Abstract
The quality of business information can significantly affect the operation level of enterprise. This paper analyses the problem of business information retrieval (BIR). A Bayesian Network Based business information retrieval model (BN-BIRM) is proposed by means of Bayesian network (BN) and information retrieval (IR) theory and a method for query adaptation is presented. In this model the customized query requirement of enterprise (CQR) is expressed in terms of the predefined illustrative documents related to business domain. The similarities between the documents and the query are evaluated with the conditional probabilities among the nodes in the BN. In the experiments, BN-BIRM is compared with the Belief Network model based on vector space model (VSM) ranking strategy and the Inference Network model based on TF-IDF ranking strategy. The experimental results show that BN-BIRM is effective for collecting business information on a large scale.
Similar content being viewed by others
References
Acid S, Campos LM, Fernandez-Luna JM, Huete JF (2003) An information retrieval model based on simple Bayesian networks. Proc Int J Intel Syst 18: 251–265
Baeza-Yates R, Ribeiro-Neto B (1999) Modern information retrieval. Addison-Wesley, USA, pp 48–61
Brin S, Page L (1998) The anatomy of a large-scale hypertextual web search engine. In: Proceedings of the seventh international world wide web conference, pp 107–117
Calado P, Cristo M, Moura E (2006) Combining link-based and content-based methods for Web document classification. In: Proceedings of the twelfth international conference on information and knowledge management, Arlington, Virginia, USA, pp 540–549
Calado P, Ribeiro-Neto B, Ziviani N (2003) Local versus global link information in the Web. ACM Trans Inf Syst 21(1): 42–63
Campos LM, Fernandez-Luna JM, Huete JF (2002) A layered Bayesian network model for document retrieval. Lect Notes Comp Sci 2291: 169–182
Chen R, Sivakumar K, Kargupta H (2004) Collective mining of Bayesian networks from distributed heterogeneous data. Knowl Inf Syst 6: 164–187
De Falco I, Della Cioppa A, Iazzetta A et al (2005) An evolutionary approach for automatically extracting intelligible classification rules. Knowl Inf Syst 7: 179–201
Druzdzel MK, Van Der Gaag LC (1995) Elicitation of probabilities for belief networks: combining qualitative and quantitative information. In: Proceedings of 11th ann. conf. uncertainty in artificial intelligence (UAI ‘95), pp 141–148
Fenton NE, Neil M, Caballero JG (2007) Using ranked nodes to model qualitative judgments in Bayesian networks. IEEE Trans Knowl Data Eng 19(10): 1420–1432
Fung R, Del Favero B (1995) Applying Bayesian networks to information retrieval. Commun ACM 38(3):42–48, 57
Gao XZ, Murugesan S, Lo B (2005) Extraction of keyterms by simple text mining for business information retrieval. In: Proceedings of IEEE international conference on e-business engineering, pp 332–339
Genest D, Chein M (2005) A content-search information retrieval process based on conceptual graphs. Knowl Inf Syst 8: 292–309
Hawking D, Crimmins F, Craswell N, Upstill T (2004) How valuable is external link evidence when searching enterprise Webs? In: Proceedings of fifteenth Australasian database conference, Dunedin, NZ, pp 77–84
Heckerman D, Horvitz E (1998) Inferring informational goals from free-text queries: a Bayesian approach. In: Presented at fourteenth conference on uncertainty in artificial intelligence, Madison, WI
Kleinberg JM (1998) Authoritative sources in a hyperlinked environment. In: Proceedings of the ninth annual ACM-SIAM symposium on discrete algorithms, San Francisco, pp 668–677
Lee CY, Soo VW (2005) Ontology-based information retrieval and extraction. In: Proceedings of the 3rd international conference on information technology: research and education, pp 265–269
Luis M, Campos LM, Fernandez-Luna JM, Huete JF (2004) Bayesian networks and information retrieval: an introduction to the special issue. Inf Process Manage 40(5): 727–733
Macedo AA, Pimentel MGC, Guerrero JAC (2002) An infrastructure for open latent semantic linking. In: Proceedings of the thirteenth ACM conference on hypertext and hypermedia, College Park, Maryland, USA, pp 107–116
Mitchell TM (1997) Machine Learning. McGraw-Hill, USA, pp 184–185
Monti S, Carenini G (2000) Dealing with the Expert Inconsistency in Probability Elicitation. IEEE Trans Knowl Data Eng 12(4): 499–508
Oyama S, Kokuko T, Ishida T (2004) Domain-specific Web search with keyword spices. IEEE Trans Know Data Eng 16(1): 17–27
Qiu YG, Frei HP (1993) Concept based query expansion. In: Proceedings of the 16th ACM SIGIR conference on research and development in information retrieval, Pittsburgh, PA, USA, pp 160–169
Ribeiro-Neto B, Muntz R (1996) A belief network model for IR. In: Proceedings of the 19th ACM SIGIR conference on research and development in information retrieval, Zurich, Switzerland, pp 253–260
Ribeiro-Neto B, Silva I, Muntz R (2000) Soft computing in information retrieval: techniques and applications, 1st edn. Springer, Verlag
Rocchio JJ (1971) Relevance feedback in information retrieval. In: Salton G (ed) The SMART retrieval system-experiments in automatic document processing. Prentice Hall Inc., Englewood Cliffs
Sheng XW, Jiang MH (2003) An information retrieval system based on automatic query expansion and hopfield network. In: Proceedings of IEEE international conference on neural networks & signal processing, Nanjing. China, pp 1624–1627
Silva I, Ribeiro-Neto B, Calado P, Moura E, Ziviani N (2000) Link-based and content-Based evidential information in a Belief network model. In: Proceedings of the 23rd annual international ACM SIGIR conference on research and development in information retrieval, Athens, Greece, pp 96–103
Tombros A, Van Rijsbergen CJ (2004) Query-sensitive similarity measures for information retrieval. Knowl Inf Syst 6: 617–642
Turtle H, Croft W (1991) Evaluation of an inference network-based retrieval model. ACM Trans Inf Syst 9(3): 187–222
VanDer Gaag LC, Renooija S, Wittemana CLM, Alemanb BMP, Taal BG (2002) Probabilities for a probabilistic network: a case study in oesophageal cancer. Artif Intell Med 25(2): 123–148
Yang Y, Pedersen JP (1997) A comparative study on feature selection in text categorization. In: Proceeding of the 14th international conference on machine learning, pp 412–420
Author information
Authors and Affiliations
Corresponding author
Additional information
This work was partially supported by Key Project of National Natural Science Foundation, grant 70431003, Innovative Research Team Project of National Natural Science Foundation, grant 60521003, and National Science Supporting plan, grant 2006BAH02A09 of People’s Republic of China.
Rights and permissions
About this article
Cite this article
Wang, Z., Wang, Q. & Wang, DW. Bayesian network based business information retrieval model. Knowl Inf Syst 20, 63–79 (2009). https://doi.org/10.1007/s10115-008-0151-5
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10115-008-0151-5