Skip to main content
Log in

Bayesian network based business information retrieval model

  • Regular Paper
  • Published:
Knowledge and Information Systems Aims and scope Submit manuscript

Abstract

The quality of business information can significantly affect the operation level of enterprise. This paper analyses the problem of business information retrieval (BIR). A Bayesian Network Based business information retrieval model (BN-BIRM) is proposed by means of Bayesian network (BN) and information retrieval (IR) theory and a method for query adaptation is presented. In this model the customized query requirement of enterprise (CQR) is expressed in terms of the predefined illustrative documents related to business domain. The similarities between the documents and the query are evaluated with the conditional probabilities among the nodes in the BN. In the experiments, BN-BIRM is compared with the Belief Network model based on vector space model (VSM) ranking strategy and the Inference Network model based on TF-IDF ranking strategy. The experimental results show that BN-BIRM is effective for collecting business information on a large scale.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. Acid S, Campos LM, Fernandez-Luna JM, Huete JF (2003) An information retrieval model based on simple Bayesian networks. Proc Int J Intel Syst 18: 251–265

    Article  MATH  Google Scholar 

  2. Baeza-Yates R, Ribeiro-Neto B (1999) Modern information retrieval. Addison-Wesley, USA, pp 48–61

    Google Scholar 

  3. Brin S, Page L (1998) The anatomy of a large-scale hypertextual web search engine. In: Proceedings of the seventh international world wide web conference, pp 107–117

  4. Calado P, Cristo M, Moura E (2006) Combining link-based and content-based methods for Web document classification. In: Proceedings of the twelfth international conference on information and knowledge management, Arlington, Virginia, USA, pp 540–549

  5. Calado P, Ribeiro-Neto B, Ziviani N (2003) Local versus global link information in the Web. ACM Trans Inf Syst 21(1): 42–63

    Article  Google Scholar 

  6. Campos LM, Fernandez-Luna JM, Huete JF (2002) A layered Bayesian network model for document retrieval. Lect Notes Comp Sci 2291: 169–182

    Article  Google Scholar 

  7. Chen R, Sivakumar K, Kargupta H (2004) Collective mining of Bayesian networks from distributed heterogeneous data. Knowl Inf Syst 6: 164–187

    Google Scholar 

  8. De Falco I, Della Cioppa A, Iazzetta A et al (2005) An evolutionary approach for automatically extracting intelligible classification rules. Knowl Inf Syst 7: 179–201

    Article  Google Scholar 

  9. Druzdzel MK, Van Der Gaag LC (1995) Elicitation of probabilities for belief networks: combining qualitative and quantitative information. In: Proceedings of 11th ann. conf. uncertainty in artificial intelligence (UAI ‘95), pp 141–148

  10. Fenton NE, Neil M, Caballero JG (2007) Using ranked nodes to model qualitative judgments in Bayesian networks. IEEE Trans Knowl Data Eng 19(10): 1420–1432

    Article  Google Scholar 

  11. Fung R, Del Favero B (1995) Applying Bayesian networks to information retrieval. Commun ACM 38(3):42–48, 57

    Google Scholar 

  12. Gao XZ, Murugesan S, Lo B (2005) Extraction of keyterms by simple text mining for business information retrieval. In: Proceedings of IEEE international conference on e-business engineering, pp 332–339

  13. Genest D, Chein M (2005) A content-search information retrieval process based on conceptual graphs. Knowl Inf Syst 8: 292–309

    Article  Google Scholar 

  14. Hawking D, Crimmins F, Craswell N, Upstill T (2004) How valuable is external link evidence when searching enterprise Webs? In: Proceedings of fifteenth Australasian database conference, Dunedin, NZ, pp 77–84

  15. Heckerman D, Horvitz E (1998) Inferring informational goals from free-text queries: a Bayesian approach. In: Presented at fourteenth conference on uncertainty in artificial intelligence, Madison, WI

  16. Kleinberg JM (1998) Authoritative sources in a hyperlinked environment. In: Proceedings of the ninth annual ACM-SIAM symposium on discrete algorithms, San Francisco, pp 668–677

  17. Lee CY, Soo VW (2005) Ontology-based information retrieval and extraction. In: Proceedings of the 3rd international conference on information technology: research and education, pp 265–269

  18. Luis M, Campos LM, Fernandez-Luna JM, Huete JF (2004) Bayesian networks and information retrieval: an introduction to the special issue. Inf Process Manage 40(5): 727–733

    Article  Google Scholar 

  19. Macedo AA, Pimentel MGC, Guerrero JAC (2002) An infrastructure for open latent semantic linking. In: Proceedings of the thirteenth ACM conference on hypertext and hypermedia, College Park, Maryland, USA, pp 107–116

  20. Mitchell TM (1997) Machine Learning. McGraw-Hill, USA, pp 184–185

    MATH  Google Scholar 

  21. Monti S, Carenini G (2000) Dealing with the Expert Inconsistency in Probability Elicitation. IEEE Trans Knowl Data Eng 12(4): 499–508

    Article  Google Scholar 

  22. Oyama S, Kokuko T, Ishida T (2004) Domain-specific Web search with keyword spices. IEEE Trans Know Data Eng 16(1): 17–27

    Article  Google Scholar 

  23. Qiu YG, Frei HP (1993) Concept based query expansion. In: Proceedings of the 16th ACM SIGIR conference on research and development in information retrieval, Pittsburgh, PA, USA, pp 160–169

  24. Ribeiro-Neto B, Muntz R (1996) A belief network model for IR. In: Proceedings of the 19th ACM SIGIR conference on research and development in information retrieval, Zurich, Switzerland, pp 253–260

  25. Ribeiro-Neto B, Silva I, Muntz R (2000) Soft computing in information retrieval: techniques and applications, 1st edn. Springer, Verlag

    Google Scholar 

  26. Rocchio JJ (1971) Relevance feedback in information retrieval. In: Salton G (ed) The SMART retrieval system-experiments in automatic document processing. Prentice Hall Inc., Englewood Cliffs

    Google Scholar 

  27. Sheng XW, Jiang MH (2003) An information retrieval system based on automatic query expansion and hopfield network. In: Proceedings of IEEE international conference on neural networks & signal processing, Nanjing. China, pp 1624–1627

  28. Silva I, Ribeiro-Neto B, Calado P, Moura E, Ziviani N (2000) Link-based and content-Based evidential information in a Belief network model. In: Proceedings of the 23rd annual international ACM SIGIR conference on research and development in information retrieval, Athens, Greece, pp 96–103

  29. Tombros A, Van Rijsbergen CJ (2004) Query-sensitive similarity measures for information retrieval. Knowl Inf Syst 6: 617–642

    Article  Google Scholar 

  30. Turtle H, Croft W (1991) Evaluation of an inference network-based retrieval model. ACM Trans Inf Syst 9(3): 187–222

    Article  Google Scholar 

  31. VanDer Gaag LC, Renooija S, Wittemana CLM, Alemanb BMP, Taal BG (2002) Probabilities for a probabilistic network: a case study in oesophageal cancer. Artif Intell Med 25(2): 123–148

    Article  Google Scholar 

  32. Yang Y, Pedersen JP (1997) A comparative study on feature selection in text categorization. In: Proceeding of the 14th international conference on machine learning, pp 412–420

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Zheng Wang.

Additional information

This work was partially supported by Key Project of National Natural Science Foundation, grant 70431003, Innovative Research Team Project of National Natural Science Foundation, grant 60521003, and National Science Supporting plan, grant 2006BAH02A09 of People’s Republic of China.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Wang, Z., Wang, Q. & Wang, DW. Bayesian network based business information retrieval model. Knowl Inf Syst 20, 63–79 (2009). https://doi.org/10.1007/s10115-008-0151-5

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10115-008-0151-5

Keywords

Navigation