Skip to main content
Log in

Constructing tree-based knowledge structures from text corpus

  • Published:
Applied Intelligence Aims and scope Submit manuscript

Abstract

A knowledge structure identifies how people think and displays a macro view of human perception. By discovering the hidden structural relations of knowledge, significant reasoning patterns are retrieved to enhance further knowledge sharing and distribution. However, the utilization of such approaches is apt to be limited due to the lack of hierarchical features and the problem of information overload, which make it difficult to enhance comprehension and provide effective navigation. To address these critical issues, we propose a new approach to construct a tree-based knowledge structure from corpus which can reveal the significant relations among knowledge objects and enhance user comprehension. The effectiveness of the proposed method is demonstrated with two representative public data sets. The evaluation results show that the method presented in this work achieves remarkable consistency with the domain-specific knowledge structure, and is capable of reflecting appropriate similarities among knowledge objects along with hierarchical implications in the document classification task.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. Bradley JH, Paul R, Seeman E (2006) Analyzingthe structure of expert knowledge. Inf Manag 43:77–91

    Article  Google Scholar 

  2. Chen NS, Kinshuk Wei CW, Chen HJ (2008) Mining e-learning domain concept map from academic articles. Comput Educ 50:1009–1021

    Article  Google Scholar 

  3. Tseng SS, Sue PC, Su JM, Weng JF, Tsai WN (2007) A new approach for constructing the concept map. Comput Educ 49:691–707

    Article  Google Scholar 

  4. Eppler MJ, RA Burkhard (2007) Visual representations in knowledge management: framework and cases. J Knowl Manag 11:112–122

    Article  Google Scholar 

  5. Murphy GL, Lassaline ME (1997) Hierarchical structure in concepts and the basic level of categorization. In: Lamberts K, Shanks D (eds) Knowledge, concepts and categories. MIT Press, Cambridge

    Google Scholar 

  6. Stumme G, Taouil R, Bastide Y, Pasquier N, Lakhal L (2002) Computing iceberg concept lattices with titanic. Data Knowl Eng 42:189–222

    Article  MATH  Google Scholar 

  7. Rajapakse RK, Denham M (2006) Text retrieval with more realistic concept matching and reinforcement learning. Inform Process Manag 42:1260–1275

    Article  Google Scholar 

  8. Belohlavek R, Dvorak J, Outrata J (2007) Fast factorization by similarity in formal concept analysis of data with fuzzy attributes. J Comput Syst Sci 73:1012–1022

    Article  MATH  MathSciNet  Google Scholar 

  9. Sparrow J (1998) Knowledge in organizations: access to thinking at work. Sage, London

    Google Scholar 

  10. Novak JD (1993) How do we learn our lesson? Taking students through the process. Sci Teach 60:50–55

    Google Scholar 

  11. Ruiz-Primo MA, Schultz SE, Li M, Shavelson RJ (2001) Comparison of the reliability and validity of sores from two concept-mapping techniques. J Res Sci Teach 38:260–278

    Article  Google Scholar 

  12. Wang J (2003) A knowledge network constructed by integrating classification, thesaurus, and metadata in digital library. Int Inf Libr Rev 35:383–397

    Article  Google Scholar 

  13. Xu JJ, Chen H (2005) Crimenet explorer: a framework for criminal network knowledge discovery. ACM Trans Inform Syst 23:201–226

    Article  Google Scholar 

  14. Schvaneveldt RW (1990) Pathfinder associative networks: studies in organization. Albex Publishing, Norwood

    Google Scholar 

  15. Chen RC, Liang JY, Pan RH (2008) Using recursive art network to construction domain ontology based on term frequency and inverse document frequency. Expert Syst Appl 34:488–501

    Article  Google Scholar 

  16. Fenza G, Loia V, Senatore S (2008) A hybrid approach to semantic web services matchmaking. Int J Approx Reason 48:808–828

    Article  Google Scholar 

  17. Lee CS, Kao YF, Kuo YH, Wang MH (2007) Automated ontology construction for unstructured text documents. Data Knowl Eng 60:547–566

    Article  Google Scholar 

  18. Reformat M, Ly C (2009) Ontological approach to development of computing with words based systems. Int J Approx Reason 50:72–91

    Article  Google Scholar 

  19. Lee CS, Jian ZW, Huang LK (2005) A fuzzy ontology and its application to news summarization. IEEE Trans Syst Man Cybern, Part B, Cybern 35:859–880

    Article  Google Scholar 

  20. Stumme G (2003) Off to new shores: conceptual knowledge discovery and processing. Int J Human-Comput Stud 59:287–325

    Article  Google Scholar 

  21. Priss U (2006) Formal concept analysis in information science. In: Cronin B (ed) Annual review of information science and technology (arist), vol 40. Information Today Medford, New Jersey, pp 521–543

    Google Scholar 

  22. Ganter B, Wille R (1999) Formal concept analysis: mathematical foundations. Springer, New York

    MATH  Google Scholar 

  23. Tho QT, Hui SC, Fong, Cao TH (2006) Automatic fuzzy ontology generation for semantic web. IEEE Trans Knowl Data Eng 18:842–856

    Article  Google Scholar 

  24. Chi YL (2007) Elicitation synergy of extracting conceptual tags and hierarchies. Expert Syst Appl 32:349–357

    Article  Google Scholar 

  25. Formica A, Missikoff M (2004) Inheritance processing and conflicts in structural generalization hierarchies. ACM Comput Surv 36:263–290

    Article  Google Scholar 

  26. Carpineto C, Romano G (2004) Exploiting the potential of concept lattices for information retrieval with credo. J Univ Comput Sci 10:985–1013

    Google Scholar 

  27. Everitt B (1993) Cluster analysis. Edward Arnold, London

    Google Scholar 

  28. Steinbach M, Karypis G, Kumar V (2000) A comparison of document clustering techniques. Paper presented at the KDD Workshop on Text Mining, Boston, MA, USA

  29. Treeratpituk P, Callan J (2006) Automatically labeling hierarchical clusters. Paper presented at the proceedings of the 6th national conference on digital government research, San Diego, USA

  30. Chung W, Chen H, Nunamaker JF Jr (2005) A visual framework for knowledge discovery on the web: an empirical study of business intelligence exploration. J Manag Inf Syst 21:57–84

    Google Scholar 

  31. Cimiano P, Hotho A, Staab S (2005) Learning concept hierarchies from text corpora using formal concept analysis. J Artif Intell Res 24:305–339

    MATH  Google Scholar 

  32. Lammari N, Metais E (2004) Building and maintaining ontologies: a set of algorithms. Data Knowl Eng 48:155–176

    Article  Google Scholar 

  33. Sanderson M, Lawrie D (2000) Build, testing and applying concept hierarchies. In: Advances in information retrieval: recent research from the center for intelligent information retrieval, vol 7. Springer, New York, pp 235–266

    Google Scholar 

  34. Yates RB, Neto BR (1999) Modern information retrieval. ACM Press, New York

    Google Scholar 

  35. Glover E Pennock DM, Lawrence S, Krovetz R (2002) Inferring hierarchical descriptions. In: Proceedings of the 20th international conference on information and knowledge management (CIKM), McLean, Virginia, pp 507–514

  36. Wu HC, Luk RWP, Wong KF, Kwok KL (2008) Interpreting TF-IDF term weights as making relevance decisions. ACM Trans Inf Syst 26:1–37

    Article  Google Scholar 

  37. Rodríguez MA, Egenhofer MJ (2003) Determining semantic similarity among entity classes from different ontologies. IEEE Trans Knowl Data Eng 15:442–456

    Article  Google Scholar 

  38. Tang J, Li J, Liang B, Huang X, Li Y, Wang K (2006) Using bayesian decision for ontology mapping. Web Semantics: Science, Services and Agents on the World Wide Web 4:243–262

    Article  Google Scholar 

  39. Fellbaum C (1998) Wordnet: an electronic lexical database. MIT Press, Cambridge

    MATH  Google Scholar 

  40. Li Y, Bandar ZA, McLean D (2003) An approach for measuring semantic similarity between words using multiple information source. IEEE Trans Knowl Data Eng 15:871–882

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Sheng-Tun Li.

Additional information

The work of S.-T. Li is partly supported by National Science Council, Taiwan under contract NSC98-2410-H-006-007.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Li, ST., Tsai, FC. Constructing tree-based knowledge structures from text corpus. Appl Intell 33, 67–78 (2010). https://doi.org/10.1007/s10489-010-0243-2

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10489-010-0243-2

Keywords

Navigation