Text Mining for the Semantic Web

Grobelnik, Marko; Mladenić, Dunja; Witbrock, Michael

doi:10.1007/978-1-4899-7687-1_835

Marko Grobelnik³,
Dunja Mladenić³ &
Michael Witbrock⁴

459 Accesses

Definition

Text Mining methods allow for the incorporation of textual data within applications of semantic technologies on the Web. Application of these techniques is appropriate when some of the data needed for a Semantic Web use scenario are in textual form. The techniques range from simple processing of text to reducing vocabulary size, through applying shallow natural language processing to constructing new semantic features or applying information retrieval to selecting relevant texts for analysis, through complex methods involving integrated visualization of semantic information, semantic search, semiautomatic ontology construction, and large-scale reasoning.

Motivation and Background

Semantic Web applications usually involve deep structured knowledge integrated by means of some kind of ontology. Text mining methods, on the other hand, support the discovery of structure in data and effectively support semantic technologies on data-driven tasks such as (semi)automatic ontology...

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 999.99; Price excludes VAT (USA)

Hardcover Book: USD 999.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Recommended Reading

Ashburner M, Ball CA, Blake JA, Botstein D, Butler H, Cherry JM, Davis AP, Dolinski K, Dwight SS, Eppig JT, Harris MA, Hill DP, Issel-Tarver L, Kasarskis A, Lewis S, Matese JC, Richardson JE, Ringwald M, Rubin GM, Sherlock G (2000) Gene ontology: tool for the unification of biology. Nat Genet 25(1):25–29
Article Google Scholar
Augenstein I, Maynard D, Ciravegna F (2014) Relation extraction from the web using distant supervision. In: Janowicz K et al (eds) EKAW 2014. LNAI 8876. Springer, pp 26–41
Google Scholar
Bard J, Rhee SY, Ashburner M (2005) An ontology for cell types. Genome Biol 6(2):R21
Article Google Scholar
Barwise J, Etchemendy J (2002) Language proof and logic. Center for the study of language and information. ISBN:157586374X
MATH Google Scholar
Buitelaar P, Cimiano P, Magnini B (2005) Ontology learning from text: methods, applications and evaluation, frontiers in artificial intelligence and applications. IOS Press, Amsterdam
Google Scholar
Cohen K, Demner-Fushman D, Ananiadou S, Tsujii J-i (2014) Proceedings of BioNLP 2014, Baltimore. Association for Computational Linguistics
Book Google Scholar
Curtis J, Baxter D, Wagner P, Cabral J, Schneider D, Witbrock M (2009) Methods of rule acquisition in the TextLearner system. In: Proceedings of the 2009 AAAI spring symposium on learning by reading and learning to read. AAAI Press, Palo Alto, pp 22–28
Google Scholar
Davies J, Grobelnik M, Mladenić D (2009) Semantic knowledge management. Springer, Berlin
Book MATH Google Scholar
Etzioni O, Banko M, Cafarella MJ (2007) Machine reading. In: Proceedings of the 2007 AAAI spring symposium on machine reading
Google Scholar
Gaber MM, Zaslavsky A, Krishnaswamy S (2005) Mining data streams: a review. ACM SIGMOD Rec 34(1):18–26. ISSN:0163-580
Google Scholar
Grobelnik M, Mladenic D (2005) Automated knowledge discovery in advanced knowledge management. J Knowl Manag 9:132–149
Article Google Scholar
Hirschman L, Yeh A, Blaschke C, Valencia A (2005) Overview of BioCreAtIvE: critical assessment of information extraction for biology. BMC Bioinform 6(Suppl 1):S1
Article Google Scholar
Lenat DB (1995) Cyc: a large-scale investment in knowledge infrastructure. Commun ACM 38(11):33–38
Article Google Scholar
Mitchell T (2005) Reading the web: a breakthrough goal for AI. Celebrating twenty-five years of AAAI: notes from the AAAI-05 and IAAI-05 conferences. AI Mag 26(3):12–16
Google Scholar
Rusu D (2014) Text annotation using background knowledge. Doctoral Dissertation, Jozef Stefan International Postgraduate School, Ljubljana
Google Scholar
Starc J, Fortuna B (2012) Identifying good patterns for relation extraction. In: Proceedings of the 15th international multiconference information society – IS 2012. Institut Jožef Stefan, Ljubljana, pp 205–208
Google Scholar
Starc J, Mladenic D (2013) Semi-automatic construction of pattern rules for translation of natural language into semantic representation. In: Proceedings of the 5th Jožef Stefan International Postgraduate School Students Conference, Jožefa Stefana International Postgraduate School, pp 199–208
Google Scholar
Zeng Y, Wang D, Zhang T Linked brain data. Web http://www.linked-neuron-data.org/. Retrieved 11 Jan 2015

Download references

Author information

Authors and Affiliations

Artificial Intelligence Laboratory, Jožef Stefan Insitute, Ljubljana, Slovenia
Marko Grobelnik & Dunja Mladenić
Cycorp Inc, 7718 Wood Hollow Dr, 78731, Austin, TX, USA
Michael Witbrock

Authors

Marko Grobelnik
View author publications
You can also search for this author in PubMed Google Scholar
Dunja Mladenić
View author publications
You can also search for this author in PubMed Google Scholar
Michael Witbrock
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Marko Grobelnik .

Editor information

Editors and Affiliations

The University of New South Wales, Sydney, NSW, Australia
Claude Sammut
Faculty of Information Technology, Monash University, Melbourne, VIC, Australia
Geoffrey I. Webb

Rights and permissions

Reprints and permissions

Copyright information

About this entry

Cite this entry

Grobelnik, M., Mladenić, D., Witbrock, M. (2017). Text Mining for the Semantic Web. In: Sammut, C., Webb, G.I. (eds) Encyclopedia of Machine Learning and Data Mining. Springer, Boston, MA. https://doi.org/10.1007/978-1-4899-7687-1_835

Download citation

DOI: https://doi.org/10.1007/978-1-4899-7687-1_835
Published: 14 April 2017
Publisher Name: Springer, Boston, MA
Print ISBN: 978-1-4899-7685-7
Online ISBN: 978-1-4899-7687-1
eBook Packages: Computer ScienceReference Module Computer Science and Engineering

Publish with us

Policies and ethics