Bootstrapping of Semantic Relation Extraction for a Morphologically Rich Language: Semi-Supervised Learning of Semantic Relations

Balaji Jagan, Ranjani Parthasarathi, T V. Geetha

Source Title: International Journal on Semantic Web and Information Systems (IJSWIS)15(1)

ISSN: 1552-6283|EISSN: 1552-6291|EISBN13: 9781522564447|DOI: 10.4018/IJSWIS.2019010106

MLA

Jagan, Balaji, et al. "Bootstrapping of Semantic Relation Extraction for a Morphologically Rich Language: Semi-Supervised Learning of Semantic Relations." IJSWIS vol.15, no.1 2019: pp.119-149. http://doi.org/10.4018/IJSWIS.2019010106

APA

Jagan, B., Parthasarathi, R., & Geetha, T. V. (2019). Bootstrapping of Semantic Relation Extraction for a Morphologically Rich Language: Semi-Supervised Learning of Semantic Relations. International Journal on Semantic Web and Information Systems (IJSWIS), 15(1), 119-149. http://doi.org/10.4018/IJSWIS.2019010106

Chicago

Jagan, Balaji, Ranjani Parthasarathi, and T V. Geetha. "Bootstrapping of Semantic Relation Extraction for a Morphologically Rich Language: Semi-Supervised Learning of Semantic Relations," International Journal on Semantic Web and Information Systems (IJSWIS) 15, no.1: 119-149. http://doi.org/10.4018/IJSWIS.2019010106

Export Reference

Favorite Full-Issue Download

View Full Text HTML

View Full Text PDF

Abstract

This article focuses on the use of a bootstrapping approach for the extraction of semantic relations that exist between two different concepts in a Tamil text. The proposed system, bootstrapping approach to semantic UNL relation extraction (BASURE) extracts generic relations that exist between different components of a sentence by exploiting the morphological richness of Tamil. Tamil is essentially a partially free word order language which means that semantic relations that exist between the concepts can occur anywhere in the sentence not necessarily in a fixed order. Here, the authors use Universal Networking Language (UNL), an Interlingua framework, to represent the word-based features and aim to define UNL semantic relations that exist between any two constituents in a sentence. The morphological suffix, lexical category and UNL semantic constraints associated with a word are defined as tuples of the pattern used for bootstrapping. Most systems define the initial set of seed patterns manually. However, this article uses a rule-based approach to obtain word-based features that form tuples of the patterns. A bootstrapping approach is then applied to extract all possible instances from the corpus and to generate new patterns. Here, the authors also introduce the use of UNL ontology to discover the semantic similarity between semantic tuples of the pattern, hence, to learn new patterns from the text corpus in an iterative manner. The use of UNL Ontology makes this approach general and domain independent. The results obtained are evaluated and compared with existing approaches and it has been shown that this approach is generic, can extract all sentence based semantic UNL relations and significantly increases the performance of the generic semantic relation extraction system.

You do not own this content. Please login to recommend this title to your institution's librarian or purchase it from the IGI Global bookstore.

Username or email: *

Password: *

Forgot individual login password?

Create individual account

Bootstrapping of Semantic Relation Extraction for a Morphologically Rich Language: Semi-Supervised Learning of Semantic Relations

MLA

APA

Chicago

Export Reference

Abstract

Request Access