Abstract:
Automatic Term Extraction is an important issue in Natural Language Processing. This paper presents a new approach of terminology extraction combining with machine learni...Show MoreMetadata
Abstract:
Automatic Term Extraction is an important issue in Natural Language Processing. This paper presents a new approach of terminology extraction combining with machine learning based on cascaded conditional random fields and corpus-based statistical model. In this approach, firstly, the low-layer and high-layer conditional random fields (CRFs) are used to extract the simple and compound terminologies respectively. Then, Domain Relevance (DR) and Domain Consensus (DC) degrees are calculated to acquire the final domain terminologies. Experimental results show that the precision, recall and F-score are 83.29%, 80.75%, 82.01% respectively. The comparison with CRFs and MI+T-value shows that the proposed method for extracting terminology is effective.
Date of Conference: 12-17 July 2015
Date Added to IEEE Xplore: 01 October 2015
ISBN Information: