Abstract:
MICA is a fast and accurate dependency parser for English that uses an automatically LTAG derived from Penn Treebank (PTB) using the Chen's approach. However, there is no...Show MoreMetadata
Abstract:
MICA is a fast and accurate dependency parser for English that uses an automatically LTAG derived from Penn Treebank (PTB) using the Chen's approach. However, there is no semantic representation related to its grammar. On the other hand, XTAG grammar is a hand crafted LTAG that its elementary trees were enriched with the semantic representation by experts. The linguistic knowledge embedded in the XTAG grammar caused it to being used in wide variety of natural language applications. However, the current XTAG parser is not as fast and accurate as well as the MICA parser. Generating an XTAG derivation tree from a MICA dependency structure could make a bridge between these two notions and gets the benefits of both models. Also, by having this conversion, the applications that use the XTAG parser, may get the helps from MICA parser too. In addition, it can enrich the MICA's grammar by semantic representation of XTAG grammar. In this paper, an unsupervised sequence tagger that maps any sequence of MICA elementary trees onto an XTAG elementary trees sequence is presented. The proposed sequence tagger is based on a Hidden Markov Model (HMM) proceeded by an EM-based algorithm for setting its initial parameters values. The trained model is tested on a part of PTB and about 82% accuracy for the detected sequences is achieved.
Published in: Proceedings of the 6th International Conference on Natural Language Processing and Knowledge Engineering(NLPKE-2010)
Date of Conference: 21-23 August 2010
Date Added to IEEE Xplore: 30 September 2010
ISBN Information: