CCRFs: Cascaded conditional random fields for Chinese POS tagging | IEEE Conference Publication | IEEE Xplore

CCRFs: Cascaded conditional random fields for Chinese POS tagging


Abstract:

We present CCRFs (cascaded conditional random fields): a cascaded approach to scale conditional random fields (CRFs) for Chinese POS tagging (labeling). General CRFs work...Show More

Abstract:

We present CCRFs (cascaded conditional random fields): a cascaded approach to scale conditional random fields (CRFs) for Chinese POS tagging (labeling). General CRFs worked well on POS tagging, but met difficulty when dealing with a large training dataset and tag set because of high computation cost for training. CCRFs organize all tags in a hierarchy and run CRFs on each node of the hierarchy. In CCRFs framework, similar tags were treated as the same one in an upper layer, while they were distinguished in lower layer with more information from upper layer. The analysis of computation complexity shows that CCRFs can highly reduce both space complexity and time complexity for training. The experiments show that CCRFs can deal with large data sets from SIGHAN 2008 with common PCs, which cannot be handled by general CRFs under the same computational condition. Furthermore, we try to keep the good performance of CCRFs for POS tagging by selecting proper features for hierarchical POS tags clustering and introducing more rich and accurate features in the training and tagging phases. The final results show that CCRFs outperform maximum entropy model on all the three test datasets, and outperform enhanced maximum entropy model on the two of the three datasets.
Date of Conference: 24-27 September 2009
Date Added to IEEE Xplore: 06 November 2009
ISBN Information:
Conference Location: Dalian, China

Contact IEEE to Subscribe

References

References is not available for this document.