Abstract:
The state-of-the-art Chinese word segmentation systems obtain high accuracy in domains like newswire but suffer a significant performance degradation when they are used i...Show MoreMetadata
Abstract:
The state-of-the-art Chinese word segmentation systems obtain high accuracy in domains like newswire but suffer a significant performance degradation when they are used in other domains such as patents and literature. In this paper, we propose a neural domain adaptation approach which works through cross-domain embeddings and uses unlabeled target domain data to improve the cross-domain performance. Experiment results show that the proposed method achieves competitive performance with previous Chinese word segmentation domain adaptation methods.
Date of Conference: 05-07 December 2017
Date Added to IEEE Xplore: 22 February 2018
ISBN Information: