A named entity recognition dataset for Turkish | IEEE Conference Publication | IEEE Xplore
Scheduled Maintenance: On Tuesday, 25 February, IEEE Xplore will undergo scheduled maintenance from 1:00-5:00 PM ET (1800-2200 UTC). During this time, there may be intermittent impact on performance. We apologize for any inconvenience.

A named entity recognition dataset for Turkish


Abstract:

Named entity recognition is one of the important topics in the research area of natural language processing. Named entity recognition studies conducted on Turkish texts a...Show More

Abstract:

Named entity recognition is one of the important topics in the research area of natural language processing. Named entity recognition studies conducted on Turkish texts are quite limited, compared to the studies on other languages. Besides, the lack of common data sets makes the comparison of different approaches harder. In this study, a dataset comprising news articles in Turkish annotated with named entities is presented. The annotations comprise the basic named entity types of person, location, and organization names. Additionally, to be used as reference in future studies, a rule-based named entity recognition system is evaluated on the final form of this data set and the corresponding evaluation results are presented. It is envisioned that our study will contribute to the advancement of named entity recognition studies on Turkish texts.
Date of Conference: 16-19 May 2016
Date Added to IEEE Xplore: 23 June 2016
ISBN Information:
Conference Location: Zonguldak, Turkey