Skip to main content

Pattern Acquisition for Chinese Named Entity Recognition: A Supervised Learning Approach

  • Conference paper
  • First Online:
Advances in Information Systems (ADVIS 2002)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 2457))

Included in the following conference series:

Abstract

This paper presents a supervised learning method for the pattern acquisition for handcrafted rule-based Chinese named entity recognition systems. We automatically extracted low frequency patterns based on the predefined high-frequency patterns and manually validated the new patterns and outputs of terms. The experiments show that the number of person names extracted from the Chinese Treebank increased by 14.3% after the use of the new patterns.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Fei Xia: The Part-Of-Speech Tagging Guidelines for the Penn Chinese Treebank (3.0). October 17, 2000.

    Google Scholar 

  2. Andrew Borthwick: A Maximum Entropy Approach to Named Entity Recognition, Ph.D thesis. (1999). New York University. Department of Computer Science, Courant Institute.

    Google Scholar 

  3. Finkelstein-Landau, Michal and Morin, Emmanuel (1999): Extracting Semantic Relationships between Terms: Supervised vs. Unsupervised Methods, In proceedings of International Workshop on Ontological Engineering on the Global Information Infrastructure, Dagstuhl Castle, Germany, May 99, pp. 71–80.

    Google Scholar 

  4. Emmanual Morin, Christian Jacquemin: Project Corpus-Based Semantic Links on a Thesaurus, (ACL99), Pages 389–390, University of Maryland. June 20–26, 1999

    Google Scholar 

  5. Marti Hearst: Automated Discovery of WordNet Relations, in WordNet: An Electronic Lexical Database, Christiane Fellbaum (ed.), and MIT Press, 1998.

    Google Scholar 

  6. Marti Hearst, 1992: Automatic acquisition of hyponyms from large text corpora. In COLING’92, pages 539–545, Nantes.

    Google Scholar 

  7. Kaiyin Liu: Chinese Text Segmentation and Part of Speech Tagging, Chinese Business Publishing company, 2000

    Google Scholar 

  8. Douglas Appelt: Introduction to Information Extraction Technology, http://www.ai.sri.com/~appelt/ie-tutorial/IJCAI99.pdf

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2002 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Fang, X., Sheng, H. (2002). Pattern Acquisition for Chinese Named Entity Recognition: A Supervised Learning Approach. In: Yakhno, T. (eds) Advances in Information Systems. ADVIS 2002. Lecture Notes in Computer Science, vol 2457. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-36077-8_16

Download citation

  • DOI: https://doi.org/10.1007/3-540-36077-8_16

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-00009-9

  • Online ISBN: 978-3-540-36077-3

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics