Skip to main content

Chinese Entity Synonym Extraction from the Web

  • Conference paper
  • First Online:
  • 147 Accesses

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 928))

Abstract

Entity synonyms play an important role in natural language processing applications, such as query expansion and question answering. There are three main distribution characteristics in texts on the web: (1) appearing in parallel structures; (2) occurring with specific patterns in sentences; and (3) distributed in similar contexts. These characteristics are largely complementary. Existing methods, such as pattern-based and context-based methods, only consider one characteristic for synonym extraction and ignore the complementarity among them. For increasing accuracy and recall, we propose a novel method that integrates the three characteristics for extracting synonyms from the web, where Entity Synonym Network (ESN) is built to incorporate synonymous knowledge. To further improve accuracy, we treat synonym detection as a ranking problem and use the Spreading Activation model as a ranking means to detect the hard noise in ESN. Experimental results show our method achieves better accuracy and recall than the state-of-the-art methods.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

  1. Krishnan A, Deepak P, Ranu S et al (2018) Leveraging semantic resources in diversified query expansion. World Wide Web 21(4):1041–1067

    Article  Google Scholar 

  2. Hristovski D, Dinevski D, Kastrin A et al (2015) Biomedical question answering using semantic relations. BMC Bioinf 16(1):1–14

    Article  Google Scholar 

  3. Hearst MA (1992) Automatic acquisition of hyponyms from large text. In: 14th International conference on computational linguistics, pp 539–545

    Google Scholar 

  4. Simanovsky A, Ulanov A (2011) Mining text patterns for synonyms extraction. In: Database and expert systems applications, pp 473–477

    Google Scholar 

  5. Pelegrina AB, Martin-Bautista MJ, Faber P (2013) Contextualization and personalization of queries to knowledge bases using spreading activation. In: Flexible query answering systems, pp 671–682

    Google Scholar 

  6. Wang W, Thomas C, Sheth A et al (2010) Pattern-based synonym and antonym extraction. In: Proceedings of the 48th annual southeast regional conference, p 64

    Google Scholar 

  7. Batista DS, Martins B, Silva MJ (2015) Semi-supervised bootstrapping of relationship extractors with distributional semantics. In: EMNLP, pp 499–504

    Google Scholar 

  8. Hu F, Shao Z, Ruan T (2015) Self-supervised synonym extraction from the web. J Inf Sci Eng 31(3):1133–1148

    Google Scholar 

  9. Leeuwenberg A (2016) A minimally supervised approach for synonym extraction with word embeddings. Prague Bull Math Linguist 105(1):111–142

    Article  Google Scholar 

  10. Faruqui M, Dodge J, Jauhar SK et al (2015) Retrofitting word vectors to semantic lexicons. In: Proceedings of NAACL, pp 1606–1615

    Google Scholar 

  11. Hagiwara, M (2008) A supervised learning approach to automatic synonym identification based on distributional features. In: Meeting of ACL, pp 1–6

    Google Scholar 

  12. Chakrabarti K, Chaudhuri S et al (2012) A framework for robust discovery of entity synonyms. In: Knowledge discovery and data mining, pp 1384–1392

    Google Scholar 

  13. Mikolov T, Sutskever I, Chen K et al (2013) Distributed representations of words and phrases and their compositionality. Adv Neural Inf Process Syst 26:3111–3119

    Google Scholar 

  14. Ferret O (2017) Turning distributional thesauri into word vectors for synonym extraction and expansion. In: International joint conference on natural language processing, pp 273–283

    Google Scholar 

  15. Nguyen NT, Miwa M, Tsuruoka Y (2015) Identifying synonymy between relational phrases using word embeddings. J Biomed Inf 56:94–102

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Xiuxia Ma .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2020 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Ma, X., Luo, X., Huang, S., Guo, Y. (2020). Chinese Entity Synonym Extraction from the Web. In: Xu, Z., Choo, KK., Dehghantanha, A., Parizi, R., Hammoudeh, M. (eds) Cyber Security Intelligence and Analytics. CSIA 2019. Advances in Intelligent Systems and Computing, vol 928. Springer, Cham. https://doi.org/10.1007/978-3-030-15235-2_54

Download citation

Publish with us

Policies and ethics