skip to main content
10.1145/2484028.2484106acmconferencesArticle/Chapter ViewAbstractPublication PagesirConference Proceedingsconference-collections
short-paper

A novel topic model for automatic term extraction

Published: 28 July 2013 Publication History

Abstract

Automatic term extraction (ATE) aims at extracting domain-specific terms from a corpus of a certain domain. Termhood is one essential measure for judging whether a phrase is a term. Previous researches on termhood mainly depend on the word frequency information. In this paper, we propose to compute termhood based on semantic representation of words. A novel topic model, namely i-SWB, is developed to map the domain corpus into a latent semantic space, which is composed of some general topics, a background topic and a documents-specific topic. Experiments on four domains demonstrate that our approach outperforms the state-of-the-art ATE approaches.

References

[1]
Blei, D. M., Ng, A. Y., Jordan, M.I. 2003. Latent Dirichlet allocation, The Journal of Machine Learning Research, pp. 993--1022.
[2]
Chang J.-S. 2005. Domain specific word extraction from hierarchical web documents: a first step toward building lexicon trees from web corpora. In Proceedings of the 4th SIGHAN Workshop on Chinese Language Learning: 64--71.
[3]
Chemudugunta, C., Smyth, P. and Steyvers, M. 2006. Modeling general and specific aspects of documents with a probabilistic topic model. In NIPS 19, pp. 241--248.
[4]
Hisamitsu, T., Niwa, Y., and Tsujii, J. 2000. A method of measuring term representativeness - baseline method using co-occurrence distribution, COLING 2000, pp.320--326.
[5]
Frantzi, K., Ananiadou, S. and Mima, H. 2000. Automatic recognition of multi-word terms: the C-value/NC-value method. International Journal of Digital Libraries, 3(2): 117--132
[6]
Kageura, K. and Umino, B. 1996. Methods of automatic term recognition. Terminology, 3(2).

Cited By

View all
  • (2024)MOOC Concept Extraction from Chinese Texts: A Rule and Graph Propagation Based Method2024 11th International Conference on Behavioural and Social Computing (BESC)10.1109/BESC64747.2024.10780751(1-7)Online publication date: 16-Aug-2024
  • (2022)Towards Better Understanding with Uniformity and Explicit Regularization of Embeddings in Embedding-based Neural Topic Models2022 International Joint Conference on Neural Networks (IJCNN)10.1109/IJCNN55064.2022.9892128(1-9)Online publication date: 18-Jul-2022
  • (2022)NMF-based approach to automatic term extractionExpert Systems with Applications: An International Journal10.1016/j.eswa.2022.117179199:COnline publication date: 1-Aug-2022
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
SIGIR '13: Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval
July 2013
1188 pages
ISBN:9781450320344
DOI:10.1145/2484028
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 28 July 2013

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. term extraction
  2. termhood
  3. topic model

Qualifiers

  • Short-paper

Conference

SIGIR '13
Sponsor:

Acceptance Rates

SIGIR '13 Paper Acceptance Rate 73 of 366 submissions, 20%;
Overall Acceptance Rate 792 of 3,983 submissions, 20%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)12
  • Downloads (Last 6 weeks)2
Reflects downloads up to 25 Feb 2025

Other Metrics

Citations

Cited By

View all
  • (2024)MOOC Concept Extraction from Chinese Texts: A Rule and Graph Propagation Based Method2024 11th International Conference on Behavioural and Social Computing (BESC)10.1109/BESC64747.2024.10780751(1-7)Online publication date: 16-Aug-2024
  • (2022)Towards Better Understanding with Uniformity and Explicit Regularization of Embeddings in Embedding-based Neural Topic Models2022 International Joint Conference on Neural Networks (IJCNN)10.1109/IJCNN55064.2022.9892128(1-9)Online publication date: 18-Jul-2022
  • (2022)NMF-based approach to automatic term extractionExpert Systems with Applications: An International Journal10.1016/j.eswa.2022.117179199:COnline publication date: 1-Aug-2022
  • (2022)LTWNN: A Novel Approach Using Sentence Embeddings for Extracting Diverse Concepts in MOOCsAI 2021: Advances in Artificial Intelligence10.1007/978-3-030-97546-3_62(763-774)Online publication date: 19-Mar-2022
  • (2021)An Unsupervised Semantic Model for Arabic/French Terminology ExtractionProceedings of International Conference on Emerging Technologies and Intelligent Systems10.1007/978-3-030-85990-9_5(49-59)Online publication date: 3-Dec-2021
  • (2019)Taxonomy Extraction for Customer Service Knowledge Base ConstructionSemantic Systems. The Power of AI and Knowledge Graphs10.1007/978-3-030-33220-4_13(175-190)Online publication date: 4-Nov-2019
  • (2018)ATR4SLanguage Resources and Evaluation10.5555/3270332.327037552:3(853-872)Online publication date: 1-Sep-2018
  • (2018)SemRe-RankACM Transactions on Knowledge Discovery from Data10.1145/320140812:5(1-41)Online publication date: 27-Jun-2018
  • (2018)My Approach = Your Apparatus?Proceedings of the 18th ACM/IEEE on Joint Conference on Digital Libraries10.1145/3197026.3197038(283-292)Online publication date: 23-May-2018
  • (2018)Weighting of Noun Phrases Based on Local Frequency of NounsRecent Advances on Soft Computing and Data Mining10.1007/978-3-319-72550-5_42(436-445)Online publication date: 12-Jan-2018
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media