skip to main content
10.1145/1871437.1871730acmconferencesArticle/Chapter ViewAbstractPublication PagescikmConference Proceedingsconference-collections
poster

Incorporating terminology evolution for query translation in text retrieval with association rules

Published: 26 October 2010 Publication History

Abstract

Time-stamped documents such as newswire articles, blog posts and other web-pages are often archived online. When these archives cover long spans of time, the terminology within them could undergo significant changes. Hence, when users pose queries pertaining to historical information, over such documents, the queries need to be translated, taking into account these temporal changes, to provide accurate responses to users. For example, a query on Sri Lanka should automatically retrieve documents with its former name Ceylon. We call such concepts SITACs, i.e., Semantically Identical Temporally Altering Concepts. In order to discover SITACs, we propose an approach based on a novel framework constituting an integration of natural language processing, association rule mining, and contextual similarity as a learning technique. The proposed approach has been experimented with real data and has been found to yield good results with respect to efficiency and accuracy.

References

[1]
Berberich, K. et al. "Bridging the Terminology Gap in Web Archive Search!", SIGMOD's WebDB 2009.
[2]
Gutenberg Corpus: "U.S. Presidential Inaugural Addresses" www.gutenberg.net
[3]
Hasegawa, T., et al. "Discovering Relations among Named Entities from Large Corpora", ACL 2004.
[4]
Lesk, M. E. "Automatic sense disambiguation using machine readable dictionaries", International Conference on Systems Documentation, 1986.
[5]
Lin, D. "Using syntactic dependency as a local context to resolve word sense ambiguity", ACL 1997.
[6]
Lu, X. A. et al. (1994). "Query expansion/reduction and its impact on information retrieval effectiveness", TREC-3.
[7]
Mihalcea, et al. "Page rank on semantic networks, application to word sense disambiguation", COLING 2004.
[8]
Miller G.A et al. "Contextual correlates of semantic similarity", Language and Cognitive Processes, 6(1).
[9]
Norvag, K., et al. "Mining Association Rules in Temporal Document Collections", Dept. of Computer and Information, Systems (2006), NTNU, Norway.
[10]
Parthasarathy, S., et al. "Incremental and Inter-active Sequence Mining", CIKM 1999/
[11]
Strehl A. et al. "Impact of Similarity Measures on Web-page Clustering", AAAI 2000.
[12]
Stanford University, USA, Minipar.
[13]
University of Waikato, New Zealand, WEKA.

Cited By

View all
  • (2024)A Text-Based Predictive Maintenance Approach for Facility Management Requests Utilizing Association Rule Mining and Large Language ModelsMachine Learning and Knowledge Extraction10.3390/make60100136:1(233-258)Online publication date: 26-Jan-2024
  • (2021)Learning Latent Variable Models with Discriminant RegularizationAgents and Artificial Intelligence10.1007/978-3-030-71158-0_18(378-398)Online publication date: 14-Mar-2021
  • (2019)Across-Time Comparative Summarization of News ArticlesProceedings of the Twelfth ACM International Conference on Web Search and Data Mining10.1145/3289600.3291008(735-743)Online publication date: 30-Jan-2019
  • Show More Cited By

Index Terms

  1. Incorporating terminology evolution for query translation in text retrieval with association rules

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image ACM Conferences
      CIKM '10: Proceedings of the 19th ACM international conference on Information and knowledge management
      October 2010
      2036 pages
      ISBN:9781450300995
      DOI:10.1145/1871437
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Sponsors

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 26 October 2010

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. association rules
      2. contextual similarity
      3. natural language processing
      4. ranking
      5. search
      6. web ir

      Qualifiers

      • Poster

      Conference

      CIKM '10

      Acceptance Rates

      Overall Acceptance Rate 1,861 of 8,427 submissions, 22%

      Upcoming Conference

      CIKM '25

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)2
      • Downloads (Last 6 weeks)1
      Reflects downloads up to 17 Jan 2025

      Other Metrics

      Citations

      Cited By

      View all
      • (2024)A Text-Based Predictive Maintenance Approach for Facility Management Requests Utilizing Association Rule Mining and Large Language ModelsMachine Learning and Knowledge Extraction10.3390/make60100136:1(233-258)Online publication date: 26-Jan-2024
      • (2021)Learning Latent Variable Models with Discriminant RegularizationAgents and Artificial Intelligence10.1007/978-3-030-71158-0_18(378-398)Online publication date: 14-Mar-2021
      • (2019)Across-Time Comparative Summarization of News ArticlesProceedings of the Twelfth ACM International Conference on Web Search and Data Mining10.1145/3289600.3291008(735-743)Online publication date: 30-Jan-2019
      • (2019)Mapping Entity Sets in News Archives Across TimeData Science and Engineering10.1007/s41019-019-00102-3Online publication date: 9-Sep-2019
      • (2019)Typicality-Based Across-Time Mapping of Entity Sets in Document ArchivesDatabase Systems for Advanced Applications10.1007/978-3-030-18576-3_21(350-366)Online publication date: 24-Apr-2019
      • (2017)Temporal Analog Retrieval using Transformation over Dual Hierarchical StructuresProceedings of the 2017 ACM on Conference on Information and Knowledge Management10.1145/3132847.3132917(717-726)Online publication date: 6-Nov-2017
      • (2017)Is Tofu the Cheese of Asia?Proceedings of the 26th International Conference on World Wide Web Companion10.1145/3041021.3055132(1033-1042)Online publication date: 3-Apr-2017
      • (2016)Causal Relationship Detection in Archival Collections of Product Reviews for Understanding Technology EvolutionACM Transactions on Information Systems10.1145/293775235:1(1-41)Online publication date: 11-Aug-2016
      • (2016)Accounting for Language Changes Over Time in Document Similarity SearchACM Transactions on Information Systems10.1145/293467135:1(1-26)Online publication date: 3-Sep-2016
      • (2016)Temporal Information RetrievalProceedings of the 39th International ACM SIGIR conference on Research and Development in Information Retrieval10.1145/2911451.2914805(1235-1238)Online publication date: 7-Jul-2016
      • Show More Cited By

      View Options

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media