skip to main content
10.1145/1951365.1951442acmotherconferencesArticle/Chapter ViewAbstractPublication PagesedbtConference Proceedingsconference-collections
research-article

SITAC: discovering semantically identical temporally altering concepts in text archives

Published: 21 March 2011 Publication History

Abstract

This paper demonstrates a system called SITAC based on our proposed approach to automate the discovery of concepts (called SITACs) in text sources that are identical semantically but alter their names over time. This system is developed to perform time-aware translation of queries over text corpora by incorporating terminology evolution, thus providing more accurate responses to users, e.g., query processing on Mumbai should automatically take into account its former name Bombay. The SITAC system constitutes a novel collaborative framework of natural language processing, association rule mining and contextual similarity.

References

[1]
Berberich, K. Bedathur, S., Sozio, M. and Weikum, G. "Bridging the Terminology Gap in Web Archive Search!", SIGMOD's WebDB 2009
[2]
Gutenberg EBook of U.S. Presidential Inaugural Addresses, www.gutenberg.net (Jan 2004), EBook Number 4938, Edition 11.
[3]
Hasegawa, T., Sekine S. and Grishman R., "Discovering Relations among Named Entities from Large Corpora", ACL (Aug 2004), pp. 415--422.
[4]
Jeh., G. and Widom., J., "SimRank: A Measure of Structural-Context Similarity". KDD (Jul 2002), pp. 538--543.
[5]
Kaluarachchi A., Varde A., Bedathur, S. Weikum, G., Peng, J. and Feldman, A., "Incorporating Terminology Evolution for Query Translation in Text Retrieval with Association Rules", CIKM (Oct 2010).
[6]
Kaluarachchi A., Varde A., Peng, J. and Feldman, A., "Intelligent Time Aware-Query Translation for Text Sources", AAAI (Jul 2010), pp. 1935--1936.
[7]
Lesh, N., Zaki, M. J. and Ogihara, M., "Mining Features for Sequence Classification". KDD (Aug 1999), pp. 342--346.
[8]
Norvag, K., Eriksen, T. O. and Skogstad, K. I, "Mining Association Rules in Temporal Document Collections", Dept. of Computer and Information, Systems (2006), NTNU, Norway.
[9]
Parthasarathy, S., Zaki, M. J., Ogihara, M., Dwarkadas, S., "Incremental and Interactive Sequence Mining". CIKM (Nov 1999), Kansas City, Missouri, pp. 251--258.
[10]
Roychoudhury D. and Varde A., "Terminology Evolution in Web and Text Mining Using Association Rules", Dept. of Computer Science (May 2009), Montclair State University, NJ.
[11]
Strehl A. Ghosh, J. and Mooney R., "Impact of Similarity Measures on Web-page Clustering", AAAI, (Jul 2000), pp. 58--64.

Cited By

View all
  • (2024)AI-Based Modeling for Textual Data on Solar Policies in Smart Energy Applications2024 15th International Conference on Information, Intelligence, Systems & Applications (IISA)10.1109/IISA62523.2024.10786713(1-8)Online publication date: 17-Jul-2024
  • (2024)Personalizing Text-to-Image Diffusion Models by Fine-Tuning Classification for AI ApplicationsIntelligent Systems and Applications10.1007/978-3-031-47721-8_44(642-658)Online publication date: 10-Jan-2024
  • (2022)Prediction Tool on Fine Particle Pollutants and Air Quality for Environmental EngineeringSN Computer Science10.1007/s42979-022-01068-23:3Online publication date: 7-Mar-2022
  • Show More Cited By

Index Terms

  1. SITAC: discovering semantically identical temporally altering concepts in text archives

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Other conferences
    EDBT/ICDT '11: Proceedings of the 14th International Conference on Extending Database Technology
    March 2011
    587 pages
    ISBN:9781450305280
    DOI:10.1145/1951365

    Sponsors

    • Microsoft Research: Microsoft Research

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 21 March 2011

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. association rules
    2. information retrieval
    3. query processing
    4. ranking
    5. temporal changes
    6. text mining
    7. web search

    Qualifiers

    • Research-article

    Conference

    EDBT/ICDT '11
    Sponsor:
    • Microsoft Research
    EDBT/ICDT '11: EDBT/ICDT '11 joint conference
    March 21 - 24, 2011
    Uppsala, Sweden

    Acceptance Rates

    Overall Acceptance Rate 7 of 10 submissions, 70%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)0
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 10 Feb 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)AI-Based Modeling for Textual Data on Solar Policies in Smart Energy Applications2024 15th International Conference on Information, Intelligence, Systems & Applications (IISA)10.1109/IISA62523.2024.10786713(1-8)Online publication date: 17-Jul-2024
    • (2024)Personalizing Text-to-Image Diffusion Models by Fine-Tuning Classification for AI ApplicationsIntelligent Systems and Applications10.1007/978-3-031-47721-8_44(642-658)Online publication date: 10-Jan-2024
    • (2022)Prediction Tool on Fine Particle Pollutants and Air Quality for Environmental EngineeringSN Computer Science10.1007/s42979-022-01068-23:3Online publication date: 7-Mar-2022
    • (2016)Leveraging Cross-Domain Social Media Analytics to Understand TV Topics PopularityIEEE Computational Intelligence Magazine10.1109/MCI.2016.257251811:3(10-21)Online publication date: 1-Aug-2016
    • (2015)A Framework for Collocation Error Correction in Web Pages and Text DocumentsACM SIGKDD Explorations Newsletter10.1145/2830544.283054817:1(14-23)Online publication date: 29-Sep-2015
    • (2015)Visions and open challenges for a knowledge-based culturomicsInternational Journal on Digital Libraries10.1007/s00799-015-0139-115:2-4(169-187)Online publication date: 1-Apr-2015
    • (2013)Mining semantics for culturomicsProceedings of the 2013 international workshop on Mining unstructured big data using natural language processing10.1145/2513549.2513551(3-10)Online publication date: 28-Oct-2013
    • (2013)MeSoOnTVProceedings of the 24th ACM Conference on Hypertext and Social Media10.1145/2481492.2481518(208-213)Online publication date: 1-May-2013

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media