skip to main content
10.1145/3308558.3313435acmotherconferencesArticle/Chapter ViewAbstractPublication PageswwwConference Proceedingsconference-collections
research-article

A Novel Unsupervised Approach for Precise Temporal Slot Filling from Incomplete and Noisy Temporal Contexts

Published:13 May 2019Publication History

ABSTRACT

The task of temporal slot filling (TSF) is to extract the values (or called facts) of specific attributes for a given entity from text data and find the time points when the values were valid. It is challenging to find precise time points with incomplete and noisy temporal contexts in the text. In this work, we propose an unsupervised approach of two modules that mutually enhance each other: one is a reliability estimator on fact extractors conditionally to the temporal contexts; the other is a fact trustworthiness estimator based on the extractor's reliability. The iterative learning process reduces the noise of the extractions. Experiments demonstrate that our approach, with the novel design, can accurately and efficiently extract precise temporal facts from newspaper corpora.

References

  1. Gabor Angeli, Melvin Jose Johnson Premkumar, and Christopher D Manning. 2015. Leveraging linguistic structure for open domain information extraction. In Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 1: Long Papers), Vol. 1. 344-354.Google ScholarGoogle ScholarCross RefCross Ref
  2. Michele Banko, Michael J Cafarella, Stephen Soderland, Matthew Broadhead, and Oren Etzioni. 2007. Open information extraction from the web.. In IJCAI, Vol. 7. 2670-2676. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Laure Berti-Equille. 2015. Data veracity estimation with ensembling truth discovery methods. In Big Data (Big Data), 2015 IEEE International Conference on. IEEE, 2628-2636. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Melisachew Wudage Chekol. 2017. Scaling probabilistic temporal query evaluation. In Proceedings of the 2017 ACM on Conference on Information and Knowledge Management. ACM, 697-706. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Aron Culotta and Jeffrey Sorensen. 2004. Dependency tree kernels for relation extraction. In Proceedings of the 42nd annual meeting on association for computational linguistics. Association for Computational Linguistics, 423. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Dmitriy Dligach, Timothy Miller, Chen Lin, Steven Bethard, and Guergana Savova. 2017. Neural temporal relation extraction. In Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics: Volume 2, Short Papers, Vol. 2. 746-751.Google ScholarGoogle ScholarCross RefCross Ref
  7. Oren Etzioni, Anthony Fader, Janara Christensen, Stephen Soderland, and Mausam Mausam. 2011. Open information extraction: The second generation.. In IJCAI, Vol. 11. 3-10. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Katrin Fundel, Robert Küffner, and Ralf Zimmer. 2006. RelEx-Relation extraction using dependency parse trees. Bioinformatics 23, 3 (2006), 365-371. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Alban Galland, Serge Abiteboul, Ame´lie Marian, and Pierre Senellart. 2010. Corroborating information from disagreeing views. In Proceedings of the third ACM international conference on Web search and data mining. ACM, 131-140. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Kiril Gashteovski, Rainer Gemulla, and Luciano Del Corro. 2017. Minie: minimizing facts in open information extraction. In Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing. 2630-2640.Google ScholarGoogle ScholarCross RefCross Ref
  11. Sally A Goldman and Manfred K Warmuth. 1995. Learning binary relations using weighted majority voting. Machine Learning 20, 3 (1995), 245-271. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Rahul Gupta, Alon Halevy, Xuezhi Wang, Steven Euijong Whang, and Fei Wu. 2014. Biperpedia: An ontology for search applications. Proceedings of the VLDB Endowment 7, 7 (2014), 505-516. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Alon Halevy, Natalya Noy, Sunita Sarawagi, Steven Euijong Whang, and Xiao Yu. 2016. Discovering structure in the universe of attribute names. In Proceedings of the 25th International Conference on World Wide Web. International World Wide Web Conferences Steering Committee, 939-949. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Julia Hirschberg and Christopher D Manning. 2015. Advances in natural language processing. Science 349, 6245 (2015), 261-266.Google ScholarGoogle Scholar
  15. Tuan-Anh Hoang-Vu, Huy T Vo, and Juliana Freire. 2016. A unified index for spatio-temporal keyword queries. In Proceedings of the 25th ACM International on Conference on Information and Knowledge Management. ACM, 135-144. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. Meng Jiang, Jingbo Shang, Taylor Cassidy, Xiang Ren, Lance M Kaplan, Timothy P Hanratty, and Jiawei Han. 2017. Metapad: Meta pattern discovery from massive text corpora. In Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, 877-886. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. Qi Li, Meng Jiang, Xikun Zhang, Meng Qu, Timothy P Hanratty, Jing Gao, and Jiawei Han. 2018. Truepie: Discovering reliable patterns in pattern-based information extraction. In Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining. ACM, 1675-1684. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. Qi Li, Yaliang Li, Jing Gao, Bo Zhao, Wei Fan, and Jiawei Han. 2014. Resolving conflicts in heterogeneous data by truth discovery and source reliability estimation. In Proceedings of the 2014 ACM SIGMOD international conference on Management of data. ACM, 1187-1198. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. Yaliang Li, Jing Gao, Chuishi Meng, Qi Li, Lu Su, Bo Zhao, Wei Fan, and Jiawei Han. 2016. A survey on truth discovery. ACM Sigkdd Explorations Newsletter 17, 2 (2016), 1-16. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. Chen Lin, Timothy Miller, Dmitriy Dligach, Steven Bethard, and Guergana Savova. 2017. Representations of time expressions for temporal relation extraction with convolutional neural networks. BioNLP 2017 (2017), 322-327.Google ScholarGoogle Scholar
  21. Mike Mintz, Steven Bills, Rion Snow, and Dan Jurafsky. 2009. Distant supervision for relation extraction without labeled data. In Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP: Volume 2-Volume 2. Association for Computational Linguistics, 1003-1011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. Ndapandula Nakashole, Gerhard Weikum, and Fabian Suchanek. 2012. PATTY: a taxonomy of relational patterns with semantic types. In Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning. Association for Computational Linguistics, 1135-1145. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. Nils Reimers, Nazanin Dehghani, and Iryna Gurevych. 2016. Temporal anchoring of events for the timebank corpus. In Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), Vol. 1. 2195-2204.Google ScholarGoogle ScholarCross RefCross Ref
  24. Sebastian Riedel, Limin Yao, Andrew McCallum, and Benjamin M Marlin. 2013. Relation extraction with matrix factorization and universal schemas. In Proceedings of the 2013 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. 74-84.Google ScholarGoogle Scholar
  25. Michael Schmitz, Robert Bart, Stephen Soderland, Oren Etzioni, and others. 2012. Open language learning for information extraction. In Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning. Association for Computational Linguistics, 523-534. Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. Jingbo Shang, Jialu Liu, Meng Jiang, Xiang Ren, Clare R Voss, and Jiawei Han. 2018. Automated phrase mining from massive text corpora. IEEE Transactions on Knowledge and Data Engineering 30, 10(2018), 1825-1837.Google ScholarGoogle ScholarCross RefCross Ref
  27. Avirup Sil and Silviu-Petru Cucerzan. 2014. Towards Temporal Scoping of Relational Facts based on Wikipedia Data. In Proceedings of the Eighteenth Conference on Computational Natural Language Learning. 109-118.Google ScholarGoogle ScholarCross RefCross Ref
  28. Alejandro Sobrino, Cristina Puente, and Jose´ Ángel Olivas. 2017. Mining Temporal Causal Relations in Medical Texts. In International Joint Conference SOCO'17-CISIS'17-ICEUTE'17 León, Spain, September 6-8, 2017, Proceeding. Springer, 449-460.Google ScholarGoogle Scholar
  29. Jannik Strötgen and Michael Gertz. 2015. A baseline temporal tagger for all languages. In Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing. 541-547.Google ScholarGoogle ScholarCross RefCross Ref
  30. David Tsurel, Dan Pelleg, Ido Guy, and Dafna Shahaf. 2017. Fun Facts: Automatic Trivia Fact Extraction from Wikipedia. In Proceedings of the Tenth ACM International Conference on Web Search and Data Mining. ACM, 345-354. Google ScholarGoogle ScholarDigital LibraryDigital Library
  31. Houping Xiao, Jing Gao, Qi Li, Fenglong Ma, Lu Su, Yunlong Feng, and Aidong Zhang. 2016. Towards confidence in the truth: A bootstrapping based truth discovery approach. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, 1935-1944. Google ScholarGoogle ScholarDigital LibraryDigital Library
  32. Houping Xiao, Yaliang Li, Jing Gao, Fei Wang, Liang Ge, Wei Fan, Long H Vu, and Deepak S Turaga. 2015. Believe it today or tomorrow? detecting untrustworthy information from dynamic multi-source data. In Proceedings of the 2015 SIAM International Conference on Data Mining. SIAM, 397-405.Google ScholarGoogle ScholarCross RefCross Ref
  33. Mohamed Yahya, Steven Whang, Rahul Gupta, and Alon Halevy. 2014. Renoun: Fact extraction for nominal attributes. In Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP). 325-335.Google ScholarGoogle ScholarCross RefCross Ref
  34. Xiaoxin Yin, Jiawei Han, and S Yu Philip. 2008. Truth discovery with multiple conflicting information providers on the web. IEEE Transactions on Knowledge and Data Engineering 20, 6(2008), 796-808. Google ScholarGoogle ScholarDigital LibraryDigital Library
  35. Chao Zhang, Fangbo Tao, Xiusi Chen, Jiaming Shen, Meng Jiang, Brian Sadler, Michelle Vanni, and Jiawei Han. 2018. TaxoGen: Constructing Topical Concept Taxonomy by Adaptive Term Embedding and Clustering. In Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining.Google ScholarGoogle ScholarDigital LibraryDigital Library

Recommendations

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Sign in
  • Published in

    cover image ACM Other conferences
    WWW '19: The World Wide Web Conference
    May 2019
    3620 pages
    ISBN:9781450366748
    DOI:10.1145/3308558

    Copyright © 2019 ACM

    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    • Published: 13 May 2019

    Permissions

    Request permissions about this article.

    Request Permissions

    Check for updates

    Qualifiers

    • research-article
    • Research
    • Refereed limited

    Acceptance Rates

    Overall Acceptance Rate1,899of8,196submissions,23%

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format .

View HTML Format