skip to main content
10.1145/2108616.2108647acmconferencesArticle/Chapter ViewAbstractPublication PagesicuimcConference Proceedingsconference-collections
research-article

ChronoSeeker: search engine for future and past events

Authors Info & Claims
Published:14 January 2010Publication History

ABSTRACT

In this paper, we propose an on-demand search engine called ChronoSeeker, which allows users to find past/future events based on their interest. Our goal is providing a search engine which can collect as many future/past events as possible relevant to user's query in obtaining various future scenarios considering both predictions and histories. Two technical issues are treated, (1) efficient search method for event information and (2) accurate filtering method for removing noises from search results. To search for event information effectively, our system expands a user query by some typical expressions related to event information such as year expressions, temporal modifiers and context terms. To remove noisy information, we selected five types of features for a machine learning technique to classify candidates into event information or not. Our experiment showed that filtering performance achieved an 85% F-measure, and that query expansion can collect dozens of times more CEs than those without expansion.

References

  1. R. Baeza-Yates. Searching the Future, ACM SIGIR Workshop MF/IR, 2005.Google ScholarGoogle Scholar
  2. P. Brun, H. Kawai, K. Kunieda, K. Yamada. ChronoSeeker: Future Opinion Extraction and Classification. The 2009 IEEE/WIC/ACM International Conference on Web Intelligence, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. M. D. Choudhury, H. Ssundaram, A. John and D. D. Seligman. Can Blog Communication Dynamics be Correlated with Stock Market Activity?, Proceedings of HT2008, pp. 55--60, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. D. Gruhl, R. V. Guha, R. Kumar, J. Novak, A. Tomkins. The Predictive Power of Online Chatter. Proceedings of SIGKDD2005, pp. 78--87, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. S. Itaya, T. Konishi, R. Tanaka, S. Doi, K. Yamada. Experiments on Personal Opinion Expression and Consensus Building using "Future Chronicle". Second International Symposium on Universal Communication, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. A. Jatowt, K. Kanazawa, S. Oyama, K. Tanaka. Supporting analysis of future-related information in news archives and the web. 9th ACM/IEEE-CS joint conference on Digital libraries, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. S. Kim, H. Alani, W. Hall, P. H. Lewis, D. E. Millard, N. R. Shadbolt, M. J. Weal. Artequakt. Generating Tailored Biographies with Automatically Annotated Fragments from the Web, Semantic Authoring. Annotation and Knowledge Markup Workshop in the 15th European Conference on Artificial Intelligence, 2002.Google ScholarGoogle Scholar
  8. R. Kimura, S. Oyama, K. Tanaka. Automatic Collection of Personal Histories for Generating Who's Who from the Web. (in Japanese), DBSJ Letters, 5(2), 2006.Google ScholarGoogle Scholar
  9. Y. Liu, X. Huang, A. An and X. Yu. ARSA: a Sentimentaware Model for Predicting Sales Performance Using Blogs. Proceedings of SIGIR 2007, pp. 607--614, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. I. Mani, J. Pustejovsky, B. Sundheim. Introduction to the special issue on temporal information processing. ACM Transactions on Asian Language Information Processing, 3(1):1--10, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. G. Mishne, N. Glance. Predicting Movie Sales from Blogger Sentiment. Proceedings of the Spring Symposia on Computational Approaches to Analyzing Weblogs, 2006.Google ScholarGoogle Scholar
  12. M. Pasca, D. Lin, J. Bigham, A. Lifchits, A. Jain. Organizing and Searching the World Wide Web of Facts - Step One: the One-Million Fact Extraction Challenge. The 21st National Conference on Artificial Intelligence, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. A. Pepe, J. Bollen. Between Conjecture and Memento:Shaping a Collective Emotional Perception of the Future. Proceedings of the AAAI 2008 Spring Symposium on Emotion, Personality and Social Behavior, 2008.Google ScholarGoogle Scholar
  14. A. Podelko. Multiple Dimensions of Performance Requirements. 33rd International Computer Measurement Group Conference, 2007.Google ScholarGoogle Scholar
  15. V. N. Vapnik. The Nature of Statistical Learning Theory. Springer, 1995. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. J. Wolfers, E. Zitzewitz, Prediction Markets. Journal of Economic Perspectives, 18(2):107--126, 2004.Google ScholarGoogle ScholarCross RefCross Ref
  17. B. Wuthrich, D. Permunetilleke, S. Leung, V. Cho, J. Zhang, W. Lam. Daily Prediction of Major Stock Indices from textual WWW Data. Proceedings of SIGKDD1998, pp. 364--368, 1998.Google ScholarGoogle ScholarCross RefCross Ref
  18. Zona Research. The Need for Speed II. Zona Market Bulletin, Issue 05, 2001.Google ScholarGoogle Scholar

Index Terms

  1. ChronoSeeker: search engine for future and past events

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      ICUIMC '10: Proceedings of the 4th International Conference on Uniquitous Information Management and Communication
      January 2010
      550 pages
      ISBN:9781605588933
      DOI:10.1145/2108616

      Copyright © 2010 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 14 January 2010

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article

      Acceptance Rates

      Overall Acceptance Rate251of941submissions,27%

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader