Skip to main content
Log in

Building text-based temporally linked event network for scientific big data analytics

  • Original Article
  • Published:
Personal and Ubiquitous Computing Aims and scope Submit manuscript

Abstract

Events formulate the world of the human being and could be regarded as the semantic units in different granularities for information organization. Extracting events and temporal information from texts plays an important role for information analytics in big data because of the wide use of multilingual texts. This paper surveys existing research work on text-based event temporal resolution and reasoning including identification of events, temporal information resolutions of events in English and Chinese texts, the rule-based temporal relation reasoning between events and relevant temporal representations. For the scientific big data analytics, we point out the shortcomings of existing research work and give the argument about the future research work for advancing identification of events, establishment of temporal relations and reasoning of temporal relations.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fig. 1
Fig. 2
Fig. 3
Fig. 4

Similar content being viewed by others

References

  1. Agichtein E, Ganti V (2004) Mining reference tables for automatic text segmentation. In: Proceedings of the tenth ACM SIGKDD international conference on knowledge discovery and data mining. ACM, pp 20–29

  2. Allen JF (1983) Maintaining knowledge about temporal intervals. Commun ACM 26(11):832–843

    Article  MATH  Google Scholar 

  3. Allen JF (1984) Towards a general theory of action and time. Artif Intell 23(2):123–154

    Article  MATH  Google Scholar 

  4. Alonso O, Gertz M, Baeza-Yates R (2009) Clustering and exploring search results using timeline constructions. In: Proceedings of the 18th ACM conference on information and knowledge management. ACM, pp 97–106

  5. Appelt DE, Hobbs JR, Bear J, Israel D, Tyson M (1993) Fastus: a finite-state processor for information extraction from real-world text. IJCAI 93:1172–1178

    Google Scholar 

  6. Asher N (1993) Reference to abstract objects in discourse, vol 50. Springer, Berlin

    Google Scholar 

  7. Bach E (1986) The algebra of events. Linguist Philos 9(1):5–16

    Google Scholar 

  8. Bittar A (2010) Building a timebank for french: a reference corpus annotated according to the ISO-TimeMl standard. Ph.D. thesis, Paris 7

  9. Borthwick A, Sterling J, Agichtein E, Grishman R (1998) Exploiting diverse knowledge sources via maximum entropy in named entity recognition. In: Proceedings of the sixth workshop on very large corpora, vol 182

  10. Califf ME, Mooney RJ (2003) Bottom–up relational learning of pattern matching rules for information extraction. J Mach Learn Res 4:177–210

    MathSciNet  MATH  Google Scholar 

  11. Chambers N, Jurafsky D (2008) Jointly combining implicit constraints improves temporal ordering. In: Proceedings of the conference on empirical methods in natural language processing. Association for Computational Linguistics, pp 698–706

  12. Chambers N, Wang S, Jurafsky D (2007) Classifying temporal relations between events. In: Proceedings of the 45th annual meeting of the ACL on interactive poster and demonstration sessions. Association for Computational Linguistics, pp 173–176

  13. Chang AX, Manning CD (2012) Sutime: a library for recognizing and normalizing time expressions. In: LREC, pp 3735–3740

  14. Cheng Y, Asahara M, Matsumoto Y (2007) Constructing a temporal relation tagged corpus of chinese based on dependency structure analysis. In: 14th International symposium on temporal representation and reasoning. IEEE, pp 59–69

  15. Cheng Y, Asahara M, Matsumoto Y (2008) Use of event types for temporal relation identification in chinese text. In: IJCNLP, pp 31–38

  16. Choi Y, Cardie C, Riloff E, Patwardhan S (2005) Identifying sources of opinions with conditional random fields and extraction patterns. In: Proceedings of the conference on human language technology and empirical methods in natural language processing. Association for Computational Linguistics, pp 355–362

  17. Cunningham H, Maynard D, Bontcheva K, Tablan V (2002) Gate: an architecture for development of robust hlt applications. In: Proceedings of the 40th annual meeting on association for computational linguistics. Association for Computational Linguistics, pp 168–175

  18. Dakka W, Gravano L, Ipeirotis PG (2012) Answering general time-sensitive queries. IEEE Trans Knowl Data Eng 24(2):220–235

    Article  Google Scholar 

  19. Davidson D (1967) Causal relations. J Philos 64(21):691–703

    Article  Google Scholar 

  20. Elkhlifi A, Faiz R (2009) Automatic annotation approach of events in news articles. Int J Comput Inf Sci 7(1):40–50

    Google Scholar 

  21. Gaizauskas R, Wilks Y (1998) Information extraction: beyond document retrieval. J Doc 54(1):70–105

    Article  Google Scholar 

  22. Gao J, Li J, Cai Z, Gao H (2015) Composite event coverage in wireless sensor networks with heterogeneous sensors. In: 2015 IEEE conference on computer communications (INFOCOM). IEEE, pp 217–225

  23. Grishman R, Sundheim B (1996) Message understanding conference-6: a brief history. COLING 96:466–471

    Google Scholar 

  24. Hitzeman J, Moens M, Grover C (1995) Algorithms for analysing the temporal structure of discourse. In: Proceedings of the seventh conference on European chapter of the Association for computational linguistics. Morgan Kaufmann, pp 253–260

  25. Hovy D, Fan J, Gliozzo A, Patwardhan S, Welty C (2012) When did that happen? Linking events and relations to timestamps. In: Proceedings of the 13th conference of the European chapter of the Association for Computational Linguistics. Association for Computational Linguistics, pp 185–193

  26. Kamp H (1981) A theory of truth and semantic representation. Formal semantics-the essential readings, pp 189–222

  27. Kulkarni A, Teevan J, Svore KM, Dumais ST (2011) Understanding temporal query dynamics. In: Proceedings of the fourth ACM international conference on Web search and data mining. ACM, pp 167–176

  28. Lascarides A, Asher N, Oberlander J (1992) Inferring discourse relations in context. In: Proceedings of the 30th annual meeting on Association for Computational Linguistics. Association for Computational Linguistics, pp 1–8

  29. Li W, Wong KF (2002) A word-based approach for modeling and discovering temporal relations embedded in Chinese sentences. TALIP 1(3):173–206

    Article  Google Scholar 

  30. Li W, Wong KF, Cao G, Yuan C (2004) Applying machine learning to chinese temporal relation resolution. In: Proceedings of the 42nd annual meeting on Association for Computational Linguistics. Association for Computational Linguistics, p 582

  31. Li X, Croft WB (2003) Time-based language models. In: Proceedings of the twelfth international conference on Information and knowledge management. ACM, pp 469–475

  32. Longacre RE (1996) The grammar of discourse. Springer, Berlin

    Book  Google Scholar 

  33. Malouf R (2002) A comparison of algorithms for maximum entropy parameter estimation. In: Proceedings of the 6th conference on Natural language learning. Association for Computational Linguistics, vol 20, pp 1–7

  34. Mani I, Schiffman B, Zhang J (2003) Inferring temporal ordering of events in news. In: Proceedings of the 2003 conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology: companion volume of the proceedings of HLT-NAACL 2003–short papers-volume 2. Association for Computational Linguistics, pp 55–57

  35. Mani I, Verhagen M, Wellner B, Lee CM, Pustejovsky J (2006) Machine learning of temporal relations. In: Proceedings of the 21st international conference on computational linguistics and the 44th annual meeting of the Association for Computational Linguistics, ACL-44. Association for Computational, Linguistics, Stroudsburg, PA, USA. doi:10.3115/1220175.1220270, pp 753–760

  36. Mani I, Verhagen M, Wellner B, Lee CM, Pustejovsky J (2006) Machine learning of temporal relations. In: Proceedings of the 21st international conference on computational linguistics and the 44th annual meeting of the Association for Computational Linguistics. Association for Computational Linguistics, pp 753–760

  37. Marthi B, Milch B, Russell S (2003) First-order probabilistic models for information extraction. In: IJCAI 2003 workshop on learning statistical models from relational data

  38. Mazur P, Dale R (2010) Wikiwars: A new corpus for research on temporal expressions. In: Proceedings of the 2010 conference on empirical methods in natural language processing. Association for Computational Linguistics, pp 913–922

  39. McCallum A, Freitag D, Pereira FC (2000) Maximum entropy markov models for information extraction and segmentation. In: ICML, pp 591–598

  40. Peng F, McCallum A (2006) Information extraction from research papers using conditional random fields. Inf Process Manag 42(4):963–979

    Article  Google Scholar 

  41. Pustejovsky J, Castano JM, Ingria R, Sauri R, Gaizauskas RJ, Setzer A, Katz G, Radev DR (2003) TimeML: robust specification of event and temporal expressions in text. New Direct Quest Answ 3:28–34

    Google Scholar 

  42. Pustejovsky J, Hanks P, Sauri R, See A, Gaizauskas R, Setzer A, Radev D, Sundheim B, Day D, Ferro L et al (2003) The timebank corpus. In: Corpus linguistics, vol 2003, p 40

  43. Qu P, Zhang J, Yao C, Zeng W (2016) Identifying long tail term from large-scale candidate pairs for big data-oriented patent analysis. Concurrency and computation: practice and experience doi:10.1002/cpe.3792

  44. Reichenbach H (2005) The tenses of verbs. In: Mani I, Pustejovsky J, Gaizauskas R (eds) The language of time: a reader. Oxford University Press, Oxford, UK, pp 71–78

  45. Riloff E et al (1993) Automatically constructing a dictionary for information extraction tasks. In: AAAI, pp 811–816

  46. Saurii R, Littman J, Knippen B, Gaizauskas R, Setzer A, Pustejovsky J (2005) TimeML annotation guidelines

  47. Schilder F, Katz G, Pustejovsky J (2007) Annotating, extracting and reasoning about time and events. In: Annotating, extracting and reasoning about time and events. Springer, pp 1–6

  48. Setzer A, Gaizauskas R (2002) On the importance of annotating temporal event-event relations in text. In: Proceedings of LREC workshop on annotation standards for temporal information in natural language, pp 52–60

  49. Seymore K, McCallum A, Rosenfeld R (1999) Learning hidden markov model structure for information extraction. In: AAAI-99 workshop on machine learning for information extraction, pp 37–42

  50. Strötgen J, Gertz M (2010) Heideltime: High quality rule-based extraction and normalization of temporal expressions. In: Proceedings of the 5th international workshop on semantic evaluation. Association for Computational Linguistics, pp 321–324

  51. Sun Y, Bie R, Zhang J (2016) Measuring semantic-based structural similarity in multi-relational networks. Int J Data Wareh Min 12(1):20–33

    Article  Google Scholar 

  52. Sun Y, Jara AJ (2014) An extensible and active semantic model of information organizing for the internet of things. Pers Ubiquit Comput 18(8):1821–1833

    Article  Google Scholar 

  53. Sun Y, Lu C, Bie R, Zhang J (2016) Semantic relation computing theory and its application. J Netw Comput Appl 59:219–229

    Article  Google Scholar 

  54. Sun Y, Song H, Jara AJ, Bie R (2016) Internet of things and big data analytics for smart and connected communities. IEEE Access 4:766–773

    Article  Google Scholar 

  55. Sun Y, Yan H, Lu C, Bie R, Zhou Z (2014) Constructing the web of events from raw data in the web of things. Mob Inf Syst 10(1):105–125

    Google Scholar 

  56. Sun Y, Yan H, Zhang J, Xia Y, Wang S, Bie R, Tian Y (2014) Organizing and querying the big sensing data with event-linked network in the internet of things. Int J Distrib Sens Netw. doi:10.1155/2014/218521

  57. UzZaman N, Llorens H, Allen J, Derczynski L, Verhagen M, Pustejovsky J (2012) Tempeval-3: evaluating events, time expressions, and temporal relations. arXiv preprint arXiv:1206.5333

  58. Vendler Z (1957) Verbs and times. Philos Rev 143–160

  59. Verhagen M, Gaizauskas R, Schilder F, Hepple M, Katz G, Pustejovsky J (2007) Semeval-2007 task 15: tempeval temporal relation identification. In: Proceedings of the 4th international workshop on semantic evaluations. Association for Computational Linguistics, pp 75–80

  60. Verhagen M, Gaizauskas R, Schilder F, Hepple M, Moszkowicz J, Pustejovsky J (2009) The tempeval challenge: identifying temporal relations in text. Lang Resour Eval 43(2):161–179

    Article  Google Scholar 

  61. Verhagen M, Sauri R, Caselli T, Pustejovsky J (2010) Semeval-2010 task 13: Tempeval-2. In: Proceedings of the 5th international workshop on semantic evaluation. Association for Computational Linguistics, pp 57–62

  62. Vilain MB (1982) A system for reasoning about time. In: AAAI, pp 197–201

  63. Whorf BL (1940) Science and linguistics

  64. Yoshikawa K, Riedel S, Asahara M, Matsumoto Y (2009) Jointly identifying temporal relations with markov logic. In: Proceedings of the joint conference of the 47th annual meeting of the ACL and the 4th international joint conference on natural language processing of the AFNLP. Association for Computational Linguistics, vol 1, pp 405–413

  65. Zhang J, Sun Y (2012) Managing resources in internet of things with semantic hyper-network model. In: Proceedings of the 2012 IEEE 21st international workshop on enabling technologies: infrastructure for collaborative enterprises. IEEE Computer Society, pp 318–323

  66. Zhang J, Yao C, Qu P, Sun Y (2015) Text-based event temporal resolution and reasoning for information analytics in big data. In: 2015 international conference on identification, information and knowledge in the internet of things. IEEE, pp 78–81

  67. Zhang NR (2001) Hidden markov models for information extraction. Technical report. Stanford Natural Language Processing Group

Download references

Acknowledgments

This research work was partially supported by National Natural Science of China (Grant Nos. 71503240 and 61371185), Humanities and Social Sciences of Ministry of Education Planning Fund (Grant No. 16YJA710007) and the National Key Technology R&D Program (Grant No. 2015BAH25F01).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Yunchuan Sun.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Zhang, J., Yao, C., Sun, Y. et al. Building text-based temporally linked event network for scientific big data analytics. Pers Ubiquit Comput 20, 743–755 (2016). https://doi.org/10.1007/s00779-016-0940-x

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00779-016-0940-x

Keywords

Navigation