Skip to main content

Narraport: Narrative-Based Interactions and Report Generation with Large Datasets

  • Conference paper
  • First Online:
Interactive Storytelling (ICIDS 2021)

Abstract

There is an increasing demand for rapid content filtering in relation to topics like digital forensics for legal cases, cybersecurity, and social media conduct monitoring. While there have been significant advances in algorithms and frameworks for media processing, this task requires an ensemble of tools and algorithms that are not well-understood by human analysts, thereby reducing their trustworthiness. In this paper, we present a novel perspective on this problem through the development of an intelligent system that generates reports from large email datasets in the form of short stories. The stories generated by the system are based on identifiable plot structures in popular media. These structures are used as semantic sensemaking templates to organize data for further filtering and triage. The end-to-end system, accessible through an interactive dashboard, incorporates unsupervised annotation modules (such as speech acts and sentiment), topic discovery, communication network analysis, character personality profiles, and automated text and visualization generators. This emerging application prototype is developed and internally deployed in collaboration with analysts and researchers actively working in this area.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 79.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 99.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Barot, C., Potts, C.M., Young, R.M.: A tripartite plan-based model of narrative for narrative discourse generation. In: Proceedings of the Joint Workshop on Intelligent Narrative Technologies and Social Believability in Games at the 11th AAAI Conference on Artificial Intelligence and Interactive Digital Entertainment, pp. 2–8 (2015)

    Google Scholar 

  2. Barot, C., et al.: Bardic: generating multimedia narrative reports for game logs. In: Working Notes of the AIIDE Workshop on Intelligent Narrative Technologies (2017)

    Google Scholar 

  3. Battad, Z., Si, M.: Apply storytelling techniques for describing time-series data. In: Rouse, R., Koenitz, H., Haahr, M. (eds.) ICIDS 2018. LNCS, vol. 11318, pp. 483–488. Springer, Cham (2018). https://doi.org/10.1007/978-3-030-04028-4_56

    Chapter  Google Scholar 

  4. Blei, D.M., Ng, A.Y., Jordan, M.I.: Latent Dirichlet allocation. J. Mach. Learn. Res. 3(Jan), 993–1022 (2003)

    MATH  Google Scholar 

  5. Booker, C.: The Seven Basic Plots: Why We Tell Stories. A&C Black (2004)

    Google Scholar 

  6. Brehmer, M., Lee, B., Bach, B., Riche, N.H., Munzner, T.: Timelines revisited: a design space and considerations for expressive storytelling. IEEE Trans. Vis. Comput. Graph. 23(9), 2151–2164 (2017)

    Article  Google Scholar 

  7. Engel, O.: Clusters, recipients and reciprocity: extracting more value from email communication networks. Proc. Soc. Behav. Sci. 10, 172–182 (2011)

    Article  Google Scholar 

  8. Erete, S., Ryou, E., Smith, G., Fassett, K.M., Duda, S.: Storytelling with data: examining the use of data by non-profit organizations. In: Proceedings of the 19th ACM Conference on Computer-Supported Cooperative Work & Social Computing, CSCW 2016, pp. 1273–1283. Association for Computing Machinery, New York (2016)

    Google Scholar 

  9. Fikes, R.E., Nilsson, N.J.: STRIPS: a new approach to the application of theorem proving to problem solving. Artif. Intell. 2(3–4), 189–208 (1971)

    Article  Google Scholar 

  10. Freeman, H.: Shape description via the use of critical points. Pattern Recogn. 10(3), 159–166 (1978). https://doi.org/10.1016/0031-3203(78)90024-9. https://www.sciencedirect.com/science/article/pii/0031320378900249. The Proceedings of the IEEE Computer Society Conference

  11. Nahian, M.S.A., Tasrin, T., Gandhi, S., Gaines, R., Harrison, B.: A hierarchical approach for visual storytelling using image description. In: Cardona-Rivera, R.E., Sullivan, A., Young, R.M. (eds.) ICIDS 2019. LNCS, vol. 11869, pp. 304–317. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-33894-7_30

    Chapter  Google Scholar 

  12. Klimt, B., Yang, Y.: The Enron corpus: a new dataset for email classification research. In: Boulicaut, J.-F., Esposito, F., Giannotti, F., Pedreschi, D. (eds.) ECML 2004. LNCS (LNAI), vol. 3201, pp. 217–226. Springer, Heidelberg (2004). https://doi.org/10.1007/978-3-540-30115-8_22

    Chapter  Google Scholar 

  13. Liu, S., Zhou, M.X., Pan, S., Song, Y., Qian, W., Cai, W., Lian, X.: TIARA: interactive, topic-based visual text summarization and analysis. ACM Trans. Intell. Syst. Technol. (TIST) 3(2), 1–28 (2012)

    Google Scholar 

  14. McKenna, S., Henry Riche, N., Lee, B., Boy, J., Meyer, M.: Visual narrative flow: exploring factors shaping data visualization story reading experiences. Comput. Graph. Forum 36(3), 377–387 (2017)

    Article  Google Scholar 

  15. Meehan, J.R.: TALE-SPIN, an interactive program that writes stories. In: IJCAI, vol. 77, pp. 91–98 (1977)

    Google Scholar 

  16. Oard, D., Webber, W., Kirsch, D.A., Golitsynskiy, S.: Avocado research email collection LDC2015T03. Linguistic Data Consortium, Philadelphia (2015). https://doi.org/10.35111/wqt6-jg60

  17. Robertson, J., Harrison, B., Jhala, A.: Interactive summarization for data filtering and triage. In: The Thirty-Third International Flairs Conference (2020)

    Google Scholar 

  18. Staff, E.: Timeline: a chronology of Enron corp. New York Times (2006). https://www.nytimes.com/2006/01/18/business/worldbusiness/timeline-a-chronology-of-enron-corp.html

  19. Vesanto, J., Hollmén, J.: An automated report generation tool for the data understanding phase. In: Abraham, A., Jain, L., van der Zwaag, B.J. (eds.) Innovations in Intelligent Systems. Studies in Fuzziness and Soft Computing, vol. 140, pp. 203–219. Springer, Heidelberg (2004). https://doi.org/10.1007/978-3-540-39615-4_8

    Chapter  Google Scholar 

  20. Wang, Y., Sun, Z., Zhang, H., Cui, W., Xu, K., Ma, X., Zhang, D.: DataShot: automatic generation of fact sheets from tabular data. IEEE Trans. Vis. Comput. Graph. 26(1), 895–905 (2020)

    Article  Google Scholar 

  21. Wilson, G., Banzhaf, W.: Discovery of email communication networks from the Enron corpus with a genetic algorithm using social network analysis. In: 2009 IEEE Congress on Evolutionary Computation, pp. 3256–3263 (2009)

    Google Scholar 

  22. Wongsuphasawat, K., et al.: Voyager 2: augmenting visual analysis with partial view specifications. In: Proceedings of the 2017 CHI Conference on Human Factors in Computing Systems, pp. 2648–2659 (2017)

    Google Scholar 

  23. Young, R.M., Pollack, M.E., Moore, J.D.: Decomposition and causality in partial-order planning. In: AIPS, pp. 188–194 (1994)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Colin M. Potts .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2021 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Potts, C.M., Jhala, A. (2021). Narraport: Narrative-Based Interactions and Report Generation with Large Datasets. In: Mitchell, A., Vosmeer, M. (eds) Interactive Storytelling. ICIDS 2021. Lecture Notes in Computer Science(), vol 13138. Springer, Cham. https://doi.org/10.1007/978-3-030-92300-6_11

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-92300-6_11

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-92299-3

  • Online ISBN: 978-3-030-92300-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics