Abstract
There is an increasing demand for rapid content filtering in relation to topics like digital forensics for legal cases, cybersecurity, and social media conduct monitoring. While there have been significant advances in algorithms and frameworks for media processing, this task requires an ensemble of tools and algorithms that are not well-understood by human analysts, thereby reducing their trustworthiness. In this paper, we present a novel perspective on this problem through the development of an intelligent system that generates reports from large email datasets in the form of short stories. The stories generated by the system are based on identifiable plot structures in popular media. These structures are used as semantic sensemaking templates to organize data for further filtering and triage. The end-to-end system, accessible through an interactive dashboard, incorporates unsupervised annotation modules (such as speech acts and sentiment), topic discovery, communication network analysis, character personality profiles, and automated text and visualization generators. This emerging application prototype is developed and internally deployed in collaboration with analysts and researchers actively working in this area.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Barot, C., Potts, C.M., Young, R.M.: A tripartite plan-based model of narrative for narrative discourse generation. In: Proceedings of the Joint Workshop on Intelligent Narrative Technologies and Social Believability in Games at the 11th AAAI Conference on Artificial Intelligence and Interactive Digital Entertainment, pp. 2–8 (2015)
Barot, C., et al.: Bardic: generating multimedia narrative reports for game logs. In: Working Notes of the AIIDE Workshop on Intelligent Narrative Technologies (2017)
Battad, Z., Si, M.: Apply storytelling techniques for describing time-series data. In: Rouse, R., Koenitz, H., Haahr, M. (eds.) ICIDS 2018. LNCS, vol. 11318, pp. 483–488. Springer, Cham (2018). https://doi.org/10.1007/978-3-030-04028-4_56
Blei, D.M., Ng, A.Y., Jordan, M.I.: Latent Dirichlet allocation. J. Mach. Learn. Res. 3(Jan), 993–1022 (2003)
Booker, C.: The Seven Basic Plots: Why We Tell Stories. A&C Black (2004)
Brehmer, M., Lee, B., Bach, B., Riche, N.H., Munzner, T.: Timelines revisited: a design space and considerations for expressive storytelling. IEEE Trans. Vis. Comput. Graph. 23(9), 2151–2164 (2017)
Engel, O.: Clusters, recipients and reciprocity: extracting more value from email communication networks. Proc. Soc. Behav. Sci. 10, 172–182 (2011)
Erete, S., Ryou, E., Smith, G., Fassett, K.M., Duda, S.: Storytelling with data: examining the use of data by non-profit organizations. In: Proceedings of the 19th ACM Conference on Computer-Supported Cooperative Work & Social Computing, CSCW 2016, pp. 1273–1283. Association for Computing Machinery, New York (2016)
Fikes, R.E., Nilsson, N.J.: STRIPS: a new approach to the application of theorem proving to problem solving. Artif. Intell. 2(3–4), 189–208 (1971)
Freeman, H.: Shape description via the use of critical points. Pattern Recogn. 10(3), 159–166 (1978). https://doi.org/10.1016/0031-3203(78)90024-9. https://www.sciencedirect.com/science/article/pii/0031320378900249. The Proceedings of the IEEE Computer Society Conference
Nahian, M.S.A., Tasrin, T., Gandhi, S., Gaines, R., Harrison, B.: A hierarchical approach for visual storytelling using image description. In: Cardona-Rivera, R.E., Sullivan, A., Young, R.M. (eds.) ICIDS 2019. LNCS, vol. 11869, pp. 304–317. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-33894-7_30
Klimt, B., Yang, Y.: The Enron corpus: a new dataset for email classification research. In: Boulicaut, J.-F., Esposito, F., Giannotti, F., Pedreschi, D. (eds.) ECML 2004. LNCS (LNAI), vol. 3201, pp. 217–226. Springer, Heidelberg (2004). https://doi.org/10.1007/978-3-540-30115-8_22
Liu, S., Zhou, M.X., Pan, S., Song, Y., Qian, W., Cai, W., Lian, X.: TIARA: interactive, topic-based visual text summarization and analysis. ACM Trans. Intell. Syst. Technol. (TIST) 3(2), 1–28 (2012)
McKenna, S., Henry Riche, N., Lee, B., Boy, J., Meyer, M.: Visual narrative flow: exploring factors shaping data visualization story reading experiences. Comput. Graph. Forum 36(3), 377–387 (2017)
Meehan, J.R.: TALE-SPIN, an interactive program that writes stories. In: IJCAI, vol. 77, pp. 91–98 (1977)
Oard, D., Webber, W., Kirsch, D.A., Golitsynskiy, S.: Avocado research email collection LDC2015T03. Linguistic Data Consortium, Philadelphia (2015). https://doi.org/10.35111/wqt6-jg60
Robertson, J., Harrison, B., Jhala, A.: Interactive summarization for data filtering and triage. In: The Thirty-Third International Flairs Conference (2020)
Staff, E.: Timeline: a chronology of Enron corp. New York Times (2006). https://www.nytimes.com/2006/01/18/business/worldbusiness/timeline-a-chronology-of-enron-corp.html
Vesanto, J., Hollmén, J.: An automated report generation tool for the data understanding phase. In: Abraham, A., Jain, L., van der Zwaag, B.J. (eds.) Innovations in Intelligent Systems. Studies in Fuzziness and Soft Computing, vol. 140, pp. 203–219. Springer, Heidelberg (2004). https://doi.org/10.1007/978-3-540-39615-4_8
Wang, Y., Sun, Z., Zhang, H., Cui, W., Xu, K., Ma, X., Zhang, D.: DataShot: automatic generation of fact sheets from tabular data. IEEE Trans. Vis. Comput. Graph. 26(1), 895–905 (2020)
Wilson, G., Banzhaf, W.: Discovery of email communication networks from the Enron corpus with a genetic algorithm using social network analysis. In: 2009 IEEE Congress on Evolutionary Computation, pp. 3256–3263 (2009)
Wongsuphasawat, K., et al.: Voyager 2: augmenting visual analysis with partial view specifications. In: Proceedings of the 2017 CHI Conference on Human Factors in Computing Systems, pp. 2648–2659 (2017)
Young, R.M., Pollack, M.E., Moore, J.D.: Decomposition and causality in partial-order planning. In: AIPS, pp. 188–194 (1994)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 Springer Nature Switzerland AG
About this paper
Cite this paper
Potts, C.M., Jhala, A. (2021). Narraport: Narrative-Based Interactions and Report Generation with Large Datasets. In: Mitchell, A., Vosmeer, M. (eds) Interactive Storytelling. ICIDS 2021. Lecture Notes in Computer Science(), vol 13138. Springer, Cham. https://doi.org/10.1007/978-3-030-92300-6_11
Download citation
DOI: https://doi.org/10.1007/978-3-030-92300-6_11
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-92299-3
Online ISBN: 978-3-030-92300-6
eBook Packages: Computer ScienceComputer Science (R0)