Abstract
Selecting a subset of linguistic summaries and providing them in a user-friendly and compact form is a latent issue in the field of Linguistic Data Summarization. The paper proposes a method for filtering the most useful subset, for a given decision problem, from a set of composite linguistic summaries. Those summaries embody Evidence, Contrast or Emphasis relations, inspired by the Rhetorical Structure Theory. The summaries’ usefulness is determined according to the relevance of the attributes contained in each one. The strategy followed is based on first finding the Evidence relation whose nucleus contains the better possible representation of the problem attributes, then searching for a Contrast relation and an Emphasis relation that share that nucleus. The method output is a scheme that synthesizes and combines the texts of the three relations. The paper provides an illustrative example in which the most useful relations are found from a dataset of 63 crimes to solve a case of bank document forgery.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Yager, R.R., Reformat, M.Z., To, N.D.: Drawing on the iPad to input fuzzy sets with an application to linguistic data science. Inf. Sci. (Ny) 479, 277–291 (2019). https://doi.org/10.1016/J.INS.2018.11.048
Yager, R.R.: A new approach to the summarization of data. Inf. Sci. (Ny) 28, 69–86 (1982)
Zadeh, L.A.: A computational approach to fuzzy quantifiers in natural languages. Comput. Math. Appl. 9(1), 149–184 (1983). https://doi.org/10.1016/0898-1221(83)90013-5
Pupo, I., Piñero, P.Y., Bello, R.E., García, R., Villavicencio, N.: Linguistic data summarization: a systematic review. In: Piñero Pérez, P.Y., Bello Pérez, R.E., Kacprzyk, J. (eds.) Artificial Intelligence in Project Management and Making Decisions, pp. 3–21. Springer International Publishing, Cham (2022). https://doi.org/10.1007/978-3-030-97269-1_1
Kuhn, T.: A survey and classification of controlled natural languages. Comput. Linguist. 40(1), 121–170 (2014)
Zadeh, L.A.: A prototype-centered approach to adding deduction capability to search engines-the concept of protoform. In: 2002 Annual Meeting of the North American Fuzzy Information Processing Society Proceedings, pp. 523–525 (2002)
Kacprzyk, J., Zadrozny, S.: Linguistic database summaries and their protoforms: towards natural language based knowledge discovery tools. Inform Sci (Ny) 173(4), 281–304 (2005). https://doi.org/10.1016/j.ins.2005.03.002
Ramos-Soto, A., Martin-Rodilla, P.: Enriching linguistic descriptions of data: a framework for composite protoforms. Fuzzy Sets Syst. 407, 1–26 (2021). https://doi.org/10.1016/j.fss.2019.11.013
Cornejo, M.E., Medina, J., Rubio-Manzano, C.: Linguistic descriptions of data via fuzzy formal concept analysis. In: Harmati, I.Á., Kóczy, L.T., Medina, J., Ramírez-Poussa, E. (eds.) Computational Intelligence and Mathematics for Tackling Complex Problems 3, pp. 119–125. Springer International Publishing, Cham (2022). https://doi.org/10.1007/978-3-030-74970-5_14
To, N.D., Reformat, M.Z., Yager, R.R.: Question-answering system with linguistic summarization. In: 2021 IEEE International Conference on Fuzzy Systems (FUZZ-IEEE), pp. 1–8 (2021). https://doi.org/10.1109/FUZZ45933.2021.9494389.
Trivino, G., Sugeno, M.: Towards linguistic descriptions of phenomena. Int. J. Approx. Reason. 54(1), 22–34 (2013). https://doi.org/10.1016/j.ijar.2012.07.004
Pérez, I., Piñero, P.Y., Al-subhi, S.H., Mahdi, G.S.S., Bello, R.E.: Linguistic data summarization with multilingual approach. In: Piñero Pérez, P.Y., Bello Pérez, R.E., Kacprzyk, J. (eds.) Artificial Intelligence in Project Management and Making Decisions, pp. 39–64. Springer International Publishing, Cham (2022). https://doi.org/10.1007/978-3-030-97269-1_3
Rodríguez, C.R., Peña, M., Zuev, D.S.: Extracting composite summaries from qualitative data. In: Heredia, Y.H., Núñez, V.M., Shulcloper, J.R. (eds.) Progress in Artificial Intelligence and Pattern Recognition: 7th International Workshop on Artificial Intelligence and Pattern Recognition, IWAIPR 2021, Havana, Cuba, October 5–7, 2021, Proceedings, pp. 260–269. Springer International Publishing, Cham (2021). https://doi.org/10.1007/978-3-030-89691-1_26
Rodríguez Rodríguez, C.R., Zuev, D.S., Peña Abreu, M.: Algorithms for linguistic description of categorical data. In: Piñero Pérez, P.Y., Bello Pérez, R.E., Kacprzyk, J. (eds.) UCIENCIA 2021. SCI, vol. 1035, pp. 79–97. Springer, Cham (2022). https://doi.org/10.1007/978-3-030-97269-1_5
Mann, W.C., Thompson, S.A.: Rhetorical structure theory: toward a functional theory of text organization. Text 8(3), 243–281 (1988)
Hou, S., Zhang, S., Fei, C.: Rhetorical structure theory: a comprehensive review of theory, parsing methods and applications. Expert Syst. Appl. 157, 113421 (2020)
Rodríguez, C.R., Amoroso, Y., Zuev, D.S., Peña, M., Zulueta, Y.: M-LAMAC: a model for linguistic assessment of mitigating and aggravating circumstances of criminal responsibility using computing with words. Artif. Intell. Law (2023). https://doi.org/10.1007/s10506-023-09365-8
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2024 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Rodríguez Rodríguez, C.R., Peña Abreu, M., Zuev, D.S., Amoroso Fernández, Y., Zulueta Véliz, Y. (2024). A Novel Method for Filtering a Useful Subset of Composite Linguistic Summaries. In: Hernández Heredia, Y., Milián Núñez, V., Ruiz Shulcloper, J. (eds) Progress in Artificial Intelligence and Pattern Recognition. IWAIPR 2023. Lecture Notes in Computer Science, vol 14335. Springer, Cham. https://doi.org/10.1007/978-3-031-49552-6_16
Download citation
DOI: https://doi.org/10.1007/978-3-031-49552-6_16
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-49551-9
Online ISBN: 978-3-031-49552-6
eBook Packages: Computer ScienceComputer Science (R0)