ABSTRACT
Nowadays, it is almost a standard issue to generate summaries of texts automatically. In contrast, it is still a problem to identify the differences in the statements of the two publications. For the most part, this still requires a human being to read and evaluate at least excerpts of the relevant passages. Finding a so-called text differentiation with appropriate tools is becoming an increasingly interesting and important task to effectively cope with the daily flood of information on the WWW. For years, co-occurrence graphs have been a proven means of deriving statements of various kinds from texts. So-called text- representing centroids (TRC's) has often been an effective tool for identifying, comparing and categorizing texts or sections. The present article examines how a different form of co-occurrence graphs can take place and be helpful. First, different co-occurrence graphs are built from a larger corpus and various individual texts or text groups. Subsequently, the calculated difference graphs can be used to create summaries that precisely characterize the differences between texts. Experimental results show that this new method works well.
- Cancho, R. F. i., and Solé, R. V. (2001). The Small World of Human Language. Proc. R. Soc. Lond. B 268, 2261–2265. doi:10.1098/rspb.2001.1800Google ScholarCross Ref
- Unger, Herwig & Kubek, Mario & Ruamsuk, Yanakorn & Mingkhwan, A. (2022). A Concept for Recommender Systems Based on Text-Representing Centroids. 10.1007/978-3-030-90936-9_2.Google Scholar
- Supaporn Simcharoen and Herwig Unger. The brain: WebEngine version 2.0. In The Autonomous Web, chapter 4, pages 51-69. Springer, first edition, 2021.Google Scholar
- Jiawei Han, Micheline Kamber, Jian Pei. Data Mining Trends and Research Frontiers, Data Mining (Third Edition), Morgan Kaufmann, 2012, Pages 39-82, ISBN 9780123814791, doi.org/10.1016/B978-0-12-381479-1.00002-2.Google ScholarCross Ref
- HA Maurer, F Kappe, B Zaka. Plagiarism-A survey, Journal of Universal Computer Science, vol. 12, no. 8 (2006), 1050-1084Google Scholar
- n.n. spaCY: industrial-strength natural language processing. Website and download from https://spacy.io/, last visited October 5t, 2022Google Scholar
- Kubek, Mario & Unger, Herwig. (2016). Centroid Terms as Text Representatives. 10.1145/2960811.2967150.Google Scholar
- Kubek, Mario & Boehme, Thomas & Unger, Herwig. (2017). Empiric Experiments with Text Representing Centroids. Lecture Notes on Information Theory. 5. 23-28. 10.18178/lnit.5.1.23-28.Google ScholarCross Ref
- Kubek,M. M., T. Bo ̈hme, and Unger,H. Spreading Activation: A Fast Calculation Method for Text Centroids. In Proceedings of the 3rd International Conference on Communication and Information Processing (ICCIP 2017), New York, NY, USA, ACM, 2017.Google ScholarDigital Library
- Y. Ruamsuk, A. Mingkhwan and H. Unger, "Generating and Evaluating Text Summarisations using Text-representing Centroids (TRC)," 2022 Research, Invention, and Innovation Congress: Innovative Electricals and Electronics (RI2C), 2022, pp. 330-333, doi: 10.1109/RI2C56397.2022.9910272.Google ScholarCross Ref
- Y Ruamsuk, W Tirasopitlert, A Mingkhwan, H Unger - Medical Recommendation System using Co-Occurrence Graphs NU. International Journal of Science, 2020Google Scholar
Index Terms
- DIFFSTRACT: distinguishing the content of texts
Recommendations
Adjacent vertex-distinguishing edge and total chromatic numbers of hypercubes
An adjacent vertex-distinguishing edge coloring of a simple graph G is a proper edge coloring of G such that incident edge sets of any two adjacent vertices are assigned different sets of colors. A total coloring of a graph G is a coloring of both the ...
Some bounds on the neighbor-distinguishing index of graphs
A proper edge coloring of a graph G is neighbor-distinguishing if any two adjacent vertices have distinct sets consisting of colors of their incident edges. The neighbor-distinguishing index of G is the minimum number a ' ( G ) of colors in a neighbor-...
Equitable neighbour-sum-distinguishing edge and total colourings
With any (not necessarily proper) edge k-colouring :E(G){1,,k} of a graph G, one can associate a vertex colouring given by (v)=ev(e). A neighbour-sum-distinguishing edge k-colouring is an edge colouring whose associated vertex colouring is proper. The ...
Comments