research-article

DIFFSTRACT: distinguishing the content of texts

Authors:
Yanakorn Ruamsuk

The Creative Lab, Faculty of Industrial Technology and Management,KMUTNB, Thailand

The Creative Lab, Faculty of Industrial Technology and Management,KMUTNB, Thailand

0000-0001-5528-3261
View Profile

,
Anirach Mingkhwan

The Creative Lab, Faculty of Industrial Technology and Management,KMUTNB, Thailand

The Creative Lab, Faculty of Industrial Technology and Management,KMUTNB, Thailand

0000-0002-4862-6462
View Profile

,
Herwig Unger

Department of Communication Networks, Fern, Germany

Department of Communication Networks, Fern, Germany

0000-0002-8818-3600
View Profile

NLPIR '22: Proceedings of the 2022 6th International Conference on Natural Language Processing and Information RetrievalDecember 2022Pages 31–35https://doi.org/10.1145/3582768.3582787

Published:27 June 2023Publication History

NLPIR '22: Proceedings of the 2022 6th International Conference on Natural Language Processing and Information Retrieval

Pages 31–35

ABSTRACT

Nowadays, it is almost a standard issue to generate summaries of texts automatically. In contrast, it is still a problem to identify the differences in the statements of the two publications. For the most part, this still requires a human being to read and evaluate at least excerpts of the relevant passages. Finding a so-called text differentiation with appropriate tools is becoming an increasingly interesting and important task to effectively cope with the daily flood of information on the WWW. For years, co-occurrence graphs have been a proven means of deriving statements of various kinds from texts. So-called text- representing centroids (TRC's) has often been an effective tool for identifying, comparing and categorizing texts or sections. The present article examines how a different form of co-occurrence graphs can take place and be helpful. First, different co-occurrence graphs are built from a larger corpus and various individual texts or text groups. Subsequently, the calculated difference graphs can be used to create summaries that precisely characterize the differences between texts. Experimental results show that this new method works well.

References

Cancho, R. F. i., and Solé, R. V. (2001). The Small World of Human Language. Proc. R. Soc. Lond. B 268, 2261–2265. doi:10.1098/rspb.2001.1800Google ScholarCross Ref
Unger, Herwig & Kubek, Mario & Ruamsuk, Yanakorn & Mingkhwan, A. (2022). A Concept for Recommender Systems Based on Text-Representing Centroids. 10.1007/978-3-030-90936-9_2.Google Scholar
Supaporn Simcharoen and Herwig Unger. The brain: WebEngine version 2.0. In The Autonomous Web, chapter 4, pages 51-69. Springer, first edition, 2021.Google Scholar
Jiawei Han, Micheline Kamber, Jian Pei. Data Mining Trends and Research Frontiers, Data Mining (Third Edition), Morgan Kaufmann, 2012, Pages 39-82, ISBN 9780123814791, doi.org/10.1016/B978-0-12-381479-1.00002-2.Google ScholarCross Ref
HA Maurer, F Kappe, B Zaka. Plagiarism-A survey, Journal of Universal Computer Science, vol. 12, no. 8 (2006), 1050-1084Google Scholar
n.n. spaCY: industrial-strength natural language processing. Website and download from https://spacy.io/, last visited October 5t, 2022Google Scholar
Kubek, Mario & Unger, Herwig. (2016). Centroid Terms as Text Representatives. 10.1145/2960811.2967150.Google Scholar
Kubek, Mario & Boehme, Thomas & Unger, Herwig. (2017). Empiric Experiments with Text Representing Centroids. Lecture Notes on Information Theory. 5. 23-28. 10.18178/lnit.5.1.23-28.Google ScholarCross Ref
Kubek,M. M., T. Bo ̈hme, and Unger,H. Spreading Activation: A Fast Calculation Method for Text Centroids. In Proceedings of the 3rd International Conference on Communication and Information Processing (ICCIP 2017), New York, NY, USA, ACM, 2017.Google ScholarDigital Library
Y. Ruamsuk, A. Mingkhwan and H. Unger, "Generating and Evaluating Text Summarisations using Text-representing Centroids (TRC)," 2022 Research, Invention, and Innovation Congress: Innovative Electricals and Electronics (RI2C), 2022, pp. 330-333, doi: 10.1109/RI2C56397.2022.9910272.Google ScholarCross Ref
Y Ruamsuk, W Tirasopitlert, A Mingkhwan, H Unger - Medical Recommendation System using Co-Occurrence Graphs NU. International Journal of Science, 2020Google Scholar

Index Terms

DIFFSTRACT: distinguishing the content of texts

Index terms have been assigned to the content through auto-classification.

Recommendations

Adjacent vertex-distinguishing edge and total chromatic numbers of hypercubes

An adjacent vertex-distinguishing edge coloring of a simple graph G is a proper edge coloring of G such that incident edge sets of any two adjacent vertices are assigned different sets of colors. A total coloring of a graph G is a coloring of both the ...
Read More
Some bounds on the neighbor-distinguishing index of graphs

A proper edge coloring of a graph G is neighbor-distinguishing if any two adjacent vertices have distinct sets consisting of colors of their incident edges. The neighbor-distinguishing index of G is the minimum number a ' ( G ) of colors in a neighbor-...
Read More
Equitable neighbour-sum-distinguishing edge and total colourings

With any (not necessarily proper) edge k-colouring :E(G){1,,k} of a graph G, one can associate a vertex colouring given by (v)=ev(e). A neighbour-sum-distinguishing edge k-colouring is an edge colouring whose associated vertex colouring is proper. The ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in

NLPIR '22: Proceedings of the 2022 6th International Conference on Natural Language Processing and Information Retrieval
December 2022
241 pages
ISBN:9781450397629
DOI:10.1145/3582768

Copyright © 2022 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 27 June 2023
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
co-occurrence graph
text differentiation
text-representing centroids
text-summarizing
Qualifiers
- research-article
- Research
- Refereed limited
Conference
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 0
  Total Citations
  View Citations
- 19
  Total Downloads
- Downloads (Last 12 months)19
- Downloads (Last 6 weeks)0
Other Metrics
View Author Metrics
Cited By
This publication has not been cited yet

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format .

View HTML Format

DIFFSTRACT: distinguishing the content of texts

NLPIR '22: Proceedings of the 2022 6th International Conference on Natural Language Processing and Information Retrieval

ABSTRACT

References

Cited By

Index Terms

Recommendations

Adjacent vertex-distinguishing edge and total chromatic numbers of hypercubes

Some bounds on the neighbor-distinguishing index of graphs

Equitable neighbour-sum-distinguishing edge and total colourings

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

HTML Format

Caption

DIFFSTRACT: distinguishing the content of texts

NLPIR '22: Proceedings of the 2022 6th International Conference on Natural Language Processing and Information Retrieval

ABSTRACT

References

Cited By

Index Terms

Recommendations

Adjacent vertex-distinguishing edge and total chromatic numbers of hypercubes

Some bounds on the neighbor-distinguishing index of graphs

Equitable neighbour-sum-distinguishing edge and total colourings

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

HTML Format

Share this Publication link

Share on Social Media