A corpora-based detection of stylistic inconsistencies of text in the targeted subgenre

Hashimoto, Kiyota; Takeuchi, Kazuhiro; Ando, Hideaki

doi:10.1007/s10015-010-0840-5

A corpora-based detection of stylistic inconsistencies of text in the targeted subgenre

Original Article
Published: 31 December 2010

Volume 15, pages 486–490, (2010)
Cite this article

Artificial Life and Robotics Aims and scope Submit manuscript

Kiyota Hashimoto¹,
Kazuhiro Takeuchi² &
Hideaki Ando²

97 Accesses
1 Citation
Explore all metrics

Abstract

Any text, be it written by humans or generated by computers, must choose a specific style and be consistent. Textual style is based on the genre to which the text should belong. Each genre has its unique characteristics based on the style of narration, the topic, the target audience, and the author’s intentions. Thus, in order to write or produce better passages, a finer-grained style checker sensitive to differences among genres is to be developed. We propose a foundational method for this purpose: the flexible accumulation of data of the style features of atomic expressions based on any kind of contrastive sets of texts representing the good and bad examples for the targeted genre, an analysis of a given text using the style features, and a visualization to effectively help the author detect anomalies for the targeted genre. Any type of genre-sensitive style checker will implement our method or similar ones.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

How to Check for Plagiarism?

Testing of detection tools for AI-generated text

Article Open access 25 December 2023

Evaluating the efficacy of AI content detection tools in differentiating between human and AI-generated text

Article Open access 01 September 2023

References

Schiffrin D, Tannen D, Hamilton HE (2001) The handbook of discourse analysis. Blackwell, Oxford
Google Scholar
Strunk W Jr, White EB (2000) The elements of style. Longman, New York
Google Scholar
Williams JM (2008) Style: the basics of clarity and grace. Longman, New York
Google Scholar
Japanese Wikipedia dumped file http://download.wikimedia.org/jawiki/20071013/jawiki-20071013-pages-articles.xml.bz2
Uchiyama M, Chujo K (2007) Linking word distribution to technical vocabulary. Technical Report B, College of Industrial Technology, Nihon University, vol 40, pp 13–21
Google Scholar
Murata M, Isahara H (2002) Automatic detection of mis-spelled Japanese expressions using a new method for automatic extraction of negative examples based on positive examples. IEICE Trans Inf Syst E85-D(9):1416–1424
Google Scholar
Araki T, et al (2000) A method for detecting and correcting characters wrongly substituted, deleted, or inserted in Japanese strings using m-th order Markov model. Trans Inst Electron Inf Commun Eng D-II J83-D-II(6):1516–1528
Google Scholar

Download references

Author information

Authors and Affiliations

School of Humanities and Social Sciences, Osaka Prefecture University, 1-1 Gakuen-cho, Naka-ku, Sakai, 599-8531, Japan
Kiyota Hashimoto
Department of Engineering Informatics, Osaka Electro-Communication University, Neyagawa, Japan
Kazuhiro Takeuchi & Hideaki Ando

Authors

Kiyota Hashimoto
View author publications
You can also search for this author in PubMed Google Scholar
Kazuhiro Takeuchi
View author publications
You can also search for this author in PubMed Google Scholar
Hideaki Ando
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Kiyota Hashimoto.

Additional information

This work was presented in part at the 15th International Symposium on Artificial Life and Robotics, Oita, Japan, February 4–6, 2010

About this article

Cite this article

Hashimoto, K., Takeuchi, K. & Ando, H. A corpora-based detection of stylistic inconsistencies of text in the targeted subgenre. Artif Life Robotics 15, 486–490 (2010). https://doi.org/10.1007/s10015-010-0840-5

Download citation

Received: 30 June 2010
Accepted: 30 June 2010
Published: 31 December 2010
Issue Date: December 2010
DOI: https://doi.org/10.1007/s10015-010-0840-5

Key words

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A corpora-based detection of stylistic inconsistencies of text in the targeted subgenre

Abstract

Access this article

Similar content being viewed by others

How to Check for Plagiarism?

Testing of detection tools for AI-generated text

Evaluating the efficacy of AI content detection tools in differentiating between human and AI-generated text

References

Author information

Authors and Affiliations

Corresponding author

Additional information

About this article

Cite this article

Key words

Navigation

A corpora-based detection of stylistic inconsistencies of text in the targeted subgenre

Abstract

Access this article

Similar content being viewed by others

How to Check for Plagiarism?

Testing of detection tools for AI-generated text

Evaluating the efficacy of AI content detection tools in differentiating between human and AI-generated text

References

Author information

Authors and Affiliations

Corresponding author

Additional information

About this article

Cite this article

Share this article

Key words

Search

Navigation