Abstract
Any text, be it written by humans or generated by computers, must choose a specific style and be consistent. Textual style is based on the genre to which the text should belong. Each genre has its unique characteristics based on the style of narration, the topic, the target audience, and the author’s intentions. Thus, in order to write or produce better passages, a finer-grained style checker sensitive to differences among genres is to be developed. We propose a foundational method for this purpose: the flexible accumulation of data of the style features of atomic expressions based on any kind of contrastive sets of texts representing the good and bad examples for the targeted genre, an analysis of a given text using the style features, and a visualization to effectively help the author detect anomalies for the targeted genre. Any type of genre-sensitive style checker will implement our method or similar ones.
Similar content being viewed by others
References
Schiffrin D, Tannen D, Hamilton HE (2001) The handbook of discourse analysis. Blackwell, Oxford
Strunk W Jr, White EB (2000) The elements of style. Longman, New York
Williams JM (2008) Style: the basics of clarity and grace. Longman, New York
Japanese Wikipedia dumped file http://download.wikimedia.org/jawiki/20071013/jawiki-20071013-pages-articles.xml.bz2
Uchiyama M, Chujo K (2007) Linking word distribution to technical vocabulary. Technical Report B, College of Industrial Technology, Nihon University, vol 40, pp 13–21
Murata M, Isahara H (2002) Automatic detection of mis-spelled Japanese expressions using a new method for automatic extraction of negative examples based on positive examples. IEICE Trans Inf Syst E85-D(9):1416–1424
Araki T, et al (2000) A method for detecting and correcting characters wrongly substituted, deleted, or inserted in Japanese strings using m-th order Markov model. Trans Inst Electron Inf Commun Eng D-II J83-D-II(6):1516–1528
Author information
Authors and Affiliations
Corresponding author
Additional information
This work was presented in part at the 15th International Symposium on Artificial Life and Robotics, Oita, Japan, February 4–6, 2010
About this article
Cite this article
Hashimoto, K., Takeuchi, K. & Ando, H. A corpora-based detection of stylistic inconsistencies of text in the targeted subgenre. Artif Life Robotics 15, 486–490 (2010). https://doi.org/10.1007/s10015-010-0840-5
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10015-010-0840-5