Abstract
This paper presents a method for discovering homographs from textual corpora. The proposed method first extracts an N-partite graph expression of word dependencies, and then, generates near-synonymous word clusters by enumerating and combining maximum complete sub-components on the graph. The homographs are identified as the words that belong to multiple clusters. In our experiment, we applied the method to Japanese newspaper articles and detected 531 homograph candidates, of which 31 were confirmed to be actual homographs.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Hattori, S.: Iwanami course philosophy XI language. Iwanami Shoten, Tokyo (1968) (in Japanese)
Avis, D., Fukuda, K.: Reverse Search for Enumeration. Discrete Appl. Math. 65, 21–45 (1996)
Uno, T.: A Practical Fast Algorithm for Finding Clusters of Huge Networks. IPSJ SIGNotes Algorithms No. 088-001. Information Processing Society of Japan, Tokyo (2002) (in Japanese)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2003 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Nakawatase, H., Aizawa, A. (2003). Discovering Homographs Using N-Partite Graph Clustering. In: Grieser, G., Tanaka, Y., Yamamoto, A. (eds) Discovery Science. DS 2003. Lecture Notes in Computer Science(), vol 2843. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-39644-4_39
Download citation
DOI: https://doi.org/10.1007/978-3-540-39644-4_39
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-20293-6
Online ISBN: 978-3-540-39644-4
eBook Packages: Springer Book Archive