Abstract
In this paper, we introduce a web-based integrated text and knowledge mining aid system in which information extraction and intelligent information retrieval/database access are combined using term-oriented natural language tools. Our work is placed within the BioPath research project whose overall aim is to link information extraction to expressed sequence data validation. The aim of the tool is to extract automatically terms, to cluster them, and to provide efficient access to heterogeneous biological and genomic databases and collections of texts, all wrapped into a user friendly workbench enabling users to use a wide range of textual and non textual resources effortlessly. For the evaluation, automatic term recognition and clustering techniques were applied in a domain of molecular biology. Besides English, the same workbench has been used for term recognition and clustering in Japanese.
This research is supported by LION BioScience, http://www.lionbioscience.com
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Ananiadou, S., Albert, S., Schuhmann, D.: “Evaluation of Automatic Term Recognition of Nuclear Receptors from Medline”, Genome Informatics Series, vol.11, 2000
Frantzi, K. T., Ananiadou, S., Mima, H.: “Automatic Recognition of Multi-Word Terms: the C-value/NC-value method”, International Journal on Digital Libraries Vol. 3, No. 2, pp.115–130, 2000
Kehagia, K., Ananiadou S.: “Term Variation as an Integrated Part of Automatic Term Extraction”, in Proc. of 22nd Conference for Greek Language, Thessaloniki, Greece, 2001 (forthcoming)
Koyama, T., Yoshiokka, M., Kageura, K.: “The Construction of a Lexically Motivated Corpus—the Problem with Defining Lexical Units”, in Proc. of LREC 1998, pp.1015–1019, Granada, Spain, 1998
Maynard, D., Ananiadou, S.: “Identifying Terms by Their Family and Friends”, in Proc. of 18th International Conference on Computational Linguistics, COLING 2000, pp.530–536, Luxembourg, 2000
Maynard, D., Ananiadou, S.: “TRUCKS: a Model for Automatic Term Recognition”, in Journal of Natural Language Processing, Vol. 8, No. 1, pp.101–125, 2001
MEDLINE, National Library of Medicine, http://www.ncbi.nlm.nih.gov/PubMed/
Mima, H., Ananiadou, S., Tsujii, J.: “A Web-based Integrated Knowledge Mining Aid System Using Term-oriented Natural Language Processing”, in Proc. of The 5th Natural Language Processing Pacific Rim Symposium, NLPRS’99, 13–18, 1999
Mima, H., Ananiadou, S.: An Application and Evaluation of the C/NC-value Approach for the Automatic Term Recognition of Multi-Word in Japanese’, in Terminology 6:2, 2001 (forthcoming)
Nenadic, G.: “Local Grammars and Parsing Coordinaton of Nouns in Serbo-Croatian”, in Text, Speech and Dialogue-TSD 2000, Lecture Notes in Artificial Intelligence 1902, Springer Verlag, 2000
Oi K., Sumita E., Iida H.: “Document Retrieval Method Using Semantic Similarity and Word Sense Disambiguation in Japanese”, Journal of Natural Language Processing, Vol. 4, No. 3, pp.51–70, 1997
Ushioda A.: “Hierarchical Clustering of Words”, In Proc. of COLING’ 96, Copenhagen, Denmark, 1996
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2001 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Mima, H., Ananiadou, S., Nenadić, G. (2001). The ATRACT Workbench: Automatic Term Recognition and Clustering for Terms. In: Matoušek, V., Mautner, P., Mouček, R., Taušer, K. (eds) Text, Speech and Dialogue. TSD 2001. Lecture Notes in Computer Science(), vol 2166. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-44805-5_16
Download citation
DOI: https://doi.org/10.1007/3-540-44805-5_16
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-42557-1
Online ISBN: 978-3-540-44805-1
eBook Packages: Springer Book Archive