The ATRACT Workbench: Automatic Term Recognition and Clustering for Terms

Mima, Hideki; Ananiadou, Sophia; Nenadić, Goran

doi:10.1007/3-540-44805-5_16

Hideki Mima²,
Sophia Ananiadou³ &
Goran Nenadić³

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 2166))

Included in the following conference series:

International Conference on Text, Speech and Dialogue

422 Accesses
5 Citations

Abstract

In this paper, we introduce a web-based integrated text and knowledge mining aid system in which information extraction and intelligent information retrieval/database access are combined using term-oriented natural language tools. Our work is placed within the BioPath research project whose overall aim is to link information extraction to expressed sequence data validation. The aim of the tool is to extract automatically terms, to cluster them, and to provide efficient access to heterogeneous biological and genomic databases and collections of texts, all wrapped into a user friendly workbench enabling users to use a wide range of textual and non textual resources effortlessly. For the evaluation, automatic term recognition and clustering techniques were applied in a domain of molecular biology. Besides English, the same workbench has been used for term recognition and clustering in Japanese.

This research is supported by LION BioScience, http://www.lionbioscience.com

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Ananiadou, S., Albert, S., Schuhmann, D.: “Evaluation of Automatic Term Recognition of Nuclear Receptors from Medline”, Genome Informatics Series, vol.11, 2000
Google Scholar
Frantzi, K. T., Ananiadou, S., Mima, H.: “Automatic Recognition of Multi-Word Terms: the C-value/NC-value method”, International Journal on Digital Libraries Vol. 3, No. 2, pp.115–130, 2000
Article Google Scholar
Kehagia, K., Ananiadou S.: “Term Variation as an Integrated Part of Automatic Term Extraction”, in Proc. of 22nd Conference for Greek Language, Thessaloniki, Greece, 2001 (forthcoming)
Google Scholar
Koyama, T., Yoshiokka, M., Kageura, K.: “The Construction of a Lexically Motivated Corpus—the Problem with Defining Lexical Units”, in Proc. of LREC 1998, pp.1015–1019, Granada, Spain, 1998
Google Scholar
Maynard, D., Ananiadou, S.: “Identifying Terms by Their Family and Friends”, in Proc. of 18th International Conference on Computational Linguistics, COLING 2000, pp.530–536, Luxembourg, 2000
Google Scholar
Maynard, D., Ananiadou, S.: “TRUCKS: a Model for Automatic Term Recognition”, in Journal of Natural Language Processing, Vol. 8, No. 1, pp.101–125, 2001
Google Scholar
MEDLINE, National Library of Medicine, http://www.ncbi.nlm.nih.gov/PubMed/
Mima, H., Ananiadou, S., Tsujii, J.: “A Web-based Integrated Knowledge Mining Aid System Using Term-oriented Natural Language Processing”, in Proc. of The 5th Natural Language Processing Pacific Rim Symposium, NLPRS’99, 13–18, 1999
Google Scholar
Mima, H., Ananiadou, S.: An Application and Evaluation of the C/NC-value Approach for the Automatic Term Recognition of Multi-Word in Japanese’, in Terminology 6:2, 2001 (forthcoming)
Google Scholar
Nenadic, G.: “Local Grammars and Parsing Coordinaton of Nouns in Serbo-Croatian”, in Text, Speech and Dialogue-TSD 2000, Lecture Notes in Artificial Intelligence 1902, Springer Verlag, 2000
Google Scholar
Oi K., Sumita E., Iida H.: “Document Retrieval Method Using Semantic Similarity and Word Sense Disambiguation in Japanese”, Journal of Natural Language Processing, Vol. 4, No. 3, pp.51–70, 1997
Google Scholar
Ushioda A.: “Hierarchical Clustering of Words”, In Proc. of COLING’ 96, Copenhagen, Denmark, 1996
Google Scholar

Download references

Author information

Authors and Affiliations

Dept. of Information Science, University of Tokyo, Japan
Hideki Mima
Computer Science, University of Salford, UK
Sophia Ananiadou & Goran Nenadić

Authors

Hideki Mima
View author publications
You can also search for this author in PubMed Google Scholar
Sophia Ananiadou
View author publications
You can also search for this author in PubMed Google Scholar
Goran Nenadić
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Dept. of Computer Science and Engineering, University of West Bohemia in Plzeň, Faculty of Applied Sciences, Univerzitní 22, 306-14, Plzeň, Czech Republic
Václav Matoušek , Pavel Mautner , Roman Mouček & Karel Taušer , , &

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Mima, H., Ananiadou, S., Nenadić, G. (2001). The ATRACT Workbench: Automatic Term Recognition and Clustering for Terms. In: Matoušek, V., Mautner, P., Mouček, R., Taušer, K. (eds) Text, Speech and Dialogue. TSD 2001. Lecture Notes in Computer Science(), vol 2166. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-44805-5_16

Download citation

DOI: https://doi.org/10.1007/3-540-44805-5_16
Published: 24 August 2001
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-42557-1
Online ISBN: 978-3-540-44805-1
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics