Skip to main content

The ATRACT Workbench: Automatic Term Recognition and Clustering for Terms

  • Conference paper
  • First Online:
Text, Speech and Dialogue (TSD 2001)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 2166))

Included in the following conference series:

Abstract

In this paper, we introduce a web-based integrated text and knowledge mining aid system in which information extraction and intelligent information retrieval/database access are combined using term-oriented natural language tools. Our work is placed within the BioPath research project whose overall aim is to link information extraction to expressed sequence data validation. The aim of the tool is to extract automatically terms, to cluster them, and to provide efficient access to heterogeneous biological and genomic databases and collections of texts, all wrapped into a user friendly workbench enabling users to use a wide range of textual and non textual resources effortlessly. For the evaluation, automatic term recognition and clustering techniques were applied in a domain of molecular biology. Besides English, the same workbench has been used for term recognition and clustering in Japanese.

This research is supported by LION BioScience, http://www.lionbioscience.com

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Ananiadou, S., Albert, S., Schuhmann, D.: “Evaluation of Automatic Term Recognition of Nuclear Receptors from Medline”, Genome Informatics Series, vol.11, 2000

    Google Scholar 

  2. Frantzi, K. T., Ananiadou, S., Mima, H.: “Automatic Recognition of Multi-Word Terms: the C-value/NC-value method”, International Journal on Digital Libraries Vol. 3, No. 2, pp.115–130, 2000

    Article  Google Scholar 

  3. Kehagia, K., Ananiadou S.: “Term Variation as an Integrated Part of Automatic Term Extraction”, in Proc. of 22nd Conference for Greek Language, Thessaloniki, Greece, 2001 (forthcoming)

    Google Scholar 

  4. Koyama, T., Yoshiokka, M., Kageura, K.: “The Construction of a Lexically Motivated Corpus—the Problem with Defining Lexical Units”, in Proc. of LREC 1998, pp.1015–1019, Granada, Spain, 1998

    Google Scholar 

  5. Maynard, D., Ananiadou, S.: “Identifying Terms by Their Family and Friends”, in Proc. of 18th International Conference on Computational Linguistics, COLING 2000, pp.530–536, Luxembourg, 2000

    Google Scholar 

  6. Maynard, D., Ananiadou, S.: “TRUCKS: a Model for Automatic Term Recognition”, in Journal of Natural Language Processing, Vol. 8, No. 1, pp.101–125, 2001

    Google Scholar 

  7. MEDLINE, National Library of Medicine, http://www.ncbi.nlm.nih.gov/PubMed/

  8. Mima, H., Ananiadou, S., Tsujii, J.: “A Web-based Integrated Knowledge Mining Aid System Using Term-oriented Natural Language Processing”, in Proc. of The 5th Natural Language Processing Pacific Rim Symposium, NLPRS’99, 13–18, 1999

    Google Scholar 

  9. Mima, H., Ananiadou, S.: An Application and Evaluation of the C/NC-value Approach for the Automatic Term Recognition of Multi-Word in Japanese’, in Terminology 6:2, 2001 (forthcoming)

    Google Scholar 

  10. Nenadic, G.: “Local Grammars and Parsing Coordinaton of Nouns in Serbo-Croatian”, in Text, Speech and Dialogue-TSD 2000, Lecture Notes in Artificial Intelligence 1902, Springer Verlag, 2000

    Google Scholar 

  11. Oi K., Sumita E., Iida H.: “Document Retrieval Method Using Semantic Similarity and Word Sense Disambiguation in Japanese”, Journal of Natural Language Processing, Vol. 4, No. 3, pp.51–70, 1997

    Google Scholar 

  12. Ushioda A.: “Hierarchical Clustering of Words”, In Proc. of COLING’ 96, Copenhagen, Denmark, 1996

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2001 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Mima, H., Ananiadou, S., Nenadić, G. (2001). The ATRACT Workbench: Automatic Term Recognition and Clustering for Terms. In: Matoušek, V., Mautner, P., Mouček, R., Taušer, K. (eds) Text, Speech and Dialogue. TSD 2001. Lecture Notes in Computer Science(), vol 2166. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-44805-5_16

Download citation

  • DOI: https://doi.org/10.1007/3-540-44805-5_16

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-42557-1

  • Online ISBN: 978-3-540-44805-1

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics