Abstract
The paper presents LEXIK, an intelligent terminological architecture that is able to efficiently obtain specialized lexical resources for elaborating dictionaries and providing lexical support for different expert tasks. LEXIK is designed as a powerful tool to create a rich knowledge base for lexicography. It will process big amounts of data in a modular system, that combines several applications and techniques for terminology extraction, definition generation, example extraction and term banks, that have been partially developed so far. Such integration is a challenge for the area, which lacks an integrated system for extracting and defining terms from a non-preprocessed corpus.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Alarcón, R., Sierra, G., Bach, C.: ECODE: a definition extraction system. In: Vetulani, Z., Uszkoreit, H. (eds.) LTC 2007. LNCS, vol. 5603, pp. 382–391. Springer, Heidelberg (2009). doi:10.1007/978-3-642-04235-5_33
Alarcón, R., Sierra, G., Bach, C.: Description and evaluation of a definition extraction system for Spanish language. In: 1st Workshop on Definition Extraction 2009, Borovets, Bulgaria, pp. 7–13 (2009)
Arroyo-Fernández, I.: Learning kernels for semantic clustering: a deep approach. In: NAACL-HLT 2015 Student Research Workshop (SRW), pp. 79–87 (2015)
Baeza-Yates, R., Ribeiro-Neto, B.: Modern Information Retrieval. ACM Press Addison-Wesley, New York (1999)
Barrón-Cedeño, A., Sierra, G., Drouin, P., Ananiadou, S.: An improved automatic term recognition method for Spanish. In: Gelbukh, A. (ed.) CICLing 2009. LNCS, vol. 5449, pp. 125–136. Springer, Heidelberg (2009). doi:10.1007/978-3-642-00382-0_10
Bourigault, D.: LEXTER, un Logiciel d’EXtraction de Terminologie. Application à l’acquisition des connaissances à partir de textes. Ph.D. tesis, Ecole des Hautes Etudes en Sciences Sociales, Paris (1994)
Cabré, T.: La terminología: representación y comunicación. Institut Universitari de Lingüística Aplicada, Universitat Pompeu Fabra, Elementos para una teoría de base comunicativa y otros artículos, Barcelona (1999)
Cabré, M.T., Estopá, R., Vivaldi, J.: Automatic Term Detection: A Review of Current Systems. Recent Advances in Computational Terminology, pp. 53–88. John Benjamins Publishing Company, Amsterdam/Philadelphia (2001)
Cabré, T.: El principio de poliedricidad: la articulación de lo discursivo, lo cognitivo y lo lingüístico en terminología (I), pp. 9–36, Ibérica (2008)
da Silva Conrado, M., Pardo, T.A.S., Rezende, S.O.A.: Machine learning approach to automatic term extraction using a rich feature set. In: HLT-NAACL, pp. 16–23 (2013)
David, S., Plante, P.: Le progiciel TERMINO: de la nécessité d’une analyse morphosyntaxique pour le dépouillement terminologique des textes. In: Proceedings of the Montreal Colloquium Les industries de la langue: perspectives des années 1990, vol. 1, pp. 71–88 (1991)
Drouin, P.: Acquisition automatique de termes: simuler le travail du terminologue. Ela. Etudes de linguistique appliquée 4, 417–427 (2015)
Drouin, P., Ladouceur, J.: Lidentification automatique de descripteurs complexes dans des textes de spcialité. In: Proceedings of the Workshop on Compound Nouns: Multilingual Aspects of Nominal Composition, pp. 8–28 (1994)
Enguehard, C., Pantera, L.: Automatic natural acquisition of a terminology. J. Quant. Linguist. 2(1), 27–32 (1994)
Espinosa-Anke, L., Carlini, R., Saggion, H., Ronzano, F.: A semi supervised definition extraction tool (2016). arXiv:1606.02514
Fernández, S., SanJuan, E., Torres-Moreno, J.M.: Textual energy of associative memories: performant applications of enertex algorithm in text summarization and topic segmentation. In: Gelbukh, A., Kuri Morales, Á.F. (eds.) MICAI 2007. LNCS, vol. 4827, pp. 861–871. Springer, Heidelberg (2007). doi:10.1007/978-3-540-76631-5_82
Frantzi, K., Ananiadou, S.: Statistical measures for terminological extraction. Working paper of the Department of Computing of Manchester Metropolitan University (1995)
Kilgarriff, A., Husák, M., McAdam, K., Rundell, M., Rychly, P.: GDEX automatically finding good dictionary examples in a corpus. In: Proceedings of the XIII EURALEX International Congress (2008)
Klavans, J.L., Muresan, S.: Evaluation of the DEFINDER system for fully automatic glossary construction. In: Proceedings of the AMIA Symposium, pp. 324–328 (2001)
Kuguel, I.: La activación del significado especializado. In: Lorente, M., Estopá, R., Freixa, J., Martí, J., Tebé, C. (eds.) Estudis de lingüística aplicada en honor de M. Teresa Cabré Castellv. Series del Institut Universitari de Lingüística Aplicada, Barcelona (2007)
Lázaro, J.A.: El ejemplo en terminología, caracterización y extracción automática. Ph.D. thesis, Universitat Pompeu Fabra (2015)
Leu, F-Y., Ko, C.C.: An automated term definition extraction using the web corpus in Chinese language. In: International Conference on Natural Language Processing and Knowledge Engineering, NLP-KE 2007. IEEE (2007)
Molina, A.: Agrupamiento automático de contextos definitorios. Master thesis, UNAM (2009)
Pantel, P., Lin, D.: A statistical corpus-based term extractor. In: En 14th Biennial Conference of the Canadian Society for Computational Studies of Intelligence, Ottawa, Canada (2001)
Sierra, G., Alarcón, R., Molina, A., Aldana, E.: Web exploitation for definition extraction. En Chávez, E., Morán, E.F.A. (eds.) 2009 Latin American Web Congress, pp. 217–223. IEEE, Los Alamitos (2009). ISBN 978-0-7695-3856-3
Sierra, G., Pozzi, M., Torres Moreno, J.M.: 1st International Workshop on Definition Extraction, Borovets, Bulgaria, 18 de septiembre (2009)
Sparck-Jones, K.: A statistical interpretation of term specificity and its application in retrieval. J. Doc. 28(1), 11–21 (1972)
Sørensen, T.: A method of establishing groups of equal amplitude in plant sociology based on similarity of species content and its application to analyses of the vegetation on Danish commons. Biol. Skr. 5, 1–34 (1948). Denmark
Velardi, P., Navigli, R., D’Amadio, P.: Mining the web to create specialized glossaries. IEEE Intell. Syst. 5, 18–25 (2008)
Vivaldi, J.: Extracción de candidatos a término mediante combinación de estrategias heterogéneas. Ph.D. thesis, Universidad Politécnica de Catalua, Barcelona (2001)
Vivaldi, J., Rodríguez, H.: Using Wikipedia for term extraction in the biomedical domain: first experiences. Procesamiento del Lenguaje Natural 45, 251–254 (2010)
Ward Church, K., Hanks, P.: Word association mutual information and lexicography. Comput. Linguist. 16(1), 22–29 (1990)
Acknowledgments
LEXIK is the integration of different tools created at the Language Engineering Group, UNAM, by several undergraduate and postgraduate students. An acknowledgment to all of them. Also to DGAPA, UNAM by the funds through the projects PAPIIT IN403016 and PAPIIT AN400117, and to CONACYT by the funds through the project Ciencia Básica 178248.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer International Publishing AG
About this paper
Cite this paper
Sierra, G., Lázaro, J., Bel-Enguix, G. (2017). LEXIK. An Integrated System for Specialized Terminology. In: Sidorov, G., Herrera-Alcántara, O. (eds) Advances in Computational Intelligence. MICAI 2016. Lecture Notes in Computer Science(), vol 10061. Springer, Cham. https://doi.org/10.1007/978-3-319-62434-1_7
Download citation
DOI: https://doi.org/10.1007/978-3-319-62434-1_7
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-62433-4
Online ISBN: 978-3-319-62434-1
eBook Packages: Computer ScienceComputer Science (R0)