Abstract:
One of the major problems in the implementation of natural language processing (NLP) or machine translation (MT) is a complete lexicon: the place where the system's infor...Show MoreMetadata
Abstract:
One of the major problems in the implementation of natural language processing (NLP) or machine translation (MT) is a complete lexicon: the place where the system's information about words is stored. There are difficulties in deciding what information should be stored in a lexicon and even greater difficulties in acquiring this information in proper form. The OriNet system was designed to incorporate a multiple lexical database and tools under one consistent functional interface in order to facilitate systems requiring syntactic, semantic and lexical information of the Oriya language. We divide the work into two independent tasks. One is to write the source file that contains basic lexical data, the content of these files being the lexical substance of OriNet. The second is to create a set of programs that would accept the source files and process them ultimately to display for the user. This paper describes ongoing work on designing an object oriented model for the OriNet system. It uses object oriented programming, particularly the rich library of classes and programming principles which Java offers. It also provides a convenient tool to conceptualise the process of the OriNet system. This technique also allows flexibility and extensibility of the system with more robustness.
Published in: Language Engineering Conference, 2002. Proceedings
Date of Conference: 13-15 December 2002
Date Added to IEEE Xplore: 28 February 2003
Print ISBN:0-7695-1885-0