Abstract
In the paper we devise a novel algorithm related to the area of natural language processing. The algorithm is capable of building a mapping between the sets of semantic features and the words available in semantic dictionaries called wordnets. In our research we consider wordnets as ontologies, paying particular attention to hypernymy relation. The correctness of the proposal is verified experimentally based on a selected set of semantic features. plWordNet semantic dictionary is considered as a reference source, providing required information for the mapping. The algorithm is evaluated on an instance of a decision problem related to data classification. The quality measures of the classification include: false positive rate, false negative rate and accuracy. A measure of a strength of membership (SOM) in a semantic feature class is proposed and its impact on the aforementioned quality measures is evaluated.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
We use here the term exact synonyms to denote words with identical meanings. On the other hand by non-exact synonyms we mean words with similar yet not identical meanings.
- 2.
The executable file, short manual and input sets are available at https://github.com/tjastrzab/semantics.
References
Ágel, V., Fischer, K.: Dependency grammar and valency theory. In: Heine, B., H.N.(eds.) The Oxford Handbook of Linguistic Analysis, pp. 223–255. Oxford University Press, UK, Oxford (2010)
Akiba, Y., Nakaiwa, H., Shirai, S., Ooyama, Y.: Interactive generalization of a translation example using queries based on a semantic hierarchy. In: Proceedings of 12th IEEE International Conference on Tools with Artificial Intelligence (ICTAI 2000), pp. 326–332. Vancouver, BC, USA (2000)
Arfaoui, N., Akaichi, J.: Automating schema integration technique case study: generating data warehouse schema from data mart schemas. In: Kozielski, S., Mrozek, D., Kasprowski, P., Małysiak-Mrozek, B., Kostrzewa, D. (eds.) BDAS 2015. CCIS, vol. 521, pp. 200–209. Springer, Heidelberg (2015)
Astrakhantsev, N.A., Turdakov, D.Y.: Automatic construction and enrichment of informal ontologies: a survey. Program. Comput. Softw. 39(1), 34–42 (2013)
Bach, M., Kozielski, S., Świderski, M.: Zastosowanie ontologii do opisu semantyki relacyjnej bazy danych na potrzeby analizy zapytań w języku naturalnym. Stud. Informatica 30(2A(83)), 187–199 (2009). presented at BDAS’09
Biemann, C.: Ontology learning from text: a survey of methods. LDV Forum 20(2), 75–93 (2005)
Fujita, S., Bond, F.: Extending the coverage of a valency dictionary. In: Proceedings of the 2002 COLING workshop on Machine translation in Asia, vol. 16, pp. 1–7. Association for Computational Linguistics, Stroudsburg, PA, USA (2002)
Grund, D.: Komputerowa implementacja słownika syntaktyczno-generatywnego czasowników polskich. Stud. Informatica 21(3(41)), 243–256 (2000)
Hajnicz, E.: Automatyczne tworzenie semantycznych słowników walencyjnych. Akademicka Oficyna Wydawnicza EXIT, Warszawa (2011)
Hossain, J., Sani, F., Affendey, L.S., Ishak, I., Kasmiran, K.A.: Semantic schema matching approaches: A review. J. Theor. Appl. Inf. Technol. 62(1), 139–147 (2014)
Jagielski, J.: Język naturalny w systemach baz danych. Stud. Informatica 31(2B(90)), 281–290 (2010). presented at BDAS’10
Kawahara, D., Kurohashi, S.: Japanese case frame construction by coupling the verb and its closest case component. In: Proceedings of First International Conference on Human Language Technology Research (HLT 2001), pp. 204–210. Association for Computational Linguistics, Stroudsburg, PA, USA (2001)
Kulików, K.: Implementacja serwera analizy lingwistycznej dla systemu THETOS - translatora tekstu na język migowy. Stud. Informatica 24(3(55)), 171–178 (2003)
Mahdi, A.M., Tiun, S.: Utilizing wordnet for instance-based schema matching. In: Proceedings of the International Conference on Advances in Computer Science and Electronics Engineering (CSEE 2014), pp. 59–63. Institute of Research Engineers and Doctors (2014)
Manning, C.D.: Automatic acquisition of a large subcategorization dictionary from corpora. In: Proceedings of 31st Annual Meeting of the Association for Computational Linguistics (ACL-1993), pp. 235–242. Ohio State University, Columbus, Ohio, USA (1993)
Miller, G.A.: Nouns in wordnet: a lexical inheritance system. Int. J. Lexicogr. 3(4), 245–264 (1990)
Miller, G.A., Beckwith, R., Fellbaum, C., Gross, D., Miller, K.: Introduction to wordnet: an on-line lexical database. Int. J. Lexicogr. 3(4), 235–244 (1990)
Mykowiecka, A.: Inżynieria lingwistyczna: komputerowe przetwarzanie tekstów w jȩzyku naturalnym. Wydawnictwo PJWSTK, Warszawa (2007)
Olson, D., Delen, D.: Performance evaluation for predictive modeling. In: Olson, D., Delen, D. (eds.) Advanced Data Mining Techniques, pp. 137–147. Springer, Berlin Heidelberg (2008)
Piasecki, M., Szpakowicz, S., Broda, B.: Toward plWordNet 2.0. In: Bhattacharyya, P., Fellbaum, C., Vossen, P. (eds.) Proceedings of the 5th Global Wordnet Conference Principles, Construction and Application of Multilingual Wordnetsm, pp. 263–270. Narosa Publishing House (2010)
Polański, K.: Słownik syntaktyczno-generatywny czasowników polskich. Zakład Narodowy im. Ossolińskich, Wrocław (1980)
Przepiórkowski, A., Hajnicz, E., Patejuk, A., Woliński, M., Skwarski, F., Świdziński, M.: Walenty: Towards a comprehensive valence dictionary of Polish. In: Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC), pp. 2785–2792. Reykjavik, Iceland (2014)
Suszczańska, N., Szmal, P., Simiński, K.: The deep parser for polish. In: Vetulani, Z., Uszkoreit, H. (eds.) LTC 2007. LNCS, vol. 5603, pp. 205–217. Springer, Heidelberg (2009)
Tesnière, L.: Elements of Structural Syntax. John Benjamins Publishing Company, New York (2015)
Vetulani, Z.: Komunikacja człowieka z maszyną. Akademicka Oficyna Wydawnicza EXIT, Warszawa (2014)
Acknowledgments
The research was supported by Institute of Informatics research grant no. BKM-515/RAU2/2015.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing Switzerland
About this paper
Cite this paper
Jastrząb, T., Kwiatkowski, G., Sadowski, P. (2016). Mapping of Selected Synsets to Semantic Features. In: Kozielski, S., Mrozek, D., Kasprowski, P., Małysiak-Mrozek, B., Kostrzewa, D. (eds) Beyond Databases, Architectures and Structures. Advanced Technologies for Data Mining and Knowledge Discovery. BDAS BDAS 2015 2016. Communications in Computer and Information Science, vol 613. Springer, Cham. https://doi.org/10.1007/978-3-319-34099-9_28
Download citation
DOI: https://doi.org/10.1007/978-3-319-34099-9_28
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-34098-2
Online ISBN: 978-3-319-34099-9
eBook Packages: Computer ScienceComputer Science (R0)