Skip to main content

Abstract

In the paper we devise a novel algorithm related to the area of natural language processing. The algorithm is capable of building a mapping between the sets of semantic features and the words available in semantic dictionaries called wordnets. In our research we consider wordnets as ontologies, paying particular attention to hypernymy relation. The correctness of the proposal is verified experimentally based on a selected set of semantic features. plWordNet semantic dictionary is considered as a reference source, providing required information for the mapping. The algorithm is evaluated on an instance of a decision problem related to data classification. The quality measures of the classification include: false positive rate, false negative rate and accuracy. A measure of a strength of membership (SOM) in a semantic feature class is proposed and its impact on the aforementioned quality measures is evaluated.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    We use here the term exact synonyms to denote words with identical meanings. On the other hand by non-exact synonyms we mean words with similar yet not identical meanings.

  2. 2.

    The executable file, short manual and input sets are available at https://github.com/tjastrzab/semantics.

References

  1. Ágel, V., Fischer, K.: Dependency grammar and valency theory. In: Heine, B., H.N.(eds.) The Oxford Handbook of Linguistic Analysis, pp. 223–255. Oxford University Press, UK, Oxford (2010)

    Google Scholar 

  2. Akiba, Y., Nakaiwa, H., Shirai, S., Ooyama, Y.: Interactive generalization of a translation example using queries based on a semantic hierarchy. In: Proceedings of 12th IEEE International Conference on Tools with Artificial Intelligence (ICTAI 2000), pp. 326–332. Vancouver, BC, USA (2000)

    Google Scholar 

  3. Arfaoui, N., Akaichi, J.: Automating schema integration technique case study: generating data warehouse schema from data mart schemas. In: Kozielski, S., Mrozek, D., Kasprowski, P., Małysiak-Mrozek, B., Kostrzewa, D. (eds.) BDAS 2015. CCIS, vol. 521, pp. 200–209. Springer, Heidelberg (2015)

    Google Scholar 

  4. Astrakhantsev, N.A., Turdakov, D.Y.: Automatic construction and enrichment of informal ontologies: a survey. Program. Comput. Softw. 39(1), 34–42 (2013)

    Article  Google Scholar 

  5. Bach, M., Kozielski, S., Świderski, M.: Zastosowanie ontologii do opisu semantyki relacyjnej bazy danych na potrzeby analizy zapytań w języku naturalnym. Stud. Informatica 30(2A(83)), 187–199 (2009). presented at BDAS’09

    Google Scholar 

  6. Biemann, C.: Ontology learning from text: a survey of methods. LDV Forum 20(2), 75–93 (2005)

    Google Scholar 

  7. Fujita, S., Bond, F.: Extending the coverage of a valency dictionary. In: Proceedings of the 2002 COLING workshop on Machine translation in Asia, vol. 16, pp. 1–7. Association for Computational Linguistics, Stroudsburg, PA, USA (2002)

    Google Scholar 

  8. Grund, D.: Komputerowa implementacja słownika syntaktyczno-generatywnego czasowników polskich. Stud. Informatica 21(3(41)), 243–256 (2000)

    Google Scholar 

  9. Hajnicz, E.: Automatyczne tworzenie semantycznych słowników walencyjnych. Akademicka Oficyna Wydawnicza EXIT, Warszawa (2011)

    Google Scholar 

  10. Hossain, J., Sani, F., Affendey, L.S., Ishak, I., Kasmiran, K.A.: Semantic schema matching approaches: A review. J. Theor. Appl. Inf. Technol. 62(1), 139–147 (2014)

    Google Scholar 

  11. Jagielski, J.: Język naturalny w systemach baz danych. Stud. Informatica 31(2B(90)), 281–290 (2010). presented at BDAS’10

    Google Scholar 

  12. Kawahara, D., Kurohashi, S.: Japanese case frame construction by coupling the verb and its closest case component. In: Proceedings of First International Conference on Human Language Technology Research (HLT 2001), pp. 204–210. Association for Computational Linguistics, Stroudsburg, PA, USA (2001)

    Google Scholar 

  13. Kulików, K.: Implementacja serwera analizy lingwistycznej dla systemu THETOS - translatora tekstu na język migowy. Stud. Informatica 24(3(55)), 171–178 (2003)

    Google Scholar 

  14. Mahdi, A.M., Tiun, S.: Utilizing wordnet for instance-based schema matching. In: Proceedings of the International Conference on Advances in Computer Science and Electronics Engineering (CSEE 2014), pp. 59–63. Institute of Research Engineers and Doctors (2014)

    Google Scholar 

  15. Manning, C.D.: Automatic acquisition of a large subcategorization dictionary from corpora. In: Proceedings of 31st Annual Meeting of the Association for Computational Linguistics (ACL-1993), pp. 235–242. Ohio State University, Columbus, Ohio, USA (1993)

    Google Scholar 

  16. Miller, G.A.: Nouns in wordnet: a lexical inheritance system. Int. J. Lexicogr. 3(4), 245–264 (1990)

    Article  Google Scholar 

  17. Miller, G.A., Beckwith, R., Fellbaum, C., Gross, D., Miller, K.: Introduction to wordnet: an on-line lexical database. Int. J. Lexicogr. 3(4), 235–244 (1990)

    Article  Google Scholar 

  18. Mykowiecka, A.: Inżynieria lingwistyczna: komputerowe przetwarzanie tekstów w jȩzyku naturalnym. Wydawnictwo PJWSTK, Warszawa (2007)

    Google Scholar 

  19. Olson, D., Delen, D.: Performance evaluation for predictive modeling. In: Olson, D., Delen, D. (eds.) Advanced Data Mining Techniques, pp. 137–147. Springer, Berlin Heidelberg (2008)

    Chapter  Google Scholar 

  20. Piasecki, M., Szpakowicz, S., Broda, B.: Toward plWordNet 2.0. In: Bhattacharyya, P., Fellbaum, C., Vossen, P. (eds.) Proceedings of the 5th Global Wordnet Conference Principles, Construction and Application of Multilingual Wordnetsm, pp. 263–270. Narosa Publishing House (2010)

    Google Scholar 

  21. Polański, K.: Słownik syntaktyczno-generatywny czasowników polskich. Zakład Narodowy im. Ossolińskich, Wrocław (1980)

    Google Scholar 

  22. Przepiórkowski, A., Hajnicz, E., Patejuk, A., Woliński, M., Skwarski, F., Świdziński, M.: Walenty: Towards a comprehensive valence dictionary of Polish. In: Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC), pp. 2785–2792. Reykjavik, Iceland (2014)

    Google Scholar 

  23. Suszczańska, N., Szmal, P., Simiński, K.: The deep parser for polish. In: Vetulani, Z., Uszkoreit, H. (eds.) LTC 2007. LNCS, vol. 5603, pp. 205–217. Springer, Heidelberg (2009)

    Google Scholar 

  24. Tesnière, L.: Elements of Structural Syntax. John Benjamins Publishing Company, New York (2015)

    Book  Google Scholar 

  25. Vetulani, Z.: Komunikacja człowieka z maszyną. Akademicka Oficyna Wydawnicza EXIT, Warszawa (2014)

    Google Scholar 

Download references

Acknowledgments

The research was supported by Institute of Informatics research grant no. BKM-515/RAU2/2015.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Tomasz Jastrząb .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2016 Springer International Publishing Switzerland

About this paper

Cite this paper

Jastrząb, T., Kwiatkowski, G., Sadowski, P. (2016). Mapping of Selected Synsets to Semantic Features. In: Kozielski, S., Mrozek, D., Kasprowski, P., Małysiak-Mrozek, B., Kostrzewa, D. (eds) Beyond Databases, Architectures and Structures. Advanced Technologies for Data Mining and Knowledge Discovery. BDAS BDAS 2015 2016. Communications in Computer and Information Science, vol 613. Springer, Cham. https://doi.org/10.1007/978-3-319-34099-9_28

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-34099-9_28

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-34098-2

  • Online ISBN: 978-3-319-34099-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics