Abstract
In this paper, we introduce a new and improved version of DeriSearch, a search engine and visualizer for word-formation networks.
Word-formation networks are datasets that express derivational, compounding and other word-formation relations between words. They are usually expressed as directed graphs, in which nodes correspond to words and edges to the relations between them. Some networks also add other linguistic information, such as morphological segmentation of the words or identification of the processes expressed by the relations.
Networks for morphologically rich languages with productive derivation or compounding have large connected components, which are difficult to visualize. For example, in the network for Czech, DeriNet 2.0, connected components over 500 words large contain of the vocabulary, including its most common parts. In the network for Latin, Word Formation Latin, over 10 000 words ( of the vocabulary) are in a single connected component.
With the recent release of the Universal Derivations collection of word-formation networks for several languages, there is a need for a searching and visualization tool that would allow browsing such complex data.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Christ, O., Schulze, B.M., Hofmann, A., König, E.: The IMS Corpus Workbench: Corpus Query Processor (CQP) User’s Manual. University of Stuttgart, Germany (1999)
Culy, C., Litta, E., Passarotti, M.: Visual exploration of Latin derivational morphology. In: Proceedings of FLAIRS 2017, pp. 601–606 (2017)
Horák, A., Pala, K., Rambousek, A., Povolný, M.: DEBVisDic - first version of new client-server Wordnet browsing and editing tool. In: Proceedings of the Third International WordNet Conference (GWC 2006), pp. 325–328 (2005)
Kyjánek, L.: Morphological resources of derivational word-formation relations. Technical report ÚFAL TR-2018-61, ÚFAL MFF UK, Prague, Czechia (2018)
Kyjánek, L., Žabokrtský, Z., Ševčíková, M., Vidra, J.: Universal derivations kickoff: a collection of harmonized derivational resources for eleven languages. In: Proceedings of DeriMo 2019, Prague, Czechia, pp. 101–110 (2019)
Litta, E., Passarotti, M., Culy, C.: Formatio formosa est. Building a word formation lexicon for Latin. In: Proceedings of CLiC-IT 2016, pp. 185–189 (2016)
Pala, K., Šmerk, P.: Derivancze — derivational analyzer of Czech. In: Král, P., Matoušek, V. (eds.) TSD 2015. LNCS (LNAI), vol. 9302, pp. 515–523. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-24033-6_58
Panocová, R.: Internationalisms with the suffix -ácia and their adaptation in Slovak. In: Proceedings of DeriMo 2017, Milano, Italy, pp. 129–139 (2017)
Rambousek, A., Horák, A., Klement, D., Kletečka, J.: New features in DEBVisDic for WordNet visualization and user feedback. In: Proceedings of RASLAN 2017 (2017)
Šojat, K., Srebačić, M., Tadić, M., Pavelić, T.: CroDeriV: a new resource for processing Croatian morphology. In: Proceedings of LREC 2014 (2014)
Talamo, L., Celata, C., Bertinetto, P.M.: DerIvaTario: an annotated lexicon of Italian derivatives. Word Struct. 9(1), 72–102 (2016)
Vidra, J.: Implementation of a search engine for DeriNet. In: Proceedings of ITAT 2015, Prague, Czechia, pp. 100–106 (2015)
Vidra, J., Žabokrtský, Z.: Online software components for accessing derivational networks. In: Proceedings of DeriMo 2017, Milano, Italy, pp. 129–139 (2017)
Vidra, J., Žabokrtský, Z., Ševčíková, M., Kyjánek, L.: Derinet 2.0: towards an all-in-one word-formation resource. In: Proceedings of DeriMo 2019, Prague, Czechia (2019)
Acknowledgments
This work was supported by the Grant No. GA19-14534S of the Czech Science Foundation, by the Charles University Grant Agency project No. 1176219 and by the SVV project No. 260 575. It uses language resources developed, stored, and distributed by the LINDAT/CLARIAH CZ project (LM2015071, LM2018101).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Switzerland AG
About this paper
Cite this paper
Vidra, J., Žabokrtský, Z. (2020). Next Step in Online Querying and Visualization of Word-Formation Networks. In: Sojka, P., Kopeček, I., Pala, K., Horák, A. (eds) Text, Speech, and Dialogue. TSD 2020. Lecture Notes in Computer Science(), vol 12284. Springer, Cham. https://doi.org/10.1007/978-3-030-58323-1_15
Download citation
DOI: https://doi.org/10.1007/978-3-030-58323-1_15
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-58322-4
Online ISBN: 978-3-030-58323-1
eBook Packages: Computer ScienceComputer Science (R0)