Skip to main content

Next Step in Online Querying and Visualization of Word-Formation Networks

  • Conference paper
  • First Online:
Text, Speech, and Dialogue (TSD 2020)

Abstract

In this paper, we introduce a new and improved version of DeriSearch, a search engine and visualizer for word-formation networks.

Word-formation networks are datasets that express derivational, compounding and other word-formation relations between words. They are usually expressed as directed graphs, in which nodes correspond to words and edges to the relations between them. Some networks also add other linguistic information, such as morphological segmentation of the words or identification of the processes expressed by the relations.

Networks for morphologically rich languages with productive derivation or compounding have large connected components, which are difficult to visualize. For example, in the network for Czech, DeriNet 2.0, connected components over 500 words large contain of the vocabulary, including its most common parts. In the network for Latin, Word Formation Latin, over 10 000 words ( of the vocabulary) are in a single connected component.

With the recent release of the Universal Derivations collection of word-formation networks for several languages, there is a need for a searching and visualization tool that would allow browsing such complex data.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 79.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 99.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Christ, O., Schulze, B.M., Hofmann, A., König, E.: The IMS Corpus Workbench: Corpus Query Processor (CQP) User’s Manual. University of Stuttgart, Germany (1999)

    Google Scholar 

  2. Culy, C., Litta, E., Passarotti, M.: Visual exploration of Latin derivational morphology. In: Proceedings of FLAIRS 2017, pp. 601–606 (2017)

    Google Scholar 

  3. Horák, A., Pala, K., Rambousek, A., Povolný, M.: DEBVisDic - first version of new client-server Wordnet browsing and editing tool. In: Proceedings of the Third International WordNet Conference (GWC 2006), pp. 325–328 (2005)

    Google Scholar 

  4. Kyjánek, L.: Morphological resources of derivational word-formation relations. Technical report ÚFAL TR-2018-61, ÚFAL MFF UK, Prague, Czechia (2018)

    Google Scholar 

  5. Kyjánek, L., Žabokrtský, Z., Ševčíková, M., Vidra, J.: Universal derivations kickoff: a collection of harmonized derivational resources for eleven languages. In: Proceedings of DeriMo 2019, Prague, Czechia, pp. 101–110 (2019)

    Google Scholar 

  6. Litta, E., Passarotti, M., Culy, C.: Formatio formosa est. Building a word formation lexicon for Latin. In: Proceedings of CLiC-IT 2016, pp. 185–189 (2016)

    Google Scholar 

  7. Pala, K., Šmerk, P.: Derivancze — derivational analyzer of Czech. In: Král, P., Matoušek, V. (eds.) TSD 2015. LNCS (LNAI), vol. 9302, pp. 515–523. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-24033-6_58

    Chapter  Google Scholar 

  8. Panocová, R.: Internationalisms with the suffix -ácia and their adaptation in Slovak. In: Proceedings of DeriMo 2017, Milano, Italy, pp. 129–139 (2017)

    Google Scholar 

  9. Rambousek, A., Horák, A., Klement, D., Kletečka, J.: New features in DEBVisDic for WordNet visualization and user feedback. In: Proceedings of RASLAN 2017 (2017)

    Google Scholar 

  10. Šojat, K., Srebačić, M., Tadić, M., Pavelić, T.: CroDeriV: a new resource for processing Croatian morphology. In: Proceedings of LREC 2014 (2014)

    Google Scholar 

  11. Talamo, L., Celata, C., Bertinetto, P.M.: DerIvaTario: an annotated lexicon of Italian derivatives. Word Struct. 9(1), 72–102 (2016)

    Article  Google Scholar 

  12. Vidra, J.: Implementation of a search engine for DeriNet. In: Proceedings of ITAT 2015, Prague, Czechia, pp. 100–106 (2015)

    Google Scholar 

  13. Vidra, J., Žabokrtský, Z.: Online software components for accessing derivational networks. In: Proceedings of DeriMo 2017, Milano, Italy, pp. 129–139 (2017)

    Google Scholar 

  14. Vidra, J., Žabokrtský, Z., Ševčíková, M., Kyjánek, L.: Derinet 2.0: towards an all-in-one word-formation resource. In: Proceedings of DeriMo 2019, Prague, Czechia (2019)

    Google Scholar 

Download references

Acknowledgments

This work was supported by the Grant No. GA19-14534S of the Czech Science Foundation, by the Charles University Grant Agency project No. 1176219 and by the SVV project No. 260 575. It uses language resources developed, stored, and distributed by the LINDAT/CLARIAH CZ project (LM2015071, LM2018101).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Jonáš Vidra .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2020 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Vidra, J., Žabokrtský, Z. (2020). Next Step in Online Querying and Visualization of Word-Formation Networks. In: Sojka, P., Kopeček, I., Pala, K., Horák, A. (eds) Text, Speech, and Dialogue. TSD 2020. Lecture Notes in Computer Science(), vol 12284. Springer, Cham. https://doi.org/10.1007/978-3-030-58323-1_15

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-58323-1_15

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-58322-4

  • Online ISBN: 978-3-030-58323-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics