Abstract
Carrying out research tasks is only inadequately supported, if not hindered, by current web search engines. This paper therefore proposes functional extensions of WebMap, a semantically induced overlay linking structure on the web to inherently facilitate research activities. These add-ons support the dynamic determination and regrouping of document clusters, the creation of a semantic signpost in the web, and the interactive tracing of topics back to their origins.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Roßrucker, G.: A Concept for a distributed Webmap. In: Supporting Web Search and Navigation by an Overlay Linking Structure. Studies in Big Data, vol. 142. Springer, Cham (2024). https://doi.org/10.1007/978-3-031-48393-6_3
Kubek, M., Unger, H.: Centroid terms as text representatives. In: Proceedings of the 2016 ACM Symposium on Document Engineering, pp. 99–102, ACM, New York, NY, USA (2016)
Jin, W., Srihari, R.K.: Graph-based text representation and knowledge discovery. In: Proceedings of the 2007 ACM Symposium on Applied Computing, ACM, New York, NY, USA (2007)
Biemann, C., Heyer, G., uasthoff, U.: Wissensrohstoff Text: Eine Einführung in das Text Mining. 2nd Edition, Springer Fachmedien Wiesbaden (2022)
Devlin, J., Chang, M., Lee, K., Toutanova, K.: BERT: pre-training of deep bidirectional transformers for language understanding (2019)
Beltagy, I., Lo, K., Cohan, A.: SciBERT: a pretrained language model for scientific text (2019)
Araci, D.: Finbert: Financial sentiment analysis with pre-trained language models (2019)
Kubek, M.: Concepts and Methods for a Librarian of the Web. In: Studies in Big Data, Vol. 62, Springer, Cham (2020)
Kleinberg, J.M.: Authoritative sources in a hyperlinked environment. J. ACM 46, 604–632 (1999) ACM, New York, NY, USA
Bock, H.H.: Automatische Klassifkation. Vandenhoeck & Ruprecht, Göttingen (1974)
Schnell, P.: Eine Methode zur Auffindung von Gruppen. Biometrische Zeitschrift, 6, 47–48 (1964)
Komkhao, M., Kubek, M., Halang, W.A.: Sequential clustering and condensing the meaning of texts into centroid terms. Inf. Technol. J. 14, 1–10 (2018)
Vaswani, A., et al.: Attention Is All You Need. arXiv:1706.03762v7 [cs.CL] (2017)
Minaee, S., Mikolov, T., et al.: Large Language Models: a survey. arXiv:2402.06196v2 [cs.CL] (2024)
Brown, T., et al.: Language models are few-shot learners. Adv. Neural. Inf. Process. Syst. 33, 1877–1901 (2020)
OpenAI: GPT-4 Technical Report (2023). https://arxiv.org/pdf/2303.08774v3.pdf
Touvron, H., et al.: LLaMA: open and efficient foundation language models. arXiv:2302.13971 [cs.CL] (2023)
Touvron, H., Martin, L., et al.: Llama2: open foundation and fine-tuned chat models. arXiv:2307.09288 [cs.CL] (2023)
Jiang, A.Q., Sablayrolles, A., et al.: Mistral 7B. arXiv:2310.06825v1 [cs.CL] (2023)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2024 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Pokharel, S., Roßrucker, G.P., Kubek, M.M. (2024). WebMap - Large Language Model-assisted Semantic Link Induction in the Web. In: Phillipson, F., Eichler, G., Erfurth, C., Fahrnberger, G. (eds) Innovations for Community Services. I4CS 2024. Communications in Computer and Information Science, vol 2109. Springer, Cham. https://doi.org/10.1007/978-3-031-60433-1_8
Download citation
DOI: https://doi.org/10.1007/978-3-031-60433-1_8
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-60432-4
Online ISBN: 978-3-031-60433-1
eBook Packages: Computer ScienceComputer Science (R0)