Abstract
Tag-based browsing is a popular interaction model for navigating digital libraries. According to this model, users select descriptive tags to filter resources in the collections. Typical implementations of the model are based on inverted indexes. However, these implementations can require a considerable amount of set operations to update the browsing state. To palliate this inconvenience, it is possible to adopt suitable cache strategies. In this paper we describe and compare two of these strategies: (i) a query-based strategy, according to which previously computed browsing states are indexed by sets of selected tags; and (ii) a resource-based strategy, according to which browsing states are indexed by sets of filtered resources. Our comparison focused on runtime performance, and was carried out empirically, using a real-world web-based collection in the field of digital humanities. The results obtained show that the resource-based strategy clearly outperforms the query-based one.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
Notes
- 1.
- 2.
All the measures reported were taken on a machine with an Intel® Core™ i5-4660S 2.9 GHz processor, RAM 16 GB and Windows 10 OS. Browsing software was programmed in Java. The browsing cache was maintained in memory using Java’s HashMap s. Sets were managed using roaring bitmaps [4].
References
Bloehdorn, S., Görlitz, O., Schenk, S., Völkel, M.: TagFS - tag semantics for hierarchical file systems. In: Proceedings of the 6th International Conference on Knowledge Management (I-KNOW 2006) (2006)
Buendía, F., Gayoso-Cabada, J., Sierra, J.-L.: Using digital medical collections to support radiology training in e-learning platforms. In: Pammer-Schindler, V., Pérez-Sanagustín, M., Drachsler, H., Elferink, R., Scheffel, M. (eds.) EC-TEL 2018. LNCS, vol. 11082, pp. 566–569. Springer, Cham (2018). https://doi.org/10.1007/978-3-319-98572-5_46
Carpineto, C., Romano, G.: Concept Data Analysis: Theory and Applications. Wiley, Hoboken (2004)
Chambi, S., Lemire, D., Kaser, O., Godin, R.: Better bitmap performance with Roaring bitmaps. Softw.-Pract. Exp. 46(5), 709–719 (2016)
Culpepper, J.-S.; Moffat, A.: Efficient set intersection for inverted indexing. ACM Trans. Inf. Syst. 29(1), Article no. 1 (2010)
Dimitrov, D., Helic, D., Strohmaier, M.: Tag-based navigation and visualization. In: Brusilovsky, P., He, D. (eds.) Social Information Access. LNCS, vol. 10100, pp. 181–212. Springer, Cham (2018). https://doi.org/10.1007/978-3-319-90092-6_6
Eck, O., Schaefer, D.: A semantic file system for integrated product data management. Adv. Eng. Inform. 25(2), 177–184 (2011)
Gayoso-Cabada, J., Gómez-Albarrán, M., Sierra, J.-L.: Tag-based browsing of digital collections with inverted indexes and browsing cache. In: Proceedings of the 6th Edition of the Technological Ecosystems for Enhancing Multiculturality Conference (TEEM 2018) (2018)
Gayoso-Cabada, J., Rodríguez-Cerezo, D., Sierra, J.-L.: Multilevel browsing of folksonomy-based digital collections. In: Cellary, W., Mokbel, Mohamed F., Wang, J., Wang, H., Zhou, R., Zhang, Y. (eds.) WISE 2016. LNCS, vol. 10042, pp. 43–51. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-48743-4_4
Gayoso-Cabada, J., Rodríguez-Cerezo, D., Sierra, J.-L.: Browsing digital collections with reconfigurable faceted thesauri. In: 25th International Conference on Information Systems Development (ISD), Katowize, Poland (2016)
Gayoso-Cabada, J., Rodríguez-Cerezo, D., Sierra, J.-L.: Browsing digital collections with reconfigurable faceted thesauri. In: Gołuchowski, J., Pańkowska, M., Linger, H., Barry, C., Lang, M., Schneider, C. (eds.) Complexity in Information Systems Development. LNISO, vol. 22, pp. 69–86. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-52593-8_5
Gifford, D.K., Jouvelot, P., Sheldon, M.A., O’Toole, J.W.: Semantic file systems. SIGOPS Oper. Syst. Rev. 25(5), 16–25 (1991)
Greene, G.-J., Dunaiski, M., Fischer, B.: Browsing publication data using tag clouds over concept lattices constructed by key-phrase extraction. In: Proceedings of Russian and South African Workshop on Knowledge Discovery Techniques Based on Formal Concept Analysis (RuZA 2015) (2015)
Helic, D., Trattner, C., Strohmaier, M., Andrews, K.: On the navigability of social tagging systems. In: 2010 IEEE Second International Conference on Social Computing (SocialCom 2010), pp. 161–168 (2010)
Hernandez, M.-E., Falconer, S.-M., Storey, M.-A., Carini, S., Sim, I.: Synchronized tag clouds for exploring semi-structured clinical trial data. In: Proceedings of the 2008 Conference of the Center for Advanced Studies on Collaborative Research: Meeting of Minds (CASCON 2008) (2008)
Kammerer, Y., Nairn, R., Pirolli, P., Chi, E.H.: Signpost from the masses: learning effects in an exploratory social tag search browser. In: Proceedings of the SIGCHI Conference on Human Factors in Computing Systems (CHI 2009) (2009)
Kleinberg, J.: Navigation in a small world. Nature 406(6798), 845 (2000)
Koutrika, G., Zadeh, Z.-M., Garcia-Molina, H.: CourseCloud: summarizing and refining keyword searches over structured data. In: Proceedings of the 12th International Conference on Extending Database Technology (EDBT), pp. 1132–1135 (2009)
Leone, S., Geel, M., Müller, C., Norrie, M.C.: Exploiting tag clouds for database browsing and querying. In: Soffer, P., Proper, E. (eds.) CAiSE Forum 2010. LNBIP, vol. 72, pp. 15–28. Springer, Heidelberg (2011). https://doi.org/10.1007/978-3-642-17722-4_2
Lin, Y.-L., Brusilovsky, P., He, D.: Finding cultural heritage images through a Dual-Perspective Navigation Framework. Inf. Proc. Manag. 52(5), 820–839 (2016)
Redden, C.S.: Social bookmarking in academic libraries: trends and applications. J. Acad. Librariansh. 36(3), 219–227 (2010)
Salton, G., McGill, M.J.: Introduction to Modern Information Retrieval. McGraw-Hill, New York (1986)
Schwartz, B., Tkachenko, V., Zaitsev, P.: High Performance MySQL, 3rd edn. O’Reilly Media, Sebastopol (2012)
Seltzer, M., Murphy, N.: Hierarchical file systems are dead. In: Proceedings of the 12th Conference on Hot Topics in Operating Systems (HotOS 2009) (2009)
Sierra, J.-L., Fernández-Valmayor, A., Guinea, M., Hernanz, H.: From research resources to learning objects: process model and virtualization experiences. Educ. Technol. Soc. 9(3), 56–68 (2006)
Sim, H., Kim, Y., Vazhkudai, S.S., Vallée, G.R., Lim, S.-H., Butt, A.R.: Tagit: an integrated indexing and search service for file systems. In: Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis (SC 2017) (2017)
Trattner, C., Lin, Y., Parra, D., Yue, Z., Real, W., Brusilovsky, P.: Evaluating tag-based information access in image collections. In: Proceedings of the 23rd ACM Conference on Hypertext and Social Media (HT 2012), pp. 113–122 (2012)
Watson, R., Dekeyser, S., Albadri, N.: Exploring the design space of metadata-focused file management systems. In: Proceedings of the Australasian Computer Science Week Multiconference (ACSW 2017) (2017)
Way, T., Eklund, P.: Social tagging for digital libraries using formal concept analysis. In: Proceedings of the 17th International Conference on Concept Lattices and Their Applications (CLA 2010) (2010)
Zobel, J., Moffat, A.: Inverted files for text search engines. ACM Comput. Surv. 33(2), Article 6 (2006)
Acknowledgements
This research is supported by the research projects TIN2014-52010-R and TIN2017-88092-R. Also, we would like to thank Mercedes Guinea and Alfredo Fernández-Valmayor (El Caño Foundation, Panamá), for their work on Chasqui.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer Nature Switzerland AG
About this paper
Cite this paper
Gayoso-Cabada, J., Gómez-Albarrán, M., Sierra, JL. (2018). Query-Based Versus Resource-Based Cache Strategies in Tag-Based Browsing Systems. In: Dobreva, M., Hinze, A., Žumer, M. (eds) Maturity and Innovation in Digital Libraries. ICADL 2018. Lecture Notes in Computer Science(), vol 11279. Springer, Cham. https://doi.org/10.1007/978-3-030-04257-8_4
Download citation
DOI: https://doi.org/10.1007/978-3-030-04257-8_4
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-04256-1
Online ISBN: 978-3-030-04257-8
eBook Packages: Computer ScienceComputer Science (R0)