Skip to main content

Towards the Web in Your Pocket: Curated Data as a Service

  • Conference paper
Advanced Methods for Computational Collective Intelligence

Part of the book series: Studies in Computational Intelligence ((SCI,volume 457))

Abstract

The Web has grown tremendously over the past two decades, as have the information needs of its users. The traditional “interface” between the vast data resources of the Web and its users is the search engine. However, search engines are increasingly challenged in providing the information needed for a particular context or application in a comprehensive, concise, and timely manner. To overcome this, we present a framework that does not just answer queries based on a pre-assembled index, but based on a subject-specific database that is curated by domain experts and dynamically generated based on vast user input.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 169.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Abello, A., et al.: Fusion Cubes: Towards Self-Service Business Intelligence. To Appear in Journal on Data Semantics (2013)

    Google Scholar 

  2. Armbrust, M., et al.: A view of cloud computing. CACM 53(4), 50–58 (2010)

    Google Scholar 

  3. Baeza-Yates, R., Raghavan, P.: Chapter 2: Next Generation Web Search. In: Ceri, S., Brambilla, M. (eds.) Search Computing. LNCS, vol. 5950, pp. 11–23. Springer, Heidelberg (2010)

    Chapter  Google Scholar 

  4. Baumgartner, R., Campi, A., Gottlob, G., Herzog, M.: Chapter 6: Web Data Extraction for Service Creation. In: Ceri, S., Brambilla, M. (eds.) Search Computing. LNCS, vol. 5950, pp. 94–113. Springer, Heidelberg (2010)

    Chapter  Google Scholar 

  5. Bozzon, A., Brambilla, M., Ceri, S., Corcoglioniti, F., Gatti, N.: Chapter 14: Building Search Computing Applications. In: Ceri, S., Brambilla, M. (eds.) Search Computing. LNCS, vol. 5950, pp. 268–290. Springer, Heidelberg (2010)

    Chapter  Google Scholar 

  6. Bozzon, A., Brambilla, M., Ceri, S., Fraternali, P., Manolescu, I.: Chapter 13: Liquid Queries and Liquid Results in Search Computing. In: Ceri, S., Brambilla, M. (eds.) Search Computing. LNCS, vol. 5950, pp. 244–267. Springer, Heidelberg (2010)

    Chapter  Google Scholar 

  7. Bozzon, A., Brambilla, M., Ceri, S., Fraternali, P., Vadacca, S.: Exploratory search in multi-domain information spaces with liquid query. In: Proc. 20th Int. Conf. on World Wide Web, pp. 189–192. ACM, New York (2011)

    Google Scholar 

  8. Braga, D., Corcoglioniti, F., Grossniklaus, M., Vadacca, S.: Panta Rhei: Optimized and Ranked Data Processing over Heterogeneous Sources. In: Maglio, P.P., Weske, M., Yang, J., Fantinato, M. (eds.) ICSOC 2010. LNCS, vol. 6470, pp. 715–716. Springer, Heidelberg (2010)

    Chapter  Google Scholar 

  9. Brin, S., Page, L.: The anatomy of a large-scale hypertextual web search engine. Computer Networks 30, 107–117 (1998)

    Google Scholar 

  10. Buneman, P., Chapman, A., Cheney, J., Vansummeren, S.: A Provenance Model for Manually Curated Data. In: Moreau, L., Foster, I. (eds.) IPAW 2006. LNCS, vol. 4145, pp. 162–170. Springer, Heidelberg (2006)

    Chapter  Google Scholar 

  11. Cafarella, M.J., Halevy, A., Madhavan, J.: Structured Data on the Web. CACM 54(2), 72–79 (2011)

    Google Scholar 

  12. Campi, A., Ceri, S., Gottlob, G., Maesani, A., Ronchi, S.: Chapter 9: Service Marts. In: Ceri, S., Brambilla, M. (eds.) Search Computing. LNCS, vol. 5950, pp. 163–187. Springer, Heidelberg (2010)

    Chapter  Google Scholar 

  13. Ceri, S.: Chapter 1: Search Computing. In: Ceri, S., Brambilla, M. (eds.) Search Computing. LNCS, vol. 5950, pp. 3–10. Springer, Heidelberg (2010)

    Chapter  Google Scholar 

  14. Choudhury, G.S.: Case Study in Data Curation at Johns Hopkins University. Library Trends 57(2), 211–220 (2008)

    Article  Google Scholar 

  15. Doorn, P., Tjalsma, H.: Introduction: archiving research data. Archival Science 7, 1–20 (2007)

    Article  Google Scholar 

  16. Dopichaj, P.: Ranking-Verfahren für Web-Suchmaschinen. In: Lewandowski, D. (ed.) Handbuch Internet-Suchmaschinen. Nutzerorientierung in Wissenschaft und Praxis, pp. 101–115. AKA, Akad. Verl.-Ges., Heidelberg (2009)

    Google Scholar 

  17. Gray, J., Szalay, A.S., Thakar, A.R., Stoughton, C., van den Berg, J.: Online Scientific Data Curation, Publication, and Archiving. CoRR Computer Science Digital Library cs.DL/0208012 (2002)

    Google Scholar 

  18. Heidorn, P.B., Tobbo, H.R., Choudhury, G.S., Greer, C., Marciano, R.: Identifying best practices and skills for workforce development in data curation. Proc. American Society for Information Science and Technology 44(1), 1–3 (2007)

    Article  Google Scholar 

  19. Hey, T., Trefethen, A.: The data deluge: An e-science perspective. In: Berman, F., Fox, G.C., Hey, A.J. (eds.) Grid Computing — Making the Global Infrastructure a Reality, pp. 809–824. Wiley (2003)

    Google Scholar 

  20. Inmon, W.: Building the Data Warehouse. Wiley Technology Publishing, Wiley (2005)

    Google Scholar 

  21. Kulikova, T., et al.: The embl nucleotide sequence database. Nucleic Acids Research 32(suppl. 1), 27–30 (2004)

    Article  Google Scholar 

  22. Laudon, K., Traver, C.G.: E-commerce: business, technology, society, 9th edn. Pearson/Prentice Hall (2013)

    Google Scholar 

  23. Lee, C.A., Marciano, R., Hou, C.Y., Shah, C.: From harvesting to cultivating: transformation of a web collecting system into a robust curation environment. In: Proc. 9th ACM/IEEE-CS Joint Conference on Digital Libraries, pp. 423–424. ACM, New York (2009)

    Google Scholar 

  24. Levene, M.: An Introduction to Search Engines and Web Navigation, 2nd edn. Wiley (2010)

    Google Scholar 

  25. Lord, P., Macdonald, A., Lyon, L., Giaretta, D.: From data deluge to data curation. In: Proc. UK e-Science All Hands Meeting, pp. 371–375 (2006)

    Google Scholar 

  26. Meliou, A., Gatterbauer, W., Halpern, J.Y., Koch, C., Moore, K.F., Suciu, D.: Causality in databases. IEEE Data Eng. Bull. 33(3), 59–67 (2010)

    Google Scholar 

  27. Palmer, C.L., Allard, S., Marlino, M.: Data curation education in research centers. In: Proc. 2011 ACM iConference, pp. 738–740. ACM, New York (2011)

    Chapter  Google Scholar 

  28. Ramírez, M.L.: Whose role is it anyway? a library practitioner’s appraisal of the digital data deluge. ASIS&T Bulletin 37(5), 21–23 (2011)

    Google Scholar 

  29. Rusbridge, C., et al.: The digital curation centre: a vision for digital curation. In: Proc. 2005 IEEE Int. Symp. on Mass Storage Systems and Technology, pp. 31–41. IEEE Computer Society, Washington, DC (2005)

    Chapter  Google Scholar 

  30. Sanderson, R., Harrison, J., Llewellyn, C.: A curated harvesting approach to establishing a multi-protocol online subject portal. In: Proc. 6th ACM/IEEE-CS Joint Conference on Digital Libraries, pp. 355–355. ACM, New York (2006)

    Chapter  Google Scholar 

  31. Smith, P.L.: Where IR you?: Using “open access” to Extend the Reach and Richness of Faculty Research within a University. OCLC Systems & Services 24(3), 174–184 (2008)

    Article  Google Scholar 

  32. Stahl, F., Schomm, F., Vossen, G.: Marketplaces for data: An initial survey. ERCIS Working Paper No. 12, Münster, Germany (2012)

    Google Scholar 

  33. Tan, W.C.: Provenance in Databases: Past, current, and future. IEEE Data Eng. Bull. 30(4), 3–12 (2007)

    Google Scholar 

  34. Van der Aalst, W., Van Hee, K.: Workflow Management: Models, Methods, and Systems. MIT Press (2004)

    Google Scholar 

  35. Vossen, G., Hagemann, S.: Unleashing Web 2.0: From Concepts to Creativity. Morgan Kaufmann Publishers (2007)

    Google Scholar 

  36. Witten, I., Frank, E., Hall, M.: Data Mining: Practical Machine Learning Tools and Techniques, 3rd edn. Morgan Kaufmann Publishers (2011)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Stuart Dillon .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2013 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Dillon, S., Stahl, F., Vossen, G. (2013). Towards the Web in Your Pocket: Curated Data as a Service. In: Nguyen, N., Trawiński, B., Katarzyniak, R., Jo, GS. (eds) Advanced Methods for Computational Collective Intelligence. Studies in Computational Intelligence, vol 457. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-34300-1_3

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-34300-1_3

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-34299-8

  • Online ISBN: 978-3-642-34300-1

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics