Skip to main content

A Short Account of Techniques for Assisting Users in Mastering Big Data

  • Chapter
  • First Online:
A Comprehensive Guide Through the Italian Database Research Over the Last 25 Years

Part of the book series: Studies in Big Data ((SBD,volume 31))

  • 1959 Accesses

Abstract

One of the most challenging problems faced by the database community is to assist inexperienced or casual users, who need the support of a sophisticated system that guides them in making sense of the data. This problem becomes especially relevant in the case of Big Data, where the amount of data may quickly overwhelm users and discourage them from leveraging the richness of the data patrimony. In the last years, often in collaboration with other members of the Italian database community, we have developed several different techniques whose aim is both to reduce the size of the problem and to focus on the information that is most relevant to the user. To this end, most of these techniques fruitfully extract and exploit data semantics, for example by succinctly characterizing data via intensional properties such as integrity constraints or by tailoring the answer to the user context or preferences. Other techniques support the users in information exploration, for instance by extracting data not readily accessible (such as the Hidden Web) or by presenting them with appropriate summaries and suggesting possible exploration paths.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 219.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    This particular research is carried out in collaboration with Università della Basilicata.

  2. 2.

    Information overload is the difficulty of people in understanding an issue in the presence of too much information. The term is used by Alvin Toffler in his books Future Shock and The Third Wave.

  3. 3.

    Context evolution [47] is the research topic that takes this into account; however, if this task is performed by the designer, it makes his or her burden even heavier.

  4. 4.

    The PerLa web site - http://perlawsn.sourceforge.net/index.php.

References

  1. S. Amer-Yahia, S. Basu Roy, A. Chawla, G. Das, C. Yu, Group recommendation: Semantics and efficiency. PVLDB 2(1), 754–765 (2009)

    Google Scholar 

  2. M. Baldauf, S. Dustdar, F. Rosenberg, A survey on context-aware systems. Int. J. Ad Hoc Ubiquitous Comput. 2(4), 263–277 (2007)

    Article  Google Scholar 

  3. E. Baralis, P. Garza, E. Quintarelli, L. Tanca, Answering XML queries by means of data summaries. ACM Trans. Inf. Syst. 25(3), 10 (2007)

    Article  Google Scholar 

  4. N.D. Blas, M. Mazuran, P. Paolini, E. Quintarelli, L. Tanca, Exploratory computing: a challenge for visual interaction, in AVI (2014), pp. 361–362

    Google Scholar 

  5. C. Bolchini, C. Curino, E. Quintarelli, F.A. Schreiber, L. Tanca, A data-oriented survey of context models. SIGMOD Record 36(4), 19–26 (2007)

    Article  Google Scholar 

  6. C. Bolchini, E. Quintarelli, L. Tanca, Carve: Context-aware automatic view definition over relational databases. Inf. Syst. 38(1), 45–67 (2013)

    Article  Google Scholar 

  7. M. Buoncristiano, G. Mecca, E. Quintarelli, M. Roveri, D. Santoro, L. Tanca, Database challenges for exploratory computing. SIGMOD Record 44(2), 17–22 (2015)

    Article  Google Scholar 

  8. M. Buoncristiano, G. Mecca, E. Quintarelli, M. Roveri, D. Santoro, L. Tanca, Exploratory computing: What is there for the database researcher?, in 23rd Italian Symposium on Advanced Database Systems, SEBD 2015, Gaeta, Italy, 14–17 June 2015 (2015), pp. 128–135

    Google Scholar 

  9. A. Calì, D. Calvanese, D. Martinenghi, Dynamic query optimization under access limitations and dependencies. J. Univers. Comput. Sci. 15(21), 33–62 (2009)

    MathSciNet  MATH  Google Scholar 

  10. A. Calì, D. Martinenghi, Conjunctive query containment under access limitations, in ER 2008 (2008), pp. 326–340

    Google Scholar 

  11. A. Calì, D. Martinenghi, Querying Data under Access Limitations, in ICDE 2008 (2008), pp. 50–59

    Google Scholar 

  12. A. Calì, D. Martinenghi, Querying Incomplete Data over Extended ER Schemata. TPLP 10(3), 291–329 (2010)

    MathSciNet  MATH  Google Scholar 

  13. A. Calì, D. Martinenghi, Querying the deep web (tutorial), in EDBT 2010 (2010), pp. 724–727

    Google Scholar 

  14. A. Calì, D. Martinenghi, R. Torlone, Keyword queries over the deep web, in ER 2016 (2016), pp. 260–268

    Google Scholar 

  15. I. Catallo, E. Ciceri, P. Fraternali, D. Martinenghi, M. Tagliasacchi, Top-k diversity queries over bounded regions. TODS 38(2), 10 (2013)

    Article  MathSciNet  MATH  Google Scholar 

  16. I. Catallo, S. Coniglio, P. Fraternali, and D. Martinenghi. A workload-dependent task assignment policy for crowdsourcing. WWW J., to appear, 2017

    Google Scholar 

  17. F. Chiang, R. Miller, A unified model for data and constraint repair, in ICDE 2011 (2011), pp. 446–457

    Google Scholar 

  18. H. Christiansen, D. Martinenghi, Simplification of database integrity constraints revisited: A transformational approach, in LOPSTR 2003 (2004), pp. 178–197

    Google Scholar 

  19. H. Christiansen, D. Martinenghi, Simplification of integrity constraints for data integration, in FoIKS (2004), pp. 31–48

    Google Scholar 

  20. H. Christiansen, D. Martinenghi, On simplification of database integrity constraints. Fundam. Inform. 71(4), 371–417 (2006)

    MathSciNet  MATH  Google Scholar 

  21. E. Ciceri, P. Fraternali, D. Martinenghi, M. Tagliasacchi, Crowdsourcing for top-k query processing over uncertain data. TKDE 28(1), 41–53 (2016)

    MATH  Google Scholar 

  22. F. Colace, M.D. Santo, V. Moscato, A. Picariello, F.A. Schreiber, L. Tanca (eds.), Data Management in Pervasive Systems, Data-Centric Systems and Applications (Springer, 2015)

    Google Scholar 

  23. H. Decker, D. Martinenghi, Inconsistency-tolerant integrity checking. TKDE 23(2), 218–234 (2011)

    Google Scholar 

  24. K.R. Fowler, J. Schmalzel, Why do we care about measurement? IEEE Instrum. Meas. Mag. 7(1), 38–46 (2004)

    Article  Google Scholar 

  25. K.R. Fowler, J.L. Schmalzel, Sensors: The first stage in the measurement chain. IEEE Instrum. Meas. Mag. 7(3), 60–65 (2004)

    Article  Google Scholar 

  26. P. Fraternali, D. Martinenghi, M. Tagliasacchi, Top-k bounded diversification, in SIGMOD 2012 (2012), pp. 421–432

    Google Scholar 

  27. P. Garza, E. Quintarelli, E. Rabosio, L. Tanca, Reducing big data by means of context-aware tailoring, in New Trends in Databases and Information Systems - ADBIS 2016 Short Papers and Workshops, BigDap, DCSA, DC, Proceedings, Prague, Czech Republic, 28–31 August 2016 (2016), pp. 115–127

    Google Scholar 

  28. J. Grant, A. Hunter, Measuring inconsistency in knowledgebases. J. Intell. Inf. Syst. 27(2), 159–184 (2006)

    Article  Google Scholar 

  29. S. Idreos, O. Papaemmanouil, S. Chaudhuri, Overview of data exploration techniques, in Proceedings of the 2015 ACM SIGMOD (2015), pp. 277–281

    Google Scholar 

  30. I.F. Ilyas, G. Beskales, M.A. Soliman, A survey of top-k query processing techniques in relational database systems. ACM Comput. Surv. 40(4) (2008)

    Google Scholar 

  31. S.R. Madden, M.J. Franklin, J.M. Hellerstein, W. Hong, Tinydb: an acquisitional query processing system for sensor networks. ACM Trans. Database Syst. 30(1), 122–173 (2005)

    Article  Google Scholar 

  32. D. Maier, A.O. Mendelzon, Y. Sagiv, Testing implications of data dependencies. TODS 4, 455–469 (1979)

    Article  Google Scholar 

  33. D. Martinenghi, Simplification of integrity constraints with aggregates and arithmetic built-ins, in FQAS 2004 (2004), pp. 348–361

    Google Scholar 

  34. D. Martinenghi, H. Christiansen, H. Decker, Integrity checking and maintenance in relational and deductive databases - and beyond, in Intelligent Databases: Technologies and Applications, ed. by Z. Ma (2006), pp. 238–285. Chap. X

    Google Scholar 

  35. D. Martinenghi, M. Tagliasacchi, Proximity rank join. PVLDB 3(1), 352–363 (2010)

    Google Scholar 

  36. D. Martinenghi, M. Tagliasacchi, Cost-aware rank join with random and sorted access. TKDE 24(12), 2143–2155 (2012)

    Google Scholar 

  37. D. Martinenghi, M. Tagliasacchi, Proximity measures for rank join. TODS 37(1) (2012)

    Google Scholar 

  38. D. Martinenghi, R. Torlone, Taxonomy-based relaxation of query answering in relational databases. VLDB J. 23(5), 747–769 (2014)

    Article  Google Scholar 

  39. M. Mazuran, E. Quintarelli, R. Rossato, L. Tanca, Mining violations to relax relational database constraints, in DaWaK (2009), pp. 339–353

    Google Scholar 

  40. M. Mazuran, E. Quintarelli, L. Tanca, IQ4EC: intensional answers as a support to exploratory computing, in 2015 IEEE International Conference on Data Science and Advanced Analytics, DSAA 2015, Campus des Cordeliers, Paris, France, 19-21 October 2015 (2015), pp. 1–10

    Google Scholar 

  41. M. Mazuran, E. Quintarelli, L. Tanca, S. Ugolini, Semi-automatic support for evolving functional dependencies, in Proceedings of the 19th International Conference on Extending Database Technology, EDBT 2016, Bordeaux, France, 15-16 March 2016 (2016), pp. 293–304

    Google Scholar 

  42. A. Miele, E. Quintarelli, E. Rabosio, L. Tanca, A data-mining approach to preference-based data ranking founded on contextual information. Inf. Syst. 38(4), 524–544 (2013)

    Article  Google Scholar 

  43. K. Morton, M. Balazinska, D. Grossman, J.D. Mackinlay, Support the data enthusiast: Challenges for next-generation data-analysis systems. PVLDB 7(6), 453–456 (2014)

    Google Scholar 

  44. J.-M. Nicolas, Logic for improving integrity checking in relational data bases. Acta Informatica 18, 227–253 (1982)

    Article  MathSciNet  MATH  Google Scholar 

  45. E. Panigati, E.A. Schreiber, Context-aware software approaches: a comparison and an integration proposal (discussion paper), in Proceedings of the 22nd Italian Symposium on Advanced database Systems (2014), pp. 175–184

    Google Scholar 

  46. E. Panigati, E.A. Schreiber, C. Zaniolo, Data streams and data stream management systems and languages, in ed. by Colace, et al. [22], pp. 93–111

    Google Scholar 

  47. E. Quintarelli, E. Rabosio, L. Tanca, A principled approach to context schema evolution in a data management perspective. Inf. Syst. 49, 65–101 (2015)

    Article  Google Scholar 

  48. E. Quintarelli, E. Rabosio, L. Tanca, Recommending new items to ephemeral groups using contextual user influence, in Proceedings of the 10th ACM Conference on Recommender Systems (2016), pp. 285–292

    Google Scholar 

  49. A. Rajaraman, Y. Sagiv, J.D. Ullman, Answering queries using templates with binding patterns, in PODS (1995), pp. 105–112

    Google Scholar 

  50. F.A. Schreiber, R. Camplani, M. Fortunato, and M. Marelli, Design of a declarative data language for pervasive systems. Art Deco Technical Report R. A. 11.1b (2008), http://perlawsn.sourceforge.net/documentation.php?official=1

  51. F.A. Schreiber, R. Camplani, M. Fortunato, M. Marelli, G. Rota, Perla: A language and middleware architecture for data management and integration in pervasive information systems. IEEE Trans. Software Eng. 38(2), 478–496 (2012)

    Article  Google Scholar 

  52. F.A. Schreiber, M. Roveri, Sensors and wireless sensor networks as data sources: Models and languages, in ed. by Colace, et al. [22], pp. 69–92

    Google Scholar 

  53. F.A. Schreiber, L. Tanca, R. Camplani, D. Viganó, Pushing context-awareness down to the core: more flexibility for the perla language, in Proceedings of the 6th PersDB 2012 Workshop (Co-located with VLDB 2012) (2012), pp. 1–6

    Google Scholar 

  54. M.A. Soliman et al., Ranking with uncertain scoring functions: semantics and sensitivity measures, in SIGMOD 2011 (2011), pp. 805–816

    Google Scholar 

  55. K. Stefanidis, E. Pitoura, P. Vassiliadis, Managing contextual preferences. Inf. Syst. 36(8), 1158–1180 (2011)

    Article  Google Scholar 

  56. J.W. Tukey, Exploratory Data Analysis (Addison-Wesley, Reading, 1977)

    MATH  Google Scholar 

  57. D. Tunkelang, Faceted Search, Synthesis Lectures on Information Concepts, Retrieval, and Services (Morgan & Claypool Publishers, 2009)

    Google Scholar 

  58. R. Want, An introduction to rfid technology. IEEE Pervasive Comput. 5(1), 25–33 (2006)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Davide Martinenghi .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer International Publishing AG

About this chapter

Cite this chapter

Martinenghi, D., Quintarelli, E., Schreiber, F.A., Tanca, L. (2018). A Short Account of Techniques for Assisting Users in Mastering Big Data. In: Flesca, S., Greco, S., Masciari, E., Saccà, D. (eds) A Comprehensive Guide Through the Italian Database Research Over the Last 25 Years. Studies in Big Data, vol 31. Springer, Cham. https://doi.org/10.1007/978-3-319-61893-7_7

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-61893-7_7

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-61892-0

  • Online ISBN: 978-3-319-61893-7

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics