skip to main content
10.1145/2783258.2788596acmconferencesArticle/Chapter ViewAbstractPublication PageskddConference Proceedingsconference-collections
research-article

Exploiting Data Mining for Authenticity Assessment and Protection of High-Quality Italian Wines from Piedmont

Published:10 August 2015Publication History

ABSTRACT

This paper discusses the data mining approach followed in a project called TRAQUASwine, aimed at the definition of methods for data analytical assessment of the authenticity and protection, against fake versions, of some of the highest value Nebbiolo-based wines from Piedmont region in Italy. This is a big issue in the wine market, where commercial frauds related to such a kind of products are estimated to be worth millions of Euros. The objective is twofold: to show that the problem can be addressed without expensive and hyper-specialized wine analyses, and to demonstrate the actual usefulness of classification algorithms for data mining on the resulting chemical profiles. Following Wagstaff's proposal for practical exploitation of machine learning (and data mining) approaches, we describe how data have been collected and prepared for the production of different datasets, how suitable classification models have been identified and how the interpretation of the results suggests the emergence of an active role of classification techniques, based on standard chemical profiling, for the assesment of the authenticity of the wines target of the study.

Skip Supplemental Material Section

Supplemental Material

p1671.mp4

mp4

305.3 MB

References

  1. F. Acevedo, J. Nez, S. Maldonado, E. Domínguez, and A. Narváez. Classification of wines produced in specific regions by UV-visible spectroscopy combined with support vector machines. J. Agric. Food Chem., 55:6842--6849, 2013.Google ScholarGoogle ScholarCross RefCross Ref
  2. I. Arvanitoyannis, M. Katsota, E. Psarra, E. Soufleros, and S. Kallithraka. Application of quality control methods for assessing wine authenticity: Use of multivariate analysis (chemometrics). Trends in Food Science and Technology, 10:321--336, 1999.Google ScholarGoogle ScholarCross RefCross Ref
  3. G. Cooper and E. Herskovits. A bayesian method for the induction of probabilistic networks from data. Machine Learning, 9(4):309--347, 1992. Google ScholarGoogle ScholarCross RefCross Ref
  4. P. Corteza, A. Cerdeirab, F. Almeidab, T. Matosb, and J. Reis. Modeling wine preferences by data mining from physicochemical properties. Decision Support Systems, 47(4):547--553, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. S. Gòmez-Meire, C. Campos, E. Falqué, F. Dìaz, and F. Fdez-Riverola. Assuring the authenticity of northwest Spain white wine varieties using machine learning techniques. Food Research International, 60:230--240, 2014.Google ScholarGoogle ScholarCross RefCross Ref
  6. M. Grzegorczyk. An introduction to Gaussian Bayesian Networks. In Systems Biology in Drug Discovery and Development, volume 662, pages 121--147. Springer, 2010.Google ScholarGoogle ScholarCross RefCross Ref
  7. M. Hall, E. Frank, G. Holmes, B. Pfahringer, P. Reutemann, and I. Witten. The WEKA data mining software: An update. SIGKDD Explorations, 11(1):10--18, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. M. A. Hall. Correlation-based Feature Subset Selection for Machine Learning. PhD thesis, University of Waikato, Hamilton, New Zealand, 1998.Google ScholarGoogle Scholar
  9. D. Mattera and S. Haykin. Support vector machines for dynamic reconstruction of a chaotic system. In B. Schölkopf, C. Burges, and A. Smola, editors, Advances in Kernel Methods, pages 211--241. MIT Press, 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. J. Platt. Fast training of support vector machines using sequential minimal optimization. In B. Schölkopf, C. Burges, and A. Smola, editors, Advances in Kernel Methods, pages 185--208. MIT Press, 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. J. Platt. Probability for SV machines. In A. Smola, P. Batlett, B. Schölkopf, and D. Schuurmans, editors, Advances in Large Margin Classifiers, pages 61--74. MIT Press, 2000.Google ScholarGoogle Scholar
  12. L. Portinale and L. Saitta. Feature selection. Technical Report D.14.1, Mining Mart Project, 2002. http://mmart.cs.uni-dortmund.de/content/publications.html.Google ScholarGoogle Scholar
  13. P. Spirtes, C. Glymour, and R. Scheines. Causation, Prediction and Search. Springer Verlag, Berlin, 1993.Google ScholarGoogle ScholarCross RefCross Ref
  14. B. Üstün, W. Melssen, and L. Buydens. Facilitating the application of support vector regression by using a universal Pearson VII function based kernel. Chemiometrics and Intelligent Laboratory Systems, 81:29--40, 2006.Google ScholarGoogle ScholarCross RefCross Ref
  15. A. Versari, V. Laurie, A. Ricci, L. Laghi, and G. Parpinello. Progress in authentication, typification and traceability of grapes and wines by chemometric approaches. Food Research International, 60:2--18, 2014.Google ScholarGoogle ScholarCross RefCross Ref
  16. K. Wagstaff. Machine learning that matters. In Proceedings of the 29 th International Conference on Machine Learning (ICML 09), Edinburgh, UK, 2012.Google ScholarGoogle Scholar
  17. Y. Zhao, S. Yu, B. Chu, N. Zhang, and X. Hu. Classification of three wine varieties based on ELM and PCA. In Lecture Notes in Computer Science, volume 7751, pages 647--654. 2013. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Exploiting Data Mining for Authenticity Assessment and Protection of High-Quality Italian Wines from Piedmont

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Conferences
        KDD '15: Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining
        August 2015
        2378 pages
        ISBN:9781450336642
        DOI:10.1145/2783258

        Copyright © 2015 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 10 August 2015

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • research-article

        Acceptance Rates

        KDD '15 Paper Acceptance Rate160of819submissions,20%Overall Acceptance Rate1,133of8,635submissions,13%

        Upcoming Conference

        KDD '24

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader