research-article

A Paradox in ML Design: Less data for a smarter water metering cognification experience

Authors:
M. Roccetti

Department of Computer Science and Engineering, University of Bologna, Italy

Department of Computer Science and Engineering, University of Bologna, Italy
View Profile

,
G. Delnevo

Department of Computer Science and Engineering, University of Bologna, Italy

Department of Computer Science and Engineering, University of Bologna, Italy
View Profile

,
L. Casini

Department of Computer Science and Engineering, University of Bologna, Italy

Department of Computer Science and Engineering, University of Bologna, Italy
View Profile

,
N. Zagni

Department of Management, University of Bologna, Italy

Department of Management, University of Bologna, Italy
View Profile

,
G. Cappiello

Department of Management, University of Bologna, Italy

Department of Management, University of Bologna, Italy
View Profile

GoodTechs '19: Proceedings of the 5th EAI International Conference on Smart Objects and Technologies for Social GoodSeptember 2019Pages 201–206https://doi.org/10.1145/3342428.3342685

Published:25 September 2019Publication History

GoodTechs '19: Proceedings of the 5th EAI International Conference on Smart Objects and Technologies for Social Good

Pages 201–206

ABSTRACT

Many data scientists are currently pointing out that the amount of Machine Learning (ML) research that will cross into practice will depend, not just on the ability of the specialized algorithms used to scrutinize positive/negative examples, but also on the quality of the data exploited for training those algorithms. Our experience, while training a neural network with a huge dataset comprised of over fifteen million water meter readings, confirms such conjecture. In this paper, we report on the actions we took to extrapolate from that database just those data that could correctly represent the complex statistical phenomenon in play. With an adequate re-organization of those data, we got an interesting, yet controversial, result. On the one hand, we improved the accuracy on the prediction when a water meter fails/needs disassembly based on a history of water consumption measurements, thus making smarter a meter maintenance process; on the other hand, all this came with the paradox of a (statistical) transformation of the initial dataset: while we alleviate a problem with a restructured and better interpretable data model, we simultaneously change the replicated form of those data.

References

Pettersen, L. (2018) Why Artificial Intelligence will not outsmart complex knowledge work. Work, Employment and Society. Sage. To appear.Google Scholar
Jordan, M. I., Mitchell, T. M. (2015). Machine learning: Trends, perspectives, and prospects. Science, 349(6245), 255--260.Google ScholarCross Ref
Delnevo, G., Roccetti, M., Mirri, S. (2019). Intelligent and good machines? The role of domain and context codification, Mobile Networks and Applications, Elsevier. To appear.Google Scholar
Witten, I. H., Frank, E., Hall, M. A., Pal, C. J. (2016). Data Mining: Practical machine learning tools and techniques. Morgan Kaufmann. Google ScholarDigital Library
Alkowaileet, W., Alsubaiee, S., Carey, M., Li, C., Ramampiaro, H., Sinthong, P., Wang, X. (2018). Enhancing Big Data with semantics: The AsterixDB approach. In Proc. of 12th IEEE International Conference on Semantic Computing, 314--315. IEEE.Google ScholarCross Ref
Emani, C. K., Cullot, N., Nicolle, C. (2015). Understandable big data: a survey. Computer Science Review, 17, 70--81. Google ScholarDigital Library
Casini, L., Delnevo, G., Roccetti, M., Zagni, N., Cappiello, G. (2019). Deep Water: Predicting water meter failures through a human-machine intelligence collaboration. In Proc. of 1st International Conference on Human Interaction & Emerging Technologies. Springer. To appearGoogle Scholar
Bird, S., Kenthapadi, K., Kiciman, E., Mitchell, M. (2019). Fairness-Aware Machine Learning: Practical challenges and lessons learned. In Proc. of 12th ACM International Conference on Web Search and Data Mining, 834--835. ACM Google ScholarDigital Library
Friedler, S. A., Scheidegger, C., Venkatasubramanian, S., Choudhary, S., Hamilton, E. P., Roth, D. (2019). A comparative study of fairness-enhancing interventions in machine learning. In Proc. of Fairness, Accountability, and Transparency Conference, 329--338, ACM. Google ScholarDigital Library
Rosner, D., Roccetti, M., Marfia, G., (2014). The digitization of cultural practices. Communications of the ACM 57(6), 82--87, ACM. Google ScholarDigital Library
Casini, L., Marfia, G., Roccetti, M., (2018). Some reflections on the potential and limitations of deep learning for automated music generation. In Proc. of International Symposium on Personal, Indoor and Mobile Radio Communications, IEEEGoogle ScholarCross Ref

Index Terms

A Paradox in ML Design: Less data for a smarter water metering cognification experience
1. Computing methodologies
  1. Artificial intelligence
  2. Machine learning

Recommendations

A Cautionary Tale for Machine Learning Design: why we Still Need Human-Assisted Big Data Analysis
Abstract
Supervised Machine Learning (ML) requires that smart algorithms scrutinize a very large number of labeled samples before they can make right predictions. And this is not always true either. In our experience, in fact, a neural network trained with ...
Read More
Smartdata: Data preprocessing to achieve smart data in R
Abstract
As the amount of data available exponentially grows, data scientists are aware that finding the value in the data is key to a successful data exploiting. However, the data rarely presents itself in a ordered, clean way. In opposition ...
Read More
Detecting anomalies and de-noising monitoring data from sensors: A smart data approach
Abstract
When monitoring safety levels in deep pit foundations using sensors, anomalies (e.g., highly correlated variables) and noise (e.g., high dimensionality) exist in the extracted time series data, impacting the ability to assess risks. ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
GoodTechs '19: Proceedings of the 5th EAI International Conference on Smart Objects and Technologies for Social Good
September 2019
272 pages
ISBN:9781450362610
DOI:10.1145/3342428
General Chairs:
Armir Bujari
Università di Padova, Italy
,
Pietro Manzoni
Universitat Politecnica de Valencia, Spain
,
Program Chairs:
Anna Forster
University of Bremen, Germany
,
Edjair Mota
UFAM, Brazil
,
Ombretta Gaggi
Università di Padova, Italy
Copyright © 2019 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 25 September 2019
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
Human-Machine-Bigdata Interaction Loop
Interactive machine learning
Machine learning design
Smart data
Water Metering
Qualifiers
- research-article
- Research
- Refereed limited
Conference
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 6
  Total Citations
  View Citations
- 59
  Total Downloads
- Downloads (Last 12 months)2
- Downloads (Last 6 weeks)0
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

A Paradox in ML Design: Less data for a smarter water metering cognification experience

GoodTechs '19: Proceedings of the 5th EAI International Conference on Smart Objects and Technologies for Social Good

ABSTRACT

References

Cited By

Index Terms

Recommendations

A Cautionary Tale for Machine Learning Design: why we Still Need Human-Assisted Big Data Analysis

Smartdata: Data preprocessing to achieve smart data in R

Detecting anomalies and de-noising monitoring data from sensors: A smart data approach