skip to main content
column

Model Selection Management Systems: The Next Frontier of Advanced Analytics

Published: 09 May 2016 Publication History

Abstract

John Boyd recognized in the 1960's the importance of situation awareness for military operations and introduced the notion of the OODA loop (Observe, Orient, Decide, and Act). Today we realize that many applications have to deal with situation awareness: Customer Relationship Management, Human Capital Management, Supply Chain Management, patient care, power grid management, and cloud services management, as well as any IoT (Internet of Things) related application; the list seems to be endless. Situation awareness requires applications to support the management of data, knowledge, processes, and other services such as social networking in an integrated way. These applications additionally require high personalization as well as rapid and continuous evolution. They must provide a wide variety of operational and functional requirements, including real time processing.
Handcrafting these applications is an almost impossible task requiring exhaustive resources for development and maintenance. Due to the resources and time involved in their development, these applications typically fall way short of the desired functionality, operational characteristics, and ease and speed of evolution. We - the authors - have developed a model enabling the development and maintenance of situation-aware applications in a declarative and therefore economical manner; we call this model KIDS - Knowledge Intensive Data-processing System.

References

[1]
Microsoft Azure ML. studio.azureml.net.
[2]
Oracle R Enterprise. www.oracle.com.
[3]
SAS Report on Analytics. sas.com/reg/wp/corp/23876.
[4]
M. Anderson et al. Brainwash: A Data System for Feature Engineering. In CIDR, 2013.
[5]
P. Domingos. A Few Useful Things to Know about Machine Learning. CACM, 2012.
[6]
X. Feng et al. Towards a Unified Architecture for in-RDBMS Analytics. In SIGMOD, 2012.
[7]
Y. Ganjisaffar et al. Distributed Tuning of Machine Learning Algorithms Using MapReduce Clusters. In LDMTA, 2011.
[8]
A. Ghoting et al. SystemML: Declarative Machine Learning on MapReduce. In ICDE, 2011.
[9]
I. Guyon et al. Feature Extraction: Foundations and Applications. New York: Springer-Verlag, 2001.
[10]
T. Hastie et al. Elements of Statistical Learning: Data mining, inference, and prediction. Springer-Verlag, 2001.
[11]
J. Hellerstein et al. The MADlib Analytics Library or MAD Skills, the SQL. In VLDB, 2012.
[12]
S. Kandel et al. Enterprise Data Analysis and Visualization: An Interview Study. IEEE TVCG, 2012.
[13]
T. Kraska et al. MLbase: A Distributed Machine-learning System. In CIDR, 2013.
[14]
A. Kumar et al. A Survey of the Existing Landscape of ML Systems. UW-Madison CS Tech. Rep. TR1827, 2015.
[15]
A. Kumar et al. Learning Generalized Linear Models Over Normalized Data. In SIGMOD, 2015.
[16]
A. Kumar et al. To Join or Not to Join? Thinking Twice about Joins before Feature Selection. In SIGMOD, 2016.
[17]
C. Ré et al. Feature Engineering for Knowledge Base Construction. IEEE Data Engineering Bulletin, 2014.
[18]
C. Zhang et al. Materialization Optimizations for Feature Selection Workloads. In SIGMOD, 2014.
[19]
Y. Zhang et al. I/O-Efficient Statistical Computing with RIOT. In ICDE, 2010.

Cited By

View all
  • (2024)DMRNet: Effective Network for Accurate Discharge Medication Recommendation2024 IEEE 40th International Conference on Data Engineering (ICDE)10.1109/ICDE60146.2024.00262(3393-3406)Online publication date: 13-May-2024
  • (2024)Non-Invasive Fairness in Learning Through the Lens of Data Drift2024 IEEE 40th International Conference on Data Engineering (ICDE)10.1109/ICDE60146.2024.00172(2164-2178)Online publication date: 13-May-2024
  • (2023)Saturn: An Optimized Data System for Multi-Large-Model Deep Learning WorkloadsProceedings of the VLDB Endowment10.14778/3636218.363622717:4(712-725)Online publication date: 1-Dec-2023
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM SIGMOD Record
ACM SIGMOD Record  Volume 44, Issue 4
December 2015
59 pages
ISSN:0163-5808
DOI:10.1145/2935694
Issue’s Table of Contents

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 09 May 2016
Published in SIGMOD Volume 44, Issue 4

Check for updates

Qualifiers

  • Column

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)69
  • Downloads (Last 6 weeks)1
Reflects downloads up to 01 Mar 2025

Other Metrics

Citations

Cited By

View all
  • (2024)DMRNet: Effective Network for Accurate Discharge Medication Recommendation2024 IEEE 40th International Conference on Data Engineering (ICDE)10.1109/ICDE60146.2024.00262(3393-3406)Online publication date: 13-May-2024
  • (2024)Non-Invasive Fairness in Learning Through the Lens of Data Drift2024 IEEE 40th International Conference on Data Engineering (ICDE)10.1109/ICDE60146.2024.00172(2164-2178)Online publication date: 13-May-2024
  • (2023)Saturn: An Optimized Data System for Multi-Large-Model Deep Learning WorkloadsProceedings of the VLDB Endowment10.14778/3636218.363622717:4(712-725)Online publication date: 1-Dec-2023
  • (2023)Provenance documentation to enable explainable and trustworthy AI: A literature reviewData Intelligence10.1162/dint_a_001195:1(139-162)Online publication date: 8-Mar-2023
  • (2023)The Many Facets of Data EquityJournal of Data and Information Quality10.1145/353342514:4(1-21)Online publication date: 7-Feb-2023
  • (2023)TCCC Decision Support With Machine Learning Prediction of Hemorrhage Risk, Shock ProbabilityMilitary Medicine10.1093/milmed/usad298188:Supplement_6(659-665)Online publication date: 8-Nov-2023
  • (2023)TSPredIT: Integrated Tuning of Data Preprocessing and Time Series Prediction ModelsTransactions on Large-Scale Data- and Knowledge-Centered Systems LIV10.1007/978-3-662-68014-8_2(41-55)Online publication date: 22-Sep-2023
  • (2022)A deep learning dataloader with shared data preparationProceedings of the 36th International Conference on Neural Information Processing Systems10.5555/3600270.3601517(17146-17156)Online publication date: 28-Nov-2022
  • (2022)SHiFTProceedings of the VLDB Endowment10.14778/3565816.356583116:2(304-316)Online publication date: 1-Oct-2022
  • (2022)Data Management for Machine Learning: A SurveyIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2022.3148237(1-1)Online publication date: 2022
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media