short-paper

Weak consistency and stochastic environments: harmonization of replicated machine learning models

Authors:
Tobias Herb

Technical University Berlin

Technical University Berlin
View Profile

,
Tim Jungnickel

Technical University Berlin

Technical University Berlin
View Profile

,
Christoph Alt

Technical University Berlin

Technical University Berlin
View Profile

PaPoC '16: Proceedings of the 2nd Workshop on the Principles and Practice of Consistency for Distributed DataApril 2016Article No.: 8Pages 1–3https://doi.org/10.1145/2911151.2911161

Published:18 April 2016Publication History

PaPoC '16: Proceedings of the 2nd Workshop on the Principles and Practice of Consistency for Distributed Data

Pages 1–3

ABSTRACT

Many machine learning (ML) models are of a stochastic nature. We aim to combine the principles of weak consistency with large scale distributed machine learning. We see interesting opportunities in this domain in (1) perceiving parallel ML algorithms based on model replication as a "collaborative task" where local progress on models is instantaneously exchanged and by (2) making this exchange more efficient by exploiting the underlying stochastic nature. Based on this motivation, we extend the notion of consistency for replicated objects with intrinsic stochastic structure and introduce harmonization as the reconciliation principle to enable efficient consistency maintenance of these objects. We present as a concrete application the harmonization of replicated ML models.

References

J. Duchi, E. Hazan, and Y. Singer. Adaptive subgradient methods for online learning and stochastic optimization. J. Mach. Learn. Res., 12:2121--2159, July 2011. Google ScholarDigital Library
G. Gibson and N. Zeldovich, editors. 2014 USENIX Annual Technical Conference, USENIX ATC '14, Philadelphia, PA, USA, June 19-20, 2014. USENIX Association, 2014. Google ScholarDigital Library
R. McDonald, K. Hall, and G. Mann. Distributed training strategies for the structured perceptron. HLT '10, pages 456--464, Stroudsburg, PA, USA, 2010. Association for Computational Linguistics. Google ScholarDigital Library
X. Meng, J. K. Bradley, B. Yavuz, E. R. Sparks, S. Venkataraman, D. Liu, J. Freeman, D. B. Tsai, M. Amde, S. Owen, D. Xin, R. Xin, M. J. Franklin, R. Zadeh, M. Zaharia, and A. Talwalkar. Mllib: Machine learning in apache spark. CoRR, abs/1505.06807, 2015.Google Scholar
A. S. Tanenbaum and M. v. Steen. Distributed Systems: Principles and Paradigms (2Nd Edition). Prentice-Hall, Inc., Upper Saddle River, NJ, USA, 2006. Google ScholarDigital Library
H. Yu and A. Vahdat. Design and evaluation of a conit-based continuous consistency model for replicated services. ACM Trans. Comput. Syst., 20(3):239--282, Aug. 2002. Google ScholarDigital Library
M. Zinkevich, M. Weimer, L. Li, and A. J. Smola. Parallelized stochastic gradient descent. In J. D. Lafferty, C. K. I. Williams, J. Shawe-Taylor, R. S. Zemel, and A. Culotta, editors, Advances in Neural Information Processing Systems 23, pages 2595--2603. Curran Associates, Inc., 2010.Google ScholarDigital Library
Wei Dai, Abhimanu Kumar, Jinliang Wei, Qirong Ho, Garth A. Gibson, and Eric P. Xing. High-performance distributed ML at scale through parameter server consistency models. In Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, January 25-30, 2015, Austin, Texas, USA., pages 79--87, 2015. Google ScholarDigital Library
Léon Bottou. Large-scale machine learning with stochastic gradient descent. In Yves Lechevallier and Gilbert Saporta, editors, Proceedings of the 19th International Conference on Computational Statistics (COMPSTAT'2010), pages 177--187, Paris, France, August 2010. Springer.Google ScholarCross Ref
Graham Cormode and S. Muthukrishnan. An improved data stream summary: The count-min sketch and its applications. J. Algorithms, 55(1):58--75, April 2005. Google ScholarDigital Library

Recommendations

Making weak consistency great again
PaPoC '16: Proceedings of the 2nd Workshop on the Principles and Practice of Consistency for Distributed Data

This paper focuses on the problem of implementing web applications on top of weakly consistent geo-replicated systems. Several techniques, such as CRDTs, have been proposed to achieve state convergence on a per-object and per-data type basis. However, ...
Read More
Collaborative Annotation of Videos Relying on Weak Consistency

This work discusses a distributed interactive video system that supports video annotation using simultaneous hyperlinking by multiple users. The users mark and annotate objects within the video with links to other media such as text, images, websites, ...
Read More
Distributed B-Tree with Weak Consistency
NETYS 2013: Revised Selected Papers of the First International Conference on Networked Systems - Volume 7853

B-tree is a widely used data-structure indexing data for efficient Retrieval. We consider a decentralized B-tree, were parts of the structure are distributed among different processors and some parts are replicated, thus providing a decentralized ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
PaPoC '16: Proceedings of the 2nd Workshop on the Principles and Practice of Consistency for Distributed Data
April 2016
54 pages
ISBN:9781450342964
DOI:10.1145/2911151
Program Chairs:
Peter Alvaro
UC Santa Cruz
,
Alysson Bessani
Faculdade de Ciências, Universidade de Lisboa, Portugal
Copyright © 2016 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 18 April 2016
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
distributed systems
large-scale machine learning
stochastic gradient descent
weak consistency
Qualifiers
- short-paper
Conference

Acceptance Rates
Overall Acceptance Rate34of47submissions,72%
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 0
  Total Citations
  View Citations
- 98
  Total Downloads
- Downloads (Last 12 months)4
- Downloads (Last 6 weeks)0
Other Metrics
View Author Metrics
Cited By
This publication has not been cited yet

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Weak consistency and stochastic environments: harmonization of replicated machine learning models

PaPoC '16: Proceedings of the 2nd Workshop on the Principles and Practice of Consistency for Distributed Data

ABSTRACT

References

Cited By

Recommendations

Making weak consistency great again

Collaborative Annotation of Videos Relying on Weak Consistency

Distributed B-Tree with Weak Consistency