skip to main content
10.1145/2637166.2637226acmotherconferencesArticle/Chapter ViewAbstractPublication PagesapsysConference Proceedingsconference-collections
research-article

D2P: a distributed deadline propagation approach to tolerate long-tail latency in datacenters

Published: 25 June 2014 Publication History

Abstract

We propose a Distributed Deadline Propagation (D2P) approach for datacenter applications to tolerate latency variability. The key idea of D2P is to allow local nodes to perceive global deadline information and to propagate the information among distributed nodes. Local nodes can leverage the information to do scheduling and adjust processing speed to reduce latency variability. Preliminary experimental results show that D2P has the potential of reducing the long-tail latency in datacenters by lever-aging propagated deadline information on the local nodes.

References

[1]
Dubbo distributed service framework. http://code.alibabatech.com/wiki/display/dubbo/Home.
[2]
Google's marissa mayer: Speed wins. http://www.zdnet.com/blog/btl/googles-marissa-mayer-speed-wins/3925.
[3]
Least laxity first. http://en.wikipedia.org/wiki/Least_slack_time_scheduling.
[4]
Alizadeh, M., Greenberg, A., Maltz, D. A., Padhye, J., Patel, P., Prabhakar, B., Sengupta, S., and Sridharan, M. Data center TCP (DCTCP). In Proceedings of the ACM SIGCOMM 2010 Conference (New York, NY, USA, 2010), SIGCOMM '10, ACM, pp. 63--74.
[5]
Barroso, L. A., Clidaras, J., and Holzle, U. The datacenter as a computer: An introduction to the design of warehouse-scale machines. Synthesis Lectures on Computer Architecture 8, 3 (2013), 1--154.
[6]
Chen, T., Chen, Y., Guo, Q., Temam, O., Wu, Y., and Hu, W. Statistical performance comparisons of computers. In High Performance Computer Architecture (HPCA), 2012 IEEE 18th International Symposium on (2012), IEEE, pp. 1--12.
[7]
Cipar, J., Ho, Q., Kim, J. K., Lee, S., Ganger, G. R., Gibson, G., Keeton, K., and Xing, E. Solving the straggler problem with bounded staleness. In Proceedings of the 14th USENIX Conference on Hot Topics in Operating Systems (Berkeley, CA, USA, 2013), HotOS'13, USENIX Association, pp. 22--22.
[8]
Dean, J. Achieving rapid response times in large online services. In Berkeley AMPLab Cloud Seminar (2012).
[9]
Dean, J., and Barroso, L. A. The tail at scale. Communications of the ACM 56, 2 (2013), 74--80.
[10]
Delimitrou, C., and Kozyrakis, C. Paragon: QoS-aware scheduling for heterogeneous datacenters. In Proceedings of the Eighteenth International Conference on Architectural Support for Programming Languages and Operating Systems (New York, NY, USA, 2013), ASPLOS '13, ACM, pp. 77--88.
[11]
Iyer, R. CQoS: a framework for enabling QoS in shared caches of CMP platforms. In Proceedings of the 18th annual international conference on Supercomputing (2004), ACM, pp. 257--266.
[12]
Iyer, R., Zhao, L., Guo, F., Illikkal, R., Makineni, S., Newell, D., Solihin, Y., Hsu, L., and Reinhardt, S. QoS policies and architecture for cache/memory in CMP platforms. In ACM SIGMETRICS Performance Evaluation Review (2007), vol. 35, ACM, pp. 25--36.
[13]
Krushevskaja, D., and Sandler, M. Understanding latency variations of black box services. In Proceedings of the 22Nd International Conference on World Wide Web (Republic and Canton of Geneva, Switzerland, 2013), WWW '13, International World Wide Web Conferences Steering Committee, pp. 703--714.
[14]
Krzysztof Ostrowski, Gideon Mann, and Mark Sandler. Diagnosing latency in multi-tier black-box services. In 5th Workshop on Large Scale Distributed Systems and Middleware (LADIS 2011) (2011).
[15]
Lin, J., Lu, Q., Ding, X., Zhang, Z., Zhang, X., and Sadayappan, P. Gaining insights into multicore cache partitioning: Bridging the gap between simulation and real systems. In High Performance Computer Architecture, 2008. HPCA 2008. IEEE 14th International Symposium on (2008), IEEE, pp. 367--378.
[16]
Liu, L., Cui, Z., Xing, M., Bao, Y., Chen, M., and Wu, C. A software memory partition approach for eliminating bank-level interference in multicore systems. In Proceedings of the 21st international conference on Parallel architectures and compilation techniques (2012), ACM, pp. 367--376.
[17]
Nishtala, R., Fugal, H., Grimm, S., Kwiatkowski, M., Lee, H., Li, H. C., McElroy, R., Paleczny, M., Peek, D., Saab, P., Stafford, D., Tung, T., and Venkataramani, V. Scaling memcache at facebook. In Proceedings of the 10th USENIX Conference on Networked Systems Design and Implementation (Berkeley, CA, USA, 2013), nsdi'13, USENIX Association, pp. 385--398.
[18]
Ravindranath, L., Padhye, J., Agarwal, S., Mahajan, R., Obermiller, I., and Shayandeh, S. AppInsight: mobile app performance monitoring in the wild. In Proceedings of the 10th USENIX Conference on Operating Systems Design and Implementation (Berkeley, CA, USA, 2012), OSDI'12, USENIX Association, pp. 107--120.
[19]
Ravindranath, L., Padhye, J., Mahajan, R., and Balakrishnan, H. Timecard: Controlling user-perceived delays in server-based mobile applications. In Proceedings of the Twenty-Fourth ACM Symposium on Operating Systems Principles (New York, NY, USA, 2013), SOSP '13, ACM, pp. 85--100.
[20]
Schurman, E., and J. Brutlag. The user and business impact of server delays.
[21]
Tang, L., Mars, J., and Soffa, M. L. Compiling for niceness: Mitigating contention for QoS in warehouse scale computers. In Proceedings of the Tenth International Symposium on Code Generation and Optimization (New York, NY, USA, 2012), CGO '12, ACM, pp. 1--12.
[22]
Tang, L., Mars, J., Wang, W., Dey, T., and Soffa, M. L. ReQoS: reactive Static/Dynamic compilation for QoS in warehouse scale computers. In Proceedings of the Eighteenth International Conference on Architectural Support for Programming Languages and Operating Systems (New York, NY, USA, 2013), ASPLOS '13, ACM, pp. 89--100.
[23]
Vamanan, B., Hasan, J., and Vijaykumar, T. Deadline-aware datacenter TCP (D2TCP). In Proceedings of the ACM SIGCOMM 2012 Conference on Applications, Technologies, Architectures, and Protocols for Computer Communication (New York, NY, USA, 2012), SIGCOMM '12, ACM, pp. 115--126.
[24]
Wilson, C., Ballani, H., Karagiannis, T., and Rowtron, A. Better never than late: Meeting deadlines in datacenter networks. In Proceedings of the ACM SIGCOMM 2011 Conference (New York, NY, USA, 2011), SIGCOMM '11, ACM, pp. 50--61.
[25]
Xu, Y., Musgrave, Z., Noble, B., and Bailey, M. Bobtail: Avoiding long tails in the cloud. In Proceedings of the 10th USENIX Conference on Networked Systems Design and Implementation (Berkeley, CA, USA, 2013), nsdi'13, USENIX Association, pp. 329--342.
[26]
Zats, D., Das, T., Mohan, P., Borthakur, D., and Katz, R. DeTail: reducing the flow completion time tail in datacenter networks. In Proceedings of the ACM SIGCOMM 2012 Conference on Applications, Technologies, Architectures, and Protocols for Computer Communication (New York, NY, USA, 2012), SIGCOMM '12, ACM, pp. 139--150.
[27]
Zhuravlev, S., Blagodurov, S., and Fedorova, A. Addressing shared resource contention in multicore processors via scheduling. In Proceedings of the Fifteenth Edition of ASPLOS on Architectural Support for Programming Languages and Operating Systems (New York, NY, USA, 2010), ASPLOS XV, ACM, pp. 129--142.

Cited By

View all

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences
APSys '14: Proceedings of 5th Asia-Pacific Workshop on Systems
June 2014
98 pages
ISBN:9781450330244
DOI:10.1145/2637166
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

  • Chinese Academy of Sciences

In-Cooperation

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 25 June 2014

Permissions

Request permissions for this article.

Check for updates

Qualifiers

  • Research-article

Conference

APSys'14
Sponsor:
APSys'14: Asia-Pacific Workshop on Systems
June 25 - 26, 2014
Beijing, China

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)4
  • Downloads (Last 6 weeks)0
Reflects downloads up to 20 Jan 2025

Other Metrics

Citations

Cited By

View all

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media