It is our great pleasure to welcome you to the Fifth International Workshop on Cloud Data Management (CloudDB 2013). This year we continue our tradition of serving as a premier forum for researchers and practitioners to present research results and share ideas and progress in the area of data management within cloud computing infrastructure. This broad area includes work in distributed storage, parallel algorithms, data mining, serving and analytic workloads, privacy, security, green computing, social workloads, and many others.
The call for papers attracted a wide range of submissions. The program committee accepted 4 papers on a variety of topics. In addition, the program includes a keynote speaker, Krishna Gade, from Twitter, speaking on Realtime Analytics at Twitter, a highly interesting topic for Cloud DB research. We hope these proceedings will serve as a valuable resource to learn about the latest and most exciting work in cloud computing.
We hope that you will find this program interesting and thought provoking and that the symposium will provide you with a valuable opportunity to share ideas with other researchers and practitioners from institutions around the world.
Proceeding Downloads
Realtime analytics @ twitter
In this talk, we will discuss the data pipeline at Twitter that collects, aggregates and processes large volumes of data in real time and also how it fits in the broader data infrastructure ecosystem. We will also discuss challenges we have faced and ...
Processing online aggregation on skewed data in mapreduce
In online aggregation, a system constantly maintains an estimate of the final answer to an aggregate query throughout execution, along with statistically meaningful bounds for the estimate's accuracy. Given the popularity of ad-hoc analytic query ...
SO-1SR: towards a self-optimizing one-copy serializability protocol for data management in the cloud
Clouds are very attractive environments for deploying different types of applications due to their pay-as-you-go cost model and their highly available and scalable infrastructure. Data management is an integral part of the applications deployed in the ...
Analysis of partitioning strategies for graph processing in bulk synchronous parallel models
Vertex centric computation implemented with a Bulk Synchronous Parallel (BSP) model is becoming a popular choice to analyze huge graphs. In this paper, we study the impact of the graph partitioning strategies for BSP by simulating different partitions ...
A SLA graph model for data services
Cloud computing has given rise to on-demand service provisioning and massive outsourcing of IT infrastructures and applications to virtual, commoditized ones. Despite the broad Service Level Agreement (SLA) usage in scientific settings, their role in ...
Index Terms
- Proceedings of the fifth international workshop on Cloud data management
Recommendations
Acceptance Rates
Year | Submitted | Accepted | Rate |
---|---|---|---|
CloudDB '13 | 6 | 4 | 67% |
CloudDB '09 | 11 | 8 | 73% |
Overall | 17 | 12 | 71% |