ABSTRACT
Many of the services that are critical to Google's ad business have historically been backed by MySQL. We have recently migrated several of these services to F1, a new RDBMS developed at Google. F1 implements rich relational database features, including a strictly enforced schema, a powerful parallel SQL query engine, general transactions, change tracking and notification, and indexing, and is built on top of a highly distributed storage system that scales on standard hardware in Google data centers. The store is dynamically sharded, supports transactionally-consistent replication across data centers, and is able to handle data center outages without data loss.
The strong consistency properties of F1 and its storage system come at the cost of higher write latencies compared to MySQL. Having successfully migrated a rich customer-facing application suite at the heart of Google's ad business to F1, with no downtime, we will describe how we restructured schema and applications to largely hide this increased latency from external users. The distributed nature of F1 also allows it to scale easily and to support significantly higher throughput for batch workloads than a traditional RDBMS.
With F1, we have built a novel hybrid system that combines the scalability, fault tolerance, transparent sharding, and cost benefits so far available only in "NoSQL" systems with the usability, familiarity, and transactional guarantees expected from an RDBMS.
Index Terms
- F1: the fault-tolerant distributed RDBMS supporting google's ad business
Recommendations
Automated Tool for NoSQL to SQL Migration
ICISE '22: Proceedings of the 7th International Conference on Information Systems EngineeringChoosing which database to use is one of the most important decisions an organization needs to make when working on a new microservice. When deciding on a modern database, one of the biggest decisions is to select the correct type of (relational or non-...
Vishleshan: Performance Comparison and Programming Process Mining Algorithms in Graph-Oriented and Relational Database Query Languages
IDEAS '15: Proceedings of the 19th International Database Engineering & Applications SymposiumProcess-Aware Information System (PAIS) are IT systems that manages, supports business processes and generate large event logs from execution of business processes. Process Mining consists of analyzing event logs generated by PAISs and discover business ...
YAMATO and ASUKA: DNA database management system
HICSS '95: Proceedings of the 28th Hawaii International Conference on System SciencesWe previously developed a relational schema for effectively building, integrating, and searching the DNA database on the relational database management system, SYBASE, at DDBJ. The schema is named the "DDBJ schema". The schema allowed us to implement a ...
Comments