skip to main content
10.1145/1989323.1989445acmconferencesArticle/Chapter ViewAbstractPublication PagesmodConference Proceedingsconference-collections
research-article

Oracle database filesystem

Published: 12 June 2011 Publication History

Abstract

Modern enterprise, web, and multimedia applications are generating unstructured content at unforeseen volumes in the form of documents, texts, and media files. Such content is generally associated with relational data such as user names, location tags, and timestamps. Storage of unstructured content in a relational database would guarantee the same robustness, transactional consistency, data integrity, data recoverability and other data management features consolidated across files and relational contents. Although database systems are preferred for relational data management, poor performance of unstructured data storage, limited data transformation functionalities, and lack of interfaces based on filesystem standards may keep more than eighty five percent of non-relational unstructured content out of databases in the coming decades.
We introduce Oracle Database Filesystem (DBFS) as a consolidated solution that unifies state-of-the-art network filesystem features with relational database management ones. DBFS is a novel shared-storage network filesystem developed in the RDBMS kernel that allows content management applications to transparently store and organize files using standard filesystem interfaces, in the same database that stores associated relational content. The server component of DBFS is based on Oracle SecureFiles, a novel unstructured data storage engine within the RDBMS that provides filesystem like or better storage performance for files within the database while fully leveraging relational data management features such as transaction atomicity, isolation, read consistency, temporality, and information lifecycle management.
We present a preliminary performance evaluation of DBFS that demonstrates more than 10TB/hr throughput of filesystem read and write operations consistently over a period of 12 hours on an Oracle Exadata Database cluster of four server nodes. In terms of file storage, such extreme performance is equivalent to ingestion of more than 2500 million 100KB document files a single day. The set of initial results look very promising for DBFS towards becoming the universal storage solution for both relational and unstructured content.

References

[1]
Lewis. M. Information 2.0. An EMC2 White Paper, 2008.
[2]
Lallier, J. Storage Management in the Year 2010. Computer Technology Review, September 2004.
[3]
You Tube Fact Sheet. A YouTube White Paper, 2009.
[4]
Facebook. http://www.facebook.com/ statistics, 2009.
[5]
Sears, R., Ingen, C., Gray, J. To BLOB or not to BLOB: Large object Storage in a database or a Filesystem? Microsoft Research Technical Report, MSR-TR-2006--45, 2006.
[6]
Gray, J. Greetings! From a Filesystem User. 4th USENIX Conference on File and Storage Technologies, San Francisco, CA, 2005.
[7]
Carey, M. J., Dewitt, D. Of Objects and Databases: A Decade of Turmoil. Proceedings of the 22nd Very Large Data Bases Endowment, 3--14, 1996.
[8]
Mukherjee, N., Aleti, B., Ganesh, A. et. al. Oracle SecureFiles System. Proceedings of the 34th Very Large Data Bases Endowment, 1(2), 1301--1312, 2008.
[9]
Mukherjee, N., Ganesh, A., Kunchithapadam, K., Muthulingam, S. Oracle SecureFiles - A Filesystem Architecture in Oracle Database Server. ICSOFT (SE/MUSE/GSDCA), 60--63, 2008.
[10]
Mukherjee, N., Ganesh, A. et. al. Oracle SecureFiles: Prepared for the Digital Deluge. Proceedings of the 35th Very Large Data Bases Endowment, 2009.
[11]
The NFS Version 4 Protocol. A Sun Solaris 10 White Paper, 2000.
[12]
Lahiri, T., Srihari, V., Chan, W., Macnaughton, N., Chandrasekaran, S. Cache Fusion: Extending Shared-Disk Clusters with Shared Caches, Proceedings of the 27th VLDB conference, 2001.
[13]
Cryan, M. Oracle Database Concepts. An Oracle White Paper, 2003.
[14]
Szeredi, M. Filesystem in USErspace. http://fuse.sourceforge.net/.
[15]
Rajamani, R. Oracle Total recall/ Flashback Data Archive. An Oracle White Paper, June 2007.
[16]
Manning, P. Automatic Storage Management technical Overview. An Oracle Technical White Paper, 2003.
[17]
Weiss, R. A Technical Overview of the Sun Oracle Exadata Storage Server and Database Machine. An Oracle Technical White Paper, 2009.
[18]
Biggar, H. Experiencing Data De-Duplication: Improving Efficiency and Reducing Capacity Requirements. A SearchStorage.com White Paper, Feb 2007.

Cited By

View all
  • (2024)Why Files If You Have a DBMS?2024 IEEE 40th International Conference on Data Engineering (ICDE)10.1109/ICDE60146.2024.00297(3878-3892)Online publication date: 13-May-2024
  • (2018)A Novel Load Forecasting System Leveraging Database TechnologyModern Approaches for Intelligent Information and Database Systems10.1007/978-3-319-76081-0_42(491-503)Online publication date: 24-Feb-2018
  • (2015)Efficient Storage and Query Processing of Large String in OracleProceedings, Part I, of the 26th International Conference on Database and Expert Systems Applications - Volume 926110.1007/978-3-319-22849-5_24(351-366)Online publication date: 1-Sep-2015
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
SIGMOD '11: Proceedings of the 2011 ACM SIGMOD International Conference on Management of data
June 2011
1364 pages
ISBN:9781450306614
DOI:10.1145/1989323
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 12 June 2011

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. RDBMS
  2. atomicity
  3. database filesystem
  4. performance and scalability
  5. read consistency
  6. securefiles
  7. temporality
  8. unstructured data

Qualifiers

  • Research-article

Conference

SIGMOD/PODS '11
Sponsor:

Acceptance Rates

Overall Acceptance Rate 785 of 4,003 submissions, 20%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)32
  • Downloads (Last 6 weeks)3
Reflects downloads up to 17 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2024)Why Files If You Have a DBMS?2024 IEEE 40th International Conference on Data Engineering (ICDE)10.1109/ICDE60146.2024.00297(3878-3892)Online publication date: 13-May-2024
  • (2018)A Novel Load Forecasting System Leveraging Database TechnologyModern Approaches for Intelligent Information and Database Systems10.1007/978-3-319-76081-0_42(491-503)Online publication date: 24-Feb-2018
  • (2015)Efficient Storage and Query Processing of Large String in OracleProceedings, Part I, of the 26th International Conference on Database and Expert Systems Applications - Volume 926110.1007/978-3-319-22849-5_24(351-366)Online publication date: 1-Sep-2015
  • (2014)SAGAProceedings of the 26th International Conference on Scientific and Statistical Database Management10.1145/2618243.2618270(1-12)Online publication date: 30-Jun-2014
  • (2014)JSON data managementProceedings of the 2014 ACM SIGMOD International Conference on Management of Data10.1145/2588555.2595628(1247-1258)Online publication date: 18-Jun-2014
  • (2014)CouchFSProceedings of the 2014 IEEE International Congress on Big Data10.1109/BigData.Congress.2014.122(784-785)Online publication date: 27-Jun-2014
  • (2013)Data VaultsComputing in Science and Engineering10.1109/MCSE.2013.1715:3(32-42)Online publication date: 1-May-2013
  • (2013)Adaptive Live VM Migration in Share-Nothing IaaS-Clouds with LiveFSProceedings of the 2013 IEEE International Conference on Cloud Computing Technology and Science - Volume 0210.1109/CloudCom.2013.151(293-298)Online publication date: 2-Dec-2013
  • (2013)Instant-On Scientific Data WarehousesEnabling Real-Time Business Intelligence10.1007/978-3-642-39872-8_5(60-75)Online publication date: 2013
  • (2012)Oracle in-database hadoopProceedings of the 2012 ACM SIGMOD International Conference on Management of Data10.1145/2213836.2213955(779-790)Online publication date: 20-May-2012
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media