skip to main content
10.1145/2383276.2383315acmotherconferencesArticle/Chapter ViewAbstractPublication PagescompsystechConference Proceedingsconference-collections
research-article

An approach for indexing file names in a directory

Published: 22 June 2012 Publication History

Abstract

The directory management is an essential part of the file systems. The carefully designed directory's internal structures can have major impact on the performance. The modern file systems tend to support millions of files per directory and the file name indexing is a milestone. Two main approaches exist -- b-trees and hashing. The current paper proposes a modification of the b-tree which combines the features of both. It is presented the modification -- the so-called bh-tree and an approach to index and maintain the file names in a directory. The aim is fast and parallelizable management of the indexed file names.

References

[1]
Chang, Ye-In, Chien-I Lee, Alternating Hashing for Expansible Files, IEEE Transactions On Knowledge And Data Engineering, Vol. 9, No. 1, January-February 1997, pp. 179--185
[2]
Comer, D., Ubiquitous B-Tree, ACM Computing Surveys (CSUR), Volume 11, Issue 2, June 1979, Pages: 121--137, ISSN:0360-0300
[3]
Cormen, Th. H., Ch. E. Leiserson, R. L. Rivest, Cl. Stein, Introduction to Algorithms, Second Edition, The MIT Press, 2001, ISBN 0-262-03293-7
[4]
EXT4 source code, http://www.kernel.org/
[5]
HFS Plus file system, http://developer.apple.com/mac/library/technotes/tn/tn1150.htm
[6]
Intel® 64 and IA-32 Architectures Software Developer's Manual, http://www.intel.com/products/processor/manuals/
[7]
JFS file system, http://jfs.sourceforge.net/
[8]
Litwin, W. A., N. Roussopoulos, G. Levy, W. Hong, Trie Hashing With Controlled Load, IEEE Transactions On Software Engineering, Vol. 17, No. 7, July 1991, pp. 678--691
[9]
Mathur, A. et al., The new ext4 filesystem: current status and future plans, Proceedings of the Linux Symposium, June 2007, Ottawa, Ontario, Canada, Vol. 2, pp. 21--33
[10]
McKusick, M. K., G. V. Neville-Neil, The Design and Implementation of the FreeBSD Operating System, Addison Wesley, 2004, ISBN: 0-201-70245-2
[11]
Mouza, C. du, W. Litwin, Ph. Rigaux, Th. Schwarz, AS-Index: A Structure For String Search Using n-grams and Algebraic Signatures, Proceeding of the 18th ACM conference on Information and knowledge management, 2009, pp. 295--304
[12]
NTFS file system, http://msdn.microsoft.com
[13]
Patil, S. V., G. A. Gibson, S. Lang, M. Polte, GIGA+: Scalable Directories for Shared File Systems, ACM Digital Library, Proceedings of the 2nd international workshop on Petascale data storage: held in conjunction with Supercomputing '07, 2007, pp. 26--29
[14]
Preslan, K. W., A. P. Barry, J. E. Brassow, G. M. Erickson, E. Nygaard, C. J. Sabol, S. R. Soltis, D. C. Teigland, M. T. O'Keefe, A 64-bit, Shared Disk File System for Linux, 16th IEEE Symposium on Mass Storage Systems, 1999, pp. 22--41
[15]
Rodeh, O., B-trees, Shadowing, and Clones, ACM Transactions on Storage, Volume 3, Issue 4, 2008, pp. 15:1--15:27
[16]
Russinovich, M. E., D. A. Solomon, Microsoft® Windows® Internals, Fourth Edition: Microsoft Windows Server#8482; 2003, Windows XP, and Windows 2000, Microsoft Press, 2004, ISBN: 0735619174
[17]
Sedgewick, R., Algorithms in Java, Third Edition, Addison Wesley, 2002, ISBN: 0-201-36120-5
[18]
SHA-1, http://csrc.nist.gov/groups/ST/toolkit/index.html
[19]
Silberschatz, A., P. Galvin, G. Gagne, Operating Systems Concepts, 7th Edition, John Wiley & Sons, 2005, ISBN 0-471-69466-5
[20]
Stallings, W., Operating Systems Internals and Design Principles, 5th Edition, Prentice Hall, 2004, ISBN-10: 0131479547
[21]
Tanenbaum, A., Modern Operating Systems, 2nd Edition, Prentice Hall, 2001, ISBN 0-13-092641-8
[22]
Wang, J., R. Min, Y. Zhu, Y. Hu, UCFS---A Novel User-Space, High Performance, Customized File System for Web Proxy Servers, IEEE Transactions On Computers, Vol. 51, No. 9, September 2002, pp. 1056--1073
[23]
XFS file system, http://xfs.org
[24]
Xing, J., J. Xiong, N. Sun, J. Ma, Adaptive and Scalable Metadata Management to Support A Trillion Files, ACM Digital Library, Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis, Portland, Oregon, 2009

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences
CompSysTech '12: Proceedings of the 13th International Conference on Computer Systems and Technologies
June 2012
440 pages
ISBN:9781450311939
DOI:10.1145/2383276
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 22 June 2012

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. directory management
  2. file system
  3. indexing
  4. operating systems

Qualifiers

  • Research-article

Conference

CompSysTech'12

Acceptance Rates

Overall Acceptance Rate 241 of 492 submissions, 49%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 93
    Total Downloads
  • Downloads (Last 12 months)7
  • Downloads (Last 6 weeks)0
Reflects downloads up to 15 Jan 2025

Other Metrics

Citations

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media