Abstract
Tertiary storage systems are used when secondary storage can not satisfy the data storage requirements and/or it is a more cost effective option. The new application domains require on-demand retrieval of data from these devices. This paper investigates issues in optimizing I/O time for a query whose data resides on automated tertiary storage containing multiple storage devices.
This work is supported by DOE ASCI Alliance program under a contract from Lawrence Livermore National Labs B347875.
Chapter PDF
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
References
Carey, M. J., Haas, L. M., and Livny, M. Tapes hold data, too: Challenges of tuples on tertiary storage. In Proceedings of the 1993 ACM SIGMOD International Conference on Management of Data (Washington, D.C., 1993), ACM Press, pp. 413–417.
Chen, L. T., Drach, R., Keating, M., Louis, S., Rotem, D., and Shoshani, A. Efficient organization and access of multi-dimensional datasets on tertiary storage systems. Information Systems 20,2 (April 1995), 155–183.
Drapeau, A. L., and Katz, R. H. Striping in large tape libraries. In Proceedings of the 1995 ACM/IEEE Supercomputing Conference (San Diego, CA, 1993), IEEE Computer Society Press.
Fox, S., Prasad, N., and Szezur, M. NASA’s EOSDIS: an integrated system for processing, archiving, and disseminating high-volume earth science imagery and associated products, July 1996.
Golubchik, L., Muntz, R. R., and Watson, R. W. Analysis of striping techniques in robotic storage libraries. In Proceedings of the Fourteenth IEEE Symposium on Mass Storage Systems (Monterey, CA, 1995), IEEE Computer Society Press, pp. 225–238.
Hillyer, B. K., Rastogi, R., and Silberschatz, A. Scheduling and data replication to improve tape jukebox performance. In Proceedings of the 15th International Conference on Data Engineering (Sydney, Australia, 1999), IEEE Computer Society Press, pp. 532–541.
Hillyer, B. K., and Silberschatz, A. On the modeling and performance characteristics of a serpentine tape drive. In Proceedings of 1996 ACM SIGMETRICS Conference on Measurement and Modeling of Computer Systems (Philadelphia, Pennsylvania, 1996), ACM Press, pp. 170–179.
Hillyer, B. K., and Silberschatz, A. Random I/O scheduling in online tertiary storage systems. In Proceedings of the 1996 ACM SIGMOD International Conference on Management of Data (Montreal, Canada, 1996), ACM Press, pp. 195–204.
Hillyer, B. K., and Silberschatz, A. Scheduling non-contiguous tape retrievals. In Proceedings of Sixth NASA Goddard Conference on Mass Storage Systems and Technologies and Fifteenth IEEE Mass Storage Systems Symposium (University of Maryland, College Park, MD, March, 1998), IEEE Computer Society Press.
Inmon, B. The Role of Nearline Storage in the Data Warehouse: Extending your Growing Warehouse to Infinity. Technical white paper, 1999. Provided by StorageTek. http://billinmon.com/library/whiteprs/stnls.pdf
Jagadish, H. V., Lakshmanan, L. V. S., and Srivastava, D. Snakes and sandwiches: Optimal clustering strategies for a data warehouse. In Proceedings ACM SIGMOD International Conference on Management of Data (Philadephia, Pennsylvania, 1999), ACM Press, pp. 37–48.
Johnson, S. M. Optimal two-and three-stage production schedules with setup times included. Naval Research Logistics Quarterly 1,1 (March 1954), 61–68.
Kobler, B., Berbert, J., Caulk, P., and Hariharan, P. C. Architecture and design of storage and data management for the nasa earth observing system data and information system (eosdis). In Proceedings of the Fourteenth IEEE Symposium on Mass Storage Systems (Monterey, CA, 1995), IEEE Computer Society Press, pp. 65–76.
More, S., and Choudhary, A. Scheduling Queries on Taperesident Data. Tech. Rep. CPDC-TR-2000-01-001, Center for Parallel and Distributed Computing, Northwestern University, January 2000. http://www.ece.nwu.edu/cpdc/TechReport/1999/11/CPDC-TR-2000-01-001.html.
More, S., and Choudhary, A. Tertiary storage organization for large multidimensional datasets. In 8th NASA Goddard Space Flight Center Conference on Mass Storage Systems and Technologies and 17th IEEE Symposium on Mass Storage Systems (College Park, MD, March 2000), IEEE Computer Society Press, pp. 203–209.
More, S., Muthukrishnan, S., and Shriver, E. Efficiently sequencing taperesident jobs. In Proceedings of the Eighteenth ACM SIGACT-SIGMOD-SIGART Symposium on Principles of Database Systems (Philadelphia, Pennsylvania, June 1999), ACM Press, pp. 33–43.
Myllymaki, J., and Livny, M. Disk-tape joins: synchronizing disk and tape accesses. In Proceedings of ACM SIGMETRICS Conference on Measurement and Modeling of Computer Systems (Ottawa, Canada, 1995), ACM Press, pp. 279–290.
APB-1 OLAP Benchmark, Release II, November 1998. OLAP Council.
Pinedo, M. Scheduling Theory, Algorithms and Systems. Prentice-Hall, Englewood Cliffs, NJ, 1995.
Sarawagi, S. Database systems for efficient access to tertiary memory. In Proceedings of the Fourteenth IEEE Symposium on Mass Storage Systems (Monterey, CA, 1995), IEEE Computer Society Press, pp. 120–126.
Sarawagi, S. Query processing in tertiary memory databases. In Proceedings of 21th International Conference on Very Large Data Bases (Zurich, Switzerland, 1995), Morgan Kaufmann, pp. 585–596.
Sarawagi, S., and Stonebraker, M. Reordering query execution in tertiary memory databases. In Proceedings of 22th International Conference on Very Large Data Bases (Mumbai (Bombay), India, 1996), Morgan Kaufmann, pp. 156–167.
Stonebraker, M. Managing persistent objects in a multi-level storage. In Proceedings of the 1991 ACM SIGMOD International Conference on Management of Data (Denver, Colorado., 1991), ACM Press, pp. 2–11.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2000 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
More, S., Choudhary, A. (2000). Scheduling Queries for Tape-Resident Data. In: Bode, A., Ludwig, T., Karl, W., Wismüller, R. (eds) Euro-Par 2000 Parallel Processing. Euro-Par 2000. Lecture Notes in Computer Science, vol 1900. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-44520-X_181
Download citation
DOI: https://doi.org/10.1007/3-540-44520-X_181
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-67956-1
Online ISBN: 978-3-540-44520-3
eBook Packages: Springer Book Archive