skip to main content
10.1145/378993acmconferencesBook PagePublication PagesasplosConference Proceedingsconference-collections
ASPLOS IX: Proceedings of the ninth international conference on Architectural support for programming languages and operating systems
ACM2000 Proceeding
Publisher:
  • Association for Computing Machinery
  • New York
  • NY
  • United States
Conference:
ASPLOS00: ASPLOS 2000 Conference Cambridge Massachusetts USA
ISBN:
978-1-58113-317-2
Published:
12 November 2000
Next Conference
Reflects downloads up to 07 Mar 2025Bibliometrics
Abstract

No abstract available.

Skip Table Of Content Section
Article
Free
Designing computer systems with MEMS-based storage

For decades the RAM-to-disk memory hierarchy gap has plagued computer architects. An exciting new storage technology based on microelectromechanical systems (MEMS) is poised to fill a large portion of this performance gap, significantly reduce system ...

Article
Free
Architecture and design of AlphaServer GS320

This paper describes the architecture and implementation of the AlphaServer GS320, a cache-coherent non-uniform memory access multiprocessor developed at Compaq. The AlphaServer GS320 architecture is specifically targeted at medium-scale multiprocessing ...

Article
Free
Timestamp snooping: an approach for extending SMPs

Symmetric muultiprocessor (SMP) servers provide superior performance for the commercial workloads that dominate the Internet. Our simulation results show that over one-third of cache misses by these applications result in cache-to-cache transfers, where ...

Article
Free
MemorIES3: a programmable, real-time hardware emulation tool for multiprocessor server design

Modern system design often requires multiple levels of simulation for design validation and performance debugging. However, while machines have gotten faster, and simulators have become more detailed, simulation speeds have not tracked machine speeds, ...

Article
Free
FLASH vs. (Simulated) FLASH: closing the simulation loop

Simulation is the primary method for evaluating computer systems during all phases of the design process. One significant problem with simulation is that it rarely models the system exactly, and quantifying the resulting simulator error can be ...

Article
Free
Using meta-level compilation to check FLASH protocol code

Building systems such as OS kernels and embedded software is difficult. An important source of this difficulty is the numerous rules they must obey: interrupts cannot be disabled for ~too long," global variables must be protected by locks, user pointers ...

Article
Free
Evaluating design alternatives for reliable communication on high-speed networks

We systematically evaluate the performance of five implementations of a single, user-level communication interface. Each implementation makes different architectural assumptions about the reliability of the network hardware and the capabilities of the ...

Article
Free
Communication scheduling

The high arithmetic rates of media processing applications require architectures with tens to hundreds of functional units, multiple register files, and explicit interconnect between functional units and register files. Communication scheduling enables ...

Article
Free
System architecture directions for networked sensors

Technological progress in integrated, low-power, CMOS communication devices and sensors makes a rich design space of networked sensors viable. They can be deeply embedded in the physical world and spread throughout our environment like smart dust. The ...

Article
Free
Power aware page allocation

One of the major challenges of post-PC computing is the need to reduce energy consumption, thereby extending the lifetime of the batteries that power these mobile devices. Memory is a particularly important target for efforts to improve energy ...

Article
Free
Hoard: a scalable memory allocator for multithreaded applications

Parallel, multithreaded C and C++ programs such as web servers, database managers, news servers, and scientific applications are becoming increasingly prevalent. For these applications, the memory allocator is often a bottleneck that severely limits ...

Article
Free
Thread-level parallelism and interactive performance of desktop applications

Multiprocessing is already prevalent in servers where multiple clients present an obvious source of thread-level parallelism. However, the case for multiprocessing is less clear for desktop applications. Nevertheless, architects are designing processors ...

Article
Free
Effective null pointer check elimination utilizing hardware trap

We present a new algorithm for eliminating null pointer checks from programs written in Java™. Our new algorithm is split into two phases. In the first phase, it moves null checks backward, and it is iterated for a few times with other ...

Article
Free
Frequent value locality and value-centric data cache design

By studying the behavior of programs in the SPECint95 suite we observed that six out of eight programs exhibit a new kind of value locality, the frequent value locality, according to which a few values appear very frequently in memory locations and are ...

Article
Free
Efficient and flexible value sampling

This paper presents novel sampling-based techniques for collecting statistical profiles of register contents, data values, and other information associated with instructions, such as memory latencies. Values of interest are sampled in response to ...

Article
Free
Architectural support for copy and tamper resistant software

Although there have been attempts to develop code transformations that yield tamper-resistant software, no reliable software-only methods are know. This paper studies the hardware implementation of a form of execute-only memory (XOM) that allows ...

Article
Free
Architectural support for fast symmetric-key cryptography

The emergence of the Internet as a trusted medium for commerce and communication has made cryptography an essential component of modern information systems. Cryptography provides the mechanisms necessary to implement accountability, accuracy, and ...

Article
Free
OceanStore: an architecture for global-scale persistent storage

OceanStore is a utility infrastructure designed to span the globe and provide continuous access to persistent information. Since this infrastructure is comprised of untrusted servers, data is protected through redundancy and cryptographic techniques. To ...

Article
Free
Software profiling for hot path prediction: less is more

Recently, there has been a growing interest in exploiting profile information in adaptive systems such as just-in-time compilers, dynamic optimizers and, binary translators. In this paper, we show that sophisticated software profiling schemes that ...

Article
Free
OS and compiler considerations in the design of the IA-64 architecture

Increasing demands for processor performance have outstripped the pace of process and frequency improvements, pushing designers to find ways of increasing the amount of work that can be processed in parallel. Traditional RISC architectures use hardware ...

Article
Free
Hardware support for dynamic activation of compiler-directed computation reuse

Compiler-directed Computation Reuse (CCR) enhances program execution speed and efficiency by eliminating dynamic computation redundancy. In this approach, the compiler designates large program regions for potential reuse. During run time, the execution ...

Article
Free
Symbiotic jobscheduling for a simultaneous multithreaded processor

Simultaneous Multithreading machines fetch and execute instructions from multiple instruction streams to increase system utilization and speedup the execution of jobs. When there are more jobs in the system than there is hardware to support simultaneous ...

Article
Free
An analysis of operating system behavior on a simultaneous multithreaded architecture

This paper presents the first analysis of operating system execution on a simultaneous multithreaded (SMT) processor. While SMT has been studied extensively over the past 6 years, previous research has focused entirely on user-mode execution. However, ...

Article
Free
Slipstream processors: improving both performance and fault tolerance

Processors execute the full dynamic instruction stream to arrive at the final output of a program, yet there exist shorter instruction streams that produce the same overall effect. We propose creating a shorter but otherwise equivalent version of the ...

Contributors
  • Massachusetts Institute of Technology

Index Terms

  1. Proceedings of the ninth international conference on Architectural support for programming languages and operating systems

      Recommendations

      Acceptance Rates

      ASPLOS IX Paper Acceptance Rate 24 of 114 submissions, 21%;
      Overall Acceptance Rate 535 of 2,713 submissions, 20%
      YearSubmittedAcceptedRate
      ASPLOS '193517421%
      ASPLOS '183195618%
      ASPLOS '173205317%
      ASPLOS '162325323%
      ASPLOS '152874817%
      ASPLOS '142174923%
      ASPLOS XV1813218%
      ASPLOS XIII1273124%
      ASPLOS XII1583824%
      ASPLOS X1752414%
      ASPLOS IX1142421%
      ASPLOS VIII1232823%
      ASPLOS VII1092523%
      Overall2,71353520%