Proceedings of the 14th annual international symposium on Computer architecture

ISCA '87: Proceedings of the 14th annual international symposium on Computer architecture

June 1987

1987 Proceeding

Editor:
D. St. Clair

Publisher:

Association for Computing Machinery
New York
NY
United States

Conference:

ISCA87: The 14th Annual International Symposium on Computer Architecture Pittsburgh Pennsylvania USA June 2 - 5, 1987

ISBN:

978-0-8186-0776-9

Published:

01 June 1987

Sponsors:

SIGARCH

Get Alerts for this ConferenceAlerts Save to BinderBinder

Save to Binder

Create a New Binder

Name

Export CitationCitation

Share on

Next Conference

ISCA '25

Sponsor:
sigarch

The 52nd Annual International Symposium on Computer Architecture

June 21 - 25, 2025

Tokyo , Japan

ISCA '25 website

Reflects downloads up to 18 Jan 2025Bibliometrics

Citation Count

1,209

Downloads (6 weeks)

419

Downloads (12 months)

2,708

Downloads (cumulative)

21,728

Sections

ISCA '87: Proceedings of the 14th annual international symposium on Computer architecture

1987

Previous Next

Abstract

No abstract available.

Select All

Export Citations Save to Binder

Article

Free

Branch folding in the CRISP microprocessor: reducing branch delay to zero

D. R. Ditzel,
H. R. McLellan

Pages 2–8https://doi.org/10.1145/30350.30351

A new method of implementing branch instructions is presented. This technique has been implemented in the CRISP Microprocessor. With a combination of hardware and software techniques the execution time cost for many branches can be effectively reduced ...

- 130
- 1,254
Metrics
Total Citations130
Total Downloads1,254
Last 12 Months312
Last 6 weeks23

Abstract
View online with eReader
PDF

Article

Free

An evaluation of branch architectures

J. A. DeRosa,
H. M. Levy

Pages 10–16https://doi.org/10.1145/30350.30352

Branch instructions form a significant fraction of executed instructions, and their design is thus a crucial component of any architecture. This paper examines three alternatives in the design of branch instructions: delayed vs. non-delayed branches, ...

- 52
- 824
Metrics
Total Citations52
Total Downloads824
Last 12 Months196
Last 6 weeks31

Abstract
View online with eReader
PDF

Article

Free

Checkpoint repair for out-of-order execution machines

W. W. Hwu,
Y. N. Patt

Pages 18–26https://doi.org/10.1145/30350.30353

Out-of-order execution and branch prediction are two mechanisms that can be used profitably in the design of Supercomputers to increase performance. Unfortunately this means there must be some kind of repair mechanism, since situations do occur that ...

- 93
- 1,260
Metrics
Total Citations93
Total Downloads1,260
Last 12 Months154
Last 6 weeks37

Abstract
View online with eReader
PDF

Article

Free

Instruction issue logic for high-performance, interruptable pipelined processors

G. S. Sohi,
S. Vajapeyam

Pages 27–34https://doi.org/10.1145/30350.30354

The performance of pipelined processors is severely limited by data dependencies. In order to achieve high performance, a mechanism to alleviate the effects of data dependencies must exist. If a pipelined CPU with multiple functional units is to be used ...

- 92
- 1,359
Metrics
Total Citations92
Total Downloads1,359
Last 12 Months68
Last 6 weeks7

Abstract
View online with eReader
PDF

Article

Free

Fast temporary storage for serial and parallel execution

J. Swensen,
Y. Patt

Pages 35–43https://doi.org/10.1145/30350.30355

There is an apparent conflict between the hardware requirements for fast parallel execution and the hardware requirements for fast serial execution. For example, fast vector execution is achieved by maintaining high execution concurrency over extended ...

- 2
- 368
Metrics
Total Citations2
Total Downloads368
Last 12 Months24
Last 6 weeks8

Abstract
View online with eReader
PDF

Article

Free

Performance analysis and design of a logic simulation machine

K. Wong,
M. A. Franklin

Pages 46–55https://doi.org/10.1145/30350.30356

The high costs associated with logic simulation of large VLSI circuits has led to the need for new computer architectures tailored to the simulation task. Such architectures have the potential for significant speed-ups over software-based logic ...

- 12
- 305
Metrics
Total Citations12
Total Downloads305
Last 12 Months43
Last 6 weeks8

Abstract
View online with eReader
PDF

Article

Free

A modular systolic architecture for image convolutions

K. Doshi,
P. Varman

Pages 56–63https://doi.org/10.1145/30350.30357

This paper describes a modular, systolic design for two-dimensional convolution which is a frequent and computationally intensive operation in low-level image processing. The design consists of a one-dimensional array of homogeneous cells, each with a ...

- 5
- 423
Metrics
Total Citations5
Total Downloads423
Last 12 Months22
Last 6 weeks8

Abstract
View online with eReader
PDF

Article

Free

A template matching algorithm using optically-connected 3-D VLSI architecture

S. Fujita,
R. Aibara,
M. Yamashita,
T. Ae

Pages 64–70https://doi.org/10.1145/30350.30358

Three-dimensional VLSI (in short, 3-D VLSI) is a new device technology that is expected to realize high performance systems. In this paper, we propose an image processing architecture based on 3-D VLSI consisting of optically-connected layers. Since the ...

- 5
- 313
Metrics
Total Citations5
Total Downloads313
Last 12 Months42
Last 6 weeks9

Abstract
View online with eReader
PDF

Article

Free

Mapping data flow programs on a VLSI array of processors

B. Mendelson,
G. M. Silberman

Pages 72–80https://doi.org/10.1145/30350.30359

With the advent of VLSI, relatively large processing arrays may be realized in a single VLSI chip. Such regularly structured arrays take considerably less time to design and test, and fault-tolerance can easily be introduced into them. However, only a ...

- 15
- 480
Metrics
Total Citations15
Total Downloads480
Last 12 Months68
Last 6 weeks8

Abstract
View online with eReader
PDF

Article

Free

Analytical modeling and architectural modifications of a dataflow computer

D. Ghosal,
L. N. Bhuyan

Pages 81–89https://doi.org/10.1145/30350.30360

Dataflow computers are an alternative to the von Neumann architectures and are capable of exploiting large amount of parallelism inherent in many computer applications. This paper deals with the performance analysis of the Manchester dataflow computer ...

- 7
- 360
Metrics
Total Citations7
Total Downloads360
Last 12 Months42
Last 6 weeks9

Abstract
View online with eReader
PDF

Article

Free

A unified resource management and execution control mechanism for data flow machines

M. Takesue

Pages 90–97https://doi.org/10.1145/30350.30361

This paper presents a unified resource management and execution control mechanism for data flow machines. The mechanism integrates load control, depth-first execution control, cache memory control and a load balancing mechanism. All of these mechanisms ...

- 15
- 306
Metrics
Total Citations15
Total Downloads306
Last 12 Months56
Last 6 weeks9

Abstract
View online with eReader
PDF

Article

Free

High performance integrated Prolog processor IPP

S. Abe,
T. Bandoh,
S. Yamaguchi,
K. Kurosawa,
K. Kiriyama

Pages 100–107https://doi.org/10.1145/30350.30362

To realize the highest performance possible for a sequential processor, and to realize utilization of a large amount of existing software, an integrated Prolog processor (IPP) and its optimized compiler are now being developed.

A tagged architecture ...

- 17
- 303
Metrics
Total Citations17
Total Downloads303
Last 12 Months30
Last 6 weeks5

Abstract
View online with eReader
PDF

Article

Free

Performance studies of a parallel Prolog architecture

B. S. Fagin,
A. M. Despain

Pages 108–116https://doi.org/10.1145/30350.30363

This paper presents a new multiprocessor architecture for the parallel execution of logic programs, developed as part of the Aquarius Project. This architecture is designed to support AND-parallelism, OR-parallelism, and intelligent backtracking. We ...

- 10
- 264
Metrics
Total Citations10
Total Downloads264
Last 12 Months44
Last 6 weeks12

Abstract
View online with eReader
PDF

Article

Free

An experimental VLSI Prolog interpreter: preliminary measurements and results

P. L. Civera,
F. Maddaleno,
G. L. Piccinini,
M. Zamboni

Pages 117–126https://doi.org/10.1145/30350.30364

This work presents the preliminary results of a project oriented to the design and VLSI implementation of a Prolog interpreter. Even if the interpretative approach is being considered an inefficient way to execute high level languages when compared to ...

- 4
- 298
Metrics
Total Citations4
Total Downloads298
Last 12 Months64
Last 6 weeks7

Abstract
View online with eReader
PDF

Article

Free

Deterministic and stochastic modeling of parallel garbage collection: towards real-time criteria

O. Ridoux

Pages 128–136https://doi.org/10.1145/30350.30365

The study of garbage collection for a logic programming language machine has exhibited fundamental differences with the more popular functional programming garbage collection. These differences yield behaviours that cannot be observed with classical ...

- 0
- 252
Metrics
Total Citations0
Total Downloads252
Last 12 Months15
Last 6 weeks7

Abstract
View online with eReader
PDF

Article

The sharing of environment in AND-OR-parallel execution of logic programs

C. Sun,
Y. Tsu

Pages 137–144https://doi.org/10.1145/30350.30366

- 4
Metrics
Total Citations4

Article

Free

Architectural issues in designing symbolic processors in optics

A. Guha,
R. Ramnarayan,
M. Derstine

Pages 145–151https://doi.org/10.1145/30350.30367

This paper analyzes potential optical architectures for AI applications (such as knowledge-based systems). Our goal was to investigate architectures most suitable for implementation completely in optics. While optical computing appears to hold much ...

- 1
- 1,205
Metrics
Total Citations1
Total Downloads1,205
Last 12 Months51
Last 6 weeks10

Abstract
View online with eReader
PDF

Article

Free

Rearrangeability of multistage shuffle/exchange networks

A. Varma,
C. S. Raghavendra

Pages 154–162https://doi.org/10.1145/30350.30368

In this paper we study the rearrangeability of multistage shuffle/exchange networks. Although a theoretical lower bound of (2 log₂N - 1) stages for rearrangeability of a network with N = 2ⁿ inputs and outputs has been known, the sufficiency of (2 log₂N -...

- 4
- 460
Metrics
Total Citations4
Total Downloads460
Last 12 Months66
Last 6 weeks9

Abstract
View online with eReader
PDF

Article

Free

Optimized mesh-connected networks for SIMD and MIMD architectures

R. Beivide,
E. Herrada,
J. L. Balcazar,
J. Labarta

Pages 163–170https://doi.org/10.1145/30350.30369

A class of mesh networks with wrap-around links is obtained from a class of circulant graphs by means of a graph isomorphism. We demonstrate how to obtain, from the adjacency pattern of the graph, simple parameters that serve to construct a planar ...

- 16
- 625
Metrics
Total Citations16
Total Downloads625
Last 12 Months51
Last 6 weeks16

Abstract
View online with eReader
PDF

Article

Free

Performance evaluation of reduced bandwidth multistage interconnection networks

D. T. Harper,
J. R. Jump

Pages 171–175https://doi.org/10.1145/30350.30370

This paper presents and evaluates a class of buffered interconnection networks which provide performance and cost levels intermediate to a bus and a delta network. These networks, referred to as hybrid networks, are formed by beginning with a delta ...

- 2
- 300
Metrics
Total Citations2
Total Downloads300
Last 12 Months51
Last 6 weeks9

Abstract
View online with eReader
PDF

Article

Free

Hardware support for interprocess communication

U. Ramachandran,
M. Solomon,
M. Vernon

Pages 178–188https://doi.org/10.1145/30350.30371

In recent years there has been increasing interest in message-based operating systems, particularly in distributed environments. Such systems consist of a small message-passing kernel supporting a collection of system server processes that provide such ...

- 7
- 967
Metrics
Total Citations7
Total Downloads967
Last 12 Months78
Last 6 weeks10

Abstract
View online with eReader
PDF

Article

Free

Architecture of a message-driven processor

W. J. Dally,
L. Chao,
A. Chien,
S. Hassoun,
W. Horwat,
J. Kaplan,
P. Song,
B. Totty,
S. Wills

Pages 189–196https://doi.org/10.1145/30350.30372

We propose a machine architecture for a high-performance processing node for a message-passing, MIMD concurrent computer. The principal mechanisms for attaining this goal are the direct execution and buffering of messages and a memory-based architecture ...

- 137
- 567
Metrics
Total Citations137
Total Downloads567
Last 12 Months54
Last 6 weeks11

Abstract
View online with eReader
PDF

Article

Free

Effect of storage allocation/reclamation methods on parallelism and storage requirements

M. Kumar

Pages 197–205https://doi.org/10.1145/30350.30373

The write after read/write synchronizations (the anti- and output-dependence constraints) inhibit the parallelism exhibited by Fortran programs. These constraints can be avoided by allocating storage for the values generated in a program dynamically, so ...

- 12
- 232
Metrics
Total Citations12
Total Downloads232
Last 12 Months34
Last 6 weeks8

Abstract
View online with eReader
PDF

Article

Free

Cache design of a sub-micron CMOS system/370

J. H. Chang,
H. Chao,
K. So

Pages 208–213https://doi.org/10.1145/30350.30374

An innovative cache accessing scheme based on high MRU (most recently used) hit ratio [1] is proposed for the design of a one-cycle cache in a CMOS implementation of System/370. It is shown that with this scheme the cache access time is reduced by 30 ~ ...

- 60
- 809
Metrics
Total Citations60
Total Downloads809
Last 12 Months74
Last 6 weeks10

Abstract
View online with eReader
PDF

Article

Free

An architectural perspective on a memory access controller

M. Freeman

Pages 214–223https://doi.org/10.1145/30350.30375

In this paper a CMOS memory access controller chip is described that provides the basis for achieving high-performance 68020-based (68030-based) systems. This controller matches the speed of the memory system to that of the microprocessor by providing a ...

- 2
- 489
Metrics
Total Citations2
Total Downloads489
Last 12 Months50
Last 6 weeks8

Abstract
View online with eReader
PDF

Article

Free

Organization and analysis of a gracefully-degrading interleaved memory system

K. Cheung,
G. Sohi,
K. Saluja,
D. Pradhan

Pages 224–231https://doi.org/10.1145/30350.30376

A hardware mechanism has been proposed to reconfigure an interleaved memory system. The reconfiguration scheme is such that, at any instant all fault-free memory banks in the memory system are utilized in interleaved manner. A performance metric is ...

- 5
- 295
Metrics
Total Citations5
Total Downloads295
Last 12 Months59
Last 6 weeks15

Abstract
View online with eReader
PDF

Article

Free

Correct memory operation of cache-based multiprocessors

C. Scheurich,
M. Dubois

Pages 234–243https://doi.org/10.1145/30350.30377

This paper shows that cache coherence protocols can implement indivisible synchronization primitives reliably and can also enforce sequential consistency. Sequential consistency provides a commonly accepted model of behavior of multiprocessors. We ...

- 112
- 1,042
Metrics
Total Citations112
Total Downloads1,042
Last 12 Months92
Last 6 weeks8

Abstract
View online with eReader
PDF

Article

Free

Hierarchical cache/bus architecture for shared memory multiprocessors

A. W. Wilson

Pages 244–252https://doi.org/10.1145/30350.30378

A new, large scale multiprocessor architecture is presented in this paper. The architecture consists of hierarchies of shared buses and caches. Extended versions of shared bus multicache coherency protocols are used to maintain coherency among all ...

- 183
- 1,817
Metrics
Total Citations183
Total Downloads1,817
Last 12 Months139
Last 6 weeks16

Abstract
View online with eReader
PDF

Article

Free

Multiprocessor cache design considerations

R. L. Lee,
P. C. Yew,
D. H. Lawrie

Pages 253–262https://doi.org/10.1145/30350.30379

In this paper, cache design is explored for large high-performance multiprocessors with hundreds or thousands of processors and memory modules interconnected by a pipe-lined multi-stage network. The majority of the multiprocessor cache studies in the ...

- 45
- 1,230
Metrics
Total Citations45
Total Downloads1,230
Last 12 Months100
Last 6 weeks13

Abstract
View online with eReader
PDF

Article

Free

Performance evaluation of multiple register sets

R. J. Eickemeyer,
J. H. Patel

Pages 264–271https://doi.org/10.1145/30350.30380

In this paper a DEC VAX with multiple register sets is evaluated under many differently sized register sets. Both the number of register sets and the number of registers per set were varied. Performance, measured in terms of memory traffic, is compared ...

- 11
- 382
Metrics
Total Citations11
Total Downloads382
Last 12 Months107
Last 6 weeks23

Abstract
View online with eReader
PDF

Save to Binder

Create a New Binder

Name

Contributors

D St. Clair
- Publication Years
- Publication counts0
- Citation count0
- Available for Download0
- Downloads (cumulative)0
- Downloads (12 months)0
- Downloads (6 weeks)0
- Average Downloads per Article0
- Average Citation per Article0
View Full Profile

Index Terms

Proceedings of the 14th annual international symposium on Computer architecture
1. Computer systems organization
2. Hardware

Comments

0 Comments

Recommendations

CompSysTech '13: Proceedings of the 14th International Conference on Computer Systems and Technologies
CSL-LICS '14: Proceedings of the Joint Meeting of the Twenty-Third EACSL Annual Conference on Computer Science Logic (CSL) and the Twenty-Ninth Annual ACM/IEEE Symposium on Logic in Computer Science (LICS)
ISCA '21: Proceedings of the 48th Annual International Symposium on Computer Architecture

Acceptance Rates

Overall Acceptance Rate 543 of 3,203 submissions, 17%

Year	Submitted	Accepted	Rate
ISCA '22	400	67	17%
ISCA '19	365	62	17%
ISCA '17	322	54	17%
ISCA '13	288	56	19%
ISCA '12	262	47	18%
ISCA '08	259	37	14%
ISCA '06	234	31	13%
ISCA '05	194	45	23%
ISCA '04	217	31	14%
ISCA '03	184	36	20%
ISCA '02	180	27	15%
ISCA '01	163	24	15%
ISCA '99	135	26	19%
Overall	3,203	543	17%

Export Citations

Select Citation format

Please download or close your previous search result export first before starting a new bulk export.
Preview is not available.
By clicking download,a status dialog will open to start the export process. The process may takea few minutes but once it finishes a file will be downloadable from your browser. You may continue to browse the DL while the export process is in progress.
Download
- Download citation
- Copy citation

Save to Binder

Sections

Save to Binder

Index Terms

Recommendations

CompSysTech '13: Proceedings of the 14th International Conference on Computer Systems and Technologies

CSL-LICS '14: Proceedings of the Joint Meeting of the Twenty-Third EACSL Annual Conference on Computer Science Logic (CSL) and the Twenty-Ninth Annual ACM/IEEE Symposium on Logic in Computer Science (LICS)

ISCA '21: Proceedings of the 48th Annual International Symposium on Computer Architecture

Acceptance Rates