Check for updates

# A Preliminary Architecture for a Basic Data-Flow Processor

Jack B. Dennis and David P. Misunas Project MAC Massachusetts Institute of Technology

<u>Abstract</u>: A processor is described which can achieve highly parallel execution of programs represented in dataflow form. The language implemented incorporates conditional and iteration mechanisms, and the processor is a step toward a practical data-flow processor for a Fortran-level data-flow language. The processor has a unique architecture which avoids the problems of processor switching and memory/processor interconnecion that usually limit the degree of realizable concurrent processing. The architecture offers an unusual solution to the problem of structuring and managing a two-level memory system.

## Introduction

Studies of concurrent operation within a computer system and of the representation of parallelism in a programming language have yielded a new form of program representation, known as data flow. Execution of a data-flow program is data-driven; that is, each instruction is enabled for execution just when each required operand has been supplied by the execution of a predecessor instruction. Data-flow representations for programs have been described by Karp and Miller [8], Rodriguez [11], Adams [1], Dennis and Fosseen [5], Bährs [2], Kosinski [9, 10], and Dennis [4].

We have developed an attractive architecture for a processor that executes elementary data-flow programs [6, 7]. The class of programs implemented by this processor corresponds to the model of Karp and Miller [8]. These data-flow programs are well suited to representing signal processing computations such as waveform generation. modulation and filtering, in which a group of operations is to be performed once for each sample (in time) of the signals being processed. This elementary data-flow processor avoids the problems of processor switching and processor/memory interconnection present in attempts to adapt conventional Von Neuman type machines for parallel computation. Sections of the machine communicate by the transmission of fixed size information packets, and the machine is organized so that the sections can tolerate delays in packet transmission without compromising effective utilization of the hardware.

We wish to expand the capabilities of the data-flow architecture, with the ultimate goal of developing a general purpose processor using a generalized data-flow language such as described by Dennis [4], Kosinski [9, 10] and Bährs [2]. As an intermediate step, we have developed a preliminary design for a basic data-flow processor that executes programs expressed in a more powerful language than the elementary machine, but still not achieving a generalized capability. The language of the basic machine is that described by Dennis and Fosseen [5], and includes constructs for expressing conditional and iterative execution of program parts.

In this paper we present solutions to the major problems faced in the development of the basic machine. A straightforward solution to the incorporation of decision capabilities in the machine is described. In addition, the growth in program size and complexity with the addition of the decision capability requires utilization of a two-level memory system. A design is presented in which only active instructions are in the operational memory of the processor, and each instruction is brought to that memory only when necessary for program execution, and remains there only as long as it is being utilized.

### The Elementary Processor

The Elementary Processor is designed to utilize the elementary data-flow language as its base language. A program in the elementary data-flow language is a directed graph in which the nodes are operators or links. These nodes are connected by arcs along which values (conveyed by tokens) may travel. An operator of the schema is enabled when tokens are present on all input arcs. The enabled operator may fire at any time, removing the tokens on its input arcs, computing a value from the operands associated with the input tokens, and associating that value with a result token placed on its output arc. A result may be sent to more than one destination by means of a link which removes a token on its input arc and places tokens on its output arcs bearing copies of the input value. An operator or a link cannot fire unless there is no token present on any output arc of that operator or link.

An example of a program in the elementary data-flow language is shown in Figure 1 and represents the following simple computation:



Figure 1. An elementary data-flow program.

<sup>&</sup>quot;The work reported here was supported by the National Science Foundation under research grant GJ-34671.



Figure 2. Organization of the elementary data-flow processor.

The rectangular boxes in Figure 1 are operators, and each arithmetic operator in the above computation is reflected in a corresponding operator in the program. The small dots are links. The large dots represent tokens holding values for the initial configuration of the program.

In the program of Figure 1, links L1 and L2 are initially enabled. The firing of Ll makes copies of the value a available to operators Al and A3; firing L2 presents the value b to operators A1 and A4. Once L1 and L2 have fired (in any order), operator Al is enabled since it will have a token on each of its input arcs. After Al has fired (completing the computation of a + b), link L3 will become enabled. The firing of L3 will enable the concurrent firing of operators A2 and A3, and so on.

The computations represented by an elementary program are performed in a data-driven manner; the enabling of an operator is determined only by the arrival of values on all input links, and no separate control signals are utilized. Such a scheme prompted the design of a processor organized as in Figure 2.

A data-flow schema to be executed is stored in the Memory of the processor. The Memory is organized into Instruction Cells, each Cell corresponding to an operator of the data-flow program. Each Instruction Cell (Figure 3) is composed of three registers. The first register holds an instruction (Figure 4) which specifies the operation to be performed and the address(es) of the register(s) to which the result of the operation is to be directed. The second and third registers hold the operands for use in execution of the instruction.

When a Cell contains an instruction and the necessary operands, it is enabled and signals the Arbitration Network that it is ready to transmit its contents as an operation packet to an Operation Unit which can perform the desired function. The operation packet flows through the Arbi-





tration Network which directs it to an appropriate Operation Unit by decoding the instruction portion of the packet.

The result of an operation leaves an Operation Unit as one or more <u>data packets</u>, consisting of the computed value and the address of a register in the Memory to which the value is to be delivered. The Distribution Network accepts data packets from the Operation Units and utilizes the address of each to direct the data item through the network to the correct register in the Memory. The Instruction Cell containing that register may then be enabled if an instruction and all operands are present in the Cell.

Many Instruction Cells may be enabled simultaneously, and it is the task of the Arbitration Network to efficiently deliver operation packets to Operation Units and to queue operation packets waiting for each Operation Unit. A structure for the Arbitration Network providing a path for operation packets from each Instruction Cell to each Operation Unit is presented in Figure 5. Each Arbitration Unit passes packets arriving at its input ports one-at-atime to its output port, using a round-robin discipline to resolve any ambiguity about which packets should be sent next. A Switch Unit assigns a packet at its input to one of its output ports, according to some property of the packet, in this case the operation code.

The Distribution Network is similarly organized using Switch Units to route data packets from the Operation Units to the Memory Registers specified by the destination addresses. A few Arbitration Units are required so data packets from different Operation Units can enter the network simultaneously.

Since the Arbitration Network has many input ports and only a few output ports, the rate of packet flow will be much greater at the output ports. Thus, a serial representation of packets is appropriate at the input ports to minimize the number of connections to the Memory, but a more parallel representation is required at the output ports so a high throughput may be achieved. Hence, serial-to-parallel conversion is performed in stages within the Arbitration Network. Similarly, parallel-toserial conversion of the value portion of each result packet occurs within the Distribution Network.



The Operation Units of the processor are pipelined in

destination

2

Figure 3. Operation of an Instruction Cell.

Figure 4. Instruction format.



Figure 6. Links of the basic data-flow language.

order to allow maximum throughput. The destination address(es) of an instruction are entered into identity pipelines of the Operation Units and are utilized to form data packets with the result when it appears.

A more detailed explanation of the elementary processor and its operation is given in [6]. We have completed designs for all units of the elementary processor in the form of speed-independent interconnections of a small set of basic asynchronous module types. These designs are presented in [7].

## The Basic Data-Flow Language

Our success in the architecture of the elementary dataflow processor led us to consider applying the concepts to the architecture of machines for more complete dataflow languages. For the first step in generalization, we have chosen a class of data-flow programs that correspond to a formal data-flow model studied by Dennis and Fosseen [5].

The representation of conditionals and iteration in data-flow form requires additional types of links and actors. The types of links and actors for the basic data-flow language are shown in Figures 6 and 7.

Data values pass through data links in the manner presented previously. The tokens transmitted by control links are known as <u>control tokens</u>, and each conveys a value of either <u>true</u> or <u>false</u>. A control token is generated at a decider which, upon receiving values from its input arcs, applies its associated predicate, and produces either a <u>true</u> or <u>false</u> control token at its output arc.

The control token produced at a decider can be combined with other control tokens by means of a Boolean operator (Figure 7f), allowing a decision to be built up from simpler decisions.

Control tokens direct the flow of data tokens by means of T-gates, F-gates, or merge actors (Figure 7c, d, e). A T-gate passes the data token on its input arc to its output arc when it receives a control token conveying



Figure 8. Data-flow representation of the basic program.



Figure 7. Actors of the basic data-flow language.

the value <u>true</u> at its control input. It will absorb the data token on its input arc and place nothing on its output arc if a <u>false</u>-valued control token is received. Similarly, the F-gate will pass its input data token to its output arc only on receipt of a <u>false</u>-valued token on the control input. Upon receipt of a <u>true</u>-valued to-ken, it will absorb the data token.

A merge actor has a true input, a false input, and a control input. It passes to its output arc a data token from the input arc corresponding to the value of the control token received. Any tokens on the other input are not affected.

As with the elementary schemas, a link or actor is not enabled to fire unless there is no token on any of its output arcs.

Using the actors and links of the basic data-flow language, conditionals and iteration can be easily represented. In illustration, Figure 8 gives a basic dataflow program for the following computation:



The control input arcs of the three merge actors carry <u>false</u>-valued tokens in the initial configuration so the input values of x and y and the constant 0 are admitted as initial values for the iteration. Once these values have been received, the predicate y < x is tested. If it is true, the value of x and the new value for y are cycled back into the body of the iteration through the T-gates and two merge nodes. Concurrently, the remaining T-gate and merge node return an incremented value of the iteration count n. When the output of the decider is <u>false</u>, the current values of y and n are delivered through the two F-gates, and the initial configuration is restored.

### The Basic Data-Flow Processor

Two problems must be faced in adapting the design of the

elementary data-flow processor for basic data-flow programs. The first task is to expand the architecture of the elementary machine to incorporate decision capability by implementing deciders, gates and merges. A fairly straightforward solution to this problem will be presented.

However, in contrast to elementary data-flow programs, the nodes of a basic data-flow program do not fire equally often during execution. As computation proceeds, different parts of the program become active or quiescent as iterations are initiated and completed, and as decisions lead to selection of alternate parts of a program for activation. Thus it would be wasteful to assign a Cell to each instruction for the duration of program execution. The basic data-flow processor must have a multi-level memory system such that only the active instructions of a program occupy the Instruction Cells of the processor. In the following sections we first show how decision capability may be realized by augmenting the elementary processor; then we show how an auxiliary memory system may be added so the Instruction Cells act as a cache for the most active instructions.

## Decision Capability

The organization of a basic data-flow processor without the two-level memory is shown in Fig. 9. As in the elementary processor, each Instruction Cell consists of three Registers and holds one instruction together with spaces for receiving its operands. Each instruction corresponds to an operator, a decider, or a Boolean operator of a basic data-flow program. The gate and merge actors of the data-flow program are not represented by separate instructions; rather, the function of the gates is incorporated into the instructions associated with operators and deciders in a manner that will be described shortly, and the function of the merge actors is implemented for free by the nature of the Distribution Network.

Instructions that represent operators are interpreted by the Operation Units to yield data packets as in the elementary processor. Instructions that represent deciders or Boolean operators are interpreted by the Decision Units to yield <u>control packets</u> having one of the two forms

| {gate,                                 | $\left\{ \frac{\text{true}}{\text{false}} \right\}$ , | $\langle address \rangle$ |
|----------------------------------------|-------------------------------------------------------|---------------------------|
| $\left\{ \frac{value}{value}, \right.$ | $\left\{ \frac{\text{true}}{\text{false}} \right\}$ , | $\langle address \rangle$ |

A gate-type control packet performs a gating function at the addressed operand register. A value-type control packet provides a Boolean operand value to an Instruction Cell that represents a Boolean operator.

The six formats for the contents of Instruction Cells in the basic processor are given in Figure 10. The use of each Register is specified in its leftmost field:

- I instruction register
- D operand register for data values
- B operand register for Boolean values

Only Registers specified to be operand registers of consistent type may be addressed by instructions of a valid program.

The remaining fields in the Instruction Cell formats are: an instruction code, op, pr or bo, that identifies the class and variation of the instruction in the Cell; from one to three destination addresses d1, d2, d3 that specify target operand registers for the packets generated by instruction execution; in the case of deciders and Boolean operators, a <u>result tag</u> t1, t2, t3 for each destination that specifies whether the control packet is of gate-type (tag <u>gate</u>) or of value type (tag <u>value</u>); and, for each operand register, a <u>gating code</u> g1, g2 and either a <u>data</u> <u>receiver</u> v1, v2 or a <u>control receiver</u> c1, c2.



Figure 9. Organization of a basic data - flow processor without two - level memory.

The gating codes permit representation of gate actors that control the reception of operand values by the operator or decider represented by the Instruction Cell. The meanings of the code values are as follows:

| <u>code value</u> | meaning                                                                                                                           |
|-------------------|-----------------------------------------------------------------------------------------------------------------------------------|
| no                | the associated operand is not gated.                                                                                              |
| <u>true</u>       | an operand value is accepted by arri-<br>val of a <u>true</u> gate packet; discarded<br>by arrival of a <u>false</u> gate packet. |
| <u>false</u>      | an operand value is accepted by arri-<br>val of a <u>false</u> gate packet; discarded<br>by arrival of a <u>true</u> gate packet. |
| cons              | the operand is a constant value.                                                                                                  |

The structure of a data or control receiver (Fig. 11) provides space to receive a data or Boolean value, and two flag fields in which the arrival of data and control packets is recorded. The <u>gate flap</u> is changed from <u>off</u> to <u>true</u> or <u>false</u> by a true or false gate-type control packet; the value flag is changed from <u>off</u> to <u>on</u> by a data packet or value type control packet according to the type of receiver.

| I | ор | dl | I | pr  | ti di | I | t  | 00       | tI         | d I |
|---|----|----|---|-----|-------|---|----|----------|------------|-----|
| D | 91 | vl | D | g١  | vl    | в | gl | ci       | t2         | d2  |
| D | g2 | v2 | D | g2  | ۷2    | в | g2 | c2       | 13         | d3  |
| I |    | dl | I |     | ti di | I |    | 0        | <u>† 1</u> | dl  |
|   | ор |    | - | - P | ·     |   | t  | 00       | 12         | d 2 |
| I | L  | d2 | I | ļ   | †2 d2 |   |    | <b>-</b> | <u>'</u>   |     |
| D | gl | vl | D | g١  | vi    | в | gl | ci       | t3         | d 3 |

pr - predicate code b o - Boolean operation code

| d I , d 2 , d 3<br>t I , t 2 , t 3<br>g I , g 2<br>v I , v 2 | destination addresses<br>result tags<br>gating codes<br>data receivers |
|--------------------------------------------------------------|------------------------------------------------------------------------|
| AL 62                                                        | control receivers                                                      |

cl,c2 control receivers

Figure IO. Instruction Cell formats for the basic processor.

Receiver:



. . . . . .

Instruction Cell Operation

The function of each Instruction Cell is to receive data and control packets, and, when the Cell becomes enabled, to transmit an operation or decision packet through the Arbitration Network and reset the Instruction Cell to its initial status. An Instruction Cell becomes enabled just when all three of its registers are enabled. A register specified to act as an instruction register is always enabled. Registers specified to act as operand registers change state with the arrival of packets directed to them. The state transitions and enabling rules for data operand registers are defined in Fig. 12.

In Fig. 12 the contents of an operand register are represented as follows:



The asterisk indicates that the Register is enabled. Events denoting arrival of data and control packets are labelled thus:

- d data packet
- t true gate-type control packet
- f <u>false</u> gate-type control packet

With this explanation of notation, the state changes and enabling rules given in Fig. 12 should be clear. Similar rules apply to the state changes and enabling of Boolean operand registers. Note that arrival of a gate-type control packet that does not match the gating code of the Register causes the associated data packet to be discar-



D, cons: (off, on) \*\*\*

Figure 12. State transition and enabling rules for data operand registers.

ded, and resets the Register to its starting condition.

The operation packets sent to Operation Units and <u>deci</u>-<u>sion packets</u> sent to Decision Units consist of the entire contents of the Instruction Cell except for the gating codes and receiver status fields. Thus the packets sent through the Arbitration Network have the following formats:

To the Operation Units:

op, v1, v2, d1 op, v1, d1, d2

To the Decision Units:

pr, vl, v2, t1, d1 pr, vl, t1, d1, t2, d2 bo, c1, c2, t1, d1, t2, d2, t3, d3 bo, c1, t1, d1, t2, d2, t3, d3

An initial configuration of Instruction Cells corresponding to the basic data-flow program of Fig. 8 is given in Fig. 13. For simplicity, Cells containing control distribution and data forwarding instructions are not shown. Instead, we have taken the liberty of writing any number of addresses in the destination fields of instructions.

The initial values of x and y are placed in Registers 2 and 5. Cells 1 and 2, containing these values, are then enabled and present to the Arbitration Network the operation packets

and

$$\left\{\begin{array}{c} x \\ \underline{ident}; \\ y \end{array}\right\}$$

ident; 8, 11, 14

These packets are directed to an identity Operation Unit which merely creates the desired data packets with the values of x and y and delivers the packets to the Distribution Network.

Upon receipt by the Memory of the data packets directed to Registers 7 and 8, cell 3 will be enabled and will transmit its decision packet to a Decision Unit to perform the <u>less</u> than function. The result of the decision will be returned through the Control Network as five control packets. If the result is <u>true</u>, Cells 4, 5 and 6 will be enabled and will send their contents through the





|    | cel | 3                          |
|----|-----|----------------------------|
| 06 | 1   | less gate (11,13,16,20,23) |
| 07 | D   | <u>no</u> (-)              |
| 08 | D   | <u>no</u> (-)              |

[ident (8,11,14)]

true (-)





Figure 13. Instruction Cell initialization for the basic data-flow program in Figure 8.

cell 4

10 1

09 1

IID

Arbitration Network to Operation Units capable of performing the identity and addition operations. If the result of the decision is <u>false</u>, output cells 7 and 8 will be enabled, and cells 4, 5, and 6 will have their gated operands deleted.

# Two-Level Memory Hierarchy

The high level of parallel activity achievable in dataflow processors makes a unique form of memory hierarchy feasible: the Instruction Cells are arranged to act as a cache for the most active instructions of the dataflow program. Individual instructions are retrieved from auxiliary memory (the Instruction Memory) as they become required by the progress of computation, and instructions are returned to the Instruction Memory when the Instruction Cells holding them are required for more active parts of the program.

The organization of a basic data-flow processor with Instruction Memory is given in Fig. 14.

# Instruction Memory

The Instruction Memory has a storage location for each possible register address of the basic processor. These storage locations are organized into groups of three locations identified by the address of the first location of the group. Each group can hold the contents of one Instruction Cell in the formats already given in Fig. 10.

A <u>memory command packet</u>  $\{a, \underline{retr}\}$  presented to the <u>com-</u> <u>mand port of the Instruction Memory, requests retrieval</u> of an <u>instruction packet</u>  $\{a, x\}$  in which x is the Cell contents stored in the group of locations specified by address a. The instruction packet is delivered at the <u>retrieve</u> port of the Instruction Memory.

An instruction packet  $\{a, x\}$  presented at the <u>store</u> port of the Instruction Memory requests storage of Cell contents x in the three-location group specified by address a. However, the storage is not effective until a memory command packet  $\{a, \underline{store}\}$  is received by the Instruction



Figure 14. Organization of the basic data - flow processor with auxiliary memory.

Memory at its <u>command</u> port, and any prior retrieval re-. quest has been honored. Similarly, retrieval requests are not honored until prior storage requests for the group have taken effect.

We envision that the Instruction Memory would be designed to handle large numbers of storage and retrieval requests concurrently, much as the input/output facilities of contemporary computer systems operate under software control.

#### <u>Cell Block Operation</u>

For application of the cache principle to the basic dataflow processor, an Instruction Memory address is divided into a <u>major address</u> and a <u>minor address</u>, each containing a number of bits of the address. One Cell Block of the processor is associated with each possible major address. All instructions having the same major address are processed by the Instruction Cells of the corresponding Cell Block. Thus the Distribution and Control Networks use the major address to direct data packets, control packets, and instruction packets to the appropriate Cell Block. The packets delivered to the Cell Block include the minor address, which is sufficient to determine how the packet should be treated by the Cell Block.

Operation and decision packets leaving a Cell Block have exactly the same format as before. Instruction packets leaving a Cell Block have the form  $\{m, x\}$  where m is a minor address and x is the contents of an Instruction Cell. The major address of the Cell Block is appended to each instruction packet as it travels through the Arbitration Network. In the same way, memory command packets leave the Cell Block with just a minor address, which is augmented by the major address of the Cell Block during its trip through the Memory Command Network.

Fig. 15 shows the structure of a Cell Block. Each Instruction Cell is able to hold any instruction whose major address is that of the Cell Block. Since many more instructions share a major address than there are Cells in a Cell Block, the Cell Block includes an Association Table which has an entry  $\{m, i\}$  for each Instruction Cell: m is the minor address of the instruction to which the Cell is assigned, and i is a Cell status indicator whose values have significance as follows:

| <u>status</u> value | meaning                                                                                                              |  |  |  |
|---------------------|----------------------------------------------------------------------------------------------------------------------|--|--|--|
| free                | the Cell is not assigned to any in-<br>struction                                                                     |  |  |  |
| engaged             | the Cell has been engaged for the in-<br>struction having minor address m, by<br>arrival of a data or control packet |  |  |  |
| occupied            | the Cell is occupied by an instruction with minor address m                                                          |  |  |  |

The Stack element of a Cell Block holds an ordering of the Instruction Cells as candidates for displacement of their contents by newly activated instructions. Only Cells in <u>occupied</u> status are candiates for displacement.

Operation of a Cell Block can be specified by giving two procedures -- one initiated by arrival of a data or control packet at the Cell Block, and the other activated by arrival of an instruction packet from the Instruction Memory.

- <u>Procedure</u> 1: Arrival of a data or control packet {n, y} where n is a minor address and y is the packet content.
- step 1. Does the Association Table have an entry with minor address n? If so, let p be the Cell corresponding to the entry, and go to step 5. Otherwise continue with step 2.
- step 2. If the Association Table shows that no Instruction Cell has status <u>free</u>, go to step 3. Otherwise let p be a Cell with status <u>free</u>. Let the Associa-



Figure 15. Structure of a Cell Block,

tion Table entry for p be {m, free}; go to step 4.

- step 3. Use the Stack to choose a Cell p in <u>occupied</u> status for preemption; let the Association Table entry for p be {m, <u>occupied</u>}; transmit the contents z of Cell p as an instruction packet {m, z} to the Instruction Memory via the Arbitration Network; transmit the memory command packet {m, <u>store</u>} to the Instruction Memory through the Memory Command Network.
- step 4. Make an entry {n, engaged} for Cell p in the Association Table; transmit the memory command packet {n, retr} to the Instruction Memory via the Memory Command Network.
- step 5. Update the operand register of Cell p having minor address n according to the content y of the data or control packet (the rules for updating are those given in Fig. 12). If Cell p is occupied the state change of the register must be consistent with the instruction code or the program is invalid. If Cell p is engaged, the changes must be consistent with the register status left by preceding packet arrivals.
- step 6. If Cell p is <u>occupied</u> and all three registers are enabled (according to the rules of Fig. 12), the Cell p is enabled: transmit an operation or decision packet to the Operation Units or Decision Units through the Arbitration Network; leave Cell p in <u>occupied</u> status holding the same instruction with its operand registers reset (receivers empty with the gate and value flags set to <u>off</u>). Change the order of Cells in the Stack to make Cell p the last candidate for displacement.
- $\frac{Procedure}{x} 2: \text{ Arrival of an instruction packet } \{n, x\}$  with minor address n and content x.
- <u>step</u> 1. Let p be the Instruction Cell with entry  $\{n, engaged\}$  in the Association Table.
- step 2. The status of the operand registers of Cell p must be consistent with the content x of the instruction packet, or the program is invalid. Update the contents of Cell p to incorporate the instruction and operand status information in the instruction packet.
- step 3. Change the Association Table entry for Cell p from [n, engaged] to [n, occupied].
- step 4. If all registers of Cell p are enabled, then

Cell p is enabled: transmit an operation or decision packet to the Operation Units or Decision Units through the Arbitration Network; leave Cell p in <u>occupied</u> status holding the same instruction with its operand registers reset. Change the order of Cells in the Stack to make Cell p the last candidate for displacement.

## <u>Conclusion</u>

The organization of a computer which allows the execution of programs represented in data-flow form offers a very promising solution to the problem of achieving highly parallel computation. Thus far, the design of two processors, the elementary and the basic data-flow processors, has been investigated. The elementary processor is attractive for stream-oriented signal processing applications. The basic processor described here is a first step toward a highly parallel processor for numerical algorithms expressed in a Fortran-like dataflow language. However, this goal requires further elaboration of the data-flow architecture to encompass arrays, concurrent activation of procedures, and some means of exploiting the sort of parallelism present in vector operations. We are optimistic that extensions of the architecture to provide these features can be devised, and we are hopeful that these concepts can be further extended to the design of computers for general-purpose computation based on more complete data-flow models such as presented by Dennis [4].

### References

- Adams, D. A. <u>A Computation Model With Data Flow Seguencing</u>. Technical Report CS 117, Computer Science Department, School of Humanities and Sciences, Stanford University, Stanford, Calif., December 1968.
- Bährs, A. Operation patterns (An extensible model of an extensible language). <u>Symposium on Theoretical Programming</u>, Novosibirsk, USSR, August 1972 (preprint).
- Dennis, J. B. Programming generality, parallelism and computer architecture. <u>Information Processing</u> <u>68</u>, North-Holland Publishing Co., Amsterdam 1969, 484-492.
- Dennis, J. B. First version of a data flow procedure language. <u>Symposium on Programming</u>, Institut de Programmation, University of Paris, Paris, France, April 1974, 241-271.
- Dennis, J. B., and J. B. Fosseen. <u>Introduction to</u> <u>Data Flow Schemas</u>. November 1973 (submitted for publication).
- Dennis, J. B., and D. P. Misunas. A computer architecture for highly parallel signal processing. <u>Proceedings of the ACM 1974 National Conference,</u> ACM, New York, November 1974.
- Dennis, J. B., and D. P. Misunas. <u>The Design of a</u> <u>Highly Parallel Computer for Signal Processing Ap-</u> <u>plications</u>: Computation Structures Group Memo 101, Project MAC, M.I.T., Cambridge, Mass., July 1974.
- Karp, R. M., and R. E. Miller. Properties of a model for parallel computations: determinacy, termination, queueing. <u>SIAM J. Appl. Math. 14</u> (November 1966), 1390-1411.
- Kosinski, P. R. <u>A Data Flow Programming Language</u>. Report RC 4264, IBM T. J. Watson Research Center, Yorktown Heights, N. Y., March 1973.
- Kosinski, P. R. A data flow language for operating systems programming. <u>Proceedings of ACM SIGPLAN-SIGOPS Interface Meeting, SIGPLAN Notices 8, 9</u> (September 1973), 89-94.
- Rodriguez, J. E. <u>A Graph Model for Parallel Computation</u>. Report TR-64, Project MAC, M.I.T., Cambridge, Mass., September 1969.