Figure 14.9 Two-Stage Instruction Pipeline one. Figure 14.9a depicts this approach. The pipeline has two independent stages. The first stage fetches an instruction and buffers it. When the second stage is free, the first stage passes it the buffered instruction. While the second stage is executing the instruction, the first stage takes advantage of any unused memory cycles to fetch and buffer the next instruction. This is called instruction prefetch or *fetch overlap*. Note that this approach, which involves instruction buffering, requires more registers. In general, pipelining requires registers to store data between stages. It should be clear that this process will speed up instruction execution. If the fetch and execute stages were of equal duration, the instruction cycle time would be halved. However, if we look more closely at this pipeline (Figure 14.9b), we will see that this doubling of execution rate is unlikely for two reasons: - 1. The execution time will generally be longer than the fetch time. Execution will involve reading and storing operands and the performance of some operation. Thus, the fetch stage may have to wait for some time before it can empty its buffer. - 2. A conditional branch instruction makes the address of the next instruction to be fetched unknown. Thus, the fetch stage must wait until it receives the next instruction address from the execute stage. The execute stage may then have to wait while the next instruction is fetched. Guessing can reduce the time loss from the second reason. A simple rule is the following: When a conditional branch instruction is passed on from the fetch to the execute stage, the fetch stage fetches the next instruction in memory after the branch instruction. Then, if the branch is not taken, no time is lost. If the branch is taken, the fetched instruction must be discarded and a new instruction fetched. While these factors reduce the potential effectiveness of the two-stage pipeline, some speedup occurs. To gain further speedup, the pipeline must have more stages. Let us consider the following decomposition of the instruction processing. - Fetch instruction (FI): Read the next expected instruction into a buffer. - Decode instruction (DI): Determine the opcode and the operand specifiers. - Calculate operands (CO): Calculate the effective address of each source operand. This may involve displacement, register indirect, indirect, or other forms of address calculation. - Fetch operands (FO): Fetch each operand from memory. Operands in registers need not be fetched. - Execute instruction (EI): Perform the indicated operation and store the result, if any, in the specified destination operand location. - Write operand (WO): Store the result in memory. With this decomposition, the various stages will be of more nearly equal duration. For the sake of illustration, let us assume equal duration. Using this assumption, Figure 14.10 shows that a six-stage pipeline can reduce the execution time for 9 instructions from 54 time units to 14 time units. Several comments are in order: The diagram assumes that each instruction goes through all six stages of the pipeline. This will not always be the case. For example, a load instruction does not need the WO stage. However, to simplify the pipeline hardware, the timing is set up assuming that each instruction requires all six stages. Also, the diagram assumes that all of the stages can be performed in parallel. In particular, it is assumed that there are no memory conflicts. For example, the FI, FO, and WO stages involve a memory access. The diagram implies that all these accesses can occur simultaneously. Most memory systems will not permit that. However, the desired value may be in cache, or the FO or WO stage may be null. Thus, much of the time, memory conflicts will not slow down the pipeline. | | Time | | | | | - | | | | | | | | | |---------------|------|----|----|----|----|----|----|----|----|----|----|----|----|----| | | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | | Instruction 1 | FI | DI | СО | FO | EI | wo | | | | | | | | | | Instruction 2 | | FI | DI | со | FO | EI | wo | | | | | | | | | Instruction 3 | | | FI | DI | СО | FO | EI | wo | | | | | | | | Instruction 4 | | | | FI | DI | со | FO | EI | wo | | | | | | | Instruction 5 | | | | | FI | DI | со | FO | EI | wo | | | | | | Instruction 6 | | | | | | FI | DI | со | FO | EI | wo | | | | | Instruction 7 | | | | | | | FI | DI | со | FO | EI | wo | | | | Instruction 8 | | | | | | | | FI | DI | СО | FO | EI | wo | | | Instruction 9 | | | | | | 11 | | | FI | DI | СО | FO | EI | wo | Figure 14.10 Timing Diagram for Instruction Pipeline Operation