ASPLOS VII Tutorial

The Importance of Decoupled Architectures

in Future Processor Designs

Matthew Farrens

Gary Tyson


Introduction

The ability to tolerate latencies (in particular, memory latencies) is becoming a critical component in high-performance processor designs. This tutorial will present an overview of decoupled architectures, a class of latency tolerant architectures and techniques that are becoming increasingly important in the drive for high performance. The history of decoupled architectures will be presented, followed by a survey of existing decoupled designs (the R10000, for example) and some motivation as to why this approach will have an impact on processor designs well into the next century.


Outline

Introduction to Decoupled Processing

Decoupled processing will be defined, and the goals of decoupling and the techniques employed will be discussed. The motivation for decoupling will then be presented, and the importance to current and future designs will be highlighted. Finally, factors limiting decoupled performance will be discussed.

Existing Decoupled Processors
Several existing decoupled processors will be detailed. Five classes will be defined, the existing processors will be classified, and example code will be shown to illustrate the differences.

Research on Decoupled Architectures

The research performed on decoupled processing will be summarized, including both past and future research domains.