ADSP-TS202S_06 AD [Analog Devices], ADSP-TS202S_06 Datasheet - Page 4

ADSP-TS202S_06

Manufacturer Part Number

ADSP-TS202S_06

Description

TigerSHARC Embedded Processor

Manufacturer

AD [Analog Devices]

Datasheet

1.ADSP-TS202S_06.pdf (48 pages)

Current page: 4 of 48
Download datasheet (2Mb)

ADSP-TS202S

The TigerSHARC DSP uses a Static Superscalar

This architecture is superscalar in that the ADSP-TS202S pro-

cessor’s core can execute simultaneously from one to four 32-bit

instructions encoded in a very large instruction word (VLIW)

instruction line using the DSP’s dual compute blocks. Because

the DSP does not perform instruction reordering at runtime—

the programmer selects which operations will execute in parallel

prior to runtime—the order of instructions is static.

With few exceptions, an instruction line, whether it contains

one, two, three, or four 32-bit instructions, executes with a

throughput of one cycle in a 10-deep processor pipeline.

For optimal DSP program execution, programmers must follow

the DSP’s set of instruction parallelism rules when encoding an

instruction line. In general, the selection of parallel instructions

that the DSP can execute in each cycle depends on both the

instruction line resources each instruction requires and on the

source and destination registers used in the instructions. The

programmer has direct control of three core components—the

IALUs, the compute blocks, and the program sequencer.

The ADSP-TS202S processor, in most cases, has a two-cycle

execution pipeline that is fully interlocked, so—whenever a

computation result is unavailable for another operation depen-

dent on it—the DSP automatically inserts one or more stall

cycles as needed. Efficient programming with dependency-free

instructions can eliminate most computational and memory

transfer data dependencies.

In addition, the ADSP-TS202S processor supports SIMD opera-

tions two ways—SIMD compute blocks and SIMD

computations. The programmer can load both compute blocks

with the same data (broadcast distribution) or different data

(merged distribution).

DUAL COMPUTE BLOCKS

The ADSP-TS202S processor has compute blocks that can

execute computations either independently or together as a sin-

gle-instruction, multiple-data (SIMD) engine. The DSP can

issue up to two compute instructions per compute block each

cycle, instructing the ALU, multiplier, or shifter to perform

independent, simultaneous operations. Each compute block can

execute eight 8-bit, four 16-bit, two 32-bit, or one 64-bit SIMD

computations in parallel with the operation in the other block.

These computation units support IEEE 32-bit single-precision

floating-point, extended-precision 40-bit floating point, and

8-, 16-, 32-, and 64-bit fixed-point processing.

The compute blocks are referred to as X and Y in assembly

syntax, and each block contains three computational units—an

ALU, a multiplier, and a 64-bit shifter—and a 32-word

†

Static Superscalar is a trademark of Analog Devices, Inc.

• Register File—each compute block has a multiported,

32-word, fully orthogonal register file used for transferring

data between the computation units and data buses and for

†

architecture.

Rev. C | Page 4 of 48 | December 2006

Using these features, the compute blocks can

DATA ALIGNMENT BUFFER (DAB)

The DAB is a quad-word FIFO that enables loading of quad-

word data from nonaligned addresses. Normally, load instruc-

tions must be aligned to their data size so that quad words are

loaded from a quad-aligned address. Using the DAB signifi-

cantly improves the efficiency of some applications, such as

FIR filters.

DUAL INTEGER ALU (IALU)

The ADSP-TS202S processor has two IALUs that provide pow-

erful address generation capabilities and perform many general-

purpose integer operations. The IALUs are referred to as J and

K in assembly syntax and have the following features:

As address generators, the IALUs perform immediate or indi-

rect (pre- and post-modify) addressing. They perform modulus

and bit-reverse operations with no constraints placed on mem-

ory addresses for the modulus data buffer placement. Each

IALU can specify either a single-, dual-, or quad-word access

from memory.

The IALUs have hardware support for circular buffers, bit

reverse, and zero-overhead looping. Circular buffers facilitate

efficient programming of delay lines and other data structures

required in digital signal processing, and they are commonly

used in digital filters and Fourier transforms. Each IALU pro-

vides registers for four circular buffers, so applications can set

• ALU—the ALU performs a standard set of arithmetic oper-

• Multiplier—the multiplier performs both fixed- and float-

• Shifter—the 64-bit shifter performs logical and arithmetic

• Provide 8 MACS per cycle peak and 7.1 MACS per cycle

• Execute six single-precision floating-point or execute 24

• Perform two complex 16-bit MACS per cycle

• Provide memory addresses for data and update pointers

• Support circular buffering and bit-reverse addressing

• Perform general-purpose integer operations, increasing

• Include a 31-word register file for each IALU

storing intermediate results. Instructions can access the

registers in the register file individually (word-aligned), in

sets of two (dual-aligned), or in sets of four (quad-aligned).

ations in both fixed- and floating-point formats. It also

performs logic and permute operations.

ing-point multiplication and fixed-point multiply and

accumulate.

shifts, bit and bit stream manipulation, and field deposit

and extraction operations.

sustained 16-bit performance and provide 2 MACS per

cycle peak and 1.8 MACS per cycle sustained 32-bit perfor-

mance (based on FIR)

fixed-point (16-bit) operations per cycle, providing

3G FLOPS or 12.0G/s regular operations performance at

500 MHz

programming flexibility

ADSP-TS202S_06 AD [Analog Devices], ADSP-TS202S_06 Datasheet - Page 4

ADSP-TS202S_06

Related parts for ADSP-TS202S_06