Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                

Spru 375 G

Download as pdf or txt
Download as pdf or txt
You are on page 1of 669

TMS320C55x DSP Algebraic Instruction Set Reference Guide

Literature Number: SPRU375G October 2002

IMPORTANT NOTICE Texas Instruments Incorporated and its subsidiaries (TI) reserve the right to make corrections, modifications, enhancements, improvements, and other changes to its products and services at any time and to discontinue any product or service without notice. Customers should obtain the latest relevant information before placing orders and should verify that such information is current and complete. All products are sold subject to TIs terms and conditions of sale supplied at the time of order acknowledgment. TI warrants performance of its hardware products to the specifications applicable at the time of sale in accordance with TIs standard warranty. Testing and other quality control techniques are used to the extent TI deems necessary to support this warranty. Except where mandated by government requirements, testing of all parameters of each product is not necessarily performed. TI assumes no liability for applications assistance or customer product design. Customers are responsible for their products and applications using TI components. To minimize the risks associated with customer products and applications, customers should provide adequate design and operating safeguards. TI does not warrant or represent that any license, either express or implied, is granted under any TI patent right, copyright, mask work right, or other TI intellectual property right relating to any combination, machine, or process in which TI products or services are used. Information published by TI regarding third party products or services does not constitute a license from TI to use such products or services or a warranty or endorsement thereof. Use of such information may require a license from a third party under the patents or other intellectual property of that third party, or a license from TI under the patents or other intellectual property of TI. Reproduction of information in TI data books or data sheets is permissible only if reproduction is without alteration and is accompanied by all associated warranties, conditions, limitations, and notices. Reproduction of this information with alteration is an unfair and deceptive business practice. TI is not responsible or liable for such altered documentation. Resale of TI products or services with statements different from or beyond the parameters stated by TI for that product or service voids all express and any implied warranties for the associated TI product or service and is an unfair and deceptive business practice. TI is not responsible or liable for any such statements.

Mailing Address: Texas Instruments Post Office Box 655303 Dallas, Texas 75265

Copyright 2002, Texas Instruments Incorporated

Preface

Read This First

About This Manual


The TMS320C55x is a fixed-point digital signal processor (DSP) in the TMS320 DSP family, and it can use either of two forms of the instruction set: a mnemonic form or an algebraic form. This book is a reference for the algebraic form of the instruction set. It contains information about the instructions used for all types of operations. For information on the mnemonic instruction set, see TMS320C55x DSP Mnemonic Instruction Set Reference Guide, SPRU374.

Notational Conventions
This book uses the following conventions.
- In syntax descriptions, the instruction is in a bold typeface. Portions of a

syntax in bold must be entered as shown. Here is an example of an instruction syntax: lms(Xmem, Ymem, ACx, ACy) lms is the instruction, and it has four operands: Xmem, Ymem, ACx, and ACy. When you use lms, the operands should be actual dual datamemory operand values and accumulator values. A comma and a space (optional) must separate the four values.
- Square brackets, [ and ], identify an optional parameter. If you use an

optional parameter, specify the information within the brackets; do not type the brackets themselves.

Contents

iii

Related Related Documentation Documentation From From Texas Texas Instruments Instruments / Trademarks

Related Documentation From Texas Instruments


The following books describe the C55x devices and related support tools. To obtain a copy of any of these TI documents, call the Texas Instruments Literature Response Center at (800) 477-8924. When ordering, please identify the book by its title and literature number. TMS320C55x Technical Overview (SPRU393). This overview is an introduction to the TMS320C55x digital signal processor (DSP). The TMS320C55x is the latest generation of fixed-point DSPs in the TMS320C5000 DSP platform. Like the previous generations, this processor is optimized for high performance and low-power operation. This book describes the CPU architecture, low-power enhancements, and embedded emulation features of the TMS320C55x. TMS320C55x DSP CPU Reference Guide (literature number SPRU371) describes the architecture, registers, and operation of the CPU for the TMS320C55x digital signal processors (DSPs). TMS320C55x DSP Mnemonic Instruction Set Reference Guide (literature number SPRU374) describes the mnemonic instructions individually. It also includes a summary of the instruction set, a list of the instruction opcodes, and a cross-reference to the algebraic instruction set. TMS320C55x Programmers Guide (literature number SPRU376) describes ways to optimize C and assembly code for the TMS320C55x DSPs and explains how to write code that uses special features and instructions of the DSP. TMS320C55x Optimizing C Compiler Users Guide (literature number SPRU281) describes the TMS320C55x C Compiler. This C compiler accepts ANSI standard C source code and produces assembly language source code for TMS320C55x devices. TMS320C55x Assembly Language Tools Users Guide (literature number SPRU280) describes the assembly language tools (assembler, linker, and other tools used to develop assembly language code), assembler directives, macros, common object file format, and symbolic debugging directives for TMS320C55x devices.

Trademarks
TMS320, TMS320C54x, TMS320C55x, C54x, and C55x are trademarks of Texas Instruments.
iv

Contents

Contents
1 Terms, Symbols, and Abbreviations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1-1 Lists and defines the terms, symbols, and abbreviations used in the TMS320C55x DSP algebraic instruction set summary and in the individual instruction descriptions. 1.1 1.2 1.3 Instruction Set Terms, Symbols, and Abbreviations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1-2 Instruction Set Conditional (cond) Fields . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1-7 Affect of Status Bits . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1-9 1.3.1 Accumulator Overflow Status Bit (ACOVx) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1-9 1.3.2 C54CM Status Bit . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1-9 1.3.3 CARRY Status Bit . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1-9 1.3.4 FRCT Status Bit . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1-9 1.3.5 INTM Status Bit . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1-9 1.3.6 M40 Status Bit . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1-10 1.3.7 RDM Status Bit . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1-12 1.3.8 SATA Status Bit . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1-12 1.3.9 SATD Status Bit . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1-13 1.3.10 SMUL Status Bit . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1-13 1.3.11 SXMD Status Bit . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1-13 1.3.12 Test Control Status Bit (TCx) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1-13 Instruction Set Notes and Rules . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1-14 1.4.1 Notes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1-14 1.4.2 Rules . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1-14 Nonrepeatable Instructions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1-20

1.4

1.5 2

Parallelism Features and Rules . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2-1 Describes the parallelism features and rules of the TMS320C55x DSP algebraic instruction set. 2.1 2.2 2.3 Parallelism Features . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Parallelism Basics . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Resource Conflicts . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2.3.1 Operators . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2.3.2 Address Generation Units . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2.3.3 Buses . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Soft-Dual Parallelism . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2.4.1 Soft-Dual Parallelism of MAR Instructions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Execute Conditionally Instructions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Other Exceptions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2-2 2-3 2-4 2-4 2-4 2-5 2-5 2-6 2-6 2-7
v

2.4 2.5 2.6

Contents

Introduction to Addressing Modes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3-1 Provides an introduction to the addressing modes of the TMS320C55x DSP. 3.1 Introduction to the Addressing Modes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3-2 3.2 Absolute Addressing Modes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3-3 3.2.1 k16 Absolute Addressing Mode . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3-3 3.2.2 k23 Absolute Addressing Mode . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3-3 3.2.3 I/O Absolute Addressing Mode . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3-3 3.3 Direct Addressing Modes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3-4 3.3.1 DP Direct Addressing Mode . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3-4 3.3.2 SP Direct Addressing Mode . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3-5 3.3.3 Register-Bit Direct Addressing Mode . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3-5 3.3.4 PDP Direct Addressing Mode . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3-5 3.4 Indirect Addressing Modes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3-6 3.4.1 AR Indirect Addressing Mode . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3-6 3.4.2 Dual AR Indirect Addressing Mode . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3-14 3.4.3 CDP Indirect Addressing Mode . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3-16 3.4.4 Coefficient Indirect Addressing Mode . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3-18 3.5 Circular Addressing . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3-20 Instruction Set Summary . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4-1 Provides a summary of the TMS320C55x DSP algebraic instruction set. Instruction Set Descriptions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5-1 Detailed information on the TMS320C55x DSP algebraic instruction set. Absolute Distance (abdst) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5-2 Absolute Value . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5-4 Addition . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5-7 Addition with Absolute Value . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5-27 Addition with Parallel Store Accumulator Content to Memory . . . . . . . . . . . . . . . . . . . . . . . . . . 5-29 Addition or Subtraction Conditionally (adsc) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5-31 Addition or Subtraction Conditionally with Shift (ads2c) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5-33 Addition, Subtraction, or Move Accumulator Content Conditionally (adsc) . . . . . . . . . . . . . . . 5-36 Bitwise AND . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5-38 Bitwise AND Memory with Immediate Value and Compare to Zero . . . . . . . . . . . . . . . . . . . . . 5-47 Bitwise OR . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5-48 Bitwise Exclusive OR (XOR) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5-57 Branch Conditionally (if goto) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5-66 Branch Unconditionally (goto) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5-70 Branch on Auxiliary Register Not Zero (if goto) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5-74 Call Conditionally (if call) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5-77 Call Unconditionally (call) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5-83 Circular Addressing Qualifier (circular) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5-87 Clear Accumulator, Auxiliary, or Temporary Register Bit . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5-88 Clear Memory Bit . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5-89 Clear Status Register Bit . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5-90 Compare Accumulator, Auxiliary, or Temporary Register Content . . . . . . . . . . . . . . . . . . . . . . . 5-93 Compare Accumulator, Auxiliary, or Temporary Register Content with AND . . . . . . . . . . . . . . 5-95 Compare Accumulator, Auxiliary, or Temporary Register Content with OR . . . . . . . . . . . . . . 5-100

4 5

vi

Contents

Compare Accumulator, Auxiliary, or Temporary Register Content Maximum (max) . . . . . . . Compare Accumulator, Auxiliary, or Temporary Register Content Minimum (min) . . . . . . . . Compare and Branch (compare goto) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Compare and Select Accumulator Content Maximum (max_diff) . . . . . . . . . . . . . . . . . . . . . . Compare and Select Accumulator Content Minimum (min_diff) . . . . . . . . . . . . . . . . . . . . . . . Compare Memory with Immediate Value . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Complement Accumulator, Auxiliary, or Temporary Register Bit (cbit) . . . . . . . . . . . . . . . . . . Complement Accumulator, Auxiliary, or Temporary Register Content . . . . . . . . . . . . . . . . . . Complement Memory Bit (cbit) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Compute Exponent of Accumulator Content (exp) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Compute Mantissa and Exponent of Accumulator Content (mant, exp) . . . . . . . . . . . . . . . . . Count Accumulator Bits (count) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Dual 16-Bit Additions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Dual 16-Bit Addition and Subtraction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Dual 16-Bit Subtractions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Dual 16-Bit Subtraction and Addition . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Execute Conditionally (if execute) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Expand Accumulator Bit Field (field_expand) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Extract Accumulator Bit Field (field_extract) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Finite Impulse Response Filter, Antisymmetrical (firsn) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Finite Impulse Response Filter, Symmetrical (firs) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Idle . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Least Mean Square (lms) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Linear Addressing Qualifier (linear) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Load Accumulator from Memory . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Load Accumulator from Memory with Parallel Store Accumulator Content to Memory . . . . Load Accumulator Pair from Memory . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Load Accumulator with Immediate Value . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Load Accumulator, Auxiliary, or Temporary Register from Memory . . . . . . . . . . . . . . . . . . . . . Load Accumulator, Auxiliary, or Temporary Register with Immediate Value . . . . . . . . . . . . . Load Auxiliary or Temporary Register Pair from Memory . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Load CPU Register from Memory . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Load CPU Register with Immediate Value . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Load Extended Auxiliary Register from Memory . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Load Extended Auxiliary Register with Immediate Value . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Load Memory with Immediate Value . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Memory Delay (delay) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Memory-Mapped Register Access Qualifier (mmap) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Modify Auxiliary Register Content (mar) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Modify Auxiliary Register Content with Parallel Multiply . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Modify Auxiliary Register Content with Parallel Multiply and Accumulate . . . . . . . . . . . . . . . Modify Auxiliary Register Content with Parallel Multiply and Subtract . . . . . . . . . . . . . . . . . . Modify Auxiliary or Temporary Register Content (mar) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Modify Auxiliary or Temporary Register Content by Addition (mar) . . . . . . . . . . . . . . . . . . . . . Modify Auxiliary or Temporary Register Content by Subtraction (mar) . . . . . . . . . . . . . . . . . . Modify Data Stack Pointer . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Modify Extended Auxiliary Register Content (mar) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
Contents

5-105 5-108 5-111 5-114 5-120 5-126 5-128 5-129 5-130 5-131 5-132 5-134 5-135 5-140 5-145 5-154 5-159 5-166 5-167 5-168 5-170 5-172 5-173 5-175 5-176 5-185 5-187 5-190 5-193 5-199 5-203 5-204 5-207 5-209 5-210 5-211 5-212 5-213 5-214 5-216 5-218 5-223 5-225 5-229 5-233 5-237 5-238
vii

Contents

Move Accumulator Content to Auxiliary or Temporary Register . . . . . . . . . . . . . . . . . . . . . . . . Move Accumulator, Auxiliary, or Temporary Register Content . . . . . . . . . . . . . . . . . . . . . . . . . Move Auxiliary or Temporary Register Content to Accumulator . . . . . . . . . . . . . . . . . . . . . . . . Move Auxiliary or Temporary Register Content to CPU Register . . . . . . . . . . . . . . . . . . . . . . Move CPU Register Content to Auxiliary or Temporary Register . . . . . . . . . . . . . . . . . . . . . . Move Extended Auxiliary Register Content . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Move Memory to Memory . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Multiply . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Multiply with Parallel Multiply and Accumulate . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Multiply with Parallel Store Accumulator Content to Memory . . . . . . . . . . . . . . . . . . . . . . . . . . Multiply and Accumulate (MAC) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Multiply and Accumulate with Parallel Delay . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Multiply and Accumulate with Parallel Load Accumulator from Memory . . . . . . . . . . . . . . . . Multiply and Accumulate with Parallel Multiply . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Multiply and Accumulate with Parallel Store Accumulator Content to Memory . . . . . . . . . . . Multiply and Subtract . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Multiply and Subtract with Parallel Load Accumulator from Memory . . . . . . . . . . . . . . . . . . . Multiply and Subtract with Parallel Multiply . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Multiply and Subtract with Parallel Multiply and Accumulate . . . . . . . . . . . . . . . . . . . . . . . . . . Multiply and Subtract with Parallel Store Accumulator Content to Memory . . . . . . . . . . . . . . Negate Accumulator, Auxiliary, or Temporary Register Content . . . . . . . . . . . . . . . . . . . . . . . No Operation (nop) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Parallel Modify Auxiliary Register Contents (mar) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Parallel Multiplies . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Parallel Multiply and Accumulates . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Parallel Multiply and Subtracts . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Peripheral Port Register Access Qualifiers . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Pop Accumulator or Extended Auxiliary Register Content from Stack Pointers (popboth) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Pop Top of Stack (pop) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Push Accumulator or Extended Auxiliary Register Content to Stack Pointers (pshboth) . . Push to Top of Stack (push) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Repeat Block of Instructions Unconditionally . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Repeat Single Instruction Conditionally (while/repeat) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Repeat Single Instruction Unconditionally (repeat) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Repeat Single Instruction Unconditionally and Decrement CSR (repeat) . . . . . . . . . . . . . . . Repeat Single Instruction Unconditionally and Increment CSR (repeat) . . . . . . . . . . . . . . . . Return Conditionally (if return) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Return Unconditionally (return) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Return from Interrupt (return_int) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Rotate Left Accumulator, Auxiliary, or Temporary Register Content . . . . . . . . . . . . . . . . . . . . Rotate Right Accumulator, Auxiliary, or Temporary Register Content . . . . . . . . . . . . . . . . . . . Round Accumulator Content (rnd) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Saturate Accumulator Content (saturate) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Set Accumulator, Auxiliary, or Temporary Register Bit . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Set Memory Bit . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Set Status Register Bit . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
viii

5-239 5-240 5-242 5-243 5-245 5-247 5-248 5-255 5-267 5-269 5-271 5-286 5-288 5-290 5-292 5-294 5-302 5-304 5-306 5-311 5-313 5-315 5-316 5-317 5-319 5-326 5-328 5-330 5-331 5-338 5-339 5-346 5-357 5-360 5-365 5-367 5-370 5-372 5-374 5-376 5-378 5-380 5-382 5-384 5-385 5-386

Contents

Shift Accumulator Content Conditionally (sftc) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Shift Accumulator Content Logically . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Shift Accumulator, Auxiliary, or Temporary Register Content Logically . . . . . . . . . . . . . . . . . Signed Shift of Accumulator Content . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Signed Shift of Accumulator, Auxiliary, or Temporary Register Content . . . . . . . . . . . . . . . . . Software Interrupt (intr) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Software Reset (reset) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Software Trap (trap) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Square . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Square and Accumulate . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Square and Subtract . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Square Distance (sqdst) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Store Accumulator Content to Memory . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Store Accumulator Pair Content to Memory . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Store Accumulator, Auxiliary, or Temporary Register Content to Memory . . . . . . . . . . . . . . . Store Auxiliary or Temporary Register Pair Content to Memory . . . . . . . . . . . . . . . . . . . . . . . Store CPU Register Content to Memory . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Store Extended Auxiliary Register Content to Memory . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Subtract Conditionally (subc) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Subtraction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Subtraction with Parallel Store Accumulator Content to Memory . . . . . . . . . . . . . . . . . . . . . . Swap Accumulator Content (swap) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Swap Accumulator Pair Content (swap) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Swap Auxiliary Register Content (swap) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Swap Auxiliary Register Pair Content (swap) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Swap Auxiliary and Temporary Register Content (swap) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Swap Auxiliary and Temporary Register Pair Content (swap) . . . . . . . . . . . . . . . . . . . . . . . . . Swap Auxiliary and Temporary Register Pairs Content (swap) . . . . . . . . . . . . . . . . . . . . . . . . Swap Temporary Register Content (swap) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Swap Temporary Register Pair Content (swap) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Test Accumulator, Auxiliary, or Temporary Register Bit . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Test Accumulator, Auxiliary, or Temporary Register Bit Pair . . . . . . . . . . . . . . . . . . . . . . . . . . . Test Memory Bit . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Test and Clear Memory Bit . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Test and Complement Memory Bit . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Test and Set Memory Bit . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6

5-389 5-391 5-394 5-397 5-406 5-411 5-413 5-417 5-419 5-422 5-425 5-428 5-430 5-450 5-453 5-457 5-458 5-462 5-463 5-465 5-490 5-492 5-493 5-494 5-495 5-496 5-498 5-500 5-502 5-503 5-504 5-506 5-508 5-511 5-512 5-513

Instruction Opcodes in Sequential Order . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6-1 The opcode in sequential order for each TMS320C55x DSP instruction syntax. 6.1 6.2 Instruction Set Opcodes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6-2 Instruction Set Opcode Symbols and Abbreviations . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6-16

Cross-Reference of Algebraic and Mnemonic Instruction Sets . . . . . . . . . . . . . . . . . . . . . . 7-1 Cross-Reference of TMS320C55x DSP Algebraic and Mnemonic Instruction Sets.
Contents ix

Figures

Figures
51 52 53 54 Status Registers Bit Mapping . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5-92 Legal Uses of Repeat Block of Instructions Unconditionally (localrepeat) Instruction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5-350 Status Registers Bit Mapping . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5-388 Effects of a Software Reset on Status Registers . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5-416

Tables
11 12 13 14 31 32 33 34 35 36 37 38 39 310 41 51 52 53 54 55 56 61 62 71
x

Instruction Set Terms, Symbols, and Abbreviations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1-2 Operators Used in Instruction Set . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1-6 Instruction Set Conditional (cond) Field . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1-7 Nonrepeatable Instructions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1-20 Addressing-Mode Operands . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3-2 Absolute Addressing Modes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3-3 Direct Addressing Modes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3-4 Indirect Addressing Modes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3-6 DSP Mode Operands for the AR Indirect Addressing Mode . . . . . . . . . . . . . . . . . . . . . . . . . 3-8 Control Mode Operands for the AR Indirect Addressing Mode . . . . . . . . . . . . . . . . . . . . . . 3-12 Dual AR Indirect Operands . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3-15 CDP Indirect Operands . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3-17 Coefficient Indirect Operands . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3-19 Circular Addressing Pointers . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3-20 Algebraic Instruction Set Summary . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4-3 Opcodes for Load CPU Register from Memory Instruction . . . . . . . . . . . . . . . . . . . . . . . . 5-206 Opcodes for Load CPU Register with Immediate Value Instruction . . . . . . . . . . . . . . . . . 5-208 Opcodes for Move Auxiliary or Temporary Register Content to CPU Register Instruction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5-244 Opcodes for Move CPU Register Content to Auxiliary or Temporary Register Instruction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5-246 Effects of a Software Reset on DSP Registers . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5-414 Opcodes for Store CPU Register Content to Memory Instruction . . . . . . . . . . . . . . . . . . 5-461 Instruction Set Opcodes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6-2 Instruction Set Opcode Symbols and Abbreviations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6-16 Cross-Reference of Algebraic and Mnemonic Instruction Sets . . . . . . . . . . . . . . . . . . . . . . 7-2

Chapter 1

Terms, Symbols, and Abbreviations


This chapter lists and defines the terms, symbols, and abbreviations used in the TMS320C55x DSP algebraic instruction set summary and in the individual instruction descriptions. Also provided are instruction set notes and rules and a list of nonrepeatable instructions.

Topic
1.1 1.2 1.3 1.4 1.5

Page
Instruction Set Terms, Symbols, and Abbreviations . . . . . . . . . . . . . . 1-2 Instruction Set Conditional (cond) Fields . . . . . . . . . . . . . . . . . . . . . . . 1-7 Affect of Status Bits . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1-9 Instruction Set Notes and Rules . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1-14 Nonrepeatable Instructions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1-20

1-1

Instruction Set Terms, Symbols, and Abbreviations

1.1 Instruction Set Terms, Symbols, and Abbreviations


Table 11 lists the terms, symbols, and abbreviations used and Table 12 lists the operators used in the instruction set summary and in the individual instruction descriptions.

Table 11. Instruction Set Terms, Symbols, and Abbreviations


Symbol [ ] ACB ACOVx ACw, ACx, ACy, ACz ARn_mod ARx, ARy AU Baddr BitIn BitOut BORROW C, Cycles Meaning Optional operands Bus that brings D-unit registers to A-unit and P-unit operators Accumulator overflow status bit: ACOV0, ACOV1, ACOV2, ACOV3 Accumulator: AC0, AC1, AC2, AC3 Content of selected auxiliary register (ARn) is premodified or postmodified in the address generation unit. Auxiliary register: AR0, AR1, AR2, AR3, AR4, AR5, AR6, AR7 A unit Register bit address Shifted bit in: Test control flag 2 (TC2) or CARRY status bit Shifted bit out: Test control flag 2 (TC2) or CARRY status bit Logical complement of CARRY status bit Execution in cycles. For conditional instructions, x/y field means: x cycle, if the condition is true. y cycle, if the condition is false. Coefficient address generation unit Value of CARRY status bit Coefficient indirect operand referencing a 16-bit or 32-bit value in data space Condition based on accumulator value (ACx), auxiliary register (ARx) value, temporary register (Tx) value, test control (TCx) flag, or CARRY status bit. See section 1.2. Coefficient Read bus Computed single-repeat register

CA CARRY Cmem cond CR CSR

1-2

Terms, Symbols, and Abbreviations

SPRU375G

Instruction Set Terms, Symbols, and Abbreviations

Table 11. Instruction Set Terms, Symbols, and Abbreviations (Continued)


Symbol DA DR dst Meaning Data address generation unit Data Read bus Destination accumulator (ACx), lower 16 bits of auxiliary register (ARx), or temporary register (Tx): AC0, AC1, AC2, AC3 AR0, AR1, AR2, AR3, AR4, AR5, AR6, AR7 T0, T1, T2, T3 D unit Data Write bus Data address label coded on x bits (absolute address) Indicates if the instruction contains a parallel enable bit. Constant bus Constant bus Unsigned constant coded on x bits Signed constant coded on x bits Long-word single data memory access (32-bit data access). Same legal inputs as Smem. Program address label coded on x bits (unsigned offset relative to program counter register) Program address label coded on x bits (signed offset relative to program counter register) If the optional M40 keyword is applied to the instruction, the instruction provides the option to locally set M40 to 1 for the execution of the instruction Operator(s) used by an instruction. Pipeline phase in which the instruction executes: AD Address D Decode R Read X Execute Program or data address label coded on x bits (absolute address)

DU DW Dx E KAB KDB kx Kx Lmem lx Lx M40 Operator Pipe, Pipeline

Px

SPRU375G

Terms, Symbols, and Abbreviations

1-3

Instruction Set Terms, Symbols, and Abbreviations

Table 11. Instruction Set Terms, Symbols, and Abbreviations (Continued)


Symbol RELOP Meaning Relational operators: == < >= != rnd RPTC S, Size SA saturate SHFT SHIFTW Smem SP src equal to less than greater than or equal to not equal to

If the optional rnd keyword is applied to the instruction, rounding is performed in the instruction Single-repeat counter register Instruction size in bytes. Stack address generation unit If the optional saturate keyword is applied to the input operand, the 40-bit output of the operation is saturated 4-bit immediate shift value, 0 to 15 6-bit immediate shift value, 32 to +31 Word single data memory access (16-bit data access) Data stack pointer Source accumulator (ACx), lower 16 bits of auxiliary register (ARx), or temporary register (Tx): AC0, AC1, AC2, AC3 AR0, AR1, AR2, AR3, AR4, AR5, AR6, AR7 T0, T1, T2, T3 System stack pointer Status register: ST0, ST1, ST2, ST3 Auxiliary register (ARx) or temporary register (Tx): AR0, AR1, AR2, AR3, AR4, AR5, AR6, AR7 T0, T1, T2, T3 Test control flag: TC1, TC2 Transition register: TRN0, TRN1 Temporary register (Tx): T0, T1, T2, T3

SSP STx TAx, TAy

TCx, TCy TRNx Tx, Ty

1-4

Terms, Symbols, and Abbreviations

SPRU375G

Instruction Set Terms, Symbols, and Abbreviations

Table 11. Instruction Set Terms, Symbols, and Abbreviations (Continued)


Symbol uns XAdst Meaning If the optional uns keyword is applied to the input operand, the operand is zero extended Destination extended register: All 23 bits of data stack pointer (XSP), system stack pointer (XSSP), data page pointer (XDP), coefficient data pointer (XCDP), and extended auxiliary register (XARx): XAR0, XAR1, XAR2, XAR3, XAR4, XAR5, XAR6, XAR7 All 23 bits of extended auxiliary register: XAR0, XAR1, XAR2, XAR3, XAR4, XAR5, XAR6, XAR7 Source extended register: All 23 bits of data stack pointer (XSP), system stack pointer (XSSP), data page pointer (XDP), coefficient data pointer (XCDP), and extended auxiliary register (XARx): XAR0, XAR1, XAR2, XAR3, XAR4, XAR5, XAR6, XAR7 Accumulator: AC0, AC1, AC2, AC3 Destination extended register: All 23 bits of data stack pointer (XSP), system stack pointer (XSSP), data page pointer (XDP), coefficient data pointer (XCDP), and extended auxiliary register (XARx): XAR0, XAR1, XAR2, XAR3, XAR4, XAR5, XAR6, XAR7 xsrc Accumulator: AC0, AC1, AC2, AC3 Source extended register: All 23 bits of data stack pointer (XSP), system stack pointer (XSSP), data page pointer (XDP), coefficient data pointer (XCDP), and extended auxiliary register (XARx): XAR0, XAR1, XAR2, XAR3, XAR4, XAR5, XAR6, XAR7 Xmem, Ymem Indirect dual data memory access (two data accesses)

XARx XAsrc

xdst

SPRU375G

Terms, Symbols, and Abbreviations

1-5

Instruction Set Terms, Symbols, and Abbreviations

Table 12. Operators Used in Instruction Set


Symbols + * + << <<< < > == & | ^
Note:

Operators ~ % >> >>> <= >= != Unary plus, minus, 1s complement Multiplication, division, modulo Addition, subtraction Signed left shift, right shift Logical left shift, logical right shift Less than, less than or equal to Greater than, greater than or equal to Equal to, not equal to Bitwise AND Bitwise OR Bitwise exclusive OR (XOR)

Evaluation Right to left Left to right Left to right Left to right Left to right Left to right Left to right Left to right Left to right Left to right Left to right

Unary +, , and * have higher precedence than the binary forms.

1-6

Terms, Symbols, and Abbreviations

SPRU375G

Instruction Set Conditional (cond) Fields

1.2 Instruction Set Conditional (cond) Fields


Table 13 lists the testing conditions available in the cond field of the conditional instructions.

Table 13. Instruction Set Conditional (cond) Field


Bit or Register Accumulator Condition (cond) Field For Condition to be True ...

Tests the accumulator (ACx) content against 0. The comparison against 0 depends on M40 status bit:
-

If M40 = 0, ACx(310) is compared to 0. If M40 = 1, ACx(390) is compared to 0. ACx content is equal to 0 ACx content is less than 0 ACx content is greater than 0 ACx content is not equal to 0 ACx content is less than or equal to 0 ACx content is greater than or equal to 0

ACx == #0 ACx < #0 ACx > #0 ACx != #0 ACx <= #0 ACx >= #0 Accumulator Overflow Status Bit

Tests the accumulator overflow status bit (ACOVx) against 1; when the optional ! symbol is used before the bit designation, the bit can be tested against 0. When this condition is used, the corresponding ACOVx is cleared to 0. overflow(ACx) !overflow(ACx) ACOVx bit is set to 1 ACOVx bit is cleared to 0

Auxiliary Register

Tests the auxiliary register (ARx) content against 0. ARx == #0 ARx < #0 ARx > #0 ARx != #0 ARx <= #0 ARx >= #0 ARx content is equal to 0 ARx content is less than 0 ARx content is greater than 0 ARx content is not equal to 0 ARx content is less than or equal to 0 ARx content is greater than or equal to 0

CARRY Status Bit

Tests the CARRY status bit against 1; when the optional ! symbol is used before the bit designation, the bit can be tested against 0. CARRY !CARRY CARRY bit is set to 1 CARRY bit is cleared to 0

SPRU375G

Terms, Symbols, and Abbreviations

1-7

Instruction Set Conditional (cond) Fields

Table 13. Instruction Set Conditional (cond) Field (Continued)


Bit or Register Temporary Register Condition (cond) Field For Condition to be True ...

Tests the temporary register (Tx) content against 0. Tx == #0 Tx < #0 Tx > #0 Tx != #0 Tx <= #0 Tx >= #0 Tx content is equal to 0 Tx content is less than 0 Tx content is greater than 0 Tx content is not equal to 0 Tx content is less than or equal to 0 Tx content is greater than or equal to 0

Test Control Flags

Tests the test control flags (TC1 and TC2) independently against 1; when the optional ! symbol is used before the flag designation, the flag can be tested independently against 0. TCx !TCx TCx flag is set to 1 TCx flag is cleared to 0

TC1 and TC2 can be combined with an AND (&), OR (|), and XOR (^) logical bit combinations: TC1 & TC2 !TC1 & TC2 TC1 & !TC2 !TC1 & !TC2 TC1 AND TC2 is equal to 1 TC1 AND TC2 is equal to 1 TC1 AND TC2 is equal to 1 TC1 AND TC2 is equal to 1

TC1 | TC2 !TC1 | TC2 TC1 | !TC2 !TC1 | !TC2

TC1 OR TC2 is equal to 1 TC1 OR TC2 is equal to 1 TC1 OR TC2 is equal to 1 TC1 OR TC2 is equal to 1

TC1 ^ TC2 !TC1 ^ TC2 TC1 ^ !TC2 !TC1 ^ !TC2

TC1 XOR TC2 is equal to 1 TC1 XOR TC2 is equal to 1 TC1 XOR TC2 is equal to 1 TC1 XOR TC2 is equal to 1

1-8

Terms, Symbols, and Abbreviations

SPRU375G

Affect of Status Bits

1.3 Affect of Status Bits


1.3.1 Accumulator Overflow Status Bit (ACOVx)
The ACOV[03] depends on M40:
- When M40 = 0, overflow is detected at bit position 31 - When M40 = 1, overflow is detected at bit position 39

If an overflow is detected, the destination accumulator overflow status bit is set to 1.

1.3.2

C54CM Status Bit


- When C54CM = 0, the enhanced mode, the CPU supports code originally

developed for a TMS320C55x DSP.

- When C54CM = 1, the compatible mode, all the C55x CPU resources

remain available; therefore, as you translate code, you can take advantage of the additional features on the C55x DSP to optimize your code. This mode must be set when you are porting code that was originally developed for a TMS320C54x DSP.

1.3.3

CARRY Status Bit


- When M40 = 0, the carry/borrow is detected at bit position 31 - When M40 = 1, the carry/borrow is detected at bit position 39

When performing a logical shift or signed shift that affects the CARRY status bit and the shift count is zero, the CARRY status bit is cleared to 0.

1.3.4

FRCT Status Bit


- When FRCT = 0, the fractional mode is OFF and results of multiply opera-

tions are not shifted.


- When FRCT = 1, the fractional mode is ON and results of multiply opera-

tions are shifted left by 1 bit to eliminate an extra sign bit.

1.3.5

INTM Status Bit


The INTM bit globally enables or disables the maskable interrupts. This bit has no effect on nonmaskable interrupts (those that cannot be blocked by software).
- When INTM = 0, all unmasked interrupts are enabled. - When INTM = 1, all maskable interrupts are disabled.

SPRU375G

Terms, Symbols, and Abbreviations

1-9

Affect of Status Bits

1.3.6

M40 Status Bit


- When M40 = 0: J J J J J

overflow is detected at bit position 31 the carry/borrow is detected at bit position 31 saturation values are 00 7FFF FFFFh (positive overflow) or FF 8000 0000h (negative overflow) TMS320C54x DSP compatibility mode for conditional instructions, the comparison against 0 (zero) is performed on 32 bits, ACx(310)

- When M40 = 1: J J J J

overflow is detected at bit position 39 the carry/borrow is detected at bit position 39 saturation values are 7F FFFF FFFFh (positive overflow) or 80 0000 0000h (negative overflow) for conditional instructions, the comparison against 0 (zero) is performed on 40 bits, ACx(390)

1.3.6.1

M40 Status Bit When Sign Shifting In D-unit shifter:


- When shifting to the LSBs: J

when M40 = 0, the input to the shifter is modified according to SXMD and then the modified input is shifted according to the shift quantity: H H if SXMD = 0, 0 is substituted for the guard bits (3932) as the input, instead of ACx(3932), to the shifter if SXMD = 1, bit 31 of the source operand is substituted for the guard bits (3932) as the input, instead of ACx(3932), to the shifter

J J

bit 39 is extended according to SXMD the shifted-out bit is extracted at bit position 0

- When shifting to the MSBs: J J J 1-10

0 is inserted at bit position 0 if M40 = 0, the shifted-out bit is extracted at bit position 31 if M40 = 1, the shifted-out bit is extracted at bit position 39
SPRU375G

Terms, Symbols, and Abbreviations

Affect of Status Bits

- After shifting, unless otherwise noted, when M40 = 0: J J J

overflow is detected at bit position 31 (if an overflow is detected, the destination ACOVx bit is set) the carry/borrow is detected at bit position 31 if SATD = 1, when an overflow is detected, ACx saturation values are 00 7FFF FFFFh (positive overflow) or FF 8000 0000h (negative overflow) TMS320C54x DSP compatibility mode

- After shifting, unless otherwise noted, when M40 = 1: J J J

overflow is detected at bit position 39 (if an overflow is detected, the destination ACOVx bit is set) the carry/borrow is detected at bit position 39 if SATD = 1, when an overflow is detected, ACx saturation values are 7F FFFF FFFFh (positive overflow) or 80 0000 0000h (negative overflow)

In A-unit ALU:
- When shifting to the LSBs, bit 15 is sign extended - When shifting to the MSBs, 0 is inserted at bit position 0 - After shifting, unless otherwise noted: J J

overflow is detected at bit position 15 (if an overflow is detected, the destination ACOVx bit is set) if SATA = 1, when an overflow is detected, register saturation values are 7FFFh (positive overflow) or 8000h (negative overflow)

1.3.6.2

M40 Status Bit When Logically Shifting In D-unit shifter:


- When shifting to the LSBs: J J J

if M40 = 0, 0 is inserted at bit position 31 and the guard bits (3932) of the destination accumulator are cleared if M40 = 1, 0 is inserted at bit position 39 the shifted-out bit is extracted at bit position 0 and stored in the CARRY status bit
Terms, Symbols, and Abbreviations 1-11

SPRU375G

Affect of Status Bits

- When shifting to the MSBs: J J

0 is inserted at bit position 0 if M40 = 0, the shifted-out bit is extracted at bit position 31 and stored in the CARRY status bit, and the guard bits (3932) of the destination accumulator are cleared if M40 = 1, the shifted-out bit is extracted at bit position 39 and stored in the CARRY status bit

In A-unit ALU:
- When shifting to the LSBs: J J

0 is inserted at bit position 15 the shifted-out bit is extracted at bit position 0 and stored in the CARRY status bit 0 is inserted at bit position 0 the shifted-out bit is extracted at bit position 15 and stored in the CARRY status bit

- When shifting to the MSBs: J J

1.3.7

RDM Status Bit


When the optional rnd or R keyword is applied to the instruction, then rounding is performed in the D-unit shifter. This is done according to RDM:
- When RDM = 0, the biased rounding to the infinite is performed. 8000h (215) is added to the 40-bit result of the shift result. - When RDM = 1, the unbiased rounding to the nearest is performed.

According to the value of the 17 LSBs of the 40-bit result of the shift result, 8000h (215) is added:
if( 8000h < bit(150) < 10000h) add 8000h to the 40-bit result of the shift result. else if( bit(150) == 8000h) if( bit(16) == 1) add 8000h to the 40-bit result of the shift result.

If a rounding has been performed, the 16 lowest bits of the result are cleared to 0.

1.3.8

SATA Status Bit


This status bit controls operations performed in the A unit.
- When SATA = 0, no saturation is performed. - When SATA = 1 and an overflow is detected, the destination register is

saturated to 7FFFh (positive overflow) or 8000h (negative overflow).


1-12 Terms, Symbols, and Abbreviations SPRU375G

Affect of Status Bits

1.3.9

SATD Status Bit


This status bit controls operations performed in the D unit.
- When SATD = 0, no saturation is performed. - When SATD = 1 and an overflow is detected, the destination register is

saturated.

1.3.10 SMUL Status Bit


- When SMUL = 0, the saturation mode is OFF. - When SMUL = 1, the saturation mode is ON. When SMUL = 1, FRCT = 1,

and SATD = 1, the result of 18000h 18000h is saturated to 00 7FFF FFFFh (regardless of the value of the M40 bit). This forces the product of the two negative numbers to be a positive number. For multiplyand-accumulate/subtract instructions, the saturation is performed after the multiplication and before the addition/subtraction.

1.3.11 SXMD Status Bit


This status bit controls operations performed in the D unit.
- When SXMD = 0, input operands are zero extended. - When SXMD = 1, input operands are sign extended.

1.3.12 Test Control Status Bit (TCx)


The test/control status bits (TC1 or TC2) hold the result of a test performed by the instruction.

SPRU375G

Terms, Symbols, and Abbreviations

1-13

Instruction Set Notes and Rules

1.4 Instruction Set Notes and Rules


1.4.1 Notes
- Algebraic syntax keywords and operand modifiers are case insensitive.

You can write:


abdst(*AR0, *ar1, AC0, ac1)

or
aBdST(*ar0, *aR1, aC0, Ac1)
- Operands for commutative operations (+, *, &, |, ^) can be arranged in any

order.
- Expression qualifiers can be specified in any order. For example, these

two instructions are equivalent:


AC0 = m40(rnd(uns(*AR0) * uns(*AR1))) AC0 = rnd(m40(uns(*AR0) * uns(*AR1)))
- Algebraic instructions must use parenthesis in the exact form shown in the

instruction set. For example, this instruction is legal:


AC0 = AC0 + (AC1 << T0)

while both of these instructions are illegal:


AC0 = AC0 + ((AC1 << T0)) AC0 = AC0 + AC1 << T0

1.4.2

Rules
- Simple instructions are not allowed to span multiple lines. One exception,

single instructions that use the , notation to imply parallelism. These instructions may be split up following the , notation. The following example shows a single instruction (dual multiply) occupying two lines:
ACx = m40(rnd(uns(Xmem) * uns(coef(Cmem)))), ACy = m40(rnd(uns(Ymem) * uns(coef(Cmem))))
- User-defined parallelism instructions (using || notation) are allowed to

span multiple lines. For example, all of the following instructions are legal:
AC0 = AC1 || AC0 = AC1 || AC2 = AC3 AC0 = AC1 || AC2 = AC3 AC0 = AC1 || AC2 = AC3
1-14 Terms, Symbols, and Abbreviations

AC2 = AC3

SPRU375G

Instruction Set Notes and Rules

- The block repeat syntax uses braces to delimit the block that is to be

repeated:
blockrepeat { instr instr : instr } localrepeat { instr instr : instr }

The left opening brace must appear on the same line as the repeat keyword. The right closing brace must appear alone on a line (trailing comments allowed). Note that a label placed just inside the closing brace of the loop is effectively outside the loop. The following two code sequences are equivalent:
localrepeat { instr1 instr2 Label: } instr3

and
localrepeat { instr1 instr2 } Label: instr3

A label is the address of the first construct following the label that gets assembled into code in the object file. A closing brace does not generate any code and so the label marks the address of the first instruction that generates code, that is, instr3. In this example, goto Label exits the loop, which is somewhat unintuitive:
localrepeat { goto Label instr2 Label: } instr3
SPRU375G Terms, Symbols, and Abbreviations 1-15

Instruction Set Notes and Rules

1.4.2.1

Reserved Words Register names and algebraic syntax keywords are reserved. They may not be used as names of identifiers, labels, etc.

1.4.2.2

Literal and Address Operands Literals in the algebraic strings are denoted as K or k fields. In the Smem address modes that require an offset, the offset is also a literal (K16 or k3). 8-bit and 16-bit literals are allowed to be linktime-relocatable; for other literals, the value must be known at assembly time. Addresses are the elements of the algebraic strings denoted by P, L, and l. Further, 16-bit and 24-bit absolute address Smem modes are addresses, as is the dma Smem mode, denoted by the @ syntax. Addresses may be assembly-time constants or symbolic linktime-known constants or expressions. Both literals and addresses follow syntax rule 1. For addresses only, rules 2 and 3 also apply.

Rule 1
A valid address or literal is a # followed by one of the following:
- a number (#123) - an identifier (#FOO) - a parenthesized expression (#(FOO + 2))

Note that # is not used inside the expression.

Rule 2
When an address is used in a dma, the address does not need to have a leading #, be it a number, a symbol or an expression. These are all legal:
@#123 @123 @#foo @foo @#(foo+2) @(foo+2)

1-16

Terms, Symbols, and Abbreviations

SPRU375G

Instruction Set Notes and Rules

Rule 3
When used in contexts other than dma (such as branch targets or Smemabsolute address), addresses generally need a leading #. As a convenience, the # may be omitted in front of an identifier. These are all legal: Branch
goto goto goto goto #123 #foo foo #(foo+2)

Absolute Address
*(#123) *(#foo) *(foo) *(#(foo+2))

These are illegal:


goto 123 goto (foo+2) *(123) *((foo+2))

1.4.2.3

Memory Operands
- Syntax of Smem is the same as that of Lmem or Baddr. - In the following instruction syntaxes, Smem cannot reference to a

memory-mapped register (MMR). No instruction can access a byte within a memory-mapped register. If Smem is an MMR in one of the following syntaxes, the DSP sends a hardware bus-error interrupt (BERRINT) request to the CPU.
dst = uns(high_byte(Smem)) dst = uns(low_byte(Smem)) ACx = low_byte(Smem) << #SHIFTW ACx = high_byte(Smem) << #SHIFTW high_byte(Smem) = src low_byte(Smem) = src
- Syntax of Xmem is the same as that of Ymem. - Syntax of coefficient operands, Cmem:

*CDP *CDP+ *CDP *(CDP + T0), when C54CM = 0 *(CDP + AR0), when C54CM = 1

When an instruction uses a Cmem operand with paralleled instructions, the pointer modification of the Cmem operand must be the same for both instructions of the paralleled pair or the assembler generates an error. For example:
AC0 = AC0 + (*AR2+ * coef(*CDP+)), AC1 = AC1 + (*AR3+ * coef(*CDP+))
SPRU375G Terms, Symbols, and Abbreviations 1-17

Instruction Set Notes and Rules

- An optional mmr prefix is allowed to be specified for indirect memory

operands, for example, mmr(*AR0). This is an assertion by you that this is an access to a memory-mapped register. The assembler checks whether such access is legal in given circumstances. The mmr prefix is supported for Xmem, Ymem, indirect Smem, indirect Lmem, and Cmem operands. It is not supported for direct memory operands; it is expected that an explicit mmap() parallel instruction is used in conjunction with direct memory operands to indicate MMR access. Note that the mmr prefix is part of the syntax. It is an implementation restriction that mmr cannot exchange positions with other prefixes around the memory operand, such as dbl or uns. If several prefixes are specified, mmr must be the innermost prefix. Thus, uns(mmr(*AR0)) is legal, but mmr(uns(*AR0)) is not legal.
- The following indirect operands cannot be used for accesses to I/O

space. An instruction using one of these operands requires a 2-byte extension for the constant. This extension would prevent the use of the port() qualifier needed to indicate an I/O-space access.
*ARn(#K16) *+ARn(#K16) *CDP(#K16) *+CDP(#K16)

Also, the following instructions that include the delay operation cannot be used for accesses to I/O space:
delay(Smem) ACx = rnd(ACx + (Smem * coef(Cmem))) [,T3 = Smem], delay(Smem)

Any illegal access to I/O space will generate a hardware bus-error interrupt (BERRINT) to be handled by the CPU. 1.4.2.4 Operand Modifiers Operand modifiers look like function calls on operands. Note that uns is an operand modifier and an instruction modifier meaning unsigned. The operand modifier uns is used when the operand is modified on the way to the rest of the operation (multiply-and-accumulate). The instruction modifier uns is used when the whole operation is affected (multiply, register compare, compare and branch).
1-18 Terms, Symbols, and Abbreviations SPRU375G

Instruction Set Notes and Rules

Modifier dbl dual HI high_byte LO low_byte pair rnd saturate uns

Meaning Access a true 32-bit memory operand Access a 32-bit memory operand for use as two independent 16-bit halves of the given operation Access upper 16 bits of the accumulator Access the high byte of the memory location Access lower 16 bits of the accumulator Access the low byte of the memory location Dual register access Round Saturate Unsigned operand

When an instruction uses a Cmem operand with paralleled instructions and the Cmem operand is defined as unsigned (uns), both Cmem operands of the paralleled pair must be defined as unsigned (and reciprocally). When an instruction uses both Xmem and Ymem operands with paralleled instructions and the Xmem operand is defined as unsigned (uns), Ymem operand must also be defined as unsigned (and reciprocally). 1.4.2.5 Operator Syntax Rules Instructions that read and write the same operand can also be written in op-assign form. For example:
AC0 = AC0 + *AR4

can also be written:


AC0 += *AR4

This form is supported for these operations: +=, =, &=, |=, ^= Note that in certain instances use of op-assign notation results in ambiguous algebraic assembly. This happens if the op-assign operator is not delimited by white space, for example: *AR0+=#4 is ambiguous, is it *AR0 += #4 or *AR0+ = #4 ? The assembler always parses adjacent += as plus-assign; therefore, this instructions is parsed as *AR0 += #4. *AR0+=*AR1 is ambiguous, is it *AR0 += *AR1 or *AR0+ =*AR1 ? Once again, the first form, *AR0 += *AR1, is used. This is not a valid instruction an error is printed.
SPRU375G Terms, Symbols, and Abbreviations 1-19

Nonrepeatable Instructions

1.5 Nonrepeatable Instructions


Table 14 lists the instructions that cannot be used in a repeatable instruction.

Table 14. Nonrepeatable Instructions


Instruction Description Addition Algebraic Syntax That Cannot Be Repeated ACy = ACx + (uns(Smem) << #SHIFTW) Smem = Smem + K16 Bitwise AND Bitwise OR Bitwise Exclusive OR (XOR) Bitwise AND Memory with Immediate Value and Compare to Zero Branch Conditionally Smem = Smem & k16 Smem = Smem | k16 Smem = Smem ^ k16 TCx = Smem & k16 if (cond) goto l4 if (cond) goto L8 if (cond) goto L16 if (cond) goto P24 Branch Unconditionally goto ACx goto L7 goto L16 goto P24 Branch on Auxiliary Register Not Zero Call Conditionally if (ARn_mod != #0) goto L16 if (cond) call L16 if (cond) call P24 Call Unconditionally call ACx call L16 call P24 Clear Status Register Bit Compare and Branch Compare Memory with Immediate Value bit(STx, k4) = #0 compare (uns(src RELOP K8)) goto L8 TCx = (Smem == K16)

This instruction may not be repeated when using the *(#k23) absolute addressing mode to access the memory operand Smem.

1-20

Terms, Symbols, and Abbreviations

SPRU375G

Nonrepeatable Instructions

Table 14. Nonrepeatable Instructions (Continued)


Instruction Description Execute Conditionally Algebraic Syntax That Cannot Be Repeated if (cond) execute(AD_Unit) if (cond) execute(D_Unit) Idle Load Accumulator from Memory Load CPU Register from Memory idle ACx = uns(Smem) << #SHIFTW DP = Smem RETA = dbl(Lmem) Load CPU Register with Immediate Value Load Memory with Immediate Value Move CPU Register Content to Auxiliary or Temporary Register Multiply Multiply and Accumulate Repeat Block of Instructions Unconditionally DP = k16 Smem = K16 TAx = RPTC

ACx = rnd(Smem * K8)[, T3 = Smem] ACy = rnd(ACx + (Smem * K8))[, T3 = Smem ] localrepeat{} blockrepeat{}

Repeat Single Instruction Conditionally Repeat Single Instruction Unconditionally

while (cond && (RPTC < k8)) repeat repeat(k8) repeat(k16) repeat(CSR)

Repeat Single Instruction Unconditionally and Decrement CSR Repeat Single Instruction Unconditionally and Increment CSR Return Conditionally Return Unconditionally Return from Interrupt Round Accumulator Content

repeat(CSR), CSR = k4

repeat(CSR), CSR += TAx repeat(CSR), CSR += k4 if (cond) return return return_int ACy = rnd(ACx)

This instruction may not be repeated when using the *(#k23) absolute addressing mode to access the memory operand Smem.

SPRU375G

Terms, Symbols, and Abbreviations

1-21

Nonrepeatable Instructions

Table 14. Nonrepeatable Instructions (Continued)


Instruction Description Set Status Register Bit Software Interrupt Software Reset Software Trap Store Accumulator Content to Memory Algebraic Syntax That Cannot Be Repeated bit(STx, k4) = #1 intr(k5) reset trap(k5) Smem = HI(rnd(ACx << #SHIFTW)) Smem = HI(saturate(uns(rnd(ACx << #SHIFTW)))) Store CPU Register Content to Memory Subtraction dbl(Lmem) = RETA ACy = ACx (uns(Smem) << #SHIFTW)

This instruction may not be repeated when using the *(#k23) absolute addressing mode to access the memory operand Smem.

1-22

Terms, Symbols, and Abbreviations

SPRU375G

Chapter 2

Parallelism Features and Rules


This chapter describes the parallelism features and rules of the TMS320C55x DSP algebraic instruction set.

Topic
2.1 2.2 2.3 2.4 2.5 2.6

Page
Parallelism Features . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2-2 Parallelism Basics . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2-3 Resource Conflicts . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2-4 Soft-Dual Parallelism . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2-5 Execute Conditionally Instructions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2-6 Other Exceptions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2-7

2-1

Parallelism Features

2.1 Parallelism Features


The C55x DSP architecture enables you to execute two instructions in parallel within the same cycle of execution. The types of parallelism are:
- Built-in parallelism within a single instruction.

Some instructions perform two different operations in parallel. A comma is used to separate the two operations. This type of parallelism is also called implied parallelism. For example:
AC0 = *AR0 * coef(*CDP), AC1 = *AR1 * coef(*CDP) This is a single instruction. The data referenced by AR0 is multiplied by the coefficient referenced by CDP. At the same time, the data referenced by AR1 is multiplied by the same coefficient (CDP).

- User-defined parallelism between two instructions.

Two instructions may be paralleled by you or the C compiler. The parallel bars, ||, are used to separate the two instructions to be executed in parallel. For example:
AC1 = *AR1 * *AR2+ || T1 = T1 ^ AR2 The first instruction performs a multiplication in the D-unit. The second instruction performs a logical operation in the A-unit ALU.

- Built-in parallelism can be combined with user-defined parallelism.

Parenthesis separators can be used to determine boundaries of the two instructions. For example:
(AC2 = *AR3+ * AC1, T3 = *AR3+) || AR1 = #5 The first instruction includes implied parallelism. The second instruction is paralleled by you.

2-2

Parallelism Features and Rules

SPRU375G

Parallelism Basics

2.2 Parallelism Basics


In the parallel pair, all of these constraints must be met:
- Total size of both instructions may not exceed 6 bytes. - No resource conflicts as detailed in section 2.3. - One instruction must have a parallel enable bit or the pair must qualify for

soft-dual parallelism as detailed in section 2.4.


- No memory operand may use an addressing mode that requires a

constant that is 16 bits or larger:


J J J J J J J

*abs16(#k16) *(#k23) *port(#k16) *ARn(K16) *+ARn(K16) *CDP(K16) *+CDP(K16)

- The following instructions cannot be in parallel: J J J J J J

if (cond) goto P24 if (cond) call P24 idle intr(k5) reset trap(k5)

- Neither instruction in the parallel pair can use any of these instruction or

operand modifiers:
J J J J J

circular() linear() mmap() readport() writeport()

- A particular register or memory location can only be written once per

pipeline phase. Violations of this rule take many forms. Loading the same register twice is a simple case. Other cases include:
J J

Conflicting address mode modifications (for example, *AR2+ versus *AR2) Combining a SWAP instruction (modifies all of its registers) with any other instruction that writes one of the same registers
Parallelism Features and Rules 2-3

SPRU375G

Parallelism Parallelism Basics Basics / Resource Conflicts

Modifying the data stack pointer (SP) or system stack pointer (SSP) in combination with: H H H H H all Push to Top of Stack (push) instructions all Pop Top of Stack (pop) instructions all Call Conditionally, if (cond) call; and Call Unconditionally, call, instructions all Return Conditionally, if (cond) return; Return Unconditionally, return; and Return from Interrupt, return_int, instructions trap and intr instructions

- When both instructions in a parallel pair modify a status bit, the value of

that status bit becomes undefined.

2.3 Resource Conflicts


Every instruction uses some set of operators, address generation units, and buses, collectively called resources, while executing. To determine which resources are used by a specific instruction, see Table 41. Two instructions in parallel use all the resources of the individual instructions. A resource conflict occurs when two instructions use a combination of resources that is not supported on the C55x device. This section details the resource conflicts.

2.3.1

Operators
You may use each of these operators only once:
-

D Unit ALU D Unit Shift D Unit Swap A Unit Swap A Unit ALU P Unit

For an instruction that uses multiple operators, any other instruction that uses one or more of those same operators may not be placed in parallel.

2.3.2

Address Generation Units


You may use no more than the indicated number of data address generation units:
- 2 Data Address (DA) Generation Units - 1 Coefficient Address (CA) Generation Unit - 1 Stack Address (SA) Generation Unit

2-4

Parallelism Features and Rules

SPRU375G

Resource Conflicts / Soft-Dual Parallelism

2.3.3

Buses
You may use no more than the indicated number of buses:
-

2 Data Read (DR) Buses 1 Coefficient Read (CR) Bus 2 Data Write (DW) Buses 1 ACB Bus brings D-unit registers to A-unit and P-unit operators 1 KAB Bus Constant Bus 1 KDB Bus Constant Bus

2.4 Soft-Dual Parallelism


Instructions that reference memory operands do not have parallel enable bits. Two such instructions may still be combined with a type of parallelism called soft-dual parallelism. The constraints of soft-dual parallelism are:
- Both memory operands must meet the constraints of the dual AR indirect

addressing mode (Xmem and Ymem), as described in section 3.4.2. The operands available for the dual AR indirect addressing mode are:
J J J J J J J J J J J

*ARn *ARn+ *ARn *(ARn + AR0) *(ARn + T0) *(ARn AR0) *(ARn T0) *ARn(AR0) *ARn(T0) *(ARn + T1) *(ARn T1)

- Neither instruction can contain any of the following: J

Instructions embedding high_byte(Smem) and low_byte(Smem). H H H H H H dst = uns(high_byte(Smem)) dst = uns(low_byte(Smem)) ACx = low_byte(Smem) << #SHIFTW ACx = high_byte(Smem) << #SHIFTW high_byte(Smem) = src low_byte(Smem) = src
Parallelism Features and Rules 2-5

SPRU375G

Execute Execute Conditionally Soft-Dual Conditionally Parallelism /Instructions Instructions Execute Conditionally / Other Exceptions Instructions

These instructions that read and write the same memory location: H H H H H H cbit(Smem, src) bit(Smem, src) = #0 bit(Smem, src) = #1 TCx = bit(Smem, k4), bit(Smem, k4) = #1 TCx = bit(Smem, k4), bit(Smem, k4) = #0 TCx = bit(Smem, k4), cbit(Smem, k4)

- With regard to soft-dual parallelism, the mar(Smem) instruction has the

same properties as any memory reference instruction.

2.4.1

Soft-Dual Parallelism of MAR Instructions


Although the following modify auxiliary register (MAR) instructions do not reference memory and do not have parallel enable bits, they may be combined together or with any other memory reference instructions (not limited to Xmem/ Ymem) to form soft-dual parallelism.
-

mar(TAy mar(TAx mar(TAy mar(TAx mar(TAy mar(TAx

+ + = =

TAx) k8) TAx) k8) TAx) k8)

Note that this is not the full list of MAR instructions; instructions mar(TAx = D16) and mar(Smem) are not included.

2.5 Execute Conditionally Instructions


The parallelization of the execute conditionally, if (cond) execute, instructions does not adhere to the descriptions in this chapter. All of the specific instances of legal parallelism are covered in the execute conditionally descriptions in Chapter 5.

2-6

Parallelism Features and Rules

SPRU375G

Other Exceptions

2.6 Other Exceptions


The following are other exceptions not covered elsewhere in this chapter.
- These instructions, when k4 is a value of 08, change the value of the XDP

register:
J J

bit(ST0, k4) = #1 bit(ST0, k4) = #0

Therefore, they may not be combined with any of these load-the-DP instructions:
J J J

DP = Smem XDP = dbl(Lmem) XDP = popboth()

- An instruction that reads the repeat counter register (RPTC) may not be

combined with any single-repeat instruction:


J J J

repeat() repeat(CSR) while (cond) repeat

SPRU375G

Parallelism Features and Rules

2-7

Chapter 3

Introduction to Addressing Modes


This chapter provides an introduction to the addressing modes of the TMS320C55x DSP.

Topic
3.1 3.2 3.3 3.4 3.5

Page
Introduction to the Addressing Modes . . . . . . . . . . . . . . . . . . . . . . . . . . 3-2 Absolute Addressing Modes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3-3 Direct Addressing Modes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3-4 Indirect Addressing Modes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3-6 Circular Addressing . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3-20

3-1

Introduction to the Addressing Modes

3.1 Introduction to the Addressing Modes


The TMS320C55x DSP supports three types of addressing modes that enable flexible access to data memory, to memory-mapped registers, to register bits, and to I/O space:
- The absolute addressing mode allows you to reference a location by

supplying all or part of an address as a constant in an instruction.


- The direct addressing mode allows you to reference a location using an

address offset.
- The indirect addressing mode allows you to reference a location using a

pointer. Each addressing mode provides one or more types of operands. An instruction that supports an addressing-mode operand has one of the following syntax elements listed in Table 31.

Table 31. Addressing-Mode Operands


Syntax Element(s) Baddr Description When an instruction contains Baddr, that instruction can access one or two bits in an accumulator (AC0AC3), an auxiliary register (AR0AR7), or a temporary register (T0T3). Only the register bit test/set/clear/complement instructions support Baddr. As you write one of these instructions, replace Baddr with a compatible operand. When an instruction contains Cmem, that instruction can access a single word (16 bits) of data from data memory. As you write the instruction, replace Cmem with a compatible operand. When an instruction contains Lmem, that instruction can access a long word (32 bits) of data from data memory or from a memory-mapped registers. As you write the instruction, replace Lmem with a compatible operand. When an instruction contains Smem, that instruction can access a single word (16 bits) of data from data memory, from I/O space, or from a memory-mapped register. As you write the instruction, replace Smem with a compatible operand. When an instruction contains Xmem and Ymem, that instruction can perform two simultaneous 16-bit accesses to data memory. As you write the instruction, replace Xmem and Ymem with compatible operands.

Cmem Lmem

Smem

Xmem and Ymem

3-2

Introduction to Addressing Modes

SPRU375G

Absolute Addressing Modes

3.2 Absolute Addressing Modes


Table 32 lists the absolute addressing modes available.

Table 32. Absolute Addressing Modes


Addressing Mode k16 absolute Description This mode uses the 7-bit register called DPH (high part of the extended data page register) and a 16-bit unsigned constant to form a 23-bit data-space address. This mode is used to access a memory location or a memory-mapped register. This mode enables you to specify a full address as a 23-bit unsigned constant. This mode is used to access a memory location or a memory-mapped register. This mode enables you to specify an I/O address as a 16-bit unsigned constant. This mode is used to access a location in I/O space.

k23 absolute I/O absolute

3.2.1

k16 Absolute Addressing Mode


The k16 absolute addressing mode uses the operand *abs16(#k16), where k16 is a 16-bit unsigned constant. DPH (the high part of the extended data page register) and k16 are concatenated to form a 23-bit data-space address. An instruction using this addressing mode encodes the constant as a 2-byte extension to the instruction. Because of the extension, an instruction using this mode cannot be executed in parallel with another instruction.

3.2.2

k23 Absolute Addressing Mode


The k23 absolute addressing mode uses the *(#k23) operand, where k23 is a 23-bit unsigned constant. An instruction using this addressing mode encodes the constant as a 3-byte extension to the instruction (the most-significant bit of this 3-byte extension is discarded). Because of the extension, an instruction using this mode cannot be executed in parallel with another instruction. Instructions using the operand *(#k23) to access the memory operand Smem cannot be used in a repeatable instruction. See Table 14 for a list of these instructions.

3.2.3

I/O Absolute Addressing Mode


The I/O absolute addressing mode uses the *port(#k16) operand, where k16 is a 16-bit unsigned constant. An instruction using this addressing mode encodes the constant as a 2-byte extension to the instruction. Because of the extension, an instruction using this mode cannot be executed in parallel with another instruction. The delay() instruction cannot use this mode.

SPRU375G

Introduction to Addressing Modes

3-3

Direct Addressing Modes

3.3 Direct Addressing Modes


Table 33 lists the direct addressing modes available.

Table 33. Direct Addressing Modes


Addressing Mode DP direct Description This mode uses the main data page specified by DPH (high part of the extended data page register) in conjunction with the data page register (DP). This mode is used to access a memory location or a memory-mapped register. This mode uses the main data page specified by SPH (high part of the extended stack pointers) in conjunction with the data stack pointer (SP). This mode is used to access stack values in data memory. This mode uses an offset to specify a bit address. This mode is used to access one register bit or two adjacent register bits. This mode uses the peripheral data page register (PDP) and an offset to specify an I/O address. This mode is used to access a location in I/O space.

SP direct

Register-bit direct PDP direct

The DP direct and SP direct addressing modes are mutually exclusive. The mode selected depends on the CPL bit in status register ST1_55:
CPL 0 1 Addressing Mode Selected DP direct addressing mode SP direct addressing mode

The register-bit and PDP direct addressing modes are independent of the CPL bit.

3.3.1

DP Direct Addressing Mode


When an instruction uses the DP direct addressing mode, a 23-bit address is formed. The 7 MSBs are taken from DPH that selects one of the 128 main data pages (0 through 127). The 16 LSBs are the sum of two values:
- The value in the data page register (DP). DP identifies the start address

of a 128-word local data page within the main data page. This start address can be any address within the selected main data page.
- A 7-bit offset (Doffset) calculated by the assembler. The calculation

depends on whether you are accessing data memory or a memorymapped register (using the mmap() qualifier). The concatenation of DPH and DP is called the extended data page register (XDP). You can load DPH and DP individually, or you can use an instruction that loads XDP.
3-4 Introduction to Addressing Modes SPRU375G

Direct Addressing Modes

3.3.2

SP Direct Addressing Mode


When an instruction uses the SP direct addressing mode, a 23-bit address is formed. The 7 MSBs are taken from SPH. The 16 LSBs are the sum of the SP value and a 7-bit offset that you specify in the instruction. The offset can be a value from 0 to 127. The concatenation of SPH and SP is called the extended data stack pointer (XSP). You can load SPH and SP individually, or you can use an instruction that loads XSP. On the first main data page, addresses 00 0000h00 005Fh are reserved for the memory-mapped registers. If any of your data stack is in main data page 0, make sure it uses only addresses 00 0060h00 FFFFh on that page.

3.3.3

Register-Bit Direct Addressing Mode


In the register-bit direct addressing mode, the offset you supply in the operand, @bitoffset, is an offset from the LSB of the register. For example, if bitoffset is 0, you are addressing the LSB of a register. If bitoffset is 3, you are addressing bit 3 of the register. Only the register bit test/set/clear/complement instructions support this mode. These instructions enable you to access bits in the following registers only: the accumulators (AC0AC3), the auxiliary registers (AR0AR7), and the temporary registers (T0T3).

3.3.4

PDP Direct Addressing Mode


When an instruction uses the PDP direct addressing mode, a 16-bit I/O address is formed. The 9 MSBs are taken from the 9-bit peripheral data page register (PDP) that selects one of the 512 peripheral data pages (0 through 511). Each page has 128 words (0 to 127). You select a particular word by specifying a 7-bit offset (Poffset) in the instruction. For example, to access the first word on a page, use an offset of 0. You must use a readport() or writeport() instruction qualifier to indicate that you are accessing an I/O-space location rather than a data-memory location. You place the readport() or the writeport() instruction qualifier in parallel with the instruction that performs the I/O-space access.

SPRU375G

Introduction to Addressing Modes

3-5

Indirect Addressing Modes

3.4 Indirect Addressing Modes


Table 34 list the indirect addressing modes available. You may use these modes for linear addressing or circular addressing.

Table 34. Indirect Addressing Modes


Addressing Mode AR indirect Description This mode uses one of eight auxiliary registers (AR0AR7) to point to data. The way the CPU uses the auxiliary register to generate an address depends on whether you are accessing data space (memory or memory-mapped registers), individual register bits, or I/O space. This mode uses the same address-generation process as the AR indirect addressing mode. This mode is used with instructions that access two or more data-memory locations. This mode uses the coefficient data pointer (CDP) to point to data. The way the CPU uses CDP to generate an address depends on whether you are accessing data space (memory or memory-mapped registers), individual register bits, or I/O space. This mode uses the same address-generation process as the CDP indirect addressing mode. This mode is available to support instructions that can access a coefficient in data memory at the same time they access two other data-memory values using the dual AR indirect addressing mode.

Dual AR indirect

CDP indirect

Coefficient indirect

3.4.1

AR Indirect Addressing Mode


The AR indirect addressing mode uses an auxiliary register ARn (n = 0, 1, 2, 3, 4, 5, 6, or 7) to point to data. The way the CPU uses ARn to generate an address depends on the access type:
For An Access To ... Data space (memory or registers) ARn Contains ... The 16 least significant bits (LSBs) of a 23-bit address. The 7 most significant bits (MSBs) are supplied by ARnH, which is the high part of extended auxiliary register XARn. For accesses to data space, use an instruction that loads XARn; ARn can be individually loaded, but ARnH cannot be loaded. A bit number. Only the register bit test/set/clear/complement instructions support AR indirect accesses to register bits. These instructions enable you to access bits in the following registers only: the accumulators (AC0AC3), the auxiliary registers (AR0AR7), and the temporary registers (T0T3). A 16-bit I/O address.

A register bit (or bit pair)

I/O space

3-6

Introduction to Addressing Modes

SPRU375G

Indirect Addressing Modes

The AR indirect addressing-mode operand available depends on the ARMS bit of status register ST2_55:
ARMS 0 DSP Mode or Control Mode DSP mode. The CPU can use the list of DSP mode operands (Table 35), which provide efficient execution of DSP-intensive applications. Control mode. The CPU can use the list of control mode operands (Table 36), which enable optimized code size for control system applications.

Table 35 (page 3-8) introduces the DSP operands available for the AR indirect addressing mode. Table 36 (page 3-12) introduces the control mode operands. When using the tables, keep in mind that:
- Both pointer modification and address generation are linear or circular

according to the pointer configuration in status register ST2_55. The content of the appropriate 16-bit buffer start address register (BSA01, BSA23, BSA45, or BSA67) is added only if circular addressing is activated for the chosen pointer.
- All additions to and subtractions from the pointers are done modulo 64K.

You cannot address data across main data pages without changing the value in the extended auxiliary register (XARn).

SPRU375G

Introduction to Addressing Modes

3-7

Indirect Addressing Modes

Table 35. DSP Mode Operands for the AR Indirect Addressing Mode
Operand *ARn Pointer Modification ARn is not modified. Supported Access Types Data-memory (Smem, Lmem) Memory-mapped register (Smem, Lmem) Register bit (Baddr) I/O-space (Smem) *ARn+ ARn is incremented after the address is generated: If 16-bit/1-bit operation: ARn = ARn + 1 If 32-bit/2-bit operation: ARn = ARn + 2 Data-memory (Smem, Lmem) Memory-mapped register (Smem, Lmem) Register bit (Baddr) I/O-space (Smem) *ARn ARn is decremented after the address is generated: If 16-bit/1-bit operation: ARn = ARn 1 If 32-bit/2-bit operation: ARn = ARn 2 Data-memory (Smem, Lmem) Memory-mapped register (Smem, Lmem) Register bit (Baddr) I/O-space (Smem) *+ARn ARn is incremented before the address is generated: If 16-bit/1-bit operation: ARn = ARn + 1 If 32-bit/2-bit operation: ARn = ARn + 2 Data-memory (Smem, Lmem) Memory-mapped register (Smem, Lmem) Register bit (Baddr) I/O-space (Smem) *ARn ARn is decremented before the address is generated: If 16-bit/1-bit operation: ARn = ARn 1 If 32-bit/2-bit operation: ARn = ARn 2 Data-memory (Smem, Lmem) Memory-mapped register (Smem, Lmem) Register bit (Baddr) I/O-space (Smem) *(ARn + AR0) The 16-bit signed constant in AR0 is added to ARn after the address is generated: ARn = ARn + AR0 This operand is available when C54CM = 1. This operand is usable when .c54cm_on is active at assembly time. Data-memory (Smem, Lmem) Memory-mapped register (Smem, Lmem) Register bit (Baddr) I/O-space (Smem)

3-8

Introduction to Addressing Modes

SPRU375G

Indirect Addressing Modes

Table 35. DSP Mode Operands for the AR Indirect Addressing Mode (Continued)
Operand *(ARn + T0) Pointer Modification The 16-bit signed constant in T0 is added to ARn after the address is generated: ARn = ARn + T0 This operand is available when C54CM = 0. This operand is usable when .c54cm_off is active at assembly time. *(ARn AR0) The 16-bit signed constant in AR0 is subtracted from ARn after the address is generated: ARn = ARn AR0 This operand is available when C54CM = 1. This operand is usable when .c54cm_on is active at assembly time. *(ARn T0) The 16-bit signed constant in T0 is subtracted from ARn after the address is generated: ARn = ARn T0 This operand is available when C54CM = 0. This operand is usable when .c54cm_off is active at assembly time. *ARn(AR0) ARn is not modified. ARn is used as a base pointer. The 16-bit signed constant in AR0 is used as an offset from that base pointer. This operand is available when C54CM = 1. This operand is usable when .c54cm_on is active at assembly time. *ARn(T0) ARn is not modified. ARn is used as a base pointer. The 16-bit signed constant in T0 is used as an offset from that base pointer. This operand is available when C54CM = 0. This operand is usable when .c54cm_off is active at assembly time. *ARn(T1) ARn is not modified. ARn is used as a base pointer. The 16-bit signed constant in T1 is used as an offset from that base pointer. Supported Access Types Data-memory (Smem, Lmem) Memory-mapped register (Smem, Lmem) Register bit (Baddr) I/O-space (Smem) Data-memory (Smem, Lmem) Memory-mapped register (Smem, Lmem) Register bit (Baddr) I/O-space (Smem) Data-memory (Smem, Lmem) Memory-mapped register (Smem, Lmem) Register bit (Baddr) I/O-space (Smem) Data-memory (Smem, Lmem) Memory-mapped register (Smem, Lmem) Register bit (Baddr) I/O-space (Smem) Data-memory (Smem, Lmem) Memory-mapped register (Smem, Lmem) Register bit (Baddr) I/O-space (Smem) Data-memory (Smem, Lmem) Memory-mapped register (Smem, Lmem) Register bit (Baddr) I/O-space (Smem)

SPRU375G

Introduction to Addressing Modes

3-9

Indirect Addressing Modes

Table 35. DSP Mode Operands for the AR Indirect Addressing Mode (Continued)
Operand *(ARn + T1) Pointer Modification The 16-bit signed constant in T1 is added to ARn after the address is generated: ARn = ARn + T1 Supported Access Types Data-memory (Smem, Lmem) Memory-mapped register (Smem, Lmem) Register bit (Baddr) I/O-space (Smem) *(ARn T1) The 16-bit signed constant in T1 is subtracted from ARn after the address is generated: ARn = ARn T1 Data-memory (Smem, Lmem) Memory-mapped register (Smem, Lmem) Register bit (Baddr) I/O-space (Smem) *(ARn + AR0B) The 16-bit signed constant in AR0 is added to ARn after the address is generated: ARn = ARn + AR0 (The addition is done with reverse carry propagation) This operand is available when C54CM = 1. This operand is usable when .c54cm_on is active at assembly time. Note: When this bit-reverse operand is used, ARn cannot be used as a circular pointer. If ARn is configured in ST2_55 for circular addressing, the corresponding buffer start address register value (BSAxx) is added to ARn, but ARn is not modified so as to remain inside a circular buffer. *(ARn + T0B) The 16-bit signed constant in T0 is added to ARn after the address is generated: ARn = ARn + T0 (The addition is done with reverse carry propagation) This operand is available when C54CM = 0. This operand is usable when .c54cm_off is active at assembly time. Note: When this bit-reverse operand is used, ARn cannot be used as a circular pointer. If ARn is configured in ST2_55 for circular addressing, the corresponding buffer start address register value (BSAxx) is added to ARn, but ARn is not modified so as to remain inside a circular buffer. Data-memory (Smem, Lmem) Memory-mapped register (Smem, Lmem) Register bit (Baddr) I/O-space (Smem) Data-memory (Smem, Lmem) Memory-mapped register (Smem, Lmem) Register bit (Baddr) I/O-space (Smem)

3-10

Introduction to Addressing Modes

SPRU375G

Indirect Addressing Modes

Table 35. DSP Mode Operands for the AR Indirect Addressing Mode (Continued)
Operand *(ARn AR0B) Pointer Modification The 16-bit signed constant in AR0 is subtracted from ARn after the address is generated: ARn = ARn AR0 (The subtraction is done with reverse carry propagation) This operand is available when C54CM = 1. This operand is usable when .c54cm_on is active at assembly time. Note: When this bit-reverse operand is used, ARn cannot be used as a circular pointer. If ARn is configured in ST2_55 for circular addressing, the corresponding buffer start address register value (BSAxx) is added to ARn, but ARn is not modified so as to remain inside a circular buffer. *(ARn T0B) The 16-bit signed constant in T0 is subtracted from ARn after the address is generated: ARn = ARn T0 (The subtraction is done with reverse carry propagation) This operand is available when C54CM = 0. This operand is usable when .c54cm_off is active at assembly time. Note: When this bit-reverse operand is used, ARn cannot be used as a circular pointer. If ARn is configured in ST2_55 for circular addressing, the corresponding buffer start address register value (BSAxx) is added to ARn, but ARn is not modified so as to remain inside a circular buffer. *ARn(#K16) ARn is not modified. ARn is used as a base pointer. The 16-bit signed constant (K16) is used as an offset from that base pointer. Data-memory (Smem, Lmem) Memory-mapped register (Smem, Lmem) Data-memory (Smem, Lmem) Memory-mapped register (Smem, Lmem) Register bit (Baddr) I/O-space (Smem) Supported Access Types Data-memory (Smem, Lmem) Memory-mapped register (Smem, Lmem) Register bit (Baddr) I/O-space (Smem)

Note: When an instruction uses this operand, the constant Register bit (Baddr) is encoded in a 2-byte extension to the instruction. Because of the extension, an instruction using this operand cannot be executed in parallel with another instruction. *+ARn(#K16) The 16-bit signed constant (K16) is added to ARn before the address is generated: ARn = ARn + K16 Data-memory (Smem, Lmem) Memory-mapped register (Smem, Lmem)

Note: When an instruction uses this operand, the constant Register bit (Baddr) is encoded in a 2-byte extension to the instruction. Because of the extension, an instruction using this operand cannot be executed in parallel with another instruction.

SPRU375G

Introduction to Addressing Modes

3-11

Indirect Addressing Modes

Table 36. Control Mode Operands for the AR Indirect Addressing Mode
Operand *ARn Pointer Modification ARn is not modified. Supported Access Types Data-memory (Smem, Lmem) Memory-mapped register (Smem, Lmem) Register bit (Baddr) I/O-space (Smem) *ARn+ ARn is incremented after the address is generated: If 16-bit/1-bit operation: ARn = ARn + 1 If 32-bit/2-bit operation: ARn = ARn + 2 Data-memory (Smem, Lmem) Memory-mapped register (Smem, Lmem) Register bit (Baddr) I/O-space (Smem) *ARn ARn is decremented after the address is generated: If 16-bit/1-bit operation: ARn = ARn 1 If 32-bit/2-bit operation: ARn = ARn 2 Data-memory (Smem, Lmem) Memory-mapped register Smem, Lmem) Register bit (Baddr) I/O-space (Smem) *(ARn + AR0) The 16-bit signed constant in AR0 is added to ARn after the address is generated: ARn = ARn + AR0 This operand is available when C54CM = 1. This operand is usable when .c54cm_on is active at assembly time. *(ARn + T0) The 16-bit signed constant in T0 is added to ARn after the address is generated: ARn = ARn + T0 This operand is available when C54CM = 0. This operand is usable when .c54cm_off is active at assembly time. *(ARn AR0) The 16-bit signed constant in AR0 is subtracted from ARn after the address is generated: ARn = ARn AR0 This operand is available when C54CM = 1. This operand is usable when .c54cm_on is active at assembly time. Data-memory (Smem, Lmem) Memory-mapped register (Smem, Lmem) Register bit (Baddr) I/O-space (Smem) Data-memory (Smem, Lmem) Memory-mapped register (Smem, Lmem) Register bit (Baddr) I/O-space (Smem) Data-memory (Smem, Lmem) Memory-mapped register (Smem, Lmem) Register bit (Baddr) I/O-space (Smem)

3-12

Introduction to Addressing Modes

SPRU375G

Indirect Addressing Modes

Table 36. Control Mode Operands for the AR Indirect Addressing Mode (Continued)
Operand *(ARn T0) Pointer Modification The 16-bit signed constant in T0 is subtracted from ARn after the address is generated: ARn = ARn T0 This operand is available when C54CM = 0. This operand is usable when .c54cm_off is active at assembly time. *ARn(AR0) ARn is not modified. ARn is used as a base pointer. The 16-bit signed constant in AR0 is used as an offset from that base pointer. This operand is available when C54CM = 1. This operand is usable when .c54cm_on is active at assembly time. *ARn(T0) ARn is not modified. ARn is used as a base pointer. The 16-bit signed constant in T0 is used as an offset from that base pointer. This operand is available when C54CM = 0. This operand is usable when .c54cm_off is active at assembly time. *ARn(#K16) ARn is not modified. ARn is used as a base pointer. The 16-bit signed constant (K16) is used as an offset from that base pointer. Note: When an instruction uses this operand, the constant is encoded in a 2-byte extension to the instruction. Because of the extension, an instruction using this operand cannot be executed in parallel with another instruction. *+ARn(#K16) The 16-bit signed constant (K16) is added to ARn before the address is generated: ARn = ARn + K16 Note: When an instruction uses this operand, the constant is encoded in a 2-byte extension to the instruction. Because of the extension, an instruction using this operand cannot be executed in parallel with another instruction. *ARn(short(#k3)) ARn is not modified. ARn is used as a base pointer. The 3-bit unsigned constant (k3) is used as an offset from that base pointer. k3 is in the range 1 to 7. Supported Access Types Data-memory (Smem, Lmem) Memory-mapped register (Smem, Lmem) Register bit (Baddr) I/O-space (Smem) Data-memory (Smem, Lmem) Memory-mapped register (Smem, Lmem) Register bit (Baddr) I/O-space (Smem) Data-memory (Smem, Lmem) Memory-mapped register (Smem, Lmem) Register bit (Baddr) I/O-space (Smem) Data-memory (Smem, Lmem) Memory-mapped register (Smem, Lmem) Register bit (Baddr)

Data-memory (Smem, Lmem) Memory-mapped register (Smem, Lmem) Register bit (Baddr)

Data-memory (Smem, Lmem) Memory-mapped register (Smem, Lmem) Register bit (Baddr) I/O-space (Smem)

SPRU375G

Introduction to Addressing Modes

3-13

Indirect Addressing Modes

3.4.2

Dual AR Indirect Addressing Mode


The dual AR indirect addressing mode enables you to make two data-memory accesses through the eight auxiliary registers, AR0AR7. As with single AR indirect accesses to data space, the CPU uses an extended auxiliary register to create each 23-bit address. You can use linear addressing or circular addressing for each of the two accesses. You may use the dual AR indirect addressing mode for:
- Executing an instruction that makes two 16-bit data-memory accesses. In

this case, the two data-memory operands are designated in the instruction syntax as Xmem and Ymem. For example:
ACx = (Xmem << #16) + (Ymem << #16)
- Executing two instructions in parallel. In this case, both instructions must

each access a single memory value, designated in the instruction syntaxes as Smem or Lmem. For example:
dst = Smem || dst = src & Smem

The operand of the first instruction is treated as an Xmem operand, and the operand of the second instruction is treated as a Ymem operand. The available dual AR indirect operands are a subset of the AR indirect operands. The ARMS status bit does not affect the set of dual AR indirect operands available. Note: The assembler rejects code in which dual operands use the same auxiliary register with two different auxiliary register modifications. You can use the same ARn for both operands, if one of the operands is *ARn or *ARn(T0); neither modifies ARn. Table 37 (page 3-15) introduces the operands available for the dual AR indirect addressing mode. Note that:
- Both pointer modification and address generation are linear or circular

according to the pointer configuration in status register ST2_55. The content of the appropriate 16-bit buffer start address register (BSA01, BSA23, BSA45, or BSA67) is added only if circular addressing is activated for the chosen pointer.
- All additions to and subtractions from the pointers are done modulo 64K.

You cannot address data across main data pages without changing the value in the extended auxiliary register (XARn).
3-14 Introduction to Addressing Modes SPRU375G

Indirect Addressing Modes

Table 37. Dual AR Indirect Operands


Operand *ARn Pointer Modification ARn is not modified. Supported Access Types Data-memory (Smem, Lmem, Xmem, Ymem) Data-memory (Smem, Lmem, Xmem, Ymem)

*ARn+

ARn is incremented after the address is generated: If 16-bit operation: ARn = ARn + 1 If 32-bit operation: ARn = ARn + 2 ARn is decremented after the address is generated: If 16-bit operation: ARn = ARn 1 If 32-bit operation: ARn = ARn 2 The 16-bit signed constant in AR0 is added to ARn after the address is generated: ARn = ARn + AR0 This operand is available when C54CM = 1. This operand is usable when .c54cm_on is active at assembly time.

*ARn

Data-memory (Smem, Lmem, Xmem, Ymem)

*(ARn + AR0)

Data-memory (Smem, Lmem, Xmem, Ymem)

*(ARn + T0)

The 16-bit signed constant in T0 is added to ARn after the address is generated: ARn = ARn + T0 This operand is available when C54CM = 0. This operand is usable when .c54cm_off is active at assembly time.

Data-memory (Smem, Lmem, Xmem, Ymem)

*(ARn AR0)

The 16-bit signed constant in AR0 is subtracted from ARn after the address is generated: ARn = ARn AR0 This operand is available when C54CM = 1. This operand is usable when .c54cm_on is active at assembly time.

Data-memory (Smem, Lmem, Xmem, Ymem)

*(ARn T0)

The 16-bit signed constant in T0 is subtracted from ARn after the address is generated: ARn = ARn T0 This operand is available when C54CM = 0. This operand is usable when .c54cm_off is active at assembly time.

Data-memory (Smem, Lmem, Xmem, Ymem)

*ARn(AR0)

ARn is not modified. ARn is used as a base pointer. The 16-bit signed constant in AR0 is used as an offset from that base pointer. This operand is available when C54CM = 1. This operand is usable when .c54cm_on is active at assembly time.

Data-memory (Smem, Lmem, Xmem, Ymem)

SPRU375G

Introduction to Addressing Modes

3-15

Indirect Addressing Modes

Table 37. Dual AR Indirect Operands (Continued)


Operand *ARn(T0) Pointer Modification ARn is not modified. ARn is used as a base pointer. The 16-bit signed constant in T0 is used as an offset from that base pointer. This operand is available when C54CM = 0. This operand is usable when .c54cm_off is active at assembly time. *(ARn + T1) The 16-bit signed constant in T1 is added to ARn after the address is generated: ARn = ARn + T1 The 16-bit signed constant in T1 is subtracted from ARn after the address is generated: ARn = ARn T1 Data-memory (Smem, Lmem, Xmem, Ymem) Data-memory (Smem, Lmem, Xmem, Ymem) Supported Access Types Data-memory (Smem, Lmem, Xmem, Ymem)

*(ARn T1)

3.4.3

CDP Indirect Addressing Mode


The CDP indirect addressing mode uses the coefficient data pointer (CDP) to point to data. The way the CPU uses CDP to generate an address depends on the access type:
For An Access To ... Data space (memory or registers) CDP Contains ... The 16 least significant bits (LSBs) of a 23-bit address. The 7 most significant bits (MSBs) are supplied by CDPH, the high part of the extended coefficient data pointer (XCDP). A bit number. Only the register bit test/set/clear/complement instructions support CDP indirect accesses to register bits. These instructions enable you to access bits in the following registers only: the accumulators (AC0AC3), the auxiliary registers (AR0AR7), and the temporary registers (T0T3). A 16-bit I/O address.

A register bit (or bit pair)

I/O space

Table 38 (page 3-17) introduces the operands available for the CDP indirect addressing mode. Note that:
- Both pointer modification and address generation are linear or circular

according to the pointer configuration in status register ST2_55. The content of the 16-bit buffer start address register BSAC is added only if circular addressing is activated for CDP.
3-16 Introduction to Addressing Modes SPRU375G

Indirect Addressing Modes

- All additions to and subtractions from CDP are done modulo 64K. You can-

not address data across main data pages without changing the value of CDPH (the high part of the extended coefficient data pointer).

Table 38. CDP Indirect Operands


Operand *CDP Pointer Modification CDP is not modified. Supported Access Types Data-memory (Smem, Lmem) Memory-mapped register (Smem, Lmem) Register-bit (Baddr) I/O-space (Smem) *CDP+ CDP is incremented after the address is generated: If 16-bit/1-bit operation: CDP = CDP + 1 If 32-bit/2-bit operation: CDP = CDP + 2 Data-memory (Smem, Lmem) Memory-mapped register (Smem, Lmem) Register-bit (Baddr) I/O-space (Smem) *CDP CDP is decremented after the address is generated: If 16-bit/1-bit operation: CDP = CDP 1 If 32-bit/2-bit operation: CDP = CDP 2 Data-memory (Smem, Lmem) Memory-mapped register (Smem, Lmem) Register-bit (Baddr) I/O-space (Smem) *CDP(#K16) CDP is not modified. CDP is used as a base pointer. The 16-bit signed constant (K16) is used as an offset from that base pointer. Data-memory (Smem, Lmem) Memory-mapped register (Smem, Lmem)

Note: When an instruction uses this operand, the constant Register-bit (Baddr) is encoded in a 2-byte extension to the instruction. Because of the extension, an instruction using this operand cannot be executed in parallel with another instruction. *+CDP(#K16) The 16-bit signed constant (K16) is added to CDP before the address is generated: CDP = CDP + K16 Data-memory (Smem, Lmem) Memory-mapped register (Smem, Lmem)

Note: When an instruction uses this operand, the constant Register-bit (Baddr) is encoded in a 2-byte extension to the instruction. Because of the extension, an instruction using this operand cannot be executed in parallel with another instruction.

SPRU375G

Introduction to Addressing Modes

3-17

Indirect Addressing Modes

3.4.4

Coefficient Indirect Addressing Mode


The coefficient indirect addressing mode uses the same address-generation process as the CDP indirect addressing mode for data-space accesses. The coefficient indirect addressing mode is supported by select memory-tomemory move and memory initialization instructions and by the following arithmetical instructions:
-

Dual multiply (accumulate/subtract) Finite impulse response filter Multiply Multiply and accumulate Multiply and subtract

Instructions using the coefficient indirect addressing mode to access data are mainly instructions performing operations with three memory operands per cycle. Two of these operands (Xmem and Ymem) are accessed with the dual AR indirect addressing mode. The third operand (Cmem) is accessed with the coefficient indirect addressing mode. The Cmem operand is carried on the BB bus. Keep the following facts about the BB bus in mind as you use the coefficient indirect addressing mode:
- The BB bus is not connected to external memory. If a Cmem operand is

accessed through the BB bus, the operand must be in internal memory.


- Although the following instructions access Cmem operands, they do not

use the BB bus to fetch the 16-bit or 32-bit Cmem operand.


Instruction Syntax Smem = Cmem Cmem = Smem Lmem = dbl(Cmem) Description of Cmem Access 16-bit read from Cmem 16-bit write to Cmem 32-bit read from Cmem Bus Used to Access Cmem DB EB CB for most significant word (MSW) DB for least significant word (LSW) FB for MSW EB for LSW

dbl(Cmem) = Lmem

32-bit write to Cmem

3-18

Introduction to Addressing Modes

SPRU375G

Indirect Addressing Modes

Consider the following instruction syntax. In one cycle, two multiplications can be performed in parallel. One memory operand (Cmem) is common to both multiplications, while dual AR indirect operands (Xmem and Ymem) are used for the other values in the multiplication.
ACx = Xmem * Cmem, ACy = Ymem * Cmem

To access three memory values (as in the above example) in a single cycle, the value referenced by Cmem must be located in a memory bank different from the one containing the Xmem and Ymem values. Table 39 introduces the operands available for the coefficient indirect addressing mode. Note that:
- Both pointer modification and address generation are linear or circular

according to the pointer configuration in status register ST2_55. The content of the 16-bit buffer start address register BSAC is added only if circular addressing is activated for CDP.
- All additions to and subtractions from CDP are done modulo 64K. You can-

not address data across main data pages without changing the value of CDPH (the high part of the extended coefficient data pointer).

Table 39. Coefficient Indirect Operands


Operand *CDP *CDP+ Pointer Modification CDP is not modified.1 CDP is incremented after the address is generated: If 16-bit operation: CDP = CDP + 1 If 32-bit operation: CDP = CDP + 2 CDP is decremented after the address is generated: If 16-bit operation: CDP = CDP 1 If 32-bit operation: CDP = CDP 2 The 16-bit signed constant in AR0 is added to CDP after the address is generated: CDP = CDP + AR0 This operand is available when C54CM = 1. This operand is usable when .c54cm_on is active at assembly time. *(CDP + T0) The 16-bit signed constant in T0 is added to CDP after the address is generated: CDP = CDP + T0 This operand is available when C54CM = 0. This operand is usable when .c54cm_off is active at assembly time. Data-memory Supported Access Type Data-memory Data-memory

*CDP

Data-memory

*(CDP + AR0)

Data-memory

SPRU375G

Introduction to Addressing Modes

3-19

Circular Addressing

3.5 Circular Addressing


Circular addressing can be used with any of the indirect addressing modes. Each of the eight auxiliary registers (AR0AR7) and the coefficient data pointer (CDP) can be independently configured to be linearly or circularly modified as they act as pointers to data or to register bits, see Table 310. This configuration is done with a bit (ARnLC) in status register ST2_55. To choose circular modification, set the bit.

Table 310. Circular Addressing Pointers


Pointer AR0 AR1 AR2 AR3 AR4 AR5 AR6 AR7 CDP Linear/Circular Configuration Bit ST2_55(0) = AR0LC ST2_55(1) = AR1LC ST2_55(2) = AR2LC ST2_55(3) = AR3LC ST2_55(4) = AR4LC ST2_55(5) = AR5LC ST2_55(6) = AR6LC ST2_55(7) = AR7LC ST2_55(8) = CDPLC Supplier of Main Data Page AR0H AR1H AR2H AR3H AR4H AR5H AR6H AR7H CDPH Buffer Start Address Register BSA01 BSA01 BSA23 BSA23 BSA45 BSA45 BSA67 BSA67 BSAC Buffer Size Register BK03 BK03 BK03 BK03 BK47 BK47 BK47 BK47 BKC

Each auxiliary register ARn has its own linear/circular configuration bit in ST2_55:
ARnLC 0 1 ARn Is Used For ... Linear addressing Circular addressing

The CDPLC bit in status register ST2_55 configures the DSP to use CDP for linear addressing or circular addressing:
CDPLC 0 1 CDP Is Used For ... Linear addressing Circular addressing

You can use the circular addressing instruction qualifier, circular(), if you want every pointer used by the instruction to be modified circularly, just add the circular() qualifier in parallel with the instruction. The circular addressing instruction qualifier overrides the linear/circular configuration in ST2_55.
3-20 Introduction to Addressing Modes SPRU375G

Chapter 4

Instruction Set Summary


This chapter provides a summary of the TMS320C55x DSP algebraic instruction set (Table 41). With each instruction, you will find the availability of a parallel enable bit, word count (size), cycle time, what pipeline phase the instruction executes, in what operator unit the instruction executes, how many of each address generation unit is used, and how many of each bus is used. Table 41 does not list all of the resources that may be used by an instruction, it only lists those that may result in a resource conflict, and thus prevent two instructions from being in parallel. If an instruction lists nothing in a particular column, it means that particular resource will never be in conflict for that instruction. The column heads of Table 41 are:
- Instruction: In cases where the resource usage of an instruction varies

with the kinds of registers, you see the notation <name>-AU for A-unit registers and <name>-DU for D-unit registers. So, dst-AU is a destination that is an A-unit register and src-DU is a source that is a D-unit register. In the few cases where that notation is insufficient, you see the cases listed in the Notes column.
- E: Whether that instruction has a parallel enable bit - S: The size of the instruction in bytes - C: Number of cycles required for the instruction - Pipe: The pipeline phase in which the instruction executes: Name AD D R X Phase Address Decode Read Execute

- Operator: Which operator(s) are used by this instruction. When an instruc-

tion uses multiple operators, any other instruction that uses one or more of those same operators may not be placed in parallel.
4-1

Instruction Set Summary

- Address Generation Unit: How many of each address generation unit is

used. The address generation units are:


Name DA CA SA Unit Data Address Generation Unit Coefficient Address Generation Unit Stack Address Generation Unit

- Buses: How many of each bus is used. The buses are: Name DR CR DW ACB KAB KDB Bus Data Read Coefficient Read Data Write Brings D unit registers to A unit and P unit operators Constants Constants

4-2

Instruction Set Summary

SPRU375G

SPRU375G Instruction Set Summary 4-3

Table 41. Algebraic Instruction Set Summary


Address Generation Unit No. Instruction E S C Pipe Operator DA CA SA DR CR DW Buses ACB KAB KDB Notes

Absolute Distance (page 5-2)


abdst(Xmem, Ymem, ACx, ACy) N 4 1 X DU_ALU 2 . . 2 . . . . .

Absolute Value (page 5-4)


dst-AU = |src-AU| dst-AU = |src-DU| dst-DU = |src| Y Y Y 2 2 2 1 1 1 X X X AU_ALU AU_ALU DU_ALU . . . . . . . . . . . . . . . . . . . 1 . . . . . . . See Note 1.

Addition (page 5-7)


[1] dst-AU = dst-AU + src-AU dst-AU = dst-AU + src-DU dst-DU = dst-DU + src [2] dst-AU = dst-AU + k4 dst-DU = dst-DU + k4 [3] dst-AU = src-AU + K16 dst-AU = src-DU + K16 dst-DU = src + K16 [4] dst-AU = src-AU + Smem dst-AU = src-DU + Smem dst-DU = src + Smem [5] ACy = ACy + (ACx << Tx) Y Y Y Y Y N N N N N N Y 2 2 2 2 2 4 4 4 3 3 3 2 1 1 1 1 1 1 1 1 1 1 1 1 X X X X X X X X X X X X AU_ALU AU_ALU DU_ALU AU_ALU DU_ALU AU_ALU AU_ALU DU_ALU AU_ALU AU_ALU DU_ALU DU_ALU + DU_SHIFT DU_ALU + DU_SHIFT DU_ALU DU_ALU + DU_SHIFT DU_ALU + DU_SHIFT . . . . . . . . 1 1 1 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1 1 1 . . . . . . . . . . . . . . . . . . . . . . . . . . 1 . . . . 1 . . 1 . . . . . . . . . . . . . . . . . 1 1 1 1 1 . . . . See Note 1. See Note 1. See Note 1.

[6]

ACy = ACy + (ACx << #SHIFTW)

Instruction Set Summary

[7] [8]

ACy = ACx + (K16 << #16) ACy = ACx + (K16 << #SHFT)

N N

4 4

1 1

X X

. .

. .

. .

. .

. .

. .

. .

. .

1 1

[9]

ACy = ACx + (Smem << Tx)

Notes:

1) dst-DU, src-AU or dst-DU, src-DU 2) dst-DU, src-AU or dst-AU, src-DU

4-4 Instruction Set Summary SPRU375G

Instruction Set Summary

Table 41. Algebraic Instruction Set Summary (Continued)


Address Generation Unit No. [10] [11] [12] [13] [14] [15] [16] Instruction ACy = ACx + (Smem << #16) ACy = ACx + uns(Smem) + CARRY ACy = ACx + uns(Smem) ACy = ACx + (uns(Smem) << #SHIFTW) ACy = ACx + dbl(Lmem) ACx = (Xmem << #16) + (Ymem << #16) Smem = Smem + K16 E N N N N N N N S 3 3 3 4 3 3 4 C 1 1 1 1 1 1 1 Pipe X X X X X X X Operator DU_ALU DU_ALU DU_ALU DU_ALU + DU_SHIFT DU_ALU DU_ALU DU_ALU DA 1 1 1 1 1 2 1 CA . . . . . . . SA . . . . . . . DR 1 1 1 1 2 2 1 CR . . . . . . . DW . . . . . . 1 Buses ACB . . . . . . . KAB . . . . . . . KDB . . . . . . 1 Notes

Addition with Absolute Value (page 5-27)


ACy = rnd(ACy + |ACx|) Y 2 1 X DU_ALU . . . . . . . . .

Addition with Parallel Store Accumulator Content to Memory (page 5-29)


ACy = ACx + (Xmem << #16), Ymem = HI(ACy << T2) N 4 1 X DU_ALU + DU_SHIFT 2 . . 2 . 2 . . .

Addition or Subtraction Conditionally (page 5-31)


[1] [2] ACy = adsc(Smem, ACx, TC1) ACy = adsc(Smem, ACx, TC2) N N 3 3 1 1 X X DU_ALU DU_ALU 1 1 . . . . 1 1 . . . . . . . . . .

Addition or Subtraction Conditionally with Shift (page 5-33)


ACy = ads2c(Smem, ACx, Tx, TC1, TC2) N 3 1 X DU_ALU + DU_SHIFT 1 . . 1 . . . . . .

Addition, Subtraction, or Move Accumulator Content Conditionally (page 5-36)


ACy = adsc(Smem, ACx, TC1, TC2) N 3 1 X DU_ALU 1 . . 1 . . . . .

Notes:

1) dst-DU, src-AU or dst-DU, src-DU 2) dst-DU, src-AU or dst-AU, src-DU

SPRU375G Instruction Set Summary 4-5

Table 41. Algebraic Instruction Set Summary (Continued)


Address Generation Unit No. Instruction E S C Pipe Operator DA CA SA DR CR DW Buses ACB KAB KDB Notes

Bitwise AND (page 5-38)


[1] dst-AU = dst-AU & src-AU dst-AU = dst-AU & src-DU dst-DU = dst-DU & src [2] dst-AU = src-AU & k8 dst-AU = src-DU & k8 dst-DU = src & k8 [3] dst-AU = src-AU & k16 dst-AU = src-DU & k16 dst-DU = src & k16 [4] dst-AU = src-AU & Smem dst-AU = src-DU & Smem dst-DU = src & Smem [5] [6] [7] [8] ACy = ACy & (ACx <<< #SHIFTW) ACy = ACx & (k16 <<< #16) ACy = ACx & (k16 <<< #SHFT) Smem = Smem & k16 Y Y Y Y Y Y N N N N N N Y N N N 2 2 2 3 3 3 4 4 4 3 3 3 3 4 4 4 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 X X X X X X X X X X X X X X X X AU_ALU AU_ALU DU_ALU AU_ALU AU_ALU DU_ALU AU_ALU AU_ALU DU_ALU AU_ALU AU_ALU DU_ALU DU_ALU + DU_SHIFT DU_ALU DU_ALU + DU_SHIFT AU_ALU . . . . . . . . . 1 1 1 . . . 1 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1 1 1 . . . 1 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1 . 1 . . 1 . . 1 . . 1 . . . . . . . . . . . . . . . . . . . . . . . . 1 1 1 1 1 1 . . . . 1 1 1 See Note 1. See Note 1. See Note 1. See Note 1.

Bitwise AND Memory with Immediate Value and Compare to Zero (page 5-47)
[1] [2] TC1 = Smem & k16 TC2 = Smem & k16 N N 4 4 1 1 X X AU_ALU AU_ALU 1 1 . . . . 1 1 . . . . . . . . 1 1

Instruction Set Summary

Notes:

1) dst-DU, src-AU or dst-DU, src-DU 2) dst-DU, src-AU or dst-AU, src-DU

4-6 Instruction Set Summary SPRU375G

Instruction Set Summary

Table 41. Algebraic Instruction Set Summary (Continued)


Address Generation Unit No. Instruction E S C Pipe Operator DA CA SA DR CR DW Buses ACB KAB KDB Notes

Bitwise OR (page 5-48)


[1] dst-AU = dst-AU | src-AU dst-AU = dst-AU | src-DU dst-DU = dst-DU | src [2] dst-AU = src-AU | k8 dst-AU = src-DU | k8 dst-DU = src | k8 [3] dst-AU = src-AU | k16 dst-AU = src-DU | k16 dst-DU = src | k16 [4] dst-AU = src-AU | Smem dst-AU = src-DU | Smem dst-DU = src | Smem [5] [6] [7] [8] ACy = ACy | (ACx <<< #SHIFTW) ACy = ACx | (k16 <<< #16) ACy = ACx | (k16 <<< #SHFT) Smem = Smem | k16 Y Y Y Y Y Y N N N N N N Y N N N 2 2 2 3 3 3 4 4 4 3 3 3 3 4 4 4 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 X X X X X X X X X X X X X X X X AU_ALU AU_ALU DU_ALU AU_ALU AU_ALU DU_ALU AU_ALU AU_ALU DU_ALU AU_ALU AU_ALU DU_ALU DU_ALU + DU_SHIFT DU_ALU DU_ALU + DU_SHIFT AU_ALU . . . . . . . . . 1 1 1 . . . 1 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1 1 1 . . . 1 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1 . 1 . . 1 . . 1 . . 1 . . . . . . . . . . . . . . . . . . . . . . . . 1 1 1 1 1 1 . . . . 1 1 1 See Note 1. See Note 1. See Note 1. See Note 1.

Notes:

1) dst-DU, src-AU or dst-DU, src-DU 2) dst-DU, src-AU or dst-AU, src-DU

SPRU375G Instruction Set Summary 4-7

Table 41. Algebraic Instruction Set Summary (Continued)


Address Generation Unit No. Instruction E S C Pipe Operator DA CA SA DR CR DW Buses ACB KAB KDB Notes

Bitwise Exclusive OR (XOR) (page 5-57)


[1] dst-AU = dst-AU ^ src-AU dst-AU = dst-AU ^ src-DU dst-DU = dst-DU ^ src [2] dst-AU = src-AU ^ k8 dst-AU = src-DU ^ k8 dst-DU = src ^ k8 [3] dst-AU = src-AU ^ k16 dst-AU = src-DU ^ k16 dst-DU = src ^ k16 [4] dst-AU = src-AU ^ Smem dst-AU = src-DU ^ Smem dst-DU = src ^ Smem [5] [6] [7] [8] ACy = ACy ^ (ACx <<< #SHIFTW) ACy = ACx ^ (k16 <<< #16) ACy = ACx ^ (k16 <<< #SHFT) Smem = Smem ^ k16 Y Y Y Y Y Y N N N N N N Y N N N 2 2 2 3 3 3 4 4 4 3 3 3 3 4 4 4 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 X X X X X X X X X X X X X X X X AU_ALU AU_ALU DU_ALU AU_ALU AU_ALU DU_ALU AU_ALU AU_ALU DU_ALU AU_ALU AU_ALU DU_ALU DU_ALU + DU_SHIFT DU_ALU DU_ALU + DU_SHIFT AU_ALU . . . . . . . . . 1 1 1 . . . 1 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1 1 1 . . . 1 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1 . 1 . . 1 . . 1 . . 1 . . . . . . . . . . . . . . . . . . . . . . . . 1 1 1 1 1 1 . . . . 1 1 1 See Note 1. See Note 1. See Note 1. See Note 1.

Branch Conditionally (page 5-66)


[1] [2] [3] [4] if (cond) goto l4 if (cond) goto L8 if (cond) goto L16 if (cond) goto P24 N Y N N 2 3 4 5 6/5 6/5 6/5 5/5 R R R R P_UNIT P_UNIT P_UNIT P_UNIT . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

Instruction Set Summary

. .

x/y cycles: x cycles = condition true, y cycles = condition false

Notes:

1) dst-DU, src-AU or dst-DU, src-DU 2) dst-DU, src-AU or dst-AU, src-DU

4-8 Instruction Set Summary SPRU375G

Instruction Set Summary

Table 41. Algebraic Instruction Set Summary (Continued)


Address Generation Unit No. Instruction E S C Pipe Operator DA CA SA DR CR DW Buses ACB KAB KDB Notes

Branch Unconditionally (page 5-70)


[1] [2] [3] [4] goto ACx goto L7 goto L16 goto P24 N Y Y N 2 2 3 4 10 6 6 5 X AD AD D P_UNIT P_UNIT P_UNIT P_UNIT . . . . . . . . . . . . . . . . . . . . . . . . 1 . . . . . . . . . . .

These instructions execute in 3 cycles if the addressed instruction is in the instruction buffer unit.

Branch on Auxiliary Register Not Zero (page 5-74)


if (ARn_mod != #0) goto L16 x/y cycles: x cycles = condition true, y cycles = condition false N 4 6/5 AD P_UNIT 1 . . . . . . . .

Call Conditionally (page 5-77)


[1] [2] if (cond) call L16 if (cond) call P24 N N 4 5 6/5 5/5 R R P_UNIT P_UNIT 1 1 . . 1 1 . . . . 2 2 . . . . . .

x/y cycles: x cycles = condition true, y cycles = condition false

Call Unconditionally (page 5-83)


[1] [2] [3] call ACx call L16 call P24 N Y N 2 3 4 10 6 5 X AD D P_UNIT P_UNIT P_UNIT 1 1 1 . . . 1 1 1 . . . . . . 2 2 2 1 . . . . . . . .

Circular Addressing Qualifier (page 5-87)


circular() N 1 1 AD . . . . . . . . .

Clear Accumulator, Auxiliary, or Temporary Register Bit (page 5-88)


bit(src-AU, Baddr) = #0 bit(src-DU, Baddr) = #0 N N 3 3 1 1 X X AU_ALU DU_ALU 1 1 . . . . . . . . . . . . . . . .

Clear Memory Bit (page 5-89)


bit(Smem, src) = #0 N 3 1 X AU_ALU 1 . . 1 . 1 . . .

Notes:

1) dst-DU, src-AU or dst-DU, src-DU 2) dst-DU, src-AU or dst-AU, src-DU

SPRU375G Instruction Set Summary 4-9

Table 41. Algebraic Instruction Set Summary (Continued)


Address Generation Unit No. Instruction E S C Pipe Operator DA CA SA DR CR DW Buses ACB KAB KDB Notes

Clear Status Register Bit (page 5-90)


[1] [2] [3] [4] bit(ST0, k4) = #0 bit(ST1, k4) = #0 bit(ST2, k4) = #0 bit(ST3, k4) = #0 Y Y Y Y 2 2 2 2 1 1 1 1 X X X X AU_ALU AU_ALU AU_ALU AU_ALU . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1 1 1 1

When this instruction is decoded to modify status bit CAFRZ (15), CAEN (14), or CACLR (13), the CPU pipeline is flushed and the instruction is executed in 5 cycles regardless of the instruction context.

Compare Accumulator, Auxiliary, or Temporary Register Content (page 5-93)


[1] TC1 = uns(src-AU RELOP dst-AU) TC1 = uns(src RELOP dst) TC1 = uns(src-DU RELOP dst-DU) [2] TC2 = uns(src-AU RELOP dst-AU) TC2 = uns(src RELOP dst) TC2 = uns(src-DU RELOP dst-DU) Y Y Y Y Y Y 3 3 3 3 3 3 1 1 1 1 1 1 X X X X X X AU_ALU AU_ALU DU_ALU AU_ALU AU_ALU DU_ALU . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1 . . 1 . . . . . . . . . . . . . . See Note 2. . See Note 2.

Compare Accumulator, Auxiliary, or Temporary Register Content with AND (page 5-95)
[1] TCx = TCy & uns(src-AU RELOP dst-AU) TCx = TCy & uns(src RELOP dst) TCx = TCy & uns(src-DU RELOP dst-DU) [2] TCx = !TCy & uns(src-AU RELOP dst-AU) TCx = !TCy & uns(src RELOP dst) TCx = !TCy & uns(src-DU RELOP dst-DU) Y Y Y Y Y Y 3 3 3 3 3 3 1 1 1 1 1 1 X X X X X X AU_ALU AU_ALU DU_ALU AU_ALU AU_ALU DU_ALU . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1 . . 1 . . . . . . . . . . . . . See Note 2. See Note 2.

Instruction Set Summary

Compare Accumulator, Auxiliary, or Temporary Register Content with OR (page 5-100)


[1] TCx = TCy | uns(src-AU RELOP dst-AU) TCx = TCy | uns(src RELOP dst) TCx = TCy | uns(src-DU RELOP dst-DU) Y Y Y 3 3 3 1 1 1 X X X AU_ALU AU_ALU DU_ALU . . . . . . . . . . . . . . . . . . . 1 . . . . . . . See Note 2.

Notes:

1) dst-DU, src-AU or dst-DU, src-DU 2) dst-DU, src-AU or dst-AU, src-DU

4-10 Instruction Set Summary SPRU375G

Instruction Set Summary

Table 41. Algebraic Instruction Set Summary (Continued)


Address Generation Unit No. [2] Instruction TCx = !TCy | uns(src-AU RELOP dst-AU) TCx = !TCy | uns(src RELOP dst) TCx = !TCy | uns(src-DU RELOP dst-DU) E Y Y Y S 3 3 3 C 1 1 1 Pipe X X X Operator AU_ALU AU_ALU DU_ALU DA . . . CA . . . SA . . . DR . . . CR . . . DW . . . Buses ACB . 1 . KAB . . . KDB . . . See Note 2. Notes

Compare Accumulator, Auxiliary, or Temporary Register Content Maximum (page 5-105)


dst-AU = max(src-AU, dst-AU) dst-AU = max(src-DU, dst-AU) dst-DU = max(src, dst-DU) Y Y Y 2 2 2 1 1 1 X X X AU_ALU AU_ALU DU_ALU . . . . . . . . . . . . . . . . . . . 1 . . . . . . . See Note 1.

Compare Accumulator, Auxiliary, or Temporary Register Content Minimum (page 5-108)


dst-AU = min(src-AU, dst-AU) dst-AU = min(src-DU, dst-AU) dst-DU = min(src, dst-DU) Y Y Y 2 2 2 1 1 1 X X X AU_ALU AU_ALU DU_ALU . . . . . . . . . . . . . . . . . . . 1 . . . . . . . See Note 1.

Compare and Branch (page 5-111)


compare (uns(src-AU RELOP K8)) goto L8 compare (uns(src-DU RELOP K8)) goto L8 x/y cycles: x cycles = condition true, y cycles = condition false N N 4 4 7/6 7/6 X X AU_ALU + P_UNIT DU_ALU + P_UNIT . . . . . . . . . . . . . . . . 1 1

Compare and Select Accumulator Content Maximum (page 5-114)


[1] [2] max_diff(ACx, ACy, ACz, ACw) max_diff_dbl(ACx, ACy, ACz, ACw, TRNx) Y Y 3 3 1 1 X X DU_ALU DU_ALU . . . . . . . . . . . . . . . . . .

Compare and Select Accumulator Content Minimum (page 5-120)


[1] [2] min_diff(ACx, ACy, ACz, ACw) min_diff_dbl(ACx, ACy, ACz, ACw, TRNx) Y Y 3 3 1 1 X X DU_ALU DU_ALU . . . . . . . . . . . . . . . . . .

Compare Memory with Immediate Value (page 5-126)


[1] [2] TC1 = (Smem == K16) TC2 = (Smem == K16) N N 4 4 1 1 X X AU_ALU AU_ALU 1 1 . . . . 1 1 . . . . . . . . 1 1

Notes:

1) dst-DU, src-AU or dst-DU, src-DU 2) dst-DU, src-AU or dst-AU, src-DU

SPRU375G Instruction Set Summary 4-11

Table 41. Algebraic Instruction Set Summary (Continued)


Address Generation Unit No. Instruction E S C Pipe Operator DA CA SA DR CR DW Buses ACB KAB KDB Notes

Complement Accumulator, Auxiliary, or Temporary Register Bit (page 5-128)


cbit(src-AU, Baddr) cbit(src-DU, Baddr) N N 3 3 1 1 X X AU_ALU DU_ALU 1 1 . . . . . . . . . . . . . . . .

Complement Accumulator, Auxiliary, or Temporary Register Content (page 5-129)


dst-AU = ~src-AU dst-AU = ~src-DU dst-DU = ~src Y Y Y 2 2 2 1 1 1 X X X AU_ALU AU_ALU DU_ALU . . . . . . . . . . . . . . . . . . . 1 . . . . . . . See Note 1.

Complement Memory Bit (page 5-130)


cbit(Smem, src) N 3 1 X AU_ALU 1 . . 1 . 1 . . .

Compute Exponent of Accumulator Content (page 5-131)


Tx = exp(ACx) Y 3 1 X DU_ALU + DU_SHIFT + AU_ALU . . . . . . 1 . .

Compute Mantissa and Exponent of Accumulator Content (page 5-132)


ACy = mant(ACx), Tx = exp(ACx) Y 3 1 X DU_ALU + DU_SHIFT + AU_ALU . . . . . . 1 . .

Count Accumulator Bits (page 5-134)


[1] Tx = count(ACx, ACy, TC1) Y 3 1 X DU_ALU + DU_SHIFT + AU_ALU DU_ALU + DU_SHIFT + AU_ALU . . . . . . 1 . .

[2]

Tx = count(ACx, ACy, TC2)

Instruction Set Summary

Dual 16-Bit Additions (page 5-135)


[1] HI(ACy) = HI(Lmem) + HI(ACx), LO(ACy) = LO(Lmem) + LO(ACx) HI(ACx) = HI(Lmem) + Tx, LO(ACx) = LO(Lmem) + Tx N 3 1 X DU_ALU 1 . . 2 . . . . .

[2]

DU_ALU

Notes:

1) dst-DU, src-AU or dst-DU, src-DU 2) dst-DU, src-AU or dst-AU, src-DU

4-12 Instruction Set Summary SPRU375G

Instruction Set Summary

Table 41. Algebraic Instruction Set Summary (Continued)


Address Generation Unit No. Instruction E S C Pipe Operator DA CA SA DR CR DW Buses ACB KAB KDB Notes

Dual 16-Bit Addition and Subtraction (page 5-140)


[1] [2] HI(ACx) = Smem + Tx, LO(ACx) = Smem Tx HI(ACx) = HI(Lmem) + Tx, LO(ACx) = LO(Lmem) Tx N N 3 3 1 1 X X DU_ALU DU_ALU 1 1 . . . . 1 2 . . . . . . . . . .

Dual 16-Bit Subtractions (page 5-145)


[1] [2] [3] [4] HI(ACy) = HI(ACx) HI(Lmem), LO(ACy) = LO(ACx) LO(Lmem) HI(ACy) = HI(Lmem) HI(ACx), LO(ACy) = LO(Lmem) LO(ACx) HI(ACx) = Tx HI(Lmem), LO(ACx) = Tx LO(Lmem) HI(ACx) = HI(Lmem) Tx, LO(ACx) = LO(Lmem) Tx N N N N 3 3 3 3 1 1 1 1 X X X X DU_ALU DU_ALU DU_ALU DU_ALU 1 1 1 1 . . . . . . . . 2 2 2 2 . . . . . . . . . . . . . . . . . . . .

Dual 16-Bit Subtraction and Addition (page 5-154)


[1] [2] HI(ACx) = Smem Tx, LO(ACx) = Smem + Tx HI(ACx) = HI(Lmem) Tx, LO(ACx) = LO(Lmem) + Tx N N 3 3 1 1 X X DU_ALU DU_ALU 1 1 . . . . 1 2 . . . . . . . . . .

Execute Conditionally (page 5-159)


[1] [2] if (cond) execute(AD_Unit) if (cond) execute(D_Unit) N N 2 2 1 1 AD X P_UNIT P_UNIT . . . . . . . . . . . . . . . . . .

Expand Accumulator Bit Field (page 5-166)


dst-AU = field_expand(ACx, k16) N 4 1 X DU_ALU + DU_SHIFT + AU_ALU DU_ALU + DU_SHIFT . . . . . . 1 . 1

dst-DU = field_expand(ACx, k16)

Notes:

1) dst-DU, src-AU or dst-DU, src-DU 2) dst-DU, src-AU or dst-AU, src-DU

SPRU375G Instruction Set Summary 4-13

Table 41. Algebraic Instruction Set Summary (Continued)


Address Generation Unit No. Instruction E S C Pipe Operator DA CA SA DR CR DW Buses ACB KAB KDB Notes

Extract Accumulator Bit Field (page 5-167)


dst-AU = field_extract(ACx, k16) N 4 1 X DU_ALU + DU_SHIFT + AU_ALU DU_ALU + DU_SHIFT . . . . . . 1 . 1

dst-DU = field_extract(ACx, k16)

Finite Impulse Response Filter, Antisymmetrical (page 5-168)


firsn(Xmem, Ymem, coef(Cmem), ACx, ACy) N 4 1 X DU_ALU 2 1 . 2 1 . . . .

Finite Impulse Response Filter, Symmetrical (page 5-170)


firs(Xmem, Ymem, coef(Cmem), ACx, ACy) N 4 1 X DU_ALU 2 1 . 2 1 . . . .

Idle (page 5-172)


idle N 4 ? D P_UNIT . . . . . . . . .

Least Mean Square (LMS) (page 5-173)


lms(Xmem, Ymem, ACx, ACy) N 4 1 X DU_ALU 2 . . 2 . . . . .

Linear Addressing Qualifier (page 5-175)


linear() N 1 1 AD . . . . . . . . .

Load Accumulator from Memory (page 5-176)


[1] ACx = rnd(Smem << Tx) N 3 1 X DU_ALU + DU_SHIFT DU_ALU + DU_SHIFT DU_ALU + DU_SHIFT DU_ALU 1 . . 1 . . . . .

[2]

ACx = low_byte(Smem) << #SHIFTW

[3]

ACx = high_byte(Smem) << #SHIFTW

Instruction Set Summary

[4] [5] [6]

ACx = Smem << #16 ACx = uns(Smem) ACx = uns(Smem) << #SHIFTW

N N N

2 3 4

1 1 1

X X X

1 1

. . .

. . .

1 1 1

. . .

. . .

. . .

. . .

. . .

DU_ALU + DU_SHIFT

[7]

ACx = M40(dbl(Lmem))

Notes:

1) dst-DU, src-AU or dst-DU, src-DU 2) dst-DU, src-AU or dst-AU, src-DU

4-14 Instruction Set Summary SPRU375G

Instruction Set Summary

Table 41. Algebraic Instruction Set Summary (Continued)


Address Generation Unit No. [8] Instruction LO(ACx) = Xmem, HI(ACx) = Ymem E N S 3 C 1 Pipe X Operator DA 2 CA . SA . DR 2 CR . DW . Buses ACB . KAB . KDB . Notes

Load Accumulator Pair from Memory (page 5-187)


[1] [2] pair(HI(ACx)) = Lmem pair(LO(ACx)) = Lmem N N 3 3 1 1 X X 1 1 . . . . 2 2 . . . . . . . . . .

Load Accumulator with Immediate Value (page 5-190)


[1] [2] ACx = K16 << #16 ACx = K16 << #SHFT N N 4 4 1 1 X X DU_ALU DU_ALU + DU_SHIFT . . . . . . . . . . . . . . . . 1 1

Load Accumulator from Memory with Parallel Store Accumulator Content to Memory (page 5-185)
ACy = Xmem << #16, Ymem = HI(ACx << T2) N 4 1 X DU_ALU + DU_SHIFT 2 . . 2 . 2 . . .

Load Accumulator, Auxiliary, or Temporary Register from Memory (page 5-193)


[1] [2] [3] dst = Smem dst = uns(high_byte(Smem)) dst = uns(low_byte(Smem)) N N N 2 3 3 1 1 1 X X X 1 1 1 . . . . . . 1 1 1 . . . . . . . . . . . . . . .

Load Accumulator, Auxiliary, or Temporary Register with Immediate Value (page 5-199)
[1] [2] [3] dst = k4 dst = k4 dst = K16 Y Y N 2 2 4 1 1 1 X X X . . . . . . . . . . . . . . . . . . . . . . . . 1 1 1

Load Auxiliary or Temporary Register Pair from Memory (page 5-203)


pair(TAx) = Lmem N 3 1 X 1 . . 2 . . . . .

Notes:

1) dst-DU, src-AU or dst-DU, src-DU 2) dst-DU, src-AU or dst-AU, src-DU

SPRU375G Instruction Set Summary 4-15

Table 41. Algebraic Instruction Set Summary (Continued)


Address Generation Unit No. Instruction E S C Pipe Operator DA CA SA DR CR DW Buses ACB KAB KDB Notes

Load CPU Register from Memory (page 5-204)


[1] [2] [3] [4] [5] [6] [7] [8] [9] [10] [11] [12] [13] [14] [15] [16] [17] [18] [19] [20] BK03 = Smem BK47 = Smem BKC = Smem BSA01 = Smem BSA23 = Smem BSA45 = Smem BSA67 = Smem BSAC = Smem BRC0 = Smem BRC1 = Smem CDP = Smem CSR = Smem DP = Smem DPH = Smem PDP = Smem SP = Smem SSP = Smem TRN0 = Smem TRN1 = Smem RETA = dbl(Lmem) N N N N N N N N N N N N N N N N N N N N 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 5 X X X X X X X X X X X X X X X X X X X X 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 2 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

Instruction Set Summary

Notes:

1) dst-DU, src-AU or dst-DU, src-DU 2) dst-DU, src-AU or dst-AU, src-DU

4-16 Instruction Set Summary SPRU375G

Instruction Set Summary

Table 41. Algebraic Instruction Set Summary (Continued)


Address Generation Unit No. Instruction E S C Pipe Operator DA CA SA DR CR DW Buses ACB KAB KDB Notes

Load CPU Register with Immediate Value (page 5-207)


[1] [2] [3] [4] [5] [6] [7] [8] [9] [10] [11] [12] [13] [14] [15] [16] [17] BK03 = k12 BK47 = k12 BKC = k12 BRC0 = k12 BRC1 = k12 CSR = k12 DPH = k7 PDP = k9 BSA01 = k16 BSA23 = k16 BSA45 = k16 BSA67 = k16 BSAC = k16 CDP = k16 DP = k16 SP = k16 SSP = k16 Y Y Y Y Y Y Y Y N N N N N N N N N 3 3 3 3 3 3 3 3 4 4 4 4 4 4 4 4 4 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 AD AD AD AD AD AD AD AD AD AD AD AD AD AD AD AD AD . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 . . . . . . . . . . . . . . . . .

Load Extended Auxiliary Register from Memory (page 5-209)


XAdst = dbl(Lmem) N 3 1 X 1 . . 2 . . . . .

Load Extended Auxiliary Register with Immediate Value (page 5-210)


XAdst = k23 N 6 1 AD 1 . . 1 . . . . .

Load Memory with Immediate Value (page 5-211)


[1] [2] Smem = K8 Smem = K16 N N 3 4 1 1 X X 1 1 . . . . . . . . 1 1 . . . . 1 1

Notes:

1) dst-DU, src-AU or dst-DU, src-DU 2) dst-DU, src-AU or dst-AU, src-DU

SPRU375G Instruction Set Summary 4-17

Table 41. Algebraic Instruction Set Summary (Continued)


Address Generation Unit No. Instruction E S C Pipe Operator DA CA SA DR CR DW Buses ACB KAB KDB Notes

Memory Delay (page 5-212)


delay(Smem) N 2 1 X 2 1 . 1 1 1 . . .

Memory-Mapped Register Access Qualifier (page 5-213)


mmap() N 1 1 D . . . . . . . . .

Modify Auxiliary Register Content (page 5-214)


mar(Smem) N 2 1 AD 1 . . 1 . . . . .

Modify Auxiliary Register Content with Parallel Multiply (page 5-216)


mar(Xmem), ACx = M40(rnd(uns(Ymem) * uns(coef(Cmem)))) N 4 1 X DU_ALU 2 1 . 2 1 . . . .

Modify Auxiliary Register Content with Parallel Multiply and Accumulate (page 5-218)
[1] [2] mar(Xmem), ACx = M40(rnd(ACx + (uns(Ymem) * uns(coef(Cmem))))) mar(Xmem), ACx = M40(rnd((ACx >> #16) + (uns(Ymem) * uns(coef(Cmem))))) N N 4 4 1 1 X X DU_ALU DU_ALU 2 2 1 1 . . 2 2 1 1 . . . . . . . .

Modify Auxiliary Register Content with Parallel Multiply and Subtract (page 5-223)
mar(Xmem), ACx = M40(rnd(ACx (uns(Ymem) * uns(coef(Cmem))))) N 4 1 X DU_ALU 2 1 . 2 1 . . . .

Modify Auxiliary or Temporary Register Content (page 5-225)


[1] [2] [3] mar(TAy = TAx) mar(TAx = P8) mar(TAx = D16) N N N 3 3 4 1 1 1 AD AD AD 1 1 1 . . . . . . . . . . . . . . . . . . . 1 1 . . .

Modify Auxiliary or Temporary Register Content by Addition (page 5-229)


[1] [2] mar(TAy + TAx) mar(TAx + P8) N N 3 3 1 1 AD AD 1 1 . . . . . . . . . . . . . 1 . .

Instruction Set Summary

Notes:

1) dst-DU, src-AU or dst-DU, src-DU 2) dst-DU, src-AU or dst-AU, src-DU

4-18 Instruction Set Summary SPRU375G

Instruction Set Summary

Table 41. Algebraic Instruction Set Summary (Continued)


Address Generation Unit No. Instruction E S C Pipe Operator DA CA SA DR CR DW Buses ACB KAB KDB Notes

Modify Auxiliary or Temporary Register Content by Subtraction (page 5-233)


[1] [2] mar(TAy TAx) mar(TAx P8) N N 3 3 1 1 AD AD 1 1 . . . . . . . . . . . . . 1 . .

Modify Data Stack Pointer (SP) (page 5-237)


SP = SP + K8 Y 2 1 AD . . . . . . . 1 .

Modify Extended Auxiliary Register Content (page 5-238)


XAdst = mar(Smem) N 3 1 AD 1 . . 1 . . . . .

Move Accumulator Content to Auxiliary or Temporary Register (page 5-239)


TAx = HI(ACx) Y 2 1 X AU_ALU . . . . . . 1 . .

Move Accumulator, Auxiliary, or Temporary Register Content (page 5-240)


dst-AU = src-AU dst-AU = src-DU dst-DU = src Y Y Y 2 2 2 1 1 1 X X X AU_ALU AU_ALU DU_ALU . . . . . . . . . . . . . . . . . . . 1 . . . . . . . See Note 1.

Move Auxiliary or Temporary Register Content to Accumulator (page 5-242)


HI(ACx) = TAx Y 2 1 X DU_ALU . . . . . . . . .

Move Auxiliary or Temporary Register Content to CPU Register (page 5-243)


[1] [2] [3] [4] [5] [6] BRC0 = TAx BRC1 = TAx CDP = TAx CSR = TAx SP = TAx SSP = TAx Y Y Y Y Y Y 2 2 2 2 2 2 1 1 1 1 1 1 X X X X X X AU_ALU AU_ALU AU_ALU AU_ALU AU_ALU AU_ALU . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

Notes:

1) dst-DU, src-AU or dst-DU, src-DU 2) dst-DU, src-AU or dst-AU, src-DU

SPRU375G Instruction Set Summary 4-19

Table 41. Algebraic Instruction Set Summary (Continued)


Address Generation Unit No. Instruction E S C Pipe Operator DA CA SA DR CR DW Buses ACB KAB KDB Notes

Move CPU Register Content to Auxiliary or Temporary Register (page 5-245)


[1] [2] [3] [4] [5] [6] TAx = BRC0 TAx = BRC1 TAx = CDP TAx = RPTC TAx = SP TAx = SSP Y Y Y Y Y Y 2 2 2 2 2 2 1 1 1 1 1 1 X X X X X X AU_ALU AU_ALU AU_ALU AU_ALU AU_ALU AU_ALU . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

Move Extended Auxiliary Register Content (page 5-247)


xdst-AU = xsrc-AU xdst-AU = xsrc-DU xdst-DU = xsrc N N N 2 2 2 1 1 1 X X X AU_ALU AU_ALU DU_ALU . . . . . . . . . . . . . . . . . . . 1 . . . . . . . See Note 1.

Move Memory to Memory (page 5-248)


[1] [2] [3] [4] [5] [6] Smem = coef(Cmem) coef(Cmem) = Smem Lmem = dbl(coef(Cmem)) dbl(coef(Cmem)) = Lmem dbl(Ymem) = dbl(Xmem) Ymem = Xmem N N N N N N 3 3 3 3 3 3 1 1 1 1 1 1 X X X X X X 2 2 2 2 2 2 . . . . . . . . . . . . 1 1 2 2 2 2 . . . . . . 1 1 2 2 2 2 . . . . . . . . . . . . . . . . . .

Multiply (MPY) (page 5-255)


[1] [2] [3] [4] [5] [6] ACy = rnd(ACy * ACx) ACy = rnd(ACx * Tx) ACy = rnd(ACx * K8) ACy = rnd(ACx * K16) ACx = rnd(Smem * coef(Cmem))[, T3 = Smem] ACy = rnd(Smem * ACx)[, T3 = Smem] Y Y Y N N N 2 2 3 4 3 3 1 1 1 1 1 1 X X X X X X DU_ALU DU_ALU DU_ALU DU_ALU DU_ALU DU_ALU . . . . 1 1 . . . . 1 . . . . . . . . . . . 1 1 . . . . 1 . . . . . . . . . . . . . . . . . . . .

Instruction Set Summary

. 1 1 . .

Notes:

1) dst-DU, src-AU or dst-DU, src-DU 2) dst-DU, src-AU or dst-AU, src-DU

4-20 Instruction Set Summary SPRU375G

Instruction Set Summary

Table 41. Algebraic Instruction Set Summary (Continued)


Address Generation Unit No. [7] [8] [9] Instruction ACx = rnd(Smem * K8)[, T3 = Smem] ACx = M40(rnd(uns(Xmem) * uns(Ymem)))[, T3 = Xmem] ACx = rnd(uns(Tx * Smem))[, T3 = Smem] E N N N S 4 4 3 C 1 1 1 Pipe X X X Operator DU_ALU DU_ALU DU_ALU DA 1 2 1 CA . . . SA . . . DR 1 2 1 CR . . . DW . . . Buses ACB . . . KAB . . . KDB 1 . . Notes

Multiply with Parallel Multiply and Accumulate (page 5-267)


ACx = M40(rnd(uns(Xmem) * uns(coef(Cmem)))), ACy = M40(rnd((ACy >> #16) + (uns(Ymem) * uns(coef(Cmem))))) N 4 1 X DU_ALU 2 1 . 2 1 . . . .

Multiply with Parallel Store Accumulator Content to Memory (page 5-269)


ACy = rnd(Tx * Xmem), Ymem = HI(ACx << T2) [, T3 = Xmem] N 4 1 X DU_ALU + DU_SHIFT 2 . . 2 . 2 . . .

Multiply and Accumulate (MAC) (page 5-271)


[1] [2] [3] [4] [5] [6] [7] [8] [9] [10] ACy = rnd(ACy + (ACx * Tx)) ACy = rnd((ACy * Tx) + ACx) ACy = rnd(ACx + (Tx * K8)) ACy = rnd(ACx + (Tx * K16)) ACx = rnd(ACx + (Smem * coef(Cmem)))[, T3 = Smem] ACy = rnd(ACy + (Smem * ACx))[, T3 = Smem] ACy = rnd(ACx + (Tx * Smem))[, T3 = Smem] ACy = rnd(ACx + (Smem * K8))[, T3 = Smem ] ACy = M40(rnd(ACx + (uns(Xmem) * uns(Ymem))))[, T3 = Xmem] ACy = M40(rnd((ACx >> #16) + (uns(Xmem) * uns(Ymem))))[, T3 = Xmem] Y Y Y N N N N N N N 2 2 3 4 3 3 3 4 4 4 1 1 1 1 1 1 1 1 1 1 X X X X X X X X X X DU_ALU DU_ALU DU_ALU DU_ALU DU_ALU DU_ALU DU_ALU DU_ALU DU_ALU DU_ALU . . . . 1 1 1 1 2 2 . . . . 1 . . . . . . . . . . . . . . . . . . . 1 1 1 1 2 2 . . . . 1 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1 1 . . . 1 . .

Multiply and Accumulate with Parallel Delay (page 5-286)


ACx = rnd(ACx + (Smem * coef(Cmem)))[, T3 = Smem], delay(Smem) N 3 1 X DU_ALU 2 1 . 1 1 1 . . .

Multiply and Accumulate with Parallel Load Accumulator from Memory (page 5-288)
ACx = rnd(ACx + (Tx * Xmem)), ACy = Ymem << #16 [, T3 = Xmem] N 4 1 X DU_ALU 2 . . 2 . . . . .

Notes:

1) dst-DU, src-AU or dst-DU, src-DU 2) dst-DU, src-AU or dst-AU, src-DU

SPRU375G Instruction Set Summary 4-21

Table 41. Algebraic Instruction Set Summary (Continued)


Address Generation Unit No. Instruction E S C Pipe Operator DA CA SA DR CR DW Buses ACB KAB KDB Notes

Multiply and Accumulate with Parallel Multiply (page 5-290)


ACx = M40(rnd(ACx + (uns(Xmem) * uns(coef(Cmem))))), ACy = M40(rnd(uns(Ymem) * uns(coef(Cmem)))) N 4 1 X DU_ALU 2 1 . 2 1 . . . .

Multiply and Accumulate with Parallel Store Accumulator Content to Memory (page 5-292)
ACy = rnd(ACy + (Tx * Xmem)), Ymem = HI(ACx << T2) [, T3 = Xmem] N 4 1 X DU_ALU + DU_SHIFT 2 . . 2 . 2 . . .

Multiply and Subtract (MAS) (page 5-294)


[1] [2] [3] [4] [5] ACy = rnd(ACy (ACx * Tx)) ACx = rnd(ACx (Smem * coef(Cmem)))[, T3 = Smem] ACy = rnd(ACy (Smem * ACx))[, T3 = Smem] ACy = rnd(ACx (Tx * Smem))[, T3 = Smem] ACy = M40(rnd(ACx (uns(Xmem) * uns(Ymem))))[, T3 = Xmem] Y N N N N 2 3 3 3 4 1 1 1 1 1 X X X X X DU_ALU DU_ALU DU_ALU DU_ALU DU_ALU . 1 1 1 2 . 1 . . . . . . . . . 1 1 1 2 . 1 . . . . . . . . . . . . . . . . . . . . . . .

Multiply and Subtract with Parallel Load Accumulator from Memory (page 5-302)
ACx = rnd(ACx (Tx * Xmem)), ACy = Ymem << #16 [, T3 = Xmem] N 4 1 X DU_ALU 2 . . 2 . . . . .

Multiply and Subtract with Parallel Multiply (page 5-304)


ACx = M40(rnd(ACx (uns(Xmem) * uns(coef(Cmem))))), ACy = M40(rnd(uns(Ymem) * uns(coef(Cmem)))) N 4 1 X DU_ALU 2 1 . 2 1 . . . .

Multiply and Subtract with Parallel Multiply and Accumulate (page 5-306)
[1] [2] ACx = M40(rnd(ACx (uns(Xmem) * uns(coef(Cmem))))), ACy = M40(rnd(ACy + (uns(Ymem) * uns(coef(Cmem))))) ACx = M40(rnd(ACx (uns(Xmem) * uns(coef(Cmem))))), ACy = M40(rnd((ACy >> #16) + (uns(Ymem) * uns(coef(Cmem))))) N N 4 4 1 1 X X DU_ALU DU_ALU 2 2 1 1 . . 2 2 1 1 . . . . . . . .

Instruction Set Summary

Multiply and Subtract with Parallel Store Accumulator Content to Memory (page 5-311)
ACy = rnd(ACy (Tx * Xmem)), Ymem = HI(ACx << T2) [, T3 = Xmem] N 4 1 X DU_ALU + DU_SHIFT 2 . . 2 . 2 . . .

Notes:

1) dst-DU, src-AU or dst-DU, src-DU 2) dst-DU, src-AU or dst-AU, src-DU

4-22 Instruction Set Summary SPRU375G

Instruction Set Summary

Table 41. Algebraic Instruction Set Summary (Continued)


Address Generation Unit No. Instruction E S C Pipe Operator DA CA SA DR CR DW Buses ACB KAB KDB Notes

Negate Accumulator, Auxiliary, or Temporary Register Content (page 5-313)


dst-AU = src-AU dst-AU = src-DU dst-DU = src Y Y Y 2 2 2 1 1 1 X X X AU_ALU AU_ALU DU_ALU . . . . . . . . . . . . . . . . . . . 1 . . . . . . . See Note 1.

No Operation (NOP) (page 5-315)


[1] [2] nop nop_16 Y Y 1 2 1 1 D D . . . . . . . . . . . . . . . . . .

Parallel Modify Auxiliary Register Contents (page 5-316)


mar(Xmem) , mar(Ymem) , mar(coef(Cmem)) N 4 1 X 2 1 . 2 1 . . . .

Parallel Multiplies (page 5-317)


ACx = M40(rnd(uns(Xmem) * uns(coef(Cmem)))), ACy = M40(rnd(uns(Ymem) * uns(coef(Cmem)))) N 4 1 X DU_ALU 2 1 . 2 1 . . . .

Parallel Multiply and Accumulates (page 5-319)


[1] [2] [3] ACx = M40(rnd(ACx + (uns(Xmem) * uns(coef(Cmem))))), ACy = M40(rnd(ACy + (uns(Ymem) * uns(coef(Cmem))))) ACx = M40(rnd((ACx >> #16) + (uns(Xmem) * uns(coef(Cmem))))), ACy = M4(rnd(ACy + (uns(Ymem) * uns(coef(Cmem))))) ACx = M40(rnd((ACx >> #16) + (uns(Xmem) * uns(coef(Cmem))))), ACy = M40(rnd((ACy >> #16) + (uns(Ymem) * uns(coef(Cmem))))) N N N 4 4 4 1 1 1 X X X DU_ALU DU_ALU DU_ALU 2 2 2 1 1 1 . . . 2 2 2 1 1 1 . . . . . . . . . . . .

Parallel Multiply and Subtracts (page 5-326)


ACx = M40(rnd(ACx (uns(Xmem) * uns(coef(Cmem))))), ACy = M40(rnd(ACy (uns(Ymem) * uns(coef(Cmem))))) N 4 1 X DU_ALU 2 1 . 2 1 . . . .

Peripheral Port Register Access Qualifiers (page 5-328)


[1] [2] readport() writeport() N N 1 1 1 1 D D . . . . . . . . . . . . . . . . . .

Pop Accumulator or Extended Auxiliary Register Content from Stack Pointers (page 5-330)
xdst = popboth() Y 2 1 X 1 . 1 2 . . . . .

Notes:

1) dst-DU, src-AU or dst-DU, src-DU 2) dst-DU, src-AU or dst-AU, src-DU

SPRU375G Instruction Set Summary 4-23

Table 41. Algebraic Instruction Set Summary (Continued)


Address Generation Unit No. Instruction E S C Pipe Operator DA CA SA DR CR DW Buses ACB KAB KDB Notes

Pop Top of Stack (page 5-331)


[1] [2] [3] [4] [5] [6] dst1, dst2 = pop() dst = pop() dst, Smem = pop() ACx = dbl(pop()) Smem = pop() dbl(Lmem) = pop() Y Y N Y N N 2 2 3 2 2 2 1 1 1 1 1 1 X X X X X X 1 1 1 1 1 1 . . . . . . 1 1 1 1 1 1 2 1 2 2 1 2 . . . . . . . . 1 . 1 2 . . . . . . . . . . . . . . . . . .

Push Accumulator or Extended Auxiliary Register Content to Stack Pointers (page 5-338)
pushboth(xsrc) Y 2 1 X 1 . 1 . . 2 . . .

Push to Top of Stack (page 5-339)


[1] [2] [3] [4] [5] [6] push(src1, src2) push(src) push(src, Smem) dbl(push(ACx)) push(Smem) push(dbl(Lmem)) Y Y N Y N N 2 2 3 2 2 2 1 1 1 1 1 1 X X X X X X 1 1 1 1 1 1 . . . . . . 1 1 1 1 1 1 . . 1 . 1 2 . . . . . . 2 1 2 2 1 2 . . . . . . . . . . . . . . . . . .

Repeat Block of Instructions Unconditionally (page 5-346)


[1] [2] localrepeat{ } blockrepeat{ } Y Y 2 3 1 1 AD AD P_UNIT P_UNIT . . . . . . . . . . . . . . . . . .

Repeat Single Instruction Conditionally (page 5-357) Instruction Set Summary


while (cond && (RPTC < k8)) repeat Y 3 1 AD P_UNIT . . . . . . . 1 .

Repeat Single Instruction Unconditionally (page 5-360)


[1] [2] [3] repeat(k8) repeat(k16) repeat(CSR) Y Y Y 2 3 2 1 1 1 AD AD AD P_UNIT P_UNIT P_UNIT . . . . . . . . . . . . . . . . . . . . . 1 1 . . . .

Notes:

1) dst-DU, src-AU or dst-DU, src-DU 2) dst-DU, src-AU or dst-AU, src-DU

4-24 Instruction Set Summary SPRU375G

Instruction Set Summary

Table 41. Algebraic Instruction Set Summary (Continued)


Address Generation Unit No. Instruction E S C Pipe Operator DA CA SA DR CR DW Buses ACB KAB KDB Notes

Repeat Single Instruction Unconditionally and Decrement CSR (page 5-365)


repeat(CSR), CSR = k4 Y 2 1 X AU_ALU + P_UNIT . . . . . . . . 1

Repeat Single Instruction Unconditionally and Increment CSR (page 5-367)


[1] [2] repeat(CSR), CSR += TAx repeat(CSR), CSR += k4 Y Y 2 2 1 1 X X AU_ALU + P_UNIT AU_ALU + P_UNIT . . . . . . . . . . . . . . . . . 1

Return Conditionally (page 5-370)


if (cond) return x/y cycles: x cycles = condition true, y cycles = condition false Y 3 5/5 R P_UNIT 1 . 1 2 . . . . .

Return Unconditionally (page 5-372)


return Y 2 5 D P_UNIT 1 . 1 2 . . . . .

Return from Interrupt (page 5-374)


return_int Y 2 5 D P_UNIT 1 . 1 2 . . . . .

Rotate Left Accumulator, Auxiliary, or Temporary Register Content (page 5-376)


dst-AU = BitOut \\ src-AU \\ BitIn dst-AU = BitOut \\ src-DU \\ BitIn dst-DU = BitOut \\ src \\ BitIn Y Y Y 3 3 3 1 1 1 X X X AU_ALU AU_ALU DU_ALU + DU_SHIFT . . . . . . . . . . . . . . . . . . . 1 . . . . . . . See Note 1.

Rotate Right Accumulator, Auxiliary, or Temporary Register Content (page 5-378)


dst-AU = BitIn // src-AU // BitOut dst-AU = BitIn // src-DU // BitOut dst-DU = BitIn // src // BitOut Y Y Y 3 3 3 1 1 1 X X X AU_ALU AU_ALU DU_ALU + DU_SHIFT . . . . . . . . . . . . . . . . . . . 1 . . . . . . . See Note 1.

Round Accumulator Content (page 5-380)


ACy = rnd(ACx) Y 2 1 X DU_ALU . . . . . . . . .

Notes:

1) dst-DU, src-AU or dst-DU, src-DU 2) dst-DU, src-AU or dst-AU, src-DU

SPRU375G Instruction Set Summary 4-25

Table 41. Algebraic Instruction Set Summary (Continued)


Address Generation Unit No. Instruction E S C Pipe Operator DA CA SA DR CR DW Buses ACB KAB KDB Notes

Saturate Accumulator Content (page 5-382)


ACy = saturate(rnd(ACx)) Y 2 1 X DU_ALU . . . . . . . . .

Set Accumulator, Auxiliary, or Temporary Register Bit (page 5-384)


bit(src-AU, Baddr) = #1 bit(src-DU, Baddr) = #1 N N 3 3 1 1 X X AU_ALU DU_ALU 1 1 . . . . . . . . . . . . . . . .

Set Memory Bit (page 5-385)


bit(Smem, src) = #1 N 3 1 X AU_ALU 1 . . 1 . 1 . . .

Set Status Register Bit (page 5-386)


[1] [2] [3] [4] bit(ST0, k4) = #1 bit(ST1, k4) = #1 bit(ST2, k4) = #1 bit(ST3, k4) = #1 Y Y Y Y 2 2 2 2 1 1 1 1 X X X X AU_ALU AU_ALU AU_ALU AU_ALU . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1 1 1 1

When this instruction is decoded to modify status bit CAFRZ (15), CAEN (14), or CACLR (13), the CPU pipeline is flushed and the instruction is executed in 5 cycles regardless of the instruction context.

Shift Accumulator Content Conditionally (page 5-389)


[1] [2] ACx = sftc(ACx, TC1) ACx = sftc(ACx, TC2) Y Y 2 2 1 1 X X DU_ALU + DU_SHIFT DU_ALU + DU_SHIFT . . . . . . . . . . . . . . . . . .

Shift Accumulator Content Logically (page 5-391)


[1] [2] ACy = ACx <<< Tx ACy = ACx <<< #SHIFTW Y Y 2 3 1 1 X X DU_ALU + DU_SHIFT DU_ALU + DU_SHIFT . . . . . . . . . . . . . . . . . .

Instruction Set Summary

Notes:

1) dst-DU, src-AU or dst-DU, src-DU 2) dst-DU, src-AU or dst-AU, src-DU

4-26 Instruction Set Summary SPRU375G

Instruction Set Summary

Table 41. Algebraic Instruction Set Summary (Continued)


Address Generation Unit No. Instruction E S C Pipe Operator DA CA SA DR CR DW Buses ACB KAB KDB Notes

Shift Accumulator, Auxiliary, or Temporary Register Content Logically (page 5-394)


[1] dst-AU = dst-AU <<< #1 dst-DU = dst-DU <<< #1 [2] dst-AU = dst-AU >>> #1 dst-DU = dst-DU >>> #1 Y Y Y Y 2 2 2 2 1 1 1 1 X X X X AU_ALU DU_ALU + DU_SHIFT AU_ALU DU_ALU + DU_SHIFT . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

Signed Shift of Accumulator Content (page 5-397)


[1] [2] [3] [4] ACy = ACx << Tx ACy = ACx << #SHIFTW ACy = ACx <<C Tx ACy = ACx <<C #SHIFTW Y Y Y Y 2 3 2 3 1 1 1 1 X X X X DU_ALU + DU_SHIFT DU_ALU + DU_SHIFT DU_ALU + DU_SHIFT DU_ALU + DU_SHIFT . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

Signed Shift of Accumulator, Auxiliary, or Temporary Register Content (page 5-406)


[1] dst-AU = dst-AU >> #1 dst-DU = dst-DU >> #1 [2] dst-AU = dst-AU << #1 dst-DU = dst-DU << #1 Y Y Y Y 2 2 2 2 1 1 1 1 X X X X AU_ALU DU_ALU + DU_SHIFT AU_ALU DU_ALU + DU_SHIFT . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

Software Interrupt (page 5-411)


intr(k5) N 2 3 D P_UNIT 1 . 1 . . 2 . . .

Software Reset (page 5-413)


reset N 2 ? D P_UNIT . . . . . . . . .

Software Trap (page 5-417)


trap(k5) N 2 ? D P_UNIT 1 . 1 . . 2 . . .

Notes:

1) dst-DU, src-AU or dst-DU, src-DU 2) dst-DU, src-AU or dst-AU, src-DU

SPRU375G Instruction Set Summary 4-27

Table 41. Algebraic Instruction Set Summary (Continued)


Address Generation Unit No. Instruction E S C Pipe Operator DA CA SA DR CR DW Buses ACB KAB KDB Notes

Square (page 5-419)


[1] [2] ACy = rnd(ACx * ACx) ACx = rnd(Smem * Smem)[, T3 = Smem] Y N 2 3 1 1 X X DU_ALU DU_ALU . 1 . . . . . 1 . . . . . . . . . .

Square and Accumulate (page 5-422)


[1] [2] ACy = rnd(ACy + (ACx * ACx)) ACy = rnd(ACx + (Smem * Smem))[, T3 = Smem] Y N 2 3 1 1 X X DU_ALU DU_ALU . 1 . . . . . 1 . . . . . . . . . .

Square and Subtract (page 5-425)


[1] [2] ACy = rnd(ACy (ACx * ACx)) ACy = rnd(ACx (Smem * Smem))[, T3 = Smem] Y N 2 3 1 1 X X DU_ALU DU_ALU . 1 . . . . . 1 . . . . . . . . . .

Square Distance (page 5-428)


sqdst(Xmem, Ymem, ACx, ACy) N 4 1 X DU_ALU 2 . . 2 . . . . .

Store Accumulator Content to Memory (page 5-430)


[1] [2] [3] [4] [5] [6] [7] [8] [9] [10] [11] [12] Smem = HI(ACx) Smem = HI(rnd(ACx)) Smem = LO(ACx << Tx) Smem = HI(rnd(ACx << Tx)) Smem = LO(ACx << #SHIFTW) Smem = HI(ACx << #SHIFTW) Smem = HI(rnd(ACx << #SHIFTW)) Smem = HI(saturate(uns(rnd(ACx)))) Smem = HI(saturate(uns(rnd(ACx << Tx)))) Smem = HI(saturate(uns(rnd(ACx << #SHIFTW)))) dbl(Lmem) = ACx dbl(Lmem) = saturate(uns(ACx)) N N N N N N N N N N N N 2 3 3 3 3 3 4 3 3 4 3 3 1 1 1 1 1 1 1 1 1 1 1 1 X X X X X X X X X X X X DU_SHIFT DU_SHIFT DU_SHIFT DU_SHIFT DU_SHIFT DU_SHIFT DU_SHIFT DU_SHIFT DU_SHIFT DU_SHIFT 1 1 1 1 1 1 1 1 1 1 1 1 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1 1 1 1 1 1 1 1 1 1 2 2 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

Instruction Set Summary

. . . . .

Notes:

1) dst-DU, src-AU or dst-DU, src-DU 2) dst-DU, src-AU or dst-AU, src-DU

4-28 Instruction Set Summary SPRU375G

Instruction Set Summary

Table 41. Algebraic Instruction Set Summary (Continued)


Address Generation Unit No. [13] Instruction HI(Lmem) = HI(ACx) >> #1, LO(Lmem) = LO(ACx) >> #1 Xmem = LO(ACx), Ymem = HI(ACx) E N S 3 C 1 Pipe X Operator DU_SHIFT DA 1 CA . SA . DR . CR . DW 2 Buses ACB . KAB . KDB . Notes

[14]

Store Accumulator Pair Content to Memory (page 5-450)


[1] [2] Lmem = pair(HI(ACx)) Lmem = pair(LO(ACx)) N N 3 3 1 1 X X 1 1 . . . . . . . . 2 2 . . . . . .

Store Accumulator, Auxiliary, or Temporary Register Content to Memory (page 5-453)


[1] [2] [3] Smem = src high_byte(Smem) = src low_byte(Smem) = src N N N 2 3 3 1 1 1 X X X 1 1 1 . . . . . . . . . . . . 1 1 1 . . . . . . . . .

Store Auxiliary or Temporary Register Pair Content to Memory (page 5-457)


Lmem = pair(TAx) N 3 1 X 1 . . . . 2 . . .

Store CPU Register Content to Memory (page 5-458)


[1] [2] [3] [4] [5] [6] [7] [8] [9] [10] [11] [12] Smem = BK03 Smem = BK47 Smem = BKC Smem = BSA01 Smem = BSA23 Smem = BSA45 Smem = BSA67 Smem = BSAC Smem = BRC0 Smem = BRC1 Smem = CDP Smem = CSR N N N N N N N N N N N N 3 3 3 3 3 3 3 3 3 3 3 3 1 1 1 1 1 1 1 1 1 1 1 1 X X X X X X X X X X X X 1 1 1 1 1 1 1 1 1 1 1 1 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1 1 1 1 1 1 1 1 1 1 1 1 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

Notes:

1) dst-DU, src-AU or dst-DU, src-DU 2) dst-DU, src-AU or dst-AU, src-DU

SPRU375G Instruction Set Summary 4-29

Table 41. Algebraic Instruction Set Summary (Continued)


Address Generation Unit No. [13] [14] [15] [16] [17] [18] [19] [20] Instruction Smem = DP Smem = DPH Smem = PDP Smem = SP Smem = SSP Smem = TRN0 Smem = TRN1 dbl(Lmem) = RETA E N N N N N N N N S 3 3 3 3 3 3 3 3 C 1 1 1 1 1 1 1 5 Pipe X X X X X X X X Operator DA 1 1 1 1 1 1 1 1 CA . . . . . . . . SA . . . . . . . . DR . . . . . . . . CR . . . . . . . . DW 1 1 1 1 1 1 1 2 Buses ACB . . . . . . . . KAB . . . . . . . . KDB . . . . . . . . Notes

Store Extended Auxiliary Register Content to Memory (page 5-462)


dbl(Lmem) = XAsrc N 3 1 X 1 . . . . 2 . . .

Subtract Conditionally (page 5-463)


subc(Smem, ACx, ACy) N 3 1 X DU_ALU 1 . . 1 . . . . .

Subtraction (page 5-465)


[1] dst-AU = dst-AU src-AU dst-AU = dst-AU src-DU dst-DU = dst-DU src [2] dst-AU = dst-AU k4 dst-DU = dst-DU k4 [3] dst-AU = src-AU K16 dst-AU = src-DU K16 dst-DU = src K16 [4] dst-AU = src-AU Smem dst-AU = src-DU Smem dst-DU = src Smem Y Y Y Y Y N N N N N N 2 2 2 2 2 4 4 4 3 3 3 1 1 1 1 1 1 1 1 1 1 1 X X X X X X X X X X X AU_ALU AU_ALU DU_ALU AU_ALU DU_ALU AU_ALU AU_ALU DU_ALU AU_ALU AU_ALU DU_ALU . . . . . . . . 1 1 1 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1 1 1 . . . . . . . . . . . . . . . . . . . . . . . 1 . . . . 1 . . 1 . . . . . . . . . . . . . . . 1 1 1 1 1 . . . See Note 1. See Note 1. See Note 1.

Instruction Set Summary

Notes:

1) dst-DU, src-AU or dst-DU, src-DU 2) dst-DU, src-AU or dst-AU, src-DU

4-30 Instruction Set Summary SPRU375G

Instruction Set Summary

Table 41. Algebraic Instruction Set Summary (Continued)


Address Generation Unit No. [5] Instruction dst-AU = Smem src-AU dst-AU = Smem src-DU dst-DU = Smem src [6] [7] [8] [9] [10] [11] [12] [13] [14] [15] [16] [17] [18] ACy = ACy (ACx << Tx) ACy = ACy (ACx << #SHIFTW) ACy = ACx (K16 << #16) ACy = ACx (K16 << #SHFT) ACy = ACx (Smem << Tx) ACy = ACx (Smem << #16) ACy = (Smem << #16) ACx ACy = ACx uns(Smem) BORROW ACy = ACx uns(Smem) ACy = ACx (uns(Smem) << #SHIFTW) ACy = ACx dbl(Lmem) ACy = dbl(Lmem) ACx ACx = (Xmem << #16) (Ymem << #16) E N N N Y Y N N N N N N N N N N N S 3 3 3 2 3 4 4 3 3 3 3 3 4 3 3 3 C 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 Pipe X X X X X X X X X X X X X X X X Operator AU_ALU AU_ALU DU_ALU DU_ALU + DU_SHIFT DU_ALU + DU_SHIFT DU_ALU DU_ALU + DU_SHIFT DU_ALU + DU_SHIFT DU_ALU DU_ALU DU_ALU DU_ALU DU_ALU + DU_SHIFT DU_ALU DU_ALU DU_ALU DA 1 1 1 . . . . 1 1 1 1 1 1 1 1 2 CA . . . . . . . . . . . . . . . . SA . . . . . . . . . . . . . . . . DR 1 1 1 . . . . 1 1 1 1 1 1 2 2 2 CR . . . . . . . . . . . . . . . . DW . . . . . . . . . . . . . . . . Buses ACB . 1 . . . . . . . . . . . . . . KAB . . . . . . . . . . . . . . . . KDB . . . . . 1 1 . . . . . . . . . See Note 1. Notes

Subtraction with Parallel Store Accumulator Content to Memory (page 5-490)


ACy = (Xmem << #16) ACx, Ymem = HI(ACy << T2) N 4 1 X DU_ALU + DU_SHIFT 2 . . 2 . 2 . . .

Swap Accumulator Content (page 5-492)


[1] [2] swap(AC0, AC2) swap(AC1, AC3) Y Y 2 2 1 1 X X DU_SWAP DU_SWAP . . . . . . . . . . . . . . . . . .

Notes:

1) dst-DU, src-AU or dst-DU, src-DU 2) dst-DU, src-AU or dst-AU, src-DU

SPRU375G Instruction Set Summary 4-31

Table 41. Algebraic Instruction Set Summary (Continued)


Address Generation Unit No. Instruction E S C Pipe Operator DA CA SA DR CR DW Buses ACB KAB KDB Notes

Swap Accumulator Pair Content (page 5-493)


swap(pair(AC0), pair(AC2)) Y 2 1 X DU_SWAP . . . . . . . . .

Swap Auxiliary Register Content (page 5-494)


[1] [2] [3] swap(AR0, AR1) swap(AR0, AR2) swap(AR1, AR3) Y Y Y 2 2 2 1 1 1 AD AD AD AU_SWAP AU_SWAP AU_SWAP . . . . . . . . . . . . . . . . . . . . . . . . . . .

Swap Auxiliary Register Pair Content (page 5-495)


swap(pair(AR0), pair(AR2)) Y 2 1 AD AU_SWAP . . . . . . . . .

Swap Auxiliary and Temporary Register Content (page 5-496)


[1] [2] [3] [4] swap(AR4, T0) swap(AR5, T1) swap(AR6, T2) swap(AR7, T3) Y Y Y Y 2 2 2 2 1 1 1 1 AD AD AD AD AU_SWAP AU_SWAP AU_SWAP AU_SWAP . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

Swap Auxiliary and Temporary Register Pair Content (page 5-498)


[1] [2] swap(pair(AR4), pair(T0)) swap(pair(AR6), pair(T2)) Y Y 2 2 1 1 AD AD AU_SWAP AU_SWAP . . . . . . . . . . . . . . . . . .

Swap Auxiliary and Temporary Register Pairs Content (page 5-500)


swap(block(AR4), block(T0)) Y 2 1 AD AU_SWAP . . . . . . . . .

Swap Temporary Register Content (page 5-502)


[1] [2] swap(T0, T2) swap(T1, T3) Y Y 2 2 1 1 AD AD AU_SWAP AU_SWAP . . . . . . . . . . . . . . . . . .

Instruction Set Summary

Swap Temporary Register Pair Content (page 5-503)


swap(pair(T0), pair(T2)) Y 2 1 AD AU_SWAP . . . . . . . . .

Notes:

1) dst-DU, src-AU or dst-DU, src-DU 2) dst-DU, src-AU or dst-AU, src-DU

4-32 Instruction Set Summary SPRU375G

Instruction Set Summary

Table 41. Algebraic Instruction Set Summary (Continued)


Address Generation Unit No. Instruction E S C Pipe Operator DA CA SA DR CR DW Buses ACB KAB KDB Notes

Test Accumulator, Auxiliary, or Temporary Register Bit (page 5-504)


[1] TC1 = bit(src-AU, Baddr) TC1 = bit(src-DU, Baddr) [2] TC2 = bit(src-AU, Baddr) TC2 = bit(src-DU, Baddr) N N N N 3 3 3 3 1 1 1 1 X X X X AU_ALU DU_ALU AU_ALU DU_ALU 1 1 1 1 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

Test Accumulator, Auxiliary, or Temporary Register Bit Pair (page 5-506)


bit(src-AU, pair(Baddr)) bit(src-DU, pair(Baddr)) N N 3 3 1 1 X X AU_ALU DU_ALU 1 1 . . . . . . . . . . . . . . . .

Test Memory Bit (page 5-508)


[1] [2] TCx = bit(Smem, src) TCx = bit(Smem, k4) N N 3 3 1 1 X X AU_ALU AU_ALU 1 1 . . . . 1 1 . . . . . . . . . 1

Test and Clear Memory Bit (page 5-511)


[1] [2] TC1 = bit(Smem, k4), bit(Smem, k4) = #0 TC2 = bit(Smem, k4), bit(Smem, k4) = #0 N N 3 3 1 1 X X AU_ALU AU_ALU 1 1 . . . . 1 1 . . 1 1 . . . . 1 1

Test and Complement Memory Bit (page 5-512)


[1] [2] TC1 = bit(Smem, k4), cbit(Smem, k4) TC2 = bit(Smem, k4), cbit(Smem, k4) N N 3 3 1 1 X X AU_ALU AU_ALU 1 1 . . . . 1 1 . . 1 1 . . . . 1 1

Test and Set Memory Bit (page 5-513)


[1] [2] TC1 = bit(Smem, k4), bit(Smem, k4) = #1 TC2 = bit(Smem, k4), bit(Smem, k4) = #1 N N 3 3 1 1 X X AU_ALU AU_ALU 1 1 . . . . 1 1 . . 1 1 . . . . 1 1

Notes:

1) dst-DU, src-AU or dst-DU, src-DU 2) dst-DU, src-AU or dst-AU, src-DU

Chapter 5

Instruction Set Descriptions


This chapter provides detailed information on the TMS320C55x DSP algebraic instruction set. See Section 1.1, Instruction Set Terms, Symbols, and Abbreviations, for definitions of symbols and abbreviations used in the description of each instruction. See Chapter 4 for a summary of the instruction set.

5-1

Absolute Distance (abdst)

Absolute Distance
Syntax Characteristics
No. [1] Syntax abdst(Xmem, Ymem, ACx, ACy) Parallel Enable Bit No Size 4 Cycles 1 Pipeline X

Opcode Operands Description

1000 0110 XXXM MMYY YMMM DDDD 1111 xxn% ACx, ACy, Xmem, Ymem This instruction executes two operations in parallel: one in the D-unit MAC and one in the D-unit ALU:
ACy = ACy + |HI(ACx)| ACx = (Xmem << #16) (Ymem << #16)

The absolute value of accumulator ACx content is computed and added to accumulator ACy content through the D-unit MAC. When an overflow is detected according to M40:
- the destination accumulator overflow status bit (ACOVy) is set - the destination register (ACy) is saturated according to SATD

The Ymem content shifted left 16 bits is subtracted from the Xmem content shifted left 16 bits in the D-unit ALU.
- Input operands (Xmem and Ymem) are sign extended to 40 bits according

to SXMD.
- CARRY status bit depends on M40. Subtraction borrow bit is reported in

CARRY status bit. It is the logical complement of CARRY status bit.


- When an overflow is detected according to M40: J J

the destination accumulator overflow status bit (ACOVx) is set the destination register (ACx) is saturated according to SATD

Compatibility with C54x devices (C54CM = 1) When this instruction is executed with M40 = 0, compatibility is ensured. When C54CM = 1, the subtract operation does not have any overflow detection, report, and saturation after the shifting operation. Status Bits Affected by Affects Repeat
5-2

C54CM, FRCT, M40, SATD, SXMD ACOVx, ACOVy, CARRY

This instruction can be repeated.


Instruction Set Descriptions SPRU375G

Absolute Distance (abdst)

See Also

See the following other related instructions:


- Square Distance

Example
Syntax abdst(*AR0+, *AR1, AC0, AC1) Description The absolute value of the content of AC0 is added to the content of AC1 and the result is stored in AC1. The content addressed by AR1 is subtracted from the content addressed by AR0 and the result is stored in AC0. The content of AR0 is incremented by 1.
After 00 0000 0000 00 E800 0000 202 302 3400 EF00 0 0 0 1 1 AC0 AC1 AR0 AR1 202 302 ACOV0 ACOV1 CARRY M40 SXMD 00 4500 0000 00 E800 0000 203 302 3400 EF00 0 0 0 1 1

Before AC0 AC1 AR0 AR1 202 302 ACOV0 ACOV1 CARRY M40 SXMD

SPRU375G

Instruction Set Descriptions

5-3

Absolute Value

Absolute Value
Syntax Characteristics
No. [1] Syntax dst = |src| Parallel Enable Bit Yes Size 2 Cycles 1 Pipeline X

Opcode Operands Description dst, src

0011 001E FSSS FDDD

This instruction computes the absolute value of the source register (src).
- When the destination register (dst) is an accumulator: J J

The operation is performed on 40 bits in the D-unit ALU. If an auxiliary or temporary register is the source operand of the instruction, the 16 LSBs of the auxiliary or temporary register are sign extended to 40 bits according to SXMD. If M40 = 0, the sign of the source register is extracted at bit position 31. If src(31) = 1, the source register content is negated. If src(31) = 0, the source register content is moved to the destination accumulator. If M40 = 1, the sign of the source register is extracted at bit position 39. If src(39) = 1, the source register content is negated. If src(39) = 0, the source register content is moved to the destination accumulator. During the 40-bit move operation, an overflow and CARRY bit status are detected according to M40: H H H The destination accumulator overflow status bit (ACOVx) is set. The destination register is saturated according to SATD. The CARRY status bit is updated as follows: If the result of the operation stored in the destination register is 0, CARRY is set; otherwise, CARRY is cleared.

- When the destination register (dst) is an auxiliary or temporary register: J J J

The operation is performed on 16 bits in the A-unit ALU. If an accumulator is the source operand of the instruction, the 16 LSBs of the accumulator are used to perform the operation. The sign of the source register is extracted at bit position 15. If src(15) = 1, the source register content is negated. If src(15) = 0, the source register content is moved to the destination register. Overflow is detected at bit position 15. The destination register is saturated according to SATA.
SPRU375G

J 5-4

Instruction Set Descriptions

Absolute Value

Compatibility with C54x devices (C54CM = 1) When C54CM =1, this instruction is executed as if M40 status bit was locally set to 1. To ensure compatibility versus overflow detection and saturation of destination accumulator, this instruction must be executed with M40 = 0. Status Bits Affected by Affects Repeat See Also C54CM, M40, SATA, SATD, SXMD ACOVx, CARRY

This instruction can be repeated. See the following other related instructions:
- Addition with Absolute Value

Example 1
Syntax AC1 = |AC0|
Before AC1 AC0 M40 00 0000 2000 82 0000 1234 1

Description The absolute value of the content of AC0 is stored in AC1.


After AC1 AC0 M40 7D FFFF EDCC 82 0000 1234 1

Example 2
Syntax AC1 = |AR1|
Before AC1 AR1 CARRY 00 0000 2000 0000 0

Description The absolute value of the content of AR1 is stored in AC1.


After AC1 AR1 CARRY 00 0000 0000 0000 1

Example 3
Syntax AC1 = |AR1| Description The absolute value of the content of AR1 is stored in AC1. Since SXMD = 1, AR1 content is sign extended. The resulting 40-bit data is negated since M40 = 0 and AR1(31) = 1.
After 00 0000 2000 8700 0 1 AC1 AR1 M40 SXMD 00 0000 7900 8700 0 1

Before AC1 AR1 M40 SXMD

SPRU375G

Instruction Set Descriptions

5-5

Absolute Value

Example 4
Syntax T1 = |AC0| Description The absolute value of the content of AC0(150) is stored in T1. The sign bit is extracted at AC0(15). Since AC0(15) = 0, T1 = AC0(150).
After 2000 80 0002 1234 T1 AC0 1234 80 0002 1234

Before T1 AC0

Example 5
Syntax T1 = |AC0| Description The absolute value of the content of AC0(150) is stored in T1. The sign bit is extracted at AC0(15). Since AC0(15) = 1, T1 equals the negated value of AC0(150).
After 2000 80 0002 9234 T1 AC0 6DCC 80 0002 9234

Before T1 AC0

5-6

Instruction Set Descriptions

SPRU375G

Addition

Addition
Syntax Characteristics
No. [1] [2] [3] [4] [5] [6] [7] [8] [9] [10] [11] [12] [13] [14] [15] [16] Syntax dst = dst + src dst = dst + k4 dst = src + K16 dst = src + Smem ACy = ACy + (ACx << Tx) ACy = ACy + (ACx << #SHIFTW) ACy = ACx + (K16 << #16) ACy = ACx + (K16 << #SHFT) ACy = ACx + (Smem << Tx) ACy = ACx + (Smem << #16) ACy = ACx + uns(Smem) + CARRY ACy = ACx + uns(Smem) ACy = ACx + (uns(Smem) << #SHIFTW) ACy = ACx + dbl(Lmem) ACx = (Xmem << #16) + (Ymem << #16) Smem = Smem + K16 Parallel Enable Bit Yes Yes No No Yes Yes No No No No No No No No No No Size 2 2 4 3 2 3 4 4 3 3 3 3 4 3 3 4 Cycles 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 Pipeline X X X X X X X X X X X X X X X X

Description Status Bits

These instructions perform an addition operation. Affected by Affects CARRY, C54CM, M40, SATA, SATD, SXMD ACOVx, ACOVy, CARRY

SPRU375G

Instruction Set Descriptions

5-7

Addition

See Also

See the following other related instructions:


- Addition or Subtraction Conditionally - Addition or Subtraction Conditionally with Shift - Addition with Absolute Value - Addition with Parallel Store Accumulator Content to Memory - Addition, Subtraction, or Move Accumulator Content Conditionally - Dual 16-Bit Additions - Dual 16-Bit Addition and Subtraction - Dual 16-Bit Subtraction and Addition - Subtraction

5-8

Instruction Set Descriptions

SPRU375G

Addition

Addition
Syntax Characteristics
No. [1] Syntax dst = dst + src Parallel Enable Bit Yes Size 2 Cycles 1 Pipeline X

Opcode Operands Description dst, src

0010 010E FSSS FDDD

This instruction performs an addition operation between two registers.


- When the destination (dst) operand is an accumulator: J J J

The operation is performed on 40 bits in the D-unit ALU. Input operands are sign extended to 40 bits according to SXMD. If an auxiliary or temporary register is the source (src) operand of the instruction, the 16 LSBs of the auxiliary or temporary register are sign extended according to SXMD. Overflow detection and CARRY status bit depends on M40. When an overflow is detected, the accumulator is saturated according to SATD.

J J

- When the destination (dst) operand is an auxiliary or temporary register: J J J J

The operation is performed on 16 bits in the A-unit ALU. If an accumulator is the source (src) operand of the instruction, the 16 LSBs of the accumulator are used to perform the operation. Addition overflow detection is done at bit position 15. When an overflow is detected, the destination register is saturated according to SATA.

Compatibility with C54x devices (C54CM = 1) When this instruction is executed with M40 = 0, compatibility is ensured. Status Bits Affected by Affects Repeat Example
Syntax AC0 = AC0 + AC1 Description The content of AC1 is added to the content of AC0 and the result is stored in AC0.

M40, SATA, SATD, SXMD ACOVx, CARRY

This instruction can be repeated.

SPRU375G

Instruction Set Descriptions

5-9

Addition

Addition
Syntax Characteristics
No. [2] Syntax dst = dst + k4 Parallel Enable Bit Yes Size 2 Cycles 1 Pipeline X

Opcode Operands Description dst, k4

0100 000E kkkk FDDD

This instruction performs an addition operation between a register content and a 4-bit unsigned constant, k4.
- When the destination (dst) operand is an accumulator: J J J

The operation is performed on 40 bits in the D-unit ALU. Overflow detection and CARRY status bit depends on M40. When an overflow is detected, the accumulator is saturated according to SATD.

- When the destination (dst) operand is an auxiliary or temporary register: J J J

The operation is performed on 16 bits in the A-unit ALU. Addition overflow detection is done at bit position 15. When an overflow is detected, the destination register is saturated according to SATA.

Compatibility with C54x devices (C54CM = 1) When this instruction is executed with M40 = 0, compatibility is ensured. Status Bits Affected by Affects Repeat Example
Syntax AC0 = AC0 + k4 Description The content of AC0 is added to an unsigned 4-bit value and the result is stored in AC0.

M40, SATA, SATD ACOVx, CARRY

This instruction can be repeated.

5-10

Instruction Set Descriptions

SPRU375G

Addition

Addition
Syntax Characteristics
No. [3] Syntax dst = src + K16 Parallel Enable Bit No Size 4 Cycles 1 Pipeline X

Opcode Operands Description dst, K16, src

0111 1011 KKKK KKKK KKKK KKKK FDDD FSSS

This instruction performs an addition operation between a register content and a 16-bit signed constant, K16.
- When the destination (dst) operand is an accumulator: J J

The operation is performed on 40 bits in the D-unit ALU. If an auxiliary or temporary register is the source (src) operand of the instruction, the 16 LSBs of the auxiliary or temporary register are sign extended according to SXMD. The 16-bit constant, K16, is sign extended to 40 bits according to SXMD. Overflow detection and CARRY status bit depends on M40. When an overflow is detected, the accumulator is saturated according to SATD. The operation is performed on 16 bits in the A-unit ALU. If an accumulator is the source (src) operand of the instruction, the 16 LSBs of the accumulator are used to perform the operation. Addition overflow detection is done at bit position 15. When an overflow is detected, the destination register is saturated according to SATA.

J J J

- When the destination (dst) operand is an auxiliary or temporary register: J J J J

Compatibility with C54x devices (C54CM = 1) When this instruction is executed with M40 = 0, compatibility is ensured. Status Bits Affected by Affects Repeat Example
Syntax AC1 = AC0 + #2E00h Description The content of AC0 is added to the signed 16-bit value (2E00h) and the result is stored in AC1.

M40, SATA, SATD, SXMD ACOVx, CARRY

This instruction can be repeated.

SPRU375G

Instruction Set Descriptions

5-11

Addition

Addition
Syntax Characteristics
No. [4] Syntax dst = src + Smem Parallel Enable Bit No Size 3 Cycles 1 Pipeline X

Opcode Operands Description dst, Smem, src

1101 0110 AAAA AAAI FDDD FSSS

This instruction performs an addition operation between a register content and the content of a memory (Smem) location.
- When the destination (dst) operand is an accumulator: J J

The operation is performed on 40 bits in the D-unit ALU. If an auxiliary or temporary register is the source (src) operand of the instruction, the 16 LSBs of the auxiliary or temporary register are sign extended according to SXMD. The content of the memory location is sign extended to 40 bits according to SXMD. Overflow detection and CARRY status bit depends on M40. When an overflow is detected, the accumulator is saturated according to SATD.

J J J

- When the destination (dst) operand is an auxiliary or temporary register: J J J J

The operation is performed on 16 bits in the A-unit ALU. If an accumulator is the source (src) operand of the instruction, the 16 LSBs of the accumulator are used to perform the operation. Addition overflow detection is done at bit position 15. When an overflow is detected, the destination register is saturated according to SATA.

Compatibility with C54x devices (C54CM = 1) When this instruction is executed with M40 = 0, compatibility is ensured. Status Bits Affected by Affects
5-12 Instruction Set Descriptions

M40, SATA, SATD, SXMD ACOVx, CARRY


SPRU375G

Addition

Repeat Example
Syntax T1 = T0 + *AR3+

This instruction can be repeated.

Description The content of T0 is added to the content addressed by AR3 and the result is stored in T1. AR3 is incremented by 1.
After 0302 EF00 3300 0 0 AR3 302 T0 T1 CARRY 0303 EF00 3300 2200 1

Before AR3 302 T0 T1 CARRY

SPRU375G

Instruction Set Descriptions

5-13

Addition

Addition
Syntax Characteristics
No. [5] Syntax ACy = ACy + (ACx << Tx) Parallel Enable Bit Yes Size 2 Cycles 1 Pipeline X

Opcode Operands Description ACx, ACy, Tx

0101 101E DDSS ss00

This instruction performs an addition operation between an accumulator content ACy and an accumulator content ACx shifted by the content of Tx.
- The operation is performed on 40 bits in the D-unit shifter. - Input operands are sign extended to 40 bits according to SXMD. - The shift operation is equivalent to the signed shift instruction. - Overflow detection and CARRY status bit depends on M40. - When an overflow is detected, the accumulator is saturated according to

SATD. Compatibility with C54x devices (C54CM = 1) When this instruction is executed with M40 = 0, compatibility is ensured. When C54CM = 1:
- An intermediary shift operation is performed as if M40 is locally set to 1 and

no overflow detection, report, and saturation is done after the shifting operation.
- The 6 LSBs of Tx are used to determine the shift quantity. The 6 LSBs of

Tx define a shift quantity within 32 to +31. When the value is between 32 to 17, a modulo 16 operation transforms the shift quantity to within 16 to 1. Status Bits Affected by Affects Repeat Example
Syntax AC0 = AC0 + (AC1 << T0) Description The content of AC1 shifted by the content of T0 is added to the content of AC0 and the result is stored in AC0.

C54CM, M40, SATD, SXMD ACOVy, CARRY

This instruction can be repeated.

5-14

Instruction Set Descriptions

SPRU375G

Addition

Addition
Syntax Characteristics
No. [6] Syntax ACy = ACy + (ACx << #SHIFTW) Parallel Enable Bit Yes Size 3 Cycles 1 Pipeline X

Opcode Operands Description ACx, ACy, SHIFTW

0001 000E DDSS 0011 xxSH IFTW

This instruction performs an addition operation between an accumulator content ACy and an accumulator content ACx shifted by the 6-bit value, SHIFTW.
- The operation is performed on 40 bits in the D-unit shifter. - Input operands are sign extended to 40 bits according to SXMD. - The shift operation is equivalent to the signed shift instruction. - Overflow detection and CARRY status bit depends on M40. - When an overflow is detected, the accumulator is saturated according to

SATD. Compatibility with C54x devices (C54CM = 1) When this instruction is executed with M40 = 0, compatibility is ensured. When C54CM = 1, an intermediary shift operation is performed as if M40 is locally set to 1 and no overflow detection, report, and saturation is done after the shifting operation. Status Bits Affected by Affects Repeat Example
Syntax AC0 = AC0 + (AC1 << #31) Description The content of AC1 shifted left by 31 bits is added to the content of AC0 and the result is stored in AC0.

C54CM, M40, SATD, SXMD ACOVy, CARRY

This instruction can be repeated.

SPRU375G

Instruction Set Descriptions

5-15

Addition

Addition
Syntax Characteristics
No. [7] Syntax ACy = ACx + (K16 << #16) Parallel Enable Bit No Size 4 Cycles 1 Pipeline X

Opcode Operands Description ACx, ACy, K16

0111 1010 KKKK KKKK KKKK KKKK SSDD 000x

This instruction performs an addition operation between an accumulator content ACx and a 16-bit signed constant, K16, shifted left by 16 bits.
- The operation is performed on 40 bits in the D-unit ALU. - Input operands are sign extended to 40 bits according to SXMD. - The shift operation is equivalent to the signed shift instruction. - Overflow detection and CARRY status bit depends on M40. - When an overflow is detected, the accumulator is saturated according to

SATD. Compatibility with C54x devices (C54CM = 1) When this instruction is executed with M40 = 0, compatibility is ensured. When C54CM = 1, an intermediary shift operation is performed as if M40 is locally set to 1 and no overflow detection, report, and saturation is done after the shifting operation. Status Bits Affected by Affects Repeat Example
Syntax AC0 = AC1 + (#2E00h << #16) Description A signed 16-bit value (2E00h) shifted left by 16 bits is added to the content of AC1 and the result is stored in AC0.

C54CM, M40, SATD, SXMD ACOVy, CARRY

This instruction can be repeated.

5-16

Instruction Set Descriptions

SPRU375G

Addition

Addition
Syntax Characteristics
No. [8] Syntax ACy = ACx + (K16 << #SHFT) Parallel Enable Bit No Size 4 Cycles 1 Pipeline X

Opcode Operands Description

0111 0000 KKKK KKKK KKKK KKKK SSDD SHFT ACx, ACy, K16, SHFT This instruction performs an addition operation between an accumulator content ACx and a 16-bit signed constant, K16, shifted left by the 4-bit value, SHFT.
- The operation is performed on 40 bits in the D-unit shifter. - Input operands are sign extended to 40 bits according to SXMD. - The shift operation is equivalent to the signed shift instruction. - Overflow detection and CARRY status bit depends on M40. - When an overflow is detected, the accumulator is saturated according to

SATD. Compatibility with C54x devices (C54CM = 1) When this instruction is executed with M40 = 0, compatibility is ensured. When C54CM = 1, an intermediary shift operation is performed as if M40 is locally set to 1 and no overflow detection, report, and saturation is done after the shifting operation. Status Bits Affected by Affects Repeat Example
Syntax AC0 = AC1 + (#2E00h << #15) Description A signed 16-bit value (2E00h) shifted left by 15 bits is added to the content of AC1 and the result is stored in AC0.

C54CM, M40, SATD, SXMD ACOVy, CARRY

This instruction can be repeated.

SPRU375G

Instruction Set Descriptions

5-17

Addition

Addition
Syntax Characteristics
No. [9] Syntax ACy = ACx + (Smem << Tx) Parallel Enable Bit No Size 3 Cycles 1 Pipeline X

Opcode Operands Description ACx, ACy, Tx, Smem

1101 1101 AAAA AAAI SSDD ss00

This instruction performs an addition operation between an accumulator content ACx and the content of a memory (Smem) location shifted by the content of Tx.
- The operation is performed on 40 bits in the D-unit shifter. - Input operands are sign extended to 40 bits according to SXMD. - The shift operation is equivalent to the signed shift instruction. - Overflow detection and CARRY status bit depends on M40. - When an overflow is detected, the accumulator is saturated according to

SATD. Compatibility with C54x devices (C54CM = 1) When this instruction is executed with M40 = 0, compatibility is ensured. When C54CM = 1:
- An intermediary shift operation is performed as if M40 is locally set to 1 and

no overflow detection, report, and saturation is done after the shifting operation.
- The 6 LSBs of Tx are used to determine the shift quantity. The 6 LSBs of

Tx define a shift quantity within 32 to +31. When the value is between 32 to 17, a modulo 16 operation transforms the shift quantity to within 16 to 1. Status Bits Affected by Affects Repeat
5-18

C54CM, M40, SATD, SXMD ACOVy, CARRY

This instruction can be repeated.


Instruction Set Descriptions SPRU375G

Addition

Example
Syntax AC0 = AC1 + (*AR1 << T0) Description The content addressed by AR1 shifted left by the content of T0 is added to the content of AC1 and the result is stored in AC0.
After 00 0000 0000 00 2300 0000 000C 0200 0300 0 0 0 0 AC0 AC1 T0 AR1 200 SXMD M40 ACOV0 CARRY 00 2330 0000 00 2300 0000 000C 0200 0300 0 0 0 1

Before AC0 AC1 T0 AR1 200 SXMD M40 ACOV0 CARRY

SPRU375G

Instruction Set Descriptions

5-19

Addition

Addition
Syntax Characteristics
No. [10] Syntax ACy = ACx + (Smem << #16) Parallel Enable Bit No Size 3 Cycles 1 Pipeline X

Opcode Operands Description ACx, ACy, Smem

1101 1110 AAAA AAAI SSDD 0100

This instruction performs an addition operation between an accumulator content ACx and the content of a memory (Smem) location shifted left by 16 bits.
- The operation is performed on 40 bits in the D-unit ALU. - Input operands are sign extended to 40 bits according to SXMD. - The shift operation is equivalent to the signed shift instruction. - Overflow detection and CARRY status bit depends on M40. If the result

of the addition generates a carry, the CARRY status bit is set; otherwise, the CARRY status bit is not affected.
- When an overflow is detected, the accumulator is saturated according to

SATD. Compatibility with C54x devices (C54CM = 1) When this instruction is executed with M40 = 0, compatibility is ensured. When C54CM = 1, an intermediary shift operation is performed as if M40 is locally set to 1 and no overflow detection, report, and saturation is done after the shifting operation. Status Bits Affected by Affects Repeat Example
Syntax AC0 = AC1 + (*AR3 << #16) Description The content addressed by AR3 shifted left by 16 bits is added to the content of AC1 and the result is stored in AC0.

C54CM, M40, SATD, SXMD ACOVy, CARRY

This instruction can be repeated.

5-20

Instruction Set Descriptions

SPRU375G

Addition

Addition
Syntax Characteristics
No. [11] Syntax ACy = ACx + uns(Smem) + CARRY Parallel Enable Bit No Size 3 Cycles 1 Pipeline X

Opcode Operands Description ACx, ACy, Smem

1101 1111 AAAA AAAI SSDD 100u

This instruction performs an addition operation of the accumulator content ACx, the content of a memory (Smem) location, and the value of the CARRY status bit.
- The operation is performed on 40 bits in the D-unit ALU. - Input operands are extended to 40 bits according to uns. J J

If the optional uns keyword is applied to the input operand, the content of the memory location is zero extended to 40 bits. If the optional uns keyword is not applied to the input operand, the content of the memory location is sign extended to 40 bits according to SXMD.

- Overflow detection and CARRY status bit depends on M40. - When an overflow is detected, the accumulator is saturated according to

SATD. Compatibility with C54x devices (C54CM = 1) When this instruction is executed with M40 = 0, compatibility is ensured. Status Bits Affected by Affects Repeat Example
Syntax AC0 = AC1 + uns(*AR3) + CARRY Description The CARRY status bit and the unsigned content addressed by AR3 are added to the content of AC1 and the result is stored in AC0.

CARRY, M40, SATD, SXMD ACOVy, CARRY

This instruction can be repeated.

SPRU375G

Instruction Set Descriptions

5-21

Addition

Addition
Syntax Characteristics
No. [12] Syntax ACy = ACx + uns(Smem) Parallel Enable Bit No Size 3 Cycles 1 Pipeline X

Opcode Operands Description ACx, ACy, Smem

1101 1111 AAAA AAAI SSDD 110u

This instruction performs an addition operation between an accumulator content ACx and the content of a memory (Smem) location.
- The operation is performed on 40 bits in the D-unit ALU. - Input operands are extended to 40 bits according to uns. J J

If the optional uns keyword is applied to the input operand, the content of the memory location is zero extended to 40 bits. If the optional uns keyword is not applied to the input operand, the content of the memory location is sign extended to 40 bits according to SXMD.

- Overflow detection and CARRY status bit depends on M40. - When an overflow is detected, the accumulator is saturated according to

SATD. Compatibility with C54x devices (C54CM = 1) When this instruction is executed with M40 = 0, compatibility is ensured. Status Bits Affected by Affects Repeat Example
Syntax AC0 = AC1 + uns(*AR3) Description The unsigned content addressed by AR3 is added to the content of AC1 and the result is stored in AC0.

M40, SATD, SXMD ACOVy, CARRY

This instruction can be repeated.

5-22

Instruction Set Descriptions

SPRU375G

Addition

Addition
Syntax Characteristics
No. [13] Syntax ACy = ACx + (uns(Smem) << #SHIFTW) Parallel Enable Bit No Size 4 Cycles 1 Pipeline X

Opcode Operands Description

1111 1001 AAAA AAAI uxSH IFTW SSDD 00xx ACx, ACy, SHIFTW, Smem This instruction performs an addition operation between an accumulator content ACx and the content of a memory (Smem) location shifted by the 6-bit value, SHIFTW.
- The operation is performed on 40 bits in the D-unit shifter. - Input operands are extended to 40 bits according to uns. J J

If the optional uns keyword is applied to the input operand, the content of the memory location is zero extended to 40 bits. If the optional uns keyword is not applied to the input operand, the content of the memory location is sign extended to 40 bits according to SXMD.

- The shift operation is equivalent to the signed shift instruction. - Overflow detection and CARRY status bit depends on M40. - When an overflow is detected, the accumulator is saturated according to

SATD. Compatibility with C54x devices (C54CM = 1) When this instruction is executed with M40 = 0, compatibility is ensured. When C54CM = 1, an intermediary shift operation is performed as if M40 is locally set to 1 and no overflow detection, report, and saturation is done after the shifting operation. Status Bits Affected by Affects Repeat C54CM, M40, SATD, SXMD ACOVy, CARRY

This instruction cannot be repeated when using the *(#k23) absolute addressing mode to access the memory operand (Smem); when using other addressing modes, this instruction can be repeated.

Example
Syntax AC0 = AC1 + (uns(*AR3) << #31) Description The unsigned content addressed by AR3 shifted left by 31 bits is added to the content of AC1 and the result is stored in AC0.

SPRU375G

Instruction Set Descriptions

5-23

Addition

Addition
Syntax Characteristics
No. [14] Syntax ACy = ACx + dbl(Lmem) Parallel Enable Bit No Size 3 Cycles 1 Pipeline X

Opcode Operands Description ACx, ACy, Lmem

1110 1101 AAAA AAAI SSDD 000n

This instruction performs an addition operation between an accumulator content ACx and the content of data memory operand dbl(Lmem).
- The data memory operand dbl(Lmem) addresses are aligned: J J

if Lmem address is even: most significant word = Lmem, least significant word = Lmem + 1 if Lmem address is odd: most significant word = Lmem, least significant word = Lmem 1

- The operation is performed on 40 bits in the D-unit ALU. - Input operands are sign extended to 40 bits according to SXMD. - Overflow detection and CARRY status bit depends on M40. - When an overflow is detected, the accumulator is saturated according to

SATD. Compatibility with C54x devices (C54CM = 1) When this instruction is executed with M40 = 0, compatibility is ensured. Status Bits Affected by Affects Repeat Example
Syntax AC0 = AC1 + dbl(*AR3+) Description The content (long word) addressed by AR3 and AR3 + 1 is added to the content of AC1 and the result is stored in AC0. Because this instruction is a long-operand instruction, AR3 is incremented by 2 after the execution.

M40, SATD, SXMD ACOVy, CARRY

This instruction can be repeated.

5-24

Instruction Set Descriptions

SPRU375G

Addition

Addition
Syntax Characteristics
No. [15] Syntax ACx = (Xmem << #16) + (Ymem << #16) Parallel Enable Bit No Size 3 Cycles 1 Pipeline X

Opcode Operands Description ACx, Xmem, Ymem

1000 0001 XXXM MMYY YMMM 00DD

This instruction performs an addition operation between the content of data memory operand Xmem shifted left 16 bits, and the content of data memory operand Ymem shifted left 16 bits.
- The operation is performed on 40 bits in the D-unit ALU. - Input operands are sign extended to 40 bits according to SXMD. - The shift operation is equivalent to the signed shift instruction. - Overflow detection and CARRY status bit depends on M40. - When an overflow is detected, the accumulator is saturated according to

SATD. Compatibility with C54x devices (C54CM = 1) When this instruction is executed with M40 = 0, compatibility is ensured. When C54CM = 1, an intermediary shift operation is performed as if M40 is locally set to 1 and no overflow detection, report, and saturation is done after the shifting operation. Status Bits Affected by Affects Repeat Example
Syntax AC0 = (*AR3 << #16) + (*AR4 << #16) Description The content addressed by AR3 shifted left by 16 bits is added to the content addressed by AR4 shifted left by 16 bits and the result is stored in AC0.

C54CM, M40, SATD, SXMD ACOVx, CARRY

This instruction can be repeated.

SPRU375G

Instruction Set Descriptions

5-25

Addition

Addition
Syntax Characteristics
No. [16] Syntax Smem = Smem + K16 Parallel Enable Bit No Size 4 Cycles 1 Pipeline X

Opcode Operands Description K16, Smem

1111 0111 AAAA AAAI KKKK KKKK KKKK KKKK

This instruction performs an addition operation between a 16-bit signed constant, K16, and the content of a memory (Smem) location.
- The operation is performed on 40 bits in the D-unit ALU. - Input operands are sign extended to 40 bits according to SXMD and

shifted by 16 bits to the MSBs before being added.


- Addition overflow is detected at bit position 31. If an overflow is detected,

accumulator 0 overflow status bit (ACOV0) is set.


- Addition carry report in CARRY status bit is extracted at bit position 31. - If SATD is 1 when an overflow is detected, the result is saturated before

being stored in memory. Saturation values are 7FFFh or 8000h. Compatibility with C54x devices (C54CM = 1) When this instruction is executed with M40 = 0, compatibility is ensured. Status Bits Affected by Affects Repeat SATD, SXMD ACOV0, CARRY

This instruction cannot be repeated when using the *(#k23) absolute addressing mode to access the memory operand (Smem); when using other addressing modes, this instruction can be repeated.

Example
Syntax *AR3 = *AR3 + #2E00h Description The content addressed by AR3 is added to a signed 16-bit value (2E00h) and the result is stored back into the location addressed by AR3.

5-26

Instruction Set Descriptions

SPRU375G

Addition with Absolute Value

Addition with Absolute Value


Syntax Characteristics
No. [1] Syntax ACy = rnd(ACy + |ACx|) Parallel Enable Bit Yes Size 2 Cycles 1 Pipeline X

Opcode Operands Description ACx, ACy

0101 010E DDSS 000%

This instruction computes the absolute value of accumulator ACx and adds the result to accumulator ACy. This instruction is performed in the D-unit MAC:
- The absolute value of accumulator ACx is computed by multiplying

ACx(3216) by 00001h or 1FFFFh depending on bit 32 of the source accumulator.


- If FRCT = 1, the absolute value is multiplied by 2. - Rounding is performed according to RDM, if the optional rnd keyword is

applied to the instruction.


- Addition overflow detection depends on M40. If an overflow is detected,

the destination accumulator overflow status bit (ACOVy) is set.


- When an addition overflow is detected, the accumulator is saturated

according to SATD.
- The result of the absolute value of the higher part of ACx is in the lower

part of ACy. Compatibility with C54x devices (C54CM = 1) When this instruction is executed with M40 = 0, compatibility is ensured. Status Bits Affected by Affects Repeat FRCT, M40, RDM, SATD, SMUL ACOVy

This instruction can be repeated.

SPRU375G

Instruction Set Descriptions

5-27

Addition with Absolute Value

See Also

See the following other related instructions:


- Absolute Value - Addition - Addition or Subtraction Conditionally - Addition or Subtraction Conditionally with Shift - Addition, Subtraction, or Move Accumulator Content Conditionally

Example
Syntax AC0 = AC0 + |AC1| Description The absolute value of AC1 is added to the content of AC0 and the result is stored in AC0.

5-28

Instruction Set Descriptions

SPRU375G

Addition with Parallel Store Accumulator Content to Memory

Addition with Parallel Store Accumulator Content to Memory


Syntax Characteristics
No. [1] Syntax ACy = ACx + (Xmem << #16), Ymem = HI(ACy << T2) Parallel Enable Bit No Size 4 Cycles 1 Pipeline X

Opcode Operands Description

1000 0111 XXXM MMYY YMMM SSDD 100x xxxx ACx, ACy, T2, Xmem, Ymem This instruction performs two operations in parallel: addition and store. The first operation performs an addition between an accumulator content ACx and the content of data memory operand Xmem shifted left by 16 bits.
- The operation is performed on 40 bits in the D-unit ALU. - Input operands are sign extended to 40 bits according to SXMD. - The shift operation is equivalent to the signed shift instruction. - Overflow detection and CARRY status bit depends on M40. When

C54CM = 1, an intermediary shift operation is performed as if M40 is locally set to 1 and no overflow detection, report, and saturation is done after the shifting operation.
- When an overflow is detected, the accumulator is saturated according to

SATD. The second operation shifts the accumulator ACy by the content of T2 and stores ACy(3116) to data memory operand Ymem. If the 16-bit value in T2 is not within 32 to +31, the shift is saturated to 32 or +31 and the shift is performed with this value.
- The input operand is shifted in the D-unit shifter according to SXMD. - After the shift, the high part of the accumulator, ACy(3116), is stored to

the memory location. Compatibility with C54x devices (C54CM = 1) When this instruction is executed with M40 = 0, compatibility is ensured. When this instruction is executed with C54CM = 1, the 6 LSBs of T2 are used to determine the shift quantity. The 6 LSBs of T2 define a shift quantity within 32 to +31. When the 16-bit value in T2 is between 32 to 17, a modulo 16 operation transforms the shift quantity to within 16 to 1.
SPRU375G Instruction Set Descriptions 5-29

Addition with Parallel Store Accumulator Content to Memory

Status Bits

Affected by Affects

C54CM, M40, SATD, SXMD ACOVy, CARRY

Repeat See Also

This instruction can be repeated. See the following other related instructions:
- Addition - Store Accumulator Content to Memory - Subtraction with Parallel Store Accumulator Content to Memory

Example
Syntax AC0 = AC1 + (*AR3 << #16), *AR4 = HI(AC0 << T2) Description Both instructions are performed in parallel. The content addressed by AR3 shifted left by 16 bits is added to the content of AC1 and the result is stored in AC0. The content of AC0 is shifted by the content of T2, and AC0(3116) is stored at the address of AR4.

5-30

Instruction Set Descriptions

SPRU375G

Addition or Subtraction Conditionally (adsc)

Addition or Subtraction Conditionally


Syntax Characteristics
No. [1] [2] Syntax ACy = adsc(Smem, ACx, TC1) ACy = adsc(Smem, ACx, TC2) Parallel Enable Bit No No Size 3 3 Cycles 1 1 Pipeline X X

Opcode

TC1 TC2

1101 1110 AAAA AAAI SSDD 0000 1101 1110 AAAA AAAI SSDD 0001

Operands Description

ACx, ACy, Smem, TCx This instruction evaluates the selected TCx status bit and based on the result of the test, either an addition or a subtraction is performed. Evaluation of the condition on the TCx status bit is performed during the Execute phase of the instruction.
TC1 or TC2 0 1 Operation ACy = ACx (Smem << #16) ACy = ACx + (Smem << #16)

- TCx = 0, then ACy = ACx (Smem << #16):

This instruction subtracts the content of a memory (Smem) location shifted left by 16 bits from accumulator ACx and stores the result in accumulator ACy.
J J J J J

The operation is performed on 40 bits in the D-unit ALU. Input operands are sign extended to 40 bits according to SXMD. The shift operation is equivalent to the signed shift instruction. Overflow detection and CARRY status bit depends on M40. When an overflow is detected, the accumulator is saturated according to SATD.

- TCx = 1, then ACy = ACx + (Smem << #16):

This instruction performs an addition operation between accumulator ACx and the content of a memory (Smem) location shifted left by 16 bits and stores the result in accumulator ACy.
J J SPRU375G

The operation is performed on 40 bits in the D-unit ALU. Input operands are sign extended to 40 bits according to SXMD.
Instruction Set Descriptions 5-31

Addition or Subtraction Conditionally (adsc)

J J J

The shift operation is equivalent to the signed shift instruction. Overflow detection and CARRY status bit depends on M40. When an overflow is detected, the accumulator is saturated according to SATD.

Compatibility with C54x devices (C54CM = 1) When this instruction is executed with M40 = 0, compatibility is ensured. When C54CM = 1, an intermediary shift operation is performed as if M40 is locally set to 1 and no overflow detection, report, and saturation is done after the shifting operation. Status Bits Affected by Affects Repeat See Also C54CM, M40, SATD, SXMD, TCx ACOVy, CARRY

This instruction can be repeated. See the following other related instructions:
- Addition or Subtraction Conditionally with Shift - Addition, Subtraction, or Move Accumulator Content Conditionally

Example 1
Syntax AC0 = adsc(*AR3, AC1, TC1) Description If TC1 = 1, the content addressed by AR3 shifted left by 16 bits is added to the content of AC1 and the result is stored in AC0. If TC1 = 0, the content addressed by AR3 shifted left by 16 bits is subtracted from the content of AC1 and the result is stored in AC0.

Example 2
Syntax AC1 = adsc(*AR1, AC0, TC2) Description TC2 = 1, the content addressed by AR1 shifted left by 16 bits is added to the content of AC0 and the result is stored in AC1. The result generated an overflow and a carry.
After AC0 AC1 AR1 200 TC2 SXMD M40 ACOV1 CARRY

Before AC0 AC1 AR1 200 TC2 SXMD M40 ACOV1 CARRY

00 EC00 0000 00 0000 0000 0200 3300 1 0 0 0 0

00 EC00 0000 01 1F00 0000 0200 3300 1 0 0 1 1

5-32

Instruction Set Descriptions

SPRU375G

Addition or Subtraction Conditionally with Shift (ads2c)

Addition or Subtraction Conditionally with Shift


Syntax Characteristics
No. [1] Syntax ACy = ads2c(Smem, ACx, Tx, TC1, TC2) Parallel Enable Bit No Size 3 Cycles 1 Pipeline X

Opcode Operands Description

1101 1101 AAAA AAAI SSDD ss10 ACx, ACy, Tx, Smem, TC1, TC2 This instruction evaluates the TC1 status bit and based on the result of the test, either an addition or a subtraction is performed; this instruction evaluates the TC2 status bit and based on the result of the test, either a shift left by 16 bits or the content of Tx is performed. Evaluation of the condition on the TCx status bits is performed during the Execute phase of the instruction.
TC1 0 0 1 1 TC2 0 1 0 1 Operation ACy = ACx (Smem << Tx) ACy = ACx (Smem << #16) ACy = ACx + (Smem << Tx) ACy = ACx + (Smem << #16)

- TC1 = 0 and TC2 = 0, then ACy = ACx (Smem << Tx):

This instruction subtracts the content of a memory (Smem) location shifted left by the content of Tx from an accumulator ACx and stores the result in accumulator ACy.
- TC1 = 0 and TC2 = 1, then ACy = ACx (Smem << #16):

This instruction subtracts the content of a memory (Smem) location shifted left by 16 bits from an accumulator ACx and stores the result in accumulator ACy.
J J J J

The operation is performed on 40 bits in the D-unit shifter. Input operands are sign extended to 40 bits according to SXMD. The shift operation is equivalent to the signed shift instruction. Overflow detection and CARRY status bit depends on M40. The subtraction borrow bit is reported in the CARRY status bit; the borrow bit is the logical complement of the CARRY status bit. When an overflow is detected, the accumulator is saturated according to SATD.
Instruction Set Descriptions 5-33

SPRU375G

Addition or Subtraction Conditionally with Shift (ads2c)

- TC1 = 1 and TC2 = 0, then ACy = ACx + (Smem << Tx):

This instruction performs an addition operation between an accumulator ACx and the content of a memory (Smem) location shifted left by the content of Tx and stores the result in accumulator ACy.
- TC1 = 1 and TC2 = 1, then ACy = ACx + (Smem << #16):

This instruction performs an addition operation between an accumulator ACx and the content of a memory (Smem) location shifted left by 16 bits and stores the result in accumulator ACy.
J J J J J

The operation is performed on 40 bits in the D-unit shifter. Input operands are sign extended to 40 bits according to SXMD. The shift operation is equivalent to the signed shift instruction. Overflow detection and CARRY status bit depends on M40. When an overflow is detected, the accumulator is saturated according to SATD.

Compatibility with C54x devices (C54CM = 1) When this instruction is executed with M40 = 0, compatibility is ensured. When C54CM = 1:
- An intermediary shift operation is performed as if M40 is locally set to 1 and

no overflow detection, report, and saturation is done after the shifting operation.
- The 6 LSBs of Tx are used to determine the shift quantity. The 6 LSBs of

Tx define a shift quantity within 32 to +31. When the value is between 32 to 17, a modulo 16 operation transforms the shift quantity to within 16 to 1. Status Bits Affected by Affects Repeat See Also C54CM, M40, SATD, SXMD, TC1, TC2 ACOVy, CARRY

This instruction can be repeated. See the following other related instructions:
- Addition or Subtraction Conditionally - Addition, Subtraction, or Move Accumulator Content Conditionally

5-34

Instruction Set Descriptions

SPRU375G

Addition or Subtraction Conditionally with Shift (ads2c)

Example
Syntax AC2 = ads2c(*AR2, AC0, T1, TC1, TC2) Description TC1 = 1 and TC2 = 0, the content addressed by AR2 shifted left by the content of T1 is added to the content of AC0 and the result is stored in AC2. The result generated an overflow.

Before AC0 AC2 AR2 201 T1 TC1 TC2 M40 ACOV2 CARRY

00 EC00 0000 00 0000 0000 0201 3300 0002 1 0 0 0 0

After AC0 AC2 AR2 201 T1 TC1 TC2 M40 ACOV2 CARRY

00 EC00 0000 00 EC00 CC00 0201 3300 0002 1 0 0 1 0

SPRU375G

Instruction Set Descriptions

5-35

Addition, Subtraction, or Move Accumulator Content Conditionally (adsc)

Addition, Subtraction, or Move Accumulator Content Conditionally


Syntax Characteristics
No. [1] Syntax ACy = adsc(Smem, ACx, TC1, TC2) Parallel Enable Bit No Size 3 Cycles 1 Pipeline X

Opcode Operands Description ACx, ACy, Smem, TC1, TC2

1101 1110 AAAA AAAI SSDD 0010

This instruction evaluates the TCx status bits and based on the result of the test, an addition, a move, or a subtraction is performed. Evaluation of the condition on the TCx status bits is performed during the Execute phase of the instruction.
TC1 0 0 1 1 TC2 0 1 0 1 Operation ACy = ACx (Smem << #16) ACy = ACx ACy = ACx + (Smem << #16) ACy = ACx

- TC2 = 1, then ACy = ACx:

This instruction moves the content of ACx to ACy.


J J

The 40-bit move operation is performed in the D-unit ALU. During the 40-bit move operation, an overflow is detected according to M40: H H the destination accumulator overflow status bit (ACOVy) is set. the destination register (ACy) is saturated according to SATD.

- TC1 = 0 and TC2 = 0, then ACy = ACx (Smem << #16):

This instruction subtracts the content of a memory (Smem) location shifted left by 16 bits from accumulator ACx and stores the result in accumulator ACy.
J J J J J

The operation is performed on 40 bits in the D-unit ALU. Input operands are sign extended to 40 bits according to SXMD. The shift operation is equivalent to the signed shift instruction. Overflow detection and CARRY status bit depends on M40. When an overflow is detected, the accumulator is saturated according to SATD.
SPRU375G

5-36

Instruction Set Descriptions

Addition, Subtraction, or Move Accumulator Content Conditionally (adsc)

- TC1 = 1 and TC2 = 0, then ACy = ACx + (Smem << #16):

This instruction performs an addition operation between accumulator ACx and the content of a memory (Smem) location shifted left by 16 bits and stores the result in accumulator ACy.
J J J J J

The operation is performed on 40 bits in the D-unit ALU. Input operands are sign extended to 40 bits according to SXMD. The shift operation is equivalent to the signed shift instruction. Overflow detection and CARRY status bit depends on M40. When an overflow is detected, the accumulator is saturated according to SATD.

Compatibility with C54x devices (C54CM = 1) When this instruction is executed with M40 = 0, compatibility is ensured. When C54CM = 1, an intermediary shift operation is performed as if M40 is locally set to 1 and no overflow detection, report, and saturation is done after the shifting operation. Status Bits Affected by Affects Repeat See Also C54CM, M40, SATD, SXMD, TC1, TC2 ACOVy, CARRY

This instruction can be repeated. See the following other related instructions:
- Addition or Subtraction Conditionally - Addition or Subtraction Conditionally with Shift

Example
Syntax AC0 = adsc(*AR3, AC1, TC1, TC2) Description If TC2 = 1, the content of AC1 is stored in AC0. If TC2 = 0 and TC1 = 1, the content addressed by AR3 shifted left by 16 bits is added to the content of AC1 and the result is stored in AC0. If TC2 = 0 and TC1 = 0, the content addressed by AR3 shifted left by 16 bits is subtracted from the content of AC1 and the result is stored in AC0.

SPRU375G

Instruction Set Descriptions

5-37

Bitwise AND

Bitwise AND
Syntax Characteristics
Parallel Enable Bit Yes Yes No No Yes No No No

No. [1] [2] [3] [4] [5] [6] [7] [8]

Syntax dst = dst & src dst = src & k8 dst = src & k16 dst = src & Smem ACy = ACy & (ACx <<< #SHIFTW) ACy = ACx & (k16 <<< #16) ACy = ACx & (k16 <<< #SHFT) Smem = Smem & k16

Size 2 3 4 3 3 4 4 4

Cycles 1 1 1 1 1 1 1 1

Pipeline X X X X X X X X

Description

These instructions perform a bitwise AND operation:


- In the D-unit, if the destination operand is an accumulator. - In the A-unit ALU, if the destination operand is an auxiliary or temporary

register.
- In the A-unit ALU, if the destination operand is the memory.

Status Bits

Affected by Affects

C54CM none

See Also

See the following other related instructions:


- Bitwise AND Memory with Immediate Value and Compare to Zero - Bitwise OR - Bitwise Exclusive OR (XOR)

5-38

Instruction Set Descriptions

SPRU375G

Bitwise AND

Bitwise AND
Syntax Characteristics
Parallel Enable Bit Yes

No. [1]

Syntax dst = dst & src

Size 2

Cycles 1

Pipeline X

Opcode Operands Description dst, src

0010 100E FSSS FDDD

This instruction performs a bitwise AND operation between two registers.


- When the destination (dst) operand is an accumulator: J J J

The operation is performed on 40 bits in the D-unit ALU. Input operands are zero extended to 40 bits. If an auxiliary or temporary register is the source (src) operand of the instruction, the 16 LSBs of the auxiliary or temporary register are zero extended.

- When the destination (dst) operand is an auxiliary or temporary register: J J

The operation is performed on 16 bits in the A-unit ALU. If an accumulator is the source (src) operand of the instruction, the 16 LSBs of the accumulator are used to perform the operation. none none

Status Bits

Affected by Affects

Repeat Example
Syntax AC1 = AC1 & AC0

This instruction can be repeated.

Description The content of AC0 is ANDed with the content of AC1 and the result is stored in AC1.

Before AC0 AC1 7E 2355 4FC0 0F E340 5678

After AC0 AC1 7E 2355 4FC0 0E 2340 4640

SPRU375G

Instruction Set Descriptions

5-39

Bitwise AND

Bitwise AND
Syntax Characteristics
Parallel Enable Bit Yes

No. [2]

Syntax dst = src & k8

Size 3

Cycles 1

Pipeline X

Opcode Operands Description dst, k8, src

0001 100E kkkk kkkk FDDD FSSS

This instruction performs a bitwise AND operation between a source (src) register content and an 8-bit value, k8.
- When the destination (dst) operand is an accumulator: J J J

The operation is performed on 40 bits in the D-unit ALU. Input operands are zero extended to 40 bits. If an auxiliary or temporary register is the source (src) operand of the instruction, the 16 LSBs of the auxiliary or temporary register are zero extended.

- When the destination (dst) operand is an auxiliary or temporary register: J J

The operation is performed on 16 bits in the A-unit ALU. If an accumulator is the source (src) operand of the instruction, the 16 LSBs of the accumulator are used to perform the operation. none none

Status Bits

Affected by Affects

Repeat Example
Syntax AC0 = AC1 & #FFh

This instruction can be repeated.

Description The content of AC1 is ANDed with the unsigned 8-bit value (FFh) and the result is stored in AC0.

5-40

Instruction Set Descriptions

SPRU375G

Bitwise AND

Bitwise AND
Syntax Characteristics
Parallel Enable Bit No

No. [3]

Syntax dst = src & k16

Size 4

Cycles 1

Pipeline X

Opcode Operands Description dst, k16, src

0111 1101 kkkk kkkk kkkk kkkk FDDD FSSS

This instruction performs a bitwise AND operation between a source (src) register content and a 16-bit unsigned constant, k16.
-

When the destination (dst) operand is an accumulator:


J J J

The operation is performed on 40 bits in the D-unit ALU. Input operands are zero extended to 40 bits. If an auxiliary or temporary register is the source (src) operand of the instruction, the 16 LSBs of the auxiliary or temporary register are zero extended.

- When the destination (dst) operand is an auxiliary or temporary register: J J

The operation is performed on 16 bits in the A-unit ALU. If an accumulator is the source (src) operand of the instruction, the 16 LSBs of the accumulator are used to perform the operation. none none

Status Bits

Affected by Affects

Repeat Example
Syntax AC0 = AC1 & #FFFFh

This instruction can be repeated.

Description The content of AC1 is ANDed with the unsigned 16-bit value (FFFFh) and the result is stored in AC0.

SPRU375G

Instruction Set Descriptions

5-41

Bitwise AND

Bitwise AND
Syntax Characteristics
Parallel Enable Bit No

No. [4]

Syntax dst = src & Smem

Size 3

Cycles 1

Pipeline X

Opcode Operands Description dst, Smem, src

1101 1001 AAAA AAAI FDDD FSSS

This instruction performs a bitwise AND operation between a source (src) register content and a memory (Smem) location.
- When the destination (dst) operand is an accumulator: J J J

The operation is performed on 40 bits in the D-unit ALU. Input operands are zero extended to 40 bits. If an auxiliary or temporary register is the source (src) operand of the instruction, the 16 LSBs of the auxiliary or temporary register are zero extended.

- When the destination (dst) operand is an auxiliary or temporary register: J J

The operation is performed on 16 bits in the A-unit ALU. If an accumulator is the source (src) operand of the instruction, the 16 LSBs of the accumulator are used to perform the operation. none none

Status Bits

Affected by Affects

Repeat Example
Syntax AC0 = AC1 & *AR3

This instruction can be repeated.

Description The content of AC1 is ANDed with the content addressed by AR3 and the result is stored in AC0.

5-42

Instruction Set Descriptions

SPRU375G

Bitwise AND

Bitwise AND
Syntax Characteristics
Parallel Enable Bit Yes

No. [5]

Syntax ACy = ACy & (ACx <<< #SHIFTW)

Size 3

Cycles 1

Pipeline X

Opcode Operands Description ACx, ACy, SHIFTW

0001 000E DDSS 0000 xxSH IFTW

This instruction performs a bitwise AND operation between an accumulator (ACy) content and an accumulator (ACx) content shifted by the 6-bit value, SHIFTW.
- The shift and AND operations are performed in one cycle in the D-unit

shifter.
- Input operands are zero extended to 40 bits. - The input operand (ACx) is shifted by a 6-bit immediate value in the D-unit

shifter.
- The CARRY status bit is not affected by the logical shift operation.

Compatibility with C54x devices (C54CM = 1) When C54CM = 1, the intermediary logical shift is performed as if M40 is locally set to 1. The 8 upper bits of the 40-bit intermediary result are not cleared. Status Bits Affected by Affects Repeat Example
Syntax AC0 = AC0 & (AC1 <<< #30) Description The content of AC0 is ANDed with the content of AC1 logically shifted left by 30 bits and the result is stored in AC0.

C54CM none

This instruction can be repeated.

SPRU375G

Instruction Set Descriptions

5-43

Bitwise AND

Bitwise AND
Syntax Characteristics
Parallel Enable Bit No

No. [6]

Syntax ACy = ACx & (k16 <<< #16)

Size 4

Cycles 1

Pipeline X

Opcode Operands Description ACx, ACy, k16

0111 1010 kkkk kkkk kkkk kkkk SSDD 010x

This instruction performs a bitwise AND operation between an accumulator (ACx) content and a 16-bit unsigned constant, k16, shifted left by 16 bits.
- The operation is performed on 40 bits in the D-unit ALU. - Input operands are zero extended to 40 bits. - The input operand (k16) is shifted 16 bits to the MSBs.

Status Bits

Affected by Affects

none none

Repeat Example
Syntax

This instruction can be repeated.

Description The content of AC1 is ANDed with the unsigned 16-bit value (FFFFh) logically shifted left by 16 bits and the result is stored in AC0.

AC0 = AC1 & (#FFFFh <<< #16)

5-44

Instruction Set Descriptions

SPRU375G

Bitwise AND

Bitwise AND
Syntax Characteristics
Parallel Enable Bit No

No. [7]

Syntax ACy = ACx & (k16 <<< #SHFT)

Size 4

Cycles 1

Pipeline X

Opcode Operands Description

0111 0010 kkkk kkkk kkkk kkkk SSDD SHFT ACx, ACy, k16, SHFT This instruction performs a bitwise AND operation between an accumulator (ACx) content and a 16-bit unsigned constant, k16, shifted left by the 4-bit value, SHFT.
- The shift and AND operations are performed in one cycle in the D-unit

shifter.
- Input operands are zero extended to 40 bits. - The input operand (k16) is shifted by a 4-bit immediate value in the D-unit

shifter.
- The CARRY status bit is not affected by the logical shift operation.

Status Bits

Affected by Affects

none none

Repeat Example
Syntax

This instruction can be repeated.

Description The content of AC1 is ANDed with the unsigned 16-bit value (FFFFh) logically shifted left by 15 bits and the result is stored in AC0.

AC0 = AC1 & (#FFFFh <<< #15)

SPRU375G

Instruction Set Descriptions

5-45

Bitwise AND

Bitwise AND
Syntax Characteristics
Parallel Enable Bit No

No. [8]

Syntax Smem = Smem & k16

Size 4

Cycles 1

Pipeline X

Opcode Operands Description k16, Smem

1111 0100 AAAA AAAI kkkk kkkk kkkk kkkk

This instruction performs a bitwise AND operation between a memory (Smem) location and a 16-bit unsigned constant, k16.
- The operation is performed on 16 bits in the A-unit ALU. - The result is stored in memory.

Status Bits

Affected by Affects

none none

Repeat

This instruction cannot be repeated when using the *(#k23) absolute addressing mode to access the memory operand (Smem); when using other addressing modes, this instruction can be repeated.

Example
Syntax *AR1 = *AR1 & #0FC0 Description The content addressed by AR1 is ANDed with the unsigned 16-bit value (FC0h) and the result is stored in the location addressed by AR1.
After 5678 *AR1 0640

Before *AR1

5-46

Instruction Set Descriptions

SPRU375G

Bitwise AND Memory with Immediate Value and Compare to Zero

Bitwise AND Memory with Immediate Value and Compare to Zero


Syntax Characteristics
Parallel Enable Bit No No

No. [1] [2]

Syntax TC1 = Smem & k16 TC2 = Smem & k16

Size 4 4

Cycles 1 1

Pipeline X X

Opcode

TC1 TC2

1111 0010 AAAA AAAI kkkk kkkk kkkk kkkk 1111 0011 AAAA AAAI kkkk kkkk kkkk kkkk

Operands Description

k16, Smem, TCx This instruction performs a bit field manipulation in the A-unit ALU. The 16-bit field mask, k16, is ANDed with the memory (Smem) operand and the result is compared to 0:
if( ((Smem) AND k16 ) == 0) TCx = 0 else TCx = 1

Status Bits

Affected by Affects

none TCx

Repeat

This instruction cannot be repeated when using the *(#k23) absolute addressing mode to access the memory operand (Smem); when using other addressing modes, this instruction can be repeated. See the following other related instructions:
- Bitwise AND

See Also

Example
Syntax TC1 = *AR0 & #0060h Description The unsigned 16-bit value (0060h) is ANDed with the content addressed by AR0. The result is 1, TC1 is set to 1.

Before *AR0 TC1 0040 0

After *AR0 TC1 0040 1

SPRU375G

Instruction Set Descriptions

5-47

Bitwise OR

Bitwise OR
Syntax Characteristics
Parallel Enable Bit Yes Yes No No Yes No No No

No. [1] [2] [3] [4] [5] [6] [7] [8]

Syntax dst = dst | src dst = src | k8 dst = src | k16 dst = src | Smem ACy = ACy | (ACx <<< #SHIFTW) ACy = ACx | (k16 <<< #16) ACy = ACx | (k16 <<< #SHFT) Smem = Smem | k16

Size 2 3 4 3 3 4 4 4

Cycles 1 1 1 1 1 1 1 1

Pipeline X X X X X X X X

Description

These instructions perform a bitwise OR operation:


- In the D-unit, if the destination operand is an accumulator. - In the A-unit ALU, if the destination operand is an auxiliary or temporary

register.
- In the A-unit ALU, if the destination operand is the memory.

Status Bits

Affected by Affects

C54CM none

See Also

See the following other related instructions:


- Bitwise AND - Bitwise Exclusive OR (XOR)

5-48

Instruction Set Descriptions

SPRU375G

Bitwise OR

Bitwise OR
Syntax Characteristics
Parallel Enable Bit Yes

No. [1]

Syntax dst = dst | src

Size 2

Cycles 1

Pipeline X

Opcode Operands Description dst, src

0010 101E FSSS FDDD

This instruction performs a bitwise OR operation between two registers.


- When the destination (dst) operand is an accumulator: J J J

The operation is performed on 40 bits in the D-unit ALU. Input operands are zero extended to 40 bits. If an auxiliary or temporary register is the source (src) operand of the instruction, the 16 LSBs of the auxiliary or temporary register are zero extended.

- When the destination (dst) operand is an auxiliary or temporary register: J J

The operation is performed on 16 bits in the A-unit ALU. If an accumulator is the source (src) operand of the instruction, the 16 LSBs of the accumulator are used to perform the operation. none none

Status Bits

Affected by Affects

Repeat Example
Syntax AC0 = AC0 | AC1

This instruction can be repeated.

Description The content of AC0 is ORed with the content of AC1 and the result is stored in AC0.

SPRU375G

Instruction Set Descriptions

5-49

Bitwise OR

Bitwise OR
Syntax Characteristics
Parallel Enable Bit Yes

No. [2]

Syntax dst = src | k8

Size 3

Cycles 1

Pipeline X

Opcode Operands Description dst, k8, src

0001 101E kkkk kkkk FDDD FSSS

This instruction performs a bitwise OR operation between a source (src) register content and an 8-bit value, k8.
- When the destination (dst) operand is an accumulator: J J J

The operation is performed on 40 bits in the D-unit ALU. Input operands are zero extended to 40 bits. If an auxiliary or temporary register is the source (src) operand of the instruction, the 16 LSBs of the auxiliary or temporary register are zero extended.

- When the destination (dst) operand is an auxiliary or temporary register: J J

The operation is performed on 16 bits in the A-unit ALU. If an accumulator is the source (src) operand of the instruction, the 16 LSBs of the accumulator are used to perform the operation. none none

Status Bits

Affected by Affects

Repeat Example
Syntax AC0 = AC1 | #FFh

This instruction can be repeated.

Description The content of AC1 is ORed with the unsigned 8-bit value (FFh) and the result is stored in AC0.

5-50

Instruction Set Descriptions

SPRU375G

Bitwise OR

Bitwise OR
Syntax Characteristics
Parallel Enable Bit No

No. [3]

Syntax dst = src | k16

Size 4

Cycles 1

Pipeline X

Opcode Operands Description dst, k16, src

0111 1110 kkkk kkkk kkkk kkkk FDDD FSSS

This instruction performs a bitwise OR operation between a source (src) register content and a 16-bit unsigned constantk16.
-

When the destination (dst) operand is an accumulator:


J J J

The operation is performed on 40 bits in the D-unit ALU. Input operands are zero extended to 40 bits. If an auxiliary or temporary register is the source (src) operand of the instruction, the 16 LSBs of the auxiliary or temporary register are zero extended.

- When the destination (dst) operand is an auxiliary or temporary register: J J

The operation is performed on 16 bits in the A-unit ALU. If an accumulator is the source (src) operand of the instruction, the 16 LSBs of the accumulator are used to perform the operation. none none

Status Bits

Affected by Affects

Repeat Example
Syntax AC0 = AC1 | #FFFFh

This instruction can be repeated.

Description The content of AC1 is ORed with the unsigned 16-bit value (FFFFh) and the result is stored in AC0.

SPRU375G

Instruction Set Descriptions

5-51

Bitwise OR

Bitwise OR
Syntax Characteristics
Parallel Enable Bit No

No. [4]

Syntax dst = src | Smem

Size 3

Cycles 1

Pipeline X

Opcode Operands Description dst, Smem, src

1101 1010 AAAA AAAI FDDD FSSS

This instruction performs a bitwise OR operation between a source (src) register content and a memory (Smem) location.
- When the destination (dst) operand is an accumulator: J J J

The operation is performed on 40 bits in the D-unit ALU. Input operands are zero extended to 40 bits. If an auxiliary or temporary register is the source (src) operand of the instruction, the 16 LSBs of the auxiliary or temporary register are zero extended.

- When the destination (dst) operand is an auxiliary or temporary register: J J

The operation is performed on 16 bits in the A-unit ALU. If an accumulator is the source (src) operand of the instruction, the 16 LSBs of the accumulator are used to perform the operation. none none

Status Bits

Affected by Affects

Repeat Example
Syntax AC0 = AC1 | *AR3

This instruction can be repeated.

Description The content of AC1 is ORed with the content addressed by AR3 and the result is stored in AC0.

5-52

Instruction Set Descriptions

SPRU375G

Bitwise OR

Bitwise OR
Syntax Characteristics
Parallel Enable Bit Yes

No. [5]

Syntax ACy = ACy | (ACx <<< #SHIFTW)

Size 3

Cycles 1

Pipeline X

Opcode Operands Description ACx, ACy, SHIFTW

0001 000E DDSS 0001 xxSH IFTW

This instruction performs a bitwise OR operation between an accumulator (ACy) content and and an accumulator (ACx) content shifted by the 6-bit value, SHIFTW.
- The shift and OR operations are performed in one cycle in the D-unit

shifter.
- Input operands are zero extended to 40 bits. - The input operand (ACx) is shifted by a 6-bit immediate value in the D-unit

shifter.
- The CARRY status bit is not affected by the logical shift operation.

Compatibility with C54x devices (C54CM = 1) When C54CM = 1, the intermediary logical shift is performed as if M40 is locally set to 1. The 8 upper bits of the 40-bit intermediary result are not cleared. Status Bits Affected by Affects Repeat Example
Syntax AC1 = AC1 | (AC0 <<< #4) Description The content of AC1 is ORed with the content of AC0 logically shifted left by 4 bits and the result is stored in AC1.

C54CM none

This instruction can be repeated.

Before AC0 AC1 7E 2355 4FC0 0F E340 5678

After AC0 AC1 7E 2355 4FC0 0F F754 FE78

SPRU375G

Instruction Set Descriptions

5-53

Bitwise OR

Bitwise OR
Syntax Characteristics
Parallel Enable Bit No

No. [6]

Syntax ACy = ACx | (k16 <<< #16)

Size 4

Cycles 1

Pipeline X

Opcode Operands Description ACx, ACy, k16

0111 1010 kkkk kkkk kkkk kkkk SSDD 011x

This instruction performs a bitwise OR operation between an accumulator (ACx) content and a 16-bit unsigned constant, k16, shifted left by 16 bits.
- The operation is performed on 40 bits in the D-unit ALU. - Input operands are zero extended to 40 bits. - The input operand (k16) is shifted 16 bits to the MSBs.

Status Bits

Affected by Affects

none none

Repeat Example
Syntax

This instruction can be repeated.

Description The content of AC1 is ORed with the unsigned 16-bit value (FFFFh) logically shifted left by 16 bits and the result is stored in AC0.

AC0 = AC1 | (#FFFFh <<< #16)

5-54

Instruction Set Descriptions

SPRU375G

Bitwise OR

Bitwise OR
Syntax Characteristics
Parallel Enable Bit No

No. [7]

Syntax ACy = ACx | (k16 <<< #SHFT)

Size 4

Cycles 1

Pipeline X

Opcode Operands Description

0111 0011 kkkk kkkk kkkk kkkk SSDD SHFT ACx, ACy, k16, SHFT This instruction performs a bitwise OR operation between an accumulator (ACx) content and a 16-bit unsigned constant, k16, shifted left by the 4-bit value, SHFT.
- The shift and OR operations are performed in one cycle in the D-unit

shifter.
- Input operands are zero extended to 40 bits. - The input operand (k16) is shifted by a 4-bit immediate value in the D-unit

shifter.
- The CARRY status bit is not affected by the logical shift operation

Status Bits

Affected by Affects

none none

Repeat Example
Syntax

This instruction can be repeated.

Description The content of AC1 is ORed with the unsigned 16-bit value (FFFFh) logically shifted left by 15 bits and the result is stored in AC0.

AC0 = AC1 | (#FFFFh <<< #15)

SPRU375G

Instruction Set Descriptions

5-55

Bitwise OR

Bitwise OR
Syntax Characteristics
Parallel Enable Bit No

No. [8]

Syntax Smem = Smem | k16

Size 4

Cycles 1

Pipeline X

Opcode Operands Description k16, Smem

1111 0101 AAAA AAAI kkkk kkkk kkkk kkkk

This instruction performs a bitwise OR operation between a memory (Smem) location and a 16-bit unsigned constant, k16.
- The operation is performed on 16 bits in the A-unit ALU. - The result is stored in memory.

Status Bits

Affected by Affects

none none

Repeat

This instruction cannot be repeated when using the *(#k23) absolute addressing mode to access the memory operand (Smem); when using other addressing modes, this instruction can be repeated.

Example
Syntax *AR1 = *AR1 | #0FC0h Description The content addressed by AR1 is ORed with the unsigned 16-bit value (FC0h) and the result is stored in the location addressed by AR1.
After 5678 *AR1 5FF8

Before *AR1

5-56

Instruction Set Descriptions

SPRU375G

Bitwise Exclusive OR (XOR)

Bitwise Exclusive OR (XOR)


Syntax Characteristics
Parallel Enable Bit Yes Yes No No Yes No No No

No. [1] [2] [3] [4] [5] [6] [7] [8]

Syntax dst = dst ^ src dst = src ^ k8 dst = src ^ k16 dst = src ^ Smem ACy = ACy ^ (ACx <<< #SHIFTW) ACy = ACx ^ (k16 <<< #16) ACy = ACx ^ (k16 <<< #SHFT) Smem = Smem ^ k16

Size 2 3 4 3 3 4 4 4

Cycles 1 1 1 1 1 1 1 1

Pipeline X X X X X X X X

Description

These instructions perform a bitwise exclusive-OR (XOR) operation:


- In the D-unit, if the destination operand is an accumulator. - In the A-unit ALU, if the destination operand is an auxiliary or temporary

register.
- In the A-unit ALU, if the destination operand is the memory.

Status Bits

Affected by Affects

C54CM none

See Also

See the following other related instructions:


- Bitwise AND - Bitwise OR

SPRU375G

Instruction Set Descriptions

5-57

Bitwise Exclusive OR (XOR)

Bitwise Exclusive OR (XOR)


Syntax Characteristics
Parallel Enable Bit Yes

No. [1]

Syntax dst = dst ^ src

Size 2

Cycles 1

Pipeline X

Opcode Operands Description dst, src

0010 110E FSSS FDDD

This instruction performs a bitwise exclusive-OR (XOR) operation between two registers.
- When the destination (dst) operand is an accumulator: J J J

The operation is performed on 40 bits in the D-unit ALU. Input operands are zero extended to 40 bits. If an auxiliary or temporary register is the source (src) operand of the instruction, the 16 LSBs of the auxiliary or temporary register are zero extended.

- When the destination (dst) operand is an auxiliary or temporary register: J J

The operation is performed on 16 bits in the A-unit ALU. If an accumulator is the source (src) operand of the instruction, the 16 LSBs of the accumulator are used to perform the operation. none none

Status Bits

Affected by Affects

Repeat Example
Syntax AC1 = AC1 ^ AC0

This instruction can be repeated.

Description The content of AC0 is XORed with the content of AC1 and the result is stored in AC1.

Before AC0 AC1 7E 2355 4FC0 0F E340 5678

After AC0 AC1 7E 2355 4FC0 71 C015 19B8

5-58

Instruction Set Descriptions

SPRU375G

Bitwise Exclusive OR (XOR)

Bitwise Exclusive OR (XOR)


Syntax Characteristics
Parallel Enable Bit Yes

No. [2]

Syntax dst = src ^ k8

Size 3

Cycles 1

Pipeline X

Opcode Operands Description dst, k8, src

0001 110E kkkk kkkk FDDD FSSS

This instruction performs a bitwise exclusive-OR (XOR) operation between a source (src) register content and an 8-bit value, k8.
- When the destination (dst) operand is an accumulator: J J J

The operation is performed on 40 bits in the D-unit ALU. Input operands are zero extended to 40 bits. If an auxiliary or temporary register is the source (src) operand of the instruction, the 16 LSBs of the auxiliary or temporary register are zero extended.

- When the destination (dst) operand is an auxiliary or temporary register: J J

The operation is performed on 16 bits in the A-unit ALU. If an accumulator is the source (src) operand of the instruction, the 16 LSBs of the accumulator are used to perform the operation. none none

Status Bits

Affected by Affects

Repeat Example
Syntax AC0 = AC1 ^ #FFh

This instruction can be repeated.

Description The content of AC1 is XORed with the unsigned 8-bit value (FFh) and the result is stored in AC0.

SPRU375G

Instruction Set Descriptions

5-59

Bitwise Exclusive OR (XOR)

Bitwise Exclusive OR (XOR)


Syntax Characteristics
Parallel Enable Bit No

No. [3]

Syntax dst = src ^ k16

Size 4

Cycles 1

Pipeline X

Opcode Operands Description dst, k16, src

0111 1111 kkkk kkkk kkkk kkkk FDDD FSSS

This instruction performs a bitwise exclusive-OR (XOR) operation between a source (src) register content and a 16-bit unsigned constant, k16.
- When the destination (dst) operand is an accumulator: J J J

The operation is performed on 40 bits in the D-unit ALU. Input operands are zero extended to 40 bits. If an auxiliary or temporary register is the source (src) operand of the instruction, the 16 LSBs of the auxiliary or temporary register are zero extended.

- When the destination (dst) operand is an auxiliary or temporary register: J J

The operation is performed on 16 bits in the A-unit ALU. If an accumulator is the source (src) operand of the instruction, the 16 LSBs of the accumulator are used to perform the operation. none none

Status Bits

Affected by Affects

Repeat Example
Syntax AC0 = AC1 ^ #FFFFh

This instruction can be repeated.

Description The content of AC1 is XORed with the unsigned 16-bit value (FFFFh) and the result is stored in AC0.

5-60

Instruction Set Descriptions

SPRU375G

Bitwise Exclusive OR (XOR)

Bitwise Exclusive OR (XOR)


Syntax Characteristics
Parallel Enable Bit No

No. [4]

Syntax dst = src ^ Smem

Size 3

Cycles 1

Pipeline X

Opcode Operands Description dst, Smem, src

1101 1011 AAAA AAAI FDDD FSSS

This instruction performs a bitwise exclusive-OR (XOR) operation between a source (src) register content and a memory (Smem) location.
- When the destination (dst) operand is an accumulator: J J J

The operation is performed on 40 bits in the D-unit ALU. Input operands are zero extended to 40 bits. If an auxiliary or temporary register is the source (src) operand of the instruction, the 16 LSBs of the auxiliary or temporary register are zero extended.

- When the destination (dst) operand is an auxiliary or temporary register: J J

The operation is performed on 16 bits in the A-unit ALU. If an accumulator is the source (src) operand of the instruction, the 16 LSBs of the accumulator are used to perform the operation. none none

Status Bits

Affected by Affects

Repeat Example
Syntax AC0 = AC1 ^ *AR3

This instruction can be repeated.

Description The content of AC1 is XORed with the content addressed by AR3 and the result is stored in AC0.

SPRU375G

Instruction Set Descriptions

5-61

Bitwise Exclusive OR (XOR)

Bitwise Exclusive OR (XOR)


Syntax Characteristics
Parallel Enable Bit Yes

No. [5]

Syntax ACy = ACy ^ (ACx <<< #SHIFTW)

Size 3

Cycles 1

Pipeline X

Opcode Operands Description ACx, ACy, SHIFTW

0001 000E DDSS 0010 xxSH IFTW

This instruction performs a bitwise exclusive-OR (XOR) operation between an accumulator (ACy) content and an accumulator (ACx) content shifted by the 6-bit value, SHIFTW.
- The shift and XOR operations are performed in one cycle in the D-unit

shifter.
- Input operands are zero extended to 40 bits. - The input operand (ACx) is shifted by a 6-bit immediate value in the D-unit

shifter.
- The CARRY status bit is not affected by the logical shift operation.

Compatibility with C54x devices (C54CM = 1) When C54CM = 1, the intermediary logical shift is performed as if M40 is locally set to 1. The 8 upper bits of the 40-bit intermediary result are not cleared. Status Bits Affected by Affects Repeat Example
Syntax AC0 = AC0 ^ (AC1 <<< #30) Description The content of AC0 is XORed with the content of AC1 logically shifted left by 30 bits and the result is stored in AC0.

C54CM none

This instruction can be repeated.

5-62

Instruction Set Descriptions

SPRU375G

Bitwise Exclusive OR (XOR)

Bitwise Exclusive OR (XOR)


Syntax Characteristics
Parallel Enable Bit No

No. [6]

Syntax ACy = ACx ^ (k16 <<< #16)

Size 4

Cycles 1

Pipeline X

Opcode Operands Description ACx, ACy, k16

0111 1010 kkkk kkkk kkkk kkkk SSDD 100x

This instruction performs a bitwise exclusive-OR (XOR) operation between an accumulator (ACx) content and a 16-bit unsigned constant, k16, shifted left by 16 bits.
- The operation is performed on 40 bits in the D-unit ALU. - Input operands are zero extended to 40 bits. - The input operand (k16) is shifted 16 bits to the MSBs.

Status Bits

Affected by Affects

none none

Repeat Example
Syntax

This instruction can be repeated.

Description The content of AC1 is XORed with the unsigned 16-bit value (FFFFh) logically shifted left by 16 bits and the result is stored in AC0.

AC0 = AC1 ^ (#FFFFh <<< #16)

SPRU375G

Instruction Set Descriptions

5-63

Bitwise Exclusive OR (XOR)

Bitwise Exclusive OR (XOR)


Syntax Characteristics
Parallel Enable Bit No

No. [7]

Syntax ACy = ACx ^ (k16 <<< #SHFT)

Size 4

Cycles 1

Pipeline X

Opcode Operands Description

0111 0100 kkkk kkkk kkkk kkkk SSDD SHFT ACx, ACy, k16, SHFT This instruction performs a bitwise exclusive-OR (XOR) operation between an accumulator (ACx) content and a 16-bit unsigned constant, k16, shifted left by the 4-bit value, SHFT.
- The shift and XOR operations are performed in one cycle in the D-unit

shifter.
- Input operands are zero extended to 40 bits. - The input operand (k16) is shifted by a 4-bit immediate value in the D-unit

shifter.
- The CARRY status bit is not affected by the logical shift operation.

Status Bits

Affected by Affects

none none

Repeat Example
Syntax

This instruction can be repeated.

Description The content of AC1 is XORed with the unsigned 16-bit value (FFFFh) logically shifted left by 15 bits and the result is stored in AC0.

AC0 = AC1 ^ (#FFFFh <<< #15)

5-64

Instruction Set Descriptions

SPRU375G

Bitwise Exclusive OR (XOR)

Bitwise Exclusive OR (XOR)


Syntax Characteristics
Parallel Enable Bit No

No. [8]

Syntax Smem = Smem ^ k16

Size 4

Cycles 1

Pipeline X

Opcode Operands Description k16, Smem

1111 0110 AAAA AAAI kkkk kkkk kkkk kkkk

This instruction performs a bitwise exclusive-OR (XOR) operation between a memory (Smem) location and a 16-bit unsigned constant, k16.
- The operation is performed on 16 bits in the A-unit ALU. - The result is stored in memory.

Status Bits

Affected by Affects

none none

Repeat

This instruction cannot be repeated when using the *(#k23) absolute addressing mode to access the memory operand (Smem); when using other addressing modes, this instruction can be repeated.

Example
Syntax *AR3 = *AR3 ^ #FFFFh Description The content addressed by AR3 is XORed with the unsigned 16-bit value (FFFFh) and the result is stored in the location addressed by AR3.

SPRU375G

Instruction Set Descriptions

5-65

Branch Conditionally (if goto)

Branch Conditionally
Syntax Characteristics
Parallel Enable Bit No Yes No No Cycles 6/5 6/5 6/5 5/5

No. [1] [2] [3] [4]

Syntax if (cond) goto l4 if (cond) goto L8 if (cond) goto L16 if (cond) goto P24

Size 2 3 4 5

Pipeline R R R R

x/y cycles: x cycles = condition true, y cycles = condition false

Description

These instructions evaluate a single condition defined by the cond field in the read phase of the pipeline. If the condition is true, a branch occurs to the program address label assembled into l4, Lx, or P24. There is a 1-cycle latency on the condition setting. A single condition can be tested as determined by the cond field of the instruction. See Table 13 for a list of conditions. The instruction selection depends on the branch offset between the current PC value and the program branch address specified by the label. These instructions cannot be repeated.

Status Bits

Affected by Affects

ACOVx, CARRY, C54CM, M40, TCx ACOVx

See Also

See the following other related instructions:


- Branch Unconditionally - Branch on Auxiliary Register Not Zero - Call Conditionally - Compare and Branch

5-66

Instruction Set Descriptions

SPRU375G

Branch Conditionally (if goto)

Branch Conditionally
Syntax Characteristics
Parallel Enable Bit No Cycles 6/5

No. [1]

Syntax if (cond) goto l4

Size 2

Pipeline R

x/y cycles: x cycles = condition true, y cycles = condition false

Opcode Operands Description cond, l4

0110 0lll 1CCC CCCC

This instruction evaluates a single condition defined by the cond field in the read phase of the pipeline. If the condition is true, a branch occurs to the program address label assembled into l4. There is a 1-cycle latency on the condition setting. A single condition can be tested as determined by the cond field of the instruction. See Table 13 for a list of conditions. Compatibility with C54x devices (C54CM = 1) When C54CM = 1, the comparison of accumulators to 0 is performed as if M40 was set to 1.

Status Bits

Affected by Affects

ACOVx, CARRY, C54CM, M40, TCx ACOVx

Repeat Example
Syntax if (AC0 != #0) goto branch

This instruction cannot be repeated.

Description The content of AC0 is not equal to 0, control is passed to the program address label defined by branch.

if (AC0 != #0) goto branch branch :


Before AC0 PC 00 0000 3000 004055 After AC0 PC 00 0000 3000 00405A

address: 004057 00405A

SPRU375G

Instruction Set Descriptions

5-67

Branch Conditionally (if goto)

Branch Conditionally
Syntax Characteristics
No. [2] [3] Syntax if (cond) goto L8 if (cond) goto L16 Parallel Enable Bit Yes No Size 3 4 Cycles 6/5 6/5 Pipeline R R

x/y cycles: x cycles = condition true, y cycles = condition false

Opcode

L8 L16

0000 010E xCCC CCCC LLLL LLLL 0110 1101 xCCC CCCC LLLL LLLL LLLL LLLL

Operands Description

cond, Lx This instruction evaluates a single condition defined by the cond field in the read phase of the pipeline. If the condition is true, a branch occurs to the program address label assembled into Lx. There is a 1-cycle latency on the condition setting. A single condition can be tested as determined by the cond field of the instruction. See Table 13 for a list of conditions. Compatibility with C54x devices (C54CM = 1) When C54CM = 1, the comparison of accumulators to 0 is performed as if M40 was set to 1.

Status Bits

Affected by Affects

ACOVx, CARRY, C54CM, M40, TCx ACOVx

Repeat Example
Syntax if (AC0 != #0) goto branch branch :

This instruction cannot be repeated.

Description The content of AC0 is not equal to 0, control is passed to the program address label defined by branch. 00305A

if (AC0 != #0) goto branch


Before AC0 PC 00 0000 3000 004055 After AC0 PC 00 0000 3000 00305A

address: 004057

5-68

Instruction Set Descriptions

SPRU375G

Branch Conditionally (if goto)

Branch Conditionally
Syntax Characteristics
No. [4] Syntax if (cond) goto P24 Parallel Enable Bit No Size 5 Cycles 5/5 Pipeline R

x/y cycles: x cycles = condition true, y cycles = condition false

Opcode Operands Description

0110 1000 xCCC CCCC PPPP PPPP PPPP PPPP PPPP PPPP cond, P24 This instruction evaluates a single condition defined by the cond field in the read phase of the pipeline. If the condition is true, a branch occurs to the program address label assembled into P24. There is a 1-cycle latency on the condition setting. A single condition can be tested as determined by the cond field of the instruction. See Table 13 for a list of conditions. Compatibility with C54x devices (C54CM = 1) When C54CM = 1, the comparison of accumulators to 0 is performed as if M40 was set to 1.

Status Bits

Affected by Affects

ACOVx, CARRY, C54CM, M40, TCx ACOVx

Repeat Example
Syntax if (AC0 != #0) goto branch

This instruction cannot be repeated.

Description The content of AC0 is not equal to 0, control is passed to the program address label defined by branch.

.sect code1 if (AC0 != #0) goto branch .sect code2 branch :


Before AC0 PC 00 0000 3000 004055 After AC0 PC 00 0000 3000 00F05A

address: 004057 00F05A

SPRU375G

Instruction Set Descriptions

5-69

Branch Unconditionally (goto)

Branch Unconditionally
Syntax Characteristics
Parallel Enable Bit No Yes Yes No

No. [1] [2] [3] [4]

Syntax goto ACx goto L7 goto L16 goto P24

Size 2 2 3 4

Cycles 10 6 6 5

Pipeline X AD AD D

This instruction executes in 3 cycles if the addressed instruction is in the instruction buffer unit.

Description

This instruction branches to a 24-bit program address defined by the content of the 24 lowest bits of an accumulator (ACx), or to a program address defined by the program address label assembled into Lx or P24. These instructions cannot be repeated.

Status Bits

Affected by Affects

none none

See Also

See the following other related instructions:


- Branch Conditionally - Branch on Auxiliary Register Not Zero - Call Unconditionally - Compare and Branch

5-70

Instruction Set Descriptions

SPRU375G

Branch Unconditionally (goto)

Branch Unconditionally
Syntax Characteristics
Parallel Enable Bit No

No. [1]

Syntax goto ACx

Size 2

Cycles 10

Pipeline X

Opcode Operands Description ACx

1001 0001 xxxx xxSS

This instruction branches to a 24-bit program address defined by the content of the 24 lowest bits of an accumulator (ACx). Affected by Affects none none

Status Bits

Repeat Example
Syntax goto AC0
Before AC0 PC

This instruction cannot be repeated.

Description Program control is passed to the program address defined by the content of AC0(230).
After 00 0000 403D 001F0A AC0 PC 00 0000 403D 00403D

SPRU375G

Instruction Set Descriptions

5-71

Branch Unconditionally (goto)

Branch Unconditionally
Syntax Characteristics
Parallel Enable Bit Yes Yes Cycles 6 6

No. [2] [3]

Syntax goto L7 goto L16

Size 2 3

Pipeline AD AD

Executes in 3 cycles if the addressed instruction is in the instruction buffer unit.

Opcode

L7 L16

0100 101E 0LLL LLLL 0000 011E LLLL LLLL LLLL LLLL

Operands Description

Lx This instruction branches to a program address defined by a program address label assembled into Lx. Affected by Affects none none

Status Bits

Repeat Example
Syntax goto branch goto branch AC0 = #1 branch: AC0 = #0

This instruction cannot be repeated.

Description Program control is passed to the absolute address defined by branch.

address: 004044 006047

Before PC AC0 004042 00 0000 0001

After PC AC0 006047 00 0000 0000

5-72

Instruction Set Descriptions

SPRU375G

Branch Unconditionally (goto)

Branch Unconditionally
Syntax Characteristics
Parallel Enable Bit No

No. [4]

Syntax goto P24

Size 4

Cycles 5

Pipeline D

Opcode Operands Description P24

0110 1010 PPPP PPPP PPPP PPPP PPPP PPPP

This instruction branches to a program address defined by a program address label assembled into P24. Affected by Affects none none

Status Bits

Repeat Example
Syntax goto branch goto branch AC0 = #1 branch: AC0 = #0

This instruction cannot be repeated.

Description Program control is passed to the absolute address defined by branch.

address: 004044 006047

Before PC AC0 004042 00 0000 0001

After PC AC0 006047 00 0000 0000

SPRU375G

Instruction Set Descriptions

5-73

Branch on Auxiliary Register Not Zero (if goto)

Branch on Auxiliary Register Not Zero


Syntax Characteristics
Parallel Enable Bit No Cycles 6/5

No. [1]

Syntax if (ARn_mod != #0) goto L16

Size 4

Pipeline AD

x/y cycles: x cycles = condition true, y cycles = condition false

Opcode Operands Description ARn_mod, L16

1111 1100 AAAA AAAI LLLL LLLL LLLL LLLL

This instruction performs a conditional branch (selected auxiliary register content not equal to 0) of the program counter (PC). The program branch address is specified as a 16-bit signed offset, L16, relative to PC. Use this instruction to branch within a 64K-byte window centered on the current PC value. The possible addressing operands can be grouped into three categories:
- ARx not modified (ARx as base pointer), some examples:

*AR1; No modification or offset *AR1(#15); Use 16-bit immediate value (15) as offset *AR1(T0); Use content of T0 as offset *AR1(short(#4)); Use 3-bit immediate value (4) as offset
- ARx modified before being compared to 0, some examples:

*AR1; Decrement by 1 before comparison *+AR1(#20); Add 16-bit immediate value (20) before comparison
- ARx modified after being compared to 0, some examples:

*AR1+; Increment by 1 after comparison *(AR1 T1); Subtract content of T1 after comparison 1) The content of the selected auxiliary register (ARn) is premodified in the address generation unit. 2) The (premodified) content of ARn is compared to 0 and sets the condition in the address phase of the pipeline. 3) If the condition is not true, a branch occurs. If the condition is true, the instructions are executed in sequence. 4) The content of ARn is postmodified in the address generation unit.
5-74 Instruction Set Descriptions SPRU375G

Branch on Auxiliary Register Not Zero (if goto)

Compatibility with C54x devices (C54CM = 1) When C54CM = 1: The premodifier *ARn(T0) is not available; *ARn(AR0) is available. The postmodifiers *(ARn + T0) and *(ARn T0) are not available; *(ARn + AR0) and *(ARn AR0) are available. The legality of the modifier usage is checked by the assembler when using the .c54cm_on and .c54cm_off assembler directives. Status Bits Affected by Affects Repeat See Also C54CM none

This instruction cannot be repeated. See the following other related instructions:
- Branch Conditionally - Branch Unconditionally - Compare and Branch

Example 1
Syntax Description if (*AR1(#6) != #0) goto branch The content of AR1 is compared to 0. The content is not 0, program control is passed to the program address label defined by branch. If (*AR1(#6) != #0) goto branch branch :
Before AR1 PC 0005 004004 After AR1 PC 0005 00400C

address: 004004 ; 00400A

00400C

SPRU375G

Instruction Set Descriptions

5-75

Branch on Auxiliary Register Not Zero (if goto)

Example 2
Syntax if (*AR3 != #0) goto branch Description The content of AR3 is compared to 0. The content is 0, program control is passed to the next instruction (the branch is not taken). AR3 is decremented by 1 after the comparison. address: 00400F ; ; 004013 004015

If (*AR3 != #0) goto branch branch :


Before AR3 PC 0000 00400F After AR3 PC

FFFF 004013

5-76

Instruction Set Descriptions

SPRU375G

Call Conditionally (if call)

Call Conditionally
Syntax Characteristics
Parallel Enable Bit No No

No. [1] [2]

Syntax if (cond) call L16 if (cond) call P24

Size 4 5

Cycles 6/5 5/5

Pipeline R R

x/y cycles: x cycles = condition true, y cycles = condition false

Description

These instructions evaluate a single condition defined by the cond field in the read phase of the pipeline. If the condition is true, a subroutine call occurs to the program address defined by the program address label assembled into L16 or P24. There is a 1-cycle latency on the condition setting. A single condition can be tested as determined by the cond field of the instruction. See Table 13 for a list of conditions. Before beginning a called subroutine, the CPU automatically saves the value of two internal registers: the program counter (PC) and a loop context register. The CPU can use these values to re-establish the context of the interrupted program sequence when the subroutine is done. In the slow-return process (default), the return address (from the PC) and the loop context bits are stored to the stacks (in memory). When the CPU returns from a subroutine, the speed at which these values are restored is dependent on the speed of the memory accesses. In the fast-return process, the return address (from the PC) and the loop context bits are saved to registers, so that these values can always be restored quickly. These special registers are the return address register (RETA) and the control-flow context register (CFCT). You can read from or write to RETA and CFCT as a pair with dedicated, 32-bit load and store instructions. The instruction selection depends on the branch offset between the current PC value and program subroutine address specified by the label. These instructions cannot be repeated.

Status Bits

Affected by Affects

ACOVx, CARRY, C54CM, M40, TCx ACOVx


Instruction Set Descriptions 5-77

SPRU375G

Call Conditionally (if call)

See Also

See the following other related instructions:


- Branch Conditionally - Call Unconditionally - Return Conditionally - Return Unconditionally

5-78

Instruction Set Descriptions

SPRU375G

Call Conditionally (if call)

Call Conditionally
Syntax Characteristics
No. [1] Syntax if (cond) call L16 Parallel Enable Bit No Size 4 Cycles 6/5 Pipeline R

x/y cycles: x cycles = condition true, y cycles = condition false

Opcode Operands Description cond, L16

0110 1110 xCCC CCCC LLLL LLLL LLLL LLLL

This instruction evaluates a single condition defined by the cond field in the read phase of the pipeline. If the condition is true, a subroutine call occurs to the program address defined by the program address label assembled into L16. There is a 1-cycle latency on the condition setting. A single condition can be tested as determined by the cond field of the instruction. See Table 13 for a list of conditions. When a subroutine call occurs in the slow-return process (default), the return address (from the PC) and the loop context bits are stored to the stacks. For fast-return mode operation, see the TMS320C55x DSP CPU Reference Guide (SPRU371).
- The data stack pointer (SP) is decremented by 1 word in the read phase

of the pipeline. The 16 LSBs of the return address, from the program counter (PC), of the called subroutine are pushed to the top of SP.
- The system stack pointer (SSP) is decremented by 1 word in the read

phase of the pipeline. The loop context bits concatenated with the 8 MSBs of the return address are pushed to the top of SSP.
- The PC is loaded with the subroutine program address. The active control

flow execution context flags are cleared.


System Stack (SSP) After Save Before Save SSP = x 1 SSP = x (Loop bits) bits):PC(23 PC(2316) Previously saved data After SP = y 1 Save Before Save SP = y Data Stack (SP) PC(150) Previously saved data

Compatibility with C54x devices (C54CM = 1) When C54CM = 1, the comparison of accumulators to 0 is performed as if M40 was set to 1.
SPRU375G Instruction Set Descriptions 5-79

Call Conditionally (if call)

Status Bits

Affected by Affects

ACOVx, CARRY, C54CM, M40, TCx ACOVx

Repeat Example
Syntax

This instruction cannot be repeated.

Description The content of AC1 is equal to or greater than 2000h, control is passed to the program address label, subroutine. The program counter (PC) is loaded with the subroutine program address.

if (AC1 >= #2000h) call (subroutine)

5-80

Instruction Set Descriptions

SPRU375G

Call Conditionally (if call)

Call Conditionally
Syntax Characteristics
No. [2] Syntax if (cond) call P24 Parallel Enable Bit No Size 5 Cycles 5/5 Pipeline R

x/y cycles: x cycles = condition true, y cycles = condition false

Opcode Operands Description

0110 1001 xCCC CCCC PPPP PPPP PPPP PPPP PPPP PPPP cond, P24 This instruction evaluates a single condition defined by the cond field in the read phase of the pipeline. If the condition is true, a subroutine call occurs to the program address defined by the program address label assembled into P24. There is a 1-cycle latency on the condition setting. A single condition can be tested as determined by the cond field of the instruction. See Table 13 for a list of conditions. When a subroutine call occurs in the slow-return process (default), the return address (from the PC) and the loop context bits are stored to the stacks. For fast-return mode operation, see the TMS320C55x DSP CPU Reference Guide (SPRU371).
- The data stack pointer (SP) is decremented by 1 word in the read phase

of the pipeline. The 16 LSBs of the return address, from the program counter (PC), of the called subroutine are pushed to the top of SP.
- The system stack pointer (SSP) is decremented by 1 word in the read

phase of the pipeline. The loop context bits concatenated with the 8 MSBs of the return address are pushed to the top of SSP.
- The PC is loaded with the subroutine program address. The active control

flow execution context flags are cleared.


System Stack (SSP) After Save Before Save SSP = x 1 SSP = x (Loop bits) bits):PC(23 PC(2316) Previously saved data After SP = y 1 Save Before Save SP = y Data Stack (SP) PC(150) Previously saved data

Compatibility with C54x devices (C54CM = 1) When C54CM = 1, the comparison of accumulators to 0 is performed as if M40 was set to 1.
SPRU375G Instruction Set Descriptions 5-81

Call Conditionally (if call)

Status Bits

Affected by Affects

ACOVx, CARRY, C54CM, M40, TCx ACOVx

Repeat Example
Syntax if (TC1) call FOO

This instruction cannot be repeated.

Description If TC1 is set to 1, control is passed to the program address label (FOO) assembled into an absolute address defined by the 24-bit value. If TC1 is cleared to 0, the program counter is incremented by 6 and the next instruction is executed.

5-82

Instruction Set Descriptions

SPRU375G

Call Unconditionally (call)

Call Unconditionally
Syntax Characteristics
Parallel Enable Bit No Yes No

No. [1] [2] [3]

Syntax call ACx call L16 call P24

Size 2 3 4

Cycles 10 6 5

Pipeline X AD D

Description

This instruction passes control to a specified subroutine program address defined by the content of the 24 lowest bits of the accumulator, ACx, or a program address label assembled into L16 or P24. Before beginning a called subroutine, the CPU automatically saves the value of two internal registers: the program counter (PC) and a loop context register. The CPU can use these values to re-establish the context of the interrupted program sequence when the subroutine is done. In the slow-return process (default), the return address (from the PC) and the loop context bits are stored to the stacks (in memory). When the CPU returns from a subroutine, the speed at which these values are restored is dependent on the speed of the memory accesses. In the fast-return process, the return address (from the PC) and the loop context bits are saved to registers, so that these values can always be restored quickly. These special registers are the return address register (RETA) and the control-flow context register (CFCT). You can read from or write to RETA and CFCT as a pair with dedicated, 32-bit load and store instructions. These instructions cannot be repeated.

Status Bits

Affected by Affects

none none

See Also

See the following other related instructions:


- Branch Unconditionally - Call Conditionally - Return Conditionally - Return Unconditionally

SPRU375G

Instruction Set Descriptions

5-83

Call Unconditionally (call)

Call Unconditionally
Syntax Characteristics
No. [1] Syntax call ACx Parallel Enable Bit No Size 2 Cycles 10 Pipeline X

Opcode Operands Description ACx

1001 0010 xxxx xxSS

This instruction passes control to a specified subroutine program address defined by the content of the 24 lowest bits of the accumulator, ACx. In the slow-return process (default), the return address (from the PC) and the loop context bits are stored to the stacks. For fast-return mode operation, see the TMS320C55x DSP CPU Reference Guide (SPRU371).
- The data stack pointer (SP) is decremented by 1 word in the address

phase of the pipeline. The 16 LSBs of the return address, from the program counter (PC), of the called subroutine are pushed to the top of SP.
- The system stack pointer (SSP) is decremented by 1 word in the address

phase of the pipeline. The loop context bits concatenated with the 8 MSBs of the return address are pushed to the top of SSP.
- The PC is loaded with the subroutine program address. The active control

flow execution context flags are cleared.


System Stack (SSP) After Save Before Save SSP = x 1 SSP = x PC(2316) (Loop bits) bits):PC(23 Previously saved data After SP = y 1 Save Before Save SP = y Data Stack (SP) PC(150) Previously saved data

Status Bits

Affected by Affects

none none

Repeat Example
Syntax call AC0

This instruction cannot be repeated.

Description Program control is passed to the program address defined by the content of AC0(230).

5-84

Instruction Set Descriptions

SPRU375G

Call Unconditionally (call)

Call Unconditionally
Syntax Characteristics
No. [2] Syntax call L16 Parallel Enable Bit Yes Size 3 Cycles 6 Pipeline AD

Opcode Operands Description L16

0000 100E LLLL LLLL LLLL LLLL

This instruction passes control to a specified subroutine program address defined by a program address label assembled into L16. In the slow-return process (default), the return address (from the PC) and the loop context bits are stored to the stacks. For fast-return mode operation, see the TMS320C55x DSP CPU Reference Guide (SPRU371).
- The data stack pointer (SP) is decremented by 1 word in the address

phase of the pipeline. The 16 LSBs of the return address, from the program counter (PC), of the called subroutine are pushed to the top of SP.
- The system stack pointer (SSP) is decremented by 1 word in the address

phase of the pipeline. The loop context bits concatenated with the 8 MSBs of the return address are pushed to the top of SSP.
- The PC is loaded with the subroutine program address. The active control

flow execution context flags are cleared.


System Stack (SSP) After Save Before Save SSP = x 1 SSP = x PC(2316) (Loop bits) bits):PC(23 Previously saved data After SP = y 1 Save Before Save SP = y Data Stack (SP) PC(150) Previously saved data

Status Bits

Affected by Affects

none none

Repeat Example
Syntax call FOO

This instruction cannot be repeated.

Description Program control is passed to the program address label (FOO) assembled into the signed 16-bit offset value relative to the program counter register.

SPRU375G

Instruction Set Descriptions

5-85

Call Unconditionally (call)

Call Unconditionally
Syntax Characteristics
No. [3] Syntax call P24 Parallel Enable Bit No Size 4 Cycles 5 Pipeline D

Opcode Operands Description P24

0110 1100 PPPP PPPP PPPP PPPP PPPP PPPP

This instruction passes control to a specified subroutine program address defined by a program address label assembled into P24. In the slow-return process (default), the return address (from the PC) and the loop context bits are stored to the stacks. For fast-return mode operation, see the TMS320C55x DSP CPU Reference Guide (SPRU371).
- The data stack pointer (SP) is decremented by 1 word in the address

phase of the pipeline. The 16 LSBs of the return address, from the program counter (PC), of the called subroutine are pushed to the top of SP.
- The system stack pointer (SSP) is decremented by 1 word in the address

phase of the pipeline. The loop context bits concatenated with the 8 MSBs of the return address are pushed to the top of SSP.
- The PC is loaded with the subroutine program address. The active control

flow execution context flags are cleared.


System Stack (SSP) After Save Before Save SSP = x 1 SSP = x PC(2316) (Loop bits) bits):PC(23 Previously saved data After SP = y 1 Save Before Save SP = y Data Stack (SP) PC(150) Previously saved data

Status Bits

Affected by Affects

none none

Repeat Example
Syntax call FOO

This instruction cannot be repeated.

Description Program control is passed to the program address label (FOO) assembled into an absolute address defined by the 24-bit value.

5-86

Instruction Set Descriptions

SPRU375G

Circular Addressing Qualifier (circular)

Circular Addressing Qualifier


Syntax Characteristics
Parallel Enable Bit No

No. [1]

Syntax circular()

Size 1

Cycles 1

Pipeline AD

Opcode Operands Description none

1001 1101

This instruction is an instruction qualifier that can be paralleled only with any instruction making an indirect Smem, Xmem, Ymem, Lmem, Baddr, or Cmem addressing. This instruction cannot be executed in parallel with any other types of instructions and it cannot be executed as a stand-alone instruction (assembler generates an error message). When this instruction is used in parallel, all modifications of ARx and CDP pointer registers used in the indirect addressing mode are done circularly (as if ST2_55 register bits 0 to 8 were set to 1).

Status Bits

Affected by Affects

none none

Repeat

This instruction can be repeated.

SPRU375G

Instruction Set Descriptions

5-87

Clear Accumulator, Auxiliary, or Temporary Register Bit

Clear Accumulator, Auxiliary, or Temporary Register Bit


Syntax Characteristics
No. [1] Syntax bit(src, Baddr) = #0 Parallel Enable Bit No Size 3 Cycles 1 Pipeline X

Opcode Operands Description Baddr, src

1110 1100 AAAA AAAI FSSS 001x

This instruction performs a bit manipulation:


- In the D-unit ALU, if the source (src) register operand is an accumulator. - In the A-unit ALU, if the source (src) register operand is an auxiliary or

temporary register. The instruction clears to 0 a single bit, as defined by the bit addressing mode, Baddr, of the source register. The generated bit address must be within:
- 039 when accessing accumulator bits (only the 6 LSBs of the generated

bit address are used to determine the bit position). If the generated bit address is not within 039, the selected register bit value does not change.
- 015 when accessing auxiliary or temporary register bits (only the 4 LSBs

of the generated address are used to determine the bit position). Status Bits Affected by Affects Repeat See Also none none

This instruction can be repeated. See the following other related instructions:
- Clear Memory Bit - Clear Status Register Bit - Complement Accumulator, Auxiliary, or Temporary Register Bit - Set Accumulator, Auxiliary, or Temporary Register Bit

Example
Syntax bit(AC0, AR3) = #0 Description The bit at the position defined by the content of AR3(40) in AC0 is cleared to 0.

5-88

Instruction Set Descriptions

SPRU375G

Clear Memory Bit

Clear Memory Bit


Syntax Characteristics
Parallel Enable Bit No

No. [1]

Syntax bit(Smem, src) = #0

Size 3

Cycles 1

Pipeline X

Opcode Operands Description Smem, src

1110 0011 AAAA AAAI FSSS 1101

This instruction performs a bit manipulation in the A-unit ALU. The instruction clears to 0 a single bit, as defined by the content of the source (src) operand, of a memory (Smem) location. The generated bit address must be within 015 (only the 4 LSBs of the register are used to determine the bit position).

Status Bits

Affected by Affects

none none

Repeat See Also

This instruction can be repeated. See the following other related instructions:
- Clear Accumulator, Auxiliary, or Temporary Register Bit - Clear Status Register Bit - Complement Memory Bit - Set Memory Bit

Example
Syntax bit(*AR3, AC0) = #0 Description The bit at the position defined by AC0(30) in the content addressed by AR3 is cleared to 0.

SPRU375G

Instruction Set Descriptions

5-89

Clear Status Register Bit

Clear Status Register Bit


Syntax Characteristics
Parallel Enable Bit Yes Yes Yes Yes

No. [1] [2] [3] [4]

Syntax bit(ST0, k4) = #0 bit(ST1, k4) = #0 bit(ST2, k4) = #0 bit(ST3, k4) = #0

Size 2 2 2 2

Cycles 1 1 1 1

Pipeline X X X X

When this instruction is decoded to modify status bit CAFRZ (15), CAEN (14), or CACLR (13), the CPU pipeline is flushed and the instruction is executed in 5 cycles regardless of the instruction context.

Opcode

ST0 ST1 ST2 ST3

0100 011E kkkk 0000 0100 011E kkkk 0010 0100 011E kkkk 0100 0100 011E kkkk 0110

Operands Description

k4, STx These instructions perform a bit manipulation in the A-unit ALU. These instructions clear to 0 a single bit, as defined by a 4-bit immediate value, k4, in the selected status register (ST0, ST1, ST2, or ST3). Compatibility with C54x devices (C54CM = 1) C55x DSP status registers bit mapping (Figure 51, page 5-92) does not correspond to C54x DSP status register bits.

Status Bits

Affected by Affects

none Selected status bits

Repeat See Also

This instruction cannot be repeated. See the following other related instructions:
- Clear Accumulator, Auxiliary, or Temporary Register Bit - Clear Memory Bit - Set Status Register Bit

5-90

Instruction Set Descriptions

SPRU375G

Clear Status Register Bit

Example
Syntax bit(ST2, #ST2_AR2LC) = #0; AR2LC = bit 2 Description The ST2 bit position defined by the label (ST2_AR2LC, bit 2) is cleared to 0.

Before ST2_55 0006

After ST2_55 0002

SPRU375G

Instruction Set Descriptions

5-91

Clear Status Register Bit

Figure 51. Status Registers Bit Mapping


ST0_55 15 ACOV2 R/W0 8 DP R/W0 ST1_55 15 BRAF R/W0 7 C16 R/W0 ST2_55 15 ARMS R/W0 7 AR7LC R/W0 ST3_55 15 CAFRZ R/W0 7 CBERR R/W0 14 CAEN R/W0 6 MPNMC R/Wpins 13 CACLR R/W0 5 SATA R/W0 4 Reserved 12 HINT R/W1 3 2 CLKOFF R/W0 1 SMUL R/W0 0 SST R/W0 11 Reserved (always write 1100b) 8 6 AR6LC R/W0 5 AR5LC R/W0 14 Reserved 13 12 DBGM R/W1 4 AR4LC R/W0 11 EALLOW R/W0 3 AR3LC R/W0 10 RDM R/W0 2 AR2LC R/W0 1 AR1LC R/W0 9 Reserved 8 CDPLC R/W0 0 AR0LC R/W0 14 CPL R/W0 6 FRCT R/W0 13 XF R/W1 5 C54CM R/W1 4 ASM R/W0 12 HM R/W0 11 INTM R/W1 10 M40 R/W0 9 SATD R/W0 8 SXMD R/W1 0 14 ACOV3 R/W0 13 TC1 R/W1 12 TC2 R/W1 11 CARRY R/W1 10 ACOV0 R/W0 9 ACOV1 R/W0 0

Legend: R = Read; W = Write; -n = Value after reset Highlighted bit: If you write to the protected address of the status register, a write to this bit has no effect, and the bit always appears as a 0 during read operations. The HINT bit is not used for all C55x host port interfaces (HPIs). Consult the documentation for the specific C55x DSP. The reset value of MPNMC may be dependent on the state of predefined pins at reset. To check this for a particular C55x DSP, see the boot loader section of its data sheet.

5-92

Instruction Set Descriptions

SPRU375G

Compare Accumulator, Auxiliary, or Temporary Register Content

Compare Accumulator, Auxiliary, or Temporary Register Content


Syntax Characteristics
No. [1] [2] Syntax TC1 = uns(src RELOP dst) TC2 = uns(src RELOP dst) Parallel Enable Bit Yes Yes Size 3 3 Cycles 1 1 Pipeline X X

Opcode

TC1 TC2

0001 001E FSSS cc00 FDDD xux0 0001 001E FSSS cc00 FDDD xux1

Operands Description

dst, RELOP, src, TCx This instruction performs a comparison in the D-unit ALU or in the A-unit ALU. Two accumulator, auxiliary registers, and temporary registers contents are compared. When an accumulator ACx is compared with an auxiliary or temporary register TAx, the 16 lowest bits of ACx are compared with TAx in the A-unit ALU. If the comparison is true, the TCx status bit is set to 1; otherwise, it is cleared to 0. The comparison depends on the optional uns keyword and on M40 for accumulator comparisons. As the following table shows, the uns keyword specifies an unsigned comparison and M40 defines the comparison bit width for accumulator comparisons.
uns no no no no yes yes yes yes src TAx TAx ACx ACx TAx TAx ACx ACx dst TAy ACy TAy ACy TAy ACy TAy ACy Comparison Type 16-bit signed comparison in A-unit ALU 16-bit signed comparison in A-unit ALU 16-bit signed comparison in A-unit ALU if M40 = 0, 32-bit signed comparison in D-unit ALU if M40 = 1, 40-bit signed comparison in D-unit ALU 16-bit unsigned comparison in A-unit ALU 16-bit unsigned comparison in A-unit ALU 16-bit unsigned comparison in A-unit ALU if M40 = 0, 32-bit unsigned comparison in D-unit ALU if M40 = 1, 40-bit unsigned comparison in D-unit ALU

Compatibility with C54x devices (C54CM = 1) Contrary to the corresponding C54x instruction, the C55x register comparison instruction is performed in execute phase of the pipeline. When C54CM = 1, the conditions testing the accumulators content are all performed as if M40 was set to 1.
SPRU375G Instruction Set Descriptions 5-93

Compare Accumulator, Auxiliary, or Temporary Register Content

Status Bits

Affected by Affects

C54CM, M40 TCx

Repeat See Also

This instruction can be repeated. See the following other related instructions:
- Compare Accumulator, Auxiliary, or Temporary Register Content with AND - Compare Accumulator, Auxiliary, or Temporary Register Content with OR - Compare Accumulator, Auxiliary, or Temporary Register Content Maximum - Compare Accumulator, Auxiliary, or Temporary Register Content Minimum - Compare Memory with Immediate Value

Example 1
Syntax TC1= AC1 = = T1 Description The signed content of AC1(150) is compared to the content of T1 and because they are equal, TC1 is set to 1.
After 00 0028 0400 0400 0 AC1 T1 TC1 00 0028 0400 0400 1

Before AC1 T1 TC1

Example 2
Syntax TC1= T1 > = AC1 Description The content of T1 is compared to the signed content of AC1(150). The content of T1 is greater than the content of AC1, TC1 is set to 1.
After 0500 80 0000 0400 0 T1 AC1 TC1 0500 80 0000 0400 1

Before T1 AC1 TC1

5-94

Instruction Set Descriptions

SPRU375G

Compare Accumulator, Auxiliary, or Temporary Register Content with AND

Compare Accumulator, Auxiliary, or Temporary Register Content with AND


Syntax Characteristics
Parallel Enable Bit Yes Yes

No. [1] [2]

Syntax TCx = TCy & uns(src RELOP dst) TCx = !TCy & uns(src RELOP dst)

Size 3 3

Cycles 1 1

Pipeline X X

Description

These instructions perform a comparison in the D-unit ALU or in the A-unit ALU. Two accumulator, auxiliary registers, and temporary registers contents are compared. When an accumulator ACx is compared with an auxiliary or temporary register TAx, the 16 lowest bits of ACx are compared with TAx in the A-unit ALU. Affected by Affects C54CM, M40, TCy TCx

Status Bits

See Also

See the following other related instructions:


- Compare Accumulator, Auxiliary, or Temporary Register Content - Compare Accumulator, Auxiliary, or Temporary Register Content with OR - Compare Accumulator, Auxiliary, or Temporary Register Content Maximum - Compare Accumulator, Auxiliary, or Temporary Register Content Minimum - Compare Memory with Immediate Value

SPRU375G

Instruction Set Descriptions

5-95

Compare Accumulator, Auxiliary, or Temporary Register Content with AND

Compare Accumulator, Auxiliary, or Temporary Register Content with AND

Syntax Characteristics
Parallel Enable Bit

No.

Syntax TCx = TCy & uns(src RELOP dst)

Size

Cycles

Pipeline

[1a] [1b]

TC1 = TC2 & uns(src RELOP dst) TC2 = TC1 & uns(src RELOP dst)

Yes Yes

3 3

1 1

X X

Opcode Operands Description dst, RELOP, src, TC1, TC2

0001 001E FSSS cc01 FDDD 0utt

This instruction performs a comparison in the D-unit ALU or in the A-unit ALU. Two accumulator, auxiliary registers, and temporary registers contents are compared. When an accumulator ACx is compared with an auxiliary or temporary register TAx, the 16 lowest bits of ACx are compared with TAx in the A-unit ALU. If the comparison is true, the TCx status bit is set to 1; otherwise, it is cleared to 0. The result of the comparison is ANDed with TCy; TCx is updated with this operation. The comparison depends on the optional uns keyword and on M40 for accumulator comparisons. As the following table shows, the uns keyword specifies an unsigned comparison and M40 defines the comparison bit width for accumulator comparisons.
uns no no no no yes yes yes yes src TAx TAx ACx ACx TAx TAx ACx ACx dst TAy ACy TAy ACy TAy ACy TAy ACy Comparison Type 16-bit signed comparison in A-unit ALU 16-bit signed comparison in A-unit ALU 16-bit signed comparison in A-unit ALU If M40 = 0, 32-bit signed comparison in D-unit ALU if M40 = 1, 40-bit signed comparison in D-unit ALU 16-bit unsigned comparison in A-unit ALU 16-bit unsigned comparison in A-unit ALU 16-bit unsigned comparison in A-unit ALU If M40 = 0, 32-bit unsigned comparison in D-unit ALU if M40 = 1, 40-bit unsigned comparison in D-unit ALU

5-96

Instruction Set Descriptions

SPRU375G

Compare Accumulator, Auxiliary, or Temporary Register Content with AND

Compatibility with C54x devices (C54CM = 1) Contrary to the corresponding C54x instruction, the C55x register comparison instruction is performed in execute phase of the pipeline. When C54CM = 1, the conditions testing the accumulators content are all performed as if M40 was set to 1. Status Bits Affected by Affects Repeat Example
Syntax TC2 = TC1 & AC1 == AC2 Description The content of AC1(310) is compared to the content of AC2(310). The contents are equal (true), TC2 = TC1 & 1.
After 80 0028 0400 00 0028 0400 0 1 0 AC1 AC2 M40 TC1 TC2 80 0028 0400 00 0028 0400 0 1 1

C54CM, M40, TCy TCx

This instruction can be repeated.

Before AC1 AC2 M40 TC1 TC2

SPRU375G

Instruction Set Descriptions

5-97

Compare Accumulator, Auxiliary, or Temporary Register Content with AND

Compare Accumulator, Auxiliary, or Temporary Register Content with AND

Syntax Characteristics
Parallel Enable Bit

No.

Syntax TCx = !TCy & uns(src RELOP dst)

Size

Cycles

Pipeline

[2a] [2b]

TC1 = !TC2 & uns(src RELOP dst) TC2 = !TC1 & uns(src RELOP dst)

Yes Yes

3 3

1 1

X X

Opcode Operands Description dst, RELOP, src, TC1, TC2

0001 001E FSSS cc01 FDDD 1utt

This instruction performs a comparison in the D-unit ALU or in the A-unit ALU. Two accumulator, auxiliary registers, and temporary registers contents are compared. When an accumulator ACx is compared with an auxiliary or temporary register TAx, the 16 lowest bits of ACx are compared with TAx in the A-unit ALU. If the comparison is true, the TCx status bit is set to 1; otherwise, it is cleared to 0. The result of the comparison is ANDed with the complement of TCy; TCx is updated with this operation. The comparison depends on the optional uns keyword and on M40 for accumulator comparisons. As the following table shows, the uns keyword specifies an unsigned comparison and M40 defines the comparison bit width for accumulator comparisons.
uns no no no no yes yes yes yes src TAx TAx ACx ACx TAx TAx ACx ACx dst TAy ACy TAy ACy TAy ACy TAy ACy Comparison Type 16-bit signed comparison in A-unit ALU 16-bit signed comparison in A-unit ALU 16-bit signed comparison in A-unit ALU if M40 = 0, 32-bit signed comparison in D-unit ALU if M40 = 1, 40-bit signed comparison in D-unit ALU 16-bit unsigned comparison in A-unit ALU 16-bit unsigned comparison in A-unit ALU 16-bit unsigned comparison in A-unit ALU if M40 = 0, 32-bit unsigned comparison in D-unit ALU if M40 = 1, 40-bit unsigned comparison in D-unit ALU

5-98

Instruction Set Descriptions

SPRU375G

Compare Accumulator, Auxiliary, or Temporary Register Content with AND

Compatibility with C54x devices (C54CM = 1) Contrary to the corresponding C54x instruction, the C55x register comparison instruction is performed in execute phase of the pipeline. When C54CM = 1, the conditions testing the accumulators content are all performed as if M40 was set to 1. Status Bits Affected by Affects Repeat Example
Syntax TC2 = !TC1 & AC1 == AC2 Description The content of AC1(310) is compared to the content of AC2(310). The contents are equal (true), TC2 = !TC1 & 1.
After 80 0028 0400 00 0028 0400 0 1 0 AC1 AC2 M40 TC1 TC2 80 0028 0400 00 0028 0400 0 1 0

C54CM, M40, TCy TCx

This instruction can be repeated.

Before AC1 AC2 M40 TC1 TC2

SPRU375G

Instruction Set Descriptions

5-99

Compare Accumulator, Auxiliary, or Temporary Register Content with OR

Compare Accumulator, Auxiliary, or Temporary Register Content with OR


Syntax Characteristics
Parallel Enable Bit Yes Yes

No. [1] [2]

Syntax TCx = TCy | uns(src RELOP dst) TCx = !TCy | uns(src RELOP dst)

Size 3 3

Cycles 1 1

Pipeline X X

Description

These instructions perform a comparison in the D-unit ALU or in the A-unit ALU. Two accumulator, auxiliary registers, and temporary registers contents are compared. When an accumulator ACx is compared with an auxiliary or temporary register TAx, the 16 lowest bits of ACx are compared with TAx in the A-unit ALU. Affected by Affects C54CM, M40, TCy TCx

Status Bits

See Also

See the following other related instructions:


- Compare Accumulator, Auxiliary, or Temporary Register Content - Compare Accumulator, Auxiliary, or Temporary Register Content with AND - Compare Accumulator, Auxiliary, or Temporary Register Content Maximum - Compare Accumulator, Auxiliary, or Temporary Register Content Minimum - Compare Memory with Immediate Value

5-100

Instruction Set Descriptions

SPRU375G

Compare Accumulator, Auxiliary, or Temporary Register Content with OR

Compare Accumulator, Auxiliary, or Temporary Register Content with OR

Syntax Characteristics
Parallel Enable Bit

No.

Syntax TCx = TCy | uns(src RELOP dst)

Size

Cycles

Pipeline

[1a] [1b]

TC1 = TC2 | uns(src RELOP dst) TC2 = TC1 | uns(src RELOP dst)

Yes Yes

3 3

1 1

X X

Opcode Operands Description dst, RELOP, src, TC1, TC2

0001 001E FSSS cc10 FDDD 0utt

This instruction performs a comparison in the D-unit ALU or in the A-unit ALU. Two accumulator, auxiliary registers, and temporary registers contents are compared. When an accumulator ACx is compared with an auxiliary or temporary register TAx, the 16 lowest bits of ACx are compared with TAx in the A-unit ALU. If the comparison is true, the TCx status bit is set to 1; otherwise, it is cleared to 0. The result of the comparison is ORed with TCy; TCx is updated with this operation. The comparison depends on the optional uns keyword and on M40 for accumulator comparisons. As the following table shows, the uns keyword specifies an unsigned comparison and M40 defines the comparison bit width for accumulator comparisons.
uns no no no no yes yes yes yes src TAx TAx ACx ACx TAx TAx ACx ACx dst TAy ACy TAy ACy TAy ACy TAy ACy Comparison Type 16-bit signed comparison in A-unit ALU 16-bit signed comparison in A-unit ALU 16-bit signed comparison in A-unit ALU if M40 = 0, 32-bit signed comparison in D-unit ALU if M40 = 1, 40-bit signed comparison in D-unit ALU 16-bit unsigned comparison in A-unit ALU 16-bit unsigned comparison in A-unit ALU 16-bit unsigned comparison in A-unit ALU if M40 = 0, 32-bit unsigned comparison in D-unit ALU if M40 = 1, 40-bit unsigned comparison in D-unit ALU

SPRU375G

Instruction Set Descriptions

5-101

Compare Accumulator, Auxiliary, or Temporary Register Content with OR

Compatibility with C54x devices (C54CM = 1) Contrary to the corresponding C54x instruction, the C55x register comparison instruction is performed in execute phase of the pipeline. When C54CM = 1, the conditions testing the accumulators content are all performed as if M40 was set to 1. Status Bits Affected by Affects Repeat Example
Syntax TC2 = TC1 | uns(AC1 != AR1) Description The unsigned content of AC1(150) is compared to the unsigned content of AR1. The contents are equal (false), TC2 = TC1 | 0.
After 00 8028 0400 0400 1 0 AC1 AR1 TC1 TC2 00 8028 0400 0400 1 1

C54CM, M40, TCy TCx

This instruction can be repeated.

Before AC1 AR1 TC1 TC2

5-102

Instruction Set Descriptions

SPRU375G

Compare Accumulator, Auxiliary, or Temporary Register Content with OR

Compare Accumulator, Auxiliary, or Temporary Register Content with OR

Syntax Characteristics
Parallel Enable Bit

No.

Syntax TCx = !TCy | uns(src RELOP dst)

Size

Cycles

Pipeline

[2a] [2b]

TC1 = !TC2 | uns(src RELOP dst) TC2 = !TC1 | uns(src RELOP dst)

Yes Yes

3 3

1 1

X X

Opcode Operands Description dst, RELOP, src, TC1, TC2

0001 001E FSSS cc10 FDDD 1utt

This instruction performs a comparison in the D-unit ALU or in the A-unit ALU. Two accumulator, auxiliary registers, and temporary registers contents are compared. When an accumulator ACx is compared with an auxiliary or temporary register TAx, the 16 lowest bits of ACx are compared with TAx in the A-unit ALU. If the comparison is true, the TCx status bit is set to 1; otherwise, it is cleared to 0. The result of the comparison is ORed with the complement of TCy; TCx is updated with this operation. The comparison depends on the optional uns keyword and on M40 for accumulator comparisons. As the following table shows, the uns keyword specifies an unsigned comparison and M40 defines the comparison bit width for accumulator comparisons.
uns no no no no yes yes yes yes src TAx TAx ACx ACx TAx TAx ACx ACx dst TAy ACy TAy ACy TAy ACy TAy ACy Comparison Type 16-bit signed comparison in A-unit ALU 16-bit signed comparison in A-unit ALU 16-bit signed comparison in A-unit ALU if M40 = 0, 32-bit signed comparison in D-unit ALU if M40 = 1, 40-bit signed comparison in D-unit ALU 16-bit unsigned comparison in A-unit ALU 16-bit unsigned comparison in A-unit ALU 16-bit unsigned comparison in A-unit ALU if M40 = 0, 32-bit unsigned comparison in D-unit ALU if M40 = 1, 40-bit unsigned comparison in D-unit ALU

SPRU375G

Instruction Set Descriptions

5-103

Compare Accumulator, Auxiliary, or Temporary Register Content with OR

Compatibility with C54x devices (C54CM = 1) Contrary to the corresponding C54x instruction, the C55x register comparison instruction is performed in execute phase of the pipeline. When C54CM = 1, the conditions testing the accumulators content are all performed as if M40 was set to 1. Status Bits Affected by Affects Repeat Example
Syntax TC2 = !TC1 | uns(AC1 != AR1) Description The unsigned content of AC1(150) is compared to the unsigned content of AR1. The contents are equal (false), TC2 = !TC1 | 0.
After 00 8028 0400 0400 1 1 AC1 AR1 TC1 TC2 00 8028 0400 0400 1 0

C54CM, M40, TCy TCx

This instruction can be repeated.

Before AC1 AR1 TC1 TC2

5-104

Instruction Set Descriptions

SPRU375G

Compare Accumulator, Auxiliary, or Temporary Register Content Maximum (max)

Compare Accumulator, Auxiliary, or Temporary Register Content Maximum


Syntax Characteristics
Parallel Enable Bit Yes

No. [1]

Syntax dst = max(src, dst)

Size 2

Cycles 1

Pipeline X

Opcode Operands Description dst, src

0010 111E FSSS FDDD

This instruction performs a maximum comparison in the D-unit ALU or in the A-unit ALU. Two accumulator, auxiliary registers, and temporary registers contents are compared. When an accumulator ACx is compared with an auxiliary or temporary register TAx, the 16 lowest bits of ACx are compared with TAx in the A-unit ALU. If the comparison is true, the TCx status bit is set to 1; otherwise, it is cleared to 0.
- When the destination operand (dst) is an accumulator: J

If an auxiliary or temporary register is the source operand (src) of the instruction, the 16 LSBs of the auxiliary or temporary register are sign extended to 40 bits according to SXMD. The operation is performed on 40 bits in the D-unit ALU: If M40 = 0, src(310) content is compared to dst(310) content. The extremum value is stored in dst. If the extremum value is the src content, the CARRY status bit is cleared to 0; otherwise, it is set to 1.
step1: if (src(310) > dst(310)) step2: { CARRY = 0; dst(390) = src(390) } else step3: CARRY = 1

If M40 = 1, src(390) content is compared to dst(390) content. The extremum value is stored in dst. If the extremum value is the src content, the CARRY status bit is cleared to 0; otherwise, it is set to 1.
step1: if (src(390) > dst(390)) step2: { CARRY = 0; dst(390) = src(390) } else step3: CARRY = 1
J SPRU375G

There is no overflow detection, overflow report, and saturation.


Instruction Set Descriptions 5-105

Compare Accumulator, Auxiliary, or Temporary Register Content Maximum (max)

- When the destination operand (dst) is an auxiliary or temporary register: J J

If an accumulator is the source operand (src) of the instruction, the 16 LSBs of the accumulator are used to perform the operation. The operation is performed on 16 bits in the A-unit ALU: The src(150) content is compared to the dst(150) content. The extremum value is stored in dst.
step1: if (src(150) > dst(150)) step2: dst = src

There is no overflow detection and saturation.

Compatibility with C54x devices (C54CM = 1) When C54CM = 1, this instruction is executed as if M40 status bit was locally set to 1. When the destination operand (dst) is an auxiliary or temporary register, the instruction execution is not impacted by the C54CM status bit. When the destination operand (dst) is an accumulator, this instruction always compares the source operand (src) with AC1 as follows:
- If an auxiliary or temporary register is the source operand (src) of the

instruction, the 16 LSBs of the auxiliary or temporary register are sign extended to 40 bits according to SXMD
- The operation is performed on 40 bits in the D-unit ALU:

The src(390) content is compared to AC1(390) content. The extremum value is stored in dst. If the extremum value is the src content, the CARRY status bit is cleared to 0; otherwise, it is set to 1.
step1: if (src(390) > AC1(390)) step2: { CARRY = 0; dst(390) = src(390) } else step3: { CARRY = 1; dst(390) = AC1(390) }

There is no overflow detection, overflow report, and saturation. Status Bits Affected by Affects Repeat C54CM, M40, SXMD CARRY

This instruction can be repeated.

5-106

Instruction Set Descriptions

SPRU375G

Compare Accumulator, Auxiliary, or Temporary Register Content Maximum (max)

See Also

See the following other related instructions:


- Compare Accumulator, Auxiliary, or Temporary Register Content - Compare Accumulator, Auxiliary, or Temporary Register Content with AND - Compare Accumulator, Auxiliary, or Temporary Register Content with OR - Compare Accumulator, Auxiliary, or Temporary Register Content

Minimum
- Compare and Select Accumulator Content Maximum - Compare Memory with Immediate Value

Example 1
Syntax AC1 = max(AC2, AC1)
Before AC2 AC1 SXMD M40 CARRY

Description The content of AC2 is less than the content of AC1, the content of AC1 remains the same and the CARRY status bit is set to 1.
After AC2 AC1 SXMD M40 CARRY 00 0000 0000 00 8500 0000 1 0 1

00 0000 0000 00 8500 0000 1 0 0

Example 2
Syntax AC1 = max(AR1, AC1)
Before AR1 AC1 CARRY

Description The content of AR1 is less than the content of AC1, the content of AC1 remains the same and the CARRY status bit is set to 1.
After AR1 AC1 CARRY 8020 00 0000 0040 1

8020 00 0000 0040 0

Example 3
Syntax T1 = max(AC1, T1)
Before AC1 T1 CARRY

Description The content of AC1(150) is greater than the content of T1, the content of AC1(150) is stored in T1 and the CARRY status bit is cleared to 0.
After AC1 T1 CARRY 00 0000 8020 8020 0

00 0000 8020 8010 0

SPRU375G

Instruction Set Descriptions

5-107

Compare Accumulator, Auxiliary, or Temporary Register Content Minimum (min)

Compare Accumulator, Auxiliary, or Temporary Register Content Minimum


Syntax Characteristics
Parallel Enable Bit Yes

No. [1]

Syntax dst = min(src, dst)

Size 2

Cycles 1

Pipeline X

Opcode Operands Description dst, src

0011 000E FSSS FDDD

This instruction performs a minimum comparison in the D-unit ALU or in the A-unit ALU. Two accumulator, auxiliary registers, and temporary registers contents are compared. When an accumulator ACx is compared with an auxiliary or temporary register TAx, the 16 lowest bits of ACx are compared with TAx in the A-unit ALU. If the comparison is true, the TCx status bit is set to 1; otherwise, it is cleared to 0.
- When the destination operand (dst) is an accumulator: J

If an auxiliary or temporary register is the source operand (src) of the instruction, the 16 LSBs of the auxiliary or temporary register are sign extended to 40 bits according to SXMD. The operation is performed on 40 bits in the D-unit ALU: If M40 = 0, src(310) content is compared to dst(310) content. The extremum value is stored in dst. If the extremum value is the src content, the CARRY status bit is cleared to 0; otherwise, it is set to 1.
step1: if (src(310) < dst(310)) step2: { CARRY = 0; dst(390) = src(390) } else step3: CARRY = 1

If M40 = 1, src(390) content is compared to dst(390) content. The extremum value is stored in dst. If the extremum value is the src content, the CARRY status bit is cleared to 0; otherwise, it is set to 1.
step1: if (src(390) < dst(390)) step2: { CARRY = 0; dst(390) = src(390) } else step3: CARRY = 1
J 5-108

There is no overflow detection, overflow report, and saturation.


SPRU375G

Instruction Set Descriptions

Compare Accumulator, Auxiliary, or Temporary Register Content Minimum (min)

- When the destination operand (dst) is an auxiliary or temporary register: J J

If an accumulator is the source operand (src) of the instruction, the 16 LSBs of the accumulator are used to perform the operation. The operation is performed on 16 bits in the A-unit ALU: The src(150) content is compared to the dst(150) content. The extremum value is stored in dst.
step1: if (src(150) < dst(150)) step2: dst = src

There is no overflow detection and saturation.

Compatibility with C54x devices (C54CM = 1) When C54CM = 1, this instruction is executed as if M40 status bit was locally set to 1. When the destination operand (dst) is an auxiliary or temporary register, the instruction execution is not impacted by the C54CM status bit. When the destination operand (dst) is an accumulator, this instruction always compares the source operand (src) with AC1 as follows:
- If an auxiliary or temporary register is the source operand (src) of the

instruction, the 16 LSBs of the auxiliary or temporary register are sign extended to 40 bits according to SXMD
- The operation is performed on 40 bits in the D-unit ALU:

The src(390) content is compared to AC1(390) content. The extremum value is stored in dst. If the extremum value is the src content, the CARRY status bit is cleared to 0; otherwise, it is set to 1.
step1: if (src(390) < AC1(390)) step2: { CARRY = 0; dst(390) = src(390) } else step3: { CARRY = 1; dst(390) = AC1(390) }

There is no overflow detection, overflow report, and saturation. Status Bits Affected by Affects Repeat C54CM, M40, SXMD CARRY

This instruction can be repeated.

SPRU375G

Instruction Set Descriptions

5-109

Compare Accumulator, Auxiliary, or Temporary Register Content Minimum (min)

See Also

See the following other related instructions:


- Compare Accumulator, Auxiliary, or Temporary Register Content - Compare Accumulator, Auxiliary, or Temporary Register Content with AND - Compare Accumulator, Auxiliary, or Temporary Register Content with OR - Compare Accumulator, Auxiliary, or Temporary Register Content

Maximum
- Compare and Select Accumulator Content Minimum - Compare Memory with Immediate Value

Example
Syntax T1 = min(AC1, T1)
Before AC1 T1 CARRY

Description The content of AC1(150) is greater than the content of T1, the content of T1 remains the same and the CARRY status bit is set to 1.
After AC1 T1 CARRY 00 8000 0000 8020 1

00 8000 0000 8020 0

5-110

Instruction Set Descriptions

SPRU375G

Compare and Branch (compare goto)

Compare and Branch


Syntax Characteristics
No. [1] Syntax compare (uns(src RELOP K8)) goto L8 Parallel Enable Bit No Size 4 Cycles 7/6 Pipeline X

x/y cycles: x cycles = condition true, y cycles = condition false

Opcode Operands Description

0110 1111 FSSS ccxu KKKK KKKK LLLL LLLL K8, L8, RELOP, src This instruction performs a comparison operation between a source (src) register content and an 8-bit signed value, K8. The instruction performs a comparison in the D-unit ALU or in the A-unit ALU. The comparison is performed in the execute phase of the pipeline. If the result of the comparison is true, a branch occurs. The program branch address is specified as an 8-bit signed offset, L8, relative to the program counter (PC). Use this instruction to branch within a 256-byte window centered on the current PC value. The comparison depends on the optional uns keyword and, for accumulator comparisons, on M40.
- In the case of an unsigned comparison, the 8-bit constant, K8, is zero

extended to:
J J

16 bits, if the source (src) operand is an auxiliary or temporary register. 40 bits, if the source (src) operand is an accumulator.

- In the case of a signed comparison, the 8-bit constant, K8, is sign

extended to:
J J

16 bits, if the source (src) operand is an auxiliary or temporary register. 40 bits, if the source (src) operand is an accumulator.

As the following table shows, the uns keyword specifies an unsigned comparison; M40 defines the comparison bit width of the accumulator.
uns no no yes yes src TAx ACx TAx ACx Comparison Type 16-bit signed comparison in A-unit ALU if M40 = 0, 32-bit signed comparison in D-unit ALU if M40 = 1, 40-bit signed comparison in D-unit ALU 16-bit unsigned comparison in A-unit ALU if M40 = 0, 32-bit unsigned comparison in D-unit ALU if M40 = 1, 40-bit unsigned comparison in D-unit ALU

SPRU375G

Instruction Set Descriptions

5-111

Compare and Branch (compare goto)

Compatibility with C54x devices (C54CM = 1) When C54CM = 1, the conditions testing the accumulator contents are all performed as if M40 was set to 1. Status Bits Affected by Affects Repeat See Also C54CM, M40 none

This instruction can be repeated. See the following other related instructions:
- Branch Conditionally - Branch Unconditionally - Branch on Auxiliary Register Not Zero

Example 1
Syntax compare (AC0 >= #12) goto branch Description The signed content of AC0 is compared to the sign-extended 8-bit value (12). Because the content of AC0 is greater than or equal to 12, program control is passed to the program address label defined by branch (004078h).

compare (AC0 >= #12) goto branch branch :


Before AC0 PC 00 0000 3000 004071 After AC0 PC 00 0000 3000 004078

address: 00 4075 00 4078

5-112

Instruction Set Descriptions

SPRU375G

Compare and Branch (compare goto)

Example 2
Syntax compare (T1 != #1) goto branch Description The content of T1 is not equal to 1, program control is passed to the next instruction (the branch is not taken).

compare (T1 != #1) goto branch branch :


Before T1 PC 0000 4079 After T1 PC 0000 407D

address: 00407D 004080

SPRU375G

Instruction Set Descriptions

5-113

Compare and Select Accumulator Content Maximum (max_diff)

Compare and Select Accumulator Content Maximum


Syntax Characteristics
Parallel Enable Bit Yes Yes

No. [1] [2]

Syntax max_diff(ACx, ACy, ACz, ACw) max_diff_dbl(ACx, ACy, ACz, ACw, TRNx)

Size 3 3

Cycles 1 1

Pipeline X X

Description

Instruction [1] performs two paralleled 16-bit extremum selections in the D-unit ALU. Instruction [2] performs a single 40-bit extremum selection in the D-unit ALU. Affected by Affects C54CM, M40, SATD ACOVw, CARRY

Status Bits

See Also

See the following other related instructions:


- Compare Accumulator, Auxiliary, or Temporary Register Content - Compare Accumulator, Auxiliary, or Temporary Register Content Maximum - Compare and Select Accumulator Content Minimum

5-114

Instruction Set Descriptions

SPRU375G

Compare and Select Accumulator Content Maximum (max_diff)

Compare and Select Accumulator Content Maximum


Syntax Characteristics
No. [1] Syntax max_diff(ACx, ACy, ACz, ACw) Parallel Enable Bit Yes Size 3 Cycles 1 Pipeline X

Opcode Operands Description ACw, ACx, ACy, ACz

0001 000E DDSS 1100 SSDD nnnn

This instruction performs two paralleled 16-bit extremum selections in the D-unit ALU in one cycle. This instruction performs a dual maximum search. The two operations are executed on 40 bits in the D-unit ALU that is configured locally in dual 16-bit mode. The 16 lower bits of both the ALU and the accumulators are separated from their higher 24 bits (the 8 guard bits are attached to the higher 16-bit data path). For each datapath (high and low):
- ACx and ACy are the source accumulators. - The differences are stored in accumulator ACw. - The subtraction computation is equivalent to the dual 16-bit subtractions

instruction.
- For each of the two computations performed in the ALU, an overflow

detection is made. If an overflow is detected on any of the data paths, the destination accumulator overflow status bit (ACOVw) is set.
J J

For the operations performed in the ALU low part, overflow is detected at bit position 15. For the operations performed in the ALU high part, overflow is detected at bit position 31.

- For all instructions, the carry of the operation performed in the ALU high

part is reported in the CARRY status bit. The CARRY status bit is always extracted at bit position 31.
- Independently on each data path, if SATD = 1 when an overflow is

detected on the data path, a saturation is performed:


J J

For the operations performed in the ALU low part, saturation values are 7FFFh (positive) and 8000h (negative). For the operations performed in the ALU high part, saturation values are 00 7FFFh (positive) and FF 8000h (negative).
Instruction Set Descriptions 5-115

SPRU375G

Compare and Select Accumulator Content Maximum (max_diff)

- The extremum is stored in accumulator ACz. - The extremum is searched considering the selected bit width of the

accumulators:
J J

for the lower 16-bit data path, the sign bit is extracted at bit position 15 for the higher 24-bit data path, the sign bit is extracted at bit position 31

- According to the extremum found, a decision bit is shifted in TRNx from

the MSBs to the LSBs:


J J

TRN0 tracks the decision for the high part data path TRN1 tracks the decision for the low part data path If the extremum value is the ACx high or low part, the decision bit is cleared to 0; otherwise, it is set to 1:
TRN0 = TRN0 >> #1 TRN1 = TRN1 >> #1 ACw(3916) = ACy(3916) ACx(3916) ACw(150) = ACy(150) ACx(150) If (ACx(3116) > ACy(3116)) { bit(TRN0, 15) = #0 ; ACz(3916) = ACx(3916) } else { bit(TRN0, 15) = #1 ; ACz(3916) = ACy(3916) } if (ACx(150) > ACy(150)) { bit(TRN1, 15) = #0 ; ACz(150) = ACx(150) } else { bit(TRN1, 15) = #1 ; ACz(150) = ACy(150) }

Compatibility with C54x devices (C54CM = 1) When C54CM = 1, this instruction is executed as if SATD is locally cleared to 0. Overflow is only detected and reported for the computation performed in the higher 24-bit data path (overflow is detected at bit position 31). Status Bits Affected by Affects Repeat
5-116

C54CM, SATD ACOVw, CARRY

This instruction can be repeated.


Instruction Set Descriptions SPRU375G

Compare and Select Accumulator Content Maximum (max_diff)

Example
Syntax max_diff(AC0, AC1, AC2, AC1) Description The difference is stored in AC1. The content of AC0(3916) is subtracted from the content of AC1(3916) and the result is stored in AC1(3916). Since SATD = 1 and an overflow is detected, AC1(3916) = FF 8000h (saturation). The content of AC0(150) is subtracted from the content of AC1(150) and the result is stored in AC1(150). The maximum is stored in AC2. The content of TRN0 and TRN1 is shifted right 1 bit. AC0(3116) is greater than AC1(3116), AC0(3916) is stored in AC2(3916) and TRN0(15) is cleared to 0. AC0(150) is greater than AC1(150), AC0(150) is stored in AC2(150) and TRN1(15) is cleared to 0.
After AC0 AC1 AC2 SATD TRN0 TRN1 ACOV1 CARRY

Before AC0 AC1 AC2 SATD TRN0 TRN1 ACOV1 CARRY

10 2400 2222 90 0000 0000 00 0000 0000 1 1000 0100 0 1

10 2400 2222 FF 8000 DDDE 10 2400 2222 1 0800 0080 1 0

SPRU375G

Instruction Set Descriptions

5-117

Compare and Select Accumulator Content Maximum (max_diff_dbl)

Compare and Select Accumulator Content Maximum


Syntax Characteristics
Parallel Enable Bit Yes Yes

No. [2a] [2b]

Syntax max_diff_dbl(ACx, ACy, ACz, ACw, TRN0) max_diff_dbl(ACx, ACy, ACz, ACw, TRN1)

Size 3 3

Cycles 1 1

Pipeline X X

Opcode

TRN0 TRN1

0001 000E DDSS 1101 SSDD xxx0 0001 000E DDSS 1101 SSDD xxx1

Operands Description

ACw, ACx, ACy, ACz, TRNx This instruction performs a single 40-bit extremum selection in the D-unit ALU. This instruction performs a maximum search.
- ACx and ACy are the two source accumulators. - The difference between the source accumulators is stored in accumulator

ACw.
- The subtraction computation is equivalent to the subtraction instruction. - Overflow detection and CARRY status bit depends on M40. The

subtraction borrow bit is reported in the CARRY status bit; the borrow bit is the logical complement of the CARRY status bit.
- When an overflow is detected, the accumulator is saturated according to

SATD.
- The extremum between the source accumulators is stored in accumulator

ACz.
- The extremum computation is similar to the compare register content

maximum instruction. However, the CARRY status bit is not updated by the extremum search but by the subtraction instruction.
- According to the extremum found, a decision bit is shifted in TRNx from

the MSBs to the LSBs. If the extremum value is ACx, the decision bit is cleared to 0; otherwise, it is set to 1.

5-118

Instruction Set Descriptions

SPRU375G

Compare and Select Accumulator Content Maximum (max_diff_dbl)

If M40 = 0:
TRNx = TRNx >> #1 ACw(390) = ACy(390) ACx(390) If (ACx(310) > ACy(310)) { bit(TRNx, 15) = #0 ; ACz(390) = ACx(390) } else { bit(TRNx, 15) = #1 ; ACz(390) = ACy(390) }

If M40 = 1:
TRNx = TRNx >> #1 ACw(390) = ACy(390) ACx(390) If (ACx(390) > ACy(390)) { bit(TRNx, 15) = #0 ; ACz(390) = ACx(390) } else { bit(TRNx, 15) = #1 ; ACz(390) = ACy(390) }

Compatibility with C54x devices (C54CM = 1) When C54CM = 1, this instruction is executed as if M40 status bit was locally set to 1. However to ensure compatibility versus overflow detection and saturation of the destination accumulator, this instruction must be executed with M40 = 0. Status Bits Affected by Affects Repeat Example
Syntax max_diff_dbl(AC0, AC1, AC2, AC3, TRN1) Description The difference is stored in AC3. The content of AC0 is subtracted from the content of AC1 and the result is stored in AC3. The maximum is stored in AC2. The content of TRN1 is shifted right 1 bit. AC0 is greater than AC1, AC0 is stored in AC2 and TRN1(15) is cleared to 0.
10 00 10 F0 2400 8000 2400 5C00 2222 DDDE 2222 BBBC 1 1 0040 0 0

C54CM, M40, SATD ACOVw, CARRY

This instruction can be repeated.

Before AC0 AC1 AC2 AC3 M40 SATD TRN1 ACOV3 CARRY

10 00 00 00

2400 8000 0000 0000

2222 DDDE 0000 0000 1 1 0080 0 0

After AC0 AC1 AC2 AC3 M40 SATD TRN1 ACOV3 CARRY

SPRU375G

Instruction Set Descriptions

5-119

Compare and Select Accumulator Content Minimum (min_diff)

Compare and Select Accumulator Content Minimum


Syntax Characteristics
Parallel Enable Bit Yes Yes

No. [1] [2]

Syntax min_diff(ACx, ACy, ACz, ACw) min_diff_dbl(ACx, ACy, ACz, ACw, TRNx)

Size 3 3

Cycles 1 1

Pipeline X X

Description

Instruction [1] performs two paralleled 16-bit extremum selections in the D-unit ALU. Instruction [2] performs a single 40-bit extremum selection in the D-unit ALU. Affected by Affects C54CM, M40, SATD ACOVw, CARRY

Status Bits

See Also

See the following other related instructions:


- Compare Accumulator, Auxiliary, or Temporary Register Content - Compare Accumulator, Auxiliary, or Temporary Register Content Minimum - Compare and Select Accumulator Content Maximum

5-120

Instruction Set Descriptions

SPRU375G

Compare and Select Accumulator Content Minimum (min_diff)

Compare and Select Accumulator Content Minimum


Syntax Characteristics
No. [1] Syntax min_diff(ACx, ACy, ACz, ACw) Parallel Enable Bit Yes Size 3 Cycles 1 Pipeline X

Opcode Operands Description ACw, ACx, ACy, ACz

0001 000E DDSS 1110 SSDD xxxx

This instruction performs two paralleled 16-bit extremum selections in the D-unit ALU in one cycle. This instruction performs a dual minimum search. The two operations are executed on 40 bits in the D-unit ALU that is configured locally in dual 16-bit mode. The 16 lower bits of both the ALU and the accumulators are separated from their higher 24 bits (the 8 guard bits are attached to the higher 16-bit data path). For each datapath (high and low):
- ACx and ACy are the source accumulators. - The differences are stored in accumulator ACw. - The subtraction computation is equivalent to the dual 16-bit subtractions

instruction.
- For each of the two computations performed in the ALU, an overflow

detection is made. If an overflow is detected on any of the data paths, the destination accumulator overflow status bit (ACOVw) is set.
J J

For the operations performed in the ALU low part, overflow is detected at bit position 15. For the operations performed in the ALU high part, overflow is detected at bit position 31.

- For all instructions, the carry of the operation performed in the ALU high

part is reported in the CARRY status bit. The CARRY status bit is always extracted at bit position 31.
- Independently on each data path, if SATD = 1 when an overflow is

detected on the data path, a saturation is performed:


J J

For the operations performed in the ALU low part, saturation values are 7FFFh (positive) and 8000h (negative). For the operations performed in the ALU high part, saturation values are 00 7FFFh (positive) and FF 8000h (negative).
Instruction Set Descriptions 5-121

SPRU375G

Compare and Select Accumulator Content Minimum (min_diff)

- The extremum is stored in accumulator ACz. - The extremum is searched considering the selected bit width of the

accumulators:
J J

for the lower 16-bit data path, the sign bit is extracted at bit position 15 for the higher 24-bit data path, the sign bit is extracted at bit position 31

- According to the extremum found, a decision bit is shifted in TRNx from

the MSBs to the LSBs:


J J

TRN0 tracks the decision for the high part data path TRN1 tracks the decision for the low part data path If the extremum value is the ACx high or low part, the decision bit is cleared to 0; otherwise, it is set to 1:
TRN0 = TRN0 >> #1 TRN1 = TRN1 >> #1 ACw(3916) = ACy(3916) ACx(3916) ACw(150) = ACy(150) ACx(150) If (ACx(3116) < ACy(3116)) { bit(TRN0, 15) = #0 ; ACz(3916) = ACx(3916) } else { bit(TRN0, 15) = #1 ; ACz(3916) = ACy(3916) } if (ACx(150) < ACy(150)) { bit(TRN1, 15) = #0 ; ACz(150) = ACx(150) } else { bit(TRN1, 15) = #1 ; ACz(150) = ACy(150) }

Compatibility with C54x devices (C54CM = 1) When C54CM = 1, this instruction is executed as if SATD is locally cleared to 0. Overflow is only detected and reported for the computation performed in the higher 24-bit data path (overflow is detected at bit position 31). Status Bits Affected by Affects Repeat
5-122

C54CM, SATD ACOVw, CARRY

This instruction can be repeated.


Instruction Set Descriptions SPRU375G

Compare and Select Accumulator Content Minimum (min_diff)

Example
Syntax min_diff(AC0, AC1, AC2, AC1) Description The difference is stored in AC1. The content of AC0(3916) is subtracted from the content of AC1(3916) and the result is stored in AC1(3916). Since SATD = 1 and an overflow is detected, AC1(3916) = FF 8000h (saturation). The content of AC0(150) is subtracted from the content of AC1(150) and the result is stored in AC1(150). The minimum is stored in AC2 (sign bit extracted at bits 31 and 15). The content of TRN0 and TRN1 is shifted right 1 bit. AC0(3116) is greater than or equal to AC1(3116), AC1(3916) is stored in AC2(3916) and TRN0(15) is set to 1. AC0(150) is greater than or equal to AC1(150), AC1(150) is stored in AC2(150) and TRN1(15) is set to 1.
After AC0 AC1 AC2 SATD TRN0 TRN1 ACOV1 CARRY

Before AC0 AC1 AC2 SATD TRN0 TRN1 ACOV1 CARRY

10 2400 2222 00 8000 DDDE 10 2400 2222 1 0800 0040 0 0

10 2400 2222 FF 8000 BBBC 00 8000 DDDE 1 8400 8020 1 1

SPRU375G

Instruction Set Descriptions

5-123

Compare and Select Accumulator Content Minimum (min_diff_dbl)

Compare and Select Accumulator Content Minimum


Syntax Characteristics
Parallel Enable Bit Yes Yes

No. [2a] [2b]

Syntax min_diff_dbl(ACx, ACy, ACz, ACw, TRN0) min_diff_dbl(ACx, ACy, ACz, ACw, TRN1)

Size 3 3

Cycles 1 1

Pipeline X X

Opcode

TRN0 TRN1

0001 000E DDSS 1111 SSDD xxx0 0001 000E DDSS 1111 SSDD xxx1

Operands Description

ACw, ACx, ACy, ACz, TRNx This instruction performs a single 40-bit extremum selection in the D-unit ALU. This instruction performs a minimum search.
- ACx and ACy are the two source accumulators. - The difference between the source accumulators is stored in accumulator

ACw.
- The subtraction computation is equivalent to the subtraction instruction. - Overflow detection and CARRY status bit depends on M40. The

subtraction borrow bit is reported in the CARRY status bit; the borrow bit is the logical complement of the CARRY status bit.
- When an overflow is detected, the accumulator is saturated according to

SATD.
- The extremum between the source accumulators is stored in accumulator

ACz.
- The extremum computation is similar to the compare register content

maximum instruction. However, the CARRY status bit is not updated by the extremum search but by the subtraction instruction.
- According to the extremum found, a decision bit is shifted in TRNx from

the MSBs to the LSBs. If the extremum value is ACx, the decision bit is cleared to 0; otherwise, it is set to 1.

5-124

Instruction Set Descriptions

SPRU375G

Compare and Select Accumulator Content Minimum (min_diff_dbl)

If M40 = 0:
TRNx = TRNx >> #1 ACw(390) = ACy(390) ACx(390) If (ACx(310) < ACy(310)) { bit(TRNx, 15) = #0 ; ACz(390) = ACx(390) } else { bit(TRNx, 15) = #1 ; ACz(390) = ACy(390) }

If M40 = 1:
TRNx = TRNx >> #1 ACw(390) = ACy(390) ACx(390) If (ACx(390) < ACy(390)) { bit(TRNx, 15) = #0 ; ACz(390) = ACx(390) } else { bit(TRNx, 15) = #1 ; ACz(390) = ACy(390) }

Compatibility with C54x devices (C54CM = 1) When C54CM = 1, this instruction is executed as if M40 status bit was locally set to 1. However to ensure compatibility versus overflow detection and saturation of the destination accumulator, this instruction must be executed with M40 = 0. Status Bits Affected by Affects Repeat Example
Syntax min_diff_dbl(AC0, AC1, AC2, AC3, TRN0) Description The difference is stored in AC3. The content of AC0 is subtracted from the content of AC1 and the result is stored in AC3. The minimum is stored in AC2. The content of TRN0 is shifted right 1 bit. If AC0 is less than AC1, AC0 is stored in AC2 and TRN0(15) is cleared to 0; otherwise, AC1 is stored in AC2 and TRN0(15) is set to 1.

C54CM, M40, SATD ACOVw, CARRY

This instruction can be repeated.

SPRU375G

Instruction Set Descriptions

5-125

Compare Memory with Immediate Value

Compare Memory with Immediate Value


Syntax Characteristics
Parallel Enable Bit No No

No. [1] [2]

Syntax TC1 = (Smem == K16) TC2 = (Smem == K16)

Size 4 4

Cycles 1 1

Pipeline X X

Opcode

TC1 TC2

1111 0000 AAAA AAAI KKKK KKKK KKKK KKKK 1111 0001 AAAA AAAI KKKK KKKK KKKK KKKK

Operands Description

K16, Smem, TCx This instruction performs a comparison in the A-unit ALU. The data memory operand Smem is compared to the 16-bit signed constant, K16. If they are equal, the TCx status bit is set to 1; otherwise, it is cleared to 0.
if((Smem) == K16) TCx = 1 else TCx = 0

Status Bits

Affected by Affects

none TCx

Repeat

This instruction cannot be repeated when using the *(#k23) absolute addressing mode to access the memory operand (Smem); when using other addressing modes, this instruction can be repeated. See the following other related instructions:
- Compare Accumulator, Auxiliary, or Temporary Register Content

See Also

5-126

Instruction Set Descriptions

SPRU375G

Compare Memory with Immediate Value

Example 1
Syntax TC1 = (*AR1+ == #400h)
Before AR1 0285 TC1

Description The content addressed by AR1 is compared to the signed 16-bit value (400h). Because they are equal, TC1 is set to 1. AR1 is incremented by 1.
After AR1 0285 TC1 0286 0400 1

0285 0400 0

Example 2
Syntax TC2 = (*AR1 == #400h)
Before AR1 0285 TC2

Description The content addressed by AR1 is compared to the signed 16-bit value (400h). Because they are not equal, TC2 is cleared to 0.
After AR1 0285 TC2 0285 0000 0

0285 0000 0

SPRU375G

Instruction Set Descriptions

5-127

Complement Accumulator, Auxiliary, or Temporary Register Bit (cbit)

Complement Accumulator, Auxiliary, or Temporary Register Bit


Syntax Characteristics
No. [1] Syntax cbit(src, Baddr) Parallel Enable Bit No Size 3 Cycles 1 Pipeline X

Opcode Operands Description Baddr, src

1110 1100 AAAA AAAI FSSS 011x

This instruction performs a bit manipulation:


- In the D-unit ALU, if the source (src) register operand is an accumulator. - In the A-unit ALU, if the source (src) register operand is an auxiliary or

temporary register. The instruction complements a single bit, as defined by the bit addressing mode, Baddr, of the source register. The generated bit address must be within:
- 039 when accessing accumulator bits (only the 6 LSBs of the generated

bit address are used to determine the bit position). If the generated bit address is not within 039, the selected register bit value does not change.
- 015 when accessing auxiliary or temporary register bits (only the 4 LSBs

of the generated address are used to determine the bit position). Status Bits Affected by Affects Repeat See Also none none

This instruction can be repeated. See the following other related instructions:
- Clear Accumulator, Auxiliary, or Temporary Register Bit - Complement Accumulator, Auxiliary, or Temporary Register Content - Complement Memory Bit - Set Accumulator, Auxiliary, or Temporary Register Bit

Example
Syntax cbit(T0, AR1)
Before T0 AR1 E000 000C

Description The bit at the position defined by the content of AR1(30) in T0 is complemented.
After T0 AR1 F000 000C

5-128

Instruction Set Descriptions

SPRU375G

Complement Accumulator, Auxiliary, or Temporary Register Content

Complement Accumulator, Auxiliary, or Temporary Register Content


Syntax Characteristics
No. [1] Syntax dst = ~src Parallel Enable Bit Yes Size 2 Cycles 1 Pipeline X

Opcode Operands Description dst, src

0011 011E FSSS FDDD

This instruction computes the 1s complement (bitwise complement) of the content of the source register (src).
- When the destination (dst) operand is an accumulator: J J

The bit inversion is performed on 40 bits in the D-unit ALU and the result is stored in the destination accumulator. If an auxiliary or temporary register is the source (src) operand of the instruction, the 16 LSBs of the auxiliary or temporary register are zero extended.

- When the destination (dst) operand is an auxiliary or temporary register: J J

The bit inversion is performed on 16 bits in the A-unit ALU and the result is stored in the destination auxiliary or temporary register. If an accumulator is the source (src) operand of the instruction, the 16 LSBs of the accumulator are used to perform the operation. none none

Status Bits

Affected by Affects

Repeat See Also

This instruction can be repeated. See the following other related instructions:
- Complement Accumulator, Auxiliary, or Temporary Register Bit - Negate Accumulator, Auxiliary, or Temporary Register Content

Example
Syntax AC1 = ~AC0
Before AC0 AC1 7E 2355 4FC0 00 2300 5678

Description The content of AC0 is complemented and the result is stored in AC1.
After AC0 AC1 7E 2355 4FC0 81 DCAA B03F

SPRU375G

Instruction Set Descriptions

5-129

Complement Memory Bit (cbit)

Complement Memory Bit


Syntax Characteristics
Parallel Enable Bit No

No. [1]

Syntax cbit(Smem, src)

Size 3

Cycles 1

Pipeline X

Opcode Operands Description Smem, src

1110 0011 AAAA AAAI FSSS 111x

This instruction performs a bit manipulation in the A-unit ALU. The instruction complements a single bit, as defined by the content of the source (src) operand, of a memory (Smem) location. The generated bit address must be within 015 (only the 4 LSBs of the register are used to determine the bit position).

Status Bits

Affected by Affects

none none

Repeat See Also

This instruction can be repeated. See the following other related instructions:
- Clear Memory Bit - Complement Accumulator, Auxiliary, or Temporary Register Bit - Complement Accumulator, Auxiliary, or Temporary Register Content - Set Memory Bit

Example
Syntax cbit(*AR3, AC0) Description The bit at the position defined by AC0(30) in the content addressed by AR3 is complemented.

5-130

Instruction Set Descriptions

SPRU375G

Compute Exponent of Accumulator Content (exp)

Compute Exponent of Accumulator Content


Syntax Characteristics
Parallel Enable Bit Yes

No. [1]

Syntax Tx = exp(ACx)

Size 3

Cycles 1

Pipeline X

Opcode Operands Description ACx, Tx

0001 000E xxSS 1000 xxdd xxxx

This instruction computes the exponent of the source accumulator ACx in the D-unit shifter. The result of the operation is stored in the temporary register Tx. The A-unit ALU is used to make the move operation. This exponent is a signed 2s-complement value in the 8 to 31 range. The exponent is computed by calculating the number of leading bits in ACx and subtracting 8 from this value. The number of leading bits is the number of shifts to the MSBs needed to align the accumulator content on a signed 40-bit representation. ACx is not modified after the execution of this instruction. If ACx is equal to 0, Tx is loaded with 0. This instruction produces in Tx the opposite result than computed by the Compute Mantissa and Exponent of Accumulator Content instruction (page 5-132).

Status Bits

Affected by Affects

none none

Repeat See Also

This instruction can be repeated. See the following other related instructions:
- Compute Mantissa and Exponent of Accumulator Content

Example
Syntax T1 = exp(AC0) Description The exponent is computed by subtracting 8 from the number of leading bits in the content of AC0. The exponent value is a signed 2s-complement value in the 8 to 31 range and is stored in T1.
After AC0 T1

Before AC0 T1

FF FFFF FFCB 0000

FF FFFF FFCB 0019

SPRU375G

Instruction Set Descriptions

5-131

Compute Mantissa and Exponent of Accumulator Content (mant, exp)

Compute Mantissa and Exponent of Accumulator Content


Syntax Characteristics
Parallel Enable Bit Yes

No. [1]

Syntax ACy = mant(ACx), Tx = exp(ACx)

Size 3

Cycles 1

Pipeline X2

Opcode Operands Description ACx, ACy, Tx

0001 000E DDSS 1001 xxdd xxxx

This instruction computes the exponent and mantissa of the source accumulator ACx. The computation of the exponent and the mantissa is executed in the D-unit shifter. The exponent is computed and stored in the temporary register Tx. The A-unit is used to make the move operation. The mantissa is stored in the accumulator ACy. The exponent is a signed 2s-complement value in the 31 to 8 range. The exponent is computed by calculating the number of leading bits in ACx and subtracting this value from 8. The number of leading bits is the number of shifts to the MSBs needed to align the accumulator content on a signed 40-bit representation. The mantissa is obtained by aligning the ACx content on a signed 32-bit representation. The mantissa is computed and stored in ACy.
- The shift operation is performed on 40 bits. J J

When shifting to the LSBs, bit 39 of ACx is extended to bit 31. When shifting to the MSBs, 0 is inserted at bit position 0.

- If ACx is equal to 0, Tx is loaded with 8000h.

This instruction produces in Tx the opposite result than computed by the Compute Exponent of Accumulator Content instruction (page 5-131). Status Bits Affected by Affects Repeat See Also none none

This instruction can be repeated. See the following other related instructions:
- Compute Exponent of Accumulator Content

5-132

Instruction Set Descriptions

SPRU375G

Compute Mantissa and Exponent of Accumulator Content (mant, exp)

Example 1
Syntax AC1 = mant(AC0), T1 = exp(AC0) Description The exponent is computed by subtracting the number of leading bits in the content of AC0 from 8. The exponent value is a signed 2s-complement value in the 31 to 8 range and is stored in T1. The mantissa is computed by aligning the content of AC0 on a signed 32-bit representation. The mantissa value is stored in AC1.
After AC0 AC1 T1

Before AC0 AC1 T1

21 0A0A 0A0A FF FFFF F001 0000

21 0A0A 0A0A 00 4214 1414 0007

Example 2
Syntax AC1 = mant(AC0), T1 = exp(AC0) Description The exponent is computed by subtracting the number of leading bits in the content of AC0 from 8. The exponent value is a signed 2s-complement value in the 31 to 8 range and is stored in T1. The mantissa is computed by aligning the content of AC0 on a signed 32-bit representation. The mantissa value is stored in AC1.
After AC0 AC1 T1

Before AC0 AC1 T1

00 E804 0000 FF FFFF F001 0000

00 E804 0000 00 7402 0000 0001

SPRU375G

Instruction Set Descriptions

5-133

Count Accumulator Bits (count)

Count Accumulator Bits


Syntax Characteristics
Parallel Enable Bit Yes Yes

No. [1] [2]

Syntax Tx = count(ACx, ACy, TC1) Tx = count(ACx, ACy, TC2)

Size 3 3

Cycles 1 1

Pipeline X X

Opcode

TC1 TC2

0001 000E xxSS 1010 SSdd xxx0 0001 000E XXSS 1010 SSdd xxx1

Operands Description

ACx, ACy, Tx, TCx This instruction performs bit field manipulation in the D-unit shifter. The result is stored in the selected temporary register (Tx). The A-unit ALU is used to make the move operation. Accumulator ACx is ANDed with accumulator ACy. The number of bits set to 1 in the intermediary result is evaluated and stored in the selected temporary register (Tx). If the number of bits is even, the selected TCx status bit is cleared to 0. If the number of bits is odd, the selected TCx status bit is set to 1.

Status Bits

Affected by Affects

none TCx

Repeat Example
Syntax

This instruction can be repeated.

Description The content of AC1 is ANDed with the content of AC2, the number of bits set to 1 in the result is evaluated and stored in T1. The number of bits set to 1 is odd, TC1 is set to 1.
After 7E 2355 4FC0 0F E340 5678 0000 0 AC1 AC2 T1 TC1 7E 2355 4FC0 0F E340 5678 000B 1

T1 = count(AC1, AC2, TC1)

Before AC1 AC2 T1 TC1

5-134

Instruction Set Descriptions

SPRU375G

Dual 16Bit Additions

Dual 16-Bit Additions


Syntax Characteristics
Parallel Enable Bit No No

No. [1] [2]

Syntax HI(ACy) = HI(Lmem) + HI(ACx), LO(ACy) = LO(Lmem) + LO(ACx) HI(ACx) = HI(Lmem) + Tx, LO(ACx) = LO(Lmem) + Tx

Size 3 3

Cycles 1 1

Pipeline X X

Description

These instructions perform two paralleled addition operations in one cycle. The operations are executed on 40 bits in the D-unit ALU that is configured locally in dual 16-bit mode. The 16 lower bits of both the ALU and the accumulator are separated from their higher 24 bits (the 8 guard bits are attached to the higher 16-bit datapath).

Status Bits

Affected by Affects

C54CM, SATD, SXMD ACOVx, ACOVy, CARRY

See Also

See the following other related instructions:


- Addition - Addition or Subtraction Conditionally - Addition or Subtraction Conditionally with Shift - Addition with Parallel Store Accumulator Content to Memory - Addition, Subtraction, or Move Accumulator Content Conditionally - Dual 16-Bit Addition and Subtraction - Dual 16-Bit Subtraction and Addition

SPRU375G

Instruction Set Descriptions

5-135

Dual 16Bit Additions

Dual 16-Bit Additions


Syntax Characteristics
Parallel Enable Bit No

No. [1]

Syntax HI(ACy) = HI(Lmem) + HI(ACx), LO(ACy) = LO(Lmem) + LO(ACx)

Size 3

Cycles 1

Pipeline X

Opcode Operands Description ACx, ACy, Lmem

1110 1110 AAAA AAAI SSDD 000x

This instruction performs two paralleled addition operations in one cycle. The operations are executed on 40 bits in the D-unit ALU that is configured locally in dual 16-bit mode. The 16 lower bits of both the ALU and the accumulator are separated from their higher 24 bits (the 8 guard bits are attached to the higher 16-bit datapath).
- The data memory operand dbl(Lmem) is divided into two 16-bit parts: J J

the lower part is used as one of the 16-bit operands of the ALU low part the higher part is sign extended to 24 bits according to SXMD and is used in the ALU high part

- The data memory operand dbl(Lmem) addresses are aligned: J J

if Lmem address is even: most significant word = Lmem, least significant word = Lmem + 1 if Lmem address is odd: most significant word = Lmem, least significant word = Lmem 1

- For each of the two computations performed in the ALU, an overflow

detection is made. If an overflow is detected on any of the data paths, the destination accumulator overflow status bit (ACOVy) is set.
J J

For the operations performed in the ALU low part, overflow is detected at bit position 15. For the operations performed in the ALU high part, overflow is detected at bit position 31.

- For all instructions, the carry of the operation performed in the ALU high

part is reported in the CARRY status bit. The CARRY status bit is always extracted at bit position 31.
5-136 Instruction Set Descriptions SPRU375G

Dual 16Bit Additions

- Independently on each data path, if SATD = 1 when an overflow is

detected on the data path, a saturation is performed:


J J

For the operations performed in the ALU low part, saturation values are 7FFFh and 8000h. For the operations performed in the ALU high part, saturation values are 00 7FFFh and FF 8000h.

Compatibility with C54x devices (C54CM = 1) When C54CM = 1, this instruction is executed as if SATD is locally cleared to 0. Overflow is only detected and reported for the computation performed in the higher 24-bit datapath (overflow is detected at bit position 31). Status Bits Affected by Affects Repeat Example
Syntax HI(AC0) = HI(*AR3) + HI(AC1), LO(AC0) = LO(*AR3) + LO(AC1) Description Both instructions are performed in parallel. When the Lmem address is even (AR3 = even): The content of AC1(3916) is added to the content addressed by AR3 and the result is stored in AC0(3916). The content of AC1(150) is added to the content addressed by AR3 + 1 and the result is stored in AC0(150).

C54CM, SATD, SXMD ACOVy, CARRY

This instruction can be repeated.

SPRU375G

Instruction Set Descriptions

5-137

Dual 16Bit Additions

Dual 16-Bit Additions


Syntax Characteristics
No. [2] Syntax HI(ACx) = HI(Lmem) + Tx, LO(ACx) = LO(Lmem) + Tx Parallel Enable Bit No Size 3 Cycles 1 Pipeline X

Opcode Operands Description ACx, Lmem, Tx

1110 1110 AAAA AAAI ssDD 100x

This instruction performs two paralleled addition operations in one cycle. The operations are executed on 40 bits in the D-unit ALU that is configured locally in dual 16-bit mode. The 16 lower bits of both the ALU and the accumulator are separated from their higher 24 bits (the 8 guard bits are attached to the higher 16-bit datapath).
- The temporary register Tx: J J

is used as one of the 16-bit operands of the ALU low part is duplicated and, according to SXMD, sign extended to 24 bits to be used in the ALU high part

- The data memory operand dbl(Lmem) is divided into two 16-bit parts: J J

the lower part is used as one of the 16-bit operands of the ALU low part the higher part is sign extended to 24 bits according to SXMD and is used in the ALU high part

- The data memory operand dbl(Lmem) addresses are aligned: J J

if Lmem address is even: most significant word = Lmem, least significant word = Lmem + 1 if Lmem address is odd: most significant word = Lmem, least significant word = Lmem 1

- For each of the two computations performed in the ALU, an overflow

detection is made. If an overflow is detected on any of the data paths, the destination accumulator overflow status bit (ACOVx) is set.
J J

For the operations performed in the ALU low part, overflow is detected at bit position 15. For the operations performed in the ALU high part, overflow is detected at bit position 31.
SPRU375G

5-138

Instruction Set Descriptions

Dual 16Bit Additions

- For all instructions, the carry of the operation performed in the ALU high

part is reported in the CARRY status bit. The CARRY status bit is always extracted at bit position 31.
- Independently on each data path, if SATD = 1 when an overflow is

detected on the data path, a saturation is performed:


J J

For the operations performed in the ALU low part, saturation values are 7FFFh and 8000h. For the operations performed in the ALU high part, saturation values are 00 7FFFh and FF 8000h.

Compatibility with C54x devices (C54CM = 1) When C54CM = 1, this instruction is executed as if SATD is locally cleared to 0. Overflow is only detected and reported for the computation performed in the higher 24-bit datapath (overflow is detected at bit position 31). Status Bits Affected by Affects Repeat Example
Syntax HI(AC0) = HI(*AR3) + T0, LO(AC0) = LO(*AR3) + T0 Description Both instructions are performed in parallel. When the Lmem address is even (AR3 = even): The content of T0 is added to the content addressed by AR3 and the result is stored in AC0(3916). The duplicated content of T0 is added to the content addressed by AR3 + 1 and the result is stored in AC0(150).

C54CM, SATD, SXMD ACOVx, CARRY

This instruction can be repeated.

SPRU375G

Instruction Set Descriptions

5-139

Dual 16Bit Addition and Subtraction

Dual 16-Bit Addition and Subtraction


Syntax Characteristics
Parallel Enable Bit No No

No. [1] [2]

Syntax HI(ACx) = Smem + Tx, LO(ACx) = Smem Tx HI(ACx) = HI(Lmem) + Tx, LO(ACx) = LO(Lmem) Tx

Size 3 3

Cycles 1 1

Pipeline X X

Description

These instructions perform two paralleled addition and subtraction operations in one cycle. The operations are executed on 40 bits in the D-unit ALU that is configured locally in dual 16-bit mode. The 16 lower bits of both the ALU and the accumulator are separated from their higher 24 bits (the 8 guard bits are attached to the higher 16-bit datapath).

Status Bits

Affected by Affects

C54CM, SATD, SXMD ACOVx, ACOVy, CARRY

See Also

See the following other related instructions:


- Addition - Dual 16-Bit Additions - Dual 16-Bit Subtractions - Dual 16-Bit Subtraction and Addition - Subtraction

5-140

Instruction Set Descriptions

SPRU375G

Dual 16Bit Addition and Subtraction

Dual 16-Bit Addition and Subtraction


Syntax Characteristics
Parallel Enable Bit No

No. [1]

Syntax HI(ACx) = Smem + Tx, LO(ACx) = Smem Tx

Size 3

Cycles 1

Pipeline X

Opcode Operands Description ACx, Smem, Tx

1101 1110 AAAA AAAI ssDD 1000

This instruction performs two paralleled arithmetical operations in one cycle: an addition and subtraction. The operations are executed on 40 bits in the D-unit ALU that is configured locally in dual 16-bit mode. The 16 lower bits of both the ALU and the accumulator are separated from their higher 24 bits (the 8 guard bits are attached to the higher 16-bit datapath).
- The data memory operand Smem: J J

is used as one of the 16-bit operands of the ALU low part is duplicated and, according to SXMD, sign extended to 24 bits to be used in the ALU high part

- The temporary register Tx: J J

is used as one of the 16-bit operands of the ALU low part is duplicated and, according to SXMD, sign extended to 24 bits to be used in the ALU high part

- For each of the two computations performed in the ALU, an overflow

detection is made. If an overflow is detected on any of the data paths, the destination accumulator overflow status bit (ACOVx) is set.
J J

For the operations performed in the ALU low part, overflow is detected at bit position 15. For the operations performed in the ALU high part, overflow is detected at bit position 31.

- For all instructions, the carry of the operation performed in the ALU high

part is reported in the CARRY status bit. The CARRY status bit is always extracted at bit position 31.
SPRU375G Instruction Set Descriptions 5-141

Dual 16Bit Addition and Subtraction

- Independently on each data path, if SATD = 1 when an overflow is

detected on the data path, a saturation is performed:


J J

For the operations performed in the ALU low part, saturation values are 7FFFh and 8000h. For the operations performed in the ALU high part, saturation values are 00 7FFFh and FF 8000h.

Compatibility with C54x devices (C54CM = 1) When C54CM = 1, this instruction is executed as if SATD is locally cleared to 0. Overflow is only detected and reported for the computation performed in the higher 24-bit datapath (overflow is detected at bit position 31). Status Bits Affected by Affects Repeat Example
Syntax HI(AC1) = *AR1 + T1, LO(AC1) = *AR1 T1 Description Both instructions are performed in parallel. The content addressed by AR1 is added to the content of T1 and the result is stored in AC1(3916). The duplicated content of T1 is subtracted from the duplicated content addressed by AR1 and the result is stored in AC1(150).
After 00 2300 0000 4000 0201 E300 1 1 0 0 AC1 T1 AR1 201 SXMD M40 ACOV0 CARRY 00 2300 A300 4000 0201 E300 1 1 0 1

C54CM, SATD, SXMD ACOVx, CARRY

This instruction can be repeated.

Before AC1 T1 AR1 201 SXMD M40 ACOV0 CARRY

5-142

Instruction Set Descriptions

SPRU375G

Dual 16Bit Addition and Subtraction

Dual 16-Bit Addition and Subtraction


Syntax Characteristics
No. [2] Syntax HI(ACx) = HI(Lmem) + Tx, LO(ACx) = LO(Lmem) Tx Parallel Enable Bit No Size 3 Cycles 1 Pipeline X

Opcode Operands Description ACx, Lmem, Tx

1110 1110 AAAA AAAI ssDD 110x

This instruction performs two paralleled arithmetical operations in one cycle: an addition and subtraction. The operations are executed on 40 bits in the D-unit ALU that is configured locally in dual 16-bit mode. The 16 lower bits of both the ALU and the accumulator are separated from their higher 24 bits (the 8 guard bits are attached to the higher 16-bit datapath).
- The temporary register Tx: J J

is used as one of the 16-bit operands of the ALU low part is duplicated and, according to SXMD, sign extended to 24 bits to be used in the ALU high part

- The data memory operand dbl(Lmem) is divided into two 16-bit parts: J J

the lower part is used as one of the 16-bit operands of the ALU low part the higher part is sign extended to 24 bits according to SXMD and is used in the ALU high part

- The data memory operand dbl(Lmem) addresses are aligned: J J

if Lmem address is even: most significant word = Lmem, least significant word = Lmem + 1 if Lmem address is odd: most significant word = Lmem, least significant word = Lmem 1

- For each of the two computations performed in the ALU, an overflow

detection is made. If an overflow is detected on any of the data paths, the destination accumulator overflow status bit (ACOVx) is set.
J J

For the operations performed in the ALU low part, overflow is detected at bit position 15. For the operations performed in the ALU high part, overflow is detected at bit position 31.
Instruction Set Descriptions 5-143

SPRU375G

Dual 16Bit Addition and Subtraction

- For all instructions, the carry of the operation performed in the ALU high

part is reported in the CARRY status bit. The CARRY status bit is always extracted at bit position 31.
- Independently on each data path, if SATD = 1 when an overflow is

detected on the data path, a saturation is performed:


J J

For the operations performed in the ALU low part, saturation values are 7FFFh and 8000h. For the operations performed in the ALU high part, saturation values are 00 7FFFh and FF 8000h.

Compatibility with C54x devices (C54CM = 1) When C54CM = 1, this instruction is executed as if SATD is locally cleared to 0. Overflow is only detected and reported for the computation performed in the higher 24-bit datapath (overflow is detected at bit position 31). Status Bits Affected by Affects Repeat Example
Syntax HI(AC0) = HI(*AR3) + T0, LO(AC0) = LO(*AR3) T0 Description Both instructions are performed in parallel. When the Lmem address is even (AR3 = even): The content of T0 is added to the content addressed by AR3 and the result is stored in AC0(3916). The duplicated content of T0 is subtracted from the content addressed by AR3 + 1 and the result is stored in AC0(150).

C54CM, SATD, SXMD ACOVx, CARRY

This instruction can be repeated.

5-144

Instruction Set Descriptions

SPRU375G

Dual 16Bit Subtractions

Dual 16-Bit Subtractions


Syntax Characteristics
Parallel Enable bit No No No No

No. [1] [2] [3] [4]

Syntax HI(ACy) = HI(ACx) HI(Lmem), LO(ACy) = LO(ACx) LO(Lmem) HI(ACy) = HI(Lmem) HI(ACx), LO(ACy) = LO(Lmem) LO(ACx) HI(ACx) = Tx HI(Lmem), LO(ACx) = Tx LO(Lmem) HI(ACx) = HI(Lmem) Tx, LO(ACx) = LO(Lmem) Tx

Size 3 3 3 3

Cycles 1 1 1 1

Pipeline X X X X

Description

These instructions perform two paralleled subtraction operations in one cycle. The operations are executed on 40 bits in the D-unit ALU that is configured locally in dual 16-bit mode. The 16 lower bits of both the ALU and the accumulator are separated from their higher 24 bits (the 8 guard bits are attached to the higher 16-bit datapath).

Status Bits

Affected by Affects

C54CM, SATD, SXMD ACOVx, ACOVy, CARRY

See Also

See the following other related instructions:


- Addition or Subtraction Conditionally - Addition or Subtraction Conditionally with Shift - Addition, Subtraction, or Move Accumulator Content Conditionally - Dual 16-Bit Addition and Subtraction - Dual 16-Bit Subtraction and Addition - Subtract Conditionally - Subtraction - Subtraction with Parallel Store Accumulator Content to Memory

SPRU375G

Instruction Set Descriptions

5-145

Dual 16Bit Subtractions

Dual 16-Bit Subtractions


Syntax Characteristics
Parallel Enable Bit No

No. [1]

Syntax HI(ACy) = HI(ACx) HI(Lmem), LO(ACy) = LO(ACx) LO(Lmem)

Size 3

Cycles 1

Pipeline X

Opcode Operands Description ACx, ACy, Lmem

1110 1110 AAAA AAAI SSDD 001x

This instruction performs two paralleled subtraction operations in one cycle. The operations are executed on 40 bits in the D-unit ALU that is configured locally in dual 16-bit mode. The 16 lower bits of both the ALU and the accumulator are separated from their higher 24 bits (the 8 guard bits are attached to the higher 16-bit data path).
- The data memory operand dbl(Lmem) is divided into two 16-bit parts: J J

the lower part is used as one of the 16-bit operands of the ALU low part the higher part is sign extended to 24 bits according to SXMD and is used in the ALU high part

- The data memory operand dbl(Lmem) addresses are aligned: J J

if Lmem address is even: most significant word = Lmem, least significant word = Lmem + 1 if Lmem address is odd: most significant word = Lmem, least significant word = Lmem 1

- For each of the two computations performed in the ALU, an overflow

detection is made. If an overflow is detected on any of the data paths, the destination accumulator overflow status bit (ACOVy) is set.
J J

For the operations performed in the ALU low part, overflow is detected at bit position 15. For the operations performed in the ALU high part, overflow is detected at bit position 31.

- For all instructions, the carry of the operation performed in the ALU high

part is reported in the CARRY status bit. The CARRY status bit is always extracted at bit position 31.
5-146 Instruction Set Descriptions SPRU375G

Dual 16Bit Subtractions

- Independently on each data path, if SATD = 1 when an overflow is

detected on the data path, a saturation is performed:


J J

For the operations performed in the ALU low part, saturation values are 7FFFh and 8000h. For the operations performed in the ALU high part, saturation values are 00 7FFFh and FF 8000h.

Compatibility with C54x devices (C54CM = 1) When C54CM = 1, this instruction is executed as if SATD is locally cleared to 0. Overflow is only detected and reported for the computation performed in the higher 24-bit datapath (overflow is detected at bit position 31). Status Bits Affected by Affects Repeat Example
Syntax HI(AC0) = HI(AC1) HI(*AR3), LO(AC0) = LO(AC1) LO(*AR3) Description Both instructions are performed in parallel. When the Lmem address is even (AR3 = even): The content addressed by AR3 (sign extended to 24 bits) is subtracted from the content of AC1(3916) and the result is stored in AC0(3916). The content addressed by AR3 + 1 is subtracted from the content of AC1(150) and the result is stored in AC0(150).

C54CM, SATD, SXMD ACOVy, CARRY

This instruction can be repeated.

SPRU375G

Instruction Set Descriptions

5-147

Dual 16Bit Subtractions

Dual 16-Bit Subtractions


Syntax Characteristics
Parallel Enable bit No

No. [2]

Syntax HI(ACy) = HI(Lmem) HI(ACx), LO(ACy) = LO(Lmem) LO(ACx)

Size 3

Cycles 1

Pipeline X

Opcode Operands Description ACx, ACy, Lmem

1110 1110 AAAA AAAI SSDD 010x

This instruction performs two paralleled subtraction operations in one cycle. The operations are executed on 40 bits in the D-unit ALU that is configured locally in dual 16-bit mode. The 16 lower bits of both the ALU and the accumulator are separated from their higher 24 bits (the 8 guard bits are attached to the higher 16-bit datapath).
- The data memory operand dbl(Lmem) is divided into two 16-bit parts: J J

the lower part is used as one of the 16-bit operands of the ALU low part the higher part is sign extended to 24 bits according to SXMD and is used in the ALU high part

- The data memory operand dbl(Lmem) addresses are aligned: J J

if Lmem address is even: most significant word = Lmem, least significant word = Lmem + 1 if Lmem address is odd: most significant word = Lmem, least significant word = Lmem 1

- For each of the two computations performed in the ALU, an overflow

detection is made. If an overflow is detected on any of the data paths, the destination accumulator overflow status bit (ACOVy) is set.
J J

For the operations performed in the ALU low part, overflow is detected at bit position 15. For the operations performed in the ALU high part, overflow is detected at bit position 31.

- For all instructions, the carry of the operation performed in the ALU high

part is reported in the CARRY status bit. The CARRY status bit is always extracted at bit position 31.
5-148 Instruction Set Descriptions SPRU375G

Dual 16Bit Subtractions

- Independently on each data path, if SATD = 1 when an overflow is

detected on the data path, a saturation is performed:


J J

For the operations performed in the ALU low part, saturation values are 7FFFh and 8000h. For the operations performed in the ALU high part, saturation values are 00 7FFFh and FF 8000h.

Compatibility with C54x devices (C54CM = 1) When C54CM = 1, this instruction is executed as if SATD is locally cleared to 0. Overflow is only detected and reported for the computation performed in the higher 24-bit datapath (overflow is detected at bit position 31). Status Bits Affected by Affects Repeat Example
Syntax HI(AC0) = HI(*AR3) HI(AC1), LO(AC0) = LO(*AR3) LO(AC1) Description Both instructions are performed in parallel. When the Lmem address is even (AR3 = even): The content of AC1(3916) is subtracted from the content addressed by AR3 and the result is stored in AC0(3916). The content of AC1(150) is subtracted from the content addressed by AR3 + 1 and the result is stored in AC0(150).

C54CM, SATD, SXMD ACOVy, CARRY

This instruction can be repeated.

SPRU375G

Instruction Set Descriptions

5-149

Dual 16Bit Subtractions

Dual 16-Bit Subtractions


Syntax Characteristics
No. [3] Syntax HI(ACx) = Tx HI(Lmem), LO(ACx) = Tx LO(Lmem) Parallel Enable bit No Size 3 Cycles 1 Pipeline X

Opcode Operands Description ACx, Lmem, Tx

1110 1110 AAAA AAAI ssDD 011x

This instruction performs two paralleled subtraction operations in one cycle. The operations are executed on 40 bits in the D-unit ALU that is configured locally in dual 16-bit mode. The 16 lower bits of both the ALU and the accumulator are separated from their higher 24 bits (the 8 guard bits are attached to the higher 16-bit datapath).
- The temporary register Tx: J J

is used as one of the 16-bit operands of the ALU low part is duplicated and, according to SXMD, sign extended to 24 bits to be used in the ALU high part

- The data memory operand dbl(Lmem) is divided into two 16-bit parts: J J

the lower part is used as one of the 16-bit operands of the ALU low part the higher part is sign extended to 24 bits according to SXMD and is used in the ALU high part

- The data memory operand dbl(Lmem) addresses are aligned: J J

if Lmem address is even: most significant word = Lmem, least significant word = Lmem + 1 if Lmem address is odd: most significant word = Lmem, least significant word = Lmem 1

- For each of the two computations performed in the ALU, an overflow

detection is made. If an overflow is detected on any of the data paths, the destination accumulator overflow status bit (ACOVx) is set.
J J

For the operations performed in the ALU low part, overflow is detected at bit position 15. For the operations performed in the ALU high part, overflow is detected at bit position 31.
SPRU375G

5-150

Instruction Set Descriptions

Dual 16Bit Subtractions

- For all instructions, the carry of the operation performed in the ALU high

part is reported in the CARRY status bit. The CARRY status bit is always extracted at bit position 31.
- Independently on each data path, if SATD = 1 when an overflow is

detected on the data path, a saturation is performed:


J J

For the operations performed in the ALU low part, saturation values are 7FFFh and 8000h. For the operations performed in the ALU high part, saturation values are 00 7FFFh and FF 8000h.

Compatibility with C54x devices (C54CM = 1) When C54CM = 1, this instruction is executed as if SATD is locally cleared to 0. Overflow is only detected and reported for the computation performed in the higher 24-bit datapath (overflow is detected at bit position 31). Status Bits Affected by Affects Repeat Example
Syntax HI(AC0) = T0 HI(*AR3), LO(AC0) = T0 LO(*AR3) Description Both instructions are performed in parallel. When the Lmem address is even (AR3 = even): The content addressed by AR3 is subtracted from the content of T0 and the result is stored in AC0(3916). The content addressed by AR3 + 1 is subtracted from the duplicated content of T0 and the result is stored in AC0(150).

C54CM, SATD, SXMD ACOVx, CARRY

This instruction can be repeated.

SPRU375G

Instruction Set Descriptions

5-151

Dual 16Bit Subtractions

Dual 16-Bit Subtractions


Syntax Characteristics
No. [4] Syntax HI(ACx) = HI(Lmem) Tx, LO(ACx) = LO(Lmem) Tx Parallel Enable bit No Size 3 Cycles 1 Pipeline X

Opcode Operands Description ACx, Tx, Lmem

1110 1110 AAAA AAAI ssDD 101x

This instruction performs two paralleled subtraction operations in one cycle. The operations are executed on 40 bits in the D-unit ALU that is configured locally in dual 16-bit mode. The 16 lower bits of both the ALU and the accumulator are separated from their higher 24 bits (the 8 guard bits are attached to the higher 16-bit datapath).
- The temporary register Tx: J J

is used as one of the 16-bit operands of the ALU low part is duplicated and, according to SXMD, sign extended to 24 bits to be used in the ALU high part

- The data memory operand dbl(Lmem) is divided into two 16-bit parts: J J

the lower part is used as one of the 16-bit operands of the ALU low part the higher part is sign extended to 24 bits according to SXMD and is used in the ALU high part

- The data memory operand dbl(Lmem) addresses are aligned: J J

if Lmem address is even: most significant word = Lmem, least significant word = Lmem + 1 if Lmem address is odd: most significant word = Lmem, least significant word = Lmem 1

- For each of the two computations performed in the ALU, an overflow

detection is made. If an overflow is detected on any of the data paths, the destination accumulator overflow status bit (ACOVx) is set.
J J

For the operations performed in the ALU low part, overflow is detected at bit position 15. For the operations performed in the ALU high part, overflow is detected at bit position 31.
SPRU375G

5-152

Instruction Set Descriptions

Dual 16Bit Subtractions

- For all instructions, the carry of the operation performed in the ALU high

part is reported in the CARRY status bit. The CARRY status bit is always extracted at bit position 31.
- Independently on each data path, if SATD = 1 when an overflow is

detected on the data path, a saturation is performed:


J J

For the operations performed in the ALU low part, saturation values are 7FFFh and 8000h. For the operations performed in the ALU high part, saturation values are 00 7FFFh and FF 8000h.

Compatibility with C54x devices (C54CM = 1) When C54CM = 1, this instruction is executed as if SATD is locally cleared to 0. Overflow is only detected and reported for the computation performed in the higher 24-bit datapath (overflow is detected at bit position 31). Status Bits Affected by Affects Repeat Example
Syntax HI(AC0) = HI(*AR3) T0, LO(AC0) = LO(*AR3) T0 Description Both instructions are performed in parallel. When the Lmem address is even (AR3 = even): The content of T0 is subtracted from the content addressed by AR3 and the result is stored in AC0(3916). The duplicated content of T0 is subtracted from the content addressed by AR3 + 1 and the result is stored in AC0(150).

C54CM, SATD, SXMD ACOVx, CARRY

This instruction can be repeated.

SPRU375G

Instruction Set Descriptions

5-153

Dual 16Bit Subtraction and Addition

Dual 16-Bit Subtraction and Addition


Syntax Characteristics
Parallel Enable bit No No

No. [1] [2]

Syntax HI(ACx) = Smem Tx, LO(ACx) = Smem + Tx HI(ACx) = HI(Lmem) Tx, LO(ACx) = LO(Lmem) + Tx

Size 3 3

Cycles 1 1

Pipeline X X

Description

These instructions perform two paralleled subtraction and addition operations in one cycle. The operations are executed on 40 bits in the D-unit ALU that is configured locally in dual 16-bit mode. The 16 lower bits of both the ALU and the accumulator are separated from their higher 24 bits (the 8 guard bits are attached to the higher 16-bit datapath).

Status Bits

Affected by Affects

C54CM, SATD, SXMD ACOVx, ACOVy, CARRY

See Also

See the following other related instructions:


- Addition - Dual 16-Bit Additions - Dual 16-Bit Addition and Subtraction - Dual 16-Bit Subtractions - Subtraction

5-154

Instruction Set Descriptions

SPRU375G

Dual 16Bit Subtraction and Addition

Dual 16-Bit Subtraction and Addition


Syntax Characteristics
Parallel Enable bit No

No. [1]

Syntax HI(ACx) = Smem Tx, LO(ACx) = Smem + Tx

Size 3

Cycles 1

Pipeline X

Opcode Operands Description ACx, Smem, Tx

1101 1110 AAAA AAAI ssDD 1001

This instruction performs two paralleled arithmetical operations in one cycle: a subtraction and addition. The operations are executed on 40 bits in the D-unit ALU that is configured locally in dual 16-bit mode. The 16 lower bits of both the ALU and the accumulator are separated from their higher 24 bits (the 8 guard bits are attached to the higher 16-bit datapath).
- The data memory operand Smem: J J

is used as one of the 16-bit operands of the ALU low part is duplicated and, according to SXMD, sign extended to 24 bits to be used in the ALU high part

- The temporary register Tx: J J

is used as one of the 16-bit operands of the ALU low part is duplicated and, according to SXMD, sign extended to 24 bits to be used in the ALU high part

- For each of the two computations performed in the ALU, an overflow

detection is made. If an overflow is detected on any of the data paths, the destination accumulator overflow status bit (ACOVx) is set.
J J

For the operations performed in the ALU low part, overflow is detected at bit position 15. For the operations performed in the ALU high part, overflow is detected at bit position 31.

- For all instructions, the carry of the operation performed in the ALU high

part is reported in the CARRY status bit. The CARRY status bit is always extracted at bit position 31.
SPRU375G Instruction Set Descriptions 5-155

Dual 16Bit Subtraction and Addition

- Independently on each data path, if SATD = 1 when an overflow is

detected on the data path, a saturation is performed:


J J

For the operations performed in the ALU low part, saturation values are 7FFFh and 8000h. For the operations performed in the ALU high part, saturation values are 00 7FFFh and FF 8000h.

Compatibility with C54x devices (C54CM = 1) When C54CM = 1, this instruction is executed as if SATD is locally cleared to 0. Overflow is only detected and reported for the computation performed in the higher 24-bit datapath (overflow is detected at bit position 31). Status Bits Affected by Affects Repeat Example
Syntax HI(AC0) = *AR3 T0, LO(AC0) = *AR3 + T0 Description Both instructions are performed in parallel. The content of T0 is subtracted from the content addressed by AR3 and the result is stored in AC0(3916). The duplicated content of T0 is added to the duplicated content addressed by AR3 and the result is stored in AC0(150).

C54CM, SATD, SXMD ACOVx, CARRY

This instruction can be repeated.

5-156

Instruction Set Descriptions

SPRU375G

Dual 16Bit Subtraction and Addition

Dual 16-Bit Subtraction and Addition


Syntax Characteristics
No. [2] Syntax HI(ACx) = HI(Lmem) Tx, LO(ACx) = LO(Lmem) + Tx Parallel Enable bit No Size 3 Cycles 1 Pipeline X

Opcode Operands Description ACx, Lmem, Tx

1110 1110 AAAA AAAI ssDD 111x

This instruction performs two paralleled arithmetical operations in one cycle: a subtraction and addition. The operations are executed on 40 bits in the D-unit ALU that is configured locally in dual 16-bit mode. The 16 lower bits of both the ALU and the accumulator are separated from their higher 24 bits (the 8 guard bits are attached to the higher 16-bit datapath).
- The temporary register Tx: J J

is used as one of the 16-bit operands of the ALU low part is duplicated and, according to SXMD, sign extended to 24 bits to be used in the ALU high part

- The data memory operand dbl(Lmem) is divided into two 16-bit parts: J J

the lower part is used as one of the 16-bit operands of the ALU low part the higher part is sign extended to 24 bits according to SXMD and is used in the ALU high part

- The data memory operand dbl(Lmem) addresses are aligned: J J

if Lmem address is even: most significant word = Lmem, least significant word = Lmem + 1 if Lmem address is odd: most significant word = Lmem, least significant word = Lmem 1

- For each of the two computations performed in the ALU, an overflow

detection is made. If an overflow is detected on any of the data paths, the destination accumulator overflow status bit (ACOVx) is set.
J J

For the operations performed in the ALU low part, overflow is detected at bit position 15. For the operations performed in the ALU high part, overflow is detected at bit position 31.
Instruction Set Descriptions 5-157

SPRU375G

Dual 16Bit Subtraction and Addition

- For all instructions, the carry of the operation performed in the ALU high

part is reported in the CARRY status bit. The CARRY status bit is always extracted at bit position 31.
- Independently on each data path, if SATD = 1 when an overflow is

detected on the data path, a saturation is performed:


J J

For the operations performed in the ALU low part, saturation values are 7FFFh and 8000h. For the operations performed in the ALU high part, saturation values are 00 7FFFh and FF 8000h.

Compatibility with C54x devices (C54CM = 1) When C54CM = 1, this instruction is executed as if SATD is locally cleared to 0. Overflow is only detected and reported for the computation performed in the higher 24-bit datapath (overflow is detected at bit position 31). Status Bits Affected by Affects Repeat Example
Syntax HI(AC0) = HI(*AR3) T0, LO(AC0) = LO(*AR3) + T0 Description Both instructions are performed in parallel. When the Lmem address is even (AR3 = even): The content of T0 is subtracted from the content addressed by AR3 and the result is stored in AC0(3916). The duplicated content of T0 is added to the content addressed by AR3 + 1 and the result is stored in AC0(150).

C54CM, SATD, SXMD ACOVx, CARRY

This instruction can be repeated.

5-158

Instruction Set Descriptions

SPRU375G

Execute Conditionally (if execute)

Execute Conditionally
Syntax Characteristics
Parallel Enable Bit No No

No. [1] [2]

Syntax if (cond) execute(AD_Unit) if (cond) execute(D_Unit)

Size 2 2

Cycles 1 1

Pipeline AD X

Description

These instructions evaluate a single condition defined by the cond field and allow you to control execution of all operations implied by the instruction or part of the instruction. See Table 13 for a list of conditions. Instruction [1] allows you to control the entire execution flow from the address phase to the execute phase of the pipeline. Instruction [2] allows you to only control the execution flow from the execute phase of the pipeline. The use of a label, where control of the execute conditionally instruction ends, is optional.
- These instructions may be executed alone. - These instructions may be executed with two paralleled instructions. - These instructions may be executed with the instruction with which it is

paralleled.
- These instructions may be executed with the previous instruction. - These instructions may be executed with the previous instruction and two

paralleled instructions.
- These instructions cannot be repeated. - These instructions cannot be used as the last instruction in a repeat loop

structure.
- These instructions cannot control the execution of the following program

control instructions:
goto call return trap (cond) goto (cond) call (cond) return localrepeat intr idle reset repeat blockrepeat (cond) execute(AD_unit) (cond) execute(D_unit) while (cond) repeat return_int

Status Bits

Affected by Affects

ACOVx, CARRY, C54CM, M40, TCx ACOVx


Instruction Set Descriptions 5-159

SPRU375G

Execute Conditionally (if execute)

Execute Conditionally
Syntax Characteristics
No. [1] Syntax if (cond) execute(AD_Unit) Parallel Enable Bit No Size 2 Cycles 1 Pipeline AD

Opcode

1001 0110 0CCC CCCC 1001 1110 0CCC CCCC 1001 1111 0CCC CCCC The assembler selects the opcode depending on the instruction position in a paralleled pair.

Operands Description

cond This instruction evaluates a single condition defined by the cond field and allows you to control the execution flow of an instruction, or instructions, from the address phase to the execute phase of the pipeline. See Table 13 for a list of conditions. When this instruction moves into the address phase of the pipeline, the condition specified in the cond field is evaluated. If the tested condition is true, the conditional instruction(s) is read and executed; if the tested condition is false, the conditional instruction(s) is not read and program control is passed to the instruction following the conditional instruction(s) or to the program address defined by label. There is a 3-cycle latency for the condition testing.
- This instruction may be executed alone:

if(cond) execute(AD_unit) instruction_executes_conditionally label:


- This instruction may be executed with two paralleled instructions:

if(cond) execute(AD_unit) instruction_1_executes_conditionally || instruction_2_executes_conditionally label:


- This instruction may be executed with the instruction with which it is

paralleled:
if(cond) execute(AD_unit) || instruction_executes_conditionally label:
5-160 Instruction Set Descriptions SPRU375G

Execute Conditionally (if execute)

- This instruction may be executed with a previous instruction:

previous_instruction || if(cond) execute(AD_unit) instruction_executes_conditionally label:


- This instruction may be executed with a previous instruction and two

paralleled instructions:
previous_instruction || if(cond) execute(AD_unit) instruction_1_executes_conditionally || instruction_2_executes_conditionally label:

This instruction cannot be used as the last instruction in a repeat loop structure. This instruction cannot control the execution of the following program control instructions:
goto call return trap (cond) goto (cond) call (cond) return localrepeat intr idle reset repeat blockrepeat (cond) execute(AD_unit) (cond) execute(D_unit) while (cond) repeat return_int

Compatibility with C54x devices (C54CM = 1) When C54CM = 1, the comparison of accumulators to 0 is performed as if M40 was set to 1. Status Bits Affected by Affects Repeat Example 1
Syntax if (TC1) execute(AD_unit) mar(*AR1+) AC1 = AC1 + *AR1
Before AC1 TC1 CARRY AR1 200 201 00 0000 4300 1 1 0200 2020 2021 After AC1 TC1 CARRY AR1 200 201 00 0000 6321 1 0 0201 2020 2021

ACOVx, CARRY, C54CM, M40, TCx ACOVx

This instruction cannot be repeated.

Description TC1 is equal to 1, the next instruction is executed (AR1 is incremented by 1). The content of AC1 is added to the content addressed by AR1 + 1 (2021h) and the result is stored in AC1.

SPRU375G

Instruction Set Descriptions

5-161

Execute Conditionally (if execute)

Example 2
Syntax if (TC1) execute(AD_unit) mar(*AR1+) AC1 = AC1 + *AR1
Before AC1 TC1 CARRY AR1 200 201 00 0000 4300 0 1 0200 2020 2021 After AC1 TC1 CARRY AR1 200 201 00 0000 6320 0 0 0200 2020 2021

Description TC1 is not equal to 1, the next instruction is not executed (AR1 is not incremented). The content of AC1 is added to the content addressed by AR1 (2020h) and the result is stored in AC1.

5-162

Instruction Set Descriptions

SPRU375G

Execute Conditionally (if execute)

Execute Conditionally
Syntax Characteristics
No. [2] Syntax if (cond) execute(D_Unit) Parallel Enable Bit No Size 2 Cycles 1 Pipeline X

Opcode

1001 0110 1CCC CCCC 1001 1110 1CCC CCCC 1001 1111 1CCC CCCC The assembler selects the opcode depending on the instruction position in a paralleled pair.

Operands Description

cond This instruction evaluates a single condition defined by the cond field and allows you to control the execution flow of an instruction, or instructions, from the execute phase of the pipeline. This instruction differs from instruction [1] because in this instruction operations performed in the address phase are always executed. See Table 13 for a list of conditions. When this instruction moves into the execute phase of the pipeline, the condition specified in the cond field is evaluated. If the tested condition is true, the conditional instruction(s) is read and executed; if the tested condition is false, the conditional instruction(s) is not read and program control is passed to the instruction following the conditional instruction(s) or to the program address defined by label. There is a 0-cycle latency for the condition testing.
- This instruction may be executed alone:

if(cond) execute(D_unit) instruction_executes_conditionally label:


- This instruction may be executed with two paralleled instructions:

if(cond) execute(D_unit) instruction_1_executes_conditionally || instruction_2_executes_conditionally label:


- This instruction may be executed with the instruction with which it is

paralleled. When this instruction syntax is used and the instruction to be executed conditionally is a store-to-memory instruction, there is a 1-cycle latency for the condition setting.
if(cond) execute(D_unit) || instruction_executes_conditionally label:
SPRU375G Instruction Set Descriptions 5-163

Execute Conditionally (if execute)

- This instruction may be executed with a previous instruction:

previous_instruction || if(cond) execute(D_unit) instruction_executes_conditionally label:


- This instruction may be executed with a previous instruction and two

paralleled instructions:
previous_instruction || if(cond) execute(D_unit) instruction_1_executes_conditionally || instruction_2_executes_conditionally label:

This instruction cannot be used as the last instruction in a repeat loop structure. This instruction cannot control the execution of the following program control instructions:
goto call return trap (cond) goto (cond) call (cond) return localrepeat intr idle reset repeat blockrepeat (cond) execute(AD_unit) (cond) execute(D_unit) while (cond) repeat return_int

Compatibility with C54x devices (C54CM = 1) When C54CM = 1, the comparison of accumulators to 0 is performed as if M40 was set to 1. Status Bits Affected by Affects Repeat Example 1
Syntax if (TC1) execute(D_unit) mar(*AR1+) AC1 = AC1 + *AR1
Before AC1 TC1 CARRY AR1 200 201 00 0000 4300 1 1 0200 2020 2021 After AC1 TC1 CARRY AR1 200 201 00 0000 6321 1 0 0201 2020 2021

ACOVx, CARRY, C54CM, M40, TCx ACOVx

This instruction cannot be repeated.

Description TC1 is equal to 1, the next instruction is executed (AR1 is incremented by 1). The content of AC1 is added to the content addressed by AR1 + 1 (2021h) and the result is stored in AC1.

5-164

Instruction Set Descriptions

SPRU375G

Execute Conditionally (if execute)

Example 2
Syntax if (TC1) execute(D_unit) mar(*AR1+) AC1 = AC1 + *AR1
Before AC1 TC1 CARRY AR1 200 201 00 0000 4300 0 1 0200 2020 2021

Description TC1 is not equal to 1, the next instruction would not be executed; however, since the next instruction is a pointer modification, AR1 is incremented by 1 in the address phase. The content of AC1 is added to the content addressed by AR1 + 1 (2021h) and the result is stored in AC1.
After AC1 TC1 CARRY AR1 200 201 00 0000 6321 0 0 0201 2020 2021

SPRU375G

Instruction Set Descriptions

5-165

Expand Accumulator Bit Field (field_expand)

Expand Accumulator Bit Field


Syntax Characteristics
No. [1] Syntax dst = field_expand(ACx, k16) Parallel Enable Bit No Size 4 Cycles 1 Pipeline X

Opcode Operands Description ACx, dst, k16

0111 0110 kkkk kkkk kkkk kkkk FDDD 01SS

This instruction performs a bit field manipulation in the D-unit shifter. When the destination register (dst) is an A-unit register (ARx or Tx), a dedicated bus carries the output of the D-unit shifter directly into dst. The 16-bit field mask, k16, is scanned from the least significant bits (LSBs) to the most significant bits (MSBs). According to the bit set to 1 in the bit field mask, the 16 LSBs of the source accumulator (ACx) bits are extracted and separated with 0 toward the MSBs. The result is stored in the dst.

Status Bits

Affected by Affects

none none

Repeat See Also

This instruction can be repeated. See the following other related instructions:
- Extract Accumulator Bit Field

Example
Syntax T2 = field_expand(AC0,#8024h) Description Each bit of the unsigned 16-bit value (8024h) is scanned from the LSB to the MSB to test for a 1. If the bit is set to 1, the bit in AC0 is extracted and separated with 0 toward the MSB in T2; otherwise, the corresponding bit in AC0 is not extracted. The result is stored in T2.

Execution #k16 (8024h) AC0(150) T2 1000 0000 0010 0100 0010 1011 0110 0101 1000 0000 0000 0100

Before AC0 T2 00 2300 2B65 0000

After AC0 T2 00 2300 2B65 8004

5-166

Instruction Set Descriptions

SPRU375G

Extract Accumulator Bit Field (field_extract)

Extract Accumulator Bit Field


Syntax Characteristics
No. [1] Syntax dst = field_extract(ACx, k16) Parallel Enable Bit No Size 4 Cycles 1 Pipeline X

Opcode Operands Description ACx, dst, k16

0111 0110 kkkk kkkk kkkk kkkk FDDD 00SS

This instruction performs a bit field manipulation in the D-unit shifter. When the destination register (dst) is an A-unit register (ARx or Tx), a dedicated bus carries the output of the D-unit shifter directly into dst. The 16-bit field mask, k16, is scanned from the least significant bits (LSBs) to the most significant bits (MSBs). According to the bit set to 1 in the bit field mask, the corresponding 16 LSBs of the source accumulator (ACx) bits are extracted and packed toward the LSBs. The result is stored in the dst.

Status Bits

Affected by Affects

none none

Repeat See Also

This instruction can be repeated. See the following other related instructions:
- Expand Accumulator Bit Field

Example
Syntax T2 = field_extract(AC0,#8024h) Description Each bit of the unsigned 16-bit value (8024h) is scanned from the LSB to the MSB to test for a 1. If the bit is set to 1, the corresponding bit in AC0 is extracted and packed toward the LSB in T2; otherwise, the corresponding bit in AC0 is not extracted. The result is stored in T2.

Execution #k16 (8024h) AC0(150) T2 1000 0000 0010 0100 0101 0101 1010 1010 0000 0000 0000 0010

Before AC0 T2 00 2300 55AA 0000

After AC0 T2 00 2300 55AA 0002

SPRU375G

Instruction Set Descriptions

5-167

Finite Impulse Response Filter, Antisymmetrical (firsn)

Finite Impulse Response Filter, Antisymmetrical


Syntax Characteristics
Parallel Enable Bit No

No. [1]

Syntax firsn(Xmem, Ymem, coef(Cmem), ACx, ACy)

Size 4

Cycles 1

Pipeline X

Opcode Operands Description

1000 0101 XXXM MMYY YMMM 11mm DDx1 DDU% ACx, ACy, Cmem, Xmem, Ymem This instruction performs two parallel operations: multiply and accumulate (MAC), and subtraction. The firsn() operation is executed:
ACy = ACy + (ACx * Cmem), ACx = (Xmem << #16) (Ymem << #16)

The first operation performs a multiplication and an accumulation in the D-unit MAC. The input operands of the multiplier are the content of ACx(3216) and the content of a data memory operand Cmem, addressed using the coefficient addressing mode, sign extended to 17 bits.
- If FRCT = 1, the output of the multiplier is shifted left by 1 bit. - Multiplication overflow detection depends on SMUL. - The 32-bit result of the multiplication is sign extended to 40 bits and added

to the source accumulator ACy.


- Addition overflow detection depends on M40. If an overflow is detected,

the destination accumulator overflow status bit (ACOVy) is set.


- When an addition overflow is detected, the accumulator is saturated

according to SATD. For this instruction, the Cmem operand is accessed through the BB bus; on some C55x-based devices, the BB bus is only connected to internal memory and not to external memory. To prevent the generation of a bus error, the Cmem operand must not be mapped on external memory. The second operation subtracts the content of data memory operand Ymem, shifted left 16 bits, from the content of data memory operand Xmem, shifted left 16 bits.
- The operation is performed on 40 bits in the D-unit ALU. - Input operands are sign extended to 40 bits according to SXMD. 5-168 Instruction Set Descriptions SPRU375G

Finite Impulse Response Filter, Antisymmetrical (firsn)

- The shift operation is equivalent to the signed shift instruction. - Overflow detection and CARRY status bit depends on M40. The

subtraction borrow bit is reported in the CARRY status bit; the borrow bit is the logical complement of the CARRY status bit.
- When an overflow is detected, the accumulator is saturated according to

SATD. Compatibility with C54x devices (C54CM = 1) When this instruction is executed with M40 = 0, compatibility is ensured. When C54CM = 1, no overflow detection, report, and saturation is done after the shifting operation. Status Bits Affected by Affects Repeat See Also C54CM, FRCT, M40, SATD, SMUL, SXMD ACOVx, ACOVy, CARRY

This instruction can be repeated. See the following other related instructions:
- Finite Impulse Response Filter, Symmetrical

Example
Syntax firsn(*AR0, *AR1, coef(*CDP), AC0, AC1) Description The content of AC0(3216) multiplied by the content addressed by the coefficient data pointer register (CDP) is added to the content of AC1 and the result is stored in AC1. The content addressed by AR1 shifted left by 16 bits is subtracted from the content addressed by AR0 shifted left by 16 bits and the result is stored in AC0.

Before AC0 AC1 *AR0 *AR1 *CDP ACOV0 ACOV1 CARRY FRCT SXMD

00 6900 0000 00 0023 0000 3400 EF00 A067 0 0 0 0 0

After AC0 AC1 *AR0 *AR1 *CDP ACOV0 ACOV1 CARRY FRCT SXMD

00 4500 0000 FF D8ED 3F00 3400 EF00 A067 0 0 0 0 0

SPRU375G

Instruction Set Descriptions

5-169

Finite Impulse Response Filter, Symmetrical (firs)

Finite Impulse Response Filter, Symmetrical


Syntax Characteristics
Parallel Enable Bit No

No. [1]

Syntax firs(Xmem, Ymem, coef(Cmem), ACx, ACy)

Size 4

Cycles 1

Pipeline X

Opcode Operands Description

1000 0101 XXXM MMYY YMMM 11mm DDx0 DDU% ACx, ACy, Cmem, Xmem, Ymem This instruction performs two parallel operations: multiply and accumulate (MAC), and addition. The firs() operation is executed:
ACy = ACy + (ACx * Cmem), ACx = (Xmem << #16) + (Ymem << #16)

The first operation performs a multiplication and an accumulation in the D-unit MAC. The input operands of the multiplier are the content of ACx(3216) and the content of a data memory operand Cmem, addressed using the coefficient addressing mode, sign extended to 17 bits.
- If FRCT = 1, the output of the multiplier is shifted left by 1 bit. - Multiplication overflow detection depends on SMUL. - The 32-bit result of the multiplication is sign extended to 40 bits and added

to the source accumulator ACy.


- Addition overflow detection depends on M40. If an overflow is detected,

the destination accumulator overflow status bit (ACOVy) is set.


- When an addition overflow is detected, the accumulator is saturated

according to SATD. For this instruction, the Cmem operand is accessed through the BB bus; on some C55x-based devices, the BB bus is only connected to internal memory and not to external memory. To prevent the generation of a bus error, the Cmem operand must not be mapped on external memory. The second operation performs an addition operation between the content of data memory operand Xmem, shifted left 16 bits, and the content of data memory operand Ymem, shifted left 16 bits.
- The operation is performed on 40 bits in the D-unit ALU. - Input operands are sign extended to 40 bits according to SXMD. 5-170 Instruction Set Descriptions SPRU375G

Finite Impulse Response Filter, Symmetrical (firs)

- The shift operation is equivalent to the signed shift instruction. - Overflow detection and CARRY status bit depends on M40. - When an overflow is detected, the accumulator is saturated according to

SATD. Compatibility with C54x devices (C54CM = 1) When this instruction is executed with M40 = 0, compatibility is ensured. When C54CM = 1, no overflow detection, report, and saturation is done after the shifting operation. Status Bits Affected by Affects Repeat See Also C54CM, FRCT, M40, SATD, SMUL, SXMD ACOVx, ACOVy, CARRY

This instruction can be repeated. See the following other related instructions:
- Finite Impulse Response Filter, Antisymmetrical

Example
Syntax firs(*AR0, *AR1, coef(*CDP), AC0, AC1) Description The content of AC0(3216) multiplied by the content addressed by the coefficient data pointer register (CDP) is added to the content of AC1 and the result is stored in AC1. The content addressed by AR0 shifted left by 16 bits is added to the content addressed by AR1 shifted left by 16 bits and the result is stored in AC0.

Before AC0 AC1 *AR0 *AR1 *CDP ACOV0 ACOV1 CARRY FRCT SXMD

00 6900 0000 00 0023 0000 3400 EF00 A067 0 0 0 0 0

After AC0 AC1 *AR0 *AR1 *CDP ACOV0 ACOV1 CARRY FRCT SXMD

00 2300 0000 FF D8ED 3F00 3400 EF00 A067 0 0 1 0 0

SPRU375G

Instruction Set Descriptions

5-171

idle

Idle
Syntax Characteristics
Parallel Enable Bit No

No. [1]

Syntax idle

Size 4

Cycles ?

Pipeline D

Opcode Operands Description none

0111 1010 xxxx xxxx xxxx xxxx xxxx 110x

This instruction forces the program being executed to wait until an interrupt or a reset occurs. The power-down mode that the processor operates in depends on a configuration register accessible through the peripheral access mechanism. Affected by Affects INTM none

Status Bits

Repeat

This instruction cannot be repeated.

5-172

Instruction Set Descriptions

SPRU375G

Least Mean Square (lms)

Least Mean Square


Syntax Characteristics
No. [1] Syntax lms(Xmem, Ymem, ACx, ACy) Parallel Enable Bit No Size 4 Cycles 1 Pipeline X

Opcode Operands Description

1000 0110 XXXM MMYY YMMM DDDD 110x xxx% ACx, ACy, Xmem, Ymem This instruction performs two paralleled operations in one cycle: multiply and accumulate (MAC), and addition. The instruction is executed:
ACy = ACy + (Xmem * Ymem), ACx = rnd(ACx + (Xmem << #16))

The first operation performs a multiplication and an accumulation in the D-unit MAC. The input operands of the multiplier are the content of data memory operand Xmem, sign extended to 17 bits, and the content of data memory operand Ymem, sign extended to 17 bits.
- If FRCT = 1, the output of the multiplier is shifted left by 1 bit. - Multiplication overflow detection depends on SMUL. - The 32-bit result of the multiplication is sign extended to 40 bits and added

to the source accumulator ACy.


- Addition overflow detection depends on M40. If an overflow is detected,

the destination accumulator overflow status bit (ACOVy) is set.


- When an addition overflow is detected, the accumulator is saturated

according to SATD. The second operation performs an addition between an accumulator content and the content of data memory operand Xmem shifted left by 16 bits.
- The operation is performed on 40 bits in the D-unit ALU. - Input operands are sign extended to 40 bits according to SXMD. - The shift operation is equivalent to the signed shift instruction. - Overflow detection and CARRY status bit depends on M40. When an

overflow is detected, the accumulator is saturated according to SATD.


- Rounding is performed according to RDM. SPRU375G Instruction Set Descriptions 5-173

Least Mean Square (lms)

Compatibility with C54x devices (C54CM = 1) When this instruction is executed with M40 = 0, compatibility is ensured. When C54CM = 1, the rounding is performed without clearing the 16 lowest bits of ACx. The addition operation has no overflow detection, report, and saturation after the shifting operation. Status Bits Affected by Affects Repeat Example
Syntax lms(*AR0, *AR1, AC0, AC1) Description The content addressed by AR0 multiplied by the content addressed by AR1 is added to the content of AC1 and the result is stored in AC1. The content addressed by AR0 shifted left by 16 bits is added to the content of AC0. The result is rounded and stored in AC0.
After AC0 AC1 *AR0 *AR1 ACOV0 ACOV1 CARRY FRCT

C54CM, FRCT, M40, RDM, SATD, SMUL, SXMD ACOVx, ACOVy, CARRY

This instruction can be repeated.

Before AC0 AC1 *AR0 *AR1 ACOV0 ACOV1 CARRY FRCT

00 1111 2222 00 1000 0000 1000 2000 0 0 0 0

00 2111 0000 00 1200 0000 1000 2000 0 0 0 0

5-174

Instruction Set Descriptions

SPRU375G

Linear Addressing Qualifier (linear)

Linear Addressing Qualifier


Syntax Characteristics
Parallel Enable Bit No

No. [1]

Syntax linear()

Size 1

Cycles 1

Pipeline AD

Opcode Operands Description none

1001 1100

This instruction is an instruction qualifier that can be paralleled only with any instruction making an indirect Smem, Xmem, Ymem, Lmem, Baddr, or Cmem addressing. This instruction cannot be executed in parallel with any other types of instructions and it cannot be executed as a stand-alone instruction (assembler generates an error message). When this instruction is used in parallel, all modifications of ARx and CDP pointer registers used in the indirect addressing mode are done linearly (as if ST2_55 register bits 0 to 8 were cleared to 0).

Status Bits

Affected by Affects

none none

Repeat

This instruction can be repeated.

SPRU375G

Instruction Set Descriptions

5-175

Load Accumulator from Memory

Load Accumulator from Memory


Syntax Characteristics
Parallel Enable Bit No No No No No No No No

No. [1] [2] [3] [4] [5] [6] [7] [8]

Syntax ACx = rnd(Smem << Tx) ACx = low_byte(Smem) << #SHIFTW ACx = high_byte(Smem) << #SHIFTW ACx = Smem << #16 ACx = uns(Smem) ACx = uns(Smem) << #SHIFTW ACx = M40(dbl(Lmem)) LO(ACx) = Xmem, HI(ACx) = Ymem

Size 3 3 3 2 3 4 3 3

Cycles 1 1 1 1 1 1 1 1

Pipeline X X X X X X X X

Description

This instruction loads a 16-bit signed constant, K16, the content of a memory (Smem) location, the content of a data memory operand (Lmem), or the content of dual data memory operands (Xmem and Ymem) to a selected accumulator (ACx). Affected by Affects C54CM, M40, RDM, SATD, SXMD ACOVx

Status Bits

See Also

See the following other related instructions:


- Load Accumulator from Memory with Parallel Store Accumulator Content

to Memory
- Load Accumulator Pair from Memory - Load Accumulator with Immediate Value - Load Accumulator, Auxiliary, or Temporary Register from Memory - Load Accumulator, Auxiliary, or Temporary Register with Immediate Value - Load Auxiliary or Temporary Register Pair from Memory - Multiply and Accumulate with Parallel Load Accumulator from Memory - Multiply and Subtract with Parallel Load Accumulator from Memory 5-176 Instruction Set Descriptions SPRU375G

Load Accumulator from Memory

Load Accumulator from Memory


Syntax Characteristics
Parallel Enable Bit No

No. [1]

Syntax ACx = rnd(Smem << Tx)

Size 3

Cycles 1

Pipeline X

Opcode Operands Description ACx, Smem, Tx

1101 1101 AAAA AAAI x%DD ss11

This instruction loads the content of a memory (Smem) location shifted by the content of Tx to the accumulator (ACx):
- The input operand is sign extended to 40 bits according to SXMD. - The input operand is shifted by the 4-bit value in the D-unit shifter. The shift

operation is equivalent to the signed shift instruction.


- Rounding is performed in the D-unit shifter according to RDM, if the

optional rnd keyword is applied to the input operand. Compatibility with C54x devices (C54CM = 1) When this instruction is executed with M40 = 0, compatibility is ensured. When C54CM = 1, no overflow detection, report, and saturation is done after the shifting operation. The 6 LSBs of Tx are used to determine the shift quantity. The 6 LSBs of Tx define a shift quantity within 32 to +31. When the value is between 32 to 17, a modulo 16 operation transforms the shift quantity to within 16 to 1. Status Bits Affected by Affects Repeat Example
Syntax AC0 = *AR3 << T0 Description AC0 is loaded with the content addressed by AR3 shifted by the content of T0.

C54CM, M40, RDM, SATD, SXMD ACOVx

This instruction can be repeated.

SPRU375G

Instruction Set Descriptions

5-177

Load Accumulator from Memory

Load Accumulator from Memory


Syntax Characteristics
Parallel Enable Bit No

No. [2]

Syntax ACx = low_byte(Smem) << #SHIFTW

Size 3

Cycles 1

Pipeline X

Opcode Operands Description ACx, SHIFTW, Smem

1110 0001 AAAA AAAI DDSH IFTW

This instruction loads the low-byte content of a memory (Smem) location shifted by the 6-bit value, SHIFTW, to the accumulator (ACx):
- The content of the memory location is sign extended to 40 bits according

to SXMD.
- The input operand is shifted by the 6-bit value in the D-unit shifter. The shift

operation is equivalent to the signed shift instruction.


- In this instruction, Smem cannot reference to a memory-mapped register

(MMR). This instruction cannot access a byte within an MMR. If Smem is an MMR, the DSP sends a hardware bus-error interrupt (BERRINT) request to the CPU. Compatibility with C54x devices (C54CM = 1) When this instruction is executed with M40 = 0, compatibility is ensured. When C54CM = 1, no overflow detection, report, and saturation is done after the shifting operation. Status Bits Affected by Affects Repeat Example
Syntax AC0 = low_byte(*AR3) << #31 Description The low-byte content addressed by AR3 is shifted left by 31 bits and loaded into AC0.

C54CM, M40, SATD, SXMD ACOVx

This instruction can be repeated.

5-178

Instruction Set Descriptions

SPRU375G

Load Accumulator from Memory

Load Accumulator from Memory


Syntax Characteristics
Parallel Enable Bit No

No. [3]

Syntax ACx = high_byte(Smem) << #SHIFTW

Size 3

Cycles 1

Pipeline X

Opcode Operands Description ACx, SHIFTW, Smem

1110 0010 AAAA AAAI DDSH IFTW

This instruction loads the high-byte content of a memory (Smem) location shifted by the 6-bit value, SHIFTW, to the accumulator (ACx):
- The content of the memory location is sign extended to 40 bits according

to SXMD.
- The input operand is shifted by the 6-bit value in the D-unit shifter. The shift

operation is equivalent to the signed shift instruction.


- In this instruction, Smem cannot reference to a memory-mapped register

(MMR). This instruction cannot access a byte within an MMR. If Smem is an MMR, the DSP sends a hardware bus-error interrupt (BERRINT) request to the CPU. Compatibility with C54x devices (C54CM = 1) When this instruction is executed with M40 = 0, compatibility is ensured. When C54CM = 1, no overflow detection, report, and saturation is done after the shifting operation. Status Bits Affected by Affects Repeat Example
Syntax AC0 = high_byte(*AR3) << #31 Description The high-byte content addressed by AR3 is shifted left by 31 bits and loaded into AC0.

C54CM, M40, SATD, SXMD ACOVx

This instruction can be repeated.

SPRU375G

Instruction Set Descriptions

5-179

Load Accumulator from Memory

Load Accumulator from Memory


Syntax Characteristics
Parallel Enable Bit No

No. [4]

Syntax ACx = Smem << #16

Size 2

Cycles 1

Pipeline X

Opcode Operands Description ACx, Smem

1011 00DD AAAA AAAI

This instruction loads the content of a memory (Smem) location shifted left by 16 bits to the accumulator (ACx):
- The input operand is sign extended to 40 bits according to SXMD. - The shift operation is equivalent to the signed shift instruction. - The input operand is shifted left by 16 bits according to M40.

Compatibility with C54x devices (C54CM = 1) When this instruction is executed with M40 = 0, compatibility is ensured. When C54CM = 1, overflow detection, report, and saturation is done after the shifting operation Status Bits Affected by Affects Repeat Example
Syntax AC1 = *AR3+ << #16 Description The content addressed by AR3 shifted left by 16 bits is loaded into AC1. AR3 is incremented by 1.
After 00 0200 FC00 0200 3400 AC1 AR3 200 00 3400 0000 0201 3400

C54CM, M40, SATD, SXMD ACOVx

This instruction can be repeated.

Before AC1 AR3 200

5-180

Instruction Set Descriptions

SPRU375G

Load Accumulator from Memory

Load Accumulator from Memory


Syntax Characteristics
Parallel Enable Bit No

No. [5]

Syntax ACx = uns(Smem)

Size 3

Cycles 1

Pipeline X

Opcode Operands Description ACx, Smem

1101 1111 AAAA AAAI xxDD 010u

This instruction loads the content of a memory (Smem) location to the accumulator (ACx):
- The memory operand is extended to 40 bits according to uns. J J

If the optional uns keyword is applied to the input operand, the content of the memory location is zero extended to 40 bits. If the optional uns keyword is not applied to the input operand, the content of the memory location is sign extended to 40 bits according to SXMD.

- The load operation in the accumulator uses a dedicated path independent

of the D-unit ALU, the D-unit shifter, and the D-unit MACs. Compatibility with C54x devices (C54CM = 1) When this instruction is executed with M40 = 0, compatibility is ensured. Status Bits Affected by Affects Repeat Example
Syntax AC0 = uns(*AR3) Description The content addressed by AR3 is zero extended to 40 bits and loaded into AC0.

SXMD none

This instruction can be repeated.

SPRU375G

Instruction Set Descriptions

5-181

Load Accumulator from Memory

Load Accumulator from Memory


Syntax Characteristics
Parallel Enable Bit No

No. [6]

Syntax ACx = uns(Smem) << #SHIFTW

Size 4

Cycles 1

Pipeline X

Opcode Operands Description

1111 1001 AAAA AAAI uxSH IFTW xxDD 10xx ACx, SHIFTW, Smem This instruction loads the content of a memory (Smem) location, shifted by the 6-bit value, SHIFTW, to the accumulator (ACx):
- The memory operand is extended to 40 bits according to uns. J J

If the optional uns keyword is applied to the input operand, the content of the memory location is zero extended to 40 bits. If the optional uns keyword is not applied to the input operand, the content of the memory location is sign extended to 40 bits according to SXMD.

- The input operand is shifted by the 6-bit value in the D-unit shifter. The shift

operation is equivalent to the signed shift instruction. Compatibility with C54x devices (C54CM = 1) When this instruction is executed with M40 = 0, compatibility is ensured. When C54CM = 1, no overflow detection, report, and saturation is done after the shifting operation. Status Bits Affected by Affects Repeat C54CM, M40, SATD, SXMD ACOVx

This instruction cannot be repeated when using the *(#k23) absolute addressing mode to access the memory operand (Smem); when using other addressing modes, this instruction can be repeated.

Example
Syntax AC0 = uns(*AR3) << #31 Description The content addressed by AR3 is zero extended to 40 bits, shifted left by 31 bits, and loaded into AC0.

5-182

Instruction Set Descriptions

SPRU375G

Load Accumulator from Memory

Load Accumulator from Memory


Syntax Characteristics
Parallel Enable Bit No

No. [7]

Syntax ACx = M40(dbl(Lmem))

Size 3

Cycles 1

Pipeline X

Opcode Operands Description ACx, Lmem

1110 1101 AAAA AAAI xxDD 100g

This instruction loads the content of data memory operand (Lmem) to the accumulator (ACx):
- The input operand is sign extended to 40 bits according to SXMD. - The load operation in the accumulator uses a dedicated path independent

of the D-unit ALU, the D-unit shifter, and the D-unit MACs.
- Status bit M40 is locally set to 1, if the optional M40 keyword is applied to

the input operand. Compatibility with C54x devices (C54CM = 1) When this instruction is executed with M40 = 0, compatibility is ensured. Status Bits Affected by Affects Repeat Example
Syntax AC0 = dbl(*AR3) Description The content (long word) addressed by AR3 and AR3 + 1 is loaded into AC0. Because this instruction is a longoperand instruction, AR3 is decremented by 2 after the execution.

M40, SATD, SXMD ACOVx

This instruction can be repeated.

SPRU375G

Instruction Set Descriptions

5-183

Load Accumulator from Memory

Load Accumulator from Memory


Syntax Characteristics
No. [8] Syntax LO(ACx) = Xmem, HI(ACx) = Ymem Parallel Enable Bit No Size 3 Cycles 1 Pipeline X

Opcode Operands Description ACx, Xmem, Ymem

1000 0001 XXXM MMYY YMMM 10DD

This instruction performs a dual 16-bit load of accumulator high and low parts. The operation is executed in dual 16-bit mode; however, it is independent of the 40-bit D-unit ALU. The 16 lower bits of the accumulator are separated from the higher 24 bits and the 8 guard bits are attached to the higher 16-bit datapath.
- The data memory operand Xmem is loaded as a 16-bit operand to the

destination accumulator (ACx) low part. And, according to SXMD the data memory operand Ymem is sign extended to 24 bits and is loaded to the destination accumulator (ACx) high part.
- For the load operations in higher accumulator bits, overflow detection is

performed at bit position 31. If an overflow is detected, the destination accumulator overflow status bit (ACOVx) is set.
- If SATD is 1 when an overflow is detected on the higher data path, a

saturation is performed with saturation value of 00 7FFFh. Compatibility with C54x devices (C54CM = 1) When this instruction is executed with M40 = 0, compatibility is ensured. When C54CM = 1, this instruction is executed as if SATD was locally cleared to 0. Status Bits Affected by Affects Repeat Example
Syntax LO(AC0) = *AR3, HI(AC0) = *AR4 Description The content at the location addressed by AR4, sign extended to 24 bits, is loaded into AC0(3916) and the content at the location addressed by AR3 is loaded into AC0(150).

C54CM, M40, SATD, SXMD ACOVx

This instruction can be repeated.

5-184

Instruction Set Descriptions

SPRU375G

Load Accumulator from Memory with Parallel Store Accumulator Content to Memory

Load Accumulator from Memory with Parallel Store Accumulator Content to Memory
Syntax Characteristics
Parallel Enable Bit No

No. [1]

Syntax ACy = Xmem << #16, Ymem = HI(ACx << T2)

Size 4

Cycles 1

Pipeline X

Opcode Operands Description

1000 0111 XXXM MMYY YMMM SSDD 110x xxxx ACx, ACy, T2, Xmem, Ymem This instruction performs two operations in parallel: load and store. The first operation loads the content of data memory operand Xmem shifted left by 16 bits to the accumulator ACy.
- The input operand is sign extended to 40 bits according to SXMD. - The shift operation is equivalent to the signed shift instruction. - The input operand is shifted left by 16 bits according to M40.

The second operation shifts the accumulator ACx by the content of T2 and stores ACx(3116) to data memory operand Ymem. If the 16-bit value in T2 is not within 32 to +31, the shift is saturated to 32 or +31 and the shift is performed with this value.
- The input operand is shifted in the D-unit shifter according to SXMD. - After the shift, the high part of the accumulator, ACx(3116), is stored to

the memory location. Compatibility with C54x devices (C54CM = 1) When this instruction is executed with M40 = 0, compatibility is ensured. When this instruction is executed with C54CM = 1, the 6 LSBs of T2 are used to determine the shift quantity. The 6 LSBs of T2 define a shift quantity within 32 to +31. When the 16-bit value in T2 is between 32 to 17, a modulo 16 operation transforms the shift quantity to within 16 to 1. Status Bits Affected by Affects Repeat
SPRU375G

C54CM, M40, SATD, SXMD ACOVy

This instruction can be repeated.


Instruction Set Descriptions 5-185

Load Accumulator from Memory with Parallel Store Accumulator Content to Memory

See Also

See the following other related instructions:


- Load Accumulator from Memory - Load Accumulator Pair from Memory - Load Accumulator with Immediate Value - Load Accumulator, Auxiliary, or Temporary Register from Memory - Load Accumulator, Auxiliary, or Temporary Register with Immediate Value

Example
Syntax AC0 = *AR3 << #16, *AR4 = HI(AC1 << T2) Description Both instructions are performed in parallel. The content addressed by AR3 shifted left by 16 bits is stored in AC0. The content of AC1 is shifted by the content of T2, and AC1(3116) is stored at the address of AR4.

5-186

Instruction Set Descriptions

SPRU375G

Load Accumulator from Memory

Load Accumulator Pair from Memory


Syntax Characteristics
Parallel Enable Bit No No

No. [1] [2]

Syntax pair(HI(ACx)) = Lmem pair(LO(ACx)) = Lmem

Size 3 3

Cycles 1 1

Pipeline X X

Description

This instruction loads the content of a data memory operand (Lmem) to the selected accumulator pair, ACx and AC(x + 1). Affected by Affects C54CM, M40, SATD, SXMD ACOVx, ACOV(x + 1)

Status Bits

See Also

See the following other related instructions:


- Load Accumulator from Memory - Load Accumulator from Memory with Parallel Store Accumulator Content

to Memory
- Load Accumulator with Immediate Value - Load Accumulator, Auxiliary, or Temporary Register from Memory - Load Accumulator, Auxiliary, or Temporary Register with Immediate Value - Load Auxiliary or Temporary Register Pair from Memory - Multiply and Accumulate with Parallel Load Accumulator from Memory - Multiply and Subtract with Parallel Load Accumulator from Memory

SPRU375G

Instruction Set Descriptions

5-187

Load Accumulator Pair from Memory

Load Accumulator Pair from Memory


Syntax Characteristics
Parallel Enable Bit No

No. [1]

Syntax pair(HI(ACx)) = Lmem

Size 3

Cycles 1

Pipeline X

Opcode Operands Description ACx, Lmem

1110 1101 AAAA AAAI xxDD 101x

This instruction loads the 16 highest bits of data memory operand (Lmem) to the 16 highest bits of the accumulator (ACx) and loads the 16 lowest bits of data memory operand (Lmem) to the 16 highest bits of accumulator AC(x + 1):
- The load operation in the accumulator uses a dedicated path independent

of the D-unit ALU, the D-unit shifter, and the D-unit MACs.
- Valid accumulators are AC0 and AC2.

Compatibility with C54x devices (C54CM = 1) When this instruction is executed with M40 = 0, compatibility is ensured. When C54CM = 1, overflow detection, report, and saturation is done after the operation. Status Bits Affected by Affects Repeat Example
Syntax pair(HI(AC2)) = *AR3+ Description The 16 highest bits of the content at the location addressed by AR3 are loaded into AC2(3116) and the 16 lowest bits of the content at the location addressed by AR3 + 1 are loaded into AC3(3116). AR3 is incremented by 1.
After 00 0200 FC00 00 0000 0000 0200 3400 0FD3 AC2 AC3 AR3 200 201 00 3400 0000 00 0FD3 0000 0201 3400 0FD3

C54CM, M40, SATD, SXMD ACOVx, ACOV(x + 1)

This instruction can be repeated.

Before AC2 AC3 AR3 200 201

5-188

Instruction Set Descriptions

SPRU375G

Load Accumulator Pair from Memory

Load Accumulator Pair from Memory


Syntax Characteristics
Parallel Enable Bit No

No. [2]

Syntax pair(LO(ACx)) = Lmem

Size 3

Cycles 1

Pipeline X

Opcode Operands Description ACx, Lmem

1110 1101 AAAA AAAI xxDD 110x

This instruction loads the 16 highest bits of data memory operand (Lmem) to the 16 lowest bits of the accumulator (ACx) and loads the 16 lowest bits of data memory operand (Lmem) to the 16 lowest bits of accumulator AC(x + 1):
- The load operation in the accumulator uses a dedicated path independent

of the D-unit ALU, the D-unit shifter, and the D-unit MACs.
- Valid accumulators are AC0 and AC2.

Compatibility with C54x devices (C54CM = 1) When this instruction is executed with M40 = 0, compatibility is ensured Status Bits Affected by Affects Repeat Example
Syntax pair(LO(AC0)) = *AR3 Description The 16 highest bits of the content at the location addressed by AR3 are loaded into AC0(150) and the 16 lowest bits of the content at the location addressed by AR3 + 1 are loaded into AC1(150).

M40, SXMD none

This instruction can be repeated.

SPRU375G

Instruction Set Descriptions

5-189

Load Accumulator with Immediate Value

Load Accumulator with Immediate Value


Syntax Characteristics
Parallel Enable Bit No No

No. [1] [2]

Syntax ACx = K16 << #16 ACx = K16 << #SHFT

Size 4 4

Cycles 1 1

Pipeline X X

Description

This instruction loads a 16-bit signed constant, K16, to a selected accumulator (ACx). Affected by Affects C54CM, M40, SATD, SXMD ACOVx

Status Bits

See Also

See the following other related instructions:


- Load Accumulator from Memory - Load Accumulator from Memory with Parallel Store Accumulator Content

to Memory
- Load Accumulator Pair from Memory - Load Accumulator, Auxiliary, or Temporary Register from Memory - Load Accumulator, Auxiliary, or Temporary Register with Immediate Value - Load Auxiliary or Temporary Register Pair from Memory - Multiply and Accumulate with Parallel Load Accumulator from Memory - Multiply and Subtract with Parallel Load Accumulator from Memory

5-190

Instruction Set Descriptions

SPRU375G

Load Accumulator with Immediate Value

Load Accumulator with Immediate Value


Syntax Characteristics
Parallel Enable Bit No

No. [1]

Syntax ACx = K16 << #16

Size 4

Cycles 1

Pipeline X

Opcode Operands Description ACx, K16

0111 1010 KKKK KKKK KKKK KKKK xxDD 101x

This instruction loads the 16-bit signed constant, K16, shifted left by 16 bits to the accumulator (ACx):
- The 16-bit constant, K16, is sign extended to 40 bits according to SXMD. - The shift operation is equivalent to the signed shift instruction. - The input operand is shifted left by 16 bits according to M40.

Compatibility with C54x devices (C54CM = 1) When this instruction is executed with M40 = 0, compatibility is ensured. When C54CM = 1, overflow detection, report, and saturation is done after the shifting operation. Status Bits Affected by Affects Repeat Example
Syntax AC0 = #2 << #16 Description AC0 is loaded with the signed 16-bit value (2) shifted left by 16 bits.

C54CM, M40, SATD, SXMD ACOVx

This instruction can be repeated.

SPRU375G

Instruction Set Descriptions

5-191

Load Accumulator with Immediate Value

Load Accumulator with Immediate Value


Syntax Characteristics
Parallel Enable Bit No

No. [2]

Syntax ACx = K16 << #SHFT

Size 4

Cycles 1

Pipeline X

Opcode Operands Description

0111 0101 KKKK KKKK KKKK KKKK xxDD SHFT ACx, K16, SHFT This instruction loads the 16-bit signed constant, K16, shifted left by the 4-bit value, SHFT, to the accumulator (ACx):
- The 16-bit constant, K16, is sign extended to 40 bits according to SXMD. - The input operand is shifted by the 4-bit value in the D-unit shifter. The shift

operation is equivalent to the signed shift instruction. Compatibility with C54x devices (C54CM = 1) When this instruction is executed with M40 = 0, compatibility is ensured. When C54CM = 1, no overflow detection, report, and saturation is done after the shifting operation. Status Bits Affected by Affects Repeat Example
Syntax AC0 = #2 << #15 Description AC0 is loaded with the signed 16-bit value (2) shifted left by 15 bits.

C54CM, M40, SXMD none

This instruction can be repeated.

5-192

Instruction Set Descriptions

SPRU375G

Load Accumulator, Auxiliary, or Temporary Register from Memory

Load Accumulator, Auxiliary, or Temporary Register from Memory


Syntax Characteristics
Parallel Enable Bit No No No

No. [1] [2] [3]

Syntax dst = Smem dst = uns(high_byte(Smem)) dst = uns(low_byte(Smem))

Size 2 3 3

Cycles 1 1 1

Pipeline X X X

Description

This instruction loads the content of a memory (Smem) location to a selected destination (dst) register. Affected by Affects M40, SXMD none

Status Bits

See Also

See the following other related instructions:


- Load Accumulator from Memory - Load Accumulator from Memory with Parallel Store Accumulator Content

to Memory
- Load Accumulator Pair from Memory - Load Accumulator with Immediate Value - Load Accumulator, Auxiliary, or Temporary Register with Immediate Value - Load Auxiliary or Temporary Register Pair from Memory - Multiply and Accumulate with Parallel Load Accumulator from Memory - Multiply and Subtract with Parallel Load Accumulator from Memory - Store Accumulator, Auxiliary, or Temporary Register Content to Memory

SPRU375G

Instruction Set Descriptions

5-193

Load Accumulator, Auxiliary, or Temporary Register from Memory

Load Accumulator, Auxiliary, or Temporary Register from Memory


Syntax Characteristics
No. [1] Syntax dst = Smem Parallel Enable Bit No Size 2 Cycles 1 Pipeline X

Opcode Operands Description dst, Smem

1010 FDDD AAAA AAAI

This instruction loads the content of a memory (Smem) location to the destination (dst) register.
- When the destination register is an accumulator: J J

The content of the memory location is sign extended to 40 bits according to SXMD. The load operation in the destination register uses a dedicated path independent of the D-unit ALU, the D-unit shifter, and the D-unit MACs.

- When the destination register is an auxiliary or temporary register: J J

The content of the memory location is sign extended to 16 bits. The load operation in the destination register uses a dedicated path independent of the A-unit ALU.

Compatibility with C54x devices (C54CM = 1) When this instruction is executed with M40 = 0, compatibility is ensured. Status Bits Affected by Affects Repeat Example
Syntax AR1 = *AR3+
Before AR1 AR3 200 FC00 0200 3400

M40, SXMD none

This instruction can be repeated.

Description AR1 is loaded with the content addressed by AR3. AR3 is incremented by 1.
After AR1 AR3 200 3400 0201 3400

5-194

Instruction Set Descriptions

SPRU375G

Load Accumulator, Auxiliary, or Temporary Register from Memory

Load Accumulator, Auxiliary, or Temporary Register from Memory


Syntax Characteristics
No. [2] Syntax dst = uns(high_byte(Smem)) Parallel Enable Bit No Size 3 Cycles 1 Pipeline X

Opcode Operands Description dst, Smem

1101 1111 AAAA AAAI FDDD 000u

This instruction loads the high-byte content of a memory (Smem) location to the destination (dst) register.
- When the destination register is an accumulator: J

The memory operand is extended to 40 bits according to uns. H H If the optional uns keyword is applied to the input operand, the content of the memory location is zero extended to 40 bits. If the optional uns keyword is not applied to the input operand, the content of the memory location is sign extended to 40 bits according to SXMD.

The load operation in the destination register uses a dedicated path independent of the D-unit ALU, the D-unit shifter, and the D-unit MACs.

- When the destination register is an auxiliary or temporary register: J

The memory operand is extended to 16 bits according to uns. H H If the optional uns keyword is applied to the input operand, the content of the memory location is zero extended to 16 bits. If the optional uns keyword is not applied to the input operand, the content of the memory location is sign extended to 16 bits regardless of SXMD.

The load operation in the destination register uses a dedicated path independent of the A-unit ALU.

- In this instruction, Smem cannot reference to a memory-mapped register

(MMR). This instruction cannot access a byte within an MMR. If Smem is an MMR, the DSP sends a hardware bus-error interrupt (BERRINT) request to the CPU. Compatibility with C54x devices (C54CM = 1) When this instruction is executed with M40 = 0, compatibility is ensured.
SPRU375G Instruction Set Descriptions 5-195

Load Accumulator, Auxiliary, or Temporary Register from Memory

Status Bits

Affected by Affects

M40, SXMD none

Repeat Example
Syntax

This instruction can be repeated.

Description The high-byte content addressed by AR3 is zero extended to 40 bits and loaded into AC0.

AC0 = uns(high_byte(*AR3))

5-196

Instruction Set Descriptions

SPRU375G

Load Accumulator, Auxiliary, or Temporary Register from Memory

Load Accumulator, Auxiliary, or Temporary Register from Memory


Syntax Characteristics
No. [3] Syntax dst = uns(low_byte(Smem)) Parallel Enable Bit No Size 3 Cycles 1 Pipeline X

Opcode Operands Description dst, Smem

1101 1111 AAAA AAAI FDDD 001u

This instruction loads the low-byte content of a memory (Smem) location to the destination (dst) register.
- When the destination register is an accumulator: J

The memory operand is extended to 40 bits according to uns. H H If the optional uns keyword is applied to the input operand, the content of the memory location is zero extended to 40 bits. If the optional uns keyword is not applied to the input operand, the content of the memory location is sign extended to 40 bits according to SXMD.

The load operation in the destination register uses a dedicated path independent of the D-unit ALU, the D-unit shifter, and the D-unit MACs.

- When the destination register is an auxiliary or temporary register: J

The memory operand is extended to 16 bits according to uns. H H If the optional uns keyword is applied to the input operand, the content of the memory location is zero extended to 16 bits. If the optional uns keyword is not applied to the input operand, the content of the memory location is sign extended to 16 bits regardless of SXMD.

The load operation in the destination register uses a dedicated path independent of the A-unit ALU.

- In this instruction, Smem cannot reference to a memory-mapped register

(MMR). This instruction cannot access a byte within an MMR. If Smem is an MMR, the DSP sends a hardware bus-error interrupt (BERRINT) request to the CPU. Compatibility with C54x devices (C54CM = 1) When this instruction is executed with M40 = 0, compatibility is ensured.
SPRU375G Instruction Set Descriptions 5-197

Load Accumulator, Auxiliary, or Temporary Register from Memory

Status Bits

Affected by Affects

M40, SXMD none

Repeat Example
Syntax

This instruction can be repeated.

Description The low-byte content addressed by AR3 is zero extended to 40 bits and loaded into AC0.

AC0 = uns(low_byte(*AR3))

5-198

Instruction Set Descriptions

SPRU375G

Load Accumulator, Auxiliary, or Temporary Register with Immediate Value

Load Accumulator, Auxiliary, or Temporary Register with Immediate Value


Syntax Characteristics
Parallel Enable Bit Yes Yes No

No. [1] [2] [3]

Syntax dst = k4 dst = k4 dst = K16

Size 2 2 4

Cycles 1 1 1

Pipeline X X X

Description

This instruction loads a 4-bit unsigned constant, k4; the 2s complement representation of the 4-bit unsigned constant; or a 16-bit signed constant, K16, to a selected destination (dst) register. Affected by Affects M40, SXMD none

Status Bits

See Also

See the following other related instructions:


- Load Accumulator from Memory - Load Accumulator from Memory with Parallel Store Accumulator Content

to Memory
- Load Accumulator Pair from Memory - Load Accumulator with Immediate Value - Load Accumulator, Auxiliary, or Temporary Register from Memory - Load Auxiliary or Temporary Register Pair from Memory - Multiply and Accumulate with Parallel Load Accumulator from Memory - Multiply and Subtract with Parallel Load Accumulator from Memory

SPRU375G

Instruction Set Descriptions

5-199

Load Accumulator, Auxiliary, or Temporary Register with Immediate Value

Load Accumulator, Auxiliary, or Temporary Register with Immediate Value


Syntax Characteristics
Parallel Enable Bit Yes

No. [1]

Syntax dst = k4

Size 2

Cycles 1

Pipeline X

Opcode Operands Description dst, k4

0011 110E kkkk FDDD

This instruction loads the 4-bit unsigned constant, k4, to the destination (dst) register.
- When the destination register is an accumulator: J J

The 4-bit constant, k4, is zero extended to 40 bits. The load operation in the destination register uses a dedicated path independent of the D-unit ALU, the D-unit shifter, and the D-unit MACs.

- When the destination register is an auxiliary or temporary register: J J

The 4-bit constant, k4, is zero extended to 16 bits. The load operation in the destination register uses a dedicated path independent of the A-unit ALU.

Compatibility with C54x devices (C54CM = 1) When this instruction is executed with M40 = 0, compatibility is ensured. Status Bits Affected by Affects Repeat Example
Syntax AC0 = #2 Description AC0 is loaded with the unsigned 4-bit value (2).

M40 none

This instruction can be repeated.

5-200

Instruction Set Descriptions

SPRU375G

Load Accumulator, Auxiliary, or Temporary Register with Immediate Value

Load Accumulator, Auxiliary, or Temporary Register with Immediate Value


Syntax Characteristics
Parallel Enable Bit Yes

No. [2]

Syntax dst = k4

Size 2

Cycles 1

Pipeline X

Opcode Operands Description dst, k4

0011 111E kkkk FDDD

This instruction loads the 2s complement representation of the 4-bit unsigned constant, k4, to the destination (dst) register.
- When the destination register is an accumulator: J

The 4-bit constant, k4, is negated in the I-unit, loaded into the accumulator, and sign extended to 40 bits before being processed by the D-unit as a signed constant. The load operation in the destination register uses a dedicated path independent of the D-unit ALU, the D-unit shifter, and the D-unit MACs.

- When the destination register is an auxiliary or temporary register: J J

The 4-bit constant, k4, is zero extended to 16 bits and negated in the I-unit before being processed by the A-unit as a signed K16 constant. The load operation in the destination register uses a dedicated path independent of the A-unit ALU.

Compatibility with C54x devices (C54CM = 1) When this instruction is executed with M40 = 0, compatibility is ensured. Status Bits Affected by Affects Repeat Example
Syntax AC0 = #2 Description AC0 is loaded with a 2s complement representation of the unsigned 4-bit value (2).

M40 none

This instruction can be repeated.

SPRU375G

Instruction Set Descriptions

5-201

Load Accumulator, Auxiliary, or Temporary Register with Immediate Value

Load Accumulator, Auxiliary, or Temporary Register with Immediate Value


Syntax Characteristics
Parallel Enable Bit No

No. [3]

Syntax dst = K16

Size 4

Cycles 1

Pipeline X

Opcode Operands Description dst, K16

0111 0110 KKKK KKKK KKKK KKKK FDDD 10xx

This instruction loads the 16-bit signed constant, K16, to the destination (dst) register.
- When the destination register is an accumulator, the 16-bit constant, K16,

is sign extended to 40 bits according to SXMD.


- When the destination register is an auxiliary or temporary register, the load

operation in the destination register uses a dedicated path independent of the A-unit ALU. Compatibility with C54x devices (C54CM = 1) When this instruction is executed with M40 = 0, compatibility is ensured. Status Bits Affected by Affects Repeat Example
Syntax AC1 = #248
Before AC1 00 0200 FC00

M40, SXMD none

This instruction can be repeated.

Description AC1 is loaded with the signed 16-bit value (248).


After AC1 00 0000 00F8

5-202

Instruction Set Descriptions

SPRU375G

Load Auxiliary or Temporary Register Pair from Memory

Load Auxiliary or Temporary Register Pair from Memory


Syntax Characteristics
Parallel Enable Bit No

No. [1]

Syntax pair(TAx) = Lmem

Size 3

Cycles 1

Pipeline X

Opcode Operands Description Lmem, TAx

1110 1101 AAAA AAAI FDDD 111x

This instruction loads the 16 highest bits of data memory operand (Lmem) to the temporary or auxiliary register (TAx) and loads the 16 lowest bits of data memory operand (Lmem) to temporary or auxiliary register TA(x + 1):
- The load operation in the temporary or auxiliary register uses a dedicated

path independent of the A-unit ALU.


- Valid auxiliary registers are AR0, AR2, AR4, and AR6. - Valid temporary registers are T0 and T2.

Compatibility with C54x devices (C54CM = 1) When this instruction is executed with M40 = 0, compatibility is ensured. Status Bits Affected by Affects Repeat See Also M40 none

This instruction can be repeated. See the following other related instructions:
- Load Accumulator, Auxiliary, or Temporary Register from Memory - Load Accumulator, Auxiliary, or Temporary Register with Immediate Value - Modify Auxiliary or Temporary Register Content

Example
Syntax pair(T0) = *AR2 Description The 16 highest bits of the content at the location addressed by AR2 are loaded into T0 and the 16 lowest bits of the content at the location addressed by AR2 + 1 are loaded into T1.

SPRU375G

Instruction Set Descriptions

5-203

Load CPU Register from Memory

Load CPU Register from Memory


Syntax Characteristics
Parallel Enable Bit No No No No No No No No No No No No No No No No No No No No

No. [1] [2] [3] [4] [5] [6] [7] [8] [9] [10] [11] [12] [13] [14] [15] [16] [17] [18] [19] [20]

Syntax BK03 = Smem BK47 = Smem BKC = Smem BSA01 = Smem BSA23 = Smem BSA45 = Smem BSA67 = Smem BSAC = Smem BRC0 = Smem BRC1 = Smem CDP = Smem CSR = Smem DP = Smem DPH = Smem PDP = Smem SP = Smem SSP = Smem TRN0 = Smem TRN1 = Smem RETA = dbl(Lmem)

Size 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3

Cycles 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 5

Pipeline X X X X X X X X X X X X X X X X X X X X

Opcode Operands

See Table 51 (page 5-206). Lmem, Smem

5-204

Instruction Set Descriptions

SPRU375G

Load CPU Register from Memory

Description

Instructions [1] through [19] load the content of a memory (Smem) location to the destination CPU register. This instruction uses a dedicated datapath independent of the A-unit ALU and the D-unit operators to perform the operation. The content of the memory location is zero extended to the bitwidth of the destination CPU register. The operation is performed in the execute phase of the pipeline. There is a 3-cycle latency between PDP, DP, SP, SSP, CDP, BSAx, BKx, BRCx, and CSR loads and their use in the address phase by the A-unit address generator units or by the P-unit loop control management. For instruction [10], when BRC1 is loaded, the block repeat save register (BRS1) is also loaded with the same value. Instruction [20] loads the content of data memory operand (Lmem) to the 24-bit RETA register (the return address of the calling subroutine) and to the 8-bit CFCT register (active control flow execution context flags of the calling subroutine):
- The 16 highest bits of Lmem are loaded into the CFCT register and into

the 8 highest bits of the RETA register.


- The 16 lowest bits of Lmem are loaded into the 16 lowest bits of the RETA

register. When instruction [20] is decoded, the CPU pipeline is flushed and the instruction is executed in 5 cycles, regardless of the instruction context. Status Bits Affected by Affects Repeat none none

Instructions [13] and [20] cannot be repeated; all other instructions can be repeated. See the following other related instructions:
- Load CPU Register with Immediate Value

See Also

SPRU375G

Instruction Set Descriptions

5-205

Load CPU Register from Memory

Table 51. Opcodes for Load CPU Register from Memory Instruction
No. [1] [2] [3] [4] [5] [6] [7] [8] [9] [10] [11] [12] [13] [14] [15] [16] [17] [18] [19] [20] Syntax BK03 = Smem BK47 = Smem BKC = Smem BSA01 = Smem BSA23 = Smem BSA45 = Smem BSA67 = Smem BSAC = Smem BRC0 = Smem BRC1 = Smem CDP = Smem CSR = Smem DP = Smem DPH = Smem PDP = Smem SP = Smem SSP = Smem TRN0 = Smem TRN1 = Smem RETA = dbl(Lmem) Opcode

1101 1100 AAAA AAAI 1001 xx10 1101 1100 AAAA AAAI 1010 xx10 1101 1100 AAAA AAAI 1011 xx10 1101 1100 AAAA AAAI 0010 xx10 1101 1100 AAAA AAAI 0011 xx10 1101 1100 AAAA AAAI 0100 xx10 1101 1100 AAAA AAAI 0101 xx10 1101 1100 AAAA AAAI 0110 xx10 1101 1100 AAAA AAAI x001 xx11 1101 1100 AAAA AAAI x010 xx11 1101 1100 AAAA AAAI 0001 xx10 1101 1100 AAAA AAAI x000 xx11 1101 1100 AAAA AAAI 0000 xx10 1101 1100 AAAA AAAI 1100 xx10 1101 1100 AAAA AAAI 1111 xx10 1101 1100 AAAA AAAI 0111 xx10 1101 1100 AAAA AAAI 1000 xx10 1101 1100 AAAA AAAI x011 xx11 1101 1100 AAAA AAAI x100 xx11 1110 1101 AAAA AAAI xxxx 011x

5-206

Instruction Set Descriptions

SPRU375G

Load CPU Register with Immediate Value

Load CPU Register with Immediate Value


Syntax Characteristics
No. [1] [2] [3] [4] [5] [6] [7] [8] [9] [10] [11] [12] [13] [14] [15] [16] [17] Syntax BK03 = k12 BK47 = k12 BKC = k12 BRC0 = k12 BRC1 = k12 CSR = k12 DPH = k7 PDP = k9 BSA01 = k16 BSA23 = k16 BSA45 = k16 BSA67 = k16 BSAC = k16 CDP = k16 DP = k16 SP = k16 SSP = k16 Parallel Enable Bit Yes Yes Yes Yes Yes Yes Yes Yes No No No No No No No No No Size 3 3 3 3 3 3 3 3 4 4 4 4 4 4 4 4 4 Cycles 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 Pipeline AD AD AD AD AD AD AD AD AD AD AD AD AD AD AD AD AD

Opcode Operands Description

See Table 52 (page 5-208). kx This instruction loads the unsigned constant, kx, to the destination CPU register. This instruction uses a dedicated datapath independent of the A-unit ALU and the D-unit operators to perform the operation. The constant is zero extended to the bitwidth of the destination CPU register. For instruction [5], when BRC1 is loaded, the block repeat save register (BRS1) is also loaded with the same value. The operation is performed in the address phase of the pipeline.

SPRU375G

Instruction Set Descriptions

5-207

Load CPU Register with Immediate Value

Status Bits

Affected by Affects

none none

Repeat See Also

Instruction [15] cannot be repeated; all other instructions can be repeated. See the following other related instructions:
- Load CPU Register from Memory

Table 52. Opcodes for Load CPU Register with Immediate Value Instruction
No. [1] [2] [3] [4] [5] [6] [7] [8] [9] [10] [11] [12] [13] [14] [15] [16] [17] Syntax BK03 = k12 BK47 = k12 BKC = k12 BRC0 = k12 BRC1 = k12 CSR = k12 DPH = k7 PDP = k9 BSA01 = k16 BSA23 = k16 BSA45 = k16 BSA67 = k16 BSAC = k16 CDP = k16 DP = k16 SP = k16 SSP = k16 Opcode

0001 011E kkkk kkkk kkkk 0100 0001 011E kkkk kkkk kkkk 0101 0001 011E kkkk kkkk kkkk 0110 0001 011E kkkk kkkk kkkk 1001 0001 011E kkkk kkkk kkkk 1010 0001 011E kkkk kkkk kkkk 1000 0001 011E xxxx xkkk kkkk 0000 0001 011E xxxk kkkk kkkk 0011 0111 1000 kkkk kkkk kkkk kkkk xxx0 011x 0111 1000 kkkk kkkk kkkk kkkk xxx0 100x 0111 1000 kkkk kkkk kkkk kkkk xxx0 101x 0111 1000 kkkk kkkk kkkk kkkk xxx0 110x 0111 1000 kkkk kkkk kkkk kkkk xxx0 111x 0111 1000 kkkk kkkk kkkk kkkk xxx0 010x 0111 1000 kkkk kkkk kkkk kkkk xxx0 000x 0111 1000 kkkk kkkk kkkk kkkk xxx1 000x 0111 1000 kkkk kkkk kkkk kkkk xxx0 001x

5-208

Instruction Set Descriptions

SPRU375G

Load Extended Auxiliary Register from Memory

Load Extended Auxiliary Register from Memory


Syntax Characteristics
Parallel Enable Bit No

No. [1]

Syntax XAdst = dbl(Lmem)

Size 3

Cycles 1

Pipeline X

Opcode Operands Description Lmem , XAdst

1110 1101 AAAA AAAI XDDD 1111

This instruction loads the lower 23 bits of the data addressed by data memory operand (Lmem) to the 23-bit destination register (XARx, XSP, XSSP, XDP, or XCDP). Affected by Affects none none

Status Bits

Repeat See Also

This instruction can be repeated. See the following other related instructions:
- Load Extended Auxiliary Register with Immediate Value - Modify Extended Auxiliary Register Content - Move Extended Auxiliary Register Content - Store Extended Auxiliary Register Content to Memory

Example
Syntax XAR1 = dbl(*AR3) Description The 7 lowest bits of the content at the location addressed by AR3 and the 16 bits of the content at the location addressed by AR3 + 1 are loaded into XAR1.
After 00 0000 0200 3492 0FD3 XAR1 AR3 200 201 12 0FD3 0200 3492 0FD3

Before XAR1 AR3 200 201

SPRU375G

Instruction Set Descriptions

5-209

Load Extended Auxiliary Register with Immediate Value

Load Extended Auxiliary Register with Immediate Value


Syntax Characteristics
Parallel Enable Bit No

No. [1]

Syntax XAdst = k23

Size 6

Cycles 1

Pipeline AD

Opcode Operands Description k23, XAdst

1110 1100 AAAA AAAI 0DDD 1110

This instruction loads a 23-bit unsigned constant (k23) into the 23-bit destination register (XARx, XSP, XSSP, XDP, or XCDP). This operation is completed in the address phase of the pipeline by the A-unit address generator. Data memory is not accessed. The premodification or postmodification of the auxiliary register (ARx), the use of *port(#K), and the use of the readport() or writeport() qualifier is not supported for this instruction. The use of auxiliary register offset operations is supported. If the corresponding bit (ARnLC) in status register ST2_55 is set to 1, the circular buffer management also controls the result stored in XAdst.

Status Bits

Affected by Affects

ST2_55 none

Repeat See Also

This instruction can be repeated. See the following other related instructions:
- Load Extended Auxiliary Register from Memory - Modify Extended Auxiliary Register Content - Move Extended Auxiliary Register Content - Store Extended Auxiliary Register Content to Memory

Example
Syntax XAR0 = #7FFFFFh Description The 23-bit value (7FFFFFh) is loaded into XAR0.

5-210

Instruction Set Descriptions

SPRU375G

Load Memory with Immediate Value

Load Memory with Immediate Value


Syntax Characteristics
Parallel Enable Bit No No

No. [1] [2]

Syntax Smem = K8 Smem = K16

Size 3 4

Cycles 1 1

Pipeline X X

Opcode

K8 K16

1110 0110 AAAA AAAI KKKK KKKK 1111 1011 AAAA AAAI KKKK KKKK KKKK KKKK

Operands Description

Kx, Smem These instructions initialize a data memory location. These instructions store an 8-bit signed constant, K8, or a 16-bit signed constant, K16, to a memory (Smem) location. They use a dedicated datapath to perform the operation. For instruction [1], the immediate value is always signed extended to 16 bits before being stored in memory.

Status Bits

Affected by Affects

none none

Repeat

Instruction [1] can be repeated. Instruction [2] cannot be repeated when using the *(#k23) absolute addressing mode to access the memory operand (Smem); when using other addressing modes, this instruction can be repeated. See the following other related instructions:
- Move Memory to Memory

See Also

Example
Syntax *(#0501h) = #248
Before 0501 FC00

Description The signed 16-bit value (248) is loaded to address 501h.


After 0501 F800

SPRU375G

Instruction Set Descriptions

5-211

Memory Delay (delay)

Memory Delay
Syntax Characteristics
No. [1] Syntax delay(Smem) Parallel Enable Bit No Size 2 Cycles 1 Pipeline X

Opcode Operands Description Smem

1011 0110 AAAA AAAI

This instruction copies the content of the memory (Smem) location into the next higher address (Smem + 1). When the data is copied, the content of the addressed location remains the same. A dedicated datapath is used to make this memory move. When this instruction is executed, the two address register arithmetic units ARAU X and Y, of the A-unit data address generator unit, are used to compute the two addresses Smem and Smem + 1. The address generation is not affected by circular addressing; if Smem points to the end of a circular buffer, Smem + 1 will point to an address outside the circular buffer. The soft dual memory addressing mode mechanism cannot be applied to this instruction. This instruction cannot use the *port(#k16) addressing mode or be paralleled with the readport() or writeport() operand qualifier. This instruction cannot be used for accesses to I/O space. Any illegal access to I/O space generates a hardware bus-error interrupt (BERRINT) to be handled by the CPU.

Status Bits

Affected by Affects

none none

Repeat Example
Syntax delay(*AR1+)

This instruction can be repeated.

Description The content addressed by AR1 is copied to the next higher address, AR1 + 1. AR1 is incremented by 1.
After 0200 3400 0D80 2030 AR1 200 201 202 0201 3400 3400 2030

Before AR1 200 201 202

5-212

Instruction Set Descriptions

SPRU375G

MemoryMapped Register Access Qualifier (mmap)

Memory-Mapped Register Access Qualifier


Syntax Characteristics
No. [1] Syntax mmap() Parallel Enable Bit No Size 1 Cycles 1 Pipeline D

Opcode Operands Description none

1001 1000

This is an operand qualifier that can be paralleled with any instruction making a Smem or Lmem direct memory access (dma). This operand qualifier allows you to locally prevent the dma access from being relative to the data stack pointer (SP) or the local data page register (DP). It forces the dma access to be relative to the memory-mapped register (MMR) data page start address, 00 0000h. This operand qualifier cannot be executed:
- as a stand-alone instruction (assembler generates an error message) - in parallel with instructions not embedding an Smem or Lmem data

memory operand
- in parallel with instructions loading or storing a byte to a register (see Load

Accumulator, Auxiliary, or Temporary Register from Memory instructions [2] and [3]; Load Accumulator from Memory instructions [2] and [3]; and Store Accumulator, Auxiliary, or Temporary Register Content to Memory instructions [2] and [3]) The MMRs are mapped as 16-bit data entities between addresses 0h and 5Fh. The scratch-pad memory that is mapped between addresses 60h and 7Fh of each main data pages of 64K words cannot be accessed through this mechanism. Any instruction using the mmap() modifier cannot be combined with any other user-defined parallelism instruction. Status Bits Affected by Affects Repeat Example
Syntax T2 = @(AC0_L)) || mmap() Description AC0_L is a keyword representing AC0(150). The content of AC0(150) is copied into T2.

none none

This instruction can be repeated.

SPRU375G

Instruction Set Descriptions

5-213

Modify Auxiliary Register Content (mar)

Modify Auxiliary Register Content


Syntax Characteristics
Parallel Enable Bit No

No. [1]

Syntax mar(Smem)

Size 2

Cycles 1

Pipeline AD

Opcode Operands Description Smem

1011 0100 AAAA AAAI

This instruction performs, in the A-unit address generation units, the auxiliary register modification specified by Smem as if a word single data memory operand access was made. The operation is performed in the address phase of the pipeline; however, data memory is not accessed. If the destination register is an auxiliary register and the corresponding bit (ARnLC) in status register ST2_55 is set to 1, the circular buffer management controls the result stored in the destination register. Compatibility with C54x devices (C54CM = 1) In the translated code section, the mar() instruction must be executed with C54CM set to 1. When circular modification is selected for the destination auxiliary register, this instruction modifies the selected destination auxiliary register by using BK03 as the circular buffer size register; BK47 is not used.

Status Bits

Affected by Affects

ST2_55 none

Repeat

This instruction can be repeated.

5-214

Instruction Set Descriptions

SPRU375G

Modify Auxiliary Register Content (mar)

See Also

See the following other related instructions:


- Modify Auxiliary or Temporary Register Content - Modify Auxiliary or Temporary Register Content by Addition - Modify Auxiliary or Temporary Register Content by Subtraction - Modify Auxiliary Register Content with Parallel Multiply - Modify Auxiliary Register Content with Parallel Multiply and Accumulate - Modify Auxiliary Register Content with Parallel Multiply and Subtract - Modify Extended Auxiliary Register Content - Parallel Modify Auxiliary Register Contents

Example
Syntax mar(*AR3+) Description The content of AR3 is incremented by 1.

SPRU375G

Instruction Set Descriptions

5-215

Modify Auxiliary Register Content with Parallel Multiply

Modify Auxiliary Register Content with Parallel Multiply


Syntax Characteristics
No. [1] Syntax mar(Xmem), ACx = M40(rnd(uns(Ymem) * uns(coef(Cmem)))) Parallel Enable Bit No Size 4 Cycles 1 Pipeline X

Opcode Operands Description

1000 0010 XXXM MMYY YMMM 11mm uuxx DDg% ACx, Cmem, Xmem, Ymem This instruction performs two parallel operations in one cycle: modify auxiliary register (MAR) and multiply. The operations are executed in the two D-unit MACs. The first operation performs an auxiliary register modification. The auxiliary register modification is specified by the content of data memory operand Xmem. The second operation performs a multiplication in the D-unit MAC. The input operands of the multiplier are the content of data memory operand Ymem, extended to 17 bits, and the content of a data memory operand Cmem, addressed using the coefficient addressing mode, extended to 17 bits.
- Input operands are extended to 17 bits according to uns. J J

If the optional uns keyword is applied to the input operand, the content of the memory location is zero extended to 17 bits. If the optional uns keyword is not applied to the input operand, the content of the memory location is sign extended to 17 bits according to SXMD.

- If FRCT = 1, the output of the multiplier is shifted left by 1 bit. - Multiplication overflow detection depends on SMUL. - The 32-bit result of the multiplication is sign extended to 40 bits. - Rounding is performed according to RDM, if the optional rnd keyword is

applied to the instruction.


- Overflow detection depends on M40. If an overflow is detected, the

destination accumulator overflow status bit (ACOVx) is set.


- When an overflow is detected, the accumulator is saturated according to

SATD.
5-216 Instruction Set Descriptions SPRU375G

Modify Auxiliary Register Content with Parallel Multiply

- This instruction provides the option to locally set M40 to 1 for the execution

of the instruction, if the optional M40 keyword is applied to the instruction.


- For this instruction, the Cmem operand is accessed through the BB bus;

on some C55x-based devices, the BB bus is only connected to internal memory and not to external memory. To prevent the generation of a bus error, the Cmem operand must not be mapped on external memory. Each data flow can also disable the usage of the corresponding MAC unit, while allowing the modification of auxiliary registers in the three address generation units through the following instructions:
J J J

mar(Xmem) mar(Ymem) mar(Cmem) FRCT, M40, RDM, SATD, SMUL, SXMD ACOVx

Status Bits

Affected by Affects

Repeat See Also

This instruction can be repeated. See the following other related instructions:
- Modify Auxiliary Register Content - Modify Auxiliary Register Content with Parallel Multiply and Accumulate - Modify Auxiliary Register Content with Parallel Multiply and Subtract - Multiply

Example
Syntax mar(*AR3+), AC0 = uns(*AR4) * uns(coef(*CDP)) Description Both instructions are performed in parallel. AR3 is incremented by 1. The unsigned content addressed by AR4 is multiplied by the unsigned content addressed by the coefficient data pointer register (CDP) and the result is stored in AC0.

SPRU375G

Instruction Set Descriptions

5-217

Modify Auxiliary Register Content with Parallel Multiply and Accumulate

Modify Auxiliary Register Content with Parallel Multiply and Accumulate


Syntax Characteristics
Parallel Enable Bit No No

No. [1] [2]

Syntax mar(Xmem), ACx = M40(rnd(ACx + (uns(Ymem) * uns(coef(Cmem))))) mar(Xmem), ACx = M40(rnd((ACx >> #16) + (uns(Ymem) * uns(coef(Cmem)))))

Size 4 4

Cycles 1 1

Pipeline X X

Description

These instructions perform two parallel operations in one cycle: modify auxiliary register (MAR), and multiply and accumulate (MAC). The operations are executed in the two D-unit MACs. Affected by Affects FRCT, M40, RDM, SATD, SMUL, SXMD ACOVx, ACOVy

Status Bits

See Also

See the following other related instructions:


- Modify Auxiliary Register Content - Modify Auxiliary Register Content with Parallel Multiply - Modify Auxiliary Register Content with Parallel Multiply and Subtract - Multiply and Accumulate

5-218

Instruction Set Descriptions

SPRU375G

Modify Auxiliary Register Content with Parallel Multiply and Accumulate

Modify Auxiliary Register Content with Parallel Multiply and Accumulate


Syntax Characteristics
No. [1] Syntax mar(Xmem), ACx = M40(rnd(ACx + (uns(Ymem) * uns(coef(Cmem))))) Parallel Enable Bit No Size 4 Cycles 1 Pipeline X

Opcode Operands Description

1000 0011 XXXM MMYY YMMM 11mm uuxx DDg% ACx, Cmem, Xmem, Ymem This instruction performs two parallel operations in one cycle: modify auxiliary register (MAR), and multiply and accumulate (MAC). The operations are executed in the two D-unit MACs. The first operation performs an auxiliary register modification. The auxiliary register modification is specified by the content of data memory operand Xmem. The second operation performs a multiplication and an accumulation in the D-unit MAC. The input operands of the multiplier are the content of data memory operand Ymem, extended to 17 bits, and the content of a data memory operand Cmem, addressed using the coefficient addressing mode, extended to 17 bits.
- Input operands are extended to 17 bits according to uns. J J

If the optional uns keyword is applied to the input operand, the content of the memory location is zero extended to 17 bits. If the optional uns keyword is not applied to the input operand, the content of the memory location is sign extended to 17 bits according to SXMD.

- If FRCT = 1, the output of the multiplier is shifted left by 1 bit. - Multiplication overflow detection depends on SMUL. - The 32-bit result of the multiplication is sign extended to 40 bits and added

to the source accumulator ACx.


- Rounding is performed according to RDM, if the optional rnd keyword is

applied to the instruction.


- Overflow detection depends on M40. If an overflow is detected, the

destination accumulator overflow status bit (ACOVx) is set.


SPRU375G Instruction Set Descriptions 5-219

Modify Auxiliary Register Content with Parallel Multiply and Accumulate

- When an overflow is detected, the accumulator is saturated according to

SATD.
- This instruction provides the option to locally set M40 to 1 for the execution

of the instruction, if the optional M40 keyword is applied to the instruction.


- For this instruction, the Cmem operand is accessed through the BB bus;

on some C55x-based devices, the BB bus is only connected to internal memory and not to external memory. To prevent the generation of a bus error, the Cmem operand must not be mapped on external memory. Each data flow can also disable the usage of the corresponding MAC unit, while allowing the modification of auxiliary registers in the three address generation units through the following instructions:
J J J

mar(Xmem) mar(Ymem) mar(Cmem) FRCT, M40, RDM, SATD, SMUL, SXMD ACOVx

Status Bits

Affected by Affects

Repeat Example
Syntax

This instruction can be repeated.

Description

mar(*AR3+), Both instructions are performed in parallel. AR3 is incremented AC0 = AC0 + (uns(*AR4) * uns(coef(*CDP))) by 1. The unsigned content addressed by AR4 multiplied by the unsigned content addressed by the coefficient data pointer register (CDP) is added to the content of AC0 and the result is stored in AC0.

5-220

Instruction Set Descriptions

SPRU375G

Modify Auxiliary Register Content with Parallel Multiply and Accumulate

Modify Auxiliary Register Content with Parallel Multiply and Accumulate


Syntax Characteristics
No. [2] Syntax mar(Xmem), ACx = M40(rnd((ACx >> #16) + (uns(Ymem) * uns(coef(Cmem))))) Parallel Enable Bit No Size 4 Cycles 1 Pipeline X

Opcode Operands Description

1000 0100 XXXM MMYY YMMM 01mm uuxx DDg% ACx, Cmem, Xmem, Ymem This instruction performs two parallel operations in one cycle: modify auxiliary register (MAR), and multiply and accumulate (MAC). The operations are executed in the two D-unit MACs. The first operation performs an auxiliary register modification. The auxiliary register modification is specified by the content of data memory operand Xmem. The second operation performs a multiplication and an accumulation in the D-unit MAC. The input operands of the multiplier are the content of data memory operand Ymem, extended to 17 bits, and the content of a data memory operand Cmem, addressed using the coefficient addressing mode, extended to 17 bits.
- Input operands are extended to 17 bits according to uns. J J

If the optional uns keyword is applied to the input operand, the content of the memory location is zero extended to 17 bits. If the optional uns keyword is not applied to the input operand, the content of the memory location is sign extended to 17 bits according to SXMD.

- If FRCT = 1, the output of the multiplier is shifted left by 1 bit. - Multiplication overflow detection depends on SMUL. - The 32-bit result of the multiplication is sign extended to 40 bits and added

to the source accumulator ACx shifted right by 16 bits. The shifting operation is performed with a sign extension of source accumulator ACx(39).
- Rounding is performed according to RDM, if the optional rnd keyword is

applied to the instruction.


SPRU375G Instruction Set Descriptions 5-221

Modify Auxiliary Register Content with Parallel Multiply and Accumulate

- Overflow detection depends on M40. If an overflow is detected, the

destination accumulator overflow status bit (ACOVx) is set.


- When an overflow is detected, the accumulator is saturated according to

SATD. This instruction provides the option to locally set M40 to 1 for the execution of the instruction, if the optional M40 keyword is applied to the instruction. For this instruction, the Cmem operand is accessed through the BB bus; on some C55x-based devices, the BB bus is only connected to internal memory and not to external memory. To prevent the generation of a bus error, the Cmem operand must not be mapped on external memory. Each data flow can also disable the usage of the corresponding MAC unit, while allowing the modification of auxiliary registers in the three address generation units through the following instructions:
J J J

mar(Xmem) mar(Ymem) mar(Cmem) FRCT, M40, RDM, SATD, SMUL, SXMD ACOVx

Status Bits

Affected by Affects

Repeat Example
Syntax

This instruction can be repeated.

Description Both instructions are performed in parallel. AR2 is incremented by 1. The unsigned content addressed by AR1 multiplied by the unsigned content addressed by the coefficient data pointer register (CDP) is added to the content of AC0 shifted right by 16 bits and the result is stored in AC0. An overflow is detected in AC0.

mar(*AR2+), AC0 = ((AC0 >> #16) + (uns(*AR1) * uns(coef(*CDP))))

Before AC0 AC1 *AR1 AR2 *CDP ACOV0 ACOV1 CARRY M40 FRCT SATD

00 6900 0000 00 0023 0000 EF00 0201 A067 0 0 0 0 0 0

After AC0 AC1 *AR1 AR2 *CDP ACOV0 ACOV1 CARRY M40 FRCT SATD

00 95C0 9200 00 0023 0000 EF00 0202 A067 1 0 0 0 0 0

5-222

Instruction Set Descriptions

SPRU375G

Modify Auxiliary Register Content with Parallel Multiply and Subtract

Modify Auxiliary Register Content with Parallel Multiply and Subtract


Syntax Characteristics
No. [1] Syntax mar(Xmem), ACx = M40(rnd(ACx (uns(Ymem) * uns(coef(Cmem))))) Parallel Enable Bit No Size 4 Cycles 1 Pipeline X

Opcode Operands Description

1000 0101 XXXM MMYY YMMM 00mm uuxx DDg% ACx, Cmem, Xmem, Ymem This instruction performs two parallel operations in one cycle: modify auxiliary register (MAR), and multiply and subtract (MAS). The operations are executed in the two D-unit MACs. The first operation performs an auxiliary register modification. The auxiliary register modification is specified by the content of data memory operand Xmem. The second operation performs a multiplication and a subtraction in the D-unit MAC. The input operands of the multiplier are the content of data memory operand Ymem, extended to 17 bits, and the content of a data memory operand Cmem, addressed using the coefficient addressing mode, extended to 17 bits.
- Input operands are extended to 17 bits according to uns. J J

If the optional uns keyword is applied to the input operand, the content of the memory location is zero extended to 17 bits. If the optional uns keyword is not applied to the input operand, the content of the memory location is sign extended to 17 bits according to SXMD.

- If FRCT = 1, the output of the multiplier is shifted left by 1 bit. - Multiplication overflow detection depends on SMUL. - The 32-bit result of the multiplication is sign extended to 40 bits and

subtracted from the source accumulator ACx.


- Rounding is performed according to RDM, if the optional rnd keyword is

applied to the instruction.


- Overflow detection depends on M40. If an overflow is detected, the

destination accumulator overflow status bit (ACOVx) is set.


SPRU375G Instruction Set Descriptions 5-223

Modify Auxiliary Register Content with Parallel Multiply and Subtract

- When an overflow is detected, the accumulator is saturated according to

SATD.
- This instruction provides the option to locally set M40 to 1 for the execution

of the instruction, if the optional M40 keyword is applied to the instruction.


- For this instruction, the Cmem operand is accessed through the BB bus;

on some C55x-based devices, the BB bus is only connected to internal memory and not to external memory. To prevent the generation of a bus error, the Cmem operand must not be mapped on external memory. Each data flow can also disable the usage of the corresponding MAC unit, while allowing the modification of auxiliary registers in the three address generation units through the following instructions:
J J J

mar(Xmem) mar(Ymem) mar(Cmem) FRCT, M40, RDM, SATD, SMUL, SXMD ACOVx

Status Bits

Affected by Affects

Repeat See Also

This instruction can be repeated. See the following other related instructions:
- Modify Auxiliary Register Content - Modify Auxiliary Register Content with Parallel Multiply - Modify Auxiliary Register Content with Parallel Multiply and Accumulate - Multiply and Subtract

Example
Syntax mar(*AR3+), AC0 = AC0 (uns(*AR4) * uns(coef(*CDP))) Description Both instructions are performed in parallel. AR3 is incremented by 1. The unsigned content addressed by AR4 multiplied by the unsigned content addressed by the coefficient data pointer register (CDP) is subtracted from the content of AC0 and the result is stored in AC0.

5-224

Instruction Set Descriptions

SPRU375G

Modify Auxiliary or Temporary Register Content (mar)

Modify Auxiliary or Temporary Register Content


Syntax Characteristics
Parallel Enable Bit No No No

No. [1] [2] [3]

Syntax mar(TAy = TAx) mar(TAx = P8) mar(TAx = D16)

Size 3 3 4

Cycles 1 1 1

Pipeline AD AD AD

Description

These instructions perform, in the A-unit address generation units:


- a move from auxiliary or temporary register TAx to auxiliary or temporary

register TAy
- a load in the auxiliary or temporary registers TAx of a program address

defined by a program address label assembled into P8


- a load in the auxiliary or temporary registers TAx of the absolute data

address signed constant D16 The operation is performed in the address phase of the pipeline, however data memory is not accessed. Status Bits Affected by Affects See Also none none

See the following other related instructions:


- Load Auxiliary or Temporary Register from Memory - Modify Auxiliary Register Content - Modify Auxiliary or Temporary Register Content by Addition - Modify Auxiliary or Temporary Register Content by Subtraction - Modify Extended Auxiliary Register Content

SPRU375G

Instruction Set Descriptions

5-225

Modify Auxiliary or Temporary Register Content (mar)

Modify Auxiliary or Temporary Register Content


Syntax Characteristics
Parallel Enable Bit No

No. [1]

Syntax mar(TAy = TAx)

Size 3

Cycles 1

Pipeline AD

Opcode

0001 010E FSSS xxxx FDDD 0001 0001 010E FSSS xxxx FDDD 1001 The assembler selects the opcode depending on the instruction position in a paralleled pair.

Operands Description

TAx, TAy This instruction performs, in the A-unit address generation units, a move from the auxiliary or temporary register TAx to auxiliary or temporary register TAy. The operation is performed in the address phase of the pipeline; however, data memory is not accessed. Affected by Affects none none

Status Bits

Repeat Example 1
Syntax mar(AR0 = AR1)

This instruction can be repeated.

Description The content of AR1 is copied to AR0.

Example 2
Syntax mar(T0 = T1) Description The content of T1 is copied to T0.

5-226

Instruction Set Descriptions

SPRU375G

Modify Auxiliary or Temporary Register Content (mar)

Modify Auxiliary or Temporary Register Content


Syntax Characteristics
Parallel Enable Bit No

No. [2]

Syntax mar(TAx = P8)

Size 3

Cycles 1

Pipeline AD

Opcode

0001 010E PPPP PPPP FDDD 0101 0001 010E PPPP PPPP FDDD 1101 The assembler selects the opcode depending on the instruction position in a paralleled pair.

Operands Description

TAx, P8 This instruction performs, in the A-unit address generation units, a load in the auxiliary or temporary registers TAx of a program address defined by a program address label assembled into P8. The operation is performed in the address phase of the pipeline; however, data memory is not accessed. Affected by Affects none none

Status Bits

Repeat Example 1
Syntax mar(AR0 = #255)

This instruction can be repeated.

Description The unsigned 8-bit value (255) is copied to AR0.

Example 2
Syntax mar(T0 = #255) Description The unsigned 8-bit value (255) is copied to T0.

SPRU375G

Instruction Set Descriptions

5-227

Modify Auxiliary or Temporary Register Content (mar)

Modify Auxiliary or Temporary Register Content


Syntax Characteristics
Parallel Enable Bit No

No. [3]

Syntax mar(TAx = D16)

Size 4

Cycles 1

Pipeline AD

Opcode Operands Description TAx, D16

0111 0111 DDDD DDDD DDDD DDDD FDDD xxxx

This instruction performs, in the A-unit address generation units, a load in the auxiliary or temporary registers TAx of the absolute data address signed constant D16. The operation is performed in the address phase of the pipeline; however, data memory is not accessed. Affected by Affects none none

Status Bits

Repeat Example
Syntax mar(T1 = #FFFFh)

This instruction can be repeated.

Description The address FFFFh is copied to T1.

5-228

Instruction Set Descriptions

SPRU375G

Modify Auxiliary or Temporary Register Content by Addition (mar)

Modify Auxiliary or Temporary Register Content by Addition


Syntax Characteristics
Parallel Enable Bit No No

No. [1] [2]

Syntax mar(TAy + TAx) mar(TAx + P8)

Size 3 3

Cycles 1 1

Pipeline AD AD

Description

These instructions perform, in the A-unit address generation units:


- an addition between two auxiliary or temporary registers, TAx and TAy,

and stores the result in TAy


- an addition between the auxiliary or temporary registers TAx and a

program address defined by a program address label assembled into unsigned P8, and stores the result in TAx The operation is performed in the address phase of the pipeline, however data memory is not accessed. If the destination register is an auxiliary register and the corresponding bit (ARnLC) in status register ST2_55 is set to 1, the circular buffer management controls the result stored in the destination register. Status Bits Affected by Affects See Also ST2_55 none

See the following other related instructions:


- Modify Auxiliary Register Content - Modify Auxiliary or Temporary Register Content - Modify Auxiliary or Temporary Register Content by Subtraction - Modify Extended Auxiliary Register Content

SPRU375G

Instruction Set Descriptions

5-229

Modify Auxiliary or Temporary Register Content by Addition (mar)

Modify Auxiliary or Temporary Register Content by Addition


Syntax Characteristics
Parallel Enable Bit No

No. [1]

Syntax mar(TAy + TAx)

Size 3

Cycles 1

Pipeline AD

Opcode

0001 010E FSSS xxxx FDDD 0000 0001 010E FSSS xxxx FDDD 1000 The assembler selects the opcode depending on the instruction position in a paralleled pair.

Operands Description

TAx, TAy This instruction performs, in the A-unit address generation units, an addition between two auxiliary or temporary registers, TAy and TAx, and stores the result in TAy. The content of TAx is considered signed. The operation is performed in the address phase of the pipeline; however, data memory is not accessed. If the destination register is an auxiliary register and the corresponding bit (ARnLC) in status register ST2_55 is set to 1, the circular buffer management controls the result stored in the destination register. Compatibility with C54x devices (C54CM = 1) In the translated code section, the mar() instruction must be executed with C54CM set to 1. When circular modification is selected for the destination auxiliary register, this instruction modifies the selected destination auxiliary register by using BK03 as the circular buffer size register; BK47 is not used.

Status Bits

Affected by Affects

ST2_55 none

Repeat

This instruction can be repeated.

5-230

Instruction Set Descriptions

SPRU375G

Modify Auxiliary or Temporary Register Content by Addition (mar)

Example 1
Syntax mar(AR0 + T0)
Before XAR0 T0

Description The content of AR0 is added to the signed content of T0 and the result is stored in AR0.
After XAR0 T0

01 0000 8000

01 8000 8000

Example 2
Syntax mar(T0 + T1) Description The content of T0 is added to the content of T1 and the result is stored in T0.

SPRU375G

Instruction Set Descriptions

5-231

Modify Auxiliary or Temporary Register Content by Addition (mar)

Modify Auxiliary or Temporary Register Content by Addition


Syntax Characteristics
Parallel Enable Bit No

No. [2]

Syntax mar(TAx + P8)

Size 3

Cycles 1

Pipeline AD

Opcode

0001 010E PPPP PPPP FDDD 0100 0001 010E PPPP PPPP FDDD 1100 The assembler selects the opcode depending on the instruction position in a paralleled pair.

Operands Description

TAx, P8 This instruction performs, in the A-unit address generation units, an addition between the auxiliary or temporary register TAx and a program address defined by a program address label assembled into unsigned P8, and stores the result in TAx. The operation is performed in the address phase of the pipeline; however, data memory is not accessed. If the destination register is an auxiliary register and the corresponding bit (ARnLC) in status register ST2_55 is set to 1, the circular buffer management controls the result stored in the destination register. Compatibility with C54x devices (C54CM = 1) In the translated code section, the mar() instruction must be executed with C54CM set to 1. When circular modification is selected for the destination auxiliary register, this instruction modifies the selected destination auxiliary register by using BK03 as the circular buffer size register; BK47 is not used.

Status Bits

Affected by Affects

ST2_55 none

Repeat Example
Syntax mar(T0 + #255)

This instruction can be repeated.

Description The unsigned 8-bit value (255) is added to the content of T0 and the result is stored in T0.

5-232

Instruction Set Descriptions

SPRU375G

Modify Auxiliary or Temporary Register Content by Subtraction (mar)

Modify Auxiliary or Temporary Register Content by Subtraction


Syntax Characteristics
Parallel Enable Bit No No

No. [1] [2]

Syntax mar(TAy TAx) mar(TAx P8)

Size 3 3

Cycles 1 1

Pipeline AD AD

Description

These instructions perform, in the A-unit address generation units:


- a subtraction between two auxiliary or temporary registers, TAy and TAx,

and stores the result in TAy


- a subtraction between the auxiliary or temporary registers TAx and a

program address defined by a program address label assembled into unsigned P8, and stores the result in TAx The operation is performed in the address phase of the pipeline, however data memory is not accessed. If the destination register is an auxiliary register and the corresponding bit (ARnLC) in status register ST2_55 is set to 1, the circular buffer management controls the result stored in the destination register. Status Bits Affected by Affects See Also ST2_55 none

See the following other related instructions:


- Modify Auxiliary Register Content - Modify Auxiliary or Temporary Register Content - Modify Auxiliary or Temporary Register Content by Addition - Modify Extended Auxiliary Register Content

SPRU375G

Instruction Set Descriptions

5-233

Modify Auxiliary or Temporary Register Content by Subtraction (mar)

Modify Auxiliary or Temporary Register Content by Subtraction


Syntax Characteristics
Parallel Enable Bit No

No. [1]

Syntax mar(TAy TAx)

Size 3

Cycles 1

Pipeline AD

Opcode

0001 010E FSSS xxxx FDDD 0010 0001 010E FSSS xxxx FDDD 1010 The assembler selects the opcode depending on the instruction position in a paralleled pair.

Operands Description

TAx, TAy This instruction performs, in the A-unit address generation units, a subtraction between two auxiliary or temporary registers, TAy and TAx, and stores the result in TAy. The content of TAx is considered signed. The operation is performed in the address phase of the pipeline; however, data memory is not accessed. If the destination register is an auxiliary register and the corresponding bit (ARnLC) in status register ST2_55 is set to 1, the circular buffer management controls the result stored in the destination register. Compatibility with C54x devices (C54CM = 1) In the translated code section, the mar() instruction must be executed with C54CM set to 1. When circular modification is selected for the destination auxiliary register, this instruction modifies the selected destination auxiliary register by using BK03 as the circular buffer size register; BK47 is not used.

Status Bits

Affected by Affects

ST2_55 none

Repeat

This instruction can be repeated.

5-234

Instruction Set Descriptions

SPRU375G

Modify Auxiliary or Temporary Register Content by Subtraction (mar)

Example 1
Syntax mar(AR0 T0) Description The signed content of T0 is subtracted from the content of AR0 and the result is stored in AR0.
After XAR0 T0

Before XAR0 T0

01 8000 8000

01 0000 8000

Example 2
Syntax mar(T0 T1) Description The content of T1 is subtracted from the content of T0 and the result is stored in T0.

SPRU375G

Instruction Set Descriptions

5-235

Modify Auxiliary or Temporary Register Content by Subtraction (mar)

Modify Auxiliary or Temporary Register Content by Subtraction


Syntax Characteristics
Parallel Enable Bit No

No. [2]

Syntax mar(TAx P8)

Size 3

Cycles 1

Pipeline AD

Opcode

0001 010E PPPP PPPP FDDD 0110 0001 010E PPPP PPPP FDDD 1110 The assembler selects the opcode depending on the instruction position in a paralleled pair.

Operands Description

TAx, P8 This instruction performs, in the A-unit address generation units, a subtraction between the auxiliary or temporary register TAx and a program address defined by a program address label assembled into unsigned P8, and stores the result in TAx. The operation is performed in the address phase of the pipeline; however, data memory is not accessed. If the destination register is an auxiliary register and the corresponding bit (ARnLC) in status register ST2_55 is set to 1, the circular buffer management controls the result stored in the destination register. Compatibility with C54x devices (C54CM = 1) In the translated code section, the mar() instruction must be executed with C54CM set to 1. When circular modification is selected for the destination auxiliary register, this instruction modifies the selected destination auxiliary register by using BK03 as the circular buffer size register; BK47 is not used.

Status Bits

Affected by Affects

ST2_55 none

Repeat Example
Syntax mar(AR0 #255)

This instruction can be repeated.

Description The unsigned 8-bit value (255) is subtracted from the signed content of AR0 and the result is stored in AR0.

5-236

Instruction Set Descriptions

SPRU375G

Modify Data Stack Pointer

Modify Data Stack Pointer


Syntax Characteristics
Parallel Enable Bit Yes

No. [1]

Syntax SP = SP + K8

Size 2

Cycles 1

Pipeline AD

Opcode Operands Description K8

0100 111E KKKK KKKK

This instruction performs an addition in the A-unit data-address generation unit (DAGEN) in the address phase of the pipeline. The 8-bit signed constant, K8, is sign extended to 16 bits and added to the data stack pointer (SP). When in 32-bit stack configuration, the system stack pointer (SSP) is also modified. Updates of the SP and SSP (depending on the stack configuration) should not be executed in parallel with this instruction. Affected by Affects none none

Status Bits

Repeat Example
Syntax SP = SP + #127

This instruction can be repeated.

Description The 8-bit value (127) is sign extended to 16 bits and added to the stack pointer (SP).

SPRU375G

Instruction Set Descriptions

5-237

Modify Extended Auxiliary Register Content (mar)

Modify Extended Auxiliary Register Content


Syntax Characteristics
Parallel Enable Bit No

No. [1]

Syntax XAdst = mar(Smem)

Size 3

Cycles 1

Pipeline AD

Opcode Operands Description Smem, XAdst

1110 1100 AAAA AAAI XDDD 1110

This instruction computes the effective address specified by the Smem operand field and modifies the 23-bit destination register (XARx, XSP, XSSP, XDP, or XCDP). This operation is completed in the address phase of the pipeline by the A-unit address generator. Data memory is not accessed. The premodification or postmodification of the auxiliary register (ARx), the use of *port(#K), and the use of the readport() or writeport() qualifier is not supported for this instruction. The use of auxiliary register offset operations is supported. If the corresponding bit (ARnLC) in status register ST2_55 is set to 1, the circular buffer management also controls the result stored in XAdst.

Status Bits

Affected by Affects

ST2_55 none

Repeat See Also

This instruction can be repeated. See the following other related instructions:
- Load Extended Auxiliary Register from Memory - Load Extended Auxiliary Register with Immediate Value - Modify Auxiliary Register Content - Move Extended Auxiliary Register Content - Store Extended Auxiliary Register Content to Memory

Example
Syntax XAR0 = mar(*AR1) Description The content of AR1 is loaded into XAR0.

5-238

Instruction Set Descriptions

SPRU375G

Move Accumulator Content to Auxiliary or Temporary Register

Move Accumulator Content to Auxiliary or Temporary Register


Syntax Characteristics
Parallel Enable Bit Yes

No. [1]

Syntax TAx = HI(ACx)

Size 2

Cycles 1

Pipeline X

Opcode Operands Description ACx, TAx

0100 010E 00SS FDDD

This instruction moves the high part of the accumulator, ACx(3116), to the destination auxiliary or temporary register (TAx). The 16-bit move operation is performed in the A-unit ALU. Compatibility with C54x devices (C54CM = 1) When this instruction is executed with M40 = 0, compatibility is ensured.

Status Bits

Affected by Affects

M40 none

Repeat See Also

This instruction can be repeated. See the following other related instructions:
- Move Accumulator, Auxiliary, or Temporary Register Content - Move Auxiliary or Temporary Register Content to Accumulator

Example
Syntax AR2 = HI(AC0)
Before AC0 AR2 01 E500 0030 0200

Description The content of AC0(3116) is copied to AR2.


After AC0 AR2 01 E500 0030 E500

SPRU375G

Instruction Set Descriptions

5-239

Move Accumulator, Auxiliary, or Temporary Register Content

Move Accumulator, Auxiliary, or Temporary Register Content


Syntax Characteristics
No. [1] Syntax dst = src Parallel Enable Bit Yes Size 2 Cycles 1 Pipeline X

Opcode Operands Description dst, src

0010 001E FSSS FDDD

This instruction moves the content of the source (src) register to the destination (dst) register:
- When the destination (dst) register is an accumulator: J J

The 40-bit move operation is performed in the D-unit ALU. During the 40-bit move operation, an overflow is detected according to M40: H H the destination accumulator overflow status bit (ACOVx) is set. the destination register (ACx) is saturated according to SATD.

If the source (src) register is an auxiliary or temporary register, the 16 LSBs of the source register are sign extended to 40 bits according to SXMD.

- When the destination (dst) register is an auxiliary or temporary register: J J

The 16-bit move operation is performed in the A-unit ALU. If the source (src) register is an accumulator, the 16 LSBs of the accumulator are used to perform the operation.

Compatibility with C54x devices (C54CM = 1) When this instruction is executed with M40 = 0, compatibility is ensured. Status Bits Affected by Affects Repeat See Also M40, SATD, SXMD ACOVx

This instruction can be repeated. See the following other related instructions:
- Move Accumulator Content to Auxiliary or Temporary Register - Move Auxiliary or Temporary Register Content to Accumulator - Move Auxiliary or Temporary Register Content to CPU Register - Move Extended Auxiliary Register Content

5-240

Instruction Set Descriptions

SPRU375G

Move Accumulator, Auxiliary, or Temporary Register Content

Example
Syntax AC1 = AC0
Before AC0 AC1 M40 SATD ACOV1 01 E500 0030 00 2800 0200 0 0 0

Description The content of AC0 is copied to AC1. Because an overflow occurred, ACOV1 is set to 1.
After AC0 AC1 M40 SATD ACOV1 01 E500 0030 01 E500 0030 0 0 1

SPRU375G

Instruction Set Descriptions

5-241

Move Auxiliary or Temporary Register Content to Accumulator

Move Auxiliary or Temporary Register Content to Accumulator


Syntax Characteristics
No. [1] Syntax HI(ACx) = TAx Parallel Enable Bit Yes Size 2 Cycles 1 Pipeline X

Opcode Operands Description ACx, TAx

0101 001E FSSS 00DD

This instruction moves the content of the auxiliary or temporary register (TAx) to the high part of the accumulator, ACx(3116):
- The 16-bit move operation is performed in the D-unit ALU. - During the 16-bit move operation, an overflow is detected according to

M40:
J J

the destination accumulator overflow status bit (ACOVx) is set. the destination register (ACx) is saturated according to SATD.

- If the source (src) register is an auxiliary or temporary register, the

16 LSBs of the source register are sign extended to 40 bits according to SXMD. Compatibility with C54x devices (C54CM = 1) When this instruction is executed with M40 = 0, compatibility is ensured. Status Bits Affected by Affects Repeat See Also M40, SATD, SXMD ACOVx

This instruction can be repeated. See the following other related instructions:
- Move Accumulator Content to Auxiliary or Temporary Register - Move Accumulator, Auxiliary, or Temporary Register Content - Move Auxiliary or Temporary Register Content to CPU Register - Move Extended Auxiliary Register Content

Example
Syntax HI(AC0) = T0 Description The content of T0 is copied to AC0(3116).

5-242

Instruction Set Descriptions

SPRU375G

Move Auxiliary or Temporary Register Content to CPU Register

Move Auxiliary or Temporary Register Content to CPU Register


Syntax Characteristics
No. [1] [2] [3] [4] [5] [6] Syntax BRC0 = TAx BRC1 = TAx CDP = TAx CSR = TAx SP = TAx SSP = TAx Parallel Enable Bit Yes Yes Yes Yes Yes Yes Size 2 2 2 2 2 2 Cycles 1 1 1 1 1 1 Pipeline X X X X X X

Opcode Operands Description

See Table 53 (page 5-244). TAx This instruction moves the content of the auxiliary or temporary register (TAx) to the selected CPU register. All the move operations are performed in the execute phase of the pipeline and the A-unit ALU is used to transfer the content of the registers. There is a 3-cycle latency between SP, SSP, CDP, TAx, CSR, and BRCx update and their use in the address phase by the A-unit address generator units or by the P-unit loop control management. For instruction [2] when BRC1 is loaded with the content of TAx, the block repeat save register (BRS1) is also loaded with the same value.

Status Bits

Affected by Affects

none none

Repeat See Also

This instruction can be repeated. See the following other related instructions:
- Move Accumulator Content to Auxiliary or Temporary Register - Move Accumulator, Auxiliary, or Temporary Register Content - Move Auxiliary or Temporary Register Content to Accumulator - Move CPU Register Content to Auxiliary or Temporary Register - Move Extended Auxiliary Register Content

SPRU375G

Instruction Set Descriptions

5-243

Move Auxiliary or Temporary Register Content to CPU Register

Example
Syntax BRC1 = T1 Description The content of T1 is copied to the block repeat register (BRC1) and to the block repeat save register (BRS1).
After 0034 00EA 00EA T1 BRC1 BRS1 0034 0034 0034

Before T1 BRC1 BRS1

Table 53. Opcodes for Move Auxiliary or Temporary Register Content to CPU Register Instruction
No. [1] [2] [3] [4] [5] [6] Syntax BRC0 = TAx BRC1 = TAx CDP = TAx CSR = TAx SP = TAx SSP = TAx Opcode

0101 001E FSSS 1110 0101 001E FSSS 1101 0101 001E FSSS 1010 0101 001E FSSS 1100 0101 001E FSSS 1000 0101 001E FSSS 1001

5-244

Instruction Set Descriptions

SPRU375G

Move CPU Register Content to Auxiliary or Temporary Register

Move CPU Register Content to Auxiliary or Temporary Register


Syntax Characteristics
Parallel Enable Bit Yes Yes Yes Yes Yes Yes

No. [1] [2] [3] [4] [5] [6]

Syntax TAx = BRC0 TAx = BRC1 TAx = CDP TAx = SP TAx = SSP TAx = RPTC

Size 2 2 2 2 2 2

Cycles 1 1 1 1 1 1

Pipeline X X X X X X

Opcode Operands Description

See Table 54 (page 5-246). TAx This instruction moves the content of the selected CPU register to the auxiliary or temporary register (TAx). All the move operations are performed in the execute phase of the pipeline and the A-unit ALU is used to transfer the content of the registers. For instructions [1] and [2], BRCx is decremented in the address phase of the last instruction of a loop. These instructions have a 3-cycle latency requirement versus the last instruction of a loop. For instructions [3], [4], and [5], there is a 3-cycle latency between SP, SSP, CDP, and TAx update and their use in the address phase by the A-unit address generator units or by the P-unit loop control management.

Status Bits

Affected by Affects

none none

Repeat See Also

Instruction [6] cannot be repeated; all other instructions can be repeated. See the following other related instructions:
- Move Accumulator Content to Auxiliary or Temporary Register - Move Auxiliary or Temporary Register Content to CPU Register - Store CPU Register Content to Memory

SPRU375G

Instruction Set Descriptions

5-245

Move CPU Register Content to Auxiliary or Temporary Register

Example
Syntax T1 = BRC1
Before T1 BRC1 0034 00EA

Description The content of block repeat register (BRC1) is copied to T1.


After T1 BRC1 00EA 00EA

Table 54. Opcodes for Move CPU Register Content to Auxiliary or Temporary Register Instruction
No. [1] [2] [3] [4] [5] [6] Syntax TAx = BRC0 TAx = BRC1 TAx = CDP TAx = SP TAx = SSP TAx = RPTC Opcode

0100 010E 1100 FDDD 0100 010E 1101 FDDD 0100 010E 1010 FDDD 0100 010E 1000 FDDD 0100 010E 1001 FDDD 0100 010E 1110 FDDD

5-246

Instruction Set Descriptions

SPRU375G

Move Extended Auxiliary Register Content

Move Extended Auxiliary Register Content


Syntax Characteristics
No. [1] Syntax xdst = xsrc Parallel Enable Bit No Size 2 Cycles 1 Pipeline X

Opcode Operands Description xdst, xsrc

1001 0000 XSSS XDDD

This instruction moves the content of the source register (xsrc) to the destination register (xdst):
- When the destination register (xdst) is an accumulator (ACx) and the

source register (xsrc) is a 23-bit register (XARx, XSP, XSSP, XDP, or XCDP):
J J

The 23-bit move operation is performed in the D-unit ALU. The upper bits of ACx are filled with 0.

- When the source register (xsrc) is an accumulator (ACx) and the

destination register (xdst) is a 23-bit register (XARx, XSP, XSSP, XDP, or XCDP):
J J

The 23-bit move operation is performed in the A-unit ALU. The lower 23 bits of ACx are loaded into xdst.

- When both the source register (xsrc) and the destination register (xdst) are

accumulators, the Move Accumulator Content instruction (dst = src) is assembled. Status Bits Affected by Affects Repeat See Also none none

This instruction can be repeated. See the following other related instructions:
- Load Extended Auxiliary Register from Memory - Load Extended Auxiliary Register with Immediate Value - Modify Extended Auxiliary Register Content - Store Extended Auxiliary Register Content to Memory

Example
Syntax XAR1 = AC0 Description The lower 23 bits of AC0 are loaded into XAR1.

SPRU375G

Instruction Set Descriptions

5-247

Move Memory to Memory

Move Memory to Memory


Syntax Characteristics
Parallel Enable Bit No No No No No No

No. [1] [2] [3] [4] [5] [6]

Syntax Smem = coef(Cmem) coef(Cmem) = Smem Lmem = dbl(coef(Cmem)) dbl(coef(Cmem)) = Lmem dbl(Ymem) = dbl(Xmem) Ymem = Xmem

Size 3 3 3 3 3 3

Cycles 1 1 1 1 1 1

Pipeline X X X X X X

Description

These instructions store the content of a memory location to a memory location. They use a dedicated datapath to perform the operation. Affected by Affects none none

Status Bits

See Also

See the following other related instructions:


- Store Accumulator Content to Memory - Store Accumulator, Auxiliary, or Temporary Register Content to Memory - Store Auxiliary or Temporary Register Pair Content to Memory - Store CPU Register Content to Memory - Store Extended Auxiliary Register Content to Memory

5-248

Instruction Set Descriptions

SPRU375G

Move Memory to Memory

Move Memory to Memory


Syntax Characteristics
Parallel Enable Bit No

No. [1]

Syntax Smem = coef(Cmem)

Size 3

Cycles 1

Pipeline X

Opcode Operands Description Cmem, Smem

1110 1111 AAAA AAAI xxxx 00mm

This instruction stores the content of a data memory operand Cmem, addressed using the coefficient addressing mode, to a memory (Smem) location. For this instruction, the Cmem operand is not accessed through the BB bus. On all C55x-based devices, the Cmem operand may be mapped in external or internal memory space.

Status Bits

Affected by Affects

none none

Repeat Example
Syntax *(#0500h) = coef(*CDP)

This instruction can be repeated.

Description The content addressed by the coefficient data pointer register (CDP) is copied to address 0500h.
After 3400 0000 *CDP 500 3400 3400

Before *CDP 500

SPRU375G

Instruction Set Descriptions

5-249

Move Memory to Memory

Move Memory to Memory


Syntax Characteristics
Parallel Enable Bit No

No. [2]

Syntax coef(Cmem) = Smem

Size 3

Cycles 1

Pipeline X

Opcode Operands Description Cmem, Smem

1110 1111 AAAA AAAI xxxx 01mm

This instruction stores the content of a memory (Smem) location to a data memory (Cmem) location addressed using the coefficient addressing mode. For this instruction, the Cmem operand is not accessed through the BB bus. On all C55x-based devices, the Cmem operand may be mapped in external or internal memory space.

Status Bits

Affected by Affects

none none

Repeat Example
Syntax coef(*CDP) = *AR3

This instruction can be repeated.

Description The content addressed by AR3 is copied in the location addressed by the coefficient data pointer register (CDP).

5-250

Instruction Set Descriptions

SPRU375G

Move Memory to Memory

Move Memory to Memory


Syntax Characteristics
Parallel Enable Bit No

No. [3]

Syntax Lmem = dbl(coef(Cmem))

Size 3

Cycles 1

Pipeline X

Opcode Operands Description Cmem, Lmem

1110 1111 AAAA AAAI xxxx 10mm

This instruction stores the content of two consecutive data memory (Cmem) locations, addressed using the coefficient addressing mode, to two consecutive data memory (Lmem) locations. For this instruction, the Cmem operand is not accessed through the BB bus. On all C55x-based devices, the Cmem operand may be mapped in external or internal memory space.

Status Bits

Affected by Affects

none none

Repeat Example
Syntax

This instruction can be repeated.

Description The content (long word) addressed by the coefficient data pointer register (CDP) and CDP + 1 is copied in the location addressed by AR1 and AR1 + 1, respectively. After the memory store, CDP is incremented by the content of T0 (5).
After 0005 0200 0300 3400 0FD3 0000 0000 T0 CDP AR1 200 201 300 301 0005 0205 0300 3400 0FD3 3400 0FD3

*AR1 = dbl(coef(*(CDP + T0)))

Before T0 CDP AR1 200 201 300 301

SPRU375G

Instruction Set Descriptions

5-251

Move Memory to Memory

Move Memory to Memory


Syntax Characteristics
Parallel Enable Bit No

No. [4]

Syntax dbl(coef(Cmem)) = Lmem

Size 3

Cycles 1

Pipeline X

Opcode Operands Description Cmem, Lmem

1110 1111 AAAA AAAI xxxx 11mm

This instruction stores the content of two consecutive data memory (Lmem) locations to two consecutive data memory (Cmem) locations addressed using the coefficient addressing mode. For this instruction, the Cmem operand is not accessed through the BB bus. On all C55x-based devices, the Cmem operand may be mapped in external or internal memory space.

Status Bits

Affected by Affects

none none

Repeat Example
Syntax dbl(coef(*CDP)) = *AR3+

This instruction can be repeated.

Description The content (long word) addressed by AR3 and AR3 + 1 is copied in the location addressed by the coefficient data pointer register (CDP) and CDP + 1, respectively. Because this instruction is a long-operand instruction, AR3 is incremented by 2 after the execution.

5-252

Instruction Set Descriptions

SPRU375G

Move Memory to Memory

Move Memory to Memory


Syntax Characteristics
Parallel Enable Bit No

No. [5]

Syntax dbl(Ymem) = dbl(Xmem)

Size 3

Cycles 1

Pipeline X

Opcode Operands Description Xmem, Ymem

1000 0000 XXXM MMYY YMMM 00xx

This instruction stores the content of two consecutive data memory (Xmem) locations, addressed using the dual addressing mode, to two consecutive data memory (Ymem) locations. Affected by Affects none none

Status Bits

Repeat Example
Syntax dbl(*AR1) = dbl(*AR0)

This instruction can be repeated.

Description The content addressed by AR0 is copied in the location addressed by AR1 and the content addressed by AR0 + 1 is copied in the location addressed by AR1 + 1.
After 0300 0400 3400 0FD3 0000 0000 AR0 AR1 300 301 400 401 0300 0400 3400 0FD3 3400 0FD3

Before AR0 AR1 300 301 400 401

SPRU375G

Instruction Set Descriptions

5-253

Move Memory to Memory

Move Memory to Memory


Syntax Characteristics
Parallel Enable Bit No

No. [6]

Syntax Ymem = Xmem

Size 3

Cycles 1

Pipeline X

Opcode Operands Description Xmem, Ymem

1000 0000 XXXM MMYY YMMM 01xx

This instruction stores the content of data memory (Xmem) location, addressed using the dual addressing mode, to data memory (Ymem) location. Affected by Affects none none

Status Bits

Repeat Example
Syntax *AR3 = *AR5

This instruction can be repeated.

Description The content addressed by AR5 is copied in the location addressed by AR3.

5-254

Instruction Set Descriptions

SPRU375G

Multiply

Multiply
Syntax Characteristics
Parallel Enable Bit Yes Yes Yes No No No No No No

No. [1] [2] [3] [4] [5] [6] [7] [8] [9]

Syntax ACy = rnd(ACy * ACx) ACy = rnd(ACx * Tx) ACy = rnd(ACx * K8) ACy = rnd(ACx * K16) ACx = rnd(Smem * coef(Cmem))[, T3 = Smem] ACy = rnd(Smem * ACx)[, T3 = Smem] ACx = rnd(Smem * K8)[, T3 = Smem] ACx = M40(rnd(uns(Xmem) * uns(Ymem)))[, T3 = Xmem] ACx = rnd(uns(Tx * Smem))[, T3 = Smem]

Size 2 2 3 4 3 3 4 4 3

Cycles 1 1 1 1 1 1 1 1 1

Pipeline X X X X X X X X X

Description

This instruction performs a multiplication in the D-unit MAC. The input operands of the multiplier are:
- ACx(3216) - the content of Tx, sign extended to 17 bits - the 8-bit signed constant, K8, sign extended to 17 bits - the 16-bit signed constant, K16, sign extended to 17 bits - the content of a memory (Smem) location, sign extended to 17 bits - the content of a data memory operand Cmem, addressed using the

coefficient addressing mode, sign extended to 17 bits


- the content of data memory operand Xmem, extended to 17 bits, and the

content of data memory operand Ymem, extended to 17 bits Status Bits Affected by Affects FRCT, M40, RDM, SATD, SMUL, SXMD ACOVx, ACOVy

SPRU375G

Instruction Set Descriptions

5-255

Multiply

See Also

See the following other related instructions:


- Modify Auxiliary Register Content with Parallel Multiply - Multiply and Accumulate - Multiply and Accumulate with Parallel Multiply - Multiply and Subtract - Multiply and Subtract with Parallel Multiply - Multiply with Parallel Multiply and Accumulate - Multiply with Parallel Store Accumulator Content to Memory - Parallel Multiplies - Square

5-256

Instruction Set Descriptions

SPRU375G

Multiply

Multiply
Syntax Characteristics
No. [1] Syntax ACy = rnd(ACy * ACx) Parallel Enable Bit Yes Size 2 Cycles 1 Pipeline X

Opcode Operands Description ACx, ACy

0101 010E DDSS 011%

This instruction performs a multiplication in the D-unit MAC. The input operands of the multiplier are ACx(3216) and ACy(3216).
- If FRCT = 1, the output of the multiplier is shifted left by 1 bit. - Multiplication overflow detection depends on SMUL. - The 32-bit result of the multiplication is sign extended to 40 bits. - Rounding is performed according to RDM, if the optional rnd keyword is

applied to the instruction.


- Overflow detection depends on M40. If an overflow is detected, the

destination accumulator overflow status bit (ACOVy) is set.


- When an overflow is detected, the accumulator is saturated according to

SATD. Compatibility with C54x devices (C54CM = 1) When this instruction is executed with M40 = 0, compatibility is ensured. Status Bits Affected by Affects Repeat Example
Syntax AC1 = AC1 * AC0
Before AC0 AC1 M40 FRCT ACOV1

FRCT, M40, RDM, SATD, SMUL ACOVy

This instruction can be repeated.

Description The content of AC1 is multiplied by the content of AC0 and the result is stored in AC1.
After AC0 AC1 M40 FRCT ACOV1

02 6000 3400 00 C000 0000 1 0 0

02 6000 3400 00 4800 0000 1 0 0

SPRU375G

Instruction Set Descriptions

5-257

Multiply

Multiply
Syntax Characteristics
Parallel Enable Bit Yes

No. [2]

Syntax ACy = rnd(ACx * Tx)

Size 2

Cycles 1

Pipeline X

Opcode Operands Description ACx, ACy, Tx

0101 100E DDSS ss0%

This instruction performs a multiplication in the D-unit MAC. The input operands of the multiplier are ACx(3216) and the content of Tx, sign extended to 17 bits.
- If FRCT = 1, the output of the multiplier is shifted left by 1 bit. - Multiplication overflow detection depends on SMUL. - The 32-bit result of the multiplication is sign extended to 40 bits. - Rounding is performed according to RDM, if the optional rnd keyword is

applied to the instruction.


- Overflow detection depends on M40. If an overflow is detected, the

destination accumulator overflow status bit (ACOVy) is set.


- When an overflow is detected, the accumulator is saturated according to

SATD. Compatibility with C54x devices (C54CM = 1) When this instruction is executed with M40 = 0, compatibility is ensured. Status Bits Affected by Affects Repeat Example
Syntax AC0 = AC1 * T0 Description The content of AC1 is multiplied by the content of T0 and the result is stored in AC0.

FRCT, M40, RDM, SATD, SMUL ACOVy

This instruction can be repeated.

5-258

Instruction Set Descriptions

SPRU375G

Multiply

Multiply
Syntax Characteristics
Parallel Enable Bit Yes

No. [3]

Syntax ACy = rnd(ACx * K8)

Size 3

Cycles 1

Pipeline X

Opcode Operands Description ACx, ACy, K8

0001 111E KKKK KKKK SSDD xx0%

This instruction performs a multiplication in the D-unit MAC. The input operands of the multiplier are ACx(3216) and the 8-bit signed constant, K8, sign extended to 17 bits.
- If FRCT = 1, the output of the multiplier is shifted left by 1 bit. - The 32-bit result of the multiplication is sign extended to 40 bits. - Rounding is performed according to RDM, if the optional rnd keyword is

applied to the instruction. Compatibility with C54x devices (C54CM = 1) When this instruction is executed with M40 = 0, compatibility is ensured. Status Bits Affected by Affects Repeat Example
Syntax AC0 = AC1 * #2 Description The content of AC1 is multiplied by a signed 8-bit value (2) and the result is stored in AC0.

FRCT, M40, RDM none

This instruction can be repeated.

SPRU375G

Instruction Set Descriptions

5-259

Multiply

Multiply
Syntax Characteristics
Parallel Enable Bit No

No. [4]

Syntax ACy = rnd(ACx * K16)

Size 4

Cycles 1

Pipeline X

Opcode Operands Description ACx, ACy, K16

0111 1001 KKKK KKKK KKKK KKKK SSDD xx0%

This instruction performs a multiplication in the D-unit MAC. The input operands of the multiplier are ACx(3216) and the 16-bit signed constant, K16, sign extended to 17 bits.
- If FRCT = 1, the output of the multiplier is shifted left by 1 bit. - Multiplication overflow detection depends on SMUL. - The 32-bit result of the multiplication is sign extended to 40 bits. - Rounding is performed according to RDM, if the optional rnd keyword is

applied to the instruction.


- Overflow detection depends on M40. If an overflow is detected, the

destination accumulator overflow status bit (ACOVy) is set.


- When an overflow is detected, the accumulator is saturated according to

SATD. Compatibility with C54x devices (C54CM = 1) When this instruction is executed with M40 = 0, compatibility is ensured. Status Bits Affected by Affects Repeat Example
Syntax AC0 = AC1 * #64 Description The content of AC1 is multiplied by a signed 16-bit value (64) and the result is stored in AC0.

FRCT, M40, RDM, SATD, SMUL ACOVy

This instruction can be repeated.

5-260

Instruction Set Descriptions

SPRU375G

Multiply

Multiply
Syntax Characteristics
No. [5] Syntax ACx = rnd(Smem * coef(Cmem))[, T3 = Smem] Parallel Enable Bit No Size 3 Cycles 1 Pipeline X

Opcode Operands Description ACx, Cmem, Smem

1101 0001 AAAA AAAI U%DD 00mm This instruction performs a multiplication in the D-unit MAC. The input operands of the multiplier are the content of a memory (Smem) location, sign extended to 17 bits, and the content of a data memory operand Cmem, addressed using the coefficient addressing mode, sign extended to 17 bits.
- If FRCT = 1, the output of the multiplier is shifted left by 1 bit. - Multiplication overflow detection depends on SMUL. - The 32-bit result of the multiplication is sign extended to 40 bits. - Rounding is performed according to RDM, if the optional rnd keyword is

applied to the instruction.


- Overflow detection depends on M40. If an overflow is detected, the

destination accumulator overflow status bit (ACOVx) is set.


- When an overflow is detected, the accumulator is saturated according to

SATD. This instruction provides the option to store the 16-bit data memory operand Smem in temporary register T3. For this instruction, the Cmem operand is accessed through the BB bus; on some C55x-based devices, the BB bus is only connected to internal memory and not to external memory. To prevent the generation of a bus error, the Cmem operand must not be mapped on external memory. Compatibility with C54x devices (C54CM = 1) When this instruction is executed with M40 = 0, compatibility is ensured. Status Bits Repeat Example
Syntax AC0 = *AR3 * coef(*CDP) Description The content addressed by AR3 is multiplied by the content addressed by the coefficient data pointer register (CDP) and the result is stored in AC0.

Affected by Affects

FRCT, M40, RDM, SATD, SMUL ACOVx

This instruction can be repeated.

SPRU375G

Instruction Set Descriptions

5-261

Multiply

Multiply
Syntax Characteristics
Parallel Enable Bit No

No. [6]

Syntax ACy = rnd(Smem * ACx) [,T3 = Smem]

Size 3

Cycles 1

Pipeline X

Opcode Operands Description ACx, ACy, Smem

1101 0011 AAAA AAAI U%DD 00SS

This instruction performs a multiplication in the D-unit MAC. The input operands of the multiplier are ACx(3216) and the content of a memory (Smem) location, sign extended to 17 bits.
- If FRCT = 1, the output of the multiplier is shifted left by 1 bit. - Multiplication overflow detection depends on SMUL. - The 32-bit result of the multiplication is sign extended to 40 bits. - Rounding is performed according to RDM, if the optional rnd keyword is

applied to the instruction.


- Overflow detection depends on M40. If an overflow is detected, the

destination accumulator overflow status bit (ACOVy) is set.


- When an overflow is detected, the accumulator is saturated according to

SATD. This instruction provides the option to store the 16-bit data memory operand Smem in temporary register T3. Compatibility with C54x devices (C54CM = 1) When this instruction is executed with M40 = 0, compatibility is ensured. Status Bits Affected by Affects Repeat Example
Syntax AC0 = *AR3 * AC1 Description The content addressed by AR3 is multiplied by the content of AC1 and the result is stored in AC0.

FRCT, M40, RDM, SATD, SMUL ACOVy

This instruction can be repeated.

5-262

Instruction Set Descriptions

SPRU375G

Multiply

Multiply
Syntax Characteristics
Parallel Enable Bit No

No. [7]

Syntax ACx = rnd(Smem * K8) [,T3 = Smem]

Size 4

Cycles 1

Pipeline X

Opcode Operands Description ACx, K8, Smem

1111 1000 AAAA AAAI KKKK KKKK xxDD x0U%

This instruction performs a multiplication in the D-unit MAC. The input operands of the multiplier are the content of a memory (Smem) location, sign extended to 17 bits, and the 8-bit signed constant, K8, sign extended to 17 bits.
- If FRCT = 1, the output of the multiplier is shifted left by 1 bit. - The 32-bit result of the multiplication is sign extended to 40 bits. - Rounding is performed according to RDM, if the optional rnd keyword is

applied to the instruction. This instruction provides the option to store the 16-bit data memory operand Smem in temporary register T3. Compatibility with C54x devices (C54CM = 1) When this instruction is executed with M40 = 0, compatibility is ensured. Status Bits Affected by Affects Repeat FRCT, M40, RDM none

This instruction cannot be repeated when using the *(#k23) absolute addressing mode to access the memory operand (Smem); when using other addressing modes, this instruction can be repeated.

Example
Syntax AC0 = *AR3 * #2 Description The content addressed by AR3 is multiplied a signed 8-bit value (2) and the result is stored in AC0.

SPRU375G

Instruction Set Descriptions

5-263

Multiply

Multiply
Syntax Characteristics
No. [8] Syntax ACx = M40(rnd(uns(Xmem) * uns(Ymem)))[, T3 = Xmem] Parallel Enable Bit No Size 4 Cycles 1 Pipeline X

Opcode Operands Description

1000 0110 XXXM MMYY YMMM xxDD 000g uuU% ACx, Xmem, Ymem This instruction performs a multiplication in the D-unit MAC. The input operands of the multiplier are the content of data memory operand Xmem, extended to 17 bits, and the content of data memory operand Ymem, extended to 17 bits.
- Input operands are extended to 17 bits according to uns. J J

If the optional uns keyword is applied to the input operand, the content of the memory location is zero extended to 17 bits. If the optional uns keyword is not applied to the input operand, the content of the memory location is sign extended to 17 bits according to SXMD.

- If FRCT = 1, the output of the multiplier is shifted left by 1 bit. - Multiplication overflow detection depends on SMUL. - The 32-bit result of the multiplication is sign extended to 40 bits. - Rounding is performed according to RDM, if the optional rnd keyword is

applied to the instruction.


- Overflow detection depends on M40. If an overflow is detected, the

destination accumulator overflow status bit (ACOVx) is set.


- When an overflow is detected, the accumulator is saturated according to

SATD. This instruction provides the option to locally set M40 to 1 for the execution of the instruction, if the optional M40 keyword is applied to the instruction. This instruction provides the option to store the 16-bit data memory operand Xmem in temporary register T3. Compatibility with C54x devices (C54CM = 1) When this instruction is executed with M40 = 0, compatibility is ensured.
5-264 Instruction Set Descriptions SPRU375G

Multiply

Status Bits

Affected by Affects

FRCT, M40, RDM, SATD, SMUL, SXMD ACOVx

Repeat Example
Syntax

This instruction can be repeated.

Description The unsigned content addressed by AR3 is multiplied by the unsigned content addressed by AR4 and the result is stored in AC0.

AC0 = uns(*AR3) * uns(*AR4)

SPRU375G

Instruction Set Descriptions

5-265

Multiply

Multiply
Syntax Characteristics
No. [9] Syntax ACx = rnd(uns(Tx * Smem)) [,T3 = Smem] Parallel Enable Bit No Size 3 Cycles 1 Pipeline X

Opcode Operands Description ACx, Smem, Tx

1101 0011 AAAA AAAI U%DD u1ss

This instruction performs a multiplication in the D-unit MAC. The input operands of the multiplier are the content of Tx, sign extended to 17 bits, and the content of a memory (Smem) location, sign extended to 17 bits.
- If FRCT = 1, the output of the multiplier is shifted left by 1 bit. - Multiplication overflow detection depends on SMUL. - The 32-bit result of the multiplication is extended to 40 bits according to uns. J J

If the optional uns keyword is applied to the instruction, the 32-bit result is zero extended to 40 bits. If the optional uns keyword is not applied to the instruction, the 32-bit result is sign extended to 40 bits.

- Rounding is performed according to RDM, if the optional rnd keyword is

applied to the instruction.


- Overflow detection depends on M40. If an overflow is detected, the

destination accumulator overflow status bit (ACOVx) is set.


- When an overflow is detected, the accumulator is saturated according to

SATD. This instruction provides the option to store the 16-bit data memory operand Smem in temporary register T3. Compatibility with C54x devices (C54CM = 1) When this instruction is executed with M40 = 0, compatibility is ensured. Status Bits Affected by Affects Repeat Example
Syntax AC0 = uns(T0 * *AR3) Description The content addressed by AR3 is multiplied by the content of T0 and the unsigned result is stored in AC0.

FRCT, M40, RDM, SATD, SMUL ACOVx

This instruction can be repeated.

5-266

Instruction Set Descriptions

SPRU375G

Multiply with Parallel Multiply and Accumulate

Multiply with Parallel Multiply and Accumulate


Syntax Characteristics
No. [1] Syntax ACx = M40(rnd(uns(Xmem) * uns(coef(Cmem)))), ACy = M40(rnd((ACy >> #16) + (uns(Ymem) * uns(coef(Cmem))))) Parallel Enable Bit No Size 4 Cycles 1 Pipeline X

Opcode Operands Description

1000 0100 XXXM MMYY YMMM 10mm uuDD DDg% ACx, ACy, Cmem, Xmem, Ymem This instruction performs two parallel operations in one cycle: multiply, and multiply and accumulate (MAC). The operations are executed in the two D-unit MACs. The first operation performs a multiplication in the D-unit MAC. The input operands of the multiplier are the content of data memory operand Xmem, extended to 17 bits, and the content of a data memory operand Cmem, addressed using the coefficient addressing mode, extended to 17 bits. The second operation performs a multiplication and an accumulation in the D-unit MAC. The input operands of the multiplier are the content of data memory operand Ymem, extended to 17 bits, and the content of a data memory operand Cmem, addressed using the coefficient addressing mode, extended to 17 bits.
- Input operands are extended to 17 bits according to uns. J J

If the optional uns keyword is applied to the input operand, the content of the memory location is zero extended to 17 bits. If the optional uns keyword is not applied to the input operand, the content of the memory location is sign extended to 17 bits according to SXMD.

- If FRCT = 1, the output of the multiplier is shifted left by 1 bit. - Multiplication overflow detection depends on SMUL. - For the first operation, the 32-bit result of the multiplication is sign

extended to 40 bits.
- For the second operation, the 32-bit result of the multiplication is sign

extended to 40 bits and added to the source accumulator ACy shifted right by 16 bits. The shifting operation is performed with a sign extension of source accumulator ACy(39).
SPRU375G Instruction Set Descriptions 5-267

Multiply with Parallel Multiply and Accumulate

- Rounding is performed according to RDM, if the optional rnd keyword is

applied to the instruction.


- Overflow detection depends on M40. If an overflow is detected, the

destination accumulator overflow status bit is set.


- When an overflow is detected, the accumulator is saturated according to

SATD. This instruction provides the option to locally set M40 to 1 for the execution of the instruction, if the optional M40 keyword is applied to the instruction. For this instruction, the Cmem operand is accessed through the BB bus; on some C55x-based devices, the BB bus is only connected to internal memory and not to external memory. To prevent the generation of a bus error, the Cmem operand must not be mapped on external memory. Each data flow can also disable the usage of the corresponding MAC unit, while allowing the modification of auxiliary registers in the three address generation units through the following instructions:
J J J

mar(Xmem) mar(Ymem) mar(Cmem) FRCT, M40, RDM, SATD, SMUL, SXMD ACOVx, ACOVy

Status Bits

Affected by Affects

Repeat See Also

This instruction can be repeated. See the following other related instructions:
- Multiply - Multiply and Accumulate - Parallel Multiply and Accumulates

Example
Syntax AC0 = uns(*AR3) * uns(coef(*CDP)), AC1 = (AC1 >> #16) + (uns(*AR4) * uns(coef(*CDP))) Description Both instructions are performed in parallel. The unsigned content addressed by AR3 is multiplied by the unsigned content addressed by the coefficient data pointer register (CDP) and the result is stored in AC0. The unsigned content addressed by AR4 multiplied by the unsigned content addressed by CDP is added to the content of AC1 shifted right by 16 bits and the result is stored in AC1.

5-268

Instruction Set Descriptions

SPRU375G

Multiply with Parallel Store Accumulator Content to Memory

Multiply with Parallel Store Accumulator Content to Memory


Syntax Characteristics
Parallel Enable Bit No

No. [1]

Syntax ACy = rnd(Tx * Xmem), Ymem = HI(ACx << T2) [,T3 = Xmem]

Size 4

Cycles 1

Pipeline X

Opcode Operands Description

1000 0111 XXXM MMYY YMMM SSDD 000x ssU% ACx, ACy, Tx, Xmem, Ymem This instruction performs two operations in parallel: multiply and store. The first operation performs a multiplication in the D-unit MAC. The input operands of the multiplier are the content of Tx, sign extended to 17 bits, and the content of data memory operand Xmem, sign extended to 17 bits.
- If FRCT = 1, the output of the multiplier is shifted left by 1 bit. - Multiplication overflow detection depends on SMUL. - The 32-bit result of the multiplication is sign extended to 40 bits. - Rounding is performed according to RDM, if the optional rnd keyword is

applied to the instruction.


- Overflow detection depends on M40. If an overflow is detected, the

destination accumulator overflow status bit (ACOVy) is set.


- When an overflow is detected, the accumulator is saturated according to

SATD.
- This instruction provides the option to store the 16-bit data memory

operand Xmem in temporary register T3. The second operation shifts the accumulator ACx by the content of T2 and stores ACx(3116) to data memory operand Ymem. If the 16-bit value in T2 is not within 32 to +31, the shift is saturated to 32 or +31 and the shift is performed with this value.
- The input operand is shifted in the D-unit shifter according to SXMD. - After the shift, the high part of the accumulator, ACx(3116), is stored to

the memory location.


SPRU375G Instruction Set Descriptions 5-269

Multiply with Parallel Store Accumulator Content to Memory

Compatibility with C54x devices (C54CM = 1) When this instruction is executed with M40 = 0, compatibility is ensured. When this instruction is executed with C54CM = 1, the 6 LSBs of T2 are used to determine the shift quantity. The 6 LSBs of T2 define a shift quantity within 32 to +31. When the 16-bit value in T2 is between 32 to 17, a modulo 16 operation transforms the shift quantity to within 16 to 1. Status Bits Affected by Affects Repeat See Also C54CM, FRCT, M40, RDM, SATD, SMUL, SXMD ACOVy

This instruction can be repeated. See the following other related instructions:
- Addition with Parallel Store Accumulator Content to Memory - Multiply - Multiply and Accumulate with Parallel Store Accumulator Content to

Memory
- Multiply and Subtract with Parallel Store Accumulator Content to Memory - Store Accumulator Content to Memory - Subtraction with Parallel Store Accumulator Content to Memory

Example
Syntax AC1 = rnd(T0 * *AR0+), *AR1+ = HI(AC0 << T2) Description Both instructions are performed in parallel. The content addressed by AR0 is multiplied by the content of T0. Since FRCT = 1, the result is multiplied by 2, rounded, and stored in AC1. The content of AC0 is shifted by the content of T2, and AC0(3116) is stored at the address of AR1. AR0 and AR1 are both incremented by 1.
After AC0 AC1 AR0 AR1 T0 T2 200 300 FRCT ACOV1 CARRY

Before AC0 AC1 AR0 AR1 T0 T2 200 300 FRCT ACOV1 CARRY

FF 8421 1234 00 0000 0000 0200 0300 4000 0004 4000 1111 1 0 0

FF 8421 1234 00 2000 0000 0201 0301 4000 0004 4000 4211 1 0 0

5-270

Instruction Set Descriptions

SPRU375G

Multiply and Accumulate (MAC)

Multiply and Accumulate (MAC)


Syntax Characteristics
Parallel Enable Bit Yes Yes Yes No No No No No No No

No. [1] [2] [3] [4] [5] [6] [7] [8] [9]

Syntax ACy = rnd(ACy + (ACx * Tx)) ACy = rnd((ACy * Tx) + ACx) ACy = rnd(ACx + (Tx * K8)) ACy = rnd(ACx + (Tx * K16)) ACx = rnd(ACx + (Smem * coef(Cmem)))[, T3 = Smem] ACy = rnd(ACy + (Smem * ACx))[, T3 = Smem] ACy = rnd(ACx + (Tx * Smem))[, T3 = Smem] ACy = rnd(ACx + (Smem * K8))[, T3 = Smem ] ACy = M40(rnd(ACx + (uns(Xmem) * uns(Ymem)))) [, T3 = Xmem]

Size 2 2 3 4 3 3 3 4 4 4

Cycles 1 1 1 1 1 1 1 1 1 1

Pipeline X X X X X X X X X X

[10] ACy = M40(rnd((ACx >> #16) + (uns(Xmem) * uns(Ymem)))) [, T3 = Xmem]

Description

This instruction performs a multiplication and an accumulation in the D-unit MAC. The input operands of the multiplier are:
- ACx(3216) - the content of Tx, sign extended to 17 bits - the 8-bit signed constant, K8, sign extended to 17 bits - the 16-bit signed constant, K16, sign extended to 17 bits - the content of a memory (Smem) location, sign extended to 17 bits - the content of a data memory operand Cmem, addressed using the

coefficient addressing mode, sign extended to 17 bits


- the content of data memory operand Xmem, extended to 17 bits, and the

content of data memory operand Ymem, extended to 17 bits Status Bits Affected by Affects
SPRU375G

FRCT, M40, RDM, SATD, SMUL, SXMD ACOVx, ACOVy


Instruction Set Descriptions 5-271

Multiply and Accumulate (MAC)

See Also

See the following other related instructions:


- Modify Auxiliary Register Content with Parallel Multiply and Accumulate - Multiply and Accumulate with Parallel Delay - Multiply and Accumulate with Parallel Load Accumulator from Memory - Multiply and Accumulate with Parallel Multiply - Multiply and Accumulate with Parallel Store Accumulator Content to

Memory
- Multiply and Subtract - Multiply and Subtract with Parallel Multiply and Accumulate - Multiply with Parallel Multiply and Accumulate - Parallel Multiply and Accumulates

5-272

Instruction Set Descriptions

SPRU375G

Multiply and Accumulate (MAC)

Multiply and Accumulate (MAC)


Syntax Characteristics
Parallel Enable Bit Yes

No. [1]

Syntax ACy = rnd(ACy + (ACx * Tx))

Size 2

Cycles 1

Pipeline X

Opcode Operands Description ACx, ACy, Tx

0101 011E DDSS ss0%

This instruction performs a multiplication and an accumulation in the D-unit MAC. The input operands of the multiplier are ACx(3216) and the content of Tx, sign extended to 17 bits.
- If FRCT = 1, the output of the multiplier is shifted left by 1 bit. - Multiplication overflow detection depends on SMUL. - The 32-bit result of the multiplication is sign extended to 40 bits and added

to the source accumulator ACy.


- Rounding is performed according to RDM, if the optional rnd keyword is

applied to the instruction.


- Addition overflow detection depends on M40. If an overflow is detected,

the destination accumulator overflow status bit (ACOVy) is set.


- When an addition overflow is detected, the accumulator is saturated

according to SATD. Compatibility with C54x devices (C54CM = 1) When this instruction is executed with M40 = 0, compatibility is ensured. Status Bits Affected by Affects Repeat Example
Syntax AC0 = AC0 + (AC1 * T0) Description The content of AC1 multiplied by the content of T0 is added to the content of AC0 and the result is stored in AC0.

FRCT, M40, RDM, SATD, SMUL ACOVy

This instruction can be repeated.

SPRU375G

Instruction Set Descriptions

5-273

Multiply and Accumulate (MAC)

Multiply and Accumulate (MAC)


Syntax Characteristics
Parallel Enable Bit Yes

No. [2]

Syntax ACy = rnd((ACy * Tx) + ACx)

Size 2

Cycles 1

Pipeline X

Opcode Operands Description ACx, ACy, Tx

0101 100E DDSS ss1%

This instruction performs a multiplication and an accumulation in the D-unit MAC. The input operands of the multiplier are ACy(3216) and the content of Tx, sign extended to 17 bits.
- If FRCT = 1, the output of the multiplier is shifted left by bit. - Multiplication overflow detection depends on SMUL. - The 32-bit result of the multiplication is sign extended to 40 bits and added

to the source accumulator ACx.


- Rounding is performed according to RDM, if the optional rnd keyword is

applied to the instruction.


- Addition overflow detection depends on M40. If an overflow is detected,

the destination accumulator overflow status bit (ACOVy) is set.


- When an addition overflow is detected, the accumulator is saturated

according to SATD. Compatibility with C54x devices (C54CM = 1) When this instruction is executed with M40 = 0, compatibility is ensured. Status Bits Affected by Affects Repeat Example
Syntax AC1 = rnd((AC1 * T1) + AC0) Description The content of AC1 multiplied by the content of T1 is added to the content of AC0. The result is rounded and stored in AC1.

FRCT, M40, RDM, SATD, SMUL ACOVy

This instruction can be repeated.

5-274

Instruction Set Descriptions

SPRU375G

Multiply and Accumulate (MAC)

Multiply and Accumulate (MAC)


Syntax Characteristics
Parallel Enable Bit Yes

No. [3]

Syntax ACy = rnd(ACx + (Tx * K8))

Size 3

Cycles 1

Pipeline X

Opcode Operands Description ACx, ACy, K8, Tx

0001 111E KKKK KKKK SSDD ss1%

This instruction performs a multiplication and an accumulation in the D-unit MAC. The input operands of the multiplier are the content of Tx, sign extended to 17 bits, and the 8-bit signed constant, K8, sign extended to 17 bits.
- If FRCT = 1, the output of the multiplier is shifted left by 1 bit. - The 32-bit result of the multiplication is sign extended to 40 bits and added

to the source accumulator ACx.


- Rounding is performed according to RDM, if the optional rnd keyword is

applied to the instruction.


- Addition overflow detection depends on M40. If an overflow is detected,

the destination accumulator overflow status bit (ACOVy) is set.


- When an addition overflow is detected, the accumulator is saturated

according to SATD. Compatibility with C54x devices (C54CM = 1) When this instruction is executed with M40 = 0, compatibility is ensured. Status Bits Affected by Affects Repeat Example
Syntax AC0 = AC1 + (T0 * K8) Description The content of T0 multiplied by a signed 8-bit value is added to the content of AC1 and the result is stored in AC0.

FRCT, M40, RDM, SATD ACOVy

This instruction can be repeated.

SPRU375G

Instruction Set Descriptions

5-275

Multiply and Accumulate (MAC)

Multiply and Accumulate (MAC)


Syntax Characteristics
Parallel Enable Bit No

No. [4]

Syntax ACy = rnd(ACx + (Tx * K16))

Size 4

Cycles 1

Pipeline X

Opcode Operands Description

0111 1001 KKKK KKKK KKKK KKKK SSDD ss1% ACx, ACy, K16, Tx This instruction performs a multiplication and an accumulation in the D-unit MAC. The input operands of the multiplier are the content of Tx, sign extended to 17 bits, and the 16-bit signed constant, K16, sign extended to 17 bits.
- If FRCT = 1, the output of the multiplier is shifted left by 1 bit. - Multiplication overflow detection depends on SMUL. - The 32-bit result of the multiplication is sign extended to 40 bits and added

to the source accumulator ACx.


- Rounding is performed according to RDM, if the optional rnd keyword is

applied to the instruction.


- Addition overflow detection depends on M40. If an overflow is detected,

the destination accumulator overflow status bit (ACOVy) is set.


- When an addition overflow is detected, the accumulator is saturated

according to SATD. Compatibility with C54x devices (C54CM = 1) When this instruction is executed with M40 = 0, compatibility is ensured. Status Bits Affected by Affects Repeat Example
Syntax AC0 = AC1 + (T0 * #FFFFh) Description The content of T0 multiplied by a signed 16-bit value (FFFFh) is added to the content of AC1 and the result is stored in AC0.

FRCT, M40, RDM, SATD, SMUL ACOVy

This instruction can be repeated.

5-276

Instruction Set Descriptions

SPRU375G

Multiply and Accumulate (MAC)

Multiply and Accumulate (MAC)


Syntax Characteristics
No. [5] Syntax ACx = rnd(ACx + (Smem * coef(Cmem)))[, T3 = Smem] Parallel Enable Bit No Size 3 Cycles 1 Pipeline X

Opcode Operands Description ACx, Cmem, Smem

1101 0001 AAAA AAAI U%DD 01mm

This instruction performs a multiplication and an accumulation in the D-unit MAC. The input operands of the multiplier are the content of a memory (Smem) location, sign extended to 17 bits, and the content of a data memory operand Cmem, addressed using the coefficient addressing mode, sign extended to 17 bits.
- If FRCT = 1, the output of the multiplier is shifted left by 1 bit. - Multiplication overflow detection depends on SMUL. - The 32-bit result of the multiplication is sign extended to 40 bits and added

to the source accumulator ACx.


- Rounding is performed according to RDM, if the optional rnd keyword is

applied to the instruction.


- Addition overflow detection depends on M40. If an overflow is detected,

the destination accumulator overflow status bit (ACOVx) is set.


- When an addition overflow is detected, the accumulator is saturated

according to SATD. This instruction provides the option to store the 16-bit data memory operand Smem in temporary register T3. For this instruction, the Cmem operand is accessed through the BB bus; on some C55x-based devices, the BB bus is only connected to internal memory and not to external memory. To prevent the generation of a bus error, the Cmem operand must not be mapped on external memory. Compatibility with C54x devices (C54CM = 1) When this instruction is executed with M40 = 0, compatibility is ensured. Status Bits Affected by Affects Repeat
SPRU375G

FRCT, M40, RDM, SATD, SMUL ACOVx

This instruction can be repeated.


Instruction Set Descriptions 5-277

Multiply and Accumulate (MAC)

Example
Syntax AC2 = rnd(AC2 + (*AR1 * coef(*CDP))) Description The content addressed by AR1 multiplied by the content addressed by the coefficient data pointer register (CDP) is added to the content of AC2. The result is rounded and stored in AC2. The result generated an overflow.

Before AC2 AR1 CDP 302 202 ACOV2

00 EC00 0000 0302 0202 FE00 0040 0

After AC2 AR2 CDP 302 202 ACOV2

00 EC00 0000 0302 0202 FE00 0040 1

5-278

Instruction Set Descriptions

SPRU375G

Multiply and Accumulate (MAC)

Multiply and Accumulate (MAC)


Syntax Characteristics
No. [6] Syntax ACy = rnd(ACy + (Smem * ACx))[, T3 = Smem] Parallel Enable Bit No Size 3 Cycles 1 Pipeline X

Opcode Operands Description ACx, ACy, Smem

1101 0010 AAAA AAAI U%DD 00SS

This instruction performs a multiplication and an accumulation in the D-unit MAC. The input operands of the multiplier are ACx(3216) and the content of a memory (Smem) location, sign extended to 17 bits.
- If FRCT = 1, the output of the multiplier is shifted left by 1 bit. - Multiplication overflow detection depends on SMUL. - The 32-bit result of the multiplication is sign extended to 40 bits and added

to the source accumulator ACy.


- Rounding is performed according to RDM, if the optional rnd keyword is

applied to the instruction.


- Addition overflow detection depends on M40. If an overflow is detected,

the destination accumulator overflow status bit (ACOVy) is set.


- When an addition overflow is detected, the accumulator is saturated

according to SATD. This instruction provides the option to store the 16-bit data memory operand Smem in temporary register T3. Compatibility with C54x devices (C54CM = 1) When this instruction is executed with M40 = 0, compatibility is ensured. Status Bits Affected by Affects Repeat Example
Syntax AC1 = AC1 + (*AR3 * AC0) Description The content addressed by AR3 multiplied by the content of AC0 is added to the content of AC1 and the result is stored in AC1.

FRCT, M40, RDM, SATD, SMUL ACOVy

This instruction can be repeated.

SPRU375G

Instruction Set Descriptions

5-279

Multiply and Accumulate (MAC)

Multiply and Accumulate (MAC)


Syntax Characteristics
No. [7] Syntax ACy = rnd(ACx + (Tx * Smem))[, T3 = Smem] Parallel Enable Bit No Size 3 Cycles 1 Pipeline X

Opcode Operands Description ACx, ACy, Smem, Tx

1101 0100 AAAA AAAI U%DD ssSS

This instruction performs a multiplication and an accumulation in the D-unit MAC. The input operands of the multiplier are the content of Tx, sign extended to 17 bits, and the content of a memory (Smem) location, sign extended to 17 bits.
- If FRCT = 1, the output of the multiplier is shifted left by 1 bit. - Multiplication overflow detection depends on SMUL. - The 32-bit result of the multiplication is sign extended to 40 bits and added

to the source accumulator ACx.


- Rounding is performed according to RDM, if the optional rnd keyword is

applied to the instruction.


- Addition overflow detection depends on M40. If an overflow is detected,

the destination accumulator overflow status bit (ACOVy) is set.


- When an addition overflow is detected, the accumulator is saturated

according to SATD. This instruction provides the option to store the 16-bit data memory operand Smem in temporary register T3. Compatibility with C54x devices (C54CM = 1) When this instruction is executed with M40 = 0, compatibility is ensured. Status Bits Affected by Affects Repeat Example
Syntax AC0 = AC1 + (T0 * *AR3) Description The content addressed by AR3 multiplied by the content of T0 is added to the content of AC1 and the result is stored in AC0.

FRCT, M40, RDM, SATD, SMUL ACOVy

This instruction can be repeated.

5-280

Instruction Set Descriptions

SPRU375G

Multiply and Accumulate (MAC)

Multiply and Accumulate (MAC)


Syntax Characteristics
No. [8] Syntax ACy = rnd(ACx + (Smem * K8))[, T3 = Smem] Parallel Enable Bit No Size 4 Cycles 1 Pipeline X

Opcode Operands Description

1111 1000 AAAA AAAI KKKK KKKK SSDD x1U% ACx, ACy, K8, Smem This instruction performs a multiplication and an accumulation in the D-unit MAC. The input operands of the multiplier are the content of a memory (Smem) location, sign extended to 17 bits, and the 8-bit signed constant, K8, sign extended to 17 bits.
- If FRCT = 1, the output of the multiplier is shifted left by 1 bit. - The 32-bit result of the multiplication is sign extended to 40 bits and added

to the source accumulator ACx.


- Rounding is performed according to RDM, if the optional rnd keyword is

applied to the instruction.


- Addition overflow detection depends on M40. If an overflow is detected,

the destination accumulator overflow status bit (ACOVy) is set.


- When an addition overflow is detected, the accumulator is saturated

according to SATD. This instruction provides the option to store the 16-bit data memory operand Smem in temporary register T3. Compatibility with C54x devices (C54CM = 1) When this instruction is executed with M40 = 0, compatibility is ensured. Status Bits Affected by Affects Repeat FRCT, M40, RDM, SATD ACOVy

This instruction cannot be repeated when using the *(#k23) absolute addressing mode to access the memory operand (Smem); when using other addressing modes, this instruction can be repeated.

Example
Syntax AC0 = AC1 + (*AR3 * #FFh) Description The content addressed by AR3 multiplied by a signed 8-bit value (FFh) is added to the content of AC1 and the result is stored in AC0.

SPRU375G

Instruction Set Descriptions

5-281

Multiply and Accumulate (MAC)

Multiply and Accumulate (MAC)


Syntax Characteristics
No. [9] Syntax ACy = M40(rnd(ACx + (uns(Xmem) * uns(Ymem)))) [, T3 = Xmem] Parallel Enable Bit No Size 4 Cycles 1 Pipeline X

Opcode Operands Description

1000 0110 XXXM MMYY YMMM SSDD 001g uuU% ACx, ACy, Xmem, Ymem This instruction performs a multiplication and an accumulation in the D-unit MAC. The input operands of the multiplier are the content of data memory operand Xmem, extended to 17 bits, and the content of data memory operand Ymem, extended to 17 bits.
- Input operands are extended to 17 bits according to uns. J J

If the optional uns keyword is applied to the input operand, the content of the memory location is zero extended to 17 bits. If the optional uns keyword is not applied to the input operand, the content of the memory location is sign extended to 17 bits according to SXMD.

- If FRCT = 1, the output of the multiplier is shifted left by 1 bit. - Multiplication overflow detection depends on SMUL. - The 32-bit result of the multiplication is sign extended to 40 bits and added

to the source accumulator ACx.


- Rounding is performed according to RDM, if the optional rnd keyword is

applied to the instruction.


- Addition overflow detection depends on M40. If an overflow is detected,

the destination accumulator overflow status bit (ACOVy) is set.


- When an addition overflow is detected, the accumulator is saturated

according to SATD. This instruction provides the option to locally set M40 to 1 for the execution of the instruction, if the optional M40 keyword is applied to the instruction. This instruction provides the option to store the 16-bit data memory operand Xmem in temporary register T3. Compatibility with C54x devices (C54CM = 1) When this instruction is executed with M40 = 0, compatibility is ensured.
5-282 Instruction Set Descriptions SPRU375G

Multiply and Accumulate (MAC)

Status Bits

Affected by Affects

FRCT, M40, RDM, SATD, SMUL, SXMD ACOVy

Repeat Example
Syntax

This instruction can be repeated.

Description The unsigned content addressed by AR2 multiplied by the unsigned content addressed by AR3 is added to the content of AC3. The result is rounded and stored in AC3. The result generated an overflow. AR2 and AR3 are both incremented by 1.

AC3 = rnd(AC3 + (uns(*AR2+) * uns(*AR3+)))

Before AC3 AR2 AR3 ACOV3 302 202 M40 SATD FRCT

00 2300 EC00 302 202 0 FE00 7000 0 0 0

After AC3 AR2 AR3 ACOV3 302 202 M40 SATD FRCT

00 9221 0000 303 203 1 FE00 7000 0 0 0

SPRU375G

Instruction Set Descriptions

5-283

Multiply and Accumulate (MAC)

Multiply and Accumulate (MAC)


Syntax Characteristics
No. Syntax Parallel Enable Bit No Size 4 Cycles 1 Pipeline X

[10] ACy = M40(rnd((ACx >> #16) + (uns(Xmem) * uns(Ymem)))) [, T3 = Xmem]

Opcode Operands Description

1000 0110 XXXM MMYY YMMM SSDD 010g uuU% ACx, ACy, Xmem, Ymem This instruction performs a multiplication and an accumulation in the D-unit MAC. The input operands of the multiplier are the content of data memory operand Xmem, extended to 17 bits, and the content of data memory operand Ymem, extended to 17 bits.
- Input operands are extended to 17 bits according to uns. J J

If the optional uns keyword is applied to the input operand, the content of the memory location is zero extended to 17 bits. If the optional uns keyword is not applied to the input operand, the content of the memory location is sign extended to 17 bits according to SXMD.

- If FRCT = 1, the output of the multiplier is shifted left by 1 bit. - Multiplication overflow detection depends on SMUL. - The 32-bit result of the multiplication is sign extended to 40 bits and added

to the source accumulator ACx shifted right by 16 bits. The shifting operation is performed with a sign extension of source accumulator ACx(39).
- Rounding is performed according to RDM, if the optional rnd keyword is

applied to the instruction.


- Addition overflow detection depends on M40. If an overflow is detected,

the destination accumulator overflow status bit (ACOVy) is set.


- When an addition overflow is detected, the accumulator is saturated

according to SATD. This instruction provides the option to locally set M40 to 1 for the execution of the instruction, if the optional M40 keyword is applied to the instruction. This instruction provides the option to store the 16-bit data memory operand Xmem in temporary register T3.
5-284 Instruction Set Descriptions SPRU375G

Multiply and Accumulate (MAC)

Compatibility with C54x devices (C54CM = 1) When this instruction is executed with M40 = 0, compatibility is ensured. Status Bits Affected by Affects Repeat Example
Syntax AC0 = (AC1 >> #16) + (uns(*AR3) * uns(*AR4)) Description The unsigned content addressed by AR3 multiplied by the unsigned content addressed by AR4 is added to the content of AC1 shifted right by 16 bits and the result is stored in AC0.

FRCT, M40, RDM, SATD, SMUL, SXMD ACOVy

This instruction can be repeated.

SPRU375G

Instruction Set Descriptions

5-285

Multiply and Accumulate with Parallel Delay

Multiply and Accumulate with Parallel Delay


Syntax Characteristics
No. [1] Syntax ACx = rnd(ACx + (Smem * coef(Cmem)))[, T3 = Smem], delay(Smem) Parallel Enable Bit No Size 3 Cycles 1 Pipeline X

Opcode Operands Description ACx, Cmem, Smem

1101 0000 AAAA AAAI U%DD xxmm

This instruction performs a multiplication and an accumulation in the D-unit MAC in parallel with the delay memory instruction. The input operands of the multiplier are the content of a memory (Smem) location, sign extended to 17 bits, and the content of a data memory operand Cmem, addressed using the coefficient addressing mode, sign extended to 17 bits.
- If FRCT = 1, the output of the multiplier is shifted left by 1 bit. - Multiplication overflow detection depends on SMUL. - The 32-bit result of the multiplication is sign extended to 40 bits and added

to the source accumulator ACx.


- Rounding is performed according to RDM, if the optional rnd keyword is

applied to the instruction.


- Addition overflow detection depends on M40. If an overflow is detected,

the destination accumulator overflow status bit (ACOVx) is set.


- When an addition overflow is detected, the accumulator is saturated

according to SATD. This instruction provides the option to store the 16-bit data memory operand Smem in temporary register T3. For this instruction, the Cmem operand is accessed through the BB bus; on some C55x-based devices, the BB bus is only connected to internal memory and not to external memory. To prevent the generation of a bus error, the Cmem operand must not be mapped on external memory. The soft dual memory addressing mode mechanism cannot be applied to this instruction. This instruction cannot use the *port(#k16) addressing mode or be paralleled with the readport() or writeport() operand qualifier. This instruction cannot be used for accesses to I/O space. Any illegal access to I/O space generates a hardware bus-error interrupt (BERRINT) to be handled by the CPU.
5-286 Instruction Set Descriptions SPRU375G

Multiply and Accumulate with Parallel Delay

Compatibility with C54x devices (C54CM = 1) When this instruction is executed with M40 set to 0, compatibility is ensured. Status Bits Affected by Affects Repeat See Also FRCT, M40, RDM, SATD, SMUL ACOVx

This instruction can be repeated. See the following other related instructions:
- Modify Auxiliary Register Content with Parallel Multiply and Accumulate - Multiply and Accumulate - Multiply and Accumulate with Parallel Load Accumulator from Memory - Multiply and Accumulate with Parallel Multiply - Multiply and Accumulate with Parallel Store Accumulator Content to

Memory
- Multiply and Subtract with Parallel Multiply and Accumulate - Multiply with Parallel Multiply and Accumulate - Parallel Multiply and Accumulates

Example
Syntax AC0 = AC0 + (*AR3 * coef(*CDP)), delay(*AR3) Description The content addressed by AR3 multiplied by the content addressed by the coefficient data pointer register (CDP) is added to the content of AC0 and the result is stored in AC0. The content addressed by AR3 is copied into the next higher address.

SPRU375G

Instruction Set Descriptions

5-287

Multiply and Accumulate with Parallel Load Accumulator from Memory

Multiply and Accumulate with Parallel Load Accumulator from Memory


Syntax Characteristics
Parallel Enable Bit No

No. [1]

Syntax ACx = rnd(ACx + (Tx * Xmem)), ACy = Ymem << #16 [,T3 = Xmem]

Size 4

Cycles 1

Pipeline X

Opcode Operands Description

1000 0110 XXXM MMYY YMMM DDDD 101x ssU% ACx, ACy, Tx, Xmem, Ymem This instruction performs two operations in parallel: multiply and accumulate (MAC), and load. The first operation performs a multiplication and an accumulation in the D-unit MAC. The input operands of the multiplier are the content of Tx, sign extended to 17 bits, and the content of data memory operand Xmem, sign extended to 17 bits.
- If FRCT = 1, the output of the multiplier is shifted left by 1 bit. - Multiplication overflow detection depends on SMUL. - The 32-bit result of the multiplication is sign extended to 40 bits and added

to the source accumulator ACx.


- Rounding is performed according to RDM, if the optional rnd keyword is

applied to the instruction.


- Addition overflow detection depends on M40. If an overflow is detected,

the destination accumulator overflow status bit (ACOVx) is set.


- When an addition overflow is detected, the accumulator is saturated

according to SATD.
- This instruction provides the option to store the 16-bit data memory

operand Xmem in temporary register T3. The second operation loads the content of data memory operand Ymem shifted left by 16 bits to the accumulator ACy.
- The input operand is sign extended to 40 bits according to SXMD. - The shift operation is equivalent to the signed shift instruction. - The input operand is shifted left by 16 bits according to M40. 5-288 Instruction Set Descriptions SPRU375G

Multiply and Accumulate with Parallel Load Accumulator from Memory

Compatibility with C54x devices (C54CM = 1) When this instruction is executed with M40 = 0, compatibility is ensured. Status Bits Affected by Affects Repeat See Also FRCT, M40, RDM, SATD, SMUL, SXMD ACOVx, ACOVy

This instruction can be repeated. See the following other related instructions:
- Modify Auxiliary Register Content with Parallel Multiply and Accumulate - Multiply and Accumulate - Multiply and Accumulate with Parallel Delay - Multiply and Accumulate with Parallel Multiply - Multiply and Accumulate with Parallel Store Accumulator Content to

Memory
- Multiply and Subtract with Parallel Load Accumulator from Memory - Multiply with Parallel Multiply and Accumulate - Parallel Multiply and Accumulates

Example
Syntax AC0 = AC0 + (T0 * *AR3), AC1 = *AR4 << #16 Description Both instructions are performed in parallel. The content addressed by AR3 multiplied by the content of T0 is added to the content of AC0 and the result is stored in AC0. The content addressed by AR4 shifted left by 16 bits is stored in AC1.

SPRU375G

Instruction Set Descriptions

5-289

Multiply and Accumulate with Parallel Multiply

Multiply and Accumulate with Parallel Multiply


Syntax Characteristics
No. [1] Syntax ACx = M40(rnd(ACx + (uns(Xmem) * uns(coef(Cmem))))), ACy = M40(rnd(uns(Ymem) * uns(coef(Cmem)))) Parallel Enable Bit No Size 4 Cycles 1 Pipeline X

Opcode Operands Description

1000 0010 XXXM MMYY YMMM 01mm uuDD DDg% ACx, ACy, Cmem, Xmem, Ymem This instruction performs two parallel operations in one cycle: multiply and accumulate (MAC), and multiply. The operations are executed in the two D-unit MACs. The first operation performs a multiplication and an accumulation in the D-unit MAC. The input operands of the multiplier are the content of data memory operand Xmem, sign extended to 17 bits, and the content of a data memory operand Cmem, addressed using the coefficient addressing mode, extended to 17 bits. This second operation performs a multiplication in the D-unit MAC. The input operands of the multiplier are the content of data memory operand Ymem, extended to 17 bits, and the content of a data memory operand Cmem, addressed using the coefficient addressing mode, extended to 17 bits.
- Input operands are extended to 17 bits according to uns. J J

If the optional uns keyword is applied to the input operand, the content of the memory location is zero extended to 17 bits. If the optional uns keyword is not applied to the input operand, the content of the memory location is sign extended to 17 bits according to SXMD.

- If FRCT = 1, the output of the multiplier is shifted left by 1 bit. - Multiplication overflow detection depends on SMUL. - For the first operation, the 32-bit result of the multiplication is sign

extended to 40 bits and added to the source accumulator ACx.


- For the second operation, the 32-bit result of the multiplication is sign

extended to 40 bits.
- Rounding is performed according to RDM, if the optional rnd keyword is

applied to the instruction.


5-290 Instruction Set Descriptions SPRU375G

Multiply and Accumulate with Parallel Multiply

- Overflow detection depends on M40. If an overflow is detected, the

destination accumulator overflow status bit is set.


- When an overflow is detected, the accumulator is saturated according to

SATD. This instruction provides the option to locally set M40 to 1 for the execution of the instruction, if the optional M40 keyword is applied to the instruction. For this instruction, the Cmem operand is accessed through the BB bus; on some C55x-based devices, the BB bus is only connected to internal memory and not to external memory. To prevent the generation of a bus error, the Cmem operand must not be mapped on external memory. Each data flow can also disable the usage of the corresponding MAC unit, while allowing the modification of auxiliary registers in the three address generation units through the following instructions:
J J J

mar(Xmem) mar(Ymem) mar(Cmem) FRCT, M40, RDM, SATD, SMUL, SXMD ACOVx, ACOVy

Status Bits Repeat See Also

Affected by Affects

This instruction can be repeated. See the following other related instructions:
- Modify Auxiliary Register Content with Parallel Multiply and Accumulate - Multiply and Accumulate - Multiply and Accumulate with Parallel Delay - Multiply and Accumulate with Parallel Load Accumulator from Memory - Multiply and Accumulate with Parallel Store Accumulator Content to Memory - Multiply and Subtract with Parallel Multiply - Multiply with Parallel Multiply and Accumulate - Parallel Multiply and Accumulates

Example
Syntax AC0 = AC0 + (uns(*AR3) * uns(coef(*CDP))), AC1 = uns(*AR4) * uns(coef(*CDP)) Description Both instructions are performed in parallel. The unsigned content addressed by AR3 multiplied by the unsigned content addressed by the coefficient data pointer register (CDP) is added to the content of AC0 and the result is stored in AC0. The unsigned content addressed by AR4 is multiplied by the unsigned content addressed by CDP and the result is stored in AC1.

SPRU375G

Instruction Set Descriptions

5-291

Multiply and Accumulate with Parallel Store Accumulator Content to Memory

Multiply and Accumulate with Parallel Store Accumulator Content to Memory


Syntax Characteristics
No. [1] Syntax ACy = rnd(ACy + (Tx * Xmem)), Ymem = HI(ACx << T2) [,T3 = Xmem] Parallel Enable Bit No Size 4 Cycles 1 Pipeline X

Opcode Operands Description

1000 0111 XXXM MMYY YMMM SSDD 001x ssU% ACx, ACy, Tx, Xmem, Ymem This instruction performs two operations in parallel: multiply and accumulate (MAC), and store. The first operation performs a multiplication and an accumulation in the D-unit MAC. The input operands of the multiplier are the content of Tx, sign extended to 17 bits, and the content of data memory operand Xmem, sign extended to 17 bits.
- If FRCT = 1, the output of the multiplier is shifted left by 1 bit. - Multiplication overflow detection depends on SMUL. - The 32-bit result of the multiplication is sign extended to 40 bits and added

to the source accumulator ACy.


- Rounding is performed according to RDM, if the optional rnd keyword is

applied to the instruction.


- Addition overflow detection depends on M40. If an overflow is detected,

the destination accumulator overflow status bit (ACOVy) is set.


- When an addition overflow is detected, the accumulator is saturated

according to SATD.
- This instruction provides the option to store the 16-bit data memory

operand Xmem in temporary register T3. The second operation shifts the accumulator ACx by the content of T2 and stores ACx(3116) to data memory operand Ymem. If the 16-bit value in T2 is not within 32 to +31, the shift is saturated to 32 or +31 and the shift is performed with this value.
- The input operand is shifted in the D-unit shifter according to SXMD. - After the shift, the high part of the accumulator, ACx(3116), is stored to

the memory location.


5-292 Instruction Set Descriptions SPRU375G

Multiply and Accumulate with Parallel Store Accumulator Content to Memory

Compatibility with C54x devices (C54CM = 1) When this instruction is executed with M40 = 0, compatibility is ensured. When this instruction is executed with C54CM = 1, the 6 LSBs of T2 are used to determine the shift quantity. The 6 LSBs of T2 define a shift quantity within 32 to +31. When the 16-bit value in T2 is between 32 to 17, a modulo 16 operation transforms the shift quantity to within 16 to 1. Status Bits Affected by Affects Repeat See Also C54CM, FRCT, M40, RDM, SATD, SMUL, SXMD ACOVy

This instruction can be repeated. See the following other related instructions:
- Modify Auxiliary Register Content with Parallel Multiply and Accumulate - Multiply and Accumulate - Multiply and Accumulate with Parallel Delay - Multiply and Accumulate with Parallel Load Accumulator from Memory - Multiply and Accumulate with Parallel Multiply - Multiply and Subtract with Parallel Store Accumulator Content to Memory - Multiply with Parallel Multiply and Accumulate - Parallel Multiply and Accumulates

Example
Syntax AC0 = AC0 + (T0 * *AR3), *AR4 = HI(AC1 << T2) Description Both instructions are performed in parallel. The content addressed by AR3 multiplied by the content of T0 is added to the content of AC0 and the result is stored in AC0. The content of AC1 is shifted by the content of T2, and AC1(3116) is stored at the address of AR4.

SPRU375G

Instruction Set Descriptions

5-293

Multiply and Subtract

Multiply and Subtract


Syntax Characteristics
No. [1] [2] [3] [4] [5] Syntax ACy = rnd(ACy (ACx * Tx)) ACx = rnd(ACx (Smem * coef(Cmem)))[, T3 = Smem] ACy = rnd(ACy (Smem * ACx))[, T3 = Smem] ACy = rnd(ACx (Tx * Smem))[, T3 = Smem] ACy = M40(rnd(ACx (uns(Xmem) * uns(Ymem)))) [, T3 = Xmem] Parallel Enable Bit Yes No No No No Size 2 3 3 3 4 Cycles 1 1 1 1 1 Pipeline X X X X X

Description

This instruction performs a multiplication and a subtraction in the D-unit MAC. The input operands of the multiplier are:
- ACx(3216) - the content of Tx, sign extended to 17 bits - the content of a memory (Smem) location, sign extended to 17 bits - the content of a data memory operand Cmem, addressed using the

coefficient addressing mode, sign extended to 17 bits


- the content of data memory operand Xmem, extended to 17 bits, and the

content of data memory operand Ymem, extended to 17 bits Status Bits Affected by Affects See Also FRCT, M40, RDM, SATD, SMUL, SXMD ACOVx, ACOVy

See the following other related instructions:


- Modify Auxiliary Register Content with Parallel Multiply and Subtract - Multiply and Accumulate - Multiply and Subtract with Parallel Load Accumulator from Memory - Multiply and Subtract with Parallel Multiply - Multiply and Subtract with Parallel Multiply and Accumulate - Multiply and Subtract with Parallel Store Accumulator Content to Memory - Parallel Multiply and Subtracts

5-294

Instruction Set Descriptions

SPRU375G

Multiply and Subtract

Multiply and Subtract


Syntax Characteristics
No. [1] Syntax ACy = rnd(ACy (ACx * Tx)) Parallel Enable Bit Yes Size 2 Cycles 1 Pipeline X

Opcode Operands Description ACx, ACy, Tx

0101 011E DDSS ss1% This instruction performs a multiplication and a subtraction in the D-unit MAC. The input operands of the multiplier are ACx(3216) and the content of Tx, sign extended to 17 bits.
- If FRCT = 1, the output of the multiplier is shifted left by 1 bit. - Multiplication overflow detection depends on SMUL. - The 32-bit result of the multiplication is sign extended to 40 bits and

subtracted from the source accumulator ACy.


- Rounding is performed according to RDM, if the optional rnd keyword is

applied to the instruction.


- Overflow detection depends on M40. If an overflow is detected, the

destination accumulator overflow status bit (ACOVy) is set.


- When an overflow is detected, the accumulator is saturated according to

SATD. Compatibility with C54x devices (C54CM = 1) When this instruction is executed with M40 = 0, compatibility is ensured. Status Bits Repeat Example
Syntax AC1 = rnd(AC1 (AC0 * T1))
Before AC0 AC1 T1 M40 ACOV1 FRCT

Affected by Affects

FRCT, M40, RDM, SATD, SMUL ACOVy

This instruction can be repeated.

Description The content of AC0 multiplied by the content of T1 is subtracted from the content of AC1. The result is rounded and stored in AC1.
After AC0 AC1 T1 M40 ACOV1 FRCT 00 EC00 0000 00 1680 0000 2000 0 0 0

00 EC00 0000 00 3400 0000 2000 0 0 0

SPRU375G

Instruction Set Descriptions

5-295

Multiply and Subtract

Multiply and Subtract


Syntax Characteristics
Parallel Enable Bit No

No. [2]

Syntax ACx = rnd(ACx (Smem * coef(Cmem)))[, T3 = Smem]

Size 3

Cycles 1

Pipeline X

Opcode Operands Description ACx, Cmem, Smem

1101 0001 AAAA AAAI U%DD 10mm

This instruction performs a multiplication and a subtraction in the D-unit MAC. The input operands of the multiplier are the content of a memory (Smem) location, sign extended to 17 bits, and the content of a data memory operand Cmem, addressed using the coefficient addressing mode, sign extended to 17 bits.
- If FRCT = 1, the output of the multiplier is shifted left by 1 bit. - Multiplication overflow detection depends on SMUL. - The 32-bit result of the multiplication is sign extended to 40 bits and

subtracted from the source accumulator ACx.


- Rounding is performed according to RDM, if the optional rnd keyword is

applied to the instruction.


- Overflow detection depends on M40. If an overflow is detected, the

destination accumulator overflow status bit (ACOVx) is set.


- When an overflow is detected, the accumulator is saturated according to

SATD. This instruction provides the option to store the 16-bit data memory operand Smem in temporary register T3. For this instruction, the Cmem operand is accessed through the BB bus; on some C55x-based devices, the BB bus is only connected to internal memory and not to external memory. To prevent the generation of a bus error, the Cmem operand must not be mapped on external memory. Compatibility with C54x devices (C54CM = 1) When this instruction is executed with M40 = 0, compatibility is ensured.
5-296 Instruction Set Descriptions SPRU375G

Multiply and Subtract

Status Bits

Affected by Affects

FRCT, M40, RDM, SATD, SMUL ACOVx

Repeat Example
Syntax

This instruction can be repeated.

Description The content addressed by AR1 multiplied by the content addressed by the coefficient data pointer register (CDP) is subtracted from the content of AC2. The result is rounded and stored in AC2.

AC2 = rnd(AC2 (*AR1 * coef(*CDP)))

Before AC2 AR1 CDP 302 202 ACOV2 SATD RDM FRCT

00 EC00 0000 0302 0202 FE00 0040 0 0 0 0

After AC2 AR2 CDP 302 202 ACOV2 SATD RDM FRCT

00 EC01 0000 0302 0202 FE00 0040 1 0 0 0

SPRU375G

Instruction Set Descriptions

5-297

Multiply and Subtract

Multiply and Subtract


Syntax Characteristics
No. [3] Syntax ACy = rnd(ACy (Smem * ACx))[, T3 = Smem] Parallel Enable Bit No Size 3 Cycles 1 Pipeline X

Opcode Operands Description ACx, ACy, Smem

1101 0010 AAAA AAAI U%DD 01SS

This instruction performs a multiplication and a subtraction in the D-unit MAC. The input operands of the multiplier are ACx(3216) and the content of a memory (Smem) location, sign extended to 17 bits.
- If FRCT = 1, the output of the multiplier is shifted left by 1 bit. - Multiplication overflow detection depends on SMUL. - The 32-bit result of the multiplication is sign extended to 40 bits and

subtracted from the source accumulator ACy.


- Rounding is performed according to RDM, if the optional rnd keyword is

applied to the instruction.


- Overflow detection depends on M40. If an overflow is detected, the

destination accumulator overflow status bit (ACOVy) is set.


- When an overflow is detected, the accumulator is saturated according to

SATD. This instruction provides the option to store the 16-bit data memory operand Smem in temporary register T3. Compatibility with C54x devices (C54CM = 1) When this instruction is executed with M40 = 0, compatibility is ensured. Status Bits Affected by Affects Repeat Example
Syntax AC0 = AC0 (*AR3 * AC1) Description The content addressed by AR3 multiplied by the content of AC1 is subtracted from the content of AC0 and the result is stored in AC0.

FRCT, M40, RDM, SATD, SMUL ACOVy

This instruction can be repeated.

5-298

Instruction Set Descriptions

SPRU375G

Multiply and Subtract

Multiply and Subtract


Syntax Characteristics
No. [4] Syntax ACy = rnd(ACx (Tx * Smem))[, T3 = Smem] Parallel Enable Bit No Size 3 Cycles 1 Pipeline X

Opcode Operands Description ACx, ACy, Smem, Tx

1101 0101 AAAA AAAI U%DD ssSS

This instruction performs a multiplication and a subtraction in the D-unit MAC. The input operands of the multiplier are the content of Tx, sign extended to 17 bits, and the content of a memory (Smem) location, sign extended to 17 bits.
- If FRCT = 1, the output of the multiplier is shifted left by 1 bit. - Multiplication overflow detection depends on SMUL. - The 32-bit result of the multiplication is sign extended to 40 bits and

subtracted from the source accumulator ACx.


- Rounding is performed according to RDM, if the optional rnd keyword is

applied to the instruction.


- Overflow detection depends on M40. If an overflow is detected, the

destination accumulator overflow status bit (ACOVy) is set.


- When an overflow is detected, the accumulator is saturated according to

SATD. This instruction provides the option to store the 16-bit data memory operand Smem in temporary register T3. Compatibility with C54x devices (C54CM = 1) When this instruction is executed with M40 = 0, compatibility is ensured. Status Bits Affected by Affects Repeat Example
Syntax AC0 = AC1 (T0 * *AR3) Description The content addressed by AR3 multiplied by the content of T0 is subtracted from the content of AC1 and the result is stored in AC0.

FRCT, M40, RDM, SATD, SMUL ACOVy

This instruction can be repeated.

SPRU375G

Instruction Set Descriptions

5-299

Multiply and Subtract

Multiply and Subtract


Syntax Characteristics
No. [5] Syntax ACy = M40(rnd(ACx (uns(Xmem) * uns(Ymem)))) [, T3 = Xmem] Parallel Enable Bit No Size 4 Cycles 1 Pipeline X

Opcode Operands Description

1000 0110 XXXM MMYY YMMM SSDD 011g uuU% ACx, ACy, Xmem, Ymem This instruction performs a multiplication and a subtraction in the D-unit MAC. The input operands of the multiplier are the content of data memory operand Xmem, extended to 17 bits, and the content of data memory operand Ymem, extended to 17 bits.
- Input operands are extended to 17 bits according to uns. J J

If the optional uns keyword is applied to the input operand, the content of the memory location is zero extended to 17 bits. If the optional uns keyword is not applied to the input operand, the content of the memory location is sign extended to 17 bits according to SXMD.

- If FRCT = 1, the output of the multiplier is shifted left by 1 bit. - Multiplication overflow detection depends on SMUL. - The 32-bit result of the multiplication is sign extended to 40 bits and

subtracted from the source accumulator ACx.


- Rounding is performed according to RDM, if the optional rnd keyword is

applied to the instruction.


- Overflow detection depends on M40. If an overflow is detected, the

destination accumulator overflow status bit (ACOVy) is set.


- When an overflow is detected, the accumulator is saturated according to

SATD. This instruction provides the option to locally set M40 to 1 for the execution of the instruction, if the optional M40 keyword is applied to the instruction. This instruction provides the option to store the 16-bit data memory operand Xmem in temporary register T3. Compatibility with C54x devices (C54CM = 1) When this instruction is executed with M40 = 0, compatibility is ensured.
5-300 Instruction Set Descriptions SPRU375G

Multiply and Subtract

Status Bits

Affected by Affects

FRCT, M40, RDM, SATD, SMUL, SXMD ACOVy

Repeat Example
Syntax

This instruction can be repeated.

Description The unsigned content addressed by AR2 multiplied by the unsigned content addressed by AR3 is subtracted from the content of AC3 and the result is stored in AC3. AR2 and AR3 are both incremented by 1.

AC3 = AC3 (uns(*AR2+) * uns(*AR3+))

Before AC3 AR2 AR3 ACOV3 302 202 FRCT

00 2300 EC00 302 202 0 FE00 7000 0

After AC3 AR2 AR3 ACOV3 302 202 FRCT

FF B3E0 EC00 303 203 0 FE00 7000 0

SPRU375G

Instruction Set Descriptions

5-301

Multiply and Subtract with Parallel Load Accumulator from Memory

Multiply and Subtract with Parallel Load Accumulator from Memory


Syntax Characteristics
No. [1] Syntax ACx = rnd(ACx (Tx * Xmem)), ACy = Ymem << #16 [,T3 = Xmem] Parallel Enable Bit No Size 4 Cycles 1 Pipeline X

Opcode Operands Description

1000 0110 XXXM MMYY YMMM DDDD 100x ssU% ACx, ACy, Tx, Xmem, Ymem This instruction performs two operations in parallel: multiply and subtract (MAS), and load. The first operation performs a multiplication and a subtraction in the D-unit MAC. The input operands of the multiplier are the content of Tx, sign extended to 17 bits, and the content of data memory operand Xmem, sign extended to 17 bits.
- If FRCT = 1, the output of the multiplier is shifted left by 1 bit. - Multiplication overflow detection depends on SMUL. - The 32-bit result of the multiplication is sign extended to 40 bits and

subtracted from the source accumulator ACx.


- Rounding is performed according to RDM, if the optional rnd keyword is

applied to the instruction.


- Overflow detection depends on M40. If an overflow is detected, the

destination accumulator overflow status bit (ACOVx) is set.


- When an overflow is detected, the accumulator is saturated according to

SATD.
- This instruction provides the option to store the 16-bit data memory

operand Xmem in temporary register T3. The second operation loads the content of data memory operand Ymem shifted left by 16 bits to the accumulator ACy.
- The input operand is sign extended to 40 bits according to SXMD. - The shift operation is equivalent to the signed shift instruction. - The input operand is shifted left by 16 bits according to M40.

Compatibility with C54x devices (C54CM = 1) When this instruction is executed with M40 = 0, compatibility is ensured.
5-302 Instruction Set Descriptions SPRU375G

Multiply and Subtract with Parallel Load Accumulator from Memory

Status Bits

Affected by Affects

FRCT, M40, RDM, SATD, SMUL, SXMD ACOVx, ACOVy

Repeat See Also

This instruction can be repeated. See the following other related instructions:
- Modify Auxiliary Register Content with Parallel Multiply and Subtract - Multiply and Accumulate with Parallel Load Accumulator from Memory - Multiply and Subtract - Multiply and Subtract with Parallel Multiply - Multiply and Subtract with Parallel Multiply and Accumulate - Multiply and Subtract with Parallel Store Accumulator Content to Memory - Parallel Multiply and Subtracts

Example
Syntax AC0 = AC0 (T0 * *AR3), AC1 = *AR4 << #16 Description Both instructions are performed in parallel. The content addressed by AR3 multiplied by the content of T0 is subtracted from the content of AC0 and the result is stored in AC0. The content addressed by AR4 shifted left by 16 bits is stored in AC1.

SPRU375G

Instruction Set Descriptions

5-303

Multiply and Subtract with Parallel Multiply

Multiply and Subtract with Parallel Multiply


Syntax Characteristics
No. [1] Syntax ACx = M40(rnd(ACx (uns(Xmem) * uns(coef(Cmem))))), ACy = M40(rnd(uns(Ymem) * uns(coef(Cmem)))) Parallel Enable Bit No Size 4 Cycles 1 Pipeline X

Opcode Operands Description

1000 0010 XXXM MMYY YMMM 10mm uuDD DDg% ACx, ACy, Cmem, Xmem, Ymem This instruction performs two parallel operations in one cycle: multiply and subtract (MAS), and multiply. The operations are executed in the two D-unit MACs. The first operation performs a multiplication and a subtraction in the D-unit MAC. The input operands of the multiplier are the content of data memory operand Xmem, extended to 17 bits, and the content of a data memory operand Cmem, addressed using the coefficient addressing mode, extended to 17 bits. The second operation performs a multiplication in the D-unit MAC. The input operands of the multiplier are the content of data memory operand Ymem, extended to 17 bits, and the content of a data memory operand Cmem, addressed using the coefficient addressing mode, extended to 17 bits.
- Input operands are extended to 17 bits according to uns. J J

If the optional uns keyword is applied to the input operand, the content of the memory location is zero extended to 17 bits. If the optional uns keyword is not applied to the input operand, the content of the memory location is sign extended to 17 bits according to SXMD.

- If FRCT = 1, the output of the multiplier is shifted left by 1 bit. - Multiplication overflow detection depends on SMUL. - For the first operation, the 32-bit result of the multiplication is sign

extended to 40 bits and subtracted from the source accumulator ACx.


- For the second operation, the 32-bit result of the multiplication is sign

extended to 40 bits.
- Rounding is performed according to RDM, if the optional rnd keyword is

applied to the instruction.


5-304 Instruction Set Descriptions SPRU375G

Multiply and Subtract with Parallel Multiply

- Overflow detection depends on M40. If an overflow is detected, the

destination accumulator overflow status bit is set.


- When an overflow is detected, the accumulator is saturated according to

SATD. This instruction provides the option to locally set M40 to 1 for the execution of the instruction, if the optional M40 keyword is applied to the instruction. For this instruction, the Cmem operand is accessed through the BB bus; on some C55x-based devices, the BB bus is only connected to internal memory and not to external memory. To prevent the generation of a bus error, the Cmem operand must not be mapped on external memory. Each data flow can also disable the usage of the corresponding MAC unit, while allowing the modification of auxiliary registers in the three address generation units through the following instructions:
J J J

mar(Xmem) mar(Ymem) mar(Cmem) FRCT, M40, RDM, SATD, SMUL, SXMD ACOVx, ACOVy

Status Bits

Affected by Affects

Repeat See Also

This instruction can be repeated. See the following other related instructions:
- Modify Auxiliary Register Content with Parallel Multiply and Subtract - Multiply and Accumulate with Parallel Multiply - Multiply and Subtract - Multiply and Subtract with Parallel Load Accumulator from Memory - Multiply and Subtract with Parallel Multiply and Accumulate - Multiply and Subtract with Parallel Store Accumulator Content to Memory - Parallel Multiply and Subtracts

Example
Syntax AC0 = AC0 (uns(*AR3) * uns(coef(*CDP))), AC1 = uns(*AR4) * uns(coef(*CDP)) Description Both instructions are performed in parallel. The unsigned content addressed by AR3 multiplied by the unsigned content addressed by the coefficient data pointer register (CDP) is subtracted from the content of AC0 and the result is stored in AC0. The unsigned content addressed by AR4 is multiplied by the unsigned content addressed by CDP and the result is stored in AC1.

SPRU375G

Instruction Set Descriptions

5-305

Multiply and Subtract with Parallel Multiply and Accumulate

Multiply and Subtract with Parallel Multiply and Accumulate


Syntax Characteristics
Parallel Enable Bit No No

No. [1] [2]

Syntax ACx = M40(rnd(ACx (uns(Xmem) * uns(coef(Cmem))))), ACy = M40(rnd(ACy + (uns(Ymem) * uns(coef(Cmem))))) ACx = M40(rnd(ACx (uns(Xmem) * uns(coef(Cmem))))), ACy = M40(rnd((ACy >> #16) + (uns(Ymem) * uns(coef(Cmem)))))

Size 4 4

Cycles 1 1

Pipeline X X

Description

These instructions perform two parallel operations in one cycle: multiply and subtract (MAS), and multiply and accumulate (MAC). The operations are executed in the two D-unit MACs. Affected by Affects FRCT, M40, RDM, SATD, SMUL, SXMD ACOVx, ACOVy

Status Bits

See Also

See the following other related instructions:


- Modify Auxiliary Register Content with Parallel Multiply and Subtract - Multiply and Subtract - Multiply and Subtract with Parallel Load Accumulator from Memory - Multiply and Subtract with Parallel Multiply - Multiply and Subtract with Parallel Store Accumulator Content to Memory - Parallel Multiply and Subtracts

5-306

Instruction Set Descriptions

SPRU375G

Multiply and Subtract with Parallel Multiply and Accumulate

Multiply and Subtract with Parallel Multiply and Accumulate


Syntax Characteristics
No. [1] Syntax ACx = M40(rnd(ACx (uns(Xmem) * uns(coef(Cmem))))), ACy = M40(rnd(ACy + (uns(Ymem) * uns(coef(Cmem))))) Parallel Enable Bit No Size 4 Cycles 1 Pipeline X

Opcode Operands Description

1000 0011 XXXM MMYY YMMM 01mm uuDD DDg% ACx, ACy, Cmem, Xmem, Ymem This instruction performs two parallel operations in one cycle: multiply and subtract (MAS), and multiply and accumulate (MAC). The operations are executed in the two D-unit MACs. The first operation performs a multiplication and a subtraction in the D-unit MAC. The input operands of the multiplier are the content of data memory operand Xmem, extended to 17 bits, and the content of a data memory operand Cmem, addressed using the coefficient addressing mode, extended to 17 bits. The second operation performs a multiplication and an accumulation in the D-unit MAC. The input operands of the multiplier are the content of data memory operand Ymem, extended to 17 bits, and the content of a data memory operand Cmem, addressed using the coefficient addressing mode, extended to 17 bits.
- Input operands are extended to 17 bits according to uns. J J

If the optional uns keyword is applied to the input operand, the content of the memory location is zero extended to 17 bits. If the optional uns keyword is not applied to the input operand, the content of the memory location is sign extended to 17 bits according to SXMD.

- If FRCT = 1, the output of the multiplier is shifted left by 1 bit. - Multiplication overflow detection depends on SMUL. - For the first operation, the 32-bit result of the multiplication is sign

extended to 40 bits and subtracted from the source accumulator ACx.


- For the second operation, the 32-bit result of the multiplication is sign

extended to 40 bits and added to the source accumulator ACy.


- Rounding is performed according to RDM, if the optional rnd keyword is

applied to the instruction.


SPRU375G Instruction Set Descriptions 5-307

Multiply and Subtract with Parallel Multiply and Accumulate

- Overflow detection depends on M40. If an overflow is detected, the

destination accumulator overflow status bit is set.


- When an overflow is detected, the accumulator is saturated according to

SATD. This instruction provides the option to locally set M40 to 1 for the execution of the instruction, if the optional M40 keyword is applied to the instruction. For this instruction, the Cmem operand is accessed through the BB bus; on some C55x-based devices, the BB bus is only connected to internal memory and not to external memory. To prevent the generation of a bus error, the Cmem operand must not be mapped on external memory. Each data flow can also disable the usage of the corresponding MAC unit, while allowing the modification of auxiliary registers in the three address generation units through the following instructions:
J J J

mar(Xmem) mar(Ymem) mar(Cmem) FRCT, M40, RDM, SATD, SMUL, SXMD ACOVx, ACOVy

Status Bits

Affected by Affects

Repeat Example
Syntax

This instruction can be repeated.

Description Both instructions are performed in parallel. The unsigned content addressed by AR0 multiplied by the unsigned content addressed by the coefficient data pointer register (CDP) is subtracted from the content of AC0. The result is rounded and stored in AC0. The unsigned content addressed by AR1 multiplied by the unsigned content addressed by CDP is added to the content of AC1. The result is rounded and stored in AC1.

AC0 = M40(rnd(AC0 (uns(*AR0) * uns(coef(*CDP))))), AC1 = M40(rnd(AC1 + (uns(*AR1) * uns(coef(*CDP)))))

Before AC0 AC1 *AR0 *AR1 *CDP ACOV0 ACOV1 CARRY FRCT

00 6900 0000 00 0023 0000 3400 EF00 A067 0 0 0 0

After AC0 AC1 *AR0 *AR1 *CDP ACOV0 ACOV1 CARRY FRCT

00 486B 0000 00 95E3 0000 3400 EF00 A067 0 0 0 0

5-308

Instruction Set Descriptions

SPRU375G

Multiply and Subtract with Parallel Multiply and Accumulate

Multiply and Subtract with Parallel Multiply and Accumulate


Syntax Characteristics
No. [2] Syntax ACx = M40(rnd(ACx (uns(Xmem) * uns(coef(Cmem))))), ACy = M40(rnd((ACy >> #16) + (uns(Ymem) * uns(coef(Cmem))))) Parallel Enable Bit No Size 4 Cycles 1 Pipeline X

Opcode Operands Description

1000 0100 XXXM MMYY YMMM 00mm uuDD DDg% ACx, ACy, Cmem, Xmem, Ymem This instruction performs two parallel operations in one cycle: multiply and subtract (MAS), and multiply and accumulate (MAC). The operations are executed in the two D-unit MACs. The first operation performs a multiplication and a subtraction in the D-unit MAC. The input operands of the multiplier are the content of data memory operand Xmem, extended to 17 bits, and the content of a data memory operand Cmem, addressed using the coefficient addressing mode, extended to 17 bits. The second operation performs a multiplication and an accumulation in the D-unit MAC. The input operands of the multiplier are the content of data memory operand Ymem, extended to 17 bits, and the content of a data memory operand Cmem, addressed using the coefficient addressing mode, extended to 17 bits.
- Input operands are extended to 17 bits according to uns. J J

If the optional uns keyword is applied to the input operand, the content of the memory location is zero extended to 17 bits. If the optional uns keyword is not applied to the input operand, the content of the memory location is sign extended to 17 bits according to SXMD.

- If FRCT = 1, the output of the multiplier is shifted left by 1 bit. - Multiplication overflow detection depends on SMUL. - For the first operation, the 32-bit result of the multiplication is sign

extended to 40 bits and subtracted from the source accumulator ACx.


- For the second operation, the 32-bit result of the multiplication is sign

extended to 40 bits and added to the source accumulator ACy shifted right by 16 bits. The shifting operation is performed with a sign extension of source accumulator ACy(39).
SPRU375G Instruction Set Descriptions 5-309

Multiply and Subtract with Parallel Multiply and Accumulate

- Rounding is performed according to RDM, if the optional rnd keyword is

applied to the instruction.


- Overflow detection depends on M40. If an overflow is detected, the

destination accumulator overflow status bit is set.


- When an overflow is detected, the accumulator is saturated according to

SATD. This instruction provides the option to locally set M40 to 1 for the execution of the instruction, if the optional M40 keyword is applied to the instruction. For this instruction, the Cmem operand is accessed through the BB bus; on some C55x-based devices, the BB bus is only connected to internal memory and not to external memory. To prevent the generation of a bus error, the Cmem operand must not be mapped on external memory. Each data flow can also disable the usage of the corresponding MAC unit, while allowing the modification of auxiliary registers in the three address generation units through the following instructions:
J J J

mar(Xmem) mar(Ymem) mar(Cmem) FRCT, M40, RDM, SATD, SMUL, SXMD ACOVx, ACOVy

Status Bits

Affected by Affects

Repeat Example
Syntax

This instruction can be repeated.

Description Both instructions are performed in parallel. The unsigned content addressed by AR3 multiplied by the unsigned content addressed by the coefficient data pointer register (CDP) is subtracted from the content of AC0 and the result is stored in AC0. The unsigned content addressed by AR4 multiplied by the unsigned content addressed by CDP is added to the content of AC1 shifted right by 16 bits and the result is stored in AC1.

AC0 = AC0 (uns(*AR3) * uns(coef(*CDP))), AC1 = (AC1 >> #16) + (uns(*AR4) * uns(coef(*CDP)))

5-310

Instruction Set Descriptions

SPRU375G

Multiply and Subtract with Parallel Store Accumulator Content to Memory

Multiply and Subtract with Parallel Store Accumulator Content to Memory


Syntax Characteristics
No. [1] Syntax ACy = rnd(ACy (Tx * Xmem)), Ymem = HI(ACx << T2) [,T3 = Xmem] Parallel Enable Bit No Size 4 Cycles 1 Pipeline X

Opcode Operands Description

1000 0111 XXXM MMYY YMMM SSDD 010x ssU% ACx, ACy, Tx, Xmem, Ymem This instruction performs two operations in parallel: multiply and subtract (MAS), and store. The first operation performs a multiplication and a subtraction in the D-unit MAC. The input operands of the multiplier are the content of Tx, sign extended to 17 bits, and the content of data memory operand Xmem, sign extended to 17 bits.
- If FRCT = 1, the output of the multiplier is shifted left by 1 bit. - Multiplication overflow detection depends on SMUL. - The 32-bit result of the multiplication is sign extended to 40 bits and

subtracted from the source accumulator ACy.


- Rounding is performed according to RDM, if the optional rnd keyword is

applied to the instruction.


- Overflow detection depends on M40. If an overflow is detected, the

destination accumulator overflow status bit (ACOVy) is set.


- When an overflow is detected, the accumulator is saturated according to

SATD.
- This instruction provides the option to store the 16-bit data memory

operand Xmem in temporary register T3. The second operation shifts the accumulator ACx by the content of T2 and stores ACx(3116) to data memory operand Ymem. If the 16-bit value in T2 is not within 32 to +31, the shift is saturated to 32 or +31 and the shift is performed with this value.
- The input operand is shifted in the D-unit shifter according to SXMD. - After the shift, the high part of the accumulator, ACx(3116), is stored to

the memory location.


SPRU375G Instruction Set Descriptions 5-311

Multiply and Subtract with Parallel Store Accumulator Content to Memory

Compatibility with C54x devices (C54CM = 1) When this instruction is executed with M40 = 0, compatibility is ensured. When this instruction is executed with C54CM = 1, the 6 LSBs of T2 are used to determine the shift quantity. The 6 LSBs of T2 define a shift quantity within 32 to +31. When the 16-bit value in T2 is between 32 to 17, a modulo 16 operation transforms the shift quantity to within 16 to 1. Status Bits Affected by Affects Repeat See Also C54CM, FRCT, M40, RDM, SATD, SMUL, SXMD ACOVy

This instruction can be repeated. See the following other related instructions:
- Modify Auxiliary Register Content with Parallel Multiply and Subtract - Multiply and Accumulate with Parallel Store Accumulator Content to

Memory
- Multiply and Subtract - Multiply and Subtract with Parallel Load Accumulator from Memory - Multiply and Subtract with Parallel Multiply - Multiply and Subtract with Parallel Multiply and Accumulate - Parallel Multiply and Subtracts

Example
Syntax AC0 = AC0 (T0 * *AR3), *AR4 = HI(AC1 << T2) Description Both instructions are performed in parallel. The content addressed by AR3 multiplied by the content of T0 is subtracted from the content of AC0 and the result is stored in AC0. The content of AC1 is shifted by the content of T2, and AC1(3116) is stored at the address of AR4.

5-312

Instruction Set Descriptions

SPRU375G

Negate Accumulator, Auxiliary, or Temporary Register Content

Negate Accumulator, Auxiliary, or Temporary Register Content


Syntax Characteristics
Parallel Enable Bit Yes

No. [1]

Syntax dst = src

Size 2

Cycles 1

Pipeline X

Opcode Operands Description dst, src

0011 010E FSSS FDDD

This instruction computes the 2s complement of the content of the source register (src). This instruction clears the CARRY status bit to 0 for all nonzero values of src. If src equals 0, the CARRY status bit is set to 1.
- When the destination operand (dst) is an accumulator: J J J

The operation is performed on 40 bits in the D-unit ALU. Input operands are sign extended to 40 bits according to SXMD. If an auxiliary or temporary register is the source operand (src) of the instruction, the 16 LSBs of the auxiliary or temporary register are sign extended according to SXMD. Overflow detection and CARRY status bit depends on M40. When an overflow is detected, the accumulator is saturated according to SATD.

J J

- When the destination operand (dst) is an auxiliary or temporary register: J J J J

The operation is performed on 16 bits in the A-unit ALU. If an accumulator is the source operand (src) of the instruction, the 16 LSBs of the accumulator are used to perform the operation. Overflow detection is done at bit position 15. When an overflow is detected, the destination register is saturated according to SATA.

Compatibility with C54x devices (C54CM = 1) When this instruction is executed with M40 = 0, compatibility is ensured.

SPRU375G

Instruction Set Descriptions

5-313

Negate Accumulator, Auxiliary, or Temporary Register Content

Status Bits

Affected by Affects

M40, SATA, SATD, SXMD ACOVx, CARRY

Repeat See Also

This instruction can be repeated. See the following other related instructions:
- Complement Accumulator, Auxiliary, or Temporary Register Bit - Complement Accumulator, Auxiliary, or Temporary Register Content

Example
Syntax AC0 = AC1 Description The 2s complement of the content of AC1 is stored in AC0.

5-314

Instruction Set Descriptions

SPRU375G

No Operation (nop)

No Operation (nop)
Syntax Characteristics
Parallel Enable Bit Yes Yes

No. [1] [2]

Syntax nop nop_16

Size 1 2

Cycles 1 1

Pipeline D D

Opcode Operands Description none

0010 000E

Instruction [1] increments the program counter register (PC) by 1 byte. Instruction [2] increments the PC by 2 bytes. Affected by Affects none none

Status Bits

Repeat Example
Syntax nop

This instruction can be repeated.

Description The program counter (PC) is incremented by 1 byte.

SPRU375G

Instruction Set Descriptions

5-315

Parallel Modify Auxiliary Register Contents

Parallel Modify Auxiliary Register Contents


Syntax Characteristics
Parallel Enable Bit No

No. [1]

Syntax mar(Xmem), mar(Ymem), mar(coef(Cmem))

Size 4

Cycles 1

Pipeline X

Opcode Operands Description

1000 0101 XXXM MMYY YMMM 10mm xxxx xxxx Cmem, Xmem, Ymem This instruction performs three parallel modify auxiliary register (MAR) operations in one cycle. The auxiliary register modification is specified by:
- the content of data memory operand Xmem - the content of data memory operand Ymem - the content of a data memory operand Cmem, addressed using the

coefficient addressing mode Status Bits Affected by Affects Repeat See Also none none

This instruction can be repeated. See the following other related instructions:
- Modify Auxiliary Register Content - Modify Extended Auxiliary Register Content

Example
Syntax mar(*AR3+), mar(*AR4), mar(coef(*CDP)) Description AR3 is incremented by 1. AR4 is decremented by 1. CDP is not modified.

5-316

Instruction Set Descriptions

SPRU375G

Parallel Multiplies

Parallel Multiplies
Syntax Characteristics
No. [1] Syntax ACx = M40(rnd(uns(Xmem) * uns(coef(Cmem)))), ACy = M40(rnd(uns(Ymem) * uns(coef(Cmem)))) Parallel Enable Bit No Size 4 Cycles 1 Pipeline X

Opcode Operands Description

1000 0010 XXXM MMYY YMMM 00mm uuDD DDg% ACx, ACy, Cmem, Xmem, Ymem This instruction performs two parallel multiply operations in one cycle. The operations are executed in the two D-unit MACs. The first operation performs a multiplication in the D-unit MAC. The input operands of the multiplier are the content of data memory operand Xmem, extended to 17 bits, and the content of a data memory operand Cmem, addressed using the coefficient addressing mode, extended to 17 bits. This second operation performs a multiplication in the D-unit MAC. The input operands of the multiplier are the content of data memory operand Ymem, extended to 17 bits, and the content of a data memory operand Cmem, addressed using the coefficient addressing mode, extended to 17 bits.
- Input operands are extended to 17 bits according to uns. J J

If the optional uns keyword is applied to the input operand, the content of the memory location is zero extended to 17 bits. If the optional uns keyword is not applied to the input operand, the content of the memory location is sign extended to 17 bits according to SXMD.

- If FRCT = 1, the output of the multiplier is shifted left by 1 bit. - Multiplication overflow detection depends on SMUL. - The 32-bit result of the multiplication is sign extended to 40 bits. - Rounding is performed according to RDM, if the optional rnd keyword is

applied to the instruction.


- Overflow detection depends on M40. If an overflow is detected, the

destination accumulator overflow status bit is set.


- When an overflow is detected, the accumulator is saturated according to

SATD.
SPRU375G Instruction Set Descriptions 5-317

Parallel Multiplies

This instruction provides the option to locally set M40 to 1 for the execution of the instruction, if the optional M40 keyword is applied to the instruction. For this instruction, the Cmem operand is accessed through the BB bus; on some C55x-based devices, the BB bus is only connected to internal memory and not to external memory. To prevent the generation of a bus error, the Cmem operand must not be mapped on external memory. Each data flow can also disable the usage of the corresponding MAC unit, while allowing the modification of auxiliary registers in the three address generation units through the following instructions:
J J J

mar(Xmem) mar(Ymem) mar(Cmem) FRCT, M40, RDM, SATD, SMUL, SXMD ACOVx, ACOVy

Status Bits

Affected by Affects

Repeat See Also

This instruction can be repeated. See the following other related instructions:
- Modify Auxiliary Register Content with Parallel Multiply - Multiply - Multiply and Accumulate with Parallel Multiply - Multiply and Subtract with Parallel Multiply - Parallel Multiply and Accumulates - Parallel Multiply and Subtracts

Example
Syntax AC0 = uns(*AR3) * uns(coef(*CDP)), AC1 = uns(*AR4) * uns(coef(*CDP)) Description Both instructions are performed in parallel. The unsigned content addressed by AR3 is multiplied by the unsigned content addressed by the coefficient data pointer register (CDP) and the result is stored in AC0. The unsigned content addressed by AR4 is multiplied by the unsigned content addressed by CDP and the result is stored in AC1.

5-318

Instruction Set Descriptions

SPRU375G

Parallel Multiply and Accumulates

Parallel Multiply and Accumulates


Syntax Characteristics
Parallel Enable Bit No No

No. [1] [2]

Syntax ACx = M40(rnd(ACx + (uns(Xmem) * uns(coef(Cmem))))), ACy = M40(rnd(ACy + (uns(Ymem) * uns(coef(Cmem))))) ACx = M40(rnd((ACx >> #16) + (uns(Xmem) * uns(coef(Cmem))))), ACy = M4(rnd(ACy + (uns(Ymem) * uns(coef(Cmem))))) ACx = M40(rnd((ACx >> #16) + (uns(Xmem) * uns(coef(Cmem))))), ACy = M40(rnd((ACy >> #16) + (uns(Ymem) * uns(coef(Cmem)))))

Size 4 4

Cycles 1 1

Pipeline X X

[3]

No

Description

These instructions perform two parallel multiply and accumulate (MAC) operations in one cycle. The operations are executed in the two D-unit MACs. Affected by Affects FRCT, M40, RDM, SATD, SMUL, SXMD ACOVx, ACOVy

Status Bits

See Also

See the following other related instructions:


- Modify Auxiliary Register Content with Parallel Multiply and Accumulate - Multiply and Accumulate - Multiply and Accumulate with Parallel Delay - Multiply and Accumulate with Parallel Load Accumulator from Memory - Multiply and Accumulate with Parallel Multiply - Multiply and Accumulate with Parallel Store Accumulator Content to Memory - Multiply and Subtract with Parallel Multiply and Accumulate - Multiply with Parallel Multiply and Accumulate - Parallel Multiplies - Parallel Multiply and Subtracts

SPRU375G

Instruction Set Descriptions

5-319

Parallel Multiply and Accumulates

Parallel Multiply and Accumulates


Syntax Characteristics
No. [1] Syntax ACx = M40(rnd(ACx + (uns(Xmem) * uns(coef(Cmem))))), ACy = M40(rnd(ACy + (uns(Ymem) * uns(coef(Cmem))))) Parallel Enable Bit No Size 4 Cycles 1 Pipeline X

Opcode Operands Description

1000 0011 XXXM MMYY YMMM 00mm uuDD DDg% ACx, ACy, Cmem, Xmem, Ymem This instruction performs two parallel multiply and accumulate (MAC) operations in one cycle. The operations are executed in the two D-unit MACs. The first operation performs a multiplication and an accumulation in the D-unit MAC. The input operands of the multiplier are the content of data memory operand Xmem, extended to 17 bits, and the content of a data memory operand Cmem, addressed using the coefficient addressing mode, extended to 17 bits. The second operation performs a multiplication and an accumulation in the D-unit MAC. The input operands of the multiplier are the content of data memory operand Ymem, extended to 17 bits, and the content of a data memory operand Cmem, addressed using the coefficient addressing mode, extended to 17 bits.
- Input operands are extended to 17 bits according to uns. J J

If the optional uns keyword is applied to the input operand, the content of the memory location is zero extended to 17 bits. If the optional uns keyword is not applied to the input operand, the content of the memory location is sign extended to 17 bits according to SXMD.

- If FRCT = 1, the output of the multiplier is shifted left by 1 bit. - Multiplication overflow detection depends on SMUL. - The 32-bit result of the multiplication is sign extended to 40 bits and added

to the source accumulator.


- Rounding is performed according to RDM, if the optional rnd keyword is

applied to the instruction.


- Overflow detection depends on M40. If an overflow is detected, the

destination accumulator overflow status bit is set.


- When an overflow is detected, the accumulator is saturated according to

SATD.
5-320 Instruction Set Descriptions SPRU375G

Parallel Multiply and Accumulates

This instruction provides the option to locally set M40 to 1 for the execution of the instruction, if the optional M40 keyword is applied to the instruction. For this instruction, the Cmem operand is accessed through the BB bus; on some C55x-based devices, the BB bus is only connected to internal memory and not to external memory. To prevent the generation of a bus error, the Cmem operand must not be mapped on external memory. Each data flow can also disable the usage of the corresponding MAC unit, while allowing the modification of auxiliary registers in the three address generation units through the following instructions:
J J J

mar(Xmem) mar(Ymem) mar(Cmem) FRCT, M40, RDM, SATD, SMUL, SXMD ACOVx, ACOVy

Status Bits

Affected by Affects

Repeat Example
Syntax

This instruction can be repeated.

Description Both instructions are performed in parallel. The unsigned content addressed by AR3 multiplied by the unsigned content addressed by the coefficient data pointer register (CDP) is added to the content of AC0 and the result is stored in AC0. The unsigned content addressed by AR4 multiplied by the unsigned content addressed by CDP is added to the content of AC1 and the result is stored in AC1.

AC0 = AC0 + (uns(*AR3) * uns(coef(*CDP))), AC1 = AC1 + (uns(*AR4) * uns(coef(*CDP)))

SPRU375G

Instruction Set Descriptions

5-321

Parallel Multiply and Accumulates

Parallel Multiply and Accumulates


Syntax Characteristics
No. [2] Syntax ACx = M40(rnd((ACx >> #16) + (uns(Xmem) * uns(coef(Cmem))))), ACy = M4(rnd(ACy + (uns(Ymem) * uns(coef(Cmem))))) Parallel Enable Bit No Size 4 Cycles 1 Pipeline X

Opcode Operands Description

1000 0011 XXXM MMYY YMMM 10mm uuDD DDg% ACx, ACy, Cmem, Xmem, Ymem This instruction performs two parallel multiply and accumulate (MAC) operations in one cycle. The operations are executed in the two D-unit MACs. The first operation performs a multiplication and an accumulation in the D-unit MAC. The input operands of the multiplier are the content of data memory operand Xmem, extended to 17 bits, and the content of a data memory operand Cmem, addressed using the coefficient addressing mode, extended to 17 bits. The second operation performs a multiplication and an accumulation in the D-unit MAC. The input operands of the multiplier are the content of data memory operand Ymem, extended to 17 bits, and the content of a data memory operand Cmem, addressed using the coefficient addressing mode, extended to 17 bits.
- Input operands are extended to 17 bits according to uns. J J

If the optional uns keyword is applied to the input operand, the content of the memory location is zero extended to 17 bits. If the optional uns keyword is not applied to the input operand, the content of the memory location is sign extended to 17 bits according to SXMD.

- If FRCT = 1, the output of the multiplier is shifted left by 1 bit. - Multiplication overflow detection depends on SMUL. - For the first operation, the 32-bit result of the multiplication is sign

extended to 40 bits and added to the source accumulator ACx shifted right by 16 bits. The shifting operation is performed with a sign extension of source accumulator ACx(39).
- For the second operation, the 32-bit result of the multiplication is sign

extended to 40 bits and added to the source accumulator ACy.


5-322 Instruction Set Descriptions SPRU375G

Parallel Multiply and Accumulates

- Rounding is performed according to RDM, if the optional rnd keyword is

applied to the instruction.


- Overflow detection depends on M40. If an overflow is detected, the

destination accumulator overflow status bit is set.


- When an overflow is detected, the accumulator is saturated according to

SATD. This instruction provides the option to locally set M40 to 1 for the execution of the instruction, if the optional M40 keyword is applied to the instruction. For this instruction, the Cmem operand is accessed through the BB bus; on some C55x-based devices, the BB bus is only connected to internal memory and not to external memory. To prevent the generation of a bus error, the Cmem operand must not be mapped on external memory. Each data flow can also disable the usage of the corresponding MAC unit, while allowing the modification of auxiliary registers in the three address generation units through the following instructions:
J J J

mar(Xmem) mar(Ymem) mar(Cmem) FRCT, M40, RDM, SATD, SMUL, SXMD ACOVx, ACOVy

Status Bits

Affected by Affects

Repeat Example
Syntax

This instruction can be repeated.

Description Both instructions are performed in parallel. The unsigned content addressed by AR3 multiplied by the unsigned content addressed by the coefficient data pointer register (CDP) is added to the content of AC0 shifted right by 16 bits and the result is stored in AC0. The unsigned content addressed by AR4 multiplied by the unsigned content addressed by CDP is added to the content of AC1 and the result is stored in AC1.

AC0 = (AC0 >> #16) + (uns(*AR3) * uns(coef(*CDP))), AC1 = AC1 + (uns(*AR4) * uns(coef(*CDP)))

SPRU375G

Instruction Set Descriptions

5-323

Parallel Multiply and Accumulates

Parallel Multiply and Accumulates


Syntax Characteristics
No. [3] Syntax ACx = M40(rnd((ACx >> #16) + (uns(Xmem) * uns(coef(Cmem))))), ACy = M40(rnd((ACy >> #16) + (uns(Ymem) * uns(coef(Cmem))))) Parallel Enable Bit No Size 4 Cycles 1 Pipeline X

Opcode Operands Description

1000 0100 XXXM MMYY YMMM 11mm uuDD DDg% ACx, ACy, Cmem, Xmem, Ymem This instruction performs two parallel multiply and accumulate (MAC) operations in one cycle. The operations are executed in the two D-unit MACs. The first operation performs a multiplication and an accumulation in the D-unit MAC. The input operands of the multiplier are the content of data memory operand Xmem, extended to 17 bits, and the content of a data memory operand Cmem, addressed using the coefficient addressing mode, extended to 17 bits. The second operation performs a multiplication and an accumulation in the D-unit MAC. The input operands of the multiplier are the content of data memory operand Ymem, extended to 17 bits, and the content of a data memory operand Cmem, addressed using the coefficient addressing mode, extended to 17 bits.
- Input operands are extended to 17 bits according to uns. J J

If the optional uns keyword is applied to the input operand, the content of the memory location is zero extended to 17 bits. If the optional uns keyword is not applied to the input operand, the content of the memory location is sign extended to 17 bits according to SXMD.

- If FRCT = 1, the output of the multiplier is shifted left by 1 bit. - Multiplication overflow detection depends on SMUL. - The 32-bit result of the multiplication is sign extended to 40 bits and added

to the source accumulator shifted right by 16 bits. The shifting operation is performed with a sign extension of source accumulator bit 39.
- Rounding is performed according to RDM, if the optional rnd keyword is

applied to the instruction.


5-324 Instruction Set Descriptions SPRU375G

Parallel Multiply and Accumulates

- Overflow detection depends on M40. If an overflow is detected, the

destination accumulator overflow status bit is set.


- When an overflow is detected, the accumulator is saturated according to

SATD. This instruction provides the option to locally set M40 to 1 for the execution of the instruction, if the optional M40 keyword is applied to the instruction. For this instruction, the Cmem operand is accessed through the BB bus; on some C55x-based devices, the BB bus is only connected to internal memory and not to external memory. To prevent the generation of a bus error, the Cmem operand must not be mapped on external memory. Each data flow can also disable the usage of the corresponding MAC unit, while allowing the modification of auxiliary registers in the three address generation units through the following instructions:
J J J

mar(Xmem) mar(Ymem) mar(Cmem) FRCT, M40, RDM, SATD, SMUL, SXMD ACOVx, ACOVy

Status Bits

Affected by Affects

Repeat Example
Syntax

This instruction can be repeated.

Description Both instructions are performed in parallel. The unsigned content addressed by AR3 multiplied by the unsigned content addressed by the coefficient data pointer register (CDP) is added to the content of AC0 shifted right by 16 bits and the result is stored in AC0. The unsigned content addressed by AR4 multiplied by the unsigned content addressed by CDP is added to the content of AC1 shifted right by 16 bits and the result is stored in AC1.

AC0 = (AC0 >> #16) + (uns(*AR3) * uns(coef(*CDP))), AC1 = (AC1 >> #16) + (uns(*AR4) * uns(coef(*CDP)))

SPRU375G

Instruction Set Descriptions

5-325

Parallel Multiply and Subtracts

Parallel Multiply and Subtracts


Syntax Characteristics
No. [1] Syntax ACx = M40(rnd(ACx (uns(Xmem) * uns(coef(Cmem))))), ACy = M40(rnd(ACy (uns(Ymem) * uns(coef(Cmem))))) Parallel Enable Bit No Size 4 Cycles 1 Pipeline X

Opcode Operands Description

1000 0101 XXXM MMYY YMMM 01mm uuDD DDg% ACx, ACy, Cmem, Xmem, Ymem This instruction performs two parallel multiply and subtract (MAS) operations in one cycle. The operations are executed in the two D-unit MACs. The first operation performs a multiplication and a subtraction in the D-unit MAC. The input operands of the multiplier are the content of data memory operand Xmem, extended to 17 bits, and the content of a data memory operand Cmem, addressed using the coefficient addressing mode, extended to 17 bits. The second operation performs a multiplication and a subtraction in the D-unit MAC. The input operands of the multiplier are the content of data memory operand Ymem, extended to 17 bits, and the content of a data memory operand Cmem, addressed using the coefficient addressing mode, extended to 17 bits.
- Input operands are extended to 17 bits according to uns. J J

If the optional uns keyword is applied to the input operand, the content of the memory location is zero extended to 17 bits. If the optional uns keyword is not applied to the input operand, the content of the memory location is sign extended to 17 bits according to SXMD.

- If FRCT = 1, the output of the multiplier is shifted left by 1 bit. - Multiplication overflow detection depends on SMUL. - The 32-bit result of the multiplication is sign extended to 40 bits and

subtracted from the source accumulator.


- Rounding is performed according to RDM, if the optional rnd keyword is

applied to the instruction.


- Overflow detection depends on M40. If an overflow is detected, the

destination accumulator overflow status bit is set.


- When an overflow is detected, the accumulator is saturated according to

SATD.
5-326 Instruction Set Descriptions SPRU375G

Parallel Multiply and Subtracts

This instruction provides the option to locally set M40 to 1 for the execution of the instruction, if the optional M40 keyword is applied to the instruction. For this instruction, the Cmem operand is accessed through the BB bus; on some C55x-based devices, the BB bus is only connected to internal memory and not to external memory. To prevent the generation of a bus error, the Cmem operand must not be mapped on external memory. Each data flow can also disable the usage of the corresponding MAC unit, while allowing the modification of auxiliary registers in the three address generation units through the following instructions:
J J J

mar(Xmem) mar(Ymem) mar(Cmem) FRCT, M40, RDM, SATD, SMUL, SXMD ACOVx, ACOVy

Status Bits

Affected by Affects

Repeat See Also

This instruction can be repeated. See the following other related instructions:
- Modify Auxiliary Register Content with Parallel Multiply and Subtract - Multiply and Subtract - Multiply and Subtract with Parallel Load Accumulator from Memory - Multiply and Subtract with Parallel Multiply - Multiply and Subtract with Parallel Multiply and Accumulate - Multiply and Subtract with Parallel Store Accumulator Content to Memory - Parallel Multiplies - Parallel Multiply and Accumulates

Example
Syntax AC0 = AC0 (uns(*AR3) * uns(coef(*CDP))), AC1 = AC1 (uns(*AR4) * uns(coef(*CDP))) Description Both instructions are performed in parallel. The unsigned content addressed by AR3 multiplied by the unsigned content addressed by the coefficient data pointer register (CDP) is subtracted from the content of AC0 and the result is stored in AC0. The unsigned content addressed by AR4 multiplied by the unsigned content addressed by CDP is subtracted from the content of AC1 and the result is stored in AC1.

SPRU375G

Instruction Set Descriptions

5-327

Peripheral Port Register Access Qualifiers (readport/writeport)

Peripheral Port Register Access Qualifiers


Syntax Characteristics
No. [1] [2] Syntax readport() writeport() Parallel Enable Bit No No Size 1 1 Cycles 1 1 Pipeline D D

Opcode

readport writeport

1001 1001 1001 1010

Operands Description

none These operand qualifiers allow you to locally disable access toward the data memory and enable access to the 64K-word I/O space. The I/O data location is specified by the Smem, Xmem, or Ymem fields.
- A readport() operand qualifier may be included in any instruction making

a word single data memory access Smem or Xmem that is used in a read operation, except instructions using delay().
- A writeport() operand qualifier may be included in any instruction making

a word single data memory access Smem or Ymem that is used in a write operation, except instructions using the delay().
- A readport() or writeport() operand qualifier cannot be used as a

stand-alone instruction (the assembler generates an error message). Any instruction making a word single data memory access Smem (except those listed above) can use the *port(#k16) addressing mode to access the 64K-word I/O space with an immediate address. When an instruction uses *port(#k16), the 16-bit unsigned constant, k16, is encoded in a 2-byte extension to the instruction. Because of the extension, an instruction using *port(#k16) cannot be executed in parallel with another instruction. The following indirect operands cannot be used for accesses to I/O space. An instruction using one of these operands requires a 2-byte extension to the instruction. Because of the extension, an instruction using one of the following indirect operands cannot be executed with these operand qualifiers.
- *ARn(#K16) - *+ARn(#K16) - *CDP(#K16) - *+CDP(#K16) 5-328 Instruction Set Descriptions SPRU375G

Peripheral Port Register Access Qualifiers (readport/writeport)

Status Bits

Affected by Affects

none none

Repeat Example 1
Syntax T2 = *AR3 || readport()

An instruction using this operand qualifier can be repeated.

Description The content addressed by AR3 (I/O address) is loaded into T2.

Example 2
Syntax *AR3 = T2 || writeport() Description The content of T2 is written to the location addressed by AR3 (I/O address).

SPRU375G

Instruction Set Descriptions

5-329

Pop Accumulator or Extended Auxiliary Register Content from Stack Pointers (popboth)

Pop Accumulator or Extended Auxiliary Register Content from Stack Pointers


Syntax Characteristics
Parallel Enable Bit Yes

No. [1]

Syntax xdst = popboth()

Size 2

Cycles 1

Pipeline X

Opcode Operands Description xdst

0101 000E XDDD 0100

This instruction moves the content of two 16-bit data memory locations addressed by the data stack pointer (SP) and system stack pointer (SSP) to accumulator ACx or to the 23-bit destination register (XARx, XSP, XSSP, XDP, or XCDP). The content of xdst(150) is loaded from the location addressed by SP and the content of xdst(3116) is loaded from the location addressed by SSP. When xdst is a 23-bit register, the upper 9 bits of the data memory addressed by SSP are discarded and only the 7 lower bits of the data memory are loaded into the high part of xdst(2216). When xdst is an accumulator, the guard bits, ACx(3932), are reloaded (unchanged) with the current value and are not modified by this instruction.

Status Bits

Affected by Affects

none none

Repeat See Also

This instruction can be repeated. See the following other related instructions:
- Pop Top of Stack - Push to Top of Stack - Push Accumulator or Extended Auxiliary Register Content to Stack Pointers

5-330

Instruction Set Descriptions

SPRU375G

Pop Top of Stack (pop)

Pop Top of Stack


Syntax Characteristics
Parallel Enable Bit Yes Yes No Yes No No

No. [1] [2] [3] [4] [5] [6]

Syntax dst1, dst2 = pop() dst = pop() dst, Smem = pop() ACx = dbl(pop()) Smem = pop() dbl(Lmem) = pop()

Size 2 2 3 2 2 2

Cycles 1 1 1 1 1 1

Pipeline X X X X X X

Description

These instructions move the content of the data memory location addressed by the data stack pointer (SP) to:
- an accumulator, auxiliary, or temporary register - a data memory location

When the destination register is an accumulator, the guard bits and the 16 higher bits of the accumulator, ACx(3916), are reloaded (unchanged) with the current value and are not modified by these instructions. The increment operation performed on SP is done by the A-unit address generator dedicated to the stack addressing management. Status Bits Affected by Affects See Also none none

See the following other related instructions:


- Pop Accumulator or Extended Auxiliary Register Content from Stack Pointers - Push to Top of Stack - Push Accumulator or Extended Auxiliary Register Content to Stack Pointers

SPRU375G

Instruction Set Descriptions

5-331

Pop Top of Stack (pop)

Pop Top of Stack


Syntax Characteristics
Parallel Enable Bit Yes

No. [1]

Syntax dst1, dst2 = pop()

Size 2

Cycles 1

Pipeline X

Opcode
Note: FSSS = dst1, FDDD = dst2

0011 101E FSSS FDDD dst1, dst2 This instruction moves the content of the 16-bit data memory location pointed by SP to destination register dst1 and moves the content of the 16-bit data memory location pointed by SP + 1 to destination register dst2. When the destination register, dst1 or dst2, is an accumulator, the content of the 16-bit data memory operand is moved to the destination accumulator low part, ACx(150). The guard bits and the 16 higher bits of the accumulator, ACx(3916), are reloaded (unchanged) with the current value and are not modified by this instruction. SP is incremented by 2.

Operands Description

Status Bits

Affected by Affects

none none

Repeat Example
Syntax AC0, AC1 = pop()

This instruction can be repeated.

Description The content of the memory location pointed by the data stack pointer (SP) is copied to AC0(150) and the content of the memory location pointed by SP + 1 is copied to AC1(150). Bits 3916 of the accumulators are unchanged. The SP is incremented by 2.
After 00 4500 0000 F7 5678 9432 0300 4890 2300 AC0 AC1 SP 300 301 00 4500 4890 F7 5678 2300 0302 4890 2300

Before AC0 AC1 SP 300 301

5-332

Instruction Set Descriptions

SPRU375G

Pop Top of Stack (pop)

Pop Top of Stack


Syntax Characteristics
Parallel Enable Bit Yes

No. [2]

Syntax dst = pop()

Size 2

Cycles 1

Pipeline X

Opcode Operands Description dst

0101 000E FDDD x010

This instruction moves the content of the 16-bit data memory location pointed by SP to destination register dst. When the destination register, dst, is an accumulator, the content of the 16-bit data memory operand is moved to the destination accumulator low part, ACx(150). The guard bits and the 16 higher bits of the accumulator, ACx(3916), are reloaded (unchanged) with the current value and are not modified by this instruction. SP is incremented by 1.

Status Bits

Affected by Affects

none none

Repeat Example
Syntax AC0 = pop()

This instruction can be repeated.

Description The content of the memory location pointed by the data stack pointer (SP) is copied to AC0(150). Bits 3916 of AC0 are unchanged. The SP is incremented by 1.

SPRU375G

Instruction Set Descriptions

5-333

Pop Top of Stack (pop)

Pop Top of Stack


Syntax Characteristics
Parallel Enable Bit No

No. [3]

Syntax dst, Smem = pop()

Size 3

Cycles 1

Pipeline X

Opcode Operands Description dst, Smem

1110 0100 AAAA AAAI FDDD x1xx

This instruction moves the content of the 16-bit data memory location pointed by SP to destination register dst and moves the content of the 16-bit data memory location pointed by SP + 1 to data memory (Smem) location. When the destination register, dst, is an accumulator, the content of the 16-bit data memory operand is moved to the destination accumulator low part, ACx(150). The guard bits and the 16 higher bits of the accumulator, ACx(3916), are reloaded (unchanged) with the current value and are not modified by this instruction. SP is incremented by 2.

Status Bits

Affected by Affects

none none

Repeat Example
Syntax AC0, *AR3 = pop()

This instruction can be repeated.

Description The content of the memory location pointed by the data stack pointer (SP) is copied to AC0(150) and the content of the memory location pointed by SP + 1 is copied to the location addressed by AR3. Bits 3916 of AC0 are unchanged. The SP is incremented by 2.

5-334

Instruction Set Descriptions

SPRU375G

Pop Top of Stack (pop)

Pop Top of Stack


Syntax Characteristics
Parallel Enable Bit Yes

No. [4]

Syntax ACx = dbl(pop())

Size 2

Cycles 1

Pipeline X

Opcode Operands Description ACx

0101 000E xxDD x011

This instruction moves the content of the 16-bit data memory location pointed by SP to the accumulator high part ACx(3116) and moves the content of the 16-bit data memory location pointed by SP + 1 to the accumulator low part ACx(150). The guard bits of the accumulator, ACx(3932), are reloaded (unchanged) with the current value and are not modified by this instruction. SP is incremented by 2.

Status Bits

Affected by Affects

none none

Repeat Example
Syntax AC1 = dbl(pop())

This instruction can be repeated.

Description The content of the memory location pointed by the data stack pointer (SP) is copied to AC1(3116) and the content of the memory location pointed by SP + 1 is copied to AC1(150). Bits 3932 of AC1 are unchanged. The SP is incremented by 2.
After 03 3800 FC00 0304 5644 F800 AC1 SP 304 305 03 5644 F800 0306 5644 F800

Before AC1 SP 304 305

SPRU375G

Instruction Set Descriptions

5-335

Pop Top of Stack (pop)

Pop Top of Stack


Syntax Characteristics
Parallel Enable Bit No

No. [5]

Syntax Smem = pop()

Size 2

Cycles 1

Pipeline X

Opcode Operands Description Smem

1011 1011 AAAA AAAI

This instruction moves the content of the 16-bit data memory location pointed by SP to data memory (Smem) location. SP is incremented by 1. Affected by Affects none none

Status Bits

Repeat Example
Syntax *AR1 = pop()

This instruction can be repeated.

Description The content of the memory location pointed by the data stack pointer (SP) is copied to the location addressed by AR1. The SP is incremented by 1.
After 0200 0300 3400 6903 AR1 SP 200 300 0200 0301 6903 6903

Before AR1 SP 200 300

5-336

Instruction Set Descriptions

SPRU375G

Pop Top of Stack (pop)

Pop Top of Stack


Syntax Characteristics
Parallel Enable Bit No

No. [6]

Syntax dbl(Lmem) = pop()

Size 2

Cycles 1

Pipeline X

Opcode Operands Description Lmem

1011 1000 AAAA AAAI

This instruction moves the content of the 16-bit data memory location pointed by SP to the 16 highest bits of data memory location Lmem and moves the content of the 16-bit data memory location pointed by SP + 1 to the 16 lowest bits of data memory location Lmem. When Lmem is at an even address, the two 16-bit values popped from the stack are stored at memory location Lmem in the same order. When Lmem is at an odd address, the two 16-bit values popped from the stack are stored at memory location Lmem in the reverse order. SP is incremented by 2.

Status Bits

Affected by Affects

none none

Repeat Example
Syntax dbl(*AR3) = pop()

This instruction can be repeated.

Description The content of the memory location pointed by the data stack pointer (SP) is copied to the 16 highest bits of the location addressed by AR3 and the content of the memory location pointed by SP + 1 is copied to the 16 lowest bits of the location addressed by AR3. Because this instruction is a long-operand instruction, AR3 is decremented by 2 after the execution. The SP is incremented by 2.

SPRU375G

Instruction Set Descriptions

5-337

Push Accumulator or Extended Auxiliary Register Content to Stack Pointers (pshboth)

Push Accumulator or Extended Auxiliary Register Content to Stack Pointers


Syntax Characteristics
Parallel Enable Bit Yes

No. [1]

Syntax pshboth(xsrc)

Size 2

Cycles 1

Pipeline X

Opcode Operands Description xsrc

0101 000E XSSS 0101

This instruction moves the lower 32 bits of ACx or the content of the 23-bit source register (XARx, XSP, XSSP, XDP, or XCDP) to the two 16-bit memory locations addressed by the data stack pointer (SP) and system stack pointer (SSP). The content of xsrc(150) is moved to the location addressed by SP and the content of xsrc(3116) is moved to the location addressed by SSP. When xsrc is a 23-bit register, the upper 9 bits of the location addressed by SSP are filled with 0.

Status Bits

Affected by Affects

none none

Repeat See Also

This instruction can be repeated. See the following other related instructions:
- Pop Accumulator or Extended Auxiliary Register Content from Stack Pointers - Pop Top of Stack - Push to Top of Stack

5-338

Instruction Set Descriptions

SPRU375G

Push to Top of Stack (push)

Push to Top of Stack


Syntax Characteristics
Parallel Enable Bit Yes Yes No Yes No No

No. [1] [2] [3] [4] [5] [6]

Syntax push(src1, src2) push(src) push(src, Smem) dbl(push(ACx)) push(Smem) push(dbl(Lmem))

Size 2 2 3 2 2 2

Cycles 1 1 1 1 1 1

Pipeline X X X X X X

Description

These instructions move one or two operands to the data memory location addressed by the data stack pointer (SP). The operands may be:
- an accumulator, auxiliary, or temporary register - a data memory location

The decrement operation performed on SP is done by the A-unit address generator dedicated to the stack addressing management. Status Bits Affected by Affects See Also none none

See the following other related instructions:


- Pop Top of Stack - Pop Accumulator or Extended Auxiliary Register Content from Stack Pointers - Push Accumulator or Extended Auxiliary Register Content to Stack Pointers

SPRU375G

Instruction Set Descriptions

5-339

Push to Top of Stack (push)

Push to Top of Stack


Syntax Characteristics
Parallel Enable Bit Yes

No. [1]

Syntax push(src1, src2)

Size 2

Cycles 1

Pipeline X

Opcode
Note: FSSS = src1, FDDD = src2

0011 100E FSSS FDDD src1, src2 This instruction decrements SP by 2, then moves the content of the source register src1 to the 16-bit data memory location pointed by SP and moves the content of the source register src2 to the 16-bit data memory location pointed by SP + 1. When the source register, src1 or src2, is an accumulator, the source accumulator low part, ACx(150), is moved to the 16-bit data memory operand.

Operands Description

Status Bits

Affected by Affects

none none

Repeat Example
Syntax push(AR0, AC1)

This instruction can be repeated.

Description The data stack pointer (SP) is decremented by 2. The content of AR0 is copied to the memory location pointed by SP and the content of AC1(150) is copied to the memory location pointed by SP + 1.
After 0300 03 5644 F800 0300 0000 0000 5890 AR0 AC1 SP 2FE 2FF 300 0300 03 5644 F800 02FE 0300 F800 5890

Before AR0 AC1 SP 2FE 2FF 300

5-340

Instruction Set Descriptions

SPRU375G

Push to Top of Stack (push)

Push to Top of Stack


Syntax Characteristics
Parallel Enable Bit Yes

No. [2]

Syntax push(src)

Size 2

Cycles 1

Pipeline X

Opcode Operands Description src

0101 000E FSSS x110

This instruction decrements SP by 1, then moves the content of the source register (src) to the 16-bit data memory location pointed by SP. When the source register is an accumulator, the source accumulator low part, ACx(150), is moved to the 16-bit data memory operand. Affected by Affects none none

Status Bits

Repeat Example
Syntax push(AC0)

This instruction can be repeated.

Description The data stack pointer (SP) is decremented by 1. The content of AC0(150) is copied to the memory location pointed by SP.

SPRU375G

Instruction Set Descriptions

5-341

Push to Top of Stack (push)

Push to Top of Stack


Syntax Characteristics
Parallel Enable Bit No

No. [3]

Syntax push(src, Smem)

Size 3

Cycles 1

Pipeline X

Opcode Operands Description Smem, src

1110 0100 AAAA AAAI FSSS x0xx

This instruction decrements SP by 2, then moves the content of the source register (src) to the 16-bit data memory location pointed by SP and moves the content of the data memory (Smem) location to the 16-bit data memory location pointed by SP + 1. When the source register is an accumulator, the source accumulator low part, ACx(150), is moved to the 16-bit data memory operand.

Status Bits

Affected by Affects

none none

Repeat Example
Syntax push(AC0, *AR3)

This instruction can be repeated.

Description The data stack pointer (SP) is decremented by 2. The content of AC0(150) is copied to the memory location pointed by SP and the content addressed by AR3 is copied to the memory location pointed by SP + 1.

5-342

Instruction Set Descriptions

SPRU375G

Push to Top of Stack (push)

Push to Top of Stack


Syntax Characteristics
Parallel Enable Bit Yes

No. [4]

Syntax dbl(push(ACx))

Size 2

Cycles 1

Pipeline X

Opcode Operands Description ACx

0101 000E xxSS x111

This instruction decrements SP by 2, then moves the content of the accumulator high part ACx(3116) to the 16-bit data memory location pointed by SP and moves the content of the accumulator low part ACx(150) to the 16-bit data memory location pointed by SP + 1. Affected by Affects none none

Status Bits

Repeat Example
Syntax dbl(push(AC0))

This instruction can be repeated.

Description The data stack pointer (SP) is decremented by 2. The content of AC0(3116) is copied to the memory location pointed by SP and the content of AC0(150) is copied to the memory location pointed by SP + 1.

SPRU375G

Instruction Set Descriptions

5-343

Push to Top of Stack (push)

Push to Top of Stack


Syntax Characteristics
Parallel Enable Bit No

No. [5]

Syntax push(Smem)

Size 2

Cycles 1

Pipeline X

Opcode Operands Description Smem

1011 0101 AAAA AAAI

This instruction decrements SP by 1, then moves the content of the data memory (Smem) location to the 16-bit data memory location pointed by SP. Affected by Affects none none

Status Bits

Repeat Example
Syntax push(*AR1)

This instruction can be repeated.

Description The data stack pointer (SP) decremented by 1. The content addressed by AR1 is copied to the memory location pointed by SP.
After 6903 0305 0000 0300 *AR1 SP 304 305 6903 0304 6903 0300

Before *AR1 SP 304 305

5-344

Instruction Set Descriptions

SPRU375G

Push to Top of Stack (push)

Push to Top of Stack


Syntax Characteristics
Parallel Enable Bit No

No. [6]

Syntax push(dbl(Lmem))

Size 2

Cycles 1

Pipeline X

Opcode Operands Description Lmem

1011 0111 AAAA AAAI

This instruction decrements SP by 2, then moves the 16 highest bits of data memory location Lmem to the 16-bit data memory location pointed by SP and moves the 16 lowest bits of data memory location Lmem to the 16-bit data memory location pointed by SP + 1. When Lmem is at an even address, the two 16-bit values pushed onto the stack are stored at memory location Lmem in the same order. When Lmem is at an odd address, the two 16-bit values pushed onto the stack are stored at memory location Lmem in the reverse order.

Status Bits

Affected by Affects

none none

Repeat Example
Syntax push(dbl(*AR3))

This instruction can be repeated.

Description The data stack pointer (SP) is decremented by 2. The 16 highest bits of the content at the location addressed by AR3 are copied to the memory location pointed by SP and the 16 lowest bits of the content at the location addressed by AR3 are copied to the memory location pointed by SP + 1. Because this instruction is a long-operand instruction, AR3 is decremented by 2 after the execution.

SPRU375G

Instruction Set Descriptions

5-345

Repeat Block of Instructions Unconditionally

Repeat Block of Instructions Unconditionally


Syntax Characteristics
No. [1] [2] Syntax localrepeat{} blockrepeat{} Parallel Enable Bit Yes Yes Size 2 3 Cycles 1 1 Pipeline AD AD

Description

These instructions repeat a block of instructions the number of times specified by:
- the content of BRC0 + 1, if no loop has already been detected. - the content of BRS1 + 1, if one level of the loop has already been detected.

Loop structures defined by these instructions must have the following characteristics:
- The minimum number of instructions executed within one loop iteration is 2. - The minimum number of cycles executed within one loop iteration is 2. - The maximum loop size is 64K bytes. - The block-repeat counter registers (BRCx) must be read 3 full cycles

before the end of the loops in order to extract the correct loop iteration number from these registers without any pipeline stall.
- The block-repeat operation can only be cleared by branching to a

destination address outside the active block-repeat loop.


- C54CM bit in ST1_55 cannot be modified within a block-repeat loop.

These instructions cannot be repeated. See section 1.5 for a list of instructions that cannot be used in a repeat block mechanism. Status Bits Affected by Affects See Also none none

See the following other related instructions:


- Repeat Single Instruction Conditionally - Repeat Single Instruction Unconditionally - Repeat Single Instruction Unconditionally and Decrement CSR - Repeat Single Instruction Unconditionally and Increment CSR

5-346

Instruction Set Descriptions

SPRU375G

Repeat Block of Instructions Unconditionally (localrepeat)

Repeat Block of Instructions Unconditionally


Syntax Characteristics
Parallel Enable Bit Yes

No. [1]

Syntax localrepeat{}

Size 2

Cycles 1

Pipeline AD

Opcode Operands Description none

0100 101E 1lll llll

This instruction repeats a block of instructions the number of times specified by:
- the content of BRC0 + 1, if no loop has already been detected. In this case: J J

In the address phase of the pipeline, RSA0 is loaded with the program address of the first instruction of the loop. The program address of the last instruction of the loop (that may be two parallel instructions) is computed in the address phase of the pipeline and stored in REA0. BRC0 is decremented at the address phase of the last instruction of the loop when its content is not equal to 0. BRC0 contains 0 after the block-repeat operation has ended.

J J

- the content of BRS1 + 1, if one level of the loop has already been detected.

In this case:
J J J

BRC1 is loaded with the content of BRS1 in the address phase of the repeat block instruction. In the address phase of the pipeline, RSA1 is loaded with the program address of the first instruction of the loop. The program address of the last instruction of the loop (that may be two parallel instructions) is computed in the address phase of the pipeline and stored in REA1. BRC1 is decremented at the address phase of the last instruction of the loop when its content is not equal to 0. BRC1 contains 0 after the block-repeat operation has ended. BRS1 content is not impacted by the block-repeat operation.
Instruction Set Descriptions 5-347

J J J SPRU375G

Repeat Block of Instructions Unconditionally (localrepeat)

Loop structures defined by this instruction must have the following characteristics:
- The minimum number of instructions executed within one loop iteration is 2. - The minimum number of cycles executed within one loop iteration is 2. - The maximum loop size is 64K bytes. - The block-repeat operation can only be cleared by branching to a

destination address outside the active block-repeat loop.


- The block-repeat counter registers (BRCx) must be read 3 full cycles

before the end of the loops in order to extract the correct loop iteration number from these registers without any pipeline stall.
- C54CM bit in ST1_55 cannot be modified within a block-repeat loop. - The following instructions cannot be used as the last instruction in the loop

structure:
while (cond && (RPTC < k8)) repeat if (cond) execute(AD_Unit) if (cond) execute(D_Unit) repeat(k8) repeat(k16) repeat(CSR) repeat(CSR), CSR += k4 repeat(CSR), CSR += TAx repeat(CSR), CSR = k4

A local loop is defined as when all the code of the loop is repeatedly executed from within the instruction buffer queue (IBQ):
- All the code of the local loop must fit within the 64-byte, 4-byte-aligned IBQ;

therefore, local repeat blocks are limited to 64 bytes minus the 0 to 3 bytes of first-instruction misalignment. The 64th byte of the IBQ can only occur in a paralleled instruction. See Figure 52 for legal uses of the localrepeat instruction.
- The following instructions cannot be used as the last instruction in the local

loop:
while (cond && (RPTC < k8)) repeat if (cond) execute(AD_Unit) if (cond) execute(D_Unit) repeat(k8) repeat(k16) repeat(CSR) repeat(CSR), CSR += k4 repeat(CSR), CSR += TAx repeat(CSR), CSR = k4

- Nested local repeat block instructions are allowed. - See section 1.5 for a list of instructions that cannot be used in the local loop

code.
5-348 Instruction Set Descriptions SPRU375G

Repeat Block of Instructions Unconditionally (localrepeat)

- The only branch instructions allowed in a localrepeat structure are the

branch instructions with a target branch address pointing to an instruction included within the loop code and being at a higher address than the branching instruction. In this case, the branch conditionally instruction is executed in 3 cycles and the condition is evaluated in the address phase of the pipeline (there is a 3-cycle latency on the condition setting). Compatibility with C54x devices (C54CM = 1) When C54CM =1:
- This instruction only uses block-repeat level 0; block-repeat level 1 is

disabled.
- The block-repeat active flag (BRAF) is set to 1. BRAF is cleared to 0 at the

end of the block-repeat operation when BRC0 contains 0.


- You can stop an active block-repeat operation by clearing BRAF to 0. - Block-repeat control registers for level 1 are not used. Nested

block-repeat operations are supported using the C54x convention with context save/restore and BRAF. The control-flow context register (CFCT) values are not used.
- BRAF is automatically cleared to 0 when a far branch (FB) or far call

(FCALL) instruction is executed. Status Bits Affected by Affects Repeat Example


Syntax localrepeat Description A block of instructions is repeated as defined by the content of BRC0 + 1. Address BRC0 = #3 localrepeat { } *?: Unchanged **DTZ: Decrease till zero 004003 004005 00400D BRC0 0003 ?* ? DTZ** 0000 RSA0 0000 4005 ? ? 4005 REA0 0000 400D ? ? 400D BRS1 0000 ? ? ? 0000

none none

This instruction cannot be repeated.

SPRU375G

Instruction Set Descriptions

5-349

Repeat Block of Instructions Unconditionally (localrepeat)

Figure 52. Legal Uses of Repeat Block of Instructions Unconditionally (localrepeat) Instruction
(a) 60-Byte Unaligned LoopLegal Use localrepeat { 1st instruction Last instruction }
next instruction

; no alignment directive

} 60-byte loop body

The entire localrepeat block and the next instruction reside in the IBQ, this code is accepted by the assembler.

(b) 61-Byte Unaligned Loop with Single Instruction at End of LoopIllegal Use localrepeat { 1st instruction Last instruction (nonparalleled = single) }
next instruction

; no alignment directive

} 61-byte loop body

The localrepeat instruction is not aligned; the next instruction may not be fetched in the IBQ. Because the last instruction of the localrepeat block is a nonparalleled (single) instruction, the CPU must confirm that the next instruction does not have a parallel enable bit; therefore, this code is rejected by the assembler.

5-350

Instruction Set Descriptions

SPRU375G

Repeat Block of Instructions Unconditionally (localrepeat)

Figure 52. Legal Uses of Repeat Block of Instructions Unconditionally (localrepeat) Instruction (Continued)
(c) 61-Byte Unaligned Loop with Paralleled Instruction at End of LoopLegal Use localrepeat { 1st instruction Last instruction (paralleled) }
next instruction

; no alignment directive

} 61-byte loop body

The localrepeat instruction is not aligned; the next instruction may not be fetched in the IBQ. Because the last instruction of the localrepeat block is a paralleled instruction, the CPU does not need to confirm that the next instruction does not have a parallel enable bit; therefore, this code is accepted by the assembler.

(d) 61-Byte Aligned Loop with Single Instruction at End of LoopLegal Use
align 4

; alignment directive 1st instruction

localrepeat { Last instruction (nonparalleled = single) }


next instruction

} 61-byte loop body

The localrepeat instruction is aligned, so the entire localrepeat block and the next instruction reside in the IBQ. Because the next instruction is in the IBQ, the CPU can confirm that the next instruction does not have a parallel enable bit; therefore, this code is accepted by the assembler.

SPRU375G

Instruction Set Descriptions

5-351

Repeat Block of Instructions Unconditionally (localrepeat)

Figure 52. Legal Uses of Repeat Block of Instructions Unconditionally (localrepeat) Instruction (Continued)
(e) 62-Byte Unaligned LoopIllegal Use localrepeat { 1st instruction Last instruction }
next instruction

; no alignment directive

} 62-byte loop body

The localrepeat instruction is not aligned; the entire localrepeat block may not reside in the IBQ. Because the last instruction of the localrepeat block may not reside in the IBQ, this code is rejected by the assembler.

(f) 62-Byte Aligned Loop with Single Instruction at End of LoopLegal Use
align 4 nop_16||nop

; alignment directive ; 3-byte instruction 1st instruction

localrepeat { Last instruction (nonparalleled = single) }


next instruction

} 62-byte loop body

The nop instructions are aligned so the localrepeat instruction, the entire localrepeat block, and the next instruction reside in the IBQ. Because the next instruction is in the IBQ, the CPU can confirm that the next instruction does not have a parallel enable bit; therefore, this code is accepted by the assembler.

5-352

Instruction Set Descriptions

SPRU375G

Repeat Block of Instructions Unconditionally (localrepeat)

Figure 52. Legal Uses of Repeat Block of Instructions Unconditionally (localrepeat) Instruction (Continued)
(g) 64-Byte Aligned Loop with Paralleled Instruction at End of LoopLegal Use
align 4 nop_16

; alignment directive ; 2-byte instruction 1st instruction

localrepeat { Last instruction (paralleled) }


next instruction

} 64-byte loop body

The nop instruction is aligned, so the localrepeat instruction and the entire localrepeat block reside in the IBQ; the next instruction is not fetched in the IBQ. Because the last instruction of the localrepeat block is a paralleled instruction, the CPU does not need to confirm that the next instruction does not have a parallel enable bit; therefore, this code is accepted by the assembler.

SPRU375G

Instruction Set Descriptions

5-353

Repeat Block of Instructions Unconditionally (blockrepeat)

Repeat Block of Instructions Unconditionally


Syntax Characteristics
Parallel Enable Bit Yes

No. [2]

Syntax blockrepeat{}

Size 3

Cycles 1

Pipeline AD

Opcode Operands Description none

0000 111E llll llll llll llll

This instruction repeats a block of instructions the number of times specified by:
- the content of BRC0 + 1, if no loop has already been detected. In this case: J J

In the address phase of the pipeline, RSA0 is loaded with the program address of the first instruction of the loop. The program address of the last instruction of the loop (that may be two parallel instructions) is computed in the address phase of the pipeline and stored in REA0. BRC0 is decremented at the address phase of the last instruction of the loop when its content is not equal to 0. BRC0 contains 0 after the block-repeat operation has ended.

J J

- the content of BRS1 + 1, if one level of the loop has already been detected.

In this case:
J J J

BRC1 is loaded with the content of BRS1 in the address phase of the repeat block instruction. In the address phase of the pipeline, RSA1 is loaded with the program address of the first instruction of the loop. The program address of the last instruction of the loop (that may be two parallel instructions) is computed in the address phase of the pipeline and stored in REA1. BRC1 is decremented at the address phase of the last instruction of the loop when its content is not equal to 0. BRC1 contains 0 after the block-repeat operation has ended. BRS1 content is not impacted by the block-repeat operation.
SPRU375G

J J J 5-354

Instruction Set Descriptions

Repeat Block of Instructions Unconditionally (blockrepeat)

Loop structures defined by these instructions must have the following characteristics:
- The minimum number of instructions executed within one loop iteration is 2. - The minimum number of cycles executed within one loop iteration is 2. - The maximum loop size is 64K bytes. - The block-repeat operation can only be cleared by branching to a

destination address outside the active block-repeat loop.


- The block-repeat counter registers (BRCx) must be read 3 full cycles

before the end of the loops in order to extract the correct loop iteration number from these registers without any pipeline stall.
- C54CM bit in ST1_55 cannot be modified within a block-repeat loop. - The following instructions cannot be used as the last instruction in the loop

structure:
while (cond && (RPTC < k8)) repeat if (cond) execute(AD_Unit) if (cond) execute(D_Unit) repeat(k8) repeat(k16) repeat(CSR) repeat(CSR), CSR += k4 repeat(CSR), CSR += TAx repeat(CSR), CSR = k4

- See section 1.5 for a list of instructions that cannot be used in the

block-repeat loop code. Compatibility with C54x devices (C54CM = 1) When C54CM =1:
- This instruction only uses block-repeat level 0; block-repeat level 1 is

disabled.
- The block-repeat active flag (BRAF) is set to 1. BRAF is cleared to 0 at the

end of the block-repeat operation when BRC0 contains 0.


- You can stop an active block-repeat operation by clearing BRAF to 0. - Block-repeat control registers for level 1 are not used. Nested

block-repeat operations are supported using the C54x convention with context save/restore and BRAF. The control-flow context register (CFCT) values are not used.
- BRAF is automatically cleared to 0 when a far branch (FB) or far call

(FCALL) instruction is executed.


SPRU375G Instruction Set Descriptions 5-355

Repeat Block of Instructions Unconditionally (blockrepeat)

Status Bits

Affected by Affects

none none

Repeat Example
Syntax blockrepeat

This instruction cannot be repeated.

Description A block of instructions is repeated as defined by the content of BRC0 + 1. A second loop of instructions is repeated as defined by the content of BRS1 + 1 (BRC1 is loaded with the content of BRS1). Address BRC0 0003 ?* 004006 004009 00400B 00400D 004015 } 004017 ? ? ? ? ? DTZ** 0000 RSA0 0000 ? 4009 ? ? ? ? ? 4009 REA0 0000 ? 4017 ? ? ? ? ? 4017 BRS1 0000 0001 ? ? ? ? ? ? 0001 BRC1 0000 0001 ? ? (BRS1) ? DTZ** ? 0000 RSA1 0000 ? ? ? 400D ? ? ? 400D REA1 0000 ? ? ? 4015 ? ? ? 4015

BRC0 = #3 BRC1 = #1 blockrepeat { localrepeat {

} *?: Unchanged **DTZ: Decrease till zero

5-356

Instruction Set Descriptions

SPRU375G

Repeat Single Instruction Conditionally (while/repeat)

Repeat Single Instruction Conditionally


Syntax Characteristics
No. [1] Syntax while (cond && (RPTC < k8)) repeat Parallel Enable Bit Yes Size 3 Cycles 1 Pipeline AD

Opcode Operands Description cond, k8

0000 000E xCCC CCCC kkkk kkkk

This instruction evaluates a single condition defined by the cond field and as long as the condition is true, the next instruction or the next two paralleled instructions is repeated the number of times specified by an 8-bit immediate value, k8 + 1. The maximum number of executions of a given instruction or paralleled instructions is 28 1 (255). See Table 13 for a list of conditions. The 8 LSBs of the repeat counter register (RPTC):
- Are loaded with the immediate value at the address phase of the pipeline. - Are decremented by 1 in the decode phase of the repeated instruction.

The 8 MSBs of RPTC:


- Are loaded with the cond code at the address phase of the pipeline. - Are untouched during the while/repeat structure execution.

At each step of the iteration, the condition defined by the cond field is tested in the execute phase of the pipeline. When the condition becomes false, the instruction repetition stops.
- If the condition becomes false at any execution of the repeated instruction,

the 8 LSBs of RPTC are corrected to indicate exactly how many iterations were not performed.
- Since the condition is evaluated in the execute phase of the repeated

instruction, when the condition is tested false, some of the succeeding iterations of that repeated instruction may have gone through the address, access, and read phases of the pipeline. Therefore, they may have modified the pointer registers used in the DAGEN units to generate data memory operands addresses in the address phase. When the while/repeat structure is exited, reading the computed single-repeat register (CSR) content enables you to determine how many instructions have gone through the address phase of the pipeline. You may then use the Repeat Single Instruction Unconditionally instruction [3] to rewind the pointer registers. Note that this must only be performed when a false condition has been met inside the while/repeat structure.
SPRU375G Instruction Set Descriptions 5-357

Repeat Single Instruction Conditionally (while/repeat)

- The following table provides the 8 LSBs of RPTC and CSR once the

while/repeat structure is exited.


If the condition is met At 1st iteration At 2nd iteration At 3rd iteration At RPTCinit 2 iteration At RPTCinit 1 iteration At RPTCinit iteration At RPTCinit + 1 iteration Never RPTC[7:0] content after exiting loop RPTCinit + 1 RPTCinit RPTC 1 4 3 2 1 0 CSR content after exiting loop 4 4 4 3 2 1 0 0

RPTCinit is the number of requested iterations minus 1.

The repeat single mechanism triggered by this instruction is interruptible. Saving and restoring the RPTC content in ISRs enables you to preserve the while/repeat structure context. When the while/repeat structure contains any form of a store-to-memory instruction, the store-to-memory instruction is only disabled one cycle after the condition is evaluated to be false. Therefore, the store-to-memory instruction is executed once more than other processing instructions updating CPU registers. This enables you to store the last values obtained in these registers when the condition was met. Instead of programming a number of iterations (minus 1) equal to 0, it is recommended that you use the conditional execute() structure. This instruction cannot be used as the last instruction in a repeat loop structure. See section 1.5 for a list of instructions that cannot be used in a repeat single mechanism. Compatibility with C54x devices (C54CM = 1) When C54CM = 1, the comparison of accumulators to 0 is performed as if M40 was set to 1. Status Bits Affected by Affects Repeat
5-358

ACOVx, CARRY, C54CM, M40, TCx ACOVx

This instruction cannot be repeated.


Instruction Set Descriptions SPRU375G

Repeat Single Instruction Conditionally (while/repeat)

See Also

See the following other related instructions:


- Repeat Block of Instructions Unconditionally - Repeat Single Instruction Unconditionally - Repeat Single Instruction Unconditionally and Decrement CSR - Repeat Single Instruction Unconditionally and Increment CSR

Example
Syntax while (AC1 > #0 && (RPTC < #7)) repeat Description As long as the content of AC1 is greater than 0 and the repeat counter is not equal to 0, the next single instruction is repeated as defined by the unsigned 8-bit value (7) + 1. At the address phase of the pipeline, RPTC is automatically initialized to 4107h and then is immediately decreased to 4106h. address: 004004 004008 00400B
After 00 2359 0340 0340 AC1 T0 00 1FC2 7B40 0340

while (AC1 > #0 && (RPTC < #7)) repeat AC1 = AC1 (T0 * *AR1)
Before AC1 T0 *AR1

2354 *AR1 2354 RPTC 4106 RPTC 0000 At the address phase of the pipeline, RPTC is automatically initialized to 4107h and then is immediately decreased to 4106h.

SPRU375G

Instruction Set Descriptions

5-359

Repeat Single Instruction Unconditionally (repeat)

Repeat Single Instruction Unconditionally


Syntax Characteristics
Parallel Enable Bit Yes Yes Yes

No. [1] [2] [3]

Syntax repeat(k8) repeat(k16) repeat(CSR)

Size 2 3 2

Cycles 1 1 1

Pipeline AD AD AD

Description

This instruction repeats the next instruction or the next two paralleled instructions the number of times specified by the content of the computed single repeat register (CSR) + 1 or an immediate value, kx + 1. This value is loaded into the repeat counter register (RPTC). The maximum number of executions of a given instruction or paralleled instructions is 216 1 (65535). The repeat single mechanism triggered by these instructions is interruptible. These instructions cannot be repeated. These instructions cannot be used as the last instruction in a repeat loop structure. Two paralleled instructions can be repeated when following the parallelism general rules. See section 1.5 for a list of instructions that cannot be used in a repeat single mechanism.

Status Bits

Affected by Affects

none none

See Also

See the following other related instructions:


- Repeat Block of Instructions Unconditionally - Repeat Single Instruction Conditionally - Repeat Single Instruction Unconditionally and Decrement CSR - Repeat Single Instruction Unconditionally and Increment CSR

5-360

Instruction Set Descriptions

SPRU375G

Repeat Single Instruction Unconditionally (repeat)

Repeat Single Instruction Unconditionally


Syntax Characteristics
Parallel Enable Bit Yes Yes

No. [1] [2]

Syntax repeat(k8) repeat(k16)

Size 2 3

Cycles 1 1

Pipeline AD AD

Opcode

k8 k16

0100 110E kkkk kkkk 0000 110E kkkk kkkk kkkk kkkk

Operands Description

kx This instruction repeats the next instruction or the next two paralleled instructions the number of times specified by an immediate value, kx + 1. The repeat counter register (RPTC):
- Is loaded with the immediate value in the address phase of the pipeline. - Is decremented by 1 in the decode phase of the repeated instruction. - Contains 0 at the end of the repeat single mechanism. - Must not be accessed when it is being decremented in the repeat single

mechanism. The repeat single mechanism triggered by this instruction is interruptible. Two paralleled instructions can be repeated when following the parallelism general rules. This instruction cannot be used as the last instruction in a repeat loop structure. See section 1.5 for a list of instructions that cannot be used in a repeat single mechanism. Status Bits Affected by Affects Repeat
SPRU375G

none none

This instruction cannot be repeated.


Instruction Set Descriptions 5-361

Repeat Single Instruction Unconditionally (repeat)

Example 1
Syntax repeat(#3) AC1 = AC1 + *AR3+ * *AR4+
Before AC1 AR3 AR4 200 201 202 203 400 401 402 403 00 0000 0000 0200 0400 AC03 3468 FE00 23DC D768 6987 3400 7900 After AC1 AR3 AR4 200 201 202 203 400 401 402 403 00 3376 AD10 0204 0404 AC03 3468 FE00 23DC D768 6987 3400 7900

Description The single instruction following the repeat instruction is repeated four times.

Example 2
Syntax repeat(#513) Description A single instruction is repeated as defined by the unsigned 16-bit value + 1 (513 + 1).

5-362

Instruction Set Descriptions

SPRU375G

Repeat Single Instruction Unconditionally (repeat)

Repeat Single Instruction Unconditionally


Syntax Characteristics
Parallel Enable Bit Yes

No. [3]

Syntax repeat(CSR)

Size 2

Cycles 1

Pipeline AD

Opcode Operands Description none

0100 100E xxxx x000

This instruction repeats the next instruction or the next two paralleled instructions the number of times specified by the content of the computed single repeat register (CSR) + 1. The repeat counter register (RPTC):
- Is loaded with CSR content in the address phase of the pipeline. - Is decremented by 1 in the decode phase of the repeated instruction. - Contains 0 at the end of the repeat single mechanism. - Must not be accessed when it is being decremented in the repeat single

mechanism. The repeat single mechanism triggered by this instruction is interruptible. Two paralleled instructions can be repeated when following the parallelism general rules. This instruction cannot be used as the last instruction in a repeat loop structure. See section 1.5 for a list of instructions that cannot be used in a repeat single mechanism. Status Bits Affected by Affects Repeat none none

This instruction cannot be repeated.

SPRU375G

Instruction Set Descriptions

5-363

Repeat Single Instruction Unconditionally (repeat)

Example
Syntax repeat(CSR) AC1 = AC1 + *AR3+ * *AR4+
Before AC1 CSR AR3 AR4 200 201 202 203 400 401 402 403 00 0000 0000 0003 0200 0400 AC03 3468 FE00 23DC D768 6987 3400 7900

Description The single instruction following the repeat instruction is repeated as defined by the content of CSR + 1.
After AC1 CSR AR3 AR4 200 201 202 203 400 401 402 403 00 3376 AD10 0003 0204 0404 AC03 3468 FE00 23DC D768 6987 3400 7900

5-364

Instruction Set Descriptions

SPRU375G

Repeat Single Instruction Unconditionally and Decrement CSR (repeat)

Repeat Single Instruction Unconditionally and Decrement CSR


Syntax Characteristics
Parallel Enable Bit Yes

No. [1]

Syntax repeat(CSR), CSR = k4

Size 2

Cycles 1

Pipeline X

Opcode Operands Description k4

0100 100E kkkk x011

This instruction repeats the next instruction or the next two paralleled instructions the number of times specified by the content of the computed single repeat register (CSR) + 1. The repeat counter register (RPTC):
- Is loaded with CSR content in the address phase of the pipeline. - Is decremented by 1 in the decode phase of the repeated instruction. - Contains 0 at the end of the repeat single mechanism. - Must not be accessed when it is being decremented in the repeat single

mechanism. With the A-unit ALU, this instruction allows the content of CSR to be decremented by k4. The CSR modification is performed in the execute phase of the pipeline; there is a 3-cycle latency between the CSR modification and its usage in the address phase. The repeat single mechanism triggered by this instruction is interruptible. Two paralleled instructions can be repeated when following the parallelism general rules. This instruction cannot be used as the last instruction in a repeat loop structure. See section 1.5 for a list of instructions that cannot be used in a repeat single mechanism. Status Bits Affected by Affects Repeat
SPRU375G

none none

This instruction cannot be repeated.


Instruction Set Descriptions 5-365

Repeat Single Instruction Unconditionally and Decrement CSR (repeat)

See Also

See the following other related instructions:


- Repeat Block of Instructions Unconditionally - Repeat Single Instruction Conditionally - Repeat Single Instruction Unconditionally - Repeat Single Instruction Unconditionally and Increment CSR

Example
Syntax repeat(CSR), CSR = #2 Description A single instruction is repeated as defined by the content of CSR + 1. The content of CSR is decremented by the unsigned 4-bit value (2).

5-366

Instruction Set Descriptions

SPRU375G

Repeat Single Instruction Unconditionally and Increment CSR (repeat)

Repeat Single Instruction Unconditionally and Increment CSR


Syntax Characteristics
Parallel Enable Bit Yes Yes

No. [1] [2]

Syntax repeat(CSR), CSR += TAx repeat(CSR), CSR = k4

Size 2 2

Cycles 1 1

Pipeline X X

Description

These instructions repeat the next instruction or the next two paralleled instructions the number of times specified by the content of the computed single repeat register (CSR) + 1. This value is loaded into the repeat counter register (RPTC). The maximum number of executions of a given instruction or paralleled instructions is 216 1 (65535). With the A-unit ALU, these instructions allow the content of CSR to be incremented. The CSR modification is performed in the execute phase of the pipeline; there is a 3-cycle latency between the CSR modification and its usage in the address phase. The repeat single mechanism triggered by these instructions is interruptible. Two paralleled instructions can be repeated when following the parallelism general rules. These instructions cannot be repeated. These instructions cannot be used as the last instruction in a repeat loop structure. See section 1.5 for a list of instructions that cannot be used in a repeat single mechanism.

Status Bits

Affected by Affects

none none

See Also

See the following other related instructions:


- Repeat Block of Instructions Unconditionally - Repeat Single Instruction Conditionally - Repeat Single Instruction Unconditionally - Repeat Single Instruction Unconditionally and Decrement CSR

SPRU375G

Instruction Set Descriptions

5-367

Repeat Single Instruction Unconditionally and Increment CSR (repeat)

Repeat Single Instruction Unconditionally and Increment CSR


Syntax Characteristics
No. [1] Syntax repeat(CSR), CSR += TAx Parallel Enable Bit Yes Size 2 Cycles 1 Pipeline X

Opcode Operands Description TAx

0100 100E FSSS x001

This instruction repeats the next instruction or the next two paralleled instructions the number of times specified by the content of the computed single repeat register (CSR) + 1. The repeat counter register (RPTC):
- Is loaded with CSR content in the address phase of the pipeline. - Is decremented by 1 in the decode phase of the repeated instruction. - Contains 0 at the end of the repeat single mechanism. - Must not be accessed when it is being decremented in the repeat single

mechanism. With the A-unit ALU, this instruction allows the content of CSR to be incremented by the content of TAx. The CSR modification is performed in the execute phase of the pipeline; there is a 3-cycle latency between the CSR modification and its usage in the address phase. The repeat single mechanism triggered by this instruction is interruptible. Two paralleled instructions can be repeated when following the parallelism general rules. This instruction cannot be used as the last instruction in a repeat loop structure. See section 1.5 for a list of instructions that cannot be used in a repeat single mechanism. Status Bits Affected by Affects Repeat Example
Syntax repeat(CSR), CSR += T1 Description A single instruction is repeated as defined by the content of CSR + 1. The content of CSR is incremented by the content of temporary register T1.

none none

This instruction cannot be repeated.

5-368

Instruction Set Descriptions

SPRU375G

Repeat Single Instruction Unconditionally and Increment CSR (repeat)

Repeat Single Instruction Unconditionally and Increment CSR


Syntax Characteristics
No. [2] Syntax repeat(CSR), CSR += k4 Parallel Enable Bit Yes Size 2 Cycles 1 Pipeline X

Opcode Operands Description k4

0100 100E kkkk x010

This instruction repeats the next instruction or the next two paralleled instructions the number of times specified by the content of the computed single repeat register (CSR) + 1. The repeat counter register (RPTC):
- Is loaded with CSR content in the address phase of the pipeline. - Is decremented by 1 in the decode phase of the repeated instruction. - Contains 0 at the end of the repeat single mechanism. - Must not be accessed when it is being decremented in the repeat single

mechanism. With the A-unit ALU, this instruction allows the content of CSR to be incremented by k4. The CSR modification is performed in the execute phase of the pipeline; there is a 3-cycle latency between the CSR modification and its usage in the address phase. The repeat single mechanism triggered by this instruction is interruptible. Two paralleled instructions can be repeated when following the parallelism general rules. This instruction cannot be used as the last instruction in a repeat loop structure. See section 1.5 for a list of instructions that cannot be used in a repeat single mechanism. Status Bits Affected by Affects Repeat Example
Syntax repeat(CSR), CSR += #2 Description A single instruction is repeated as defined by the content of CSR + 1. The content of CSR is incremented by the unsigned 4-bit value (2).

none none

This instruction cannot be repeated.

SPRU375G

Instruction Set Descriptions

5-369

Return Conditionally (if return)

Return Conditionally
Syntax Characteristics
Parallel Enable Bit Yes Cycles 5/5

No. [1]

Syntax if (cond) return

Size 3

Pipeline R

x/y cycles: x cycles = condition true, y cycles = condition false

Opcode Operands Description cond

0000 001E xCCC CCCC xxxx xxxx

This instructions evaluates a single condition defined by the cond field in the read phase of the pipeline. If the condition is true, a return occurs to the return address of the calling subroutine. There is a 1-cycle latency on the condition setting. A single condition can be tested as determined by the cond field of the instruction. See Table 13 for a list of conditions. After returning from a called subroutine, the CPU restores the value of two internal registers: the program counter (PC) and a loop context register. The CPU uses these values to re-establish the context of the program sequence. In the slow-return process (default), the return address (from the PC) and the loop context bits are restored from the stacks (in memory). When the CPU returns from a subroutine, the speed at which these values are restored is dependent on the speed of the memory accesses. In the fast-return process, the return address (from the PC) and the loop context bits are restored from the return address register (RETA) and the control-flow context register (CFCT). You can read from or write to RETA and CFCT as a pair with dedicated, 32-bit load and store instructions. For fastreturn mode operation, see the TMS320C55x DSP CPU Reference Guide (SPRU371). When a return from a subroutine occurs:
- The loop context bits concatenated with the 8 MSBs of the return address

are popped from the top of the system stack pointer (SSP). The SSP is incremented by 1 word in the read phase of the pipeline.
- The 16 LSBs of the return address are popped from the top of the data

stack pointer (SP). The SP is incremented by 1 word in the read phase of the pipeline.
5-370 Instruction Set Descriptions SPRU375G

Return Conditionally (if return)

System Stack (SSP) Before Return SSP = x (Loop bits):PC(2316) Previously stored data Before Return SP = y

Data Stack (SP) PC(150) Previously stored data

After SSP = x + 1 Return R t

After SP = y + 1 Return R t

Compatibility with C54x devices (C54CM = 1) When C54CM = 1, the comparison of accumulators to 0 is performed as if M40 was set to 1. Status Bits Affected by Affects Repeat See Also ACOVx, CARRY, C54CM, M40, TCx ACOVx

This instruction cannot be repeated. See the following other related instructions:
- Call Conditionally - Call Unconditionally - Return from Interrupt - Return Unconditionally

Example
Syntax if (ACOV0 = #0) return Description The AC0 overflow bit is equal to 0, the program counter (PC) is loaded with the return address of the calling subroutine.
After 0 ACOV0 PC SP 0 (return address)

Before ACOV0 PC SP

SPRU375G

Instruction Set Descriptions

5-371

Return Unconditionally (return)

Return Unconditionally
Syntax Characteristics
Parallel Enable Bit Yes

No. [1]

Syntax return

Size 2

Cycles 5

Pipeline D

Opcode Operands Description none

0100 100E xxxx x100

This instruction passes control back to the calling subroutine. After returning from a called subroutine, the CPU restores the value of two internal registers: the program counter (PC) and a loop context register. The CPU uses these values to re-establish the context of the program sequence. In the slow-return process (default), the return address (from the PC) and the loop context bits are restored from the stacks (in memory). When the CPU returns from a subroutine, the speed at which these values are restored is dependent on the speed of the memory accesses. In the fast-return process, the return address (from the PC) and the loop context bits are restored from the return address register (RETA) and the control-flow context register (CFCT). You can read from or write to RETA and CFCT as a pair with dedicated, 32-bit load and store instructions. For fastreturn mode operation, see the TMS320C55x DSP CPU Reference Guide (SPRU371).
- The loop context bits concatenated with the 8 MSBs of the return address

are popped from the top of the system stack pointer (SSP). The SSP is incremented by 1 word in the address phase of the pipeline.
- The 16 LSBs of the return address are popped from the top of the data

stack pointer (SP). The SP is incremented by 1 word in the address phase of the pipeline.
System Stack (SSP) Before Return SSP = x (Loop bits):PC(2316) Previously stored data Before Return SP = y Data Stack (SP) PC(150) Previously stored data

After SSP = x + 1 Return R

After SP = y + 1 Return R

5-372

Instruction Set Descriptions

SPRU375G

Return Unconditionally (return)

Status Bits

Affected by Affects

none none

Repeat See Also

This instruction cannot be repeated. See the following other related instructions:
- Call Conditionally - Call Unconditionally - Return Conditionally - Return from Interrupt

Example
Syntax return Description The program counter is loaded with the return address of the calling subroutine.

SPRU375G

Instruction Set Descriptions

5-373

Return from Interrupt (return_int)

Return from Interrupt


Syntax Characteristics
No. [1] Syntax return_int Parallel Enable Bit Yes Size 2 Cycles 5 Pipeline D

Opcode Operands Description none

0100 100E xxxx x101

This instruction passes control back to the interrupted task. After returning from an interrupt service routine (ISR), the CPU automatically restores the value of some CPU registers and two internal registers: the program counter (PC) and a loop context register. The CPU uses these values to re-establish the context of the program sequence. In the slow-return process (default), the return address (from the PC), the loop context bits, and some CPU registers are restored from the stacks (in memory). When the CPU returns from an ISR, the speed at which these values are restored is dependent on the speed of the memory accesses. In the fast-return process, the return address (from the PC) and the loop context bits are restored from the return address register (RETA) and the control-flow context register (CFCT). You can read from or write to RETA and CFCT as a pair with dedicated, 32-bit load and store instructions. Some CPU registers are restored from the stacks (in memory). For fast-return mode operation, see the TMS320C55x DSP CPU Reference Guide (SPRU371).
- The loop context bits concatenated with the 8 MSBs of the return address

are popped from the top of the system stack pointer (SSP). The SSP is incremented by 1 word in the address phase of the pipeline.
- The 16 LSBs of the return address are popped from the top of the data

stack pointer (SP). The SP is incremented by 1 word in the address phase of the pipeline.
- The debug status register (DBSTAT) content is popped from the top of

SSP. The SSP is incremented by 1 word in the access phase of the pipeline.
- The status register 1 (ST1_55) content is popped from the top of SP. The

SP is incremented by 1 word in the access phase of the pipeline.


- The 7 higher bits of status register 0 (ST0_55) concatenated with 9 zeroes

are popped from the top of SSP. The SSP is incremented by 1 word in the read phase of the pipeline.
5-374 Instruction Set Descriptions SPRU375G

Return from Interrupt (return_int)

- The status register 2 (ST2_55) content is popped from the top of SP. The

SP is incremented by 1 word in the read phase of the pipeline.


System Stack (SSP) Before SSP = x Return R t SSP = x + 1 SSP = x + 2 After SSP = x + 3 Return R (Loop bits):PC(2316) DBSTAT ST0_55(159) Previously stored data Before SP = y Return R t SP = y + 1 SP = y + 2 After SP = y + 3 Return R Data Stack (SP) PC(150) ST1_55 ST2_55 Previously stored data

Status Bits

Affected by Affects

none none

Repeat See Also

This instruction cannot be repeated. See the following other related instructions:
- Return Conditionally - Return Unconditionally - Software Interrupt - Software Trap

Example
Syntax return_int Description The program counter (PC) is loaded with the return address of the interrupted task.

SPRU375G

Instruction Set Descriptions

5-375

Rotate Left Accumulator, Auxiliary, or Temporary Register Content

Rotate Left Accumulator, Auxiliary, or Temporary Register Content


Syntax Characteristics
No. Syntax dst = BitOut \\ src \\ BitIn [1] [2] [3] [4] dst = TC2 \\ src \\ TC2 dst = TC2 \\ src \\ CARRY dst = CARRY \\ src \\ TC2 dst = CARRY \\ src \\ CARRY Yes Yes Yes Yes 3 3 3 3 1 1 1 1 X X X X Parallel Enable Bit Size Cycles Pipeline

Opcode Operands Description dst, src

0001 001E FSSS xx11 FDDD 0xvv

This instruction performs a bitwise rotation to the MSBs. Both TC2 and CARRY can be used to shift in one bit (BitIn) or to store the shifted out bit (BitOut). The one bit in BitIn is shifted into the source (src) operand and the shifted out bit is stored to BitOut.
- When the destination (dst) operand is an accumulator: J J J J

if an auxiliary or temporary register is the source (src) operand of the instruction, the 16 LSBs of the register are zero extended to 40 bits the operation is performed on 40 bits in the D-unit shifter BitIn is inserted at bit position 0 BitOut is extracted at a bit position according to M40

- When the destination (dst) operand is an auxiliary or temporary register: J J J J

if an accumulator is the source (src) operand of the instruction, the 16 LSBs of the accumulator are used to perform the operation the operation is performed on 16 bits in the A-unit ALU BitIn is inserted at bit position 0 BitOut is extracted at bit position 15

Compatibility with C54x devices (C54CM = 1) When this instruction is executed with M40 = 0, compatibility is ensured. Status Bits Affected by Affects
5-376 Instruction Set Descriptions

CARRY, M40, TC2 CARRY, TC2


SPRU375G

Rotate Left Accumulator, Auxiliary, or Temporary Register Content

Repeat See Also

This instruction can be repeated. See the following other related instructions:
- Rotate Right Accumulator, Auxiliary, or Temporary Register Content

Example
Syntax AC1 = CARRY \\ AC1 \\ TC2 Description The value of TC2 (1) before the execution of the instruction is shifted into the LSB of AC1 and bit 31 shifted out from AC1 is stored in the CARRY status bit. The rotated value is stored in AC1. Because M40 = 0, the guard bits (3932) are cleared.
After 0F E340 5678 1 1 0 AC1 TC2 CARRY M40 00 C680 ACF1 1 1 0

Before AC1 TC2 CARRY M40

SPRU375G

Instruction Set Descriptions

5-377

Rotate Right Accumulator, Auxiliary, or Temporary Register Content

Rotate Right Accumulator, Auxiliary, or Temporary Register Content


Syntax Characteristics
No. Syntax dst = BitIn // src // BitOut [1] [2] [3] [4] dst = TC2 // src // TC2 dst = TC2 // src // CARRY dst = CARRY // src // TC2 dst = CARRY // src // CARRY Yes Yes Yes Yes 3 3 3 3 1 1 1 1 X X X X Parallel Enable Bit Size Cycles Pipeline

Opcode Operands Description dst, src

0001 001E FSSS xx11 FDDD 1xvv

This instruction performs a bitwise rotation to the LSBs. Both TC2 and CARRY can be used to shift in one bit (BitIn) or to store the shifted out bit (BitOut). The one bit in BitIn is shifted into the source (src) operand and the shifted out bit is stored to BitOut.
- When the destination (dst) operand is an accumulator: J J J J

if an auxiliary or temporary register is the source (src) operand of the instruction, the 16 LSBs of the register are zero extended to 40 bits the operation is performed on 40 bits in the D-unit shifter BitIn is inserted at a bit position according to M40 BitOut is extracted at bit position 0

- When the destination (dst) operand is an auxiliary or temporary register: J J J J

if an accumulator is the source (src) operand of the instruction, the 16 LSBs of the accumulator are used to perform the operation the operation is performed on 16 bits in the A-unit ALU BitIn is inserted at bit position 15 BitOut is extracted at bit position 0

Compatibility with C54x devices (C54CM = 1) When this instruction is executed with M40 = 0, compatibility is ensured. Status Bits Affected by Affects
5-378 Instruction Set Descriptions

CARRY, M40, TC2 CARRY, TC2


SPRU375G

Rotate Right Accumulator, Auxiliary, or Temporary Register Content

Repeat See Also

This instruction can be repeated. See the following other related instructions:
- Rotate Left Accumulator, Auxiliary, or Temporary Register Content

Example
Syntax AC1 = TC2 // AC0 // TC2 Description The value of TC2 (1) before the execution of the instruction is shifted into bit 31 of AC0 and the LSB shifted out from AC0 is stored in TC2. The rotated value is stored in AC1. Because M40 = 0, the guard bits (3932) are cleared.
After 5F B000 1234 00 C680 ACF1 1 0 AC0 AC1 TC2 M40 5F B000 1234 00 D800 091A 0 0

Before AC0 AC1 TC2 M40

SPRU375G

Instruction Set Descriptions

5-379

Round Accumulator Content (rnd)

Round Accumulator Content


Syntax Characteristics
No. [1] Syntax ACy = rnd(ACx) Parallel Enable Bit Yes Size 2 Cycles 1 Pipeline X

Opcode Operands Description ACx, ACy

0101 010E DDSS 101%

This instruction performs a rounding of the source accumulator ACx in the D-unit ALU.
- The rounding operation depends on RDM: J J

When RDM = 0, the biased rounding to the infinite is performed. 8000h (215) is added to the 40-bit source accumulator ACx. When RDM = 1, the unbiased rounding to the nearest is performed. According to the value of the 17 LSBs of the 40-bit source accumulator ACx, 8000h (215) is added:
if( 8000h < bit(150) < 10000h) add 8000h to the 40-bit source accumulator ACx else if( bit(150) == 8000h) if( bit(16) == 1) add 8000h to the 40-bit source accumulator ACx

If a rounding has been performed, the 16 lowest bits of the result are cleared to 0.
- Addition overflow detection depends on M40. - No addition carry report is stored in CARRY status bit. - If an overflow is detected, the destination accumulator overflow status bit

(ACOVy) is set.
- When an overflow is detected, the accumulator is saturated according to

SATD. Compatibility with C54x devices (C54CM = 1) When this instruction is executed with M40 = 0, compatibility is ensured. When C54CM = 1, the rounding is performed without clearing the LSBs of accumulator ACx.
5-380 Instruction Set Descriptions SPRU375G

Round Accumulator Content (rnd)

Status Bits

Affected by Affects

C54CM, M40, RDM, SATD ACOVy

Repeat Example
Syntax AC1 = rnd(AC0)

This instruction cannot be repeated.

Description The content of AC0 is added to 8000h, the 16 LSBs are cleared to 0, and the result is stored in AC1. M40 is cleared to 0, so overflow is detected at bit 31; SATD is cleared to 0, so AC1 is not saturated.
After EF 0FF0 8023 00 0000 0000 1 0 0 0 AC0 AC1 RDM M40 SATD ACOV1 EF 0FF0 8023 EF 0FF1 0000 1 0 0 1

Before AC0 AC1 RDM M40 SATD ACOV1

SPRU375G

Instruction Set Descriptions

5-381

Saturate Accumulator Content (saturate)

Saturate Accumulator Content


Syntax Characteristics
No. [1] Syntax ACy = saturate(rnd(ACx)) Parallel Enable Bit Yes Size 2 Cycles 1 Pipeline X

Opcode Operands Description ACx, ACy

0101 010E DDSS 110%

This instruction performs a saturation of the source accumulator ACx to the 32-bit width frame in the D-unit ALU.
- A rounding is performed if the optional rnd keyword is applied to the

instruction. The rounding operation depends on RDM:


J J

When RDM = 0, the biased rounding to the infinite is performed. 8000h (215) is added to the 40-bit source accumulator ACx. When RDM = 1, the unbiased rounding to the nearest is performed. According to the value of the 17 LSBs of the 40-bit source accumulator ACx, 8000h (215) is added:
if( 8000h < bit(150) < 10000h) add 8000h to the 40-bit source accumulator ACx else if( bit(150) == 8000h) if( bit(16) == 1) add 8000h to the 40-bit source accumulator ACx

If a rounding has been performed, the 16 lowest bits of the result are cleared to 0.
- An overflow is detected at bit position 31. - No addition carry report is stored in CARRY status bit. - If an overflow is detected, the destination accumulator overflow status bit

(ACOVy) is set.
- When an overflow is detected, the destination register is saturated.

Saturation values are 00 7FFF FFFFh FF 8000 0000h (negative overflow). Compatibility with C54x devices (C54CM = 1)

(positive

overflow)

or

When this instruction is executed with M40 = 0, compatibility is ensured. When C54CM = 1, the rounding is performed without clearing the LSBs of accumulator ACx.
5-382 Instruction Set Descriptions SPRU375G

Saturate Accumulator Content (saturate)

Status Bits

Affected by Affects

C54CM, RDM ACOVy

Repeat Example 1
Syntax AC1 = saturate(AC0)

This instruction can be repeated.

Description The 32-bit width content of AC0 is saturated and the saturated value, FF 8000 0000, is stored in AC1.
After EF 0FF0 8023 00 0000 0000 0 AC0 AC1 ACOV1 EF 0FF0 8023 FF 8000 0000 1

Before AC0 AC1 ACOV1

Example 2
Syntax AC1 = saturate(rnd(AC0))
Before AC0 AC1 RDM ACOV1 00 7FFF 8000 00 0000 0000 0 0

Description The 32-bit width content of AC0 is saturated. The saturated value, 00 7FFF FFFFh, is rounded, 16 LSBs are cleared, and stored in AC1.
After AC0 AC1 RDM ACOV1 00 7FFF 8000 00 7FFF 0000 0 1

SPRU375G

Instruction Set Descriptions

5-383

Set Accumulator, Auxiliary, or Temporary Register Bit

Set Accumulator, Auxiliary, or Temporary Register Bit


Syntax Characteristics
No. [1] Syntax bit(src, Baddr) = #1 Parallel Enable Bit No Size 3 Cycles 1 Pipeline X

Opcode Operands Description Baddr, src

1110 1100 AAAA AAAI FSSS 000x

This instruction performs a bit manipulation:


- In the D-unit ALU, if the source (src) register operand is an accumulator. - In the A-unit ALU, if the source (src) register operand is an auxiliary or

temporary register. The instruction sets to 1 a single bit, as defined by the bit addressing mode, Baddr, of the source register. The generated bit address must be within:
- 039 when accessing accumulator bits (only the 6 LSBs of the generated

bit address are used to determine the bit position). If the generated bit address is not within 039, the selected register bit value does not change.
- 015 when accessing auxiliary or temporary register bits (only the 4 LSBs

of the generated address are used to determine the bit position). Status Bits Affected by Affects Repeat See Also none none

This instruction can be repeated. See the following other related instructions:
- Clear Accumulator, Auxiliary, or Temporary Register Bit - Complement Accumulator, Auxiliary, or Temporary Register Bit - Set Memory Bit - Set Status Register Bit

Example
Syntax bit(AC0, AR3) = #1 Description The bit at the position defined by the content of AR3(40) in AC0 is set to 1.

5-384

Instruction Set Descriptions

SPRU375G

Set Memory Bit

Set Memory Bit


Syntax Characteristics
Parallel Enable Bit No

No. [1]

Syntax bit(Smem, src) = #1

Size 3

Cycles 1

Pipeline X

Opcode Operands Description Smem, src

1110 0011 AAAA AAAI FSSS 1100

This instruction performs a bit manipulation in the A-unit ALU. The instruction sets to 1 a single bit, as defined by the content of the source (src) operand, of a memory (Smem) location. The generated bit address must be within 015 (only the 4 LSBs of the register are used to determine the bit position).

Status Bits

Affected by Affects

none none

Repeat See Also

This instruction can be repeated. See the following other related instructions:
- Clear Memory Bit - Complement Memory Bit - Set Accumulator, Auxiliary, or Temporary Register Bit - Set Status Register Bit

Example
Syntax bit(*AR3, AC0) = #1 Description The bit at the position defined by AC0(30) in the content addressed by AR3 is set to 1.

SPRU375G

Instruction Set Descriptions

5-385

Set Status Register Bit

Set Status Register Bit


Syntax Characteristics
Parallel Enable Bit Yes Yes Yes Yes

No. [1] [2] [3] [4]

Syntax bit(ST0, k4) = #1 bit(ST1, k4) = #1 bit(ST2, k4) = #1 bit(ST3, k4) = #1

Size 2 2 2 2

Cycles 1 1 1 1

Pipeline X X X X

When this instruction is decoded to modify status bit CAFRZ (15), CAEN (14), or CACLR (13), the CPU pipeline is flushed and the instruction is executed in 5 cycles regardless of the instruction context.

Opcode

ST0 ST1 ST2 ST3

0100 011E kkkk 0001 0100 011E kkkk 0011 0100 011E kkkk 0101 0100 011E kkkk 0111

Operands Description

k4, STx These instructions perform a bit manipulation in the A-unit ALU. These instructions set to 1 a single bit, as defined by a 4-bit immediate value, k4, in the selected status register (ST0, ST1, ST2, or ST3). Compatibility with C54x devices (C54CM = 1) C55x DSP status registers bit mapping (Figure 53, page 5-388) does not correspond to C54x DSP status register bits.

Status Bits

Affected by Affects

none Selected status bits

Repeat See Also

This instruction cannot be repeated. See the following other related instructions:
- Clear Status Register Bit - Set Accumulator, Auxiliary, or Temporary Register Bit - Set Memory Bit

5-386

Instruction Set Descriptions

SPRU375G

Set Status Register Bit

Example
Syntax bit(ST0, ST0_CARRY) = #1; ST0_CARRY = bit 11 Description The ST0 bit position defined by the label (ST0_CARRY, bit 11) is set to 1.

Before ST0 0000

After ST0 0800

SPRU375G

Instruction Set Descriptions

5-387

Set Status Register Bit

Figure 53. Status Registers Bit Mapping


ST0_55 15 ACOV2 R/W0 8 DP R/W0 ST1_55 15 BRAF R/W0 7 C16 R/W0 ST2_55 15 ARMS R/W0 7 AR7LC R/W0 ST3_55 15 CAFRZ R/W0 7 CBERR R/W0 14 CAEN R/W0 6 MPNMC R/Wpins 13 CACLR R/W0 5 SATA R/W0 4 Reserved 12 HINT R/W1 3 2 CLKOFF R/W0 1 SMUL R/W0 0 SST R/W0 11 Reserved (always write 1100b) 8 6 AR6LC R/W0 5 AR5LC R/W0 14 Reserved 13 12 DBGM R/W1 4 AR4LC R/W0 11 EALLOW R/W0 3 AR3LC R/W0 10 RDM R/W0 2 AR2LC R/W0 1 AR1LC R/W0 9 Reserved 8 CDPLC R/W0 0 AR0LC R/W0 14 CPL R/W0 6 FRCT R/W0 13 XF R/W1 5 C54CM R/W1 4 ASM R/W0 12 HM R/W0 11 INTM R/W1 10 M40 R/W0 9 SATD R/W0 8 SXMD R/W1 0 14 ACOV3 R/W0 13 TC1 R/W1 12 TC2 R/W1 11 CARRY R/W1 10 ACOV0 R/W0 9 ACOV1 R/W0 0

Legend: R = Read; W = Write; -n = Value after reset Highlighted bit: If you write to the protected address of the status register, a write to this bit has no effect, and the bit always appears as a 0 during read operations. The HINT bit is not used for all C55x host port interfaces (HPIs). Consult the documentation for the specific C55x DSP. The reset value of MPNMC may be dependent on the state of predefined pins at reset. To check this for a particular C55x DSP, see the boot loader section of its data sheet.

5-388

Instruction Set Descriptions

SPRU375G

Shift Accumulator Content Conditionally (sftc)

Shift Accumulator Content Conditionally


Syntax Characteristics
Parallel Enable Bit Yes Yes

No. [1] [2]

Syntax ACx = sftc(ACx, TC1) ACx = sftc(ACx, TC2)

Size 2 2

Cycles 1 1

Pipeline X X

Opcode

TC1 TC2

0101 101E DDxx xx10 0101 101E DDxx xx11

Operands Description

ACx, TCx If the source accumulator ACx(390) is equal to 0, this instruction sets the TCx status bit to 1. If the source accumulator ACx(310) has two sign bits:
- this instruction shifts left the 32-bit accumulator ACx by 1 bit - the TCx status bit is cleared to 0

If the source accumulator ACx(310) does not have two sign bits, this instruction sets the TCx status bit to 1. The sign bits are extracted at bit positions 31 and 30. Status Bits Affected by Affects Repeat See Also none TCx

This instruction can be repeated. See the following other related instructions:
- Shift Accumulator Content Logically - Shift Accumulator, Auxiliary, or Temporary Register Content Logically - Signed Shift of Accumulator Content - Signed Shift of Accumulator, Auxiliary, or Temporary Register Content

SPRU375G

Instruction Set Descriptions

5-389

Shift Accumulator Content Conditionally (sftc)

Example 1
Syntax AC0 = sftc(AC0, TC1) Description Because AC0(31) XORed with AC0(30) equals 1, the content of AC0 is not shifted left and TC1 is set to 1.
After AC0 TC1

Before AC0 TC1

FF 8765 0055 0

FF 8765 0055 1

Example 2
Syntax AC0 = sftc(AC0, TC2) Description Because AC0(31) XORed with AC0(30) equals 0, the content of AC0 is shifted left by 1 bit and TC2 is cleared to 0.
After AC0 TC2

Before AC0 TC2

00 1234 0000 0

00 2468 0000 0

5-390

Instruction Set Descriptions

SPRU375G

Shift Accumulator Content Logically

Shift Accumulator Content Logically


Syntax Characteristics
Parallel Enable Bit Yes Yes

No. [1] [2]

Syntax ACy = ACx <<< Tx ACy = ACx <<< #SHIFTW

Size 2 3

Cycles 1 1

Pipeline X X

Description

These instructions perform an unsigned shift by an immediate value, SHIFTW, or the content of a temporary register (Tx) in the D-unit shifter. Affected by Affects C54CM, M40 CARRY

Status Bits

See Also

See the following other related instructions:


- Shift Accumulator Content Conditionally - Shift Accumulator, Auxiliary, or Temporary Register Content Logically - Signed Shift of Accumulator Content - Signed Shift of Accumulator, Auxiliary, or Temporary Register Content

SPRU375G

Instruction Set Descriptions

5-391

Shift Accumulator Content Logically

Shift Accumulator Content Logically


Syntax Characteristics
No. [1] Syntax ACy = ACx <<< Tx Parallel Enable Bit Yes Size 2 Cycles 1 Pipeline X

Opcode Operands Description ACx, ACy, Tx

0101 110E DDSS ss00

This instruction shifts by the temporary register (Tx) content the accumulator (ACx) content and stores the shifted-out bit in the CARRY status bit. If the 16-bit value contained in Tx is out of the 32 to +31 range, the shift is saturated to 32 or +31 and the shift operation is performed with this value. However, no overflow is reported when such saturation occurs.
- The operation is performed on 40 bits in the D-unit shifter. - The shift operation is performed according to M40. - The CARRY status bit contains the shifted-out bit. When the shift count is

zero, Tx = 0, the CARRY status bit is cleared to 0. Compatibility with C54x devices (C54CM = 1) When this instruction is executed with M40 = 0, compatibility is ensured. When C54CM = 1, the 6 LSBs of Tx define the shift quantity within 32 to +31. When the value is between 32 to 17, a modulo 16 operation transforms the shift quantity to within 16 to 1. Status Bits Affected by Affects Repeat Example
Syntax AC1 = AC0 >>> T0 Description The content of AC0 is logically shifted right by the content of T0 and the result is stored in AC1. There is a right shift because the content of T0 is negative (6). Because M40 = 0, the guard bits (3932) are cleared.
After 5F B000 1234 00 C680 ACF0 FFFA 0 AC0 AC1 T0 M40 5F B000 1234 00 02C0 0048 FFFA 0

C54CM, M40 CARRY

This instruction can be repeated.

Before AC0 AC1 T0 M40

5-392

Instruction Set Descriptions

SPRU375G

Shift Accumulator Content Logically

Shift Accumulator Content Logically


Syntax Characteristics
Parallel Enable Bit Yes

No. [2]

Syntax ACy = ACx <<< #SHIFTW

Size 3

Cycles 1

Pipeline X

Opcode Operands Description ACx, ACy, SHIFTW

0001 000E DDSS 0111 xxSH IFTW

This instruction shifts by a 6-bit value, SHIFTW, the accumulator (ACx) content and stores the shifted-out bit in the CARRY status bit.
- The operation is performed on 40 bits in the D-unit shifter. - The shift operation is performed according to M40. - The CARRY status bit contains the shifted-out bit. When the shift count is

zero, SHIFTW = 0, the CARRY status bit is cleared to 0. Compatibility with C54x devices (C54CM = 1) When this instruction is executed with M40 = 0, compatibility is ensured. Status Bits Affected by Affects Repeat Example
Syntax AC0 = AC1 <<< #31 Description The content of AC1 is logically shifted left by 31 bits and the result is stored in AC0.

M40 CARRY

This instruction can be repeated.

SPRU375G

Instruction Set Descriptions

5-393

Shift Accumulator, Auxiliary, or Temporary Register Content Logically

Shift Accumulator, Auxiliary, or Temporary Register Content Logically


Syntax Characteristics
Parallel Enable Bit Yes Yes

No. [1] [2]

Syntax dst = dst <<< #1 dst = dst >>> #1

Size 2 2

Cycles 1 1

Pipeline X X

Description

These instructions perform an unsigned shift by 1 bit:


- In the D-unit shifter, if the destination operand is an accumulator (ACx). - In the A-unit ALU, if the destination operand is an auxiliary or temporary

register (TAx). Status Bits Affected by Affects See Also C54CM, M40 CARRY

See the following other related instructions:


- Shift Accumulator Content Conditionally - Shift Accumulator Content Logically - Signed Shift of Accumulator Content - Signed Shift of Accumulator, Auxiliary, or Temporary Register Content

5-394

Instruction Set Descriptions

SPRU375G

Shift Accumulator, Auxiliary, or Temporary Register Content Logically

Shift Accumulator, Auxiliary, or Temporary Register Content Logically


Syntax Characteristics
No. [1] Syntax dst = dst <<< #1 Parallel Enable Bit Yes Size 2 Cycles 1 Pipeline X

Opcode Operands Description dst

0101 000E FDDD x000

This instruction shifts left by 1 bit the input operand (dst). The CARRY status bit contains the shifted-out bit.
- When the destination operand (dst) is an accumulator: J J J

The operation is performed on 40 bits in the D-unit shifter. 0 is inserted at bit position 0. The shifted-out bit is extracted at a bit position according to M40.

- When the destination operand (dst) is an auxiliary or temporary register: J J J

The operation is performed on 16 bits in the A-unit ALU. 0 is inserted at bit position 0. The shifted-out bit is extracted at bit position 15 and stored in the CARRY status bit.

Compatibility with C54x devices (C54CM = 1) When this instruction is executed with M40 = 0, compatibility is ensured. Status Bits Affected by Affects Repeat Example
Syntax AC1 = AC1 <<< #1 Description The content of AC1 is logically shifted left by 1 bit and the result is stored in AC1. Because M40 = 0, the CARRY status bit is extracted at bit 31 and the guard bits (3932) are cleared.
After 8F E340 5678 0 0 AC1 CARRY M40 00 C680 ACF0 1 0

M40 CARRY

This instruction can be repeated.

Before AC1 CARRY M40

SPRU375G

Instruction Set Descriptions

5-395

Shift Accumulator, Auxiliary, or Temporary Register Content Logically

Shift Accumulator, Auxiliary, or Temporary Register Content Logically


Syntax Characteristics
Parallel Enable Bit Yes

No. [2]

Syntax dst = dst >>> #1

Size 2

Cycles 1

Pipeline X

Opcode Operands Description dst

0101 000E FDDD x001

This instruction shifts right by 1 bit the input operand (dst). The CARRY status bit contains the shifted-out bit.
- When the destination operand (dst) is an accumulator: J J J

The operation is performed on 40 bits in the D-unit shifter. 0 is inserted at a bit position according to M40. The shifted-out bit is extracted at bit position 0 and stored in the CARRY status bit.

When the destination operand (dst) is an auxiliary or temporary register:


J J J

The operation is performed on 16 bits in the A-unit ALU. 0 is inserted at bit position 15. The shifted-out bit is extracted at bit position 0 and stored in the CARRY status bit.

Compatibility with C54x devices (C54CM = 1) When this instruction is executed with M40 = 0, compatibility is ensured. Status Bits Affected by Affects Repeat Example
Syntax AC0 = AC0 >>> #1 Description The content of AC0 is logically shifted right by 1 bit and the result is stored in AC0.

M40 CARRY

This instruction can be repeated.

5-396

Instruction Set Descriptions

SPRU375G

Signed Shift of Accumulator Content

Signed Shift of Accumulator Content


Syntax Characteristics
Parallel Enable Bit Yes Yes Yes Yes

No. [1] [2] [3] [4]

Syntax ACy = ACx << Tx ACy = ACx <<C Tx ACy = ACx << #SHIFTW ACy = ACx <<C #SHIFTW

Size 2 2 3 3

Cycles 1 1 1 1

Pipeline X X X X

Description

These instructions perform a signed shift by an immediate value, SHIFTW, or by the content of a temporary register (Tx) in the D-unit shifter. Affected by Affects C54CM, M40, SATA, SATD, SXMD ACOVx, ACOVy, CARRY

Status Bits

See Also

See the following other related instructions:


- Shift Accumulator Content Conditionally - Shift Accumulator Content Logically - Shift Accumulator, Auxiliary, or Temporary Register Content Logically - Signed Shift of Accumulator, Auxiliary, or Temporary Register Content

SPRU375G

Instruction Set Descriptions

5-397

Signed Shift of Accumulator Content

Signed Shift of Accumulator Content


Syntax Characteristics
Parallel Enable Bit Yes

No. [1]

Syntax ACy = ACx << Tx

Size 2

Cycles 1

Pipeline X

Opcode Operands Description ACx, ACy, Tx

0101 110E DDSS ss01

This instruction shifts by the temporary register (Tx) content the accumulator (ACx) content. If the 16-bit value contained in Tx is out of the 32 to +31 range, the shift is saturated to 32 or +31 and the shift operation is performed with this value; a destination accumulator overflow is reported when such saturation occurs.
- The operation is performed on 40 bits in the D-unit shifter. - When M40 = 0, the input to the shifter is modified according to SXMD and

then the modified input is shifted by the Tx content:


J J

if SXMD = 0, 0 is substituted for the guard bits (3932) as the input, instead of ACx(3932), to the shifter if SXMD = 1, bit 31 of the source operand is substituted for the guard bits (3932) as the input, instead of ACx(3932), to the shifter

- The sign position of the source operand is compared to the shift quantity.

This comparison depends on M40:


J J

if M40 =0, comparison is performed versus bit 31 if M40 =1, comparison is performed versus bit 39

- 0 is inserted at bit position 0. - The shifted-out bit is extracted according to M40. - After shifting, unless otherwise noted, when M40 = 0: J J

overflow is detected at bit position 31 (if an overflow is detected, the destination ACOVy bit is set) if SATD = 1, when an overflow is detected, the destination accumulator saturation values are 00 7FFF FFFFh (positive overflow) or FF 8000 0000h (negative overflow)
SPRU375G

5-398

Instruction Set Descriptions

Signed Shift of Accumulator Content

- After shifting, unless otherwise noted, when M40 = 1: J J

overflow is detected at bit position 39 (if an overflow is detected, the destination ACOVy bit is set) if SATD = 1, when an overflow is detected, the destination accumulator saturation values are 7F FFFF FFFFh (positive overflow) or 80 0000 0000h (negative overflow)

Compatibility with C54x devices (C54CM = 1) When C54CM = 1:


- These instructions are executed as if M40 status bit was locally set to 1. - There is no overflow detection, overflow report, and saturation performed

by the D-unit shifter.


- The 6 LSBs of Tx are used to determine the shift quantity. The 6 LSBs of

Tx define a shift quantity within 32 to +31. When the value is between 32 to 17, a modulo 16 operation transforms the shift quantity to within 16 to 1. Status Bits Affected by Affects Repeat Example
Syntax AC0 = AC1 << T0 Description The content of AC1 is shifted by the content of T0 and the result is stored in AC0.

C54CM, M40, SATD, SXMD ACOVy

This instruction can be repeated.

SPRU375G

Instruction Set Descriptions

5-399

Signed Shift of Accumulator Content

Signed Shift of Accumulator Content


Syntax Characteristics
No. [2] Syntax ACy = ACx <<C Tx Parallel Enable Bit Yes Size 2 Cycles 1 Pipeline X

Opcode Operands Description ACx, ACy, Tx

0101 110E DDSS ss10

This instruction shifts by the temporary register (Tx) content the accumulator (ACx) content and stores the shifted-out bit in the CARRY status bit. If the 16-bit value contained in Tx is out of the 32 to +31 range, the shift is saturated to 32 or +31 and the shift operation is performed with this value; a destination accumulator overflow is reported when such saturation occurs.
- The operation is performed on 40 bits in the D-unit shifter. - When M40 = 0, the input to the shifter is modified according to SXMD and

then the modified input is shifted by the Tx content:


J J

if SXMD = 0, 0 is substituted for the guard bits (3932) as the input, instead of ACx(3932), to the shifter if SXMD = 1, bit 31 of the source operand is substituted for the guard bits (3932) as the input, instead of ACx(3932), to the shifter

- The sign position of the source operand is compared to the shift quantity.

This comparison depends on M40:


J J

if M40 =0, comparison is performed versus bit 31 if M40 =1, comparison is performed versus bit 39

- 0 is inserted at bit position 0. - The shifted-out bit is extracted according to M40 and stored in the CARRY

status bit. When the shift count is zero, Tx = 0, the CARRY status bit is cleared to 0.
- After shifting, unless otherwise noted, when M40 = 0: J J

overflow is detected at bit position 31 (if an overflow is detected, the destination ACOVy bit is set) if SATD = 1, when an overflow is detected, the destination accumulator saturation values are 00 7FFF FFFFh (positive overflow) or FF 8000 0000h (negative overflow)
SPRU375G

5-400

Instruction Set Descriptions

Signed Shift of Accumulator Content

- After shifting, unless otherwise noted, when M40 = 1: J J

overflow is detected at bit position 39 (if an overflow is detected, the destination ACOVy bit is set) if SATD = 1, when an overflow is detected, the destination accumulator saturation values are 7F FFFF FFFFh (positive overflow) or 80 0000 0000h (negative overflow)

Compatibility with C54x devices (C54CM = 1) When C54CM = 1:


- These instructions are executed as if M40 status bit was locally set to 1. - There is no overflow detection, overflow report, and saturation performed

by the D-unit shifter.


- The 6 LSBs of Tx are used to determine the shift quantity. The 6 LSBs of

Tx define a shift quantity within 32 to +31. When the value is between 32 to 17, a modulo 16 operation transforms the shift quantity to within 16 to 1. Status Bits Affected by Affects Repeat Example
Syntax AC2 = AC2 <<C T1 Description The content of AC2 is shifted left by the content of T1 and the saturated result is stored in AC2. The shifted out bit is stored in the CARRY status bit. Since SATD = 1 and M40 = 0, AC2 = FF 8000 0000 (saturation).
After AC2 T1 CARRY M40 ACOV2 SXMD SATD

C54CM, M40, SATD, SXMD ACOVy, CARRY

This instruction can be repeated.

Before AC2 T1 CARRY M40 ACOV2 SXMD SATD

80 AA00 1234 0005 0 0 0 1 1

FF 8000 0000 0005 1 0 1 1 1

SPRU375G

Instruction Set Descriptions

5-401

Signed Shift of Accumulator Content

Signed Shift of Accumulator Content


Syntax Characteristics
No. [3] Syntax ACy = ACx << #SHIFTW Parallel Enable Bit Yes Size 3 Cycles 1 Pipeline X

Opcode Operands Description ACx, ACy, SHIFTW

0001 000E DDSS 0101 xxSH IFTW

This instruction shifts by a 6-bit value, SHIFTW, the accumulator (ACx) content.
- The operation is performed on 40 bits in the D-unit shifter. - When M40 = 0, the input to the shifter is modified according to SXMD and

then the modified input is shifted by the 6-bit value, SHIFTW:


J J

if SXMD = 0, 0 is substituted for the guard bits (3932) as the input, instead of ACx(3932), to the shifter if SXMD = 1, bit 31 of the source operand is substituted for the guard bits (3932) as the input, instead of ACx(3932), to the shifter

- The sign position of the source operand is compared to the shift quantity.

This comparison depends on M40:


J J

if M40 =0, comparison is performed versus bit 31 if M40 =1, comparison is performed versus bit 39

- 0 is inserted at bit position 0. - The shifted-out bit is extracted according to M40. - After shifting, unless otherwise noted, when M40 = 0: J J

overflow is detected at bit position 31 (if an overflow is detected, the destination ACOVy bit is set) if SATD = 1, when an overflow is detected, the destination accumulator saturation values are 00 7FFF FFFFh (positive overflow) or FF 8000 0000h (negative overflow)

- After shifting, unless otherwise noted, when M40 = 1: J J

overflow is detected at bit position 39 (if an overflow is detected, the destination ACOVy bit is set) if SATD = 1, when an overflow is detected, the destination accumulator saturation values are 7F FFFF FFFFh (positive overflow) or 80 0000 0000h (negative overflow)
SPRU375G

5-402

Instruction Set Descriptions

Signed Shift of Accumulator Content

Compatibility with C54x devices (C54CM = 1) When C54CM = 1, these instructions are executed as if M40 status bit was locally set to 1. There is no overflow detection, overflow report, and saturation performed by the D-unit shifter. Status Bits Affected by Affects Repeat Example 1
Syntax AC0 = AC1 << #31 Description The content of AC1 is shifted left by 31 bits and the result is stored in AC0.

C54CM, M40, SATD, SXMD ACOVy

This instruction can be repeated.

Example 2
Syntax AC0 = AC1 << #32 Description The content of AC1 is shifted right by 32 bits and the result is stored in AC0.

SPRU375G

Instruction Set Descriptions

5-403

Signed Shift of Accumulator Content

Signed Shift of Accumulator Content


Syntax Characteristics
Parallel Enable Bit Yes

No. [4]

Syntax ACy = ACx <<C #SHIFTW

Size 3

Cycles 1

Pipeline X

Opcode Operands Description ACx, ACy, SHIFTW

0001 000E DDSS 0110 xxSH IFTW

This instruction shifts by a 6-bit value, SHIFTW, the accumulator (ACx) content and stores the shifted-out bit in the CARRY status bit.
- The operation is performed on 40 bits in the D-unit shifter. - When M40 = 0, the input to the shifter is modified according to SXMD and

then the modified input is shifted by the 6-bit value, SHIFTW:


J J

if SXMD = 0, 0 is substituted for the guard bits (3932) as the input, instead of ACx(3932), to the shifter if SXMD = 1, bit 31 of the source operand is substituted for the guard bits (3932) as the input, instead of ACx(3932), to the shifter

- The sign position of the source operand is compared to the shift quantity.

This comparison depends on M40:


J J

if M40 =0, comparison is performed versus bit 31 if M40 =1, comparison is performed versus bit 39

- 0 is inserted at bit position 0. - The shifted-out bit is extracted according to M40 and stored in the CARRY

status bit. When the shift count is zero, SHIFTW = 0, the CARRY status bit is cleared to 0.
- After shifting, unless otherwise noted, when M40 = 0: J J

overflow is detected at bit position 31 (if an overflow is detected, the destination ACOVy bit is set) if SATD = 1, when an overflow is detected, the destination accumulator saturation values are 00 7FFF FFFFh (positive overflow) or FF 8000 0000h (negative overflow)
SPRU375G

5-404

Instruction Set Descriptions

Signed Shift of Accumulator Content

- After shifting, unless otherwise noted, when M40 = 1: J J

overflow is detected at bit position 39 (if an overflow is detected, the destination ACOVy bit is set) if SATD = 1, when an overflow is detected, the destination accumulator saturation values are 7F FFFF FFFFh (positive overflow) or 80 0000 0000h (negative overflow)

Compatibility with C54x devices (C54CM = 1) When C54CM = 1, these instructions are executed as if M40 status bit was locally set to 1. There is no overflow detection, overflow report, and saturation performed by the D-unit shifter. Status Bits Affected by Affects Repeat Example
Syntax AC1 = AC0 <<C #5 Description The content of AC0 is shifted right by 5 bits and the result is stored in AC1. The shifted out bit is stored in the CARRY status bit.
After AC0 AC1 CARRY SXMD

C54CM, M40, SATD, SXMD ACOVy, CARRY

This instruction can be repeated.

Before AC0 AC1 CARRY SXMD

FF 8765 0055 00 4321 1234 0 1

FF 8765 0055 FF FC3B 2802 1 1

SPRU375G

Instruction Set Descriptions

5-405

Signed Shift of Accumulator, Auxiliary, or Temporary Register Content

Signed Shift of Accumulator, Auxiliary, or Temporary Register Content


Syntax Characteristics
Parallel Enable Bit Yes Yes

No. [1] [2]

Syntax dst = dst >> #1 dst = dst << #1

Size 2 2

Cycles 1 1

Pipeline X X

Description

These instructions perform a shift of 1 bit:


- In the D-unit shifter, if the destination operand is an accumulator (ACx). - In the A-unit ALU, if the destination operand is an auxiliary or temporary

register (TAx). Status Bits Affected by Affects See Also C54CM, M40, SATA, SATD, SXMD ACOVx, ACOVy, CARRY

See the following other related instructions:


- Shift Accumulator Content Conditionally - Shift Accumulator Content Logically - Shift Accumulator, Auxiliary, or Temporary Register Content Logically - Signed Shift of Accumulator Content

5-406

Instruction Set Descriptions

SPRU375G

Signed Shift of Accumulator, Auxiliary, or Temporary Register Content

Signed Shift of Accumulator, Auxiliary, or Temporary Register Content


Syntax Characteristics
Parallel Enable Bit Yes

No. [1]

Syntax dst = dst >> #1

Size 2

Cycles 1

Pipeline X

Opcode Operands Description dst

0100 010E 01x0 FDDD

This instruction shifts right by 1 bit the content of the destination register (dst). If the destination operand (dst) is an accumulator:
- The operation is performed on 40 bits in the D-unit shifter. - When M40 = 0, the input to the shifter is modified according to SXMD and

then the modified input is shifted right by 1 bit:


J J

if SXMD = 0, 0 is substituted for the guard bits (3932) as the input, instead of ACx(3932), to the shifter if SXMD = 1, bit 31 of the source operand is substituted for the guard bits (3932) as the input, instead of ACx(3932), to the shifter

- Bit 39 is extended according to SXMD - The shifted-out bit is extracted at bit position 0. - After shifting, unless otherwise noted, when M40 = 0: J J

overflow is detected at bit position 31 if SATD = 1, when an overflow is detected, the destination accumulator saturation values are 00 7FFF FFFFh (positive overflow) or FF 8000 0000h (negative overflow)

- After shifting, unless otherwise noted, when M40 = 1: J J

overflow is detected at bit position 39 if SATD = 1, when an overflow is detected, the destination accumulator saturation values are 7F FFFF FFFFh (positive overflow) or 80 0000 0000h (negative overflow)

SPRU375G

Instruction Set Descriptions

5-407

Signed Shift of Accumulator, Auxiliary, or Temporary Register Content

If the destination operand (dst) is an auxiliary or temporary register:


- The operation is performed on 16 bits in the A-unit ALU. - Bit 15 is sign extended. - After shifting, unless otherwise noted: J J

overflow is detected at bit position 15 if SATA = 1, when an overflow is detected, the destination register saturation values are 7FFFh (positive overflow) or 8000h (negative overflow)

Compatibility with C54x devices (C54CM = 1) When C54CM = 1, these instructions are executed as if M40 status bit was locally set to 1. There is no overflow detection, overflow report, and saturation performed by the D-unit shifter. Status Bits Affected by Affects Repeat Example
Syntax AC0 = AC0 >> #1 Description The content of AC0 is shifted right by 1 bit and the result is stored in AC0.

C54CM, M40, SATA, SATD, SXMD none

This instruction can be repeated.

5-408

Instruction Set Descriptions

SPRU375G

Signed Shift of Accumulator, Auxiliary, or Temporary Register Content

Signed Shift of Accumulator, Auxiliary, or Temporary Register Content


Syntax Characteristics
No. [2] Syntax dst = dst << #1 Parallel Enable Bit Yes Size 2 Cycles 1 Pipeline X

Opcode Operands Description dst

0100 010E 01x1 FDDD

This instruction shifts left by 1 bit the content of the destination register (dst). If the destination operand (dst) is an accumulator:
- The operation is performed on 40 bits in the D-unit shifter. - When M40 = 0, the input to the shifter is modified according to SXMD and

then the modified input is shifted left by 1 bit:


J J

if SXMD = 0, 0 is substituted for the guard bits (3932) as the input, instead of ACx(3932), to the shifter if SXMD = 1, bit 31 of the source operand is substituted for the guard bits (3932) as the input, instead of ACx(3932), to the shifter

- The sign position of the source operand is compared to the shift quantity.

This comparison depends on M40:


J J

if M40 =0, comparison is performed versus bit 31 if M40 =1, comparison is performed versus bit 39

- 0 is inserted at bit position 0. - The shifted-out bit is extracted according to M40. - After shifting, unless otherwise noted, when M40 = 0: J J

overflow is detected at bit position 31 (if an overflow is detected, the destination ACOVx bit is set) if SATD = 1, when an overflow is detected, the destination accumulator saturation values are 00 7FFF FFFFh (positive overflow) or FF 8000 0000h (negative overflow)

- After shifting, unless otherwise noted, when M40 = 1: J J

overflow is detected at bit position 39 (if an overflow is detected, the destination ACOVx bit is set) if SATD = 1, when an overflow is detected, the destination accumulator saturation values are 7F FFFF FFFFh (positive overflow) or 80 0000 0000h (negative overflow)
Instruction Set Descriptions 5-409

SPRU375G

Signed Shift of Accumulator, Auxiliary, or Temporary Register Content

If the destination operand (dst) is an auxiliary or temporary register:


- The operation is performed on 16 bits in the A-unit ALU. - 0 is inserted at bit position 0. - After shifting, unless otherwise noted: J J

overflow is detected at bit position 15 (if an overflow is detected, the destination ACOVx bit is set) if SATA = 1, when an overflow is detected, the destination register saturation values are 7FFFh (positive overflow) or 8000h (negative overflow)

Compatibility with C54x devices (C54CM = 1) When C54CM = 1, these instructions are executed as if M40 status bit was locally set to 1. There is no overflow detection, overflow report, and saturation performed by the D-unit shifter. Status Bits Affected by Affects Repeat Example
Syntax T2 = T2 << #1
Before T2 SATA

C54CM, M40, SATA, SATD, SXMD ACOVx

This instruction can be repeated.

Description The content of T2 is shifted left by 1 bit and the result is stored in T2.
After T2 SATA

EF27 1

DE4E 1

5-410

Instruction Set Descriptions

SPRU375G

Software Interrupt (intr)

Software Interrupt
Syntax Characteristics
No. [1] Syntax intr(k5) Parallel Enable Bit No Size 2 Cycles 3 Pipeline D

Opcode Operands Description k5

1001 0101 0xxk kkkk

This instruction passes control to a specified interrupt service routine (ISR) and interrupts are globally disabled (INTM bit is set to 1 after ST1_55 content is pushed onto the data stack pointer). The ISR address is stored at the interrupt vector address defined by the content of an interrupt vector pointer (IVPD or IVPH) combined with the 5-bit constant, k5. This instruction is executed regardless of the value of INTM bit. Note: DBSTAT (the debug status register) holds debug context information used during emulation. Make sure the ISR does not modify the value that will be returned to DBSTAT. Before beginning an ISR, the CPU automatically saves the value of some CPU registers and two internal registers: the program counter (PC) and a loop context register. The CPU can use these values to re-establish the context of the interrupted program sequence when the ISR is done. In the slow-return process (default), the return address (from the PC), the loop context bits, and some CPU registers are stored to the stacks (in memory). When the CPU returns from an ISR, the speed at which these values are restored is dependent on the speed of the memory accesses. In the fast-return process, the return address (from the PC) and the loop context bits are saved to registers, so that these values can always be restored quickly. These special registers are the return address register (RETA) and the control-flow context register (CFCT). You can read from or write to RETA and CFCT as a pair with dedicated, 32-bit load and store instructions. Some CPU registers are saved to the stacks (in memory). For fast-return mode operation, see the TMS320C55x DSP CPU Reference Guide (SPRU371). When control is passed to the ISR:
- The data stack pointer (SP) is decremented by 1 word in the address

phase of the pipeline. The status register 2 (ST2_55) content is pushed to the top of SP.
SPRU375G Instruction Set Descriptions 5-411

Software Interrupt (intr)

- The system stack pointer (SSP) is decremented by 1 word in the address

phase of the pipeline. The 7 higher bits of status register 0 (ST0_55) concatenated with 9 zeroes are pushed to the top of SSP.
- The SP is decremented by 1 word in the access phase of the pipeline. The

status register 1 (ST1_55) content is pushed to the top of SP.


- The SSP is decremented by 1 word in the access phase of the pipeline.

The debug status register (DBSTAT) content is pushed to the top of SSP.
- The SP is decremented by 1 word in the read phase of the pipeline. The

16 LSBs of the return address, from the program counter (PC), of the called subroutine are pushed to the top of SP.
- The SSP is decremented by 1 word in the read phase of the pipeline. The

loop context bits concatenated with the 8 MSBs of the return address are pushed to the top of SSP.
- The PC is loaded with the ISR program address. The active control flow

execution context flags are cleared.


System Stack (SSP) After S Save SSP = x 3 SSP = x 2 SSP = x 1 Before S Save SSP = x (Loop bits):PC(2316) DBSTAT ST0_55(159) Previously saved data Before S Save After SP = y 3 S Save SP = y 2 SP = y 1 SP = y Data Stack (SP) PC(150) ST1_55 ST2_55 Previously saved data

Status Bits

Affected by Affects

none INTM

Repeat See Also

This instruction cannot be repeated. See the following other related instructions:
- Return from Interrupt - Software Trap

Example
Syntax intr(#3) Description Program control is passed to the specified interrupt service routine. The interrupt vector address is defined by the content of an interrupt vector pointer (IVPD) combined with the unsigned 5-bit value (3).

5-412

Instruction Set Descriptions

SPRU375G

Software Reset (reset)

Software Reset
Syntax Characteristics
Parallel Enable bit No

No. [1]

Syntax reset

Size 2

Cycles ?

Pipeline D

Opcode Operands Description none

1001 0100 xxxx xxxx

This instruction performs a nonmaskable software reset that can be used any time to put the device in a known state. The reset instruction affects ST0_55, ST1_55, ST2_55, IFR0, IFR1, and T2 (Table 55 and Figure 54); status register ST3_55 and interrupt vectors pointer registers (IVPD and IVPH) are not affected. When the reset instruction is acknowledged, the INTM is set to 1 to disable maskable interrupts. All pending interrupts in IFR0 and IFR1 are cleared. The initialization of the system control register, the interrupt vectors pointer, and the peripheral registers is different from the initialization performed by a hardware reset.

Status Bits

Affected by Affects

none IFR0, IFR1, ST0_55, ST1_55, ST2_55

Repeat

This instruction cannot be repeated.

SPRU375G

Instruction Set Descriptions

5-413

Software Reset (reset)

Table 55. Effects of a Software Reset on DSP Registers


Register T2 IFR0 IFR1 ST0_55 Bit All All All ACOV2 ACOV3 TC1 TC2 CARRY ACOV0 ACOV1 DP ST1_55 BRAF CPL XF HM Reset Value 0 0 0 0 0 1 1 1 0 0 0 0 0 1 0 Comment All bits are cleared. To ensure TMS320C54x DSP compatibility, instructions affected by ASM bit will use a shift count of 0 (no shift). All pending interrupt flags are cleared. All pending interrupt flags are cleared. AC2 overflow flag is cleared. AC3 overflow flag is cleared. Test control flag 1 is cleared. Test control flag 2 is cleared. CARRY bit is cleared. AC0 overflow flag is cleared. AC1 overflow flag is cleared. All bits are cleared, data page 0 is selected. This flag is cleared. The DP (rather than SP) direct addressing mode is selected. Direct accesses to data space are made relative to the data page register (DP). External flag is set. When an active HOLD signal forces the DSP to place its external interface in the high-impedance state, the DSP continues executing code from internal memory. Maskable interrupts are globally disabled. 32-bit (rather than 40-bit) computation mode is selected for the D unit. CPU will not saturate overflow results in the D unit. Sign-extension mode is on. Dual 16-bit mode is off. For an instruction that is affected by C16, the Dunit ALU performs one 32-bit operation rather than two parallel 16-bit operations. Results of multiply operations are not shifted. TMS320C54x-compatibility mode is on. Instructions affected by ASM will use a shift count of 0 (no shift).

INTM M40 SATD SXMD C16

1 0 0 1 0

FRCT C54CM ASM

0 1 0

5-414

Instruction Set Descriptions

SPRU375G

Software Reset (reset)

Table 55. Effects of a Software Reset on DSP Registers (Continued)


Register ST2_55 Bit ARMS DBGM EALLOW RDM CDPLC AR7LC AR6LC AR5LC AR4LC AR3LC AR2LC AR1LC AR0LC Reset Value 0 1 0 0 0 0 0 0 0 0 0 0 0 Comment When you use the AR indirect addressing mode, the DSP mode (rather than control mode) operands are available. Debug events are disabled. A program cannot write to the non-CPU emulation registers. When an instruction specifies that an operand should be rounded, the CPU uses rounding to the infinite (rather than rounding to the nearest). CDP is used for linear addressing (rather than circular addressing). AR7 is used for linear addressing. AR6 is used for linear addressing. AR5 is used for linear addressing. AR4 is used for linear addressing. AR3 is used for linear addressing. AR2 is used for linear addressing. AR1 is used for linear addressing. AR0 is used for linear addressing.

SPRU375G

Instruction Set Descriptions

5-415

Software Reset (reset)

Figure 54. Effects of a Software Reset on Status Registers


ST0_55 15 ACOV2 0 8 DP 0 14 ACOV3 0 13 TC1 1 12 TC2 1 11 CARRY 1 10 ACOV0 0 9 ACOV1 0 0

ST1_55 15 BRAF 0 7 C16 0 14 CPL 0 6 FRCT 0 13 XF 1 5 C54CM 1 4 ASM 0 12 HM 0 11 INTM 1 10 M40 0 9 SATD 0 8 SXMD 1 0

ST2_55 15 ARMS 0 7 AR7LC 0 6 AR6LC 0 5 AR5LC 0 14 Reserved 13 12 DBGM 1 4 AR4LC 0 11 EALLOW 0 3 AR3LC 0 10 RDM 0 2 AR2LC 0 1 AR1LC 0 9 Reserved 8 CDPLC 0 0 AR0LC 0

5-416

Instruction Set Descriptions

SPRU375G

Software Trap (trap)

Software Trap
Syntax Characteristics
No. [1] Syntax trap(k5) Parallel Enable Bit No Size 2 Cycles ? Pipeline D

Opcode Operands Description k5

1001 0101 1xxk kkkk

This instruction passes control to a specified interrupt service routine (ISR) and this instruction does not affect INTM bit in ST1_55. The ISR address is stored at the interrupt vector address defined by the content of an interrupt vector pointer (IVPD or IVPH) combined with the 5-bit constant, k5. This instruction is executed regardless of the value of INTM bit . This instruction is not maskable. Note: DBSTAT (the debug status register) holds debug context information used during emulation. Make sure the ISR does not modify the value that will be returned to DBSTAT. Before beginning an ISR, the CPU automatically saves the value of some CPU registers and two internal registers: the program counter (PC) and a loop context register. The CPU can use these values to re-establish the context of the interrupted program sequence when the ISR is done. In the slow-return process (default), the return address (from the PC), the loop context bits, and some CPU registers are stored to the stacks (in memory). When the CPU returns from an ISR, the speed at which these values are restored is dependent on the speed of the memory accesses. In the fast-return process, the return address (from the PC) and the loop context bits are saved to registers, so that these values can always be restored quickly. These special registers are the return address register (RETA) and the control-flow context register (CFCT). You can read from or write to RETA and CFCT as a pair with dedicated, 32-bit load and store instructions. Some CPU registers are saved to the stacks (in memory). For fast-return mode operation, see the TMS320C55x DSP CPU Reference Guide (SPRU371). When control is passed to the ISR:
- The data stack pointer (SP) is decremented by 1 word in the address

phase of the pipeline. The status register 2 (ST2_55) content is pushed to the top of SP.
SPRU375G Instruction Set Descriptions 5-417

Software Trap (trap)

- The system stack pointer (SSP) is decremented by 1 word in the address

phase of the pipeline. The 7 higher bits of status register 0 (ST0_55) concatenated with 9 zeroes are pushed to the top of SSP.
- The SP is decremented by 1 word in the access phase of the pipeline. The

status register 1 (ST1_55) content is pushed to the top of SP.


- The SSP is decremented by 1 word in the access phase of the pipeline.

The debug status register (DBSTAT) content is pushed to the top of SSP.
- The SP is decremented by 1 word in the read phase of the pipeline. The

16 LSBs of the return address, from the program counter (PC), of the called subroutine are pushed to the top of SP.
- The SSP is decremented by 1 word in the read phase of the pipeline. The

loop context bits concatenated with the 8 MSBs of the return address are pushed to the top of SSP.
- The PC is loaded with the ISR program address. The active control flow

execution context flags are cleared.


System Stack (SSP) After S Save SSP = x 3 SSP = x 2 SSP = x 1 Before S Save SSP = x (Loop bits):PC(2316) DBSTAT ST0_55(159) Previously saved data Before S Save After SP = y 3 S Save SP = y 2 SP = y 1 SP = y Data Stack (SP) PC(150) ST1_55 ST2_55 Previously saved data

Status Bits

Affected by Affects

none none

Repeat See Also

This instruction cannot be repeated. See the following other related instructions:
- Return from Interrupt - Software Interrupt

Example
Syntax trap(5) Description Program control is passed to the specified interrupt service routine. The interrupt vector address is defined by the content of an interrupt vector pointer (IVPD) combined with the unsigned 5-bit value (5).

5-418

Instruction Set Descriptions

SPRU375G

Square

Square
Syntax Characteristics
Parallel Enable Bit Yes No

No. [1] [2]

Syntax ACy = rnd(ACx * ACx) ACx = rnd(Smem * Smem)[, T3 = Smem]

Size 2 3

Cycles 1 1

Pipeline X X

Description

This instruction performs a multiplication in the D-unit MAC. The input operands of the multiplier are:
- ACx(3216) - the content of a memory (Smem) location, sign extended to 17 bits

Status Bits

Affected by Affects

FRCT, M40, RDM, SATD, SMUL ACOVx, ACOVy

See Also

See the following other related instructions:


- Multiply - Square and Accumulate - Square and Subtract - Square Distance

SPRU375G

Instruction Set Descriptions

5-419

Square

Square
Syntax Characteristics
Parallel Enable Bit Yes

No. [1]

Syntax ACy = rnd(ACx * ACx)

Size 2

Cycles 1

Pipeline X

Opcode Operands Description ACx, ACy

0101 010E DDSS 100%

This instruction performs a multiplication in the D-unit MAC. The input operands of the multiplier are ACx(3216).
- If FRCT = 1, the output of the multiplier is shifted left by 1 bit. - Multiplication overflow detection depends on SMUL. - The 32-bit result of the multiplication is sign extended to 40 bits. - Rounding is performed according to RDM, if the optional rnd keyword is

applied to the instruction.


- Overflow detection depends on M40. If an overflow is detected, the

destination accumulator overflow status bit (ACOVy) is set.


- When an overflow is detected, the accumulator is saturated according to

SATD. Compatibility with C54x devices (C54CM = 1) When this instruction is executed with M40 = 0, compatibility is ensured. Status Bits Affected by Affects Repeat Example
Syntax AC0 = AC1 * AC1 Description The content of AC1 is squared and the result is stored in AC0.

FRCT, M40, RDM, SATD, SMUL ACOVy

This instruction can be repeated.

5-420

Instruction Set Descriptions

SPRU375G

Square

Square
Syntax Characteristics
Parallel Enable Bit No

No. [2]

Syntax ACx = rnd(Smem * Smem)[, T3 = Smem]

Size 3

Cycles 1

Pipeline X

Opcode Operands Description ACx, Smem

1101 0011 AAAA AAAI U%DD 10xx

This instruction performs a multiplication in the D-unit MAC. The input operands of the multiplier are the content of a memory (Smem) location, sign extended to 17 bits.
- If FRCT = 1, the output of the multiplier is shifted left by 1 bit. - Multiplication overflow detection depends on SMUL. - The 32-bit result of the multiplication is sign extended to 40 bits. - Rounding is performed according to RDM, if the optional rnd keyword is

applied to the instruction.


- Overflow detection depends on M40. If an overflow is detected, the

destination accumulator overflow status bit (ACOVx) is set.


- When an overflow is detected, the accumulator is saturated according to

SATD. This instruction provides the option to store the 16-bit data memory operand Smem in temporary register T3. Compatibility with C54x devices (C54CM = 1) When this instruction is executed with M40 = 0, compatibility is ensured. Status Bits Affected by Affects Repeat Example
Syntax AC0 = *AR3 * *AR3 Description The content addressed by AR3 is squared and the result is stored in AC0.

FRCT, M40, RDM, SATD, SMUL ACOVx

This instruction can be repeated.

SPRU375G

Instruction Set Descriptions

5-421

Square and Accumulate

Square and Accumulate


Syntax Characteristics
Parallel Enable Bit Yes No

No. [1] [2]

Syntax ACy = rnd(ACy + (ACx * ACx)) ACy = rnd(ACx + (Smem * Smem)) [,T3 = Smem]

Size 2 3

Cycles 1 1

Pipeline X X

Description

This instruction performs a multiplication and an accumulation in the D-unit MAC. The input operands of the multiplier are:
- ACx(3216) - the content of a memory (Smem) location, sign extended to 17 bits

Status Bits

Affected by Affects

FRCT, M40, RDM, SATD, SMUL ACOVx, ACOVy

See Also

See the following other related instructions:


- Multiply and Accumulate - Square - Square Distance - Square and Subtract

5-422

Instruction Set Descriptions

SPRU375G

Square and Accumulate

Square and Accumulate


Syntax Characteristics
Parallel Enable Bit Yes

No. [1]

Syntax ACy = rnd(ACy + (ACx * ACx))

Size 2

Cycles 1

Pipeline X

Opcode Operands Description ACx, ACy

0101 010E DDSS 001%

This instruction performs a multiplication and an accumulation in the D-unit MAC. The input operands of the multiplier are ACx(3216).
- If FRCT = 1, the output of the multiplier is shifted left by 1 bit. - Multiplication overflow detection depends on SMUL. - The 32-bit result of the multiplication is sign extended to 40 bits and added

to the source accumulator ACy.


- Rounding is performed according to RDM, if the optional rnd keyword is

applied to the instruction.


- Addition overflow detection depends on M40. If an overflow is detected,

the destination accumulator overflow status bit (ACOVy) is set.


- When an addition overflow is detected, the accumulator is saturated

according to SATD. Compatibility with C54x devices (C54CM = 1) When this instruction is executed with M40 = 0, compatibility is ensured. Status Bits Affected by Affects Repeat Example
Syntax AC0 = AC0 + (AC1 * AC1) Description The content of AC1 squared is added to the content of AC0 and the result is stored in AC0.

FRCT, M40, RDM, SATD, SMUL ACOVy

This instruction can be repeated.

SPRU375G

Instruction Set Descriptions

5-423

Square and Accumulate

Square and Accumulate


Syntax Characteristics
No. [2] Syntax ACy = rnd(ACx + (Smem * Smem)) [,T3 = Smem] Parallel Enable Bit No Size 3 Cycles 1 Pipeline X

Opcode Operands Description ACx, ACy, Smem

1101 0010 AAAA AAAI U%DD 10SS

This instruction performs a multiplication and an accumulation in the D-unit MAC. The input operands of the multiplier are the content of a memory (Smem) location, sign extended to 17 bits.
- If FRCT = 1, the output of the multiplier is shifted left by 1 bit. - Multiplication overflow detection depends on SMUL. - The 32-bit result of the multiplication is sign extended to 40 bits and added

to the source accumulator ACx.


- Rounding is performed according to RDM, if the optional rnd keyword is

applied to the instruction.


- Addition overflow detection depends on M40. If an overflow is detected,

the destination accumulator overflow status bit (ACOVy) is set.


- When an addition overflow is detected, the accumulator is saturated

according to SATD. This instruction provides the option to store the 16-bit data memory operand Smem in temporary register T3. Compatibility with C54x devices (C54CM = 1) When this instruction is executed with M40 = 0, compatibility is ensured. Status Bits Affected by Affects Repeat Example
Syntax AC0 = AC1 + (*AR3 * *AR3) Description The content addressed by AR3 squared is added to the content of AC1 and the result is stored in AC0.

FRCT, M40, RDM, SATD, SMUL ACOVy

This instruction can be repeated.

5-424

Instruction Set Descriptions

SPRU375G

Square and Subtract

Square and Subtract


Syntax Characteristics
Parallel Enable Bit Yes No

No. [1] [2]

Syntax ACy = rnd(ACy (ACx * ACx)) ACy = rnd(ACx (Smem * Smem))[, T3 = Smem]

Size 2 3

Cycles 1 1

Pipeline X X

Description

This instruction performs a multiplication and a subtraction in the D-unit MAC. The input operands of the multiplier are:
- ACx(3216) - the content of a memory (Smem) location, sign extended to 17 bits

Status Bits

Affected by Affects

FRCT, M40, RDM, SATD, SMUL ACOVx, ACOVy

See Also

See the following other related instructions:


- Multiply and Subtract - Square - Square and Accumulate - Square Distance

SPRU375G

Instruction Set Descriptions

5-425

Square and Subtract

Square and Subtract


Syntax Characteristics
Parallel Enable Bit Yes

No. [1]

Syntax ACy = rnd(ACy (ACx * ACx))

Size 2

Cycles 1

Pipeline X

Opcode Operands Description ACx, ACy

0101 010E DDSS 010%

This instruction performs a multiplication and a subtraction in the D-unit MAC. The input operands of the multiplier are ACx(3216).
- If FRCT = 1, the output of the multiplier is shifted left by 1 bit. - Multiplication overflow detection depends on SMUL. - The 32-bit result of the multiplication is sign extended to 40 bits and

subtracted from the source accumulator ACy.


- Rounding is performed according to RDM, if the optional rnd keyword is

applied to the instruction.


- Overflow detection depends on M40. If an overflow is detected, the

destination accumulator overflow status bit (ACOVy) is set.


- When an overflow is detected, the accumulator is saturated according to

SATD. Compatibility with C54x devices (C54CM = 1) When this instruction is executed with M40 = 0, compatibility is ensured. Status Bits Affected by Affects Repeat Example
Syntax AC1 = AC1 (AC0 * AC0) Description The content of AC0 squared is subtracted from the content of AC1 and the result is stored in AC1.

FRCT, M40, RDM, SATD, SMUL ACOVy

This instruction can be repeated.

5-426

Instruction Set Descriptions

SPRU375G

Square and Subtract

Square and Subtract


Syntax Characteristics
No. [2] Syntax ACy = rnd(ACx (Smem * Smem))[, T3 = Smem] Parallel Enable Bit No Size 3 Cycles 1 Pipeline X

Opcode Operands Description ACx, ACy, Smem

1101 0010 AAAA AAAI U%DD 11SS

This instruction performs a multiplication and a subtraction in the D-unit MAC. The input operands of the multiplier are the content of a memory (Smem) location, sign extended to 17 bits.
- If FRCT = 1, the output of the multiplier is shifted left by 1 bit. - Multiplication overflow detection depends on SMUL. - The 32-bit result of the multiplication is sign extended to 40 bits and

subtracted from the source accumulator ACx.


- Rounding is performed according to RDM, if the optional rnd keyword is

applied to the instruction.


- Overflow detection depends on M40. If an overflow is detected, the

destination accumulator overflow status bit (ACOVy) is set.


- When an overflow is detected, the accumulator is saturated according to

SATD. This instruction provides the option to store the 16-bit data memory operand Smem in temporary register T3. Compatibility with C54x devices (C54CM = 1) When this instruction is executed with M40 = 0, compatibility is ensured. Status Bits Affected by Affects Repeat Example
Syntax AC0 = AC1 (*AR3 * *AR3) Description The content addressed by AR3 squared is subtracted from the content of AC1 and the result is stored in AC0.

FRCT, M40, RDM, SATD, SMUL ACOVy

This instruction can be repeated.

SPRU375G

Instruction Set Descriptions

5-427

Square Distance (sqdst)

Square Distance
Syntax Characteristics
No. [1] Syntax sqdst(Xmem, Ymem, ACx, ACy) Parallel Enable Bit No Size 4 Cycles 1 Pipeline X

Opcode Operands Description

1000 0110 XXXM MMYY YMMM DDDD 1110 xxn% ACx, ACy, Xmem, Ymem This instruction performs two parallel operations: multiply and accumulate (MAC), and subtract:
ACy = ACy + (ACx * ACx), ACx = (Xmem << #16) (Ymem << #16)

The first operation performs a multiplication and an accumulation in the D-unit MAC. The input operands of the multiplier are ACx(3216).
- If FRCT = 1, the output of the multiplier is shifted left by 1 bit. - Multiplication overflow detection depends on SMUL. - The 32-bit result of the multiplication is sign extended to 40 bits and added

to the source accumulator ACy.


- Addition overflow detection depends on M40. If an overflow is detected,

the destination accumulator overflow status bit (ACOVy) is set.


- When an addition overflow is detected, the accumulator is saturated

according to SATD. The second operation subtracts the content of data memory operand Ymem, shifted left 16 bits, from the content of data memory operand Xmem, shifted left 16 bits.
- The operation is performed on 40 bits in the D-unit ALU. - Input operands are sign extended to 40 bits according to SXMD. - The shift operation is equivalent to the signed shift instruction. - Overflow detection and CARRY status bit depends on M40. The

subtraction borrow bit is reported in the CARRY status bit; the borrow bit is the logical complement of the CARRY status bit.
- When an overflow is detected, the accumulator is saturated according to

SATD.
5-428 Instruction Set Descriptions SPRU375G

Square Distance (sqdst)

Compatibility with C54x devices (C54CM = 1) When this instruction is executed with M40 = 0, compatibility is ensured. When C54CM = 1, during the subtraction an intermediary shift operation is performed as if M40 is locally set to 1 and no overflow detection, report, and saturation is done after the shifting operation. Status Bits Affected by Affects Repeat See Also C54CM, FRCT, M40, SATD, SMUL, SXMD ACOVx, ACOVy, CARRY

This instruction can be repeated. See the following other related instructions:
- Absolute Distance - Square - Square and Accumulate - Square and Subtract

Example
Syntax sqdst(*AR0, *AR1, AC0, AC1) Description The content of AC0 squared is added to the content of AC1 and the result is stored in AC1. The content addressed by AR1 shifted left by 16 bits is subtracted from the content addressed by AR0 shifted left by 16 bits and the result is stored in AC0.
After FF ABCD 0000 00 0000 0000 0055 00AA 0 0 0 0 AC0 AC1 *AR0 *AR1 ACOV0 ACOV1 CARRY FRCT FF FFAB 0000 00 1BB1 8229 0055 00AA 0 0 0 0

Before AC0 AC1 *AR0 *AR1 ACOV0 ACOV1 CARRY FRCT

SPRU375G

Instruction Set Descriptions

5-429

Store Accumulator Content to Memory

Store Accumulator Content to Memory


Syntax Characteristics
No. [1] [2] [3] [4] [5] [6] [7] [8] [9] [10] [11] [12] [13] [14] Syntax Smem = HI(ACx) Smem = HI(rnd(ACx)) Smem = LO(ACx << Tx) Smem = HI(rnd(ACx << Tx)) Smem = LO(ACx << #SHIFTW) Smem = HI(ACx << #SHIFTW) Smem = HI(rnd(ACx << #SHIFTW)) Smem = HI(saturate(uns(rnd(ACx)))) Smem = HI(saturate(uns(rnd(ACx << Tx)))) Smem = HI(saturate(uns(rnd(ACx << #SHIFTW)))) dbl(Lmem) = ACx dbl(Lmem) = saturate(uns(ACx)) HI(Lmem) = HI(ACx) >> #1, LO(Lmem) = LO(ACx) >> #1 Xmem = LO(ACx), Ymem = HI(ACx) Parallel Enable Bit No No No No No No No No No No No No No No Size 2 3 3 3 3 3 4 3 3 4 3 3 3 3 Cycles Pipeline 1 1 1 1 1 1 1 1 1 1 1 1 1 1 X X X X X X X X X X X X X X

Description

This instruction stores the content of the selected accumulator (ACx) to a memory (Smem) location, to a data memory operand (Lmem), or to dual data memory operands (Xmem and Ymem). Affected by Affects C54CM, RDM, SXMD none

Status Bits

5-430

Instruction Set Descriptions

SPRU375G

Store Accumulator Content to Memory

See Also

See the following other related instructions:


- Addition with Parallel Store Accumulator Content to Memory - Load Accumulator from Memory with Parallel Store Accumulator Content

to Memory
- Load Accumulator, Auxiliary, or Temporary Register from Memory - Multiply and Accumulate with Parallel Store Accumulator Content to Memory - Multiply and Subtract with Parallel Store Accumulator Content to Memory - Multiply with Parallel Store Accumulator Content to Memory - Store Accumulator Pair Content to Memory - Store Accumulator, Auxiliary, or Temporary Register Content to Memory - Store Auxiliary or Temporary Register Pair Content to Memory - Subtraction with Parallel Store Accumulator Content to Memory

SPRU375G

Instruction Set Descriptions

5-431

Store Accumulator Content to Memory

Store Accumulator Content to Memory


Syntax Characteristics
No. [1] Syntax Smem = HI(ACx) Parallel Enable Bit No Size 2 Cycles 1 Pipeline X

Opcode Operands Description ACx, Smem

1011 11SS AAAA AAAI

This instruction stores the high part of the accumulator, ACx(3116), to the memory (Smem) location. The store operation to the memory location uses a dedicated path independent of the D-unit ALU, the D-unit shifter, and the D-unit MACs. Affected by Affects none none

Status Bits

Repeat Example
Syntax *AR3 = HI(AC0)

This instruction can be repeated.

Description The content of AC0(3116) is stored at the location addressed by AR3.

5-432

Instruction Set Descriptions

SPRU375G

Store Accumulator Content to Memory

Store Accumulator Content to Memory


Syntax Characteristics
No. [2] Syntax Smem = HI(rnd(ACx)) Parallel Enable Bit No Size 3 Cycles 1 Pipeline X

Opcode Operands Description ACx, Smem

1110 1000 AAAA AAAI SSxx x0x%

This instruction stores the high part of the accumulator, ACx(3116), to the memory (Smem) location. Rounding is performed in the D-unit shifter according to RDM, if the optional rnd keyword is applied to the input operand. Affected by Affects RDM none

Status Bits

Repeat Example
Syntax *AR3 = HI(rnd(AC0))

This instruction can be repeated.

Description The content of AC0(3116) is rounded and stored at the location addressed by AR3.

SPRU375G

Instruction Set Descriptions

5-433

Store Accumulator Content to Memory

Store Accumulator Content to Memory


Syntax Characteristics
No. [3] Syntax Smem = LO(ACx << Tx) Parallel Enable Bit No Size 3 Cycles 1 Pipeline X

Opcode Operands Description ACx, Smem, Tx

1110 0111 AAAA AAAI SSss 00xx

This instruction shifts the accumulator, ACx, by the content of Tx and stores the low part of the accumulator, ACx(150), to the memory (Smem) location. If the 16-bit value in Tx is not within 32 to +31, the shift is saturated to 32 or +31 and the shift is performed with this value. The input operand is shifted in the D-unit shifter according to SXMD. Compatibility with C54x devices (C54CM = 1) When this instruction is executed with C54CM = 1, the 6 LSBs of Tx are used to determine the shift quantity. The 6 LSBs of Tx define a shift quantity within 32 to +31. When the 16-bit value in Tx is between 32 to 17, a modulo 16 operation transforms the shift quantity to within 16 to 1.

Status Bits

Affected by Affects

C54CM, SXMD none

Repeat Example
Syntax *AR3 = LO(AC0 << T0)

This instruction can be repeated.

Description The content of AC0 is shifted by the content of T0 and AC0(150) is stored at the location addressed by AR3.

5-434

Instruction Set Descriptions

SPRU375G

Store Accumulator Content to Memory

Store Accumulator Content to Memory


Syntax Characteristics
No. [4] Syntax Smem = HI(rnd(ACx << Tx)) Parallel Enable Bit No Size 3 Cycles 1 Pipeline X

Opcode Operands Description ACx, Smem, Tx

1110 0111 AAAA AAAI SSss 10x%

This instruction shifts the accumulator, ACx, by the content of Tx and stores high part of the accumulator, ACx(3116), to the memory (Smem) location. If the 16-bit value in Tx is not within 32 to +31, the shift is saturated to 32 or +31 and the shift is performed with this value. The input operand is shifted in the D-unit shifter according to SXMD. Rounding is performed in the D-unit shifter according to RDM, if the optional rnd keyword is applied to the input operand. Compatibility with C54x devices (C54CM = 1) When this instruction is executed with C54CM = 1, the 6 LSBs of Tx are used to determine the shift quantity. The 6 LSBs of Tx define a shift quantity within 32 to +31. When the 16-bit value in Tx is between 32 to 17, a modulo 16 operation transforms the shift quantity to within 16 to 1.

Status Bits

Affected by Affects

C54CM, RDM, SXMD none

Repeat Example
Syntax

This instruction can be repeated.

Description The content of AC0 is shifted by the content of T0, is rounded, and AC0(3116) is stored at the location addressed by AR3.

*AR3 = HI(rnd(AC0 << T0))

SPRU375G

Instruction Set Descriptions

5-435

Store Accumulator Content to Memory

Store Accumulator Content to Memory


Syntax Characteristics
No. [5] Syntax Smem = LO(ACx << #SHIFTW) Parallel Enable Bit No Size 3 Cycles 1 Pipeline X

Opcode Operands Description ACx, SHIFTW, Smem

1110 1001 AAAA AAAI SSSH IFTW

This instruction shifts the accumulator, ACx, by the 6-bit value, SHIFTW, and stores the low part of the accumulator, ACx(150), to the memory (Smem) location. The input operand is shifted by the 6-bit value in the D-unit shifter according to SXMD. Affected by Affects SXMD none

Status Bits

Repeat Example
Syntax *AR3 = LO(AC0 << #31)

This instruction can be repeated.

Description The content of AC0 is shifted left by 31 bits and AC0(150) is stored at the location addressed by AR3.

5-436

Instruction Set Descriptions

SPRU375G

Store Accumulator Content to Memory

Store Accumulator Content to Memory


Syntax Characteristics
No. [6] Syntax Smem = HI(ACx << #SHIFTW) Parallel Enable Bit No Size 3 Cycles 1 Pipeline X

Opcode Operands Description ACx, SHIFTW, Smem

1110 1010 AAAA AAAI SSSH IFTW

This instruction shifts the accumulator, ACx, by the 6-bit value, SHIFTW, and stores the high part of the accumulator, ACx(3116), to the memory (Smem) location. The input operand is shifted by the 6-bit value in the D-unit shifter according to SXMD. Affected by Affects SXMD none

Status Bits

Repeat Example
Syntax *AR3 = HI(AC0 << #31)

This instruction can be repeated.

Description The content of AC0 is shifted left by 31 bits and AC0(3116) is stored at the location addressed by AR3.

SPRU375G

Instruction Set Descriptions

5-437

Store Accumulator Content to Memory

Store Accumulator Content to Memory


Syntax Characteristics
No. [7] Syntax Smem = HI(rnd(ACx << #SHIFTW)) Parallel Enable Bit No Size 4 Cycles 1 Pipeline X

Opcode Operands Description

1111 1010 AAAA AAAI xxSH IFTW SSxx x0x% ACx, SHIFTW, Smem This instruction shifts the accumulator, ACx, by the 6-bit value, SHIFTW, and stores the high part of the accumulator, ACx(3116), to the memory (Smem) location. The input operand is shifted by the 6-bit value in the D-unit shifter according to SXMD. Rounding is performed in the D-unit shifter according to RDM, if the optional rnd keyword is applied to the input operand. Affected by Affects RDM, SXMD none

Status Bits

Repeat

This instruction cannot be repeated when using the *(#k23) absolute addressing mode to access the memory operand (Smem); when using other addressing modes, this instruction can be repeated.

Example
Syntax *AR3 = HI(rnd(AC0 << #31)) Description The content of AC0 is shifted left by 31 bits, is rounded, and AC0(3116) is stored at the location addressed by AR3.

5-438

Instruction Set Descriptions

SPRU375G

Store Accumulator Content to Memory

Store Accumulator Content to Memory


Syntax Characteristics
No. [8] Syntax Smem = HI(saturate(uns(rnd(ACx)))) Parallel Enable Bit No Size 3 Cycles 1 Pipeline X

Opcode Operands Description ACx, Smem

1110 1000 AAAA AAAI SSxx x1u%

This instruction stores the high part of the accumulator, ACx(3116), to the memory (Smem) location.
- Input operands are considered signed or unsigned according to uns. J J

If the optional uns keyword is applied to the input operand, the content of the memory location is considered unsigned. If the optional uns keyword is not applied to the input operand, the content of the memory location is considered signed.

- If the optional rnd keyword is applied to the input operand, rounding is

performed in the D-unit shifter according to RDM.


- When a rounding overflow is detected and if the optional saturate keyword

is applied to the input operand, the 40-bit output of the operation is saturated:
J J

If the optional uns keyword is applied to the input operand, saturation value is 00 FFFF FFFFh. If the optional uns keyword is not applied, saturation values are 00 7FFF FFFFh (positive overflow) or FF 8000 0000h (negative overflow).

Compatibility with C54x devices (C54CM = 1) When this instruction is executed with C54CM = 1, overflow detection at the output of the shifter consists of checking if the sign of the input operand is identical to the most-significant bits of the 40-bit result of the round operation:
- If the optional uns keyword is applied to the input operand, then bits 3932

of the result are compared to 0.


- If the optional uns keyword is not applied to the input operand, then bits

3931 of the result are compared to bit 39 of the input operand and SXMD. Status Bits Affected by Affects
SPRU375G

C54CM, RDM, SXMD none


Instruction Set Descriptions 5-439

Store Accumulator Content to Memory

Repeat Example
Syntax

This instruction can be repeated.

Description The unsigned content of AC0 is rounded, is saturated, and AC0(3116) is stored at the location addressed by AR3.

*AR3 = HI(saturate(uns(rnd(AC0))))

5-440

Instruction Set Descriptions

SPRU375G

Store Accumulator Content to Memory

Store Accumulator Content to Memory


Syntax Characteristics
No. [9] Syntax Smem = HI(saturate(uns(rnd(ACx << Tx)))) Parallel Enable Bit No Size 3 Cycles 1 Pipeline X

Opcode Operands Description ACx, Smem, Tx

1110 0111 AAAA AAAI SSss 11u%

This instruction shifts the accumulator, ACx, by the content of Tx and stores the high part of the accumulator, ACx(3116), to the memory (Smem) location. If the 16-bit value in Tx is not within 32 to +31, the shift is saturated to 32 or +31 and the shift is performed with this value.
- Input operands are considered signed or unsigned according to uns. J J

If the optional uns keyword is applied to the input operand, the content of the memory location is considered unsigned. If the optional uns keyword is not applied to the input operand, the content of the memory location is considered signed.

- The input operand is shifted in the D-unit shifter according to SXMD. - When shifting, the sign position of the input operand is compared to the

shift quantity.
J J

If the optional uns keyword is applied to the input operand, this comparison is performed against bit 32 of the shifted operand. If the optional uns keyword is not applied, this comparison is performed against bit 31 of the shifted operand that is considered signed (the sign is defined by bit 39 of the input operand and SXMD). An overflow is generated accordingly.

- If the optional rnd keyword is applied to the input operand, rounding is

performed in the D-unit shifter according to RDM.


- When a shift or rounding overflow is detected and if the optional saturate

keyword is applied to the input operand, the 40-bit output of the operation is saturated:
J J

If the optional uns keyword is applied to the input operand, saturation value is 00 FFFF FFFFh. If the optional uns keyword is not applied, saturation values are 00 7FFF FFFFh (positive overflow) or FF 8000 0000h (negative overflow).
Instruction Set Descriptions 5-441

SPRU375G

Store Accumulator Content to Memory

Compatibility with C54x devices (C54CM = 1) When this instruction is executed with C54CM = 1:
- Overflow detection at the output of the shifter consists of checking if the

sign of the input operand is identical to the most-significant bits of the 40-bit result of the shift and round operation.
J J

If the optional uns keyword is applied to the input operand, then bits 3932 of the result are compared to 0. If the optional uns keyword is not applied to the input operand, then bits 3931 of the result are compared to bit 39 of the input operand and SXMD.

- The 6 LSBs of Tx are used to determine the shift quantity. The 6 LSBs of

Tx define a shift quantity within 32 to +31. When the 16-bit value in Tx is between 32 to 17, a modulo 16 operation transforms the shift quantity to within 16 to 1. Status Bits Affected by Affects Repeat Example
Syntax *AR3 = HI(saturate(uns(rnd(AC0 << T0)))) Description The unsigned content of AC0 is shifted by the content of T0, is rounded, is saturated, and AC0(3116) is stored at the location addressed by AR3.

C54CM, RDM, SXMD none

This instruction can be repeated.

5-442

Instruction Set Descriptions

SPRU375G

Store Accumulator Content to Memory

Store Accumulator Content to Memory


Syntax Characteristics
No. [10] Syntax Smem = HI(saturate(uns(rnd(ACx << #SHIFTW)))) Parallel Enable Bit No Size 4 Cycles Pipeline 1 X

Opcode Operands Description

1111 1010 AAAA AAAI uxSH IFTW SSxx x1x% ACx, SHIFTW, Smem This instruction shifts the accumulator, ACx, by the 6-bit value, SHIFTW, and stores the high part of the accumulator, ACx(3116), to the memory (Smem) location.
- Input operands are considered signed or unsigned according to uns. J J

If the optional uns keyword is applied to the input operand, the content of the memory location is considered unsigned. If the optional uns keyword is not applied to the input operand, the content of the memory location is considered signed.

- The input operand is shifted by the 6-bit value in the D-unit shifter

according to SXMD.
- When shifting, the sign position of the input operand is compared to the

shift quantity.
J J

If the optional uns keyword is applied to the input operand, this comparison is performed against bit 32 of the shifted operand. If the optional uns keyword is not applied, this comparison is performed against bit 31 of the shifted operand that is considered signed (the sign is defined by bit 39 of the input operand and SXMD). An overflow is generated accordingly.

- If the optional rnd keyword is applied to the input operand, rounding is

performed in the D-unit shifter according to RDM.


- When a shift or rounding overflow is detected and if the optional saturate

keyword is applied to the input operand, the 40-bit output of the operation is saturated:
J J

If the optional uns keyword is applied to the input operand, saturation value is 00 FFFF FFFFh. If the optional uns keyword is not applied, saturation values are 00 7FFF FFFFh (positive overflow) or FF 8000 0000h (negative overflow).
Instruction Set Descriptions 5-443

SPRU375G

Store Accumulator Content to Memory

Compatibility with C54x devices (C54CM = 1) When this instruction is executed with C54CM = 1, overflow detection at the output of the shifter consists of checking if the sign of the input operand is identical to the most-significant bits of the 40-bit result of the shift and round operation.
- If the optional uns keyword is applied to the input operand, then bits 3932

of the result are compared to 0.


- If the optional uns keyword is not applied to the input operand, then bits

3931 of the result are compared to bit 39 of the input operand and SXMD. Status Bits Affected by Affects Repeat C54CM, RDM, SXMD none

This instruction cannot be repeated when using the *(#k23) absolute addressing mode to access the memory operand (Smem); when using other addressing modes, this instruction can be repeated.

Example
Syntax *AR3 = HI(saturate(uns(rnd(AC0 << #31)))) Description The unsigned content of AC0 is shifted left by 31 bits, is rounded, is saturated, and AC0(3116) is stored at the location addressed by AR3.

5-444

Instruction Set Descriptions

SPRU375G

Store Accumulator Content to Memory

Store Accumulator Content to Memory


Syntax Characteristics
No. [11] Syntax dbl(Lmem) = ACx Parallel Enable Bit No Size 3 Cycles 1 Pipeline X

Opcode Operands Description ACx, Lmem

1110 1011 AAAA AAAI xxSS 10x0

This instruction stores the content of the accumulator, ACx(310), to the data memory operand (Lmem). The store operation to the memory location uses a dedicated path independent of the D-unit ALU, the D-unit shifter, and the D-unit MACs. Affected by Affects none none

Status Bits

Repeat Example
Syntax dbl(*AR3) = AC0

This instruction can be repeated.

Description The content of AC0 is stored at the locations addressed by AR3 and AR3 + 1.

SPRU375G

Instruction Set Descriptions

5-445

Store Accumulator Content to Memory

Store Accumulator Content to Memory


Syntax Characteristics
No. [12] Syntax dbl(Lmem) = saturate(uns(ACx)) Parallel Enable Bit No Size 3 Cycles 1 Pipeline X

Opcode Operands Description ACx, Lmem

1110 1011 AAAA AAAI xxSS 10u1

This instruction stores the content of the accumulator, ACx(310), to the data memory operand (Lmem).
- Input operands are considered signed or unsigned according to uns. J J

If the optional uns keyword is applied to the input operand, the content of the memory location is considered unsigned. If the optional uns keyword is not applied to the input operand, the content of the memory location is considered signed.

- The 40-bit output of the operation is saturated: J J

If the optional uns keyword is applied to the input operand, saturation value is 00 FFFF FFFFh. If the optional uns keyword is not applied, saturation values are 00 7FFF FFFFh (positive overflow) or FF 8000 0000h (negative overflow).

- The store operation to the memory location uses the D-unit shifter.

Compatibility with C54x devices (C54CM = 1) When this instruction is executed with C54CM = 1, overflow detection at the output of the shifter consists of checking if the sign of the input operand is identical to the most-significant bits of the 40-bit result of the shift and round operation.
- If the optional uns keyword is applied to the input operand, then bits 3932

of the result are compared to 0.


- If the optional uns keyword is not applied to the input operand, then bits

3931 of the result are compared to bit 39 of the input operand and SXMD. Status Bits Affected by Affects
5-446 Instruction Set Descriptions

C54CM, SXMD none


SPRU375G

Store Accumulator Content to Memory

Repeat Example
Syntax

This instruction can be repeated.

Description The unsigned content of AC0 is saturated and stored at the locations addressed by AR3 and AR3 + 1.

dbl(*AR3) = saturate(uns(AC0))

SPRU375G

Instruction Set Descriptions

5-447

Store Accumulator Content to Memory

Store Accumulator Content to Memory


Syntax Characteristics
No. [13] Syntax HI(Lmem) = HI(ACx) >> #1, LO(Lmem) = LO(ACx) >> #1 Parallel Enable Bit No Size 3 Cycles 1 Pipeline X

Opcode Operands Description ACx, Lmem

1110 1011 AAAA AAAI xxSS 1101

This instruction performs two store operations in parallel and is executed in the D-unit shifter:
- The 16 highest bits of the accumulator, ACx(3116), shifted right by 1 bit

(bit 31 is sign extended according to SXMD), are stored to the 16 highest bits of the data memory operand (Lmem).
- The 16 lowest bits, ACx(150), shifted right by 1 bit (bit 15 is sign extended

according to SXMD), are stored to the 16 lowest bits of the data memory operand (Lmem). Status Bits Affected by Affects Repeat Example
Syntax HI(*AR1) = HI(AC0) >> #1, LO(*AR1) = LO(AC0) >> #1 Description The content of AC0(3116), shifted right by 1 bit, is stored at the location addressed by AR1 and the content of AC0(150), shifted right by 1 bit, is stored at the location addressed by AR1 + 1.

SXMD none

This instruction can be repeated.

5-448

Instruction Set Descriptions

SPRU375G

Store Accumulator Content to Memory

Store Accumulator Content to Memory


Syntax Characteristics
No. [14] Syntax Xmem = LO(ACx), Ymem = HI(ACx) Parallel Enable Bit No Size 3 Cycles 1 Pipeline X

Opcode Operands Description ACx, Xmem, Ymem

1000 0000 XXXM MMYY YMMM 10SS

This instruction performs two store operations in parallel:


- The 16 lowest bits of the accumulator, ACx(150), are stored to data

memory operand Xmem.


- The 16 highest bits, ACx(3116), are stored to data memory operand Ymem.

Status Bits

Affected by Affects

none none

Repeat Example
Syntax *AR1 = LO(AC0), *AR2 = HI(AC0)
Before AC0 AR1 AR2 200 201

This instruction can be repeated.

Description The content of AC0(150) is stored at the location addressed by AR1 and the content of AC0(3116) is stored at the location addressed by AR2.
After 01 4500 0030 0200 0201 3400 0FD3 AC0 AR1 AR2 200 201 01 4500 0030 0200 0201 0030 4500

SPRU375G

Instruction Set Descriptions

5-449

Store Accumulator Pair Content to Memory

Store Accumulator Pair Content to Memory


Syntax Characteristics
No. [1] [2] Syntax Lmem = pair(HI(ACx)) Lmem = pair(LO(ACx)) Parallel Enable Bit No No Size 3 3 Cycles Pipeline 1 1 X X

Description

This instruction stores the content of the selected accumulator pair, ACx and AC(x + 1), to a data memory operand (Lmem). Affected by Affects none none

Status Bits

See Also

See the following other related instructions:


- Addition with Parallel Store Accumulator Content to Memory - Load Accumulator from Memory with Parallel Store Accumulator Content

to Memory
- Load Accumulator, Auxiliary, or Temporary Register from Memory - Multiply and Accumulate with Parallel Store Accumulator Content to Memory - Multiply and Subtract with Parallel Store Accumulator Content to Memory - Multiply with Parallel Store Accumulator Content to Memory - Store Accumulator Content to Memory - Store Accumulator, Auxiliary, or Temporary Register Content to Memory - Store Auxiliary or Temporary Register Pair Content to Memory - Subtraction with Parallel Store Accumulator Content to Memory

5-450

Instruction Set Descriptions

SPRU375G

Store Accumulator Pair Content to Memory

Store Accumulator Pair Content to Memory


Syntax Characteristics
No. [1] Syntax Lmem = pair(HI(ACx)) Parallel Enable Bit No Size 3 Cycles 1 Pipeline X

Opcode Operands Description ACx, Lmem

1110 1011 AAAA AAAI xxSS 1110

This instruction stores the 16 highest bits of the accumulator, ACx(3116), to the 16 highest bits of the data memory operand (Lmem) and stores the 16 highest bits of AC(x + 1) to the16 lowest bits of data memory operand (Lmem):
- The store operation to the memory location uses a dedicated path

independent of the D-unit ALU, the D-unit shifter, and the D-unit MACs.
- Valid accumulators are AC0 and AC2.

Status Bits

Affected by Affects

none none

Repeat Example
Syntax *AR1+ = pair(HI(AC0))

This instruction can be repeated.

Description The content of AC0(3116) is stored at the location addressed by AR1 and the content of AC1(3116) is stored at the location addressed by AR1 + 1. AR1 is incremented by 2.
After 01 4500 0030 03 5644 F800 0200 3400 0FD3 AC0 AC1 AR1 200 201 01 4500 0030 03 5644 F800 0202 4500 5644

Before AC0 AC1 AR1 200 201

SPRU375G

Instruction Set Descriptions

5-451

Store Accumulator Pair Content to Memory

Store Accumulator Pair Content to Memory


Syntax Characteristics
No. [2] Syntax Lmem = pair(LO(ACx)) Parallel Enable Bit No Size 3 Cycles 1 Pipeline X

Opcode Operands Description ACx, Lmem

1110 1011 AAAA AAAI xxSS 1111

This instruction stores the 16 lowest bits of the accumulator, ACx(150), to the 16 highest bits of the data memory operand (Lmem) and stores the 16 lowest bits of AC(x + 1) to the16 lowest bits of data memory operand (Lmem):
- The store operation to the memory location uses a dedicated path

independent of the D-unit ALU, the D-unit shifter, and the D-unit MACs.
- Valid accumulators are AC0 and AC2.

Status Bits

Affected by Affects

none none

Repeat Example
Syntax *AR3 = pair(LO(AC0))

This instruction can be repeated.

Description The content of AC0(150) is stored at the location addressed by AR3 and the content of AC1(150) is stored at the location addressed by AR3 + 1.

5-452

Instruction Set Descriptions

SPRU375G

Store Accumulator, Auxiliary, or Temporary Register Content to Memory

Store Accumulator, Auxiliary, or Temporary Register Content to Memory


Syntax Characteristics
No. [1] [2] [3] Syntax Smem = src high_byte(Smem) = src low_byte(Smem) = src Parallel Enable Bit No No No Size 2 3 3 Cycles 1 1 1 Pipeline X X X

Description

This instruction stores the content of the selected source (src) register to a memory (Smem) location. Affected by Affects none none

Status Bits

See Also

See the following other related instructions:


- Addition with Parallel Store Accumulator Content to Memory - Load Accumulator from Memory with Parallel Store Accumulator Content

to Memory
- Load Accumulator, Auxiliary, or Temporary Register from Memory - Multiply and Accumulate with Parallel Store Accumulator Content to Memory - Multiply and Subtract with Parallel Store Accumulator Content to Memory - Multiply with Parallel Store Accumulator Content to Memory - Store Accumulator Content to Memory - Store Accumulator Pair Content to Memory - Store Auxiliary or Temporary Register Pair Content to Memory - Subtraction with Parallel Store Accumulator Content to Memory

SPRU375G

Instruction Set Descriptions

5-453

Store Accumulator, Auxiliary, or Temporary Register Content to Memory

Store Accumulator, Auxiliary, or Temporary Register Content to Memory


Syntax Characteristics
No. [1] Syntax Smem = src Parallel Enable Bit No Size 2 Cycles 1 Pipeline X

Opcode Operands Description Smem, src

1100 FSSS AAAA AAAI

This instruction stores the content of the source (src) register to a memory (Smem) location.
- When the source register is an accumulator: J J

The low part of the accumulator, ACx(150), is stored to the memory location. The store operation to the memory location uses a dedicated path independent of the D-unit ALU, the D-unit shifter, and the D-unit MACs.

- When the source register is an auxiliary or temporary register: J J

The content of the auxiliary or temporary register is stored to the memory location. The store operation to the memory location uses a dedicated path independent of the A-unit ALU. none none

Status Bits

Affected by Affects

Repeat Example
Syntax *(#0E10h) = AC0
Before AC0 0E10

This instruction can be repeated.

Description The content of AC0(150) is stored at location E10h.


After 23 0400 6500 0000 AC0 0E10 23 0400 6500 6500

5-454

Instruction Set Descriptions

SPRU375G

Store Accumulator, Auxiliary, or Temporary Register Content to Memory

Store Accumulator, Auxiliary, or Temporary Register Content to Memory


Syntax Characteristics
No. [2] Syntax high_byte(Smem) = src Parallel Enable Bit No Size 3 Cycles 1 Pipeline X

Opcode Operands Description Smem, src

1110 0101 AAAA AAAI FSSS 01x0

This instruction stores the low byte (bits 70) of the source (src) register to the high byte (bits 158) of the memory (Smem) location. The low byte (bits 70) of Smem is unchanged.
- When the source register is an accumulator: J J

The low part of the accumulator, ACx(70), is stored to the high byte of the memory location. The store operation to the memory location uses a dedicated path independent of the D-unit ALU, the D-unit shifter, and the D-unit MACs.

- When the source register is an auxiliary or temporary register: J J

The low part (bits 70) content of the auxiliary or temporary register is stored to the high byte of the memory location. The store operation to the memory location uses a dedicated path independent of the A-unit ALU.

- In this instruction, Smem cannot reference to a memory-mapped register

(MMR). This instruction cannot access a byte within an MMR. If Smem is an MMR, the DSP sends a hardware bus-error interrupt (BERRINT) request to the CPU. Status Bits Affected by Affects Repeat Example
Syntax high_byte(*AR1) = AC1 Description The content of AC1(70) is stored in the high byte (bits 158) at the location addressed by AR1.
After 20 FC00 6788 0200 6903 AC1 AR1 200 20 FC00 6788 0200 8803

none none

This instruction can be repeated.

Before AC1 AR1 200

SPRU375G

Instruction Set Descriptions

5-455

Store Accumulator, Auxiliary, or Temporary Register Content to Memory

Store Accumulator, Auxiliary, or Temporary Register Content to Memory


Syntax Characteristics
No. [3] Syntax low_byte(Smem) = src Parallel Enable Bit No Size 3 Cycles 1 Pipeline X

Opcode Operands Description Smem, src

1110 0101 AAAA AAAI FSSS 01x1

This instruction stores the low byte (bits 70) of the source (src) register to the low byte (bits 70) of the memory (Smem) location. The high byte (bits 158) of Smem is unchanged.
- When the source register is an accumulator: J J

The low part of the accumulator, ACx(70), is stored to the low byte of the memory location. The store operation to the memory location uses a dedicated path independent of the D-unit ALU, the D-unit shifter, and the D-unit MACs.

- When the source register is an auxiliary or temporary register: J J

The low part (bits 70) content of the auxiliary or temporary register is stored to the low byte of the memory location. The store operation to the memory location uses a dedicated path independent of the A-unit ALU.

- In this instruction, Smem cannot reference to a memory-mapped register

(MMR). This instruction cannot access a byte within an MMR. If Smem is an MMR, the DSP sends a hardware bus-error interrupt (BERRINT) request to the CPU. Status Bits Affected by Affects Repeat Example
Syntax low_byte(*AR3) = AC0 Description The content of AC0(70) is stored in the low byte (bits 70) at the location addressed by AR3.

none none

This instruction can be repeated.

5-456

Instruction Set Descriptions

SPRU375G

Store Auxiliary or Temporary Register Pair Content to Memory

Store Auxiliary or Temporary Register Pair Content to Memory


Syntax Characteristics
No. [1] Syntax Lmem = pair(TAx) Parallel Enable Bit No Size 3 Cycles 1 Pipeline X

Opcode Operands Description TAx, Lmem

1110 1011 AAAA AAAI FSSS 1100

This instruction stores the content of the temporary or auxiliary register (TAx) to the 16 highest bits of the data memory operand (Lmem) and stores the content of TA(x + 1) to the 16 lowest bits of data memory operand (Lmem):
- The store operation to the memory location uses a dedicated path

independent of the A-unit ALU.


- Valid auxiliary registers are AR0, AR2, AR4, and AR6. - Valid temporary registers are T0 and T2.

Status Bits

Affected by Affects

none none

Repeat See Also

This instruction can be repeated. See the following other related instructions:
- Load Accumulator, Auxiliary, or Temporary Register from Memory - Store Accumulator, Auxiliary, or Temporary Register Content to Memory

Example
Syntax *AR2 = pair(T0) Description The content of T0 is stored at the location addressed by AR2 and the content of T1 is stored at the location addressed by AR2 + 1.

SPRU375G

Instruction Set Descriptions

5-457

Store CPU Register Content to Memory

Store CPU Register Content to Memory

Syntax Characteristics
Parallel Enable Bit No No No No No No No No No No No No No No No No No No No No

No. [1] [2] [3] [4] [5] [6] [7] [8] [9] [10] [11] [12] [13] [14] [15] [16] [17] [18] [19] [20]

Syntax Smem = BK03 Smem = BK47 Smem = BKC Smem = BSA01 Smem = BSA23 Smem = BSA45 Smem = BSA67 Smem = BSAC Smem = BRC0 Smem = BRC1 Smem = CDP Smem = CSR Smem = DP Smem = DPH Smem = PDP Smem = SP Smem = SSP Smem = TRN0 Smem = TRN1 dbl(Lmem) = RETA

Size 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3

Cycles 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 5

Pipeline X X X X X X X X X X X X X X X X X X X X

Opcode Operands
5-458

See Table 56 (page 5-461). Lmem, Smem


Instruction Set Descriptions SPRU375G

Store CPU Register Content to Memory

Description

These instructions store the content of the selected source CPU register to a memory (Smem) location or a data memory operand (Lmem). For instructions [9] and [10], the block repeat register (BRCx) is decremented in the address phase of the last instruction of the loop. These instructions have a 3-cycle latency requirement versus the last instruction of the loop. For instruction [20], the content of the 24-bit RETA register (the return address of the calling subroutine) and the 8-bit CFCT register (active control flow execution context flags of the calling subroutine) are stored to the data memory operand (Lmem):
- The content of the CFCT register and the 8 highest bits of the RETA

register are stored in the 16 highest bits of Lmem.


- The 16 lowest bits of the RETA register are stored in the 16 lowest bits of

Lmem. When instruction [20] is decoded, the CPU pipeline is flushed and the instruction is executed in 5 cycles, regardless of the instruction context. Status Bits Affected by Affects Repeat See Also none none

Instruction [20] cannot be repeated; all other instructions can be repeated. See the following other related instructions:
- Load CPU Register from Memory - Load CPU Register with Immediate Value - Move CPU Register Content to Auxiliary or Temporary Register - Store Accumulator Content to Memory - Store Accumulator Pair Content to Memory - Store Accumulator, Auxiliary, or Temporary Register Content to Memory - Store Auxiliary or Temporary Register Pair Content to Memory

Example 1
Syntax *AR1+ = SP Description The content of the data stack pointer (SP) is stored in the location addressed by AR1. AR1 is incremented by 1.
After 0200 0200 0000 AR1 SP 200 0201 0200 0200

Before AR1 SP 200

SPRU375G

Instruction Set Descriptions

5-459

Store CPU Register Content to Memory

Example 2
Syntax *AR1+ = SSP Description The content of the system stack pointer (SSP) is stored in the location addressed by AR1. AR1 is incremented by 1.
After 0201 0000 00FF AR1 SSP 201 0202 0000 0000

Before AR1 SSP 201

Example 3
Syntax *AR1+ = TRN0 Description The content of the transition register (TRN0) is stored in the location addressed by AR1. AR1 is incremented by 1.
After 0202 3490 0000 AR1 TRN0 202 0203 3490 3490

Before AR1 TRN0 202

Example 4
Syntax *AR1+ = TRN1 Description The content of the transition register (TRN1) is stored in the location addressed by AR1. AR1 is incremented by 1.
After 0203 0020 0000 AR1 TRN1 203 0204 0020 0020

Before AR1 TRN1 203

Example 5
Syntax dbl(*AR3) = RETA Description The contents of the RETA and CFCT are stored in the location addressed by AR3 and AR3 + 1.

5-460

Instruction Set Descriptions

SPRU375G

Store CPU Register Content to Memory

Table 56. Opcodes for Store CPU Register Content to Memory Instruction
No. [1] [2] [3] [4] [5] [6] [7] [8] [9] [10] [11] [12] [13] [14] [15] [16] [17] [18] [19] [20] Syntax Smem = BK03 Smem = BK47 Smem = BKC Smem = BSA01 Smem = BSA23 Smem = BSA45 Smem = BSA67 Smem = BSAC Smem = BRC0 Smem = BRC1 Smem = CDP Smem = CSR Smem = DP Smem = DPH Smem = PDP Smem = SP Smem = SSP Smem = TRN0 Smem = TRN1 dbl(Lmem) = RETA Opcode

1110 0101 AAAA AAAI 1001 10xx 1110 0101 AAAA AAAI 1010 10xx 1110 0101 AAAA AAAI 1011 10xx 1110 0101 AAAA AAAI 0010 10xx 1110 0101 AAAA AAAI 0011 10xx 1110 0101 AAAA AAAI 0100 10xx 1110 0101 AAAA AAAI 0101 10xx 1110 0101 AAAA AAAI 0110 10xx 1110 0101 AAAA AAAI x001 11xx 1110 0101 AAAA AAAI x010 11xx 1110 0101 AAAA AAAI 0001 10xx 1110 0101 AAAA AAAI x000 11xx 1110 0101 AAAA AAAI 0000 10xx 1110 0101 AAAA AAAI 1100 10xx 1110 0101 AAAA AAAI 1111 10xx 1110 0101 AAAA AAAI 0111 10xx 1110 0101 AAAA AAAI 1000 10xx 1110 0101 AAAA AAAI x011 11xx 1110 0101 AAAA AAAI x100 11xx 1110 1011 AAAA AAAI xxxx 01xx

SPRU375G

Instruction Set Descriptions

5-461

Store Extended Auxiliary Register Content to Memory

Store Extended Auxiliary Register Content to Memory


Syntax Characteristics
Parallel Enable Bit No

No. [1]

Syntax dbl(Lmem) = XAsrc

Size 3

Cycles 1

Pipeline X

Opcode Operands Description Lmem, XAsrc

1110 1101 AAAA AAAI XSSS 0101

This instruction moves the content of the 23-bit source register (XARx, XSP, XSSP, XDP, or XCDP) to the 32-bit data memory location addressed by data memory operand (Lmem). The upper 9 bits of the data memory are filled with 0: Affected by Affects none none

Status Bits

Repeat See Also

This instruction can be repeated. See the following other related instructions:
- Load Extended Auxiliary Register from Memory - Load Extended Auxiliary Register with Immediate Value - Modify Extended Auxiliary Register Content - Move Extended Auxiliary Register Content

Example
Syntax dbl(*AR3) = XAR1 Description The 7 highest bits of XAR1 are moved to the 7 lowest bits of the location addressed by AR3, the 9 highest bits are filled with 0, and the 16 lowest bits of XAR1 are moved to the location addressed by AR3 + 1.
After 7F 3492 0200 3765 0FD3 XAR1 AR3 200 201 7F 3492 0200 007F 3492

Before XAR1 AR3 200 201

5-462

Instruction Set Descriptions

SPRU375G

Subtract Conditionally (subc)

Subtract Conditionally
Syntax Characteristics
No. [1] Syntax subc(Smem, ACx, ACy) Parallel Enable Bit No Size 3 Cycles 1 Pipeline X

Opcode Operands Description ACx, ACy, Smem

1101 1110 AAAA AAAI SSDD 0011

This instruction performs a conditional subtraction in the D-unit ALU. The D-unit shifter is not used to perform the memory operand shift.
- The 16-bit data memory operand Smem is sign extended to 40 bits

according to SXMD, shifted left by 15 bits, and subtracted from the content of the source accumulator ACx.
J J

The shift operation is equivalent to the signed shift instruction. Overflow and carry bit is always detected at bit position 31. The subtraction borrow bit is reported in the CARRY status bit; the borrow bit is the logical complement of the CARRY status bit. If an overflow is detected and reported in accumulator overflow bit ACOVy, no saturation is performed on the result of the operation.

- If the result of the subtraction is greater than 0 (bit 39 = 0), the result is

shifted left by 1 bit, added to 1, and stored in the destination accumulator ACy.
- If the result of the subtraction is less than 0 (bit 39 = 1), the source

accumulator ACx is shifted left by 1 bit and stored in the destination accumulator ACy.
if ((ACx (Smem << #15)) >= 0) ACy = (ACx (Smem << #15)) << #1 + 1 else ACy = ACx << #1

This instruction is used to make a 16 step 16-bit by 16-bit division. The divisor and the dividend are both assumed to be positive in this instruction. SXMD affects this operation:
- If SXMD = 1, the divisor must have a 0 value in the most significant bit - If SXMD = 0, any 16-bit divisor value produces the expected result

The dividend, which is in the source accumulator ACx, must be positive (bit 31 = 0) during the computation.
SPRU375G Instruction Set Descriptions 5-463

Subtract Conditionally (subc)

Status Bits

Affected by Affects

SXMD ACOVy, CARRY

Repeat See Also

This instruction can be repeated. See the following other related instructions:
- Addition or Subtraction Conditionally - Addition or Subtraction Conditionally with Shift - Addition, Subtraction, or Move Accumulator Content Conditionally - Dual 16-Bit Subtraction and Addition - Subtraction - Subtraction with Parallel Store Accumulator Content to Memory

Example 1
Syntax subc(*AR1, AC0, AC1) Description The content addressed by AR1 shifted left by 15 bits is subtracted from the content of AC0. The result is greater than 0; therefore, the result is shifted left by 1 bit, added to 1, and the new result stored in AC1. The result generated an overflow and a carry.
After AC0 AC1 AR1 300 SXMD ACOV1 CARRY 23 4300 0000 46 8400 0001 300 200 0 1 1

Before AC0 AC1 AR1 300 SXMD ACOV1 CARRY

23 4300 0000 00 0000 0000 300 200 0 0 0

Example 2
Syntax repeat (CSR) subc(*AR1, AC1, AC1) Description The content addressed by AR1 shifted left by 15 bits is subtracted from the content of AC1. The result is greater than 0; therefore, the result is shifted left by 1 bit, added to 1, and the new result stored in AC1. The content addressed by AR1 shifted left by 15 bits is subtracted from the content of AC1. The result is greater than 0; therefore, the result is shifted left by 1 bit, added to 1, and the new result stored in AC1. The result generated a carry.
After AC1 AR1 200 CSR ACOV1 CARRY 00 1A18 0007 200 0100 0 0 1

Before AC1 AR1 200 CSR ACOV1 CARRY

00 0746 0000 200 0100 1 0 0

5-464

Instruction Set Descriptions

SPRU375G

Subtraction

Subtraction
Syntax Characteristics
Parallel Enable Bit Yes Yes No No No Yes Yes No No No No No No No No No No No

No. [1] [2] [3] [4] [5] [6] [7] [8] [9] [10] [11] [12] [13] [14] [15] [16] [17] [18]

Syntax dst = dst src dst = dst k4 dst = src K16 dst = src Smem dst = Smem src ACy = ACy (ACx << Tx) ACy = ACy (ACx << #SHIFTW) ACy = ACx (K16 << #16) ACy = ACx (K16 << #SHFT) ACy = ACx (Smem << Tx) ACy = ACx (Smem << #16) ACy = (Smem << #16) ACx ACy = ACx uns(Smem) BORROW ACy = ACx uns(Smem) ACy = ACx (uns(Smem) << #SHIFTW) ACy = ACx dbl(Lmem) ACy = dbl(Lmem) ACx ACx = (Xmem << #16) (Ymem << #16)

Size 2 2 4 3 3 2 3 4 4 3 3 3 3 3 4 3 3 3

Cycles 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1

Pipeline X X X X X X X X X X X X X X X X X X

Description Status Bits

These instructions perform a subtraction operation. Affected by Affects CARRY, C54CM, M40, SATA, SATD, SXMD ACOVx, ACOVy, CARRY

SPRU375G

Instruction Set Descriptions

5-465

Subtraction

See Also

See the following other related instructions:


- Addition - Addition or Subtraction Conditionally - Addition or Subtraction Conditionally with Shift - Addition, Subtraction, or Move Accumulator Content Conditionally - Dual 16-Bit Addition and Subtraction - Dual 16-Bit Subtractions - Dual 16-Bit Subtraction and Addition - Subtract Conditionally - Subtraction with Parallel Store Accumulator Content to Memory

5-466

Instruction Set Descriptions

SPRU375G

Subtraction

Subtraction
Syntax Characteristics
No. [1] Syntax dst = dst src Parallel Enable Bit Yes Size 2 Cycles 1 Pipeline X

Opcode Operands Description dst, src

0010 011E FSSS FDDD

This instruction performs a subtraction operation between two registers.


- When the destination operand (dst) is an accumulator: J J J

The operation is performed on 40 bits in the D-unit ALU. Input operands are sign extended to 40 bits according to SXMD. If an auxiliary or temporary register is the source operand (src) of the instruction, the 16 LSBs of the auxiliary or temporary register are sign extended according to SXMD. Overflow detection and CARRY status bit depends on M40. The subtraction borrow bit is reported in the CARRY status bit; the borrow bit is the logical complement of the CARRY status bit. When an overflow is detected, the accumulator is saturated according to SATD.

- When the destination operand (dst) is an auxiliary or temporary register: J J J J

The operation is performed on 16 bits in the A-unit ALU. If an accumulator is the source operand (src) of the instruction, the 16 LSBs of the accumulator are used to perform the operation. Overflow detection is done at bit position 15. When an overflow is detected, the destination register is saturated according to SATA.

Compatibility with C54x devices (C54CM = 1) When this instruction is executed with M40 = 0, compatibility is ensured. Status Bits Affected by Affects Repeat Example
Syntax AC0 = AC0 AC1 Description The content of AC1 is subtracted from the content of AC0 and the result is stored in AC0.

M40, SATA, SATD, SXMD ACOVx, CARRY

This instruction can be repeated.

SPRU375G

Instruction Set Descriptions

5-467

Subtraction

Subtraction
Syntax Characteristics
Parallel Enable Bit Yes

No. [2]

Syntax dst = dst k4

Size 2

Cycles 1

Pipeline X

Opcode Operands Description dst, k4

0100 011E kkkk FDDD

This instruction subtracts a 4-bit unsigned constant, k4, from a register.


- When the destination operand (dst) is an accumulator: J J

The operation is performed on 40 bits in the D-unit ALU. Overflow detection and CARRY status bit depends on M40. The subtraction borrow bit is reported in the CARRY status bit; the borrow bit is the logical complement of the CARRY status bit. When an overflow is detected, the accumulator is saturated according to SATD.

- When the destination operand (dst) is an auxiliary or temporary register: J J J

The operation is performed on 16 bits in the A-unit ALU. Overflow detection is done at bit position 15. When an overflow is detected, the destination register is saturated according to SATA.

Compatibility with C54x devices (C54CM = 1) When this instruction is executed with M40 = 0, compatibility is ensured. Status Bits Affected by Affects Repeat Example
Syntax AC0 = AC0 #15 Description An unsigned 4-bit value (15) is subtracted from the content of AC0 and the result is stored in AC0.

M40, SATA, SATD ACOVx, CARRY

This instruction can be repeated.

5-468

Instruction Set Descriptions

SPRU375G

Subtraction

Subtraction
Syntax Characteristics
Parallel Enable Bit No

No. [3]

Syntax dst = src K16

Size 4

Cycles 1

Pipeline X

Opcode Operands Description dst, K16, src

0111 1100 KKKK KKKK KKKK KKKK FDDD FSSS

This instruction subtracts a 16-bit signed constant, K16, from a register.


- When the destination operand (dst) is an accumulator: J J

The operation is performed on 40 bits in the D-unit ALU. If an auxiliary or temporary register is the source operand (src) of the instruction, the 16 LSBs of the auxiliary or temporary register are sign extended according to SXMD. The 16-bit constant, K16, is sign extended to 40 bits according to SXMD. Overflow detection and CARRY status bit depends on M40. The subtraction borrow bit is reported in the CARRY status bit; the borrow bit is the logical complement of the CARRY status bit. When an overflow is detected, the accumulator is saturated according to SATD.

J J

- When the destination operand (dst) is an auxiliary or temporary register: J J J J

The operation is performed on 16 bits in the A-unit ALU. If an accumulator is the source operand (src) of the instruction, the 16 LSBs of the accumulator are used to perform the operation. Overflow detection is done at bit position 15. When an overflow is detected, the destination register is saturated according to SATA.

Compatibility with C54x devices (C54CM = 1) When this instruction is executed with M40 = 0, compatibility is ensured.
SPRU375G Instruction Set Descriptions 5-469

Subtraction

Status Bits

Affected by Affects

M40, SATA, SATD, SXMD ACOVx, CARRY

Repeat Example
Syntax AC0 = AC1 FFFFh

This instruction can be repeated.

Description A signed 16-bit value (FFFFh) is subtracted from the content of AC1 and the result is stored in AC0.

5-470

Instruction Set Descriptions

SPRU375G

Subtraction

Subtraction
Syntax Characteristics
Parallel Enable Bit No

No. [4]

Syntax dst = src Smem

Size 3

Cycles 1

Pipeline X

Opcode Operands Description dst, Smem, src

1101 0111 AAAA AAAI FDDD FSSS

This instruction subtracts the content of a memory (Smem) location from a register content.
- When the destination operand (dst) is an accumulator: J J

The operation is performed on 40 bits in the D-unit ALU. If an auxiliary or temporary register is the source operand (src) of the instruction, the 16 LSBs of the auxiliary or temporary register are sign extended according to SXMD. The content of the memory location is sign extended to 40 bits according to SXMD. Overflow detection and CARRY status bit depends on M40. The subtraction borrow bit is reported in the CARRY status bit; the borrow bit is the logical complement of the CARRY status bit. When an overflow is detected, the accumulator is saturated according to SATD.

J J

- When the destination operand (dst) is an auxiliary or temporary register: J J J J

The operation is performed on 16 bits in the A-unit ALU. If an accumulator is the source operand (src) of the instruction, the 16 LSBs of the accumulator are used to perform the operation. Overflow detection is done at bit position 15. When an overflow is detected, the destination register is saturated according to SATA.

Compatibility with C54x devices (C54CM = 1) When this instruction is executed with M40 = 0, compatibility is ensured.
SPRU375G Instruction Set Descriptions 5-471

Subtraction

Status Bits

Affected by Affects

M40, SATA, SATD, SXMD ACOVx, CARRY

Repeat Example
Syntax AC0 = AC1 *AR3

This instruction can be repeated.

Description The content addressed by AR3 is subtracted from the content of AC1 and the result is stored in AC0.

5-472

Instruction Set Descriptions

SPRU375G

Subtraction

Subtraction
Syntax Characteristics
Parallel Enable Bit No

No. [5]

Syntax dst = Smem src

Size 3

Cycles 1

Pipeline X

Opcode Operands Description dst, Smem, src

1101 1000 AAAA AAAI FDDD FSSS

This instruction subtracts a register content from the content of a memory (Smem) location.
- When the destination operand (dst) is an accumulator: J J

The operation is performed on 40 bits in the D-unit ALU. If an auxiliary or temporary register is the source operand (src) of the instruction, the 16 LSBs of the auxiliary or temporary register are sign extended according to SXMD. The content of the memory location is sign extended to 40 bits according to SXMD. Overflow detection and CARRY status bit depends on M40. The subtraction borrow bit is reported in the CARRY status bit; the borrow bit is the logical complement of the CARRY status bit. When an overflow is detected, the accumulator is saturated according to SATD.

J J

- When the destination operand (dst) is an auxiliary or temporary register: J J J J

The operation is performed on 16 bits in the A-unit ALU. If an accumulator is the source operand (src) of the instruction, the 16 LSBs of the accumulator are used to perform the operation. Overflow detection is done at bit position 15. When an overflow is detected, the destination register is saturated according to SATA.

Compatibility with C54x devices (C54CM = 1) When this instruction is executed with M40 = 0, compatibility is ensured.
SPRU375G Instruction Set Descriptions 5-473

Subtraction

Status Bits

Affected by Affects

M40, SATA, SATD, SXMD ACOVx, CARRY

Repeat Example
Syntax AC0 = *AR3 AC1

This instruction can be repeated.

Description The content of AC1 is subtracted from the content addressed by AR3 and the result is stored in AC0.

5-474

Instruction Set Descriptions

SPRU375G

Subtraction

Subtraction
Syntax Characteristics
No. [6] Syntax ACy = ACy (ACx << Tx) Parallel Enable Bit Yes Size 2 Cycles 1 Pipeline X

Opcode Operands Description ACx, ACy, Tx

0101 101E DDSS ss01

This instruction subtracts an accumulator content ACx shifted by the content of Tx from an accumulator content ACy.
- The operation is performed on 40 bits in the D-unit shifter. - Input operands are sign extended to 40 bits according to SXMD. - The shift operation is equivalent to the signed shift instruction. - Overflow detection and CARRY status bit depends on M40. The

subtraction borrow bit is reported in the CARRY status bit; the borrow bit is the logical complement of the CARRY status bit.
- When an overflow is detected, the accumulator is saturated according to

SATD. Compatibility with C54x devices (C54CM = 1) When this instruction is executed with M40 = 0, compatibility is ensured. When C54CM = 1:
- An intermediary shift operation is performed as if M40 is locally set to 1 and

no overflow detection, report, and saturation is done after the shifting operation.
- The 6 LSBs of Tx are used to determine the shift quantity. The 6 LSBs of

Tx define a shift quantity within 32 to +31. When the value is between 32 to 17, a modulo 16 operation transforms the shift quantity to within 16 to 1. Status Bits Affected by Affects Repeat Example
Syntax AC0 = AC0 (AC1 << T0) Description The content of AC1 shifted by the content of T0 is subtracted from the content of AC0 and the result is stored in AC0.

C54CM, M40, SATD, SXMD ACOVy, CARRY

This instruction can be repeated.

SPRU375G

Instruction Set Descriptions

5-475

Subtraction

Subtraction
Syntax Characteristics
Parallel Enable Bit Yes

No. [7]

Syntax ACy = ACy (ACx << #SHIFTW)

Size 3

Cycles 1

Pipeline X

Opcode Operands Description ACx, ACy, SHIFTW

0001 000E DDSS 0100 xxSH IFTW

This instruction subtracts an accumulator content ACx shifted by the 6-bit value, SHIFTW, from an accumulator content ACy.
- The operation is performed on 40 bits in the D-unit shifter. - Input operands are sign extended to 40 bits according to SXMD. - The shift operation is equivalent to the signed shift instruction. - Overflow detection and CARRY status bit depends on M40. The

subtraction borrow bit is reported in the CARRY status bit; the borrow bit is the logical complement of the CARRY status bit.
- When an overflow is detected, the accumulator is saturated according to

SATD. Compatibility with C54x devices (C54CM = 1) When this instruction is executed with M40 = 0, compatibility is ensured. When C54CM = 1, an intermediary shift operation is performed as if M40 is locally set to 1 and no overflow detection, report, and saturation is done after the shifting operation. Status Bits Affected by Affects Repeat Example
Syntax AC0 = AC0 (AC1 << #31) Description The content of AC1 shifted left by 31 bits is subtracted from the content of AC0 and the result is stored in AC0.

C54CM, M40, SATD, SXMD ACOVy, CARRY

This instruction can be repeated.

5-476

Instruction Set Descriptions

SPRU375G

Subtraction

Subtraction
Syntax Characteristics
Parallel Enable Bit No

No. [8]

Syntax ACy = ACx (K16 << #16)

Size 4

Cycles 1

Pipeline X

Opcode Operands Description ACx, ACy, K16

0111 1010 KKKK KKKK KKKK KKKK SSDD 001x

This instruction subtracts the 16-bit signed constant, K16, shifted left by 16 bits from an accumulator content ACx.
- The operation is performed on 40 bits in the D-unit ALU. - Input operands are sign extended to 40 bits according to SXMD. - The shift operation is equivalent to the signed shift instruction. - Overflow detection and CARRY status bit depends on M40. The

subtraction borrow bit is reported in the CARRY status bit; the borrow bit is the logical complement of the CARRY status bit.
- When an overflow is detected, the accumulator is saturated according to

SATD. Compatibility with C54x devices (C54CM = 1) When this instruction is executed with M40 = 0, compatibility is ensured. When C54CM = 1, an intermediary shift operation is performed as if M40 is locally set to 1 and no overflow detection, report, and saturation is done after the shifting operation. Status Bits Affected by Affects Repeat Example
Syntax AC0 = AC1 (FFFFh << #16) Description A signed 16-bit value (FFFFh) shifted left by 16 bits is subtracted from the content of AC1 and the result is stored in AC0.

C54CM, M40, SATD, SXMD ACOVy, CARRY

This instruction can be repeated.

SPRU375G

Instruction Set Descriptions

5-477

Subtraction

Subtraction
Syntax Characteristics
Parallel Enable Bit No

No. [9]

Syntax ACy = ACx (K16 << #SHFT)

Size 4

Cycles 1

Pipeline X

Opcode Operands Description

0111 0001 KKKK KKKK KKKK KKKK SSDD SHFT ACx, ACy, K16, SHFT This instruction subtracts the 16-bit signed constant, K16, shifted left by the 4-bit value, SHFT, from an accumulator content ACx.
- The operation is performed on 40 bits in the D-unit shifter. - Input operands are sign extended to 40 bits according to SXMD. - The shift operation is equivalent to the signed shift instruction. - Overflow detection and CARRY status bit depends on M40. The

subtraction borrow bit is reported in the CARRY status bit; the borrow bit is the logical complement of the CARRY status bit.
- When an overflow is detected, the accumulator is saturated according to

SATD. Compatibility with C54x devices (C54CM = 1) When this instruction is executed with M40 = 0, compatibility is ensured. When C54CM = 1, an intermediary shift operation is performed as if M40 is locally set to 1 and no overflow detection, report, and saturation is done after the shifting operation. Status Bits Affected by Affects Repeat Example
Syntax AC1 = AC0 (#9800h << #5) Description A signed 16-bit value (9800h) shifted left by 5 bits is subtracted from the content of AC0 and the result is stored in AC1.

M40, SATD, SXMD ACOVy, CARRY

This instruction can be repeated.

5-478

Instruction Set Descriptions

SPRU375G

Subtraction

Subtraction
Syntax Characteristics
No. [10] Syntax ACy = ACx (Smem << Tx) Parallel Enable Bit No Size 3 Cycles 1 Pipeline X

Opcode Operands Description ACx, ACy, Smem, Tx

1101 1101 AAAA AAAI SSDD ss01

This instruction subtracts the content of a memory (Smem) location shifted by the content of Tx from an accumulator content ACx.
- The operation is performed on 40 bits in the D-unit shifter. - Input operands are sign extended to 40 bits according to SXMD. - The shift operation is equivalent to the signed shift instruction. - Overflow detection and CARRY status bit depends on M40. The

subtraction borrow bit is reported in the CARRY status bit; the borrow bit is the logical complement of the CARRY status bit.
- When an overflow is detected, the accumulator is saturated according to

SATD. Compatibility with C54x devices (C54CM = 1) When this instruction is executed with M40 = 0, compatibility is ensured. When C54CM = 1:
- An intermediary shift operation is performed as if M40 is locally set to 1 and

no overflow detection, report, and saturation is done after the shifting operation.
- The 6 LSBs of Tx are used to determine the shift quantity. The 6 LSBs of

Tx define a shift quantity within 32 to +31. When the value is between 32 to 17, a modulo 16 operation transforms the shift quantity to within 16 to 1. Status Bits Affected by Affects Repeat Example
Syntax AC0 = AC1 (*AR3 << T0) Description The content addressed by AR3 shifted by the content of T0 is subtracted from the content of AC1 and the result is stored in AC0.

C54CM, M40, SATD, SXMD ACOVy, CARRY

This instruction can be repeated.

SPRU375G

Instruction Set Descriptions

5-479

Subtraction

Subtraction
Syntax Characteristics
Parallel Enable Bit No

No. [11]

Syntax ACy = ACx (Smem << #16)

Size 3

Cycles 1

Pipeline X

Opcode Operands Description ACx, ACy, Smem

1101 1110 AAAA AAAI SSDD 0101

This instruction subtracts the content of a memory (Smem) location shifted left by 16 bits from an accumulator content ACx.
- The operation is performed on 40 bits in the D-unit ALU. - Input operands are sign extended to 40 bits according to SXMD. - The shift operation is equivalent to the signed shift instruction. - Overflow detection and CARRY status bit depends on M40. If the result

of the subtraction generates a borrow, the CARRY status bit is cleared; otherwise, the CARRY status bit is not affected.
- When an overflow is detected, the accumulator is saturated according to

SATD. Compatibility with C54x devices (C54CM = 1) When this instruction is executed with M40 = 0, compatibility is ensured. When C54CM = 1, an intermediary shift operation is performed as if M40 is locally set to 1 and no overflow detection, report, and saturation is done after the shifting operation. Status Bits Affected by Affects Repeat Example
Syntax AC0 = AC1 (*AR3 << #16) Description The content addressed by AR3 shifted left by 16 bits is subtracted from the content of AC1 and the result is stored in AC0.

C54CM, M40, SATD, SXMD ACOVy, CARRY

This instruction can be repeated.

5-480

Instruction Set Descriptions

SPRU375G

Subtraction

Subtraction
Syntax Characteristics
Parallel Enable Bit No

No. [12]

Syntax ACy = (Smem << #16) ACx

Size 3

Cycles 1

Pipeline X

Opcode Operands Description ACx, ACy, Smem

1101 1110 AAAA AAAI SSDD 0110

This instruction subtracts an accumulator content ACx from the content of a memory (Smem) location shifted left by 16 bits.
- The operation is performed on 40 bits in the D-unit ALU. - Input operands are sign extended to 40 bits according to SXMD. - The shift operation is equivalent to the signed shift instruction. - Overflow detection and CARRY status bit depends on M40. The

subtraction borrow bit is reported in the CARRY status bit; the borrow bit is the logical complement of the CARRY status bit.
- When an overflow is detected, the accumulator is saturated according to

SATD. Compatibility with C54x devices (C54CM = 1) When this instruction is executed with M40 = 0, compatibility is ensured. When C54CM = 1, an intermediary shift operation is performed as if M40 is locally set to 1 and no overflow detection, report, and saturation is done after the shifting operation. Status Bits Affected by Affects Repeat Example
Syntax AC0 = (*AR3 << #16) AC1 Description The content of AC1 is subtracted from the content addressed by AR3 shifted left by 16 bits and the result is stored in AC0.

C54CM, M40, SATD, SXMD ACOVy, CARRY

This instruction can be repeated.

SPRU375G

Instruction Set Descriptions

5-481

Subtraction

Subtraction
Syntax Characteristics
Parallel Enable Bit No

No. [13]

Syntax ACy = ACx uns(Smem) BORROW

Size 3

Cycles 1

Pipeline X

Opcode Operands Description ACx, ACy, Smem

1101 1111 AAAA AAAI SSDD 101u

This instruction subtracts the logical complement of the CARRY status bit (borrow) and the content of a memory (Smem) location from an accumulator content ACx.
- The operation is performed on 40 bits in the D-unit ALU. - Input operands are extended to 40 bits according to uns. J J

If the optional uns keyword is applied to the input operand, the content of the memory location is zero extended to 40 bits. If the optional uns keyword is not applied to the input operand, the content of the memory location is sign extended to 40 bits according to SXMD.

- Overflow detection and CARRY status bit depends on M40. The

subtraction borrow bit is reported in the CARRY status bit; the borrow bit is the logical complement of the CARRY status bit.
- When an overflow is detected, the accumulator is saturated according to

SATD. Compatibility with C54x devices (C54CM = 1) When this instruction is executed with M40 = 0, compatibility is ensured. Status Bits Affected by Affects Repeat
5-482

CARRY, M40, SATD, SXMD ACOVy, CARRY

This instruction can be repeated.


Instruction Set Descriptions SPRU375G

Subtraction

Example
Syntax AC1 = AC0 uns(*AR1) BORROW Description The complement of the CARRY bit (1) and the unsigned content addressed by AR1 (F000h) are subtracted from the content of AC0 and the result is stored in AC1.
After 00 EC00 0000 00 0000 0000 0302 F000 0 AC0 AC1 AR1 302 CARRY 00 EC00 0000 00 EBFF 0FFF 0302 F000 1

Before AC0 AC1 AR1 302 CARRY

SPRU375G

Instruction Set Descriptions

5-483

Subtraction

Subtraction
Syntax Characteristics
Parallel Enable Bit No

No. [14]

Syntax ACy = ACx uns(Smem)

Size 3

Cycles 1

Pipeline X

Opcode Operands Description ACx, ACy, Smem

1101 1111 AAAA AAAI SSDD 111u

This instruction subtracts the content of a memory (Smem) location from an accumulator content ACx.
- The operation is performed on 40 bits in the D-unit ALU. - Input operands are extended to 40 bits according to uns. J J

If the optional uns keyword is applied to the input operand, the content of the memory location is zero extended to 40 bits. If the optional uns keyword is not applied to the input operand, the content of the memory location is sign extended to 40 bits according to SXMD.

- Overflow detection and CARRY status bit depends on M40. The

subtraction borrow bit is reported in the CARRY status bit; the borrow bit is the logical complement of the CARRY status bit.
- When an overflow is detected, the accumulator is saturated according to

SATD. Compatibility with C54x devices (C54CM = 1) When this instruction is executed with M40 = 0, compatibility is ensured. Status Bits Affected by Affects Repeat Example
Syntax AC0 = AC1 uns(*AR3) Description The unsigned content addressed by AR3 is subtracted from the content of AC1 and the result is stored in AC0.

M40, SATD, SXMD ACOVy, CARRY

This instruction can be repeated.

5-484

Instruction Set Descriptions

SPRU375G

Subtraction

Subtraction
Syntax Characteristics
Parallel Enable Bit No

No. [15]

Syntax ACy = ACx (uns(Smem) << #SHIFTW)

Size 4

Cycles 1

Pipeline X

Opcode Operands Description

1111 1001 AAAA AAAI uxSH IFTW SSDD 01xx ACx, ACy, SHIFTW, Smem This instruction subtracts the content of a memory (Smem) location shifted by the 6-bit value, SHIFTW, from an accumulator content ACx.
- The operation is performed on 40 bits in the D-unit shifter. - Input operands are extended to 40 bits according to uns. J J

If the optional uns keyword is applied to the input operand, the content of the memory location is zero extended to 40 bits. If the optional uns keyword is not applied to the input operand, the content of the memory location is sign extended to 40 bits according to SXMD.

- The shift operation is equivalent to the signed shift instruction. - Overflow detection and CARRY status bit depends on M40. The

subtraction borrow bit is reported in the CARRY status bit; the borrow bit is the logical complement of the CARRY status bit.
- When an overflow is detected, the accumulator is saturated according to

SATD. Compatibility with C54x devices (C54CM = 1) When this instruction is executed with M40 = 0, compatibility is ensured. When C54CM = 1, an intermediary shift operation is performed as if M40 is locally set to 1 and no overflow detection, report, and saturation is done after the shifting operation. Status Bits Affected by Affects
SPRU375G

C54CM, M40, SATD, SXMD ACOVy, CARRY


Instruction Set Descriptions 5-485

Subtraction

Repeat

This instruction cannot be repeated when using the *(#k23) absolute addressing mode to access the memory operand (Smem); when using other addressing modes, this instruction can be repeated.

Example
Syntax AC0 = AC1 (uns(*AR3) << #31) Description The unsigned content addressed by AR3 shifted left by 31 bits is subtracted from the content of AC1 and the result is stored in AC0.

5-486

Instruction Set Descriptions

SPRU375G

Subtraction

Subtraction
Syntax Characteristics
No. [16] Syntax ACy = ACx dbl(Lmem) Parallel Enable Bit No Size 3 Cycles 1 Pipeline X

Opcode Operands Description ACx, ACy, Lmem

1110 1101 AAAA AAAI SSDD 001n

This instruction subtracts the content of data memory operand dbl(Lmem) from an accumulator content ACx.
- The data memory operand dbl(Lmem) addresses are aligned: J J

if Lmem address is even: most significant word = Lmem, least significant word = Lmem + 1 if Lmem address is odd: most significant word = Lmem, least significant word = Lmem 1

- The operation is performed on 40 bits in the D-unit ALU. - Input operands are sign extended to 40 bits according to SXMD. - Overflow detection and CARRY status bit depends on M40. The

subtraction borrow bit is reported in the CARRY status bit; the borrow bit is the logical complement of the CARRY status bit.
- When an overflow is detected, the accumulator is saturated according to

SATD. Compatibility with C54x devices (C54CM = 1) When this instruction is executed with M40 = 0, compatibility is ensured. Status Bits Affected by Affects Repeat Example
Syntax Description AC0 = AC1 dbl(*AR3+) The content (long word) addressed by AR3 and AR3 + 1 is subtracted from the content of AC1 and the result is stored in AC0. Because this instruction is a long-operand instruction, AR3 is incremented by 2 after the execution.

M40, SATD, SXMD ACOVy, CARRY

This instruction can be repeated.

SPRU375G

Instruction Set Descriptions

5-487

Subtraction

Subtraction
Syntax Characteristics
Parallel Enable Bit No

No. [17]

Syntax ACy = dbl(Lmem) ACx

Size 3

Cycles 1

Pipeline X

Opcode Operands Description ACx, ACy, Lmem

1110 1101 AAAA AAAI SSDD 010x

This instruction subtracts an accumulator content ACx from the content of data memory operand dbl(Lmem).
- The data memory operand dbl(Lmem) addresses are aligned: J J

if Lmem address is even: most significant word = Lmem, least significant word = Lmem + 1 if Lmem address is odd: most significant word = Lmem, least significant word = Lmem 1

- The operation is performed on 40 bits in the D-unit ALU. - Input operands are sign extended to 40 bits according to SXMD. - Overflow detection and CARRY status bit depends on M40. The

subtraction borrow bit is reported in the CARRY status bit; the borrow bit is the logical complement of the CARRY status bit.
- When an overflow is detected, the accumulator is saturated according to

SATD. Compatibility with C54x devices (C54CM = 1) When this instruction is executed with M40 = 0, compatibility is ensured. Status Bits Affected by Affects Repeat Example
Syntax AC0 = dbl(*AR3) AC1 Description The content of AC1 is subtracted from the content (long word) addressed by AR3 and AR3 + 1 and the result is stored in AC0.

M40, SATD, SXMD ACOVy, CARRY

This instruction can be repeated.

5-488

Instruction Set Descriptions

SPRU375G

Subtraction

Subtraction
Syntax Characteristics
Parallel Enable Bit No

No. [18]

Syntax ACx = (Xmem << #16) (Ymem << #16)

Size 3

Cycles 1

Pipeline X

Opcode Operands Description ACx, Xmem, Ymem

1000 0001 XXXM MMYY YMMM 01DD

This instruction subtracts the content of data memory operand Ymem, shifted left 16 bits, from the content of data memory operand Xmem, shifted left 16 bits.
- The operation is performed on 40 bits in the D-unit ALU. - Input operands are sign extended to 40 bits according to SXMD. - The shift operation is equivalent to the signed shift instruction. - Overflow detection and CARRY status bit depends on M40. The

subtraction borrow bit is reported in the CARRY status bit; the borrow bit is the logical complement of the CARRY status bit.
- When an overflow is detected, the accumulator is saturated according to

SATD. Compatibility with C54x devices (C54CM = 1) When this instruction is executed with M40 = 0, compatibility is ensured. When C54CM = 1, an intermediary shift operation is performed as if M40 is locally set to 1 and no overflow detection, report, and saturation is done after the shifting operation. Status Bits Affected by Affects Repeat Example
Syntax AC0 = (*AR3 << #16) (*AR4 << #16) Description The content addressed by AR4 shifted left by 16 bits is subtracted from the content addressed by AR3 shifted left by 16 bits and the result is stored in AC0.

C54CM, M40, SATD, SXMD ACOVx, CARRY

This instruction can be repeated.

SPRU375G

Instruction Set Descriptions

5-489

Subtraction with Parallel Store Accumulator Content to Memory

Subtraction with Parallel Store Accumulator Content to Memory


Syntax Characteristics
No. [1] Syntax ACy = (Xmem << #16) ACx, Ymem = HI(ACy << T2) Parallel Enable Bit No Size 4 Cycles 1 Pipeline X

Opcode Operands Description

1000 0111 XXXM MMYY YMMM SSDD 101x xxxx ACx, ACy, T2, Xmem, Ymem This instruction performs two operations in parallel: subtraction and store. The first operation subtracts an accumulator content from the content of data memory operand Xmem shifted left by 16 bits.
- The operation is performed on 40 bits in the D-unit ALU. - Input operands are sign extended to 40 bits according to SXMD. - The shift operation is equivalent to the signed shift instruction. - Overflow detection and CARRY status bit depends on M40. The

subtraction borrow bit is reported in the CARRY status bit; the borrow bit is the logical complement of the CARRY status bit. When C54CM = 1, an intermediary shift operation is performed as if M40 is locally set to 1 and no overflow detection, report, and saturation is done after the shifting operation.
- When an overflow is detected, the accumulator is saturated according to

SATD. The second operation shifts the accumulator ACy by the content of T2 and stores ACy(3116) to data memory operand Ymem. If the 16-bit value in T2 is not within 32 to +31, the shift is saturated to 32 or +31 and the shift is performed with this value.
- The input operand is shifted in the D-unit shifter according to SXMD. - After the shift, the high part of the accumulator, ACy(3116), is stored to

the memory location. Compatibility with C54x devices (C54CM = 1) When this instruction is executed with M40 = 0, compatibility is ensured. When this instruction is executed with C54CM = 1, the 6 LSBs of T2 are used to determine the shift quantity. The 6 LSBs of T2 define a shift quantity within 32 to +31. When the 16-bit value in T2 is between 32 to 17, a modulo 16 operation transforms the shift quantity to within 16 to 1.
5-490 Instruction Set Descriptions SPRU375G

Subtraction with Parallel Store Accumulator Content to Memory

Status Bits

Affected by Affects

C54CM, M40, SATD, SXMD ACOVy, CARRY

Repeat See Also

This instruction can be repeated. See the following other related instructions:
- Addition or Subtraction Conditionally - Addition or Subtraction Conditionally with Shift - Addition, Subtraction, or Move Accumulator Content Conditionally - Dual 16-Bit Addition and Subtraction - Dual 16-Bit Subtractions - Dual 16-Bit Subtraction and Addition - Subtraction - Subtract Conditionally

Example
Syntax AC0 = (*AR3 << #16) AC1, *AR4 = HI(AC0 << T2) Description Both instructions are performed in parallel. The content of AC1 is subtracted from the content addressed by AR3 shifted left by 16 bits and the result is stored in AC0. The content of AC0 is shifted by the content of T2, and AC0(3116) is stored at the address of AR4.

SPRU375G

Instruction Set Descriptions

5-491

Swap Accumulator Content (swap)

Swap Accumulator Content


Syntax Characteristics
No. Syntax swap(ACx, ACy) [1] [2] swap(AC0, AC2) swap(AC1, AC3) Yes Yes 2 2 1 1 X X Parallel Enable Bit Size Cycles Pipeline

Opcode

swap(AC0, AC2) swap(AC1, AC3)

0101 111E 0000 0000 0101 111E 0000 0001

Operands Description

ACx, ACy This instruction performs parallel moves between two accumulators. These operations are performed in a dedicated datapath independent of the D-unit operators. This instruction moves the content of the first accumulator (ACx) to the second accumulator (ACy), and reciprocally moves the content of the second accumulator to the first accumulator. Accumulator swapping is performed in the execute phase of the pipeline.

Status Bits

Affected by Affects

none none

Repeat See Also

This instruction can be repeated. See the following other related instructions:
- Swap Accumulator Pair Content - Swap Auxiliary Register Content - Swap Auxiliary and Temporary Register Content - Swap Temporary Register Content

Example
Syntax swap(AC0, AC2)
Before AC0 AC2 01 E500 0030 00 2800 0200

Description The content of AC0 is moved to AC2 and the content of AC2 is moved to AC0.
After AC0 AC2 00 2800 0200 01 E500 0030

5-492

Instruction Set Descriptions

SPRU375G

Swap Accumulator Pair Content (swap)

Swap Accumulator Pair Content


Syntax Characteristics
No. [1] Syntax swap(pair(AC0), pair(AC2)) Parallel Enable Bit Yes Size 2 Cycles 1 Pipeline X

Opcode Operands Description AC0, AC2

0101 111E 0001 0000

This instruction performs two parallel moves between four accumulators (AC0 and AC2, AC1 and AC3) in one cycle. These operations are performed in a dedicated datapath independent of the D-unit operators. Accumulator swapping is performed in the execute phase of the pipeline. This instruction performs two parallel moves:
- the content of AC0 to AC2, and reciprocally the content of AC2 to AC0 - the content of AC1 to AC3, and reciprocally the content of AC3 to AC1

Status Bits

Affected by Affects

none none

Repeat See Also

This instruction can be repeated. See the following other related instructions:
- Swap Accumulator Content - Swap Auxiliary Register Pair Content - Swap Auxiliary and Temporary Register Pair Content - Swap Temporary Register Pair Content

Example
Syntax swap(pair(AC0), pair(AC2)) Description The following two swap instructions are performed in parallel: the content of AC0 is moved to AC2 and the content of AC2 is moved to AC0, and the content of AC1 is moved to AC3 and the content of AC3 is moved to AC1.
After 01 E500 0030 00 FFFF 0000 00 2800 0200 00 8800 0800 AC0 AC1 AC2 AC3 00 2800 0200 00 8800 0800 01 E500 0030 00 FFFF 0000

Before AC0 AC1 AC2 AC3

SPRU375G

Instruction Set Descriptions

5-493

Swap Auxiliary Register Content (swap)

Swap Auxiliary Register Content


Syntax Characteristics
No. Syntax swap(ARx, ARy) [1] [2] [3] swap(AR0, AR1) swap(AR0, AR2) swap(AR1, AR3) Yes Yes Yes 2 2 2 1 1 1 AD AD AD Parallel Enable Bit Size Cycles Pipeline

Opcode

swap(AR0, AR1) swap(AR0, AR2) swap(AR1, AR3)

0101 111E 0011 1000 0101 111E 0000 1000 0101 111E 0000 1001

Operands Description

ARx, ARy This instruction performs parallel moves between two auxiliary registers. These operations are performed in a dedicated datapath independent of the A-unit operators. This instruction moves the content of the first auxiliary register (ARx) to the second auxiliary register (ARy), and reciprocally moves the content of the second auxiliary register to the first auxiliary register. Auxiliary register swapping is performed in the address phase of the pipeline.

Status Bits Repeat See Also

Affected by Affects

none none

This instruction can be repeated. See the following other related instructions:
- Swap Accumulator Content - Swap Auxiliary and Temporary Register Content - Swap Auxiliary Register Pair Content - Swap Temporary Register Content

Example
Syntax swap(AR0, AR2)
Before AR0 AR2 6500 0300

Description The content of AR0 is moved to AR2 and the content of AR2 is moved to AR0.
After AR0 AR2 0300 6500

5-494

Instruction Set Descriptions

SPRU375G

Swap Auxiliary Register Pair Content (swap)

Swap Auxiliary Register Pair Content


Syntax Characteristics
No. [1] Syntax swap(pair(AR0), pair(AR2)) Parallel Enable Bit Yes Size 2 Cycles 1 Pipeline AD

Opcode Operands Description AR0, AR2

0101 111E 0001 1000

This instruction performs two parallel moves between four auxiliary registers (AR0 and AR2, AR1 and AR3) in one cycle. These operations are performed in a dedicated datapath independent of the A-unit operators. Auxiliary register swapping is performed in the address phase of the pipeline. This instruction performs two parallel moves:
- the content of AR0 to AR2, and reciprocally the content of AR2 to AR0 - the content of AR1 to AR3, and reciprocally the content of AR3 to AR1

Status Bits

Affected by Affects

none none

Repeat See Also

This instruction can be repeated. See the following other related instructions:
- Swap Accumulator Pair Content - Swap Auxiliary Register Content - Swap Auxiliary and Temporary Register Pair Content - Swap Temporary Register Pair Content

Example
Syntax swap(pair(AR0), pair(AR2)) Description The following two swap instructions are performed in parallel: the content of AR0 is moved to AR2 and the content of AR2 is moved to AR0, and the content of AR1 is moved to AR3 and the content of AR3 is moved to AR1.
After 0200 0300 6788 0200 AR0 AR1 AR2 AR3 6788 0200 0200 0300

Before AR0 AR1 AR2 AR3

SPRU375G

Instruction Set Descriptions

5-495

Swap Auxiliary and Temporary Register Content (swap)

Swap Auxiliary and Temporary Register Content


Syntax Characteristics
No. Syntax swap(ARx, Tx) [1] [2] [3] [4] swap(AR4, T0) swap(AR5, T1) swap(AR6, T2) swap(AR7, T3) Yes Yes Yes Yes 2 2 2 2 1 1 1 1 AD AD AD AD Parallel Enable Bit Size Cycles Pipeline

Opcode

swap(AR4, T0) swap(AR5, T1) swap(AR6, T2) swap(AR7, T3)

0101 111E 0000 1100 0101 111E 0000 1101 0101 111E 0000 1110 0101 111E 0000 1111

Operands Description

ARx, Tx This instruction performs parallel moves between auxiliary registers and temporary registers. These operations are performed in a dedicated datapath independent of the A-unit operators. This instruction moves the content of the auxiliary register (ARx) to the temporary register (Tx), and reciprocally moves the content of the temporary register to the auxiliary register. Auxiliary and temporary register swapping is performed in the address phase of the pipeline.

Status Bits

Affected by Affects

none none

Repeat See Also

This instruction can be repeated. See the following other related instructions:
- Swap Accumulator Content - Swap Auxiliary Register Content - Swap Auxiliary and Temporary Register Pair Content - Swap Auxiliary and Temporary Register Pairs Content - Swap Temporary Register Content

5-496

Instruction Set Descriptions

SPRU375G

Swap Auxiliary and Temporary Register Content (swap)

Example
Syntax swap(AR4, T0)
Before T0 AR4 6500 0300

Description The content of AR4 is moved to T0 and the content of T0 is moved to AR4.
After T0 AR4 0300 6500

SPRU375G

Instruction Set Descriptions

5-497

Swap Auxiliary and Temporary Register Pair Content (swap)

Swap Auxiliary and Temporary Register Pair Content


Syntax Characteristics
No. Syntax swap(pair(ARx), pair(Tx)) [1] [2] swap(pair(AR4), pair(T0)) swap(pair(AR6), pair(T2)) Yes Yes 2 2 1 1 AD AD Parallel Enable Bit Size Cycles Pipeline

Opcode

swap(pair(AR4), pair(T0)) swap(pair(AR6), pair(T2))

0101 111E 0001 1100 0101 111E 0001 1110

Operands Description

ARx, Tx This instruction performs two parallel moves between two auxiliary registers and two temporary registers in one cycle. These operations are performed in a dedicated datapath independent of the A-unit operators. Auxiliary and temporary register swapping is performed in the address phase of the pipeline. Instruction [1] performs two parallel moves:
- the content of AR4 to T0, and reciprocally the content of T0 to AR4 - the content of AR5 to T1, and reciprocally the content of T1 to AR5

Instruction [2] performs two parallel moves:


- the content of AR6 to T2, and reciprocally the content of T2 to AR6 - the content of AR7 to T3, and reciprocally the content of T3 to AR7

Status Bits

Affected by Affects

none none

Repeat See Also

This instruction can be repeated. See the following other related instructions:
- Swap Accumulator Pair Content - Swap Auxiliary Register Pair Content - Swap Auxiliary and Temporary Register Content - Swap Auxiliary and Temporary Register Pairs Content - Swap Temporary Register Pair Content

5-498

Instruction Set Descriptions

SPRU375G

Swap Auxiliary and Temporary Register Pair Content (swap)

Example
Syntax swap(pair(AR4), pair(T0)) Description The following two swap instructions are performed in parallel: the content of AR4 is moved to T0 and the content of T0 is moved to AR4, and the content of AR5 is moved to T1 and the content of T1 is moved to AR5.
After 0200 0300 6788 0200 AR4 AR5 T0 T1 6788 0200 0200 0300

Before AR4 AR5 T0 T1

SPRU375G

Instruction Set Descriptions

5-499

Swap Auxiliary and Temporary Register Pairs Content (swap)

Swap Auxiliary and Temporary Register Pairs Content


Syntax Characteristics
Parallel Enable Bit Yes

No. [1]

Syntax swap(block(AR4), block(T0))

Size 2

Cycles 1

Pipeline AD

Opcode Operands Description AR4, T0

0101 111E 0010 1100

This instruction performs four parallel moves between four auxiliary registers (AR4, AR5, AR6, and AR7) and four temporary registers (T0, T1, T2, and T3) in one cycle. These operations are performed in a dedicated datapath independent of the A-unit operators. Auxiliary and temporary register swapping is performed in the address phase of the pipeline. This instruction performs four parallel moves:
- the content of AR4 to T0, and reciprocally the content of T0 to AR4 - the content of AR5 to T1, and reciprocally the content of T1 to AR5 - the content of AR6 to T2, and reciprocally the content of T2 to AR6 - the content of AR7 to T3, and reciprocally the content of T3 to AR7

Status Bits

Affected by Affects

none none

Repeat See Also

This instruction can be repeated. See the following other related instructions:
- Swap Auxiliary and Temporary Register Content - Swap Auxiliary and Temporary Register Pair Content

5-500

Instruction Set Descriptions

SPRU375G

Swap Auxiliary and Temporary Register Pairs Content (swap)

Example
Syntax swap (block(AR4), block(T0)) Description The following four swap instructions are performed in parallel: the content of AR4 is moved to T0 and the content of T0 is moved to AR4, the content of AR5 is moved to T1 and the content of T1 is moved to AR5, the content of AR6 is moved to T2 and the content of T2 is moved to AR6, and the content of AR7 is moved to T3 and the content of T3 is moved to AR7.
After 0200 0300 0240 0400 0030 0200 3400 0FD3 AR4 AR5 AR6 AR7 T0 T1 T2 T3 0030 0200 3400 0FD3 0200 0300 0240 0400

Before AR4 AR5 AR6 AR7 T0 T1 T2 T3

SPRU375G

Instruction Set Descriptions

5-501

Swap Temporary Register Content (swap)

Swap Temporary Register Content


Syntax Characteristics
No. Syntax swap(Tx, Ty) [1] [2] swap(T0, T2) swap(T1, T3) Yes Yes 2 2 1 1 AD AD Parallel Enable Bit Size Cycles Pipeline

Opcode

swap(T0, T2) swap(T1, T3)

0101 111E 0000 0100 0101 111E 0000 0101

Operands Description

Tx, Ty This instruction performs parallel moves between two temporary registers. These operations are performed in a dedicated datapath independent of the A-unit operators. This instruction moves the content of the first temporary register (Tx) to the second temporary register (Ty), and reciprocally moves the content of the second temporary register to the first temporary register. Temporary register swapping is performed in the address phase of the pipeline.

Status Bits

Affected by Affects

none none

Repeat See Also

This instruction can be repeated. See the following other related instructions:
- Swap Accumulator Content - Swap Auxiliary Register Content - Swap Auxiliary and Temporary Register Content - Swap Temporary Register Pair Content

Example
Syntax swap(T0, T2)
Before T0 T2 6500 0300

Description The content of T0 is moved to T2 and the content of T2 is moved to T0.


After T0 T2 0300 6500

5-502

Instruction Set Descriptions

SPRU375G

Swap Temporary Register Pair Content (swap)

Swap Temporary Register Pair Content


Syntax Characteristics
No. [1] Syntax swap(pair(T0), pair(T2)) Parallel Enable Bit Yes Size 2 Cycles 1 Pipeline AD

Opcode Operands Description T0, T2

0101 111E 0001 0100

This instruction performs two parallel moves between four temporary registers (T0 and T2, T1 and T3) in one cycle. These operations are performed in a dedicated datapath independent of the A-unit operators. Temporary register swapping is performed in the address phase of the pipeline. This instruction performs two parallel moves:
- the content of T0 to T2, and reciprocally the content of T2 to T0 - the content of T1 to T3, and reciprocally the content of T3 to T1

Status Bits

Affected by Affects

none none

Repeat See Also

This instruction can be repeated. See the following other related instructions:
- Swap Accumulator Pair Content - Swap Auxiliary Register Pair Content - Swap Auxiliary and Temporary Register Pair Content - Swap Temporary Register Content

Example
Syntax swap(pair(T0), pair(T2)) Description The following two swap instructions are performed in parallel: the content of T0 is moved to T2 and the content of T2 is moved to T0, and the content of T1 is moved to T3 and the content of T3 is moved to T1.
After 0200 0300 6788 0200 T0 T1 T2 T3 6788 0200 0200 0300

Before T0 T1 T2 T3

SPRU375G

Instruction Set Descriptions

5-503

Test Accumulator, Auxiliary, or Temporary Register Bit

Test Accumulator, Auxiliary, or Temporary Register Bit


Syntax Characteristics
No. [1] [2] Syntax TC1 = bit(src, Baddr) TC2 = bit(src, Baddr) Parallel Enable Bit No No Size 3 3 Cycles 1 1 Pipeline X X

Opcode

TC1 TC2

1110 1100 AAAA AAAI FSSS 1000 1110 1100 AAAA AAAI FSSS 1001

Operands Description

Baddr, src, TCx This instruction performs a bit manipulation:


- In the D-unit ALU, if the source (src) register operand is an accumulator. - In the A-unit ALU, if the source (src) register operand is an auxiliary or

temporary register. The instruction tests a single bit of the source register location as defined by the bit addressing mode, Baddr. The tested bit is copied into the selected TCx status bit. The generated bit address must be within:
- 039 when accessing accumulator bits (only the 6 LSBs of the generated

bit address are used to determine the bit position). If the generated bit address is not within 039, 0 is stored into the selected TCx status bit.
- 015 when accessing auxiliary or temporary register bits (only the 4 LSBs

of the generated address are used to determine the bit position). Status Bits Affected by Affects Repeat See Also none TCx

This instruction can be repeated. See the following other related instructions:
- Clear Accumulator, Auxiliary, or Temporary Register Bit - Complement Accumulator, Auxiliary, or Temporary Register Bit - Set Accumulator, Auxiliary, or Temporary Register Bit - Test Accumulator, Auxiliary, or Temporary Register Bit Pair - Test Memory Bit

5-504

Instruction Set Descriptions

SPRU375G

Test Accumulator, Auxiliary, or Temporary Register Bit

Example
Syntax TC1 = bit(T0, @#12) Description The bit at the position defined by the register bit address (12) in T0 is tested and the tested bit is copied into TC1.
After FE00 0 T0 TC1 FE00 1

Before T0 TC1

SPRU375G

Instruction Set Descriptions

5-505

Test Accumulator, Auxiliary, or Temporary Register Bit Pair

Test Accumulator, Auxiliary, or Temporary Register Bit Pair


Syntax Characteristics
No. [1] Syntax bit(src, pair(Baddr)) Parallel Enable Bit No Size 3 Cycles 1 Pipeline X

Opcode Operands Description Baddr, src

1110 1100 AAAA AAAI FSSS 010x

This instruction performs a bit manipulation:


- In the D-unit ALU, if the source (src) register operand is an accumulator. - In the A-unit ALU, if the source (src) register operand is an auxiliary or

temporary register. The instruction tests two consecutive bits of the source register location as defined by the bit addressing mode, Baddr and Baddr + 1. The tested bits are copied into status bits TC1 and TC2:
J J

TC1 tests the bit that is defined by Baddr TC2 tests the bit defined by Baddr + 1

The generated bit address must be within:


- 038 when accessing accumulator bits (only the 6 LSBs of the generated

bit address are used to determine the bit position). If the generated bit address is not within 038:
J J

If the generated bit address is 39, bit 39 of the register is stored into TC1 and 0 is stored into TC2. In all other cases, 0 is stored into TC1 and TC2.

- 014 when accessing auxiliary or temporary register bits (only the 4 LSBs

of the generated address are used to determine the bit position). If the generated bit address is not within 014:
J J

If the generated bit address is 15, bit 15 of the register is stored into TC1 and 0 is stored into TC2. In all other cases, 0 is stored into TC1 and TC2. none TC1, TC2
SPRU375G

Status Bits

Affected by Affects

5-506

Instruction Set Descriptions

Test Accumulator, Auxiliary, or Temporary Register Bit Pair

Repeat See Also

This instruction can be repeated. See the following other related instructions:
- Clear Accumulator, Auxiliary, or Temporary Register Bit - Complement Accumulator, Auxiliary, or Temporary Register Bit - Set Accumulator, Auxiliary, or Temporary Register Bit - Test Accumulator, Auxiliary, or Temporary Register Bit - Test Memory Bit

Example
Syntax bit(AC0, pair(AR1(T0))) Description The bit at the position defined by the content of AR1(T0) in AC0 is tested and the tested bit is copied into TC1. The bit at the position defined by the content of AR1(T0) + 1 in AC0 is tested and the tested bit is copied into TC2.
After E0 1234 0000 0026 0001 0 0 AC0 AR1 T0 TC1 TC2 E0 1234 0000 0026 0001 1 0

Before AC0 AR1 T0 TC1 TC2

SPRU375G

Instruction Set Descriptions

5-507

Test Memory Bit

Test Memory Bit


Syntax Characteristics
Parallel Enable Bit No No

No. [1] [2]

Syntax TCx = bit(Smem, src) TCx = bit(Smem, k4)

Size 3 3

Cycles 1 1

Pipeline X X

Description

These instructions perform a bit manipulation in the A-unit ALU. These instructions test a single bit of a memory (Smem) location. The bit tested is defined by either the content of the source (src) operand or a 4-bit immediate value, k4. The tested bit is copied into the selected TCx status bit. For instruction [1], the generated bit address must be within 015 (only the 4 LSBs of the register are used to determine the bit position).

Status Bits

Affected by Affects

none TCx

See Also

See the following other related instructions:


- Clear Memory Bit - Complement Memory Bit - Set Memory Bit - Test Accumulator, Auxiliary, or Temporary Register Bit - Test Accumulator, Auxiliary, or Temporary Register Bit Pair - Test and Clear Memory Bit - Test and Complement Memory Bit - Test and Set Memory Bit

5-508

Instruction Set Descriptions

SPRU375G

Test Memory Bit

Test Memory Bit


Syntax Characteristics
Parallel Enable Bit No No

No. [1a] [1b]

Syntax TC1 = bit(Smem, src) TC2 = bit(Smem, src)

Size 3 3

Cycles 1 1

Pipeline X X

Opcode

TC1 TC2

1110 0000 AAAA AAAI FSSS xxx0 1110 0000 AAAA AAAI FSSS xxx1

Operands Description

Smem, src, TCx This instruction performs a bit manipulation in the A-unit ALU. This instruction tests a single bit of a memory (Smem) location. The bit tested is defined by the content of the source (src) operand. The tested bit is copied into the selected TCx status bit. The generated bit address must be within 015 (only the 4 LSBs of the register are used to determine the bit position).

Status Bits

Affected by Affects

none TCx

Repeat Example
Syntax TC1 = bit(*AR0, AC0)

This instruction can be repeated.

Description The bit at the position defined by AC0(30) in the content addressed by AR0 is tested and the tested bit is copied into TC1.
After 00 0000 0008 00C0 0 AC0 *AR0 TC1 00 0000 0008 00C0 0

Before AC0 *AR0 TC1

SPRU375G

Instruction Set Descriptions

5-509

Test Memory Bit

Test Memory Bit


Syntax Characteristics
Parallel Enable Bit No No

No. [2a] [2b]

Syntax TC1 = bit(Smem, k4) TC2 = bit(Smem, k4)

Size 3 3

Cycles 1 1

Pipeline X X

Opcode

TC1 TC2

1101 1100 AAAA AAAI kkkk xx00 1101 1100 AAAA AAAI kkkk xx01

Operands Description

k4, Smem, TCx This instruction performs a bit manipulation in the A-unit ALU. This instruction tests a single bit of a memory (Smem) location. The bit tested is defined by a 4-bit immediate value, k4. The tested bit is copied into the selected TCx status bit. Affected by Affects none TCx

Status Bits

Repeat Example
Syntax TC1 = bit(*AR3, #12)

This instruction can be repeated.

Description The bit at the position defined by an unsigned 4-bit value (12) in the content addressed by AR3 is tested and the tested bit is copied into TC1.

5-510

Instruction Set Descriptions

SPRU375G

Test and Clear Memory Bit

Test and Clear Memory Bit


Syntax Characteristics
Parallel Enable Bit No No

No. [1] [2]

Syntax TC1 = bit(Smem, k4), bit(Smem, k4) = #0 TC2 = bit(Smem, k4), bit(Smem, k4) = #0

Size 3 3

Cycles 1 1

Pipeline X X

Opcode

TC1 TC2

1110 0011 AAAA AAAI kkkk 010x 1110 0011 AAAA AAAI kkkk 011x

Operands Description

k4, Smem, TCx This instruction performs a bit manipulation in the A-unit ALU. The instruction tests a single bit, as defined by a 4-bit immediate value, k4, of a memory (Smem) location. The tested bit is copied into status bit TCx and is cleared to 0 in Smem. Affected by Affects none TCx

Status Bits

Repeat See Also

This instruction can be repeated. See the following other related instructions:
- Clear Memory Bit - Complement Memory Bit - Set Memory Bit - Test and Complement Memory Bit - Test and Set Memory Bit - Test Memory Bit

Example
Syntax TC1 = bit(*AR3, #12), bit(*AR3, #12) = #0 Description The bit at the position defined by the unsigned 4-bit value (12) in the content addressed by AR3 is tested and the tested bit is copied into TC1. The selected bit (12) in the content addressed by AR3 is cleared to 0.

SPRU375G

Instruction Set Descriptions

5-511

Test and Complement Memory Bit

Test and Complement Memory Bit


Syntax Characteristics
No. [1] [2] Syntax TC1 = bit(Smem, k4), cbit(Smem, k4) TC2 = bit(Smem, k4), cbit(Smem, k4) Parallel Enable Bit No No Size 3 3 Cycles 1 1 Pipeline X X

Opcode

TC1 TC2

1110 0011 AAAA AAAI kkkk 100x 1110 0011 AAAA AAAI kkkk 101x

Operands Description

k4, Smem, TCx This instruction performs a bit manipulation in the A-unit ALU. The instruction tests a single bit, as defined by a 4-bit immediate value, k4, of a memory (Smem) location and the tested bit is copied into status bit TCx and is complemented in Smem. Affected by Affects none TCx

Status Bits

Repeat See Also

This instruction can be repeated. See the following other related instructions:
- Clear Memory Bit - Complement Memory Bit - Set Memory Bit - Test and Clear Memory Bit - Test and Set Memory Bit - Test Memory Bit

Example
Syntax TC1 = bit(*AR0, #12), cbit(*AR0, #12) Description The bit at the position defined by the unsigned 4-bit value (12) in the content addressed by AR0 is tested and the tested bit is copied into TC1. The selected bit (12) in the content addressed by AR0 is complemented.

Before *AR0 TC1 0040 0

After *AR0 TC1 1040 0

5-512

Instruction Set Descriptions

SPRU375G

Test and Set Memory Bit

Test and Set Memory Bit


Syntax Characteristics
Parallel Enable Bit No No

No. [1] [2]

Syntax TC1 = bit(Smem, k4), bit(Smem, k4) = #1 TC2 = bit(Smem, k4), bit(Smem, k4) = #1

Size 3 3

Cycles 1 1

Pipeline X X

Opcode

TC1 TC2

1110 0011 AAAA AAAI kkkk 000x 1110 0011 AAAA AAAI kkkk 001x

Operands Description

k4, Smem, TCx This instruction performs a bit manipulation in the A-unit ALU. The instruction tests a single bit, as defined by a 4-bit immediate value, k4, of a memory (Smem) location. The tested bit is copied into status bit TCx and is set to 1 in Smem. Affected by Affects none TCx

Status Bits

Repeat See Also

This instruction can be repeated. See the following other related instructions:
- Clear Memory Bit - Complement Memory Bit - Set Memory Bit - Test and Clear Memory Bit - Test and Complement Memory Bit - Test Memory Bit

Example
Syntax TC1 = bit(*AR3, #12), bit(*AR3, #12) = #1 Description The bit at the position defined by the unsigned 4-bit value (12) in the content addressed by AR3 is tested and the tested bit is copied into TC1. The selected bit (12) in the content addressed by AR3 is set to 1.

SPRU375G

Instruction Set Descriptions

5-513

Chapter 6

Instruction Opcodes in Sequential Order


This chapter provides the opcode in sequential order for each TMS320C55x DSP instruction syntax.

Topic
6.1 6.2

Page
Instruction Set Opcodes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6-2 Instruction Set Opcode Symbols and Abbreviations . . . . . . . . . . . . 6-16

6-1

Instruction Set Opcodes

6.1 Instruction Set Opcodes


Table 61 lists the opcodes of the instruction set. See Table 62 (page 6-16) for a list of the symbols and abbreviations used in the instruction set opcode. See Table 11 (page 1-2) and Table 12 (page 1-6) for a list of the terms, symbols, and abbreviations used in the algebraic syntax.

Table 61. Instruction Set Opcodes


Opcode 0000000E xCCCCCCC kkkkkkkk 0000001E xCCCCCCC xxxxxxxx 0000010E xCCCCCCC LLLLLLLL 0000011E LLLLLLLL LLLLLLLL 0000100E LLLLLLLL LLLLLLLL 0000110E kkkkkkkk kkkkkkkk 0000111E llllllll llllllll 0001000E DDSS0000 xxSHIFTW 0001000E DDSS0001 xxSHIFTW 0001000E DDSS0010 xxSHIFTW 0001000E DDSS0011 xxSHIFTW 0001000E DDSS0100 xxSHIFTW 0001000E DDSS0101 xxSHIFTW 0001000E DDSS0110 xxSHIFTW 0001000E DDSS0111 xxSHIFTW 0001000E xxSS1000 xxddxxxx 0001000E DDSS1001 xxddxxxx 0001000E xxSS1010 SSddxxxt 0001000E DDSS1100 SSDDnnnn 0001000E DDSS1101 SSDDxxxr 0001000E DDSS1110 SSDDxxxx 0001000E DDSS1111 SSDDxxxr 0001001E FSSScc00 FDDDxuxt 0001001E FSSScc01 FDDD0utt 0001001E FSSScc01 FDDD1utt 0001001E FSSScc10 FDDD0utt 0001001E FSSScc10 FDDD1utt 0001001E FSSSxx11 FDDD0xvv Algebraic syntax while (cond && (RPTC < k8)) repeat if (cond) return if (cond) goto L8 goto L16 call L16 repeat(k16) blockrepeat{} ACy = ACy & (ACx <<< #SHIFTW) ACy = ACy | (ACx <<< #SHIFTW) ACy = ACy ^ (ACx <<< #SHIFTW) ACy = ACy + (ACx << #SHIFTW) ACy = ACy (ACx << #SHIFTW) ACy = ACx << #SHIFTW ACy = ACx <<C #SHIFTW ACy = ACx <<< #SHIFTW Tx = exp(ACx) ACy = mant(ACx), Tx = exp(ACx) Tx = count(ACx,ACy,TCx) max_diff(ACx,ACy,ACz,ACw) max_diff_dbl(ACx,ACy,ACz,ACw,TRNx) min_diff(ACx,ACy,ACz,ACw) min_diff_dbl(ACx,ACy,ACz,ACw,TRNx) TCx = uns(src RELOP dst) TCx = TCy & uns(src RELOP dst) TCx = !TCy & uns(src RELOP dst) TCx = TCy | uns(src RELOP dst) TCx = !TCy | uns(src RELOP dst) dst = BitOut \\ src \\ BitIn

6-2

Instruction Opcodes in Sequential Order

SPRU375G

Instruction Set Opcodes

Table 61. Instruction Set Opcodes (Continued)


Opcode 0001001E FSSSxx11 FDDD1xvv 0001010E FSSSxxxx FDDD0000 0001010E FSSSxxxx FDDD0001 0001010E FSSSxxxx FDDD0010 0001010E PPPPPPPP FDDD0100 0001010E PPPPPPPP FDDD0101 0001010E PPPPPPPP FDDD0110 0001010E FSSSxxxx FDDD1000 0001010E FSSSxxxx FDDD1001 0001010E FSSSxxxx FDDD1010 0001010E PPPPPPPP FDDD1100 0001010E PPPPPPPP FDDD1101 0001010E PPPPPPPP FDDD1110 0001011E xxxxxkkk kkkk0000 0001011E xxxkkkkk kkkk0011 0001011E kkkkkkkk kkkk0100 0001011E kkkkkkkk kkkk0101 0001011E kkkkkkkk kkkk0110 0001011E kkkkkkkk kkkk1000 0001011E kkkkkkkk kkkk1001 0001011E kkkkkkkk kkkk1010 0001100E kkkkkkkk FDDDFSSS 0001101E kkkkkkkk FDDDFSSS 0001110E kkkkkkkk FDDDFSSS 0001111E KKKKKKKK SSDDxx0% 0001111E KKKKKKKK SSDDss1% 0010000E 0010001E FSSSFDDD 0010010E FSSSFDDD 0010011E FSSSFDDD 0010100E FSSSFDDD 0010101E FSSSFDDD 0010110E FSSSFDDD Algebraic syntax dst = BitIn // src // BitOut mar(TAy + TAx) mar(TAy = TAx) mar(TAy TAx) mar(TAx + P8) mar(TAx = P8) mar(TAx P8) mar(TAy + TAx) mar(TAy = TAx) mar(TAy TAx) mar(TAx + P8) mar(TAx = P8) mar(TAx P8) DPH = k7 PDP = k9 BK03 = k12 BK47 = k12 BKC = k12 CSR = k12 BRC0 = k12 BRC1 = k12 dst = src & k8 dst = src | k8 dst = src ^ k8 ACy = rnd(ACx * K8) ACy = rnd(ACx + (Tx * K8)) nop dst = src dst = dst + src dst = dst src dst = dst & src dst = dst | src dst = dst ^ src

SPRU375G

Instruction Opcodes in Sequential Order

6-3

Instruction Set Opcodes

Table 61. Instruction Set Opcodes (Continued)


Opcode 0010111E FSSSFDDD 0011000E FSSSFDDD 0011001E FSSSFDDD 0011010E FSSSFDDD 0011011E FSSSFDDD 0011100E FSSSFDDD (Note: FSSS = src1, FDDD = src2) 0011101E FSSSFDDD (Note: FSSS = dst1, FDDD = dst2) 0011110E kkkkFDDD 0011111E kkkkFDDD 0100000E kkkkFDDD 0100001E kkkkFDDD 0100010E 00SSFDDD 0100010E 01x0FDDD 0100010E 01x1FDDD 0100010E 1000FDDD 0100010E 1001FDDD 0100010E 1010FDDD 0100010E 1100FDDD 0100010E 1101FDDD 0100010E 1110FDDD 0100011E kkkk0000 0100011E kkkk0001 0100011E kkkk0010 0100011E kkkk0011 0100011E kkkk0100 0100011E kkkk0101 0100011E kkkk0110 0100011E kkkk0111 0100100E xxxxx000 0100100E FSSSx001 0100100E kkkkx010 dst = max(src, dst) dst = min(src, dst) dst = |src| dst = src dst = ~src push(src1, src2) dst1, dst2 = pop() dst = k4 dst = k4 dst = dst + k4 dst = dst k4 TAx = HI(ACx) dst = dst >> #1 dst = dst << #1 TAx = SP TAx = SSP TAx = CDP TAx = BRC0 TAx = BRC1 TAx = RPTC bit(ST0, k4) = #0 bit(ST0, k4) = #1 bit(ST1, k4) = #0 bit(ST1, k4) = #1 bit(ST2, k4) = #0 bit(ST2, k4) = #1 bit(ST3, k4) = #0 bit(ST3, k4) = #1 repeat(CSR) repeat(CSR), CSR += TAx repeat(CSR), CSR += k4 Algebraic syntax

6-4

Instruction Opcodes in Sequential Order

SPRU375G

Instruction Set Opcodes

Table 61. Instruction Set Opcodes (Continued)


Opcode 0100100E kkkkx011 0100100E xxxxx100 0100100E xxxxx101 0100101E 0LLLLLLL 0100101E 1lllllll 0100110E kkkkkkkk 0100111E KKKKKKKK 0101000E FDDDx000 0101000E FDDDx001 0101000E FDDDx010 0101000E xxDDx011 0101000E FSSSx110 0101000E xxSSx111 0101000E XDDD0100 0101000E XSSS0101 0101001E FSSS00DD 0101001E FSSS1000 0101001E FSSS1001 0101001E FSSS1010 0101001E FSSS1100 0101001E FSSS1101 0101001E FSSS1110 0101010E DDSS000% 0101010E DDSS001% 0101010E DDSS010% 0101010E DDSS011% 0101010E DDSS100% 0101010E DDSS101% 0101010E DDSS110% 0101011E DDSSss0% 0101011E DDSSss1% 0101100E DDSSss0% 0101100E DDSSss1% Algebraic syntax repeat(CSR), CSR = k4 return return_int goto L7 localrepeat{} repeat(k8) SP = SP + K8 dst = dst <<< #1 dst = dst >>> #1 dst = pop() ACx = dbl(pop()) push(src) dbl(push(ACx)) xdst = popboth() pshboth(xsrc) HI(ACx) = TAx SP = TAx SSP = TAx CDP = TAx CSR = TAx BRC1 = TAx BRC0 = TAx ACy = rnd(ACy + |ACx|) ACy = rnd(ACy + (ACx * ACx)) ACy = rnd(ACy (ACx * ACx)) ACy = rnd(ACy * ACx) ACy = rnd(ACx * ACx) ACy = rnd(ACx) ACy = saturate(rnd(ACx)) ACy = rnd(ACy + (ACx * Tx)) ACy = rnd(ACy (ACx * Tx)) ACy = rnd(ACx * Tx) ACy = rnd((ACy * Tx) + ACx)

SPRU375G

Instruction Opcodes in Sequential Order

6-5

Instruction Set Opcodes

Table 61. Instruction Set Opcodes (Continued)


Opcode 0101101E DDSSss00 0101101E DDSSss01 0101101E DDxxxx1t 0101110E DDSSss00 0101110E DDSSss01 0101110E DDSSss10 0101111E 00kkkkkk 01100lll lCCCCCCC 01101000 xCCCCCCC PPPPPPPP PPPPPPPP PPPPPPPP 01101001 xCCCCCCC PPPPPPPP PPPPPPPP PPPPPPPP 01101010 PPPPPPPP PPPPPPPP PPPPPPPP 01101100 PPPPPPPP PPPPPPPP PPPPPPPP 01101101 xCCCCCCC LLLLLLLL LLLLLLLL 01101110 xCCCCCCC LLLLLLLL LLLLLLLL 01101111 FSSSccxu KKKKKKKK LLLLLLLL 01110000 KKKKKKKK KKKKKKKK SSDDSHFT 01110001 KKKKKKKK KKKKKKKK SSDDSHFT 01110010 kkkkkkkk kkkkkkkk SSDDSHFT 01110011 kkkkkkkk kkkkkkkk SSDDSHFT 01110100 kkkkkkkk kkkkkkkk SSDDSHFT 01110101 KKKKKKKK KKKKKKKK xxDDSHFT 01110110 kkkkkkkk kkkkkkkk FDDD00SS 01110110 kkkkkkkk kkkkkkkk FDDD01SS 01110110 KKKKKKKK KKKKKKKK FDDD10xx 01110111 DDDDDDDD DDDDDDDD FDDDxxxx 01111000 kkkkkkkk kkkkkkkk xxx0000x 01111000 kkkkkkkk kkkkkkkk xxx0001x 01111000 kkkkkkkk kkkkkkkk xxx0010x 01111000 kkkkkkkk kkkkkkkk xxx0011x 01111000 kkkkkkkk kkkkkkkk xxx0100x 01111000 kkkkkkkk kkkkkkkk xxx0101x Algebraic syntax ACy = ACy + (ACx << Tx) ACy = ACy (ACx << Tx) ACx = sftc(ACx,TCx) ACy = ACx <<< Tx ACy = ACx << Tx ACy = ACx <<C Tx swap( ) if (cond) goto l4 if (cond) goto P24 if (cond) call P24 goto P24 call P24 if (cond) goto L16 if (cond) call L16 compare (uns(src RELOP K8)) goto L8 ACy = ACx + (K16 << #SHFT) ACy = ACx (K16 << #SHFT) ACy = ACx & (k16 <<< #SHFT) ACy = ACx | (k16 <<< #SHFT) ACy = ACx ^ (k16 <<< #SHFT) ACx = K16 << #SHFT dst = field_extract(ACx,k16) dst = field_expand(ACx,k16) dst = K16 mar(TAx = D16) DP = k16 SSP = k16 CDP = k16 BSA01 = k16 BSA23 = k16 BSA45 = k16

6-6

Instruction Opcodes in Sequential Order

SPRU375G

Instruction Set Opcodes

Table 61. Instruction Set Opcodes (Continued)


Opcode 01111000 kkkkkkkk kkkkkkkk xxx0110x 01111000 kkkkkkkk kkkkkkkk xxx0111x 01111000 kkkkkkkk kkkkkkkk xxx1000x 01111001 KKKKKKKK KKKKKKKK SSDDxx0% 01111001 KKKKKKKK KKKKKKKK SSDDss1% 01111010 KKKKKKKK KKKKKKKK SSDD000x 01111010 KKKKKKKK KKKKKKKK SSDD001x 01111010 kkkkkkkk kkkkkkkk SSDD010x 01111010 kkkkkkkk kkkkkkkk SSDD011x 01111010 kkkkkkkk kkkkkkkk SSDD100x 01111010 KKKKKKKK KKKKKKKK xxDD101x 01111010 xxxxxxxx xxxxxxxx xxxx110x 01111011 KKKKKKKK KKKKKKKK FDDDFSSS 01111100 KKKKKKKK KKKKKKKK FDDDFSSS 01111101 kkkkkkkk kkkkkkkk FDDDFSSS 01111110 kkkkkkkk kkkkkkkk FDDDFSSS 01111111 kkkkkkkk kkkkkkkk FDDDFSSS 10000000 XXXMMMYY YMMM00xx 10000000 XXXMMMYY YMMM01xx 10000000 XXXMMMYY YMMM10SS 10000001 XXXMMMYY YMMM00DD 10000001 XXXMMMYY YMMM01DD 10000001 XXXMMMYY YMMM10DD 10000010 XXXMMMYY YMMM00mm uuDDDDg% 10000010 XXXMMMYY YMMM01mm uuDDDDg% BSA67 = k16 BSAC = k16 SP = k16 ACy = rnd(ACx * K16) ACy = rnd(ACx + (Tx * K16)) ACy = ACx + (K16 << #16) ACy = ACx (K16 << #16) ACy = ACx & (k16 <<< #16) ACy = ACx | (k16 <<< #16) ACy = ACx ^ (k16 <<< #16) ACx = K16 << #16 idle dst = src + K16 dst = src K16 dst = src & k16 dst = src | k16 dst = src ^ k16 dbl(Ymem) = dbl(Xmem) Ymem = Xmem Xmem = LO(ACx), Ymem = HI(ACx) ACx = (Xmem << #16) + (Ymem << #16) ACx = (Xmem << #16) (Ymem << #16) LO(ACx) = Xmem, HI(ACx) = Ymem ACx = M40(rnd(uns(Xmem) * uns(coef(Cmem)))), ACy = M40(rnd(uns(Ymem) * uns(coef(Cmem)))) ACx = M40(rnd(ACx + (uns(Xmem) * uns(coef(Cmem))))), ACy = M40(rnd(uns(Ymem) * uns(coef(Cmem)))) ACx = M40(rnd(ACx (uns(Xmem) * uns(coef(Cmem))))), ACy = M40(rnd(uns(Ymem) * uns(coef(Cmem)))) mar(Xmem), ACx = M40(rnd(uns(Ymem) * uns(coef(Cmem)))) Algebraic syntax

10000010 XXXMMMYY YMMM10mm uuDDDDg%

10000010 XXXMMMYY YMMM11mm uuxxDDg%

SPRU375G

Instruction Opcodes in Sequential Order

6-7

Instruction Set Opcodes

Table 61. Instruction Set Opcodes (Continued)


Opcode 10000011 XXXMMMYY YMMM00mm uuDDDDg% Algebraic syntax ACx = M40(rnd(ACx + (uns(Xmem) * uns(coef(Cmem))))), ACy = M40(rnd(ACy + (uns(Ymem) * uns(coef(Cmem))))) ACx = M40(rnd(ACx (uns(Xmem) * uns(coef(Cmem))))), ACy = M40(rnd(ACy + (uns(Ymem) * uns(coef(Cmem)))))
ACx = M40(rnd((ACx >> #16) + (uns(Xmem) * uns(coef(Cmem))))), ACy = M40(rnd(ACy + (uns(Ymem) * uns(coef(Cmem)))))

10000011 XXXMMMYY YMMM01mm uuDDDDg%

10000011 XXXMMMYY YMMM10mm uuDDDDg%

10000011 XXXMMMYY YMMM11mm uuxxDDg% 10000100 XXXMMMYY YMMM00mm uuDDDDg%

mar(Xmem), ACx = M40(rnd(ACx + (uns(Ymem) * uns(coef(Cmem)))))


ACx = M40(rnd(ACx (uns(Xmem) * uns(coef(Cmem))))), ACy = M40(rnd((ACy >> #16) + (uns(Ymem) * uns(coef(Cmem))))) mar(Xmem), ACx = M40(rnd((ACx >> #16) + (uns(Ymem) * uns(coef(Cmem))))) ACx = M40(rnd(uns(Xmem) * uns(coef(Cmem)))), ACy = M40(rnd((ACy >> #16) + (uns(Ymem) * uns(coef(Cmem))))) ACx = M40(rnd((ACx >> #16) + (uns(Xmem) * uns(coef(Cmem))))), ACy = M40(rnd((ACy >> #16) + (uns(Ymem) * uns(coef(Cmem)))))

10000100 XXXMMMYY YMMM01mm uuxxDDg%

10000100 XXXMMMYY YMMM10mm uuDDDDg%

10000100 XXXMMMYY YMMM11mm uuDDDDg%

10000101 XXXMMMYY YMMM00mm uuxxDDg% 10000101 XXXMMMYY YMMM01mm uuDDDDg%

mar(Xmem), ACx = M40(rnd(ACx (uns(Ymem) * uns(coef(Cmem))))) ACx = M40(rnd(ACx (uns(Xmem) * uns(coef(Cmem))))), ACy = M40(rnd(ACy (uns(Ymem) * uns(coef(Cmem))))) mar(Xmem) ,mar(Ymem) ,mar(coef(Cmem)) firs(Xmem, Ymem, coef(Cmem), ACx, ACy) firsn(Xmem, Ymem, coef(Cmem), ACx, ACy) ACx = M40(rnd(uns(Xmem) * uns(Ymem))) [,T3 = Xmem] ACy = M40(rnd(ACx + (uns(Xmem) * uns(Ymem)))) [,T3 = Xmem]
ACy = M40(rnd((ACx >> #16) + (uns(Xmem) * uns(Ymem)))) [,T3 = Xmem]

10000101 XXXMMMYY YMMM10mm xxxxxxxx 10000101 XXXMMMYY YMMM11mm DDx0DDU% 10000101 XXXMMMYY YMMM11mm DDx1DDU% 10000110 XXXMMMYY YMMMxxDD 000guuU% 10000110 XXXMMMYY YMMMSSDD 001guuU% 10000110 XXXMMMYY YMMMSSDD 010guuU% 10000110 XXXMMMYY YMMMSSDD 011guuU%

ACy = M40(rnd(ACx (uns(Xmem) * uns(Ymem)))) [,T3 = Xmem]

6-8

Instruction Opcodes in Sequential Order

SPRU375G

Instruction Set Opcodes

Table 61. Instruction Set Opcodes (Continued)


Opcode 10000110 XXXMMMYY YMMMDDDD 100xssU% 10000110 XXXMMMYY YMMMDDDD 101xssU% 10000110 XXXMMMYY YMMMDDDD 110xxxx% 10000110 XXXMMMYY YMMMDDDD 1110xxn% 10000110 XXXMMMYY YMMMDDDD 1111xxn% 10000111 XXXMMMYY YMMMSSDD 000xssU% 10000111 XXXMMMYY YMMMSSDD 001xssU% 10000111 XXXMMMYY YMMMSSDD 010xssU% 10000111 XXXMMMYY YMMMSSDD 100xxxxx 10000111 XXXMMMYY YMMMSSDD 101xxxxx 10000111 XXXMMMYY YMMMSSDD 110xxxxx 10010000 XSSSXDDD 10010001 xxxxxxSS 10010010 xxxxxxSS 10010100 xxxxxxxx 10010101 0xxkkkkk 10010101 1xxkkkkk 10010110 0CCCCCCC 10010110 1CCCCCCC 10011000 10011001 10011010 10011100 10011101 10011110 0CCCCCCC 10011110 1CCCCCCC 10011111 0CCCCCCC Algebraic syntax ACx = rnd(ACx (Tx * Xmem)), ACy = Ymem << #16 [,T3 = Xmem] ACx = rnd(ACx + (Tx * Xmem)), ACy = Ymem << #16 [,T3 = Xmem] lms(Xmem, Ymem, ACx, ACy) sqdst(Xmem, Ymem, ACx, ACy) abdst(Xmem, Ymem, ACx, ACy) ACy = rnd(Tx * Xmem), Ymem = HI(ACx << T2) [,T3 = Xmem] ACy = rnd(ACy + (Tx * Xmem)), Ymem = HI(ACx << T2) [,T3 = Xmem] ACy = rnd(ACy (Tx * Xmem)), Ymem = HI(ACx << T2) [,T3 = Xmem] ACy = ACx + (Xmem << #16), Ymem = HI(ACy << T2) ACy = (Xmem << #16) ACx, Ymem = HI(ACy << T2) ACy = Xmem << #16, Ymem = HI(ACx << T2) xdst = xsrc goto ACx call ACx reset intr(k5) trap(k5) if (cond) execute(AD_unit) if (cond) execute(D_unit) mmap() readport() writeport() linear() circular() if (cond) execute(AD_unit) if (cond) execute(D_unit) if (cond) execute(AD_unit)

SPRU375G

Instruction Opcodes in Sequential Order

6-9

Instruction Set Opcodes

Table 61. Instruction Set Opcodes (Continued)


Opcode 10011111 1CCCCCCC 1010FDDD AAAAAAAI 101100DD AAAAAAAI 10110100 AAAAAAAI 10110101 AAAAAAAI 10110110 AAAAAAAI 10110111 AAAAAAAI 10111000 AAAAAAAI 10111011 AAAAAAAI 101111SS AAAAAAAI 1100FSSS AAAAAAAI 11010000 AAAAAAAI U%DDxxmm 11010001 AAAAAAAI U%DD00mm 11010001 AAAAAAAI U%DD01mm 11010001 AAAAAAAI U%DD10mm 11010010 AAAAAAAI U%DD00SS 11010010 AAAAAAAI U%DD01SS 11010010 AAAAAAAI U%DD10SS 11010010 AAAAAAAI U%DD11SS 11010011 AAAAAAAI U%DD00SS 11010011 AAAAAAAI U%DD10xx 11010011 AAAAAAAI U%DDu1ss 11010100 AAAAAAAI U%DDssSS 11010101 AAAAAAAI U%DDssSS 11010110 AAAAAAAI FDDDFSSS 11010111 AAAAAAAI FDDDFSSS 11011000 AAAAAAAI FDDDFSSS 11011001 AAAAAAAI FDDDFSSS 11011010 AAAAAAAI FDDDFSSS 11011011 AAAAAAAI FDDDFSSS 11011100 AAAAAAAI kkkkxx00 11011100 AAAAAAAI kkkkxx01 Algebraic syntax if (cond) execute(D_unit) dst = Smem ACx = Smem << #16 mar(Smem) push(Smem) delay(Smem) push(dbl(Lmem)) dbl(Lmem) = pop() Smem = pop() Smem = HI(ACx) Smem = src ACx = rnd(ACx + (Smem * coef(Cmem))) [,T3 = Smem], delay(Smem) ACx = rnd(Smem * coef(Cmem)) [,T3 = Smem] ACx = rnd(ACx + (Smem * coef(Cmem))) [,T3 = Smem] ACx = rnd(ACx (Smem * coef(Cmem))) [,T3 = Smem] ACy = rnd(ACy + (Smem * ACx)) [,T3 = Smem] ACy = rnd(ACy (Smem * ACx)) [,T3 = Smem] ACy = rnd(ACx + (Smem * Smem)) [,T3 = Smem] ACy = rnd(ACx (Smem * Smem)) [,T3 = Smem] ACy = rnd(Smem * ACx) [,T3 = Smem] ACx = rnd(Smem * Smem) [,T3 = Smem] ACx = rnd(uns(Tx * Smem)) [,T3 = Smem] ACy = rnd(ACx + (Tx * Smem)) [,T3 = Smem] ACy = rnd(ACx (Tx * Smem)) [,T3 = Smem] dst = src + Smem dst = src Smem dst = Smem src dst = src & Smem dst = src | Smem dst = src ^ Smem TC1 = bit(Smem, k4) TC2 = bit(Smem, k4)

6-10

Instruction Opcodes in Sequential Order

SPRU375G

Instruction Set Opcodes

Table 61. Instruction Set Opcodes (Continued)


Opcode 11011100 AAAAAAAI 0000xx10 11011100 AAAAAAAI 0001xx10 11011100 AAAAAAAI 0010xx10 11011100 AAAAAAAI 0011xx10 11011100 AAAAAAAI 0100xx10 11011100 AAAAAAAI 0101xx10 11011100 AAAAAAAI 0110xx10 11011100 AAAAAAAI 0111xx10 11011100 AAAAAAAI 1000xx10 11011100 AAAAAAAI 1001xx10 11011100 AAAAAAAI 1010xx10 11011100 AAAAAAAI 1011xx10 11011100 AAAAAAAI 1100xx10 11011100 AAAAAAAI 1111xx10 11011100 AAAAAAAI x000xx11 11011100 AAAAAAAI x001xx11 11011100 AAAAAAAI x010xx11 11011100 AAAAAAAI x011xx11 11011100 AAAAAAAI x100xx11 11011101 AAAAAAAI SSDDss00 11011101 AAAAAAAI SSDDss01 11011101 AAAAAAAI SSDDss10 11011101 AAAAAAAI x%DDss11 11011110 AAAAAAAI SSDD0000 11011110 AAAAAAAI SSDD0001 11011110 AAAAAAAI SSDD0010 11011110 AAAAAAAI SSDD0011 11011110 AAAAAAAI SSDD0100 11011110 AAAAAAAI SSDD0101 11011110 AAAAAAAI SSDD0110 11011110 AAAAAAAI ssDD1000 DP = Smem CDP = Smem BSA01 = Smem BSA23 = Smem BSA45 = Smem BSA67 = Smem BSAC = Smem SP = Smem SSP = Smem BK03 = Smem BK47 = Smem BKC = Smem DPH = Smem PDP = Smem CSR = Smem BRC0 = Smem BRC1 = Smem TRN0 = Smem TRN1 = Smem ACy = ACx + (Smem << Tx) ACy = ACx (Smem << Tx) ACy = ads2c(Smem, ACx, Tx, TC1, TC2) ACx = rnd(Smem << Tx) ACy = adsc(Smem, ACx, TC1) ACy = adsc(Smem, ACx, TC2) ACy = adsc(Smem, ACx, TC1, TC2) subc(Smem, ACx, ACy) ACy = ACx + (Smem << #16) ACy = ACx (Smem << #16) ACy = (Smem << #16) ACx HI(ACx) = Smem + Tx, LO(ACx) = Smem Tx Algebraic syntax

SPRU375G

Instruction Opcodes in Sequential Order

6-11

Instruction Set Opcodes

Table 61. Instruction Set Opcodes (Continued)


Opcode 11011110 AAAAAAAI ssDD1001 11011111 AAAAAAAI FDDD000u 11011111 AAAAAAAI FDDD001u 11011111 AAAAAAAI xxDD010u 11011111 AAAAAAAI SSDD100u 11011111 AAAAAAAI SSDD101u 11011111 AAAAAAAI SSDD110u 11011111 AAAAAAAI SSDD111u 11100000 AAAAAAAI FSSSxxxt 11100001 AAAAAAAI DDSHIFTW 11100010 AAAAAAAI DDSHIFTW 11100011 AAAAAAAI kkkk000x 11100011 AAAAAAAI kkkk001x 11100011 AAAAAAAI kkkk010x 11100011 AAAAAAAI kkkk011x 11100011 AAAAAAAI kkkk100x 11100011 AAAAAAAI kkkk101x 11100011 AAAAAAAI FSSS1100 11100011 AAAAAAAI FSSS1101 11100011 AAAAAAAI FSSS111x 11100100 AAAAAAAI FSSSx0xx 11100100 AAAAAAAI FDDDx1xx 11100101 AAAAAAAI FSSS01x0 11100101 AAAAAAAI FSSS01x1 11100101 AAAAAAAI 000010xx 11100101 AAAAAAAI 000110xx 11100101 AAAAAAAI 001010xx 11100101 AAAAAAAI 001110xx 11100101 AAAAAAAI 010010xx 11100101 AAAAAAAI 010110xx 11100101 AAAAAAAI 011010xx 11100101 AAAAAAAI 011110xx Algebraic syntax HI(ACx) = Smem Tx, LO(ACx) = Smem + Tx dst = uns(high_byte(Smem)) dst = uns(low_byte(Smem)) ACx = uns(Smem) ACy = ACx + uns(Smem) + CARRY ACy = ACx uns(Smem) BORROW ACy = ACx + uns(Smem) ACy = ACx uns(Smem) TCx = bit(Smem, src) ACx = low_byte(Smem) << #SHIFTW ACx = high_byte(Smem) << #SHIFTW TC1 = bit(Smem, k4), bit(Smem, k4) = #1 TC2 = bit(Smem, k4), bit(Smem, k4) = #1 TC1 = bit(Smem, k4), bit(Smem, k4) = #0 TC2 = bit(Smem, k4), bit(Smem, k4) = #0 TC1 = bit(Smem, k4), cbit(Smem, k4) TC2 = bit(Smem, k4), cbit(Smem, k4) bit(Smem, src) = #1 bit(Smem, src) = #0 cbit(Smem, src) push(src, Smem) dst, Smem = pop() high_byte(Smem) = src low_byte(Smem) = src Smem = DP Smem = CDP Smem = BSA01 Smem = BSA23 Smem = BSA45 Smem = BSA67 Smem = BSAC Smem = SP

6-12

Instruction Opcodes in Sequential Order

SPRU375G

Instruction Set Opcodes

Table 61. Instruction Set Opcodes (Continued)


Opcode 11100101 AAAAAAAI 100010xx 11100101 AAAAAAAI 100110xx 11100101 AAAAAAAI 101010xx 11100101 AAAAAAAI 101110xx 11100101 AAAAAAAI 110010xx 11100101 AAAAAAAI 111110xx 11100101 AAAAAAAI x00011xx 11100101 AAAAAAAI x00111xx 11100101 AAAAAAAI x01011xx 11100101 AAAAAAAI x01111xx 11100101 AAAAAAAI x10011xx 11100110 AAAAAAAI KKKKKKKK 11100111 AAAAAAAI SSss00xx 11100111 AAAAAAAI SSss10x% 11100111 AAAAAAAI SSss11u% 11101000 AAAAAAAI SSxxx0x% 11101000 AAAAAAAI SSxxx1u% 11101001 AAAAAAAI SSSHIFTW 11101010 AAAAAAAI SSSHIFTW 11101011 AAAAAAAI xxxx01xx 11101011 AAAAAAAI xxSS10x0 11101011 AAAAAAAI xxSS10u1 11101011 AAAAAAAI FSSS1100 11101011 AAAAAAAI xxSS1101 11101011 AAAAAAAI xxSS1110 11101011 AAAAAAAI xxSS1111 11101100 AAAAAAAI FSSS000x 11101100 AAAAAAAI FSSS001x 11101100 AAAAAAAI FSSS010x 11101100 AAAAAAAI FSSS011x 11101100 AAAAAAAI FSSS100t 11101100 AAAAAAAI XDDD1110 Smem = SSP Smem = BK03 Smem = BK47 Smem = BKC Smem = DPH Smem = PDP Smem = CSR Smem = BRC0 Smem = BRC1 Smem = TRN0 Smem = TRN1 Smem = K8 Smem = LO(ACx << Tx) Smem = HI(rnd(ACx << Tx)) Smem = HI(saturate(uns(rnd(ACx << Tx)))) Smem = HI(rnd(ACx)) Smem = HI(saturate(uns(rnd(ACx)))) Smem = LO(ACx << #SHIFTW) Smem = HI(ACx << #SHIFTW) dbl(Lmem) = RETA dbl(Lmem) = ACx dbl(Lmem) = saturate(uns(ACx)) Lmem = pair(TAx) HI(Lmem) = HI(ACx) >> #1, LO(Lmem) = LO(ACx) >> #1 Lmem = pair(HI(ACx)) Lmem = pair(LO(ACx)) bit(src, Baddr) = #1 bit(src, Baddr) = #0 bit(src, pair(Baddr)) cbit(src, Baddr) TCx = bit(src, Baddr) XAdst = mar(Smem) Algebraic syntax

SPRU375G

Instruction Opcodes in Sequential Order

6-13

Instruction Set Opcodes

Table 61. Instruction Set Opcodes (Continued)


Opcode 11101101 AAAAAAAI SSDD000n 11101101 AAAAAAAI SSDD001n 11101101 AAAAAAAI SSDD010x 11101101 AAAAAAAI xxxx011x 11101101 AAAAAAAI xxDD100g 11101101 AAAAAAAI xxDD101x 11101101 AAAAAAAI xxDD110x 11101101 AAAAAAAI FDDD111x 11101101 AAAAAAAI XDDD1111 11101101 AAAAAAAI XSSS0101 11101110 AAAAAAAI SSDD000x 11101110 AAAAAAAI SSDD001x 11101110 AAAAAAAI SSDD010x 11101110 AAAAAAAI ssDD011x 11101110 AAAAAAAI ssDD100x 11101110 AAAAAAAI ssDD101x 11101110 AAAAAAAI ssDD110x 11101110 AAAAAAAI ssDD111x 11101111 AAAAAAAI xxxx00mm 11101111 AAAAAAAI xxxx01mm 11101111 AAAAAAAI xxxx10mm 11101111 AAAAAAAI xxxx11mm 11110000 AAAAAAAI KKKKKKKK KKKKKKKK 11110001 AAAAAAAI KKKKKKKK KKKKKKKK 11110010 AAAAAAAI kkkkkkkk kkkkkkkk 11110011 AAAAAAAI kkkkkkkk kkkkkkkk 11110100 AAAAAAAI kkkkkkkk kkkkkkkk Algebraic syntax ACy = ACx + dbl(Lmem) ACy = ACx dbl(Lmem) ACy = dbl(Lmem) ACx RETA = dbl(Lmem) ACx = M40(dbl(Lmem)) pair(HI(ACx)) = Lmem pair(LO(ACx)) = Lmem pair(TAx) = Lmem XAdst = dbl(Lmem) dbl(Lmem) = XAsrc HI(ACy) = HI(Lmem) + HI(ACx), LO(ACy) = LO(Lmem) + LO(ACx) HI(ACy) = HI(ACx) HI(Lmem), LO(ACy) = LO(ACx) LO(Lmem) HI(ACy) = HI(Lmem) HI(ACx), LO(ACy) = LO(Lmem) LO(ACx) HI(ACx) = Tx HI(Lmem), LO(ACx) = Tx LO(Lmem) HI(ACx) = HI(Lmem) + Tx, LO(ACx) = LO(Lmem) + Tx HI(ACx) = HI(Lmem) Tx, LO(ACx) = LO(Lmem) Tx HI(ACx) = HI(Lmem) + Tx, LO(ACx) = LO(Lmem) Tx HI(ACx) = HI(Lmem) Tx, LO(ACx) = LO(Lmem) + Tx Smem = coef(Cmem) coef(Cmem) = Smem Lmem = dbl(coef(Cmem)) dbl(coef(Cmem)) = Lmem TC1 = (Smem == K16) TC2 = (Smem == K16) TC1 = Smem & k16 TC2 = Smem & k16 Smem = Smem & k16

6-14

Instruction Opcodes in Sequential Order

SPRU375G

Instruction Set Opcodes

Table 61. Instruction Set Opcodes (Continued)


Opcode 11110101 AAAAAAAI kkkkkkkk kkkkkkkk 11110110 AAAAAAAI kkkkkkkk kkkkkkkk 11110111 AAAAAAAI KKKKKKKK KKKKKKKK 11111000 AAAAAAAI KKKKKKKK xxDDx0U% 11111000 AAAAAAAI KKKKKKKK SSDDx1U% 11111001 AAAAAAAI uxSHIFTW SSDD00xx 11111001 AAAAAAAI uxSHIFTW SSDD01xx 11111001 AAAAAAAI uxSHIFTW xxDD10xx 11111010 AAAAAAAI xxSHIFTW SSxxx0x% 11111010 AAAAAAAI uxSHIFTW SSxxx1x% 11111011 AAAAAAAI KKKKKKKK KKKKKKKK 11111100 AAAAAAAI LLLLLLLL LLLLLLLL Algebraic syntax Smem = Smem | k16 Smem = Smem ^ k16 Smem = Smem + K16 ACx = rnd(Smem * K8) [,T3 = Smem] ACy = rnd(ACx + (Smem * K8)) [,T3 = Smem] ACy = ACx + (uns(Smem) << #SHIFTW) ACy = ACx (uns(Smem) << #SHIFTW) ACx = uns(Smem) << #SHIFTW Smem = HI(rnd(ACx << #SHIFTW)) Smem = HI(saturate(uns(rnd(ACx << #SHIFTW)))) Smem = K16 if (ARn_mod != #0) goto L16

SPRU375G

Instruction Opcodes in Sequential Order

6-15

Instruction Set Opcode Symbols and Abbreviations

6.2 Instruction Set Opcode Symbols and Abbreviations


Table 62 lists the symbols and abbreviations used in the instruction set opcode.

Table 62. Instruction Set Opcode Symbols and Abbreviations


Bit Field Name % Bit Field Value 0 1 Bit Field Description Rounding is disabled Rounding is enabled

AAAA AAAI AAAA AAA0 AAAA AAA1 0001 0001 0011 0001 0101 0001 0111 0001 1001 0001 1011 0001 1101 0001 1111 0001 PPP0 0001 PPP0 0011 PPP0 0101 PPP0 0111 PPP0 1001 PPP0 1011 PPP0 1101 PPP0 1111 PPP1 0011 PPP1 0101

Smem addressing mode: @dma, direct memory address (dma) direct access Smem indirect memory access: ABS16(#k16) *(#k23) *port(#k16) *CDP *CDP+ *CDP *CDP(#K16) *+CDP(#K16) *ARn *ARn+ *ARn *(ARn + T0), when C54CM = 0 *(ARn + T0), when C54CM = 1 *(ARn T0), when C54CM = 0 *(ARn T0), when C54CM = 1 *ARn(T0), when C54CM = 0 *ARn(T0), when C54CM = 1 *ARn(#K16) *+ARn(#K16) *(ARn + T1), when ARMS = 0 *ARn(short(#1)), when ARMS = 1 *(ARn T1), when ARMS = 0 *ARn(short(#2)), when ARMS = 1

6-16

Instruction Opcodes in Sequential Order

SPRU375G

Instruction Set Opcode Symbols and Abbreviations

Table 62. Instruction Set Opcode Symbols and Abbreviations (Continued)


Bit Field Name Bit Field Value PPP1 0111 PPP1 1001 PPP1 1011 PPP1 1101 PPP1 1111 Bit Field Description *ARn(T1), when ARMS = 0 *ARn(short(#3)), when ARMS = 1 *+ARn, when ARMS = 0 *ARn(short(#4)), when ARMS = 1 *ARn, when ARMS = 0 *ARn(short(#5)), when ARMS = 1 *(ARn + T0B), when ARMS = 0 *ARn(short(#6)), when ARMS = 1 *(ARn T0B), when ARMS = 0 *ARn(short(#7)), when ARMS = 1

PPP encodes an auxiliary register (ARn) as for XXX and YYY.

cc 00 01 10 11

Relational operators (RELOP): == < >= != (equal to) (less than) (greater than or equal to) (not equal to)

CCC CCCC 000 FSSS 001 FSSS 010 FSSS 011 FSSS 100 FSSS 101 FSSS 110 00SS 110 0100 110 0101 110 0110 110 0111

Conditional field (cond) on source accumulator, auxiliary, or temporary register; TCx; and CARRY: src == 0 src != 0 src < 0 src <= 0 src > 0 src >= 0 (source is equal to 0) (source is not equal to 0) (source is less than 0) (source is less than or equal to 0) (source is greater than 0) (source is greater than or equal to 0)

overflow(ACx) (source accumulator overflow status bit (ACOVx) is tested against 1) TC1 TC2 CARRY Reserved (status bit is tested against 1) (status bit is tested against 1) (status bit is tested against 1)

SPRU375G

Instruction Opcodes in Sequential Order

6-17

Instruction Set Opcode Symbols and Abbreviations

Table 62. Instruction Set Opcode Symbols and Abbreviations (Continued)


Bit Field Name Bit Field Value 110 1000 110 1001 110 1010 110 1011 110 11xx 111 00SS 111 0100 111 0101 111 0110 111 0111 111 1000 111 1001 111 1010 111 1011 111 1100 111 1101 111 1110 111 1111 Bit Field Description TC1 & TC2 TC1 & !TC2 !TC1 & TC2 !TC1 & !TC2 Reserved !overflow(ACx)(source accumulator overflow status bit (ACOVx) is tested against 0) !TC1 !TC2 !CARRY Reserved TC1 | TC2 TC1 | !TC2 !TC1 | TC2 !TC1 | !TC2 TC1 ^ TC2 TC1 ^ !TC2 !TC1 ^ TC2 !TC1 ^ !TC2 (status bit is tested against 0) (status bit is tested against 0) (status bit is tested against 0)

dd 00 01 10 11

Destination temporary register (Tx, Ty): Temporary register 0 (T0) Temporary register 1 (T1) Temporary register 2 (T2) Temporary register 3 (T3)

6-18

Instruction Opcodes in Sequential Order

SPRU375G

Instruction Set Opcode Symbols and Abbreviations

Table 62. Instruction Set Opcode Symbols and Abbreviations (Continued)


Bit Field Name DD 00 01 10 11 Bit Field Value Bit Field Description Destination accumulator register (ACw, ACx, ACy, ACz): Accumulator 0 (AC0) Accumulator 1 (AC1) Accumulator 2 (AC2) Accumulator 3 (AC3)

DDD . . . D

Data address label coded on n bits (absolute address)

0 1

Parallel Enable bit is cleared to 0 Parallel Enable bit is set to 1

FDDD FSSS 0000 0001 0010 0011 0100 0101 0110 0111 1000 1001 1010 1011 1100 1101 1110 1111

Destination or Source accumulator, auxiliary, or temporary register (dst, src, TAx, TAy): Accumulator 0 (AC0) Accumulator 1 (AC1) Accumulator 2 (AC2) Accumulator 3 (AC3) Temporary register 0 (T0) Temporary register 1 (T1) Temporary register 2 (T2) Temporary register 3 (T3) Auxiliary register 0 (AR0) Auxiliary register 1 (AR1) Auxiliary register 2 (AR2) Auxiliary register 3 (AR3) Auxiliary register 4 (AR4) Auxiliary register 5 (AR5) Auxiliary register 6 (AR6) Auxiliary register 7 (AR7)

SPRU375G

Instruction Opcodes in Sequential Order

6-19

Instruction Set Opcode Symbols and Abbreviations

Table 62. Instruction Set Opcode Symbols and Abbreviations (Continued)


Bit Field Name g Bit Field Value 0 1 Bit Field Description 40 keyword is not applied 40 keyword is applied; M40 is locally set to 1

kk kkkk 00 0000 00 0001 00 0100 00 0101 00 1000 00 1001 00 1100 00 1101 00 1110 00 1111 01 0000 01 0001 01 0100 01 0101 01 1000 01 1001 01 1100 01 1101 01 1110 01 1111 10 1000 10 1100 11 1000 11 1100 1x 0000 1x 0001

Swap code for Swap Register Content instruction: swap(AC0, AC2) swap(AC1, AC3) swap(T0, T2) swap(T1, T3) swap(AR0, AR2) swap(AR1, AR3) swap(AR4, T0) swap(AR5, T1) swap(AR6, T2) swap(AR7, T3) swap(pair(AC0), pair(AC2)) Reserved swap(pair(T0), pair(T2)) Reserved swap(pair(AR0), pair(AR2)) Reserved swap(pair(AR4), pair(T0)) Reserved swap(pair(AR6), pair(T2)) Reserved Reserved swap(block(AR4), block(T0)) swap(AR0, AR1) Reserved Reserved Reserved

6-20

Instruction Opcodes in Sequential Order

SPRU375G

Instruction Set Opcode Symbols and Abbreviations

Table 62. Instruction Set Opcode Symbols and Abbreviations (Continued)


Bit Field Name Bit Field Value 1x 0100 1x 0101 1x 1001 1x 1101 1x 1110 1x 1111 Bit Field Description Reserved Reserved Reserved Reserved Reserved Reserved

kkk . . . k

Unsigned constant of n bits

KKK . . . K

Signed constant of n bits

lll . . . l

Program address label coded on n bits (unsigned offset relative to program counter register)

LLL . . . L

Program address label coded on n bits (signed offset relative to program counter register)

mm 00 01 10 11

Coefficient addressing mode (Cmem): *CDP *CDP+ *CDP *(CDP + T0)

MMM 000 001 010 011 100 101

Modifier option for Xmem or Ymem addressing mode: *ARn *ARn+ *ARn *(ARn + T0), when C54CM = 0 *(ARn + AR0), when C54CM = 1 *(ARn + T1) *(ARn T0), when C54CM = 0 *(ARn AR0), when C54CM = 1

SPRU375G

Instruction Opcodes in Sequential Order

6-21

Instruction Set Opcode Symbols and Abbreviations

Table 62. Instruction Set Opcode Symbols and Abbreviations (Continued)


Bit Field Name Bit Field Value 110 111 Bit Field Description *(ARn T1) *ARn(T0), when C54CM = 0 *ARn(AR0), when C54CM = 1

Reserved bit

PPP . . . P

Program or data address label coded on n bits (absolute address)

0 1

Select TRN0 Select TRN1

SHFT

4-bit immediate shift value, 0 to 15

SHIFTW

6-bit immediate shift value, 32 to +31

ss 00 01 10 11

Source temporary register (Tx, Ty): Temporary register 0 (T0) Temporary register 1 (T1) Temporary register 2 (T2) Temporary register 3 (T3)

SS 00 01 10 11

Source accumulator register (ACw, ACx, ACy, ACz): Accumulator 0 (AC0) Accumulator 1 (AC1) Accumulator 2 (AC2) Accumulator 3 (AC3)

6-22

Instruction Opcodes in Sequential Order

SPRU375G

Instruction Set Opcode Symbols and Abbreviations

Table 62. Instruction Set Opcode Symbols and Abbreviations (Continued)


Bit Field Name tt Bit Field Value 00 01 10 11 Bit Field Description Bit 0: destination TCy bit of Compare Register Content instruction Bit 1: source TCx bit of Compare Register Content instruction When value = 0: TC1 is selected When value = 1: TC2 is selected

0 1

uns keyword is not applied; operand is considered signed uns keyword is applied; operand is considered unsigned

0 1

No update of T3 with Smem or Xmem content T3 is updated with Smem or Xmem content

vv

00 01 10 11

Bit 0: shifted-out bit of Rotate instruction Bit 1: shifted-in bit of Rotate instruction When value = 0: CARRY is selected When value = 1: TC2 is selected

Reserved bit

XDDD XSSS 0000 0001 0010 0011 0100 0101 0110 0111 1000 1001

Destination or Source accumulator or extended register. All 23 bits of stack pointer (XSP), system stack pointer (XSSP), data page pointer (XDP), coefficient data pointer (XCDP), and extended auxiliary register (XARx). Accumulator 0 (AC0) Accumulator 1 (AC1) Accumulator 2 (AC2) Accumulator 3 (AC3) Stack pointer (XSP) System stack pointer (XSSP) Data page pointer (XDP) Coefficient data pointer (XCDP) Auxiliary register 0 (XAR0) Auxiliary register 1 (XAR1)

SPRU375G

Instruction Opcodes in Sequential Order

6-23

Instruction Set Opcode Symbols and Abbreviations

Table 62. Instruction Set Opcode Symbols and Abbreviations (Continued)


Bit Field Name Bit Field Value 1010 1011 1100 1101 1110 1111 Bit Field Description Auxiliary register 2 (XAR2) Auxiliary register 3 (XAR3) Auxiliary register 4 (XAR4) Auxiliary register 5 (XAR5) Auxiliary register 6 (XAR6) Auxiliary register 7 (XAR7)

XXX YYY 000 001 010 011 100 101 110 111

Auxiliary register designation for Xmem or Ymem addressing mode: Auxiliary register 0 (AR0) Auxiliary register 1 (AR1) Auxiliary register 2 (AR2) Auxiliary register 3 (AR3) Auxiliary register 4 (AR4) Auxiliary register 5 (AR5) Auxiliary register 6 (AR6) Auxiliary register 7 (AR7)

6-24

Instruction Opcodes in Sequential Order

SPRU375G

Chapter 7

Cross-Reference of Algebraic and Mnemonic Instruction Sets


This chapter provides a cross-reference between the TMS320C55x DSP algebraic instruction set and the mnemonic instruction set (Table 71). For more information on the mnemonic instruction set, see TMS320C55x DSP Mnemonic Instruction Set Reference Guide, SPRU374.

7-1

7-2 Cross-Reference of Algebraic and Mnemonic Instruction Sets SPRU375G

Cross-Reference of Algebraic and Mnemonic Instruction Sets

Table 71. Cross-Reference of Algebraic and Mnemonic Instruction Sets


Algebraic Syntax Absolute Distance abdst(Xmem, Ymem, ACx, ACy) Absolute Value dst = |src| Addition dst = dst + src dst = dst + k4 dst = src + K16 dst = src + Smem ACy = ACy + (ACx << Tx) ACy = ACy + (ACx << #SHIFTW) ACy = ACx + (K16 << #16) ACy = ACx + (K16 << #SHFT) ACy = ACx + (Smem << Tx) ACy = ACx + (Smem << #16) ACy = ACx + uns(Smem) + CARRY ACy = ACx + uns(Smem) ACy = ACx + (uns(Smem) << #SHIFTW) ACy = ACx + dbl(Lmem) ACx = (Xmem << #16) + (Ymem << #16) Smem = Smem + K16 Mnemonic Syntax ABDST: Absolute Distance ABDST Xmem, Ymem, ACx, ACy ABS: Absolute Value ABS [src,] dst ADD: Addition ADD [src,] dst ADD k4, dst ADD K16, [src,] dst ADD Smem, [src,] dst ADD ACx << Tx, ACy ADD ACx << #SHIFTW, ACy ADD K16 << #16, [ACx,] ACy ADD K16 << #SHFT, [ACx,] ACy ADD Smem << Tx, [ACx,] ACy ADD Smem << #16, [ACx,] ACy ADD [uns(]Smem[)], CARRY, [ACx,] ACy ADD [uns(]Smem[)], [ACx,] ACy ADD [uns(]Smem[)] << #SHIFTW, [ACx,] ACy ADD dbl(Lmem), [ACx,] ACy ADD Xmem, Ymem, ACx ADD K16, Smem

SPRU375G Cross-Reference of Algebraic and Mnemonic Instruction Sets 7-3

Table 71. Cross-Reference of Algebraic and Mnemonic Instruction Sets (Continued)


Algebraic Syntax Addition with Absolute Value ACy = rnd(ACy + |ACx|) Addition with Parallel Store Accumulator Content to Memory ACy = ACx + (Xmem << #16), Ymem = HI(ACy << T2) Addition or Subtraction Conditionally ACy = adsc(Smem, ACx, TCx) Addition or Subtraction Conditionally with Shift ACy = ads2c(Smem, ACx, Tx, TC1, TC2) Addition, Subtraction, or Move Accumulator Content Conditionally ACy = adsc(Smem, ACx, TC1, TC2) Bitwise AND dst = dst & src dst = src & k8 dst = src & k16 dst = src & Smem ACy = ACy & (ACx <<< #SHIFTW) ACy = ACx & (k16 <<< #16) ACy = ACx & (k16 <<< #SHFT) Mnemonic Syntax ADDV: Addition with Absolute Value ADD[R]V [ACx,] ACy ADD::MOV: Addition with Parallel Store Accumulator Content to Memory ADD Xmem << #16, ACx, ACy :: MOV HI(ACy << T2), Ymem ADDSUBCC: Addition or Subtraction Conditionally ADDSUBCC Smem, ACx, TCx, ACy Cross-Reference of Algebraic and Mnemonic Instruction Sets ADDSUB2CC: Addition or Subtraction Conditionally with Shift ADDSUB2CC Smem, ACx, Tx, TC1, TC2, ACy ADDSUBCC: Addition, Subtraction, or Move Accumulator Content Conditionally ADDSUBCC Smem, ACx, TC1, TC2, ACy AND: Bitwise AND AND src, dst AND k8,src, dst AND k16, src, dst AND Smem, src, dst AND ACx << #SHIFTW[, ACy] AND k16 << #16, [ACx,] ACy AND k16 << #SHFT, [ACx,] ACy

7-4 Cross-Reference of Algebraic and Mnemonic Instruction Sets SPRU375G

Cross-Reference of Algebraic and Mnemonic Instruction Sets

Table 71. Cross-Reference of Algebraic and Mnemonic Instruction Sets (Continued)


Algebraic Syntax Smem = Smem & k16 Bitwise AND Memory with Immediate Value and Compare to Zero TCx = Smem & k16 Bitwise OR dst = dst | src dst = src | k8 dst = src | k16 dst = src | Smem ACy = ACy | (ACx <<< #SHIFTW) ACy = ACx | (k16 <<< #16) ACy = ACx | (k16 <<< #SHFT) Smem = Smem | k16 Bitwise Exclusive OR (XOR) dst = dst ^ src dst = src ^ k8 dst = src ^ k16 dst = src ^ Smem ACy = ACy ^ (ACx <<< #SHIFTW) ACy = ACx ^ (k16 <<< #16) ACy = ACx ^ (k16 <<< #SHFT) Mnemonic Syntax AND k16, Smem BAND: Bitwise AND Memory with Immediate Value and Compare to Zero BAND Smem, k16, TCx OR: Bitwise OR OR src, dst OR k8, src, dst OR k16, src, dst OR Smem, src, dst OR ACx << #SHIFTW[, ACy] OR k16 << #16, [ACx,] ACy OR k16 << #SHFT, [ACx,] ACy OR k16, Smem XOR: Bitwise Exclusive OR (XOR) XOR src, dst XOR k8, src, dst XOR k16, src, dst XOR Smem, src, dst XOR ACx << #SHIFTW[, ACy] XOR k16 << #16, [ACx,] ACy XOR k16 << #SHFT, [ACx,] ACy

SPRU375G Cross-Reference of Algebraic and Mnemonic Instruction Sets 7-5

Table 71. Cross-Reference of Algebraic and Mnemonic Instruction Sets (Continued)


Algebraic Syntax Smem = Smem ^ k16 Branch Conditionally if (cond) goto l4 if (cond) goto L8 if (cond) goto L16 if (cond) goto P24 Branch Unconditionally goto ACx goto L7 goto L16 goto P24 Branch on Auxiliary Register Not Zero if (ARn_mod != #0) goto L16 Call Conditionally if (cond) call L16 if (cond) call P24 Mnemonic Syntax XOR k16, Smem BCC: Branch Conditionally BCC l4, cond BCC L8, cond BCC L16, cond BCC P24, cond B: Branch Unconditionally B ACx Cross-Reference of Algebraic and Mnemonic Instruction Sets B L7 B L16 B P24 BCC: Branch on Auxiliary Register Not Zero BCC L16, ARn_mod != #0 CALLCC: Call Conditionally CALLCC L16, cond CALLCC P24, cond

7-6 Cross-Reference of Algebraic and Mnemonic Instruction Sets SPRU375G

Cross-Reference of Algebraic and Mnemonic Instruction Sets

Table 71. Cross-Reference of Algebraic and Mnemonic Instruction Sets (Continued)


Algebraic Syntax Call Unconditionally call ACx call L16 call P24 Circular Addressing Qualifier circular() Clear Accumulator, Auxiliary, or Temporary Register Bit bit(src, Baddr) = #0 Clear Memory Bit bit(Smem, src) = #0 Clear Status Register Bit bit(STx, k4) = #0 Mnemonic Syntax CALL: Call Unconditionally CALL ACx CALL L16 CALL P24 .CR: Circular Addressing Qualifier <instruction>.CR BCLR: Clear Accumulator, Auxiliary, or Temporary Register Bit BCLR Baddr, src BCLR: Clear Memory Bit BCLR src, Smem BCLR: Clear Status Register Bit BCLR k4, STx_55 BCLR fname Compare Accumulator, Auxiliary, or Temporary Register Content TCx = uns(src RELOP dst) Compare Accumulator, Auxiliary, or Temporary Register Content with AND TCx = TCy & uns(src RELOP dst) TCx = !TCy & uns(src RELOP dst) CMP: Compare Accumulator, Auxiliary, or Temporary Register Content CMP[U] src RELOP dst, TCx CMPAND: Compare Accumulator, Auxiliary, or Temporary Register Content with AND CMPAND[U] src RELOP dst, TCy, TCx CMPAND[U] src RELOP dst, !TCy, TCx

SPRU375G Cross-Reference of Algebraic and Mnemonic Instruction Sets 7-7

Table 71. Cross-Reference of Algebraic and Mnemonic Instruction Sets (Continued)


Algebraic Syntax Compare Accumulator, Auxiliary, or Temporary Register Content with OR TCx = TCy | uns(src RELOP dst) TCx = !TCy | uns(src RELOP dst) Compare Accumulator, Auxiliary, or Temporary Register Content Maximum dst = max(src, dst) Compare Accumulator, Auxiliary, or Temporary Register Content Minimum dst = min(src, dst) Compare and Branch compare (uns(src RELOP K8)) goto L8 Compare and Select Accumulator Content Maximum max_diff(ACx, ACy, ACz, ACw) max_diff_dbl(ACx, ACy, ACz, ACw, TRNx) Compare and Select Accumulator Content Minimum min_diff(ACx, ACy, ACz, ACw) min_diff_dbl(ACx, ACy, ACz, ACw, TRNx) Compare Memory with Immediate Value TCx = (Smem == K16) Mnemonic Syntax CMPOR: Compare Accumulator, Auxiliary, or Temporary Register Content with OR CMPOR[U] src RELOP dst, TCy, TCx CMPOR[U] src RELOP dst, !TCy, TCx MAX: Compare Accumulator, Auxiliary, or Temporary Register Content Maximum MAX [src,] dst MIN: Compare Accumulator, Auxiliary, or Temporary Register Content Minimum Cross-Reference of Algebraic and Mnemonic Instruction Sets MIN [src,] dst BCC: Compare and Branch BCC[U] L8, src RELOP K8 MAXDIFF: Compare and Select Accumulator Content Maximum MAXDIFF ACx, ACy, ACz, ACw DMAXDIFF ACx, ACy, ACz, ACw, TRNx MINDIFF: Compare and Select Accumulator Content Minimum MINDIFF ACx, ACy, ACz, ACw DMINDIFF ACx, ACy, ACz, ACw, TRNx CMP: Compare Memory with Immediate Value CMP Smem == K16, TCx

7-8 Cross-Reference of Algebraic and Mnemonic Instruction Sets SPRU375G

Cross-Reference of Algebraic and Mnemonic Instruction Sets

Table 71. Cross-Reference of Algebraic and Mnemonic Instruction Sets (Continued)


Algebraic Syntax Complement Accumulator, Auxiliary, or Temporary Register Bit cbit(src, Baddr) Complement Accumulator, Auxiliary, or Temporary Register Content dst = ~src Complement Memory Bit cbit(Smem, src) Compute Exponent of Accumulator Content Tx = exp(ACx) Compute Mantissa and Exponent of Accumulator Content ACy = mant(ACx), Tx = exp(ACx) Mnemonic Syntax BNOT: Complement Accumulator, Auxiliary, or Temporary Register Bit BNOT Baddr, src NOT: Complement Accumulator, Auxiliary, or Temporary Register Content NOT [src,] dst BNOT: Complement Memory Bit BNOT src, Smem EXP: Compute Exponent of Accumulator Content EXP ACx, Tx MANT::NEXP: Compute Mantissa and Exponent of Accumulator Content MANT ACx, ACy :: NEXP ACx, Tx BCNT: Count Accumulator Bits BCNT ACx, ACy, TCx, Tx

Count Accumulator Bits Tx = count(ACx, ACy, TCx)

SPRU375G Cross-Reference of Algebraic and Mnemonic Instruction Sets 7-9

Table 71. Cross-Reference of Algebraic and Mnemonic Instruction Sets (Continued)


Algebraic Syntax Dual 16-Bit Additions HI(ACy) = HI(Lmem) + HI(ACx), LO(ACy) = LO(Lmem) + LO(ACx) HI(ACx) = HI(Lmem) + Tx, LO(ACx) = LO(Lmem) + Tx Mnemonic Syntax ADD: Dual 16-Bit Additions ADD dual(Lmem), [ACx,] ACy ADD dual(Lmem), Tx, ACx

Dual 16-Bit Addition and Subtraction HI(ACx) = Smem + Tx, LO(ACx) = Smem Tx HI(ACx) = HI(Lmem) + Tx, LO(ACx) = LO(Lmem) Tx

ADDSUB: Dual 16-Bit Addition and Subtraction ADDSUB Tx, Smem, ACx ADDSUB Tx, dual(Lmem), ACx Cross-Reference of Algebraic and Mnemonic Instruction Sets

Dual 16-Bit Subtractions HI(ACy) = HI(ACx) HI(Lmem), LO(ACy) = LO(ACx) LO(Lmem) HI(ACy) = HI(Lmem) HI(ACx), LO(ACy) = LO(Lmem) LO(ACx) HI(ACx) = Tx HI(Lmem), LO(ACx) = Tx LO(Lmem) HI(ACx) = HI(Lmem) Tx, LO(ACx) = LO(Lmem) Tx

SUB: Dual 16-Bit Subtractions SUB dual(Lmem), [ACx,] ACy SUB ACx, dual(Lmem), ACy SUB dual(Lmem), Tx, ACx SUB Tx, dual(Lmem), ACx

Dual 16-Bit Subtraction and Addition HI(ACx) = Smem Tx, LO(ACx) = Smem + Tx HI(ACx) = HI(Lmem) Tx, LO(ACx) = LO(Lmem) + Tx

SUBADD: Dual 16-Bit Subtraction and Addition SUBADD Tx, Smem, ACx SUBADD Tx, dual(Lmem), ACx

7-10 Cross-Reference of Algebraic and Mnemonic Instruction Sets SPRU375G

Cross-Reference of Algebraic and Mnemonic Instruction Sets

Table 71. Cross-Reference of Algebraic and Mnemonic Instruction Sets (Continued)


Algebraic Syntax Execute Conditionally if (cond) execute(AD_Unit) if (cond) execute(D_Unit) Expand Accumulator Bit Field dst = field_expand(ACx, k16) Extract Accumulator Bit Field dst = field_extract(ACx, k16) Finite Impulse Response Filter, Antisymmetrical firsn(Xmem, Ymem, coef(Cmem), ACx, ACy) Finite Impulse Response Filter, Symmetrical firs(Xmem, Ymem, coef(Cmem), ACx, ACy) Idle idle Least Mean Square (LMS) lms(Xmem, Ymem, ACx, ACy) Linear Addressing Qualifier linear() Mnemonic Syntax XCC: Execute Conditionally XCC [label, ]cond XCCPART [label, ]cond BFXPA: Expand Accumulator Bit Field BFXPA k16, ACx, dst BFXTR: Extract Accumulator Bit Field BFXTR k16, ACx, dst FIRSSUB: Finite Impulse Response Filter, Antisymmetrical FIRSSUB Xmem, Ymem, Cmem, ACx, ACy FIRSADD: Finite Impulse Response Filter, Symmetrical FIRSADD Xmem, Ymem, Cmem, ACx, ACy IDLE IDLE LMS: Least Mean Square LMS Xmem, Ymem, ACx, ACy .LR: Linear Addressing Qualifier <instruction>.LR

SPRU375G Cross-Reference of Algebraic and Mnemonic Instruction Sets 7-11

Table 71. Cross-Reference of Algebraic and Mnemonic Instruction Sets (Continued)


Algebraic Syntax Load Accumulator from Memory ACx = rnd(Smem << Tx) ACx = low_byte(Smem) << #SHIFTW ACx = high_byte(Smem) << #SHIFTW ACx = Smem << #16 ACx = uns(Smem) ACx = uns(Smem) << #SHIFTW ACx = M40(dbl(Lmem)) LO(ACx) = Xmem, HI(ACx) = Ymem Load Accumulator from Memory with Parallel Store Accumulator Content to Memory ACy = Xmem << #16, Ymem = HI(ACx << T2) Load Accumulator Pair from Memory pair(HI(ACx)) = Lmem pair(LO(ACx)) = Lmem Load Accumulator with Immediate Value ACx = K16 << #16 ACx = K16 << #SHFT Mnemonic Syntax MOV: Load Accumulator from Memory MOV [rnd(]Smem << Tx[)], ACx MOV low_byte(Smem) << #SHIFTW, ACx MOV high_byte(Smem) << #SHIFTW, ACx MOV Smem << #16, ACx MOV [uns(]Smem[)], ACx MOV [uns(]Smem[)] << #SHIFTW, ACx MOV[40] dbl(Lmem), ACx MOV Xmem, Ymem, ACx Cross-Reference of Algebraic and Mnemonic Instruction Sets

MOV::MOV: Load Accumulator from Memory with Parallel Store Accumulator Content to Memory MOV Xmem << #16, ACy :: MOV HI(ACx << T2), Ymem MOV: Load Accumulator Pair from Memory MOV dbl(Lmem), pair(HI(ACx)) MOV dbl(Lmem), pair(LO(ACx)) MOV: Load Accumulator with Immediate Value MOV K16 << #16, ACx MOV K16 << #SHFT, ACx

7-12 Cross-Reference of Algebraic and Mnemonic Instruction Sets SPRU375G

Cross-Reference of Algebraic and Mnemonic Instruction Sets

Table 71. Cross-Reference of Algebraic and Mnemonic Instruction Sets (Continued)


Algebraic Syntax Load Accumulator, Auxiliary, or Temporary Register from Memory dst = Smem dst = uns(high_byte(Smem)) dst = uns(low_byte(Smem)) Load Accumulator, Auxiliary, or Temporary Register with Immediate Value dst = k4 dst = k4 dst = K16 Load Auxiliary or Temporary Register Pair from Memory pair(TAx) = Lmem Load CPU Register from Memory BK03 = Smem BK47 = Smem BKC = Smem BSA01 = Smem BSA23 = Smem BSA45 = Smem BSA67 = Smem BSAC = Smem BRC0 = Smem Mnemonic Syntax MOV: Load Accumulator, Auxiliary, or Temporary Register from Memory MOV Smem, dst MOV [uns(]high_byte(Smem)[)], dst MOV [uns(]low_byte(Smem)[)], dst MOV: Load Accumulator, Auxiliary, or Temporary Register with Immediate Value MOV k4, dst MOV k4, dst MOV K16, dst MOV: Load Auxiliary or Temporary Register Pair from Memory MOV dbl(Lmem), pair(TAx) MOV: Load CPU Register from Memory MOV Smem, BK03 MOV Smem, BK47 MOV Smem, BKC MOV Smem, BSA01 MOV Smem, BSA23 MOV Smem, BSA45 MOV Smem, BSA67 MOV Smem, BSAC MOV Smem, BRC0

SPRU375G Cross-Reference of Algebraic and Mnemonic Instruction Sets 7-13

Table 71. Cross-Reference of Algebraic and Mnemonic Instruction Sets (Continued)


Algebraic Syntax BRC1 = Smem CDP = Smem CSR = Smem DP = Smem DPH = Smem PDP = Smem SP = Smem SSP = Smem TRN0 = Smem TRN1 = Smem RETA = dbl(Lmem) Load CPU Register with Immediate Value BK03 = k12 BK47 = k12 BKC = k12 BRC0 = k12 BRC1 = k12 CSR = k12 DPH = k7 PDP = k9 BSA01 = k16 BSA23 = k16 Mnemonic Syntax MOV Smem, BRC1 MOV Smem, CDP MOV Smem, CSR MOV Smem, DP MOV Smem, DPH MOV Smem, PDP MOV Smem, SP MOV Smem, SSP MOV Smem, TRN0 Cross-Reference of Algebraic and Mnemonic Instruction Sets MOV Smem, TRN1 MOV dbl(Lmem), RETA MOV: Load CPU Register with Immediate Value MOV k12, BK03 MOV k12, BK47 MOV k12, BKC MOV k12, BRC0 MOV k12, BRC1 MOV k12, CSR MOV k7, DPH MOV k9, PDP MOV k16, BSA01 MOV k16, BSA23

7-14 Cross-Reference of Algebraic and Mnemonic Instruction Sets SPRU375G

Cross-Reference of Algebraic and Mnemonic Instruction Sets

Table 71. Cross-Reference of Algebraic and Mnemonic Instruction Sets (Continued)


Algebraic Syntax BSA45 = k16 BSA67 = k16 BSAC = k16 CDP = k16 DP = k16 SP = k16 SSP = k16 Load Extended Auxiliary Register from Memory XAdst = dbl(Lmem) Load Extended Auxiliary Register with Immediate Value XAdst = k23 Load Memory with Immediate Value Smem = K8 Smem = K16 Memory Delay delay(Smem) Memory-Mapped Register Access Qualifier mmap() Mnemonic Syntax MOV k16, BSA45 MOV k16, BSA67 MOV k16, BSAC MOV k16, CDP MOV k16, DP MOV k16, SP MOV k16, SSP MOV: Load Extended Auxiliary Register from Memory MOV dbl(Lmem), XAdst AMOV: Load Extended Auxiliary Register with Immediate Value AMOV k23, XAdst MOV: Load Memory with Immediate Value MOV K8, Smem MOV K16, Smem DELAY: Memory Delay DELAY Smem mmap: Memory-Mapped Register Access Qualifier mmap

SPRU375G Cross-Reference of Algebraic and Mnemonic Instruction Sets 7-15

Table 71. Cross-Reference of Algebraic and Mnemonic Instruction Sets (Continued)


Algebraic Syntax Modify Auxiliary Register Content mar(Smem) Modify Auxiliary Register Content with Parallel Multiply mar(Xmem), ACx = M40(rnd(uns(Ymem) * uns(coef(Cmem)))) Modify Auxiliary Register Content with Parallel Multiply and Accumulate mar(Xmem), ACx = M40(rnd(ACx + (uns(Ymem) * uns(coef(Cmem))))) mar(Xmem), ACx = M40(rnd((ACx >> #16) + (uns(Ymem) * uns(coef(Cmem))))) Modify Auxiliary Register Content with Parallel Multiply and Subtract mar(Xmem), ACx = M40(rnd(ACx (uns(Ymem) * uns(coef(Cmem))))) Modify Auxiliary or Temporary Register Content mar(TAy = TAx) mar(TAx = P8) mar(TAx = D16) Mnemonic Syntax AMAR: Modify Auxiliary Register Content AMAR Smem AMAR::MPY: Modify Auxiliary Register Content with Parallel Multiply AMAR Xmem :: MPY[R][40] [uns(]Ymem[)], [uns(]Cmem[)], ACx AMAR::MAC: Modify Auxiliary Register Content with Parallel Multiply and Accumulate AMAR Xmem :: MAC[R][40] [uns(]Ymem[)], [uns(]Cmem[)], ACx AMAR Xmem :: MAC[R][40] [uns(]Ymem[)], [uns(]Cmem[)], ACx >> #16 AMAR::MAS: Modify Auxiliary Register Content with Parallel Multiply and Subtract AMAR Xmem :: MAS[R][40] [uns(]Ymem[)], [uns(]Cmem[)], ACx AMOV: Modify Auxiliary or Temporary Register Content AMOV TAx, TAy AMOV P8, TAx AMOV D16, TAx

Cross-Reference of Algebraic and Mnemonic Instruction Sets

7-16 Cross-Reference of Algebraic and Mnemonic Instruction Sets SPRU375G

Cross-Reference of Algebraic and Mnemonic Instruction Sets

Table 71. Cross-Reference of Algebraic and Mnemonic Instruction Sets (Continued)


Algebraic Syntax Modify Auxiliary or Temporary Register Content by Addition mar(TAy + TAx) mar(TAx + P8) Mnemonic Syntax AADD: Modify Auxiliary or Temporary Register Content by Addition AADD TAx, TAy AADD P8, TAx

Modify Auxiliary or Temporary Register Content by Subtraction ASUB: Modify Auxiliary or Temporary Register Content by Subtraction mar(TAy TAx) mar(TAx P8) Modify Data Stack Pointer SP = SP + K8 Modify Extended Auxiliary Register Content XAdst = mar(Smem) Move Accumulator Content to Auxiliary or Temporary Register TAx = HI(ACx) Move Accumulator, Auxiliary, or Temporary Register Content dst = src Move Auxiliary or Temporary Register Content to Accumulator HI(ACx) = TAx ASUB TAx, TAy ASUB P8, TAx AADD: Modify Data Stack Pointer (SP) AADD K8, SP AMAR: Modify Extended Auxiliary Register Content AMAR Smem, XAdst MOV: Move Accumulator Content to Auxiliary or Temporary Register MOV HI(ACx), TAx MOV: Move Accumulator, Auxiliary, or Temporary Register Content MOV src, dst MOV: Move Auxiliary or Temporary Register Content to Accumulator MOV TAx, HI(ACx)

SPRU375G Cross-Reference of Algebraic and Mnemonic Instruction Sets 7-17

Table 71. Cross-Reference of Algebraic and Mnemonic Instruction Sets (Continued)


Algebraic Syntax Mnemonic Syntax

Move Auxiliary or Temporary Register Content to CPU Register MOV: Move Auxiliary or Temporary Register Content to CPU Register BRC0 = TAx BRC1 = TAx CDP = TAx CSR = TAx SP = TAx SSP = TAx MOV TAx, BRC0 MOV TAx, BRC1 MOV TAx, CDP MOV TAx, CSR MOV TAx, SP MOV TAx, SSP

Move CPU Register Content to Auxiliary or Temporary Register MOV: Move CPU Register Content to Auxiliary or Temporary Register TAx = BRC0 TAx = BRC1 TAx = CDP TAx = RPTC TAx = SP TAx = SSP Move Extended Auxiliary Register Content xdst = xsrc MOV BRC0, TAx MOV BRC1, TAx MOV CDP, TAx MOV RPTC, TAx MOV SP, TAx MOV SSP, TAx MOV: Move Extended Auxiliary Register Content MOV xsrc, xdst

Cross-Reference of Algebraic and Mnemonic Instruction Sets

7-18 Cross-Reference of Algebraic and Mnemonic Instruction Sets SPRU375G

Cross-Reference of Algebraic and Mnemonic Instruction Sets

Table 71. Cross-Reference of Algebraic and Mnemonic Instruction Sets (Continued)


Algebraic Syntax Move Memory to Memory Smem = coef(Cmem) coef(Cmem) = Smem Lmem = dbl(coef(Cmem)) dbl(coef(Cmem)) = Lmem dbl(Ymem) = dbl(Xmem) Ymem = Xmem Multiply ACy = rnd(ACy * ACx) ACy = rnd(ACx * Tx) ACy = rnd(ACx * K8) ACy = rnd(ACx * K16) ACx = rnd(Smem * coef(Cmem))[, T3 = Smem] ACy = rnd(Smem * ACx)[, T3 = Smem] ACx = rnd(Smem * K8)[, T3 = Smem] ACx = M40(rnd(uns(Xmem) * uns(Ymem)))[, T3 = Xmem] ACx = rnd(uns(Tx * Smem))[, T3 = Smem] Multiply with Parallel Multiply and Accumulate ACx = M40(rnd(uns(Xmem) * uns(coef(Cmem)))), ACy = M40(rnd((ACy >> #16) + (uns(Ymem) * uns(coef(Cmem))))) Mnemonic Syntax MOV: Move Memory to Memory MOV Cmem, Smem MOV Smem, Cmem MOV Cmem, dbl(Lmem) MOV dbl(Lmem), Cmem MOV dbl(Xmem), dbl(Ymem) MOV Xmem, Ymem MPY: Multiply MPY[R] [ACx,] ACy MPY[R] Tx, [ACx,] ACy MPYK[R] K8, [ACx,] ACy MPYK[R] K16, [ACx,] ACy MPYM[R] [T3 = ]Smem, Cmem, ACx MPYM[R] [T3 = ]Smem, [ACx,] ACy MPYMK[R] [T3 = ]Smem, K8, ACx MPYM[R][40] [T3 = ][uns(]Xmem[)], [uns(]Ymem[)], ACx MPYM[R][U] [T3 = ]Smem, Tx, ACx MPY::MAC: Multiply with Parallel Multiply and Accumulate MPY[R][40] [uns(]Xmem[)], [uns(]Cmem[)], ACx :: MAC[R][40] [uns(]Ymem[)], [uns(]Cmem[)], ACy >> #16

SPRU375G Cross-Reference of Algebraic and Mnemonic Instruction Sets 7-19

Table 71. Cross-Reference of Algebraic and Mnemonic Instruction Sets (Continued)


Algebraic Syntax Multiply with Parallel Store Accumulator Content to Memory ACy = rnd(Tx * Xmem), Ymem = HI(ACx << T2) [,T3 = Xmem] Multiply and Accumulate (MAC) ACy = rnd(ACy + (ACx * Tx)) ACy = rnd((ACy * Tx) + ACx) ACy = rnd(ACx + (Tx * K8)) ACy = rnd(ACx + (Tx * K16)) ACx = rnd(ACx + (Smem * coef(Cmem)))[, T3 = Smem] ACy = rnd(ACy + (Smem * ACx))[, T3 = Smem] ACy = rnd(ACx + (Tx * Smem))[, T3 = Smem] ACy = rnd(ACx + (Smem * K8))[, T3 = Smem ] ACy = M40(rnd(ACx + (uns(Xmem) * uns(Ymem))))[, T3 = Xmem] ACy = M40(rnd((ACx >> #16) + (uns(Xmem) * uns(Ymem)))) [, T3 = Xmem] Multiply and Accumulate with Parallel Delay ACx = rnd(ACx + (Smem * coef(Cmem)))[, T3 = Smem], delay(Smem) Multiply and Accumulate with Parallel Load Accumulator from Memory ACx = rnd(ACx + (Tx * Xmem)), ACy = Ymem << #16 [,T3 = Xmem] Mnemonic Syntax MPYM::MOV: Multiply with Parallel Store Accumulator Content to Memory MPYM[R] [T3 = ]Xmem, Tx, ACy :: MOV HI(ACx << T2), Ymem MAC: Multiply and Accumulate MAC[R] ACx, Tx, ACy[, ACy] MAC[R] ACy, Tx, ACx, ACy MACK[R] Tx, K8, [ACx,] ACy MACK[R] Tx, K16, [ACx,] ACy Cross-Reference of Algebraic and Mnemonic Instruction Sets MACM[R] [T3 = ]Smem, Cmem, ACx MACM[R] [T3 = ]Smem, [ACx,] ACy MACM[R] [T3 = ]Smem, Tx, [ACx,] ACy MACMK[R] [T3 = ]Smem, K8, [ACx,] ACy MACM[R][40] [T3 = ][uns(]Xmem[)], [uns(]Ymem[)], [ACx,] ACy MACM[R][40] [T3 = ][uns(]Xmem[)], [uns(]Ymem[)], ACx >> #16 [, ACy] MACMZ: Multiply and Accumulate with Parallel Delay MACM[R]Z [T3 = ]Smem, Cmem, ACx

MACM::MOV: Multiply and Accumulate with Parallel Load Accumulator from Memory MACM[R] [T3 = ]Xmem, Tx, ACx :: MOV Ymem << #16, ACy

7-20 Cross-Reference of Algebraic and Mnemonic Instruction Sets SPRU375G

Cross-Reference of Algebraic and Mnemonic Instruction Sets

Table 71. Cross-Reference of Algebraic and Mnemonic Instruction Sets (Continued)


Algebraic Syntax Multiply and Accumulate with Parallel Multiply ACx = M40(rnd(ACx + (uns(Xmem) * uns(coef(Cmem))))), ACy = M40(rnd(uns(Ymem) * uns(coef(Cmem)))) Multiply and Accumulate with Parallel Store Accumulator Content to Memory ACy = rnd(ACy + (Tx * Xmem)), Ymem = HI(ACx << T2) [,T3 = Xmem] Multiply and Subtract ACy = rnd(ACy (ACx * Tx)) ACx = rnd(ACx (Smem * coef(Cmem)))[, T3 = Smem] ACy = rnd(ACy (Smem * ACx))[, T3 = Smem] ACy = rnd(ACx (Tx * Smem))[, T3 = Smem] ACy = M40(rnd(ACx (uns(Xmem) * uns(Ymem))))[, T3 = Xmem] Multiply and Subtract with Parallel Load Accumulator from Memory ACx = rnd(ACx (Tx * Xmem)), ACy = Ymem << #16 [,T3 = Xmem] Multiply and Subtract with Parallel Multiply ACx = M40(rnd(ACx (uns(Xmem) * uns(coef(Cmem))))), ACy = M40(rnd(uns(Ymem) * uns(coef(Cmem)))) Mnemonic Syntax MAC::MPY: Multiply and Accumulate with Parallel Multiply MAC[R][40] [uns(]Xmem[)], [uns(]Cmem[)], ACx :: MPY[R][40] [uns(]Ymem[)], [uns(]Cmem[)], ACy MACM::MOV: Multiply and Accumulate with Parallel Store Accumulator Content to Memory MACM[R] [T3 = ]Xmem, Tx, ACy :: MOV HI(ACx << T2), Ymem MAS: Multiply and Subtract MAS[R] Tx, [ACx,] ACy MASM[R] [T3 = ]Smem, Cmem, ACx MASM[R] [T3 = ]Smem, [ACx,] ACy MASM[R] [T3 = ]Smem, Tx, [ACx,] ACy MASM[R][40] [T3 = ][uns(]Xmem[)], [uns(]Ymem[)], [ACx,] ACy MASM::MOV: Multiply and Subtract with Parallel Load Accumulator from Memory MASM[R] [T3 = ]Xmem, Tx, ACx :: MOV Ymem << #16, ACy MAS::MPY: Multiply and Subtract with Parallel Multiply MAS[R][40] [uns(]Xmem[)], [uns(]Cmem[)], ACx :: MPY[R][40] [uns(]Ymem[)], [uns(]Cmem[)], ACy

SPRU375G Cross-Reference of Algebraic and Mnemonic Instruction Sets 7-21

Table 71. Cross-Reference of Algebraic and Mnemonic Instruction Sets (Continued)


Algebraic Syntax Multiply and Subtract with Parallel Multiply and Accumulate ACx = M40(rnd(ACx (uns(Xmem) * uns(coef(Cmem))))), ACy = M40(rnd(ACy + (uns(Ymem) * uns(coef(Cmem))))) ACx = M40(rnd(ACx (uns(Xmem) * uns(coef(Cmem))))), ACy = M40(rnd((ACy >> #16) + (uns(Ymem) * uns(coef(Cmem))))) Multiply and Subtract with Parallel Store Accumulator Content to Memory ACy = rnd(ACy (Tx * Xmem)), Ymem = HI(ACx << T2) [,T3 = Xmem] Negate Accumulator, Auxiliary, or Temporary Register Content dst = src No Operation nop nop_16 Parallel Modify Auxiliary Register Contents mar(Xmem), mar(Ymem), mar(coef(Cmem)) Parallel Multiplies ACx = M40(rnd(uns(Xmem) * uns(coef(Cmem)))), ACy = M40(rnd(uns(Ymem) * uns(coef(Cmem)))) Mnemonic Syntax MAS::MAC: Multiply and Subtract with Parallel Multiply and Accumulate MAS[R][40] [uns(]Xmem[)], [uns(]Cmem[)], ACx :: MAC[R][40] [uns(]Ymem[)], [uns(]Cmem[)], ACy MAS[R][40] [uns(]Xmem[)], [uns(]Cmem[)], ACx :: MAC[R][40] [uns(]Ymem[)], [uns(]Cmem[)], ACy >> #16 MASM::MOV: Multiply and Subtract with Parallel Store Accumulator Content to Memory MASM[R] [T3 = ]Xmem, Tx, ACy :: MOV HI(ACx << T2), Ymem Cross-Reference of Algebraic and Mnemonic Instruction Sets NEG: Negate Accumulator, Auxiliary, or Temporary Register Content NEG [src,] dst NOP: No Operation NOP NOP_16 AMAR: Parallel Modify Auxiliary Register Contents AMAR Xmem, Ymem, Cmem MPY::MPY: Parallel Multiplies MPY[R][40] [uns(]Xmem[)], [uns(]Cmem[)], ACx :: MPY[R][40] [uns(]Ymem[)], [uns(]Cmem[)], ACy

7-22 Cross-Reference of Algebraic and Mnemonic Instruction Sets SPRU375G

Cross-Reference of Algebraic and Mnemonic Instruction Sets

Table 71. Cross-Reference of Algebraic and Mnemonic Instruction Sets (Continued)


Algebraic Syntax Parallel Multiply and Accumulates ACx = M40(rnd(ACx + (uns(Xmem) * uns(coef(Cmem))))), ACy = M40(rnd(ACy + (uns(Ymem) * uns(coef(Cmem))))) ACx = M40(rnd((ACx >> #16) + (uns(Xmem) * uns(coef(Cmem))))), ACy = M4(rnd(ACy + (uns(Ymem) * uns(coef(Cmem))))) ACx = M40(rnd((ACx >> #16) + (uns(Xmem) * uns(coef(Cmem))))), ACy = M40(rnd((ACy >> #16) + (uns(Ymem) * uns(coef(Cmem))))) Parallel Multiply and Subtracts ACx = M40(rnd(ACx (uns(Xmem) * uns(coef(Cmem))))), ACy = M40(rnd(ACy (uns(Ymem) * uns(coef(Cmem))))) Peripheral Port Register Access Qualifiers readport() writeport() Pop Accumulator or Extended Auxiliary Register Content from Stack Pointers xdst = popboth() Pop Top of Stack dst1, dst2 = pop() dst = pop() dst, Smem = pop() ACx = dbl(pop()) Smem = pop() Mnemonic Syntax MAC::MAC: Parallel Multiply and Accumulates MAC[R][40] [uns(]Xmem[)], [uns(]Cmem[)], ACx :: MAC[R][40] [uns(]Ymem[)], [uns(]Cmem[)], ACy MAC[R][40] [uns(]Xmem[)], [uns(]Cmem[)], ACx >> #16 :: MAC[R][40] [uns(]Ymem[)], [uns(]Cmem[)], ACy MAC[R][40] [uns(]Xmem[)], [uns(]Cmem[)], ACx >> #16 :: MAC[R][40] [uns(]Ymem[)], [uns(]Cmem[)], ACy >> #16 MAS::MAS: Parallel Multiply and Subtracts MAS[R][40] [uns(]Xmem[)], [uns(]Cmem[)], ACx :: MAS[R][40] [uns(]Ymem[)], [uns(]Cmem[)], ACy port: Peripheral Port Register Access Qualifiers port(Smem) port(Smem) POPBOTH: Pop Accumulator or Extended Auxiliary Register Content from Stack Pointers POPBOTH xdst POP: Pop Top of Stack POP dst1, dst2 POP dst POP dst, Smem POP ACx POP Smem

SPRU375G Cross-Reference of Algebraic and Mnemonic Instruction Sets 7-23

Table 71. Cross-Reference of Algebraic and Mnemonic Instruction Sets (Continued)


Algebraic Syntax dbl(Lmem) = pop() Push Accumulator or Extended Auxiliary Register Content to Stack Pointers pshboth(xsrc) Push to Top of Stack push(src1, src2) push(src) push(src, Smem) dbl(push(ACx)) push(Smem) push(dbl(Lmem)) Repeat Block of Instructions Unconditionally localrepeat{ } blockrepeat{ } Repeat Single Instruction Conditionally while (cond && (RPTC < k8)) repeat Repeat Single Instruction Unconditionally repeat(k8) repeat(k16) repeat(CSR) Mnemonic Syntax POP dbl(Lmem) PSHBOTH: Push Accumulator or Extended Auxiliary Register Content to Stack Pointers PSHBOTH xsrc PSH: Push to Top of Stack PSH src1, src2 PSH src PSH src, Smem Cross-Reference of Algebraic and Mnemonic Instruction Sets PSH ACx PSH Smem PSH dbl(Lmem) RPTB: Repeat Block of Instructions Unconditionally RPTBLOCAL pmad RPTB pmad RPTCC: Repeat Single Instruction Conditionally RPTCC k8, cond RPT: Repeat Single Instruction Unconditionally RPT k8 RPT k16 RPT CSR

7-24 Cross-Reference of Algebraic and Mnemonic Instruction Sets SPRU375G

Cross-Reference of Algebraic and Mnemonic Instruction Sets

Table 71. Cross-Reference of Algebraic and Mnemonic Instruction Sets (Continued)


Algebraic Syntax Repeat Single Instruction Unconditionally and Decrement CSR repeat(CSR), CSR = k4 Repeat Single Instruction Unconditionally and Increment CSR repeat(CSR), CSR += TAx repeat(CSR), CSR += k4 Return Conditionally if (cond) return Return Unconditionally return Return from Interrupt return_int Rotate Left Accumulator, Auxiliary, or Temporary Register Content dst = BitOut \\ src \\ BitIn Rotate Right Accumulator, Auxiliary, or Temporary Register Content dst = BitIn // src // BitOut Mnemonic Syntax RPTSUB: Repeat Single Instruction Unconditionally and Decrement CSR RPTSUB CSR, k4 RPTADD: Repeat Single Instruction Unconditionally and Increment CSR RPTADD CSR, TAx RPTADD CSR, k4 RETCC: Return Conditionally RETCC cond RET: Return Unconditionally RET RETI: Return from Interrupt RETI ROL: Rotate Left Accumulator, Auxiliary, or Temporary Register Content ROL BitOut, src, BitIn, dst ROR: Rotate Right Accumulator, Auxiliary, or Temporary Register Content ROR BitIn, src, BitOut, dst

SPRU375G Cross-Reference of Algebraic and Mnemonic Instruction Sets 7-25

Table 71. Cross-Reference of Algebraic and Mnemonic Instruction Sets (Continued)


Algebraic Syntax Round Accumulator Content ACy = rnd(ACx) Saturate Accumulator Content ACy = saturate(rnd(ACx)) Set Accumulator, Auxiliary, or Temporary Register Bit bit(src, Baddr) = #1 Set Memory Bit bit(Smem, src) = #1 Set Status Register Bit bit(STx, k4) = #1 Mnemonic Syntax ROUND: Round Accumulator Content ROUND [ACx,] ACy SAT: Saturate Accumulator Content SAT[R] [ACx,] ACy BSET: Set Accumulator, Auxiliary, or Temporary Register Bit BSET Baddr, src BSET: Set Memory Bit BSET src, Smem BSET: Set Status Register Bit BSET k4, STx_55 BSET fname Shift Accumulator Content Conditionally ACx = sftc(ACx, TCx) Shift Accumulator Content Logically ACy = ACx <<< Tx ACy = ACx <<< #SHIFTW SFTCC: Shift Accumulator Content Conditionally SFTCC ACx, TCx SFTL: Shift Accumulator Content Logically SFTL ACx, Tx[, ACy] SFTL ACx, #SHIFTW[, ACy] Cross-Reference of Algebraic and Mnemonic Instruction Sets

7-26 Cross-Reference of Algebraic and Mnemonic Instruction Sets SPRU375G

Cross-Reference of Algebraic and Mnemonic Instruction Sets

Table 71. Cross-Reference of Algebraic and Mnemonic Instruction Sets (Continued)


Algebraic Syntax Shift Accumulator, Auxiliary, or Temporary Register Content Logically dst = dst <<< #1 dst = dst >>> #1 Signed Shift of Accumulator Content ACy = ACx << Tx ACy = ACx << #SHIFTW ACy = ACx <<C Tx ACy = ACx <<C #SHIFTW Signed Shift of Accumulator, Auxiliary, or Temporary Register Content dst = dst >> #1 dst = dst << #1 Software Interrupt intr(k5) Software Reset reset Software Trap trap(k5) Mnemonic Syntax SFTL: Shift Accumulator, Auxiliary, or Temporary Register Content Logically SFTL dst, #1 SFTL dst, #1 SFTS: Signed Shift of Accumulator Content SFTS ACx, Tx[, ACy] SFTS ACx, #SHIFTW[, ACy] SFTSC ACx, Tx[, ACy] SFTSC ACx, #SHIFTW[, ACy] SFTS: Signed Shift of Accumulator, Auxiliary, or Temporary Register Content SFTS dst, #1 SFTS dst, #1 INTR: Software Interrupt INTR k5 RESET: Software Reset RESET TRAP: Software Trap TRAP k5

SPRU375G Cross-Reference of Algebraic and Mnemonic Instruction Sets 7-27

Table 71. Cross-Reference of Algebraic and Mnemonic Instruction Sets (Continued)


Algebraic Syntax Square ACy = rnd(ACx * ACx) ACx = rnd(Smem * Smem)[, T3 = Smem] Square and Accumulate ACy = rnd(ACy + (ACx * ACx)) ACy = rnd(ACx + (Smem * Smem))[, T3 = Smem] Square and Subtract ACy = rnd(ACy (ACx * ACx)) ACy = rnd(ACx (Smem * Smem))[, T3 = Smem] Square Distance sqdst(Xmem, Ymem, ACx, ACy) Store Accumulator Content to Memory Smem = HI(ACx) Smem = HI(rnd(ACx)) Smem = LO(ACx << Tx) Smem = HI(rnd(ACx << Tx)) Smem = LO(ACx << #SHIFTW) Smem = HI(ACx << #SHIFTW) Smem = HI(rnd(ACx << #SHIFTW)) Smem = HI(saturate(uns(rnd(ACx)))) Mnemonic Syntax SQR: Square SQR[R] [ACx,] ACy SQRM[R] [T3 = ]Smem, ACx SQA: Square and Accumulate SQA[R] [ACx,] ACy SQAM[R] [T3 = ]Smem, [ACx,] ACy SQS: Square and Subtract SQS[R] [ACx,] ACy SQSM[R] [T3 = ]Smem, [ACx,] ACy SQDST: Square Distance SQDST Xmem, Ymem, ACx, ACy MOV: Store Accumulator Content to Memory MOV HI(ACx), Smem MOV [rnd(]HI(ACx)[)], Smem MOV ACx << Tx, Smem MOV [rnd(]HI(ACx << Tx)[)], Smem MOV ACx << #SHIFTW, Smem MOV HI(ACx << #SHIFTW), Smem MOV [rnd(]HI(ACx << #SHIFTW)[)], Smem MOV [uns(] [rnd(]HI[(saturate](ACx)[)))], Smem Cross-Reference of Algebraic and Mnemonic Instruction Sets

7-28 Cross-Reference of Algebraic and Mnemonic Instruction Sets SPRU375G

Cross-Reference of Algebraic and Mnemonic Instruction Sets

Table 71. Cross-Reference of Algebraic and Mnemonic Instruction Sets (Continued)


Algebraic Syntax Smem = HI(saturate(uns(rnd(ACx << Tx)))) Smem = HI(saturate(uns(rnd(ACx << #SHIFTW)))) dbl(Lmem) = ACx dbl(Lmem) = saturate(uns(ACx)) HI(Lmem) = HI(ACx) >> #1, LO(Lmem) = LO(ACx) >> #1 Xmem = LO(ACx), Ymem = HI(ACx) Store Accumulator Pair Content to Memory Lmem = pair(HI(ACx)) Lmem = pair(LO(ACx)) Store Accumulator, Auxiliary, or Temporary Register Content to Memory Smem = src high_byte(Smem) = src low_byte(Smem) = src Store Auxiliary or Temporary Register Pair Content to Memory Lmem = pair(TAx) Store CPU Register Content to Memory Smem = BK03 Smem = BK47 Mnemonic Syntax MOV [uns(] [rnd(]HI[(saturate](ACx << Tx)[)))], Smem MOV [uns(] [rnd(]HI[(saturate](ACx << #SHIFTW)[)))], Smem MOV ACx, dbl(Lmem) MOV [uns(]saturate(ACx)[)], dbl(Lmem) MOV ACx >> #1, dual(Lmem) MOV ACx, Xmem, Ymem

MOV: Store Accumulator Pair Content to Memory MOV pair(HI(ACx)), dbl(Lmem) MOV pair(LO(ACx)), dbl(Lmem) MOV: Store Accumulator, Auxiliary, or Temporary Register Content to Memory MOV src, Smem MOV src, high_byte(Smem) MOV src, low_byte(Smem) MOV: Store Auxiliary or Temporary Register Pair Content to Memory MOV pair(TAx), dbl(Lmem) MOV: Store CPU Register Content to Memory MOV BK03, Smem MOV BK47, Smem

SPRU375G Cross-Reference of Algebraic and Mnemonic Instruction Sets 7-29

Table 71. Cross-Reference of Algebraic and Mnemonic Instruction Sets (Continued)


Algebraic Syntax Smem = BKC Smem = BSA01 Smem = BSA23 Smem = BSA45 Smem = BSA67 Smem = BSAC Smem = BRC0 Smem = BRC1 Smem = CDP Smem = CSR Smem = DP Smem = DPH Smem = PDP Smem = SP Smem = SSP Smem = TRN0 Smem = TRN1 dbl(Lmem) = RETA Store Extended Auxiliary Register Content to Memory dbl(Lmem) = XAsrc Subtract Conditionally subc(Smem, ACx, ACy) Mnemonic Syntax MOV BKC, Smem MOV BSA01, Smem MOV BSA23, Smem MOV BSA45, Smem MOV BSA67, Smem MOV BSAC, Smem MOV BRC0, Smem MOV BRC1, Smem MOV CDP, Smem Cross-Reference of Algebraic and Mnemonic Instruction Sets MOV CSR, Smem MOV DP, Smem MOV DPH, Smem MOV PDP, Smem MOV SP, Smem MOV SSP, Smem MOV TRN0, Smem MOV TRN1, Smem MOV RETA, dbl(Lmem) MOV: Store Extended Auxiliary Register Content to Memory MOV XAsrc, dbl(Lmem) SUBC: Subtract Conditionally SUBC Smem, [ACx,] ACy

7-30 Cross-Reference of Algebraic and Mnemonic Instruction Sets SPRU375G

Cross-Reference of Algebraic and Mnemonic Instruction Sets

Table 71. Cross-Reference of Algebraic and Mnemonic Instruction Sets (Continued)


Algebraic Syntax Subtraction dst = dst src dst = dst k4 dst = src K16 dst = src Smem dst = Smem src ACy = ACy (ACx << Tx) ACy = ACy (ACx << #SHIFTW) ACy = ACx (K16 << #16) ACy = ACx (K16 << #SHFT) ACy = ACx (Smem << Tx) ACy = ACx (Smem << #16) ACy = (Smem << #16) ACx ACy = ACx uns(Smem) BORROW ACy = ACx uns(Smem) ACy = ACx (uns(Smem) << #SHIFTW) ACy = ACx dbl(Lmem) ACy = dbl(Lmem) ACx ACx = (Xmem << #16) (Ymem << #16) Subtraction with Parallel Store Accumulator Content to Memory ACy = (Xmem << #16) ACx, Ymem = HI(ACy << T2) Mnemonic Syntax SUB: Subtraction SUB [src,] dst SUB k4, dst SUB K16, [src,] dst SUB Smem, [src,] dst SUB src, Smem, dst SUB ACx << Tx, ACy SUB ACx << #SHIFTW, ACy SUB K16 << #16, [ACx,] ACy SUB K16 << #SHFT, [ACx,] ACy SUB Smem << Tx, [ACx,] ACy SUB Smem << #16, [ACx,] ACy SUB ACx, Smem << #16, ACy SUB [uns(]Smem[)], BORROW, [ACx,] ACy SUB [uns(]Smem[)], [ACx,] ACy SUB [uns(]Smem[)] << #SHIFTW, [ACx,] ACy SUB dbl(Lmem), [ACx,] ACy SUB ACx, dbl(Lmem), ACy SUB Xmem, Ymem, ACx SUB::MOV: Subtraction with Parallel Store Accumulator Content to Memory SUB Xmem << #16, ACx, ACy :: MOV HI(ACy << T2), Ymem

SPRU375G Cross-Reference of Algebraic and Mnemonic Instruction Sets 7-31

Table 71. Cross-Reference of Algebraic and Mnemonic Instruction Sets (Continued)


Algebraic Syntax Swap Accumulator Content swap(ACx, ACy) Swap Accumulator Pair Content swap(pair(AC0), pair(AC2)) Swap Auxiliary Register Content swap(ARx, ARy) Swap Auxiliary Register Pair Content swap(pair(AR0), pair(AR2)) Swap Auxiliary and Temporary Register Content swap(ARx, Tx) Swap Auxiliary and Temporary Register Pair Content swap(pair(ARx), pair(Tx)) Swap Auxiliary and Temporary Register Pairs Content swap(block(AR4), block(T0)) Swap Temporary Register Content swap(Tx, Ty) Swap Temporary Register Pair Content swap(pair(T0), pair(T2)) Mnemonic Syntax SWAP: Swap Accumulator Content SWAP ACx, ACy SWAPP: Swap Accumulator Pair Content SWAPP AC0, AC2 SWAP: Swap Auxiliary Register Content SWAP ARx, ARy SWAPP: Swap Auxiliary Register Pair Content SWAPP AR0, AR2 SWAP: Swap Auxiliary and Temporary Register Content SWAP ARx, Tx SWAPP: Swap Auxiliary and Temporary Register Pair Content SWAPP ARx, Tx SWAP4: Swap Auxiliary and Temporary Register Pairs Content SWAP4 AR4, T0 SWAP: Swap Temporary Register Content SWAP Tx, Ty SWAPP: Swap Temporary Register Pair Content SWAPP T0, T2 Cross-Reference of Algebraic and Mnemonic Instruction Sets

7-32 Cross-Reference of Algebraic and Mnemonic Instruction Sets SPRU375G

Cross-Reference of Algebraic and Mnemonic Instruction Sets

Table 71. Cross-Reference of Algebraic and Mnemonic Instruction Sets (Continued)


Algebraic Syntax Test Accumulator, Auxiliary, or Temporary Register Bit TCx = bit(src, Baddr) Test Accumulator, Auxiliary, or Temporary Register Bit Pair bit(src, pair(Baddr)) Test Memory Bit TCx = bit(Smem, src) TCx = bit(Smem, k4) Test and Clear Memory Bit TCx = bit(Smem, k4), bit(Smem, k4) = #0 Test and Complement Memory Bit TCx = bit(Smem, k4), cbit(Smem, k4) Test and Set Memory Bit TCx = bit(Smem, k4), bit(Smem, k4) = #1 Mnemonic Syntax BTST: Test Accumulator, Auxiliary, or Temporary Register Bit BTST Baddr, src, TCx BTSTP: Test Accumulator, Auxiliary, or Temporary Register Bit Pair BTSTP Baddr, src BTST: Test Memory Bit BTST src, Smem, TCx BTST k4, Smem, TCx BTSTCLR: Test and Clear Memory Bit BTSTCLR k4, Smem, TCx

BTSTNOT: Test and Complement Memory Bit BTSTNOT k4, Smem, TCx

BTSTSET: Test and Set Memory Bit BTSTSET k4, Smem, TCx

Index

Index
A
abdst 5-2 absolute addressing modes 3-3 I/O absolute 3-3 k16 absolute 3-3 k23 absolute 3-3 Absolute Distance (abdst) 5-2 Absolute Value 5-4 Addition 5-7 Addition or Subtraction Conditionally (adsc) 5-31 Addition or Subtraction Conditionally with Shift (ads2c) 5-33 Addition with Absolute Value 5-27 Addition with Parallel Store Accumulator Content to Memory 5-29 Addition, Subtraction, or Move Accumulator Content Conditionally (adsc) 5-36 addressing modes absolute 3-3 direct 3-4 indirect 3-6 introduction 3-2 ads2c 5-33 adsc 5-31, 5-36 affect of status bits 1-9 algebraic instruction set cross-reference to mnemonic instruction set 7-1 AND 5-38 Antisymmetrical Finite Impulse Response Filter (firsn) 5-168 arithmetic absolute distance 5-2 absolute value 5-4 addition 5-7 addition or subtraction conditionally 5-31, 5-36 addition or subtraction conditionally with shift 5-33 addition with absolute value 5-27 compare memory with immediate value 5-126 compute exponent of accumulator content 5-131 compute mantissa and exponent of accumulator content 5-132 dual 16-bit addition and subtraction 5-140 dual 16-bit additions 5-135 dual 16-bit subtraction and addition 5-154 dual 16-bit subtractions 5-145 finite impulse response filter, antisymmetrical 5-168 finite impulse response filter, symmetrical 5-170 least mean square 5-173 multiply 5-255 multiply and accumulate 5-271 multiply and subtract 5-294 negation 5-313 round accumulator content 5-380 saturate accumulator content 5-382 square 5-419 square and accumulate 5-422 square and subtract 5-425 square distance 5-428 subtract conditionally 5-463 subtraction 5-465

B
bit field comparison bit field counting bit field expand bit field extract 5-47 5-134 5-166 5-167

Index-1

Index

bit manipulation bitwise AND memory with immediate value and compare to zero 5-47 clear accumulator, auxiliary, or temporary register bit 5-88 clear memory bit 5-89 clear status register bit 5-90 complement accumulator, auxiliary, or temporary register bit 5-128 complement accumulator, auxiliary, or temporary register content 5-129 complement memory bit 5-130 expand accumulator bit field 5-166 extract accumulator bit field 5-167 set accumulator, auxiliary, or temporary register bit 5-384 set memory bit 5-385 set status register bit 5-386 test accumulator, auxiliary, or temporary register bit 5-504 test accumulator, auxiliary, or temporary register bit pair 5-506 test and clear memory bit 5-511 test and complement memory bit 5-512 test and set memory bit 5-513 test memory bit 5-508 Bitwise AND 5-38 Bitwise AND Memory with Immediate Value and Compare to Zero 5-47 bitwise complement 5-129 Bitwise Exclusive OR (XOR) 5-57 Bitwise OR 5-48 blockrepeat 5-346 branch conditionally 5-66 on auxiliary register not zero 5-74 unconditionally 5-70 Branch Conditionally (if goto) 5-66 Branch on Auxiliary Register Not Zero (if goto) 5-74 Branch Unconditionally (goto) 5-70

C
call 5-83 conditionally 5-77 unconditionally 5-83 Call Conditionally (if call) 5-77 Index-2

Call Unconditionally (call) 5-83 cbit 5-128, 5-130 circular 5-87 circular addressing 3-20 Circular Addressing Qualifier (circular) 5-87 clear accumulator bit 5-88 auxiliary register bit 5-88 memory bit 5-89 status register bit 5-90 temporary register bit 5-88 Clear Accumulator Bit 5-88 Clear Auxiliary Register Bit 5-88 Clear Memory Bit 5-89 Clear Status Register Bit 5-90 Clear Temporary Register Bit 5-88 compare accumulator, auxiliary, or temporary register content 5-93 accumulator, auxiliary, or temporary register content maximum 5-105 accumulator, auxiliary, or temporary register content minimum 5-108 accumulator, auxiliary, or temporary register content with AND 5-95 accumulator, auxiliary, or temporary register content with OR 5-100 and branch 5-111 and select accumulator content maximum 5-114 and select accumulator content minimum 5-120 memory with immediate value 5-126 Compare Accumulator Content 5-93 Compare Accumulator Content Maximum (max) 5-105 Compare Accumulator Content Minimum (min) 5-108 Compare Accumulator Content with AND 5-95 Compare Accumulator Content with OR 5-100 Compare and Branch 5-111 compare and goto 5-111 Compare and Select Accumulator Content Maximum (max_diff) 5-114 Compare and Select Accumulator Content Minimum (min_diff) 5-120 Compare Auxiliary Register Content 5-93 Compare Auxiliary Register Content Maximum (max) 5-105

Index

Compare Auxiliary Register Content Minimum (min) 5-108 Compare Auxiliary Register Content with AND 5-95 Compare Auxiliary Register Content with OR 5-100 compare maximum 5-105 Compare Memory with Immediate Value 5-126 compare minimum 5-108 Compare Temporary Register Content 5-93 Compare Temporary Register Content Maximum (max) 5-105 Compare Temporary Register Content Minimum (min) 5-108 Compare Temporary Register Content with AND 5-95 Compare Temporary Register Content with OR 5-100 complement accumulator bit 5-128 accumulator content 5-129 auxiliary register bit 5-128 auxiliary register content 5-129 memory bit 5-130 temporary register bit 5-128 temporary register content 5-129 Complement Accumulator Bit (cbit) 5-128 Complement Accumulator Content 5-129 Complement Auxiliary Register Bit (cbit) 5-128 Complement Auxiliary Register Content 5-129 Complement Memory Bit (cbit) 5-130 Complement Temporary Register Bit (cbit) 5-128 Complement Temporary Register Content 5-129 Compute Exponent of Accumulator Content (exp) 5-131 Compute Mantissa and Exponent of Accumulator Content 5-132 cond field 1-7 conditional addition or subtraction 5-31 addition or subtraction with shift 5-33 addition, subtraction, or move accumulator content 5-36 branch 5-66

call 5-77 execute 5-159 repeat single instruction 5-357 return 5-370 shift 5-389 subtract 5-463 count 5-134 Count Accumulator Bits (count) 5-134 Cross-Reference to Algebraic and Mnemonic Instruction Sets 7-1

D
delay 5-212 direct addressing modes 3-4 DP direct 3-4 PDP direct 3-5 register-bit direct 3-5 SP direct 3-5 Dual 16-Bit Addition and Subtraction Dual 16-Bit Additions 5-135 dual 16-bit arithmetic addition and subtraction 5-140 additions 5-135 subtraction and addition 5-154 subtractions 5-145 Dual 16-Bit Subtraction and Addition Dual 16-Bit Subtractions 5-145

5-140

5-154

E
Execute Conditionally (if execute) 5-159 exp 5-131, 5-132 Expand Accumulator Bit Field (field_expand) 5-166 extended auxiliary register (XAR) load from memory 5-209 load with immediate value 5-210 modify content 5-238 move content 5-247 pop content from stack pointers 5-330 push content to stack pointers 5-338 store to memory 5-462 Extract Accumulator Bit Field (field_extract)

5-167

Index-3

Index

F
field_expand 5-166 field_extract 5-167 finite impulse response (FIR) filter antisymmetrical 5-168 symmetrical 5-170 firs 5-170 firsn 5-168

instruction set opcode abbreviations 6-16 symbols 6-16 instruction set opcodes 6-2 instruction set summary 4-1 instruction set terms, symbols, and abbreviations 1-2 interrupt 5-411 intr 5-411

G
goto 5-70

L
Least Mean Square (lms) 5-173 linear 5-175 Linear Addressing Qualifier (linear) 5-175 List of Algebraic Instruction Opcodes 6-1 lms 5-173 load accumulator from memory 5-176 accumulator from memory with parallel store accumulator content to memory 5-185 accumulator pair from memory 5-187 accumulator with immediate value 5-190 accumulator, auxiliary, or temporary register from memory 5-193 accumulator, auxiliary, or temporary register with immediate value 5-199 auxiliary or temporary register pair from memory 5-203 CPU register from memory 5-204 CPU register with immediate value 5-207 extended auxiliary register (XAR) from memory 5-209 extended auxiliary register (XAR) with immediate value 5-210 memory with immediate value 5-211 Load Accumulator from Memory 5-176, 5-193 Load Accumulator from Memory with Parallel Store Accumulator Content to Memory 5-185 Load Accumulator Pair from Memory 5-187 Load Accumulator with Immediate Value 5-190, 5-199 Load Auxiliary Register from Memory 5-193 Load Auxiliary Register Pair from Memory 5-203 Load Auxiliary Register with Immediate Value 5-199 Load CPU Register from Memory 5-204

I
idle 5-172 if call 5-77 if execute 5-159 if goto 5-66, 5-74 if return 5-370 indirect addressing modes 3-6 AR indirect 3-6 CDP indirect 3-16 coefficient indirect 3-18 dual AR indirect 3-14 initialize memory 5-211 instruction qualifier circular addressing 5-87 linear addressing 5-175 memory-mapped register access 5-213 instruction set abbreviations 1-2 affect of status bits 1-9 conditional fields 1-7 nonrepeatable instructions 1-20 notes 1-14 opcode symbols and abbreviations 6-16 opcodes 6-2 operators 1-6 rules 1-14 symbols 1-2 terms 1-2 instruction set conditional fields 1-7 instruction set notes and rules 1-14 Index-4

Index

Load CPU Register with Immediate Value 5-207 Load Extended Auxiliary Register (XAR) from Memory 5-209 Load Extended Auxiliary Register (XAR) with Immediate Value 5-210 Load Memory with Immediate Value 5-211 Load Temporary Register from Memory Load Temporary Register with Immediate Value 5-199 localrepeat 5-346 logical bitwise AND 5-38 bitwise OR 5-48 bitwise XOR 5-57 count accumulator bits 5-134 shift accumulator content logically 5-391 shift accumulator, auxiliary, or temporary register content logically 5-394 5-193 5-203 Load Temporary Register Pair from Memory

M
mant 5-132 mar 5-214, 5-225, 5-229, 5-233, 5-238, 5-316 max 5-105 5-114 max_diff

max_diff_dbl 5-114 memory bit clear 5-89 complement (not) 5-130 set 5-385 test 5-508 test and clear 5-511 test and complement 5-512 test and set 5-513 Memory Delay (delay) 5-212 Memory-Mapped Register Access Qualifier (mmap) 5-213 min 5-108 min_diff 5-120 min_diff_dbl 5-120 mmap 5-213 mnemonic instruction set cross-reference to algebraic instruction set 7-1

modify auxiliary or temporary register content 5-225 auxiliary or temporary register content by addition 5-229 auxiliary or temporary register content by subtraction 5-233 auxiliary register content 5-214 auxiliary register content with parallel multiply 5-216 auxiliary register content with parallel multiply and accumulate 5-218 auxiliary register content with parallel multiply and subtract 5-223 data stack pointer 5-237 extended auxiliary register (XAR) content 5-238 Modify Auxiliary Register Content (mar) 5-214, 5-225 Modify Auxiliary Register Content by Addition (mar) 5-229 Modify Auxiliary Register Content by Subtraction (mar) 5-233 Modify Auxiliary Register Content with Parallel Multiply (mar) 5-216 Modify Auxiliary Register Content with Parallel Multiply and Accumulate (mar) 5-218 Modify Auxiliary Register Content with Parallel Multiply and Subtract (mar) 5-223 Modify Data Stack Pointer 5-237 Modify Extended Auxiliary Register Content (mar) 5-238 Modify Temporary Register Content (mar) 5-225 Modify Temporary Register Content by Addition (mar) 5-229 Modify Temporary Register Content by Subtraction (mar) 5-233 move accumulator content to auxiliary or temporary register 5-239 accumulator, auxiliary, or temporary register content 5-240 auxiliary or temporary register content to accumulator 5-242 auxiliary or temporary register content to CPU register 5-243 CPU register content to auxiliary or temporary register 5-245 extended auxiliary register content 5-247 memory delay 5-212 memory to memory 5-248

Index-5

Index

move (continued) pop accumulator or extended auxiliary register content from stack pointers 5-330 pop top of stack 5-331 push accumulator or extended auxiliary register content to stack pointers 5-338 push to top of stack 5-339 swap accumulator content 5-492 swap accumulator pair content 5-493 swap auxiliary and temporary register content 5-496 swap auxiliary and temporary register pair content 5-498 swap auxiliary and temporary register pairs content 5-500 swap auxiliary register content 5-494 swap auxiliary register pair content 5-495 swap temporary register content 5-502 swap temporary register pair content 5-503 Move Accumulator Content 5-240 Move Accumulator Content to Auxiliary Register 5-239 Move Accumulator Content to Temporary Register 5-239 Move Auxiliary Register Content 5-240 Move Auxiliary Register Content to Accumulator 5-242 Move Auxiliary Register Content to CPU Register 5-243 Move CPU Register Content to Auxiliary Register 5-245 Move CPU Register Content to Temporary Register 5-245 Move Extended Auxiliary Register (XAR) Content 5-247 Move Memory to Memory 5-248 5-240 Move Temporary Register Content

Multiply and Accumulate with Parallel Multiply 5-290 Multiply and Accumulate with Parallel Store Accumulator Content to Memory 5-292 Multiply and Subtract 5-294 Multiply and Subtract with Parallel Load Accumulator from Memory 5-302 Multiply and Subtract with Parallel Multiply 5-304 Multiply and Subtract with Parallel Multiply and Accumulate 5-306 Multiply and Subtract with Parallel Store Accumulator Content to Memory 5-311 Multiply with Parallel Multiply and Accumulate 5-267 Multiply with Parallel Store Accumulator Content to Memory 5-269

N
Negate Accumulator Content 5-313 5-313 5-313 Negate Auxiliary Register Content Negate Temporary Register Content negation accumulator content 5-313 auxiliary register content 5-313 temporary register content 5-313 No Operation (nop) nop 5-315 5-315 1-20 nonrepeatable instructions

O
operand qualifier OR 5-48 5-328

Move Temporary Register Content to Accumulator 5-242 Move Temporary Register Content to CPU Register 5-243 Multiply 5-255 Multiply and Accumulate (MAC) 5-271 5-286 Multiply and Accumulate with Parallel Delay Multiply and Accumulate with Parallel Load Accumulator from Memory 5-288 Index-6

P
Parallel Modify Auxiliary Register Contents (mar) 5-316 Parallel Multiplies 5-317 5-319 5-326 Parallel Multiply and Accumulates Parallel Multiply and Subtracts

Index

parallel operations addition with parallel store accumulator content to memory 5-29 load accumulator from memory with parallel store accumulator content to memory 5-185 modify auxiliary register content with parallel multiply 5-216 modify auxiliary register content with parallel multiply and accumulate 5-218 modify auxiliary register content with parallel multiply and subtract 5-223 modify auxiliary register contents 5-316 multiplies 5-317 multiply and accumulate with parallel delay 5-286 multiply and accumulate with parallel load accumulator from memory 5-288 multiply and accumulate with parallel multiply 5-290 multiply and accumulate with parallel store accumulator content to memory 5-292 multiply and accumulates 5-319 multiply and subtract with parallel load accumulator from memory 5-302 multiply and subtract with parallel multiply 5-304 multiply and subtract with parallel multiply and accumulate 5-306 multiply and subtract with parallel store accumulator content to memory 5-311 multiply and subtracts 5-326 multiply with parallel multiply and accumulate 5-267 multiply with parallel store accumulator content to memory 5-269 subtraction with parallel store accumulator content to memory 5-490 parallelism basics parallelism features 2-3 2-2 5-328

program control branch conditionally 5-66 branch on auxiliary register not zero 5-74 branch unconditionally 5-70 call conditionally 5-77 call unconditionally 5-83 compare and branch 5-111 execute conditionally 5-159 idle 5-172 no operation 5-315 repeat block of instructions unconditionally 5-346 repeat single instruction conditionally 5-357 repeat single instruction unconditionally 5-360 repeat single instruction unconditionally and decrement CSR 5-365 repeat single instruction unconditionally and increment CSR 5-367 return conditionally 5-370 return from interrupt 5-374 return unconditionally 5-372 software interrupt 5-411 software reset 5-413 software trap 5-417 pshboth 5-338 push 5-339 Push Accumulator Content to Stack Pointers (pshboth) 5-338 Push Extended Auxiliary Register (XAR) Content to Stack Pointers (pshboth) 5-338 Push to Top of Stack (push) 5-339

R
readport 5-328 register bit clear 5-88 complement (not) 5-128 set 5-384 test 5-504 test bit pair 5-506 repeat 5-360, 5-365, 5-367 Repeat Block of Instructions Unconditionally 5-346 Repeat Single Instruction Conditionally (while repeat) 5-357 Repeat Single Instruction Unconditionally (repeat) 5-360 Repeat Single Instruction Unconditionally and Decrement CSR (repeat) 5-365

Peripheral Port Register Access Qualifiers pop 5-331

Pop Accumulator Content from Stack Pointers (popboth) 5-330 Pop Extended Auxiliary Register (XAR) Content from Stack Pointers (popboth) 5-330 Pop Top of Stack (pop) popboth 5-330 5-331

Index-7

Index

Repeat Single Instruction Unconditionally and Increment CSR (repeat) 5-367 reset 5-413 resource conflicts in a parallel pair 2-4 return 5-372 Return Conditionally (if return) 5-370 Return from Interrupt (return_int) 5-374 Return Unconditionally (return) 5-372 return_int 5-374 rnd 5-380 Rotate Left Accumulator Content 5-376 Rotate Left Auxiliary Register Content 5-376 Rotate Left Temporary Register Content 5-376 Rotate Right Accumulator Content 5-378 Rotate Right Auxiliary Register Content 5-378 Rotate Right Temporary Register Content 5-378 Round Accumulator Content (rnd) 5-380 rounding 5-380

S
saturate 5-382 Saturate Accumulator Content (saturate) 5-382 set accumulator bit 5-384 auxiliary register bit 5-384 memory bit 5-385 status register bit 5-386 temporary register bit 5-384 Set Accumulator Bit 5-384 Set Auxiliary Register Bit 5-384 Set Memory Bit 5-385 Set Status Register Bit 5-386 Set Temporary Register Bit 5-384 sftc 5-389 Shift Accumulator Content Conditionally (sftc) 5-389 Shift Accumulator Content Logically 5-391, 5-394 Shift Auxiliary Register Content Logically 5-394 shift conditionally 5-389 shift logically 5-391, 5-394 Shift Temporary Register Content Logically 5-394 Signed Shift of Accumulator Content 5-397, 5-406 Signed Shift of Auxiliary Register Content 5-406 Signed Shift of Temporary Register Content 5-406 Index-8

soft-dual parallelism 2-5 Software Interrupt (intr) 5-411 Software Reset (reset) 5-413 Software Trap (trap) 5-417 sqdst 5-428 Square 5-419 Square and Accumulate 5-422 Square and Subtract 5-425 Square Distance (sqdst) 5-428 status register bit clear 5-90 set 5-386 store accumulator content to memory 5-430 accumulator pair content to memory 5-450 accumulator, auxiliary, or temporary register content to memory 5-453 auxiliary or temporary register pair content to memory 5-457 CPU register content to memory 5-458 extended auxiliary register (XAR) to memory 5-462 Store Accumulator Content to Memory 5-430, 5-453 Store Accumulator Pair Content to Memory 5-450 Store Auxiliary Register Content to Memory 5-453 Store Auxiliary Register Pair Content to Memory 5-457 Store CPU Register Content to Memory 5-458 Store Extended Auxiliary Register (XAR) to Memory 5-462 Store Temporary Register Content to Memory 5-453 Store Temporary Register Pair Content to Memory 5-457 subc 5-463 Subtract Conditionally 5-463 Subtraction 5-465 Subtraction with Parallel Store Accumulator Content to Memory 5-490 swap 5-492, 5-493, 5-494, 5-495, 5-496, 5-498, 5-500, 5-502, 5-503 Swap Accumulator Content (swap) 5-492 Swap Accumulator Pair Content (swap) 5-493 Swap Auxiliary and Temporary Register Content (swap) 5-496 Swap Auxiliary and Temporary Register Pair Content (swap) 5-498

Index

Swap Auxiliary and Temporary Register Pairs Content (swap) 5-500 Swap Auxiliary Register Content (swap) 5-494 Swap Auxiliary Register Pair Content (swap) 5-495 Swap Temporary Register Content (swap) 5-502 Swap Temporary Register Pair Content (swap) 5-503 Symmetrical Finite Impulse Response Filter (firs) 5-170

Test Memory Bit 5-508 Test Temporary Register Bit 5-504 Test Temporary Register Bit Pair 5-506 trap 5-417

U
unconditional branch 5-70 call 5-83 repeat block of instructions 5-346 repeat single instruction 5-360 repeat single instruction and decrement CSR 5-365 repeat single instruction and increment CSR 5-367 return 5-372 return from interrupt 5-374

T
test accumulator bit 5-504 accumulator bit pair 5-506 auxiliary register bit 5-504 auxiliary register bit pair 5-506 memory bit 5-508 temporary register bit 5-504 temporary register bit pair 5-506 Test Accumulator Bit 5-504 Test Accumulator Bit Pair 5-506 Test and Clear Memory Bit 5-511 Test and Complement Memory Bit 5-512 Test and Set Memory Bit 5-513 Test Auxiliary Register Bit 5-504 Test Auxiliary Register Bit Pair 5-506

W
while repeat 5-357 writeport 5-328

X
XOR 5-57

Index-9

You might also like