[Indigo]<Dragon>Documentation>PrincOps>SingleProcOp.tioga!5

Bland, February 20, 1986 3:58:49 pm PST

/ivy/bland/doc/SingleProcOp.tioga

DRAFT

SINGLE PROCESSOR OPERATION - VERSION 3

FOR INTERNAL XEROX USE ONLY

2.0 Single Processor Operation

A Dragaon Processor consists of an Instruction Fetch Unit (IFU) and an Execution Unit (EU). The IFU and EU form a single logical entity and constitute the fixed-point execution engine of a Dragon Processor. The IFU reads and decodes a stream of variable-length machine instructions. It then directs the EU to perform the arithmetic, logical, shifting, fetching, and storing operations required. The IFU and EU make use of a pipeline which is 4 machine cycles long. As an instruction progresses through the pipeline, less and less work is done in the IFU and more is performed in the EU.

Each Dragon Processor contains a 128-registers for use in local frames, 12 registers for constants, and 16 auxiliary registers. The following figure shows the logical organization of a Dragon Processor's registers.

IFU/EU - A Logical Model

[Artwork node; type 'ArtworkInterpress on' to command tool]

The names of the registers come from the enumerated type, ProcessorRegister, which specifies logical names for physical registers. Here is a Cedar definition for ProcessorRegister:

ProcessorRegister: TYPE = MACHINE DEPENDENT {

Stack (0) the EU Stack (128 registers)

Spare (128) not used; possibly non-existent register

ToKBus (129) send result on K bus to IFU

MAR (130) MemoryAddressRegister

Field (131) Field register

Constants (132) Base of EU constant register (12 regs)

AuxRegs (144) Base of EU aux registers (16 regs)

[160..239] don't correspond to any existing registers

YoungestL (240) youngest L in IFU stack

YoungestPC (241) youngest PC in IFU stack

EldestL (242) eldest L in IFU stack

EldestPC (243) eldest PC in IFU stack (rd removes, wt adds)

Status (244) IFU status

SLimit (245) stack limit register

[246..255] don't correspond to any existing registers

};

Some of the elements enumerated in ProcessorRegister are accessed from the IFU; some are accessed from the EU. The following sections on the IFU and EU provide explanations of these elements, as appropriate.

2.1 Instruction Fetch Unit

The IFU contains a small stack of the most recent procedure contexts. A procedure context for the IFU includes a program counter (PC) and an index (L) into the EU stack which is the base register of the local variables.

The IFU has nine accessible registers which are defined below. These registers may be read by the LIP (Load from Internal Processor register) instruction and written by the SIP (Store to Internal Processor register) instruction.

PC (Program Counter)

PC is the program counter (a byte address). Instruction space is limited to 232 bytes (roughly 4 gigabytes), even though the full Dragon address space is 232 words.

S (Stack pointer)

S is a 7-bit index into Stack.

SLimit (Stack Limit)

SLimit gives the limit for S. A stack overflow occurs when an instruction modifies S in such a way that the new value of S is in the range [SLimit..SLimit+16).

L (Local frame base)

L is a 7-bit index into Stack which serves as the base register for the current local frame. The first 16 registers at or above L are easily addressable through the instruction set.

EldestL

EldestL contains L of the oldest procedure context in the IFU stack.

EldestPC

EldestPC contains the PC of the oldest procedure context in the IFU stack. EldestPC is read by the LIP (Load from Internal Processor register) instruction. A read does the following:

Stack[S+1] ← IFUStack[Eldest].PCBits; S ← S + 1;

Eldest ← Eldest + 1;

EldestPC is written by the SIP (Store to Internal Processor register) instruction:

IFUStack[Eldest - 1].PCBits ← Stack[S]; S ← S — 1;

Eldest ← Eldest — 1;

If traps are enabled and there are already 11 entries in the IFU Stack, any instruction which would cause another entry will instead generate an IFU overflow trap. Upon entry into the overflow routine, the PC and L of the offending instruction are recorded as the last values in the IFU Stack.

YoungestL

YoungestL contains the L of the most recent procedure context in the IFU stack.

YoungestPC

YoungestPC contains the PC of the most recent procedure context in the IFU stack. This is useful in trap routines that need to examine code, or in cases where the trap routine wants to skip the instruction that caused the trap.

Status

Status logically contains only three bits of information: mode (user or kernel), traps (enabled or disabled), and rescheduling (pending or not pending). In order to facilitate partial changes in the status word during write (i.e. SIP) operations, each status bit is paired with a control bit which selects either the old or new value to be the result of the write. This control bit is called the keep bit since when it is true, it assures that the old value of the field is kept. When the status word is read into Stack[S+1] all of its paired keep bits are set to false. This facilitates restoration of saved state (LIP) at some future time.

Status: Machine Dependent Record[
reserved: twenty-six Bits ← false, Reserved bits are not currently used and must be set to 0.
userModeKeep: Bool ← false, ifTrue => when writing, keep old value
userMode: Bool ← false, TRUE => user, FALSE => kernel
trapsEnabledKeep: Bool ← false, ifTrue => when writing, keep old value
trapsEnabled: Bool ← false, TRUE => traps enabled
rescheduleKeep: Bool ← false, ifTrue => when writing, keep old value
reschedule: Bool ← false, TRUE => reschedule pending
];

Note: Arithmetic involving S and L is always performed modulo the size of the EU stack, without detection of underflow or overflow; it produces values in the range [0..127]. If traps are enabled and the stack pointer S is in the range [SLimit..SLimit + 16) at the end of any instruction, an EU stack overflow trap occurs.

2.2 The Execution Unit

The Execution Unit (EU) contains a 32-bit Arithmetic Logic Unit (ALU) and a Field Unit (FU) for shifting, masking and inserting fields. The EU contains the address and data pathway to its data cache (or caches) but does not control these caches. It also contains several multiplexers to select operands and implement pipeline short-circuits.

The Execution Unit contains a bank of registers that are 32 bits wide. These registers contain the most recent elements of the data stack for a process. There are also registers that contain constants, as well as special purpose quantities. This architecture permits most elementary operations involving local variables to be performed in 1 EU cycle. However, this architecture requires special attention be paid to migrating the contents of a process stack between registers and memory.

The EU has the following registers:

Stack

Stack indicates the 128 registers used for local variables in recent local frames.

Locals

Locals indicates the 16 local registers in Stack used for the current local frame. The L register in the IFU points at the base of Locals. When the local frame has fewer than 16 registers, the excess locals are synonomous with lower positions on the Stack.

AuxRegs

AuxRegs indicates the 16 auxilliary registers. Most of these registers will be used for the runtime support of higher-level languages. It is illegal to write into the first 8 auxiliary registers when in User mode.

Constants

Constants indicates the 12 registers normally used to hold constants. Although these registers are not really constant as far as the hardware is concerned, they are used to hold constants for runtime environments. They are more general in that they can be used in more addressing modes than the AuxRegs. It is illegal to write any constant in User mode.

Field (Field unit register)

Field indicates a special register which can be loaded by the FSDB instruction; the value in this register is used by the RFU (Register Field Unit) instruction to control the Field Unit.

MAR (Memory Address Register)

The MAR register is loaded with the memory address whenever an EU memory reference is delayed by the Wait signal from the cache. In the event of a page or write protect fault, MAR must be read by the fault routine before it issues any EU memory references.

ToKBus

ToKBus is loaded with the value being sent from the EU back to the IFU on SFC, SFCI, SJ, and SIP instructions. Writing this register with SIP serves no useful purpose and might interfere with normal use of the K bus.

Carry

A 1-bit register which is discussed in the Chapter 2.2.1.1 entitled, Arithmetic Operations, below.

2.2.1 Arithmetic and Logical Unit (ALU)

2.2.1.1 Arithmetic Operations

The Dragon Processor provides efficient instruction-level support for 32-bit 2's-complement integer arithmetic. N-precision 2's-complement arithmetic is also supported but needs a little instruction level support. In constrast, 32-bit cardinal arithmetic and 1's-complement arithmetic are only incidentally supported.

Arithmetic operands are treated in one of three ways: as signed numbers (or twos-complement integers) in the range [-231..231); as unsigned numbers (or cardinals) in the range [0..232); or as Lisp integers in the range [-229..229), where the top three bits must be either all 0's or all 1's.

These different interpretations of the operands require different handling of the Carry and overflow conditions. Accordingly, five distinct operations are defined for addition/subtraction operations:

Unsigned

On addition, the 1-bit Carry supplies the adder's carry-in and is loaded from the adder carry-out; on subtraction, the complement of Carry bit supplies the adder carry-in, and Carry is loaded from the complement of the adder's carry-out. No traps are taken. The only instructions using this kind of arithmetic are RUADD and RUSUB, and executing one of these operations is the only way to load Carry with the value 1. Unsigned (= cardinal) arithmetic is used for the low-order terms of multiple-precision arithmetic.

Signed

On addition, the Carry bit supplies the carry-in, and on subtraction it supplies the complement of carry-in, just as in Unsigned arithmetic. Carry is always loaded with 0 at the end of the instruction. Overflow, which causes a Trap, is defined to occur when the numerical result is not in the range [-231..231), or, equivalently, when the carry out of bit 0 is unequal to the carry out of bit 1. The integer overflow trap, like other traps, occurs before any machine state has been modified, so no information is lost when it is taken. Many instructions use this kind of arithmetic.

Lisp

The Carry is not used as an input, and is always set to 0. If either of the operands or the result is not in the range [-229..229) (top three bits being all 0s or all 1s), a Lisp NaN (Not a Number) trap is taken. This trap instruction is executed by software. It has no side effects, so no information is lost when it is taken.

Small Integer

Integers in the range [0..127]. The Carry bit is neither used nor set. No overflow checking is performed, and no traps can result. The small integer interpretation is used exclusively for operations involving S or L that are performed modulo the size of the EU stack.

Vanilla

The 1-bit Carry flip-flop is not used as an input and is not modified. No traps are taken. The only instructions using this kind of arithmetic are RVADD and RVSUB, which are intended for situations where overflow may occur but does not represent an error.

Instructions facilitating multiply and divide do not exist but will be provided later by an Arithmetic Unit (AU) that will be part of every Dragon processor. The AU will also support both 32-bit and 64-bit IEEE standard floating point arithmetic. It will be controlled from the P bus using IO instructions. In all other respects the AU is not presently defined.

The Carry bit may only be addressed indirectly. To read the Carry bit into Rc execute a RUADD instruction: Stack[S+1] ← Constant[0] + Constant[0]; S ← S + 1. This will yield a 1 on the top of Stack if the Carry was set to 1, and a 0 if it was not. To set the Carry bit, load the saved Carry bit onto the Stack and execute a RUSUB instruction: Stack[S] ← Constant[0] — Stack[S]; S ← S — 1. This instruction will set the Carry bit in the EU and a word will be discarded from Stack.

The notation A-B-Carry is an abbreviation for A+~B+~Carry, where ~B is the 32-bit complement of B, and ~Carry is the 1-bit complement of Carry. The notation A-B, where A and B are 32-bit quantities, denotes the 3-way, signed addition of A+~B+1. It is worth noting the clever trick of complementing the value in Carry for subtraction, which allows the signed subtraction instructions also to be used for the high-order subtraction of two n-precision numbers.

Lisp arithmetic is intended for an implementation in which pointers to storage and numbers occur within the same 32-bit space. For example, it is tentatively proposed that the 32-bit address space be divided according to the high-order 3 bits of an address as follows:

7 Negative Lisp integers
6 Negative floating point numbers
5 Negative floating point numbers
4 Cons storage
3 Ref storage
2 Positive floating point numbers
1 Positive floating point numbers
0 Positive Lisp integers and Kernel

With this arrangement, arbitrary-precision integers can be implemented as follows: An integer in the range 2[-229..229) is encoded as a Lisp integer; integers outside this range are stored in a vector in Ref space where the first word of an entry gives the number of 32-bit words in the integer, and words 1 to n are the n-precision integer. Under the assumption that integers in the range [-229..229 ) are vastly more common than those outside this range, addition and subtraction are compiled, respectively, into Lisp add or subtract instructions, which will trap and be interpreted by the NaN trap software in the uncommon case. (Arbitrary-precision floating point numbers embedded in pointers like the Lisp integers have also been discussed, as suggested in this table.)

To support 32-bit cardinal arithmetic, instructions which trapped on adder carry-out = 1 (i.e., on Cardinal overflow) would be needed; these do not exist, though the Vanilla or Unsigned arithmetic operations might be used. It is possible to carry out cardinal addition and subtraction as follows: First, complement the sign bit of each operand. Then execute an integer operation. Finally, complement the sign bit of the result. It can easily be shown that a cardinal comparison is equivalent to an integer comparison with the sign bit of the two operands complemented.

2.2.1.2 Logical Operations

Logical operations are performed on 32-bit quantities that are treated as arrays of 32 booleans. For example, the Or instruction reads: Stack[S-1] ← Stack[S] or Stack[S-1]; S ← S — 1. It performs a logical OR for each of the 32 bit positions.

2.2.1.3 Comparative Operations

Comparative operations are performed on 32-bit signed quantities. The Carry bit is not used or set. For example, the Jump Not Equal Byte Byte instruction reads: If Stack[S] ~= zExt[literal], then PC ← PC + sExt[displacement]; S ← S — 1. Note: Numbers that are zero-extended are understood to be positive (see the definition of zExt in Section 2.3.

2.2.2 Instruction Execution and Sequencing

2.2.3 Field Unit Operations

The field unit enables shifting, rotation, insertion, and masking of fields. It takes two words, and produces a one word result, under the control of a field descriptor. The field descriptor is a Machine Dependent Record. This means that it may not be packed for storage efficiency or arranged in an order that differs from the left to right order of the original record declaration. It is supplied either through a 16-bit constant or through the Field register in the EU; it has the following format:

[Artwork node; type 'ArtworkInterpress on' to command tool]

The four fields are defined as follows:

r1 - r3

r1 - r3 are bits that are not currently used and must be set to zero.

insert

insert governs the choice of background and low bits of the mask. If insert is false the output of the instruction is the logical And of the width and shift. If insert is true the instruction performs an insert operation.

width

Width gives the number of right-justified one's in the mask. (If width = 0, there are no one's. If width = 32, the mask is entirely one's.)

shift

Shift gives the number of bits to left-shift. the double word.

A Cedar program, entitled FieldUnit, describes the operation of the Field Unit. Here are definitions used by the FieldUnit program:

DragonTooth: CEDAR DEFINITIONS = {

Dragon teeth determine how big dragon bytes are.

Bit: TYPE = MACHINE DEPENDENT {zero (0), one (1)};

Zero: Bit = zero;

One: Bit = one;

BitsPerWord: CARDINAL = 32;

BitIndex: TYPE = CARDINAL [0..BitsPerWord);

FieldWidth: TYPE = CARDINAL [0..BitsPerWord];

ShiftIndex: TYPE = CARDINAL [0..BitsPerWord];

Word: TYPE = ARRAY BitIndex OF Bit;

ZeroesWord: Word = ALL[Zero];

OnesWord: Word = ALL[One];

Here is the user interface to the FieldUnitImpl procedure:

DIRECTORY

FieldUnit,

DragonTooth USING [Word, ZeroesWord, OnesWord],

DragonToothOps USING [BitWiseAnd, BitWiseMultiplex, DoubleWordShiftLeft];

FieldUnitImpl: CEDAR PROGRAM

IMPORTS DragonToothOps

EXPORTS FieldUnit = {

Operate: PUBLIC PROCEDURE [left, right: DragonTooth.Word, fieldOp: FieldUnit.FieldDescriptor] RETURNS [output: DragonTooth.Word] = {

There are two cases,

either

extracting a right-justified field from left and inserting it into the middle of right.

widthMask: DragonTooth.Word ← DragonToothOps.DoubleWordShiftLeft[left: DragonTooth.ZeroesWord, right: DragonTooth.OnesWord, shiftAmount: fieldOp.width];

A mask for a right-justified field of length fieldOp.width.

shifted: DragonTooth.Word ← DragonToothOps.DoubleWordShiftLeft[left: left, right: right, shiftAmount: fieldOp.shift];

shift the contents of the field into position.

IF fieldOp.insert

THEN {

This is the hard case, where a right-justified field is extracted from left and inserted into the middle of right. In this case the operands aren't named very well.

`left' holds the right-justified source bits to be inserted,

and so `shifted' holds the source bits shifted into alignment with the destination bits.

`right' holds the word into which the field is inserted,

`widthMask' holds a mask for the field plus `shift' extra bits on the right.

`shiftMask' (defined below) will be used to mask off those extra bits.

The mask for the field is constructed and then used to extract the source bits from `shifted' and to extract the bits around the destination from `right'.

shiftMask: DragonTooth.Word ← DragonToothOps.DoubleWordShiftLeft[left: DragonTooth.OnesWord, right: DragonTooth.ZeroesWord, shiftAmount: fieldOp.shift];

fieldMask: DragonTooth.Word ← DragonToothOps.BitWiseAnd[widthMask, shiftMask];

note that if fieldOp.shift e fieldOp.width then fieldMask is all 0's.

output ← DragonToothOps.BitWiseMultiplex[ifZero: right, ifOne: shifted, selector: fieldMask];

the source bits from `shifted' and the surrounding bits of `right'.

}

ELSE {

The easy case, extract a right-justified field from the shifted Words.

output ← DragonToothOps.BitWiseAnd[shifted, widthMask];

};

2.3 Instruction formats

There are eleven instruction formats. They vary in length from 1 to 5 bytes and specify operations on from 0 to 3 operands. Nine of the formats have an opcode occupying 8 bits. Two formats have a 4-bit opcode followed by a 4-bit specification for an operand. The operands may be specified implicitly or explicitly; they may be registers or be determined from an index register and an offset. In all cases the way in which the operands are specified is determined implicitly from the opcode.

Here are brief definitions of terms and abbreviations used in describing the instruction formats and instructions:

Implicit (abbreviated I) — The location of the operands are implicit in the opcode of the instruction. Some of the instructions specify some operands implicitly and some operands explicitly. The term implicit is used to describe the format for a particular instruction if and only if no operands are designated explicitly.

Literal (abbreviated L)— The instruction contains the operand itself, rather than an address or other information describing where the operand is. A frequently-used synonym for the term literal is immediate.

Register (abbreviated R) — The address fields of the instruction specify register operands.

Indexed Register (abbreviated X) — Indexed-Register instructions have an operand whose address is the sum of an offset and the contents of a register.

Mem — memory word

Mem[addr] — the contents of that register or memory at address addr

Aux — auxiliary register

Const — constants register

ProcReg — processor register

ProcBus — processor bus

FieldDesc — field unit descriptor

FieldOp — field operation

FP — floating point

Offset — offset indicates a non-negative byte displacement.

Displacement — displacement indicates a signed byte displacement.

sExt[x] — The signed magnitude of x, a 2's complement number, is extended to the width of the destination.

zExt[x] — The unsigned (positive) magnitude of x is extended of the width of the destination.

m — an integer in the range [0..8)

n — an integer in the range [0..8)

Implicit Operand Specification:
I Implicit

[Artwork node; type 'ArtworkInterpress on' to command tool]

This format is used to perform stack operations. The operand (if any) is implicitly specified by in the opcode.

Example: Add. ADD: Stack[S-1] ← Stack[S] + Stack[S-1] + Carry; Carry ← 0; S ← S — 1;

Literal Operand Specification:
LB Literal Byte

[Artwork node; type 'ArtworkInterpress on' to command tool]

For LB instructions the literal byte following the opcode is used in 1 of 3 ways. It is zero-extended to 32 bits for stack operations: operand ← zExt[literal]. It is used as a 32-bit signed displacement when computing a new PC: operand ← sExt[literal]. And, it is used to calculate a displacement from S or L: operand ← literal. In calculating displacement from S or L, all arithmetic is performed modulo the size of the EU Stack.

Example: Add Byte. ADDB: Stack[S] ← Stack[S] + zExt[literal] + Carry; Carry ← 0.

LH Literal Halfword

[Artwork node; type 'ArtworkInterpress on' to command tool]

For LH instructions the literal halfword following the opcode is used in 1 of 3 ways. It is zero-extended to 32 bits for stack operations: operand ← zExt[literal]. It is used as a 32-bit signed displacement when computing a new PC: operand ← sExt[literal]. The low-order 13 bits are used as a descriptor for Field Unit operations: operand ← Low13Bits[literal].

Example: Load Immediate Double Byte. Stack[S+1] ← zExt[literal]; S ← S + 1.

LW Literal Word

[Artwork node; type 'ArtworkInterpress on' to command tool]

For LW instructions the operand is the 32-bit literal quantity following the opcode, operand ← literal.

Example: Load Immediate Quad Byte. LIQB: Stack[S+1] ← literal; S ← S + 1.

LBD Literal Byte Displacement

[Artwork node; type 'ArtworkInterpress on' to command tool]

LBD instructions have two operands. The first operand is a literal used for comparison with the top of the stack, operand1 ← zExt[literal]. The second operand is a signed displacement, operand2 ← sExt[displacement], used in computing the new PC, PC ← PC+operand2.

Example: Jump Equal Byte Byte. JEBB: If Stack[S] = zExt[literal] then PC ← PC + sExt[displacement]; S ← S — 1.

The Registers to Register, Quick Register and Register Displacement formats take operands from registers. The Registers to Register format selects a destination operand (Rc) and 2 source operands (Ra and Rb). The Quick Register format is a tighter form of the Registers to Register format; it provides the same decoding as the Registers to Register format for the Rb operand, but limits the possibilities for Ra and Rc to 4 different pairs of locations. The Register Displacement format provides the same decoding as the Registers to Register format for the Rb operand and gives 4 possibilities for the second operand, Rs. (Rs stands for short source register.)

The algorithms used to select Rc, Ra and Rb, and Rs are given here by the CEDAR program, OperandSpeciferImpl.

Here are the definitions used by the CEDAR implementation.

OperandSpecifier: CEDAR DEFINITIONS = {

Here are the type definitions used in accessing Locals[], AuxRegs[] and Constants[]:

AuxiliaryRegisterIndex: TYPE = CARDINAL [0..15];

LocalRegisterIndex: TYPE = CARDINAL [0..15];

ConstantRegisterIndex: TYPE = CARDINAL [0..11];

ShortConstantIndex: TYPE = ConstantRegisterIndex [0..1];

Some types for strong type checking.

SourceDeltaS: TYPE = INTEGER [-1..0];

DestinationDeltaS: TYPE = INTEGER [0..+1];

Here are the abstract locations and specifiers for source operands (Ra and Rb):

SourceLocation: TYPE = {AuxReg, Local, Constant, Top, Under};

SourceSpecifier: TYPE = RECORD [

SELECT location: SourceLocation FROM

AuxReg => [aux: AuxiliaryRegisterIndex],

Local => [local: LocalRegisterIndex],

Constant => [constant: ConstantRegisterIndex],

Top => [deltaS: SourceDeltaS],

Under => [deltaS: SourceDeltaS],

ENDCASE

];

Here are the abstract locations and specifiers for destination operands (Rc):

DestinationLocation: TYPE = {AuxReg, Local, Constant, Top, Under, Push};

DestinationSpecifier: TYPE = RECORD [

SELECT location: DestinationLocation FROM

AuxReg => [aux: AuxiliaryRegisterIndex],

Local => [local: LocalRegisterIndex],

Constant2 => [constant: ConstantRegisterIndex],

Top => [-- deltaS: 0 --],

Under => [-- deltaS: 0 --],

Push => [-- deltaS: 1 --],

ENDCASE

];

Here are the abstract locations and specifiers for shortCASpecifier operands (Rc and Ra):

ShortCASpecifier: TYPE = RECORD [

location: ShortCASelector

];

These are the supporting definitions that reflect the mappings between bit patterns and names

SourceSelector: TYPE = MACHINE DEPENDENT {

Constant0 (0), Constant1, Constant2, Constant3,

Constant4 (4), Constant5, Constant6, Constant7,

Constant8 (8), Constant9, Constant10, Constant11,

Top (12), Under, PopTop, PopUnder (15)

};

DestinationSelector: TYPE = MACHINE DEPENDENT {

Constant0 (0), Constant1, Constant2, Constant3,

Constant4 (4), Constant5, Constant6, Constant7,

Constant8 (8), Constant9, Constant10, Constant11,

Top (12), Under, Push, Reserved (15)

};

ShortCASelector: TYPE = MACHINE DEPENDENT {

TopAtop(0), PushAtop, PushA0, PushA1 (3)

};

ShortSourceSelector: TYPE = MACHINE DEPENDENT {

Constant0 (0), Constant1, Top, PopTop (3)

};

Translations from bit patterns to operand specifiers

SourceOperand: PUBLIC PROCEDURE [auxFlag: BOOL, operandFlag: BOOL, operandSelector: SourceSelector] RETURNS [SourceSpecifier];

DestinationOperand: PUBLIC PROCEDURE [auxFlag: BOOL, operandFlag: BOOL, operandSelector: DestinationSelector] RETURNS [DestinationSpecifier];

ShortCAOperand: PUBLIC PROCEDURE [operandSelector: OperandSpecifier.ShortCASelector] RETURNS [C: OperandSpecifier.DestinationSpecifier, A: OperandSpecifier.SourceSpecifier];

ShortSourceOperand: PUBLIC PROCEDURE [operandSelector: ShortSourceSelector] RETURNS [SourceSpecifier];

Here is the code that actually selects the registers to be used for destination operands (Rc), source operands (Ra and Rb) and short source operands (Rs):