This segment of the tutorial introduces branches, particularly conditional branches and function calls. This is Part 3 of a series. If you haven't yet, you may wish to review at least Part 1 and Part 2.

1 Unconditional Branches and Jumps
2 Conditional Branches
- 2.1 Signed Comparisons
  - 2.1.1 How Signed Arithmetic Works With Conditional Branches
  - 2.1.2 Gotchas With the CP1610 Compare Instruction
- 2.2 Unsigned Comparisons
  - 2.2.1 How Unsigned Arithmetic Works With Conditional Branches
3 Function Calls
4 Indirect Branches and Jump Tables

Unconditional Branches and Jumps

Jump and Unconditional Branch instructions are like "goto" instructions found in higher-level languages. They tell the CPU to immediately start executing code from somewhere else as opposed to executing the next instruction in sequence. They're useful to tell the CPU to skip a section of code, or to repeat a section of code. Be careful when repeating a block of code with an unconditional branch: If there isn't a conditional branch somewhere in that block, it could end up running forever.

The following table lists the instructions:

Mnemonic	Description	Cycles	Size
B	Branch to label	9	2 words
J	Jump to label	12	3 words
JD	Jump to label while disabling interrupts	12	3 words
JE	Jump to label while enabling interrupts	12	3 words
JR	Jump to address in register	7	1 word

As you can see, the primary difference between branches and jumps is that branches are smaller and faster. Branches encode their "target address," the address being jumped to, as a relative offset from the current address. Jumps, on the other hand, store the actual address of the target. In most cases, especially in a 16-bit ROM, there are few reasons to use a J instruction, although the combination instructions, JD and JE can be useful.

There is also a pseudo-instruction, JR, that allows "jumping to a location held in a register." It is really a pseudonym for "MOVR Rx, R7". Because it is a MOVR instruction, it will modify the Sign Flag and Zero Flag, which may be confusing if you're not expecting it. Otherwise, it is an efficient method for jumping to an address held in an register, such as when returning from a CALL.

Conditional Branches

Conditional branches are similar to "if ... goto" type constructs found in other languages. They tell the CPU to start executing at another location if a particular condition is true. This allows us to build "if-then-else" types of statements, as well as "for" and "while" loops.

The CP1610 has a rich set of conditional branch instructions. They work by looking at the CPU's flag bits to decide when to branch. Instructions like CMP and DECR set these flags, making it easy to write those if/else statements and for/while loops. Even fancier uses are possible with some creativity.

The following table summarizes the conditional branches.

Mnemonic	Name	Branch taken when...	Mnemonic	Name	Branch taken when...
BC	Branch on Carry	`C = 1`	BNC	Branch on No Carry	`C = 0`
BOV	Branch on OVerflow	`OV = 1`	BNOV	Branch on No OVerflow	`OV = 0`
BPL	Branch if PLus	`S = 0`	BMI	Branch on MInus	`S = 1`
BEQ	Branch if EQual	`Z = 1`	BNEQ	Branch on Not Equal	`Z = 0`
BZE	Branch on ZEro	`Z = 1`	BNZE	Branch on Not ZEro	`Z = 0`
BLT	Branch if Less Than	`S <> OV`	BGE	Branch if Greater than or Equal	`S = OV`
BNGE	Branch if Not Greater than or Equal	`S <> OV`	BNLT	Branch if Not Less Than	`S = OV`
BLE	Branch if Less than or Equal	`Z = 1 OR S <> OV`	BGT	Branch if Greater Than	`Z = 0 AND S = OV`
BNGT	Branch if Not Greather Than	`Z = 1 OR S <> OV`	BNLE	Branch if Not Less than or Equal	`Z = 0 AND S = OV`
BUSC	Branch on Unequal Sign and Carry	`S <> C`	BESC	Branch on Equal Sign and Carry	`S = C`

Conditional branches are most often used with numeric comparisons, or as the branch at the end of a loop. The following sections illustrate how numeric comparisons, increment and decrement work in concert with branches.

Conditional branches can also be paired with other instructions that manipulate the flags. For instance, shift instructions, as described in Part 4 update sign, zero, carry and overflow flags depending the operation performed. This can lead to interesting and creative combinations of shifts and branches.

Another use of flags and branches is to pass status information in CPU flags (such as the Carry Flag) and then act on that information later. The SETC and CLRC instructions make it easy to manipulate the Carry Flag to pass this status information around.

Signed Comparisons

The following branches are particularly useful when comparing signed numbers. These are the conditional branches you will use most when writing "if-then-else" statements, since most numbers are signed, or can be treated as signed.

Mnemonic		Branch taken when...	Mnemonic		Branch taken when...
BEQ	BZE	`Z = 1`	BNEQ	BNZE	`Z = 0`
BLT	BNGE	`S <> OV`	BGE	BNLT	`S = OV`
BLE	BNGT	`Z = 1 OR S <> OV`	BGT	BNLE	`Z = 0 AND S = OV`
BOV		`OV = 1`	BNOV		`OV = 0`

Note: One pair of branches shown above—BOV and BNOV—are useful in this context only for detecting overflow and little else. I included them here for completeness. These actually find more use paired up with shift instructions. Those are described Part 4 of this tutorial.

How Signed Arithmetic Works With Conditional Branches

The next couple of paragraphs describe how these branches work with compares. Most of the time, you do not need to think about these details when writing programs, so feel free to skim it for now and come back to it later.

The compare instruction compares two numbers by subtracting them, and then setting the flags based on the result. This provides a lot of information about the relative values of the two numbers, as this table shows (ignoring overflow):

If this is true...	...then this also must be true...	...which implies the flags get set as follows (if you ignore overflow).
`x = y`	`x - y = 0`	S = 0	Z = 1
`x < y`	`x - y < 0`	S = 1	Z = 0
`x > y`	`x - y > 0`	S = 0	Z = 0

That is, we can determine whether two numbers are equal or not by looking at the Zero Flag. We can determine if one's less than the other by looking at the Sign Flag. At least, that would be true if there was no such thing as overflow.

The CP1610 can only work with 16 bits at a time. If you try to subtract two numbers whose values are very far apart, such as, in the worst case 32767 - (-32768), you will trigger an overflow, because the results don't fit in 16 bits. Overflow causes the sign of the result to be the opposite of what you would get if no overflow had occurred. The branches take this into account and look at the overflow bit in addition to the sign bit to decide whether one number is greater than or less than another. The following table illustrates the relationships both with and without overflow.

If this is true...	...then the flags get set as follows...			...which matches these branches.
`x = y`	S = 0	Z = 1	OV = 0	BEQ, BGE, BLE
`x < y`	S = 1	Z = 0	OV = 0	BNEQ, BLT, BLE
`x < y`	S = 0	Z = 0	OV = 1	BNEQ, BLT, BLE
`x > y`	S = 0	Z = 0	OV = 0	BNEQ, BGT, BGE
`x > y`	S = 1	Z = 0	OV = 1	BNEQ, BGT, BGE

As you can see, the flags and the branches cooperate quite nicely.

Gotchas With the CP1610 Compare Instruction

The syntax for the CP1610's compare instruction can confuse things slightly, since it does a "subtract from". Consider the following example:


   MVII    #1, R0  ; R0 = 1
   MVII    #2, R1  ; R1 = 2
   CMPR    R0, R1  ; Subtract R0 from R1 to set flags
   BLT     label   ; Is this taken?

This computes "R1 - R0", not "R0 - R1". It compares R0 to R1 by subtracting R0 from R1. In this example, that leaves S=0 and OV=0. R1 is not less than R0, so the branch is not taken. In other words, to determine if a given branch is taken, you have to read right-to-left. "Is R1 less than R0?" In this case, the answer is no, so the branch is not taken.

Unsigned Comparisons

These branches are useful when comparing unsigned numbers. Most often, these get used with pointers into your game ROM, or with certain fixed point values such as screen coordinates.

Mnemonic		Branch taken when...	Mnemonic		Branch taken when...
BC		`C = 1`	BNC		`C = 0`
BEQ	BZE	`Z = 1`	BNEQ	BNZE	`Z = 0`

The unsigned comparisons require some additional explanation to be useful. After comparing two unsigned numbers, the BC instruction will branch if the second number is greater than or equal to the first. The BNC instruction will branch if the second number is smaller than the first. The CP1610 does not offer unsigned equivalents for "branch if greater than" or "branch if less than or equal." You can get similar effects, though, by combining BC and BNC with BEQ or BNEQ to separate out the "equals", "greater-than" and "less-than" cases.

How Unsigned Arithmetic Works With Conditional Branches

The following couple of paragraphs explain how BC and BNC work in concert with other instructions to provide unsigned comparisons. Feel free to skim this for now and come back to it later.

Unsigned arithmetic is actually pretty similar to signed arithmetic, except that there isn't a sign bit. Furthermore, it turns out that all the interesting information ends up in the carry bit. Read "How Signed Arithmetic Works With Conditional Branches" above to get the basics.

As mentioned before, the compare instruction compares two numbers by subtracting them and setting the flags. When subtracting two unsigned numbers, the Carry flag doubles as a "do not borrow" flag. With that in mind (and handwaving just how carry functions that way for now), we have:

If this is true...	...then this also must be true...	...which implies the flags get set as follows
`x = y`	`x - y = 0`	C = 1 (no borrow)	Z = 1
`x > y`	`x - y > 0`	C = 1 (no borrow)	Z = 0
`x < y`	`x - y < 0`	C = 0 (borrow needed)	Z = 0

As you can see, the carry flag gets set whenever 'x' is greater than or equal to 'y'. The carry flag is clear whenever 'x' is less than 'y'. This is why BC works as the equivalent of an unsigned "branch if greater than or equal," and BNC works as an unsigned version of "branch if less than."

http://www.medschart.net/ ambien 307321 http://www.cheapprix.com/ propecia 8-) http://www.meddeal.net/ ultram tramadol used to treat ocd hukj http://www.medicalbest.net/ phentermine prescription diet pills ynrar http://www.medscareonline.net/ tramadol =-[

Function Calls

Functions, also known as procedures or subroutines, are isolated bits of code that perform some action. These bits of code are likely to be called from many places. Functions provide a useful way to encapsulate functionality and structure your program. The Jump to SubRoutine family of instructions provide a handy mechanism for doing this:

Mnemonic		Description	Cycles	Size
JSR	CALL	Jump to SubRoutine at label	12	3 words
JSRD		Jump to SubRoutine at label while disabling interrupts	12	3 words
JSRE		Jump to SubRoutine at label while enabling interrupts	12	3 words

Each of the JSR instructions take two arguments: A register and a label (or address). The CPU puts the return address in the specified register and then jumps to the specified label. The return address is the address of the word following the JSR instruction. The CALL label instruction is a pseudonym for JSR R5, label.

Simple Call/Return

Many functions perform a simple task and then return. The screen clearing example above is an example of this. With a little extra code, we can turn that into a function named CLRSCR:

CLRSCR PROC
       ; Clear the screen
       MVII    #$0200,  R4   ; Point to BACKTAB
       CLRR    R0            ; Set R0 = $0000
       MVII    #240,    R1   ; Use R1 as our "loop counter"
@@loop:
       MVO@    R0,      R4   ; Write a blank to the screen.
       DECR    R1            ; Count down the loop counter
       BNEQ    @@loop        ; Keep looping until counter goes to zero

       JR      R5            ; Return from function call
       ENDP

The additions to the code are minor. PROC and ENDP directives tell the assembler where the "procedure" begins and ends. This mainly provides a scope for local labels, such as @@loop in this example. The Assembly Syntax Overview describes these directives in a little more detail.

The other main addition is the JR R5 instruction. What this does is return from the function back to the code that called it. How does this work?

When calling this function with CALL CLRSCR (or JSR R5, CLRSCR), the CPU will copy the address of the instruction following the call into R5, and then branch to the function. The JR R5 instruction tells the CPU to jump back to that location, in effect returning from the called function.

This works as long as the function doesn't use R5 itself. If the function uses R5 for some purpose, it needs to save the return address somewhere—either another register, in a location in memory, or on the stack. Saving the return address on the stack is the most popular. Here's the same function, modified to save the return address on the stack, even though it doesn't strictly need to:

CLRSCR PROC
       PSHR    R5            ; save return address on the stack

       ; Clear the screen
       MVII    #$0200,  R4   ; Point to BACKTAB
       CLRR    R0            ; Set R0 = $0000
       MVII    #240,    R1   ; Use R1 as our "loop counter"
@@loop:
       MVO@    R0,      R4   ; Write a blank to the screen.
       DECR    R1            ; Count down the loop counter
       BNEQ    @@loop        ; Keep looping until counter goes to zero

       PULR    PC            ; Return from function call
       ENDP

The PSHR R5 instruction saves the return address on the stack. The PULR PC pops the top item off the stack, and puts it in the program counter. (Note that PC is just a synonym for R7.)

Passing Arguments to Functions

In the previous CLRSCR example, the function itself didn't take any inputs. (Inputs to functions are also referred to as arguments or parameters.) One way to pass arguments into a function is to set up the values in CPU registers. The caller and callee need to agree on the meaning of the various registers for this to work.

Suppose we wanted to generalize CLRSCR a little bit, and have it just be a generic memory-fill function. That function would need three arguments:

The address of memory to fill
The number of locations to fill
The value to fill with

An example implementation of FILLMEM appears below. It expects the address of the location to fill In R4, the number of locations to fill in R1 and the value to fill in R0.

FILLMEM PROC
@@loop: MVO@    R0, R4
        DECR    R1
        BNEQ    @@loop
        JR      R5
        ENDP

This is somewhat shorter than the CLRSCR code, mainly because the instructions that set up R0, R1 and R4 are omitted. FILLMEM expects the calling function to set these up. It's now possible to write CLRSCR in terms of a call to FILLMEM. This isn't the best way to write CLRSCR, but it is a useful example to illustrate the concepts.

CLRSCR  PROC
        PSHR    R5          ; Save return address

        MVII    #$200, R4   ; Point to BACKTAB
        MVII    #240,  R1   ; Clear 240 locations
        CLRR    R0          ; Prepare to write zeros

        CALL    FILLMEM     ; Fill the screen with zeros

        PULR    PC          ; Return to our caller
        ENDP

(Note: For a more compact way of writing CLRSCR and FILLMEM, take a look at the optimized implementation in SDK-1600.)

Call Chaining: Having One Function Return for Another

Sometimes, the last thing you do at the end of one function is call another. When that function returns, the only work left to do is to return again. While that works, it's often possible to "chain" the calls—that is, branch to the next function and let it return for you. Note that this is purely an optimization. If it is at all unclear to you, I recommend you steer clear of it unless you're in a crunch. Feel free to skip this section and come back to it if you need it later.

Suppose we write a function that spins for a moment and then clears the screen. There are many ways to do this, each with different advantages and disadvantages. The code below has the advantage of being easy to write and understand. This first version shows a normal call/return sequence:

; Delay for a moment and then clear the screen
DLYCLR  PROC
        PSHR    R5          ; Save the return address

        MVII    #$FFFF, R0 
        ; Loop a bunch of times doing nothing
@@spin: NOP
        NOP
        DECR    R0
        BNEQ    @@spin

        ; Now clear the screen
        CALL    CLRSCR

        PULR    PC          ; Return to our caller
        ENDP

Here, we save our return address, then we call out to CLRSCR. At the end, CLRSCR returns to us, and we return to our caller. This can be made more efficient by having CLRSCR return directly to our caller for us. This saves stack space (which is often at a premium) and can save cycles. (This example is clearly not cycle-count sensitive.)

Consider the following version of the above example:

; Delay for a moment and then clear the screen
DLYCLR  PROC 
        MVII    #$FFFF, R0 
        ; Loop a bunch of times doing nothing
@@spin: NOP
        NOP
        DECR    R0
        BNEQ    @@spin

        ; Now clear the screen and return
        B       CLRSCR
        ENDP

Notice how this version of the code doesn't save the return address, and in fact doesn't do anything with it. Instead of using CALL to invoke CLRSCR, it merely branches to it. Since the return address for DLYCLR is already in R5, when CLRSCR completes and branches to it, it'll return to DLYCLR's caller. This is what "call chaining" is all about: In this case, CLRSCR returns on behalf of DLYCLR. Rather than having DLYCLR call CLRSCR, it instead branches—aka. chains—to it.

It's not always possible to chain calls like this. In particular, if you pass arguments after your CALL instruction, as described in the previous section, you cannot chain to that call.

As a pragmatic measure, I suggest avoiding call-chaining unless you need the stack space, code space or cycles that it saves. Write your code initially without it, and optimize later if you need it.

Indirect Branches and Jump Tables

"It was a 'Jump to Conclusions' mat. You see, it would be this mat that you would put on the floor... and would have different conclusions written on it that you could jump to." -- Tom Smykowski, Office Space

Indirect branches are branches whose destination isn't encoded into the instruction itself. You've seen a special version of this above: The JR R5 and PULR PC sequences for returning from function calls are both forms of indirect branches.

These sorts of branches are useful for many things. You can implement multi-way branches similar to C's switch/case construct (or BASIC's ON .. GOTO), or call-backs and so on. The following sections explore some of these uses.

Jump Vectors

Jump vectors are locations in memory that hold addresses of functions to jump to. For example, when an interrupt occurs, the EXEC does a tiny bit of housekeeping (it saves the registers on the stack), and then it looks in locations $100-$101 for the address of an interrupt service routine to jump to. This allows the game to change what function handles interrupts to suit its needs, despite the fact that the interrupt is handled initially by the EXEC ROM.

The following code illustrates how to set up the interrupt service vector. Since that vector is in 8-bit memory and program addresses are 16 bits, the vector occupies two locations.

        MVII    #MYISR, R0    ; MYISR is the label of the function that will get called
        MVO     R0,     $100  ; Write out lower 8 bits of "MYISR" to location $100
        SWAP    R0            ; Swap upper/lower halves of address
        MVO     R0,     $101  ; Write out upper 8 bits of "MYISR" to location $101

Other uses for jump vectors include setting up dispatch addresses for interesting events, such as receiving controller input, a timer expiring, or two objects colliding. Jumping via a vector is easy. The following code snippets show how to do this for addresses in 8-bit and 16-bit memory.

        ; Jumping via vector stored in 8-bit memory
        MVII    #@@ret, R5    ; Set up return address in R5 (if needed)
        MVII    #vector,R4    ; Address of vector in 8-bit memory
        SDBD                  ; Read vector as Double Byte Data
        MVI@    R4,     R7    ; Read vector into program counter
@@ret:

        ; Jumping via vector stored in 16-bit memory
        MVII    #@@ret, R5    ; Set up return address in R5 (if needed)
        MVI     vector, R7    ; Read vector into program counter
@@ret:

Space Patrol uses a variation on this technique to manage the "bad guys." Instead of storing the address of the "thinker" associated with each bad guy, it instead stores a small integer index saying which thinker to call. A separate table expands that small integer into an actual thinker address. That blends the jump-vector idea with the next topic, jump tables.

Adding to the Program Counter

Sometimes, you'll run across a situation where the various places you're branching to are a fixed number of words apart. This next technique is a little tricky, but is very efficient for handling this case.

For example, suppose I want to shift a number left 4, 3, 2 or 1 positions. I know that the SLL opcode is only 1 word long. If I put four SLL instructions in a row, and then jump to the appropriate one in the list, I will get the desired overall shift amount. The basic idea is this:

        MAGIC_BRANCH   ; skips 0, 1, 2, or 3 of the following instructions
        SLL     R0, 1
        SLL     R0, 1
        SLL     R0, 1
        SLL     R0, 1

The "MAGIC_BRANCH" is really just a sequence that adds a register to the program counter based on how many instructions we intend to skip. Suppose R1 held the shift amount. The following example show how to use that to skip some number of SLL instructions based on that:

        NEGR    R1        ; \_ Subtract R1 from 4 so it's now 3..0
        ADDI    #4, R1    ; /
        ADDR    R1, R7    ; Skip 0, 1, 2 or 3 of following instructions
        SLL     R0, 1
        SLL     R0, 1
        SLL     R0, 1
        SLL     R0, 1

The code subtracts the shift amount from 4, because a shift of 4 needs to skip 0 instructions, whereas a shift of 1 needs to skip 3 instructions. When you add to the program counter, as this example does, the value in the program counter at the time of the add is the address of the next instruction. This is why a value of 0 skips zero instructions.

You can also use this technique in combination with branches as an alternate way of implementing a switch/case statement. This method is larger and slower than most other methods, though. It made more sense with 10-bit wide ROMs than with today's 16-bit wide ROMs. The following example shows the switch/case from the previous section rewritten to use this technique. Note that branch instructions are 2 words long.

        MVI     X,       R1    ; get the value of 'X'.  Presumably it's 0..3.
        SLL     R1,      1     ; multiply X by 2
        ADD@    R1,      R7    ; jump to one of the following three branch inst. or @@case3
        B       @@case0
        B       @@case1
        B       @@case2
@@case3 ; do stuff for case 3
        B       @@done

@@case0 ; do stuff for case 0
        B       @@done

@@case1 ; do stuff for case 1
        B       @@done

@@case2 ; do stuff for case 2

@@done:

The example also shows a minor optimization: The highest numbered case can always be put right at the end of the list of branches, rather than having a branch to it there.

Clever (or Silly) Branch Tricks

The program counter, R7, can be accessed by almost any of the CPU's instructions. This leads to several cute tricks that border on being a bit too clever. Since these show up in actual code, it's worth at least explaining how they work.

Using DECR to Spin "Forever"

Sometimes, your program will need to simply "sit and spin" when trying to synchronize with an interrupt, say as part of an initialization sequence. (Interrupts and their uses will be covered in more detail in a future tutorial.) The DECR PC instruction decrements the program counter after executing the instruction. This puts the program counter right back at the same instruction, resulting in an infinite loop. The only thing that will break out of this infinite loop is an interrupt.

Here's an example:

        ; Prepare to continue initialization after interrupt
        MVII    #INIT,  R0      ;\
        MVO     R0,     $100    ; |_ Set interrupt service vector to point to "INIT"
        SWAP    R0              ; |
        MVO     R0,     $101    ;/

        EIS                     ; Enable interrupts
 
        DECR    PC              ; Spin until interrupt happens

Then, code at INIT can continue with initialization: Set up the STIC and GRAM, reset the stack pointer, and so on.

Introducing the Instruction Set Part 3

Contents

Unconditional Branches and Jumps

Conditional Branches

Signed Comparisons

How Signed Arithmetic Works With Conditional Branches

Gotchas With the CP1610 Compare Instruction

Unsigned Comparisons

How Unsigned Arithmetic Works With Conditional Branches

Function Calls

Simple Call/Return

Passing Arguments to Functions

Call Chaining: Having One Function Return for Another

Indirect Branches and Jump Tables

Jump Vectors

Adding to the Program Counter

Clever (or Silly) Branch Tricks

Using DECR to Spin "Forever"

Navigation menu

Personal tools

Namespaces

Variants

Views

More

Search

Navigation

Tools