12.4 Target Language

Search in book...
Toggle Font Controls
Create new playlist

Name your new playlist

Playlist description (optional)
Sign In

Email address

Password

Forgot Password?

or

Continue with Facebook

Continue with Google
Sign Up

Full Name

Email address

Confirm Email Address

Password

or

Continue with Facebook

Continue with Google

12.4 Target Language

The target language is x86 assembly language. We summarize here the language in the form of tables. The syntax for the assembly language used by us is known as the AT&T syntax. It is the one supported by the GNU tool chain that becomes standard with every Linux distribution. However, the official syntax for x86 assembly language (known as the Intel syntax) is different. It is the same assembly language for the same platform, but it looks different. See Section 12.11 for some of the more important differences.

12.4.1 x86 Instructions

We do not plan to give here a description of all of hundreds of instructions in the instruction set of x86 architecture, but only those which will be used in the assembly language outputs which are given. The reader should consult appropriate documentation from the manufacturer for full details.

x86 Flags

O	Overflow flag. This is set to true if the destination operand was not large enough to hold the result of the instruction.
S	Sign flag. This is set to the sign of the last result.
Z	Zero flag. This flag is set to true if the result of the instruction is zero.
A	Auxiliary carry flag. This flag is set for carries and borrows between the third and fourth bits. It is not used by us.
P	Parity flag. This flag is set to true if the low byte of the last result had an even number of 1-bit.
C	Carry flag. This flag is used in arithmetic to indicate whether the result should be carried over to an additional byte. If the carry flag is set that usually means that the destination register could not hold the full result. It is up to the programmer to decide on what action to take, for example, propagate the result to another byte, signal an error or ignore it entirely.

Other flags exist, but they are much less important and not used by us.

The flags which get affected by a particular instruction are shown in the instruction set tables given below.

The source (src) and destination (dst) operands are listed giving the type of operands they take. An operand is shown as a code which tells whether the operand can be an immediate-mode value (I), a register (R) or a memory address (M). Note that in x86 assembly language you cannot have more than one operand, being a memory location.

Data Transfer Instructions

These instructions are mostly used for moving data from one place to another. They are given in Table 12.1.

Table 12.1 Data transfer instructions

Integer Instructions

These are the basic computing instructions that operate on signed or unsigned integers, given in Table 12.2. The logic instructions are given in Table 12.3. The flow control instructions are given in Table 12.4.

Table 12.2 Integer instructions

Table 12.3 Logic instructions

Table 12.4 Flow control instructions

Condition codes are:

[n]a[e] – above (unsigned greater than). An n can be added for not and an e can be added for “or equal to”
[n]b[e] – below (unsigned less than)
[n]e – equal to
[n]z – zero
[n]g[e] – greater than (signed comparison)
[n]I[e] – less than (signed comparison)
[n]c – carry flag set
[n]o – overflow flag set
[n]p – parity flag set
[n]s – sign flag set
ecxz – %ecx is zero

12.4.2 Assembler Directives

These are instructions to the assembler and linker, instead of instructions to the processor. The most essential of them are given in Table 12.5.

Table 12.5 Assembler directives

Op	Operands	Remarks
.ascii	Quoted string	Takes the given quoted string and converts it into byte data.
.byte	Values	Takes a comma-separated list of values and inserts them right there in the program as data.
.endr	—	Ends a repeating section defined with .rept.
.equ	Label, value	Sets the given label equivalent to the given value. The value can be a number, a character or a constant expression that evaluates to a number or character. From that point on, use of the label will be substituted for the given value.
.globl	Label	Sets the given label as global, meaning that it can be used from separately compiled object files.
.include	File	Includes the given file just as if it were typed in right there.
.lcomm	Symbol, size	This is used in the .bss section to specify storage that should be allocated when the program is executed. Defines the symbol with the address where the storage will be located, and makes sure that it is the given number of bytes long.
.long	Values	Takes a sequence of numbers separated by commas, and inserts those numbers as 4-byte words right where they are in the program.
.rept	Count	Repeats everything between this directive and the .endr directives the number of times specified.
.section	Section name	Switches the section that is being worked on. Common sections include .text (for code), .data (for data embedded in the program itself) and .bss (for uninitialized global data).
.type	Symbol, ©function	Tells the linker that the given symbol is a function.

12.4.3 Floating-point Instructions

The x86 architecture implements the IEEE-754 standard. The IEEE-754 32-bit precision format divides the 32-bit value into three different sections:

Sign size = 1-bit, bit-range: 31–31;

Biased exponent size = 8-bits, bit-range: 30–23; minimum value = –126 and maximum value = 127;

Mantissa size = 23-bits, bit-range: 22–0.

The mantissa expresses a fraction of the form 1.x where x is expressed by the mantissa section. Let s be the sign bit, be be the biased exponent and m_i, i ∈ {1, 2, …, 23} be the mantissa, then the value of most of the IEEE-754 encoded floats is given by:

There are certain special cases for invalid numbers, infinity and zero, etc. There are positive and negative infinities, positive and negative NaNs (Not-a-Numbers).

The special cases are:

if exponent is 0 and fraction is 0, the number is ±0 (depending on the sign bit),
if exponent = 255 and fraction is 0, the number is ±infinity (again depending on the sign bit), and
if exponent = 255 and fraction is not 0, the number being represented is not a number (NaN).

Some sample values (in hex) are: positive zero: 00 00 00 00, negative zero: 80 00 00 00, positive 1: 3F 80 00 00, negative 1: BF 80 00 00.

There are five possible exceptions with floating-point numbers:

Invalid operation (e.g., square root of a negative number)
Division by zero
Overflow (a result is too large to be represented correctly)
Underflow (a result is very small (outside the normal range) and is inexact)
Inexact

The FPU contains 8 80-bit stack elements called %st(0), %st(1), …, %st(7). They are implemented as a stack so that %st(0) always refers to the top element, %st(1) to the one below the top, etc. This makes it quite tricky to use.

FPU code is rather fragile and often hard to read. It is strongly recommended that you read a good instruction set reference simultaneously when you are working with FPU code. We take some easy examples to see how the FPU stack works. Here is the state of the FPU at the beginning of the program:

%st(0) undefined

%st(1) undefined