Fixed Point Arithmetic and the Packed
Decimal Format
This
set of lectures discusses the idea of fixed
point arithmetic and
its implementation by Packed Decimal Format on the IBM Mainframes.
The
topics include
1. A review of IBM S/370 floating point formats,
focusing on
the precision with which real
numbers can be stored.
2. The difference between the precision
requirements of business
applications and those of
scientific applications.
3. An overview of the Packed Decimal format, as
implemented
on the IBM S/370 and
predecessors.
We begin
with a review of IBM notation for denoting bit numbers
in a byte or word. From our viewpoint,
it is a bit non–standard.
IBM S/370: Terminology and Notation
The IBM 370 is a byte–addressable machine; each byte
has a unique address.
The standard storage sizes on the IBM 370 are byte,
halfword, and fullword.
Byte 8 binary bits
Halfword 16 binary bits 2 bytes
Fullword 32 binary bits 4 bytes.
In IBM terminology, the leftmost bit is bit zero, so
we have the following.
Byte
0 |
1 |
2 |
3 |
4 |
5 |
6 |
7 |
Halfword
0 |
1 |
2 |
3 |
4 |
5 |
6 |
7 |
8 |
9 |
10 |
11 |
12 |
13 |
14 |
15 |
Fullword
0 – 7 |
8 – 15 |
16 – 23 |
24 – 31 |
Comment: The IBM 370 seems to be a “big endian”
machine.
S/370: Available Floating Point Formats
There are three available formats for representing
floating point numbers.
Single precision 4
bytes 32 bits: 0 – 31
Double precision 8
bytes 64 bits: 0 – 63
Extended precision 16 bytes 128 bits; 0 – 127.
The standard representation of the fields is as
follows.
Format |
Sign bit |
Exponent bits |
Fraction
bits |
Single |
0 |
1 – 7 |
8 – 31 |
Double |
0 |
1 – 7 |
8 – 63 |
Extended |
0 |
1 – 7 |
8 – 127 |
NOTE: Unlike the IEEE–754 format, greater
precision is not
accompanied by a greater
range of exponents.
The precision of the format depends on the number of
bits used for the fraction.
Single precision 24
bit fraction 1 part in 224 7 digits precision *
Double precision 56
bit fraction 1 part in 256 16 digits precision **
*224 = 16,777,216 ** 256 » (100.30103)56 » 1016.86 » 7·1016.
Precision Example: Slightly Exaggerated
Consider a banking problem. Banks lend each other money overnight.
At 3% annual interest, the overnight interest on
$1,000,000 is $40.492.
Suppose my bank lends your bank $10,000,000 (ten
million).
You owe me $404.92 in interest; $10,000,404.92 in
total.
With seven significant digits, the amount might be
calculated as $10,000,400.
My bank loses $4.92.
I want my books to balance to the penny. I do not like floating point arithmetic.
TRUE STORY
When DEC (the Digital Equipment Corporation) was marketing
their PDP–11
to a large New York bank, it supported integer and floating point arithmetic.
At this time, the PDP–11 did not support decimal
arithmetic.
The bank told DEC something like this:
“Add decimal arithmetic and we shall
buy a few thousand. Without
it – no sale.”
What
do you think that DEC did?
Precision Example: Weather Modeling
Suppose a weather model that relies on three data to
describe each point.
1. The temperature in Kelvins. Possible values are 200 K to 350 K.
2. The pressure in millibars. Typical values are around 1000.
3. The percent humidity. Possible ranges are 0.00 through 1.00 (100%).
Consider the errors associated with single precision
floating point arithmetic.
The values are precise to 1 part in 107.
The
maximum temperature errors resulting at any step in the calculation would be:
3.5·10–5 Kelvins
1.0·10–4 millibars
1.0·10–5 percent in humidity.
As cumulative errors tend to cancel each other out, it
is hard to imagine
the cumulated error in the computation of any of these values
as becoming significant to the result.
Example: A
weather prediction accurate to ±1 Kelvin is considered excellent.
Encoding Decimal Digits
There are ten decimal digits. As 23 < 10 £ 24, we require four binary
bits in order to represent each of the digits.
Here are the standard encodings. Note the resemblance to hexadecimal,
except that the non–decimal hexadecimal digits are not shown.
0 0000 5 0101
1 0001 6 0110
2 0010 7 0111
3 0011 8 1000
4 0100 9 1001
Note that the binary codes 1010, 1011, 1100, 1101, 1110, 1111
for hexadecimal digits A B C D E F
are not used to represent decimal digits.
Packed decimal representation will have other uses for
these binary codes.
Zoned Decimal Data
Remember that all textual data in computing are input
and output
as character codes. IBM S/370 uses 8–bit
EBCDIC to represent characters.
On input, these character codes are immediately
converted to Zoned Decimal
format, which uses 8–bit (one byte) codes to represent each digit. With a
slight exception, it is identical to EBCDIC.
Here are the EBCDIC representations of each digit,
shown as 2 hex digits.
Digit Code Digit Code
Hex Binary Hex Binary
0 F0 1111 0000 5 F5 1111 0101
1 F1 1111 0001 6 F6 1111 0110
2 F2 1111 0010 7 F7 1111 0111
3 F3 1111 0011 8 F8 1111 1000
4 F4 1111 0100 9 F9 1111 1001
Packed Decimal Data
Packed decimal representation makes use of the fact
that only four
binary bits are required to represent any decimal digit.
Numbers can be packed,
two digits to a byte.
How do we represent signed numbers in this
representation?
The answer is that we must include a sign “half byte”.
The IBM format for packed decimal allows for an
arbitrary number
of digits in each number stored. The
range is from 1 to 31, inclusive.
After adding the sign “half byte”, the number of
hexadecimal digits
used to represent any number ranges from 2 through 32.
Numbers with an even digit count are converted to
proper format by
the addition of a leading zero; 1234 becomes 01234.
The system must allocate an integral number of bytes
to store each datum,
so each number stored in Packed Decimal format must have
an odd number of digits.
Packed Decimal: The Sign “Half Byte”
In the S/370 Packed Decimal format, the sign is stored
“to the right” of
the string of digits as the least significant hexadecimal digit.
The standard calls for use of all six non–decimal
hexadecimal digits
as sign digits. The standard is as
follows:
Binary Hex Sign Comments
1010 A +
1011 B –
1100 C + The standard plus sign
1101 D – The standard minus sign
1110 E +
1111 F + A common plus sign, resulting from a
shortcut
in translation from Zoned format.
Zoned Decimal Data
The zoned
decimal format is a modification of the EBCDIC format.
The zoned decimal format seems to be a modification to
facilitate
processing decimal strings of variable length.
The length of zoned data may be from 1 to 16 digits,
stored in 1 to 16 bytes.
We have the address of the first byte for the decimal
data,
but need some “tag” to denote the last (rightmost) byte.
The assembler places a “sign zone” for the rightmost
byte of the zoned data.
The common standard is X’C’ for non–negative numbers, and
X’D’
for negative numbers.
The format is used for constants possibly containing a
decimal point, but
it does not store the decimal point.
As an example, we consider the string “–123.45”.
Note that the format requires one byte per digit
stored.
Creating the Zoned Representation
Here is how the assembler generates the zoned decimal
format.
Consider the string “–123.45”.
The EBCDIC character representation is as follows.
Character |
– |
1 |
2 |
3 |
. |
4 |
5 |
Code |
6D |
F1 |
F2 |
F3 |
4B |
F4 |
F5 |
The
decimal point (code 4B) is not stored.
A bit
later we shall see the reason for this.
The sign
character is implicitly stored in the rightmost digit.
The zoned data representation is as follows.
1 |
2 |
3 |
4 |
5 |
F1 |
F2 |
F3 |
F4 |
D5 |
The
string “F1 F2 F3 F4 C5” would indicate a positive number,
with digits “12345”.
Packed Decimal Data
The packed decimal format is the one preferred by
business for financial use.
Zoned decimal format is not used for arithmetic, but just for conversions.
As is suggested by the name, the packed format is more
compact.
Zoned
format one digit per byte
Packed
format two digits per byte (mostly)
In the packed format, the rightmost byte stores the
sign in its rightmost part,
so the rightmost byte of packed format data contains only one digit.
All other bytes in the packed format contain two
digits, each with value in 0 – 9.
This implies that each packed constants always has an odd number of digits.
A leading 0 may be inserted, as needed.
The standard sign fields are: negative X’D’
non–negative X’C’
The length may be from 1 to 16 bytes, or 1 to 31
decimal digits.
Examples +7 | 7C |
–
13 | 01 | 3D |
Example: Addition of Two Packed Decimal
Values
Consider two integer
values, stored in packed decimal format.
Note that 32–bit two’s complement representation would
limit the integer
to a bit more than nine digits: –2, 147, 483, 648 through 2, 147, 483, 647.
Integers stored as packed decimals can have 31 digits
and vary between
–9, 999, 999, 999, 999, 999, 999,
999, 999, 999, 999 and
+9, 999, 999, 999, 999, 999, 999,
999, 999, 999, 999 (9.99·1030)
Consider the addition of the numbers –97 and 12,541.
–97
would be expanded to –097 and stored as 097D.
12,541
would be stored as 12541C.
The CPU does what we would do. It first pads the smaller number with 0’s.
00097D
12541C
It then performs the operation, denoted as “12541 –
97”, and stores the
result as 12444, or 12444C in packed decimal format.
Fixed Point: Where is the Decimal Point?
We shall now consider a sequence of numbers, in which
each member is
divided by ten to get the next one. This
will be a short finite sequence.
The number 12345 is
represented in packed decimal as 12345C.
The number 1234.5 is
represented in packed decimal as 12345C.
The number 123.45 is
represented in packed decimal as 12345C.
The number 12.345 is
represented in packed decimal as 12345C.
The number 1.2345 is
represented in packed decimal as 12345C.
The number .12345 is
represented in packed decimal as 12345C.
Note that the format does not store the decimal point
or any indication
of the location of the decimal point.
That is up to the code.
Put another way, what is the sum of 012C and 4C?
Is this really 12 + 4, with sum 16, represented as 016C?
Is this really 12.0 + 0.4, with sum 12.4, represented
as 124C?
Fixed Point: Where is the Decimal Point?
(Page 2)
Consider the following addition problem: 1.23 +
10.11405.
The answer is obviously 11.34405.
But the Packed Decimal format does not store the
decimal point, so
1.23 is stored
as 123C and
10.11405 is stored as 1011405C.
Using our previous logic, we line up the numbers 0000123C
1011405C
The result is denoted 10111528C, which might be 10.111528.
It cannot represent the correct answer.
This is a “feature” of fixed point arithmetic, in which all numbers are stored
with the decimal point in the same place.
In this system, the code
must guarantee that 1.23 is stored as 0123000C.
The result is now 0123000C
1011405C, stored as 1134405C.
The code must interpret and output this as the value
11.34405.