A substantial amount of information is added to the audio data before the compact disc is recorded. Figure 9.1 illustrates the encoding process and shows the various information to be recorded.
There are usually two audio channels with 16-bit coding, sampled at 44.1 kHz. So, the bit rate, after combining both channels, is:
44.1 × 16 × 2 = 1.4112 × 106 bit s–1
Most of the errors which occur on a medium such as CD are random. However, from time to time burst errors may occur due to fingerprints, dust or scratches on the disc surface. To cope with both random and burst errors, Sony and Philips developed the cross-interleave Reed–Solomon error-correction code (CIRC). CIRC is a very powerful combination of several error correction techniques.
It is useful to be able to measure an error-correcting system’s ability to correct errors, and as far as the compact disc medium is concerned it is the maximum length of a burst error which is critical. Also, the greater the number of errors received, the greater the probability of some errors being uncorrectable. The number of errors received is defined as the bit error rate (BER). An important system specification, therefore, is the number of data samples per unit time, called the sample interpolation rate, which have to be interpolated (rather than corrected) for given BER values. The lower this rate is, the better the system. Then, if burst errors cannot be corrected, an important specification is the maximum length of a burst error which can be interpolated. Finally, it is important to know the number of undetected errors, resulting in audible clicks. Any specification of an error-correcting system must take all these factors into account. Table 9.1 is a list of all relevant specifications of the CIRC system used in CD.
The CIRC principle is as follows (refer to Figure 9.2):
Aspect | Specification |
Maximum correctable burst error length | 4000 bits (i.e., 2.5 mm on the disc surface) |
Maximum interpolatable burst error length | 12 300 bits (i.e., 7.7 mm on the disc surface) |
Sample interpolation rate | One sample every 10 hours at a BER of 10–4 |
1000 samples every minute at a BER of 10–3 | |
Undetected error samples | Less than one every 750 hours at a BER of 10–3 |
Negligible at a BER of 10–4 | |
Code rate | On average, four bits are recorded for every three data bits |
The C2 encoder outputs 28 8-bit symbols for 24 symbols at its input: it is therefore called a (24, 28) encoder. The C1 encoder outputs 32 symbols for 28 symbols input: it is a (28, 32) encoder.
The bit rate at the output of the CIRC encoder is:
One 8-bit control word is added to every 32-symbol block of data from the encoder. The compact disc standard defines eight additional channels of information or subcodes that can be added to the music information; these subcodes are called P, Q, R, S, T, U, V and W. At the time of writing, only the P and Q subcodes are commonly used:
The other subcodes carry information relating to possible enhancements, such as text and graphics, but will not be discussed here.
Each subcode word is 98 bits long and, as each bit of the control word corresponds to each subcode (i.e., P, Q, R, S, T, U, V, W), a total of 98 complete data blocks or frames must be read from the disc to read each subcode word. This is illustrated in Figure 9.3.
After addition of the control word, the new data rate becomes:
Figure 9.4a illustrates the structure of the 98-bit Q subcode word. The R, S, T, U, V and W subcode words are similar. The first 2 bits are synchronizing bits, S0 and S1. They are necessary to allow the decoder to distinguish the control word in a block from the audio information, and always contain the same data.
The next 4 bits are control bits, indicating the number of channels and pre-emphasis used, as follows:
0000 two audio channels/no pre-emphasis
1000 four audio channels/no pre-emphasis
0001 two audio channels/with pre-emphasis
1001 four audio channels/with pre-emphasis
Four address bits indicate the mode of the subsequent data to follow. For the subcode, three modes are defined.
At the end of the subcode word, a 16-bit CRCC error-correction code, calculated on control, address and data information, is inserted. The CRCC uses the polynomial P(x) = x16 + x12 + x5 + 1.
The three modes of data in Q subcode words are used to carry various information.
Mode 1 (address = 0001)
This is the most important mode, and the only one which is of use during normal playback. At least nine of 20 consecutive subcode words must carry data in mode 1 format. Two different situations are possible, depending whether the subcode is in the lead-in track or not.
When in the lead-in track, the data are in the format illustrated in Figure 9.4b. The 72-bit section comprises nine 8-bit parts:
• |
TNO – containing information relating to track number: two digits in BCD form (i.e., 2 × 4 bits). Is 00 during lead-in. |
• |
POINT/PMIN/PSEC/PFRAME – containing information relating to the table of contents (TOC). They are repeated three times. POINT indicates the successive track numbers, while PMIN, PSEC and PFRAME indicate the starting time of that track. Furthermore, if POINT = A0, PMIN gives the physical track number of the first piece of music (PSEC and PFRAME are zero); if POINT = A1, PMIN indicates the last track on the disc, and if POINT = A2, the starting point of the lead-out track is given in PMIN, PSEC and PFRAME. Table 9.2 shows the encoding of the TOC on a disc which contains six pieces of music. |
• |
ZERO – 8 bits, all zero. |
Frame number | POINT | PMIN, PSEC, PFRAME |
n | 01 | 00, 02, 32 |
n + 1 | 01 | 00, 02, 32 |
n + 2 | 01 | 00, 02, 32 |
n + 3 | 02 | 10, 15, 12 |
n + 4 | 02 | 10, 15, 12 |
n + 5 | 02 | 10, 15, 12 |
n + 6 | 03 | 16, 28, 63 |
n + 7 | 03 | 16, 28, 63 |
n + 8 | 03 | 16, 28, 63 |
n + 9 | 04 | 16, 28, 63 |
n + 10 | 04 | 16, 28, 63 |
n + 11 | 04 | 16, 28, 63 |
n + 12 | 05 | 16, 28, 63 |
n + 13 | 05 | 16, 28, 63 |
n + 14 | 05 | 16, 28, 63 |
n + 15 | 06 | 49, 10, 33 |
n + 16 | 06 | 49, 10, 33 |
n + 17 | 06 | 49, 10, 33 |
n + 18 | A0 | 01, 00, 00 |
n + 19 | A0 | 01, 00, 00 |
n + 20 | A0 | 01, 00, 00 |
n + 21 | A1 | 06, 00, 00 |
n + 22 | A1 | 06, 00, 00 |
n + 23 | A1 | 06, 00, 00 |
n + 24 | A2 | 52, 48, 41 |
n + 25 | A2 | 52, 48, 41 |
n + 26 | A2 | 52, 48, 41 |
n + 27 | 01 | 00, 02, 32 |
n + 28 | 01 | 00, 02, 32 |
. | . | . |
. | . | . |
. | . | . |
In music and lead-out tracks, data are in the format illustrated in Figure 9.4c. The 72-bit section now comprises:
• |
TNO – current track number: two digits in BCD form (01 to 99). |
• |
POINT – index number within a track: two digits in BCD form (01 to 99). If POINT = 00 it indicates a pause in between tracks. |
• |
MIN/SEC/FRAME – indicates running time within a track: each part consists of digits in BCD form. There are 75 frames in a second (00 to 74). Time is counted down during a pause, with a value zero at the end of the pause. During lead-in and lead-out tracks, the time increases. |
• |
AMIN/ASEC/AFRAME – indicates the running time of the disc in the same format as above. At the start of the programme area, it is set to zero. |
• |
ZERO – 8 bits, all zero. |
Figure 9.5 shows a timing diagram of P subcode and Q subcode status during complete reading of a disc containing four selections (of which selections three and four fade out and in consecutively without an actual pause).
If mode 2 data are present, at least one of 100 successive subcode words must contain it. It is of importance only to the manufacturer of the disc, containing the disc catalogue number. The 98-bit, Q subcode word in mode 2 is shown in Figure 9.6. The structure is similar to that of mode 1, with the following differences:
• |
N1 to N13 – catalogue number of the disc expressed in 13 digits of BCD, according to the UPC/EAN standard for bar coding. The catalogue number is constant for any one disc. If no catalogue number is present, N1 to N13 are all zero, or mode 2 subcode words may not even appear. |
• |
ZERO – these 12 bits are zero. |
Mode 3 (address = 0111)
Like mode 2 data, if mode 3 is present, at least one of 100 successive subcode words will contain it.
Mode 3 is used to assign each selection with a unique number, according to the 12-character International Standard Recording Code (ISRC), defined in DIN-31-621.
If no ISRC number is assigned, mode 3 subcode words are not present. During lead-in and lead-out tracks, mode 3 subcode words are not used, and the ISRC number must only change immediately after the track number (TNO) has been changed.
The 98-bit, Q subcode word in mode 3 is shown in Figure 9.7. The structure is similar to that of mode 1, with the following differences:
• |
I1 to I12 – the 12 characters of the selection’s ISRC number. Characters I1 and I2 give the code corresponding to country. Characters I3 to I5 give a code for the owner. Characters I6 and I7 give the year of recording. Characters I8 to I12 give the recording’s serial number. Characters I1 to I5 are coded in a 6-bit format according to Table 9.3, while characters I6 to I12 are 4-bit BCD numbers. |
• |
00 – these 2 bits are zero. |
• |
ZERO – these 4 bits are zero. |
EFM, or eight-to-fourteen modulation, is a technique which converts each 8-bit symbol into a 14-bit symbol, with the purpose of aiding the recording and playback procedure by reducing required bandwidth, reducing the signal’s DC content and adding extra synchronization information. A timing diagram of signals at this stage of CD encoding is given in Figure 9.8.
The procedure is to use 14-bit codewords to represent all possible combinations of the 8-bit code. An 8-bit code represents 256 (i.e., 28) possible combinations, as shown in Table 9.4. A 14-bit code, on the other hand, represents 16 384 (i.e., 214) different combinations, as shown in Table 9.5. Of the 16 384 14-bit codewords, only 256 are selected, having combinations which aid processing of the signal.
For instance, by choosing codewords which give low numbers of individual bit inversions (i.e., 1 to 0, or 0 to 1) between consecutive bits, the bandwidth is reduced. Similarly, by choosing codewords with only limited numbers of consecutive bits with the same logic level, overall DC content is reduced.
A ROM-based look-up table, say, can then be used to assign all 256 combinations of the 8-bit code to the 256 chosen combinations within the 14-bit code. Some examples are listed in Table 9.6.
In addition to EFM modulation, three extra bits, known as merging bits, are added to each 14-bit symbol, with the purpose of further lowering DC content of the signal. Exact values of the merging bits depend on the adjacent symbols.
Finally, the data bits are changed from NRZ into NRZI (non-return to zero inverted) format, by converting each positive-going pulse of the NRZ signal into a single transition. The resultant signal has a minimum length of 3T (i.e., three clock periods) and a maximum of 11T (i.e., 11 clock periods), as shown in Figure 9.9.
The bit rate is now:
To the signal, comprising 33 symbols of 17 bits (i.e., a total of 561 bits), a sync word and its three merging bits are added, giving 588 bits in total (Figure 9.10). Sync words have two main functions: (1) they indicate the start of each frame; (2) sync word frequency is used to control the player’s motor speed.
The 588-bit-long signal block is known as an information frame.
The final bit rate, recorded on the CD, consequently becomes:
The frame frequency fframe is:
And, as subcodes are in blocks of 98 frames, the subcode frequency f sc is:
Playing time is calculated by counting blocks of subcode (i.e., 75 blocks = 1 second). A 60-minute-long CD consequently contains:
60 × 60 × 7350 = 26 460 000 frames
As each frame comprises 33 × 8 = 264 bits of information, a 1-hour-long CD actually contains 6 985 440 000 bits of information, or 873 180 000 bytes! Of this, the subcode area contains some 25.8 kbytes (200 Mbits). This gigantic data storage capacity of the CD medium is also used for more general purposes on the CD-ROM (compact disc read-only memory), which is derived directly from the audio CD.
3.137.178.122