Search in book...
Toggle Font Controls
Create new playlist

Name your new playlist

Playlist description (optional)
Sign In

Email address

Password

Forgot Password?

or

Continue with Facebook

Continue with Google
Sign Up

Full Name

Email address

Confirm Email Address

Password

or

Continue with Facebook

Continue with Google

4.3.2 BICM Pseudo-Capacity

The results from the previous section point out the fact that binary labeling strongly affects the BICM-GMI. Moreover, we have seen that the BICM-GMI curves for a given binary labeling intersect each other for different constellation sizes, which naturally leads to a quite general question: What is the best constellation, binary labeling, and input distribution for the BICM at a given SNR? Once this question is answered, approaching the fundamental limit will depend only on a good design of the binary encoder/decoder.

To formalize the previous question, and in analogy with (4.25), we define the BICM pseudo-capacity for a given constellation size $c04-math-0341$ as

4.64

where the optimization is over the constellation and its binary labeling, as well as over the vector of bitwise probabilities $c04-math-0343$ that induce a symbol-wise PMF (via (2.72)) satisfying the energy constraint $c04-math-0344$ . We use the name BICM pseudo-capacity because, in theory, the quantity in (4.64) might not be the largest achievable rate, which simply comes from the fact that the BICM-GMI is not the largest achievable rate for given $c04-math-0345$ . Nevertheless, the numerical results we present in this section will, in fact, show that the BICM pseudo-capacity can close the gap to the AWGN capacity, and thus, it is indeed a meaningful quantity to study.

Similarly to the CM capacity, the BICM pseudo-capacity in (4.64) represents an upper bound on the number of bits per symbol that can be reliably communicated using BICM, i.e., when for each SNR, the constellation, its labeling, and the bits' distribution are optimized.

In general, solutions of the problem in (4.64) are difficult to find, even when both the constellation and the labeling are fixed. This is because, unlike the CM-MI, the BICM-GMI is not a concave function of the input distribution. To clarify this, consider the following example.

c04f013 — **Figure 4.13** BICM-GMI $c04-math-0355$ for 8PAM labeled by the BRGC as a function of the bit probabilities for $c04-math-0356$ and $c04-math-0357$ ( $c04-math-0358$ ). The filled black circle shows the optimal value

**Figure 4.13** BICM-GMI $c04-math-0355$ for 8PAM labeled by the BRGC as a function of the bit probabilities for $c04-math-0356$ and $c04-math-0357$ ( $c04-math-0358$ ). The filled black circle shows the optimal value

Although not explicitly mentioned, the results in Fig. 4.13 are obtained by scaling the constellation for each $c04-math-0359$ so that $c04-math-0360$ . To formally define the problem in Example 4.17, we define the BICM constellation and labeling-constrained pseudo-capacity as

4.65

where the optimization is over all the $c04-math-0362$ and $c04-math-0363$ is chosen so that $c04-math-0364$ is satisfied.

For constellations with a small number of points with a fixed binary labeling, we may use an exhaustive search together with an efficient numerical implementation of the BICM-GMI (which we show in Section 4.5) to solve the problem in (4.65). While this approach has obvious complexity limitations, i.e., the complexity grows exponentially with the number of bits per symbol $c04-math-0365$ , it allows us to obtain an insight into the importance of the bit-level probabilistic shaping in BICM.

Example 4.18 (BICM Pseudo-capacity for $c04-math-0366$ and Different Labelings)

In Fig. 4.14, we show the BICM constellation and labeling-constrained pseudo-capacity in (4.65) for an 8PAM constellation labeled by the BRGC, NBC, and FBC. We performed a grid search (with steps of 0.01) over the three variables defining the input distribution: $c04-math-0367$ , $c04-math-0368$ , and $c04-math-0369$ . For each SNR value, we selected the input distribution that maximizes the BICM-GMI . The results in this figure show how, by properly selecting the input distribution, the BICM-GMI increases and gets very close to the AWGN capacity for $c04-math-0370$ . Interestingly, the results in Fig. 4.14 show that if the input distribution is optimized, the NBC is not the optimal binary labeling for low SNR anymore. The figure also shows that the BRGC with an optimized input distribution achieves the SL.

c04f014 — **Figure 4.14** Functions $c04-math-0371$ and $c04-math-0372$ in (4.16) for 8PAM over the AWGN channel and the BRGC, NBC, and FBC. The function $c04-math-0373$ in (4.17) and the SL in (4.18) are also shown

On the basis of the results in Examples 4.15–4.18, some concluding remarks about the BICM pseudo-capacity can be made. First of all, these results show the suboptimality of the BRGC in terms of maximizing the BICM-GMI for a given equally spaced constellation and uniform input distributions. For 8PAM and for low and medium SNR values, the NBC and the FBC give higher BICM capacities than the BRGC. Moreover, for asymptotically low rates, the optimum binary labeling for $c04-math-0374$ PAM constellations is the NBC. The results in these examples also show that a BICM system can be optimum for asymptotically low rates (i.e., it can achieve the SL). This can be obtained by using probabilistic shaping or by properly selecting the binary labeling.

4.3.3 Useful Relationships

In this section, we provide some useful relationships for the analysis of information-theoretic aspects of BICM transmission.

To show the usefulness of (4.66), consider (for simplicity) the AWGN channel. In this case, the conditional MI $c04-math-0388$ in (4.66) is given by (4.4) and can be expressed as

4.70

4.71

4.72

where

4.73

and where (4.72) follows from (2.75) and from the fact that $c04-math-0393$ , $c04-math-0394$ .

In (4.73), we use $c04-math-0395$ to denote a random variable with support $c04-math-0396$ and PMF $c04-math-0397$ given by (2.75), i.e., when the symbols are drawn from the subconstellation $c04-math-0398$ created by the binary labeling (see, e.g., Fig. 2.16). As the r.h.s. of (4.73) is an MI between $c04-math-0399$ and $c04-math-0400$ , we use the notation $c04-math-0401$ . Using (4.72) in (4.66) we obtain

4.74

The expression in (4.74) shows that the BICM-GMI can be expressed as a difference of CM-MIs. This interpretation is useful because it shows that a good characterization of MIs for arbitrary (sub)constellations leads to a good characterization of the BICM-GMI. This is particularly interesting as plenty of results already exist in the literature regarding CM-MIs, which, because of (4.74), can be “reused” to study the BICM-GMI. This approach has been taken, e.g., to study the low- and high-SNR behavior of BICM, where the asymptotic behavior of CM-MIs for arbitrary constellations is used to study the asymptotic behavior of the BICM-GMI.

We note that (4.76) generalizes the L-values defined in Section 3.3 allowing us to deal with arbitrary mismatched metrics $c04-math-0416$ . Theorem 5.1 demonstrates also that the distribution of the generalized L-values (4.76) suffices to calculate the BICM-GMI.

An immediate consequence of Theorem 5.1 is that when $c04-math-0417$ is given by (4.33), $c04-math-0418$ , $c04-math-0419$ , and $c04-math-0420$ . Thus,

4.81

In the case of uniformly distributed bits $c04-math-0422$ and for any generalized logarithmic-likelihood ratio (LLR), (4.75) becomes

4.82

Furthermore, if we assume the symmetry condition (3.74) is satisfied⁵ (i.e., $c04-math-0424$ , we obtain from (4.82)

4.83

4.84

Equivalently, we can express (4.83) as

4.85

where

4.86

4.87

and (4.87), inspired by (3.94), is a numerically stable⁶ version of (4.86).

Knowing the observations

4.88

and the transmitted bits $c04-math-0433$ , (4.85) can be transformed into a simple unbiased estimator of $c04-math-0434$ as

4.89

In the case of matched bit metrics $c04-math-0436$ with symmetric L-values and uniformly distributed bits, we obtain the estimate of the bit-level GMI via Monte Carlo integration

4.90

In a simple case of transmission over the scalar channel, i.e., when the model relating the channel outcome $c04-math-0438$ and the input $c04-math-0439$ is well known, we may calculate the GMI via the formulas provided in Section 4.5 with much better accuracy than the one obtained from (4.89) and (4.90). On the other hand, as it happens very often, the L-values are calculated using nonlinear expressions, and are affected by various random variables. Then, the direct integrals of Section 4.5 are too involved to derive and/or tedious to use. Moreover, because the PDFs $c04-math-0440$ or $c04-math-0441$ are also difficult to estimate (analytically or via histograms), the estimates (4.89) and (4.90) become very convenient.

4.4 BICM, CM, and MLC: A Comparison

4.4.1 Achievable Rates

Having analyzed CM and BICM, it is interesting to compare and relate them to multilevel coding (MLC), another popular coded modulation scheme we presented in Section 2.3.2. To this end, we start rewriting the CM-MI using the random variables $c04-math-0458$ , i.e.,

4.97

4.98

To obtain (4.97), we used the fact that the mapping $c04-math-0461$ is bijective, and to obtain (4.98), we used the chain rule of mutual information.

The quantity $c04-math-0462$ in (4.98) represents an MI between the bit $c04-math-0463$ and the output, conditioned on the knowledge of the bits $c04-math-0464$ . This bit-level MI represents the maximum rate that can be used at the $c04-math-0465$ th bit position, given a perfect knowledge of the previous $c04-math-0466$ bits.

MLC with multistage decoding (MSD) (see Fig. 2.5) is in fact a direct application of (4.98), i.e., $c04-math-0467$ parallel encoders send the encoded oddata using bits' positions $c04-math-0468$ , each of the encoders uses a coding rate $c04-math-0469$ . At the receiver side, the first bit level is decoded and the decisions are passed to the second decoder, which then passes the decisions to the third decoder, and so on, as shown in Fig. 2.6 (a).

Although different mapping rules produce different conditional MIs $c04-math-0470$ , the sum (4.98) remains constant. In other words, the sum of achievable rates using MLC encoding/decoding does not depend on the labeling and is equal to CM-MI. On the other hand, when MLC is based on parallel decoding of the individual levels (PDL), i.e., when no information is passed between the $c04-math-0471$ decoders (see Fig. 2.6b), BICM and MLC are equivalent in terms of achievable rates, although the main difference still remains: instead of $c04-math-0472$ encoder–decoder pairs, BICM uses only one pair.

Using (4.98) and (4.54), we can express the MI loss caused by BICM as

4.99

which clearly shows that the BICM-GMI will be approximately equal to the CM if the differences $c04-math-0474$ are small for all $c04-math-0475$ .

Example 4.23 (CM, MLC, and BICM for $c04-math-0476$ )

Consider an 8PAM constellation labeled by the BRGC and BSGC shown in Fig. 2.13 over the AWGN channel. In Fig. 4.15, we show the conditional MIs $c04-math-0477$ for the BRGC (Fig. 4.15 (a)) and the BSGC (Fig. 4.15 (b)). The AWGN capacity $c04-math-0478$ in (4.9), the CM-MI, and BICM-GMI are also shown. While the obtained conditional MIs are quite different, from (4.98), we know that their sum yields nothing but the CM-MI. In this figure, we also show the unconditional MIs $c04-math-0479$ whose sum yields the BICM-GMI $c04-math-0480$ . In the case of the BRGC, the differences between $c04-math-0481$ and $c04-math-0482$ (i.e., the terms in the r.h.s. of (4.99)) are small, which brings the BICM-GMI very close to the CM-MI. On the other hand, using the labeling BSGC, the difference becomes significant for the bit position $c04-math-0483$ , and this causes a significant loss in terms of achievable rates when using BICM rather than MLC.

c04f015 — **Figure 4.15** MIs defining the achievable rates for 8PAM labeled by (a) the BRGC and (b) the BSGC over the AWGN channel (see Fig. 2.13). The AWGN capacity $c04-math-0484$ , the CM-MI, and BICM-GMI are also shown. The pairs $c04-math-0485$ and $c04-math-0486$ (see (4.109) and (4.113)) for $c04-math-0487$ are also shown (circles and triangles, respectively)

We have already demonstrated in Theorem 4.10 that the GMI is upper bounded by the MI, which indicates that the BICM-GMI is upper bounded by the CM-MI. This property can be also demonstrated analyzing the terms in the r.h.s. of (4.99). We note that, although often evoked in the literature, the data processing theorem cannot be used for the proof because the bits $c04-math-0488$ , the bit $c04-math-0489$ , and the observation $c04-math-0490$ do not form a Markov chain.

We are now in a position to establish the following inequalities

4.104

which hold for any $c04-math-0501$ , $c04-math-0502$ , $c04-math-0503$ , $c04-math-0504$ , and $c04-math-0505$ , and where $c04-math-0506$ and $c04-math-0507$ are related via (2.72). The second inequality in (4.104) is obvious because of constraining the input distribution to be discrete and unoptimized. The first inequality is due to Theorem 4.24 and is tight only if the bits $c04-math-0508$ conditioned on $c04-math-0509$ are independent, i.e.,

4.105

The expression in (4.105) holds, e.g., when using a 4QAM constellation with Gray labeling. In this case the bits $c04-math-0511$ and $c04-math-0512$ modulate the orthogonal dimensions of the constellation: $c04-math-0513$ and $c04-math-0514$ , respectively.

Similarly to (4.104), for fading channels, we obtain

4.106

Furthermore, because $c04-math-0516$ and $c04-math-0517$ are concave functions of the SNR, it follows from Jensen's inequality that

4.107

4.108

On the other hand, the BICM-GMI is not always a concave function of the SNR,⁷ and thus, we cannot guarantee an inequality similar to (4.107).

4.4.2 Robustness of the Encoding Scheme

While the rates achievable using MLC are never smaller than those attainable using BICM, it is important to note that the design of MLC, i.e., the choice of the coding rates $c04-math-0520$ for each of the coding levels, requires the knowledge of all the conditional MIs $c04-math-0521$ . This brings up an issue related to the robustness of the design with respect to lack of complete knowledge of the channel model.

To clarify this issue, we now assume that the MLC encoding/decoding scheme is designed for the AWGN channel for a coding rate $c04-math-0522$ . We also assume that the design is done so as to operate at the SNR decoding threshold $c04-math-0523$ , i.e., we design the encoders to ensure successful decoding at the SNR limit defined by the CM-MI, which we denote as

4.109

where $c04-math-0525$ is the inverse of the MI function $c04-math-0526$ defined in Section 4.1.

The decoding threshold of each of the MLC levels is given by

4.110

where $c04-math-0528$ is the inverse of the conditional MI $c04-math-0529$ .

We can thus define the SNR decoding threshold as the value of SNR, for which the message at all MLC levels can be decoded

4.111

In order to operate at the CM-MI limit with $c04-math-0531$ we need to satisfy the following:

4.112

that is, the SNR decoding thresholds for each of the decoders must be the same.

If a BICM scheme is designed for the same coding rate $c04-math-0533$ , the corresponding SNR decoding threshold is given by

4.113

where $c04-math-0535$ is the inverse function of the BICM-GMI.

To illustrate the previous definitions, we show in Fig. 4.15 the pairs $c04-math-0536$ and $c04-math-0537$ for a rate $c04-math-0538$ . The difference $c04-math-0539$ can be understood as the SNR penalty caused by using a BICM scheme for a given coding rate $c04-math-0540$ . For the BRGC, the penalty for $c04-math-0541$ is close to zero, however, for the BSGC it increases up to about $c04-math-0542$ .

We are now ready to see what happens if MLC and BICM are used when transmitting over a non-AWGN channel. In other words, we are interested in studying how the SNR decoding thresholds change if the design made for the AWGN channel is used in a different channel. This can be the case, e.g., when transmitting over a fading channel which was not foreseen during the design.

If the encoders with the rates $c04-math-0543$ with $c04-math-0544$ (designed so as to satisfy (4.112) for the AWGN channel) are used in a fading channel, the corresponding SNR decoding thresholds are given by

4.114

where $c04-math-0546$ is the inverse of the conditional MI $c04-math-0547$ .

Again, successful decoding at all decoding levels of MLC, is possible if $c04-math-0548$ is greater than all the individual SNR decoding thresholds in (4.114), i.e.,

4.115

Since the form of $c04-math-0550$ varies with $c04-math-0551$ , the values of the corresponding inverse functions $c04-math-0552$ will not be the same. Consequently, we cannot guarantee that the condition for the average SNR, similar to (4.112), will be satisfied, and, in general, we will have $c04-math-0553$ .

Therefore, the following relationship holds

4.116

and the difference $c04-math-0555$ should be interpreted as the SNR penalty because of a “code design mismatch”.

We emphasize that the mismatch happens because the coding rates $c04-math-0556$ were selected using the functions $c04-math-0557$ . If, on the other hand, we rather used $c04-math-0558$ , then the code designed would be matched to the fading channel. In such a case, the effect of the code design mismatch would appear if MLC were used over the AWGN channel. More generally, the code design mismatch phenomenon will be observed when the model of the channel used for the design is mismatched with respect to the actual channel.

Of course, in BICM transmission, the SNR decoding threshold is similar to (4.113), i.e.,

4.117

In many cases of practical interest, $c04-math-0560$ but in general, this relationship depends on the mapping as we discussed after (4.108). Of course, the relationship $c04-math-0561$ always holds, and we interpret the difference $c04-math-0562$ as the SNR gap because of the suboptimality of BICM. Unlike in the case of MLC, there is no effect of the code design mismatch: BICM remains “matched” to the channel independently of the underlying channel model.

Example 4.25 (Code Mismatch for $c04-math-0563$ )

Let us investigate the effect of the code design mismatch for an 8PAM constellation labeled by the BRGC. Figure 4.16 shows the rates achievable by MLC and BICM together with the SNR decoding thresholds for MLC. The pairs $c04-math-0564$ as well as $c04-math-0565$ , where $c04-math-0566$ are shown by circles. The rates $c04-math-0567$ are chosen for operation over the AWGN channel, so the vertical alignment of the circles means that $c04-math-0568$ . Thus, the code is designed to operate at the CM–MI curve. The pair $c04-math-0569$ is shown by a triangle and we see that the SNR penalty $c04-math-0570$ is relatively small.

If we now consider the case where the same MLC encoders are used in a Rayleigh fading channel, we see the mismatch: the pairs $c04-math-0571$ shown as squares are not aligned vertically. This results in the SNR decoding thresholds of MLC (the pair $c04-math-0572$ is shown by the top square) being larger than the SNR decoding threshold for BICM (the pair $c04-math-0573$ is shown by a diamond).

By varying the value of $c04-math-0574$ , we can find pairs $c04-math-0575$ , which we show in Fig. 4.16 using a dashed line. For clarity of presentation, we only consider $c04-math-0576$ . These results illustrate that while MLC has the potential to operate at the limits of CM-MI, and thus, also to outperform BICM, the latter, having a binary encoder independent of the channel model, is more robust to variations in the conditions of operation.

c04f016 — **Figure 4.16** Achievable rates for 8PAM using the BRGC. In MLC, the rates $c04-math-0577$ are designed for the AWGN channel and then used in a Rayleigh fading channel. For $c04-math-0578$ , the circles correspond to the pairs $c04-math-0579$ and $c04-math-0580$ , the triangle shows $c04-math-0581$ , the diamond shows $c04-math-0582$ and the squares indicate $c04-math-0583$ and $c04-math-0584$ . The dashed line corresponds to the set of pairs $c04-math-0585$ for $c04-math-0586$

4.5 Numerical Calculation of Mutual Information

In Sections 4.2 and 4.3, we have shown various examples of the CM-MI and the BICM-GMI calculated for the AWGN channel. While we cannot avoid the problem of enumerating the constellation points, we may efficiently calculate the multidimensional integrals (over $c04-math-0587$ ) appearing in the MI expressions using Gauss–Hermite (GH) quadratures. The methods we discussed in Section 4.3.3 may also be used but the numerical quadratures are much more accurate for small values of $c04-math-0588$ and, what is more important, they are much more reliable for being used in a numerical optimization.

For any function $c04-math-0589$ with bounded $c04-math-0590$ th derivative, the $c04-math-0591$ -points GH quadrature

4.118

is asymptotically ( $c04-math-0593$ ) exact if we use the quadrature nodes $c04-math-0594$ as the $c04-math-0595$ th root of the Hermite polynomial

4.119

and the weights defined as

4.120

The values of $c04-math-0598$ and $c04-math-0599$ can be easily found for different values of $c04-math-0600$ , which determines the trade-off between the computation speed and the accuracy of the quadrature. The expression in (4.118) can be straightforwardly expanded to a quadrature over an $c04-math-0601$ -dimensional space as

4.121

where $c04-math-0603$ and $c04-math-0604$ . It is important to note that the expansion we use in (4.121) is not unique; however, it is a simple way of generalizing (4.118) to $c04-math-0605$ dimensions.

The CM-MI in (4.23) and in (4.24) can be expressed, respectively, as

4.122

4.123

where $c04-math-0608$ and $c04-math-0609$ are shown in Table 4.1. To obtain (4.122), we first used the substitution $c04-math-0610$ , then we used $c04-math-0611$ , and then split the logarithm of the quotient as a difference of logarithms. To obtain (4.123), we used $c04-math-0612$ in $c04-math-0613$ and split the logarithm of the multiplication as the sum of the logarithms.

c04-math-0620 — **Table 4.1** Six functions used in (4.122)–(4.129) to evaluate the CM-MI and the BICM-GMI

The BICM-GMI in (4.62) and (4.63) can be expressed, respectively, as

4.124

4.125

where $c04-math-0616$ and $c04-math-0617$ are shown in Table 4.1. To obtain (4.124), we first used (4.66), then the substitution $c04-math-0618$ , and then again split the logarithm of the quotient (in the conditional expectation (4.4)) as a difference of logarithms. The expression (4.125) can be obtained by repeating the procedure and using a uniform input distribution.

Using (4.121)–(4.123), we obtain the following ready-to-use expressions for the CM-MI

4.126

4.127

where (4.126) and (4.127) shall be used to evaluate the CM-MI with arbitrary and uniform input distributions, respectively.

Similarly, by using (4.121)–(4.125), we obtain the following ready-to-use expressions for the BICM-GMI

4.128

4.129

which shall be used to numerically evaluate the BICM-GMI with arbitrary and uniform input distributions, respectively.

Example 4.26 (Accuracy of MI Calculations)

Consider the case of 2PAM and 8PAM transmission over the AWGN channel. We can compare the accuracy of the MI calculation via (4.126) as a function of $c04-math-0635$ . Assuming the calculation for $c04-math-0636$ as exact, we show in Fig. 4.17 the absolute error of $c04-math-0637$ for different values of $c04-math-0638$ .

It is interesting to compare these results with those obtained via the unbiased estimator (4.90) based on Monte Carlo integration. The latter is a random variable with mean equal to $c04-math-0639$ and variance

4.130

where

4.131

where $c04-math-0642$ is given by (4.86).

For large $c04-math-0643$ , we can model $c04-math-0644$ as a Gaussian random variable and then, the estimation error is approximated as

4.132

which means that to maintain the same (small) probability of error, we have to maintain $c04-math-0646$ : to increase the accuracy by one order of magnitude ( $c04-math-0647$ -fold decrease of $c04-math-0648$ ), we have to increase the number of samples $c04-math-0649$ by two orders of magnitude. Thus, in the simple cases we analyzed here, the numerical quadratures offer a clear implementation-complexity advantage over the estimator (4.90). However, if the number of dimensions $c04-math-0650$ and/or the number of independent variables affecting the observation $c04-math-0651$ are increased, the numerical quadratures quickly become computationally demanding and tedious to implement. Then, the estimators (4.89) and (4.90) become simpler and viable alternative to the numerical quadratures.

c04f017 — **Figure 4.17** Average value of absolute error when evaluating $c04-math-0633$ using different number of quadrature points $c04-math-0634$ for (a) 2PAM and (b) 8PAM. The left axis indicates the values of the error (shown with solid lines) while the right axis indicates the values of the MI (thick dashed lines)

**Figure 4.17** Average value of absolute error when evaluating $c04-math-0633$ using different number of quadrature points $c04-math-0634$ for (a) 2PAM and (b) 8PAM. The left axis indicates the values of the error (shown with solid lines) while the right axis indicates the values of the MI (thick dashed lines)

All the results presented in this chapter were obtained with $c04-math-0652$ , which we found to provide an adequate accuracy in all numerical examples.

4.6 Bibliographical Notes

All the concepts in this chapter can be traced back to Shannon's works [1, 2]. For a detailed treatment of MIs, channel capacity, and so on, we refer the reader to standard textbook such as [3, 4].

The CM-MI has received different names in the literature. It is called joint capacity in [5], (constellation) constrained capacity in [6, 7], and coded modulation capacity in [8–12]. We use the name CM-MI to indicate that no optimization over the input distribution is performed.

The information-theoretical model for BICM was first introduced by Caire et al. using $c04-math-0653$ parallel memoryless BICO channels [8, Fig. 3]. Martinez et al. later refined this model in [10], where it was shown that the model of parallel channels leads to correct conclusions about achievable rates, but may yield incorrect error exponents/cutoff rates. Error exponents for BICM were also studied in [13].

The BICM-GMI has received different names in the literature. It is called parallel decoding capacity in [5], receiver constrained capacity in [6], and BICM capacity in [8–12]. Again, we refrain from using the word “capacity” to emphasize the lack of optimization over the input distribution. We adopted the name BICM-GMI, which fits well the finding of Martinez et al. [10] who recognized the BICM decoder as a mismatched decoder and showed that the BICM-GMI in (4.54) corresponds to an achievable rate of such a decoder. Theorem 4.11 is thus adopted from [10] and was also used in [14]. The results on the optimality of matched processing in Theorem 4.10 and Corollary 4.12 were inspired by [15].

As we indicated, the BICM-GMI is an achievable rate but it has not be proven to be the largest achievable rate. For example, a different achievable rate—the so-called LM rate—has been recently studied in [16, Part I]. Finding the largest achievable rate remains as an open research problem. Despite this cautionary statement, the BICM-GMI predicts well the performance of capacity-approaching codes, as shown in Example 4.13 (see also [17, Section 3], [18, Section IV]). The generator polynomials of the TENC used in Example 4.13 were chosen on the basis of the results in [19] and some of the puncturing patterns used were taken from [19, 20].

Changing the distribution of the constellation symbols—known as signal/constellation shaping—has been studied for a number years, see [21–24]. In the context of BICM, geometrical shaping (i.e., design of the constellation under the assumption of uniform distribution of the symbols) was studied in [5, 25, 26], and probabilistic shaping (i.e., varying the probabilities of the symbols and/or bits when using the fixed constellation) was proposed in [27, 28] and developed further in [16, 29–33]. Recently, a numerical algorithm to obtain optimal input distributions in BICM was proposed in [34]. As the BICM-GMI is not the largest achievable rate, probabilistic or geometric shaping in BICM yields what we call in Section 4.3.2 a “pseudo-capacity.”

The expression in (4.66) was first introduced in [9, Proposition 1], [7, eq. (65)] and later generalized in [35, Theorem 2] to multidimensional constellations and arbitrary input distributions. An alternative proof for Theorem 4.22 was presented in [16, Appendix B.1.1]. Achievable rates for 4D constellations for BICM in the context of coherent optical communications have been recently studied in [17].

The solution of (4.64) is in general unknown but results are available in the low-SNR regime. The low-SNR regime of the CM-MI has been studied in [36, 37], where conditions for constellations to achieve the SL are given. The fact that the SL is not always achieved for BICM was first noticed in [9] and later shown to be caused by the selection of the binary labeling [11]. The behavior of BICM in the low-SNR regime was studied in [9, 11, 35, 38, 39] as a function of the constellation and the binary labeling, assuming a uniform input distribution. Results for arbitrary input distributions under the indepence assumption of the bits have been recently presented in [40]. It is shown in [40] that probabilistic shaping offers no extra degrees of freedom in addition to what is provided by geometrical shaping for BICM in the low-SNR regime.

In terms of binary labelings, the NBC was shown to be optimal for $c04-math-0654$ PAM in the low-SNR regime [11, 35] and the FBC was analyzed in [41] for uncoded transmission. The 49 labelings that give different BICM-GMI in Fig. 4.12 correspond to the 49 classes in [42, Tables A.1 and A.2] and the 7 different classes of labelings that appear in Fig. 4.12 for high SNR correspond to those in [42, Table A.1]. The optimality of a Gray code was conjectured in [8, Section III-C] in a rather general setup, i.e., without specifying the Gray code, the constellation, or the SNR regime. This conjecture was later disproved in [43] where it is shown that for low and medium SNR, there exist other labelings that give a higher BICM-GMI (see also [12, Chapter 3]). The numerical results presented in [12, Chapter 3], [44] show that in the high-SNR regime, Gray codes are indeed optimal. A formal proof for the optimality of Gray codes in the high-SNR regime for arbitrary 1D constellations was recently given in [45]. A simple approximation for the BICM-GMI that can be used to optimize the binary labeling of the constellation was recently introduced in [18].

GH quadratures can be found in [46, Section 7.3.4] and tables with the coefficient $c04-math-0655$ and $c04-math-0656$ for different values of $c04-math-0657$ in [46, Appendix 7.3(b)]. Ready-to-use expressions for computing the CM-MI and BICM-GMI are given in [44].

References

[1] Shannon, C. E. (1948) A mathematical theory of communications. Bell Syst. Tech. J., 27, 379–423 and 623–656.
[2] Shannon, C. E. (1949) Communication in the presence of noise. Proc. IRE, 37 (1), 10–21.
[3] Cover, T. and Thomas, J. (2006) Elements of Information Theory, 2nd edn., John Wiley & Sons, Inc., Hoboken, NJ.
[4] MacKay, D. J. C. (2005) Information Theory, Inference, and Learning Algorithms, Cambridge University Press.
[5] Barsoum, M., Jones, C., and Fitz, M. (2007) Constellation design via capacity maximization. IEEE International Symposium on Information Theory (ISIT), June 2007, Nice, France.
[6] Schreckenbach, F. (2007) Iterative decoding of bit-interleaved coded modulation. PhD dissertation, Technische Universität München, Munich, Germany.
[7] Brännström, F. and Rasmussen, L. K. (2009) Classification of unique mappings for 8PSK based on bit-wise distance spectra. IEEE Trans. Inf. Theory, 55 (3), 1131–1145.
[8] Caire, G., Taricco, G., and Biglieri, E. (1998) Bit-interleaved coded modulation. IEEE Trans. Inf. Theory, 44 (3), 927–946.
[9] Martinez, A., Guillén i Fàbregas, A., and Caire, G. (2008) Bit-interleaved coded modulation in the wideband regime. IEEE Trans. Inf. Theory, 54 (12), 5447–5455.
[10] Martinez, A., Guillén i Fàbregas, A., Caire, G., and Willems, F. M. J. (2009) Bit-interleaved coded modulation revisited: a mismatched decoding perspective. IEEE Trans. Inf. Theory, 55 (6), 2756–2765.
[11] Stierstorfer, C. and Fischer, R. F. H. (2009) Asymptotically optimal mappings for BICM with $c04-math-0658$ -PAM and $c04-math-0659$ -QAM. IET Electron. Lett., 45 (3), 173–174.
[12] Stierstorfer, C. (2009) A bit-level-based approach to coded multicarrier transmission. PhD dissertation, Friedrich-Alexander-Universität Erlangen-Nürnberg, Erlangen, Germany.
[13] Wachsmann, U., Fischer, R. F. H., and Huber, J. B. (1999) Multilevel codes: theoretical concepts and practical design rules. IEEE Trans. Inf. Theory, 45 (5), 1361–1391.
[14] Nguyen, T. and Lampe, L. (2011) Bit-interleaved coded modulation with mismatched decoding metrics. IEEE Trans. Commun., 59 (2), 437–447.
[15] Jaldén, J., Fertl, P., and Matz, G. (2010) On the generalized mutual information of BICM systems with approximate demodulation. IEEE Information Theory Workshop (ITW), January 2010, Cairo, Egypt.
[16] Peng, L. (2012) Fundamentals of bit-interleaved coded modulation and reliable source transmission. PhD dissertation, University of Cambridge, Cambridge.
[17] Alvarado, A. and Agrell, E. (2014) Achievable rates for four-dimensional coded modulation with a bit-wise receiver. Optical Fiber Conference (OFC), March 2014, San Francisco, CA.
[18] Alvarado, A., Brännström, F., and Agrell, E. (2014) A simple approximation for the bit-interleaved coded modulation capacity. IEEE Commun. Lett., 18 (3), 495–498.
[19] Açikel, O. and Ryan, W. (1999) Punctured turbo-codes for BPSK/QPSK channels. IEEE Trans. Commun., 47 (9), 1325–1323.
[20] Kousa, M. A. and Mugaibel, A. H. (2002) Puncturing effects on turbo codes. Proc. IEE, 149 (3), 132–138.
[21] Calderbank, A. R. and Ozarow, L. H. (1990) Nonequiprobable signaling on the Gaussian channel. IEEE Trans. Inf. Theory, 36 (4), 726–740.
[22] Forney, G. D. Jr. and Wei, L.-F. (1989) Multidimensional constellations—Part I: Introduction, figures of merit, and generalized cross constellations. IEEE J. Sel. Areas Commun., 7 (6), 877–892.
[23] Forney, G. D. Jr. (1989) Multidimensional constellations—Part II: voronoi constellations. IEEE J. Sel. Areas Commun., 7 (6), 941–957.
[24] Fischer, R. F. H. (2002) Precoding and Signal Shaping for Digital Transmission, John Wiley & Sons, Inc..
[25] Sommer, D. and Fettweis, G. P. (2000) Signal shaping by non-uniform QAM for AWGN channels and applications using turbo coding. International ITG Conference on Source and Channel Coding (SCC), January 2000, Munich, Germany.
[26] Goff, S. Y. L. (2003) Signal constellations for bit-interleaved coded modulation. IEEE Trans. Inf. Theory, 49 (1), 307–313.
[27] Le Goff, S. Y., Sharif, B. S., and Jimaa, S. A. (2004) A new bit-interleaved coded modulation scheme using shaping coding. IEEE Global Telecommunications Conference (GLOBECOM), November–December 2004, Dallas, TX.
[28] Raphaeli, D. and Gurevitz, A. (2004) Constellation shaping for pragmatic turbo-coded modulation with high spectral efficiency. IEEE Trans. Commun., 52 (3), 341–345.
[29] Le Goff, S., Sharif, B. S., and Jimaa, S. A. (2005) Bit-interleaved turbo-coded modulation using shaping coding. IEEE Commun. Lett., 9 (3), 246–248.
[30] Le Goff, S., Khoo, B. K., Tsimenidis, C. C., and Sharif, B. S. (2007) Constellation shaping for bandwidth-efficient turbo-coded modulation with iterative receiver. IEEE Trans. Wirel. Commun., 6 (6), 2223–2233.
[31] Guillén i Fàbregas, A. and Martinez, A. (2010) Bit-interleaved coded modulation with shaping. IEEE Information Theory Workshop (ITW), August–September 2010, Dublin, Ireland.
[32] Peng, L., Guillén i Fàbregas, A., and Martinez, A. (2012) Mismatched shaping schemes for bit-interleaved coded modulation. IEEE International Symposium on Information Theory (ISIT), July 2012, Cambridge, MA.
[33] Valenti, M. and Xiang, X. (2012) Constellation shaping for bit-interleaved LDPC coded APSK. IEEE Trans. Commun., 60 (10), 2960–2970.
[34] Böcherer, G., Altenbach, F., Alvarado, A., Corroy, S., and Mathar, R. (2012) An efficient algorithm to calculate BICM capacity. IEEE International Symposium on Information Theory (ISIT), July 2012, Cambridge, MA.
[35] Agrell, E. and Alvarado, A. (2011) Optimal alphabets and binary labelings for BICM at low SNR. IEEE Trans. Inf. Theory, 57 (10), 6650–6672.
[36] Verdú, S. (2002) Spectral efficiency in the wideband regime. IEEE Trans. Inf. Theory, 48 (6), 1319–1343.
[37] Prelov, V. V. and Verdú, S. (2004) Second-order asymptotics of mutual information. IEEE Trans. Inf. Theory, 50 (8), 1567–1580.
[38] Alvarado, A., Agrell, E., Guillén i Fàbregas, A., and Martinez, A. (2010) Corrections to ‘Bit-interleaved coded modulation in the wideband regime’. IEEE Trans. Inf. Theory, 56 (12), 6513.
[39] Stierstorfer, C. and Fischer, R. F. H. (2008) Mappings for BICM in UWB scenarios. International ITG Conference on Source and Channel Coding (SCC), January 2008, Ulm, Germany.
[40] Agrell, E. and Alvarado, A. (2013) Signal shaping for BICM at low SNR. IEEE Trans. Inf. Theory, 59 (4), 2396–2410.
[41] Lassing, J., Ström, E. G., Agrell, E., and Ottosson, T. (2003) Unequal bit-error protection in coherent $c04-math-0660$ -ary PSK. IEEE Vehicular Technology Conference (VTC-Fall), October 2003, Orlando, FL.
[42] Brännström, F. (2004) Convergence analysis and design of multiple concatenated codes. PhD dissertation, Chalmers University of Technology, Göteborg, Sweden.
[43] Stierstorfer, C. and Fischer, R. F. H. (2007) (Gray) Mappings for bit-interleaved coded modulation. IEEE Vehicular Technology Conference (VTC-Spring), April 2007, Dublin, Ireland.
[44] Alvarado, A., Brännström, F., and Agrell, E. (2011) High SNR bounds for the BICM capacity. IEEE Information Theory Workshop (ITW), October 2011, Paraty, Brazil.
[45] Alvarado, A., Brännström, F., Agrell, E., and Koch, T. (2014) High-SNR asymptotics of mutual information for discrete constellations with applications to BICM. IEEE Trans. Inf. Theory, 60 (2), 1061–1076.
[46] Hildebrand, F. B. (ed.) (1981) Handbook of Applicable Mathematics: Numerical Methods, Vol. 3, John Wiley & Sons, Inc.

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.

Function	Expression
$c04-math-0619$	$c04-math-0620$
$c04-math-0621$	$c04-math-0622$
$c04-math-0623$	$c04-math-0624$
	$c04-math-0625$
$c04-math-0626$	$c04-math-0627$
	$c04-math-0628$

Table of Contents for 4.4 BICM, CM, and MLC: A Comparison

Create new playlist

Sign In

Sign Up

4.3.2 BICM Pseudo-Capacity

4.3.3 Useful Relationships

4.4 BICM, CM, and MLC: A Comparison

4.4.1 Achievable Rates

4.4.2 Robustness of the Encoding Scheme

4.5 Numerical Calculation of Mutual Information

4.6 Bibliographical Notes

References

Table of Contents for
4.4 BICM, CM, and MLC: A Comparison