The analysis shown in Section 5.2 is slightly more involved in the case of 2D constellations. In what follows, we will first describe how to find the tessellation regions , and then, we will find the PDF of the L-values for arbitrary 2D constellations.
Finding the tessellation regions for 1D constellations is relatively simple and can be done by inspection, as we did, e.g., in Example 5.5. A more structured approach is required for 2D constellations, because as we will see, even with very regular constellations such as QAM, the regions may significantly change in a function of the binary labeling.
We start by rewriting (5.27) as
where , , and is given by (3.54).
These tessellation regions are mutually disjoint
and span the whole space
Moreover, because in (5.69) is defined by linear inequalities, the tessellation region may be (i) a (convex) polygon, (ii) a region that extends to infinity (i.e., an “infinite” polygon), or (iii) an empty set (if the inequalities in (5.69) are contradictory).
It should be rather clear that the regions cannot be deduced from the so-called Voronoi regions of the constellation (see (3.100)). The Voronoi regions depend solely on the constellation , while depends also the on the labeling (via and ), or equivalently, on the subconstellations and . However, the tessellation regions can be obtained as the intersection of the Voronoi regions of the subconstellations and , as explained in the following example.
Finding the tessellation regions in Examples 5.6 and 5.7 is relatively simple, however, this is not always the case. For example, when a 16PAM constellation labeled by the M16 (shown in Fig. 2.15) is considered, the labeling cannot be factorized as the ordered direct product of two labelings, so the tessellation regions are difficult to find by inspection. As we will see in Example 5.8, the regions in this case are very irregular, and thus, to find them, we require an algorithmic support.
In order to find , we need to remove redundant constraints from (5.69). The linear inequalities and in (5.69) can be reformulated in a matrix form6 as
where
is an matrix,7 and where and are matrices whose columns are taken from the respective sets
Similarly, the vector in (5.72) is
where and are column vectors whose elements are taken from the sets
respectively. To make (5.72) correspond to (5.69), we also assume that the matrix and the vector are ordered in the same way. In other words, if the column of is defined by a particular pair of indices , the same pair defines the corresponding element of . The same applies to and whose entries are defined by the pairs .
In general, (5.72) has redundant inequalities, i.e., the number of linear forms defining the region is smaller than . Removing the redundant inequalities in (5.72) is a “classical” problem in computational geometry, and to solve it, we first require the region to be closed, i.e., infinite polygons (which indeed exists, as shown in Fig. 5.13) are artificially closed by adding constraints forming a square with arbitrarily large dimensions
where is a vector of ones. We can keep track of these artificial constraints and, if they appear in the final solution, we should eliminate them, which would produce an infinite polygon. However, for practical purposes, retaining the artificial inequalities has negligible effect when is sufficiently large.
We then focus on the augmented problem
where is an matrix and is an column vector. To solve (5.80), we translate the coordinates via to obtain
so that after the translation, satisfies all the inequalities in (5.82). The problem then boils down to finding a solution of
Various methods may be used to find a feasible solution of the set of inequalities in (5.83). Here, we have solved the following quadratic optimization problem:
Note that the objective function is defined arbitrarily as we are interested in finding any translation point satisfying the constraints. If the solution of (5.84) does not exist, it means that the inequalities (5.72) are contradictory and the tessellation region is empty.
After has been found, (5.82) can be expressed as
where . The next step is to normalize each inequality in (5.85) to obtain
where the normalization is such that the th row of is equal to the th row of divided by the (positive by definition) th entry of .
The original problem in (5.72) expressed as in (5.86) is now in a standard form and can be solved using the duality between the line and points description. This duality states that removing redundant inequalities from (5.86) is equivalent to finding the minimum convex region enclosing the rows of , where each row is treated as a point in 2D space. We thus have to find a convex hull of these points, which can be done efficiently using numerical routines available, e.g., in MATLAB® or Mathematica®.
As we will see in the next section, to compute the PDF of the L-values, we need not only the linear pieces defining the tessellation regions but also the vertices of the polygons. Finding these vertices is again relatively simple and corresponds to a vertex enumeration problem treated in computational geometry.
In the previous section, we have shown how to find the tessellation regions for arbitrary 2D constellations and labelings. The next step toward finding the PDF of the L-values is to find the CDFs as required by (5.31). We start by rewriting (5.69) as
where we use
to denote each of the nonredundant linear forms defining the tessellation regions , i.e., for each there exists a mapping between and the indices defining . These nonredundant linear forms are the ones we have already shown, e.g., in Fig. 5.13: if , , and , the region is an infinite polygon defined by linear forms; if , , and , we obtain ; and for the region (the closed polygon containing and ), we have . We also define the vertices of the tessellation regions as .
The CDF (5.32) can be expressed as
where
contains observations which satisfy (this region is shown shaded in Fig. 5.16) and
where given by (5.43). The equation defines the line perpendicular to , as schematically shown in Fig. 5.16.
The region in (5.90) is a subset of (5.87) and depends on . Further, to simplify the presentation, we assume that the tessellation region is a closed polygon defined by nonredundant inequalities. Then the line either (i) intersects8 two sides of polygon and these sides are then active, or (ii) does not have a common point with the polygon. The first case is critical to the calculation of the CDF. We note that while the indices of the linear forms defining the active sides change with , they remain constant as long as, varying , the line does not cross one of the vertices of the polygon; i.e., the indices change when satisfies , which may occur for distinct values of defined by
Thus, denoting by the sorted values of in (5.92), we find the intervals
such that
For a given value of , we will use the indices and to denote the two sides of the polygon which remain active for all .
To clarify the above discussion, consider the example shown in Fig. 5.16. In this case, we have and , , and . For the particular value of in Fig. 5.16, so and . Furthermore, in this figure we schematically show the values of as a projection of the vertices on the line connecting and which becomes the “axis” for the arguments of the PDF (see also Fig. 3.2). The origin of this axis corresponds to . Note that the origin does not necessarily belong to .
The calculation of (5.89) requires now a 2D-equivalent of the integration shown in (5.48). This is practically the same problem we need to solve when evaluating the bit- and symbol-error rates, so the approach we have adopted here is also explained in detail in Section 6.1.2. More particularly, we decompose the observation space into elementary wedges (see also (6.34)) defined as
Integration over such wedge can be expressed as9
where is the bivariate Q-function defined in (2.11).
As shown in Fig. 5.16, the union of the wedges and the polygon over which the integration must be carried out yields the entire observation space. In the particular case shown in Fig. 5.16 we can write
where we used shorthand notation to denote the parameters of the linear form (5.91). We note that the linear form , being redundant for the definition of (for the particular value of used in this example), does not affect the integration result.
We are now ready to find the conditional PDF of the L-values .
Analyzing Theorem 5.10, we note that the indicator function in (5.106) takes into account the variation of the indices and because of the change of moving throughout the intervals . As we see, the form of the PDF does not depend uniquely on , , and , but also on other symbols from the constellation which define the polygon's sides. We also note that the Gaussian piece we have already obtained in the case of 1D constellations, is still present, but it is now multiplied by the “correction” terms (5.107).
In this section, we consider an 8PSK constellation, which is probably the simplest nontrivial example of a 2D constellation. Furthermore, because of its practical relevance, we consider the BRGC. The constellation and labeling are shown in Fig. 5.14.
The tessellation regions are simple to find as we have already shown it in Example 5.7: they are always either (i) the Voronoi regions of the constellation symbols (e.g., , in Fig. 5.14) or (ii) a union of two Voronoi regions (e.g., , or any , see again Fig. 5.14). This property spares us the need to apply the algorithmic approach of Section 5.3.1, which means we do not need to set the artificial constraints (5.79). Second, because of symmetry of the BRGC, the regions also have a notable symmetry, which we will exploit to simplify the analysis.
As for PSK constellations , we obtain
and thus,
The expression in (5.112) shows that all the lines delimiting the regions pass through the origin and so does the line .
Let us focus on the case which also gives us the results for (as explained in Example 5.7). This case is shown in Fig. 5.17. The piece of the PDF depends on the form of the tessellation region as well as on the position of the symbol with respect to this region. An inspection of the constellation in Fig. 5.17 lets us conclude the following:
thus, it is sufficient to find for , conditioned on .
i.e., .Moreover, for we observe the following relationship:
i.e., .
After differentiation and generalization to other regions we obtain the following relationships:
The advantage of (5.116)–(5.118) is that we can now express the PDF using (5.33) as
The same can be done for the PDF pieces conditioned on , namely,
which yields
Thus, instead of varying the regions for a constant as suggested by (5.33), in (5.120) and (5.125), we limit our considerations to two regions and change the symbols . We emphasize that this is possible in the particular case of an PSK constellation labeled by the BRGC because of the symmetries of the constellation and the binary labeling. This is very convenient because we only need to find and for .
The same can be done for . In this case, only one symbol needs to be considered, namely, . It can be shown that in this case
In order to reuse the results obtained for , we can further exploit the equivalence (up to a rotation) of the tessellation regions and (see Fig. 5.13), i.e., for and .
Up to now, we have shown the PDF can be expressed as a function of and . The next step is to use Theorem 5.10. Before doing this, however, we study some properties of PSK constellations.
More specifically, using , where (see (2.52)), we find
where
Similarly, we can find
We can now analyze the contribution of the region . From (5.106), we obtain
where , and . Here, , and , so (5.107) becomes
where we used .
Note that there is only one correction term in (5.136) because the region is a semi-infinite polygon and the line intersects only one of its sides. Equivalently, we might set an artificial constraint to close the polygon, e.g., , where and . Then and , cf. (5.107).
Similarly, for we obtain
where , , and and , so
with , and .
The form of is shown in Fig. 5.18 for and , where we identify the pieces . Similarly, we show the corresponding results for in Fig. 5.19. We can observe that for , the term has a “dominating” contribution to . In particular, in Fig. 5.18 (a), we see that for , takes on the largest values, while in Fig. 5.18 (b), we see that for , the piece dominates the others. Finally, in Fig. 5.19 we see that for , is again the dominating piece.
18.221.12.52