Loading [MathJax]/extensions/MathMenu.js
MIMO Per-Tone Equalizer Design for Long Reach xDSL | IEEE Journals & Magazine | IEEE Xplore

MIMO Per-Tone Equalizer Design for Long Reach xDSL


Abstract:

Recently, long reach x-digital subscriber line (LR-xDSL) has been proposed to extend the reach of conventional DSL systems. The extended loop lengths are characterized by...Show More

Abstract:

Recently, long reach x-digital subscriber line (LR-xDSL) has been proposed to extend the reach of conventional DSL systems. The extended loop lengths are characterized by a longer channel impulse response (CIR), which requires a similarly longer cyclic prefix (CP) to successfully eliminate the inter-symbol interference (ISI) between successive time-domain discrete multi-tone (DMT) symbols and inter-carrier interference (ICI) between the carriers or tones of the same DMT symbol. This adds a large overhead to the transmitted symbols and results in throughput loss. A per-tone equalizer (PTEQ) is an attractive alternative to deal with extended loop lengths. However, it imposes a large initialization computational complexity and memory requirement, hindering the use of a PTEQ in practical multiple-input multiple-output (MIMO) scenarios. To tackle this problem, a specific structure in the MIMO DSL channel, namely that the combined ISI and ICI signal power from the crosstalk channels is significantly lower than the desired and combined ISI and ICI signal power from the direct channels, may be exploited in deriving a novel low complexity/memory solution, here referred as sparse MIMO PTEQ, with negligible impact (≈ 0.5% drop) on performance compared to a full MIMO PTEQ. For a conventional DSL binder size of 16 lines and a PTEQ order of 3, the proposed sparse MIMO PTEQ performs at 42% of the initialization computational complexity and 29.7% of the memory requirement, with negligible performance degradation, compared to a full MIMO PTEQ. The initialization computational complexity and memory requirement is further reduced by the proposed diagonal MIMO PTEQ which operates at 0.4% of the initialization computational complexity of a full MIMO PTEQ and requires 6.25% memory compared to a full MIMO PTEQ, with some degradation in the performance compared to the full MIMO PTEQ. The diagonal MIMO PTEQ has the additional benefit that it can be applied in both upstream and downstream sc...
Page(s): 51 - 64
Date of Publication: 22 December 2021
Electronic ISSN: 2644-125X

Funding Agency:


SECTION I.

Introduction

Digital subscriber line (DSL) technologies hold a major market-share of broadband communication [1]. They are based on discrete multi-tone (DMT) modulation, which is a multicarrier modulation technique similar to orthogonal frequency division multiplexing (OFDM). DMT is used in wireline communication, where channels are slowly time-varying and the noise in the channel is assumed stationary. Hence channels are assumed to be known after an initial training and quasi-static. This allows DMT to use bit-loading and adaptive power loading in addition to basic OFDM operation. In DMT on the transmitter side, the input bits to be transmitted are converted into parallel bit streams, each stream to be carried on a carrier or tone (a discrete sub-band in the available bandwidth). In each stream, the bits are converted into higher order QAM symbols (up to 16384-QAM [2]), which are then modulated on to the discrete tones by an inverse discrete Fourier transform (IDFT) operation to obtain time-domain symbols. A cyclic prefix (CP) is added which—provided that its length at least matches the length of the channel impulse response (CIR), and the transmitter and receiver are properly synchronized—eliminates inter-symbol interference (ISI) between successive time-domain DMT symbols and inter-carrier interference (ICI) between the tones of the same DMT symbol. On the receiver side, the CP is removed from the received time-domain symbols and subsequently a discrete Fourier transform (DFT) is applied. Finally, the received QAM symbols are equalized with a single-tap complex frequency-domain equalizer (FEQ).

In an ideal scenario, the combination of CP and FEQ works perfectly to counter ISI and ICI. In older generations of DSL, such as ADSL [3] (and its later versions ADSL2 and ADSL2+), the allowed loop lengths ranged up to 5000 meters and the available channel bandwidth ranged up to only a few MHz. Hence, these older generations have been characterized by a long CIR and low crosstalk levels enabling a per-line single-input single-output (SISO) design. Although the use of a similarly long CP eliminates ISI and ICI, it also increases the DMT symbol length, thereby introducing an overhead and decreasing the achievable throughput. An efficient way to deal with this problem has been the use of a channel shortening filter—commonly known as a time-domain equalizer (TEQ) [4]. However in later generations of DSL (e.g., VDSL2 [5], G.fast [6], G.mgfast [7]) the copper loop lengths have been effectively shortened with the deployment of fibre to the distribution point unit (DPU), The typical loop lengths for VDSL2, G.fast and G.mgfast are 1200m, 250m and 100m, respectively. Hence, channel shortening has not been used in the later generations of DSL, confining its use to ADSL.

Recently long reach VDSL2 (LR-VDSL2) has been proposed with the purpose of providing high data rates (possibly up to $40 Mbit/s$ for downstream [8]) and a longer reach than conventional VDSL2 (up to 2100m) for areas where optical fibre cannot easily be deployed (due to geographical or financial barriers) [9]. Hence, the need for transmitting data over longer loops again motivates the use of a TEQ for LR-VDSL2. Moreover, long reach G.fast (LR-G.fast) is also being considered (within Q4/15 ITU standardization). A recent use case proposed under Q4/15 ITU standardization is for G.mgfast, which is arguably also the last DSL standard. In G.mgfast deployment, one of the proposed scenarios is a dual mode DPU, to serve G.mgfast for shorter loop lengths and G.fast and LR-G.fast for longer loop lengths.

These long reach extensions of existing DSL standards (LR-xDSL) will enable VDSL2 and G.fast DPUs to be connected to lines with a wider variety of lengths. All the lines connected to the same DPU have to use the same CP length in order to simplify the system and allow for efficient vectoring. If the CP length is chosen according to the longest line length then shorter lines may undergo a substantial throughput loss in order to enable ISI/ICI-free transmission on the longer lines, further motivating the use of a TEQ for LR-xDSL. However, since crosstalk can no longer be neglected in VDSL2 and later generations of DSL, a multiple-input multiple-output (MIMO) TEQ design is required for a joint shortening of both direct and crosstalk channels. Nevertheless, in the case of LR-VDSL2, besides ISI/ICI, there is an additional issue of echo and near-end crosstalk (NEXT) generation when considering a too short CP. This can be partly solved by echo cancellation techniques [10]–​[12]. Moreover, it is emphasized that the main application use-case of the paper is LR-G.fast, in which this issue of NEXT and echo does not appear. For ISI/ICI cancellation with short CP, the MIMO TEQ has been shown to do a fairly good job in terms of shortening the CIRs [9]. However, the TEQ design procedure is generally not related to true bitrate optimization of the system. Therefore, a good TEQ in terms of the channel shortening, may indeed not guarantee a corresponding significant improvement in bitrates. In [13], a SISO bitrate maximizing TEQ (BM-TEQ) has been proposed to maximize the total bitrate for a given filter order, but this comes with a large initialization computational complexity due to a non-linear non-convex cost function. Similarly, in [14], [15] a genetic algorithm based blind channel shortening equalizer structure has been proposed, for the optimal SISO TEQ coefficients search. However again, a large initialization computational complexity is incurred. Moreover, the bitrate achieved by a TEQ is known to be sensitive to the synchronization delay and their relation is non-smooth [16].

A MIMO per-tone equalizer (PTEQ) [17], [18] provides a solution to these problems faced by a MIMO TEQ. In a MIMO PTEQ, the TEQ is moved to after the DFT operations at the receiver side. Hence, each tone has its own filter, designed to maximize the signal-to-noise ratio (SNR) by solving a minimum mean squared error (MMSE) problem for each tone separately. Not only does a PTEQ provide an upper limit to the performance of a TEQ, it also yields a relation between SNR and synchronisation delay that is smooth and predictable [18], thereby simplifying determining the optimal synchronization delay. Furthermore, it has been shown that the runtime computational complexity of the MIMO PTEQ is equal or less than the runtime computational complexity of a MIMO TEQ [18]. Moreover, recently in [19] a transmitter side per-tone precoder has been proposed, which can be applied at the DPU in a downstream scenario, thus reducing the computational load on customer-premises equipment (CPE). Finally, apart from ISI/ICI cancellation, the MIMO PTEQ also performs crosstalk cancellation, removing the need for a separate entity (vectoring) for crosstalk cancellation [20]–​[22].

A. Main Contributions

Despite the advantages of the MIMO PTEQ, a large initialization computational complexity and memory requirement hinder its applicability in practical scenarios. To tackle this problem, a specific structure in the MIMO DSL channel, namely that the combined ISI and ICI signal power from the crosstalk channels is significantly lower than the desired and combined ISI and ICI signal power from the direct channels, is exploited in deriving two new MIMO PTEQ structures, that reduce the initialization computational complexity and memory requirement of a full MIMO PTEQ. The two proposed structures are the sparse MIMO PTEQ and diagonal MIMO PTEQ.

  1. The proposed sparse MIMO PTEQ results in negligible loss of performance. From the simulation results obtained in Section VII it is observed that for an order-1 full MIMO PTEQ and an order-1 sparse MIMO PTEQ, the average difference in achieved data rate, over all synchronization delays, is only ≈ 0.5%. The average difference in the peak data rate is only ≈ 0.1%.

  2. The proposed diagonal MIMO PTEQ results a in further reduction of the initialization computational complexity, runtime computational complexity and memory requirement at the cost of a small performance loss. The diagonal MIMO PTEQ has the additional benefit that it can be applied in both upstream and downstream scenarios, in contrast to the full and sparse MIMO PTEQ structures which can be used only in upstream scenarios.

A similar reduction in the initialization computational complexity and memory requirement has been proposed for the MIMO TEQ in [9]. The basic ideas proposed in this paper and in [9] are similar in that both papers aim at reducing the computational complexity and memory requirement, for MIMO PTEQ and MIMO TEQ, respectively. However, the MIMO TEQ, discussed in [9], aims at channel shortening while the MIMO PTEQ aims at bit rate maximization. Therefore, the underlying system equations, constrained optimization problems and cost functions are very different in the two papers.

B. Organization and Notation

The paper is organized as follows: Section II provides a brief overview of the MIMO DSL system model and reviews the full MIMO PTEQ design equations. Section III and Section IV present the sparse and diagonal MIMO PTEQ structure, respectively. Section V and Section VI present the initialization computational complexity and memory requirement of proposed MIMO PTEQ structures, respectively. Section VII reports the results and finally Section VIII concludes the paper.

Lower-case and upper-case boldface letters are used to denote vectors (row vectors and column vectors) and matrices, respectively, with $\mathbf {a}_{l:m}$ representing elements of vector $\mathbf {a}$ with indices from $l$ to $m$ . Further, $(.)^{T}$ is used to represent the transpose operation, $(.)^{H}$ the Hermitian transpose operation, $(.)^{\ast}$ complex conjugation and $\mathbb {E}[.]$ the expected value operation.

SECTION II.

MIMO PTEQ

The system considered is a cable binder with $M$ lines (users) corresponding to an $M \times M$ baseband communication system with additive white Gaussian noise and time-dispersive channels of length $L$ . In VDSL2 and later generations of DSL, transmit pulse shaping is also applied to the DMT symbols to reduce the effect of echo and near-end crosstalk (NEXT) [23]. A CP of length $\nu + \beta $ and a cyclic suffix (CS) of length $\mu + \beta $ are then added to the DMT symbol of size $N$ ($N$ represents the IDFT (DFT) size used at the transmitter (receiver)). A transmit pulse shaping with a window (usually a raised-cosine window) with roll-off length of $\beta $ is applied to both ends of the symbol and eventually successive symbols are overlapped with an overlap length of $\beta $ , as shown in Fig. 1.

FIGURE 1. - Time-domain transmitted symbol for xDSL.
FIGURE 1.

Time-domain transmitted symbol for xDSL.

Hence, the time-domain received signal for user $m = 1,\ldots, M$ is given in (1), shown at the bottom of the page.

Here, $T$ is the equalizer length, $s$ denotes the symbol period after the overlapping of successive symbols, given by $s = N + \nu +\mu +\beta $ . Vector $\ddot {\mathbf {x}}^{l}_{(k)} \triangleq \begin{bmatrix} X^{l}_{(k),1}, X^{l}_{(k),2} \cdots X^{l}_{(k),N} \end{bmatrix}^{T}$ represents the $k^{th}$ symbol of user $l$ , with $X^{l}_{(k),i}$ the QAM-symbol for tone $i = 1,\ldots, N$ . Matrix $\mathbf {\mathbb {I}}_{N}$ is an $N \times N$ IDFT matrix. Matrix $\mathbf {C}$ adds the CS and CP to the symbols and performs transmit pulse shaping. The structure of the matrix $\mathbf {C}$ is as follows\begin{align*} \mathbf {C} &= \left [{\begin{array}{ccc} \mathbf {0} & \mathbf {0} & | \mathbf {C}_{1} \\ \hline & \mathbf {I}_{N} & \\ \hline \mathbf {C}_{2} |& \mathbf {0} & \mathbf {0} \end{array} }\right] \tag{2a} \\ \mathbf {C}_{1} &= \begin{bmatrix} \begin{array}{c|c} \mathbf {I}_{\beta } \text{diag}\left ({\boldsymbol {\varpi }_{1:\beta }}\right) & \mathbf {0}_{\beta \times \nu } \\ \hline \mathbf {0}_{\nu \times \beta } & \mathbf {I}_{\nu } \end{array} \end{bmatrix} \tag{2b} \\ \mathbf {C}_{2}&=\begin{bmatrix} \begin{array}{c|c} \mathbf {I}_{\mu } & \mathbf {0}_{\mu \times \beta } \\ \hline \mathbf {0}_{\beta \times \mu } & \mathbf {I}_{\beta } \text{diag}(\boldsymbol {\varpi }_{\beta:end}) \end{array} \end{bmatrix}\tag{2c}\end{align*} View SourceRight-click on figure for MathML and additional features. where, $\boldsymbol {\varpi }$ is the raised cosine window given as \begin{equation*} \boldsymbol {\varpi }_{l} = \frac {1}{2}\left [{1 + cos\left ({\frac {\pi }{\beta } \left ({l - \beta }\right) }\right) }\right] \quad 1 \leq l \leq 2\beta\tag{3}\end{equation*} View SourceRight-click on figure for MathML and additional features. The$\boxed{\mathbf {I}}$ in formula (1) represents an identity matrix of dimension $N+\nu + \mu +2\beta $ which in the compound matrix has $\beta $ rows overlap with the next and/or previous$\boxed{\mathbf {I}}$ . Finally, $h^{ml}_{j}$ , which is a part of channel matrix $\mathbf {H}^{ml}$ , represents the $j^{th}$ tap of the channel from user $l$ to user $m$ , and $\mathbf {O_{1}}$ and $\mathbf {O_{2}}$ are all-zero matrices with dimensions $(N+T-1) \times (N+2\nu +\mu +2\beta -L -T+2+\Delta)$ and $(N+T-1)\times (N+\nu +2\mu +2\beta -\Delta)$ receptively, where $\Delta $ is the so-called synchronization delay—a design parameter. For ease of notation we do not include $\Delta $ in the system equations and variables, from here on.

The MIMO PTEQ, as described in [18], is an $M \times M$ equalizer of length $T$ (order $T-1$ ), applied to the output of a sliding DFT (SDFT) of the received signals $\mathbf {y}^{m}_{[k]}$ (Fig. 2(a)), i.e., \begin{equation*} {\ddot {\mathbf {y}}}^{m}_{\left [{k}\right],i} = \mathbf {F}_{i} \mathbf {y}^{m}_{\left [{k}\right]}\tag{4}\end{equation*} View SourceRight-click on figure for MathML and additional features. where ${\ddot {\mathbf {y}}}^{m}_{[k],i}$ is the $T \times 1$ SDFT output for user $m$ on tone $i$ , with \begin{align*} \mathbf {F}_{i} = \left [{\begin{array}{ccccccc} \boxed{\text{row}_{i}\left ({\mathbf {\mathbb {F}_{N}}}\right)} &{}\cdots &0 \\ 0 \boxed{\text{row}_{i}\left ({\mathbf {\mathbb {F}_{N}}}\right)} & \cdots &0 \\ \vdots & \ddots & \vdots \\ 0&{}\cdots & \boxed{\text{row}_{i}\left ({\mathbf {\mathbb {F}_{N}}}\right)} \\ \end{array} }\right]\tag{5}\end{align*} View SourceRight-click on figure for MathML and additional features. where $\mathbf {\mathbb {F}_{N}}$ is an $N\times N$ DFT matrix and $\text{row}_{i}(\mathbf {\mathbb {F}_{N}})$ represents the $i^{th}$ row of the DFT matrix $\mathbf {\mathbb {F}_{N}}$ . These SDFT outputs on tone $i$ can be combined for all users in a single $MT \times 1$ vector \begin{align*} \underbrace {\left [{ \begin{array}{c} {\ddot {\mathbf {y}}}^{1}_{\left [{k}\right],i} \\ {\ddot {\mathbf {y}}}^{2}_{\left [{k}\right],i} \\ \vdots \\ {\ddot {\mathbf {y}}}^{M}_{\left [{k}\right],i} \end{array} }\right] }_{{\ddot {\mathbf {y}}}_{\left [{k}\right],i}} = \underbrace {\left [{\begin{array}{ccccc} \mathbf {F}_{i} & \mathbf {0}& \cdots & \mathbf {0} \\ \mathbf {0} & \mathbf {F}_{i} & \cdots & \mathbf {0} \\ \vdots & \ddots & \ddots & \vdots \\ \mathbf {0} & \cdots & \cdots & \mathbf {F}_{i}\\ \end{array} }\right]}_{\check {\mathbf {F}}_{i}} \underbrace {\left [{ \begin{array}{c} \mathbf {y}^{1}_{\left [{k}\right]} \\ \mathbf {y}^{2}_{\left [{k}\right]} \\ \vdots \\ \mathbf {y}^{M}_{\left [{k}\right]} \end{array} }\right]}_{ \mathbf {y}_{\left [{k}\right]}}\tag{6}\end{align*} View SourceRight-click on figure for MathML and additional features. The aim of a MIMO PTEQ is to generate an estimate of the $k^{th}$ transmitted symbol $({X}^{m}_{(k),i})$ from the SDFT output vector on tone $i$ (${\ddot {\mathbf {y}}}_{[k],i}$ ), for all the users $m$ and tones $i$ . Hence, the estimated $k^{th}$ transmitted symbol for user $m$ on tone $i$ , with the full MIMO PTEQ, is given as \begin{equation*} \hat {X}^{m}_{\left ({k}\right),i} = \mathbf {w}^{m}_{i} {\ddot {\mathbf {y}}}_{\left [{k}\right],i}\tag{7}\end{equation*} View SourceRight-click on figure for MathML and additional features. Here, $\mathbf {w}^{m}_{i}$ contains the MIMO PTEQ coefficients for user $m$ on tone $i$ , i.e., \begin{equation*} \mathbf {w}^{m}_{i} = \left [{\mathbf {w}^{m1}_{i} \mathbf {w}^{m2}_{i} \cdots \mathbf {w}^{mm}_{i} \cdots \mathbf {w}^{mM}_{i} }\right]\tag{8}\end{equation*} View SourceRight-click on figure for MathML and additional features. where, each $\mathbf {w}^{ml}_{i}$ represents an equalizer of order $T-1$ :\begin{equation*} \mathbf {w}^{ml}_{i} = \left [{ w^{ml}_{\left ({T-1}\right),i} w^{ml}_{\left ({T-2}\right),i} \cdots w^{ml}_{\left ({0}\right),i} }\right]\tag{9}\end{equation*} View SourceRight-click on figure for MathML and additional features. These MIMO PTEQ coefficients for tone $i$ can be stacked for all users in a single matrix, yielding a full MIMO PTEQ matrix for tone $i$ \begin{align*} \mathbf {W}_{i} = \left [{\begin{matrix} \mathbf {w}_{i}^{11} & \mathbf {w}_{i}^{12} & \cdots & \mathbf {w}_{i}^{1M} \\ \mathbf {w}_{i}^{21} & \mathbf {w}_{i}^{22} & \cdots & \mathbf {w}_{i}^{2M} \\ \vdots & \ddots & \\ \mathbf {w}_{i}^{M1} & \mathbf {w}_{i}^{M2}& \cdots & \mathbf {w}_{i}^{MM} \end{matrix}}\right]\tag{10}\end{align*} View SourceRight-click on figure for MathML and additional features. Using (1), the estimated $k^{th}$ transmitted symbol for user $m$ on tone $i$ , (7) can be rewritten as \begin{align*} \hat {X}^{m}_{\left ({k}\right),i}=&\mathbf {w}^{m}_{i} \check {\mathbf {F}}_{i} \left ({\left [{\begin{array}{cccc} \mathbf {H}^{11} & \mathbf {H}^{12} & \cdots & \mathbf {H}^{1M} \\ \mathbf {H}^{21} & \mathbf {H}^{22} & \cdots & \mathbf {H}^{2M} \\ \vdots & \vdots & \cdots & \vdots \\ \mathbf {H}^{M1} & \mathbf {H}^{M2} & \cdots & \mathbf {H}^{MM} \end{array} }\right]\left [{ \begin{array}{c} \breve {\mathbf {x}}^{1}_{\left [{k}\right]} \\ \breve {\mathbf {x}}^{2}_{\left [{k}\right]} \\ \vdots \\ \breve {\mathbf {x}}^{M}_{\left [{k}\right]} \end{array} }\right] }\right. \\&\qquad \qquad \left.{+ \left [{ \begin{array}{c} \mathbf {n}^{1}_{\left [{k}\right]} \\ \mathbf {n}^{2}_{\left [{k}\right]} \\ \vdots \\ \mathbf {n}^{M}_{\left [{k}\right]} \end{array} }\right] }\right)\tag{11}\end{align*} View SourceRight-click on figure for MathML and additional features.

FIGURE 2. - Block diagram of 
$2 \times 2$
 full MIMO PTEQ.
FIGURE 2.

Block diagram of $2 \times 2$ full MIMO PTEQ.

Based on (5) and (6), it can be observed that $M T$ DFT operations are required to generate ${\ddot {\mathbf {y}}}_{[k],i}$ , leading to a large runtime computational complexity. However, in [17], a more efficient difference terms based implementation of the SDFT for the PTEQ has been proposed as follows. The SDFT matrix $\mathbf {F}_{i}$ can be decomposed [19] into a lower triangular matrix and a sparse matrix, i.e., \begin{align*} \mathbf {F}_{i}=\underbrace {\left [{\begin{array}{cccc} \alpha ^{0i} & \cdots & 0 \\ \vdots & \ddots & \vdots \\ \alpha ^{\left ({T-1}\right)i} & \cdots & \alpha ^{0i}\\ \end{array} }\right]}_{\mathbf {L}_{i}} \underbrace {\left [{ \begin{array}{cc|c} \,\,\,\,\text{row}_{i}\left ({\mathbf {\mathbb {F}_{N}}}\right) & \mathbf {0}\\ \hline -\alpha ^{i} \mathbf {I}_{T-1} & \mathbf {0} & \alpha ^{i} \mathbf {I}_{T-1} \end{array}}\right]}_{{\underline {\text{F}}}_{i}} \\\tag{12}\end{align*} View SourceRight-click on figure for MathML and additional features. where $\alpha = \exp ^{-j2\pi /N} $ . The first row of the matrix ${\underline {\text {F}}}_{i}$ corresponds to a single DFT operation and the other rows result in $T-1$ so-called difference terms. Hence, utilizing the difference terms implementation instead of the SDFT, (7) can be rewritten as \begin{equation*} \hat {X}^{m}_{\left ({k}\right),i} = \mathbf {v}^{m}_{i} \check {{\underline {\text {F}}}}_{i} \mathbf {y}_{\left [{k}\right]}\tag{13}\end{equation*} View SourceRight-click on figure for MathML and additional features. where $\check {{\underline {\text {F}}}}_{i}$ is given as \begin{align*} \check {{\underline {\text {F}}}}_{i} = \left [{\begin{array}{ccccc} {\underline {\text {F}}}_{i} & \mathbf {0}& \cdots & \mathbf {0} \\ \mathbf {0} & {\underline {\text {F}}}_{i} & \cdots & \mathbf {0} \\ \vdots & \ddots & \ddots & \vdots \\ \mathbf {0} & \cdots & \cdots & {\underline {\text {F}}}_{i}\\ \end{array} }\right]\tag{14}\end{align*} View SourceRight-click on figure for MathML and additional features. and $\mathbf {v}^{m}_{i}$ are the modified PTEQ coefficients with $\mathbf {v}^{m,l}_{i} = \mathbf {L}_{i}^{T}\mathbf {w}^{m,l}_{i}$ , which will then be computed directly instead of the original PTEQ coefficients $\mathbf {w}^{m}_{i}$ . Similar to $\mathbf {W}_{i} $ in (10), the full MIMO PTEQ matrix for tone $i$ with modified PTEQ coefficients $\mathbf {v}^{m}_{i}$ can be written as \begin{align*} \mathbf {V}_{i} = \left [{\begin{matrix} \mathbf {v}_{i}^{11} & \mathbf {v}_{i}^{12} & \cdots & \mathbf {v}_{i}^{1M} \\ \mathbf {v}_{i}^{21} & \mathbf {v}_{i}^{22} & \cdots & \mathbf {v}_{i}^{2M} \\ \vdots & \ddots & \\ \mathbf {v}_{i}^{M1} & \mathbf {v}_{i}^{M2}& \cdots & \mathbf {v}_{i}^{MM} \end{matrix}}\right]\tag{15}\end{align*} View SourceRight-click on figure for MathML and additional features. where, each $\mathbf {v}^{ml}_{i}$ represents an equalizer of order $T-1$ :\begin{equation*} \mathbf {v}^{ml}_{i} = \left [{ v^{ml}_{\left ({T-1}\right),i} v^{ml}_{\left ({T-2}\right),i} \cdots v^{ml}_{\left ({0}\right),i} }\right]\tag{16}\end{equation*} View SourceRight-click on figure for MathML and additional features. Finally, the optimization problem for the optimal (MMSE based) full MIMO PTEQ filter coefficients for user $m$ on tone $i$ is defined as \begin{align*} \underset {\mathbf {\mathbf {v}^{m}_{i}}}{\text{minimize }}&\mathbb {E}\left [{ \left |{ \hat {X}^{m}_{\left ({k}\right),i} - {X}^{m}_{\left ({k}\right),i} }\right |^{2}}\right] \\&\left ({or}\right) \\ \underset {\mathbf {\mathbf {v}^{m}_{i}}}{\text{minimize }}&\mathbb {E}\left [{ \left |{ \mathbf {v}^{m}_{i} \check {{\underline {\text {F}}}}_{i} \mathbf {y}_{\left [{k}\right]} - {X}^{m}_{\left ({k}\right),i} }\right |^{2}}\right]\tag{17}\end{align*} View SourceRight-click on figure for MathML and additional features. where, the latter can be obtained from the former by substituting the definition of $\hat {X}^{m}_{(k),i}$ from (13). The solution to the optimization problem (17) is given by (see Appendix) \begin{equation*} \left ({\tilde {\mathbf {v}}^{m}_{i}}\right)^{H} = \mathbb {E}\left [{{\ddot {\mathbf {y}}}_{\left [{k}\right],i}\left ({{\ddot {\mathbf {y}}}_{\left [{k}\right],i}}\right)^{H} }\right]^{-1}\mathbb {E}\left [{ {\ddot {\mathbf {y}}}_{\left [{k}\right],i} \left ({{X}^{m}_{\left ({k}\right),i}}\right)^{\ast}}\right]\tag{18}\end{equation*} View SourceRight-click on figure for MathML and additional features. The solution can be further written as a function of the channel matrices, input correlation and noise correlation matrices, based on (1). This is left out here for conciseness (see also Sections III and IV for similar derivations).

Furthermore, it is to be noticed that if the CP length is too short, the received signals after the DMT demodulation (before a soft/hard decision operation) are generally improper [24], [25]. However, [26] proposed a widely-linear PTEQ (WL-PTEQ) considering impropriety of the received signals and observed that the achieved bitrates for various CP lengths (shorter than CIR) roughly coincide for the PTEQ and WL-PTEQ for equalizer order greater than 0. Therefore, this paper ignores the impropriety of the received signals after DMT demodulation.

SECTION III.

Sparse MIMO PTEQ

The full MIMO PTEQ design described in Section II thus circumvents the difficult bitrate maximizing MIMO TEQ design: The minimization in (17) directly corresponds to the maximization of the achievable bitrate for user $m$ on tone $i$ [17], [18]. However, despite the advantages of the MIMO PTEQ, its large initialization computational complexity and memory requirement hinder its applicability in practical scenarios. In this section, a first solution to this problem is proposed with a sparse MIMO PTEQ.

For MIMO DSL systems (without a TEQ/PTEQ), the combined ISI and ICI signal power from the crosstalk channels is negligible compared to the desired and the combined ISI and ICI power from the direct channels. Fig. 3 shows this structure for a $2 \times 2$ MIMO DSL system (without a TEQ/PTEQ) where the CP is shorter than the CIR (no CS is used here). The desired signal power and combined ISI and ICI power, contributed by measured dispersive DSL channels, are calculated [27] and plotted for both direct channels and crosstalk channels. It can be further observed that the combined ISI and ICI signal power from the direct channels is comparable and past a certain frequency even higher than the desired signal power from the direct channels. Thus, a multi-taps equalizer, represented by the diagonal elements of MIMO PTEQ matrix is used to combat the combined ISI and ICI signal power from the direct channels.

FIGURE 3. - Power distribution as desired signal power and combined ISI and ICI power for two measured 
$2 \times 2$
 MIMO DSL channels by two Tier-1 operators. Desired signal power from direct channels (
$-\!\!\!-\!\!\!-\!\!\!\!\!|$
), combined ISI and ICI signal power from direct channels (
$-\!-\!-\!\!\!\!\!\circ$
), desired signal power from crosstalk channels (
$-\!-\!-\!\!\!\!\!\square$
), combined ISI and ICI signal power from crosstalk channels (
$-\!-\!-\!\!\!\!\!\diamond$
).
FIGURE 3.

Power distribution as desired signal power and combined ISI and ICI power for two measured $2 \times 2$ MIMO DSL channels by two Tier-1 operators. Desired signal power from direct channels ($-\!\!\!-\!\!\!-\!\!\!\!\!|$ ), combined ISI and ICI signal power from direct channels ($-\!-\!-\!\!\!\!\!\circ$ ), desired signal power from crosstalk channels ($-\!-\!-\!\!\!\!\!\square$ ), combined ISI and ICI signal power from crosstalk channels ($-\!-\!-\!\!\!\!\!\diamond$ ).

However, comparatively, the combined ISI and ICI signal power from the crosstalk channels is negligible, especially in the lower frequencies, which is generally the high SNR region in DSL systems and hence contributes most to the data rate. Therefore, this structure can be exploited to significantly simplify the full MIMO PTEQ without impacting the performance, namely by reducing the MIMO PTEQ coefficients responsible for crosstalk ISI and ICI cancellation to a single tap scalar. The sparse MIMO PTEQ matrix elements for tone $i$ are then defined as \begin{align*} \mathbf {v}^{ml}_{i}=&\left [{ v^{ml}_{\left ({T-1}\right),i} v^{ml}_{\left ({T-2}\right),i} \cdots v^{ml}_{\left ({0}\right),i} }\right] \quad \forall m=l \\ \mathbf {v}^{ml}_{i}=&\left [{ v^{ml}_{\left ({0}\right),i} 0 \cdots 0 }\right] \equiv v^{ml}_{i} \underline {{\mathbf {e}} } \quad \forall m \neq l\tag{19}\end{align*} View SourceRight-click on figure for MathML and additional features. where $\underline {{\mathbf {e}} } = [1 \,\,0 \,\,0 \,\,\cdots \,\,0]_{1\times T}$ . Hence, the sparse MIMO PTEQ coefficients for user $m$ on tone $i$ are given as \begin{equation*} \mathbf {v}^{m}_{i} = \left [{v^{m1}_{i} \underline {{\mathbf {e}} } v^{m2}_{i} \underline {{\mathbf {e}} } \cdots \mathbf {v}_{i}^{mm} \cdots v^{mM}_{i} \underline {{\mathbf {e}} } }\right]\tag{20}\end{equation*} View SourceRight-click on figure for MathML and additional features. and the sparse MIMO PTEQ matrix for tone $i$ is given as \begin{align*} \mathbf {V}_{i}=\left [{\begin{matrix} \mathbf {v}_{i}^{11} & v^{12}_{i} \underline {{\mathbf {e}} } & \cdots & v^{1M}_{i} \underline {{\mathbf {e}} } \\ v^{21}_{i} \underline {{\mathbf {e}} } & \mathbf {v}_{i}^{22} & \cdots & v^{2M}_{i} \underline {{\mathbf {e}} } \\ \vdots & \ddots & \\ v^{M1}_{i} \underline {{\mathbf {e}} } & v^{M2}_{i} \underline {{\mathbf {e}} } & \cdots & \mathbf {v}_{i}^{MM} \end{matrix}}\right]\tag{21}\end{align*} View SourceRight-click on figure for MathML and additional features. The estimated $k^{th}$ transmitted symbol for user $m$ on tone $i$ , with the sparse MIMO PTEQ is given as \begin{equation*} \hat {X}^{m}_{\left ({k}\right),i} = \mathbf {v}^{m}_{i} \check {{\underline {\text {F}}}}_{i} \mathbf {y}_{\left [{k}\right]}\tag{22}\end{equation*} View SourceRight-click on figure for MathML and additional features. which, using (20) and (14), is equivalent to (23), shown at the bottom of the page,

where, $\check {{\underline {\text {F}}}}^{m}_{i}$ now has $T+M-1$ rows (compared to $MT$ rows for $\check {{\underline {\text {F}}}}_{i}$ in (13)).

Hence the MMSE based optimization problem for the sparse MIMO PTEQ filter coefficients for user $m$ on tone $i$ is given as \begin{equation*} \underset {\tilde {\mathbf {v}}^{\mathbf {m}}_{\mathbf {i}}}{\text{minimize }} \mathbb {E}\left [{ \left |{ \tilde {\mathbf {v}}^{m}_{i} \check {{\underline {\text {F}}}}^{m}_{i}\mathbf {y}_{\left [{k}\right]} - {X}^{m}_{\left ({k}\right),i} }\right |^{2}}\right]\tag{24}\end{equation*} View SourceRight-click on figure for MathML and additional features. Denoting $\mathbf {{\stackrel {\circ }{y}}}^{m}_{[k],i} = \check {{\underline {\text {F}}}}^{m}_{i} \mathbf {y}_{[k]} $ . The solution to the optimization problem (24) is given by \begin{equation*} \left ({\tilde {\mathbf {v}}^{m}_{i}}\right)^{H} = \left ({\mathbb {E}\left [{\mathbf {{\stackrel {\circ }{y}}}^{m}_{\left [{k}\right],i} \left ({\mathbf {{\stackrel {\circ }{y}}}^{m}_{\left [{k}\right],i}}\right)^{H}}\right]}\right)^{-1} \left ({\mathbb {E}\left [{\mathbf {{\stackrel {\circ }{y}}}^{m}_{\left [{k}\right],i} \left ({{X}^{m}_{\left ({k}\right),i}}\right)^{\ast} }\right]}\right)\tag{25}\end{equation*} View SourceRight-click on figure for MathML and additional features. The solution can be further written as a function of the channel matrices, input correlation and noise correlation matrices. The first part of (25) ($\mathbb {E}[\mathbf {{\stackrel {\circ }{y}}}^{m}_{[k],i} (\mathbf {{\stackrel {\circ }{y}}}^{m}_{[k],i})^{H}]$ ) can be expanded as \begin{align*} \mathbb {E}\left [{\mathbf {{\stackrel {\circ }{y}}}^{m}_{\left [{k}\right],i} \left ({\mathbf {{\stackrel {\circ }{y}}}^{m}_{\left [{k}\right],i}}\right)^{H}}\right]=&\check {{\underline {\text {F}}}}^{m}_{i} \mathbb {E} \left [{\left [{ \begin{array}{c} \mathbf {y}^{1}_{\left [{k}\right]} \\ \mathbf {y}^{2}_{\left [{k}\right]} \\ \vdots \\ \mathbf {y}^{M}_{\left [{k}\right]} \end{array} }\right] \left [{ \begin{array}{c} \mathbf {y}^{1}_{\left [{k}\right]} \\ \mathbf {y}^{2}_{\left [{k}\right]} \\ \vdots \\ \mathbf {y}^{M}_{\left [{k}\right]} \end{array} }\right]^{H}}\right] \check {{\underline {\text {F}}}}^{m^{H}}_{i} \\=&\check {{\underline {\text {F}}}}^{m}_{i} \mathbf {R_{yy}} \check {{\underline {\text {F}}}}^{m^{H}}_{i}\tag{26}\end{align*} View SourceRight-click on figure for MathML and additional features. The output correlation matrix $\mathbf {R_{yy}}$ is given as \begin{align*} \mathbf {R_{yy}} = \left [{\begin{matrix} \mathbb {E}\left [{\mathbf {y}^{1}_{\left [{k}\right]}\mathbf {y}^{1^{H}}_{\left [{k}\right]}}\right] & \mathbb {E}\left [{\mathbf {y}^{1}_{\left [{k}\right]} \mathbf {y}^{2^{H}}_{\left [{k}\right]}}\right] & \cdots & \mathbb {E}\left [{\mathbf {y}^{1}_{\left [{k}\right]} \mathbf {y}^{M^{H}}_{\left [{k}\right]}}\right] \\ \mathbb {E}\left [{\mathbf {y}^{2}_{\left [{k}\right]}\mathbf {y}^{1^{H}}_{\left [{k}\right]}}\right] & \mathbb {E}\left [{\mathbf {y}^{2}_{\left [{k}\right]} \mathbf {y}^{2^{H}}_{\left [{k}\right]}}\right] & \cdots & \mathbb {E}\left [{\mathbf {y}^{2}_{\left [{k}\right]} \mathbf {y}^{M^{H}}_{\left [{k}\right]}}\right] \\ \vdots & \vdots & \cdots & \vdots \\ \mathbb {E}\left [{\mathbf {y}^{M}_{\left [{k}\right]} \mathbf {y}^{1^{H}}_{\left [{k}\right]}}\right] & \mathbb {E}\left [{\mathbf {y}^{M}_{\left [{k}\right]} \mathbf {y}^{2^{H}}_{\left [{k}\right]}}\right] & \cdots & \mathbb {E}\left [{\mathbf {y}^{M}_{\left [{k}\right]} \mathbf {y}^{M^{H}}_{\left [{k}\right]}}\right] \end{matrix}}\right] \\{}\tag{27}\end{align*} View SourceRight-click on figure for MathML and additional features. which can be written as a function of the known input correlation and noise correlation matrices, assuming the noise on different lines are independent \begin{align*} \mathbb {E}\left [{\mathbf {y}^{p}_{\left [{k}\right]} \mathbf {y}^{q^{H}}_{\left [{k}\right]}}\right]=&\sum _{l = 1}^{M}\mathbf {H}^{p,l} \mathbf {R}_{\breve {\mathbf {x}}^{l}}\mathbf {H}^{q,l^{H}} \forall p \neq q \\ \mathbb {E}\left [{\mathbf {y}^{p}_{\left [{k}\right]} \mathbf {y}^{q^{H}}_{\left [{k}\right]}}\right]=&\sum _{l = 1}^{M}\mathbf {H}^{p,l} \mathbf {R}_{\breve {\mathbf {x}}^{l}}\mathbf {H}^{q,l^{H}} + \mathbf {R}_{\mathbf {n}^{p}} \forall p = q\tag{28}\end{align*} View SourceRight-click on figure for MathML and additional features. where, $\mathbf {R}_{\breve {\mathbf {x}}^{l}}$ is the input symbols correlation matrix $\mathbf {R}_{\breve {\mathbf {x}}^{l}} = \mathbb {E}[\breve {\mathbf {x}}^{l}_{[k]} \breve {\mathbf {x}}^{l^{H}}_{[k]}]$ and $\mathbf {R}_{\mathbf {n}^{p}}$ is the noise correlation matrix $\mathbf {R}_{\mathbf {n}^{p}} = \mathbb {E}[\mathbf {n}^{p}_{[k]} \mathbf {n}^{p^{H}}_{[k]}]$ .

The second part of the (25) ($\mathbb {E}[\mathbf {{\stackrel {\circ }{y}}}^{m}_{[k],i} ({X}^{m}_{(k),i})^{\ast}]$ ) can be expanded as \begin{align*} \mathbb {E}\left [{\mathbf {{\stackrel {\circ }{y}}}^{m}_{\left [{k}\right],i} \left ({{X}^{m}_{\left ({k}\right),i}}\right)^{\ast} }\right] = \mathbb {E}\left [{ \left ({\check {{\underline {\text {F}}}}^{m}_{i} \left [{ \begin{array}{c} \mathbf {y}^{1}_{\left [{k}\right]} \\ \mathbf {y}^{2}_{\left [{k}\right]} \\ \vdots \\ \mathbf {y}^{M}_{\left [{k}\right]} \end{array} }\right]}\right) \left ({{X}^{m}_{\left ({k}\right),i}}\right)^{\ast}}\right] \\ = \mathbb {E}\left [{ \left ({\check {{\underline {\text {F}}}}^{m}_{i}\left [{\begin{array}{c} \sum _{l = 1}^{M}\mathbf {H}^{1l} \breve {\mathbf {x}}^{l}_{\left [{k}\right]} + \mathbf {n}^{1}_{\left [{k}\right]} \\ \sum _{l = 1}^{M}\mathbf {H}^{2l} \breve {\mathbf {x}}^{l}_{\left [{k}\right]} + \mathbf {n}^{2}_{\left [{k}\right]} \\ \vdots \\ \sum _{l = 1}^{M}\mathbf {H}^{Ml} \breve {\mathbf {x}}^{l}_{\left [{k}\right]} + \mathbf {n}^{M}_{\left [{k}\right]} \end{array}}\right]}\right) \left ({\breve {\mathbf {x}}^{m}_{\left [{k}\right]}}\right)^{H} \mathbf {e}_{i} }\right]\tag{29}\end{align*} View SourceRight-click on figure for MathML and additional features. where $\mathbf {e}_{i} = [0 \,\,\cdots \,\,0 \,\,1\,\,0\,\,\cdots \,\,0]^{T}_{1\times 3N}$ , with the 1 in the $(N+i)^{th}$ position. Finally (29) can be rewritten as \begin{align*} \mathbb {E}\left [{\mathbf {{\stackrel {\circ }{y}}}^{m}_{\left [{k}\right],i} \left ({{X}^{m}_{\left ({k}\right),i}}\right)^{\ast} }\right] = \check {{\underline {\text {F}}}}^{m}_{i} \left [{ \begin{array}{c} \mathbf {H}^{1m} \mathbf {R}_{\breve {\mathbf {x}}^{m}}\mathbf {e}_{i} \\ \mathbf {H}^{2m}\mathbf {R}_{\breve {\mathbf {x}}^{m}} \mathbf {e}_{i} \\ \vdots \\ \mathbf {H}^{Mm} \mathbf {R}_{\breve {\mathbf {x}}^{m}} \mathbf {e}_{i} \end{array} }\right]\tag{30}\end{align*} View SourceRight-click on figure for MathML and additional features. Thus, the optimal solution for sparse MIMO PTEQ coefficients for user $m$ on tone $i$ is given as (31), at the bottom of the next page.

SECTION IV.

Diagonal MIMO PTEQ

A further reduction of the initialization computational complexity and memory requirement can be achieved by considering a diagonal MIMO PTEQ. In addition to achieving a reduction in the initialization computational complexity and memory requirement, a diagonal MIMO PTEQ can also be used in downstream scenarios, where no receiver coordination is possible. The diagonal MIMO PTEQ matrix elements for tone $i$ are then defined as \begin{align*} \mathbf {v}^{ml}_{i}=&\left [{ v^{ml}_{\left ({T-1}\right),i} v^{ml}_{\left ({T-2}\right),i} \cdots v^{ml}_{\left ({0}\right),i} }\right] \quad \forall m=l \\ \mathbf {v}^{ml}_{i}=&\left [{ \mathbf {0} }\right]_{1\times T} \quad \forall m \neq l\tag{32}\end{align*} View SourceRight-click on figure for MathML and additional features. Hence, the diagonal MIMO PTEQ coefficients for user $m$ on tone $i$ are given as \begin{equation*} \mathbf {v}^{m}_{i} = \left [{\mathbf {0} \mathbf {0} \cdots \mathbf {v}_{i}^{mm} \cdots \mathbf {0} }\right]\tag{33}\end{equation*} View SourceRight-click on figure for MathML and additional features. and the diagonal MIMO PTEQ matrix for tone $i$ is given as \begin{align*} \mathbf {V}_{i} = \left [{\begin{matrix} \mathbf {v}_{i}^{11} & \mathbf {0} & \cdots & \mathbf {0} \\ \mathbf {0} & \mathbf {v}_{i}^{22} & \cdots & \mathbf {0} \\ \vdots & \ddots & \\ \mathbf {0} & \mathbf {0}& \cdots & \mathbf {v}_{i}^{MM} \end{matrix}}\right]\tag{34}\end{align*} View SourceRight-click on figure for MathML and additional features. The estimated $k^{th}$ transmitted symbol for user $m$ on tone $i$ , with the diagonal MIMO PTEQ is given as \begin{equation*} \hat {X}^{m}_{\left ({k}\right),i} = \mathbf {v}^{m}_{i} \check {{\underline {\text {F}}}}_{i} \mathbf {y}_{\left [{k}\right]}\tag{35}\end{equation*} View SourceRight-click on figure for MathML and additional features. which, using (33) and (14) is equivalent to \begin{align*} \hat {X}^{m}_{\left ({k}\right),i}=&\mathbf {v}_{i}^{mm} {\underline {\text {F}}}_{i} \mathbf {y}^{m}_{\left [{k}\right]} \\=&\mathbf {v}^{mm}_{i}{\underline {\text {F}}}_{i} \left ({\left [{ \mathbf {H}^{m1} \mathbf {H}^{m2} \cdots \mathbf {H}^{mM} }\right]\left [{\begin{array}{c} \breve {\mathbf {x}}^{1}_{\left [{k}\right]} \\ \breve {\mathbf {x}}^{2}_{\left [{k}\right]} \\ \vdots \\ \breve {\mathbf {x}}^{M}_{\left [{k}\right]} \end{array}}\right]+ \mathbf {n}^{m}_{\left [{k}\right]} }\right)\tag{36}\end{align*} View SourceRight-click on figure for MathML and additional features. where ${\underline {\text {F}}}_{i}$ has $T$ rows (compared to $T+M-1$ rows for $\check {{\underline {\text {F}}}}^{m}_{i}$ in (23)). Similar to (24), the MMSE based optimization problem for the diagonal PTEQ filter coefficients for user $m$ on tone $i$ is given as \begin{equation*} \underset {\mathbf {\mathbf {v}^{mm}_{i}}}{\text{minimize }} \quad \mathbb {E}\left [{ \left |{ \mathbf {v}^{mm}_{i} {\underline {\text {F}}}_{i} \mathbf {y}^{m}_{\left [{k}\right]} - {X}^{m}_{\left ({k}\right),i} }\right |^{2}}\right]\tag{37}\end{equation*} View SourceRight-click on figure for MathML and additional features. The solution is given as \begin{equation*} \left ({\mathbf {v}^{mm}_{i}}\right)^{H} = \left ({\mathbb {E}\left [{{\dot {\mathbf {y}}}^{m}_{\left [{k}\right],i} \left ({{\dot {\mathbf {y}}}^{m}_{\left [{k}\right],i}}\right)^{H}}\right]}\right)^{-1} \left ({\mathbb {E}\left [{{\dot {\mathbf {y}}}^{m}_{\left [{k}\right],i} \left ({{X}^{m}_{\left ({k}\right),i}}\right)^{\ast} }\right]}\right)\tag{38}\end{equation*} View SourceRight-click on figure for MathML and additional features. where ${\dot {\mathbf {y}}}^{m}_{[k],i} = {\underline {\text {F}}}_{i} \mathbf {y}^{m}_{[k]}$ . The solution can be further written as a function of the channel matrices, input correlation and noise correlation matrices. The first part of (38) ($\mathbb {E}[{\dot {\mathbf {y}}}^{m}_{[k],i} ({\dot {\mathbf {y}}}^{m}_{[k],i})^{H}]$ ) can be expanded as \begin{align*} \mathbb {E}\left [{{\dot {\mathbf {y}}}^{m}_{\left [{k}\right],i} \left ({{\dot {\mathbf {y}}}^{m}_{\left [{k}\right],i}}\right)^{H}}\right]=&{\underline {\text {F}}}_{i} \mathbb {E} \left [{ \mathbf {y}^{m}_{\left [{k}\right]} \mathbf {y}^{m^{H}}_{\left [{k}\right]} }\right] {\underline {\text {F}}}^{H}_{i} \\=&{\underline {\text {F}}}_{i} \mathbf {R}_{\mathbf {y}^{m}\mathbf {y}^{m}} {\underline {\text {F}}}^{H}_{i}\tag{39}\end{align*} View SourceRight-click on figure for MathML and additional features.

The second part of (38) ($\mathbb {E}[{\dot {\mathbf {y}}}^{m}_{[k],i} ({X}^{m}_{(k),i})^{\ast}]$ ) can be expanded as \begin{align*} \mathbb {E}\left [{{\dot {\mathbf {y}}}^{m}_{\left [{k}\right],i} \left ({{X}^{m}_{\left ({k}\right),i}}\right)^{\ast} }\right]=&\mathbb {E}\left [{ \left ({{\underline {\text {F}}}_{i} \mathbf {y}^{m}_{\left [{k}\right]} }\right) \left ({{X}^{m}_{\left ({k}\right),i}}\right)^{\ast}}\right] \\=&\mathbb {E}\left [{ \left ({{\underline {\text {F}}}_{i} \sum _{l = 1}^{M}\mathbf {H}^{ml} \breve {\mathbf {x}}^{l}_{\left [{k}\right]} + \mathbf {n}^{m}_{\left [{k}\right]} }\right)\left ({\breve {\mathbf {x}}^{m}_{\left [{k}\right]}}\right)^{H} \mathbf {e}_{i} }\right] \\ \tag{40}\end{align*} View SourceRight-click on figure for MathML and additional features. which can be written as a function of known input correlation matrix \begin{equation*} \mathbb {E}\left [{ {\dot {\mathbf {y}}}^{m}_{i}\left [{k}\right] \left ({{X}^{m}_{\left ({k}\right),i}}\right)^{\ast}}\right] = {\underline {\text {F}}}_{i} \mathbf {H}^{mm} \mathbf {R}_{\breve {\mathbf {x}}^{m}} \mathbf {e}_{i}\tag{41}\end{equation*} View SourceRight-click on figure for MathML and additional features. Thus, the optimal solution for the diagonal PTEQ coefficients for user $m$ on tone $i$ is given as (42), at the bottom of the page.

It can be observed that the full MIMO PTEQ and the sparse MIMO PTEQ require signal coordination among users at the receiver side. Hence, the users are required to have their receivers physically co-located. The requirement of signal coordination can be observed from the formulation of the estimated $k^{th}$ transmitted symbol for user $m$ on tone $i$ for both the full MIMO PTEQ and the sparse MIMO PTEQ in (13) and (22), respectively. The required signal coordination at the receiver side is only possible in the upstream scenario where the receivers are co-located at the DPU. In the downstream scenario, since the users are not co-located, the signal coordination at the receiver side is not realizable. Therefore, the full MIMO PTEQ and sparse MIMO PTEQ can not be applied. However, the diagonal MIMO PTEQ does not require signal coordination, as can be observed from the formulation of the estimated $k^{th}$ transmitted symbol for user $m$ on tone $i$ with diagonal PTEQ in (36). Therefore, the diagonal MIMO PTEQ can be applied in both upstream and downstream scenarios.

SECTION V.

Initialization Computational Complexity

Computing the optimal sparse or diagonal MIMO PTEQ coefficients, can be based on (31) and (42), shown at the bottom of the page if first all the channel matrices (and noise correlation matrices) are estimated. Alternatively these coefficients can be computed based on received data correlations, i.e., with (25) and (38) (and (18) for full MIMO PTEQ). The latter option will be analysed here. Computing the coefficients can then be divided in 3 tasks: $(i)$ computing the SDFT of the received signals using the difference terms implementation, $(ii)$ computing the correlation matrices of the input signals to the PTEQ and subsequently $(iii)$ solving the sets of linear equations to compute the optimal MIMO PTEQ coefficients.

  1. Computing the SDFT of the received signals using the difference terms implementation, as described in (12), requires $M$ DFT operations and $M(T-1)$ complex add operations per DMT symbol. This task is common to all three discussed MIMO PTEQs.

The computational complexity of the remaining tasks is described below.

A. Full MIMO PTEQ

  1. Computing the correlation matrix $\mathbb {E}[{\ddot {\mathbf {y}}}_{[k],i} ({\ddot {\mathbf {y}}}_{[k],i})^{H}]$ requires $\mathcal {O}(M^{2}T^{2})$ arithmetic operations, where $MT$ is the length of the vector ${\ddot {\mathbf {y}}}_{[k],i}$ . For $i = 1\cdots N$ , the total required number of arithmetic operations becomes $\mathcal {O}(N M^{2}T^{2})$ .

  2. Solving (18) as $(\tilde {\mathbf {v}}^{m}_{i})^{H} = (\mathbb {E}[{\ddot {\mathbf {y}}}_{[k],i} ({\ddot {\mathbf {y}}}_{[k],i})^{H}])^{-1} \cdot (\mathbb {E}[{\ddot {\mathbf {y}}}_{[k],i} ({X}^{m}_{(k),i})^{\ast}])$ can be done with $\mathcal {O}\left({\frac {1}{3}M^{3}T^{3}}\right)$ arithmetic operations. For $i = 1\cdots N$ , the total required number of arithmetic operations becomes $\mathcal {O}\left({\frac {1}{3}N M^{3}T^{3}}\right)$ .

B. Sparse MIMO PTEQ

  1. Computing the correlation matrix $\mathbb {E}[\mathbf {{\stackrel {\circ }{y}}}^{m}_{[k],i} (\mathbf {{\stackrel {\circ }{y}}}^{m}_{[k],i})^{H}]$ requires $\mathcal {O}((T+M-1)^{2})$ arithmetic operations, where $M+T-1$ is the length of the vector ${\ddot {\mathbf {y}}}^{m}_{[k],i}$ . For $i = 1 \cdots N$ and $m = 1\cdots M$ , the total required arithmetic operations becomes $\mathcal {O}(NM(T+M-1)^{2})$ .

  2. Solving (25) as $(\tilde {\mathbf {v}}^{m}_{i})^{H} = (\mathbb {E}[\mathbf {{\stackrel {\circ }{y}}}^{m}_{[k],i} (\mathbf {{\stackrel {\circ }{y}}}^{m}_{[k],i})^{H}])^{-1}\cdot (\mathbb {E}[\mathbf {{\stackrel {\circ }{y}}}^{m}_{[k],i} ({X}^{m}_{(k),i})^{\ast}])$ can be done with $\mathcal {O}\left({\frac {1}{3}(T+M-1)^{3}}\right)$ arithmetic operations. For $i = 1\cdots N$ and $m = 1 \cdots M$ , the total required number of arithmetic operations becomes $\mathcal {O}\left({\frac {1}{3}NM(T+M-1)^{3}}\right)$ .

C. Diagonal MIMO PTEQ

  1. Computing the correlation matrix $\mathbb {E}[{\dot {\mathbf {y}}}^{m}_{[k],i} ({\dot {\mathbf {y}}}^{m}_{[k],i})^{H}]$ requires $\mathcal {O}(T^{2})$ arithmetic operations, where $T$ is the length of the vector ${\dot {\mathbf {y}}}^{m}_{[k],i}$ . For $i = 1\cdots N$ and $m = 1\cdots M$ , the total required number of arithmetic operations becomes $\mathcal {O}(NMT^{2})$ .

  2. Solving (38) as $(\mathbf {v}^{mm}_{i})^{H} = (\mathbb {E}[{\dot {\mathbf {y}}}^{m}_{[k],i} ({\dot {\mathbf {y}}}^{m}_{[k],i})^{H}])^{-1} \cdot (\mathbb {E}[{\dot {\mathbf {y}}}^{m}_{[k],i} ({X}^{m}_{(k),i})^{\ast}])$ can be done with $\mathcal {O}\left({\frac {1}{3}T^{3}}\right)$ arithmetic operations. For $i = 1\cdots N$ and $m = 1 \cdots M$ , the total required number of arithmetic operations becomes $\mathcal {O}\left({\frac {1}{3}NMT^{3}}\right)$ .

Fig. 5 shows the % ratio of the initialization computational complexity of a sparse MIMO PTEQ and a diagonal MIMO PTEQ compared to a full MIMO PTEQ, for different MIMO PTEQ orders. The % ratio is calculated based on the computationally most expensive task in computing the optimal MIMO PTEQ coefficients, which corresponds to solving (18), (25) and (38) for the full, sparse and diagonal MIMO PTEQ, respectively. The % ratio is calculated as:\begin{align*} \% \text{ratio}=&\dfrac {\text{complexity (sparse or diagonal MIMO PTEQ)}}{\text{complexity (full MIMO PTEQ)}} \\&\times 100.\tag{43}\end{align*} View SourceRight-click on figure for MathML and additional features.
FIGURE 4. - 
$2 \times 2$
 sparse MIMO PTEQ for tone 
$i$
.
FIGURE 4.

$2 \times 2$ sparse MIMO PTEQ for tone $i$ .

FIGURE 5. - % ratio of initialization computational complexity of sparse MIMO PTEQ with PTEQ order-1 (
$-\!\!\!-\!\!\!-\!\!\!\!\!|$
), order-3 (
$-\!\!\!-\!\!\!-\!\!\!\!\!\diamond$
), order-7 (
$-\!\!\!-\!\!\!-\!\!\!\!\!\circ$
), diagonal MIMO PTEQ (__) and MIMO TEQ order-7 (__) compared to a full MIMO PTEQ with the same filter order.
FIGURE 5.

% ratio of initialization computational complexity of sparse MIMO PTEQ with PTEQ order-1 ($-\!\!\!-\!\!\!-\!\!\!\!\!|$ ), order-3 ($-\!\!\!-\!\!\!-\!\!\!\!\!\diamond$ ), order-7 ($-\!\!\!-\!\!\!-\!\!\!\!\!\circ$ ), diagonal MIMO PTEQ (__) and MIMO TEQ order-7 (__) compared to a full MIMO PTEQ with the same filter order.

Fig. 5 also includes the % ratio of the initialization computational complexity of a UNCDc-Zxc design method based MIMO TEQ (order 7) [9] compared to a full MIMO PTEQ (order 7). The lower orders of MIMO TEQ shows much higher % ratio of the initialization computational complexity compared to a full MIMO PTEQ of the same order and hence are omitted from Fig. 5. Furthermore, since, the computationally most expensive task in computing the optimal MIMO PTEQ coefficients for both the diagonal MIMO PTEQ and full MIMO PTEQ depends on $T^{3}$ , it can be observed that the % ratio of the initialization computational complexity for the diagonal MIMO PTEQ is independent of the equalizer order. Finally, it can be noticed that the sparse MIMO PTEQ provides a reduction in the initialization computational complexity compared to a full MIMO PTEQ for all filter orders larger than 1. For filter order 1, it can be observed from (43) that the sparse MIMO PTEQ has a higher initialization computational complexity compared to a full MIMO PTEQ. This is because, for the sparse MIMO PTEQ, the filter coefficients are computed separately for each user $m$ (25). However, for the full MIMO PTEQ the filter coefficients for all the users are computed jointly (18). Therefore in the sparse MIMO PTEQ, for lower filter orders, the increase in the computational cost caused by the separate computations of filter coefficients for each user becomes more prominent than the decrease in the computational cost caused by the reduction in the required filter coefficients to be computed, compared to a full MIMO PTEQ.

SECTION VI.

Memory Requirement

In comparison to a MIMO TEQ, a full MIMO PTEQ needs to store $N$ times more filter coefficients, which hinders its applicability in practical scenarios. Compared to a full MIMO PTEQ, the proposed sparse MIMO PTEQ and the diagonal MIMO PTEQ show a significant reduction in the memory requirement to store the PTEQ coefficients $(\mathbf {V}_{i})$ . A full MIMO PTEQ as shown in (15), requires $NM^{2}T$ coefficients to be stored. A sparse MIMO PTEQ, as shown in (21), requires $NM(M+T-1)$ coefficients to be stored. A diagonal MIMO PTEQ, as shown in (34) requires only $NMT$ coefficients to be stored. An equivalent reduction can be seen in the runtime computational complexity. A full MIMO PTEQ performs $NM^{2}T$ complex multiplications to compute the filtered outputs, while a sparse and a diagonal MIMO PTEQ perform $NM(M+T-1)$ and $NMT$ complex multiplications respectively to compute the filtered outputs. Fig. 6 shows the % ratio of memory requirement and the runtime computational complexity for a sparse MIMO PTEQ and a diagonal MIMO PTEQ compared to a full MIMO PTEQ, for different MIMO PTEQ orders. The % ratio in Fig. 6 is based on (43) for memory requirement and runtime complexity. Similar to the initialization computational complexity ratio in Fig. 5, it can be noted that the % ratio in the memory requirement and the runtime computational complexity for the diagonal MIMO PTEQ is also independent of the equalizer order, as the memory requirement and the runtime computational complexity for both the diagonal MIMO PTEQ and full MIMO PTEQ depends on $T$ .

FIGURE 6. - % ratio in memory requirement (and runtime computational complexity) of sparse MIMO PTEQ with PTEQ order-1 (
$-\!\!\!-\!\!\!-\!\!\!\!\!|$
), order-3 (
$-\!\!\!-\!\!\!-\!\!\!\!\!\diamond$
), order-7 (
$-\!\!\!-\!\!\!-\!\!\!\!\!\circ$
) and diagonal MIMO PTEQ (__) compared to a full MIMO PTEQ with the same filter order.
FIGURE 6.

% ratio in memory requirement (and runtime computational complexity) of sparse MIMO PTEQ with PTEQ order-1 ($-\!\!\!-\!\!\!-\!\!\!\!\!|$ ), order-3 ($-\!\!\!-\!\!\!-\!\!\!\!\!\diamond$ ), order-7 ($-\!\!\!-\!\!\!-\!\!\!\!\!\circ$ ) and diagonal MIMO PTEQ (__) compared to a full MIMO PTEQ with the same filter order.

SECTION VII.

Results

The simulation setup is kept similar to [9]. The G.fast 106b profile [28] is considered here for the simulation of a $5 \times 5$ MIMO DSL system, i.e., a 5-line DSL system with 2048 tones. A total transmit power of $8 dBm$ and a noise power of $-140 \,\,dBm/Hz$ is considered. A practical approach is chosen for the transmit power distribution over tones as follows. Initially, the power is allocated to tones according to the power spectral density (PSD) mask specification [29]. Based on that, a PTEQ filter is designed and the number of bits that can be transmitted on each tone is calculated. The tones for which the number of transmitted bits is smaller than 1, are rejected and left unused. The remaining power is finally distributed over the used tones,while making sure not to violate the PSD mask constraints [29], and then the PTEQ filter coefficients are updated one last time. The simulations are performed for both a theoretical channel model and measured channels. The theoretical channel (length 600m) is based on the KHM model, suggested in [30], while the measured channel data corresponds to cable binders of two Tier-1 operators (channel 1 of length 728m and channel 2 of length 600m). The data rates are computed with a bit-cap of 14 bits. Moreover for the simulations, the input correlation matrices ($\mathbf {R}_{\breve {\mathbf {x}}^{j}}$ ) are considered to be diagonal, assuming that the transmitted symbols $\ddot {\mathbf {x}}^{j}_{(k)}$ are proper,—i.e., that their real and imaginary parts are uncorrelated and have equal variance—and are independent over users, tones, and time. Furthermore, we assume perfect knowledge of the channel state information (CSI). This assumption is standard in wireline communication systems, especially in DSL systems [31]. Table 1 summarizes the simulation parameters.

TABLE 1 Simulation Parameters
Table 1- 
Simulation Parameters

Fig. 7(a), Fig. 7(b) and Fig. 7(c) compare the performance of the full MIMO PTEQ with the proposed sparse MIMO PTEQ in terms of total achieved rates for a five line DSL system, for different channels. The rates are calculated over a range of synchronisation delays, to also characterize the robustness of the MIMO PTEQ design against the synchronization delay. From the simulation results, it can be noticed that the sparse PTEQ attains similar performance as the full MIMO PTEQ, while providing a reduction in the initialization computational complexity and the memory requirement. As discussed in Section III, the ISI and ICI power caused by crosstalk channels is expected to be small. Hence, ignoring the crosstalk ISI and ICI cancellation indeed does not significantly affect the performance of the MIMO PTEQ and a single tap equalizer for crosstalk cancellation works similar to a T tap equalizer. Furthermore, it can be observed from Fig. 7(a), Fig. 7(b) and Fig. 7(c) that the performance of the full MIMO PTEQ and sparse MIMO PTEQ shows insignificant improvement, while increasing the order from PTEQ order 1 to PTEQ order 7. This suggests that the channel impulse responses used for the simulations can be approximated with a rational transfer function that has few poles and an order-1 MIMO PTEQ is already good enough to cancel them. An impulse response whose approximated rational transfer function has a higher number of poles will require a higher order PTEQ, and the system performance will not saturate at order-1 MIMO PTEQ.

FIGURE 7. - Performance comparison of MIMO PTEQ structures (and MIMO TEQ-3 (__) [9]) for different DSL channels and filter orders (CP = 128).
FIGURE 7.

Performance comparison of MIMO PTEQ structures (and MIMO TEQ-3 (__) [9]) for different DSL channels and filter orders (CP = 128).

Fig. 7(d), Fig. 7(e) and Fig. 7(f) compare the performance of the full MIMO PTEQ and the proposed diagonal MIMO PTEQ. Though the diagonal PTEQ further reduces the initialization computational complexity and memory requirement, an average performance drop of 29.3% is observed in the absence of crosstalk cancellation. This drop in performance can likely be avoided by using crosstalk precompensation, also referred to as downstream vectoring, at the transmitter side, but designing the vectoring together with the diagonal MIMO PTEQ is complex and remains as a topic for future work.

Fig. 8 extends the results of Fig. 7 with the same channels and simulation parameters but with a CP length of 64 samples. Finally, Fig. 9 provides simulation results for non-LR-G.fast channels. The theoretical channel (length 300m) is again based on the KHM model, suggested in [30], while the measured channel data corresponds to cable binders of two Tier-1 operators (channel 1 of length 250m and channel 2 of length 300m). The simulation parameters are kept the same (Table 1) with a CP length of 64 samples. The results in Fig. 8 and Fig. 9 further bolster the conclusions drawn from Fig. 7.

FIGURE 8. - Performance comparison of MIMO PTEQ structures (and MIMO TEQ-3 (__) [9]) for different DSL channels and filter orders (CP = 64).
FIGURE 8.

Performance comparison of MIMO PTEQ structures (and MIMO TEQ-3 (__) [9]) for different DSL channels and filter orders (CP = 64).

FIGURE 9. - Performance comparison of MIMO PTEQ structures (and MIMO TEQ-3 (__) [9]) for different DSL channels (non-LR-G.fast) and filter orders (CP = 64).
FIGURE 9.

Performance comparison of MIMO PTEQ structures (and MIMO TEQ-3 (__) [9]) for different DSL channels (non-LR-G.fast) and filter orders (CP = 64).

SECTION VIII.

Conclusion

The paper is motivated by the potential use of a MIMO PTEQ in newly emerging LR-xDSL and tackles the large initialization computational complexity and memory requirement for the full MIMO PTEQ. A specific structure in the MIMO DSL channel is exploited, namely that the combined ISI and ICI signal power from the crosstalk channels is significantly lower than the desired and combined ISI and ICI signal power from the direct channels, to derive a very low complexity/memory solution, referred as sparse MIMO PTEQ, with negligible impact (≈ 0.5% drop) on the performance compared to a full MIMO PTEQ. For a conventional DSL binder size of 16 lines and a PTEQ order of 3, the proposed sparse MIMO PTEQ operates at 42% of the initialization computational complexity and 29.7% of the memory requirement, with negligible performance degradation, compared to a full MIMO PTEQ. Even larger reductions in initialization computational complexity and memory requirement are achieved by the proposed diagonal MIMO PTEQ, which operates at 0.4% of the initialization computational complexity and 6.25% of the memory requirement compared to a full MIMO PTEQ. It also allows the MIMO PTEQ implementation in upstream as well as downstream scenarios. However, in absence of crosstalk cancellation the performance of diagonal MIMO PTEQ drops significantly compared to a full MIMO PTEQ. This drop in performance can likely be avoided by using crosstalk precompensation (i.e., downstream vectoring) at the transmitter side. Finally, the applicability of the proposed models could potentially be interesting for wireless systems as well, which are characterized by a similar structure.

Appendix

Optimal MIMO PTEQ coefficients

The optimization problem (MMSE) for optimal PTEQ filter coefficients for user $m$ and tone $i$ is given as \begin{equation*} \underset {\tilde {\mathbf {v}}^{\mathbf {m}}_{\mathbf {i}}}{\text{minimize }} \mathbb {E}\left [{ \left |{ \tilde {\mathbf {v}}^{m}_{i}{\ddot {\mathbf {y}}}_{\left [{k}\right],i} - {X}^{m}_{\left ({k}\right),i} }\right |^{2}}\right]\tag{44}\end{equation*} View SourceRight-click on figure for MathML and additional features. where ${\ddot {\mathbf {y}}}_{[k],i}= \check {{\underline {\text {F}}}}_{i} \cdot \check {\mathbf {y}}_{[k]} $ . The mean squared error can be expanded as \begin{align*} \varepsilon ^{m}_{i}=&\mathbb {E}\left [{\left ({\tilde {\mathbf {v}}^{m}_{i} {\ddot {\mathbf {y}}}_{\left [{k}\right],i} - {X}^{m}_{\left ({k}\right),i} }\right)\left ({\tilde {\mathbf {v}}^{m}_{i} {\ddot {\mathbf {y}}}_{\left [{k}\right],i} - {X}^{m}_{\left ({k}\right),i} }\right)^{H}}\right] \\=&\tilde {\mathbf {v}}^{m}_{i}\mathbb {E}\left [{ {\ddot {\mathbf {y}}}_{\left [{k}\right],i} \left ({{\ddot {\mathbf {y}}}_{\left [{k}\right],i}}\right)^{H}}\right]\left ({\tilde {\mathbf {v}}^{m}_{i}}\right)^{H} - \tilde {\mathbf {v}}^{m}_{i}\mathbb {E}\left [{ {\ddot {\mathbf {y}}}_{\left [{k}\right],i}\left ({{X}^{m}_{\left ({k}\right),i}}\right)^{H}}\right] \\&- \mathbb {E}\left [{{X}^{m}_{\left ({k}\right),i} \left ({{\ddot {\mathbf {y}}}_{\left [{k}\right],i}}\right)^{H}}\right]\left ({\tilde {\mathbf {v}}^{m}_{i}}\right)^{H} + \mathbb {E}\left [{{X}^{m}_{\left ({k}\right),i}\left ({{X}^{m}_{\left ({k}\right),i}}\right)^{H}}\right]\tag{45}\end{align*} View SourceRight-click on figure for MathML and additional features. Taking the derivative of the mean squared error with respect to $(\tilde {\mathbf {v}}^{m}_{i})^{H}$ yields \begin{align*} \frac {\partial \varepsilon ^{m}_{i}}{\partial \left ({\tilde {\mathbf {v}}^{m}_{i}}\right)^{H}}=&2\mathbb {E}\left [{{\ddot {\mathbf {y}}}_{\left [{k}\right],i}\left ({{\ddot {\mathbf {y}}}_{\left [{k}\right],i}}\right)^{H} }\right]\left ({\tilde {\mathbf {v}}^{m}_{i}}\right)^{H} - 2\mathbb {E}\left [{{\ddot {\mathbf {y}}}_{\left [{k}\right],i} \left ({{X}^{m}_{\left ({k}\right),i}}\right)^{H}}\right] \\{}\tag{46}\end{align*} View SourceRight-click on figure for MathML and additional features. Equating the derivative to 0 to find the optimal $\tilde {\mathbf {v}}^{m}_{i}$ yields \begin{equation*} \left ({\tilde {\mathbf {v}}^{m}_{i}}\right)^{H} = \mathbb {E}\left [{{\ddot {\mathbf {y}}}_{\left [{k}\right],i}\left ({{\ddot {\mathbf {y}}}_{\left [{k}\right],i}}\right)^{H} }\right]^{-1}\mathbb {E}\left [{ {\ddot {\mathbf {y}}}_{\left [{k}\right],i} \left ({{X}^{m}_{\left ({k}\right),i}}\right)^{\ast}}\right].\tag{47}\end{equation*} View SourceRight-click on figure for MathML and additional features.

References

References is not available for this document.