Introduction
Digital subscriber line (DSL) technologies hold a major market-share of broadband communication [1]. They are based on discrete multi-tone (DMT) modulation, which is a multicarrier modulation technique similar to orthogonal frequency division multiplexing (OFDM). DMT is used in wireline communication, where channels are slowly time-varying and the noise in the channel is assumed stationary. Hence channels are assumed to be known after an initial training and quasi-static. This allows DMT to use bit-loading and adaptive power loading in addition to basic OFDM operation. In DMT on the transmitter side, the input bits to be transmitted are converted into parallel bit streams, each stream to be carried on a carrier or tone (a discrete sub-band in the available bandwidth). In each stream, the bits are converted into higher order QAM symbols (up to 16384-QAM [2]), which are then modulated on to the discrete tones by an inverse discrete Fourier transform (IDFT) operation to obtain time-domain symbols. A cyclic prefix (CP) is added which—provided that its length at least matches the length of the channel impulse response (CIR), and the transmitter and receiver are properly synchronized—eliminates inter-symbol interference (ISI) between successive time-domain DMT symbols and inter-carrier interference (ICI) between the tones of the same DMT symbol. On the receiver side, the CP is removed from the received time-domain symbols and subsequently a discrete Fourier transform (DFT) is applied. Finally, the received QAM symbols are equalized with a single-tap complex frequency-domain equalizer (FEQ).
In an ideal scenario, the combination of CP and FEQ works perfectly to counter ISI and ICI. In older generations of DSL, such as ADSL [3] (and its later versions ADSL2 and ADSL2+), the allowed loop lengths ranged up to 5000 meters and the available channel bandwidth ranged up to only a few MHz. Hence, these older generations have been characterized by a long CIR and low crosstalk levels enabling a per-line single-input single-output (SISO) design. Although the use of a similarly long CP eliminates ISI and ICI, it also increases the DMT symbol length, thereby introducing an overhead and decreasing the achievable throughput. An efficient way to deal with this problem has been the use of a channel shortening filter—commonly known as a time-domain equalizer (TEQ) [4]. However in later generations of DSL (e.g., VDSL2 [5], G.fast [6], G.mgfast [7]) the copper loop lengths have been effectively shortened with the deployment of fibre to the distribution point unit (DPU), The typical loop lengths for VDSL2, G.fast and G.mgfast are 1200m, 250m and 100m, respectively. Hence, channel shortening has not been used in the later generations of DSL, confining its use to ADSL.
Recently long reach VDSL2 (LR-VDSL2) has been proposed with the purpose of providing high data rates (possibly up to
These long reach extensions of existing DSL standards (LR-xDSL) will enable VDSL2 and G.fast DPUs to be connected to lines with a wider variety of lengths. All the lines connected to the same DPU have to use the same CP length in order to simplify the system and allow for efficient vectoring. If the CP length is chosen according to the longest line length then shorter lines may undergo a substantial throughput loss in order to enable ISI/ICI-free transmission on the longer lines, further motivating the use of a TEQ for LR-xDSL. However, since crosstalk can no longer be neglected in VDSL2 and later generations of DSL, a multiple-input multiple-output (MIMO) TEQ design is required for a joint shortening of both direct and crosstalk channels. Nevertheless, in the case of LR-VDSL2, besides ISI/ICI, there is an additional issue of echo and near-end crosstalk (NEXT) generation when considering a too short CP. This can be partly solved by echo cancellation techniques [10]–[12]. Moreover, it is emphasized that the main application use-case of the paper is LR-G.fast, in which this issue of NEXT and echo does not appear. For ISI/ICI cancellation with short CP, the MIMO TEQ has been shown to do a fairly good job in terms of shortening the CIRs [9]. However, the TEQ design procedure is generally not related to true bitrate optimization of the system. Therefore, a good TEQ in terms of the channel shortening, may indeed not guarantee a corresponding significant improvement in bitrates. In [13], a SISO bitrate maximizing TEQ (BM-TEQ) has been proposed to maximize the total bitrate for a given filter order, but this comes with a large initialization computational complexity due to a non-linear non-convex cost function. Similarly, in [14], [15] a genetic algorithm based blind channel shortening equalizer structure has been proposed, for the optimal SISO TEQ coefficients search. However again, a large initialization computational complexity is incurred. Moreover, the bitrate achieved by a TEQ is known to be sensitive to the synchronization delay and their relation is non-smooth [16].
A MIMO per-tone equalizer (PTEQ) [17], [18] provides a solution to these problems faced by a MIMO TEQ. In a MIMO PTEQ, the TEQ is moved to after the DFT operations at the receiver side. Hence, each tone has its own filter, designed to maximize the signal-to-noise ratio (SNR) by solving a minimum mean squared error (MMSE) problem for each tone separately. Not only does a PTEQ provide an upper limit to the performance of a TEQ, it also yields a relation between SNR and synchronisation delay that is smooth and predictable [18], thereby simplifying determining the optimal synchronization delay. Furthermore, it has been shown that the runtime computational complexity of the MIMO PTEQ is equal or less than the runtime computational complexity of a MIMO TEQ [18]. Moreover, recently in [19] a transmitter side per-tone precoder has been proposed, which can be applied at the DPU in a downstream scenario, thus reducing the computational load on customer-premises equipment (CPE). Finally, apart from ISI/ICI cancellation, the MIMO PTEQ also performs crosstalk cancellation, removing the need for a separate entity (vectoring) for crosstalk cancellation [20]–[22].
A. Main Contributions
Despite the advantages of the MIMO PTEQ, a large initialization computational complexity and memory requirement hinder its applicability in practical scenarios. To tackle this problem, a specific structure in the MIMO DSL channel, namely that the combined ISI and ICI signal power from the crosstalk channels is significantly lower than the desired and combined ISI and ICI signal power from the direct channels, is exploited in deriving two new MIMO PTEQ structures, that reduce the initialization computational complexity and memory requirement of a full MIMO PTEQ. The two proposed structures are the sparse MIMO PTEQ and diagonal MIMO PTEQ.
The proposed sparse MIMO PTEQ results in negligible loss of performance. From the simulation results obtained in Section VII it is observed that for an order-1 full MIMO PTEQ and an order-1 sparse MIMO PTEQ, the average difference in achieved data rate, over all synchronization delays, is only ≈ 0.5%. The average difference in the peak data rate is only ≈ 0.1%.
The proposed diagonal MIMO PTEQ results a in further reduction of the initialization computational complexity, runtime computational complexity and memory requirement at the cost of a small performance loss. The diagonal MIMO PTEQ has the additional benefit that it can be applied in both upstream and downstream scenarios, in contrast to the full and sparse MIMO PTEQ structures which can be used only in upstream scenarios.
A similar reduction in the initialization computational complexity and memory requirement has been proposed for the MIMO TEQ in [9]. The basic ideas proposed in this paper and in [9] are similar in that both papers aim at reducing the computational complexity and memory requirement, for MIMO PTEQ and MIMO TEQ, respectively. However, the MIMO TEQ, discussed in [9], aims at channel shortening while the MIMO PTEQ aims at bit rate maximization. Therefore, the underlying system equations, constrained optimization problems and cost functions are very different in the two papers.
B. Organization and Notation
The paper is organized as follows: Section II provides a brief overview of the MIMO DSL system model and reviews the full MIMO PTEQ design equations. Section III and Section IV present the sparse and diagonal MIMO PTEQ structure, respectively. Section V and Section VI present the initialization computational complexity and memory requirement of proposed MIMO PTEQ structures, respectively. Section VII reports the results and finally Section VIII concludes the paper.
Lower-case and upper-case boldface letters are used to denote vectors (row vectors and column vectors) and matrices, respectively, with
MIMO PTEQ
The system considered is a cable binder with
Hence, the time-domain received signal for user


The MIMO PTEQ, as described in [18], is an \begin{equation*} {\ddot {\mathbf {y}}}^{m}_{\left [{k}\right],i} = \mathbf {F}_{i} \mathbf {y}^{m}_{\left [{k}\right]}\tag{4}\end{equation*}
\begin{align*} \mathbf {F}_{i} = \left [{\begin{array}{ccccccc} \boxed{\text{row}_{i}\left ({\mathbf {\mathbb {F}_{N}}}\right)} &{}\cdots &0 \\ 0 \boxed{\text{row}_{i}\left ({\mathbf {\mathbb {F}_{N}}}\right)} & \cdots &0 \\ \vdots & \ddots & \vdots \\ 0&{}\cdots & \boxed{\text{row}_{i}\left ({\mathbf {\mathbb {F}_{N}}}\right)} \\ \end{array} }\right]\tag{5}\end{align*}
\begin{align*} \underbrace {\left [{ \begin{array}{c} {\ddot {\mathbf {y}}}^{1}_{\left [{k}\right],i} \\ {\ddot {\mathbf {y}}}^{2}_{\left [{k}\right],i} \\ \vdots \\ {\ddot {\mathbf {y}}}^{M}_{\left [{k}\right],i} \end{array} }\right] }_{{\ddot {\mathbf {y}}}_{\left [{k}\right],i}} = \underbrace {\left [{\begin{array}{ccccc} \mathbf {F}_{i} & \mathbf {0}& \cdots & \mathbf {0} \\ \mathbf {0} & \mathbf {F}_{i} & \cdots & \mathbf {0} \\ \vdots & \ddots & \ddots & \vdots \\ \mathbf {0} & \cdots & \cdots & \mathbf {F}_{i}\\ \end{array} }\right]}_{\check {\mathbf {F}}_{i}} \underbrace {\left [{ \begin{array}{c} \mathbf {y}^{1}_{\left [{k}\right]} \\ \mathbf {y}^{2}_{\left [{k}\right]} \\ \vdots \\ \mathbf {y}^{M}_{\left [{k}\right]} \end{array} }\right]}_{ \mathbf {y}_{\left [{k}\right]}}\tag{6}\end{align*}
\begin{equation*} \hat {X}^{m}_{\left ({k}\right),i} = \mathbf {w}^{m}_{i} {\ddot {\mathbf {y}}}_{\left [{k}\right],i}\tag{7}\end{equation*}
\begin{equation*} \mathbf {w}^{m}_{i} = \left [{\mathbf {w}^{m1}_{i} \mathbf {w}^{m2}_{i} \cdots \mathbf {w}^{mm}_{i} \cdots \mathbf {w}^{mM}_{i} }\right]\tag{8}\end{equation*}
\begin{equation*} \mathbf {w}^{ml}_{i} = \left [{ w^{ml}_{\left ({T-1}\right),i} w^{ml}_{\left ({T-2}\right),i} \cdots w^{ml}_{\left ({0}\right),i} }\right]\tag{9}\end{equation*}
\begin{align*} \mathbf {W}_{i} = \left [{\begin{matrix} \mathbf {w}_{i}^{11} & \mathbf {w}_{i}^{12} & \cdots & \mathbf {w}_{i}^{1M} \\ \mathbf {w}_{i}^{21} & \mathbf {w}_{i}^{22} & \cdots & \mathbf {w}_{i}^{2M} \\ \vdots & \ddots & \\ \mathbf {w}_{i}^{M1} & \mathbf {w}_{i}^{M2}& \cdots & \mathbf {w}_{i}^{MM} \end{matrix}}\right]\tag{10}\end{align*}
\begin{align*} \hat {X}^{m}_{\left ({k}\right),i}=&\mathbf {w}^{m}_{i} \check {\mathbf {F}}_{i} \left ({\left [{\begin{array}{cccc} \mathbf {H}^{11} & \mathbf {H}^{12} & \cdots & \mathbf {H}^{1M} \\ \mathbf {H}^{21} & \mathbf {H}^{22} & \cdots & \mathbf {H}^{2M} \\ \vdots & \vdots & \cdots & \vdots \\ \mathbf {H}^{M1} & \mathbf {H}^{M2} & \cdots & \mathbf {H}^{MM} \end{array} }\right]\left [{ \begin{array}{c} \breve {\mathbf {x}}^{1}_{\left [{k}\right]} \\ \breve {\mathbf {x}}^{2}_{\left [{k}\right]} \\ \vdots \\ \breve {\mathbf {x}}^{M}_{\left [{k}\right]} \end{array} }\right] }\right. \\&\qquad \qquad \left.{+ \left [{ \begin{array}{c} \mathbf {n}^{1}_{\left [{k}\right]} \\ \mathbf {n}^{2}_{\left [{k}\right]} \\ \vdots \\ \mathbf {n}^{M}_{\left [{k}\right]} \end{array} }\right] }\right)\tag{11}\end{align*}
Based on (5) and (6), it can be observed that \begin{align*} \mathbf {F}_{i}=\underbrace {\left [{\begin{array}{cccc} \alpha ^{0i} & \cdots & 0 \\ \vdots & \ddots & \vdots \\ \alpha ^{\left ({T-1}\right)i} & \cdots & \alpha ^{0i}\\ \end{array} }\right]}_{\mathbf {L}_{i}} \underbrace {\left [{ \begin{array}{cc|c} \,\,\,\,\text{row}_{i}\left ({\mathbf {\mathbb {F}_{N}}}\right) & \mathbf {0}\\ \hline -\alpha ^{i} \mathbf {I}_{T-1} & \mathbf {0} & \alpha ^{i} \mathbf {I}_{T-1} \end{array}}\right]}_{{\underline {\text{F}}}_{i}} \\\tag{12}\end{align*}
\begin{equation*} \hat {X}^{m}_{\left ({k}\right),i} = \mathbf {v}^{m}_{i} \check {{\underline {\text {F}}}}_{i} \mathbf {y}_{\left [{k}\right]}\tag{13}\end{equation*}
\begin{align*} \check {{\underline {\text {F}}}}_{i} = \left [{\begin{array}{ccccc} {\underline {\text {F}}}_{i} & \mathbf {0}& \cdots & \mathbf {0} \\ \mathbf {0} & {\underline {\text {F}}}_{i} & \cdots & \mathbf {0} \\ \vdots & \ddots & \ddots & \vdots \\ \mathbf {0} & \cdots & \cdots & {\underline {\text {F}}}_{i}\\ \end{array} }\right]\tag{14}\end{align*}
\begin{align*} \mathbf {V}_{i} = \left [{\begin{matrix} \mathbf {v}_{i}^{11} & \mathbf {v}_{i}^{12} & \cdots & \mathbf {v}_{i}^{1M} \\ \mathbf {v}_{i}^{21} & \mathbf {v}_{i}^{22} & \cdots & \mathbf {v}_{i}^{2M} \\ \vdots & \ddots & \\ \mathbf {v}_{i}^{M1} & \mathbf {v}_{i}^{M2}& \cdots & \mathbf {v}_{i}^{MM} \end{matrix}}\right]\tag{15}\end{align*}
\begin{equation*} \mathbf {v}^{ml}_{i} = \left [{ v^{ml}_{\left ({T-1}\right),i} v^{ml}_{\left ({T-2}\right),i} \cdots v^{ml}_{\left ({0}\right),i} }\right]\tag{16}\end{equation*}
\begin{align*} \underset {\mathbf {\mathbf {v}^{m}_{i}}}{\text{minimize }}&\mathbb {E}\left [{ \left |{ \hat {X}^{m}_{\left ({k}\right),i} - {X}^{m}_{\left ({k}\right),i} }\right |^{2}}\right] \\&\left ({or}\right) \\ \underset {\mathbf {\mathbf {v}^{m}_{i}}}{\text{minimize }}&\mathbb {E}\left [{ \left |{ \mathbf {v}^{m}_{i} \check {{\underline {\text {F}}}}_{i} \mathbf {y}_{\left [{k}\right]} - {X}^{m}_{\left ({k}\right),i} }\right |^{2}}\right]\tag{17}\end{align*}
\begin{equation*} \left ({\tilde {\mathbf {v}}^{m}_{i}}\right)^{H} = \mathbb {E}\left [{{\ddot {\mathbf {y}}}_{\left [{k}\right],i}\left ({{\ddot {\mathbf {y}}}_{\left [{k}\right],i}}\right)^{H} }\right]^{-1}\mathbb {E}\left [{ {\ddot {\mathbf {y}}}_{\left [{k}\right],i} \left ({{X}^{m}_{\left ({k}\right),i}}\right)^{\ast}}\right]\tag{18}\end{equation*}
Furthermore, it is to be noticed that if the CP length is too short, the received signals after the DMT demodulation (before a soft/hard decision operation) are generally improper [24], [25]. However, [26] proposed a widely-linear PTEQ (WL-PTEQ) considering impropriety of the received signals and observed that the achieved bitrates for various CP lengths (shorter than CIR) roughly coincide for the PTEQ and WL-PTEQ for equalizer order greater than 0. Therefore, this paper ignores the impropriety of the received signals after DMT demodulation.
Sparse MIMO PTEQ
The full MIMO PTEQ design described in Section II thus circumvents the difficult bitrate maximizing MIMO TEQ design: The minimization in (17) directly corresponds to the maximization of the achievable bitrate for user
For MIMO DSL systems (without a TEQ/PTEQ), the combined ISI and ICI signal power from the crosstalk channels is negligible compared to the desired and the combined ISI and ICI power from the direct channels. Fig. 3 shows this structure for a
Power distribution as desired signal power and combined ISI and ICI power for two measured
However, comparatively, the combined ISI and ICI signal power from the crosstalk channels is negligible, especially in the lower frequencies, which is generally the high SNR region in DSL systems and hence contributes most to the data rate. Therefore, this structure can be exploited to significantly simplify the full MIMO PTEQ without impacting the performance, namely by reducing the MIMO PTEQ coefficients responsible for crosstalk ISI and ICI cancellation to a single tap scalar. The sparse MIMO PTEQ matrix elements for tone \begin{align*} \mathbf {v}^{ml}_{i}=&\left [{ v^{ml}_{\left ({T-1}\right),i} v^{ml}_{\left ({T-2}\right),i} \cdots v^{ml}_{\left ({0}\right),i} }\right] \quad \forall m=l \\ \mathbf {v}^{ml}_{i}=&\left [{ v^{ml}_{\left ({0}\right),i} 0 \cdots 0 }\right] \equiv v^{ml}_{i} \underline {{\mathbf {e}} } \quad \forall m \neq l\tag{19}\end{align*}
\begin{equation*} \mathbf {v}^{m}_{i} = \left [{v^{m1}_{i} \underline {{\mathbf {e}} } v^{m2}_{i} \underline {{\mathbf {e}} } \cdots \mathbf {v}_{i}^{mm} \cdots v^{mM}_{i} \underline {{\mathbf {e}} } }\right]\tag{20}\end{equation*}
\begin{align*} \mathbf {V}_{i}=\left [{\begin{matrix} \mathbf {v}_{i}^{11} & v^{12}_{i} \underline {{\mathbf {e}} } & \cdots & v^{1M}_{i} \underline {{\mathbf {e}} } \\ v^{21}_{i} \underline {{\mathbf {e}} } & \mathbf {v}_{i}^{22} & \cdots & v^{2M}_{i} \underline {{\mathbf {e}} } \\ \vdots & \ddots & \\ v^{M1}_{i} \underline {{\mathbf {e}} } & v^{M2}_{i} \underline {{\mathbf {e}} } & \cdots & \mathbf {v}_{i}^{MM} \end{matrix}}\right]\tag{21}\end{align*}
\begin{equation*} \hat {X}^{m}_{\left ({k}\right),i} = \mathbf {v}^{m}_{i} \check {{\underline {\text {F}}}}_{i} \mathbf {y}_{\left [{k}\right]}\tag{22}\end{equation*}
Hence the MMSE based optimization problem for the sparse MIMO PTEQ filter coefficients for user \begin{equation*} \underset {\tilde {\mathbf {v}}^{\mathbf {m}}_{\mathbf {i}}}{\text{minimize }} \mathbb {E}\left [{ \left |{ \tilde {\mathbf {v}}^{m}_{i} \check {{\underline {\text {F}}}}^{m}_{i}\mathbf {y}_{\left [{k}\right]} - {X}^{m}_{\left ({k}\right),i} }\right |^{2}}\right]\tag{24}\end{equation*}
\begin{equation*} \left ({\tilde {\mathbf {v}}^{m}_{i}}\right)^{H} = \left ({\mathbb {E}\left [{\mathbf {{\stackrel {\circ }{y}}}^{m}_{\left [{k}\right],i} \left ({\mathbf {{\stackrel {\circ }{y}}}^{m}_{\left [{k}\right],i}}\right)^{H}}\right]}\right)^{-1} \left ({\mathbb {E}\left [{\mathbf {{\stackrel {\circ }{y}}}^{m}_{\left [{k}\right],i} \left ({{X}^{m}_{\left ({k}\right),i}}\right)^{\ast} }\right]}\right)\tag{25}\end{equation*}
\begin{align*} \mathbb {E}\left [{\mathbf {{\stackrel {\circ }{y}}}^{m}_{\left [{k}\right],i} \left ({\mathbf {{\stackrel {\circ }{y}}}^{m}_{\left [{k}\right],i}}\right)^{H}}\right]=&\check {{\underline {\text {F}}}}^{m}_{i} \mathbb {E} \left [{\left [{ \begin{array}{c} \mathbf {y}^{1}_{\left [{k}\right]} \\ \mathbf {y}^{2}_{\left [{k}\right]} \\ \vdots \\ \mathbf {y}^{M}_{\left [{k}\right]} \end{array} }\right] \left [{ \begin{array}{c} \mathbf {y}^{1}_{\left [{k}\right]} \\ \mathbf {y}^{2}_{\left [{k}\right]} \\ \vdots \\ \mathbf {y}^{M}_{\left [{k}\right]} \end{array} }\right]^{H}}\right] \check {{\underline {\text {F}}}}^{m^{H}}_{i} \\=&\check {{\underline {\text {F}}}}^{m}_{i} \mathbf {R_{yy}} \check {{\underline {\text {F}}}}^{m^{H}}_{i}\tag{26}\end{align*}
\begin{align*} \mathbf {R_{yy}} = \left [{\begin{matrix} \mathbb {E}\left [{\mathbf {y}^{1}_{\left [{k}\right]}\mathbf {y}^{1^{H}}_{\left [{k}\right]}}\right] & \mathbb {E}\left [{\mathbf {y}^{1}_{\left [{k}\right]} \mathbf {y}^{2^{H}}_{\left [{k}\right]}}\right] & \cdots & \mathbb {E}\left [{\mathbf {y}^{1}_{\left [{k}\right]} \mathbf {y}^{M^{H}}_{\left [{k}\right]}}\right] \\ \mathbb {E}\left [{\mathbf {y}^{2}_{\left [{k}\right]}\mathbf {y}^{1^{H}}_{\left [{k}\right]}}\right] & \mathbb {E}\left [{\mathbf {y}^{2}_{\left [{k}\right]} \mathbf {y}^{2^{H}}_{\left [{k}\right]}}\right] & \cdots & \mathbb {E}\left [{\mathbf {y}^{2}_{\left [{k}\right]} \mathbf {y}^{M^{H}}_{\left [{k}\right]}}\right] \\ \vdots & \vdots & \cdots & \vdots \\ \mathbb {E}\left [{\mathbf {y}^{M}_{\left [{k}\right]} \mathbf {y}^{1^{H}}_{\left [{k}\right]}}\right] & \mathbb {E}\left [{\mathbf {y}^{M}_{\left [{k}\right]} \mathbf {y}^{2^{H}}_{\left [{k}\right]}}\right] & \cdots & \mathbb {E}\left [{\mathbf {y}^{M}_{\left [{k}\right]} \mathbf {y}^{M^{H}}_{\left [{k}\right]}}\right] \end{matrix}}\right] \\{}\tag{27}\end{align*}
\begin{align*} \mathbb {E}\left [{\mathbf {y}^{p}_{\left [{k}\right]} \mathbf {y}^{q^{H}}_{\left [{k}\right]}}\right]=&\sum _{l = 1}^{M}\mathbf {H}^{p,l} \mathbf {R}_{\breve {\mathbf {x}}^{l}}\mathbf {H}^{q,l^{H}} \forall p \neq q \\ \mathbb {E}\left [{\mathbf {y}^{p}_{\left [{k}\right]} \mathbf {y}^{q^{H}}_{\left [{k}\right]}}\right]=&\sum _{l = 1}^{M}\mathbf {H}^{p,l} \mathbf {R}_{\breve {\mathbf {x}}^{l}}\mathbf {H}^{q,l^{H}} + \mathbf {R}_{\mathbf {n}^{p}} \forall p = q\tag{28}\end{align*}
The second part of the (25) (\begin{align*} \mathbb {E}\left [{\mathbf {{\stackrel {\circ }{y}}}^{m}_{\left [{k}\right],i} \left ({{X}^{m}_{\left ({k}\right),i}}\right)^{\ast} }\right] = \mathbb {E}\left [{ \left ({\check {{\underline {\text {F}}}}^{m}_{i} \left [{ \begin{array}{c} \mathbf {y}^{1}_{\left [{k}\right]} \\ \mathbf {y}^{2}_{\left [{k}\right]} \\ \vdots \\ \mathbf {y}^{M}_{\left [{k}\right]} \end{array} }\right]}\right) \left ({{X}^{m}_{\left ({k}\right),i}}\right)^{\ast}}\right] \\ = \mathbb {E}\left [{ \left ({\check {{\underline {\text {F}}}}^{m}_{i}\left [{\begin{array}{c} \sum _{l = 1}^{M}\mathbf {H}^{1l} \breve {\mathbf {x}}^{l}_{\left [{k}\right]} + \mathbf {n}^{1}_{\left [{k}\right]} \\ \sum _{l = 1}^{M}\mathbf {H}^{2l} \breve {\mathbf {x}}^{l}_{\left [{k}\right]} + \mathbf {n}^{2}_{\left [{k}\right]} \\ \vdots \\ \sum _{l = 1}^{M}\mathbf {H}^{Ml} \breve {\mathbf {x}}^{l}_{\left [{k}\right]} + \mathbf {n}^{M}_{\left [{k}\right]} \end{array}}\right]}\right) \left ({\breve {\mathbf {x}}^{m}_{\left [{k}\right]}}\right)^{H} \mathbf {e}_{i} }\right]\tag{29}\end{align*}
\begin{align*} \mathbb {E}\left [{\mathbf {{\stackrel {\circ }{y}}}^{m}_{\left [{k}\right],i} \left ({{X}^{m}_{\left ({k}\right),i}}\right)^{\ast} }\right] = \check {{\underline {\text {F}}}}^{m}_{i} \left [{ \begin{array}{c} \mathbf {H}^{1m} \mathbf {R}_{\breve {\mathbf {x}}^{m}}\mathbf {e}_{i} \\ \mathbf {H}^{2m}\mathbf {R}_{\breve {\mathbf {x}}^{m}} \mathbf {e}_{i} \\ \vdots \\ \mathbf {H}^{Mm} \mathbf {R}_{\breve {\mathbf {x}}^{m}} \mathbf {e}_{i} \end{array} }\right]\tag{30}\end{align*}
Diagonal MIMO PTEQ
A further reduction of the initialization computational complexity and memory requirement can be achieved by considering a diagonal MIMO PTEQ. In addition to achieving a reduction in the initialization computational complexity and memory requirement, a diagonal MIMO PTEQ can also be used in downstream scenarios, where no receiver coordination is possible. The diagonal MIMO PTEQ matrix elements for tone \begin{align*} \mathbf {v}^{ml}_{i}=&\left [{ v^{ml}_{\left ({T-1}\right),i} v^{ml}_{\left ({T-2}\right),i} \cdots v^{ml}_{\left ({0}\right),i} }\right] \quad \forall m=l \\ \mathbf {v}^{ml}_{i}=&\left [{ \mathbf {0} }\right]_{1\times T} \quad \forall m \neq l\tag{32}\end{align*}
\begin{equation*} \mathbf {v}^{m}_{i} = \left [{\mathbf {0} \mathbf {0} \cdots \mathbf {v}_{i}^{mm} \cdots \mathbf {0} }\right]\tag{33}\end{equation*}
\begin{align*} \mathbf {V}_{i} = \left [{\begin{matrix} \mathbf {v}_{i}^{11} & \mathbf {0} & \cdots & \mathbf {0} \\ \mathbf {0} & \mathbf {v}_{i}^{22} & \cdots & \mathbf {0} \\ \vdots & \ddots & \\ \mathbf {0} & \mathbf {0}& \cdots & \mathbf {v}_{i}^{MM} \end{matrix}}\right]\tag{34}\end{align*}
\begin{equation*} \hat {X}^{m}_{\left ({k}\right),i} = \mathbf {v}^{m}_{i} \check {{\underline {\text {F}}}}_{i} \mathbf {y}_{\left [{k}\right]}\tag{35}\end{equation*}
\begin{align*} \hat {X}^{m}_{\left ({k}\right),i}=&\mathbf {v}_{i}^{mm} {\underline {\text {F}}}_{i} \mathbf {y}^{m}_{\left [{k}\right]} \\=&\mathbf {v}^{mm}_{i}{\underline {\text {F}}}_{i} \left ({\left [{ \mathbf {H}^{m1} \mathbf {H}^{m2} \cdots \mathbf {H}^{mM} }\right]\left [{\begin{array}{c} \breve {\mathbf {x}}^{1}_{\left [{k}\right]} \\ \breve {\mathbf {x}}^{2}_{\left [{k}\right]} \\ \vdots \\ \breve {\mathbf {x}}^{M}_{\left [{k}\right]} \end{array}}\right]+ \mathbf {n}^{m}_{\left [{k}\right]} }\right)\tag{36}\end{align*}
\begin{equation*} \underset {\mathbf {\mathbf {v}^{mm}_{i}}}{\text{minimize }} \quad \mathbb {E}\left [{ \left |{ \mathbf {v}^{mm}_{i} {\underline {\text {F}}}_{i} \mathbf {y}^{m}_{\left [{k}\right]} - {X}^{m}_{\left ({k}\right),i} }\right |^{2}}\right]\tag{37}\end{equation*}
\begin{equation*} \left ({\mathbf {v}^{mm}_{i}}\right)^{H} = \left ({\mathbb {E}\left [{{\dot {\mathbf {y}}}^{m}_{\left [{k}\right],i} \left ({{\dot {\mathbf {y}}}^{m}_{\left [{k}\right],i}}\right)^{H}}\right]}\right)^{-1} \left ({\mathbb {E}\left [{{\dot {\mathbf {y}}}^{m}_{\left [{k}\right],i} \left ({{X}^{m}_{\left ({k}\right),i}}\right)^{\ast} }\right]}\right)\tag{38}\end{equation*}
\begin{align*} \mathbb {E}\left [{{\dot {\mathbf {y}}}^{m}_{\left [{k}\right],i} \left ({{\dot {\mathbf {y}}}^{m}_{\left [{k}\right],i}}\right)^{H}}\right]=&{\underline {\text {F}}}_{i} \mathbb {E} \left [{ \mathbf {y}^{m}_{\left [{k}\right]} \mathbf {y}^{m^{H}}_{\left [{k}\right]} }\right] {\underline {\text {F}}}^{H}_{i} \\=&{\underline {\text {F}}}_{i} \mathbf {R}_{\mathbf {y}^{m}\mathbf {y}^{m}} {\underline {\text {F}}}^{H}_{i}\tag{39}\end{align*}
The second part of (38) (\begin{align*} \mathbb {E}\left [{{\dot {\mathbf {y}}}^{m}_{\left [{k}\right],i} \left ({{X}^{m}_{\left ({k}\right),i}}\right)^{\ast} }\right]=&\mathbb {E}\left [{ \left ({{\underline {\text {F}}}_{i} \mathbf {y}^{m}_{\left [{k}\right]} }\right) \left ({{X}^{m}_{\left ({k}\right),i}}\right)^{\ast}}\right] \\=&\mathbb {E}\left [{ \left ({{\underline {\text {F}}}_{i} \sum _{l = 1}^{M}\mathbf {H}^{ml} \breve {\mathbf {x}}^{l}_{\left [{k}\right]} + \mathbf {n}^{m}_{\left [{k}\right]} }\right)\left ({\breve {\mathbf {x}}^{m}_{\left [{k}\right]}}\right)^{H} \mathbf {e}_{i} }\right] \\ \tag{40}\end{align*}
\begin{equation*} \mathbb {E}\left [{ {\dot {\mathbf {y}}}^{m}_{i}\left [{k}\right] \left ({{X}^{m}_{\left ({k}\right),i}}\right)^{\ast}}\right] = {\underline {\text {F}}}_{i} \mathbf {H}^{mm} \mathbf {R}_{\breve {\mathbf {x}}^{m}} \mathbf {e}_{i}\tag{41}\end{equation*}
It can be observed that the full MIMO PTEQ and the sparse MIMO PTEQ require signal coordination among users at the receiver side. Hence, the users are required to have their receivers physically co-located. The requirement of signal coordination can be observed from the formulation of the estimated
Initialization Computational Complexity
Computing the optimal sparse or diagonal MIMO PTEQ coefficients, can be based on (31) and (42), shown at the bottom of the page if first all the channel matrices (and noise correlation matrices) are estimated. Alternatively these coefficients can be computed based on received data correlations, i.e., with (25) and (38) (and (18) for full MIMO PTEQ). The latter option will be analysed here. Computing the coefficients can then be divided in 3 tasks:
Computing the SDFT of the received signals using the difference terms implementation, as described in (12), requires
DFT operations and$M$ complex add operations per DMT symbol. This task is common to all three discussed MIMO PTEQs.$M(T-1)$
A. Full MIMO PTEQ
Computing the correlation matrix
requires$\mathbb {E}[{\ddot {\mathbf {y}}}_{[k],i} ({\ddot {\mathbf {y}}}_{[k],i})^{H}]$ arithmetic operations, where$\mathcal {O}(M^{2}T^{2})$ is the length of the vector$MT$ . For${\ddot {\mathbf {y}}}_{[k],i}$ , the total required number of arithmetic operations becomes$i = 1\cdots N$ .$\mathcal {O}(N M^{2}T^{2})$ Solving (18) as
can be done with$(\tilde {\mathbf {v}}^{m}_{i})^{H} = (\mathbb {E}[{\ddot {\mathbf {y}}}_{[k],i} ({\ddot {\mathbf {y}}}_{[k],i})^{H}])^{-1} \cdot (\mathbb {E}[{\ddot {\mathbf {y}}}_{[k],i} ({X}^{m}_{(k),i})^{\ast}])$ arithmetic operations. For$\mathcal {O}\left({\frac {1}{3}M^{3}T^{3}}\right)$ , the total required number of arithmetic operations becomes$i = 1\cdots N$ .$\mathcal {O}\left({\frac {1}{3}N M^{3}T^{3}}\right)$
B. Sparse MIMO PTEQ
Computing the correlation matrix
requires$\mathbb {E}[\mathbf {{\stackrel {\circ }{y}}}^{m}_{[k],i} (\mathbf {{\stackrel {\circ }{y}}}^{m}_{[k],i})^{H}]$ arithmetic operations, where$\mathcal {O}((T+M-1)^{2})$ is the length of the vector$M+T-1$ . For${\ddot {\mathbf {y}}}^{m}_{[k],i}$ and$i = 1 \cdots N$ , the total required arithmetic operations becomes$m = 1\cdots M$ .$\mathcal {O}(NM(T+M-1)^{2})$ Solving (25) as
can be done with$(\tilde {\mathbf {v}}^{m}_{i})^{H} = (\mathbb {E}[\mathbf {{\stackrel {\circ }{y}}}^{m}_{[k],i} (\mathbf {{\stackrel {\circ }{y}}}^{m}_{[k],i})^{H}])^{-1}\cdot (\mathbb {E}[\mathbf {{\stackrel {\circ }{y}}}^{m}_{[k],i} ({X}^{m}_{(k),i})^{\ast}])$ arithmetic operations. For$\mathcal {O}\left({\frac {1}{3}(T+M-1)^{3}}\right)$ and$i = 1\cdots N$ , the total required number of arithmetic operations becomes$m = 1 \cdots M$ .$\mathcal {O}\left({\frac {1}{3}NM(T+M-1)^{3}}\right)$
C. Diagonal MIMO PTEQ
Computing the correlation matrix
requires$\mathbb {E}[{\dot {\mathbf {y}}}^{m}_{[k],i} ({\dot {\mathbf {y}}}^{m}_{[k],i})^{H}]$ arithmetic operations, where$\mathcal {O}(T^{2})$ is the length of the vector$T$ . For${\dot {\mathbf {y}}}^{m}_{[k],i}$ and$i = 1\cdots N$ , the total required number of arithmetic operations becomes$m = 1\cdots M$ .$\mathcal {O}(NMT^{2})$ Solving (38) as
can be done with$(\mathbf {v}^{mm}_{i})^{H} = (\mathbb {E}[{\dot {\mathbf {y}}}^{m}_{[k],i} ({\dot {\mathbf {y}}}^{m}_{[k],i})^{H}])^{-1} \cdot (\mathbb {E}[{\dot {\mathbf {y}}}^{m}_{[k],i} ({X}^{m}_{(k),i})^{\ast}])$ arithmetic operations. For$\mathcal {O}\left({\frac {1}{3}T^{3}}\right)$ and$i = 1\cdots N$ , the total required number of arithmetic operations becomes$m = 1 \cdots M$ .$\mathcal {O}\left({\frac {1}{3}NMT^{3}}\right)$

% ratio of initialization computational complexity of sparse MIMO PTEQ with PTEQ order-1 (
Fig. 5 also includes the % ratio of the initialization computational complexity of a UNCDc-Zxc design method based MIMO TEQ (order 7) [9] compared to a full MIMO PTEQ (order 7). The lower orders of MIMO TEQ shows much higher % ratio of the initialization computational complexity compared to a full MIMO PTEQ of the same order and hence are omitted from Fig. 5. Furthermore, since, the computationally most expensive task in computing the optimal MIMO PTEQ coefficients for both the diagonal MIMO PTEQ and full MIMO PTEQ depends on
Memory Requirement
In comparison to a MIMO TEQ, a full MIMO PTEQ needs to store
% ratio in memory requirement (and runtime computational complexity) of sparse MIMO PTEQ with PTEQ order-1 (
Results
The simulation setup is kept similar to [9]. The G.fast 106b profile [28] is considered here for the simulation of a
Fig. 7(a), Fig. 7(b) and Fig. 7(c) compare the performance of the full MIMO PTEQ with the proposed sparse MIMO PTEQ in terms of total achieved rates for a five line DSL system, for different channels. The rates are calculated over a range of synchronisation delays, to also characterize the robustness of the MIMO PTEQ design against the synchronization delay. From the simulation results, it can be noticed that the sparse PTEQ attains similar performance as the full MIMO PTEQ, while providing a reduction in the initialization computational complexity and the memory requirement. As discussed in Section III, the ISI and ICI power caused by crosstalk channels is expected to be small. Hence, ignoring the crosstalk ISI and ICI cancellation indeed does not significantly affect the performance of the MIMO PTEQ and a single tap equalizer for crosstalk cancellation works similar to a T tap equalizer. Furthermore, it can be observed from Fig. 7(a), Fig. 7(b) and Fig. 7(c) that the performance of the full MIMO PTEQ and sparse MIMO PTEQ shows insignificant improvement, while increasing the order from PTEQ order 1 to PTEQ order 7. This suggests that the channel impulse responses used for the simulations can be approximated with a rational transfer function that has few poles and an order-1 MIMO PTEQ is already good enough to cancel them. An impulse response whose approximated rational transfer function has a higher number of poles will require a higher order PTEQ, and the system performance will not saturate at order-1 MIMO PTEQ.
Performance comparison of MIMO PTEQ structures (and MIMO TEQ-3 (__) [9]) for different DSL channels and filter orders (CP = 128).
Fig. 7(d), Fig. 7(e) and Fig. 7(f) compare the performance of the full MIMO PTEQ and the proposed diagonal MIMO PTEQ. Though the diagonal PTEQ further reduces the initialization computational complexity and memory requirement, an average performance drop of 29.3% is observed in the absence of crosstalk cancellation. This drop in performance can likely be avoided by using crosstalk precompensation, also referred to as downstream vectoring, at the transmitter side, but designing the vectoring together with the diagonal MIMO PTEQ is complex and remains as a topic for future work.
Fig. 8 extends the results of Fig. 7 with the same channels and simulation parameters but with a CP length of 64 samples. Finally, Fig. 9 provides simulation results for non-LR-G.fast channels. The theoretical channel (length 300m) is again based on the KHM model, suggested in [30], while the measured channel data corresponds to cable binders of two Tier-1 operators (channel 1 of length 250m and channel 2 of length 300m). The simulation parameters are kept the same (Table 1) with a CP length of 64 samples. The results in Fig. 8 and Fig. 9 further bolster the conclusions drawn from Fig. 7.
Performance comparison of MIMO PTEQ structures (and MIMO TEQ-3 (__) [9]) for different DSL channels and filter orders (CP = 64).
Conclusion
The paper is motivated by the potential use of a MIMO PTEQ in newly emerging LR-xDSL and tackles the large initialization computational complexity and memory requirement for the full MIMO PTEQ. A specific structure in the MIMO DSL channel is exploited, namely that the combined ISI and ICI signal power from the crosstalk channels is significantly lower than the desired and combined ISI and ICI signal power from the direct channels, to derive a very low complexity/memory solution, referred as sparse MIMO PTEQ, with negligible impact (≈ 0.5% drop) on the performance compared to a full MIMO PTEQ. For a conventional DSL binder size of 16 lines and a PTEQ order of 3, the proposed sparse MIMO PTEQ operates at 42% of the initialization computational complexity and 29.7% of the memory requirement, with negligible performance degradation, compared to a full MIMO PTEQ. Even larger reductions in initialization computational complexity and memory requirement are achieved by the proposed diagonal MIMO PTEQ, which operates at 0.4% of the initialization computational complexity and 6.25% of the memory requirement compared to a full MIMO PTEQ. It also allows the MIMO PTEQ implementation in upstream as well as downstream scenarios. However, in absence of crosstalk cancellation the performance of diagonal MIMO PTEQ drops significantly compared to a full MIMO PTEQ. This drop in performance can likely be avoided by using crosstalk precompensation (i.e., downstream vectoring) at the transmitter side. Finally, the applicability of the proposed models could potentially be interesting for wireless systems as well, which are characterized by a similar structure.
AppendixOptimal MIMO PTEQ coefficients
Optimal MIMO PTEQ coefficients
The optimization problem (MMSE) for optimal PTEQ filter coefficients for user \begin{equation*} \underset {\tilde {\mathbf {v}}^{\mathbf {m}}_{\mathbf {i}}}{\text{minimize }} \mathbb {E}\left [{ \left |{ \tilde {\mathbf {v}}^{m}_{i}{\ddot {\mathbf {y}}}_{\left [{k}\right],i} - {X}^{m}_{\left ({k}\right),i} }\right |^{2}}\right]\tag{44}\end{equation*}
\begin{align*} \varepsilon ^{m}_{i}=&\mathbb {E}\left [{\left ({\tilde {\mathbf {v}}^{m}_{i} {\ddot {\mathbf {y}}}_{\left [{k}\right],i} - {X}^{m}_{\left ({k}\right),i} }\right)\left ({\tilde {\mathbf {v}}^{m}_{i} {\ddot {\mathbf {y}}}_{\left [{k}\right],i} - {X}^{m}_{\left ({k}\right),i} }\right)^{H}}\right] \\=&\tilde {\mathbf {v}}^{m}_{i}\mathbb {E}\left [{ {\ddot {\mathbf {y}}}_{\left [{k}\right],i} \left ({{\ddot {\mathbf {y}}}_{\left [{k}\right],i}}\right)^{H}}\right]\left ({\tilde {\mathbf {v}}^{m}_{i}}\right)^{H} - \tilde {\mathbf {v}}^{m}_{i}\mathbb {E}\left [{ {\ddot {\mathbf {y}}}_{\left [{k}\right],i}\left ({{X}^{m}_{\left ({k}\right),i}}\right)^{H}}\right] \\&- \mathbb {E}\left [{{X}^{m}_{\left ({k}\right),i} \left ({{\ddot {\mathbf {y}}}_{\left [{k}\right],i}}\right)^{H}}\right]\left ({\tilde {\mathbf {v}}^{m}_{i}}\right)^{H} + \mathbb {E}\left [{{X}^{m}_{\left ({k}\right),i}\left ({{X}^{m}_{\left ({k}\right),i}}\right)^{H}}\right]\tag{45}\end{align*}
\begin{align*} \frac {\partial \varepsilon ^{m}_{i}}{\partial \left ({\tilde {\mathbf {v}}^{m}_{i}}\right)^{H}}=&2\mathbb {E}\left [{{\ddot {\mathbf {y}}}_{\left [{k}\right],i}\left ({{\ddot {\mathbf {y}}}_{\left [{k}\right],i}}\right)^{H} }\right]\left ({\tilde {\mathbf {v}}^{m}_{i}}\right)^{H} - 2\mathbb {E}\left [{{\ddot {\mathbf {y}}}_{\left [{k}\right],i} \left ({{X}^{m}_{\left ({k}\right),i}}\right)^{H}}\right] \\{}\tag{46}\end{align*}
\begin{equation*} \left ({\tilde {\mathbf {v}}^{m}_{i}}\right)^{H} = \mathbb {E}\left [{{\ddot {\mathbf {y}}}_{\left [{k}\right],i}\left ({{\ddot {\mathbf {y}}}_{\left [{k}\right],i}}\right)^{H} }\right]^{-1}\mathbb {E}\left [{ {\ddot {\mathbf {y}}}_{\left [{k}\right],i} \left ({{X}^{m}_{\left ({k}\right),i}}\right)^{\ast}}\right].\tag{47}\end{equation*}