1 Introduction
 By a 
 $\mathbb {Z}$
 system we mean a tuple
$\mathbb {Z}$
 system we mean a tuple 
 $\textbf {X} = (X,T,\mu _{X})$
 where X is a compact, metric space, T is a continuous action of
$\textbf {X} = (X,T,\mu _{X})$
 where X is a compact, metric space, T is a continuous action of 
 $\mathbb {Z}$
 on X, and
$\mathbb {Z}$
 on X, and 
 $\mu _{X}$
 is a Borel probability measure on X that is T invariant. Ergodic
$\mu _{X}$
 is a Borel probability measure on X that is T invariant. Ergodic 
 $\mathbb {Z}$
 systems
$\mathbb {Z}$
 systems 
 $(X,T,\mu _{X})$
 and
$(X,T,\mu _{X})$
 and 
 $(Y,S,\mu _{Y})$
 are disjoint if
$(Y,S,\mu _{Y})$
 are disjoint if 
 $\mu _{X} \otimes \mu _{Y}$
 is the only probability measure on
$\mu _{X} \otimes \mu _{Y}$
 is the only probability measure on 
 $X \times Y$
 that has
$X \times Y$
 that has 
 $\mu _{X}$
 and
$\mu _{X}$
 and 
 $\mu _{Y}$
 as its marginals and is invariant under the diagonal action
$\mu _{Y}$
 as its marginals and is invariant under the diagonal action 
 $(T \times S)^{n} = T^{n} \times S^{n}$
. (Any probability measure on
$(T \times S)^{n} = T^{n} \times S^{n}$
. (Any probability measure on 
 $X \times Y$
 with these two properties is called a joining of the two
$X \times Y$
 with these two properties is called a joining of the two 
 $\mathbb {Z}$
 systems.) The notion of disjointness, introduced in Furstenberg’s seminal paper [Reference FurstenbergFur67], is an extreme form of non-isomorphism. In particular, if systems have a non-trivial factor in common, then they cannot be disjoint. Furstenberg asked whether the converse is true. Rudolph [Reference RudolphRud79] answered this question by producing (from his construction in the same paper of a
$\mathbb {Z}$
 systems.) The notion of disjointness, introduced in Furstenberg’s seminal paper [Reference FurstenbergFur67], is an extreme form of non-isomorphism. In particular, if systems have a non-trivial factor in common, then they cannot be disjoint. Furstenberg asked whether the converse is true. Rudolph [Reference RudolphRud79] answered this question by producing (from his construction in the same paper of a 
 $\mathbb {Z}$
 system with minimal self-joinings) two
$\mathbb {Z}$
 system with minimal self-joinings) two 
 $\mathbb {Z}$
 systems that are not disjoint and yet share no common factor.
$\mathbb {Z}$
 systems that are not disjoint and yet share no common factor.
 Perhaps motivated by Furstenberg’s question, Berg [Reference BergBer71, Reference BergBer72] considered the case when one of the systems is measurably distal. Recall that a 
 $\mathbb {Z}$
 system is measurably distal if it belongs to the smallest class of
$\mathbb {Z}$
 system is measurably distal if it belongs to the smallest class of 
 $\mathbb {Z}$
 systems that contains the trivial system and is closed under factors, group extensions, and inverse limits; see §2 for the definitions of these notions. That the above definition of measurably distal is equivalent to Parry’s original definition [Reference Parry, Auslander and GottschalkPar68] in terms of separating sieves was proved by Zimmer [Reference ZimmerZim76a] (cf. §2.5).
$\mathbb {Z}$
 systems that contains the trivial system and is closed under factors, group extensions, and inverse limits; see §2 for the definitions of these notions. That the above definition of measurably distal is equivalent to Parry’s original definition [Reference Parry, Auslander and GottschalkPar68] in terms of separating sieves was proved by Zimmer [Reference ZimmerZim76a] (cf. §2.5).
 To describe Berg’s result, recall that the Kronecker factor of an ergodic 
 $\mathbb {Z}$
 system is the largest factor of the system that is isomorphic to a rotation on a compact abelian group. Berg proved that disjointness of the Kronecker factors of two ergodic
$\mathbb {Z}$
 system is the largest factor of the system that is isomorphic to a rotation on a compact abelian group. Berg proved that disjointness of the Kronecker factors of two ergodic 
 $\mathbb {Z}$
 systems is equivalent to both disjointness and the absence of a common factor when one of the systems is measurably distal. We say that systems
$\mathbb {Z}$
 systems is equivalent to both disjointness and the absence of a common factor when one of the systems is measurably distal. We say that systems 
 $\textbf {X}$
 and
$\textbf {X}$
 and 
 $\textbf {Y}$
 are Kronecker disjoint if their Kronecker factors are disjoint.
$\textbf {Y}$
 are Kronecker disjoint if their Kronecker factors are disjoint.
Theorem 1.1. (Berg [Reference BergBer71, Reference BergBer72])
 Let 
 $\textbf {X}$
 be an ergodic and measurably distal
$\textbf {X}$
 be an ergodic and measurably distal 
 $\mathbb {Z}$
 system and let
$\mathbb {Z}$
 system and let 
 $\textbf {Y}$
 be an ergodic
$\textbf {Y}$
 be an ergodic 
 $\mathbb {Z}$
 system. The following are equivalent:
$\mathbb {Z}$
 system. The following are equivalent: 
- 
(i)  $\textbf {X}$
 and $\textbf {X}$
 and $\textbf {Y}$
 are disjoint; $\textbf {Y}$
 are disjoint;
- 
(ii)  $\textbf {X}$
 and $\textbf {X}$
 and $\textbf {Y}$
 are Kronecker disjoint; $\textbf {Y}$
 are Kronecker disjoint;
- 
(iii)  $\textbf {X}$
 and $\textbf {X}$
 and $\textbf {Y}$
 have no non-trivial common factor. $\textbf {Y}$
 have no non-trivial common factor.
 The key ingredient in the proof of Theorem 1.1 is a weakening of the notion of disjointness that is shown to be preserved by group extensions, factors, and inverse limits. The definition of this weakened property (called ‘quasi-disjointness’ in [Reference BergBer71]) is as follows. Given ergodic 
 $\mathbb {Z}$
 systems
$\mathbb {Z}$
 systems 
 $\textbf {X}$
 and
$\textbf {X}$
 and 
 $\textbf {Y}$
, let
$\textbf {Y}$
, let 
 $\alpha $
 and
$\alpha $
 and 
 $\beta $
 be the factor maps from
$\beta $
 be the factor maps from 
 $\textbf {X}$
 and
$\textbf {X}$
 and 
 $\textbf {Y}$
, respectively, to their maximal common Kronecker factor
$\textbf {Y}$
, respectively, to their maximal common Kronecker factor 
 $\mathsf {K}(\textbf {X},\textbf {Y})$
. The
$\mathsf {K}(\textbf {X},\textbf {Y})$
. The 
 $\mathbb {Z}$
 systems
$\mathbb {Z}$
 systems 
 $\textbf {X}$
 and
$\textbf {X}$
 and 
 $\textbf {Y}$
 are quasi-disjoint if the property
$\textbf {Y}$
 are quasi-disjoint if the property 
- 
(BQD) for almost every k in  $\mathsf {K}(\textbf {X},\textbf {Y})$
 there is exactly one joining of the systems $\mathsf {K}(\textbf {X},\textbf {Y})$
 there is exactly one joining of the systems $\textbf {X}$
 and $\textbf {X}$
 and $\textbf {Y}$
 giving full measure to $\textbf {Y}$
 giving full measure to $\gamma ^{-1}(k)$ $\gamma ^{-1}(k)$
holds, where 
 $\gamma (x,y)=\alpha (x)-\beta (y)$
. The main results in [Reference BergBer71, Reference BergBer72] imply that
$\gamma (x,y)=\alpha (x)-\beta (y)$
. The main results in [Reference BergBer71, Reference BergBer72] imply that 
 $\textbf {X}$
 and
$\textbf {X}$
 and 
 $\textbf {Y}$
 satisfy (BQD) whenever
$\textbf {Y}$
 satisfy (BQD) whenever 
 $\textbf {X}$
 is an ergodic and measurably distal
$\textbf {X}$
 is an ergodic and measurably distal 
 $\mathbb {Z}$
 system and
$\mathbb {Z}$
 system and 
 $\textbf {Y}$
 is an ergodic
$\textbf {Y}$
 is an ergodic 
 $\mathbb {Z}$
 system.
$\mathbb {Z}$
 system.
 In this paper, we introduce a new definition of quasi-disjointness that applies to measure-preserving actions of any countable group G. To describe it, we recall the following notions. A G system is a tuple 
 $(X,T,\mu _{X})$
 where X is a compact, metric space, T is a continuous left action of G on X, and
$(X,T,\mu _{X})$
 where X is a compact, metric space, T is a continuous left action of G on X, and 
 $\mu _{X}$
 is a Borel probability measure on X that is T invariant. The Kronecker factor of a G system
$\mu _{X}$
 is a Borel probability measure on X that is T invariant. The Kronecker factor of a G system 
 $\textbf {X}$
 is the factor
$\textbf {X}$
 is the factor 
 $\mathsf {K} \textbf {X}$
 corresponding to the subspace of
$\mathsf {K} \textbf {X}$
 corresponding to the subspace of 
 $\operatorname {\textrm {L}}^{2}(X,\mu _{X})$
 spanned by functions f with the property that
$\operatorname {\textrm {L}}^{2}(X,\mu _{X})$
 spanned by functions f with the property that 
 $\{ f \circ T^{g} : g \in G \}$
 has compact closure. Disjointness of G systems is defined just as for
$\{ f \circ T^{g} : g \in G \}$
 has compact closure. Disjointness of G systems is defined just as for 
 $\mathbb {Z}$
 systems.
$\mathbb {Z}$
 systems.
 Although it is the case for ergodic 
 $\mathbb {Z}$
 systems, the Kronecker factor of an ergodic G system cannot generally be modeled by a rotation on a compact, abelian group. Thus, it is not clear how to modify (BQD) or Berg’s proofs to apply to actions of more general groups. We instead make the following definition, which is more general and easier to handle than (BQD).
$\mathbb {Z}$
 systems, the Kronecker factor of an ergodic G system cannot generally be modeled by a rotation on a compact, abelian group. Thus, it is not clear how to modify (BQD) or Berg’s proofs to apply to actions of more general groups. We instead make the following definition, which is more general and easier to handle than (BQD).
Definition 1.2. Two G systems 
 $\textbf {X} = (X,T,\mu _{X})$
 and
$\textbf {X} = (X,T,\mu _{X})$
 and 
 $\textbf {Y} = (Y,S,\mu _{Y})$
 are quasi-disjoint if the only joining of
$\textbf {Y} = (Y,S,\mu _{Y})$
 are quasi-disjoint if the only joining of 
 $\textbf {X}$
 and
$\textbf {X}$
 and 
 $\textbf {Y}$
 that projects to the product measure on the product
$\textbf {Y}$
 that projects to the product measure on the product 
 $\mathsf {K}\textbf {X} \times \mathsf {K} \textbf {Y}$
 of their Kronecker factors is the trivial joining
$\mathsf {K}\textbf {X} \times \mathsf {K} \textbf {Y}$
 of their Kronecker factors is the trivial joining 
 $\mu _{X} \otimes \mu _{Y}$
.
$\mu _{X} \otimes \mu _{Y}$
.
 Our first result (proved in §3) is that ergodic 
 $\mathbb {Z}$
 systems
$\mathbb {Z}$
 systems 
 $\textbf {X}$
 and
$\textbf {X}$
 and 
 $\textbf {Z}$
 are quasi-disjoint according to Definition 1.2 if and only if they satisfy (BQD), justifying the use of the terminology ‘quasi-disjoint’.
$\textbf {Z}$
 are quasi-disjoint according to Definition 1.2 if and only if they satisfy (BQD), justifying the use of the terminology ‘quasi-disjoint’.
Theorem 1.3. Ergodic 
 $\mathbb {Z}$
 systems
$\mathbb {Z}$
 systems 
 $\textbf {X}$
 and
$\textbf {X}$
 and 
 $\textbf {Y}$
 are quasi-disjoint if and only if they satisfy (BQD).
$\textbf {Y}$
 are quasi-disjoint if and only if they satisfy (BQD).
Our main result is an extension of Berg’s main results in [Reference BergBer71, Reference BergBer72] to G systems. Recall that a G system is measurably distal if it belongs to the smallest class of G systems that is closed under factors, group extensions, and inverse limits: notions that are defined in §2.
Theorem 1.4. If G is a countable group and 
 $\textbf {X}$
 is a measurably distal G system, then
$\textbf {X}$
 is a measurably distal G system, then 
 $\textbf {X}$
 is quasi-disjoint from any other G system
$\textbf {X}$
 is quasi-disjoint from any other G system 
 $\textbf {Y}$
.
$\textbf {Y}$
.
As a consequence of Theorem 1.4 we obtain the following characterizations of disjointness from a measurably distal system.
Corollary 1.5. If G is a countable group, 
 $\textbf {X}$
 is a measurably distal G system, and
$\textbf {X}$
 is a measurably distal G system, and 
 $\textbf {Y}$
 is a G system, then the following are equivalent:
$\textbf {Y}$
 is a G system, then the following are equivalent: 
- 
(i)  $\textbf {X}$
 and $\textbf {X}$
 and $\textbf {Y}$
 are disjoint; $\textbf {Y}$
 are disjoint;
- 
(ii)  $\textbf {X}$
 and $\textbf {X}$
 and $\textbf {Y}$
 are Kronecker disjoint; $\textbf {Y}$
 are Kronecker disjoint;
- 
(iii)  $\textbf {X}$
 and $\textbf {X}$
 and $\textbf {Y}$
 have no non-trivial common factor. $\textbf {Y}$
 have no non-trivial common factor.
 If, in addition, 
 $\textbf {X}$
 and
$\textbf {X}$
 and 
 $\textbf {Y}$
 are ergodic, then (i)–(iii) are also equivalent to:
$\textbf {Y}$
 are ergodic, then (i)–(iii) are also equivalent to: 
- 
(iv) the product system  $\textbf {X}\times \textbf {Y}$
 is ergodic. $\textbf {X}\times \textbf {Y}$
 is ergodic.
 If 
 $\textbf {Y}$
 is measurably distal and both
$\textbf {Y}$
 is measurably distal and both 
 $\textbf {X}$
 and
$\textbf {X}$
 and 
 $\textbf {Y}$
 are ergodic, then, as pointed out to us by Glasner, the structure theory of measurably distal systems together with [Reference GlasnerGla03, Theorem 3.30] provide an alternative approach to proving Corollary 1.5.
$\textbf {Y}$
 are ergodic, then, as pointed out to us by Glasner, the structure theory of measurably distal systems together with [Reference GlasnerGla03, Theorem 3.30] provide an alternative approach to proving Corollary 1.5.
 We offer two applications of our results. The first is a Wiener–Wintner-type result with distal weights for measure-preserving actions of countable amenable groups, proved in §5. Recall that, when dealing with actions of amenable groups, one uses Følner sequences to average orbits, where a Følner sequence in a countable group G is a sequence 
 $N \mapsto \Phi _{N}$
 of finite, non-empty subsets of G such that
$N \mapsto \Phi _{N}$
 of finite, non-empty subsets of G such that 
 $$ \begin{align*} \frac{|\Phi_{N} \cap g^{-1} \Phi_{N}|}{|\Phi_{N}|} \to 1 \end{align*} $$
$$ \begin{align*} \frac{|\Phi_{N} \cap g^{-1} \Phi_{N}|}{|\Phi_{N}|} \to 1 \end{align*} $$
for all g in G. Lindenstrauss [Reference LindenstraussLin01] proved that the pointwise ergodic theorem holds for actions of amenable groups along tempered Følner sequences: those for which there is 
 $C> 0$
 with
$C> 0$
 with 
 $$ \begin{align*} \bigg| \bigcup_{K < N} \Phi_{K}^{-1} \Phi_{N} \bigg| \le C |\Phi_{N}| \end{align*} $$
$$ \begin{align*} \bigg| \bigcup_{K < N} \Phi_{K}^{-1} \Phi_{N} \bigg| \le C |\Phi_{N}| \end{align*} $$
for all 
 $N \ge 2$
.
$N \ge 2$
.
Theorem 1.6. Let G be a countable discrete amenable group, let 
 $\Phi $
 be a tempered Følner sequence on G and let
$\Phi $
 be a tempered Følner sequence on G and let 
 $\textbf {Y} = (Y,S,\mu _{Y})$
 be an ergodic G system. For every
$\textbf {Y} = (Y,S,\mu _{Y})$
 be an ergodic G system. For every 
 $\phi $
 in
$\phi $
 in 
 $\operatorname {\textrm {L}}^{1}(Y,\mu _{Y})$
, there is a conull set
$\operatorname {\textrm {L}}^{1}(Y,\mu _{Y})$
, there is a conull set 
 $Y^{\prime } \subset Y$
 with the following property: for any uniquely ergodic topological G system
$Y^{\prime } \subset Y$
 with the following property: for any uniquely ergodic topological G system 
 $(X,T)$
, with unique invariant measure
$(X,T)$
, with unique invariant measure 
 $\mu _{X}$
, such that the G system
$\mu _{X}$
, such that the G system 
 $\textbf {X}=(X,T,\mu _{X})$
 is measurably distal and Kronecker disjoint from
$\textbf {X}=(X,T,\mu _{X})$
 is measurably distal and Kronecker disjoint from 
 $\textbf {Y}$
, and for any
$\textbf {Y}$
, and for any 
 $f\in \mathsf {C}(X)$
, any
$f\in \mathsf {C}(X)$
, any 
 $x\in X$
 and any
$x\in X$
 and any 
 $y\in Y^{\prime }$
 we have
$y\in Y^{\prime }$
 we have 
 $$ \begin{align} \lim_{N \to\infty} \frac{1}{|\Phi_{N}|} \sum_{g \in \Phi_{N}} f(T^{g} x)\phi(S^{g} y) = \int f\,\textit{d} \mu_{X} \int \phi\,\textit{d} \mu_{Y}. \end{align} $$
$$ \begin{align} \lim_{N \to\infty} \frac{1}{|\Phi_{N}|} \sum_{g \in \Phi_{N}} f(T^{g} x)\phi(S^{g} y) = \int f\,\textit{d} \mu_{X} \int \phi\,\textit{d} \mu_{Y}. \end{align} $$
One can quickly derive the classical Wiener–Wintner theorem [Reference Wiener and WintnerWW41] from Theorem 1.6, which we do in §5.
The second type of application that we offer is to the theory of multiple recurrence. It is somewhat surprising that only Kronecker disjointness is needed for the following theorems, even though multicorrelations are typically governed by nilrotations (cf. [Reference Host and KraHK05, Lei10, Reference ZieglerZie07]), which, in general, are of a higher complexity than rotations on compact abelian groups. However, as nilsystems are distal, Theorem 1.1 allows us to deduce disjointness of nilsystems from disjointness of their Kronecker factors.
Theorem 1.7. Let 
 $\textbf {X}=(X,T,\mu _{X})$
 and
$\textbf {X}=(X,T,\mu _{X})$
 and 
 $\textbf {Y}=(Y,S,\mu _{Y})$
 be ergodic
$\textbf {Y}=(Y,S,\mu _{Y})$
 be ergodic 
 $\mathbb {Z}$
 systems and assume
$\mathbb {Z}$
 systems and assume 
 $\textbf {X}$
 and
$\textbf {X}$
 and 
 $\textbf {Y}$
 are Kronecker disjoint. Then for every
$\textbf {Y}$
 are Kronecker disjoint. Then for every 
 $k,\ell \in \mathbb {N}$
, any
$k,\ell \in \mathbb {N}$
, any 
 $f_{1},\ldots ,f_{k}\in \operatorname {\textrm {L}}^{\infty }(X,\mu _{X})$
 and any
$f_{1},\ldots ,f_{k}\in \operatorname {\textrm {L}}^{\infty }(X,\mu _{X})$
 and any 
 $g_{1},\ldots ,g_{\ell }\in \operatorname {\textrm {L}}^{\infty }(Y,\mu _{Y})$
 we have
$g_{1},\ldots ,g_{\ell }\in \operatorname {\textrm {L}}^{\infty }(Y,\mu _{Y})$
 we have 
 $$ \begin{align*} \lim_{N\to\infty} \frac{1}{N} \sum_{n=1}^{N} \prod_{i=1}^{k} \prod_{j=1}^{\ell} T^{in}f_{i} \; S^{jn}g_{j}=\bigg(\lim_{N\to\infty}\frac1N\sum_{n=1}^{N}\prod_{i=1}^{k} T^{in}f_{i}\bigg) \bigg( \lim_{N\to\infty}\frac1N\sum_{n=1}^{N}\prod_{j=1}^{\ell} S^{jn}g_{j}\bigg) \end{align*} $$
$$ \begin{align*} \lim_{N\to\infty} \frac{1}{N} \sum_{n=1}^{N} \prod_{i=1}^{k} \prod_{j=1}^{\ell} T^{in}f_{i} \; S^{jn}g_{j}=\bigg(\lim_{N\to\infty}\frac1N\sum_{n=1}^{N}\prod_{i=1}^{k} T^{in}f_{i}\bigg) \bigg( \lim_{N\to\infty}\frac1N\sum_{n=1}^{N}\prod_{j=1}^{\ell} S^{jn}g_{j}\bigg) \end{align*} $$
in 
 $\operatorname {\textrm {L}}^{2}(X\times Y,\mu _{X} \otimes \mu _{Y})$
.
$\operatorname {\textrm {L}}^{2}(X\times Y,\mu _{X} \otimes \mu _{Y})$
.
Theorem 1.8. Let 
 $(Y,S)$
 be a topological
$(Y,S)$
 be a topological 
 $\mathbb {Z}$
 system and let
$\mathbb {Z}$
 system and let 
 $\mu _{Y}$
 be an ergodic S invariant Borel probability measure on Y. For all
$\mu _{Y}$
 be an ergodic S invariant Borel probability measure on Y. For all 
 $G\in \operatorname {\textrm {L}}^{1}(Y,\mu _{Y})$
, there exists a set
$G\in \operatorname {\textrm {L}}^{1}(Y,\mu _{Y})$
, there exists a set 
 $Y^{\prime }\subset Y$
 with
$Y^{\prime }\subset Y$
 with 
 $\mu _{Y}(Y^{\prime })=1$
 such that for any ergodic
$\mu _{Y}(Y^{\prime })=1$
 such that for any ergodic 
 $\mathbb {Z}$
 system
$\mathbb {Z}$
 system 
 $(X,T,\mu _{X})$
 which is Kronecker disjoint from
$(X,T,\mu _{X})$
 which is Kronecker disjoint from 
 $(Y,S,\mu _{Y})$
, any
$(Y,S,\mu _{Y})$
, any 
 $k\in \mathbb {N}$
, any
$k\in \mathbb {N}$
, any 
 $f_{1},\ldots ,f_{k}\in \operatorname {\textrm {L}}^{\infty }(X,\mu _{X})$
, and any
$f_{1},\ldots ,f_{k}\in \operatorname {\textrm {L}}^{\infty }(X,\mu _{X})$
, and any 
 $y\in Y^{\prime }$
,
$y\in Y^{\prime }$
, 
 $$ \begin{align*} \lim_{N \to \infty} \frac{1}{N} \sum_{n=1}^{N} G(S^{n}y)\prod_{i=1}^{k} T^{in} f_{i} = \bigg( \int_{Y} G \,\mathsf{d}\mu_{Y} \bigg) \cdot \bigg( \lim_{N\to\infty}\frac{1}{N} \sum_{n=1}^{N} \prod_{i=1}^{k} T^{in} f_{i} \bigg) \end{align*} $$
$$ \begin{align*} \lim_{N \to \infty} \frac{1}{N} \sum_{n=1}^{N} G(S^{n}y)\prod_{i=1}^{k} T^{in} f_{i} = \bigg( \int_{Y} G \,\mathsf{d}\mu_{Y} \bigg) \cdot \bigg( \lim_{N\to\infty}\frac{1}{N} \sum_{n=1}^{N} \prod_{i=1}^{k} T^{in} f_{i} \bigg) \end{align*} $$
in 
 $\operatorname {\textrm {L}}^{2}(X,\mu _{X})$
. Moreover, if
$\operatorname {\textrm {L}}^{2}(X,\mu _{X})$
. Moreover, if 
 $(Y,S)$
 is uniquely ergodic and
$(Y,S)$
 is uniquely ergodic and 
 $G\in \mathsf {C}(Y)$
, then we can take
$G\in \mathsf {C}(Y)$
, then we can take 
 $Y^{\prime }=Y$
.
$Y^{\prime }=Y$
.
1.1 Structure of the paper
 In §2, we review basic results and facts regarding Kronecker factors and distal systems, which are needed in the subsequent sections. In §3, we discuss Berg’s notion of quasi-disjointness for 
 $\mathbb {Z}$
 systems in more detail and give a proof of Theorem 1.3. In §4, we provide a proof of Theorem 1.4 by showing that quasi-disjointness lifts through group extensions, is preserved by passing to factors and is preserved under taking inverse limits. Sections 5 and 6 contain numerous applications of our main results to questions about pointwise convergence in ergodic theory and to the theory of multiple recurrence, including proofs of Theorems 1.7 and 1.8. Finally, in §7 we formulate some natural open questions.
$\mathbb {Z}$
 systems in more detail and give a proof of Theorem 1.3. In §4, we provide a proof of Theorem 1.4 by showing that quasi-disjointness lifts through group extensions, is preserved by passing to factors and is preserved under taking inverse limits. Sections 5 and 6 contain numerous applications of our main results to questions about pointwise convergence in ergodic theory and to the theory of multiple recurrence, including proofs of Theorems 1.7 and 1.8. Finally, in §7 we formulate some natural open questions.
2 Preliminaries
In this section, we present various preliminary results, which will be of use throughout the paper, on G systems, their Kronecker factors, and their joinings. We conclude with a brief discussion of topologically and measurably distal systems. Throughout this paper, G denotes a countable group.
2.1 Measure-preserving systems
 By a topological Gsystem we mean a pair 
 $(X,T)$
 where X is a compact metric space and T is a continuous left action of G on X. A Gsystem is a tuple
$(X,T)$
 where X is a compact metric space and T is a continuous left action of G on X. A Gsystem is a tuple 
 $\textbf {X} = (X,T,\mu _{X})$
 where
$\textbf {X} = (X,T,\mu _{X})$
 where 
 $(X,T)$
 is a topological G system and
$(X,T)$
 is a topological G system and 
 $\mu _{X}$
 is a T invariant Borel probability measure on X. The product of two G systems
$\mu _{X}$
 is a T invariant Borel probability measure on X. The product of two G systems 
 $\textbf {X} = (X,T,\mu _{X})$
 and
$\textbf {X} = (X,T,\mu _{X})$
 and 
 $\textbf {Z} = (Z,R,\mu _{Z})$
 is the system
$\textbf {Z} = (Z,R,\mu _{Z})$
 is the system 
 $\textbf {X} \times \textbf {Z} = (X \times Z, T \times R, \mu _{X} \otimes \mu _{Z})$
 where
$\textbf {X} \times \textbf {Z} = (X \times Z, T \times R, \mu _{X} \otimes \mu _{Z})$
 where 
 $T \times R$
 is the diagonal action
$T \times R$
 is the diagonal action 
 $(T \times R)^{g} = T^{g} \times R^{g}$
.
$(T \times R)^{g} = T^{g} \times R^{g}$
.
 A G system 
 $\textbf {Z} = (Z,R,\mu _{Z})$
 is a factor of a G system
$\textbf {Z} = (Z,R,\mu _{Z})$
 is a factor of a G system 
 $\textbf {X} = (X,T,\mu _{X})$
 if there is a G invariant, conull (that is full measure) subset
$\textbf {X} = (X,T,\mu _{X})$
 if there is a G invariant, conull (that is full measure) subset 
 $X^{\prime }$
 of X and a measurable, measure-preserving, G equivariant map
$X^{\prime }$
 of X and a measurable, measure-preserving, G equivariant map 
 $X^{\prime } \to Z$
. Any such map, together with its conull, invariant domain, is called a factor map.
$X^{\prime } \to Z$
. Any such map, together with its conull, invariant domain, is called a factor map.
 Given a factor 
 $\textbf {Z}$
 of a G system
$\textbf {Z}$
 of a G system 
 $\textbf {X}$
, the associated factor map induces an isometric embedding of
$\textbf {X}$
, the associated factor map induces an isometric embedding of 
 $\operatorname {\textrm {L}}^{2}(Z,\mu _{Z})$
 in
$\operatorname {\textrm {L}}^{2}(Z,\mu _{Z})$
 in 
 $\operatorname {\textrm {L}}^{2}(X,\mu _{X})$
. Denote by
$\operatorname {\textrm {L}}^{2}(X,\mu _{X})$
. Denote by 
 $\mathbb {E}({\cdot }\vert {\textbf {Z}})$
 the orthogonal projection from
$\mathbb {E}({\cdot }\vert {\textbf {Z}})$
 the orthogonal projection from 
 $\operatorname {\textrm {L}}^{2}(X,\mu _{X})$
 to this embedded copy of
$\operatorname {\textrm {L}}^{2}(X,\mu _{X})$
 to this embedded copy of 
 $\operatorname {\textrm {L}}^{2}(Z,\mu _{Z})$
. Given
$\operatorname {\textrm {L}}^{2}(Z,\mu _{Z})$
. Given 
 $f \in \operatorname {\textrm {L}}^{2}(X,\mu _{X})$
, we can think of
$f \in \operatorname {\textrm {L}}^{2}(X,\mu _{X})$
, we can think of 
 $\mathbb {E}({f}\vert {\textbf {Z}})$
 as a function either on X or on Z.
$\mathbb {E}({f}\vert {\textbf {Z}})$
 as a function either on X or on Z.
2.2 Disintegrations
 Given a factor map 
 $\pi : \textbf {X} \to \textbf {Y}$
 of G systems one can always find an almost-surely defined, measurable family
$\pi : \textbf {X} \to \textbf {Y}$
 of G systems one can always find an almost-surely defined, measurable family 
 $y \mapsto \mu _{y}$
 of Borel probability measures on X such that
$y \mapsto \mu _{y}$
 of Borel probability measures on X such that 
 $$ \begin{align*} \int f\,\textit{d} \mu_{X} = \iint f\,\textit{d} \mu_{y}\,\textit{d} \mu_{Y}(y) \end{align*} $$
$$ \begin{align*} \int f\,\textit{d} \mu_{X} = \iint f\,\textit{d} \mu_{y}\,\textit{d} \mu_{Y}(y) \end{align*} $$
for all f in 
 $\operatorname {\textrm {L}}^{1}(X,\mu _{X})$
 and that
$\operatorname {\textrm {L}}^{1}(X,\mu _{X})$
 and that 
 $T^{g} \mu _{y} = \mu _{S^{g} y}$
 for all
$T^{g} \mu _{y} = \mu _{S^{g} y}$
 for all 
 $g \in G$
 almost surely. Moreover, the family
$g \in G$
 almost surely. Moreover, the family 
 $y \mapsto \mu _{y}$
 is uniquely determined almost surely by these properties. If
$y \mapsto \mu _{y}$
 is uniquely determined almost surely by these properties. If 
 $\textbf {Y}$
 is the factor corresponding (via [Reference ZimmerZim76b, Corollary 2.2]) to the
$\textbf {Y}$
 is the factor corresponding (via [Reference ZimmerZim76b, Corollary 2.2]) to the 
 $\sigma $
-algebra of T invariant sets, then the resulting disintegration
$\sigma $
-algebra of T invariant sets, then the resulting disintegration 
 $y \mapsto \mu _{y}$
 is a version of the ergodic decomposition of
$y \mapsto \mu _{y}$
 is a version of the ergodic decomposition of 
 $\mu _{X}$
. We refer the reader to [Reference Einsiedler and WardEW11, Ch. 5] for details on disintegrations of measures.
$\mu _{X}$
. We refer the reader to [Reference Einsiedler and WardEW11, Ch. 5] for details on disintegrations of measures.
2.3 The Kronecker factor
 Every G system 
 $\textbf {X}= (X,T,\mu _{X})$
 induces a right action of G on
$\textbf {X}= (X,T,\mu _{X})$
 induces a right action of G on 
 $\operatorname {\textrm {L}}^{2}(X,\mu _{X})$
 defined by
$\operatorname {\textrm {L}}^{2}(X,\mu _{X})$
 defined by 
 $(T^{g} f)(x) = f(T^{g} x)$
. A function
$(T^{g} f)(x) = f(T^{g} x)$
. A function 
 $f\in \operatorname {\textrm {L}}^{2}(X,\mu _{X})$
 is almost periodic if its orbit
$f\in \operatorname {\textrm {L}}^{2}(X,\mu _{X})$
 is almost periodic if its orbit 
 $\{T^{g}f:g\in G\}$
 has compact closure in the strong topology of
$\{T^{g}f:g\in G\}$
 has compact closure in the strong topology of 
 $\operatorname {\textrm {L}}^{2}(X,\mu _{X})$
. We write
$\operatorname {\textrm {L}}^{2}(X,\mu _{X})$
. We write 
 $\mathsf {AP}(\textbf {X})$
 for the closed subspace of
$\mathsf {AP}(\textbf {X})$
 for the closed subspace of 
 $\operatorname {\textrm {L}}^{2}(X,\mu _{X})$
 spanned by almost periodic functions. A G system
$\operatorname {\textrm {L}}^{2}(X,\mu _{X})$
 spanned by almost periodic functions. A G system 
 $\textbf {X}$
 is almost periodic if
$\textbf {X}$
 is almost periodic if 
 $\operatorname {\textrm {L}}^{2}(X,\mu _{X}) = \mathsf {AP}(\textbf {X})$
. It follows from [Reference de Leeuw and GlicksbergLG61, Lemma 4.3] that
$\operatorname {\textrm {L}}^{2}(X,\mu _{X}) = \mathsf {AP}(\textbf {X})$
. It follows from [Reference de Leeuw and GlicksbergLG61, Lemma 4.3] that 
 $\mathsf {AP}(\textbf {X}) $
 coincides with the subspace of
$\mathsf {AP}(\textbf {X}) $
 coincides with the subspace of 
 $\operatorname {\textrm {L}}^{2}(X,\mu _{X})$
 spanned by finite-dimensional, T invariant subspaces of
$\operatorname {\textrm {L}}^{2}(X,\mu _{X})$
 spanned by finite-dimensional, T invariant subspaces of 
 $\operatorname {\textrm {L}}^{2}(X,\mu _{X})$
. There is a G invariant, countably generated sub-
$\operatorname {\textrm {L}}^{2}(X,\mu _{X})$
. There is a G invariant, countably generated sub-
 $\sigma $
-algebra
$\sigma $
-algebra 
 $\mathscr {A}$
 of the Borel
$\mathscr {A}$
 of the Borel 
 $\sigma $
-algebra of X such that
$\sigma $
-algebra of X such that 
 $\mathsf {AP}(\textbf {X}) = \operatorname {\textrm {L}}^{2}(X,\mathscr {A},\mu _{X})$
 (cf. [Reference Furstenberg and KatznelsonFK91, Lemma 3.1]). By [Reference ZimmerZim76b, Corollary 2.2] the
$\mathsf {AP}(\textbf {X}) = \operatorname {\textrm {L}}^{2}(X,\mathscr {A},\mu _{X})$
 (cf. [Reference Furstenberg and KatznelsonFK91, Lemma 3.1]). By [Reference ZimmerZim76b, Corollary 2.2] the 
 $\sigma $
-algebra
$\sigma $
-algebra 
 $\mathscr {A}$
 corresponds to a factor
$\mathscr {A}$
 corresponds to a factor 
 $\mathsf {K} \textbf {X}$
 of
$\mathsf {K} \textbf {X}$
 of 
 $\textbf {X}$
 called the Kronecker factor.
$\textbf {X}$
 called the Kronecker factor.
Proposition 2.1. For any two G systems 
 $\textbf {X}$
 and
$\textbf {X}$
 and 
 $\textbf {Z}$
, we have
$\textbf {Z}$
, we have 
 $\mathsf {AP}( \textbf {X} \times \textbf {Z} ) = \mathsf {AP}(\textbf {X}) \otimes \mathsf {AP}(\textbf {Z})$
.
$\mathsf {AP}( \textbf {X} \times \textbf {Z} ) = \mathsf {AP}(\textbf {X}) \otimes \mathsf {AP}(\textbf {Z})$
.
Proof. Given a G system 
 $\textbf {Y}$
, denote by
$\textbf {Y}$
, denote by 
 $\mathsf {WM}(\textbf {Y})$
 the closure in
$\mathsf {WM}(\textbf {Y})$
 the closure in 
 $\operatorname {\textrm {L}}^{2}(Y,\mu _{Y})$
 of the collection of vectors f with the property that
$\operatorname {\textrm {L}}^{2}(Y,\mu _{Y})$
 of the collection of vectors f with the property that 
 $0$
 belongs to the weak closure of
$0$
 belongs to the weak closure of 
 $\{ T^{g} f : g \in G \}$
. By [Reference de Leeuw and GlicksbergLG61, Corollary 4.12] one can write
$\{ T^{g} f : g \in G \}$
. By [Reference de Leeuw and GlicksbergLG61, Corollary 4.12] one can write 
 $\operatorname {\textrm {L}}^{2}(Y,\mu _{Y})$
 as the direct sum
$\operatorname {\textrm {L}}^{2}(Y,\mu _{Y})$
 as the direct sum 
 $\mathsf {AP}(\textbf {Y}) \oplus \mathsf {WM}(\textbf {Y})$
.
$\mathsf {AP}(\textbf {Y}) \oplus \mathsf {WM}(\textbf {Y})$
.
 Fix now G systems 
 $\textbf {X} = (X,T,\mu _{X})$
 and
$\textbf {X} = (X,T,\mu _{X})$
 and 
 $\textbf {Z} = (Z,R,\mu _{Z})$
. We have
$\textbf {Z} = (Z,R,\mu _{Z})$
. We have 
 $$ \begin{align*} \operatorname{\textrm{L}}^{2}(X \times Z, \mu_{X} \otimes \mu_{Z})) &= (\mathsf{AP}(\textbf{X}) \otimes \mathsf{AP}(\textbf{Z})) \oplus (\mathsf{WM}(\textbf{X}) \otimes \mathsf{AP}(\textbf{Z}))\\ &\quad\oplus (\mathsf{AP}(\textbf{X}) \otimes \mathsf{WM}(\textbf{Z})) \oplus (\mathsf{WM}(\textbf{X}) \otimes \mathsf{WM}(\textbf{Z})) \end{align*} $$
$$ \begin{align*} \operatorname{\textrm{L}}^{2}(X \times Z, \mu_{X} \otimes \mu_{Z})) &= (\mathsf{AP}(\textbf{X}) \otimes \mathsf{AP}(\textbf{Z})) \oplus (\mathsf{WM}(\textbf{X}) \otimes \mathsf{AP}(\textbf{Z}))\\ &\quad\oplus (\mathsf{AP}(\textbf{X}) \otimes \mathsf{WM}(\textbf{Z})) \oplus (\mathsf{WM}(\textbf{X}) \otimes \mathsf{WM}(\textbf{Z})) \end{align*} $$
holds. Certainly 
 $\mathsf {AP}(\textbf {X}) \otimes \mathsf {AP}(\textbf {Z}) \subset \mathsf {AP}(\textbf {X} \times \textbf {Z})$
. If f belongs to
$\mathsf {AP}(\textbf {X}) \otimes \mathsf {AP}(\textbf {Z}) \subset \mathsf {AP}(\textbf {X} \times \textbf {Z})$
. If f belongs to 
 $\mathsf {WM}(\textbf {X})$
, then
$\mathsf {WM}(\textbf {X})$
, then 
 $f \otimes g$
 belongs to
$f \otimes g$
 belongs to 
 $\mathsf {WM}(\textbf {X} \times \textbf {Z})$
 for every g in
$\mathsf {WM}(\textbf {X} \times \textbf {Z})$
 for every g in 
 $\operatorname {\textrm {L}}^{2}(Z,\mu _{Z})$
. Similarly, if g belongs to
$\operatorname {\textrm {L}}^{2}(Z,\mu _{Z})$
. Similarly, if g belongs to 
 $\mathsf {WM}(\textbf {Z})$
, then
$\mathsf {WM}(\textbf {Z})$
, then 
 $f \otimes g$
 belongs to
$f \otimes g$
 belongs to 
 $\mathsf {WM}(\textbf {X} \times \textbf {Z})$
 for every f in
$\mathsf {WM}(\textbf {X} \times \textbf {Z})$
 for every f in 
 $\operatorname {\textrm {L}}^{2}(Z,\mu _{Z})$
. This gives
$\operatorname {\textrm {L}}^{2}(Z,\mu _{Z})$
. This gives 
 $$ \begin{align*} (\mathsf{WM}(\textbf{X}) \otimes \mathsf{AP}(\textbf{Z})) \oplus (\mathsf{AP}(\textbf{X}) \otimes \mathsf{WM}(\textbf{Z})) \oplus (\mathsf{WM}(\textbf{X}) \otimes \mathsf{WM}(\textbf{Z})) \subset \mathsf{WM}(\textbf{X} \otimes \textbf{Z}) \end{align*} $$
$$ \begin{align*} (\mathsf{WM}(\textbf{X}) \otimes \mathsf{AP}(\textbf{Z})) \oplus (\mathsf{AP}(\textbf{X}) \otimes \mathsf{WM}(\textbf{Z})) \oplus (\mathsf{WM}(\textbf{X}) \otimes \mathsf{WM}(\textbf{Z})) \subset \mathsf{WM}(\textbf{X} \otimes \textbf{Z}) \end{align*} $$
and the result follows.
 When 
 $\textbf {X}$
 is ergodic [Reference MackeyMac64, Theorem 1] allows us to model the system
$\textbf {X}$
 is ergodic [Reference MackeyMac64, Theorem 1] allows us to model the system 
 $\mathsf {K} \textbf {X}$
 as a homogeneous space
$\mathsf {K} \textbf {X}$
 as a homogeneous space 
 $K/H$
 where K is a compact group and H is a closed subgroup with the action of G on
$K/H$
 where K is a compact group and H is a closed subgroup with the action of G on 
 $K/H$
 given by a homomorphism
$K/H$
 given by a homomorphism 
 $G \to K$
 with dense image. Write
$G \to K$
 with dense image. Write 
 $\mathsf {K} X$
 for the underlying space of any such modeling of
$\mathsf {K} X$
 for the underlying space of any such modeling of 
 $\mathsf {K} \textbf {X}$
. For ergodic
$\mathsf {K} \textbf {X}$
. For ergodic 
 $\mathbb {Z}$
 systems the Kronecker factor can be explicitly described by the system’s discrete spectrum.
$\mathbb {Z}$
 systems the Kronecker factor can be explicitly described by the system’s discrete spectrum.
Definition 2.2. Let 
 $\textbf {X}=(X,T,\mu _{X})$
 be a
$\textbf {X}=(X,T,\mu _{X})$
 be a 
 $\mathbb {Z}$
 system. The discrete spectrum of
$\mathbb {Z}$
 system. The discrete spectrum of 
 $\textbf {X}$
, denoted by
$\textbf {X}$
, denoted by 
 $\mathsf {Eig}(\textbf {X})$
, is defined to be the set of all eigenvalues of T when viewed as a unitary operator
$\mathsf {Eig}(\textbf {X})$
, is defined to be the set of all eigenvalues of T when viewed as a unitary operator 
 $T:\operatorname {\textrm {L}}^{2}(X,\mu _{X}) \to \operatorname {\textrm {L}}^{2}(X,\mu _{X})$
,
$T:\operatorname {\textrm {L}}^{2}(X,\mu _{X}) \to \operatorname {\textrm {L}}^{2}(X,\mu _{X})$
, 
 $$ \begin{align*} \mathsf{Eig}(\textbf{X})=\{\zeta\in\mathbb{C}:\text{there exists } f\in\operatorname{\textrm{L}}^{2}(X,\mu_{X}) \text{ with }f\neq 0 \text{ and } Tf=\zeta f\}. \end{align*} $$
$$ \begin{align*} \mathsf{Eig}(\textbf{X})=\{\zeta\in\mathbb{C}:\text{there exists } f\in\operatorname{\textrm{L}}^{2}(X,\mu_{X}) \text{ with }f\neq 0 \text{ and } Tf=\zeta f\}. \end{align*} $$
 Given two ergodic G systems 
 $\textbf {X}$
 and
$\textbf {X}$
 and 
 $\textbf {Z}$
 we can (using the Bohr compactification of G, for instance) assume that
$\textbf {Z}$
 we can (using the Bohr compactification of G, for instance) assume that 
 $\mathsf {K} \textbf {X}$
 and
$\mathsf {K} \textbf {X}$
 and 
 $\mathsf {K} \textbf {Z}$
 are modeled by homogeneous spaces of the same compact topological group K. That is, we may assume
$\mathsf {K} \textbf {Z}$
 are modeled by homogeneous spaces of the same compact topological group K. That is, we may assume 
 $\mathsf {K} X = K/H_{\textbf {X}}$
 and
$\mathsf {K} X = K/H_{\textbf {X}}$
 and 
 $\mathsf {K} Z = K/H_{\textbf {Z}}$
 for some compact topological group K and closed subgroups
$\mathsf {K} Z = K/H_{\textbf {Z}}$
 for some compact topological group K and closed subgroups 
 $H_{\textbf {X}}$
,
$H_{\textbf {X}}$
, 
 $H_{\textbf {Z}}$
 thereof, the action of G on both spaces determined by a homomorphism
$H_{\textbf {Z}}$
 thereof, the action of G on both spaces determined by a homomorphism 
 $G \to K$
 with dense image. The system
$G \to K$
 with dense image. The system 
 $K/ \langle H_{\textbf {X}},H_{\textbf {Z}} \rangle $
 is a factor of both
$K/ \langle H_{\textbf {X}},H_{\textbf {Z}} \rangle $
 is a factor of both 
 $\textbf {X}$
 and
$\textbf {X}$
 and 
 $\textbf {Z}$
 and serves as a model for
$\textbf {Z}$
 and serves as a model for 
 $\mathsf {K}(\textbf {X},\textbf {Z})$
, their joint Kronecker factor by which we mean the largest almost periodic factor of both
$\mathsf {K}(\textbf {X},\textbf {Z})$
, their joint Kronecker factor by which we mean the largest almost periodic factor of both 
 $\textbf {X}$
 and
$\textbf {X}$
 and 
 $\textbf {Z}$
.
$\textbf {Z}$
.
Proposition 2.3. (Cf. [Reference FurstenbergFur81, Lemma 4.18])
 If 
 $\textbf {X}$
 and
$\textbf {X}$
 and 
 $\textbf {Z}$
 are ergodic G systems and
$\textbf {Z}$
 are ergodic G systems and 
 $\textbf {X} \times \textbf {Z}$
 is not ergodic, then
$\textbf {X} \times \textbf {Z}$
 is not ergodic, then 
 $\operatorname {\textrm {L}}^{2}(X,\mu _{X})$
 and
$\operatorname {\textrm {L}}^{2}(X,\mu _{X})$
 and 
 $\operatorname {\textrm {L}}^{2}(Z,\mu _{Z})$
 contain isomorphic, finite-dimensional, invariant subspaces of non-constant functions.
$\operatorname {\textrm {L}}^{2}(Z,\mu _{Z})$
 contain isomorphic, finite-dimensional, invariant subspaces of non-constant functions.
Proof. If 
 $\textbf {X} \times \textbf {Z}$
 is not ergodic, then there is a non-constant function f in
$\textbf {X} \times \textbf {Z}$
 is not ergodic, then there is a non-constant function f in 
 $\operatorname {\textrm {L}}^{2}(X \times Z, \mu _{X} \otimes \mu _{Z})$
 that is
$\operatorname {\textrm {L}}^{2}(X \times Z, \mu _{X} \otimes \mu _{Z})$
 that is 
 $T \times R$
 invariant. Therefore,
$T \times R$
 invariant. Therefore, 
 $\langle {f}, {\phi } \rangle = \langle {f}, {(T \times R)^{g} \phi } \rangle $
 for all
$\langle {f}, {\phi } \rangle = \langle {f}, {(T \times R)^{g} \phi } \rangle $
 for all 
 $\phi \in \operatorname {\textrm {L}}^{2}(X\times Z,\mu _{X}\otimes \mu _{Z})$
 and
$\phi \in \operatorname {\textrm {L}}^{2}(X\times Z,\mu _{X}\otimes \mu _{Z})$
 and 
 $g \in G$
. In particular, if
$g \in G$
. In particular, if 
 $\phi \in \mathsf {WM}(\textbf {X} \times \textbf {Z})$
, then
$\phi \in \mathsf {WM}(\textbf {X} \times \textbf {Z})$
, then 
 $\langle {f}, {\phi } \rangle = 0$
. It follows that f belongs to
$\langle {f}, {\phi } \rangle = 0$
. It follows that f belongs to 
 $\mathsf {AP}(\textbf {X} \times \textbf {Z})$
 and, therefore (by Proposition 2.1), to
$\mathsf {AP}(\textbf {X} \times \textbf {Z})$
 and, therefore (by Proposition 2.1), to 
 $\mathsf {AP}(\textbf {X}) \otimes \mathsf {AP}(\textbf {Z})$
. Now
$\mathsf {AP}(\textbf {X}) \otimes \mathsf {AP}(\textbf {Z})$
. Now 
 $\mathsf {AP}(\textbf {X})$
 and
$\mathsf {AP}(\textbf {X})$
 and 
 $\mathsf {AP}(\textbf {Z})$
 are spanned by finite-dimensional, invariant subspaces of
$\mathsf {AP}(\textbf {Z})$
 are spanned by finite-dimensional, invariant subspaces of 
 $\operatorname {\textrm {L}}^{2}(X,\mu _{X})$
 and
$\operatorname {\textrm {L}}^{2}(X,\mu _{X})$
 and 
 $\operatorname {\textrm {L}}^{2}(Z,\mu _{Z})$
, respectively, thus
$\operatorname {\textrm {L}}^{2}(Z,\mu _{Z})$
, respectively, thus 
 $$ \begin{align*} f = \sum_{\iota,\kappa} g_{\iota} \otimes h_{\kappa} \end{align*} $$
$$ \begin{align*} f = \sum_{\iota,\kappa} g_{\iota} \otimes h_{\kappa} \end{align*} $$
where 
 $\iota $
 and
$\iota $
 and 
 $\kappa $
 enumerate the finite-dimensional subrepresentations of
$\kappa $
 enumerate the finite-dimensional subrepresentations of 
 $\operatorname {\textrm {L}}^{2}(X,\mu _{X})$
 and
$\operatorname {\textrm {L}}^{2}(X,\mu _{X})$
 and 
 $\operatorname {\textrm {L}}^{2}(Z,\mu _{Z})$
, respectively, and
$\operatorname {\textrm {L}}^{2}(Z,\mu _{Z})$
, respectively, and 
 $g_{\iota } \otimes h_{\kappa }$
 is the projection of f on the corresponding subrepresentation of
$g_{\iota } \otimes h_{\kappa }$
 is the projection of f on the corresponding subrepresentation of 
 $\operatorname {\textrm {L}}^{2}(X \times Z, \mu _{X} \otimes \mu _{Z})$
. The fact that f is invariant implies only terms of the form
$\operatorname {\textrm {L}}^{2}(X \times Z, \mu _{X} \otimes \mu _{Z})$
. The fact that f is invariant implies only terms of the form 
 $g_{\iota } \otimes h_{\iota ^{*}}$
 contribute to the sum, where
$g_{\iota } \otimes h_{\iota ^{*}}$
 contribute to the sum, where 
 $\iota ^{*}$
 denotes the contragradient of the representation
$\iota ^{*}$
 denotes the contragradient of the representation 
 $\iota $
. As f is non-constant there must be a non-trivial, finite-dimensional representation of G that appears as a subrepresentation of both
$\iota $
. As f is non-constant there must be a non-trivial, finite-dimensional representation of G that appears as a subrepresentation of both 
 $\operatorname {\textrm {L}}^{2}(X,\mu _{X})$
 and
$\operatorname {\textrm {L}}^{2}(X,\mu _{X})$
 and 
 $\operatorname {\textrm {L}}^{2}(Z,\mu _{Z})$
.
$\operatorname {\textrm {L}}^{2}(Z,\mu _{Z})$
.
2.4 Joinings
 Given G systems 
 $\textbf {X} = (X,T,\mu _{X})$
 and
$\textbf {X} = (X,T,\mu _{X})$
 and 
 $\textbf {Z} = (Z,R,\mu _{Z})$
, a measure
$\textbf {Z} = (Z,R,\mu _{Z})$
, a measure 
 $\unicode{x3bb} $
 on
$\unicode{x3bb} $
 on 
 $X \times Z$
 is a joining of
$X \times Z$
 is a joining of 
 $\textbf {X}$
 and
$\textbf {X}$
 and 
 $\textbf {Z}$
 if
$\textbf {Z}$
 if 
 $\unicode{x3bb} $
 is invariant under the diagonal action
$\unicode{x3bb} $
 is invariant under the diagonal action 
 $T \times R$
 and the pushforwards of
$T \times R$
 and the pushforwards of 
 $\unicode{x3bb} $
 under the two coordinate projection maps
$\unicode{x3bb} $
 under the two coordinate projection maps 
 $\pi _{\textbf {X}}:X\times Z\to X$
 and
$\pi _{\textbf {X}}:X\times Z\to X$
 and 
 $\pi _{\textbf {Z}} : X\times Z\to Z$
 satisfy
$\pi _{\textbf {Z}} : X\times Z\to Z$
 satisfy 
 $\pi _{\textbf {X}}(\unicode{x3bb} ) = \mu _{X}$
 and
$\pi _{\textbf {X}}(\unicode{x3bb} ) = \mu _{X}$
 and 
 $\pi _{\textbf {Z}}(\unicode{x3bb} )=\mu _{Z}$
. We write
$\pi _{\textbf {Z}}(\unicode{x3bb} )=\mu _{Z}$
. We write 
 $\mathcal {J}(\textbf {X},\textbf {Z})$
 for the set of all joinings of
$\mathcal {J}(\textbf {X},\textbf {Z})$
 for the set of all joinings of 
 $\textbf {X}$
 with
$\textbf {X}$
 with 
 $\textbf {Z}$
 and
$\textbf {Z}$
 and 
 $\mathcal {J}_{\mathsf {e}}(\textbf {X},\textbf {Z})$
 for the set of joinings of
$\mathcal {J}_{\mathsf {e}}(\textbf {X},\textbf {Z})$
 for the set of joinings of 
 $\textbf {X}$
 with
$\textbf {X}$
 with 
 $\textbf {Z}$
 that are ergodic. The product
$\textbf {Z}$
 that are ergodic. The product 
 $\mu _{X} \otimes \mu _{Z}$
 is always a joining of
$\mu _{X} \otimes \mu _{Z}$
 is always a joining of 
 $\textbf {X}$
 and
$\textbf {X}$
 and 
 $\textbf {Z}$
. When
$\textbf {Z}$
. When 
 $\textbf {X}$
 and
$\textbf {X}$
 and 
 $\textbf {Z}$
 are ergodic, the set
$\textbf {Z}$
 are ergodic, the set 
 $\mathcal {J}_{\mathsf {e}}(\textbf {X},\textbf {Z})$
 is always non-empty because the measures in the ergodic decomposition of
$\mathcal {J}_{\mathsf {e}}(\textbf {X},\textbf {Z})$
 is always non-empty because the measures in the ergodic decomposition of 
 $\mu _{X} \otimes \mu _{Z}$
 can be shown to be ergodic joinings of
$\mu _{X} \otimes \mu _{Z}$
 can be shown to be ergodic joinings of 
 $\textbf {X}$
 and
$\textbf {X}$
 and 
 $\textbf {Z}$
.
$\textbf {Z}$
.
 One says that G systems 
 $\textbf {X}$
 and
$\textbf {X}$
 and 
 $\textbf {Z}$
 are disjoint if
$\textbf {Z}$
 are disjoint if 
 $\mu _{X} \otimes \mu _{Z}$
 is their only joining. We say that G systems
$\mu _{X} \otimes \mu _{Z}$
 is their only joining. We say that G systems 
 $\textbf {X}$
 and
$\textbf {X}$
 and 
 $\textbf {Z}$
 are Kronecker disjoint if their Kronecker factors
$\textbf {Z}$
 are Kronecker disjoint if their Kronecker factors 
 $\mathsf {K} \textbf {X}$
 and
$\mathsf {K} \textbf {X}$
 and 
 $\mathsf {K} \textbf {Z}$
 are disjoint.
$\mathsf {K} \textbf {Z}$
 are disjoint.
2.5 Distal systems
 Given a G system 
 $\textbf {X} = (X,T,\mu _{X})$
 denote by
$\textbf {X} = (X,T,\mu _{X})$
 denote by 
 $\mathsf {Aut}(\textbf {X})$
 the group of invertible, measurable, measure-preserving maps on
$\mathsf {Aut}(\textbf {X})$
 the group of invertible, measurable, measure-preserving maps on 
 $(X,\mu _{X})$
 that commute with T, where two automorphisms are identified if they coincide
$(X,\mu _{X})$
 that commute with T, where two automorphisms are identified if they coincide 
 $\mu _{X}$
 almost everywhere. As X is a compact metric space, the group
$\mu _{X}$
 almost everywhere. As X is a compact metric space, the group 
 $\mathsf {Aut}(\textbf {X})$
 is metrizable and as such becomes a Polish topological group. Given a compact subgroup L of
$\mathsf {Aut}(\textbf {X})$
 is metrizable and as such becomes a Polish topological group. Given a compact subgroup L of 
 $\mathsf {Aut}(\textbf {X})$
, the associated sub-
$\mathsf {Aut}(\textbf {X})$
, the associated sub-
 $\sigma $
-algebra of L invariant sets determines a factor of
$\sigma $
-algebra of L invariant sets determines a factor of 
 $\textbf {X}$
. One says that a G system
$\textbf {X}$
. One says that a G system 
 $\textbf {X}$
 is a group extension of a G system
$\textbf {X}$
 is a group extension of a G system 
 $\textbf {Y}$
 if
$\textbf {Y}$
 if 
 $\textbf {Y}$
 is (isomorphic to) a factor of
$\textbf {Y}$
 is (isomorphic to) a factor of 
 $\textbf {X}$
 via a compact subgroup of
$\textbf {X}$
 via a compact subgroup of 
 $\mathsf {Aut}(\textbf {X})$
 in the above fashion. As stated in the introduction, a G system is measurably distal if it belongs to the smallest class of G systems that contains the trivial one-point system and is closed under group extensions, factor maps, and inverse limits.
$\mathsf {Aut}(\textbf {X})$
 in the above fashion. As stated in the introduction, a G system is measurably distal if it belongs to the smallest class of G systems that contains the trivial one-point system and is closed under group extensions, factor maps, and inverse limits.
 We recall that a topological G system 
 $(X,T)$
 is topologically distal if
$(X,T)$
 is topologically distal if 
 $$ \begin{align*} \inf \{ \mathsf{d}(T^{g} x, T^{g} x^{\prime}) : g \in G \}> 0 \end{align*} $$
$$ \begin{align*} \inf \{ \mathsf{d}(T^{g} x, T^{g} x^{\prime}) : g \in G \}> 0 \end{align*} $$
for all 
 $x \ne x^{\prime }$
 in X. In the case
$x \ne x^{\prime }$
 in X. In the case 
 $G = \mathbb {Z}$
 Parry [Reference Parry, Auslander and GottschalkPar68] modified this definition to apply to measure-preserving actions. We now recall Zimmer’s generalization [Reference ZimmerZim76a, Definition 8.5] of Parry’s definition to actions of countable groups. Given a G system
$G = \mathbb {Z}$
 Parry [Reference Parry, Auslander and GottschalkPar68] modified this definition to apply to measure-preserving actions. We now recall Zimmer’s generalization [Reference ZimmerZim76a, Definition 8.5] of Parry’s definition to actions of countable groups. Given a G system 
 $\textbf {X} = (X,T,\mu _{X})$
 a sequence
$\textbf {X} = (X,T,\mu _{X})$
 a sequence 
 $n \mapsto A_{n}$
 of Borel subsets of X with
$n \mapsto A_{n}$
 of Borel subsets of X with 
 $\mu _{X}(A_{n})> 0$
 and
$\mu _{X}(A_{n})> 0$
 and 
 $\mu _{X}(A_{n}) \to 0$
 is called a separating sieve if there is a conull set
$\mu _{X}(A_{n}) \to 0$
 is called a separating sieve if there is a conull set 
 $X^{\prime } \subset X$
 such that, whenever
$X^{\prime } \subset X$
 such that, whenever 
 $x,x^{\prime } \in X^{\prime }$
 and, for each
$x,x^{\prime } \in X^{\prime }$
 and, for each 
 $n \in \mathbb {N}$
, one can find
$n \in \mathbb {N}$
, one can find 
 $g_{n} \in G$
 with
$g_{n} \in G$
 with 
 $\{ T^{g_{n}} x, T^{g_{n}} x^{\prime } \} \subset A_{n}$
, one has
$\{ T^{g_{n}} x, T^{g_{n}} x^{\prime } \} \subset A_{n}$
, one has 
 $x = x^{\prime }$
. Zimmer [Reference ZimmerZim76a, Theorem 8.7] proved that a non-atomic G system
$x = x^{\prime }$
. Zimmer [Reference ZimmerZim76a, Theorem 8.7] proved that a non-atomic G system 
 $\textbf {X}$
 is measurably distal if and only if it has a separating sieve. Using this characterization of distality, one can show that every topological G system
$\textbf {X}$
 is measurably distal if and only if it has a separating sieve. Using this characterization of distality, one can show that every topological G system 
 $(X,T)$
 that is topologically distal has the property that for every T invariant Borel probability measure
$(X,T)$
 that is topologically distal has the property that for every T invariant Borel probability measure 
 $\mu _{X}$
 on X the G system
$\mu _{X}$
 on X the G system 
 $(X,T,\mu _{X})$
 is measurably distal. Lindenstrauss [Reference LindenstraussLin99] has proved a partial converse to this result for
$(X,T,\mu _{X})$
 is measurably distal. Lindenstrauss [Reference LindenstraussLin99] has proved a partial converse to this result for 
 $\mathbb {Z}$
 systems by showing that every measurably distal
$\mathbb {Z}$
 systems by showing that every measurably distal 
 $\mathbb {Z}$
 system can be modeled by a topologically distal
$\mathbb {Z}$
 system can be modeled by a topologically distal 
 $\mathbb {Z}$
 system equipped with an invariant Borel probability measure.
$\mathbb {Z}$
 system equipped with an invariant Borel probability measure.
3 Proof of Theorem 1.3
 In this section, we prove Theorem 1.3. It follows a preparatory discussion parameterizing the space of joinings of ergodic, almost-periodic 
 $\mathbb {Z}$
 systems and describing the ergodic decomposition of the product of two such
$\mathbb {Z}$
 systems and describing the ergodic decomposition of the product of two such 
 $\mathbb {Z}$
 systems.
$\mathbb {Z}$
 systems.
 As described in §2.3, the Kronecker factor of an ergodic 
 $\mathbb {Z}$
 system can be modeled as an ergodic rotation on a compact abelian group. Given two ergodic rotations on compact, abelian groups
$\mathbb {Z}$
 system can be modeled as an ergodic rotation on a compact abelian group. Given two ergodic rotations on compact, abelian groups 
 ${\textbf X}=(X,T,\mu _{X})$
 and
${\textbf X}=(X,T,\mu _{X})$
 and 
 ${\textbf Y}=(Y,S,\mu _{Y})$
, their ergodic joinings can be easily described as follows: let
${\textbf Y}=(Y,S,\mu _{Y})$
, their ergodic joinings can be easily described as follows: let 
 $e_{X}$
 be the identity of the compact abelian group X, let
$e_{X}$
 be the identity of the compact abelian group X, let 
 $e_{Y}$
 be the identity of Y, and let H be the subgroup
$e_{Y}$
 be the identity of Y, and let H be the subgroup 
 $$ \begin{align*} H = \overline{\{(T\times S)^{n}(e_{X},e_{Y}):n\in\mathbb{Z}\}} \end{align*} $$
$$ \begin{align*} H = \overline{\{(T\times S)^{n}(e_{X},e_{Y}):n\in\mathbb{Z}\}} \end{align*} $$
of 
 $X\times Y$
. Given an ergodic joining
$X\times Y$
. Given an ergodic joining 
 $\unicode{x3bb} \in \mathcal {J}_{\mathsf {e}}(\textbf {X},\textbf {Y})$
, let
$\unicode{x3bb} \in \mathcal {J}_{\mathsf {e}}(\textbf {X},\textbf {Y})$
, let 
 $(x_{0},y_{0})\in X\times Y$
 be a generic point. The support of
$(x_{0},y_{0})\in X\times Y$
 be a generic point. The support of 
 $\unicode{x3bb} $
 is the orbit closure of
$\unicode{x3bb} $
 is the orbit closure of 
 $(x_{0},y_{0})$
, which is
$(x_{0},y_{0})$
, which is 
 $(x_{0},y_{0})+H$
. Hence, the pushforward of
$(x_{0},y_{0})+H$
. Hence, the pushforward of 
 $\unicode{x3bb} $
 under the map
$\unicode{x3bb} $
 under the map 
 $(x,y)\mapsto (x-x_{0},y-y_{0})$
 is a measure on H invariant under
$(x,y)\mapsto (x-x_{0},y-y_{0})$
 is a measure on H invariant under 
 $T\times S$
 and, hence, must be the Haar measure on H. It follows that
$T\times S$
 and, hence, must be the Haar measure on H. It follows that 
 $\unicode{x3bb} $
 is the Haar measure on the coset
$\unicode{x3bb} $
 is the Haar measure on the coset 
 $(x_{0},y_{0})+H$
.
$(x_{0},y_{0})+H$
.
 Now let 
 $K = (X\times Y)/H$
 be the group of cosets of H and write
$K = (X\times Y)/H$
 be the group of cosets of H and write 
 $\mu _{K}$
 for Haar measure on K. Define a map
$\mu _{K}$
 for Haar measure on K. Define a map 
 $R : K \to K$
 by
$R : K \to K$
 by 
 $R:(x,y)+H\mapsto (Tx,y)+H=(x,S^{-1}y)+H$
. Let
$R:(x,y)+H\mapsto (Tx,y)+H=(x,S^{-1}y)+H$
. Let 
 $\alpha :X\to K$
 be defined by
$\alpha :X\to K$
 be defined by 
 $\alpha (x)=(x,e_{Y})+H$
 and let
$\alpha (x)=(x,e_{Y})+H$
 and let 
 $\beta :Y\to K$
 be defined by
$\beta :Y\to K$
 be defined by 
 $\beta (y)=(e_{X},-y)+H$
. It is easy to check that both
$\beta (y)=(e_{X},-y)+H$
. It is easy to check that both 
 $\alpha $
 and
$\alpha $
 and 
 $\beta $
 are factor maps onto
$\beta $
 are factor maps onto 
 $(K,R)$
 and, hence, either
$(K,R)$
 and, hence, either 
 ${\textbf X}\times {\textbf Y}$
 is ergodic (and, thus,
${\textbf X}\times {\textbf Y}$
 is ergodic (and, thus, 
 $K=\{\operatorname {\textrm {id}}\}$
), or
$K=\{\operatorname {\textrm {id}}\}$
), or 
 $\textbf {X}$
 and
$\textbf {X}$
 and 
 $\textbf {Y}$
 share a non-trivial common factor.
$\textbf {Y}$
 share a non-trivial common factor.
 Note that 
 $\gamma (x,y)=(x,y)+H$
 from
$\gamma (x,y)=(x,y)+H$
 from 
 ${\textbf X}\times {\textbf Y}$
 to K is the maximal invariant factor (because every invariant function is constant along cosets of H). In fact,
${\textbf X}\times {\textbf Y}$
 to K is the maximal invariant factor (because every invariant function is constant along cosets of H). In fact, 
 $(K,R)$
 is the maximal common factor of
$(K,R)$
 is the maximal common factor of 
 $\textbf {X}$
 and
$\textbf {X}$
 and 
 $\textbf {Y}$
.
$\textbf {Y}$
.
 As 
 $\gamma ^{-1}((x,y)+H)=(x,y)+H$
, there exists exactly one ergodic joining living in that pre-image, namely, the Haar measure. For each
$\gamma ^{-1}((x,y)+H)=(x,y)+H$
, there exists exactly one ergodic joining living in that pre-image, namely, the Haar measure. For each 
 $k\in K$
, let
$k\in K$
, let 
 $\unicode{x3bb} _{k}\in \mathcal {J}_{\mathsf {e}}({\textbf X},{\textbf Y})$
 be the unique joining such that
$\unicode{x3bb} _{k}\in \mathcal {J}_{\mathsf {e}}({\textbf X},{\textbf Y})$
 be the unique joining such that 
 $\unicode{x3bb} _{k}(\gamma ^{-1}(k))=1$
. Observe that
$\unicode{x3bb} _{k}(\gamma ^{-1}(k))=1$
. Observe that 

is, therefore, the ergodic decomposition of 
 $\mu _{X} \otimes \mu _{Y}$
.
$\mu _{X} \otimes \mu _{Y}$
.
Proof of Theorem 1.3
 First, suppose that 
 $\textbf {X}$
 and
$\textbf {X}$
 and 
 $\textbf {Y}$
 are quasi-disjoint. Let
$\textbf {Y}$
 are quasi-disjoint. Let 
 $(K,\mu _{K})$
 be a model for their joint Kronecker factor and let
$(K,\mu _{K})$
 be a model for their joint Kronecker factor and let 
 $k \mapsto \unicode{x3bb} _{k}$
 be a measurable map from K into the space
$k \mapsto \unicode{x3bb} _{k}$
 be a measurable map from K into the space 
 $\mathcal {J}_{\mathsf {e}}(\textbf {X},\textbf {Y})$
 of ergodic joinings of
$\mathcal {J}_{\mathsf {e}}(\textbf {X},\textbf {Y})$
 of ergodic joinings of 
 $\textbf {X}$
 and
$\textbf {X}$
 and 
 $\textbf {Y}$
 such that
$\textbf {Y}$
 such that 
 $\unicode{x3bb} _{k}$
 gives full measure to
$\unicode{x3bb} _{k}$
 gives full measure to 
 $\gamma ^{-1}(k)$
 for almost every k. Then
$\gamma ^{-1}(k)$
 for almost every k. Then 

is a joining of 
 ${\textbf X}$
 and
${\textbf X}$
 and 
 ${\textbf Y}$
. We claim that the projection of
${\textbf Y}$
. We claim that the projection of 
 $\eta $
 to
$\eta $
 to 
 $\mathsf {K} \textbf {X} \times \mathsf {K} \textbf {Y}$
 is the product measure. Indeed, let
$\mathsf {K} \textbf {X} \times \mathsf {K} \textbf {Y}$
 is the product measure. Indeed, let 
 $\pi :{\textbf X}\times {\textbf Y}\to \mathsf {K}{\textbf X}\times \mathsf {K}{\textbf Y}$
 be the corresponding factor map. We can decompose
$\pi :{\textbf X}\times {\textbf Y}\to \mathsf {K}{\textbf X}\times \mathsf {K}{\textbf Y}$
 be the corresponding factor map. We can decompose 
 $\gamma =\tilde \gamma \circ \pi $
 for some
$\gamma =\tilde \gamma \circ \pi $
 for some 
 $\tilde \gamma :\mathsf {K} X\times \mathsf {K} Y\to K$
. As
$\tilde \gamma :\mathsf {K} X\times \mathsf {K} Y\to K$
. As 
 $\pi (\unicode{x3bb} _{k})$
 gives full measure to
$\pi (\unicode{x3bb} _{k})$
 gives full measure to 
 $\tilde \gamma ^{-1}(k)$
, the discussion preceding this proof implies it is uniquely determined by k. In particular, in view of equation 2, we have
$\tilde \gamma ^{-1}(k)$
, the discussion preceding this proof implies it is uniquely determined by k. In particular, in view of equation 2, we have 

as claimed. However, then by quasi-disjointness 
 $\eta = \mu _{\textbf X} \otimes \mu _{\textbf Y}$
. By uniqueness of the ergodic disintegration, the map
$\eta = \mu _{\textbf X} \otimes \mu _{\textbf Y}$
. By uniqueness of the ergodic disintegration, the map 
 $k\to \unicode{x3bb} _{k}$
 is uniquely defined almost everywhere, so
$k\to \unicode{x3bb} _{k}$
 is uniquely defined almost everywhere, so 
 $\textbf {X}$
 and
$\textbf {X}$
 and 
 $\textbf {Y}$
 satisfy (BQD).
$\textbf {Y}$
 satisfy (BQD).
 Conversely, suppose that 
 $\textbf {X}$
 and
$\textbf {X}$
 and 
 $\textbf {Y}$
 satisfy (BQD). Let
$\textbf {Y}$
 satisfy (BQD). Let 
 $\eta $
 be a joining of
$\eta $
 be a joining of 
 $\textbf {X}$
 and
$\textbf {X}$
 and 
 $\textbf {Y}$
 that projects to the product measure on
$\textbf {Y}$
 that projects to the product measure on 
 $\mathsf {K} \textbf {X} \times \mathsf {K} \textbf {Y}$
. Then
$\mathsf {K} \textbf {X} \times \mathsf {K} \textbf {Y}$
. Then 
 $\gamma \eta $
 is the Haar measure on the maximal common Kronecker factor
$\gamma \eta $
 is the Haar measure on the maximal common Kronecker factor 
 $\mathsf {K}(\textbf {X},\textbf {Y})$
 of
$\mathsf {K}(\textbf {X},\textbf {Y})$
 of 
 $\textbf {X}$
 and
$\textbf {X}$
 and 
 $\textbf {Y}$
. The ergodic disintegration of
$\textbf {Y}$
. The ergodic disintegration of 
 $\eta $
 is the same as the disintegration of
$\eta $
 is the same as the disintegration of 
 $\eta $
 over
$\eta $
 over 
 $\mathsf {K}(\textbf {X},\textbf {Y})$
. By (BQD), this disintegration is, in turn, the same as the disintegration of
$\mathsf {K}(\textbf {X},\textbf {Y})$
. By (BQD), this disintegration is, in turn, the same as the disintegration of 
 $\mu \otimes \nu $
 over
$\mu \otimes \nu $
 over 
 $\mathsf {K}(\textbf {X},\textbf {Y})$
. Therefore,
$\mathsf {K}(\textbf {X},\textbf {Y})$
. Therefore, 
 $\eta = \mu \otimes \nu $
.
$\eta = \mu \otimes \nu $
.
4 Quasi-disjointness for measurably distal systems
 In this section, we give a proof of Theorem 1.4. The proof comprises three parts, covered in the following three subsections. The first part consists of showing that quasi-disjointness lifts through group extensions. The second part proves that quasi-disjointness is preserved when passing to a factor and the third part consists of showing that quasi-disjointness is preserved by inverse limits. As, starting from the trivial system, such operations exhaust the class of distal systems, these three parts combined indeed yield a complete proof of Theorem 1.4. We conclude this section with an example of a 
 $\mathbb {Z}$
 system that is not measurably distal but is quasi-disjoint from every ergodic system.
$\mathbb {Z}$
 system that is not measurably distal but is quasi-disjoint from every ergodic system.
4.1 Quasi-disjointness lifts through group extensions
The purpose of this subsection is to prove the following result.
Theorem 4.1. Let 
 ${\textbf X}=(X,T,\mu _{X})$
,
${\textbf X}=(X,T,\mu _{X})$
, 
 ${\textbf Y}=(Y,S,\mu _{Y})$
, and
${\textbf Y}=(Y,S,\mu _{Y})$
, and 
 ${\textbf Z}=(Z,R,\mu _{Z})$
 be G systems and assume that
${\textbf Z}=(Z,R,\mu _{Z})$
 be G systems and assume that 
 ${\textbf X}$
 is a group extension of
${\textbf X}$
 is a group extension of 
 ${\textbf Y}$
. If
${\textbf Y}$
. If 
 ${\textbf Y}$
 is quasi-disjoint from
${\textbf Y}$
 is quasi-disjoint from 
 ${\textbf Z}$
, then so is
${\textbf Z}$
, then so is 
 ${\textbf X}$
.
${\textbf X}$
.
For the proof of Theorem 4.1, we borrow ideas from the proof of [Reference FurstenbergFur67, Theorem 1.4]. See [Reference GlasnerGla03, Theorem 3.30] for a version of that result applicable to actions of countable groups.
Proof of Theorem 4.1
 Let 
 $\pi _{\textbf {X},\mathsf {K}\textbf {X}}$
 and
$\pi _{\textbf {X},\mathsf {K}\textbf {X}}$
 and 
 $\pi _{\textbf {Z},\mathsf {K}\textbf {Z}}$
 denote the projection maps from
$\pi _{\textbf {Z},\mathsf {K}\textbf {Z}}$
 denote the projection maps from 
 $\textbf {X}$
 onto
$\textbf {X}$
 onto 
 $\mathsf {K}\textbf {X}$
 and from
$\mathsf {K}\textbf {X}$
 and from 
 $\textbf {Z}$
 onto
$\textbf {Z}$
 onto 
 $\mathsf {K}\textbf {Z}$
, respectively. Let
$\mathsf {K}\textbf {Z}$
, respectively. Let 
 $\unicode{x3bb} \in \mathcal {J}(\textbf {X},\textbf {Z})$
 be a joining of
$\unicode{x3bb} \in \mathcal {J}(\textbf {X},\textbf {Z})$
 be a joining of 
 $\textbf {X}$
 and
$\textbf {X}$
 and 
 $\textbf {Z}$
 with the property that
$\textbf {Z}$
 with the property that 
 $(\pi _{\textbf {X},\mathsf {K}\textbf {X}}\times \pi _{\textbf {Z},\mathsf {K}\textbf {Z}})(\unicode{x3bb} )=\mu _{\mathsf {K} X}\otimes \mu _{\mathsf {K} Z}$
. We want to show that
$(\pi _{\textbf {X},\mathsf {K}\textbf {X}}\times \pi _{\textbf {Z},\mathsf {K}\textbf {Z}})(\unicode{x3bb} )=\mu _{\mathsf {K} X}\otimes \mu _{\mathsf {K} Z}$
. We want to show that 
 $\unicode{x3bb} =\mu _{X}\otimes \mu _{Z}$
.
$\unicode{x3bb} =\mu _{X}\otimes \mu _{Z}$
.
 Let 
 $\pi _{\textbf {X},\textbf {Y}}$
 denote the factor map from
$\pi _{\textbf {X},\textbf {Y}}$
 denote the factor map from 
 $\textbf {X}$
 onto
$\textbf {X}$
 onto 
 $\textbf {Y}$
. Note that
$\textbf {Y}$
. Note that 
 ${(\pi _{\textbf {X},\textbf {Y}}\times \text {id}_{\textbf {Z}})}(\unicode{x3bb} )$
 is a joining of
${(\pi _{\textbf {X},\textbf {Y}}\times \text {id}_{\textbf {Z}})}(\unicode{x3bb} )$
 is a joining of 
 $\textbf {Y}$
 with
$\textbf {Y}$
 with 
 $\textbf {Z}$
 whose projection onto the product of the Kronecker factors
$\textbf {Z}$
 whose projection onto the product of the Kronecker factors 
 $\mathsf {K}\textbf {Y}\times \mathsf {K}\textbf {Z}$
 equals
$\mathsf {K}\textbf {Y}\times \mathsf {K}\textbf {Z}$
 equals 
 $\mu _{KY}\otimes \mu _{KZ}$
. As
$\mu _{KY}\otimes \mu _{KZ}$
. As 
 ${\textbf Y}$
 is quasi-disjoint from
${\textbf Y}$
 is quasi-disjoint from 
 ${\textbf Z}$
, we conclude that
${\textbf Z}$
, we conclude that 
 ${(\pi _{\textbf {X},\textbf {Y}}\times \text {id}_{\textbf {Z}})}(\unicode{x3bb} )=\mu _{Y}\otimes \mu _{Z}$
.
${(\pi _{\textbf {X},\textbf {Y}}\times \text {id}_{\textbf {Z}})}(\unicode{x3bb} )=\mu _{Y}\otimes \mu _{Z}$
.
 As 
 $\textbf {X}$
 is a group extension of
$\textbf {X}$
 is a group extension of 
 $\textbf {Y}$
, there exists a compact group
$\textbf {Y}$
, there exists a compact group 
 $L\leqslant \mathsf {Aut}(\textbf {X})$
 such that the factor
$L\leqslant \mathsf {Aut}(\textbf {X})$
 such that the factor 
 $\textbf {Y}$
 corresponds to the sub-
$\textbf {Y}$
 corresponds to the sub-
 $\sigma $
-algebra of L invariant subsets of
$\sigma $
-algebra of L invariant subsets of 
 $\textbf {X}$
 (cf. §2.5). Let
$\textbf {X}$
 (cf. §2.5). Let 
 $\mu _{L}$
 denote the normalized Haar measure on L and let
$\mu _{L}$
 denote the normalized Haar measure on L and let 
 $\textbf {L}=(L,Q,\mu _{L})$
, where Q is the action of L on itself by left multiplication. Given
$\textbf {L}=(L,Q,\mu _{L})$
, where Q is the action of L on itself by left multiplication. Given 
 $\psi \in \operatorname {\textrm {L}}^{\infty }(L,\mu _{L})$
 with
$\psi \in \operatorname {\textrm {L}}^{\infty }(L,\mu _{L})$
 with 
 $\psi \geqslant 0$
 we define a new joining
$\psi \geqslant 0$
 we define a new joining 
 $\unicode{x3bb} _{\psi }\in \mathcal {J}(\textbf {X},\textbf {Z})$
 by
$\unicode{x3bb} _{\psi }\in \mathcal {J}(\textbf {X},\textbf {Z})$
 by 

for all Borel sets 
 $A \subset X \times Z$
. Let
$A \subset X \times Z$
. Let 
 $\unicode{x3bb} _{1}$
 denote the measure defined by (3) with
$\unicode{x3bb} _{1}$
 denote the measure defined by (3) with 
 $\psi = 1$
. We claim that
$\psi = 1$
. We claim that 
 $\unicode{x3bb} _{1} = \mu _{X}\otimes \mu _{Z}$
. To verify this claim, let
$\unicode{x3bb} _{1} = \mu _{X}\otimes \mu _{Z}$
. To verify this claim, let 
 $f \in \operatorname {\textrm {L}}^{2}(X,\mu _{X})$
,
$f \in \operatorname {\textrm {L}}^{2}(X,\mu _{X})$
, 
 $g\in \operatorname {\textrm {L}}^{2}(Z,\mu _{Z})$
, and define
$g\in \operatorname {\textrm {L}}^{2}(Z,\mu _{Z})$
, and define 
 $f^{\prime }(x)=\int _{L} f(lx)\,\textit {d}\mu _{L}(l)$
 for all
$f^{\prime }(x)=\int _{L} f(lx)\,\textit {d}\mu _{L}(l)$
 for all 
 $x\in X$
. Note that
$x\in X$
. Note that 
 $f^{\prime }$
 is L invariant and, hence, there exists
$f^{\prime }$
 is L invariant and, hence, there exists 
 $h\in \operatorname {\textrm {L}}^{2}(Y,\mu _{Y})$
 such that
$h\in \operatorname {\textrm {L}}^{2}(Y,\mu _{Y})$
 such that 
 $h\circ \pi _{\textbf {X},\textbf {Y}}=f^{\prime }$
. We have
$h\circ \pi _{\textbf {X},\textbf {Y}}=f^{\prime }$
. We have 

because 
 $(\pi _{\textbf {X},\textbf {Y}} \times \operatorname {\textrm {id}}_{\textbf {Z}}) \unicode{x3bb} = \mu _{Y} \otimes \mu _{Z}$
. This shows that indeed
$(\pi _{\textbf {X},\textbf {Y}} \times \operatorname {\textrm {id}}_{\textbf {Z}}) \unicode{x3bb} = \mu _{Y} \otimes \mu _{Z}$
. This shows that indeed 
 $\unicode{x3bb} _{1} = \mu _{X} \otimes \mu _{Z}$
.
$\unicode{x3bb} _{1} = \mu _{X} \otimes \mu _{Z}$
.
Next, observe that

for all Borel sets 
 $A \subset X \times Z$
. Inequality (4) shows that
$A \subset X \times Z$
. Inequality (4) shows that 
 $\unicode{x3bb} _{\psi }$
 is absolutely continuous with respect to
$\unicode{x3bb} _{\psi }$
 is absolutely continuous with respect to 
 $\mu _{X}\otimes \mu _{Z}$
. Let
$\mu _{X}\otimes \mu _{Z}$
. Let 
 $F_{\psi }$
 denote the Radon–Nikodym derivative of
$F_{\psi }$
 denote the Radon–Nikodym derivative of 
 $\unicode{x3bb} _{\psi }$
 with respect to
$\unicode{x3bb} _{\psi }$
 with respect to 
 $\mu _{X}\otimes \mu _{Z}$
. It also follows from (4) that
$\mu _{X}\otimes \mu _{Z}$
. It also follows from (4) that 
 $\| {F_{\psi }} \|_{\infty } \leqslant \| {\psi } \|_{\infty }$
 and so
$\| {F_{\psi }} \|_{\infty } \leqslant \| {\psi } \|_{\infty }$
 and so 
 $F_{\psi }\in \operatorname {\textrm {L}}^{\infty }(X \times Z, \mu _{X} \otimes \mu _{Z})$
. Moreover, since
$F_{\psi }\in \operatorname {\textrm {L}}^{\infty }(X \times Z, \mu _{X} \otimes \mu _{Z})$
. Moreover, since 
 $\unicode{x3bb} _{\psi }$
 is a
$\unicode{x3bb} _{\psi }$
 is a 
 $T \times R$
 invariant measure, we conclude that
$T \times R$
 invariant measure, we conclude that 
 $F_{\psi }$
 is a
$F_{\psi }$
 is a 
 $T \times R$
 invariant function in
$T \times R$
 invariant function in 
 $\operatorname {\textrm {L}}^{\infty }(X \times Z, \mu _{X} \otimes \mu _{Z})$
. As any
$\operatorname {\textrm {L}}^{\infty }(X \times Z, \mu _{X} \otimes \mu _{Z})$
. As any 
 $T \times R$
 invariant function is almost periodic, it follows that
$T \times R$
 invariant function is almost periodic, it follows that 
 $F_{\psi }\in \mathsf {AP}( \textbf {X} \times \textbf {Z} )$
. Therefore,
$F_{\psi }\in \mathsf {AP}( \textbf {X} \times \textbf {Z} )$
. Therefore, 
 $F_{\psi } \in \mathsf {AP}(\textbf {X}) \otimes \mathsf {AP}(\textbf {Z})$
 by Proposition 2.1. It follows that for all
$F_{\psi } \in \mathsf {AP}(\textbf {X}) \otimes \mathsf {AP}(\textbf {Z})$
 by Proposition 2.1. It follows that for all 
 $f\in \operatorname {\textrm {L}}^{2}(X,\mu _{X})$
 and
$f\in \operatorname {\textrm {L}}^{2}(X,\mu _{X})$
 and 
 $g\in \operatorname {\textrm {L}}^{2}(Z,\mu _{Z})$
,
$g\in \operatorname {\textrm {L}}^{2}(Z,\mu _{Z})$
, 

As 
 $(\pi _{\textbf {X},\mathsf {K}\textbf {X}}\times \pi _{\textbf {Z},\mathsf {K}\textbf {Z}})(\unicode{x3bb} )=\mu _{\mathsf {K} X}\otimes \mu _{\mathsf {K} Z}$
 we have
$(\pi _{\textbf {X},\mathsf {K}\textbf {X}}\times \pi _{\textbf {Z},\mathsf {K}\textbf {Z}})(\unicode{x3bb} )=\mu _{\mathsf {K} X}\otimes \mu _{\mathsf {K} Z}$
 we have 
 $$ \begin{align*} \int_{X\times Z} \mathbb{E}(l f | \mathsf{K}\textbf{X}) \otimes \mathbb{E}(g|\mathsf{K}\textbf{Z})\,\textit{d}\unicode{x3bb} & = \int_{X\times Z} \mathbb{E}(l f | \mathsf{K}\textbf{X}) \otimes \mathbb{E}(g|\mathsf{K}\textbf{Z})\,\textit{d}(\mu_{X}\otimes\mu_{Z}) \\ & = \int_{X\times Z} f\otimes g \,\textit{d}(\mu_{X}\otimes\mu_{Z}). \end{align*} $$
$$ \begin{align*} \int_{X\times Z} \mathbb{E}(l f | \mathsf{K}\textbf{X}) \otimes \mathbb{E}(g|\mathsf{K}\textbf{Z})\,\textit{d}\unicode{x3bb} & = \int_{X\times Z} \mathbb{E}(l f | \mathsf{K}\textbf{X}) \otimes \mathbb{E}(g|\mathsf{K}\textbf{Z})\,\textit{d}(\mu_{X}\otimes\mu_{Z}) \\ & = \int_{X\times Z} f\otimes g \,\textit{d}(\mu_{X}\otimes\mu_{Z}). \end{align*} $$
for all 
 $l \in L$
. We conclude that
$l \in L$
. We conclude that 

so, in particular, for any 
 $\psi \in \operatorname {\textrm {L}}^{\infty }(L,\mu _{L})$
 with
$\psi \in \operatorname {\textrm {L}}^{\infty }(L,\mu _{L})$
 with 
 $\psi \geqslant 0$
 and
$\psi \geqslant 0$
 and 
 $\int _{L} \psi \,\textit {d}\mu _{L} = 1$
, the measure
$\int _{L} \psi \,\textit {d}\mu _{L} = 1$
, the measure 
 $\unicode{x3bb} _{\psi }$
 coincides with
$\unicode{x3bb} _{\psi }$
 coincides with 
 $\mu _{X}\otimes \mu _{Z}$
.
$\mu _{X}\otimes \mu _{Z}$
.
 Finally, allowing 
 $\psi $
 to run though an approximate identity, one can approximate
$\psi $
 to run though an approximate identity, one can approximate 
 $\unicode{x3bb} $
 by
$\unicode{x3bb} $
 by 
 $\unicode{x3bb} _{\psi }$
 and thereby conclude that
$\unicode{x3bb} _{\psi }$
 and thereby conclude that 
 $\unicode{x3bb} = \mu _{X}\otimes \mu _{Z}$
, which finishes the proof.
$\unicode{x3bb} = \mu _{X}\otimes \mu _{Z}$
, which finishes the proof.
4.2 Quasi-disjointness passes to factors
In this subsection, we prove the following theorem.
Theorem 4.2. Let 
 $\textbf {X}=(X,T,\mu _{X})$
,
$\textbf {X}=(X,T,\mu _{X})$
, 
 $\textbf {Y}=(Y,S,\mu _{Y})$
, and
$\textbf {Y}=(Y,S,\mu _{Y})$
, and 
 $\textbf {Z}=(Z,R,\mu _{Z})$
 be G systems and suppose that
$\textbf {Z}=(Z,R,\mu _{Z})$
 be G systems and suppose that 
 $\textbf {Y}$
 is a factor of
$\textbf {Y}$
 is a factor of 
 $\textbf {X}$
. If
$\textbf {X}$
. If 
 $\textbf {X}$
 and
$\textbf {X}$
 and 
 $\textbf {Z}$
 are quasi-disjoint, then
$\textbf {Z}$
 are quasi-disjoint, then 
 $\textbf {Y}$
 and
$\textbf {Y}$
 and 
 $\textbf {Z}$
 are also quasi-disjoint.
$\textbf {Z}$
 are also quasi-disjoint.
 For the proof of Theorem 4.2, we need to recall the definition of relatively independent joinings. Let 
 $\textbf {X}=(X,T,\mu _{X})$
,
$\textbf {X}=(X,T,\mu _{X})$
, 
 $\textbf {Y}=(Y,S,\mu _{Y})$
, and
$\textbf {Y}=(Y,S,\mu _{Y})$
, and 
 $\textbf {Z}=(Z,R,\mu _{Z})$
 be G systems and suppose that
$\textbf {Z}=(Z,R,\mu _{Z})$
 be G systems and suppose that 
 $\textbf {Y}$
 is a factor of both
$\textbf {Y}$
 is a factor of both 
 $\textbf {X}$
 and
$\textbf {X}$
 and 
 $\textbf {Z}$
. Let
$\textbf {Z}$
. Let 
 $\pi _{\textbf {X},\textbf {Y}}:X\to Y$
 and
$\pi _{\textbf {X},\textbf {Y}}:X\to Y$
 and 
 $\pi _{\textbf {Z},\textbf {Y}}:Z\to Y$
 denote the respective factor maps. Using
$\pi _{\textbf {Z},\textbf {Y}}:Z\to Y$
 denote the respective factor maps. Using 
 $\pi _{\textbf {X},\textbf {Y}}$
, we can embed
$\pi _{\textbf {X},\textbf {Y}}$
, we can embed 
 $\operatorname {\textrm {L}}^{2}(Y,\mu _{Y})$
 into
$\operatorname {\textrm {L}}^{2}(Y,\mu _{Y})$
 into 
 $\operatorname {\textrm {L}}^{2}(X,\mu _{X})$
. Likewise, through
$\operatorname {\textrm {L}}^{2}(X,\mu _{X})$
. Likewise, through 
 $\pi _{\textbf {Z},\textbf {Y}}$
 we can identify
$\pi _{\textbf {Z},\textbf {Y}}$
 we can identify 
 $\operatorname {\textrm {L}}^{2}(Y,\mu _{Y})$
 as a subspace of
$\operatorname {\textrm {L}}^{2}(Y,\mu _{Y})$
 as a subspace of 
 $\operatorname {\textrm {L}}^{2}(Z,\mu _{Z})$
. The relatively independent joining of
$\operatorname {\textrm {L}}^{2}(Z,\mu _{Z})$
. The relatively independent joining of
 $\textbf {X}$
with
$\textbf {X}$
with
 $\textbf {Z}$
over
$\textbf {Z}$
over
 $\textbf {Y}$
 is the triple
$\textbf {Y}$
 is the triple 
 $\textbf {X}\times _{\textbf {Y}}\textbf {Z}=(X\times Z,T\times R,\mu _{X}\otimes _{\textbf {Y}}\mu _{Z})$
, where
$\textbf {X}\times _{\textbf {Y}}\textbf {Z}=(X\times Z,T\times R,\mu _{X}\otimes _{\textbf {Y}}\mu _{Z})$
, where 
 $\mu _{X}\otimes _{\textbf {Y}}\mu _{Z}$
 denotes the unique measure on
$\mu _{X}\otimes _{\textbf {Y}}\mu _{Z}$
 denotes the unique measure on 
 $X\times Z$
 with the property that
$X\times Z$
 with the property that 
 $$ \begin{align} \int_{X\times Z} f \otimes g \,\textit{d}(\mu_{X}\otimes_{\textbf{Y}}\mu_{Z}) = \int_{Y}\mathbb{E}(f|\textbf{Y})\mathbb{E}(g|\textbf{Y})\,\textit{d}\mu_{Y} \end{align} $$
$$ \begin{align} \int_{X\times Z} f \otimes g \,\textit{d}(\mu_{X}\otimes_{\textbf{Y}}\mu_{Z}) = \int_{Y}\mathbb{E}(f|\textbf{Y})\mathbb{E}(g|\textbf{Y})\,\textit{d}\mu_{Y} \end{align} $$
for all 
 $f \in \operatorname {\textrm {L}}^{2}(X,\mu _{X})$
 and all
$f \in \operatorname {\textrm {L}}^{2}(X,\mu _{X})$
 and all 
 $g \in \operatorname {\textrm {L}}^{2}(Z,\mu _{Z})$
.
$g \in \operatorname {\textrm {L}}^{2}(Z,\mu _{Z})$
.
Lemma 4.3. Let 
 $\textbf {X}=(X,T,\mu _{X})$
 and
$\textbf {X}=(X,T,\mu _{X})$
 and 
 $\textbf {Y}=(Y,S,\mu _{Y})$
 be G systems and suppose that
$\textbf {Y}=(Y,S,\mu _{Y})$
 be G systems and suppose that 
 $\textbf {Y}$
 is a factor of
$\textbf {Y}$
 is a factor of 
 $\textbf {X}$
. Then the relatively independent joining
$\textbf {X}$
. Then the relatively independent joining 
 $\mathsf {K}\textbf {X}\times _{\mathsf {K}\textbf {Y}}\textbf {Y}$
 is a factor of
$\mathsf {K}\textbf {X}\times _{\mathsf {K}\textbf {Y}}\textbf {Y}$
 is a factor of 
 $\textbf {X}$
.
$\textbf {X}$
.
Proof. Let 
 $\pi _{\textbf {X},\textbf {Y}}:X\to Y$
 denote the factor map from
$\pi _{\textbf {X},\textbf {Y}}:X\to Y$
 denote the factor map from 
 $\textbf {X}$
 to
$\textbf {X}$
 to 
 $\textbf {Y}$
 and let
$\textbf {Y}$
 and let 
 $\pi _{\textbf {X},\mathsf {K}\textbf {X}}:X\to \mathsf {K} X$
 denote the factor map from
$\pi _{\textbf {X},\mathsf {K}\textbf {X}}:X\to \mathsf {K} X$
 denote the factor map from 
 $\textbf {X}$
 to
$\textbf {X}$
 to 
 $\mathsf {K}\textbf {X}$
. Let
$\mathsf {K}\textbf {X}$
. Let 
 $\tau :X\to \mathsf {K} X\times Y$
 be defined as
$\tau :X\to \mathsf {K} X\times Y$
 be defined as 
 $\tau (x)=(\pi _{\textbf {X},\mathsf {K}\textbf {X}}(x),\pi _{\textbf {X},\textbf {Y}}(x))$
 for all
$\tau (x)=(\pi _{\textbf {X},\mathsf {K}\textbf {X}}(x),\pi _{\textbf {X},\textbf {Y}}(x))$
 for all 
 $x\in X$
. We claim that
$x\in X$
. We claim that 
 $\tau $
 is a factor map from
$\tau $
 is a factor map from 
 $\textbf {X}$
 onto
$\textbf {X}$
 onto 
 $\mathsf {K}\textbf {X}\times _{\mathsf {K}\textbf {Y}}\textbf {Y}$
. Once this claim is verified, the proof is complete.
$\mathsf {K}\textbf {X}\times _{\mathsf {K}\textbf {Y}}\textbf {Y}$
. Once this claim is verified, the proof is complete.
 To show that 
 $\tau $
 is a factor map from
$\tau $
 is a factor map from 
 $\textbf {X}$
 onto
$\textbf {X}$
 onto 
 $\mathsf {K}\textbf {X}\times _{\mathsf {K}\textbf {Y}}\textbf {Y}$
, we must show that the pushforward of
$\mathsf {K}\textbf {X}\times _{\mathsf {K}\textbf {Y}}\textbf {Y}$
, we must show that the pushforward of 
 $\mu _{X}$
 under
$\mu _{X}$
 under 
 $\tau $
 equals
$\tau $
 equals 
 $\mu _{\mathsf {K} X}\otimes _{\mathsf {K}\textbf {Y}}\mu _{Y}$
. It suffices to show that
$\mu _{\mathsf {K} X}\otimes _{\mathsf {K}\textbf {Y}}\mu _{Y}$
. It suffices to show that 
 $$ \begin{align} \int_{\mathsf{K} X\times Y} f\otimes g \,\textit{d} (\tau \mu_{X}) = \int_{\mathsf{K} X\times Y} f\otimes g \,\textit{d} \mu_{\mathsf{K} X}\otimes_{\mathsf{K}\textbf{Y}}\mu_{Y} \end{align} $$
$$ \begin{align} \int_{\mathsf{K} X\times Y} f\otimes g \,\textit{d} (\tau \mu_{X}) = \int_{\mathsf{K} X\times Y} f\otimes g \,\textit{d} \mu_{\mathsf{K} X}\otimes_{\mathsf{K}\textbf{Y}}\mu_{Y} \end{align} $$
for all 
 $f\in \operatorname {\textrm {L}}^{2}(\mathsf {K} X, \mu _{K})$
 and all
$f\in \operatorname {\textrm {L}}^{2}(\mathsf {K} X, \mu _{K})$
 and all 
 $g\in \operatorname {\textrm {L}}^{2}(Y,\mu _{Y})$
.
$g\in \operatorname {\textrm {L}}^{2}(Y,\mu _{Y})$
.
 By definition, the right-hand side of (6) equals 
 $\int _{\mathsf {K} Y}\mathbb {E}(f|\mathsf {K}\textbf {Y})\mathbb {E}(g|\mathsf {K}\textbf {Y})\,\textit {d}\mu _{\mathsf {K} Y}$
, which can be rewritten as
$\int _{\mathsf {K} Y}\mathbb {E}(f|\mathsf {K}\textbf {Y})\mathbb {E}(g|\mathsf {K}\textbf {Y})\,\textit {d}\mu _{\mathsf {K} Y}$
, which can be rewritten as 
 $$ \begin{align*} \int_{X}\mathbb{E}(f\circ \pi_{\textbf{X},\mathsf{K}\textbf{X}} |\mathsf{K}\textbf{Y})\mathbb{E}(g\circ \pi_{\textbf{X},\textbf{Y}}|\mathsf{K}\textbf{Y})\,\textit{d}\mu_{X}. \end{align*} $$
$$ \begin{align*} \int_{X}\mathbb{E}(f\circ \pi_{\textbf{X},\mathsf{K}\textbf{X}} |\mathsf{K}\textbf{Y})\mathbb{E}(g\circ \pi_{\textbf{X},\textbf{Y}}|\mathsf{K}\textbf{Y})\,\textit{d}\mu_{X}. \end{align*} $$
The left-hand side of (6) equals
 $$ \begin{align*} \int_{X}(f\circ \pi_{\textbf{X},\mathsf{K}\textbf{X}})(g\circ \pi_{\textbf{X},\textbf{Y}})\,\textit{d}\mu_{X}. \end{align*} $$
$$ \begin{align*} \int_{X}(f\circ \pi_{\textbf{X},\mathsf{K}\textbf{X}})(g\circ \pi_{\textbf{X},\textbf{Y}})\,\textit{d}\mu_{X}. \end{align*} $$
As 
 $f\circ \pi _{\textbf {X},\mathsf {K}\textbf {X}}=\mathbb {E}(f\circ \pi _{\textbf {X},\mathsf {K}\textbf {X}}|\mathsf {K}\textbf {X})$
 and
$f\circ \pi _{\textbf {X},\mathsf {K}\textbf {X}}=\mathbb {E}(f\circ \pi _{\textbf {X},\mathsf {K}\textbf {X}}|\mathsf {K}\textbf {X})$
 and 
 $g\circ \pi _{\textbf {X},\textbf {Y}}=\mathbb {E}(g\circ \pi _{\textbf {X},\textbf {Y}}|\textbf {Y})$
 we have that
$g\circ \pi _{\textbf {X},\textbf {Y}}=\mathbb {E}(g\circ \pi _{\textbf {X},\textbf {Y}}|\textbf {Y})$
 we have that 
 $$ \begin{align*} \int_{X}(f\circ \pi_{\textbf{X},\mathsf{K}\textbf{X}})(g\circ \pi_{\textbf{X},\textbf{Y}})\,\textit{d}\mu_{X} = \int_{X}\mathbb{E}(f\circ \pi_{\textbf{X},\mathsf{K}\textbf{X}}|\mathsf{K}\textbf{X})\mathbb{E}(g\circ \pi_{\textbf{X},\textbf{Y}}|\textbf{Y}) \,\textit{d}\mu_{X}. \end{align*} $$
$$ \begin{align*} \int_{X}(f\circ \pi_{\textbf{X},\mathsf{K}\textbf{X}})(g\circ \pi_{\textbf{X},\textbf{Y}})\,\textit{d}\mu_{X} = \int_{X}\mathbb{E}(f\circ \pi_{\textbf{X},\mathsf{K}\textbf{X}}|\mathsf{K}\textbf{X})\mathbb{E}(g\circ \pi_{\textbf{X},\textbf{Y}}|\textbf{Y}) \,\textit{d}\mu_{X}. \end{align*} $$
Hence, (6) is equivalent to
 $$ \begin{align} \int_{X}\mathbb{E}(f\circ \pi_{\textbf{X},\mathsf{K}\textbf{X}}|\mathsf{K}\textbf{X})\mathbb{E}(g\circ \pi_{\textbf{X},\textbf{Y}}|\textbf{Y}) \,\textit{d}\mu_{X} = \int_{X}\mathbb{E}(f\circ \pi_{\textbf{X},\mathsf{K}\textbf{X}} |\mathsf{K}\textbf{Y})\mathbb{E}(g\circ \pi_{\textbf{X},\textbf{Y}}|\mathsf{K}\textbf{Y})\,\textit{d}\mu_{X}. \end{align} $$
$$ \begin{align} \int_{X}\mathbb{E}(f\circ \pi_{\textbf{X},\mathsf{K}\textbf{X}}|\mathsf{K}\textbf{X})\mathbb{E}(g\circ \pi_{\textbf{X},\textbf{Y}}|\textbf{Y}) \,\textit{d}\mu_{X} = \int_{X}\mathbb{E}(f\circ \pi_{\textbf{X},\mathsf{K}\textbf{X}} |\mathsf{K}\textbf{Y})\mathbb{E}(g\circ \pi_{\textbf{X},\textbf{Y}}|\mathsf{K}\textbf{Y})\,\textit{d}\mu_{X}. \end{align} $$
 However, because 
 $\mathbb {E}(\cdot |\mathsf {K}\textbf {X})$
 and
$\mathbb {E}(\cdot |\mathsf {K}\textbf {X})$
 and 
 $\mathbb {E}(\cdot |\textbf {Y})$
 are orthogonal projections onto
$\mathbb {E}(\cdot |\textbf {Y})$
 are orthogonal projections onto 
 $\mathsf {AP}({\textbf {X}})$
 and
$\mathsf {AP}({\textbf {X}})$
 and 
 $\operatorname {\textrm {L}}^{2}(Y,\mu _{Y})$
, respectively, it follows immediately from
$\operatorname {\textrm {L}}^{2}(Y,\mu _{Y})$
, respectively, it follows immediately from 
 $\mathsf {AP}(\textbf {X})\cap \operatorname {\textrm {L}}^{2}(Y,\mu _{Y})=\mathsf {AP}(\textbf {Y})$
 that (7) is true.
$\mathsf {AP}(\textbf {X})\cap \operatorname {\textrm {L}}^{2}(Y,\mu _{Y})=\mathsf {AP}(\textbf {Y})$
 that (7) is true.
Lemma 4.4. Let 
 $\textbf {A} = (A,T_{A},\mu _{A})$
,
$\textbf {A} = (A,T_{A},\mu _{A})$
, 
 $\textbf {B} = (B,T_{B},\mu _{B})$
,
$\textbf {B} = (B,T_{B},\mu _{B})$
, 
 $\textbf {C} = (C,T_{C},\mu _{C})$
, and
$\textbf {C} = (C,T_{C},\mu _{C})$
, and 
 $\textbf {D} = (D,T_{D},\mu _{D})$
 be G systems. If
$\textbf {D} = (D,T_{D},\mu _{D})$
 be G systems. If 
 $\textbf {B}$
 is a factor of
$\textbf {B}$
 is a factor of 
 $\textbf {A}$
 and
$\textbf {A}$
 and 
 $\textbf {D}$
 is a factor of
$\textbf {D}$
 is a factor of 
 $\textbf {C}$
, then the induced map from
$\textbf {C}$
, then the induced map from 
 $\mathcal {J}(\textbf {A}, \textbf {C})$
 to
$\mathcal {J}(\textbf {A}, \textbf {C})$
 to 
 $\mathcal {J}(\textbf {B},\textbf {D})$
 is surjective.
$\mathcal {J}(\textbf {B},\textbf {D})$
 is surjective.
Proof. Write 
 $\pi _{\textbf {A},\textbf {B}}$
 and
$\pi _{\textbf {A},\textbf {B}}$
 and 
 $\pi _{\textbf {C},\textbf {D}}$
 for the factor maps. Let
$\pi _{\textbf {C},\textbf {D}}$
 for the factor maps. Let 
 $b \mapsto \mu _{A,b}$
 and
$b \mapsto \mu _{A,b}$
 and 
 $d \mapsto \mu _{C,d}$
 be disintegrations (cf. §2.2) of
$d \mapsto \mu _{C,d}$
 be disintegrations (cf. §2.2) of 
 $\mu _{A}$
 and
$\mu _{A}$
 and 
 $\mu _{C}$
 over
$\mu _{C}$
 over 
 $\textbf {B}$
 and
$\textbf {B}$
 and 
 $\textbf {D}$
, respectively. We have
$\textbf {D}$
, respectively. We have 
 $T_{A}^{g} \mu _{A,b} = \mu _{A,T_{B}^{g} b}$
 and
$T_{A}^{g} \mu _{A,b} = \mu _{A,T_{B}^{g} b}$
 and 
 $T_{C}^{g} \mu _{C,d} = \mu _{C,T_{D}^{g} d}$
 almost surely for all
$T_{C}^{g} \mu _{C,d} = \mu _{C,T_{D}^{g} d}$
 almost surely for all 
 $g \in G$
.
$g \in G$
.
 Fix a joining 
 $\unicode{x3bb} _{B,D}$
 of
$\unicode{x3bb} _{B,D}$
 of 
 $\textbf {B}$
 and
$\textbf {B}$
 and 
 $\textbf {D}$
. We claim that
$\textbf {D}$
. We claim that 

is a joining of 
 $\textbf {A}$
 and
$\textbf {A}$
 and 
 $\textbf {C}$
 with
$\textbf {C}$
 with 
 $(\pi _{\textbf {A},\textbf {B}} \times \pi _{\textbf {C},\textbf {D}}) \unicode{x3bb} _{A,C} = \unicode{x3bb} _{B,D}$
. First note that
$(\pi _{\textbf {A},\textbf {B}} \times \pi _{\textbf {C},\textbf {D}}) \unicode{x3bb} _{A,C} = \unicode{x3bb} _{B,D}$
. First note that 

for all Borel sets 
 $E \subset A$
 because
$E \subset A$
 because 
 $\unicode{x3bb} _{B,D}$
 is a joining, so the left marginal of
$\unicode{x3bb} _{B,D}$
 is a joining, so the left marginal of 
 $\unicode{x3bb} _{A,C}$
 is
$\unicode{x3bb} _{A,C}$
 is 
 $\mu _{A}$
. Similarly, its right marginal is
$\mu _{A}$
. Similarly, its right marginal is 
 $\mu _{C}$
.
$\mu _{C}$
.
 For all 
 $f \in \mathsf {C}(B)$
 and all
$f \in \mathsf {C}(B)$
 and all 
 $h \in \mathsf {C}(D)$
 we have
$h \in \mathsf {C}(D)$
 we have 

by disintegration properties, so 
 $(\pi _{\textbf {A},\textbf {B}} \times \pi _{\textbf {C},\textbf {D}})\unicode{x3bb} _{A,C} = \unicode{x3bb} _{B,D}$
.
$(\pi _{\textbf {A},\textbf {B}} \times \pi _{\textbf {C},\textbf {D}})\unicode{x3bb} _{A,C} = \unicode{x3bb} _{B,D}$
.
 Finally, for any 
 $f \in \mathsf {C}(A)$
 and any
$f \in \mathsf {C}(A)$
 and any 
 $h \in \mathsf {C}(C)$
, we calculate that
$h \in \mathsf {C}(C)$
, we calculate that 

so 
 $\unicode{x3bb} _{A,C}$
 is
$\unicode{x3bb} _{A,C}$
 is 
 $T_{A} \times T_{C}$
 invariant.
$T_{A} \times T_{C}$
 invariant.
From Lemma 4.4, we obtain the following immediate corollary.
Corollary 4.5. Let 
 $\textbf {A}=(A,T_{A},\mu _{A})$
,
$\textbf {A}=(A,T_{A},\mu _{A})$
, 
 $\textbf {B}=(B,T_{B},\mu _{B})$
, and
$\textbf {B}=(B,T_{B},\mu _{B})$
, and 
 $\textbf {C}=(C,T_{C},\mu _{C})$
 be G systems and suppose that
$\textbf {C}=(C,T_{C},\mu _{C})$
 be G systems and suppose that 
 $\textbf {B}$
 is a factor of
$\textbf {B}$
 is a factor of 
 $\textbf {A}$
. Let
$\textbf {A}$
. Let 
 $\pi _{\textbf {A},\textbf {B}}$
 denote the factor map from
$\pi _{\textbf {A},\textbf {B}}$
 denote the factor map from 
 $\textbf {A}$
 onto
$\textbf {A}$
 onto 
 $\textbf {B}$
. Then, for any joining
$\textbf {B}$
. Then, for any joining 
 $\unicode{x3bb} \in \mathcal {J}(\textbf {B},\textbf {C})$
, there exists a joining
$\unicode{x3bb} \in \mathcal {J}(\textbf {B},\textbf {C})$
, there exists a joining 
 $\unicode{x3bb} ^{\prime }\in \mathcal {J}(\textbf {A},\textbf {C})$
 such that the pushforward of
$\unicode{x3bb} ^{\prime }\in \mathcal {J}(\textbf {A},\textbf {C})$
 such that the pushforward of 
 $\unicode{x3bb} ^{\prime }$
 under the factor map
$\unicode{x3bb} ^{\prime }$
 under the factor map 
 $\pi _{\textbf {A},\textbf {B}}\times \operatorname {\mathrm {id}}_{\textbf {C}}$
 equals
$\pi _{\textbf {A},\textbf {B}}\times \operatorname {\mathrm {id}}_{\textbf {C}}$
 equals 
 $\unicode{x3bb} $
.
$\unicode{x3bb} $
. 

We also need the following lemmas for the proof of Theorem 4.2.
Lemma 4.6. Let 
 $\pi : \textbf {X} \to \textbf {Y}$
 be a factor map of G systems. If
$\pi : \textbf {X} \to \textbf {Y}$
 be a factor map of G systems. If 
 $f \in \mathsf {AP}(\textbf {Y})$
, then
$f \in \mathsf {AP}(\textbf {Y})$
, then 
 $f \circ \pi \in \mathsf {AP}(\textbf {X})$
.
$f \circ \pi \in \mathsf {AP}(\textbf {X})$
.
Proof. This follows from the fact that 
 $\pi : \operatorname {\textrm {L}}^{2}(Y,\mu _{Y}) \to \operatorname {\textrm {L}}^{2}(X,\mu _{X})$
 is an isometric embedding.
$\pi : \operatorname {\textrm {L}}^{2}(Y,\mu _{Y}) \to \operatorname {\textrm {L}}^{2}(X,\mu _{X})$
 is an isometric embedding.
Lemma 4.7. Let 
 $\pi : \textbf {X} \to \textbf {Y}$
 be a factor map of G systems. Let
$\pi : \textbf {X} \to \textbf {Y}$
 be a factor map of G systems. Let 
 $f \in \mathsf {WM}(\textbf {Y})$
 and define
$f \in \mathsf {WM}(\textbf {Y})$
 and define 
 $h=f \circ \pi $
. Then
$h=f \circ \pi $
. Then 
 $h \in \mathsf {WM}(\textbf {X})$
.
$h \in \mathsf {WM}(\textbf {X})$
.
Proof. We need to prove that 0 belongs to the weak closure of 
 $\{ T^{g} h : g \in G \}$
 in
$\{ T^{g} h : g \in G \}$
 in 
 $\operatorname {\textrm {L}}^{2}(X,\mu _{X})$
. Note that
$\operatorname {\textrm {L}}^{2}(X,\mu _{X})$
. Note that 
 $\mathbb {E}({T^{g} h}\vert {\textbf {Y}})=T^{g}\mathbb {E}({h}\vert {\textbf {Y}})=T^{g}h$
. Fix
$\mathbb {E}({T^{g} h}\vert {\textbf {Y}})=T^{g}\mathbb {E}({h}\vert {\textbf {Y}})=T^{g}h$
. Fix 
 $\xi $
 in
$\xi $
 in 
 $\operatorname {\textrm {L}}^{2}(X,\mu _{X})$
. We have
$\operatorname {\textrm {L}}^{2}(X,\mu _{X})$
. We have 
 $$ \begin{align*} \langle {T^{g} h}, {\xi} \rangle = \langle {\mathbb{E}({T^{g} h}\vert{\textbf{Y}})}, {\mathbb{E}({\xi}\vert{\textbf{Y}})} \rangle, \end{align*} $$
$$ \begin{align*} \langle {T^{g} h}, {\xi} \rangle = \langle {\mathbb{E}({T^{g} h}\vert{\textbf{Y}})}, {\mathbb{E}({\xi}\vert{\textbf{Y}})} \rangle, \end{align*} $$
so the fact that 
 $f \in \mathsf {WM}(\textbf {Y})$
 implies that
$f \in \mathsf {WM}(\textbf {Y})$
 implies that 
 $\{ T^{g} h : g \in G \}$
 contains 0 in its closure.
$\{ T^{g} h : g \in G \}$
 contains 0 in its closure.
Proof of Theorem 4.2
 Let 
 $\pi _{\textbf {Y},\mathsf {K}\textbf {Y}}$
 and
$\pi _{\textbf {Y},\mathsf {K}\textbf {Y}}$
 and 
 $\pi _{\textbf {Z},\mathsf {K}\textbf {Z}}$
 denote the factor maps from
$\pi _{\textbf {Z},\mathsf {K}\textbf {Z}}$
 denote the factor maps from 
 $\textbf {Y}$
 onto
$\textbf {Y}$
 onto 
 $\mathsf {K}\textbf {Y}$
 and from
$\mathsf {K}\textbf {Y}$
 and from 
 $\textbf {Z}$
 onto
$\textbf {Z}$
 onto 
 $\mathsf {K}\textbf {Z}$
, respectively. Let
$\mathsf {K}\textbf {Z}$
, respectively. Let 
 $\unicode{x3bb} \in \mathcal {J}(\textbf {Y},\textbf {Z})$
 be a joining of
$\unicode{x3bb} \in \mathcal {J}(\textbf {Y},\textbf {Z})$
 be a joining of 
 $\textbf {Y}$
 and
$\textbf {Y}$
 and 
 $\textbf {Z}$
 with the property that
$\textbf {Z}$
 with the property that 
 $(\pi _{\textbf {Y},\mathsf {K}\textbf {Y}}\times \pi _{\textbf {Z},\mathsf {K}\textbf {Z}})(\unicode{x3bb} )=\mu _{\mathsf {K} Y} \otimes \mu _{\mathsf {K} Z}$
. Let
$(\pi _{\textbf {Y},\mathsf {K}\textbf {Y}}\times \pi _{\textbf {Z},\mathsf {K}\textbf {Z}})(\unicode{x3bb} )=\mu _{\mathsf {K} Y} \otimes \mu _{\mathsf {K} Z}$
. Let 
 $\textbf {W}=(Y\times Z, S\times R, \unicode{x3bb} )$
. Our goal is to show that
$\textbf {W}=(Y\times Z, S\times R, \unicode{x3bb} )$
. Our goal is to show that 
 $\textbf {W}=\textbf {Y}\times \textbf {Z}$
 or, equivalently, that
$\textbf {W}=\textbf {Y}\times \textbf {Z}$
 or, equivalently, that 
 $\unicode{x3bb} =\mu _{Y}\otimes \mu _{Z}$
.
$\unicode{x3bb} =\mu _{Y}\otimes \mu _{Z}$
.
 Observe that 
 $\mathsf {K}\textbf {Y}$
 is a factor of both
$\mathsf {K}\textbf {Y}$
 is a factor of both 
 $\textbf {W}$
 and
$\textbf {W}$
 and 
 $\mathsf {K}\textbf {X}$
. Hence, we can consider the relatively independent joining
$\mathsf {K}\textbf {X}$
. Hence, we can consider the relatively independent joining 
 $\mathsf {K}\textbf {X}\times _{\mathsf {K}\textbf {Y}}\textbf {W}$
 with corresponding measure
$\mathsf {K}\textbf {X}\times _{\mathsf {K}\textbf {Y}}\textbf {W}$
 with corresponding measure 
 $\mu _{\mathsf {K} X}\otimes _{\mathsf {K}\textbf {Y}}\unicode{x3bb} $
. Note that the underlying space of
$\mu _{\mathsf {K} X}\otimes _{\mathsf {K}\textbf {Y}}\unicode{x3bb} $
. Note that the underlying space of 
 $\mathsf {K}\textbf {X}\times _{\mathsf {K}\textbf {Y}}\textbf {W}$
 is
$\mathsf {K}\textbf {X}\times _{\mathsf {K}\textbf {Y}}\textbf {W}$
 is 
 $\mathsf {K} X \times Y \times Z$
. Let
$\mathsf {K} X \times Y \times Z$
. Let 
 $\pi _{1,2} :\mathsf {K} X \times Y \times Z \to \mathsf {K} X \times Y$
 denote the projection onto the first and second coordinates and let
$\pi _{1,2} :\mathsf {K} X \times Y \times Z \to \mathsf {K} X \times Y$
 denote the projection onto the first and second coordinates and let 
 $\pi _{3} : \mathsf {K} X \times Y \times Z\to Z$
 denote the projection onto the third coordinate. Observe that
$\pi _{3} : \mathsf {K} X \times Y \times Z\to Z$
 denote the projection onto the third coordinate. Observe that 
 $\pi _{3}$
 is a factor map from
$\pi _{3}$
 is a factor map from 
 $\mathsf {K}\textbf {X} \times _{\mathsf {K}\textbf {Y}}\textbf {W}$
 onto
$\mathsf {K}\textbf {X} \times _{\mathsf {K}\textbf {Y}}\textbf {W}$
 onto 
 $\textbf {Z}$
 and
$\textbf {Z}$
 and 
 $\pi _{1,2}$
 is a factor map from
$\pi _{1,2}$
 is a factor map from 
 $\mathsf {K}\textbf {X}\times _{\mathsf {K}\textbf {Y}}\textbf {W}$
 onto
$\mathsf {K}\textbf {X}\times _{\mathsf {K}\textbf {Y}}\textbf {W}$
 onto 
 $\mathsf {K}\textbf {X}\times _{\mathsf {K}\textbf {Y}}\textbf {Y}$
, the relatively independent joining of
$\mathsf {K}\textbf {X}\times _{\mathsf {K}\textbf {Y}}\textbf {Y}$
, the relatively independent joining of 
 $\mathsf {K}\textbf {X}$
 with
$\mathsf {K}\textbf {X}$
 with 
 $\textbf {Y}$
 over
$\textbf {Y}$
 over 
 $\mathsf {K}\textbf {Y}$
. This shows that
$\mathsf {K}\textbf {Y}$
. This shows that 
 $\mu _{\mathsf {K} X} \otimes _{\mathsf {K}\textbf {Y}} \unicode{x3bb} \in \mathcal {J}(\mathsf {K} \textbf {X} \times _{\mathsf {K}\textbf {Y}} \textbf {Y}, \textbf {Z})$
.
$\mu _{\mathsf {K} X} \otimes _{\mathsf {K}\textbf {Y}} \unicode{x3bb} \in \mathcal {J}(\mathsf {K} \textbf {X} \times _{\mathsf {K}\textbf {Y}} \textbf {Y}, \textbf {Z})$
.
 Let 
 $\tau $
 denote the factor map from
$\tau $
 denote the factor map from 
 $\textbf {X}$
 to
$\textbf {X}$
 to 
 $\mathsf {K}\textbf {X}\times _{\mathsf {K}\textbf {Y}}\textbf {Y}$
 in the proof of Lemma 4.3. We can now apply Corollary 4.5 with
$\mathsf {K}\textbf {X}\times _{\mathsf {K}\textbf {Y}}\textbf {Y}$
 in the proof of Lemma 4.3. We can now apply Corollary 4.5 with 
 $\textbf {A}=\textbf {X}$
,
$\textbf {A}=\textbf {X}$
, 
 $\textbf {B}=\mathsf {K}\textbf {X}\times _{\mathsf {K}\textbf {Y}}\textbf {Y}$
, and
$\textbf {B}=\mathsf {K}\textbf {X}\times _{\mathsf {K}\textbf {Y}}\textbf {Y}$
, and 
 $\textbf {C}=\textbf {Z}$
 to find a joining
$\textbf {C}=\textbf {Z}$
 to find a joining 
 $\rho \in \mathcal {J}(\textbf {X},\textbf {Z})$
 with the property that
$\rho \in \mathcal {J}(\textbf {X},\textbf {Z})$
 with the property that 
 $(\tau \times \operatorname {\textrm {id}}_{\textbf {Z}})(\rho ) = \mu _{\mathsf {K} X}\otimes _{\mathsf {K}\textbf {Y}}\unicode{x3bb} $
. Let
$(\tau \times \operatorname {\textrm {id}}_{\textbf {Z}})(\rho ) = \mu _{\mathsf {K} X}\otimes _{\mathsf {K}\textbf {Y}}\unicode{x3bb} $
. Let 
 ${\textbf W}^{\prime }=(X\times Z,T\times R,\rho )$
.
${\textbf W}^{\prime }=(X\times Z,T\times R,\rho )$
.
 Let 
 $\pi _{1,3} : \mathsf {K} X \times Y \times Z \to \mathsf {K} X \times Z$
 denote the projection onto the first and third coordinates. We claim that
$\pi _{1,3} : \mathsf {K} X \times Y \times Z \to \mathsf {K} X \times Z$
 denote the projection onto the first and third coordinates. We claim that 
 $\pi _{1,3}(\mu _{\mathsf {K} X} \otimes _{\mathsf {K} \textbf {Y}} \unicode{x3bb} ) = \mu _{\mathsf {K} X} \otimes \mu _{Z}$
. This claim implies that the diagram
$\pi _{1,3}(\mu _{\mathsf {K} X} \otimes _{\mathsf {K} \textbf {Y}} \unicode{x3bb} ) = \mu _{\mathsf {K} X} \otimes \mu _{Z}$
. This claim implies that the diagram 

of factor maps commutes, giving 
 $(\pi _{\textbf {X},\mathsf {K} \textbf {X}} \times \pi _{\textbf {Z},\mathsf {K} \textbf {Z}}) \rho = \mu _{\mathsf {K} X} \otimes \mu _{\mathsf {K} Z}$
 whence
$(\pi _{\textbf {X},\mathsf {K} \textbf {X}} \times \pi _{\textbf {Z},\mathsf {K} \textbf {Z}}) \rho = \mu _{\mathsf {K} X} \otimes \mu _{\mathsf {K} Z}$
 whence 
 $\rho = \mu _{X} \otimes \mu _{Z}$
 because
$\rho = \mu _{X} \otimes \mu _{Z}$
 because 
 $\textbf {X}$
 and
$\textbf {X}$
 and 
 $\textbf {Z}$
 are assumed to be quasi-disjointness. If follows that the measure
$\textbf {Z}$
 are assumed to be quasi-disjointness. If follows that the measure 
 $(\tau \times \operatorname {\textrm {id}}_{\textbf {Z}})(\rho )$
 on
$(\tau \times \operatorname {\textrm {id}}_{\textbf {Z}})(\rho )$
 on 
 $\mathsf {K}\textbf {X}\times _{\mathsf {K}\textbf {Y}}\textbf {W}$
 is the product of the measures from
$\mathsf {K}\textbf {X}\times _{\mathsf {K}\textbf {Y}}\textbf {W}$
 is the product of the measures from 
 $\mathsf {K}\textbf {X}\times _{\mathsf {K}\textbf {Y}}\textbf {Y}$
 and
$\mathsf {K}\textbf {X}\times _{\mathsf {K}\textbf {Y}}\textbf {Y}$
 and 
 $\textbf {Z}$
 and, hence,
$\textbf {Z}$
 and, hence, 
 $\unicode{x3bb} = \mu _{Y} \otimes \mu _{Z}$
 as desired.
$\unicode{x3bb} = \mu _{Y} \otimes \mu _{Z}$
 as desired.
 It remains to prove the claim. Fix f in 
 $\mathsf {AP}(\textbf {X})$
 and
$\mathsf {AP}(\textbf {X})$
 and 
 $\phi $
 in
$\phi $
 in 
 $\operatorname {\textrm {L}}^{2}(Z,\mu _{Z})$
. We have
$\operatorname {\textrm {L}}^{2}(Z,\mu _{Z})$
. We have 
 $$ \begin{align*} \int f \otimes \phi \,\textit{d} \pi_{1,3} (\mu_{\mathsf{K} X} \otimes_{\mathsf{K} \textbf{Y}} \unicode{x3bb}) &= \int f \otimes 1 \otimes \phi \,\textit{d} (\mu_{\mathsf{K} X} \otimes_{\mathsf{K} \textbf{Y}} \unicode{x3bb})\\ &= \int \mathbb{E}({f}\vert{\mathsf{K}\textbf{Y}}) \mathbb{E}({1 \otimes\phi}\vert{\mathsf{K}\textbf{Y}}) \,\textit{d} \mu_{\mathsf{K} Y}, \end{align*} $$
$$ \begin{align*} \int f \otimes \phi \,\textit{d} \pi_{1,3} (\mu_{\mathsf{K} X} \otimes_{\mathsf{K} \textbf{Y}} \unicode{x3bb}) &= \int f \otimes 1 \otimes \phi \,\textit{d} (\mu_{\mathsf{K} X} \otimes_{\mathsf{K} \textbf{Y}} \unicode{x3bb})\\ &= \int \mathbb{E}({f}\vert{\mathsf{K}\textbf{Y}}) \mathbb{E}({1 \otimes\phi}\vert{\mathsf{K}\textbf{Y}}) \,\textit{d} \mu_{\mathsf{K} Y}, \end{align*} $$
so it suffices to prove that
 $$ \begin{align} \mathbb{E}({1 \otimes \phi}\vert{\mathsf{K} \textbf{Y}}) = \int \phi \,\textit{d} \mu_{Z} \end{align} $$
$$ \begin{align} \mathbb{E}({1 \otimes \phi}\vert{\mathsf{K} \textbf{Y}}) = \int \phi \,\textit{d} \mu_{Z} \end{align} $$
in 
 $\operatorname {\textrm {L}}^{2}(Y \times Z, \unicode{x3bb} )$
. Fix
$\operatorname {\textrm {L}}^{2}(Y \times Z, \unicode{x3bb} )$
. Fix 
 $\psi $
 in
$\psi $
 in 
 $\mathsf {AP}(\textbf {Y})$
. We have
$\mathsf {AP}(\textbf {Y})$
. We have 
 $$ \begin{align*} \int (\psi \otimes 1) (1 \otimes \phi) \,\textit{d} \unicode{x3bb} = \int (\psi \otimes 1) \mathbb{E}({1 \otimes \phi}\vert{\mathsf{K} \textbf{W}}) \,\textit{d} \unicode{x3bb} \end{align*} $$
$$ \begin{align*} \int (\psi \otimes 1) (1 \otimes \phi) \,\textit{d} \unicode{x3bb} = \int (\psi \otimes 1) \mathbb{E}({1 \otimes \phi}\vert{\mathsf{K} \textbf{W}}) \,\textit{d} \unicode{x3bb} \end{align*} $$
because 
 $\psi \otimes 1$
 is in
$\psi \otimes 1$
 is in 
 $\mathsf {AP}(\textbf {W})$
 by Lemma 4.6. However,
$\mathsf {AP}(\textbf {W})$
 by Lemma 4.6. However, 
 $$ \begin{align*} \mathbb{E}({1 \otimes \phi}\vert{\mathsf{K} \textbf{W}}) = \mathbb{E}({1 \otimes \mathbb{E}({\phi}\vert{\mathsf{K} \textbf{Z}})}\vert{\mathsf{K} \textbf{W}}) \end{align*} $$
$$ \begin{align*} \mathbb{E}({1 \otimes \phi}\vert{\mathsf{K} \textbf{W}}) = \mathbb{E}({1 \otimes \mathbb{E}({\phi}\vert{\mathsf{K} \textbf{Z}})}\vert{\mathsf{K} \textbf{W}}) \end{align*} $$
by Lemmas 4.6 and 4.7 upon writing 
 $\phi = \mathbb {E}({\phi }\vert {\mathsf {K} \textbf {Z}}) + ( \phi - \mathbb {E}({\phi }\vert {\mathsf {K} \textbf {Z}}))$
. Thus, we calculate that
$\phi = \mathbb {E}({\phi }\vert {\mathsf {K} \textbf {Z}}) + ( \phi - \mathbb {E}({\phi }\vert {\mathsf {K} \textbf {Z}}))$
. Thus, we calculate that 
 $$ \begin{align*} \int (\psi \otimes 1) (1 \otimes \phi) \,\textit{d} \unicode{x3bb} & = \int (\psi \otimes 1) (1 \otimes \mathbb{E}({\phi}\vert{\mathsf{K} \textbf{Z}})) \,\textit{d} \unicode{x3bb} \\ & = \int \psi \otimes \mathbb{E}({\phi}\vert{\mathsf{K} \textbf{Z}}) \,\textit{d} \unicode{x3bb} \\ & = \int \mathbb{E}({\psi}\vert{\mathsf{K} \textbf{Y}}) \otimes \mathbb{E}({\phi}\vert{\mathsf{K} \textbf{Z}}) \,\textit{d} \unicode{x3bb} = \int \psi \,\textit{d} \mu_{Y} \int \phi \,\textit{d} \mu_{Z}, \end{align*} $$
$$ \begin{align*} \int (\psi \otimes 1) (1 \otimes \phi) \,\textit{d} \unicode{x3bb} & = \int (\psi \otimes 1) (1 \otimes \mathbb{E}({\phi}\vert{\mathsf{K} \textbf{Z}})) \,\textit{d} \unicode{x3bb} \\ & = \int \psi \otimes \mathbb{E}({\phi}\vert{\mathsf{K} \textbf{Z}}) \,\textit{d} \unicode{x3bb} \\ & = \int \mathbb{E}({\psi}\vert{\mathsf{K} \textbf{Y}}) \otimes \mathbb{E}({\phi}\vert{\mathsf{K} \textbf{Z}}) \,\textit{d} \unicode{x3bb} = \int \psi \,\textit{d} \mu_{Y} \int \phi \,\textit{d} \mu_{Z}, \end{align*} $$
where we have used, in the last equality, the fact that 
 $\unicode{x3bb} $
 projects to the product joining of
$\unicode{x3bb} $
 projects to the product joining of 
 $\mathsf {K} \textbf {Y}$
 and
$\mathsf {K} \textbf {Y}$
 and 
 $\mathsf {K} \textbf {Z}$
. This establishes (8) and, therefore, the claim.
$\mathsf {K} \textbf {Z}$
. This establishes (8) and, therefore, the claim.
4.3 Quasi-disjointness is preserved by inverse limits
Theorem 4.8. Let 
 ${\textbf X}$
 and
${\textbf X}$
 and 
 ${\textbf Z}$
 be G systems and assume that
${\textbf Z}$
 be G systems and assume that 
 ${\textbf X}$
 is the inverse limit of a sequence
${\textbf X}$
 is the inverse limit of a sequence 
 $n \mapsto {\textbf X}_{n}$
 of G systems. If each
$n \mapsto {\textbf X}_{n}$
 of G systems. If each 
 ${\textbf X}_{n}$
 is quasi-disjoint from
${\textbf X}_{n}$
 is quasi-disjoint from 
 ${\textbf Z}$
, then so is
${\textbf Z}$
, then so is 
 ${\textbf X}$
.
${\textbf X}$
.
Proof. Fix a joining 
 $\unicode{x3bb} $
 of
$\unicode{x3bb} $
 of 
 $\textbf {X}$
 and
$\textbf {X}$
 and 
 $\textbf {Z}$
 whose projection to a joining of
$\textbf {Z}$
 whose projection to a joining of 
 $\mathsf {K} \textbf {X}$
 with
$\mathsf {K} \textbf {X}$
 with 
 $\mathsf {K} \textbf {Z}$
 is the product measure. For every n, the system
$\mathsf {K} \textbf {Z}$
 is the product measure. For every n, the system 
 $\mathsf {K} \textbf {X}_{n}$
 is a factor of
$\mathsf {K} \textbf {X}_{n}$
 is a factor of 
 $\mathsf {K} \textbf {X}$
 so
$\mathsf {K} \textbf {X}$
 so 
 $\unicode{x3bb} $
 projects to the product joining of
$\unicode{x3bb} $
 projects to the product joining of 
 $\mathsf {K} \textbf {X}_{n}$
 with
$\mathsf {K} \textbf {X}_{n}$
 with 
 $\mathsf {K} \textbf {Z}$
. As
$\mathsf {K} \textbf {Z}$
. As 
 $\textbf {X}_{n}$
 and
$\textbf {X}_{n}$
 and 
 $\textbf {Z}$
 are assumed quasi-disjoint, the projection of
$\textbf {Z}$
 are assumed quasi-disjoint, the projection of 
 $\unicode{x3bb} $
 to a joining of
$\unicode{x3bb} $
 to a joining of 
 $\textbf {X}_{n}$
 with
$\textbf {X}_{n}$
 with 
 $\textbf {Z}$
 is the product measure
$\textbf {Z}$
 is the product measure 
 $\mu _{X_{n}} \otimes \mu _{Z}$
. Therefore,
$\mu _{X_{n}} \otimes \mu _{Z}$
. Therefore, 
 $\unicode{x3bb} (A \times B) = \mu _{X}(A) \mu _{Z}(B)$
 for all measurable
$\unicode{x3bb} (A \times B) = \mu _{X}(A) \mu _{Z}(B)$
 for all measurable 
 $A \subset X_{n}$
 and all
$A \subset X_{n}$
 and all 
 $B \subset Z$
 for all
$B \subset Z$
 for all 
 $n \in \mathbb {N}$
. In other words,
$n \in \mathbb {N}$
. In other words, 
 $\unicode{x3bb} $
 and
$\unicode{x3bb} $
 and 
 $\mu _{X} \otimes \mu _{Z}$
 agree on all measurable sets of the form
$\mu _{X} \otimes \mu _{Z}$
 agree on all measurable sets of the form 
 $A \times B$
 where
$A \times B$
 where 
 $A \subset X_{n}$
 for some
$A \subset X_{n}$
 for some 
 $n \in \mathbb {N}$
 and
$n \in \mathbb {N}$
 and 
 $B \subset Z$
. As the
$B \subset Z$
. As the 
 $\sigma $
-algebra generated by such sets is the Borel
$\sigma $
-algebra generated by such sets is the Borel 
 $\sigma $
-algebra on
$\sigma $
-algebra on 
 $X \times Z$
 we must have
$X \times Z$
 we must have 
 $\unicode{x3bb} = \mu _{X} \otimes \mu _{Z}$
 as desired.
$\unicode{x3bb} = \mu _{X} \otimes \mu _{Z}$
 as desired.
4.4 An example
 It follows from Theorem 1.1 that any ergodic measurably distal 
 $\mathbb {Z}$
 system is quasi-disjoint from itself. However, the converse is not true.
$\mathbb {Z}$
 system is quasi-disjoint from itself. However, the converse is not true.
Example 4.9. There exists an ergodic 
 $\mathbb {Z}$
 system
$\mathbb {Z}$
 system 
 $\textbf {X}$
 which is not measurably distal but is quasi-disjoint from itself and every other ergodic system.
$\textbf {X}$
 which is not measurably distal but is quasi-disjoint from itself and every other ergodic system.
Proof. In [Reference Glasner and WeissGW89, Theorem 2.2], Glasner and Weiss constructed a continuous 
 $\mathbb {Z}$
 action on a compact, metric space X that is minimal and uniquely ergodic, but for which the corresponding
$\mathbb {Z}$
 action on a compact, metric space X that is minimal and uniquely ergodic, but for which the corresponding 
 $\mathbb {Z}$
 system
$\mathbb {Z}$
 system 
 $(X,T,\mu _{X})$
 is not measurably distal. Moreover, they prove for their system that the only invariant measure on the set
$(X,T,\mu _{X})$
 is not measurably distal. Moreover, they prove for their system that the only invariant measure on the set 
 $\tilde X=\{\mu \in {\mathcal M}(X):\pi _{{\textbf X},\mathsf {K}{\textbf X}}(\mu )=\mu _{\mathsf {K}{\textbf X}}\}$
 is the Dirac measure
$\tilde X=\{\mu \in {\mathcal M}(X):\pi _{{\textbf X},\mathsf {K}{\textbf X}}(\mu )=\mu _{\mathsf {K}{\textbf X}}\}$
 is the Dirac measure 
 $\delta _{\mu _{X}}$
 at the unique invariant measure
$\delta _{\mu _{X}}$
 at the unique invariant measure 
 $\mu _{X}\in \tilde X$
 of X.
$\mu _{X}\in \tilde X$
 of X.
 Given any ergodic system 
 ${\textbf Y}$
, let
${\textbf Y}$
, let 
 $\unicode{x3bb} \in \mathcal {J}_{\mathsf {e}}({\textbf X},{\textbf Y})$
 and assume that
$\unicode{x3bb} \in \mathcal {J}_{\mathsf {e}}({\textbf X},{\textbf Y})$
 and assume that 
 $\unicode{x3bb} $
 projects to the product measure in
$\unicode{x3bb} $
 projects to the product measure in 
 $\mathsf {K}{\textbf X}\times \mathsf {K}{\textbf Y}$
. Let
$\mathsf {K}{\textbf X}\times \mathsf {K}{\textbf Y}$
. Let 
 $\unicode{x3bb} =\int _{Y}\unicode{x3bb} _{y}\,\textit {d}\mu _{Y}(y)$
 be the disintegration of
$\unicode{x3bb} =\int _{Y}\unicode{x3bb} _{y}\,\textit {d}\mu _{Y}(y)$
 be the disintegration of 
 $\unicode{x3bb} $
 with respect to the factor map
$\unicode{x3bb} $
 with respect to the factor map 
 $(X\times Y,\unicode{x3bb} )\to (Y,\mu _{Y})$
. Observe that
$(X\times Y,\unicode{x3bb} )\to (Y,\mu _{Y})$
. Observe that 

and so 
 $\pi _{{\textbf X},\mathsf {K}{\textbf X}}(\unicode{x3bb} _{y})=\mu _{\mathsf {K}{\textbf X}}$
, which implies that
$\pi _{{\textbf X},\mathsf {K}{\textbf X}}(\unicode{x3bb} _{y})=\mu _{\mathsf {K}{\textbf X}}$
, which implies that 
 $\unicode{x3bb} _{y}\in \tilde X$
.
$\unicode{x3bb} _{y}\in \tilde X$
.
 Let 
 $\nu $
 be the measure on
$\nu $
 be the measure on 
 $\tilde X$
 obtained as the pushforward of
$\tilde X$
 obtained as the pushforward of 
 $\mu _{Y}$
 by the map
$\mu _{Y}$
 by the map 
 $y\mapsto \unicode{x3bb} _{y}$
. As
$y\mapsto \unicode{x3bb} _{y}$
. As 
 $\nu $
 is invariant, it follows that
$\nu $
 is invariant, it follows that 
 $\nu =\delta _{\mu _{X}}$
 and, therefore,
$\nu =\delta _{\mu _{X}}$
 and, therefore, 
 $\unicode{x3bb} =\mu _{X}\otimes \mu _{Y}$
. We conclude that
$\unicode{x3bb} =\mu _{X}\otimes \mu _{Y}$
. We conclude that 
 ${\textbf X}$
 and
${\textbf X}$
 and 
 ${\textbf Y}$
 are quasi-disjoint as desired.
${\textbf Y}$
 are quasi-disjoint as desired.
5 Proof of Theorem 1.6
In this section, we prove Theorem 1.6.
Proof of Theorem 1.6
 In view of a version for amenable groups of the Jewett–Krieger theorem [Reference RosenthalRos86], we can assume that 
 $\textbf {Y}$
 is uniquely ergodic. Fix
$\textbf {Y}$
 is uniquely ergodic. Fix 
 $(X,T,\mu _{X})$
 Kronecker disjoint from
$(X,T,\mu _{X})$
 Kronecker disjoint from 
 $(Y,S,\mu _{Y})$
, a function
$(Y,S,\mu _{Y})$
, a function 
 $f\in \mathsf {C}(X)$
, and a point
$f\in \mathsf {C}(X)$
, and a point 
 $x\in X$
. Kronecker disjointness together with Theorem 1.4 implies that the sequence
$x\in X$
. Kronecker disjointness together with Theorem 1.4 implies that the sequence 
 $$ \begin{align*} \frac{1}{|\Phi_{N}|} \sum_{g \in \Phi_{N}} \delta_{(T^{g} x,S^{g} y)} \end{align*} $$
$$ \begin{align*} \frac{1}{|\Phi_{N}|} \sum_{g \in \Phi_{N}} \delta_{(T^{g} x,S^{g} y)} \end{align*} $$
of measures converges to 
 $\mu _{X} \otimes \mu _{Y}$
 for every
$\mu _{X} \otimes \mu _{Y}$
 for every 
 $y\in Y$
. This implies that we can take
$y\in Y$
. This implies that we can take 
 $Y^{\prime } = Y$
 when
$Y^{\prime } = Y$
 when 
 $\phi $
 is continuous.
$\phi $
 is continuous.
 For the general case, fix 
 $\phi $
 in
$\phi $
 in 
 $\operatorname {\textrm {L}}^{1}(Y,\mu _{Y})$
. Let
$\operatorname {\textrm {L}}^{1}(Y,\mu _{Y})$
. Let 
 $k \mapsto \phi _{k}$
 be a sequence in
$k \mapsto \phi _{k}$
 be a sequence in 
 $\mathsf {C}(Y)$
 with
$\mathsf {C}(Y)$
 with 
 $\phi _{k} \to \phi $
 in
$\phi _{k} \to \phi $
 in 
 $\operatorname {\textrm {L}}^{1}(Y,\mu _{Y})$
. Let
$\operatorname {\textrm {L}}^{1}(Y,\mu _{Y})$
. Let 
 $Y^{\prime }$
 be the set of points
$Y^{\prime }$
 be the set of points 
 $y\in Y$
 such that
$y\in Y$
 such that 
 $$ \begin{align} \lim_{N\to\infty} \frac{1}{|\Phi_{N}|} \sum_{g \in \Phi_{N}} |\phi_{k}(S^{g}y)-\phi(S^{g}y)| = \| {\phi_{k}-\phi} \|_{1} \end{align} $$
$$ \begin{align} \lim_{N\to\infty} \frac{1}{|\Phi_{N}|} \sum_{g \in \Phi_{N}} |\phi_{k}(S^{g}y)-\phi(S^{g}y)| = \| {\phi_{k}-\phi} \|_{1} \end{align} $$
for all 
 $k \in \mathbb {N}$
. In view of Lindenstrauss’ pointwise ergodic theorem [Reference LindenstraussLin01], we have that
$k \in \mathbb {N}$
. In view of Lindenstrauss’ pointwise ergodic theorem [Reference LindenstraussLin01], we have that 
 $\mu _{Y}(Y^{\prime })=1$
. Next, let
$\mu _{Y}(Y^{\prime })=1$
. Next, let 
 $y\in Y^{\prime }$
, let
$y\in Y^{\prime }$
, let 
 $f\in \mathsf {C}(X)$
,
$f\in \mathsf {C}(X)$
, 
 $x\in X$
, and let
$x\in X$
, and let 
 $k,N\in \mathbb {N}$
. By rescaling, assume that
$k,N\in \mathbb {N}$
. By rescaling, assume that 
 $\sup \{ f(x) : x \in X \} \leqslant 1$
. We have
$\sup \{ f(x) : x \in X \} \leqslant 1$
. We have 
 $$ \begin{align} &\bigg| \frac{1}{|\Phi_{N}|} \sum_{g \in \Phi_{N}} f(T^{g} x)\phi_{k}(S^{g} y) - \frac{1}{|\Phi_{N}|} \sum_{g \in \Phi_{N}} f(T^{g} x)\phi(S^{g} y) \bigg|\nonumber\\ &\quad\leqslant \frac1{|\Phi_{N}|}\sum_{g\in \Phi_{N}}|\phi_{k}(S^{g}y)-\phi(S^{g}y)|, \end{align} $$
$$ \begin{align} &\bigg| \frac{1}{|\Phi_{N}|} \sum_{g \in \Phi_{N}} f(T^{g} x)\phi_{k}(S^{g} y) - \frac{1}{|\Phi_{N}|} \sum_{g \in \Phi_{N}} f(T^{g} x)\phi(S^{g} y) \bigg|\nonumber\\ &\quad\leqslant \frac1{|\Phi_{N}|}\sum_{g\in \Phi_{N}}|\phi_{k}(S^{g}y)-\phi(S^{g}y)|, \end{align} $$
 $$ \begin{align} \lim_{N\to\infty}\frac{1}{|\Phi_{N}|} \sum_{g \in \Phi_{N}} f(T^{g} x)\phi_{k}(S^{g} y) = \int_{X}f\,\textit{d}\mu_{X}\int_{Y}\phi_{k}\,\textit{d}\mu_{Y} \end{align} $$
$$ \begin{align} \lim_{N\to\infty}\frac{1}{|\Phi_{N}|} \sum_{g \in \Phi_{N}} f(T^{g} x)\phi_{k}(S^{g} y) = \int_{X}f\,\textit{d}\mu_{X}\int_{Y}\phi_{k}\,\textit{d}\mu_{Y} \end{align} $$
and putting (9), (10), and (11) together, we obtain
 $$ \begin{align*} \limsup_{N\to\infty} \bigg|\frac{1}{|\Phi_{N}|} \sum_{g\in \Phi_{N}} f(T^{g} x) \phi(S^{g} y) -\int_{X} f \,\textit{d} \mu_{X} \int_{Y}\phi_{k} \,\textit{d} \mu_ Y \bigg| \leqslant \| {\phi_{k}-\phi} \|_{1} \end{align*} $$
$$ \begin{align*} \limsup_{N\to\infty} \bigg|\frac{1}{|\Phi_{N}|} \sum_{g\in \Phi_{N}} f(T^{g} x) \phi(S^{g} y) -\int_{X} f \,\textit{d} \mu_{X} \int_{Y}\phi_{k} \,\textit{d} \mu_ Y \bigg| \leqslant \| {\phi_{k}-\phi} \|_{1} \end{align*} $$
for every 
 $k\in \mathbb {N}$
. As
$k\in \mathbb {N}$
. As 
 $\phi _{k}\to \phi $
 in
$\phi _{k}\to \phi $
 in 
 $\operatorname {\textrm {L}}^{1}$
, we obtain the desired result.
$\operatorname {\textrm {L}}^{1}$
, we obtain the desired result.
We now see how to derive the classical Wiener–Wintner theorem from Theorem 1.6.
Corollary 5.1. (Wiener–Wintner theorem)
 Let 
 ${\textbf X}=(X,T,\mu _{X})$
 be a
${\textbf X}=(X,T,\mu _{X})$
 be a 
 $\mathbb {Z}$
 system and let
$\mathbb {Z}$
 system and let 
 $f\in \operatorname {\textrm {L}}^{1}(X,\mu _{X})$
. There exists a set
$f\in \operatorname {\textrm {L}}^{1}(X,\mu _{X})$
. There exists a set 
 $X_{0}\subset X$
 with
$X_{0}\subset X$
 with 
 $\mu (X_{0})=1$
 such that for every
$\mu (X_{0})=1$
 such that for every 
 $\alpha \in \mathbb {R}$
 and every
$\alpha \in \mathbb {R}$
 and every 
 $x\in X_{0}$
 the limit
$x\in X_{0}$
 the limit 
 $$ \begin{align} \lim_{N\to\infty} \frac{1}{N} \sum_{n=1}^{N}f(T^{n}x)e(n\alpha) \end{align} $$
$$ \begin{align} \lim_{N\to\infty} \frac{1}{N} \sum_{n=1}^{N}f(T^{n}x)e(n\alpha) \end{align} $$
exists.
Proof. Denote by 
 $\mathbb {T}$
 the circle
$\mathbb {T}$
 the circle 
 $\mathbb {R}/\mathbb {Z}$
. For each
$\mathbb {R}/\mathbb {Z}$
. For each 
 $\alpha \in \mathbb {R}$
, let
$\alpha \in \mathbb {R}$
, let 
 $R_{\alpha }:\mathbb {T}\to \mathbb {T}$
 be the rotation
$R_{\alpha }:\mathbb {T}\to \mathbb {T}$
 be the rotation 
 $R_{\alpha }:t\mapsto t+\alpha $
. The pointwise ergodic theorem of Birkhoff applied to the system
$R_{\alpha }:t\mapsto t+\alpha $
. The pointwise ergodic theorem of Birkhoff applied to the system 
 $(X\times \mathbb {T}, T\times R_{\alpha })$
 implies that there exists a set
$(X\times \mathbb {T}, T\times R_{\alpha })$
 implies that there exists a set 
 $X_{\alpha }\subset X$
 with full measure such that, for every
$X_{\alpha }\subset X$
 with full measure such that, for every 
 $x\in X_{\alpha }$
, the limit (12) exists. The discrete spectrum
$x\in X_{\alpha }$
, the limit (12) exists. The discrete spectrum 
 $\mathsf {Eig}({\textbf X})$
 of
$\mathsf {Eig}({\textbf X})$
 of 
 $\textbf {X}$
 (cf. Definition 2.2) is at most countable, so the intersection
$\textbf {X}$
 (cf. Definition 2.2) is at most countable, so the intersection 
 $X_{1}=\bigcap _{\alpha \in \mathsf {Eig}({\textbf X})}X_{\alpha }$
 still has full measure.
$X_{1}=\bigcap _{\alpha \in \mathsf {Eig}({\textbf X})}X_{\alpha }$
 still has full measure.
 Next, let 
 $X_{2}\subset X$
 be the full measure set given by Theorem 1.6 applied to
$X_{2}\subset X$
 be the full measure set given by Theorem 1.6 applied to 
 ${\textbf X}$
 in place of
${\textbf X}$
 in place of 
 ${\textbf Y}$
. As for every
${\textbf Y}$
. As for every 
 $\alpha \notin \mathsf {Eig}({\textbf X})$
 the systems
$\alpha \notin \mathsf {Eig}({\textbf X})$
 the systems 
 ${\textbf X}$
 and
${\textbf X}$
 and 
 $(\mathbb {T},R_{\alpha })$
 are Kronecker disjoint, it follows that for every
$(\mathbb {T},R_{\alpha })$
 are Kronecker disjoint, it follows that for every 
 $x\in X_{2}$
, the limit in (12) exists. Therefore, the limit exists for every
$x\in X_{2}$
, the limit in (12) exists. Therefore, the limit exists for every 
 $\alpha \in \mathbb {R}$
 and every
$\alpha \in \mathbb {R}$
 and every 
 $x\in X_{0}=X_{1}\cap X_{2}$
.
$x\in X_{0}=X_{1}\cap X_{2}$
.
6 Applications to multicorrelation sequences and multiple recurrence
 In this section, we present applications of our main results to the theory of multiple recurrence. In particular, this section contains proofs of Theorems 1.7 and 1.8. We remark that Theorems 1.7 and 1.8 only concern measure-preserving 
 $\mathbb {Z}$
 systems. The analogs for more general groups G remain open. We formulate some open questions in this direction in §7.
$\mathbb {Z}$
 systems. The analogs for more general groups G remain open. We formulate some open questions in this direction in §7.
6.1 Preliminaries on nilmanifolds
In [Reference Host and KraHK05], Host and Kra established a structure theorem for multiple ergodic averages which revealed a deep connection between multi-correlation sequences and single-orbit dynamics on compact nilmanifolds. In the proofs of Theorems 1.7 and 1.8, we make use of refinements of the Host–Kra structure theorem which appeared in [Reference Bergelson, Host and KraBHK05, Reference Host and KraHK09, Reference Moreira and RichterMR19, Reference Moreira and RichterMR21]. The purpose of this subsection is to give an overview of these results and some related methods that we use in the subsequent sections.
 We begin with the definition of a nilmanifold. A closed subgroup 
 $\Gamma $
 of G is called uniform if
$\Gamma $
 of G is called uniform if 
 $G/\Gamma $
 is compact or, equivalently, if there exists a compact set K such that
$G/\Gamma $
 is compact or, equivalently, if there exists a compact set K such that 
 $K\Gamma = G$
. Let G be a k-step nilpotent Lie group and let
$K\Gamma = G$
. Let G be a k-step nilpotent Lie group and let 
 $\Gamma \subset G$
 be a uniform and discrete subgroup of G. The quotient space
$\Gamma \subset G$
 be a uniform and discrete subgroup of G. The quotient space 
 $G/\Gamma $
 is called a nilmanifold. Naturally, G acts continuously and transitively on
$G/\Gamma $
 is called a nilmanifold. Naturally, G acts continuously and transitively on 
 $G/\Gamma $
 via left-multiplication, that is,
$G/\Gamma $
 via left-multiplication, that is, 
 $a(g\Gamma )=(ag)\Gamma $
 for all
$a(g\Gamma )=(ag)\Gamma $
 for all 
 $a\in G$
 and all
$a\in G$
 and all 
 $g\Gamma \in G/\Gamma $
. On any nilmanifold
$g\Gamma \in G/\Gamma $
. On any nilmanifold 
 $G/\Gamma $
, there exists a unique G invariant Borel probability measure called the Haar measure of
$G/\Gamma $
, there exists a unique G invariant Borel probability measure called the Haar measure of
 $G/\Gamma $
, which we denote by
$G/\Gamma $
, which we denote by 
 $\mu _{G/\Gamma }$
 (cf. [Reference RaghunathanRag72, Lemma 1.4]). Given a fixed group element
$\mu _{G/\Gamma }$
 (cf. [Reference RaghunathanRag72, Lemma 1.4]). Given a fixed group element 
 $a\in G$
 the map
$a\in G$
 the map 
 $R:G/\Gamma \to G/\Gamma $
 defined by
$R:G/\Gamma \to G/\Gamma $
 defined by 
 $R(x)=ax$
 for all
$R(x)=ax$
 for all 
 $x=g\Gamma \in G/\Gamma $
 is a niltranslation and the resulting
$x=g\Gamma \in G/\Gamma $
 is a niltranslation and the resulting 
 $\mathbb {Z}$
 system
$\mathbb {Z}$
 system 
 $(G/\Gamma ,R,\mu _{G/\Gamma })$
 is called a k-step nilsystem. We remark that
$(G/\Gamma ,R,\mu _{G/\Gamma })$
 is called a k-step nilsystem. We remark that 
 $(G/\Gamma ,R,\mu _{G/\Gamma })$
 is ergodic if and only if R acts transitively on
$(G/\Gamma ,R,\mu _{G/\Gamma })$
 is ergodic if and only if R acts transitively on 
 $G/\Gamma $
; in fact, if R is transitive, then
$G/\Gamma $
; in fact, if R is transitive, then 
 $\mu _{G/\Gamma }$
 is the unique R invariant Borel probability measure on
$\mu _{G/\Gamma }$
 is the unique R invariant Borel probability measure on 
 $G/\Gamma $
 (cf. [Reference Auslander, Green and HahnAGH63, Reference ParryPar69]). Finally, for any
$G/\Gamma $
 (cf. [Reference Auslander, Green and HahnAGH63, Reference ParryPar69]). Finally, for any 
 $x\in G/\Gamma $
, the orbit closure
$x\in G/\Gamma $
, the orbit closure 
 $Y=\overline {\{R^{n}x:n\in \mathbb {Z}\}}\subset G/\Gamma $
 is a sub-nilmanifold of
$Y=\overline {\{R^{n}x:n\in \mathbb {Z}\}}\subset G/\Gamma $
 is a sub-nilmanifold of 
 $G/\Gamma $
, meaning that there exists a closed subgroup H of G such that
$G/\Gamma $
, meaning that there exists a closed subgroup H of G such that 
 $a\in H$
,
$a\in H$
, 
 $Y=Hx$
 and
$Y=Hx$
 and 
 $\Lambda =H\cap \Gamma $
 is a uniform and discrete subgroup of H. In this case, there exists a unique H invariant Borel probability measure
$\Lambda =H\cap \Gamma $
 is a uniform and discrete subgroup of H. In this case, there exists a unique H invariant Borel probability measure 
 $\mu _{Y}$
 on Y, called the Haar measure of the sub-nilmanifold Y, and the system
$\mu _{Y}$
 on Y, called the Haar measure of the sub-nilmanifold Y, and the system 
 $(Y,R, \mu _{Y})$
 is a nilsystem, as it is isomorphic to
$(Y,R, \mu _{Y})$
 is a nilsystem, as it is isomorphic to 
 $(H/\Lambda ,R,\mu _{H/\Lambda })$
 (cf. [Reference LeibmanLei06]). For more information on nilmanifolds and nilsystems, we refer the reader to [Reference Auslander, Green and HahnAGH63, Reference ParryPar69, Reference ParryPar70, Reference RaghunathanRag72].
$(H/\Lambda ,R,\mu _{H/\Lambda })$
 (cf. [Reference LeibmanLei06]). For more information on nilmanifolds and nilsystems, we refer the reader to [Reference Auslander, Green and HahnAGH63, Reference ParryPar69, Reference ParryPar70, Reference RaghunathanRag72].
Theorem 6.1. (See [Reference Moreira and RichterMR21, Revised Theorem 7.1])
 Let 
 $k\in \mathbb {N}$
, let
$k\in \mathbb {N}$
, let 
 ${\textbf X}=(G/\Gamma ,R,\mu _{G/\Gamma })$
 be an ergodic nilrotation and assume that
${\textbf X}=(G/\Gamma ,R,\mu _{G/\Gamma })$
 be an ergodic nilrotation and assume that 
 $G/\Gamma $
 is connected. Define
$G/\Gamma $
 is connected. Define 
 $S= R\times R^{2}\times \cdots \times R^{k}$
 and for every
$S= R\times R^{2}\times \cdots \times R^{k}$
 and for every 
 $x\in G/\Gamma $
 consider the sub-nilmanifold
$x\in G/\Gamma $
 consider the sub-nilmanifold 
 $\Omega ({\textbf X},x)$
 of
$\Omega ({\textbf X},x)$
 of 
 $(G/\Gamma )^{k}=G^{k}/\Gamma ^{k}$
 defined as
$(G/\Gamma )^{k}=G^{k}/\Gamma ^{k}$
 defined as 
 $$ \begin{align*} \Omega({\textbf X},x)=\overline{\{S^{n}(x,x,\ldots, x): n\in\mathbb{Z}\}}. \end{align*} $$
$$ \begin{align*} \Omega({\textbf X},x)=\overline{\{S^{n}(x,x,\ldots, x): n\in\mathbb{Z}\}}. \end{align*} $$
Let 
 $\mu _{\Omega ({\textbf X},x)}$
 denote the Haar measure on
$\mu _{\Omega ({\textbf X},x)}$
 denote the Haar measure on 
 $\Omega ({\textbf X},x)$
 and let
$\Omega ({\textbf X},x)$
 and let 
 $\theta \in [0,1)$
. If
$\theta \in [0,1)$
. If 
 $e(\theta )\notin \mathsf {Eig}(\textbf {X})$
, then for almost every
$e(\theta )\notin \mathsf {Eig}(\textbf {X})$
, then for almost every 
 $x\in G/\Gamma $
,
$x\in G/\Gamma $
, 
 $e(\theta )\notin \mathsf {Eig}(\Omega ({\textbf X},x),S,\mu _{\Omega ({\textbf X},x)})$
.
$e(\theta )\notin \mathsf {Eig}(\Omega ({\textbf X},x),S,\mu _{\Omega ({\textbf X},x)})$
.
 For our purposes, we need a generalization of Theorem 6.1 that holds for nilmanifolds 
 $G/\Gamma $
 that are not necessarily connected.
$G/\Gamma $
 that are not necessarily connected.
Theorem 6.2. Let 
 $k\in \mathbb {N}$
 and let
$k\in \mathbb {N}$
 and let 
 ${\textbf Z}=(G/\Gamma ,R,\mu _{G/\Gamma })$
 be an ergodic nilsystem. Define
${\textbf Z}=(G/\Gamma ,R,\mu _{G/\Gamma })$
 be an ergodic nilsystem. Define 
 $S= R\times R^{2}\times \cdots \times R^{k}$
 and
$S= R\times R^{2}\times \cdots \times R^{k}$
 and 
 $$ \begin{align*} \Omega({\textbf Z},x)=\overline{\{S^{n}(x,x,\ldots, x): n\in\mathbb{Z}\}} \subset (G/\Gamma)^{k}. \end{align*} $$
$$ \begin{align*} \Omega({\textbf Z},x)=\overline{\{S^{n}(x,x,\ldots, x): n\in\mathbb{Z}\}} \subset (G/\Gamma)^{k}. \end{align*} $$
For any 
 $\theta \in [0,1)$
, if
$\theta \in [0,1)$
, if 
 $e(\theta )\notin \mathsf {Eig}({\textbf Z})$
 then for almost every
$e(\theta )\notin \mathsf {Eig}({\textbf Z})$
 then for almost every 
 $x\in G/\Gamma $
,
$x\in G/\Gamma $
, 
 $e(\theta )\notin \mathsf {Eig}(\Omega ({\textbf Z},x), S,\mu _{\Omega ({\textbf Z},x)})$
.
$e(\theta )\notin \mathsf {Eig}(\Omega ({\textbf Z},x), S,\mu _{\Omega ({\textbf Z},x)})$
.
To derive Theorem 6.2 from Theorem 6.1, we need the following well-known lemma regarding nilsystems.
Lemma 6.3. (Cf. [Reference Auslander, Green and HahnAGH63, Reference LeibmanLei05, Reference ParryPar69])
 Suppose 
 ${\textbf Z}=(G/\Gamma ,R,\mu _{G/\Gamma })$
 is a nilsystem. Then the following are equivalent:
${\textbf Z}=(G/\Gamma ,R,\mu _{G/\Gamma })$
 is a nilsystem. Then the following are equivalent: 
- 
(i)  $G/\Gamma $
 is connected and $G/\Gamma $
 is connected and ${\textbf Z}$
 is ergodic; ${\textbf Z}$
 is ergodic;
- 
(ii)  ${\textbf Z}$
 is totally ergodic. ${\textbf Z}$
 is totally ergodic.
Proof of Theorem 6.2
 Suppose 
 $G/\Gamma $
 is not connected. As
$G/\Gamma $
 is not connected. As 
 $G/\Gamma $
 is compact, it splits into finitely many distinct connected components
$G/\Gamma $
 is compact, it splits into finitely many distinct connected components 
 $Z_{0},Z_{1},\ldots ,Z_{t-1}$
. It is also straightforward to show that
$Z_{0},Z_{1},\ldots ,Z_{t-1}$
. It is also straightforward to show that 
 $Z_{i}$
 is itself a nilmanifold with Haar measure
$Z_{i}$
 is itself a nilmanifold with Haar measure 
 $\mu _{Z_{i}}$
 and that
$\mu _{Z_{i}}$
 and that 
 $\mu _{G/\Gamma }={1}/{t}(\mu _{Z_{0}}+\mu _{Z_{1}}+\cdots +\mu _{Z_{t-1}})$
. The ergodic niltranslation
$\mu _{G/\Gamma }={1}/{t}(\mu _{Z_{0}}+\mu _{Z_{1}}+\cdots +\mu _{Z_{t-1}})$
. The ergodic niltranslation 
 $R:G/\Gamma \to G/\Gamma $
 cyclically permutes these connected components, so after re-indexing them, if necessary, we have
$R:G/\Gamma \to G/\Gamma $
 cyclically permutes these connected components, so after re-indexing them, if necessary, we have 
 $R^{tn+i}Z_{0}=Z_{i}$
 for all
$R^{tn+i}Z_{0}=Z_{i}$
 for all 
 $n\in \mathbb {N}$
 and
$n\in \mathbb {N}$
 and 
 $i\in \{0,1,\ldots ,t-1\}$
. In particular, for every
$i\in \{0,1,\ldots ,t-1\}$
. In particular, for every 
 $i\in \{0,\ldots ,t-1\}$
 the component
$i\in \{0,\ldots ,t-1\}$
 the component 
 $Z_{i}$
 is
$Z_{i}$
 is 
 $R^{t}$
 invariant and
$R^{t}$
 invariant and 
 $R^{t}:Z_{i}\to Z_{i}$
 is an ergodic niltranslation on
$R^{t}:Z_{i}\to Z_{i}$
 is an ergodic niltranslation on 
 $Z_{i}$
. Let
$Z_{i}$
. Let 
 ${\textbf Z}_{i}=(Z_{i},R^{t},\mu _{Z_{i}})$
.
${\textbf Z}_{i}=(Z_{i},R^{t},\mu _{Z_{i}})$
.
 As 
 $Z_{i}$
 is connected, it follows from Lemma 6.3 that
$Z_{i}$
 is connected, it follows from Lemma 6.3 that 
 ${\textbf Z}_{i}$
 is totally ergodic. This means that
${\textbf Z}_{i}$
 is totally ergodic. This means that 
 $\mathsf {Eig}({\textbf Z}_{i})$
 contains no roots of unity. Also note that the function
$\mathsf {Eig}({\textbf Z}_{i})$
 contains no roots of unity. Also note that the function 
 $\sum _{i=0}^{t-1} e({i}/{t})1_{Z_{i}}$
 is an eigenfunction for R with eigenvalue
$\sum _{i=0}^{t-1} e({i}/{t})1_{Z_{i}}$
 is an eigenfunction for R with eigenvalue 
 $e({i}/{t})$
, where
$e({i}/{t})$
, where 
 $e(x)=e^{2\pi i x}$
 for all
$e(x)=e^{2\pi i x}$
 for all 
 $x\in \mathbb {R}$
. We conclude that
$x\in \mathbb {R}$
. We conclude that 
 $$ \begin{align*} \mathsf{Eig}({\textbf Z})\cap\{\text{roots of unity}\}=\bigg\{1,e\bigg(\dfrac1t\bigg),e\bigg(\dfrac2t\bigg),\ldots,e\bigg(\dfrac{t-1}t\bigg)\bigg\}.\end{align*} $$
$$ \begin{align*} \mathsf{Eig}({\textbf Z})\cap\{\text{roots of unity}\}=\bigg\{1,e\bigg(\dfrac1t\bigg),e\bigg(\dfrac2t\bigg),\ldots,e\bigg(\dfrac{t-1}t\bigg)\bigg\}.\end{align*} $$
On the other hand, if 
 $\zeta $
 is not a root of unity, then
$\zeta $
 is not a root of unity, then 
 $\zeta $
 is an eigenvalue for R if and only if
$\zeta $
 is an eigenvalue for R if and only if 
 $\zeta ^{t}$
 is an eigenvalue for
$\zeta ^{t}$
 is an eigenvalue for 
 $R^{t}$
; therefore,
$R^{t}$
; therefore, 
 $$ \begin{align*} \mathsf{Eig}({\textbf Z})=(\mathsf{Eig}({\textbf Z}_{0}))^{{1}/{t}}\cdot \bigg\{1,e\bigg(\dfrac1t\bigg),e\bigg(\dfrac2t\bigg),\ldots,e\bigg(\dfrac{t-1}t\bigg)\bigg\}. \end{align*} $$
$$ \begin{align*} \mathsf{Eig}({\textbf Z})=(\mathsf{Eig}({\textbf Z}_{0}))^{{1}/{t}}\cdot \bigg\{1,e\bigg(\dfrac1t\bigg),e\bigg(\dfrac2t\bigg),\ldots,e\bigg(\dfrac{t-1}t\bigg)\bigg\}. \end{align*} $$
 Let 
 $x\in Z$
 and let
$x\in Z$
 and let 
 $i_{0}\in \{0,1,\ldots ,t-1\}$
 be such that
$i_{0}\in \{0,1,\ldots ,t-1\}$
 be such that 
 $x\in Z_{i_{0}}$
. Define
$x\in Z_{i_{0}}$
. Define 
 $$ \begin{align*}\Omega({\textbf Z}_{i_{0}},x)= \overline{\{S^{tn}(x,x,\ldots, x): n\in\mathbb{Z}\}}.\end{align*} $$
$$ \begin{align*}\Omega({\textbf Z}_{i_{0}},x)= \overline{\{S^{tn}(x,x,\ldots, x): n\in\mathbb{Z}\}}.\end{align*} $$
Observe that for 
 $i\in \{0,\ldots ,t-1\}$
,
$i\in \{0,\ldots ,t-1\}$
, 
 $$ \begin{align*} S^{i}\Omega({\textbf Z}_{i_{0}},x)\subset Z_{i_{0}+i}\times Z_{i_{0}+2i} \times \cdots \times Z_{i_{0}+ki} \end{align*} $$
$$ \begin{align*} S^{i}\Omega({\textbf Z}_{i_{0}},x)\subset Z_{i_{0}+i}\times Z_{i_{0}+2i} \times \cdots \times Z_{i_{0}+ki} \end{align*} $$
and, hence, 
 $S^{i}\Omega ({\textbf Z}_{i_{0}},x)\cap S^{j}\Omega ({\textbf Z}_{i_{0}},x)=\emptyset $
 for
$S^{i}\Omega ({\textbf Z}_{i_{0}},x)\cap S^{j}\Omega ({\textbf Z}_{i_{0}},x)=\emptyset $
 for 
 $i\neq j$
. As
$i\neq j$
. As 
 ${\textbf Z}_{i_{0}}$
 is totally ergodic, it follows from Theorem 6.1 that for almost every
${\textbf Z}_{i_{0}}$
 is totally ergodic, it follows from Theorem 6.1 that for almost every 
 $x\in Z_{i_{0}}$
, the nilsystem
$x\in Z_{i_{0}}$
, the nilsystem 
 $(\Omega ({\textbf Z}_{i_{0}},x), S^{t})$
 is also totally ergodic. In view of Lemma 6.3, this means that
$(\Omega ({\textbf Z}_{i_{0}},x), S^{t})$
 is also totally ergodic. In view of Lemma 6.3, this means that 
 $\Omega ({\textbf Z}_{i_{0}},x)$
 is connected. We deduce that for almost every
$\Omega ({\textbf Z}_{i_{0}},x)$
 is connected. We deduce that for almost every 
 $x\in Z$
, the nilmanifold
$x\in Z$
, the nilmanifold 
 $\Omega ({\textbf Z},x)$
 has t connected components, because
$\Omega ({\textbf Z},x)$
 has t connected components, because 
 $$ \begin{align*} \Omega({\textbf Z},x)=\bigcup_{i=0}^{t-1}S^{i} \Omega({\textbf Z}_{i_{0}},x), \end{align*} $$
$$ \begin{align*} \Omega({\textbf Z},x)=\bigcup_{i=0}^{t-1}S^{i} \Omega({\textbf Z}_{i_{0}},x), \end{align*} $$
where 
 $\Omega ({\textbf Z}_{i_{0}},x), S^{1}\Omega ({\textbf Z}_{i_{0}},x),\ldots , S^{t-1}\Omega ({\textbf Z}_{i_{0}},x)$
 are connected and distinct.
$\Omega ({\textbf Z}_{i_{0}},x), S^{1}\Omega ({\textbf Z}_{i_{0}},x),\ldots , S^{t-1}\Omega ({\textbf Z}_{i_{0}},x)$
 are connected and distinct.
 Finally, suppose 
 $\theta \in [0,1)$
 is such that
$\theta \in [0,1)$
 is such that 
 $e(\theta )\notin \mathsf {Eig}({\textbf Z})$
. Then
$e(\theta )\notin \mathsf {Eig}({\textbf Z})$
. Then 
 $e(\theta )^{t}\notin \mathsf {Eig}({\textbf Z}_{i})$
 for all i. From Theorem 6.1, we deduce that for almost every
$e(\theta )^{t}\notin \mathsf {Eig}({\textbf Z}_{i})$
 for all i. From Theorem 6.1, we deduce that for almost every 
 $x\in Z$
,
$x\in Z$
, 
 $e(\theta )^{t}\notin \mathsf {Eig}(\Omega ({\textbf Z}_{i_{0}},x),S^{t},\mu _{\Omega ({\textbf Z}_{i_{0}},x)})$
 (where, again,
$e(\theta )^{t}\notin \mathsf {Eig}(\Omega ({\textbf Z}_{i_{0}},x),S^{t},\mu _{\Omega ({\textbf Z}_{i_{0}},x)})$
 (where, again, 
 $i_{0}\in \{0,\ldots ,t-1\}$
 is such that
$i_{0}\in \{0,\ldots ,t-1\}$
 is such that 
 $x\in Z_{i_{0}}$
) and, hence,
$x\in Z_{i_{0}}$
) and, hence, 
 $e(\theta )\notin \mathsf {Eig}(\Omega ({\textbf Z},x),S,\mu _{\Omega ({\textbf Z},x)})$
.
$e(\theta )\notin \mathsf {Eig}(\Omega ({\textbf Z},x),S,\mu _{\Omega ({\textbf Z},x)})$
.
Corollary 6.4. Let 
 $k\in \mathbb {N}$
 and let
$k\in \mathbb {N}$
 and let 
 ${\textbf Z}=(G/\Gamma ,R,\mu _{G/\Gamma })$
 be an ergodic nilsystem. Define
${\textbf Z}=(G/\Gamma ,R,\mu _{G/\Gamma })$
 be an ergodic nilsystem. Define 
 $S= R\times R^{2}\times \cdots \times R^{k}$
 and
$S= R\times R^{2}\times \cdots \times R^{k}$
 and 
 $$ \begin{align*} \Omega({\textbf Z},x)=\overline{\{S^{n}(x,x,\ldots, x): n\in\mathbb{Z}\}} \subset (G/\Gamma)^{k}. \end{align*} $$
$$ \begin{align*} \Omega({\textbf Z},x)=\overline{\{S^{n}(x,x,\ldots, x): n\in\mathbb{Z}\}} \subset (G/\Gamma)^{k}. \end{align*} $$
Let 
 $\textbf {X}=(X,T,\mu _{X})$
 be a
$\textbf {X}=(X,T,\mu _{X})$
 be a 
 $\mathbb {Z}$
 system. If
$\mathbb {Z}$
 system. If 
 $\textbf {X}$
 and
$\textbf {X}$
 and 
 $\textbf {Z}$
 are Kronecker disjoint, then for almost every
$\textbf {Z}$
 are Kronecker disjoint, then for almost every 
 $x\in G/\Gamma $
, the
$x\in G/\Gamma $
, the 
 $\mathbb {Z}$
 systems
$\mathbb {Z}$
 systems 
 $\textbf {X}$
 and
$\textbf {X}$
 and 
 $(\Omega ({\textbf Z},x),S,\mu _{\Omega ({\textbf Z},x)})$
 are Kronecker disjoint.
$(\Omega ({\textbf Z},x),S,\mu _{\Omega ({\textbf Z},x)})$
 are Kronecker disjoint.
Proof. By way of contradiction, assume that there is a positive measure set 
 $X^{\prime }\subset G/\Gamma $
 such that
$X^{\prime }\subset G/\Gamma $
 such that 
 $\textbf {X}$
 and
$\textbf {X}$
 and 
 $(\Omega ({\textbf Z},x),S,\mu _{\Omega ({\textbf Z},x)})$
 are not Kronecker disjoint whenever
$(\Omega ({\textbf Z},x),S,\mu _{\Omega ({\textbf Z},x)})$
 are not Kronecker disjoint whenever 
 $x\in X^{\prime }$
. This means that for any
$x\in X^{\prime }$
. This means that for any 
 $x\in X^{\prime }$
, we can find some
$x\in X^{\prime }$
, we can find some 
 $\theta _{x}\in [0,1)$
 such that
$\theta _{x}\in [0,1)$
 such that 
 $e(\theta _{x})$
 is a common eigenvalue for the systems
$e(\theta _{x})$
 is a common eigenvalue for the systems 
 $\textbf {X}$
 and
$\textbf {X}$
 and 
 $(\Omega ({\textbf Z},x),S,\mu _{\Omega ({\textbf Z},x)})$
. As
$(\Omega ({\textbf Z},x),S,\mu _{\Omega ({\textbf Z},x)})$
. As 
 $\textbf {X}$
 only possesses countably many eigenvalues, there exists a positive measure subset
$\textbf {X}$
 only possesses countably many eigenvalues, there exists a positive measure subset 
 $X^{\prime \prime }\subset X^{\prime }$
 such that
$X^{\prime \prime }\subset X^{\prime }$
 such that 
 $\theta _{x}=\theta $
 is constant for all
$\theta _{x}=\theta $
 is constant for all 
 $x\in X^{\prime \prime }$
. As
$x\in X^{\prime \prime }$
. As 
 $e(\theta )$
 belongs to
$e(\theta )$
 belongs to 
 $\mathsf {Eig}(\Omega ({\textbf Z},x),S,\mu _{\Omega ({\textbf Z},x)})$
 for all
$\mathsf {Eig}(\Omega ({\textbf Z},x),S,\mu _{\Omega ({\textbf Z},x)})$
 for all 
 $x\in X^{\prime \prime }$
 and
$x\in X^{\prime \prime }$
 and 
 $X^{\prime \prime }$
 has positive measure, it follows from Theorem 6.2 that
$X^{\prime \prime }$
 has positive measure, it follows from Theorem 6.2 that 
 $e(\theta )$
 belongs to
$e(\theta )$
 belongs to 
 $\mathsf {Eig}(\textbf {Z})$
. This contradicts the assumption that
$\mathsf {Eig}(\textbf {Z})$
. This contradicts the assumption that 
 $\textbf {X}$
 and
$\textbf {X}$
 and 
 $\textbf {Z}$
 are Kronecker disjoint.
$\textbf {Z}$
 are Kronecker disjoint.
6.2 Proofs of Theorems 1.7 and 1.8
The following result, which is used in the proofs of Theorems 1.7 and 1.8, is contained implicitly in [Reference Host and KraHK09, §7.3].
Theorem 6.5. (Cf. [Reference Host and KraHK09, §7.3])
 Let 
 $k\in \mathbb {N}$
, let
$k\in \mathbb {N}$
, let 
 $\textbf {X}=(X,T,\mu _{X})$
 be a
$\textbf {X}=(X,T,\mu _{X})$
 be a 
 $\mathbb {Z}$
 system, and let
$\mathbb {Z}$
 system, and let 
 $f_{1},\ldots ,f_{k} \in \operatorname {\textrm {L}}^{\infty }(X,\mu _{X})$
. Then, for every
$f_{1},\ldots ,f_{k} \in \operatorname {\textrm {L}}^{\infty }(X,\mu _{X})$
. Then, for every 
 $\varepsilon>0$
, there exists a k-step nilsystem
$\varepsilon>0$
, there exists a k-step nilsystem 
 $(G/\Gamma ,R,\mu _{G/\Gamma })$
, which is a factor of
$(G/\Gamma ,R,\mu _{G/\Gamma })$
, which is a factor of 
 $(X,T,\mu _{X})$
, and there exist continuous functions
$(X,T,\mu _{X})$
, and there exist continuous functions 
 $g_{1},\ldots ,g_{k}\in \mathsf {C}(G/\Gamma )$
 such that for every bounded complex-valued sequence
$g_{1},\ldots ,g_{k}\in \mathsf {C}(G/\Gamma )$
 such that for every bounded complex-valued sequence 
 $(a_{n})_{n\in \mathbb {N}}$
, one has
$(a_{n})_{n\in \mathbb {N}}$
, one has 
 $$ \begin{align*} \limsup_{N\to\infty} \bigg\| {\frac{1}{N} \sum_{n=1}^{N} a_{n}\prod_{i=0}^{k} T^{in} f_{i} - \frac{1}{N} \sum_{n=1}^{N} a_{n}\prod_{i=0}^{k} (R^{in}g_{i}) \circ \pi} \bigg\|_{2} \le \varepsilon \sup_{n\in\mathbb{N}}{|a_{n}|}, \end{align*} $$
$$ \begin{align*} \limsup_{N\to\infty} \bigg\| {\frac{1}{N} \sum_{n=1}^{N} a_{n}\prod_{i=0}^{k} T^{in} f_{i} - \frac{1}{N} \sum_{n=1}^{N} a_{n}\prod_{i=0}^{k} (R^{in}g_{i}) \circ \pi} \bigg\|_{2} \le \varepsilon \sup_{n\in\mathbb{N}}{|a_{n}|}, \end{align*} $$
where 
 $\pi :X\to G/\Gamma $
 denotes the factor map from
$\pi :X\to G/\Gamma $
 denotes the factor map from 
 $(X,T,\mu _{X})$
 onto
$(X,T,\mu _{X})$
 onto 
 $(G/\Gamma ,R,\mu _{G/\Gamma })$
.
$(G/\Gamma ,R,\mu _{G/\Gamma })$
.
We also need the following lemma.
Lemma 6.6. Let 
 $(Y,S)$
 be a topological
$(Y,S)$
 be a topological 
 $\mathbb {Z}$
 system, let
$\mathbb {Z}$
 system, let 
 $\mu _{Y}$
 be an ergodic S invariant Borel probability measure on Y, and let
$\mu _{Y}$
 be an ergodic S invariant Borel probability measure on Y, and let 
 $G\in \operatorname {\textrm {L}}^{1}(Y,\mu _{Y})$
. Then there exists a set
$G\in \operatorname {\textrm {L}}^{1}(Y,\mu _{Y})$
. Then there exists a set 
 $Y^{\prime }\subset Y$
 with
$Y^{\prime }\subset Y$
 with 
 $\mu _{Y}(Y^{\prime })=1$
 such that for any ergodic nilsystem system
$\mu _{Y}(Y^{\prime })=1$
 such that for any ergodic nilsystem system 
 $(G/\Gamma ,\mu _{G/\Gamma },R)$
 which is Kronecker disjoint from
$(G/\Gamma ,\mu _{G/\Gamma },R)$
 which is Kronecker disjoint from 
 $(Y,S,\mu _{Y})$
, any
$(Y,S,\mu _{Y})$
, any 
 $F\in \mathsf {C}(G/\Gamma )$
, any
$F\in \mathsf {C}(G/\Gamma )$
, any 
 $x\in G/\Gamma $
, and any
$x\in G/\Gamma $
, and any 
 $y\in Y^{\prime }$
 we have
$y\in Y^{\prime }$
 we have 
 $$ \begin{align*} \lim_{N\to\infty}\frac1N\sum_{n=1}^{N} F(R^{n} x)G(S^{n} y)=\int_{G/\Gamma} F\,\textit{d}\mu_{G/\Gamma}\int_{Y} G\,\textit{d}\mu_{Y}. \end{align*} $$
$$ \begin{align*} \lim_{N\to\infty}\frac1N\sum_{n=1}^{N} F(R^{n} x)G(S^{n} y)=\int_{G/\Gamma} F\,\textit{d}\mu_{G/\Gamma}\int_{Y} G\,\textit{d}\mu_{Y}. \end{align*} $$
Moreover, if 
 $(Y,S)$
 is uniquely ergodic and
$(Y,S)$
 is uniquely ergodic and 
 $g\in \mathsf {C}(Y)$
, then we can take
$g\in \mathsf {C}(Y)$
, then we can take 
 $Y^{\prime }=Y$
.
$Y^{\prime }=Y$
.
Proof. As any nilsystem is measurably distal (see [Reference LeibmanLei05, Theorem 2.14]), Lemma 6.6 is a special case of Theorem 1.6.
Proof of Theorem 1.7
 Let 
 $\textbf {X}=(X,T,\mu _{X})$
 and
$\textbf {X}=(X,T,\mu _{X})$
 and 
 $\textbf {Y}=(Y,S,\mu _{Y})$
 be
$\textbf {Y}=(Y,S,\mu _{Y})$
 be 
 $\mathbb {Z}$
 systems and assume
$\mathbb {Z}$
 systems and assume 
 $\textbf {X}$
 and
$\textbf {X}$
 and 
 $\textbf {Y}$
 are Kronecker disjoint. Let
$\textbf {Y}$
 are Kronecker disjoint. Let 
 $k,\ell \in \mathbb {N}$
,
$k,\ell \in \mathbb {N}$
, 
 $f_{1},\ldots ,f_{k}\in \operatorname {\textrm {L}}^{\infty }(X,\mu _{X})$
 and
$f_{1},\ldots ,f_{k}\in \operatorname {\textrm {L}}^{\infty }(X,\mu _{X})$
 and 
 $g_{1},\ldots ,g_{\ell }\in \operatorname {\textrm {L}}^{\infty }(Y,\mu _{Y})$
. According to [Reference Host and KraHK05, Reference ZieglerZie07], the limit
$g_{1},\ldots ,g_{\ell }\in \operatorname {\textrm {L}}^{\infty }(Y,\mu _{Y})$
. According to [Reference Host and KraHK05, Reference ZieglerZie07], the limit 
 $$ \begin{align*} F= \lim_{N \to \infty} \frac{1}{N} \sum_{n=1}^{N}\prod_{i=1}^{k} T^{in}f_{i} \end{align*} $$
$$ \begin{align*} F= \lim_{N \to \infty} \frac{1}{N} \sum_{n=1}^{N}\prod_{i=1}^{k} T^{in}f_{i} \end{align*} $$
exists in 
 $\operatorname {\textrm {L}}^{2}(X,\mu _{X})$
 and the limit
$\operatorname {\textrm {L}}^{2}(X,\mu _{X})$
 and the limit 
 $$ \begin{align*} G= \lim_{N\to\infty} \frac{1}{N} \sum_{n=1}^{N}\prod_{j=1}^{\ell} S^{jn}g_{j} \end{align*} $$
$$ \begin{align*} G= \lim_{N\to\infty} \frac{1}{N} \sum_{n=1}^{N}\prod_{j=1}^{\ell} S^{jn}g_{j} \end{align*} $$
exists in 
 $\operatorname {\textrm {L}}^{2}(Y,\mu _{Y})$
. Moreover, in view of [Reference TaoTao08] also the limit
$\operatorname {\textrm {L}}^{2}(Y,\mu _{Y})$
. Moreover, in view of [Reference TaoTao08] also the limit 
 $$ \begin{align*} H = \lim_{N\to\infty} \frac{1}{N} \sum_{n=1}^{N}\prod_{i=1}^{k} \prod_{j=1}^{\ell} T^{in}f_{i} S^{jn}g_{j} \end{align*} $$
$$ \begin{align*} H = \lim_{N\to\infty} \frac{1}{N} \sum_{n=1}^{N}\prod_{i=1}^{k} \prod_{j=1}^{\ell} T^{in}f_{i} S^{jn}g_{j} \end{align*} $$
exists in 
 $\operatorname {\textrm {L}}^{2}(X\times Y,\mu _{X} \otimes \mu _{Y})$
. Our goal is to show that
$\operatorname {\textrm {L}}^{2}(X\times Y,\mu _{X} \otimes \mu _{Y})$
. Our goal is to show that 
 $$ \begin{align} H=F\otimes G. \end{align} $$
$$ \begin{align} H=F\otimes G. \end{align} $$
 We can assume without loss of generality that 
 $\| {f_{i}} \|_{\infty } \le 1$
 and
$\| {f_{i}} \|_{\infty } \le 1$
 and 
 $\| {g_{j}} \|_{\infty } \le 1$
. Fix
$\| {g_{j}} \|_{\infty } \le 1$
. Fix 
 $\varepsilon> 0$
. First, we apply Theorem 6.5 to find a k-step nilsystem
$\varepsilon> 0$
. First, we apply Theorem 6.5 to find a k-step nilsystem 
 $(G_{X}/\Gamma _{X},R_{X},\mu _{G_{X}/\Gamma _{X}})$
, which is a factor of
$(G_{X}/\Gamma _{X},R_{X},\mu _{G_{X}/\Gamma _{X}})$
, which is a factor of 
 $(X,T,\mu _{X})$
, and a set of continuous functions
$(X,T,\mu _{X})$
, and a set of continuous functions 
 $\tilde {f}_{1},\ldots ,\tilde {f}_{k}\in \mathsf {C}(G_{X}/\Gamma _{X})$
 such that for every bounded complex-valued sequence
$\tilde {f}_{1},\ldots ,\tilde {f}_{k}\in \mathsf {C}(G_{X}/\Gamma _{X})$
 such that for every bounded complex-valued sequence 
 $(a_{n})_{n\in \mathbb {N}}$
 one has
$(a_{n})_{n\in \mathbb {N}}$
 one has 
 $$ \begin{align*} \limsup_{N\to\infty} \bigg\| {\frac{1}{N} \sum_{n=1}^{N} a_{n} \prod_{i=1}^{k} T^{in}f_{i} - \frac{1}{N} \sum_{n=1}^{N} a_{n} \prod_{i=1}^{k} (R_{X}^{in}\tilde{f}_{i})\circ \pi} \bigg\|_{2} \le \varepsilon \sup_{n\in\mathbb{N}}|a_{n}|, \end{align*} $$
$$ \begin{align*} \limsup_{N\to\infty} \bigg\| {\frac{1}{N} \sum_{n=1}^{N} a_{n} \prod_{i=1}^{k} T^{in}f_{i} - \frac{1}{N} \sum_{n=1}^{N} a_{n} \prod_{i=1}^{k} (R_{X}^{in}\tilde{f}_{i})\circ \pi} \bigg\|_{2} \le \varepsilon \sup_{n\in\mathbb{N}}|a_{n}|, \end{align*} $$
where 
 $\pi :X\to G_{X}/\Gamma _{X}$
 denotes the factor map from
$\pi :X\to G_{X}/\Gamma _{X}$
 denotes the factor map from 
 $(X,T,\mu _{X})$
 onto
$(X,T,\mu _{X})$
 onto 
 $(G_{X}/\Gamma _{X},R_{X}, \mu _{G_{X}/\Gamma _{X}})$
. In particular, if we choose
$(G_{X}/\Gamma _{X},R_{X}, \mu _{G_{X}/\Gamma _{X}})$
. In particular, if we choose 
 $a_{n}=\prod _{j=1}^{\ell } S^{jn}g_{j}(y)$
 as y runs through Y, we obtain
$a_{n}=\prod _{j=1}^{\ell } S^{jn}g_{j}(y)$
 as y runs through Y, we obtain 
 $$ \begin{align} &\sup_{y\in Y} \limsup_{N\to\infty} \bigg\| \frac{1}{N} \sum_{n=1}^{N} \prod_{i=1}^{k} \prod_{j=1}^{\ell} T^{in} f_{i} S^{jn}g_{j}(y)\nonumber\\ &\quad - \frac{1}{N} \sum_{n=1}^{N} \prod_{i=1}^{k}\prod_{j=1}^{\ell} ( (R_{X}^{in}\tilde{f}_{i})\circ\pi) S^{jn}g_{j}(y) \bigg\|_{2} \le \varepsilon. \end{align} $$
$$ \begin{align} &\sup_{y\in Y} \limsup_{N\to\infty} \bigg\| \frac{1}{N} \sum_{n=1}^{N} \prod_{i=1}^{k} \prod_{j=1}^{\ell} T^{in} f_{i} S^{jn}g_{j}(y)\nonumber\\ &\quad - \frac{1}{N} \sum_{n=1}^{N} \prod_{i=1}^{k}\prod_{j=1}^{\ell} ( (R_{X}^{in}\tilde{f}_{i})\circ\pi) S^{jn}g_{j}(y) \bigg\|_{2} \le \varepsilon. \end{align} $$
From (14), it follows that
 $$ \begin{align} \limsup_{N\to\infty} \bigg\| {\frac{1}{N} \sum_{n=1}^{N}\prod_{i=1}^{k} \prod_{j=1}^{\ell} T^{in}f_{i} S^{jn}g_{j} - \frac{1}{N} \sum_{n=1}^{N} \prod_{i=1}^{k} \prod_{j=1}^{\ell} ( (R_{X}^{in}\tilde{f}_{i}) \circ \pi ) S^{jn}g_{j}} \bigg\|_{2} \le \varepsilon. \end{align} $$
$$ \begin{align} \limsup_{N\to\infty} \bigg\| {\frac{1}{N} \sum_{n=1}^{N}\prod_{i=1}^{k} \prod_{j=1}^{\ell} T^{in}f_{i} S^{jn}g_{j} - \frac{1}{N} \sum_{n=1}^{N} \prod_{i=1}^{k} \prod_{j=1}^{\ell} ( (R_{X}^{in}\tilde{f}_{i}) \circ \pi ) S^{jn}g_{j}} \bigg\|_{2} \le \varepsilon. \end{align} $$
 Similarly, we can pick a 
 $\ell $
-step nilsystem
$\ell $
-step nilsystem 
 $(G_{Y}/\Gamma _{Y},R_{Y},\mu _{G_{Y}/\Gamma _{Y}})$
, which is a factor of
$(G_{Y}/\Gamma _{Y},R_{Y},\mu _{G_{Y}/\Gamma _{Y}})$
, which is a factor of 
 $(Y,S,\mu _{Y})$
, and a set of continuous functions
$(Y,S,\mu _{Y})$
, and a set of continuous functions 
 $\tilde {g}_{1},\ldots ,\tilde {g}_{\ell }\in \mathsf {C}(G_{Y}/\Gamma _{Y})$
 such that, for every bounded complex-valued sequence
$\tilde {g}_{1},\ldots ,\tilde {g}_{\ell }\in \mathsf {C}(G_{Y}/\Gamma _{Y})$
 such that, for every bounded complex-valued sequence 
 $(b_{n})_{n\in \mathbb {N}}$
, one has
$(b_{n})_{n\in \mathbb {N}}$
, one has 
 $$ \begin{align*} \limsup_{N\to\infty} \bigg\| {\frac{1}{N} \sum_{n=1}^{N} b_{n} \prod_{j=1}^{\ell} S^{jn}g_{j} - \frac{1}{N} \sum_{n=1}^{N} b_{n} \prod_{j=1}^{\ell} (R_{Y}^{jn} \tilde{g}_{j}) \circ \eta } \bigg\|_{2} \le \varepsilon \sup_{n\in\mathbb{N}}|b_{n}|, \end{align*} $$
$$ \begin{align*} \limsup_{N\to\infty} \bigg\| {\frac{1}{N} \sum_{n=1}^{N} b_{n} \prod_{j=1}^{\ell} S^{jn}g_{j} - \frac{1}{N} \sum_{n=1}^{N} b_{n} \prod_{j=1}^{\ell} (R_{Y}^{jn} \tilde{g}_{j}) \circ \eta } \bigg\|_{2} \le \varepsilon \sup_{n\in\mathbb{N}}|b_{n}|, \end{align*} $$
where 
 $\eta :Y\to G_{Y}/\Gamma _{Y}$
 denotes the factor map from
$\eta :Y\to G_{Y}/\Gamma _{Y}$
 denotes the factor map from 
 $(Y,S,\mu _{Y})$
 onto
$(Y,S,\mu _{Y})$
 onto 
 $(G_{Y}/\Gamma _{Y},R_{Y}, \mu _{G_{Y}/\Gamma _{Y}})$
. If we set
$(G_{Y}/\Gamma _{Y},R_{Y}, \mu _{G_{Y}/\Gamma _{Y}})$
. If we set 
 $b_{n}=\prod _{i=1}^{k} (R_{X}^{in}\tilde {f}_{i})\circ \pi (x)$
, then we obtain
$b_{n}=\prod _{i=1}^{k} (R_{X}^{in}\tilde {f}_{i})\circ \pi (x)$
, then we obtain 
 $$ \begin{align} &\sup_{x\in X} \limsup_{N\to\infty} \bigg\| \frac{1}{N} \sum_{n=1}^{N} \prod_{i=1}^{k} \prod_{j=1}^{\ell} (R_{X}^{in}\tilde{f}_{i})(\pi x) S^{jn}g_{j}\nonumber\\ &\quad - \frac{1}{N} \sum_{n=1}^{N} \prod_{i=1}^{k} \prod_{j=1}^{\ell} (R_{X}^{in} \tilde{f}_{i})(\pi x)((R_{Y}^{jn}\tilde{g}_{j}) \circ \eta) \bigg\|_{2} \leqslant \varepsilon \end{align} $$
$$ \begin{align} &\sup_{x\in X} \limsup_{N\to\infty} \bigg\| \frac{1}{N} \sum_{n=1}^{N} \prod_{i=1}^{k} \prod_{j=1}^{\ell} (R_{X}^{in}\tilde{f}_{i})(\pi x) S^{jn}g_{j}\nonumber\\ &\quad - \frac{1}{N} \sum_{n=1}^{N} \prod_{i=1}^{k} \prod_{j=1}^{\ell} (R_{X}^{in} \tilde{f}_{i})(\pi x)((R_{Y}^{jn}\tilde{g}_{j}) \circ \eta) \bigg\|_{2} \leqslant \varepsilon \end{align} $$
and, hence,
 $$ \begin{align} &\limsup_{N\to\infty} \bigg\| \frac{1}{N} \sum_{n=1}^{N} \prod_{i=1}^{k} \prod_{j=1}^{\ell} ( (R_{X}^{in} \tilde{f}_{i})\circ\pi ) S^{jn}g_{j} \nonumber\\ &\quad- \frac{1}{N} \sum_{n=1}^{N} \prod_{i=1}^{k} \prod_{j=1}^{\ell} ((R_{X}^{in}\tilde{f}_{i})\circ\pi)((R_{Y}^{jn}\tilde{g}_{j})\circ \eta) \bigg\|_{2} \le \varepsilon. \end{align} $$
$$ \begin{align} &\limsup_{N\to\infty} \bigg\| \frac{1}{N} \sum_{n=1}^{N} \prod_{i=1}^{k} \prod_{j=1}^{\ell} ( (R_{X}^{in} \tilde{f}_{i})\circ\pi ) S^{jn}g_{j} \nonumber\\ &\quad- \frac{1}{N} \sum_{n=1}^{N} \prod_{i=1}^{k} \prod_{j=1}^{\ell} ((R_{X}^{in}\tilde{f}_{i})\circ\pi)((R_{Y}^{jn}\tilde{g}_{j})\circ \eta) \bigg\|_{2} \le \varepsilon. \end{align} $$
Combining (15) and (17) yields
 $$ \begin{align} \limsup_{N \to \infty} \bigg\| {\frac{1}{N} \sum_{n=1}^{N} \prod_{i=1}^{k} \prod_{j=1}^{\ell} T^{in} f_{i} S^{jn} g_{j} - \frac{1}{N} \sum_{n=1}^{N} \prod_{i=1}^{k} \prod_{j=1}^{\ell} ((R_{X}^{in}\tilde{f}_{i})\circ\pi )( (R_{Y}^{jn}\tilde{g}_{j})\circ \eta) } \bigg\|_{2} \!\le 2\varepsilon. \end{align} $$
$$ \begin{align} \limsup_{N \to \infty} \bigg\| {\frac{1}{N} \sum_{n=1}^{N} \prod_{i=1}^{k} \prod_{j=1}^{\ell} T^{in} f_{i} S^{jn} g_{j} - \frac{1}{N} \sum_{n=1}^{N} \prod_{i=1}^{k} \prod_{j=1}^{\ell} ((R_{X}^{in}\tilde{f}_{i})\circ\pi )( (R_{Y}^{jn}\tilde{g}_{j})\circ \eta) } \bigg\|_{2} \!\le 2\varepsilon. \end{align} $$
 Next, we claim that for almost every 
 $x\in G_{X}/\Gamma _{X}$
 and almost every
$x\in G_{X}/\Gamma _{X}$
 and almost every 
 $y\in G_{Y}/\Gamma _{Y}$
, we have
$y\in G_{Y}/\Gamma _{Y}$
, we have 
 $$ \begin{align} &\lim_{N\to\infty}\bigg|\frac1N\sum_{n=1}^{N} \prod_{i=1}^{k}\prod_{j=1}^{\ell} \tilde{f}_{i}(R_{X}^{in}x)\tilde{g}_{j}(R_{Y}^{jn} y) \nonumber\\ &\quad- \bigg(\frac1N\sum_{n=1}^{N} \prod_{i=1}^{k}\tilde{f}_{i}(R_{X}^{in}x)\bigg) \bigg(\frac1N\sum_{n=1}^{N} \prod_{j=1}^{\ell} \tilde{g}_{j}(R_{Y}^{jn} y) \bigg)\bigg|=0. \end{align} $$
$$ \begin{align} &\lim_{N\to\infty}\bigg|\frac1N\sum_{n=1}^{N} \prod_{i=1}^{k}\prod_{j=1}^{\ell} \tilde{f}_{i}(R_{X}^{in}x)\tilde{g}_{j}(R_{Y}^{jn} y) \nonumber\\ &\quad- \bigg(\frac1N\sum_{n=1}^{N} \prod_{i=1}^{k}\tilde{f}_{i}(R_{X}^{in}x)\bigg) \bigg(\frac1N\sum_{n=1}^{N} \prod_{j=1}^{\ell} \tilde{g}_{j}(R_{Y}^{jn} y) \bigg)\bigg|=0. \end{align} $$
Assume for now that this claim holds. It follows from (14) and (16) that
 $$ \begin{align} &\limsup_{N\to\infty} \bigg\| \bigg( \frac{1}{N} \sum_{n=1}^{N} \prod_{i=1}^{k}R_{X}^{in}\tilde{f}_{i}\circ\pi\bigg) \bigg(\frac{1}{N} \sum_{n=1}^{N} \prod_{j=1}^{\ell} R_{Y}^{jn}\tilde{g}_{j} \circ\eta\bigg) \nonumber\\ &\quad - \bigg(\frac{1}{N} \sum_{n=1}^{N} \prod_{i=1}^{k} T^{in}f_{i}\bigg) \bigg(\frac{1}{N} \sum_{n=1}^{N} \prod_{j=1}^{\ell} S^{jn}g_{j} \bigg) \bigg\|_{2} \le 2\varepsilon. \end{align} $$
$$ \begin{align} &\limsup_{N\to\infty} \bigg\| \bigg( \frac{1}{N} \sum_{n=1}^{N} \prod_{i=1}^{k}R_{X}^{in}\tilde{f}_{i}\circ\pi\bigg) \bigg(\frac{1}{N} \sum_{n=1}^{N} \prod_{j=1}^{\ell} R_{Y}^{jn}\tilde{g}_{j} \circ\eta\bigg) \nonumber\\ &\quad - \bigg(\frac{1}{N} \sum_{n=1}^{N} \prod_{i=1}^{k} T^{in}f_{i}\bigg) \bigg(\frac{1}{N} \sum_{n=1}^{N} \prod_{j=1}^{\ell} S^{jn}g_{j} \bigg) \bigg\|_{2} \le 2\varepsilon. \end{align} $$
Thus, combining (20) with (19) and (18) gives
 $$ \begin{align*} \lim_{N\to\infty} \bigg\| {\frac{1}{N} \sum_{n=1}^{N} \prod_{i=1}^{k}\prod_{j=1}^{\ell} T^{in}f_{i} S^{jn}g_{j} - \bigg(\frac{1}{N} \sum_{n=1}^{N}\prod_{i=1}^{k} T^{in}f_{i} \bigg) \cdot \bigg( \frac{1}{N} \sum_{n=1}^{N} \prod_{j=1}^{\ell} S^{jn}g_{j}\bigg)} \bigg\|_{2} \leqslant 4\varepsilon. \end{align*} $$
$$ \begin{align*} \lim_{N\to\infty} \bigg\| {\frac{1}{N} \sum_{n=1}^{N} \prod_{i=1}^{k}\prod_{j=1}^{\ell} T^{in}f_{i} S^{jn}g_{j} - \bigg(\frac{1}{N} \sum_{n=1}^{N}\prod_{i=1}^{k} T^{in}f_{i} \bigg) \cdot \bigg( \frac{1}{N} \sum_{n=1}^{N} \prod_{j=1}^{\ell} S^{jn}g_{j}\bigg)} \bigg\|_{2} \leqslant 4\varepsilon. \end{align*} $$
As 
 $\varepsilon>0$
 was chosen arbitrarily, the proof of (13) is complete.
$\varepsilon>0$
 was chosen arbitrarily, the proof of (13) is complete.
 It remains to show that (19) is true. Define 
 $S_{X}= R_{X}\times R_{X}^{2}\times \cdots \times R_{X}^{k}$
 and for every
$S_{X}= R_{X}\times R_{X}^{2}\times \cdots \times R_{X}^{k}$
 and for every 
 $x\in G_{X}/\Gamma _{X}$
 consider
$x\in G_{X}/\Gamma _{X}$
 consider 
 $\Omega ({\textbf X},x)=\overline {\{S_{X}^{n}(x,x,\ldots , x): n\in \mathbb {Z}\}}$
. In addition, define
$\Omega ({\textbf X},x)=\overline {\{S_{X}^{n}(x,x,\ldots , x): n\in \mathbb {Z}\}}$
. In addition, define 
 $F=\tilde {f}_{1}\otimes \cdots \otimes \tilde {f}_{k}$
. Similarly, we define
$F=\tilde {f}_{1}\otimes \cdots \otimes \tilde {f}_{k}$
. Similarly, we define 
 $S_{Y}=R_{Y}\times R_{Y}^{2}\times \cdots \times R_{Y}^{\ell }$
,
$S_{Y}=R_{Y}\times R_{Y}^{2}\times \cdots \times R_{Y}^{\ell }$
, 
 $\Omega ({\textbf Y},y)=\overline {\{S_{Y}^{n}(y,y,\ldots , y): n\in \mathbb {Z}\}}$
 and
$\Omega ({\textbf Y},y)=\overline {\{S_{Y}^{n}(y,y,\ldots , y): n\in \mathbb {Z}\}}$
 and 
 $G=\tilde {g}_{1}\otimes \cdots \otimes \tilde {g}_{\ell }$
. Hence, (19) can be rewritten as
$G=\tilde {g}_{1}\otimes \cdots \otimes \tilde {g}_{\ell }$
. Hence, (19) can be rewritten as 
 $$ \begin{align} &\lim_{N\to\infty}\bigg|\frac1N\sum_{n=1}^{N} F(S_{X}^{n}(x,\ldots,x)) G(S_{Y}^{n}(y,\ldots,y)) \nonumber\\ &\quad-\bigg(\frac1N\sum_{n=1}^{N} F(S_{X}^{n}(x,\ldots,x))\bigg) \bigg(\frac1N\sum_{n=1}^{N}G(S_{Y}^{n}(y,\ldots,y)) \bigg)\bigg|=0. \end{align} $$
$$ \begin{align} &\lim_{N\to\infty}\bigg|\frac1N\sum_{n=1}^{N} F(S_{X}^{n}(x,\ldots,x)) G(S_{Y}^{n}(y,\ldots,y)) \nonumber\\ &\quad-\bigg(\frac1N\sum_{n=1}^{N} F(S_{X}^{n}(x,\ldots,x))\bigg) \bigg(\frac1N\sum_{n=1}^{N}G(S_{Y}^{n}(y,\ldots,y)) \bigg)\bigg|=0. \end{align} $$
 Note that 
 $(G_{X}/\Gamma _{X},R_{X},\mu _{G_{X}/\Gamma _{X}})$
 and
$(G_{X}/\Gamma _{X},R_{X},\mu _{G_{X}/\Gamma _{X}})$
 and 
 $(G_{Y}/\Gamma _{Y},R_{Y},\mu _{G_{Y}/\Gamma _{Y}})$
 are Kronecker disjoint, because
$(G_{Y}/\Gamma _{Y},R_{Y},\mu _{G_{Y}/\Gamma _{Y}})$
 are Kronecker disjoint, because 
 $(X,T,\mu _{X})$
 and
$(X,T,\mu _{X})$
 and 
 $(Y,S,\mu _{Y})$
 are Kronecker disjoint. In view of Corollary 6.4 it therefore follows that for
$(Y,S,\mu _{Y})$
 are Kronecker disjoint. In view of Corollary 6.4 it therefore follows that for 
 $\mu _{X}$
-almost every
$\mu _{X}$
-almost every 
 $x\in G_{X}/\Gamma _{X} $
 and for
$x\in G_{X}/\Gamma _{X} $
 and for 
 $\mu _{Y}$
-almost every
$\mu _{Y}$
-almost every 
 $y\in G_{Y}/\Gamma _{Y}$
 the two nilsystems
$y\in G_{Y}/\Gamma _{Y}$
 the two nilsystems 
 $(\Omega ({\textbf X},x), S_{X}, \mu _{\Omega ({\textbf X},x)})$
 and
$(\Omega ({\textbf X},x), S_{X}, \mu _{\Omega ({\textbf X},x)})$
 and 
 $(\Omega ({\textbf Y},y), S_{Y}, \mu _{\Omega ({\textbf Y},y)})$
 are Kronecker disjoint. We can now apply Lemma 6.6 to conclude that for almost every
$(\Omega ({\textbf Y},y), S_{Y}, \mu _{\Omega ({\textbf Y},y)})$
 are Kronecker disjoint. We can now apply Lemma 6.6 to conclude that for almost every 
 $x\in G_{X}/\Gamma _{X} $
 and almost every
$x\in G_{X}/\Gamma _{X} $
 and almost every 
 $y\in G_{Y}/\Gamma _{Y}$
, we have
$y\in G_{Y}/\Gamma _{Y}$
, we have 
 $$ \begin{align} &\lim_{N\to\infty}\frac1N\sum_{n=1}^{N} F(S_{X}^{n}(x,\ldots,x))G(S_{Y}^{n}(y,\ldots,y))\nonumber\\ &\quad= \int_{\Omega({\textbf X},x)} F\,\textit{d}\mu_{\Omega({\textbf X},x)} \int_{\Omega({\textbf Y},y)} G\,\textit{d}\mu_{\Omega({\textbf Y},y)}. \end{align} $$
$$ \begin{align} &\lim_{N\to\infty}\frac1N\sum_{n=1}^{N} F(S_{X}^{n}(x,\ldots,x))G(S_{Y}^{n}(y,\ldots,y))\nonumber\\ &\quad= \int_{\Omega({\textbf X},x)} F\,\textit{d}\mu_{\Omega({\textbf X},x)} \int_{\Omega({\textbf Y},y)} G\,\textit{d}\mu_{\Omega({\textbf Y},y)}. \end{align} $$
As 
 $(\Omega ({\textbf X},x), S_{X})$
 and
$(\Omega ({\textbf X},x), S_{X})$
 and 
 $(\Omega ({\textbf Y},y), S_{Y})$
 are uniquely ergodic, we have that
$(\Omega ({\textbf Y},y), S_{Y})$
 are uniquely ergodic, we have that 
 $$ \begin{align} \lim_{N\to\infty}\frac1N\sum_{n=1}^{N} F(S_{X}^{n}(x,\ldots,x))&=\int_{\Omega({\textbf X},x)} F\,\textit{d}\mu_{\Omega({\textbf X},x)}, \end{align} $$
$$ \begin{align} \lim_{N\to\infty}\frac1N\sum_{n=1}^{N} F(S_{X}^{n}(x,\ldots,x))&=\int_{\Omega({\textbf X},x)} F\,\textit{d}\mu_{\Omega({\textbf X},x)}, \end{align} $$
 $$ \begin{align} \lim_{N\to\infty}\frac1N\sum_{n=1}^{N} G(S_{Y}^{n}(x,\ldots,x))&=\int_{\Omega({\textbf Y},y)} G\,\textit{d}\mu_{\Omega({\textbf Y},y)}. \end{align} $$
$$ \begin{align} \lim_{N\to\infty}\frac1N\sum_{n=1}^{N} G(S_{Y}^{n}(x,\ldots,x))&=\int_{\Omega({\textbf Y},y)} G\,\textit{d}\mu_{\Omega({\textbf Y},y)}. \end{align} $$
Combining (22) with (23) and (24) yields (21), which, in turn, implies (19). This completes the proof of Theorem 1.7.
Proof of Theorem 1.8
 Let 
 $(Y,S)$
 be a topological
$(Y,S)$
 be a topological 
 $\mathbb {Z}$
 system, let
$\mathbb {Z}$
 system, let 
 $\mu _{Y}$
 be an S invariant Borel probability measure on Y, and let
$\mu _{Y}$
 be an S invariant Borel probability measure on Y, and let 
 $G\in \operatorname {\textrm {L}}^{1}(Y,\mu _{Y})$
. First, we apply Lemma 6.6 to find a set
$G\in \operatorname {\textrm {L}}^{1}(Y,\mu _{Y})$
. First, we apply Lemma 6.6 to find a set 
 $Y^{\prime }\subset Y$
 with
$Y^{\prime }\subset Y$
 with 
 $\mu _{Y}(Y^{\prime })=1$
 such that for any ergodic nilsystem system
$\mu _{Y}(Y^{\prime })=1$
 such that for any ergodic nilsystem system 
 $(G/\Gamma ,\mu _{G/\Gamma },R)$
 which is Kronecker disjoint from
$(G/\Gamma ,\mu _{G/\Gamma },R)$
 which is Kronecker disjoint from 
 $(Y,S,\mu _{Y})$
, any
$(Y,S,\mu _{Y})$
, any 
 $F\in \mathsf {C}(G/\Gamma )$
, any
$F\in \mathsf {C}(G/\Gamma )$
, any 
 $x\in G/\Gamma $
, and any
$x\in G/\Gamma $
, and any 
 $y\in Y^{\prime }$
 we have
$y\in Y^{\prime }$
 we have 
 $$ \begin{align*} \lim_{N\to\infty}\frac1N\sum_{n=1}^{N} F(x)G(y)=\int_{G/\Gamma} F\,\textit{d}\mu_{G/\Gamma}\int_{Y} G\,\textit{d}\mu_{Y}. \end{align*} $$
$$ \begin{align*} \lim_{N\to\infty}\frac1N\sum_{n=1}^{N} F(x)G(y)=\int_{G/\Gamma} F\,\textit{d}\mu_{G/\Gamma}\int_{Y} G\,\textit{d}\mu_{Y}. \end{align*} $$
Lemma 6.6 also guarantees that if 
 $(Y,S)$
 is uniquely ergodic and
$(Y,S)$
 is uniquely ergodic and 
 $G\in \mathsf {C}(Y)$
, then we can take
$G\in \mathsf {C}(Y)$
, then we can take 
 $Y^{\prime }=Y$
. Now let
$Y^{\prime }=Y$
. Now let 
 $(X,T,\mu _{X})$
 be a
$(X,T,\mu _{X})$
 be a 
 $\mathbb {Z}$
 system that is Kronecker disjoint from
$\mathbb {Z}$
 system that is Kronecker disjoint from 
 $(Y,S,\mu _{Y})$
. Fix
$(Y,S,\mu _{Y})$
. Fix 
 $k\in \mathbb {N}$
 and let
$k\in \mathbb {N}$
 and let 
 $f_{1},\ldots ,f_{k}\in \operatorname {\textrm {L}}^{\infty }(X,\mu _{X})$
. Our goal is to show that for any
$f_{1},\ldots ,f_{k}\in \operatorname {\textrm {L}}^{\infty }(X,\mu _{X})$
. Our goal is to show that for any 
 $y\in Y^{\prime }$
 we have
$y\in Y^{\prime }$
 we have 
 $$ \begin{align} \lim_{N\to\infty}\frac1N\sum_{n=1}^{N} G(S^{n}y)\prod_{i=1}^{k} T^{in}f_{i}=\bigg(\int_{Y} G\,\textit{d}\mu_{Y}\bigg)\cdot\bigg(\lim_{N\to\infty}\frac1N\sum_{n=1}^{N}\prod_{i=1}^{k} T^{in}f_{i}\bigg) \end{align} $$
$$ \begin{align} \lim_{N\to\infty}\frac1N\sum_{n=1}^{N} G(S^{n}y)\prod_{i=1}^{k} T^{in}f_{i}=\bigg(\int_{Y} G\,\textit{d}\mu_{Y}\bigg)\cdot\bigg(\lim_{N\to\infty}\frac1N\sum_{n=1}^{N}\prod_{i=1}^{k} T^{in}f_{i}\bigg) \end{align} $$
in 
 $\operatorname {\textrm {L}}^{2}(X,\mu _{X})$
.
$\operatorname {\textrm {L}}^{2}(X,\mu _{X})$
.
 Note that (25) is trivially true if G is a constant function. Hence, by replacing G with 
 $G-\int _{Y} G\,\textit {d}\mu _{Y}$
 if necessary, we can assume without loss of generality that
$G-\int _{Y} G\,\textit {d}\mu _{Y}$
 if necessary, we can assume without loss of generality that 
 $\int _{Y} G\,\textit {d}\mu _{Y}=0$
. In this case, (25) reduces to
$\int _{Y} G\,\textit {d}\mu _{Y}=0$
. In this case, (25) reduces to 
 $$ \begin{align} \lim_{N\to\infty}\frac1N\sum_{n=1}^{N} G(S^{n}y)\prod_{i=1}^{k} T^{in}f_{i}=0. \end{align} $$
$$ \begin{align} \lim_{N\to\infty}\frac1N\sum_{n=1}^{N} G(S^{n}y)\prod_{i=1}^{k} T^{in}f_{i}=0. \end{align} $$
We can also assume without loss of generality that 
 $\| {G} \|_{\infty } \le 1$
 and that
$\| {G} \|_{\infty } \le 1$
 and that 
 $\| {f_{i}} \|_{\infty } \le 1$
 for all
$\| {f_{i}} \|_{\infty } \le 1$
 for all 
 $i=1,\ldots ,k$
.
$i=1,\ldots ,k$
.
 Fix 
 $\varepsilon> 0$
. We apply Theorem 6.5 to find a k-step nilsystem
$\varepsilon> 0$
. We apply Theorem 6.5 to find a k-step nilsystem 
 $(G/\Gamma ,R,\mu _{G/\Gamma })$
, which is a factor of
$(G/\Gamma ,R,\mu _{G/\Gamma })$
, which is a factor of 
 $(X,T,\mu _{X})$
, and a set of continuous functions
$(X,T,\mu _{X})$
, and a set of continuous functions 
 $\tilde {f}_{1},\ldots ,\tilde {f}_{k}\in \mathsf {C}(G/\Gamma )$
 such that
$\tilde {f}_{1},\ldots ,\tilde {f}_{k}\in \mathsf {C}(G/\Gamma )$
 such that 
 $$ \begin{align*} \limsup_{N \to \infty} \bigg\| {\frac{1}{N} \sum_{n=1}^{N} G(S^{n}y) \prod_{i=1}^{k} T^{in}f_{i} - \frac{1}{N} \sum_{n=1}^{N} G(S^{n}y) \prod_{i=1}^{k} (R^{in}\tilde{f}_{i})\circ \pi } \bigg\|_{2} \le \varepsilon, \end{align*} $$
$$ \begin{align*} \limsup_{N \to \infty} \bigg\| {\frac{1}{N} \sum_{n=1}^{N} G(S^{n}y) \prod_{i=1}^{k} T^{in}f_{i} - \frac{1}{N} \sum_{n=1}^{N} G(S^{n}y) \prod_{i=1}^{k} (R^{in}\tilde{f}_{i})\circ \pi } \bigg\|_{2} \le \varepsilon, \end{align*} $$
where 
 $\pi :X\to G/\Gamma $
 denotes the factor map from
$\pi :X\to G/\Gamma $
 denotes the factor map from 
 $(X,T,\mu _{X})$
 onto
$(X,T,\mu _{X})$
 onto 
 $(G/\Gamma ,R,\mu _{G/\Gamma })$
. Therefore, to show (26) it suffices to show that for almost every
$(G/\Gamma ,R,\mu _{G/\Gamma })$
. Therefore, to show (26) it suffices to show that for almost every 
 $x\in G/\Gamma $
 one has
$x\in G/\Gamma $
 one has 
 $$ \begin{align} \lim_{N\to\infty}\frac1N\sum_{n=1}^{N} G(S^{n}y) \prod_{i=1}^{k} R^{in}\tilde{f}_{i}(x)=0. \end{align} $$
$$ \begin{align} \lim_{N\to\infty}\frac1N\sum_{n=1}^{N} G(S^{n}y) \prod_{i=1}^{k} R^{in}\tilde{f}_{i}(x)=0. \end{align} $$
Define 
 $S= R\times R^{2}\times \cdots \times R^{k}$
,
$S= R\times R^{2}\times \cdots \times R^{k}$
, 
 $Y_{x}=\overline {\{S^{n}(x,x,\ldots , x): n\in \mathbb {Z}\}}$
 and also
$Y_{x}=\overline {\{S^{n}(x,x,\ldots , x): n\in \mathbb {Z}\}}$
 and also 
 $F=\tilde {f}_{1}\otimes \cdots \otimes \tilde {f}_{k}$
. Clearly, (27) is equivalent to
$F=\tilde {f}_{1}\otimes \cdots \otimes \tilde {f}_{k}$
. Clearly, (27) is equivalent to 
 $$ \begin{align} \lim_{N\to\infty}\frac1N\sum_{n=1}^{N} G(S^{n}y) S^{n} F(x,\ldots, x)=0. \end{align} $$
$$ \begin{align} \lim_{N\to\infty}\frac1N\sum_{n=1}^{N} G(S^{n}y) S^{n} F(x,\ldots, x)=0. \end{align} $$
It follows from Corollary 6.4 that for almost every 
 $x\in G_{X}/\Gamma _{X} $
, the two systems
$x\in G_{X}/\Gamma _{X} $
, the two systems 
 $(Y_{x}, S, \mu _{Y_{x}}, S)$
 and
$(Y_{x}, S, \mu _{Y_{x}}, S)$
 and 
 $(Y,S,\mu _{Y})$
 are Kronecker disjoint. Hence, Lemma 6.6 implies that for almost every
$(Y,S,\mu _{Y})$
 are Kronecker disjoint. Hence, Lemma 6.6 implies that for almost every 
 $x\in G_{X}/\Gamma _{X} $
, (28) holds. This finishes the proof.
$x\in G_{X}/\Gamma _{X} $
, (28) holds. This finishes the proof.
6.3 An example of multiple recurrence
In this section, we obtain an application of Theorem 1.4 to multiple recurrence.
Definition 6.7. Let q be an integer 
 $\geqslant 2$
. A function
$\geqslant 2$
. A function 
 $w:\mathbb {N}\to \mathbb {C}$
 is called strongly q-multiplicative if
$w:\mathbb {N}\to \mathbb {C}$
 is called strongly q-multiplicative if 
 $w(n)=w(a_{0})\cdots w(a_{k})$
, where
$w(n)=w(a_{0})\cdots w(a_{k})$
, where 
 $$ \begin{align*}n=\sum_{i=0}^{k}a_{i}q^{i},\quad 0\leqslant a_{i}\leqslant q-1\end{align*} $$
$$ \begin{align*}n=\sum_{i=0}^{k}a_{i}q^{i},\quad 0\leqslant a_{i}\leqslant q-1\end{align*} $$
is the base q expansion of n. For convenience, we assume 
 $w(0)=1$
.
$w(0)=1$
.
 The 
 $\{-1,1\}$
-valued Thue–Morse sequence is an example of a strongly
$\{-1,1\}$
-valued Thue–Morse sequence is an example of a strongly 
 $2$
-multiplicative sequence, obtained by letting
$2$
-multiplicative sequence, obtained by letting 
 $w(0)=1$
 and
$w(0)=1$
 and 
 $w(1)=-1$
.
$w(1)=-1$
.
 When a strongly q-multiplicative function takes only finitely many values, say 
 $w:\mathbb {N}\to {\mathcal A}\subset \mathbb {C}$
, we can identify the function with a point (which, by a slight abuse of notation, we also denote by w) in the symbolic space
$w:\mathbb {N}\to {\mathcal A}\subset \mathbb {C}$
, we can identify the function with a point (which, by a slight abuse of notation, we also denote by w) in the symbolic space 
 ${\mathcal A}^{\mathbb {N}}$
. Let
${\mathcal A}^{\mathbb {N}}$
. Let 
 $S:{\mathcal A}^{\mathbb {N}}\to {\mathcal A}^{\mathbb {N}}$
 be the usual shift map.
$S:{\mathcal A}^{\mathbb {N}}\to {\mathcal A}^{\mathbb {N}}$
 be the usual shift map.
Proposition 6.8. [Reference LiardetLia87]
 Let 
 $w:\mathbb {N}\to {\mathcal A}\subset \mathbb {C}$
 be a strongly q-multiplicative function. Then the orbit closure
$w:\mathbb {N}\to {\mathcal A}\subset \mathbb {C}$
 be a strongly q-multiplicative function. Then the orbit closure 
 ${\textbf Y}=\Big (\overline {\{S^{n}w:n\in \mathbb {N}\}},S\Big )$
 is a uniquely ergodic system. Moreover, the discrete spectrum
${\textbf Y}=\Big (\overline {\{S^{n}w:n\in \mathbb {N}\}},S\Big )$
 is a uniquely ergodic system. Moreover, the discrete spectrum 
 $\mathsf {Eig}({\textbf Y})$
 with respect to the unique invariant measure
$\mathsf {Eig}({\textbf Y})$
 with respect to the unique invariant measure 
 $\mu _{Y}$
 is contained in the set
$\mu _{Y}$
 is contained in the set 
 $\{e(a/q^{n}):a,n\in \mathbb {N}\}$
.
$\{e(a/q^{n}):a,n\in \mathbb {N}\}$
.
Putting this proposition together with Theorem 1.8, we obtain the following corollary.
Corollary 6.9. Let 
 $w:\mathbb {N}\to \mathbb {C}$
 be a strongly q-multiplicative function taking only finite many values and let
$w:\mathbb {N}\to \mathbb {C}$
 be a strongly q-multiplicative function taking only finite many values and let 
 ${\textbf X}=(X,T,\mu _{X})$
 be a system with
${\textbf X}=(X,T,\mu _{X})$
 be a system with 
 $\mathsf {Eig}({\textbf X})\cap \{e(a/q^{n}):a,n\in \mathbb {N}\}=\{1\}$
. For every
$\mathsf {Eig}({\textbf X})\cap \{e(a/q^{n}):a,n\in \mathbb {N}\}=\{1\}$
. For every 
 $f_{1},\ldots ,f_{k}\in \operatorname {\textrm {L}}^{\infty }(X)$
, we have
$f_{1},\ldots ,f_{k}\in \operatorname {\textrm {L}}^{\infty }(X)$
, we have 
 $$ \begin{align*}\lim_{N\to\infty}\frac1N\sum_{n=1}^{N}w(n)\prod_{i=1}^{k}T^{in}f_{i}= \bigg(\lim_{N\to\infty}\frac1N\sum_{n=1}^{N}w(n)\bigg)\cdot\bigg( \lim_{N\to\infty}\frac1N\sum_{n=1}^{N}\prod_{i=1}^{k}T^{in}f_{i}\bigg).\end{align*} $$
$$ \begin{align*}\lim_{N\to\infty}\frac1N\sum_{n=1}^{N}w(n)\prod_{i=1}^{k}T^{in}f_{i}= \bigg(\lim_{N\to\infty}\frac1N\sum_{n=1}^{N}w(n)\bigg)\cdot\bigg( \lim_{N\to\infty}\frac1N\sum_{n=1}^{N}\prod_{i=1}^{k}T^{in}f_{i}\bigg).\end{align*} $$
We can use this to derive a multiple recurrence result for level sets of strongly q-multiplicative functions.
Theorem 6.10. Let 
 $m,q\in \mathbb {N}$
, let
$m,q\in \mathbb {N}$
, let 
 ${\mathcal A}\subset \mathbb {C}$
 be the set of mth roots of
${\mathcal A}\subset \mathbb {C}$
 be the set of mth roots of 
 $1$
, and let
$1$
, and let 
 $w:\mathbb {N}\to {\mathcal A}$
 be a strongly q-multiplicative function. For every
$w:\mathbb {N}\to {\mathcal A}$
 be a strongly q-multiplicative function. For every 
 $z\in {\mathcal A}$
, the level set
$z\in {\mathcal A}$
, the level set 
 $R=\{n\in \mathbb {N}:w(n)=z\}$
 either has
$R=\{n\in \mathbb {N}:w(n)=z\}$
 either has 
 $0$
 density or satisfies the following multiple recurrence property: let
$0$
 density or satisfies the following multiple recurrence property: let 
 ${\textbf X}=(X,T,\mu _{X})$
 be a
${\textbf X}=(X,T,\mu _{X})$
 be a 
 $\mathbb {Z}$
 system and assume that
$\mathbb {Z}$
 system and assume that 
 $\mathsf {Eig}({\textbf X})$
 contains no non-trivial qth root of
$\mathsf {Eig}({\textbf X})$
 contains no non-trivial qth root of 
 $1$
. Then, for every
$1$
. Then, for every 
 $A\subset X$
 with
$A\subset X$
 with 
 $\mu (A)>0$
 and every
$\mu (A)>0$
 and every 
 $k\in \mathbb {N}$
, there exists
$k\in \mathbb {N}$
, there exists 
 $n\in R$
 such that
$n\in R$
 such that 
 $$ \begin{align*}\mu(A\cap T^{-n}A\cap T^{-2n}A\cap\cdots\cap T^{-kn}A)>0.\end{align*} $$
$$ \begin{align*}\mu(A\cap T^{-n}A\cap T^{-2n}A\cap\cdots\cap T^{-kn}A)>0.\end{align*} $$
Proof. The indicator function 
 $1_{R}(n)$
 of
$1_{R}(n)$
 of 
 $R=\{n\in \mathbb {N}:w(n)=z\}$
 can be expressed as
$R=\{n\in \mathbb {N}:w(n)=z\}$
 can be expressed as 
 $\delta _{z}\circ w(n)$
, where
$\delta _{z}\circ w(n)$
, where 
 $\delta _{z}:{\mathcal A}\to \mathbb {C}$
 is the function
$\delta _{z}:{\mathcal A}\to \mathbb {C}$
 is the function 
 $\delta _{z}(u)=1$
 if
$\delta _{z}(u)=1$
 if 
 $u=z$
 and
$u=z$
 and 
 $\delta _{z}(u)=0$
 otherwise. The space of functions from
$\delta _{z}(u)=0$
 otherwise. The space of functions from 
 $\mathcal A$
 to
$\mathcal A$
 to 
 $\mathbb {C}$
 is a vector space spanned by the functions
$\mathbb {C}$
 is a vector space spanned by the functions 
 $u\mapsto u^{j}$
,
$u\mapsto u^{j}$
, 
 $j=0,\ldots ,m-1$
, and, in particular,
$j=0,\ldots ,m-1$
, and, in particular, 
 $\delta _{z}$
 is a linear combination of the functions
$\delta _{z}$
 is a linear combination of the functions 
 $u^{j}$
. It follows that
$u^{j}$
. It follows that 
 $1_{R}(n)$
 is a linear combination of the functions
$1_{R}(n)$
 is a linear combination of the functions 
 $w^{j}(n)$
 for
$w^{j}(n)$
 for 
 $j=0,\ldots ,m-1$
.
$j=0,\ldots ,m-1$
.
 Observe that each power 
 $w^{j}$
 of w is a strongly q-multiplicative function and, thus, Corollary 6.9 applied to the functions
$w^{j}$
 of w is a strongly q-multiplicative function and, thus, Corollary 6.9 applied to the functions 
 $f_{i}=1_{A}$
 implies that
$f_{i}=1_{A}$
 implies that 
 $$ \begin{align*}&\lim_{N\to\infty}\frac1N\sum_{n=1}^{N}w^{j}(n)\mu\bigg(\bigcap_{i=0}^{k}T^{-in}A\bigg)\\ &\quad=\bigg(\lim_{N\to\infty}\frac1N\sum_{n=1}^{N}w^{j}(n)\bigg)\cdot\bigg( \lim_{N\to\infty}\frac1N\sum_{n=1}^{N}\mu\bigg(\bigcap_{i=0}^{k}T^{-in}A\bigg) \bigg).\end{align*} $$
$$ \begin{align*}&\lim_{N\to\infty}\frac1N\sum_{n=1}^{N}w^{j}(n)\mu\bigg(\bigcap_{i=0}^{k}T^{-in}A\bigg)\\ &\quad=\bigg(\lim_{N\to\infty}\frac1N\sum_{n=1}^{N}w^{j}(n)\bigg)\cdot\bigg( \lim_{N\to\infty}\frac1N\sum_{n=1}^{N}\mu\bigg(\bigcap_{i=0}^{k}T^{-in}A\bigg) \bigg).\end{align*} $$
By linearity, this implies that
 $$ \begin{align*}&\lim_{N\to\infty}\frac1N\sum_{n=1}^{N}1_{R}(n)\mu\bigg(\bigcap_{i=0}^{k}T^{-in}A\bigg) \\ &\quad=\bigg(\lim_{N\to\infty}\frac1N\sum_{n=1}^{N}1_{R}(n)\bigg)\cdot\bigg( \lim_{N\to\infty}\frac1N\sum_{n=1}^{N}\mu\bigg(\bigcap_{i=0}^{k}T^{-in}A\bigg) \bigg).\end{align*} $$
$$ \begin{align*}&\lim_{N\to\infty}\frac1N\sum_{n=1}^{N}1_{R}(n)\mu\bigg(\bigcap_{i=0}^{k}T^{-in}A\bigg) \\ &\quad=\bigg(\lim_{N\to\infty}\frac1N\sum_{n=1}^{N}1_{R}(n)\bigg)\cdot\bigg( \lim_{N\to\infty}\frac1N\sum_{n=1}^{N}\mu\bigg(\bigcap_{i=0}^{k}T^{-in}A\bigg) \bigg).\end{align*} $$
If R has positive density, then the first factor on the right-hand side of the previous equation is positive. The fact that the second factor is also positive is the content of Furstenberg’s multiple recurrence theorem [Reference FurstenbergFur77, Theorem 11.13]. Therefore, the left-hand side has to be positive as well and this implies the desired conclusion.
7 Some open questions
 It is tempting to define 
 $\textbf {X}$
 and
$\textbf {X}$
 and 
 $\textbf {Z}$
 to be quasi-disjoint if the natural map from
$\textbf {Z}$
 to be quasi-disjoint if the natural map from 
 $\mathcal {J}_{\mathsf {e}}(\textbf {X},\textbf {Z})$
 to
$\mathcal {J}_{\mathsf {e}}(\textbf {X},\textbf {Z})$
 to 
 $\mathcal {J}_{\mathsf {e}}(\mathsf {K} \textbf {X},\mathsf {K} \textbf {Z})$
 is a bijection. However, [Reference BergBer71, Example 2] shows this is, in fact, a strictly stronger notion than quasi-disjointness, which amounts to requiring that the above map be a bijection almost everywhere with respect to the measure on
$\mathcal {J}_{\mathsf {e}}(\mathsf {K} \textbf {X},\mathsf {K} \textbf {Z})$
 is a bijection. However, [Reference BergBer71, Example 2] shows this is, in fact, a strictly stronger notion than quasi-disjointness, which amounts to requiring that the above map be a bijection almost everywhere with respect to the measure on 
 $\mathcal {J}_{\mathsf {e}}(\textbf {X},\textbf {Z})$
 given by the ergodic decomposition of the product measure. In light of this we ask the following question.
$\mathcal {J}_{\mathsf {e}}(\textbf {X},\textbf {Z})$
 given by the ergodic decomposition of the product measure. In light of this we ask the following question.
Question 7.1. Is it true that a system 
 $\textbf {X}$
 is quasi-disjoint from
$\textbf {X}$
 is quasi-disjoint from 
 $\textbf {Y}$
 if and only if the support of the measure appearing in the ergodic decomposition of
$\textbf {Y}$
 if and only if the support of the measure appearing in the ergodic decomposition of 
 $\mu \times \nu $
 equals
$\mu \times \nu $
 equals 
 $\mathcal {J}_{\mathsf {e}}(\textbf {X},\textbf {Y})$
.
$\mathcal {J}_{\mathsf {e}}(\textbf {X},\textbf {Y})$
.
The notion of disjointness can be described in terms of factor maps. We ask whether a similar characterization of quasi-disjointness is possible.
Question 7.2. Is is true that 
 ${\textbf X}$
 and
${\textbf X}$
 and 
 ${\textbf Y}$
 are quasi-disjoint if and only if any system
${\textbf Y}$
 are quasi-disjoint if and only if any system 
 ${\textbf Z}$
 which has
${\textbf Z}$
 which has 
 ${\textbf X}$
,
${\textbf X}$
, 
 ${\textbf Y}$
 and
${\textbf Y}$
 and 
 $\mathsf {K}{\textbf X}\times \mathsf {K}{\textbf Y}$
 as factors, also has
$\mathsf {K}{\textbf X}\times \mathsf {K}{\textbf Y}$
 as factors, also has 
 ${\textbf X}\times {\textbf Y}$
 as a factor?
${\textbf X}\times {\textbf Y}$
 as a factor?
We are also interested in the following potential extensions of our theorem and its applications.
Question 7.3. Is a system 
 ${\textbf X}$
 quasi-disjoint from any ergodic system if and only if it is quasi-disjoint from itself?
${\textbf X}$
 quasi-disjoint from any ergodic system if and only if it is quasi-disjoint from itself?
We expect the following question, which seeks a generalization of Theorem 1.7 to countable, amenable groups, to be quite difficult.
Question 7.4. Fix a countable, amenable group G with a Følner sequence 
 $\Phi $
. Let
$\Phi $
. Let 
 $\textbf {X} = (X,T,\mu _{X})$
 be an ergodic
$\textbf {X} = (X,T,\mu _{X})$
 be an ergodic 
 $G^{k}$
 system and let
$G^{k}$
 system and let 
 $\textbf {Y} = (Y,S,\mu _{Y})$
 be an ergodic
$\textbf {Y} = (Y,S,\mu _{Y})$
 be an ergodic 
 $G^{\ell }$
 system. Given
$G^{\ell }$
 system. Given 
 $1 \le i \le j \le k$
, write
$1 \le i \le j \le k$
, write 
 $T_{[i,j]}$
 for the G action induced by the inclusion of G in
$T_{[i,j]}$
 for the G action induced by the inclusion of G in 
 $G^{k}$
 diagonally on the coordinates
$G^{k}$
 diagonally on the coordinates 
 $i,\ldots ,j$
 and similarly for
$i,\ldots ,j$
 and similarly for 
 $S_{[i,j]}$
 with
$S_{[i,j]}$
 with 
 $1 \le i \le j \le \ell $
. Under what conditions on
$1 \le i \le j \le \ell $
. Under what conditions on 
 $\textbf {X}$
 and
$\textbf {X}$
 and 
 $\textbf {Y}$
 do we have
$\textbf {Y}$
 do we have 
 $$ \begin{align*} &\lim_{N \to \infty} \frac{1}{|\Phi_{N}|} \sum_{g \in \Phi_{N}} \prod_{i=1}^{k} T_{[1,i]}^{g} f_{i} \prod_{j=1}^{\ell} S_{[1,j]}^{g} h_{j}\\ &\quad= \bigg( \lim_{N \to \infty} \frac{1}{|\Phi_{N}|} \sum_{g \in \Phi_{N}} \prod_{i=1}^{k} T_{[1,i]}^{g} f_{i}\bigg) \bigg( \lim_{N \to \infty} \frac{1}{|\Phi_{N}|} \sum_{g \in \Phi_{N}} \prod_{j=1}^{\ell} S_{[1,j]}^{g} h_{j} \bigg) \end{align*} $$
$$ \begin{align*} &\lim_{N \to \infty} \frac{1}{|\Phi_{N}|} \sum_{g \in \Phi_{N}} \prod_{i=1}^{k} T_{[1,i]}^{g} f_{i} \prod_{j=1}^{\ell} S_{[1,j]}^{g} h_{j}\\ &\quad= \bigg( \lim_{N \to \infty} \frac{1}{|\Phi_{N}|} \sum_{g \in \Phi_{N}} \prod_{i=1}^{k} T_{[1,i]}^{g} f_{i}\bigg) \bigg( \lim_{N \to \infty} \frac{1}{|\Phi_{N}|} \sum_{g \in \Phi_{N}} \prod_{j=1}^{\ell} S_{[1,j]}^{g} h_{j} \bigg) \end{align*} $$
in 
 $\operatorname {\textrm {L}}^{2}(X \times Y, \mu _{X} \otimes \mu _{Y})$
 for all
$\operatorname {\textrm {L}}^{2}(X \times Y, \mu _{X} \otimes \mu _{Y})$
 for all 
 $f_{1},\ldots ,f_{k}$
 in
$f_{1},\ldots ,f_{k}$
 in 
 $\operatorname {\textrm {L}}^{\infty }(X,\mu _{X})$
 and all
$\operatorname {\textrm {L}}^{\infty }(X,\mu _{X})$
 and all 
 $h_{1},\ldots ,h_{\ell }$
 in
$h_{1},\ldots ,h_{\ell }$
 in 
 $\operatorname {\textrm {L}}^{\infty }(Y,\mu _{Y})$
?
$\operatorname {\textrm {L}}^{\infty }(Y,\mu _{Y})$
?
Acknowledgements
We would like to thank Vitaly Bergelson for helpful comments on an early version of the paper, Eli Glasner for fruitful discussions on disjointness of measurably distal systems, and Pavel Zorin-Kranich for useful remarks on Wiener–Wintner results. We would like to thank the referee for suggesting improvements to the readability of the paper. We would also like to thank the referee of an earlier version of this paper for pointing out errors in the statements of Theorems 6.1 and 6.2 in light of work by Liang and Qiu, which have since been corrected. The first author was supported by the NSF grant DMS-1700147. The third author gratefully acknowledges the support of the NSF via grants DMS-1246989 and DMS-1703597.
 
  
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
