1. Introduction
 We consider the classical problem for irrotational steady waves on the surface of an ideal fluid of finite depth. The fluid occupies an infinite region with a rigid flat bottom and is separated from the air by an unknown free surface. For plane steady waves there are three important physical quantities:  $Q$,
$Q$,  $m$ and
$m$ and  $S$. Here
$S$. Here  $Q$ is the total head,
$Q$ is the total head,  $m$ is the mass flux and
$m$ is the mass flux and  $S$ is the flow force constant. The importance of these parameters was justified by Benjamin & Lighthill (Reference Benjamin and Lighthill1954) in their study of long waves. It was shown that a combination of
$S$ is the flow force constant. The importance of these parameters was justified by Benjamin & Lighthill (Reference Benjamin and Lighthill1954) in their study of long waves. It was shown that a combination of  $Q$,
$Q$,  $m$ and
$m$ and  $S$ determines the cnoidal wave-train uniquely. Furthermore, non-dimensional quantities
$S$ determines the cnoidal wave-train uniquely. Furthermore, non-dimensional quantities  $Q/Q_c$ and
$Q/Q_c$ and  $S/S_c$ of an arbitrary cnoidal wave realise points in a two-dimensional cuspidal region as in figure 1 whose boundary corresponds to laminar flows. Thus, Benjamin and Lighthill conjectured the latter property to hold among all exact gravity waves, irrespective of amplitude or wavelength.
$S/S_c$ of an arbitrary cnoidal wave realise points in a two-dimensional cuspidal region as in figure 1 whose boundary corresponds to laminar flows. Thus, Benjamin and Lighthill conjectured the latter property to hold among all exact gravity waves, irrespective of amplitude or wavelength.

Figure 1. Parameter region for steady water waves.
 In order to reformulate this conjecture in a mathematically convenient way we introduce non-dimensional variables proposed by Keady & Norbury (Reference Keady and Norbury1978), where the mass flux and the gravitational constant are scaled to unity. In new variables, constants  $Q$ and
$Q$ and  $S$ are scaled
$S$ are scaled  $r$ and
$r$ and  ${\mathcal {S}}$ respectively. According to conjecture, every steady flow, irrespective of amplitude or wavelength, will realise a point within a certain region in the
${\mathcal {S}}$ respectively. According to conjecture, every steady flow, irrespective of amplitude or wavelength, will realise a point within a certain region in the  $(r,{\mathcal {S}})$-plane. The latter planar region is determined by all points
$(r,{\mathcal {S}})$-plane. The latter planar region is determined by all points  $(r,{\mathcal {S}})$ for which
$(r,{\mathcal {S}})$ for which
 \begin{equation} {\mathcal{S}}_-(r) \leq {\mathcal{S}} \leq {\mathcal{S}}_+(r), \end{equation}
\begin{equation} {\mathcal{S}}_-(r) \leq {\mathcal{S}} \leq {\mathcal{S}}_+(r), \end{equation}
where  ${\mathcal {S}}_+(r)$ and
${\mathcal {S}}_+(r)$ and  ${\mathcal {S}}_-(r)$ are the flow force constants corresponding to the subcritical and supercritical laminar flows respectively. The boundary of this region is a cusped curve representing all uniform streams (see figure 1). Thus, the conjecture can be stated as follows.
${\mathcal {S}}_-(r)$ are the flow force constants corresponding to the subcritical and supercritical laminar flows respectively. The boundary of this region is a cusped curve representing all uniform streams (see figure 1). Thus, the conjecture can be stated as follows.
 Benjamin and Lighthill Conjecture. For any two-dimensional steady wave the corresponding constants  $r$ and
$r$ and  ${\mathcal {S}}$ are subject to the inequalities (1.1).
${\mathcal {S}}$ are subject to the inequalities (1.1).
We emphasise that the conjecture concerns all steady waves, not necessarily periodic or solitary waves. For instance, sub-harmonic bifurcations were found numerically by Vanden-Broeck (Reference Vanden-Broeck1983). Such waves can have several crests in every minimal period.
 The conjecture restricted to Stokes waves (periodic waves with exactly one crest and one trough in every minimal period) was verified by Benjamin (Reference Benjamin1995), although the left inequality in (1.1) was obtained earlier by Keady & Norbury (Reference Keady and Norbury1975). Later, Kozlov & Kuznetsov (Reference Kozlov and Kuznetsov2009, Reference Kozlov and Kuznetsov2011) proved (1.1) for arbitrary solutions under weak regularity assumptions, provided the Bernoulli constant  $r$ is close to its critical value
$r$ is close to its critical value  $r_c = 3/2$. It was extended to the rotational setting in Kozlov, Kuznetsov & Lokharu (Reference Kozlov, Kuznetsov and Lokharu2017), again for
$r_c = 3/2$. It was extended to the rotational setting in Kozlov, Kuznetsov & Lokharu (Reference Kozlov, Kuznetsov and Lokharu2017), again for  $r \approx r_c$, which guarantees that solutions are of small amplitude. The left inequality in (1.1) for periodic waves with a favourable vorticity was obtained by Keady & Norbury (Reference Keady and Norbury1978). Finally, a complete answer was given in Lokharu (Reference Lokharu2020), where the conjecture was proved in its general form for rotational waves with an arbitrary vorticity distribution.
$r \approx r_c$, which guarantees that solutions are of small amplitude. The left inequality in (1.1) for periodic waves with a favourable vorticity was obtained by Keady & Norbury (Reference Keady and Norbury1978). Finally, a complete answer was given in Lokharu (Reference Lokharu2020), where the conjecture was proved in its general form for rotational waves with an arbitrary vorticity distribution.
 Another important question about the conjecture is whether all points of the region between curves  ${\mathcal {S}}_-$ and
${\mathcal {S}}_-$ and  ${\mathcal {S}}_+$ in figure 1 correspond to some steady motion. It was already noted by Benjamin & Lighthill (Reference Benjamin and Lighthill1954) that the answer is negative and that the diagram should be complemented by the third barrier corresponding to waves of extreme form. This was partly approved by Cokelet (Reference Cokelet1977) who numerically found the barrier, which, however, consists of waves that are only close to stagnation but not extreme.
${\mathcal {S}}_+$ in figure 1 correspond to some steady motion. It was already noted by Benjamin & Lighthill (Reference Benjamin and Lighthill1954) that the answer is negative and that the diagram should be complemented by the third barrier corresponding to waves of extreme form. This was partly approved by Cokelet (Reference Cokelet1977) who numerically found the barrier, which, however, consists of waves that are only close to stagnation but not extreme.
 In the present paper we prove a new and surprisingly simple inequality  ${\mathcal {S}} > r^2/2$ that is valid for all steady waves, irrespective of geometry and amplitude. The new inequality corresponds to a barrier defined by the curve
${\mathcal {S}} > r^2/2$ that is valid for all steady waves, irrespective of geometry and amplitude. The new inequality corresponds to a barrier defined by the curve  ${\mathcal {S}} = \frac 12 r^2$ that intersects the lower boundary at a point with
${\mathcal {S}} = \frac 12 r^2$ that intersects the lower boundary at a point with  $F=2$ (recovering the well-known bound for the Froude number) and is asymptotically close to the upper boundary
$F=2$ (recovering the well-known bound for the Froude number) and is asymptotically close to the upper boundary  ${\mathcal {S}} = {\mathcal {S}}_+(r)$. This shows that all steady waves, including waves of greatest height, correspond to a significantly smaller part (grey region in figure 1) of the region determined by the inequalities (1.1).
${\mathcal {S}} = {\mathcal {S}}_+(r)$. This shows that all steady waves, including waves of greatest height, correspond to a significantly smaller part (grey region in figure 1) of the region determined by the inequalities (1.1).
 In addition to the significant improvement in the Benjamin and Lighthill conjecture, the new result can be used to study solutions of the problem for large  $r$. More precisely, we prove that all steady waves with large Bernoulli constants are necessarily small in their amplitude, which is of order
$r$. More precisely, we prove that all steady waves with large Bernoulli constants are necessarily small in their amplitude, which is of order  $O(r^{-2})$. Thus, every extreme wave with the mass flux
$O(r^{-2})$. Thus, every extreme wave with the mass flux  $m=1$ in water of depth
$m=1$ in water of depth  $d$ has the amplitude of order
$d$ has the amplitude of order  $1/d^2$. In the physical variables the result states that the amplitude
$1/d^2$. In the physical variables the result states that the amplitude  $a$ of an arbitrary Stokes wave is bounded by
$a$ of an arbitrary Stokes wave is bounded by  $C m^2 g/Q^2$, where
$C m^2 g/Q^2$, where  $m$ is the relative mass flux,
$m$ is the relative mass flux,  $g$ is the gravitational constant,
$g$ is the gravitational constant,  $Q$ is the total head and
$Q$ is the total head and  $C$ is an explicit absolute constant.
$C$ is an explicit absolute constant.
2. Statement of the problem
We consider the classical model for two-dimensional steady waves on water of finite depth. We neglect the effects of surface tension and consider an ideal fluid of constant (unit) density. In the corresponding moving reference frame stationary Euler equations are given by
 $$\begin{gather} (u-c)u_X + vu_Y ={-}P_X, \end{gather}$$
$$\begin{gather} (u-c)u_X + vu_Y ={-}P_X, \end{gather}$$ $$\begin{gather}(u-c)v_X + vv_Y ={-}P_Y-g, \end{gather}$$
$$\begin{gather}(u-c)v_X + vv_Y ={-}P_Y-g, \end{gather}$$ $$\begin{gather}u_X + v_Y = 0, \end{gather}$$
$$\begin{gather}u_X + v_Y = 0, \end{gather}$$ $$\begin{gather}u_Y - v_X = 0, \end{gather}$$
$$\begin{gather}u_Y - v_X = 0, \end{gather}$$
which holds true in a two-dimensional fluid domain  $D_{\eta ^\star }$, defined by the inequality
$D_{\eta ^\star }$, defined by the inequality
 \begin{equation} 0 < Y < \eta^\star(X). \end{equation}
\begin{equation} 0 < Y < \eta^\star(X). \end{equation}
Here  $(u,v)$ are components of the velocity field,
$(u,v)$ are components of the velocity field,  $Y = \eta ^\star (X)$ is the surface profile,
$Y = \eta ^\star (X)$ is the surface profile,  $c$ is the wave speed,
$c$ is the wave speed,  $P$ is the pressure and
$P$ is the pressure and  $g$ is the gravitational constant. The corresponding boundary conditions are
$g$ is the gravitational constant. The corresponding boundary conditions are
 $$\begin{gather} v = 0 \text{ on } Y=0, \end{gather}$$
$$\begin{gather} v = 0 \text{ on } Y=0, \end{gather}$$ $$\begin{gather}v = (u-c)\eta^\star_X \text{ on } Y=\eta^\star, \end{gather}$$
$$\begin{gather}v = (u-c)\eta^\star_X \text{ on } Y=\eta^\star, \end{gather}$$ $$\begin{gather}P = P_{{atm}} \text{ on } Y=\eta^\star. \end{gather}$$
$$\begin{gather}P = P_{{atm}} \text{ on } Y=\eta^\star. \end{gather}$$ We reformulate these equations in terms of a stream function  $\psi ^\star$, defined implicitly by the relations
$\psi ^\star$, defined implicitly by the relations
 \begin{equation} \psi^\star_y = c-u, \quad \psi^\star_x = v. \end{equation}
\begin{equation} \psi^\star_y = c-u, \quad \psi^\star_x = v. \end{equation}
This determines  $\psi ^\star$ up to an additive constant, while relations (2.1f),(2.1f) require
$\psi ^\star$ up to an additive constant, while relations (2.1f),(2.1f) require  $\psi ^\star$ to be constant along the boundaries. Thus, by subtracting a suitable constant we assume that
$\psi ^\star$ to be constant along the boundaries. Thus, by subtracting a suitable constant we assume that
 \begin{equation} \psi^\star= m^\star,
\quad Y = \eta^\star; \quad \psi^\star= 0, \quad Y =
0.
\end{equation}
\begin{equation} \psi^\star= m^\star,
\quad Y = \eta^\star; \quad \psi^\star= 0, \quad Y =
0.
\end{equation}
Here  $m^\star$ is the mass flux, defined by
$m^\star$ is the mass flux, defined by
 \begin{equation} m^\star= \int_0^{\eta^\star} (c-u) \, \textrm{d} Y. \end{equation}
\begin{equation} m^\star= \int_0^{\eta^\star} (c-u) \, \textrm{d} Y. \end{equation}The corresponding problem for the stream function is
 $$\begin{gather} {\rm \Delta} \psi^\star=0 \text{ in } D_{\eta^\star}, \end{gather}$$
$$\begin{gather} {\rm \Delta} \psi^\star=0 \text{ in } D_{\eta^\star}, \end{gather}$$ $$\begin{gather}\tfrac 12\lvert \boldsymbol{\nabla}\psi^\star \rvert^2 + g Y = Q^\star \text{ on }Y=\eta^\star, \end{gather}$$
$$\begin{gather}\tfrac 12\lvert \boldsymbol{\nabla}\psi^\star \rvert^2 + g Y = Q^\star \text{ on }Y=\eta^\star, \end{gather}$$ $$\begin{gather}\psi^\star= m^\star \text{ on }Y=\eta^\star, \end{gather}$$
$$\begin{gather}\psi^\star= m^\star \text{ on }Y=\eta^\star, \end{gather}$$ $$\begin{gather}\psi^\star= 0 \text{ on }Y=0. \end{gather}$$
$$\begin{gather}\psi^\star= 0 \text{ on }Y=0. \end{gather}$$In the following we will use the non-dimensional variables proposed by Keady & Norbury (Reference Keady and Norbury1978), where the gravitational constant and the relative flux are scaled to unity. More precisely, we put
 \begin{align} x &= \lambda X, \quad y = \lambda Y,\quad \eta(x) = \lambda \eta^\star(\lambda^{{-}1} x), \quad \psi(x,y) = (m^\star)^{{-}1}\psi^\star(X,Y),\notag\\ r &= Q/(m^\star \lambda)^2, \end{align}
\begin{align} x &= \lambda X, \quad y = \lambda Y,\quad \eta(x) = \lambda \eta^\star(\lambda^{{-}1} x), \quad \psi(x,y) = (m^\star)^{{-}1}\psi^\star(X,Y),\notag\\ r &= Q/(m^\star \lambda)^2, \end{align}where
 \begin{equation} \lambda = \left(\frac{g}{m^2}\right)^{1/3}. \end{equation}
\begin{equation} \lambda = \left(\frac{g}{m^2}\right)^{1/3}. \end{equation}The corresponding non-dimensional problem is
 $$\begin{gather} {\rm \Delta} \psi=0 \text{ in } D_\eta = \{(x,y): 0 < y < \eta\}, \end{gather}$$
$$\begin{gather} {\rm \Delta} \psi=0 \text{ in } D_\eta = \{(x,y): 0 < y < \eta\}, \end{gather}$$ $$\begin{gather}\tfrac 12\lvert \boldsymbol{\nabla}\psi \rvert^2 + y = r \text{ on }y=\eta, \end{gather}$$
$$\begin{gather}\tfrac 12\lvert \boldsymbol{\nabla}\psi \rvert^2 + y = r \text{ on }y=\eta, \end{gather}$$ $$\begin{gather}\psi = 1 \text{ on }y=\eta, \end{gather}$$
$$\begin{gather}\psi = 1 \text{ on }y=\eta, \end{gather}$$ $$\begin{gather}\psi = 0 \text{ on }y=0. \end{gather}$$
$$\begin{gather}\psi = 0 \text{ on }y=0. \end{gather}$$ Here  $r$ is referred to as the Bernoulli constant. The above problem admits another spatial constant of motion (see Benjamin (Reference Benjamin1984) for more details), the flow force, defined as
$r$ is referred to as the Bernoulli constant. The above problem admits another spatial constant of motion (see Benjamin (Reference Benjamin1984) for more details), the flow force, defined as
 \begin{equation} {\mathcal{S}} = \int_0^{\eta}\left(\frac12(\psi_y^2 - \psi_x^2) - y + r \right)\, {\textrm{d} y}. \end{equation}
\begin{equation} {\mathcal{S}} = \int_0^{\eta}\left(\frac12(\psi_y^2 - \psi_x^2) - y + r \right)\, {\textrm{d} y}. \end{equation}
After taking the  $x$-derivative in (2.9) and using (2.8a) together with the boundary relation (2.8b), one verifies that
$x$-derivative in (2.9) and using (2.8a) together with the boundary relation (2.8b), one verifies that  ${\mathcal {S}}$ is independent of
${\mathcal {S}}$ is independent of  $x$. Our main result can now be stated as follows.
$x$. Our main result can now be stated as follows.
Theorem 2.1 Let  $(\psi ,\eta ) \in C^{2,\gamma }(\overline {D_\eta }) \times C^{2,\gamma }(\mathbb {R})$ for some
$(\psi ,\eta ) \in C^{2,\gamma }(\overline {D_\eta }) \times C^{2,\gamma }(\mathbb {R})$ for some  $\gamma \in (0,1)$ be a solution to (2.8) with
$\gamma \in (0,1)$ be a solution to (2.8) with  $\sup \eta < r$, other than a laminar flow. Then
$\sup \eta < r$, other than a laminar flow. Then  ${\mathcal {S}} > \frac 12 r^2$, where
${\mathcal {S}} > \frac 12 r^2$, where  $r$ and
$r$ and  ${\mathcal {S}}$ are the Bernoulli and the flow force constants respectively, defined by (2.8b) and (2.9).
${\mathcal {S}}$ are the Bernoulli and the flow force constants respectively, defined by (2.8b) and (2.9).
 By laminar flows we mean a solution independent of the  $x$-variable. Let us compare the quantity
$x$-variable. Let us compare the quantity  ${\mathcal {S}}_+(r)$ and the bound
${\mathcal {S}}_+(r)$ and the bound  $\frac 12 r^2$ from the theorem. Note that
$\frac 12 r^2$ from the theorem. Note that
 \begin{equation} {\mathcal{S}}_+(r) = \frac{1}{2 d_+(r)} - \frac12 d_+^2(r) + r d_+(r), \end{equation}
\begin{equation} {\mathcal{S}}_+(r) = \frac{1}{2 d_+(r)} - \frac12 d_+^2(r) + r d_+(r), \end{equation}
where  $d_+(r) > 1$ (the depth of the subcritical laminar flow) is the unique root of
$d_+(r) > 1$ (the depth of the subcritical laminar flow) is the unique root of  $\frac 12 d^{-2} + d = r$. Thus, for large
$\frac 12 d^{-2} + d = r$. Thus, for large  $r$, we find
$r$, we find  $d_+(r) \sim r$ and more precisely,
$d_+(r) \sim r$ and more precisely,
 \begin{equation} d_+(r) = r - \frac{1}{2 r^2} + O(r^{{-}3}) \ \text{as} \ r \to +\infty. \end{equation}
\begin{equation} d_+(r) = r - \frac{1}{2 r^2} + O(r^{{-}3}) \ \text{as} \ r \to +\infty. \end{equation}Therefore, we obtain
 \begin{align} {\mathcal{S}}_+(r) & = \frac{1}{2r} \left(1 + O(r^{{-}3})\right) - \frac12 \left( r^2 - \frac{1}{r} + O(r^{{-}2}) \right) + r \left(r - \frac{1}{2r^2} + O(r^{{-}3})\right) \nonumber\\ & = \frac{r^2}{2} + \frac{1}{2r} + O(r^{{-}2}). \end{align}
\begin{align} {\mathcal{S}}_+(r) & = \frac{1}{2r} \left(1 + O(r^{{-}3})\right) - \frac12 \left( r^2 - \frac{1}{r} + O(r^{{-}2}) \right) + r \left(r - \frac{1}{2r^2} + O(r^{{-}3})\right) \nonumber\\ & = \frac{r^2}{2} + \frac{1}{2r} + O(r^{{-}2}). \end{align}
We see that the curve  ${\mathcal {S}} = \frac 12 r^2$ is below the upper boundary
${\mathcal {S}} = \frac 12 r^2$ is below the upper boundary  ${\mathcal {S}} = {\mathcal {S}}_+(r)$ (see figure 1) for large
${\mathcal {S}} = {\mathcal {S}}_+(r)$ (see figure 1) for large  $r$ and is asymptotically accurate. On the other hand, curves
$r$ and is asymptotically accurate. On the other hand, curves  ${\mathcal {S}} = {\mathcal {S}}_-(r)$ and
${\mathcal {S}} = {\mathcal {S}}_-(r)$ and  ${\mathcal {S}} = \frac 12 r^2$ have one point of intersection, for which the Froude number
${\mathcal {S}} = \frac 12 r^2$ have one point of intersection, for which the Froude number  $F:=d^{-3/2} = 2$. Indeed, because
$F:=d^{-3/2} = 2$. Indeed, because
 \begin{equation} {\mathcal{S}}_-(r) = \frac{1}{2 d_-(r)} - \frac12 d_-^2(r) + r d_-(r), \end{equation}
\begin{equation} {\mathcal{S}}_-(r) = \frac{1}{2 d_-(r)} - \frac12 d_-^2(r) + r d_-(r), \end{equation}
the equality  ${\mathcal {S}}_-(r) = \frac 12 r^2$ is equivalent to
${\mathcal {S}}_-(r) = \frac 12 r^2$ is equivalent to  $(r-d_-)^2 = d_-^{-1}$, which in view of
$(r-d_-)^2 = d_-^{-1}$, which in view of  $r = \frac 12 d_-^{-2} + d_-$, leads to
$r = \frac 12 d_-^{-2} + d_-$, leads to  $d^{-3/2} = 2$. This recovers the well known bound
$d^{-3/2} = 2$. This recovers the well known bound  $F < 2$ for the Froude number of solitary waves; see Starr (Reference Starr1947), Amick & Toland (Reference Amick and Toland1981), McLeod (Reference McLeod1984).
$F < 2$ for the Froude number of solitary waves; see Starr (Reference Starr1947), Amick & Toland (Reference Amick and Toland1981), McLeod (Reference McLeod1984).
Using the result of Theorem 2.1 we can prove the following statement.
Theorem 2.2 There exists a constant  $C>0$ such that the inequalities
$C>0$ such that the inequalities
 \begin{equation} max{\eta} - min{\eta} < C r^{{-}2} \quad \text{and} \quad 0 < \psi_y < C r^{{-}1} \end{equation}
\begin{equation} max{\eta} - min{\eta} < C r^{{-}2} \quad \text{and} \quad 0 < \psi_y < C r^{{-}1} \end{equation}
hold true for any Stokes wave solution in  $D_\eta$. The constant
$D_\eta$. The constant  $C$ is independent of
$C$ is independent of  $r$.
$r$.
 Let us discuss the result, which might look surprising. First, we consider a sufficiently deep water wave  $\eta ^\star$ with
$\eta ^\star$ with  $c^2/(g d)$, being small, where
$c^2/(g d)$, being small, where  $d$ is the average water depth, being large. Then
$d$ is the average water depth, being large. Then  $m^\star \approx c d$ and
$m^\star \approx c d$ and  $Q^\star \approx g d$, so that
$Q^\star \approx g d$, so that  $r \approx (c^2/(g d))^{-1/3}$ is large. If
$r \approx (c^2/(g d))^{-1/3}$ is large. If  $a^\star$ is the amplitude of
$a^\star$ is the amplitude of  $\eta ^\star$, then the corresponding amplitude of
$\eta ^\star$, then the corresponding amplitude of  $\eta$ is
$\eta$ is
 \begin{equation} a = a_\star O(r^{{-}2}), \end{equation}
\begin{equation} a = a_\star O(r^{{-}2}), \end{equation}
which has the same order as the bound in Theorem 2.1. Thus, deep water waves correspond to  $(\psi ,\eta )$ with large
$(\psi ,\eta )$ with large  $r$, while the opposite is also true in view of (2.14a,b). Therefore, our result is sharp for deep water waves.
$r$, while the opposite is also true in view of (2.14a,b). Therefore, our result is sharp for deep water waves.
 Comparing variables, we find the corresponding inequality for the amplitude  $a$ in the original physical variables:
$a$ in the original physical variables:
 \begin{equation} a < C \frac{1}{\lambda r^2} = C \frac{m^2 g}{Q^2}. \end{equation}
\begin{equation} a < C \frac{1}{\lambda r^2} = C \frac{m^2 g}{Q^2}. \end{equation}
On the other hand, we have  $m \leq c d$ and
$m \leq c d$ and  $Q \geq g d$, so that
$Q \geq g d$, so that
 \begin{equation} a < C c^2 g^{{-}1}. \end{equation}
\begin{equation} a < C c^2 g^{{-}1}. \end{equation}This inequality is valid for all Stokes waves, irrespective of wavelength or amplitude, including extreme waves. This is confirmed numerically in several studies, such as Cokelet (Reference Cokelet1977) and Thomas (Reference Thomas1975).
 The result of Theorem 2.2 has another important meaning. We can imagine two extreme Stokes waves with  $m=1$ and corresponding mean depths
$m=1$ and corresponding mean depths  $d_1 = 10$ and
$d_1 = 10$ and  $d_2 = 1000$. The first wave has an amplitude of order
$d_2 = 1000$. The first wave has an amplitude of order  $10^{-2}$, while for the second it is of order
$10^{-2}$, while for the second it is of order  $10^{-6}$. Now we can consider two Stokes waves with the same depths and the same amplitude, of order
$10^{-6}$. Now we can consider two Stokes waves with the same depths and the same amplitude, of order  $10^{-2}$. Then the underlying flows of such waves must be very different and the mass flux of the second wave corresponding to the depth
$10^{-2}$. Then the underlying flows of such waves must be very different and the mass flux of the second wave corresponding to the depth  $d_2$ must be significantly greater. In other words, the wave with depth
$d_2$ must be significantly greater. In other words, the wave with depth  $d_2$ must be travelling much faster. Of course, this observation is also true for usual Stokes waves without stagnation points. This resembles the well known property of surface waves.
$d_2$ must be travelling much faster. Of course, this observation is also true for usual Stokes waves without stagnation points. This resembles the well known property of surface waves.
3. Proof of Theorem 2.1
3.1. Flow force function formulation
Based on the definition for the flow force constant (2.9), we introduce the corresponding flow force function
 \begin{equation} F(x,y) = \int_0^{y}\left(\frac12(\psi_y^2(x,y') - \psi_x^2(x,y')) - y' + r \right)\, {\textrm{d} y}'. \end{equation}
\begin{equation} F(x,y) = \int_0^{y}\left(\frac12(\psi_y^2(x,y') - \psi_x^2(x,y')) - y' + r \right)\, {\textrm{d} y}'. \end{equation}
Just as in Basu (Reference Basu2020) we can reformulate the water-wave problem in terms of the function  $F$. It is straightforward to obtain
$F$. It is straightforward to obtain
 \begin{equation} F_x = \psi_x \psi_y, \quad F_y = \tfrac12(\psi_y^2 - \psi_x^2) - y + r. \end{equation}
\begin{equation} F_x = \psi_x \psi_y, \quad F_y = \tfrac12(\psi_y^2 - \psi_x^2) - y + r. \end{equation}Thus, we arrive at an equivalent formulation given by
 $$\begin{gather} {\rm \Delta} F + 1=0 \quad \text{for } 0 < y < \eta, \end{gather}$$
$$\begin{gather} {\rm \Delta} F + 1=0 \quad \text{for } 0 < y < \eta, \end{gather}$$ $$\begin{gather}\tfrac 12 (-\eta' F_x + F_y) + y = r \text{ on }y=\eta, \end{gather}$$
$$\begin{gather}\tfrac 12 (-\eta' F_x + F_y) + y = r \text{ on }y=\eta, \end{gather}$$ $$\begin{gather}F = {\mathcal{S}} \text{ on }y=\eta, \end{gather}$$
$$\begin{gather}F = {\mathcal{S}} \text{ on }y=\eta, \end{gather}$$ $$\begin{gather}F = 0 \text{ on }y=0. \end{gather}$$
$$\begin{gather}F = 0 \text{ on }y=0. \end{gather}$$A reformulation as given above was already covered in Constantin & Strauss (Reference Constantin and Strauss2011) with an arbitrary vorticity. This reformulation is useful for an analysis of solutions with weak regularity.
 First, we show that  $F > 0$ for
$F > 0$ for  $y>0$. In fact, one can verify that
$y>0$. In fact, one can verify that
 \begin{equation} F_y \geq \psi_y^2 > 0 \quad \text{for } 0 \leq y \leq \eta. \end{equation}
\begin{equation} F_y \geq \psi_y^2 > 0 \quad \text{for } 0 \leq y \leq \eta. \end{equation}
It is enough to apply the maximum principle to the subharmonic function  $\varPhi =\frac 12(\psi _x^2 + \psi _y^2) + y$. A direct calculation gives
$\varPhi =\frac 12(\psi _x^2 + \psi _y^2) + y$. A direct calculation gives
 \begin{equation} {\rm \Delta} \varPhi = \psi_{xx}^2 + 2\psi_{xy}^2 + \psi_{yy}^2 \geq 0, \end{equation}
\begin{equation} {\rm \Delta} \varPhi = \psi_{xx}^2 + 2\psi_{xy}^2 + \psi_{yy}^2 \geq 0, \end{equation}
while  $\varPhi _y = 1$ on
$\varPhi _y = 1$ on  $y=0$. Thus, by the maximum principle, the maximum of
$y=0$. Thus, by the maximum principle, the maximum of  $\varPhi$ is attained at the surface, where
$\varPhi$ is attained at the surface, where  $\varPhi = r$ is constant. This shows that
$\varPhi = r$ is constant. This shows that  $0 < \varPhi < r$ in
$0 < \varPhi < r$ in  $D_\eta$. Using this in (3.2a,b) we obtain (3.4).
$D_\eta$. Using this in (3.2a,b) we obtain (3.4).
 Using equations for the flow force function  $F$ one can obtain an interesting formula for the flow force constant
$F$ one can obtain an interesting formula for the flow force constant  ${\mathcal {S}}$ of a Stokes wave. For this purpose we apply Green's first identity to find
${\mathcal {S}}$ of a Stokes wave. For this purpose we apply Green's first identity to find
 \begin{align} \int_{-\varLambda}^{\varLambda}\int_0^{\eta(x)} \{ ( {\rm \Delta} F \cdot y + F_y \} \, \textrm{d} x\, \textrm{d} y = \int_{-\varLambda}^{\varLambda} \eta(x) (F_x(x,\eta(x)),F_y(x,\eta(x))) \cdot (-\eta'(x),1)\, {\textrm{d} x}. \end{align}
\begin{align} \int_{-\varLambda}^{\varLambda}\int_0^{\eta(x)} \{ ( {\rm \Delta} F \cdot y + F_y \} \, \textrm{d} x\, \textrm{d} y = \int_{-\varLambda}^{\varLambda} \eta(x) (F_x(x,\eta(x)),F_y(x,\eta(x))) \cdot (-\eta'(x),1)\, {\textrm{d} x}. \end{align}
Here  $\varLambda$ is the half period, while
$\varLambda$ is the half period, while  $\eta (x)$ is even with respect to
$\eta (x)$ is even with respect to  $x$. Thus, using (3.2a,b) and (3.3) we conclude
$x$. Thus, using (3.2a,b) and (3.3) we conclude
 \begin{equation} 2 {\mathcal{S}} \varLambda - \frac12 \int_{-\varLambda}^{\varLambda} \eta^2\, {\textrm{d} x} = 2 \int_{-\varLambda}^{\varLambda} \eta (r-\eta)\, {\textrm{d} x}. \end{equation}
\begin{equation} 2 {\mathcal{S}} \varLambda - \frac12 \int_{-\varLambda}^{\varLambda} \eta^2\, {\textrm{d} x} = 2 \int_{-\varLambda}^{\varLambda} \eta (r-\eta)\, {\textrm{d} x}. \end{equation}Therefore, we obtain
 \begin{equation} {\mathcal{S}} = \frac{1}{2\varLambda} \int_{-\varLambda}^{\varLambda} \left( 2r\eta -\frac32 \eta^2 \right)\, {\textrm{d} x}. \end{equation}
\begin{equation} {\mathcal{S}} = \frac{1}{2\varLambda} \int_{-\varLambda}^{\varLambda} \left( 2r\eta -\frac32 \eta^2 \right)\, {\textrm{d} x}. \end{equation}This equality is similar to Starr's identity for the Froude number of a solitary wave; see Starr (Reference Starr1947).
 In what follows it is convenient to have the ‘mass flux’  $F = {\mathcal {S}}$ scaled to
$F = {\mathcal {S}}$ scaled to  $1$. This suggests new variables
$1$. This suggests new variables
 \begin{equation} X = {\mathcal{S}}^{{-}1/2} x, \quad Y = {\mathcal{S}}^{{-}1/2} y, \quad \zeta(X) = {\mathcal{S}}^{{-}1/2} \eta(x), \quad \bar{F}(X,Y) = {\mathcal{S}}^{{-}1} F(x,y). \end{equation}
\begin{equation} X = {\mathcal{S}}^{{-}1/2} x, \quad Y = {\mathcal{S}}^{{-}1/2} y, \quad \zeta(X) = {\mathcal{S}}^{{-}1/2} \eta(x), \quad \bar{F}(X,Y) = {\mathcal{S}}^{{-}1} F(x,y). \end{equation}The scaled problem is
 $$\begin{gather} {\rm \Delta} \bar{F} + 1=0 \text{ for } 0 < Y < \zeta, \end{gather}$$
$$\begin{gather} {\rm \Delta} \bar{F} + 1=0 \text{ for } 0 < Y < \zeta, \end{gather}$$ $$\begin{gather}\tfrac 12 (-\zeta_X \bar{F}_X + \bar{F}_Y) + Y = R:=r {\mathcal{S}}^{{-}1/2} \text{ on }Y=\zeta, \end{gather}$$
$$\begin{gather}\tfrac 12 (-\zeta_X \bar{F}_X + \bar{F}_Y) + Y = R:=r {\mathcal{S}}^{{-}1/2} \text{ on }Y=\zeta, \end{gather}$$ $$\begin{gather}\bar{F} = 1 \text{ on }Y=\zeta, \end{gather}$$
$$\begin{gather}\bar{F} = 1 \text{ on }Y=\zeta, \end{gather}$$ $$\begin{gather}\bar{F} = 0 \text{ on }Y=0. \end{gather}$$
$$\begin{gather}\bar{F} = 0 \text{ on }Y=0. \end{gather}$$Furthermore, in view of (3.4), we additionally have
 \begin{equation} \inf \bar{F}_Y > 0. \end{equation}
\begin{equation} \inf \bar{F}_Y > 0. \end{equation}
Note that  $\psi _y$ is always separated from zero for any smooth steady wave as in Theorem 2.1.
$\psi _y$ is always separated from zero for any smooth steady wave as in Theorem 2.1.
 We are going to prove certain bounds for the Bernoulli constant  $R$ in (3.10b). Note that the system (3.10) is very similar to the stream-function formulation of the water problem with constant vorticity, for which the desired bounds were obtained in Kozlov, Kuznetsov & Lokharu (Reference Kozlov, Kuznetsov and Lokharu2015). Thus, a similar argument can be applied here and we adapt it below.
$R$ in (3.10b). Note that the system (3.10) is very similar to the stream-function formulation of the water problem with constant vorticity, for which the desired bounds were obtained in Kozlov, Kuznetsov & Lokharu (Reference Kozlov, Kuznetsov and Lokharu2015). Thus, a similar argument can be applied here and we adapt it below.
3.2. Stream solutions
 In order to obtain bounds for  $R$ we need to study stream solutions to (3.10). These are pairs
$R$ we need to study stream solutions to (3.10). These are pairs  $\bar {F} = U(Y;d), \zeta (X) = d$, parametrised by the depth
$\bar {F} = U(Y;d), \zeta (X) = d$, parametrised by the depth  $d > 0$. Using this ansatz in (3.10), one finds
$d > 0$. Using this ansatz in (3.10), one finds
 \begin{equation} U(Y;d) ={-} \tfrac12 Y^2 + (d^{{-}1} + \tfrac12 d) Y. \end{equation}
\begin{equation} U(Y;d) ={-} \tfrac12 Y^2 + (d^{{-}1} + \tfrac12 d) Y. \end{equation}The corresponding Bernoulli constant is given by
 \begin{equation} R(d) = \frac{1}{2d} + \frac{3d}{4}. \end{equation}
\begin{equation} R(d) = \frac{1}{2d} + \frac{3d}{4}. \end{equation}
In view of (3.10e) we are interested in unidirectional solutions only, for which  $U_Y > 0$ on
$U_Y > 0$ on  $[0,d]$. As a result we obtain a restriction on
$[0,d]$. As a result we obtain a restriction on  $d$ such that
$d$ such that
 \begin{equation} 0 < d < d_0 = \sqrt{2}, \end{equation}
\begin{equation} 0 < d < d_0 = \sqrt{2}, \end{equation}
where the critical value  $d_0$ is characterised by the relation
$d_0$ is characterised by the relation  $U_Y(d_0;d_0) = 0$, while
$U_Y(d_0;d_0) = 0$, while  $U_Y(Y;d_0) > 0$ for
$U_Y(Y;d_0) > 0$ for  $Y \in [0,d_0)$. Let us put
$Y \in [0,d_0)$. Let us put
 \begin{equation} R_0 = R(d_0) = \sqrt{2} \quad \text{and} \quad R_c = \sqrt{\frac32}, \quad d_c = \sqrt{\frac23}. \end{equation}
\begin{equation} R_0 = R(d_0) = \sqrt{2} \quad \text{and} \quad R_c = \sqrt{\frac32}, \quad d_c = \sqrt{\frac23}. \end{equation}
Note that  $R_c = R(d_c)$ is the global minimum of
$R_c = R(d_c)$ is the global minimum of  $R(d)$. Thus, for any
$R(d)$. Thus, for any  $R \in (R_c,R_0)$ there are two solutions
$R \in (R_c,R_0)$ there are two solutions  $d = d_-(R)$ and
$d = d_-(R)$ and  $d = R_+(d)$ with
$d = R_+(d)$ with  $d_-(R) < d_+(R)$ to the equation
$d_-(R) < d_+(R)$ to the equation
 \begin{equation} R = R(d). \end{equation}
\begin{equation} R = R(d). \end{equation}
For  $R>R_0$ there is only one root
$R>R_0$ there is only one root  $d = d_-(R)$. These depths are analogous to the subcritical and supercritical depths of conjugate laminar flows of the original water-wave problem.
$d = d_-(R)$. These depths are analogous to the subcritical and supercritical depths of conjugate laminar flows of the original water-wave problem.
3.3. Bounds for the Bernoulli constant
Our aim is to prove the following theorem:
Theorem 3.1 Let  $(\bar {F},\zeta )$ be an arbitrary non-trivial (other than a stream) solution to (3.10). Then the corresponding Bernoulli constant
$(\bar {F},\zeta )$ be an arbitrary non-trivial (other than a stream) solution to (3.10). Then the corresponding Bernoulli constant  $R$ is subject to the inequality
$R$ is subject to the inequality  $R < R_0$.
$R < R_0$.
 Note that the statement of Theorem 2.1 follows directly from the upper bound  $R < R_0$. Indeed, a non-trivial solution
$R < R_0$. Indeed, a non-trivial solution  $(\psi ,\eta )$ of the original system (2.8) with the Bernoulli constant
$(\psi ,\eta )$ of the original system (2.8) with the Bernoulli constant  $r$ generates a solution to (3.10) with
$r$ generates a solution to (3.10) with  $R = r\sqrt {{\mathcal {S}}}^{-1}$. Now, Theorem 3.1 gives
$R = r\sqrt {{\mathcal {S}}}^{-1}$. Now, Theorem 3.1 gives  $r\sqrt {{\mathcal {S}}}^{-1} < R_0 = \sqrt {2}$, which is equivalent to
$r\sqrt {{\mathcal {S}}}^{-1} < R_0 = \sqrt {2}$, which is equivalent to  ${\mathcal {S}} > \frac 12 r^2$ as stated in Theorem 2.1.
${\mathcal {S}} > \frac 12 r^2$ as stated in Theorem 2.1.
 Proof of Theorem 3.1. Our argument is based on a comparison of a given solution  $(\bar {F},\zeta )$ to different stream solutions from § 3.2. For this purpose we will apply the partial hodograph transformation introduced in Dubreil-Jacotin (Reference Dubreil-Jacotin1934) but for the flow force function formulation (3.10). This is possible because
$(\bar {F},\zeta )$ to different stream solutions from § 3.2. For this purpose we will apply the partial hodograph transformation introduced in Dubreil-Jacotin (Reference Dubreil-Jacotin1934) but for the flow force function formulation (3.10). This is possible because  $\bar {F}_Y > 0$ everywhere by (3.4). Thus, we introduce new independent variables
$\bar {F}_Y > 0$ everywhere by (3.4). Thus, we introduce new independent variables
 \begin{equation} q = X, \quad p = \bar{F}(X,Y), \end{equation}
\begin{equation} q = X, \quad p = \bar{F}(X,Y), \end{equation}
while the new unknown function  $h(q,p)$ (height function) is defined from the identity
$h(q,p)$ (height function) is defined from the identity
 \begin{equation} h(q,p) = Y. \end{equation}
\begin{equation} h(q,p) = Y. \end{equation}
Note that  $h$ is related to the flow force function
$h$ is related to the flow force function  $\bar {F}$ through the formulas
$\bar {F}$ through the formulas
 \begin{equation} \bar{F}_X ={-} \frac{h_q}{h_p}, \quad \bar{F}_Y = \frac{1}{h_p}, \end{equation}
\begin{equation} \bar{F}_X ={-} \frac{h_q}{h_p}, \quad \bar{F}_Y = \frac{1}{h_p}, \end{equation}where
 \begin{equation} \inf_S h_p > 0 \quad \text{and} \quad \sup_S h_p <{+}\infty. \end{equation}
\begin{equation} \inf_S h_p > 0 \quad \text{and} \quad \sup_S h_p <{+}\infty. \end{equation}
An equivalent problem for  $h$ is given by
$h$ is given by
 $$\begin{gather} \left( \frac{1+h_q^2}{2h_p^2} + p \right)_p - \left(\frac{h_q}{h_p}\right)_q =0 \text{ in } \varGamma = \mathbb{R} \times (0,1), \end{gather}$$
$$\begin{gather} \left( \frac{1+h_q^2}{2h_p^2} + p \right)_p - \left(\frac{h_q}{h_p}\right)_q =0 \text{ in } \varGamma = \mathbb{R} \times (0,1), \end{gather}$$ $$\begin{gather}\frac{1+h_q^2}{2h_p} + h = R \text{ on }p=1, \end{gather}$$
$$\begin{gather}\frac{1+h_q^2}{2h_p} + h = R \text{ on }p=1, \end{gather}$$ $$\begin{gather}h = 0 \text{ on }p=0. \end{gather}$$
$$\begin{gather}h = 0 \text{ on }p=0. \end{gather}$$
The surface profile  $\zeta$ becomes the boundary value of
$\zeta$ becomes the boundary value of  $h$ on
$h$ on  $p = 1$ such that
$p = 1$ such that
 \begin{equation} h(q,1) = \zeta(q), \quad q \in \mathbb{R}. \end{equation}
\begin{equation} h(q,1) = \zeta(q), \quad q \in \mathbb{R}. \end{equation}
For a detailed derivation of (3.19) we refer to Basu (Reference Basu2020). Applying a similar transformation for the stream functions  $U(Y;d)$, we obtain the corresponding height functions
$U(Y;d)$, we obtain the corresponding height functions  $H(p;d)$, subject to
$H(p;d)$, subject to
 \begin{equation} \left(\frac{1}{2H_p^2} + p\right)_p = 0, \quad H(0) = 0, \quad H(1) = d, \quad \frac{1}{2 H_p(1;d)} + d = R(d). \end{equation}
\begin{equation} \left(\frac{1}{2H_p^2} + p\right)_p = 0, \quad H(0) = 0, \quad H(1) = d, \quad \frac{1}{2 H_p(1;d)} + d = R(d). \end{equation}
Because the domain for  $h$ and
$h$ and  $H$ is the same, we can compare these functions using the maximum principle. More precisely, we put
$H$ is the same, we can compare these functions using the maximum principle. More precisely, we put
 \begin{equation} w^{(d)} = h - H({\cdot};d), \end{equation}
\begin{equation} w^{(d)} = h - H({\cdot};d), \end{equation}
and using the corresponding equations for  $h$ and
$h$ and  $H$ one finds that
$H$ one finds that  $w^{(d)}$ solves the homogeneous elliptic equation
$w^{(d)}$ solves the homogeneous elliptic equation
 \begin{equation} \frac{1+h_q^2}{h_p^2} w^{(d)}_{pp} - 2\frac{h_q}{h_p} w^{(d)}_{qp} + w^{(d)}_{qq} - w^{(d)}_p + \frac{(w^{(d)}_q)^2 H_{pp}}{h_p^2} - \frac{w^{(d)}_p (h_p + H_p) H_{pp}}{h_p^2 H_p^2} = 0. \end{equation}
\begin{equation} \frac{1+h_q^2}{h_p^2} w^{(d)}_{pp} - 2\frac{h_q}{h_p} w^{(d)}_{qp} + w^{(d)}_{qq} - w^{(d)}_p + \frac{(w^{(d)}_q)^2 H_{pp}}{h_p^2} - \frac{w^{(d)}_p (h_p + H_p) H_{pp}}{h_p^2 H_p^2} = 0. \end{equation}
This equation can be obtained directly by substituting  $h = H + w^{(d)}$ into (3.19b) and then using the corresponding equation for
$h = H + w^{(d)}$ into (3.19b) and then using the corresponding equation for  $H$. Thus, every
$H$. Thus, every  $w^{(d)}$ is subject to the maximum principle; see Vitolo (Reference Vitolo2007) for an elliptic maximum principle in unbounded domains.
$w^{(d)}$ is subject to the maximum principle; see Vitolo (Reference Vitolo2007) for an elliptic maximum principle in unbounded domains.
The proof relies on two lemmas.
Lemma 3.2 There exists  $0 < d < d_0$ such that
$0 < d < d_0$ such that  $\sup _\mathbb {R} \zeta < d$.
$\sup _\mathbb {R} \zeta < d$.
Proof. Note that when  $d \to d_0 = \sqrt {2}$ we have
$d \to d_0 = \sqrt {2}$ we have  $H_p(1;d) \to +\infty$. Thus, we can choose
$H_p(1;d) \to +\infty$. Thus, we can choose  $d < \sqrt {2}$ for which
$d < \sqrt {2}$ for which  $w^{(d)}_p < 0$ everywhere on
$w^{(d)}_p < 0$ everywhere on  $p=1$. Let us show that the supremum of
$p=1$. Let us show that the supremum of  $w:=w^{(d)}$ in
$w:=w^{(d)}$ in  $\varGamma$ cannot be attained on
$\varGamma$ cannot be attained on  $p=1$. This claim is trivial when the supremum of
$p=1$. This claim is trivial when the supremum of  $w$ is attained at some point. The remaining possibility is that there exists an unbounded sequence
$w$ is attained at some point. The remaining possibility is that there exists an unbounded sequence  $\{q_j\}$ for which
$\{q_j\}$ for which  $\sup _\varGamma w = \lim _{j \to +\infty } w(q_j,1)$. Then we consider shifts
$\sup _\varGamma w = \lim _{j \to +\infty } w(q_j,1)$. Then we consider shifts  $w_j(q,p) = w(q - q_j,p)$ and use a compactness argument as follows. The regularity assumption from Theorem 2.1 and the relation (3.19a) guarantee that
$w_j(q,p) = w(q - q_j,p)$ and use a compactness argument as follows. The regularity assumption from Theorem 2.1 and the relation (3.19a) guarantee that
 \begin{equation} \|h\|_{C^{2,\gamma}(\bar{\varGamma})} <{+} \infty, \end{equation}
\begin{equation} \|h\|_{C^{2,\gamma}(\bar{\varGamma})} <{+} \infty, \end{equation}
and so the norms  $\|w_j\|_{C^{2,\gamma }(\bar {\varGamma })}$ are uniformly bounded. By the compactness argument there exists a subsequence
$\|w_j\|_{C^{2,\gamma }(\bar {\varGamma })}$ are uniformly bounded. By the compactness argument there exists a subsequence  $\{q_{j_k}\}$ and a function
$\{q_{j_k}\}$ and a function  $w_0 \in C^{2}(\bar {\varGamma })$ with the following property: for any compact
$w_0 \in C^{2}(\bar {\varGamma })$ with the following property: for any compact  $K \subset \bar {S}$, functions
$K \subset \bar {S}$, functions  $w_{j_k}$ converge to
$w_{j_k}$ converge to  $w_0$ in
$w_0$ in  $C^2(K)$. Thus, the limiting function
$C^2(K)$. Thus, the limiting function  $w_0$ attains its global maximum at
$w_0$ attains its global maximum at  $(0,1)$, while
$(0,1)$, while  $(w_0)_q(0,1) \leq 0$ because
$(w_0)_q(0,1) \leq 0$ because  $w_p < 0$ on
$w_p < 0$ on  $p=1$ by the choice of
$p=1$ by the choice of  $d$. But this contradicts the Hopf lemma and then we have shown that the supremum of
$d$. But this contradicts the Hopf lemma and then we have shown that the supremum of  $w$ can not be attained for
$w$ can not be attained for  $p=1$. Now the maximum principle for
$p=1$. Now the maximum principle for  $w$ forces
$w$ forces  $w \geq 0$ in
$w \geq 0$ in  $\varGamma$ and then
$\varGamma$ and then  $\zeta \leq d$. This finishes the proof of the lemma.
$\zeta \leq d$. This finishes the proof of the lemma.
Lemma 3.3 For any  $q \in \mathbb {R}$ we have
$q \in \mathbb {R}$ we have  $\zeta (d) > d_-(R)$.
$\zeta (d) > d_-(R)$.
Proof. By Lemma 3.2 we can find  $0 < d_{\dagger} < d_0$ such that
$0 < d_{\dagger} < d_0$ such that  $\inf _{\mathbb {R}} \zeta = d_{\dagger}$; note that
$\inf _{\mathbb {R}} \zeta = d_{\dagger}$; note that  $\inf _{\mathbb {R}} \zeta > 0$ as shown in Kozlov & Kuznetsov (Reference Kozlov and Kuznetsov2007) (Theorem 2.3). Thus, the function
$\inf _{\mathbb {R}} \zeta > 0$ as shown in Kozlov & Kuznetsov (Reference Kozlov and Kuznetsov2007) (Theorem 2.3). Thus, the function  $w:=w^{(d_{\dagger} )}$ is non-negative along the boundaries and so must be strictly positive in
$w:=w^{(d_{\dagger} )}$ is non-negative along the boundaries and so must be strictly positive in  $\varGamma$ by the maximum principle. The zero infimum of
$\varGamma$ by the maximum principle. The zero infimum of  $w$ is attained at the surface by the choice of
$w$ is attained at the surface by the choice of  $d$, so that there exists a minimising sequence
$d$, so that there exists a minimising sequence  $\{q_j\}$ such that
$\{q_j\}$ such that  $w(q_j,1) \to 0$ as
$w(q_j,1) \to 0$ as  $j \to +\infty$. Then we claim that
$j \to +\infty$. Then we claim that  $w_q(q_j,1) \to 0$ as
$w_q(q_j,1) \to 0$ as  $j \to +\infty$ and
$j \to +\infty$ and  $\liminf w_p(q_j,1) \leq 0$. These statements are trivial if the sequence
$\liminf w_p(q_j,1) \leq 0$. These statements are trivial if the sequence  $\{q_j\}$ is bounded. Otherwise we can use a similar compactness argument as in the proof of Lemma 3.2. Thus, passing to the limit in (3.19c) we obtain
$\{q_j\}$ is bounded. Otherwise we can use a similar compactness argument as in the proof of Lemma 3.2. Thus, passing to the limit in (3.19c) we obtain
 \begin{equation} R = \lim_{j \to +\infty} h(q_j,1) + \frac{1+h_q^2(q_j,1)}{2h_p^2(q_j,1)} \geq d_{\dagger}+ \frac{1}{2H_p^2(1;d_{\dagger})} = R(d_{\dagger}), \end{equation}
\begin{equation} R = \lim_{j \to +\infty} h(q_j,1) + \frac{1+h_q^2(q_j,1)}{2h_p^2(q_j,1)} \geq d_{\dagger}+ \frac{1}{2H_p^2(1;d_{\dagger})} = R(d_{\dagger}), \end{equation}
which implies that  $\zeta (q) \geq d_{\dagger} \geq d_-(R)$ for all
$\zeta (q) \geq d_{\dagger} \geq d_-(R)$ for all  $q \in \mathbb {R}$. The inequality here is strict until
$q \in \mathbb {R}$. The inequality here is strict until  $\zeta$ is not a constant everywhere, which follows from the Hopf lemma.
$\zeta$ is not a constant everywhere, which follows from the Hopf lemma.
 Now we can prove that  $R < R_0$. Let us choose
$R < R_0$. Let us choose  $d_\star < d_0$ such that
$d_\star < d_0$ such that  $\sup _{\mathbb {R}} \zeta = d_\star$. Just as before we can find a maximising sequence
$\sup _{\mathbb {R}} \zeta = d_\star$. Just as before we can find a maximising sequence  $q_j$, possibly unbounded, such that
$q_j$, possibly unbounded, such that
 \begin{equation} \lim_{j \to +\infty} \zeta(q_j) = d_\star, \quad \lim_{j \to +\infty} w_q^{(d_\star)}(q_j,1) = 0, \quad \lim_{j \to +\infty} w_p^{(d_\star)}(q_j,1) \geq 0. \end{equation}
\begin{equation} \lim_{j \to +\infty} \zeta(q_j) = d_\star, \quad \lim_{j \to +\infty} w_q^{(d_\star)}(q_j,1) = 0, \quad \lim_{j \to +\infty} w_p^{(d_\star)}(q_j,1) \geq 0. \end{equation}
Thus, taking the corresponding limit in (3.19c) and using relations (3.26a–c) we obtain  $R \leq R(d_\star )$. Assuming
$R \leq R(d_\star )$. Assuming  $R \geq R_0$ we would conclude that
$R \geq R_0$ we would conclude that  $d_\star < d_-(R)$, which contradicts the statement of Lemma 3.3. Therefore, we have shown
$d_\star < d_-(R)$, which contradicts the statement of Lemma 3.3. Therefore, we have shown  $R < R_0$. This finishes the proof of the theorem.
$R < R_0$. This finishes the proof of the theorem.
3.4. Proof of Theorem 2.2
 First, we show that the corresponding inequality for  $\psi _y$ is valid along the bottom. For this purpose we consider the function
$\psi _y$ is valid along the bottom. For this purpose we consider the function
 \begin{equation} f = F - {\mathcal{S}} \psi, \end{equation}
\begin{equation} f = F - {\mathcal{S}} \psi, \end{equation}
which equals zero on  $y = \eta$ and
$y = \eta$ and  $y=0$. From (3.3a) we find that
$y=0$. From (3.3a) we find that  ${\rm \Delta} f = - 1$ in
${\rm \Delta} f = - 1$ in  $D_\eta$, and so
$D_\eta$, and so  $f_y > 0$ on
$f_y > 0$ on  $y=0$ by the maximum principle. This gives
$y=0$ by the maximum principle. This gives
 \begin{equation} \tfrac12 \psi_y^2 + r - {\mathcal{S}} \psi_y > 0 \text{ on } y = 0. \end{equation}
\begin{equation} \tfrac12 \psi_y^2 + r - {\mathcal{S}} \psi_y > 0 \text{ on } y = 0. \end{equation}
Now we note that because the function  $\frac 12 \psi _x^2+\frac 12 \psi _y^2 + y$ attains its maximum at the upper boundary, we have
$\frac 12 \psi _x^2+\frac 12 \psi _y^2 + y$ attains its maximum at the upper boundary, we have  $\frac 12 \psi _y^2 \leq r$ in
$\frac 12 \psi _y^2 \leq r$ in  $D_\eta$. Using this in (3.28), we obtain
$D_\eta$. Using this in (3.28), we obtain
 \begin{equation} \psi_y < 2r {\mathcal{S}}^{{-}1} < 4 r^{{-}1}, \end{equation}
\begin{equation} \psi_y < 2r {\mathcal{S}}^{{-}1} < 4 r^{{-}1}, \end{equation}
where the right inequality is valid because  ${\mathcal {S}} > \frac 12 r^2$ by Theorem 2.1. The opposite inequality
${\mathcal {S}} > \frac 12 r^2$ by Theorem 2.1. The opposite inequality  $\psi _y > r^{-1}$ for
$\psi _y > r^{-1}$ for  $y=0$ follows from the maximum principle applied to the function
$y=0$ follows from the maximum principle applied to the function  $g = \psi - r^{-1}y$, which attains its minimum on
$g = \psi - r^{-1}y$, which attains its minimum on  $y=0$. Thus, we have shown that
$y=0$. Thus, we have shown that
 \begin{equation} r^{{-}1} < \psi_y < 4 r^{{-}1} \text{ on } y = 0. \end{equation}
\begin{equation} r^{{-}1} < \psi_y < 4 r^{{-}1} \text{ on } y = 0. \end{equation}
Now we are ready to prove the main claim. It is enough to show that  $r-\check {\eta } < C r^{-2}$, where
$r-\check {\eta } < C r^{-2}$, where  $\check {\eta } = min_\mathbb {R} \eta$. Let us consider the function
$\check {\eta } = min_\mathbb {R} \eta$. Let us consider the function
 \begin{equation} g(x,y)={-}k \psi+\tfrac{1}{2} (\psi_y^2+\psi_x^2)+y, \quad (x,y) \in D_\eta, \end{equation}
\begin{equation} g(x,y)={-}k \psi+\tfrac{1}{2} (\psi_y^2+\psi_x^2)+y, \quad (x,y) \in D_\eta, \end{equation}
where  $k$ is to be determined later. A direct computation shows that
$k$ is to be determined later. A direct computation shows that
 \begin{equation} {\rm \Delta} g \geq 0, \end{equation}
\begin{equation} {\rm \Delta} g \geq 0, \end{equation}
so that the maximum of  $g$ is attained at the boundary of
$g$ is attained at the boundary of  $D_\eta$. At the bottom we have
$D_\eta$. At the bottom we have
 \begin{equation} g_y = 1- k \psi_y > 0, \end{equation}
\begin{equation} g_y = 1- k \psi_y > 0, \end{equation}
by (3.30), provided  $k = r/4$. Therefore, the maximum of
$k = r/4$. Therefore, the maximum of  $g$ is attained at the surface and the Hopf lemma gives
$g$ is attained at the surface and the Hopf lemma gives
 \begin{equation} g_y = \frac{\eta'' \psi_y^2-k (\eta'^2+1) \psi_y+1}{\eta'^2+1} > 0, \quad y = \eta(x). \end{equation}
\begin{equation} g_y = \frac{\eta'' \psi_y^2-k (\eta'^2+1) \psi_y+1}{\eta'^2+1} > 0, \quad y = \eta(x). \end{equation}This inequality is equivalent to
 \begin{equation} 2 \eta'' (r-\eta) (1+\eta'^2)^{{-}1}-k \sqrt{2(r-\eta)(1+\eta'^2)}+1 \geq 0. \end{equation}
\begin{equation} 2 \eta'' (r-\eta) (1+\eta'^2)^{{-}1}-k \sqrt{2(r-\eta)(1+\eta'^2)}+1 \geq 0. \end{equation}
Let  $x_0$ be such that
$x_0$ be such that  $\eta ''(x_0) = 0$, then the latter inequality shows that
$\eta ''(x_0) = 0$, then the latter inequality shows that
 \begin{equation} r-\eta(x_0) < 8 r^{{-}2}. \end{equation}
\begin{equation} r-\eta(x_0) < 8 r^{{-}2}. \end{equation}
Without loss of generality we can assume that  $\eta (0) = \check {\eta }$. If
$\eta (0) = \check {\eta }$. If  $r-\check {\eta } \geq 16 r^{-2}$, the proof is complete. Otherwise, if
$r-\check {\eta } \geq 16 r^{-2}$, the proof is complete. Otherwise, if  $r-\check {\eta } > 16 r^{-2}$ we put
$r-\check {\eta } > 16 r^{-2}$ we put  $\delta = r-\check {\eta }$ and note that
$\delta = r-\check {\eta }$ and note that
 \begin{equation} \tfrac12 \delta < r- \eta(x) \leq \delta, \end{equation}
\begin{equation} \tfrac12 \delta < r- \eta(x) \leq \delta, \end{equation}
for all  $x \in [0,\delta /2]$. Here we used the fact that
$x \in [0,\delta /2]$. Here we used the fact that  $|\eta '| < 1$ on
$|\eta '| < 1$ on  $\mathbb {R}$ for any Stokes wave solution; see Strauss & Wheeler (Reference Strauss and Wheeler2016). Furthermore,
$\mathbb {R}$ for any Stokes wave solution; see Strauss & Wheeler (Reference Strauss and Wheeler2016). Furthermore,  $\eta ''(x) > 0$ on
$\eta ''(x) > 0$ on  $[0,\delta ]$ by (3.36). Now (3.35) leads to
$[0,\delta ]$ by (3.36). Now (3.35) leads to
 \begin{equation} 2 \delta \eta''(x) \geq k \sqrt{\delta} - 1, \quad x \in [0,\delta/2]. \end{equation}
\begin{equation} 2 \delta \eta''(x) \geq k \sqrt{\delta} - 1, \quad x \in [0,\delta/2]. \end{equation}
After integration over  $[0,\delta ]$, we conclude that
$[0,\delta ]$, we conclude that
 \begin{equation} k \sqrt{\delta} - 1 \leq 4 \eta'(\delta/2) \leq 4. \end{equation}
\begin{equation} k \sqrt{\delta} - 1 \leq 4 \eta'(\delta/2) \leq 4. \end{equation}
Hence,  $\delta < 16^2 r^{-2}$ by the choice of
$\delta < 16^2 r^{-2}$ by the choice of  $k$, and we have proved that
$k$, and we have proved that  $r-\check {\eta } < 16^2 r^{-2}$. This inequality implies that
$r-\check {\eta } < 16^2 r^{-2}$. This inequality implies that  $\psi _y < C r^{-1}$ along the surface, so that the interior estimate for
$\psi _y < C r^{-1}$ along the surface, so that the interior estimate for  $\psi _y$ follows from the maximum principle. This finishes the proof of Theorem 2.2.
$\psi _y$ follows from the maximum principle. This finishes the proof of Theorem 2.2.
Funding
This research received no specific grant from any funding agency, commercial or not-for-profit sectors.
Declaration of interest
The authors report no conflict of interest.
 
 












