Effective bounds of the variance of statistics on multisets of necklaces

Arvydas Karbonskis; Eugenijus Manstavičius

resúmenes

secciones

referencias

imágenes

Abstract: The variance of a linear statistics on multisets of necklaces is explored. The upper and lower bounds with optimal constants are obtained.

Keywords: Tur´an–Kubilius inequality, polynomials over finite field, additive function.

Summary: Nagrinėjama tiesinės statistikos, apibrėžtos atsitiktinių vėrinių multiaibėje, dispersija. Gauti tikslūs viršutinieji ir apatinieji įverčiai.

Keywords: Turanas–Kubiliaus nelygybė, daugianariai virš baigtinio lauko, priedų funkcija.

Carátula del artículo

Articles

Effective bounds of the variance of statistics on multisets of necklaces

Vėrinių multiaibių statistikos dispersijos efektyvūs įverčiai

Arvydas Karbonskis arvydas.karbonskis@mif.stud.vu.lt

Institute of Mathematics, Vilnius University, Lituania

Eugenijus Manstavičius

Institute of Mathematics, Vilnius University, Lituania

Lietuvos matematikos rinkinys, vol. 61 Ser. A, pp. 7-12, 2020
Vilniaus Universitetas

Esta obra está bajo una Licencia Creative Commons Atribución 4.0 Internacional.

Recepción: 28 Octubre 2020

Publicación: 18 Febrero 2021

DOI: https://doi.org/10.15388/LMR.2020.22469

1 Introduction and results

Let (P, ǁ · ǁ) be an initial set of weighted objects and

π (j) : = | {p ∊ P : ∥ p ∥ = j} | < \infty

for every j = 1, 2, . . . . Examine the set G with the extended weight function ǁ · ǁ of multisets comprised of p ∈ P. Namely, a ∈ G if a = {p₁, . . . , p_r} and ǁaǁ = ǁp₁ǁ + · · · + ǁp_rǁ including the empty multiset ∅ of weight 0. Then

m (n) : = | ς_{n} | : = | {α ∊ ς : ∥ a ∥ = n} | = \sum_{ℓ (\overline{k}) = n} \prod_{j = 1}^{n} (\binom{π (j) + k_{j} - 1}{k_{j}}),

where l(k¯) = 1k₁ + + nk_n if k¯ = (k₁, . . . , k_n) $∊ N_{0}^{n}$ and n $∊ ℕ_{0} : = ℕ \cup {0}$ .

In the present paper, we deal with the multisets for which m(n) = qⁿ, where q ≥ 2 is an arbitrary natural number. If q is a prime power, then may be interpreted as $F_{q}^{*} [t]$ , the set of monic polynomials over a finite field F_q. Then P is the subset of irre- ducible polynomials. For an arbitrary such q, there exist combinatorial constructions, called multisets of necklaces satisfying m( n) = qⁿ (see, [1, Example 2.12, p. 43]). For multisets, we have the following relations

π (n) = \frac{1}{n} \sum_{d | n} q^{n / d} μ (d), q^{n} = \sum_{d | n} d π (d),

(1)

where in the summations, d runs over natural divisors of n and µ(d) stands for the Möbius function. The equalities are equivalent to the formal power series relation

\sum_{n = 0}^{\infty} q^{n} x^{n} = \frac{1}{1 - q x} = \prod_{j = 1}^{\infty} {(1 - x^{3})}^{- π (j)} .

Take an a ∈ G_n uniformly at random, that is, sample it with probabilityν_n({a}) = q⁻ⁿ, n ∈ N and ν₀({∅ }) = 1. If k_j(a) ≥ 0 is the number of elements pi in a ∈ G_n of weight j, then k(a) = k₁(a), . . . , k_n(a) is the structure vector of a ∈ G_n satisfying l(k(a)) = n. Its distribution is

v_{n} (\overline{k} (a) = \overline{s}) = 1 {ℓ (\overline{s}) = n} q^{- n} \prod_{j = 1}^{\infty} (\binom{π (j) + s_{j} - 1}{s_{j}}) .

(2)

where $\overline{s} = (s_{1}, . . ., s_{n}) \in ℕ_{0}^{n}$ and 1.{.} stands for the indicator function.

We are interested in the distribution with respect to ν_n of the linear statistics

h (\overline{c}) : = h (\overline{c}, a) = c_{1} k_{1} (a) + \cdot \cdot \cdot + c_{n} k_{n} (a), \overline{c} = (c_{1}, . . ., c_{n}) ∊ ℝ^{n} .

(3)

The number of components in a is such a function, namely, it equals k₁(a)+ +k_n(a). We refer to [1] for more sophisticated examples.

The present paper is devoted to the variance of $h (\overline{c})$ which is a sum of dependent random variables (r.vs) as the relation $h (\overline{j}, a) = ℓ (\overline{k} (a)) = n$ for each a $\in G_{n}$ shows. Estimating it, we propose an approach to overcome technical obstacles stemming from dependence.

In the sequel, the expectations and variances with respect to νn will be denoted by E_n and V_n while, when the probability space (Ω, F , P ) is not specified, we will respectively use the notation E and V. The summation indexes i, j, l, k, m, m¹ and m² will be natural numbers.

Theorem 1 If $\overline{c} \in ℝ^{n}$ and $n \in ℕ_{0}$ , then

V_{n} h (\overline{c}) = \sum_{1 ⩽ j k ⩽ n} c_{j}^{2} π (j) k q^{- j k} - \sum_{\binom{i l + j k > n}{i l ⩽ n, j k ⩽ n}} c_{i} c_{j} π (i) π (j) q^{- i l - j k} .

(4)

The sketch of the proof is given at the beginning of Section 2.

t is known [1] that, for a fixed j, the r.v. k_j(a) converges in distribution to the r.v. γj distributed according the negative binomial law NB(π(j), q^−j). If {γ1, γ2, . .} . are mutually independent, define the statistics Y_n = c₁γ₁ + + nγ_n. We shall see that the first sum on the right-hand side in (4) is close to VY_n; therefore, estimating V_nh(c¯), we use the following quadratic forms:

B_{n} (\overline{c}) : = \sum_{1 ⩽ j k ⩽ n} c_{j}^{2} π (j) k q^{- j k}, R_{n} (\overline{c}) = \sum_{m ⩽ n} {m q}^{- 2 m} {(\sum_{j | m} c_{j} π (j))}^{2} .

Theorem 2 If n ≥ 2, then

V_{n} h (\overline{c}) ⩽ B_{n} (\overline{c}) + \frac{1}{2} R_{n} (\overline{c}) .

(5)

The inequality becomes an equality for

c_{j} = c_{j}^{*} : = \frac{3}{π (j)} \sum_{d | j} d q^{d} μ (\frac{j}{d}) - (2 n + 1) j, 1 ⩽ j ⩽ n .

(6)

Corollary 1 If n ≥ 2 and $\overline{c} \neq \overline{0},$ , then

V_{n} h (\overline{c}) < \frac{3}{2} B_{n} (\overline{c}) < (\frac{3}{2} - \frac{q - 1}{q} {n q}^{- n}) V Y n .

(7)

The inequalities are trivial for functions proportional to $h (\overline{j}, a) = n$ if $a ∊ G_{n}$ because of $V_{n} h (j) = 0$ then. A shift of $\overline{c}$ eliminates this inconvenience. Observe that either of $B_{n} (\overline{c} - t \overline{j})$ and $R_{n} (\overline{c} - t \overline{j})$ attain their minimums in $t ∊ ℝ$ at

t = t_{c} : = \frac{2}{(n + 1) n} \sum_{m ⩽ n} {m q}^{- m} \sum_{j | m} c_{j} π (j) .

Theorem 3 If n ≥ 3, then

B_{n} (\overline{c} - t_{c} \overline{j}) - \frac{1}{3} R_{n} (\overline{c} - t_{c} \overline{j}) ⩽ V_{n} h (\overline{c}) ⩽ B_{n} (\overline{c} - t_{c} \overline{j}) - \frac{1}{2} R_{n} (\overline{c} - t_{c} \overline{j}) .

Both inequalities are sharp.

Corollary 2If n ≥ 3 and $\overline{c} \neq α \overline{j}$ for every $α \in ℝ$

\frac{3}{2} B_{n} (\overline{c} - t_{c} \overline{j}) ⩽ V_{n} h (\overline{c}) < \frac{3}{2} B_{n} (\overline{c} - t_{c} \overline{j}) .

The proofs of the last two theorems presented in Section 2 are built upon the ideas and auxiliary results obtained in [4], [2] and [5].

2 Proofs

We firstly recall known facts about random multisets which can be found in [3] and [1, Section 2.3]. Let ${\overline{y}}^{(x)} = (y_{1}^{(x)}, y_{2}^{(x)}, . . .)$ be the infinite dimensional vector of independent r.vs having the negative binomial distributions NB(π(j), x^j), namely,

P (γ_{j}^{(x)} = m) = (\binom{π (j) + m - 1}{m}) {(1 - x^{j})}^{π (j)} x^{j m}, m = 0, 1, . . .

where 0 < x ≤ q⁻¹. Then γ_j^{(q−1 )} = γ_jwhich has been introduced in Introduction. For convenience, we extend k(a) to k(a) := (k₁(a), . . . , k_n(a), 0, . . . ) and use infinite dimensional vectors. $S e t θ^{(x)} = 1 y_{1}^{(x)} + \cdot \cdot \cdot + n, y_{n}^{(x)} + (n + 1) y_{n + 1}^{(x)} +, . . .$ The latter r.v. is well defined if 0 < x < q⁻¹, since the condition of the Boreli–Cantelli lemma is satisfied:

\sum_{j = 1}^{\infty} p (γ_{j}^{(x)} \neq 0) = \sum_{j = 1}^{\infty} (1 - {(1 - x^{j})}^{π (j)}) < ∞ .

Lemma 1 $I f \overline{s} = (s_{1}, . . ., s_{j}, s_{j} + 1, . . .) ∊ ℕ_{0}^{∞}$ and 0 < x < q⁻¹, then

v_{n} (\overline{k} (a) = s) = p (γ^{(x)} = s | θ^{(x)} = n) .

Proof. Actually, this is Lemma 2.2 in [3] stated there for F_q[t]. The details remain the same in the more general case.

Lemma 2 For a functional $Ψ : ℕ_{0}^{∞} \to ℝ$ such that $E | Ψ ({\overline{γ}}^{(x)}) | < ∞,$ , we have

E Ψ (γ^{(x)}) = (1 - q x) (Ψ (0) + \sum_{n = 1}^{\infty} E_{n} Ψ (\overline{k} (a)) q^{n} x^{n}), 0 < x < q^{- 1} .

Proof. Apply Lemma 1 in the double averaging as follows:

E Ψ (γ^{(x)}) = \sum_{n = 0}^{\infty} E (Ψ (γ^{(x)}) | θ^{(x)} = n) P (θ^{(x)} = n) = \sum_{n = 0}^{\infty} E_{n} Ψ (\overline{k} (a)) P (θ^{(x)} = n) .

Proof of Theorem 1. It is straightforward. Applying the last lemma for the relevant Ψ , one can easily find the needed mixed moments of k_j(a), 1 ≤ j ≤ n, and further, the variance of the linear combination h(a).

To prove Theorems 2 and 3, we will apply the following lemmas concerning par- ticular matrices and quadratic forms.

Lemma 3LetU = ((u_ij)), i, j ≤ n, be the symmetric matrix with the entries

u_{i j} = 1 {i + j > n} {(i j)}^{- 1 / 2} .

The spectrum of U is the set{1, 1/2, 1/3, . . . , ( 1)ⁿ⁻¹/n} . The eigenvectors corresponding to the first three eigenvalues areproportional to ${\overline{c}}_{r} = c_{r 1}, . . ., c_{r n}),$ wherer = 1, 2, 3and, for j ≤ n,

e_{1 j} = \sqrt{j}, e_{2 j} = (3 j - 2 n - 1) \sqrt{j}, e_{3 j} = ({10 j}^{2} - 6 (2 n + 1) j + {3 n}^{2} + 3 n + 2) \sqrt{j} .

Proof. This is the byproduct of works [4] and [2].

Afterwards, let ${\overline{e}}_{r}, 1 ⩽ m ⩽ n,$ be the orthogonal basis of $ℝ^{n}$ comprised of the eigenvectors of U and ${\overline{x}}^{'}$ means the transposed vector $\overline{x}$ .

Lemma 4 If $b_{m} \in ℝ$ and 1 ≤ m ≤ n and n ≥ 2, then

- \frac{1}{2} \sum_{1 ⩽ m ⩽ n} {m b}_{m}^{2} ⩽ \sum_{\binom{1 ⩽ m 1, m 2 ⩽ n}{m 1 + m 2 > n}} b_{m 1} b_{m 2} ⩽ \sum_{1 ⩽ m ⩽ n} {m b}_{m}^{2} .

(9)

If n ≥ 3and

\sum_{m ⩽ n} {m b}_{m} = 0,

(10)

then

\sum_{\binom{1 ⩽ m 1, m 2 ⩽ n}{m 1 + m 2 > n}} b_{m 1} b_{m 2} ⩽ \frac{1}{3} \sum_{1 ⩽ m ⩽ n} {m b}_{m}^{2} .

(11)

Moreover, each bound in (9) and (11) are achieved, respectively, for $b_{m} = e_{r m} / \sqrt{m},$ , wherer = 2, 1, 3and $e_{r m}$ have been defined in Lemma 3.

Proof. Inequalities (9) are seen from Lemma 3 after the substitution $b_{m} = x_{m} / \sqrt{m},$ ,m<n since the extreme eigenvalues are 1 and 1/2.

After the same substitution, we further examine the quadratic form with the matrix U . Condition (10) reckons the subspace of vectors $\overline{x} = (x_{1}, . . ., x_{n})$ satisfying $x_{1} + \cdot \cdot \cdot + x_{j} \sqrt{j} + \cdot \cdot \cdot + x_{n} \sqrt{n} = \overline{x} \cdot {\overline{e}}_{1}^{'} = 0 .$ This subspace is spanned over the first eigenvector. In other words, under (10), only the form values obtained in the subspace L ⊂ Rⁿ spanned over the vectors ${\overline{e}}_{2}, . . ., {\overline{e}}_{n}$ count. Hence

\binom{m a x}{\overline{x} \in L} ǁ \overline{x} ǁ^{- 2} \overline{x} \cup {\overline{x}}^{'} ⩽ \binom{m a x}{2 ⩽ r ⩽ n} {(- 1)}^{r - 1} / r = 1 / 3 .

Returning to b_m, from this we obtain inequality (11).

Proof of Theorem 2. After grouping the summands, expression (4) can be rewritten as follows:

V_{n} h (\overline{c}) = b_{n} (\overline{c}) - \sum_{\binom{1 ⩽ m 1, m 2 ⩽ n}{m 1 + m 2 > n}} (q^{- m 1} \sum_{1 ⩽ m ⩽ n} c_{i} π (i)) (q^{- m 2} \sum_{j | m 2} c_{i} π (j)) .

Now evidently estimate (5) follows from Lemma 4 with

b_{m} = q^{- m} \sum_{j | m} c_{j} π (j), m ⩽ n .

Moreover, it becomes an equality if we take $c_{j} = c_{j}^{*}$ satisfying

q^{- m} \sum_{j | m} c_{j}^{*} π (j) = 3 m - 2 n - 1,

which by the Möbius inversion formula and (1) may be rewritten as (6).

To prove the first assertion of Corollary 1, it suffices to estimate the inner sum in $R_{n} (\overline{c}),$ , namely,

{(\sum_{j | m} c_{j} π (j))}^{2} ⩽ \sum_{j | m} \frac{c_{j}^{3} π (j)}{j} . \sum_{j | m} j π (j) = \sum_{j | m} \frac{c_{j}^{3} π (j)}{j} . q m .

Further, using the expression of VY_n, we just estimate the remainder:

V Y_{n} - B_{n} (\overline{c}) = {\sum_{j ⩽ m} c_{j}^{3} π (j)}^{} \sum_{k > n / j} {k q}^{- j k} ⩾ {n q}^{- n} \sum_{j ⩽ m} \frac{c_{j}^{2} π (j)}{j} \frac{q^{j}}{{(q^{j} - 1)}^{2}} . \frac{q^{j} - 1}{q^{j}} ⩾ {n q}^{- n - 1} (q - 1) {V Y}_{n} .

Plugging both estimates into (5), we obtain the first inequality in Corollary 1 with ≤ instead of <. In fact, we obtained the strict inequality since Cauchy’s inequality applied in the last step is strict if $\overline{c}$ is not proportional to $\overline{j}$ , and in this exceptional case, $V h (\overline{c}) = 0 .$ .

Proof of Theorem 3. Observe that $V_{n} h (\overline{c}) = V_{n} h (\overline{c}) - t n) - V_{n} h (\overline{c} - t \overline{j})$ for every $t ∊ ℝ$ . Hence the right-hand inequality follows from (5) applied for the shifted statistics.

To get the lower bound of variance, we combine (4) and (11). We start with

V_{n} h (\overline{c} - t_{c} \overline{j}) - B_{n} (\overline{c} - t_{c} \overline{j}) - \sum_{\binom{m 1, m 2 ⩽ n}{m 1, m 2 > n}} {\overline{b}}_{m 1} {\overline{b}}_{m 2},

where

{\overline{b}}_{m} = q^{- m} \sum_{j | m} (c_{j} - t_{c} j) π (j)

and m ≤ n. By the definition of tc the latter sequence satisfies condition (10). Hence by (11),

\sum_{\binom{m 1, m 2 ⩽ n}{m 1 + m 2 > n}} {\overline{b}}_{m 1} {\overline{b}}_{m 2}, ⩽ \frac{1}{3} \sum_{m ⩽ n} m {\overline{b}}_{m}^{2} = \frac{1}{3} R_{n} (\overline{c} - t_{c} \overline{j}) .

This and (4) imply the lower bound. Moreover, the latter is sharp since Lemma 4 assures this by a choice of a particular sequence ${\overline{b}}_{m}, m ⩽ n .$ .

Material suplementario

References

[1] R. Arratia, A.D. Barbour, S. Tavaré. Logarithmic Combinatorial Structures: A Prob- abilistic Approach. EMS Monographs in Mathematics. EMS Publishing House, Zürich, 2003.

[2] Ž. Baronėnas, E. Manstavičius, P. Šapokaitė. A sharp inequality for the variance with respect to the ewens sampling formula. 2019, arXiv:2003.05975v1.

[3] J.C. Hansen. Factorization in fq [.] and brownian motion. Combin. Probab. Comput., .:285–299, 1993.

[4] J. Klimavičius, E. Manstavičius. The Tur´an–Kubilius inequality on permutations. Annales Univ. Sci. Budapest., Sect. Comp., 48:45–51, 2018.

[5] E. Manstavičius. Sharp bounds for the variance of linear statistics on random permuta- tions. Random Struct. Alg., 2020. https://doi.org/10.1002/rsa.20951.

Notas