1. Introduction

ALGEBRA

Algebra

2314-4114 2314-4106

Hindawi Publishing Corporation

970157

10.1155/2014/970157

970157

Research Article

On Determinantal Varieties of Hankel Matrices

http://orcid.org/0000-0002-1432-7413

Ballico

Edoardo

http://orcid.org/0000-0002-8074-8232

Elia

Michele

² Dascalescu

Sorin

University of Trento, 38123 Povo, Trento

Italy

unitn.it

Polytechnic of Turin, 10129 Torino

Italy

polito.it

2014

2842014

2014 23 01 2014 09 04 2014 28 4 2014

2014

This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Let ℌ be a class of n × n Hankel matrices H A whose entries, depending on a given matrix A , are linear forms in n variables with coefficients in a finite field 𝔽 q . For every matrix in ℌ , it is shown that the varieties specified by the leading minors of orders from 1 to n - 1 have the same number q n - 1 of points in 𝔽 q n . Further properties are derived, which show that sets of varieties, tied to a given Hankel matrix, resemble a set of hyperplanes as regards the number of points of their intersections.

1. Introduction

The representation of hypersurfaces of small degree as determinants is a classical subject. For instance, Hesse [1] discussed the representation of the plane quartic by symmetric determinants, and many different problems have been tackled over the years; see, for example, [2, 3]. An important question, when hypersurfaces are defined over finite fields, is the computation of the number of points. In general this is very difficult, for example, [4], and most frequently only bounds are given. This paper considers hypersurfaces over finite fields, which are defined by determinants of Hankel matrices whose entries are linear forms in the variables. These Hankel matrices are encountered in the proof of certain properties of finite state automata whose state change is governed by tridiagonal matrices [5, 6]. They also occur in the study of some decoding algorithms for error-correcting codes [7, 8].

It is remarkable that, for these determinantal varieties, the exact number of points can in many instances be explicitly found, in terms of the size of the field and the number of variables.

Let p ( z ) = z n + a 1 z n - 1 + a 2 z n - 2 + ⋯ + a n - 1 z + a n be an irreducible polynomial of degree n over 𝔽 q with root α ∈ 𝔽 q n , which is thus an eigenvalue of the companion matrix A which is assumed to have the coefficients of p ( z ) in the last column, all 1 s in the first subdiagonal, and the remaining entries are 0 s [9].

The definition of Hankel matrices that we are dealing with uses the Krylov matrices (1) K ( A , x ) = ( x , A x , A 2 x , … , A n - 1 x ) , K ( A T , y T ) = ( y T , A T y T , ( A T ) 2 y T , … , ( A T ) n - 1 y T ) , where y = ( y 1 , … , y n ) is a row vector of n independent variables and x T = ( x 1 , … , x n ) is a column vector of n independent variables. Every Krylov matrix is nonsingular unless x and y are all-zero vectors, as will be proved later.

Definition 1.

The class ℌ consists of n × n matrices defined as (2) H A = K ( A T , y T ) T K ( A , x ) . These are Hankel matrices, because the entries (3) ( H A ) i j = y A i A j x = y A i + j x are clearly the same whenever the index sum i + j = h is constant. When the vector y is a fixed element y o of 𝔽 q n , the corresponding subclass of ℌ is denoted by ℌ ( y o ) .

Given a polynomial f in the ring 𝔽 q [ x 1 , … , x n ] , the variety 𝒱 ( f ) is defined as the set of points in the affine space 𝔽 q n that annihilate f ; that is, (4) 𝒱 ( f ) = { ( a 1 , a 2 , … , a n ) ∈ 𝔽 p n ∣ f ( a 1 , … , a n ) = 0 } ⊆ 𝔽 p n .

More generally, given s polynomials f 1 , … , f s ∈ 𝔽 p [ x 1 , … , x n ] the variety 𝒱 ( f 1 , … , f s ) is the set of solutions of the system (5) f 1 = 0 , … , f s = 0 .

Note that 𝒱 ( f 1 , … , f s ) = ⋂ i = 1 s ‍ 𝒱 ( f i ) is the intersection and 𝒱 ( f 1 f 2 , … , f s ) = ⋃ i = 1 s ‍ 𝒱 ( f i ) is the union of the varieties 𝒱 ( f 1 ) , … , 𝒱 ( f s ) .

The entries in H A are bilinear forms of the entries in y and x . Let D j ( x 1 , … , x n ) denote the leading minor of order j of a given Hankel matrix H A obtained fixing y = ( b 1 , … , b n ) ∈ 𝔽 q n , and define the determinantal varieties as 𝒱 ( D j ( x 1 , … , x n ) ) ≔ { ( a 1 , … , a n ) ∈ 𝔽 q n : D j ( a 1 , … , a n ) = 0 } . Then, we prove that every polynomial D j ( x 1 , … , x n ) is irreducible over 𝔽 ¯ q (Proposition 10), and obtain the following general result.

Theorem 2.

We have | 𝒱 ( D j ( x 1 , … , x n ) ) | = q n - 1 if j = 1 , … , n - 1 and | 𝒱 ( D n ( x 1 , … x n ) ) | = 1 .

While proving this theorem, the cardinality of certain subsets S ( i , j , n ) ⊂ 𝔽 q n is also computed. The sets S ( i , j , n ) are the zero-loci of all D h ( x 1 , … , x n ) ’s with i ≤ h ≤ j (Theorem 18). That is, every S ( i , j , n ) is specified by j + 1 - i equations of degree higher than 1 ; nevertheless its cardinality q n + i - 1 - j is the same as in the case of the intersections in 𝔽 q n of j + 1 - i distinct hyperplanes. In the next section, preliminary notions, properties, and useful lemmas are collected, while the main results are proved in Section 3.

2. Preliminaries

It is direct to check that v α = ( 1 , α , … , α n - 1 ) is a row eigenvector of A , associated with the eigenvalue α ; that is, v α A = α v α .

Let σ : 𝔽 ¯ q → 𝔽 ¯ q denote the q -Frobenius; that is, set σ ( x ) ≔ x q for all x . The action of σ is extended to vectors and matrices component-wise. Since σ ( A ) = A , because the entries of this matrix are in 𝔽 q , we have (6) σ ( v α A ) = σ ( α v α ) ⟹ σ ( v α ) A = σ ( α ) σ ( v α ) ; that is, all eigenvectors of A are conjugate vectors under σ . Hence the matrix (7) B = [ v α σ ( v α ) σ 2 ( v α ) ⋮ σ n - 1 ( v α ) ] reduces A to diagonal form over 𝔽 q n ; that is, (8) D = B A B - 1 = diag ⁡ ( α , … , σ n - 1 ( α ) ) , D being the diagonal matrix of the eigenvalues of A .

Observe that, writing (8) as A B - 1 = B - 1 D , the columns of B - 1 are column eigenvectors of A . Thus there is a column vector u that allows us to write B - 1 in the form (9) B - 1 = ( u α , σ ( u α ) , … , σ n - 1 ( u α ) ) .

The following lemma is useful to show that every matrix similar to A gives the same class ℌ . Let GL ( n , 𝔽 q ) denote the general linear group of n × n nonsingular matrices with entries in 𝔽 q .

Lemma 3.

Matrices of G L ( n , 𝔽 q ) that have the same characteristic irreducible polynomial p ( z ) are 𝔽 q -similar.

Proof.

Let α be a root of p ( z ) . To prove the lemma it is sufficient to show that any two matrices A and E of GL ( n , 𝔽 q ) , having the same characteristic polynomial p ( z ) , are similar. The previous arguments indicate that there are two 𝔽 q n -matrices B and S of form (7) such that (10) B A B - 1 = D , S E S - 1 = D .

Multiplying the first equation by S - 1 on the left, and by S on the right, we have ( S - 1 B ) A ( S - 1 B ) - 1 = E . Thus, the lemma is proved by showing that S - 1 B is a 𝔽 q -matrix. Since we may always assume that (11) S - 1 = ( s α , σ ( s α ) , … , σ n - 1 ( s α ) ) , where s α is a convenient column eigenvector of E and B is of form (7), we have (12) S - 1 B = s α v α + σ ( s α ) σ ( v α ) + ⋯ + σ n - 1 ( s α ) σ n - 1 ( v α ) = s α v α + σ ( s α v α ) + ⋯ + σ n - 1 ( s α v α ) , which is patently invariant under the action of the automorphism σ ; thus S - 1 B is a 𝔽 q -matrix.

Corollary 4.

A and A T are 𝔽 q -similar.

2.1. <inline-formula> <mml:math xmlns:mml="http://www.w3.org/1998/Math/MathML" id="M133"> <mml:mrow> <mml:mi>B</mml:mi></mml:mrow> </mml:math></inline-formula> and <inline-formula> <mml:math xmlns:mml="http://www.w3.org/1998/Math/MathML" id="M134"> <mml:mrow> <mml:mi>z</mml:mi></mml:mrow> </mml:math></inline-formula>

The equation w = y B - 1 defines an 𝔽 q -linear mapping ψ from 𝔽 q n into 𝔽 q n n . Taking the vector y to be the element of 𝔽 q n (13) y o = ( Tr ⁡ ( 1 ) , Tr ⁡ ( α ) , Tr ⁡ ( α 2 ) , … , Tr ⁡ ( α n - 1 ) ) ∈ 𝔽 q n , we have w o = ( 1,1 , … , 1 ) = ψ ( y o ) . The image Im ⁡ ( ψ ) is the 𝔽 q n -linear span of ( 1,1 … , 1 ) ; hence it is a one-dimensional 𝔽 q n -linear subspace of 𝔽 q n n .

Equation (8) implies that B A = D B ; then, introducing the vector (14) z T = B ( x 1 , x 2 , … , x n ) T = ( z 1 , z 2 , … , z n ) , it is immediate to see that z i = σ i - 1 ( z 1 ) for every i , whenever x T ∈ 𝔽 q n . The linear forms y o A j x T are transformed into linear forms w o D j z T , and matrix H A can be written as (15) H A = [ w o T z w o T D z w o T D 2 z ⋯ w o T D n - 1 z w o T D z w o T D 2 z w o T D 3 z ⋯ w o T D n z w o T D 2 z w o T D 3 z w o T D 4 z ⋯ w o T D n + 1 z ⋮ ⋮ ⋮ ⋯ ⋮ w o T D n - 1 z w o T D n z ⋯ w o T D 2 n - 2 z ] .

Definition 5.

Let D j ( x 1 , … , x n ) denote the leading minor of order j of a given Hankel matrix H A in ℌ . When there is no ambiguity surrounding the variables, this minor is in brief denoted by D j ( 1 ) . The determinant of H A is D n ( x 1 , … , x n ) , or D n ( 1 ) .

Lemma 6.

Let x be a vector of n variables, and let y be a constant vector in 𝔽 q n ; then we have the following. (1)

The determinant D n ( x 1 , … , x n ) of H A is zero if and only if all variables are set equal to zero.

(2)

The matrix H A is a linear combination of n nonsingular matrices, the coefficients of the linear combination being the entries of x .

(3)

Any linear combination of the rows of H A is a set of n linearly independent linear forms.

Proof.

D n ( x 1 , … , x n ) is not zero, because it is the product of two determinants that are different from zero (16) D n ( x 1 , … , x n ) = det ⁡ ( K ( A T , y T ) T K ( A , x ) ) = det ⁡ ( K ( A T , y T ) T ) det ⁡ ( K ( A , x ) ) . In particular, det ⁡ ( K ( A , x ) ) = 0 if and only if x is the all-zero vector; the same observation holds for y . This proves point (1).

Point (2) is proved by writing (17) K ( A T , y T ) T K ( A , x ) = ∑ i = 1 n ‍ x i M i , where the matrices M i have constant entries that depend on y , and taking x j = 1 and x i = 0 for every i ≠ j , that is, x = e j . When x = e j , we have (18) D n ( 0 , … , 1 , … , 0 ) = det ⁡ ( K ( A T , y T ) T ) det ⁡ ( K ( A , e j ) ) = det ⁡ ( M j ) . This implies that det ⁡ ( M j ) ≠ 0 .

Point (3) is proved by noting that D n ( x 1 , … , x n ) = 0 has only one solution, namely, x 1 = ⋯ = x n = 0 , and D n ( x 1 , … , x n ) = 0 identifies linear combinations of the rows of H A . It follows that every linear combination of the rows should have only the all-zero solution; therefore the n entries in every row must be linearly independent, by a theorem of Rouché-Capelli.

By correspondence (14), every D j ( x 1 , x 2 , … , x n ) is transformed into a polynomial Q j ( z 1 , … , z n ) in the variables z i s with the coefficients in 𝔽 q n .

2.2. Auxiliary Results

Let V ( a 1 , a 2 , … , a j ) be a Vandermonde determinant of order j identified by the j -tuple ( a 1 , a 2 , … , a j ) .

Definition 7.

For every triple of integers j , i , and t such that t ≥ 2 i - 1 ≥ 2 j - 1 > 0 , the subset S ( j , i , t ) of 𝔽 q t is defined as (19) S ( j , i , t ) ≔ { ( a 1 , … , a t ) ∈ 𝔽 q t : D h ( a 1 , … , a t ) = 0 } mmmmmmmmmmmmmmmmm ∀ h ∈ { j , … , i } .

Definition 8.

The set ℭ j n is defined to be the collection of ( n j ) subsets, where each subset consists of the unordered collection of j distinct integers from the set { 1,2 , … , n } .

Every subset 𝔥 i = { h 1 , … , h j } defines a mapping τ i ( ℓ ) = h ℓ from the set { 1,2 , … , j } into { 1,2 , … , n } .

Lemma 9.

Consider a Hankel matrix H A , as defined in (2) with y = y o ∈ 𝔽 q n ; the leading minors D j ( x 1 , … , x n ) , j = 1 , … , n are multivariate homogeneous polynomials of degree j , which may be written over 𝔽 q n , in the form (20) D j ( x 1 , … , x n ) = Q j ( z 1 , … , z n ) = ∑ { h 1 , h 2 , … , h j } ∈ C j n ‍ V ( σ h 1 - 1 ( α ) , … , σ h j - 1 ( α ) ) 2 z h 2 ⋯ z h j , where the summation is extended to all combinations of the n integers { 1,2 , … , n } , taking j at a time, and the coefficients of the monomials z h 1 ⋯ z h j are squares of Vandermonde determinants.

Proof.

In matrix (15), the bilinear forms w D i + j - 2 z have the explicit expression (21) ∑ h = 1 n ‍ z h σ h - 1 ( α i + j - 2 ) = ∑ h = 1 n ‍ z h σ h - 1 ( α i - 1 ) σ h - 1 ( α j - 1 ) , where i is the row index and j is the column index. Each column is a linear combination of columns with coefficients z h such that all columns with the same coefficient z h are proportional. Matrix (15) can be written as a sum of the form (22) H A = ∑ h = 1 n ‍ z h σ h - 1 ( Γ ) , where Γ is the n × n matrix (23) [ 1 α ⋯ α n - 1 α α 2 ⋯ α n ⋮ ⋮ ⋮ ⋮ α n - 1 α n ⋯ α 2 n - 1 ] , which has rank 1 , since every row is proportional to the first row, and the same holds for the columns. The leading minor D j ( x 1 , … , x n ) is computed by writing the determinant as a sum of n j determinants, which contain a single variable z k in every column, determinants with repeated variables are 0 , because of the previous observation that their corresponding columns are proportional, and in the remaining determinants the corresponding variable is collected from each column.

The coefficient of the monomial z h 1 z h 2 ⋯ z h j is obtained as follows. Let u i = σ h i - 1 ( α ) . Then the coefficient of z h 1 , … , z h j is equal to (24) ∑ τ ∈ S j ‍ | 1 u τ ( 2 ) ⋯ u τ ( j ) u τ ( 1 ) u τ ( 2 ) 2 ⋯ u τ ( j ) j ⋮ ⋮ ⋮ ⋮ u τ ( 1 ) j - 1 u τ ( 2 ) j ⋯ u τ ( j ) 2 j - 1 | = ∑ τ ∈ S j ‍ sgn ⁡ ( τ ) ∏ ℓ = 1 j ‍ u τ ( ℓ ) ℓ - 1 | 1 1 ⋯ 1 u 1 u 2 2 ⋯ u j j ⋮ ⋮ ⋮ ⋮ u 1 j - 1 u 2 j ⋯ u j 2 j - 1 | . Collecting the common factor, the remaining summation is exactly the same determinant; thus we have (25) | 1 1 ⋯ 1 u 1 u 2 2 ⋯ u j j ⋮ ⋮ ⋮ ⋮ u 1 j - 1 u 2 j ⋯ u j 2 j - 1 | 2 = V ( σ h 1 - 1 ( α ) , … , σ h j - 1 ( α ) ) 2 , which gives (26) Q j ( z 1 , … , z n ) = ∑ 𝔥 ∈ ℭ j n ‍ V ( σ h 1 - 1 ( α ) , … , σ h j - 1 ( α ) ) 2 z h 1 z h 2 ⋯ z h j , with the summation extended to every subset 𝔥 = { h 1 , … , h j } of ℭ j n , and this concludes the proof.

Proposition 10.

The product ∏ i = 1 n - 1 ‍ Q i ( z 1 , … , z n ) is not identically zero over 𝔽 q n .

Furthermore, the leading minors D j ( x 1 , … , x n ) , j = 1 , … n , are irreducible degree- j polynomials over 𝔽 ¯ q .

Proof.

As a consequence of (26), every Q j ( z 1 , … , z n ) is irreducible over 𝔽 q n . Further, observing that each variable z i occurs at degree 1 in any Q j ( z 1 , … , z n ) , it has maximum degree n - 1 in the product polynomial 𝔇 = ∏ j = 1 n - 1 ‍ Q j ( z 1 , … , z n ) . Therefore 𝔇 is not identically zero in 𝔽 q n n , because n - 1 is certainly less than q n for any q .

To prove the second statement, fix j ∈ { 1 , … , n - 1 } . In this step it is checked that g ≔ D j ( x 1 , … , x n ) ∈ 𝔽 q [ x 1 , … , x n ] is irreducible over 𝔽 ¯ q . It is only necessary to use the fact that g ∈ 𝔽 q [ x 1 , … , x n ] is a homogeneous polynomial of degree j < n which is irreducible over 𝔽 q n . Assume that g is reducible over 𝔽 ¯ q and call h ∈ 𝔽 ¯ q an irreducible factor of minimal degree x < j . Let 𝔽 q e be the minimal extension of 𝔽 q in which h is defined. Since g is irreducible, the polynomials σ i ( h ) , 1 ≤ i < e , obtained by applying the Frobenius σ i to h are nonproportional irreducible factors of g . Hence deg ⁡ ( g ) ≥ ( e - 1 ) x ≥ n x ≥ n , which is a contradiction.

Remark 11.

The determinant D n ( x 1 , … , x n ) is found to be (27) D n ( x 1 , … , x n ) = Δ 2 ∏ u = 1 n ‍ z u = p ( 1 ) 2 δ 2 ∏ u = 1 n ‍ [ σ u - 1 ( v α ) x ] , where δ is the discriminant of p ( z ) , and the product involving x can be seen as a norm in the field 𝔽 q n ; therefore D n ( x 1 , … , x n ) is irreducible over 𝔽 q .

Lemma 12.

The variety 𝒱 ( D 1 ( 1 ) ) ∩ 𝒱 ( D 2 ( 1 ) ) ∩ ⋯ ∩ 𝒱 ( D n - 1 ( 1 ) ) has cardinality q over 𝔽 q .

Proof.

Equation (17) shows that any D j ( 1 ) is a polynomial of degree j with coefficients in 𝔽 q ; furthermore, every entry is a linear form with coefficients in 𝔽 q . Hence D 1 ( 1 ) = 0 implies u 1 = 0 ; in turn D 2 ( 1 ) = 0 implies u 2 = 0 , given that u 1 = 0 and arguing recursively; finally, D n - 1 ( 1 ) = 0 implies u n - 1 = 0 , while the variable u n is free and may assume q values. The conclusion follows.

Lemma 13.

Let b i ≠ 0 , 1 ≤ i ≤ j , and assume n ≥ 2 j - 1 . Then | 𝒱 ( D 1 ( 1 ) - b 1 ) ∩ ⋯ ∩ 𝒱 ( D j ( 1 ) - b j ) | = q n - j .

Proof.

We use induction on j , the case j = 1 being obvious. The inductive assumption in 𝔽 q 2 j - 3 gives | 𝒱 ( D 1 ( 1 ) - b 1 ) ∩ ⋯ ∩ 𝒱 ( D j - 1 ( 1 ) - b j - 1 ) | = q j - 2 . Fix ( a 1 , … , a 2 j - 3 ) ∈ 𝔽 q 2 j - 3 with D i ( 1 ) = b i for all 1 ≤ i ≤ j - 1 . Since D j - 1 ( a 1 , a 2 , … , a 2 j - 3 ) ≠ 0 , for all a 2 j - 2 , c ∈ 𝔽 q there is a unique a 2 j - 1 ∈ 𝔽 q such that D j ( a 1 , … , a 2 j - 1 ) = c . Take c = b j . This completes the proof.

3. Main Results Proposition 14.

The equality | V ( D j ( 1 ) ) | = | V ( D n - j ( 1 ) ) | holds for every 1 ≤ j ≤ n - 1 .

Proof.

In the proof of Lemma 9, it was shown that (28) D j ( 1 ) = D j ( x 1 , … , x n ) = Q j ( z 1 , … , z n ) , with z i = ∑ j = 1 n ‍ σ j - 1 ( α ) x j . Further, it was noted in that lemma that z i = σ i - 1 ( z 1 ) for every i , whenever every x j ∈ 𝔽 q .

The relation z = B - 1 x establishes a one-to-one correspondence between 𝔽 q n and a subspace of dimension 1 of 𝔽 q n n , and further x = B z . There thus exists a one-to-one correspondence between the zeros of D j ( 1 ) in 𝔽 q n and the zeros of Q j ( z 1 , … , z n ) in the one-dimensional subspace of 𝔽 q n n , which is the image of 𝔽 q n . Referring to (26), which yields the representation Q j ( z 1 , … , z n ) of D j ( 1 ) , assuming z 1 ≠ 0 , considering the change of variables (29) z i = 1 t i ∏ ℓ ≠ i - 1 n - 1 ‍ ( σ i - 1 ( α ) - σ ℓ ( α ) ) 2 = 1 𝔡 i t i , and recalling that the coefficients of the monomials are squares of Vandermonde determinants V ( σ h 1 - 1 ( α ) , σ h 2 - 1 ( α ) , … , σ h j - 1 ( α ) ) 2 , we obtain (30) Q j ( z 1 , … , z n ) = 1 δ 2 ∏ i = 1 n ‍ t i Q n - j ( t 1 , … , t n ) , where δ is the discriminant of the polynomial with root α . The variety 𝒱 ( D j ( 1 ) ) is obtained by considering t 1 = v ( α ) x T and the other variables as t i = σ i - 1 ( t 1 ) , i = 2 , … , n ; thus ∏ t i = 0 only when every t i = 0 , ∀ i ; further δ ≠ 0 . Finally, we have the chain of bijections (31) x ⟷ z T = B - 1 x T ⟷ z 1 ⟷ t 1 = { 1 𝔡 1 z 1 if z 1 ≠ 0 0 if z 1 = 0 ⟷ y T ⟷ x ~ T = B t T . In conclusion, this equation shows an explicit one-to-one mapping between the zeros ( a 1 , … , a n ) of D j ( 1 ) and the zeros ( a ~ 1 , … , a ~ n ) of D n - j ( 1 ) , which implies | 𝒱 ( D j ( 1 ) ) | = | 𝒱 ( D n - j ( 1 ) ) | .

In the following example, the procedure for obtaining a point of 𝒱 ( D n - j ( 1 ) ) from a point of 𝒱 ( D j ( 1 ) ) is explicitly illustrated.

Example 15.

Consider the irreducible polynomial p 7 ( z ) = z 7 + z 4 + z 3 + z 2 + 1 of degree n = 7 over 𝔽 3 with the transpose companion matrix (32) A 7 = [ 0 1 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 1 - 1 0 - 1 - 1 - 1 0 0 ] . Taking y = [ 1,0 , 0,0 , 0,0 , 0 ] , and x T = [ x 1 , x 2 , x 3 , x 4 , x 5 , x 6 , x 7 ] , the Hankel matrix (17) becomes (33) H A 7 = [ x 1 x 2 x 3 x 4 x 5 x 6 x 7 x 2 x 3 x 4 x 5 x 6 x 7 ξ 8 x 3 x 4 x 5 x 6 x 7 ξ 8 ξ 9 x 4 x 5 x 6 x 7 ξ 8 ξ 9 ξ 10 x 5 x 6 x 7 ξ 8 ξ 9 ξ 10 ξ 11 x 6 x 7 ξ 8 ξ 9 ξ 10 ξ 11 ξ 12 x 7 ξ 8 ξ 9 ξ 10 ξ 11 ξ 12 ξ 13 ] , where (34) ξ 8 = - x 1 - x 3 - x 4 - x 5 , ξ 9 = - x 2 - x 4 - x 5 - x 6 , ξ 10 = - x 3 - x 5 - x 6 - x 7 , ξ 11 = x 1 + x 3 + x 5 - x 6 - x 7 , ξ 12 = x 1 + x 2 + x 3 - x 4 + x 5 + x 6 - x 7 , ξ 13 = x 1 + x 2 - x 3 - x 4 + x 5 + x 6 - x 7 . The forms D 3 ( 1 ) and D 4 ( 1 ) of degrees 3 and 4 , respectively, are (35) D 3 ( 1 ) = x 1 x 3 x 5 - x 1 x 4 2 - x 2 2 x 5 - x 2 x 3 x 4 - x 3 3 , D 4 ( 1 ) = x 1 x 3 x 5 x 7 - x 1 x 3 x 6 2 - x 1 x 4 2 x 7 - x 1 x 4 x 5 x 6 - x 1 x 5 3 D 4 ( 1 ) - x 2 2 x 5 x 7 + x 2 2 x 6 2 - x 2 x 4 x 3 x 7 + x 2 x 4 2 x 6 + x 2 x 5 x 3 x 6 D 4 ( 1 ) - x 2 x 4 x 5 2 - x 3 3 x 7 - x 3 2 x 4 x 6 + x 3 2 x 5 2 + x 4 4 . Given a point x T = [ 0,1 , - 1 , - 1,0 , 0,1 ] ∈ 𝔽 3 7 which is a zero of D 3 ( 1 ) , a zero of D 4 ( 1 ) is obtained as follows.

Compute the vector z = B 7 - 1 x T whose first component is z 1 = 1 + α 2 + α 3 + α 6 ∈ 𝔽 3 7 7 , and the remaining entries are obtained as σ ℓ ( z 1 ) = 1 + α 3 ℓ - 1 2 + α 3 ℓ - 1 3 + α 3 ℓ - 1 6 , ℓ = 1 , … , 6 ; then compute (36) 𝔡 2 = ∏ i = 1 6 ‍ ( α - α 3 i ) 2 = α + α 3 + α 4 - α 5 , t 1 = 1 z 1 𝔡 2 = - 1 - α + α 2 + α 4 - α 5 , and construct the vector t = [ t 1 , σ ( t 1 ) , … , σ 6 ( t 1 ) ] ∈ 𝔽 3 7 7 . Finally, a zero of D 4 ( 1 ) is obtained as (37) B 7 t T = [ 1 , - 1,0 , 1,1 , - 1 , - 1 ] T ∈ 𝔽 3 7 , where B 7 is the matrix whose columns are the eigenvectors of A 7 in 𝔽 3 7 7 (38) B 7 = [ 1 1 1 1 1 1 1 α α 3 α 3 2 α 3 3 α 3 4 α 3 5 α 3 6 α 2 α 2 · 3 α 2 · 3 2 α 2 · 3 3 α 2 · 3 4 α 2 · 3 5 α 2 · 3 6 α 3 α 3 · 3 α 3 · 3 2 α 3 · 3 3 α 3 · 3 4 α 3 · 3 5 α 3 · 3 6 α 4 α 4 · 3 α 4 · 3 2 α 4 · 3 3 α 4 · 3 4 α 4 · 3 5 α 4 · 3 6 α 5 α 5 · 3 α 5 · 3 2 α 5 · 3 3 α 5 · 3 4 α 5 · 3 5 α 5 · 3 6 α 6 α 6 · 3 α 6 · 3 2 α 6 · 3 3 α 6 · 3 4 α 6 · 3 5 α 6 · 3 6 ] .

Remark 16.

Since the n forms in the first row of (17) are linearly independent, by Lemma 6, a change of variables from x 1 , … , x n to u 1 , … , u n takes a matrix H A to the form (39) H A = [ u 1 u 2 u 3 ⋯ u n u 2 u 3 ⋯ u n ℓ 1 u 3 ⋯ u n ℓ 1 ℓ 2 ⋮ u n ℓ 1 ℓ 2 ⋯ ℓ n - 1 ] , where the n variables u i s are free, and every ℓ h is a linear form in the u i s.

Fix the integers k ≥ 1 and j ≥ 1 , and let D j ( k ) denote the j × j determinant of a Hankel matrix with free variable entries u i , i = k , … , 2 j - 2 + k (40) | u k u k + 1 ⋯ u k + j - 2 u k + j - 1 u k + 1 u k + 2 ⋯ u k + j - 1 u k + j ⋮ ⋮ ⋮ u k + j - 1 u k + j ⋯ u k + 2 j - 3 u k + 2 j - 2 | . And set D 0 ( 1 ) = 1 by definition.

Proposition 17.

Let m , k be natural integers. Let H ( 2 k + 2 m - 1 ) be a ( 2 k + 2 m - 1 ) × ( 2 k + 2 m - 1 ) Hankel matrix with first row ( u 1 , … , u 2 k + 2 m - 1 ) . Let E ( k , m ) be the set of points ( b 1 , … , b 2 k + 2 m - 1 ) ∈ 𝔽 q 2 k + 2 m - 1 with the same first 2 k - 1 coordinates b i = a i , i = 1 , … , 2 k - 1 such that the minor D k ( b 1 , … , b 2 k + 2 m - 1 ) ≠ 0 , and the minors D i ( b 1 , … , b 2 k + 2 m - 1 ) = 0 for all i ∈ { k + 1 , … , k + m } . Then E ( k , m ) has cardinality q m .

Proof.

Observe that the first row ( u 1 , … , u 2 k + 2 m - 1 ) of the Hankel matrix H ( 2 k + 2 m - 1 ) completely specifies the leading ( k + m ) × ( k + m ) Hankel submatrix H ( k + m ) , and consequently also every minor D i ( 1 ) for i = 1 , … , k + m .

Let R j ( u 1 , … , u 2 k + 2 m - 1 ) denote the j th row of H ( k + m ) . Let A ( k + 1 , m ) be the subset of 𝔽 q 2 m consisting of all ( b 2 k , … , b 2 k + 2 m - 1 ) ∈ 𝔽 q 2 m such that R k + 1 ( a 1 , … , a 2 k - 1 , b 2 k , … , b 2 k + 2 m - 1 ) is linearly dependent on R 1 ( b 1 , … , a 2 k - 1 , b 2 k , … , b 2 k + 2 m - 1 ) , … , R k ( b 1 , … , a 2 k - 1 , b 2 k , … , b 2 k + 2 m - 1 ) .

The case m = 1 is easily settled. Consider the identity (41) D k + 1 ( a 1 , … , a 2 k - 1 , u 2 k , u 2 k + 1 , … , u 2 k + 2 m - 1 ) = u 2 k + 1 D k ( a 1 , … , a 2 k - 1 , u 2 k , u 2 k + 1 , … , u 2 k + 2 m - 1 ) + B ( a 1 , … , a 2 k - 1 , u 2 k ) , for some B ( u 1 , … , u 2 k ) ∈ 𝔽 q [ u 1 , … , u 2 k ] , and take u 2 k = a 2 k ∈ 𝔽 q ; it follows that (42) D k + 1 ( a 1 , … , a 2 k - 1 , a 2 k , a 2 k + 1 , u 2 k + 2 , … , u 2 k + 2 m - 1 ) = 0 , for a unique a 2 k + 1 ∈ 𝔽 q because D k ( 1 ) ≠ 0 by hypothesis. Since u 2 k is any element a 2 k ∈ 𝔽 q (i.e., it may assume q values in 𝔽 q , while u 2 k + 1 is uniquely specified), the assertion | E ( k , 1 ) | = q is proved.

Now, assume m ≥ 2 , and note that row k + 1 is uniquely determined up to position k + 1 as a linear combination of the above rows up to the same position k + 1 . Extend this linear combination to uniquely determine the remaining elements of the H ( k + m ) Hankel matrix.

The assertion | E ( k , m ) | = q m is a consequence of the following claims.

Claim 1. One has | A ( k + 1 , m ) | = q m .

Consider a vector ( b 2 k , … , b 2 k + 2 m - 1 ) ∈ 𝔽 q 2 m , which belongs to A ( k + 1 , m ) if and only if there are c i ∈ 𝔽 q , 1 ≤ i ≤ k , such that (43) R k + 1 ( a 1 , … , a 2 k - 1 , b 2 k , … , b 2 k + 2 m - 1 ) = ∑ i = 1 k ‍ c i R i ( a 1 , … , a 2 k - 1 , b 2 k , … , b 2 k + 2 m - 1 ) , since D k ( a 1 , … , a 2 k - 1 , b 2 k , … , b 2 k + 2 m - 1 ) ≠ 0 , and this same condition implies that the coefficients c 1 , … , c k are uniquely determined by the entries of the vector ( a 1 , … , a 2 k - 1 ) and by the entry b 2 k = a 2 k in row R k + 1 ( a 1 , … , a 2 k - 1 , b 2 k , … , b 2 k + 2 m - 1 ) .

We know that for each a 2 k ∈ 𝔽 q there is a unique a 2 k + 1 such that D k + 1 ( a 1 , … , a 2 k - 1 , a 2 k , a 2 k + 1 ) = 0 .

Fix u 2 k = a 2 k and hence fix c 1 , … , c k , and u 2 k + 1 = a 2 k + 1 . The values of u 2 k + i = a 2 k + i , i = 2 , … , m are uniquely specified by the linear combination condition, jointly with the Hankel matrix properties. Since the remaining u 2 k + m + i , i = 1 , … , m - 1 are free, the cardinality of A ( k + 1 , m ) is precisely q m .

Claim 2. Since the first k + 1 rows of the Hankel matrix H ( k + m ) are linearly dependent, it follows that D j ( a 1 , … , a 2 k + m , b 2 k + m + 1 , … , b 2 k + 2 m - 1 ) = 0 for every j ∈ { k + 2 , … , k + m } .

To conclude the proof, it remains to show that the ( k + 1 ) th row, constructed as above, is the only possible ( k + 1 ) th row that leads to a Hankel matrix satisfying the hypotheses of the proposition. This property is the third claim.

Claim 3. If b 2 k + i ≠ a 2 k + i , for every i = 2 , … , m , then D x + k ( a 1 , … , a 2 k - 1 , a 2 k , a 2 k + 1 , b 2 k + 2 , … , b 2 k + 2 m - 1 ) ≠ 0 for every x ∈ { 2 , … , m } .

Let x ≥ 2 be the smallest integer such that the ( k + x ) × ( k + x ) Hankel matrix H ( k + x ) , with leading minor D k ( 1 ) ≠ 0 , has the whole ( k + 1 ) th row that is not a linear combination of the above rows: this means that the entry b k + x is different from a k + x .

Let c 1 , … , c k be the coefficients of the linear combination of the first k rows of H ( k + 1 + x ) yielding the row ( a k , … , a k + 1 + x ) .

From every h th row of the matrix H ( k + x ) , with h ≥ k + 1 , the linear combination of the first k rows may be subtracted to get a row whose entries with index y are zero for every y = 1 , … , h + k + 1 . The counter-diagonal entries between row k + 1 and the bottom row are b 2 k + x - a 2 k + x . The determinant of H ( k + x ) and that of the modified matrix are the same; using the generalized Laplace formula for the expansion of a determinant with respect to the last x + 1 rows, we get D k + 1 + x ( 1 ) = D k ( 1 ) ( a 2 k + x - b 2 k + x ) x ≠ 0 .

The contradiction forces b 2 k + x = a 2 k + x , which concludes the proof.

Theorem 18.

For all integers n , j , and i such that n ≥ 2 j - 1 ≥ 2 i - 1 > 0 we have (44) | S ( j , i , n ) | = q n - 1 - j + i .

Proof.

For all integers n ≥ t ≥ 2 j - 1 we have | S ( j , i , n ) | = | S ( j , i , t ) | · q n - t , because in each determinant D h ( 1 ) , h ≤ j , the variables x e , e ≥ 2 j , do not occur; hence Lemma 12 gives the case i = 1 for all j . We may thus assume i ≥ 2 . Induction will be applied to j , the case j = 2 being obvious. The inductive assumption gives (45) | S ( h , h , 2 h - 1 ) ∖ S ( h , h - 1,2 h - 1 ) | = ( q - 1 ) q 2 h - 3 , for all h < j . Notice that S ( j , i , n ) = S ( j , 1 , n ) ⊔ { ⊔ h = 2 i ( S ( j , h , n ) ∖ S ( j , h - 1 , n ) ) } . Lemma 9 gives | S ( j , h , n ) ∖ S ( j , h - 1 , n ) | = ( q - 1 ) q 2 h - 3 q n - 2 j + 1 + j - h . Hence (46) | S ( j , i , n ) | = q n - j + ( q - 1 ) ∑ h = 2 i ‍ q n - j + h - 2 = q n - j - 1 + i .

Remark 19.

Take integers n , j , and i such that n ≥ 2 j - 1 ≥ 2 i - 1 ≥ 3 . Applying Theorem 18, first for ( n , j , i ) and then for ( n , j , i - 1 ) , gives | S ( j , i , n ) ∖ S ( j , i + 1 , n ) | = q n - j - 2 + i ( q - 1 ) .

Proof of Theorem <xref ref-type="statement" rid="thm1">2</xref>.

We know by Lemma 6 that | 𝒱 ( D j ( 1 ) ) | = | 𝒱 ( D n - j ( 1 ) ) | , for every 1 ≤ j ≤ n - 1 ; the proof is completed by showing that | 𝒱 ( D j ( 1 ) ) | = q n - 1 for every 1 ≤ j ≤ ⌊ n / 2 ⌋ . This is true by the case i = j of Theorem 18. 𝒱 ( D n ( 1 ) ) has only one point, because D n ( 1 ) is an irreducible polynomial over 𝔽 q .

Corollary 20.

Given j < n , if g c d { j , q - 1 } = 1 , the varieties 𝒱 ( D j ( 1 ) - b ) , ∀ b ∈ 𝔽 q have cardinality q n - 1 .

Proof.

Performing the substitution x i = t x i ′ gives the equation t j D j ( x 1 ′ , … , x n ′ ) - b = 0 . By hypothesis gcd { j , q - 1 } = 1 , the equation t j = b always has a solution in 𝔽 q , since we have t = b μ with μ j = 1 mod q - 1 . Thus all varieties with b ≠ 0 have the same cardinality, say n b = | 𝒱 ( D j ( 1 ) - 1 ) | , and the equation (47) q n - 1 + ( q - 1 ) n b = q n implies n b = q n - 1 .

Note that, when j has some factor in common with q - 1 , the cardinalities of 𝒱 ( D j ( 1 ) - b ) are close to q n - 1 but depend on b . It is an interesting problem to determine how close these cardinalities are to q ( n - 1 ) .

Conflict of Interests

The authors declare that there is no conflict of interests regarding the publication of this paper.

Hesse

Ueber determinanten und ihre anwendung in der geometrie, insbesondere auf Curven vieter ordung

Journal für die Reine und Angewandte Mathematik 1855 49 243 264

Beauville

Determinantal hypersurfaces

The Michigan Mathematical Journal 2000 48 39 64

10.1307/mmj/1030132707

MR1786479

ZBL1076.14534

Zaare-Nahandi

Usefi

A note on minors of a generalized Hankel matrix

International Mathematical Journal 2003 3 11 1197 1201

MR2006580

ZBL1173.13307

Weil

Courbes Algébriques et Variétés Abéliennes 1971

Paris, France

Hermann

Cattell

Muzio

J. C.

Analysis of one-dimensional linear hybrid cellular automata over 𝔽 q

IEEE Transactions on Computers 1996 45 7 782 792

10.1109/12.508317

MR1423911

ZBL1055.68548

Cattell

Muzio

J. C.

Synthesis of one-dimensional linear hybrid cellular automata

IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems 1996 15 3 325 335

2-s2.0-0030104631

10.1109/43.489103

Comer

M. T.

Kaltofen

E. L.

On the Berlekamp/Massey algorithm and counting singular Hankel matrices over a finite field

Journal of Symbolic Computation 2012 47 4 480 491

10.1016/j.jsc.2011.09.008

MR2890883

ZBL1242.65074

Imamura

Yoshida

A simple derivation of the Berlekamp-Massey algorithm and some applications

IEEE Transactions on Information Theory 1987 33 1 146 150

10.1109/TIT.1987.1057261

MR875544

ZBL0624.94014

Horn

R. A.

Johnson

C. R.

Matrix Analysis 1980

New York, NY, USA

Dover