Symmetric Polynomials

We introduced 𝕂[x] the ring of polynomials in one indeterminate with coefficients over 𝕂. This definition can be extended to a case in which the coefficients vary over a commutative ring (e.g: β„€) rather than a field. In this case we have a ring of polynomials with coefficients in a ring R and will be indicated by R[x].

5.10.1 Definition. The ring R of polynomials in n indeterminates and coefficients over the field 𝕂 is defined inductively in the following way

R1 ≑ 𝕂[x],   Rn ≑ Rn βˆ’ 1[x]

we write Rn ≑ 𝕂[x1, x2, ..., xn].  β–‘

So a polynomials with n indeterminates is thought as a polynomial in one variable, with coefficients in the ring of the polynomials with n-1 variables. An element of 𝕂[x,y] = (𝕂[x])[y] is of the kind

βˆ‘nj=0 fjyj,   fj ∈ 𝕂[x]

like for example

βˆ‘2j=0 (βˆ‘1i=0 aijxi)yj = (a00 + a10x) + (a01 + a11x)y + (a02 + a12 x)y2 =
a00 + a10x + a01y + a02y2 + a11 xy + a12 xy2.

We can represent in general the elements of 𝕂[x1, x2, ..., xn] in the form

f(x1, x2, ..., xn) = βˆ‘ ai1i2...in x1i1 x2i2 ... xnin

They are added and multiplied in the familiar way, making use of the associative, commutative, and distributive laws of addition and multiplication.

The degree of the indeterminate xi is the highest exponent that occurs in the polynomial with a nonzero coefficient.

Given the monomial

x1h1, x2h2, ..., xnhn

the degree of the polynomials is the integer h1 + h2 + ... + hn. The degree of the polynomials f(x1, x2, ..., xn) is defined as the highest degree of its monomials.

A polynomial in general can have different highest degree monomials. It is thus relevant to order the monomials of a polynomial. The common order is the lexicographic ordering: In practice, given an ordered alphabet of variables x1, x2,..., xn you can sort all the monomials by first considering the exponent of x1, x2,..., xn, then the exponent of x2 and so on, until a difference is found between the exponents. At this point, the monomial is considered minor for which the exponent is minor. In symbols

αx1h1x2h1 β‹…β‹…β‹… x3hn,   α ∈ 𝕂,   hi ≥ 0

and

βx1k1x2k1 β‹…β‹…β‹… x3kn,   β ∈ 𝕂,   ki ≥0

two mononomers of the polynomial f(x1, x2,.., xn). We say that

αx1h1x2h1 β‹…β‹…β‹… xnhn < βx1k1x2k1 β‹…β‹…β‹… xnkn

if the least integer m, for which hiki, such that hm < km.

For example

x14x22x3 ≻ x14x2x33   (same total degree, equal x1 exponent, greater x2 exponent)

x14x22x33 β‰Ί x14x23x33   (smaller total degree)

We introduce now an important class of polynomials: symmetric polynomials. Given the definition of permutation we have the following

5.10.2 Definition. A polynomial f(x1,x2,..., xn) ∈ 𝕂[x1, x2, ..., xn] is said symmetric if for any permutation of its indeterminate it remains invariant. β–‘

For example the polynomials in 𝕂[x1, x2, ..., x3]

2x1 + 2x2 + 2x3 βˆ’3x12 βˆ’3x22 βˆ’3x32

x1x23 + x2x13 + x1x3 + x3x13 + x2x33 + x3x23

instead the following are not symmetric polynomials

x12 + x22 βˆ’ x32,   x1 + 3x2 + 7x3

The following symmetric polynomial are said elementary symmetric polynomials (or functions) in 𝕂[x1, x2, ..., xn]

σ1(x1, x2,..., xn) = x1 + x2 + ... + xn = βˆ‘ni=1 xi

σ1(x1, x2,..., xn) = x1 x2 + x1 x3 + ... + xnβˆ’1 xn = βˆ‘nii < jn xixj

σ1(x1,x2,..., xn) = x1 x2x3 + x1 x2 x4 + ...+ xnβˆ’2 xnβˆ’1 xn = βˆ‘nii < j < kn xixjxk

...

σ1(x1,x2,...,xn) = x1x2β‹…β‹…β‹…xn

We shall show, these functions connenct coefficients and roots of a polynomial equation in one variable.

Consider the equation

f(t) = tn + a1tnβˆ’1 + ... + anβˆ’1t + an = 0   (5.10.1)

Let r1, r2, rn be its roots, we can write

f(t) = (t βˆ’ r1) (t βˆ’ r2) β‹…β‹…β‹… (t βˆ’ rn)   (5.10.2)

expanding the factored form of 5.10.2 and setting the coefficients equal to those of (5.10.2), we obtain the following relations

a1 = βˆ’(r1 + r2 + rn)

a2 = r1r2 + r1x3 + ... + r1xn + r2x3 + ... + rnβˆ’1 rn

a3 = βˆ’(r1r2x3 + r1r2x4 + ... + rnβˆ’2xnβˆ’1 xn)

...

an = (βˆ’1)n r1x2β‹…β‹…β‹…rn

The above relations are known as the Viète formulas: they relate the coefficients of a polynomial to sums and products of its roots. We examine the case n = 2.

Vieta's Formula. Quadratic Equations. Let r1 and r2 be the roots of the quadratic equation ax2 + bx + c = 0. Then the two identities

βˆ’b/a = r1 + r2    c/a = r1r2

both hold.

Proof. Write ax2 + bx + c = a(x βˆ’ r1) (x βˆ’ r2) = ax2 βˆ’ax(r1 + r2) + ar1r2. From which the relations above must be satisfied.  β–‘

In general we have σk = (βˆ’1)k σk (r1, r2, ..., rn), hence the coefficients of every monic polynomial of one indeterminate with coefficients in a field, are the elementary symmetric functions of its roots.

The set S of all symmetric polynomials in n variables with coefficients in 𝕂[x1, x2, ..., xn]. Every polynomial f(σ1, σ2, ..., σn) of 𝕂[x1, x2, ..., xn] can be expressed in terms of elementary symmetric polynomials and thus contained in S. We have the following inclusions

We make this precise in the next theorem, showing that the first inclusion is an equality.

𝕂[σ1, σ2, ..., σn] βŠ† S βŠ‚ 𝕂[x1, x2, ..., xn]

5.10.3 Theorem. (Fundamental theorem of symmetric polynomials). Every symmetric polynomial f(x1,x2,..., xn) of 𝕂[x1, x2, ..., xn] can be expressed uniquely in terms of elementary symmetric polynomials with coefficients in 𝕂.

Proof. Let

Ξ±x1h1x2h1 β‹…β‹…β‹… xnhn

the highest degree monomial function occurring in the polynomial function using the lexicographic ordering. Then necessarily h1h2, ≥ ..., ≥ hn; If for example we had h1 < h2, we could exchange x1 with x2, obtaining the monomial

Ξ±x2h1x1h1 β‹…β‹…β‹… xnhn = Ξ±x1h1x2h1 β‹…β‹…β‹… xnhn

which is again a monomial of f(x1,x2, ..., xn), being f by hypothesis symmetric.

Consider now the symmetric polynomial (in x1, x2, ..., xn)

φ1 = ασ1h1βˆ’h2σ2h2βˆ’h3 β‹…β‹…β‹… σnβˆ’1hnβˆ’1βˆ’hn σnhn

The leading term of φ1 is given by the product of α for the leading terms of each σi raised to the power h1 βˆ’ h2, h2βˆ’h3, .., ecc. respectively.

αx1h1βˆ’h2(x1x2)h2βˆ’h3 β‹…β‹…β‹…(x1x2β‹…β‹…β‹…xn)hn = αx1h1x2h2 β‹…β‹…β‹… xnhn

This shows that f and φ1 have the same leading term. Hence f1 = f β€” φ1 has a strictly smaller leading term according to the lexicographic ordering. Note that f1 is symmetric, since f and g are. Now repeat this process, starting with f1 instead of f. Since f1 is symmetric, it has a leading term with coefficient α1 and exponents b1 > Β·Β·Β· > bn. As above, this will give an expression φ2 in the elementary symmetric polynomials such that f1 and φ2 have the same leading term.

This process will terminate if we find some m with fm = 0, for the zero polynomial has no leading term. If, on the other hand, we never had fm = 0, then the above would give an infinite sequence of nonzero polynomials with strictly decreasing leading terms. But we showed above that there are only finitely many monomials strictly smaller than the leading term off. Hence the above process must terminate.

So we've proven that f βˆ’ φ1βˆ’φ2 βˆ’ ... βˆ’φm is the zero polynomial, hence

f = φ1 + φ2 + ... + φm

Each φi, is a product of the Οƒ, to various powers, which proves that f is a polynomial in the elementary symmetric polynomials.

Uniqueness. The uniqueness of the expression follows from the algebraic independence of the elementary symmetric functions.

To prove the uniqueness of the expression we have to prove that if a polynomial (symmetric) can be written in two forms as

βˆ‘ αi1i2...in σ1i1 σ2i2 ... σnin = βˆ‘ βi1i2...in σ1i1 σ2i2 ... σnin

then αi1i2...in = βi1i2...in for all index ij, j=1,...,n. It is sufficient to prove that if φ(σ1,...,σn) = 0 then all coefficients of φ(σ1,...,σn) are zero i.e. the polynomial φ(z1,...,zn) in the indeterminates zi is the zero polynomial. This is equivalent to say that the σi are algebrically independent. In other words we have to prove that

φ(z1,...,zn) ≠ 0   β‡’   φ(σ1,...,σn) ≠ 0

Let Ξ±z1h1z2h1 β‹…β‹…β‹… znhn be the leading term (using the lexicographic order), of φ(z1,...,zn). Substitute zi = σi in φ(z1,..., zn). By expanding σi in terms of the xi (since σi is a function of x) the monomial Ξ±z1h1z2h1 β‹…β‹…β‹… znhn becomes the following polynomial in the xi:

αi1i2...in σ1i1 σ2i2 ... σnin = a(x1 + x2 + ... + xn)h1(x1x2 + ...)h2 β‹…β‹…β‹… (x1β‹…β‹…β‹…xn)hn

The leading term of this polynomial is

ax1h1 + h2 + ... + hn x2h1 + h2 + ... + hnxnhn

which has a non-zero coefficient and cannot vanish by combination with any other monomial. This completes the proof.β–‘

This theorem asserts that the ring of all symmetric polynomials S, in n indeterminates is

S = 𝕂[x1, x2, ..., xn]

Example 5.10.4 Write the symmetric polynomial in 𝕂[x1, x2, x3]

f = βˆ‘ij xi2xj = x12x2 + x12x3 + x22x1 + x22x3 + x32x1 + x32x2

in terms of elementary symmetric polynomials σk.

Solution. We use the same method applied to prove theorem 5.10.3. The highest degree monomial is x12x2 with h1 = 2, h2 = 1, h3 = 0. φ1 = σ12βˆ’1 σ21 = σ1 σ2

f1 = f βˆ’ φ1 = βˆ‘ij xi2xj βˆ’ (x1 + x2 + x3)(x1 x2 + x1x3 + x2x3) = βˆ‘ij xi2xj βˆ’ βˆ‘ij xi2xj βˆ’ 3(x1x2x3)

The highest monomial of f1 is βˆ’3(x1x2x3) so h1 = 1, h2 = 1, h3 = 1. We have

φ2 = βˆ’3σ11βˆ’1 σ21βˆ’1 σ3 = βˆ’3σ3 = βˆ’3(x1x2x3)

and

f2 = f1 βˆ’ φ2 = βˆ’3(x1x2x3) +3(x1x2x3) = 0

thus f = φ1 + φ2 = σ1σ2 βˆ’3σ3.

The ring of polynomials in n indeterminates in a field 𝕂, is an integral domain (see exercise 1).

«Solutions of Cubic and Quartic equations Index Rational functions »