Symmetric Polynomials

We introduced 𝕂[x] the ring of polynomials in one indeterminate with coefficients over 𝕂. This definition can be extended to a case in which the coefficients vary over a commutative ring (e.g: β„€) rather than a field. In this case we have a ring of polynomials with coefficients in a ring R and will be indicated by R[x].

5.10.1 Definition. The ring R of polynomials in n indeterminates and coefficients over the field 𝕂 is defined inductively in the following way

R1 ≑ 𝕂[x],   Rn ≑ Rn βˆ’ 1[x]

we write Rn ≑ 𝕂[x1, x2, ..., xn].  ░

So a polynomials with n indeterminates is thought as a polynomial in one variable, with coefficients in the ring of the polynomials with n-1 variables. An element of 𝕂[x,y] = (𝕂[x])[y] is of the kind

βˆ‘nj=0 fjyj,   fj ∈ 𝕂[x]

like for example

βˆ‘2j=0 (βˆ‘1i=0 aijxi)yj = (a00 + a10x) + (a01 + a11x)y + (a02 + a12 x)y2 =
a00 + a10x + a01y + a02y2 + a11 xy + a12 xy2.

We can represent in general the elements of 𝕂[x1, x2, ..., xn] in the form

f(x1, x2, ..., xn) = βˆ‘ ai1i2...in x1i1 x2i2 ... xnin

They are added and multiplied in the familiar way, making use of the associative, commutative, and distributive laws of addition and multiplication.

The degree of the indeterminate xi is the highest exponent that occurs in the polynomial with a nonzero coefficient.

Given the monomial

x1h1, x2h2, ..., xnhn

the degree of the polynomials is the integer h1 + h2 + ... + hn. The degree of the polynomials f(x1, x2, ..., xn) is defined as the highest degree of its monomials.

A polynomial in general can have different highest degree monomials. It is thus relevant to order the monomials of a polynomial. The common order is the lexicographic ordering: In practice, given an ordered alphabet of variables x1, x2,..., xn you can sort all the monomials by first considering the exponent of x1, x2,..., xn, then the exponent of x2 and so on, until a difference is found between the exponents. At this point, the monomial is considered minor for which the exponent is minor. In symbols

Ξ±x1h1x2h1 β‹…β‹…β‹… x3hn,   Ξ± ∈ 𝕂,   hi β‰₯ 0

and

Ξ²x1k1x2k1 β‹…β‹…β‹… x3kn,   Ξ² ∈ 𝕂,   ki β‰₯0

two mononomers of the polynomial f(x1, x2,.., xn). We say that

Ξ±x1h1x2h1 β‹…β‹…β‹… xnhn < Ξ²x1k1x2k1 β‹…β‹…β‹… xnkn

if the least integer m, for which hi β‰  ki, such that hm < km.

For example

x14x22x3 ≻ x14x2x33   (same total degree, equal x1 exponent, greater x2 exponent)

x14x22x33 β‰Ί x14x23x33   (smaller total degree)

We introduce now an important class of polynomials: symmetric polynomials. Given the definition of permutation we have the following

5.10.2 Definition. A polynomial f(x1,x2,..., xn) ∈ 𝕂[x1, x2, ..., xn] is said symmetric if for any permutation of its indeterminate it remains invariant. ░

For example the polynomials in 𝕂[x1, x2, ..., x3]

2x1 + 2x2 + 2x3 βˆ’3x12 βˆ’3x22 βˆ’3x32

x1x23 + x2x13 + x1x3 + x3x13 + x2x33 + x3x23

instead the following are not symmetric polynomials

x12 + x22 βˆ’ x32,   x1 + 3x2 + 7x3

The following symmetric polynomial are said elementary symmetric polynomials (or functions) in 𝕂[x1, x2, ..., xn]

Οƒ1(x1, x2,..., xn) = x1 + x2 + ... + xn = βˆ‘ni=1 xi

Οƒ1(x1, x2,..., xn) = x1 x2 + x1 x3 + ... + xnβˆ’1 xn = βˆ‘ni≀i < j ≀ n xixj

Οƒ1(x1,x2,..., xn) = x1 x2x3 + x1 x2 x4 + ...+ xnβˆ’2 xnβˆ’1 xn = βˆ‘ni ≀ i < j < k ≀ n xixjxk

...

Οƒ1(x1,x2,...,xn) = x1x2β‹…β‹…β‹…xn

We shall show, these functions connenct coefficients and roots of a polynomial equation in one variable.

Consider the equation

f(t) = tn + a1tnβˆ’1 + ... + anβˆ’1t + an = 0   (5.10.1)

Let r1, r2, rn be its roots, we can write

f(t) = (t βˆ’ r1) (t βˆ’ r2) β‹…β‹…β‹… (t βˆ’ rn)   (5.10.2)

expanding the factored form of 5.10.2 and setting the coefficients equal to those of (5.10.2), we obtain the following relations

a1 = βˆ’(r1 + r2 + rn)

a2 = r1r2 + r1x3 + ... + r1xn + r2x3 + ... + rnβˆ’1 rn

a3 = βˆ’(r1r2x3 + r1r2x4 + ... + rnβˆ’2xnβˆ’1 xn)

...

an = (βˆ’1)n r1x2β‹…β‹…β‹…rn

The above relations are known as the Viète formulas: they relate the coefficients of a polynomial to sums and products of its roots. We examine the case n = 2.

Vieta's Formula. Quadratic Equations. Let r1 and r2 be the roots of the quadratic equation ax2 + bx + c = 0. Then the two identities

βˆ’b/a = r1 + r2    c/a = r1r2

both hold.

Proof. Write ax2 + bx + c = a(x βˆ’ r1) (x βˆ’ r2) = ax2 βˆ’ax(r1 + r2) + ar1r2. From which the relations above must be satisfied.  ░

In general we have Οƒk = (βˆ’1)k Οƒk (r1, r2, ..., rn), hence the coefficients of every monic polynomial of one indeterminate with coefficients in a field, are the elementary symmetric functions of its roots.

The set S of all symmetric polynomials in n variables with coefficients in 𝕂[x1, x2, ..., xn]. Every polynomial f(Οƒ1, Οƒ2, ..., Οƒn) of 𝕂[x1, x2, ..., xn] can be expressed in terms of elementary symmetric polynomials and thus contained in S. We have the following inclusions

We make this precise in the next theorem, showing that the first inclusion is an equality.

𝕂[Οƒ1, Οƒ2, ..., Οƒn] βŠ† S βŠ‚ 𝕂[x1, x2, ..., xn]

5.10.3 Theorem. (Fundamental theorem of symmetric polynomials). Every symmetric polynomial f(x1,x2,..., xn) of 𝕂[x1, x2, ..., xn] can be expressed uniquely in terms of elementary symmetric polynomials with coefficients in 𝕂.

Proof. Let

Ξ±x1h1x2h1 β‹…β‹…β‹… xnhn

the highest degree monomial function occurring in the polynomial function using the lexicographic ordering. Then necessarily h1 β‰₯ h2, β‰₯ ..., β‰₯ hn; If for example we had h1 < h2, we could exchange x1 with x2, obtaining the monomial

Ξ±x2h1x1h1 β‹…β‹…β‹… xnhn = Ξ±x1h1x2h1 β‹…β‹…β‹… xnhn

which is again a monomial of f(x1,x2, ..., xn), being f by hypothesis symmetric.

Consider now the symmetric polynomial (in x1, x2, ..., xn)

Ο†1 = Ξ±Οƒ1h1βˆ’h2Οƒ2h2βˆ’h3 β‹…β‹…β‹… Οƒnβˆ’1hnβˆ’1βˆ’hn Οƒnhn

The leading term of Ο†1 is given by the product of Ξ± for the leading terms of each Οƒi raised to the power h1 βˆ’ h2, h2βˆ’h3, .., ecc. respectively.

Ξ±x1h1βˆ’h2(x1x2)h2βˆ’h3 β‹…β‹…β‹…(x1x2β‹…β‹…β‹…xn)hn = Ξ±x1h1x2h2 β‹…β‹…β‹… xnhn

This shows that f and Ο†1 have the same leading term. Hence f1 = f β€” Ο†1 has a strictly smaller leading term according to the lexicographic ordering. Note that f1 is symmetric, since f and g are. Now repeat this process, starting with f1 instead of f. Since f1 is symmetric, it has a leading term with coefficient Ξ±1 and exponents b1 > Β·Β·Β· > bn. As above, this will give an expression Ο†2 in the elementary symmetric polynomials such that f1 and Ο†2 have the same leading term.

This process will terminate if we find some m with fm = 0, for the zero polynomial has no leading term. If, on the other hand, we never had fm = 0, then the above would give an infinite sequence of nonzero polynomials with strictly decreasing leading terms. But we showed above that there are only finitely many monomials strictly smaller than the leading term off. Hence the above process must terminate.

So we've proven that f βˆ’ Ο†1βˆ’Ο†2 βˆ’ ... βˆ’Ο†m is the zero polynomial, hence

f = Ο†1 + Ο†2 + ... + Ο†m

Each Ο†i, is a product of the Οƒ, to various powers, which proves that f is a polynomial in the elementary symmetric polynomials.

Uniqueness. The uniqueness of the expression follows from the algebraic independence of the elementary symmetric functions.

To prove the uniqueness of the expression we have to prove that if a polynomial (symmetric) can be written in two forms as

βˆ‘ Ξ±i1i2...in Οƒ1i1 Οƒ2i2 ... Οƒnin = βˆ‘ Ξ²i1i2...in Οƒ1i1 Οƒ2i2 ... Οƒnin

then Ξ±i1i2...in = Ξ²i1i2...in for all index ij, j=1,...,n. It is sufficient to prove that if Ο†(Οƒ1,...,Οƒn) = 0 then all coefficients of Ο†(Οƒ1,...,Οƒn) are zero i.e. the polynomial Ο†(z1,...,zn) in the indeterminates zi is the zero polynomial. This is equivalent to say that the Οƒi are algebrically independent. In other words we have to prove that

Ο†(z1,...,zn) β‰  0   β‡’   Ο†(Οƒ1,...,Οƒn) β‰  0

Let Ξ±z1h1z2h1 β‹…β‹…β‹… znhn be the leading term (using the lexicographic order), of Ο†(z1,...,zn). Substitute zi = Οƒi in Ο†(z1,..., zn). By expanding Οƒi in terms of the xi (since Οƒi is a function of x) the monomial Ξ±z1h1z2h1 β‹…β‹…β‹… znhn becomes the following polynomial in the xi:

Ξ±i1i2...in Οƒ1i1 Οƒ2i2 ... Οƒnin = a(x1 + x2 + ... + xn)h1(x1x2 + ...)h2 β‹…β‹…β‹… (x1β‹…β‹…β‹…xn)hn

The leading term of this polynomial is

ax1h1 + h2 + ... + hn x2h1 + h2 + ... + hnxnhn

which has a non-zero coefficient and cannot vanish by combination with any other monomial. This completes the proof.β–‘

This theorem asserts that the ring of all symmetric polynomials S, in n indeterminates is

S = 𝕂[x1, x2, ..., xn]

Example 5.10.4 Write the symmetric polynomial in 𝕂[x1, x2, x3]

f = βˆ‘i β‰  j xi2xj = x12x2 + x12x3 + x22x1 + x22x3 + x32x1 + x32x2

in terms of elementary symmetric polynomials Οƒk.

Solution. We use the same method applied to prove theorem 5.10.3. The highest degree monomial is x12x2 with h1 = 2, h2 = 1, h3 = 0. Ο†1 = Οƒ12βˆ’1 Οƒ21 = Οƒ1 Οƒ2

f1 = f βˆ’ Ο†1 = βˆ‘i β‰  j xi2xj βˆ’ (x1 + x2 + x3)(x1 x2 + x1x3 + x2x3) = βˆ‘i β‰  j xi2xj βˆ’ βˆ‘i β‰  j xi2xj βˆ’ 3(x1x2x3)

The highest monomial of f1 is βˆ’3(x1x2x3) so h1 = 1, h2 = 1, h3 = 1. We have

Ο†2 = βˆ’3Οƒ11βˆ’1 Οƒ21βˆ’1 Οƒ3 = βˆ’3Οƒ3 = βˆ’3(x1x2x3)

and

f2 = f1 βˆ’ Ο†2 = βˆ’3(x1x2x3) +3(x1x2x3) = 0

thus f = Ο†1 + Ο†2 = Οƒ1Οƒ2 βˆ’3Οƒ3.

The ring of polynomials in n indeterminates in a field 𝕂, is an integral domain (see exercise 1).

Β«Solutions of Cubic and Quartic equations Index Rational functions Β»