Symmetric Polynomials
We introduced π[x] the ring of polynomials in one indeterminate with coefficients over π. This definition can be extended to a case in which the coefficients vary over a commutative ring (e.g: β€) rather than a field. In this case we have a ring of polynomials with coefficients in a ring R and will be indicated by R[x].
5.10.1 Definition. The ring R of polynomials in n indeterminates and coefficients over the field π is defined inductively in the following way
R1 β‘ π[x], Rn β‘ Rn β 1[x]
we write Rn β‘ π[x1, x2, ..., xn]. β‘
So a polynomials with n indeterminates is thought as a polynomial in one variable, with coefficients in the ring of the polynomials with n-1 variables. An element of π[x,y] = (π[x])[y] is of the kind
βnj=0 fjyj, fj ∈ π[x]
like for example
β2j=0 (β1i=0 aijxi)yj = (a00 + a10x) + (a01 + a11x)y + (a02 + a12 x)y2 =
a00 + a10x + a01y + a02y2 + a11 xy + a12 xy2.
We can represent in general the elements of π[x1, x2, ..., xn] in the form
f(x1, x2, ..., xn) = β ai1i2...in x1i1 x2i2 ... xnin
They are added and multiplied in the familiar way, making use of the associative, commutative, and distributive laws of addition and multiplication.
The degree of the indeterminate xi is the highest exponent that occurs in the polynomial with a nonzero coefficient.
Given the monomial
x1h1, x2h2, ..., xnhn
the degree of the polynomials is the integer h1 + h2 + ... + hn. The degree of the polynomials f(x1, x2, ..., xn) is defined as the highest degree of its monomials.
A polynomial in general can have different highest degree monomials. It is thus relevant to order the monomials of a polynomial. The common order is the lexicographic ordering: In practice, given an ordered alphabet of variables x1, x2,..., xn you can sort all the monomials by first considering the exponent of x1, x2,..., xn, then the exponent of x2 and so on, until a difference is found between the exponents. At this point, the monomial is considered minor for which the exponent is minor. In symbols
αx1h1x2h1 β β β x3hn, α ∈ π, hi ≥ 0
and
βx1k1x2k1 β β β x3kn, β ∈ π, ki ≥0
two mononomers of the polynomial f(x1, x2,.., xn). We say that
αx1h1x2h1 β β β xnhn < βx1k1x2k1 β β β xnkn
if the least integer m, for which hi ≠ ki, such that hm < km.
For example
x14x22x3 β» x14x2x33 (same total degree, equal x1 exponent, greater x2 exponent)
x14x22x33 βΊ x14x23x33 (smaller total degree)
We introduce now an important class of polynomials: symmetric polynomials. Given the definition of permutation we have the following
5.10.2 Definition. A polynomial f(x1,x2,..., xn) ∈ π[x1, x2, ..., xn] is said symmetric if for any permutation of its indeterminate it remains invariant. β‘
For example the polynomials in π[x1, x2, ..., x3]
2x1 + 2x2 + 2x3 β3x12 β3x22 β3x32
x1x23 + x2x13 + x1x3 + x3x13 + x2x33 + x3x23
instead the following are not symmetric polynomials
x12 + x22 β x32, x1 + 3x2 + 7x3
The following symmetric polynomial are said elementary symmetric polynomials (or functions) in π[x1, x2, ..., xn]
σ1(x1, x2,..., xn) = x1 + x2 + ... + xn = βni=1 xi
σ1(x1, x2,..., xn) = x1 x2 + x1 x3 + ... + xnβ1 xn = βni≤i < j ≤ n xixj
σ1(x1,x2,..., xn) = x1 x2x3 + x1 x2 x4 + ...+ xnβ2 xnβ1 xn = βni ≤ i < j < k ≤ n xixjxk
...
σ1(x1,x2,...,xn) = x1x2β β β xn
We shall show, these functions connenct coefficients and roots of a polynomial equation in one variable.
Consider the equation
f(t) = tn + a1tnβ1 + ... + anβ1t + an = 0 (5.10.1)
Let r1, r2, rn be its roots, we can write
f(t) = (t β r1) (t β r2) β β β (t β rn) (5.10.2)
expanding the factored form of 5.10.2 and setting the coefficients equal to those of (5.10.2), we obtain the following relations
a1 = β(r1 + r2 + rn)
a2 = r1r2 + r1x3 + ... + r1xn + r2x3 + ... + rnβ1 rn
a3 = β(r1r2x3 + r1r2x4 + ... + rnβ2xnβ1 xn)
...
an = (β1)n r1x2β β β rn
The above relations are known as the Viète formulas: they relate the coefficients of a polynomial to sums and products of its roots. We examine the case n = 2.
Vieta's Formula. Quadratic Equations. Let r1 and r2 be the roots of the quadratic equation ax2 + bx + c = 0. Then the two identities
βb/a = r1 + r2 c/a = r1r2
both hold.
Proof. Write ax2 + bx + c = a(x β r1) (x β r2) = ax2 βax(r1 + r2) + ar1r2. From which the relations above must be satisfied. β‘
In general we have σk = (β1)k σk (r1, r2, ..., rn), hence the coefficients of every monic polynomial of one indeterminate with coefficients in a field, are the elementary symmetric functions of its roots.
The set S of all symmetric polynomials in n variables with coefficients in π[x1, x2, ..., xn]. Every polynomial f(σ1, σ2, ..., σn) of π[x1, x2, ..., xn] can be expressed in terms of elementary symmetric polynomials and thus contained in S. We have the following inclusions
We make this precise in the next theorem, showing that the first inclusion is an equality.
π[σ1, σ2, ..., σn] β S β π[x1, x2, ..., xn]
5.10.3 Theorem. (Fundamental theorem of symmetric polynomials). Every symmetric polynomial f(x1,x2,..., xn) of π[x1, x2, ..., xn] can be expressed uniquely in terms of elementary symmetric polynomials with coefficients in π.
Proof. Let
Ξ±x1h1x2h1 β β β xnhn
the highest degree monomial function occurring in the polynomial function using the lexicographic ordering. Then necessarily h1 ≥ h2, ≥ ..., ≥ hn; If for example we had h1 < h2, we could exchange x1 with x2, obtaining the monomial
Ξ±x2h1x1h1 β β β xnhn = Ξ±x1h1x2h1 β β β xnhn
which is again a monomial of f(x1,x2, ..., xn), being f by hypothesis symmetric.
Consider now the symmetric polynomial (in x1, x2, ..., xn)
φ1 = ασ1h1βh2σ2h2βh3 β β β σnβ1hnβ1βhn σnhn
The leading term of φ1 is given by the product of α for the leading terms of each σi raised to the power h1 β h2, h2βh3, .., ecc. respectively.
αx1h1βh2(x1x2)h2βh3 β β β (x1x2β β β xn)hn = αx1h1x2h2 β β β xnhn
This shows that f and φ1 have the same leading term. Hence f1 = f β φ1 has a strictly smaller leading term according to the lexicographic ordering. Note that f1 is symmetric, since f and g are. Now repeat this process, starting with f1 instead of f. Since f1 is symmetric, it has a leading term with coefficient α1 and exponents b1 > Β·Β·Β· > bn. As above, this will give an expression φ2 in the elementary symmetric polynomials such that f1 and φ2 have the same leading term.
This process will terminate if we find some m with fm = 0, for the zero polynomial has no leading term. If, on the other hand, we never had fm = 0, then the above would give an infinite sequence of nonzero polynomials with strictly decreasing leading terms. But we showed above that there are only finitely many monomials strictly smaller than the leading term off. Hence the above process must terminate.
So we've proven that f β φ1βφ2 β ... βφm is the zero polynomial, hence
f = φ1 + φ2 + ... + φm
Each φi, is a product of the Ο, to various powers, which proves that f is a polynomial in the elementary symmetric polynomials.
Uniqueness. The uniqueness of the expression follows from the algebraic independence of the elementary symmetric functions.
To prove the uniqueness of the expression we have to prove that if a polynomial (symmetric) can be written in two forms as
β αi1i2...in σ1i1 σ2i2 ... σnin = β βi1i2...in σ1i1 σ2i2 ... σnin
then αi1i2...in = βi1i2...in for all index ij, j=1,...,n. It is sufficient to prove that if φ(σ1,...,σn) = 0 then all coefficients of φ(σ1,...,σn) are zero i.e. the polynomial φ(z1,...,zn) in the indeterminates zi is the zero polynomial. This is equivalent to say that the σi are algebrically independent. In other words we have to prove that
φ(z1,...,zn) ≠ 0 β φ(σ1,...,σn) ≠ 0
Let Ξ±z1h1z2h1 β β β znhn be the leading term (using the lexicographic order), of φ(z1,...,zn). Substitute zi = σi in φ(z1,..., zn). By expanding σi in terms of the xi (since σi is a function of x) the monomial Ξ±z1h1z2h1 β β β znhn becomes the following polynomial in the xi:
αi1i2...in σ1i1 σ2i2 ... σnin = a(x1 + x2 + ... + xn)h1(x1x2 + ...)h2 β β β (x1β β β xn)hn
The leading term of this polynomial is
ax1h1 + h2 + ... + hn x2h1 + h2 + ... + hnxnhn
which has a non-zero coefficient and cannot vanish by combination with any other monomial. This completes the proof.β‘
This theorem asserts that the ring of all symmetric polynomials S, in n indeterminates is
S = π[x1, x2, ..., xn]
Example 5.10.4 Write the symmetric polynomial in π[x1, x2, x3]
f = βi ≠ j xi2xj = x12x2 + x12x3 + x22x1 + x22x3 + x32x1 + x32x2
in terms of elementary symmetric polynomials σk.
Solution. We use the same method applied to prove theorem 5.10.3. The highest degree monomial is x12x2 with h1 = 2, h2 = 1, h3 = 0. φ1 = σ12β1 σ21 = σ1 σ2
f1 = f β φ1 = βi ≠ j xi2xj β (x1 + x2 + x3)(x1 x2 + x1x3 + x2x3) = βi ≠ j xi2xj β βi ≠ j xi2xj β 3(x1x2x3)
The highest monomial of f1 is β3(x1x2x3) so h1 = 1, h2 = 1, h3 = 1. We have
φ2 = β3σ11β1 σ21β1 σ3 = β3σ3 = β3(x1x2x3)
and
f2 = f1 β φ2 = β3(x1x2x3) +3(x1x2x3) = 0
thus f = φ1 + φ2 = σ1σ2 β3σ3.
The ring of polynomials in n indeterminates in a field π, is an integral domain (see exercise 1).