Discriminants, Resultants, and Multidimensional Determinants

I. M. Gelfand, M. M. Kapranov, and A. V. Zelevinsky

Publisher:

Birkhäuser

Publication Date:

2008

Number of Pages:

523

Format:

Paperback

Series:

Modern Birkhäuser Classics

Price:

39.95

ISBN:

9780817647704

Category:

Monograph

BLL Rating:

BLL*

The Basic Library List Committee recommends this book for acquisition by undergraduate mathematics libraries.

MAA Review
Table of Contents

[Reviewed by

David P. Roberts

, on

10/24/2009

]

Israel M. Gelfand, perhaps more than any other twentieth-century mathematician, was known for the school he led. He was one of the great mathematicians who kept Soviet mathematics thriving through very difficult circumstances. MathSciNet records that Gelfand wrote more than 500 works with 150 coauthors covering 40 categories. A common theme of the Gelfand school is great depth presented in deceptively elementary language. A related theme is the use of algebraic techniques in settings such as differential equations where one might expect analytic techniques instead.

Discriminants, Resultants, and Multidimensional Determinants, joint with Mikhail M. Kapranov and Andrei V. Zelevinksy, first appeared in 1994. It is very much representative of the Gelfand school style. Discriminants and resultants of one-variable polynomials are well-known concepts, computable in a number of ways, including by ordinary determinants of square matrices. But what about the case of polynomials of more than one variable? The answer to this seemingly naive question is the elaborate and elegant theory presented in the book.

The authors are at great pains to be gentle to readers, but nonetheless I am sure that many potential readers would find the book itself intimidating. Accordingly in this review, I will highlight some important background, emphasize some of the more accessible topics, and indicate how the book fits into a larger program. I will focus on discriminants only, leaving resultants and determinants completely aside. Also the book uses "three main approaches" to its material: "a geometric approach via projective duality and associated hypersurfaces; an algebraic approach via homological algebra and determinants of complexes (Whitehead torsion); and a combinatorical approach via Newton polytopes and triangulation." The material in this review represents the third approach only.

Degree. When dealing with polynomials in one variable, an important notion is degree. For example, always assuming that printed coefficients are nonzero complex numbers, the polynomial f₁(x) = ax³ + bx² + cx + d has degree three. As a warm-up to the material in the book, it is useful to reflect on how the notion of degree should be extended to polynomials in more variables. What would be the degree of say

f₂(x,y) = ay² + bx³ + cxy + d or f₃(x,y) = axy² + bx² + cy + d?

A common response to this question would be to give the x-degree, y-degree and total degree. For f₂ and f₃ respectively, these numbers are (3,2,3) and (2,2,3).

This response is inadequate, as it does not fully capture the multidimensionality of the situation. To give a better response, let's first give ourselves extra flexiblity by allowing negative exponents as well. Given a Laurent polynomial f(x₁,...,x_m), let A ⊂ Z^m be the set of exponents for which the corresponding coefficient is non-zero. A better response focuses on the Newton polytope associated to the polynomial, namely the convex hull Q of the set A of contributing exponents. Our three quadrinomial examples are illustrated by the following figure:

In this figure, the Q_i are drawn in turn, with each A_i indicated by a set of four small disks. The lower left disk of each Q_i is the origin of the ambient exponent space containing Q_i. Each disk is drawn accompanied by its associated monomial. Thus, for example, the monomial a x y² of f₃(x,y) accompanies the top vertex of Q₃ which is located at the point (1,2) of the Euclidean plane containing Q₃.

In general, it is best to view the polytope Q itself as the proper higher-dimensional analog of degree. One numerical aspect of Q that is particularly important is the volume Vol(Q), with volume being normalized by Vol([0,1]^m) = m! Another important numerical aspect of Q is the count #(Q), defined as the number of lattice points interior to Q. In the one-dimensional case, there is not much different between Q, Vol(Q), and #(Q): the first is an interval [u,v], the second is its length d=v–u, and the third is d–1. In higher dimensions, Q is a complicated object and Vol(Q) and #(Q) capture two of its simpler aspects. In the two-dimensional examples pictured above, one has Vol(Q₂) = 6, Vol(Q₃) = 5, and #(Q₂) = #(Q₃) = 1.

Geometry. The authors refer to the Newton polytope Q of a given Laurent polynomial f(x₁,...,x_m) as a "magic crystal" which sheds light on f. So before getting to the book itself, it's best to give the reader of this review some feel for this remarkable "magic."

So given f(x₁,...,x_m), let X be its zero-locus in the torus (C^×)^m. As long as the coefficients (a,b,c,...) are chosen outside of a certain discriminant locus, then X is a smooth manifold with homeomorphism type independent of the coefficients. Its real topological dimension is 2(m–1). Since it is an affine variety, its Betti numbers b_i = dim_Q Hⁱ(X,Q) can be non-zero only for 0 ≤ i ≤ m–1. Its Euler characteristic χ(X) = b₀ – b₁ + b₂ – ... + (–1)^m–1b_m–1 is given by the extremely simple formula

χ(X) = (–1)^m–1 Vol(Q).

More subtly, but very importantly in algebraic geometry, each Betti number b_i decomposes as a sum of Hodge-Deligne numbers h_i^p,q∈ Z_≥0 satisfying the symmetry h_i^p,q = h_i^q,p; here the indices p and q run over non-negative integers satisfying p+q ≤ i. The Hodge-Deligne number h_m–1^m–1,0 is particularly well-behaved because it doesn't change when one replaces X by any of its compactifications. This Hodge number is also known as the geometric genus g(X) of X and one has the formula

g(X) = #(Q),

valid if m > 1. Further information about Hodge-Deligne numbers can be extracted from more subtle invariants of Q.

The last paragraph can be brought thoroughly down-to-earth in the cases m = 1 and m = 2. In the case m = 1, X consists of Vol(Q) distinct points, and so χ(X) = Vol(Q) is completely evident. In the case m = 2, X is a smooth genus #(Q) curve whose canonical smooth compactification is obtained by adding one point for each segment between lattice points on the boundary of Q. Thus X₂ is a genus one curve with k₂ = 2 + 3 + 1 = 6 punctures while X₃ is also a genus one curve, but now one with k₃ = 2 + 1 + 1 + 1 = 5 punctures. In each case, the general formula relating Euler characteristic, genus, and number of punctures is satisfied: χ = 2 – 2g – k. In general for m = 2, this geometric relation holds exactly because it is the translation of Pick's theorem for convex lattice polygons. In this setting m = 2, knowledge of Hodge-Deligne numbers comes down to knowledge of (g,k). The polytope Q contains this information and very much more. So too in higher dimensions: one can expect that the combinatorial object Q goes a long way towards describing the geometric object X.

Discriminants. Finally, with some comfort with and appreciation for Q, let's move to a central focus of the book. Our starting point now is A, a finite subset of Z^m. The polynomials giving rise to A as their set of contributing exponents are exactly the polynomials of the form f(x) = Σ_a∈A c_ax^a for c_a non-zero. Here we are using multi-index notation x=(x₁,...,x_m), a=(a₁,...,a_m), and x^a = x₁^a₁... x_m^a_m. One can ask which of these give rise to a singular X. The answer is given by an explicit polynomial Δ_A({c_a}) ∈ Z[{c_a}], well-defined up to sign. The singular X then come from exactly those polynomials f(x) = Σ_a∈A c_ax^a with Δ_A({c_a})=0.

In the case m=1, the discriminant is computable as the resultant of f with its derivative f_x. In our cubic example,

Res_x(f₁,f_1x) = c²b² – 4ac³ – 4b³d + 18abcd – 27a²d² = Δ_A₁(a,b,c,d).

In higher-dimensional cases, a tempting approach is to iterate the one-variable process. For example, eliminating y and then x in our second example yields

Res_x(Res_y(f₂,f_2x), Res_y(f₂,f_2y)) = –3ab²c⁴d(c⁶ – 432a³b²d) ≈ Δ_A₂(a,b,c,d).

Similarly, eliminating x and then y in our third example yields

Res_y(Res_x(f₃,f_3x), Res_x(f₃,f_3y)) = 16a⁶b⁶ (27bc⁴ + 64a²d³) ≈ Δ_A₃(a,b,c,d).

As indicated by the last two displayed lines, the result of this iterative process is close to the desired discriminant. However this iteration is too naive: typically eliminating the variables in any order gets the right "main factor." However the "secondary factors" obtained from this process depend on the ordering of the variables and are often not right.

To properly define and then compute the correct Δ_A, one has to be much more sophisticated in the setting m>1. Some of the ideas in the passage from A to Δ_A go back to an 1848 paper of Cayley which "laid out the foundations of modern homological algebra." This "breathtaking" paper is reprinted in the back of the book and the authors say "We were happy to enter into spiritual contact with this great mathematician." Another complication, not suggested by our three examples, is that even very modest A can yield Δ_A having millions of terms.

The secondary polytope. We've discussed already that the study of any given multivariate polynomial should begin with a study of its Newton polytope. Applying this principle to the polynomial Δ_A, one gets its Newton polytope S_A, called the secondary polytope of A. The authors write that "It was a very surprising discovery for us" when they realized that secondary polytopes "admit a very nice combinatorial description."

To describe S_A, the authors describe its vertices, these being by definition in bijection with extreme monomials of Δ_A. The authors find that extreme monomials are indexed by "coherent" triangulations of Q with all vertices in A. Going a step beyond S_A toward Δ_A itself, the authors find that the extreme monomial corresponding to a coherent triangulation T is

± ∏_σ∈T Vol(σ)^Vol(σ) ∏_{a ∈A} c_a^φ_T(a).

Here φ_T(a) is the sum of the volumes of the simplices σ incident on a.

The book presents a number of situations where the extreme monomial formula just displayed can be made much more explicit. We will present two of these situations here. The classical setting is the case when Q is one-dimensional. The setting of circuits is the case when S is one-dimensional. The one-dimensionality of one of the key polytopes, Q or S, keeps these two situations particularly simple.

The classical case. The classical case of degree d univariate polynomials has m=1 and A = {0,1,...,d}. In the case d=3 considered above, note first that all five terms in Δ₁(a,b,c,d) have total degree 4. Also if one gives a, b, c, and d weights 0, 1, 2, and 3 respectively, then all five terms have total weight 6. So the exponents i and j on the last two variables c and d determine the exponents on the first two variables a and b. We can therefore draw the secondary polytope S₁ in the i–j plane as on the right below. The four A₁-triangulations of the primary polytope Q₁ are all coherent and drawn to the left.

Note that the extreme-monomial formula gives four of the five terms, up to sign. Coefficients of interior terms are not covered by the theory presented in the book, and remain a mystery in general. Note also that the secondary polytope S₁ is combinatorically equivalent to a square but not affinely equivalent to a square.

The case of general degree d with coefficients c₀, ..., c_d is similar. All terms have total degree 2d –2 . The sum of the indices on the 2d–2 factors is d(d–1). A-triangulations of Q = [0,d] are indexed by subsets of {1,...,d–1} and are all coherent. Accordingly, extreme monomials are indexed by subsets of {1,...,d–1} according to which a_i appear. The monomial indexed by {1,...,d–1} itself is c₁² ... c_d–1² . In general, listing out the exponents which appear in increasing order, to pass from the extreme monomial indexed by a set S = {...,u,v,w,...} to the extreme monomial indexed by S – {v} = {...,u,w,...} one replaces the factor c_v^w–u by the factor c_u^w–vc_w^v–u. The secondary polytope is thus combinatorially an (m–1)-dimensional cube. As d = 3, 4, 5, 6, 7, ... increases, the number of terms in the discriminant increases rapidly: 5, 16, 59, 246, 1103, ...

The case of circuits. If A lies on a j-dimensional hyperplane of Z^m, then one can reduce to the case of j variables. For A to not lie on a proper hyperplane one needs |A| ≥ m+1. The first case |A|=m+1 is highly non-trivial in some respects such as Hodge-Deligne numbers. However it is degenerate with respect to the discriminants and secondary polytopes, as discriminants are just 1.

The next case |A|=m+2 with A being an affine generating set for Z^m is particularly interesting; in this case A is called a circuit. The vertices of the secondary polytope inside its ambient lattice can be identified with simply {0,1} inside Z. Accordingly, the discriminant Δ_A then has just two terms, both given up to sign by the extreme monomial formula. The triangulations giving these terms in our second and third examples are as follows.

Δ_A₂	=	1¹2²3³a¹⁺²b¹⁺³c¹⁺²⁺³d²⁺³ – 6⁶a⁶b⁶d⁶	Δ_A₃	=	1¹4⁴a¹⁺⁴b⁴c¹d¹⁺⁴ + 2²3³a³b²⁺³c²⁺³d²
	=	108a³b⁴c⁶d⁵ – 46656 a⁶b⁶d⁶		=	256a⁵b⁴c d⁵ + 108a³b⁵c⁵d²
	=	108a³b⁴d⁵ (c⁶ – 432a³b²d).		=	4 a³b⁴c d²(27bc⁴ + 64a²d³).

Thus in each case the main factor agrees with the main factor as naively calculated previously by the iterated resultant procedure.

Via simple monomial coordinates changes, the isomorphism type of the curve defined by f₂(a,b,c,d;x,y)=0 depends only on t = –432a³b²c^–6d. Similarly, the isomorphism type of the curve defined by f₃(a,b,c,d;x,y)=0 depends only on t = –27a^–1bc²d^–2 The resulting one-parameter families of curves X_t are smooth for t ∈ C^× except for t=1. These families are both very closely related to classical hypergeometric functions of the type ₂F₁(t). Similarly, a circuit in dimension m is closely related to a hypergeometric function of type _mF_m–1(t). The discriminant, as a function of the single variable t, is of the extremely simple form C t^j(t–1).

In general, the entire theory in the book can be "quantized": to a given A, one has a system of differential equations and Δ_A is only its "classical approximation." The authors are wise not to pursue this next level in the text itself, but readers should be aware that this larger theory is a primary motivation of the authors.

Conclusion. Of Gelfand's 500+ works, Discriminants, Resultants, and Multidimensional Determinants is currently the most cited on MathSciNet. A perusal of the long list of citations indicates the enormous influence of the book. Many of the citations are clearly in the framework set up by the book. Others go well beyond the book into fields as diverse as tropical geometry and mirror symmetry. Birkhäuser is not exaggerating when it calls Discriminants, Resultants, and Multidimensional Determinants a "modern classic."

David Roberts is a professor of mathematics at the University of Minnesota, Morris.

Preface.- Introduction.- General Discriminants and Resultants.- Projective Dual Varieties and General Discriminants.- The Cayley Method of Studying Discriminants.- Associated Varieties and General Resultants.- Chow Varieties.- Toric Varieties.- Newton Polytopes and Chow Polytopes.- Triangulations and Secondary Polytopes.- A-Resultants and Chow Polytopes of Toric Varieties.- A-Discriminants.- Principal A-Discriminants.- Regular A-Determinants and A-Discriminants.- Classical Discriminants and Resultants.- Discriminants and Resultants for Polynomials in One Variable.- Discriminants and Resultants for Forms in Several Variables.- Hyperdeterminants.- Appendix A. Determinants.- Appendix B. A. Cayley: On the Theory of Elimination.- Bibliography.- Notes and References.- List of Notation.- Index

Tags:

Linear Algebra