# Geometry and Computational Complexity Theory

Mathematics flourishes when ideas from one area of mathematics can be used in another area.  For a long time (certainly since Descartes), algebra has been a great asset to geometry.  In return, geometry has been helpful to algebra, for instance through representation theory.  Here are two very fine reviews of papers that bring algebra and geometry to the question of computational complexity.  Both reviews are by Peter Bürgisser.

The first article is a survey by Volker Strassen of his work on the complexity of matrix operations, and its growth into a larger application of geometry to the theory of bilinear maps.  The “well-known discovery” mentioned in the review is Strassen’s paper where he showed how one can compute the product of two $n\times n$ matrices with fewer than $4.7n^{\log_2(7)}\approx 4.7n^{2.8}$ arithmetical operations. This then implies that one can invert a non-singular matrix of order $n$ in less than $5.64 n^{\log_2(7)} \approx 6n^{2.8}$ arithmetical operations.  The traditional result is that if you only work with rows and columns, the lower bound on solving an $n\times n$ system of linear equations is $\frac{1}{3}n^3$, which is achieved  by Gaussian elimination.  Strassen’s survey puts on display a variety of mathematical tools, such as geometric invariant theory and the notion of asymptotic degenerations of bilinear maps.   The appendix makes use of representation theory.

So Strassen gives us an upper bound on the minimum number of operations needed for matrix multiplication.  What about a lower bound?  The second article is by J.M. Landsberg wherein he gives a lower bound on the computational complexity of multiplication of $2\times 2$ matrices.  (In Strassen’s result, plug in $n=2$ to see why there is a 7 in the title.)  Landsberg uses techniques from representation theory and geometry, in particular the secant variety.  Bürgisser mentions in his review that there are some gaps in Landsberg’s original paper.  On the arXiv, you can find an updated version where some overlooked cases are treated.

I hope you enjoy these reviews.  If you want to dig further into the subject, Landsberg has a book, Tensors: Geometry and Applications, that sets up the background, describes the work on complexity of matrix multiplication, and applies the general techniques to other areas, such as P versus NP, signal processing, phylogenetics, and algebraic statistics.  It’s available from the AMS.  And there is a nice review of it on MathSciNet.

MR2138544 (2006d:68289)
Strassen, Volker(D-KNST)
Komplexität und Geometrie bilinearer Abbildungen. (German. English summary) [Complexity and geometry of bilinear maps]
Jahresber. Deutsch. Math.-Verein. 107 (2005), no. 1, 3–31.
68W30 (14L24 68Q17)

This survey (in German) is a summary of Volker Strassen’s work at the interface of complexity and geometry published in the Journal für die Reine und Angewandte Mathematik [375/376 (1987), 406–443; MR0882307 (88h:11026); 384 (1988), 102–152; MR0929980 (89c:11056); 413 (1991), 127–180; MR1089800 (92m:11038)]. It is a pleasure to read and should be accessible to a wide mathematical audience. The main part is similar to Strassen’s overview given in [First European Congress of Mathematics, Vol. II (Paris, 1992), 429–446, Progr. Math., 120, Birkhäuser, Basel, 1994; MR1341854 (96e:68067)]. The appendix, however, which covers one third of the article, contains a wealth of new material.

The starting point, triggered by Strassen’s well-known 1969 discovery [Numer. Math. 13 (1969), 354–356; MR0248973 (40 #2223)], is to determine the minimum number of arithmetic operations sufficient to multiply two square matrices. Hereby, attention is confined to the asymptotic behaviour of this function captured by the exponent $\omega$ of matrix multiplication, which is defined as the infimum of the real numbers $\tau$ such that two $n\times n$ matrices can be multiplied with $O(n^\tau)$ arithmetic operations. It is known that $2\le\omega<2.38$ [D. Coppersmith and S. Winograd, J. Symbolic Comput. 9 (1990), no. 3, 251–280;MR1056627 (91i:68058)] and most experts nowadays believe that $\omega=2$.

Strassen has created a geometrical theory of bilinear maps with a considerably larger scope than the original questions of computational complexity. Geometric invariant theory provides a natural framework for addressing these more general questions. The concept of asymptotic degeneration makes it possible to take an asymptotic viewpoint when comparing any two bilinear maps. A bilinear map $f$ is called an asymptotic degeneration of a bilinear map $g$ iff, for all $\nu$$f^{\otimes\nu} is a degeneration of a direct sum of 2^{o(\nu)} many copies of g^{\otimes\nu}. (There are natural notions of direct sum and tensor product of bilinear maps.) The following spectral interpretation of this notion is crucial: For any semiring S of bilinear maps there is an essentially unique compact space\Delta(S) and a homomorphism \varphi\colon S\to C^+(\Delta(S)) from S to the semiring of nonnegative continuous functions on \Delta(S) such that\varphi(S) separates the points of \Delta(S) and the following property holds: f is an asymptotic degeneration of g iff \varphi(f)\le\varphi(g) pointwise on \Delta(S). If S is generated by q elements one may assume that \Delta(S)\subseteq\Bbb{R}^q. For semirings S of bilinear maps of a certain type, a singular simplex in the spectrum \Delta(S), the so-called support simplex, can be explicitly computed. This is achieved by exhibiting, for a probability vector \theta\in\Bbb{R}^3, a homomorphism S\to\Bbb{R},\, f\mapsto\zeta(\theta, f) of semirings that is monotonic with respect to asymptotic degeneration. In order to define this, suppose f\colon\Bbb{C}^{m_1}\times \Bbb{C}^{m_2}\to\Bbb{C}^{m_3} is a tight (straff) bilinear map. This means that there are injective real valued maps \alpha(i),\beta(j),\gamma(k) such that \alpha(i)+\beta(j)+\gamma(k)=0 for all (i,j,k) in the support {\rm supp }(f)\colon =\{(i,j,k)\mid f_{ijk}\ne 0\} of f. One defines$$ \zeta(\theta,f) \colon = \max_P 2^{\sum_{\kappa=1}^3 \theta_\kappa H(P_k)},\tag1$$where$P$runs over all probability distributions on${\rm supp}\, f$,$P_\kappa$denotes the$\kappa$-th marginal distribution of$P$, and$H$is the entropy function. It is known that for tight bilinear maps the support simplex and the spectrum have the same minimal points with respect to the product of the natural orders on$\Bbb{R}^q$. A main conjecture by Strassen (Vermutung 6) states that for tight bilinear maps, the image of the singular simplex coincides with the asymptotic spectrum. This conjecture would imply$\omega=2$. To fully appreciate the appendix, some familiarity with the representation theory of reductive linear algebraic groups is required. Some elegant proofs different from the ones in the literature are given. Moreover, the main conjecture is shown to be equivalent to the following more down-to-earth version (Vermutung 15): for “perfect” bilinear maps$f\colon\Bbb{C}^{m_1}\times \Bbb{C}^{m_2}\to\Bbb{C}^{m_3}$and$g\colon\Bbb{C}^{n_1}\times \Bbb{C}^{n_2}\to\Bbb{C}^{n_3}$the inequalities of the formats$m_i\le n_i$for$i=1,2,3$imply that$f$is an asymptotic degeneration of$g$. (Perfect means that the marginal distributions of the uniform distribution on the support are uniform.) Finally, an interesting connection of the support simplex of$f$to the moment polytope is made. It states that for a tight bilinear map$f$, Equation (1) is also true when$(P_1,P_2,P_3)$runs over all elements of the moment polytope of the orbit closure of$f$. {Reviewer’s remark: In the proof of Satz 16 there are a few typos confusing dimensions and their logarithms.} Reviewed by Peter Bürgisser MR2188132 (2006j:68034) Landsberg, J. M.(1-TXAM) The border rank of the multiplication of$2\times2$matrices is seven. J. Amer. Math. Soc. 19 (2006), no. 2, 447–459. 68Q17 (65F05 68W30) V. Strassen presented in 1969 [Numer. Math. 13 (1969), 354–356; MR0248973 (40 #2223)] a bilinear algorithm for multiplying$2\times2$matrices using seven multiplications (instead of eight). This implied that virtually all tasks of numerical linear algebra can be solved in less than cubic time. In subsequent improvements of Strassen’s algorithm, the concept of approximate bilinear algorithms played a crucial role. However, it remained unknown whether there is an approximate bilinear algorithm for the multiplication of$2\times 2$matrices using fewer than seven multiplications. This paper solves this long-standing open problem in algebraic complexity theory (cf. Problem 12.1 in [V. Strassen, in Handbook of theoretical computer science, Vol. A, 633–672, Elsevier, Amsterdam, 1990; MR1127177]). Mathematically, one has to show that the tensor of matrix multiplication is not contained in the secant variety$\sigma_6(\Bbb{P}^3\times\Bbb{P}^3\times\Bbb{P}^3)$. The key idea is to use a local differential-geometric description of limits of curves in the secant variety. This makes it possible to oversee all possible cancellations and thus to classify the points in$\sigma_6(\Bbb{P}^3\times\Bbb{P}^3\times\Bbb{P}^3)- \sigma_5(\Bbb{P}^3\times\Bbb{P}^3\times\Bbb{P}^3)\$ into fifteen cases. Once this is achieved, the proof proceeds by excluding all the possible cases using the known substitution technique.

I was recently informed by the author that the proof contains a gap. Apparently, it can be fixed and this will appear as an erratum.

Reviewed by Peter Bürgisser

Notes:

1. There is an interesting biography of Volker Strassen on the MacTutor website.
2. The review of Strassen’s ground-breaking result is by Graeme Fairweather, long before he became an executive editor of Mathematical Reviews. 