Amortized multi-point evaluationof multivariate polynomials

Let

be an effective field, so that we have algorithms for the field operations. Given a polynomial

and a tuple

of points, the computation of

is called the problem of multi-point evaluation.

This problem naturally occurs in several areas of applied algebra. When solving a polynomial system, multi-point evaluation can for instance be used to check whether all points in a given set are indeed solutions of the system. In [16], we have shown that efficient algorithms for multi-point evaluation actually lead to efficient algorithms for polynomial system solving. Bivariate polynomial evaluation has also been used to compute generator matrices of algebraic geometry error correcting codes [20].

In the univariate case when

, it is well known that one may use so-called “remainder trees” to compute multi-point evaluations in quasi-optimal time [1, 2, 4, 8, 22]. More precisely, if

stands for the cost to multiply two univariate polynomials of degree

(in terms of the number of field operations in

), then the multi-point evaluation of a polynomial of degree

points can be computed in time

. Using a variant of the Schönhage–Strassen algorithm [3, 27, 28], it is well known that

. If we restrict our attention to fields

of positive characteristic, then we may take

[5] and conjecturally even

[6]. In the remainder of this paper, we make the customary hypothesis that

is a non-decreasing function in

Currently, the fastest general purpose algorithm for multivariate multi-point evaluation is based on the “baby-step giant-step” method; see e.g. [12, section 3]. For a fixed dimension

and

such that

, this method requires

operations in

(e.g. by taking

and

in [12, Proposition 3.3]). Here

is a common abbreviation for

, and

is a constant such that two

and

matrices with coefficients in

can be multiplied using

operations in

. The best known theoretical bound is

[21, Table 2, half of the upper bound for

] (combined with the tensor permutation lemma [17, Corollary 7]). In the special case when

, and

, Nüsken and Ziegler proved the sharper bound

[24, particular case of Result 4].

In 2008, Kedlaya and Umans achieved a major breakthrough [18, 19]. On the one hand, they gave an algorithm of complexity

for the case when

has positive characteristic. On the other hand, they gave algorithms for multi-point evaluations over

with quasi-optimal bit-complexity exponents. Unfortunately, to the best of our knowledge, these algorithms do not seem suitable for practical purposes, as observed in [13, Conclusion]. Even in the case when the dimension

is fixed, we also note that the algorithms by Kedlaya and Umans do not achieve a smoothly linear complexity of the form

Recently, several algorithms have been proposed for multi-point evaluation in the case when

is a fixed tuple of points [15, 23]. These algorithms are amortized in the sense that we allow for potentially expensive precomputations as a function of

. The algorithms from [15, 23] are both restricted to the case when

is sufficiently generic, whereas [14, 23] focus on the bivariate case

. In the present paper, we deal with the general case when both

and

are arbitrary. Our main result is the following:

Remark 1.2. In characteristic zero, any integer different from

, and

has infinite order. For finite fields

, working in an algebraic extension

yields elements of order up to

. Replacing

by such an extension induces an additional overhead of

in our complexity bound.

As in the generic case from [15], our algorithm heavily relies on the relaxed algorithm for polynomial reduction from [9]. Given polynomials

, the reduction algorithm computes

with

where

is reduced with respect to some admissible ordering

on monomials. The relaxed algorithm from [9] essentially performs this reduction with the same complexity (up to a logarithmic factor) as checking the relation.

For our application to multi-point evaluation, the idea is to pick the polynomials

in the vanishing ideal

, so

. We may next recursively split-up the tuple of points

into two halves and adopt a similar strategy as in the univariate case.

However, there are several technical problems with this idea. First of all, it would be too costly to work with a full Gröbner basis

, because the mere storage of such a basis generally requires more than

terms. In [15], we solved this issue by working with respect to a so called “axial basis” instead. In the case when

is not generic, we have the additional problem that the shape of a Gröbner basis may become very irregular. This again has the consequence that the mere size of all products

in (1.1) may exceed

In this paper, we address these problems by introducing the new technique of quasi-reduction. The idea is to work simultaneously with several orderings on the monomials; in particular, each

belongs to a Gröbner basis for

with respect to a different admissible ordering. The result of a quasi-reduction (1.1) is usually not reduced with respect to any usual Gröbner basis of

, but we will still be able to control the size of

. Moreover, during the quasi-reduction process, we will be able to ensure that the ratios

are similar for every

, which further allows us to control the size of the products

The constant factor in our complexity bound of Theorem 1.1 grows exponentially as a function of

. For the time being, we therefore expect our methods to be of practical use only in low dimensions like

. Of course, the present paper focuses on worst case bounds for potentially highly non-generic cases. There is hope to drastically lower the constant factors for more common use cases.

One major technical contribution of this paper is the introduction of quasi-reduction and heterogeneous orderings. An interesting question for future work is whether these concepts can be applied to other problems. For instance, consider a zero-dimensional ideal

and two Gröbner bases for

with respect to different monomial orderings. Each Gröbner basis induces a representation for elements of the quotient algebra

. Does there exist a smoothly linear time algorithm to convert between these two representations?

2.Polynomial reduction

In this section we recall and extend some results from [9] to the context of this paper.

2.1.Admissible orderings

Let

be the set of monomials

with

. Any polynomial

can uniquely be written as a linear combination

Given a total ordering

, the support of any non-zero polynomial

admits a unique maximal element

that is called the leading monomial of

; the corresponding coefficient

is called the leading coefficient of

2.2.Sparse polynomial products

A sparse representation of a polynomial

is a data structure that stores the set of the non-zero terms of

. Each such term is a pair made of a coefficient and a degree vector. In an algebraic complexity model the bit size of the exponents counts for free, so the relevant size of such a polynomial is the cardinality of its support.

Consider two polynomials

and

in sparse representation. An extensive literature exists on the general problem of multiplying

and

; see [26] for a recent survey. For the purposes of this paper, a superset

for the support of

will always be known. Then we define

to be the cost to compute

, where

is the maximum of the sizes of

and the supports of

and

. We will assume that

is a non-decreasing function in

. Under suitable assumptions, the following proposition will allow us to take

in our multivariate evaluation algorithm.

Proof. The first statement is straightforward. The second one is an adapted version of [11, Proposition 6].

2.3.Relaxed multivariate series

We assume that the reader is familiar with the technique of relaxed power series evaluations [7], which is an efficient way to solve so-called recursive power series equations. In [9], this technique was generalized to multivariate Laurent series that are expanded according to some admissible ordering

Given a general admissible ordering

, we know from [25] that there exist real vectors

, such that

For clarity we sometimes denote this ordering

. In [9], it is always assumed that

and

for all

Example 2.2. The graded lexicographical ordering

is obtained by taking

Example 2.3. In section 4.1, we will only need orderings for which

for certain

and

. In fact, we note that

for any

, so modulo the replacement of

, we may assume without loss of generality that

, whenever we wish to apply the theory from [9].

In order to analyze the complexity of relaxed products in this multivariate context, we need to introduce the following quantities for finite subsets

2.4.Quasi-reduction

Let

be a total ordering on

and consider a finite family

of non-zero polynomials in

. Each

comes with a distinguished monomial

(that will play the role of the leading monomial) and we define

. The family

is equipped with a selection strategy, that is a function

For each

, we also assume that we have fixed the set

of monomials that will be selected for reduction with respect to

, for

. We assume that the

are pairwise disjoint and we set

Note that this corresponds to fixing a selection strategy, although we do not require that

(which explains the terminology “quasi-reduction” below). For our application later in this paper, the complement

will be a “rather relatively small” finite set.

We say that

is quasi-reduced if

. We say that a total ordering

is quasi-admissible (with respect to our choices of

and the selection strategy

) if

for all

and if

Polynomials that are not quasi-reduced are said to be quasi-reducible. In order to quasi-reduce

with respect to

we may use Algorithm 2.1 below. This yields the relation

such that

is quasi-reduced with respect to

. We call

the extended quasi-reduction of

with respect to

Algorithm 2.1

Input
Output: The extended quasi-reduction of by .

Initialize and for all .
While do:
1. Let be the largest monomial of ;
2. Replace by ;
3. Replace by .
Return .

2.5.Relaxed quasi-reduction

The main contribution of [9] is a relaxed algorithm for the computation of extended reductions. This algorithm generalizes to our setting as follows and provides an alternative for Algorithm 2.1 with a better computational complexity.

This operator is “recursive” in the sense of [9, section 4.2] with respect to the ordering

. Consequently

can be computed efficiently using relaxed power series evaluation. The complexity of this relaxed quasi-reduction is stated in Theorem 4.4 below for the specific ordering used by our multi-point evaluation algorithm. This definition and study of this ordering is the purpose of the next section.

3.Heterogeneous orderings

3.1.Weighted degree orderings

An admissible weight is an

-tuple

with

. Given a monomial

, we define its -degree by

3.2.Simplest elements of ideals

Given an admissible weight

, there exists a unique non-zero polynomial

in the reduced Gröbner basis of

whose leading monomial is minimal for

and whose leading coefficient is one. We call

the -simplest element of

. Note that there are at most

monomials below the leading monomial of

for

Proof. We say that

is an initial segment

for

for all

and

. By definition, the set

is an initial segment for

and it contains at least

monomials. Consequently, the set of linear combinations of monomials in

contains at least one non-zero element of

and therefore

. Now consider a monomial

. Then

for

, whence

3.3.Heterogeneous orderings

Now consider a finite non-empty set

of admissible weights. Given a monomial

, we define its -degree

but this union is not necessarily disjoint. Given

, we note that

for any

with

; this explains the terminology “cone”.

3.4.Quasi-reduction

Let

again be a zero-dimensional ideal of

and let

. Let

be a finite non-empty set of weights that will play the role of the index set

from section 2.4. For

, let

be the

-simplest element of

and let

We endow

with the lexicographical ordering

and fix the following selection strategy: for increasing

, we set

for all

. We call

a shifted heterogeneous ordering. The shift of all exponents by one (through multiplication by

) is motivated by the fact that the product of the partial degrees of a monomial in

never vanishes. This will be important in the next section in order to establish certain degree bounds.

4.On the complexity of quasi-reduction

In this section we instantiate the weight family

considered in the previous section, and analyze the complexity of the corresponding quasi-reduction.

4.1.Selecting the weights

Consider a zero-dimensional ideal

and let

. In order to devise an efficient algorithm to quasi-reduce polynomials with respect to

, our next task is to specify a suitable set of weights

. We take

, with

Proof. Consider

with

and

. We have

for

and

is determined uniquely in terms of

. There are at most

ways to pick

in this manner.

Assume that

for some

. We associate a selection procedure and a quasi-admissible ordering to

as in section 3.4.

Proof. Assume the contrary and let

be the index for which

is maximal. Similarly, let

be the index for which

is minimal, so that

. Since

Since

, this yields

. If

then

, since

. In both cases we consider the weight

with

, and

for all

. By what precedes, we have

4.2.Complexity of quasi-reduction

We define

to be the set of polynomials

such that

for all

. Given

, let us now examine the complexity of extended quasi-reduction.

Proof. We solve (4.1) using the relaxed approach from [9], recalled in section 2. However, contrary to the situation in [9], each individual product

is computed in a relaxed manner with respect to a different ordering (namely,

). More precisely, for each individual product

, we recall from (2.1) that

So we can actually compute

using the relaxed product algorithm from Theorem 2.4 with respect to the ordering

. Here we note that the supports of

and

are contained in dense blocks of similar proportions: for

, Corollary 3.2 yields

Indeed, for any

, we have

, whence

and

. Let

be the set of monomials

with

for

, so that

and

It particular, as a side remark, we note that the dense multiplication of

and

can be done using

operations in

As to the relaxed product of

and

, we first note that

, with the notation from Example 2.3, where

with

and

. Consequently,

We need to do

such relaxed multiplications. Now

by Lemma 4.1, so the complete computation takes

operations in

4.3.A degree bound for reduced polynomials

This shows that

divides

. In combination with our assumption that

, this means that

, a contradiction.

5.Reduction of the border

One limitation of Proposition 4.5 is that it does not apply to monomials

such that

for some index

. In this section, we show how to treat such lower dimensional “border” monomials by adapting our reduction process. For any fixed subset

, we will use the fact that the reduction process from the previous subsections can be applied to polynomials in the variables from

and the ideal

. We next combine the reduction processes for all such subsets

5.1.Generalized weights

its set of active coordinates. We say that

is admissible if

. For any monomial

, we define its

-degree by

for all

. We may naturally extend the notions of

-degree, leading monomials, etc. to polynomials in

. We also define the

-simplest element of

to be the unique monic polynomial

whose leading monomial is minimal for

5.2.Quasi-reduction for generalized weights

We next extend the definitions from section 3.4 to the case when

is a set of generalized weights with

for every subset

. For each

, we take

to be the

-simplest element of

and let

We will sometimes write

instead of

when we need to emphasize the dependence on

. After declaring that

, the set

can still be endowed with the lexicographical ordering. For increasing

, we now set

Sometimes, we will write

instead of

when we need to emphasize the dependence on

. Finally, we define our total ordering

Proof. It is straightforward to check that

is indeed a total ordering such that

5.3.Complexity analysis

Given a general weight

and a subset

such that

for all

, we define

to be the generalized weight

with

and

. If

is admissible, then we note that

is again admissible. Given

, we define

Using the same arguments as in the proof of Theorem 4.4, we obtain that the relaxed product

can be computed in time

. Since there are

such products to be computed, the complexity bound follows. The inequality (5.1) is also obtained in a similar way as for Theorem 4.4.

Proof. The monomial

is reduced with respect to

if and only if

is reduced with respect to

. By Proposition 4.5, this implies

6.Multi-point evaluation

Let

be a tuple of pairwise distinct points and consider the problem of fast multi-point evaluation of a polynomial

. For simplicity of the exposition, it is convenient to restrict ourselves to the case when

is a power of two (without loss of generality, we may reduce to this case by adding extra points). We define

to be the vanishing ideal of

. We assume that we precomputed

. For any

and

, we also assume that we precomputed

, where

and

Algorithm 6.1

Input
Output: .
Note: We assume the precomputations that are stated above.

If , then return the naive evaluation .
Compute the quasi-reduction of w.r.t. .
Recursively apply the algorithm to and .
Recursively apply the algorithm to and .
Return the concatenations of the results of the recursive evaluations.

Proof. The algorithm is clearly correct if

. For any recursive call of the algorithm with arguments

and

, Proposition 5.3 ensures that we indeed have

with

. The correctness of the general case easily follows from this.

As to the complexity bound, let us first assume that

. Then the same condition is satisfied for all recursive calls. Now the computation of

takes

By unrolling this recurrence inequality, it follows that

. If

, then the computation of

at the top-level requires

operations in

and we have just shown that the complexity of all recursive computations is

Proof of Theorem 1.1. The proof follows from Theorem 6.1 and Proposition 2.1, by analyzing the cardinalities of the supports involved in the quasi-reductions.

Let us revisit the quasi-reduction computed in step 2 of Algorithm 6.1. From (5.1) in Theorem 5.2 and (5.2) in Proposition 5.3, it follows that we may apply Proposition 2.1 for

. For the recursive quasi-reductions, we may use even smaller values for

. This justifies that we may indeed take

in Theorem 6.1.

Bibliography

[1]: A. Borodin and R. T. Moenck. Fast modular transforms. J. Comput. System Sci., 8:366–386, 1974.
[2]: A. Bostan, G. Lecerf, and É. Schost. Tellegen's principle into practice. In Proceedings of the 2003 International Symposium on Symbolic and Algebraic Computation, ISSAC '03, pages 37–44. New York, NY, USA, 2003. ACM.
[3]: D. G. Cantor and E. Kaltofen. On fast multiplication of polynomials over arbitrary algebras. Acta Inform., 28:693–701, 1991.
[4]: C. M. Fiduccia. Polynomial evaluation via the division algorithm: the fast Fourier transform revisited. In Proceedings of the Fourth Annual ACM Symposium on Theory of Computing, STOC '72, pages 88–93. New York, NY, USA, 1972. ACM.
[5]: D. Harvey and J. van der Hoeven. Faster polynomial multiplication over finite fields using cyclotomic coefficient rings. J. Complexity, 54:101404, 2019.
[6]: D. Harvey and J. van der Hoeven. Polynomial multiplication over finite fields in time . Technical Report, HAL, 2019. https://hal.archives-ouvertes.fr/hal-02070816, accepted for publication in J. ACM.
[7]: J. van der Hoeven. Relax, but don't be too lazy. J. Symbolic Comput., 34:479–542, 2002.
[8]: J. van der Hoeven. Faster Chinese remaindering. Technical Report, HAL, 2016. https://hal.archives-ouvertes.fr/hal-01403810.
[9]: J. van der Hoeven. On the complexity of multivariate polynomial division. In I. S. Kotsireas and E. Martínez-Moro, editors, Applications of Computer Algebra. Kalamata, Greece, July 20–23, 2015, volume 198 of Springer Proceedings in Mathematics & Statistics, pages 447–458. Cham, 2017. Springer International Publishing.
[10]: J. van der Hoeven. The Jolly Writer. Your Guide to GNU TeXmacs. Scypress, 2020.
[11]: J. van der Hoeven and G. Lecerf. On the bit-complexity of sparse polynomial multiplication. J. Symbolic Comput., 50:227–254, 2013.
[12]: J. van der Hoeven and G. Lecerf. Accelerated tower arithmetic. J. Complexity, 55:101402, 2019.
[13]: J. van der Hoeven and G. Lecerf. Fast multivariate multi-point evaluation revisited. J. Complexity, 56:101405, 2020.
[14]: J. van der Hoeven and G. Lecerf. Amortized bivariate multi-point evaluation. In M. Mezzarobba, editor, Proceedings of the 2021 International Symposium on Symbolic and Algebraic Computation, ISSAC '21, pages 179–185. New York, NY, USA, 2021. ACM.
[15]: J. van der Hoeven and G. Lecerf. Fast amortized multi-point evaluation. J. Complexity, 67:101574, 2021.
[16]: J. van der Hoeven and G. Lecerf. On the complexity exponent of polynomial system solving. Found. Comput. Math., 21:1–57, 2021.
[17]: J. Hopcroft and J. Musinski. Duality applied to the complexity of matrix multiplication and other bilinear forms. SIAM J. Comput., 2(3):159–173, 1973.
[18]: K. S. Kedlaya and C. Umans. Fast modular composition in any characteristic. In Proceedings of the 49th Annual IEEE Symposium on Foundations of Computer Science (FOCS 2008), pages 146–155. Los Alamitos, CA, USA, 2008. IEEE Computer Society.
[19]: K. S. Kedlaya and C. Umans. Fast polynomial factorization and modular composition. SIAM J. Comput., 40(6):1767–1802, 2011.
[20]: D. Le Brigand and J.-J. Risler. Algorithme de Brill–Noether et codes de Goppa. Bulletin de la société mathématique de France, 116(2):231–253, 1988.
[21]: F. Le Gall and F. Urrutia. Improved rectangular matrix multiplication using powers of the Coppersmith–Winograd tensor. In A. Czumaj, editor, Proceedings of the 2018 Annual ACM-SIAM Symposium on Discrete Algorithms (SODA), pages 1029–1046. Philadelphia, PA, USA, 2018. SIAM.
[22]: R. T. Moenck and A. Borodin. Fast modular transforms via division. In 13th Annual Symposium on Switching and Automata Theory, pages 90–96. USA, 1972. IEEE.
[23]: V. Neiger, J. Rosenkilde, and G. Solomatov. Generic bivariate multi-point evaluation, interpolation and modular composition with precomputation. In A. Mantzaflaris, editor, Proceedings of the 45th International Symposium on Symbolic and Algebraic Computation, ISSAC '20, pages 388–395. New York, NY, USA, 2020. ACM.
[24]: M. Nüsken and M. Ziegler. Fast multipoint evaluation of bivariate polynomials. In S. Albers and T. Radzik, editors, Algorithms – ESA 2004. 12th Annual European Symposium, Bergen, Norway, September 14-17, 2004, volume 3221 of Lect. Notes Comput. Sci., pages 544–555. Springer Berlin Heidelberg, 2004.
[25]: L. Robbiano. Term orderings on the polynomial ring. In B. F. Caviness, editor, EUROCAL '85. European Conference on Computer Algebra. Linz, Austria, April 1-3, 1985. Proceedings. Volume 2: Research Contributions, volume 204 of Lect. Notes Comput. Sci., pages 513–517. Springer-Verlag Berlin Heidelberg, 1985.
[26]: D. S. Roche. What can (and can't) we do with sparse polynomials? In C. Arreche, editor, Proceedings of the 2018 ACM International Symposium on Symbolic and Algebraic Computation, ISSAC '18, pages 25–30. New York, NY, USA, 2018. ACM.
[27]: A. Schönhage. Schnelle Multiplikation von Polynomen über Körpern der Charakteristik 2. Acta Inform., 7:395–398, 1977.
[28]: A. Schönhage and V. Strassen. Schnelle Multiplikation großer Zahlen. Computing, 7:281–292, 1971.

1.Introduction

2.Polynomial reduction

2.1.Admissible orderings

2.2.Sparse polynomial products

2.3.Relaxed multivariate series

2.4.Quasi-reduction

2.5.Relaxed quasi-reduction

3.Heterogeneous orderings

3.1.Weighted degree orderings

3.2.Simplest elements of ideals

3.3.Heterogeneous orderings

3.4.Quasi-reduction

4.On the complexity of quasi-reduction

4.1.Selecting the weights

4.2.Complexity of quasi-reduction

4.3.A degree bound for reduced polynomials

5.Reduction of the border

5.1.Generalized weights

5.2.Quasi-reduction for generalized weights

5.3.Complexity analysis

6.Multi-point evaluation

Bibliography