10 comments

  • rnhmjoj 1 hour ago
    > My concern is that the word “elementary” in the title carries a much broader meaning in standard mathematical usage, and in this meaning, the paper’s title does not hold.

    > Elementary functions typically include arbitrary polynomial roots, and EML terms cannot express them.

    If you take a real analysis class, the elementary functions will be defined exactly as the author of the EML paper does.

    I've actually just learnt that some consider roots of arbitrary polynomials being part of the elementary functions before, but I'm a physicist and only ever took some undergraduate mathematics classes. Nonetheless, calling these elementary feels a bit of stretch considering that the word literally means basic stuff, something that a beginner will learn first.

    • chii 5 minutes ago
      jargon are words being used that don't carry the typical laymen definition, but a specific one from the domain of said jargon.

      If a written piece is intended for an audience who knows the jargon, then it's fine to use jargon - in fact it's appropriate and succinct. If it was intended for the laymen, then jargon is inappropriate.

      But it seems you're lamenting that this jargon is wrong and that it shouldn't be jargon!?

    • reikonomusha 1 hour ago
      The definition of "elementary function" typically includes functions which solve polynomials, like the Bring radical. The definition was developed and is most fitting in algebraic contexts where algebraic structure is meaningful, like Liouvillian structure theorems, algorithmic integration, and computer algebra. See e.g.

      - Page 2 and the following example of https://billcookmath.com/courses/math4010-spring2016/math401... (2016)

      - Ritt's Integration in Finite Terms: Liouville's Theory of Elementary Methods (1948)

      It's not frequent that analysis books will define the class of elementary functions rigorously, but instead refer to examples of them informally.

    • burnished 46 minutes ago
      All I know is that when a class starts with 'elementary' or 'fundamentals of' you had best buckle up.
      • TeMPOraL 26 minutes ago
        Introduction to ...
  • SabrinaJewson 1 hour ago
    Related is the paper [What is a closed-form number?], which explores the field E, defined as the smallest subfield of ℂ closed under exp and log. I believe the set of numbers that can be generated using exp-minus-log is a strict subset of this.

    In a similar vein to this post, the paper points out that general polynomials do not have solutions in E, so of course exp-minus-log is similarly incomplete.

    What is intriguing is that we don’t even know whether many simple equations like exp(-x) = x (i.e. the [omega constant]) have solutions in E. We of course suspect they don’t, but this conjecture is not proven: https://en.wikipedia.org/wiki/Schanuel%27s_conjecture

    What is a closed-form number?: http://timothychow.net/closedform.pdf omega constant: https://en.wikipedia.org/wiki/Omega_constant

  • saithound 2 hours ago
    The original article explicitly acknowledged this limitation, that while in "the classical differential-algebraic setting, one often works with a broader notion of elementary function, defined relative to a chosen field of constants and allowing algebraic adjunctions, i.e., adjoining roots of polynomial equations," the author works with the less general definition.

    Neither the present article, nor the original one has much mathematical originality, though: Odrzywolek's result is immediately obvious, while this blog post is a rehash of Arnold's proof of the unsolvability of the quintic.

    • reikonomusha 40 minutes ago
      Arnold (as reported by Goldmakher [1]) does prove the unsolvability of the quintic in finite terms of arithmetic and single-valued continuous functions (which does not include the complex logarithm). TFA's result is stronger, which is something about the solvability of the monodromy groups of all EML-derived functions. So it doesn't seem to be a "rehash", even if their specific counterexample could have been achieved either in fewer steps or with less machinery.

      [1] https://web.williams.edu/Mathematics/lg5/394/ArnoldQuintic.p...

      • saithound 17 minutes ago
        Arnold's proof can be used to show that certain classes of functions are insufficient to express a quintic formula.

        These classes can always safely include all single-valued continuous functions (you cannot even write the _quadratic_ formula in terms of arithmetic and single-valued continuous functions!), but also plenty of non-single-valued functions (e.g. the +-sqrt function which appears in the well-known quadratic formula).

        Applying Arnold's proof to the class given by arithmetic and all complex nth root functions (also multivalued) gives the usual Abel-Ruffini theorem. But Arnold's proof applies to the class "all elm-expressible functions" without modification.

    • vintermann 1 hour ago
      Yes, this article is kicking in open doors, the original article was quite clear about the scope.

      The present article could rather have spent time arguing why this isn't like NAND gate functional completeness.

      I would have thought the differences lie in the other direction: not that trees of EML and 1 can describe too little, but that they can describe too much already. It's decidable whether two NAND circuits implement the same function, I'm pretty sure it's not decidable if two EML trees describe the same function.

  • derriz 1 hour ago
    When I first read the exp-minus-log paper, I found it extremely surprising - even shocking that such a function could exist.

    But the fact that a single function can represent a large number of other functions isn't that surprising at all.

    It's probably obvious to anyone (it wasn't initially to me), but given enough arguments I can represent any arbitrary set of n+1 functions (they don't even have to be functions on the reals - just as long as the domain has a multiplicative zero available) as a sort of "selector":

    g(x_0, c_0, x_1, c_1, ... , x_n, c_n) = c_0 * f_0(x_0) + ... + c_n * f_n(x_n)

    The trick is to minimize the number of arguments and complexity of the RHS - but that there's a trivial upper-bound (in terms of number of arguments).

    • kqr 31 minutes ago
      This is similar to the idea of generating functions, if you would like more to read!
    • cyberax 18 minutes ago
      Why would it be surprising?

      And if you want something truly surprising, Riemann's zeta function can approximate any holomorphic function arbitrarily well on the critical strip. So technically you need only _one_ argument.

  • lotaezenwa 2 hours ago
    The author essentially says that the quintic has no closed form solution which is true regardless of the exp-minus-log function. The purpose of this blog post is lost on me.

    Can anyone please explain this further? It seems like he’s moving the goalposts.

    • reikonomusha 1 hour ago
      "The quintic has no closed form solution" is a theorem that is more precisely stated (in the usual capstone Galois proof) as follows: The quintic has no closed form solution in terms of arbitrary compositions of rational numbers, arithmetic, and Nth roots. We can absolutely express closed form solutions to the quintic if we broaden our repertoire of functions, such as with the Bring radical.

      The post's argument is different than the usual Galois theory result about the unsolvability of the quintic, in that it shows a property that must be true about all EML(x,y)-derived functions, and a hypothetical quintic-solver-function does not have that property, so no function we add to our repertoire via EML will solve it (or any other function, elementary or not, that lacks this property).

      • lotaezenwa 48 minutes ago
        Cool explanation, thanks!
      • cyberax 15 minutes ago
        Bring radicals are just cheating.

        You can't solve an equation? Why not just introduce a function that is equal to the solution of the equation! Problem solved.

        • reikonomusha 3 minutes ago
          This fundamental "cheat" gave rise to some of the most important pure and applied mathematics known.

          Can't solve the differential equation y'' = -y? Why not just introduce a function sin(x) as its solution! Problem solved.

          A lot of 19th century mathematics was essentially this: discover which equations had solutions in terms of things we already knew about, and if they didn't, make a new name. This is the whole field of so-called "special functions". It's where we also get the elliptic functions, Bessel functions, etc.

          The definition of "elementary function" comes exactly from this line in inquiry: define a set of functions we think are nice and algebraically tractable, and answer what we can express with them. The biggest classical question was:

              Do integrals of elementary functions give us elementary functions?
          
          The answer is "no" and Liouville gave us a result which tells us what the answer does look like when the result is elementary.

          Risch gave us an algorithm to compute the answer, even it exists.

    • markgall 1 hour ago
      Can anyone provide a link that "Some are going as far as to suggest that the entire foundations of computer engineering and machine learning should be re-built as a result of this", or anything similarly grandiose?

      I am a professional mathematician, though nowhere near this kind of thing. The result seems amusing enough, but it doesn't really strike me as something that would be surprising. I confess that this thread is the first I've heard of it...

      • saithound 1 hour ago
        It's a fun, but unsurprising undergrad-level result. It got picked up and overhyped on HN [1] and /r/math [2] earlier this week.

        Some of my favorites:

        DoctorOetker: "I'm still reading this, but if this checks out, this is one of the most significant discoveries in years."

        cryptonektor: "Given this amazing work, an efficient EML operator HW implementation could revolutionize a bunch of things."

        zephen: "This is about continuous math, not ones and zeroes. Assuming peer review proves it out, this is outstanding."

        [1] https://news.ycombinator.com/item?id=47746610

        [2] https://www.reddit.com/r/math/comments/1sk63n5/all_elementar...

        • DoctorOetker 36 minutes ago
          :)

          I still consider the article important, as it demonstrates techniques to conduct searches, and emphasizes the very early stage of the research (establishes non-uniqueness for example), openly wonders which other binary operators exist and which would have more desirable properties, etc.

          Sometimes articles are important not for their immediate result, but for the tools and techniques developed to solve (often artificial or constrained) problems. The history of mathematics is filled with mathematicians studying at-the-time-rather-useless-constructions which centuries or millennia later become profound to human interaction. Think of the "value" of Euclid's greatest common divisor algorithm. What starts out as a curiosity with 0 immediate relevance for society, is now routinely used by everyone who enjoys the world wide web without their government or others MitM'ing a webpage.

          If the result was the main claimed importance for the article, there would be more emphasis on it than on the methodology used to find and verify candidates, but the emphasis throughout the article is on the methodology.

          It is far from obvious that the tricks used would have converged at all. Before this result, a lot of people would have been skeptical that it is even possible to do search candidates this way. While the gradual early-out tightening in verification could speed up the results, many might have argued that the approach to be used doesn't contain an assurance that the false positive rate wouldn't be excessively high (i.e. many would have said "verifying candidates does not ensure finding a solution, reality may turn out that 99.99999999999999999% of candidates turn out not to pass deeper inspection").

          It is certainly noteworthy to publish these results as they establish the machinery for automated search of such operations.

        • renewiltord 1 hour ago
          This result itself is being described in those terms[1]:

          > If this is true, then this blog post debunking EML is going to up-end all of mathematics for the next century.

          This is very concerning for mathematics in general.

          1: https://news.ycombinator.com/item?id=47775105

    • AlotOfReading 1 hour ago
      The argument is that a universal basis would be capable of solving arbitrary polynomial roots. The rest is an argument that the group constructed by eml is solveable, and hence not all the standard elementary functions.

      It wouldn't be a math discussion without people using at least two wildly different definitions.

    • DevelopingElk 2 hours ago
      His claim is that we exp-minus-log cannot compute the root of an arbitrary quintic. If you consider the root of an arbitrary quintic "elementary" the exp-minus-log can't represent all elementary functions.

      I think it really comes down to what set of functions you are calling "elementary".

      • throwanem 1 hour ago
        The author discusses this in his third paragraph, and states explicitly in his fourth that he considers the result faulty for its unrealistically narrow definition of elementarity.

        (I'm not a mathematician, so don't expect me to have an opinion as far as that goes. But the author also writes well in English, and that language we do share.)

        • bawolff 1 hour ago
          Well the author saysin that paragraph:

          > In layman’s terms, I do not consider the “Exp-Minus-Log” function to be the continuous analog of the Boolean NAND gate or the universal quantum CCNOT/CSWAP gates.

          But is there actually a combination of NANDs that find the roots of an arbitrary quintic? I always thought the answer was no but admittedly this is above my math level.

          • reikonomusha 1 hour ago
            Combinations of the NAND gate can express any Boolean function. The Toffoli (CCNOT) or Fredkin (CSWAP) can express any reversible Boolean function, which is important in quantum computing where all gates must be unitary (and therefore reversible). The posited analog is that EML would be the "universal operator" for continuous functions.
    • petters 2 hours ago
      Yes, that blog post could have been much shorter….
  • bawolff 1 hour ago
    > Elementary functions typically include arbitrary polynomial roots

    Admittedly this may be above my math level, but this just seems like a bad definition of elementary functions, given the context.

    • js8 16 minutes ago
      I would agree, it makes them anything but elementary. I am honestly not even sure if there is a finite constructible basis of the functions that can express any solution of single-variable integer polynomials.

      And for multivariate polynomials, the roots are uncomputable due to MRDP theorem.

  • aixpert 18 minutes ago
    tangenially it would be interesting to analyze what infinite structures can be represented with infinite EML trees
  • avmich 1 hour ago
    I'd really like more details on the terminology used.

    Also I'd be glad to see a specific example of a function, considered elementary, which is not representable by EML.

    It could be hard, and in any case, thanks for the article. I wish it would be more accessible to me.

    • markgall 1 hour ago
      I only skimmed the article, but I think the idea is to use some variation on:

      f(a,b,c,d,e) = the largest real solution x of the quintic equation x^5 + ax^4 + bx^3 + cx^2 + dx + e = 0

      There's not a simple formula for this function (which is the basic point), but certainly it is a function: you feed it five real numbers as input, and it spits out one number as output. The proof that you can't generate this function using the single one given looks like some fairly routine Galois theory.

      Whether this function is "considered elementary" depends on who you ask. Most people would not say this is elementary, but the author would like to redefine the term to include it, which would make the theorem not true anymore.

      Why any of this would shake the foundations of computer engineering I do not know.

      • avmich 1 hour ago
        I've thought something like that, but I'm interested more in details of the argument.

        As for why this could be important... we sometimes find new ways of solving old problems, when we formulate them in a different language. I remember how i was surprised to learn how representation of numbers as a tuple (ordered list of numbers), where each element is the remainder for mutually prime dividers - as many dividers as there are elements in the tuple - reduces the size of tables of division operation, and so the hardware which does the operation using thise tables may use significantly less memory. Here we might have some other interesting advantages.

      • teo_zero 1 hour ago
        I feel that saying that EML can't generate all the elementary functions because it can't express the solution of the quintic is like saying that NAND gates can't be the basis of modern computing because they can't be used to solve Turing's halting problem.
        • reikonomusha 55 minutes ago
          As is usual with these kinds of "structure theorems" (as they're often called), we need to precisely define what set of things we seek to express.

          A function which solves a quintic is reasonably ordinary. We can readily compute it to arbitrary precision using any number of methods, just as we can do with square roots or cosines. Not just the quintic, but any polynomial with rational coefficients can be solved. But the solutions can't be expressed with a finite number of draws from a small repertoire of functions like {+, -, *, /}.

          So the question is, does admitting a new function into our "repertoire" allow us to express new things? That's what a structure theorem might tell us.

          The blog post is exploring this question: Does a repertoire of just the EML function, which has been shown by the original author to be able to express a great variety of functions (like + or cosine or ...) also allow us to express polynomial roots?

      • Aardwolf 1 hour ago
        But can you even express this function with the elementary operator symbols, exp, log, power and trig functions? It seems to me like no, you can't express "largest real solution" with those (and what's the intended result for complex inputs?)

        At least eml can express the quintic itself, just like the above mentioned operators can

        • tliltocatl 57 minutes ago
          Author and EML are using different definitions of elementary functions, EML's definition being the school textbooks' one (polynomials, sin, exp, log, arcsin, arctan, closed under multiplication, division and composition). The author's definition I've never met before, it apparently includes some multi-valued functions, which are quite unusual.
          • reikonomusha 52 minutes ago
            Wikipedia says:

            > More generally, in modern mathematics, elementary functions comprise the set of functions previously enumerated, all algebraic functions (not often encountered by beginners), and all functions obtained by roots of a polynomial whose coefficients are elementary. [...] This list of elementary functions was originally set forth by Joseph Liouville in 1833.

            which seems to be what the blog post references.

  • zarzavat 1 hour ago
    This is a bit like invalidating a result based on 0^0 := 1 because you work in a field of mathematics where 0^0 is an indeterminate form. Not very interesting.

    AFAIU the original paper is a result in the field of symbolic regression. What definition of elementary function do they use?

  • renewiltord 1 hour ago
    If this is true, then this blog post debunking EML is going to up-end all of mathematics for the next century.