Ernest Ryu's Twitter Thread

This is really exciting and impressive, and this stuff is in my area of mathematics research (convex optimization). I have a nuanced take. 🧵 (1/9)

There are 3 proofs in discussion: v1. ( η ≤ 1/L, discovered by human ) v2. ( η ≤ 1.75/L, discovered by human ) v.GTP5 ( η ≤ 1.5/L, discovered by AI ) Sebastien argues that the v.GPT5 proof is impressive, even though it is weaker than the v2 proof. (2/9)

The proof itself is arguably not very difficult for an expert in convex optimization, if the problem is given. Knowing that the key inequality to use is [Nesterov Theorem 2.1.5], I could prove v2 in a few hours by searching through the set of relevant combinations. (3/9)

(And for reasons that I won’t elaborate here, the search for the proof is precisely a 6-dimensional search problem. The author of the v2 proof, Moslem Zamani, also knows this. I know Zamani’s work enough to know that he knows.) (4/9)

(In research, the key challenge is often in finding problems that are both interesting and solvable. This paper is an example of an interesting problem definition that admits a simple solution.) (5/9)

When proving bounds (inequalities) in math, there are 2 challenges: (i) Curating the correct set of base/ingredient inequalities. (This is the part that often requires more creativity.) (ii) Combining the set of base inequalities. (Calculations can be quite arduous.) (6/9)

In this problem, that [Nesterov Theorem 2.1.5] should be the key inequality to be used for (i) is known to those working in this subfield. (7/9)

So, the choice of base inequalities (i) is clear/known to me, ChatGPT, and Zamani. Having (i) figured out significantly simplifies this problem. The remaining step (ii) becomes mostly calculations. (8/9)

The proof is something an experienced PhD student could work out in a few hours. That GPT-5 can do it with just ~30 sec of human input is impressive and potentially very useful to the right user. However, GPT5 is by no means exceeding the capabilities of human experts. (9/9)

@ErnestRyu @grok is the GPT 5 proof a derivative of the human proof or completely novel?

@ErnestRyu Thank you for the analysis!

@ErnestRyu Can you let us know if gpt5 is useful in your research?

@ErnestRyu You are going to get flamed for this, but even people with modest mathematical sophistication could see that, while impressive, this new result was just a matter of search and permutation and not discovery of new mathematics.

@ErnestRyu we're still stuck in iid

@ErnestRyu OpenAI loves inflated claims

@ErnestRyu Your thread is creating a buzz! #TopUnroll https://threadreaderapp.com/th... 🙏🏼@Velowit for 🥇unroll

@ErnestRyu But why didn’t any human solved this before GPT?

@ErnestRyu Intriguing. I used to believe computers excel at Mathematical reasoning only via computerized algebra and the Schwartz–Zippel lemma, not by statistical reasoning via N-gram or GPT. Is SZ lemma applicable to convex optimization convergence proving? https://link.springer.com/book...

@ErnestRyu convex optimization is almost done research is just small twiks simplex was since 1947

@ErnestRyu Sir I don't think anybody claimed that GPT5-pro exceeds capabilities of human. Claim is that models are legit PhD level, and getting close to the point of ability to discover new science, this is foreshadowing of that. And LLMs are still scaling intelligence.

@ErnestRyu Thank you for explaining so clearly. Everything makes sense now 🙏

@ErnestRyu Broken clock brother

@ErnestRyu Thank you for the interesting insight and perspective @ErnestRyu

@ErnestRyu @threadreaderapp unroll

Share this thread

Read on Twitter

Navigate thread