News
Researchers tested OpenAI’s GPT-4 Code Interpreter on the challenging MATH benchmark and achieved a new state-of-the-art accuracy of 69.7 percent , far surpassing GPT-4’s 42.2 percent.
Some results have been hidden because they may be inaccessible to you
Show inaccessible results