It showed improved performance on all of them, even exceeding OpenAI’s previously most advanced model at the MATH (word problem solving) third-party benchmark of 12,500 questions covering ...
Researchers also gave children a math test and found that children who scored higher tended to have parents who talked about math more during the observation period.