return to table of content
30% drop in O1-preview accuracy when Putnam problems are slightly variated