🤖 OpenAI tested GPT-5, Claude Opus 4.1, and other models on 1320 practical tasks across 44 professions. Among them are developers, lawyers, financial consultants, sales managers, and doctors.
40% of the tasks were completed better than or at human level by GPT-5. Claude Opus 4.1 achieved 49%.