SRMED
METR benchmark shows AI models now complete tasks requiring 14 hours of human labor. | Srmed