Reasoning
The ability to think logically, solve problems, and draw valid conclusions from available information.
Reasoning is weighted at 25% in our capability assessment - the highest weight. It encompasses mathematical reasoning, logical deduction, causal reasoning, and abstract problem-solving. Current frontier models show strong reasoning capabilities on many benchmarks but still struggle with novel or highly complex reasoning tasks.
Related:AGI, Coding, Science
Agency
The capacity for autonomous action, planning, and goal-directed behavior in the world.
Agency is weighted at 20% in our assessment. It includes the ability to break down goals into subtasks, use tools, interact with external systems, and operate autonomously over extended periods. Agentic AI systems are a major focus of current research and raise significant safety considerations.
Related:AGI, AI Safety, Reasoning
Coding
The ability to write, understand, debug, and reason about computer code.
Coding is weighted at 15% in our assessment. Modern AI systems have achieved remarkable coding capabilities, often matching or exceeding human programmers on standard benchmarks. This capability is particularly significant as it enables AI systems to potentially improve themselves or build other AI systems.
Related:Reasoning, Agency, AGI
Multimodal
The ability to process and generate content across multiple modalities like text, images, audio, and video.
Multimodal capability is weighted at 10% in our assessment. Modern frontier models increasingly handle multiple input and output modalities, moving beyond text-only systems to understand images, generate audio, and work with video content.
Related:Foundation Model, Language, AGI
Science
The ability to understand scientific concepts, generate hypotheses, and contribute to research.
Scientific capability is weighted at 10% in our assessment. This includes understanding research papers, generating novel hypotheses, designing experiments, and potentially accelerating scientific discovery. AI systems are increasingly being used as research assistants and may eventually make independent scientific contributions.
Related:Reasoning, AGI, Coding
Robotics
The ability to perceive, understand, and interact with the physical world through embodied systems.
Robotics capability is weighted at 10% in our assessment. While AI has made remarkable progress in digital domains, physical world interaction remains challenging. Advances in robotics could enable AI systems to directly manipulate the physical world, with significant implications for automation and AI safety.
Related:Agency, Multimodal, AGI