Post by NVIDIA AI

1,909,816 followers

Artificial Analysis just dropped a brand new leaderboard called AA-Briefcase for evaluating realistic tasks in complex projects. Nemotron 3 Ultra ranks among the top open models, with strong performance across a wide range of long-running agentic tasks, even when encountering them for the first time. πŸ”— https://nvda.ws/4vAFN6A

Post content