Post by NVIDIA AI
1,909,816 followers
Artificial Analysis just dropped a brand new leaderboard called AA-Briefcase for evaluating realistic tasks in complex projects. Nemotron 3 Ultra ranks among the top open models, with strong performance across a wide range of long-running agentic tasks, even when encountering them for the first time. π https://nvda.ws/4vAFN6A