Claude Code Skills 2.0 adds evals plus benchmark test sets; changes target skill reliability as models update over time.
The new Mercury 2 AI model uses diffusion reasoning to generate 1,000 tokens per second; it runs about 5x faster than Haiku, speed limits are ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results