Performance analysis based on user votes in model battles.
Total Votes Analyzed: 64
| Model Name | Wins | Appearances | Win Rate | Avg. Duration (s) | Avg. Cost ($) | Error Rate |
|---|---|---|---|---|---|---|
| anthropic/claude-sonnet-4.5 | 5 | 6 | 83.33% | 30.79 | $0.05064 | 66.67% |
| openai/gpt-5 | 18 | 30 | 60.00% | 54.08 | $0.05085 | 40.91% |
| anthropic/claude-opus-4.1 | 6 | 10 | 60.00% | 40.15 | $0.05620 | 16.67% |
| x-ai/grok-4-fast:free | 6 | 10 | 60.00% | 25.69 | $0.04481 | 50.00% |
| gemini-2.5-pro-preview-05-06 | 13 | 28 | 46.43% | 36.96 | $0.07252 | 40.91% |
| anthropic/claude-sonnet-4 | 13 | 34 | 38.24% | 24.42 | $0.04188 | 22.73% |
| openai/gpt-5-codex | 3 | 10 | 30.00% | 50.54 | $0.06113 | 50.00% |
| System Prompt Version | Wins | Appearances | Win Rate |
|---|---|---|---|
| v1.0.0 | 64 | 128 | 50.00% |