← mecheval / task / a1-sphere-01

Centered sphere A · A1 · a1-sphere-01

primitives · sphere · sanity

Expected

Prompt

Make a sphere of diameter 30mm (radius 15mm) centered at the origin. Output a single solid.

Checks

0
valid_solid
{
  "type": "valid_solid"
}
1
bbox
{
  "type": "bbox",
  "min": [
    -15,
    -15,
    -15
  ],
  "max": [
    15,
    15,
    15
  ],
  "tolerance_mm": 0.1
}
2
mass_props
{
  "type": "mass_props",
  "volume_mm3": 14137.17,
  "center_of_mass": [
    0,
    0,
    0
  ],
  "tolerance_pct": 2
}
3
step_roundtrip
{
  "type": "step_roundtrip",
  "tolerance_pct": 2
}

Anti-cheese

{
  "max_solid_count": 1
}

Limits

{
  "max_tokens": 20000,
  "max_wallclock_sec": 120,
  "max_tool_calls": 20
}

Recent attempts

Runs (40)

modelrun statusscorefirst failtokenswall
claude-mcp-claude-opus-4-7 20260611T172137Z-8476 PASS 1.00 45.4k 7.8s
claude-mcp-claude-opus-4-7 20260611T172129Z-5732 PASS 1.00 45.4k 7.3s
claude-mcp-claude-opus-4-7 20260611T172120Z-31ac PASS 1.00 45.4k 9.0s
claude-mcp-claude-opus-4-7 20260611T172113Z-8805 PASS 1.00 45.4k 7.6s
claude-mcp-claude-opus-4-7 20260611T172104Z-3401 PASS 1.00 45.4k 8.1s
openai-direct-gpt-5-mini 20260428T212908Z-8ba2 PASS 1.00 917 6.6s
openai-direct-gpt-5-mini 20260428T212903Z-6914 PASS 1.00 789 4.9s
openai-direct-gpt-5-mini 20260428T212856Z-9ffd PASS 1.00 917 6.4s
openai-direct-gpt-5-mini 20260428T212850Z-ddc0 PASS 1.00 856 5.6s
openai-direct-gpt-5-mini 20260428T212844Z-d9f4 PASS 1.00 917 6.5s
openai-direct-gpt-5 20260428T212739Z-046a PASS 1.00 853 6.2s
openai-direct-gpt-5 20260428T212734Z-a064 PASS 1.00 920 5.3s
openai-direct-gpt-5 20260428T212728Z-03bb PASS 1.00 908 6.1s
openai-direct-gpt-5 20260428T212723Z-4f50 PASS 1.00 856 5.1s
openai-direct-gpt-5 20260428T212717Z-650a PASS 1.00 901 5.2s
openai-direct-gpt-4o-mini 20260428T212548Z-faa7 fail 0.00 valid_solid · solid invalid 653 2.3s
openai-direct-gpt-4o-mini 20260428T212546Z-6860 PASS 1.00 649 1.9s
openai-direct-gpt-4o-mini 20260428T212543Z-6b4c fail 0.00 valid_solid · solid invalid 653 3.0s
openai-direct-gpt-4o-mini 20260428T212541Z-344a PASS 1.00 636 2.0s
openai-direct-gpt-4o-mini 20260428T212539Z-ae93 PASS 1.00 653 2.1s
claude-direct-claude-sonnet-4-6 20260428T211411Z-d412 PASS 1.00 736 2.5s
claude-direct-claude-sonnet-4-6 20260428T211409Z-4abd PASS 1.00 714 2.4s
claude-direct-claude-sonnet-4-6 20260428T211406Z-4358 PASS 1.00 715 2.4s
claude-direct-claude-sonnet-4-6 20260428T211404Z-7d05 PASS 1.00 715 2.3s
claude-direct-claude-sonnet-4-6 20260428T211402Z-4882 PASS 1.00 736 2.4s
claude-direct-claude-opus-4-7 20260428T211401Z-07af PASS 1.00 910 2.1s
claude-direct-claude-opus-4-7 20260428T211358Z-6de9 PASS 1.00 911 2.7s
claude-direct-claude-opus-4-7 20260428T211356Z-302b PASS 1.00 910 2.3s
claude-direct-claude-opus-4-7 20260428T211353Z-56c7 PASS 1.00 910 2.1s
claude-direct-claude-opus-4-7 20260428T211351Z-bb1a PASS 1.00 912 2.8s
claude-direct-claude-haiku-4-5-20251001 20260428T211347Z-dc35 fail 0.00 valid_solid · solid invalid 735 1.6s
claude-direct-claude-haiku-4-5-20251001 20260428T211346Z-3258 fail 0.00 valid_solid · solid invalid 734 1.3s
claude-direct-claude-haiku-4-5-20251001 20260428T211345Z-9e55 fail 0.00 valid_solid · solid invalid 735 1.0s
claude-direct-claude-haiku-4-5-20251001 20260428T211344Z-91f2 fail 0.00 valid_solid · solid invalid 734 1.0s
claude-direct-claude-haiku-4-5-20251001 20260428T211342Z-2ffc fail 0.00 valid_solid · solid invalid 735 1.2s
claude-direct-claude-sonnet-4-6 20260428T211221Z-3d63 PASS 1.00 726 2.3s
claude-direct-claude-sonnet-4-6 20260428T211219Z-60d5 PASS 1.00 726 2.3s
claude-direct-claude-sonnet-4-6 20260428T211217Z-4141 PASS 1.00 715 2.3s
claude-direct-claude-sonnet-4-6 20260428T211214Z-c11b PASS 1.00 714 2.3s
claude-direct-claude-sonnet-4-6 20260428T211212Z-7e5a PASS 1.00 715 2.2s

generated 2026-06-17T03:16:07.197Z · static site, regenerate with npm run build -w @mecheval/leaderboard