← mecheval / task / a1-pipe-01

Hollow pipe A · A1 · a1-pipe-01

primitives · boolean · pipe · hollow

Expected

Prompt

Make a hollow cylindrical pipe with outer diameter 30mm and inner diameter 20mm. Length 40mm, axis along Z, base on the XY plane (z = 0 to z = 40). The bore is a single through-hole on the central axis. Output a single solid.

Checks

0
valid_solid
{
  "type": "valid_solid"
}
1
bbox
{
  "type": "bbox",
  "min": [
    -15,
    -15,
    0
  ],
  "max": [
    15,
    15,
    40
  ],
  "tolerance_mm": 0.1
}
2
mass_props
{
  "type": "mass_props",
  "volume_mm3": 15707.96,
  "center_of_mass": [
    0,
    0,
    20
  ],
  "tolerance_pct": 1
}
3
hole_count
{
  "type": "hole_count",
  "diameter_mm": 20,
  "expected": 1,
  "diameter_tolerance_mm": 0.1
}
4
hole_positions
{
  "type": "hole_positions",
  "diameter_mm": 20,
  "positions": [
    [
      0,
      0,
      0
    ]
  ],
  "tolerance_mm": 0.15
}
5
step_roundtrip
{
  "type": "step_roundtrip",
  "tolerance_pct": 1
}

Anti-cheese

{
  "max_solid_count": 1
}

Limits

{
  "max_tokens": 20000,
  "max_wallclock_sec": 120,
  "max_tool_calls": 20
}

Recent attempts

Runs (35)

modelrun statusscorefirst failtokenswall
claude-mcp-claude-opus-4-7 20260611T171754Z-2295 PASS 1.00 393.5k 71.5s
claude-mcp-claude-opus-4-7 20260611T171744Z-15fc PASS 1.00 260.0k 48.0s
claude-mcp-claude-opus-4-7 20260611T171744Z-0e64 PASS 1.00 289.0k 51.0s
claude-mcp-claude-opus-4-7 20260611T171713Z-f4e5 PASS 1.00 164.4k 31.3s
claude-mcp-claude-opus-4-7 20260611T171713Z-678c PASS 1.00 235.9k 41.5s
openai-direct-gpt-5-mini 20260428T213138Z-12c7 PASS 1.00 1.1k 9.2s
openai-direct-gpt-5-mini 20260428T213129Z-e64e PASS 1.00 1.3k 9.2s
openai-direct-gpt-5-mini 20260428T213116Z-644c PASS 1.00 1.5k 12.1s
openai-direct-gpt-5-mini 20260428T213105Z-ac87 PASS 1.00 1.2k 10.8s
openai-direct-gpt-5-mini 20260428T213053Z-932c PASS 1.00 1.3k 12.0s
openai-direct-gpt-5 20260428T213012Z-a95c PASS 1.00 1.3k 12.1s
openai-direct-gpt-5 20260428T212943Z-e9ff PASS 1.00 1.2k 28.3s
openai-direct-gpt-5 20260428T212934Z-de22 PASS 1.00 1.2k 9.2s
openai-direct-gpt-5 20260428T212921Z-d353 PASS 1.00 1.3k 13.2s
openai-direct-gpt-5 20260428T212913Z-8e1c PASS 1.00 1.0k 7.7s
openai-direct-gpt-4o-mini 20260428T212632Z-8e8c fail 0.00 valid_solid · solid invalid 805 3.2s
openai-direct-gpt-4o-mini 20260428T212629Z-9270 fail 0.00 valid_solid · solid invalid 817 3.7s
openai-direct-gpt-4o-mini 20260428T212625Z-1b5f PASS 1.00 807 3.7s
openai-direct-gpt-4o-mini 20260428T212621Z-971c fail 0.00 valid_solid · solid invalid 805 3.5s
openai-direct-gpt-4o-mini 20260428T212618Z-9c33 fail 0.00 valid_solid · solid invalid 766 3.2s
claude-direct-claude-sonnet-4-6 20260428T211458Z-1ea1 PASS 1.00 846 3.0s
claude-direct-claude-sonnet-4-6 20260428T211454Z-4af9 PASS 1.00 846 3.3s
claude-direct-claude-sonnet-4-6 20260428T211451Z-8c54 PASS 1.00 846 3.0s
claude-direct-claude-sonnet-4-6 20260428T211448Z-a580 PASS 1.00 852 3.3s
claude-direct-claude-opus-4-7 20260428T211447Z-9eed PASS 1.00 1.1k 4.2s
claude-direct-claude-sonnet-4-6 20260428T211444Z-d58c PASS 1.00 846 3.0s
claude-direct-claude-opus-4-7 20260428T211444Z-9246 PASS 1.00 1.1k 3.0s
claude-direct-claude-opus-4-7 20260428T211440Z-3216 PASS 1.00 1.1k 4.0s
claude-direct-claude-opus-4-7 20260428T211437Z-121a PASS 1.00 1.1k 3.1s
claude-direct-claude-opus-4-7 20260428T211433Z-3d3a PASS 1.00 1.1k 3.2s
claude-direct-claude-haiku-4-5-20251001 20260428T211411Z-1951 fail 0.00 valid_solid · solid invalid 914 2.0s
claude-direct-claude-haiku-4-5-20251001 20260428T211409Z-0d37 PASS 1.00 910 1.7s
claude-direct-claude-haiku-4-5-20251001 20260428T211407Z-5313 fail 0.00 valid_solid · solid invalid 927 1.6s
claude-direct-claude-haiku-4-5-20251001 20260428T211405Z-7853 PASS 1.00 910 1.7s
claude-direct-claude-haiku-4-5-20251001 20260428T211403Z-21af fail 0.00 valid_solid · solid invalid 914 1.8s

generated 2026-06-17T03:16:07.193Z · static site, regenerate with npm run build -w @mecheval/leaderboard