← All papers
First page of AXLE: A Cloud Infrastructure for Lean 4 Theorem Proving Utilities

AXLE: A Cloud Infrastructure for Lean 4 Theorem Proving Utilities

Jimmy Xin, Alex Schneidman, Chris Cummins, Karun Ram, Srihari Ganesh, Jannis Limperg

cs.LO Jun 24, 2026 · v1 cs.AI
Presents AXLE, a scalable cloud service providing 14 Lean 4 metaprogramming tools for proof verification, manipulation, and extraction.
We present AXLE (Axiom Lean Engine), a cloud service for Lean 4 proof manipulation, extraction, and verification. Recent progress in AI for mathematics -- reinforcement learning pipelines, agentic proving workflows, dataset curation -- demands Lean 4 tooling that scales to millions of requests while remaining correct and robust; existing infrastructure offers parallel compilation but not scalable proof verification, higher-level proof manipulation, multi-version support, or per-request isolation at the throughput modern AI workflows require. AXLE provides 14 Lean 4 metaprogramming tools spanning strict proof verification, declaration metadata extraction, semantic source manipulation, deterministic proof repair and simplification, and lemma extraction. The service runs as a multi-tenant cloud deployment with per-request isolation and concurrent support for multiple Lean 4 and Mathlib versions, accessible via a Python SDK, command-line interface, web UI, MCP server, and raw HTTP API. AXLE is publicly available and free to use at https://axle.axiommath.ai and via the axiom-axle PyPI package, with no local Lean 4 installation required. It has served over 500 million requests to date and is the underlying infrastructure for Axiom Math's proving efforts, including its 12/12 score on the 2025 Putnam competition.

AI-driven Lean 4 workflows (RL training, agentic proving, dataset curation) need fast, correct, and scalable tooling with per-request isolation and multi-version support, which existing infrastructure does not provide.

AXLE implements 14 Lean 4 metaprogramming tools (strict verification, metadata extraction, proof repair, simplification, lemma extraction) served from a multi-tenant cloud deployment. Each request runs in isolation with concurrent support for multiple Lean/Mathlib versions. Access is via Python SDK, CLI, web UI, MCP server, or HTTP API.

Median verify_proof latency of 0.97s, 7.27x near-linear scaling to 8 concurrent requests, 100% agreement with alternative strict checkers, and over 500 million requests served. Underlies Axiom Math's 12/12 Putnam 2025 result.

systemmedian latencythroughputtotal wallsuccess
Axle verify_proof0.97 s0.43 req/s9,205 s100.0%
SafeVerify10.1 s0.107 req/s37,258 s99.9%
Comparator95.7 s0.026 req/s151,267 s93.8%
Strict-checker comparison