(Auto)formalization is supposed to be easy: Trellis process semantics for spelling out rigorous proofs

Wesley Pegden

cs.AI Jun 8, 2026 · v1

autoformalization ai-agents theorem-proving-ml combinatorics

Read PDF arXiv abstract

TL;DR

Trellis is an LLM-agent autoformalization system that produces end-to-end Lean proofs via a process-semantics-constrained workflow.

Abstract

We present Trellis: an autoformalization system that leverages LLM agents in a deterministically constrained workflow to enforce incremental progress in Lean autoformalization tasks through iterative refinement of natural language proofs. Our approach is motivated by the common mathematician's notion of what it means to have a rigorous proof in the first place: namely, that it would be routine to elaborate any part of the proof in further detail. The result is a system which aims to achieve reliable autoformalization on a modest budget and with generalist agents, with specialization to autoformalization coming not from any task-specific agent training but instead from a meaning-of-rigor inspired workflow enforced by process semantics. We link to an end-to-end Lean formalization of a recent Ramsey theory breakthrough produced by the process.

Problem

Autoformalization of mathematical proofs remains unreliable, and existing systems often require expensive task-specific training or large compute budgets. The core difficulty is ensuring that an LLM agent makes incremental progress rather than getting stuck or producing invalid output.

Approach

Trellis uses LLM agents constrained by a deterministic workflow (process semantics) that enforces incremental progress in Lean autoformalization. The system iteratively refines natural language proofs, guided by the mathematical principle that a rigorous proof should be routine to elaborate at any point. Specialization comes from the workflow structure rather than task-specific fine-tuning, allowing generalist agents to perform autoformalization on a modest budget.

Results

The system produced an end-to-end Lean formalization of a recent Ramsey theory breakthrough. The authors report that reliable autoformalization is achievable with generalist agents when constrained by meaning-of-rigor inspired process semantics.

Papers With

(Auto)formalization is supposed to be easy: Trellis process semantics for spelling out rigorous proofs