Conflict-Aware Fusion: Mitigating Logic Inertia in Large Language Models via Structured Cognitive Priors

Qiming Bao, Xiaoxuan Fu, Michael Witbrock

cs.AI Dec 6, 2025 · v1

Read PDF arXiv abstract Code

TL;DR

Uses the Lean 4 kernel as a deterministic verification oracle for reinforcement-learning reward in an LLM reasoning pipeline.

Abstract

Large language models (LLMs) achieve high accuracy on many reasoning benchmarks but remain brittle under structural perturbations of rule-based systems. We introduce a diagnostic framework with four stress tests -- redundant vs. essential rule deletion, contradictory-rule injection, logic-preserving rewrites, and multi-law stacking -- and use it to expose Logic Inertia: the tendency of generative LLMs (Qwen2/3, TinyLlama, GPT-4o, Gemma-3-4B-IT) and the encoder-only BERT baseline to persist along learned deductive trajectories under inconsistent premises. The collapse is sharp: untreated baselines fall from accuracy 1.00 on the base task to 0.00 on contradiction injection (instance-level exact match), and GPT-4o resolves only 56.0% of contradiction cases. We propose Conflict-Aware Fusion, a four-stage training pipeline that enforces verification-before-deduction as a learned structural prior: (i) SFT establishes the verification preamble; (ii) DPO sharpens the halt-on-contradiction decision boundary; (iii) Logical Invariance REgularisation (LIRE) penalises divergence between logically equivalent rule formulations via symmetric KL; (iv) Reinforcement Learning from Verification Feedback (RLVF) uses a symbolic forward-chaining engine as a deterministic oracle reward, jointly optimising invariance and sensitivity. The pipeline saturates all four primary stress tests for both 1.5B and 8B backbones. We further validate a Phase 2 extension that replaces the propositional oracle with a Lean 4 kernel, attaining 99.0% kernel agreement on the 105 classically-derivable (T) questions within a stratified 187-question Lean-translated sample (overall 71.7% across both polarities), providing a sound upgrade path to formally verified RL training. Code and benchmark: https://github.com/14H034160212/lemo

Problem

Large language models achieve high accuracy on standard reasoning benchmarks but remain brittle under structural perturbations of rule-based systems, persisting along learned deductive trajectories even when premises become inconsistent (a phenomenon the authors term Logic Inertia).

Approach

The authors introduce a diagnostic framework with four stress tests (redundant vs. essential rule deletion, contradictory-rule injection, logic-preserving rewrites, and multi-law stacking) to expose Logic Inertia. They propose Conflict-Aware Fusion, a four-stage optimization pipeline combining SFT, DPO, structure-aligned training (LIRE loss), and RLVF with Lean-verified rewards. The RLVF stage uses Lean 4 as the verifier to provide binary correctness signals for reinforcement learning.

Results

Untreated baselines fall from accuracy 1.00 on the base task to 0.00 on contradiction injection. GPT-4o resolves only 56.0% of contradiction cases. After the full Conflict-Aware Fusion pipeline (including RLVF with Lean verification), Qwen2-1.5B and Qwen3-8B maintain near-perfect performance across all four stress tests with accuracies of 1.000 on base, V2, V3, and V4-multi conditions.

Papers With

Conflict-Aware Fusion: Mitigating Logic Inertia in Large Language Models via Structured Cognitive Priors