[ArXiv Endorsement Request]Towards an Internal Topology of Alignment: The PCE Framework

FAllan07 · May 5, 2026, 12:30pm

Moving beyond external injunctions (RLHF, output filters, safety rules) to induce internal structural coherence via an axiomatic system.

Hello community,

I am seeking an endorsement for the cs.CL (Computation and Language) or cs.AI categories regarding my latest paper on the PCE (Prompt Coherence Engine) framework.

Far from seeking to replace existing alignment methods (RLHF, DPO), this work proposes a complementary solution: creating an internal semantic topology that guides inference. The goal is to transition from “reactive” security (output filtering) to “native” stability (logical trajectory).

The paper covers three major axes:

Behavioral Analysis: The Geometry of Constraint

The PCE stabilizes semantic trajectories via invariant logical constraints (e.g., non-dissociation of goal and method). We observe a drastic reduction in semantic drift over long sequences (160+ turns), suggesting that the model converges towards semantic attractors defined by the axioms rather than drifting under user pressure.

Experimental Evaluation: The D3 Dilemma Battery

The framework’s robustness is tested using the D3 (Dilemma-Driven Dynamics) battery. By confronting the model with contradictory injunctions and extreme emergency scenarios, we observe the emergence of a “Third Way”: a capacity for non-binary creative synthesis where standard models tend to collapse or produce generic refusals.

Standardized Protocol (SEP v2.0): For Reproducible Science

I am publishing an open experimental protocol (Standardized Evaluation Protocol v2.0) aimed at transforming qualitative intuition into reproducible statistical results.

The protocol includes:

100 graded dilemmas (D1–D5) testing the limits of the logical framework.

Controlled conditions: Rigorous comparisons (Baseline vs. Long Prompt vs. PCE).

Resistance metrics: D3 scores and P1–P3 trajectory signatures.

The Objective: To validate the hypothesis that an axiom-structured model develops emergent robustness.

Multi-Model Validation & Rigor

To ensure impartiality, results were validated via a decoupling protocol:

Inference: Grok 4.20, Gemini 1.5 Pro, Qwen 2.5 7B.

Independent Audit: Claude 3.5 Sonnet (for consistency auditing).

Cold Analysis: ChatGPT-4o for semantic decomposition of raw logs.

Call for Collaboration

This framework is a proof-of-concept that I wish to bring to a mechanistic level. I would be delighted to discuss:

Statistical Validation: Extending SEP tests to other models (LLaMA, Mistral, etc.).

Interpretability: Paths for internal state analysis (Logit Lens, Hidden States) to observe how this axiomatic topology translates at the weight/activation level.

Robustness: Resistance to sophisticated adversarial attacks.

Link to Preprint (PDF): PCE_Axiomatic_V2.5_Faure_preprint.pdf · AllanF-SSU/Research-Papers at main

Alignment should not be a cage imposed on the model, but a coherent logical structure emerging from its own inference.

Thank you for your feedback and for your help with this endorsement!

Allan

Topic		Replies	Views
Alignment by internal topology for long-horizon stability and robustesse is possible? Research	0	9	May 5, 2026
Behavioral Stability and Adversarial Robustness via Axiomatic Prompt Structuring (PCE) Research	0	25	March 17, 2026
Can an AI have its own internal Ethics? Standard Protocol for Axiomatic Alignment Research	28	280	May 5, 2026
Stabilization of LLMs through Axiomatic Prompting (PCE Protocol v4) Research	2	23	February 11, 2026
Experimental Protocol Proposal: Testing the Prompt Coherence Engine (PCE) Research	8	93	March 12, 2026

[ArXiv Endorsement Request]Towards an Internal Topology of Alignment: The PCE Framework

Related topics