The Cognitive Categorical Transformer: Category-Theoretic Inductive Biases for Language Modeling

Original Source

ArXiv AI (cs.AI)

by Al Kari

Read Full Article

arXiv:2605.28864v1 Announce Type: new Abstract: The Cognitive Categorical Transformer (CCT) is a 306M-parameter architecture that augments a pretrained GPT-2 Small backbone with cognitively grounded components derived from category theory and several inspirations from cognitive science. Under a matched-step protocol (215,000 optimizer steps, matched data, matched optimizer and schedule) on WikiText-103, CCT reaches 21.27 validation perplexity, compared with 24.19 for an identically fine-tuned GPT-2 Small baseline. The architecture therefore contributes a 2.92 PPL (12% relative) reduction beyond what in-domain fine-tuning alone provides. A retrain-from-scratch ablation that holds GT-Full simplicial message passing bypassed across the entire seven-phase activation schedule reaches 23.72 PPL, localizing 84% of the architectural improvement (2.45 of 2.92 PPL) to GT-Full. We present the first ablation-validated evidence that simplicial message passing improves language-model perplexity at the 306M-parameter scale on WikiText-103. Published GPT-2 Large reaches 22.05 zero-shot PPL on WikiText-103 with 6.2x more parameters than GPT-2 Small; this paper treats that number as an external published reference, not as the architectural benchmark. Three negative results on consistency-style categorical priors (sheaf smoothing, adjunction round-trip, curvature regularization) and the joint structural-prior result for GT-Full and PrecisionWeightedPP together support an empirical pattern termed the *structure/consistency distinction*, in which categorical priors that add new topology improve language modeling and those that enforce a consistency identity do not.

Tags:GPTAI

Original Content Credit

This summary is sourced from ArXiv AI (cs.AI). For the complete article with full details, research data, and author insights, please visit the original source.

Visit ArXiv AI (cs.AI)

What happens when companies become too AI-pilled?

TechCrunch AI

Industry News1m

What happens when companies become too AI-pilled?

The people deciding that AI can replace your job are also the ones least likely to understand what your job truly involves, according to Box founder Aaron Levie, who pointed to this as an example of “AI psychosis.” Indeed, ClickUp recently cut 22% of its workforce for AI ag

May 29, 2026

After Nvidia’s $20B not-aqui-hire, AI chip startup Groq reportedly raising $650M

TechCrunch AI

Business AI1m

After Nvidia’s $20B not-aqui-hire, AI chip startup Groq reportedly raising $650M

Chipmaker Groq is looking to raise $650 million in internal funding as it pivots from hardware to focus more on AI inference, the process of refining the way AI models respond to prompted requests, per Axios.

May 29, 2026

We Asked the ‘Future of Truth’ Author to Explain How He Used AI. It Didn’t Go Well

Wired AI

Industry News1m

We Asked the ‘Future of Truth’ Author to Explain How He Used AI. It Didn’t Go Well

A book about how AI shapes perceptions of reality came under fire for using AI-generated quotes. Its problems go beyond that.

May 29, 2026

The Cognitive Categorical Transformer: Category-Theoretic Inductive Biases for Language Modeling

Related Articles

What happens when companies become too AI-pilled?

After Nvidia&#8217;s $20B not-aqui-hire, AI chip startup Groq reportedly raising $650M

We Asked the ‘Future of Truth’ Author to Explain How He Used AI. It Didn’t Go Well

After Nvidia’s $20B not-aqui-hire, AI chip startup Groq reportedly raising $650M