Evidence

Delta Analysis

Blind evaluation by GPT-5 across 30 analytical dimensions. Configured output, using inference-time cognitive configuration, against default output from the same model on the same prompt.

Overall Score

9.2vs7.8

Average across all dimensions

Semantic Density

0.106vs0.055

Concepts per word

Meta-Reasoning

+8point advantage

On meta-cognitive execution

Dimension-by-Dimension Comparison

Selected dimensions. Full 30-dimension matrix in the delta analysis.

DimensionConfiguredDefaultDelta

Analytical Depth9.57.2+2.3

Structural Coherence9.38.0+1.3

Novel Insight Generation9.46.8+2.6

Meta-Cognitive Awareness9.65.5+4.1

Constraint Satisfaction9.17.9+1.2

Semantic Density9.07.0+2.0

Epistemic Precision9.37.5+1.8

Framework Originality9.56.2+3.3

Counter-Argument Handling9.27.8+1.4

Practical Actionability8.88.1+0.7

Reasoning Transparency9.47.3+2.1

Information Prioritization9.66.9+2.7

Semantic Density Comparison

Configured Output0.106

Default Output0.055

Unit: concepts per word. Higher = more information per token.

See the full evidence

The raw responses and the blind judge analysis

The full response from a standard, fresh Gemini 3 Deep Think instance. The full response from Gemini 3 Deep Think running the complete NovaThink Cognitive Seed stack of eight meta-cognitive priors that govern global reasoning. And the full delta analysis from a GPT-5 instance acting as a blind judge with no knowledge of which output came from which configuration.

Want to see how your own AI outputs compare?

Try The Inference Auditor

Delta Analysis

Dimension-by-Dimension Comparison

Semantic Density Comparison

The raw responses and the blind judge analysis

A fresh, unconfigured Gemini 3 Deep Think instance answers the synthesis prompt.

Gemini 3 Deep Think with the full NovaThink Cognitive Seed stack answers the same prompt.

A blind GPT-5 instance compares both outputs across 30 analytical dimensions.