feat: causal diff engine — reverse-engineer reasoning from weight-space by AdaWorldAPI · Pull Request #56 · AdaWorldAPI/ndarray

AdaWorldAPI · 2026-03-30T08:19:47Z

What

New causal_diff.rs module + CompressedTensor::read_from deserializer.

Diff two bgz7 model indexes by matching tensor names, comparing per-row Base17 fingerprints, and emitting NARS-truthed causal edges for every row that structurally shifted.

Pipeline

base.bgz7 ──┐
            ├─→ causal_diff(threshold=100) ─→ Vec<WeightEdge>
dist.bgz7 ──┘                                    │
                                                  ├─→ cluster_by_head()
                                                  ├─→ find_reasoning_scaffold()
                                                  └─→ revise_across_diffs()

Key types

WeightEdge: tensor_name + row + block + projection(Q/K/V/O/Gate/FFN) + verb(BECOMES/SUPPORTS) + L1 + NarsTruth
DiffStats: per-projection shift counts + mean L1
classify_projection(): maps tensor names to Q/K/V/O/Gate/FFN/Embedding
find_reasoning_scaffold(): blocks where Q+O shifted but K stayed stable
revise_across_diffs(): NARS revision integrating evidence from multiple model pairs

Integration test

test_full_reasoning_reverse_eng indexes 5 Qwen3.5 Q8_0 models (105 GB total), diffs 4 pairs, finds reasoning scaffold, revises truth across all:

Pair	What it reveals
base 27B → distilled v1	What Claude reasoning looks like in weight space
base 27B → distilled v2	What improved distillation adds
v1 → v2	Iteration delta
base 9B → distilled 9B	Scale-invariant reasoning atoms

Files

src/hpc/causal_diff.rs — new (319 lines)
src/hpc/gguf_indexer.rs — added CompressedTensor::read_from + read_bgz7_file
src/hpc/mod.rs — registered causal_diff

Why

First real observed evidence for the NARS stack. Every truth value so far was manufactured by construction. This pipeline generates truth values from actual weight-space transformations between a base model and its reasoning-distilled variant. The NARS stack goes from theoretical to empirical.

Diff two bgz7 indexes by tensor name + row, emit CausalEdges with NARS truth values. Classifies edges by attention projection (Q/K/V/O/Gate/FFN), clusters by block, finds reasoning scaffold circuits (Q+O shift, K stable). Includes NARS revision across multiple diff runs for integrated evidence about structural reasoning patterns. Test: test_full_reasoning_reverse_eng indexes 5 Qwen3.5 Q8_0 models (105 GB), diffs 4 pairs, revises across all.

Deserializer for bgz7 files — reads back the per-tensor Base17 rows written by stream_index_gguf/stream_index_gguf_bf16. Required by causal_diff to load and compare indexed models.

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 11fffd123c

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

chatgpt-codex-connector · 2026-03-30T08:22:43Z

+    let n_tensors = u32::from_le_bytes(u32_buf) as usize;
+
+    let mut tensors = Vec::with_capacity(n_tensors);
+    for _ in 0..n_tensors {
+        tensors.push(CompressedTensor::read_from(&mut reader)?);


Align BGZ7 tensor count with serialized entries

read_bgz7_file trusts the header tensor count and unconditionally reads that many CompressedTensor records, but this repository’s writers (stream_index_gguf / stream_index_gguf_bf16) write gguf.tensors.len() before skipping Norm/Skip tensors, so many .bgz7 files contain fewer records than the header claims. In that common case, this loop will hit EOF and causal_diff fails even on valid indexes produced by our own indexer; either the writer must store indexed count or the reader must tolerate shorter payloads.

Useful? React with 👍 / 👎.

chatgpt-codex-connector · 2026-03-30T08:22:43Z

+            let key = (block, format!("{:?}", e.projection));
+            let entry = clusters.entry(key).or_insert((0, 0, 0));
+            entry.0 += 1;
+            entry.1 = entry.1.max(e.row_idx + 1);


Compute shift ratios from real row totals

This uses max(shifted_row_idx)+1 as total_rows, but cluster_by_head only receives shifted edges, so sparse shifts undercount the true denominator and inflate q_pct/k_pct/o_pct in find_reasoning_scaffold. For example, if only early rows shift, percentages can cross shift_threshold even when most rows are stable, producing false scaffold detections.

Useful? React with 👍 / 👎.

AdaWorldAPI added 3 commits March 30, 2026 10:18

mod: register causal_diff module

3769a31

feat: CompressedTensor::read_from + read_bgz7_file

11fffd1

Deserializer for bgz7 files — reads back the per-tensor Base17 rows written by stream_index_gguf/stream_index_gguf_bf16. Required by causal_diff to load and compare indexed models.

AdaWorldAPI merged commit 85f8d48 into master Mar 30, 2026
6 of 15 checks passed

AdaWorldAPI deleted the claude/causal-diff-engine branch March 30, 2026 08:22

chatgpt-codex-connector Bot reviewed Mar 30, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: causal diff engine — reverse-engineer reasoning from weight-space#56

feat: causal diff engine — reverse-engineer reasoning from weight-space#56
AdaWorldAPI merged 3 commits into
masterfrom
claude/causal-diff-engine

AdaWorldAPI commented Mar 30, 2026

Uh oh!

Uh oh!

chatgpt-codex-connector Bot left a comment

Uh oh!

chatgpt-codex-connector Bot Mar 30, 2026

Uh oh!

chatgpt-codex-connector Bot Mar 30, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

AdaWorldAPI commented Mar 30, 2026

What

Pipeline

Key types

Integration test

Files

Why

Uh oh!

Uh oh!

chatgpt-codex-connector Bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector Bot Mar 30, 2026

Choose a reason for hiding this comment

Uh oh!

chatgpt-codex-connector Bot Mar 30, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant