Skip to content

Claude/transcode deepnsm rust o na1 z#51

Merged
AdaWorldAPI merged 2 commits into
masterfrom
claude/transcode-deepnsm-rust-oNa1Z
Mar 30, 2026
Merged

Claude/transcode deepnsm rust o na1 z#51
AdaWorldAPI merged 2 commits into
masterfrom
claude/transcode-deepnsm-rust-oNa1Z

Conversation

@AdaWorldAPI
Copy link
Copy Markdown
Owner

No description provided.

claude added 2 commits March 30, 2026 06:58
BF16 optimizations for Maverick-scale (801 GB):
- BF16-direct: skip f32 Vec allocation (saves 283 MB per tensor)
- Strided octave + halftone: 97% fewer BF16→f64 conversions (stride=16)
- Reusable u16 buffer: one alloc for entire shard
- stream_index_gguf_bf16(): fast path for BF16, fallback for other dtypes
- 4 new tests: halftone coverage, bf16 accuracy, stride agreement, f32 parity

Shard 4/5: 4.1 MB. All 5 Scout shards now committed (~43 MB total).

https://claude.ai/code/session_01Y69Vnw751w75iVSBRws7o7
@AdaWorldAPI AdaWorldAPI merged commit 66e4b60 into master Mar 30, 2026
4 of 10 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants