Skip to content

data(jina): optimized runtime weights — 1.7MB, zero external deps Pre-computed from Jina v4 F16 (3.1B params, 5.9GB GGUF). These ARE the runtime — the original model is never needed again. src/hpc/jina/weights/ jina_base17_20k.bin 665 KB 20K tokens × 17D i16 (LEAF, ρ=1.0 vs palette) jina_palette_20k.bin 29 KB 256 centroids + 20K assignments (HEEL, ρ=0.66) coca_academic_20k.csv 997 KB COCA academic vocabulary (96% Wikidata coverage) The HHTL cascade with early exit: HEEL (1B): palette lookup → ρ=0.66, rejects 40% TWIG (18B): i8 quantized → ρ=0.72 LEAF (34B): full Base17 → ρ=1.0 Average: 4.82 bytes/pair for ρ=1.0 exactness No GGUF download needed. No API calls needed. No GPU needed. Load weights at startup via LazyLock. Run forever on CPU. https://claude.ai/code/session_01Y69Vnw751w75iVSBRws7o7#44

Merged
AdaWorldAPI merged 3 commits into
masterfrom
claude/transcode-deepnsm-rust-oNa1Z
Mar 29, 2026

Commits

Commits on Mar 29, 2026