From 72e9f28621eb3f2732007b490c239e6289f2fb75 Mon Sep 17 00:00:00 2001 From: Michal Harakal Date: Mon, 8 Jun 2026 15:26:54 +0200 Subject: [PATCH] docs: refresh eager-kernels mindmap after Panama Q6_K (#720) MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8bit The hand-authored mindmap drifted: Q6_K Panama is now a first-class SPI kernel (PanamaVectorQ6_KMatmulKernel, #720), not the legacy "🚧 legacy SIMD path". Flip Q6_K Panama to ✅ in the mindmap + table, drop the now-done "Panama SPI Q6_K" gap entry, and point the linux-native-SIMD gap at its tracker (#722). The auto-generated kernel-support-matrix.md already reflected this (it's CI-gated); this fixes the hand-authored companion. Co-Authored-By: Claude Opus 4.8 --- docs/eager-execution-backends-and-kernels.md | 10 ++++------ 1 file changed, 4 insertions(+), 6 deletions(-) diff --git a/docs/eager-execution-backends-and-kernels.md b/docs/eager-execution-backends-and-kernels.md index 8f2aa33a..5133cbd1 100644 --- a/docs/eager-execution-backends-and-kernels.md +++ b/docs/eager-execution-backends-and-kernels.md @@ -26,7 +26,7 @@ mindmap FP32 BF16 Q8_0 Q4_0 ✅ Q4_K ✅ Q5_1 Q5_0 ✅ new - Q6_K 🚧 legacy SIMD path + Q6_K ✅ new Native FFM ✅ JVM only — C kernels via CMake FP32 BF16 Q8_0 Q4_0 Q4_K ✅ @@ -44,8 +44,7 @@ mindmap JS and WASM ✅ scalar only Gaps and roadmap Native FFM Q5 and Q6_K ❌ issue 708 - Native SIMD on linux ❌ - Panama SPI Q6_K kernel 🚧 + Native SIMD on linux ❌ issue 722 Q5_K Q2_K Q3_K IQ4 packed ❌ dequant only GPU backends IREE Metal ❌ future ``` @@ -59,7 +58,7 @@ mindmap | Q8_0 | ✅ | ✅ | ✅ | | Q4_0 | ✅ | ✅ | ✅ | | Q4_K | ✅ | ✅ | ✅ | -| Q6_K | ✅ | 🚧 legacy `JvmQuantizedVectorKernels` (no SPI kernel) | ❌ | +| Q6_K | ✅ | ✅ | ❌ | | Q5_1 | ✅ | ✅ | ❌ | | Q5_0 | ✅ | ✅ | ❌ | | Q5_K / Q2_K / Q3_K / Q8_K / IQ4 | ❌ (dequant-to-FP32 only) | ❌ | ❌ | @@ -82,9 +81,8 @@ those formats were JVM-only and broke on Native. ## In progress / missing (with trackers) -- 🚧 **Q6_K Panama SPI kernel** — Q6_K is SIMD on JVM via the legacy `JvmQuantizedVectorKernels.matmulQ6_KVec`, but has no `PanamaVectorQ6KMatmulKernel`/`KernelProvider.matmulQ6K()` SPI entry yet. - ❌ **Native FFM Q5_1/Q5_0/Q6_K** — the C kernel set covers FP32/BF16/Q8_0/Q4_0/Q4_K only. Tracked by **SKaiNET#708** (core kernel) and **SKaiNET-transformers#170** (converter wiring). -- ❌ **Native SIMD on linux** — Kotlin/Native linux targets run the scalar floor; no cinterop/OpenBLAS or SIMD path. (Apple has Accelerate for dense ops.) +- ❌ **Native SIMD on linux** — Kotlin/Native linux targets run the scalar floor; no cinterop/OpenBLAS or SIMD path (Apple has Accelerate for dense ops). Tracked by **SKaiNET#722**. - ❌ **Other GGML quant formats** (Q5_K, Q2_K, Q3_K, Q8_K, IQ4_NL/XS) — loadable via dequant-to-FP32, but no packed matmul kernel. - ❌ **Non-CPU eager backends** (IREE, Metal, GPU) — the `KernelProvider` SPI anticipates them, but none are implemented for the eager path today.