Extract transformer-core: NN primitives reusable on all targets (incl. androidNative) by michalharakal · Pull Request #185 · SKaiNET-developers/SKaiNET-transformers

michalharakal · 2026-06-17T15:00:17Z

Closes #183.

Extracts llm-core's lang-core-only NN primitives (KV-cache family, MultiHeadAttention, Embedding,
RMSNormalization, RoPE, SwiGLU/GeGLU FFN, ResidualAdd, LinearProjection, TransformerDsl) into a
new transformer-core module that depends only on skainet-lang-core and declares the full target
matrix including androidNativeArm32/androidNativeArm64. llm-core api-depends on it (re-exports),
so existing consumers are unaffected; ARM-native consumers (e.g. skainet-whisper-kmp) can reuse the
primitives instead of reimplementing.

Why

The primitives only need lang-core (which has androidNative), but were trapped in llm-core, whose other
deps (io-gguf/io-core/compile-*/backend-cpu) lack androidNative. They're dtype-agnostic (just call
ops.*), so this target generalization is orthogonal to the quant/dtype generalization (#178) and meets
it cleanly at these primitives. See transformer-core/README.md.

What stayed / decoupled

dsl/decoder/* stays in llm-core (DecoderTransformerNetwork needs apps.llm.HybridTransformerBlock,
which is compile-opt-coupled).
MultiHeadAttention's diagnostic dumpStats back-reference → a settable mhaStatSink (default no-op)
that HybridTransformerBlock wires to llm-core's platform dumpStats — no behaviour lost.

Verified

:transformer-core: compiles for jvm + androidNativeArm32 + androidNativeArm64.
:llm-core:jvmTest green (5/5) via the re-export.
Branch is off release/0.31.0; merge-base with develop is the fork point → clean, no conflicts with
Eager NATIVE_OPTIMIZED: keep Q8_0 matmul weights packed (pre-transpose marker) so gemma fits + runs fast on the SL2610 #178's merged quant work (which is in the model/engine layers, not these primitives).

Follow-up (noted in the README)

The pre-transpose marker (#178 "Solution C") will land in LinearProjection.kt, now here; and
RowDequantSource + packing (today in sk.ainet.models.gemma) are the next hoist candidates — tracked in #184.

… androidNative llm-core's transformer primitives (KV-cache family, MultiHeadAttention, Embedding, RMSNormalization, RoPE, SwiGLU/GeGLU FFN, ResidualAdd, LinearProjection, …) only need skainet-lang-core (which has androidNative), but were trapped in llm-core, whose other deps (io-gguf/io-core/compile-*/backend-cpu) have no androidNative — so ARM-native consumers (the Amlogic box) couldn't reuse them and had to reimplement. Move the 15 lang-core-only NN files (transformer/, layers/, normalization/, dsl/TransformerDsl.kt) into a new transformer-core module that depends ONLY on skainet-lang-core and declares the full matrix INCLUDING androidNativeArm32/Arm64. llm-core api-depends on transformer-core (re-exports), so existing consumers are unaffected. dsl/decoder/* stays in llm-core (DecoderTransformerNetwork needs apps.llm.HybridTransformerBlock, which is compile-opt-coupled). Decoupled the one back-reference: MultiHeadAttention's diagnostic dumpStats call now goes through a settable `mhaStatSink` (default no-op) that HybridTransformerBlock wires to llm-core's platform dumpStats — no functionality lost. Verified: transformer-core compiles for jvm + androidNativeArm32 + arm64; llm-core builds + jvmTest green (5/5). Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>

…onflict assessment) Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>

michalharakal and others added 2 commits June 17, 2026 11:32

docs: transformer-core README + landing notes (rebase onto develop, c…

5baae89

…onflict assessment) Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>

michalharakal merged commit 6c548d7 into develop Jun 17, 2026
2 checks passed

michalharakal deleted the feature/transformer-core branch June 17, 2026 16:54

michalharakal mentioned this pull request Jun 17, 2026

Release 0.31.1 — transformer-core + publish guardrail #186

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Extract transformer-core: NN primitives reusable on all targets (incl. androidNative)#185

Extract transformer-core: NN primitives reusable on all targets (incl. androidNative)#185
michalharakal merged 2 commits into
developfrom
feature/transformer-core

michalharakal commented Jun 17, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

michalharakal commented Jun 17, 2026

Why

What stayed / decoupled

Verified

Follow-up (noted in the README)

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant