feat(constants)!: switch URLs to v0.9.0 layout + add MODEL_REGISTRY by msluszniak · Pull Request #1148 · software-mansion/react-native-executorch

msluszniak · 2026-05-13T14:31:10Z

Description

Switches every URL constant in the library to the restructured HF layout
under resolve/v0.9.0 and adds the typed MODEL_REGISTRY accessor.

URL refresh

All URLs now follow <model>_<size>_<backend>_<precision>.pte, files
sit under per-size and per-backend directories on HF:

modelUrls.ts — 170 URL refs rewritten. The lfm2_5_350m_xnnpack_8w4da.pte typo is corrected to _8da4w.pte.
ocr/models.ts — CRAFT detector URL + CRNN per-language URL switch to the new <lang>/xnnpack/crnn_<lang>_xnnpack_fp32.pte shape.
tts/models.ts — Kokoro consts re-rooted to <size>/xnnpack/kokoro_<size>_<component>_xnnpack_fp32.pte.
tts/voices.ts — voices/ and phonemizer/ asset paths kept in place; only the tag value bumps.
versions.ts — VERSION_TAG → resolve/v0.9.0. NEXT_VERSION_TAG removed. PREVIOUS_VERSION_TAG = resolve/v0.8.0 retained for the @deprecated Llama QLoRA aliases.

MODEL_REGISTRY

New constants/modelRegistry.ts exports a typed accessor grouped by
capability (LLM / VLM / CLASSIFICATION / OBJECT_DETECTION /
SEMANTIC_SEGMENTATION / INSTANCE_SEGMENTATION / STYLE_TRANSFER /
SPEECH_TO_TEXT / TEXT_EMBEDDING / IMAGE_EMBEDDING / IMAGE_GENERATION /
VAD). Each entry is callable with { quant, backend }:

MODEL_REGISTRY.LLM.LLAMA3_2_3B                   // default (quantized)
MODEL_REGISTRY.LLM.LLAMA3_2_3B({ quant: false })  // base

Object access returns the default config; calling resolves the
requested variant. backend is accepted in the type signature for
forward-compat — the library still picks via Platform.OS at module
load. Per-backend selection lands in a follow-up.

Deprecations

LLAMA3_2_3B_QLORA, LLAMA3_2_1B_QLORA — @deprecated; the .pte
files stay at v0.8.0, and the constants still resolve those URLs.
Use LLAMA3_2_*_SPINQUANT going forward.

Introduces a breaking change?

Yes
No

URL paths under ${VERSION_TAG} change — any code that hardcoded
resolve/v0.8.0 URLs through the constants keeps working only if it
read the constants at runtime (the constant values themselves are
updated). Historical tags continue to resolve old paths, so apps pinned
to a previous library version are unaffected.

The flat MODEL_REGISTRY = { ALL_MODELS: {...} } export in
modelUrls.ts is removed; the new MODEL_REGISTRY from
constants/modelRegistry.ts is the replacement. The internal
URL→name lookup (getModelNameForUrl) is preserved.

Type of change

New feature (change which adds functionality)
Other (chores, tests, code style improvements etc.)

Tested on

iOS
Android

yarn typecheck clean across the monorepo. Runtime behaviour validated
against the migrated HF state (every URL resolves at v0.9.0).

Testing instructions

yarn typecheck
# Or in the package:
cd packages/react-native-executorch && yarn typecheck

In application code:

import { MODEL_REGISTRY } from 'react-native-executorch';
console.log(MODEL_REGISTRY.LLM.LLAMA3_2_3B.modelSource);
console.log(MODEL_REGISTRY.LLM.LLAMA3_2_3B({ quant: true }).modelSource);

Related issues

#431
#612

Checklist

I have performed a self-review of my code
I have commented my code, particularly in hard-to-understand areas
I have updated the documentation accordingly
My changes generate no new warnings

@deprecated

URL refresh ----------- Every URL constant in the library now points at the restructured HF layout under `resolve/v0.9.0`. File names follow `<model>_<size>_<backend>_<precision>.pte`, files sit under per-size and per-backend directories. Affects: - modelUrls.ts: 170 URL refs rewritten to new paths. The 8da4w-typo file `lfm2_5_350m_xnnpack_8w4da.pte` is corrected to `..._8da4w.pte`. - ocr/models.ts: CRAFT detector URL + CRNN per-language URL template switch to the new `<lang>/xnnpack/crnn_<lang>_xnnpack_fp32.pte` shape. - tts/models.ts: Kokoro consts re-rooted to `<size>/xnnpack/kokoro_<size>_<component>_xnnpack_fp32.pte`. - tts/voices.ts: voices/ and phonemizer/ asset paths kept in place; only the `${VERSION_TAG}` value bumps. - versions.ts: VERSION_TAG -> resolve/v0.9.0. NEXT_VERSION_TAG collapsed into VERSION_TAG. PREVIOUS_VERSION_TAG=resolve/v0.8.0 retained for the two @deprecated Llama QLoRA aliases (LLAMA3_2_*_QLORA) that continue to resolve their v0.8.0 file. SpinQuant is the canonical quantized Llama 3.2 variant going forward. MODEL_REGISTRY -------------- Adds `constants/modelRegistry.ts` — a typed accessor grouped by capability (LLM, VLM, CLASSIFICATION, OBJECT_DETECTION, SEMANTIC_SEGMENTATION, INSTANCE_SEGMENTATION, STYLE_TRANSFER, SPEECH_TO_TEXT, TEXT_EMBEDDING, IMAGE_EMBEDDING, IMAGE_GENERATION, VAD). Each entry is callable with `{ quant, backend }`: MODEL_REGISTRY.LLM.LLAMA3_2_3B // default (base) MODEL_REGISTRY.LLM.LLAMA3_2_3B({ quant: true }) // SpinQuant When read as a value (object access), returns the default config; when called, resolves the requested variant. `backend` is accepted in the signature for forward-compat but the library still picks via `Platform.OS` at module load. The previous flat `MODEL_REGISTRY = { ALL_MODELS: {...} }` export in modelUrls.ts is removed; its internal-only consumer (the urlToModelName lookup) now reads from a private `_ALL_MODELS` array. Resolves the JS-API side of the HF naming convention migration.

The umbrella lfm-2.5 HF repo hosts two distinct models — the text LLM (1.2B + 350M) and the vision-language model (1.6B + 450M). The migrator collapsed the VL size tokens (`vl_1_6b`, `vl_450m`) to bare numeric sizes, making VL 1.6B indistinguishable from a hypothetical text 1.6B variant. It also left the four per-variant tokenizers at their legacy `lfm2.5-*/` paths instead of moving them next to the new backend dirs. HF state (separate commits on the repo): - VL .pte files renamed to `vl_<size>/xnnpack/lfm_2_5_vl_<size>_*.pte` - tokenizers moved into `<size>/` and `vl_<size>/` next to each cell - legacy `lfm2.5-*-instruct/` and `lfm2.5-VL-*/` dirs cleaned out - config.json files refreshed (vl_* configs now carry `model: lfm_2_5_vl` + `capabilities: [vision, text-generation]`) This commit refreshes the matching URL constants in modelUrls.ts so every LFM2.5 model points at its new HF path.

Covers the new grouped MODEL_REGISTRY shape (capability groups with callable accessors), the `{ quant, backend }` options, default vs quantized resolution, the still-supported direct-import pattern, and a short migration note from the previous flat `ALL_MODELS` dict.

22 files updated across apps/llm, apps/computer-vision, apps/speech, apps/text-embeddings, and apps/bare-rn. Each flat model-constant import is replaced with the corresponding `MODEL_REGISTRY.<GROUP>.<NAME>` (or `(...)({ quant: true })` for quantized variants). Llama QLoRA aliases remain imported under their flat names — they're deprecated and not part of the registry. Net effect: -242 / +158 lines (collapsed imports, terser callsites). Apps now serve as the canonical usage example for the typed registry.

…ctions useState auto-invokes function-typed initial values as lazy initializers, so passing a MODEL_REGISTRY accessor unwraps it into a plain config — breaking reference equality against the accessor stored in MODELS. Compare by modelName (falling back to === for picker users without one, e.g. VoiceConfig).

Each accessor's `backend` parameter is now typed to exactly the backends the model ships with — passing an unsupported one is a compile-time error. `Platform.OS` still picks the default when `backend` is omitted. The per- backend (quant × backend) variant matrix lives in modelRegistry.ts so modelUrls.ts stays flat-per-model. Unifies DISTILUSE_BASE_MULTILINGUAL_CASED_V2 to one accessor with xnnpack + coreml; the _8DA4W and _COREML named constants stay as deprecated aliases.

…ariant Bare accessors (and undefined `quant`) now resolve to the quantized variant when one is published; pass `{ quant: false }` to opt out. Docs and example apps are updated to match — dual pickers keep both rows by making the FP32 entry the explicit opt-out.

barhanc · 2026-05-14T11:17:29Z

+  if (typeof am === 'string' && typeof bm === 'string') return am === bm;
+  return a === b;
+}
+


This code is duplicated in several files. It should probably be factored out to some common utilities file. However, in general I would be in favour of not using the model accessor as both a value and a function.

barhanc · 2026-05-14T11:17:34Z

+/**
+ * An accessor that behaves as the platform-default config when read as a value
+ * (e.g. `MODEL_REGISTRY.LLM.LLAMA3_2_3B.modelName`) and as a function when
+ * called (e.g. `MODEL_REGISTRY.LLM.LLAMA3_2_3B({ quant: false })`).
+ */
+type Accessor<
+  C extends { modelName: string },
+  B extends Backend = Backend,
+> = C & ((opts?: ModelOpts<B>) => C);


I don't think we should make the accessor behave like both a value and a function and just make the user explicitly call e.g. MODEL_REGISTRY.LLM.LLAMA3_2_3B() for default config (perhaps with some stylistic changes like changing the names to lowercase to indicate these are getters and not constants; the use pattern in the user code would be something like const LLAMA3_2_3B = models.llm.llama3_2_3b()). I feel the current approach might generate some problems e.g. when comparing models as in example apps using sameValue workaround.

msluszniak assigned msluszniak and unassigned msluszniak May 13, 2026

msluszniak added feature PRs that implement a new feature labels May 13, 2026

msluszniak marked this pull request as ready for review May 13, 2026 15:16

barhanc self-requested a review May 13, 2026 15:22

msluszniak requested review from NorbertKlockiewicz and chmjkb May 14, 2026 09:14

msluszniak added 7 commits May 14, 2026 11:35

msluszniak force-pushed the @ms/model-registry branch from 667d6b3 to fc5eeb0 Compare May 14, 2026 09:36

barhanc reviewed May 14, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(constants)!: switch URLs to v0.9.0 layout + add MODEL_REGISTRY#1148

feat(constants)!: switch URLs to v0.9.0 layout + add MODEL_REGISTRY#1148
msluszniak wants to merge 7 commits into
mainfrom
@ms/model-registry

msluszniak commented May 13, 2026 •

edited

Loading

Uh oh!

barhanc May 14, 2026

Uh oh!

barhanc May 14, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

msluszniak commented May 13, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

URL refresh

MODEL_REGISTRY

Deprecations

Introduces a breaking change?

Type of change

Tested on

Testing instructions

Related issues

Checklist

Uh oh!

barhanc May 14, 2026

Choose a reason for hiding this comment

Uh oh!

barhanc May 14, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

msluszniak commented May 13, 2026 •

edited

Loading

barhanc May 14, 2026 •

edited

Loading