Model Config: DB-driven validator, seed sarvamai/elevenlabs/google by vprashrex · Pull Request #859 · ProjectTech4DevAI/kaapi-backend

vprashrex · 2026-05-18T21:10:19Z

Target issue is: #693

Summary

Previously, config/version creation accepted any completion.params.model without validating it against model_config, allowing unsupported or misspelled models to fail only at inference time. STT/TTS providers and models for Google, Sarvamai, and ElevenLabs were also missing from model_config.

This update adds model validation during config/version creation, seeds missing STT/TTS provider configs, and centralizes provider/model validation within the CRUD layer.

Checklist

Before submitting a pull request, please ensure that you mark these task.

Ran fastapi run --reload app/main.py or docker compose up in the repository root and test.
If you've fixed a bug or added code that is tested and has test cases.

Notes

Please add here if any other information is required for the reviewer.

Summary by CodeRabbit

New Features
- Added support for Sarvamai and ElevenLabs as STT and TTS providers.
- Seeded new provider model configurations into the database.
Improvements
- Stronger validation of model selection and TTS voice parameters to prevent invalid configs.
Performance
- Added database indexes to speed up pending job monitoring and related queries.

…ons for STT/TTS providers

coderabbitai · 2026-05-18T21:10:25Z

Warning

Rate limit exceeded

@vprashrex has exceeded the limit for the number of commits that can be reviewed per hour. Please wait 14 minutes and 44 seconds before requesting another review.

You’ve run out of usage credits. Purchase more in the billing tab.

⌛ How to resolve this issue?

After the wait time has elapsed, a review can be triggered using the @coderabbitai review command as a PR comment. Alternatively, push new commits to this PR.

We recommend that you space out your commits to avoid hitting the rate limit.

🚦 How do rate limits work?

CodeRabbit enforces hourly rate limits for each developer per organization.

Our paid plans have higher rate limits than the trial, open-source and free plans. In all cases, we re-allow further reviews after a brief timeout.

Please see our FAQ for further information.

ℹ️ Review info

⚙️ Run configuration

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Pro

Run ID: d4eeeeec-8113-4606-99e4-db5082d0bcb1

📥 Commits

Reviewing files that changed from the base of the PR and between 45e6b6a and e00df2e.

📒 Files selected for processing (4)

backend/app/alembic/versions/063_seed_stt_tts_model_configs.py
backend/app/crud/model_config.py
backend/app/models/model_config.py
backend/app/tests/crud/test_model_config.py

📝 Walkthrough

Walkthrough

Adds migrations and seed data, expands ModelConfig provider values, removes static model/voice whitelists, implements modality-aware DB-backed model/voice validation, integrates validation into config CRUD and job execution, and updates tests to exercise the new validation.

Changes

Model Provider Expansion and Database-Driven Validation

Layer / File(s)	Summary
Migrations, seeds, and provider schema `backend/app/alembic/versions/062_add_pending_job_monitoring_indexes.py`, `backend/app/alembic/versions/063_seed_stt_tts_model_configs.py`, `backend/app/models/model_config.py`	Adds migration 062 (concurrent indexes), migration 063 (seed STT/TTS models), and expands `ModelConfigBase.provider` to include `sarvamai` and `elevenlabs`.
ModelConfig CRUD: types, modality filters, and helpers `backend/app/crud/model_config.py`	Adds `Provider`/`CompletionType` types, `_normalize_provider`, `_modality_filter`, and `list_supported_models`/`is_model_supported`; widens signatures to accept `Provider`.
Blob model validation implementation `backend/app/crud/model_config.py`	`validate_blob_model_or_raise()` requires `completion.params.model` for non-`*-native` providers, verifies model exists and supports the requested completion type, and validates TTS `voice` against model-config voice options; raises HTTP 400 on failures.
Config CRUD integration `backend/app/crud/config/config.py`, `backend/app/crud/config/version.py`	Calls `validate_blob_model_or_raise` during `create_or_raise` flows to validate config blobs before persisting configs and versions.
Remove static model and voice whitelists `backend/app/models/llm/constants.py`, `backend/app/models/llm/request.py`	Removes `SUPPORTED_MODELS` and `SUPPORTED_VOICES`; simplifies `KaapiCompletionConfig.validate_params` to skip whitelist checks and auto-default provider for `stt`/`tts`.
Job execution integration `backend/app/services/llm/jobs.py`	Validates inline/stored config blobs before executing jobs; converts validation failures to structured errors returned from job execution paths.
Unit tests and test updates `backend/app/tests/crud/test_model_config.py`, `backend/app/tests/api/routes/configs/test_version.py`, `backend/app/tests/models/llm/test_request.py`	Adds comprehensive tests for `validate_blob_model_or_raise`, updates version creation tests to assert type-change constraints, and removes obsolete whitelist-based tests.

Estimated code review effort

🎯 4 (Complex) | ⏱️ ~45 minutes

Possibly related PRs

ProjectTech4DevAI/kaapi-backend#846: Related pending-job monitoring indexing changes in Alembic.
ProjectTech4DevAI/kaapi-backend#826: Also modifies job execution paths in services/llm/jobs.py.

Suggested reviewers

Prajna1999
AkhileshNegi
Ayush8923

🐰 Four providers join the database cheer,
Validation hops in, precise and clear,
Whitelists shelved, checks now from the store,
Models and voices sing: accept or no more,
Hooray — configs validated, jobs run without fear!

🚥 Pre-merge checks | ✅ 4 | ❌ 1

❌ Failed checks (1 warning)

Check name	Status	Explanation	Resolution
Docstring Coverage	⚠️ Warning	Docstring coverage is 62.79% which is insufficient. The required threshold is 80.00%.	Write docstrings for the functions missing them to satisfy the coverage threshold.

✅ Passed checks (4 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Title check	✅ Passed	The title 'Model Config: DB-driven validator, seed sarvamai/elevenlabs/google' directly and accurately summarizes the main changes: it introduces a database-driven validator for model configs and seeds configurations for three providers (sarvamai, elevenlabs, google).
Linked Issues check	✅ Passed	Check skipped because no linked issues were found for this pull request.
Out of Scope Changes check	✅ Passed	Check skipped because no linked issues were found for this pull request.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing Touches

🧪 Generate unit tests (beta)

Create PR with unit tests
Commit unit tests in branch chore/model-config-evals

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

…ai, and ElevenLabs

…der_success

codecov · 2026-05-19T11:37:27Z

Codecov Report

❌ Patch coverage is 95.60440% with 8 lines in your changes missing coverage. Please review.

Files with missing lines	Patch %	Lines
backend/app/services/llm/jobs.py	70.58%	5 Missing ⚠️
backend/app/crud/model_config.py	94.59%	2 Missing ⚠️
backend/app/tests/crud/test_model_config.py	98.75%	1 Missing ⚠️

📢 Thoughts on this report? Let us know!

…test_request.py

coderabbitai

Actionable comments posted: 3

Caution

Some comments are outside the diff and can’t be posted inline due to platform limitations.

⚠️ Outside diff range comments (1)

backend/app/models/llm/request.py (1)
265-282: ⚠️ Potential issue | 🟡 Minor | ⚡ Quick win

Add a return type annotation to validate_params.

The method at line 265 is missing a return type hint. Since it returns self (a KaapiCompletionConfig instance), add the return type annotation.
Proposed fix
-    def validate_params(self):
+    def validate_params(self) -> "KaapiCompletionConfig":
As per coding guidelines: **/*.py requires type hints on all function parameters and return values; Use Python 3.11+ with type hints throughout the codebase.
🤖 Prompt for AI Agents
Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@backend/app/models/llm/request.py` around lines 265 - 282, The method
validate_params currently lacks a return type annotation; update its signature
to annotate the return as the containing class (KaapiCompletionConfig) — i.e.,
change def validate_params(self) to def validate_params(self) ->
"KaapiCompletionConfig" (use a forward reference string if the class is defined
later or add from __future__ import annotations) so that callers know it returns
self; keep the implementation unchanged and ensure imports/annotations comply
with Python 3.11 typing rules.

🧹 Nitpick comments (4)

backend/app/tests/crud/test_model_config.py (1)
214-214: ⚡ Quick win

Remove unused monkeypatch fixture parameters in these tests.

Both functions currently trigger Ruff ARG001 and can be simplified safely.
♻️ Proposed fix
-def test_validate_blob_none_provider_skips(monkeypatch: pytest.MonkeyPatch) -> None:
+def test_validate_blob_none_provider_skips() -> None:
@@
-def test_validate_blob_missing_model_raises(monkeypatch: pytest.MonkeyPatch) -> None:
+def test_validate_blob_missing_model_raises() -> None:
Also applies to: 220-220
🤖 Prompt for AI Agents
Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@backend/app/tests/crud/test_model_config.py` at line 214, The two tests
test_validate_blob_none_provider_skips and the other test at the nearby block
currently accept an unused monkeypatch fixture causing Ruff ARG001; remove the
unused monkeypatch parameter from their function signatures (e.g., change def
test_validate_blob_none_provider_skips(monkeypatch: pytest.MonkeyPatch) -> None:
to def test_validate_blob_none_provider_skips() -> None:) and run tests to
ensure no remaining references to monkeypatch exist.
backend/app/alembic/versions/063_seed_stt_tts_model_configs.py (1)
32-67: ⚡ Quick win

Add return type hints for upgrade and downgrade.

Please annotate both functions with -> None for consistency with the project typing rule.

As per coding guidelines: "**/*.py: Always add type hints to all function parameters and return values in Python code".
🤖 Prompt for AI Agents
Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@backend/app/alembic/versions/063_seed_stt_tts_model_configs.py` around lines
32 - 67, The upgrade and downgrade functions lack return type annotations; add
"-> None" to both function definitions (upgrade and downgrade) so their
signatures read with a None return type, keeping the bodies unchanged and
conforming to the project's typing rule for Python functions.
backend/app/alembic/versions/062_add_pending_job_monitoring_indexes.py (1)
35-56: ⚡ Quick win

Add explicit return type hints to migration functions.

upgrade/downgrade should be annotated as -> None to match repo typing standards.

As per coding guidelines: "**/*.py: Always add type hints to all function parameters and return values in Python code".
🤖 Prompt for AI Agents
Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@backend/app/alembic/versions/062_add_pending_job_monitoring_indexes.py`
around lines 35 - 56, Annotate the migration entry points with explicit return
type hints: change the function signatures of upgrade and downgrade to include
-> None (i.e., def upgrade() -> None: and def downgrade() -> None:) so they
match the repository typing standards and satisfy the rule to add return type
hints for all functions like the ones that call
op.get_context().autocommit_block() and op.create_index/op.drop_index.
backend/app/crud/model_config.py (1)
66-87: ⚡ Quick win

Type-annotate _modality_filter signature.

stmt and return type are currently implicit. Add explicit types (e.g., Select-based typing) to satisfy repo-wide typing requirements.

As per coding guidelines: "**/*.py: Always add type hints to all function parameters and return values in Python code".
🤖 Prompt for AI Agents
Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@backend/app/crud/model_config.py` around lines 66 - 87, Update the
_modality_filter signature to add explicit SQLAlchemy Select typing: change def
_modality_filter(stmt, completion_type: CompletionType): to def
_modality_filter(stmt: Select, completion_type: CompletionType) -> Select: and
add the appropriate import for Select (e.g. from sqlalchemy.sql import Select or
from sqlalchemy.sql.selectable import Select) at the top of the module; keep all
existing logic using ModelConfig, ARRAY, and sqltypes.String unchanged so the
function still returns the filtered Select object.

🤖 Prompt for all review comments with AI agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

Inline comments:
In `@backend/app/alembic/versions/063_seed_stt_tts_model_configs.py`:
- Around line 66-83: The current downgrade() uses op.execute(...) to
unconditionally DELETE rows matching provider/model_name which may remove
pre-existing data; either make downgrade() a no-op or modify both upgrade() and
downgrade(): in upgrade() (the INSERT ... ON CONFLICT ... DO NOTHING block) add
a deterministic marker (e.g., set a seeded_by or migration_tag column or INSERT
and capture RETURNING id into a temp table) so you can identify which rows this
revision actually created, and then change downgrade() to only DELETE rows that
match those inserted records (matching the marker or the captured ids) rather
than deleting by provider/model_name alone.

In `@backend/app/crud/model_config.py`:
- Around line 119-132: The docstring for the validation block is inconsistent
with the code: it says native configs must include completion.params.model but
the function returns early for raw_provider.endswith("-native") (see
blob.completion, raw_provider, completion_type) and thus skips that validation;
fix by making them consistent—either remove the early return and enforce the
model presence for all providers (delete the raw_provider.endswith("-native")
return and validate completion.params.model against model_config) or update the
docstring to explicitly state that native providers
(raw_provider.endswith("-native")) are exempt from the model requirement so
callers know native-provider behavior is allowed to omit
completion.params.model.

In `@backend/app/tests/crud/test_model_config.py`:
- Around line 166-168: Add explicit type hints for the helper functions:
annotate _make_blob parameters (e.g., provider: str, completion_type: str,
params: Mapping[str, Any]) and its return type (e.g., SimpleNamespace) and do
the same for the inner function boom (declare its parameters and return type).
Also rename boom's varargs to underscored names (e.g., _args, _kwargs) to
silence unused-arg lint warnings. Apply the same parameter/return annotation
updates to the other helper on lines ~201-203 as well.

---

Outside diff comments:
In `@backend/app/models/llm/request.py`:
- Around line 265-282: The method validate_params currently lacks a return type
annotation; update its signature to annotate the return as the containing class
(KaapiCompletionConfig) — i.e., change def validate_params(self) to def
validate_params(self) -> "KaapiCompletionConfig" (use a forward reference string
if the class is defined later or add from __future__ import annotations) so that
callers know it returns self; keep the implementation unchanged and ensure
imports/annotations comply with Python 3.11 typing rules.

---

Nitpick comments:
In `@backend/app/alembic/versions/062_add_pending_job_monitoring_indexes.py`:
- Around line 35-56: Annotate the migration entry points with explicit return
type hints: change the function signatures of upgrade and downgrade to include
-> None (i.e., def upgrade() -> None: and def downgrade() -> None:) so they
match the repository typing standards and satisfy the rule to add return type
hints for all functions like the ones that call
op.get_context().autocommit_block() and op.create_index/op.drop_index.

In `@backend/app/alembic/versions/063_seed_stt_tts_model_configs.py`:
- Around line 32-67: The upgrade and downgrade functions lack return type
annotations; add "-> None" to both function definitions (upgrade and downgrade)
so their signatures read with a None return type, keeping the bodies unchanged
and conforming to the project's typing rule for Python functions.

In `@backend/app/crud/model_config.py`:
- Around line 66-87: Update the _modality_filter signature to add explicit
SQLAlchemy Select typing: change def _modality_filter(stmt, completion_type:
CompletionType): to def _modality_filter(stmt: Select, completion_type:
CompletionType) -> Select: and add the appropriate import for Select (e.g. from
sqlalchemy.sql import Select or from sqlalchemy.sql.selectable import Select) at
the top of the module; keep all existing logic using ModelConfig, ARRAY, and
sqltypes.String unchanged so the function still returns the filtered Select
object.

In `@backend/app/tests/crud/test_model_config.py`:
- Line 214: The two tests test_validate_blob_none_provider_skips and the other
test at the nearby block currently accept an unused monkeypatch fixture causing
Ruff ARG001; remove the unused monkeypatch parameter from their function
signatures (e.g., change def test_validate_blob_none_provider_skips(monkeypatch:
pytest.MonkeyPatch) -> None: to def test_validate_blob_none_provider_skips() ->
None:) and run tests to ensure no remaining references to monkeypatch exist.

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Pro

Run ID: adea4ba3-9ad8-449f-9bb1-55caa06b92ef

📥 Commits

Reviewing files that changed from the base of the PR and between 68369a8 and 4be5b8c.

📒 Files selected for processing (11)

backend/app/alembic/versions/062_add_pending_job_monitoring_indexes.py
backend/app/alembic/versions/063_seed_stt_tts_model_configs.py
backend/app/crud/config/config.py
backend/app/crud/config/version.py
backend/app/crud/model_config.py
backend/app/models/llm/constants.py
backend/app/models/llm/request.py
backend/app/models/model_config.py
backend/app/tests/api/routes/configs/test_version.py
backend/app/tests/crud/test_model_config.py
backend/app/tests/models/llm/test_request.py

💤 Files with no reviewable changes (3)

backend/app/tests/api/routes/configs/test_version.py
backend/app/tests/models/llm/test_request.py
backend/app/models/llm/constants.py

AkhileshNegi · 2026-05-21T07:50:06Z

+    return session.exec(stmt).first() is not None
+
+
+def validate_blob_model_or_raise(session: Session, blob: Any) -> None:


inline-blob path on /llm/call and /llm/chain no longer validates model/voice. Before this PR, model/voice checks lived in KaapiCompletionConfig.validate_params, so FastAPI ran them on every request body that contained a ConfigBlob — including ad-hoc inline blobs sent to /llm/call. This PR moves the check into validate_blob_model_or_raise (which needs a DB session) and only wires it into ConfigCrud.create_or_raise and ConfigVersionCrud.create_or_raise. The inline-blob branch in services/llm/jobs.py:525 (config_blob = config.blob) never calls it, so a client can now POST {"provider": "google", "type": "tts", "params": {"model": "gemini-99-ultra", "voice": "Nonexistent"}} and the request will be accepted, a job row created, and the failure surfaces deep in the worker instead of as a 4xx at request time

AkhileshNegi · 2026-05-21T07:59:00Z

    assert data["data"]["version"]["config_blob"]["completion"]["type"] == "text"
-
-
-def test_create_version_with_kaapi_stt_provider_success(


can you add these coverage testcases as well
- test_create_version_cannot_change_type_from_stt_to_tts — start with a google + gemini-2.5-pro STT config, try to bump it to google + gemini-2.5-flash-preview-tts TTS, expect 400 immutability error.
- test_create_version_cannot_change_type_from_tts_to_text — start with a google + gemini-2.5-flash-preview-tts TTS config, try to bump it to openai + gpt-4o text, expect 400 immutability error.
- test_create_version_with_kaapi_stt_provider_success — create a google + gemini-2.5-pro STT config, post a new version that just tweaks instructions/temperature, expect 201 with version == 2 and type == "stt".
- test_create_version_with_kaapi_tts_provider_success — create a google + gemini-2.5-flash-preview-tts (voice Kore) TTS config, post a new version switching to e.g. gemini-2.5-pro-preview-tts with a different seeded voice, expect 201 with version == 2 and type == "tts".

AkhileshNegi · 2026-05-21T08:07:15Z

    DEFAULT_TTS_VOICE,
-    SUPPORTED_MODELS,
-    SUPPORTED_VOICES,
 )


here we need to add check that STT/TTS is not supported for OpenAI and send appropriate error message

in model config there is already an column for input and output modalities which is set .. that has AUDIO, TEXT AND IMAGE ... from that TTS/STT capabilities can be checked

{
"success": false,
"data": null,
"error": "Provider 'openai' does not support completion type='tts'.",
"errors": null,
"metadata": null
}

…trictions

…utput costs

…ng database schema for STT/TTS support

…nd update type hints in test functions

…ity and structure

Refactor model configuration validation and add new model configurati…

9b40427

…ons for STT/TTS providers

vprashrex added 4 commits May 19, 2026 15:44

Remove redundant tests for changing config types in version creation

d7e7169

Merge branch 'main' into chore/model-config-evals

85d31ec

Add migration to seed STT/TTS model configurations for Google, Sarvam…

06f94ef

…ai, and ElevenLabs

Remove unnecessary blank lines in test_create_config_with_kaapi_provi…

3bc791a

…der_success

vprashrex self-assigned this May 20, 2026

vprashrex requested review from AkhileshNegi and Prajna1999 May 20, 2026 11:15

vprashrex added the bug Something isn't working label May 20, 2026

vprashrex linked an issue May 20, 2026 that may be closed by this pull request

Evaluation: Clear error message #693

Open

Remove commented-out code regarding model-allowlist enforcement from …

4be5b8c

…test_request.py

vprashrex changed the title ~~Model Config: Model Config: DB-driven validator, seed sarvamai/elevenlabs/google~~ Model Config: DB-driven validator, seed sarvamai/elevenlabs/google May 20, 2026

vprashrex added the ready-for-review label May 20, 2026

coderabbitai Bot reviewed May 20, 2026

View reviewed changes

Comment thread backend/app/alembic/versions/063_seed_stt_tts_model_configs.py

Comment thread backend/app/crud/model_config.py

Comment thread backend/app/tests/crud/test_model_config.py Outdated

Prajna1999 approved these changes May 21, 2026

View reviewed changes

AkhileshNegi requested changes May 21, 2026

View reviewed changes

vprashrex added 5 commits May 21, 2026 20:21

Refactor model validation logic and enhance tests for config type res…

45e6b6a

…trictions

Update STT/TTS model configurations with detailed pricing and input/o…

5fa40d2

…utput costs

Enhance model configuration by adding completion type enum and updati…

48d56ef

…ng database schema for STT/TTS support

Refactor downgrade function to remove obsolete model deletion logic a…

944efdd

…nd update type hints in test functions

Refactor model configuration and test functions for improved readabil…

e00df2e

…ity and structure

		return session.exec(stmt).first() is not None


		def validate_blob_model_or_raise(session: Session, blob: Any) -> None:

		assert data["data"]["version"]["config_blob"]["completion"]["type"] == "text"


		def test_create_version_with_kaapi_stt_provider_success(

Conversation

vprashrex commented May 18, 2026 • edited by coderabbitai Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Checklist

Notes

Summary by CodeRabbit

Uh oh!

coderabbitai Bot commented May 18, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Rate limit exceeded

Walkthrough

Changes

Estimated code review effort

Possibly related PRs

Suggested reviewers

❌ Failed checks (1 warning)

Uh oh!

codecov Bot commented May 19, 2026 • edited by sentry Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

AkhileshNegi May 21, 2026

Choose a reason for hiding this comment

Uh oh!

AkhileshNegi May 21, 2026

Choose a reason for hiding this comment

Uh oh!

AkhileshNegi May 21, 2026

Choose a reason for hiding this comment

Uh oh!

vprashrex May 21, 2026

Choose a reason for hiding this comment

Uh oh!

vprashrex May 21, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

vprashrex commented May 18, 2026 •

edited by coderabbitai Bot

Loading

coderabbitai Bot commented May 18, 2026 •

edited

Loading

codecov Bot commented May 19, 2026 •

edited by sentry Bot

Loading