feat: Making feast vector store with open ai search api compatible by patelchaitany · Pull Request #6121 · feast-dev/feast

patelchaitany · 2026-03-17T11:38:02Z

What this PR does / why we need it:

This PR making the feast vector store api with open ai search api compatible so.

This are the changes are made.

New OpenAI-compatible endpoint

Added POST /v1/vector_stores/{vector_store_id}/search that matches the OpenAI vector store search API
vector_store_id just maps to a feature view name
Takes a query string, embeds it via LiteLLM, calls retrieve_online_documents_v2, and returns results in OpenAI's vector_store.search_results.page format
LiteLLM embedding config goes in feature_store.yaml under a new embedding_model section (model, api_key, api_base, api_version, dimensions

Metadata filtering

New filter_models.py with two Pydantic models: ComparisonFilter (eq, ne, gt, gte, lt, lte, in, nin) and CompoundFilter (and/or, nestable)
Threaded filters through the entire retrieval stack down to each online store
Each store translates filters into its native query language:
- Elasticsearch: Query DSL clauses (term, range, terms, bool)
- Milvus: boolean expressions like field == 'value'
- Postgres: parameterized SQL subqueries with entity_key IN (SELECT ...)
- SQLite: same approach as Postgres, SQLite syntax

Numeric storage fix

Without this change, all values are stored as text, so '9' > '100' is true in filters
New enable_openai_compatible_store config flag on every store backend
When enabled, adds a value_num column that stores int, float, double, and bool values natively alongside the existing `value_text

Bug fixes picked up along the way

SQLite BM25 search was reading raw value instead of value_text
SQLite's query param renamed to embedding since that's what it actually is
Added input escaping for Milvus query strings

Tests

160 lines of unit tests for filter models (valid/invalid operators, value types, nested compounds)
~320 lines of integration tests covering filtered vector search, filtered text search, OpenAI response shape, and error cases (no embedding config, nonexistent feature view, empty results)

Which issue(s) this PR fixes:

#5615

Misc

ntkathole · 2026-05-14T08:29:16Z

+                ``vector_store_id`` path parameter).
+            query: Natural language query string, or list of strings.
+            max_num_results: Maximum number of results to return.
+            filters: OpenAI-compatible filters (accepted but not yet


filters are accepted and applied as well

ntkathole · 2026-05-14T08:35:14Z

+    value: Union[str, int, float, bool, List[Union[str, int]]]
+
+
+class OpenAICompoundFilter(BaseModel):


I think these all are not needed now, it's duplicate of filter_models.py definitions

ntkathole · 2026-05-14T08:39:18Z

+        ...
+
+
+class LiteLLMEmbeddingProvider:


Suggested change

class LiteLLMEmbeddingProvider:

class LiteLLMEmbeddingProvider(EmbeddingProvider):

ntkathole · 2026-05-14T08:48:19Z

+            "/v1/vector_stores/{vector_store_id}/search"
+        ):
+            try:
+                result = await run_in_threadpool(


The entire call including the embedding network I/O is wrapped in run_in_threadpool, which blocks a thread waiting for the OpenAI/Ollama HTTP response. LiteLLMEmbeddingProvider.aembed() exists precisely to avoid this. Make retrieve_online_documents_openai async, call await self.embedding_provider.aembed(...) inside it, and then await run_in_threadpool only for the DB retrieval step. The endpoint can then await store.retrieve_online_documents_openai(...) directly without an outer run_in_threadpool.

Done. retrieve_online_documents_openai is now async with await aembed(...). The retrieve_online_documents_v2 call is still in run_in_threadpool since it has no async variant.

ntkathole · 2026-05-14T08:58:32Z

+                        query=request.query,
+                        max_num_results=request.max_num_results or 10,
+                        filters=(
+                            request.filters.model_dump() if request.filters else None


request.filters.model_dump() can be removed once line 146 uses ComparisonFilter/CompoundFilter directly

ntkathole · 2026-05-15T05:34:16Z

+---
+title: "Making Feast Speak OpenAI: Vector Search Without the Glue Code"
+description: "Feast now exposes an OpenAI-compatible vector store search endpoint. Send a plain text query, get results back in the standard OpenAI format. No client-side embeddings required."
+date: 2026-04-28


update date before merge

ntkathole · 2026-05-15T05:35:11Z

Also add documentation for OpenAI-compatible Vector search endpoint in docs/reference/alpha-vector-database.md and docs/reference/feature-servers/python-feature-server.md

Also, missing docs for how user can use different embedding provider other than litellm

ntkathole

looks good to me!

@franciscojavierarceo will you be able to take a look once?

patelchaitany · 2026-05-19T07:32:14Z

@ntkathole, The CI for the mcp-feature-server-runtime was failing because fastapi_mcp does not handle circular references properly. To fix this, I made changes in mcp_server.py where we inserted a custom schema resolver that handles how reference schemas are resolved. I also verified its logic against the original fastapi_mcp logic -- both produce the same output when there are no self-referencing schemas.

Signed-off-by: Chaitany patel <patelchaitany93@gmail.com>

…gistration fastapi_mcp 0.4.0 resolve_schema_references() has no cycle detection. Feast's OpenAPI schema contains self-referential protobuf types (Value -> Struct -> Value) which trigger a RecursionError. The error is silently caught, so the /mcp route never gets registered and CI gets a 404. Add _resolve_schema_references_safe() that tracks a seen-refs set to break circular chains, and monkey-patch it into fastapi_mcp before FastApiMCP processes the schema. Non-circular schemas produce identical output to the original. Signed-off-by: Chaitany patel <patelchaitany93@gmail.com>

ntkathole changed the title ~~feat: making feast vector store with open ai search api compatible~~ feat: Making feast vector store with open ai search api compatible Mar 17, 2026

patelchaitany force-pushed the enh/openai-compatibel-store-api branch 4 times, most recently from e45f167 to c8392a9 Compare March 23, 2026 11:17

patelchaitany changed the title ~~feat: Making feast vector store with open ai search api compatible~~ feat: Making feast vector store with open ai search api compatible Mar 23, 2026

patelchaitany force-pushed the enh/openai-compatibel-store-api branch 5 times, most recently from 7e8adfb to 3f541ad Compare March 24, 2026 11:29

patelchaitany marked this pull request as ready for review March 24, 2026 11:29

patelchaitany requested review from a team as code owners March 24, 2026 11:29

patelchaitany requested review from HaoXuAI, nquinn408 and redhatHameed and removed request for a team March 24, 2026 11:29