Flexible LLM providers + manage Ollama with 4CAT by dale-wahl · Pull Request #576 · digitalmethodsinitiative/4cat

dale-wahl · 2026-03-05T16:00:36Z

That's right, boys and girls, now you can spin up an Ollama container right beside your 4CAT containers. Lil admin UI action to pull and delete models on it (should work with other Ollama servers as well) as well as enable and disable models (should work with tags as well, but did not test that).

The gist: docker compose -f docker-compose.yml -f docker-compose_ollama.yml up -d

It's that simple! (Or almost that simple; you do need to un-comment some lines if you want it to use GPU, but a) works without GPU--albeit slowly--and b) doesn't crash for those GPU-less users.)

You're welcome.

Fixes #564, fixes #563

Copilot

Pull request overview

This PR adds Ollama LLM container support to 4CAT's Docker stack, along with an admin UI to manage LLM models (pull, delete, enable/disable). The LLM model refresh logic is extracted from the old refresh_items worker into a dedicated OllamaManager worker. A new llm.enabled_models configuration setting allows admins to control which available models are exposed to users.

Changes:

New OllamaManager backend worker for refreshing, pulling, and deleting Ollama models via the Ollama HTTP API
New /admin/llm/ admin panel (views_llm.py + llm-server.html) for managing LLM models, gated by both admin privileges and llm.access
New docker-compose_ollama.yml override for running Ollama as a Docker sidecar, with auto-configuration in docker_setup.py

Reviewed changes

Copilot reviewed 11 out of 11 changed files in this pull request and generated 3 comments.

Show a summary per file

File	Description
`backend/workers/ollama_manager.py`	New worker for Ollama model refresh/pull/delete operations
`backend/workers/refresh_items.py`	LLM refresh logic removed; worker now does nothing
`webtool/views/views_llm.py`	New Flask blueprint for the admin LLM management panel
`webtool/templates/controlpanel/llm-server.html`	New admin panel template for model listing and actions
`webtool/templates/controlpanel/layout.html`	Adds "LLM Server" nav link when llm.access is enabled
`webtool/__init__.py`	Registers the new `views_llm` blueprint
`processors/machine_learning/llm_prompter.py`	Filters available models by enabled list before showing to users
`common/lib/config_definition.py`	Adds `llm.enabled_models` config definition
`docker/docker_setup.py`	Auto-configures LLM settings when Ollama is detected on Docker network
`docker-compose_ollama.yml`	New Docker Compose override for the Ollama sidecar service
`docker/README.md`	Documents the Ollama Docker setup and usage

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2026-03-10T11:11:02Z

+        elif task == "delete":
+            success = self.delete_model(model_name)
+            if success:
+                self.refresh_models()


When a model is successfully deleted from the Ollama server, refresh_models() updates llm.available_models to remove the deleted model, but llm.enabled_models is never cleaned up. This means deleted models accumulate as stale entries in llm.enabled_models. While this doesn't cause an immediate runtime error (since llm_prompter.py intersects the two lists), it's misleading: after a delete-and-refresh cycle, the model would disappear from the available models table in the UI, but it remains in the enabled list. If the model is later re-pulled, it would reappear as already enabled, which could be surprising.

The delete_model() method (or the work() method after a successful delete) should remove the model from llm.enabled_models, or at minimum refresh_models() should reconcile llm.enabled_models to remove entries no longer present in llm.available_models.

Copilot · 2026-03-10T11:11:03Z

+### Configuring 4CAT to use Ollama
+
+1. Log in as admin and open **Control Panel → Settings**.
+2. Set the following LLM fields:
+
+   | Setting | Value |
+   |---|---|
+   | LLM Provider Type | `ollama` |
+   | LLM Server URL | `http://ollama:11434` |
+   | LLM Access | enabled |
+
+3. Save settings.
+4. Open **Control Panel → LLM Server** (visible once *LLM Access* is enabled).
+5. Use the **Refresh** button to load available models, then **Pull** a model
+   (e.g. `llama3.2:3b`) to download it from the Ollama library.
+6. Enable the models you want to make available to users.


The docker/README.md section "Configuring 4CAT to use Ollama" (steps 1–3) instructs users to manually set the LLM Provider Type, LLM Server URL, and LLM Access fields in the Control Panel Settings. However, docker/docker_setup.py now automatically detects the Ollama sidecar on first startup and configures these settings without user intervention. The README should mention this auto-configuration so users know they can skip steps 1–3 on a fresh install with the Ollama override.

dale-wahl · 2026-03-10T11:19:25Z

@copilot open a new pull request to apply changes based on the comments in this thread:

remove enabled models via refresh_models, if they have been deleted.
update docs to reflex docker auto setting those variables, but ensure it is also clear how to manually set them to for another Ollama server
keep the refresh_items worker for later use, but comment out the ensure job or have it prematurely return since it is at the moment doing nothing.

Copilot · 2026-03-10T11:19:32Z

@dale-wahl I've opened a new pull request, #581, to work on those changes. Once the pull request is ready, I'll request review from you.

…config docs (#581) * Initial plan * Fix stale enabled models, disable refresh_items scheduling, update README docs Co-authored-by: dale-wahl <32108944+dale-wahl@users.noreply.github.com> --------- Co-authored-by: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com> Co-authored-by: dale-wahl <32108944+dale-wahl@users.noreply.github.com>

# Conflicts: # backend/workers/refresh_items.py

stijn-uva · 2026-05-21T14:54:14Z

Add support for multiple LLM providers, via config setting llm.providers
LLM manager worker checks all providers and loads list of available models
Providers may or may not implement pull_model and delete_model methods, if implemented models can be added/removed via control panel
Third-party APIs (via llms.json) are also available via such a 'provider' (a special case)
LLM prompter processor simply lists all enabled models, no need to choose from local/api/etc
LLM classes now in common/lib/llm

Todo:

further testing (probably broke a few things refactoring this all)
llm.access config setting doesn't really work at the moment
Investigate if other providers can also get a pull/delete_model method (not really)
Migrate script
Ensure Docker Ollama is added as a provider on install when Dockering

stijn-uva · 2026-06-02T11:02:15Z

@dale-wahl this is now mostly ready to merge, I think - need to test the migrate script (which I plan to do after merge, when we can test it on our own 4CAT) and the Docker setup. Can you do the latter? I tried it but it pulled the stable image, which doesn't contain the relevant updates to the LLM code yet, so that would fail. Maybe that also needs to be tested after merging?

sal-uva · 2026-06-04T12:02:21Z

@stijn-uva running migrate from 1.53 results in the following error


Checking if llm.providers setting exists...
    ...does not exist, filling with currently configured proviers

  Unexpected error while running migrate-1.54-1.55.py. Migration halted.
  The following exception occurred:

Traceback (most recent call last):
  File "C:\Users\shagen\repos\4cat\helper-scripts\migrate\migrate-1.54-1.55.py", line 51, in <module>
    provider_type = {"ollama": "ollama"}.get(provider_type, "openai-like")
                    ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
TypeError: unhashable type: 'RealDictRow'

stijn-uva · 2026-06-08T13:56:44Z

@sal-uva should be fixed now, thanks!

…on main)

dale-wahl · 2026-06-09T15:13:15Z

startswith("api-") / provider type == "api" is doing a lot of work.

So far, it seems to mean:

models come from the static llms.json menu,
the user supplies the API key,
data leaves the instance to a commercial service. (for our comments/tooltips and such)

I am still reviewing, but I need to drop this someplace because it feels off. I think there are multiple axis here (e.g. the wrapper we use (Ollama, Anthropic, OpenAI etc.), where it is hosted, who's credentials are used).

Possibly related (?), third party (the static file llms) are not showing as available to actually select. I think it is semi related because they have the api- type, but the processor itself needs to know what looks like it was moved to provider_key. Also, I haven't tested, but a new build might fail as I think _id only appears from the migrate script.

Reminder for me: look at docker_setup.py; it did not detect Ollama or at least it does not appear as a provider, also might overwrite existing providers...

`provider_key` was not used; made a general `wrapper` key for the model.

…d "Status" with action buttons

…y one provider; do not accidently delete enabled models when connection fails

…errors printing on fail)

dale-wahl · 2026-06-10T10:49:10Z

+
+            # if we have a categorised set of options, look deeper to get
+            # valid option values
+            is_categorised = all([type(o) is dict for o in options.values()])


@stijn-uva this looks weird to me. maybe options = settings.get("options", {}) if it really is supposed to be a dict. list.values() is going to fail.

Also the if choice not in match_options uses the chain iterator to check for choice so you'll only be left with the remaining items in match_options.

dale-wahl

This one took me a bit. I do think some renaming would help as "providers" collides with OpenAI, Anthorpic, etc. and our "providers" is more connections to LLMs... services 😅 like Ollama vs LiteLLM etc. I also think renaming api to thirdparty may be beneficial.

Once I understood the providers/clients were connections, I think we could expand on them to help with the other "axis" I mentioned in my earlier comment. I added wrapper to fix the Third Party class and allow you to connect which LangChain wrapper to use. We could also add an egress key so you could denote which connections are external vs internal (e.g. do we warn if a user is sending data to UvA via LiteLLM or whatever setup others come up with). You could also add key_source and allow users to provide the key. That axis is perhaps less important, but we are conflating it now with the api- means thirdparty "provider". And I wouldn't mind adding my own keys to providers for my own instance. (Plus we could then have keys available to groups of users by making providers/connections available by tag...).

All that said, I tested out some configurations and the docker setup and think we are pretty good.

dale-wahl added 7 commits March 5, 2026 15:43

add ollama to docker-compose

6c27094

give me a proper worker who can do neat stuff.

8a8427c

ruff you mean

89824e2

add docker setup if ollama present

e7aa9af

a useful frontend setting panel

74e01b6

only show enabled models

baec03a

update docker readme so people can use ollama

36fe0ed

dale-wahl requested a review from Copilot March 10, 2026 11:03

Copilot started reviewing on behalf of dale-wahl March 10, 2026 11:04 View session

Copilot AI reviewed Mar 10, 2026

View reviewed changes

Copilot AI mentioned this pull request Mar 10, 2026

Cleanup: stale enabled models, refresh_items scheduling, README auto-config docs #581

Merged

Copilot AI and others added 13 commits March 10, 2026 12:56

Merge branch 'master' into ollama_management

58dd587

Merge branch 'master' into ollama_management

74f5600

ollama_manager: get additional info from ollama including capabilities

26f33f5

Merge branch 'master' into ollama_management

73b9536

ollama_manager: display names / ollama get your api together

f2501b9

Create OllamaClient to collect model info

c72d043

list capabilities in admin panel

a79657b

Merge branch 'master' into ollama_management

29f1433

ollama_manager: check for connection first, ollama_client: accept logger

43de49b

Merge branch 'master' into ollama_management

1e8cb36

# Conflicts: # backend/workers/refresh_items.py

Multi-form!

c8da75f

Merge branch 'master' into ollama_management

fda48b1

stijn-uva changed the title ~~Ollama setup via Docker PLUS manage your LLM models UI~~ Flexible LLM providers + manage Ollama with 4CAT May 21, 2026

Refactor everything

4c429df

Formatting

a6ecbc2

stijn-uva added 5 commits June 2, 2026 12:25

Add filename and line no to test error output

1a147e8

Fix init issues in LLM processors

fc91a44

Fix "add model" panel show/hide on LLM page

8c4b34a

the

46fee9f

Update Docker setup

ff0f215

stijn-uva added 3 commits June 2, 2026 16:37

Clean up docstrings, etc

ac2ba30

Address #563

d378f8b

Merge branch 'master' into ollama_management

5afa8b5

Fix migrate of provider settings

cdb5e2e

stijn-uva and others added 7 commits June 8, 2026 15:58

Merge branch 'master' into ollama_management

eda635f

Better test error stack trace

b440536

Merge branch 'master' into ollama_management

fec39dc

git fix: no tracking extensions syslink (already added to .gitignore …

ab9c7ae

…on main)

fix allowed_models validation.

866765b

fix: missing user supplied api_key (?)

17e1bee

fix migrate script; also db.upsert needs constraints

f3f8d9c

dale-wahl added 6 commits June 10, 2026 10:48

fix third party SDK routing

b729c00

`provider_key` was not used; made a general `wrapper` key for the model.

llm-server.html add status column because I am too dense to understan…

f29ea14

…d "Status" with action buttons

llm_manager fixes: do not delete available models when refreshing onl…

af7f9b4

…y one provider; do not accidently delete enabled models when connection fails

docker_setup.py: fix no-op on ollama_url compare (and add connection …

5f8a711

…errors printing on fail)

llm-server.html: value name mismatch w/ endpoint

1d14f43

ItemUpdater placeholder: delete existing interval job if running.

e28235f

dale-wahl commented Jun 10, 2026

View reviewed changes

regex fix ints

5382403

dale-wahl commented Jun 10, 2026

View reviewed changes

Merge branch 'master' into ollama_management

6dfca47

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Flexible LLM providers + manage Ollama with 4CAT#576

Flexible LLM providers + manage Ollama with 4CAT#576
dale-wahl wants to merge 71 commits into
masterfrom
ollama_management

dale-wahl commented Mar 5, 2026 •

edited by stijn-uva

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Mar 10, 2026

Uh oh!

Copilot AI Mar 10, 2026

Uh oh!

Uh oh!

dale-wahl commented Mar 10, 2026

Uh oh!

Copilot AI commented Mar 10, 2026

Uh oh!

stijn-uva commented May 21, 2026 •

edited

Loading

Uh oh!

stijn-uva commented Jun 2, 2026 •

edited

Loading

Uh oh!

sal-uva commented Jun 4, 2026

Uh oh!

stijn-uva commented Jun 8, 2026

Uh oh!

dale-wahl commented Jun 9, 2026

Uh oh!

dale-wahl Jun 10, 2026

Uh oh!

dale-wahl left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Conversation

dale-wahl commented Mar 5, 2026 • edited by stijn-uva Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Copilot AI Mar 10, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Mar 10, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

dale-wahl commented Mar 10, 2026

Uh oh!

Copilot AI commented Mar 10, 2026

Uh oh!

stijn-uva commented May 21, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

stijn-uva commented Jun 2, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

sal-uva commented Jun 4, 2026

Uh oh!

stijn-uva commented Jun 8, 2026

Uh oh!

dale-wahl commented Jun 9, 2026

Uh oh!

dale-wahl Jun 10, 2026

Choose a reason for hiding this comment

Uh oh!

dale-wahl left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

dale-wahl commented Mar 5, 2026 •

edited by stijn-uva

Loading

stijn-uva commented May 21, 2026 •

edited

Loading

stijn-uva commented Jun 2, 2026 •

edited

Loading