AI-Notes

Useful references in this area

Search

Perplexity - https://www.perplexity.ai/ - Given google's descent into mediocrity
Duck AI - https://duckduckgo.com/?q=DuckDuckGo+AI+Chat&ia=chat&duckai=1
Scira - https://scira.ai/

Robots

ChatBot Arena - https://lmarena.ai/ - ManyBots?
ChatGPT - https://openai.com/index/chatgpt/
Claude - https://claude.ai/login?returnTo=%2F%3F - You have to try Mr Shannon at least once, right?
Deepseek - https://chat.deepseek.com/ - Anyone not like 10x cheaper?
Gemini - https://gemini.google.com/app/bf9ff53a9cd3ff1e?hl=en-AU - Just like the old ad
- https://www.google.com/search?udm=50&aep=11
Genspark - https://www.genspark.ai/
Huggingchat - point of interesting being Cohere Command R - https://huggingface.co/chat/
Kimi - https://www.kimi.com
Le Chat - https://chat.mistral.ai/chat
Mercury Playground - https://chat.inceptionlabs.ai/ -> Diffusion LLM test
z.ai - https://chat.z.ai/
- https://docs.z.ai/api-reference/introduction [API]
Qwen - https://chat.qwen.ai/
DeepWiki - https://deepwiki.org/ - automagic github wiki overviews
https://chatjimmy.ai/ - Llama 3 8B on a chip - blazing fast

Robot Aggregators

OpenRouter - https://openrouter.ai/ - Real ManyBots

UI

https://github.com/oobabooga/text-generation-webui

Domained

Geology Oracle - https://geologyoracle.com/

Research Analysis

Notebook LM

https://notebooklm.google/
- note says 300 sources with the paid version but seems to stop working at some stage with total content - e.g. if you put books in

alphaXiv

https://www.alphaxiv.org/ - Chat with arxiv

sciarena

https://sciarena.allen.ai/ - compare models for a research question

Research

Models

https://huggingface.co/unsloth/DeepSeek-R1-Distill-Llama-8B-unsloth-bnb-4bit

Tools

ollama

https://ollama.com/search

claude code

if git bash installed for a user then setx CLAUDE_CODE_GIT_BASH_PATH "C:\Users\rscott\AppData\Local\Programs\Git\bin\bash.exe"

so claude works

increase context create 64K Modelfile

FROM glm-4.7-flash

PARAMETER num_ctx 65536

command

ollama create glm-4.7-flash-64k -f Modelfile

launch and choose model

ollama launch claude --config

Llama.cpp https://github.com/ggml-org/llama.cpp

llama-server

pip install llama-cpp-python --extra-index-url https://abetlen.github.io/llama-cpp-python/whl/cu128

Go to the llama.cpp releases page
Download the package matching your GPU:
- NVIDIA: llama--bin-win-cuda-cu12.2.0-x64.zip (or whichever CUDA version matches your driver)
- AMD: llama--bin-win-vulkan-x64.zip
Extract it — llama-server.exe is right there, no install needed
Run it:

llama-server.exe -m your-model.gguf -ngl 99 --host 0.0.0.0 --port 8080

-ngl 99 offloads all layers to GPU.

llama-server.exe -m path\to\zai-org_GLM-4.6V-Flash-Q6_K_L.gguf --mmproj path\to\mmproj-zai-org_GLM-4.6V-Flash-f.gguf --port 8080 -ngl 99

llama-server -m "D:\llama\Qwen3.6-35B-A3B-UD-Q4_K_M.gguf" --alias qwen36-35b-a3b --host 127.0.0.1 --port 8080 -c 131072 -ngl 999 -fa on --jinja --no-mmap --cache-type-k q8_0 --cache-type-v q8_0 --n-cpu-moe 30

Prerequisites:

NVIDIA: Make sure you have the CUDA toolkit (or at least the CUDA runtime DLLs) matching the release you downloaded. Usually having up-to-date NVIDIA drivers is enough since they bundle the runtime.
AMD: Vulkan drivers (typically included with AMD Adrenalin drivers).

That's it — no compilation, no WSL needed. Just extract and run. ▸ Time: 10s

LLM https://github.com/simonw/llm [from Datasette]
Opencode https://github.com/sst/opencode
- anomalyco/opencode#1669 - using opencode and ollama
- opencode go https://opencode.ai/go
Claude Code router - https://github.com/musistudio/claude-code-router

Tools - Need Signup

Gemini cli [current decent free use level - but slow as a consequence]
- no longer useful and also deprecated
- antigravity migration, antigravity interface apparently being a cluster https://www.antigravity.google/docs/gcli-migration
Cursor - see Composer 2.5 variant
Copilot
Copilot cli

Amazon

Amazon Q Developer

kiro-cli

now has native windows version - which is of course buggy as would appear to be the usual js wrapper around other things
https://docs.aws.amazon.com/amazonq/latest/qdeveloper-ug/what-is.html
https://kiro.dev/docs/cli/
- curl -fsSL https://cli.kiro.dev/install | bash

Important! Before you can continue, you must update your PATH to include: /home/rscott/.local/bin

Add it to your PATH by adding this line to your shell configuration file: export PATH="$HOME/.local/bin:$PATH"

Use the command "kiro-cli" to get started!

rscott@bananasplits:/mnt/c/Users/rscott$ nano ~/.bashrc rscott@bananasplits:/mnt/c/Users/rscott$ source ~/.bashrc

pi

plugins

now native to Ollama as well
https://github.com/tmustier/pi-extensions/tree/main/pi-ralph-wiggum
- seems to accumulate context in chat session not clear per iteration
https://github.com/rahulmutt/pi-ralph - is a simple looper apparently

Qwen 3.6

the 4B KM_M quants seem useable locally
the 22.1GB 35B_3AB MOE with some expert offloading to the cpu works on 16GB - e.g. an ancient Tesla can run it - test settings getting closer to 20 T/S
https://insiderllm.com/guides/best-way-run-qwen-3-6-35b-moe-locally/
https://medium.com/@tolgaeren/running-pi-with-local-llms-c596aa14b062
- C:\Users\rscott>C:\Users\rscott\llama\llama-b9673-bin-win-cuda-12.4-x64\llama-server -m "C:\Users\rscott.cache\huggingface\hub\models--unsloth--Qwen3.6-35B-A3B-GGUF\snapshots\a483e9e6cbd595906af30beda3187c2663a1118c\Qwen3.6-35B-A3B-UD-Q4_K_M.gguf" --host 127.0.0.1 --port 8080 -c 131072 -ngl 999 -fa on --jinja --no-mmap --n-cpu-moe 30

Small Models

https://huggingface.co/Nanbeige/Nanbeige4.1-3B - find out why this one is interesting

Info

https://latentpatterns.com/
- from the OG Ralph

Data

https://github.com/abhigyanpatwari/GitNexus

Science

https://openclaws.io/blog/science-cluster-experiment/

Models

Gear

https://www.dell.com/en-au/lp/dt/nvidia-ai AI-Factory

Utilities

Rust Token Killer https://github.com/rtk-ai/rtk

Name		Name	Last commit message	Last commit date
Latest commit History 66 Commits
pi		pi
.gitignore		.gitignore
README.md		README.md
deep-research-example.md		deep-research-example.md
open_deep_research_baseline.ipynb		open_deep_research_baseline.ipynb

Folders and files

Latest commit

History

Repository files navigation

AI-Notes

Search

Robots

Robot Aggregators

UI

Domained

Research Analysis

Notebook LM

alphaXiv

sciarena

Research

Models

Tools

ollama

claude code

increase context create 64K Modelfile

command

launch and choose model

llama-server

Tools - Need Signup

Amazon

kiro-cli

pi

plugins

Qwen 3.6

Orchestrators

Max Headroom

Tool calling hack

Papers

Small Models

Info

Data

Science

Models

Gear

Utilities

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages