Back to list
ai-toolsllmollamallamacontextvramlocal
Local Inference Context
Context management for self-hosted LLM backends (llama.cpp, Ollama). Prevents mid-task 503 errors and context overflows caused by VRAM-limited KV caches.
30 downloads5/6/2026
Install Command
npx clawhub@latest install joekravelli/local-inference-context