ai-toolsApple Siliconlocal inferenceLLMhigh performanceOpenAI compatible

Rapid-MLX

The fastest local AI engine for Apple Silicon. 4.2x faster than Ollama, 0.08s cached TTFT, 100% tool calling. 17 tool parsers, prompt cache, reasoning separation, cloud routing. Drop-in OpenAI replacement.

5 views834 stars5/4/2026

Visit GitHub