* [Jan - Open-source ChatGPT replacement v0.8.0](https://github.com/janhq/jan) – Runs local and cloud AI models with privacy-focused control and customizable assistants.

* [yzma v1.15.0](https://github.com/hybridgroup/yzma) – Go-based library for hardware-accelerated local inference with llama.cpp integration.

* [vLLM Studio v1.24.0](https://github.com/sybil-solutions/vllm-studio) – Unified local AI workstation for model lifecycle, chat/agent workflows, orchestration, observability, and remote deployment.

* [vLLM Studio v1.20.0](https://github.com/0xSero/vllm-studio) – Unified local AI workstation for model lifecycle, chat/agent workflows, orchestration, observability, and remote deployment.

* [llama-swap v219](https://github.com/mostlygeek/llama-swap) – Reliable on-demand model switching between local OpenAI-compatible inference servers without restarting applications.