llm-inference
| Date | Project Name | 🎉 | ? | Tags |
|---|---|---|---|---|
| 01/26 |
The LeaderWorkerSet API (LWS) v0.8.0
* [The LeaderWorkerSet API (LWS) v0.8.0](https://github.com/kubernetes-sigs/lws) – API for deploying and managing groups of pods as a single unit with leader and worker roles for multi-host inference workloads.
API for deploying and managing groups of pods as a single unit with leader and worker roles for multi-host inference workloads.
|
7
|
Go 654 ⭐706 days old |
golang go sig-apps llm-inference |
| 01/14 |
llamactl v0.14.2
* [llamactl v0.14.2](https://github.com/lordmathis/llamactl) – Unified management and routing for llama.cpp, MLX and vLLM models with web dashboard.
Unified management and routing for llama.cpp, MLX and vLLM models with web dashboard.
|
5
|
Go 69 ⭐186 days old |
llm go llama-cpp llamacpp llm-inference llama-server |