* [NVIDIA AI Cluster Runtime v0.14.0](https://github.com/NVIDIA/aicr) – Tooling for optimized, validated, reproducible GPU-accelerated Kubernetes clusters via version-locked recipes and deployment-ready bundles.

* [GoGPU v0.41.0](https://github.com/gogpu/gogpu) – Pure Go GPU computing ecosystem offering WebGPU-compatible APIs with selectable Rust or native backends and zero CGO.

* [RapidFire AI v0.16.0](https://github.com/RapidFireAI/rapidfireai) – Hyperparallelized, shard-based experiment framework for rapid LLM customization, RAG, context engineering, and fine-tuning with real-time control.

* [TypeGPU v0.11.8](https://github.com/software-mansion/TypeGPU) – TypeScript library enhancing the WebGPU API for type-safe resource management.

* [LLMKube llmkube-0.8.0](https://github.com/defilantech/LLMKube) – Kubernetes operator managing self-hosted LLM inference on NVIDIA GPUs and Apple Silicon, with autoscaling, model routing, and OpenAI-compatible API.