* [NVIDIA AI Cluster Runtime v0.14.0](https://github.com/NVIDIA/aicr) – Tooling for optimized, validated, reproducible GPU-accelerated Kubernetes clusters via version-locked recipes and deployment-ready bundles. * [GoGPU v0.41.0](https://github.com/gogpu/gogpu) – Pure Go GPU computing ecosystem offering WebGPU-compatible APIs with selectable Rust or native backends and zero CGO. * [RapidFire AI v0.16.0](https://github.com/RapidFireAI/rapidfireai) – Hyperparallelized, shard-based experiment framework for rapid LLM customization, RAG, context engineering, and fine-tuning with real-time control. * [TypeGPU v0.11.8](https://github.com/software-mansion/TypeGPU) – TypeScript library enhancing the WebGPU API for type-safe resource management. * [LLMKube llmkube-0.8.0](https://github.com/defilantech/LLMKube) – Kubernetes operator managing self-hosted LLM inference on NVIDIA GPUs and Apple Silicon, with autoscaling, model routing, and OpenAI-compatible API.