Companies Mentioned
Why It Matters
The combined stack shortens time‑to‑market for AI services, cuts operational complexity, and provides validated reliability, giving cloud operators a competitive edge in the rapidly growing AI infrastructure market.
Key Takeaways
- •Rafay embeds NVIDIA DSX OS to automate GPU cloud provisioning.
- •NICo and AICR deliver hardware‑enforced tenant isolation and version‑locked runtimes.
- •Integrated KAI Scheduler and Run:ai enable fractional GPU allocation.
- •NVIDIA Cloud Functions provide unified APIs for inference, fine‑tuning, batch.
- •First ISV validated under NVIDIA AI Cloud‑Ready, ensuring end‑to‑end reliability.
Pulse Analysis
AI factories have moved beyond the question of raw GPU capacity; the bottleneck now lies in turning that horsepower into reliable, multi‑tenant services. NVIDIA’s DSX OS, an open‑source framework derived from DGX Cloud experience, supplies the modular building blocks for lifecycle automation, runtime consistency, and tenant isolation. By exposing these components as APIs, DSX OS enables operators to stitch together provisioning, scheduling, and serving layers without reinventing the wheel, a crucial advantage as enterprises demand on‑demand AI capabilities at scale.
Rafay’s platform leverages DSX OS across two critical domains. First, NICo and AICR automate bare‑metal lifecycle management and lock runtime configurations, using BlueField DPUs and the DOCA framework to enforce zero‑trust isolation. Second, the integration of KAI Scheduler, Run:ai, Dynamo, Grove, and NVIDIA Cloud Functions turns provisioned hardware into production‑grade AI services with fractional GPU allocation, hierarchical quotas, and unified inference APIs. The solution’s validation under NVIDIA’s AI Cloud‑Ready ISV program guarantees that the stack meets NVIDIA’s end‑to‑end performance and reliability standards, reducing risk for operators.
For the broader market, Rafay’s end‑to‑end offering accelerates the path from infrastructure investment to revenue‑generating AI services. Operators can now launch inference, fine‑tuning, and batch workloads with a single consumable platform, slashing engineering costs and shortening deployment cycles. This streamlined approach is likely to spur wider adoption of AI factories, especially among hyperscalers and telco edge providers seeking to monetize GPU assets. As open‑source AI infrastructure matures, partnerships that bundle validated components—like Rafay and NVIDIA—will become the blueprint for scalable, cost‑effective AI cloud services.
Rafay Operationalizes NVIDIA DSX OS for AI Factories

Comments
Want to join the conversation?
Loading comments...