
How Rafay & NVIDIA Help NeoClouds Monetize AI with Token Factories
The AI surge has spurred a new class of GPU‑first cloud providers, called neoclouds, that initially sold raw GPU capacity. Rafay’s Token Factory now lets these providers expose models as token‑metered APIs, turning infrastructure into a consumable AI service. Deep integration with NVIDIA’s NIM and Dynamo stacks accelerates model deployment and optimizes inference performance. This creates a marketplace where enterprises, developers, and model creators transact on a pay‑as‑you‑go basis, unlocking higher GPU utilization and new revenue streams.

Rafay Launches AI Grid Orchestration Solution to Help Telcos Intelligently Deploy Distributed AI Infrastructure
Rafay, an NVIDIA Inception startup, unveiled an AI Grid orchestration platform that turns existing telco edge infrastructure into a self‑service, multi‑tenant AI factory. The solution lets operators express intent—such as latency, cost, or security requirements—and automatically places GPU workloads across...

From Infrastructure Validation to Market Validation: Rafay and NVIDIA DSX Air
NVIDIA DSX Air provides a full‑stack simulation that lets cloud providers validate networking, GPU servers, storage and connectivity before any rack is shipped. Rafay layers a self‑service orchestration platform on top, enabling multi‑tenant, governance and workflow testing alongside the hardware...
.png)
AI Assistants for Kubernetes: Secure Cluster Operations with MCP and Rafay ZTKA
The Model Context Protocol (MCP) lets AI assistants run Kubernetes commands through a local server while Rafay’s Zero Trust Kubectl Access (ZTKA) supplies a secure, token‑less kubeconfig. This architecture places the MCP server on the admin workstation, routes traffic via...

Run GPU Hackathons at Scale: How Rafay Enables GPU Cloud Providers
Rafay’s platform lets GPU cloud operators provision and manage thousands of GPU‑backed Jupyter notebooks for hackathons through a declarative API and templated SKUs. By batching parallel API calls and using an inventory‑aware scheduler, operators can spin up 1,000 environments in...
.png)
Validate GPU Health in Kubernetes with Rafay Zero Trust Kubectl Access
Rafay’s zero‑trust kubectl lets operators run commands inside pods on remote GPU‑enabled Kubernetes clusters without exposing the API or using bastion hosts. Using this workflow, they open an exec session to the nvidia‑dcgm‑exporter pod and execute nvidia‑smi to verify driver,...

Rafay Joins VAST Cosmos to Enable Governed GPU-Powered AI Services
Rafay has joined the VAST Cosmos Community as a Technology Partner, aligning its AI‑native cloud control plane with VAST Data’s AI Operating System. The collaboration integrates Rafay’s orchestration platform with VAST’s governed storage services, creating a unified, multi‑tenant AI service...

What Is an AI Factory? Enterprise & Cloud Guide
An AI factory is an operational model that industrializes artificial‑intelligence development by linking high‑performance compute, data pipelines, orchestration, governance and deployment into a continuous production system. The concept, popularized by NVIDIA, moves AI from isolated experiments to repeatable, scalable outputs....

From Tickets to Self-Service AI Infrastructure
Many enterprises still provision AI resources through ticket systems, causing delays and underutilized GPUs. Modern developers now expect instant, self‑service access similar to hyperscaler offerings, making manual approval a competitive risk. The shift to automated, governed platforms improves utilization, speeds...

What Is Amazon EKS? EKS & EKS Anywhere Explained | Rafay
Amazon Elastic Kubernetes Service (EKS) dominates the managed Kubernetes market with roughly 50% share, offering a fully managed control plane, deep AWS integration, and serverless compute via Fargate. EKS Anywhere, launched in 2020, extends the same open‑source distro to on‑premise...
.png)
Migrating Existing Amazon EKS Clusters to EKS Auto Mode | Rafay
Amazon EKS Auto Mode automates node scaling, patching, and add‑on management, but AWS does not provide an automated path for migrating applications, storage, or ingress. Rafay offers a guided, cluster‑level migration process that includes converting to access entries, enabling Auto...