Entrepreneurship Leadership AI CEO Pulse

CEO Interview with Dr. Mohammad Rastegari of Elastix.AI

•March 15, 2026

SemiWiki•Mar 15, 2026

Key Takeaways

•Elastix.AI uses FPGA reconfigurability for AI inference.
•Claims up to 10× lower inference cost versus GPUs.
•Power efficiency improves up to five times.
•Supports dynamic model-to-hardware mapping.
•Offers token-as-a-service, subscription, and leasing options.

Summary

Elastix.AI, led by former Meta and Apple AI scientist Dr. Mohammad Rastegari, is building a reconfigurable FPGA‑based inference platform that promises dramatically lower cost and power consumption for large‑scale AI models. The company claims up to ten‑fold reductions in inference spend and five‑fold gains in energy efficiency compared with traditional GPU clusters. By coupling advanced model optimization with hardware that can be re‑programmed after deployment, Elastix.AI aims to eliminate the rigid, costly refresh cycles that currently dominate AI infrastructure. Its business model spans token‑as‑a‑service, software subscriptions, and hardware leasing or sales.

Pulse Analysis

The rapid expansion of large language models and generative AI has turned inference into a major cost driver for enterprises. While GPUs have powered the training boom, their high capital outlay, power hunger, and inflexibility make them ill‑suited for the sustained, high‑throughput workloads of production AI services. Data‑center operators are therefore seeking alternatives that can deliver comparable performance without the escalating electricity bills and frequent hardware refreshes that accompany GPU‑only stacks.

Elastix.AI’s strategy leverages field‑programmable gate arrays, which combine the parallelism of custom silicon with the adaptability of software. By integrating model‑specific optimization pipelines directly into the FPGA fabric, the company can tailor compute pathways to each workload, extracting efficiency gains that traditional GPUs cannot match. This reconfigurability also sidesteps the long development cycles of ASICs, allowing customers to pivot to newer model architectures without ordering new silicon, a distinct advantage in a market where model updates occur every few months.

From a business perspective, Elastix.AI’s multi‑tiered engagement model lowers the barrier to entry for firms of all sizes. Token‑as‑a‑service offers immediate access to low‑cost inference, while subscription and leasing options provide predictable OPEX for larger deployments. If the promised cost and power reductions materialize at scale, the company could capture a significant share of the AI infrastructure market, prompting cloud providers and hyperscalers to reconsider their reliance on GPU‑centric designs and potentially driving a broader shift toward reconfigurable hardware solutions.