Ardan Labs

Ardan Labs

Creator
0 followers

Deep‑dive Go courses and systems/performance engineering topics relevant to building reliable services.

Invest in Yourself & Master AI Agents
VideoApr 30, 2026

Invest in Yourself & Master AI Agents

The speaker urges professionals to invest in themselves by mastering AI agents, dedicating a portion of weekly work time to develop the skill set needed to harness these tools effectively. Key recommendations include allocating roughly 20 % of one’s weekly schedule to...

By Ardan Labs
Kronk AI: Hugging Face & Vision Model File Formats
VideoApr 27, 2026

Kronk AI: Hugging Face & Vision Model File Formats

The video walks through the file structure required for vision‑oriented models hosted on Hugging Face, emphasizing that unlike pure‑text models they ship two distinct artifacts: the core model binary and a companion projection file. The projection file is consumed by Llama‑CPP’s...

By Ardan Labs
Kronk AI: Understanding GGUF & Jinja Chat Templates
VideoApr 23, 2026

Kronk AI: Understanding GGUF & Jinja Chat Templates

The video walks viewers through the GGUF model format and Jinja‑based chat templates, showing how to locate, download, and run large language models from Hugging Face. It highlights Unsloth as the go‑to provider for GGUF files and advises checking each...

By Ardan Labs
VLLM Vs. Kronk: Choosing the Best AI Engine for Your App
VideoApr 17, 2026

VLLM Vs. Kronk: Choosing the Best AI Engine for Your App

The video contrasts two local model inference engines—VLLM and Kron—explaining their distinct design philosophies and target use‑cases. VLLM is presented as the leading production‑grade server for deploying large language models at scale, engineered to handle thousands of concurrent users and...

By Ardan Labs
Understanding Context Limits in Kronk: GGUF, Truncation, & Summarization
VideoApr 9, 2026

Understanding Context Limits in Kronk: GGUF, Truncation, & Summarization

The video explains how Kronk determines a model’s context window using the GGUF file and why respecting that limit is essential for stable operation. For example, the presenter points to a model with a 256 K token limit, noting that the limit...

By Ardan Labs
Optimizing Local AI: Kronk + Metrics for Gauging Performance
VideoApr 2, 2026

Optimizing Local AI: Kronk + Metrics for Gauging Performance

The video introduces Kronk’s new “playground” tool for locally running AI models, showing how it automatically evaluates multiple configuration combos to identify optimal settings for a given machine. The presenter argues that traditional tokens-per-second (TPS) numbers are misleading, emphasizing that the...

By Ardan Labs
Rethinking AI Deployment: Self Contained AI with Go and Kronk
VideoMar 19, 2026

Rethinking AI Deployment: Self Contained AI with Go and Kronk

The video introduces Kron SDK, a Go‑based toolkit that lets developers embed the model serving logic directly into their applications, removing the traditional separate model server. By compiling the entire RAG stack—including a vector database—into one Go binary, developers can deploy...

By Ardan Labs
Turn Plain English Into SQL Queries with Go and LLMs
VideoMar 16, 2026

Turn Plain English Into SQL Queries with Go and LLMs

The video walks through a Go‑based prototype that lets users ask plain‑English questions about a DuckDB database and have a large language model generate the corresponding SQL, execute it, and return a natural‑language answer. The implementation follows a two‑prompt workflow: the...

By Ardan Labs
Bill Kennedy at FOSDEM'26: Directly Integrating LLM Models Into Go Applications
VideoMar 6, 2026

Bill Kennedy at FOSDEM'26: Directly Integrating LLM Models Into Go Applications

At FOSDEM ‘26 Bill Kennedy unveiled a new approach for integrating large‑language‑model inference directly into Go applications, bypassing traditional model‑server architectures. He explained how licensing costs and the need to run separate C‑or‑Python services have hampered Go developers. Ron Evans’ pure‑Go FFI...

By Ardan Labs
Ardan Labs | Pulse