Hamel Husain - Latest News and Information
  • All Technology
  • AI
  • Autonomy
  • B2B Growth
  • Big Data
  • BioTech
  • ClimateTech
  • Consumer Tech
  • Cybersecurity
  • DevOps
  • Digital Marketing
  • Ecommerce
  • EdTech
  • Enterprise
  • FinTech
  • GovTech
  • Hardware
  • HealthTech
  • HRTech
  • LegalTech
  • Nanotech
  • PropTech
  • Quantum
  • Robotics
  • SaaS
  • SpaceTech
AllNewsDealsSocialBlogsVideosPodcastsDigests

Technology Pulse

EMAIL DIGESTS

Daily

Every morning

Weekly

Tuesday recap

Top Publishers

  • The Verge AI

    The Verge AI

    21 followers

  • TechCrunch AI

    TechCrunch AI

    19 followers

  • Crunchbase News AI

    Crunchbase News AI

    15 followers

  • TechRadar

    TechRadar

    15 followers

  • Hacker News

    Hacker News

    13 followers

See More →

Top Creators

  • Ryan Allis

    Ryan Allis

    194 followers

  • Elon Musk

    Elon Musk

    78 followers

  • Sam Altman

    Sam Altman

    68 followers

  • Mark Cuban

    Mark Cuban

    56 followers

  • Jack Dorsey

    Jack Dorsey

    39 followers

See More →

Top Companies

  • SaasRise

    SaasRise

    196 followers

  • Anthropic

    Anthropic

    39 followers

  • OpenAI

    OpenAI

    21 followers

  • Hugging Face

    Hugging Face

    15 followers

  • xAI

    xAI

    12 followers

See More →

Top Investors

  • Andreessen Horowitz

    Andreessen Horowitz

    16 followers

  • Y Combinator

    Y Combinator

    15 followers

  • Sequoia Capital

    Sequoia Capital

    12 followers

  • General Catalyst

    General Catalyst

    8 followers

  • A16Z Crypto

    A16Z Crypto

    5 followers

See More →
NewsDealsSocialBlogsVideosPodcasts
Hamel Husain

Hamel Husain

Publication
0 followers

Applied AI engineering and tooling; patterns for shipping AI systems, evals, and agents.

Evals Skills for Coding Agents
News•Mar 2, 2026

Evals Skills for Coding Agents

Ham­el Husain released evals‑skills, an open‑source plugin that equips AI coding agents with a toolbox for product‑specific evaluation. The package introduces an eval‑audit skill that inspects six diagnostic areas of an evaluation pipeline and a suite of targeted skills for error analysis, synthetic data generation, judge prompt creation, evaluator validation, RAG assessment, and review‑interface building. It is built to complement existing MCP servers from vendors such as Braintrust, LangSmith, and Phoenix, allowing agents to both run experiments and interpret outcomes. By providing these reusable components, developers can accelerate reliable AI product monitoring and extend the framework with custom, domain‑specific skills.

By Hamel Husain