MarkTechPost

Publication

0 followers

Showcases the hottest research trends in AI from around the world

News•Feb 24, 2026

Alibaba Qwen Team Releases Qwen 3.5 Medium Model Series: A Production Powerhouse Proving that Smaller AI Models Are Smarter

Alibaba’s Qwen team unveiled the Qwen 3.5 Medium series, including 35B‑A3B, 27B, and 122B‑A10B models that rely on Mixture‑of‑Experts and reinforcement learning. The 35B‑A3B model activates only 3 billion parameters yet outperforms the older 235 billion‑parameter Qwen‑3, demonstrating a new efficiency frontier. Qwen 3.5‑Flash adds a 1‑million‑token context window and built‑in tool‑calling for low‑latency production workloads. The lineup targets the “Goldilocks” sweet spot, offering frontier reasoning on modest hardware.

MarkTechPost

Alibaba Qwen Team Releases Qwen 3.5 Medium Model Series: A Production Powerhouse Proving that Smaller AI Models Are Smarter

Composio Open Sources Agent Orchestrator to Help AI Developers Build Scalable Multi-Agent Workflows Beyond the Traditional ReAct Loops

Beyond Simple API Requests: How OpenAI’s WebSocket Mode Changes the Game for Low Latency Voice Powered AI Experiences

How to Build a Production-Grade Customer Support Automation Pipeline with Griptape Using Deterministic Tools and Agentic Reasoning

VectifyAI Launches Mafin 2.5 and PageIndex: Achieving 98.7% Financial RAG Accuracy with a New Open-Source Vectorless Tree Indexing.

A Coding Guide to Instrumenting, Tracing, and Evaluating LLM Applications Using TruLens and OpenAI Models

A New Google AI Research Proposes Deep-Thinking Ratio to Improve LLM Accuracy While Cutting Total Inference Costs by Half

Is There a Community Edition of Palantir? Meet OpenPlanter: An Open Source Recursive AI Agent for Your Micro Surveillance Use...

NVIDIA Releases Dynamo v0.9.0: A Massive Infrastructure Overhaul Featuring FlashIndexer, Multi-Modal Support, and Removed NATS and ETCD

Zyphra Releases ZUNA: A 380M-Parameter BCI Foundation Model for EEG Data, Advancing Noninvasive Thought-to-Text Development

[Tutorial] Building a Visual Document Retrieval Pipeline with ColPali and Late Interaction Scoring

How to Build an Advanced, Interactive Exploratory Data Analysis Workflow Using PyGWalker and Feature-Engineered Data

Cloudflare Releases Agents SDK v0.5.0 with Rewritten @Cloudflare/Ai-Chat and New Rust-Powered Infire Engine for Optimized Edge Inference Performance

Agoda Open Sources APIAgent to Convert Any REST Pr GraphQL API Into an MCP Server with Zero Code

Moonshot AI Launches Kimi Claw: Native OpenClaw on Kimi.com with 5,000 Community Skills and 40GB Cloud Storage Now

How to Build a Self-Organizing Agent Memory System for Long-Term AI Reasoning

[In-Depth Guide] The Complete CTGAN + SDV Pipeline for High-Fidelity Synthetic Data

Kyutai Releases Hibiki-Zero: A3B Parameter Simultaneous Speech-to-Speech Translation Model Using GRPO Reinforcement Learning Without Any Word-Level Aligned Data

How to Design Complex Deep Learning Tensor Pipelines Using Einops with Vision, Attention, and Multimodal Examples

Alibaba Open-Sources Zvec: An Embedded Vector Database Bringing SQLite-Like Simplicity and High-Performance On-Device RAG to Edge Applications

A Coding Implementation to Establish Rigorous Prompt Versioning and Regression Testing Workflows for Large Language Models Using MLflow

ByteDance Releases Protenix-V1: A New Open-Source Model Achieving AF3-Level Performance in Biomolecular Structure Prediction

NVIDIA AI Release VibeTensor: An AI Generated Deep Learning Runtime Built End to End by Coding Agents Programmatically

How to Build Efficient Agentic Reasoning Systems by Dynamically Pruning Multiple Chain-of-Thought Paths Without Losing Accuracy

Google Introduces Agentic Vision in Gemini 3 Flash for Active Image Understanding

Google Releases Conductor: A Context Driven Gemini CLI Extension that Stores Knowledge as Markdown and Orchestrates Agentic Workflows

NVIDIA AI Brings Nemotron-3-Nano-30B to NVFP4 with Quantization Aware Distillation (QAD) for Efficient Reasoning Inference

How to Build Memory-Driven AI Agents with Short-Term, Long-Term, and Episodic Memory

A Coding and Experimental Analysis of Decentralized Federated Learning with Gossip Protocols and Differential Privacy

A Coding Deep Dive Into Differentiable Computer Vision with Kornia Using Geometry Optimization, LoFTR Matching, and GPU Augmentations

MBZUAI Releases K2 Think V2: A Fully Sovereign 70B Reasoning Model For Math, Code, And Science

Tencent Hunyuan Releases HPC-Ops: A High Performance LLM Inference Operator Library

DSGym Offers a Reusable Container Based Substrate for Building and Benchmarking Data Science Agents

How a Haystack-Powered Multi-Agent System Detects Incidents, Investigates Metrics and Logs, and Produces Production-Grade Incident Reviews End-to-End

StepFun AI Introduce Step-DeepResearch: A Cost-Effective Deep Research Agent Model Built Around Atomic Capabilities

How an AI Agent Chooses What to Do Under Tokens, Latency, and Tool-Call Budget Constraints?

Microsoft Releases VibeVoice-ASR: A Unified Speech-to-Text Model Designed to Handle 60-Minute Long-Form Audio in a Single Pass

Inworld AI Releases TTS-1.5 For Realtime, Production Grade Voice Agents

How AutoGluon Enables Modern AutoML Pipelines for Production-Grade Tabular Models with Ensembling and Distillation

Liquid AI Releases LFM2.5-1.2B-Thinking: A 1.2B Parameter Reasoning Model That Fits Under 1 GB On-Device

A Coding Guide to Anemoi-Style Semi-Centralized Agentic Systems Using Peer-to-Peer Critic Loops in LangGraph

Nous Research Releases NousCoder-14B: A Competitive Olympiad Programming Model Post-Trained on Qwen3-14B via Reinforcement Learning

Vercel Releases Agent Skills: A Package Manager For AI Coding Agents With 10 Years of React and Next.js Optimisation Rules

NVIDIA Releases PersonaPlex-7B-V1: A Real-Time Speech-to-Speech Model Designed for Natural and Full-Duplex Conversations

How to Build a Self-Evaluating Agentic AI System with LlamaIndex and OpenAI Using Retrieval, Tool Use, and Automated Quality Checks

Black Forest Labs Releases FLUX.2 [Klein]: Compact Flow Models for Interactive Visual Intelligence

How to Build a Stateless, Secure, and Asynchronous MCP-Style Protocol for Scalable Agent Workflows

Understanding the Layers of AI Observability in the Age of LLMs

How to Build a Multi-Turn Crescendo Red-Teaming Pipeline to Evaluate and Stress-Test LLM Safety Using Garak

How This Agentic Memory Research Unifies Long Term and Short Term Memory for LLM Agents

Technology Pulse