Mindful Modeler

Creator

0 followers

Better machine learning by thinking like a statistician. About model interpretation, paying attention to data, and always staying critical.

Blog•Jun 9, 2026

How TabICL and TabPFN Handle Missing Values

TabICL and TabPFN, two leading tabular foundation models, both accept datasets with missing entries, but they use distinct internal mechanisms. TabICL applies a simple preprocessing step that creates a missing‑value category for categoricals and fills numeric gaps with the column mean. TabPFN, by contrast, encodes NaNs with a binary indicator and sentinel values, concatenating this missingness channel to the feature vector before embedding. The differing approaches affect bias, interpretability, and suitability for complex missing‑data patterns.

By Mindful Modeler

Blog•May 19, 2026

Tabular ML Is Entering a New Benchmark Era

Tabular machine learning is moving from static paper‑based benchmarks to live, capability‑oriented leaderboards such as TabArena, ScoringBench, and MulTaBench. The rise of tabular foundation models like TabICL and TabPFN‑3.0 enables a single pre‑trained model to be evaluated across tasks—classification, regression,...

By Mindful Modeler

Blog•May 5, 2026

Time Series Forecasting with Tabular Foundation Models

The post demonstrates how tabular foundation models (TFMs) can be repurposed for time‑series forecasting by treating the problem as a regression task. By automatically enriching a simple timestamp‑target table with temporal features, models like TabICL and TabPFN‑TS generate one‑shot forecasts....

By Mindful Modeler

Blog•Apr 21, 2026

Context Is the New Training

Tabular foundation models such as TabPFN and TabICL replace classic training with in‑context learning, where the so‑called training data becomes context supplied at prediction time. Because model weights remain static after pre‑training, only the context data changes the embeddings and...

By Mindful Modeler

Blog•Apr 14, 2026

Regression Should Predict Full Distributions

The post argues that regression models should output full predictive distributions rather than single-point estimates. It highlights tabular foundation models such as TabPFN and TabICL, which internally generate discretized distributions and can return quantiles with a simple parameter change, without...

By Mindful Modeler

Blog•Mar 31, 2026

TabArena Explained

TabArena is a living benchmark for tabular machine‑learning models hosted on HuggingFace, featuring a strict preprocessing and evaluation protocol. It evaluates 51 curated tasks—13 regression and 38 classification datasets—using an Elo rating system to compare algorithms pairwise. Recent Prior Labs’...

By Mindful Modeler

Blog•Mar 24, 2026

The Interpretability Tax on Tabular Foundation Models

The post examines how classic model‑agnostic interpretability tools, such as permutation feature importance (PFI) and LOCO, operate on tabular foundation models (TFMs). While these methods function out‑of‑the‑box, TFMs flip the traditional cost balance: training is cheap but inference is expensive,...

By Mindful Modeler

Blog•Mar 3, 2026

The Random Forest of the 2030s?

Tabular foundation models (TFMs) are transformer‑based systems that perform in‑context learning on combined training and test data without parameter updates. The author outlines three adoption scenarios: as another algorithm (Level 1), as the go‑to quick‑and‑dirty baseline replacing Random Forests (Level 2), and...

By Mindful Modeler

Mindful Modeler

How TabICL and TabPFN Handle Missing Values

Tabular ML Is Entering a New Benchmark Era

Time Series Forecasting with Tabular Foundation Models

Context Is the New Training

Regression Should Predict Full Distributions

TabArena Explained

The Interpretability Tax on Tabular Foundation Models

The Random Forest of the 2030s?

Technology Pulse