Karina | Python | Excel | Stats | DataScience | DataAnalytics

Karina | Python | Excel | Stats | DataScience | DataAnalytics

Creator
0 followers

Data analytics/science educator; active, high‑engagement posts on pandas/Python and big‑data‑adjacent workflows for analytics.

Beeswarm Plots Reveal Hidden Data Clusters Beyond Box Plots
SocialMar 6, 2026

Beeswarm Plots Reveal Hidden Data Clusters Beyond Box Plots

Part 3 of 3 underused chart types worth knowing. A box plot with 15 points looks identical to one with 1,500. You lose all sense of where measurements actually cluster. Beeswarm plots fix this. Every data point is visible. Nothing gets absorbed into...

By Karina | Python | Excel | Stats | DataScience | DataAnalytics
Bump Charts Simplify Ranking Changes over Time
SocialMar 4, 2026

Bump Charts Simplify Ranking Changes over Time

Part 1 of 3 underused chart types worth knowing You reach for a line plot to show ranking changes over time. The lines cross. It turns into spaghetti. Bump charts fix this. When you care about relative position — not raw values —...

By Karina | Python | Excel | Stats | DataScience | DataAnalytics
Boost Pandas Performance with Modin, Dask, Polars
SocialMar 2, 2026

Boost Pandas Performance with Modin, Dask, Polars

Python Tip When pandas is too slow, there are other libraries to rescue: - Modin - Easiest switch from pandas Change one line: import modin.pandas as pd Same syntax. Uses all CPU cores - Dask - When data > RAM Processes data in chunks across CPU...

By Karina | Python | Excel | Stats | DataScience | DataAnalytics
Replace UNION ALL with GROUPING SETS for Faster Aggregations
SocialMar 1, 2026

Replace UNION ALL with GROUPING SETS for Faster Aggregations

Stop Writing UNION ALL for Multi-Level Aggregations You need regional totals AND product totals AND grand totals. So you write three separate queries with UNION ALL. There's a better way: GROUPING SETS. UNION ALL - Scans the table 3 times. Slow. GROUPING SETS - One...

By Karina | Python | Excel | Stats | DataScience | DataAnalytics
Query CSVs Directly with DuckDB—No Load, Faster
SocialFeb 26, 2026

Query CSVs Directly with DuckDB—No Load, Faster

Instead of loading CSVs into pandas just to run one query, you can use DuckDB to run SQL directly on files. No loading. No waiting. Just query the file and get results. It’s also 20x faster and uses way less memory. Here’s how...

By Karina | Python | Excel | Stats | DataScience | DataAnalytics
Choose the Right SQL Ranking Function to Avoid Misleading Gaps
SocialFeb 22, 2026

Choose the Right SQL Ranking Function to Avoid Misleading Gaps

ROW_NUMBER(), RANK(), DENSE_RANK(). Three functions, three different behaviors. Pick the wrong one and your rankings mislead. Here are 4 patterns to get it right: - ranking with gaps vs without - top-N per category - deduplication - running totals 1. ROW_NUMBER() vs RANK() vs DENSE_RANK() Three functions, three behaviors...

By Karina | Python | Excel | Stats | DataScience | DataAnalytics
Analyst Ads Overpromise Python; Excel, SQL Dominate Daily
SocialFeb 21, 2026

Analyst Ads Overpromise Python; Excel, SQL Dominate Daily

Unpopular opinion: Data analyst job postings ask for Python. Data analyst jobs don't actually use Python. What you'll use daily: Excel — every single day SQL — every single day Power BI or Tableau — multiple times per week Python — maybe once a month This pattern holds...

By Karina | Python | Excel | Stats | DataScience | DataAnalytics
Replace If‑elif Chains with Clean Python Dispatch Patterns
SocialFeb 20, 2026

Replace If‑elif Chains with Clean Python Dispatch Patterns

The more if-elif chains you write, the harder your code gets to change. Python has cleaner patterns for this. Here are 4 worth knowing: - dictionary dispatch - guard clauses - match/case - conditional expressions 1. Dictionary dispatch. Replace long equality checks with a dict. Constant-time lookup. No branching....

By Karina | Python | Excel | Stats | DataScience | DataAnalytics
Free Resources Replace Costly Bootcamps—Discipline Is Key
SocialFeb 19, 2026

Free Resources Replace Costly Bootcamps—Discipline Is Key

You don't need a bootcamp to become a data analyst. Everything you need is free: Excel/SQL/Python/Power BI tutorials — YouTube SQL practice — SQLZoo, LeetCode, HackerRank Datasets for portfolio projects — Kaggle, data.gov, Google Dataset Search Resume feedback — Reddit (r/datascience, r/resumes), LinkedIn communities Interview prep...

By Karina | Python | Excel | Stats | DataScience | DataAnalytics
Master Six Core Concepts to Decode Regression Results
SocialFeb 17, 2026

Master Six Core Concepts to Decode Regression Results

Most analysts can run a regression. Very few can explain what the output actually means. That gap is a statistics fundamentals problem. Not a tools problem. Not a Python problem. Not a years-of-experience problem. If you can't explain what your numbers mean, you...

By Karina | Python | Excel | Stats | DataScience | DataAnalytics
Decode Common SQL Errors and Their Real Fixes
SocialFeb 16, 2026

Decode Common SQL Errors and Their Real Fixes

Common SQL errors and what they REALLY mean "Column ambiguously defined" You joined tables with the same column name. Fix: Add table aliases (customers.id not just id) "Not a single-group group function" You mixed aggregated + non-aggregated columns. Fix: Add all non-aggregated columns to GROUP BY "Division...

By Karina | Python | Excel | Stats | DataScience | DataAnalytics
Great Tables Turns DataFrames Into Presentation‑ready Tables
SocialFeb 13, 2026

Great Tables Turns DataFrames Into Presentation‑ready Tables

Turning a DataFrame into a presentation-ready table in Python. Recently I tried a library called Great Tables and it makes formatting tables very easy. - Works with Pandas & Polars - 19 formatting methods (currency, percentages, dates, scientific notation) - Export to HTML, LaTeX,...

By Karina | Python | Excel | Stats | DataScience | DataAnalytics
Decompose Sales Data to Distinguish Real Growth
SocialFeb 10, 2026

Decompose Sales Data to Distinguish Real Growth

Your manager asks: "Sales jumped in December — is that a real growth or seasonality?" You pull up a line chart. Sure, it shows the spike. But it doesn't tell you why. And that's what they actually want to know. This is where time...

By Karina | Python | Excel | Stats | DataScience | DataAnalytics
Visualize Churn Timing with Python Cohort Heatmaps
SocialFeb 8, 2026

Visualize Churn Timing with Python Cohort Heatmaps

At my work every month we look at customer retention. A single churn rate doesn't tell the whole story. You need to show when customers are leaving. Here's how you can do it with a cohort analysis heatmap in python

By Karina | Python | Excel | Stats | DataScience | DataAnalytics