Sitemap - 2025 - Data Tinkerer

What is Data Governance? A Practical Guide to Building Trustworthy Data in the Age of AI

What the Data Crowd Was Reading in November 2025

How Snap Rebuilt Its ML Platform to Handle 10,000+ Daily Spark Jobs

From Data Analyst to Senior DS Manager at Skyscanner

What the Data Crowd Was Reading in October 2025

From Dental Cleaning to Data Cleaning: How I Pivoted to Healthcare Analytics

From Marketing to Data Engineering: How I Made the Switch

How Dropbox Made AI Evaluation Work at Scale

What the Data Crowd Was Reading in September 2025

The Data Analyst’s Dilemma: Accuracy vs Speed

How Shopify Uses Change Data Capture to Serve Millions of Merchants

How Netflix Used Deep Learning to Slash Video Quality Control Time by 90%

What the Data Crowd Was Reading in August 2025

How Grab Shrunk Real-Time Queries from 5 Minutes to 1 with FlinkSQL and Kafka

How Uber Built an AI Agent That Answers Financial Questions in Slack

What the Data Crowd Was Reading in July 2025

How to Show Impact as a Data Analyst

How Expedia Monitors 1000+ A/B Tests in Real Time with Flink and Kafka

How Target Used GenAI to Lift Sales by 9% Across 100K+ Products

The Alignment Trap: When Stakeholders Want Data but Not the Truth

How Bolt Reconciles €2B in Revenue Using Airflow, Spark and dbt

How DoorDash Used LLMs to Trigger 30% More Relevant Results

When the Ratio Lies: The Denominator Problem Explained

Meta Buys Half of Scale AI, Apple Polishes Glass and Veo 3 Crashes the NBA Finals

How Flipkart Scaled Delivery Date Calculation 10x While Slashing Latency by 90%

ChatGPT Codex, Lawsuit Drama and A 52 Dollar Festival

How Uber Cut Invoice Handling Time by 70% with GenAI (Without Ditching Humans)

Veo's Viral Videos, Upgraded Agents and Dashboards in 10 Minutes

Freelancing in 2025 as a Data Analyst (Guest Post)

Google Launches AI Everything, Claude Claps Back and OpenAI Goes Full Hardware

How Notion Brought Order to Its Data Chaos (And Why Their First Catalog Failed)

Google’s AI Solves 56-Year-Old Math Problem, Codex Writes Code and Manus Makes Art That "Reads" Your Mind

How Reddit Scans 1M+ Images a Day to Flag NSFW Content Using Deep Learning

Gemini Builds, Claude Browses and ChatGPT Reads Your GitHub

When the Metric Becomes the Monster

Bots That Overshare, Suck Up Less and Sync Mouths Like Pros

How Canva Rebuilt Its Data Pipelines for Billions of Events per Month

One AI to Paint, One to File, One to ... Burn Your House Down (Maybe)

How Walmart Automated 400+ Forecasts and Cut Runtime by Half

The Dashboard Fallacy: When Seeing Everything Means Understanding Nothing

AI Agents "Chat", Copilot Remembers Your Personal Stuff and Another Vibe-Coding App

How Airtable Made Archive Validation Work at Petabyte Scale

Zuck is Watching You, Agents Get Smarter and AI Videos Get Less Weird

Inside the Mind of an LLM

ChatGPT Gets Visual, Gemini Gets Smarter, and Perplexity Gets a Shake-Up

P-Hacking: How to Make Anything “True” with Enough Data Tricks

More Free Resources, Nvidia's Cute Little Robot, Claude "Thinking Harder" and ChatGPT Prompts Playground

How HubSpot Optimized Logging to Save Millions

🚨Free Resource: 120+ Data Science/Data Engineering Articles from 70+ Major Companies

Google's Big Moves, Fake "Real" Influencers and Self-Driving Goes Open Source

eBay’s e-Llama: AI Trained on 1 Trillion Tokens, Boosting E-Commerce Accuracy by 25%

China Strikes Again, Sesame's Human-Like Voice and OpenAI's $20K PhD Bot

Numbers Don’t Tell the Whole Story: The McNamara Fallacy

Claude & ChatGPT 4.5 Drop, A Robot Goes Full Kung Fu, and AI Chatbots Start Speaking in Beeps

The Future is Here – And It’s a Little Unsettling

The Base Rate Fallacy: When Ignoring the Big Picture Leads to Bad Decisions

Scaling Apache Flink: How Reddit Cut Memory Usage by 60%

From Pins to Personalization: Inside Pinterest's Retrieval System for 500 Million Users

When Everyone "Wins" But Nothing Changes: Will Rogers Phenomenon Explained

ML Training Too Slow? Yelp’s 1,400x Speed Boost Fixes That

Improving Search for 1B+ LinkedIn Users with GenAI

Are Your Stats Betraying You? Simpson’s Paradox Explained

Inside Meta's Data Flow Discovery

No Cookies, No Problem: Grammarly’s Ad Experiment

What I Wish I Knew as a Data Analyst

Scaling Real-Time Analytics: How Expedia Cut Costs by 40% While Supporting 450+ Concurrent Users

To Build or not to Build AI Agents

2% Vulnerability, 100% Risk: The Hidden Dangers of AI Feedback Loops

How Datadog Achieved 99% Timeout Reduction with 20x Scalability Boost

How Uber Scaled Incentive Optimisation by 40x

52 AI Models Tested: Which Align Best with Human Ethics?

1.2 GB/sec Throughput: How Atlassian Scales ETL Pipelines

The Art of Substitution: Instacart’s ML Model for Better Shopping Choices

AI Cuts Freelance Writing Jobs by 30%

How Canva Saved $3.6 Million by Optimizing Amazon S3 Storage

Cracking the ETA Code: How Lyft Improves its Predictions

Ipsos Predictions 2025: A Global Snapshot of Hope, Fear, and Transformation