Sitemap - 2025 - Data Tinkerer
What is Data Governance? A Practical Guide to Building Trustworthy Data in the Age of AI
What the Data Crowd Was Reading in November 2025
How Snap Rebuilt Its ML Platform to Handle 10,000+ Daily Spark Jobs
From Data Analyst to Senior DS Manager at Skyscanner
What the Data Crowd Was Reading in October 2025
From Dental Cleaning to Data Cleaning: How I Pivoted to Healthcare Analytics
From Marketing to Data Engineering: How I Made the Switch
How Dropbox Made AI Evaluation Work at Scale
What the Data Crowd Was Reading in September 2025
The Data Analyst’s Dilemma: Accuracy vs Speed
How Shopify Uses Change Data Capture to Serve Millions of Merchants
How Netflix Used Deep Learning to Slash Video Quality Control Time by 90%
What the Data Crowd Was Reading in August 2025
How Grab Shrunk Real-Time Queries from 5 Minutes to 1 with FlinkSQL and Kafka
How Uber Built an AI Agent That Answers Financial Questions in Slack
What the Data Crowd Was Reading in July 2025
How to Show Impact as a Data Analyst
How Expedia Monitors 1000+ A/B Tests in Real Time with Flink and Kafka
How Target Used GenAI to Lift Sales by 9% Across 100K+ Products
The Alignment Trap: When Stakeholders Want Data but Not the Truth
How Bolt Reconciles €2B in Revenue Using Airflow, Spark and dbt
How DoorDash Used LLMs to Trigger 30% More Relevant Results
When the Ratio Lies: The Denominator Problem Explained
Meta Buys Half of Scale AI, Apple Polishes Glass and Veo 3 Crashes the NBA Finals
How Flipkart Scaled Delivery Date Calculation 10x While Slashing Latency by 90%
ChatGPT Codex, Lawsuit Drama and A 52 Dollar Festival
How Uber Cut Invoice Handling Time by 70% with GenAI (Without Ditching Humans)
Veo's Viral Videos, Upgraded Agents and Dashboards in 10 Minutes
Freelancing in 2025 as a Data Analyst (Guest Post)
Google Launches AI Everything, Claude Claps Back and OpenAI Goes Full Hardware
How Notion Brought Order to Its Data Chaos (And Why Their First Catalog Failed)
How Reddit Scans 1M+ Images a Day to Flag NSFW Content Using Deep Learning
Gemini Builds, Claude Browses and ChatGPT Reads Your GitHub
When the Metric Becomes the Monster
Bots That Overshare, Suck Up Less and Sync Mouths Like Pros
How Canva Rebuilt Its Data Pipelines for Billions of Events per Month
One AI to Paint, One to File, One to ... Burn Your House Down (Maybe)
How Walmart Automated 400+ Forecasts and Cut Runtime by Half
The Dashboard Fallacy: When Seeing Everything Means Understanding Nothing
AI Agents "Chat", Copilot Remembers Your Personal Stuff and Another Vibe-Coding App
How Airtable Made Archive Validation Work at Petabyte Scale
Zuck is Watching You, Agents Get Smarter and AI Videos Get Less Weird
ChatGPT Gets Visual, Gemini Gets Smarter, and Perplexity Gets a Shake-Up
P-Hacking: How to Make Anything “True” with Enough Data Tricks
How HubSpot Optimized Logging to Save Millions
🚨Free Resource: 120+ Data Science/Data Engineering Articles from 70+ Major Companies
Google's Big Moves, Fake "Real" Influencers and Self-Driving Goes Open Source
eBay’s e-Llama: AI Trained on 1 Trillion Tokens, Boosting E-Commerce Accuracy by 25%
China Strikes Again, Sesame's Human-Like Voice and OpenAI's $20K PhD Bot
Numbers Don’t Tell the Whole Story: The McNamara Fallacy
Claude & ChatGPT 4.5 Drop, A Robot Goes Full Kung Fu, and AI Chatbots Start Speaking in Beeps
The Future is Here – And It’s a Little Unsettling
The Base Rate Fallacy: When Ignoring the Big Picture Leads to Bad Decisions
Scaling Apache Flink: How Reddit Cut Memory Usage by 60%
From Pins to Personalization: Inside Pinterest's Retrieval System for 500 Million Users
When Everyone "Wins" But Nothing Changes: Will Rogers Phenomenon Explained
ML Training Too Slow? Yelp’s 1,400x Speed Boost Fixes That
Improving Search for 1B+ LinkedIn Users with GenAI
Are Your Stats Betraying You? Simpson’s Paradox Explained
Inside Meta's Data Flow Discovery
No Cookies, No Problem: Grammarly’s Ad Experiment
What I Wish I Knew as a Data Analyst
Scaling Real-Time Analytics: How Expedia Cut Costs by 40% While Supporting 450+ Concurrent Users
To Build or not to Build AI Agents
2% Vulnerability, 100% Risk: The Hidden Dangers of AI Feedback Loops
How Datadog Achieved 99% Timeout Reduction with 20x Scalability Boost
How Uber Scaled Incentive Optimisation by 40x
52 AI Models Tested: Which Align Best with Human Ethics?
1.2 GB/sec Throughput: How Atlassian Scales ETL Pipelines
The Art of Substitution: Instacart’s ML Model for Better Shopping Choices
AI Cuts Freelance Writing Jobs by 30%
How Canva Saved $3.6 Million by Optimizing Amazon S3 Storage
Cracking the ETA Code: How Lyft Improves its Predictions
Ipsos Predictions 2025: A Global Snapshot of Hope, Fear, and Transformation
