Data Tinkerer
Subscribe
Sign in
Home
Data Roundup
Data Science
Data Engineering
Data Analysis
Resources
About
🚨Free Resource: 120+ Data Science/Data Engineering Articles from 70+ Major Companies
From companies like Netflix, Apple, Google, Microsoft, Meta and others
Mar 17, 2025
Â
•
Â
Data Tinkerer
14
4
7
What the Data Crowd Was Reading in March 2026
Tools, techniques and deep dives worth reading that I came across in March 2026.
Apr 2
Â
•
Â
Data Tinkerer
12
2
3
How Notion Scaled AI Q&A to Millions of Workspaces
Kafka, Spark and Ray powering low-latency, high-throughput search pipelines
Mar 26
Â
•
Â
Data Tinkerer
11
1
What the Data Crowd Was Reading in February 2026
Tools, techniques and deep dives worth reading that I came across in February 2026.
Mar 12
Â
•
Â
Data Tinkerer
15
8
4
How Shopify Scales Taxonomy Evolution Across 10,000+ Categories With Multi-Agent AI
From reactive manual curation to continuous taxonomy evolution grounded in merchant reality.
Feb 26
Â
•
Â
Data Tinkerer
14
2
4
Most Popular
View all
What the Data Crowd Was Reading in July 2025
Aug 7, 2025
Â
•
Â
Data Tinkerer
17
10
2
What the Data Crowd Was Reading in August 2025
Sep 4, 2025
Â
•
Â
Data Tinkerer
16
5
From Data Analyst to Senior DS Manager at Skyscanner
Nov 13, 2025
Â
•
Â
Data Tinkerer
 andÂ
Jose Parreño Garcia
13
1
2
What the Data Crowd Was Reading in October 2025
Nov 6, 2025
Â
•
Â
Data Tinkerer
12
2
6
Latest
Top
Discussions
How LinkedIn Built a Pipeline That Scales to 230M Records/sec Without Breaking SLAs
From partition strategy to adaptive throttling, the playbook behind Venice’s ingestion evolution.
Feb 19
Â
•
Â
Data Tinkerer
10
1
What the Data Crowd Was Reading in January 2026
Tools, techniques and deep dives worth reading that I came across in January 2026.
Feb 5
Â
•
Â
Data Tinkerer
16
2
6
How to Build a Recommendation System at Scale: Insights from Instacart
A Senior ML Engineer on production constraints, rules vs ML and the workflow behind large-scale recommender systems
Jan 29
Â
•
Â
Data Tinkerer
 andÂ
Ahsaas Bajaj
12
1
3
How DoorDash Saves Tens of Millions of Dollars Per Year by Detecting Fraud 30× Faster
A daily anomaly detection system that cut discovery time from 100+ days to under three.
Jan 23
Â
•
Â
Data Tinkerer
15
4
How Grab Detects Data Issues across 100+ Kafka Topics Before They Spread
Real-time stream validation surfaces poison records early and notifies owners with context
Jan 15
Â
•
Â
Data Tinkerer
15
3
What the Data Crowd Was Reading in December 2025
Tools, techniques and deep dives worth reading that I came across in December 2025.
Jan 8
Â
•
Â
Data Tinkerer
16
2
5
How Uber Cut Data Lake Freshness From Hours to Minutes With Flink
Why Uber moved ingestion from Spark batch to Flink streaming and what it took to run thousands of jobs reliably at petabyte scale.
Jan 2
Â
•
Â
Data Tinkerer
19
3
What is Data Governance? A Practical Guide to Building Trustworthy Data in the Age of AI
From unclear ownership to missing standards, Charlotte Ledoux breaks down the simple governance practices that help organisations trust their data and…
Dec 11, 2025
Â
•
Â
Data Tinkerer
 andÂ
Charlotte Ledoux
31
1
5
What the Data Crowd Was Reading in November 2025
Tools, techniques and deep dives worth reading that I came across in November 2025.
Dec 3, 2025
Â
•
Â
Data Tinkerer
13
2
3
See all
Data Tinkerer
The latest updates on data science, data engineering and data analysis - for free!
Subscribe
Data Tinkerer
Subscribe
About
Archive
Recommendations
Sitemap
This site requires JavaScript to run correctly. Please
turn on JavaScript
or unblock scripts