Data Tinkerer
Subscribe
Sign in
Home
Data Roundup
Data Science
Data Engineering
Data Analysis
Resources
About
Latest
Top
Discussions
What the Data Crowd Was Reading in April 2026
Tools, techniques and deep dives worth reading that I came across in April 2026.
May 7
•
Data Tinkerer
17
1
4
April 2026
The Bitter Lesson (of Decision Making)
Why simple rules often beat human judgment over time
Apr 30
•
Data Tinkerer
17
2
How Airtable Saved Millions by Cutting Archive Storage Costs by 100x
Airtable moved petabytes of cold log data out of MySQL and built a cheaper archive layer on S3 and Parquet without sacrificing fast queries.
Apr 23
•
Data Tinkerer
13
4
How Pinterest Used Multimodal AI to Help Millions of Shoppers
Inside the multimodal AI pipeline that converted images, metadata and search behavior into scalable shopping discovery.
Apr 16
•
Data Tinkerer
9
2
What the Data Crowd Was Reading in March 2026
Tools, techniques and deep dives worth reading that I came across in March 2026.
Apr 2
•
Data Tinkerer
13
2
3
March 2026
How Notion Scaled AI Q&A to Millions of Workspaces
Kafka, Spark and Ray powering low-latency, high-throughput search pipelines
Mar 26
•
Data Tinkerer
11
1
What the Data Crowd Was Reading in February 2026
Tools, techniques and deep dives worth reading that I came across in February 2026.
Mar 12
•
Data Tinkerer
15
8
4
February 2026
How Shopify Scales Taxonomy Evolution Across 10,000+ Categories With Multi-Agent AI
From reactive manual curation to continuous taxonomy evolution grounded in merchant reality.
Feb 26
•
Data Tinkerer
14
2
4
How LinkedIn Built a Pipeline That Scales to 230M Records/sec Without Breaking SLAs
From partition strategy to adaptive throttling, the playbook behind Venice’s ingestion evolution.
Feb 19
•
Data Tinkerer
10
1
What the Data Crowd Was Reading in January 2026
Tools, techniques and deep dives worth reading that I came across in January 2026.
Feb 5
•
Data Tinkerer
16
2
6
January 2026
How to Build a Recommendation System at Scale: Insights from Instacart
A Senior ML Engineer on production constraints, rules vs ML and the workflow behind large-scale recommender systems
Jan 29
•
Data Tinkerer
and
Ahsaas Bajaj
12
1
3
How DoorDash Saves Tens of Millions of Dollars Per Year by Detecting Fraud 30× Faster
A daily anomaly detection system that cut discovery time from 100+ days to under three.
Jan 23
•
Data Tinkerer
15
4
This site requires JavaScript to run correctly. Please
turn on JavaScript
or unblock scripts