<!DOCTYPE html><html lang="en"><head><meta http-equiv="Content-Type" content="text/html charset=UTF-8"><meta charset="UTF-8"><meta name="viewport" content="width=device-width"><meta name="x-apple-disable-message-reformatting"><title>TLDR Data</title><meta name="color-scheme" content="light dark"><meta name="supported-color-schemes" content="light dark"><style type="text/css">
:root {
color-scheme: light dark; supported-color-schemes: light dark;
}
*,
*:after,
*:before {
-webkit-box-sizing: border-box; -moz-box-sizing: border-box; box-sizing: border-box;
}
* {
-ms-text-size-adjust: 100%; -webkit-text-size-adjust: 100%;
}
html,
body,
.document {
width: 100% !important; height: 100% !important; margin: 0; padding: 0;
}
body {
-webkit-font-smoothing: antialiased; -moz-osx-font-smoothing: grayscale; text-rendering: optimizeLegibility;
}
div[style*="margin: 16px 0"] {
margin: 0 !important;
}
table,
td {
mso-table-lspace: 0pt; mso-table-rspace: 0pt;
}
table {
border-spacing: 0; border-collapse: collapse; table-layout: fixed; margin: 0 auto;
}
img {
-ms-interpolation-mode: bicubic; max-width: 100%; border: 0;
}
*[x-apple-data-detectors] {
color: inherit !important; text-decoration: none !important;
}
.x-gmail-data-detectors,
.x-gmail-data-detectors *,
.aBn {
border-bottom: 0 !important; cursor: default !important;
}
.btn {
-webkit-transition: all 200ms ease; transition: all 200ms ease;
}
.btn:hover {
background-color: #f67575; border-color: #f67575;
}
* {
font-family: Arial, Helvetica, sans-serif; font-size: 18px;
}
@media screen and (max-width: 600px) {
.container {
width: 100%; margin: auto;
}
.stack {
display: block!important; width: 100%!important; max-width: 100%!important;
}
.btn {
display: block; width: 100%; text-align: center;
}
}
body,
p,
td,
tr,
.body,
table,
h1,
h2,
h3,
h4,
h5,
h6,
div,
span {
background-color: #FEFEFE !important; color: #010101 !important;
}
@media (prefers-color-scheme: dark) {
body,
p,
td,
tr,
.body,
table,
h1,
h2,
h3,
h4,
h5,
h6,
div,
span {
background-color: #27292D !important; color: #FEFEFE !important;
}
}
a {
color: inherit !important; text-decoration: underline !important;
}
</style><!--[if mso | ie]>
<style type="text/css">
a {
background-color: #FEFEFE !important; color: #010101 !important;
}
@media (prefers-color-scheme: dark) {
a {
background-color: #27292D !important; color: #FEFEFE !important;
}
}
</style>
<![endif]--></head><body class="">
<div style="display: none; max-height: 0px; overflow: hidden;">Cloudflareβs shift to per-tenant retention in a massive ClickHouse βReady-Analyticsβ table exposed an unexpected scaling limit β β β β β β β β β β β β β β β β β β β β β β β β β β β β β β β β β β β β β β β β β β β β β β β β β β β β </div>
<div style="display: none; max-height: 0px; overflow: hidden;">
<br>
</div>
<table align="center" class="document"><tbody><tr><td valign="top">
<table align="center" border="0" cellpadding="0" cellspacing="0" class="container" width="600"><tbody><tr class="inner-body"><td>
<table align="center" border="0" cellpadding="0" cellspacing="0" width="100%"><tbody><tr class="header"><td bgcolor="" class="container">
<table width="100%"><tbody><tr><td class="container">
<table align="center" bgcolor="" border="0" cellpadding="0" cellspacing="0" style="margin-top: 0px;" width="100%"><tbody><tr><td style="padding: 0px;">
<table align="center" bgcolor="" border="0" cellpadding="0" cellspacing="0" width="100%"><tbody><tr><td class="container" style="padding: 15px 15px;">
<div style="text-align: center;">
<span style="margin-right: 0px;"><a href="https://tracking.tldrnewsletter.com/CL0/https:%2F%2Ftldr.tech%2Fdata%3Futm_source=tldrdata/1/0100019e3a8e4cf9-04cf8340-b550-42cb-b701-5cb21cc246c6-000000/LkECJU-CU3qAk2LRsf2qQDDUvX7EQmQBDlmzSor-nkI=452" rel="noopener noreferrer" target="_blank"><span>Sign Up</span></a>
|<span style="margin-right: 2px; margin-left: 2px;"><a href="https://tracking.tldrnewsletter.com/CL0/https:%2F%2Fadvertise.tldr.tech%3Futm_source=tldrdata%26utm_medium=newsletter%26utm_campaign=advertisetopnav/1/0100019e3a8e4cf9-04cf8340-b550-42cb-b701-5cb21cc246c6-000000/nH__FXVCypMVRSuUjcMdkVeyotiMmkNjnYeUrqt-wf8=452" rel="noopener noreferrer" target="_blank"><span>Advertise</span></a></span>|<span style="margin-left: 2px;"><a href="https://tracking.tldrnewsletter.com/CL0/https:%2F%2Fa.tldrnewsletter.com%2Fweb-version%3Fep=1%26lc=1670a604-84b7-11f0-bcf5-55fc1d40139c%26p=7cfdb668-528a-11f1-900c-1b05c078f987%26pt=campaign%26t=1779098864%26s=d4a833030808b85799900cd4d485acb2b34fdf1ff4712caffb407dd29a9a5a91/1/0100019e3a8e4cf9-04cf8340-b550-42cb-b701-5cb21cc246c6-000000/mNpn08Rs7_GW2rNEKbnLDAUouX0p1eHqihkyErL6Ejw=452"><span>View Online</span></a></span>
<br>
</span></div>
</td></tr></tbody></table>
<table align="center" bgcolor="" border="0" cellpadding="0" cellspacing="0" width="100%"><tbody><tr><td class="container" style="text-align: center;"><span data-darkreader-inline-color="" style="--darkreader-inline-color:#3db3ff; color: rgb(51, 175, 255) !important; font-size: 30px;">T</span><span style="font-size: 30px;"><span data-darkreader-inline-color="" style="color: rgb(232, 192, 96) !important; --darkreader-inline-color:#e8c163; font-size:30px;">L</span><span data-darkreader-inline-color="" style="color: rgb(101, 195, 173) !important; --darkreader-inline-color:#6ec7b2; font-size:30px;">D</span></span><span data-darkreader-inline-color="" style="--darkreader-inline-color:#dd6e6e; color: rgb(220, 107, 107) !important; font-size: 30px;">R</span>
<br>
</td></tr></tbody></table>
<br>
<table align="center" bgcolor="" border="0" cellpadding="0" cellspacing="0" width="100%"><tbody><tr id="together-with"><td align="center" height="20" style="vertical-align:middle !important;" valign="middle" width="100%"><strong style="vertical-align:middle !important; height: 100%;">Together With </strong>
<a href="https://tracking.tldrnewsletter.com/CL0/https:%2F%2Fwww.fivetran.com%2Fresources%2Freports%2Fthe-2026-agentic-ai-readiness-index/1/0100019e3a8e4cf9-04cf8340-b550-42cb-b701-5cb21cc246c6-000000/j1I456VqyvL90Jx4dhf2du8v5FCXUx1tk4w3milReyo=452"><img src="https://images.tldr.tech/fivetran.png" valign="middle" style="vertical-align: middle !important; height: 100%;" alt="Fivetran"></a></td></tr></tbody></table>
<table style="table-layout: fixed; width:100%;" width="100%"><tbody><tr><td style="padding:0;border-collapse:collapse;border-spacing:0;margin:0;">
<div style="text-align: center;">
<h1><strong>TLDR Data <span id="date">2026-05-18</span></strong></h1>
</div>
</td></tr></tbody></table>
<table style="table-layout: fixed; width:100%;" width="100%"><tbody><tr id="sponsy-copy"><td class="container" style="padding: 15px 15px;">
<div class="text-block">
<span>
<a href="https://tracking.tldrnewsletter.com/CL0/https:%2F%2Fwww.fivetran.com%2Fresources%2Freports%2Fthe-2026-agentic-ai-readiness-index/2/0100019e3a8e4cf9-04cf8340-b550-42cb-b701-5cb21cc246c6-000000/A7p4kcfL9l--ghIPTVYnFVzidlWLoTwMgdYTmYIgH4g=452">
<span>
<strong>5 of 6 companies lack the data foundation for agentic AI. They're spending $$$ anyway (Sponsor)</strong>
</span>
</a>
<br>
<br>
<span style="font-family: "Helvetica Neue", Helvetica, Arial, Verdana, sans-serif;">
AI agents are stuck in pilot, and data is to blame. Yet most orgs are investing 7-8 figures in agentic projects anyway. <p></p><p><a href="https://tracking.tldrnewsletter.com/CL0/https:%2F%2Fwww.fivetran.com%2Fresources%2Freports%2Fthe-2026-agentic-ai-readiness-index/3/0100019e3a8e4cf9-04cf8340-b550-42cb-b701-5cb21cc246c6-000000/Bp3Luil62shnK_Y28sa1I2F6HDI1oSj3Em7iaqwBlQo=452" rel="noopener noreferrer nofollow" target="_blank"><span>Fivetran's agentic AI readiness index</span></a> shows why most companies aren't realizing the full value of AI. Read it to learn why:</p>
<ul>
<li>Only 15% of teams are prepared for agentic AI at scale</li>
<li>Governance and compliance issues are stalling AI projects</li>
<li><a href="https://tracking.tldrnewsletter.com/CL0/https:%2F%2Fwww.fivetran.com%2Fblog%2Fwhat-is-open-data-infrastructure/1/0100019e3a8e4cf9-04cf8340-b550-42cb-b701-5cb21cc246c6-000000/hn1G7ci9P8pZJ80sRAsqZ8xqpxZSpPl9EQ0Wu6iaMNY=452" rel="noopener noreferrer nofollow" target="_blank"><span>Open Data Infrastructure</span></a> is emerging as the new agentic standard</li>
</ul>
<p>If you're trying to deliver autonomous AI systems, start with the foundation. <a href="https://tracking.tldrnewsletter.com/CL0/https:%2F%2Ffivetran.com%2Fsignup%3Futm_medium=paid_listing%26utm_source=tldr%26utm_campaign=2026-May-6-TLDR-AI-sponsorship%26utm_content=newsletter%26utm_term=default/1/0100019e3a8e4cf9-04cf8340-b550-42cb-b701-5cb21cc246c6-000000/vyl212wQnWQvMMwcuOKhY9uSRgRi68sKJ-eOKICX5bA=452" rel="noopener noreferrer nofollow" target="_blank"><span>Try Fivetran free</span></a>
</p>
</span></span></div>
</td></tr></tbody></table>
</td></tr></tbody></table>
</td></tr></tbody></table>
</td></tr>
<tr bgcolor=""><td class="container">
<table align="center" bgcolor="" border="0" cellpadding="0" cellspacing="0" width="100%"><tbody><tr><td style="padding: 0px;">
<table align="center" bgcolor="" border="0" cellpadding="0" cellspacing="0" width="100%"><tbody><tr><td class="container" style="padding-top: 0px; padding-bottom: 0px;">
<div class="text-block">
<div style="text-align: center;"><span style="font-size: 36px;">π±</span></div></div>
</td></tr></tbody></table>
<table align="center" bgcolor="" border="0" cellpadding="0" cellspacing="0" width="100%"><tbody><tr><td class="container" style="padding-top: 0px; padding-bottom: 0px;">
<div class="text-block">
<div style="text-align: center;">
<h1><strong>Deep Dives</strong></h1>
</div>
</div>
</td></tr></tbody></table>
<table style="table-layout: fixed; width: 100%;" width="100%"><tbody><tr><td style="padding:0;border-collapse:collapse;border-spacing:0;margin:0;" valign="top">
<table align="center" border="0" cellpadding="0" cellspacing="0" width="100%"><tbody><tr><td class="container" style="padding: 15px 15px;">
<div class="text-block">
<span>
<a href="https://tracking.tldrnewsletter.com/CL0/https:%2F%2Fblog.cloudflare.com%2Fclickhouse-query-plan-contention%2F%3Futm_source=tldrdata/1/0100019e3a8e4cf9-04cf8340-b550-42cb-b701-5cb21cc246c6-000000/HS-yELIbPoT4c0Yc4zbz2zzV1JmO1jlSVR9rbTnLJa4=452">
<span>
<strong>Our billing pipeline was suddenly slow. The culprit was a hidden bottleneck in ClickHouse (9 minute read)</strong>
</span>
</a>
<br>
<br>
<span style="font-family: "Helvetica Neue", Helvetica, Arial, Verdana, sans-serif;">
Cloudflare's shift to per-tenant retention in a massive ClickHouse βReady-Analyticsβ table exposed an unexpected scaling limit: query planning, not I/O or scan volume, became the bottleneck as parts per replica grew. Tracing showed 45% of leaf query CPU time in part filtering. Switching to a shared lock and then a shared-read cache removed most of the contention and cut query latency sharply.
</span>
</span>
</div>
</td></tr></tbody></table>
<table align="center" border="0" cellpadding="0" cellspacing="0" width="100%"><tbody><tr><td class="container" style="padding: 15px 15px;">
<div class="text-block">
<span>
<a href="https://tracking.tldrnewsletter.com/CL0/https:%2F%2Flinks.tldrnewsletter.com%2FzcoSPA/1/0100019e3a8e4cf9-04cf8340-b550-42cb-b701-5cb21cc246c6-000000/2UPdF6CBA_To3I8jV5YnIj3_L-m-Y10YlWvS1MnPzaw=452">
<span>
<strong>Viaduct 1.0 and the Future of Airbnb's Data Mesh (5 minute read)</strong>
</span>
</a>
<br>
<br>
<span style="font-family: "Helvetica Neue", Helvetica, Arial, Verdana, sans-serif;">
Viaduct 1.0 is Airbnb's open-source data-oriented service mesh built on GraphQL. It provides a single unified schema for accessing any data source across the company while enabling decentralized development through multi-tenant modules as teams contribute their own schema and resolvers without operating separate GraphQL services, striking a balance between a monolithic GraphQL server and full federation.
</span>
</span>
</div>
</td></tr></tbody></table>
<table align="center" border="0" cellpadding="0" cellspacing="0" width="100%"><tbody><tr><td class="container" style="padding: 15px 15px;">
<div class="text-block">
<span>
<a href="https://tracking.tldrnewsletter.com/CL0/https:%2F%2Fwww.singlestore.com%2Fblog%2Faws-outage-may-2026-cross-region-disaster-recovery%2F%3Futm_source=tldrdata/1/0100019e3a8e4cf9-04cf8340-b550-42cb-b701-5cb21cc246c6-000000/_xXGs8KHisu9gJyZubIRjyWg6a7pUDnqX-SbACxkJkM=452">
<span>
<strong>AWS Outage May 2026: Lessons for Database Disaster Recovery (10 minute read)</strong>
</span>
</a>
<br>
<br>
<span style="font-family: "Helvetica Neue", Helvetica, Arial, Verdana, sans-serif;">
A major AWS US-EAST-1 outage in May was triggered by a data center overheating event in a single availability zone, causing multi-hour disruptions for high-profile services like Coinbase. The incident highlighted the critical difference between Multi-AZ high availability (which failed to protect latency-sensitive workloads) and true cross-region disaster recovery.
</span>
</span>
</div>
</td></tr></tbody></table>
</td></tr></tbody></table>
<table align="center" bgcolor="" border="0" cellpadding="0" cellspacing="0" width="100%"><tbody><tr><td class="container" style="padding-top: 0px; padding-bottom: 0px;">
<div class="text-block">
<div style="text-align: center;"><span style="font-size: 36px;">π</span></div>
</div>
</td></tr></tbody></table>
<table align="center" bgcolor="" border="0" cellpadding="0" cellspacing="0" width="100%"><tbody><tr><td class="container" style="padding-top: 0px; padding-bottom: 0px;">
<div class="text-block">
<div style="text-align: center;">
<h1><strong>Opinions & Advice</strong></h1>
</div>
</div>
</td></tr></tbody></table>
<table style="table-layout: fixed; width: 100%;" width="100%"><tbody><tr><td style="padding:0;border-collapse:collapse;border-spacing:0;margin:0;" valign="top">
<table align="center" border="0" cellpadding="0" cellspacing="0" width="100%"><tbody><tr><td class="container" style="padding: 15px 15px;">
<div class="text-block">
<span>
<a href="https://tracking.tldrnewsletter.com/CL0/https:%2F%2Fdlthub.com%2Fblog%2Fllm-ontology-schema-evolution%3Futm_source=tldrdata/1/0100019e3a8e4cf9-04cf8340-b550-42cb-b701-5cb21cc246c6-000000/RUI8hiuGh4INyKcGZE0Ad_gCINy5k_vKmXr8lWdWHIo=452">
<span>
<strong>Exploring schema evolution with ontology-driven propagation (4 minute read)</strong>
</span>
</a>
<br>
<br>
<span style="font-family: "Helvetica Neue", Helvetica, Arial, Verdana, sans-serif;">
A plain-English ontology can act as a runtime access policy that survives schema evolution, letting an LLM classify columns column-by-column using row counts, cardinality ratios, and sampled values. The approach keeps policy separate from pipeline code, but it does not cover numeric sensitive inferences or cross-column re-identification.
</span>
</span>
</div>
</td></tr></tbody></table>
<table align="center" border="0" cellpadding="0" cellspacing="0" width="100%"><tbody><tr><td class="container" style="padding: 15px 15px;">
<div class="text-block">
<span>
<a href="https://tracking.tldrnewsletter.com/CL0/https:%2F%2Flukewhittaker.substack.com%2Fp%2Fthe-modern-data-stack-is-overcomplicated-5ff%3Futm_source=tldrdata/1/0100019e3a8e4cf9-04cf8340-b550-42cb-b701-5cb21cc246c6-000000/YF8yYpKCoDjsmixySW2Xuxk_pHviTulM-fYRjpgcfug=452">
<span>
<strong>The Modern Data Stack is Overcomplicated: Data Ingestion (17 minute read)</strong>
</span>
</a>
<br>
<br>
<span style="font-family: "Helvetica Neue", Helvetica, Arial, Verdana, sans-serif;">
Data ingestion looks simple, but the wrong choice can create hidden costs through broken connectors, schema drift, over-engineering, and wasted engineering time. The best approach is usually a hybrid: managed connectors for standard SaaS, streaming only when low latency truly matters, and custom pipelines for niche or legacy sources.
</span>
</span>
</div>
</td></tr></tbody></table>
<table align="center" border="0" cellpadding="0" cellspacing="0" width="100%"><tbody><tr><td class="container" style="padding: 15px 15px;">
<div class="text-block">
<span>
<a href="https://tracking.tldrnewsletter.com/CL0/https:%2F%2Fboringsql.com%2Fposts%2Forder-by-jungle%2F%3Futm_source=tldrdata/1/0100019e3a8e4cf9-04cf8340-b550-42cb-b701-5cb21cc246c6-000000/rmi7DpRASEPUyK-RvYbmder0Eq3Spt1atCH6CQsIpfE=452">
<span>
<strong>Welcome to ORDER BY Jungle (11 minute read)</strong>
</span>
</a>
<br>
<br>
<span style="font-family: "Helvetica Neue", Helvetica, Arial, Verdana, sans-serif;">
PostgreSQL resolves column names and expressions in ORDER BY clauses in inconsistent ways. For example, bare identifiers (e.g. ORDER BY a) first look for aliases in the SELECT list, while any expression (e.g. ORDER BY -a) resolves against the FROM clause, leading to confusing behaviors with aliases, quoting, GROUP BY, window functions, and UNION.
</span>
</span>
</div>
</td></tr></tbody></table>
</td></tr></tbody></table>
<table align="center" bgcolor="" border="0" cellpadding="0" cellspacing="0" width="100%"><tbody><tr><td class="container" style="padding-top: 0px; padding-bottom: 0px;">
<div class="text-block">
<div style="text-align: center;"><span style="font-size: 36px;">π»</span></div>
</div>
</td></tr></tbody></table>
<table align="center" bgcolor="" border="0" cellpadding="0" cellspacing="0" width="100%"><tbody><tr><td class="container" style="padding-top: 0px; padding-bottom: 0px;">
<div class="text-block">
<div style="text-align: center;">
<h1><strong>Launches & Tools</strong></h1>
</div>
</div>
</td></tr></tbody></table>
<table style="table-layout: fixed; width: 100%;" width="100%"><tbody><tr><td style="padding:0;border-collapse:collapse;border-spacing:0;margin:0;" valign="top">
<table align="center" border="0" cellpadding="0" cellspacing="0" width="100%"><tbody><tr><td class="container" style="padding: 15px 15px;">
<div class="text-block">
<span>
<a href="https://tracking.tldrnewsletter.com/CL0/https:%2F%2Flinks.tldrnewsletter.com%2FtRetW8%3Futm_source=tldrdata/1/0100019e3a8e4cf9-04cf8340-b550-42cb-b701-5cb21cc246c6-000000/7nLG4fekXtT0F07uJQ1I9jXK0gK7hWwKGl_N56mKvNY=452">
<span>
<strong>A Data Layer That Won't Make You Wait (Sponsor)</strong>
</span>
</a>
<br>
<br>
<span style="font-family: "Helvetica Neue", Helvetica, Arial, Verdana, sans-serif;">
You can spend your whole morning waiting for that data to land. Or, you can use a data layer that won't make you wait. That's Lakebase. <a href="https://tracking.tldrnewsletter.com/CL0/https:%2F%2Flinks.tldrnewsletter.com%2FtRetW8/1/0100019e3a8e4cf9-04cf8340-b550-42cb-b701-5cb21cc246c6-000000/NoG4UKIKU4RMNimS3OIN3WAW9gZRYQ423l3VbQFrssk=452" rel="noopener noreferrer nofollow" target="_blank"><span>Learn how Lakebase's fully-managed Postgres database can help you spin up ideas fast</span></a>, and run agents and apps on one platform.
</span>
</span>
</div>
</td></tr></tbody></table>
<table align="center" border="0" cellpadding="0" cellspacing="0" width="100%"><tbody><tr><td class="container" style="padding: 15px 15px;">
<div class="text-block">
<span>
<a href="https://tracking.tldrnewsletter.com/CL0/https:%2F%2Fgithub.com%2Fborchero%2Fducklake-sdk%3Futm_source=tldrdata/1/0100019e3a8e4cf9-04cf8340-b550-42cb-b701-5cb21cc246c6-000000/PuQIeII_p-8Zz-PxbEnKR9RDZ-9LGy3okU03v3N-li8=452">
<span>
<strong>ducklake-sdk (GitHub Repo)</strong>
</span>
</a>
<br>
<br>
<span style="font-family: "Helvetica Neue", Helvetica, Arial, Verdana, sans-serif;">
ducklake-sdk is an alpha Rust/Python SDK for reading and writing DuckLake tables without running DuckDB. It implements the DuckLake spec in a Rust core, with Python integrations for Polars, Arrow, and DuckDB, targeting SQL-catalog metadata plus Parquet storage. Useful for embedding DuckLake access into apps, pipelines, or engines directly.
</span>
</span>
</div>
</td></tr></tbody></table>
<table align="center" border="0" cellpadding="0" cellspacing="0" width="100%"><tbody><tr><td class="container" style="padding: 15px 15px;">
<div class="text-block">
<span>
<a href="https://tracking.tldrnewsletter.com/CL0/https:%2F%2Fthenewstack.io%2Fminio-memkv-recompute-tax%2F%3Futm_source=tldrdata/1/0100019e3a8e4cf9-04cf8340-b550-42cb-b701-5cb21cc246c6-000000/OcU1ON8OocpMTIjiSIEzwcHjVK84sAInyWcHsj9zjXU=452">
<span>
<strong>MinIO's MemKV promises 95% better GPU utilization by ending AI recompute tax (5 minute read)</strong>
</span>
</a>
<br>
<br>
<span style="font-family: "Helvetica Neue", Helvetica, Arial, Verdana, sans-serif;">
MemKV is a petabyte-scale context memory store for AI inference designed to preserve and share session state across GPU clusters. By moving context directly from NVMe into the AI data path over 800 GbE RDMA, it targets the βrecompute taxβ and claims 95%+ better GPU utilization and about 50% lower cost per token on benchmark workloads.
</span>
</span>
</div>
</td></tr></tbody></table>
<table align="center" border="0" cellpadding="0" cellspacing="0" width="100%"><tbody><tr><td class="container" style="padding: 15px 15px;">
<div class="text-block">
<span>
<a href="https://tracking.tldrnewsletter.com/CL0/https:%2F%2Fwww.confessionsofadataguy.com%2Fapache-arrow-as-data-interchange%2F%3Futm_source=tldrdata/1/0100019e3a8e4cf9-04cf8340-b550-42cb-b701-5cb21cc246c6-000000/lfb3TRrgQMCVgD4faySDPFOczkBNhLxfwV3vIZK1g3o=452">
<span>
<strong>Apache Arrow as Data Interchange (5 minute read)</strong>
</span>
</a>
<br>
<br>
<span style="font-family: "Helvetica Neue", Helvetica, Arial, Verdana, sans-serif;">
Apache Arrow is rapidly becoming the universal in-memory columnar format for data interchange across the modern data stack. Instead of repeatedly serializing, deserializing, and copying data between tools (Pandas β Spark β databases, etc.), Arrow enables zero-copy handoff, where systems share the exact same memory layout, dramatically reducing CPU overhead.
</span>
</span>
</div>
</td></tr></tbody></table>
<table align="center" border="0" cellpadding="0" cellspacing="0" width="100%"><tbody><tr><td class="container" style="padding: 15px 15px;">
<div class="text-block">
<span>
<a href="https://tracking.tldrnewsletter.com/CL0/https:%2F%2Farpitbhayani.me%2Fblogs%2Frag-production%2F%3Futm_source=tldrdata/1/0100019e3a8e4cf9-04cf8340-b550-42cb-b701-5cb21cc246c6-000000/1RDs6f7E46KDFTrEyoIatXrM3F4LGxs81FYIzULHrIM=452">
<span>
<strong>What Matters in Production RAG (8 minute read)</strong>
</span>
</a>
<br>
<br>
<span style="font-family: "Helvetica Neue", Helvetica, Arial, Verdana, sans-serif;">
Key requirements for production RAG include smart chunking strategies (recursive, semantic, and structure-aware), robust indexing pipelines with document registries, content hashing for efficient updates, alias-based zero-downtime index switching, careful embedding model management, and strong observability with detailed tracing, chunk attribution, and retrieval quality metrics.
</span>
</span>
</div>
</td></tr></tbody></table>
</td></tr></tbody></table>
<table align="center" bgcolor="" border="0" cellpadding="0" cellspacing="0" width="100%"><tbody><tr><td class="container" style="padding-top: 0px; padding-bottom: 0px;">
<div class="text-block">
<div style="text-align: center;"><span style="font-size: 36px;">π</span></div></div>
</td></tr></tbody></table>
<table align="center" bgcolor="" border="0" cellpadding="0" cellspacing="0" width="100%"><tbody><tr><td class="container" style="padding-top: 0px; padding-bottom: 0px;">
<div class="text-block">
<div style="text-align: center;"><strong><h1>Miscellaneous</h1></strong></div>
</div>
</td></tr></tbody></table>
<table bgcolor="" style="table-layout: fixed; width: 100%;" width="100%"><tbody><tr><td style="padding:0;border-collapse:collapse;border-spacing:0;margin:0;" valign="top">
<table align="center" border="0" cellpadding="0" cellspacing="0" width="100%"><tbody><tr><td class="container" style="padding: 15px 15px;">
<div class="text-block">
<span>
<a href="https://tracking.tldrnewsletter.com/CL0/https:%2F%2Fwww.cio.com%2Farticle%2F4170277%2Fyour-ai-agent-deletes-critical-data-who-is-responsible.html%3Futm_source=tldrdata/1/0100019e3a8e4cf9-04cf8340-b550-42cb-b701-5cb21cc246c6-000000/sBJqXOGGo6HIMLDk6BChljDiFXh_LxxNJQrZ9mXUf8Y=452">
<span>
<strong>Your AI agent deletes critical data: Who is responsible? (5 minute read)</strong>
</span>
</a>
<br>
<br>
<span style="font-family: "Helvetica Neue", Helvetica, Arial, Verdana, sans-serif;">
AI agents that can write to production systems create a new accountability and recovery problem: a Replit agent once deleted a live database, and the real issue was the absence of clear ownership, guardrails, and rollback. With 86% of IT/security leaders expecting agents to outrun current controls, governance is a shared responsibility across architecture, security, legal, and business. Practical controls like policy boundaries, observability, human-in-the-loop triage, and explicit recovery mechanisms are essential to prevent autonomous tools from becoming enterprise-wide risk.
</span>
</span>
</div>
</td></tr></tbody></table>
<table align="center" border="0" cellpadding="0" cellspacing="0" width="100%"><tbody><tr><td class="container" style="padding: 15px 15px;">
<div class="text-block">
<span>
<a href="https://tracking.tldrnewsletter.com/CL0/https:%2F%2Fredis.io%2Fblog%2Fcontext-pruning-llm-tokens%2F%3Futm_source=tldrdata/1/0100019e3a8e4cf9-04cf8340-b550-42cb-b701-5cb21cc246c6-000000/qlmrXVEGUMIpw0sQgJ2koYWDKDidFYhyq-wFTf7xGpA=452">
<span>
<strong>Context pruning: cut LLM tokens without losing quality (9 minute read)</strong>
</span>
</a>
<br>
<br>
<span style="font-family: "Helvetica Neue", Helvetica, Arial, Verdana, sans-serif;">
Context Pruning is the practice of selectively removing low-value tokens, sentences, or passages from an LLM's input to reduce cost, latency, and often improve output quality. It includes techniques such as token-level, sentence/chunk-level, attention-based, and dynamic layer-progressive pruning, and works best when paired with semantic caching.
</span>
</span>
</div>
</td></tr></tbody></table>
</td></tr></tbody></table>
<table align="center" bgcolor="" border="0" cellpadding="0" cellspacing="0" width="100%"><tbody><tr><td class="container" style="padding-top: 0px; padding-bottom: 0px;">
<div class="text-block">
<div style="text-align: center;"><span style="font-size: 36px;">β‘</span></div></div>
</td></tr></tbody></table>
<table align="center" bgcolor="" border="0" cellpadding="0" cellspacing="0" width="100%"><tbody><tr><td class="container" style="padding-top: 0px; padding-bottom: 0px;">
<div class="text-block">
<div style="text-align: center;">
<h1><strong>Quick Links</strong></h1>
</div>
</div>
</td></tr></tbody></table>
<table bgcolor="" style="table-layout: fixed; width: 100%;" width="100%"><tbody><tr><td style="padding:0;border-collapse:collapse;border-spacing:0;margin:0;" valign="top">
<table align="center" border="0" cellpadding="0" cellspacing="0" width="100%"><tbody><tr><td class="container" style="padding: 15px 15px;">
<div class="text-block">
<span>
<a href="https://tracking.tldrnewsletter.com/CL0/https:%2F%2Fseattledataguy.substack.com%2Fp%2Fwhat-leading-a-data-team-actually%3Futm_source=tldrdata/1/0100019e3a8e4cf9-04cf8340-b550-42cb-b701-5cb21cc246c6-000000/yHYvNwiWFvVBU_gOxUCi_pcgeuY65LvtltUAKUd-4P4=452">
<span>
<strong>What Leading a Data Team Actually Looks Like Right Now (7 minute read)</strong>
</span>
</a>
<br>
<br>
<span style="font-family: "Helvetica Neue", Helvetica, Arial, Verdana, sans-serif;">
Data leaders still face the same core challenges despite the AI hype: proving business value, managing stakeholder politics, preventing dashboard/model/tool sprawl, and saying no to low-value requests.
</span>
</span>
</div>
</td></tr></tbody></table>
<table align="center" border="0" cellpadding="0" cellspacing="0" width="100%"><tbody><tr><td class="container" style="padding: 15px 15px;">
<div class="text-block">
<span>
<a href="https://tracking.tldrnewsletter.com/CL0/https:%2F%2Fdavistreybig.substack.com%2Fp%2Fhow-agents-use-systems-differently%3Futm_source=tldrdata/1/0100019e3a8e4cf9-04cf8340-b550-42cb-b701-5cb21cc246c6-000000/oubnAKTBRgoc9y0AQwn6oCEZKemkJRqvLmps6Ygoz7E=452">
<span>
<strong>How Agents Use Systems Differently (15 minute read)</strong>
</span>
</a>
<br>
<br>
<span style="font-family: "Helvetica Neue", Helvetica, Arial, Verdana, sans-serif;">
Agents use software differently than humans, so infrastructure needs to be redesigned around snapshots, branching, elastic scale, high concurrency, isolation, and cheap experimentation.
</span>
</span>
</div>
</td></tr></tbody></table>
</td></tr></tbody></table>
<table align="center" bgcolor="" border="0" cellpadding="0" cellspacing="0" width="100%"><tbody><tr><td align="left" style="word-break: break-word; vertical-align: top; padding: 5px 10px;">
<p style="padding: 0; margin: 0; font-size: 22px; color: #000000; line-height: 1.6; font-weight: bold;">
Want to advertise in TLDR? π°
</p>
<div class="text-block" style="margin-top: 10px;">
If your company is interested in reaching an audience of data engineering professionals and decision makers, you may want to <a href="https://tracking.tldrnewsletter.com/CL0/https:%2F%2Fadvertise.tldr.tech%2F%3Futm_source=tldrdata%26utm_medium=newsletter%26utm_campaign=advertisecta/1/0100019e3a8e4cf9-04cf8340-b550-42cb-b701-5cb21cc246c6-000000/g-X6Ck5TKmAuKAsSuXhTmioBwjQL7l5qOkYiwg5h7zg=452"><strong><span>advertise with us</span></strong></a>.
</div>
<br>
<!-- New "Want to work at TLDR?" section -->
<p style="padding: 0; margin: 0; font-size: 22px; color: #000000; line-height: 1.6; font-weight: bold;">
Want to work at TLDR? πΌ
</p>
<div class="text-block" style="margin-top: 10px;">
<a href="https://tracking.tldrnewsletter.com/CL0/https:%2F%2Fjobs.ashbyhq.com%2Ftldr.tech/1/0100019e3a8e4cf9-04cf8340-b550-42cb-b701-5cb21cc246c6-000000/1vTrv4Gtq6PxNliQVmHtu8VNQPYuJSbe6FhAMM8vU-s=452" rel="noopener noreferrer" style="color: #0000EE; text-decoration: underline;" target="_blank"><strong>Apply here</strong></a>,
<a href="https://tracking.tldrnewsletter.com/CL0/https:%2F%2Fjobs.ashbyhq.com%2Ftldr.tech%2Fc227b917-a6a4-40ce-8950-d3e165357871/1/0100019e3a8e4cf9-04cf8340-b550-42cb-b701-5cb21cc246c6-000000/jUMJG98WI6sq57rXRnX8RzGhkHRJn5gZVmdlgJKuhSs=452" rel="noopener noreferrer" style="color: #0000EE; text-decoration: underline;" target="_blank"><strong>create your own role</strong></a> or send a friend's resume to <a href="mailto:jobs@tldr.tech" style="color: #0000EE; text-decoration: underline;">jobs@tldr.tech</a> and get $1k if we hire them! TLDR is one of <a href="https://tracking.tldrnewsletter.com/CL0/https:%2F%2Fwww.linkedin.com%2Ffeed%2Fupdate%2Furn:li:activity:7401699691039830016%2F/1/0100019e3a8e4cf9-04cf8340-b550-42cb-b701-5cb21cc246c6-000000/gDN9iZEdJKAgqz5o8-OAAe5XWJxO5-sipqDDDvLjYTw=452" rel="noopener noreferrer" style="color: #0000EE; text-decoration: underline;" target="_blank"><strong>Inc.'s Best Bootstrapped businesses</strong></a> of 2025.
</div>
<br>
<div class="text-block">
If you have any comments or feedback, just respond to this email!
<br>
<br> Thanks for reading,
<br>
<a href="https://tracking.tldrnewsletter.com/CL0/https:%2F%2Fwww.linkedin.com%2Fin%2Fjoelvanveluwen%2F/1/0100019e3a8e4cf9-04cf8340-b550-42cb-b701-5cb21cc246c6-000000/2fP5gXFOdEdanQwd8ShdmufVpVFm4UEEySnanA1kg2g=452"><span>Joel Van Veluwen</span></a>, <a href="https://tracking.tldrnewsletter.com/CL0/https:%2F%2Fwww.linkedin.com%2Fin%2Fjennytzurueyching%2F/1/0100019e3a8e4cf9-04cf8340-b550-42cb-b701-5cb21cc246c6-000000/d_nB8Uc5wbJDjPKrDP_ool-GEaFABQpVkUWI1v2vvH4=452"><span>Tzu-Ruey Ching</span></a> & <a href="https://tracking.tldrnewsletter.com/CL0/https:%2F%2Fwww.linkedin.com%2Fin%2Fremi-turpaud%2F/1/0100019e3a8e4cf9-04cf8340-b550-42cb-b701-5cb21cc246c6-000000/1XcU1Z3ifpI56EJzQcMPCHeHxcDjDbH3at82CguesV8=452"><span>Remi Turpaud</span></a>
<br>
<br>
</div>
<br>
</td></tr></tbody></table>
<table align="center" bgcolor="" border="0" cellpadding="0" cellspacing="0" width="100%"><tbody><tr><td class="container" style="padding: 15px 15px;">
<div class="text-block" id="testing-id">
<a href="https://tracking.tldrnewsletter.com/CL0/https:%2F%2Ftldr.tech%2Fdata%2Fmanage%3Femail=silk.theater.56%2540fwdnl.com/1/0100019e3a8e4cf9-04cf8340-b550-42cb-b701-5cb21cc246c6-000000/OJXpdDZveVxD12wTyqaFiIWbf51lkpb33_FQor0a0UM=452">Manage your subscriptions</a> to our other newsletters on tech, startups, and programming. Or if TLDR Data isn't for you, please <a href="https://tracking.tldrnewsletter.com/CL0/https:%2F%2Fa.tldrnewsletter.com%2Funsubscribe%3Fep=1%26l=037ede50-92cc-11ee-b0f2-b761aa2217ad%26lc=1670a604-84b7-11f0-bcf5-55fc1d40139c%26p=7cfdb668-528a-11f1-900c-1b05c078f987%26pt=campaign%26pv=4%26spa=1779098426%26t=1779098864%26s=9a3466be4b179c7e0d4e01de3060b6beea7244aa34d71009760caef39e47c725/1/0100019e3a8e4cf9-04cf8340-b550-42cb-b701-5cb21cc246c6-000000/Qqv7UdxfPF9Cf9ONyo7MqLqio45dX9eKU5Ex9BsPj5Q=452">unsubscribe</a>.
<br>
</div>
</td></tr></tbody></table>
</td></tr></tbody></table>
</td></tr></tbody></table>
</td></tr></tbody></table>
</td></tr></tbody></table>
<img alt="" src="http://tracking.tldrnewsletter.com/CI0/0100019e3a8e4cf9-04cf8340-b550-42cb-b701-5cb21cc246c6-000000/rLNFhoDEUCwEUJa60rMYtJ1whQyroI_gXI4jg9oPEew=452" style="display: none; width: 1px; height: 1px;">
</body></html>