TLDR AI

TLDR

Together With

TLDR AI 2026-05-08

⚡See how Oura uses AI to listen to their customers (Sponsor)

What does it actually look like to build a product around your customers, not just say you do?

Oura has done it. And on May 27th, they're pulling back the curtain.

Join the Oura team to talk about how they unify member feedback across product, engineering, and leadership, and how real member voices shape every decision they make.

Speakers:

John Moses - VP, Member Experience
Dzevad Kudic - Sr. Support Engineering Manager
Dayu Aryanthi - CX Strategy
Jens Nordgaard - Analytics Manager
Joining Elizabeth O'Bryen - Head of CS, Unwrap

If customer feedback is a part of your role then this one's for you! The talk will be followed by a Q&A with the audience.

📅 Date: Wednesday, May 27th

⏰ Tme: 11am PT / 2pm ET

Save your spot here. If you can't make it on the 27th, no worries! Go ahead and register and you'll automatically get the recording after the session.

🚀

Headlines & Launches

Codex now works directly in Chrome on macOS and Windows (2 minute read)

OpenAI Codex now works directly in Chrome on macOS and Windows. It works in parallel across tabs in the background without taking over the browser. The implementation can quickly move through repetitive browser work like navigating structured pages and complex data flows. It writes code to navigate and complete tasks under the hood. Video demos showing what users can do with the feature are available in the article.

OpenAI Released Realtime Audio Models (8 minute read)

OpenAI released a new set of real-time audio models, including GPT‑Realtime‑2 for conversational reasoning, GPT‑Realtime‑Translate for live multilingual translation, and GPT‑Realtime‑Whisper for streaming transcription.

Meta prepares Hatch AI Agent with waitlist and social skills (2 minute read)

Meta is developing Hatch, an AI agent positioned as a consumer-grade competitor to OpenAI's OpenClaw, with features for image and video generation, shopping, and learning integrated deeply into social platforms like Instagram and Facebook. Internal tests are expected by June, using simulated environments, with a potential wide release gated by a waitlist. Meta also plans a shopping tool for Instagram by Q4, aiming to keep users engaged within their existing social spaces.

🧠

Deep Dives & Analysis

Improving token efficiency in GitHub Agentic Workflows (12 minute read)

GitHub Agent Workflows significantly improve repository hygiene and quality, but costs are becoming a growing concern for developers. AI jobs like agentic workflows are automatically scheduled and triggered, so costs can accumulate out of view. GitHub started systematically optimizing the token usage of many workflows last month. This post describes what the team instrumented, the optimizations it applied, and its preliminary results.

The Six-Hour Codex Run That Survived a Five-Hour Pause (10 minute read)

/goal is a headline feature that shipped in Codex on April 30 that introduced persisted goals, goal states that survive terminal restarts, laptop sleeps, and multi-hour passes without re-prompting. The feature injects a developer message on resume rather than waiting for the user to time anything. It allows users to turn off their devices and pick up exactly where they left off without re-prompting anything.

Good QC for RL Data (18 minute read)

There needs to be a higher bar for what good quality looks like. Any vendor selling into a frontier lab is being judged implicitly during the purchase decision, and most are failing multiple quality control gates at once. Standardizing QC for evaluating how well data tests something on the performance-cost-latency Pareto curve is of the utmost importance. The vendors who don't internalize a higher QC bar will start running into problems this year.

🧑‍💻

Engineering & Research

This is the data architecture your AI's been missing [Microsoft + CData Webinar] (Sponsor)

73% of enterprises say poor data connectivity is preventing them from scaling AI. In this live webinar, Microsoft and CData walk you through the infrastructure that clears the bottleneck: universal data connectivity. Join on May 13th for an architecture blueprint for your AI product. Save your spot

AlphaEvolve: How our Gemini-powered coding agent is scaling impact across fields (9 minute read)

AlphaEvolve is a Gemini-powered coding agent that can design advanced algorithms. It can help make new discoveries on open problems across mathematics and computer science. The agent has been upgraded with the ability to help explain the physics of the natural world to help accelerate progress for scientists and businesses across a variety of fields. This post shares the areas AlphaEvolve has had the most significant impacts on to date.

Building Fast & Accurate Agents with Prime-RL Post Training (22 minute read)

Ramp Sheets built Fast Ask to handle its spreadsheet agent's information retrieval loop. It can navigate a workbook, read the relevant ranges, and return a compact answer for the main agent to use. This post presents Fast Ask as a case study for reinforcement learning. It covers where it is worth training a specialized agent, how to design the environment, and how to evaluate whether post-training worked.

ds4.c (GitHub Repo)

ds4.c is an intentionally narrow, small native inference engine for DeepSeek V4 Flash. The project aimed to create one local model that felt finished from end to end. It is Metal-only, but the development team may implement CUDA support in the future. The project is still in alpha, so there are still bugs in the implementation.

Meta's Optimized RecSys Inference (58 minute read)

Meta detailed In-Kernel Broadcast Optimization (IKBO), a co-design approach that removes redundant embedding replication in recommendation inference workloads.

Natural Language Autoencoders (9 minute read)

Anthropic introduces Natural Language Autoencoders (NLAs) to translate AI model activations into human-readable text, aiding in understanding model thoughts. NLAs have been used to detect safety concerns and hidden motivations in AI behavior, improving model alignment auditing. Despite limitations like hallucinations and high costs, NLAs advance AI auditing techniques. Anthropic has released training resources for further development.

🎁

Miscellaneous

Notes from inside China's AI labs (18 minute read)

Chinese and American labs look similar when you look at their outputs and ingredients, but there are major differences in how they are organized and conditioned. Chinese scientists are much more willing to do non-flashy work to improve a model rather than push their own ideas. This results in less gamification of the system and more flexibility in adapting modern techniques. The Chinese AI community feels more like an ecosystem than battling tribes, with all the labs having nothing but respect for their peers. They are also much less focused on the business side of the technology.

Long AI Short AGI (3 minute read)

Silicon Valley's narrative emphasizes AGI as the ultimate scarce resource, but the rapid commoditization of AI models challenges this. Intelligence now follows the same path as compute, bandwidth, and storage, where market forces drive competition and reduce costs. The real winners in AI won't necessarily have superior models but will own customer relationships and proprietary data, much like past tech giants.

⚡

Quick Links

Don't blame the model. Blame the harness (Sponsor)

The most powerful model can be held back by bad execution, orchestration, and observability. See how real teams are getting it right with Temporal + Grid Dynamics. Join the webinar

Google DeepMind partners with EVE Online for AI model testing (3 minute read)

EVE Online is a uniquely rich environment for study, especially for developing AI systems that use long-horizon planning, memory, and continual learning.

TLDR is hiring a Senior Software Engineer, Applied AI ($250k-$350k, Fully Remote)

TLDR's Applied AI team is tasked with making every process at TLDR legible to code, runnable by anyone, and composable into larger workflows. Join a small, fast moving team using the latest AI tools with an unlimited token budget. Learn more.

Trusted Contact for ChatGPT (6 minute read)

OpenAI introduced Trusted Contact, an optional feature that allows adults to nominate someone who may be alerted if severe self-harm risk is detected in conversations.

Perplexity Brings Personal Computer to Mac (3 minute read)

Perplexity released Personal Computer for all Mac users through its desktop app, giving AI agents access to local files, applications, connectors, and the web.

{% if RHREFLINK2 %}

Love TLDR? Tell your friends and get rewards!

Share your referral link below with friends to get free TLDR swag!

Track your referrals here.

{% endif %}

Want to advertise in TLDR? 📰

If your company is interested in reaching an audience of AI professionals and decision makers, you may want to advertise with us.

Want to work at TLDR? 💼

Apply here, create your own rol

Codex in Chrome 🤖, inside Chinese labs 🇨🇳, improving token efficiency 🛠️