More Devins in More Places (3 minute read)
Cognition raised over $1B at a $26B valuation, with significant backing from major investors to expand Devin, an AI software engineer. Devin has significantly cut project times and improved automation for clients like Mercedes-Benz and Itaú. Cognition aims to further streamline software development by matching models to tasks while expanding its engineering capabilities.
|
Biohub releases a world model of protein biology (9 minute read)
Biohub has made its open discovery engine for protein structure prediction, design, and biological discovery available to researchers everywhere. The release includes ESMC, a state-of-the-art language model that has internalized the fundamental properties that govern protein biology; ESMFold2, a design engine designed to transform ESMC's sequence representations into atomically-resolved 3D structures of biomolecular complexes; and ESM Atlas, which makes ESMC's representations navigable across 6.8 billion protein sequences and 1.1 billion predicted structures. All three models are freely available to the global scientific community.
|
|
Shipping a Trillion Parameters With a Hub Bucket: Delta Weight Sync in TRL (4 minute read)
The blog post introduces a method to reduce the weight synchronization payload in async RL using "Delta Weight Sync," which transmits only changed model parameters between RL steps, significantly reducing data transfer from gigabytes to megabytes. A Hugging Face Hub "bucket" manages high-frequency object storage, enabling separate locations for the trainer and inference engine without direct communication, leading to substantial bandwidth savings.
|
Building self-improving tax agents with Codex (17 minute read)
Real-world systems often behave differently in production than they do in the lab. Teams often discover these failures after launch, then spend weeks fixing them. That feedback loop is slow and manual. Today, it is possible to build agents that self-improve. This post looks at how OpenAI used Codex to build this type of agent at Thrive Holdings, resulting in an AI that can prepare increasingly complex tax returns.
|
I think Anthropic and OpenAI have found product-market fit (11 minute read)
Both Anthropic and OpenAI have started aggressively pricing their APIs. This is likely because they have found product-market fit with coding/general-purpose agent products. Companies spending over $200 per month per user helps these businesses cover their costs much better than charging $10 to $20 per month per user. Coding agents amplify this spending significantly.
|
|
Secure MCP Tunnel (6 minute read)
Secure MCP Tunnel enables connecting private MCP servers to OpenAI products without exposing them to the internet. It uses tunnel-client to establish outbound HTTPS paths for request handling while maintaining server privacy. The solution integrates easily with existing systems, supporting enterprise networking requirements and maintaining secure data flow.
|
Introducing Apex: A Fast, Specialized Model for React Native (6 minute read)
Apex is a React Native coding model trained to build apps by analyzing architecture decisions, fixing framework-specific issues, and reasoning about constraints. While it doesn't match frontier models on coding benchmarks, the optimized model significantly alters the performance-to-cost ratio within its specific domain. The model is still in development. It is now available in a private beta with selected teams.
|
LiteParse v2.0 (1 minute read)
LiteParse is a standalone OSS PDF parsing tool that provides high-quality spatial text parsing with bounding boxes without proprietary LLM features or cloud dependencies. It features fast text parsing, screenshot generation, and support for multiple languages, platforms, and output formats. Everything runs locally on users' machines.
|
|
Nvidia bets $150B on Taiwan as Trump's plan to make US an AI hub backfires (13 minute read)
Nvidia will invest $150 billion a year to make sure that Taiwan remains at the epicenter of the AI revolution. The investment is aimed at cementing Taiwan as the world's tech manufacturing hub for a long time. Nvidia will create a new headquarters in Taiwan to expand its partnership with TSMC, benefit from close proximity to advanced packaging technology not yet available at TSMC's factories in the US, and boost its alliances with other nearby partners. Expanding the AI ecosystem helps Nvidia further its bottom line.
|
|
|
Want to advertise in TLDR? 📰
If your company is interested in reaching an audience of AI professionals and decision makers, you may want to advertise with us.
Want to work at TLDR? 💼
| | | |