Harness design for long-running application development
Harness design is key to performance at the frontier of agentic coding. Here’s how we pushed Claude further in frontend design and long-running autonomous software engineering.
OpenAI to acquire Astral
Accelerates Codex growth to power the next generation of Python developer tools
Robots Must Be Ephemeralized
There is a subfield of robotics research called “sim-to-real” (sim2real) whereby one attempts to solve a robotic task in simulation, and then get a real robot to do the same thing in the real world. My team at Google utilizes Sim2Real techniques exten…
ML Mentorship: Some Q/A about RL
One of my ML research mentees is following OpenAI’s Spinning up in RL tutorials (thanks to the nice folks who put that guide together!). She emailed me some good questions about the basics of Reinforcement Learning, and I wanted to share some of m…