Writing

Thinking out loud about engineering and research.

Occasional essays on machine learning, robotics, software craft, and the messy gap between academic research and production systems. Published when I have something worth saying.

Featured · ML Research

The Reproducibility Problem in Deep Reinforcement Learning — and What We Can Actually Do About It

After spending three years trying to reproduce results from top-tier RL papers — and failing more often than I'd like to admit — I've come to believe the field has a reproducibility crisis that goes deeper than most people acknowledge. Here's what I think is actually going wrong, and what a more honest culture of experimentation might look like.

Emre ARI

March 18, 202614 min read

All Writing

Recent articles

AllML ResearchEngineeringRoboticsOpinionTutorials

Robotics

Why Sim-to-Real Transfer Still Fails in the Interesting Cases

The gap between simulation performance and real-world results isn't just about physics accuracy. Here's what I've learned after two years of trying to close it on a collaborative arm.

Feb 4, 202611 minRead

Engineering

Writing C++ for Robotics: Lessons from Building Kepler

Building a motion planning library that researchers actually want to use is harder than getting the algorithms right. Here are the software engineering decisions that made Kepler adopted by three labs.

Jan 20, 20269 minRead

ML Research

The Case for Smaller, Better-Evaluated Models

The race toward scale has produced impressive benchmarks and underwhelming deployment stories. A contrarian argument for why the next decade of ML should go deep rather than large.

Jan 6, 202613 minRead

Tutorial

Setting Up a ROS 2 Development Environment That Won't Make You Miserable

A practical guide to Docker-based ROS 2 development with proper IDE integration, GPU passthrough, and a workflow that survives team handoffs. Everything I wish I'd had three years ago.

Dec 14, 202516 minRead

Opinion

Academia vs. Industry: A False Dichotomy in ML Engineering

After working in both research labs and production engineering teams, I've stopped believing the divide between them is as fundamental as both sides claim. Here's why the conversation is more nuanced.

Nov 28, 20258 minRead

ML Research

What I Learned from Six Months of Benchmarking Energy Forecasting Models

A deep dive into temporal distribution shift, evaluation methodology, and why the model that wins on the leaderboard almost never wins in production. Lessons from the Atlas project.

Oct 10, 202510 minRead

Load more articles