Hacker News1 min read
Show HN: PhAIL – Real-robot benchmark for AI model
I built this because I couldn't find honest numbers on how well VLA models [1] actually work on commercial tasks. I come from search ranking at Google where you measure everything, and in robotics nobody seemed to know.PhAIL runs four models (OpenPI/pi0.5, GR00T, ACT, SmolVLA) on bin-to-bin order picking – one of the most common warehouse operations. Same robot (Franka FR3), same objects, hundreds of blind runs. The operator doesn't know which model is running.Best model: 64 UPH. Human teleopera
Read original on phail.ai7
7Related
Hacker News
LinkedIn Is Illegally Searching Your Computer
Discussed on Hacker News with 538 points and 254 comments.
538
254Hacker News
Artemis II lifts off: four astronauts begin 10-day
Discussed on Hacker News with 206 points and 101 comments.
206
101Hacker News
How the AI Bubble Bursts
Discussed on Hacker News with 117 points and 76 comments.
117
76Get the 10 best reads every Sunday
Curated by AI, voted by readers. Free forever.
Liked this? Start your own feed.
Comment
Sign in to join the discussion.
Loading comments…