Having Optimus learn straight from internet videos

6 views
Skip to first unread message

Gmail

unread,
May 22, 2025, 12:58:02 PM5/22/25
to RSSC-List

From LinkedIn:


Milan Kovac • 1st

VP, Head of Engineering - Optimus (Tesla...

21h • 0


One of our goals is to have Optimus learn straight from internet videos of humans doing tasks. Those are often 3rd person views captured by random cameras etc.


We recently had a significant breakthrough along that journey, and can now transfer a big chunk of the learning directly from human videos to the bots (1st person views for now).


This allows us to bootstrap new tasks much faster compared to teleoperated bot data alone (heavier operationally).


Many new skills are emerging through this process, are called for via natural language (voice/text), and are run by a single neural network on the bot (multi-tasking).


Next: expand to 3rd person video transfer (aka random internet), and push reliability via self-play (RL) in the real-, and/or synthetic-(sim / world models) world.




Thomas


-  

Need something prototyped, built or coded? I’ve been building prototypes for companies for 15 years. I am now incorporating generative AI into products.

Contact me directly or through LinkedIn:   

Reply all
Reply to author
Forward
0 new messages