Google's vision-language-action model — robots that reason from web knowledge
Google's vision-language-action model for robotic control