Thursday, June 27, 2024
CONVERSATION AS THE COMMAND
By combining large language models (LLMs) with vision-language models (VLMs, which label images), it is possible for autonomous robots to achieve their tasks by conversing with humans in natural language. This feat makes “conversation the command,” instead of requiring operators to issue robots explicit commands in their own structured language.
FULL TEXT