Open-Vocabulary Manipulation
The ability of a robot to manipulate objects specified by free-form language descriptions, even objects not seen during training. Open-vocabulary manipulation combines VLMs (for identifying objects from language queries) with manipulation policies. Models like CLIPort, VIMA, and RT-2 demonstrate open-vocabulary capabilities by grounding language in visual observations.