Vision-Language Models

Hierarchical Language Models for Semantic Navigation and Manipulation in an Aerial-Ground Robotic System