Feature Extraction
The process of computing informative, discriminative representations from raw sensor data. In robotics, visual feature extraction uses CNN or ViT encoders to transform images into compact feature vectors. Feature extraction can be end-to-end (learned jointly with the policy) or use frozen pre-trained encoders (DINOv2, CLIP). The quality of features directly impacts policy performance.