Vision demo

Locate & describe

Take or upload a photo, then ask each model. LocateAnything draws boxes; Cosmos answers in text.

Locate prompts — what to find (one or more)