An arXiv paper shows frozen vision-language-action policies can absorb new tasks at test time by retrieving pool-side demonstrations instead of per-task fine-tuning.
2 results for: vision-language
AllenAI launches vla-eval to unify Vision-Language-Action benchmarking
vla-eval decouples model inference from simulator execution with a WebSocket+msgpack protocol and Docker isolation, supporting 14 benchmarks and six model servers.