UCSC ERIC Lab
Pinned
Repositories
- awesome-vision-language-navigation Public
A curated list for vision-and-language navigation. ACL 2022 paper "Vision-and-Language Navigation: A Survey of Tasks, Methods, and Future Directions"
- Discffusion Public
Official repo for the paper "Discffusion: Discriminative Diffusion Models as Few-shot Vision and Language Learners"
- MultipanelVQA Public
Code for the MultipanelVQA benchmark "Muffin or Chihuahua? Challenging Large Vision-Language Models with Multipanel VQA"
- Naivgation-as-wish Public
Official implementation of the NAACL 2024 paper "Navigation as Attackers Wish? Towards Building Robust Embodied Agents under Federated Learning"
- swap-anything Public
"SwapAnything: Enabling Arbitrary Object Swapping in Personalized Visual Editing"
- llm_coordination Public
Code repository for the paper "LLM-Coordination: Evaluating and Analyzing Multi-agent Coordination Abilities in Large Language Models"
- minigpt-5.github.io Public
Top languages
Loading…