Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to content
@eric-ai-lab

UCSC ERIC Lab

UCSC Embodied and Responsible Interaction and Communication (ERIC) Lab

Pinned Loading

  1. MiniGPT-5 MiniGPT-5 Public

    Official implementation of paper "MiniGPT-5: Interleaved Vision-and-Language Generation via Generative Vokens"

    Python 840 52

  2. photoswap photoswap Public

    Official implementation of the NeurIPS 2023 paper "Photoswap: Personalized Subject Swapping in Images"

    Jupyter Notebook 338 24

  3. awesome-vision-language-navigation awesome-vision-language-navigation Public

    A curated list for vision-and-language navigation. ACL 2022 paper "Vision-and-Language Navigation: A Survey of Tasks, Methods, and Future Directions"

    329 19

  4. PEViT PEViT Public

    Official implementation of AAAI 2023 paper "Parameter-efficient Model Adaptation for Vision Transformers"

    Python 94 5

  5. VLMbench VLMbench Public

    NeurIPS 2022 Paper "VLMbench: A Compositional Benchmark for Vision-and-Language Manipulation"

    Python 78 8

  6. Aerial-Vision-and-Dialog-Navigation Aerial-Vision-and-Dialog-Navigation Public

    Codebase of ACL 2023 Findings "Aerial Vision-and-Dialog Navigation"

    Python 36 6

Repositories

Showing 10 of 25 repositories
  • MMWorld Public

    Official repo of the paper "MMWorld: Towards Multi-discipline Multi-faceted World Model Evaluation in Videos"

    eric-ai-lab/MMWorld’s past year of commit activity
    Python 18 MIT 1 0 0 Updated Aug 29, 2024
  • ComCLIP Public

    Official implementation and dataset for the NAACL 2024 paper "ComCLIP: Training-Free Compositional Image and Text Matching"

    eric-ai-lab/ComCLIP’s past year of commit activity
    Python 30 MIT 2 0 1 Updated Aug 18, 2024
  • Screen-Point-and-Read Public

    Code repo for "Read Anywhere Pointed: Layout-aware GUI Screen Reading with Tree-of-Lens Grounding"

    eric-ai-lab/Screen-Point-and-Read’s past year of commit activity
    Python 18 2 0 0 Updated Jul 31, 2024
  • ProbMed Public

    "Worse than Random? An Embarrassingly Simple Probing Evaluation of Large Multimodal Models in Medical VQA"

    eric-ai-lab/ProbMed’s past year of commit activity
    Python 12 1 1 0 Updated Jun 24, 2024
  • eric-ai-lab/via-video’s past year of commit activity
    22 0 0 0 Updated Jun 20, 2024
  • R2H Public

    Official implementation of the EMNLP 2023 paper "R2H: Building Multimodal Navigation Helpers that Respond to Help Requests"

    eric-ai-lab/R2H’s past year of commit activity
    Python 4 1 0 0 Updated Jun 19, 2024
  • ViCor Public

    This is the implementation of ACL 2024 Findings paper ViCor: Bridging Visual Understanding and Commonsense Reasoning with Large Language Models

    eric-ai-lab/ViCor’s past year of commit activity
    2 0 0 0 Updated Jun 11, 2024
  • awesome-vision-language-navigation Public

    A curated list for vision-and-language navigation. ACL 2022 paper "Vision-and-Language Navigation: A Survey of Tasks, Methods, and Future Directions"

    eric-ai-lab/awesome-vision-language-navigation’s past year of commit activity
    329 MIT 19 0 0 Updated May 2, 2024
  • Discffusion Public

    Official repo for the paper "Discffusion: Discriminative Diffusion Models as Few-shot Vision and Language Learners"

    eric-ai-lab/Discffusion’s past year of commit activity
    Python 26 MIT 3 0 0 Updated Apr 27, 2024
  • MultipanelVQA Public

    Code for the MultipanelVQA benchmark "Muffin or Chihuahua? Challenging Large Vision-Language Models with Multipanel VQA"

    eric-ai-lab/MultipanelVQA’s past year of commit activity
    Jupyter Notebook 7 MIT 0 0 0 Updated Apr 11, 2024

Top languages

Loading…

Most used topics

Loading…