Skip to content
@dvlab-research

DV Lab

Deep Vision Lab

Pinned

  1. LISA LISA Public

    Project Page for "LISA: Reasoning Segmentation via Large Language Model"

    Python 1.5k 102

  2. LongLoRA LongLoRA Public

    Code and documents of LongLoRA and LongAlpaca (ICLR 2024 Oral)

    Python 2.5k 252

  3. MGM MGM Public

    Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"

    Python 3k 272

  4. LLaMA-VID LLaMA-VID Public

    Official Implementation for LLaMA-VID: An Image is Worth 2 Tokens in Large Language Models

    Python 598 38

  5. Video-P2P Video-P2P Public

    Video-P2P: Video Editing with Cross-attention Control

    Python 334 23

  6. LLMGA LLMGA Public

    This project is the official implementation of 'LLMGA: Multimodal Large Language Model based Generation Assistant'

    Python 259 17

Repositories

Showing 10 of 63 repositories
  • MGM Public

    Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"

    Python 3,018 Apache-2.0 272 45 2 Updated May 4, 2024
  • MR-GSM8K Public

    Challenge LLMs to Reason About Reasoning: A Benchmark to Unveil Cognitive Depth in LLMs

    Python 34 0 2 0 Updated Apr 25, 2024
  • LISA Public

    Project Page for "LISA: Reasoning Segmentation via Large Language Model"

    Python 1,500 Apache-2.0 102 52 1 Updated Apr 8, 2024
  • GroupContrast Public

    [CVPR 2024] GroupContrast: Semantic-aware Self-supervised Representation Learning for 3D Understanding

    35 MIT 1 2 0 Updated Mar 15, 2024
  • Video-P2P Public

    Video-P2P: Video Editing with Cross-attention Control

    Python 334 23 5 0 Updated Mar 12, 2024
  • Parametric-Contrastive-Learning Public

    Parametric Contrastive Learning (ICCV2021) & GPaCo (TPAMI 2023)

    Python 226 MIT 29 5 0 Updated Feb 29, 2024
  • LongLoRA Public

    Code and documents of LongLoRA and LongAlpaca (ICLR 2024 Oral)

    Python 2,488 Apache-2.0 252 41 1 Updated Feb 11, 2024
  • Prompt-Highlighter Public

    [CVPR 2024] Prompt Highlighter: Interactive Control for Multi-Modal LLMs

    Python 103 MIT 2 2 0 Updated Jan 25, 2024
  • LLMGA Public

    This project is the official implementation of 'LLMGA: Multimodal Large Language Model based Generation Assistant'

    Python 259 Apache-2.0 17 3 0 Updated Jan 22, 2024
  • LLaMA-VID Public

    Official Implementation for LLaMA-VID: An Image is Worth 2 Tokens in Large Language Models

    Python 598 Apache-2.0 38 29 0 Updated Jan 10, 2024

Top languages

Loading…

Most used topics

Loading…