✨✨Latest Papers and Datasets on Multimodal Large Language Models, and Their Evaluation.
-
Updated
May 19, 2024
✨✨Latest Papers and Datasets on Multimodal Large Language Models, and Their Evaluation.
[CVPR2024] The code for "Osprey: Pixel Understanding with Visual Instruction Tuning"
A collection of visual instruction tuning datasets.
🦩 Visual Instruction Tuning with Polite Flamingo - training multi-modal LLMs to be both clever and polite! (AAAI-24 Oral)
A Video Chat Agent with Temporal Prior
Visual Instruction Tuning towards General-Purpose Multimodal Model: A Survey
Add a description, image, and links to the visual-instruction-tuning topic page so that developers can more easily learn about it.
To associate your repository with the visual-instruction-tuning topic, visit your repo's landing page and select "manage topics."