llava

Custom ComfyUI nodes for Vision Language Models, Large Language Models, Image to Music, Text to Music, Consistent and Random Creative Prompt Generation

image-captioning nodes vlm custom-nodes img2text llm mllm llava comfyui siglip phi15 joytag img2sfx

Updated Jun 2, 2024
Python

jhc13 / taggui

Star

Tag manager and captioner for image datasets

image-captioning image-tagging tag-manager pyside6 stable-diffusion llava moondream cogvlm

Updated Jun 2, 2024
Python

Faris-abukhader / faris-social

Star

a unique blend of features from your favorite social media platforms like Facebook, Twitter, Reddit, and Instagram, all in one convenient place

typescript nextjs drizzle prisma tailwindcss trpc next-i18next zustand neondb llava ollama valibot mistral-7b

Updated Jun 2, 2024
TypeScript

modelscope / swift

Star

ms-swift: Use PEFT or Full-parameter to finetune 250+ LLMs or 30+ MLLMs

Updated Jun 2, 2024
Python

FreeGenius AI, an advanced AI assistant that can talk and take multi-step actions. Supports numerous open-source LLMs via Llama.cpp or Ollama or Groq Cloud API, with optional integration with AutoGen agents, OpenAI API, Google Gemini Pro and unlimited plugins.

google ai gemini vision openai mistral autogen groq stable-diffusion chatgpt llava llamacpp ollama llama3

Updated Jun 1, 2024
Python

maheshmnj / Image-Captioning-using-llava-and-llama3

Sponsor

Star

lmage Caption Generator using llava and llama3 through the ollama library

vision llava ollama llama3

Updated Jun 1, 2024
Python

Victorwz / MLM_Filter

Star

Official implementation of our paper "Finetuned Multimodal Language Models are High-Quality Image-Text Data Filters".

data-filtering data-quality-assessment large-language-models llava multimodal-large-language-models image-text-data

Updated May 31, 2024
Python

jakobdylanc / discord-llm-chatbot

Sponsor

Star

llmcord.py • Talk to LLMs with your friends!

Updated May 31, 2024
Python

InternLM / xtuner

Star

An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)

agent chatbot conversational-ai peft baichuan msagent large-language-models llm supervised-finetuning llava llm-training chatglm2 internlm llama2 qwen chatglm3 mixtral llama3 phi3

Updated May 31, 2024
Python

PaddlePaddle / PaddleMIX

Star

Paddle Multimodal Integration and eXploration, supporting mainstream multi-modal tasks, including end-to-end large-scale multi-modal pretrain models and diffusion model toolbox. Equipped with high performance and flexibility.

image-to-text clip text-to-image dit multimodal sora text-to-video aigc stable-diffusion controlnet llava blip2 minigpt4 sd-xl ppdiffusers eva-clip stablevideodiffusion qwen-vl

Updated Jun 2, 2024
Python

Blaizzy / mlx-vlm

Star

MLX-VLM is a package for running Vision LLMs locally on your Mac using MLX.

mlx vision-framework apple-silicon vision-transformer llm vision-language-model llava local-ai idefics paligemma

Updated May 31, 2024
Python

Seeed-Projects / jetson-examples

Star

jetson-examples running AI models and applications on NVIDIA Jetson devices with one-line command.

nvidia llama gpt jetson multimodal llm jetson-orin llava llama3 jetson-examples

Updated May 31, 2024
Shell

modelscope / data-juicer

Star

A one-stop data processing system to make data higher-quality, juicier, and more digestible for LLMs! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷为大语言模型提供更高质量、更丰富、更易”消化“的数据！

Updated May 31, 2024
Python

open-compass / VLMEvalKit

Star

Open-source evaluation toolkit of large vision-language models (LVLMs), support GPT-4v, Gemini, QwenVLPlus, 50+ HF models, 20+ benchmarks

computer-vision evaluation pytorch gemini openai vqa vit gpt multi-modal clip claude openai-api gpt4 large-language-models llm chatgpt llava qwen gpt-4v

Updated May 31, 2024
Python

apocas / restai

Sponsor

Star

RestAI is an AIaaS (AI as a Service) open-source platform. Built on top of LlamaIndex, Ollama and HF Pipelines. Supports any public LLM supported by LlamaIndex and any local LLM suported by Ollama. Precise embeddings usage and tuning.

python transformers embeddings openai llama rag fastapi llm stable-diffusion langchain openaiapi llava llamaindex ollama

Updated May 31, 2024
Python

whwu95 / FreeVA

Star

FreeVA: Offline MLLM as Training-Free Video Assistant

chatbot video-understanding zero-shot-video-captioning video-question-answering chatgpt vision-language-model llava training-free multimodal-large-language-models

Updated May 31, 2024
Python

Improve this page

Add a description, image, and links to the llava topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the llava topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

llava

Here are 105 public repositories matching this topic...

spongedsc / pathways

mgonzs13 / llama_ros

SciSharp / LLamaSharp

ollama / ollama

gokayfem / ComfyUI_VLM_nodes

jhc13 / taggui

Faris-abukhader / faris-social

modelscope / swift

eliranwong / freegenius

maheshmnj / Image-Captioning-using-llava-and-llama3

Victorwz / MLM_Filter

jakobdylanc / discord-llm-chatbot

InternLM / xtuner

PaddlePaddle / PaddleMIX

Blaizzy / mlx-vlm

Seeed-Projects / jetson-examples

modelscope / data-juicer

open-compass / VLMEvalKit

apocas / restai

whwu95 / FreeVA

Improve this page

Add this topic to your repo