You can upload images, ask questions about images using voice prompts, then listen to the responses in voice
text-to-speech
speech-to-text
whisper
gtts
replicate
large-language-models
llm
blip-2-ai-model
answering-questions
-
Updated
May 31, 2023 - JavaScript