Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We鈥檒l occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add support for Image, Video, and Audio input into Forge and AutoGPT #7152

Open
1 task done
ntindle opened this issue May 14, 2024 · 0 comments
Open
1 task done

Add support for Image, Video, and Audio input into Forge and AutoGPT #7152

ntindle opened this issue May 14, 2024 · 0 comments
Labels
fridge Items that can't be processed right now but can be of use or inspiration later

Comments

@ntindle
Copy link
Member

ntindle commented May 14, 2024

Duplicates

  • I have searched the existing issues

Summary 馃挕

Adding support for Image, Video, and Audio inputs into the AutoGPT system is more than just supporting it at the fastapi server level, it includes passing them through the MultiProvider for LLMs and checking which LLMs support which features as part of their configs.

Examples 馃寛

No response

Motivation 馃敠

The future of Agents is multimodal

@ntindle ntindle added the fridge Items that can't be processed right now but can be of use or inspiration later label May 14, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
fridge Items that can't be processed right now but can be of use or inspiration later
Projects
Status: No status
Development

No branches or pull requests

1 participant