Collaborative generation of unique audiovisual experiences using NFC identity cards
-
Updated
Jan 20, 2021 - TypeScript
Collaborative generation of unique audiovisual experiences using NFC identity cards
Todo o conteúdo produzido para a unidade curricular PF (Projeto FEUP), para o curso em Engenharia Informática e Computação na FEUP
Multitasking multimodal AI material that focus on human interaction and assistance
Utilizing a multimodal architecture to predict the appropriate speaker turn in a dialogue.
This repo collects Multi-modal Machine Learning papers.
AMR extension for the spatial domain, with grounded frame of reference tracking
Multi-angle Lip Multimodal Video Data
Accepted at The Web Conference 2024.
🤖 A framework for building AI Agents with LLMs, integrating multimodal generative AI technologies including voice, images, videos, and digital humans 🌈💎✨
A notebook to learn about ML for astronomy through BTSbot.
Visuo-haptic integration during texture exploration
In this course, you’ll select open source models from Hugging Face Hub to perform NLP, audio, image and multimodal tasks using the Hugging Face transformers library.
Here we will track the latest AI Multimodal Models, including Multimodal Foundation Models, LLM, Audio, Image, Video, Music and 3D content. 🔥
Comparison of multimodal models for Emotion Detection on IEMOCAP
The first public transport search written in Flutter Web
A repository of Video Language papers, code and datasets.
Multimodales Programmieren mit Processing
Repository to document and advertise our McGill Capstone Group 22 Project
Add a description, image, and links to the multimodal topic page so that developers can more easily learn about it.
To associate your repository with the multimodal topic, visit your repo's landing page and select "manage topics."