FinRobot: An Open-Source AI Agent Platform for Financial Applications using LLMs 🚀 🚀 🚀
-
Updated
Jun 3, 2024 - Jupyter Notebook
FinRobot: An Open-Source AI Agent Platform for Financial Applications using LLMs 🚀 🚀 🚀
LAVIS - A One-stop Library for Language-Vision Intelligence
A curated list of awesome vision and language resources for earth observation.
Here we will track the latest AI Multimodal Models, including Multimodal Foundation Models, LLM, Audio, Image, Video, Music and 3D content. 🔥
[ICLR 2024] Official implementation of " 🦙 Time-LLM: Time Series Forecasting by Reprogramming Large Language Models"
(ෆ`꒳´ෆ) A Survey on Text-to-Image Generation/Synthesis.
A codebase dedicated to exploring multimodal learning approaches by integrating images of host galaxies of supernovae and their corresponding light-curves and spectra.
A curated list of awesome Multimodal studies.
A flexible package for multimodal-deep-learning to combine tabular data with text and images using Wide and Deep models in Pytorch
This repo contains evaluation code for the paper "MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI"
This is my personal news list updates in Information Retrieval domain
Multimodal Computer Vision application leveraging object detections, gesture recognition and speech to text, in order to help user ask questions about their environment.
Evaluation framework for paper "VisualWebBench: How Far Have Multimodal LLMs Evolved in Web Page Understanding and Grounding?"
Movie detection application.
Demo for Binding Text, Images, Graphs, and Audio for Music Representation Learning
Code for Neural Plasticity-Inspired Foundation Model for Observing the Earth Crossing Modalities
Pure C 3D Hybrid GAN using Cross attention, attention and convolution
Improving Chest X-Ray Report Generation by Leveraging Warm-Starting
Python framework to extract multimodal features for multimodal recommendation in a highly-customizable way.
A data science project to predict online pet adoption speed using image, natural language, and tabular data with a multi-modal ML framework.
Add a description, image, and links to the multimodal-deep-learning topic page so that developers can more easily learn about it.
To associate your repository with the multimodal-deep-learning topic, visit your repo's landing page and select "manage topics."