Baseline model for multimodal classification based on images and text. Text representation obtained from pretrained BERT base model and image representation obtained from VGG16 pretrained model.
-
Updated
Aug 26, 2022 - Jupyter Notebook
Baseline model for multimodal classification based on images and text. Text representation obtained from pretrained BERT base model and image representation obtained from VGG16 pretrained model.
Visual Fusion of Camera and LiDAR Sensor
This project aims to develop multimodal deep learning model for fall detecting(Sleep or Fall)
Add a description, image, and links to the late-fusion topic page so that developers can more easily learn about it.
To associate your repository with the late-fusion topic, visit your repo's landing page and select "manage topics."