Audio models for fact checking against textual documents. Please refer to different branches for implementations of ImageBind, CLAP, and QFormer.
Branch: clap-mod - CLAP implementation
dev-sachetch - QFormer implementation
dev-john - ImageBind implementation