TrueMedia.org ML Models Collection

[Update February 3, 2025] We haven’t shared the model weights with anyone yet, as we’re still carefully working on a responsible release policy to ensure security and prevent any potential misuse. Thank you for your understanding.

TrueMedia.org ML Models Collection

This repository contains a collection of advanced machine learning models and deepfake detectors. The models were developed by TrueMedia.org, a non-profit organization committed to detecting political deepfakes and supporting fact-checking efforts for the 2024 election.

Each detector is contained in its own directory with specific documentation and implementation details.

Models

1. DistilDIRE (image)

Distil-DIRE is a lightweight version of DIRE, which can be used for real-time applications. Instead of calculating DIRE image directly, Distl-DIRE aims to reconstruct the features of corresponding DIRE image forwared by a image-net pretrained classifier with one-step noise of DDIM inversion. (Paper Link)

Utilizes a knowledge distillation approach from pre-trained diffusion models.
Achieves a 3.2x faster inference speed compared to the traditional DIRE framework.
Capable of detecting diffusion-generated images as well as those produced by GANs

2. UniversalFakeDetectV2 (image)

An updated implementation of the "UniversalFakeDetect" model by Ojha et al. for detecting AI-generated images. Key features:

Untrained Feature Space Utilization: Uses the feature space of large pre-trained vision-language models like CLIP-ViT, which are not specifically trained for distinguishing real from fake images.
Enhanced Generalization: Demonstrates superior performance in detecting fake images from a wide variety of unseen generative models, including diffusion and autoregressive models.
Simple and Efficient Classification: Uses nearest neighbor and linear probing methods in the pre-trained feature space, avoiding extensive model training and reducing computational overhead.
Streamlined codebase to allow training on large image datasets (100k+)

3. GenConViT (video)

An updated implementation of the GenConViT model by Wodajo et al.. It analyzes both visual artifacts and latent data distributions. Features:

Hybrid Architecture: Combines ConvNeXt and Swin Transformer for robust feature extraction, leveraging CNNs for local features and Transformers for global context.
Generative Learning: Uses Autoencoder and Variational Autoencoder to capture latent data distributions
Frame-based video processing capabilities
Streamlined training and testing scripts
Docker deployment support
FP16 precision support

4. StyleFlow (video)

By combining the StyleGRU module and Style Attention Module (SAM), it effectively captures temporal and visual inconsistencies (Paper Link). Features:

Style Latent Flow Analysis: Captures anomalies in the temporal changes of style latent vectors, highlighting suppressed variance in deepfake videos.
StyleGRU Module with Contrastive Learning: Encodes the dynamics of style latent flows using a GRU network trained with supervised contrastive learning to extract style-based temporal features.

5. Reverse Research (image)

A reverse image search pipeline for deepfake detection that leverages multiple APIs:

Google Vision API integration
Support for Perplexity and GPT-4
Docker containerization
Health monitoring endpoints

6. FTCN (video)

An updated implementation of the FTCN (Fully Temporal Convolution Network) model (Paper Link). The goal is to leverage long-term temporal coherence using a Temporal Transformer Network.

Uses temporal-only convolutions by reducing spatial kernel sizes to focus on temporal features, encouraging the model to learn temporal incoherence.
End-to-End Training Without Pre-training: The framework can be trained from scratch without any pre-trained models or external datasets.

7. Transcript Based Detector (audio)

Uses a speach-recognition model and a LLM to make predictions on audio veracity
Predictions are based on language construction, narrative coherence, and factual alignment

Setup

Each model has its own setup requirements and dependencies. Please refer to the individual directories for specific setup instructions.

Accessing Model Weights for Research

We believe in advancing responsible AI development through collaboration. To receive the trained weights for our deepfake detection models:

Email [email protected] with subject "Weight Access Request"
Include:
- Your affiliation
- Intended usage purpose
- Research/project outline
- Agreement to ethical guidelines

Access is granted to verified users only. We carefully review each request to ensure responsible usage of these tools.

Licenses

This project is licensed under the terms of the MIT license. Note that our GenConViT model is licensed differently in a separate repository.

Contributors

The Machine Learning engineers at TrueMedia.org improved open-source models and also developed new detectors. We're now making these advancements available to the community.

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
.github		.github
DistilDIRE		DistilDIRE
FTCN		FTCN
StyleFlow		StyleFlow
UniversalFakeDetectV2		UniversalFakeDetectV2
eval-docs		eval-docs
reverse-search		reverse-search
transcript		transcript
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

TrueMedia.org ML Models Collection

Models

1. DistilDIRE (image)

2. UniversalFakeDetectV2 (image)

3. GenConViT (video)

4. StyleFlow (video)

5. Reverse Research (image)

6. FTCN (video)

7. Transcript Based Detector (audio)

Setup

Accessing Model Weights for Research

Licenses

Contributors

About

Releases

Packages

Contributors 10

Languages

License

truemediaorg/ml-models

Folders and files

Latest commit

History

Repository files navigation

TrueMedia.org ML Models Collection

Models

1. DistilDIRE (image)

2. UniversalFakeDetectV2 (image)

3. GenConViT (video)

4. StyleFlow (video)

5. Reverse Research (image)

6. FTCN (video)

7. Transcript Based Detector (audio)

Setup

Accessing Model Weights for Research

Licenses

Contributors

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 10

Languages

Packages