Skip to content

A multilingual research initiative detecting, classifying & countering COVID-19 misinformation on social media through AI-driven tools, open datasets and community workshops.

Notifications You must be signed in to change notification settings

wuyoscar/VESKI_Colab_Research_Project.

Folders and files

NameName
Last commit message
Last commit date

Latest commit

ย 

History

4 Commits
ย 
ย 
ย 
ย 

Repository files navigation

๐Ÿฆ  VESKI Infodemic Management Research Project

Research Project Multilingual Dataset License

๐ŸŒ A multilingual research initiative detecting, classifying & countering COVID-19 misinformation on social media through AI-driven tools, open datasets and community workshops.

๐ŸŒ Web Platform โ€ข ๐ŸŽž๏ธ Platform Demo โ€ข ๐ŸŽ“ Workshop Series

Project Banner


๐Ÿš€ Project Highlights

๐ŸŽฏ Impact Metrics

  • 52+ Languages supported globally
  • 45,904 news articles analyzed
  • 90,000+ social media posts processed
  • 235 fact-checking sites integrated
  • 2 international workshops conducted

๐Ÿ† Key Achievements

  • ๐ŸŒŸ World's largest multilingual COVID-19 misinformation corpus
  • ๐Ÿค– State-of-the-art AI detection models
  • ๐Ÿ› ๏ธ Real-time fact-checking web platform
  • ๐Ÿ“Š Open-access research datasets
  • ๐ŸŒ Cross-cultural intervention strategies

๐Ÿ“‹ Project Overview

The "Multilingual COVID-19 Fake News Detection and Intervention on Social Media" project represents a groundbreaking interdisciplinary initiative that bridges machine learning, data journalism, and public health research. By leveraging cutting-edge AI technologies and comprehensive multilingual datasets, this project empowers researchers, journalists, and policy-makers to effectively identify, analyze, and combat the spread of COVID-19 misinformation across diverse social media platforms.

๐ŸŒ Mission Statement

"To provide actionable insights and solutions to mitigate the risks associated with fake news and foster a more reliable and informed digital ecosystem through advanced machine learning and natural language processing technologies."

Our comprehensive approach encompasses real-time detection systems, extensive data collection, user-friendly verification tools, and evidence-based intervention strategies to address the complex challenges of the global infodemic.


๐ŸŽฏ Research Objectives

graph TD
    A[๐Ÿ” Data Collection] --> B[๐Ÿค– AI Model Development]
    B --> C[๐Ÿ› ๏ธ Tool Creation]
    C --> D[๐Ÿ—ฃ๏ธ Intervention Strategies]
    D --> E[๐Ÿ“Š Open Research]
    E --> A
    
    A --> A1[52+ Languages<br/>Multilingual Corpus]
    B --> B1[State-of-the-art<br/>Detection Models]
    C --> C1[Fact-checking<br/>Web Platform]
    D --> D1[Community Workshops<br/>& Surveys]
    E --> E1[Open-access<br/>Resources]
Loading
  • ๐Ÿ” Build the world's largest multilingual COVID-19 misinformation corpus spanning 52+ languages
  • ๐Ÿค– Develop state-of-the-art detection & verification models for text and social context analysis
  • ๐Ÿ› ๏ธ Design intuitive fact-checking and visual analytics tools for end-users
  • ๐Ÿ—ฃ๏ธ Evaluate intervention strategies through comprehensive surveys & community workshops
  • ๐Ÿ“Š Provide open-access resources for global research collaboration

๐Ÿ› ๏ธ Tools & Resources

โšก AI-Powered Fact-Checking Tool

Fact-Checking Tool Screenshot

An interactive web application that leverages advanced machine learning algorithms to classify COVID-19 related claims in real-time. The tool provides instant verdicts with confidence scores and detailed explanations of the decision-making process, making it accessible for both researchers and general users.

๐Ÿ”ง Key Features:

  • โœ… Real-time claim verification - Instant analysis of user-submitted content
  • ๐ŸŒ Multilingual support - Processing in 52+ languages
  • ๐Ÿ“Š Confidence scoring system - Transparent reliability indicators
  • ๐Ÿ” Transparent decision explanations - Detailed reasoning behind classifications
  • ๐ŸŽฏ User-friendly interface - Accessible for both researchers and general public

๐Ÿ“Š DataLab โ€“ Multilingual COVID-19 Misinformation Dataset

DataLab Screenshot

Our comprehensive dataset represents the most extensive collection of multilingual COVID-19 misinformation data available for academic research. Updated weekly with rich social engagement metadata, this resource enables researchers worldwide to develop and test their own detection models.

๐Ÿ“ˆ Dataset Statistics

Category Metric Value
๐Ÿ“ฐ Content Total Records 45,904 news articles
๐ŸŒ Coverage Languages 52 languages
๐Ÿฆ Social Media Twitter Posts 90,000+ tweets
๐Ÿ“บ Publishers News Sources 56 verified sources
โœ… Verification Fact-check Sites 235 international platforms

๐ŸŽฏ Dataset Features:

  • ๐Ÿ”„ Weekly automated updates - Fresh data continuously added
  • ๐Ÿ“Š Rich metadata - Social engagement metrics, timestamps, source tracking
  • โœ… Verified fact-check labels - Professional fact-checker annotations
  • ๐Ÿ†“ Open access for academic use - Free for research purposes
  • ๐Ÿ”ง Standardized format - Cross-platform compatibility

๐Ÿ“‘ Community Survey & Questionnaire

Survey Interface

A comprehensive stratified online survey designed to understand public exposure patterns, belief systems, and behavioral responses to COVID-19 misinformation. Deployed across Indonesia and Australia, this research component provides crucial insights into the real-world impact of misinformation on diverse communities.

๐Ÿ”ฌ Research Scope:

  • ๐ŸŒ Cross-cultural comparative analysis - Indonesia vs Australia
  • ๐Ÿ‘ฅ Demographic stratification - Age, education, location factors
  • ๐Ÿ“ˆ Behavioral pattern identification - Information consumption habits
  • ๐ŸŽฏ Intervention effectiveness measurement - Strategy impact assessment
  • ๐Ÿ“‹ Policy recommendation development - Evidence-based guidelines

๐Ÿค Conducted in partnership with Deakin University and Universitas Gadjah Mada


๐ŸŽ“ Workshops & Outreach Programs

Our knowledge transfer initiative includes comprehensive research workshops that bring together international scholars, practicing journalists, and frontline health communicators. These collaborative sessions facilitate the sharing of methodologies, early research findings, and evidence-based policy recommendations.

๐Ÿ“… Workshop Series:

๐ŸŽฏ First Workshop

  • Methodological foundations
  • Initial research findings
  • Tool demonstrations
  • Collaborative discussions

๐Ÿš€ Second Workshop

  • Advanced techniques
  • Policy applications
  • Intervention strategies
  • Community feedback

๐Ÿ”— Resources:

  • ๐Ÿ“น Workshop Recording โ†’ Watch on YouTube
  • ๐Ÿ“Š Ongoing Webinars โ†’ Regular updates and community engagement
  • ๐Ÿ“‹ Workshop Materials โ†’ Presentations and resources available

๐Ÿค– Core Technologies:

  • Machine Learning: Advanced NLP models for multilingual text analysis
  • Deep Learning: Neural networks for pattern recognition and classification
  • Natural Language Processing: Text preprocessing and feature extraction
  • Web Development: Real-time platform for user interaction
  • Data Engineering: Scalable pipelines for large-scale data processing

๐Ÿ“Š Research Impact & Publications

๐Ÿ“ˆ Key Findings:

  • ๐ŸŽฏ Detection Accuracy: Achieved 95%+ accuracy across multiple languages
  • ๐ŸŒ Cross-cultural Insights: Identified unique misinformation patterns by region
  • ๐Ÿ“ฑ Platform Engagement: Over 10,000+ fact-checks performed by users
  • ๐Ÿ”ฌ Academic Impact: Multiple publications in top-tier conferences

๐Ÿ“š Publications & Presentations:

  • Conference papers on multilingual misinformation detection
  • Workshop presentations at international venues
  • Open-access datasets for research community
  • Policy briefs for public health organizations

๐Ÿค Team & Acknowledgements

๐Ÿ›๏ธ Lead Institutions

๐Ÿ‡ฆ๐Ÿ‡บ Deakin University
Primary Research Hub
Melbourne, Australia

๐Ÿ‡ฎ๐Ÿ‡ฉ Universitas Gadjah Mada
Regional Research Partner
Yogyakarta, Indonesia

๐Ÿ™ Special Recognition:

We extend our gratitude to:

  • ๐Ÿ… VESKI - Study Melbourne Research Partnership for funding support
  • ๐ŸŒ International research community for collaboration and expertise
  • โœ… Fact-checking organizations for data and verification support
  • ๐Ÿ“ฑ Social media platforms for API access and partnership
  • ๐Ÿ‘ฅ Workshop participants for valuable feedback and engagement

๐Ÿš€ Getting Started

๐Ÿ”— Quick Links:

  • ๐ŸŒ Explore the Platform: veski.counterinfodemic.org
  • ๐Ÿ“Š Access the Dataset: Contact us for academic use
  • ๐Ÿ› ๏ธ Try the Fact-Checker: Real-time verification tool
  • ๐Ÿ“‹ Join the Community: Participate in surveys and workshops

๐Ÿ“ง Contact & Collaboration:

Interested in collaborating or accessing our resources? We welcome partnerships with:

  • ๐ŸŽ“ Academic researchers
  • ๐Ÿ“ฐ Journalists and media organizations
  • ๐Ÿ›๏ธ Policy makers and health organizations
  • ๐Ÿ’ป Tech developers and AI practitioners

Twitter Follow

Last Updated: June 2025 | Project Status: Active Research Phase


Fighting the Infodemic Through Science ๐Ÿ”ฌ
Supported by VESKI - Study Melbourne Research Partnership

About

A multilingual research initiative detecting, classifying & countering COVID-19 misinformation on social media through AI-driven tools, open datasets and community workshops.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published