Skip to content
@SCUT-DLVCLab

SCUT-DLVCLab

华南理工大学深度学习与视觉计算实验室

About Us 🚀

The Deep Learning and Vision Computing Lab is dedicated to advanced theoretical research and innovative applications in the fields of artificial intelligence, computer vision, machine learning, and pattern recognition. Our current research focuses on deep learning, text detection and recognition, document analysis and understanding, and artificial intelligence. In recent years, our team has led more than 30 national and provincial research projects, making significant achievements in optical character recognition (OCR), handwriting recognition, gesture recognition and interaction technology, and innovative applications of deep learning. We have published over 300 SCI/EI papers, obtained more than 50 authorized invention patents, won 5 provincial and ministerial science and technology awards, and achieved first place in international academic competitions 4 times.

Pinned Loading

  1. GPT-4V_OCR GPT-4V_OCR Public

    Evaluation of the Optical Character Recognition (OCR) capabilities of GPT-4V(ision)

    Python 121 4

  2. Document-AI-Recommendations Document-AI-Recommendations Public

    Algorithms, papers, datasets, performance comparisons for Document AI. Continuously updating.

    171 4

  3. SCUT-EnsExam SCUT-EnsExam Public

    SCUT-EnsExam is a real-world handwritten text erasure dataset for examination paper scenarios, which consists of 545 examination paper images. The dataset is randomly divided into training set and …

    8

  4. RFUND RFUND Public

    [MM'2024] Official release of RFUND introduced in the MM'2024 paper "PEneo: Unifying Line Extraction, Line Grouping, and Entity Linking for End-to-end Document Pair Extraction"

    17

Repositories

Showing 10 of 13 repositories
  • Document-AI-Recommendations Public

    Algorithms, papers, datasets, performance comparisons for Document AI. Continuously updating.

    SCUT-DLVCLab/Document-AI-Recommendations’s past year of commit activity
    171 4 0 0 Updated Dec 9, 2024
  • RFUND Public

    [MM'2024] Official release of RFUND introduced in the MM'2024 paper "PEneo: Unifying Line Extraction, Line Grouping, and Entity Linking for End-to-end Document Pair Extraction"

    SCUT-DLVCLab/RFUND’s past year of commit activity
    17 0 0 0 Updated Dec 4, 2024
  • DOLPHIN Public

    Official repository of "Online Writer Retrieval with Chinese Handwritten Phrases: A Synergistic Temporal-Frequency Representation Learning Approach", IEEE TIFS 2024.

    SCUT-DLVCLab/DOLPHIN’s past year of commit activity
    Python 0 GPL-3.0 0 0 0 Updated Nov 11, 2024
  • DCOH-120K Public
    SCUT-DLVCLab/DCOH-120K’s past year of commit activity
    0 GPL-3.0 0 0 0 Updated Nov 7, 2024
  • PAVENet Public
    SCUT-DLVCLab/PAVENet’s past year of commit activity
    0 GPL-3.0 0 0 0 Updated Oct 29, 2024
  • TongGu-LLM Public

    [EMNLP 2024] TongGu, a classical Chinese language model.

    SCUT-DLVCLab/TongGu-LLM’s past year of commit activity
    13 0 1 0 Updated Sep 28, 2024
  • WenMind Public

    WenMind benchmark.

    SCUT-DLVCLab/WenMind’s past year of commit activity
    Python 5 0 0 0 Updated Sep 26, 2024
  • HisDoc1B Public
    SCUT-DLVCLab/HisDoc1B’s past year of commit activity
    1 0 0 0 Updated Jul 17, 2024
  • .github Public
    SCUT-DLVCLab/.github’s past year of commit activity
    0 0 0 0 Updated Jun 4, 2024
  • C3bench Public

    C3 benchmark

    SCUT-DLVCLab/C3bench’s past year of commit activity
    2 0 1 0 Updated May 27, 2024