GitHub - zqw5219/Coding-Plagiarism-and-Ghostwriting-Detection-System

Coding Plagiarism and Ghostwriting Detection System

Following this directory structure

|--project_name                           <- Project root level that is checked into github
  |--project                              <- Project folder
    |--README.md                          <- Top-level README for developers
    |--volume
    |   |--data
    |   |   |--external                   <- Data from third party sources
    |   |   |--interim                    <- Intermediate data that has been transformed
    |   |   |--processed                  <- The final model-ready data
    |   |   |--raw                        <- The original data dump
    |   |
    |   |--models                         <- Trained model files that can be read into R or Python
    |
    |--required
    |   |--requirements.txt               <- The required libraries for reproducing the Python environment
    |
    |
    |--src
    |   |--development                    <- Scripts for testing and development purpose that are not formatted
    |   |   |--remove_template.py
    |   |   |--similarity.ipynb
    |   |   |--string_similarity_check.py
    |   |--features                       <- Scripts for turning raw and external data into model-ready data
    |   |   |--utils                      <- Scripts for pycparser features such as preprocessing
    |   |   |--authorizerPackage.py       <- Functions for all features needed
    |   |   |--codeAndCommentSparator.py
    |   |   |--codeElementExtractor.py
    |   |   |--dataReader.py              
    |   |   |--main.py                    <- Script that contains the instruction of this project
    |   |   |--pycparserApplication.py
    |   |   |--randomForestClassifier.py  <- Script that generates random forest classifier on extracted features
    |   |   |--similarityScoreByLine.py   <- Script that extract features based on similarity by line
    |   |   |--similarityScoreByWord.py   <- Script that extract features based on similarity by word
    |   |   |--structureExtractor.py      <- Script that extract features based on structure
    |   |   |--testing.py                 <- Script saved for all testing and developing purpose
    |   |
    |   |--models                         <- Scripts for training and saving models
    |   |   |--.gitkeep
    |   |
    |
    |
    |
    |--.getignore                         <- List of files not to sync with github

Name		Name	Last commit message	Last commit date
Latest commit History 36 Commits
project		project
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Coding Plagiarism and Ghostwriting Detection System

About

Uh oh!

Releases

Packages

Uh oh!

Languages

zqw5219/Coding-Plagiarism-and-Ghostwriting-Detection-System

Folders and files

Latest commit

History

Repository files navigation

Coding Plagiarism and Ghostwriting Detection System

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages