ChatGPT-Detection-PR-HPPT

This our dataset and code for the paper: Is ChatGPT Involved in Texts? Measure the Polish Ratio to Detect ChatGPT-Generated Text

Guideline

The detector is a Roberta for classification model with labels (0: human, 1:ChatGPT-involved).

If you want to train it, follow these steps:

install the environment

pip install -r requirements.txt

train a detector

cd Detector

python train.py

It is also all right for you to change some setting in the code.

get the detector

The best_model.pt is the trained detector.

You can test the custom sample in text_test.txt (only three examples in it):

python inference.py

If you do not want to train the model, we provide our trained detector on HPPT: Trained Detector on Google driver and Trained Detector on Huggingface.

train a model to get polish ratio

cd ../PR_reg

python train.py

We also provide the trained PR model: Trained PR model

Citation

You are welcome to use our dataset and models. For citation following BibTex entry:

@article{yang2023chatgpt,
  title={Is chatgpt involved in texts? measure the polish ratio to detect chatgpt-generated text},
  author={Yang, Lingyi and Jiang, Feng and Li, Haizhou},
  journal={APSIPA Transactions on Signal and Information Processing},
  volume={13},
  number={2},
  publisher={Now Publishers, Inc.}
}

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
API_deployment		API_deployment
Dataset		Dataset
Detector		Detector
PR_reg		PR_reg
README.md		README.md
cal_distance.py		cal_distance.py
image.png		image.png
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ChatGPT-Detection-PR-HPPT

Guideline

Citation

About

Contributors 2

Languages

FreedomIntelligence/ChatGPT-Detection-PR-HPPT

Folders and files

Latest commit

History

Repository files navigation

ChatGPT-Detection-PR-HPPT

Guideline

Citation

About

Topics

Resources

Stars

Watchers

Forks

Contributors 2

Languages