Adapter-TST

The code implementation of our EMNLP'23 Findings "Adapter-TST: A Parameter Efficient Method for Multiple-Attribute Text Style Transfer"

Steps to run

Download GloVe embedding for style classifers

cd checkpoints_cls

wget https://nlp.stanford.edu/data/glove.840B.300d.zip

unzip glove.840B.300d.zip
Install the dependencies

Huggingface transformers shouldn't be installed.

Remember to change to the path in following file:
```
 1. examples/pytorch/summarization.py L25, L27
 2. PPL_score.py L2
 3. classifer/textcnn.py L9, L18
```

Train the classifier

 # all the classifier weights have beed uploaded, if you want to train a new classifier, you can use the following command

 CUDA_VISIBLE_DEVICES=0 python classifier/textcnn.py -dataset yelp -num_label 2 -batch_size 8

Train Adapter-TST model

Use T5-large as the base model and train Adapter-TST model for sentiment transfer.

CUDA_VISIBLE_DEVICES=0 python examples/pytorch/summarization/run_summarization.py \
    --model_name_or_path t5-large \
    --do_train \
    --do_eval \
    --do_predict \
    --train_adapter \
    --num_train_epochs 1 \
    --tst_lambda 0.99 \
    --gradient_accumulation_steps 4 \
    --test_file data/datasets/yelpbaseline/test/sentiment_transfer_unsup.json \
    --train_file data/datasets/yelpbaseline/train/sentiment_transfer_unsup.json \
    --validation_file data/datasets/yelpbaseline/test/sentiment_transfer_unsup.json \
    --output_dir trained_models/adapter-tst-yelp-t5/ \
    --overwrite_output_dir \
    --per_device_train_batch_size=32 \
    --per_device_eval_batch_size=32 \
    --text_column sentence \
    --summary_column style_label \
    --evaluation_strategy epoch \
    --predict_with_generate \
    --save_strategy no \
    --tst_task_name sentiment

#For Tense-Voice multi-attribute transfer

CUDA_VISIBLE_DEVICES=0 python examples/pytorch/summarization/run_summarization.py \
    --model_name_or_path t5-large \
    --do_train \
    --do_eval \
    --do_predict \
    --train_adapter \
    --num_train_epochs 1 \
    --tst_lambda 0.9 \
    --tst_lambda2 0.97 \
    --gradient_accumulation_steps 4 \
    --test_file data/datasets/StylePTB/adapterTST/tense_voice/test/style_transfer_unsup.json \
    --train_file data/datasets/StylePTB/adapterTST/tense_voice/train/style_transfer_unsup.json \
    --validation_file data/datasets/StylePTB/adapterTST/tense_voice/test/style_transfer_unsup.json \
    --output_dir trained_models/adapter-tst-tense-voice-t5/ \
    --overwrite_output_dir \
    --per_device_train_batch_size=16 \
    --per_device_eval_batch_size=16 \
    --text_column sentence \
    --summary_column style_label \
    --evaluation_strategy epoch \
    --predict_with_generate \
    --save_strategy no \
    --num_label_cls1 3 \
    --num_label_cls2 2 \
    --tst_task_name tense_voice

#For tense_adjadv_removal training

CUDA_VISIBLE_DEVICES=1 python examples/pytorch/summarization/run_summarization.py \
    --model_name_or_path t5-large \
    --do_train \
    --do_eval \
    --do_predict \
    --train_adapter \
    --num_train_epochs 1 \
    --tst_lambda 0.9 \
    --tst_lambda2 0.98 \
    --gradient_accumulation_steps 4 \
    --test_file data/datasets/StylePTB/adapterTST/tense_adjadv_removal/test/style_transfer_unsup.json \
    --train_file data/datasets/StylePTB/adapterTST/tense_adjadv_removal/train/style_transfer_unsup.json \
    --validation_file data/datasets/StylePTB/adapterTST/tense_adjadv_removal/test/style_transfer_unsup.json \
    --output_dir trained_models/adapter-tst-tense-adjadv-removal-t5/ \
    --overwrite_output_dir \
    --per_device_train_batch_size=16 \
    --per_device_eval_batch_size=16 \
    --text_column sentence \
    --summary_column style_label \
    --evaluation_strategy epoch \
    --predict_with_generate \
    --save_strategy no \
    --num_label_cls1 3 \
    --num_label_cls2 2 \
    --tst_task_name tense_adjadv_removal

#For tense_pp_removal training

CUDA_VISIBLE_DEVICES=1 python examples/pytorch/summarization/run_summarization.py \
    --model_name_or_path t5-large \
    --do_train \
    --do_eval \
    --do_predict \
    --train_adapter \
    --num_train_epochs 1 \
    --tst_lambda 0.9 \
    --tst_lambda2 0.99 \
    --gradient_accumulation_steps 4 \
    --test_file data/datasets/StylePTB/adapterTST/tense_pp_removal/test/style_transfer_unsup.json \
    --train_file data/datasets/StylePTB/adapterTST/tense_pp_removal/train/style_transfer_unsup.json \
    --validation_file data/datasets/StylePTB/adapterTST/tense_pp_removal/test/style_transfer_unsup.json \
    --output_dir trained_models/adapter-tst-tense-pp-removal-t5/ \
    --overwrite_output_dir \
    --per_device_train_batch_size=16 \
    --per_device_eval_batch_size=16 \
    --text_column sentence \
    --summary_column style_label \
    --evaluation_strategy epoch \
    --predict_with_generate \
    --save_strategy no \
    --num_label_cls1 3 \
    --num_label_cls2 2 \
    --tst_task_name tense_pp_removal

#For tense_pp_front_back training

CUDA_VISIBLE_DEVICES=1 python examples/pytorch/summarization/run_summarization.py \
    --model_name_or_path t5-large \
    --do_train \
    --do_eval \
    --do_predict \
    --train_adapter \
    --num_train_epochs 1 \
    --tst_lambda 0.9 \
    --tst_lambda2 0.99 \
    --gradient_accumulation_steps 4 \
    --test_file data/datasets/StylePTB/adapterTST/tense_pp_front_back/test/style_transfer_unsup.json \
    --train_file data/datasets/StylePTB/adapterTST/tense_pp_front_back/train/style_transfer_unsup.json \
    --validation_file data/datasets/StylePTB/adapterTST/tense_pp_front_back/test/style_transfer_unsup.json \
    --output_dir trained_models/adapter-tst-tense-pp-front-back-t5/ \
    --overwrite_output_dir \
    --per_device_train_batch_size=32 \
    --per_device_eval_batch_size=32 \
    --text_column sentence \
    --summary_column style_label \
    --evaluation_strategy epoch \
    --predict_with_generate \
    --save_strategy no \
    --num_label_cls1 3 \
    --num_label_cls2 2 \
    --tst_task_name tense_pp_front_back

Tips: You can adjust --tst_lambda and --tst_lambda2 to balance the transfer accuracy and the content preservation.

Evaluate the performance

Accuracy

#yelp
CUDA_VISIBLE_DEVICES=1 python classifier/textcnn.py -dataset yelp -num_label 2 -test_only True -gen_path  trained_models/adapter-tst-yelp-t5/generated_predictions.json

# tense_voice
python classifier/textcnn.py -dataset tense_voice -style tense -num_label 3 -test_only True -gen_path  trained_models/adapter-tst-tense-voice-t5/generated_predictions_comp_1.json 

python classifier/textcnn.py -dataset tense_voice -style voice -num_label 2 -test_only True -gen_path  trained_models/adapter-tst-tense-voice-t5/generated_predictions2_comp_1.json

#tense_adjadv_removal
python classifier/textcnn.py -dataset tense_adjadv_removal -style tense -num_label 3 -test_only True -gen_path  trained_models/adapter-tst-tense-adjadv-removal-t5/generated_predictions_comp_1.json

python classifier/textcnn.py -dataset tense_adjadv_removal -style adjadv_removal -num_label 2 -test_only True -gen_path  trained_models/adapter-tst-tense-adjadv-removal-t5/generated_predictions2_comp_1.json

#tense_pp_front_back
python classifier/textcnn.py -dataset tense_pp_front_back -style tense -num_label 3 -test_only True -gen_path  trained_models/adapter-tst-tense-pp-front-back-t5/generated_predictions_comp_1.json

python classifier/textcnn.py -dataset tense_pp_front_back -style pp -num_label 2 -test_only True -gen_path  trained_models/adapter-tst-tense-pp-front-back-t5/generated_predictions2_comp_1.json

#tense_pp_removal
python classifier/textcnn.py -dataset tense_pp_removal -style tense -num_label 3 -test_only True -gen_path  trained_models/adapter-tst-tense-pp-removal-t5/generated_predictions.json

python classifier/textcnn.py -dataset tense_pp_removal -style pp -num_label 2 -test_only True -gen_path  trained_models/adapter-tst-tense-pp-removal-t5/generated_predictions2.json

BertScore

python BERTscore_evaluator.py \
    --ref-file-path trained_models/adapter-tst-tense-adjadv-removal-t5/reference_comp_1.txt \
    --gen-file-path trained_models/adapter-tst-tense-adjadv-removal-t5/generated_predictions_comp_1.txt

PPL

python PPL_scorepy

Remember to change the file path in the script.

Citing

If you use Adapter-TST in your publication, please cite it by using the following BibTeX entry.

@article{hu2023adapter,
  title={Adapter-TST: A Parameter Efficient Method for Multiple-Attribute Text Style Transfer},
  author={Hu, Zhiqiang and Lee, Roy Ka-Wei and Chen, Nancy F},
  journal={arXiv preprint arXiv:2305.05945},
  year={2023}
}

Acknowledgement

This repo benefits from Adapter-Transformer. Thanks for their wonderful works.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
adapter_docs		adapter_docs
checkpoints_cls		checkpoints_cls
classifier		classifier
data		data
docker		docker
examples/pytorch		examples/pytorch
notebooks		notebooks
scripts		scripts
src/transformers		src/transformers
tests		tests
tests_adapters		tests_adapters
utils		utils
.gitattributes		.gitattributes
BERTscore_evaluator.py		BERTscore_evaluator.py
BLEU_evaluator.py		BLEU_evaluator.py
CITATION.cff		CITATION.cff
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
Makefile		Makefile
PPL_score.py		PPL_score.py
README.md		README.md
acc_evaluater.py		acc_evaluater.py
cls_eval_dataprepapre.py		cls_eval_dataprepapre.py
conftest.py		conftest.py
hubconf.py		hubconf.py
prata_train.sh		prata_train.sh
prata_train_1.sh		prata_train_1.sh
pyproject.toml		pyproject.toml
setup.cfg		setup.cfg
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Adapter-TST

Steps to run

Citing

Acknowledgement

About

Releases

Packages

Languages

License

Social-AI-Studio/Adapter-TST

Folders and files

Latest commit

History

Repository files navigation

Adapter-TST

Steps to run

Citing

Acknowledgement

About

Resources

License

Code of conduct

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages