Adding MedQA and general QandA finetuning stuff #9

lurosenb · 2023-11-26T04:56:26Z

Working on the Privacy issue #8 , need to start by adding a reasonable QandA from medical domain. Added MedQA here: https://huggingface.co/datasets/lurosenb/medqa .

Added an evaluate_qanda.py script with some QandA specific functions. Piggybacked off of @BeanHam 's finetune_summarization file, as it was my starting point and there was lots of overlap in the implementation. I vote we rename "finetune_summarization" to just "finetune_runner" and use it to run any task (with proper command line customization, as demonstrated.

Still need to improve on the QandA metrics (for MedQA, multiple choice means we should have an accuracy score with better answer parsing, not just the SQUAD style f1 score. I added some work to that end but its incomplete).

Also, didn't run tests beyond finetuning Llama. trying not to get too distracted by experiments, as my goal is to move on quickly to the privacy finetuning task which is non trivial.

Also added a readme which catalogued the process of getting my task going. Hopefully it's useful for someone!

Adding MedQA and general QandA finetuning stuff

947adf5

lurosenb requested review from BeanHam and wolferobert3 November 26, 2023 04:56

lurosenb self-assigned this Nov 26, 2023

wolferobert3 approved these changes Nov 26, 2023

View reviewed changes

wolferobert3 merged commit 9e727e0 into main Nov 26, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Adding MedQA and general QandA finetuning stuff #9

Adding MedQA and general QandA finetuning stuff #9

Uh oh!

lurosenb commented Nov 26, 2023

Uh oh!

Uh oh!

Adding MedQA and general QandA finetuning stuff #9

Adding MedQA and general QandA finetuning stuff #9

Uh oh!

Conversation

lurosenb commented Nov 26, 2023

Uh oh!

Uh oh!