Skip to content

Adding MedQA and general QandA finetuning stuff #9

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 1 commit into from
Nov 26, 2023

Conversation

lurosenb
Copy link

Working on the Privacy issue #8 , need to start by adding a reasonable QandA from medical domain. Added MedQA here: https://huggingface.co/datasets/lurosenb/medqa .

Added an evaluate_qanda.py script with some QandA specific functions. Piggybacked off of @BeanHam 's finetune_summarization file, as it was my starting point and there was lots of overlap in the implementation. I vote we rename "finetune_summarization" to just "finetune_runner" and use it to run any task (with proper command line customization, as demonstrated.

Still need to improve on the QandA metrics (for MedQA, multiple choice means we should have an accuracy score with better answer parsing, not just the SQUAD style f1 score. I added some work to that end but its incomplete).

Also, didn't run tests beyond finetuning Llama. trying not to get too distracted by experiments, as my goal is to move on quickly to the privacy finetuning task which is non trivial.

Also added a readme which catalogued the process of getting my task going. Hopefully it's useful for someone!

@lurosenb lurosenb self-assigned this Nov 26, 2023
@wolferobert3 wolferobert3 merged commit 9e727e0 into main Nov 26, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants