Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

missing language instructions to covert mobile aloha dataset #73

Open
fayefw opened this issue Jan 20, 2025 · 1 comment
Open

missing language instructions to covert mobile aloha dataset #73

fayefw opened this issue Jan 20, 2025 · 1 comment

Comments

@fayefw
Copy link

fayefw commented Jan 20, 2025

Hi, thank you for your great work! I tried to covert the mobile aloha dataset from hdf5 to tfrecords but the output was empty. I inspected the hdf5 file and found there is no instruction in it. could you tell me how can I find or set the dataset with instructions? thanks a lot!

@thkkk
Copy link

thkkk commented Jan 24, 2025

You can check the expanded_instruction_gpt-4-turbo.json in the dataset, which is the instructions of the tasks. During training, one of the three prompts will be uniformly randomly sampled. If the long prompt is selected, the algorithm will also uniformly randomly select a prompt from the long prompt.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants