Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Question: Can I turn off the teacher (question generation)? #630

Open
TomasHofman opened this issue Mar 25, 2024 · 1 comment
Open

Question: Can I turn off the teacher (question generation)? #630

TomasHofman opened this issue Mar 25, 2024 · 1 comment
Labels
stale stale-bot has marked you as stale

Comments

@TomasHofman
Copy link

What if I have a skill where it's easy to generate synthetic data in an algorithmic way (so I can generate 100 questions easily myself), and I'm not satisfied with the questions generated by the teacher model?

Can I turn it off?

Or is there a way for me to influence the teacher model in some way?

Example (Semantic version skill):

In the qna.yaml, I have questions like:

Sort following version string from lowest to highest: 1.2.3, 2.2.1, 1.0.1
Which of these versions belong to major version 1: 2.1.1, 1.2.0, 11.1.0

The generated questions are sometimes way off, like:

Determine the highest peak in the world for each continent.

  • Africa: Kilimanjaro (5,895 m)
  • Antarctica: Vinson Massif (4,892 m)

Other generated questions are about basic arithmetic.

@bjhargrave
Copy link
Contributor

I would think this sort of behavior is hard for LLMs since they are text based and not math oriented. So even if you could generate a lot of q/a pairs with correct information, I don't think the LLM would ever learn to sort-by-number in a reliable way.

The Which of these versions belong to major version skill is more possible since it is sort of text extraction rather than math.

@github-actions github-actions bot added the stale stale-bot has marked you as stale label May 9, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
stale stale-bot has marked you as stale
Projects
None yet
Development

No branches or pull requests

2 participants