Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

How to retrieve salience of some specific words? #105

Open
CarhoJohn opened this issue Sep 5, 2023 · 1 comment
Open

How to retrieve salience of some specific words? #105

CarhoJohn opened this issue Sep 5, 2023 · 1 comment

Comments

@CarhoJohn
Copy link

CarhoJohn commented Sep 5, 2023

Hi. To obtain the salience map of previous tokens when generating new tokens, we can use the code/function provided in the example code:

output = lm.generate(prompt, generate=1, do_sample=True, attribution=['ig'])
res = output.primary_attributions(attr_method='ig')

However, in this standard method, I can only get the salience map for the (randomly/uncontrollable) generated word.

Is it possible to obtain the salience map for specific word? For example, in the sentence "I have a dog. He is very ...", I'd like to get the salience map for a specific word cute, rather than other words generated by the model.

Thanks very much!

@BiEchi
Copy link
Contributor

BiEchi commented Sep 11, 2023

From my understanding this is not possible unless you do algorithmic optimization (some math). Salience maps is doing backprop from output to embedding. This process is just chain rule, and if you break it you do get specific words, but unless mathematically grounded, your approach fails.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants