More robust CounterfactualGenerator by dskarbrevik · Pull Request #212 · cvs-health/langfair

dskarbrevik · 2025-09-24T19:42:28Z

Description

Improves the boundary detection for parsing prompts to allow for better identification of protected attributes. E.g. previously "caucasian student" would be parsed as "asian student".
Adds an "llm_ftu" field to CounterfactualGenerator.generate_responses() that allows you to pass a langchain BaseChatModel to do FTU checking for a more robust protected attribute checking mechanism.
Retry logic if llm response format is not correct

Contributor License Agreement

confirm you have signed the LangFair CLA

Tests

no new tests required
new tests added
existing tests adjusted

Documentation

no documentation changes needed
README updated
API docs added or updated
example notebook added or updated

Screenshots

Improved parsing for word list counterfactuals

Here's an example prompt that was previously having issues parsing correctly:
"The caucasian man was looking at a tree." (previously this was getting hits for "caucasian" and "asian man" due to improper parsing).

Now:

LLM based counterfactuals

Here's an example prompt with multiple race attributes mentioned:
"The asian car mechanic is fixing the white car of that white guy and that other black guy."

We can see that the word list based approach catches "white guy" and "black guy" but isn't robust enough to catch "asian mechanic"

With the llm approach, we can get all race attribute mentions for this case:

...

Here's a test with a bigger group of prompts:

Here are the terms picked up:

And finally, here are the generated counterfactuals from this set of prompts

To point out two interesting things from the above prompts:

For the prompt: "That guy is white and she is black and that person over there is asian. They all drive white cars."
We see the asian_prompt (counterfactual generation):
"That guy is asian and she is asian and that person over there is asian. They all drive white cars."

So you can see that the llm correctly found all of the race words, avoiding the car color reference and substituted them all appropriately.

For the prompt: 'The black| flight attendant was looking at a tree.'
We see that the llm was about to work with this odd punctuation syntax (the | char) and correctly make counterfactuals.

v0.7.1 updates

dylanbouchard and others added 12 commits September 23, 2025 14:44

Merge pull request cvs-health#210 from cvs-health/main

5f224ef

v0.7.1 updates

better word list checking

4df0338

llm based counterfactuals

5724d97

name swapping counterfactuals

f89f3ac

adding llm_ftu to generate_responses

4594439

more focused llm counterfactual

1285eb1

Update counterfactual.py

b942651

clean up llm ftu pathway

723ff71

Update counterfactual.py

e6eda19

Update counterfactual.py

f03dc90

catch no llm specified error

9b2ce01

improved prompting

1d747f8

dskarbrevik marked this pull request as ready for review September 24, 2025 22:29

dskarbrevik added 8 commits September 25, 2025 12:37

retry logic for llm counterfactuals

8c11f24

unique delimiter for llm responses

a6e6119

better llm prompting

9025328

more robust counterfactual replacements

2a39601

better logging

d579afb

improved llm retries

a8d4cfc

better counterfactual generation

6063be9

Delete original_counterfactual.py

44f4a44

dylanbouchard linked an issue Sep 27, 2025 that may be closed by this pull request

Improve robustness of CounterfactualGenerator #215

Open

dylanbouchard changed the base branch from develop to release-branch/v0.7.2 September 30, 2025 20:02

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

More robust CounterfactualGenerator#212

More robust CounterfactualGenerator#212
dskarbrevik wants to merge 20 commits intocvs-health:release-branch/v0.7.2from
dskarbrevik:develop

dskarbrevik commented Sep 24, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

dskarbrevik commented Sep 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Contributor License Agreement

Tests

Documentation

Screenshots

Improved parsing for word list counterfactuals

LLM based counterfactuals

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

dskarbrevik commented Sep 24, 2025 •

edited

Loading