autocaption_prefix and autocaption_suffix not always added to captions #42

erikbrodnick · 2024-10-03T02:06:08Z

I observed in the training output logs that the captions did not always include the autocaption_prefix or autocaption_suffix. This is pretty important as it contains the unique trigger token.

After looking at caption.py I noticed this block:

if autocaption_prefix:
    inp += f"\n\nYou must start the caption with '{autocaption_prefix}'. "

if autocaption_suffix:
    inp += f"\n\nYou must end the caption with '{autocaption_suffix}'."

Instead of relying on the llm to add these which it is clearly failing to do, I suggest manually adding to the resulting output as follows:

output = self.tokenizer.batch_decode(output_ids, skip_special_tokens=True)[
    0
].strip()

if autocaption_prefix:
    output = f"{autocaption_prefix} {output}"
if autocaption_suffix:
    output = f"{output} {autocaption_suffix}"

print(f"Caption for {image_path}: {output}")

The text was updated successfully, but these errors were encountered:

tomhillable · 2024-10-03T02:40:56Z

Also have this issue and am seeing only about 50% of the captions having the prefix or suffix enforced.

dmund95 · 2024-10-12T03:21:12Z

Yeah I agree.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

autocaption_prefix and autocaption_suffix not always added to captions #42

autocaption_prefix and autocaption_suffix not always added to captions #42

erikbrodnick commented Oct 3, 2024 •

edited

Loading

tomhillable commented Oct 3, 2024

dmund95 commented Oct 12, 2024 •

edited

Loading

autocaption_prefix and autocaption_suffix not always added to captions #42

autocaption_prefix and autocaption_suffix not always added to captions #42

Comments

erikbrodnick commented Oct 3, 2024 • edited Loading

tomhillable commented Oct 3, 2024

dmund95 commented Oct 12, 2024 • edited Loading

erikbrodnick commented Oct 3, 2024 •

edited

Loading

dmund95 commented Oct 12, 2024 •

edited

Loading