Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Permanent DAPT tutorial bug fix #505

Open
sarahyurick opened this issue Jan 31, 2025 · 0 comments
Open

Permanent DAPT tutorial bug fix #505

sarahyurick opened this issue Jan 31, 2025 · 0 comments
Assignees
Labels
bug Something isn't working

Comments

@sarahyurick
Copy link
Collaborator

In #503, I fixed a semantic deduplication bug that was happening in the DAPT tutorial by avoiding writing the embeddings to Parquet files. However, we should implement a more permanent fix.

This is the error that would pop up when doing write_to_disk: rapidsai/cudf#10115.

One way to fix this could be to convert any categorical columns into strings before writing to disk. I have not had a chance to test this yet, though.

@sarahyurick sarahyurick added the bug Something isn't working label Jan 31, 2025
@sarahyurick sarahyurick self-assigned this Jan 31, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

1 participant