You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I'm running triage over [Red Hat 11.3.1-4] on Linux, Python 3.10.6, and using the v8 triage version. My database is in PostgreSQL 15.7 on x86_64-pc-linux-gnu, compiled by GCC (Ubuntu 7.5.0-3ubuntu1~18.04) 7.5.0, 64-bit.
All the presented code worked fine until I used the feature_group_strategies in leave-one-out or leave-one-in. In both cases, I get the same error (detailed below). However, when I use feature_group_strategies: ['all-combinations'], it works, but it doesn't group the variables as expected, and I get results as if I were using all.
Everything runs smoothly until the matrix building step, where I encounter this error:
2024-09-08 15:17:14 - ERROR Child error
Traceback (most recent call last):
File "/Ccd/-pyenv/versions/tri-hp/lib/python3.10/site-packages/triage/experiments/multicore.py", line 166, in run_task_with_splatted_arguments return task_runner(**task)
File "/Ccd/pyenv/versions/tri-hp/lib/python3.10/site-packages/triage/component/architect/builders.py", line 321, in build_matrix
output, labels = self.stitch_csvs(feature_queries, label_query, matrix_store, matrix_uuid)
File "/Ccd/pyenv/versions/tri-hp/lib/python3.10/site-packages/triage/component/architect/builders.py", line 551, in stitch_csvs
if len(df_pl.get_column('as_of_date').head(1)[0].split)) > 1:
File "/Ccd/.pyenv/versions/tri-hp/lib/python3.10/site-packages/polars/dataframe/frame.py", line 6128, in get_column return self[name]
exceptions.ColumnNotFoundError: as_of_date
It seems like the as_of_date column is missing or not properly generated during matrix building, specifically when using the leave-one-out or leave-one-in strategies.
I expected the leave-one-out strategy to group variables accordingly and generate matrices without this error, but instead, the process halts when it reaches matrix building. I checked the matrices generated in the process and confirmed that the as_of_date column is indeed present.
My questions would be:
Is this a known issue with these feature grouping strategies?
Could this be related to how the as_of_date column is handled with these strategies?
Any guidance or suggestions would be greatly appreciated!
Thank you for your help.
The text was updated successfully, but these errors were encountered:
ElenaVillano
changed the title
Especification for feature_group_strategies is not working with leave-one-out or leave-one-in don't work
Specification for feature_group_strategies is not working with leave-one-out or leave-one-in
Sep 11, 2024
Hi everyone,
I'm running triage over [Red Hat 11.3.1-4] on Linux, Python 3.10.6, and using the v8 triage version. My database is in PostgreSQL 15.7 on x86_64-pc-linux-gnu, compiled by GCC (Ubuntu 7.5.0-3ubuntu1~18.04) 7.5.0, 64-bit.
Configuration details:
All the presented code worked fine until I used the
feature_group_strategies
inleave-one-out
orleave-one-in
. In both cases, I get the same error (detailed below). However, when I usefeature_group_strategies: ['all-combinations']
, it works, but it doesn't group the variables as expected, and I get results as if I were usingall
.Command used:
Everything runs smoothly until the matrix building step, where I encounter this error:
It seems like the
as_of_date
column is missing or not properly generated during matrix building, specifically when using theleave-one-out
orleave-one-in
strategies.I expected the
leave-one-out
strategy to group variables accordingly and generate matrices without this error, but instead, the process halts when it reaches matrix building. I checked the matrices generated in the process and confirmed that the as_of_date column is indeed present.My questions would be:
as_of_date
column is handled with these strategies?Any guidance or suggestions would be greatly appreciated!
Thank you for your help.
The text was updated successfully, but these errors were encountered: