You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
$ mkdir tmp
$ cd tmp
$ wget -P data/imdb https://github.com/usc-isi-i2/kgtk-notebooks/raw/main/datasets/imdb/IMDB.csv.gz
$ zcat IMDB.csv.gz | head -2
imdb_title_id,title,original_title,year,date_published,genre,duration,country,language,director,writer,production_company,actors,description,avg_vote,votes,budget,usa_gross_income,worlwide_gross_income,metascore,reviews_from_users,reviews_from_critics
tt0000009,Miss Jerry,Miss Jerry,1894,1894-10-09,Romance,45,USA,None,Alexander Black,Alexander Black,Alexander Black Photoplays,"Blanche Bayliss, William Courtenay, Chauncey Depew",The adventures of a female reporter in the 1890s.,5.9,154,,,,,1.0,2.0
First issue:
normalize-nodes does not recognize --id-column option
First issue:
normalize-nodes
does not recognize--id-column
optionhttps://kgtk.readthedocs.io/en/latest/transform/normalize_nodes/
Second issue:
After manually changing the column name
imdb_title_id
toid
, thereis still an error:
-v
shows that the issue is the extra filename component.The code is trying to detect compression suffixes like
foo.csv.gz
.It sees there is no
gz
but then it mistakenly defaults tokgtk
.Workarounds: name the file as
foo.csv
with a single dot or pass--input-format csv
.The text was updated successfully, but these errors were encountered: