VISUM 2021 Summer School Digital Edition - The Competition Repository
Here you will find all information regarding the competition, including:
- General information about the challenge, including the evaluated metrics.
- Detailed technical instructions about the competition.
- Auxiliary source code, including the source code of the baseline solution.
We wish you all good luck and much success in your work :)
Presentation - here
FAQs - here
Submission Platform - here
Final Leaderboard - here
- Access the baseline source code by either:
- Accessing the files already present in your machine OR
- Cloning this repo into your machine by doing and afterwards moving the contents of the created directory into
/home/visum
git clone https://github.com/visum-summerschool/visum-competition2021.git cp -r visum-competition2021/ . rm -r visum-competition2021
- run
python3 split_data_nondisjoint.py
to create the training and validation splits in/home/visum/processed_data
- run
python3 generate_community_prods.py
to generate the louvain communities - Either:
- train the baseline using
python3 train.py
(model weights and checkpoints are saved in/home/visum/results/<timestamp>
) OR - use the baseline weights which are already in
/home/visum/results/
.
- train the baseline using
- IF you trained a new model you need to copy the weights (and the tokenizer) into
/home/visum
by doing:where <timestamp> corresponds to the name of the folder where the trained model is located. Usually this will be a timestamp in the format YYYY-MM-DD_hh-mm-ss.cp -r results/<timestamp>/tokenizer results/. cp results/<timestamp>/best_model_weights.pth results/.
- generate predictions for your model with
python3 test.py
- check the predictions are in the correct format by running
python3 evaluate.py preds.csv /home/master/dataset/test/solutions.csv
- submit your results by accessing here
The steps described in the previous section allow you to train your model and test it on some dummy test queries (generated from the train data) located in home/master/dataset/test
. However, you might want to test the model on the validation split created in step 2. To do so, consider the following steps:
- run
python3 generate_test_queries.py
to generate queries from your validation split - test your model with
python3 test.py -t processed_data/valid
- evaluate your model by running
python3 evaluate.py preds.csv processed_data/valid/solutions.csv
cd /home/visum/
rm preds.csv
python3 test.py
python3 evaluate.py preds.csv /home/master/dataset/test/solutions.csv```