Files

.circleci
Algorithms_and_Hardness_for_Learning_Linear_Thresholds_from_Label_Proportions
CIQA
Domain_Agnostic_Contrastive_Representations_for_Learning_from_Label_Proportions
KNF
On_Combining_Bags_to_Better_Learn_from_Label_Proportions
OpenMSD
STraTA
aav
abps
abstract_nas
action_angle_networks
action_gap_rl
activation_clustering
active_selective_prediction
adaptive_learning_rate_tuner
adaptive_prediction
adaptive_surrogates
adversarial_nets_lr_scheduler
after_kernel
agile_modeling
al_for_fep
albert
algae_dice
aloe
alx
amortized_bo
android_in_the_wild
anthea
aptamers_mlpd
aqt
aquadem
arithmetic_sampling
arxiv_latex_cleaner
assemblenet
assessment_plan_modeling
attentional_adapters
attribution
automatic_structured_vi
automl_zero
autoregressive_diffusion
aux_tasks
axial
bam
bangbang_qaoa
basisnet
batch_science
behavior_regularized_offline_rl
bertseq2seq
better_storylines
bigg
bigger_better_faster
bisimulation_aaai2020
bitempered_loss
blur
bnn_hmc
bonus_based_exploration
building_detection
business_metric_aware_forecasting
bustle
c_learning
cache_replacement
caltrain
capsule_em
caql
cascaded_networks
cate
cbertscore
cell_embedder
cell_mixer
cfq
cfq_pt_vs_sa
charformer
ciw_label_noise
class_balanced_distillation
clay
cluster_gcn
clustering_normalized_cuts
cnn_quantization
cochlear_implant
code_as_policies
codistillation
cognate_inpaint_neighbors
coherent_gradients
cola
cold_posterior_bnn
cold_posterior_flax
collocated_irradiance_network
coltran
combiner
comisr
compgen_d2t
compositional_classification
- modules
- scripts
- README.md
- cls_lstm.sh
- cls_relative_transformer.sh
- cls_relative_transformer_nomask.sh
- cls_transformer.sh
- requirements.txt
- run.sh
- run_cls.py
compositional_rl
compositional_transformers
concept_explanations
concept_marl
conceptor
conqur
constrained_language_typology
contrack
contrails
contrastive_rl
coref_mt5
correct_batch_effects_wdn
correlated_compression
correlation_clustering
covid_epidemiology
covid_vhh_design
cube_unfoldings
cubert
cvl_public
d3pm
dac
darc
data_free_distillation
data_selection
dataset_or_not
dble
ddpm_w_distillation
deciphering_clinical_abbreviations
dedal
deep_homography
deep_representation_one_class
demogen
dense_representations_for_entity_retrieval
deplot
depth_and_motion_learning
depth_from_video_in_the_wild
design_bipartite_experiments
dialogue_ope
dichotomy_of_control
dictionary_learning
didi_dataset
differentiable_data_selection
differentially_private_gnns
diffusion_distillation
dimensions_of_motion
dipper
direction_net
disarm
dissecting_factual_predictions
distinguishing_romanized_hindi_urdu
distracting_control
distribution_embedding_networks
dnn_predict_accuracy
do_wide_and_deep_networks_learn_the_same_things
docent
domain_conditional_predictors
dot_vs_learned_similarity
dp_alternating_minimization
dp_multiq
dp_regression
dp_topk
dp_transfer
dpok
dql_grasping
drawtext
dreamfields
dreg_estimators
drfact
drops
dselect_k_moe
dual_dice
dual_pixels
dvrl
earthquakes_fern
ebp
editable_graph_temporal
eeg_modelling
eim
eli5_retrieval_large_lm
enas_lm
encyclopedic_vqa
entropy_semiring
es_enas
es_maml
es_optimization
etcmodel
etcsum
euphonia_spice
evanet
evolution
experience_replay
explaining_risk_increase
extreme_memorization
f_divergence_estimation_ram_mc
f_net
factoring_sqif
factorize_a_city
factors_of_influence
fair_submodular_matroid
fair_submodular_maximization_2020
fair_survival_analysis
fairness_and_bias_in_online_selection
fairness_teaching
fast_k_means_2020
fastconvnets
fat
federated_vision_datasets
felix
findit
fisher_brc
flare_removal
flax_models
floatseg
flood_forecasting
frechet_audio_distance
frechet_video_distance
frequency_analysis
frmt
frost
fsq
fully_dynamic_facility_location
fully_dynamic_submodular_maximization
func_dist
fvlm
fwl
gaternet
ged_tts
gen_patch_neural_rendering
general-pattern-machines
generalization_representations_rl_aistats22
generalized_rates
generative_trees
genomics_ood
gfsa
ghum
gift
gigamol
goemotions
gon
gradient_based_tuning
gradient_coresets_replay
graph_compression
graph_embedding
graph_sampler
graph_temporal_ai
grbm
group_agnostic_fairness
grouptesting
grow_bert
gumbel_max_causal_gadgets
gwikimatch
hal
hct
hierarchical_foresight
hipi
hist_thresh
hitnet
hmc_swindles
homophonous_logography
hspace
hst_clustering
human_attention
human_object_interaction
hybrid_zero_dynamics
hyperbolic
hyperbolic_discount
hypertransformer
ials
icetea
ieg
igt_optimizer
ime
imghum
implicit_constrained_optimization
implicit_pdf
incontext
incremental_gain
inerf
infinite_nature
infinite_nature_zero
infinite_uncertainty
intent_recognition
interactive_cbms
interpretability_benchmark
invariant_explanations
invariant_slot_attention
investigating_m4
ipagnn
irregular_timeseries_pretraining
isl
isolating_factors
jax_dft
jax_mpc
jax_particles
jaxbarf
jaxnerf
jaxraytrace
jaxsel
jaxstronomy
jrl
jslm
k_norm
keypose
kip
kl_guided_sampling
kobe
ksme
kwikbucks
kws_streaming
l2da
l2tl
label_bias
lamp
language_model_uncertainty
large_margin
large_scale_voting
lasagna_mt
latent_programmer
latent_shift_adaptation
layout-blt
learn_to_forget
learn_to_infer
learning_parameter_allocation
learning_with_little_mixing
learnreg
ledge
lego
light_field_neural_rendering
lighthouse
linear_dynamical_systems
linear_eval
linear_identifiability
linear_vae
lista_design_space
llm4mobile
lm_fact_tracing
lm_memorization
local_forward_gradient
locoprop
logic_inference_dataset
logit_adjustment
loss_functions_transfer
low_rank_local_connectivity
m_layer
m_theory
madlad_400
many_constraints
marot
mave
mbpp
mechanic
meena
memento
memory_efficient_attention
menger_rl
mentormix
merf
meta_augmentation
meta_learning_without_memorization
meta_pseudo_labels
meta_reward_learning
metapose
mico
micronet_challenge
microscope_image_quality
milking_cowmask
minigrid_basics
misinfo_provenance
missing_link
ml_debiaser
mobilebert
model_pruning
moe_models_implicit_bias
moe_mtl
moew
mol_dqn
moment_advice
motion_blur
mpi_extrapolation
mqm_viewer
muNet
mucped22
muller
multi_annotator
multi_game_dt
multi_resolution_rec
multimodalchat
multiple_user_representations
munchausen_rl
musiq
mutual_information_representation_learning
muzero
ncsnv3
negative_cache
nerflets
nested_rhat
neural_additive_models
neural_guided_symbolic_regression
neutra
nf_diffusion
ngrammer
nigt_optimizer
nngp_nas
non_decomp
non_semantic_speech_benchmark
nopad_inception_v3_fcn
norml
npy_array
numbert
occluder_recovery
offline_online_bandits
omnimatte3D
online_belief_propagation
online_correlation_clustering
opencontrails
openscene
opt_list
optimizing_interpretability
osf
pair_ngram
pairwise_fairness
pali
parallel_clustering
pde_preconditioner
performer
persistent-nature
persistent_es
perso_arabic_norm
perturbations
pgdl
playrooms
poem
policy_eval
polish
poly_kernel_sketch
pretrained_conv
prime
primer
privacy_poison
private_covariance_estimation
private_kendall
private_personalized_pagerank
private_sampling
private_text_transformers
procedure_cloning
property_linking
protein_lm
protenn
protnlm
protoattend
protseq
proxy_rewards
pruning_identified_exemplars
pse
psyborgs
psycholab
ptopk_patch_selection
pvn
pwil
q_match
qanet
qsp_quantum_metrology
quantum_sample_learning
r4r
rank_ckpt
rankgen
rankt5
ravens
rcc_algorithms
rce
re_identification_risk
readtwice
realformer
recs_ecosystem_creator_rl
recursive_optimizer
red-ace
regnerf
rembert
remote_sensing_representations
repnet
representation_batch_rl
representation_clustering
representation_similarity
reset_free_learning
resolve_ref_exp_elements_ml
restarting_FOM_for_LP
revisiting_neural_scaling_laws
rico_semantics
rise
rl4circopt
rl_metrics_aaai2021
rl_repr
rllim
robust_count_sketch
robust_loss
robust_loss_jax
robust_optim
robust_retrieval
rouge
routing_transformer
rpc
rrlfd
rs_gnn
saccader
saf
sail_rl
saycan
scalable_shampoo
scaling_transformer_inference_efficiency
scaling_transformers
scann
schema_guided_dst
schptm_benchmark
score_prior
scouts_ml_model_env
screen2words
scrna_benchmark
sd_gym
seq2act
sequential_attention
sgk
shortcut_testing
sign_language_detection
simpdom
simple_probabilistic_programming
simulation_research
single_view_mpi
sketching
sliding_window_clustering
slot_attention
sm3
smart_eval
smith
smu
smug_saliency
smurf
snerg
snlds
sobolev
social_rl
socraticmodels
soft_sort
soft_topk
soil_moisture_retrieval
solver1d
sorb
spaceopt
sparse_data
sparse_mixers
sparse_soft_topk
special_orthogonalization
specinvert
spectral_bias
spectral_graphormer
speech_embedding
spelling_convention_nlm
spin_spherical_cnns
spreadsheet_coder
sql_palm
squiggles
stable_transfer
stacked_capsule_autoencoders
standalone_self_attention_in_vision_models
star_cfq
state_of_sparsity
stochastic_to_deterministic
storm_optimizer
strategic_exploration
stream_s2s
streetview_contrails_dataset
structformer
structured_multihashing
student_mentor_dataset_cleaning
study_recommend
subclass_distillation
sufficient_input_subsets
summae
supcon
supervised_pixel_contrastive_loss
symbolic_functionals
t5_closed_book_qa
tabnet
tag
talk_about_random_splits
taperception
task_set
task_specific_learned_opt
tcc
tf3d
tf_trees
tft
tide
tide_nlp
time_varying_optimization
tiny_video_nets
topological_transformer
towards_gan_benchmarks
trainable_grids
transformer_modifications
trimap
true_teacher
truss_decomposition
tsmixer
tunas
uflow
ugif
ugsl
ul2
uncertainties
understanding_convolutions_on_graphs
universal_embedding_challenge
unprocessing
uq_benchmark_2019
using_dl_to_annotate_protein_universe
vae_ood
value_dice
value_function_polytope
vatt
vbmi
vct
vdvae_flax
video_structure
video_timeline_modeling
vila
visual_relationship
vmsst
vrdu
warmstart_graphcut_image_segmentation
weak_disentangle
widget-caption
widget_caption
wiki_split_bleu_eval
wildfire_conv_lstm
wildfire_perc_sim
wt5
xirl
yeast_transcription_network
yobo
yoto
youtube_asl
zebraix
zero_shot_structured_reflection
.gitignore
CONTRIBUTING.md
LICENSE
README.md
__init__.py
compile_protos.sh

compositional_classification

santiaontanon-google

and

copybara-github

Releasing the source code of the internship work of Juyong Kim (assoc…

Sep 19, 2023

fe8b2b7 · Sep 19, 2023

History

This branch is 497 commits behind google-research/google-research:master.

Name	Name	Last commit message	Last commit date
parent directory ..
modules	modules	Releasing the source code of the internship work of Juyong Kim (assoc…	Sep 19, 2023
scripts	scripts	Releasing the source code of the internship work of Juyong Kim (assoc…	Sep 19, 2023
README.md	README.md	Releasing the source code of the internship work of Juyong Kim (assoc…	Sep 19, 2023
cls_lstm.sh	cls_lstm.sh	Releasing the source code of the internship work of Juyong Kim (assoc…	Sep 19, 2023
cls_relative_transformer.sh	cls_relative_transformer.sh	Releasing the source code of the internship work of Juyong Kim (assoc…	Sep 19, 2023
cls_relative_transformer_nomask.sh	cls_relative_transformer_nomask.sh	Releasing the source code of the internship work of Juyong Kim (assoc…	Sep 19, 2023
cls_transformer.sh	cls_transformer.sh	Releasing the source code of the internship work of Juyong Kim (assoc…	Sep 19, 2023
requirements.txt	requirements.txt	Releasing the source code of the internship work of Juyong Kim (assoc…	Sep 19, 2023
run.sh	run.sh	Releasing the source code of the internship work of Juyong Kim (assoc…	Sep 19, 2023
run_cls.py	run_cls.py	Releasing the source code of the internship work of Juyong Kim (assoc…	Sep 19, 2023

README.md

CFQ classification task

This repository contains the source code for the paper: https://arxiv.org/abs/2106.10434

@misc{kim2021improving,
      title={Improving Compositional Generalization in Classification Tasks via Structure Annotations},
      author={Juyong Kim and Pradeep Ravikumar and Joshua Ainslie and Santiago Ontañón},
      year={2021},
      eprint={2106.10434},
      archivePrefix={arXiv},
      primaryClass={cs.LG}
}

The code converts the CFQ dataset into a sentence pair classification task and trains ML models on the task annotated with structural annotation.

1. Create datasets

Place CFQ dataset under scripts directory. A file dataset.json of CFQ dataset should be placed in scripts/cfq.
(For model negative dataset) Run CFQ baseline code with option to print beam score (Open run.sh and append --decode_hparams="return_beams=True,write_beam_scores=True" to t2t-decoder command). Place CFQ baseline outputs under scripts/cfq_model_outputs directory. The directory structure should be

scripts
└─cfq_model_outputs
  └─(cfq_split)
    ├─train_encode.txt
    ├─train_decode_lstm.txt
    ├─train_decode_transformer.txt
    ├─train_decode_universal.txt
    ├─dev_encode.txt
    ...
    ├─dev_decode_universal.txt
    ├─test_encode.txt
    ...
    └─test_decode_universal.txt

Run the generation shell script in scripts (Please see help output for usage).

$ ./create_cls_dataset.sh cfq_split neg_method train_hold_out output_tree

Note: Currently, dataset with structure annotation (or when output_tree is true) can be generated when xlink_mapping.pkl is placed under the dataset output dir (This file can be generated using a jupyter notebook colab/cfq_xlink_mutual_information.ipynb and the dataset of the same config but without structure annotation).

2. Dependency

Checkout the ETC repository at (https://github.com/google-research/google-research/tree/master/etcmodel) under the third_party directory.

3. Run model

Run one of the training shell scripts (cls_*.sh) in the project root (Please see help output for usage). Note that only Relative Transformer can use structure annotations.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Files

compositional_classification

compositional_classification

README.md

CFQ classification task

1. Create datasets

2. Dependency

3. Run model

Files

compositional_classification

Directory actions

More options

Directory actions

More options

Latest commit

History

compositional_classification

Folders and files

parent directory

README.md

CFQ classification task

1. Create datasets

2. Dependency

3. Run model