generalization

Back up code for my post in Towards data science discussing how to obtain bounds on test loss using unlabeled data. Running the file dense_NN.py trains a fully connected two-layer neural network on synthetic data and saves its predictions on both the training set and the test set. Running image_classification.py does the same for a CNN by copying the code from the tensorflow image classification tutorial. Assessment of overfitting is then done with the Gen class contained in gen.py. Basic usage is

g = Gen('training_predictions.txt', 'testing_predictions.txt')
g = Gen.summary('training_plot_name.png', 'testing_plot_name.png', bins=20)

The summary() method creates the training and testing histograms according to the given filenames and reports a variety of bounds and estimates calculated from the input files training_predictions.txt and testing_predictions.txt

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
README.md		README.md
dense_NN.py		dense_NN.py
gen.py		gen.py
image_classification.py		image_classification.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

generalization

About

Releases

Packages

Languages

elanstop/generalization

Folders and files

Latest commit

History

Repository files navigation

generalization

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages