An experiment to see how little training data is required to train a network with a new feature, given that there already exists a pretrained network predicting the same quantity without this new feature.
Dataset : boston_housing It has 13 features and 1 predicted value
Trained 3 ANNs
- M1 Input 13 - Vanialla training
- M2 Input 12 - Dropped the last column of the feature set, then Vanialla training
- M3 Input 13 - Took weights from M2 and then trained