DeepBiLSTM #201

efosler · 2018-08-27T20:27:24Z

Is there a reason that DeepBiLSTM has two variants (relu and non-relu)? I was thinking about creating a UniLSTM, and realized that all that really needed to happen were some flags being flipped. It seems like having one model file, DeepLSTM, would be better and have the activations and directionality be options to that model. The diffs between relu and non-relu seem minor.

For backwards compatibility, we could have model_factory just call the new function with the options set appropriately.

ramonsanabria · 2018-08-27T20:43:30Z

Hi Eric,

Yes, "relu and non-relu" is something that I was trying long time ago. I think it was not very important.

Yes correct. This was the idea of model_factory to decouple models and IO infrastructure. We can even model this further and have a layer_factory (?). This was another idea that I had in my head long time ago. What do you think?

Thanks!

efosler · 2018-08-27T20:47:09Z

Should I bother trying to put the two back together? It would not be hard but if it's not on the critical path then I'm not going to bother. It shouldn't take more than 20 minutes to do.

I think I see what you mean, but just to clarify: how do you separate the models from the layers?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DeepBiLSTM #201

DeepBiLSTM #201

efosler commented Aug 27, 2018

ramonsanabria commented Aug 27, 2018

efosler commented Aug 27, 2018

DeepBiLSTM #201

DeepBiLSTM #201

Comments

efosler commented Aug 27, 2018

ramonsanabria commented Aug 27, 2018

efosler commented Aug 27, 2018