自己学习率没有调节好 #164

glorymu · 2020-09-11T09:07:53Z

model.py
def train():
line140 memory, sents1, src_masks = self.encode(xs)
line141 logits, preds, y, sents2 = self.decode(ys, memory, src_masks)

我们都知道编码器的每一个block的output输入到解码器对应的block当中，但是代码中的memory是最后一层block的输出，原作者直接作为了解码器每一层block的输入。这里应该错了吧，我觉得应该用一个列表存储编码器每一层的输出，然后将列表传给解码器，将对应的值传给解码端对应的block。
大家有没有啥看法？

wanna-fly · 2020-10-06T07:34:34Z

本来就是encoder 的最后一层输出作为输入送入到每一层decoder当中作为k和v

Ningshiqi · 2020-11-18T11:29:26Z

作者实现没错。。你去看论文。就是encoder最后一层，会输出到所有docoder中。

GuoshenLi · 2021-07-14T13:48:52Z

作者没错是你的理解有问题

glorymu · 2021-07-25T09:44:45Z

嗯，我知道，我两种方法后来都尝试了，作者的效果会好一些，我的方式会快一些

glorymu changed the title ~~大神帮帮忙，感觉model.py中的关于encoder与encoder的搭建有点问题，~~ 自己学习率没有调节好 Jul 25, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

自己学习率没有调节好 #164

自己学习率没有调节好 #164

glorymu commented Sep 11, 2020

wanna-fly commented Oct 6, 2020

Ningshiqi commented Nov 18, 2020

GuoshenLi commented Jul 14, 2021

glorymu commented Jul 25, 2021

自己学习率没有调节好 #164

自己学习率没有调节好 #164

Comments

glorymu commented Sep 11, 2020

wanna-fly commented Oct 6, 2020

Ningshiqi commented Nov 18, 2020

GuoshenLi commented Jul 14, 2021

glorymu commented Jul 25, 2021