- Transformer-XL: Attentive Language Models Beyond a Fixed-Length Context Zihang Dai, Zhilin Yang, Yiming Yang, Jaime Carbonell, Quoc V. Le, Ruslan Salakhutdinov
ACL19
[pdf] [code] - Star-Transformer Qipeng Guo, Xipeng Qiu, Pengfei Liu, Yunfan Shao, Xiangyang Xue, Zheng Zhang
NAACL19
[pdf] [code] - BP-Transformer: Modelling Long-Range Context via Binary Partitioning Zihao Ye, Qipeng Guo, Quan Gan, Xipeng Qiu, Zheng Zhang [pdf] [code]
- Reformer: The Efficient Transformer Nikita Kitaev, Łukasz Kaiser, Anselm Levskaya
ICLR20
[pdf] [code] - Longformer: The Long-Document Transformer Iz Beltagy, Matthew E. Peters, Arman Cohan [pdf] [code]
- Big Bird: Transformers for Longer Sequences Manzil Zaheer, Guru Guruganesh, Avinava Dubey, Joshua Ainslie, Chris Alberti, Santiago Ontanon, Philip Pham, Anirudh Ravula, Qifan Wang, Li Yang, Amr Ahmed [pdf]
- tBERT: Topic Models and BERT Joining Forces for Semantic Similarity Detection Nicole Peinelt, Dong Nguyen, Maria Liakata
ACL20
[pdf] [code] - Recurrent Hierarchical Topic-Guided RNN for Language Generation Dandan Guo, Bo Chen, Ruiying Lu, Mingyuan Zhou
ICML20
[pdf] [code]
- Meta-Curriculum Learning for Domain Adaptation in Neural Machine Translation Runzhe Zhan, Xuebo Liu, Derek F. Wong, Lidia S. Chao
AAAI21
[pdf] [code] - Dynamic Fusion Network for Multi-Domain End-to-end Task-Oriented Dialog Libo Qin, Xiao Xu, Wanxiang Che, Yue Zhang, Ting Liu
ACL20
[pdf] [code] - Revisiting Multi-Domain Machine Translation MinhQuang Pham , Josep Maria Crego , François Yvon
TACL21
[pdf]
Paper | Conference |
---|---|
LAnguage MOdeling for Lifelong Language Learning | ICLR20 |
Episodic Memory in Lifelong Language Learning | NIPS19 |
Toward Continual Learning for Conversational Agents |
Paper | Conference |
---|---|
Hierarchical Summary-to-Article Generation | ICLR20 under review |
Entity-Relation Extraction as Multi-turn Question Answering | ACL19 |
GSN: A Graph-Structured Network for Multi-Party Dialogues | IJCAI19 |
Growing Story Forest Online from Massive Breaking News | CIKM17 |
Paper | Conference |
---|---|
QADiscourse : Discourse Relations as QA Pairs: Representation, Crowdsourcing and Baselines | EMNLP20 |
基于实体网格的语篇表示模型研究 | |
Disentangling Chat with Local Coherence Models | ACL11 |
Modeling Local Coherence: An Entity-Based Approach |
Paper | Conference |
---|---|
Non-Monotonic Sequential Text Generation | ICML19 |
Imitation Learning with Recurrent Neural Networks | |
Learning to Search Better than Your Teacher | ICML15 |
Paper | Conference |
---|---|
Multilingual Unsupervised NMT using Shared Encoder and Language-Specific Decoders | ACL19 |
Unsupervised Neural Text Simplification | ACL19 |
Unsupervised Question Answering by Cloze Translation | ACL19 |
Unsupervised Abstractive Meeting Summarization with Multi-Sentence Compression and Budgeted Submodular Maximization | ACL18 |
Paper | Conference | White or Black |
---|---|---|
Deep Text Classification Can be Fooled | IJCAI18 | Both |
Paper | Conference |
---|---|
Learning to Update Natural Language Comments Based on Code Changes | ACL20 |
- Unsupervised Topic Segmentation of Meetings with BERT Embeddings Alessandro Solbiati, Kevin Heffernan, Georgios Damaskinos, Shivani Poddar, Shubham Modi, Jacques Cali [pdf]