NMT十篇必读论文（一）attention is all you need - 代码天地

NMT十篇必读论文（一）attention is all you need

其他 2019-01-29 20:01:45 阅读次数: 0

清华大学NLP整理的神经机器翻译reading list中提到了十篇必读论文

https://github.com/THUNLP-MT/MT-Reading-List

本文抛弃了惯用的以CNN、RNN作为位置编码的方法，单纯依靠注意力机制以及简单的三角函数进行了位置编码，起到了不错的效果。对应模型为Tensor2Tensor框架下的Transformer模型。

GitHub地址： https://github.com/tensorflow/tensor2tensor

解释的比较好的博客：

https://blog.csdn.net/c9Yv2cf9I06K2A9E/article/details/79023069

https://ask.hellobi.com/blog/wenwen/18695

https://blog.csdn.net/qq_41058526/article/details/80783925

https://www.jianshu.com/p/3f2d4bc126e6

https://blog.csdn.net/mijiaoxiaosan/article/details/73251443

清华大学在此基础上提出了一种改进的文档级Transformer模型

Improving the Transformer Translation Model with Document-Level Context

https://github.com/THUNLP-MT/Document-Transformer

将原来Transformer模型的encoder和decoder结构的self-attention之后的输出作为Q，将经过self-attention之后的context embedding作为K，V，分别进行了一次mulit-head self-attention，并进行了些许优化

实验结果表明bleu值提高了

猜你喜欢

转载自blog.csdn.net/weixin_40240670/article/details/85619899

NMT十篇必读论文（一）attention is all you need

Attention is all you need

Attention all you need

《Attention Is All You Need》

读懂「Attention is All You Need」|

对Attention is all you need 的理解

Transformer【Attention is all you need】

Attention is All You Need -- 浅析

Attention is All You Need 理解

paper:Attention Is All You Need

Transformer：Attention Is All You Need

Transformer —— attention is all you need

Paper | Attention Is All You Need

Attention Is All You Need（Transformer ）

transformer(attention is all you need)

【Transformer】Attention Is All You Need

论文笔记：Attention Is All You Need

论文分享-->Attention is all you need

论文笔记《Attention Is All You Need》

Attention is all you need 论文详解（转）

《Attention Is All You Need》论文总结

attention is all you need 论文笔记

[Attention Is All You Need]论文笔记

Attention is all you need论文翻译

Attention Is All You Need 论文研读

【论文笔记】Attention is all you need

Attention Is All You Need论文详解与理解

论文阅读：Attention is all you need

【论文阅读】Attention is all you need（Transformer）

【论文 01】《Attention is all you need》

今日推荐

周排行

AIZU 2224 Save your cats(并查集)

HTTP响应头状态码详解

Python socket编程（2）

MaxCompute Studio使用心得系列7—作业对比

Supervisor安装使用

LeetCode 164. Maximum Gap

mysql面试题: 一张表里面有ID自增主键，当insert了17条记录之后，删除了第15,16,17条记录，再把mysql重启，再insert一条记录，这条记录的ID是18还是15

nutch1.2 DeleteDuplicates IndexMerger 详解

OC - @property与setter,getter方法

SpringBoot @Transactional的rollbackFor属性

每日归档

更多

2024-09-19(0)

2024-09-18(0)

2024-09-17(0)

2024-09-16(0)

2024-09-15(0)

2024-09-14(0)

2024-09-13(0)

2024-09-12(0)

2024-09-11(0)

2024-09-10(0)