Attention is all you need论文Transformer中的Positional Encoding代码实现及讲解 - 代码天地

Attention is all you need论文Transformer中的Positional Encoding代码实现及讲解

其他 2020-03-11 13:01:13 阅读次数: 0

首先论文中说到因为没有用到RNN也没有用到CNN提取特征，所以句子中没有很好的应用位置信息。所以需要在input embedding后加上Positional Encoding 。所以论文中提出了一种Positional Encoding的实现方式，下面贴出代码的实现以及讲解。

首先看下论文中提出的方式，pos为词的位置信息，dmodel为词向量embedding的维度。

最后得到的向量大小取值范围也在-1到1之间。

代码如下。

# n_position 为句子划分成字符或者词的长度，d_hid为词向量的维度。
def get_sinusoid_encoding_table(n_position, d_hid, padding_idx=None):
''' Sinusoid position encoding table '''

def cal_angle(position, hid_idx):
return position / np.power(10000, 2 * (hid_idx // 2) / d_hid)

def get_posi_angle_vec(position):
return [cal_angle(position, hid_j) for hid_j in range(d_hid)]

sinusoid_table = np.array([get_posi_angle_vec(pos_i) for pos_i in range(n_position)])

sinusoid_table[:, 0::2] = np.sin(sinusoid_table[:, 0::2]) # dim 2i 偶数正弦
sinusoid_table[:, 1::2] = np.cos(sinusoid_table[:, 1::2]) # dim 2i+1 奇数余弦

if padding_idx is not None:
# zero vector for padding dimension
sinusoid_table[padding_idx] = 0.

return torch.FloatTensor(sinusoid_table) # n_position × d_hid 得到每一个词的位置向量

原文链接：https://blog.csdn.net/qq_33278884/article/details/88868808

猜你喜欢

转载自www.cnblogs.com/wisir/p/12461641.html

Attention is all you need论文Transformer中的Positional Encoding代码实现及讲解

【论文阅读】Attention is all you need（Transformer）

Transformer 论文精读——Attention Is All You Need

Transformer【Attention is all you need】

Transformer：Attention Is All You Need

Transformer —— attention is all you need

Attention Is All You Need（Transformer ）

transformer(attention is all you need)

【Transformer】Attention Is All You Need

Attention is all you need中Transformer方法

【论文解读】Attention Is All You Need（Transformer and Self-Attention）

【自然语言处理 | Transformer】Transformer：Attention is All You Need论文讲解

Attention is all you need

Attention all you need

《Attention Is All You Need》

《Attention is All You Need》论文理解Transformer

论文笔记Transformer:Attention is all you need

Attention Is All You Need（Transformer）原理小结

bert之transformer（attention is all you need）

Attention is all you need-详解Transformer

【笔记】Transformer 框架：Attention is all you need

Transformer-《Attention Is All You Need》

Attention is All You Need（Transformer入门）

pytorch求索(4): 跟着论文《 Attention is All You Need》一步一步实现Attention和Transformer

读懂「Attention is All You Need」|

对Attention is all you need 的理解

Attention is All You Need -- 浅析

Attention is All You Need 理解

paper:Attention Is All You Need

Paper | Attention Is All You Need

今日推荐

周排行

python 发送邮件，554问题的一些解决方法

Hadoop集群的组成成份

BZOJ4735 你的生命已如风中残烛【数学】

AlarmManager简单用法记录

程序员接私活的途径以及正确方式。

DAG也许是真正的区块链3.0

【操作系统作业—lab1】linux shell脚本遍历目标文件夹和所有文件 | 包括特殊字符文件名的处理

javaweb：HTTP中GET和POST方法的区别（量大小-安全与否）

Java泛型介绍——HashMap总结

Tornado的使用

每日归档

更多

2024-07-05(0)

2024-07-04(0)

2024-07-03(0)

2024-07-02(0)

2024-07-01(0)

2024-06-30(0)

2024-06-29(0)

2024-06-28(0)

2024-06-27(0)

2024-06-26(0)