Attention-Based Bidirectional Long Short-Term Memory Networks for - 代码天地

Attention-Based Bidirectional Long Short-Term Memory Networks for

其他 2020-04-04 10:39:40 阅读次数: 0

Attention-Based Bidirectional Long Short-Term Memory Networks for Relation Classification

概述

作者提出了一种基于注意力机制的双向LSTM框架用于关系抽取。方法的主要创新点引入了注意力机制。

神经网络框架

框架非常简单，也是作者一直强调的。Embedding Layer、LSTM layer、Attention Layer。Embedding Layer,跟通常的Embedding没有什么区别，首先使用训练好的word vector初始化，然后在训练过程中微调。

LSTM Layer

文章中使用的是LSTM的变体，其与LSTM的区别如下图，思想是各个门也将上一个记忆单元考虑上。

计算公式如下

Attention Layer

attention层详细可以完全通过公式说明

H是BILSTM的输出，size为 $v\times T$ , v是词向量的维度， T是序列的长度，H首先通过tanh函数激活得到M

再通过全连接层+softmax层得到 $\alpha$ , w的size是 $v\times 1$ , 所以 $\alpha$ 的size是 $1\times T$ 。最后H乘以权重，得到

的输出r, size为 $v \times 1$ 。最后经过tanh函数激活得到最后输出h, size为 $v \times 1$ 。得到输出后，直接作为

softmax层的输入，就可以得到相应预测标签的输出。

说明

作者嵌入实体位置信息，是通过改变原始序列。在实体的开始和结尾加入分隔符。

参考

基于注意力机制的双向LSTM关系抽取理解

发布了176 篇原创文章 · 获赞 97 · 访问量 13万+

私信关注

猜你喜欢

转载自blog.csdn.net/zycxnanwang/article/details/100075218

Attention-Based Bidirectional Long Short-Term Memory Networks for

NRE论文总结：Attention-Based Bidirectional Long Short-Term Memory Networks for Relation Classification

【读】关系抽取—（1）Attention-Based Bidirectional Long Short-Term Memory Networks for Relation Classification

Attention-Based Bidirectional Long Short-Term Memory Networks for Relation Classiﬁcation

Attention-Based Bidirectional Long Short-Term Memory for Relation Classification双向lstm实体关系分类

论文翻译（9）---A Convolution Bidirectional Long Short-Term Memory Neural Network for Driver Emotion Recog

《How to Reshape Input Data for Long Short-Term Memory Networks in Keras》学习笔记

《How to Use the TimeDistributed Layer for Long Short-Term Memory Networks in Python》学习笔记

Improved Semantic Representations From Tree-Structured Long Short-Term Memory Networks-paper

【分词】Long Short-Term Memory Neural Networks for Chinese Word Segmentation

Long Short-Term Memory (LSTM)

CNN Long Short-Term Memory

Long Short-Term Memory（LSTM）

Long Short-Term Memory 学习笔记

LSTM学习—Long Short Term Memory networks

多标签分类：Transition-Based Dependency Parsing with Stack Long Short-Term Memory

（KWS-LSTM）Max-pooling loss training of long short-term memory networks for small-footprint KWS

LSTM(Long Short-Term Memory)和LSTM例子理解

Video Summarization with Long Short-term Memory（论文翻译）

基于LSTM（Long Short-Term Memory）的实时异常检测

Video Summarization with Long Short-Term Memory论文翻译

长短期记忆神经Long Short-Term Memory（ LSTM）

长短时记忆(long short-term memory)LSTM

Speech and Language Processing之Long Short-Term Memory

Sequence Models and Long-Short Term Memory Networks

4 Short-Term Load Forecasting using A Long Short- Term Memory Network

愉快的学习就从翻译开始吧_Multi-step Time Series Forecasting with Long Short-Term Memory Networks in Python_0

愉快的学习就从翻译开始吧_Multi-step Time Series Forecasting with Long Short-Term Memory Networks in Python_1

长短期记忆网络（Long Short-Term Memory，LSTM）及其变体双向LSTM和GRU

Long short-term memory neuralnetwork for traffic speed prediction using remote microwave sensor data

今日推荐

周排行

深度学习------Lingvo框架下的加速通道GPipe

webjars管理静态资源

C专家编程_2.2

mysql 源码安装

json文件操作

123231432

注解的实现

Spring MVC 控制器

《人月神话》读后感二

C#使用HttpWebRequest和HttpWebResponse上传文件示例

每日归档

更多

2024-09-08(0)

2024-09-07(0)

2024-09-06(0)

2024-09-05(0)

2024-09-04(0)

2024-09-03(0)

2024-09-02(0)

2024-09-01(0)

2024-08-31(0)

2024-08-30(0)