Attention Is All You Need 中的self-attention 以及multi-head attention

2023-07-31 01:57:32

前言

attention在语音识别方面越来越受大家的欢迎了，无论是soft attention以及hard attention等等都被大家广泛应用，从今天起笔者将基于一篇篇的顶会，来复现各家的attention的算法，今天就分享self-attention。

self-attention

Attention Is All You Need 中的self-attention 以及multi-head attention

如图所示文章中的经典图例

所采用的公式

Attention Is All You Need 中的self-attention 以及multi-head attention

也就是引入了QKV三个值，用这三个值进行一如上公式，进行系列的操作

代码展示：

import tensorflow as tf 
import math
length=50#帧长
input=39#MFCC特征维数
###########输入数据
x = tf.placeholder(tf.float32,[None,length,input])#输入数据


def self_attention(x,hidden_layer,head):
    x=tf.layers.conv1d(x,hidden_layer*3,1,strides=1, padding='same')
    Q,K,V=tf.split(x, 3, axis=2)
    print(Q,K,V)
    K=tf.transpose(K,[0,2,1])
    print(K)
    result=tf.reduce_sum(tf.matmul(Q,K)/math.sqrt(hidden_layer),axis=1)
    print(result)
    result=tf.reshape(result,[-1,50,1])
    result=tf.nn.softmax(result)
    V=V*result
    return V

采用tf.split函数分离出Q，K，V，然后Q与K矩阵相乘，求和，经过softmax最后与V相乘，得到了单头注意力机制的结果

既然有个单头的算法了，如何变成多头的呢？

我们先来看一下论文里写的：

Attention Is All You Need 中的self-attention 以及multi-head attention

他是这么做的呢，他是先把一个语料最后一维先分成h份最后concat到了一起，并且文章中的这句话也验证了我们的研究：

Attention Is All You Need 中的self-attention 以及multi-head attention

文章中采用的是h=8，这里我们采用5。

def multi_head_attention(x,head,output_channel):
    xn=tf.split(x,head,axis=2)
    print(xn)
    V1=xn[0]
    print(V1)
    V1=self_attention(V1,32)
    for a in xn[1:]:
        V=self_attention(a,32)
        V1=tf.concat([V1,V],axis=2)
    print(V1)   
    V1=tf.layers.conv1d(V1,output_channel,1,strides=1, padding='same')
    return V1

这样就比较轻松的完成了multi-head attention的代码编写

Attention Is All You Need 中的self-attention 以及multi-head attention

前言

self-attention

继续阅读

《燃冬》：冰封之下，情绪汹涌

《野蛮人入侵》里的东南亚文化魔方：“我”是谁？

曾被剧方殴打毁容的孙菲菲，13年没等来道歉，只等来资本的甩锅

自揭伤疤的孙菲菲：曾被誉为古装第一美女，缘何巅峰期陨落 | 晚八点娱闻

3部片成本5亿亏了3亿，邱礼涛不甘心，找刘德华带来一部生猛大片

为梁朝伟的眼眸干杯｜面孔

奥来德（688378）：国内显示光刻胶龙头积极拓展钙钛矿领域丨公司研究

《第八个嫌疑人》不及格，但大鹏的演技，配得上影帝的称号吗？

谁在支持郭敬明？

又美又飒，这些女人给我看迷糊了

太气了！《云之羽》大结局稀烂，郭导演“男女情”就那么难拍吗？

尺度狂飙，4集就被紧急停播！这200+亿国产流量要完了？

被官媒点名批评，强吻、袭胸、打人，这些“借题发挥”何时休？

胡歌+吴磊，票房惨败！电视咖难撑大银幕？

专访｜亚运会开幕式总导演陆川：在我的概念中，“桥”是个重要元素

12部影片国庆档厮杀，《前任4》遥遥领先，第五代导演能行吗？