Keras Layer自定義

2018-07-31 23:50:00

簡單層實作

實作一個簡單層需要首先繼承 layers.Layer 類即可,如下是官方網站上的例子:

from keras import backend as K
from keras.engine.topology import Layer
import numpy as np

class MyLayer(Layer):

    def __init__(self, output_dim, **kwargs):
        self.output_dim = output_dim
        super(MyLayer, self).__init__(**kwargs)

    def build(self, input_shape):
        # Create a trainable weight variable for this layer.
        self.kernel = self.add_weight(name='kernel', 
                                      shape=(input_shape[1], self.output_dim),
                                      initializer='uniform',
                                      trainable=True)
        super(MyLayer, self).build(input_shape)  # Be sure to call this somewhere!

    def call(self, x):
        return K.dot(x, self.kernel)

    def compute_output_shape(self, input_shape):
        return (input_shape[0], self.output_dim)

如上所示, 其中有三個函數需要我們自己實作:

build() 用來初始化定義weights, 這裡可以用父類的self.add_weight() 函數來初始化資料, 該函數必須将 self.built 設定為True, 以保證該 Layer 已經成功 build , 通常如上所示, 使用 super(MyLayer, self).build(input_shape) 來完成
call() 用來執行 Layer 的職能, 即目前 Layer 所有的計算過程均在該函數中完成
compute_output_shape() 用來計算輸出張量的 shape

正常DL都是一個forward, backword, update 三個流程,而在 keras 中對于單層 Layer 來說,通過将可訓練的權應該在這裡被加入清單`self.trainable_weights中。其他的屬性還包括self.non_trainabe_weights（清單）和self.updates（需要更新的形如（tensor, new_tensor）的tuple的清單）。你可以參考BatchNormalization層的實作來學習如何使用上面兩個屬性。這個方法必須設定self.built = True，可通過調用super([layer],self).build()實作

loss 以及參數更新

詳細檢視了下 add_weight 函數實作如下（keras/engine/topology.py）：

def add_weight(self,
                   name,
                   shape,
                   dtype=None,
                   initializer=None,
                   regularizer=None,
                   trainable=True,
                   constraint=None):
        """Adds a weight variable to the layer.
        # Arguments
            name: String, the name for the weight variable.
            shape: The shape tuple of the weight.
            dtype: The dtype of the weight.
            initializer: An Initializer instance (callable).
            regularizer: An optional Regularizer instance.
            trainable: A boolean, whether the weight should
                be trained via backprop or not (assuming
                that the layer itself is also trainable).
            constraint: An optional Constraint instance.
        # Returns
            The created weight variable.
        """
        initializer = initializers.get(initializer)
        if dtype is None:
            dtype = K.floatx()
        weight = K.variable(initializer(shape),
                            dtype=dtype,
                            name=name,
                            constraint=constraint)
        if regularizer is not None:
            self.add_loss(regularizer(weight))
        if trainable:
            self._trainable_weights.append(weight)
        else:
            self._non_trainable_weights.append(weight)
        return weight

從上述代碼來看通過 add_weight 建立的參數，通過 regularizer 函數來計算 loss, 如果 trainable 設定 True ，則該生成的 self._trainable_weights, 可以通過 regularizer 來建構 loss

具體訓練過程參見: keras/engine/training.py

Keras Layer自定義

簡單層實作

loss 以及參數更新

繼續閱讀

超詳細的 Bert 文本分類源碼解讀 | 附源碼

完全解析！Bert & Transformer 閱讀了解源碼詳解

卷積神經網絡（CNN）詳解

我用 PyTorch 複現了 LeNet-5 神經網絡（MNIST 手寫資料集篇）！一、使用 LeNet-5 網絡結建構立 MNIST 手寫數字識别分類器

深度學習與遺傳算法的碰撞——利用遺傳算法優化深度學習網絡結構（詳解與實作）前言優化深度學習分類器的架構隐藏層配置的染色體表示評估個體的适應度得分使用遺傳算法優化MLP架構結果分析

為什麼雲計算巨頭都要強調AI了？

TensorFlow2實作神經風格遷移，DIY數字油畫定制照片前言神經風格遷移使用VGG提取特征實作神經風格轉換效果展示

TensorFlow2實作空間自适應歸一化（Spatial Adaptive Normalization, SPADE）空間自适應歸一化(Spatial Adaptive Normalization, SPADE)在殘差網絡中應用SPADE

TensorFlow2實作條件批歸一化（Conditional Batch Normalization）條件批歸一化（Conditional Batch Normalization）TensorFlow實作條件批歸一化在殘差塊中應用條件批歸一化

解析聯想人工智能實踐

阿裡雲釋出異構計算平台，隻是為了人工智能嗎？

在檔案存儲HDFS版上使用 TensorFlow一目的二背景資訊三準備工作四配置 TensorFlow 支援檔案存儲HDFS版五驗證

OpenCV 人臉檢測詳解（僅需2行代碼學會人臉檢測）人臉檢測簡介使用 OpenCV 進行人臉檢測

現在的實體店不管線下生意如何都在有意識的建立自己的私域營銷池，老客新客都會拉到自己的客戶群。意識強一些的商家會根據客戶不

開發者玩轉機器學習不能錯過的15篇深度文章！

EasyNLP帶你玩轉CLIP圖文檢索