Tensorflow Day16 Autoencoder 實作

今日目標

實作 Autoencoder
比較輸入以及輸出

Github Ipython Notebook 好讀完整版

實作

定義 weight 以及 bias 函數

def weight_variable(shape, name):
     
         
      return tf.Variable(tf.truncated_normal(shape = shape, stddev = 
      0.1), name)
     
     
      def bias_variable(shape, name):
     
         
      return tf.Variable(tf.constant(
      0.1, shape = shape), name)

autoencoder 架構

初步的構想是建立一個七層的神經網路，在 encoder 維度會從 784 逐漸變成 300, 100, 5 (code layer)， decoder 的時候再逐漸轉回 100, 300, 784．而這個想法在現在對 tensorflow 比較熟悉的情形下是很容易實現的，以下就是我實現的程式碼片段

input -> 784 -> 300 -> 100 -> 5 (code layer) -> 100 -> 300 -> 784 -> output

x = tf.placeholder(tf.float32, shape = [
      None, 
      784])
     
     
      e_W_1 = weight_variable([
      784, 
      300], 
      "e_W_1")
     
     
      e_b_1 = bias_variable([
      300], 
      "e_b_1")
     
     
      e_layer1 = tf.nn.relu(tf.matmul(x, e_W_1) + e_b_1)
     
     
      e_W_2 = weight_variable([
      300, 
      100], 
      "e_W_2")
     
     
      e_b_2 = bias_variable([
      100], 
      "e_b_2")
     
     
      e_layer2 = tf.nn.relu(tf.matmul(e_layer1, e_W_2) + e_b_2)
     
     
      e_W_3 = weight_variable([
      100, 
      20], 
      "e_W_3")
     
     
      e_b_3 = bias_variable([
      20], 
      "e_b_3")
     
     
      code_layer = tf.nn.relu(tf.matmul(e_layer2, e_W_3) + e_b_3)
     
     
      d_W_1 = weight_variable([
      20, 
      100], 
      "d_W_1")
     
     
      d_b_1 = bias_variable([
      100], 
      "d_b_1")
     
     
      d_layer1 = tf.nn.relu(tf.matmul(code_layer, d_W_1) + d_b_1)
     
     
      d_W_2 = weight_variable([
      100, 
      300], 
      "d_W_2")
     
     
      d_b_2 = bias_variable([
      300], 
      "d_b_2")
     
     
      d_layer2 = tf.nn.relu(tf.matmul(d_layer1, d_W_2) + d_b_2)
     
     
      d_W_3 = weight_variable([
      300, 
      784], 
      "d_W_3")
     
     
      d_b_3 = bias_variable([
      784], 
      "d_b_3")
     
     
      output_layer = tf.nn.relu(tf.matmul(d_layer2, d_W_3) + d_b_3)

loss

loss 函數我使用了

mean square error

，而 optimizer 原本是使用

GradientDescentOptimizer

，但是做出來的 decode 結果變得非常糟糕 (如下)．後來上網搜尋了以後改用

RMSPropOptimizer

，以下會隨機選出數個數字來看看所做出來的結果．

loss = tf.reduce_mean(tf.pow(output_layer - x, 
      2))
     
     
      optimizer = tf.train.RMSPropOptimizer(
      0.01).minimize(loss)
     
     
      init_op = tf.global_variables_initializer()

training

sess = tf.InteractiveSession()
     
     
      sess.run(init_op)
     
     
      for i 
      in range(
      20000):
     
     
          batch = mnist.train.next_batch(
      50)
     
         
      if i%
      100 == 
      0:
     
     
              print(
      "step %d, loss %g"%(i, loss.eval(feed_dict={x:batch[
      0]})))
     
     
          optimizer.run(feed_dict={x: batch[
      0]})
     
         
     
     
      print(
      "final loss %g" % loss.eval(feed_dict={x: mnist.test.images}))

final loss 0.031768

成果

import numpy 
      as np
     
     
      import matplotlib.pyplot 
      as plt
     
     
      %matplotlib inline
     
     
      trainimg = mnist.train.images
     
     
      trainlabel = mnist.train.labels
     
     
      output_nd = output_layer.eval(feed_dict = {x: mnist.train.images})
     
     
      for i 
      in [
      0, 
      1, 
      2, 
      3, 
      4]:
     
     
          curr_img   = np.reshape(trainimg[i, :], (
      28, 
      28)) 
      # 28 by 28 matrix 
     
     
          ae_img = np.reshape(output_nd[i,:], (
      28, 
      28))
     
     
          curr_label = np.argmax(trainlabel[i, :] ) 
      # Label
     
     
          plt.matshow(curr_img, cmap=plt.get_cmap(
      'gray'))
     
     
          plt.matshow(ae_img, cmap=plt.get_cmap(
      'gray'))

Tensorflow Day16 Autoencoder 實作

調整 code layer 維度

而接下來當我想嘗試把 code layer 的維度調整為 2 維來看看

Tensorflow Day16 Autoencoder 實作

我們可以發現完全糊掉了，隻用二維來表示輸入數字是不太夠的．更進一步仔細看 code layer 的值，都已經接近零，表示這個網路已經算是死掉了，是以還要想想為什麼會造成這樣的結果．

今日心得

用 tensorflow 實作了 autoencoder，但是在降成兩維的效果並不好，需要再找找看有什麼方法可以幫助解決這個情形， google 上看到說 sparse autoencoder 可能可以幫助這個情形．

原文： https://blog.c1mone.com.tw/2017/01/02/tensorflow-note-day-18/

Tensorflow Day16 Autoencoder 實作

今日目標

實作

定義 weight 以及 bias 函數

autoencoder 架構

loss

training

成果

調整 code layer 維度

今日心得

繼續閱讀

anaconda下鏡像快速安裝tensorflow和keras

anaconda中科大鏡像

安裝tensorflow1.12出現illegal hardware instruction python錯誤1、問題2、定位問題3、問題解決4、驗證

Linux下Anaconda安裝tensorflow-gpu

tensorflow筆記實踐：正則化優化過拟合

TensorFlow運作模型——會話

【Ubuntu-Tensorflow】TF1.0到TF1.2出現“Key LSTM/basic_lstm_cell/bias not found in checkpoin”問題

linux下的conda安裝tensorflow

Linux環境下 TensorFlow的安裝和使用基于Anaconda的tensorflow安裝

MindSpore儲存模型的格式疑惑

【Tensorflow】Tensorflow介紹

鸢尾花分類

利用tensorflow建構AlexNet模型，實作小數量級的貓狗分類（隻有train）

ImportError: libcublas.so.10.0: cannot open shared object file: No such file解決方法

ImportError: libcublas.so.9.0: cannot open shared object file: No such file or directory（完美解決）

一種解決思路： ImportError: libcublas.so.10.0: cannot open shared object file: No such file