簡單介紹

ReLU 激活函數：

ReLu使得網絡可以自行引入稀疏性，在沒做預訓練情況下，以ReLu為激活的網絡性能優于其它激活函數。

數學表達式： y=max(0,x)

Sigmoid 激活函數：

sigmoid 激活函數在神經網絡學習方面，可以将重點特征推向中央區，将非重點特征推向兩側區。

數學表達式： y=(1+exp(−x))−1

Tanh 激活函數：

Tanh 激活函數使得輸出與輸入的關系能保持非線性單調上升和下降關系，比sigmoid 函數延遲了飽和期，對神經網路的容錯性好。

數學表達式： y=exp(x)−exp(−x)exp(x)+exp(−x)

ReLU 主要函數

Forward_cpu 函數：

template <typename Dtype>
void ReLULayer<Dtype>::Forward_cpu(const vector<Blob<Dtype>*>& bottom,
    const vector<Blob<Dtype>*>& top) {
  const Dtype* bottom_data = bottom[]->cpu_data();
  Dtype* top_data = top[]->mutable_cpu_data();
  const int count = bottom[]->count();
  Dtype negative_slope = this->layer_param_.relu_param().negative_slope(); //輸入小于0時的斜率，預設為0；
  for (int i = ; i < count; ++i) {
    top_data[i] = std::max(bottom_data[i], Dtype())
        + negative_slope * std::min(bottom_data[i], Dtype());
  }//輸入大于零斜率為1，小于0斜率為negative_slope。
}

Backward_cpu 函數：

template <typename Dtype>
void ReLULayer<Dtype>::Backward_cpu(const vector<Blob<Dtype>*>& top,
    const vector<bool>& propagate_down,
    const vector<Blob<Dtype>*>& bottom) {
  if (propagate_down[]) {
    const Dtype* bottom_data = bottom[]->cpu_data();
    const Dtype* top_diff = top[]->cpu_diff();
    Dtype* bottom_diff = bottom[]->mutable_cpu_diff();
    const int count = bottom[]->count();
    Dtype negative_slope = this->layer_param_.relu_param().negative_slope();
    for (int i = ; i < count; ++i) {
      bottom_diff[i] = top_diff[i] * ((bottom_data[i] > )
          + negative_slope * (bottom_data[i] <= ));
    }
  }
}

Sigmoid主要函數

Forward_cpu 函數：

template <typename Dtype>
void SigmoidLayer<Dtype>::Forward_cpu(const vector<Blob<Dtype>*>& bottom,
    const vector<Blob<Dtype>*>& top) {
  const Dtype* bottom_data = bottom[]->cpu_data();
  Dtype* top_data = top[]->mutable_cpu_data();
  const int count = bottom[]->count();
  for (int i = ; i < count; ++i) {
    top_data[i] = sigmoid(bottom_data[i]);
  }
}

sigmoid 函數定義如下：

template <typename Dtype>
inline Dtype sigmoid(Dtype x) {
  return  / ( + exp(-x));
}

Backward_cpu 函數：

求導：

dydx=−1(1+exp(−x))2×(−exp(−x))=11+exp(−x)×(1−11+exp(−x))

template <typename Dtype>
void SigmoidLayer<Dtype>::Backward_cpu(const vector<Blob<Dtype>*>& top,
    const vector<bool>& propagate_down,
    const vector<Blob<Dtype>*>& bottom) {
  if (propagate_down[]) {
    const Dtype* top_data = top[]->cpu_data();
    const Dtype* top_diff = top[]->cpu_diff();
    Dtype* bottom_diff = bottom[]->mutable_cpu_diff();
    const int count = bottom[]->count();
    for (int i = ; i < count; ++i) {
      const Dtype sigmoid_x = top_data[i];
      bottom_diff[i] = top_diff[i] * sigmoid_x * ( - sigmoid_x);
    }
  }
}

Tanh主要函數

Forward_cpu 函數：

template <typename Dtype>
void TanHLayer<Dtype>::Forward_cpu(const vector<Blob<Dtype>*>& bottom,
    const vector<Blob<Dtype>*>& top) {
  const Dtype* bottom_data = bottom[]->cpu_data();
  Dtype* top_data = top[]->mutable_cpu_data();
  const int count = bottom[]->count();
  for (int i = ; i < count; ++i) {
    top_data[i] = tanh(bottom_data[i]);
  }
}

Backward_cpu 函數：

求導：

dydx=(exp(x)+exp(−x))2−(exp(x)−exp(−x))2(exp(x)+exp(−x))2=1−(exp(x)−exp(−x)exp(x)+exp(−x))2

template <typename Dtype>
void TanHLayer<Dtype>::Backward_cpu(const vector<Blob<Dtype>*>& top,
    const vector<bool>& propagate_down,
    const vector<Blob<Dtype>*>& bottom) {
  if (propagate_down[]) {
    const Dtype* top_data = top[]->cpu_data();
    const Dtype* top_diff = top[]->cpu_diff();
    Dtype* bottom_diff = bottom[]->mutable_cpu_diff();
    const int count = bottom[]->count();
    Dtype tanhx;
    for (int i = ; i < count; ++i) {
      tanhx = top_data[i];
      bottom_diff[i] = top_diff[i] * ( - tanhx * tanhx);
    }
  }
}

Caffe源碼（七）：ReLU，Sigmoid and Tanh目錄簡單介紹ReLU 主要函數Sigmoid主要函數Tanh主要函數

目錄

簡單介紹

ReLU 激活函數：

Sigmoid 激活函數：

Tanh 激活函數：

ReLU 主要函數

Forward_cpu 函數：

Backward_cpu 函數：

Sigmoid主要函數

Forward_cpu 函數：

Backward_cpu 函數：

Tanh主要函數

Forward_cpu 函數：

Backward_cpu 函數：

繼續閱讀

Camera安卓源碼-高通mm_camera架構剖析

自然語言了解（NLU）相關微信小程式大全

jquery3.0源碼解讀（四）Callbacks

使用Socket實作最簡單的聊天功能

Linux指令行源碼查找方式

實驗吧WEBWP(一）

hadoop實戰之單表關聯STjoin源碼

ROS Navigation之amcl源碼解析（完全詳解）0. 寫在最前面1. amcl是幹什麼的2. 總體情況3. amcl_node.cpp參考

DOS源碼相關資料

Ubuntu16.04下Caffe環境搭建：cuda8.0 + opencv2.4.13

Mybatis源碼閱讀（一）：Mybatis初始化1.1 解析properties、settings前言入口引申結語

MyBatis源碼解析(一)——MyBatis初始化過程解MyBatis源碼解析(一)——MyBatis初始化過程解1. 準備工作2. MyBatis初始化過程

《資料結構》（嚴蔚敏,吳偉民版）課本源碼+習題集解析使用說明先附上文檔歸類目錄：部落客有話說：(已遷移到部落格園 ☛☛☛ 新部落格連結)

一步一步解析集合架構ArrayList源碼（2）

Ubuntu14.04+cuda8.0+caffe+MATLAB

如何提高個人開源網站源碼開發使用率