關于神經網絡的輸出神經元個數的思考

2023-05-08 21:23:27

部落客對于神經網絡的輸出神經元個數的問題，起源于“識别手寫數字的神經網絡為什麼需要10個輸出而不是四個？”.

關于神經網絡的輸出神經元個數的思考

實際上，這是兩種不同的編碼方式，兩種的網絡架構都是可行的，但是我們選擇十個神經元而不是四個神經元來表達各類别，是因為這是經驗上的選擇，從效果來說，輸出為十個的效果更好。

具體理由如下：

如果輸出為四個，那麼輸出層的每個神經元需要學習的是“1和2的手寫體之間的差別”之類的斷言；

如果輸出為十個，那麼輸出層的每個神經元需要學習的隻是“判斷一幅圖檔是不是1”這樣的斷言。

而描述一個圖檔是不是某個數字比描述兩個數字之間的差別容易的多。

（問題來自Neural networks and deep learning）

You might wonder why we use 10 output neurons. After all, the goal of the network is to tell us which digit (0,1,2,…,9) corresponds to the input image. A seemingly natural way of doing that is to use just 44 output neurons, treating each neuron as taking on a binary value, depending on whether the neuron's output is closer to 0 or to 1. Four neurons are enough to encode the answer, since 24=16 is more than the 10 possible values for the input digit. Why should our network use 10 neurons instead? Isn't that inefficient? The ultimate justification is empirical: we can try out both network designs, and it turns out that, for this particular problem, the network with 1010output neurons learns to recognize digits better than the network with 4 output neurons. But that leaves us wonderingwhyusing 1010output neurons works better. Is there some heuristic that would tell us in advance that we should use the 10-output encoding instead of the 4-output encoding?

……

關于神經網絡的輸出神經元個數的思考

繼續閱讀

List在調用add、remove方法後報java.lang.UnsupportedOperationException

數字圖像處理基礎：頻率域濾波（岡薩雷斯第三版數字圖像處理第4章）前言背景基本概念頻率域濾波基礎使用頻率域濾波器平滑圖像使用頻率域濾波器銳化圖像總結

LMS自适應濾波器算法及其改進正文後記

BigDecimal.ROUND_HALF_EVEN （銀行家算法）

MQ消息中間件技術 AMQP協定介紹JMS協定介紹STOMP協定介紹消息中間件概況釋出-訂閱消息模式MQ相關概念MQ産品的特性常用MQ産品比較MQ适用場景介紹

解決從Anaconda客服端中進入建立環境，啟動spyder出現報錯的問題

ARM ACE協定學習（一）

算法和算法分析概念1 算法及其性能标準2 算法的時間複雜度3 漸進時間複雜度

Weakly supervised learning approaches for object segmentation

wireshark分析tcp協定（一）三次握手【理論 + 實操】

使用tensorflow識别自己手寫的數字時的問題

如何閱讀一本書如何閱讀一本書-----------提升自己的學習能力