天天看點

mxnet mobilenet SSD

mxnet 源碼裡面有SSD的example, 現在支援VGG,RESNET,等網絡,沒有mobilenet的支援, 但是在另外一個repo裡面有,就是這個example的作者自己維護的一個repo。是以這次對代碼稍加修改來采用mobilenet_v2做骨幹網絡來訓練SSD。

同時,照貓畫虎,可以選擇其他新型預訓練網絡來train 你自己的SSD。

wget http://host.robots.ox.ac.uk/pascal/VOC/voc2012/VOCtrainval_11-May-2012.tar
wget http://host.robots.ox.ac.uk/pascal/VOC/voc2007/VOCtrainval_06-Nov-2007.tar
wget http://host.robots.ox.ac.uk/pascal/VOC/voc2007/VOCtest_06-Nov-2007.tar
           

首先下載下傳VOC資料集到incubator-mxnet/example/ssd/data , 并解壓三個壓縮包,關于VOC資料集的介紹可以自行baidu。

然後運作incubator-mxnet/example/ssd/tools prepare_pascal.sh的shell腳本,可以生成lst,rec和idx, mxnet的dataiter需要。

如果使用VGG作為骨幹網絡來訓練的話,可以從以下位址下載下傳VGG-reduced的json和param,到incubator-mxnet/example/ssd/model 

下載下傳位址:https://github.com/zhreshold/mxnet-ssd/releases/download/v0.2-beta/vgg16_reduced.zip

最簡單的方式 在incubator-mxnet/example/ssd  目錄下執行 python train 就開始訓練了。不需要更改代碼

然後,稍加修改代碼,用mobilenet-v2 來train我們的SSD。

首先要先找一個image-net的預訓練模型,github上有,拿來主義:

直接 git clone https://github.com/KeyKy/mobilenet-mxnet.git

記得給作者星星

然後把clone下來的代碼裡的 mobilenet_v2-0000.params,mobilenet_v2-symbol.json 放到incubator-mxnet/example/ssd/model 下,

把mobilenet_v2.py 放到incubator-mxnet/example/ssd/symbol 下

然後就要追代碼了,

我們先找到關鍵代碼的位置:

incubator-mxnet/example/ssd/symbol/symbol_factory.py

if network == 'vgg16_reduced':
        if data_shape >= 448:
            from_layers = ['relu4_3', 'relu7', '', '', '', '', '']
            num_filters = [512, -1, 512, 256, 256, 256, 256]
            strides = [-1, -1, 2, 2, 2, 2, 1]
            pads = [-1, -1, 1, 1, 1, 1, 1]
            sizes = [[.07, .1025], [.15,.2121], [.3, .3674], [.45, .5196], [.6, .6708], \
                [.75, .8216], [.9, .9721]]
            ratios = [[1,2,.5], [1,2,.5,3,1./3], [1,2,.5,3,1./3], [1,2,.5,3,1./3], \
                [1,2,.5,3,1./3], [1,2,.5], [1,2,.5]]
            normalizations = [20, -1, -1, -1, -1, -1, -1]
            steps = [] if data_shape != 512 else [x / 512.0 for x in
                [8, 16, 32, 64, 128, 256, 512]]
        else:
            from_layers = ['relu4_3', 'relu7', '', '', '', '']
            num_filters = [512, -1, 512, 256, 256, 256]
            strides = [-1, -1, 2, 2, 1, 1]
            pads = [-1, -1, 1, 1, 0, 0]
            sizes = [[.1, .141], [.2,.272], [.37, .447], [.54, .619], [.71, .79], [.88, .961]]
            ratios = [[1,2,.5], [1,2,.5,3,1./3], [1,2,.5,3,1./3], [1,2,.5,3,1./3], \
                [1,2,.5], [1,2,.5]]
            normalizations = [20, -1, -1, -1, -1, -1]
            steps = [] if data_shape != 300 else [x / 300.0 for x in [8, 16, 32, 64, 100, 300]]
        if not (data_shape == 300 or data_shape == 512):
            logging.warn('data_shape %d was not tested, use with caucious.' % data_shape)
        return locals()
    elif network == 'inceptionv3':
        from_layers = ['ch_concat_mixed_7_chconcat', 'ch_concat_mixed_10_chconcat', '', '', '', '']
        num_filters = [-1, -1, 512, 256, 256, 128]
        strides = [-1, -1, 2, 2, 2, 2]
        pads = [-1, -1, 1, 1, 1, 1]
        sizes = [[.1, .141], [.2,.272], [.37, .447], [.54, .619], [.71, .79], [.88, .961]]
        ratios = [[1,2,.5], [1,2,.5,3,1./3], [1,2,.5,3,1./3], [1,2,.5,3,1./3], \
            [1,2,.5], [1,2,.5]]
        normalizations = -1
        steps = []
        return locals()
    elif network == 'resnet50':
        num_layers = 50
        image_shape = '3,224,224'  # resnet require it as shape check
        network = 'resnet'
        from_layers = ['_plus12', '_plus15', '', '', '', '']
        num_filters = [-1, -1, 512, 256, 256, 128]
        strides = [-1, -1, 2, 2, 2, 2]
        pads = [-1, -1, 1, 1, 1, 1]
        sizes = [[.1, .141], [.2,.272], [.37, .447], [.54, .619], [.71, .79], [.88, .961]]
        ratios = [[1,2,.5], [1,2,.5,3,1./3], [1,2,.5,3,1./3], [1,2,.5,3,1./3], \
            [1,2,.5], [1,2,.5]]
        normalizations = -1
        steps = []
        return locals()
    elif network == 'resnet101':
        num_layers = 101
        image_shape = '3,224,224'
        network = 'resnet'
        from_layers = ['_plus29', '_plus32', '', '', '', '']
        num_filters = [-1, -1, 512, 256, 256, 128]
        strides = [-1, -1, 2, 2, 2, 2]
        pads = [-1, -1, 1, 1, 1, 1]
        sizes = [[.1, .141], [.2,.272], [.37, .447], [.54, .619], [.71, .79], [.88, .961]]
        ratios = [[1,2,.5], [1,2,.5,3,1./3], [1,2,.5,3,1./3], [1,2,.5,3,1./3], \
            [1,2,.5], [1,2,.5]]
        normalizations = -1
        steps = []
        return locals()
    elif network=='mobilenet_v2':
        image_shape = '3,224,224'
        network = 'mobilenet_v2'
        from_layers = ['relu6_1_expand', 'relu6_4', '', '', '', '']
        num_filters = [-1, -1, 512, 256, 256, 128]
        strides = [-1, -1, 2, 2, 2, 2]
        pads = [-1, -1, 1, 1, 1, 1]
        sizes = [[.1, .141], [.2, .272], [.37, .447], [.54, .619], [.71, .79], [.88, .961]]
        ratios = [[1, 2, .5], [1, 2, .5, 3, 1. / 3], [1, 2, .5, 3, 1. / 3], [1, 2, .5, 3, 1. / 3], \
                  [1, 2, .5], [1, 2, .5]]
        normalizations = -1
        steps = []
        return locals()
    else:
        msg = 'No configuration found for %s with data_shape %d, but you can make it by yourself, its very easy..' % (network, data_shape)
        raise NotImplementedError(msg)
           

其中mobilenet_v2便是需要加的一段代碼,大體上跟example裡面有的幾個是一樣的,包括size,ratio 采用和前面一樣就可以,當然你也可以更改,具體意義參考SSD論文。

現在,萬事俱備,開始train

cd 到 incubator-mxnet/example/ssd 下:

python train.py --network mobilenet_v2 --pretrained mobilenet_v2 --lr 0.01 --batch-size 64 --epoch 0

train.py 的參數有很多,可以進去自己看根據需要指定就OK。

有可能會有一些錯誤報出來,根據具體情況debug就可以。

一小段LOG,僅供參考

INFO:root:Epoch[46] Validation-aeroplane=0.599531

INFO:root:Epoch[46] Validation-bicycle=0.655078

INFO:root:Epoch[46] Validation-bird=0.566540

INFO:root:Epoch[46] Validation-boat=0.352134

INFO:root:Epoch[46] Validation-bottle=0.198845

INFO:root:Epoch[46] Validation-bus=0.689774

INFO:root:Epoch[46] Validation-car=0.665570

INFO:root:Epoch[46] Validation-cat=0.779215

INFO:root:Epoch[46] Validation-chair=0.343104

INFO:root:Epoch[46] Validation-cow=0.570960

INFO:root:Epoch[46] Validation-diningtable=0.544843

INFO:root:Epoch[46] Validation-dog=0.725205

INFO:root:Epoch[46] Validation-horse=0.776931

INFO:root:Epoch[46] Validation-motorbike=0.699201

INFO:root:Epoch[46] Validation-person=0.587530

INFO:root:Epoch[46] Validation-pottedplant=0.267101

INFO:root:Epoch[46] Validation-sheep=0.541729

INFO:root:Epoch[46] Validation-sofa=0.600170

INFO:root:Epoch[46] Validation-train=0.754583

INFO:root:Epoch[46] Validation-tvmonitor=0.555712

INFO:root:Epoch[46] Validation-mAP=0.573688

後面會進行一些模型壓縮方面的工作,如果效果不過也會繼續放出來。