【python實現卷積神經網路】卷積層Conv2D反向傳播過程

西西嘛呦發表於2020-04-16

原文網址 : https://www.cnblogs.com/xiximayou/p/12713930.html

Python卷積神經網路反向傳播

程式碼來源：https://github.com/eriklindernoren/ML-From-Scratch

卷積神經網路中卷積層Conv2D（帶stride、padding）的具體實現：https://www.cnblogs.com/xiximayou/p/12706576.html

啟用函式的實現（sigmoid、softmax、tanh、relu、leakyrelu、elu、selu、softplus）：https://www.cnblogs.com/xiximayou/p/12713081.html

損失函式定義（均方誤差、交叉熵損失）：https://www.cnblogs.com/xiximayou/p/12713198.html

優化器的實現（SGD、Nesterov、Adagrad、Adadelta、RMSprop、Adam）：https://www.cnblogs.com/xiximayou/p/12713594.html

本節將根據程式碼繼續學習卷積層的反向傳播過程。

這裡就只貼出Conv2D前向傳播和反向傳播的程式碼了：

def forward_pass(self, X, training=True):
        batch_size, channels, height, width = X.shape
        self.layer_input = X
        # Turn image shape into column shape
        # (enables dot product between input and weights)
        self.X_col = image_to_column(X, self.filter_shape, stride=self.stride, output_shape=self.padding)
        # Turn weights into column shape
        self.W_col = self.W.reshape((self.n_filters, -1))
        # Calculate output
        output = self.W_col.dot(self.X_col) + self.w0
        # Reshape into (n_filters, out_height, out_width, batch_size)
        output = output.reshape(self.output_shape() + (batch_size, ))
        # Redistribute axises so that batch size comes first
        return output.transpose(3,0,1,2)

    def backward_pass(self, accum_grad):
        # Reshape accumulated gradient into column shape
        accum_grad = accum_grad.transpose(1, 2, 3, 0).reshape(self.n_filters, -1)

        if self.trainable:
            # Take dot product between column shaped accum. gradient and column shape
            # layer input to determine the gradient at the layer with respect to layer weights
            grad_w = accum_grad.dot(self.X_col.T).reshape(self.W.shape)
            # The gradient with respect to bias terms is the sum similarly to in Dense layer
            grad_w0 = np.sum(accum_grad, axis=1, keepdims=True)

            # Update the layers weights
            self.W = self.W_opt.update(self.W, grad_w)
            self.w0 = self.w0_opt.update(self.w0, grad_w0)

        # Recalculate the gradient which will be propogated back to prev. layer
        accum_grad = self.W_col.T.dot(accum_grad)
        # Reshape from column shape to image shape
        accum_grad = column_to_image(accum_grad,
                                self.layer_input.shape,
                                self.filter_shape,
                                stride=self.stride,
                                output_shape=self.padding)

        return accum_grad

而在定義卷積神經網路中是在neural_network.py中　　

   def train_on_batch(self, X, y):
        """ Single gradient update over one batch of samples """
        y_pred = self._forward_pass(X)
        loss = np.mean(self.loss_function.loss(y, y_pred))
        acc = self.loss_function.acc(y, y_pred)
        # Calculate the gradient of the loss function wrt y_pred
        loss_grad = self.loss_function.gradient(y, y_pred)
        # Backpropagate. Update weights
        self._backward_pass(loss_grad=loss_grad)

        return loss, acc

還需要看一下self._forward_pas和self._backward_pass：

    def _forward_pass(self, X, training=True):
        """ Calculate the output of the NN """
        layer_output = X
        for layer in self.layers:
            layer_output = layer.forward_pass(layer_output, training)

        return layer_output

    def _backward_pass(self, loss_grad):
        """ Propagate the gradient 'backwards' and update the weights in each layer """
        for layer in reversed(self.layers):
            loss_grad = layer.backward_pass(loss_grad)

我們可以看到，在前向傳播中會計算出self.layers中每一層的輸出，把包括卷積、池化、啟用和歸一化等。然後在反向傳播中從後往前更新每一層的梯度。這裡我們以一個卷積層+全連線層+損失函式為例。網路前向傳播完之後，最先獲得的梯度是損失函式的梯度。然後將損失函式的梯度傳入到全連線層，然後獲得全連線層計算的梯度，傳入到卷積層中，此時呼叫卷積層的backward_pass()方法。在卷積層中的backward_pass()方法中，如果設定了self.trainable，那麼會計算出對權重W以及偏置項w0的梯度，然後使用優化器optmizer，也就是W_opt和w0_opt進行引數的更新，然後再計算對前一層的梯度。最後有一個colun_to_image()方法。

def column_to_image(cols, images_shape, filter_shape, stride, output_shape='same'):
    batch_size, channels, height, width = images_shape
    pad_h, pad_w = determine_padding(filter_shape, output_shape)
    height_padded = height + np.sum(pad_h)
    width_padded = width + np.sum(pad_w)
    images_padded = np.empty((batch_size, channels, height_padded, width_padded))

    # Calculate the indices where the dot products are applied between weights
    # and the image
    k, i, j = get_im2col_indices(images_shape, filter_shape, (pad_h, pad_w), stride)

    cols = cols.reshape(channels * np.prod(filter_shape), -1, batch_size)
    cols = cols.transpose(2, 0, 1)
    # Add column content to the images at the indices
    np.add.at(images_padded, (slice(None), k, i, j), cols)

    # Return image without padding
    return images_padded[:, :, pad_h[0]:height+pad_h[0], pad_w[0]:width+pad_w[0]]

該方法是將之間為了方便計算卷積進行的形狀改變image_to_column()重新恢復成images_padded的格式。

像這種計算期間的各種的形狀的變換就挺讓人頭疼的，還會碰到numpy中各式各樣的函式，需要去查閱相關的資料。只要弄懂其中大致過程就可以了，加深相關知識的理解。

何為神經網路卷積層？
2023-03-16
神經網路卷積
卷積神經網路
2020-03-10
卷積神經網路
卷積神經網路的原理及Python實現
2024-05-25
卷積神經網路Python
《卷積神經網路的Python實現》筆記
2020-12-29
卷積神經網路Python筆記
卷積神經網路四種卷積型別
2018-12-17
卷積神經網路型別
卷積神經網路(CNN)反向傳播演算法公式詳細推導
2020-04-04
卷積神經網路CNN反向傳播演算法公式
吳恩達《卷積神經網路》課程筆記（1）– 卷積神經網路基礎
2018-08-02
吳恩達卷積神經網路筆記
[譯] 使用 Python 和 Keras 實現卷積神經網路
2018-05-07
PythonKeras卷積神經網路
【python實現卷積神經網路】開始訓練
2020-04-18
Python卷積神經網路
卷積神經網路概述
2018-10-24
卷積神經網路
解密卷積神經網路！
2018-11-06
解密卷積神經網路
5.2.1 卷積神經網路
2019-12-31
卷積神經網路
卷積神經網路CNN
2020-11-04
卷積神經網路CNN
卷積神經網路-AlexNet
2024-06-21
卷積神經網路
卷積神經網路-1
2018-04-19
卷積神經網路
卷積神經網路-2
2018-04-19
卷積神經網路
卷積神經網路-3
2018-04-20
卷積神經網路
TensorFlow上實現卷積神經網路CNN
2020-04-06
卷積神經網路CNN
Keras上實現卷積神經網路CNN
2020-04-06
Keras卷積神經網路CNN
神經網路之卷積篇：詳解單層卷積網路（One layer of a convolutional network）
2024-08-20
神經網路卷積
利用Python實現卷積神經網路的視覺化
2019-03-04
Python卷積神經網路視覺化
全卷積神經網路FCN
2018-07-19
卷積神經網路
深度剖析卷積神經網路
2018-05-23
卷積神經網路
卷積神經網路中的Winograd快速卷積演算法
2019-05-22
卷積神經網路演算法
【Python】keras卷積神經網路識別mnist
2018-09-13
PythonKeras卷積神經網路
卷積神經網路的一些細節思考（卷積、池化層的作用）
2018-10-20
卷積神經網路
圖卷積神經網路分類的pytorch實現
2023-02-20
卷積神經網路PyTorch
CNN神經網路之卷積操作
2019-07-19
CNN神經網路卷積
卷積神經網路 part2
2020-08-08
卷積神經網路
14 卷積神經網路（進階）
2020-11-01
卷積神經網路
卷積神經網路（CNN）詳解
2021-08-05
卷積神經網路CNN
Tensorflow-卷積神經網路CNN
2021-01-31
卷積神經網路CNN
吳恩達《卷積神經網路》課程筆記（2）– 深度卷積模型：案例研究
2018-08-02
吳恩達卷積神經網路筆記模型
神經網路之卷積篇：詳解卷積步長（Strided convolutions）
2024-08-14
神經網路卷積IDE
TensorFlow實戰卷積神經網路之LeNet
2018-04-03
卷積神經網路
卷積神經網路鼻祖LeNet網路分析
2018-12-14
卷積神經網路
論文筆記：Diffusion-Convolutional Neural Networks （傳播-卷積神經網路）
2018-04-09
筆記卷積神經網路
神經網路基礎部件-卷積層詳解
2023-02-16
神經網路卷積

【python實現卷積神經網路】卷積層Conv2D反向傳播過程

相關文章