當(dāng)前位置：首頁(yè) > 人工智能 > pytorch >内容正文

pytorch

（pytorch-深度学习系列）CNN二维卷积层-学习笔记

發(fā)布時(shí)間：2024/8/23 pytorch 30 豆豆

生活随笔收集整理的這篇文章主要介紹了（pytorch-深度学习系列）CNN二维卷积层-学习笔记小編覺(jué)得挺不錯(cuò)的,現(xiàn)在分享給大家,幫大家做個(gè)參考.

二維卷積層

在二維互相關(guān)運(yùn)算中，卷積窗口從輸入數(shù)組的最左上方開(kāi)始，按從左往右、從上往下的順序，依次在輸入數(shù)組上滑動(dòng)。當(dāng)卷積窗口滑動(dòng)到某一位置時(shí)，窗口中的輸入子數(shù)組與核數(shù)組按元素相乘并求和，得到輸出數(shù)組中相應(yīng)位置的元素。

例如：
輸入為：input:
$[012345678]\begin{bmatrix} 0 & 1 & 2 \\ 3 & 4 &5 \\ 6 & 7 & 8 \end{bmatrix}$

核為：kernel:
$[0123]\begin{bmatrix} 0 & 1 \\ 2 & 3 \end{bmatrix}$

則輸出為：output

$\begin{bmatrix} 0 & 1 & 2 \\ 3 & 4 &5 \\ 6 & 7 & 8 \end{bmatrix} * \begin{bmatrix} 0 & 1 \\ 2 & 3 \end{bmatrix} = \begin{bmatrix} 19 & 25 \\ 37 & 43 \end{bmatrix}$

輸出數(shù)組高和寬分別為2，其中的4個(gè)元素由二維互相關(guān)運(yùn)算得出：

$0×0+1×1+3×2+4×3=19,1×0+2×1+4×2+5×3=25,3×0+4×1+6×2+7×3=37,4×0+5×1+7×2+8×3=43.0\times0+1\times1+3\times2+4\times3=19,\\ 1\times0+2\times1+4\times2+5\times3=25,\\ 3\times0+4\times1+6\times2+7\times3=37,\\ 4\times0+5\times1+7\times2+8\times3=43.$

設(shè)計(jì)函數(shù)實(shí)現(xiàn)卷積層的計(jì)算：

import torch from torch import nndef corr2d(X, K): h, w = K.shapeY = torch.zeros((X.shape[0] - h + 1, X.shape[1] - w + 1))for i in range(Y.shape[0]):for j in range(Y.shape[1]):Y[i, j] = (X[i: i + h, j: j + w] * K).sum()return Y

構(gòu)造輸入數(shù)據(jù)以及卷積核驗(yàn)證上述計(jì)算：

X = torch.tensor([[0, 1, 2], [3, 4, 5], [6, 7, 8]]) K = torch.tensor([[0, 1], [2, 3]]) corr2d(X, K)

輸出：

tensor([[19., 25.],[37., 43.]])

自定義二維卷積層

class conv2d(nn.Module):def __init(self, kernel_size):super(conv2d, self).__init__()self.weight = nn.Parameter(torch.randn(kernel_size))self.bias = nn.Parameter(torch.randn(1))def forward(self, x):return corr2d(x, self.weight) + self.bias

這里我們使用了corr2d函數(shù)來(lái)實(shí)現(xiàn)一個(gè)自定義的二維卷積層，這個(gè)網(wǎng)絡(luò)層有weight、bias兩個(gè)參數(shù)，前向計(jì)算函數(shù)是直接調(diào)用corr2d函數(shù)再加上偏差。

使用卷積層做圖像中物體邊緣的檢測(cè)

檢測(cè)圖像中物體的邊緣，即找到像素變化的位置。首先我們構(gòu)造一張 $6×86\times 8$ 的圖像（即高和寬分別為6像素和8像素的圖像）。它中間4列為黑（0），其余為白（1）。

X = torch.ones(6, 8) X[:, 2:6] = 0

構(gòu)造一個(gè)高和寬分別為1和2的卷積核K。當(dāng)它與輸入做互相關(guān)運(yùn)算時(shí)，如果橫向相鄰元素相同，輸出為0；否則輸出為非0。

K = torch.tensor([[1, -1]])

將輸入X和我們?cè)O(shè)計(jì)的卷積核K做互相關(guān)運(yùn)算。計(jì)算將從白到黑的邊緣和從黑到白的邊緣分別檢測(cè)成了1和-1。其余部分的輸出全是0。

Y = corr2d(X, K) print(Y) tensor([[ 0., 1., 0., 0., 0., -1., 0.],[ 0., 1., 0., 0., 0., -1., 0.],[ 0., 1., 0., 0., 0., -1., 0.],[ 0., 1., 0., 0., 0., -1., 0.],[ 0., 1., 0., 0., 0., -1., 0.],[ 0., 1., 0., 0., 0., -1., 0.]])

通過(guò)這個(gè)例子我們能看出，卷積層可以通過(guò)重復(fù)使用卷積核有效的表示局部空間。

通過(guò)數(shù)據(jù)學(xué)習(xí)出卷積核

使用物體邊緣檢測(cè)中的輸入數(shù)據(jù)X和輸出數(shù)據(jù)Y來(lái)學(xué)習(xí)我們構(gòu)造的核數(shù)組K。
我們首先構(gòu)造一個(gè)卷積層，其卷積核將被初始化成隨機(jī)數(shù)組。接下來(lái)在每一次迭代中，我們使用平方誤差來(lái)比較Y和卷積層的輸出，然后計(jì)算梯度來(lái)更新權(quán)重。

model = conv2d(kernel_size=(1, 2)) step = 20 #訓(xùn)練步數(shù) lr = 0.01 #學(xué)習(xí)率 for i in range(step):y_hat= model(X)l = ((y_hat - y)**2).sum()l.backward()#進(jìn)行梯度下降model.weight.data -= lr * model.wight.gradmodel.bias.data -= lr * model.bias.gradi+1# 梯度清0conv2d.weight.grad.fill_(0)conv2d.bias.grad.fill_(0)if (i+1)%5 == 0:print('step: %d , loss: %.3f' %(i+1, l.item()))

總結(jié)

以上是生活随笔為你收集整理的（pytorch-深度学习系列）CNN二维卷积层-学习笔记的全部?jī)?nèi)容，希望文章能夠幫你解決所遇到的問(wèn)題。

如果覺(jué)得生活随笔網(wǎng)站內(nèi)容還不錯(cuò)，歡迎將生活随笔推薦給好友。

上一篇：贝加尔湖，冰雪奇缘之旅
下一篇：科研这条路：一位数学博士给本科生的建议