當(dāng)前位置：首頁 > 人工智能 > pytorch >内容正文

pytorch

（pytorch-深度学习系列）使用重复元素的网络（VGG）

發(fā)布時間：2024/8/23 pytorch 24 豆豆

生活随笔收集整理的這篇文章主要介紹了（pytorch-深度学习系列）使用重复元素的网络（VGG）小編覺得挺不錯的,現(xiàn)在分享給大家,幫大家做個參考.

使用重復(fù)元素的網(wǎng)絡(luò)（VGG）

VGG的名字來源于論文作者所在的實驗室Visual Geometry Group，VGG提出了可以通過重復(fù)使用簡單的基礎(chǔ)塊來構(gòu)建深度模型的思路。

VGG Block(VGG 塊)

VGG塊的組成規(guī)律是：連續(xù)使用數(shù)個相同的填充為1、窗口形狀為 $3×33\times 3$ 的卷積層后接上一個步幅為2、窗口形狀為 $2×22\times 2$ 的最大池化層。
卷積層保持輸入的高和寬不變，而池化層則對其減半。

用函數(shù)來實現(xiàn)基礎(chǔ)的VGG塊，它可以指定卷積層的數(shù)量和輸入輸出通道數(shù)。

import time import torch from torch import nn, optimdevice = torch.device('cuda' if torch.cuda.is_available() else 'cpu')def vgg_block(num_convs, in_channels, out_channels):blk = []for i in range(num_convs):if i == 0:blk.append(nn.Conv2d(in_channels, out_channels, kernel_size=3, padding=1))else:blk.append(nn.Conv2d(out_channels, out_channels, kernel_size=3, padding=1))blk.append(nn.ReLU())blk.append(nn.MaxPool2d(kernel_size=2, stride=2)) # 這里會使寬高減半return nn.Sequential(*blk)

對于給定的感受野（與輸出有關(guān)的輸入圖片的局部大小），采用堆積的小卷積核優(yōu)于采用大的卷積核，因為可以增加網(wǎng)絡(luò)深度來保證學(xué)習(xí)更復(fù)雜的模式，而且代價還比較小（參數(shù)更少）。例如，在VGG中，使用了3個3x3卷積核來代替7x7卷積核，使用了2個3x3卷積核來代替5*5卷積核，這樣做的主要目的是在保證具有相同感知野的條件下，提升了網(wǎng)絡(luò)的深度，在一定程度上提升了神經(jīng)網(wǎng)絡(luò)的效果。

VGG 網(wǎng)絡(luò)

VGG網(wǎng)絡(luò)由卷積層模塊后接全連接層模塊構(gòu)成。卷積層模塊串聯(lián)數(shù)個VGG Block，其超參數(shù)由變量conv_arch定義。該變量指定了每個VGG塊里卷積層個數(shù)和輸入輸出通道數(shù)。全連接模塊則跟AlexNet中的一樣。
構(gòu)造一個VGG網(wǎng)絡(luò)。

它有5個卷積塊，前2塊使用單卷積層，而后3塊使用雙卷積層。
第一塊的輸入輸出通道分別是1和64
之后每次對輸出通道數(shù)翻倍，直到變?yōu)?12

因為這個網(wǎng)絡(luò)使用了8個卷積層和3個全連接層，所以經(jīng)常被稱為VGG-11。

conv_arch = ((1, 1, 64), (1, 64, 128), (2, 128, 256), (2, 256, 512), (2, 512, 512)) # 經(jīng)過5個VGG Block, 寬高會減半5次, 變成 224/32 = 7 fc_features = 512 * 7 * 7 # c * w * h fc_hidden_units = 4096 # 任意設(shè)定

實現(xiàn)VGG-11

def vgg(conv_arch, fc_features, fc_hidden_units=4096):net = nn.Sequential()# 卷積層部分for i, (num_convs, in_channels, out_channels) in enumerate(conv_arch):# 每經(jīng)過一個vgg_block都會使寬高減半net.add_module("vgg_block_" + str(i+1), vgg_block(num_convs, in_channels, out_channels))# 全連接層部分net.add_module("fc", nn.Sequential(d2l.FlattenLayer(),nn.Linear(fc_features, fc_hidden_units),nn.ReLU(),nn.Dropout(0.5),nn.Linear(fc_hidden_units, fc_hidden_units),nn.ReLU(),nn.Dropout(0.5),nn.Linear(fc_hidden_units, 10)))return net

構(gòu)造一個高和寬均為224的單通道數(shù)據(jù)樣本來觀察每一層的輸出形狀。

net = vgg(conv_arch, fc_features, fc_hidden_units) X = torch.rand(1, 1, 224, 224)# named_children獲取一級子模塊及其名字(named_modules會返回所有子模塊,包括子模塊的子模塊) for name, blk in net.named_children(): X = blk(X)print(name, 'output shape: ', X.shape)

輸出：

vgg_block_1 output shape: torch.Size([1, 64, 112, 112]) vgg_block_2 output shape: torch.Size([1, 128, 56, 56]) vgg_block_3 output shape: torch.Size([1, 256, 28, 28]) vgg_block_4 output shape: torch.Size([1, 512, 14, 14]) vgg_block_5 output shape: torch.Size([1, 512, 7, 7]) fc output shape: torch.Size([1, 10])

每次輸入的高和寬減半，直到最終高和寬變成7后傳入全連接層。與此同時，輸出通道數(shù)每次翻倍，直到變成512。因為每個卷積層的窗口大小一樣，所以每層的模型參數(shù)尺寸和計算復(fù)雜度與輸入高、輸入寬、輸入通道數(shù)和輸出通道數(shù)的乘積成正比。VGG這種高和寬減半以及通道翻倍的設(shè)計使得多數(shù)卷積層都有相同的模型參數(shù)尺寸和計算復(fù)雜度。

獲取數(shù)據(jù)

ratio = 8 small_conv_arch = [(1, 1, 64//ratio), (1, 64//ratio, 128//ratio), (2, 128//ratio, 256//ratio), (2, 256//ratio, 512//ratio), (2, 512//ratio, 512//ratio)] net = vgg(small_conv_arch, fc_features // ratio, fc_hidden_units // ratio) print(net)

輸出：

Sequential((vgg_block_1): Sequential((0): Conv2d(1, 8, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1))(1): ReLU()(2): MaxPool2d(kernel_size=2, stride=2, padding=0, dilation=1, ceil_mode=False))(vgg_block_2): Sequential((0): Conv2d(8, 16, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1))(1): ReLU()(2): MaxPool2d(kernel_size=2, stride=2, padding=0, dilation=1, ceil_mode=False))(vgg_block_3): Sequential((0): Conv2d(16, 32, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1))(1): ReLU()(2): Conv2d(32, 32, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1))(3): ReLU()(4): MaxPool2d(kernel_size=2, stride=2, padding=0, dilation=1, ceil_mode=False))(vgg_block_4): Sequential((0): Conv2d(32, 64, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1))(1): ReLU()(2): Conv2d(64, 64, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1))(3): ReLU()(4): MaxPool2d(kernel_size=2, stride=2, padding=0, dilation=1, ceil_mode=False))(vgg_block_5): Sequential((0): Conv2d(64, 64, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1))(1): ReLU()(2): Conv2d(64, 64, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1))(3): ReLU()(4): MaxPool2d(kernel_size=2, stride=2, padding=0, dilation=1, ceil_mode=False))(fc): Sequential((0): FlattenLayer()(1): Linear(in_features=3136, out_features=512, bias=True)(2): ReLU()(3): Dropout(p=0.5)(4): Linear(in_features=512, out_features=512, bias=True)(5): ReLU()(6): Dropout(p=0.5)(7): Linear(in_features=512, out_features=10, bias=True)) )

訓(xùn)練：

def load_data_fashion_mnist(batch_size, resize=None, root='~/Datasets/FashionMNIST'):"""Download the fashion mnist dataset and then load into memory."""trans = []if resize:trans.append(torchvision.transforms.Resize(size=resize))trans.append(torchvision.transforms.ToTensor())transform = torchvision.transforms.Compose(trans)mnist_train = torchvision.datasets.FashionMNIST(root=root, train=True, download=True, transform=transform)mnist_test = torchvision.datasets.FashionMNIST(root=root, train=False, download=True, transform=transform)if sys.platform.startswith('win'):num_workers = 0 # 0表示不用額外的進程來加速讀取數(shù)據(jù)else:num_workers = 4train_iter = torch.utils.data.DataLoader(mnist_train, batch_size=batch_size, shuffle=True, num_workers=num_workers)test_iter = torch.utils.data.DataLoader(mnist_test, batch_size=batch_size, shuffle=False, num_workers=num_workers)return train_iter, test_iter batch_size = 64 # 如出現(xiàn)“out of memory”的報錯信息，可減小batch_size或resize train_iter, test_iter = load_data_fashion_mnist(batch_size, resize=224)lr, num_epochs = 0.001, 5 optimizer = torch.optim.Adam(net.parameters(), lr=lr) d2l.train_ch5(net, train_iter, test_iter, batch_size, optimizer, device, num_epochs)

總結(jié)

以上是生活随笔為你收集整理的（pytorch-深度学习系列）使用重复元素的网络（VGG）的全部內(nèi)容，希望文章能夠幫你解決所遇到的問題。

如果覺得生活随笔網(wǎng)站內(nèi)容還不錯，歡迎將生活随笔推薦給好友。