<b id="algme"><dfn id="algme"></dfn></b>

<u id="algme"></u>

<b id="algme"><dfn id="algme"></dfn></b>

溫馨提示×

溫馨提示×

您好，登錄后才能下訂單哦！

密碼登錄×

忘記密碼？

登錄注冊(cè)×

獲取短信驗(yàn)證碼

其他方式登錄

點(diǎn)擊登錄注冊(cè) 即表示同意《億速云用戶服務(wù)條款》

用戶登錄×

賬戶密碼登錄

請(qǐng)使用微信掃描上方二維碼

使用幫助

請(qǐng)求超時(shí)！

請(qǐng)點(diǎn)擊重新獲取二維碼

Pytorch之8層神經(jīng)網(wǎng)絡(luò)怎么實(shí)現(xiàn)Cifar-10圖像分類驗(yàn)證

發(fā)布時(shí)間：2023-03-25 14:04:57 來(lái)源：億速云閱讀：143 作者：iii 欄目：開(kāi)發(fā)技術(shù)

這篇文章主要講解了“Pytorch之8層神經(jīng)網(wǎng)絡(luò)怎么實(shí)現(xiàn)Cifar-10圖像分類驗(yàn)證”，文中的講解內(nèi)容簡(jiǎn)單清晰，易于學(xué)習(xí)與理解，下面請(qǐng)大家跟著小編的思路慢慢深入，一起來(lái)研究和學(xué)習(xí)“Pytorch之8層神經(jīng)網(wǎng)絡(luò)怎么實(shí)現(xiàn)Cifar-10圖像分類驗(yàn)證”吧！

實(shí)驗(yàn)環(huán)境

Pytorch 1.7.0
torchvision 0.8.2
Python 3.8
CUDA10.2 + cuDNN v7.6.5
Win10 + Pycharm
GTX1660, 6G

網(wǎng)絡(luò)結(jié)構(gòu)采用最簡(jiǎn)潔的類VGG結(jié)構(gòu)，即全部由3*3卷積和最大池化組成，后面接一個(gè)全連接層用于分類，網(wǎng)絡(luò)大小僅18M左右。

神經(jīng)網(wǎng)絡(luò)結(jié)構(gòu)圖

Pytorch之8層神經(jīng)網(wǎng)絡(luò)怎么實(shí)現(xiàn)Cifar-10圖像分類驗(yàn)證

Pytorch上搭建網(wǎng)絡(luò)

class Block(nn.Module):
    def __init__(self, inchannel, outchannel, res=True):
        super(Block, self).__init__()
        self.res = res     # 是否帶殘差連接
        self.left = nn.Sequential(
            nn.Conv2d(inchannel, outchannel, kernel_size=3, padding=1, bias=False),
            nn.BatchNorm2d(outchannel),
            nn.ReLU(inplace=True),
            nn.Conv2d(outchannel, outchannel, kernel_size=3, padding=1, bias=False),
            nn.BatchNorm2d(outchannel),
        )
        if stride != 1 or inchannel != outchannel:
            self.shortcut = nn.Sequential(
                nn.Conv2d(inchannel, outchannel, kernel_size=1, bias=False),
                nn.BatchNorm2d(outchannel),
            )
        else:
            self.shortcut = nn.Sequential()

        self.relu = nn.Sequential(
            nn.ReLU(inplace=True),
        )

    def forward(self, x):
        out = self.left(x)
        if self.res:
            out += self.shortcut(x)
        out = self.relu(out)
        return out


class myModel(nn.Module):
    def __init__(self, cfg=[64, 'M', 128,  'M', 256, 'M', 512, 'M'], res=True):
        super(myModel, self).__init__()
        self.res = res       # 是否帶殘差連接
        self.cfg = cfg       # 配置列表
        self.inchannel = 3   # 初始輸入通道數(shù)
        self.futures = self.make_layer()
        # 構(gòu)建卷積層之后的全連接層以及分類器：
        self.classifier = nn.Sequential(nn.Dropout(0.4),            # 兩層fc效果還差一些
                                        nn.Linear(4 * 512, 10), )   # fc，最終Cifar10輸出是10類

    def make_layer(self):
        layers = []
        for v in self.cfg:
            if v == 'M':
                layers.append(nn.MaxPool2d(kernel_size=2, stride=2))
            else:
                layers.append(Block(self.inchannel, v, self.res))
                self.inchannel = v    # 輸入通道數(shù)改為上一層的輸出通道數(shù)
        return nn.Sequential(*layers)

    def forward(self, x):
        out = self.futures(x)
        # view(out.size(0), -1): change tensor size from (N ,H , W) to (N, H*W)
        out = out.view(out.size(0), -1)
        out = self.classifier(out)
        return out

該網(wǎng)絡(luò)可以很方便的改造成帶殘差的，只要在初始化網(wǎng)絡(luò)時(shí)，將參數(shù)res設(shè)為T(mén)rue即可，并可改變cfg配置列表來(lái)方便的修改網(wǎng)絡(luò)層數(shù)。

Pytorch上訓(xùn)練

所選數(shù)據(jù)集為Cifar-10，該數(shù)據(jù)集共有60000張帶標(biāo)簽的彩色圖像，這些圖像尺寸32*32，分為10個(gè)類，每類6000張圖。這里面有50000張用于訓(xùn)練，每個(gè)類5000張，另外10000用于測(cè)試，每個(gè)類1000張。
訓(xùn)練策略如下：

1.優(yōu)化器

momentum=0.9 的 optim.SGD，adam在很多情況下能加速收斂，但因?yàn)槭亲赃m應(yīng)學(xué)習(xí)率，在訓(xùn)練后期存在不能收斂到全局極值點(diǎn)的問(wèn)題，所以采用能手動(dòng)調(diào)節(jié)學(xué)習(xí)率的SGD，現(xiàn)在很多比賽和論文中也是采用該策略。設(shè)置weight_decay=5e-3，即設(shè)置較大的L2正則來(lái)降低過(guò)擬合。

# 定義損失函數(shù)和優(yōu)化器
loss_func = nn.CrossEntropyLoss()
optimizer = optim.SGD(model.parameters(), lr=LR, momentum=0.9, weight_decay=5e-3)

2.學(xué)習(xí)率

optim.lr_scheduler.MultiStepLR，參數(shù)設(shè)為：milestones=[int(num_epochs * 0.56), int(num_epochs * 0.78)], gamma=0.1，即在0.56倍epochs和0.78時(shí)分別下降為前一階段學(xué)習(xí)率的0.1倍。

# 學(xué)習(xí)率調(diào)整策略 MultiStep：
scheduler = optim.lr_scheduler.MultiStepLR(optimizer=optimizer,
                   milestones=[int(num_epochs * 0.56), int(num_epochs * 0.78)],
                   gamma=0.1, last_epoch=-1)

在每個(gè)epoch訓(xùn)練完的時(shí)候一定要記得step一下，不然不會(huì)更新學(xué)習(xí)率，可以通過(guò)get_last_lr()來(lái)查看最新的學(xué)習(xí)率

# 更新學(xué)習(xí)率并查看當(dāng)前學(xué)習(xí)率
scheduler.step()
print('\t last_lr:', scheduler.get_last_lr())

3.數(shù)據(jù)策略

實(shí)驗(yàn)表明，針對(duì)cifar10數(shù)據(jù)集，隨機(jī)水平翻轉(zhuǎn)、隨機(jī)遮擋、隨機(jī)中心裁剪能有效提高驗(yàn)證集準(zhǔn)確率，而旋轉(zhuǎn)、顏色抖動(dòng)等則無(wú)效。

     norm_mean = [0.485, 0.456, 0.406]      # 均值
     norm_std = [0.229, 0.224, 0.225]       # 方差      
     transforms.Normalize(norm_mean, norm_std),                    #將[0,1]歸一化到[-1,1]
     transforms.RandomHorizontalFlip(),                            # 隨機(jī)水平鏡像
     transforms.RandomErasing(scale=(0.04, 0.2), ratio=(0.5, 2)),  # 隨機(jī)遮擋
     transforms.RandomCrop(32, padding=4)                          # 隨機(jī)中心裁剪

4.超參數(shù)

batch_size = 512     # 約占用顯存4G
num_epochs = 200     # 訓(xùn)練輪數(shù)
LR = 0.01            # 初始學(xué)習(xí)率

實(shí)驗(yàn)結(jié)果：best_acc= 94.71%

Pytorch之8層神經(jīng)網(wǎng)絡(luò)怎么實(shí)現(xiàn)Cifar-10圖像分類驗(yàn)證

另外，將網(wǎng)絡(luò)改成14層的帶殘差結(jié)構(gòu)后，準(zhǔn)確率上升到了95.56%，但是網(wǎng)絡(luò)大小也從18M到了43M。

以下是14層殘差網(wǎng)絡(luò)的全部代碼，8層的只需修改cfg和初始化時(shí)的res參數(shù)：

cfg=[64, ‘M’, 128, 128, ‘M’, 256, 256, ‘M’, 512, 512,‘M’] 修改為 [64, ‘M’, 128, ‘M’, 256, ‘M’, 512, ‘M’]

# *_* coding : UTF-8 *_*
# 開(kāi)發(fā)人員： csu·pan-_-||
# 開(kāi)發(fā)時(shí)間： 2020/12/29 15:17
# 文件名稱： battey_class.py
# 開(kāi)發(fā)工具： PyCharm
# 功能描述： 自建CNN對(duì)cifar10進(jìn)行分類

import torch
from torchvision import datasets, transforms
import torch.nn as nn
import torch.optim as optim
from torch.utils.data import DataLoader
import onnx
import time
import numpy as np
import matplotlib.pyplot as plt


class Block(nn.Module):
    def __init__(self, inchannel, outchannel, res=True, stride=1):
        super(Block, self).__init__()
        self.res = res     # 是否帶殘差連接
        self.left = nn.Sequential(
            nn.Conv2d(inchannel, outchannel, kernel_size=3, padding=1, stride=stride, bias=False),
            nn.BatchNorm2d(outchannel),
            nn.ReLU(inplace=True),
            nn.Conv2d(outchannel, outchannel, kernel_size=3, padding=1, stride=1, bias=False),
            nn.BatchNorm2d(outchannel),
        )
        if stride != 1 or inchannel != outchannel:
            self.shortcut = nn.Sequential(
                nn.Conv2d(inchannel, outchannel, kernel_size=1, bias=False),
                nn.BatchNorm2d(outchannel),
            )
        else:
            self.shortcut = nn.Sequential()

        self.relu = nn.Sequential(
            nn.ReLU(inplace=True),
        )

    def forward(self, x):
        out = self.left(x)
        if self.res:
            out += self.shortcut(x)
        out = self.relu(out)
        return out


class myModel(nn.Module):
    def __init__(self, cfg=[64, 'M', 128, 128, 'M', 256, 256, 'M', 512, 512,'M'], res=True):
        super(myModel, self).__init__()
        self.res = res       # 是否帶殘差連接
        self.cfg = cfg       # 配置列表
        self.inchannel = 3   # 初始輸入通道數(shù)
        self.futures = self.make_layer()
        # 構(gòu)建卷積層之后的全連接層以及分類器：
        self.classifier = nn.Sequential(nn.Dropout(0.4),           # 兩層fc效果還差一些
                                        nn.Linear(4 * 512, 10), )   # fc，最終Cifar10輸出是10類

    def make_layer(self):
        layers = []
        for v in self.cfg:
            if v == 'M':
                layers.append(nn.MaxPool2d(kernel_size=2, stride=2))
            else:
                layers.append(Block(self.inchannel, v, self.res))
                self.inchannel = v    # 輸入通道數(shù)改為上一層的輸出通道數(shù)
        return nn.Sequential(*layers)

    def forward(self, x):
        out = self.futures(x)
        # view(out.size(0), -1): change tensor size from (N ,H , W) to (N, H*W)
        out = out.view(out.size(0), -1)
        out = self.classifier(out)
        return out

all_start = time.time()
# 使用torchvision可以很方便地下載Cifar10數(shù)據(jù)集，而torchvision下載的數(shù)據(jù)集為[0,1]的PILImage格式
# 我們需要將張量Tensor歸一化到[-1,1]
norm_mean = [0.485, 0.456, 0.406]  # 均值
norm_std = [0.229, 0.224, 0.225]  # 方差
transform_train = transforms.Compose([transforms.ToTensor(),  # 將PILImage轉(zhuǎn)換為張量
                                      # 將[0,1]歸一化到[-1,1]
                                      transforms.Normalize(norm_mean, norm_std),
                                      transforms.RandomHorizontalFlip(),  # 隨機(jī)水平鏡像
                                      transforms.RandomErasing(scale=(0.04, 0.2), ratio=(0.5, 2)),  # 隨機(jī)遮擋
                                      transforms.RandomCrop(32, padding=4)  # 隨機(jī)中心裁剪
                                      ])

transform_test = transforms.Compose([transforms.ToTensor(),
                                     transforms.Normalize(norm_mean, norm_std)])

# 超參數(shù)：
batch_size = 256
num_epochs = 200   # 訓(xùn)練輪數(shù)
LR = 0.01          # 初始學(xué)習(xí)率

# 選擇數(shù)據(jù)集:
trainset = datasets.CIFAR10(root='Datasets', train=True, download=True, transform=transform_train)
testset = datasets.CIFAR10(root='Datasets', train=False, download=True, transform=transform_test)
# 加載數(shù)據(jù):
train_data = DataLoader(dataset=trainset, batch_size=batch_size, shuffle=True)
valid_data = DataLoader(dataset=testset, batch_size=batch_size, shuffle=False)
cifar10_classes = ('plane', 'car', 'bird', 'cat', 'deer', 'dog', 'frog', 'horse', 'ship', 'truck')

train_data_size = len(trainset)
valid_data_size = len(testset)

print('train_size: {:4d}  valid_size:{:4d}'.format(train_data_size, valid_data_size))

device = torch.device("cuda:0" if torch.cuda.is_available() else "cpu")

model = myModel(res=True)

# 定義損失函數(shù)和優(yōu)化器
loss_func = nn.CrossEntropyLoss()
optimizer = optim.SGD(model.parameters(), lr=LR, momentum=0.9, weight_decay=5e-3)

# 學(xué)習(xí)率調(diào)整策略 MultiStep：
scheduler = optim.lr_scheduler.MultiStepLR(optimizer=optimizer,
                                           milestones=[int(num_epochs * 0.56), int(num_epochs * 0.78)],
                                           gamma=0.1, last_epoch=-1)

# 訓(xùn)練和驗(yàn)證：
def train_and_valid(model, loss_function, optimizer, epochs=10):
    model.to(device)
    history = []
    best_acc = 0.0
    best_epoch = 0

    for epoch in range(epochs):
        epoch_start = time.time()
        print("Epoch: {}/{}".format(epoch + 1, epochs))

        model.train()

        train_loss = 0.0
        train_acc = 0.0
        valid_loss = 0.0
        valid_acc = 0.0

        for i, (inputs, labels) in enumerate(train_data):
            inputs = inputs.to(device)
            labels = labels.to(device)

            # 因?yàn)檫@里梯度是累加的，所以每次記得清零
            optimizer.zero_grad()

            outputs = model(inputs)

            loss = loss_function(outputs, labels)

            loss.backward()

            optimizer.step()

            train_loss += loss.item() * inputs.size(0)

            ret, predictions = torch.max(outputs.data, 1)
            correct_counts = predictions.eq(labels.data.view_as(predictions))

            acc = torch.mean(correct_counts.type(torch.FloatTensor))

            train_acc += acc.item() * inputs.size(0)

        with torch.no_grad():
            model.eval()

            for j, (inputs, labels) in enumerate(valid_data):
                inputs = inputs.to(device)
                labels = labels.to(device)

                outputs = model(inputs)

                loss = loss_function(outputs, labels)

                valid_loss += loss.item() * inputs.size(0)

                ret, predictions = torch.max(outputs.data, 1)
                correct_counts = predictions.eq(labels.data.view_as(predictions))

                acc = torch.mean(correct_counts.type(torch.FloatTensor))

                valid_acc += acc.item() * inputs.size(0)
        # 更新學(xué)習(xí)率并查看當(dāng)前學(xué)習(xí)率
        scheduler.step()
        print('\t last_lr:', scheduler.get_last_lr())

        avg_train_loss = train_loss / train_data_size
        avg_train_acc = train_acc / train_data_size

        avg_valid_loss = valid_loss / valid_data_size
        avg_valid_acc = valid_acc / valid_data_size

        history.append([avg_train_loss, avg_valid_loss, avg_train_acc, avg_valid_acc])

        if best_acc < avg_valid_acc:
            best_acc = avg_valid_acc
            best_epoch = epoch + 1

        epoch_end = time.time()

        print(
            "\t Training: Loss: {:.4f}, Accuracy: {:.4f}%, "
            "\n\t Validation: Loss: {:.4f}, Accuracy: {:.4f}%, Time: {:.3f}s".format(
                avg_train_loss, avg_train_acc * 100, avg_valid_loss, avg_valid_acc * 100,
                                epoch_end - epoch_start
            ))
        print("\t Best Accuracy for validation : {:.4f} at epoch {:03d}".format(best_acc, best_epoch))

        torch.save(model, '%s/' % 'cifar10_my' + '%02d' % (epoch + 1) + '.pt')  # 保存模型

        # # 存儲(chǔ)模型為onnx格式：
        # d_cuda = torch.rand(1, 3, 32, 32, dtype=torch.float).to(device='cuda')
        # onnx_path = '%s/' % 'cifar10_shuffle' + '%02d' % (epoch + 1) + '.onnx'
        # torch.onnx.export(model.to('cuda'), d_cuda, onnx_path)
        # shape_path = '%s/' % 'cifar10_shuffle' + '%02d' % (epoch + 1) + '_shape.onnx'
        # onnx.save(onnx.shape_inference.infer_shapes(onnx.load(onnx_path)), shape_path)
        # print('\t export shape success...')

    return model, history


trained_model, history = train_and_valid(model, loss_func, optimizer, num_epochs)

history = np.array(history)
# Loss曲線
plt.figure(figsize=(10, 10))
plt.plot(history[:, 0:2])
plt.legend(['Tr Loss', 'Val Loss'])
plt.xlabel('Epoch Number')
plt.ylabel('Loss')
# 設(shè)置坐標(biāo)軸刻度
plt.xticks(np.arange(0, num_epochs + 1, step=10))
plt.yticks(np.arange(0, 2.05, 0.1))
plt.grid()  # 畫(huà)出網(wǎng)格
plt.savefig('cifar10_shuffle_' + '_loss_curve1.png')

# 精度曲線
plt.figure(figsize=(10, 10))
plt.plot(history[:, 2:4])
plt.legend(['Tr Accuracy', 'Val Accuracy'])
plt.xlabel('Epoch Number')
plt.ylabel('Accuracy')
# 設(shè)置坐標(biāo)軸刻度
plt.xticks(np.arange(0, num_epochs + 1, step=10))
plt.yticks(np.arange(0, 1.05, 0.05))
plt.grid()  # 畫(huà)出網(wǎng)格
plt.savefig('cifar10_shuffle_' + '_accuracy_curve1.png')

all_end = time.time()
all_time = round(all_end - all_start)
print('all time: ', all_time, ' 秒')
print("All Time: {:d} 分 {:d} 秒".format(all_time // 60, all_time % 60))

感謝各位的閱讀，以上就是“Pytorch之8層神經(jīng)網(wǎng)絡(luò)怎么實(shí)現(xiàn)Cifar-10圖像分類驗(yàn)證”的內(nèi)容了，經(jīng)過(guò)本文的學(xué)習(xí)后，相信大家對(duì)Pytorch之8層神經(jīng)網(wǎng)絡(luò)怎么實(shí)現(xiàn)Cifar-10圖像分類驗(yàn)證這一問(wèn)題有了更深刻的體會(huì)，具體使用情況還需要大家實(shí)踐驗(yàn)證。這里是億速云，小編將為大家推送更多相關(guān)知識(shí)點(diǎn)的文章，歡迎關(guān)注！

向AI問(wèn)一下細(xì)節(jié)

推薦閱讀：

免責(zé)聲明：本站發(fā)布的內(nèi)容（圖片、視頻和文字）以原創(chuàng)、轉(zhuǎn)載和分享為主，文章觀點(diǎn)不代表本網(wǎng)站立場(chǎng)，如果涉及侵權(quán)請(qǐng)聯(lián)系站長(zhǎng)郵箱：is@yisu.com進(jìn)行舉報(bào)，并提供相關(guān)證據(jù)，一經(jīng)查實(shí)，將立刻刪除涉嫌侵權(quán)內(nèi)容。

上一篇新聞：
PPOCRLabel標(biāo)注的txt格式怎么轉(zhuǎn)換成labelme能修改的json格式
下一篇新聞：
Vue狀態(tài)管理庫(kù)Vuex如何使用

猜你喜歡

AI
助
手

產(chǎn)品服務(wù)

地區(qū)劃分

專題活動(dòng)

幫助支持

關(guān)于我們

售后咨詢

7*24小時(shí)在線電話：400-100-2938

7*24小時(shí)在線 QQ：800811969

關(guān)注億速云

億速云公眾號(hào)

手機(jī)網(wǎng)站二維碼

<sub id="phcsn"><em id="phcsn"><center id="phcsn"></center></em></sub>

<button id="phcsn"><em id="phcsn"><tt id="phcsn"></tt></em></button>

<table id="phcsn"><blockquote id="phcsn"></blockquote></table>
<cite id="phcsn"><progress id="phcsn"></progress></cite>

<samp id="phcsn"><del id="phcsn"></del></samp>