项目概述
在本教程中,我们将使用PyTorch框架构建一个完整的图像分类模型。你将学习从数据预处理、模型设计、训练到评估的完整流程。
环境准备
首先,确保你已经安装以下依赖:
pip install torch torchvision matplotlib numpy
1. 数据加载与预处理
我们使用CIFAR-10数据集,包含10个类别的60000张32×32彩色图像:
import torch
import torchvision
import torchvision.transforms as transforms
transform = transforms.Compose([
transforms.RandomHorizontalFlip(),
transforms.RandomCrop(32, padding=4),
transforms.ToTensor(),
transforms.Normalize((0.5, 0.5, 0.5), (0.5, 0.5, 0.5))
])
trainset = torchvision.datasets.CIFAR10(
root='./data', train=True, download=True, transform=transform)
trainloader = torch.utils.data.DataLoader(
trainset, batch_size=64, shuffle=True)
2. 定义模型结构
我们构建一个简单的卷积神经网络:
import torch.nn as nn
import torch.nn.functional as F
class SimpleCNN(nn.Module):
def __init__(self):
super(SimpleCNN, self).__init__()
self.conv1 = nn.Conv2d(3, 32, 3)
self.conv2 = nn.Conv2d(32, 64, 3)
self.pool = nn.MaxPool2d(2, 2)
self.fc1 = nn.Linear(64 * 6 * 6, 256)
self.fc2 = nn.Linear(256, 10)
self.dropout = nn.Dropout(0.5)
def forward(self, x):
x = self.pool(F.relu(self.conv1(x)))
x = self.pool(F.relu(self.conv2(x)))
x = x.view(-1, 64 * 6 * 6)
x = F.relu(self.fc1(x))
x = self.dropout(x)
x = self.fc2(x)
return x
3. 训练模型
设置损失函数和优化器,开始训练:
model = SimpleCNN()
criterion = nn.CrossEntropyLoss()
optimizer = torch.optim.Adam(model.parameters(), lr=0.001)
for epoch in range(10):
running_loss = 0.0
for images, labels in trainloader:
optimizer.zero_grad()
outputs = model(images)
loss = criterion(outputs, labels)
loss.backward()
optimizer.step()
running_loss += loss.item()
print(f'Epoch {epoch+1}, Loss: {running_loss/len(trainloader):.3f}')
4. 模型评估
在测试集上评估模型性能:
testset = torchvision.datasets.CIFAR10(
root='./data', train=False, download=True, transform=transform)
testloader = torch.utils.data.DataLoader(
testset, batch_size=64, shuffle=False)
correct = 0
total = 0
with torch.no_grad():
for images, labels in testloader:
outputs = model(images)
_, predicted = torch.max(outputs, 1)
total += labels.size(0)
correct += (predicted == labels).sum().item()
print(f'Accuracy: {100 * correct / total:.2f}%')
进阶优化
你可以尝试以下方法来提升模型性能:
- 使用更深的网络结构(如ResNet、DenseNet)
- 数据增强:旋转、缩放、颜色变换
- 学习率调度策略
- 使用预训练模型进行迁移学习
通过本实战教程,你已经掌握了使用PyTorch构建图像分类模型的完整流程。继续深入学习,你还可以探索目标检测、图像分割等更高级的计算机视觉任务!