卷积神经网络 (CNN) 输出计算详解：代码示例与步骤

本文将详细解释一个简单的卷积神经网络 (CNN) 的输出是如何计算的，并提供 Python 代码示例。

代码示例：

class CNN(nn.Module):
    def __init__(self, in_channels, out_channels):
        super(CNN, self).__init__()
        self.conv1 = nn.Conv2d(in_channels, 16, kernel_size=3, stride=1, padding=1)
        self.pool = nn.MaxPool2d(kernel_size=2, stride=2, padding=0)
        self.conv2 = nn.Conv2d(16, out_channels, kernel_size=3, stride=1, padding=1)

    def forward(self, x):
        print (x.shape)
        x = F.relu(self.conv1(x))
        x = self.pool(x)
        x = F.relu(self.conv2(x))
        x = self.pool(x)
        return x

输出计算步骤：

输入 x 的形状为 (batch_size, in_channels, height, width)。
卷积层 conv1：
- 输入通道数为 in_channels，输出通道数为 16。
- 使用 3x3 的卷积核，步长为 1，填充为 1。
- 输出特征图形状为 (batch_size, 16, height, width)，其中 height 和 width 根据输入 x 的大小和填充大小计算得出。
池化层 pool：
- 使用 2x2 的池化核，步长为 2，不进行填充。
- 输出特征图形状为 (batch_size, 16, height/2, width/2)。
卷积层 conv2：
- 输入通道数为 16，输出通道数为 out_channels。
- 使用 3x3 的卷积核，步长为 1，填充为 1。
- 输出特征图形状为 (batch_size, out_channels, height/2, width/2)。
池化层 pool：
- 使用 2x2 的池化核，步长为 2，不进行填充。
- 输出特征图形状为 (batch_size, out_channels, height/4, width/4)。
最终输出： 返回池化层的输出特征图作为最终的输出。

注意： 在计算过程中，通过使用 ReLU 激活函数对卷积层的输出进行非线性变换。

结论：

通过分析每个卷积层和池化层的操作，以及 ReLU 激活函数的作用，我们可以清晰地理解 CNN 的工作原理，并计算出其最终的输出。