CNN Series: AlexNet
Paper
AlexNet
- AlexNet is a convolutional neural network that won the ImageNet Large Scale Visual Recognition Challenge (ILSVRC) in 2012. It was designed to recognize objects and faces from over a million different categories, making it one of the most successful applications of deep learning.
- ReLU as its activation function was used, which addresses issue of vanishing gradient problem that is common with sigmoid or tanh.
- DropOut was used to randomly ignore neurons during training to reduce overfitting.
- In Convloutional Layers of Alexnet, overlapping max pooling, as sub-sampling layer, was used to reduce the spatial size (width and height) of the representation at each stage.
- Deep Architecture: It has 8 layers in total: 5 convolutional layers and 3 fully connected layers.
- Designed to utilize GPUs, which significantly sped up the training process.
- AlexNet used data augmentation techniques such as image translations, horizontal reflections, and patch extractions to combat overfitting.
Written on
April
7th,
2024
by
Amar P
Feel free to share!