Date: Wednesday, May 22
Start Time: 1:30 pm
End Time: 2:35 pm
In this presentation, we introduce the essential aspects of training convolutional neural networks (CNNs). We discuss the prerequisites for training, including models, data and training frameworks, with an emphasis on the characteristics of data needed for effective training. We explore the model training process using visuals to explain the error surface and gradient-based learning techniques. Our discussion covers key hyperparameters, loss functions and how to monitor the health of the training process. We also address the common training problems of overfitting and underfitting, and offer practical rules of thumb for mitigating these issues. Finally, we introduce popular training frameworks and provide resources for further learning.