Semantic segmentation of human brain cortex with UNET-like architecture using SONY Neural Network Console

Training was performed using NVIDIA Titan RTX video card with 24 GB of memory. With presented dataset of ~2700 RGB images, each 320 by 320 pixels, training speed was 2 minutes per one epoch. It has to be outlined that NNABLA library is using CUDA and cuDNN architecture very efficiently, with GPU utilization rate reaching 90%-95%.

Below is the example of the learning curve:


  • Sony Corporation. Neural Network Console : Not just train and evaluate. You can design neural networks with fast and intuitive GUI.
  • Sony Corporation. Neural Network Libraries : An open source software to make research, development and implementation of neural network more efficient.
  • BatchNormalization – Ioffe and Szegedy, Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift.
  • Convolution – Chen et al., DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs.
  • Yu et al., Multi-Scale Context Aggregation by Dilated Convolutions.
  • ReLU – Vinod Nair, Geoffrey E. Hinton. Rectified Linear Units Improve Restricted Boltzmann Machines.
  • Momentum – Matthew D. Zeiler. ADADELTA: An Adaptive Learning Rate Method.
  • U-Net: Convolutional Networks for Biomedical Image Segmentation, Olaf Ronneberger, Philipp Fischer, Thomas Brox.