Group Abstract Group Abstract

Message Boards Message Boards

Breast Cancer Diagnosis

POSTED BY: Iman Nazari
3 Replies

enter image description here - Congratulations! This post is now a Staff Pick as distinguished on your profile! Thank you, keep it coming!

POSTED BY: EDITORIAL BOARD

You seem to be using rotations and reversals of the same images in training, validation, and test sets. Better practice is to separate so that all variants of a given image are in one set only. Better still, when the data set is sufficiently large, is to have all images derived from a given patient in one set only. See:

http://www.inf.ufpr.br/lesoliveira/download/IJCNN2016-BC.pdf

The data set on which that work is based may be found here:

https://web.inf.ufpr.br/vri/breast-cancer-database

POSTED BY: Daniel Lichtblau
Posted 8 years ago

I really liked your example. I just tried the same with AlexNet. https://papers.nips.cc/paper/4824-imagenet-classification-with-deep-convolutional-neural-networks.pdf. Since AlexNet is easier to implement, I thought of giving it a try.

enter image description here

You can also use the pretrained NetModels (VGGNet-16 etc.). I modified the tensor dimensions of AlexNet, (especially reducing the LinearLayers did not cause much difference in accuracy) to reduce the number of parameters (in turn the memory requirements). This leads to better BatchSize and faster training. The AlexNet trained in approximately 6 min as compared to the lenet(30 min), with much less Cancer elements being mis-classified as Normal or Benign. So the Recall of the actual class is much higher (0.64) as compared to the LeNet (0.28) in my machine. enter image description here

================================================================================================== I revisited the problem again. If we are just worried about the Recall (i.e. getting the cancer classified in the bets possible manner), then we can just use data augmentation for the Cancer and Benign class.

enter image description here

This way I would have a better distribution of data in the three classes Normal, Cancer and Benign, as a result, I can get faster and better results even with LeNet:

enter image description here

POSTED BY: Test Account
Reply to this discussion
Community posts can be styled and formatted using the Markdown syntax.
Reply Preview
Attachments
Remove
or Discard