Journal Article
. 2015 Nov; 63(7):1455-62.
doi: 10.1109/TBME.2015.2496264.

A Dataset for Breast Cancer Histopathological Image Classification

Fabio A Spanhol  Luiz S Oliveira  Caroline Petitjean  Laurent Heutte  
  • PMID: 26540668
  •     40 citations


Today, medical image analysis papers require solid experiments to prove the usefulness of proposed methods. However, experiments are often performed on data selected by the researchers, which may come from different institutions, scanners, and populations. Different evaluation measures may be used, making it difficult to compare the methods. In this paper, we introduce a dataset of 7909 breast cancer histopathology images acquired on 82 patients, which is now publicly available from The dataset includes both benign and malignant images. The task associated with this dataset is the automated classification of these images in two classes, which would be a valuable computer-aided diagnosis tool for the clinician. In order to assess the difficulty of this task, we show some preliminary results obtained with state-of-the-art image classification systems. The accuracy ranges from 80% to 85%, showing room for improvement is left. By providing this dataset and a standardized evaluation protocol to the scientific community, we hope to gather researchers in both the medical and the machine learning field to advance toward this clinical application.

Automatic prediction of tumour malignancy in breast cancer with fractal dimension.
Alan Chan, Jack A Tuszynski.
R Soc Open Sci, 2017 Jan 14; 3(12). PMID: 28083100    Free PMC article.
Breast Cancer Multi-classification from Histopathological Images with Structured Deep Learning Model.
Zhongyi Han, Benzheng Wei, +3 authors, Shuo Li.
Sci Rep, 2017 Jun 25; 7(1). PMID: 28646155    Free PMC article.
Grading of invasive breast carcinoma through Grassmannian VLAD encoding.
Kosmas Dimitropoulos, Panagiotis Barmpoutis, +3 authors, Nikos Grammalidis.
PLoS One, 2017 Sep 22; 12(9). PMID: 28934283    Free PMC article.
Regulation of dual specificity phosphatases in breast cancer during initial treatment with Herceptin: a Boolean model analysis.
Petronela Buiga, Ari Elson, Lydia Tabernero, Jean-Marc Schwartz.
BMC Syst Biol, 2018 Apr 20; 12(Suppl 1). PMID: 29671404    Free PMC article.
Histopathological Breast Cancer Image Classification by Deep Neural Network Techniques Guided by Local Clustering.
Abdullah-Al Nahid, Mohamad Ali Mehrabi, Yinan Kong.
Biomed Res Int, 2018 May 01; 2018. PMID: 29707566    Free PMC article.
Computer Aided Classification of Neuroblastoma Histological Images Using Scale Invariant Feature Transform with Feature Encoding.
Soheila Gheisari, Daniel R Catchpoole, +3 authors, Paul J Kennedy.
Diagnostics (Basel), 2018 Aug 30; 8(3). PMID: 30154334    Free PMC article.
Transfer learning based histopathologic image classification for breast cancer detection.
Erkan Deniz, Abdulkadir Şengür, +3 authors, Ümit Budak.
Health Inf Sci Syst, 2018 Oct 04; 6(1). PMID: 30279988    Free PMC article.
ARA: accurate, reliable and active histopathological image classification framework with Bayesian deep learning.
Łukasz Rączkowski, Marcin Możejko, Joanna Zambonelli, Ewa Szczurek.
Sci Rep, 2019 Oct 06; 9(1). PMID: 31586139    Free PMC article.
Breast cancer histopathology image classification through assembling multiple compact CNNs.
Chuang Zhu, Fangzhou Song, +3 authors, Jun Liu.
BMC Med Inform Decis Mak, 2019 Oct 24; 19(1). PMID: 31640686    Free PMC article.
Cancer diagnosis through a tandem of classifiers for digitized histopathological slides.
Daniel Lichtblau, Catalin Stoean.
PLoS One, 2019 Jan 17; 14(1). PMID: 30650087    Free PMC article.
Breast cancer histopathological image classification using convolutional neural networks with small SE-ResNet module.
Yun Jiang, Li Chen, Hai Zhang, Xiao Xiao.
PLoS One, 2019 Mar 30; 14(3). PMID: 30925170    Free PMC article.
Classification of breast cancer histopathological images using interleaved DenseNet with SENet (IDSNet).
Xia Li, Xi Shen, +2 authors, Tie-Qiang Li.
PLoS One, 2020 May 05; 15(5). PMID: 32365142    Free PMC article.
Breast Cancer Classification from Histopathological Images with Inception Recurrent Residual Convolutional Neural Network.
Md Zahangir Alom, Chris Yakopcic, +2 authors, Vijayan K Asari.
J Digit Imaging, 2019 Feb 14; 32(4). PMID: 30756265    Free PMC article.
Conventional Machine Learning and Deep Learning Approach for Multi-Classification of Breast Cancer Histopathology Images-a Comparative Insight.
Shallu Sharma, Rajesh Mehra.
J Digit Imaging, 2020 Jan 05; 33(3). PMID: 31900812    Free PMC article.
Fusion in Breast Cancer Histology Classification.
Juan Vizcarra, Ryan Place, +2 authors, May D Wang.
ACM BCB, 2020 Jul 09; 2019. PMID: 32637941    Free PMC article.
Artificial intelligence in digital breast pathology: Techniques and applications.
Asmaa Ibrahim, Paul Gamble, +4 authors, Emad A Rakha.
Breast, 2020 Jan 15; 49. PMID: 31935669    Free PMC article.
Histopathological Classification of Breast Cancer Images Using a Multi-Scale Input and Multi-Feature Network.
Taimoor Shakeel Sheikh, Yonghee Lee, Migyung Cho.
Cancers (Basel), 2020 Jul 30; 12(8). PMID: 32722111    Free PMC article.
Interactive thyroid whole slide image diagnostic system using deep representation.
Pingjun Chen, Xiaoshuang Shi, +3 authors, Paul D Gader.
Comput Methods Programs Biomed, 2020 Jul 08; 195. PMID: 32634647    Free PMC article.
Breast Cancer Histopathology Image Classification Using an Ensemble of Deep Learning Models.
Zabit Hameed, Sofia Zahia, +2 authors, Ana María Vanegas.
Sensors (Basel), 2020 Aug 09; 20(16). PMID: 32764398    Free PMC article.
Deep Learning Based Analysis of Histopathological Images of Breast Cancer.
Juanying Xie, Ran Liu, Joseph Luttrell, Chaoyang Zhang.
Front Genet, 2019 Mar 07; 10. PMID: 30838023    Free PMC article.
The transition module: a method for preventing overfitting in convolutional neural networks.
S Akbar, M Peikari, +2 authors, A L Martel.
Comput Methods Biomech Biomed Eng Imaging Vis, 2019 Jun 14; 7(3). PMID: 31192055    Free PMC article.
Spectral-Spatial Features Integrated Convolution Neural Network for Breast Cancer Classification.
Hiren K Mewada, Amit V Patel, +2 authors, Keyur Mahant.
Sensors (Basel), 2020 Aug 28; 20(17). PMID: 32842640    Free PMC article.
Optimization of Deep Learning Network Parameters Using Uniform Experimental Design for Breast Cancer Histopathological Image Classification.
Cheng-Jian Lin, Shiou-Yun Jeng.
Diagnostics (Basel), 2020 Sep 05; 10(9). PMID: 32882935    Free PMC article.
Computer-Aided Histopathological Image Analysis Techniques for Automated Nuclear Atypia Scoring of Breast Cancer: a Review.
Asha Das, Madhu S Nair, S David Peter.
J Digit Imaging, 2020 Jan 29; 33(5). PMID: 31989390    Free PMC article.
How much off-the-shelf knowledge is transferable from natural images to pathology images?
Xingyu Li, Konstantinos N Plataniotis.
PLoS One, 2020 Oct 15; 15(10). PMID: 33052964    Free PMC article.
Fine-Grained Breast Cancer Classification With Bilinear Convolutional Neural Networks (BCNNs).
Weihuang Liu, Mario Juhas, Yang Zhang.
Front Genet, 2020 Oct 27; 11. PMID: 33101377    Free PMC article.
Experimental Assessment of Color Deconvolution and Color Normalization for Automated Classification of Histology Images Stained with Hematoxylin and Eosin.
Francesco Bianconi, Jakob N Kather, Constantino Carlos Reyes-Aldasoro.
Cancers (Basel), 2020 Nov 15; 12(11). PMID: 33187299    Free PMC article.
Kinetic Modeling of DUSP Regulation in Herceptin-Resistant HER2-Positive Breast Cancer.
Petronela Buiga, Ari Elson, Lydia Tabernero, Jean-Marc Schwartz.
Genes (Basel), 2019 Jul 31; 10(8). PMID: 31357550    Free PMC article.
Deep neural network models for computational histopathology: A survey.
Chetan L Srinidhi, Ozan Ciga, Anne L Martel.
Med Image Anal, 2020 Oct 14; 67. PMID: 33049577    Free PMC article.
Super resolution microscopy and deep learning identify Zika virus reorganization of the endoplasmic reticulum.
Rory K M Long, Kathleen P Moriarty, +5 authors, Ivan R Nabi.
Sci Rep, 2020 Dec 03; 10(1). PMID: 33262363    Free PMC article.
Texture features in the Shearlet domain for histopathological image classification.
Sadiq Alinsaif, Jochen Lang.
BMC Med Inform Decis Mak, 2020 Dec 17; 20(Suppl 14). PMID: 33323118    Free PMC article.
Image Descriptors for Weakly Annotated Histopathological Breast Cancer Data.
Panagiotis Stanitsas, Anoop Cherian, +3 authors, Alexander Truskinovsky.
Front Digit Health, 2020 Dec 22; 2. PMID: 33345255    Free PMC article.
A Semisupervised Learning Scheme with Self-Paced Learning for Classifying Breast Cancer Histopathological Images.
Sarpong Kwadwo Asare, Fei You, Obed Tettey Nartey.
Comput Intell Neurosci, 2020 Dec 31; 2020. PMID: 33376479    Free PMC article.
Conventional Machine Learning versus Deep Learning for Magnification Dependent Histopathological Breast Cancer Image Classification: A Comparative Study with Visual Explanation.
Said Boumaraf, Xiabi Liu, +5 authors, Dalal Bardou.
Diagnostics (Basel), 2021 Apr 04; 11(3). PMID: 33809611    Free PMC article.
Novel Transfer Learning Approach for Medical Imaging with Limited Labeled Data.
Laith Alzubaidi, Muthana Al-Amidie, +6 authors, Ye Duan.
Cancers (Basel), 2021 Apr 04; 13(7). PMID: 33808207    Free PMC article.
A review and comparison of breast tumor cell nuclei segmentation performances using deep convolutional neural networks.
Andrew Lagree, Majidreza Mohebpour, +8 authors, William T Tran.
Sci Rep, 2021 Apr 15; 11(1). PMID: 33850222    Free PMC article.
Richer fusion network for breast cancer classification based on multimodal data.
Rui Yan, Fa Zhang, +8 authors, Jun Liang.
BMC Med Inform Decis Mak, 2021 Apr 24; 21(Suppl 1). PMID: 33888098    Free PMC article.
Fractal dimension analysis as an easy computational approach to improve breast cancer histopathological diagnosis.
Lucas Glaucio da Silva, Waleska Rayanne Sizinia da Silva Monteiro, +3 authors, Gustavo Torres de Souza.
Appl Microsc, 2021 May 01; 51(1). PMID: 33929635    Free PMC article.
Histo-CADx: duo cascaded fusion stages for breast cancer diagnosis from histopathological images.
Omneya Attallah, Fatma Anwar, Nagia M Ghanem, Mohamed A Ismail.
PeerJ Comput Sci, 2021 May 15; 7. PMID: 33987459    Free PMC article.
3E-Net: Entropy-Based Elastic Ensemble of Deep Convolutional Neural Networks for Grading of Invasive Breast Carcinoma Histopathological Microscopic Images.
Zakaria Senousy, Mohammed M Abdelsamea, Mona Mostafa Mohamed, Mohamed Medhat Gaber.
Entropy (Basel), 2021 Jun 03; 23(5). PMID: 34065765    Free PMC article.