Journal Article
. 2019 Jul;46().
doi: 10.1016/j.ebiom.2019.07.046.

A 23 gene-based molecular prognostic score precisely predicts overall survival of breast cancer patients

Hideyuki Shimizu 1 Keiichi I Nakayama 2 
  • PMID: 31358476
  •     32 References
  •     8 citations


Background: Although many prognosis-predicting molecular scores for breast cancer have been developed, they are applicable to only limited disease subtypes. We aimed to develop a novel prognostic score that is applicable to a wider range of breast cancer patients.

Methods: We initially examined The Cancer Genome Atlas breast cancer cohort to identify potential prognosis-related genes. We then performed a meta-analysis of 36 international breast cancer cohorts to validate such genes. We trained artificial intelligence models (random forest and neural network) to predict prognosis precisely, and we finally validated our prediction with the log-rank test.

Findings: We identified a comprehensive list of 184 prognosis-related genes, most of which have been not extensively studied to date. We then established a universal molecular prognostic score (mPS) that relies on the expression status of only 23 of these genes. The mPS system is almost universally applicable to breast cancer patients (log-rank P < 0.05) in a manner independent of platform (microarray or RNA sequencing).

Interpretation: The mPS system is simple and cost-effective to apply and yet is able to reveal previously unrecognized heterogeneity among patient subpopulations in a platform-independent manner. The combination of mPS and clinical stage stratifies prognosis even more precisely and should prove of value for avoidance of overtreatment. In addition, the prognosis-related genes uncovered in this study are potential drug targets. FUND: This work was supported by KAKENHI grants from the Ministry of Education, Culture, Sports, Science, and Technology of Japan to H.S. (19K20403) and to K.I·N (18H05215).

Keywords: AI; Breast cancer; Personalized medicine; Prognosis; Scoring system.

Gene expression profiling predicts clinical outcome of breast cancer.
Laura J van 't Veer, Hongyue Dai, +13 authors, Stephen H Friend.
Nature, 2002 Feb 02; 415(6871). PMID: 11823860
Highly Cited.
Molecular prognostic factors for breast cancer metastasis and survival.
Francisco J Esteva, Aysegul A Sahin, +2 authors, Gabriel N Hortobagyi.
Semin Radiat Oncol, 2002 Oct 17; 12(4). PMID: 12382190
Robust estimators for expression analysis.
Earl Hubbell, Wei-Min Liu, Rui Mei.
Bioinformatics, 2002 Dec 20; 18(12). PMID: 12490442
Highly Cited.
A multigene assay to predict recurrence of tamoxifen-treated, node-negative breast cancer.
Soonmyung Paik, Steven Shak, +12 authors, Norman Wolmark.
N Engl J Med, 2004 Dec 14; 351(27). PMID: 15591335
Highly Cited.
Gene expression and benefit of chemotherapy in women with node-negative, estrogen receptor-positive breast cancer.
Soonmyung Paik, Gong Tang, +11 authors, Norman Wolmark.
J Clin Oncol, 2006 May 25; 24(23). PMID: 16720680
Highly Cited.
Distinct clinical and prognostic features of infiltrating lobular carcinoma of the breast: combined results of 15 International Breast Cancer Study Group clinical trials.
Bernhard C Pestalozzi, David Zahrieh, +13 authors, International Breast Cancer Study Group.
J Clin Oncol, 2008 May 07; 26(18). PMID: 18458044
bc-GenExMiner: an easy-to-use online platform for gene prognostic analyses in breast cancer.
Pascal Jézéquel, Mario Campone, +4 authors, Loïc Campion.
Breast Cancer Res Treat, 2011 Apr 01; 131(3). PMID: 21452023
Highly Cited.
FOXM1: From cancer initiation to progression and treatment.
Chuay-Yeng Koo, Kyle W Muir, Eric W-F Lam.
Biochim Biophys Acta, 2011 Oct 08; 1819(1). PMID: 21978825
Highly Cited. Review.
The genomic and transcriptomic architecture of 2,000 breast tumours reveals novel subgroups.
Christina Curtis, Sohrab P Shah, +29 authors, Samuel Aparicio.
Nature, 2012 Apr 24; 486(7403). PMID: 22522925    Free PMC article.
Highly Cited.
Sequence analysis of mutations and translocations across breast cancer subtypes.
Shantanu Banerji, Kristian Cibulskis, +44 authors, Matthew Meyerson.
Nature, 2012 Jun 23; 486(7403). PMID: 22722202    Free PMC article.
Highly Cited.
Comprehensive molecular portraits of human breast tumours.
Cancer Genome Atlas Network.
Nature, 2012 Sep 25; 490(7418). PMID: 23000897    Free PMC article.
Highly Cited.
Ki67 and proliferation in breast cancer.
Nirmala Pathmanathan, Rosemary L Balleine.
J Clin Pathol, 2013 Feb 26; 66(6). PMID: 23436927
Highly Cited. Review.
Nottingham Prognostic Index Plus (NPI+): a modern clinical decision making tool in breast cancer.
E A Rakha, D Soria, +6 authors, I O Ellis.
Br J Cancer, 2014 Mar 13; 110(7). PMID: 24619074    Free PMC article.
How many etiological subtypes of breast cancer: two, three, four, or more?
William F Anderson, Philip S Rosenberg, +2 authors, Mark E Sherman.
J Natl Cancer Inst, 2014 Aug 15; 106(8). PMID: 25118203    Free PMC article.
Highly Cited. Review.
Machine learning applications in cancer prognosis and prediction.
Konstantina Kourou, Themis P Exarchos, +2 authors, Dimitrios I Fotiadis.
Comput Struct Biotechnol J, 2015 Mar 10; 13. PMID: 25750696    Free PMC article.
Highly Cited. Review.
Gene-Expression-Based Predictors for Breast Cancer.
Arjun Gupta, Miriam Mutebi, Aditya Bardia.
Ann Surg Oncol, 2015 Jul 29; 22(11). PMID: 26215189
Prospective Validation of a 21-Gene Expression Assay in Breast Cancer.
Joseph A Sparano, Robert J Gray, +28 authors, George W Sledge.
N Engl J Med, 2015 Sep 29; 373(21). PMID: 26412349    Free PMC article.
Highly Cited.
Comprehensive Molecular Portraits of Invasive Lobular Breast Cancer.
Giovanni Ciriello, Michael L Gatza, +30 authors, Charles M Perou.
Cell, 2015 Oct 10; 163(2). PMID: 26451490    Free PMC article.
Highly Cited.
The somatic mutation profiles of 2,433 breast cancers refines their genomic and transcriptomic landscapes.
Bernard Pereira, Suet-Feung Chin, +31 authors, Carlos Caldas.
Nat Commun, 2016 May 11; 7. PMID: 27161491    Free PMC article.
Highly Cited.
SPAG5 as a prognostic biomarker and chemotherapy sensitivity predictor in breast cancer: a retrospective, integrated genomic, transcriptomic, and protein analysis.
Tarek M A Abdel-Fatah, Devika Agarwal, +12 authors, Stephen Y T Chan.
Lancet Oncol, 2016 Jun 18; 17(7). PMID: 27312051
A Time for MYC: Metabolism and Therapy.
Chi V Dang.
Cold Spring Harb Symp Quant Biol, 2017 Feb 09; 81. PMID: 28174256
Omicseq: a web-based search engine for exploring omics datasets.
Xiaobo Sun, William S Pittard, +5 authors, Zhaohui S Qin.
Nucleic Acids Res, 2017 Apr 13; 45(W1). PMID: 28402462    Free PMC article.
Prognostic value of FOXM1 in solid tumors: a systematic review and meta-analysis.
Lijun Li, Dang Wu, +2 authors, Pin Wu.
Oncotarget, 2017 Apr 22; 8(19). PMID: 28427178    Free PMC article.
Systematic Review.
Clinical utility of gene-expression signatures in early stage breast cancer.
Maryann Kwa, Andreas Makris, Francisco J Esteva.
Nat Rev Clin Oncol, 2017 Jun 01; 14(10). PMID: 28561071
Highly Cited. Review.
Evaluation of invasive breast cancer samples using a 12-chemokine gene expression score: correlation with clinical outcomes.
Sangeetha Prabhakaran, Victoria T Rizk, +6 authors, Hatem H Soliman.
Breast Cancer Res, 2017 Jun 21; 19(1). PMID: 28629479    Free PMC article.
Use of Biomarkers to Guide Decisions on Adjuvant Systemic Therapy for Women With Early-Stage Invasive Breast Cancer: American Society of Clinical Oncology Clinical Practice Guideline Focused Update.
Ian Krop, Nofisat Ismaila, +10 authors, Vered Stearns.
J Clin Oncol, 2017 Jul 12; 35(24). PMID: 28692382    Free PMC article.
Highly Cited.
Prognostic and predictive biomarkers in breast cancer: Past, present and future.
Andrea Nicolini, Paola Ferrari, Michael J Duffy.
Semin Cancer Biol, 2017 Sep 09; 52(Pt 1). PMID: 28882552
Highly Cited. Review.
Relation of tumor size, lymph node status, and survival in 24,740 breast cancer cases.
C L Carter, C Allen, D E Henson.
Cancer, 1989 Jan 01; 63(1). PMID: 2910416
Highly Cited.
Breast Cancer: Multiple Subtypes within a Tumor?
Syn Kok Yeo, Jun-Lin Guan.
Trends Cancer, 2017 Nov 10; 3(11). PMID: 29120751    Free PMC article.
Cancer statistics, 2018.
Rebecca L Siegel, Kimberly D Miller, Ahmedin Jemal.
CA Cancer J Clin, 2018 Jan 10; 68(1). PMID: 29313949
Highly Cited.
A Deep Neural Network Model using Random Forest to Extract Feature Representation for Gene Expression Data Classification.
Yunchuan Kong, Tianwei Yu.
Sci Rep, 2018 Nov 09; 8(1). PMID: 30405137    Free PMC article.
A prognostic index in primary breast cancer.
J L Haybittle, R W Blamey, +5 authors, K Griffiths.
Br J Cancer, 1982 Mar 01; 45(3). PMID: 7073932    Free PMC article.
Highly Cited.
The Application of Deep Learning in Cancer Prognosis Prediction.
Wan Zhu, Longxiang Xie, Jianye Han, Xiangqian Guo.
Cancers (Basel), 2020 Mar 11; 12(3). PMID: 32150991    Free PMC article.
Artificial intelligence in oncology.
Hideyuki Shimizu, Keiichi I Nakayama.
Cancer Sci, 2020 Mar 07; 111(5). PMID: 32133724    Free PMC article.
Identification of common and divergent gene expression signatures in patients with venous and arterial thrombosis using data from public repositories.
Bidossessi Wilfried Hounkpe, Rafaela de Oliveira Benatti, Benilton de Sá Carvalho, Erich Vinicius De Paula.
PLoS One, 2020 Aug 12; 15(8). PMID: 32780732    Free PMC article.
The Construction of Bone Metastasis-Specific Prognostic Model and Co-expressed Network of Alternative Splicing in Breast Cancer.
Runzhi Huang, Juanru Guo, +11 authors, Zongqiang Huang.
Front Cell Dev Biol, 2020 Sep 29; 8. PMID: 32984314    Free PMC article.
Development of a susceptibility gene based novel predictive model for the diagnosis of ulcerative colitis using random forest and artificial neural network.
Hanyang Li, Lijie Lai, Jun Shen.
Aging (Albany NY), 2020 Oct 26; 12(20). PMID: 33099536    Free PMC article.
Artificial intelligence-directed prognostication of breast cancer.
Azadeh Nasrazadani, Adam M Brufsky.
EBioMedicine, 2019 Aug 06; 46. PMID: 31378696    Free PMC article.
Modeling Heterogeneity of Triple-Negative Breast Cancer Uncovers a Novel Combinatorial Treatment Overcoming Primary Drug Resistance.
Fabienne Lamballe, Fahmida Ahmad, +14 authors, Flavio Maina.
Adv Sci (Weinh), 2021 Feb 09; 8(3). PMID: 33552868    Free PMC article.
A universal molecular prognostic score for gastrointestinal tumors.
Hideyuki Shimizu, Keiichi I Nakayama.
NPJ Genom Med, 2021 Feb 06; 6(1). PMID: 33542224    Free PMC article.