Data Perturbation Independent Diagnosis and Validation of Breast Cancer Subtypes Using Clustering and Patterns

OpenBU

Show simple item record

dc.contributor.author Alexe, G. en_US
dc.contributor.author Dalgin, G.S. en_US
dc.contributor.author Ramaswamy, R. en_US
dc.contributor.author DeLisi, C. en_US
dc.contributor.author Bhanot, G. en_US
dc.date.accessioned 2011-12-29T22:56:31Z
dc.date.available 2011-12-29T22:56:31Z
dc.date.issued 2007-2-19 en_US
dc.identifier.citation Alexe, G., G.S. Dalgin, R. Ramaswamy, C. DeLisi, G. Bhanot. "Data Perturbation Independent Diagnosis and Validation of Breast Cancer Subtypes Using Clustering and Patterns" Cancer Informatics 2:243-274. (2007) en_US
dc.identifier.issn 1176-9351 en_US
dc.identifier.uri http://hdl.handle.net/2144/2625
dc.description.abstract Molecular stratification of disease based on expression levels of sets of genes can help guide therapeutic decisions if such classifications can be shown to be stable against variations in sample source and data perturbation. Classifications inferred from one set of samples in one lab should be able to consistently stratify a different set of samples in another lab. We present a method for assessing such stability and apply it to the breast cancer (BCA) datasets of Sorlie et al. 2003 and Ma et al. 2003. We find that within the now commonly accepted BCA categories identified by Sorlie et al. Luminal A and Basal are robust, but Luminal B and ERBB2+ are not. In particular, 36% of the samples identified as Luminal B and 55% identified as ERBB2+ cannot be assigned an accurate category because the classification is sensitive to data perturbation. We identify a "core cluster" of samples for each category, and from these we determine "patterns" of gene expression that distinguish the core clusters from each other. We find that the best markers for Luminal A and Basal are (ESR1, LIV1, GATA-3) and (CCNE1, LAD1, KRT5), respectively. Pathways enriched in the patterns regulate apoptosis, tissue remodeling and the immune response. We use a different dataset (Ma et al. 2003) to test the accuracy with which samples can be allocated to the four disease subtypes. We find, as expected, that the classification of samples identified as Luminal A and Basal is robust but classification into the other two subtypes is not. en_US
dc.description.sponsorship New Jersey Comission on Cancer Research (CCR-703054-03); Institute for Advanced Study through the David and Lucile Packard Foundation; Shelby White and Leon Levy Initiative Fund en_US
dc.language.iso en en_US
dc.publisher Libertas Academica en_US
dc.subject Breast cancer en_US
dc.subject Clusters en_US
dc.subject Patterns en_US
dc.subject Multi-gene Biomarkers en_US
dc.subject Diagnosis en_US
dc.title Data Perturbation Independent Diagnosis and Validation of Breast Cancer Subtypes Using Clustering and Patterns en_US
dc.type article en_US
dc.identifier.pubmedid 19458770 en_US
dc.identifier.pmcid 2675483 en_US

Files in this item

This item appears in the following Collection(s)

Show simple item record

Search OpenBU


Advanced Search

Browse

Deposit Materials