The Cancer Genome Atlas (TCGA) has been one of the clearing houses of genome-wide array data for the understanding of the molecular basis of cancer from large cohorts. These analyses are intrinsically from bulk measurements of mixed cell types, derived from frozen biopsy sections that include tissues with mixed histopathology and/or microanatomies (e.g., tumor, stroma). While bulk array profiling may provide insights into molecular aberrations, it provides only an average genome-wide measurement for a biopsy and fails to reveal inherent cellular composition and heterogeneity of a tumor. On the other hand, histology sections do not provide standardized measurements, but they are rich in content and continue to be the gold standard for the assessment of tissue neoplasm.

We are developing a platform to facilitate management and analysis of data provided by the NCI’s TCGA project. The significance of this platform is its robustness and scalability on data processing, and the potential results of this initiative are:  (I) An efficient and effective platform for the representation and characterization of tumor histology as well as the integrative analysis with clinical outcome; and (II) An atlas that identifies morphometric subtypes, responses to therapies, and molecular correlates. Therefore, any clinical sample can be crossreferenced against such an atlas for precision medicine and personalized therapy.

Visualization of WSIs as well as the computed nuclear architectures are available at Berkeley Cancer Morphometric Data Portal

Resource Releases

All resource, including data and source code related to this project, have been released on BMIHub for public consumption.