Breast Invasive Carcinoma Research Project

Genomics & Clinical Data Analysis | TCGA Dataset Study (BMEG 310)

Background & Overview

This study investigates the genetic and clinical characteristics of breast cancer, focusing on mutations within BRCA genes. Utilizing data from The Cancer Genome Atlas (TCGA), the research employs unsupervised learning to identify highly transcribed genes and to analyze factors affecting patient survival outcomes. Key insights are gained into pathways critical for tumor growth, such as cell cycle regulation and NK cell-mediated cytotoxicity.

1.0 Study Objectives

2.0 Methodology

2.1 Data Collection & Preprocessing

Three TCGA datasets (mutation, clinical, and RNA sequencing) were used. Preprocessing filtered for high-impact mutations. Analyses included:

Cbioportal
Img 1: cBioPortal mutation view

2.2 Clustering & Pathway Analysis

Patients were clustered into four groups by mutation profiles. Clusters were compared across demographics, clinical outcomes, and pathway activity.

3.0 Results

3.1 Clinical Summary

Survival outcomes
Img 4: Kaplan-Meier survival curves

3.2 Mutation Analysis

Frequent mutations occurred in PIK3CA, TP53, and TTN, with missense mutations most common. Oncoplots highlighted four mutation-based clusters.

Oncoplot
Img 5: Oncoplot of mutation clusters

3.3 Pathway Findings

4.0 Discussion

5.0 Limitations

Hierarchical clustering struggled with overlapping gene activity between subtypes. Future studies should refine clustering methods for clearer subtype resolution.

6.0 Conclusion

This study highlights genetic and pathway differences in breast invasive carcinoma. Findings suggest targeted therapies should consider subtype-specific pathway dysregulation.

📄 Download Full Report