Background: Current prognostic gene signatures for breast cancer mainly reflect proliferation status and have limited value in triple-negative (TNBC) cancers. The identification of prognostic signatures from TNBC cohorts was limited in the past due to small sample sizes.
Methodology/Principal Findings: We assembled all currently publically available TNBC gene expression datasets generated on Affymetrix gene chips. Inter-laboratory variation was minimized by filtering methods for both samples and genes. Supervised analysis was performed to identify prognostic signatures from 394 cases which were subsequently tested on an independent validation cohort (n = 261 cases).
Conclusions/Significance: Using two distinct false discovery rate thresholds, 25% and <3.5%, a larger (n = 264 probesets) and a smaller (n = 26 probesets) prognostic gene sets were identified and used as prognostic predictors. Most of these genes were positively associated with poor prognosis and correlated to metagenes for inflammation and angiogenesis. No correlation to other previously published prognostic signatures (recurrence score, genomic grade index, 70-gene signature, wound response signature, 7-gene immune response module, stroma derived prognostic predictor, and a medullary like signature) was observed. In multivariate analyses in the validation cohort the two signatures showed hazard ratios of 4.03 (95% confidence interval [CI] 1.71–9.48; P = 0.001) and 4.08 (95% CI 1.79–9.28; P = 0.001), respectively. The 10-year event-free survival was 70% for the good risk and 20% for the high risk group. The 26-gene signatures had modest predictive value (AUC = 0.588) to predict response to neoadjuvant chemotherapy, however, the combination of a B-cell metagene with the prognostic signatures increased its response predictive value. We identified a 264-gene prognostic signature for TNBC which is unrelated to previously known prognostic signatures.

INTRODUCTION: Current prognostic gene expression profiles for breast cancer mainly reflect proliferation status and are most useful in ER-positive cancers. Triple negative breast cancers (TNBC) are clinically heterogeneous and prognostic markers and biology-based therapies are needed to better treat this disease.
METHODS: We assembled Affymetrix gene expression data for 579 TNBC and performed unsupervised analysis to define metagenes that distinguish molecular subsets within TNBC. We used n = 394 cases for discovery and n = 185 cases for validation. Sixteen metagenes emerged that identified basal-like, apocrine and claudin-low molecular subtypes, or reflected various non-neoplastic cell populations, including immune cells, blood, adipocytes, stroma, angiogenesis and inflammation within the cancer. The expressions of these metagenes were correlated with survival and multivariate analysis was performed, including routine clinical and pathological variables.
RESULTS: Seventy-three percent of TNBC displayed basal-like molecular subtype that correlated with high histological grade and younger age. Survival of basal-like TNBC was not different from non basal-like TNBC. High expression of immune cell metagenes was associated with good and high expression of inflammation and angiogenesis-related metagenes were associated with poor prognosis. A ratio of high B-cell and low IL-8 metagenes identified 32% of TNBC with good prognosis (hazard ratio (HR) 0.37, 95% CI 0.22 to 0.61; P < 0.001) and was the only significant predictor in multivariate analysis including routine clinicopathological variables.
CONCLUSIONS: We describe a ratio of high B-cell presence and low IL-8 activity as a powerful new prognostic marker for TNBC. Inhibition of the IL-8 pathway also represents an attractive novel therapeutic target for this disease.

Introduction Lymphocyte infiltration (LI) is often seen in breast cancer but its importance remains controversial. A positive correlation of human epidermal growth factor receptor 2 (HER2) amplification and LI has been described, which was associated with a more favorable outcome. However, specific lymphocytes might also promote tumor progression by shifting the cytokine milieu in the tumor. Methods Affymetrix HG-U133A microarray data of 1,781 primary breast cancer samples from 12 datasets were included. The correlation of immune system-related metagenes with different immune cells, clinical parameters, and survival was analyzed. Results A large cluster of nearly 600 genes with functions in immune cells was consistently obtained in all datasets. Seven robust metagenes from this cluster can act as surrogate markers for the amount of different immune cell types in the breast cancer sample. An IgG metagene as a marker for B cells had no significant prognostic value. In contrast, a strong positive prognostic value for the T-cell surrogate marker (lymphocyte-specific kinase (LCK) metagene) was observed among all estrogen receptor (ER)-negative tumors and those ER-positive tumors with a HER2 overexpression. Moreover ER-negative tumors with high expression of both IgG and LCK metagenes seem to respond better to neoadjuvant chemotherapy. Conclusions Precise definitions of the specific subtypes of immune cells in the tumor can be accomplished from microarray data. These surrogate markers define subgroups of tumors with different prognosis. Importantly, all known prognostic gene signatures uniformly assign poor prognosis to all ER-negative tumors. In contrast, the LCK metagene actually separates the ER-negative group into better or worse prognosis.