Background Breast cancer is a heterogeneous disease at the clinical and molecular level. In this study we integrate classifications extracted from five different molecular levels in order to identify integrated subtypes. Methods Tumor tissue from 425 patients with primary breast cancer from the Oslo2 study was cut and blended, and divided into fractions for DNA, RNA and protein isolation and metabolomics, allowing the acquisition of representative and comparable molecular data. Patients were stratified into groups based on their tumor characteristics from five different molecular levels, using various clustering methods. Finally, all previously identified and newly determined subgroups were combined in a multilevel classification using a “cluster-of-clusters” approach with consensus clustering. Results Based on DNA copy number data, tumors were categorized into three groups according to the complex arm aberration index. mRNA expression profiles divided tumors into five molecular subgroups according to PAM50 subtyping, and clustering based on microRNA expression revealed four subgroups. Reverse-phase protein array data divided tumors into five subgroups. Hierarchical clustering of tumor metabolic profiles revealed three clusters. Combining DNA copy number and mRNA expression classified tumors into seven clusters based on pathway activity levels, and tumors were classified into ten subtypes using integrative clustering. The final consensus clustering that incorporated all aforementioned subtypes revealed six major groups. Five corresponded well with the mRNA subtypes, while a sixth group resulted from a split of the luminal A subtype; these tumors belonged to distinct microRNA clusters. Gain-of-function studies using MCF-7 cells showed that microRNAs differentially expressed between the luminal A clusters were important for cancer cell survival. These microRNAs were used to validate the split in luminal A tumors in four independent breast cancer cohorts. In two cohorts the microRNAs divided tumors into subgroups with significantly different outcomes, and in another a trend was observed. Conclusions The six integrated subtypes identified confirm the heterogeneity of breast cancer and show that finer subdivisions of subtypes are evident. Increasing knowledge of the heterogeneity of the luminal A subtype may add pivotal information to guide therapeutic choices, evidently bringing us closer to improved treatment for this largest subgroup of breast cancer.

In the microenvironment of a malignancy, tumor cells do not exist in isolation, but rather in a diverse ecosystem consisting not only of heterogeneous tumor-cell clones, but also normal cell types such as fibroblasts, vasculature, and an extensive pool of immune cells at numerous possible stages of activation and differentiation. This results in a complex interplay of diverse cellular signaling systems, where the immune cell component is now established to influence cancer progression and therapeutic response. It is experimentally difficult and laborious to comprehensively and systematically profile these distinct cell types from heterogeneous tumor samples in order to capitalize on potential therapeutic and biomarker discoveries. One emerging solution to address this challenge is to computationally extract cell-type specific information directly from bulk tumors. Such in silico approaches are advantageous because they can capture both the cell-type specific profiles and the tissue systems level of cell-cell interactions. Accurately and comprehensively predicting these patterns in tumors is an important challenge to overcome, not least given the success of immunotherapeutic drug treatment of several human cancers. This is especially challenging for subsets of closely related immune cell phenotypes with relatively small gene expression differences, which have critical functional distinctions. Here, we outline the existing and emerging novel bioinformatics strategies that can be used to profile the tumor immune landscape.

The tumor microenvironment is now widely recognized for its role in tumor progression, treatment response, and clinical outcome. The intratumoral immunological landscape, in particular, has been shown to exert both pro-tumorigenic and anti-tumorigenic effects. Identifying immunologically active or silent tumors may be an important indication for administration of therapy, and detecting early infiltration patterns may uncover factors that contribute to early risk. Thus far, direct detailed studies of the cell composition of tumor infiltration have been limited; with some studies giving approximate quantifications using immunohistochemistry and other small studies obtaining detailed measurements by isolating cells from excised tumors and sorting them using flow cytometry. Herein we utilize a machine learning based approach to identify lymphocyte markers with which we can quantify the presence of B cells, cytotoxic T-lymphocytes, T-helper 1, and T-helper 2 cells in any gene expression data set and apply it to studies of breast tissue. By leveraging over 2,100 samples from existing large scale studies, we are able to find an inherent cell heterogeneity in clinically characterized immune infiltrates, a strong link between estrogen receptor activity and infiltration in normal and tumor tissues, changes with genomic complexity, and identify characteristic differences in lymphocyte expression among molecular groupings. With our extendable methodology for capturing cell type specific signal we systematically studied immune infiltration in breast cancer, finding an inverse correlation between beneficial lymphocyte infiltration and estrogen receptor activity in normal breast tissue and reduced infiltration in estrogen receptor negative tumors with high genomic complexity.

Background<br/> There is increasing evidence that elevated body mass index (BMI) is associated with reduced survival for women with breast cancer. However, the underlying reasons remain unclear. We conducted a Mendelian randomization analysis to investigate a possible causal role of BMI in survival from breast cancer.<br/><br/> Methods<br/> We used individual-level data from six large breast cancer case-cohorts including a total of 36 210 individuals (2475 events) of European ancestry. We created a BMI genetic risk score (GRS) based on genotypes at 94 known BMI-associated genetic variants. Association between the BMI genetic score and breast cancer survival was analysed by Cox regression for each study separately. Study-specific hazard ratios were pooled using fixed-effect meta-analysis.<br/><br/> Results<br/> BMI genetic score was found to be associated with reduced breast cancer-specific survival for estrogen receptor (ER)-positive cases [hazard ratio (HR) = 1.11, per one-unit increment of GRS, 95% confidence interval (CI) 1.01–1.22, P = 0.03). We observed no association for ER-negative cases (HR = 1.00, per one-unit increment of GRS, 95% CI 0.89–1.13, P = 0.95).<br/><br/> Conclusions<br/> Our findings suggest a causal effect of increased BMI on reduced breast cancer survival for ER-positive breast cancer. There is no evidence of a causal effect of higher BMI on survival for ER-negative breast cancer cases.

Most non-BRCA1/2 breast cancer families have no identified genetic cause. We used linkage and haplotype analyses in familial and sporadic breast cancer cases to identify a susceptibility locus on chromosome 6q. Two independent genome-wide linkage analysis studies suggested a 3 Mb locus on chromosome 6q and two unrelated Swedish families with a LOD >2 together seemed to share a haplotype in 6q14.1. We hypothesized that this region harbored a rare high-risk founder allele contributing to breast cancer in these two families. Sequencing of DNA and RNA from the two families did not detect any pathogenic mutations. Finally, 29 SNPs in the region were analyzed in 44,214 cases and 43,532 controls from BCAC, and the original haplotypes in the two families were suggested as low-risk alleles for European and Swedish women specifically. There was also some support for one additional independent moderate-risk allele in Swedish familial samples. The results were consistent with our previous findings in familial breast cancer and supported a breast cancer susceptibility locus at 6q14.1 around the PHIP gene.

Breast cancer risk is influenced by rare coding variants in susceptibility genes, such as BRCA1, and many common, mostly non-coding variants. However, much of the genetic contribution to breast cancer risk remains unknown. Here we report the results of a genome-wide association study of breast cancer in 122,977 cases and 105,974 controls of European ancestry and 14,068 cases and 13,104 controls of East Asian ancestry1. We identified 65 new loci that are associated with overall breast cancer risk at P < 5 × 10−8. The majority of credible risk single-nucleotide polymorphisms in these loci fall in distal regulatory elements, and by integrating in silico data to predict target genes in breast cells at each locus, we demonstrate a strong overlap between candidate target genes and somatic driver genes in breast tumours. We also find that heritability of breast cancer due to all single-nucleotide polymorphisms in regulatory features was 2–5-fold enriched relative to the genome-wide average, with strong enrichment for particular transcription factor binding sites. These results provide further insight into genetic susceptibility to breast cancer and will improve the use of genetic risk scores for individualized screening and prevention.

Most common breast cancer susceptibility variants have been identified through genome-wide association studies (GWAS) of predominantly estrogen receptor (ER)-positive disease1. We conducted a GWAS using 21,468 ER-negative cases and 100,594 controls combined with 18,908 BRCA1 mutation carriers (9,414 with breast cancer), all of European origin. We identified independent associations at P < 5 × 10−8 with ten variants at nine new loci. At P < 0.05, we replicated associations with 10 of 11 variants previously reported in ER-negative disease or BRCA1 mutation carrier GWAS and observed consistent associations with ER-negative disease for 105 susceptibility variants identified by other studies. These 125 variants explain approximately 16% of the familial risk of this breast cancer subtype. There was high genetic correlation (0.72) between risk of ER-negative breast cancer and breast cancer risk for BRCA1 mutation carriers. These findings may lead to improved risk prediction and inform further fine-mapping and functional work to better understand the biological basis of ER-negative breast cancer.

Chronic inflammation promotes breast tumor growth and invasion by accelerating angiogenesis and tissue remodeling in the tumor microenvironment. There is a complex relationship between inflammation and estrogen, which drives the growth of 70 percent of breast tumors. While low levels of estrogen exposure stimulate macrophages and other inflammatory cell populations, very high levels are immune suppressive. Breast tumor incidence is increased by obesity and age, which interact to influence inflammatory cell populations in normal breast tissue. To characterize the impact of these factors on tumors and the tumor microenvironment, we measured gene expression in 195 breast adenocarcinomas and matched adjacent normal breast tissue samples collected at Akershus University Hospital (AHUS). Age and Body Mass Index (BMI) were independently associated with inflammation in adjacent normal tissue but not tumors. Estrogen Receptor (ER)-negative tumors had elevated macrophage expression compared with matched normal tissue, but ER-positive tumors showed an unexpected decrease in macrophage expression. We found an inverse relationship between the increase in tumor estrogen pathway expression compared with adjacent normal tissue and tumor macrophage score. We validated this finding in 126 breast tumor-normal pairs from the previously published METABRIC cohort. We developed a novel statistic, the Rewiring Coefficient, to quantify the rewiring of gene co-expression networks at the level of individual genes. Differential correlation analysis demonstrated distinct pathways were rewired during tumorigenesis. Our data support an immune suppressive effect of high doses of estrogen signaling in breast tumor microenvironment, suggesting that this effect contributes to the greater presence of prognostic and therapeutically relevant immune cells in ER-negative tumors.

We conducted a meta-analysis of three endometrial cancer genome-wide association studies (GWAS) and two follow-up phases totaling 7,737 endometrial cancer cases and 37,144 controls of European ancestry. Genome-wide imputation and meta-analysis identified five new risk loci of genome-wide significance at likely regulatory regions on chromosomes 13q22.1 (rs11841589, near KLF5), 6q22.31 (rs13328298, in LOC643623 and near HEY2 and NCOA7), 8q24.21 (rs4733613, telomeric to MYC), 15q15.1 (rs937213, in EIF2AK4, near BMF) and 14q32.33 (rs2498796, in AKT1, near SIVA1). We also found a second independent 8q24.21 signal (rs17232730). Functional studies of the 13q22.1 locus showed that rs9600103 (pairwise r2 = 0.98 with rs11841589) is located in a region of active chromatin that interacts with the KLF5 promoter region. The rs9600103[T] allele that is protective in endometrial cancer suppressed gene expression in vitro, suggesting that regulation of the expression of KLF5, a gene linked to uterine development, is implicated in tumorigenesis. These findings provide enhanced insight into the genetic and biological basis of endometrial cancer.

Genome-wide association studies have found SNPs at 17q22 to be associated with breast cancer risk. To identify potential causal variants related to breast cancer risk, we performed a high resolution fine-mapping analysis that involved genotyping 517 SNPs using a custom Illumina iSelect array (iCOGS) followed by imputation of genotypes for 3,134 SNPs in more than 89,000 participants of European ancestry from the Breast Cancer Association Consortium (BCAC). We identified 28 highly correlated common variants, in a 53 Kb region spanning two introns of the STXBP4 gene, that are strong candidates for driving breast cancer risk (lead SNP rs2787486 (OR = 0.92; CI 0.90–0.94; P = 8.96 × 10−15)) and are correlated with two previously reported risk-associated variants at this locus, SNPs rs6504950 (OR = 0.94, P = 2.04 × 10−09, r2 = 0.73 with lead SNP) and rs1156287 (OR = 0.93, P = 3.41 × 10−11, r2 = 0.83 with lead SNP). Analyses indicate only one causal SNP in the region and several enhancer elements targeting STXBP4 are located within the 53 kb association signal. Expression studies in breast tumor tissues found SNP rs2787486 to be associated with increased STXBP4 expression, suggesting this may be a target gene of this locus.

We analyzed 3,872 common genetic variants across the ESR1 locus (encoding estrogen receptor α) in 118,816 subjects from three international consortia. We found evidence for at least five independent causal variants, each associated with different phenotype sets, including estrogen receptor (ER+ or ER−) and human ERBB2 (HER2+ or HER2−) tumor subtypes, mammographic density and tumor grade. The best candidate causal variants for ER− tumors lie in four separate enhancer elements, and their risk alleles reduce expression of ESR1, RMND1 and CCDC170, whereas the risk alleles of the strongest candidates for the remaining independent causal variant disrupt a silencer element and putatively increase ESR1 and RMND1 expression

Breast, ovarian, and prostate cancers are hormone-related and may have a shared genetic basis, but this has not been investigated systematically by genome-wide association (GWA) studies. Meta-analyses combining the largest GWA meta-analysis data sets for these cancers totaling 112,349 cases and 116,421 controls of European ancestry, all together and in pairs, identified at P < 10−8 seven new cross-cancer loci: three associated with susceptibility to all three cancers (rs17041869/2q13/BCL2L11; rs7937840/11q12/INCENP; rs1469713/19p13/GATAD2A), two breast and ovarian cancer risk loci (rs200182588/9q31/SMC2; rs8037137/15q26/RCCD1), and two breast and prostate cancer risk loci (rs5013329/1p34/NSUN4; rs9375701/6q23/L3MBTL3). Index variants in five additional regions previously associated with only one cancer also showed clear association with a second cancer type. Cell-type–specific expression quantitative trait locus and enhancer–gene interaction annotations suggested target genes with potential cross-cancer roles at the new loci. Pathway analysis revealed significant enrichment of death receptor signaling genes near loci with P < 10−5 in the three-cancer meta-analysis.

The presence of breast cancer in any first-degree female relative in general nearly doubles the risk for a proband and the risk gradually increases with the number of affected relatives. Current advances in molecular oncology and oncogenetics may enable the identification of high-risk individuals with breast-cancer predisposition. The best-known forms of hereditary breast cancer (HBC) are caused by mutations in the high-penetrance genes BRCA1 and BRCA2. Other genes, including PTEN, TP53, STK11/LKB1, CDH1, PALB2, CHEK2, ATM, MRE11, RAD50, NBS1, BRIP1, FANCA, FANCC, FANCM, RAD51, RAD51B, RAD51C, RAD51D, and XRCC2 have been described as high- or moderate-penetrance breast cancer-susceptibility genes. The majority of breast cancer-susceptibility genes code for tumor suppressor proteins that are involved in critical processes of DNA repair pathways. This is of particular importance for those women who, due to their increased risk of breast cancer, may be subjected to more frequent screening but due to their repair deficiency might be at the risk of developing radiation-induced malignancies. It has been proven that cancers arising from the most frequent BRCA1 gene mutation carriers differ significantly from the sporadic disease of age-matched controls in their histopathological appearances and molecular characteristics. The increased depth of mutation detection brought by next-generation sequencing and a better understanding of the mechanisms through which these mutations cause the disease will bring novel insights in terms of oncological prevention, diagnostics, and therapeutic options for HBC patients.

Background: The K3326X variant in BRCA2 ( BRCA2 *c.9976A>T; p.Lys3326*; rs11571833) has been found to be associated with small increased risks of breast cancer. However, it is not clear to what extent linkage disequilibrium with fully pathogenic mutations might account for this association. There is scant information about the effect of K3326X in other hormone-related cancers. Methods: Using weighted logistic regression, we analyzed data from the large iCOGS study including 76 637 cancer case patients and 83 796 control patients to estimate odds ratios (OR w ) and 95% confidence intervals (CIs) for K3326X variant carriers in relation to breast, ovarian, and prostate cancer risks, with weights defined as probability of not having a pathogenic BRCA2 variant. Using Cox proportional hazards modeling, we also examined the associations of K3326X with breast and ovarian cancer risks among 7183 BRCA1 variant carriers. All statistical tests were two-sided. Results: The K3326X variant was associated with breast (OR w = 1.28, 95% CI = 1.17 to 1.40, P = 5.9x10 -6 ) and invasive ovarian cancer (OR w = 1.26, 95% CI = 1.10 to 1.43, P = 3.8x10 -3 ). These associations were stronger for serous ovarian cancer and for estrogen receptor–negative breast cancer (OR w = 1.46, 95% CI = 1.2 to 1.70, P = 3.4x10 -5 and OR w = 1.50, 95% CI = 1.28 to 1.76, P = 4.1x10 -5 , respectively). For BRCA1 mutation carriers, there was a statistically significant inverse association of the K3326X variant with risk of ovarian cancer (HR = 0.43, 95% CI = 0.22 to 0.84, P = .013) but no association with breast cancer. No association with prostate cancer was observed. Conclusions: Our study provides evidence that the K3326X variant is associated with risk of developing breast and ovarian cancers independent of other pathogenic variants in BRCA2. Further studies are needed to determine the biological mechanism of action responsible for these associations.

NBS1, also known as NBN, plays an important role in maintaining genomic stability. Interestingly, rs2735383 G > C, located in a microRNA binding site in the 3′-untranslated region (UTR) of NBS1, was shown to be associated with increased susceptibility to lung and colorectal cancer. However, the relation between rs2735383 and susceptibility to breast cancer is not yet clear. Therefore, we genotyped rs2735383 in 1,170 familial non-BRCA1/2 breast cancer cases and 1,077 controls using PCR-based restriction fragment length polymorphism (RFLP-PCR) analysis, but found no association between rs2735383CC and breast cancer risk (OR = 1.214, 95% CI = 0.936–1.574, P = 0.144). Because we could not exclude a small effect size due to a limited sample size, we further analyzed imputed rs2735383 genotypes (r2 > 0.999) of 47,640 breast cancer cases and 46,656 controls from the Breast Cancer Association Consortium (BCAC). However, rs2735383CC was not associated with overall breast cancer risk in European (OR = 1.014, 95% CI = 0.969–1.060, P = 0.556) nor in Asian women (OR = 0.998, 95% CI = 0.905–1.100, P = 0.961). Subgroup analyses by age, age at menarche, age at menopause, menopausal status, number of pregnancies, breast feeding, family history and receptor status also did not reveal a significant association. This study therefore does not support the involvement of the genotype at NBS1 rs2735383 in breast cancer susceptibility.

Common variation on 14q24.1, close to RAD51B, has been associated with breast cancer: rs999737 and rs2588809 with the risk of female breast cancer and rs1314913 with the risk of male breast cancer. The aim of this study was to investigate the role of RAD51B variants in breast cancer predisposition, particularly in the context of familial breast cancer in Finland. We sequenced the coding region of RAD51B in 168 Finnish breast cancer patients from the Helsinki region for identification of possible recurrent founder mutations. In addition, we studied the known rs999737, rs2588809, and rs1314913 SNPs and RAD51B haplotypes in 44,791 breast cancer cases and 43,583 controls from 40 studies participating in the Breast Cancer Association Consortium (BCAC) that were genotyped on a custom chip (iCOGS). We identified one putatively pathogenic missense mutation c.541C>T among the Finnish cancer patients and subsequently genotyped the mutation in additional breast cancer cases (n = 5259) and population controls (n = 3586) from Finland and Belarus. No significant association with breast cancer risk was seen in the meta-analysis of the Finnish datasets or in the large BCAC dataset. The association with previously identified risk variants rs999737, rs2588809, and rs1314913 was replicated among all breast cancer cases and also among familial cases in the BCAC dataset. The most significant association was observed for the haplotype carrying the risk-alleles of all the three SNPs both among all cases (odds ratio (OR): 1.15, 95% confidence interval (CI): 1.11–1.19, P = 8.88 x 10−16) and among familial cases (OR: 1.24, 95% CI: 1.16–1.32, P = 6.19 x 10−11), compared to the haplotype with the respective protective alleles. Our results suggest that loss-of-function mutations in RAD51B are rare, but common variation at the RAD51B region is significantly associated with familial breast cancer risk.

Background Ductal carcinoma in situ (DCIS) is a non-invasive form of breast cancer. It is often associated with invasive ductal carcinoma (IDC), and is considered to be a non-obligate precursor of IDC. It is not clear to what extent these two forms of cancer share low-risk susceptibility loci, or whether there are differences in the strength of association for shared loci. Methods To identify genetic polymorphisms that predispose to DCIS, we pooled data from 38 studies comprising 5,067 cases of DCIS, 24,584 cases of IDC and 37,467 controls, all genotyped using the iCOGS chip. Results Most (67 %) of the 76 known breast cancer predisposition loci showed an association with DCIS in the same direction as previously reported for invasive breast cancer. Case-only analysis showed no evidence for differences between associations for IDC and DCIS after considering multiple testing. Analysis by estrogen receptor (ER) status confirmed that loci associated with ER positive IDC were also associated with ER positive DCIS. Analysis of DCIS by grade suggested that two independent SNPs at 11q13.3 near CCND1 were specific to low/intermediate grade DCIS (rs75915166, rs554219). These associations with grade remained after adjusting for ER status and were also found in IDC. We found no novel DCIS-specific loci at a genome wide significance level of P < 5.0x10-8. Conclusion In conclusion, this study provides the strongest evidence to date of a shared genetic susceptibility for IDC and DCIS. Studies with larger numbers of DCIS are needed to determine if IDC or DCIS specific loci exist.

Purpose CHEK2*1100delC is a well-established breast cancer risk variant that is most prevalent in European populations; however, there are limited data on risk of breast cancer by age and tumor subtype, which limits its usefulness in breast cancer risk prediction. We aimed to generate tumor subtype- and age-specific risk estimates by using data from the Breast Cancer Association Consortium, including 44,777 patients with breast cancer and 42,997 controls from 33 studies genotyped for CHEK2*1100delC. Patients and Methods CHEK2*1100delC genotyping was mostly done by a custom Taqman assay. Breast cancer odds ratios (ORs) for CHEK2*1100delC carriers versus noncarriers were estimated by using logistic regression and adjusted for study (categorical) and age. Main analyses included patients with invasive breast cancer from population- and hospital-based studies. Results Proportions of heterozygous CHEK2*1100delC carriers in controls, in patients with breast cancer from population- and hospital-based studies, and in patients with breast cancer from familial- and clinical genetics center–based studies were 0.5%, 1.3%, and 3.0%, respectively. The estimated OR for invasive breast cancer was 2.26 (95%CI, 1.90 to 2.69; P = 2.3 × 10−20). The OR was higher for estrogen receptor (ER)–positive disease (2.55 [95%CI, 2.10 to 3.10; P = 4.9 × 10−21]) than it was for ER-negative disease (1.32 [95%CI, 0.93 to 1.88; P = .12]; P interaction = 9.9 × 10−4). The OR significantly declined with attained age for breast cancer overall (P = .001) and for ER-positive tumors (P = .001). Estimated cumulative risks for development of ER-positive and ER-negative tumors by age 80 in CHEK2*1100delC carriers were 20% and 3%, respectively, compared with 9% and 2%, respectively, in the general population of the United Kingdom. Conclusion These CHEK2*1100delC breast cancer risk estimates provide a basis for incorporating CHEK2*1100delC into breast cancer risk prediction models and into guidelines for intensified screening and follow-up.

Background The rarity of mutations in PALB2, CHEK2 and ATM make it difficult to estimate precisely associated cancer risks. Population-based family studies have provided evidence that at least some of these mutations are associated with breast cancer risk as high as those associated with rare BRCA2 mutations. We aimed to estimate the relative risks associated with specific rare variants in PALB2, CHEK2 and ATM via a multicentre case-control study. Methods We genotyped 10 rare mutations using the custom iCOGS array: PALB2 c.1592delT, c.2816T>G and c.3113G>A, CHEK2 c.349A>G, c.538C>T, c.715G>A, c.1036C>T, c.1312G>T, and c.1343T>G and ATM c.7271T>G. We assessed associations with breast cancer risk (42 671 cases and 42 164 controls), as well as prostate (22 301 cases and 22 320 controls) and ovarian (14 542 cases and 23 491 controls) cancer risk, for each variant. Results For European women, strong evidence of association with breast cancer risk was observed for PALB2 c.1592delT OR 3.44 (95% CI 1.39 to 8.52, p=7.1×10−5), PALB2 c.3113G>A OR 4.21 (95% CI 1.84 to 9.60, p=6.9×10−8) and ATM c.7271T>G OR 11.0 (95% CI 1.42 to 85.7, p=0.0012). We also found evidence of association with breast cancer risk for three variants in CHEK2, c.349A>G OR 2.26 (95% CI 1.29 to 3.95), c.1036C>T OR 5.06 (95% CI 1.09 to 23.5) and c.538C>T OR 1.33 (95% CI 1.05 to 1.67) (p≤0.017). Evidence for prostate cancer risk was observed for CHEK2 c.1343T>G OR 3.03 (95% CI 1.53 to 6.03, p=0.0006) for African men and CHEK2 c.1312G>T OR 2.21 (95% CI 1.06 to 4.63, p=0.030) for European men. No evidence of association with ovarian cancer was found for any of these variants. Conclusions This report adds to accumulating evidence that at least some variants in these genes are associated with an increased risk of breast cancer that is clinically important.

Breast cancer is the most diagnosed malignancy and the second leading cause of cancer mortality in females. Previous association studies have identified variants on 2q35 associated with the risk of breast cancer. To identify functional susceptibility loci for breast cancer, we interrogated the 2q35 gene desert for chromatin architecture and functional variation correlated with gene expression. We report a novel intergenic breast cancer risk locus containing an enhancer copy number variation (enCNV; deletion) located approximately 400Kb upstream to IGFBP5, which overlaps an intergenic ERα-bound enhancer that loops to the IGFBP5 promoter. The enCNV is correlated with modified ERα binding and monoallelic-repression of IGFBP5 following oestrogen treatment. We investigated the association of enCNV genotype with breast cancer in 1,182 cases and 1,362 controls, and replicate our findings in an independent set of 62,533 cases and 60,966 controls from 41 case control studies and 11 GWAS. We report a dose-dependent inverse association of 2q35 enCNV genotype (percopy OR = 0.68 95%CI 0.55–0.83, P = 0.0002; replication OR = 0.77 95% CI 0.73-0.82, P = 2.1 × 10−19) and identify 13 additional linked variants (r2 > 0.8) in the 20Kb linkage block containing the enCNV (P = 3.2 × 10−15 − 5.6 × 10−17). These associations were independent of previously reported 2q35 variants, rs13387042/rs4442975 and rs16857609, and were stronger for ER-positive than ER-negative disease. Together, these results suggest that 2q35 breast cancer risk loci may be mediating their effect through IGFBP5.

may be because of germline genetic variation. We aimed to identify genetic markers associated with breast cancer–specific survival. Methods: We conducted a large meta-analysis of studies in populations of European ancestry, including 37 954 patients with 2900 deaths from breast cancer. Each study had been genotyped for between 200 000 and 900 000 single nucleotide polymorphisms (SNPs) across the genome; genotypes for nine million common variants were imputed using a common reference panel from the 1000 Genomes Project. We also carried out subtype-specific analyses based on 6881 estrogen receptor (ER)–negative patients (920 events) and 23 059 ER-positive patients (1333 events). All statistical tests were two-sided. Results: We identified one new locus (rs2059614 at 11q24.2) associated with survival in ER-negative breast cancer cases (hazard ratio [HR] = 1.95, 95% confidence interval [CI] = 1.55 to 2.47, P = 1.91 x 10–8). Genotyping a subset of 2113 case patients, of which 300 were ER negative, provided supporting evidence for the quality of the imputation. The association in this set of case patients was stronger for the observed genotypes than for the imputed genotypes. A second locus (rs148760487 at 2q24.2) was associated at genome-wide statistical significance in initial analyses; the association was similar in ER-positive and ER-negative case patients. Here the results of genotyping suggested that the finding was less robust. Conclusions: This is currently the largest study investigating genetic variation associated with breast cancer survival. Our results have potential clinical implications, as they confirm that germline genotype can provide prognostic information in addition to standard tumor prognostic factors.

Background Ductal carcinoma in situ (DCIS) of the breast is a precursor of invasive breast carcinoma. DNA methylation alterations are thought to be an early event in progression of cancer, and may prove valuable as a tool in clinical decision making and for understanding neoplastic development. Results We generate genome-wide DNA methylation profiles of 285 breast tissue samples representing progression of cancer, and validate methylation changes between normal and DCIS in an independent dataset of 15 normal and 40 DCIS samples. We also validate a prognostic signature on 583 breast cancer samples from The Cancer Genome Atlas. Our analysis reveals that DNA methylation profiles of DCIS are radically altered compared to normal breast tissue, involving more than 5,000 genes. Changes between DCIS and invasive breast carcinoma involve around 1,000 genes. In tumors, DNA methylation is associated with gene expression of almost 3,000 genes, including both negative and positive correlations. A prognostic signature based on methylation level of 18 CpGs is associated with survival of breast cancer patients with invasive tumors, as well as with survival of patients with DCIS and mixed lesions of DCIS and invasive breast carcinoma. Conclusions This work demonstrates that changes in the epigenome occur early in the neoplastic progression, provides evidence for the possible utilization of DNA methylation-based markers of progression in the clinic, and highlights the importance of epigenetic changes in carcinogenesis.