Cancer Prevention Institute of California, Fremont, California.Department of Health Research and Policy (Epidemiology) and Stanford Cancer Institute, Stanford University School of Medicine, Stanford, California.

Chronic Disease Research Centre, Tropical Medicine Research Institute, University of the West Indies, Bridgetown, Barbados.Department of Preventive Medicine, State University of New York at Stony Brook, Stony Brook, New York.

Abstract

Background: Genome-wide association studies have identified approximately 100 common genetic variants associated with breast cancer risk, the majority of which were discovered in women of European ancestry. Because of different patterns of linkage disequilibrium, many of these genetic markers may not represent signals in populations of African ancestry.

Results: Fifty-four of the 74 variants (73%) were found to have ORs that were directionally consistent with those previously reported, of which 12 were nominally statistically significant (P < 0.05). Through fine-mapping, in six regions (3p24, 12p11, 14q13, 16q12/FTO, 16q23, 19p13), we observed seven markers that better represent the underlying risk variant for overall breast cancer or breast cancer subtypes, whereas in another two regions (11q13, 16q12/TOX3), we identified suggestive evidence of signals that are independent of the reported index variant. Overlapping chromatin features and regulatory elements suggest that many of the risk alleles lie in regions with biological functionality.

Conclusions: Through fine-mapping of known susceptibility regions, we have revealed alleles that better characterize breast cancer risk in women of African ancestry.

Introduction

Genome-wide association (GWAS) and large-scale fine-mapping studies have led to the identification of >100 breast cancer susceptibility loci that are estimated to explain approximately 20% of the 2-fold familial risk of breast cancer in women of European descendant (1–13). For populations of African ancestry, where the span of linkage disequilibrium (LD) has been shortened by recombination events (more generations), a weaker correlation between an “index” marker (from GWAS in Asian and European ancestry populations) and biologically relevant risk variants is expected. As a consequence, the index marker might not accurately capture the risk associated with the biologically functional variant in African ancestry populations. Comprehensive testing in a large African ancestry sample is needed to identify a set of markers that better capture risk associated with the functional allele at known risk regions, which is an important prerequisite for constructing genetic risk models for this population. Previous fine-mapping investigations in women of African ancestry have been limited in size, with the largest study including 3,016 breast cancer cases and 2,745 controls and having 80% power to detect reported effect sizes for only 10 of 72 variants examined (1–15).

To obtain greater statistical power for fine-mapping of known breast cancer susceptibility regions, we combined genotype and imputed data for 6,522 breast cancer cases and 7,643 controls from three large consortia of African ancestry breast cancer—the African American Breast Cancer GWAS Consortium (AABC; ref. 16), the African American Breast Cancer Epidemiology and Risk Consortium (AMBER; refs. 17, 18), and the Genome-Wide Association Study of Breast Cancer in the African Diaspora Consortium (ROOT; ref. 19). In addition to testing the reported index variants from previous GWAS, we conducted association analyses and functional annotation across each region in search of markers that might best define breast cancer risk in women of African ancestry.

Materials and Methods

Studies

The genetic data included in this analysis were from three consortia of breast cancer in women of African ancestry (AABC, AMBER, and ROOT). For this analysis, the African American Breast Cancer Consortium (AABC) included seven epidemiologic studies: The Multiethnic Cohort study (MEC), 734/1,003; The Los Angeles component of The Women's Contraceptive and Reproductive Experiences (CARE) Study, 380/224; The San Francisco Bay Area Breast Cancer Study (SFBCS), 172/23; The Northern California site of the Breast Cancer Family Registry (NC-BCFR), 440/53; The Prostate, Lung, Colorectal, and Ovarian Cancer Screening Trial (PLCO) Cohort, 64/133; The Nashville Breast Health Study (NBHS), 310/186; and The Wake Forest University Breast Cancer Study (WFBC), 125/153). The current analysis includes GWAS data for 2,225 invasive cases and 1,983 controls from AABC (14). Although the Women's Circle of Health Study (WCHS) and The Carolina Breast Cancer Study (CBCS) participated in AABC, samples from those studies are included as part of the AMBER consortium described below.

Genotyping and quality control

Genotyping in AABC was conducted using the Illumina Human 1M-Duo BeadChip as described in Chen and colleagues (14). The ROOT samples were genotyped using the Illumina 2.5 M array (23). Samples in AMBER were genotyped using an Illumina Infinium custom ∼160 K SNP array which included approximately 45,000 SNPs selected primarily for fine-mapping of known breast cancer susceptibility regions.

Statistical analysis

Imputation in AABC and AMBER was conducted using IMPUTE2 (24) to a cosmopolitan panel of all 1000 Genome Project subjects (March 2012 release). IMPUTE2 (24) was also used to impute the untyped SNPs in ROOT using a cosmopolitan panel of all 1000 Genome Project subjects (October 2011 release). Imputed SNPs with imputation quality score > 0.7 and a minor allele frequency > 0.01 in each study were used in the fine-mapping analysis. We examined 74 risk variants for breast cancer in 72 regions that had been reported at the time this study was initiated (1–11). One additional variant, rs11571833 at chromosome 13q13, was not genotyped and could not be imputed in all three studies; this variant had a minor allele frequency of 0.006 in the 1000 Genomes AFR population. These 74 risk variants include stronger markers than the index SNP found in GWAS as well as independent signals discovered through subsequent fine-mapping studies (Supplementary Table S1; refs. 1, 3, 4, 8, 10, 11).

A total of 6,522 breast cancer cases (2,933 ER+ and 1,876 ER−) and 7,643 controls were included in the analysis. For each typed and imputed SNP, ORs and 95% confidence intervals (95% CI) were estimated using unconditional logistic regression adjusting for age (at diagnosis for cases and age at the reference date for controls), study, and the first 10 eigenvectors from a principal components analysis (25). For each SNP that existed in all three studies, we tested for allele dosage effects separately in each of the three studies, applying a 1-degree-of-freedom Wald χ2 trend test. Results were then combined using inverse variance–weighted fixed-effects meta-analysis, as implemented in METAL (26). We tested for effect heterogeneity between studies using Cochran Q-test as implemented in METAL. Power calculations were conducted using Quanto (http://hydra.usc.edu/gxe/) using the OR in previous GWAS and the allele frequency in African Americans.

To identify alleles that might capture the biologically functional variant at 70 of the known breast cancer risk regions, we searched and tested LD proxies among the genotyped and imputed SNPs that were correlated (r2 ≥ 0.4) with the index SNP [within 250 kb or larger if the index signal was contained within an LD block (based on the D' statistic) of >250 kb] in European ancestry populations, resulting in a total of 157,920 SNPs included in the analysis. Two regions, 5p15 and 20q11, were excluded from fine-mapping because the AABC sample was involved in the discovery of the risk loci in these regions (27, 28). The GWAS arrays and imputation in AAPC, AMBER, and ROOT provided good coverage of common variation (>5%) in the fine-mapped regions in African ancestry populations. For AABC, an average of 96% of common SNPs with a MAF >5% in the phase III 1000 Genome AFR population were tagged (at r2 > 0.8) by the genotyped and imputed SNPs. For ROOT and AMBER, these averages were each 97%. For each study, the coverage was >90% for all regions, with the exception of chromosome 1p11 (45% in each) and chromosome 6q25 in AMBER (82%).

Locus-specific significance levels were calculated as described in Feng and colleagues, 2014 (ref. 15; Supplementary Table S2). More specifically, locus-specific significance levels were calculated as 0.05 divided by the number of tag SNPs in the African population (1000 Genomes, AFR, March 2012 Release) that capture (r2 ≥ 0.8) all SNPs correlated with the index signal in the European population (1000 Genomes, EUR, March 2012 Release). To reduce false-positive signals, we required the P value of all better markers to be less than 0.01. In an attempt to eliminate minor fluctuations in P values for correlated SNPs, we also required the P value to decrease by more than one order of magnitude compared with the association with the index signal. If multiple variants satisfied the above criterion in each region, only the most statistically significant variant was reported.

We also tested for novel independent associations, focusing on all genotyped and imputed SNPs in each region that were uncorrelated with the index signal in European ancestry populations (r2 < 0.1), and applied a significance criterion of α = 5 × 10−6 for defining suggestive novel associations, as used in prior studies (14). This α is not as conservative as genome-wide significance and is an approximation of the number of tests to capture (at r2 ≥ 0.8) common risk alleles across all regions. To confirm independent associations, conditional analyses were performed that included the index SNP or better marker plus the most significant uncorrelated allele. The analysis was first conducted in each separate study and then combined using fixed-effects meta-analysis. Haplotype analysis on 16q12 was conducted applying the “haplo.stats” package in R (http://www.mayo.edu/research/labs/statistical-genetics-genetic-epidemiology/software).

The procedures described above were applied to the analysis of overall breast cancer as well as in secondary analyses stratified by ER status.

Functional annotations

We assessed whether any of the signals colocalized with 65 chromatin features that capture open chromatin regions and regulatory elements across the genome in ER+ breast cancer (MCF7, T47D, HCC1954), ER− breast cancer (MDAMB231) and normal breast (HMEC, Myoepithelial, Fibroblast, Luminal epithelial) cells identified by the Coetzee Laboratory (29–31) or obtained from the Encyclopedia of DNA Elements (ENCODE) project (32) or NIH Roadmap Epigenomics Mapping Consortium (REMC; refs. 33, 34). Enriched regions of chromatin features were either called by using the Sole-search program (35) or obtained from GEO databases (GSE35583, GSE32970, GSE35239, GSE46074, GSE49651, GSE78913). To refine the genomic regulatory regions, chromatin state segmentation information built by using a Hidden Markov Model (HMM) in MCF7 and HMEC were also included (36, 37).

We used motifbreakR (38) to search for transcription factor motifs that bind to each variant (39–43). Chromatin features that overlapped variants and motifs that significantly altered binding (using the default setting with the score threshold, 0.9) are summarized in Supplementary Table S3. We also included key transcription factors for breast cancer such as FOXA1, GATA3, and ESR1 ChIP-seq data in ER+ breast cancer cells (MCF7, T47D) from the ENCODE project to examine the occupancy of transcription factors in vitro at regulatory elements where variants reside (32).

Results

Of the 74 breast cancer risk variants, 68 were also common in women of African ancestry, with minor allele frequencies greater than 0.05 in all three studies. Of these 68 variants, we had ≥50% and ≥80% power (at P < 0.05) to detect previously reported effect sizes for 51 and 36 variants, respectively. The ORs observed for 54 (73%) of the 74 SNPs were directionally consistent with those previously reported (i.e., ORs were in the same direction), with 12 variants nominally statistically significant at P < 0.05 (Table 1; Supplementary Table S1). Of the 61 SNPs that were directionally consistent with previous GWAS findings and not replicated at P < 0.05 in this study, statistical power to detect the previously reported effect size for overall breast cancer was ≥80% for 29 (48%) SNPs (Supplementary Table S1). Fifty-three (72%) variants were positively associated with ER+ breast cancer (8 statistically significant at P < 0.05) and 37 (50%) variants were positively associated with ER− disease (10 statistically significant at P < 0.05) (Supplementary Table S4). Of the 7 variants that were reported to be more strongly associated with ER− than ER+ disease in European ancestry populations (rs6678914/1q32, rs4245739/1q32, rs12710696/2p24, rs10069690/5p15, rs11075995/16q12, rs67397200/19p13, rs2284378/20q11; refs. 2, 27, 28, 44), all were positively associated with the risk of ER− disease (3 at P < 0.05; rs4245739, rs10069690, and rs67397200). Statistical power was ≥80% to detect the reported effect size with ER− disease for 4 of the 7 variants (Supplementary Table S4).

To identify markers at known risk regions that might better define the index signals or serve as secondary, independent signals, fine-mapping analysis was conducted at each of the 70 regions (excluding 5p15 and 20q11, see Materials and Methods). Using region-specific thresholds, we observed associations of 7 markers with overall breast cancer or breast cancer subtypes at 6 regions (3p24, 12p11, 14q13, 16q12/FTO, 16q23, 19p13), while in two regions (11q13 and 16q12), we observed suggestive evidence of signals independent of the reported index variant (Supplementary Table S5). These regions are discussed below.

At 3p24, the index variant, rs4973768, was more strongly associated with ER− than ER+ disease in the initial GWAS (ER+: OR = 1.06, ER−: OR = 1.12, Phet = 0.022; ref. 7). Variant rs2370946, located in the intron of the NEK10 gene, with enhancer histone marks in ER+ breast cancer cells (i.e., HCC1954) and 155 kb from the index variant, rs4973768, was found statistically significantly associated with ER+ breast cancer in women of African ancestry (ER+: OR = 1.17, P = 7.8 × 10−4; ER−: OR = 1.11, P = 0.058; Supplementary Figs. S1 and S2). Variant rs2370946 is correlated with the index in European populations, but not in African populations (EUR: r2 = 0.66; AFR: r2 = 0.01).

At 11q13, the same variant reported by Chen and colleagues (rs609275: OR = 1.20, P = 1.0 × 10−5; ref. 14) was identified as an independent secondary signal in this region (Supplementary Table S5). This variant was statistically significantly associated with overall breast cancer in women of African ancestry [OR = 1.13, P = 4.5 × 10−6; r2 with the index variant: 0.022 (EUR), 0.003 (AFR); Fig. 1; Supplementary Fig. S1]. The variant rs609275, which resides in a gene desert region at 11q13, is located in a breast-specific active enhancer found not only in normal breast cells, but also breast cancer cells (both ER+ and ER−). We observed that the motif of NR3C1 (a.k.a. GR, glucocorticoid receptor) is disrupted by the SNP; NR3C1 is known to inhibit MAPK activation by inducing MAPK1, possibly influencing breast cancer cell survival (ref. 45; Fig. 1; Supplementary Fig. S2).

Regional plot and genome browser view of 11q13. The chromosomal position (based on GRCh37) of SNPs on 11q13 against −log10P values for overall breast cancer is shown on the top plot. The blue arrow denotes the secondary signal rs609275. The purple circle denotes the index variant rs614367. SNPs surrounding the index variant are colored to indicate the LD structure using pairwise r2 in reference to rs614367 from the May 2012 EUR panel of 1000 Genomes. The plots were generated using LocusZoom (55). Genome browser views with epigenetic chromatin features in breast cells (MCF7, HCC1954, MDAMB231, HMEC) on 11q13 are generated using the UCSC genome browser (56). Below is a magnified view of rs609275 with selected enhancer chromatin marks and DNA sequence of a response element. The gray shading indicates the location of the variant rs609275.

At 14q13, the index variant, rs2236007, was reported to be more strongly associated with ER+ than ER− breast cancer in the initial GWAS (ER+: OR = 1.10, P = 1.9 × 10−10; ER−: OR = 1.04, P = 0.081, Phet = 0.015; Supplementary Table S4; ref. 7). No association with the index variant could be detected in women of African ancestry (ER+: OR = 0.98, P = 0.72; ER−: OR = 0.94, P = 0.39). Through fine-mapping, the association with the most statistically significant P value was observed with rs73258644 and ER+ disease (ER+: OR = 1.43, P = 1.0 × 10−6; ER−: OR = 1.02, P = 0.82). rs73258644 is a perfect proxy for rs17104923, which we previously reported in AABC as a potential independent signal (r2 = 1 in EUR and AFR), and shows no correlation with the index variant rs2236007 (EUR r2 = 0.008; AFR r2 = 0.002). Among markers correlated with the index variant, the strongest association was observed with rs12883049 and ER+ disease (OR = 1.19, P = 5.6 × 10−5; Supplementary Fig. S1). This variant, rs12883049, is located in the intron of PAX9 with enhancer histone marks and open chromatin marks in all breast cell lines, suggesting an important role of this variant (Fig. 2; Supplementary Fig. S2). We also found that the motif of TFAP4 (a.k.a. AP4) is disrupted by the SNP. AP4 is involved in the cell cycle and also activates cell migration and epithelial–mesenchymal transition in breast cancer (47, 48). This variant is well correlated with the index variant in Europeans (r2 = 0.82), but not in women of African ancestry (r2 = 0.01). Variants rs73258644 and rs12883049 are modestly correlated (r2 = 0.35) and only rs73258644 remains statistically significant in conditional analyses with rs12883049 (P = 8.8 × 10−4) which suggests that rs73258644 is the best marker in the region relevant to women of African ancestry (Supplementary Table S6).

Regional plot and genome browser view of 14q13. The chromosomal position (based on GRCh37) of SNPs on 14q13 against −log10P values for ER+ breast cancer is shown. The blue arrow denotes the signal rs73258644 and the red arrow denotes rs12883049, which is a better marker of the index signal. The purple circle denotes the index variant rs2236007. SNPs surrounding the index variant are colored to indicate the LD structure using pairwise r2 in reference to rs2236007 from the May 2012 EUR panel of 1000 Genomes. The plots were generated using LocusZoom (55). Genome browser views with epigenetic chromatin features in breast cells (MCF7, HCC1954, MDAMB231, HMEC) on 14q13 are generated using the UCSC genome browser (56). Below is a magnified view of rs12883049 with selected enhancer chromatin marks and DNA sequence of a response element. The gray shading indicates the location of the variant rs12883049.

At 16q12/TOX3, the index variant rs3803662 was identified initially in association with ER+ disease (12). This variant was not associated with breast cancer subtypes in women of African ancestry (Supplementary Table S4). Our fine-mapping analysis of this region revealed a risk variant in the intron of TOX3, rs35850695 (r2 = 0.89 in EUR), that was more strongly associated with ER+ breast cancer (ER+: OR = 1.25, P = 2.4 × 10−5; ER–: OR = 1.07, P = 0.33; Phet = 0.033; Supplementary Table S5). However, the most statistically significantly associated risk variant in this region was rs3104791, which is located in the intron of long noncoding RNA (lncRNA), LINC00918 (OR = 1.18 for ER+ disease, P = 1.8 × 10−6; Supplementary Fig. S2). This variant is moderately correlated with the index (rs3803662) in both women of European and African ancestry (EUR: r2 = 0.28; AFR r2 = 0.20) and is also moderately correlated with rs35850695 in Europeans, but not in women of African ancestry (EUR: r2 = 0.24; AFR r2 = 0.018). A second potentially independent signal, rs3112565, was also noted (OR = 1.19, P = 2.3 × 10−5), which is a perfect proxy (r2 = 1 in AFR) for rs3112572 (14) and rs3104746 reported previously (ref. 49; Supplementary Table S6). In conditional analyses of these three signals, rs35850695 (P = 5.2 × 10−5) and rs3112565 (P = 0.0011) remained as independent signals for ER+ disease, but not rs3104791 (P = 0.054; Supplementary Table S6). Haplotypes containing the risk variant for rs3104791 were statistically significantly associated with risk together with either the risk alleles of rs3112565 and/or rs35850695, but not alone (OR = 1.03; P = 0.54; Supplementary Table S6). The variant rs35850695 is located in the intron of TOX3 gene, whereas the variants rs3112565 and rs310479 are located in the intron of LINC00918. The variant rs3112565 is also found in ER+ cancer–specific enhancer regions, annotated by histone marks, H3K4me1 and H3K27Ac (Supplementary Fig. S2).

At 16q12/FTO, two independent signals (rs17817449 and rs11075995) were discovered to be associated with breast cancer risk in previous GWAS and rs11075995 was identified as an ER− specific variant (7). In women of African ancestry, rs17817449 showed a statistically significant association with both overall breast cancer and ER+ disease (overall: OR = 1.07, P = 0.012; ER+: OR = 1.08, P = 0.029). We observed an association with rs62048370, that was statistically significantly and more strongly associated with ER+ breast cancer (overall: OR = 1.29, P = 0.00032; ER+: OR = 1.59, P = 3.0 × 10−6; ER−: OR = 1.04, P = 0.72). Variant rs62048370 is not correlated with either of the index variants in European or African populations (rs17817449: EUR r2 < 0.001, AFR r2 = 0.004; rs11075995: EUR r2 < 0.001, AFR r2 = 0.007; Supplementary Fig. S1; Supplementary Table S5). This variant also overlaps with enhancer histone marks in ER+ breast cancer and normal breast cell lines, which are in close proximity to open chromatin regions in which transcription factors such as FOXA1, GATA3, and ESR1 bind (Supplementary Fig. S2).

At 16q23, the index variant rs13329835 was reported to be more strongly associated with ER+ disease in the initial GWAS (7). Through fine-mapping, we identified another variant, rs9940301, which is highly correlated with the index variant in Europeans (r2 = 0.84), and was statistically significantly associated with ER+ breast cancer in women of African ancestry (OR = 1.13, P = 8.5 × 10−4; Supplementary Table S5; Supplementary Fig. S1). The variant rs9940301 is in the intron of the CDYL2 gene, and encodes a chromodomain protein, which interacts with histone H3K9me3 (Supplementary Fig. S2; ref. 50).

At the ER− risk region 19p13, variant rs11668840, which is correlated with the index SNP in Europeans (rs67397200; r2 = 0.49), was the most statistically significantly associated marker for ER− breast cancer (OR = 1.25, P = 3.1 × 10−8; Supplementary Fig. S1; Supplementary Tables S4 and S5). The variant rs11668840 is 1.2 kb downstream of the transcription termination site of the ANKLE1 gene and is not located within any regions of open chromatin (Supplementary Fig. S2).

Discussion

The majority of GWAS-identified risk variants for breast cancer are common in women of African ancestry with directions of effect that are consistent with the discovery populations. However, in this sample, which is the largest breast cancer genetics study ever conducted in the women of African ancestry (6,522 cases and 7,643 controls), only 12 variants were directionally consistent with previous GWAS and nominally statistically significant at P < 0.05. In fine-mapping, we were successful in identifying seven markers for overall breast cancer or breast cancer subtype in six regions (3p24, 12p11, 14q13, 16q12/FTO, 16q23, 19p13) that were more likely (than the index variant) to capture the breast cancer association in this population. In another two regions (11q13 and 16q12/TOX3) we identified risk variants independent of the index signal. Among these regions harboring better markers or independent signals, only at 19p13 was the index variant also significantly associated with breast cancer risk.

The 74 variants analyzed in this study were reported to have an average OR of 1.09, with only 17 (23%) having ORs >1.10. Of the 61 SNPs that were directionally consistent with previous GWAS findings but not statistically significant in African Americans, statistical power to detect the previously reported effect sizes for overall breast cancer was ≥80% for 29 SNPs (48%). While reasonable statistical power was noted for roughly 50% of these regions, the inability to achieve statistical significance for the majority of these loci is likely due to differences in LD structure between populations of European and African ancestry. Statistical power in fine-mapping analyses is even more severely limited as we employed conservative locus-specific alpha levels to limit the number of false-positive associations. Statistical power to detect associations as large as those of the index signals while adjusting for multiple comparisons in the fine mapping was ≥80% at only 13 of the 70 regions (Supplementary Table S2). It is important to note that the markers we highlighted in each region only indicate whether the region replicates in African ancestry populations. There is a high degree of variability in the association statistics (ORs, P values and SEs) due to many factors including genotyping success rate and imputation quality, which has an impact on the ranking of associated correlated SNPs.

To further prioritize variants for functional follow-up testing, we mapped the most strongly associated variants relative to epigenomic datasets (see Materials and Methods). For the better markers or independent signals, we identified in this study, 7 overlapped with enhancer histone marks (Supplementary Table S3). In addition, we discovered that some of the better marker/independent signals more strongly associated with ER+ breast cancer were found in ER+ breast cancer-specific enhancers (e.g., rs2370946 at 3p24). On the other hand, some of the better markers or independent signals associated with overall breast cancer risk (both ER+ and ER−) were found in putative breast enhancers common in both ER+ and ER− breast cancer cells (e.g., rs609275 in 11q13). The underlying risk variants may play different roles and have unique mechanisms to increase breast cancer risk; however, we may deduce that subtype-specific enhancer activity might be tightly linked with some of these risk regions.

The most statistically significant associations in women of African ancestry identified in both previous studies as well as the current investigation were with variants on 11q13, 14q13, 16q12/TOX3, and 19p13 (14, 15). At 11q13, the putative novel signal locates 53 kb upstream of CCND1 (Cyclin D1). Cyclin D1 plays a key role in cell-cycle regulation and is one of the most commonly overexpressed proteins in breast tumors (51). At 14q13, variants were located in the gene PAX9 (paired box 9), which has been shown to be required for the growth and survival of breast cancer cells (52). At 16q12, the signals are located within the intron of a lncRNA, LINC00918, and the TOX3 (TOX-high mobility group box family member 3) gene, which may be involved in the bending and unwinding of DNA and altering chromatin structure (53). At 19p13, the risk variant is located near the genes BABAM1 (BRISC and BRCA1-A complex member 1), ANKLE1 (Ankyrin Repeat And LEM Domain-Containing Protein 1), and ABHD8 (abhydrolase domain containing 8). BABAM1 is the best candidate that may be influenced by genetic variation in the region given its interaction with BRCA1 (54).

In conclusion, 54 (73%) of the 74 breast cancer risk variants examined in women of African ancestry had effects that were directionally consistent with those previously reported, with 12 being nominally statistically significant. These findings support prior studies indicating that the majority of established breast cancer risk loci found in populations of European and Asian ancestry are also likely to be susceptibility regions for women of African ancestry. In six regions, we observed suggestive evidence of common alleles that may better characterize the association with breast cancer in women of African ancestry. Despite the sample size of the current effort, which includes all existing genetic studies of breast cancer in women of African ancestry globally, substantially larger studies, including multiethnic studies, will be needed to fully understand the genetic architecture of breast cancer in women of African ancestry.

Disclosure of Potential Conflicts of Interest

No potential conflicts of interest were disclosed.

The Editor-in-Chief (Timothy R. Rebbeck) is an author on this article. In keeping with the AACR's editorial policy, the peer review of this submission was managed by a senior member of Cancer Epidemiology, Biomarkers & Prevention's editorial team; a member of the AACR Publications Committee rendered the final decision concerning acceptability.

Disclaimer

The content of this manuscript does not necessarily reflect the views or policies of the National Cancer Institute or any of the collaborating centers in the BCFR, nor does mention of trade names, commercial products, or organizations imply endorsement by the United States Government or the BCFR.

Other (supervision of initial preparation of DNA, storage, and shipping): A.G. Falusi

Acknowledgments

This research was funded by the NIH and Foundation grants: P01 CA151135, R01 CA058420, UM1 CA164974, R01 CA098663, R01 CA100598, R01 CA185623, UM1 CA164973, R01 CA54281, R01 CA063464, R01 CA190182, P50 CA58223, U01 CA179715, R01 CA142996, P50 CA125183, and R01 CA89085, the Department of Defense Breast Cancer Research Program, Era of Hope Scholar Award Program W81XWH-08-1-0383 (AABC); the Susan G. Komen for the Cure Foundation; the Breast Cancer Research Foundation; and the University Cancer Research Fund of North Carolina. Pathology data were obtained from numerous state cancer registries (Arizona, California, Colorado, Connecticut, Delaware, District of Columbia, Florida, Georgia, Hawaii, Illinois, Indiana, Kentucky, Louisiana, Maryland, Massachusetts, Michigan, New Jersey, New York, North Carolina, Oklahoma, Pennsylvania, South Carolina, Tennessee, Texas, Virginia). Studies in AABC were supported by National Institute for Child Health and Development contract NO1-HD-3-3175 (CARE), NIH grant CA100374, and the Biospecimen Core Lab that is supported in part by the Vanderbilt-Ingram Cancer Center (CA68485; NBHS), by NIH grant CA73629 (WFBC), NIH grant CA77305 and United States Army Medical Research Program grant DAMD17-96-6071 (SFBCS), by NIH grant CA164920 (NC-BCFR). The Breast Cancer Family Registry (BCFR) was supported by grant UM1 CA164920 from the U.S. National Cancer Institute. Genotyping of the PLCO samples was funded by the Intramural Research Program of the Division of Cancer Epidemiology and Genetics, NCI, NIH.

The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked advertisement in accordance with 18 U.S.C. Section 1734 solely to indicate this fact.

Footnotes

Note: Supplementary data for this article are available at Cancer Epidemiology, Biomarkers & Prevention Online (http://cebp.aacrjournals.org/).