Fine-mapping of prostate cancer susceptibility loci in a large meta-analysis identifies candidate causal variants

Dadaev, Tokhir; Saunders, Edward J.; Newcombe, Paul J.; Anokian, Ezequiel; Leongamornlert, Daniel A.; Brook, Mark N.; Cieza-Borrella, Clara; Mijuskovic, Martina; Wakerell, Sarah; Olama, Ali Amin Al; Schumacher, Fredrick R.; Berndt, Sonja I.; Benlloch, Sara; Ahmed, Mahbubl; Goh, Chee; Sheng, Xin; Zhang, Zhuo; Muir, Kenneth; Govindasami, Koveela; Lophatananon, Artitaya; Stevens, Victoria L.; Gapstur, Susan M.; Carter, Brian D.; Tangen, Catherine M.; Goodman, Phyllis; Thompson, Ian M.; Batra, Jyotsna; Chambers, Suzanne; Moya, Leire; Clements, Judith; Horvath, Lisa; Tilley, Wayne; Risbridger, Gail; Gronberg, Henrik; Aly, Markus; Nordström, Tobias; Pharoah, Paul; Pashayan, Nora; Schleutker, Johanna; Tammela, Teuvo L. J.; Sipeky, Csilla; Auvinen, Anssi; Albanes, Demetrius; Weinstein, Stephanie; Wolk, Alicja; Hakansson, Niclas; West, Catharine; Dunning, Alison M.; Burnet, Neil; Mucci, Lorelei; Giovannucci, Edward; Andriole, Gerald; Cussenot, Olivier; Cancel-Tassin, Géraldine; Koutros, Stella; Freeman, Laura E. Beane; Sorensen, Karina Dalsgaard; Orntoft, Torben Falck; Borre, Michael; Maehle, Lovise; Grindedal, Eli Marie; Neal, David E.; Donovan, Jenny L.; Hamdy, Freddie C.; Martin, Richard M.; Travis, Ruth C.; Key, Tim J.; Hamilton, Robert J.; Fleshner, Neil E.; Finelli, Antonio; Ingles, Sue Ann; Stern, Mariana C.; Rosenstein, Barry; Kerns, Sarah; Ostrer, Harry; Lu, Yong-Jie; Zhang, Hong-Wei; Feng, Ninghan; Mao, Xueying; Guo, Xin; Wang, Guomin; Sun, Zan; Giles, Graham G.; Southey, Melissa C.; MacInnis, Robert J.; FitzGerald, Liesel M.; Kibel, Adam S.; Drake, Bettina F.; Vega, Ana; Gómez-Caamaño, Antonio; Fachal, Laura; Szulkin, Robert; Eklund, Martin; Kogevinas, Manolis; Llorca, Javier; Castaño-Vinyals, Gemma; Penney, Kathryn L.; Stampfer, Meir; Park, Jong Y.; Sellers, Thomas A.; Lin, Hui-Yi; Stanford, Janet L.; Cybulski, Cezary; Wokolorczyk, Dominika; Lubinski, Jan; Ostrander, Elaine A.; Geybels, Milan S.; Nordestgaard, Børge G.; Nielsen, Sune F.; Weisher, Maren; Bisbjerg, Rasmus; Røder, Martin Andreas; Iversen, Peter; Brenner, Hermann; Cuk, Katarina; Holleczek, Bernd; Maier, Christiane; Luedeke, Manuel; Schnoeller, Thomas; Kim, Jeri; Logothetis, Christopher J.; John, Esther M.; Teixeira, Manuel R.; Paulo, Paula; Cardoso, Marta; Neuhausen, Susan L.; Steele, Linda; Ding, Yuan Chun; De Ruyck, Kim; De Meerleer, Gert; Ost, Piet; Razack, Azad; Lim, Jasmine; Teo, Soo-Hwang; Lin, Daniel W.; Newcomb, Lisa F.; Lessel, Davor; Gamulin, Marija; Kulis, Tomislav; Kaneva, Radka; Usmani, Nawaid; Slavov, Chavdar; Mitev, Vanio; Parliament, Matthew; Singhal, Sandeep; Claessens, Frank; Joniau, Steven; Van den Broeck, Thomas; Larkin, Samantha; Townsend, Paul A.; Aukim-Hastie, Claire; Gago-Dominguez, Manuela; Castelao, Jose Esteban; Martinez, Maria Elena; Roobol, Monique J.; Jenster, Guido; van Schaik, Ron H. N.; Menegaux, Florence; Truong, Thérèse; Koudou, Yves Akoli; Xu, Jianfeng; Khaw, Kay-Tee; Cannon-Albright, Lisa; Pandha, Hardev; Michael, Agnieszka; Kierzek, Andrzej; Thibodeau, Stephen N.; McDonnell, Shannon K.; Schaid, Daniel J.; Lindstrom, Sara; Turman, Constance; Ma, Jing; Hunter, David J.; Riboli, Elio; Siddiq, Afshan; Canzian, Federico; Kolonel, Laurence N.; Le Marchand, Loic; Hoover, Robert N.; Machiela, Mitchell J.; Kraft, Peter; Freedman, Matthew; Wiklund, Fredrik; Chanock, Stephen; Henderson, Brian E.; Easton, Douglas F.; Haiman, Christopher A.; Eeles, Rosalind A.; Conti, David V.; Kote-Jarai, Zsofia

doi:10.1038/s41467-018-04109-8

Download PDF

Article
Open access
Published: 11 June 2018

Fine-mapping of prostate cancer susceptibility loci in a large meta-analysis identifies candidate causal variants

Tokhir Dadaev ORCID: orcid.org/0000-0002-8268-0438¹^na1,
Edward J. Saunders¹^na1,
Paul J. Newcombe²,
Ezequiel Anokian¹,
Daniel A. Leongamornlert ORCID: orcid.org/0000-0002-3486-3168^1,3,
Mark N. Brook ORCID: orcid.org/0000-0002-8969-2378¹,
Clara Cieza-Borrella¹,
Martina Mijuskovic¹,
Sarah Wakerell¹,
Ali Amin Al Olama ORCID: orcid.org/0000-0002-7178-3431^4,5,
Fredrick R. Schumacher ORCID: orcid.org/0000-0002-3073-7463^6,7,
Sonja I. Berndt⁸,
Sara Benlloch^1,4,
Mahbubl Ahmed¹,
Chee Goh¹,
Xin Sheng⁹,
Zhuo Zhang⁹,
Kenneth Muir ORCID: orcid.org/0000-0001-6429-988X^10,11,
Koveela Govindasami¹,
Artitaya Lophatananon^10,11,
Victoria L. Stevens¹²,
Susan M. Gapstur¹²,
Brian D. Carter¹²,
Catherine M. Tangen¹³,
Phyllis Goodman¹³,
Ian M. Thompson Jr.¹⁴,
Jyotsna Batra ORCID: orcid.org/0000-0003-4646-6247^15,16,
Suzanne Chambers^17,18,
Leire Moya^15,16,
Judith Clements^15,16,
Lisa Horvath^19,20,
Wayne Tilley²¹,
Gail Risbridger^22,23,
Henrik Gronberg²⁴,
Markus Aly^24,25,
Tobias Nordström ORCID: orcid.org/0000-0003-4915-7546^24,26,
Paul Pharoah ORCID: orcid.org/0000-0001-8494-732X^4,27,
Nora Pashayan ORCID: orcid.org/0000-0003-0843-2468^27,28,
Johanna Schleutker ORCID: orcid.org/0000-0002-1863-0305^29,30,
Teuvo L. J. Tammela³¹,
Csilla Sipeky ORCID: orcid.org/0000-0002-8853-4722²⁹,
Anssi Auvinen³²,
Demetrius Albanes⁸,
Stephanie Weinstein⁸,
Alicja Wolk ORCID: orcid.org/0000-0001-7387-6845³³,
Niclas Hakansson³³,
Catharine West³⁴,
Alison M. Dunning²⁷,
Neil Burnet³⁵,
Lorelei Mucci³⁶,
Edward Giovannucci³⁶,
Gerald Andriole³⁷,
Olivier Cussenot^38,39,
Géraldine Cancel-Tassin ORCID: orcid.org/0000-0002-9583-6382^38,39,
Stella Koutros⁸,
Laura E. Beane Freeman⁸,
Karina Dalsgaard Sorensen ORCID: orcid.org/0000-0002-4902-5490^40,41,
Torben Falck Orntoft^40,41,
Michael Borre^41,42,
Lovise Maehle⁴³,
Eli Marie Grindedal⁴³,
David E. Neal^44,45,46,
Jenny L. Donovan⁴⁷,
Freddie C. Hamdy^46,48,
Richard M. Martin^47,49,50,
Ruth C. Travis⁵¹,
Tim J. Key⁵¹,
Robert J. Hamilton⁵²,
Neil E. Fleshner⁵²,
Antonio Finelli⁵²,
Sue Ann Ingles⁹,
Mariana C. Stern⁹,
Barry Rosenstein^53,54,
Sarah Kerns ORCID: orcid.org/0000-0002-6503-0011⁵⁵,
Harry Ostrer ORCID: orcid.org/0000-0002-2209-5376⁵⁶,
Yong-Jie Lu⁵⁷,
Hong-Wei Zhang⁵⁸,
Ninghan Feng⁵⁹,
Xueying Mao⁵⁷,
Xin Guo^60,61,
Guomin Wang⁶²,
Zan Sun⁶¹,
Graham G. Giles^63,64,
Melissa C. Southey⁶⁵,
Robert J. MacInnis^63,64,
Liesel M. FitzGerald^64,66,
Adam S. Kibel⁶⁷,
Bettina F. Drake³⁷,
Ana Vega⁶⁸,
Antonio Gómez-Caamaño⁶⁹,
Laura Fachal ORCID: orcid.org/0000-0002-7256-9752^4,68,
Robert Szulkin^70,71,
Martin Eklund²⁴,
Manolis Kogevinas ORCID: orcid.org/0000-0002-9605-0461^72,73,74,75,
Javier Llorca^73,76,
Gemma Castaño-Vinyals^72,73,74,75,
Kathryn L. Penney⁷⁷,
Meir Stampfer⁷⁷,
Jong Y. Park⁷⁸,
Thomas A. Sellers⁷⁸,
Hui-Yi Lin⁷⁹,
Janet L. Stanford^80,81,
Cezary Cybulski⁸²,
Dominika Wokolorczyk⁸²,
Jan Lubinski⁸²,
Elaine A. Ostrander⁸³,
Milan S. Geybels⁸⁰,
Børge G. Nordestgaard ORCID: orcid.org/0000-0002-1954-7220^84,85,
Sune F. Nielsen^84,85,
Maren Weisher⁸⁵,
Rasmus Bisbjerg⁸⁶,
Martin Andreas Røder⁸⁷,
Peter Iversen^84,87,
Hermann Brenner^88,89,90,
Katarina Cuk⁸⁸,
Bernd Holleczek⁹¹,
Christiane Maier⁹²,
Manuel Luedeke⁹²,
Thomas Schnoeller⁹³,
Jeri Kim⁹⁴,
Christopher J. Logothetis⁹⁴,
Esther M. John^95,96,
Manuel R. Teixeira^97,98,
Paula Paulo⁹⁷,
Marta Cardoso⁹⁷,
Susan L. Neuhausen⁹⁹,
Linda Steele⁹⁹,
Yuan Chun Ding⁹⁹,
Kim De Ruyck¹⁰⁰,
Gert De Meerleer¹⁰⁰,
Piet Ost¹⁰¹,
Azad Razack¹⁰²,
Jasmine Lim ORCID: orcid.org/0000-0002-7501-1834¹⁰²,
Soo-Hwang Teo¹⁰³,
Daniel W. Lin^80,104,
Lisa F. Newcomb^80,104,
Davor Lessel¹⁰⁵,
Marija Gamulin¹⁰⁶,
Tomislav Kulis¹⁰⁷,
Radka Kaneva¹⁰⁸,
Nawaid Usmani^109,110,
Chavdar Slavov¹¹¹,
Vanio Mitev¹⁰⁸,
Matthew Parliament^109,110,
Sandeep Singhal¹⁰⁹,
Frank Claessens¹¹²,
Steven Joniau¹¹³,
Thomas Van den Broeck^112,113,
Samantha Larkin¹¹⁴,
Paul A. Townsend¹¹⁵,
Claire Aukim-Hastie¹¹⁶,
Manuela Gago-Dominguez^117,118,
Jose Esteban Castelao¹¹⁹,
Maria Elena Martinez¹²⁰,
Monique J. Roobol¹²¹,
Guido Jenster¹²¹,
Ron H. N. van Schaik¹²²,
Florence Menegaux¹²³,
Thérèse Truong ORCID: orcid.org/0000-0002-2943-6786¹²³,
Yves Akoli Koudou¹²³,
Jianfeng Xu ORCID: orcid.org/0000-0002-1343-8752¹²⁴,
Kay-Tee Khaw¹²⁵,
Lisa Cannon-Albright^126,127,
Hardev Pandha¹¹⁶,
Agnieszka Michael¹¹⁶,
Andrzej Kierzek¹¹⁶,
Stephen N. Thibodeau¹²⁸,
Shannon K. McDonnell¹²⁹,
Daniel J. Schaid¹²⁹,
Sara Lindstrom¹³⁰,
Constance Turman¹³¹,
Jing Ma⁷⁷,
David J. Hunter¹³¹,
Elio Riboli¹³²,
Afshan Siddiq¹³³,
Federico Canzian¹³⁴,
Laurence N. Kolonel¹³⁵,
Loic Le Marchand¹³⁵,
Robert N. Hoover⁸,
Mitchell J. Machiela⁸,
Peter Kraft¹³¹,
The PRACTICAL (Prostate Cancer Association Group to Investigate Cancer-Associated Alterations in the Genome) Consortium,
Matthew Freedman¹³⁶,
Fredrik Wiklund²⁴,
Stephen Chanock⁸,
Brian E. Henderson⁹^na3,
Douglas F. Easton ORCID: orcid.org/0000-0003-2444-3247^4,27,
Christopher A. Haiman⁹^na2,
Rosalind A. Eeles ORCID: orcid.org/0000-0002-3698-6241^1,137^na2,
David V. Conti⁹^na2 &
…
Zsofia Kote-Jarai¹^na2

Nature Communications volume 9, Article number: 2256 (2018) Cite this article

11k Accesses
72 Citations
36 Altmetric
Metrics details

Subjects

Abstract

Prostate cancer is a polygenic disease with a large heritable component. A number of common, low-penetrance prostate cancer risk loci have been identified through GWAS. Here we apply the Bayesian multivariate variable selection algorithm JAM to fine-map 84 prostate cancer susceptibility loci, using summary data from a large European ancestry meta-analysis. We observe evidence for multiple independent signals at 12 regions and 99 risk signals overall. Only 15 original GWAS tag SNPs remain among the catalogue of candidate variants identified; the remainder are replaced by more likely candidates. Biological annotation of our credible set of variants indicates significant enrichment within promoter and enhancer elements, and transcription factor-binding sites, including AR, ERG and FOXA1. In 40 regions at least one variant is colocalised with an eQTL in prostate cancer tissue. The refined set of candidate variants substantially increase the proportion of familial relative risk explained by these known susceptibility regions, which highlights the importance of fine-mapping studies and has implications for clinical risk profiling.

Characterizing prostate cancer risk through multi-ancestry genome-wide discovery of 187 novel risk variants

Article 09 November 2023

Anqi Wang, Jiayi Shen, … Christopher A. Haiman

Trans-ancestry genome-wide association meta-analysis of prostate cancer identifies new susceptibility loci and informs genetic risk prediction

Article 04 January 2021

David V. Conti, Burcu F. Darst, … Christopher A. Haiman

8q24 genetic variation and comprehensive haplotypes altering familial risk of prostate cancer

Article Open access 23 March 2020

William D. Dupont, Joan P. Breyer, … Jeffrey R. Smith

Introduction

Prostate cancer (PrCa) is the most common cancer among males in developed countries. As there is evidence for a large heritable component for PrCa, the identification of genetic variation that increases susceptibility may help to inform screening strategies and clinical management of patients in the future. Currently, only a handful of rare genetic variants with larger effect sizes have been reported that increase the risk of PrCa (e.g., BRCA2 and ATM)^1,2. By comparison, genome-wide association studies (GWAS) have reported >100 low-penetrance PrCa risk signals with small odds ratios (ORs)³. Individually, these GWAS loci only modestly influence risk. However, because the risk alleles are relatively common within the general population their cumulative impact is substantial.

When an initial GWAS identifies a susceptibility locus, any one (or more) of a large number of variants within the region may underlie the molecular mechanism that modulates risk. This includes correlated variants in linkage disequilibrium (LD) that may capture the same association signal and additional variants with independent associations. Genotyping a denser set of variants in the region facilitates characterisation of the underlying genetic architecture and makes subsequent imputation more precise and complete. Although forward stepwise selection is frequently used for fine-mapping, it has severe limitations, particularly the way LD can lead to misleading results. In this manuscript, we report the findings of a PrCa fine-mapping study in a European ancestry meta-analysis sample set that is the largest to date and utilise the well-established stochastic search and model selection framework, which more accurately represents the uncertainty in determining both the number of signals and the set of single-nucleotide polymorphisms (SNPs) that best describe the association in each region^4,5,6,7. To leverage the large sample size from the overall meta-analysis, we use a novel multivariate Bayesian variable selection approach, which takes marginal SNP summary statistics as input and accounts for LD, to jointly analyse all SNPs in a region. We identify a catalogue of variants and further prioritise within this set through functional annotation, to assist identification of putative causal variants. This refined credible set of variants explains a substantially larger proportion of the estimated familial relative risk (FRR) of PrCa compared with the original GWAS tags.

Results

Replication of reported associations prior to fine-mapping

In this study, we examined 92 PrCa GWAS risk associations within 85 distinct genomic regions reported prior to the recent meta-analysis using the OncoArray experiment⁸; due to their complexity, two regions (Chr8q24 and Chr6p21/MHC) were excluded and are subject to separate studies. Some regions contained more than one signal due to close proximity between the reported index SNPs. Summary results from the large European ancestry meta-analysis comprising 82,591 PrCa cases and 61,213 controls from eight GWAS sub-cohorts (OncoArray, iCOGS, UK stage 1 and 2, CaPS 1 and 2, BPC3 and NCI PEGASUS), imputed to the 1000 Genomes phase 3 reference panel, were used for our fine-mapping analysis.

We first assessed whether all 92 original associations had replicated with at least one variant in the region at a genome-wide significant level (marginal P-value <5 × 10⁻⁸). Five regions had not replicated and were excluded from downstream fine-mapping analyses accordingly (Supplementary Table 1). An additional 3 associations previously reported in different ancestral populations also had not replicated in our European sample set; however, these original lead variants were each situated within the region boundary of another replicated GWAS association and therefore the expanded region boundary was retained during fine-mapping for logistical purposes, although only the associations replicated in Europeans were considered as index variants. Fine-mapping was therefore conducted for 84 replicated, previously reported GWAS signals, within 80 distinct regions (Fig. 1). This included the region encompassing the moderate penetrance risk SNP rs138213197 in HOXB13, which although originally identified through sequencing⁹ was included due to its relatively close proximity to the GWAS association rs11650494. The HOXB13 region therefore also served as a useful positive control during mapping, since the known causal variant exerts a relatively large effect size (OR 3.85) and has low minor allele frequency (MAF), but the signal is also detectable through a cluster of more common variants as a ‘synthetic association’¹⁰.

The eight signals that did not replicate in our European meta-analysis may remain risk loci for PrCa in other ancestral populations or specific disease phenotypes rather than overall PrCa risk, although we cannot completely exclude the possibility that some were false positives. Two of these variants were originally reported in a multi-ethnic meta-analysis (rs7153648 and rs12051443), one failed quality control (QC) due to strongly discordant MAF between individual sub-studies within the meta-analysis (rs6625711) and is also reported as having extremely discordant MAF between 1000 Genomes phase 1 and phase 3 cohorts (MAF in EUR 0.45 vs. 0.16), one was associated with young-onset disease only (rs636291), one only for aggressive PrCa (rs1571801) and the final three were reported in populations of Chinese (rs103294), Japanese (rs2055109) or African (rs7210100) ancestry and had not been confirmed in Europeans to date^{11,12,13,14,15}.

Multivariate fine-mapping from univariate summary statistics

We utilised Joint Analysis of Marginal summary statistics (JAM)¹⁶, a novel fine-mapping framework that uses summary statistics and explores multi-SNP models while accounting for LD. JAM provides inference of two important measures; (1) the most likely number of independent risk variants in the region and (2) a 95% credible set of variants that drive these signal(s). This credible set includes all variants from regression models that cumulatively reach at least 95% posterior probability in JAM’s stochastic search. Prior to running JAM, the variants were pruned to eliminate high LD (initially set at r² > 0.9, decreased in r² = 0.05 increments if required, Fig. 1). JAM was run twice for each region using independent seeds of 10 million iterations each. Final credible sets for each region included the set of tag variants identified by JAM and the pruned SNPs in high LD with these tags. Region-wide Bayes factors were used to provide evidence for the minimum number of independent signals. For 75 regions JAM successfully inferred credible sets of associated variants from the meta-analysis summary statistics, with 91% concordance of variants selected between two independent runs. For the final 5 regions, JAM did not infer a strong posterior probability for any variant, therefore was unable to select candidate variants.

Overall, we identified 99 independent PrCa risk signals within the 80 replicated regions (Tables 1–3). In all, 68 regions contained a single PrCa risk association, whilst we detected evidence for multiple independent risk signals within 12 regions (15% of replicated loci). In the initial meta-analysis data set, the 80 replicated regions contained a total of 213,728 SNPs, of which 14,463 were genome-wide significant and 25,186 marginally associated with PrCa at P < 5 × 10⁻⁵. From this variant set, JAM identified a catalogue of 3700 SNPs as the final 95% credible set of candidate causal variants for the 75 regions successfully fine-mapped (Supplementary Data 1), whilst in the 5 regions in which JAM could not identify candidate variants, a total of 175 variants had reached genome-wide significance in the univariate meta-GWAS results, including a novel more strongly associated lead variant in 4 of the 5 regions (Supplementary Data 1). The majority of variants within the JAM credible set were common (Supplementary Fig. 1a), with only 2 variants having MAF < 1% and 48 variants MAF < 5%; lower MAF variants do however represent the most likely candidate causal variants within certain regions. We also observed a slight increase in the distribution of univariate ORs for the novel lead variants we have identified in comparison to the original GWAS tag SNPs (Supplementary Fig. 1b). Only 15 original GWAS tag SNPs remained within the catalogue of candidate variants, with all other signals being replaced by more likely candidates. As expected, fine-mapping performance varied by region, with 95% credible set sizes ranging from 1 to 606 variants. We did however observe strong refinement of variants within the majority of regions (median 24 variants per region overall and 21 for single-signal regions). Indeed, among the 63 single-signal regions, 30 returned a 95% credible set containing ≤20 variants, of which 20 comprised ≤10 variants and 4 returned a credible set containing a single variant. These represent the putative causal PrCa susceptibility variant within that locus and include the well-established HOXB13 causal variant rs138213197 at Chr17q21⁹, as well as rs10993994 in the promoter of MSMB, which modulates gene expression in prostate tissue^17,18,19. These two regions serve as proof of principle; our methodology selected the presumed causal variants and therefore the remaining two single candidate variants are very likely to be causal and are strong candidates to test in functional studies. These two variants are an intronic SNP in TBX1, and a low MAF frameshift insertion in the final exon of FAM111A; which confirms for the first time in Europeans the GWAS hit at this locus previously reported in Japanese¹¹, although the European and Japanese variants are not in LD. The 12 regions with multiple independent risk signals contained 31 independent signals in total, represented by a 95% credible set of 626 variants (median 33.5 variants per region, average 20.2 variants per association signal). Prioritisation also performed well in these complex regions. In the TERT region at Chr5p15 we observed the highest number of independent signals, 5, and the credible set comprised only 30 SNPs. Similarly, 3 regions each containing 3 signals (Chr2q37:FARP2/ANO7, Chr17q12:HNF1B and Chr19q13:KLK3) returned a combined credible set of 61 variants representing these 9 PrCa associations. Notably, we observed that the regions found to contain multiple independent signals generally had P-values and marginal ORs towards the upper end of the distribution of original GWAS hits in the univariate meta-GWAS (Supplementary Fig. 2).

Table 1 Overview of fine-mapping results by region for regions 1–27 of the 80 regions fine-mapped

Full size table

Table 2 Overview of fine-mapping results by region for regions 28–54 of the 80 regions fine-mapped

Full size table

Table 3 Overview of fine-mapping results by region for regions 55–80 of the 80 regions fine-mapped, and summary results across all 80 regions

Full size table

Integration of annotation

We annotated variants for indicators of putative biological functionality using data from publically available databases. Intragenic variants were ascribed to genes relative to GENCODEv19, miRNA variants using MirBasev20 and variants situated within segments of the genome under evolutionary conservation were annotated using conserved element outputs generated by four algorithms (GERP++, SiPhy Omega, SiPhy Pi and Phastcons)^20,21,22. For information derived from tissue-based experimental data sets, we focused primarily on those conducted in prostate cell lines; specifically DNaseI hypersensitivity sites in three prostate cell types from seven experiments in the ENCODE project, chromatin-state characterisations by ChromHMM from Taberlay et al.²³, ChIP-seq peak locations for a variety of transcription factor (AR, CTCF, ERG, FOXA1, GABPA, GATA2, HOXB13 and NKX3.1) and histone mark (H3K27Ac, H3K27Me3 and H3K4Me3) data sets retrieved through the Cistrome Data Browser²⁴, and expression quantitative trait loci (eQTLs) from a set of 359 PrCa samples in the Cancer Genome Atlas (TCGA).

To formally incorporate these annotations into the prioritisation of SNPs, for the 75 regions in which JAM selected candidate variants, we investigated posterior estimates from JAM for all 37 863 pruned tags against annotation features using a conditional quantile regression (QR) analysis^25,26 at multiple quantiles (99.2, 99.4, 99.6, 99.8 and 99.95%). These correspond to posterior probabilities ranging from 0.01 to 0.99, with the exact values conditional on the linear combination of the annotations. At each quantile, we used the fitted model to calculate a predicted posterior probability given the SNP’s annotation features. A single expected posterior probability was then calculated from a weighted average of these quantile-specific expected posterior probabilities with the weight reflecting both the fit (i.e., a function of the likelihood) and variance of the predicted values from the quantile-specific model to the data. We selected a single data set for each annotation category for the QR analysis to minimise correlation between variables. Whilst the majority of tag probabilities were not notably adjusted during QR, an appreciable subset of variants were up- or downgraded based upon their annotations (ΔPosterior probability_QR ranged between −0.304 and 0.254; 63 of the 37,863 tags had a ΔPosterior probability_QR of magnitude ±0.005 or greater) (Supplementary Fig. 3). The conditional QR also facilitates identification of the annotations that demonstrate an association across the extreme quantiles of the posterior probabilities. Specifically, several annotations (eQTLs within TCGA PrCa tissue, AR and GATA2 transcription factor-binding sites, LNCaP DNase1, H3K27Ac and H3K4Me3 histone marks, enhancer and repressed chromatin states by ChromHMM, conservation according to GERP++, higher CADD scores and protein altering variants) had statistically significant associations (P < 1.0 × 10⁻³) for at least one quantile (Supplementary Data 2). That is, the upper quantiles of the posterior probability distribution for variants with any of these annotations were larger when compared with SNPs without those annotations.

For comparison to the conditional QR approach, we also used Fisher’s exact test to examine the representation of individual annotation features across variants included in the 95% credible set of prospective PrCa causal variants relative to variants not selected. Independent tests were conducted for each annotation upon the set of 37,863 tag variants analysed by JAM, of which 343 tags represented the 95% credible set of 3700 SNPs and annotations for all proxy SNPs were inherited by the tag variant. We observed significant enrichment of a number of annotations among variants in the credible set (Fig. 2, Supplementary Data 2). In particular, enrichment was found for eQTLs in the TCGA data set (P = 1.15 × 10⁻²³); intragenic variants within protein-coding genes (P = 8.15 × 10⁻¹¹; P = 6.03 × 10⁻⁵ for protein altering variants exclusively) but not non-coding transcripts (P = 0.29); promoter (P = 1.66 × 10⁻⁸), enhancer (P = 3.42 × 10⁻⁶) and transcribed (P = 3.07 × 10⁻⁷) ChromHMM states in prostate epithelial cells; DNaseI hypersensitivity sites from all seven ENCODE prostate data sets (P = 1.28 × 10⁻⁷ to 7.61 × 10⁻¹⁷); for AR (P = 2.33 × 10⁻¹⁵ to 2.86 × 10⁻²⁰), ERG (P = 5.33 × 10⁻¹² to 1.00 × 10⁻²⁰), FOXA1 (P = 9.18 × 10⁻¹⁸ to 1.14 × 10⁻¹⁸), GABPA (P = 8.53 × 10⁻¹²), GATA2 (P = 1.24 × 10⁻¹²), HOXB13 (P = 8.25 × 10⁻⁹) and NKX3.1 (P = 9.44 × 10⁻⁵ to 1.43 × 10⁻¹⁵) transcription factor-binding sites from one or more experimental data set; for H3K27Ac (P = 5.34 × 10⁻¹⁹ to 1.39 × 10⁻²¹) and H3K4Me3 (P = 1.30 × 10⁻⁹ to 8.27 × 10⁻¹⁴) histone marks; and conserved elements within the human genome according to all four algorithms (P = 1.89 × 10⁻⁷ to 4.04 × 10⁻¹¹). Of particular interest, in over half of the regions fine-mapped, at least one variant within our credible set intersected a significantly associated eQTL with a colocalisation score >0.9 (overlap between eQTL and GWAS signal) in the TCGA PrCa data set. In all, 40 of the 75 regions contained an eQTL variant among the credible set, with 91 distinct genes represented (Tables 1–3, Supplementary Data 3). In total, 127 of the 343 tags representing the credible set inherited an eQTL annotation (37%), compared with 5711 of the total 37,863 tags within these regions (17.8%). This corresponds to 1027 prostate eQTL variants among the 3700 credible set variants represented by the 343 JAM tags (27.8%), compared with 37,331 eQTLs from the 203,211 total variants within these 75 regions (18.4%).

Intuitively, some degree of correlation between the annotation features we examined would be expected, since regulatory regions of DNA may be indicated through various experimental techniques. Although annotations were jointly modelled in QR, any partial correlation could potentially inflate the extent of enrichment observed during independent Fisher’s tests. To preclude this outcome, we examined the level of correlation between separate annotations. Correlation between replicate data sets representing the same annotation category was usually moderate to high as would be expected, with more modest levels of correlation observed between different markers and information types (Supplementary Fig. 4). The level of correlation increased slightly when individual SNP annotations were collapsed onto tags, as the tag variants can inherit different annotations from separate SNPs. We performed logistic regression of the annotations used in the QR analysis in a single model, to evaluate their informativeness after adjustment for other annotation categories. In this regression, the TCGA eQTL, coding transcript and ERG transcription factor annotations were all highly significant after adjusting for multiple testing, whilst the AR transcription factor annotation was also nominally significant (Supplementary Fig. 5). The remaining annotations were not significant after adjustment for other annotations; however, within the range of information types selected, separate data sets represent broader or greater resolution functional information relative to one another and therefore may partially overlap with other markers whilst remaining instructive individually.

Fine-mapping resolution

At several regions our catalogue of variants highlighted putative biological mechanisms that may be responsible for the differential risk of PrCa development, as well as credible sets sufficiently small to enable subsequent laboratory follow-up. One example is the Chr2q37 region described by rs3771570 in the original publication²⁷. The original lead variant is intronic in FARP2, but multiple genes are located within the region. During fine-mapping, we observed evidence for three independent signals, one more than we previously detected²⁸. These signals are represented by a credible set of 14 variants from 7 tags, demonstrating highly successful refinement of the original signal (Fig. 3a, Tables 1–3, Supplementary Data 1). The majority of these prospective causal variants are centred on the ANO7 gene, approximately 100 kb centromeric of FARP2. ANO7 is expressed predominantly in the prostate (http://www.proteinatlas.org/ENSG00000146205-ANO7/tissue), unlike FARP2, which is ubiquitously expressed across tissue types. Within the credible set 3 tags are selected with particularly high confidence (posterior probabilities 0.72–1); all 3 represent only themselves with no additional proxy variants to consider, and are therefore the most likely causal variants underlying the 3 signals detected. Two of these 3 candidate causal variants (rs77559646 and rs77482050) are non-synonymous SNPs in ANO7 that are uncommon among European ancestry populations, whilst the third (rs62187431) is intronic in ANO7. The 11 remaining variants in the credible set include one more missense SNP within ANO7 (rs76832527), 2 intronic variants in ANO7 (rs111770284 and rs56091437), a synonymous variant in ANO7 (rs2074840) and 7 variants that are all intronic within other genes (FARP2, PPP1R7, HDLBP and SEPT2). Our fine-mapping results therefore strongly implicate the ANO7 gene as a prospective biological effector modulating susceptibility for PrCa.

The region at Chr6q22 described by rs339331 in the original publication²⁹ presents a good example of how variant annotations can assist further prioritisation of the most likely candidate variants even within regions where the credible set remains comparatively large after fine-mapping (Fig. 3b, Tables 1–3, Supplementary Data 1). rs339331 is intronic in RFX6, a member of the regulatory factor X transcription factor family. We observed a single signal during fine-mapping, but due to high LD between variants the credible set comprises 102 variants from 3 tags (the top tag with posterior probability 0.76 tagging 35 proxy SNPs, another with posterior probability 0.15 tagging 40 SNPs and the last with posterior probability 0.08 tagging 27 SNPs). Only 14 of these variants demonstrate any plausible biological evidence however, therefore the credible set can be filtered to prioritise this subset of variants. Four of these are proxies of the tag with the greatest statistical evidence, including the variant that demonstrates the greatest biological evidence for functionality; the original index SNP rs339331, which resides within a DNaseI peak, intersects binding sites for multiple transcription factors, including AR, FOXA1, GATA2, HOXB13 and NKX3.1, and is situated within a conserved element. rs339331 would therefore be ranked highest for follow-up based on combined statistical information and biological annotations, and has been demonstrated to alter HOXB13 transcription factor binding and RFX6 transcription during a previous functional investigation of this region³⁰.

At the TMPRSS2 region on Chr21q22, we detected a single PrCa risk signal with a credible set of 31 SNPs from 8 tags, all of which are situated within the promoter region or first intron of TMPRSS2 (Fig. 3c, Tables 1–3, Supplementary Data 1). In all, 20 of these variants are eQTLs for TMPRSS2 in prostate tissue, whilst 2 variants intersect transcription factor-binding sites in multiple data sets, including for AR, ERG, FOXA1, GABPA, GATA2, HOXB13 and NKX3.1. In this region, the tag selected by JAM with the highest posterior probability is substantially downgraded after QR (ΔPosterior probability_QR −0.18) due to lack of overlap with informative biological annotations, therefore it and its proxies may not in fact represent the most likely candidate causal variants. An early and common event in prostate tumour development involves a translocation that forms a TMPRSS2:ERG fusion, bringing the ERG transcription factor under transcriptional control of the more active TMPRSS2 promoter. Our fine-mapping results and biological annotations therefore allude to the possibility that subtle, heritable differences in TMPRSS2 expression could potentially operate in conjunction with a common somatic alteration to influence development of PrCa. Intriguingly, we also observed significant enrichment for variants intersecting ERG transcription factor-binding sites among our combined credible set of candidate variants across all regions using Fisher’s exact test (Supplementary Data 2, Fig. 2).

Comparison with African Ancestry meta-analysis results

Since LD patterns and allele frequencies of variants frequently differ among ancestral populations, as an additional prioritisation strategy we cross-checked meta-analysis results for variants in our 95% credible set against data from a meta-analysis of 10,202 cases and 10,810 controls with African Ancestry (AA)³¹. A total of 3633 of the 3700 SNPs in our credible set were available in the AA cohort, 1155 (31.8%) of which were nominally significant at P < 0.05 in the AA meta-analysis. In addition, of the 175 variants that reached genome-wide significance within the five regions in which JAM did not resolve candidate variants, 111 were nominally significant in the AA data. We would hypothesise that variants demonstrating no evidence of association in the AA data set would generally represent less likely candidate causal variants than any nominally significant variants within their region specific credible set and should be assigned lower priority when considering variants for functional confirmation studies. This extra prioritisation step does not enable us to formally exclude any variants from our credible set however, as the AA analysis may be underpowered to detect association with PrCa at specific SNPs, and additional variants within the regions fine-mapped in Europeans but not included in our credible set were not examined for association in AA data.

Estimating the GWAS loci contribution to FRR of PrCa

The proportion of FRR of PrCa explained by these risk loci before and after fine-mapping were calculated using conditional effect estimates and standard errors derived from the OncoArray sample sub-cohort. The post fine-mapping calculation was performed separately for the full set of 99 signals identified and a restricted subset of 84 variants (matching the number of original associations), in order to investigate the relative importance between replacement of GWAS tag SNPs and addition of extra novel signals. Single lead variants representing the independent signals were selected for this calculation. In regions containing a single signal, the JAM tag in the credible set with the highest Bayes factor was designated as the new lead variant, or for the five regions in which JAM did not resolve candidates the most strongly associated SNP in the meta-GWAS was taken instead. Within regions containing multiple independent hits, signals were represented by the combination of tags given the greatest posterior support by JAM. Our FRR calculations use conditional risk estimates incorporating uncertainty for each variant, plus a correction for potential bias due to risk estimation in the same sample as discovery and uncertainty in the specification of the FRR. This novel but more conservative method of risk calculation estimated that: (1) inclusion of only single ‘best’ replacement variants for each tag SNP contributes 26.5% (95% credible interval, CI, 22.7–31.5) of the known FRR of PrCa compared to 23.2% (95% CI 19.4–27.9) for the 84 previously known GWAS tag SNPs; and (2) inclusion of lead SNPs representing all of the 99 independent signals contributes 30.3% (95% CI 26.0–35.9) (Supplementary Data 4). This substantial enhancement demonstrates that the variant catalogue identified through fine-mapping explains a greater proportion of the FRR of PrCa compared to the original GWAS index SNPs, with replacement of the 84 original GWAS tag SNPs conferring a similar magnitude of increase as addition of the 15 novel independent signals we identified. We additionally calculated the contribution to FRR of PrCa for each region individually, to highlight regions that make the greatest contributions towards PrCa susceptibility (Tables 1–3). Whilst the majority of the fine-mapped GWAS loci individually contribute a small proportion towards the FRR, six regions confer in excess of 1% each. These include the moderate penetrance HOXB13 rs138213197 variant, which demonstrated the greatest contribution at 6.87%, and the multi-signal TERT locus, which explained the next highest level at 2.57%. Each of the remaining regions of higher FRR contribution contained multiple independent signals, with the exception of the single-signal MSMB locus. The magnitude of increase in proportion of FRR explained by each locus after fine-mapping was also generally greater for regions where additional independent signals were identified; for example, the ANO7 region increased 6.5 fold (from 0.1% for the original GWAS tag SNP to 0.65% after fine-mapping) and the KLK region 1.9 fold (from 0.45 to 0.86%), partly due the identification of 2 novel signals within each.

Discussion

Prior to the recent OncoArray study, approximately 100 PrCa susceptibility loci identified through GWAS had been reported. Limited information was however known about the precise identity of the causal variants and functional mechanisms behind these loci despite several having been fine-mapped individually or collectively using logistic regression^{28,32,33,34,35}. Here we present the largest genetic fine-mapping study for PrCa to date based on a meta-analysis of 82,591 cases and 61,213 controls of European ancestry, and employ a state-of-the-art multivariate Bayesian variable selection technique to prioritise candidate variants. We further refined results by incorporating functional annotation information using a novel QR approach, to assist prioritisation of candidate causal variants for downstream functional validation.

Since the meta-analysis comprised marginal summary effect estimates, we applied JAM, a joint Bayesian fine-mapping algorithm that accounts for LD in a multivariate analysis of univariate summary statistics, to identify credible candidate PrCa susceptibility variants. A stochastic variable selection approach provided posterior probabilities of association for each variant and combinations of variants within each region, as determined by a set of best models. This framework is preferred over alternative approaches, such as forward stepwise selection, which tend to underrepresent the uncertainty in the analysis and yield false levels of confidence for the final set of SNPs and number of signals represented by the single ‘best’ model. JAM also has advantages over similar Bayesian variable selection algorithms as it incorporates an extremely computationally efficient formal reversible jump Markov Chain Monte Carlo (MCMC) stochastic model search, which allows application to very large regions and does not require a prior assumption on the maximum number of causal SNPs within each region, making it more applicable to regions with larger or unknown numbers of causal variants. Linear model-based summary data methods such as JAM represent the current state of the art and have demonstrated good performance when applied to transformed logistic ORs from binary traits as opposed to linear effects for continuous traits^36,37. The effectiveness of logistic/linear mapping will however vary between different genomic architectures and is dependent on factors including the number of variants and correlation structure between them within each region. In general however, the approximation should work well provided no individual variants exert large effects, as expected for GWAS loci. For 5 of the 80 regions that had replicated at genome-wide significance, JAM was unable to fit a model to the summary data and consequently we could not resolve candidate variants beyond the catalogue of genome-wide significant variants within these regions. Four of these regions were not densely genotyped on the OncoArray genotyping chip, as their discovery in a multi-ethnic meta-analysis occurred only late during chip design. In addition, the top hit within these 5 regions ranked towards the weaker end of the P-value and effect size distributions in the univariate meta-analysis prior to fine-mapping. The inability of JAM to resolve candidate causal variants within these regions therefore most likely results from mismatch between the reference correlation structure and meta-GWAS effect patterns, issues with the logistic/linear mapping in the presence of complex correlation structure, or possibly simply low signal to noise ratio within the data.

Use of multivariate models prioritised a 95% credible set of 3700 candidate variants from the 203,211 variants analysed within the 75 regions in which candidate variants were resolved; thereby markedly reducing the number of variants for further consideration. In addition, previous reports of multiple independent signals at several PrCa risk loci were confirmed, with evidence for multiple signals at 12 regions; of which 7 regions contained 2 signals, 4 demonstrated evidence for 3 signals and 5 signals were observed at the Chr5p15 TERT gene locus, which is known to contain susceptibility variants for many cancer types³⁸. We observed no consistent pattern of LD relationship between the original GWAS tag SNPs and the independent signals identified through fine-mapping in the regions containing multiple independent signals (Supplementary Fig. 6). For example, at the ANO7 locus, the original index SNP (rs3771570) is not selected in the credible set and correlated with only 1 of the 3 independent signals detected (rs62187431, r² = 0.61). In contrast, at the TERT region, the original index SNP (rs2242652) is in moderate or modest LD (r² 0.08–0.43) with 4 of the variants selected by JAM as representative of the 5 independent signals. Previous smaller fine-mapping studies using stepwise selection approaches had also identified evidence for independent association signals within several regions. However, these are potentially more sensitive towards subjective measures such as the P-value threshold chosen for secondary signal inclusion and LD level used to define the final list of candidate variants represented by the selected marker(s). Due to our substantially larger sample size and variant density available and the well-established superiority of Bayesian search procedures over stepwise selection in high-dimensional settings, we therefore consider this the most detailed fine-mapping study to date for variant prioritisation. Comparing our results to the previous iCOGS fine-mapping study²⁸, in which refinement of 64 GWAS loci was attempted in a smaller European ancestry cohort of 25,723 PrCa cases and 26,274 controls, 48 regions corresponding to 52 original index SNPs replicated at genome-wide significance in both studies, of which only 21 regions had been densely genotyped on the iCOGS chip (Supplementary Data 5). Within these comparable regions, 70% of the ‘best candidate SNPs’ established using the iCOGS sample set were also included in the credible set we have identified in this study. This indicates broad stability of the results from fine-mapping studies conducted in the same ancestral population. The additional power and more dense genotyping across all regions in this study has however facilitated further refinement of potential candidate variants, identification of additional candidate variants within several regions and refinement for the first time of a number of regions in which fine-mapping had not previously been performed or had been unsuccessful. We have confirmed the existence of multiple independent risk signals at 10 loci previously reported, including identifying extra signals at the TERT (Chr5p15), ANO7 (Chr2q37) and SLC22A3 (Chr6q25) loci, and identified multiple independent association signals for the first time at two further loci, including KLK3 (Chr19q13). Eight regions demonstrating evidence for multiple independent signals in the iCOGS fine-mapping study were however not corroborated in this larger study. Notably, the conditional P-values for these secondary signals in the iCOGS fine-mapping study were below genome-wide significance in all but one of these regions. This may suggest that contrary to general assumptions that a lower burden of evidence is valid for uncorrelated variants in loci for which a genome-wide significant association has previously been observed, instead equally stringent significance thresholds should be applied for both secondary signals and initial primary signals. It is also notable that in this well-powered study, the vast majority of regions containing multiple independent signals were first reported as associated with PrCa in early GWAS using relatively modest sample sizes. This may indicate that regions with lower effect sizes and weaker evidence for association, which require larger sample sizes for their detection, are less likely to contain additional independent risk variants. Alternatively however, it could reflect lower power for the detection of additional independently associated variants within the regions that contain weaker signals, despite the large sample cohort utilised in this study.

As would be expected, refinement of putative causal risk variants varied between regions, with credible sets ranging from a single variant or handful of variants to >100 variants for a small number of regions. The regions retaining large credible set sizes appear to result primarily from large numbers of variants in high LD with the actual causal variant as opposed to low power within the region however, rendering further refinement of these signals to facilitate functional validation studies more complicated. One approach to further prioritise candidate variants could be to leverage the different LD patterns among different ancestral populations, provided that the underlying casual variants are shared and present at sufficient frequency between populations. Cross-referencing the 3700 variants within our 95% credible set with data for an African American PrCa meta-analysis from the African Ancestry Prostate Cancer GWAS Consortium highlighted a subset of 1155 variants with nominal or genome-wide significant evidence for association in this additional population. An alternative prioritisation approach is to consider pre-existing biological information, as we have described for the RFX6 (Chr6q22) region. We annotated variants against a number of publically available data sets, observing enrichment of several plausible markers of biological function active in prostate cell lines within our credible set, including intersection with prospective promoter and enhancer elements, DNaseI hypersensitivity sites, histone modification or transcription factor-binding peaks, and variants residing within protein-coding transcripts and conserved regions of the genome. Of particular interest, more than a quarter of the variants within our credible set were also eQTLs within the TCGA prostate adenocarcinoma data set. Given their statistical selection independent of this annotation and demonstrated effects upon gene expression, these eQTL variants should be considered high priority when selecting candidate causal variants for functional confirmation, alongside variants that modify the coding sequence of genes, or appear to reside within reliably annotated promoters or enhancers. Another important discovery of this study is that an appreciable number of highly ranked variants within the credible set are non-synonymous SNPs. This provides evidence that subtle alterations to structure and activity of specific proteins may give rise to the functional mechanisms behind a proportion of GWAS associations.

Some alternative fine-mapping algorithms integrate functional annotations during the statistical analysis when considering evidence for causality for each variant^39,40,41,42. These methods can prove useful for enhancing variant prioritisation, provided that the annotation information is reliably indicative of causal variants. We preferred to perform statistical analysis separately from annotation and compare statistical and functional evidence for causality afterwards using conditional QR. We believe this more clearly allows the most informative annotations, and the variants that are characterised by those annotations to be highlighted within the data set, whilst also reducing the potential for penalisation of strong candidate variants due to localised artefacts or cell line-specific effects within the whole-genome biological data sets used for annotation. Our conditional QR analysis resulted in adjustment of posterior probability for a small proportion of variants and may further assist prioritisation of the most likely functional variants among the credible set selected for each region.

Fine-mapping studies are important to reveal information on the biological mechanisms underlying disease predisposition by pinpointing potential candidate genes, signalling pathways and networks that account for differences in disease risk between individuals. In addition, these studies may help to refine the contribution of GWAS loci to PrCa risk by incorporating more likely candidate variants. This study evaluated almost all previously reported PrCa GWAS regions, apart from the highly complex Chr8q24 and major histocompatibility complex (MHC) regions and associations that did not replicate in the largest European meta-analysis to date. We then subsequently re-evaluated the contribution to FRR of these known PrCa risk loci using an enhanced method in which the overall FRR of PrCa was revised upwards from 2.0 to 2.5 to reflect the most recent estimates and we also accounted for uncertainty of various estimates that can introduce bias in these calculations. Our approach therefore provides more conservative estimates than in previous publications. We demonstrated a substantial increase in the proportion of FRR explained through fine-mapping these GWAS regions (from 23.2 to 30.3%), with detailed investigation showing that a similar proportion of this enhancement was conferred by replacement of the original tag SNPs and discovery of secondary signals. It is also noteworthy that the 7.1% magnitude of increase in FRR explained after fine-mapping known loci is substantially greater than the 4.4% increase achieved through identification of 62 novel PrCa loci⁸. This highlights the invaluable importance of fine-mapping studies for risk prediction and their potential utility in helping to inform clinical screening studies.

Fine-mapping of GWAS loci requires comprehensive examination of variation within the region. Logistical constraints generally preclude resequencing of disease-associated loci to achieve complete variant coverage in large sample cohorts and instead mandate the use of genotype array data followed by imputation, in order to achieve sufficient sample sizes. To ensure the accuracy of downstream fine-mapping analyses, stringent variant QC must be applied to imputed data, to exclude low-quality variants that may be indicative of imputation artefacts. In this study, initial pre-imputation QC of the meta-analysis data set was first performed to exclude potential genotyping errors, followed by post-imputation QC in which variants with low MAF or imputation information score, or divergent MAF consistency between dosage and ‘best guess’-derived MAF estimates were excluded. The MAF estimate consistency check was performed to highlight additional variants for which reliability of imputation may be reduced and evaluation of variants excluded in this step revealed that the majority were situated within segments of the genome flagged as repetitive or otherwise ambiguous. Whilst we cannot guarantee that no causal variants at GWAS loci would be located within repetitive elements, we believe that the high proportion of variants filtered during QC that are located within potentially difficult to impute segments indicates an appropriate balance between controlling against both type I and II errors during the subsequent fine-mapping analyses. The inability to directly interrogate this category of variants during this study could however reflect a potential limitation.

The multivariate fine-mapping strategy we employed enabled identification of small numbers of prospective causal variants amenable to functional follow-up at many known PrCa susceptibility regions. Within this credible set of variants, we found evidence of enrichment for a number of biologically plausible mechanisms through which PrCa risk could potentially be modulated. We observed multiple independent PrCa associations at 15% of the loci fine-mapped, and several candidate genes were indicated for consideration through functional annotation. As rare variants with MAF < 0.005 were not included in our analyses, we cannot exclude a contribution of rare casual variants exerting a greater effect size giving rise to synthetic associations at any GWAS loci, although our findings indicate that these are unlikely to be widespread. Importantly, replacement of the original GWAS tag SNPs with more likely candidate variants and identification of additional independent signals resulted in a substantial increase in the proportion of the FRR of PrCa explained by these loci. This finding accounts for a portion of the ‘missing heritability’ of PrCa and has important implications for clinical risk profiling and management of patients.

Methods

Identification of PrCa risk loci to fine-map

We identified 101 independent PrCa GWAS risk associations within the literature that had been reported at genome-wide significance prior to the start of this study, the majority of which had previously been replicated within a European ancestry population^3,12. Six of these lead variants were located within the Chr8q24 region that is associated with multiple cancer types in a highly complex manner, and three within the MHC Chr6p21 region. Due to the large numbers of variants, high levels of correlation and greater complexity within these regions, they are the subject of separate fine-mapping and risk stratification studies and were excluded from consideration in this analysis; the remaining 92 previously reported GWAS SNPs were selected for fine-mapping in this study. For 5 of these originally reported GWAS SNPs, no variant within ±500 kb replicated at genome-wide significance in our larger European meta-analysis and these loci were subsequently excluded from downstream Bayesian analyses and FRR calculations. An additional 2 GWAS SNPs originally reported in non-European ancestral populations and 1 reported in a previous meta-analysis did not replicate, but were situated <500 kb from an independent, replicated European risk association and were therefore still considered within the region boundaries of signals that were fine-mapped.

Selection of SNPs for fine-mapping on the OncoArray

A total of 78 PrCa risk associations that had been reported prior to the design of the OncoArray genotyping platform⁴³ were densely genotyped within the OncoArray sample cohort. Region boundaries for dense genotyping were defined as the greater of ±500 kb from the index SNPs or the maximum distance of any variant with r² > 0.3 to the index SNP in 1KG (phase 1 version 3, March 2012 release). All SNPs within these regions with MAF > 0.01 in any ancestral population were extracted and then we obtained Illumina Design Scores for all variants from the 1000 Genomes Project (phase I version 3, March 2012 release). From designable variants with a Design Score ≥ 0.8, we used Snagger⁴⁴ to select (a) all variants correlated with the known hits at r² > 0.6 and P < 0.05 in the iCOGS study, (b) all variants from lists of potentially functional variants, defined through ENCODE and RegulomeDB and (c) a set of SNPs to tag all remaining variants at r² > 0.9. The 23 risk loci reported in a recent multi-ethnic meta-analysis study¹² were not densely genotyped as these loci were reported after the OncoArray design; however, these regions were also fine-mapped in this study.

Meta-analysis and imputation

Genotype data for a combined 82,591 PrCa cases and 61,213 controls of European ancestry from eight GWAS (OncoArray, iCOGS, UK stage 1 and 2, CaPS 1 and 2, BPC3 and NCI PEGASUS) were used for the meta-analysis⁸. Per-allele ORs and standard errors were generated for the OncoArray and each GWAS, adjusting for principal components (PCs) and study relevant covariates using logistic regression. The OncoArray and iCOGS analyses were additionally stratified by country and study, respectively. We used the first seven PCs for OncoArray and first eight PCs for iCOGS samples, as additional components did not further reduce inflation in the test statistics. OR estimates were derived using either SNPTEST (https://mathgen.stats.ox.ac.uk/genetics_software/snptest/snptest.html) or an in-house software C++ programme. OR estimates and standard errors were combined by a fixed effects inverse variance meta-analysis using METAL⁴⁵. All statistical tests conducted were two-sided.

IMPUTE2 was used to impute non-genotyped SNPs within a boundary flank of ±500 kb or the maximum distance of any variant with r² > 0.3 to the index SNP in 1KG phase 1 from the originally reported GWAS index SNP in the meta-analysis cohort. For the OncoArray data, un-phased imputation was carried out for all the fine-mapping regions. Where the boundaries of adjacent associations to fine-map overlapped, these were merged for imputation; therefore, imputation was performed as 82 discrete chunks. Within 3 of these chunks the separate signals to analyse were sufficiently dispersed to enable clear demarcation of the individual signals and retention of an appropriate flank distance; these 3 imputation chunks were therefore split prior to statistical analysis and the 92 original index SNPs analysed were fine-mapped as 85 separate regions.

We conducted a two-stage post-imputation QC process. During basic QC, imputed genotype data were filtered to retain variants with INFO ≥ 0.4 and MAF ≥ 0.005. We subsequently instituted an additional QC measure to remove imputed variants with greater genotype uncertainty in which separate MAFs were calculated based on ‘dosage’ and ‘best guess’ genotypes. Large deviations between these MAF estimates for a variant would indicate unreliable imputation performance; variants for which these differed by ≥10% were excluded from analysis. An additional benefit of this methodology is that inherently applies progressively greater stringency of QC filtering the rarer a variant is within the study population. During the post-imputation QC process, 288,033 rare variants were excluded, whilst a further 146,088 variants were removed due to low INFO score or divergent MAF consistency. This resulted in a final post-QC set for analysis of 213,728 SNPs within the 80 fine-map regions that had replicated in the initial meta-GWAS, with a minimum variant INFO score within the final data set of 0.63, and the vast majority of variants having INFO > 0.9 (Supplementary Fig. 7).

As an additional safeguard, we investigated the proportion of common variants (MAF ≥ 0.05) in 1000 Genomes European samples that were retained or excluded during our QC procedure. In total, 186,907 of 227,793 common 1000 Genomes European variants (82.1%) were included in our final post-QC data set for analysis. The vast majority of common variants excluded during QC, 37,830, were removed in the MAF consistency check step. In all, 27,070 (71.5%) of these were situated within segments of the genome flagged as repetitive or otherwise ambiguous (either masked as low complexity by RepeatMarker, or excluded by the 1000 Genomes phase 3 Strict Mask), whilst a further 4460 (11.8%) had intermediate INFO score values (0.4–0.8).

Multivariate fine-mapping towards putative causal variants

JAM¹⁶ is a novel Bayesian algorithm that searches multi-SNP models in summary data by imputing the correlation structure according to a reference panel. JAM provides inference on the number of independent signals, as well as the set of potential SNPs driving those signals. Under a standard multivariate linear regression, the vector of trait values y are regressed on a matrix of genotypes, X, under the following model

$${\mathbf{y}}\sim N\left( {{\mathbf{X} \mathbf{\beta }},{\mathbf{I}}_N\sigma ^2} \right)$$

(1)

where σ² represents the residual variance, β represents a vector of effects, which are all adjusted for one another, and I_N is the N × N identity matrix. Multiplying the standard model above through by the transpose of the genotype matrix, JAM makes inference under the resulting multivariate normal (MVN) model:

$${\mathbf{X}\prime \mathbf{y}}\sim N\left( {{\mathbf{X}\prime \mathbf{X}\mathbf{\beta }}},{\mathbf{X}\prime \mathbf{X}\sigma ^2} \right)$$

(2)

The motivation for using the model in (2) rather than (1) is that individual-level data are no longer required; X′y can be derived from one-at-a-time univariate effect estimates of each variant^46,47 and X′X from an estimate of the genetic correlation matrix. Note that in the case of the PrCa summary statistics, we derive X′y after first mapping the univariate log ORs to approximate linear effects via their z-scores, a strategy adopted for binary traits in other linear model-based summary statistic frameworks^37,48. Consequently, the model residuals have the same interpretation as in a linear regression of a binary outcome; they cannot exceed 1 and, under the null model, their variance σ² equals the trait variance, p(1 − p) where p is the proportion of cases. Since each region is unlikely to explain much heritability individually, we specify an inverse gamma (Γ⁻¹) prior that loosely targets the PrCa variance in the meta-GWAS:

$$\sigma ^2\sim {\it{\Gamma }}^{ - 1}(2,0.24)$$

This corresponds to a prior expectation for σ² equal to the PrCa variance in the meta-GWAS, 0.24, and 95% weight over the range (0.05, 0.69). The JAM model is completed by specifying a so-called ‘g-prior’ over the genetic effects, β:

$${\mathbf{\eta }}\sim \mathrm{MVN}\left( {0,\tau {\mathbf{\eta }}\left( {{\mathbf{X}\prime \mathbf{X}}} \right)^{ - 1}} \right)$$

The conjugate g-prior supports effects inversely proportional to the corresponding genetic co-variances and variances, as estimated from the reference matrix X′X, and has been shown to help when modelling highly correlated predictors⁴⁹. There is a substantial literature on choices for the hyper-parameter, τ; we follow recommendations to set a value equal to the maximum of N and P², where P is the number of variants in the region^50,51.

Crucially, both (1) and (2) are parameterised by the same vector of multivariate (i.e., correlation adjusted) effect estimates, β; JAM is therefore able to approximate inference from a multivariate analysis of individual-level data. Optimal performance is achieved when the correlation structure X′X is taken from the original GWAS population, rather than an external reference population. We applied JAM to summary statistics from the meta-analysis data set using LD estimated according to imputed individual-level data from the OncoArray sub-cohort of 53,449 cases and 36,225 controls in which these regions had been densely genotyped.

Similar to other Bayesian stochastic variable selection approaches, JAM models a latent vector of binary indicators, γ, for whether each variant should be included (γ_v = 1 if variant v is associated and included in the model, or 0 otherwise). Any specific configuration of indicators then specifies a specific model, M. Using a Bayesian stochastic search, specifically a Reversible Jump MCMC (RJMCMC) algorithm^16,52, JAM searches over different possible models. By specifying a prior on the probability of including any combination of variants, we induce a prior over the ‘model space’, γ. More formally, JAM’s prior over γ induces sparsity and accounts for the multiple testing burden through use of a ‘beta-binomial’ prior on the number of associated variants or variants included in any given model, which consists of a Beta distribution over the proportion of associated variants in a particular region, conditional on which prior probabilities for each possible number of associated variants follow a binomial distribution. All configurations or combinations, including the same number of variants are given identical prior probabilities. For each region we used a beta-binomial (1, P_r) prior, where P_r is the total number of variants in a region r. This places a constant prior probability for any effect in each region (i.e., one or more causal variants) of 0.5, which is split up over all possible models according to the beta-binomial distribution. Since these are previously discovered regions, this is far more generous than our prior belief would be that a random region of the genome is associated with PrCa but is more conservative for the regions in this analysis, where we estimate the false discovery rate is <10%. The marginal prior odds of any particular SNP being selected is 1/P_r, and decreases with the total number of variants in the region, providing an intrinsic multiplicity correction as a function of region size^53,54,55,56. The prior probabilities for ≥2, or ≥3 associated variants and so on are weakly effected by P_r, however, for all regions in this analysis they are equal to the second decimal place at 0.25 and 0.12, respectively (Supplementary Table 2). More detail on the JAM model and RJMCMC algorithm can be found in the original paper¹⁶. For this analysis, each region was analysed independently, and by running two independent JAM seeds for 10 million iterations each. The JAM output provides posterior probabilities for each variant, Pr(γ_v = 1|data), and for each combination of variants, Pr(M = 1|data). To determine statistical significance for individual variants, combinations of variants and for the possible number of independent signals we use Bayes factors⁵⁷, the ratio of the posterior odds to the prior odds. Specifically, we used the inference of the minimum number of independent signals in the model at a regional Bayes factor threshold of 3 to define the evidence for multiple signals.

Before running JAM, Priority Pruner v0.1.3 (http://prioritypruner.sourceforge.net) was used to LD prune the imputed meta-analysis variant set at a threshold of r² = 0.9 for the 80 regions replicated at genome-wide significance. Pruning was performed agnostic of additional prioritisation criteria (association or annotation data) to ensure unbiased Bayesian model selection. Additional pruning at lower LD levels was performed upon any regions in which the overall Bayes factor for association with PrCa fell below 1. A regional Bayes factor below 1 directly conflicts with our knowledge that these regions are robustly associated with PrCa, and was taken as an indication of collinearity; numerical instability that can occur when fitting multivariate models to highly correlated variables. Where required, the pruning threshold was lowered in r² = 0.05 increments, to a cut-off level of r² = 0.6. The pruned data set used in the final Bayesian analyses comprised a total of 38,745 selected tags.

The pruning thresholds used in the final results are listed in Tables 1–3. For each independent JAM analysis, the top models or combinations of included SNPs within each region as determined by the posterior probabilities of the models, Pr(M = 1|data) that summed to a cumulative posterior probability of 0.95, were used to define a run specific 95% credible set. To filter out any low confidence variants, final 95% credible sets for each region were defined according to the intersection between two independent runs of JAM, with any variants with variant-specific BF < 1 additionally removed from the amalgamated variant list due to having greater standalone evidence against association. Overall, 3761 of the 4142 unique SNPs selected by either JAM run were retained in the combined top models from both runs (90.8%), with a further 61 variants with BF < 1 removed to achieve the final 95% credible set.

Annotation of variants for functional features

Variants were annotated for a number of putative indicators of biological functionality or importance, using a range of publically available data sources. These annotations focussed on either the likely consequence or relevance of the variant resulting from its primary genomic context, or the proximity to annotated regulatory features within cell lines derived from normal prostate or PrCa tissues.

Gene-based annotation of variants was performed using wANNOVAR in relation to GENCODEv19 transcripts⁵⁸. Variants residing within miRNA transcripts were subsequently added in relation to miRBase release 20 (ftp://mirbase.org/pub/mirbase/20/genomes/hsa.gff3)⁵⁹. Annotation of variants that reside within genomic elements demonstrating evidence for evolutionary constraint was performed against conserved element peak outputs from comparative genomics analyses by four algorithms; GERP++ (http://mendel.stanford.edu/SidowLab/downloads/gerp/)²⁰, SiPhy_Omega, SiPhy_Pi (https://www.broadinstitute.org/scientific-community/science/projects/mammals-models/29-mammals-project-supplementary-info)²¹ and PhastCons (ftp://hgdownload.cse.ucsc.edu/goldenPath/hg19/phastCons100way/)²². Variants were also scored for likelihood of prospective pathogenicity using CADDv1.3 (http://cadd.gs.washington.edu/score)⁶⁰.

For annotation against prospective regulatory elements within the genome, which frequently operate in a tissue-specific context, these data sets were primarily retrieved from experiments using prostate-derived cell lines. We annotated variants that intersected DNaseI peaks data in seven individual ENCODE prostate data sets from three cell lines (LNCaP, PrEC and RWPE1; GSM816637, GSM816634, GSM1008595, GSM736565, GSM736603, GSM4742 and GSM4743)^61,62. Peak data from ChIP-seq experiments for transcription factor-binding sites and histone modifications in the LNCaP, PC3, PrEC and VCaP cell lines and human prostate tumour tissue was downloaded from the Cistrome Data Browser (http://cistrome.org/db/); a resource that accumulates publically available ChIP-seq data sets and re-analyses their raw data through a standardised pipeline and QC procedure²⁴. Downloaded CistromeDB data were converted from GRCh38 to GRCh37/hg19 reference assembly co-ordinates for compatibility with our variant data set using the UCSC Genome Browser LiftOver tool (https://genome.ucsc.edu/cgi-bin/hgLiftOver). Transcription factor-binding site data were obtained for the Androgen Receptor (GSM1236922, GSM1328945 and GSM1576447), CTCF (GSM1006874 and GSM1383877), ERG (GSM1193657 and GSM1328978), FOXA1 (GSM1068136, GSM1274873 and GSM1716762), GABPA (GSM1193660), GATA2 (GSM1600544), HOXB13 (GSM1716763 and GSM1716764) and NKX3.1 (GSM699633 and GSM989640). Histone modification data were obtained for H3K27Ac (GSM1249447 and GSM1249448), H3K27me3 (GSM1383866 and GSM1383872) and H3K4me3 (GSM1383874 and GSM945240). Finally, to facilitate deeper categorisation of the genomic context of variants within prospective regulatory features, they were annotated with their chromatin state categorisations by ChromHMM from two prostate cell lines (PrEC and PC3; GSE57498), alongside three ENCODE tier 1&2 cell lines (GM12878, H1HESC and HUVEC) to enable comparison of tissue specificity for prospective regulatory elements^23,63,64.

eQTL analysis

Genotype and gene expression data for 494 samples with PrCa were downloaded from TCGA (https://gdc-portal.nci.nih.gov). For the genotype data set, QC was performed according to the protocol suggested by Anderson et al.⁶⁵, removing samples with heterozygosity >2 standard deviations from the mean, individuals with low genotype call rate (<95%), non-male samples and related or duplicated samples (individuals with identity-by-descent >0.185). Variants with call rate <95% were also excluded from analysis. PC analysis was performed to induce the ancestry of the TCGA samples, using the 494 TCGA samples plus 2504 samples from the 1000 Genomes Project phase 3, with non-European or Finnish samples removed from the analysis. In total, 108 samples and 106 SNPs were removed after performing QC on genotype data. For the expression data set, we observed that samples from two plates (A31K and A30D) exhibited values substantially higher than samples on the remainder of plates, therefore samples on these plates were also excluded (27 additional samples). Out of the 494 samples, 359 therefore passed QC. Genotypes for samples passing QC were subsequently imputed to the 1000 Genomes Project phase 3 reference panel within the region boundaries applied to the fine-mapping data set using IMPUTE2. In all, 227,773 variants within the fine-mapping data set passed QC thresholds in the TCGA imputed data and therefore were available for eQTL analysis. Genes with mean expression across samples of ≤6 counts or with expression variance = 0 were also excluded (4123 and 370 genes removed, respectively). Finally, expression values were quantile-normalised by samples and rank-transformed by genes. In total, 16,038 genes passed QC out of the initial 20,531.

For the eQTL analysis, 35 PEER factors⁶⁶ for the top 10,000 expressed genes were used as covariates, plus 3 genotyping PCs. eQTL analysis was performed for each region individually using FastQTL⁶⁷ with 1000 permutations and a window of 1 megabase from the transcription start site of each gene. Colocalisation tests between the eQTLs and GWAS SNPs were then performed following the approach suggested by Nica et al.⁶⁸. First, for each significant eQTL, we added the imputed SNP to the linear regression to assess if the inclusion better explains the change in expression of the gene.

$${\mathrm{Expression}}\sim {\mathrm{genotype}}\left( {{\mathrm{eQTL}}} \right) + {\mathrm{cov}} + {\mathrm{genotype}}\left( {{\mathrm{imp}}{\mathrm{.}}{\kern 1pt} {\mathrm{SNP}}} \right)$$

We retrieved the P-value of this new linear regression, assigning P-value of 1 if the eQTL and imputed SNP are the same variant. Second, we ranked the P-values in descending order for each eQTL. Finally, we calculated the colocalisation score for each pair of eQTL and imputed SNPs as:

$${\mathrm{Colocalisation}}{\kern 4pt} {\mathrm{score}} = \left( {N - {\mathrm{rank}}} \right){\mathrm{/}}N$$

where N is the total number of imputed SNPs in that region and rank is the rank of the imputed SNP we are including. In general, if an eQTL and an imputed SNP represent the same signal, this will be reflected by the imputed SNP having a high P-value, a low rank and consequently a high colocalisation score.

Quantile regression

Conditional QR across variant annotations was performed for the 75 regions successfully fine-mapped using JAM, with the 5 regions in which JAM was unable to resolve candidate variants excluded from this analysis. To minimise correlation between annotations, single data sets for each transcription factor, histone mark, DNaseI, conserved element and chromatin state by ChromHMM category were selected for investigation with conditional QR. Specifically, the GSM736603 DNaseI, GSM1328945 AR, GSM1383877 CTCF, GSM1193657 ERG, GSM1068136 FOXA1, GSM1193660 GABPA, GSM1600544 GATA2, GSM1716763 HOXB13, GSM989640 NKX3.1, GSM1249447 H3K27Ac, GSM1383872 H3K27me3, GSM945240 H3K4me3 and GERP++ conserved element annotation fields were selected, as these were observed to be most informative for variants within the 95% credible set, whilst the PrEC cell line ChromHMM annotation was selected over the PC3 data set due to its origin from normal prostate rather than cancerous tissue. Similarly, CADD RawScore was selected, with CADD PHRED score excluded prior to the analysis. Information on whether variants were an eQTL in the TCGA data set was included. Finally, new categories were computed to ascertain whether a variant was situated within a protein-coding transcript (intronic, exonic or untranslated region), within a non-coding transcript, and whether the variant altered protein structure (non-synonymous, non-sense, frameshift or non-frameshift insertion/deletion coding variants).

All annotations were converted to binary format for the QR analysis, with the exception of CADD RawScore, which was retained as a continuous variable. Separate variables were created for each possible ChromHMM state during conversion from categorical to binary format. QR analysis was performed upon the priority pruner tag variants that were analysed by JAM, using the statistical results from those analyses. Annotations for all proxy SNPs represented by the tag variant were therefore subsequently inherited by the priority pruner tag. For the binary annotation categories, this meant that if one or more proxies had received a given annotation then the tag would also receive that annotation, whilst for the continuous CADD RawScore, the tag inherited the highest value from all associated proxies.

For a specified quantile, τ, we first fit a conditional QR model to the estimated posterior probabilities from the JAM analysis for each variant. Second, we use the fitted model to calculate an expected posterior probability for each SNP given the annotation profile for that SNP. Since there is uncertainty in the choice of τ, we analyse the data across a range of τ = (99.2, 99.4, 99.6, 99.8 and 99.95%) and calculate a weighted average of these expected posterior probabilities to yield a final estimate. Specifically, the posterior probability from JAM, P, is modelled with a conditional QR with annotation, Z. The model is defined by an asymmetric laplace distribution (ALD):

$$\begin{array}{c}{P}\sim N\left( {v,\sigma ^2} \right)\\ v \sim \mathop {\prod }\limits_i {\kern 1pt} {\mathrm{ALD}}\left( {{\bf{Z}}\theta ,\lambda _i,\tau _i} \right)\end{array}$$

Notice v is affected by both P and Zθ, and it suggests a weighted average of P and fitted regression quantiles can approximate v. The density function of ALD distribution is

$$f\left( {v|{\bf{Z}}\theta ,\lambda ,\tau } \right) = \frac{{\tau \left( {1 - \tau } \right)}}{\lambda }{\mathrm{exp}}\left( { - \rho _\tau \left( {\frac{{v - {\mathbf{Z}}{{\theta }}}}{\lambda }} \right)} \right)$$

For λ, the maximum is achieved at

$$\lambda ^ \ast = \frac{{\mathop {\sum }\nolimits_j {\kern 1pt} \rho _\tau \left( {v - {\bf{Z}}{{\theta }}} \right)}}{N}$$

We fix λ to be

$$\hat \lambda = \frac{{\mathop {\sum }\nolimits_j {\kern 1pt} \rho _\tau \left( {y - {\bf{Z}}\widehat {{\theta }}} \right)}}{N}$$

where $\widehat {{\theta }}$ is coefficient estimates from classical conditional QR. With λ fixed, only the exponential parts of the Gaussian distribution and ALD involve v, to which we assign weight to P and ${\bf{Z}}\widehat {\theta }$. Specifically classical QR yields a prediction for each SNP, i as $\widehat {P_i} = {\bf{Z}}\hat \theta _i$ at $\tau _i$; the larger the penalty $\rho _{\tau _i}\left( {P_i - \widehat {P_i}} \right)$ and $\left( {P_i - \widehat {P_i}} \right)^2$, the less influence $\widehat {P_i}$ should have on P_i. We normalise the weight for P_i to be 1. For $\widehat {P_i}$ we assign weight

$$w_i = {\mathrm{exp}}\left( {\left( { - \frac{{\rho _{\tau _i}\left( {P_i - \widehat {P_i}} \right)}}{{\hat \lambda _i}} - \frac{{\left( {P_i - \widehat {P_i}} \right)^2}}{{2\sigma ^2}}} \right){\mathrm{/}}4} \right)$$

which is approximately the penalty at $v = \left( {{{P}} + \widehat {{{P}}_i}} \right){\mathrm{/}}2$. Our approximate value for v is then

$$\hat v = \frac{{{{P}} + \mathop {\sum }\nolimits_i {\kern 1pt} w_iY_i}}{{1 + \mathop {\sum }\nolimits_i {\kern 1pt} w_i}}$$

Proportion of familial risk explained

The contribution and comparison of the newly identified SNPs and the previously known variants to the familial risk, under a multiplicative model, was computed using the formula

$$\mathop {\sum }\nolimits_M {\kern 1pt} \left( {{\mathrm{log}}\lambda _{m}} \right){\mathrm{/}}\left( {{\mathrm{log}}\lambda _0} \right)$$

Where λ₀ is the observed FRR to first degree relatives of cases and λ_m is the FRR due to locus m, calculated assuming a per-allele effect:

$$\lambda _m = \frac{{p_mr_m^2 + q_m}}{{\left( {p_mr_m + q_m} \right)^2}}$$

where p_m is the frequency of the risk allele for locus m, q_m = 1 − p_m and r_m is the estimated per-allele OR.

This calculation was performed using a Bayesian framework, which allows us to attenuate any ‘winners curse’ bias, and incorporates the uncertainty in estimating the variant-specific per-allele OR, r_k, and the value of the observed familial risk, λ₀. To correct for potential bias in effect estimation from using the same sample to determine the credible set of SNPs (the so-called ‘winner’s curse’), we implemented a hierarchical model similar in spirit to Zhong and Prentice⁶⁹ by placing a normal prior distribution on effect estimates of the form $\upbeta _m\sim N\left( {0,\tau ^2} \right)$. Here β_m is the log OR from the conditional model within each region, and τ is a pre-specified variance of the effect distribution reflecting our prior beliefs. For all variants, we used a conservative value of τ = 0.05, reflecting a 95% prior probability density for a per-allele OR in the range of [0.91, 1.10]. For the FRR calculation, we specified a prior distribution as $\lambda _0\sim N\left( {2.5,0.14^2} \right)$, which places a 95% prior density in the range [2.22, 2.78] on the FRR of PrCa. This calculation was performed using the JAGS software⁷⁰.

To collapse our catalogue of credible variants identified through fine-mapping into a parsimonious set of SNPs matching the observed number of independent signals, we selected single representative lead variants to represent each signal. For the 63 regions in which JAM identified only a single signal, these were designated as the tag with the highest posterior evidence for association, whereas for the 5 regions in which JAM did not resolve candidate variants the variant most strongly associated with PrCa in the original meta-GWAS was designated as the novel lead variant. For the 12 regions containing multiple independent risk signals, to facilitate unbiased selection of variants representing different signals, JAM exhaustively fitted all possible multi-SNP models for the specified number of signals, and the combinations of SNPs with the highest posterior probability were selected to represent the independent signals. Separate models were run to derive the variant list for the full 99 signals identified and also a reduced set of 84 signals, matching the number of original index variants fine-mapped, to enable comparison between the contributions of replacement of the GWAS tag SNPs and addition of novel signals identified. To yield adjusted effect estimates for each lead variant in regions containing multiple signals, conditional effect estimates and standard errors for the selected ‘representative’ variants used for the FRR calculations were derived from the OncoArray sub-cohort of 53,449 cases and 36,225 controls, for which individual-level data were available.

Data availability

The meta-analysis summary data used in this fine-mapping project are available from the PRACTICAL Consortium (http://practical.icr.ac.uk/blog/?page_id=8164) or GitHub (https://github.com/oncogenetics/LocusExplorer/tree/master/Data/ProstateData). Results from the fine-mapping analyses may be explored interactively through Locus Explorer⁷¹ (http://www.oncogenetics.icr.ac.uk/LocusExplorer/).

References

Kote-Jarai, Z. et al. BRCA2 is a moderate penetrance gene contributing to young-onset prostate cancer: implications for genetic testing in prostate cancer patients. Br. J. Cancer 105, 1230–1234 (2011).
Article PubMed PubMed Central CAS Google Scholar
Pritchard, C. C. et al. Inherited DNA-repair gene mutations in men with metastatic prostate cancer. N. Engl. J. Med. 375, 443–453 (2016).
Article PubMed PubMed Central CAS Google Scholar
Mikropoulos, C., Goh, C., Leongamornlert, D., Kote-Jarai, Z. & Eeles, R. Translating genetic risk factors for prostate cancer to the clinic: 2013 and beyond. Future Oncol. 10, 1679–1694 (2014).
Article PubMed CAS Google Scholar
Conti, D. V. & Gauderman, W. J. SNPs, haplotypes, and model selection in a candidate gene region: the SIMPle analysis for multilocus data. Genet. Epidemiol. 27, 429–441 (2004).
Article PubMed Google Scholar
Fridley, B. L. Bayesian variable and model selection methods for genetic association studies. Genet. Epidemiol. 33, 27–37 (2009).
Article PubMed Google Scholar
Viallefont, V., Raftery, A. E. & Richardson, S. Variable selection and Bayesian model averaging in case-control studies. Stat. Med. 20, 3215–3230 (2001).
Article PubMed CAS Google Scholar
Wallace, C. et al. Dissection of a complex disease susceptibility region using a Bayesian stochastic search approach to fine mapping. PLoS Genet. 11, e1005272 (2015).
Article PubMed PubMed Central CAS Google Scholar
Schumacher, F. R. et al. Prostate cancer meta-analysis from more than 140,000 men identifies 63 novel prostate cancer susceptibility loci. Nat. Genet. https://doi.org/10.1038/s41588-018-0142-8 (2018).
Ewing, C. M. et al. Germline mutations in HOXB13 and prostate-cancer risk. N. Engl. J. Med. 366, 141–149 (2012).
Article PubMed PubMed Central CAS Google Scholar
Saunders, E. J. et al. Fine-mapping the HOXB region detects common variants tagging a rare coding allele: evidence for synthetic association in prostate cancer. PLoS Genet. 10, e1004129 (2014).
Article PubMed PubMed Central CAS Google Scholar
Akamatsu, S. et al. Common variants at 11q12, 10q26 and 3p11.2 are associated with prostate cancer susceptibility in Japanese. Nat. Genet. 44, 426–429 (2012).
Article PubMed CAS Google Scholar
Al Olama, A. A. et al. A meta-analysis of 87,040 individuals identifies 23 new susceptibility loci for prostate cancer. Nat. Genet. 46, 1103–1109 (2014).
Article PubMed PubMed Central CAS Google Scholar
Duggan, D. et al. Two genome-wide association studies of aggressive prostate cancer implicate putative prostate tumor suppressor gene DAB2IP. J. Natl. Cancer Inst. 99, 1836–1844 (2007).
Article PubMed CAS Google Scholar
Haiman, C. A. et al. Genome-wide association study of prostate cancer in men of African ancestry identifies a susceptibility locus at 17q21. Nat. Genet. 43, 570–573 (2011).
Article PubMed PubMed Central CAS Google Scholar
Xu, J. et al. Genome-wide association study in Chinese men identifies two new prostate cancer risk loci at 9q31.2 and 19q13.4. Nat. Genet. 44, 1231–1235 (2012).
Article PubMed PubMed Central CAS Google Scholar
Newcombe, P. J., Conti, D. V. & Richardson, S. JAM: a scalable Bayesian framework for joint analysis of marginal SNP effects. Genet. Epidemiol. 40, 188–201 (2016).
Article PubMed PubMed Central Google Scholar
FitzGerald, L. M. et al. Investigation of the relationship between prostate cancer and MSMB and NCOA4 genetic variants and protein expression. Hum. Mutat. 34, 149–156 (2013).
Article PubMed CAS Google Scholar
Pomerantz, M. M. et al. Analysis of the 10q11 cancer risk locus implicates MSMB and NCOA4 in human prostate tumorigenesis. PLoS Genet. 6, e1001204 (2010).
Article PubMed PubMed Central CAS Google Scholar
Whitaker, H. C. et al. The rs10993994 risk allele for prostate cancer results in clinically relevant changes in microseminoprotein-beta expression in tissue and urine. PLoS ONE 5, e13363 (2010).
Article ADS PubMed PubMed Central CAS Google Scholar
Davydov, E. V. et al. Identifying a high fraction of the human genome to be under selective constraint using GERP++. PLoS Comput. Biol. 6, e1001025 (2010).
Article PubMed PubMed Central CAS Google Scholar
Garber, M. et al. Identifying novel constrained elements by exploiting biased substitution patterns. Bioinformatics 25, i54–i62 (2009).
Article ADS PubMed PubMed Central CAS Google Scholar
Siepel, A. et al. Evolutionarily conserved elements in vertebrate, insect, worm, and yeast genomes. Genome Res. 15, 1034–1050 (2005).
Article PubMed PubMed Central CAS Google Scholar
Taberlay, P. C., Statham, A. L., Kelly, T. K., Clark, S. J. & Jones, P. A. Reconfiguration of nucleosome-depleted regions at distal regulatory elements accompanies DNA methylation of enhancers and insulators in cancer. Genome Res. 24, 1421–1432 (2014).
Article PubMed PubMed Central CAS Google Scholar
Mei, S. et al. Cistrome Data Browser: a data portal for ChIP-Seq and chromatin accessibility data in human and mouse. Nucleic Acids Res. 45, D658–D662 (2017).
Article PubMed CAS Google Scholar
Koenker, R. Quantile Regression (Cambridge University Press, New York, 2005).
Kozumi, H. & Kobayashi, G. Gibbs sampling methods for Bayesian quantile regression. J. Stat. Comput. Simul. 81, 1565–1578 (2011).
Article MathSciNet MATH Google Scholar
Eeles, R. A. et al. Identification of 23 new prostate cancer susceptibility loci using the iCOGS custom genotyping array. Nat. Genet. 45, 385–391 (2013).
Article PubMed CAS Google Scholar
Amin Al Olama, A. et al. Multiple novel prostate cancer susceptibility signals identified by fine-mapping of known risk loci among Europeans. Hum. Mol. Genet. 24, 5589–5602 (2015).
Article PubMed PubMed Central CAS Google Scholar
Takata, R. et al. Genome-wide association study identifies five new susceptibility loci for prostate cancer in the Japanese population. Nat. Genet. 42, 751–754 (2010).
Article PubMed CAS Google Scholar
Spisak, S. et al. CAUSEL: an epigenome- and genome-editing pipeline for establishing function of noncoding GWAS variants. Nat. Med. 21, 1357–1363 (2015).
Article PubMed PubMed Central CAS Google Scholar
Conti, D. V. et al. Two novel susceptibility loci for prostate cancer in men of African ancestry. J. Natl. Cancer Inst. 109, djx084 (2017).
Article PubMed Central Google Scholar
Chung, C. C. et al. Fine mapping of a region of chromosome 11q13 reveals multiple independent loci associated with risk of prostate cancer. Hum. Mol. Genet. 20, 2869–2878 (2011).
Article PubMed PubMed Central CAS Google Scholar
Han, Y. et al. Integration of multiethnic fine-mapping and genomic annotation to prioritize candidate functional SNPs at prostate cancer susceptibility regions. Hum. Mol. Genet. 24, 5603–5618 (2015).
Article PubMed PubMed Central CAS Google Scholar
Kote-Jarai, Z. et al. Identification of a novel prostate cancer susceptibility variant in the KLK3 gene transcript. Hum. Genet. 129, 687–694 (2011).
Article PubMed PubMed Central CAS Google Scholar
Kote-Jarai, Z. et al. Fine-mapping identifies multiple prostate cancer risk loci at 5p15, one of which associates with TERT expression. Hum. Mol. Genet. 22, 2520–2528 (2013).
Article PubMed PubMed Central CAS Google Scholar
Benner, C. et al. FINEMAP: efficient variable selection using summary data from genome-wide association studies. Bioinformatics 32, 1493–1501 (2016).
Article PubMed PubMed Central CAS Google Scholar
Chen, W. et al. Fine mapping causal variants with an approximate Bayesian method using marginal test statistics. Genetics 200, 719–736 (2015).
Article PubMed PubMed Central CAS Google Scholar
Wang, Z. et al. Imputation and subset-based association analysis across different cancer types identifies multiple independent risk loci in the TERT-CLPTM1L region on chromosome 5p15.33. Hum. Mol. Genet. 23, 6616–6633 (2014).
Article PubMed PubMed Central CAS Google Scholar
Chen, W., McDonnell, S. K., Thibodeau, S. N., Tillmans, L. S. & Schaid, D. J. Incorporating functional annotations for fine-mapping causal variants in a Bayesian framework using summary statistics. Genetics 204, 933–958 (2016).
Article PubMed PubMed Central Google Scholar
Farh, K. K. et al. Genetic and epigenetic fine mapping of causal autoimmune disease variants. Nature 518, 337–343 (2015).
Article ADS PubMed CAS Google Scholar
Kichaev, G. et al. Integrating functional data to prioritize causal variants in statistical fine-mapping studies. PLoS Genet. 10, e1004722 (2014).
Article PubMed PubMed Central CAS Google Scholar
Pickrell, J. K. Joint analysis of functional genomic data and genome-wide association studies of 18 human traits. Am. J. Hum. Genet. 94, 559–573 (2014).
Article PubMed PubMed Central CAS Google Scholar
Amos, C. I. et al. The OncoArray consortium: a network for understanding the genetic architecture of common cancers. Cancer Epidemiol. Biomark. Prev. 26, 126–135 (2017).
Article Google Scholar
Edlund, C. K., Lee, W. H., Li, D., Van Den Berg, D. J. & Conti, D. V. Snagger: a user-friendly program for incorporating additional information for tagSNP selection. BMC Bioinformatics 9, 174 (2008).
Article PubMed PubMed Central CAS Google Scholar
Willer, C. J., Li, Y. & Abecasis, G. R. METAL: fast and efficient meta-analysis of genomewide association scans. Bioinformatics 26, 2190–2191 (2010).
Article PubMed PubMed Central CAS Google Scholar
Verzilli, C. et al. Bayesian meta-analysis of genetic association studies with different sets of markers. Am. J. Hum. Genet. 82, 859–872 (2008).
Article PubMed PubMed Central CAS Google Scholar
Yang, J. et al. Conditional and joint multiple-SNP analysis of GWAS summary statistics identifies additional variants influencing complex traits. Nat. Genet. 44, 369–375 (2012).
Article PubMed PubMed Central CAS Google Scholar
Vilhjalmsson, B. J. et al. Modeling linkage disequilibrium increases accuracy of polygenic risk scores. Am. J. Hum. Genet. 97, 576–592 (2015).
Article PubMed PubMed Central CAS Google Scholar
Bottolo, L. & Richardson, S. Evolutionary stochastic search for Bayesian model exploration. Bayesian Anal. 5, 583–618 (2010).
Article MathSciNet MATH Google Scholar
Fernández, C., Ley, E. & Steel, M. F. J. Benchmark priors for Bayesian model averaging. J. Econom. 100, 381–427 (2001).
Article MathSciNet MATH Google Scholar
George, E. I. & McCulloch, R. E. Approaches for Bayesian variable selection. Stat. Sin. 7, 339–373 (1997).
MATH Google Scholar
Green, P. J. Reversible jump Markov chain Monte Carlo computation and Bayesian model determination. Biometrika 82, 711–732 (1995).
Article MathSciNet MATH Google Scholar
Cui, W. & George, E. I. Empirical Bayes vs. fully Bayes variable selection. J. Stat. Plan. Inference 138, 888–900 (2008).
Article MathSciNet MATH Google Scholar
Ley, E. & Steel, M. F. J. On the effect of prior assumptions in Bayesian model averaging with applications to growth regression. J. Appl. Econ. 24, 651–674 (2009).
Article MathSciNet Google Scholar
Scott, J. G. & Berger, J. O. Bayes and empirical-Bayes multiplicity adjustment in the variable-selection problem. Ann. Statist. 38, 2587–2619 (2010).
Article MathSciNet MATH Google Scholar
Wilson, M. A., Iversen, E. S., Clyde, M. A., Schmidler, S. C. & Schildkraut, J. M. Bayesian Model Search and multilevel inference for Snp association studies. Ann. Appl. Stat. 4, 1342–1364 (2010).
Article MathSciNet PubMed PubMed Central MATH Google Scholar
Kass, R. E. & Raftery, A. E. Bayes factors. J. Am. Stat. Assoc. 90, 773–795 (1995).
Article MathSciNet MATH Google Scholar
Yang, H. & Wang, K. Genomic variant annotation and prioritization with ANNOVAR and wANNOVAR. Nat. Protoc. 10, 1556–1566 (2015).
Article PubMed PubMed Central CAS Google Scholar
Kozomara, A. & Griffiths-Jones, S. miRBase: annotating high confidence microRNAs using deep sequencing data. Nucleic Acids Res. 42, D68–D73 (2014).
Article PubMed CAS Google Scholar
Kircher, M. et al. A general framework for estimating the relative pathogenicity of human genetic variants. Nat. Genet. 46, 310–315 (2014).
Article PubMed PubMed Central CAS Google Scholar
Consortium, E. P. An integrated encyclopedia of DNA elements in the human genome. Nature 489, 57–74 (2012).
Article ADS CAS Google Scholar
Thurman, R. E. et al. The accessible chromatin landscape of the human genome. Nature 489, 75–82 (2012).
Article ADS PubMed PubMed Central CAS Google Scholar
Ernst, J. & Kellis, M. ChromHMM: automating chromatin-state discovery and characterization. Nat. Methods 9, 215–216 (2012).
Article PubMed PubMed Central CAS Google Scholar
Hoffman, M. M. et al. Integrative annotation of chromatin elements from ENCODE data. Nucleic Acids Res. 41, 827–841 (2013).
Article PubMed CAS Google Scholar
Anderson, C. A. et al. Data quality control in genetic case-control association studies. Nat. Protoc. 5, 1564–1573 (2010).
Article PubMed PubMed Central CAS Google Scholar
Stegle, O., Parts, L., Durbin, R. & Winn, J. A Bayesian framework to account for complex non-genetic factors in gene expression levels greatly increases power in eQTL studies. PLoS. Comput. Biol. 6, e1000770 (2010).
Article ADS MathSciNet PubMed PubMed Central CAS Google Scholar
Ongen, H., Buil, A., Brown, A. A., Dermitzakis, E. T. & Delaneau, O. Fast and efficient QTL mapper for thousands of molecular phenotypes. Bioinformatics 32, 1479–1485 (2016).
Article PubMed CAS Google Scholar
Nica, A. C. et al. Candidate causal regulatory effects by integration of expression QTLs with complex trait genetic associations. PLoS Genet. 6, e1000895 (2010).
Article PubMed PubMed Central CAS Google Scholar
Zhong, H. & Prentice, R. L. Bias-reduced estimators and confidence intervals for odds ratios in genome-wide association studies. Biostatistics 9, 621–634 (2008).
Article PubMed PubMed Central Google Scholar
Plummer, M. JAGS: a program for analysis of Bayesian graphical models using Gibbs sampling. In Proc. 3rd International Workshop on Distributed Statistical Computing (DSC 2003), Vienna, Austria (eds Hornik, K. et al.) 1–10 (2003).
Dadaev, T., Leongamornlert, D. A., Saunders, E. J., Eeles, R. & Kote-Jarai, Z. LocusExplorer: a user-friendly tool for integrated visualization of human genetic association data and biological annotations. Bioinformatics 32, 949–951 (2016).
Article PubMed CAS Google Scholar

Download references

Acknowledgements

We would particularly like to thank all the patients and control men who took part in all the studies involved in this work, as well as all the researchers, clinicians, technicians and administrative staff who have enabled this work to be carried out, and the collaborators in the PRACTICAL consortium. We wish to thank all GWAS study groups contributing to the meta-analysis data set from which these fine-mapping analyses were conducted: BPC3 (Breast and Prostate Cancer Cohort Consortium); CAPS (Cancer of the Prostate in Sweden); PEGASUS (Prostate Cancer Genome-wide Association Study of Uncommon Susceptibility Loci); The PRACTICAL (Prostate Cancer Association Group to Investigate Cancer-Associated Alterations in the Genome) Consortium; and The GAME-ON/ELLIPSE Consortium. Detailed acknowledgements and funding information for all GWAS study groups and from all the individual studies involved in the PRACTICAL Consortium are included in Supplementary Note 1.

Author information

These authors contributed equally: Tokhir Dadaev, Edward J. Saunders.
These authors jointly supervised this work: Christopher A. Haiman, Rosalind A. Eeles, David V. Conti, Zsofia Kote-Jarai.
Deceased: Brian E. Henderson.

Authors and Affiliations

The Institute of Cancer Research, London, SW7 3RP, UK
Tokhir Dadaev, Edward J. Saunders, Ezequiel Anokian, Daniel A. Leongamornlert, Mark N. Brook, Clara Cieza-Borrella, Martina Mijuskovic, Sarah Wakerell, Sara Benlloch, Mahbubl Ahmed, Chee Goh, Koveela Govindasami, Alison Thwaites, Michelle Guy, Ian Whitmore, Angela Morgan, Cyril Fisher, Steve Hazel, Naomi Livni, David Dearnaley, Rosalind A. Eeles & Zsofia Kote-Jarai
MRC Biostatistics Unit, University of Cambridge, Robinson Way, Cambridge, CB2 0SR, UK
Paul J. Newcombe
Cancer Genome Project, Wellcome Trust Sanger Institute, Hinxton, Cambridge, CB10 1SA, UK
Daniel A. Leongamornlert
Centre for Cancer Genetic Epidemiology, Department of Public Health and Primary Care, Strangeways Research Laboratory, University of Cambridge, Cambridge, CB1 8RN, UK
Ali Amin Al Olama, Sara Benlloch, Paul Pharoah, Laura Fachal, Margaret Cook & Douglas F. Easton
Department of Clinical Neurosciences, University of Cambridge, Cambridge, CB2 0QQ, UK
Ali Amin Al Olama
Department of Population and Quantitative Health Sciences, Case Western Reserve University, Cleveland, OH, 44106-7219, USA
Fredrick R. Schumacher
Seidman Cancer Center, University Hospitals, Cleveland, OH, 44106, USA
Fredrick R. Schumacher
Division of Cancer Epidemiology and Genetics, National Cancer Institute, NIH, Bethesda, MD, 20892, USA
Sonja I. Berndt, Demetrius Albanes, Stephanie Weinstein, Stella Koutros, Laura E. Beane Freeman, Robert N. Hoover, Mitchell J. Machiela & Stephen Chanock
Department of Preventive Medicine, Keck School of Medicine, University of Southern California/Norris Comprehensive Cancer Center, Los Angeles, CA, 90015, USA
Xin Sheng, Zhuo Zhang, Sue Ann Ingles, Mariana C. Stern, Brian E. Henderson, Christopher A. Haiman & David V. Conti
Institute of Population Health, University of Manchester, Manchester, M13 9PL, UK
Kenneth Muir & Artitaya Lophatananon
Warwick Medical School, University of Warwick, Coventry, CV4 7AL, UK
Kenneth Muir & Artitaya Lophatananon
Epidemiology Research Program, American Cancer Society, 250 Williams Street, Atlanta, GA, 30303, USA
Victoria L. Stevens, Susan M. Gapstur & Brian D. Carter
SWOG Statistical Center, Fred Hutchinson Cancer Research Center, Seattle, WA, 98109, USA
Catherine M. Tangen & Phyllis Goodman
CHRISTUS Santa Rosa Hospital - Medical Center, San Antonio, TX, 78229, USA
Ian M. Thompson Jr.
Australian Prostate Cancer Research Centre-Qld, Institute of Health and Biomedical Innovation and School of Biomedical Science, Queensland University of Technology, Brisbane, QLD, 4059, Australia
Jyotsna Batra, Leire Moya, Judith Clements, Srilakshmi Srinivasan, Mary-Anne Kedda, Trina Yeadon & Allison Eckert
Translational Research Institute, Brisbane, QLD, 4102, Australia
Jyotsna Batra, Leire Moya, Judith Clements, Srilakshmi Srinivasan, Mary-Anne Kedda, Trina Yeadon & Allison Eckert
Menzies Health Institute Queensland, Griffith University, Gold Coast, QLD, 4222, Australia
Suzanne Chambers & Joanne Aitken
Cancer Council Queensland, Fortitude Valley, QLD, 4006, Australia
Suzanne Chambers & Joanne Aitken
Chris O’Brien Lifehouse (COBLH), Camperdown, Sydney, NSW, 2010, Australia
Lisa Horvath
Garvan Institute of Medical Research, Sydney, NSW, 2010, Australia
Lisa Horvath & Anne-Maree Haynes
Dame Roma Mitchell Cancer Research Centre, University of Adelaide, Adelaide, SA, 5005, Australia
Wayne Tilley
Department of Anatomy and Developmental Biology, Biomedicine Discovery Institute, Monash University, Melbourne, VIC, 3800, Australia
Gail Risbridger
Prostate Cancer Translational Research Program, Cancer Research Division, Peter MacCallum Cancer Centre, Melbourne, VIC, 3000, Australia
Gail Risbridger
Department of Medical Epidemiology and Biostatistics, Karolinska Institute, SE-171 77, Stockholm, Sweden
Henrik Gronberg, Markus Aly, Tobias Nordström, Martin Eklund, Carin Cavalli-Bjoerkman & Fredrik Wiklund
Department of Molecular Medicine and Surgery, Karolinska Institutet, and Department of Urology, Karolinska University Hospital, 171 76, Stockholm, Sweden
Markus Aly
Department of Clinical Sciences at Danderyd Hospital, Karolinska Institutet, 182 88, Stockholm, Sweden
Tobias Nordström
Centre for Cancer Genetic Epidemiology, Department of Oncology, Strangeways Laboratory, University of Cambridge, Cambridge, CB1 8RN, UK
Paul Pharoah, Nora Pashayan, Alison M. Dunning & Douglas F. Easton
Department of Applied Health Research, University College London, London, WC1E 7HB, UK
Nora Pashayan
Institute of Biomedicine, University of Turku, FI-20014, Turku, Finland
Johanna Schleutker & Csilla Sipeky
Tyks Microbiology and Genetics, Department of Medical Genetics, Turku University Hospital, 20521, Turku, Finland
Johanna Schleutker
Department of Urology, Tampere University Hospital, University of Tampere, Kalevantie 4, FI-33014, Tampere, Finland
Teuvo L. J. Tammela & Teemu Murtola
Department of Epidemiology, School of Health Sciences, University of Tampere, FI-33014, Tampere, Finland
Anssi Auvinen
Division of Nutritional Epidemiology, Institute of Environmental Medicine, Karolinska Institutet, SE-171 77, Stockholm, Sweden
Alicja Wolk & Niclas Hakansson
Division of Cancer Sciences, Manchester Academic Health Science Centre, Radiotherapy Related Research, Manchester NIHR Biomedical Research Centre, The Christie Hospital NHS Foundation Trust, University of Manchester, Manchester, M13 9PL, UK
Catharine West & Rebecca Elliott
University of Cambridge Department of Oncology, Oncology Centre, Cambridge University Hospitals NHS Foundation Trust, Cambridge, CB1 8RN, UK
Neil Burnet & Gill Barnett
Department of Epidemiology, Harvard School of Public Health, Boston, MA, 02115, USA
Lorelei Mucci, Edward Giovannucci & Hardeep Ranu
Washington University School of Medicine, St. Louis, MO, 63110, USA
Gerald Andriole, Bettina F. Drake & Aleksandra Klim
GRC N°5 ONCOTYPE-URO, UPMC Univ Paris 06, Tenon Hospital, F-75020, Paris, France
Olivier Cussenot & Géraldine Cancel-Tassin
CeRePP, Tenon Hospital, F-75020, Paris, France
Olivier Cussenot & Géraldine Cancel-Tassin
Department of Molecular Medicine, Aarhus University Hospital, 8200, Aarhus N, Denmark
Karina Dalsgaard Sorensen & Torben Falck Orntoft
Department of Clinical Medicine, Aarhus University, 8200, Aarhus N, Denmark
Karina Dalsgaard Sorensen, Torben Falck Orntoft & Michael Borre
Department of Urology, Aarhus University Hospital, 8200, Aarhus N, Denmark
Michael Borre
Department of Medical Genetics, Oslo University Hospital, 0424, Oslo, Norway
Lovise Maehle & Eli Marie Grindedal
Department of Oncology, Addenbrooke’s Hospital, University of Cambridge, Cambridge, CB2 0QQ, UK
David E. Neal
Cancer Research UK Cambridge Research Institute, Li Ka Shing Centre, Cambridge, CB2 0RE, UK
David E. Neal
Nuffield Department of Surgical Sciences, University of Oxford, Oxford, OX1 2JD, UK
David E. Neal, Freddie C. Hamdy & Gemma Marsden
School of Social and Community Medicine, University of Bristol, Canynge Hall, 39 Whatley Road, Bristol, BS8 2PS, UK
Jenny L. Donovan, Richard M. Martin, Michael Davis, Athene Lane & Sarah J. Lewis
Faculty of Medical Science, John Radcliffe Hospital, University of Oxford, Oxford, OX1 2JD, UK
Freddie C. Hamdy & Gemma Marsden
Medical Research Council (MRC) Integrative Epidemiology Unit, University of Bristol, Bristol, BS8 2BN, UK
Richard M. Martin
National Institute for Health Research (NIHR) Biomedical Research Centre, University of Bristol, Bristol, BS8 1TH, UK
Richard M. Martin
Cancer Epidemiology, Nuffield Department of Population Health, University of Oxford, Oxford, OX3 7LF, UK
Ruth C. Travis & Tim J. Key
Department of Surgical Oncology, Princess Margaret Cancer Centre, Toronto, ON, M5G 2M9, Canada
Robert J. Hamilton, Neil E. Fleshner, Antonio Finelli, Paul Brown, Girish S. Kulkarni & Alexandre R. Zlotta
Department of Radiation Oncology, Icahn School of Medicine at Mount Sinai, New York, NY, 10029, USA
Barry Rosenstein
Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, 10029-5674, USA
Barry Rosenstein
Department of Radiation Oncology, University of Rochester Medical Center, Rochester, NY, 14620, USA
Sarah Kerns
Professor of Pathology and Pediatrics, Albert Einstein College of Medicine, Bronx, NY, 10461, USA
Harry Ostrer
Centre for Molecular Oncology, Barts Cancer Institute, John Vane Science Centre, Queen Mary University of London, London, EC1M 6BQ, UK
Yong-Jie Lu, Xueying Mao & Jacek Marzec
Second Military Medical University, Shanghai, 200433, P. R. China
Hong-Wei Zhang, Guangwen Cao, Ji Lin, Jin Ling & Meiling Li
Wuxi Second Hospital, Nanjing Medical University, Wuxi, Jiangzhu, 214003, China
Ninghan Feng
Department of Urology, The First Affiliated Hospital, Chongqing Medical University, Chongqing, 200032, China
Xin Guo, Jie Li & Weiyang He
The People’s Hospital of Liaoning Province and The People’s Hospital of China Medical University, Shenyang, 110001, China
Xin Guo & Zan Sun
Department of Urology, Zhongshan Hospital, Fudan University Medical College, Shanghai, 200032, China
Guomin Wang & Jianming Guo
Cancer Epidemiology & Intelligence Division, Cancer Council Victoria, Melbourne, VIC, 3004, Australia
Graham G. Giles, Robert J. MacInnis & Roger Milne
Centre for Epidemiology and Biostatistics, Melbourne School of Population and Global Health, The University of Melbourne, Melbourne, VIC, 3010, Australia
Graham G. Giles, Robert J. MacInnis, Liesel M. FitzGerald, John L. Hopper & Roger Milne
Precision Medicine, School and Clinical Sciences at Monash Health, Monash University, Clayton, VIC, 3168, Australia
Melissa C. Southey
Menzies Institute for Medical Research, University of Tasmania, Hobart, TAS, 7000, Australia
Liesel M. FitzGerald
Division of Urologic Surgery, Brigham and Womens Hospital, Boston, MA, 02115, USA
Adam S. Kibel
Fundación Pública Galega de Medicina Xenómica-SERGAS, Grupo de Medicina Xenómica, CIBERER, IDIS, Santiago de Compostela, 15706, Spain
Ana Vega, Laura Fachal, Miguel Aguado & Angel Carracedo
Department of Radiation Oncology, Complexo Hospitalario Universitario de Santiago, SERGAS, 15706, Santiago de Compostela, Spain
Antonio Gómez-Caamaño, Ana Carballo, Paula Peleteiro & Patricia Calvo
Division of Family Medicine, Department of Neurobiology, Care Science and Society, Karolinska Institutet, Huddinge, SE-171 77, Stockholm, Sweden
Robert Szulkin
Scandinavian Development Services, 182 33, Danderyd, Sweden
Robert Szulkin
Centre for Research in Environmental Epidemiology (CREAL), Barcelona Institute for Global Health (ISGlobal), 08003, Barcelona, Spain
Manolis Kogevinas, Gemma Castaño-Vinyals, Mariona Bustamante & Esther Gracia-Lavedan
CIBER Epidemiología y Salud Pública (CIBERESP), 28029, Madrid, Spain
Manolis Kogevinas, Javier Llorca, Gemma Castaño-Vinyals, Esther Gracia-Lavedan, Trinidad Dierssen-Sotos & Ines Gomez-Acebo
IMIM (Hospital del Mar Research Institute), 08003, Barcelona, Spain
Manolis Kogevinas, Gemma Castaño-Vinyals, Lluís Cecchini & Esther Gracia-Lavedan
Universitat Pompeu Fabra (UPF), 08002, Barcelona, Spain
Manolis Kogevinas, Gemma Castaño-Vinyals & Esther Gracia-Lavedan
University of Cantabria-IDIVAL, 39005, Santander, Spain
Javier Llorca, Trinidad Dierssen-Sotos & Ines Gomez-Acebo
Channing Division of Network Medicine, Department of Medicine, Brigham and Women’s Hospital/Harvard Medical School, Boston, MA, 02184, USA
Kathryn L. Penney, Meir Stampfer & Jing Ma
Department of Cancer Epidemiology, Moffitt Cancer Center, Tampa, FL, 33612, USA
Jong Y. Park, Thomas A. Sellers, Hyun Park & Babu Zachariah
School of Public Health, Louisiana State University Health Sciences Center, New Orleans, LA, 70112, USA
Hui-Yi Lin
Division of Public Health Sciences, Fred Hutchinson Cancer Research Center, Seattle, WA, 98109-1024, USA
Janet L. Stanford, Milan S. Geybels, Daniel W. Lin, Lisa F. Newcomb & Suzanne Kolb
Department of Epidemiology, School of Public Health, University of Washington, Seattle, WA, 98195, USA
Janet L. Stanford
International Hereditary Cancer Center, Department of Genetics and Pathology, Pomeranian Medical University, 70-115, Szczecin, Poland
Cezary Cybulski, Dominika Wokolorczyk, Jan Lubinski & Wojciech Kluzniak
National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, 20892, USA
Elaine A. Ostrander
Faculty of Health and Medical Sciences, University of Copenhagen, 2200, Copenhagen, Denmark
Børge G. Nordestgaard, Sune F. Nielsen & Peter Iversen
Department of Clinical Biochemistry, Herlev and Gentofte Hospital, Copenhagen University Hospital, Herlev, 2200, Copenhagen, Denmark
Børge G. Nordestgaard, Sune F. Nielsen & Maren Weisher
Department of Urology, Herlev and Gentofte Hospital, Copenhagen University Hospital, Herlev, 2200, Copenhagen, Denmark
Rasmus Bisbjerg & Peter Klarskov
Copenhagen Prostate Cancer Center, Department of Urology, Rigshospitalet, Copenhagen University Hospital, DK-2730, Herlev, Denmark
Martin Andreas Røder & Peter Iversen
Division of Clinical Epidemiology and Aging Research, German Cancer Research Center (DKFZ), D-69120, Heidelberg, Germany
Hermann Brenner & Katarina Cuk
German Cancer Consortium (DKTK), German Cancer Research Center (DKFZ), D-69120, Heidelberg, Germany
Hermann Brenner
Division of Preventive Oncology, German Cancer Research Center (DKFZ) and National Center for Tumor Diseases (NCT), 69120, Heidelberg, Germany
Hermann Brenner
Saarland Cancer Registry, 66119, Saarbrücken, Germany
Bernd Holleczek & Christa Stegmaier
Institute for Human Genetics, University Hospital Ulm, 89075, Ulm, Germany
Christiane Maier, Manuel Luedeke & Walther Vogel
Department of Urology, University Hospital Ulm, 89075, Ulm, Germany
Thomas Schnoeller & Philipp Bohnert
Department of Genitourinary Medical Oncology, The University of Texas MD Anderson Cancer Center, Houston, TX, 77030, USA
Jeri Kim & Christopher J. Logothetis
Cancer Prevention Institute of California, Fremont, CA, 94538, USA
Esther M. John
Department of Health Research & Policy (Epidemiology) and Stanford Cancer Institute, Stanford University School of Medicine, Stanford, CA, 94305-5101, USA
Esther M. John
Department of Genetics, Portuguese Oncology Institute of Porto, 4200-072, Porto, Portugal
Manuel R. Teixeira, Paula Paulo, Marta Cardoso, Sofia Maia & Maria P. Silva
Biomedical Sciences Institute (ICBAS), University of Porto, 4050-313, Porto, Portugal
Manuel R. Teixeira
Department of Population Sciences, Beckman Research Institute of the City of Hope, Duarte, CA, 91010, USA
Susan L. Neuhausen, Linda Steele & Yuan Chun Ding
Ghent University, Faculty of Medicine and Health Sciences, Basic Medical Sciences, B-9000, Gent, Belgium
Kim De Ruyck, Gert De Meerleer, Sofie De Langhe & Hubert Thierens
Department of Radiotherapy, Ghent University Hospital, B-9000, Gent, Belgium
Piet Ost
Department of Surgery, Faculty of Medicine, University of Malaya, 50603, Kuala Lumpur, Malaysia
Azad Razack, Jasmine Lim, Meng H. Tan & Aik T. Ong
Cancer Research Malaysia (CRM), Outpatient Centre, Subang Jaya Medical Centre, 47500, Subang Jaya, Selangor, Malaysia
Soo-Hwang Teo
Department of Urology, University of Washington, Seattle, WA, 98195, USA
Daniel W. Lin & Lisa F. Newcomb
Institute of Human Genetics, University Medical Center Hamburg-Eppendorf, D-20246, Hamburg, Germany
Davor Lessel
Division of Medical Oncology, Urogenital Unit, Department of Oncology at the University Hospital Centre Zagreb, Šalata 2, 10000, Zagreb, Croatia
Marija Gamulin
Department of Urology, University Hospital Center Zagreb, University of Zagreb School of Medicine, Šalata 2, 10000, Zagreb, Croatia
Tomislav Kulis & Zeljko Kastelan
Molecular Medicine Center, Department of Medical Chemistry and Biochemistry, Medical University of Sofia, 1431, Sofia, Bulgaria
Radka Kaneva, Vanio Mitev, Darina Kachakova & Atanaska Mitkova
Department of Oncology, Cross Cancer Institute, University of Alberta, Edmonton, AB, T6G 1Z2, Canada
Nawaid Usmani, Matthew Parliament & Sandeep Singhal
Division of Radiation Oncology, Cross Cancer Institute, Edmonton, AB, T6G 1Z2, Canada
Nawaid Usmani & Matthew Parliament
Department of Urology and Alexandrovska University Hospital, Medical University of Sofia, 1431, Sofia, Bulgaria
Chavdar Slavov & Elenko Popov
Molecular Endocrinology Laboratory, Department of Cellular and Molecular Medicine, KU Leuven, BE-3000, Leuven, Belgium
Frank Claessens & Thomas Van den Broeck
Department of Urology, University Hospitals Leuven, BE-3000, Leuven, Belgium
Steven Joniau & Thomas Van den Broeck
Southampton General Hospital, The University of Southampton, Southampton, SO16 6YD, UK
Samantha Larkin
Manchester Cancer Research Centre, Faculty of Biology Medicine & Health, Manchester Academic Health Science Centre, NIHR Manchester Biomedical Research Centre, Health Innovation Manchester, University of Manchester, Manchester, M13 9WL, UK
Paul A. Townsend
The University of Surrey, Guildford, Surrey, GU2 7XH, UK
Claire Aukim-Hastie, Hardev Pandha, Agnieszka Michael, Andrzej Kierzek, Ami Karlsson, Michael Broms & Huihai Wu
Genomic Medicine Group, Galician Foundation of Genomic Medicine, Instituto de Investigacion Sanitaria de Santiago de Compostela (IDIS), Complejo Hospitalario Universitario de Santiago, Servicio Galego de Saúde, SERGAS, 15706, Santiago de Compostela, Spain
Manuela Gago-Dominguez
Moores Cancer Center, University of California San Diego, La Jolla, CA, 92037, USA
Manuela Gago-Dominguez
Genetic Oncology Unit, CHUVI Hospital, Complexo Hospitalario Universitario de Vigo, Instituto de Investigación Biomédica Galicia Sur (IISGS), 36204, Vigo (Pontevedra), Spain
Jose Esteban Castelao
Moores Cancer Center, Department of Family Medicine and Public Health, University of California San Diego, La Jolla, CA, 92093-0012, USA
Maria Elena Martinez
Department of Urology, Erasmus University Medical Center, 3015 CE, Rotterdam, The Netherlands
Monique J. Roobol, Guido Jenster, Christopher Bangma & F. H. Schroder
Department of Clinical Chemistry, Erasmus University Medical Center, 3015 CE, Rotterdam, The Netherlands
Ron H. N. van Schaik
Cancer & Environment Group, Center for Research in Epidemiology and Population Health (CESP), INSERM, University Paris-Sud, University Paris-Saclay, 94807, Villejuif Cédex, France
Florence Menegaux, Thérèse Truong, Yves Akoli Koudou, Sylvie Cenee & Marie Sanchez
Program for Personalized Cancer Care, NorthShore University HealthSystem, Evanston, IL, 60201, USA
Jianfeng Xu
Clinical Gerontology Unit, University of Cambridge, Cambridge, CB2 2QQ, UK
Kay-Tee Khaw
Division of Genetic Epidemiology, Department of Medicine, University of Utah School of Medicine, Salt Lake City, UT, 84112, USA
Lisa Cannon-Albright
George E. Wahlen Department of Veterans Affairs Medical Center, Salt Lake City, UT, 84148, USA
Lisa Cannon-Albright
Department of Laboratory Medicine and Pathology, Mayo Clinic, Rochester, MN, 55905, USA
Stephen N. Thibodeau, Lori Tillmans & Shaun Riska
Division of Biomedical Statistics & Informatics, Mayo Clinic, Rochester, MN, 55905, USA
Shannon K. McDonnell & Daniel J. Schaid
Department of Epidemiology, University of Washington, Seattle, WA, 98195, USA
Sara Lindstrom
Program in Genetic Epidemiology and Statistical Genetics, Department of Epidemiology, Harvard T.H. Chan School of Public Health, Boston, MA, 02115, USA
Constance Turman, David J. Hunter & Peter Kraft
Department of Epidemiology and Biostatistics, School of Public Health, Imperial College, London, SW7 2AZ, UK
Elio Riboli & Clare Berry
Genomics England, Queen Mary University of London, Dawson Hall, Charterhouse Square, London, EC1M 6BQ, UK
Afshan Siddiq
Genomic Epidemiology Group, German Cancer Research Center (DKFZ), D-69120, Heidelberg, Germany
Federico Canzian
Epidemiology Program, University of Hawaii Cancer Center, Honolulu, HI, 96813, USA
Laurence N. Kolonel & Loic Le Marchand
Dana-Farber Cancer Institute, Boston, MA, 02215, USA
Matthew Freedman
Royal Marsden NHS Foundation Trust, London, SW3 6JJ, UK
David Dearnaley & Rosalind A. Eeles
Molecular Cancer Epidemiology Laboratory, QIMR Berghofer Institute of Medical Research, Herston, QLD, 4006, Australia
Amanda Spurdle
School of Medicine, University of Queensland, Herston, QLD, 4006, Australia
Robert Gardiner
Royal Brisbane & Women’s Hospital, Herston, QLD, 4029, Australia
Robert Gardiner
The Kinghorn Cancer Centre (TKCC), Victoria, NSW, 2010, Australia
Vanessa Hayes & Anne-Maree Haynes
Prostate Cancer Research Group, South Australian Health & Medical Research Institute, Adelaide, SA, 5000, Australia
Lisa Butler
Department of Physiology, Biomedicine Discovery Institute, Cancer Program, Monash University, Melbourne, VIC, 3800, Australia
Renea Taylor & Melissa Papargiris
University of Adelaide, North Terrace, Adelaide, SA, 5005, Australia
Pamela Saunders
Fimlab Laboratories, Tampere University Hospital, FI-33520, Tampere, Finland
Paula Kujala
Finnish Cancer Registry, FI-00130, Helsinki, Finland
Kirsi Talala
Faculty of Medicine and Life Sciences, University of Tampere, FI-33014, Tampere, Finland
Teemu Murtola
Department of Urology, Helsinki University Central Hospital and University of Helsinki, FI-00014, Helsinki, Finland
Kimmo Taari
Division of Biostatistics and Bioinformatics, University of Maryland Greenebaum Cancer Center, and Department of Epidemiology and Public Health, University of Maryland School of Medicine, Baltimore, MD, 21201, USA
Søren Bentzen
Cancer Genomics Research Laboratory (CGR), Division of Cancer Epidemiology and Genetics, FNLCR Leidos Biomedical Research, National Cancer Institute, Frederick, MD, 21701, USA
Belynda Hicks & Aurelie Vogt
DNA Extraction and Staging Laboratory (DESL), Cancer Genomics Research Laboratory (CGR), Division of Cancer Epidemiology and Genetics, FNLCR Leidos Biomedical Research, National Cancer Institute, Frederick, MD, 21701, USA
Amy Hutchinson
Sheffield Institute for Nucleic Acids, University of Sheffield, Sheffield, S10 2TN, UK
Angela Cox
Cambridge Cancer Trials Centre, Cambridge Clinical Trials Unit - Cancer Theme, Cambridge University Hospitals NHS Foundation Trust, Cambridge, CB2 0QQ, UK
Anne George
Department of Medical Imaging, University Health Network, Toronto, ON, M5G 2C4, Canada
Ants Toi
Department of Pathology, University Health Network, Toronto, ON, M5G 2C4, Canada
Andrew Evans & Theodorus H. van der Kwast
Advanced Radiation Biology Research Program, Research Center for Charged Particle Therapy, National Institute of Radiological Sciences, Chiba, 263-8555, Japan
Takashi Imai
Department of Urology, National Hospital Organization Tokyo Medical Center, Tokyo, 152-8902, Japan
Shiro Saito
Department of Urology, Nanfang Hospital, Southern Medical University, 510515, Guangzhou, China
Shan-Chao Zhao
Department of Pathology, The First Affiliated Hospital, Zhejiang University Medical College, 310009, Hangzhou, China
Guoping Ren & Yangling Zhang
Department of Pathology, Changhai Hospital, The Second Military Medical University, 200433, Shanghai, China
Yongwei Yu
Department of Urology, First Affiliated Hospital, Medical College, Zhengzhou University, 450003, Zhengzhou, China
Yudong Wu
Department of Urology, North Sichuan Medical College, 637000, Nanchong, China
Ji Wu
Department of Nutrition Science, Shenyang Medical College, 110034, Shenyang, China
Bo Zhou
Tissupath Pty Ltd., Melbourne, VIC, 3122, Australia
John Pedersen
Department of Medical Physics, Complexo Hospitalario Universitario de Santiago, SERGAS, 15706, Santiago de Compostela, Spain
Ramón Lobato-Busto
Urology Department, Hospital Germans Trias I Pujol, 08916, Barcelona, Spain
José Manuel Ruiz-Dominguez
Laboratory and Department of Urology, Hospital Clínic, Institut d’Investigacions Biomèdiques August Pi i Sunyer (IDIBAPS), Universitat de Barcelona, 08036, Barcelona, Spain
Lourdes Mengual
Centre de Recerca Biomèdica CELLEX, 08036, Barcelona, Spain
Lourdes Mengual
Department and Laboratory of Urology, Hospital Clínic, Institut d’Investigacions Biomèdiques August Pi i Sunyer (IDIBAPS), Universitat de Barcelona, 08036, Barcelona, Spain
Antonio Alcaraz
Genitourinary Program, Moffitt Cancer Center, Tampa, FL, 33612, USA
Julio Pow-Sang
Department of Urology, Klinikum rechts der Isar der Technischen Universitaet Muenchen, 81675, Munich, Germany
Kathleen Herkommer
Department of General and Clinical Pathology and Alexandrovska University Hospital, Medical University, 1431, Sofia, Bulgaria
Aleksandrina Vlahova, Tihomir Dikov & Svetlana Christova
Center of Excellence in Genomic Medicine Research, King Abdulaziz University, Jeddah, 2252 3270, Saudi Arabia
Angel Carracedo
Grupo de Medicina Xenómica, CIBERER, CIMUS, Universidad de Santiago de Compostela, Avenida de Barcelona, 15782, Santiago de Compostela, Spain
Angel Carracedo
Paris-Sud University, UMRS 1018, Cedex, 94807, Villejuif, France
Sylvie Cenee & Marie Sanchez
Hérault Cancer Registry, Montpellier cedex 5, Montpellier, 34298, France
Brigitte Tretarre
Urology Department, Clinique Beau Soleil, 34070, Montpellier, France
Xavier Rebillard
INSERM U1147, 75013, Paris, France
Claire Mulot
Department of Clinical Science, Intervention and Technology, Karolinska Institutet, SE-171 77, Stockholm, Sweden
Jan Adolfsson
Swedish Agency for Health Technology Assessment and Assessment of Social Services, SE-102 33, Stockholm, Sweden
Jan Adolfsson
Department of Surgical and Perioperative Sciences, Urology and Andrology, Umeå University, SE-901 85, Umeå, Sweden
Par Stattin
Department of Surgical Sciences, Uppsala University, SE-751 85, Uppsala, Sweden
Par Stattin
Department of Urology, Faculty of Medicine and Health, Örebro University, SE-701 82, Örebro, Sweden
Jan-Erik Johansson

Authors

Tokhir Dadaev
View author publications
You can also search for this author in PubMed Google Scholar
Edward J. Saunders
View author publications
You can also search for this author in PubMed Google Scholar
Paul J. Newcombe
View author publications
You can also search for this author in PubMed Google Scholar
Ezequiel Anokian
View author publications
You can also search for this author in PubMed Google Scholar
Daniel A. Leongamornlert
View author publications
You can also search for this author in PubMed Google Scholar
Mark N. Brook
View author publications
You can also search for this author in PubMed Google Scholar
Clara Cieza-Borrella
View author publications
You can also search for this author in PubMed Google Scholar
Martina Mijuskovic
View author publications
You can also search for this author in PubMed Google Scholar
Sarah Wakerell
View author publications
You can also search for this author in PubMed Google Scholar
Ali Amin Al Olama
View author publications
You can also search for this author in PubMed Google Scholar
Fredrick R. Schumacher
View author publications
You can also search for this author in PubMed Google Scholar
Sonja I. Berndt
View author publications
You can also search for this author in PubMed Google Scholar
Sara Benlloch
View author publications
You can also search for this author in PubMed Google Scholar
Mahbubl Ahmed
View author publications
You can also search for this author in PubMed Google Scholar
Chee Goh
View author publications
You can also search for this author in PubMed Google Scholar
Xin Sheng
View author publications
You can also search for this author in PubMed Google Scholar
Zhuo Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Kenneth Muir
View author publications
You can also search for this author in PubMed Google Scholar
Koveela Govindasami
View author publications
You can also search for this author in PubMed Google Scholar
Artitaya Lophatananon
View author publications
You can also search for this author in PubMed Google Scholar
Victoria L. Stevens
View author publications
You can also search for this author in PubMed Google Scholar
Susan M. Gapstur
View author publications
You can also search for this author in PubMed Google Scholar
Brian D. Carter
View author publications
You can also search for this author in PubMed Google Scholar
Catherine M. Tangen
View author publications
You can also search for this author in PubMed Google Scholar
Phyllis Goodman
View author publications
You can also search for this author in PubMed Google Scholar
Ian M. Thompson Jr.
View author publications
You can also search for this author in PubMed Google Scholar
Jyotsna Batra
View author publications
You can also search for this author in PubMed Google Scholar
Suzanne Chambers
View author publications
You can also search for this author in PubMed Google Scholar
Leire Moya
View author publications
You can also search for this author in PubMed Google Scholar
Judith Clements
View author publications
You can also search for this author in PubMed Google Scholar
Lisa Horvath
View author publications
You can also search for this author in PubMed Google Scholar
Wayne Tilley
View author publications
You can also search for this author in PubMed Google Scholar
Gail Risbridger
View author publications
You can also search for this author in PubMed Google Scholar
Henrik Gronberg
View author publications
You can also search for this author in PubMed Google Scholar
Markus Aly
View author publications
You can also search for this author in PubMed Google Scholar
Tobias Nordström
View author publications
You can also search for this author in PubMed Google Scholar
Paul Pharoah
View author publications
You can also search for this author in PubMed Google Scholar
Nora Pashayan
View author publications
You can also search for this author in PubMed Google Scholar
Johanna Schleutker
View author publications
You can also search for this author in PubMed Google Scholar
Teuvo L. J. Tammela
View author publications
You can also search for this author in PubMed Google Scholar
Csilla Sipeky
View author publications
You can also search for this author in PubMed Google Scholar
Anssi Auvinen
View author publications
You can also search for this author in PubMed Google Scholar
Demetrius Albanes
View author publications
You can also search for this author in PubMed Google Scholar
Stephanie Weinstein
View author publications
You can also search for this author in PubMed Google Scholar
Alicja Wolk
View author publications
You can also search for this author in PubMed Google Scholar
Niclas Hakansson
View author publications
You can also search for this author in PubMed Google Scholar
Catharine West
View author publications
You can also search for this author in PubMed Google Scholar
Alison M. Dunning
View author publications
You can also search for this author in PubMed Google Scholar
Neil Burnet
View author publications
You can also search for this author in PubMed Google Scholar
Lorelei Mucci
View author publications
You can also search for this author in PubMed Google Scholar
Edward Giovannucci
View author publications
You can also search for this author in PubMed Google Scholar
Gerald Andriole
View author publications
You can also search for this author in PubMed Google Scholar
Olivier Cussenot
View author publications
You can also search for this author in PubMed Google Scholar
Géraldine Cancel-Tassin
View author publications
You can also search for this author in PubMed Google Scholar
Stella Koutros
View author publications
You can also search for this author in PubMed Google Scholar
Laura E. Beane Freeman
View author publications
You can also search for this author in PubMed Google Scholar
Karina Dalsgaard Sorensen
View author publications
You can also search for this author in PubMed Google Scholar
Torben Falck Orntoft
View author publications
You can also search for this author in PubMed Google Scholar
Michael Borre
View author publications
You can also search for this author in PubMed Google Scholar
Lovise Maehle
View author publications
You can also search for this author in PubMed Google Scholar
Eli Marie Grindedal
View author publications
You can also search for this author in PubMed Google Scholar
David E. Neal
View author publications
You can also search for this author in PubMed Google Scholar
Jenny L. Donovan
View author publications
You can also search for this author in PubMed Google Scholar
Freddie C. Hamdy
View author publications
You can also search for this author in PubMed Google Scholar
Richard M. Martin
View author publications
You can also search for this author in PubMed Google Scholar
Ruth C. Travis
View author publications
You can also search for this author in PubMed Google Scholar
Tim J. Key
View author publications
You can also search for this author in PubMed Google Scholar
Robert J. Hamilton
View author publications
You can also search for this author in PubMed Google Scholar
Neil E. Fleshner
View author publications
You can also search for this author in PubMed Google Scholar
Antonio Finelli
View author publications
You can also search for this author in PubMed Google Scholar
Sue Ann Ingles
View author publications
You can also search for this author in PubMed Google Scholar
Mariana C. Stern
View author publications
You can also search for this author in PubMed Google Scholar
Barry Rosenstein
View author publications
You can also search for this author in PubMed Google Scholar
Sarah Kerns
View author publications
You can also search for this author in PubMed Google Scholar
Harry Ostrer
View author publications
You can also search for this author in PubMed Google Scholar
Yong-Jie Lu
View author publications
You can also search for this author in PubMed Google Scholar
Hong-Wei Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Ninghan Feng
View author publications
You can also search for this author in PubMed Google Scholar
Xueying Mao
View author publications
You can also search for this author in PubMed Google Scholar
Xin Guo
View author publications
You can also search for this author in PubMed Google Scholar
Guomin Wang
View author publications
You can also search for this author in PubMed Google Scholar
Zan Sun
View author publications
You can also search for this author in PubMed Google Scholar
Graham G. Giles
View author publications
You can also search for this author in PubMed Google Scholar
Melissa C. Southey
View author publications
You can also search for this author in PubMed Google Scholar
Robert J. MacInnis
View author publications
You can also search for this author in PubMed Google Scholar
Liesel M. FitzGerald
View author publications
You can also search for this author in PubMed Google Scholar
Adam S. Kibel
View author publications
You can also search for this author in PubMed Google Scholar
Bettina F. Drake
View author publications
You can also search for this author in PubMed Google Scholar
Ana Vega
View author publications
You can also search for this author in PubMed Google Scholar
Antonio Gómez-Caamaño
View author publications
You can also search for this author in PubMed Google Scholar
Laura Fachal
View author publications
You can also search for this author in PubMed Google Scholar
Robert Szulkin
View author publications
You can also search for this author in PubMed Google Scholar
Martin Eklund
View author publications
You can also search for this author in PubMed Google Scholar
Manolis Kogevinas
View author publications
You can also search for this author in PubMed Google Scholar
Javier Llorca
View author publications
You can also search for this author in PubMed Google Scholar
Gemma Castaño-Vinyals
View author publications
You can also search for this author in PubMed Google Scholar
Kathryn L. Penney
View author publications
You can also search for this author in PubMed Google Scholar
Meir Stampfer
View author publications
You can also search for this author in PubMed Google Scholar
Jong Y. Park
View author publications
You can also search for this author in PubMed Google Scholar
Thomas A. Sellers
View author publications
You can also search for this author in PubMed Google Scholar
Hui-Yi Lin
View author publications
You can also search for this author in PubMed Google Scholar
Janet L. Stanford
View author publications
You can also search for this author in PubMed Google Scholar
Cezary Cybulski
View author publications
You can also search for this author in PubMed Google Scholar
Dominika Wokolorczyk
View author publications
You can also search for this author in PubMed Google Scholar
Jan Lubinski
View author publications
You can also search for this author in PubMed Google Scholar
Elaine A. Ostrander
View author publications
You can also search for this author in PubMed Google Scholar
Milan S. Geybels
View author publications
You can also search for this author in PubMed Google Scholar
Børge G. Nordestgaard
View author publications
You can also search for this author in PubMed Google Scholar
Sune F. Nielsen
View author publications
You can also search for this author in PubMed Google Scholar
Maren Weisher
View author publications
You can also search for this author in PubMed Google Scholar
Rasmus Bisbjerg
View author publications
You can also search for this author in PubMed Google Scholar
Martin Andreas Røder
View author publications
You can also search for this author in PubMed Google Scholar
Peter Iversen
View author publications
You can also search for this author in PubMed Google Scholar
Hermann Brenner
View author publications
You can also search for this author in PubMed Google Scholar
Katarina Cuk
View author publications
You can also search for this author in PubMed Google Scholar
Bernd Holleczek
View author publications
You can also search for this author in PubMed Google Scholar
Christiane Maier
View author publications
You can also search for this author in PubMed Google Scholar
Manuel Luedeke
View author publications
You can also search for this author in PubMed Google Scholar
Thomas Schnoeller
View author publications
You can also search for this author in PubMed Google Scholar
Jeri Kim
View author publications
You can also search for this author in PubMed Google Scholar
Christopher J. Logothetis
View author publications
You can also search for this author in PubMed Google Scholar
Esther M. John
View author publications
You can also search for this author in PubMed Google Scholar
Manuel R. Teixeira
View author publications
You can also search for this author in PubMed Google Scholar
Paula Paulo
View author publications
You can also search for this author in PubMed Google Scholar
Marta Cardoso
View author publications
You can also search for this author in PubMed Google Scholar
Susan L. Neuhausen
View author publications
You can also search for this author in PubMed Google Scholar
Linda Steele
View author publications
You can also search for this author in PubMed Google Scholar
Yuan Chun Ding
View author publications
You can also search for this author in PubMed Google Scholar
Kim De Ruyck
View author publications
You can also search for this author in PubMed Google Scholar
Gert De Meerleer
View author publications
You can also search for this author in PubMed Google Scholar
Piet Ost
View author publications
You can also search for this author in PubMed Google Scholar
Azad Razack
View author publications
You can also search for this author in PubMed Google Scholar
Jasmine Lim
View author publications
You can also search for this author in PubMed Google Scholar
Soo-Hwang Teo
View author publications
You can also search for this author in PubMed Google Scholar
Daniel W. Lin
View author publications
You can also search for this author in PubMed Google Scholar
Lisa F. Newcomb
View author publications
You can also search for this author in PubMed Google Scholar
Davor Lessel
View author publications
You can also search for this author in PubMed Google Scholar
Marija Gamulin
View author publications
You can also search for this author in PubMed Google Scholar
Tomislav Kulis
View author publications
You can also search for this author in PubMed Google Scholar
Radka Kaneva
View author publications
You can also search for this author in PubMed Google Scholar
Nawaid Usmani
View author publications
You can also search for this author in PubMed Google Scholar
Chavdar Slavov
View author publications
You can also search for this author in PubMed Google Scholar
Vanio Mitev
View author publications
You can also search for this author in PubMed Google Scholar
Matthew Parliament
View author publications
You can also search for this author in PubMed Google Scholar
Sandeep Singhal
View author publications
You can also search for this author in PubMed Google Scholar
Frank Claessens
View author publications
You can also search for this author in PubMed Google Scholar
Steven Joniau
View author publications
You can also search for this author in PubMed Google Scholar
Thomas Van den Broeck
View author publications
You can also search for this author in PubMed Google Scholar
Samantha Larkin
View author publications
You can also search for this author in PubMed Google Scholar
Paul A. Townsend
View author publications
You can also search for this author in PubMed Google Scholar
Claire Aukim-Hastie
View author publications
You can also search for this author in PubMed Google Scholar
Manuela Gago-Dominguez
View author publications
You can also search for this author in PubMed Google Scholar
Jose Esteban Castelao
View author publications
You can also search for this author in PubMed Google Scholar
Maria Elena Martinez
View author publications
You can also search for this author in PubMed Google Scholar
Monique J. Roobol
View author publications
You can also search for this author in PubMed Google Scholar
Guido Jenster
View author publications
You can also search for this author in PubMed Google Scholar
Ron H. N. van Schaik
View author publications
You can also search for this author in PubMed Google Scholar
Florence Menegaux
View author publications
You can also search for this author in PubMed Google Scholar
Thérèse Truong
View author publications
You can also search for this author in PubMed Google Scholar
Yves Akoli Koudou
View author publications
You can also search for this author in PubMed Google Scholar
Jianfeng Xu
View author publications
You can also search for this author in PubMed Google Scholar
Kay-Tee Khaw
View author publications
You can also search for this author in PubMed Google Scholar
Lisa Cannon-Albright
View author publications
You can also search for this author in PubMed Google Scholar
Hardev Pandha
View author publications
You can also search for this author in PubMed Google Scholar
Agnieszka Michael
View author publications
You can also search for this author in PubMed Google Scholar
Andrzej Kierzek
View author publications
You can also search for this author in PubMed Google Scholar
Stephen N. Thibodeau
View author publications
You can also search for this author in PubMed Google Scholar
Shannon K. McDonnell
View author publications
You can also search for this author in PubMed Google Scholar
Daniel J. Schaid
View author publications
You can also search for this author in PubMed Google Scholar
Sara Lindstrom
View author publications
You can also search for this author in PubMed Google Scholar
Constance Turman
View author publications
You can also search for this author in PubMed Google Scholar
Jing Ma
View author publications
You can also search for this author in PubMed Google Scholar
David J. Hunter
View author publications
You can also search for this author in PubMed Google Scholar
Elio Riboli
View author publications
You can also search for this author in PubMed Google Scholar
Afshan Siddiq
View author publications
You can also search for this author in PubMed Google Scholar
Federico Canzian
View author publications
You can also search for this author in PubMed Google Scholar
Laurence N. Kolonel
View author publications
You can also search for this author in PubMed Google Scholar
Loic Le Marchand
View author publications
You can also search for this author in PubMed Google Scholar
Robert N. Hoover
View author publications
You can also search for this author in PubMed Google Scholar
Mitchell J. Machiela
View author publications
You can also search for this author in PubMed Google Scholar
Peter Kraft
View author publications
You can also search for this author in PubMed Google Scholar
Matthew Freedman
View author publications
You can also search for this author in PubMed Google Scholar
Fredrik Wiklund
View author publications
You can also search for this author in PubMed Google Scholar
Stephen Chanock
View author publications
You can also search for this author in PubMed Google Scholar
Brian E. Henderson
View author publications
You can also search for this author in PubMed Google Scholar
Douglas F. Easton
View author publications
You can also search for this author in PubMed Google Scholar
Christopher A. Haiman
View author publications
You can also search for this author in PubMed Google Scholar
Rosalind A. Eeles
View author publications
You can also search for this author in PubMed Google Scholar
David V. Conti
View author publications
You can also search for this author in PubMed Google Scholar
Zsofia Kote-Jarai
View author publications
You can also search for this author in PubMed Google Scholar

Consortia

The PRACTICAL (Prostate Cancer Association Group to Investigate Cancer-Associated Alterations in the Genome) Consortium

Margaret Cook
, Alison Thwaites
, Michelle Guy
, Ian Whitmore
, Angela Morgan
, Cyril Fisher
, Steve Hazel
, Naomi Livni
, Amanda Spurdle
, Srilakshmi Srinivasan
, Mary-Anne Kedda
, Joanne Aitken
, Robert Gardiner
, Vanessa Hayes
, Lisa Butler
, Renea Taylor
, Trina Yeadon
, Allison Eckert
, Pamela Saunders
, Anne-Maree Haynes
, Melissa Papargiris
, Paula Kujala
, Kirsi Talala
, Teemu Murtola
, Kimmo Taari
, David Dearnaley
, Gill Barnett
, Søren Bentzen
, Rebecca Elliott
, Hardeep Ranu
, Belynda Hicks
, Aurelie Vogt
, Amy Hutchinson
, Angela Cox
, Michael Davis
, Paul Brown
, Anne George
, Gemma Marsden
, Athene Lane
, Sarah J. Lewis
, Clare Berry
, Girish S. Kulkarni
, Ants Toi
, Andrew Evans
, Alexandre R. Zlotta
, Theodorus H. van der Kwast
, Takashi Imai
, Shiro Saito
, Jacek Marzec
, Guangwen Cao
, Ji Lin
, Jin Ling
, Meiling Li
, Shan-Chao Zhao
, Guoping Ren
, Yongwei Yu
, Yudong Wu
, Ji Wu
, Bo Zhou
, Yangling Zhang
, Jie Li
, Weiyang He
, Jianming Guo
, John Pedersen
, John L. Hopper
, Roger Milne
, Aleksandra Klim
, Ana Carballo
, Ramón Lobato-Busto
, Paula Peleteiro
, Patricia Calvo
, Miguel Aguado
, José Manuel Ruiz-Dominguez
, Lluís Cecchini
, Lourdes Mengual
, Antonio Alcaraz
, Mariona Bustamante
, Esther Gracia-Lavedan
, Trinidad Dierssen-Sotos
, Ines Gomez-Acebo
, Julio Pow-Sang
, Hyun Park
, Babu Zachariah
, Wojciech Kluzniak
, Suzanne Kolb
, Peter Klarskov
, Christa Stegmaier
, Walther Vogel
, Kathleen Herkommer
, Philipp Bohnert
, Sofia Maia
, Maria P. Silva
, Sofie De Langhe
, Hubert Thierens
, Meng H. Tan
, Aik T. Ong
, Zeljko Kastelan
, Elenko Popov
, Darina Kachakova
, Atanaska Mitkova
, Aleksandrina Vlahova
, Tihomir Dikov
, Svetlana Christova
, Angel Carracedo
, Christopher Bangma
, F. H. Schroder
, Sylvie Cenee
, Brigitte Tretarre
, Xavier Rebillard
, Claire Mulot
, Marie Sanchez
, Jan Adolfsson
, Par Stattin
, Jan-Erik Johansson
, Carin Cavalli-Bjoerkman
, Ami Karlsson
, Michael Broms
, Huihai Wu
, Lori Tillmans
& Shaun Riska

Contributions

The contributions of each author to this study are as follows. Performance of fine-map analyses: T.D., E.J.S., P.J.N., E.A., D.A.L., M.N.B., C.C.B., M.M., S.Wa., Z.Z., D.V.C. and Z.K.J. Performance of variant annotation: E.J.S., E.A., M.M., C.C.B. and S.Wa. OncoArray chip design (PrCa content): T.D., E.J.S., D.A.L., A.A.O., F.R.S., X.S., P.K., F.W., Ste.C., B.E.H., D.F.E, C.A.Hai., R.A.E. and Z.K.J. Provision of DNA samples and/or phenotypic data: T.D., E.J.S., D.A.L., K.M., K.G., A.L., V.L.S., S.M.G., B.D.C., C.M.T., P.G., I.M.T., J.B., Suz.C., L.Mo., J.C., L.H., W.T., G.R., H.G., M.Al., T.N., P.Ph., N.P., J.S., T.L.T., C.Si., A.A., D.A., S.We., A.W., N.H., C.W., A.M.D., N.B., L.Mu., E.G., G.A., O.C., G.C.T., S.Ko., L.E.B., K.D.S., T.F.O., M.B., L.Ma., E.M.G., D.E.N., J.L.D., F.C.H., R.M.M., R.C.T., T.J.K., R.J.H., N.E.F., A.F., S.A.I., M.C.St., B.R., S.Ke., H.O., Y.J.L., H.W.Z., N.F., X.M., X.G., G.W., Z.S., G.G.G., M.C.So., R.J.M., L.M.F., A.S.K., B.F.D., A.V., A.G.C., L.F., R.S., M.E., M.K., J.Ll., G.C.V., K.L.P., M.S., J.Y.P., T.A.S., H.Y.L., J.L.S., C.Cy., D.W., J.Lu., E.A.O., M.S.G., B.G.N., S.F.N., M.W., R.B., M.A.R., P.I., H.B., K.C., B.H., C.M., M.L., T.S., J.K., C.J.L., E.M.J., M.R.T., P.Pa., M.C., S.L.N., L.S., Y.C.D., K.D.R., G.D.M., P.O., A.R., J.Li., S.H.T., D.W.L., L.F.N., D.L., M.G., T.K., R.K., N.U., C.Sl., V.M., M.P., S.S., F.Cl., S.J., T.Vd., S.La., P.A.T., C.A.Has., M.G.D., J.E.C., M.E.M., M.J.R., G.J., R.Hv., F.M., T.T., Y.A.K., J.X., K.T.K., L.C., H.P., A.M., A.K., S.N.T., S.K.M., D.J.S., S.Li., C.T., J.M., D.J.H., E.R., A.S., F.Ca., L.N.K., L.L.M., R.N.H., M.J.M., P.K., The PRACTICAL Consortium, F.W. and Z.K.J. Data management for meta-analysis: S.B. and X.S. Data QC for meta-analysis: T.D., E.J.S., A.A.O., F.R.S., S.I.B. and T.T. Imputation for meta-analysis: A.A.O., F.R.S., S.I.B. and L.F. Coordination of meta-analysis project: S.B., M.Ah., C.G., M.F., Ste.C., B.E.H., D.F.E., C.A.Hai., R.A.E. and Z.K.J. Provision of genotype data for individual GWAS sub-studies in meta-analysis: P.K., F.W., Ste.C., B.E.H., D.F.E., C.A.Hai., R.A.E. and Z.K.J. Writing of manuscript: E.J.S., T.D., P.J.N., D.V.C. and Z.K.J.

Corresponding author

Correspondence to Zsofia Kote-Jarai.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

A full list of consortium members appears at the end of the paper.

Electronic supplementary material

Supplementary Information

Peer Review File

Description of Additional Supplementary Files

Supplementary Data 1

Supplementary Data 2

Supplementary Data 3

Supplementary Data 4

Supplementary Data 5

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Dadaev, T., Saunders, E.J., Newcombe, P.J. et al. Fine-mapping of prostate cancer susceptibility loci in a large meta-analysis identifies candidate causal variants. Nat Commun 9, 2256 (2018). https://doi.org/10.1038/s41467-018-04109-8

Download citation

Received: 28 April 2017
Accepted: 05 April 2018
Published: 11 June 2018
DOI: https://doi.org/10.1038/s41467-018-04109-8

This article is cited by

ANO7 African-ancestral genomic diversity and advanced prostate cancer
- Jue Jiang
- Pamela X. Y. Soh
- Weerachai Jaratlerdsiri
Prostate Cancer and Prostatic Diseases (2023)
A biallelic multiple nucleotide length polymorphism explains functional causality at 5p15.33 prostate cancer risk locus
- Sandor Spisak
- Viktoria Tisza
- Matthew L. Freedman
Nature Communications (2023)
Androgen receptor binding sites enabling genetic prediction of mortality due to prostate cancer in cancer-free subjects
- Shuji Ito
- Xiaoxi Liu
- Chikashi Terao
Nature Communications (2023)
Characterizing prostate cancer risk through multi-ancestry genome-wide discovery of 187 novel risk variants
- Anqi Wang
- Jiayi Shen
- Christopher A. Haiman
Nature Genetics (2023)
Integrating transcription factor occupancy with transcriptome-wide association analysis identifies susceptibility genes in human cancers
- Jingni He
- Wanqing Wen
- Xingyi Guo
Nature Communications (2022)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Introduction

Results

Replication of reported associations prior to fine-mapping

Multivariate fine-mapping from univariate summary statistics

Integration of annotation

Fine-mapping resolution

Comparison with African Ancestry meta-analysis results

Estimating the GWAS loci contribution to FRR of PrCa

Discussion

Methods

Identification of PrCa risk loci to fine-map

Selection of SNPs for fine-mapping on the OncoArray

Meta-analysis and imputation

Multivariate fine-mapping towards putative causal variants

Annotation of variants for functional features

eQTL analysis

Quantile regression

Proportion of familial risk explained

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Consortia

The PRACTICAL (Prostate Cancer Association Group to Investigate Cancer-Associated Alterations in the Genome) Consortium

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Electronic supplementary material

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Comments

Search

Quick links