Genome-Wide Association of Bipolar Disorder in European American and African American

Bipolar GWAStudy

SUPPLEMENTAL MATERIAL

Study Subjects

Bipolar cases. Cases were selected from those collected and characterized by the Bipolar Consortium over the past 18 years. These subjects were collected in 5 waves. Waves 1-4 were families collected for linkage studies, while wave 5 is a large set of primarily unrelated cases collected for large-scale association studies.

Waves 1-4 comprise the Bipolar Family Dataset (BFD),which includes 2,936 subjects from 646 pedigrees, each ascertained via a proband with a Bipolar I (BPI) diagnosis and an additional first degree relative with BPI or Schizoaffective Disorder, Bipolar Type (SABP) diagnosis. All subjects were diagnosed with a standard best estimate (BEFD) procedure (see below). For the BiGS GWA study we selected unrelated Diagnostic and Statistical Manual(DSM) IV-defined BPI subjects (1)from these families. Waves 1 and 2 involved recruitment of primarily large pedigrees through four sites (Indiana, Johns Hopkins, the NIMH Intramural Program, and WashingtonUniversity at St. Louis). In Waves 3 and 4, smaller pedigrees with a minimal ascertainment criterion of a BPI-BPI affected sibling pair were recruited through an expanded series of sites (i.e., including those above, as well as University of Pennsylvania, University of California at San Diego, University of California at Irvine, University of California at San Francisco, University of Iowa, University of Chicago, and Rush University). BiGS subjects include 175 unrelated EA cases from Waves 1 and 2 and 396 unrelated cases from Waves 3 and 4. In addition, the BiGS sample included 430 cases from the ‘Wave 5’ data collection, which included 4,089 DNA samples from 3,655 families ascertained throughprobands with DSM IV-defined BPI disorder.

The BiGS cases abstracted from Waves 1-5 of the BiGS consortium dataset totaled 1,041 EA individuals. EA status was determined based on the subject’s self-report that all four grandparents were of EA heritage. Forty EA subjects were removed due to a non-BPI/SABP best estimate diagnosis or low diagnostic confidence. The final BiGS sample thus included 1,001 EA cases of which 951 had a diagnosis of BPI and 50 had a diagnosis of SABP. AA status was based on self-report of at least one grandparent being of AA.A total of 345 of these subjects were ultimately included in the BiGS analyses after review of best estimate diagnoses; these included 315 BPI AA cases and 30 SABP AA cases.

Controls. Controls were ascertained separately through a NIMH-supported contract mechanism between Dr. Pablo Gejman and Knowledge Networks, Inc.; this mechanism allowed the ascertainment of 4,586 subjects across the U.S. who agreed to donate a blood sample for transformation into lymphoblastoid cell lines and to respond to a medical questionnaire. All participating subjects, including 3,303 EA and 1,283 AA were given the questionnaire. Only individuals with complete or near-complete psychiatric questionnaire data who did not fulfill diagnostic criteria for major depression and denied a history of psychosis or bipolar disorder (BD) were included as controls for the BiGS analyses. Potential controls were matched for gender and ethnicity with the BiGS EA cases, and the control counts were 1,034 EA subjects and 716 AA subjects.

Clinical Assessment. All case subjects were interviewed with the Diagnostic Interview for Genetic Studies (2). The DIGS was revised (DIGS 4.0) between Waves 4 and 5 to allow collection of additional data on posttraumatic stress disorder and adult attention deficit disorder, as well as additional phenotypic information on BD. A change was also made in the ‘Best Estimate Final Diagnosis’ (BEFD) process at the start of Wave 5 to incorporate clinician judgment of multiple phenotypic indicators. These included diagnosis by DSM IV, DSM IIIR, and the Research Diagnostic Criteria (RDC) (3), as well as age of onset, number of episodes for depression, hypomania and mania, temporal relationship of mood disorder to substance abuse and psychosis, evidence of mixed episodes and rapid cycling, and a summary of the family history information. All of these indicators were scored independently by a senior clinician (generally a psychiatrist) based on all available information, including medical records, interviewer observations, the coded DIGS, and the Family Instrument for Genetic Studies (‘FIGS,’ developed for the NIMH Genetics Initiative; available at The FIGS incorporates clinician judgment on family patterns of illness, including presence or absence of BD, unipolar disorder, and/or other psychiatric disorders in first and second-degree relatives. Final BEFD judgments based on all available criteria, including level of interviewer agreement and certainty, are available for all BiGSsubjects ( In addition, all genotypic and phenotypic datafor the BiGS subjects, as detailed above, are available through the GAIN dbGAP database ( and the NIMH data repository ( This resource for the study of BD has been widely utilized by academic groups both within and outside of the United States, as well as by pharmaceutical and biotechnology companies. Additional subject self-report data, including the Akiskal Temperament Scale(4), the Basic Language Morningness Scale(5), the Childhood Life Events Scale (Lawson & Gershon, unpublished), the Lifetime History of Aggression measure(6), the Questionnaire on Genetic Risk (Nurnberger & Lawson, unpublished), the Temperament and Character Inventory(7), a Visual Analogue Scale on Mood, the Wender Attention Deficit Scale(8), and the Zuckerman-Kuhlman Personality Questionnaire(9) are not yet publicly posted but are available from the authors upon request.

Genotyping and Quality Control

Genotyping and quality control of data available on dbGaP. Genotyping was carried out by The Broad Institute Center for Genotyping and Analysis ( DNA quantity was checked using PicoGreen fluorometry, and sample quality was initially assessed by genotyping a 24-SNP panel on the Sequenom iPLEX platform, which contains a sex-determining assay. Samples were plated at 50 ng/ul in 96 well plates at the Rutgers University Cell and DNA Repository. In addition, the Centre d'Etude du Polymorphisme Humain (CEPH;) sample NA12144 was placed on each production plate at the Broad Institute. Because this Bipolar project and the Genome-Wide Association Study of Schizophrenia (dbGaP; Study Accession: phs000021.v2.p1) shared controls, bipolar cases were often genotyped on different plates than controls. This unbalanced distribution ofBD cases and controls produced systematic differences in the processing that may have affected allele calling. One plate, containing only 16 EA BD samples, was not included in the analysis because allele calling for many single nucleotide polymorphisms (SNPs) was aberrant. Genotyping of the EA and AA samples was carried out separately, using the Affymetrix Genome-Wide Human SNP Array 6.0 (10). Samples for which fewer than 86% of the quality control (FQC) SNPs produced genotypes were rerun. Allele calling was performed using the BirdSeed algorithm (11) Affymetrix Power Tools version apt-1.8.6 ( and cluster models (‘priors’) file (found here: Scans from the same production plate were clustered together. Concordance between genotypes from the array and those from the initial QC panel was evaluated to confirm sample ID.

Further quality controls were carried out separately for the EA and AA samples. Samples were not used in the analysis if they failed any of several quality metrics: low call rate (below 98.5% for EA and 97.8% for AA), excessively high or low heterozygosity (between 0.344 and 0.363 for EA and between 0.29 and 0.324 for AA), or incompatibility between reported gender and genetically determined gender.Samples were also checked for unexpected familial relationships using pairwise IBD estimation in PLINK(12). SNPs were not analyzed if the minor allele frequency was <0.01, if the call rate was <95%, if the SNP violated Hardy Weinberg Equilibrium (p1 x 10-6) in control samples within an ancestry group, if there were 3 or more Mendelian errors, or if there was more than one discrepancy among duplicate samples. Each plate in the study, including those in the GAIN Schizophrenia GWA study, was compared to all other plates. SNP allele frequency variation between the plates was examined using a chi-squared association test. This was performed with PLINK by setting all the individuals on one plate to “case” and all other individuals to “control”. If the association p-value was less than 10-8 for any one plate or 10-4 for two or more plates, the SNP was removed. This resulted in 39,817 SNPs being removed from EA and 8,026SNPs being removed from AA. The total number of SNPs passing all initial QC tests was 729,454 for EA and 845,814 for AA.

Additional Quality Control after downloading of data from dbGaP

SNP quality assessment. Data were downloaded from the database of Genotype and Phenotype (dbGaP; Each dataset (i.e., EA and AA) underwent a second round of quality control limited to the samples of interest for our BD GWA study. Since we analyzed only a subset of the genotyped samples (i.e., the BD and control samples), we reapplied similar QC thresholdsto the final set of individuals meeting certain criteria: < 1 homozygote to homozygote error in all duplicates; < 2 homozygote to heterozygote errors in all duplicates; > 90% SNP call rate; minor allele frequency > 0.01; and no Hardy Weinberg Disequilibrium in control samples within an ancestry group (p1x10-6).We tested whether plate effects were due to problems with genotype calling that were driven by a handful of poorly genotyped individual samples(i.e., individuals that may have altered genotype clustering based on allelic intensity values). We also merged our data with data obtained for the GAIN Schizophrenia GWA study, removed all individuals that had been removed for any QC reason, and recalled all the genotypes by plate using Birdseed version 2 in Affymetrix Power Tools (version 1.10.0). This procedure did not remove the observed plate effect and so the genotypes as originally called were used in the final analysis.

The final dataset consisted of 724,067 SNPs in the EA dataset and 840,730 SNPs in the AA dataset. The two datasets had 702,044 SNPs in common, and these were used to perform analyses addressing SNP and SNP x genetic background interactions in the combined sample.

Analysis of SNP genotypes for individuals. Individual ancestry and admixture levels were assessed by the program Local Ancestry in Admixed Populations (LAMP) (13). Admixture was assessed for both the EA and AA samples. LAMP uses a sliding window approach to estimate ancestry of local regions of the genome, which can then be averaged to obtain genome-wide levels of admixture. It is most effective when analyzing populations of recent admixture, such as African Americans, and has been shown to be faster and more accurate than STRUCTURE (13). We used the HapMap (version 2, release 23a) CEU and YRI collections as ancestral populations.

Population stratification. We performed analyses to control for population stratification using several available methods, including genomic control-adjusted p-values, logistic regression analyses using a LAMP-generated ancestry estimate as a covariate, and logistic regression analyses using multidimensional scaling values (i.e., the top 4 coordinates) as covariates. We also compared LAMP admixture estimates to admixture estimates generated with STRUCTURE v2.2 (14, 15) based on the use of either the same ancestral populations (i.e., HapMap subjects) or based on an alternative set consisting of subjects from the Human Genome Diversity Panel (HGDP) (e.g., see 16). Specifically, a random panel of 3,505 SNPs present in all datasets were chosen from chromosome 1 with intermarker-distances of >15,000 bp. In order to avoid strand issues between markers genotyped on different platforms, CG and AT SNPs were excluded. Ancestral allele frequencies were estimated using either genotypes from 60 CEPH and 60 Yoruban subjects from HapMap (version 2, release 23a) or genotypes from 158 European and 102 African subjects from the HGDP. All STRUCTURE runs were performed using a burn-in period of 5,000 iterations, followed by 5,000 MCMC repeats, and were based on the admixture model and the F model of correlation in allele frequencies across clusters. A supervised analysis with K=2 clusters was performed with individuals from Europe and Africa forced into separate clusters. Individual admixture was estimated as the average membership coefficients across five replicate runs. LAMP and STRUCTURE ancestry estimates were similar (Figure S4). Ancestry estimates were not affected by the choice of HapMap or HGDP populations.

Replication Genotyping

A subset of 85 SNPs from our BD GWA study was selected for replication genotyping based on several criteria, with a primary focus on the allelic association p-value in the larger EA sample. Other factors considered in this prioritization were statistical support for association from neighboring SNPs and evidence for association in the AA GWAstudy sample. The 85 SNPs were genotyped using Sequenom mass spectrometry technology in 1,749 EA subjects from 250 multiplex bipolar families which have been previously described (17). There was a modest amount of overlap between individuals in this familial sample and individuals in ourBD GWA study EA sample, with 199 EA cases drawn from the familial cohort. TheSNPsin this set were also genotyped using the same technique in an independent EA cohort consisting of 1,263 BPI cases and 431 controls. Family-based transmission analysis was performed in the family sampleas previously described (17), using both a strict (affecteds-only) and a more permissive definition (individuals not meeting criteria for BPI, BPII, or recurrent depression) of unaffected status.The case-control cohort was analyzed in the same fashion as our BD GWA study described above. Meta-analysis was performed to evaluate evidence for association across the primary EA sample and the two replication genotyping cohorts using the METAL method (Willer and Abecasis, Effective sample sizes for each of the three studies were obtained using a published method (18).

Statistical Analysis

Imputation. Imputation was performed for the cleaned EA dataset using MACH v.1.0.16 ( with HapMap rel21 phased haplotypes as a reference. Model parameters were estimated with a random subset of 200 individuals before imputation on the entire dataset. Association with estimated genotype expectations and BD was performed in R using logistic regression, with the top 4 MDS components as covariates.

Supplemental Table 1 (S1): Subject demographics.

Demographic / African Ancestry Cases
N = 345 / African Ancestry
Controls
N = 670 / European Ancestry Cases
N = 1,001 / European Ancestry Controls
N = 1,033
Gender / Female / 239 (69%) / 398 (59%) / 501 (50%) / 501 (48%)
Male / 106 (31%) / 272 (41%) / 500 (50%) / 532 (52%)
Diagnosis / Bipolar I (BPI) / 315 (91%) / -- / 951 (95%) / --
Schizoaffective disorder, bipolar type (SABP) / 30 (9%) / -- / 50 (5%) / --
Age at onset or study entry / = 18 years / 201 (58%) / 3 (0.4%) / 541 (54%) / 10 (1%)
> 18 years / 131 (38%) / 667 (99.6%) / 426 (43%) / 1023 (99%)
Unknown / 13 (4%) / 0 (0%) / 34 (3%) / 0 (0%)
Average/median / 18.0/16 / 45.8/46 / 19.3/18 / 52.2/53

Bipolar GWAStudy

Supplemental Table 2 (S2): Consistency of top hits across the different study samples.

Analysis / SNP/Allele / Location / MAF (AA/EA) / AA Lamp / EA MDS / EA/AA Combined Main Effect MDS
OR / p-value / OR / p-value / OR / p-value
AA
LAMP adjusted / rs2111504/T / DPY19L3 (19q13.11) / 0.23/0.17 / 1.73 / 1.7 x 10-6 / 1.12 / 0.20 / 1.31 / 8.6 x 10-5
rs2769605/T / NTRK2 (9q21.33) / 0.22/0.57 / 0.61 / 4.5 x 10-5 / 0.92 / 0.20 / 0.84 / 1.5 x 10-3
EA
MDS adjusted / rs1825828/C / 3q11.2 / 0.58/0.26 / 0.91 / 0.33 / 0.70 / 7.0 x 10-7 / 0.77 / 5.3 x 10-6
rs5907577/T / Xq27.1 / 0.10/0.31 / 1.08 / 0.67 / 1.48 / 1.6 x 10-6 / 1.27 / 9.7 x 10-5
rs10193871/G / NAP5 (2q21.2) / 0.06/0.13 / 0.75 / 0.18 / 0.65 / 9.8 x 10-6 / 0.67 / 4.2 x 10-6
EA/AA Combined Main Effect
MDS adjusted / rs4825220/C / Xq27.1 / 0.1/0.52 / 0.62 / 0.01 / 0.74 / 2.8 x 10-5 / 0.71 / 2.6 x 10-7

We investigated the association for our top SNPs in each sub-sample within the other sub-samples studied. This table shows, for each SNP identified, the association score in each of our three sub-samples.

Bipolar GWAStudy

Supplemental Table 3 (S3).Annotation of regions showing high haplotype heterogeneity.

Chromsome / Start / End / Genes within/nearby region
1 / 196366461 / 196672977 / NEK7
1 / 225792670 / 226014057 / ZNF678, C1orf142, JMJD4
6 / 64735225 / 64943243 / PHF3
6 / 98667547 / 98821885 / POU3F2
7 / 46118553 / 46178058 / IGFBP3
8 / 109360409 / 109478088 / EIF3E, TTC35
10 / 61748209 / 61913110 / ANK3
11 / 104024629 / 104164746 / PDGFD, CASP12
12 / 46696784 / 47173860 / SENP1, PFKM, ASB8, C12orf68, OR10AD1, H1FNT, ZNF641, ANP32D, C12orf54
17 / 36697627 / 36746143 / KRTAP cluster
23 / 78870379 / 79016438 / ITM2A, TBX22

Bipolar GWAStudy

Supplemental Table 4 (S4). Results for SNPs reported in Baum et al. (2008).

Of the 88 replicated SNPs reported in the previous study, 29 were directly genotyped in the present study. All allele-wise results are shown, with SNPs in physical order (UCSC hg 18, dbSNP build 125).

SUPPLEMENTAL FIGURE LEGENDS

Supplemental Figure 1 (S1): QQ and multidimensional scaling (MDS) plots for each study population.

Unadjusted –log(p-values) are shown in black, with MDS (top 4 components) adjustment in red and LAMP adjustment in blue. In the top left hand corner of each plot are the top 2 MDS components for each analysis plotted against each other. Genomic control  values are shown given the current study size and are corrected for a study size of 1000 cases and 1000 controls. In the case where EA and AA individuals are analyzed together, unadjusted  levels are elevated because of different ratios of cases and controls in the two populations. All individuals shown in the MDS plots in the upper left corners were included in the analyses.

Supplemental Figure 2 (S2): Regions near top hits and areas of interest.

Regions +/- 250 kb around each SNP listed in Table 1 are shown. P-values are from the analysis where the SNP was identified. Genotyped SNPs are shown as circles, while imputed SNPs are shown as smaller diamonds. The primary SNP of interest is large and colored in black. Other SNPs are colored according to linkage disequilibrium levels with the primary SNP (r2), as calculated from Phase 3 HapMap data using CEU (for EA and EA + AA combined) or YRI (for AA) populations. Recombination rate (HapMap) is shown on the second y-axis in blue. RefSeq genes are shown with all possible exons; arrows indicate transcript direction. In the upper left hand corner of each graph, the genotype intensity plots are shown, with each color indicating the final genotype call (blue and red for homozygotes and purple for the heterozygote).

Supplemental Figure 3 (S3): Top regions characterized in Ferreria et al., +/- 250 kb.

Each circle represents a SNP, with the first y-axis indicating the p-value for BD in EA individuals (MDS adjusted) for this study. Genotyped SNPs are indicated with a circle, while imputed SNPs are indicated with diamonds. The most strongly previously associated SNP is indicated by a large black circle. SNPs are colored shades of red depending on their linkage disequilibrium with the most strongly previously associated SNP (r2, calculated from HapMap CEU Phase 3 using Haploview). Recombination rate (HapMap) is shown on the second y-axis in blue. Green horizontal lines indicate haplotype association p-values from a 10-SNP sliding window. Below the plot are SNPs from genotyped SNPs in the WTCCC Bipolar Disorder and STEP-BP studies, with p-value indicated by color. RefSeq genes are shown with all possible exons; arrows indicate transcript direction.