Genetic analysis of quantitative traits in the Japanese population links cell types to complex human diseases

Kanai, Masahiro; Akiyama, Masato; Takahashi, Atsushi; Matoba, Nana; Momozawa, Yukihide; Ikeda, Masashi; Iwata, Nakao; Ikegawa, Shiro; Hirata, Makoto; Matsuda, Koichi; Kubo, Michiaki; Okada, Yukinori; Kamatani, Yoichiro

doi:10.1038/s41588-018-0047-6

Download PDF

Article
Open access
Published: 05 February 2018

Genetic analysis of quantitative traits in the Japanese population links cell types to complex human diseases

Nature Genetics volume 50, pages 390–400 (2018)Cite this article

49k Accesses
138 Altmetric
Metrics details

Subjects

Abstract

Clinical measurements can be viewed as useful intermediate phenotypes to promote understanding of complex human diseases. To acquire comprehensive insights into the underlying genetics, here we conducted a genome-wide association study (GWAS) of 58 quantitative traits in 162,255 Japanese individuals. Overall, we identified 1,407 trait-associated loci (P < 5.0 × 10⁻⁸), 679 of which were novel. By incorporating 32 additional GWAS results for complex diseases and traits in Japanese individuals, we further highlighted pleiotropy, genetic correlations, and cell-type specificity across quantitative traits and diseases, which substantially expands the current understanding of the associated genetics and biology. This study identified both shared polygenic effects and cell-type specificity, represented by the genetic links among clinical measurements, complex diseases, and relevant cell types. Our findings demonstrate that even without prior biological knowledge of cross-phenotype relationships, genetics corresponding to clinical measurements successfully recapture those measurements’ relevance to diseases, and thus can contribute to the elucidation of unknown etiology and pathogenesis.

Multi-omics study for interpretation of genome-wide association study

Article 18 September 2020

A cross-population atlas of genetic associations for 220 human phenotypes

Article 30 September 2021

Multi-ancestry genetic study of type 2 diabetes highlights the power of diverse populations for discovery and translation

Article 12 May 2022

Main

Clinical laboratory measurements (e.g., blood test results) are powerful intermediate phenotypes that can be used to diagnose and monitor human diseases. Elucidation of the underlying genetics, as well as inference of genetic relationships to diseases and implicated cell types, can provide clues about disease biology. To this end, GWASs have been conducted to investigate various quantitative traits, including anthropometric^1,2,3, metabolic^4,5, kidney-related^6,7, hematological^8,9, and blood pressure traits^10,11,12. The interplay between the genetics of quantitative traits and diseases has been assessed by several approaches, such as pleiotropy^13,14, genetic correlation^15,16, and Mendelian randomization¹⁷. For example, recent large-scale studies of body mass index (BMI), a key measure for assessing obesity, revealed shared genetic effects on metabolic traits and the involvement of the central nervous system² and immune cells³ in obesity susceptibility. However, previous studies primarily examined subjects of European ancestry, and each study separately focused on few quantitative traits. For the creation of a comprehensive landscape, additional studies of non-European populations are warranted that simultaneously investigate a wide range of clinical measurements and extensively interrogate their relevance to complex diseases.

Here we report a GWAS of 58 quantitative traits in 162,255 Japanese individuals from the BioBank Japan Project (BBJ)^18,19, one of the largest non-European single-descent biobanks with detailed phenotypes, to broaden the current knowledge and understanding of the genetics and biology of these traits. Moreover, we incorporated additional GWASs of complex diseases and traits in Japanese subjects, and evaluated pleiotropy, genetic correlation, and cell-type specificity with respect to the quantitative traits. Our study provides many insights into the genetic basis of various quantitative traits and illuminates the complex genetic links among clinical measurements, complex diseases, and relevant cell types.

Results

Genome-wide association analysis of 58 quantitative traits

We tested 5,961,600 autosomal variants and 147,353 X-chromosome variants (imputed with 1000 Genomes Project Phase 1²⁰; Methods) for association with 58 quantitative traits in 162,255 Japanese individuals. The studied traits covered a wide range of clinical measurements, grouped into nine distinct categories (Table 1): metabolic (n = 6), serum protein (n = 4), kidney-related (n = 4), electrolyte (n = 5), liver-related (n = 6), other biochemical (n = 6), hematological (n = 13), blood pressure (n = 4), and echocardiographic (n = 9). The study design is illustrated in Supplementary Fig. 1, and the detailed characteristics of the subjects, phenotype source, and exclusion criteria are described in Supplementary Tables 1 and 2.

Table 1 Overview of the studied quantitative traits

Full size table

Overall, we identified 1,407 trait-associated loci for 53 quantitative traits that satisfied a genome-wide significance threshold of P = 5.0 × 10⁻⁸ (Methods). Of these, 679 loci were novel (Table 1 and Supplementary Table 3). When we applied multiple-testing correction to the number of the studied traits, 943 trait-associated loci for 51 traits showed significant associations (P < 5.0 × 10⁻⁸/58 = 8.6 × 10⁻¹⁰), of which 372 loci were novel. Stepwise conditional analysis for each trait-associated locus further identified 267 additional independent signals at 158 trait-associated loci for 39 traits (Supplementary Table 4). We observed multiple additional independent signals at 49 trait-associated loci, with the maximum number of 11 independent signals at 11q13.1 for uric acid (the top associated signal was rs57633992 on NRXN2; P = 7.30 × 10⁻⁸⁴⁵) (Supplementary Fig. 2). Although the genomic inflation factors (λ_GC) of several traits showed considerable inflation (mean λ_GC = 1.11), linkage disequilibrium (LD) score regression²¹ analysis confirmed no existence of substantial confounding biases for all traits (mean intercept: 1.04), as shown in Supplementary Table 5. Given the substantial sample sizes in our GWASs, these statistics suggest that a majority of the inflation was due to polygenic effects, and population stratification and other potential biases were strictly controlled^3,21. Manhattan, quantile–quantile, and LD score plots are provided in Supplementary Dataset 1. Detailed regional plots for each locus are provided in Supplementary Dataset 2.

Trans-ethnic comparison of the allele frequencies of the identified loci between East Asians and Europeans showed an overall shared allelic spectrum across populations (r = 0.687; Supplementary Fig. 3). The novel loci tended to have higher allele frequencies in East Asians than in Europeans, as 60 novel loci (8.8%) were common (minor allele frequency ≥ 5%) in East Asians but rare (≤1%) in Europeans. Of note, the associated single-nucleotide polymorphisms (SNPs) in 15 unique loci (for example, ALDH2, EGF, and SUFU) were monomorphic in Europeans but had higher frequencies in East Asians (≥10%). These observations show the contribution of population-specific factors, such as evolutionary selection pressure, to the identified loci. The percentage of mean heritability of the traits explained by the significant loci was 2.84% (Supplementary Table 6). On average, the known loci in Europeans explained 1.92%, the overall known loci explained 2.03%, and the newly identified loci explained 0.84%. The percentage of global heritability explained by the genome-wide common SNPs was on average 8.60%, which is comparable to previous findings in Europeans (Supplementary Table 5).

Pleiotropy of top associated quantitative trait loci

Pleiotropy, defined here as the sharing of risk alleles across multiple traits, is a key concept in investigations of cross-phenotype relationships across human traits, leading us to decipher a shared genetic etiology underlying a complex genetic architecture^13,14. To identify major pleiotropic loci, we assessed pleiotropy at the single-locus level across 763 unique loci (derived from the 1,407 trait-associated loci for 53 quantitative traits mentioned above; Methods). We identified numerous pleiotropic loci among the quantitative traits (n = 313), representing approximately 41% of the unique loci (Fig. 1 and Supplementary Table 7). Of these, 88 loci showed pleiotropy across traits in multiple trait categories (intercategorical pleiotropy), whereas the other 225 loci showed pleiotropy across traits in a single category (intracategorical pleiotropy).

**Fig. 1: Overview of the identified loci and their pleiotropy.**

We observed the most abundant intercategorical pleiotropy at ALDH2 (12q24.12), associated with 21 traits in seven categories (Supplementary Fig. 4). The most significant associations were at rs79105258 (the top associated signal was γ-glutamyl transferase (GGT); P = 9.98 × 10⁻¹⁰⁰), which shows high minor allele frequency in East Asians (0.24) but is monomorphic in other ancestral populations²⁰. Other pleiotropic loci that showed intercategorical pleiotropy included GCKR (2p23.3), associated with 18 traits in seven categories (rs1260326 for triglyceride; P = 1.69 × 10⁻⁹⁴); ABO (9q34.2), associated with 15 traits in six categories (rs2519093 for alkaline phosphatase; P = 2.02 × 10⁻⁸⁸⁷); and RGS12 (4p16.3), associated with nine traits in six categories (rs4690095 for albumin; P = 1.63 × 10⁻²²). Although RGS12 (4p16.3) has received little attention as a pleiotropic locus compared with the other loci mentioned¹³, this locus has shown associations with several traits and diseases, including serum lipids⁴ and inflammatory bowel disease²². Our results expand its associations with additional traits, including kidney function, serum calcium, GGT, and platelet count (Plt).

Polygenic correlations across quantitative traits

Another approach to infer genetic overlap across traits is to estimate a genetic correlation, that is, a correlation of causal effect sizes at a genome-wide level^15,16. Rather than using a single-locus-level analysis, we evaluated genetic correlations under a polygenic model that could take into account the consistency of effect directions, unlike pleiotropy analysis, to disentangle the polygenic architecture of the studied traits. We incorporated additional GWAS results for the anthropometric traits BMI³ and adult height, obtained from ongoing studies under the BBJ (Supplementary Note 1), to gain a broader perspective on quantitative traits. We carried out bivariate LD score regression¹⁵ to estimate pairwise genetic correlations across the 59 quantitative traits (we excluded the E/A ratio, a marker of heart function, owing to small sample size; Methods). We found 173 significant genetic correlations (false discovery rate (FDR) < 0.05), 100 (58%) of which were intercategorical (Supplementary Fig. 5 and Supplementary Table 8).

We observed the greatest number of significant intercategorical genetic correlations with BMI, which showed significant correlations with 22 quantitative traits in seven trait categories (the most significant correlation (P = 9.83 × 10⁻¹⁷) was with mean arterial pressure). Total protein and height had the second highest numbers of correlated categories (n = 6), followed by triglycerides, non-albumin protein, and Plt (n = 5). Although some of the significant intercategorical genetic correlations had been suggested previously (for example, BMI and serum lipids in Europeans¹⁵), most were newly identified. Notably, most of these links were consistent with observations in epidemiological studies, thus demonstrating the robustness and potential of the genetics-based studies to elucidate novel biological and medical architectures of human traits without prior knowledge (Supplementary Table 8). For example, the observed negative correlation between white blood cell (WBC) count and total bilirubin was suggested in an epidemiological study²³, but our study corroborated this correlation on the basis of genetics, thus providing empirical support for the hypothesis of the anti-inflammatory activities of bilirubin²⁴.

Genetic correlations among quantitative traits and diseases

Given that clinical measurements are informative as intermediate phenotypes for the assessment of complex human diseases, we reasoned that additional exploration of genetic correlations between quantitative traits and diseases would provide more empirical corroboration of shared genetic architecture, which could illuminate the underlying etiology and pathogenesis. To this end, we additionally incorporated 30 case–control GWAS results for complex diseases in Japanese individuals (Table 2 and Supplementary Note 1)^{25,26,27,28,29,30}, including cardiometabolic (n = 6), immune-related (n = 6), hematologic (n = 1), psychiatric (n = 2), and musculoskeletal diseases (n = 2); cancer (n = 7); and other diseases (n = 6).

Table 2 Summary of the additional case–control GWASs of the 30 complex diseases

Full size table

We then estimated pairwise genetic correlations across the 59 quantitative traits and 30 diseases. We identified 68 significant genetic correlations (FDR < 0.05), which supported the biological relevance of associations between clinical measurements and complex diseases (Fig. 2 and Supplementary Table 8; the full results are presented in Supplementary Fig. 6 and Supplementary Table 9). Among the 68 significant correlations, 52 (76.5%) involved cardiometabolic diseases, correlating with quantitative traits in seven categories. Indeed, type 2 diabetes showed the greatest number of significant correlations with quantitative traits (n = 15), and demonstrated the most significant genetic correlation with hemoglobin A1c (r_g = 0.724; P = 2.54 × 10⁻²²). We also observed other significant correlations, such as those between ischemic stroke and uric acid (r_g = 0.254; P = 5.74 × 10⁻⁵), and between myocardial infarction and albumin/globulin ratio (r_g = −0.174; P = 1.06 × 10⁻³). Among the remaining 16 significant genetic correlations (other than for cardiometabolic diseases), the most significant correlation was between asthma and eosinophil count (r_g = 0.348; P = 3.76 × 10⁻⁴). Other significant correlations included those between urolithiasis and systolic blood pressure (r_g = 0.272; P = 7.22 × 10⁻⁴), asthma and systolic blood pressure (r_g = 0.214; P = 8.84 × 10⁻⁴), and colorectal cancer and height (r_g = 0.164; P = 2.92 × 10⁻³).

**Fig. 2: Genetic correlations between the 59 quantitative traits and 30 diseases.**

In addition to the suggested genetic correlations in Europeans (type 2 diabetes and BMI; triglycerides, blood sugar, and hemoglobin A1c; coronary artery disease and BMI; and high-density-lipoprotein cholesterol and triglycerides)¹⁵, we empirically corroborated novel genetic correlations that have been implicated in Mendelian randomization analyses (e.g., type 2 diabetes and alanine aminotransferase³¹; atrial fibrillation and height³²; asthma and eosinophil count⁹; and colorectal cancer and height³³) and epidemiological studies (e.g., ischemic stroke and uric acid³⁴; myocardial infarction and albumin/globulin ratio³⁵; peripheral artery disease and total bilirubin³⁶; and urolithiasis and blood pressure³⁷) (Supplementary Table 8). Thus, we further investigated causal relationships between the significant pairs of quantitative traits and diseases by using a Mendelian randomization approach (Methods). We identified 24 significant causal associations (P < 9.43 × 10⁻⁴ (= 0.05/53)), 15 of which had not been previously suggested by genetic causal relationships (Supplementary Fig. 7 and Supplementary Table 10). To distinguish bias due to pleiotropy, we further applied MR-Egger regression³⁸ as a sensitivity test, and confirmed the robustness of the identified causal relationships (P > 0.05 for intercept after Bonferroni correction).

To facilitate understanding of the complex inter-relations between clinical measurements and diseases, we constructed a network from the genetic correlation matrix (Fig. 3). In the network, the distance between correlated phenotypes is determined by weighting of the magnitudes of their correlations (Methods). Although we constructed our genetic correlation network without prior biological knowledge of cross-phenotype relationships, we observed distinctive clusters of biologically related phenotypes. The largest cluster was composed of cardiometabolic diseases and their biomarkers, interconnected with various clinical measurements, such as kidney-related, liver-related, and hematological traits. The network also depicted cross-disease interplay, including the positive correlation of autoimmune diseases (rheumatoid arthritis and Graves’ disease) and chronic inflammatory diseases (asthma and chronic obstructive pulmonary disease), as well as the negative correlation of glaucoma and Graves’ disease. These results suggest that the polygenic landscapes of traits reflect their biological backgrounds, and thus could be used to elucidate the unknown etiology of diseases.

**Fig. 3: Genetic correlation network across the 59 quantitative traits and 30 diseases.**

Shared cell-type specificity among human complex traits

The identification of trait-relevant cell types is essential for fine-mapping of candidate causal variants, the identification of potent therapeutic targets, and, ultimately, full understanding of disease biology^39,40,41. To assess the cell-type specificity of human traits and diseases on the basis of heritability enrichment, we applied stratified LD score regression³⁹ to the GWAS results for the 59 quantitative traits and 30 diseases using 220 cell-type-specific annotations for histone modifications (H3K4me1, H3K4me3, H3K9ac, and H3K27ac) constructed from the Roadmap Epigenomics Project dataset^39,42.

To create a broad picture of trait-relevant cell types, we first assessed heritability enrichment in ten major cell-type groups, defined as unions of 220 cell-type-specific annotations, representing their system- or organ-level structure³⁹. We observed 72 significant heritability enrichments (FDR < 0.05) in the cell-type groups for 44 quantitative traits and diseases (Fig. 4a and Supplementary Table 11). The top significant enrichments in each quantitative trait category included connective or bone for height (P = 4.89 × 10⁻⁹), kidney for estimated glomerular filtration rate (P = 2.59 × 10⁻⁷), liver for GGT (P = 2.54 × 10⁻⁶), immune or hematopoietic for mean corpuscular volume (P = 6.46 × 10⁻⁶), and skeletal muscle for creatine kinase (P = 2.77 × 10⁻⁵), consistent with known biology (Fig. 4b and Supplementary Fig. 8). The same held true for the diseases—for example, significant enrichments in immune or hematopoietic for rheumatoid arthritis (P = 9.19 × 10⁻⁶) and Graves’ disease (P = 3.81 × 10⁻⁵).

**Fig. 4: Heritability enrichment in the ten cell-type groups.**

Although the cell-type-group-level analysis successfully identified a trait-relevant group for most of the quantitative traits and diseases, we hypothesized that more detailed assessment at the cell-type level would differentiate a trait-relevant cell type within the group. We thus assessed heritability enrichment in each of the 220 cell-type-specific annotations. We identified 384 significant heritability enrichments (FDR < 0.05) for 50 quantitative traits and diseases (Supplementary Table 12). To further explore the complex systems of trait-relevant cell types, we carried out hierarchical clustering based on the earned profile of heritability enrichment for the 59 quantitative traits and 30 diseases in the 220 cell-type annotations (Fig. 5a).

**Fig. 5: Hierarchical clustering of heritability enrichments in the 220 cell-type-specific annotations.**

We observed several distinct clusters that specifically comprised related traits and cell types. The most distinct cluster involved a great majority of immune or hematopoietic cell types enriched in hematological traits and in autoimmune, allergic, and infectious diseases, representing a wide range of immune-related diseases and traits (Fig. 5b). The most significant heritability enrichment was for mean corpuscular hemoglobin in mobilized CD34 (P = 2.01 × 10⁻⁹; H3K4me1). All CD34-related epigenetic annotations also showed significant enrichments for red blood cell, WBC, and Plt-related hematological traits. Because CD34 is recognized as a marker of hematopoietic progenitor cells⁴³, our findings suggest that variants in the regulatory region of CD34⁺ primary cells affect hematopoietic cell differentiation and the number of hematopoietic cells.

Finally, to highlight shared cell types involved in human diseases and traits, we constructed a directed network matrix of cell-type-specific heritability annotations (Fig. 6; details are also presented in the Methods section). We identified several independent networks of cell-type specificity. The largest network was composed of three major clusters connected via the significant enrichment of adipose nuclei for (i) WBC count, (ii) lymphocyte count, and (iii) height. In addition to the contribution of CD34, we observed heritability enrichments in regulatory regions of CD14⁺ and CD15⁺ primary cells for WBC counts and WBC subtypes (i.e., monocytes and neutrophils), representing their specificity in myeloid lineages (CD14 for monocytes and macrophages⁴⁴, and CD15 for granulocyte series cells⁴⁵). Primary cells expressing CD19 and CD20, surface markers of B cells⁴⁶, also showed enrichment for non-albumin protein and albumin/globulin ratio, potentially reflecting immunoglobulin-synthesis functions of B cells. Moreover, various CD4⁺ and CD8⁺ T cells showed enrichment for autoimmune diseases such as Graves’ disease and rheumatoid arthritis. We note that the enrichment of regulatory T cells (T_reg cells) in Graves’ disease, a human autoimmune thyroiditis, is concordant with the biological finding that T_reg-cell-depleted mice develop thyroiditis⁴⁷. Other observed links between allergic diseases (atopic dermatitis and asthma) and helper T cells, or about the contribution of fetal or chondrogenic tissues to height, also supported biological and medical findings.

**Fig. 6: Cell-type-specificity network across the 59 quantitative traits and 30 diseases in the 220 cell-type-specific annotations.**

These results demonstrate that ‘individual cell-type level’ analysis can successfully recapture the biology of human traits, without prior knowledge of ‘consolidated cell-type group-level’ analysis. The cell-type-specificity networks pinpoint potent causal cell types that cooperatively affect human phenotypes, providing promising resources for novel therapeutic targets. Nevertheless, integration of cell-type specificity in addition to polygenic genetic correlations clearly expanded the current knowledge of cross-phenotype relationships and underlying genetic mechanisms of diseases.

Discussion

We have presented one of the largest non-European GWASs of quantitative traits so far, identifying 1,407 trait-associated loci for 53 traits in 162,255 Japanese individuals. By incorporating additional GWAS results for 32 complex diseases and traits in Japanese individuals, we further identified numerous pleiotropic loci, wide-ranging genetic correlations, and distinct cell-type specificity among the quantitative traits and diseases that confirmed or expanded our current understanding of biology.

Our findings suggest that there are complex inter-relations between clinical measurements and diseases, demonstrating the value of GWASs for a variety of traits in a single large-scale cohort with detailed clinical information. We report novel genetic correlations, some of which are consistent with the results of epidemiological studies. These findings substantially expand the knowledge of genetic relationships across clinical measurements and diseases. We also highlight shared cell-type specificity by linking cell types to diseases. These results shed light on the underlying genetic mechanisms, revealing shared etiology and pathogenesis of complex diseases by using clinical measurements as an intermediate phenotype.

Although our work provided various insights into the genetics corresponding to clinical measurements in Japanese subjects, we should address several limitations of this study. First, we did not have a replication cohort for validation of the identified loci, but the majority of the trait-associated loci were previously reported (n = 728; 51.7%). This issue partly reflects a dilemma in the present study, namely, that extensive phenotypes were covered simultaneously, which makes replication more challenging. Second, our subjects for each trait mostly overlapped. Although bivariate LD score regression has elegantly modeled overlapping samples and their phenotypic correlation¹⁵, such sample overlap might exert an upward bias in interpretation of the genetic overlaps. Third, although we adopted a linear regression model for unrelated subjects, the application of a linear mixed model for both related and unrelated subjects could potentially have increased the statistical power of the study⁴⁸. Fourth, the causal inference of clinical measurements for complex diseases in the present study could be limited because of the handling of the single cohort. Further application of Mendelian randomization¹⁷ in independent validation cohorts is warranted. Finally, our cell-type analysis was inevitably limited by the availability of the cell-type-specific annotations regarding the variety of cell types and epigenetic markers. More acquisition and integration of cell-type-specific resources would further facilitate the exploration of causal cell types in human diseases.

In conclusion, we conducted a large-scale GWAS of 58 quantitative traits in Japanese individuals and demonstrated complex inter-relations with human diseases via pleiotropy, genetic correlation, and cell-type-specificity analyses. We further visualized the results as networks, depicting the genetic links among clinical measurements, human diseases, and relevant cell types. Our findings will contribute to future studies and serve as a fundamental resource for understanding the genetics and biology underlying clinical measurements and human diseases.

URLs

BBJ, https://biobankjp.org/english/index.html; JENGER, http://jenger.riken.jp/en/; 1000 Genomes Project, http://www.1000genomes.org/; GWAS catalog, https://www.ebi.ac.uk/gwas/; PLINK 1.9, https://www.cog-genomics.org/plink2; ldsc, https://github.com/bulik/ldsc/; LD score, http://data.broadinstitute.org/alkesgroup/LDSCORE/; MACH, http://csg.sph.umich.edu//abecasis/MaCH/; Minimac, https://genome.sph.umich.edu/wiki/Minimac; ANNOVAR, http://annovar.openbioinformatics.org/en/latest/; R, https://www.r-project.org/; Locuszoom, http://locuszoom.sph.umich.edu/locuszoom/; Circos, http://circos.ca/; NBDC Human Database, https://humandbs.biosciencedbc.jp/en/.

Methods

Subjects

All the subjects enrolled in this study were collected under the BioBank Japan Project (BBJ). The BBJ is a multi-institutional hospital-based registry that collected DNA, serum, and clinical information of approximately 200,000 patients from 66 hospitals affiliated with 12 medical institutes between fiscal years 2003 and 2007. All study participants had been diagnosed with one or more of 47 target diseases by physicians at the cooperating hospitals as described in the previous reports^18,19. Written informed consent was obtained from all participants, as approved by the ethics committees of RIKEN Center for Integrative Medical Sciences and the Institute of Medical Sciences, the University of Tokyo. Detailed characteristics of the subjects for each trait are shown in Supplementary Table 1.

Phenotype

BBJ collected baseline clinical information through interviews and reviews of medical records using a standardized questionnaire. Among the quantitative traits included in this study, age, height, and weight were retrieved from the self-reported questionnaire for all participants. Laboratory measurements were retrieved from medical records of routine laboratory examination for all participants. Because dyslipidemia and diabetes were the most common diseases registered in the BBJ, around half of the study participants (41.8%) had these two diseases. Echocardiographic traits were retrieved from medical records only for the subjects with cardiovascular diseases, dyslipidemia, and diabetes. The measured values of each quantitative trait (or common log-transformed values if required, to achieve normality) were adjusted for age, sex, top ten principal components of genetic ancestry, disease status (affected versus non-affected) for the 47 target diseases in the BBJ, and any necessary trait-specific covariates in a linear regression model. We then normalized the resulting residuals by applying an appropriate trait-specific transformation (Z-score or rank-based inverse normal transformation) as detailed in Supplementary Table 2.

Genotyping and imputation

We genotyped samples with the Illumina HumanOmniExpressExome BeadChip or a combination of the Illumina HumanOmniExpress and HumanExome BeadChips. We excluded samples with (i) sample call rate < 0.98, (ii) closely related individuals identified by identity-by-descent analysis, and (iii) non–East Asian outliers identified by principal component analysis of the studied samples and the three major reference populations (Africans, Europeans, and East Asians) in the International HapMap Project⁴⁹. We then applied standard quality-control criteria for variants, excluding those with (i) SNP call rate < 0.99, (ii) minor allele frequency < 1%, and (iii) Hardy–Weinberg equilibrium P value ≤ 1.0 × 10⁻⁶. We prephased the genotypes with MACH⁵⁰ and imputed dosages with minimac and the 1000 Genomes Project Phase 1 (version 3) East Asian reference haplotypes²⁰. For the X chromosome, we performed prephasing and imputation separately for females and males. Imputed SNPs with an imputation quality Rsq < 0.7 were excluded from the subsequent association analysis.

Genome-wide association analysis

For each quantitative trait, we conducted a GWAS using a linear regression model under the assumption of additive allelic effects of the SNP dosages via mach2qtl⁵⁰. We set a genome-wide significance threshold at the level of P = 5.0 × 10⁻⁸ (ref. 51) and a study-wide significance threshold at the level of P = 8.6 × 10⁻¹⁰ (= 5.0 × 10⁻⁸/58) by applying Bonferroni correction based on the number of studied traits. We defined independent associated loci on the basis of genomic positions at least 1 Mb apart from each other. We call such independent associated loci for each trait ‘trait-associated loci’, and these could overlap other trait-associated loci (i.e., multiple trait-associated loci could be mapped to one unique locus). We considered a trait-associated locus as novel when it was (i) located at a distance of >500 kb from the nearest locus and (ii) not in LD (r² < 0.1) in both East Asians and Europeans with the previously reported loci of the same quantitative trait. For the X chromosome, we conducted GWASs separately for females and males, and meta-analyzed association results. We performed stepwise conditional analysis to identify additional independent signals around associated loci (each region ± 500 kb) by adjusting the most significant variant of the region in each step until none met the genome-wide significance threshold. For extremely significant variants showing P < 1.0 × 10⁻³⁰⁰, we calculated P values in R (ver. 3.3.1) with the Rmpfr package. We calculated the genomic inflation factor λ_GC in R. The variance explained by the significantly associated SNPs was estimated with the formula 2 f (1−f) β², where f represents the allele frequency and β represents the additive effect. We then summed the resulting values to calculate the total variance explained by the significant SNPs for each of the 53 quantitative traits that showed at least one genome-wide significant locus. We carried out LD score regression²¹ with ldsc (v. 1.0.0; commit 23a94fc) to estimate confounding bias and heritability explained by the genome-wide high-quality common SNPs present in the HapMap 3 reference panel. We generated regional plots with LocusZoom⁵² (v. 1.3) and R.

Pleiotropy analysis

We assessed pleiotropy at a unique locus using the following criteria: top-associated variants of different quantitative traits were (i) in LD (r² ≥ 0.5) or (ii) closely located (physical distance within 25 kb). We calculated r² of two variants using PLINK 1.90⁵³ and the 1000 Genomes Project Phase 3 (version 5) East Asian dataset²⁰. We used Circos⁵⁴ to visualize the results.

Additional GWAS results for anthropometric traits and diseases in Japanese subjects

We additionally obtained two quantitative trait GWAS results for anthropometric traits (BMI³ and height), and 30 case–control GWAS results for complex diseases in the Japanese population from both published^{25,26,27,28,29,30} and unpublished studies in the BioBank Japan Project (Table 2). For the two anthropometric traits, results for most of the subjects overlapped with those from the present study (n = 152,667 (94.1%) and 153,456 (94.6%) for BMI and height, respectively). For the 30 complex diseases, the 26 disease cases were recruited through BBJ, whereas subjects with rheumatoid arthritis, bipolar disorder, schizophrenia, and adolescent idiopathic scoliosis were recruited by collaborators as described elsewhere^26,28,29. The controls were constructed from three population-based cohorts (the Tohoku Medical Megabank organization, the Japan Public Health Center–based Prospective study, and the Japan Multi-Institutional Collaborative Cohort Study) or a mixture of the cases in BBJ as detailed in Supplementary Note 1. We incorporated these additional GWAS results into the original GWAS results for the 58 quantitative traits in the subsequent analyses.

Genetic correlation

We conducted bivariate LD score regression¹⁵ to quantify genetic correlations across the 59 traits and 30 complex diseases in the Japanese population. To maintain sufficient statistical power¹⁵, we excluded one GWAS result (E/A ratio of echocardiographic trait) for which the sample size was far less than 10,000. For the regression, we used the East Asian LD score and summary statistics of high-quality common SNPs present in the HapMap 3 reference panel for each available trait or disease. We excluded SNPs found in the major histocompatibility complex (MHC) region (chromosome 6: 25–34 Mb) from the analysis because of its complex LD structure^27,39,55,56. We defined significant genetic correlations as those with FDR < 0.05, calculated via the Benjamini–Hochberg method to correct multiple testing of all 3,916 pairwise correlations among the 59 quantitative traits and 30 diseases.

For network visualization, we constructed a network from the genetic correlation matrix of the 59 traits and 30 diseases. Specifically, each phenotype was represented as a node, and the nodes were connected by edges if they were genetically correlated. We assigned a weight to each edge based on the magnitude of the corresponding genetic correlation. To highlight biological patterns in the network and to prevent it from becoming too dense, we used only significant genetic correlations (FDR < 0.05). Node layout was determined by the Fruchterman–Reingold algorithm given edge weights, with strongly correlated phenotypes placed closer together. We used R (ver. 3.3.1) with the igraph package for this network analysis.

Mendelian randomization

Given the 68 significant genetic correlations between clinical measurements and complex diseases, we carried out a Mendelian randomization analysis for each pair of them to evaluate potential causal effects of clinical measurements on complex diseases. Because most of the samples overlapped in the present study and the disease GWAS, we excluded overlapping samples from disease cases with clinical measurements available for each pair, to avoid potential bias. We selected 53 pairs on the basis of the following criteria: (i) raw genotypes of disease cases were available (i.e., the cases were recruited through BBJ (Table 2 and Supplementary Note 1)), (ii) more than three loci were identified in clinical measurement GWASs, and (iii) unique samples remained after the removal of overlapping samples. We note that this sample exclusion might have led to decreased statistical power compared with that of the original disease GWAS. For each pair, we calculated a weighted genetic risk score by summing the product of risk allele dosage and the effect sizes of the identified alleles influencing each clinical measurement. Associations between the genetic risk score and disease were quantified via a logistic regression model. To further test pleiotropy, we applied MR-Egger regression³⁸ as sensitivity analysis. We used R (ver. 3.3.1) with the MendelianRandomization package⁵⁷.

Partitioning heritability

We carried out stratified LD score regression³⁹ to partition heritability into multiple functional categories. We used the 220 cell-type-specific and the 10 cell-type-group-specific annotations constructed based on the Roadmap Epigenomics Project⁴² available at the authors’ website (see “URLs”). Because only European references are provided for partitioning heritability analysis, we generated the East Asian LD Score reference for each annotation using the 1000 Genomes Project Phase 3 (version 5) East Asian reference haplotypes²⁰ according to standard procedures. For each annotation, we calculated the P value of the regression coefficient τ_c of the annotation. We defined significant heritability enrichments as those with FDR < 0.05, calculated via the Benjamini–Hochberg method.

We performed hierarchical clustering on the matrix of enrichment significance for the 59 quantitative traits and 30 diseases in the 220 cell-type-specific annotations, using Spearman’s correlation distance and the group average method. We also constructed a network from the matrix to represent the heritability enrichment of cell types to phenotypes. We assigned each phenotype and cell type to a node, and linked a pair of them with an arrow if a cell type was enriched for a phenotype. We assigned a weight to each arrow on the basis of the corresponding enrichment significance. For the sake of clarity, we used only highly significant enrichments (FDR < 0.01). Node layout was determined with the Fruchterman–Reingold algorithm given edge weights, with significantly enriched pairs of phenotypes and cell types placed closer together.

Life Sciences Reporting Summary

Further information on experimental design is available in the Life Sciences Reporting Summary.

Data availability

GWAS summary statistics of the 58 quantitative traits are publically available at our website (JENGER; see “URLs”) and the National Bioscience Database Center (NBDC) Human Database (Research ID: hum0014) as open data without any access restrictions. GWAS genotype data from the subjects was deposited at the NBDC Human Database (Research ID: hum0014).

References

Wood, A. R. et al. Defining the role of common variation in the genomic and biological architecture of adult human height. Nat. Genet. 46, 1173–1186 (2014).
Article CAS PubMed PubMed Central Google Scholar
Locke, A. E. et al. Genetic studies of body mass index yield new insights for obesity biology. Nature 518, 197–206 (2015).
Article CAS PubMed PubMed Central Google Scholar
Akiyama, M. et al. Genome-wide association study identifies 112 new loci for body mass index in the Japanese population. Nat. Genet. 49, 1458–1467 (2017).
Article CAS PubMed Google Scholar
Willer, C. J. et al. Discovery and refinement of loci associated with lipid levels. Nat. Genet. 45, 1274–1283 (2013).
Article CAS PubMed PubMed Central Google Scholar
Surakka, I. et al. The impact of low-frequency and rare variants on lipid levels. Nat. Genet. 47, 589–597 (2015).
Article CAS PubMed PubMed Central Google Scholar
Okada, Y. et al. Meta-analysis identifies multiple loci associated with kidney function-related traits in east Asian populations. Nat. Genet. 44, 904–909 (2012).
Article CAS PubMed PubMed Central Google Scholar
Pattaro, C. et al. Genetic associations at 53 loci highlight cell types and biological pathways relevant for kidney function. Nat. Commun. 7, 10023 (2016).
Article CAS PubMed PubMed Central Google Scholar
Kamatani, Y. et al. Genome-wide association study of hematological and biochemical traits in a Japanese population. Nat. Genet. 42, 210–215 (2010).
Article CAS PubMed Google Scholar
Astle, W. J. et al. The allelic landscape of human blood cell trait variation and links to common complex disease. Cell 167, 1415–1429 (2016).
Article CAS PubMed PubMed Central Google Scholar
Surendran, P. et al. Trans-ancestry meta-analyses identify rare and common variants associated with blood pressure and hypertension. Nat. Genet. 48, 1151–1161 (2016).
Article CAS PubMed PubMed Central Google Scholar
Liu, C. et al. Meta-analysis identifies common and rare variants influencing blood pressure and overlapping with metabolic trait loci. Nat. Genet. 48, 1162–1170 (2016).
Article CAS PubMed PubMed Central Google Scholar
Ehret, G. B. et al. The genetics of blood pressure regulation and its target organs from association studies in 342,415 individuals. Nat. Genet. 48, 1171–1184 (2016).
Article CAS PubMed PubMed Central Google Scholar
Sivakumaran, S. et al. Abundant pleiotropy in human complex diseases and traits. Am. J. Hum. Genet. 89, 607–618 (2011).
Article CAS PubMed PubMed Central Google Scholar
Han, B. et al. A method to decipher pleiotropy by detecting underlying heterogeneity driven by hidden subgroups applied to autoimmune and neuropsychiatric diseases. Nat. Genet. 48, 803–810 (2016).
Article CAS PubMed PubMed Central Google Scholar
Bulik-Sullivan, B. et al. An atlas of genetic correlations across human diseases and traits. Nat. Genet. 47, 1236–1241 (2015).
Article CAS PubMed PubMed Central Google Scholar
Lee, S. H., Yang, J., Goddard, M. E., Visscher, P. M. & Wray, N. R. Estimation of pleiotropy between complex diseases using single-nucleotide polymorphism-derived genomic relationships and restricted maximum likelihood. Bioinformatics 28, 2540–2542 (2012).
Article CAS PubMed PubMed Central Google Scholar
Davey Smith, G. & Hemani, G. Mendelian randomization: genetic anchors for causal inference in epidemiological studies. Hum. Mol. Genet. 23, R89–R98 (2014).
Article CAS PubMed PubMed Central Google Scholar
Nagai, A. et al. Overview of the BioBank Japan Project: study design and profile. J. Epidemiol. 27, S2–S8 (2017).
Article PubMed PubMed Central Google Scholar
Hirata, M. et al. Cross-sectional analysis of BioBank Japan clinical data: a large cohort of 200,000 patients with 47 common diseases. J. Epidemiol. 27, S9–S21 (2017).
Article PubMed PubMed Central Google Scholar
1000 Genomes Project Consortium. A global reference for human genetic variation. Nature 526, 68–74 (2015).
Article Google Scholar
Bulik-Sullivan, B. K. et al. LD Score regression distinguishes confounding from polygenicity in genome-wide association studies. Nat. Genet. 47, 291–295 (2015).
Article CAS PubMed PubMed Central Google Scholar
Liu, J. Z. et al. Association analyses identify 38 susceptibility loci for inflammatory bowel disease and highlight shared genetic risk across populations. Nat. Genet. 47, 979–986 (2015).
Article CAS PubMed PubMed Central Google Scholar
Tsai, W.-N. et al. Serum total bilirubin concentrations are inversely associated with total white blood cell counts in an adult population. Ann. Clin. Biochem. 52, 251–258 (2015).
Article CAS PubMed Google Scholar
Liu, Y. et al. Bilirubin possesses powerful immunomodulatory activity and suppresses experimental autoimmune encephalomyelitis. J. Immunol. 181, 1887–1897 (2008).
Article CAS PubMed Google Scholar
Hirota, T. et al. Genome-wide association study identifies eight new susceptibility loci for atopic dermatitis in the Japanese population. Nat. Genet. 44, 1222–1226 (2012).
Article CAS PubMed Google Scholar
Okada, Y. et al. Genetics of rheumatoid arthritis contributes to biology and drug discovery. Nature 506, 376–381 (2014).
Article CAS PubMed Google Scholar
Okada, Y. et al. Construction of a population-specific HLA imputation reference panel and its application to Graves’ disease risk in Japanese. Nat. Genet. 47, 798–802 (2015).
Article CAS PubMed Google Scholar
Ogura, Y. et al. A functional SNP in BNC2 is associated with adolescent idiopathic scoliosis. Am. J. Hum. Genet. 97, 337–342 (2015).
Article CAS PubMed PubMed Central Google Scholar
Ikeda, M. et al. A genome-wide association study identifies two novel susceptibility loci and trans population polygenicity associated with bipolar disorder. Mol. Psychiatry https://doi.org/10.1038/mp.2016.259 (2017).
Low, S.-K. et al. Identification of six new genetic loci associated with atrial fibrillation in the Japanese population. Nat. Genet. 49, 953–958 (2017).
Article CAS PubMed Google Scholar
Liu, J., Au Yeung, S. L., Lin, S. L., Leung, G. M. & Schooling, C. M. Liver enzymes and risk of ischemic heart disease and type 2 diabetes mellitus: a Mendelian randomization study. Sci. Rep. 6, 38813 (2016).
Article CAS PubMed PubMed Central Google Scholar
Rosenberg, M. A. et al. Genetic variants related to height and risk of atrial fibrillation: the cardiovascular health study. Am. J. Epidemiol. 180, 215–222 (2014).
Article PubMed PubMed Central Google Scholar
Khankari, N. K. et al. Association between adult height and risk of colorectal, lung, and prostate cancer: results from meta-analyses of prospective studies and Mendelian randomization analyses. PLoS Med. 13, e1002118 (2016).
Article PubMed PubMed Central Google Scholar
Wu, A. H., Gladden, J. D., Ahmed, M., Ahmed, A. & Filippatos, G. Relation of serum uric acid to cardiovascular disease. Int. J. Cardiol. 213, 4–7 (2016).
Article PubMed Google Scholar
Azab, B. et al. Value of albumin-globulin ratio as a predictor of all-cause mortality after non-ST elevation myocardial infarction. Angiology 64, 137–145 (2013).
Article CAS PubMed Google Scholar
Perlstein, T. S., Pande, R. L., Beckman, J. A. & Creager, M. A. Serum total bilirubin level and prevalent lower-extremity peripheral arterial disease: National Health and Nutrition Examination Survey (NHANES) 1999 to 2004. Arterioscler. Thromb. Vasc. Biol. 28, 166–172 (2008).
Article CAS PubMed Google Scholar
Timio, F., Kerry, S. M., Anson, K. M., Eastwood, J. B. & Cappuccio, F. P. Calcium urolithiasis, blood pressure and salt intake. Blood Press. 12, 122–127 (2003).
Article CAS PubMed Google Scholar
Bowden, J., Davey Smith, G. & Burgess, S. Mendelian randomization with invalid instruments: effect estimation and bias detection through Egger regression. Int. J. Epidemiol. 44, 512–525 (2015).
Article PubMed PubMed Central Google Scholar
Finucane, H. K. et al. Partitioning heritability by functional annotation using genome-wide association summary statistics. Nat. Genet. 47, 1228–1235 (2015).
Article CAS PubMed PubMed Central Google Scholar
Pers, T. H. et al. Biological interpretation of genome-wide association studies using predicted gene functions. Nat. Commun. 6, 5890 (2015).
Article CAS PubMed PubMed Central Google Scholar
Trynka, G. et al. Disentangling the effects of colocalizing genomic annotations to functionally prioritize non-coding variants within complex-trait loci. Am. J. Hum. Genet. 97, 139–152 (2015).
Article CAS PubMed PubMed Central Google Scholar
Roadmap Epigenomics Consortium et al. Integrative analysis of 111 reference human epigenomes. Nature 518, 317–330 (2015).
Article PubMed Central Google Scholar
Sidney, L. E., Branch, M. J., Dunphy, S. E., Dua, H. S. & Hopkinson, A. Concise review: evidence for CD34 as a common marker for diverse progenitors. Stem Cells 32, 1380–1389 (2014).
Article CAS PubMed PubMed Central Google Scholar
Ziegler-Heitbrock, H. W. L. & Ulevitch, R. J. CD14: cell surface receptor and differentiation marker. Immunol. Today 14, 121–125 (1993).
Article CAS PubMed Google Scholar
Gadhoum, S. Z. & Sackstein, R. CD15 expression in human myeloid cell differentiation is regulated by sialidase activity. Nat. Chem. Biol. 4, 751–757 (2008).
Article CAS PubMed PubMed Central Google Scholar
Clark, E. A. & Lane, P. J. L. Regulation of human B-cell activation and adhesion. Annu. Rev. Immunol. 9, 97–127 (1991).
Article CAS PubMed Google Scholar
Sakaguchi, S., Sakaguchi, N., Asano, M., Itoh, M. & Toda, M. Immunologic self-tolerance maintained by activated T cells expressing IL-2 receptor alpha-chains (CD25). Breakdown of a single mechanism of self-tolerance causes various autoimmune diseases. J. Immunol. 155, 1151–1164 (1995).
CAS PubMed Google Scholar
Loh, P.-R. et al. Efficient Bayesian mixed-model analysis increases association power in large cohorts. Nat. Genet. 47, 284–290 (2015).
Article CAS PubMed PubMed Central Google Scholar
International HapMap 3 Consortium. Integrating common and rare genetic variation in diverse human populations. Nature 467, 52–58 (2010).
Article Google Scholar
Li, Y., Willer, C. J., Ding, J., Scheet, P. & Abecasis, G. R. MaCH: using sequence and genotype data to estimate haplotypes and unobserved genotypes. Genet. Epidemiol. 34, 816–834 (2010).
Article PubMed PubMed Central Google Scholar
Kanai, M., Tanaka, T. & Okada, Y. Empirical estimation of genome-wide significance thresholds based on the 1000 Genomes Project data set. J. Hum. Genet. 61, 861–866 (2016).
Article CAS PubMed PubMed Central Google Scholar
Pruim, R. J. et al. LocusZoom: regional visualization of genome-wide association scan results. Bioinformatics 26, 2336–2337 (2010).
Article CAS PubMed PubMed Central Google Scholar
Chang, C. C. et al. Second-generation PLINK: rising to the challenge of larger and richer datasets. Gigascience 4, 7 (2015).
Article PubMed PubMed Central Google Scholar
Krzywinski, M. et al. Circos: an information aesthetic for comparative genomics. Genome Res. 19, 1639–1645 (2009).
Article CAS PubMed PubMed Central Google Scholar
Zheng, J. et al. LD Hub: a centralized database and web interface to perform LD score regression that maximizes the potential of summary level GWAS data for SNP heritability and genetic correlation analysis. Bioinformatics 33, 272–279 (2017).
Article PubMed Google Scholar
Hirata, J. et al. Variants at HLA-A, HLA-C, and HLA-DQB1 confer risk of psoriasis vulgaris in Japanese. J. Invest. Dermatol. https://doi.org/10.1016/j.jid.2017.10.001 (2017).
Yavorska, O. O. & Burgess, S. MendelianRandomization: an R package for performing Mendelian randomization analyses using summarized data. Int. J. Epidemiol. 46, 1734–1739 (2017).
Article PubMed PubMed Central Google Scholar

Download references

Acknowledgements

We acknowledge the staff of BBJ for their outstanding assistance in collecting samples and clinical information. We also thank the Tohoku Medical Megabank Project, the Japan Public Health Center–based Prospective (JPHC) Study, and the Japan Multi-Institutional Collaborative Cohort (J-MICC) Study for their invaluable contributions to the case-control studies used in this study. We thank the staff of the Japan Scoliosis Clinical Research Group (JSCRG) for their support in recruiting patients to the AIS GWAS used in this study. We are grateful to H. Finucane for helpful discussions and assistance with LD score regression analysis. This research was supported by the Tailor-Made Medical Treatment Program (the BioBank Japan Project) of the Ministry of Education, Culture, Sports, Science, and Technology (MEXT) and the Japan Agency for Medical Research and Development (AMED). The study of psychiatric disorders was supported by the Strategic Research Program for Brain Sciences (SRPBS) of AMED. M. Kanai was supported by a Nakajima Foundation Fellowship. Y.O. was supported by the Japan Society for the Promotion of Science (JSPS) KAKENHI (grants 15H05670, 15H05907, 15H05911, 15K14429, 16H03269, and 16K15738), AMED (grants 16km0405206h0001, 16gm6010001h0001, and 17ek0410041h0001), Takeda Science Foundation, the Uehara Memorial Foundation, the Naito Foundation, Daiichi Sankyo Foundation of Life Science, and Senri Life Science Foundation.

Author information

These authors jointly supervised this work: Yukinori Okada and Yoichiro Kamatani.

Authors and Affiliations

Department of Statistical Genetics, Osaka University Graduate School of Medicine, Osaka, Japan
Masahiro Kanai & Yukinori Okada
Laboratory for Statistical Analysis, RIKEN Center for Integrative Medical Sciences, Yokohama, Japan
Masahiro Kanai, Masato Akiyama, Atsushi Takahashi, Nana Matoba, Yukinori Okada & Yoichiro Kamatani
Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
Masahiro Kanai
Department of Genomic Medicine, Research Institute, National Cerebral and Cardiovascular Center, Osaka, Japan
Atsushi Takahashi
Laboratory for Genotyping Development, RIKEN Center for Integrative Medical Sciences, Yokohama, Japan
Yukihide Momozawa
Department of Psychiatry, Fujita Health University School of Medicine, Aichi, Japan
Masashi Ikeda & Nakao Iwata
Laboratory for Bone and Joint Diseases, RIKEN Center for Integrative Medical Sciences, Tokyo, Japan
Shiro Ikegawa
Institute of Medical Science, The University of Tokyo, Tokyo, Japan
Makoto Hirata
Graduate School of Frontier Sciences, The University of Tokyo, Tokyo, Japan
Koichi Matsuda
RIKEN Center for Integrative Medical Sciences, Yokohama, Japan
Michiaki Kubo
Laboratory of Statistical Immunology, Immunology Frontier Research Center (WPI-IFReC), Osaka University, Osaka, Japan
Yukinori Okada
Center for Genomic Medicine, Kyoto University Graduate School of Medicine, Kyoto, Japan
Yoichiro Kamatani

Authors

Masahiro Kanai
View author publications
You can also search for this author inPubMed Google Scholar
Masato Akiyama
View author publications
You can also search for this author inPubMed Google Scholar
Atsushi Takahashi
View author publications
You can also search for this author inPubMed Google Scholar
Nana Matoba
View author publications
You can also search for this author inPubMed Google Scholar
Yukihide Momozawa
View author publications
You can also search for this author inPubMed Google Scholar
Masashi Ikeda
View author publications
You can also search for this author inPubMed Google Scholar
Nakao Iwata
View author publications
You can also search for this author inPubMed Google Scholar
Shiro Ikegawa
View author publications
You can also search for this author inPubMed Google Scholar
Makoto Hirata
View author publications
You can also search for this author inPubMed Google Scholar
Koichi Matsuda
View author publications
You can also search for this author inPubMed Google Scholar
Michiaki Kubo
View author publications
You can also search for this author inPubMed Google Scholar
Yukinori Okada
View author publications
You can also search for this author inPubMed Google Scholar
Yoichiro Kamatani
View author publications
You can also search for this author inPubMed Google Scholar

Contributions

M. Kanai, M.A., M. Kubo, Y.O., and Y.K. designed the study and wrote the manuscript. K.M., M.H., and M. Kubo collected and managed the BBJ samples. Y.M. and M. Kubo performed genotyping. M. Kanai, M.A., A.T., and N.M. performed statistical analysis. S.I., M.I., and N.I. contributed to data acquisition. Y.O. and Y.K. supervised the study. All authors contributed to and approved the final version of the manuscript.

Corresponding authors

Correspondence to Yukinori Okada or Yoichiro Kamatani.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Additional information

Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Text and Figures

Supplementary Figures 1–9 and Supplementary Note 1

Life Sciences Reporting Summary

Supplementary Tables

Supplementary Tables 1–12

Supplementary Dataset 1

Manhattan, quantile–quantile, and LD score plots for the 58 quantitative traits

Supplementary Dataset 2

Regional plots for all identified trait-associated loci

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (https://creativecommons.org/licenses/by/4.0), which permits use, duplication, adaptation, distribution, and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.

Reprints and permissions

About this article

Cite this article

Kanai, M., Akiyama, M., Takahashi, A. et al. Genetic analysis of quantitative traits in the Japanese population links cell types to complex human diseases. Nat Genet 50, 390–400 (2018). https://doi.org/10.1038/s41588-018-0047-6

Download citation

Received: 05 June 2017
Accepted: 21 December 2017
Published: 05 February 2018
Issue Date: March 2018
DOI: https://doi.org/10.1038/s41588-018-0047-6

Subjects

Abstract

Similar content being viewed by others

Main

Results

Genome-wide association analysis of 58 quantitative traits

Pleiotropy of top associated quantitative trait loci

Polygenic correlations across quantitative traits

Genetic correlations among quantitative traits and diseases

Shared cell-type specificity among human complex traits

Discussion

URLs

Methods

Subjects

Phenotype

Genotyping and imputation

Genome-wide association analysis

Pleiotropy analysis

Additional GWAS results for anthropometric traits and diseases in Japanese subjects

Genetic correlation

Mendelian randomization

Partitioning heritability

Life Sciences Reporting Summary

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing interests

Additional information

Supplementary information

Rights and permissions

About this article

Cite this article

Share this article

Search

Quick links