Discovery and Fine Mapping of Serum Protein Loci through Transethnic Meta-analysis

Nora Franceschini; Frank JA van Rooij; Bram P Prins; Mary F Feitosa; Mahir Karakas; John H Eckfeldt; Aaron R Folsom; Jeffrey Kopp; Ahmad Vaez; Jeanette S Andrews; Jens Baumert; Vesna Boraska; Linda Broer; Caroline Hayward; Julius S Ngwa; Yukinori Okada; Ozren Polasek; Harm-Jan Westra; Ying A Wang; Fabiola Del Greco M; Nicole L Glazer; Karen Kapur; Ido P Kema; Lorna M Lopez; Arne Schillert; Albert V Smith; Cheryl A Winkler; Lina Zgaga; The LifeLines Cohort Study; Stefania Bandinelli; Sven Bergmann; Mladen Boban; Murielle Bochud; YD Chen; Gail Davies; Abbas Dehghan; Jingzhong Ding; Angela Doering; J Peter Durda; Luigi Ferrucci; Oscar H Franco; Lude Franke; Grog Gunjaca; Albert Hofman; Fang-Chi Hsu; Ivana Kolcic; Aldi Kraja; Michiaki Kubo; Karl J Lackner; Lenore Launer; Laura R Loehr; Guo Li; Christa Meisinger; Yusuke Nakamura; Christine Schwienbacher; John M Starr; Atsushi Takahashi; Vesela Torlak; André G Uitterlinden; Veronique Vitart; Melanie Waldenberger; Philipp S Wild; Mirna Kirin; Tanja Zeller; Tatijana Zemunik; Qunyuan Zhang; Andreas Ziegler; Stefan Blankenberg; Eric Boerwinkle; Ingrid B Borecki; Harry Campbell; Ian J Deary; Timothy M Frayling; Christian Gieger; Tamara B Harris; Andrew A Hicks; Wolfgang Koenig; Christopher J O’Donnell; Caroline S Fox; Peter P Pramstaller; Bruce M Psaty; Alex P Reiner; Jerome I Rotter; Igor Rudan; Harold Snieder; Toshihiro Tanaka; Cornelia M van Duijn; Peter Vollenweider; Gerard Waeber; James F Wilson; Jacqueline CM Witteman; Bruce HR Wolffenbuttel; Alan F Wright; Qingyu Wu; Yongmei Liu; Nancy S Jenny; Kari E North; Janine F Felix; Behrooz Z Alizadeh; L Adrienne Cupples

doi:10.1016/j.ajhg.2012.08.021

. 2012 Oct 5;91(4):744–753. doi: 10.1016/j.ajhg.2012.08.021

Discovery and Fine Mapping of Serum Protein Loci through Transethnic Meta-analysis

Nora Franceschini ^1,^68,^∗, Frank JA van Rooij ^2,^3,⁶⁸, Bram P Prins ^4,⁶⁸, Mary F Feitosa ^5,⁶⁸, Mahir Karakas ^6,⁶⁸, John H Eckfeldt ⁷, Aaron R Folsom ⁸, Jeffrey Kopp ⁹, Ahmad Vaez ⁴, Jeanette S Andrews ¹⁰, Jens Baumert ¹¹, Vesna Boraska ¹², Linda Broer ^2,³, Caroline Hayward ¹³, Julius S Ngwa ¹⁴, Yukinori Okada ^15,¹⁶, Ozren Polasek ¹⁷, Harm-Jan Westra ¹⁸, Ying A Wang ^14,¹⁹, Fabiola Del Greco M ²⁰, Nicole L Glazer ²¹, Karen Kapur ^22,³⁴, Ido P Kema ²³, Lorna M Lopez ^24,²⁵, Arne Schillert ²⁶, Albert V Smith ^27,²⁸, Cheryl A Winkler ²⁹, Lina Zgaga ^30,³¹; The LifeLines Cohort Study³², Stefania Bandinelli ³³, Sven Bergmann ^22,³⁴, Mladen Boban ³⁵, Murielle Bochud ³⁶, YD Chen ³⁷, Gail Davies ²⁵, Abbas Dehghan ^2,³, Jingzhong Ding ³⁸, Angela Doering ¹¹, J Peter Durda ³⁹, Luigi Ferrucci ⁴⁰, Oscar H Franco ^2,³, Lude Franke ¹⁸, Grog Gunjaca ³³, Albert Hofman ^2,³, Fang-Chi Hsu ¹⁰, Ivana Kolcic ¹⁷, Aldi Kraja ⁵, Michiaki Kubo ⁴¹, Karl J Lackner ⁴², Lenore Launer ⁴³, Laura R Loehr ¹, Guo Li ⁴⁴, Christa Meisinger ¹¹, Yusuke Nakamura ⁴⁵, Christine Schwienbacher ^20,⁴⁶, John M Starr ^24,⁴⁷, Atsushi Takahashi ¹⁵, Vesela Torlak ⁴⁸, André G Uitterlinden ^2,³, Veronique Vitart ¹³, Melanie Waldenberger ⁴⁹, Philipp S Wild ⁵⁰, Mirna Kirin ³⁰, Tanja Zeller ⁵¹, Tatijana Zemunik ¹², Qunyuan Zhang ⁵, Andreas Ziegler ²⁶, Stefan Blankenberg ⁵¹, Eric Boerwinkle ⁵², Ingrid B Borecki ⁵, Harry Campbell ³⁰, Ian J Deary ^24,²⁵, Timothy M Frayling ⁵³, Christian Gieger ¹¹, Tamara B Harris ⁴³, Andrew A Hicks ²⁰, Wolfgang Koenig ⁶, Christopher J O’Donnell ^54,⁵⁵, Caroline S Fox ^54,⁵⁶, Peter P Pramstaller ^20,⁵⁷, Bruce M Psaty ^58,⁵⁹, Alex P Reiner ⁶⁰, Jerome I Rotter ⁶¹, Igor Rudan ³⁰, Harold Snieder ⁴, Toshihiro Tanaka ⁶², Cornelia M van Duijn ^2,³, Peter Vollenweider ⁶³, Gerard Waeber ⁶³, James F Wilson ³⁰, Jacqueline CM Witteman ^2,³, Bruce HR Wolffenbuttel ⁶⁴, Alan F Wright ¹³, Qingyu Wu ⁶⁵, Yongmei Liu ⁶⁶, Nancy S Jenny ³⁹, Kari E North ¹, Janine F Felix ^2,^3,⁶⁸, Behrooz Z Alizadeh ^4,⁶⁸, L Adrienne Cupples ^14,^54,⁶⁸, John RB Perry ^53,⁶⁸, Andrew P Morris ^67,^68,^∗∗

¹Department of Epidemiology, University of North Carolina, Chapel Hill, NC 27599, USA

²ErasmusAGE and Department of Epidemiology, Erasmus University Medical Center, P.O. Box 2040, 3000 CA Rotterdam, the Netherlands

³Netherlands Consortium for Healthy Aging, Netherlands Genomics Initiative, P.O. Box 9600, 2300 RC Leiden, the Netherlands

⁴Department of Epidemiology, University Medical Center Groningen, University of Groningen, P.O. Box 30006, 9700 RB Groningen, the Netherlands

⁵Division of Statistical Genomics, Department of Genetics, Washington University School of Medicine, St. Louis, MO 63108, USA

⁶Department of Internal Medicine II—Cardiology, University of Ulm Medical Center, Albert-Einstein-Allee 23, 89081 Ulm, Germany

⁷Department of Laboratory Medicine and Pathology, University of Minnesota, 420 Delaware Street SE, Minneapolis, MN 55455, USA

⁸Division of Epidemiology and Community Health, University of Minnesota, West Bank Office Building, 1300 S Second Street, Suite 300, Minneapolis, MN 55454-1015 MN, USA

⁹National Institute of Diabetes and Digestive and Kidney Diseases, National Institutes of Health, 31 Center Drive, MSC 2560, Bethesda, MD 20892-2560, USA

¹⁰Department of Biostatistical Sciences, Wake Forest University School of Medicine, Medical Center Boulevard, Winston-Salem, NC 27157-1063, USA

¹¹Institute of Epidemiology II, Helmholtz Zentrum München, German Research Center for Environmental Health, Ingolstädter Landstraße 1, 85764 Neuherberg, Germany

¹²Department of Medical Biology, University of Split School of Medicine, Šoltanska 2, 21000 Split, Croatia

¹³Medical Research Council Human Genetics Unit, Institute of Genetics and Molecular Medicine, The University of Edinburgh, Western General Hospital, Crewe Road, Edinburgh EH4 2XU, UK

¹⁴Department of Biostatistics, Boston University School of Public Health, Harvard School of Public Health, 677 Huntington Avenue, Boston, MA 02115, USA

¹⁵Laboratory for Statistical Analysis, Center for Genomic Medicine (CGM), RIKEN, 1-7-22 Suehiro-cho, Tsurumi-ku, Yokohama, Kanagawa 230-0045, Japan

¹⁶Department of Allergy and Rheumatology, Graduate School of Medicine, University of Tokyo, 7-3-1, Hongo, Bunkyo-ku, Tokyo 113-0033, Japan

¹⁷Department of Public Health, University of Split School of Medicine, Šoltanska 2, 21000 Split, Croatia

¹⁸Department of Genetics, University Medical Center Groningen, University of Groningen, Hanzeplein 1, 9713 GZ Groningen, the Netherlands

¹⁹Novartis Institutes for BioMedical Research, 250 Massachusetts Avenue, Cambridge, MA 02139, USA

²⁰Center for Biomedicine, European Academy Bozen/Bolzano (EURAC, affiliated institute of the University of Lübeck, Ratzeburger Allee 160, 23562 Lübeck, Germany), Viale Druso, 1 / Drususallee 1, 39100 Bolzano/Bozen, Italy

²¹Section of Preventive Medicine and Epidemiology, Boston University School of Medicine, 801 Massachusetts Ave, Suite 470, Boston, MA 02118, USA

²²Department of Medical Genetics, University Hospital Center, University of Lausanne, Rue du Bugnon 27, 1005 Lausanne, Switzerland

²³Department of Laboratory Medicine, University Medical Center Groningen, University of Groningen, P.O. Box 30001, 9700 RB Groningen, the Netherlands

²⁴Centre for Cognitive Ageing and Cognitive Epidemiology, The University of Edinburgh, 7 George Square, Edinburgh EH8 9JZ, UK

²⁵Department of Psychology, The University of Edinburgh, Dugald Stewart Building, 3 Charles Street, Edinburgh EH8 9AD, UK

²⁶Institute of Medical Biometry and Statistics, University Hospital Schleswig-Holstein, Ratzeburger Allee 160, Haus 4, 23538 Lübeck, Germany

²⁷Icelandic Heart Association, Holtasmari 1, IS-201Kopavogur, Iceland

²⁸Faculty of Medicine, University of Iceland, Menntavegi 1, 101 Reykjavík, Iceland

²⁹Molecular Genetics Epidemiology Section, National Institute of Cancer, National Institutes of Health, Building 560, Room 11-64, Frederick National Lab, Frederick, MD 21702-1201, USA

³⁰Centre for Population Health Sciences and Institute of Genetics and Molecular Medicine, College of Medicine and Veterinary Medicine, The University of Edinburgh, The Queen’s Medical Research Institute, 47 Little France Crescent, Edinburgh EH16 4TJ, UK

³¹Department of Medical Statistics, Epidemiology and Medical Informatics, Medical School, University of Zagreb, 10 000 ZAGREB, Šalata 3, Croatia

³²University Medical Center Groningen, University of Groningen, P.O. Box 30006, 9700 RB Groningen, the Netherlands

³³Geriatric Unit, Azienda Sanitaria di Firenze, 50125 Florence, Italy

³⁴Swiss Institute of Bioinformatics, Quartier Sorge, Batiment Genopode, 1015 Lausanne, Switzerland

³⁵Department of Pharmacology, University of Split School of Medicine, Šoltanska 2, 21000 Split, Croatia

³⁶Community Prevention Unit, University Institute of Social and Preventive Medicine, Rue du Bugnon 17, 1005 Lausanne, Switzerland

³⁷Medical Genetics Institute and Department of Obstetrics and Gynecology, Cedars-Sinai Medical Center, 8700 Beverly Boulevard, Los Angeles, CA 90048, USA

³⁸Section on Gerontology and Geriatric Medicine, Department of Internal Medicine, Wake Forest School of Medicine, Winston-Salem, NC 27157, USA

³⁹Laboratory for Clinical Biochemistry Research, Department of Pathology, University of Vermont College of Medicine, 89 Beaumont Avenue, Courtyard at Given S269, Burlington, VT 05405, USA

⁴⁰Longitudinal Studies Section, Clinical Research Branch, National Institute on Aging, 3001 Hanover Street, Baltimore, MD 21225, USA

⁴¹Laboratory for Genotyping Development, CGM, RIKEN, 1-7-22 Suehiro-cho, Tsurumi-ku, Yokohama City, Kanagawa 230-0045, Japan

⁴²Institute for Clinical Chemistry and Laboratory Medicine, University Medical Center Mainz, Langenbeckstraße 1, 55131 Mainz, Germany

⁴³Laboratory of Epidemiology, Demography, and Biometry, National Institutes of Aging, Gateway Building, 7201 Wisconsin Avenue, Bethesda, MD 20892-9205, USA

⁴⁴Cardiovascular Health Research Unit, University of Washington, Metropolitan Park East Tower, 1730 Minor Ave, Suite 1360, Seattle, WA 98101, USA

⁴⁵Laboratory of Molecular Medicine, Human Genome Center, Institute of Medical Science, University of Tokyo, 4-6-1 Shirokanedai, Minato-ku, Tokyo 108-8639, Japan

⁴⁶Department of Experimental and Diagnostic Medicine, University of Ferrara, 44121 Ferrara, Italy

⁴⁷Alzheimer Scotland Dementia Research Centre, Department of Psychology, The University of Edinburgh, Room G24, 7 George Square, Edinburgh EH8 9JZ, UK

⁴⁸University Hospital Split, Šoltanska 2, 21000 Split, Croatia

⁴⁹Research Unit of Epidemiology, Helmholtz Zentrum München, German Research Center for Environmental Health, Ingolstädter Landstraße 1, 85764 Neuherberg, Germany

⁵⁰Department of Medicine II, University Medical Center Mainz, Langenbeckstraße 1, 55131 Mainz, Germany

⁵¹University Heart Center Hamburg, Clinic for General and Interventional Cardiology, Martinistraße 52, 20246 Hamburg, Germany

⁵²Center for Human Genetics and Division of Epidemiology, School of Public Health, University of Houston, 1200 Herman Pressler, Houston, TX 77030, USA

⁵³Genetics of Complex Traits, Peninsula Medical School, University of Exeter, Barrack Road, Exeter EX2 5DW, UK

⁵⁴National Heart, Lung, and Blood Institute (NHLBI) Framingham Heart Study, Division of Intramural Research, 73 Mt. Wayte Avenue, Suite 2, Framingham, MA 01702-5827, USA

⁵⁵Cardiology Division, Department of Medicine, Massachusetts General Hospital, Harvard Medical School, 55 Fruit Street, Boston, MA 02114, USA

⁵⁶Center for Populations Studies, NHLBI, 73 Mt. Wayte Ave, Suite 2, Framingham, MA 01702-5827, USA

⁵⁷Department of Neurology, General Central Hospital, 39100 Bolzano, Italy

⁵⁸Cardiovascular Health Research Unit, Departments of Medicine, Epidemiology, and Health Services, University of Washington, Metropolitan Park East Tower, 1730 Minor Ave, Suite 1360, Seattle, WA 98101, USA

⁵⁹Group Health Research Institute, Group Health Cooperative, 1730 Minor Ave, Suite 1600, Seattle, WA 98101-1448, USA

⁶⁰Department of Epidemiology, School of Public Health, University of Washington, Box 357236, Seattle, WA 98195, USA

⁶¹Medical Genetics Institute, Cedars-Sinai Medical Center, 8700 Beverly Boulevard, Suite 400, Los Angeles, CA 90048, USA

⁶²Laboratory for Cardiovascular Diseases, CGM, RIKEN, 1-7-22 Suehiro-cho, Tsurumi-ku, Yokohama City, Kanagawa 230-0045, Japan

⁶³Internal Medicine Department, Lausanne University Hospital, Rue du Bugnon 46, 1011 Lausanne, Switzerland

⁶⁴Department of Endocrinology, University Medical Center Groningen, University of Groningen, Hanzeplein 1, 9713 GZ Groningen, the Netherlands

⁶⁵Molecular Cardiology/Nephrology & Hypertension Lerner Research Institute, Cleveland Clinic, 9500 Euclid Avenue, Cleveland, OH 44195, USA

⁶⁶Department of Epidemiology and Prevention, Wake Forest University School of Medicine, Medical Center Boulevard, Winston-Salem, NC 27157-1063, USA

⁶⁷Wellcome Trust Centre for Human Genetics, University of Oxford, Roosevelt Drive, Oxford OX3 7BN, UK

^∗

Corresponding author noraf@unc.edu

^∗∗

Corresponding author amorris@well.ox.ac.uk

⁶⁸

These authors contributed equally to this work

PMCID: PMC3484648 PMID: 23022100

Abstract

Many disorders are associated with altered serum protein concentrations, including malnutrition, cancer, and cardiovascular, kidney, and inflammatory diseases. Although these protein concentrations are highly heritable, relatively little is known about their underlying genetic determinants. Through transethnic meta-analysis of European-ancestry and Japanese genome-wide association studies, we identified six loci at genome-wide significance (p < 5 × 10⁻⁸) for serum albumin (HPN-SCN1B, GCKR-FNDC4, SERPINF2-WDR81, TNFRSF11A-ZCCHC2, FRMD5-WDR76, and RPS11-FCGRT, in up to 53,190 European-ancestry and 9,380 Japanese individuals) and three loci for total protein (TNFRS13B, 6q21.3, and ELL2, in up to 25,539 European-ancestry and 10,168 Japanese individuals). We observed little evidence of heterogeneity in allelic effects at these loci between groups of European and Japanese ancestry but obtained substantial improvements in the resolution of fine mapping of potential causal variants by leveraging transethnic differences in the distribution of linkage disequilibrium. We demonstrated a functional role for the most strongly associated serum albumin locus, HPN, for which Hpn knockout mice manifest low plasma albumin concentrations. Other loci associated with serum albumin harbor genes related to ribosome function, protein translation, and proteasomal degradation, whereas those associated with serum total protein include genes related to immune function. Our results highlight the advantages of transethnic meta-analysis for the discovery and fine mapping of complex trait loci and have provided initial insights into the underlying genetic architecture of serum protein concentrations and their association with human disease.

Main Text

Albumin, the major plasma protein, transports endogenous and exogenous compounds such as nutrients, hormones, metabolic catabolites, and drugs and maintains intravascular volume by generating oncotic pressure. Diverse conditions, including cancer, liver and kidney diseases, and acute and chronic inflammatory states, manifest reduced plasma albumin concentrations. Low plasma albumin is associated with increased risk of cardiovascular disease¹ and mortality.² Gamma globulins, the second most abundant type of plasma protein, are composed primarily of immunoglobulins (Ig), the effector arm of humoral immunity. Dysregulation of Ig may result from altered production in infectious and autoimmune diseases and in immunodeficiency syndromes and from increased loss in kidney disease.³ To date, little is known about the genetic regulation of plasma proteins, and the pathophysiologic mechanisms leading to low albumin concentrations in many acute and chronic disease conditions remain obscure. Genetic tools may allow for the discovery of pathways in the metabolism, regulation, and/or disease processes associated with changes in these proteins and may provide insights into the immune system, cancer, inflammatory diseases, and malnutrition.

Serum albumin heritability estimates range from 0.36 to 0.77 in family and twin studies.^4–7 Recent genome-wide association studies (GWASs) of populations of eastern Asian ancestry have revealed genetic loci contributing to variation in blood protein concentrations: GCKR (MIM 600842)-FNDC4 (MIM 611905) to serum albumin,⁸ TNFRSF13B (MIM 604907) to total protein,⁸ and RPS11 (MIM 180471)-FCGRT (MIM 601437) to both traits.^8,9 These associations have not been previously examined in other ancestry groups, and much of the heritability of blood protein concentrations remains unexplained. To bridge this gap in our understanding of the genetic architecture of serum protein concentrations, we began by performing a meta-analysis of European-ancestry GWASs for albumin and total protein. Subsequently, we combined the European meta-analysis with data from a GWAS of Japanese ancestry with the aim of (1) identifying additional loci through increased sample size, (2) assessing the evidence of heterogeneity in allelic effects between ethnic groups, and (3) improving the resolution of fine mapping in associated regions by leveraging the expected differences in the structure of linkage disequilibrium (LD) between diverse populations.

The European-ancestry meta-analysis consisted of 53,190 individuals (from 20 GWASs) for serum albumin and 25,539 individuals (from six GWASs) for total protein (Tables S1–S3 and Supplemental Data available online). The procedures followed were approved by the institutional review board committees and are in accordance with the ethical standards of the institutional committees on human experimentation. All participants have given informed consent. Sample and SNP quality control (QC) were undertaken within each study. Each GWAS was then imputed at up to 2.5M autosomal SNPs with the use of CEU (Utah residents with ancestry from northern and western Europe from the CEPH collection) samples from phase II of the International HapMap Project.¹⁰ Each SNP with minor allele frequency (MAF) >1% that passed QC was tested for association with serum albumin and total protein under an additive model after adjustment for study-specific covariates. The results of each GWAS were corrected for residual population structure using the genomic control inflation factor¹¹ and were combined via fixed-effect inverse-variance-weighted meta-analysis. The results of the meta-analysis were subsequently corrected by a second round of genomic control (λ_GC = 1.04 for serum albumin and λ_GC = 1.02 for total protein) to allow for population differences between studies.

The European-ancestry meta-analysis identified six genome-wide significant loci (p < 5 × 10⁻⁸) for serum albumin and two for total protein (Table 1, Figure S1). These included association signals for serum albumin at HPN (MIM 142440)-SCN1B (MIM 600235) (p = 3.3 × 10⁻¹⁵), SERPINF2 (MIM 613168)-WDR81 (MIM 614218) (p = 6.8 × 10⁻¹³), TNFRSF11A (MIM 603499)-ZCCHC2 (p = 3.9 × 10⁻⁹), and FRMD5-WDR76 (p = 2.0 × 10⁻⁸) and for total protein on chromosome 6q21.3 (p = 3.4 × 10⁻⁹). We also confirmed the associations previously reported in eastern Asian populations at GCKR-FNDC4 (p = 2.9 × 10⁻¹⁴) and RPS11-FCGRT (p = 3.2 × 10⁻⁸) for serum albumin and TNFRSF13B (p = 1.3 × 10⁻¹⁰) for total protein. Inspection of the HPN-SCN1B locus (Figure 1) provided evidence for two independent association signals for serum albumin (rs4806073, p = 3.3 × 10⁻¹⁵; rs11671010, p = 1.9 × 10⁻¹³; CEU r² = 0.02, 4.3 kb apart). To perform conditional analyses at this locus, we applied genome-wide complex trait analysis (GCTA)¹² to the results of the European-ancestry meta-analysis and individual-level genotype data from the Atherosclerosis Risk in Communities Study (8,127 European American individuals, Table S1) and confirmed that both SNPs remained genome-wide significant after adjustment for the effect of the other (rs4806073, p = 1.6 × 10⁻¹²; rs11671010, p = 1.5 × 10⁻¹¹).

Table 1.

Loci Achieving Genome-wide Significance for Serum Albumin and Total Protein in European-Ancestry Populations

Lead SNP	Chr	Position (Build 36)	Alleles		EAF	Beta	SE	p Value	Sample Size	Locus
Lead SNP	Chr	Position (Build 36)	Effect	Other	EAF	Beta	SE	p Value	Sample Size	Locus
Serum Albumin

rs4806073	19	40,247,030	C	T	0.93	0.0257	0.0033	3.3 × 10⁻¹⁵	53,187	HPN-SCN1B
rs1260326	2	27,584,444	T	C	0.41	0.0124	0.0016	2.9 × 10⁻¹⁴	53,189	GCKR-FNDC4
rs11078597	17	1,565,113	C	T	0.18	0.0205	0.0029	6.8 × 10⁻¹³	38,231	SERPINF2-WDR81
rs13381710	18	58,304,309	G	A	0.30	0.0108	0.0018	3.9 × 10⁻⁹	53,189	TNFRSF11A-ZCCHC2
rs16948098	15	42,006,899	A	G	0.06	0.0229	0.0041	1.9 × 10⁻⁸	53,189	FRMD5-WDR76
rs739347	19	54,693,197	T	C	0.89	0.0186	0.0034	3.2 × 10⁻⁸	38,231	RPS11-FCGRT

Total Protein

rs3751991	17	16,776,011	A	C	0.11	0.0377	0.0059	1.3 × 10⁻¹⁰	25,537	TNFRSF13B
rs204999	6	32,217,957	A	G	0.74	0.0251	0.0042	3.4 × 10⁻⁹	25,537	6q21.3

Open in a new tab

Genome-wide significance is defined as p < 5 × 10⁻⁸. The following abbreviations are used: Chr, chromosome; and EAF, effect allele frequency.

Signal Plot of the *HPN-SCN1B* Locus for Serum Albumin

The two panels present signal plots from the fixed-effect meta-analyses of European-ancestry individuals showing evidence for two independent associations in the region only: (A) SNPs tagged by rs4806073 and (B) SNPs tagged by rs11671010. In each panel, the lead SNP is represented by the purple circle. Each point represents a SNP plotted with their p (on a log₁₀ scale) as a function of genomic position (build 36). The color coding of all other SNPs indicates LD with the lead SNP (estimated by CEU r² from phase II HapMap): red, r² ≥ 0.8; gold, 0.6 ≤ r²<0.8; green, 0.4 ≤ r²<0.6; cyan, 0.2 ≤ r²<0.4; blue, r²<0.2; and gray, r² unknown. Recombination rates are estimated from the International HapMap Project, and gene annotations are taken from the University of California Santa Cruz genome browser.

The results of the European meta-analysis were then combined with a Japanese-ancestry GWAS (BioBank Japan Project) consisting of 9,380 individuals for serum albumin and 10,168 individuals for total protein (Tables S1–S3, Supplemental Data). Sample and SNP QC were undertaken within the BioBank Japan Project. The GWAS was imputed at up to 2.5 M autosomal SNPs with the use of Han Chinese in Beijing and Japanese in Tokyo (CHB+JPT) samples from phase II of the International HapMap Project.¹⁰ Each SNP with MAF >1% that passed QC was then tested for association with serum albumin and total protein in the BioBank Japan Project under an additive model after adjustment for age and sex, and the results were corrected for residual population structure with the genomic control inflation factor (λ_GC = 0.98 for serum albumin and λ_GC = 1.08 for total protein). The European meta-analysis and the BioBank Japan Project GWAS were then combined via transethnic meta-analysis implemented with MANTRA (meta-analysis of transethnic association studies).¹³ This approach has the advantage of allowing for heterogeneity in allelic effects between ancestry groups by assigning studies to clusters according to a Bayesian partition model of similarity in terms of their allele frequency profile. Studies assigned to the same cluster have the same allelic effect. However, each cluster can have different allelic effects. Fixed-effect meta-analysis is thus equivalent to a Bayesian partition model with a single cluster of studies.

We observed strong evidence of association, as defined by a log₁₀ Bayes factor (BF) of 5 (equivalent to prior odds of association of any SNP with either trait of 1:100,000),¹⁴ at all identified loci for both traits (Table 2, Figure S2). These loci included RPS11-FCGRT for total protein, which was previously observed in eastern Asian populations but not at genome-wide significance in our European-ancestry meta-analysis. The only exception was at the FRMD5-WDR76 locus (log₁₀BF = 4.79), where the lead SNP from the European meta-analysis (rs16948098) was not observed in the Japanese GWAS and is monomorphic in eastern Asian (CHB and JPT) HapMap populations.¹⁰ Using the threshold of log₁₀BF > 5 for strong evidence of association, we identified two additional “potential” loci for serum albumin and four for total protein (Table 2).

Table 2.

Loci with Strong Evidence of Association with Serum Albumin and Total Protein after Transethnic MANTRA Analysis of European-Ancestry and Japanese GWASs

Lead SNP	Chr	Position (Build 36)	Alleles		European Ancestry GWAS Meta-analysis					Japanese GWAS					MANTRA Transethnic Meta-analysis		Locus
Lead SNP	Chr	Position (Build 36)	Effect	Other	EAF	Beta	SE	p Value	Sample Size	EAF	Beta	SE	p Value	Sample Size	log₁₀BF Association	log₁₀BF Heterogeneity	Locus
Serum Albumin Established Loci

rs1260326	2	27,584,444	T	C	0.41	0.0124	0.0016	2.9 × 10⁻¹⁴	53,189	0.56	0.0270	0.0050	2.2 × 10⁻⁸	9,380	17.01	0.28	GCKR-FNDC4
rs4806073	19	40,247,030	C	T	0.93	0.0257	0.0033	3.3 × 10⁻¹⁵	53,187	0.92	0.0380	0.0090	1.3 × 10⁻⁵	9,380	15.81	−0.08	HPN-SCN1B
rs11078597	17	1,565,113	C	T	0.18	0.0205	0.0029	6.8 × 10⁻¹³	38,231	0.18	0.0200	0.0060	1.8 × 10⁻³	9,380	12.51	0.20	SERPINF2-WDR81
rs694419	18	58,277,092	T	C	0.52	0.0093	0.0016	1.2 × 10⁻⁸	53,189	0.05	0.0180	0.0110	9.2 × 10⁻²	9,380	7.05	−0.11	TNFRSF11A-ZCCHC2
rs2280401	19	54,691,821	A	G	0.17	0.0121	0.0024	7.6 × 10⁻⁷	53,189	0.16	0.0240	0.0070	3.1 × 10⁻⁴	9,380	6.96	0.13	RPS11-FCGRT
rs16948098	15	42,006,899	A	C	0.06	0.0229	0.0041	1.9 × 10⁻⁸	53,189	0.00					4.79		FRMD5-WDR76

Serum Albumin Additional Potential Loci

rs2293579	11	47,397,334	A	G	0.40	0.0093	0.0017	8.0 × 10⁻⁸	53,189	0.26	0.0030	0.0050	5.7 × 10⁻¹	9,380	5.61	−0.05	PSMC3
rs12914385	15	76,685,778	C	T	0.61	0.0064	0.0016	8.7 × 10⁻⁵	53,189	0.70	0.0200	0.0050	1.3 × 10⁻⁴	9,380	5.07	0.12	CHRNA3- CHRNA5

Total Protein Established Loci

rs4561508	17	16,789,475	T	C	0.11	0.0360	0.0060	1.3 × 10⁻⁹	25,534	0.37	0.0470	0.0700	2.0 × 10⁻¹¹	10,168	16.25	−0.00	TNFRSF13B
rs204999	6	32,217,957	A	G	0.74	0.0250	0.0040	3.4 × 10⁻⁹	25,537	0.94	0.0420	0.0130	1.7 × 10⁻³	10,168	9.01	0.10	6q21.3
rs2280401	19	54,691,821	A	G	0.16	0.0120	0.0050	1.5 × 10⁻²	25,537	0.16	0.0500	0.0090	6.5 × 10⁻⁸	10,168	5.19	0.96	RPS11-FCGRT

Total Protein Additional Potential Loci

rs3777200	5	95,260,547	T	C	0.27	0.0180	0.0040	1.9 × 10⁻⁵	25,524	0.30	0.0290	0.0070	1.1 × 10⁻⁴	10,168	6.57	−0.14	ELL2
rs1260326	2	27,584,444	T	C	0.44	0.0150	0.0040	1.1 × 10⁻⁴	25,537	0.56	0.0310	0.0070	3.7 × 10⁻⁶	10,168	5.93	−0.02	GCKR-FNDC4
rs2675609	10	63,306,537	T	C	0.39	0.0090	0.0040	1.4 × 10⁻²	25,537	0.43	−0.0360	0.0070	1.7 × 10⁻⁶	10,168	5.81	6.79	ARID5B
rs10097731	8	82,190,227	T	G	0.15	0.0230	0.0050	1.1 × 10⁻⁵	25,537	0.17	0.0310	0.0090	5.9 × 10⁻⁴	10,168	5.67	0.19	PAG1

Open in a new tab

Strong evidence is defined as log₁₀ BF > 5. The following abbreviations are used: Chr, chromosome; GWAS, genome-wide association study; MANTRA, meta-analysis of transethnic association studies; EAF, effect allele frequency; and BF, Bayes factor.

MANTRA revealed little evidence of heterogeneity in allelic effects between European-ancestry and Japanese studies at the majority of the serum albumin and total protein loci (Table 2). The extent of heterogeneity was assessed through comparison of association BF under a Bayesian partition model wherein the number of clusters of studies is unrestricted to that wherein there is a single cluster, the latter corresponding to homogeneous allelic effects across all ancestry groups. Subsequent fixed-effect inverse-variance-weighted meta-analysis across groups of European and Japanese ancestry (Table S4) revealed one additional signal for total-protein mapping to ELL2 (MIM 601874, p = 1.1 × 10⁻⁸), although none of the other potential MANTRA loci showed genome-wide significance (p < 5 × 10⁻⁸). Among these potential loci, however, there was strong evidence of heterogeneity at ARID5B (MIM 608538) for total protein (MANTRA log₁₀BF in favor of heterogeneity of 6.79). The lead SNP at this locus (rs2675609) was strongly associated with total protein only in the Japanese GWAS (p = 1.7 × 10⁻⁶, compared with p = 0.014 in the European meta-analysis), and the allelic effects were in opposite directions in the two ancestry groups (Table 2). Interestingly, the effect-allele frequency is similar in European-ancestry and Japanese GWASs, and there is little evidence of variation in LD structure between CEU and CHB+JPT reference haplotypes from the 1000 Genomes Project¹⁵ (Figure S3). Although intrastudy phenotypic variation in total protein concentrations (such as Ig, which is not available for analyses here) might contribute to these apparent transethnic differences in allelic effects, further investigation is required to fully elucidate the source of heterogeneity between ancestry groups.

To assess the improvement in fine-mapping resolution due to transethnic meta-analysis in serum albumin and total protein loci, we defined “credible sets” of SNPs (J.B. Maller, personal communication) with the strongest signals of association and, hence, most likely to be causal (or tagging an unobserved causal variant), on the basis of European-ancestry GWASs only and then after inclusion of the Japanese study. At each locus, defined by the genomic region 500 kb up and downstream of the lead SNP, we calculated the posterior probability that the j^th SNP is “causal” (or tags an unobserved causal variant) by

ϕ_{j} = \frac{B F_{j}}{\sum_{k} B F_{k}} .

In this expression, BF_j denotes the BF in favor of association of the j^th SNP from the transethnic MANTRA analysis, and the summation in the denominator is over all SNPs passing QC across the locus (J.B. Maller, personal communication). A 100ω% credible set at the locus was then constructed through (1) ranking all SNPs according to their BF and (2) combining ranked SNPs until their cumulative posterior probability exceeded ω. Using this definition, we observed improved resolution, in terms of the number of SNPs and the genomic interval covered by the credible set, at HPN-SCN1B, TNFRSF11A-ZCCHC2, and RPS11-FCGRT for serum albumin and at TNFRSF13B and the 6q21.3 locus for total protein (Figure 2, Table S5). The most striking improvements in resolution were observed at the 6q21.3 locus for total protein, wherein the 99% credible set was reduced from 14 SNPs (covering 346 kb), to just three (covering 37 kb). Furthermore, after transethnic meta-analysis, the posterior probability that the lead SNP was causal (or tagged an unobserved causal variant) was more than 95% at GCKR-FNDC4 and SERPINF2-WDR81 for serum albumin and at TNFRSF13B and the 6q21.3 locus for total protein.

Fine Mapping of the 6q21.3 Locus for Total Protein

The two panels present signal plots for the MANTRA association signal (A) after transethnic meta-analysis of European-ancestry and Japanese GWASs and (B) after meta-analysis of the European ancestry GWAS only. Each point represents a SNP passing QC in our MANTRA analysis, plotted with their BF (on a log₁₀ scale) as a function of genomic position (build 36). In each panel, the lead SNP is represented by the purple circle. The color coding of all other SNPs indicates LD with the lead SNP (estimated by CEU r² from phase II HapMap): red, r² ≥ 0.8; gold, 0.6 ≤ r²<0.8; green, 0.4 ≤ r²<0.6; cyan, 0.2 ≤ r²<0.4; blue, r²<0.2; and gray, r² unknown. Recombination rates are estimated from the International HapMap Project and gene annotations are taken from the University of California Santa Cruz genome browser. In each panel, the gray-shaded regions correspond to the genomic interval covered by a 99% credible set of SNPs.

Two of the serum albumin loci, HPN-SCN1B and RPS11-FCGRT, can be validated by existing mouse models. The lead SNP at the HPN-SCN1B locus maps to an intron of HPN, a gene encoding hepsin, a membrane-bound serine protease that has substrate specificity for basic amino acids similar to that of proalbumin processing, suggesting a physiologic role of hepsin in the cleavage of proalbumin to albumin. We compared serum protein concentrations between hepsin knockout (KO) mice and wild-type litter mates¹⁶ (Figure 3) using blood samples collected from the inferior vena cava and analyzed by Consolidated Veterinary Diagnostics (West Sacramento, CA, USA). In hepsin^−/− mice, we observed overwhelming evidence of reduced serum albumin (p = 9.1 × 10⁻¹²) and, to a lesser extent, reduced total protein (p = 1.5 × 10⁻⁵), but not Ig. At the RPS11-FCGRT locus, KO Fcgrt mice have been previously demonstrated to manifest low serum albumin and low serum gamma Ig concentrations.^17,18 Furthermore, in humans, two siblings with genetic deficiency of FcRn due to lack of the β2 microglobumin component have manifested reduced serum albumin and gamma Ig concentrations.¹⁹ FCGRT encodes the heavy alpha chain of the FcRn, which prevents lysosomal degradation of albumin and Ig in lysosomes and thereby extends their serum half-life.¹⁷

Serum Albumin, Total Protein, and Serum Globulin Concentrations and Albumin-to-Ig Ratio in Wild-Type and *hepsin*^−/− Mice

Data are presented as mean ± SD, and the number of mice in each experimental group is shown in parentheses. Results for serum albumin are shown in (A); for total protein in (B); for serum globulin in (C); and the albumin-to-Ig ratio (A/Ig ratio) in wild-type (WT) and *hepsin*^−/− (KO) mice in (D). Statistical differences are shown by p values. Note the significantly lower serum albumin concentrations, but not Ig concentrations, in KO mice compared to WT mice.

To gain insights into the possible functional role of other serum albumin and total protein loci, we began by performing expression quantitative trait locus (eQTL) mapping using data derived from 1,469 whole blood samples.²⁰ Gene expression levels were measured from peripheral blood and assayed in 1,240 individuals with Illumina HT12 v3 and in 229 individuals with Illumina H8 v2 BeadChip arrays. Both sets were independently quantile-normalized after log₂ transformation and subsequently corrected for 50 principal components obtained from the gene expression probe covariance matrix. To integrate both data sets, genotype data were imputed up to 2.5M autosomal SNPs with the use of CEU samples from phase II of the International HapMap Project.¹⁰ SNPs of low frequency (MAF < 5%) or with deviation from Hardy-Weinberg equilibrium (p < 10⁻⁴) were excluded from the subsequent analysis. Cis-eQTL effects (within 1 Mb of the probe) were determined with Spearman’s ranked correlation, and meta-analysis between the two data sets was performed with the use of weighted Z scores. The false discovery rate (FDR) was then assessed by permutation. Using this approach, we mapped lead SNPs at four of the identified loci (GCKR-FNDC4, SERPINF2-WDR81, and RPS11-FCGRT for serum albumin; 6q21.3 for total protein) to cis expression levels of 18 genes (Table S6). The strongest associations were observed for expression of NOSIP (p = 2.4 × 10⁻¹⁷ at RPS11-FCGRT) and HLA-DQA1 (MIM 146880) /HLA-DQA2 (MIM 613503, p = 9.1 × 10⁻³⁶ at 6q21.3). NOSIP (nitric oxide synthase interacting protein) inhibits endothelial nitric oxide synthesis, whereas HLA-DQA1/2 is a human leukocyte antigen (HLA) class II antigen with an immune system role related to processing and presentation of antigen peptides.

We noticed that the lead SNP at the GCKR-FNDC4 locus (rs1260326, c.1337T>C [p. Leu446Pro]; RefSeq NM_001486.3) is a GCKR missense mutation with moderate predicted functional impact by snpEff, and has been previously associated with several metabolic traits, as well as kidney, liver, and hematologic phenotypes (Table S7). We used data from the 1000 Genomes Project¹⁵ to search for additional coding variants with predicted function in strong LD (r² > 0.5 in 283 individuals of European ancestry) with lead SNPs at our identified serum albumin and total protein loci (Table S8). Across serum albumin loci, two nonsynonymous SNPs mapped to SERPINF2, a gene encoding an inhibitor of plasmin which degrades plasma fibrin and other proteins, and one nonsynonymous SNP mapped to CHRNA5 (MIM 118505), a nicotinic acetylcholine receptor gene associated with smoking behavior^21,22 and lung cancer.²³ For total protein, nonsynonymous SNPs mapped to TNFRSF13B and ELL2, and to PPT2 (MIM 603298) and EGFL8 (MIM 609897) at the 6q21.3 locus. Mutations in TNFRSF13B cause immunodeficiency common variable type 2 (MIM 240500), characterized by hypogammaglobulinemia and recurrent bacterial infections due to failure of β cell differentiation and impaired production of Ig. They also cause selective IgA deficiency 2 (MIM 609529), the most common primary immunodeficiency, affecting 1 in 600 individuals in the western world. ELL2 product directs Ig secretion in plasma cells, and the 6q21.3 major histocompatibility complex class III region encompasses a number of genes involved in autoimmunity, inflammation, and complement proteins. Interrogation of the National Human Genome Research Institute (NHGRI) GWAS catalog²⁴ highlighted that lead SNPs at ten of the identified loci have themselves been reported or are in LD (r² > 0.5 in 283 individuals of European ancestry) with those disclosed, for a diverse range of human complex traits (Table S7) but are enriched for metabolic phenotypes that are associated with, or are direct products of, protein metabolism.

Finally, we used the human interactome database (Cytoscape) to construct an interaction network consisting of 250 proteins that directly interact with genes in the identified serum albumin and total protein loci reported in Table 2. For identifying molecular complexes within this first-degree interaction network, cluster analyses were performed with the FAG-EC algorithm, implemented in the ClusterViz plug-in, with standard settings applied. In total, 16 distinct clusters were identified, including three large complexes (Figure S4) that were carried forward for further analysis. Functional enrichment analyses within these clusters were performed following defined pathways from BioCarta, KEGG, PANTHER, and Reactome via the Database for Annotation, Visualization, and Integrated Discovery (DAVID). The most significantly enriched clusters from protein interaction network analyses incorporated ribosomal functioning and protein translation, proteasomal protein degradation, and immune-response signaling (Table S9). As a complementary approach, we applied an implementation of gene-set enrichment analysis (MAGENTA²⁵) to identify whether defined biological pathways from BioCarta, Gene Ontology, Ingenuity, KEGG, PANTHER, and Reactome were enriched in the leading-edge fraction of the meta-analysis. In brief, gene association p values were calculated on the basis of meta-analysis summary statistics for SNPs within a 110-kb-upstream and 40-kb-downstream window. These gene scores were then corrected for gene size, number of SNPs, and the LD between them, and subsequently ranked by p value. Enrichment in the 75^th and 95^th percentiles was assessed for significance by comparison with 10,000 randomly generated pathways. Using an FDR threshold of 5%, we observed significant overrepresentation of genes assigned to three pathways: RNA- (FDR = 0.044), sensory-perception- (FDR = 0.027), and protein-trafficking-related pathways (FDR = 0.043-0.044) (Table S10).

In conclusion, we have identified six loci for serum albumin concentration and three for total protein at genome-wide significance. These loci harbor genes that fall across a diverse range of biological pathways, including those involved in biomarkers, immune regulation, and disease, but are enriched for those relevant to the synthesis and degradation of serum protein. By combining GWAS data from European and Japanese populations, we observed some evidence of heterogeneity in allelic effects between ancestry groups and have demonstrated substantial improvements in the localization of potential causal variants. Taken together, our results highlight the advantages of transethnic meta-analysis for the discovery and fine mapping of complex trait loci and provide initial insights into the underlying genetic architecture of serum protein concentrations and their association with human disease.

Acknowledgments

A.P.M acknowledges financial support from the Wellcome Trust, grant numbers WT081682, WT064890, WT098017, and WT090532. N.F. acknowledges financial support from AHA 0675001N, R01HL089651, and U01HG004803.

Contributor Information

Nora Franceschini, Email: noraf@unc.edu.

Andrew P. Morris, Email: amorris@well.ox.ac.uk.

Supplemental Data

Document S1. Figures S1–S4, Tables S1–S10, and Supplemental Acknowledgments

mmc1.pdf^{(5.2MB, pdf)}

Web Resources

The URLs for data presented herein are as follows:

1000 Genomes Project, http://www.1000genomes.org/
ClusterViz plug-in, http://code.google.com/p/clusterviz-cytoscape
DAVID, http://david.abcc.ncifcrf.gov
GCTA, http://www.complextraitgenomics.com/software/gcta/
NHGRI Catalog of Published Genome-wide Association Studies, http://www.genome.gov/gwastudies/
Online Mendelian Inheritance in Man (OMIM), http://www.omim.org
SNIPPER, http://csg.sph.umich.edu/boehnke/snipper/
snpEff, http://snpeff.sourceforge.net

References

1.Nelson J.J., Liao D., Sharrett A.R., Folsom A.R., Chambless L.E., Shahar E., Szklo M., Eckfeldt J., Heiss G. Serum albumin level as a predictor of incident coronary heart disease: the Atherosclerosis Risk in Communities (ARIC) study. Am. J. Epidemiol. 2000;151:468–477. doi: 10.1093/oxfordjournals.aje.a010232. [DOI] [PubMed] [Google Scholar]
2.Goldwasser P., Feldman J. Association of serum albumin and mortality risk. J. Clin. Epidemiol. 1997;50:693–703. doi: 10.1016/s0895-4356(97)00015-2. [DOI] [PubMed] [Google Scholar]
3.Johnson A., Rohlfs E., Silverman L. WB Saunders Co.; Philadelphia, PA: 1999. Tietz Textbook of Clinical Chemistry. [Google Scholar]
4.Dal Colletto G.M., Krieger H., Magalhães J.R. Genetic and environmental determinants of 17 serum biochemical traits in Brazilian twins. Acta Genet. Med. Gemellol. (Roma) 1983;32:23–29. doi: 10.1017/s0001566000008126. [DOI] [PubMed] [Google Scholar]
5.Whitfield J.B., Martin N.G. The effects of inheritance on constituents of plasma: a twin study on some biochemical variables. Ann. Clin. Biochem. 1984;21:176–183. doi: 10.1177/000456328402100303. [DOI] [PubMed] [Google Scholar]
6.Kalousdian S., Fabsitz R., Havlik R., Christian J., Rosenman R. Heritability of clinical chemistries in an older twin cohort: the NHLBI Twin Study. Genet. Epidemiol. 1987;4:1–11. doi: 10.1002/gepi.1370040102. [DOI] [PubMed] [Google Scholar]
7.Pankow J.S., Folsom A.R., Cushman M., Borecki I.B., Hopkins P.N., Eckfeldt J.H., Tracy R.P. Familial and genetic determinants of systemic markers of inflammation: the NHLBI family heart study. Atherosclerosis. 2001;154:681–689. doi: 10.1016/s0021-9150(00)00586-4. [DOI] [PubMed] [Google Scholar]
8.Kamatani Y., Matsuda K., Okada Y., Kubo M., Hosono N., Daigo Y., Nakamura Y., Kamatani N. Genome-wide association study of hematological and biochemical traits in a Japanese population. Nat. Genet. 2010;42:210–215. doi: 10.1038/ng.531. [DOI] [PubMed] [Google Scholar]
9.Kim Y.J., Go M.J., Hu C., Hong C.B., Kim Y.K., Lee J.Y., Hwang J.Y., Oh J.H., Kim D.J., Kim N.H., MAGIC consortium Large-scale genome-wide association studies in East Asians identify new genetic loci influencing metabolic traits. Nat. Genet. 2011;43:990–995. doi: 10.1038/ng.939. [DOI] [PubMed] [Google Scholar]
10.Frazer K.A., Ballinger D.G., Cox D.R., Hinds D.A., Stuve L.L., Gibbs R.A., Belmont J.W., Boudreau A., Hardenbol P., Leal S.M., International HapMap Consortium A second generation human haplotype map of over 3.1 million SNPs. Nature. 2007;449:851–861. doi: 10.1038/nature06258. [DOI] [PMC free article] [PubMed] [Google Scholar]
11.Devlin B., Roeder K. Genomic control for association studies. Biometrics. 1999;55:997–1004. doi: 10.1111/j.0006-341x.1999.00997.x. [DOI] [PubMed] [Google Scholar]
12.Yang J., Lee S.H., Goddard M.E., Visscher P.M. GCTA: a tool for genome-wide complex trait analysis. Am. J. Hum. Genet. 2011;88:76–82. doi: 10.1016/j.ajhg.2010.11.011. [DOI] [PMC free article] [PubMed] [Google Scholar]
13.Morris A.P. Transethnic meta-analysis of genomewide association studies. Genet. Epidemiol. 2011;35:809–822. doi: 10.1002/gepi.20630. [DOI] [PMC free article] [PubMed] [Google Scholar]
14.Stephens M., Balding D.J. Bayesian statistical methods for genetic association studies. Nat. Rev. Genet. 2009;10:681–690. doi: 10.1038/nrg2615. [DOI] [PubMed] [Google Scholar]
15.1000 Genomes Project Consortium A map of human genome variation from population-scale sequencing. Nature. 2010;467:1061–1073. doi: 10.1038/nature09534. [DOI] [PMC free article] [PubMed] [Google Scholar]
16.Wu Q., Yu D., Post J., Halks-Miller M., Sadler J.E., Morser J. Generation and characterization of mice deficient in hepsin, a hepatic transmembrane serine protease. J. Clin. Invest. 1998;101:321–326. doi: 10.1172/JCI1617. [DOI] [PMC free article] [PubMed] [Google Scholar]
17.Chaudhury C., Mehnaz S., Robinson J.M., Hayton W.L., Pearl D.K., Roopenian D.C., Anderson C.L. The major histocompatibility complex-related Fc receptor for IgG (FcRn) binds albumin and prolongs its lifespan. J. Exp. Med. 2003;197:315–322. doi: 10.1084/jem.20021829. [DOI] [PMC free article] [PubMed] [Google Scholar]
18.Roopenian D.C., Christianson G.J., Sproule T.J., Brown A.C., Akilesh S., Jung N., Petkova S., Avanessian L., Choi E.Y., Shaffer D.J. The MHC class I-like IgG receptor controls perinatal IgG transport, IgG homeostasis, and fate of IgG-Fc-coupled drugs. J. Immunol. 2003;170:3528–3533. doi: 10.4049/jimmunol.170.7.3528. [DOI] [PubMed] [Google Scholar]
19.Wani M.A., Haynes L.D., Kim J., Bronson C.L., Chaudhury C., Mohanty S., Waldmann T.A., Robinson J.M., Anderson C.L. Familial hypercatabolic hypoproteinemia caused by deficiency of the neonatal Fc receptor, FcRn, due to a mutant beta2-microglobulin gene. Proc. Natl. Acad. Sci. USA. 2006;103:5084–5089. doi: 10.1073/pnas.0600548103. [DOI] [PMC free article] [PubMed] [Google Scholar]
20.Fehrmann R.S., Jansen R.C., Veldink J.H., Westra H.J., Arends D., Bonder M.J., Fu J., Deelen P., Groen H.J., Smolonska A. Trans-eQTLs reveal that independent genetic variants associated with a complex phenotype converge on intermediate genes, with a major role for the HLA. PLoS Genet. 2011;7:e1002197. doi: 10.1371/journal.pgen.1002197. [DOI] [PMC free article] [PubMed] [Google Scholar]
21.Thorgeirsson T.E., Gudbjartsson D.F., Surakka I., Vink J.M., Amin N., Geller F., Sulem P., Rafnar T., Esko T., Walter S., ENGAGE Consortium Sequence variants at CHRNB3-CHRNA6 and CYP2A6 affect smoking behavior. Nat. Genet. 2010;42:448–453. doi: 10.1038/ng.573. [DOI] [PMC free article] [PubMed] [Google Scholar]
22.Tobacco and Genetics Consortium Genome-wide meta-analyses identify multiple loci associated with smoking behavior. Nat. Genet. 2010;42:441–447. doi: 10.1038/ng.571. [DOI] [PMC free article] [PubMed] [Google Scholar]
23.Amos C.I., Wu X., Broderick P., Gorlov I.P., Gu J., Eisen T., Dong Q., Zhang Q., Gu X., Vijayakrishnan J. Genome-wide association scan of tag SNPs identifies a susceptibility locus for lung cancer at 15q25.1. Nat. Genet. 2008;40:616–622. doi: 10.1038/ng.109. [DOI] [PMC free article] [PubMed] [Google Scholar]
24.Hindorff L.A., Sethupathy P., Junkins H.A., Ramos E.M., Mehta J.P., Collins F.S., Manolio T.A. Potential etiologic and functional implications of genome-wide association loci for human diseases and traits. Proc. Natl. Acad. Sci. USA. 2009;106:9362–9367. doi: 10.1073/pnas.0903103106. [DOI] [PMC free article] [PubMed] [Google Scholar]
25.Segrè A.V., Groop L., Mootha V.K., Daly M.J., Altshuler D., DIAGRAM Consortium. MAGIC investigators Common inherited variation in mitochondrial genes is not enriched for associations with type 2 diabetes or related glycemic traits. PLoS Genet. 2010;6:e1001058. doi: 10.1371/journal.pgen.1001058. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Document S1. Figures S1–S4, Tables S1–S10, and Supplemental Acknowledgments

mmc1.pdf^{(5.2MB, pdf)}

[bib1] 1.Nelson J.J., Liao D., Sharrett A.R., Folsom A.R., Chambless L.E., Shahar E., Szklo M., Eckfeldt J., Heiss G. Serum albumin level as a predictor of incident coronary heart disease: the Atherosclerosis Risk in Communities (ARIC) study. Am. J. Epidemiol. 2000;151:468–477. doi: 10.1093/oxfordjournals.aje.a010232. [DOI] [PubMed] [Google Scholar]

[bib2] 2.Goldwasser P., Feldman J. Association of serum albumin and mortality risk. J. Clin. Epidemiol. 1997;50:693–703. doi: 10.1016/s0895-4356(97)00015-2. [DOI] [PubMed] [Google Scholar]

[bib3] 3.Johnson A., Rohlfs E., Silverman L. WB Saunders Co.; Philadelphia, PA: 1999. Tietz Textbook of Clinical Chemistry. [Google Scholar]

[bib4] 4.Dal Colletto G.M., Krieger H., Magalhães J.R. Genetic and environmental determinants of 17 serum biochemical traits in Brazilian twins. Acta Genet. Med. Gemellol. (Roma) 1983;32:23–29. doi: 10.1017/s0001566000008126. [DOI] [PubMed] [Google Scholar]

[bib5] 5.Whitfield J.B., Martin N.G. The effects of inheritance on constituents of plasma: a twin study on some biochemical variables. Ann. Clin. Biochem. 1984;21:176–183. doi: 10.1177/000456328402100303. [DOI] [PubMed] [Google Scholar]

[bib6] 6.Kalousdian S., Fabsitz R., Havlik R., Christian J., Rosenman R. Heritability of clinical chemistries in an older twin cohort: the NHLBI Twin Study. Genet. Epidemiol. 1987;4:1–11. doi: 10.1002/gepi.1370040102. [DOI] [PubMed] [Google Scholar]

[bib7] 7.Pankow J.S., Folsom A.R., Cushman M., Borecki I.B., Hopkins P.N., Eckfeldt J.H., Tracy R.P. Familial and genetic determinants of systemic markers of inflammation: the NHLBI family heart study. Atherosclerosis. 2001;154:681–689. doi: 10.1016/s0021-9150(00)00586-4. [DOI] [PubMed] [Google Scholar]

[bib8] 8.Kamatani Y., Matsuda K., Okada Y., Kubo M., Hosono N., Daigo Y., Nakamura Y., Kamatani N. Genome-wide association study of hematological and biochemical traits in a Japanese population. Nat. Genet. 2010;42:210–215. doi: 10.1038/ng.531. [DOI] [PubMed] [Google Scholar]

[bib9] 9.Kim Y.J., Go M.J., Hu C., Hong C.B., Kim Y.K., Lee J.Y., Hwang J.Y., Oh J.H., Kim D.J., Kim N.H., MAGIC consortium Large-scale genome-wide association studies in East Asians identify new genetic loci influencing metabolic traits. Nat. Genet. 2011;43:990–995. doi: 10.1038/ng.939. [DOI] [PubMed] [Google Scholar]

[bib10] 10.Frazer K.A., Ballinger D.G., Cox D.R., Hinds D.A., Stuve L.L., Gibbs R.A., Belmont J.W., Boudreau A., Hardenbol P., Leal S.M., International HapMap Consortium A second generation human haplotype map of over 3.1 million SNPs. Nature. 2007;449:851–861. doi: 10.1038/nature06258. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib11] 11.Devlin B., Roeder K. Genomic control for association studies. Biometrics. 1999;55:997–1004. doi: 10.1111/j.0006-341x.1999.00997.x. [DOI] [PubMed] [Google Scholar]

[bib12] 12.Yang J., Lee S.H., Goddard M.E., Visscher P.M. GCTA: a tool for genome-wide complex trait analysis. Am. J. Hum. Genet. 2011;88:76–82. doi: 10.1016/j.ajhg.2010.11.011. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib13] 13.Morris A.P. Transethnic meta-analysis of genomewide association studies. Genet. Epidemiol. 2011;35:809–822. doi: 10.1002/gepi.20630. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib14] 14.Stephens M., Balding D.J. Bayesian statistical methods for genetic association studies. Nat. Rev. Genet. 2009;10:681–690. doi: 10.1038/nrg2615. [DOI] [PubMed] [Google Scholar]

[bib15] 15.1000 Genomes Project Consortium A map of human genome variation from population-scale sequencing. Nature. 2010;467:1061–1073. doi: 10.1038/nature09534. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib16] 16.Wu Q., Yu D., Post J., Halks-Miller M., Sadler J.E., Morser J. Generation and characterization of mice deficient in hepsin, a hepatic transmembrane serine protease. J. Clin. Invest. 1998;101:321–326. doi: 10.1172/JCI1617. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib17] 17.Chaudhury C., Mehnaz S., Robinson J.M., Hayton W.L., Pearl D.K., Roopenian D.C., Anderson C.L. The major histocompatibility complex-related Fc receptor for IgG (FcRn) binds albumin and prolongs its lifespan. J. Exp. Med. 2003;197:315–322. doi: 10.1084/jem.20021829. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib18] 18.Roopenian D.C., Christianson G.J., Sproule T.J., Brown A.C., Akilesh S., Jung N., Petkova S., Avanessian L., Choi E.Y., Shaffer D.J. The MHC class I-like IgG receptor controls perinatal IgG transport, IgG homeostasis, and fate of IgG-Fc-coupled drugs. J. Immunol. 2003;170:3528–3533. doi: 10.4049/jimmunol.170.7.3528. [DOI] [PubMed] [Google Scholar]

[bib19] 19.Wani M.A., Haynes L.D., Kim J., Bronson C.L., Chaudhury C., Mohanty S., Waldmann T.A., Robinson J.M., Anderson C.L. Familial hypercatabolic hypoproteinemia caused by deficiency of the neonatal Fc receptor, FcRn, due to a mutant beta2-microglobulin gene. Proc. Natl. Acad. Sci. USA. 2006;103:5084–5089. doi: 10.1073/pnas.0600548103. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib20] 20.Fehrmann R.S., Jansen R.C., Veldink J.H., Westra H.J., Arends D., Bonder M.J., Fu J., Deelen P., Groen H.J., Smolonska A. Trans-eQTLs reveal that independent genetic variants associated with a complex phenotype converge on intermediate genes, with a major role for the HLA. PLoS Genet. 2011;7:e1002197. doi: 10.1371/journal.pgen.1002197. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib21] 21.Thorgeirsson T.E., Gudbjartsson D.F., Surakka I., Vink J.M., Amin N., Geller F., Sulem P., Rafnar T., Esko T., Walter S., ENGAGE Consortium Sequence variants at CHRNB3-CHRNA6 and CYP2A6 affect smoking behavior. Nat. Genet. 2010;42:448–453. doi: 10.1038/ng.573. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib22] 22.Tobacco and Genetics Consortium Genome-wide meta-analyses identify multiple loci associated with smoking behavior. Nat. Genet. 2010;42:441–447. doi: 10.1038/ng.571. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib23] 23.Amos C.I., Wu X., Broderick P., Gorlov I.P., Gu J., Eisen T., Dong Q., Zhang Q., Gu X., Vijayakrishnan J. Genome-wide association scan of tag SNPs identifies a susceptibility locus for lung cancer at 15q25.1. Nat. Genet. 2008;40:616–622. doi: 10.1038/ng.109. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib24] 24.Hindorff L.A., Sethupathy P., Junkins H.A., Ramos E.M., Mehta J.P., Collins F.S., Manolio T.A. Potential etiologic and functional implications of genome-wide association loci for human diseases and traits. Proc. Natl. Acad. Sci. USA. 2009;106:9362–9367. doi: 10.1073/pnas.0903103106. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib25] 25.Segrè A.V., Groop L., Mootha V.K., Daly M.J., Altshuler D., DIAGRAM Consortium. MAGIC investigators Common inherited variation in mitochondrial genes is not enriched for associations with type 2 diabetes or related glycemic traits. PLoS Genet. 2010;6:e1001058. doi: 10.1371/journal.pgen.1001058. [DOI] [PMC free article] [PubMed] [Google Scholar]

PERMALINK

Discovery and Fine Mapping of Serum Protein Loci through Transethnic Meta-analysis

Nora Franceschini

Frank JA van Rooij

Bram P Prins

Mary F Feitosa

Mahir Karakas

John H Eckfeldt

Aaron R Folsom

Jeffrey Kopp

Ahmad Vaez

Jeanette S Andrews

Jens Baumert

Vesna Boraska

Linda Broer

Caroline Hayward

Julius S Ngwa

Yukinori Okada

Ozren Polasek

Harm-Jan Westra

Ying A Wang

Fabiola Del Greco M

Nicole L Glazer

Karen Kapur

Ido P Kema

Lorna M Lopez

Arne Schillert

Albert V Smith

Cheryl A Winkler

Lina Zgaga

Stefania Bandinelli

Sven Bergmann

Mladen Boban

Murielle Bochud

YD Chen

Gail Davies

Abbas Dehghan

Jingzhong Ding

Angela Doering

J Peter Durda

Luigi Ferrucci

Oscar H Franco

Lude Franke

Grog Gunjaca

Albert Hofman

Fang-Chi Hsu

Ivana Kolcic

Aldi Kraja

Michiaki Kubo

Karl J Lackner

Lenore Launer

Laura R Loehr

Guo Li

Christa Meisinger

Yusuke Nakamura

Christine Schwienbacher

John M Starr

Atsushi Takahashi

Vesela Torlak

André G Uitterlinden

Veronique Vitart

Melanie Waldenberger

Philipp S Wild

Mirna Kirin

Tanja Zeller

Tatijana Zemunik

Qunyuan Zhang

Andreas Ziegler

Stefan Blankenberg

Eric Boerwinkle

Ingrid B Borecki

Harry Campbell

Ian J Deary

Timothy M Frayling

Christian Gieger

Tamara B Harris

Andrew A Hicks

Wolfgang Koenig

Christopher J O’Donnell

Caroline S Fox