研究内容の概要

目的： 多層的オミクス解析による疾患病態の解明、日本人集団におけるGWASおよび複数集団におけるGWASメタ解析

方法： メタゲノムシークエンス、ゲノムワイド関連解析

対象： 日本人集団（95＋103名）の腸内細菌叢のメタゲノムシークエンスデータ

肺胞蛋白症患者：198名、対照者：395名のゲノムワイド関連解析データ

バイオバンク・ジャパン（179,000名）、UKバイオバンク（361,000名）、FinnGen（136,000名）の215形質のゲノムワイド関連解析データ

データID	内容	制限	公開日
JGAS000205	メタゲノム	制限公開（Type I）	2019/11/15
hum0197.v2.gwas.v1	肺胞蛋白症のGWAS	非制限公開	2020/11/27
JGAS000260	メタゲノム	制限公開（Type I）	2020/11/27
hum0197.v3.gwas.v1	215形質のGWAS	非制限公開	2021/03/22

※リリース情報はこちら

※制限公開データの利用にあたっては、利用申請が必要です。申請方法はこちら。

※論文等でデータベースからダウンロードしたデータを含む結果を公表する際には、下記文献を引用いただくか、NBDCヒトデータベースに登録されたデータを利用した旨について謝辞（Acknowledgement）に記載して下さい。記載例はこちら。

分子データ

JGAS000205


対象	日本人集団：95名
規模	メタゲノム
対象領域（Target Captureの場合）	-
Platform	Illumina [HiSeq 3000]
ライブラリソース	便より抽出したDNA
検体情報（購入の場合）	-
ライブラリ作製方法（キット名）	KAPA Hyper Prep Kit
断片化の方法	超音波断片化（Covaris）
ライブラリ構築方法	Paired-end
リード長（除：バーコード、アダプタ、プライマー、リンカー）	150 bp
Japanese Genotype-phenotype Archive Data set ID	JGAD000290
総データ量	477 GB（fastq）
コメント（利用にあたっての制限事項）	NBDC policy

hum0197.v2.gwas.v1


対象	肺胞蛋白症（ICD10：J840）：198症例対照者：395名
規模	genome wide SNPs
対象領域（Target Captureの場合）	-
Platform	Illumina [Infinium Asian Screening Array]
ソース	末梢血から抽出したDNA
検体情報（購入の場合）	-
調整試薬（キット名、バージョン）	Infinium Asian Screening Array
遺伝子型決定アルゴリズム（ソフトウェア）	GenomeStudio for genotyping, shapeit2 for haplotype phasing, and minimac3 for imputation
関連解析（ソフトウェア）	PLINK2
フィルタリング	Sample QC: We excluded samples with low genotyping call rates (call rate < 98%) and in close genetic relation (PI_HAT > 0.175). We included samples of the estimated East Asian ancestry. Variant QC: We excluded variants with (1) genotyping call rate < 98%, (2) P value for Hardy–Weinberg equilibrium < 1.0 × 10⁻⁶, and (3) minor allele count < 5, or (4) > 10% frequency difference with the imputation reference panel.
マーカー数（QC後）	12,153,232 autosomal variants and 242,876 X-chromosomal variants after QC.
NBDC Data Set ID	hum0197.v2.gwas.v1 （データのダウンロードは上記Data IDをクリックしてください） Dictionary file
総データ量	390MB for autosome (txt.gz) and 19MB for X chromosome (txt.gz)
コメント（利用にあたっての制限事項）	NBDC policy

JGAS000260


対象	日本人集団：103名
規模	メタゲノム
対象領域（Target Captureの場合）	-
Platform	Illumina [HiSeq 3000]
ライブラリソース	便より抽出したDNA
検体情報（購入の場合）	-
ライブラリ作製方法（キット名）	KAPA Hyper Prep Kit
断片化の方法	超音波断片化（Covaris）
ライブラリ構築方法	Paired-end
リード長（除：バーコード、アダプタ、プライマー、リンカー）	150 bp
Japanese Genotype-phenotype Archive Data set ID	JGAD000363
総データ量	408 GB（fastq）
コメント（利用にあたっての制限事項）	NBDC policy

hum0197.v3.gwas.v1


対象	バイオバンク・ジャパン（179,000名）、UKバイオバンク（361,000名）、FinnGen（136,000名）形質数：215
規模	genome wide SNPs
対象領域（Target Captureの場合）	-
Platform	BBJ：Illumina [HumanOmniExpressExome BeadChip、HumanOmniExpress BeadChip、HumanExome BeadChip] UK Biobank：Applied Biosystems [UK BiLEVE Axiom Array、UK Biobank Axiom Array] FinnGen：Thermo Fisher Scientific [FinnGen1 ThermoFisher Arrayなど]
ソース	末梢血から抽出したDNA
検体情報（購入の場合）	-
調整試薬（キット名、バージョン）	BBJ：HumanOmniExpressExome BeadChip、HumanOmniExpress BeadChip、HumanExome BeadChip UK Biobank：UK BiLEVE Axiom Array、UK Biobank Axiom Array FinnGen：FinnGen1 ThermoFisher Arrayなど
遺伝子型決定アルゴリズム（ソフトウェア）	BBJ：Eagle、Minimac3 UK Biobank：IMPUTE4 FinnGen：beagle4.1
関連解析（ソフトウェア）	For binary traits, SAIGE software was used with age, age2, sex, age×sex, age2×sex, and top 20 principal components as covariates. For quantitative traits (biomarkers), BOLT-LMM or plink software was used with the same covariates.
フィルタリング	BBJ：We included imputed variants with Rsq > 0.7. UK Biobank：We excluded the variants with (i) INFO score ≤ 0.8, (ii) MAF ≤ 0.0001 (except for missense and protein-truncating variants annotated by VEP, which were excluded if MAF ≤ 1 × 10-6), and (iii) PHWE ≤ 1 × 10-10. FinnGen：We excluded variants with an imputation INFO score < 0.8 or MAF < 0.0001.
マーカー数（QC後）	BBJ：13,530,797 variants UK Biobank：13,791,467 variants FinnGen：16,859,359 variants
NBDC Data Set ID	hum0197.v3.gwas.v1 （データのダウンロードは上記データIDをクリックし、遷移先のサイトの各Data set IDをクリックしてください） Dictionary file（BBJ、EUR、META）
総データ量	BBJ：~1.5G for autosome and ~33M for chrX UK Biobank：~1.5G for autosome and ~15M for chrX FinnGen：~740M for autosome and ~20M for chrX
コメント（利用にあたっての制限事項）	NBDC policy

提供者情報

研究代表者： 岡田随象

所属機関： 大阪大学大学院医学系研究科遺伝統計学

プロジェクト/研究グループ名： -

科研費/助成金（Research Project Number）：

科研費・助成金名	タイトル	研究課題番号
日本医療研究開発機構（AMED）革新的先端研究開発支援事業ソロタイプ（PRIME）	遺伝統計学が紐解く微生物叢・宿主・疾患・創薬のクロストーク	JP19gm6010001
⽇本医療研究開発機構（AMED）⾰新的先端研究開発⽀援事業ステップタイプ（FORCE）	メタゲノムワイド関連解析による疾患特異的微生物叢解明と個別化医療実装	JP20gm4010006
⽇本医療研究開発機構（AMED）難治性疾患実用化研究事業	横断的オミクス解析を駆使した肺胞蛋白症の病態解明とインシリコ・リポジショニング創薬	JP20ek0109413

	タイトル	DOI	データID
1	Metagenome-wide association study of gut microbiome revealed novel aetiology of rheumatoid arthritis in the Japanese population.	doi: 10.1136/annrheumdis-2019-215743	JGAD000290
2	Genetic determinants of risk in autoimmune pulmonary alveolar proteinosis.	doi: 10.1038/s41467-021-21011-y	hum0197.v2.gwas.v1
3	A metagenome-wide association study of gut microbiome in patients with multiple sclerosis revealed novel disease pathology.	doi: 10.3389/fcimb.2020.585973	JGAD000363
4	A global atlas of genetic associations of 220 deep phenotypes	doi: 10.1101/2020.10.23.20213652	hum0197.v3.gwas.v1

制限公開データの利用者一覧

研究代表者	所属機関	利用データID	利用期間

研究内容の概要

分子データ

提供者情報

関連論文

制限公開データの利用者一覧