NBDC Research ID: hum0103.v4

 

SUMMARY

Aims: To investigate genomic alterations of Japanese biliary tract cancers

Methods: DNAs/RNA were extracted from biliary tract cancer and paired non-cancer (normal) tissues. NGS libaraies were prepared by using TruSeq DNA Sample Prep kit (Illumina) for whole genome sequencing (WGS), Nextera Rapid Capture kit (Illumina) / Agilent SureSelect Exome V5 kit for whole exome sequencing (WES) and KAPA RNA HyperPrep Kit / TruSeq Stranded mRNA for RNA sequencing. Sequencing was perfomred by Illumina HiSeq 2000/2500 or NovaSeq 6000.

Participants/Materials: Japanese biliary tract cancer patients, normal gallbladder tissue

 

Dataset IDType of DataCriteriaRelease Date
JGAS000109

NGS (WGS)

NGS (Exome)

Controlled-access (Type I) 2018/02/27
JGAS000109 (Data addition) NGS (WGS) Controlled-access (Type I) 2019/06/21
JGAS000109 (Data addition) bam/gvcf data of NGS(WGS) Controlled-access (Type I) 2021/07/13
JGAS000389

NGS (WGS)

NGS (Exome)

NGS (RNA-seq)

Controlled-access (Type I) 2022/01/18

*Release Note

*Data users need to apply an application for Using NBDC Human Data to reach the Controlled-access Data. Learn more

 

MOLECULAR DATA

WGS (JGAS000109)

Participants/Materials

biliary tract cancer (ICD10: C22, 23, 24): 14 cases + 3 cases

          cancer tissues: 23 samples + 6 samples

          paired non-cancer tissues: 14 samples + 3 samples

Targets WGS
Target Loci for Capture Methods -
Platform Illumina [HiSeq 2000/2500, NovaSeq 6000]
Library Source DNAs extracted from cancer and paired non-cancer (normal) tissues from biliary tract cancer patients
Cell Lines -
Library Construction (kit name) TruSeq DNA Sample Prep kit
Fragmentation Methods Ultrasonic fragmentation (Covaris)
Spot Type Paired-end
Read Length (without Barcodes, Adaptors, Primers, and Linkers) 100 bp
QC

Data with bad base quality and high %GC content were removed.

Aligment:

Data matched for the following condition were removed.

- Low mapping rate

- Different insert size

- Gender information mismatch between meta-data and genotype data

- Suspected sex chromosome aberration

Genotyping:

GATK’s best practices includes a variant filtering step following Variant Quality Score Recalibration (VQSR)

- DP/GP (DP < 5, GQ < 20, DP > 60, GQ < 95 )

- Heterozygosity (F>=0.05)

- Hardy-Weinberg equilibrium (p < 10^-6)

- Repeat & Low Complexity

Principal Component Analysis (PCA):

PCA was performed with individuals included in the 1000 genomes project and outliers from Japanese cluster were removed.

 

After these filtering steps, variants located in the regions listed as the HighConfidenceRegion (Genome-In-A-Bottle project) were flagged.

Deduplication Picard 2.10.6
Calibration for re-alignment and base quality GATK 3.7
Mapping Methods BWA mem 0.7.12
Mapping Quality Reads with MAPQ< 20 were excluded at variant calling with GATK 3.7 HaplotypeCaller
Reference Genome Sequence GRCh37/hg19 (hs37d5)
Coverage (Depth) HiSeq 2000/2500: 31.8x, NovaSeq 6000: 28.0x
Detecting Methods for Variation GATK 3.7 HaplotypeCaller
SNV Numbers (after QC)

76,768,387 (Autosomal Chromosomes)

2,898,518 (X Chromosome)

INDEL Numbers (after QC)

10,202,908 (Autosomal Chromosomes)

410,435 (X Chromosome)

Japanese Genotype-phenotype Archive Dataset ID

JGAD000117 (fastq) (included in EGAS00001000678 [EGAD00001000809])

JGAD000403 (bam, vcf): Whole genome sequencing analyzed data included in the JGAD000117 were mapped to the GRCh37 reference genome sequence, and variant detection was carried out using the GATK (Genome Analysis Toolkit) standards. This project is an initiative of the GEnome Medical alliance Japan (GEM Japan, GEM-J). Lean more..

Total Data Volume 2.4 TB (fastq) + 375 GB (fastq) + 1.6 TB (bam, vcf)
Comments (Policies) NBDC policy

 

Exome (JGAS000109, JGAS000389)

Participants/Materials

biliary tract cancer (ICD10: C22, 23, 24): 81 cases

          cancer tissues: 138 samples

          paired non-cancer 128 samples

biliary tract cancer (ICD10: C221, C240, C248, C23): 29 cases (including 3 cases used for WGS)

          cancer tissues: 29 samples

          paired non-cancer tissues: 29 samples

Targets Exome
Target Loci for Capture Methods -
Platform Illumina [HiSeq 2000/2500]
Library Source DNAs extracted from cancer and paired non-cancer (normal) tissues from biliary tract cancer patients
Cell Lines -
Library Construction (kit name) Nextera Rapid Capture kit or SureSelect Exome V5 kit
Fragmentation Methods Ultrasonic fragmentation (Covaris)
Spot Type Paired-end
Read Length (without Barcodes, Adaptors, Primers, and Linkers) 93 bp or 126 bp
Japanese Genotype-phenotype Archive Dataset ID

JGAD000118

JGAD000505

Total Data Volume 1.7 + 0.6 TB (fastq)
Comments (Policies) NBDC policy

 

WGS (JGAS000389)

Participants/Materials

biliary tract cancer (ICD10: C221, C240, C248, C23): 17 cases

          cancer tissues: 17 samples

          paired non-cancer tissues: 17 samples

Targets WGS
Target Loci for Capture Methods -
Platform Illumina [HiSeq 2000/2500, NovaSeq 6000]
Library Source DNAs extracted from cancer and paired non-cancer (normal) tissues from biliary tract cancer patients
Cell Lines -
Library Construction (kit name) TruSeq DNA Sample Prep kit
Fragmentation Methods Ultrasonic fragmentation (Covaris)
Spot Type Paired-end
Read Length (without Barcodes, Adaptors, Primers, and Linkers) 126 bp or 151 bp
Japanese Genotype-phenotype Archive Dataset ID JGAD000504
Total Data Volume 2.0 TB (fastq)
Comments (Policies) NBDC policy

 

RNA-seq (JGAS000389)

Participants/Materials

biliary tract cancer (ICD10: C221, C240, C248, C23): 100 cases (KAPA: 44 cases, TruSeq: 73 cases, both protocols: 17 cases)

          cancer tissues: 100 samples (KAPA: 44 samples, TruSeq: 73 samples)

normal gallbladder: 4 cases (collected from pancreatic cancer surgery cases)

          non-cancer tissues: 4 samples

Targets RNA-seq
Target Loci for Capture Methods -
Platform Illumina [HiSeq 25000]
Library Source RNAs extracted from cancer tissues from biliary tract cancer patients and non-cancer tissues from normal gallbladders
Cell Lines -
Library Construction (kit name)

KAPA RNA HyperPrep Kit

TruSeq Stranded mRNA Sample Prep kit

Fragmentation Methods Heat treatment
Spot Type Paired-end
Read Length (without Barcodes, Adaptors, Primers, and Linkers) 101 bp or 126 bp
Japanese Genotype-phenotype Archive Dataset ID JGAD000506
Total Data Volume 640 GB (fastq)
Comments (Policies) NBDC policy

 

DATA PROVIDER

Principal Investigator: Hidewaki Nakagawa

Affiliation: RIKEN Center for Integrative Medical Sciences

Project / Group Name: -

Funds / Grants (Research Project Number):

Name Title Project Number
KAKENHI Grant-in-Aid for Scientific Research (A) Elucidation of molecular mechanisms of hepatobiliary and pancreatic cancer by multiomics analysis 18H04049
Project for Cancer Research and Therapeutic Evolution (P-CREATE), Japan Agency for Medical Research and Development (AMED) Search for seeds for the development of novel immunotherapies and combined immunotherapies by cancer genome analysis JP20cm0106552

 

PUBLICATIONS

TitleDOIDataset ID
1 Genomic characterization of biliary tract cancers identifies their driver genes and predisposing mutations. doi: 10.1016/j.jhep.2018.01.009

JGAD000117

JGAD000118

2 Molecular Classification and Tumor Microenvironment Characterization of Gallbladder Cancer by Comprehensive Genomic and Transcriptomic Analysis doi: 10.3390/cancers13040733

JGAD000504

JGAD000505

JGAD000506

3 Actionability evaluation of biliary tract cancer by genome transcriptome analysis and Asian cancer knowledgebase doi: 10.18632/oncotarget.28021

JGAD000504

JGAD000505

JGAD000506

 

USERS (Controlled-access Data)

Principal InvestigatorAffiliationCountry/RegionResearch TitleData in Use (Dataset ID)Period of Data Use
Tatsuhiro Shibata Division of Cancer Genomics, National Cancer Center Research Institute JGAD000117, JGAD000118 2018/09/12-2022/09/30
Kengo Kinoshita Tohoku Medical Megabank Organization Construction of Japanese whole genome database JGAD000117 2019/06/24-2022/03/31
Justo Lorenzo Bermejo Statistical Genetics Research Group, Institute of Medical Biometry, Heidelberg University Hospital Germany Genomic characterization of gallbladder tumors from patients worldwide JGAD000118, JGAD000505 2022/10/13-2025/08/17