Data availability
Raw sequencing data have been deposited in the Genome Sequence Archive in National Genomics Data Center, China National Center for Bioinformation / Beijing Institute of Genomics, Chinese Academy of Sciences, under accession number HRA001385 that are publicly accessible at (https://ngdc.cncb.ac.cn/gsa-human).
The VCF is annotated with rsIDs from dbSNP151, and the following INFO fields:
AC:
Allele count in called genotypes in WBBC
AF:
Allele frequency in called genotypes in WBBC
AN:
Total number of alleles in called genotypes in WBBC
NS:
Total number of samples in called genotypes in WBBC
North_AF:
Allele frequency in North Han Chinese
North_AN:
Total number of alleles in North Han Chinese
Central_AF:
Allele frequency in Central Han Chinese
Central_AN:
Total number of alleles in Central Han Chinese
South_AF:
Allele frequency in South Han Chinese
South_AN:
Total number of alleles in South Han Chinese
Lingnan_AF:
Allele frequency in Lingnan Han Chinese
Lingnan_AN:
Total number of alleles in Lingnan Han Chinese
VQSLOD:
Variant Recalibration Score from GATK