KVar metadata

Study

Key Terms and Definitions

SNP Single Nucleotide Polymorphism. In Kvar, this category refers to variations in the genome that include both single nucleotide variants (SNVs) and short insertions or deletions (indels) of 50 bp or less.

SV Structural Variation. Large-scale genomic alterations, typically 50 base pairs or larger, including insertions, deletions, inversions, and translocations

Metadata

Field

Description

Study variant type

Select one of the following study variant types to submit. - SNP - SV

Study title

A brief title of the study. Example: Analysis of Single Nucleotide Polymorphisms in the Korean Population

Study description

A description of the study. Example: We analyzed whole genome sequencing data generated on the Illumina platform from more than 100 normal Korean individuals to identify single nucleotide polymorphisms (SNPs) in the human genome.

Study type

Select one of the following study types to submit. - Case-Control - Case-Set - Collection - Control Set - Somatic - Tumor vs. Matched-Normal Note: Check only if ‘sv’ is selected in Study variant type field

SampleSet

Metadata

Field

Description

SampleSet ID

An appropriate ID for this sampleset. Example: Normal_1, Case_1

SampleSet size

Number of samples in sampleset. Example: 4

SampleSet type

Select one of the following sampleset types. - Case - Control Note: Check only if ‘sv’ is selected in Study variant type field

SampleSet description

Description of sampleset. Example: Korean normal population

SampleSet population

Population of Sampleset. For the subjects represented in the sampleset, indicate the: ethnicity (if human), population (if non-human primate), strain (if mouse), breed (if cattle or dog), cultivar (if plant), etc.

Example: Korean

Sample

Metadata

※ The BioSample Accession ID and Sample Name fields form a grouped requirement, where only one of the two is mandatory.

If a new BioSample is being registered as part of this submission, enter a value in the Sample Name field.
If an existing BioSample is being used, enter the appropriate BioSample Accession ID.

Field

Description

SampleSet ID

The SampleSet ID (from Part 2. SampleSet) to which the sample belongs.

BioSample accession ID

The accession ID of an already registered BioSample corresponding to this sample. Note: If a new BioSample is being registered, leave this field blank and fill out the "BioSample attribute" field instead.

Sample name

A unique name given by the researcher to distinguish each sample within the study. Note: This field must be completed when registering a new BioSample. If an existing BioSample accession ID is provided, this field can be left blank

NCBI taxonomy ID

NCBI’s taxonomy identifier of the organism for this sample. The NCBI taxonomy ID can be found at https://www.ncbi.nlm.nih.gov/taxonomy/. Enter 32644 (which is a taxonomy ID for unidentified organisms) for the following or similar cases: (1) when NCBI taxonomy ID is not available because NCBI taxonomy does not yet cover the organism, (2) when metagenome or environmental sample was used, whose organismal composition is unknown in advance Example: 9606 (for Homo sapiens), 452680 (for Pseudomonas sp. UK4)

Organism

The most descriptive organism name for this sample (to the species, if possible). In the case of a new species, provide the desired organism name. In the case of unidentified species, choose the appropriate Genus and include ‘sp.’, e.g. “Escherichia sp.”. When sequencing a genome from a non-metagenomic source, include a strain or isolate name too, e.g. “Pseudomonas sp. UK4” Example: Homo sapiens, Pseudomonas sp. UK4

Sample description

Description of sample

Sample population

Population of Sample

Example: Korean

Experiment

Key Terms and Definitions

Discovery Initial detection of variants using sequencing methods such as WGS or WES

Genotyping Determination of variant genotypes using sequencing-based methods such as targeted sequencing

Validation Independent confirmation of variants using alternative methods.

Metadata

Field

Description

Experiment ID

A unique ID to identify each of your experiments within the study. Example: snp1

Experiment type

Select one of the following experiment types. - Discovery - Genotyping - Validation

Method type

Select one of the following mehod types. - Sequencing - Oligo aCGH - SNP array - BAC aCGH - Curated - Digital array - FISH - Gene expression array - Karyotyping - MAPH - MassSpec - Merging - Multiple complete digestion - MLPA - Optical mapping - PCR - qPCR - ROMA Note: Only 'Sequencing' is available if 'Discovery' or 'Genotyping' is selected for the Experiment type field

Analysis type

Select one of the following analysis types. See the Analysis type from for details. Note: The possible values for the Analysis type field change depending on the selection in the Method type field. However, 'Manual Observation' and 'Other' are always available, regardless of the Method type selected

Reference value

The Name or accession ID of the reference sequence used for the analysis. Example: GRCh38

Method platform

Select one of the following mehod platform. See the Method platform from for details.

Method description

Description of the technique or technology used to generate the data. Example1: Whole-genome sequencing of the sample was performed using the Illumina NovaSeq 6000 platform with paired-end 150 bp reads, followed by library preparation using the TruSeq DNA PCR-Free kit.

Example2: Targeted sequencing of asthma-associated genes was conducted using a custom capture panel and the Illumina MiSeq platform with paired-end 250 bp reads.

Detection method

The specific method or assay (software package, algorithms, etc)used to detect a biological entity or feature. Example1: Single nucleotide polymorphisms (SNPs) were detected using the Genome Analysis Toolkit v.4.3.0 (GATK) HaplotypeCaller following best practices.

Example2: Structural variants (SVs) were identified using Manta Structural Variant Caller v.1.6.0 with default parameters.

Analysis type

Value (Method type)

Value (Anlaysis type)

Sequencing

Paired-end Mapping

Sequencing

Read Depth

Sequencing

One End Anchored Assembly

Sequencing

Split Read Mapping

Sequencing

Sequence Alignment

Sequencing

Serial Analysis of Gene Expression

SNP array, Oligo aCGH, BAC aCGH, FISH, qPCR

Probe Signal Intensity

SNP array

SNP Genotyping Analysis

Multiple Complete Digestion

MCD Analysis

Merging

Optical Mapping

Curated

Manual Observation

Other

Method plaotform

Value

AB5500GeneticAnalyzer

AB 5500xl Genetic Analyzer

AB 5500x-Wl Genetic Analyzer System

AB SOLiD 3 Plus System

AB SOLiD 4 System

AB SOLiD 4hq System

AB SOLiD PI System

AB SOLiD System

AB SOLiD System 2.0

AB SOLiD System 3.0

BGISEQ-50

BGISEQ-500

MGISEQ-2000RS

AB 310 Genetic Analyzer

AB 3130 Genetic Analyzer

AB 3130xL Genetic Analyzer

AB 3500 Genetic Analyzer

AB 3500xL Genetic Analyzer

AB 3730 Genetic Analyzer

AB 3730xL Genetic Analyzer

Complete Genomics

DNBSEQ-G400

DNBSEQ-G400 FAST

DNBSEQ-G50

DNBSEQ-T7

Element AVITI

GS111

FASTASeq 300

GenoCare 1600

GenoLab M

Helicos HeliScope

HiSeq X Five

HiSeq X Ten

Illumina Genome Analyzer

Illumina Genome Analyzer II

Illumina Genome Analyzer IIx

Illumina HiScanSQ

Illumina HiSeq 1000

Illumina HiSeq 1500

Illumina HiSeq 2000

Illumina HiSeq 2500

Illumina HiSeq 3000

Illumina HiSeq 4000

Illumina HiSeq X

Illumina MiSeq

Illumina MiniSeq

Illumina NovaSeq 6000

Illumina NovaSeq X

Illumina NovaSeq X Plus

Illumina iSeq 100

NextSeq 1000

NextSeq 2000

NextSeq 500

NextSeq 550

Ion GeneStudio S5

Ion GeneStudio S5 Plus

Ion GeneStudio S5 Prime

Ion Torrent Genexus

Ion Torrent PGM

Ion Torrent Proton

Ion Torrent S5

Ion Torrent S5 XL

454 GS

454 GS 20

454 GS FLX

454 GS FLX Titanium

454 GS FLX+

454 GS Junior

GridION

MinION

PromethION

PacBio RS

PacBio RS II

Revio

Sequel

Sequel II

Sequel IIe

Onso

Tapestri

UG 100

Sentosa SQ301

DataSet

Metadata

Field

Description

Variant type

Select one of the following variant types to submit. - SNP - Variant call - Variant region

SampleSet ID

The SampleSet ID (from Part 2. SampleSet) associated with the dataset

Experiment ID

The Experiment ID (from Part 4. Experiment) associated with the dataset

Moltype

Select one of the following molecular type - Genomic - cDNA - Mito - Chloro

DataSet description

Brief description of dataset Example: Structural variations of Normal_1

Variant filename

Exact file name of the variant data. VCF is required for SNP data; VCF or Excel file (xlsx) can be used for SV data. Example: Kdata.vcf, Kdata.vcf.gz, Kdata.xlsx

Previous2.2 KVar Submission Next2.3 Variation File Format Guide

Last updated 2 months ago

hashtagStudy

hashtagKey Terms and Definitions

hashtagMetadata

hashtagSampleSet

hashtagMetadata

hashtagSample

hashtagMetadata

hashtagExperiment

hashtagKey Terms and Definitions

hashtagMetadata

hashtagAnalysis type

hashtagMethod plaotform

hashtagDataSet

hashtagMetadata

Study

Key Terms and Definitions

Metadata

SampleSet

Metadata

Sample

Metadata

Experiment

Key Terms and Definitions

Metadata

Analysis type

Method plaotform

DataSet

Metadata