2. How to submit

KRA submission : BioProject - BioSample - KRA

  • BioProject : Summary of the conducted research project, which is mandatory to complete

  • BioSample : Select sample type of the biological specimen used in the research and insert the corresponding biological information

  • KRA : The form for uploading the raw data and metadata generated through NGS instruments, including the option to specify the data release date

Please refer to the Standard of KRA metadata for a detailed explanation

Schematic overview of KRA

Accepted data

The KRA accepts genetic data and the associated quality scores produced by next generation sequencing technologies. Please refer to the File Format Guide for more specific information.

Please refer to

  • KArray - submit functional genomic studies

  • KNA - submit genome/assembly studies

  • KHBDB - submit human data that require controlled access

Human data

Metagenomic Data

Human metagenomic studies may contain human sequences and require that the donor provide consent to archive their data in an unprotected database. If you would like to archive human metagenomic sequences in the public KRA database, please contact the KRA.

When submitting human metagenomic data, please ensure that all human sequence contaminants have been removed.

Prerequisites

  • KRA accepts reads from high throughput sequencing platforms in specific formats (see the File Format Guide for details). KRA does NOT accept assembled/consensus data or contigs (see KNA)

  • For submissions of 2,000 samples or less. If you have more than 2,000 samples, please create multiple submissions with the same BioProject reference

  • Raw data files can be compressed using gzip or bzip2. Do not use zip! but Packaging or compressed files (e.g., BAM, HDF5, FAST5) do not require compression

  • If there are two or more definitions for the Analysis type, or if you are submitting multiple Analysis data files, such as single-cell platform Cellranger output files (e.g., barcode, feature, matrix, visium image, etc.), please compress them into .tar file

  • Data can be uploaded to GBox up to a maximum of 10TB (however, only up to 1TB is possible on the web-based GBox). Therefore, a submission with more than 10TB of data should be split across multiple submissions, keeping each set of uploads under 10TB, and waiting for completion of each submission before uploading the next set of files

  • Submissions can be linked to the same BioProject to ensure all data are searchable with a single accession

  • Once you apply certificate for R&D outcomes, you can NOT add submission under that BioProject. Create a new BioProject and submit data

  • KRA does not accept submission of duplicate files and submitting duplicate data will lead to significant delays in processing of the submission.

  • To update an existing record, do not resubmit the data files, instead contact the KRA staff (kra@kribb.re.kr) for assistance

Contact KRA staff

Before contacting KRA staff for assistance please see our Troubleshooting Guide. Please provide your submission's temporary ID in the form of DSUB#in your messages.

Email kra@kribb.re.kr for help.

Last updated