# 2. How to submit

## KRA submission : BioProject - BioSample - KRA

* **BioProject** : Overview of the research project (mandatory for all submissions)
* **BioSample** : Biological sample information including organism, tissue type, and experimental attributes
* **KRA** : Raw sequencing data and metadata from NGS platforms, with options to specify data release date

> <mark style="color:red;">With the latest update, the KRA submission process allows you to register BioProject, BioSample, and KRA information in a single integrated workflow.</mark>
>
> Please refer to the [<mark style="color:blue;">**Standard of KRA metadata**</mark>](https://app.gitbook.com/s/Ut3xQ5vaW7nKPK4KBV0E/) for a detailed explanation

<figure><img src="https://2173271362-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2F7yxS8L6oyg8lt6DC3l24%2Fuploads%2FOZ1U1cOzHUjK3obYDpkk%2Fimage.png?alt=media&#x26;token=ed226434-352b-4f61-95a0-4332baf128e3" alt=""><figcaption><p>Schematic overview of KRA</p></figcaption></figure>

## Accepted data

The KRA accepts **raw sequencing reads and associated quality scores** produced by next-generation sequencing technologies. Please refer to the [<mark style="color:blue;">File Format Guide</mark>](https://kobic.gitbook.io/kra-docs/kra-user-guide/2.-how-to-submit/2.2-file-format-guide) for more specific information.

For other data types, please refer to:&#x20;

* [**KEA**](https://kbds.re.kr/KArray) - submit functional genomic data
* [**KNA**](https://kbds.re.kr/KNA) - submit genome or assembled sequences
* [**KVar**](https://app.gitbook.com/o/NXmN4HGB76Ro5VGnPx9X/s/oKaWS6kXjZyazWw10K8s/) - submit genomic variants
* [**KHBDB**](https://kbds.re.kr/khbdb/service/main) - submit human data requiring controlled access

## Human data

{% hint style="danger" %}

### **If the data to be registered satisfies ALL of the following criteria, it must be submitted through the Korea Human Biological Data Bank (**[**KHBDB**](https://kbds.re.kr/khbdb/service/main)**):**

* **Human research\*** or **human-derived materials research\*\***
* Contains personal information
* Written consent has been obtained from research subjects (donors) for the third-party provision of research data

**\* Human research** : Research conducted on humans, or using identifiable information that involvs physical intervention, communication, interpersonal contact, or interaction with individuals, based on Article 2-1 of the Bioethics and Safety Act ([Korean](https://www.law.go.kr/LSW/lsInfoP.do?lsId=009628\&ancYnChk=0#0000), [English](https://www.law.go.kr/LSW/lsInfoP.do?lsiSeq=199534\&viewCls=engLsInfoR\&urlMode=engLsInfoR#0000)), as designated by regulations of the Ministry of Health and Welfare.

**\*\* Human-derived materials research** : Research conducted by directly examining or analyzing biological materials collected or sampled from the human body, including tissues, cells, blood, body fluids , and molecular components (e.g., serum, plasma, chromosomes, DNA, RNA, proteins)

<mark style="background-color:$primary;">**However, human organoid samples and cell lines without privacy concerns are eligible for submission to KRA**</mark>
{% endhint %}

## Human Metagenomic Data

Human metagenomic studies may contain human sequences. All human sequences must be removed from metagenomic data prior to submission to KRA

If you have questions about human sequence removal or require assistance, please contact KRA staff at <kra@kribb.re.kr>

## Prerequisites

#### Accepted Data Types

* KRA accepts reads from high-throughput sequencing platforms in specific formats (see the [File Format Guide](https://kobic.gitbook.io/kra-docs/kra-user-guide/2.-how-to-submit/2.2-file-format-guide) for details). KRA does NOT accept assembled/consensus sequences or contigs (see [KNA](https://kbds.re.kr/KNA))

#### Submission Size Limits

* For submissions of 2,000 samples or fewer. For larger datasets, create multiple submissions linked to the same BioProject
* Data upload limits via GBox:

  * Web-based: 1TB maximum
  * CLI/APP: 10TB maximum per submission

  For datasets exceeding 10TB, split across multiple submissions (under 10TB each), link to the same BioProject, and complete each submission before starting the next
* Submissions can be linked to the same BioProject to ensure all data are searchable with a single accession

#### File Preparation

* Raw data files can be compressed using gzip or bzip2 (do not use zip). Pre-packaged or binary formats (e.g., BAM, HDF5, FAST5) do not require additional compression
* If submitting multiple analysis files for a single sample, please bundle them into a single .tar archive

#### Important Policies

* KRA does not accept duplicate files. Submitting duplicate data will lead to significant delays in processing
* To update an existing record, do not resubmit data files. Instead, contact KRA staff (<kra@kribb.re.kr>) for assistance

## Contact KRA staff

Before contacting KRA staff for assistance, please see our [Troubleshooting Guide](https://kobic.gitbook.io/kra-docs/kra-user-guide/2.-how-to-submit/2.5-trouble-shooting).\
Please provide your submission's temporary ID (**`DSUB########)`** in your messages.&#x20;

> Email <mark style="color:blue;"><kra@kribb.re.kr></mark> for help.&#x20;
