7. Best Practices & Examples

This section provides real-world examples, common patterns, and practical recommendations to help you create high-quality BioProject submissions. Following these guidelines will make your submissions more discoverable, easier to reuse, and faster to approve.


1. Write a clear and comprehensive description

A well-written project description is one of the most important elements of a BioProject. It should explain what the research is about, why it matters, and what types of data are generated.

✅ Good example:

"This project investigates the genomic diversity and adaptive mechanisms of soil microbiomes under extreme environmental conditions. Multi-omics approaches including metagenomics, metatranscriptomics, and metabolomics are used to profile microbial community structure and function. The resulting datasets will support comparative analysis and enable the development of predictive ecosystem models."

❌ Poor example:

"This is a sequencing project to study bacteria in soil."


2. Use consistent metadata and naming conventions

Ensure that project titles, sample IDs, dataset names, and accession numbers are consistent across BioProject, BioSample, KRA, KNA, and other submissions. Inconsistent metadata is one of the main causes of delayed approval.

✅ Good Example:

  • BioProject title: Plant Root Microbiome Dynamics under Drought Stress

  • BioSample IDs: Root001, Root002, Root003

  • Raw data files: Root001_R1.fastq, Root002_R1.fastq

❌ Poor Example:

  • BioProject title: Root Stress Study

  • BioSample IDs: Sample1, SampleA, S3

  • Raw data files: test.fastq, sample.fastq, final.fastq


3. Provide relevant publications and references

Including DOIs, PubMed IDs, or preprints increases the visibility of your BioProject and helps other researchers understand its scientific context.

💡 Tip: If a manuscript is in preparation, include a brief note in the project description:

"A manuscript describing this dataset is currently under review and will be linked here upon publication."


Always ensure that related data — such as BioSample metadata, raw reads (KRA), assembled sequences (KNA), expression data (KArray), chemical profiling, or bioimaging results — are correctly linked to the BioProject. A well-linked project enhances discoverability and makes downstream analysis much easier.


Example BioProject Scenarios

1. Multi-Omics Plant Research

  • Title: Comparative Multi-Omics Analysis of Rice Response to Heat Stress

  • Data Types: Genome sequencing (KRA, KNA), transcriptome profiling (KEA), metabolomics (KMAP)

  • Goal: Identify genetic and metabolic pathways involved in thermotolerance.

  • Outcome: Data support the development of stress-resistant crop varieties.


2. Metagenomics and Functional Analysis

  • Title: Soil Microbiome Metagenomic Survey in Industrial Contaminated Sites

  • Data Types: Metagenomic reads (KRA), MAG assemblies (KNA), metatranscriptomics (KEA)

  • Goal: Understand microbial community adaptation to heavy metal stress.

  • Outcome: Provides reference data for bioremediation research.


3. Drug Discovery and Bioassay Profiling

  • Title: High-Throughput Screening of Chemical Compounds for Cancer Therapeutics

  • Data Types: Chemical structure data (KCODE), bioassay results (KCODE), transcriptomic response data (KEA)

  • Goal: Identify and validate potential small-molecule inhibitors of oncogenic pathways.

  • Outcome: Supports lead compound selection for preclinical development.


Frequently Asked Questions (FAQ)

Q1. Can I update my BioProject after approval? Yes. You can add new datasets and link additional publications or patents at any time. However, core project metadata (e.g., title, description, submitter information) cannot be directly modified by users after approval - such chanes requir curator assistance.

Q2. Do I need to register a new BioProject for additional experiments? If the experiments are part of the same research scope and objectives, add them to the existing BioProject. If they represent a new, independent study, register a new BioProject.

Q3. How many BioSamples can be linked to one BioProject? There is no strict limit. A single BioProject can include hundreds or even thousands of BioSamples.

Q4. Can I submit data before the BioProject is approved? Yes. BioSample, raw data (KRA), and other datasets can be registered after the BioProject is created but before final approval. They will be linked once the BioProject is approved.


Final Advice

  • Think of a BioProject as the “hub” for your research. All data, metadata, and results should point back to it.

  • Use descriptive titles and rich metadata — they are key to making your data discoverable and citable.

  • Plan your submission timeline in advance, especially if you intend to coordinate data release with manuscript publication.

  • Treat metadata as a first-class research output: accurate and complete metadata dramatically increases the scientific value of your data.

Remember: A well-structured, well-documented BioProject is more than a submission — it becomes a long-term scientific resource that supports discovery, collaboration, and innovation.

Last updated