Integrated attributes
1
Age
Age at the time of sampling; relevant scale depends on species and study, e.g. could be seconds for amoebae or centuries for trees. For samples such as cell lines in which age is not applicable, enter not applicable. For samples where an exact age is unknown, such as when only the range of age values is known, enter missing:not collected in this field, then use the “other characteristics“ field to enter the range of age values Example: 28 years, 2.5 months, 13 weeks, 3 days, 10 embryo age
2
Altitude
The altitude of the sample is the vertical distance between Earth's surface above sea level and the sampled position in the air Example: 1,000 m
3
Biomaterial Provider
Name and address of the lab or PI, or a culture collection identifierr. If a value cannot be provided, submitters are asked to use one of the accepted missing value reporting terms Example: Korean Collection for Type Cultures(KCTC), Korean Cell Line Bank(KCBL), Dr. Gildong Hong from Seoul National University
4
Birth date
Birth date of the subject
Format: YYYY-MM-DD, YYYY-MM, YYYY
Example: 1990-10-30, 1990-10, 1990
5
Birth location
Birth location of the subject
Example: Laboratory Animal Resource & Research Center - KRIBB (LARRC), Johns Hopkins Hospital Gynecology & Obstetrics
6
Breed
Breed name - chiefly used in domesticated animals or plants
Example: BALB/c, C57BL/6, White (w1118)
7
Breeding history
Breeding history
8
Breeding method
Breeding method
9
Broad-scale environmental context
Add terms that identify the major environment type(s) where your sample was collected. Recommend subclasses of biome [ENVO:00000428]. Multiple terms can be separated by one or more pipes (|)
Format: http://purl.obolibrary.org/obo/ENVO_00000428 or ENVO_00000428 in EBI Ontologies
Example: mangrove biome [ENVO:01000181]|estuarine biome [ENVO:01000020]
10
Cell line
Name of the cell line
11
Cell subtype
Subtype of a cell
12
Cell type
Type of cell of the sample or from which the sample was obtained
13
Collected by
Name of persons or institute who collected the sample
14
Collection date
The date (or date and time) on which the sample was collected. If a value of an expected format cannot be provided, submitters are asked to use one of the accepted missing value reporting terms.
Format: Use ISO 8601 standard on date and time. For date, use the formats “YYYY”, “YYYY-MM”, “YYYY-MM-DD”. For date and time, use the format “YYYY-mm-ddThh:mm:ssZ”. In this format, time is in Coordinated Universal Time (UTC), otherwise known as Zulu time, the letter “T” is added between date and time, and the letter “Z” (indicating Zulu) is added after time. For a range of two date and time values, use the forward-slash character “/” as the delimiter
Example: 1990-10-30, 1990-10, 1990, 1952-10-21/1953-02-15, 2015-10-11T17:53:03Z
15
Cultivar
Cultivar name - cultivated variety of plant
16
Culture collection
Name of source institute and unique culture identifier
Format: See https://www.insdc.org/submitting-standards/controlled-vocabulary-culturecollection-qualifier/ for the description for the proper format and list of allowed institutes
17
Death date
Death date of the subject
Format: YYYY-MM-DD, YYYY-MM, YYYY
Example: 1990-10-30, 1990-10, 1990
18
Depth
Depth is defined as the vertical distance below surface, e.g. for sediment or soil samples depth is measured from sediment or soil surface, respectively. Depth can be reported as an interval for subsurface samples
Example: 10 m
19
Derived from
Indicates when one BioSample was derived from another BioSample. Value should include BioSample accession number(s)
Example: SAMN00000001, KAS24095074
20
Description
Description of the sample
21
Development stage
Developmental stage at the time of sampling
Example: adult, embryo, fetal, hatching stage, infant, juvenile, larva, newborn, pluripotency, pupa, R3 growth stage, seedling, vegetative stage
22
Disease
Name of diseases diagnosed
Format: Can include multiple diagnoses. Controlled vocabulary from (1) Human Disease Ontology or (2) MeSH
23
Disease stage
Stage of disease at the time of sampling
24
Ecotype
A population within a given species displaying genetically based, phenotypic traits that reflect adaptation to a local habitat
Example: Korea, Columbia
25
Ethnicity
Ethnicity of the subject
26
Genotype
Observed genotype
27
Geographic location
Geographical origin of the sample. If a value of an expected format cannot be provided, submitters are asked to use one of the accepted missing value reporting terms.
Format: Use the appropriate name from the list shown in https://www.insdc.org/submitting-standards/geo_loc_name-qualifier-vocabulary/. Use a colon (:) to separate the country or ocean from more detailed information about the location
Example: “South Korea: Seoul”, “Canada: Vancouver”, “Germany: halfway down Zugspitze, Alps”
28
Growth protocol
Protocol for growth Example: 30 min egg collections of OreR and yw flies at 25 degree Celsius were aged at room temperature (RT) according to the different temporal classes T0-T4.
29
Health state
Health or disease status of sample at time of collection
30
Height or length
Measurement of height or length
31
Host
The natural (as opposed to laboratory) host to the organism from which the sample was obtained. Use the full taxonomic name
Example: Homo sapiens
32
Host age
Age of host at the time of sampling
33
Host description
Additional information not included in other defined vocabulary fields
34
Host disease
Name of relevant disease
Format: Controlled vocabulary from (1) Human Disease Ontology or (2) MeSH (Medical Subject Heading)
Example: Salmonella gastroenteritis
35
Host disease outcome
Final outcome of disease
Example: death, chronic disease, recovery
36
Host disease stage
Stage of disease at the time of sampling
37
Host health state
Information regarding health state of the individual sampled at the time of sampling
38
Host sex
Gender or physical sex of the host Controlled Term: "male", "female", "pooled male and female", "neuter", "hermaphrodite", "intersex", "not determined", "missing", "not applicable", "not collected"
39
Host subject id
A unique identifier by which each subject can be referred to, de-identified
Example: #131
40
Host tissue sampled
Type of tissue the initial sample was taken from
Format: Controlled vocabulary from BRENDA Tissue and Enzyme Source Ontology
41
Identified by
Name of the taxonomist who identified the specimen
42
Isolate
Identification or description of the specific individual from which this sample was obtained
43
Isolation source
Describes the physical, environmental and/or local geographical source of the biological sample from which the sample was derived
44
Karyotype
Karyotype
45
Lab host
Scientific name and description of the laboratory host used to propagate the source organism or material from which the sample was obtained
Example: Escherichia coli DH5a, or Homo sapiens HeLa cells
46
Latitude and longitude
The geographical coordinates of the location where the sample was collected. When the information is lacking or specification of location is not appropriate, submitters are asked to use one of the accepted missing value reporting terms.
Format: Specify as degrees latitude and longitude in format “d[d.dddd] N|S d[dd.dddd] W|E”
Example: 38.98 N 77.11 W
47
Mating type
Mating type of microbe
48
NCBI taxonomy ID
NCBI’s taxonomy identifier of the organism for this sample. The NCBI taxonomy ID can be found at NCBI taxonomy. Enter 32644 (which is a taxonomy ID for unidentified organisms) for the following or similar cases: (1) when NCBI taxonomy ID is not available because NCBI taxonomy does not yet cover the organism, (2) when metagenome or environmental sample was used, whose organismal composition is unknown in advance Example: 9606 for Homo sapiens and 452680 for Pseudomonas sp. UK4
49
Organism
The most descriptive organism name for this sample (to the species, if possible). In the case of a new species, provide the desired organism name. In the case of unidentified species, choose the appropriate Genus and include ‘sp.’, e.g. “Escherichia sp.”. When sequencing a genome from a non-metagenomic source, include a strain or isolate name too, e.g. “Pseudomonas sp. UK4” Example: Homo sapiens, Pseudomonas sp. UK4
50
Passage history
Number of passages and passage method
51
Pathotype
Some bacterial specific pathotypes
Example: Escherichia coli - STEC, UPEC
52
Phenotype
Phenotype of sampled organism
Format: Controlled vocabulary from Phenotypic Quality Ontology (PATO)
53
Population
Applicable for human and plants. Population is a summation of all the organisms of the same group or species, which live in a particular geographical area, and it can also indicate filial generation, number of progeny, or genetic structure
54
Race
Race of the subject
55
Reference for biomaterial
Primary publication or genome report
Format: PubMed ID, or DOI (Digital Object Identifier), or URL
56
Relationship to oxygen
Aerobic or anaerobic
Controlled Term: "aerobe", "anaerobe", "facultative", "microaerophilic", "microanaerobe", "obligate aerobe", "obligate anaerobe"
57
Sample collection device or method
Method or device employed for collecting sample
58
Sample material processing
Processing applied to the sample during or after isolation
59
Sample name
A name that you choose for the sample. It can have any format, but we suggest that you make it concise, unique and consistent within your lab, and as informative as possible. Every sample name from a single submitter must be unique within a single BioProject
60
Sample size
Amount or size of sample (volume, mass or area) that was collected
61
Sample type
Sample type, such as cell culture, mixed culture, tissue sample, whole organism, single cell, metagenomic assembly
62
Serotype
Taxonomy below subspecies; a variety (in bacteria, fungi or virus) usually based on its antigenic properties
Example: serotype=“H1N1” in Influenza A virus CY098518.
63
Serovar
Taxonomy below subspecies; a variety (in bacteria, fungi or virus) usually based on its antigenic properties. Sometimes used as species identifier in bacteria with shaky taxonomy
Example: Leptospira, serovar saopaolo S76607 (65357 in Entrez).
64
Sex
Physical sex of sampled organism Controlled Term: "male", "female", "pooled male and female", "neuter", "hermaphrodite", "intersex", "not determined", "missing", "not applicable", "not collected"
65
Source material identifiers
Unique identifier assigned to a material sample used for extracting nucleic acids, and subsequent sequencing. The identifier can refer either to the original material collected or to any derived sub-samples
66
Specimen voucher
Identifier for the physical specimen. When the information is not available or not applicable, submitters are asked to use one of the accepted missing value reporting terms.
Format: Use the format “[<institution-code>:[<collection-code>:]]<specimen_id>”, in which collection-code is optional, along with a controlled vocabulary provided at http://www.insdc.org/submitting-standards/controlled-vocabulary-specimenvoucher-qualifier/
Example: KCLB:30082, KCTC:6910, KACC:44679
Note: The institution-code above will be found at https://ftp.ncbi.nlm.nih.gov/pub/taxonomy/coll_dump.txt, an institution-code list file provided by NCBI
67
Storage conditions
Explain how and for how long the soil sample was stored before DNA extraction
68
Strain
Microbial or eukaryotic strain name
69
Stud book number
A number of the captive animal individual in stud book
70
Subgroup
Taxonomy below subspecies; sometimes used in viruses to denote subgroups taken from a single isolate
71
Subtype
Used as classifier in viruses
Example: HIV type 1, Group M, Subtype A
72
Temperature
Temperature of the sample at time of sampling
Format: The temperature in Celsius is expressed in degrees C, in Fahrenheit in degrees F, and in absolute temperature in K. Please input the temperature value with the corresponding unit (C, F, or K)
Example: 37 degrees C, 98.6 degrees F, 310.15 K
73
Tissue
Type of tissue the sample was taken from
Example: liver, plasma, urine, leaf, root
74
Treatment
Describes how the sample was treated
Example1: Embryos were dechorionated with 50% bleach, put on a cover slip and covered with Halocarbon oil 27 (Sigma). Embryos of the appropriate stage were manually selected under the dissecting scope. Selected embryos were transferred to a basket, rinsed with PBS with 0.7% NaCl, 0.04% triton-X100 and placed on ice in the Trizol solution (GibcoBRL)
Example2: The rats were treated with 150 mg phthalate per bodyweight kilogram for each day for 13 weeks.
75
Other characteristics
Other characteristics needed to describe sample characteristics, which can be useful to enter the information on experimental factors. Do not enter this field if there are no other characteristics that are needed to describe the sample
Format: This field consists of a pair of two sub-fields: otherCharacteristics_key and otherCharacteristics_value, which correspond to the name of the factor and its value, respectively. Fill out the two sub-fields in a pair. For multiple characteristics, use “;” as a delimiter to enter in the Excel template. Example: BRCA 1 mutation - Yes, BRCA1 mutation status;Chemotherapy dosage - Yes;High, BRCA1 mutation status;Chemotherapy dosage;Weight in kg - Yes;High;65
Last updated