Integrated attributes

No
Attribute
Description

1

Age

Age at the time of sampling; relevant scale depends on species and study, e.g. could be seconds for amoebae or centuries for trees. For samples such as cell lines in which age is not applicable, enter not applicable. For samples where an exact age is unknown, such as when only the range of age values is known, enter missing:not collected in this field, then use the “other characteristics“ field to enter the range of age values Example: 28 years, 2.5 months, 13 weeks, 3 days, 10 embryo age

2

Altitude

The altitude of the sample is the vertical distance between Earth's surface above sea level and the sampled position in the air Example: 1,000 m

3

Biomaterial Provider

Name and address of the lab or PI, or a culture collection identifierr. If a value cannot be provided, submitters are asked to use one of the accepted missing value reporting terms Example: Korean Collection for Type Cultures(KCTC), Korean Cell Line Bank(KCBL), Dr. Gildong Hong from Seoul National University

4

Birth date

Birth date of the subject

Format: YYYY-MM-DD, YYYY-MM, YYYY

Example: 1990-10-30, 1990-10, 1990

5

Birth location

Birth location of the subject

Example: Laboratory Animal Resource & Research Center - KRIBB (LARRC), Johns Hopkins Hospital Gynecology & Obstetrics

6

Breed

Breed name - chiefly used in domesticated animals or plants

Example: BALB/c, C57BL/6, White (w1118)

7

Breeding history

Breeding history

8

Breeding method

Breeding method

9

Broad-scale environmental context

Add terms that identify the major environment type(s) where your sample was collected. Recommend subclasses of biome [ENVO:00000428]. Multiple terms can be separated by one or more pipes (|)

Format: http://purl.obolibrary.org/obo/ENVO_00000428arrow-up-right or ENVO_00000428 in EBI Ontologiesarrow-up-right

Example: mangrove biome [ENVO:01000181]|estuarine biome [ENVO:01000020]

10

Cell line

Name of the cell line

11

Cell subtype

Subtype of a cell

12

Cell type

Type of cell of the sample or from which the sample was obtained

13

Collected by

Name of persons or institute who collected the sample

14

Collection date

The date (or date and time) on which the sample was collected. If a value of an expected format cannot be provided, submitters are asked to use one of the accepted missing value reporting terms.

Format: Use ISO 8601 standard on date and time. For date, use the formats “YYYY”, “YYYY-MM”, “YYYY-MM-DD”. For date and time, use the format “YYYY-mm-ddThh:mm:ssZ”. In this format, time is in Coordinated Universal Time (UTC), otherwise known as Zulu time, the letter “T” is added between date and time, and the letter “Z” (indicating Zulu) is added after time. For a range of two date and time values, use the forward-slash character “/” as the delimiter

Example: 1990-10-30, 1990-10, 1990, 1952-10-21/1953-02-15, 2015-10-11T17:53:03Z

15

Cultivar

Cultivar name - cultivated variety of plant

16

Culture collection

Name of source institute and unique culture identifier

Format: See https://www.insdc.org/submitting-standards/controlled-vocabulary-culturecollection-qualifier/arrow-up-right for the description for the proper format and list of allowed institutes

17

Death date

Death date of the subject

Format: YYYY-MM-DD, YYYY-MM, YYYY

Example: 1990-10-30, 1990-10, 1990

18

Depth

Depth is defined as the vertical distance below surface, e.g. for sediment or soil samples depth is measured from sediment or soil surface, respectively. Depth can be reported as an interval for subsurface samples

Example: 10 m

19

Derived from

Indicates when one BioSample was derived from another BioSample. Value should include BioSample accession number(s)

Example: SAMN00000001, KAS24095074

20

Description

Description of the sample

21

Development stage

Developmental stage at the time of sampling

Example: adult, embryo, fetal, hatching stage, infant, juvenile, larva, newborn, pluripotency, pupa, R3 growth stage, seedling, vegetative stage

22

Disease

Name of diseases diagnosed

Format: Can include multiple diagnoses. Controlled vocabulary from (1) Human Disease Ontologyarrow-up-right or (2) MeSHarrow-up-right

23

Disease stage

Stage of disease at the time of sampling

24

Ecotype

A population within a given species displaying genetically based, phenotypic traits that reflect adaptation to a local habitat

Example: Korea, Columbia

25

Ethnicity

Ethnicity of the subject

26

Genotype

Observed genotype

27

Geographic location

Geographical origin of the sample. If a value of an expected format cannot be provided, submitters are asked to use one of the accepted missing value reporting terms.

Format: Use the appropriate name from the list shown in https://www.insdc.org/submitting-standards/geo_loc_name-qualifier-vocabulary/arrow-up-right. Use a colon (:) to separate the country or ocean from more detailed information about the location

Example: “South Korea: Seoul”, “Canada: Vancouver”, “Germany: halfway down Zugspitze, Alps”

28

Growth protocol

Protocol for growth Example: 30 min egg collections of OreR and yw flies at 25 degree Celsius were aged at room temperature (RT) according to the different temporal classes T0-T4.

29

Health state

Health or disease status of sample at time of collection

30

Height or length

Measurement of height or length

31

Host

The natural (as opposed to laboratory) host to the organism from which the sample was obtained. Use the full taxonomic name

Example: Homo sapiens

32

Host age

Age of host at the time of sampling

33

Host description

Additional information not included in other defined vocabulary fields

34

Host disease

Name of relevant disease

Format: Controlled vocabulary from (1) Human Disease Ontologyarrow-up-right or (2) MeSHarrow-up-right (Medical Subject Heading)

Example: Salmonella gastroenteritis

35

Host disease outcome

Final outcome of disease

Example: death, chronic disease, recovery

36

Host disease stage

Stage of disease at the time of sampling

37

Host health state

Information regarding health state of the individual sampled at the time of sampling

38

Host sex

Gender or physical sex of the host Controlled Term: "male", "female", "pooled male and female", "neuter", "hermaphrodite", "intersex", "not determined", "missing", "not applicable", "not collected"

39

Host subject id

A unique identifier by which each subject can be referred to, de-identified

Example: #131

40

Host tissue sampled

Type of tissue the initial sample was taken from

Format: Controlled vocabulary from BRENDA Tissue and Enzyme Source Ontologyarrow-up-right

41

Identified by

Name of the taxonomist who identified the specimen

42

Isolate

Identification or description of the specific individual from which this sample was obtained

43

Isolation source

Describes the physical, environmental and/or local geographical source of the biological sample from which the sample was derived

44

Karyotype

Karyotype

45

Lab host

Scientific name and description of the laboratory host used to propagate the source organism or material from which the sample was obtained

Example: Escherichia coli DH5a, or Homo sapiens HeLa cells

46

Latitude and longitude

The geographical coordinates of the location where the sample was collected. When the information is lacking or specification of location is not appropriate, submitters are asked to use one of the accepted missing value reporting terms.

Format: Specify as degrees latitude and longitude in format “d[d.dddd] N|S d[dd.dddd] W|E”

Example: 38.98 N 77.11 W

47

Mating type

Mating type of microbe

48

NCBI taxonomy ID

NCBI’s taxonomy identifier of the organism for this sample. The NCBI taxonomy ID can be found at NCBI taxonomyarrow-up-right. Enter 32644 (which is a taxonomy ID for unidentified organisms) for the following or similar cases: (1) when NCBI taxonomy ID is not available because NCBI taxonomy does not yet cover the organism, (2) when metagenome or environmental sample was used, whose organismal composition is unknown in advance Example: 9606 for Homo sapiens and 452680 for Pseudomonas sp. UK4

49

Organism

The most descriptive organism name for this sample (to the species, if possible). In the case of a new species, provide the desired organism name. In the case of unidentified species, choose the appropriate Genus and include ‘sp.’, e.g. “Escherichia sp.”. When sequencing a genome from a non-metagenomic source, include a strain or isolate name too, e.g. “Pseudomonas sp. UK4” Example: Homo sapiens, Pseudomonas sp. UK4

50

Passage history

Number of passages and passage method

51

Pathotype

Some bacterial specific pathotypes

Example: Escherichia coli - STEC, UPEC

52

Phenotype

Phenotype of sampled organism

Format: Controlled vocabulary from Phenotypic Quality Ontology (PATO)arrow-up-right

53

Population

Applicable for human and plants. Population is a summation of all the organisms of the same group or species, which live in a particular geographical area, and it can also indicate filial generation, number of progeny, or genetic structure

54

Race

Race of the subject

55

Reference for biomaterial

Primary publication or genome report

Format: PubMed ID, or DOI (Digital Object Identifier), or URL

56

Relationship to oxygen

Aerobic or anaerobic

Controlled Term: "aerobe", "anaerobe", "facultative", "microaerophilic", "microanaerobe", "obligate aerobe", "obligate anaerobe"

57

Sample collection device or method

Method or device employed for collecting sample

58

Sample material processing

Processing applied to the sample during or after isolation

59

Sample name

A name that you choose for the sample. It can have any format, but we suggest that you make it concise, unique and consistent within your lab, and as informative as possible. Every sample name from a single submitter must be unique within a single BioProject

60

Sample size

Amount or size of sample (volume, mass or area) that was collected

61

Sample type

Sample type, such as cell culture, mixed culture, tissue sample, whole organism, single cell, metagenomic assembly

62

Serotype

Taxonomy below subspecies; a variety (in bacteria, fungi or virus) usually based on its antigenic properties

Example: serotype=“H1N1” in Influenza A virus CY098518.

63

Serovar

Taxonomy below subspecies; a variety (in bacteria, fungi or virus) usually based on its antigenic properties. Sometimes used as species identifier in bacteria with shaky taxonomy

Example: Leptospira, serovar saopaolo S76607 (65357 in Entrez).

64

Sex

Physical sex of sampled organism Controlled Term: "male", "female", "pooled male and female", "neuter", "hermaphrodite", "intersex", "not determined", "missing", "not applicable", "not collected"

65

Source material identifiers

Unique identifier assigned to a material sample used for extracting nucleic acids, and subsequent sequencing. The identifier can refer either to the original material collected or to any derived sub-samples

66

Specimen voucher

Identifier for the physical specimen. When the information is not available or not applicable, submitters are asked to use one of the accepted missing value reporting terms.

Format: Use the format “[<institution-code>:[<collection-code>:]]<specimen_id>”, in which collection-code is optional, along with a controlled vocabulary provided at http://www.insdc.org/submitting-standards/controlled-vocabulary-specimenvoucher-qualifier/arrow-up-right

Example: KCLB:30082, KCTC:6910, KACC:44679

Note: The institution-code above will be found at https://ftp.ncbi.nlm.nih.gov/pub/taxonomy/coll_dump.txtarrow-up-right, an institution-code list file provided by NCBI

67

Storage conditions

Explain how and for how long the soil sample was stored before DNA extraction

68

Strain

Microbial or eukaryotic strain name

69

Stud book number

A number of the captive animal individual in stud book

70

Subgroup

Taxonomy below subspecies; sometimes used in viruses to denote subgroups taken from a single isolate

71

Subtype

Used as classifier in viruses

Example: HIV type 1, Group M, Subtype A

72

Temperature

Temperature of the sample at time of sampling

Format: The temperature in Celsius is expressed in degrees C, in Fahrenheit in degrees F, and in absolute temperature in K. Please input the temperature value with the corresponding unit (C, F, or K)

Example: 37 degrees C, 98.6 degrees F, 310.15 K

73

Tissue

Type of tissue the sample was taken from

Example: liver, plasma, urine, leaf, root

74

Treatment

Describes how the sample was treated

Example1: Embryos were dechorionated with 50% bleach, put on a cover slip and covered with Halocarbon oil 27 (Sigma). Embryos of the appropriate stage were manually selected under the dissecting scope. Selected embryos were transferred to a basket, rinsed with PBS with 0.7% NaCl, 0.04% triton-X100 and placed on ice in the Trizol solution (GibcoBRL)

Example2: The rats were treated with 150 mg phthalate per bodyweight kilogram for each day for 13 weeks.

75

Other characteristics

Other characteristics needed to describe sample characteristics, which can be useful to enter the information on experimental factors. Do not enter this field if there are no other characteristics that are needed to describe the sample

Format: This field consists of a pair of two sub-fields: otherCharacteristics_key and otherCharacteristics_value, which correspond to the name of the factor and its value, respectively. Fill out the two sub-fields in a pair. For multiple characteristics, use “;” as a delimiter to enter in the Excel template. Example: BRCA 1 mutation - Yes, BRCA1 mutation status;Chemotherapy dosage - Yes;High, BRCA1 mutation status;Chemotherapy dosage;Weight in kg - Yes;High;65

Last updated