SAGE data : Serial Analysis
of Gene Expression
SAGE (Serial Analysis of Gene Expression) libraries and accompanying
biosource descriptive information may be submitted to GEO.
SAGE data
submitted to GEO (Gene Expression Omnibus) are later incorporated into
the
SAGEmap website.
Gene Expression Omnibus
The Gene Expression Omnibus
is a high-throughput gene expression / molecular abundance data
repository, as well as a curated, online resource for gene expression
data browsing, query and retrieval.
GEO web site : http://www.ncbi.nlm.nih.gov/geo/.
SAGE map
SAGEmap is a SAGE data
resource for the query and retrieval and analysis of SAGE data from any
organism.
SAGE map data resource : http://www.ncbi.nlm.nih.gov/projects/SAGE/
Types de données
SAGE data type:
- SAGE: 10 bp tags (minus anchor sequence)
- SAGE 14: 14 bp tags (minus anchor sequence)
- longSAGE 17: 17 bp tags (minus anchor sequence)
Possible enzyme protocols employed :
SAGE data table guidelines
- A valid SAGE Sample data table is a tab-delimited text file.
- SAGE data are represented by a paired list of oligomer "tags" and
a measure of abundance.
- The first row in the file must be a header line that identifies
the content of each column.
Standard SAGE column headers and their content are as follows:
- TAG: (Required) oligomer tag sequence - identifies the tag
sequence that is being counted. Each tag must be unique in any given
data table. Do not include the anchor enzyme sequence, e.g., "GATC" for
NlaIII. This header may be used only once in the table.
- COUNT: (Required) tag count - specifies the number of times each
tag is detected in that Sample. The contents of this column must be a
whole number. This header may be used only once in the table.
A typical SAGE Sample data table may look as follows:
TAG |
COUNT |
CCATCGTCC |
1380 |
ATTTGAGAAG |
599 |
GCAAACCCT |
566 |
CACCTAATTC |
558 |
CCTCCAGCTA |
472 |
CCTCCAGCTA |
458 |