SAGE data short description

SAGE data : Serial Analysis of Gene Expression

SAGE (Serial Analysis of Gene Expression) libraries and accompanying biosource descriptive information may be submitted to GEO.
SAGE data submitted to GEO (Gene Expression Omnibus) are later incorporated into the SAGEmap website.

Gene Expression Omnibus

The Gene Expression Omnibus is a high-throughput gene expression / molecular abundance data repository, as well as a curated, online resource for gene expression data browsing, query and retrieval.
GEO web site : http://www.ncbi.nlm.nih.gov/geo/.

SAGE map

SAGEmap is a SAGE data resource for the query and retrieval and analysis of SAGE data from any organism.
SAGE map data resource : http://www.ncbi.nlm.nih.gov/projects/SAGE/

Types de données

SAGE data type:

SAGE: 10 bp tags (minus anchor sequence)
SAGE 14: 14 bp tags (minus anchor sequence)
longSAGE 17: 17 bp tags (minus anchor sequence)

Possible enzyme protocols employed :

NlaIII
Sau3A

SAGE data table guidelines

A valid SAGE Sample data table is a tab-delimited text file.
SAGE data are represented by a paired list of oligomer "tags" and a measure of abundance.
The first row in the file must be a header line that identifies the content of each column.

Standard SAGE column headers and their content are as follows:

TAG: (Required) oligomer tag sequence - identifies the tag sequence that is being counted. Each tag must be unique in any given data table. Do not include the anchor enzyme sequence, e.g., "GATC" for NlaIII. This header may be used only once in the table.
COUNT: (Required) tag count - specifies the number of times each tag is detected in that Sample. The contents of this column must be a whole number. This header may be used only once in the table.

A typical SAGE Sample data table may look as follows:

TAG	COUNT
CCATCGTCC	1380
ATTTGAGAAG	599
GCAAACCCT	566
CACCTAATTC	558
CCTCCAGCTA	472
CCTCCAGCTA	458