SelenoDB
Release 1.0

Chart

Advanced

Faq

What is the FASTA sequence format?

A sequence in FASTA format begins with a single-line description, followed by lines of sequence data. The description line is distinguished from the sequence data by a greater-than (">") symbol in the first column. It is recommended that all lines of text be shorter than 80 characters in length. Example of nucleotide and protein sequences in FASTA format are:

>SPT00000014_1.0 # Selenoprotein H (SelH) # Homo sapiens
ATGGCTCCCCGCGGGAGGAAGCGTAAGGCTGAGGCCGCGGTGGTCGCCGTAGCCGAGAAG
CGAGAGAAGCTGGCGAACGGCGGGGAGGGAATGGAGGAGGCGACCGTTGTTATCGAGCAT
TGCACTAGCTGACGCGTCTATGGGCGCAACGCCGCGGCCCTGAGCCAGGCGCTGCGCCTG
GAGGCCCCAGAGCTTCCAGTAAAGGTGAACCCGACGAAGCCCCGGAGGGGCAGCTTCGAG
GTGACGCTGCTGCGCCCGGACGGCAGCAGTGCGGAGCTCTGGACTGGGATTAAGAAGGGG
CCCCCACGCAAACTCAAATTCCCTGAGCCTCAAGAGGTGGTGGAAGAGTTGAAGAAGTAC
CTGTCGTAG

SPP00000014_1.0 # Selenoprotein H (SelH) # Homo sapiens
MAPRGRKRKAEAAVVAVAEKREKLANGGEGMEEATVVIEHCTSURVYGRNAAALSQALRL
EAPELPVKVNPTKPRRGSFEVTLLRPDGSSAELWTGIKKGPPRKLKFPEPQEVVEELKKY
LS%
Note the correspondece between the nucleotide and protein codes. In particular, between the in-frame TGA and U and the termination codon TAG and %. Learn more on the encoding of termination codons in SelenoDB here.

Chart

Advanced