|
|
|
|
|
|
[1]
|
NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
Copeland A.,
Lucas S.,
Lapidus A.,
Glavina del Rio T.,
Dalin E.,
Tice H.,
Bruce D.,
Goodwin L.,
Pitluck S.,
Kiss H.,
Brettin T.,
Detter J.C.,
Han C.,
Kuske C.R.,
Schmutz J.,
Larimer F.,
Land M.,
Hauser L.,
Kyrpides N.,
Mikhailova N.,
Ingram L.,
Richardson P.;
"Complete sequence of Escherichia coli C str. ATCC 8739.";
Submitted (FEB-2008) to the EMBL/GenBank/DDBJ databases.
|
|
|
|
|
|
|
|
|
Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms.
Distributed under the Creative Commons Attribution-NoDerivs License.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| Length: 566 AA [This is the length of the unprocessed precursor] |
Molecular weight: 61114 Da [This is the MW of the unprocessed precursor] |
CRC64: 6A0C37B0D17B03FE [This is a checksum on the sequence] |
|
10 20 30 40 50 60
MAIAIGLDFG SDSVRALAVD CASGEEIATS VEWYPRWQKG QFCDAPNNQF RHHPRDYIES
70 80 90 100 110 120
MEAALKTVLA ELSVEQRAAV VGIGVDSTGS TPAPIDADGN VLALRPEFAE NPNAMFVLWK
130 140 150 160 170 180
DHTAVEEAEE ITRLCHAPGN VDYSRYIGGI YSSEWFWAKI LHVTRQDSAV AQSAASWIEL
190 200 210 220 230 240
CDWVPALLSG TTRPQDIRRG RCSAGHKSLW HESWGGLPPA SFFDELDPIL NRHLPSPLFT
250 260 270 280 290 300
DTWTADIPVG TLCPEWAQRL GLPESVVISG GAFDCHMGAV GAGAQPNALV KVIGTSTCDI
310 320 330 340 350 360
LIADKQSVGE RAVKGICGQV DGSVVPGFIG LEAGQSAFGD IYAWFGRVLS WPLEQLAAQH
370 380 390 400 410 420
PELKAQINAS QKQLLPALTE AWAKNPSLDH LPVVLDWFNG RRSPNANQRL KGVITDLNLA
430 440 450 460 470 480
TDAPLLFGGL IAATAFGARA IMECFTDQGI AVNNVMALGG IARKNQVIMQ ACCDVLNRPL
490 500 510 520 530 540
QIVASDQCCA LGAAIFAAVA AKVHADIPSA QQKMASAVEK TLQPRSEQAQ RFEQLYRRYQ
550 560
QWAMSAEQHY LPTSAPAQAA QAVATL
|
B1IRB5 in FASTA format |
|