|
|
|
|
|
|
[1]
|
NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
STRAIN=O6:H1 / CFT073 / ATCC 700928 / UPEC;
DOI=10.1073/pnas.252529799; PubMed=12471157 [NCBI, ExPASy, EBI, Israel, Japan]
Welch R.A.,
Burland V.,
Plunkett G. III,
Redford P.,
Roesch P.,
Rasko D.,
Buckles E.L.,
Liou S.-R.,
Boutin A.,
Hackett J.,
Stroud D.,
Mayhew G.F.,
Rose D.J.,
Zhou S.,
Schwartz D.C.,
Perna N.T.,
Mobley H.L.T.,
Donnenberg M.S.,
Blattner F.R.;
"Extensive mosaic structure revealed by the complete genome sequence of uropathogenic Escherichia coli.";
Proc. Natl. Acad. Sci. U.S.A. 99:17020-17024(2002).
|
|
|
|
|
|
|
|
|
Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms.
Distributed under the Creative Commons Attribution-NoDerivs License.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| Length: 962 AA [This is the length of the unprocessed precursor] |
Molecular weight: 107892 Da [This is the MW of the unprocessed precursor] |
CRC64: DEDD2CA2A9AADF8D [This is a checksum on the sequence] |
|
10 20 30 40 50 60
MPRSIWFKAL LLFVALWAPL SQAETGWQPI QETIRKSDKD NRQYQAIRLD NGMVVLLVSD
70 80 90 100 110 120
PQAVKSLSAL VVPVGSLEDP EAYQGLAHYL EHMSLMGSKK YPQADSLAEY LKMHGGSHNA
130 140 150 160 170 180
STAPYRTAFY LEVENDALPG AVDRLADAIA EPLLDKKYAE RERNAVNAEL TMARTRDGMR
190 200 210 220 230 240
MAQVSAETIN PAHPGSKFSG GNLETLSDKP GNPVQQALKD FHEKYYSANL MKAVIYSNKP
250 260 270 280 290 300
LPELAKMAAD TFGRVPNKES KKPEITVPVV TDAQKGIIIH YVPALPRKVL RVEFRIDNNS
310 320 330 340 350 360
AKFRSKTDEL ITYLIGNRSP GTLSDWLQKQ GLVEGISANS DPIVNGNSGV LAISASLTDK
370 380 390 400 410 420
GLANRDQVVA AIFSYLNLLR EKGIDKQYFD ELANVLDIDF RYPSITRDMD YVEWLADTMI
430 440 450 460 470 480
RVPVEHTLDA VNIADRYDAK AVKERLAMMT PQNARIWYIS PKEPHNKTAY FVDAPYQVDK
490 500 510 520 530 540
ISEQTFADWQ KKAANIALSL PELNPYIPDD FSLIKSEKKY DHPELIVDES NLRVVYAPSR
550 560 570 580 590 600
YFASEPKADV SLILRNPKAM DSARNQVMFA LNDYLAGLAL DQLSNQASVG GISFSTNANN
610 620 630 640 650 660
GLMVNANGYT QRLPQLFQAL LEGYFSYTAT EDQLEQAKSW YNQMMDSAEK GKAFEQAIMP
670 680 690 700 710 720
AQMLSQVPYF SRDERRKILP SITLKEVLAY RDALKSGARP EFMVIGNMTE AQATTLARHV
730 740 750 760 770 780
QKQLGADGSE WCRNKDVVVD KKQSVIFEKA GNSTDSALAA IFVPTGYDEY TSSAYSSLLG
790 800 810 820 830 840
QIVQPWFYNQ LRTEEQLGYA VFAFPMSVGR QWGMGFLLQS NDKQPSFLWE RYKAFFPTAE
850 860 870 880 890 900
AKLRTMKPEE FAQIQQAVIT QMLQAPQTLG EEASKLSKDF DRGNMRFDSR DKIVAQIKLL
910 920 930 940 950 960
TPQKLADFFH QAVVEPQGMA ILSQISGSQN GKAEYVHPEG WKVWENVSAL QQTMPLMSEK
NE
|
Q8CVS2 in FASTA format |
|