|
|
|
|
|
|
[1]
|
NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
STRAIN=ATCC 6205 / CBS 148.51 / DSM 1962 / IFO 6347 / NRRL 1970;
Birren B.W.,
Lander E.S.,
Galagan J.E.,
Devon K.,
Nusbaum C.,
Ma L.-J.,
Jaffe D.B.,
Butler J.,
Alvarez P.,
Gnerre S.,
Grabherr M.,
Kleber M.,
Mauceli E.W.,
Brockman W.,
Rounsley S.,
Young S.K.,
LaButti K.,
Pushparaj V.,
DeCaprio D.,
Crawford M.,
Koehrsen M.,
Engels R.,
Montgomery P.,
Pearson M.,
Howarth C.,
Kodira C.D.,
Yandava C.,
Zeng Q.,
Alvarado L.,
Oleary S.,
Untereiner W.;
"Annotation of the Chaetomium globosum CBS 148.51 genome.";
Submitted (MAR-2005) to the EMBL/GenBank/DDBJ databases.
|
|
|
|
|
|
|
|
|
Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms.
Distributed under the Creative Commons Attribution-NoDerivs License.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| Length: 943 AA [This is the length of the unprocessed precursor] |
Molecular weight: 104057 Da [This is the MW of the unprocessed precursor] |
CRC64: 40AA99FA21695DAC [This is a checksum on the sequence] |
|
10 20 30 40 50 60
MSERTSSSRR SKPASDDTIG NFVIDKEIGK GSFAQVYSGR HKVTGALVAI KSVELARLNT
70 80 90 100 110 120
KLKDNLYGEI EILKRLRHPH IVALHDCVES RTHINLIMEY CELGDLSLFI KKRDKLITNP
130 140 150 160 170 180
GTHELARKYP VAPNSGLNEV VIRHFLKQLT SAIRFLREAN LIHRDVKPQN LLLLPSPQYR
190 200 210 220 230 240
EANKMHKQIL SASHDSFTPA AGLPSLPMLK LADFGFARVL PSTSLAETLC GSPLYMAPEI
250 260 270 280 290 300
LRYEKYDAKA DLWSVGTVLY EMATGRPPFR AVNHVDLLRK IEASGDVIRF SRECVVSSEV
310 320 330 340 350 360
KGLVRALLKR NPVERISFED FFHHPVITGP IPGLVEDDIP KPEKPVLAET KSRIRRANPE
370 380 390 400 410 420
LSHTRRSRAG PHLLATPIEP KPNPLEQVAS PRLSYSPRQE ADEGLGIRRP LAQPSTSAPV
430 440 450 460 470 480
RPISYVDRSR RYSNASTKVP ARDTPPPPHE GSNRSRPKSA VSRPLTDEDK AAQDVAFERD
490 500 510 520 530 540
YIIIDKTAVE VNALADQISL YPQQGQPKSG GQIVRRATQQ GHPTSTTGAV PSHPGRNAQG
550 560 570 580 590 600
GRNDHYRKAS HDKTLSGSPG ATTSVISKAI QDASLRLFGF KYSPQMLSKG QSPPQIYSPF
610 620 630 640 650 660
PAYPTPSTPA GLIMDGKQSA PVDEDSRVAQ CIEDYATRSD VVYGFAEVKY KQLVPLAPSV
670 680 690 700 710 720
EHGLGGVPTD RMGEEEEGLT MDAVVSLSEE ALVLYVKALS LLAKSMDIAS LWWSRKSRPE
730 740 750 760 770 780
SSNNVHSATR DSVNTQALVL KINAAVQWIR SRFNEVLEKA EIVRLRLVEA QNQLPEEHPS
790 800 810 820 830 840
HPSNRPPETS ALGGSSGGQA TFPSVGISAE KLMYDRAVEM SRTAAINEIA SEDLPGCEIS
850 860 870 880 890 900
YVTAVRMLEA VLDSDDDHLP KRRVSTSSKE EQSVAAQDAS DDMSSDDKQA VQKMVQMINT
910 920 930 940
RLTYLRKRMH TIAAASKAQQ QQQQQQVVVR RRSGDVTPRS VPT
|
Q2H6X2 in FASTA format |
|