|
|
|
|
|
|
[1]
|
NUCLEOTIDE SEQUENCE OF 1-776.
DOI=10.1016/0042-6822(91)90567-U; PubMed=1720591 [NCBI, ExPASy, EBI, Israel, Japan]
Mandl C.W.,
Iacono-Connors L.,
Wallner G.,
Holzmann H.,
Kunz C.,
Heinz F.X.;
"Sequence of the genes encoding the structural proteins of the low-virulence tick-borne flaviviruses Langat TP21 and Yelantsev.";
Virology 185:891-895(1991).
|
[2]
|
NUCLEOTIDE SEQUENCE OF 777-3414.
DOI=10.1016/0042-6822(92)90545-Z; PubMed=1316684 [NCBI, ExPASy, EBI, Israel, Japan]
Iacono-Connors L.C.,
Schmaljohn C.S.;
"Cloning and sequence analysis of the genes encoding the nonstructural proteins of Langat virus and comparative analysis with other flaviviruses.";
Virology 188:875-880(1992).
|
|
|
|
|
|
|
|
|
Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms.
Distributed under the Creative Commons Attribution-NoDerivs License.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| Length: 3414 AA [This is the length of the unprocessed precursor] |
Molecular weight: 378023 Da [This is the MW of the unprocessed precursor] |
CRC64: 59CB7E95DD70D82E [This is a checksum on the sequence] |
|
10 20 30 40 50 60
MAGKAVLKGK GGGPPRRASK VAPKKTRQLR VQMPNGLVLM RMLGVLWHAL TGTARSPVLK
70 80 90 100 110 120
AFWKVVPLKQ ATLALRKIKR TVSTLMVGLH RRGSRRTTID WMTPLLITVM LGMCLTATVR
130 140 150 160 170 180
RERDGSMVIR AEGRDAATQV RVENGTCVIL ATDMGSWCDD SLAYECVTID QGEEPVDVDC
190 200 210 220 230 240
FCRGVEKVTL EYGRCGRREG SRSRRSVLIP SHAQRDLTGR GHQWLEGEAV KAHLTRVEGW
250 260 270 280 290 300
VWKNKLFTLS LVMVAWLMVD GLLPRILIVV VALALVPAYA SRCTHLENRD FVTGVQGTTR
310 320 330 340 350 360
LTLVLELGGC VTVTADGKPS LDVWLDSIYQ ESPAQTREYC LHAKLTGTKV AARCPTMGPA
370 380 390 400 410 420
TLPEEHQSGT VCKRDQSDRG WGNHCGLFGK GSIVTCVKFT CEDKKKATGH VYDVNKITYT
430 440 450 460 470 480
IKVEPHTGEF VAANETHSGR KSASFTVSSE KTILTLGDYG DVSLLCRVAS GVDLAQTVVL
490 500 510 520 530 540
ALDKTHEHLP TAWQVHRDWF NDLALPWKHD GAEAWNEAGR LVEFGTPHAV KMDVFNLGDQ
550 560 570 580 590 600
TGVLLKSLAG VPVASIEGTK YHLKSGHVTC EVGLEKLKMK GLTYTVCDKT KFTWKRAPTD
610 620 630 640 650 660
SGHDTVVMEV GFSGTRPCRI PVRAVAHGVP EVNVAMLITP NPTMENNGGG FIEMQLPPGD
670 680 690 700 710 720
NIIYVGDLNH QWFQKGSSIG RVLQKTRKGI ERLTVLGEHA WDFGSVGGVM TSIGRAMHTV
730 740 750 760 770 780
LGGAFNTLLG GVGFLPKILL GVAMAWLGLN MRNPTLSMGF LLSGGLVLAM TLGVGADVGC
790 800 810 820 830 840
AVDTERMELR CGEGLVVWRE VSEWYDNYVF HPETPAVLAS AVQRAYEEEI CGIVPQNRLE
850 860 870 880 890 900
MAMWRSSLVE LNLALAEGEA NLTVVVDKAD PSDYRGGVPG LLNKGKDIKV SWRSWGRSML
910 920 930 940 950 960
WSVPEAPRRF MIGVEGGREC PFARRKTGVM TVAEFGIGLR TKVFMDLRQE LTTECDTGVM
970 980 990 1000 1010 1020
GAAVKNGMAV HTDQSLWMKS IKNDTTVTIV ELIVTDLRNC TWPASHTIDN AGVVNSKLFL
1030 1040 1050 1060 1070 1080
PASLAGPRST YNVIPGYAEQ VRGPWAHTPV RIKREECPGT RVTIDKACDK RGASVRSTTE
1090 1100 1110 1120 1130 1140
SGKVIPEWCC RTCELPPVTY RTGTDCWYAM EIRPVHTQGG LVRSMVVADN GALLSEGGVP
1150 1160 1170 1180 1190 1200
GVVALFVVLE LVIRRRPATG GTVIWGGIAI LALLVTGLVS VESLFRYLVA VGLVFQLELG
1210 1220 1230 1240 1250 1260
PEAVAMVLLQ AVFEMRTCLL SGFVLRRSIT TREIVTVYFL LLVLEMGIPV KGLEHLWRWT
1270 1280 1290 1300 1310 1320
DALAMGAIIF RACTAEGKTG IGLLLAAFMT QSDMNIIHDG LTAFLCVATT MAIWRYIRGQ
1330 1340 1350 1360 1370 1380
GERKGLTWIV PLAGILGGEG SGVRLLAFWE LAASRGRRSF NEPMTVIGVM LTLASGMMRH
1390 1400 1410 1420 1430 1440
TSQEAVCAMA LAAFLLLMLT LGTRKMQLLA EWSGNIEWNP ELTSEGGEVS LRVRQDALGN
1450 1460 1470 1480 1490 1500
LHLTELEKEE RMMAFWLVVG LIASAFHWSG ILIVMGLWTI SEMLGSPRRT DLVFSGCSEG
1510 1520 1530 1540 1550 1560
RSDSRPLDVK NGVYRIYTPG LLWGQRQIGV GYGAKGVLHT MWHVTRGAAL LVDGVAVGPY
1570 1580 1590 1600 1610 1620
WADVREDVVC YGGAWSLESR WRGETVQVHA FPPGRAHETH QCQPGELILE NGRKMGAIPI
1630 1640 1650 1660 1670 1680
DLAKGTSGSP IMNSQGEVVG LYGNGLKTND TYVSSIAQGE VEKSRPNLPQ SVVGTGWTAK
1690 1700 1710 1720 1730 1740
GQITVLDMHP GSGKTHRVLP ELIRQCVERR LRTLVLAPTR VVLREMERAL SGKNVRFHSP
1750 1760 1770 1780 1790 1800
AVTEQHANGA IVDVMCHATY VNRRLLPQGR QNWEVAIMDE AHWTDPHSIA ARGHLYSLAK
1810 1820 1830 1840 1850 1860
ENRCAFVLMT ATPPGKSEPF PESNGAIASE ERQIPDGEWR DGFDWITEYE GRTAWFVPSI
1870 1880 1890 1900 1910 1920
ARGGAIARAL RQRGKSVICL NSKTFDKEYS RVKDEKPDFV VTTDISEMGA NLDVTRVIDG
1930 1940 1950 1960 1970 1980
RTNIKPEEVD GRIELTGTRR VTTASAAQRR GRVGRQGGRT DEYIYSGQCD DDDSGLVQWK
1990 2000 2010 2020 2030 2040
EAQILLDNIT TARGPVATFY GPEQERMTET AGHYRLPEEK RKHFRHLLAQ CDFTPWLAWH
2050 2060 2070 2080 2090 2100
VAANVASVTD RSWTWEGPEE NAVDENNGEL VTFRSPNGAE RTLRPVWRDA RMFREGRDIR
2110 2120 2130 2140 2150 2160
EFVSYASGRR SVGDVLMGMS GVPALLRQRC TSAMDVFYTL MHEEPGSRAM RIGERDAPEA
2170 2180 2190 2200 2210 2220
FLTAVEMLVL GLATLGVVWC FVVRTSVSRM VLGTLVLATS LIFLWAGGVG YGNMAGVALV
2230 2240 2250 2260 2270 2280
FYTLLTVLQP ETGKQRSSDD NKLAYFLLTL CGLAGMVAAN EMGLLEKTKA DLAALFARDQ
2290 2300 2310 2320 2330 2340
GETVRWGEWT NLDIQPARSW GTYVLVVSLF TPYMLHQLQT RIQQLVNSAV ASGAQAMRDL
2350 2360 2370 2380 2390 2400
GGGTPFFGVA GHVLALGVAS LVGATPTSLI LGVGLAAFHL AIVVSGLEAE LTQRAHKVFF
2410 2420 2430 2440 2450 2460
SAMVRNPMVD GDVINPFGDG EAKPALYERK LSLILALVLC LASVVMNRTF VAVTEAGAVG
2470 2480 2490 2500 2510 2520
VAAAMQLLRP EMDVLWTMPV ACGMSGVVRG SLWGLLPLGH RLWLRTTGTR RGGSEGDTLG
2530 2540 2550 2560 2570 2580
DMWKARLNSC TKEEFFAYRR AGVMETDREK ARELLKRGET NMGLAVSRGT SKLAWMEERG
2590 2600 2610 2620 2630 2640
YVTLKGEVVD LGCGRGGWSY YAASRPAVMS VRAYTIGGKG HESPRMVTSL GWNLIKFRAG
2650 2660 2670 2680 2690 2700
MDVFSMEPHR ADAILCDIGE SNPDAVVEGE RSRRVILLME QWKNRNPTAT CVFKVLAPYR
2710 2720 2730 2740 2750 2760
PEVIEALHRF QLQWGGGLVR TPFSRNSTHE MYFSTAITGN IVNSVNIQSR KLLARFGDQR
2770 2780 2790 2800 2810 2820
GPTRVPEIDL GVGTRSVVLA EDKVKEKDVM ERIQALKDQY CDTWHEDHEH PYRTWQYWGS
2830 2840 2850 2860 2870 2880
YKTAATGSSA SLLNGVVKLL SWPWNAREDV VRMAMTDTTA FGQQRVFKDK VDTKAQEPQP
2890 2900 2910 2920 2930 2940
GTKIIMRAVN DWLLERLVKK SRPRMCSREE FIAKVRSNAA LAAWSDEQNK WKSAREAVED
2950 2960 2970 2980 2990 3000
PEFWSLVEAE RERHLQGRCA HCVYNMMGKR EKKLGEFGVA KGSRAIWYMW LGSRFLEFEA
3010 3020 3030 3040 3050 3060
LGFLNEDHWA SRASSGAGVE GISLNYLGWH LKKLASLSGG LFYADDTAGW DTRITNADLD
3070 3080 3090 3100 3110 3120
DEEQILRYMD GDHKKLAATV LRKAYHAKVV RVARPSREGG CVMDIITRRD QRGSGQVVTY
3130 3140 3150 3160 3170 3180
ALNTITNIKV QLVRMMEGEG VIEVADSHNP RLLRVEKCVE EHGEERLSRM LVSGDDCVVR
3190 3200 3210 3220 3230 3240
PVDDRFSKAL YFLNDMAKTR KDTGEWEPST GFASWEEVPF CSHHFHELVM KDGRALVVPC
3250 3260 3270 3280 3290 3300
RDQDELVGRA RVSPGCGWSV RETACLSKAY GQMWLLSYFH RRDLRTLGFA ICSAVPVDWV
3310 3320 3330 3340 3350 3360
PTGRTTWSIH ASGAWMTTED MLEVWNRVWI YDNPFMEDKT RVDEWRDTPY LPKSQDILCS
3370 3380 3390 3400 3410
SLVGRGERAE WAKNIWGAVE KVRRMIGPEH YRDYLSSMDR HDLHWELKLE SSIF
|
P29837 in FASTA format |
|