ExPASy logo ExPASy Home page Site Map Search ExPASy Contact us Swiss-Prot
Notice: This page will be replaced with www.uniprot.org. Please send us your feedback!
Search for

UniProtKB/Swiss-Prot entry P06935


[Entry info] [Name and origin] [References] [Comments] [Cross-references] [Keywords] [Features] [Sequence] [Tools]

Note: most headings are clickable, even if they don't appear as links. They link to the user manual or other documents.
Entry information
Entry name POLG_WNV
Primary accession number P06935
Secondary accession numbers None
Integrated into Swiss-Prot on January 1, 1988
Sequence was last modified on October 24, 2003 (Sequence version 2)
Annotations were last modified on    September 2, 2008 (Entry version 94)
Name and origin of the protein
Protein name Genome polyprotein
Synonyms None
Contains Protein C
     (Core protein)
     (Capsid protein)
Small envelope protein M
     (Matrix protein)
Envelope protein E
Non-structural protein 1
     (NS1)
Non-structural protein 2A
     (NS2A)
Flavivirin protease NS2B regulatory subunit
Flavivirin protease NS3 catalytic subunit
     (EC 3.4.21.91)
Non-structural protein 4A
     (NS4A)
Non-structural protein 4B
     (NS4B)
RNA-directed RNA polymerase
     (EC 2.7.7.48)
     (NS5)
Gene name None
From
West Nile virus (WNV) [TaxID: 11082] 
Taxonomy Viruses; ssRNA positive-strand viruses, no DNA stage; Flaviviridae; Flavivirus; Japanese encephalitis virus group.
Virus hosts Aedes [TaxID: 7158]
Amblyomma variegatum (Tropical bont tick) [TaxID: 34610]
Aves [TaxID: 8782]
Culex [TaxID: 53527]
Homo sapiens (Human) [TaxID: 9606]
Hyalomma marginatum [TaxID: 34627]
Mansonia uniformis [TaxID: 308735]
Mimomyia [TaxID: 308737]
Rhipicephalus [TaxID: 34630]
Protein existence 1: Evidence at protein level;
References
[1]
NUCLEOTIDE SEQUENCE [GENOMIC RNA].
DOI=10.1016/0042-6822(86)90082-6; PubMed=3753811 [NCBI, ExPASy, EBI, Israel, Japan]
Castle E., Leidner U., Nowak T., Wengler G., Wengler G.;
"Primary structure of the West Nile flavivirus genome region coding for all nonstructural proteins.";
Virology 149:10-26(1986).
[2]
SEQUENCE REVISION TO 1908; 2018-2036; 2242 AND 2859-2860.
DOI=10.1006/viro.2000.0795; PubMed=11277701 [NCBI, ExPASy, EBI, Israel, Japan]
Yamshchikov V.F., Wengler G., Perelygin A.A., Brinton M.A., Compans R.W.;
"An infectious clone of the West Nile flavivirus.";
Virology 281:294-304(2001).
[3]
NUCLEOTIDE SEQUENCE [GENOMIC RNA] OF 1-291.
DOI=10.1016/0042-6822(85)90156-4; PubMed=2992152 [NCBI, ExPASy, EBI, Israel, Japan]
Castle E., Nowak T., Leidner U., Wengler G., Wengler G.;
"Sequence analysis of the viral core protein and the membrane-associated proteins V1 and NV2 of the flavivirus West Nile virus and of the genome sequence for these proteins.";
Virology 145:227-236(1985).
[4]
NUCLEOTIDE SEQUENCE [GENOMIC RNA] OF 255-854.
DOI=10.1016/0042-6822(85)90129-1; PubMed=3855247 [NCBI, ExPASy, EBI, Israel, Japan]
Wengler G., Castle E., Leidner U., Nowak T., Wengler G.;
"Sequence analysis of the membrane protein V3 of the flavivirus West Nile virus and of its gene.";
Virology 147:264-274(1985).
[5]
DISULFIDE BONDS IN E PROTEIN.
DOI=10.1016/0042-6822(87)90443-0; PubMed=3811228 [NCBI, ExPASy, EBI, Israel, Japan]
Nowak T., Wengler G.;
"Analysis of disulfides present in the membrane proteins of the West Nile flavivirus.";
Virology 156:127-137(1987).
Comments
Copyright
Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms. Distributed under the Creative Commons Attribution-NoDerivs License.
Cross-references
Sequence databases
EMBL
M12294; AAA48498.2; -; Genomic_RNA.[EMBL / GenBank / DDBJ] [CoDingSequence]
PIR A25256; GNWVWV.
RefSeq NP_041724.2; -.
3D structure databases
PDB
2FP7; X-ray; 1.68 A; A=1420-1466, B=1517-1688.[ExPASy / RCSB / EBI]
2G05; Model; -; D=1675-2120.[ExPASy / RCSB / EBI]
2G2G; Model; -; D=1675-2120.[ExPASy / RCSB / EBI]
2GGV; X-ray; 1.80 A; A=1419-1463, B=1503-1679.[ExPASy / RCSB / EBI]
2IJO; X-ray; 2.30 A; A=1423-1469, B=1502-1686.[ExPASy / RCSB / EBI]
2P5P; X-ray; 2.80 A; A/B/C=585-701.[ExPASy / RCSB / EBI]
Detailed list of linked structures.
PDBsum 2FP7; -.
2G05; -.
2G2G; -.
2GGV; -.
2IJO; -.
2P5P; -.
SMR P06935; 25-97, 291-686, 1511-1679, 2531-2792.
ModBase P06935.
Protein-protein interaction databases
IntAct P06935; -.
Ontologies
GO
GO:0005515; Molecular function: protein binding (inferred from physical interaction from IntAct).
QuickGo view.
Family and domain databases
InterPro IPR014001; DEAD-like_N.
IPR011492; DEAD_Flavivir.
IPR001650; DNA/RNA_helicase_C.
IPR002464; DNA/RNA_helicase_DEAH_CS.
IPR013756; Flav_glyE_cen_2.
IPR011999; Flav_glyE_cen_dm.
IPR013754; Flav_glyE_dim.
IPR001122; Flavi_capsidC.
IPR000069; Flavi_M.
IPR001157; Flavi_NS1.
IPR000752; Flavi_NS2A.
IPR000487; Flavi_NS2B.
IPR000404; Flavi_NS4A.
IPR001528; Flavi_NS4B.
IPR002535; Flavi_propep.
IPR000336; Flv_glyE_Ig-like.
IPR014412; Gen_Poly_FLV.
IPR014021; Helicase_SF1/SF2_ATP-bd.
IPR001850; Peptidase_S7.
IPR000208; RNA_pol_flaviviral.
IPR007094; RNA_pol_PSvir.
IPR002877; RrmJFtsJ_MeTrfase.
Graphical view of domain structure.
Gene3D G3DSA:3.30.67.10; Flav_glyE_cen_2; 1.
G3DSA:2.60.98.10; Flav_glyE_dim; 1.
G3DSA:2.60.40.350; Flv_glyE_Ig-like; 1.
Pfam PF01003; Flavi_capsid; 1.
PF07652; Flavi_DEAD; 1.
PF02832; Flavi_glycop_C; 1.
PF00869; Flavi_glycoprot; 1.
PF01004; Flavi_M; 1.
PF00948; Flavi_NS1; 1.
PF01005; Flavi_NS2A; 1.
PF01002; Flavi_NS2B; 1.
PF01350; Flavi_NS4A; 1.
PF01349; Flavi_NS4B; 1.
PF00972; Flavi_NS5; 1.
PF01570; Flavi_propep; 1.
PF01728; FtsJ; 1.
PF00271; Helicase_C; 1.
PF00949; Peptidase_S7; 1.
Pfam graphical view of domain structure.
PIRSF PIRSF003817; Gen_Poly_FLV; 1.
ProDom PD001496; Flavi_NS1; 1.
[Domain structure / List of seq. sharing at least 1 domain]
SMART SM00487; DEXDc; 1.
SM00490; HELICc; 1.
SMART graphical view of domain structure.
PROSITE PS00690; DEAH_ATP_HELICASE; FALSE_NEG.
PS51192; HELICASE_ATP_BIND_1; 1.
PS51194; HELICASE_CTER; 1.
PS50507; RDRP_SSRNA_POS; 1.
PROSITE graphical view of domain structure (profiles).
BLOCKS P06935.
ProtoNet P06935.
Genome annotation databases
GeneID 912267; -.
Other
UniRef View cluster of proteins with at least 50% / 90% / 100% identity.
Keywords
3D-structure; ATP-binding; Capsid protein; Cleavage on pair of basic residues; Complete proteome; Core protein; Envelope protein; Glycoprotein; Helicase; Hydrolase; Membrane; Nucleotide-binding; Nucleotidyltransferase; RNA replication; RNA-directed RNA polymerase; Transferase; Transmembrane; Virion.
Features
SEVIEWER logo Feature table viewer FT aligner logo Feature aligner
KeyFrom    To Length Description FTId
INIT_MET   1      1        Removed; by host. 
CHAIN   2    123  122     Protein C. PRO_0000037743
PROPEP   124    215  92      PRO_0000037744
CHAIN   216    290  75     Small envelope protein M. PRO_0000037745
CHAIN   291    787  497     Envelope protein E. PRO_0000037746
CHAIN   788   1139  352     Non-structural protein 1. PRO_0000037747
CHAIN   1140   1370  231     Non-structural protein 2A. PRO_0000037748
CHAIN   1371   1501  131     Flavivirin protease NS2B regulatory subunit. PRO_0000037749
CHAIN   1502   2120  619     Flavivirin protease NS3 catalytic subunit. PRO_0000037750
CHAIN   2121   2269  149     Non-structural protein 4A. PRO_0000037751
CHAIN   2270   2525  256     Non-structural protein 4B. PRO_0000037752
CHAIN   2526   3430  905     RNA-directed RNA polymerase. PRO_0000037753
TRANSMEM   46     66  21     Potential. 
TRANSMEM   106    126  21     Potential. 
TRANSMEM   249    269  21     Potential. 
TRANSMEM   276    292  17     Potential. 
TRANSMEM   740    760  21     Potential. 
TRANSMEM   767    787  21     Potential. 
TRANSMEM   1139   1159  21     Potential. 
TRANSMEM   1171   1191  21     Potential. 
TRANSMEM   1213   1233  21     Potential. 
TRANSMEM   1244   1264  21     Potential. 
TRANSMEM   1279   1301  23     Potential. 
TRANSMEM   1341   1361  21     Potential. 
TRANSMEM   1372   1392  21     Potential. 
TRANSMEM   1396   1416  21     Potential. 
TRANSMEM   1474   1494  21     Potential. 
TRANSMEM   2171   2191  21     Potential. 
TRANSMEM   2197   2217  21     Potential. 
TRANSMEM   2219   2239  21     Potential. 
TRANSMEM   2255   2275  21     Potential. 
TRANSMEM   2310   2330  21     Potential. 
TRANSMEM   2356   2376  21     Potential. 
TRANSMEM   2378   2398  21     Potential. 
TRANSMEM   2442   2462  21     Potential. 
DOMAIN   1508   1679  172     Peptidase S7. 
DOMAIN   1682   1838  157     Helicase ATP-binding. 
DOMAIN   1849   2014  166     Helicase C-terminal. 
DOMAIN   3055   3207  153     RdRp catalytic. 
NP_BIND   1695   1702  8     ATP (Potential). 
REGION   388    401  14     Involved in fusion. 
MOTIF   1786   1789  4     DEAH box. 
ACT_SITE   1552   1552        Charge relay system (By similarity). 
ACT_SITE   1576   1576        Charge relay system (By similarity). 
ACT_SITE   1636   1636        Charge relay system (By similarity). 
CARBOHYD   138    138        N-linked (GlcNAc...) (Potential). 
CARBOHYD   917    917        N-linked (GlcNAc...) (Potential). 
CARBOHYD   962    962        N-linked (GlcNAc...) (Potential). 
CARBOHYD   994    994        N-linked (GlcNAc...) (Potential). 
CARBOHYD   2336   2336        N-linked (GlcNAc...) (Potential). 
CARBOHYD   2489   2489        N-linked (GlcNAc...) (Potential). 
DISULFID   293    320         
DISULFID   350    406         
DISULFID   364    395         
DISULFID   382    411         
DISULFID   476    574         
DISULFID   591    622         
STRAND   1423   1428  6      
STRAND   1444   1449  6      
STRAND   1455   1457  3      
STRAND   1522   1527  6      
STRAND   1536   1543  8      
STRAND   1546   1550  5      
HELIX   1551   1554  4      
STRAND   1559   1561  3      
STRAND   1564   1566  3      
STRAND   1568   1572  5      
TURN   1573   1576  4      
STRAND   1577   1583  7      
STRAND   1592   1594  3      
STRAND   1596   1600  5      
STRAND   1608   1612  5      
STRAND   1615   1619  5      
STRAND   1622   1627  6      
HELIX   1633   1635  3      
STRAND   1639   1641  3      
STRAND   1647   1651  5      
STRAND   1654   1656  3      
STRAND   1662   1665  4      
Sequence information
Length: 3430 AA [This is the length of the unprocessed precursor] Molecular weight: 380110 Da [This is the MW of the unprocessed precursor] CRC64: 42D71B7CB12DC45B [This is a checksum on the sequence]
        10         20         30         40         50         60 
MSKKPGGPGK NRAVNMLKRG MPRGLSLIGL KRAMLSLIDG KGPIRFVLAL LAFFRFTAIA 

        70         80         90        100        110        120 
PTRAVLDRWR GVNKQTAMKH LLSFKKELGT LTSAINRRST KQKKRGGTAG FTILLGLIAC 

       130        140        150        160        170        180 
AGAVTLSNFQ GKVMMTVNAT DVTDVITIPT AAGKNLCIVR AMDVGYLCED TITYECPVLA 

       190        200        210        220        230        240 
AGNDPEDIDC WCTKSSVYVR YGRCTKTRHS RRSRRSLTVQ THGESTLANK KGAWLDSTKA 

       250        260        270        280        290        300 
TRYLVKTESW ILRNPGYALV AAVIGWMLGS NTMQRVVFAI LLLLVAPAYS FNCLGMSNRD 

       310        320        330        340        350        360 
FLEGVSGATW VDLVLEGDSC VTIMSKDKPT IDVKMMNMEA ANLADVRSYC YLASVSDLST 

       370        380        390        400        410        420 
RAACPTMGEA HNEKRADPAF VCKQGVVDRG WGNGCGLFGK GSIDTCAKFA CTTKATGWII 

       430        440        450        460        470        480 
QKENIKYEVA IFVHGPTTVE SHGKIGATQA GRFSITPSAP SYTLKLGEYG EVTVDCEPRS 

       490        500        510        520        530        540 
GIDTSAYYVM SVGEKSFLVH REWFMDLNLP WSSAGSTTWR NRETLMEFEE PHATKQSVVA 

       550        560        570        580        590        600 
LGSQEGALHQ ALAGAIPVEF SSNTVKLTSG HLKCRVKMEK LQLKGTTYGV CSKAFKFART 

       610        620        630        640        650        660 
PADTGHGTVV LELQYTGTDG PCKVPISSVA SLNDLTPVGR LVTVNPFVSV ATANSKVLIE 

       670        680        690        700        710        720 
LEPPFGDSYI VVGRGEQQIN HHWHKSGSSI GKAFTTTLRG AQRLAALGDT AWDFGSVGGV 

       730        740        750        760        770        780 
FTSVGKAIHQ VFGGAFRSLF GGMSWITQGL LGALLLWMGI NARDRSIAMT FLAVGGVLLF 

       790        800        810        820        830        840 
LSVNVHADTG CAIDIGRQEL RCGSGVFIHN DVEAWMDRYK FYPETPQGLA KIIQKAHAEG 

       850        860        870        880        890        900 
VCGLRSVSRL EHQMWEAIKD ELNTLLKENG VDLSVVVEKQ NGMYKAAPKR LAATTEKLEM 

       910        920        930        940        950        960 
GWKAWGKSII FAPELANNTF VIDGPETEEC PTANRAWNSM EVEDFGFGLT STRMFLRIRE 

       970        980        990       1000       1010       1020 
TNTTECDSKI IGTAVKNNMA VHSDLSYWIE SGLNDTWKLE RAVLGEVKSC TWPETHTLWG 

      1030       1040       1050       1060       1070       1080 
DGVLESDLII PITLAGPRSN HNRRPGYKTQ NQGPWDEGRV EIDFDYCPGT TVTISDSCEH 

      1090       1100       1110       1120       1130       1140 
RGPAARTTTE SGKLITDWCC RSCTLPPLRF QTENGCWYGM EIRPTRHDEK TLVQSRVNAY 

      1150       1160       1170       1180       1190       1200 
NADMIDPFQL GLMVVFLATQ EVLRKRWTAK ISIPAIMLAL LVLVFGGITY TDVLRYVILV 

      1210       1220       1230       1240       1250       1260 
GAAFAEANSG GDVVHLALMA TFKIQPVFLV ASFLKARWTN QESILLMLAA AFFQMAYYDA 

      1270       1280       1290       1300       1310       1320 
KNVLSWEVPD VLNSLSVAWM ILRAISFTNT SNVVVPLLAL LTPGLKCLNL DVYRILLLMV 

      1330       1340       1350       1360       1370       1380 
GVGSLIKEKR SSAAKKKGAC LICLALASTG VFNPMILAAG LMACDPNRKR GWPATEVMTA 

      1390       1400       1410       1420       1430       1440 
VGLMFAIVGG LAELDIDSMA IPMTIAGLMF AAFVISGKST DMWIERTADI TWESDAEITG 

      1450       1460       1470       1480       1490       1500 
SSERVDVRLD DDGNFQLMND PGAPWKIWML RMACLAISAY TPWAILPSVI GFWITLQYTK 

      1510       1520       1530       1540       1550       1560 
RGGVLWDTPS PKEYKKGDTT TGVYRIMTRG LLGSYQAGAG VMVEGVFHTL WHTTKGAALM 

      1570       1580       1590       1600       1610       1620 
SGEGRLDPYW GSVKEDRLCY GGPWKLQHKW NGHDEVQMIV VEPGKNVKNV QTKPGVFKTP 

      1630       1640       1650       1660       1670       1680 
EGEIGAVTLD YPTGTSGSPI VDKNGDVIGL YGNGVIMPNG SYISAIVQGE RMEEPAPAGF 

      1690       1700       1710       1720       1730       1740 
EPEMLRKKQI TVLDLHPGAG KTRKILPQII KEAINKRLRT AVLAPTRVVA AEMSEALRGL 

      1750       1760       1770       1780       1790       1800 
PIRYQTSAVH REHSGNEIVD VMCHATLTHR LMSPHRVPNY NLFIMDEAHF TDPASIAARG 

      1810       1820       1830       1840       1850       1860 
YIATKVELGE AAAIFMTATP PGTSDPFPES NAPISDMQTE IPDRAWNTGY EWITEYVGKT 

      1870       1880       1890       1900       1910       1920 
VWFVPSVKMG NEIALCLQRA GKKVIQLNRK SYETEYPKCK NDDWDFVITT DISEMGANFK 

      1930       1940       1950       1960       1970       1980 
ASRVIDSRKS VKPTIIEEGD GRVILGEPSA ITAASAAQRR GRIGRNPSQV GDEYCYGGHT 

      1990       2000       2010       2020       2030       2040 
NEDDSNFAHW TEARIMLDNI NMPNGLVAQL YQPEREKVYT MDGEYRLRGE ERKNFLEFLR 

      2050       2060       2070       2080       2090       2100 
TADLPVWLAY KVAAAGISYH DRKWCFDGPR TNTILEDNNE VEVITKLGER KILRPRWADA 

      2110       2120       2130       2140       2150       2160 
RVYSDHQALK SFKDFASGKR SQIGLVEVLG RMPEHFMVKT WEALDTMYVV ATAEKGGRAH 

      2170       2180       2190       2200       2210       2220 
RMALEELPDA LQTIVLIALL SVMSLGVFFL LMQRKGIGKI GLGGVILGAA TFFCWMAEVP 

      2230       2240       2250       2260       2270       2280 
GTKIAGMLLL SLLLMIVLIP EPEKQRSQTD NQLAVFLICV LTLVGAVAAN EMGWLDKTKN 

      2290       2300       2310       2320       2330       2340 
DIGSLLGHRP EARETTLGVE SFLLDLRPAT AWSLYAVTTA VLTPLLKHLI TSDYINTSLT 

      2350       2360       2370       2380       2390       2400 
SINVQASALF TLARGFPFVD VGVSALLLAV GCWGQVTLTV TVTAAALLFC HYAYMVPGWQ 

      2410       2420       2430       2440       2450       2460 
AEAMRSAQRR TAAGIMKNVV VDGIVATDVP ELERTTPVMQ KKVGQIILIL VSMAAVVVNP 

      2470       2480       2490       2500       2510       2520 
SVRTVREAGI LTTAAAVTLW ENGASSVWNA TTAIGLCHIM RGGWLSCLSI MWTLIKNMEK 

      2530       2540       2550       2560       2570       2580 
PGLKRGGAKG RTLGEVWKER LNHMTKEEFT RYRKEAITEV DRSAAKHARR EGNITGGHPV 

      2590       2600       2610       2620       2630       2640 
SRGTAKLRWL VERRFLEPVG KVVDLGCGRG GWCYYMATQK RVQEVKGYTK GGPGHEEPQL 

      2650       2660       2670       2680       2690       2700 
VQSYGWNIVT MKSGVDVFYR PSEASDTLLC DIGESSSSAE VEEHRTVRVL EMVEDWLHRG 

      2710       2720       2730       2740       2750       2760 
PKEFCIKVLC PYMPKVIEKM ETLQRRYGGG LIRNPLSRNS THEMYWVSHA SGNIVHSVNM 

      2770       2780       2790       2800       2810       2820 
TSQVLLGRME KKTWKGPQFE EDVNLGSGTR AVGKPLLNSD TSKIKNRIER LKKEYSSTWH 

      2830       2840       2850       2860       2870       2880 
QDANHPYRTW NYHGSYEVKP TGSASSLVNG VVRLLSKPWD TITNVTTMAM TDTTPFGQQR 

      2890       2900       2910       2920       2930       2940 
VFKEKVDTKA PEPPEGVKYV LNETTNWLWA FLARDKKPRM CSREEFIGKV NSNAALGAMF 

      2950       2960       2970       2980       2990       3000 
EEQNQWKNAR EAVEDPKFWE MVDEEREAHL RGECNTCIYN MMGKREKKPG EFGKAKGSRA 

      3010       3020       3030       3040       3050       3060 
IWFMWLGARF LEFEALGFLN EDHWLGRKNS GGGVEGLGLQ KLGYILKEVG TKPGGKVYAD 

      3070       3080       3090       3100       3110       3120 
DTAGWDTRIT KADLENEAKV LELLDGEHRR LARSIIELTY RHKVVKVMRP AADGKTVMDV 

      3130       3140       3150       3160       3170       3180 
ISREDQRGSG QVVTYALNTF TNLAVQLVRM MEGEGVIGPD DVEKLGKGKG PKVRTWLFEN 

      3190       3200       3210       3220       3230       3240 
GEERLSRMAV SGDDCVVKPL DDRFATSLHF LNAMSKVRKD IQEWKPSTGW YDWQQVPFCS 

      3250       3260       3270       3280       3290       3300 
NHFTELIMKD GRTLVVPCRG QDELIGRARI SPGAGWNVRD TACLAKSYAQ MWLLLYFHRR 

      3310       3320       3330       3340       3350       3360 
DLRLMANAIC SAVPANWVPT GRTTWSIHAK GEWMTTEDML AVWNRVWIEE NEWMEDKTPV 

      3370       3380       3390       3400       3410       3420 
ERWSDVPYSG KREDIWCGSL IGTRTRATWA ENIHVAINQV RSVIGEEKYV DYMSSLRRYE 

      3430 
DTIVVEDTVL 

P06935 in FASTA format

View entry in original UniProtKB/Swiss-Prot format
View entry in raw text format (no links)
Report form for errors/updates in this UniProtKB/Swiss-Prot entry

BLAST logo BLAST submission on ExPASy/SIB
or at NCBI (USA)
Tools Sequence analysis tools: ProtParam, ProtScale, Compute pI/Mw, PeptideMass, PeptideCutter, Dotlet (Java)
PROSITE logo ScanProsite, MotifScan SWISS-MODEL Submit a homology modeling request to SWISS-MODEL
NPSA logo NPSA Sequence analysis tools

ExPASy logo ExPASy Home page Site Map Search ExPASy Contact us Swiss-Prot
 Hosted by ch flag SIB Switzerland Mirror sites: Australia  Brazil  Canada  China  Korea
Notice: This page will be replaced with www.uniprot.org. Please send us your feedback!