|
|
|
|
|
|
|
|
|
- FUNCTION: P123 and P123' are short-lived polyproteins, accumulating during early stage of infection. P123 is directly translated from the genome, whereas P123' is a product of the cleavage of P1234. They localize the viral replication complex to the cytoplasmic surface of modified endosomes and lysosomes. By interacting with nsP4, they start viral genome replication into antigenome. After these early events, P123 and P123' are cleaved sequentially into nsP1, nsP2 and nsP3/nsP3'. This sequence of delayed processing would allow correct assembly and membrane association of the RNA polymerase complex (By similarity).
- FUNCTION: nsP1 is a cytoplasmic capping enzyme. This function is necessary since all viral RNAs are synthesized in the cytoplasm, and host capping enzymes are restricted to the nucleus. The enzymatic reaction involves a covalent link between 7-methyl-GMP and nsP1, whereas eukaryotic capping enzymes form a covalent complex only with GMP. nsP1 capping would consist in the following reactions: GTP is first methylated and then forms the m7GMp-nsP1 complex, from which 7-methyl-GMP complex is transferred to the mRNA to create the cap structure. Palmitoylated nsP1 is remodeling host cell cytoskeleton, and induces filopodium-like structure formation at the surface of the host cell (By similarity).
- FUNCTION: nsP2 has two separate domain with different biological activities. The N-terminal section is part of the RNA polymerase complex and has RNA trisphosphatase and RNA helicase activity. The C-terminal section harbors a protease that specifically cleaves and releases the four mature proteins (By similarity).
- FUNCTION: nsP3 and nsP3' are essential for minus strand and subgenomic 26S mRNA synthesis (By similarity).
- FUNCTION: nsP4 is a RNA dependent RNA polymerase. It replicates genomic and antigenomic RNA by recognizing replications specific signals. Transcribes also a 26S subgenomic mRNA by initiating RNA synthesis internally on antigenomic RNA. This 26S mRNA encodes for structural proteins. nsP4 is a short-lived protein regulated by several ways: the opal codon readthrough and degradation by ubiquitin pathway (By similarity).
- CATALYTIC ACTIVITY: S-adenosyl-L-methionine + GTP = m7GTP.
- CATALYTIC ACTIVITY: m7GTP + (5')pp-Pur-mRNA = diphosphate + m7G(5')ppp-Pur-mRNA.
- CATALYTIC ACTIVITY: (5')ppp-mRNA + H2O = (5')pp-mRNA + phosphate.
- CATALYTIC ACTIVITY: A 5'-phosphopolynucleotide + H2O = a polynucleotide + phosphate.
- CATALYTIC ACTIVITY: NTP + H2O = NDP + phosphate.
- CATALYTIC ACTIVITY: Nucleoside triphosphate + RNA(n) = diphosphate + RNA(n+1).
- SUBUNIT: P123 interacts with nsP4; nsP1, nsP2, nsP3 and nsP4 interact with each other, and with uncharacterized host factors.
- SUBCELLULAR LOCATION: Non-structural polyprotein: Endosome membrane; Peripheral membrane protein; Cytoplasmic side (By similarity). Lysosome membrane; Peripheral membrane protein; Cytoplasmic side (By similarity). Note=Located on the cytoplasmic surface of modified endosomes and lysosomes, also called cytopathic vacuoles type I (CPVI). These vacuoles contain numerous small circular invaginations (spherules) which may be the sites of RNA synthesis.
- SUBCELLULAR LOCATION: P123: Endosome membrane; Peripheral membrane protein; Cytoplasmic side (By similarity). Lysosome membrane; Peripheral membrane protein; Cytoplasmic side (By similarity).
- SUBCELLULAR LOCATION: P123': Endosome membrane; Peripheral membrane protein; Cytoplasmic side (By similarity). Lysosome membrane; Peripheral membrane protein; Cytoplasmic side (By similarity).
- SUBCELLULAR LOCATION: mRNA-capping enzyme nsP1: Endosome membrane; Peripheral membrane protein; Cytoplasmic side (By similarity). Lysosome membrane; Peripheral membrane protein; Cytoplasmic side (By similarity). Cell membrane; Peripheral membrane protein; Cytoplasmic side (By similarity). Cell projection, filopodium (By similarity). Note=In the late phase of infection, the polyprotein is quickly cleaved before localization to cellular membranes. Then a fraction of nsP1 localizes to the inner surface of the plasma membrane and its filopodial extensions (By similarity).
- SUBCELLULAR LOCATION: Protease/triphosphatase/NTPase/helicase nsP2: Endosome membrane; Peripheral membrane protein; Cytoplasmic side (By similarity). Lysosome membrane; Peripheral membrane protein; Cytoplasmic side (By similarity). Nucleus (By similarity). Note=In the late phase of infection, the polyprotein is quickly cleaved before localization to cellular membranes. Then approximately half of nsP2 is found in the nucleus (By similarity).
- SUBCELLULAR LOCATION: Non-structural protein 3: Endosome membrane; Peripheral membrane protein; Cytoplasmic side (By similarity). Lysosome membrane; Peripheral membrane protein; Cytoplasmic side (By similarity). Cytoplasm (By similarity). Note=In the late phase of infection, the polyprotein is quickly cleaved before localization to cellular membranes. Then nsP3 and nsP3' seems to aggregate in cytoplasm (By similarity).
- SUBCELLULAR LOCATION: Non-structural protein 3': Endosome membrane; Peripheral membrane protein; Cytoplasmic side (By similarity). Lysosome membrane; Peripheral membrane protein; Cytoplasmic side (By similarity). Cytoplasm (By similarity). Note=In the late phase of infection, the polyprotein is quickly cleaved before localization to cellular membranes. Then nsP3 and nsP3' seems to aggregate in cytoplasm (By similarity).
- SUBCELLULAR LOCATION: RNA-directed RNA polymerase nsP4: Endosome membrane; Peripheral membrane protein; Cytoplasmic side (By similarity). Lysosome membrane; Peripheral membrane protein; Cytoplasmic side (By similarity).
- INDUCTION: Viral replication produces dsRNA in the late phase of infection, resulting in a strong activation of host EIF2AK2/PKR, leading to almost complete phosphorylation of EIF2A. This inactivates completely cellular translation initiation, resulting in a dramatic shutoff of proteins synthesis. Translation of viral non-structural polyprotein and all cellular proteins are stopped in infected cell between 2 and 4 hours post infection. Only the 26S mRNA is still translated into viral structural proteins, presumably through a unique mechanism of enhancer element which counteract the translation inhibition mediated by EIF2A. By doing this, the virus uses the cellular defense for its own advantage: shutoff of cellular translation allows to produce big amounts of structural proteins needed for the virus to bud out of the doomed cell.
- PTM: Specific enzymatic cleavages in vivo yield mature proteins. The polyprotein is synthesized as P123, or P1234 by stop codon readthrough. These polyproteins are processed differently depending on the stage of infection. In early stages, P1234 is first cleaved in trans, through its nsP2 protease activity, releasing P123' and nsP4. P123/P123' and nsP4 start to replicate the viral genome into its antigenome. After these early events, nsP1 is cleaved in cis by nsP2 protease, releasing the P23/P23' polyprotein. Cleavage of nsP1 exposes an 'activator' at the N-terminus of P23/P23' which induces its cleavage into nsP2 and nsP3 by the viral protease. This sequence of delayed processing would allow correct assembly and membrane association of the RNA-polymerase complex. In the late stage of infection, the presence of free nsP2 in the cytoplasm cleaves P1234 quickly into P12 and P34, then into the four nsP (By similarity).
- PTM: nsP1 is palmitoylated by host (By similarity).
- PTM: nsP4 is ubiquitinated; targets the protein for rapid degradation via the ubiquitin system (By similarity).
- MISCELLANEOUS: The genome encodes for P123, but readthrough of a terminator codon UGA occurs between the codons for Asn-1859 and Arg-1860. This readthrough produces P1234, cleaved quickly by nsP2 into P123' and nsP4. Further processing of p123' gives nsP1, nsP2 and nsP3' which is 7 amino-acids longer than nsP3 since the cleavage site is after the readthrough. This unusual molecular mechanism ensures that few nsP4 are produced compared to other non-structural proteins. Mutant viruses with no alternative termination site grow significantly slower than wild-type virus (By similarity).
- SIMILARITY: Contains 1 Macro domain.
- SIMILARITY: Contains 1 peptidase C9 domain [view classification].
- SIMILARITY: Contains 1 RdRp catalytic domain.
|
|
|
|
Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms.
Distributed under the Creative Commons Attribution-NoDerivs License.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| Length: 2473 AA [This is the length of the unprocessed precursor] |
Molecular weight: 276313 Da [This is the MW of the unprocessed precursor] |
CRC64: C220E9F9B51DA183 [This is a checksum on the sequence] |
|
10 20 30 40 50 60
MEKVHVDLDA DSPYVKLLQK CFPHFEIEAT QVTDNDHANA RAFSHLATKL IESEVDPDQV
70 80 90 100 110 120
ILDIGSAPVR HTHSKHKYHC ICPMISAEDP DRLQRYADKL RKSDVTDRFI ASKAADLLTV
130 140 150 160 170 180
MSTPDVETPS LCMHTDSTCR YHGTVAVYQD VYAVHAPTSI YHQALKGVRT IYWIGFDTTP
190 200 210 220 230 240
FMYKNMAGAY PTYNTNWADE SVLEARNIGL CSSDLHEKRL GKISIMRKKK LQPTNKVVFS
250 260 270 280 290 300
VGSTIYTEER ILLRSWHLPN VFHLKGKTSF TGRCNTIVSC DGYVVKKITI SPGIYGKVDN
310 320 330 340 350 360
LASTLHREGF LSCKVTDTLR GERVSFPVCT YVPATLCDQM TGILATDVSV DDAQKLLVGL
370 380 390 400 410 420
NQRIVVNGRT QRNTNTMQNY LLPVVAQAFS RWAREYRADL EDEKDLGVRE RSLVMGCCWA
430 440 450 460 470 480
FKTHKITSIY KKPGTQTIKK VPAVFNSFVI PQFNSYGLNI GLRRRIKMLL EEKRKPAPII
490 500 510 520 530 540
TEADVAHLKG MQEEAEAVAE AEAVRAALPP LLPEVERETI EADIDLIMQE AGAGSVETPR
550 560 570 580 590 600
RHIKVTTYPG EETIGSYAVL SPQAVLNSEK LACIHPLAEQ VLVMTHKGRA GRYKVEPYHG
610 620 630 640 650 660
RVVVPSGTAI PIPDFQALSE SATIVYNERE FVNRYLHHIA INGGAINTDE EYYKVLRSSE
670 680 690 700 710 720
ADSEYVFDID ARKCVKKADA GPMCLVGELV DPPFHEFAYE SLKTRPAAPH KVPTIGVYGV
730 740 750 760 770 780
PGSGKSGIIK SAVTKRDLVV SAKKENCTEI IKDVKRMRGM DIAARTVDSV LLNGVKHPVD
790 800 810 820 830 840
TLYIDEAFAC HAGTLLALIA IVKPKKVVLC GDPKQCGFFN MMCLKVHFNH EICTEVYHKS
850 860 870 880 890 900
ISRRCTKTVT AIVSTLFYDK RMRTVNPCSD KIIIDTTSTT KPQRDDIILT CFRGWVKQLQ
910 920 930 940 950 960
IDYKNHEIMT AAASQGLTRK GVYAVRYKVN ENPLYAQTSE HVNVLLTRTE KRIVWKTLAG
970 980 990 1000 1010 1020
DPWIKTLTAH YPGEFSATLE EWQAEHDAIM ERILETPASS DVYQNKVHVC WAKALEPVLA
1030 1040 1050 1060 1070 1080
TANITLTRSQ WETIPAFKDD KAFSPEMALN FLCTRFFGVD IDSGLFSAPT VPLTYTNEHW
1090 1100 1110 1120 1130 1140
DNSPGPNRYG LCMRTAKELA RRYPCILKAV DTGRLADVRT NTIKDYSPLI NVVPLNRRLP
1150 1160 1170 1180 1190 1200
HSLVVSHRYT GDGNYSQLLS KLIGKTVLVI GTPISVPGKR VETLGPGPQC TYKADLDLGI
1210 1220 1230 1240 1250 1260
PSTIGKYDII FVNVRTPYKH HHYQQCEDHA IHHSMLTRKA VDHLNKGGTC VALGYGTADR
1270 1280 1290 1300 1310 1320
ATENIISAVA RSFRFSRVCQ PKCAWENTEV AFVFFGKDNG NHLRDQDQLS IVLNNIYQGS
1330 1340 1350 1360 1370 1380
TQYEAGRAPA YRVIRGDISK STDEAIVNAA NNKGQPGAGV CGALYKKWPG AFDKVPIATG
1390 1400 1410 1420 1430 1440
TAHLVKHTPN IIHAVGPNFS RVSEVEGNQK LSEVYMDIAK IINRERYNKV SIPLLSTGIY
1450 1460 1470 1480 1490 1500
AGGKDRVMQS LNHLFTAMDT TDADVTIYCL DKQWEARIKD AIARKESVEE LVEDDKPVDI
1510 1520 1530 1540 1550 1560
ELVRVHPLSS LVGRPGYSTD EGKVHSYLEG TRFHQTAKDI AEIYAMWPNK QEANEQICLY
1570 1580 1590 1600 1610 1620
VLGESMTSIR SKCPVEDSEA SSPPHTIPCL CNYAMTAERV YRLRMAKNEQ FAVCSSFQLP
1630 1640 1650 1660 1670 1680
KYRITGVQKI QCNKPVIFSG VVPPAIHPRK FSAIEETVPV TIERLVPRRP APPVPVPARI
1690 1700 1710 1720 1730 1740
PSPRCSPAVS MQSLGGSSTS DVVISEAEVH DSDSDCSIPP MPFVVEAEVH ASQGSHWSIP
1750 1760 1770 1780 1790 1800
SASGFEIREL PEDRSISGSP TRTSVISDHS VNLITFDSVT DIFENFKQAP FQFLSEIRPI
1810 1820 1830 1840 1850 1860
PAPRRRVGGF ETDTKRYDKT EEKPIPKPRT RTTKYKQPPG VARSISEAEL DEFIRRHSNR
1870 1880 1890 1900 1910 1920
YEAGAYIFSS ETGQGHLQQK STRQCKLQNP ILERSVHEKF YAPRLDLERE KLLQKKLQLC
1930 1940 1950 1960 1970 1980
ASEGNRSRYQ SRKVENMKAI TAERLLQGIG AYLSAESQPV ECYKVNYPVP IYSTTRSNRF
1990 2000 2010 2020 2030 2040
SSADVAVRVC NLVLQENFPT VASYTITDEY DAYLDMVDGA SCCLDTATFC PAKLRSFPKK
2050 2060 2070 2080 2090 2100
HSYLRPEIRS AVPSPIQNTL QNVLAAATKR NCNVTQMREL PVLDSAAFNV ECFKKYACND
2110 2120 2130 2140 2150 2160
EYWDTFKNNP IRLTTENVTQ YVTKLKGPKA AALFAKTHNL QPLHEIPMDR FVMDLKRDVK
2170 2180 2190 2200 2210 2220
VTPGTKHTEE RPKVQVIQAA EPLATAYLCG IHRELVRRLN AVLLPNIHTL FDMSAEDFDA
2230 2240 2250 2260 2270 2280
IIAEHFQFGD AVLETDIASF DKSEDDAIAM SALMILEDLG VDQALLDLIE AAFGNITSVH
2290 2300 2310 2320 2330 2340
LPTGTRFKFG AMMKSGMFLT LFINTVVNIM IASRVLRERL TNSPCAAFIG DDNIVKGVKS
2350 2360 2370 2380 2390 2400
DALMAERCAT WLNMEVKIID ATVGVKAPYF CGGFIVVDQV TGTACRVADP LKRLFKLGKP
2410 2420 2430 2440 2450 2460
LPLDDDQDGD RRRALYDEAL RWNRIGITDE LIKAVESRYE VFYISLVITA LTTLAATVSN
2470
FKYIRGNPIT LYG
|
Q306W8 in FASTA format |
|