|
|
|
|
|
|
[1]
|
NUCLEOTIDE SEQUENCE [GENOMIC RNA].
PubMed=2986127 [NCBI, ExPASy, EBI, Israel, Japan]
Najarian R.,
Caput D.,
Gee W.W.,
Potter S.J.,
Renard A.,
Merryweather J.,
van Nest G.,
Dina D.;
"Primary structure and gene organization of human hepatitis A virus.";
Proc. Natl. Acad. Sci. U.S.A. 82:2627-2631(1985).
|
|
|
|
- FUNCTION: Capsid proteins VP1, VP2, and VP3 form a closed capsid enclosing the viral positive strand RNA genome. All these proteins contain a beta-sheet structure called beta-barrel jelly roll. Together they form an icosahedral capsid (T=3) composed of 60 copies of each VP1, VP2, and VP3, with a diameter of approximately 300 Angstroms. VP1 is situated at the 12 fivefold axes, whereas VP2 and VP3 are located at the quasi-sixfold axes. The capsid interacts with HAVCR1 to provide virion attachment to target cell (By similarity).
- FUNCTION: VP0 precursor is a component of immature procapsids. The N-terminal domain of VP0, protein VP4, is needed for the assembly of 12 pentamers into the icosahedral structure. Unlike other picornaviruses, HAV VP4 does not seem to be myristoylated and has not been detected in mature virions, supposedly owing to its small size (By similarity).
- FUNCTION: VP1-2A precursor is a component of immature procapsids and corresponds to an extended form of the structural protein VP1. The C-terminal domain of VP1-2A, protein 2A, acts as an assembly signal that allows multimerization of VP1-2A and formation of pentamers of VP1-VP2-VP3 trimers. It is proteolytically removed from the precursor by a host protease and does not seem to be found in mature particles (By similarity).
- FUNCTION: Protein 2B and 2BC precursor affect membrane integrity and cause an increase in membrane permeability (By similarity).
- FUNCTION: Protein 2C associates with and induces structural rearrangements of intracellular membranes. It displays RNA-binding, nucleotide binding and NTPase activities (By similarity).
- FUNCTION: Protein 3A, via its hydrophobic domain, serves as membrane anchor to the 3AB and 3ABC precursors (By similarity).
- FUNCTION: The 3AB precursor interacts with the 3CD precursor and with RNA structures found at both the 5'- and 3'-termini of the viral genome. Since the 3AB precursor contains the hydrophobic domain 3A, it probably anchors the whole viral replicase complex to intracellular membranes on which viral RNA synthesis occurs (By similarity).
- FUNCTION: The 3ABC precursor is targeted to the mitochondrial membrane where protease 3C activity cleaves and inhibits the host antiviral protein MAVS, thereby disrupting activation of IRF3 through the IFIH1/MDA5 pathway. In vivo, the protease activity of 3ABC precursor is more efficient in cleaving the 2BC precursor than that of protein 3C. The 3ABC precursor may therefore play a role in the proteolytic processing of the polyprotein (By similarity).
- FUNCTION: Protein 3B is covalently linked to the 5'-end of both the positive-strand and negative-strand genomic RNAs. It acts as a genome-linked replication primer (By similarity).
- FUNCTION: Protein 3C is a cysteine protease that generates mature viral proteins from the precursor polyprotein. In addition to its proteolytic activity, it binds to viral RNA, and thus influences viral genome replication. RNA and substrate bind co-operatively to the protease. Also cleaves host proteins such as PCBP2 (By similarity).
- FUNCTION: RNA-directed RNA polymerase 3D-POL replicates genomic and antigenomic RNA by recognizing replications specific signals (By similarity).
- CATALYTIC ACTIVITY: Nucleoside triphosphate + RNA(n) = diphosphate + RNA(n+1).
- CATALYTIC ACTIVITY: Selective cleavage of Gln-|-Gly bond in the poliovirus polyprotein. In other picornavirus reactions Glu may be substituted for Gln, and Ser or Thr for Gly.
- CATALYTIC ACTIVITY: NTP + H2O = NDP + phosphate.
- SUBUNIT: 3AB precursor is a homodimer. 3AB precursor interacts with 3CD precursor (By similarity).
- SUBCELLULAR LOCATION: Protein VP2: Virion (By similarity). Cytoplasm (Potential).
- SUBCELLULAR LOCATION: Protein VP3: Virion (By similarity). Cytoplasm (Potential).
- SUBCELLULAR LOCATION: Protein VP1: Virion (By similarity). Cytoplasm (Potential).
- SUBCELLULAR LOCATION: Protein VP1-2A: Virion (By similarity). Cytoplasm (Potential).
- SUBCELLULAR LOCATION: Protein 2B: Cytoplasmic vesicle membrane; Peripheral membrane protein; Cytoplasmic side (Potential). Note=Probably localizes to the surface of intracellular membrane vesicles that are induced after virus infection as the site for viral RNA replication. These vesicles are derived from the endoplasmic reticulum (By similarity).
- SUBCELLULAR LOCATION: Protein 2C: Cytoplasmic vesicle membrane; Peripheral membrane protein; Cytoplasmic side (Potential). Note=Probably localizes to the surface of intracellular membrane vesicles that are induced after virus infection as the site for viral RNA replication. These vesicles are derived from the endoplasmic reticulum. May associate with membranes through a N-terminal amphipathic helix (By similarity).
- SUBCELLULAR LOCATION: Protein 3ABC: Cytoplasmic vesicle membrane; Peripheral membrane protein; Cytoplasmic side (Potential). Mitochondrion outer membrane; Peripheral membrane protein; Cytoplasmic side (Potential). Note=Probably localizes to the surface of intracellular membrane vesicles that are induced after virus infection as the site for viral RNA replication. These vesicles are derived from the endoplasmic reticulum (By similarity).
- SUBCELLULAR LOCATION: Protein 3AB: Cytoplasmic vesicle membrane; Peripheral membrane protein; Cytoplasmic side (Potential). Note=Probably localizes to the surface of intracellular membrane vesicles that are induced after virus infection as the site for viral RNA replication. These vesicles are derived from the endoplasmic reticulum (By similarity).
- SUBCELLULAR LOCATION: Protein 3A: Cytoplasmic vesicle membrane; Peripheral membrane protein; Cytoplasmic side (Potential). Note=Probably localizes to the surface of intracellular membrane vesicles that are induced after virus infection as the site for viral RNA replication. These vesicles are derived from the endoplasmic reticulum (By similarity).
- SUBCELLULAR LOCATION: Protein 3B: Virion (Potential).
- SUBCELLULAR LOCATION: Picornain 3C: Cytoplasm (Potential).
- SUBCELLULAR LOCATION: RNA-directed RNA polymerase 3D-POL: Cytoplasmic vesicle membrane; Peripheral membrane protein; Cytoplasmic side (Potential). Note=Interacts with membranes in a complex with viral protein 3AB. Probably localizes to the surface of intracellular membrane vesicles that are induced after virus infection as the site for viral RNA replication. These vesicles are derived from the endoplasmic reticulum (By similarity).
- PTM: Specific enzymatic cleavages by the viral protease in vivo yield a variety of precursors and mature proteins. Polyprotein processing intermediates are produced, such as P1-2A which is a functional precursor of the structural proteins, VP0 which is a VP4-VP2 precursor, VP1-2A precursor, 3ABC precursor which is a stable and catalytically active precursor of 3A, 3B and 3C proteins, 3AB and 3CD precursors. The assembly signal 2A is removed from VP1-2A by a host protease. During virion maturation, non-infectious particles are rendered infectious following cleavage of VP0. This maturation cleavage is followed by a conformational change of the particle (By similarity).
- SIMILARITY: Belongs to the picornaviridae polyprotein family.
- SIMILARITY: Contains 1 peptidase C3 domain [view classification].
- SIMILARITY: Contains 1 RdRp catalytic domain.
- SIMILARITY: Contains 1 SF3 helicase domain.
- CAUTION: It is uncertain whether Met-1 or Met-3 is the initiator.
- CAUTION: Protein VP1 seems to have a heterogeneous C-terminus in cell culture. It may be reduced by a few amino acids compared to the sequence shown.
|
|
|
|
Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms.
Distributed under the Creative Commons Attribution-NoDerivs License.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| Length: 2227 AA [This is the length of the unprocessed precursor] |
Molecular weight: 251900 Da [This is the MW of the unprocessed precursor] |
CRC64: 99A7354B4CD2799C [This is a checksum on the sequence] |
|
10 20 30 40 50 60
MNMSKQGIFQ TVGSGLDHIL SLADIEEEQM IQSVDRTAVT GASYFTSVDQ SSVHTAEVGS
70 80 90 100 110 120
HQIEPLKTSV DKPGSKKTQG EKFFLIHSAD WLTTHALFHE VAKLDVVKLL YNEQFAVQGL
130 140 150 160 170 180
LRYHTYARFG IEIQVQINPT PFQQGGLICA MVPGDQSYGS IASLTVYPHG LLNCNINNVV
190 200 210 220 230 240
RIKVPFIYTR GAYHFKDPQY PVWELTIRVW SELNIGTGTS AYTSLNVLAR FTDLELHGLT
250 260 270 280 290 300
PLSTQMMRNE FRVSTTENVV NLSNYEDARA KMSFALDQED WKSDPSQGGG IKITHFTTWT
310 320 330 340 350 360
SIPTLAAQFP FNASDSVGQQ IKVIPVDPYF FQMTNTNPDQ KCITALASIC QMFCFWRGDL
370 380 390 400 410 420
VFDFQVFPTK YHSGRLLFCF VPGNELIDVT GITLKQATTA PCAVMDITGV QSTLRFRVPW
430 440 450 460 470 480
ISDTPYRVNR YTKSAHQKGE YTAIGKLIVY CYNRLTSPSN VASHVRVNVY LSAINLECFA
490 500 510 520 530 540
PLYHAMDVTT QVGDDSGGFS TTVSTEQNVP DPQVGITTMR DLKGKANRGK MDVSGVQAPR
550 560 570 580 590 600
GSYQQQLNDP VLAKKVPETF PELKPGESRH TSDHMSIYKF MGRSHFLCTF TFNSNNKEYT
610 620 630 640 650 660
FPITLSSTSN PPHGLPSTLR WFFNLFQLYR GPLDLTIIIT GATDVDGMAW FTPVGLAVDP
670 680 690 700 710 720
WVEKESALSI DYKTALGAVR FNTRRTGNIQ IRLPWYSYLY AVSGALDGLG DKTDSTFGLF
730 740 750 760 770 780
LFEIANYNHS DEYLSFSCYL SVTEQSEFYF PRAPLNSNAM LSTESMMSRI AAGDLESSVD
790 800 810 820 830 840
DPRSEEDRRF ESHIECRKPY KELRLEVGKQ RLKYAQEELS NEVLPPPRKM KGLFSQAKIS
850 860 870 880 890 900
LFYTEEHEIM KFSWRGVTAD TRALRRFGFS LAAGRSVWTL EMDAGVLTGR LIRLNDEKWT
910 920 930 940 950 960
EMKDDKIVSL IEKFTSNKYW SKVNFPHGML DLEEIAANSK DFPNMSETDL CFLLHWLNPK
970 980 990 1000 1010 1020
KINLADRMLG LSGVQEIKEQ GVGLIAECRT FLDSIAGTLK SMMFGFHHSV TVEIINTVLC
1030 1040 1050 1060 1070 1080
FVKSGILLYV IQQLNQDEHS HIIGLLRVMN YADIGCSVIS CGKVFSKMLE TVFNWQMDSR
1090 1100 1110 1120 1130 1140
MMELRTQSFS NWLRDICSGI TIFKSFKDAI YWLYTKLKDF YEVNYGKKKD ILNILKDNQQ
1150 1160 1170 1180 1190 1200
KIEKAIEEAD NFCILQIQDV EKFDQYQKGV DLIQKLRTVH SMAQVDPNLG VHLSPLRDCI
1210 1220 1230 1240 1250 1260
ARVHQKLKNL GSINQAMVTR CEPVVCYLYG KRGGGKSLTS IALATKICKH YGVEPEKNIY
1270 1280 1290 1300 1310 1320
TKPVASDYWD GYSGQLVCII DDIGQNTTDE DWSDFCQLVS GCPMRLNMAS LEEKGRHFSS
1330 1340 1350 1360 1370 1380
PFIIATSNWS NPSPKTVYVK EAIDRRLHFK VEVKPASFFK NPHNDMLNVN LAKTNDAIKD
1390 1400 1410 1420 1430 1440
MSCVDLIMDG HNISLMDLLS SLVMTVEIRK QNMSEFMELW SQGISDDDND SAVAEFFQSF
1450 1460 1470 1480 1490 1500
PSGEPSNWKL SSFFQSVTNH KWVAVGAAVG ILGVLVGGWF VYKHFSRKEE EPIPAEGVYH
1510 1520 1530 1540 1550 1560
GVTKPKQVIK LDADPVESQS TLEIAGLVRK NLVQFGVGEK NGCVRWVMNA LGVKDDWLLV
1570 1580 1590 1600 1610 1620
PSHAYKFEKD YEMMEFYFNR GGTYYSISAG NVVIQSLDVG FQDVVLMKVP TIPKFRDITQ
1630 1640 1650 1660 1670 1680
HFIKKGDVPR ALNRLATLVT TVNGTPMLIS EGPLKMEEKA TYVHKKNDGT TVDLTVDQAW
1690 1700 1710 1720 1730 1740
RGKGEGLPGM CGGALVSSNQ SIQNAILGIH VAGGNSILVA KLVTQEMFQN IDKKIESQRI
1750 1760 1770 1780 1790 1800
MKVEFTQCSM NVVSKTLFRK SPIHHHIDKT MINFPAAMPF SKAEIDPMAM MLSKYSLPIV
1810 1820 1830 1840 1850 1860
EEPEDYKEAS VFYQNKIVGK TQLVDDFLDL DMAITGAPGI DAINMDSSPG FPYVQEKLTK
1870 1880 1890 1900 1910 1920
RDLIWLDENG LLLGVHPRLA QRILFNTVMM ENCSDLDVVF TTCPKDELRP LEKVLESKTR
1930 1940 1950 1960 1970 1980
AIDACPLDYT ILCRMYWGPA ISYFHLNPGF HTGVAIGIDP DRQWDELFKT MIRFGDVGLD
1990 2000 2010 2020 2030 2040
LDFSAFDASL SPFMIREAGR IMSELSGTPS HFGTALINTI IYSKHLLYNC CYHVCGSMPS
2050 2060 2070 2080 2090 2100
GSPCTALLNS IINNINLYYV FSKIFGKSPV FFCQALRILC YGDDVLIVFS RDVQIDNLDL
2110 2120 2130 2140 2150 2160
IGQKIVDEFK KLGMTATSAD KNVPQLKPVS ELTFLKRSFN LVEDRIRPAI SEKTIWSLMA
2170 2180 2190 2200 2210 2220
WQRSNAEFEQ NLENAQWFAF MHGYEFYQKF YYFVQSCLEK EMIEYRLKSY DWWRMRFYDQ
CFICDLS
|
P06441 in FASTA format |
|