ExPASy logo ExPASy Home page Site Map Search ExPASy Contact us Swiss-Prot
Notice: This page will be replaced with www.uniprot.org. Please send us your feedback!
Search for

UniProtKB/Swiss-Prot entry P26663


[Entry info] [Name and origin] [References] [Comments] [Cross-references] [Keywords] [Features] [Sequence] [Tools]

Note: most headings are clickable, even if they don't appear as links. They link to the user manual or other documents.
Entry information
Entry name POLG_HCVBK
Primary accession number P26663
Secondary accession numbers None
Integrated into Swiss-Prot on August 1, 1992
Sequence was last modified on January 23, 2007 (Sequence version 3)
Annotations were last modified on    June 16, 2009 (Entry version 110)
Name and origin of the protein
Protein name Genome polyprotein
Synonyms None
Contains Core protein p21
     (Capsid protein C)
     (p21)
Core protein p19
Envelope glycoprotein E1
     (gp32)
     (gp35)
Envelope glycoprotein E2
     (NS1)
     (gp68)
     (gp70)
p7
Protease NS2-3
     (p23)
     (EC 3.4.22.-)
Serine protease/NTPase/helicase NS3
     (EC 3.4.21.98)
     (EC 3.6.1.15)
     (EC 3.6.1.-)
     (Hepacivirin)
     (NS3P)
     (p70)
Non-structural protein 4A
     (NS4A)
     (p8)
Non-structural protein 4B
     (NS4B)
     (p27)
Non-structural protein 5A
     (NS5A)
     (p56)
RNA-directed RNA polymerase
     (EC 2.7.7.48)
     (NS5B)
     (p68)
Gene name None
From
Hepatitis C virus genotype 1b (isolate BK) (HCV) [TaxID: 11105] 
Taxonomy Viruses; ssRNA positive-strand viruses, no DNA stage; Flaviviridae; Hepacivirus.
Virus host Homo sapiens (Human) [TaxID: 9606]
Protein existence 1: Evidence at protein level;
References
[1]
NUCLEOTIDE SEQUENCE [GENOMIC RNA].
PubMed=1847440 [NCBI, ExPASy, EBI, Israel, Japan]
Takamizawa A., Mori C., Fuke I., Manabe S., Murakami S., Fujita J., Onishi E., Andoh T., Yoshida I., Okayama H.;
"Structure and organization of the hepatitis C virus genome isolated from human carriers.";
J. Virol. 65:1105-1113(1991).
[2]
PROTEIN SEQUENCE OF 1487-1500.
DOI=10.1111/j.1432-1033.1996.0611p.x; PubMed=8647104 [NCBI, ExPASy, EBI, Israel, Japan]
Borowski P., Heiland M., Oehlmann K., Becker B., Korneteky L.;
"Non-structural protein 3 of hepatitis C virus inhibits phosphorylation mediated by cAMP-dependent protein kinase.";
Eur. J. Biochem. 237:611-618(1996).
[3]
SUBCELLULAR LOCATION, AND RNA BINDING ACTIVITY OF CORE PROTEIN.
PubMed=8189501 [NCBI, ExPASy, EBI, Israel, Japan]
Santolini E., Migliaccio G., La Monica N.;
"Biosynthesis and biochemical properties of the hepatitis C virus core protein.";
J. Virol. 68:3631-3641(1994).
[4]
CHARACTERIZATION OF PROTEASE NS2-3.
PubMed=9261354 [NCBI, ExPASy, EBI, Israel, Japan]
Pieroni L., Santolini E., Fipaldini C., Pacini L., Migliaccio G., La Monica N.;
"In vitro study of the NS2-3 protease of hepatitis C virus.";
J. Virol. 71:6373-6380(1997).
[5]
FUNCTION OF NS5A.
PubMed=9710605 [NCBI, ExPASy, EBI, Israel, Japan]
Gale M.J. Jr., Blakely C.M., Kwieciszewski B., Tan S.-L., Dossett M., Tang N.M., Korth M.J., Polyak S.J., Gretch D.R., Katze M.G.;
"Control of PKR protein kinase by hepatitis C virus nonstructural 5A protein: molecular mechanisms of kinase regulation.";
Mol. Cell. Biol. 18:5208-5218(1998).
[6]
INTERACTION OF NS5A WITH HUMAN GRB2, AND MUTAGENESIS OF PRO-2322; PRO-2323 AND PRO-2326.
DOI=10.1073/pnas.96.10.5533; PubMed=10318918 [NCBI, ExPASy, EBI, Israel, Japan]
Tan S.-L., Nakao H., He Y., Vijaysri S., Neddermann P., Jacobs B.L., Mayer B.J., Katze M.G.;
"NS5A, a nonstructural protein of hepatitis C virus, binds growth factor receptor-bound protein 2 adaptor protein in a Src homology 3 domain/ligand-dependent manner and perturbs mitogenic signaling.";
Proc. Natl. Acad. Sci. U.S.A. 96:5533-5538(1999).
[7]
MUTAGENESIS OF SER-2194.
DOI=10.1006/viro.2000.0662; PubMed=11118372 [NCBI, ExPASy, EBI, Israel, Japan]
Katze M.G., Kwieciszewski B., Goodlett D.R., Blakely C.M., Neddermann P., Tan S.-L., Aebersold R.;
"Ser(2194) is a highly conserved major phosphorylation site of the hepatitis C virus nonstructural protein NS5A.";
Virology 278:501-513(2000).
[8]
CHARACTERIZATION OF PROTEASE NS2-3.
DOI=10.1074/jbc.M108266200; PubMed=11591719 [NCBI, ExPASy, EBI, Israel, Japan]
Thibeault D., Maurice R., Pilote L., Lamarre D., Pause A.;
"In vitro characterization of a purified NS2/3 protease variant of hepatitis C virus.";
J. Biol. Chem. 276:46678-46684(2001).
[9]
INTERACTION OF NS5A WITH HUMAN PIK3R1.
DOI=10.1128/JVI.76.18.9207-9217.2002; PubMed=12186904 [NCBI, ExPASy, EBI, Israel, Japan]
He Y., Nakao H., Tan S.-L., Polyak S.J., Neddermann P., Vijaysri S., Jacobs B.L., Katze M.G.;
"Subversion of cell signaling pathways by hepatitis C virus nonstructural 5A protein via interaction with Grb2 and P85 phosphatidylinositol 3-kinase.";
J. Virol. 76:9207-9217(2002).
[10]
DOMAINS CD81-BINDING AND HVR2.
DOI=10.1086/368221; PubMed=12660945 [NCBI, ExPASy, EBI, Israel, Japan]
Hofmann W.P., Sarrazin C., Kronenberger B., Schonberger B., Bruch K., Zeuzem S.;
"Mutations within the CD81-binding sites and hypervariable region 2 of the envelope 2 protein: correlation with treatment response in hepatitis C virus-infected patients.";
J. Infect. Dis. 187:982-987(2003).
[11]
PHOSPHORYLATION OF NS5A.
DOI=10.1128/JVI.78.7.3502-3513.2004; PubMed=15016873 [NCBI, ExPASy, EBI, Israel, Japan]
Coito C., Diamond D.L., Neddermann P., Korth M.J., Katze M.G.;
"High-throughput screening of the yeast kinome: identification of human serine/threonine protein kinases that phosphorylate the hepatitis C virus NS5A protein.";
J. Virol. 78:3502-3513(2004).
[12]
DOMAINS ISDR AND V3 REGIONS.
DOI=10.1002/jmv.20144; PubMed=15258967 [NCBI, ExPASy, EBI, Israel, Japan]
Vuillermoz I., Khattab E., Sablon E., Ottevaere I., Durantel D., Vieux C., Trepo C., Zoulim F.;
"Genetic variability of hepatitis C virus in chronically infected patients with viral breakthrough during interferon-ribavirin therapy.";
J. Med. Virol. 74:41-53(2004).
[13]
INTERACTION OF NS5A WITH CELLULAR PROTEINS.
PubMed=15607035 [NCBI, ExPASy, EBI, Israel, Japan]
Ahn J., Chung K.-S., Kim D.-U., Won M., Kim L., Kim K.-S., Nam M., Choi S.-J., Kim H.-C., Yoon M., Chae S.-K., Hoe K.-L.;
"Systematic identification of hepatocellular proteins interacting with NS5A of the hepatitis C virus.";
J. Biochem. Mol. Biol. 37:741-748(2004).
[14]
INTERACTION OF NS5A WITH HUMAN BIN1, AND FUNCTION OF NS5A.
DOI=10.1053/j.gastro.2005.12.030; PubMed=16530520 [NCBI, ExPASy, EBI, Israel, Japan]
Nanda S.K., Herion D., Liang T.J.;
"The SH3 binding motif of HCV NS5A protein interacts with Bin1 and is important for apoptosis and infectivity.";
Gastroenterology 130:794-809(2006).
[15]
SUBCELLULAR LOCATION OF CORE PROTEIN, AND FUNCTION OF CORE PROTEIN.
DOI=10.1016/j.jhep.2006.10.019; PubMed=17188392 [NCBI, ExPASy, EBI, Israel, Japan]
Jackel-Cram C., Babiuk L.A., Liu Q.;
"Up-regulation of fatty acid synthase promoter by hepatitis C virus core protein: genotype-3a core has a stronger effect than genotype-1b core.";
J. Hepatol. 46:999-1008(2007).
[16]
REVIEW.
DOI=10.1046/j.1365-2893.2000.00201.x; PubMed=10718937 [NCBI, ExPASy, EBI, Israel, Japan]
McLauchlan J.;
"Properties of the hepatitis C virus core protein: a structural protein that modulates cellular processes.";
J. Viral Hepat. 7:2-14(2000).
[17]
REVIEW, AND SUBCELLULAR LOCATION.
DOI=10.1002/hep.20032; PubMed=14752815 [NCBI, ExPASy, EBI, Israel, Japan]
Penin F., Dubuisson J., Rey F.A., Moradpour D., Pawlotsky J.-M.;
"Structural biology of hepatitis C virus.";
Hepatology 39:5-19(2004).
[18]
X-RAY CRYSTALLOGRAPHY (2.4 ANGSTROMS) OF 1027-1215.
DOI=10.1016/S0092-8674(00)81350-1; PubMed=8861916 [NCBI, ExPASy, EBI, Israel, Japan]
Love R.A., Parge H.E., Wickersham J.A., Hostomsky Z., Habuka N., Moomaw E.W., Adachi T., Hostomska Z.;
"The crystal structure of hepatitis C virus NS3 proteinase reveals a trypsin-like fold and a structural zinc binding site.";
Cell 87:331-342(1996).
[19]
X-RAY CRYSTALLOGRAPHY (2.2 ANGSTROMS) OF 1027-1206 AND 1678-1691.
PubMed=9568891 [NCBI, ExPASy, EBI, Israel, Japan]
Yan Y., Li Y., Munshi S., Sardana V., Cole J.L., Sardana M., Steinkuehler C., Tomei L., de Francesco R., Kuo L.C., Chen Z.;
"Complex of NS3 protease and NS4A peptide of BK strain hepatitis C virus: a 2.2-A resolution structure in a hexagonal crystal form.";
Protein Sci. 7:837-847(1998).
[20]
X-RAY CRYSTALLOGRAPHY (2.3 ANGSTROMS) OF 1216-1650.
DOI=10.1074/jbc.273.24.15045; PubMed=9614113 [NCBI, ExPASy, EBI, Israel, Japan]
Cho H.-S., Ha N.-C., Kang L.-W., Chung K.M., Back S.H., Jang S.K., Oh B.-H.;
"Crystal structure of RNA helicase from genotype 1b hepatitis C virus. A feasible mechanism of unwinding duplex RNA.";
J. Biol. Chem. 273:15045-15052(1998).
[21]
X-RAY CRYSTALLOGRAPHY (2.5 ANGSTROMS) OF 1013-1657.
DOI=10.1016/S0969-2126(00)80025-8; PubMed=10574797 [NCBI, ExPASy, EBI, Israel, Japan]
Yao N., Reichert P., Taremi S.S., Prosise W.W., Weber P.C.;
"Molecular views of viral polyprotein processing revealed by the crystal structure of the hepatitis C virus bifunctional protease-helicase.";
Structure 7:1353-1363(1999).
[22]
STRUCTURE BY NMR OF 1027-1206.
DOI=10.1006/jmbi.1999.2745; PubMed=10366511 [NCBI, ExPASy, EBI, Israel, Japan]
Barbato G., Cicero D.O., Nardi M.C., Steinkuehler C., Cortese R., De Francesco R., Bazzo R.;
"The solution structure of the N-terminal proteinase domain of the hepatitis C virus (HCV) NS3 protein provides new insights into its activation and catalytic mechanism.";
J. Mol. Biol. 289:371-384(1999).
[23]
X-RAY CRYSTALLOGRAPHY (2.8 ANGSTROMS) OF 2420-2950.
DOI=10.1073/pnas.96.23.13034; PubMed=10557268 [NCBI, ExPASy, EBI, Israel, Japan]
Bressanelli S., Tomei L., Roussel A., Incitti I., Vitale R.L., Mathieu M., De Francesco R., Rey F.A.;
"Crystal structure of the RNA-dependent RNA polymerase of hepatitis C virus.";
Proc. Natl. Acad. Sci. U.S.A. 96:13034-13039(1999).
[24]
X-RAY CRYSTALLOGRAPHY (1.9 ANGSTROMS) OF 2414-2989.
DOI=10.1038/13305; PubMed=10504728 [NCBI, ExPASy, EBI, Israel, Japan]
Lesburg C.A., Cable M.B., Ferrari E., Hong Z., Mannarino A.F., Weber P.C.;
"Crystal structure of the RNA-dependent RNA polymerase from hepatitis C virus reveals a fully encircled active site.";
Nat. Struct. Biol. 6:937-943(1999).
[25]
X-RAY CRYSTALLOGRAPHY (2.5 ANGSTROMS) OF 2420-2999.
DOI=10.1016/S0969-2126(00)80031-3; PubMed=10574802 [NCBI, ExPASy, EBI, Israel, Japan]
Ago H., Adachi T., Yoshida A., Yamamoto M., Habuka N., Yatsunami K., Miyano M.;
"Crystal structure of the RNA-dependent RNA polymerase of hepatitis C virus.";
Structure 7:1417-1426(1999).
[26]
X-RAY CRYSTALLOGRAPHY (1.7 ANGSTROMS) OF 2420-2955.
DOI=10.1128/JVI.76.7.3482-3492.2002; PubMed=11884572 [NCBI, ExPASy, EBI, Israel, Japan]
Bressanelli S., Tomei L., Rey F.A., De Francesco R.;
"Structural analysis of the hepatitis C virus RNA polymerase in complex with ribonucleotides.";
J. Virol. 76:3482-3492(2002).
[27]
X-RAY CRYSTALLOGRAPHY (2.0 ANGSTROMS) OF 2420-2989 IN COMPLEX WITH A NON-NUCLEOSIDE INHIBITOR.
DOI=10.1074/jbc.M209397200; PubMed=12509436 [NCBI, ExPASy, EBI, Israel, Japan]
Wang M., Ng K.K.-S., Cherney M.M., Chan L., Yannopoulos C.G., Bedard J., Morin N., Nguyen-Ba N., Alaoui-Ismaili M.H., Bethell R.C., James M.N.G.;
"Non-nucleoside analogue inhibitors bind to an allosteric site on HCV NS5B polymerase. Crystal structures and mechanism of inhibition.";
J. Biol. Chem. 278:9489-9495(2003).
[28]
X-RAY CRYSTALLOGRAPHY (2.2 ANGSTROMS) OF 2420-2995 IN COMPLEX WITH AN INHIBITOR.
DOI=10.1128/JVI.77.13.7575-7581.2003; PubMed=12805457 [NCBI, ExPASy, EBI, Israel, Japan]
Love R.A., Parge H.E., Yu X., Hickey M.J., Diehl W., Gao J., Wriggers H., Ekker A., Wang L., Thomson J.A., Dragovich P.S., Fuhrman S.A.;
"Crystallographic identification of a noncompetitive inhibitor binding site on the hepatitis C virus NS5B RNA polymerase enzyme.";
J. Virol. 77:7575-7581(2003).
Comments
  • FUNCTION: Core protein packages viral RNA to form a viral nucleocapsid, and promotes virion budding. Modulates viral translation initiation by interacting with HCV IRES and 40S ribosomal subunit. Also regulates many host cellular functions such as signaling pathways and apoptosis. Prevents the establishment of cellular antiviral state by blocking the interferon-alpha/beta (IFN-alpha/beta) and IFN-gamma signaling pathways and by inducing human STAT1 degradation. Thought to play a role in virus-mediated cell transformation leading to hepatocellular carcinomas. Interacts with, and activates STAT3 leading to cellular transformation. May repress the promoter of p53, and sequester CREB3 and SP110 isoform 3/Sp110b in the cytoplasm. Also represses cell cycle negative regulating factor CDKN1A, thereby interrupting an important check point of normal cell cycle regulation. Targets transcription factors involved in the regulation of inflammatory responses and in the immune response: suppresses NK-kappaB activation, and activates AP-1. Could mediate apoptotic pathways through association with TNF-type receptors TNFRSF1A and LTBR, although its effect on death receptor-induced apoptosis remains controversial. Enhances TRAIL mediated apoptosis, suggesting that it might play a role in immune-mediated liver cell injury. Seric core protein is able to bind C1QR1 at the T-cell surface, resulting in down-regulation of T-lymphocytes proliferation. May transactivate human MYC, Rous sarcoma virus LTR, and SV40 promoters. May suppress the human FOS and HIV-1 LTR activity. Alters lipid metabolism by interacting with hepatocellular proteins involved in lipid accumulation and storage. Core protein induces up-regulation of FAS promoter activity, and thereby probably contributes to the increased triglyceride accumulation in hepatocytes (steatosis) (By similarity).
  • FUNCTION: E1 and E2 glycoproteins form a heterodimer that is involved in virus attachment to the host cell, virion internalization through clathrin-dependent endocytosis and fusion with host membrane. E1/E2 heterodimer binds to human LDLR, CD81 and SCARB1/SR-BI receptors, but this binding is not sufficient for infection, some additional liver specific cofactors may be needed. The fusion function may possibly be carried by E1. E2 inhibits human EIF2AK2/PKR activation, preventing the establishment of an antiviral state. E2 is a viral ligand for CD209/DC-SIGN and CLEC4M/DC-SIGNR, which are respectively found on dendritic cells (DCs), and on liver sinusoidal endothelial cells and macrophage-like cells of lymph node sinuses. These interactions allow capture of circulating HCV particles by these cells and subsequent transmission to permissive cells. DCs act as sentinels in various tissues where they entrap pathogens and convey them to local lymphoid tissue or lymph node for establishment of immunity. Capture of circulating HCV particles by these SIGN+ cells may facilitate virus infection of proximal hepatocytes and lymphocyte subpopulations and may be essential for the establishment of persistent infection (By similarity).
  • FUNCTION: P7 seems to be a heptameric ion channel protein (viroporin) and is inhibited by the antiviral drug amantadine. Also inhibited by long-alkyl-chain iminosugar derivatives. Essential for infectivity (By similarity).
  • FUNCTION: Protease NS2-3 is a cysteine protease responsible for the autocatalytic cleavage of NS2-NS3. Seems to undergo self-inactivation following maturation (By similarity).
  • FUNCTION: NS3 displays three enzymatic activities: serine protease, NTPase and RNA helicase. NS3 serine protease, in association with NS4A, is responsible for the cleavages of NS3-NS4A, NS4A-NS4B, NS4B-NS5A and NS5A-NS5B. NS3/NS4A complex also prevents phosphorylation of human IRF3, thus preventing the establishment of dsRNA induced antiviral state. NS3 RNA helicase binds to RNA and unwinds dsRNA in the 3' to 5' direction, and likely RNA stable secondary structure in the template strand. Cleaves and inhibits the host antiviral protein MAVS (By similarity).
  • FUNCTION: NS4B induces a specific membrane alteration that serves as a scaffold for the virus replication complex. This membrane alteration gives rise to the so-called ER-derived membranous web that contains the replication complex (By similarity).
  • FUNCTION: NS5A is a component of the replication complex involved in RNA-binding. Its interaction with Human VAPB may target the viral replication complex to vesicles. Down-regulates viral IRES translation initiation. Mediates interferon resistance, presumably by interacting with and inhibiting human EIF2AK2/PKR. Seems to inhibit apoptosis by interacting with BIN1 and FKBP8. The hyperphosphorylated form of NS5A is an inhibitor of viral replication (By similarity).
  • FUNCTION: NS5B is a RNA-dependent RNA polymerase that plays an essential role in the virus replication (By similarity).
  • CATALYTIC ACTIVITY: Hydrolysis of four peptide bonds in the viral precursor polyprotein, commonly with Asp or Glu in the P6 position, Cys or Thr in P1 and Ser or Ala in P1'.
  • CATALYTIC ACTIVITY: Nucleoside triphosphate + RNA(n) = diphosphate + RNA(n+1).
  • CATALYTIC ACTIVITY: NTP + H2O = NDP + phosphate.
  • COFACTOR: Binds 1 zinc ion per NS3 protease domain (By similarity).
  • COFACTOR: Binds 1 zinc ion per NS5A N-terminal domain (By similarity).
  • ENZYME REGULATION: Activity of auto-protease NS2-3 is dependent on zinc ions and completely inhibited by EDTA, 1,10-phenanthroline, iodocetamide and N-ethylmaleimide. According to PubMed:9261354, completely inhibited by the serine protease inhibitors TLCK and TPCK. According to PubMed:8189501, almost completely inhibited by TPCK and slightly inhibited by TLCK. Not inhibited by antipain, aprotinin, E64, PMSF and pepstatin. Also inhibited by NS2-3 and NS4A derived peptides. Serine protease NS3 is also activated by zinc ions.
  • SUBUNIT: Core protein is a homomultimer that binds the C-terminal part of E1 and interacts with numerous cellular proteins. Interaction with human STAT1 SH2 domain seems to result in decreased STAT1 phosphorylation, leading to decreased IFN-stimulated gene transcription. In addition to blocking the formation of phosphorylated STAT1, the core protein also promotes ubiquitin-mediated proteasome-dependent degradation of STAT1. Interacts with, and constitutively activates human STAT3. Associates with human LTBR and TNFRSF1A receptors and possibly induces apoptosis. Binds to human SP110 isoform 3/Sp110b, HNRPK, C1QR1, YWHAE, UBE3A/E6AP, DDX3X, APOA2 and RXRA proteins. Interacts with human CREB3 nuclear transcription protein, triggering cell transformation. May interact with human p53. Also binds human cytokeratins KRT8, KRT18, KRT19 and VIM (vimentin). E1 and E2 glycoproteins form a heterodimer that binds to human LDLR, CLDN1, CD81 and SCARB1 receptors. E2 binds and inhibits human EIF2AK2/PKR. Also binds human CD209/DC-SIGN and CLEC4M/DC-SIGNR. p7 forms a homoheptamer in vitro. NS2 forms a homodimer containing a pair of composite active sites at the dimerization interface. NS2 seems to interact with all other non-structural (NS) proteins. NS4A interacts with NS3 serine protease and stabilizes its folding. NS3-NS4A complex is essential for the activation of the latter and allows membrane anchorage of NS3. NS3 interacts with human TANK-binding kinase TBK1 and MAVS. NS4B and NS5A form homodimers and seem to interact with all other non-structural (NS) proteins. NS5A also interacts with human EIF2AK2/PKR, FKBP8, GRB2, BIN1, PIK3R1, SRCAP, VAPB and with most Src-family kinases. NS5B is a homooligomer and interacts with human VAPB (By similarity).
  • SUBCELLULAR LOCATION: Core protein p21: Host endoplasmic reticulum membrane; Single-pass membrane protein. Host mitochondrion membrane; Single-pass type I membrane protein. Host lipid droplet membrane; Single-pass membrane protein. Note=The C-terminal transmembrane domain of core protein p21 contains an ER signal leading the nascent polyprotein to the ER membrane. Only a minor proportion of core protein is present in the nucleus and an unknown proportion is secreted.
  • SUBCELLULAR LOCATION: Core protein p19: Virion (By similarity). Host cytoplasm (By similarity). Host nucleus (By similarity). Secreted (By similarity).
  • SUBCELLULAR LOCATION: Envelope glycoprotein E1: Virion membrane; Single-pass type I membrane protein (Potential). Host endoplasmic reticulum membrane; Single-pass type I membrane protein (By similarity). Note=The C-terminal transmembrane domain acts as a signal sequence and forms a hairpin structure before cleavage by host signal peptidase. After cleavage, the membrane sequence is retained at the C-terminus of the protein, serving as ER membrane anchor. A reorientation of the second hydrophobic stretch occurs after cleavage producing a single reoriented transmembrane domain. These events explain the final topology of the protein. ER retention of E1 is leaky and, in overexpression conditions, only a small fraction reaches the plasma membrane.
  • SUBCELLULAR LOCATION: Envelope glycoprotein E2: Virion membrane; Single-pass type I membrane protein (Potential). Host endoplasmic reticulum membrane; Single-pass type I membrane protein (By similarity). Note=The C-terminal transmembrane domain acts as a signal sequence and forms a hairpin structure before cleavage by host signal peptidase. After cleavage, the membrane sequence is retained at the C-terminus of the protein, serving as ER membrane anchor. A reorientation of the second hydrophobic stretch occurs after cleavage producing a single reoriented transmembrane domain. These events explain the final topology of the protein. ER retention of E2 is leaky and, in overexpression conditions, only a small fraction reaches the plasma membrane.
  • SUBCELLULAR LOCATION: p7: Host endoplasmic reticulum membrane; Multi-pass membrane protein (By similarity). Host cell membrane (By similarity). Note=The C-terminus of p7 membrane domain acts as a signal sequence. After cleavage by host signal peptidase, the membrane sequence is retained at the C-terminus of the protein, serving as ER membrane anchor. Only a fraction localizes to the plasma membrane.
  • SUBCELLULAR LOCATION: Protease NS2-3: Host endoplasmic reticulum membrane; Multi-pass membrane protein (Potential).
  • SUBCELLULAR LOCATION: Serine protease/NTPase/helicase NS3: Host endoplasmic reticulum membrane; Peripheral membrane protein (By similarity). Note=NS3 is associated to the ER membrane through its binding to NS4A.
  • SUBCELLULAR LOCATION: Non-structural protein 4A: Host endoplasmic reticulum membrane; Single-pass type I membrane protein (Potential). Note=Host membrane insertion occurs after processing by the NS3 protease.
  • SUBCELLULAR LOCATION: Non-structural protein 4B: Host endoplasmic reticulum membrane; Multi-pass membrane protein (By similarity).
  • SUBCELLULAR LOCATION: Non-structural protein 5A: Host endoplasmic reticulum membrane; Peripheral membrane protein (By similarity). Host cytoplasm, host perinuclear region (By similarity). Host mitochondrion (By similarity). Note=Host membrane insertion occurs after processing by the NS3 protease.
  • SUBCELLULAR LOCATION: RNA-directed RNA polymerase: Host endoplasmic reticulum membrane; Single-pass type I membrane protein (Potential). Note=Host membrane insertion occurs after processing by the NS3 protease.
  • DOMAIN: The transmembrane regions of envelope E1 and E2 glycoproteins are involved in heterodimer formation, ER localization, and assembly of these proteins. Envelope E2 glycoprotein contain two highly variable regions called hypervariable region 1 and 2 (HVR1 and HVR2). E2 also contain two segments involved in CD81-binding. HVR1 is implicated in the SCARB1-mediated cell entry. HVR2 and CD81-binding regions may be involved in sensitivity and/or resistance to IFN-alpha therapy (By similarity).
  • DOMAIN: The N-terminus of NS5A acts as membrane anchor. The central part of NS5A contains a variable region called interferon sensitivity determining region (ISDR) and seems to be intrinsically disordered and interacts with NS5B and host PKR (By similarity). The C-terminus of NS5A contains a variable region called variable region 3 (V3). ISDR and V3 may be involved in sensitivity and/or resistance to IFN-alpha therapy.
  • DOMAIN: The SH3-binding domain of NS5A is involved in the interaction with human Bin1, GRB2 and Src-family kinases.
  • DOMAIN: The N-terminal one-third of serine protease NS3 contains the protease activity. This region contains a zinc atom that does not belong to the active site, but may play a structural rather than a catalytic role. This region is essential for the activity of protease NS2-3, maybe by contributing to the folding of the latter. The helicase activity is located in the C-terminus of NS3.
  • PTM: Specific enzymatic cleavages in vivo yield mature proteins. The structural proteins, core, E1, E2 and p7 are produced by proteolytic processing by host signal peptidases. The core protein is synthesized as a 21 kDa precursor which is retained in the ER membrane through the hydrophobic signal peptide. Cleavage by the signal peptidase releases the 19 kDa mature core protein. The other proteins (p7, NS2-3, NS3, NS4A, NS4B, NS5A and NS5B) are cleaved by the viral proteases (By similarity).
  • PTM: Envelope E1 and E2 glycoproteins are highly N-glycosylated (By similarity).
  • PTM: Core protein is phosphorylated by host PKC and PKA (By similarity).
  • PTM: NS5A is phosphorylated in a basal form termed p56. p58 is an hyperphosphorylated form of p56. p56 and p58 coexist in the cell in roughly equivalent amounts. Hyperphosphorylation is dependent on the presence of NS4A. Human AKT1, RPS6KB1/p70S6K, MAP2K1/MEK1, MAP2K6/MKK6 and CSNK1A1/CKI-alpha kinases may be responsible for NS5A phosphorylation.
  • PTM: NS4B is palmitoylated. This modification may play a role in its polymerization or in protein-protein interactions (By similarity).
  • PTM: The N-terminus of a fraction of NS4B molecules seems to be relocated post-translationally from the cytoplasm to the ER lumen, with a 5th transmembrane segment. The C-terminus of NS2 may be lumenal with a fourth transmembrane segment (By similarity).
  • PTM: Core protein is ubiquitinated; mediated by UBE3A and leading to core protein subsequent proteasomal degradation (By similarity).
  • MISCELLANEOUS: Cell culture adaptation of the virus leads to mutations in NS5A, reducing its inhibitory effect on replication (By similarity).
  • MISCELLANEOUS: Core protein exerts viral interference on hepatitis B virus when HCV and HBV coinfect the same cell, by suppressing HBV gene expression, RNA encapsidation and budding (By similarity).
  • SIMILARITY: Belongs to the hepaciviruses polyprotein family.
  • SIMILARITY: Contains 1 helicase ATP-binding domain.
  • SIMILARITY: Contains 1 peptidase C18 domain [view classification].
  • SIMILARITY: Contains 1 peptidase S29 domain [view classification].
  • SIMILARITY: Contains 1 RdRp catalytic domain.
  • CAUTION: The core gene probably also codes for alternative reading frame proteins (ARFPs). Many functions depicted for the core protein might belong to the ARFPs.
  • WEB RESOURCE: Name=euHCVdb; Note=The European HCV database; URL="http://euhcvdb.ibcp.fr";.
  • WEB RESOURCE: Name=HCV database; URL="http://www.hcvdb.org/";.
Copyright
Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms. Distributed under the Creative Commons Attribution-NoDerivs License.
Cross-references
Sequence databases
EMBL
M58335; AAA72945.1; -; Genomic_RNA.[EMBL / GenBank / DDBJ] [CoDingSequence]
PIR A38465; GNWVTC.
3D structure databases
PDB
1A1Q; X-ray; 2.40 A; A/B/C=1027-1215.[ExPASy / RCSB / EBI]
1BT7; NMR; -; A=1027-1206.[ExPASy / RCSB / EBI]
1C2P; X-ray; 1.90 A; A/B=2422-2989.[ExPASy / RCSB / EBI]
1CSJ; X-ray; 2.80 A; A/B=2420-2950.[ExPASy / RCSB / EBI]
1CU1; X-ray; 2.50 A; A/B=1029-1657.[ExPASy / RCSB / EBI]
1GX5; X-ray; 1.70 A; A=2420-2955.[ExPASy / RCSB / EBI]
1GX6; X-ray; 1.85 A; A=2420-2950.[ExPASy / RCSB / EBI]
1JXP; X-ray; 2.20 A; A/B=1027-1206, C/D=1678-1691.[ExPASy / RCSB / EBI]
1NHU; X-ray; 2.00 A; A/B=2420-2989.[ExPASy / RCSB / EBI]
1NHV; X-ray; 2.90 A; A/B=2420-2989.[ExPASy / RCSB / EBI]
1NS3; X-ray; 2.80 A; A/B=1029-1206, C/D=1678-1689.[ExPASy / RCSB / EBI]
1OS5; X-ray; 2.20 A; A=2420-2989.[ExPASy / RCSB / EBI]
1QUV; X-ray; 2.50 A; A=2420-2989.[ExPASy / RCSB / EBI]
2AWZ; X-ray; 2.15 A; A/B=2420-2981.[ExPASy / RCSB / EBI]
2AX0; X-ray; 2.00 A; A/B=2420-2981.[ExPASy / RCSB / EBI]
2AX1; X-ray; 2.10 A; A/B=2420-2981.[ExPASy / RCSB / EBI]
2BRK; X-ray; 2.30 A; A=2420-2955.[ExPASy / RCSB / EBI]
2BRL; X-ray; 2.40 A; A=2420-2955.[ExPASy / RCSB / EBI]
2DXS; X-ray; 2.20 A; A/B=2420-2963.[ExPASy / RCSB / EBI]
2GIQ; X-ray; 1.65 A; A/B=2421-2981.[ExPASy / RCSB / EBI]
2GIR; X-ray; 1.90 A; A/B=2421-2981.[ExPASy / RCSB / EBI]
2HAI; X-ray; 1.58 A; A=2420-2988.[ExPASy / RCSB / EBI]
2HWH; X-ray; 2.30 A; A/B=2422-2989.[ExPASy / RCSB / EBI]
2HWI; X-ray; 2.00 A; A/B=2422-2989.[ExPASy / RCSB / EBI]
2I1R; X-ray; 2.20 A; A/B=2422-2989.[ExPASy / RCSB / EBI]
2JC0; X-ray; 2.20 A; A/B=2420-2989.[ExPASy / RCSB / EBI]
2JC1; X-ray; 2.00 A; A/B=2420-2989.[ExPASy / RCSB / EBI]
2O5D; X-ray; 2.20 A; A/B=2422-2989.[ExPASy / RCSB / EBI]
2ZKU; X-ray; 1.95 A; A/B/C/D=2420-2989.[ExPASy / RCSB / EBI]
3BR9; X-ray; 2.30 A; A/B=2420-2989.[ExPASy / RCSB / EBI]
3BSA; X-ray; 2.30 A; A/B=2420-2989.[ExPASy / RCSB / EBI]
3BSC; X-ray; 2.65 A; A/B=2420-2989.[ExPASy / RCSB / EBI]
3CDE; X-ray; 2.10 A; A/B=2420-2989.[ExPASy / RCSB / EBI]
3CIZ; X-ray; 1.87 A; A/B=2421-2989.[ExPASy / RCSB / EBI]
3CJ0; X-ray; 1.90 A; A/B=2421-2989.[ExPASy / RCSB / EBI]
3CJ2; X-ray; 1.75 A; A/B=2421-2989.[ExPASy / RCSB / EBI]
3CJ3; X-ray; 1.87 A; A/B=2421-2989.[ExPASy / RCSB / EBI]
3CJ4; X-ray; 2.07 A; A/B=2421-2989.[ExPASy / RCSB / EBI]
3CJ5; X-ray; 1.92 A; A/B=2421-2989.[ExPASy / RCSB / EBI]
3CO9; X-ray; 2.10 A; A/B=2420-2989.[ExPASy / RCSB / EBI]
3CVK; X-ray; 2.31 A; A/B=2420-2989.[ExPASy / RCSB / EBI]
3FQK; X-ray; 2.20 A; A/B=2421-2989.[ExPASy / RCSB / EBI]
3FRZ; X-ray; 1.86 A; A=2420-2989.[ExPASy / RCSB / EBI]
8OHM; X-ray; 2.30 A; A=1216-1650.[ExPASy / RCSB / EBI]
Detailed list of linked structures.
PDBsum 1A1Q; -.
1BT7; -.
1C2P; -.
1CSJ; -.
1CU1; -.
1GX5; -.
1GX6; -.
1JXP; -.
1NHU; -.
1NHV; -.
1NS3; -.
1OS5; -.
1QUV; -.
2AWZ; -.
2AX0; -.
2AX1; -.
2BRK; -.
2BRL; -.
2DXS; -.
2GIQ; -.
2GIR; -.
2HAI; -.
2HWH; -.
2HWI; -.
2I1R; -.
2JC0; -.
2JC1; -.
2O5D; -.
2ZKU; -.
3BR9; -.
3BSA; -.
3BSC; -.
3CDE; -.
3CIZ; -.
3CJ0; -.
3CJ2; -.
3CJ3; -.
3CJ4; -.
3CJ5; -.
3CO9; -.
3CVK; -.
3FQK; -.
3FRZ; -.
8OHM; -.
ModBase P26663.
Organism-specific databases
euHCVdb M58335; -.
Ontologies
GO
GO:0005789; Cellular component: endoplasmic reticulum membrane (inferred from electronic annotation from InterPro).
GO:0033650; Cellular component: host cell mitochondrion (inferred from electronic annotation from UniProtKB-SubCell).
GO:0042025; Cellular component: host cell nucleus (inferred from electronic annotation from UniProtKB-SubCell).
GO:0020002; Cellular component: host cell plasma membrane (inferred from electronic annotation from UniProtKB-SubCell).
GO:0016021; Cellular component: integral to membrane (inferred from electronic annotation from UniProtKB-KW).
GO:0005811; Cellular component: lipid particle (inferred from electronic annotation from UniProtKB-KW).
GO:0005739; Cellular component: mitochondrion (inferred from electronic annotation from UniProtKB-KW).
GO:0005634; Cellular component: nucleus (inferred from electronic annotation from UniProtKB-KW).
GO:0005886; Cellular component: plasma membrane (inferred from electronic annotation from UniProtKB-KW).
GO:0030529; Cellular component: ribonucleoprotein complex (inferred from electronic annotation from UniProtKB-KW).
GO:0019028; Cellular component: viral capsid (inferred from electronic annotation from UniProtKB-KW).
GO:0019031; Cellular component: viral envelope (inferred from electronic annotation from UniProtKB-KW).
GO:0005524; Molecular function: ATP binding (inferred from electronic annotation from UniProtKB-KW).
GO:0004197; Molecular function: cysteine-type endopeptidase activity (inferred from electronic annotation from InterPro).
GO:0004386; Molecular function: helicase activity (inferred from electronic annotation from UniProtKB-KW).
GO:0003723; Molecular function: RNA binding (inferred from electronic annotation from UniProtKB-KW).
GO:0003968; Molecular function: RNA-directed RNA polymerase activity (inferred from electronic annotation from UniProtKB-KW).
GO:0004252; Molecular function: serine-type endopeptidase activity (inferred from electronic annotation from InterPro).
GO:0017124; Molecular function: SH3 domain binding (inferred from electronic annotation from UniProtKB-KW).
GO:0005198; Molecular function: structural molecule activity (inferred from electronic annotation from UniProtKB-KW).
GO:0008270; Molecular function: zinc ion binding (inferred from electronic annotation from UniProtKB-KW).
GO:0006915; Biological process: apoptosis (inferred from electronic annotation from UniProtKB-KW).
GO:0030683; Biological process: evasion by virus of host immune response (inferred from electronic annotation from UniProtKB-KW).
GO:0006508; Biological process: proteolysis (inferred from electronic annotation from InterPro).
GO:0006355; Biological process: regulation of transcription, DNA-dependent (inferred from electronic annotation from UniProtKB-KW).
GO:0006410; Biological process: transcription, RNA-dependent (inferred from electronic annotation from UniProtKB-KW).
GO:0019087; Biological process: transformation of host cell by virus (inferred from electronic annotation from InterPro).
GO:0019079; Biological process: viral genome replication (inferred from electronic annotation from InterPro).
QuickGo view.
Family and domain databases
InterPro IPR014001; DEAD-like_N.
IPR001650; DNA/RNA_helicase_C.
IPR002522; HCV_capsid.
IPR002521; HCV_core.
IPR002519; HCV_env.
IPR002531; HCV_NS1.
IPR000745; HCV_NS4a.
IPR001490; HCV_NS4b.
IPR002868; HCV_NS5a.
IPR013193; HCV_NS5a_1b.
IPR014021; Helicase_SF1/SF2_ATP-bd.
IPR002518; Pept_C18_HCV_NS2.
IPR004109; Peptidase_S29.
IPR002166; RNA_pol_HCV.
IPR007094; RNA_pol_PSvir.
IPR013192; Znf_HCV_NS5a.
Graphical view of domain structure.
Pfam PF01543; HCV_capsid; 1.
PF01542; HCV_core; 1.
PF01539; HCV_env; 1.
PF01560; HCV_NS1; 1.
PF01538; HCV_NS2; 1.
PF01006; HCV_NS4a; 1.
PF01001; HCV_NS4b; 1.
PF01506; HCV_NS5a; 1.
PF08300; HCV_NS5a_1a; 1.
PF08301; HCV_NS5a_1b; 1.
PF02907; Peptidase_S29; 1.
PF00998; RdRP_3; 1.
Pfam graphical view of domain structure.
SMART SM00487; DEXDc; 1.
SMART graphical view of domain structure.
PROSITE PS51192; HELICASE_ATP_BIND_1; 1.
PS51194; HELICASE_CTER; FALSE_NEG.
PS50507; RDRP_SSRNA_POS; 1.
PROSITE graphical view of domain structure (profiles).
Other
ProtoNet P26663.
UniRef View cluster of proteins with at least 50% / 90% / 100% identity.
Keywords
3D-structure; Acetylation; Apoptosis; ATP-binding; Capsid protein; Cell membrane; Complete proteome; Cytoplasm; Direct protein sequencing; Disulfide bond; Endoplasmic reticulum; Envelope protein; Fusion protein; Glycoprotein; Helicase; Host-virus interaction; Hydrolase; Interferon antiviral system evasion; Lipid droplet; Lipoprotein; Membrane; Metal-binding; Mitochondrion; Multifunctional enzyme; Nucleotide-binding; Nucleotidyltransferase; Nucleus; Oncogene; Palmitate; Phosphoprotein; Protease; Ribonucleoprotein; RNA replication; RNA-binding; RNA-directed RNA polymerase; Secreted; Serine protease; SH3-binding; Thiol protease; Transcription; Transcription regulation; Transferase; Transmembrane; Ubl conjugation; Viral nucleoprotein; Virion; Zinc.
Features
SEVIEWER logo Feature table viewer FT aligner logo Feature aligner
KeyFrom    To Length Description FTId
INIT_MET   1      1        Removed; by host (By similarity). 
CHAIN   2    191  190     Core protein p21 (Potential). PRO_0000037529
CHAIN   2    177  176     Core protein p19 (By similarity). PRO_0000037530
PROPEP   178    191  14     ER anchor for the core protein, removed in mature form by host signal peptidase (By similarity). PRO_0000037531
CHAIN   192    383  192     Envelope glycoprotein E1 (Potential). PRO_0000037532
CHAIN   384    746  363     Envelope glycoprotein E2 (Potential). PRO_0000037533
CHAIN   747    809  63     p7 (By similarity). PRO_0000037534
CHAIN   810   1026  217     Protease NS2-3 (Potential). PRO_0000037535
CHAIN   1027   1657  631     Serine protease/NTPase/helicase NS3. PRO_0000037536
CHAIN   1658   1711  54     Non-structural protein 4A. PRO_0000037537
CHAIN   1712   1972  261     Non-structural protein 4B. PRO_0000037538
CHAIN   1973   2419  447     Non-structural protein 5A. PRO_0000037539
CHAIN   2420   3010  591     RNA-directed RNA polymerase. PRO_0000037540
TOPO_DOM   2    168  167     Cytoplasmic (Potential). 
TRANSMEM   169    189  21     Potential. 
TOPO_DOM   190    358  169     Lumenal (Potential). 
TRANSMEM   359    379  21     Potential. 
TOPO_DOM   380    725  346     Lumenal (Potential). 
TRANSMEM   726    746  21     Potential. 
TOPO_DOM   747    757  11     Lumenal (Potential). 
TRANSMEM   758    778  21     Potential. 
TOPO_DOM   779    782  4     Cytoplasmic (Potential). 
TRANSMEM   783    803  21     Potential. 
TOPO_DOM   804    813  10     Lumenal (Potential). 
TRANSMEM   814    834  21     Potential. 
TOPO_DOM   835    881  47     Cytoplasmic (Potential). 
TRANSMEM   882    902  21     Potential. 
TOPO_DOM   903    928  26     Lumenal (Potential). 
TRANSMEM   929    949  21     Potential. 
TOPO_DOM   950   1657  708     Cytoplasmic (Potential). 
TRANSMEM   1658   1678  21     Potential. 
TOPO_DOM   1679   1805  127     Cytoplasmic (Potential). 
TRANSMEM   1806   1826  21     Potential. 
TOPO_DOM   1827   1828  2     Lumenal (Potential). 
TRANSMEM   1829   1849  21     Potential. 
TOPO_DOM   1850   1850  1     Cytoplasmic (Potential). 
TRANSMEM   1851   1871  21     Potential. 
TOPO_DOM   1872   1881  10     Lumenal (Potential). 
TRANSMEM   1882   1902  21     Potential. 
TOPO_DOM   1903   1972  70     Cytoplasmic (Potential). 
TOPO_DOM   1973   2002  30     In membrane (By similarity). 
TOPO_DOM   2003   2989  987     Cytoplasmic (Potential). 
TRANSMEM   2990   3010  21     By similarity. 
DOMAIN   1217   1369  153     Helicase ATP-binding. 
DOMAIN   2633   2751  119     RdRp catalytic. 
NP_BIND   1230   1237  8     ATP (Potential). 
REGION   2     59  58     Interaction with DDX3X (By similarity). 
REGION   2     23  22     Interaction with STAT1 (By similarity). 
REGION   122    173  52     Interaction with APOA2 (By similarity). 
REGION   148    236  89     Interaction with FKBP8. 
REGION   150    159  10     Mitochondrial targeting signal (By similarity). 
REGION   164    167  4     Important for lipid droplets localization (By similarity). 
REGION   265    296  32     Fusion peptide (Potential). 
REGION   385    411  27     HVR1. 
REGION   475    481  7     HVR2. 
REGION   482    494  13     CD81-binding 1 (Potential). 
REGION   522    553  32     CD81-binding 2 (Potential). 
REGION   660    671  12     PKR/eIF2-alpha phosphorylation homology domain (PePHD). 
REGION   1679   1690  12     NS3-binding (by NS4A) (Potential). 
REGION   2120   2332  213     Transcriptional activation (Potential). 
REGION   2120   2208  89     FKBP8-binding (Potential). 
REGION   2200   2250  51     Basal phosphorylation (By similarity). 
REGION   2210   2275  66     PKR-binding (Potential). 
REGION   2210   2249  40     ISDR. 
REGION   2249   2306  58     NS4B-binding (Potential). 
REGION   2351   2419  69     Basal phosphorylation (By similarity). 
REGION   2354   2377  24     V3. 
MOTIF   5     13  9     Nuclear localization signal (Potential). 
MOTIF   38     43  6     Nuclear localization signal (Potential). 
MOTIF   58     64  7     Nuclear localization signal (Potential). 
MOTIF   66     71  6     Nuclear localization signal (Potential). 
MOTIF   1316   1319  4     DECH box. 
MOTIF   2322   2325  4     SH3-binding (Potential). 
MOTIF   2327   2335  9     Nuclear localization signal (Potential). 
COMPBIAS   796    803  8     Poly-Leu. 
COMPBIAS   1432   1435  4     Poly-Val. 
COMPBIAS   2282   2327  46     Pro-rich. 
COMPBIAS   2995   2998  4     Poly-Leu. 
ACT_SITE   952    952        For protease NS2-3 activity; shared with dimeric partner (By similarity). 
ACT_SITE   972    972        For protease NS2-3 activity; shared with dimeric partner (By similarity). 
ACT_SITE   993    993        For protease NS2-3 activity; shared with dimeric partner (By similarity). 
ACT_SITE   1083   1083        Charge relay system; for serine protease NS3 activity (By similarity). 
ACT_SITE   1107   1107        Charge relay system; for serine protease NS3 activity (By similarity). 
ACT_SITE   1165   1165        Charge relay system; for serine protease NS3 activity (By similarity). 
METAL   1123   1123        Zinc. 
METAL   1125   1125        Zinc. 
METAL   1171   1171        Zinc. 
METAL   1175   1175        Zinc. 
METAL   2011   2011        Zinc (By similarity). 
METAL   2029   2029        Zinc (By similarity). 
METAL   2031   2031        Zinc (By similarity). 
METAL   2052   2052        Zinc (By similarity). 
SITE   177    178  2     Cleavage; by host signal peptidase (By similarity). 
SITE   191    192  2     Cleavage; by host signal peptidase (Potential). 
SITE   383    384  2     Cleavage; by host signal peptidase (Potential). 
SITE   746    747  2     Cleavage; by host signal peptidase (By similarity). 
SITE   809    810  2     Cleavage; by host signal peptidase (By similarity). 
SITE   1026   1027  2     Cleavage; by protease NS2-3 (Potential). 
SITE   1657   1658  2     Cleavage; by serine protease NS3 (Potential). 
SITE   1711   1712  2     Cleavage; by serine protease NS3 (Potential). 
SITE   1972   1973  2     Cleavage; by serine protease NS3 (Potential). 
SITE   2419   2420  2     Cleavage; by serine protease NS3 (Potential). 
MOD_RES   2      2        N-acetylserine; by host (By similarity). 
MOD_RES   53     53        Phosphoserine; by host (By similarity). 
MOD_RES   99     99        Phosphoserine; by host (By similarity). 
MOD_RES   116    116        Phosphoserine; by host PKA (By similarity). 
MOD_RES   2194   2194        Phosphoserine; by host; in p56 (Probable). 
MOD_RES   2197   2197        Phosphoserine; by host; in p58 (By similarity). 
MOD_RES   2201   2201        Phosphoserine; by host; in p58 (By similarity). 
MOD_RES   2204   2204        Phosphoserine; by host; in p58 (By similarity). 
LIPID   1968   1968        S-palmitoyl cysteine; by host (By similarity). 
LIPID   1972   1972        S-palmitoyl cysteine; by host (By similarity). 
CARBOHYD   196    196        N-linked (GlcNAc...); by host (Potential). 
CARBOHYD   209    209        N-linked (GlcNAc...); by host (Potential). 
CARBOHYD   234    234        N-linked (GlcNAc...); by host (Potential). 
CARBOHYD   250    250        N-linked (GlcNAc...); by host (Potential). 
CARBOHYD   305    305        N-linked (GlcNAc...); by host (Potential). 
CARBOHYD   417    417        N-linked (GlcNAc...); by host (Potential). 
CARBOHYD   423    423        N-linked (GlcNAc...); by host (Potential). 
CARBOHYD   430    430        N-linked (GlcNAc...); by host (Potential). 
CARBOHYD   448    448        N-linked (GlcNAc...); by host (Potential). 
CARBOHYD   532    532        N-linked (GlcNAc...); by host (Potential). 
CARBOHYD   540    540        N-linked (GlcNAc...); by host (Potential). 
CARBOHYD   556    556        N-linked (GlcNAc...); by host (Potential). 
CARBOHYD   576    576        N-linked (GlcNAc...); by host (Potential). 
CARBOHYD   623    623        N-linked (GlcNAc...); by host (Potential). 
CARBOHYD   645    645        N-linked (GlcNAc...); by host (Potential). 
DISULFID   2114   2162        By similarity. 
MUTAGEN   2194   2194        S->A: Loss of phosphorylation. 
MUTAGEN   2322   2322        P->A: Complete loss of binding to GRB2. 
MUTAGEN   2323   2323        P->A: Complete loss of binding to GRB2. 
MUTAGEN   2326   2326        P->A: Complete loss of binding to GRB2. 
STRAND   1031   1035  5      
HELIX   1039   1047  9      
STRAND   1057   1063  7      
STRAND   1068   1074  7      
STRAND   1077   1081  5      
HELIX   1082   1085  4      
STRAND   1090   1092  3      
STRAND   1095   1097  3      
STRAND   1100   1103  4      
TURN   1104   1107  4      
STRAND   1108   1112  5      
STRAND   1129   1133  5      
STRAND   1139   1144  6      
STRAND   1146   1157  12      
HELIX   1158   1161  4      
STRAND   1168   1171  4      
TURN   1172   1174  3      
STRAND   1175   1186  12      
STRAND   1189   1197  9      
HELIX   1198   1202  5      
STRAND   1224   1229  6      
STRAND   1232   1235  4      
TURN   1236   1238  3      
HELIX   1239   1245  7      
TURN   1246   1248  3      
STRAND   1251   1256  6      
HELIX   1258   1271  14      
STRAND   1277   1279  3      
STRAND   1290   1295  6      
HELIX   1296   1302  7      
STRAND   1311   1316  6      
TURN   1317   1319  3      
HELIX   1323   1335  13      
TURN   1336   1340  5      
STRAND   1342   1350  9      
STRAND   1362   1366  5      
STRAND   1371   1375  5      
STRAND   1378   1380  3      
HELIX   1382   1384  3      
STRAND   1386   1393  8      
STRAND   1395   1397  3      
HELIX   1398   1409  12      
STRAND   1414   1417  4      
STRAND   1431   1436  6      
HELIX   1440   1443  4      
STRAND   1448   1453  6      
STRAND   1456   1463  8      
STRAND   1467   1469  3      
STRAND   1471   1478  8      
HELIX   1481   1488  8      
STRAND   1493   1495  3      
STRAND   1497   1502  6      
STRAND   1509   1511  3      
HELIX   1514   1526  13      
HELIX   1532   1544  13      
HELIX   1555   1563  9      
HELIX   1570   1579  10      
HELIX   1584   1596  13      
HELIX   1606   1617  12      
STRAND   1627   1629  3      
HELIX   1640   1648  9      
STRAND   1653   1656  4      
STRAND   1680   1688  9      
STRAND   2422   2425  4      
HELIX   2446   2449  4      
HELIX   2453   2455  3      
STRAND   2456   2458  3      
HELIX   2461   2463  3      
HELIX   2464   2471  8      
HELIX   2481   2494  14      
HELIX   2504   2509  6      
HELIX   2524   2528  5      
HELIX   2532   2547  16      
STRAND   2549   2551  3      
STRAND   2555   2559  5      
STRAND   2563   2565  3      
STRAND   2578   2581  4      
HELIX   2584   2606  23      
HELIX   2607   2609  3      
HELIX   2611   2613  3      
HELIX   2616   2629  14      
STRAND   2630   2638  9      
HELIX   2643   2646  4      
HELIX   2649   2661  13      
HELIX   2666   2678  13      
STRAND   2683   2686  4      
STRAND   2692   2696  5      
HELIX   2706   2725  20      
STRAND   2728   2735  8      
STRAND   2738   2744  7      
HELIX   2748   2764  17      
STRAND   2769   2771  3      
HELIX   2779   2781  3      
STRAND   2787   2793  7      
STRAND   2799   2804  6      
HELIX   2808   2819  12      
HELIX   2826   2834  9      
HELIX   2838   2842  5      
HELIX   2844   2854  11      
STRAND   2862   2866  5      
STRAND   2869   2873  5      
HELIX   2875   2877  3      
HELIX   2878   2886  9      
HELIX   2888   2891  4      
HELIX   2898   2911  14      
HELIX   2916   2933  18      
HELIX   2935   2944  10      
HELIX   2946   2948  3      
STRAND   2949   2951  3      
HELIX   2959   2963  5      
TURN   2967   2970  4      
Sequence information
Length: 3010 AA [This is the length of the unprocessed precursor] Molecular weight: 327194 Da [This is the MW of the unprocessed precursor] CRC64: F8422D5ECCFDFD9C [This is a checksum on the sequence]
        10         20         30         40         50         60 
MSTNPKPQRK TKRNTNRRPQ DVKFPGGGQI VGGVYLLPRR GPRLGVRAPR KTSERSQPRG 

        70         80         90        100        110        120 
RRQPIPKARR PEGRTWAQPG YPWPLYGNEG LGWAGWLLSP RGSRPSWGPT DPRRRSRNLG 

       130        140        150        160        170        180 
KVIDTLTCGF ADLMGYIPLV GAPLGGAARA LAHGVRVLED GVNYATGNLP GCSFSIFLLA 

       190        200        210        220        230        240 
LLSCLTTPAS AYEVHNVSGI YHVTNDCSNA SIVYEAADLI MHTPGCVPCV REGNSSRCWV 

       250        260        270        280        290        300 
ALTPTLAARN VTIPTTTIRR HVDLLVGAAA FCSAMYVGDL CGSVFLVSQL FTFSPRRHVT 

       310        320        330        340        350        360 
LQDCNCSIYP GHVSGHRMAW DMMMNWSPTT ALVVSQLLRI PQAVVDMVAG AHWGVLAGLA 

       370        380        390        400        410        420 
YYSMAGNWAK VLIVMLLFAG VDGDTHVTGG AQAKTTNRLV SMFASGPSQK IQLINTNGSW 

       430        440        450        460        470        480 
HINRTALNCN DSLQTGFLAA LFYTHSFNSS GCPERMAQCR TIDKFDQGWG PITYAESSRS 

       490        500        510        520        530        540 
DQRPYCWHYP PPQCTIVPAS EVCGPVYCFT PSPVVVGTTD RFGVPTYRWG ENETDVLLLN 

       550        560        570        580        590        600 
NTRPPQGNWF GCTWMNSTGF TKTCGGPPCN IGGVGNNTLT CPTDCFRKHP EATYTKCGSG 

       610        620        630        640        650        660 
PWLTPRCMVD YPYRLWHYPC TVNFTIFKVR MYVGGVEHRL NAACNWTRGE RCDLEDRDRP 

       670        680        690        700        710        720 
ELSPLLLSTT EWQVLPCSFT TLPALSTGLI HLHQNIVDVQ YLYGIGSAVV SFAIKWEYVL 

       730        740        750        760        770        780 
LLFLLLADAR VCACLWMMLL IAQAEAALEN LVVLNSASVA GAHGILSFLV FFCAAWYIKG 

       790        800        810        820        830        840 
RLVPGATYAL YGVWPLLLLL LALPPRAYAM DREMAASCGG AVFVGLVLLT LSPYYKVFLA 

       850        860        870        880        890        900 
RLIWWLQYFT TRAEADLHVW IPPLNARGGR DAIILLMCAV HPELIFDITK LLIAILGPLM 

       910        920        930        940        950        960 
VLQAGITRVP YFVRAQGLIH ACMLVRKVAG GHYVQMAFMK LGALTGTYIY NHLTPLRDWP 

       970        980        990       1000       1010       1020 
RAGLRDLAVA VEPVVFSDME TKIITWGADT AACGDIILGL PVSARRGKEI LLGPADSLEG 

      1030       1040       1050       1060       1070       1080 
RGLRLLAPIT AYSQQTRGLL GCIITSLTGR DKNQVEGEVQ VVSTATQSFL ATCVNGVCWT 

      1090       1100       1110       1120       1130       1140 
VYHGAGSKTL AAPKGPITQM YTNVDQDLVG WPKPPGARSL TPCTCGSSDL YLVTRHADVI 

      1150       1160       1170       1180       1190       1200 
PVRRRGDSRG SLLSPRPVSY LKGSSGGPLL CPFGHAVGIF RAAVCTRGVA KAVDFVPVES 

      1210       1220       1230       1240       1250       1260 
METTMRSPVF TDNSSPPAVP QSFQVAHLHA PTGSGKSTKV PAAYAAQGYK VLVLNPSVAA 

      1270       1280       1290       1300       1310       1320 
TLGFGAYMSK AHGIDPNIRT GVRTITTGAP VTYSTYGKFL ADGGCSGGAY DIIICDECHS 

      1330       1340       1350       1360       1370       1380 
TDSTTILGIG TVLDQAETAG ARLVVLATAT PPGSVTVPHP NIEEVALSNT GEIPFYGKAI 

      1390       1400       1410       1420       1430       1440 
PIEAIRGGRH LIFCHSKKKC DELAAKLSGL GINAVAYYRG LDVSVIPTIG DVVVVATDAL 

      1450       1460       1470       1480       1490       1500 
MTGYTGDFDS VIDCNTCVTQ TVDFSLDPTF TIETTTVPQD AVSRSQRRGR TGRGRRGIYR 

      1510       1520       1530       1540       1550       1560 
FVTPGERPSG MFDSSVLCEC YDAGCAWYEL TPAETSVRLR AYLNTPGLPV CQDHLEFWES 

      1570       1580       1590       1600       1610       1620 
VFTGLTHIDA HFLSQTKQAG DNFPYLVAYQ ATVCARAQAP PPSWDQMWKC LIRLKPTLHG 

      1630       1640       1650       1660       1670       1680 
PTPLLYRLGA VQNEVTLTHP ITKYIMACMS ADLEVVTSTW VLVGGVLAAL AAYCLTTGSV 

      1690       1700       1710       1720       1730       1740 
VIVGRIILSG RPAIVPDREL LYQEFDEMEE CASHLPYIEQ GMQLAEQFKQ KALGLLQTAT 

      1750       1760       1770       1780       1790       1800 
KQAEAAAPVV ESKWRALETF WAKHMWNFIS GIQYLAGLST LPGNPAIASL MAFTASITSP 

      1810       1820       1830       1840       1850       1860 
LTTQSTLLFN ILGGWVAAQL APPSAASAFV GAGIAGAAVG SIGLGKVLVD ILAGYGAGVA 

      1870       1880       1890       1900       1910       1920 
GALVAFKVMS GEMPSTEDLV NLLPAILSPG ALVVGVVCAA ILRRHVGPGE GAVQWMNRLI 

      1930       1940       1950       1960       1970       1980 
AFASRGNHVS PTHYVPESDA AARVTQILSS LTITQLLKRL HQWINEDCST PCSGSWLRDV 

      1990       2000       2010       2020       2030       2040 
WDWICTVLTD FKTWLQSKLL PQLPGVPFFS CQRGYKGVWR GDGIMQTTCP CGAQITGHVK 

      2050       2060       2070       2080       2090       2100 
NGSMRIVGPK TCSNTWHGTF PINAYTTGPC TPSPAPNYSR ALWRVAAEEY VEVTRVGDFH 

      2110       2120       2130       2140       2150       2160 
YVTGMTTDNV KCPCQVPAPE FFSEVDGVRL HRYAPACRPL LREEVTFQVG LNQYLVGSQL 

      2170       2180       2190       2200       2210       2220 
PCEPEPDVAV LTSMLTDPSH ITAETAKRRL ARGSPPSLAS SSASQLSAPS LKATCTTHHV 

      2230       2240       2250       2260       2270       2280 
SPDADLIEAN LLWRQEMGGN ITRVESENKV VVLDSFDPLR AEEDEREVSV PAEILRKSKK 

      2290       2300       2310       2320       2330       2340 
FPAAMPIWAR PDYNPPLLES WKDPDYVPPV VHGCPLPPIK APPIPPPRRK RTVVLTESSV 

      2350       2360       2370       2380       2390       2400 
SSALAELATK TFGSSESSAV DSGTATALPD QASDDGDKGS DVESYSSMPP LEGEPGDPDL 

      2410       2420       2430       2440       2450       2460 
SDGSWSTVSE EASEDVVCCS MSYTWTGALI TPCAAEESKL PINALSNSLL RHHNMVYATT 

      2470       2480       2490       2500       2510       2520 
SRSAGLRQKK VTFDRLQVLD DHYRDVLKEM KAKASTVKAK LLSVEEACKL TPPHSAKSKF 

      2530       2540       2550       2560       2570       2580 
GYGAKDVRNL SSKAVNHIHS VWKDLLEDTV TPIDTTIMAK NEVFCVQPEK GGRKPARLIV 

      2590       2600       2610       2620       2630       2640 
FPDLGVRVCE KMALYDVVST LPQVVMGSSY GFQYSPGQRV EFLVNTWKSK KNPMGFSYDT 

      2650       2660       2670       2680       2690       2700 
RCFDSTVTEN DIRVEESIYQ CCDLAPEARQ AIKSLTERLY IGGPLTNSKG QNCGYRRCRA 

      2710       2720       2730       2740       2750       2760 
SGVLTTSCGN TLTCYLKASA ACRAAKLQDC TMLVNGDDLV VICESAGTQE DAASLRVFTE 

      2770       2780       2790       2800       2810       2820 
AMTRYSAPPG DPPQPEYDLE LITSCSSNVS VAHDASGKRV YYLTRDPTTP LARAAWETAR 

      2830       2840       2850       2860       2870       2880 
HTPVNSWLGN IIMYAPTLWA RMILMTHFFS ILLAQEQLEK ALDCQIYGAC YSIEPLDLPQ 

      2890       2900       2910       2920       2930       2940 
IIERLHGLSA FSLHSYSPGE INRVASCLRK LGVPPLRVWR HRARSVRARL LSQGGRAATC 

      2950       2960       2970       2980       2990       3000 
GKYLFNWAVK TKLKLTPIPA ASRLDLSGWF VAGYSGGDIY HSLSRARPRW FMLCLLLLSV 

      3010 
GVGIYLLPNR 

P26663 in FASTA format

View entry in raw text format (no links)
Report form for errors/updates in this UniProtKB/Swiss-Prot entry

BLAST logo BLAST submission on ExPASy/SIB
or at NCBI (USA)
Tools Sequence analysis tools: ProtParam, ProtScale, Compute pI/Mw, PeptideMass, PeptideCutter, Dotlet (Java)
PROSITE logo ScanProsite, MotifScan SWISS-MODEL Submit a homology modeling request to SWISS-MODEL
NPSA logo NPSA Sequence analysis tools

ExPASy logo ExPASy Home page Site Map Search ExPASy Contact us Swiss-Prot
 Hosted by ch flag SIB Switzerland Mirror sites: Australia  Brazil  Canada  China  Korea
Notice: This page will be replaced with www.uniprot.org. Please send us your feedback!