ExPASy logo ExPASy Home page Site Map Search ExPASy Contact us Swiss-Prot
Notice: This page will be replaced with www.uniprot.org. Please send us your feedback!
Search for

UniProtKB/Swiss-Prot entry P17140


[Entry info] [Name and origin] [References] [Comments] [Cross-references] [Keywords] [Features] [Sequence] [Tools]

Note: most headings are clickable, even if they don't appear as links. They link to the user manual or other documents.
Entry information
Entry name CO4A2_CAEEL
Primary accession number P17140
Secondary accession numbers Q19098 Q19099
Integrated into Swiss-Prot on August 1, 1990
Sequence was last modified on October 1, 1994 (Sequence version 2)
Annotations were last modified on    June 16, 2009 (Entry version 95)
Name and origin of the protein
Protein name Collagen alpha-2(IV) chain [Precursor]
Synonym Lethal protein 2
Gene name
Name: let-2
Synonyms: clb-1
ORFNames: F01G12.5
From
Caenorhabditis elegans [TaxID: 6239] 
Taxonomy Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis.
Protein existence 1: Evidence at protein level;
References
[1]
NUCLEOTIDE SEQUENCE [GENOMIC DNA], ALTERNATIVE SPLICING, AND FUNCTION.
STRAIN=Bristol N2;
DOI=10.1083/jcb.123.1.255; PubMed=7691828 [NCBI, ExPASy, EBI, Israel, Japan]
Sibley M.H., Johnson J.J., Mello C.C., Kramer J.M.;
"Genetic identification, sequence, and alternative splicing of the Caenorhabditis elegans alpha 2(IV) collagen gene.";
J. Cell Biol. 123:255-264(1993).
[2]
NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA], AND ALTERNATIVE SPLICING.
STRAIN=Bristol N2;
DOI=10.1126/science.282.5396.2012; PubMed=9851916 [NCBI, ExPASy, EBI, Israel, Japan]
The C. elegans sequencing consortium;
"Genome sequence of the nematode C. elegans: a platform for investigating biology.";
Science 282:2012-2018(1998).
[3]
PRELIMINARY NUCLEOTIDE SEQUENCE [GENOMIC DNA] OF 1495-1758.
STRAIN=Bristol N2;
PubMed=2793871 [NCBI, ExPASy, EBI, Israel, Japan]
Guo X., Kramer J.M.;
"The two Caenorhabditis elegans basement membrane (type IV) collagen genes are located on separate chromosomes.";
J. Biol. Chem. 264:17574-17582(1989).
[4]
MUTAGENESIS.
PubMed=8045258 [NCBI, ExPASy, EBI, Israel, Japan]
Sibley M.H., Graham P.L., von Mende N., Kramer J.M.;
"Mutations in the alpha 2(IV) basement membrane collagen gene of Caenorhabditis elegans produce phenotypes of differing severities.";
EMBO J. 13:3278-3285(1994).
Comments
  • FUNCTION: Collagen type IV is specific for basement membranes. Vital for embryonic development.
  • SUBUNIT: Trimers of two alpha 1(IV) and one alpha 2(IV) chain. Type IV collagen forms a mesh-like network linked through intermolecular interactions between 7S domains and between NC1 domains.
  • SUBCELLULAR LOCATION: Secreted, extracellular space, extracellular matrix, basement membrane.
  • ALTERNATIVE PRODUCTS: 2 named isoforms [FASTA] produced by alternative splicing.
    NameI
    Synonymsa
    Isoform IDP17140-1
    This is the isoform sequence displayed in this entry.
    NameII
    Synonymsb
    Isoform IDP17140-2
    Features which should be applied to build the isoform sequence: VSP_001160.
  • DEVELOPMENTAL STAGE: Isoform I is predominant in embryos and isoform II is predominant in the larvae and adults.
  • DOMAIN: Alpha chains of type IV collagen have a non-collagenous domain (NC1) at their C-terminus, frequent interruptions of the G-X-Y repeats in the long central triple-helical domain (which may cause flexibility in the triple helix), and a short N-terminal triple-helical 7S domain.
  • PTM: Prolines at the third position of the tripeptide repeating unit (G-X-Y) are hydroxylated in some or all of the chains.
  • PTM: Type IV collagens contain numerous cysteine residues which are involved in inter- and intramolecular disulfide bonding. 12 of these, located in the NC1 domain, are conserved in all known type IV collagens.
  • SIMILARITY: Belongs to the type IV collagen family.
  • SIMILARITY: Contains 1 collagen IV NC1 (C-terminal non-collagenous) domain.
Copyright
Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms. Distributed under the Creative Commons Attribution-NoDerivs License.
Cross-references
Sequence databases
EMBL
Z22964; CAA80536.1; -; Genomic_DNA.[EMBL / GenBank / DDBJ] [CoDingSequence]
Z22964; CAA80537.1; -; Genomic_DNA.[EMBL / GenBank / DDBJ] [CoDingSequence]
U22327; AAA64312.1; ALT_SEQ; Genomic_DNA.[EMBL / GenBank / DDBJ] [CoDingSequence]
U53342; AAA96215.1; -; Genomic_DNA.[EMBL / GenBank / DDBJ] [CoDingSequence]
U53342; AAA96216.1; -; Genomic_DNA.[EMBL / GenBank / DDBJ] [CoDingSequence]
J05066; AAA27989.1; -; Genomic_DNA.[EMBL / GenBank / DDBJ] [CoDingSequence]
PIR A34476; A34476.
T29350; T29350.
T29351; T29351.
UniGene Cel.17195
3D structure databases
HSSP P08572; 1LI1. [HSSP ENTRY / PDB]
SMR P17140; 1531-1753.
ModBase P17140.
Organism-specific databases
WormBase WBGene00002280; let-2.
WormPep F01G12.5a; CE04334. [WormPep / WorfDB]
F01G12.5b; CE04335. [WormPep / WorfDB]
Gene expression databases
ArrayExpress P17140; -.
Ontologies
GO
GO:0005587; Cellular component: collagen type IV (inferred from mutant phenotype from UniProtKB).
GO:0005615; Cellular component: extracellular space (inferred from electronic annotation from UniProtKB-SubCell).
GO:0005488; Molecular function: binding (inferred from electronic annotation from InterPro).
GO:0030020; Molecular function: extracellular matrix structural constituent conferring tensile strength (inferred from mutant phenotype from UniProtKB).
GO:0016043; Biological process: cellular component organization (non-traceable author statement from UniProtKB).
GO:0009792; Biological process: embryonic development ending in birth or egg hatching (inferred from mutant phenotype from WormBase).
GO:0040007; Biological process: growth (inferred from mutant phenotype from WormBase).
GO:0040011; Biological process: locomotion (inferred from mutant phenotype from WormBase).
GO:0002119; Biological process: nematode larval development (inferred from mutant phenotype from WormBase).
GO:0040018; Biological process: positive regulation of multicellular organism growth (inferred from mutant phenotype from WormBase).
GO:0000003; Biological process: reproduction (inferred from mutant phenotype from WormBase).
QuickGo view.
Family and domain databases
InterPro IPR008160; Collagen.
IPR001442; Procollagn4_C.
Graphical view of domain structure.
Gene3D G3DSA:2.170.240.10; Procollagn4_C; 1.
Pfam PF01413; C4; 2.
PF01391; Collagen; 22.
Pfam graphical view of domain structure.
ProDom PD000007; Clg_helix; 14.
PD003923; Procollagn4_C; 2.
PD003992; XGLTT_domain; 2.
[Domain structure / List of seq. sharing at least 1 domain]
SMART SM00111; C4; 2.
SMART graphical view of domain structure.
PROSITE PS51403; NC1_IV; 1.
PROSITE graphical view of domain structure (profiles).
Genome annotation databases
Ensembl F01G12.5; Caenorhabditis elegans. [Contig view]
Other
ProtoNet P17140.
UniRef View cluster of proteins with at least 50% / 90% / 100% identity.
Keywords
Alternative splicing; Basement membrane; Collagen; Complete proteome; Disulfide bond; Extracellular matrix; Glycoprotein; Hydroxylation; Repeat; Secreted; Signal.
Features
SEVIEWER logo Feature table viewer FT aligner logo Feature aligner
KeyFrom    To Length Description FTId
SIGNAL   1     26  26     Potential. 
CHAIN   27   1758  1732     Collagen alpha-2(IV) chain. PRO_0000005829
DOMAIN   1531   1754  224     Collagen IV NC1. 
REGION   27     42  16     7S domain. 
REGION   42   1527  1486     Triple-helical region. 
CARBOHYD   248    248        O-linked (Xyl...) (glycosaminoglycan) (Potential). 
DISULFID   1546   1635        By similarity. 
DISULFID   1579   1632        By similarity. 
DISULFID   1591   1597        By similarity. 
DISULFID   1654   1750        By similarity. 
DISULFID   1688   1747        By similarity. 
DISULFID   1700   1707        By similarity. 
VAR_SEQ   229    264        GDLGSVGPPGPPGPREFTGSGSIVGPRGNPGEKGDK -> GDIGAMGPAGPPGPIASTMSKGTIIGPKGDLGEKGEK (in isoform II). VSP_001160
MUTAGEN   48     48        G->E: In MN114; 73% lethal. 
MUTAGEN   366    366        A->T: In MN126; 100% lethal. 
MUTAGEN   570    570        G->E: In MN109; 37% lethal. 
MUTAGEN   588    588        G->R: In MN103 and MN151; 96% lethal. 
MUTAGEN   597    597        G->R: In MN152; 50% lethal. 
MUTAGEN   690    690        G->E: In MN129; 100% lethal. 
MUTAGEN   690    690        G->R: In MN101; 100% lethal. 
MUTAGEN   737    737        G->E: In MN143; 100% lethal. 
MUTAGEN   877    877        G->R: In G30; 90% lethal. 
MUTAGEN   904    904        G->R: In E1470; 94% lethal. 
MUTAGEN   1003   1003        G->E: In MN139; 20% lethal. 
MUTAGEN   1125   1125        G->D: In G25; 2% lethal. 
MUTAGEN   1152   1152        G->D: In MN147; 7% lethal. 
MUTAGEN   1286   1286        G->D: In G37 and B246; 9% lethal. 
CONFLICT   1604   1604        E -> D (in Ref. 2; AAA96215/AAA96216). 
CONFLICT   1682   1682        P -> L (in Ref. 2; AAA96215/AAA96216). 
Sequence information
Length: 1758 AA [This is the length of the unprocessed precursor] Molecular weight: 167751 Da [This is the MW of the unprocessed precursor] CRC64: 97EE3F3DBB2D2AC5 [This is a checksum on the sequence]
        10         20         30         40         50         60 
MKQRAALGPV LRLAILALLA VSYVQSQATC RDCSNRGCFC VGEKGSMGAP GPQGPPGTQG 

        70         80         90        100        110        120 
IRGFPGPEGL AGPKGLKGAQ GPPGPVGIKG DRGAVGVPGF PGNDGGNGRP GEPGPPGAPG 

       130        140        150        160        170        180 
WDGCNGTDGA PGIPGRPGPP GMPGFPGPPG MDGLKGEPAI GYAGAPGEKG DGGMPGMPGL 

       190        200        210        220        230        240 
PGPSGRDGYP GEKGDRGDTG NAGPRGPPGE AGSPGNPGIG SIGPKGDPGD LGSVGPPGPP 

       250        260        270        280        290        300 
GPREFTGSGS IVGPRGNPGE KGDKGEPGEG GQRGYPGNGG LSGQPGLPGM KGEKGLSGPA 

       310        320        330        340        350        360 
GPRGKEGRPG NAGPPGFKGD RGLDGLGGIP GLPGQKGEAG YPGRDGPKGN SGPPGPPGGG 

       370        380        390        400        410        420 
TFNDGAPGPP GLPGRPGNPG PPGTDGYPGA PGPAGPIGNT GGPGLPGYPG NEGLPGPKGD 

       430        440        450        460        470        480 
KGDGGIPGAP GVSGPSGIPG LPGPKGEPGY RGTPGQSIPG LPGKDGKPGL DGAPGRKGEN 

       490        500        510        520        530        540 
GLPGVRGPPG DSLNGLPGAP GQRGAPGPNG YDGRDGVNGL PGAPGTKGDR GGTCSACAPG 

       550        560        570        580        590        600 
TKGEKGLPGY SGQPGPQGDR GLPGMPGPVG DAGDDGLPGP AGRPGSPGPP GQDGFPGLPG 

       610        620        630        640        650        660 
QKGEPTQLTL RPGPPGYPGL KGENGFPGQP GVDGLPGPSG PVGPPGAPGY PGEKGDAGLP 

       670        680        690        700        710        720 
GLSGKPGQDG LPGLPGNKGE AGYGQPGQPG FPGAKGDGGL PGLPGTPGLQ GMPGEPAPEN 

       730        740        750        760        770        780 
QVNPAPPGQP GLPGLPGTKG EGGYPGRPGE VGQPGFPGLP GMKGDSGLPG PPGLPGHPGV 

       790        800        810        820        830        840 
PGDKGFGGVP GLPGIPGPKG DVGNPGLPGL NGQKGEPGVG VPGQPGSPGF PGLKGDAGLP 

       850        860        870        880        890        900 
GLPGTPGLEG QRGFPGAPGL KGGDGLPGLS GQPGYPGEKG DAGLPGVPGR EGSPGFPGQD 

       910        920        930        940        950        960 
GLPGVPGMKG EDGLPGLPGV TGLKGDLGAP GQSGAPGLPG APGYPGMKGN AGIPGVPGFK 

       970        980        990       1000       1010       1020 
GDGGLPGLPG LNGPKGEPGV PGMPGTPGMK GNGGLPGLPG RDGLSGVPGM KGDRGFNGLP 

      1030       1040       1050       1060       1070       1080 
GEKGEAGPAA RDGQKGDAGL PGQPGLRGPQ GPSGLPGVPG FKGETGLPGY GQPGQPGEKG 

      1090       1100       1110       1120       1130       1140 
LPGIPGKAGR QGAPGSPGQD GLPGFPGMKG ESGYPGQDGL PGRDGLPGVP GQKGDLGQSG 

      1150       1160       1170       1180       1190       1200 
QPGLSGAPGL DGQPGVPGIR GDKGQGGLPG IPGDRGMDGY PGQKGENGYP GQPGLPGLGG 

      1210       1220       1230       1240       1250       1260 
EKGFAGTPGF PGLKGSPGYP GQDGLPGIPG LKGDSGFPGQ PGQEGLPGLS GEKGMGGLPG 

      1270       1280       1290       1300       1310       1320 
MPGQPGQSIA GPVGPPGAPG LQGKDGFPGL PGQKGESGLS GLPGAPGLKG ESGMPGFPGA 

      1330       1340       1350       1360       1370       1380 
KGDLGANGIP GKRGEDGLPG VPGRDGQPGI PGLKGEVGGA GLPGQPGFPG IPGLKGEGGL 

      1390       1400       1410       1420       1430       1440 
PGFPGAKGEA GFPGTPGVPG YAGEKGDGGL PGLPGRDGLP GADGPVGPPG PSGPQNLVEP 

      1450       1460       1470       1480       1490       1500 
GEKGLPGLPG APGLRGEKGM PGLDGPPGND GPPGLPGQRG NDGYPGAPGL SGEKGMGGLP 

      1510       1520       1530       1540       1550       1560 
GFPGLDGQPG GPGAPGLPGA PGAAGPAYRD GFVLVKHSQT TEVPRCPEGQ TKLWDGYSLL 

      1570       1580       1590       1600       1610       1620 
YIEGNEKSHN QDLGHAGSCL QRFSTMPFLF CDFNNVCNYA SRNEKSYWLS TSEAIPMMPV 

      1630       1640       1650       1660       1670       1680 
NEREIEPYIS RCAVCEAPAN TIAVHSQTIQ IPNCPAGWSS LWIGYSFAMH TGAGAEGGGQ 

      1690       1700       1710       1720       1730       1740 
SPSSPGSCLE DFRATPFIEC NGARGSCHYF ANKFSFWLTT IDNDSEFKVP ESQTLKSGNL 

      1750 
RTRVSRCQVC VKSTDGRH 

P17140 in FASTA format

View entry in raw text format (no links)
Report form for errors/updates in this UniProtKB/Swiss-Prot entry

BLAST logo BLAST submission on ExPASy/SIB
or at NCBI (USA)
Tools Sequence analysis tools: ProtParam, ProtScale, Compute pI/Mw, PeptideMass, PeptideCutter, Dotlet (Java)
PROSITE logo ScanProsite, MotifScan SWISS-MODEL Submit a homology modeling request to SWISS-MODEL
NPSA logo NPSA Sequence analysis tools

ExPASy logo ExPASy Home page Site Map Search ExPASy Contact us Swiss-Prot
 Hosted by ch flag SIB Switzerland Mirror sites: Australia  Brazil  Canada  China  Korea
Notice: This page will be replaced with www.uniprot.org. Please send us your feedback!