ExPASy logo ExPASy Home page Site Map Search ExPASy Contact us Swiss-Prot
Notice: This page will be replaced with www.uniprot.org. Please send us your feedback!
Search for

UniProtKB/Swiss-Prot entry Q9NYQ7


[Entry info] [Name and origin] [References] [Comments] [Cross-references] [Keywords] [Features] [Sequence] [Tools]

Note: most headings are clickable, even if they don't appear as links. They link to the user manual or other documents.
Entry information
Entry name CELR3_HUMAN
Primary accession number Q9NYQ7
Secondary accession number O75092
Integrated into Swiss-Prot on August 2, 2002
Sequence was last modified on October 1, 2000 (Sequence version 1)
Annotations were last modified on    November 25, 2008 (Entry version 80)
Name and origin of the protein
Protein name Cadherin EGF LAG seven-pass G-type receptor 3 [Precursor]
Synonyms Flamingo homolog 1
hFmi1
Multiple epidermal growth factor-like domains 2
Epidermal growth factor-like 1
Gene name
Name: CELSR3
Synonyms: CDHF11, EGFL1, FMI1, KIAA0812, MEGF2
From
Homo sapiens (Human) [TaxID: 9606] 
Taxonomy Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo.
Protein existence 1: Evidence at protein level;
References
[1]
NUCLEOTIDE SEQUENCE [MRNA].
DOI=10.1073/pnas.060027397; PubMed=10716726 [NCBI, ExPASy, EBI, Israel, Japan]
Wu Q., Maniatis T.;
"Large exons encoding multiple ectodomains are a characteristic feature of protocadherin genes.";
Proc. Natl. Acad. Sci. U.S.A. 97:3124-3129(2000).
[2]
NUCLEOTIDE SEQUENCE [MRNA] OF 1954-3312.
TISSUE=Brain;
DOI=10.1006/geno.1998.5341; PubMed=9693030 [NCBI, ExPASy, EBI, Israel, Japan]
Nakayama M., Nakajima D., Nagase T., Nomura N., Seki N., Ohara O.;
"Identification of high-molecular-weight proteins with multiple EGF-like motifs by motif-trap screening.";
Genomics 51:27-34(1998).
Comments
Copyright
Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms. Distributed under the Creative Commons Attribution-NoDerivs License.
Cross-references
Sequence databases
EMBL
AF231023; AAF61929.1; -; mRNA.[EMBL / GenBank / DDBJ] [CoDingSequence]
AB011536; BAA32464.1; -; mRNA.[EMBL / GenBank / DDBJ] [CoDingSequence]
PIR T00250; T00250.
RefSeq NP_001398.2; -.
UniGene Hs.631926
3D structure databases
HSSP P01132; 1GK5. [HSSP ENTRY / PDB]
ModBase Q9NYQ7.
Protein-protein interaction databases
IntAct Q9NYQ7; -.
Protein family/group databases
GPCRDB Q9NYQ7; CELR3_HUMAN.
PTM databases
PhosphoSite Q9NYQ7; -.
Organism-specific databases
H-InvDB HIX0003291; -.
HGNC HGNC:3230; CELSR3.
GenAtlas CELSR3.
MIM 604264; gene. [NCBI / EBI]
PharmGKB PA26395; -.
GeneCards Q9NYQ7.
HUGE KIAA0812.
Gene expression databases
ArrayExpress Q9NYQ7; -.
CleanEx HS_CELSR3; -.
GermOnline ENSG00000008300; Homo sapiens.
Ontologies
GO
GO:0016021; Cellular component: integral to membrane (inferred from electronic annotation from UniProtKB-KW).
GO:0005886; Cellular component: plasma membrane (inferred from electronic annotation from UniProtKB-KW).
GO:0005509; Molecular function: calcium ion binding (inferred from electronic annotation from InterPro).
GO:0004930; Molecular function: G-protein coupled receptor activity (inferred from electronic annotation from InterPro).
GO:0007156; Biological process: homophilic cell adhesion (inferred from electronic annotation from InterPro).
GO:0007275; Biological process: multicellular organismal development (inferred from electronic annotation from UniProtKB-KW).
GO:0007218; Biological process: neuropeptide signaling pathway (inferred from electronic annotation from InterPro).
QuickGo view.
Family and domain databases
InterPro IPR002126; Cadherin.
IPR013320; ConA_like_subgrp.
IPR006210; EGF.
IPR000152; EGF-type_Asp/Asn_hydroxyl_CS.
IPR000742; EGF_3.
IPR002049; EGF_laminin.
IPR006209; EGF_like.
IPR013032; EGF_like_reg_CS.
IPR001879; GPCR_2_extracellular.
IPR000832; GPCR_2_secretin-like.
IPR001791; Laminin_G.
IPR012680; Laminin_G_2.
IPR000203; PKD_cys_rich.
Graphical view of domain structure.
Gene3D G3DSA:2.60.40.60; Cadherin; 8.
G3DSA:2.60.120.200; ConA_like_subgrp; 2.
Pfam PF00002; 7tm_2; 1.
PF00028; Cadherin; 9.
PF00008; EGF; 6.
PF01825; GPS; 1.
PF02793; HRM; 1.
PF00053; Laminin_EGF; 1.
PF02210; Laminin_G_2; 2.
Pfam graphical view of domain structure.
PRINTS PR00205; CADHERIN.
PR00249; GPCRSECRETIN.
SMART SM00112; CA; 9.
SM00181; EGF; 6.
SM00180; EGF_Lam; 1.
SM00303; GPS; 1.
SM00008; HormR; 1.
SM00282; LamG; 2.
SMART graphical view of domain structure.
PROSITE PS00010; ASX_HYDROXYL; 1.
PS00232; CADHERIN_1; 7.
PS50268; CADHERIN_2; 8.
PS00022; EGF_1; 6.
PS01186; EGF_2; 4.
PS50026; EGF_3; 6.
PS01248; EGF_LAM_1; 1.
PS50027; EGF_LAM_2; 1.
PS00649; G_PROTEIN_RECEP_F2_1; FALSE_NEG.
PS00650; G_PROTEIN_RECEP_F2_2; 1.
PS50227; G_PROTEIN_RECEP_F2_3; 1.
PS50261; G_PROTEIN_RECEP_F2_4; 1.
PS50221; GPS; 1.
PS50025; LAM_G_DOMAIN; 2.
PROSITE graphical view of domain structure (profiles).
ProtoNet Q9NYQ7.
Genome annotation databases
Ensembl ENSG00000008300; Homo sapiens. [Contig view]
GeneID 1951; -.
KEGG hsa:1951; -.
Phylogenomic databases
HOVERGEN Q9NYQ7; -.
Other
LinkHub Q9NYQ7; -.
NextBio 7907; -.
SOURCE CELSR3; Homo sapiens.
GPCRDB-Snakes Q9NYQ7.
UniRef View cluster of proteins with at least 50% / 90% / 100% identity.
Keywords
Calcium; Cell membrane; Developmental protein; EGF-like domain; G-protein coupled receptor; Glycoprotein; Hydroxylation; Laminin EGF-like domain; Membrane; Polymorphism; Receptor; Repeat; Signal; Transducer; Transmembrane.
Features
SEVIEWER logo Feature table viewer FT aligner logo Feature aligner
KeyFrom    To Length Description FTId
SIGNAL   1     32  32     Potential. 
CHAIN   33   3312  3280     Cadherin EGF LAG seven-pass G-type receptor 3. PRO_0000012918
TOPO_DOM   33   2540  2508     Extracellular (Potential). 
TRANSMEM   2541   2561  21     1 (Potential). 
TOPO_DOM   2562   2572  11     Cytoplasmic (Potential). 
TRANSMEM   2573   2593  21     2 (Potential). 
TOPO_DOM   2594   2601  8     Extracellular (Potential). 
TRANSMEM   2602   2622  21     3 (Potential). 
TOPO_DOM   2623   2643  21     Cytoplasmic (Potential). 
TRANSMEM   2644   2664  21     4 (Potential). 
TOPO_DOM   2665   2681  17     Extracellular (Potential). 
TRANSMEM   2682   2702  21     5 (Potential). 
TOPO_DOM   2703   2725  23     Cytoplasmic (Potential). 
TRANSMEM   2726   2746  21     6 (Potential). 
TOPO_DOM   2747   2753  7     Extracellular (Potential). 
TRANSMEM   2754   2774  21     7 (Potential). 
TOPO_DOM   2775   3312  538     Cytoplasmic (Potential). 
DOMAIN   326    433  108     Cadherin 1. 
DOMAIN   434    545  112     Cadherin 2. 
DOMAIN   546    651  106     Cadherin 3. 
DOMAIN   652    756  105     Cadherin 4. 
DOMAIN   757    858  102     Cadherin 5. 
DOMAIN   859    961  103     Cadherin 6. 
DOMAIN   962   1067  106     Cadherin 7. 
DOMAIN   1068   1169  102     Cadherin 8. 
DOMAIN   1170   1265  96     Cadherin 9. 
DOMAIN   1375   1433  59     EGF-like 1; calcium-binding. 
DOMAIN   1435   1471  37     EGF-like 2; calcium-binding. 
DOMAIN   1475   1514  40     EGF-like 3; calcium-binding. 
DOMAIN   1515   1719  205     Laminin G-like 1. 
DOMAIN   1722   1758  37     EGF-like 4; calcium-binding. 
DOMAIN   1764   1944  181     Laminin G-like 2. 
DOMAIN   1946   1982  37     EGF-like 5; calcium-binding. 
DOMAIN   1983   2020  38     EGF-like 6; calcium-binding. 
DOMAIN   2021   2053  33     EGF-like 7; calcium-binding. 
DOMAIN   2055   2090  36     EGF-like 8; calcium-binding. 
DOMAIN   2077   2124  48     Laminin EGF-like. 
DOMAIN   2477   2529  53     GPS. 
MOD_RES   1963   1963        3-hydroxyaspartate (Potential). 
CARBOHYD   632    632        N-linked (GlcNAc...) (Potential). 
CARBOHYD   847    847        N-linked (GlcNAc...) (Potential). 
CARBOHYD   1182   1182        N-linked (GlcNAc...) (Potential). 
CARBOHYD   1222   1222        N-linked (GlcNAc...) (Potential). 
CARBOHYD   1317   1317        N-linked (GlcNAc...) (Potential). 
CARBOHYD   1327   1327        N-linked (GlcNAc...) (Potential). 
CARBOHYD   1649   1649        N-linked (GlcNAc...) (Potential). 
CARBOHYD   1713   1713        N-linked (GlcNAc...) (Potential). 
CARBOHYD   1770   1770        N-linked (GlcNAc...) (Potential). 
CARBOHYD   2053   2053        N-linked (GlcNAc...) (Potential). 
CARBOHYD   2177   2177        N-linked (GlcNAc...) (Potential). 
CARBOHYD   2196   2196        N-linked (GlcNAc...) (Potential). 
CARBOHYD   2386   2386        N-linked (GlcNAc...) (Potential). 
CARBOHYD   2474   2474        N-linked (GlcNAc...) (Potential). 
CARBOHYD   2506   2506        N-linked (GlcNAc...) (Potential). 
DISULFID   1379   1390        By similarity. 
DISULFID   1384   1421        By similarity. 
DISULFID   1423   1432        By similarity. 
DISULFID   1439   1450        By similarity. 
DISULFID   1444   1459        By similarity. 
DISULFID   1461   1470        By similarity. 
DISULFID   1479   1490        By similarity. 
DISULFID   1484   1500        By similarity. 
DISULFID   1502   1513        By similarity. 
DISULFID   1693   1719        By similarity. 
DISULFID   1726   1737        By similarity. 
DISULFID   1731   1746        By similarity. 
DISULFID   1748   1757        By similarity. 
DISULFID   1915   1944        By similarity. 
DISULFID   1950   1961        By similarity. 
DISULFID   1955   1970        By similarity. 
DISULFID   1972   1981        By similarity. 
DISULFID   1985   1996        By similarity. 
DISULFID   1990   2008        By similarity. 
DISULFID   2010   2019        By similarity. 
DISULFID   2027   2040        By similarity. 
DISULFID   2042   2052        By similarity. 
DISULFID   2059   2074        By similarity. 
DISULFID   2061   2077        By similarity. 
DISULFID   2079   2089        By similarity. 
DISULFID   2098   2107        By similarity. 
DISULFID   2110   2122        By similarity. 
VARIANT   157    157  1     A -> P (in dbSNP:rs3733085 [NCBI]). VAR_020022 
VARIANT   805    805  1     S -> T (in dbSNP:rs3821875 [NCBI]). VAR_020023 
CONFLICT   2158   2158        G -> GLRGAG (in Ref. 2). 
Sequence information
Length: 3312 AA [This is the length of the unprocessed precursor] Molecular weight: 358257 Da [This is the MW of the unprocessed precursor] CRC64: BEC208703651A4A5 [This is a checksum on the sequence]
        10         20         30         40         50         60 
MMARRPPWRG LGERSTPILL LLLLSLFPLS QEELGGGGHQ GWDPGLAATT GPRAHIGGGA 

        70         80         90        100        110        120 
LALCPESSGV REDGGPGLGV REPIFVGLRG RRQSARNSRG PPEQPNEELG IEHGVQPLGS 

       130        140        150        160        170        180 
RERETGQGPG SVLYWRPEVS SCGRTGPLQR GSLSPGALSS GVPGSGNSSP LPSDFLIRHH 

       190        200        210        220        230        240 
GPKPVSSQRN AGTGSRKRVG TARCCGELWA TGSKGQGERA TTSGAERTAP RRNCLPGASG 

       250        260        270        280        290        300 
SGPELDSAPR TARTAPASGS APRESRTAPE PAPKRMRSRG LFRCRFLPQR PGPRPPGLPA 

       310        320        330        340        350        360 
RPEARKVTSA NRARFRRAAN RHPQFPQYNY QTLVPENEAA GTAVLRVVAQ DPDAGEAGRL 

       370        380        390        400        410        420 
VYSLAALMNS RSLELFSIDP QSGLIRTAAA LDRESMERHY LRVTAQDHGS PRLSATTMVA 

       430        440        450        460        470        480 
VTVADRNDHS PVFEQAQYRE TLRENVEEGY PILQLRATDG DAPPNANLRY RFVGPPAARA 

       490        500        510        520        530        540 
AAAAAFEIDP RSGLISTSGR VDREHMESYE LVVEASDQGQ EPGPRSATVR VHITVLDEND 

       550        560        570        580        590        600 
NAPQFSEKRY VAQVREDVRP HTVVLRVTAT DRDKDANGLV HYNIISGNSR GHFAIDSLTG 

       610        620        630        640        650        660 
EIQVVAPLDF EAEREYALRI RAQDAGRPPL SNNTGLASIQ VVDINDHIPI FVSTPFQVSV 

       670        680        690        700        710        720 
LENAPLGHSV IHIQAVDADH GENARLEYSL TGVAPDTPFV INSATGWVSV SGPLDRESVE 

       730        740        750        760        770        780 
HYFFGVEARD HGSPPLSASA SVTVTVLDVN DNRPEFTMKE YHLRLNEDAA VGTSVVSVTA 

       790        800        810        820        830        840 
VDRDANSAIS YQITGGNTRN RFAISTQGGV GLVTLALPLD YKQERYFKLV LTASDRALHD 

       850        860        870        880        890        900 
HCYVHINITD ANTHRPVFQS AHYSVSVNED RPMGSTIVVI SASDDDVGEN ARITYLLEDN 

       910        920        930        940        950        960 
LPQFRIDADS GAITLQAPLD YEDQVTYTLA ITARDNGIPQ KADTTYVEVM VNDVNDNAPQ 

       970        980        990       1000       1010       1020 
FVASHYTGLV SEDAPPFTSV LQISATDRDA HANGRVQYTF QNGEDGDGDF TIEPTSGIVR 

      1030       1040       1050       1060       1070       1080 
TVRRLDREAV SVYELTAYAV DRGVPPLRTP VSIQVMVQDV NDNAPVFPAE EFEVRVKENS 

      1090       1100       1110       1120       1130       1140 
IVGSVVAQIT AVDPDEGPNA HIMYQIVEGN IPELFQMDIF SGELTALIDL DYEARQEYVI 

      1150       1160       1170       1180       1190       1200 
VVQATSAPLV SRATVHVRLV DQNDNSPVLN NFQILFNNYV SNRSDTFPSG IIGRIPAYDP 

      1210       1220       1230       1240       1250       1260 
DVSDHLFYSF ERGNELQLLV VNQTSGELRL SRKLDNNRPL VASMLVTVTD GLHSVTAQCV 

      1270       1280       1290       1300       1310       1320 
LRVVIITEEL LANSLTVRLE NMWQERFLSP LLGRFLEGVA AVLATPAEDV FIFNIQNDTD 

      1330       1340       1350       1360       1370       1380 
VGGTVLNVSF SALAPRGAGA GAAGPWFSSE ELQEQLYVRR AALAARSLLD VLPFDDNVCL 

      1390       1400       1410       1420       1430       1440 
REPCENYMKC VSVLRFDSSA PFLASASTLF RPIQPIAGLR CRCPPGFTGD FCETELDLCY 

      1450       1460       1470       1480       1490       1500 
SNPCRNGGAC ARREGGYTCV CRPRFTGEDC ELDTEAGRCV PGVCRNGGTC TDAPNGGFRC 

      1510       1520       1530       1540       1550       1560 
QCPAGGAFEG PRCEVAARSF PPSSFVMFRG LRQRFHLTLS LSFATVQQSG LLFYNGRLNE 

      1570       1580       1590       1600       1610       1620 
KHDFLALELV AGQVRLTYST GESNTVVSPT VPGGLSDGQW HTVHLRYYNK PRTDALGGAQ 

      1630       1640       1650       1660       1670       1680 
GPSKDKVAVL SVDDCDVAVA LQFGAEIGNY SCAAAGVQTS SKKSLDLTGP LLLGGVPNLP 

      1690       1700       1710       1720       1730       1740 
ENFPVSHKDF IGCMRDLHID GRRVDMAAFV ANNGTMAGCQ AKLHFCDSGP CKNSGFCSER 

      1750       1760       1770       1780       1790       1800 
WGSFSCDCPV GFGGKDCQLT MAHPHHFRGN GTLSWNFGSD MAVSVPWYLG LAFRTRATQG 

      1810       1820       1830       1840       1850       1860 
VLMQVQAGPH STLLCQLDRG LLSVTVTRGS GRASHLLLDQ VTVSDGRWHD LRLELQEEPG 

      1870       1880       1890       1900       1910       1920 
GRRGHHVLMV SLDFSLFQDT MAVGSELQGL KVKQLHVGGL PPGSAEEAPQ GLVGCIQGVW 

      1930       1940       1950       1960       1970       1980 
LGSTPSGSPA LLPPSHRVNA EPGCVVTNAC ASGPCPPHAD CRDLWQTFSC TCQPGYYGPG 

      1990       2000       2010       2020       2030       2040 
CVDACLLNPC QNQGSCRHLP GAPHGYTCDC VGGYFGHHCE HRMDQQCPRG WWGSPTCGPC 

      2050       2060       2070       2080       2090       2100 
NCDVHKGFDP NCNKTNGQCH CKEFHYRPRG SDSCLPCDCY PVGSTSRSCA PHSGQCPCRP 

      2110       2120       2130       2140       2150       2160 
GALGRQCNSC DSPFAEVTAS GCRVLYDACP KSLRSGVWWP QTKFGVLATV PCPRGALGAA 

      2170       2180       2190       2200       2210       2220 
VRLCDEAQGW LEPDLFNCTS PAFRELSLLL DGLELNKTAL DTMEAKKLAQ RLREVTGHTD 

      2230       2240       2250       2260       2270       2280 
HYFSQDVRVT ARLLAHLLAF ESHQQGFGLT ATQDAHFNEN LLWAGSALLA PETGDLWAAL 

      2290       2300       2310       2320       2330       2340 
GQRAPGGSPG SAGLVRHLEE YAATLARNME LTYLNPMGLV TPNIMLSIDR MEHPSSPRGA 

      2350       2360       2370       2380       2390       2400 
RRYPRYHSNL FRGQDAWDPH THVLLPSQSP RPSPSEVLPT SSSIENSTTS SVVPPPAPPE 

      2410       2420       2430       2440       2450       2460 
PEPGISIIIL LVYRTLGGLL PAQFQAERRG ARLPQNPVMN SPVVSVAVFH GRNFLRGILE 

      2470       2480       2490       2500       2510       2520 
SPISLEFRLL QTANRSKAIC VQWDPPGLAE QHGVWTARDC ELVHRNGSHA RCRCSRTGTF 

      2530       2540       2550       2560       2570       2580 
GVLMDASPRE RLEGDLELLA VFTHVVVAVS VAALVLTAAI LLSLRSLKSN VRGIHANVAA 

      2590       2600       2610       2620       2630       2640 
ALGVAELLFL LGIHRTHNQL VCTAVAILLH YFFLSTFAWL FVQGLHLYRM QVEPRNVDRG 

      2650       2660       2670       2680       2690       2700 
AMRFYHALGW GVPAVLLGLA VGLDPEGYGN PDFCWISVHE PLIWSFAGPV VLVIVMNGTM 

      2710       2720       2730       2740       2750       2760 
FLLAARTSCS TGQREAKKTS ALTLRSSFLL LLLVSASWLF GLLAVNHSIL AFHYLHAGLC 

      2770       2780       2790       2800       2810       2820 
GLQGLAVLLL FCVLNADARA AWMPACLGRK AAPEEARPAP GLGPGAYNNT ALFEESGLIR 

      2830       2840       2850       2860       2870       2880 
ITLGASTVSS VSSARSGRTQ DQDSQRGRSY LRDNVLVRHG SAADHTDHSL QAHAGPTDLD 

      2890       2900       2910       2920       2930       2940 
VAMFHRDAGA DSDSDSDLSL EEERSLSIPS SESEDNGRTR GRFQRPLCRA AQSERLLTHP 

      2950       2960       2970       2980       2990       3000 
KDVDGNDLLS YWPALGECEA APCALQTWGS ERRLGLDTSK DAANNNQPDP ALTSGDETSL 

      3010       3020       3030       3040       3050       3060 
GRAQRQRKGI LKNRLQYPLV PQTRGAPELS WCRAATLGHR AVPAASYGRI YAGGGTGSLS 

      3070       3080       3090       3100       3110       3120 
QPASRYSSRE QLDLLLRRQL SRERLEEAPA PVLRPLSRPG SQECMDAAPG RLEPKDRGST 

      3130       3