ExPASy logo ExPASy Home page Site Map Search ExPASy Contact us Swiss-Prot
Notice: This page will be replaced with www.uniprot.org. Please send us your feedback!
Search for

UniProtKB/Swiss-Prot entry O08550


[Entry info] [Name and origin] [References] [Comments] [Cross-references] [Keywords] [Features] [Sequence] [Tools]

Note: most headings are clickable, even if they don't appear as links. They link to the user manual or other documents.
Entry information
Entry name MLL4_MOUSE
Primary accession number O08550
Secondary accession number Q5NU09
Integrated into Swiss-Prot on December 1, 2000
Sequence was last modified on January 9, 2007 (Sequence version 2)
Annotations were last modified on    November 25, 2008 (Entry version 66)
Name and origin of the protein
Protein name Histone-lysine N-methyltransferase MLL4
Synonyms EC 2.1.1.43
Myeloid/lymphoid or mixed-lineage leukemia protein 4 homolog
WW domain-binding protein 7
WBP-7
Trithorax homolog 2
Gene name
Name: Wbp7
Synonyms: Mll2, Trx2
From
Mus musculus (Mouse) [TaxID: 10090] 
Taxonomy Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; Sciurognathi; Muroidea; Muridae; Murinae; Mus.
Protein existence 1: Evidence at protein level;
References
[1]
NUCLEOTIDE SEQUENCE [MRNA].
Yoshida K.;
"Murine MLL2 gene and its expression.";
Submitted (JUN-2004) to the EMBL/GenBank/DDBJ databases.
[2]
NUCLEOTIDE SEQUENCE [MRNA] OF 379-657.
DOI=10.1093/emboj/16.9.2376; PubMed=9171351 [NCBI, ExPASy, EBI, Israel, Japan]
Bedford M.T., Chan D.C., Leder P.;
"FBP WW domains and the Abl SH3 domain bind to a specific class of proline-rich ligands.";
EMBO J. 16:2376-2383(1997).
[3]
PHOSPHORYLATION [LARGE SCALE ANALYSIS] AT SER-1927, AND MASS SPECTROMETRY.
TISSUE=Liver;
DOI=10.1073/pnas.0609836104; PubMed=17242355 [NCBI, ExPASy, EBI, Israel, Japan]
Villen J., Beausoleil S.A., Gerber S.A., Gygi S.P.;
"Large-scale phosphorylation analysis of mouse liver.";
Proc. Natl. Acad. Sci. U.S.A. 104:1488-1493(2007).
[4]
PHOSPHORYLATION [LARGE SCALE ANALYSIS] AT SER-320, AND MASS SPECTROMETRY.
DOI=10.1126/science.1140321; PubMed=17525332 [NCBI, ExPASy, EBI, Israel, Japan]
Matsuoka S., Ballif B.A., Smogorzewska A., McDonald E.R. III, Hurov K.E., Luo J., Bakalarski C.E., Zhao Z., Solimini N., Lerenthal Y., Shiloh Y., Gygi S.P., Elledge S.J.;
"ATM and ATR substrate analysis reveals extensive protein networks responsive to DNA damage.";
Science 316:1160-1166(2007).
Comments
Copyright
Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms. Distributed under the Creative Commons Attribution-NoDerivs License.
Cross-references
Sequence databases
EMBL
AB182318; BAD81031.1; -; mRNA.[EMBL / GenBank / DDBJ] [CoDingSequence]
U92455; AAC53192.1; ALT_SEQ; mRNA.[EMBL / GenBank / DDBJ] [CoDingSequence]
RefSeq NP_083550.2; -.
UniGene Mm.168688
3D structure databases
ModBase O08550.
Organism-specific databases
MGI MGI:109565; Wbp7.
Gene expression databases
ArrayExpress O08550; -.
CleanEx MM_WBP7; -.
GermOnline ENSMUSG00000006307; Mus musculus.
Ontologies
GO
GO:0005634; Cellular component: nucleus (inferred from electronic annotation from InterPro).
GO:0003677; Molecular function: DNA binding (inferred from electronic annotation from InterPro).
GO:0018024; Molecular function: histone-lysine N-methyltransferase activity (inferred from electronic annotation from EC).
GO:0005515; Molecular function: protein binding (inferred from electronic annotation from InterPro).
GO:0008270; Molecular function: zinc ion binding (inferred from electronic annotation from InterPro).
GO:0016568; Biological process: chromatin modification (inferred from electronic annotation from UniProtKB-KW).
GO:0006355; Biological process: regulation of transcription, DNA-dependent (inferred from electronic annotation from UniProtKB-KW).
QuickGo view.
Family and domain databases
InterPro IPR000637; AT_hook_DNA_bd.
IPR003889; FYrich_C.
IPR003888; FYrich_N.
IPR015722; MLL.
IPR003616; Post-SET_Zn_bd.
IPR001214; SET.
IPR002857; Znf_CXXC.
IPR001965; Znf_PHD.
IPR013083; Znf_RING/FYVE/PHD.
Graphical view of domain structure.
Gene3D G3DSA:3.30.40.10; Znf_RING/FYVE/PHD; 1.
PANTHER PTHR22884:SF10; MLL; 1.
Pfam PF05965; FYRC; 1.
PF05964; FYRN; 1.
PF00628; PHD; 3.
PF00856; SET; 1.
PF02008; zf-CXXC; 1.
Pfam graphical view of domain structure.
SMART SM00384; AT_hook; 3.
SM00542; FYRC; 1.
SM00541; FYRN; 1.
SM00249; PHD; 4.
SM00508; PostSET; 1.
SM00317; SET; 1.
SMART graphical view of domain structure.
PROSITE PS50868; POST_SET; 1.
PS50280; SET; 1.
PS51058; ZF_CXXC; 1.
PS01359; ZF_PHD_1; 3.
PS50016; ZF_PHD_2; 3.
PROSITE graphical view of domain structure (profiles).
Genome annotation databases
Ensembl ENSMUSG00000006307; Mus musculus. [Contig view]
GeneID 75410; -.
KEGG mmu:75410; -.
Phylogenomic databases
HOVERGEN O08550; -.
Other
NextBio 342938; -.
SOURCE Wbp7; Mus musculus.
ProtoNet O08550.
UniRef View cluster of proteins with at least 50% / 90% / 100% identity.
Keywords
Chromatin regulator; DNA-binding; Metal-binding; Methyltransferase; Nucleus; Phosphoprotein; Repeat; S-adenosyl-L-methionine; Transcription; Transcription regulation; Transferase; Zinc; Zinc-finger.
Features
SEVIEWER logo Feature table viewer FT aligner logo Feature aligner
KeyFrom    To Length Description FTId
CHAIN   1   2713  2713     Histone-lysine N-methyltransferase MLL4. PRO_0000124882
DOMAIN   2572   2693  122     SET. 
DOMAIN   2697   2713  17     Post-SET. 
DNA_BIND   37     44  8     A.T hook 1. 
DNA_BIND   110    117  8     A.T hook 2. 
DNA_BIND   357    365  9     A.T hook 3. 
ZN_FING   964   1011  48     CXXC-type. 
ZN_FING   1207   1258  52     PHD-type 1. 
ZN_FING   1255   1309  55     PHD-type 2. 
ZN_FING   1341   1402  62     PHD-type 3. 
COMPBIAS   6    106  101     Gly-rich. 
COMPBIAS   272    300  29     Gly-rich. 
COMPBIAS   347    410  64     Glu-rich. 
COMPBIAS   414    776  363     Pro-rich. 
COMPBIAS   1814   2296  483     Pro-rich. 
MOD_RES   159    159        Phosphothreonine (By similarity). 
MOD_RES   320    320        Phosphoserine. 
MOD_RES   826    826        Phosphoserine (By similarity). 
MOD_RES   849    849        Phosphoserine (By similarity). 
MOD_RES   866    866        Phosphoserine (By similarity). 
MOD_RES   1037   1037        Phosphoserine (By similarity). 
MOD_RES   1040   1040        Phosphoserine (By similarity). 
MOD_RES   1170   1170        Phosphoserine (By similarity). 
MOD_RES   1926   1926        Phosphoserine (By similarity). 
MOD_RES   1927   1927        Phosphoserine. 
MOD_RES   1931   1931        Phosphothreonine (By similarity). 
Sequence information
Length: 2713 AA [This is the length of the unprocessed precursor] Molecular weight: 294836 Da [This is the MW of the unprocessed precursor] CRC64: B24BCA2D019EB055 [This is a checksum on the sequence]
        10         20         30         40         50         60 
MAAAAGGGSC PGPGSARVRF PGRPLGCGGG GGRGGRGNGA ERVRVALRRG GGAAGPGGAE 

        70         80         90        100        110        120 
PGEDTALLRL LGLRRGLRRL RRLWAGARVQ RGRGRGRGRG WGPNRGCMPE EESSDGESEE 

       130        140        150        160        170        180 
EEFQGFHSDE DVAPSSLRSA LRSQRGRAPR GRGRKHKTTP LPPRLADVTP VPPKAPTRKR 

       190        200        210        220        230        240 
GEEGTERMVQ ALTELLRRSQ APQPPRSRAR AREPSTPRRS RGRPPGRPAG PCRKKQQAVV 

       250        260        270        280        290        300 
LAEAAVTIPK PEPPPPVVPV KNKAGSWKCK EGPGPGPGTP KRGGQPGRGG RGGRGRGRGG 

       310        320        330        340        350        360 
LPLMIKFVSK AKKVKMGQLS QELESGQGHG QRGESWQDAP QRKDGDEPER GSCRKKQEQK 

       370        380        390        400        410        420 
LEEEEEEEEK EGEEKEEKDD NEDNNKQEEE EETERAVAEE EAMLAKEKEE AKLPSPPLTP 

       430        440        450        460        470        480 
PVPSPPPPLP PPSTSPPPPA SPLPPPVSPP PPLSPPPYPA PEKQEESPPL VPATCSRKRG 

       490        500        510        520        530        540 
RPPLTPSQRA EREAARSGPE GTLSPTPNPS TTTGSPLEDS PTVVPKSTTF LKNIRQFIMP 

       550        560        570        580        590        600 
VVSARSSRVI KTPRRFMDED PPKPPKVEAS IVRPPVATSP PAPQEPVPVS SPPRVPTPPS 

       610        620        630        640        650        660 
TPVPLPEKRR SILREPTFRW TSLTRELPPP PPAPPPAPSP PPAPATPSRR PLLLRAPQFT 

       670        680        690        700        710        720 
PSEAHLKIYE SVLTPPPLGA LETPEPELPP ADDSPAEPEP RAVGRTNHLS LPRFVPVVTS 

       730        740        750        760        770        780 
PVKVEVPPHG APALSEGQQL QLQQPPQALQ TQLLPQALPP QQPQAQPPPS PQHTPPLEKA 

       790        800        810        820        830        840 
RVASLGSLPL SGVEEKMFSL LKRAKVQLFK IDQQQQQKVA ASMPLSPAVQ TEEAVGTVKQ 

       850        860        870        880        890        900 
TPDRGCVRSE DESMEAKRDR ASGPESPLQG PRIKHVCRHA AVALGQARAM VPEDVPRLSA 

       910        920        930        940        950        960 
LPLRDRQDLA TEDTSSASET ESVPSRSQRE KVESAGPGGD SEPTGSTGAL AHTPRRSLPS 

       970        980        990       1000       1010       1020 
HHGKKMRMAR CGHCRGCLRV QDCGSCVNCL DKPKFGGPNT KKQCCVYRKC DKIEARKMER 

      1030       1040       1050       1060       1070       1080 
LAKKGRTIVK TLLPWDSDES PEASPGPPGP RRGAGAGGSR EEVGATPGPE EQDSLLLQRK 

      1090       1100       1110       1120       1130       1140 
SARRCVKQRP SYDVFEDSDD SEPGGPPAPR RRTPREHELP VLEPEEQSRP RKPTLQPVLQ 

      1150       1160       1170       1180       1190       1200 
LKARRRLDKD ALAPGPFASF PNGWTGKQKS PDGVHRVRVD FKEDCDLENV WLMGGLSVLT 

      1210       1220       1230       1240       1250       1260 
SVPGGPPMVC LLCASKGLHE LVFCQVCCDP FHPFCLEEAE RPSPQHRDTW CCRRCKFCHV 

      1270       1280       1290       1300       1310       1320 
CGRKGRGSKH LLECERCRHA YHPACLGPSY PTRATRRRRH WICSACVRCK SCGATPGKNW 

      1330       1340       1350       1360       1370       1380 
DVEWSGDYSL CPRCTELYEK GNYCPICTRC YEDNDYESKM MQCAQCDHWV HAKCEGLSDE 

      1390       1400       1410       1420       1430       1440 
DYEILSGLPD SVLYTCGPCA GATQPRWREA LSGALQGGLR QVLQGLLSSK VAGPLLLCTQ 

      1450       1460       1470       1480       1490       1500 
CGQDGKQLHP GPCDLQAVGK RFEEGLYKSV HSFMEDVVAI LMRHSEEGET PERRAGSQMK 

      1510       1520       1530       1540       1550       1560 
GLLLKLLESA FCWFDAHDPK YWRRSTRLPN GVLPNAVLPP SLDHVYAQWR QQESETPESG 

      1570       1580       1590       1600       1610       1620 
QPPGDPSAAF QSKDPAAFSH LDDPRQCALC LKYGDADSKE AGRLLYIGQN EWTHVNCAIW 

      1630       1640       1650       1660       1670       1680 
SAEVFEENDG SLKNVHAAVA RGRQMRCELC LKPGATVGCC LSSCLSNFHF MCARASYCIF 

      1690       1700       1710       1720       1730       1740 
QDDKKVFCQK HTDLLDGKEI VTPDGFDVLR RVYVDFEGIN FKRKFLTGLE PDVINVLIGS 

      1750       1760       1770       1780       1790       1800 
IRINSLGTLS DLSDCEGRLF PIGYQCSRLY WSTVDARRRC WYRCRILEYR PWGPREEPVH 

      1810       1820       1830       1840       1850       1860 
LEAAEENQTI VHSPTPSSDT DSLIPGDPVH HSPIQNLDPP LRTDSSNGPP PTPRSFSGAR 

      1870       1880       1890       1900       1910       1920 
IKVPNYSPSR RPLGGVSFGP LPSPGSPSSL THHIPTVGDS DFPAPPRRSR RPSPLATRPP 

      1930       1940       1950       1960       1970       1980 
PSRRTSSPLR TSPQLRVPLS TSVTALTPTS GELAPPDLAP SPLPPSEDLG PDFEDMEVVS 

      1990       2000       2010       2020       2030       2040 
GLSAADLDFA ASLLGTEPFQ EEIVAAGAVG SSQGGPGDSS EEEASPTTHY VHFPVTVVSG 

      2050       2060       2070       2080       2090       2100 
PALAPSSLAG APRIEQLDGV DDGTDSEAEA VQQPRGQGTP PSGPGVGRGG VLGAAGDRAQ 

      2110       2120       2130       2140       2150       2160 
PPEDLPSEIV DFVLKNLGGP GEGAAGPRED SLPSAPPLAN GSQPPQSLST SPADPTRTFA 

      2170       2180       2190       2200       2210       2220 
WLPGAPGVRV LSLGPAPEPP KPATSKIILV NKLGQVFVKM AGEGEPVAPP VKQPPLPPII 

      2230       2240       2250       2260       2270       2280 
PPTAPTSWTL PPGPLLSVLP VVGVGVVRPA PPPPPPPLTL VFSSGPPSPP RQAIRVKRVS 

      2290       2300       2310       2320       2330       2340 
TFSGRSPPVP PPNKTPRLDE DGESLEDAHH VPGISGSGFS RVRMKTPTVR GVLDLNNPGE 

      2350       2360       2370       2380       2390       2400 
QPEEESPGRP QDRCPLLPLA EAPSQALDGS SDLLFESQWH HYSAGEASSS EEEPPSPEDK 

      2410       2420       2430       2440       2450       2460 
ENQVPKRVGP HLRFEISSDD GFSVEAESLE VAWRTLIEKV QEARGHARLR HLSFSGMSGA 

      2470       2480       2490       2500       2510       2520 
RLLGIHHDAV IFLAEQLPGA QRCQHYKFRY HQQGEGQEEP PLNPHGAARA EVYLRKCTFD 

      2530       2540       2550       2560       2570       2580 
MFNFLASQHR VLPEGATCDE EEDEVQLRST RRATSLELPM AMRFRHLKKT SKEAVGVYRS 

      2590       2600       2610       2620       2630       2640 
AIHGRGLFCK RNIDAGEMVI EYSGIVIRSV LTDKREKFYD GKGIGCYMFR MDDFDVVDAT 

      2650       2660       2670       2680       2690       2700 
MHGNAARFIN HSCEPNCFSR VIHVEGQKHI VIFALRRILR GEELTYDYKF PIEDASNKLP 

      2710 
CNCGAKRCRR FLN 

O08550 in FASTA format

View entry in original UniProtKB/Swiss-Prot format
View entry in raw text format (no links)
Report form for errors/updates in this UniProtKB/Swiss-Prot entry

BLAST logo BLAST submission on ExPASy/SIB
or at NCBI (USA)
Tools Sequence analysis tools: ProtParam, ProtScale, Compute pI/Mw, PeptideMass, PeptideCutter, Dotlet (Java)
PROSITE logo ScanProsite, MotifScan SWISS-MODEL Submit a homology modeling request to SWISS-MODEL
NPSA logo NPSA Sequence analysis tools

ExPASy logo ExPASy Home page Site Map Search ExPASy Contact us Swiss-Prot
 Hosted by ca flag CBR Canada Mirror sites: Australia  Brazil  China  Korea  Switzerland
Notice: This page will be replaced with www.uniprot.org. Please send us your feedback!