ExPASy logo ExPASy Home page Site Map Search ExPASy Contact us Swiss-Prot
Notice: This page will be replaced with www.uniprot.org. Please send us your feedback!
Search for

UniProtKB/Swiss-Prot entry P01266


[Entry info] [Name and origin] [References] [Comments] [Cross-references] [Keywords] [Features] [Sequence] [Tools]

Note: most headings are clickable, even if they don't appear as links. They link to the user manual or other documents.
Entry information
Entry name THYG_HUMAN
Primary accession number P01266
Secondary accession numbers O15274 O43899 Q15593 Q15948 Q9NYR1 Q9NYR2 Q9UMZ0 Q9UNY3
Integrated into Swiss-Prot on July 21, 1986
Sequence was last modified on February 20, 2007 (Sequence version 5)
Annotations were last modified on    June 16, 2009 (Entry version 113)
Name and origin of the protein
Protein name Thyroglobulin [Precursor]
Synonym Tg
Gene name
Name: TG
From
Homo sapiens (Human) [TaxID: 9606] 
Taxonomy Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo.
Protein existence 1: Evidence at protein level;
References
[1]
NUCLEOTIDE SEQUENCE [MRNA] (ISOFORM 1), AND VARIANTS ASP-604; ASP-653; GLN-985 DEL; TYR-1043; THR-1059; GLY-1312; ARG-1437; HIS-1463; THR-1936; GLU-2091; LEU-2149; ARG-2170 AND HIS-2242.
DOI=10.1111/j.1432-1033.1987.tb11466.x; PubMed=3595599 [NCBI, ExPASy, EBI, Israel, Japan]
Malthiery Y., Lissitzky S.;
"Primary structure of human thyroglobulin deduced from the sequence of its 8448-base complementary DNA.";
Eur. J. Biochem. 165:491-498(1987).
[2]
NUCLEOTIDE SEQUENCE [MRNA] (ISOFORM 1), AND VARIANT GLY-1312.
TISSUE=Thyroid;
PubMed=9186272 [NCBI, ExPASy, EBI, Israel, Japan]
van de Graaf S.A.R., Pauws E., de Vijlder J.J.M., Ris-Stalpers C.;
"The revised 8307 base pair coding sequence of human thyroglobulin transiently expressed in eukaryotic cells.";
Eur. J. Endocrinol. 136:508-515(1997).
[3]
NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA], AND VARIANT GLU-515.
DOI=10.1038/nature04406; PubMed=16421571 [NCBI, ExPASy, EBI, Israel, Japan]
Nusbaum C., Mikkelsen T.S., Zody M.C., Asakawa S., Taudien S., Garber M., Kodira C.D., Schueler M.G., Shimizu A., Whittaker C.A., Chang J.L., Cuomo C.A., Dewar K., FitzGerald M.G., Yang X., Allen N.R., Anderson S., Asakawa T., Blechschmidt K., Bloom T., Borowsky M.L., Butler J., Cook A., Corum B., DeArellano K., DeCaprio D., Dooley K.T., Dorris L. III, Engels R., Gloeckner G., Hafez N., Hagopian D.S., Hall J.L., Ishikawa S.K., Jaffe D.B., Kamat A., Kudoh J., Lehmann R., Lokitsang T., Macdonald P., Major J.E., Matthews C.D., Mauceli E., Menzel U., Mihalev A.H., Minoshima S., Murayama Y., Naylor J.W., Nicol R., Nguyen C., O'Leary S.B., O'Neill K., Parker S.C.J., Polley A., Raymond C.K., Reichwald K., Rodriguez J., Sasaki T., Schilhabel M., Siddiqui R., Smith C.L., Sneddon T.P., Talamas J.A., Tenzin P., Topham K., Venkataraman V., Wen G., Yamazaki S., Young S.K., Zeng Q., Zimmer A.R., Rosenthal A., Birren B.W., Platzer M., Shimizu N., Lander E.S.;
"DNA sequence and analysis of human chromosome 8.";
Nature 439:331-335(2006).
[4]
NUCLEOTIDE SEQUENCE [MRNA] OF 1-730, AND VARIANTS ASP-604 AND ASP-653.
DOI=10.1111/j.1432-1033.1985.tb08717.x; PubMed=3971976 [NCBI, ExPASy, EBI, Israel, Japan]
Malthiery Y., Lissitzky S.;
"Sequence of the 5'-end quarter of the human-thyroglobulin messenger ribonucleic acid and of its deduced amino-acid sequence.";
Eur. J. Biochem. 147:53-58(1985).
[5]
NUCLEOTIDE SEQUENCE [GENOMIC DNA] OF 1-415; 640-737 AND 880-1000, AND VARIANT ALA-734.
DOI=10.1016/0022-2836(87)90403-7; PubMed=3681978 [NCBI, ExPASy, EBI, Israel, Japan]
Parma J., Christophe D., Pohl V., Vassart G.;
"Structural organization of the 5' region of the thyroglobulin gene. Evidence for intron loss and 'exonization' during evolution.";
J. Mol. Biol. 196:769-779(1987).
[6]
NUCLEOTIDE SEQUENCE [GENOMIC DNA] OF 1-25.
DOI=10.1093/nar/13.14.5127; PubMed=2991855 [NCBI, ExPASy, EBI, Israel, Japan]
Christophe D., Cabrer B., Bacolla A., Targovnik H.M., Pohl V., Vassart G.;
"An unusually long poly(purine)-poly(pyrimidine) sequence is located upstream from the human thyroglobulin gene.";
Nucleic Acids Res. 13:5127-5144(1985).
[7]
NUCLEOTIDE SEQUENCE [GENOMIC DNA] OF 1002-1566 (ISOFORM 1), AND VARIANT GLY-1312.
DOI=10.1530/eje.0.1430789; PubMed=11124863 [NCBI, ExPASy, EBI, Israel, Japan]
Moya C.M., Mendive F.M., Rivolta C.M., Vassart G., Targovnik H.M.;
"Genomic organization of the 5' region of the human thyroglobulin gene.";
Eur. J. Endocrinol. 143:789-798(2000).
[8]
NUCLEOTIDE SEQUENCE [GENOMIC DNA] OF 1645-2768, AND VARIANTS LEU-2149 AND ARG-2170.
PubMed=10524569 [NCBI, ExPASy, EBI, Israel, Japan]
Mendive F.M., Rivolta C.M., Vassart G., Targovnik H.M.;
"Genomic organization of the 3' region of the human thyroglobulin gene.";
Thyroid 9:903-912(1999).
[9]
NUCLEOTIDE SEQUENCE [MRNA] OF 1504-1602 (ISOFORM 2).
DOI=10.1016/0303-7207(92)90087-M; PubMed=1639210 [NCBI, ExPASy, EBI, Israel, Japan]
Targovnik H.M., Cochaux P., Corach D., Vassart G.;
"Identification of a minor Tg mRNA transcript in RNA from normal and goitrous thyroids.";
Mol. Cell. Endocrinol. 84:R23-R26(1992).
[10]
PARTIAL PROTEIN SEQUENCE.
DOI=10.1016/0014-5793(89)80513-7; PubMed=2914619 [NCBI, ExPASy, EBI, Israel, Japan]
Marriq C., Lejeune P.J., Venot N., Vinet L.;
"Hormone synthesis in human thyroglobulin: possible cleavage of the polypeptide chain at the tyrosine donor site.";
FEBS Lett. 242:414-418(1989).
[11]
PARTIAL PROTEIN SEQUENCE.
DOI=10.1111/j.1432-1033.1993.tb18414.x; PubMed=8269951 [NCBI, ExPASy, EBI, Israel, Japan]
Gentile F., Salvatore G.;
"Preferential sites of proteolytic cleavage of bovine, human and rat thyroglobulin. The use of limited proteolysis to detect solvent-exposed regions of the primary structure.";
Eur. J. Biochem. 218:603-621(1993).
[12]
PARTIAL PROTEIN SEQUENCE.
DOI=10.1006/abbi.1995.1346; PubMed=7793989 [NCBI, ExPASy, EBI, Israel, Japan]
Xiao S., Pollock H.G., Taurog A., Rawitch A.B.;
"Characterization of hormonogenic sites in an N-terminal, cyanogen bromide fragment of human thyroglobulin.";
Arch. Biochem. Biophys. 320:96-105(1995).
[13]
PARTIAL PROTEIN SEQUENCE.
DOI=10.1006/abbi.1996.0093; PubMed=8615697 [NCBI, ExPASy, EBI, Israel, Japan]
Yang S.X., Pollock H.G., Rawitch A.B.;
"Glycosylation in human thyroglobulin: location of the N-linked oligosaccharide units and comparison with bovine thyroglobulin.";
Arch. Biochem. Biophys. 327:61-70(1996).
[14]
PRESENCE OF A 11TH TYROGLOBULIN TYPE-1 REPEAT.
DOI=10.1111/j.1432-1033.1996.0125h.x; PubMed=8797845 [NCBI, ExPASy, EBI, Israel, Japan]
Molina F., Bouanani M., Pau B., Granier C.;
"Characterization of the type-1 repeat from thyroglobulin, a cysteine-rich module found in proteins from different families.";
Eur. J. Biochem. 240:125-133(1996).
[15]
IODINATION AT TYR-24; TYR-1310; TYR-2573; TYR-2587 AND TYR-2766.
PubMed=2760035 [NCBI, ExPASy, EBI, Israel, Japan]
Lamas L., Anderson P.C., Fox J.W., Dunn J.T.;
"Consensus sequences for early iodination and hormonogenesis in human thyroglobulin.";
J. Biol. Chem. 264:13541-13545(1989).
[16]
SULFATION.
DOI=10.1006/bbrc.1999.1173; PubMed=10448091 [NCBI, ExPASy, EBI, Israel, Japan]
Nlend M.-C., Cauvi D., Venot N., Chabaud O.;
"Sulfated tyrosines of thyroglobulin are involved in thyroid hormone synthesis.";
Biochem. Biophys. Res. Commun. 262:193-197(1999).
[17]
SULFATION AT TYR-24.
DOI=10.1016/S0006-291X(02)02425-7; PubMed=12387814 [NCBI, ExPASy, EBI, Israel, Japan]
Venot N., Nlend M.-C., Cauvi D., Chabaud O.;
"The hormonogenic tyrosine 5 of porcine thyroglobulin is sulfated.";
Biochem. Biophys. Res. Commun. 298:193-197(2002).
[18]
VARIANT GOITER HIS-870.
DOI=10.1016/0140-6736(93)90209-Y; PubMed=8094490 [NCBI, ExPASy, EBI, Israel, Japan]
Corral J., Martin C., Perez R., Sanchez I., Mories M.T., San Millan J.L., Miralles J.M., Gonzalez-Sarmiento R.;
"Thyroglobulin gene point mutation associated with non-endemic simple goitre.";
Lancet 341:462-464(1993).
[19]
VARIANTS GOITER ARG-1264 AND SER-1996, AND VARIANTS HIS-135; ASP-604; ASP-653; ALA-734; GLU-830; GLN-985 DEL; VAL-1028; TYR-1043; THR-1059; ARG-1437; HIS-1463; ASN-1838; THR-1936; TRP-1999; GLU-2091; LEU-2149; ARG-2170; HIS-2242; ARG-2501 AND GLN-2530.
DOI=10.1210/jc.84.4.1438; PubMed=10199792 [NCBI, ExPASy, EBI, Israel, Japan]
Hishinuma A., Takamatsu J., Ohyama Y., Yokozawa T., Kanno Y., Kuma K., Yoshida S., Matsuura N., Ieiri T.;
"Two novel cysteine substitutions (C1263R and C1995S) of thyroglobulin cause a defect in intracellular transport of thyroglobulin in patients with congenital goiter and the variant type of adenomatous goiter.";
J. Clin. Endocrinol. Metab. 84:1438-1444(1999).
[20]
VARIANTS ALA-734; VAL-1028 AND TRP-1979, AND INVOLVEMENT IN AITD3.
DOI=10.1073/pnas.2434175100; PubMed=14657345 [NCBI, ExPASy, EBI, Israel, Japan]
Ban Y., Greenberg D.A., Concepcion E., Skrabanek L., Villanueva R., Tomer Y.;
"Amino acid substitutions in the thyroglobulin gene are associated with susceptibility to human and murine autoimmune thyroid disease.";
Proc. Natl. Acad. Sci. U.S.A. 100:15119-15124(2003).
Comments
Copyright
Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms. Distributed under the Creative Commons Attribution-NoDerivs License.
Cross-references
Sequence databases
EMBL
X05615; CAA29104.1; -; mRNA.[EMBL / GenBank / DDBJ] [CoDingSequence]
U93033; AAC51924.1; -; mRNA.[EMBL / GenBank / DDBJ] [CoDingSequence]
AF230667; -; NOT_ANNOTATED_CDS; Genomic_DNA.[EMBL / GenBank / DDBJ]
AF235100; -; NOT_ANNOTATED_CDS; Genomic_DNA.[EMBL / GenBank / DDBJ]
AF230666; -; NOT_ANNOTATED_CDS; Genomic_DNA.[EMBL / GenBank / DDBJ]
AF305872; -; NOT_ANNOTATED_CDS; Genomic_DNA.[EMBL / GenBank / DDBJ]
X02154; CAA26089.1; -; mRNA.[EMBL / GenBank / DDBJ] [CoDingSequence]
X06059; CAA29454.1; -; Genomic_DNA.[EMBL / GenBank / DDBJ] [CoDingSequence]
X06060; CAA29454.1; JOINED; Genomic_DNA.[EMBL / GenBank / DDBJ] [CoDingSequence]
X06061; CAA29454.1; JOINED; Genomic_DNA.[EMBL / GenBank / DDBJ] [CoDingSequence]
X06062; CAA29454.1; JOINED; Genomic_DNA.[EMBL / GenBank / DDBJ] [CoDingSequence]
X06063; CAA29454.1; JOINED; Genomic_DNA.[EMBL / GenBank / DDBJ] [CoDingSequence]
X06064; CAA29454.1; JOINED; Genomic_DNA.[EMBL / GenBank / DDBJ] [CoDingSequence]
X06065; CAA29454.1; JOINED; Genomic_DNA.[EMBL / GenBank / DDBJ] [CoDingSequence]
X06066; CAA29454.1; JOINED; Genomic_DNA.[EMBL / GenBank / DDBJ] [CoDingSequence]
X06067; CAA29455.1; -; Genomic_DNA.[EMBL / GenBank / DDBJ] [CoDingSequence]
X06068; CAA29455.1; JOINED; Genomic_DNA.[EMBL / GenBank / DDBJ] [CoDingSequence]
X06069; CAA29456.1; -; Genomic_DNA.[EMBL / GenBank / DDBJ] [CoDingSequence]
X06070; CAA29456.1; JOINED; Genomic_DNA.[EMBL / GenBank / DDBJ] [CoDingSequence]
X02749; CAA26527.1; -; Genomic_DNA.[EMBL / GenBank / DDBJ] [CoDingSequence]
AF170489; AAD51647.1; -; Genomic_DNA.[EMBL / GenBank / DDBJ] [CoDingSequence]
AF170486; AAD51647.1; JOINED; Genomic_DNA.[EMBL / GenBank / DDBJ] [CoDingSequence]
AF170487; AAD51647.1; JOINED; Genomic_DNA.[EMBL / GenBank / DDBJ] [CoDingSequence]
AF170488; AAD51647.1; JOINED; Genomic_DNA.[EMBL / GenBank / DDBJ] [CoDingSequence]
AF105687; AAC95473.1; -; Genomic_DNA.[EMBL / GenBank / DDBJ] [CoDingSequence]
AF105681; AAC95473.1; JOINED; Genomic_DNA.[EMBL / GenBank / DDBJ] [CoDingSequence]
AF105682; AAC95473.1; JOINED; Genomic_DNA.[EMBL / GenBank / DDBJ] [CoDingSequence]
AF105683; AAC95473.1; JOINED; Genomic_DNA.[EMBL / GenBank / DDBJ] [CoDingSequence]
AF105684; AAC95473.1; JOINED; Genomic_DNA.[EMBL / GenBank / DDBJ] [CoDingSequence]
AF105685; AAC95473.1; JOINED; Genomic_DNA.[EMBL / GenBank / DDBJ] [CoDingSequence]
AF105686; AAC95473.1; JOINED; Genomic_DNA.[EMBL / GenBank / DDBJ] [CoDingSequence]
AF080484; AAD50912.2; -; Genomic_DNA.[EMBL / GenBank / DDBJ] [CoDingSequence]
AF169654; AAD50912.2; JOINED; Genomic_DNA.[EMBL / GenBank / DDBJ] [CoDingSequence]
AF169655; AAD50912.2; JOINED; Genomic_DNA.[EMBL / GenBank / DDBJ] [CoDingSequence]
AF169656; AAD50912.2; JOINED; Genomic_DNA.[EMBL / GenBank / DDBJ] [CoDingSequence]
AF169657; AAD50912.2; JOINED; Genomic_DNA.[EMBL / GenBank / DDBJ] [CoDingSequence]
AF169658; AAD50912.2; JOINED; Genomic_DNA.[EMBL / GenBank / DDBJ] [CoDingSequence]
AF169659; AAD50912.2; JOINED; Genomic_DNA.[EMBL / GenBank / DDBJ] [CoDingSequence]
AF169661; AAD50912.2; JOINED; Genomic_DNA.[EMBL / GenBank / DDBJ] [CoDingSequence]
AF169662; AAD50912.2; JOINED; Genomic_DNA.[EMBL / GenBank / DDBJ] [CoDingSequence]
AF169663; AAD50912.2; JOINED; Genomic_DNA.[EMBL / GenBank / DDBJ] [CoDingSequence]
AF169664; AAD50912.2; JOINED; Genomic_DNA.[EMBL / GenBank / DDBJ] [CoDingSequence]
AF080472; AAD50912.2; JOINED; Genomic_DNA.[EMBL / GenBank / DDBJ] [CoDingSequence]
AF080473; AAD50912.2; JOINED; Genomic_DNA.[EMBL / GenBank / DDBJ] [CoDingSequence]
AF080474; AAD50912.2; JOINED; Genomic_DNA.[EMBL / GenBank / DDBJ] [CoDingSequence]
AF080475; AAD50912.2; JOINED; Genomic_DNA.[EMBL / GenBank / DDBJ] [CoDingSequence]
AF080476; AAD50912.2; JOINED; Genomic_DNA.[EMBL / GenBank / DDBJ] [CoDingSequence]
AF080477; AAD50912.2; JOINED; Genomic_DNA.[EMBL / GenBank / DDBJ] [CoDingSequence]
AF080478; AAD50912.2; JOINED; Genomic_DNA.[EMBL / GenBank / DDBJ] [CoDingSequence]
AF080479; AAD50912.2; JOINED; Genomic_DNA.[EMBL / GenBank / DDBJ] [CoDingSequence]
AF080480; AAD50912.2; JOINED; Genomic_DNA.[EMBL / GenBank / DDBJ] [CoDingSequence]
AF080481; AAD50912.2; JOINED; Genomic_DNA.[EMBL / GenBank / DDBJ] [CoDingSequence]
AF080482; AAD50912.2; JOINED; Genomic_DNA.[EMBL / GenBank / DDBJ] [CoDingSequence]
AF080483; AAD50912.2; JOINED; Genomic_DNA.[EMBL / GenBank / DDBJ] [CoDingSequence]
S40807; AAB22685.1; -; mRNA.[EMBL / GenBank / DDBJ] [CoDingSequence]
IPI IPI00306129; -.
IPI00549199; -.
PIR A59110; UIHU.
RefSeq NP_003226.4; -.
UniGene Hs.654591
3D structure databases
HSSP P21836; 1N5M. [HSSP ENTRY / PDB]
ModBase P01266.
Protein family/group databases
MEROPS I31.950; -.
S09.978; -.
PTM databases
PhosphoSite P01266; -.
Organism-specific databases
GeneCards GC08P133948; -.
H-InvDB HIX0034371; -.
HGNC HGNC:11764; TG.
GenAtlas TG.
HPA CAB000077; -.
HPA002740; -.
MIM 188450; gene+phenotype. [NCBI / EBI]
608175; phenotype. [NCBI / EBI]
PharmGKB PA28623; -.
Gene expression databases
ArrayExpress P01266; -.
Bgee P01266; -.
Ontologies
GO
GO:0005576; Cellular component: extracellular region (non-traceable author statement from UniProtKB).
GO:0005179; Molecular function: hormone activity (inferred from electronic annotation from UniProtKB-KW).
GO:0042446; Biological process: hormone biosynthetic process (inferred from electronic annotation from UniProtKB-KW).
GO:0007165; Biological process: signal transduction (non-traceable author statement from ProtInc).
GO:0006590; Biological process: thyroid hormone generation (inferred from electronic annotation from UniProtKB-KW).
QuickGo view.
Family and domain databases
InterPro IPR002018; CarbesteraseB.
IPR019819; Carboxylesterase_B_CS.
IPR011641; GCC2_GCC3.
IPR016324; Thyroglobulin.
IPR000716; Thyroglobulin_1.
Graphical view of domain structure.
Gene3D G3DSA:4.10.800.10; Thyroglobulin_1; 7.
PANTHER PTHR11559; CarbesteraseB; 1.
Pfam PF00135; COesterase; 1.
PF07699; GCC2_GCC3; 1.
PF00086; Thyroglobulin_1; 9.
Pfam graphical view of domain structure.
PIRSF PIRSF001831; Thyroglobulin; 1.
SMART SM00211; TY; 10.
SMART graphical view of domain structure.
PROSITE PS00941; CARBOXYLESTERASE_B_2; 1.
PS00484; THYROGLOBULIN_1_1; 9.
PS51162; THYROGLOBULIN_1_2; 11.
PROSITE graphical view of domain structure (profiles).
Proteomic databases
PRIDE P01266; -.
Genome annotation databases
Ensembl ENSG00000042832; Homo sapiens. [Contig view]
GeneID 7038; -.
KEGG hsa:7038; -.
NMPDR fig|9606.3.peg.30756; -.
Phylogenomic databases
HOVERGEN P01266; -.
OMA P01266; ARLALQF.
Other
NextBio 27497; -.
SOURCE TG; Homo sapiens.
ProtoNet P01266.
UniRef View cluster of proteins with at least 50% / 90% / 100% identity.
Keywords
Alternative splicing; Direct protein sequencing; Disease mutation; Disulfide bond; Glycoprotein; Hormone; Iodination; Polymorphism; Repeat; Secreted; Signal; Sulfation; Thyroid hormone; Thyroid hormones biosynthesis.
Features
SEVIEWER logo Feature table viewer FT aligner logo Feature aligner
KeyFrom    To Length Description FTId
SIGNAL   1     19  19      
CHAIN   20   2768  2749     Thyroglobulin. PRO_0000008636
DOMAIN   31     92  62     Thyroglobulin type-1 1. 
DOMAIN   93    160  68     Thyroglobulin type-1 2. 
DOMAIN   161    297  137     Thyroglobulin type-1 3. 
DOMAIN   298    358  61     Thyroglobulin type-1 4. 
DOMAIN   605    658  54     Thyroglobulin type-1 5. 
DOMAIN   659    726  68     Thyroglobulin type-1 6. 
DOMAIN   727    921  195     Thyroglobulin type-1 7. 
DOMAIN   922   1073  152     Thyroglobulin type-1 8. 
DOMAIN   1074   1145  72     Thyroglobulin type-1 9. 
DOMAIN   1146   1210  65     Thyroglobulin type-1 10. 
REPEAT   1456   1469  14     Type II. 
REPEAT   1470   1486  17     Type II. 
REPEAT   1487   1503  17     Type II. 
DOMAIN   1511   1565  55     Thyroglobulin type-1 11. 
REPEAT   1603   1723  121     Type IIIA. 
REPEAT   1724   1892  169     Type IIIB. 
REPEAT   1893   1995  103     Type IIIA. 
REPEAT   1996   2129  134     Type IIIB. 
REPEAT   2130   2187  58     Type IIIA. 
MOD_RES   24     24        Sulfotyrosine. 
MOD_RES   24     24        Thyroxine. 
MOD_RES   1310   1310        Thyroxine. 
MOD_RES   2573   2573        Thyroxine. 
MOD_RES   2587   2587        Thyroxine. 
MOD_RES   2766   2766        Triiodothyronine. 
CARBOHYD   76     76        N-linked (GlcNAc...) (Potential). 
CARBOHYD   110    110        N-linked (GlcNAc...) (Potential). 
CARBOHYD   198    198        N-linked (GlcNAc...) (Potential). 
CARBOHYD   484    484        N-linked (GlcNAc...) (Potential). 
CARBOHYD   496    496        N-linked (GlcNAc...) (Potential). 
CARBOHYD   529    529        N-linked (GlcNAc...) (Potential). 
CARBOHYD   748    748        N-linked (GlcNAc...) (Potential). 
CARBOHYD   816    816        N-linked (GlcNAc...) (Potential). 
CARBOHYD   947    947        N-linked (GlcNAc...) (Potential). 
CARBOHYD   1220   1220        N-linked (GlcNAc...) (Potential). 
CARBOHYD   1348   1348        N-linked (GlcNAc...) (Potential). 
CARBOHYD   1349   1349        N-linked (GlcNAc...) (Potential). 
CARBOHYD   1365   1365        N-linked (GlcNAc...) (Potential). 
CARBOHYD   1716   1716        N-linked (GlcNAc...) (Potential). 
CARBOHYD   1774   1774        N-linked (GlcNAc...) (Potential). 
CARBOHYD   1869   1869        N-linked (GlcNAc...) (Potential). 
CARBOHYD   2013   2013        N-linked (GlcNAc...) (Potential). 
CARBOHYD   2122   2122        N-linked (GlcNAc...) (Potential). 
CARBOHYD   2250   2250        N-linked (GlcNAc...) (Potential). 
CARBOHYD   2295   2295        N-linked (GlcNAc...) (Potential). 
CARBOHYD   2582   2582        N-linked (GlcNAc...) (Potential). 
DISULFID   34     52        By similarity. 
DISULFID   63     70        By similarity. 
DISULFID   72     92        By similarity. 
DISULFID   96    120        By similarity. 
DISULFID   131    138        By similarity. 
DISULFID   140    160        By similarity. 
DISULFID   164    183        By similarity. 
DISULFID   194    235        By similarity. 
DISULFID   301    319        By similarity. 
DISULFID   330    336        By similarity. 
DISULFID   338    358        By similarity. 
DISULFID   608    620        By similarity. 
DISULFID   631    636        By similarity. 
DISULFID   638    658        By similarity. 
DISULFID   662    687        By similarity. 
DISULFID   698    703        By similarity. 
DISULFID   705    726        By similarity. 
DISULFID   730    763        By similarity. 
DISULFID   774    898        By similarity. 
DISULFID   900    921        By similarity. 
DISULFID   1042   1049        By similarity. 
DISULFID   1051   1073        By similarity. 
DISULFID   1077   1108        By similarity. 
DISULFID   1126   1145        By similarity. 
DISULFID   1149   1169        By similarity. 
DISULFID   1181   1188        By similarity. 
DISULFID   1190   1210        By similarity. 
DISULFID   1514   1523        By similarity. 
DISULFID   1543   1565        By similarity. 
DISULFID   2264   2281        Potential. 
VAR_SEQ   1510   1567        CVTDCQRNEAGLQCDQNGQYRASQKDRGSGKAFCVDGEGR RLPWWETEAPLEDSQCLM -> L (in isoform 2). VSP_012655
VARIANT   135    135  1     Q -> H (in dbSNP:rs2069546 [NCBI]). VAR_010212 
VARIANT   515    515  1     Q -> E (in dbSNP:rs180222 [NCBI]). VAR_016190 
VARIANT   604    604  1     S -> D (requires 2 nucleotide substitutions). VAR_016852 
VARIANT   653    653  1     G -> D (in dbSNP:rs2069548 [NCBI]). VAR_016853 
VARIANT   734    734  1     S -> A (polymorphism associated with AITD3; dbSNP:rs180223 [NCBI]). VAR_010213 
VARIANT   777    777  1     P -> L (in dbSNP:rs3739274 [NCBI]). VAR_049077 
VARIANT   815    815  1     G -> R (in dbSNP:rs16904774 [NCBI]). VAR_049078 
VARIANT   830    830  1     Q -> E (in dbSNP:rs2076737 [NCBI]). VAR_010214 
VARIANT   870    870  1     Q -> H (in goiter; simple; dbSNP:rs2229843 [NCBI]). VAR_002365 
VARIANT   985    985  1     Missing. VAR_016854
VARIANT   988    988  1     R -> P (in dbSNP:rs16893332 [NCBI]). VAR_049079 
VARIANT   1028   1028  1     M -> V (polymorphism associated with AITD3; dbSNP:rs853326 [NCBI]). VAR_010215 
VARIANT   1043   1043  1     H -> Y. VAR_016855 
VARIANT   1059   1059  1     I -> T. VAR_016856 
VARIANT   1063   1063  1     L -> M (in dbSNP:rs11992497 [NCBI]). VAR_049080 
VARIANT   1222   1222  1     S -> L (in dbSNP:rs12549018 [NCBI]). VAR_049081 
VARIANT   1264   1264  1     C -> R (in goiter; autosomal recessive; dbSNP:rs2076738 [NCBI]). VAR_010216 
VARIANT   1312   1312  1     D -> G (in dbSNP:rs2069556 [NCBI]). VAR_010217 
VARIANT   1437   1437  1     W -> R (in dbSNP:rs2069558 [NCBI]). VAR_016857 
VARIANT   1463   1463  1     P -> H. VAR_016858 
VARIANT   1740   1740  1     T -> K (in dbSNP:rs16904791 [NCBI]). VAR_049082 
VARIANT   1838   1838  1     D -> N (in dbSNP:rs2069561 [NCBI]). VAR_010218 
VARIANT   1936   1936  1     A -> T (in dbSNP:rs2069562 [NCBI]). VAR_016859 
VARIANT   1979   1979  1     R -> W (polymorphism associated with AITD3). VAR_032013 
VARIANT   1996   1996  1     C -> S (in goiter; autosomal recessive). VAR_010219 
VARIANT   1999   1999  1     R -> W (in dbSNP:rs2076740 [NCBI]). VAR_010220 
VARIANT   2091   2091  1     D -> E. VAR_016860 
VARIANT   2149   2149  1     P -> L. VAR_016861 
VARIANT   2170   2170  1     Q -> R (in dbSNP:rs2069565 [NCBI]). VAR_016862 
VARIANT   2242   2242  1     R -> H (in dbSNP:rs2069566 [NCBI]). VAR_016863 
VARIANT   2455   2455  1     R -> H (in dbSNP:rs2272707 [NCBI]). VAR_049083 
VARIANT   2469   2469  1     L -> P (in dbSNP:rs2069568 [NCBI]). VAR_049084 
VARIANT   2501   2501  1     W -> R (in dbSNP:rs2069569 [NCBI]). VAR_010221 
VARIANT   2526   2526  1     F -> L (in dbSNP:rs12114109 [NCBI]). VAR_049085 
VARIANT   2530   2530  1     R -> Q (in dbSNP:rs1133076 [NCBI]). VAR_010222 
VARIANT   2616   2616  1     N -> S (in dbSNP:rs10091530 [NCBI]). VAR_049086 
CONFLICT   23     25        EYQ -> GKF (in Ref. 6). 
CONFLICT   848    848        Missing (in Ref. 13; AA sequence). 
CONFLICT   984    985        EQ -> DR (in Ref. 5). 
CONFLICT   1359   1360        Missing (in Ref. 13; AA sequence). 
CONFLICT   1717   1717        L -> A (in Ref. 13; AA sequence). 
CONFLICT   1776   1776        T -> S (in Ref. 13; AA sequence). 
CONFLICT   2019   2019        G -> H (in Ref. 13; AA sequence). 
CONFLICT   2287   2287        F -> P (in Ref. 13; AA sequence). 
Sequence information
Length: 2768 AA [This is the length of the unprocessed precursor] Molecular weight: 304790 Da [This is the MW of the unprocessed precursor] CRC64: 69A87D935F1BAA72 [This is a checksum on the sequence]
        10         20         30         40         50         60 
MALVLEIFTL LASICWVSAN IFEYQVDAQP LRPCELQRET AFLKQADYVP QCAEDGSFQT 

        70         80         90        100        110        120 
VQCQNDGRSC WCVGANGSEV LGSRQPGRPV ACLSFCQLQK QQILLSGYIN STDTSYLPQC 

       130        140        150        160        170        180 
QDSGDYAPVQ CDVQQVQCWC VDAEGMEVYG TRQLGRPKRC PRSCEIRNRR LLHGVGDKSP 

       190        200        210        220        230        240 
PQCSAEGEFM PVQCKFVNTT DMMIFDLVHS YNRFPDAFVT FSSFQRRFPE VSGYCHCADS 

       250        260        270        280        290        300 
QGRELAETGL ELLLDEIYDT IFAGLDLPST FTETTLYRIL QRRFLAVQSV ISGRFRCPTK 

       310        320        330        340        350        360 
CEVERFTATS FGHPYVPSCR RNGDYQAVQC QTEGPCWCVD AQGKEMHGTR QQGEPPSCAE 

       370        380        390        400        410        420 
GQSCASERQQ ALSRLYFGTS GYFSQHDLFS SPEKRWASPR VARFATSCPP TIKELFVDSG 

       430        440        450        460        470        480 
LLRPMVEGQS QQFSVSENLL KEAIRAIFPS RGLARLALQF TTNPKRLQQN LFGGKFLVNV 

       490        500        510        520        530        540 
GQFNLSGALG TRGTFNFSQF FQQLGLASFL NGGRQEDLAK PLSVGLDSNS STGTPEAAKK 

       550        560        570        580        590        600 
DGTMNKPTVG SFGFEINLQE NQNALKFLAS LLELPEFLLF LQHAISVPED VARDLGDVME 

       610        620        630        640        650        660 
TVLSSQTCEQ TPERLFVPSC TTEGSYEDVQ CFSGECWCVN SWGKELPGSR VRGGQPRCPT 

       670        680        690        700        710        720 
DCEKQRARMQ SLMGSQPAGS TLFVPACTSE GHFLPVQCFN SECYCVDAEG QAIPGTRSAI 

       730        740        750        760        770        780 
GKPKKCPTPC QLQSEQAFLR TVQALLSNSS MLPTLSDTYI PQCSTDGQWR QVQCNGPPEQ 

       790        800        810        820        830        840 
VFELYQRWEA QNKGQDLTPA KLLVKIMSYR EAASGNFSLF IQSLYEAGQQ DVFPVLSQYP 

       850        860        870        880        890        900 
SLQDVPLAAL EGKRPQPREN ILLEPYLFWQ ILNGQLSQYP GSYSDFSTPL AHFDLRNCWC 

       910        920        930        940        950        960 
VDEAGQELEG MRSEPSKLPT CPGSCEEAKL RVLQFIRETE EIVSASNSSR FPLGESFLVA 

       970        980        990       1000       1010       1020 
KGIRLRNEDL GLPPLFPPRE AFAEQFLRGS DYAIRLAAQS TLSFYQRRRF SPDDSAGASA 

      1030       1040       1050       1060       1070       1080 
LLRSGPYMPQ CDAFGSWEPV QCHAGTGHCW CVDEKGGFIP GSLTARSLQI PQCPTTCEKS 

      1090       1100       1110       1120       1130       1140 
RTSGLLSSWK QARSQENPSP KDLFVPACLE TGEYARLQAS GAGTWCVDPA SGEELRPGSS 

      1150       1160       1170       1180       1190       1200 
SSAQCPSLCN VLKSGVLSRR VSPGYVPACR AEDGGFSPVQ CDQAQGSCWC VMDSGEEVPG 

      1210       1220       1230       1240       1250       1260 
TRVTGGQPAC ESPRCPLPFN ASEVVGGTIL CETISGPTGS AMQQCQLLCR QGSWSVFPPG 

      1270       1280       1290       1300       1310       1320 
PLICSLESGR WESQLPQPRA CQRPQLWQTI QTQGHFQLQL PPGKMCSADY ADLLQTFQVF 

      1330       1340       1350       1360       1370       1380 
ILDELTARGF CQIQVKTFGT LVSIPVCNNS SVQVGCLTRE RLGVNVTWKS RLEDIPVASL 

      1390       1400       1410       1420       1430       1440 
PDLHDIERAL VGKDLLGRFT DLIQSGSFQL HLDSKTFPAE TIRFLQGDHF GTSPRTWFGC 

      1450       1460       1470       1480       1490       1500 
SEGFYQVLTS EASQDGLGCV KCPEGSYSQD EECIPCPVGF YQEQAGSLAC VPCPVGRTTI 

      1510       1520       1530       1540       1550       1560 
SAGAFSQTHC VTDCQRNEAG LQCDQNGQYR ASQKDRGSGK AFCVDGEGRR LPWWETEAPL 

      1570       1580       1590       1600       1610       1620 
EDSQCLMMQK FEKVPESKVI FDANAPVAVR SKVPDSEFPV MQCLTDCTED EACSFFTVST 

      1630       1640       1650       1660       1670       1680 
TEPEISCDFY AWTSDNVACM TSDQKRDALG NSKATSFGSL RCQVKVRSHG QDSPAVYLKK 

      1690       1700       1710       1720       1730       1740 
GQGSTTTLQK RFEPTGFQNM LSGLYNPIVF SASGANLTDA HLFCLLACDR DLCCDGFVLT 

      1750       1760       1770       1780       1790       1800 
QVQGGAIICG LLSSPSVLLC NVKDWMDPSE AWANATCPGV TYDQESHQVI LRLGDQEFIK 

      1810       1820       1830       1840       1850       1860 
SLTPLEGTQD TFTNFQQVYL WKDSDMGSRP ESMGCRKDTV PRPASPTEAG LTTELFSPVD 

      1870       1880       1890       1900       1910       1920 
LNQVIVNGNQ SLSSQKHWLF KHLFSAQQAN LWCLSRCVQE HSFCQLAEIT ESASLYFTCT 

      1930       1940       1950       1960       1970       1980 
LYPEAQVCDD IMESNAQGCR LILPQMPKAL FRKKVILEDK VKNFYTRLPF QKLMGISIRN 

      1990       2000       2010       2020       2030       2040 
KVPMSEKSIS NGFFECERRC DADPCCTGFG FLNVSQLKGG EVTCLTLNSL GIQMCSEENG 

      2050       2060       2070       2080       2090       2100 
GAWRILDCGS PDIEVHTYPF GWYQKPIAQN NAPSFCPLVV LPSLTEKVSL DSWQSLALSS 

      2110       2120       2130       2140       2150       2160 
VVVDPSIRHF DVAHVSTAAT SNFSAVRDLC LSECSQHEAC LITTLQTQPG AVRCMFYADT 

      2170       2180       2190       2200       2210       2220 
QSCTHSLQGQ NCRLLLREEA THIYRKPGIS LLSYEASVPS VPISTHGRLL GRSQAIQVGT 

      2230       2240       2250       2260       2270       2280 
SWKQVDQFLG VPYAAPPLAE RRFQAPEPLN WTGSWDASKP RASCWQPGTR TSTSPGVSED 

      2290       2300       2310       2320       2330       2340 
CLYLNVFIPQ NVAPNASVLV FFHNTMDREE SEGWPAIDGS FLAAVGNLIV VTASYRVGVF 

      2350       2360       2370       2380       2390       2400 
GFLSSGSGEV SGNWGLLDQV AALTWVQTHI RGFGGDPRRV SLAADRGGAD VASIHLLTAR 

      2410       2420       2430       2440       2450       2460 
ATNSQLFRRA VLMGGSALSP AAVISHERAQ QQAIALAKEV SCPMSSSQEV VSCLRQKPAN 

      2470       2480       2490       2500       2510       2520 
VLNDAQTKLL AVSGPFHYWG PVIDGHFLRE PPARALKRSL WVEVDLLIGS SQDDGLINRA 

      2530       2540       2550       2560       2570       2580 
KAVKQFEESR GRTSSKTAFY QALQNSLGGE DSDARVEAAA TWYYSLEHST DDYASFSRAL 

      2590       2600       2610       2620       2630       2640 
ENATRDYFII CPIIDMASAW AKRARGNVFM YHAPENYGHG SLELLADVQF ALGLPFYPAY 

      2650       2660       2670       2680       2690       2700 
EGQFSLEEKS LSLKIMQYFS HFIRSGNPNY PYEFSRKVPT FATPWPDFVP RAGGENYKEF 

      2710       2720       2730       2740       2750       2760 
SELLPNRQGL KKADCSFWSK YISSLKTSAD GAKGGQSAES EEEELTAGSG LREDLLSLQE 


PGSKTYSK 

P01266 in FASTA format

View entry in raw text format (no links)
Report form for errors/updates in this UniProtKB/Swiss-Prot entry

BLAST logo BLAST submission on ExPASy/SIB
or at NCBI (USA)
Tools Sequence analysis tools: ProtParam, ProtScale, Compute pI/Mw, PeptideMass, PeptideCutter, Dotlet (Java)
PROSITE logo ScanProsite, MotifScan SWISS-MODEL Submit a homology modeling request to SWISS-MODEL
NPSA logo NPSA Sequence analysis tools

ExPASy logo ExPASy Home page Site Map Search ExPASy Contact us Swiss-Prot
 Hosted by ch flag SIB Switzerland Mirror sites: Australia  Brazil  Canada  China  Korea
Notice: This page will be replaced with www.uniprot.org. Please send us your feedback!