MOD_SUMO_for_1

Accession:	ELME000002
Functional site class:	Sumoylation site
Functional site description:	Sumoylation is a common PTM of nuclear proteins that affects their functional status. SUMO belongs to the large multiprotein family of Ubiquitin-like proteins. The sumoylation modification is achieved by a typical E1-, E2- and E3-ligase based system. Many transcription factors, chromatin proteins and proteins involved in other nuclear functions as well as the nuclear pores are sumoylated. Sumoylation is known to cause dramatic rearrangements of the subnuclear location of modified proteins.
ELMs with same tags:	Sumo: LIG_KEPE_1 LIG_KEPE_2 LIG_KEPE_3 LIG_SUMO_SIM_anti_2 LIG_SUMO_SIM_par_1
ELMs with same func. site:	LIG_KEPE_1 LIG_KEPE_2 LIG_KEPE_3 MOD_SUMO_for_1 MOD_SUMO_rev_2
ELM Description:	Motif recognised for modification by SUMO-1
Pattern:	`[VILMAFP](K).E`
Pattern Probability:	`0.0019140`
Present in taxon:	Eukaryota
Interaction Domain:	UQ_con (PF00179) Ubiquitin-conjugating enzyme (Stochiometry: 1 : 1) PDB Structure: 1KPS

o See 46 Instances for MOD_SUMO_for_1

o Abstract

The SUMO proteins are Small Ubiquitin-related MOdifiers that are covalently conjugated onto lysine residues within target proteins (Tang,2008, Anckar,2007, Geiss-Friedlander,2007). Invertebrates have a single SUMO gene, whereas the SUMO family in vertebrates has three members; SUMO-1, SUMO-2, and SUMO-3. The SUMO proteins are synthesized as inactive precursors, which are processed by SUMO-specific carboxy-terminal hydrolases, resulting in novel double-glycine C-termini. The mature SUMO proteins are then activated by the Aos1/Uba2 activating enzyme (E1) and transferred to the Ubc9 conjugating enzyme (E2). Eventually, the SUMO protein is covalently linked to the target protein by the formation of an isopeptide bond between the carboxyl terminus of SUMO and an epsilon-amino group of a lysine residue of the target protein. The reaction is aided by an E3 ligase, e.g. mammalian PIAS1. This process, termed sumoylation, is reversible by certain SENP family desumoylating proteases.
Most sumoylated proteins are nuclear, and three main functional roles of SUMO have been proposed. (i) Protein targeting: sumoylation has been shown to be important for nuclear import of the RanGAP1 protein, and for recruiting proteins to subnuclear protein complexes (e.g. promyelocytic leukemia protein (PML) to PML nuclear bodies). (ii) Enhancement of protein stability by potential competition with (and inhibition of) ubiquitination. (iii) Transcriptional control (e.g. negative regulation of transcription from the androgen receptor).
A core motif (PhiKxE) has been identified as the sumoylation target for SUMO-1 (Endter,2001, Poukka,2000, Sternsdorf,1999). SUMO-2/3 themselves contain a PhiKxE site, in contrast to SUMO-1, and can thus form polymeric chains (shown in vitro, and in vivo for SUMO-2) (Tatham,2001). The Crystal structure of a complex between SUMO-1, E2 Ligase UBC9, E3 Ligase RanGAP1 and the target protein RanBP2 (3UIP, Gareau,2012) shows, that the hydrophobic residue (leucine) and glutamic acid bind to the E2 ligase UBC9 and stabilize lysine, which fits into the catalytic pocket of UBC9. The E3 ligase RanGAP1 binds SUMO-1 with a SIM (SUMO interacting motif) LIG_SUMO_SIM_par_1 motif and stabilizes the N-terminus of SUMO-1 which also fits into the catalytic pocket of UBC9 where the covalent binding of SUMO-1 to RanBP2 takes place.
A number of reports in the literature suggest modified versions of the core motif (PhiKxE):
(i) KEPE motif (LIG_KEPE_1, LIG_KEPE_2 and LIG_KEPE_3): A bioinformatics survey of nuclear proteins revealed a common extended SUMO site, termed the KEPE motif (Diella,2008) in transcriptional and chromatin proteins. The function of the KEPE motif remains to be determined.
(ii) PDSM motif: A Phosphorylation-Dependent Sumo Motif (PhiKxExxSP) has been found (Hietakangas,2006) which consists of the core motif with a phosphorylation site. It was found to regulate sumoylation of the transcription regulators GATA-1, MEF2a and HFSs (Hietakangas,2006).
(iii) NDSM motif: Another extended version of the core motif is the NDSM (Negatively charged amino acid-Dependent Sumoylation Motif), which has clusters of acidic residues downstream from the core motif (Yang,2006). These acidic clusters help to increase the efficiency of sumoylation, for example for the transcription factor ELK-1.
Beside these modified versions of the core motif, multiple high throughput mass spectrometry studies (Matic,2010, Tammsalu,2014, Impens,2014, Hendriks,2014) have shown that an inverted version (E/DxK) of the motif is also used, but less commonly. Yung-Kang Lee et al. (Lee,2007) confirmed the inverted SUMOylation site in TRIM28 by a combination of proteomic screening and site- directed mutagenesis. These finding have also been confirmed by another study (Ivanov,2007). Currently there is no crystal structure of the inverted motif available. There are also some sumoylation sites that do not fit to either the canonical or the reverse motif. ELM annotators have noted that these seem to always be at sites in folded globular domains, raising the possibility that there is a structural motif at these sites, in contrast to the more typical short linear motifs.

o 9 selected references:

The nuclear dot protein sp100, characterization of domains necessary for dimerization, subcellular localization, and modification by small ubiquitin-like modifiers.
Sternsdorf T, Jensen K, Reich B, Will H
J Biol Chem 1999 Apr 30; 274 (18), 12555-66
PMID: 10212234
SUMO--nonclassical ubiquitin.
Melchior F
Annu Rev Cell Dev Biol 2000; 16, 591-626
PMID: 11031248
Covalent modification of the androgen receptor by small ubiquitin-like modifier 1 (SUMO-1).
Poukka H, Karvonen U, Janne OA, Palvimo JJ
Proc Natl Acad Sci U S A 2000 Dec 19; 97 (26), 14145-50
PMID: 11121022
SUMO, ubiquitin's mysterious cousin.
Muller S, Hoege C, Pyrowolakis G, Jentsch S
Nat Rev Mol Cell Biol 2001 Mar; 2 (3), 202-10
PMID: 11265250
SUMO-1 modification required for transformation by adenovirus type 5 early region 1B 55-kDa oncoprotein.
Endter C, Kzhyshkowska J, Stauber R, Dobner T
Proc Natl Acad Sci U S A 2001 Sep 25; 98 (20), 11312-7
PMID: 11553772
An E3-like factor that promotes SUMO conjugation to the yeast septins.
Johnson ES, Gupta AA
Cell 2001 Sep 21; 106 (6), 735-44
PMID: 11572779
Concepts in sumoylation: a decade on.
Geiss-Friedlander R, Melchior F
Nat Rev Mol Cell Biol 2007 Dec; 8 (12), 947-56
PMID: 18000527
SUMO: getting it on.
Anckar J, Sistonen L
Biochem Soc Trans 2007 Dec; 35, 1409-13
PMID: 18031233
Protein interactions in the sumoylation cascade: lessons from X-ray structures.
Tang Z, Hecker CM, Scheschonka A, Betz H
FEBS J 2008 Jun; 275 (12), 3003-15
PMID: 18492068

o 8 GO-Terms:

Biological Process:
Sumo-Dependent Protein Catabolism (also annotated in class: )
Protein Sumoylation (also annotated in these classes: MOD_SUMO_rev_2 )
Regulation Of Gene Expression (also annotated in these classes: MOD_SUMO_rev_2 )
Sumoylation (also annotated in class: )
Protein-Nucleus Import (also annotated in these classes: MOD_PRMT_GGRGG_1 MOD_SUMO_rev_2 TRG_NLS_Bipartite_1 TRG_NLS_MonoCore_2 TRG_NLS_MonoExtC_3 TRG_NLS_MonoExtN_4 )
Negative Regulation Of Transcription (also annotated in these classes: LIG_CORNRBOX LIG_CtBP_PxDLS_1 LIG_EH1_1 LIG_WRPW_1 LIG_WRPW_2 )
Cellular Compartment:
Nucleus (also annotated in these classes: CLV_C14_Caspase3-7 CLV_Separin_Fungi CLV_Separin_Metazoa CLV_TASPASE1 DEG_APCC_DBOX_1 DEG_APCC_KENBOX_2 DEG_APCC_TPR_1 DEG_Cend_DCAF12_1 DEG_Cend_FEM1AC_1 DEG_Cend_FEM1B_2 DEG_Cend_KLHDC2_1 DEG_Cend_TRIM7_1 DEG_COP1 DEG_COP1_1 DEG_CRL4_CDT2_1 DEG_CRL4_CDT2_2 DEG_Kelch_Keap1_1 DEG_Kelch_Keap1_2 DEG_MDM2_SWIB_1 DEG_ODPH_VHL_1 DEG_SCF_COI1_1 DEG_SCF_FBW7_1 DEG_SCF_FBW7_2 DEG_SCF_FBXO31_1 DEG_SCF_SKP2-CKS1_1 DEG_SCF_TIR1_1 DEG_SCF_TRCP1_1 DEG_SIAH_1 DEG_SPOP_SBC_1 DOC_ANK_TNKS_1 DOC_CDC14_PxL_1 DOC_CKS1_1 DOC_CYCLIN_D_Helix_1 DOC_CYCLIN_RevRxL_6 DOC_CYCLIN_RxL_1 DOC_CYCLIN_yClb1_LxF_4 DOC_CYCLIN_yClb3_PxF_3 DOC_CYCLIN_yClb5_NLxxxL_5 DOC_CYCLIN_yCln2_LP_2 DOC_MAPK_DCC_7 DOC_MAPK_FxFP_2 DOC_MAPK_gen_1 DOC_MAPK_GRA24_9 DOC_MAPK_HePTP_8 DOC_MAPK_JIP1_4 DOC_MAPK_MEF2A_6 DOC_MAPK_NFAT4_5 DOC_MAPK_RevD_3 DOC_PIKK_1 DOC_PP1_MyPhoNE_1 DOC_PP1_RVXF_1 DOC_PP1_SILK_1 DOC_PP2A_B56_1 DOC_PP2A_KARD_1 DOC_PP2B_LxvP_1 DOC_PP2B_PxIxIT_1 DOC_PP4_FxxP_1 DOC_PP4_MxPP_1 DOC_USP7_MATH_1 DOC_USP7_MATH_2 DOC_USP7_UBL2_3 DOC_WW_Pin1_4 LIG_14-3-3_CanoR_1 LIG_14-3-3_ChREBP_3 LIG_14-3-3_CterR_2 LIG_ANK_PxLPxL_1 LIG_APCC_ABBA_1 LIG_APCC_Cbox_1 LIG_APCC_Cbox_2 LIG_ARL_BART_1 LIG_ARS2_EDGEI_1 LIG_BRCT_BRCA1_1 LIG_BRCT_BRCA1_2 LIG_BRCT_MDC1_1 LIG_CaM_1-14-15-16_REV_1 LIG_CaMK_CASK_1 LIG_CORNRBOX LIG_CSL_BTD_1 LIG_CtBP_PxDLS_1 LIG_CtBP_RRT_2 LIG_DCNL_PONY_1 LIG_EF_ALG2_ABM_1 LIG_EF_ALG2_ABM_2 LIG_EH1_1 LIG_FHA_1 LIG_FHA_2 LIG_GLEBS_BUB3_1 LIG_HCF-1_HBM_1 LIG_HOMEOBOX LIG_HP1_1 LIG_IRF7_LxLS_2 LIG_IRFs_LxIS_1 LIG_KEPE_1 LIG_KEPE_2 LIG_KEPE_3 LIG_LEDGF_IBM_1 LIG_LSD1_SNAG_1 LIG_MAD2 LIG_Menin_MBM1_1 LIG_MLH1_MIPbox_1 LIG_MSH2_SHIPbox_1 LIG_MTR4_AIM_1 LIG_Mtr4_Air2_1 LIG_Mtr4_Trf4_1 LIG_Mtr4_Trf4_2 LIG_MYND_1 LIG_MYND_2 LIG_MYND_3 LIG_NBox_RRM_1 LIG_NRBOX LIG_Nrd1CID_NIM_1 LIG_PALB2_WD40_1 LIG_PCNA_APIM_2 LIG_PCNA_PIPBox_1 LIG_PCNA_TLS_4 LIG_PCNA_yPIPBox_3 LIG_PTAP_UEV_1 LIG_RBL1_LxSxE_2 LIG_RB_LxCxE_1 LIG_RB_pABgroove_1 LIG_REV1ctd_RIR_1 LIG_RPA_C_Plants LIG_RPA_C_Vert LIG_RRM_PRI_1 LIG_Rrp6Rrp47_Mtr4_1 LIG_Sin3_1 LIG_Sin3_2 LIG_Sin3_3 LIG_SUFU_1 LIG_SUMO_SIM_anti_2 LIG_SUMO_SIM_par_1 LIG_TPR LIG_Trf4_IWRxY_1 LIG_TRFH_1 LIG_UBA3_1 LIG_ULM_U2AF65_1 LIG_VCP_SHPBox_1 LIG_VCP_VBM_3 LIG_VCP_VIM_2 LIG_WD40_WDR5_VDV_1 LIG_WD40_WDR5_VDV_2 LIG_WD40_WDR5_WIN_1 LIG_WD40_WDR5_WIN_2 LIG_WD40_WDR5_WIN_3 LIG_WRPW_1 LIG_WRPW_2 LIG_WW_2 MOD_AAK1BIKe_LxxQxTG_1 MOD_CDC14_SPxK_1 MOD_CDK_SPK_2 MOD_CDK_SPxK_1 MOD_CDK_SPxxK_3 MOD_CK1_1 MOD_CK2_1 MOD_DYRK1A_RPxSP_1 MOD_GSK3_1 MOD_NEK2_1 MOD_NEK2_2 MOD_PIKK_1 MOD_PKA_1 MOD_PKA_2 MOD_PKB_1 MOD_PLK MOD_Plk_1 MOD_Plk_2-3 MOD_Plk_4 MOD_PRMT_GGRGG_1 MOD_ProDKin_1 MOD_SUMO_rev_2 ELM:old_LIG_14-3-3_1 ELM:old_LIG_14-3-3_2 ELM:old_LIG_14-3-3_3 TRG_NES_CRM1_1 TRG_NESrev_CRM1_2 TRG_NLS_Bipartite_1 TRG_NLS_MonoCore_2 TRG_NLS_MonoExtC_3 TRG_NLS_MonoExtN_4 )
Pml Body (also annotated in these classes: LIG_RPA_C_Plants LIG_RPA_C_Vert LIG_SUMO_SIM_anti_2 LIG_SUMO_SIM_par_1 MOD_SUMO_rev_2 )

o 46 Instances for MOD_SUMO_for_1
(click table headers for sorting; Notes column: =Number of Switches, =Number of Interactions)

Acc., Gene-, Name	Start	End	Subsequence	Logic	#Ev.	Organism	Notes
Q2GIB5 ampA AMPA_ANAPZ	134	137	PEKAVVRFFKIEKSAAEEPQ	TP	3	Anaplasma phagocytophilum str. HZ
Q16665 HIF1A HIF1A_HUMAN	531	534	VDSDMVNEFKLELVEKLFAE	TN	4	Homo sapiens (Human)
P29590 PML PML_HUMAN	489	492	QTQCPRKVIKMESEEGKEAR	TP	9	Homo sapiens (Human)
Q16665 HIF1A HIF1A_HUMAN	476	479	PALNQEVALKLEPNPESLEL	TP	7	Homo sapiens (Human)
Q16665 HIF1A HIF1A_HUMAN	390	393	DTSSLFDKLKKEPDALTLLA	TP	7	Homo sapiens (Human)
P37238 Pparg PPARG_MOUSE	106	109	KLQEYQSAIKVEPASPPYYS	TP	5	Mus musculus (House mouse)
P37231 PPARG PPARG_HUMAN	106	109	KLQEYQSAIKVEPASPPYYS	TP	3	Homo sapiens (Human)
Q02078 MEF2A MEF2A_HUMAN	402	405	INTNQNISIKSEPISPPRDR	TP	3	Homo sapiens (Human)
P32457 CDC3 CDC3_YEAST	3	6	MSLKEEQVSIKQDPEQEERQ	TP	2	Saccharomyces cerevisiae (Baker"s yeast)
Q9NSC2 SALL1 SALL1_HUMAN	1085	1088	PANSLSSLIKTEVNGFVHVS	TP	1	Homo sapiens (Human)
Q14526 HIC1 HIC1_HUMAN	332	335	GPSLLYRWMKHEPGLGSYGD	TP	3	Homo sapiens (Human)
P46061 Rangap1 RAGP1_MOUSE	525	528	RLLIHMGLLKSEDKIKAIPS	TP	1	Mus musculus (House mouse)	1KPS 1KPS
P61956 SUMO2 SUMO2_HUMAN	10	13	ADEKPKEGVKTENNDHINLK	TP	1	Homo sapiens (Human)
P10242 MYB MYB_HUMAN	502	505	LVEDLQDVIKQESDESGIVA	TP	1	Homo sapiens (Human)
P10242 MYB MYB_HUMAN	526	529	NGPPLLKKIKQEVESPTDKS	TP	1	Homo sapiens (Human)
Q15596 NCOA2 NCOA2_HUMAN	784	787	ASNTKLIAMKTEKEEMSFEP	FP	1	Homo sapiens (Human)
P15873 POL30 PCNA_YEAST	126	129	LMDIDADFLKIEELQYDSTL	TP	1	Saccharomyces cerevisiae (Baker"s yeast)
Q07657 SHS1 SHS1_YEAST	425	428	PVRQLGREIKQENENLIRSI	TP	1	Saccharomyces cerevisiae (Baker"s yeast)
Q9UPW6 SATB2 SATB2_HUMAN	232	235	RWYKKYKKIKVERVERENLS	TP	2	Homo sapiens (Human)
P56524 HDAC4 HDAC4_HUMAN	558	561	AHAQAGVQVKQEPIESDEEE	TP	1	Homo sapiens (Human)
Q07657 SHS1 SHS1_YEAST	436	439	ENENLIRSIKTESSPKFLNS	TP	1	Saccharomyces cerevisiae (Baker"s yeast)
P55854 SUMO3 SUMO3_HUMAN	10	13	SEEKPKEGVKTENDHINLKV	TP	1	Homo sapiens (Human)
P23497 SP100 SP100_HUMAN	296	299	SCSVRLVDIKKEKPFSNSKV	TP	1	Homo sapiens (Human)
Q02447 SP3 SP3_HUMAN	550	553	ADSPADIRIKEEEPDPEEWQ	TP	1	Homo sapiens (Human)
Q13569 TDG TDG_HUMAN	329	332	QEDAKKMAVKEEKYDPGYEA	TP	1	Homo sapiens (Human)
P13202 UL123 VIE1_HCMVA	449	452	EEREDTVSVKSEPVSEIEEV	TP	1	Human herpesvirus 5 strain AD169
Q9UPW6 SATB2 SATB2_HUMAN	349	352	NHPPIPRAVKPEPTNSSVEV	TP	2	Homo sapiens (Human)
P04637 TP53 P53_HUMAN	385	388	GQSTSRHKKLMFKTEGPDSD	TP	1	Homo sapiens (Human)
Q15596 NCOA2 NCOA2_HUMAN	238	241	FAVSQPKSIKEEGEDLQSCL	TP	1	Homo sapiens (Human)
P49716 CEBPD CEBPD_HUMAN	119	122	PGPAAPRLLKREPDWGDGDA	TP	1	Homo sapiens (Human)
Q15744 CEBPE CEBPE_HUMAN	120	123	SYDPRAVAVKEEPRGPEGSR	TP	1	Homo sapiens (Human)
P29590 PML PML_HUMAN	159	162	CFEAHQWFLKHEARPLAELR	TP	1	Homo sapiens (Human)
P06401 PGR PRGR_HUMAN	387	390	DFQPPALKIKEEEEGAEASA	TP	1	Homo sapiens (Human)
P49715 CEBPA CEBPA_HUMAN	160	163	APALRPLVIKQEPREEDEAK	TP	1	Homo sapiens (Human)
Q15596 NCOA2 NCOA2_HUMAN	730	733	TAPGSEVTIKQEPVSPKKKE	TP	1	Homo sapiens (Human)
Q92754 TFAP2C AP2C_HUMAN	9	12	MLWKITDNVKYEEDCEDRHD	TP	1	Homo sapiens (Human)
P27540 ARNT ARNT_HUMAN	244	247	ILDLKTGTVKKEGQQSSMRM	TP	1	Homo sapiens (Human)
P32458 CDC11 CDC11_YEAST	411	414	KELREIEARLEKEAKIKQEE	TP	1	Saccharomyces cerevisiae (Baker"s yeast)
P17676 CEBPB CEBPB_HUMAN	173	176	PPPPPPAELKAEPGFEPADC	TP	1	Homo sapiens (Human)
P04150 NR3C1 GCR_HUMAN	276	279	PSNVTLPQVKTEKEDFIELC	TP	1	Homo sapiens (Human)
P04150 NR3C1 GCR_HUMAN	292	295	IELCTPGVIKQEKLGTVYCQ	TP	1	Homo sapiens (Human)
Q13547 HDAC1 HDAC1_HUMAN	443	446	SNFKKAKRVKTEDEKEKDPE	TP	1	Homo sapiens (Human)
Q13547 HDAC1 HDAC1_HUMAN	475	478	EKTKEEKPEAKGVKEEVKLA	TP	1	Homo sapiens (Human)
Q00613 HSF1 HSF1_HUMAN	297	300	LSSSPLVRVKEEPPSPPQSP	TP	1	Homo sapiens (Human)
P27782 Lef1 LEF1_MOUSE	24	27	CATDEMIPFKDEGDPQKEKI	TP	1	Mus musculus (House mouse)
P27782 Lef1 LEF1_MOUSE	266	269	HPAIVTPQVKQEHPHTDSDL	TP	1	Mus musculus (House mouse)

Please cite: ELM-the Eukaryotic Linear Motif resource-2024 update. (PMID:37962385)

ELM data can be downloaded & distributed for non-commercial use according to the ELM Software License Agreement

feedback@elm.eu.org