DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment SETD1A and Setd1a

DIOPT Version :10

Sequence 1:NP_055527.1 Gene:SETD1A / 9739 HGNCID:29010 Length:1707 Species:Homo sapiens
Sequence 2:NP_821172.2 Gene:Setd1a / 233904 MGIID:2446244 Length:1716 Species:Mus musculus


Alignment Length:1725 Identity:1547/1725 - (89%)
Similarity:1590/1725 - (92%) Gaps:27/1725 - (1%)


- Green bases have known domain annotations that are detailed below.


Human     1 MDQEGGGDGQKAPSFQWRNYKLIVDPALDPALRRPSQKVYRYDGVHFSVNDSKYIPVEDLQDPRC 65
            |||||||||||||||||||||||||||||||||||||||||||||||||:||||.||||||||||
Mouse     1 MDQEGGGDGQKAPSFQWRNYKLIVDPALDPALRRPSQKVYRYDGVHFSVSDSKYTPVEDLQDPRC 65

Human    66 HVRSKNRDFSLPVPKFKLDEFYIGQIPLKEVTFARLNDNVRETFLKDMCRKYGEVEEVEILLHPR 130
            |||||.|||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Mouse    66 HVRSKARDFSLPVPKFKLDEFYIGQIPLKEVTFARLNDNVRETFLKDMCRKYGEVEEVEILLHPR 130

Human   131 TRKHLGLARVLFTSTRGAKETVKNLHLTSVMGNIIHAQLDIKGQQRMKYYELIVNGSYTPQTVPT 195
            |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Mouse   131 TRKHLGLARVLFTSTRGAKETVKNLHLTSVMGNIIHAQLDIKGQQRMKYYELIVNGSYTPQTVPT 195

Human   196 GGKALSEKFQGSGAATETAESRRRSSSDTAAYPAGTTAVGTPGNGTPCSQDTSFSSSRQDTPSSF 260
            |||||||||||||||.||.|:||||||||||||||||..|||||||||||||:||||||||||||
Mouse   196 GGKALSEKFQGSGAAAETTEARRRSSSDTAAYPAGTTVGGTPGNGTPCSQDTNFSSSRQDTPSSF 260

Human   261 GQFTPQSSQGTPYTSRGSTPYSQDSAYSSSTTSTSFKPRRSENSYQDAFSRRHFSASSASTTAST 325
            |||||||||||||||||||||||||||||||||||||||||||||||:|||||||.|||..|.:|
Mouse   261 GQFTPQSSQGTPYTSRGSTPYSQDSAYSSSTTSTSFKPRRSENSYQDSFSRRHFSTSSAPATTAT 325

Human   326 AIAATTAATASSSASSSSLSSSSSSSSSSSSSQFRSSDANYPAYYESWNRYQRHTSYPPRRATRE 390
            |.:||.||||:||:||||.|||||||||||:||||.||::|||||||||||||||||||||||||
Mouse   326 ATSATAAATAASSSSSSSSSSSSSSSSSSSASQFRGSDSSYPAYYESWNRYQRHTSYPPRRATRE 390

Human   391 EPPGAPFAENTAERFPPSYTSYLPPEPSRPTDQDYRPPASEAPPPEPPEP-------------GG 442
            :|.||.|||||||||||||||||.|||:|.||||||||||||||||||||             ||
Mouse   391 DPSGASFAENTAERFPPSYTSYLAPEPNRSTDQDYRPPASEAPPPEPPEPGGGGGGSGGGGGGGG 455

Human   443 GGGGGGPSPEREEVRTSPRPASPARSGSPAPETTNESVPFAQHSSLDSRIEMLLKEQRSKFSFLA 507
            |||||.|||||||.||.||||||||||||||||||||||||||||||||||||||||||||||||
Mouse   456 GGGGGAPSPEREEARTPPRPASPARSGSPAPETTNESVPFAQHSSLDSRIEMLLKEQRSKFSFLA 520

Human   508 SDTEEEEENSSMVLGARDTGSEVPSGSGHGPCTPPPAPANFEDVAPTGSGEPGATRESPKANGQN 572
            |||||||||||...||||.|:|||||:|||||||||||||||||||||||||||.||||||||||
Mouse   521 SDTEEEEENSSAGPGARDAGAEVPSGAGHGPCTPPPAPANFEDVAPTGSGEPGAARESPKANGQN 585

Human   573 QASPCSSGDDMEISDDDRGGSPPPAPTPPQQ-PPPPPPPPPPPPPYLASLPLGYPPHQPAYLLPP 636
            ||||||||:|||||||||||||||||||||| |||||||||||||||||||||||||||||||||
Mouse   586 QASPCSSGEDMEISDDDRGGSPPPAPTPPQQPPPPPPPPPPPPPPYLASLPLGYPPHQPAYLLPP 650

Human   637 RPDGPPPPEY-PPPPPPPPHIYDFVNSLELMDRLGAQWGGMPMSFQMQTQMLTRLHQLRQGKGLI 700
            |||||||||| |||||||||||||||||||||||||||||||||||||||||||||||||||||.
Mouse   651 RPDGPPPPEYPPPPPPPPPHIYDFVNSLELMDRLGAQWGGMPMSFQMQTQMLTRLHQLRQGKGLT 715

Human   701 AASAGPPGGAFGEAFLPFPPPQEAAYGLPYALYAQGQEGRGAYSREAYHLPMPMAAEPLPSSSVS 765
            |||||||||||||||||||||||||||||||||.|||||||:|||||||||:|||||||||||||
Mouse   716 AASAGPPGGAFGEAFLPFPPPQEAAYGLPYALYTQGQEGRGSYSREAYHLPLPMAAEPLPSSSVS 780

Human   766 GEEARLPPREEAELAEGKTLPTAGTVGRVLAMLVQEMKSIMQRDLNRKMVENVAFGAFDQWWESK 830
            |||||||.|||||:||.|.||:|||||||||.|||||||||||||||||||||||||||||||||
Mouse   781 GEEARLPHREEAEIAESKVLPSAGTVGRVLATLVQEMKSIMQRDLNRKMVENVAFGAFDQWWESK 845

Human   831 EEKAKPFQNAAKQQAKEEDKEKTKLKEPGLLSLVDWAKSGGTTGIEAFAFGSGLRGALRLPSFKV 895
            ||||||||||||||||||||||.||||||:|||||||||||.|||||||||||||||||||||||
Mouse   846 EEKAKPFQNAAKQQAKEEDKEKMKLKEPGMLSLVDWAKSGGITGIEAFAFGSGLRGALRLPSFKV 910

Human   896 KRKEPSEISEASEEKRPRPSTPAEEDEDDPEQEKEAGEPGRPGTKPPKRDEERGKTQGKHRKSFA 960
            |||||||||||||||||||||||||||||||:||||||||||||||||||||||||||||||||.
Mouse   911 KRKEPSEISEASEEKRPRPSTPAEEDEDDPEREKEAGEPGRPGTKPPKRDEERGKTQGKHRKSFT 975

Human   961 LDSEGEEASQESSSEKDEEDDEEDEEDEDREEAVDTTKKETEVSDGEDEESDSSSKCSLYADSDG 1025
            ||||||||||||||||||:||:||||||::|||||.||||.|.||||||:|||||:|||||||||
Mouse   976 LDSEGEEASQESSSEKDEDDDDEDEEDEEQEEAVDATKKEAEASDGEDEDSDSSSQCSLYADSDG 1040

Human  1026 ENDSTSDSESSSSSSSSSSSSSSSSSSSSSSSSESSSEDEEEEERPAALPSASPPPREVPVPTPA 1090
            ||.|||||||.||||||||||||||||||.|||       ||||:.|.:|||| ||||||.|.||
Mouse  1041 ENGSTSDSESGSSSSSSSSSSSSSSSSSSESSS-------EEEEQSAVIPSAS-PPREVPEPLPA 1097

Human  1091 PVEVPVPERVAGSPVTPLPEQEASPARPAGPTEESPPSAPLRPPEPPAGPPAPAPRPDERPSSPI 1155
            |.|.|..:.:..|||.||.|:|..|.:||||.||.|||.|..|.|||||||..|||.||||||||
Mouse  1098 PDEKPETDGLVDSPVMPLSEKETLPTQPAGPAEEPPPSVPQPPAEPPAGPPDAAPRLDERPSSPI 1162

Human  1156 PLLPPPKKRRKTVSFSAIEVVPAPEPPPATPPQAKFPGPASRKAPRGVERTIRNLPLDHASLVKS 1220
            |||||||||||||||||.|..|.|||..|.|.|||..||.|||.||.||||||||||||||||||
Mouse  1163 PLLPPPKKRRKTVSFSAAEEAPVPEPSTAAPLQAKSSGPVSRKVPRVVERTIRNLPLDHASLVKS 1227

Human  1221 WPEEVSRGGRSRAGGRGRLTEEEEA-EPGTEVDLAVLADLALTPARRGLPALPAVEDSEATETSD 1284
            |||||:||||:|||||.|.|||||| |.|||||||||||||||||||||..||..:|||||||||
Mouse  1228 WPEEVARGGRNRAGGRVRSTEEEEATESGTEVDLAVLADLALTPARRGLATLPTGDDSEATETSD 1292

Human  1285 EAERPRPLLSHILLEHNYALAVKPTPPAPALRPPEPVPAPAALFSSPADEVLEAPEVVVAEAEEP 1349
            |||||.|||||||||||||||:||.|..||.||.||.||.|||||||||||||||||||||||||
Mouse  1293 EAERPSPLLSHILLEHNYALAIKPPPTTPAPRPLEPAPALAALFSSPADEVLEAPEVVVAEAEEP 1357

Human  1350 KPQQLQQQREEGEEEGEEEGEEEEEESSD--SSSSSDGEGALRRRSLRSHARRRRPPPPPPPPPP 1412
            | ||||||..|.|.|.|||.||||.|||:  ||||||.|||:||||||||.||||||.|||||||
Mouse  1358 K-QQLQQQHPEQEGEEEEEDEEEESESSESSSSSSSDEEGAIRRRSLRSHTRRRRPPLPPPPPPP 1421

Human  1413 RAYEPRSEFEQMTILYDIWNSGLDSEDMSYLRLTYERLLQQTSGADWLNDTHWVHHTITNLTTPK 1477
            .::|||||||||||||||||||||.|||||||||||||||||||||||||||||.||||||:|||
Mouse  1422 PSFEPRSEFEQMTILYDIWNSGLDLEDMSYLRLTYERLLQQTSGADWLNDTHWVQHTITNLSTPK 1486

Human  1478 RKRRPQDGPREHQTGSARSEGYYPISKKEKDKYLDVCPVSARQLEGVDTQGTNRVLSERRSEQRR 1542
            ||||||||||||||||||||||||||||||||||||||||||||||.||||||||||||||||||
Mouse  1487 RKRRPQDGPREHQTGSARSEGYYPISKKEKDKYLDVCPVSARQLEGGDTQGTNRVLSERRSEQRR 1551

Human  1543 LLSAIGTSAIMDSDLLKLNQLKFRKKKLRFGRSRIHEWGLFAMEPIAADEMVIEYVGQNIRQMVA 1607
            |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Mouse  1552 LLSAIGTSAIMDSDLLKLNQLKFRKKKLRFGRSRIHEWGLFAMEPIAADEMVIEYVGQNIRQMVA 1616

Human  1608 DMREKRYVQEGIGSSYLFRVDHDTIIDATKCGNLARFINHCCTPNCYAKVITIESQKKIVIYSKQ 1672
            |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Mouse  1617 DMREKRYVQEGIGSSYLFRVDHDTIIDATKCGNLARFINHCCTPNCYAKVITIESQKKIVIYSKQ 1681

Human  1673 PIGVDEEITYDYKFPLEDNKIPCLCGTESCRGSLN 1707
            |||||||||||||||||||||||||||||||||||
Mouse  1682 PIGVDEEITYDYKFPLEDNKIPCLCGTESCRGSLN 1716

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
SETD1ANP_055527.1 Interaction with WDR82. /evidence=ECO:0000269|PubMed:37030068 60..89 27/28 (96%)
RRM_Set1A 92..186 CDD:409964 93/93 (100%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 194..308 107/113 (95%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 331..363 25/31 (81%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 381..486 95/117 (81%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 506..655 140/150 (93%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 834..854 19/19 (100%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 891..1251 288/360 (80%)
PRK12323 <1070..>1130 CDD:481241 34/59 (58%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 1264..1293 22/28 (79%)
HCFC1-binding motif (HBM) 1299..1303 3/3 (100%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 1307..1417 86/111 (77%)
Interaction with CFP1 1415..1450 32/34 (94%)
N-SET 1425..1562 CDD:463344 132/136 (97%)
Interaction with ASH2L, RBBP5 and WDR5. /evidence=ECO:0000269|PubMed:17998332 1450..1537 83/86 (97%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 1472..1499 25/26 (96%)
WDR5 interaction motif (WIN). /evidence=ECO:0000269|PubMed:22266653, ECO:0000269|PubMed:22665483 1492..1497 4/4 (100%)
RxxxRR motif. /evidence=ECO:0000250|UniProtKB:P38827 1537..1542 4/4 (100%)
SET_SETD1 1556..1703 CDD:380946 146/146 (100%)
Setd1aNP_821172.2 Interaction with WDR82. /evidence=ECO:0000250|UniProtKB:O15047 60..89 27/28 (96%)
RRM_Set1A 92..186 CDD:409964 93/93 (100%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 194..367 150/172 (87%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 380..499 96/118 (81%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 516..670 143/153 (93%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 849..869 19/19 (100%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 911..1206 238/302 (79%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 1230..1259 22/28 (79%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 1275..1297 16/21 (76%)
HCFC1-binding motif (HBM). /evidence=ECO:0000250|UniProtKB:O15047 1307..1311 3/3 (100%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 1355..1427 54/72 (75%)
Interaction with CFP1. /evidence=ECO:0000250|UniProtKB:O15047 1424..1459 32/34 (94%)
N-SET 1434..1571 CDD:463344 132/136 (97%)
Interaction with ASH2L, RBBP5 and WDR5. /evidence=ECO:0000250|UniProtKB:O15047 1459..1546 83/86 (97%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 1480..1508 26/27 (96%)
WDR5 interaction motif (WIN). /evidence=ECO:0000250|UniProtKB:O15047 1501..1506 4/4 (100%)
RxxxRR motif. /evidence=ECO:0000250|UniProtKB:P38827 1546..1551 4/4 (100%)
SET_SETD1 1565..1712 CDD:380946 146/146 (100%)

Return to query results.
Submit another query.