DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment SETD1A and Set1

DIOPT Version :10

Sequence 1:NP_055527.1 Gene:SETD1A / 9739 HGNCID:29010 Length:1707 Species:Homo sapiens
Sequence 2:NP_001015221.1 Gene:Set1 / 3354971 FlyBaseID:FBgn0040022 Length:1641 Species:Drosophila melanogaster


Alignment Length:1983 Identity:490/1983 - (24%)
Similarity:738/1983 - (37%) Gaps:642/1983 - (32%)


- Green bases have known domain annotations that are detailed below.


Human     8 DGQKAPSFQWRNYKLIVDPALDPALRRPSQKVYRYDGVHFSVNDSKY--IPVEDLQDPRCHVRSK 70
            |...|.|...||:||:    .||.|.:...::|||||:  ...|..|  |...|.::|...:|::
  Fly    18 DSSLANSKMPRNFKLL----SDPQLVKCGTRLYRYDGL--MPGDPSYPTITPRDPRNPLIRIRAR 76

Human    71 N-RDFSLPVPKFKLDEFYIGQIPLKEVTFARLNDNVRETFLKDMCRKYGEVEEVEILLHPRTRKH 134
            . ....|.:|:|.:|..|:||.|..|||...||||:.:.||..|..|.|..:|:.|..||.|.||
  Fly    77 AVEPLMLLIPRFVIDSDYVGQPPAVEVTIVNLNDNIDKQFLASMLDKCGTSDEINIYHHPITNKH 141

Human   135 LGLARVLFTSTRGAKETVKNLHLTSVMGNIIHAQLDIKGQQRMKYYELIVNGSYTPQTVPTGGKA 199
            ||:||::|.||:||::.|:..:..||||.|:....|..|....|..|.:.|.....|.:   |..
  Fly   142 LGIARIVFDSTKGARQFVEKYNQKSVMGKILDVFCDPFGATLKKSLESLTNSVAGKQLI---GPK 203

Human   200 LSEKFQGSGAATETAESRRRSSSDTAAYPAGTTAVGTPGNGTPCSQDTSFSSSRQDTPSSFGQ-- 262
            ::.::....||.|                                 ||.|.   ...|...|:  
  Fly   204 VTPQWTFQQAALE---------------------------------DTEFI---HGYPEKNGEHI 232

Human   263 ---FTPQSSQGTPYTSRG-----------STPYSQDSAYSS-----------STTSTSFKPRRSE 302
               :|.|::...|..||.           ...:.:.|.:||           ....||.:.||: 
  Fly   233 KDIYTTQTNHEIPNRSRDRNWNRDKERERDRHFKERSRHSSERSYDRDRGMRENVGTSIRRRRT- 296

Human   303 NSYQDAFSRRHFSASSASTTASTAIAATTAATASSSASSSSLSSSSSSS---------------- 351
                  |.||..|..|...:....|.....:..|.|.......|....|                
  Fly   297 ------FYRRRSSDISPEDSRDILIMTRERSRDSDSRPRDYCRSRERESFRDRKRSHEKGRDQPR 355

Human   352 -------SSSSSSQFRSSDANYPAYYES--------WNRYQRH--------------TSYPPRRA 387
                   :||...::|..|.:..|..:.        .:||..|              :||    .
  Fly   356 EKREHYYNSSKDREYRGRDRDRSAEIDQRDRGSLKYCSRYSLHEYIETDVRRSSNTISSY----Y 416

Human   388 TREEPPGAPFAENTAERFPP-----------SYTSYLPPEPSRPTDQDYRPPASEAPPPEPPEPG 441
            :....|.|....|:.. ||.           ::|::.|         |:.|   ..|||.|||..
  Fly   417 SASSLPIASHGFNSCS-FPSIENIKTWSDRRAWTAFQP---------DFHP---VQPPPPPPEEI 468

Human   442 GGGGGGGPSPEREEVRTSPRP---ASPARSGSPAPETTN-----ESVPFAQHSS------LDSRI 492
            ...      .|.|..:.|..|   ...|:...|.|...|     :||  .|.:|      ||:||
  Fly   469 DNW------DEEEHDKNSIVPTHYGCMAKLQPPVPSNVNFATKLQSV--TQPNSDPGTVDLDTRI 525

Human   493 EMLLKEQR--SKFSFLASDTEEEEENSSMVLGARDTGSEVPSGSGHGPCTPPPAPANFEDVAPTG 555
            .::.|.:.  :...||..|:.:.|         .|.|.                |..|.||....
  Fly   526 ALIFKGKTFGNAPPFLQMDSSDSE---------TDQGK----------------PEVFSDVNSDS 565

Human   556 SGEPGATRESPKAN----GQNQASPCSSGDDM---------------EISDDDRGGSPPPAPTPP 601
            :......|...|.|    ..|:||..||.:::               |::||:..          
  Fly   566 NNSENKKRSCEKNNKVLHQPNEASDISSDEELIGKKDKSKLSLICEKEVNDDNMS---------- 620

Human   602 QQPPPPPPPPPPPPPYLASLPLGYPPHQPAYLLPPRPDGPPPPEYPPPPPPPPHIYDFVNSLELM 666
                            |:||.....|                                     :.
  Fly   621 ----------------LSSLSSQEDP-------------------------------------IQ 632

Human   667 DRLGAQWGGMPMSFQMQTQMLTRLHQLRQGKGLIAASAGPPGGAFGEAFLPFPPPQEAAYGLPYA 731
            .:.||::..:..|:..........:....|.|               .:|...|.:.|:     .
  Fly   633 TKEGAEYKSIMSSYMYSHSNQNPFYYHASGYG---------------HYLSGIPSESAS-----R 677

Human   732 LYAQGQEGRGAYSREAYHLPMPMAAEPLPSSSVSGEEARLPPREEAELAEGKTLPTAGTVGRVLA 796
            |::.|......|.:..........::|...:..:..:.....|::              |.:|:.
  Fly   678 LFSNGAYVHSEYLKAVASFNFDSFSKPYDYNKGALSDQNDGIRQK--------------VKQVIG 728

Human   797 MLVQEMKSIMQRDLNRKMVENVAFGAFDQWWESKEEKA--KP-FQ----------NAAKQQAKEE 848
            .:|:|:|.|::||:|::|:|..||..|:.||:....||  || |:          |..|..:..|
  Fly   729 YIVEELKQILKRDVNKRMIEITAFKHFETWWDEHTSKARSKPLFEKADSTVNTPLNCIKDTSYNE 793

Human   849 DKEKTKLKEPGLLSLVDWAKSGGTTGIEAFAFGS----GLRGAL-RLPSFKVKRKEPSEISEASE 908
                   |.|.:..|::..:       |...|.|    |||.|: :||||:..||.||.|    .
  Fly   794 -------KNPDINLLINAHR-------EVADFQSYSSIGLRAAMPKLPSFRRIRKHPSPI----P 840

Human   909 EKR---PRPSTPAEE-----DEDDPEQEKEAGEPGR---PGTKPPKRDEERGKTQGKHRKSFALD 962
            .||   .|..:..||     |.|..:...|..:..|   .|..|.:..:.:..|.|.:.|.....
  Fly   841 TKRNFLERDLSDQEEMVQRSDSDKEDSNVEISDTARSKIKGPVPIQESDSKSHTSGLNSKRKGSA 905

Human   963 SEGEEASQESSSEKDE-------EDDEEDEEDEDR-------EEAVDTTKKETEV---------- 1003
            |....:|..|:|.:.|       |.....|||..|       .:...|.:....|          
  Fly   906 SSFFSSSSSSTSSEAEYEAIDCVEKARTSEEDSPRGYGQRNLNQRTTTIRNRNLVGTMDVINVRN 970

Human  1004 -----SDGEDEESDSSSKCSLYADSDGENDSTSDSESSSSSSSSSSSSSSSSSSSSSSSSESSSE 1063
                 ::.:.|.....:|.::|:|:|.:||.|           ...:....:.|:..|..|..|:
  Fly   971 LCSGSNEFKKENVTKRTKKNIYSDTDEDNDRT-----------LFPALKEKNISTILSDLEEISK 1024

Human  1064 D-----EEEEERPAALPSASPPPREVPVPTPAPVEVPVPERVAGSPVTPLPEQEASPARPAGPTE 1123
            |     :|....|..|       |::|       ..|.........:||:|        |.|..|
  Fly  1025 DSCIGLDENGIEPTIL-------RKIP-------NTPKLNEECRRSLTPVP--------PPGYNE 1067

Human  1124 ESPPSAPLRPPEPPAGPPAPAPRPD--ERPSSPIPLL--------PPPKKRRKTVSFSAIEVVPA 1178
            |....                 :.|  ::||.....:        ...::|::...:.|      
  Fly  1068 EEIKK-----------------KVDCKQKPSFEYDRIYSDSEEEKEYQERRKRNTEYMA------ 1109

Human  1179 PEPPPATPPQAKFPGPASRKAPRGVERTIRNLPLDHASLVKSWPEEVSRGGRSRAGGRGRLTEEE 1243
                   ..:.:|.....::..:.:::.:::            |..:.:...|   .|.:..|..
  Fly  1110 -------QMEREFLEEQEKRIEKSLDKNLQS------------PNNIVKNNNS---PRNKNDETR 1152

Human  1244 EA---------EPGTEVDLAVLADLALT-------PARRG-------------------LPALPA 1273
            :.         |..::||..::..:::.       |...|                   ....|.
  Fly  1153 KTAISQTRSCFESASKVDTTLVNIISVENDINEFGPHEEGDVLTNGCNKMYTNSKGKTKRTQSPV 1217

Human  1274 VEDSEATETSDEAERPRPLLSHILLEHNYALAVKPTPPAPALRPPEPVPAPAALFSSPADEVLEA 1338
            ..:..:::.|.        .|.:.|||.|:|            ||..|    :|...|:.:|.|.
  Fly  1218 YSEGGSSQASQ--------ASQVALEHCYSL------------PPHSV----SLGDYPSGKVNET 1258

Human  1339 PEVVVAEAE--------------EPKPQQLQQQREEGEEEGEEEGEEEEEESSDSSSSSDGEGAL 1389
            ..::..|||              .|:...:..|:                               
  Fly  1259 KNILKREAENIAIVSQMTRTGPGRPRKDPICIQK------------------------------- 1292

Human  1390 RRRSL---RSHARRRRPPPPPPPP----------PPRAYEPRSEFEQMTILYDIWNSGLDSEDMS 1441
            ::|.|   .|:.:.:..|.....|          |...|:.|.:.|:|.|||.....|:|:||::
  Fly  1293 KKRDLAPRMSNVKSKMTPNGDEWPDLAHKNVHFVPCDMYKTRDQNEEMVILYTFLTKGIDAEDIN 1357

Human  1442 YLRLTYERLLQQTSGADWLNDTHWVHHTITNLT---TPKRKRRPQDGPREHQTGSARSEGYYPIS 1503
            :::::|...|.:...|.:||:||||.|..|:..   .|.:|||..|....|:||.||:||:|.:.
  Fly  1358 FIKMSYLDHLHKEPYAMFLNNTHWVDHCTTDRAFWPPPSKKRRKDDELIRHKTGCARTEGFYKLD 1422

Human  1504 KKEKDKYLDVCPVSARQLEGVDTQGT-----------------NRVLS-------ERRSEQRRLL 1544
            .:||.|:       .......:|:.:                 |:::|       |.||.|||||
  Fly  1423 VREKAKH-------KYHYAKANTEDSFNEDRSDEPTALTNHHHNKLISKMQGISREARSNQRRLL 1480

Human  1545 SAIGTSAIMDSDLLKLNQLKFRKKKLRFGRSRIHEWGLFAMEPIAADEMVIEYVGQNIRQMVADM 1609
            :|.|:..  :|:|||.||||||||:|:|.:|.||:|||||||||||||||||||||.||.:|||:
  Fly  1481 TAFGSMG--ESELLKFNQLKFRKKQLKFAKSAIHDWGLFAMEPIAADEMVIEYVGQMIRPVVADL 1543

Human  1610 REKRYVQEGIGSSYLFRVDHDTIIDATKCGNLARFINHCCTPNCYAKVITIESQKKIVIYSKQPI 1674
            ||.:|...|||||||||:|.:|||||||||||||||||.|.||||||||||||:|||||||||||
  Fly  1544 RETKYEAIGIGSSYLFRIDMETIIDATKCGNLARFINHSCNPNCYAKVITIESEKKIVIYSKQPI 1608

Human  1675 GVDEEITYDYKFPLEDNKIPCLCGTESCRGSLN 1707
            |::|||||||||||||.|||||||.:.|||:||
  Fly  1609 GINEEITYDYKFPLEDEKIPCLCGAQGCRGTLN 1641

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
SETD1ANP_055527.1 Interaction with WDR82. /evidence=ECO:0000269|PubMed:37030068 60..89 7/29 (24%)
RRM_Set1A 92..186 CDD:409964 39/93 (42%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 194..308 21/140 (15%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 331..363 7/54 (13%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 381..486 28/123 (23%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 506..655 24/167 (14%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 834..854 7/32 (22%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 891..1251 71/423 (17%)
PRK12323 <1070..>1130 CDD:481241 12/59 (20%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 1264..1293 3/47 (6%)
HCFC1-binding motif (HBM) 1299..1303 2/3 (67%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 1307..1417 19/136 (14%)
Interaction with CFP1 1415..1450 12/34 (35%)
N-SET 1425..1562 CDD:463344 51/163 (31%)
Interaction with ASH2L, RBBP5 and WDR5. /evidence=ECO:0000269|PubMed:17998332 1450..1537 30/113 (27%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 1472..1499 11/29 (38%)
WDR5 interaction motif (WIN). /evidence=ECO:0000269|PubMed:22266653, ECO:0000269|PubMed:22665483 1492..1497 3/4 (75%)
RxxxRR motif. /evidence=ECO:0000250|UniProtKB:P38827 1537..1542 3/4 (75%)
SET_SETD1 1556..1703 CDD:380946 117/146 (80%)
Set1NP_001015221.1 RRM_Set1 99..191 CDD:409745 39/91 (43%)
U2AF_lg 255..>386 CDD:273727 22/137 (16%)
N-SET 1352..1496 CDD:463344 47/152 (31%)
SET_SETD1 1490..1637 CDD:380946 117/146 (80%)

Return to query results.
Submit another query.