DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment Set2 and KMT2C

DIOPT Version :10

Sequence 1:NP_572888.2 Gene:Set2 / 32301 FlyBaseID:FBgn0030486 Length:2362 Species:Drosophila melanogaster
Sequence 2:NP_733751.2 Gene:KMT2C / 58508 HGNCID:13726 Length:4911 Species:Homo sapiens


Alignment Length:1977 Identity:388/1977 - (19%)
Similarity:614/1977 - (31%) Gaps:714/1977 - (36%)


- Green bases have known domain annotations that are detailed below.


  Fly    19 GRGRPPKVALSALGNTPPHINPSLKHADAEASPTAPEDQDSGQSECRRSSRKKIIKFDVRDLLNK 83
            |..|..|.||||...|...........|||......|.|...|.:..:              :.|
Human  3203 GAHRKSKKALSAKQRTAKKAGREFPEEDAEQLKHVTEQQSMVQKQLEQ--------------IRK 3253

  Fly    84 NRKAHKIQIE-ARIDSNPSTGHSQSGTTAASTSMSTATASAASASSAATVSRLFSMFEMSHQSLP 147
            .:|.|...|| .||.      ..|....|..|.|                           .|:.
Human  3254 QQKEHAELIEDYRIK------QQQQCAMAPPTMM---------------------------PSVQ 3285

  Fly   148 PPPP------PPTALEIFAKPRPTQSLIVAQVTSEPSAV---GGAHPVQTMAGLPPVTPRKRGRP 203
            |.||      |||      ..:||..::..|:..:....   |...||: |..||...|  ...|
Human  3286 PQPPLIPGATPPT------MSQPTFPMVPQQLQHQQHTTVISGHTSPVR-MPSLPGWQP--NSAP 3341

  Fly   204 RKSQLADAAIIPTVI---VPSCSDSDTNSTSTTTSNMSSDSGELP------------GFPIQKPK 253
            ....|....|.|.:.   :.:|:.:     ..|.||.:..||..|            .|..::.|
Human  3342 AHLPLNPPRIQPPIAQLPIKTCTPA-----PGTVSNANPQSGPPPRVEFDDNNPFSESFQERERK 3401

  Fly   254 SKLRVSLKRLKLGGRLESSDSGNSPSSSSPEVEPPALQDENAMDERPKQEQNLSRMVDAEENSD- 317
            .:||...:|.::              ....||:     .:.|:.:|.:.||:  .||.:|.:|. 
Human  3402 ERLREQQERQRI--------------QLMQEVD-----RQRALQQRMEMEQH--GMVGSEISSSR 3445

  Fly   318 -SDSQIIF----IEIETESPKG------EEEQEEGRPVEVE---------PQDLIDIDMELAKQE 362
             |.|||.|    :..:...|.|      :.:|:.|:.::.:         |.....:.....:|.
Human  3446 TSVSQIPFYSSDLPCDFMQPLGPLQQSPQHQQQMGQVLQQQNIQQGSINSPSTQTFMQTNERRQV 3510

  Fly   363 PTPDPEEDLDEIMVEVLSGPPSLWSADDEAEEEEDATVQRA------TP--PGKEPAADSCSSAP 419
            ..|....|...|.|    |.|:..|...........:.|::      ||  |...|.|:  ||.|
Human  3511 GPPSFVPDSPSIPV----GSPNFSSVKQGHGNLSGTSFQQSPVRPSFTPALPAAPPVAN--SSLP 3569

  Fly   420 RRSRRSAPLSGSSRQGKT--LEETFAEIAAE---SSKQILEAEESQDQEEQHILIDLIEDTLSES 479
             ..:.|....|.|..|.|  |.:.:::|..|   ..|:..:.:...|.|...      ..:...|
Human  3570 -CGQDSTITHGHSYPGSTQSLIQLYSDIIPEEKGKKKRTRKKKRDDDAESTK------APSTPHS 3627

  Fly   480 EVTSSVSPTIEHMVVEEVVVEENQLVDEADEILDSKQEFVIKKVFSESDNIAASLNKDIFEPKVE 544
            ::|:..:|.|........|...::|..:||      ||.|                    ||   
Human  3628 DITAPPTPGISETTSTPAVSTPSELPQQAD------QESV--------------------EP--- 3663

  Fly   545 TKATCGEVVPRPEMVTEDVYITEGIAATLEKSAVVTKPTTEMIAETKLSDEVVIEPPLKDESDP- 608
                                               ..|:|..:|..:|..|:..:.|..|.|.. 
Human  3664 -----------------------------------VGPSTPNMAAGQLCTELENKLPNSDFSQAT 3693

  Fly   609 --KQTEVELPESKPAVNIP-KSERILSAEVETTSSPLVPPECCTLESVSGPVLLETSLSTEEKSN 670
              :||.......|.::..| |:|.|...:.||.|.|          ....|.|       ||::.
Human  3694 PNQQTYANSEVDKLSMETPAKTEEIKLEKAETESCP----------GQEEPKL-------EEQNG 3741

  Fly   671 ENVETTPLKTEAAKEDSPP---AAPEEEASNSSEEPNFLLEDYES----NQEQ----VAEDEMMK 724
            ..||...:....:...|||   .||..:..:.:|....||::.:|    ||:.    .:||:..|
Human  3742 SKVEGNAVACPVSSAQSPPHSAGAPAAKGDSGNELLKHLLKNKKSSSLLNQKPEGSICSEDDCTK 3806

  Fly   725 CNNQKGQKQTP--------------------LPEMKEPEKPVAETVSKKEKAMENPARSSPAIVD 769
             :|:..:||.|                    ||:.....:...:...:.::..|..|..|     
Human  3807 -DNKLVEKQNPAEGLQTLGAQMQGGFGCGNQLPKTDGGSETKKQRSKRTQRTGEKAAPRS----- 3865

  Fly   770 KKVRAGEMEKKVVKSTKGTVPEKKMDSKKSCAAVTPAK----------QKESGKSAKEAILKKET 824
            ||.:..|.||:.:.|:..|....|..:..|.....||.          ||.:...|        |
Human  3866 KKRKKDEEEKQAMYSSTDTFTHLKQQNNLSNPPTPPASLPPTPPPMACQKMANGFA--------T 3922

  Fly   825 EKE---KSSAKLDSSSPNTLDKK-------------GKDTAQWSPQLQTLPKSSTKPPQESAPSV 873
            .:|   |:...:......||..|             .:..|| .|:...:|.|...||.      
Human  3923 TEELAGKAGVLVSHEVTKTLGPKPFQLPFRPQDDLLARALAQ-GPKTVDVPASLPTPPH------ 3980

  Fly   874 ISKTTSNQPAPKEEQHAAKKGLSDNSPPSVLKAKEKAVSGFVECDAMFKAMDLANAQLRLDEKNK 938
                 :||...:.:.|...:...|:..||   :..::|.| ||   :.:..||:..:        
Human  3981 -----NNQEELRIQDHCGDRDTPDSFVPS---SSPESVVG-VE---VSRYPDLSLVK-------- 4025

  Fly   939 KKLKKVPTKVEAPPKVEPPTAVPVPGQKKSLSGKTS-LRRN-------TVYEDSPNLERNSSPSS 995
                      |.||:..|...:|:   ..|.:||:| .|||       |:|..||   ...||:.
Human  4026 ----------EEPPEPVPSPIIPI---LPSTAGKSSESRRNDIKTEPGTLYFASP---FGPSPNG 4074

  Fly   996 DSAQANTSAGKLKPSKVKKKINPRRSTICEAAKDL---RSSSSSSTPTREVAASSPVSTSSDSSS 1057
            ..:...:.|..|.|:..:     ..|::..|..||   |..:|     .||:::..|.:....||
Human  4075 PRSGLISVAITLHPTAAE-----NISSVVAAFSDLLHVRIPNS-----YEVSSAPDVPSMGLVSS 4129

  Fly  1058 KRNGSKRTTSDLDGGSKLDQRRYTICEDRQPETA---------------IPVPLTKRRFSMHPKA 1107
            .|         ::.|  |:.|::.:.....|.:|               :|.|.|....|.:..:
Human  4130 HR---------INPG--LEYRQHLLLRGPPPGSANPPRLVSSYRLKQPNVPFPPTSNGLSGYKDS 4183

  Fly  1108 SANPLHDTLLQT------------AGKKRG-------RKEGKESLSRQNS-----------LDSS 1142
            |........|:.            :|.::.       .|:.:||..|...           |.||
Human  4184 SHGIAESAALRPQWCCHCKVVILGSGVRKSFKDLTLLNKDSRESTKRVEKDIVFCSNNCFILYSS 4248

  Fly  1143 SSASQGAPKKKALKSAEILSAALLETESSESTSSGSKMSRWDVQTSPEL-EAANPFGD------- 1199
            ::.::.:..|:::.|  :..:.:.||.|.......:.:|..||...|:| |.|:|...       
Human  4249 TAQAKNSENKESIPS--LPQSPMRETPSKAFHQYSNNISTLDVHCLPQLPEKASPPASPPIAFPP 4311

  Fly  1200 -------IAKFIEDGVNLLKRDKVD------EDQR-----------KEGQDEV-------KREAD 1233
                   .||..|..|.:..:.::.      ||.|           |:....:       |...:
Human  4312 AFEAAQVEAKPDELKVTVKLKPRLRAVHGGFEDCRPLNKKWRGMKWKKWSIHIVIPKGTFKPPCE 4376

  Fly  1234 PEEDEFAQRVANMETPATTP----------------TPSPTQSNPED--------SASTTTVLKE 1274
            .|.|||.:::.....|...|                |..|.:....|        .|..:|.:.|
Human  4377 DEIDEFLKKLGTSLKPDPVPKDYRKCCFCHEEGDGLTDGPARLLNLDLDLWVHLNCALWSTEVYE 4441

  Fly  1275 LETG-----------------------GGVRRSHRIK---------------------------Q 1289
            .:.|                       |.....||.:                           .
Human  4442 TQAGALINVELALRRGLQMKCVFCHKTGATSGCHRFRCTNIYHFTCAIKAQCMFFKDKTMLCPMH 4506

  Fly  1290 KPQGPRASQGRGVASVALAPISMDE--QLAELANIEAINEQFLRSEGLNTF-----------QLL 1341
            ||:|....:....|......:..||  |:|.:..         |.|..:||           |||
Human  4507 KPKGIHEQELSYFAVFRRVYVQRDEVRQIASIVQ---------RGERDHTFRVGSLIFHTIGQLL 4562

  Fly  1342 KENF-------------YRCARQV-SQENAEMQCDCFLTGDEE------------AQGH------ 1374
            .:..             |..:|.. |...|..:|. :|...||            .|||      
Human  4563 PQQMQAFHSPKALFPVGYEASRLYWSTRYANRRCR-YLCSIEEKDGRPVFVIRIVEQGHEDLVLS 4626

  Fly  1375 ---------------------------------------LSCGA------------GCIN----- 1383
                                                   |:..|            .|.|     
Human  4627 DISPKGVWDKILEPVACVRKKSEMLQLFPAYLKGEDLFGLTVSAVARIAESLPGVEACENYTFRY 4691

  Fly  1384 -RMLMIECGPLCSNGARC-------------------------TNKRFQ---------------- 1406
             |..::|. ||..|...|                         |:|.||                
Human  4692 GRNPLMEL-PLAVNPTGCARSEPKMSAHVKRFVLRPHTLNSTSTSKSFQSTVTGELNAPYSKQFV 4755

  Fly  1407 -----QHQC----WPCRVF--RTEKKGCGITAELLIPPGEFIMEYVGEVIDSEEFERRQHLYSKD 1460
                 |::.    |...|:  |:..:|.|:.|...|.....::||:|.:|.:|...|::.|| :.
Human  4756 HSKSSQYRKMKTEWKSNVYLARSRIQGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLY-ES 4819

  Fly  1461 RNRHYYFMALRGEAVIDATSKGNISRYINHSCDPNAETQKWTVNGELRIGFFSVKPIQPGEEITF 1525
            :||..|...:..:.|||||..|..:|||||||.||...:..|.....:|...|.:.||.|||:.:
Human  4820 QNRGVYMFRMDNDHVIDATLTGGPARYINHSCAPNCVAEVVTFERGHKIIISSSRRIQKGEELCY 4884

  Fly  1526 DYQYLRYGRDAQR--CYCEAANCRGWI 1550
            ||:: .:..|..:  |:|.|.|||.|:
Human  4885 DYKF-DFEDDQHKIPCHCGAVNCRKWM 4910

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
Set2NP_572888.2 valS <709..832 CDD:237855 32/163 (20%)
AWS 1358..1410 CDD:197795 22/172 (13%)
SET_SETD2 1410..1551 CDD:380949 51/149 (34%)
WW 2014..2043 CDD:459800
SRI 2266..2355 CDD:462404
KMT2CNP_733751.2 Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 1..101
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 164..203
ePHD1_KMT2C 247..330 CDD:277166
PHD1_KMT2C_like 343..388 CDD:276984
PHD2_KMT2C 390..435 CDD:277069
PHD3_KMT2C 466..517 CDD:276986
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 721..742
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 763..798
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 828..864
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 885..912
PHD4_KMT2C 952..1008 CDD:277071
PHD5_KMT2C_like 1009..1055 CDD:276988
RanBP2-type Zn finger 1084..1106 CDD:275375
PHD6_KMT2C 1086..1136 CDD:277073
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 1215..1324
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 1406..1431
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 1458..1485
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 1604..1630
HMG-box_KMT2C 1633..1713 CDD:438835
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 1709..2448
PHA03247 <1876..2353 CDD:223021
Atrophin-1 2219..>2547 CDD:460830
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 2589..2694
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 2793..2887
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 2925..2954
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 2989..3029
TPH <3176..>3272 CDD:464007 21/88 (24%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 3205..3241 11/35 (31%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 3353..3409 12/60 (20%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 3527..3583 14/58 (24%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 3596..3919 79/415 (19%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 4024..4053 10/49 (20%)
ePHD2_KMT2C 4402..4506 CDD:277167 10/103 (10%)
FYRN 4551..4602 CDD:461787 9/51 (18%)
FYRC 4610..4697 CDD:197781 8/86 (9%)
WDR5 interaction motif (WIN). /evidence=ECO:0000269|PubMed:22266653, ECO:0000269|PubMed:22665483 4707..4712 1/4 (25%)
SET_KMT2C_2D 4758..4910 CDD:380948 51/153 (33%)
Blue background indicates that the domain is not in the aligned region.

Return to query results.
Submit another query.