DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment Set2 and kmt2a

DIOPT Version :10

Sequence 1:NP_572888.2 Gene:Set2 / 32301 FlyBaseID:FBgn0030486 Length:2362 Species:Drosophila melanogaster
Sequence 2:XP_005157640.1 Gene:kmt2a / 557048 ZFINID:ZDB-GENE-080521-3 Length:4219 Species:Danio rerio


Alignment Length:1952 Identity:381/1952 - (19%)
Similarity:631/1952 - (32%) Gaps:677/1952 - (34%)


- Green bases have known domain annotations that are detailed below.


  Fly    12 PVASRGRGRGRPPK----VALSALGNT-----PPHI---NPSLKHADAEASPTA--------PED 56
            |:|...:.:..|.|    .:::||.:|     |..|   :.|.|..|..:.|.|        ...
Zfish  2531 PLAMLPKDKANPNKEGNMTSMAALKDTVKTGSPQRIYNKSGSRKSHDYASGPAAVVAMKPLWSSG 2595

  Fly    57 QDSGQSECRR---------------SSRKK--IIKFDV-RD--------------LLNKNRKAHK 89
            ...|:.:.:|               |:::|  .:|.:| ||              :||.|.|:..
Zfish  2596 AKLGEEDIKRGFQASAGITGSHGTSSTKEKHSKVKMNVSRDVSKERKETPQNRNAVLNSNSKSSN 2660

  Fly    90 IQIEARIDSNPSTGHSQSGTTAASTSMSTATASAASASSAATVSRLFS----MFEMSHQSL---- 146
            ::.:.::   |...:..:..||.|::..:.|..............|.|    .||..|.|.    
Zfish  2661 VKTQGQV---PPPHNISNKATALSSNTGSGTVEVNKFDQKEVEKPLKSKERFSFEKKHTSAMDAI 2722

  Fly   147 -----------PPPPPPPTALEI-FAKPRPTQSLIVAQVTSEPS---AVGGAHPVQTMAGLPPVT 196
                       ||...|.::.|: ....:.|:.|.:.....:|:   ||..:...||...   ||
Zfish  2723 QPKAGSERSIRPPQVHPKSSKEVPLVGKKHTERLSLMSQKMDPNRTKAVSISPNTQTYTS---VT 2784

  Fly   197 PRKRGRPRKSQLADAAIIPTVIVPSCSDSDTNSTSTTTSNMSSDSGELPGFPIQKPKSKLRVSLK 261
            |..:|..|:|..|      .|..||.               ||:|.|                  
Zfish  2785 PSNQGPQRRSSRA------MVFSPSA---------------SSESSE------------------ 2810

  Fly   262 RLKLGGRLESSDSGNSPSSSSPEV---------EPPALQDENAMDERPKQEQNLS---------- 307
                      |||...|..|...:         |...|:||.::|:..:::.:.|          
Zfish  2811 ----------SDSHIHPDDSEEHLMDHQCADDGEDNNLEDEGSVDKHHEEDSDGSAGSAKRRYPR 2865

  Fly   308 RMVDAEEN------------SDSDSQIIFI---EIETESPKGEEEQE-EGRPVEVEPQDLIDIDM 356
            |...|..|            |..:..|.|.   ||..:...|..::. ||   :|:..|    ||
Zfish  2866 RSARARSNMFFGLTPFYGVRSYGEEDIPFYRSGEISMKKRTGSSKRSAEG---QVDGAD----DM 2923

  Fly   357 ELAKQEPTPDPEE------------DLDEIMVEVLSGPPSLWSAD--------------DEAEEE 395
            ..:....:.:.||            :....::...||.||:...|              |:|:|.
Zfish  2924 STSSSADSGEDEEGGIGSNKDTYYYNFTRTIINPSSGLPSIAGIDQCLGRGSQIHRFLRDQAKEH 2988

  Fly   396 EDAT--VQRATP----------PGKEPAADSCSSAPRRSRRSAPLSGSSRQGKT--------LEE 440
            ||.:  |..||.          .|.:..::|..|....|..:|..| |:::|.|        .|:
Zfish  2989 EDDSDEVSTATKNLELQQIGQLDGVDDGSESDISISTSSTTTATTS-STQKGSTKRKGRESRTEK 3052

  Fly   441 TFAEIAAESSKQILEAEESQDQEEQHILIDLIEDTLSESEVTSSVSPTIEHMVVEEVVVEENQLV 505
            :..:...|:......:.:|:..::.:.|......|..:..:.:.:|.|.:.:..:    .:|...
Zfish  3053 SNVDSGKEAVNTTSNSRDSRKNQKDNCLPLGSVKTQGQDPLETQLSLTTDLLKSD----SDNNNS 3113

  Fly   506 DEADEILDSK-QEFVIKKVFSESDNIAASLNKDIFEPKVE----TKATCGEVVPRPEMVTEDVYI 565
            |:...||.|. .|||:.....:      :|.:....|..|    .::...:|..|.:|:.||   
Zfish  3114 DDCGNILPSDIMEFVLNTPSMQ------ALGQQAEAPSAEQFSLDESYGVDVNQRKDMLFED--- 3169

  Fly   566 TEGIAATLEKSAVVTKPTTEM-IAETKLSDEVVIEP----PLKDESDPKQTEVELPESKPAVNIP 625
                         .|:|.... ..|:.:|..:.:|.    ||           |||.        
Zfish  3170 -------------FTQPLANAESGESGVSTTIAVEESYGLPL-----------ELPS-------- 3202

  Fly   626 KSERILSAEVETTSSPLVPPECCTLESVSGPVLLETS------LSTEE----KSNENVET-TPLK 679
                  ...|.||.||.|.      ....||::.|||      |:|||    ||.:...| :.:.
Zfish  3203 ------DLSVLTTRSPTVS------NQNHGPLISETSERTMLALATEESEAGKSKKKTRTGSTVS 3255

  Fly   680 TEAAKE---------------------------DSPPAAPEEEASNSS-----------EEPNFL 706
            :::.:|                           .||..||..|..|..           ..|...
Zfish  3256 SKSPQEGCADSQVPEGHMTPEHFIPPSVDGDHITSPGVAPVGETGNQDMTRTSSTPVLPSSPTLP 3320

  Fly   707 LEDYE---------------SNQEQVAEDEMMKCNNQK----GQKQTPLPEMK-------EPEKP 745
            |::.:               |:..|.|..: :|...:|    .|...||..::       .|..|
Zfish  3321 LQNQKFIPATTVTSGPAPITSSAVQAAASQ-LKPGPEKLIVLNQHLQPLYVLQTVPNGVMNPNAP 3384

  Fly   746 VAETVSKKEKAMEN--PARSS---PAIVDKKVRA--GEME---KKVVKSTKGTV----------- 789
            |...:|......::  ||.|.   |.....::.|  |..:   :.|:.||...:           
Zfish  3385 VLTGLSGGISTSQSIFPAGSKGLVPVSHHPQIHAFTGTTQTGFQPVIPSTTSGLLIGVTSHDPQI 3449

  Fly   790 ------------PEKKMDSKKSCAAVTPAKQK-ESGKSAKEAILKKETEKEKSSAK--------- 832
                        |...|.|  |.:.:|||... .||...|..|.:.::.|.|..|:         
Zfish  3450 GVTEAGHRHDHAPNVAMVS--SASTITPAPSMIPSGHGKKRLISRLQSPKSKKQARPKTQPTLAP 3512

  Fly   833 --------LDSSSPNTLDKKGKDTAQWSPQLQTLPKSSTKPPQESAPSVISK----------TTS 879
                    |.:.||:.: ..|........:|.|:    |..|....|::|.:          |..
Zfish  3513 SDVGPNMTLINLSPSQI-AAGIPAQTGLMELGTI----TATPHRKIPNIIKRPKQGVMYLEPTIL 3572

  Fly   880 NQPAPKEEQHAAKKGLSDNSPPSVLKAKEK------AVSGFVECDAMFKAM-------------- 924
            .||.|          :|..:.|.:|.....      .|||.....::...:              
Zfish  3573 PQPMP----------ISTTTQPGILGHDSSTHLLPCTVSGLNTSQSVLNVVSVPSSAPGNFLGGS 3627

  Fly   925 -----------------DLANAQLRLDEKNKKKLKKVP------------------TKVEAPPKV 954
                             .|:|..::.:..| ..|.:.|                  |.:.:...|
Zfish  3628 SVSLSAPGLISSTEITGSLSNLLIKANPHN-LSLSEQPMVLHPGTPMMSHLTNPAQTSIASSICV 3691

  Fly   955 EPPT-AVPVP-GQKKSLSGKTSLRR--NTVYED---SPNLERNSSPSSDSAQANTSAGK--LKPS 1010
            .||. ::.|| .|:....|...|:.  :.|..|   .||:              :|||:  |.|:
Zfish  3692 FPPNQSITVPVNQQVEKEGTVHLQHAVSRVLADKTLDPNV--------------SSAGQVALAPN 3742

  Fly  1011 KVKKKINPRRSTICEAAKDLRSSSSSSTPTREVAASSPVSTSSDSSSKRNGSKRTTSDLDGGSKL 1075
            .:.:::| :...:....:..|:|..|....:..|:..|...||.:..|              .|.
Zfish  3743 PISQELN-KGHVVSVLTQSSRTSPISRPQHQHQASKLPAGASSVAFGK--------------GKH 3792

  Fly  1076 DQRRYTICEDRQPETAIPVPLTKRRFSMHPKASANPLHDT-LLQTAGKKRGRKEGKESLSRQNSL 1139
            ..:|...|.|:..        .|:...:|   |..|..|| .:|.:..|     |.:.||....:
Zfish  3793 KAKRPRPCPDKSS--------GKKHKGLH---SDTPTVDTSAIQLSYIK-----GDQELSSPEPM 3841

  Fly  1140 DSSSSASQGAPKKKALKSAEILSAALLETESSESTSSGSKMSRWDVQTSPELEAANPFGDIAKFI 1204
            |:..|...|:.|:                :|:..|::.|.:.|..|....  |..:..|..:|  
Zfish  3842 DTGQSNETGSKKR----------------DSTTMTTNSSALKRKTVDAVD--EKPSTAGLPSK-- 3886

  Fly  1205 EDGVNLLKRDKVD-EDQRKEGQDEVKREADPE---------EDEFAQRVANMET--PATTPTPSP 1257
            .||.. .|...|| .|||..|:|. ..:..|:         :|.|..|..::|.  .:.|.....
Zfish  3887 GDGTG-NKAFSVDTPDQRDSGRDS-SLDHKPKKGLIFEICSDDGFQIRCESIEEAWKSLTDKVQE 3949

  Fly  1258 TQSNPEDSAST--------------TTVLKELETGGGVR--RSHRIK-QKPQ-------GPRASQ 1298
            .:||....|.:              ..|:..||...|.|  |::|.: .||:       .|..| 
Zfish  3950 ARSNARLKALSFDGVNGLKMLGVVHDAVVFLLEQLYGARHCRNYRFRFHKPEETDYLPVNPHGS- 4013

  Fly  1299 GRGVASVALAPISMDEQLAELANIEAINEQFLRSEGLNTFQLLKENFYRCARQVSQENAEMQCDC 1363
                                     |..|.:.|...|:.|..|....    ||....|.:.:   
Zfish  4014 -------------------------ARAEVYHRKSVLDMFNFLASKH----RQPPVYNPQEE--- 4046

  Fly  1364 FLTGDEEAQGHLSCGAGCINRMLMIECGPLCSNGARCTN------KRFQQHQCW---PCRVFRTE 1419
                |||.....|                    ..|.|:      ::|:|.:..   ...|:|:.
Zfish  4047 ----DEEEMQQKS--------------------ARRATSTDLPLPEKFRQLKKASRDAVGVYRSA 4087

  Fly  1420 KKGCGITAELLIPPGEFIMEYVGEVIDSEEFERRQHLYSKDRNRHYYFMALRGEAVIDATSKGNI 1484
            ..|.|:.....|.|||.::||.|.||.|...::|:..|. |:....|...:....|:|||..||.
Zfish  4088 IHGRGLFCRKNIEPGEMVIEYSGNVIRSVLTDKREKYYD-DKGIGCYMFRIDDYEVVDATIHGNS 4151

  Fly  1485 SRYINHSCDPNAETQKWTVNGELRIGFFSVKPIQPGEEITFDYQY-LRYGRDAQRCYCEAANCRG 1548
            :|:|||||:||..::...|:|:..|..|:.:.|..|||:|:||:: :....:...|.|.|..||.
Zfish  4152 ARFINHSCEPNCYSRVVNVDGQKHIVIFATRKIYKGEELTYDYKFPIEEPGNKLPCNCGAKKCRK 4216

  Fly  1549 WI 1550
            ::
Zfish  4217 FL 4218

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
Set2NP_572888.2 valS <709..832 CDD:237855 34/182 (19%)
AWS 1358..1410 CDD:197795 8/57 (14%)
SET_SETD2 1410..1551 CDD:380949 48/145 (33%)
WW 2014..2043 CDD:459800
SRI 2266..2355 CDD:462404
kmt2aXP_005157640.1 zf-CXXC 1312..1359 CDD:366873
PHD1_KMT2A 1627..1673 CDD:277063
PHD2_KMT2A 1675..1724 CDD:277065
PHD3_KMT2A 1762..1818 CDD:277067
Bromo_ALL-1 1841..1971 CDD:99925
ePHD_KMT2A 2066..2178 CDD:277163
FYRN 2219..2266 CDD:461787
Atrophin-1 3221..>3709 CDD:460830 87/506 (17%)
FYRC 3919..4002 CDD:197781 17/82 (21%)
SET_KMT2A 4066..4219 CDD:380983 50/154 (32%)
Blue background indicates that the domain is not in the aligned region.

Return to query results.
Submit another query.