DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment CG5098 and Kmt2a

DIOPT Version :10

Sequence 1:NP_001261066.1 Gene:CG5098 / 37063 FlyBaseID:FBgn0034300 Length:1339 Species:Drosophila melanogaster
Sequence 2:XP_063121507.1 Gene:Kmt2a / 315606 RGDID:1586165 Length:4017 Species:Rattus norvegicus


Alignment Length:1601 Identity:300/1601 - (18%)
Similarity:481/1601 - (30%) Gaps:583/1601 - (36%)


- Green bases have known domain annotations that are detailed below.


  Fly   109 QSARSAAQPLAKQPPNQQPHQTQQQQQSLIHAPNYPSIQNLTTNATPTSTQLQQQQQQEHLAAMA 173
            |::.|...||...||..||..........:..|..|    |.:...|.|....|.:::..|....
  Rat   589 QTSSSPPPPLLTPPPPLQPASGISDHTPWLMPPTIP----LASPFLPASAAPMQGKRKSILREPT 649

  Fly   174 AAHVSLLQSSRQNQGAPSGNLSNGGDCESLLPPP------PPTSVSGNTNHTGSNSSSNSGSNNH 232
            ....||..|..:.|...|...:.    |.|:..|      ||.....:.......|:|.|.::..
  Rat   650 FRWTSLKHSRSEPQYFSSAKYAK----EGLIRKPIFDNFRPPPLTPEDVGFASGFSASGSAASAR 710

  Fly   233 IASPHYMQSRDENFKLTQLKRS--FEPDLSGKNPQKEKDFGYPSASSASKLPTHNVQQQHANKKP 295
            :.||.:..:|.:..|.:.:.|:  |.|  |..:.:..:....||..::|...:..|..:...:|.
  Rat   711 LFSPLHSGTRFDIHKRSPILRAPRFTP--SEAHSRIFESVTLPSNRTSSGASSSGVSNRKRKRKV 773

  Fly   296 -SPLRNYHQQQQPPYNLTPKYN-------GPQTPPT--------PQSPLAANPHQMLSPTMDYNQ 344
             ||:|:  :.:.|.:::..:..       .|.|||:        |.|||||:   .|:||..:..
  Rat   774 FSPIRS--EPRSPSHSMRTRSGRLSTSELSPLTPPSSVSSSLSIPVSPLAAS---ALNPTFTFPS 833

  Fly   345 LHLHHQLNSSSGGSYQHMQQDQTQSQSHPQHLHYHNQHATSQTAPP-----PLLPPLLTSGQFHA 404
                |.| :.||.|.:..|:.:.|               ||..|.|     |.|.|..|.|    
  Rat   834 ----HSL-TQSGESTEKSQRARKQ---------------TSAPAEPFSSNSPALFPWFTPG---- 874

  Fly   405 QPQDASQQQTASSSQHQTHHSRTAQLTNLDQAVKHKPESEEQPVITDLSYRNSETDKTAANPVPE 469
                   .||....:..|.....::..:.|::|: |.:|.|         |:.|.:|        
  Rat   875 -------SQTEKGRKKDTAPEELSKDRDADKSVE-KDKSRE---------RDREREK-------- 914

  Fly   470 APESPYLTTSNEESLESNSNSSNSRKRRKRKASMVMRVTPNENAPEGENSKPQHPQQAANLNNSC 534
                              .|...|||.:::|.|.:.  :.:...|.|..||.:...:....::|.
  Rat   915 ------------------ENKRESRKEKRKKGSDIQ--SSSALYPVGRVSKEKVAGEDVGTSSSA 959

  Fly   535 S-----PKKSPKNGGGEFQPFSTQKQSQTENEKTTQENGRGG----------------------- 571
            .     .|.|..:.|.:..|. |...:.....|...:.|||.                       
  Rat   960 KKATGRKKSSSLDSGADIAPV-TLGDTTAVKAKILIKKGRGNLEKNNLDLGPTAPSLEKEKTLCL 1023

  Fly   572 ---SPAPAENNSNSNSSTLYNDNENPKTKKQRQALLQR------NLTEQHRMQQDDEPPKNHTSP 627
               ||:..:::::|..|.|...::.|.|.|:..:||::      .:.:...::|.|:|       
  Rat  1024 STPSPSTVKHSTSSIGSMLAQADKLPMTDKRVASLLKKAKAQLCKIEKSKSLKQTDQP------- 1081

  Fly   628 AMPPPSPQSNSSSSSSSSSSANTHSSQSSHAVNNIPKPEINNKAT--TDTPASPAL--VEQGDI- 687
                     .:....|.||..:....:..|............:|.  .|.|...||  .|:..| 
  Rat  1082 ---------KAQGQESDSSETSVRGPRIKHVCRRAAVALGRKRAVFPDDMPTLSALPWEEREKIL 1137

  Fly   688 -----DAKPAVSVHECDEEEEPAVNKVSP-----AHPDPPT-----TAAVAAPPATESPKKSSPA 737
                 |.|.:::..|..|...|.:..:.|     |..:||.     :......|..:.|:.....
  Rat  1138 SSMGNDDKSSIAGSEDAEPLAPPIKPIKPVTRNKAPQEPPVKKGRRSRRCGQCPGCQVPEDCGVC 1202

  Fly   738 ANSESCP-FGEVEDK--------------------LEQMFAGIEEETERISSPEKPAEESAAMVA 781
            .|....| ||....|                    |::....::::.::..:.||...:.:.:|.
  Rat  1203 TNCLDKPKFGGRNIKKQCCKMRKCQNLQWMPSKAYLQKQTKAVKKKEKKSKATEKKESKESTVVK 1267

  Fly   782 HNL-TAQLALDPSKTLDT------------PAENQTSVLAVLAPNQTPTPEIRPVATKAAMKSTM 833
            .:| :||.|..|.:....            |.|.:|......||...|.|...|        :..
  Rat  1268 SSLESAQKAAPPVREEPAPKKSSSEPPPRKPVEEKTEEGGAPAPAPAPAPAPAP--------APA 1324

  Fly   834 PSPVHSPIPQSRSTSTPLVAGDDSKSNTPVPAKAPAPRRPPPRRLSMGMDASLLRFMIDDPPAKK 898
            |:|..:|.|:.:..|||  |...|......|| |..|.:||...|   ......:.:..:|..|:
  Rat  1325 PAPAPAPAPEPKQASTP--ASRKSSKQVSQPA-AVVPPQPPSTAL---QKKEAPKAIPSEPKKKQ 1383

  Fly   899 P---------GRKKKVTKEPDFEDDDKPSTSAAAAAALAARQLSEAASATKSKPAAGAKKKNAGV 954
            |         .::|||...|......||.                    .|.||...:|::|||.
  Rat  1384 PPPPESGPEQSKQKKVAPRPSIPVKQKPK--------------------DKEKPPPVSKQENAGT 1428

  Fly   955 KGKKGSAGKGNAKNAKQNGKKSARK-PA---------FTTDEDSTPAPTNGG----GSVPELRFK 1005
            .         |..|...||..|.:| ||         |..|.::......||    .||| :..:
  Rat  1429 L---------NILNPLLNGISSKQKIPADGVHRIRVDFKEDCEAENVWEMGGLGILTSVP-ITPR 1483

  Fly  1006 SPFILIKPDGSVSI----------------KNTHSAEDVNE------------------------ 1030
            ....|....|.|..                :|....||..|                        
  Rat  1484 VVCFLCASSGHVEFVYCQVCCEPFHKFCLEENERPLEDQLENWCCRRCKFCHVCGRQHQATKQLL 1548

  Fly  1031 --------------------KQTKVKK----------------APHERKNLRGMHSSTLSNR--- 1056
                                |.||.||                .|.:..:.:..|..:|.:.   
  Rat  1549 ECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDAQWSHDFSLCHDCAK 1613

  Fly  1057 --------------YDADTTDSTWI-CVFCKRGPHKL--GLGDLFGPYLVTS-----------DC 1093
                          ||.|..:|..: |..|.|..|..  ||.|..  |.:.|           :|
  Rat  1614 LFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCEGLSDEM--YEILSNLPESVAYTCVNC 1676

  Fly  1094 DE------------------------------------YRAAVQTPGAQ---------------- 1106
            .|                                    ||.|.:.|...                
  Rat  1677 TERHPAEWRLALEKELQASLKQVLTALLNSRTTSHLLRYRQAAKPPDLNPETEESIPSRSSPEGP 1741

  Fly  1107 -----------------DIDGMFVNKRR-----------REDMVK----------GQ-------- 1125
                             |::|  |.|:.           .:|:||          ||        
  Rat  1742 DPPVLTEVSKQDEQQPLDLEG--VKKKMDQGNYVSVLEFSDDIVKIIQAAINSDGGQPEIKKANS 1804

  Fly  1126 ----------ERNLPAVPATLANIMQAPKISM---------------HKRKRKQTHDSSISYSDD 1165
                      ||..|......:...:..|:|.               |...:.|..:.| |:::.
  Rat  1805 MVKSFFIRQMERVFPWFSVKKSRFWEPNKVSNNSGMLPNAVLPPSLDHNYAQWQEREES-SHTEQ 1868

  Fly  1166 P--------------------------------NESRSQCSSVDL-----LDCSTESKFVETFRG 1193
            |                                :..||:..|.:|     :|.:.:......:  
  Rat  1869 PPLMKKIIPAPKPKGPGEPDSPTPLHPPTPPILSTDRSREDSPELHPPPGIDDNRQCALCLMY-- 1931

  Fly  1194 MGKTSEN--------GFEVWLHEDCAVWS------NDIHLIGAHVNGLDAAVWDSTRYQCVLCQQ 1244
             |..|.|        |...|.|.:||:||      :|..|...|:     ||....:.:|..||:
  Rat  1932 -GDDSANDAGRLLYIGQNEWTHVNCALWSAEVFEDDDGSLKNVHM-----AVIRGKQLRCEFCQK 1990

  Fly  1245 TGASICCFQRCCKAAAHVPCGRSANWSLSEEDRKVYCHLHR 1285
            .||::.|....|.:..|..|.|:.| .:..:|:||||..||
  Rat  1991 PGATVGCCLTSCTSNYHFMCSRAKN-CVFLDDKKVYCQRHR 2030

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
CG5098NP_001261066.1 PHD_SF 1068..1284 CDD:473978 70/402 (17%)
Kmt2aXP_063121507.1 None

Return to query results.
Submit another query.