DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment CG4049 and Smarcad1

DIOPT Version :10

Sequence 1:NP_611885.3 Gene:CG4049 / 37860 FlyBaseID:FBgn0034976 Length:1669 Species:Drosophila melanogaster
Sequence 2:NP_031984.1 Gene:Smarcad1 / 13990 MGIID:95453 Length:1021 Species:Mus musculus


Alignment Length:1227 Identity:264/1227 - (21%)
Similarity:432/1227 - (35%) Gaps:370/1227 - (30%)


- Green bases have known domain annotations that are detailed below.


  Fly     2 DNPEANMEESN-PNYDVNQRLQK---PEGWPHVNEFHPSMM----------YPD---DPRRYFHN 49
            :|.|.....:| |:.||.::.:.   ||  |..||...|:.          |.|   |......|
Mouse    43 ENAEGEGSRANTPDSDVTEKTEDSSVPE--PPDNERKASLSCFQNQRAIQEYIDLSSDTEDVSPN 105

  Fly    50 ASTDPQHQPQQHPQ----QQPEQHPEQHPQHPL---HSQQQPQQHPQQQPQQHPQQQTQQPPQLQ 107
            .|:..|.:......    .:|.:..|.|....:   :...:.:...:.:..:..:.||.:  :|.
Mouse   106 CSSTVQEKKFSKDTVIIVSEPSEDEESHDLPSVTRRNDSSELEDLSELEDLKDAKLQTLK--ELF 168

  Fly   108 PQQHPQAQIQPPIPVPISTPMEGTMEVPVKM-------PEYCPETSQPVEPDRVINPLAEVAKNT 165
            ||:.....::   .:..::.|:|.:...:.|       |.....:|...|.|  :|....|.:..
Mouse   169 PQRSDSDLLK---LIESTSTMDGAIAAALLMFGDAGGGPRKRKLSSSSEEDD--VNDDQSVKQPR 228

  Fly   166 VPYGHKRKNSAGEDQAANKKLSMAAAEEKKNSHL-RKNIRDVMNENNLDTTTLAAQREESERLAR 229
            ...|.:...||.......|:.|:....:|:..:. ::.:|:|:.|:....|      |..|.|..
Mouse   229 GDRGEESNESAEASSNWEKQESIVLKLQKEFPNFDKQELREVLKEHEWMYT------EALESLKV 287

  Fly   230 VAGQQKSMREIQKQVVH-KQIFRILQLDESEGIESCAVANPVEHHPPVDF----------PEEEI 283
            .|..|......|.:|.: |::.|  ..:.|:......:...:...|...|          |::.:
Mouse   288 FAEDQDVQCASQSEVTNGKEVAR--NQNYSKNATKIKMKQKISVKPQNGFNKKRKKNVFNPKKAV 350

  Fly   284 ASDTFDDSN--SSSLSG--GSIEDVLHKG-----------ASVQPSEVVTIDDSSDDDCILLSEE 333
            ....:|..:  .|||..  .|.|:|:..|           :|:  :|:..|...|......::|.
Mouse   351 EDSEYDSGSDAGSSLDEDYSSCEEVMEDGYKGKILHFLQVSSI--AELTLIPKCSQKKAQKITEL 413

  Fly   334 EEEEDDEDLNESDDATNSGMHVKDIYNVPD--ENGQVVVNMAHPEGEETLYLAPQIAKV------ 390
            ....:.|.|.......| |:....|:|...  :...||:.:.:...:.:..|..|:..:      
Mouse   414 RPFNNWEALFTKMSKIN-GLSEDLIWNCKTVIQERDVVIRLMNKCEDISNKLTKQVTMLTGNGGG 477

  Fly   391 --------------IKPHQIGGVRFLYDNIIESTRRYNKSSGFGCILAHSMGLGKTLQVVSFCDI 441
                          :||:|..|:.:|         ......|...|||..||||||:|.::|...
Mouse   478 WNREQPSLLNQSLSLKPYQKVGLNWL---------ALVHKHGLNGILADEMGLGKTIQAIAFLAY 533

  Fly   442 FLRHTSAKTVLCVMPINTLQNWLSEFNMWIPR------YSTDSNVRPRNFDIFVLNDQQKTLTAR 500
            ..:..:....|.|:|.:|:.|||.|.|:|.|.      |.:....:...|:|.            
Mouse   534 LFQEGNKGPHLIVVPASTIDNWLREVNLWCPSLNVLCYYGSQEERKQIRFNIH------------ 586

  Fly   501 AKVILNWVHDGGVLLIGY----------ELFRLLALKLVKTRKRKGSVIRPDGMDSSSDLMNLVY 555
                 |...|..|::..|          .|||.|.|                             
Mouse   587 -----NKYEDYNVIVTTYNCAISSSDDRSLFRRLKL----------------------------- 617

  Fly   556 EALVKPGPDLVICDEGHRIKNSHAGISLALKEIRTRRRIVLTGYPLQNNLLEYWCMVDFVRPNYL 620
                    :..|.||||.:||..:.....|..|..|.|::|||.|:||||||...:::||.|:..
Mouse   618 --------NYAIFDEGHMLKNMGSIRYQHLMTINARNRLLLTGTPVQNNLLELMSLLNFVMPHMF 674

  Fly   621 GTRT-EFCNMFERPIQNGQCVDSTPDDIKLMRYRAHVLHS--LLLGFVQRR-SHTVLQLTLPQKY 681
            .:.| |...||..        .:.|.|.:.:..:..:.|:  ::..|:.|| ...||:| ||.|.
Mouse   675 SSSTSEIRRMFSS--------KTKPADEQSIYEKERIAHAKQIIKPFILRRVKEEVLKL-LPPKK 730

  Fly   682 EYVILVKMTAFQRKLYDTFMTDVVRTKAFPNPLKAFAVCCKIWNHPDVLYNFLKKCETDLDLEID 746
            :.:.|..|:..|.:||..                              |:|.|||          
Mouse   731 DRIELCAMSEKQEQLYSG------------------------------LFNRLKK---------- 755

  Fly   747 EEVTKGAATPIVEPSADSSLSLASPLEKKINGSGDPINSIETFSKAENQTLFNIPASSDLNAKYL 811
                                               .||::|     :|..:.|:.......|.:.
Mouse   756 -----------------------------------SINNLE-----KNTEMCNVMMQLRKMANHP 780

  Fly   812 NKSPSFYDEKPEPLNYGSFGSEGKNNYWMDSSILPKPGCVEVIKQTDTNMSSNFESITGSSEIVD 876
            .....:|  .||.|..            |...:|.:|          |:..:|.:.|....|:: 
Mouse   781 LLHRQYY--TPEKLKE------------MSQLMLKEP----------THCEANPDLIFEDMEVM- 820

  Fly   877 LDTNEIKTVETTIQAPCSNNQLDNGCNAGKPSEWNAAGSKNSSGVAAAEPFKKLLKSKQRNEEFS 941
                    .:..:...|...|..|                                         
Mouse   821 --------TDFELHVLCKQYQHIN----------------------------------------- 836

  Fly   942 CSWAVDLMKNYVSGLISNSPKMEIFFCILKESLNLGDRILLFSQSLLTLNLLEVYLKSSYVPGSN 1006
             |:.:|:      .||.:|.|.....|||.|....|||::||||..:.|::|||.||.       
Mouse   837 -SYQLDM------DLILDSGKFRALGCILSELKQKGDRVVLFSQFTMMLDILEVLLKH------- 887

  Fly  1007 QLWTKNSSYFRLDGSTSSQERERLVNEFNANSNVKLFLISTRAGSLGINLTGANRVIIFDASWNP 1071
                ....|.||||.|...||..|::|||.:.::.:||:||:||.||||||.||.||:.|...||
Mouse   888 ----HQHRYLRLDGKTQISERIHLIDEFNTDMDIFVFLLSTKAGGLGINLTSANVVILHDIDCNP 948

  Fly  1072 CHDTQAVYRIYRYGQTKPCFVYRIVMDRCLEK---KIYDRQIK-KQGMSDRIVDECN 1124
            .:|.||..|.:|.||||...|.:::....:|:   ||..:::| :|.|:  .|||.:
Mouse   949 YNDKQAEDRCHRVGQTKEVLVIKLISQGTIEESMLKINQQKLKLEQDMT--TVDEAD 1003

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
CG4049NP_611885.3 DEXHc_ARIP4 391..653 CDD:350827 70/278 (25%)
HepA <392..1121 CDD:440319 180/752 (24%)
Smarcad1NP_031984.1 Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 1..82 12/40 (30%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 124..151 3/26 (12%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 201..246 9/46 (20%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 329..366 6/36 (17%)
HepA <480..992 CDD:440319 177/755 (23%)
DEGH box 623..626 2/2 (100%)
Nuclear localization signal. /evidence=ECO:0000255 716..733 8/17 (47%)
DEAD box 1000..1003 2/2 (100%)

Return to query results.
Submit another query.