DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment Mhc and CG31638

DIOPT Version :10

Sequence 1:NP_523587.4 Gene:Mhc / 35007 FlyBaseID:FBgn0264695 Length:1962 Species:Drosophila melanogaster
Sequence 2:NP_723186.2 Gene:CG31638 / 33910 FlyBaseID:FBgn0051638 Length:704 Species:Drosophila melanogaster


Alignment Length:684 Identity:148/684 - (21%)
Similarity:283/684 - (41%) Gaps:144/684 - (21%)


- Green bases have known domain annotations that are detailed below.


  Fly  1345 QVEEEAEGKADLQRQLSKANAEAQV--------------WRSKYESDGVARSEELEEAKRKLQAR 1395
            :.|.||. ::..||:|.:|.|.|..              ||.|:......|::..||:|:     
  Fly    40 ETEWEAR-ESQRQRELHEARARAAQMEKTMKWWSDCTANWREKWSKVRNERNKAREESKQ----- 98

  Fly  1396 LAEAEETIESLNQKCIGLEKTKQRLSTEVEDLQLEVDRANAIANAAEKKQKAFDKIIGEWKLKVD 1460
                      |:.|..|:.|....|..|..||:|::.:   :....||......|..|::...  
  Fly    99 ----------LSLKLDGVMKEAHSLKREKNDLELQITQ---LKKEMEKVHTLMMKHAGQFHRA-- 148

  Fly  1461 DLAAELDASQKECRNYSTELFRLKGAYEEGQEQLEAVRREN---KNLADEVKDL-LDQIGEGG-- 1519
            |.:.:.:|:.::. |.|.::         ..:.|:.:..|:   ..|.::|||| :::....|  
  Fly   149 DTSEDAEANGRDA-NCSPDI---------SSDGLKNINSEDGLVTKLPNDVKDLDIEEFAMKGAM 203

  Fly  1520 -RNIHEIEKA----RKRL--EAEKDE------------LQAALEEAEAALEQEENKVLRAQLELS 1565
             :::.|:::|    .|||  :..||:            ||..|:||:..|:.|.::.|.....:.
  Fly   204 PKHLTELDEAAAAEEKRLIQQLSKDDFDEDYLLQKISMLQLRLDEAQKTLQAERDEKLELHKSIE 268

  Fly  1566 QVRQEIDRRIQEKEEEFENTRKNHQRALDSMQASLEAEAK----GKAEALRMKKKLEADINELEI 1626
            ::..|| :.::.::||..:.::...|.|.::|....||.:    ...|.:..::.||..:.||..
  Fly   269 KLTLEI-QDVRGRQEEMRSAKQEAVRELLTLQEQHRAEMRIVNNSLQEEIAARENLERRLTELRT 332

  Fly  1627 ALDHANKANAEAQKNIKRYQQQLKDIQTALE-EEQRARDDAREQLGISERRANALQ-NELEESRT 1689
            .|:|....||.....    :::|:..:.|:| :.::.|.:.|:....|:|:...:| |::|    
  Fly   333 ELEHLQAENASEWGK----RERLESEKLAMERDNKKLRAELRDYQERSDRKCRPMQANDVE---- 389

  Fly  1690 LLEQADRGRRQAEQELADAHEQLNEVSAQNASIS------------AAKR--KLESELQTLHSDL 1740
                    .|..:|||::.:::::||...:|.:.            |.:|  :.|:|::.|...:
  Fly   390 --------LRALQQELSERNKEISEVKMSHAKLKKLLAETNTELGHAVRRAEQYEAEVKRLRQRV 446

  Fly  1741 DELLNEAKNSEEKAKKAMVDAARLA---DELRAEQDHAQTQEKLRKALEQQIKELQVRLDEAEAN 1802
            :||..|...:|::...|:....||.   |||..:.:          .|:.||:.||.|  .|.:.
  Fly   447 EELKRELAGAEDELDSAVNQVRRLQRSNDELVGQTE----------GLQVQIQHLQNR--RAPSP 499

  Fly  1803 ALKGGKKAIQKLEQRVRELENE----LDGEQRRHADAQKNLRKSERRVKELSFQSEEDRKNHERM 1863
            .|: |...:|...:...||.|:    ::..::...|:|..||.|..........:...:::....
  Fly   500 QLR-GMGGVQLRNKIAVELPNDCLPNINDLRQIFDDSQAGLRSSHNGSDAAMHHAASVKRSSHTE 563

  Fly  1864 QDLVDKLQQKIKTYKRQIEEAEEIAALNLAKFRKAQQELEEAEERADLAEQAISKF--------- 1919
            :.|:.  ||:..........|...||...||....::.:.|::.|....|:|..||         
  Fly   564 RTLLQ--QQQSSAAASAAAAAAAAAAFFDAKPTHLEENIFESKSRNLEFERAKQKFDNPHGQHHH 626

  Fly  1920 ------RAKGRAGSVGRGASPAPRATSVRPQFDG 1947
                  ...||.||.|..||.|..|..::...:|
  Fly   627 HHQRQRSGNGRYGSAGSSASRANLALPLKGTTNG 660

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
MhcNP_523587.4 Myosin_N 33..75 CDD:460670
MYSc_Myh1_insects_crustaceans 100..765 CDD:276874
Myosin_tail_1 842..1922 CDD:460256 138/657 (21%)
CG31638NP_723186.2 Smc <233..>496 CDD:440809 64/291 (22%)
Blue background indicates that the domain is not in the aligned region.

Return to query results.
Submit another query.