DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment Mhc and Myh9

DIOPT Version :9

Sequence 1:NP_001162991.1 Gene:Mhc / 35007 FlyBaseID:FBgn0264695 Length:1962 Species:Drosophila melanogaster
Sequence 2:NP_037326.2 Gene:Myh9 / 25745 RGDID:3140 Length:1960 Species:Rattus norvegicus


Alignment Length:1915 Identity:783/1915 - (40%)
Similarity:1172/1915 - (61%) Gaps:36/1915 - (1%)


- Green bases have known domain annotations that are detailed below.


  Fly    34 SKKSCWIPDEKEGYLLGEIKATKGDIVSVGL-QGGEVRDIKSEKVEKVNPPKFEKIEDMADMTVL 97
            :||..|:|..|.|:....:|...|:...|.| :.|:...:..:.::|:|||||.|:||||::|.|
  Rat    28 AKKLVWVPSTKNGFEPASLKEEVGEEAIVELVENGKKVKVNKDDIQKMNPPKFSKVEDMAELTCL 92

  Fly    98 NTPCVLHNLRQRYYAKLIYTYSGLFCVAINPYKRYPVYTNRCAKMYRGKRRNEVPPHIFAISDGA 162
            |...|||||::|||:.|||||||||||.|||||..|:|:.....||:||:|:|:||||:||:|.|
  Rat    93 NEASVLHNLKERYYSGLIYTYSGLFCVVINPYKNLPIYSEEIVDMYKGKKRHEMPPHIYAITDTA 157

  Fly   163 YVDMLTNHVNQSMLITGESGAGKTENTKKVIAYFATVGASKKTDEAAKSKGSLEDQVVQTNPVLE 227
            |..|:.:..:||:|.||||||||||||||||.|.|.|.:|.|   :.|.:|.||.|::|.||:||
  Rat   158 YRSMMQDREDQSILCTGESGAGKTENTKKVIQYLAHVASSHK---SKKDQGELERQLLQANPILE 219

  Fly   228 AFGNAKTVRNDNSSRFGKFIRIHFGPTGKLAGADIETYLLEKARVISQQSLERSYHIFYQIMSGS 292
            ||||||||:|||||||||||||:|...|.:.||:||||||||:|.|.|...||::||||.::||:
  Rat   220 AFGNAKTVKNDNSSRFGKFIRINFDVNGYIVGANIETYLLEKSRAIRQAKEERTFHIFYYLLSGA 284

  Fly   293 VPGVKDICLLTDNIYDYHIVSQGKVTVASIDDAEEFSLTDQAFDILGFTKQEKEDVYRITAAVMH 357
            ...:| ..||.:....|..:|.|.||:....|.:.|..|.:|..|:|..:.|:..:.|:.:.|:.
  Rat   285 GEHLK-TDLLLEPYNKYRFLSNGHVTIPGQQDKDMFQETMEAMRIMGIPEDEQMGLLRVISGVLQ 348

  Fly   358 MGGMKFKQRGREEQAEQDGEEEGGRVSKLFGCDTAELYKNLLKPRIKVGNEFVTQGRNVQQVTNS 422
            :|.:.||:....:||.........:||.|.|.:..:..:.:|.||||||.::|.:.:..:|...:
  Rat   349 LGNIVFKKERNTDQASMPDNTAAQKVSHLLGINVTDFTRGILTPRIKVGRDYVQKAQTKEQADFA 413

  Fly   423 IGALCKGVFDRLFKWLVKKCNETLD-TQQKRQHFIGVLDIAGFEIFEYNGFEQLCINFTNEKLQQ 486
            |.||.|..::|:|:|||.:.|:.|| |:::...|||:|||||||||:.|.|||||||:|||||||
  Rat   414 IEALAKATYERMFRWLVLRINKALDKTKRQGASFIGILDIAGFEIFDLNSFEQLCINYTNEKLQQ 478

  Fly   487 FFNHHMFVLEQEEYKREGIDWAFIDFGMDLLACIDLIEKPM---GILSILEEESMFPKATDQTFS 548
            .|||.||:||||||:||||:|.|||||:||..||||||||.   |||::|:||..||||||::|.
  Rat   479 LFNHTMFILEQEEYQREGIEWNFIDFGLDLQPCIDLIEKPAGPPGILALLDEECWFPKATDKSFV 543

  Fly   549 EKLTNTHLGKSAPFQKPKPPKPGQQAAHFAIAHYAGCVSYNITGWLEKNKDPLNDTVVDQFKKSQ 613
            ||:.... |....|||||..|   ..|.|.|.||||.|.|....||.||.|||||.:.....:|.
  Rat   544 EKVVQEQ-GTHPKFQKPKQLK---DKADFCIIHYAGKVDYKADEWLMKNMDPLNDNIATLLHQSS 604

  Fly   614 NKLLIEIFAD---------HAGQSGGGEQAKGGRGK-KGGGFATVSSAYKEQLNSLMTTLRSTQP 668
            :|.:.|::.|         .||.|   |.|..|..| :.|.|.||...|||||..||.|||:|.|
  Rat   605 DKFVSELWKDVDRIIGLDQVAGMS---ETALPGAFKTRKGMFRTVGQLYKEQLAKLMATLRNTNP 666

  Fly   669 HFVRCIIPNEMKQPGVVDAHLVMHQLTCNGVLEGIRICRKGFPNRMMYPDFKMRYKIMCPKLL-Q 732
            :||||||||..|:.|.:|.|||:.||.||||||||||||:|||||:::.:|:.||:|:.|..: :
  Rat   667 NFVRCIIPNHEKKAGKLDPHLVLDQLRCNGVLEGIRICRQGFPNRVVFQEFRQRYEILTPNSIPK 731

  Fly   733 GVEKDKKATEIIIKFIDLPEDQYRLGNTKVFFRAGVLGQMEEFRDERLGKIMSWMQAWARGYLSR 797
            |....|:|..::||.::|..:.||:|.:|||||||||..:||.||.::..::...||..||||:|
  Rat   732 GFMDGKQACVLMIKALELDSNLYRIGQSKVFFRAGVLAHLEEERDLKITDVIIGFQACCRGYLAR 796

  Fly   798 KGFKKLQEQRVALKVVQRNLRKYLQLRTWPWYKLWQKVKPLLNVSRIEDEIARLEEKAKKAEELH 862
            |.|.|.|:|..|:||:|||...||:||.|.|::|:.|||||||..|.|||:...|.:..|..|.|
  Rat   797 KAFAKRQQQLTAMKVLQRNCAAYLRLRNWQWWRLFTKVKPLLNSIRHEDELLAKEAELTKVREKH 861

  Fly   863 AAEVKVRKELEALNAKLLAEKTALLDSLSGEKGALQDYQERNAKLTAQKNDLENQLRDIQERLTQ 927
            .|......|:|.:.::|:|||..|.:.|..|.....:.:|..|:|||:|.:||....|::.|:.:
  Rat   862 LAAENRLTEMETMQSQLMAEKLQLQEQLQAETELCAEAEELRARLTAKKQELEEICHDLEARVEE 926

  Fly   928 EEDARNQLFQQKKKADQEISGLKKDIEDLELNVQKAEQDKATKDHQIRNL-NDEIAHQDELINKL 991
            ||:....|..:|||..|.|..|::.:|:.|...||.:.:|.|.:.:::.| .|:|..:|:.. ||
  Rat   927 EEERCQYLQAEKKKMQQNIQELEEQLEEEESARQKLQLEKVTTEAKLKKLEEDQIIMEDQNC-KL 990

  Fly   992 NKEKKMQGETNQKTGEELQAAEDKINHLNKVKAKLEQTLDELEDSLEREKKVRGDVEKSKRKVEG 1056
            .||||:..:...:....|...|:|...|.|:|.|.|..:.:||:.|.||:|.|.::||::||:||
  Rat   991 AKEKKLLEDRVAEFTTNLMEEEEKSKSLAKLKNKHEAMITDLEERLRREEKQRQELEKTRRKLEG 1055

  Fly  1057 DLKLTQEAVADLERNKKELEQTIQRKDKELSSITAKLEDEQVVVLKHQRQIKELQARIEELEEEV 1121
            |.....:.:|:|:....||:..:.:|::||.:..|::|:|........::|:||:.:|.||:|::
  Rat  1056 DSTDLSDQIAELQAQIAELKMQLAKKEEELQAALARVEEEAAQKNMALKKIRELETQISELQEDL 1120

  Fly  1122 EAERQARAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAELSKLRRDLEEANIQHES 1186
            |:||..|.|||||:.||..|||.|...||:...:|:||.||..|||.|:|.|::.||:....||:
  Rat  1121 ESERACRNKAEKQKRDLGEELEALKTELEDTLDSTAAQQELRSKREQEVSILKKTLEDEAKTHEA 1185

  Fly  1187 TLANLRKKHNDAVAEMAEQVDQLNKLKAKAEKEKNEYYGQLNDLRAGVDHITNEKAAQEKIAKQL 1251
            .:..:|:||:.||.|:|||::|..::||..||.|.....:..:|...|..:...|...|...|::
  Rat  1186 QIQEMRQKHSQAVEELAEQLEQTKRVKATLEKAKQTLENERGELANEVKALLQGKGDSEHKRKKV 1250

  Fly  1252 QHTLNEVQSKLDETNRTLNDFDASKKKLSIENSDLLRQLEEAESQVSQLSKIKISLTTQLEDTKR 1316
            :..|.|:|.|..|..|...:......||.:|...:...|.:::|:.|:|:|...:|.:||:||:.
  Rat  1251 EAQLQELQVKFSEGERVRTELADKVSKLQVELDSVTGLLNQSDSKSSKLTKDFSALESQLQDTQE 1315

  Fly  1317 LADEESRERATLLGKFRNLEHDLDNLREQVEEEAEGKADLQRQLSKANAEAQVWRSKYESDGVAR 1381
            |..||:|::.:|..|.:.:|.:.::.|||:|||.|.|.:|::|::..:|:....:.|.| |||..
  Rat  1316 LLQEENRQKLSLSTKLKQMEDEKNSFREQLEEEEEAKRNLEKQIATLHAQVTDMKKKME-DGVGC 1379

  Fly  1382 SEELEEAKRKLQARLAEAEETIESLNQKCIGLEKTKQRLSTEVEDLQLEVDRANAIANAAEKKQK 1446
            .|..|||||:||..|....:.:|........|||||.||..|::||.:::|......:..|||||
  Rat  1380 LETAEEAKRRLQKDLEGLSQRLEEKVAAYDKLEKTKTRLQQELDDLLVDLDHQRQSVSNLEKKQK 1444

  Fly  1447 AFDKIIGEWKLKVDDLAAELDASQKECRNYSTELFRLKGAYEEGQEQLEAVRRENKNLADEVKDL 1511
            .||:::.|.|......|.|.|.::.|.|...|:...|..|.||..||...:.|.||....|::||
  Rat  1445 KFDQLLAEEKTISAKYAEERDRAEAEAREKETKALSLARALEEAMEQKAELERLNKQFRTEMEDL 1509

  Fly  1512 LDQIGEGGRNIHEIEKARKRLEAEKDELQAALEEAEAALEQEENKVLRAQLELSQVRQEIDRRIQ 1576
            :....:.|:::||:||:::.||.:.:|::..|||.|..|:..|:..||.::.|..::.:.:|.:|
  Rat  1510 MSSKDDVGKSVHELEKSKRALEQQVEEMKTQLEELEDELQATEDAKLRLEVNLQAMKAQFERDLQ 1574

  Fly  1577 EKEEEFENTRKNHQRALDSMQASLEAEAKGKAEALRMKKKLEADINELEIALDHANKANAEAQKN 1641
            .::|:.|..:|...|.:..|:|.||.|.|.::.|:..:||||.|:.:||..:|.|||...||.|.
  Rat  1575 GRDEQSEEKKKQLVRQVREMEAELEDERKQRSIAMAARKKLEMDLKDLEAHIDTANKNREEAIKQ 1639

  Fly  1642 IKRYQQQLKDIQTALEEEQRARDDAREQLGISERRANALQNELEESRTLLEQADRGRRQAEQELA 1706
            :::.|.|:||....|::.:.:|::...|...:|::..:::.|:.:.:..|..|:|.:|||:||..
  Rat  1640 LRKLQAQMKDCMRELDDTRASREEILAQAKENEKKLKSMEAEMIQLQEELAAAERAKRQAQQERD 1704

  Fly  1707 DAHEQLNEVSAQNASISAAKRKLESELQTLHSDLDELLNEAKNSE---EKAKKAMVDAARLADEL 1768
            :..:::...|.:.|.....||:||:.:..|..:|:|   |..|:|   ::.|||.:...::..:|
  Rat  1705 ELADEIANSSGKGALALEEKRRLEARIAQLEEELEE---EQGNTELINDRLKKANLQIDQINTDL 1766

  Fly  1769 RAEQDHAQTQEKLRKALEQQIKELQVRLDEAEANALKGGKKAIQKLEQRVRELENELDGEQRRHA 1833
            ..|:.|||..|..|:.||:|.|||:.:|.|.|:......|.:|..||.::.:||.:||.|.:...
  Rat  1767 NLERSHAQKNENARQQLERQNKELKAKLQEMESAVKSKYKASIAALEAKIAQLEEQLDNETKERQ 1831

  Fly  1834 DAQKNLRKSERRVKELSFQSEEDRKNHERMQDLVDKLQQKIKTYKRQIEEAEEIAALNLAKFRKA 1898
            .|.|.:|::|:::|::..|.|::|:|.|:.:|..||...::|..|||:|||||.|....|..||.
  Rat  1832 AASKQVRRAEKKLKDVLLQVEDERRNAEQFKDQADKASTRLKQLKRQLEEAEEEAQRANASRRKL 1896

  Fly  1899 QQELEEAEERADLAEQAISKFRAKGRAGSV 1928
            |:|||:|.|.||...:.:|..:.|.|.|.:
  Rat  1897 QRELEDATETADAMNREVSSLKNKLRRGDM 1926

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
MhcNP_001162991.1 Myosin_N 36..73 CDD:280832 10/37 (27%)
MYSc 81..777 CDD:214580 364/710 (51%)
MYSc_Myh1_insects_crustaceans 100..765 CDD:276874 343/679 (51%)
Myosin_tail_1 842..1897 CDD:279860 359/1058 (34%)
V_Alix_like 1557..1877 CDD:187408 101/322 (31%)
ATP-synt_B <1671..1771 CDD:304375 26/102 (25%)
TMPIT 1739..>1825 CDD:285135 30/88 (34%)
Myh9NP_037326.2 Myosin_N 29..68 CDD:397036 11/38 (29%)
Motor_domain 95..764 CDD:419868 343/679 (51%)
Myosin_tail_1 842..1921 CDD:396244 370/1083 (34%)


Information from Original Tools:


Tool Simple Score Weighted Score Original Tool Information
BLAST Result Score Score Type Cluster ID
Compara 00.000 Not matched by this tool.
Domainoid 00.000 Not matched by this tool.
eggNOG 1 0.900 - - E2759_KOG0161
Hieranoid 00.000 Not matched by this tool.
Homologene 00.000 Not matched by this tool.
Inparanoid 00.000 Not matched by this tool.
OMA 00.000 Not matched by this tool.
OrthoDB 1 1.010 - - D47111at2759
OrthoFinder 1 1.000 - - FOG0000014
OrthoInspector 00.000 Not matched by this tool.
orthoMCL 1 0.900 - - OOG6_100110
Panther 00.000 Not matched by this tool.
Phylome 1 0.910 - -
SonicParanoid 1 1.000 - - X58
SwiftOrtho 1 1.000 - -
TreeFam 00.000 Not matched by this tool.
76.720

Return to query results.
Submit another query.