DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment Mhc and Myh4

DIOPT Version :9

Sequence 1:NP_001162991.1 Gene:Mhc / 35007 FlyBaseID:FBgn0264695 Length:1962 Species:Drosophila melanogaster
Sequence 2:NP_062198.1 Gene:Myh4 / 360543 RGDID:3139 Length:1939 Species:Rattus norvegicus


Alignment Length:1922 Identity:1070/1922 - (55%)
Similarity:1433/1922 - (74%) Gaps:13/1922 - (0%)


- Green bases have known domain annotations that are detailed below.


  Fly    12 DPTPYLFVSLEQRRIDQSKPYDSKKSCWIPDEKEGYLLGEIKATKGDIVSVGLQGGEVRDIKSEK 76
            :..|||..|.::|...|:||:|:|.|.::.|.||.|:...:::.:|..|:...:||....:|.::
  Rat    12 EAAPYLRKSEKERIEAQNKPFDAKSSVFVVDAKESYVKATVQSREGGKVTAKTEGGATVTVKEDQ 76

  Fly    77 VEKVNPPKFEKIEDMADMTVLNTPCVLHNLRQRYYAKLIYTYSGLFCVAINPYKRYPVYTNRCAK 141
            |..:||||::||||||.||.|:.|.||:||::||.|.:||||||||||.:||||..|||......
  Rat    77 VFSMNPPKYDKIEDMAMMTHLHEPAVLYNLKERYAAWMIYTYSGLFCVTVNPYKWLPVYNPEVVA 141

  Fly   142 MYRGKRRNEVPPHIFAISDGAYVDMLTNHVNQSMLITGESGAGKTENTKKVIAYFATVGAS--KK 204
            .||||:|.|.|||||:|||.||..|||:..|||:|||||||||||.|||:||.||||:..:  ||
  Rat   142 AYRGKKRQEAPPHIFSISDNAYQFMLTDRENQSILITGESGAGKTVNTKRVIQYFATIAVTGDKK 206

  Fly   205 TDEA--AKSKGSLEDQVVQTNPVLEAFGNAKTVRNDNSSRFGKFIRIHFGPTGKLAGADIETYLL 267
            .:||  .|.:|:||||::..||:|||||||||||||||||||||||||||.|||||.||||||||
  Rat   207 KEEAPSGKMQGTLEDQIISANPLLEAFGNAKTVRNDNSSRFGKFIRIHFGATGKLASADIETYLL 271

  Fly   268 EKARVISQQSLERSYHIFYQIMSGSVPGVKDICLLTDNIYDYHIVSQGKVTVASIDDAEEFSLTD 332
            ||:||..|...|||||||||:||...|.:.::.|:|.|.||:..||||::||.||||.||...||
  Rat   272 EKSRVTFQLKAERSYHIFYQVMSNKKPELIEMLLITTNPYDFAYVSQGEITVPSIDDQEELMATD 336

  Fly   333 QAFDILGFTKQEKEDVYRITAAVMHMGGMKFKQRGREEQAEQDGEEEGGRVSKLFGCDTAELYKN 397
            .|.||||||..||..:|::|.||||.|.|||||:.||||||.||.|...:.:.|...::|:|.|.
  Rat   337 TAVDILGFTADEKVAIYKLTGAVMHYGNMKFKQKQREEQAEPDGTEVADKAAYLTSLNSADLLKA 401

  Fly   398 LLKPRIKVGNEFVTQGRNVQQVTNSIGALCKGVFDRLFKWLVKKCNETLDTQQKRQHFIGVLDIA 462
            |..||:|||||:||:|:.||||.||:|||.|.:::::|.|:|.:.|:.|||:|.||:||||||||
  Rat   402 LCYPRVKVGNEYVTKGQTVQQVYNSVGALAKAMYEKMFLWMVTRINQQLDTKQPRQYFIGVLDIA 466

  Fly   463 GFEIFEYNGFEQLCINFTNEKLQQFFNHHMFVLEQEEYKREGIDWAFIDFGMDLLACIDLIEKPM 527
            |||||::|..|||||||||||||||||||||||||||||:|||:|.||||||||.|||:||||||
  Rat   467 GFEIFDFNTLEQLCINFTNEKLQQFFNHHMFVLEQEEYKKEGIEWEFIDFGMDLAACIELIEKPM 531

  Fly   528 GILSILEEESMFPKATDQTFSEKLTNTHLGKSAPFQKPKPPKPGQQAAHFAIAHYAGCVSYNITG 592
            ||.||||||.|||||||.:|..||...|||||..||||||.| |:..|||::.||||.|.|||.|
  Rat   532 GIFSILEEECMFPKATDTSFKNKLYEQHLGKSNNFQKPKPAK-GKAEAHFSLVHYAGTVDYNIIG 595

  Fly   593 WLEKNKDPLNDTVVDQFKKSQNKLLIEIFA-DHAGQSGGGEQAKGGRGKKGGGFATVSSAYKEQL 656
            ||:|||||||:|||..::||..|.|..:|: ..|.::.||...|||: |||..|.|||:.::|.|
  Rat   596 WLDKNKDPLNETVVGLYQKSGLKTLAFLFSGGQAAEAEGGGGKKGGK-KKGSSFQTVSALFRENL 659

  Fly   657 NSLMTTLRSTQPHFVRCIIPNEMKQPGVVDAHLVMHQLTCNGVLEGIRICRKGFPNRMMYPDFKM 721
            |.|||.|:||.||||||:||||.|.||.::..||:|||.||||||||||||||||:|::|.|||.
  Rat   660 NKLMTNLKSTHPHFVRCLIPNETKTPGAMEHELVLHQLRCNGVLEGIRICRKGFPSRILYADFKQ 724

  Fly   722 RYKIMCPKLL---QGVEKDKKATEIIIKFIDLPEDQYRLGNTKVFFRAGVLGQMEEFRDERLGKI 783
            |||::....:   |.:: .|||:|.::..||:...||:.|:|||||:||:||.:||.|||:|.::
  Rat   725 RYKVLNASAIPEGQFID-SKKASEKLLGSIDIDHTQYKFGHTKVFFKAGLLGTLEEMRDEKLAQL 788

  Fly   784 MSWMQAWARGYLSRKGFKKLQEQRVALKVVQRNLRKYLQLRTWPWYKLWQKVKPLLNVSRIEDEI 848
            ::..||..||||.|..|:|:.|:|.::..:|.|:|.::.::.|||.||:.|:||||..:..|.|:
  Rat   789 ITRTQAVCRGYLMRVEFRKMMERRESIFCIQYNVRAFMNVKHWPWMKLYFKIKPLLKSAETEKEM 853

  Fly   849 ARLEEKAKKA-EELHAAEVKVRKELEALNAKLLAEKTALLDSLSGEKGALQDYQERNAKLTAQKN 912
            |.::|..:|| |:|..:|.| |||||.....|:.||..|...:..|...|.|.:||..:|...|.
  Rat   854 ATMKEDFEKAKEDLAKSEAK-RKELEEKMVALMQEKNDLQLQVQAEADGLADAEERCDQLIKTKI 917

  Fly   913 DLENQLRDIQERLTQEEDARNQLFQQKKKADQEISGLKKDIEDLELNVQKAEQDKATKDHQIRNL 977
            .||.:::::.||...||:...:|..:|:|.:.|.|.|||||:||||.:.|.|::|...:::::||
  Rat   918 QLEAKIKELTERAEDEEEINAELTAKKRKLEDECSELKKDIDDLELTLAKVEKEKHATENKVKNL 982

  Fly   978 NDEIAHQDELINKLNKEKKMQGETNQKTGEELQAAEDKINHLNKVKAKLEQTLDELEDSLEREKK 1042
            .:|:|..||.|.||.||||...|.:|:|.::|||.|||:|.|.|.|.||||.:|:||.|||:|||
  Rat   983 TEEMAGLDENIVKLTKEKKALQEAHQQTLDDLQAEEDKVNTLTKAKTKLEQQVDDLEGSLEQEKK 1047

  Fly  1043 VRGDVEKSKRKVEGDLKLTQEAVADLERNKKELEQTIQRKDKELSSITAKLEDEQVVVLKHQRQI 1107
            :|.|:|::|||:||||||.||:..|:|.:|::|::.:::|:.|:|::.:|:||||.:.::.|::|
  Rat  1048 LRMDLERAKRKLEGDLKLAQESTMDIENDKQQLDEKLKKKEFEMSNLQSKIEDEQALGMQLQKKI 1112

  Fly  1108 KELQARIEELEEEVEAERQARAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAELSK 1172
            |||||||||||||:||||.:||||||||:||:|||||:.|||||||||||||||:|||||||..|
  Rat  1113 KELQARIEELEEEIEAERASRAKAEKQRSDLSRELEEISERLEEAGGATSAQIEMNKKREAEFQK 1177

  Fly  1173 LRRDLEEANIQHESTLANLRKKHNDAVAEMAEQVDQLNKLKAKAEKEKNEYYGQLNDLRAGVDHI 1237
            :|||||||.:|||:|.|.|||||.|:|||:.||:|.|.::|.|.||||:|...:::||.:.::.:
  Rat  1178 MRRDLEEATLQHEATAAALRKKHADSVAELGEQIDNLQRVKQKLEKEKSELKMEIDDLASNMETV 1242

  Fly  1238 TNEKAAQEKIAKQLQHTLNEVQSKLDETNRTLNDFDASKKKLSIENSDLLRQLEEAESQVSQLSK 1302
            :..|...||:.:.|:..|:||::|.:|..|.:|:..|.|.:|..|:.:..|||:|.::.|||||:
  Rat  1243 SKAKGNLEKMCRTLEDQLSEVKTKEEEQQRLINELSAQKARLHTESGEFSRQLDEKDAMVSQLSR 1307

  Fly  1303 IKISLTTQLEDTKRLADEESRERATLLGKFRNLEHDLDNLREQVEEEAEGKADLQRQLSKANAEA 1367
            .|.:.|.|:|:.||..:|||:.:..|....::..||.|.||||.|||.|.||:|||.:||||:|.
  Rat  1308 GKQAFTQQIEELKRQLEEESKAKNALAHALQSARHDCDLLREQYEEEQEAKAELQRAMSKANSEV 1372

  Fly  1368 QVWRSKYESDGVARSEELEEAKRKLQARLAEAEETIESLNQKCIGLEKTKQRLSTEVEDLQLEVD 1432
            ..||:|||:|.:.|:|||||||:||..||.:|||.:|::|.||..||||||||..|||||.::|:
  Rat  1373 AQWRTKYETDAIQRTEELEEAKKKLAQRLQDAEEHVEAVNSKCASLEKTKQRLQNEVEDLMIDVE 1437

  Fly  1433 RANAIANAAEKKQKAFDKIIGEWKLKVDDLAAELDASQKECRNYSTELFRLKGAYEEGQEQLEAV 1497
            |:||...|.:|||:.|||::.|||.|.::..|||:|||||.|:.|||||::|.||||..:|||.:
  Rat  1438 RSNAACAALDKKQRNFDKVLAEWKQKYEETQAELEASQKESRSLSTELFKVKNAYEESLDQLETL 1502

  Fly  1498 RRENKNLADEVKDLLDQIGEGGRNIHEIEKARKRLEAEKDELQAALEEAEAALEQEENKVLRAQL 1562
            :||||||..|:.||.:||.|||::|||:||.:|:::.||.||||:||||||:||.||.|:||.||
  Rat  1503 KRENKNLQQEISDLTEQIAEGGKHIHELEKIKKQIDQEKSELQASLEEAEASLEHEEGKILRIQL 1567

  Fly  1563 ELSQVRQEIDRRIQEKEEEFENTRKNHQRALDSMQASLEAEAKGKAEALRMKKKLEADINELEIA 1627
            ||:||:.||||:|.||:||.:..::||.|.::|||::|:||.:.:.:|||:|||:|.|:||:||.
  Rat  1568 ELNQVKSEIDRKIAEKDEEIDQLKRNHLRVVESMQSTLDAEIRSRNDALRIKKKMEGDLNEMEIQ 1632

  Fly  1628 LDHANKANAEAQKNIKRYQQQLKDIQTALEEEQRARDDAREQLGISERRANALQNELEESRTLLE 1692
            |:|||:..|||.:|::..|..|||.|..|::..|.:||.:|||.:.|||||.:|.|:||.|..||
  Rat  1633 LNHANRQAAEAIRNLRNTQGMLKDTQLHLDDALRGQDDLKEQLAMVERRANLMQAEIEELRASLE 1697

  Fly  1693 QADRGRRQAEQELADAHEQLNEVSAQNASISAAKRKLESELQTLHSDLDELLNEAKNSEEKAKKA 1757
            |.:|.||.|||||.||.|::..:..||.|:...|:|||:::..:..::::::.||:|:|||||||
  Rat  1698 QTERSRRVAEQELLDASERVQLLHTQNTSLINTKKKLETDISQIQGEMEDIVQEARNAEEKAKKA 1762

  Fly  1758 MVDAARLADELRAEQDHAQTQEKLRKALEQQIKELQVRLDEAEANALKGGKKAIQKLEQRVRELE 1822
            :.|||.:|:||:.|||.:...|:::|.:||.:|:||.||||||..|||||||.|||||.||||||
  Rat  1763 ITDAAMMAEELKKEQDTSAHLERMKKNMEQTVKDLQHRLDEAEQLALKGGKKQIQKLEARVRELE 1827

  Fly  1823 NELDGEQRRHADAQKNLRKSERRVKELSFQSEEDRKNHERMQDLVDKLQQKIKTYKRQIEEAEEI 1887
            ||::.||:|:.:|.|.|||.|||||||::|:||||||..|:||||||||.|:|.||||.|||||.
  Rat  1828 NEVENEQKRNIEAVKGLRKHERRVKELTYQTEEDRKNVLRLQDLVDKLQTKVKAYKRQAEEAEEQ 1892

  Fly  1888 AALNLAKFRKAQQELEEAEERADLAEQAISKFRAKGR 1924
            :.:|||||||.|.||||||||||:||..::|.|.|.|
  Rat  1893 SNVNLAKFRKIQHELEEAEERADIAESQVNKLRVKSR 1929

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
MhcNP_001162991.1 Myosin_N 36..73 CDD:280832 9/36 (25%)
MYSc 81..777 CDD:214580 442/703 (63%)
MYSc_Myh1_insects_crustaceans 100..765 CDD:276874 422/672 (63%)
Myosin_tail_1 842..1897 CDD:279860 560/1055 (53%)
V_Alix_like 1557..1877 CDD:187408 174/319 (55%)
ATP-synt_B <1671..1771 CDD:304375 47/99 (47%)
TMPIT 1739..>1825 CDD:285135 51/85 (60%)
Myh4NP_062198.1 Myosin_N 36..73 CDD:280832 9/36 (25%)
Myosin_head 88..770 CDD:278492 430/684 (63%)
MYSc_Myh4 100..770 CDD:276879 422/672 (63%)
Actin-binding. /evidence=ECO:0000250 659..681 16/21 (76%)
Actin-binding. /evidence=ECO:0000250 761..775 8/13 (62%)
Myosin_tail_1 850..1927 CDD:279860 577/1077 (54%)
RILP-like <994..1098 CDD:304877 53/103 (51%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 1128..1147 14/18 (78%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 1153..1172 17/18 (94%)
MIT_CorA-like <1299..>1411 CDD:294313 58/111 (52%)
COG6 1376..>1511 CDD:303003 80/134 (60%)
DUF342 <1506..1582 CDD:302792 48/75 (64%)
DUF342 <1670..1767 CDD:302792 46/96 (48%)


Information from Original Tools:


Tool Simple Score Weighted Score Original Tool Information
BLAST Result Score Score Type Cluster ID
Compara 00.000 Not matched by this tool.
Domainoid 1 1.000 1109 1.000 Domainoid score I19
eggNOG 1 0.900 - - E2759_KOG0161
Hieranoid 1 1.000 - -
Homologene 00.000 Not matched by this tool.
Inparanoid 1 1.050 2097 1.000 Inparanoid score I38
OMA 00.000 Not matched by this tool.
OrthoDB 1 1.010 - - D41153at33208
OrthoFinder 1 1.000 - - FOG0000014
OrthoInspector 1 1.000 - - otm45868
orthoMCL 1 0.900 - - OOG6_100110
Panther 1 1.100 - - O PTHR45615
Phylome 1 0.910 - -
SonicParanoid 1 1.000 - - X58
SwiftOrtho 1 1.000 - -
TreeFam 00.000 Not matched by this tool.
1211.870

Return to query results.
Submit another query.