DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment Mhc and XIA

DIOPT Version :10

Sequence 1:NP_523587.4 Gene:Mhc / 35007 FlyBaseID:FBgn0264695 Length:1962 Species:Drosophila melanogaster
Sequence 2:NP_171954.1 Gene:XIA / 839480 AraportID:AT1G04600 Length:1730 Species:Arabidopsis thaliana


Alignment Length:1753 Identity:479/1753 - (27%)
Similarity:823/1753 - (46%) Gaps:318/1753 - (18%)


- Green bases have known domain annotations that are detailed below.


  Fly    39 WIPDEKEGYLLGEIKATKGDIVSVGLQGGETRDLKKDLLQQVN-----PPKYEK--AEDMSNLTY 96
            |:.|..:.::.||::....:.::|...|       |.::.::|     .|::.:  .:||:.|.|
plant    14 WVEDPDDAWIDGEVEEVNSEEITVNCSG-------KTVVAKLNNVYPKDPEFPELGVDDMTKLAY 71

  Fly    97 LNDASVLHNLRQRYYNKLIYTYSGLFCVAINPYKRYP-VYTNRCAKMYRGKRRNEVPPHIFAISD 160
            |::..||.||:.||....||||:|...:|:||:||.| :|.:...|.|:|....|:.||.||::|
plant    72 LHEPGVLLNLKCRYNANEIYTYTGNILIAVNPFKRLPHLYGSETMKQYKGTAFGELSPHPFAVAD 136

  Fly   161 GAYVDMLTNHVNQSMLITGESGAGKTENTKKVIAYFATVGASKKTDEAAKSKG-SLEDQVVQTNP 224
            .||..|:...|:|::|::|||||||||:||.::.|.|.:|.      .|:|:| |:|.||:::||
plant   137 SAYRKMINEGVSQAILVSGESGAGKTESTKMLMQYLAYMGG------RAESEGRSVEQQVLESNP 195

  Fly   225 VLEAFGNAKTVRNDNSSRFGKFIRIHFGPTGKLAGADIETYLLEKARVISQQSLERSYHIFYQIM 289
            |||||||||||||:||||||||:.|.|...|:::||.|.|||||::||......||:||.||.:.
plant   196 VLEAFGNAKTVRNNNSSRFGKFVEIQFDQRGRISGAAIRTYLLERSRVCQVSDPERNYHCFYMLC 260

  Fly   290 SGSVPGVKDICLLTDNIYDYHIVSQGK-VTVASIDDAEEFSLTDQAFDILGFTKQEKEDVYRITA 353
            :......:...|...:.:.|  ::|.. ..:..:||::|:..|.:|.|::|...:|::.::|:.|
plant   261 AAPEQETERYKLGKPSTFRY--LNQSNCYALDGLDDSKEYLATRKAMDVVGINSEEQDGIFRVVA 323

  Fly   354 AVMHMGGMKFKQRGREEQAEQDGEEEG----GRVSKLFGCDTAELYKNLLKPRIKVGNEFVTQGR 414
            |::|:|.::| .:|.|.:|.:..:|:.    ...::||.||...|..:|.|..:...:|.:|:..
plant   324 AILHLGNIEF-AKGEESEASEPKDEKSRFHLKVAAELFMCDGKALEDSLCKRVMVTRDESITKSL 387

  Fly   415 NVQQVTNSIGALCKGVFDRLFKWLVKKCNETLDTQQKRQHFIGVLDIAGFEIFEYNGFEQLCINF 479
            :.........||.|.|:.:||.|||.|.|.::......:|.||||||.|||.|:.|.|||.|||.
plant   388 DPDSAALGRDALAKIVYSKLFDWLVTKINNSIGQDPNSKHIIGVLDIYGFESFKTNSFEQFCINL 452

  Fly   480 TNEKLQQFFNHHMFVLEQEEYQREGIEWTFIDFGMDLQLCIDLIE-KPMGILSILEEESMFPKAT 543
            |||||||.||.|:|.:|||||.:|.|:|::|:| :|.|..:|||| ||.||:::|:|..|||::|
plant   453 TNEKLQQHFNQHVFKMEQEEYTKEEIDWSYIEF-IDNQDVLDLIEKKPGGIIALLDEACMFPRST 516

  Fly   544 DQTFSEKLTNTHLGKSAPFQKPKPPKPGQQAAHFAIAHYAGCVSYNITGWLEKNKDPLNDTVVDQ 608
            ..||::||..|...... |.|||..:     ..|.|.||||.|:|....:|:||||.:.......
plant   517 HDTFAQKLYQTFKNHKR-FGKPKLAQ-----TDFTICHYAGDVTYQTELFLDKNKDYVVGEHQAL 575

  Fly   609 FKKSQNKLLIEIFADHAGQSGGGEQAKGGRGKKGGGFATVSSAYKEQLNSLMTTLRSTQPHFVRC 673
            ...|....:..:|.....:|           .|...|:::.|.:|:||.||:.:|.:|:||::||
plant   576 LSSSDCSFVSSLFPPLPEES-----------SKTSKFSSIGSQFKQQLQSLLESLSTTEPHYIRC 629

  Fly   674 IIPNEMKQPGVVDAHLVMHQLTCNGVLEGIRICRKGFPNRMMYPDFKMRYKIMCPKLLQGVEKDK 738
            :.||.:.:|.:.:...::|||.|.||:|.|||...|:|.|..:.:|..|::|:.|:..:....:.
plant   630 VKPNNLLKPDIFENINILHQLRCGGVMEAIRISCAGYPTRKPFNEFLTRFRILAPETTKSSYDEV 694

  Fly   739 KATEIIIKFIDLPEDQYRLGNTKVFFRAGVLGQMEEFRDERLGKIMSWMQAWARGYLSRKGFKKL 803
            .|.:.::..:||  ..:::|.||||.|||.:.:|:..|.|.||.....:|.....|.|||.|..|
plant   695 DACKKLLAKVDL--KGFQIGKTKVFLRAGQMAEMDAHRAEVLGHSARIIQRNVLTYQSRKKFLLL 757

  Fly   804 QEQRVALKVVQRNLRKYLQLRTWPWYKLWQKVKPLLNVSRIEDEIARLEEKAKKAEELHAAEVKV 868
            |.....::.:.|.                       .|:|:..|..|.|          ||.:::
plant   758 QAASTEIQALCRG-----------------------QVARVWFETMRRE----------AASLRI 789

  Fly   869 RKELEALNAKLLAEKTALLDSLSGEKGALQDYQERNAKLTAQKNDLENQLRDIQERLT---QEED 930
            :|:......: .|.||....:.|.:.|           :.|:...:|.|||. :.|.|   |.:.
plant   790 QKQARTYICQ-NAYKTLCSSACSIQTG-----------MRAKAARIELQLRK-KRRATIIIQSQI 841

  Fly   931 AR---NQLFQQKKKADQEISGLKKDIEDLELNVQKAEQDKATKDHQIRNLNDEIAHQDELINKLN 992
            .|   :|.:.:.|||              .:..|...:.|..: .::|||               
plant   842 RRCLCHQRYVRTKKA--------------AITTQCGWRVKVAR-RELRNL--------------- 876

  Fly   993 KEKKMQGETNQKTGEELQAAEDKINHLNKVKAKLEQTLDELEDSLEREKKVRGDVEKSKRKVEGD 1057
               ||..   ::||.           |...|.|||..::||..:||.||::|.::|::|.:   :
plant   877 ---KMAA---KETGA-----------LQDAKTKLENQVEELTSNLELEKQMRMEIEEAKSQ---E 921

  Fly  1058 LKLTQEAVADLERNKKELEQTIQRKDKELSSITAKLEDEQVVVLKHQ----RQIKELQARIEELE 1118
            ::..|..:.|:   |.:|..|.:.|.||:|.:.:.|.|.::.:...|    ::|.:||:.:::::
plant   922 IEALQSVLTDI---KLQLRDTQETKSKEISDLQSVLTDIKLQLRDTQETKSKEISDLQSALQDMQ 983

  Fly  1119 EEVEAERQARAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAELSKLRRDLEEANIQ 1183
            .|:|    ..:|..:...|||.|    .|:|:|:..:...:|:.::::..|:||    :.|..|:
plant   984 LEIE----ELSKGLEMTNDLAAE----NEQLKESVSSLQNKIDESERKYEEISK----ISEERIK 1036

  Fly  1184 HESTLANLRKKHNDAVAEMAEQVDQLNKLKAKAEKEKNEYYGQLNDLRAGVDHITNEKAAQEKIA 1248
            .|..:                 :||...:|.:.|.:|         |:|              :.
plant  1037 DEVPV-----------------IDQSAIIKLETENQK---------LKA--------------LV 1061

  Fly  1249 KQLQHTLNEVQSKLDETNRTLNDFDASKKKLSIENSDLLRQLEEAESQVSQLSKIKISLTTQLED 1313
            ..::..::|:..|.|||:..:.  :..|:.:|.: .:::..| |||::  :|..:..||..::.:
plant  1062 SSMEEKIDELDRKHDETSPNIT--EKLKEDVSFD-YEIVSNL-EAENE--RLKALVGSLEKKINE 1120

  Fly  1314 TKRLADEESRERATLLGKFRNLEHD--LDNLREQVEEEAEGKADLQRQLSKANAEAQVWRSKYES 1376
            :...:.:|..|...:| |..:|..|  :||  |:|::.|:...||...:|....:......||  
plant  1121 SGNNSTDEQEEGKYIL-KEESLTEDASIDN--ERVKKLADENKDLNDLVSSLEKKIDETEKKY-- 1180

  Fly  1377 DGVARSEELEEAKRKLQARLAEAEETIESLNQKCIGLEKTKQRLSTEVEDLQL--EVDRANAIAN 1439
                     |||.|..:.||.:|.:....|    |.|:.:.|||..:|.|::.  ::.|..|:.|
plant  1181 ---------EEASRLCEERLKQALDAETGL----IDLKTSMQRLEEKVSDMETAEQIRRQQALVN 1232

  Fly  1440 AAEKKQKAFDKIIGEWKLKVDDLAAELDASQKE------CRNYSTELFRLKGAYEEGQEQLEAVR 1498
            :|.::........|         |..|:...:|      .|.:.||.||......:..|.::.:.
plant  1233 SASRRMSPQVSFTG---------APPLENGHQEPLAPIPSRRFGTESFRRSRIERQPHEFVDVLL 1288

  Fly  1499 R-ENKNLADEVKDLLDQIGEG----GRNIHEIEKARKRLEAEKDEL--------QAALEEAEAAL 1550
            : .:||:.         ...|    ...|::.....|..||||..:        .:|:|.     
plant  1289 KCVSKNIG---------FSHGKPVAALTIYKCLMRWKIFEAEKTSIFDRIVPVFGSAIEN----- 1339

  Fly  1551 EQEENKVLRAQLELSQVRQEIDRRIQEKEEEFENTRKNHQ------------RALDSMQASLEA- 1602
            ::::|.:.......|.:...:.|.::::.....:..|..|            |:..|...|.:. 
plant  1340 QEDDNHLAYWLTNTSTLLFLLQRSLRQQSSTGSSPTKPPQPTSFFGRMTQGFRSTSSPNLSTDVV 1404

  Fly  1603 -EAKGKAEALRMKKKLEADINELEIALDHANKANAEAQKNIKRYQ--------QQLKDIQTALEE 1658
             :...:..||..|::|.|.:..:...:          ::|:||..        |.||:.......
plant  1405 QQVDARYPALLFKQQLTAYVETMYGII----------RENVKREVSSLLSSCIQSLKESSCDSSV 1459

  Fly  1659 EQRARDDAREQLGISERRANALQNELEESRTLLEQADRGRRQAEQELADAHEQLNEVSAQNAS 1721
            .......:.|.|.......|:.:...||:.......|:    :.|:|:|.:....|..|..:|
plant  1460 VNSPSKSSEENLPAKSSEENSPKKSSEENSPKESSGDK----SPQKLSDDNSPSKEGQAVKSS 1518

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
MhcNP_523587.4 Myosin_N 33..75 CDD:460670 6/35 (17%)
MYSc_Myh1_insects_crustaceans 100..765 CDD:276874 259/672 (39%)
Myosin_tail_1 842..1922 CDD:460256 188/935 (20%)
XIANP_171954.1 Myosin_N 8..51 CDD:460670 8/43 (19%)
MYSc_Myo11 75..719 CDD:276835 259/672 (39%)
PRK02224 <865..>1229 CDD:179385 107/478 (22%)
MyosinXI_CBD 1278..1699 CDD:271259 43/269 (16%)

Return to query results.
Submit another query.