DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment Mhc and XIC

DIOPT Version :10

Sequence 1:NP_523587.4 Gene:Mhc / 35007 FlyBaseID:FBgn0264695 Length:1962 Species:Drosophila melanogaster
Sequence 2:NP_172349.2 Gene:XIC / 837394 AraportID:AT1G08730 Length:1538 Species:Arabidopsis thaliana


Alignment Length:1618 Identity:455/1618 - (28%)
Similarity:732/1618 - (45%) Gaps:315/1618 - (19%)


- Green bases have known domain annotations that are detailed below.


  Fly    39 WIPDEKEGYLLGEIKATKGDIVSVGLQGGE--TRDLKKDLLQQVNPPKYEKAEDMSNLTYLNDAS 101
            |..|.:..::.||::...|..|.:....|:  |..|.|...:.|..|. ...:||:.|:||::..
plant    22 WFEDPEVAWIDGEVEKINGQEVVIQATTGKKVTAKLSKI
YPKDVEAPA-GGVDDMTKLSYLHEPG 85

  Fly   102 VLHNLRQRYYNKLIYTYSGLFCVAINPYKRYP-VYTNRCAKMYRGKRRNEVPPHIFAISDGAYVD 165
            ||.||:.||....||||:|...:||||::|.| :|.....:.|:|....|:.||:||::|.||..
plant    86 VLQNLKIRYELNEIYTYTGNILIAINPFQRLPHIYDAHMMQQYKGAPLGELSPHVFAVADVAYRA 150

  Fly   166 MLTNHVNQSMLITGESGAGKTENTKKVIAYFATVGASKKTDEAAKSKGSLEDQVVQTNPVLEAFG 230
            |:....:.|:|::|||||||||.||.::.|.|.:|....|:..     ::|.||:::||||||||
plant   151 MINEGKSNSILVSGESGAGKTETTKMLMRYLAYLGGRAVTEGR-----TVEQQVLESNPVLEAFG 210

  Fly   231 NAKTVRNDNSSRFGKFIRIHFGPTGKLAGADIETYLLEKARVISQQSLERSYHIFYQIMSGSVPG 295
            |||||||:||||||||:.|.|...|:::||.|.|||||::||......||:||.||.:.:.....
plant   211 NAKTVRNNNSSRFGKFVEIQFDKQGRISGAAIRTYLLERSRVCQISDPERNYHCFYLLCAAPQEE 275

  Fly   296 VKDICLLTDNIYDYHIVSQGK-VTVASIDDAEEFSLTDQAFDILGFTKQEKEDVYRITAAVMHMG 359
            ::...|  .:...:|.::|.| ..:..|.||.::..|.:|.||:|.:::|:|.::|:.||::|:|
plant   276 IEKYKL--GHPKTFHYLNQSKCFELVGISDAHDYLATRRAMDIVGISEKEQEAIFRVVAAILHIG 338

  Fly   360 GMKFKQRGREEQAEQDGEEEG----GRVSKLFGCDTAELYKNLLKPRIKVGNEFVTQGRNVQQVT 420
            .:.| .:|:|..:....:|:.    ...::|..||...|...|.|..:....|.:.:..:.|...
plant   339 NIDF-TKGKEVDSSVPKDEKSKFHLKTAAELLMCDLKALEDALCKRVMITPEEVIKRSLDPQSAV 402

  Fly   421 NSIGALCKGVFDRLFKWLVKKCNETLDTQQKRQHFIGVLDIAGFEIFEYNGFEQLCINFTNEKLQ 485
            .|...|.|.|:.|||.|||.|.|:::......:..||||||.|||.|:.|.|||.||||||||||
plant   403 TSRDGLAKTVYSRLFDWLVDKINKSIGQDANSRSLIGVLDIYGFESFKTNSFEQFCINFTNEKLQ 467

  Fly   486 QFFNHHMFVLEQEEYQREGIEWTFIDFGMDLQLCIDLIE-KPMGILSILEEESMFPKATDQTFSE 549
            |.||.|:|.:|||||.:|.|:|::|:| :|.|..:|||| ||.||:::|:|..||||:|.:||:.
plant   468 QHFNQHVFKMEQEEYTKEAIDWSYIEF-VDNQDVLDLIEKKPGGIVALLDEACMFPKSTHETFAN 531

  Fly   550 KLTNTHLGKSAPFQKPKPPKPGQQAAHFAIAHYAGCVSYNITGWLEKNKDPLNDTVVDQFKKSQN 614
            ||..| ......|.|||..:     ..||:|||||.|.|....:|:||||.:.....|....|:.
plant   532 KLYQT-FKTHKRFIKPKLSR-----TDFAVAHYAGEVLYQSELFLDKNKDYVIPEHQDLLGASKC 590

  Fly   615 KLLIEIFADHAGQSGGGEQAKGGRGKKGGGFATVSSAYKEQLNSLMTTLRSTQPHFVRCIIPNEM 679
            ..::.:|.....::           .|...|:::.|.:|.||..||.||..|:||::||:.||.:
plant   591 PFVVGLFPPLPEET-----------SKSSKFSSIGSRFKLQLQQLMETLNCTEPHYIRCVKPNNL 644

  Fly   680 KQPGVVDAHLVMHQLTCNGVLEGIRICRKGFPNRMMYPDFKMRYKIMCPKLLQGVEKDKKATEII 744
            .:|.:.:...:|.||.|.||||.|||...|:|.|..:.:|..|:.::.|..|:|...:|.|.:.|
plant   645 LKPAIFENVNIMQQLRCGGVLEAIRISCAGYPTRKPFFEFINRFGLLSPAALEGNFDEKVACQKI 709

  Fly   745 IKFIDLPEDQYRLGNTKVFFRAGVLGQMEEFRDERLG-----------------------KIMSW 786
            :..:.|  ..|::|.||||.|||.:.:::..|.|.|.                       |....
plant   710 LDNMGL--KGYQIGKTKVFLRAGQMAELDARRAEVLSSAAKKIQRRIRTHQAQKRFIVLRKATIS 772

  Fly   787 MQAWARGYLSRKGFKKLQEQRVALKVVQRNLRKYLQLRTWPWYKLWQKVKPLLNVSRIEDEIARL 851
            :||..||.||.|.:..|:.:..|:|:.:...|.|                     ||        
plant   773 LQAICRGRLSCKHYDNLRREAAAVKIQKNGRRHY---------------------SR-------- 808

  Fly   852 EEKAKKAEELHAAEVKVRKELEALNAKLLAEKTALLDSLSGEKGALQDYQERNAKLTAQKNDLEN 916
                |..::||.|.:.|:..|.|:.|:.                                     
plant   809 ----KSYKKLHVASLVVQTGLRAMAARK------------------------------------- 832

  Fly   917 QLRDIQERLTQEEDARNQLFQQKKKADQEISGLKKDIEDLELNVQKAEQDKATKDHQIRNLNDEI 981
                 |.|..::..|.. :.|.:.:..:.||..||                      ::|     
plant   833 -----QFRFRKQTKAAT-IVQAQWRCHRAISYYKK----------------------LKN----- 864

  Fly   982 AHQDELINKLNKEKKMQGETNQKTGEELQAAEDKINHLNKVKAKLEQTLDELEDSLEREKKVRGD 1046
                   ..:..:.:.:|...::...:|:.|..:...|.:.|..||:.::||...::.||:.|||
plant   865 -------GVVLSQTRWRGRLAKRELRKLKMAARETGALKEAKDMLEKKVEELTYRVQLEKRSRGD 922

  Fly  1047 VEKSKRKVEGDLKLTQEAV---ADLERNKKELEQT--IQRKDKELSSITAKLEDEQVVVLKHQRQ 1106
            :|::|         |||.:   :..|..:|::::|  :..|::|.:.   |..:|...|:|..:.
plant   923 LEEAK---------TQEILKLKSSFEEMRKKVDETNALLLKEREAAK---KAAEEAPPVIKETQI 975

  Fly  1107 IKELQARIEELEEEVEAERQARAKAEKQRADLA-RELEELGERLEEAGGATSAQIELNKKREAEL 1170
            :.|...:||.:.||:|:.: ...:.||||||.| |:.||..|.||:           .||:..|.
plant   976 LVEDTKKIELMTEELESVK-VTLENEKQRADDAVRKFEEAQESLED-----------KKKKLEET 1028

  Fly  1171 SKLRRDLEEANIQHESTLANLRKKHNDAVAEMAEQVDQLNKL---KAKAEKEKNEYYGQLN-DLR 1231
            .|..:.|:|:..:.|...:|| :..|..:.:.|..:.. ||.   ::::..::....|.|. |.|
plant  1029 EKKGQQLQESLTRMEEKCSNL-ESENKVLRQQAVSMAP-NKFLSGRSRSILQRGSESGHLAVDAR 1091

  Fly  1232 AGVD---HITNEKAAQEKIAKQLQHTLNEVQSKLDET-------------NRTL----------- 1269
            :.:|   |..|.:...| :..:.|.:|||.|.:..:.             ||.:           
plant  1092 SNLDLHSHSINHRDPSE-VEDKPQKSLNEKQQENQDLLIRSIVQHLGFQGNRPITACIIYKCLLQ 1155

  Fly  1270 -NDFDASKKKL----------SIENSDLLRQLEEAESQVSQLSKIKISLTTQLEDTKRLADEESR 1323
             ..|:..:..:          :||..|....|....|..|.| .:.:..|.:......:|.:..|
plant  1156 WRSFEVERTSVFDRIIQTIGHAIETQDNNNTLAYWLSNTSTL-LLLLQRTLKASGAAGMAPQRRR 1219

  Fly  1324 -ERATLLGK----FRNLEHDLDNLREQVEEEAEGKADLQRQLSKANAEAQVWRSKYESDGVARSE 1383
             ..|||.|:    ||.....: || ..:...|.|.||..||:          .:||.:  :...:
plant  1220 SSSATLFGRMSQSFRGAPPGV-NL-AMINGAAGGGADTFRQV----------EAKYPA--LLFKQ 1270

  Fly  1384 ELEEAKRKLQARLAE-AEETIESLNQKCIGLEKTKQRLSTEVEDLQLEVDRANAIANAAEKKQKA 1447
            :|.....|:...:.: .::.|..|...||...:|.:        ..|....:.::.|.|     |
plant  1271 QLTAYVEKIYGMIRDNLKKEISPLLGLCIQAPRTSR--------ASLVKGASRSVGNTA-----A 1322

  Fly  1448 FDKIIGEWKLKVDDLAAELDASQKE------CRNYSTELFRL-------------------KGAY 1487
            ...:|..|:..|..|...|:..:..      .|...|::|..                   .|.|
plant  1323 QQALIAHWQGIVKSLTNFLNTLKSNNVPSFLVRKVFTQIFSFINVQLFNSLLLRRECCSFSNGEY 1387

  Fly  1488 -EEGQEQLE-----AVRRENKNLADEVKDLLDQIGEGGRNIHEIEKARKRLEAEKDEL 1539
             :.|..:||     |......:..||:|.:...|  |...:|  :|.:|.|:....:|
plant  1388 VKAGLSELEHWCFKATNEYAGSSWDELKHIRQAI--GFLVVH--QKPKKTLDEISHDL 1441

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
MhcNP_523587.4 Myosin_N 33..75 CDD:460670 9/37 (24%)
MYSc_Myh1_insects_crustaceans 100..765 CDD:276874 263/671 (39%)
Myosin_tail_1 842..1922 CDD:460256 156/783 (20%)
XICNP_172349.2 Myosin_N 16..60 CDD:460670 9/37 (24%)
MYSc_Myo11 84..728 CDD:276835 263/671 (39%)
DUF5401 <747..1058 CDD:375164 89/445 (20%)
MyosinXI_CBD 1119..1504 CDD:271259 66/355 (19%)

Return to query results.
Submit another query.