DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment Mhc and zip

DIOPT Version :10

Sequence 1:NP_523587.4 Gene:Mhc / 35007 FlyBaseID:FBgn0264695 Length:1962 Species:Drosophila melanogaster
Sequence 2:NP_523860.2 Gene:zip / 38001 FlyBaseID:FBgn0287873 Length:2056 Species:Drosophila melanogaster


Alignment Length:1994 Identity:757/1994 - (37%)
Similarity:1198/1994 - (60%) Gaps:82/1994 - (4%)


- Green bases have known domain annotations that are detailed below.


  Fly     5 VANQEDEDPTPYLFVSLEQRRIDQ---SKPYDSKKSCWIPDEKEGYLLGEIKATKGDIVSVGL-Q 65
            ::.:.|.:.....::|:|:.:.:.   ...:..|:..|:|.|.:|::...||...||.|.|.| :
  Fly    46 MSEEVDRNDPELKYLSVERNQFNDPATQAEWTQKRLVWVPHENQGFVAASIKREHGDEVEVELAE 110

  Fly    66 GGETRDLKKDLLQQVNPPKYEKAEDMSNLTYLNDASVLHNLRQRYYNKLIYTYSGLFCVAINPYK 130
            .|:...:.:|.:|::||||::|.|||:.||.||:||||||::.|||:.|||||||||||.:||||
  Fly   111 TGKRVMILRDDIQKMNPPKFDKVEDMAELTCLNEASVLHNIKDRYYSGLIYTYSGLFCVVVNPYK 175

  Fly   131 RYPVYTNRCAKMYRGKRRNEVPPHIFAISDGAYVDMLTNHVNQSMLITGESGAGKTENTKKVIAY 195
            :.|:||.:..:.|:|.:|:|||||:|||:|.||.:||.:..:||:|.||||||||||||||||.:
  Fly   176 KLPIYTEKIMERYKGIKRHEVPPHVFAITDSAYRNMLGDREDQSILCTGESGAGKTENTKKVIQF 240

  Fly   196 FATVGASK------------------KTDEAAKSK---------------------------GSL 215
            .|.|.|||                  .|::..|.|                           |.|
  Fly   241 LAYVAASKPKGSGAVPHPAVLINFSVNTNKYIKVKIMAQNQNQTIEVVNGLKMVEVNSNCQEGEL 305

  Fly   216 EDQVVQTNPVLEAFGNAKTVRNDNSSRFGKFIRIHFGPTGKLAGADIETYLLEKARVISQQSLER 280
            |.|::|.||:||||||||||:|||||||||||||:|..:|.::||:||||||||:|.|.|...||
  Fly   306 EQQLLQANPILEAFGNAKTVKNDNSSRFGKFIRINFDASGFISGANIETYLLEKSRAIRQAKDER 370

  Fly   281 SYHIFYQIMSGSVPGVKDICLLTDNIYDYHIVSQGKVTVASIDDAEEFSLTDQAFDILGFTKQEK 345
            ::|||||:::|:.|..::..:| |::..|..:|.|.:.|..:||..||..|.::.:|:|.|.::.
  Fly   371 TFHIFYQLLAGATPEQREKFIL-DDVKSYAFLSNGSLPVPGVDDYAEFQATVKSMNIMGMTSEDF 434

  Fly   346 EDVYRITAAVMHMGGMKFKQRGREEQAEQDGEEEGGRVSKLFGCDTAELYKNLLKPRIKVGNEFV 410
            ..::||.:||:..|.|||:|....:||.........:::.|.|....::.:..|.||||||.:||
  Fly   435 NSIFRIVSAVLLFGSMKFRQERNNDQATLPDNTVAQKIAHLLGLSVTDMTRAFLTPRIKVGRDFV 499

  Fly   411 TQGRNVQQVTNSIGALCKGVFDRLFKWLVKKCNETLD-TQQKRQHFIGVLDIAGFEIFEYNGFEQ 474
            |:.:..:||..::.|:.|..::|:|||||.:.|.:|| |:::...|||:||:|||||||.|.|||
  Fly   500 TKAQTKEQVEFAVEAIAKACYERMFKWLVNRINRSLDRTKRQGASFIGILDMAGFEIFELNSFEQ 564

  Fly   475 LCINFTNEKLQQFFNHHMFVLEQEEYQREGIEWTFIDFGMDLQLCIDLIEKPMGILSILEEESMF 539
            ||||:|||||||.|||.||:|||||||||||||.|||||:|||..||||:||.||:::|:||..|
  Fly   565 LCINYTNEKLQQLFNHTMFILEQEEYQREGIEWKFIDFGLDLQPTIDLIDKPGGIMALLDEECWF 629

  Fly   540 PKATDQTFSEKLTNTHLGKSAPFQKPKPPKPG-QQAAHFAIAHYAGCVSYNITGWLEKNKDPLND 603
            |||||:||.:||.:.|      ...||..|.. :..|.|||.||||.|.|:...||.||.||||:
  Fly   630 PKATDKTFVDKLVSAH------SMHPKFMKTDFRGVADFAIVHYAGRVDYSAAKWLMKNMDPLNE 688

  Fly   604 TVVDQFKKSQNKLLIEIFADHAGQSGGGEQAK-----GGRGKKGGGFATVSSAYKEQLNSLMTTL 663
            .:|...:.||:..::.|:.| |...|..:||.     |.|.:| |.|.|||..|||||..||.||
  Fly   689 NIVSLLQGSQDPFVVNIWKD-AEIVGMAQQALTDTQFGARTRK-GMFRTVSHLYKEQLAKLMDTL 751

  Fly   664 RSTQPHFVRCIIPNEMKQPGVVDAHLVMHQLTCNGVLEGIRICRKGFPNRMMYPDFKMRYKIMCP 728
            |:|.|:||||||||..|:.|.:||.||:.||.||||||||||||:|||||:.:.:|:.||:::.|
  Fly   752 RNTNPNFVRCIIPNHEKRAGKIDAPLVLDQLRCNGVLEGIRICRQGFPNRIPFQEFRQRYELLTP 816

  Fly   729 KLL-QGVEKDKKATEIIIKFIDLPEDQYRLGNTKVFFRAGVLGQMEEFRDERLGKIMSWMQAWAR 792
            .:: :|....|||.|.:|:.::|..:.||:|.:|:|||||||..:||.||.::..::...||:.|
  Fly   817 NVIPKGFMDGKKACEKMIQALELDSNLYRVGQSKIFFRAGVLAHLEEERDFKISDLIVNFQAFCR 881

  Fly   793 GYLSRKGFKKLQEQRVALKVVQRNLRKYLQLRTWPWYKLWQKVKPLLNVSRIEDEIARLEEKAKK 857
            |:|:|:.::|..:|..|::::|||...||:||.|.|::|:.||||||.|::.|:::.:.|::.|:
  Fly   882 GFLARRNYQKRLQQLNAIRIIQRNCAAYLKLRNWQWWRLYTKVKPLLEVTKQEEKLVQKEDELKQ 946

  Fly   858 AEELHAAEVKVRKELEALNAKLLAEKTALLDSLSGEKGALQDYQERNAKLTAQKNDLENQLRDIQ 922
            ..|......|..:|.|....:.|.|||.|.:.|..|.....:.:|..::|.|:|.:||:.:::::
  Fly   947 VREKLDTLAKNTQEYERKYQQALVEKTTLAEQLQAEIELCAEAEESRSRLMARKQELEDMMQELE 1011

  Fly   923 ERLTQEEDARNQLFQQKKKADQEISGLKKDIEDLELNVQKAEQDKATKDHQIRNLNDEIAHQDEL 987
            .|:.:||:....|..:|||.:..|..|::.:|:.|...||.:.:|...|.:|:...:::|..|:.
  Fly  1012 TRIEEEEERVLALGGEKKKLELNIQDLEEQLEEEEAARQKLQLEKVQLDAKIKKYEEDLALTDDQ 1076

  Fly   988 INKLNKEKKMQGETNQKTGEELQAAEDKINHLNKVKAKLEQTLDELEDSLEREKKVRGDVEKSKR 1052
            ..||.||||:..|......:.|...|:|..||.|:|||.|.|:.|||:.|.::::.|.:.::|||
  Fly  1077 NQKLLKEKKLLEERANDLSQTLAEEEEKAKHLAKLKAKHEATISELEERLHKDQQQRQESDRSKR 1141

  Fly  1053 KVEGDLKLTQEAVADLERNKKELEQTIQRKDKELSSITAKLEDEQVVVLKHQRQIKELQARIEEL 1117
            |:|.::...:|.:.:......|::..:.::::||:....::::|.......|:..:||::::.|:
  Fly  1142 KIETEVADLKEQLNERRVQVDEMQAQLAKREEELTQTLLRIDEESATKATAQKAQRELESQLAEI 1206

  Fly  1118 EEEVEAERQARAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAELSKLRRDLEEANI 1182
            :|::|||:.|||||||.|.||:.|||.|...|.::...|:||.||..|||.||:.|::.|||..:
  Fly  1207 QEDLEAEKAARAKAEKVRRDLSEELEALKNELLDSLDTTAAQQELRSKREQELATLKKSLEEETV 1271

  Fly  1183 QHESTLANLRKKHNDAVAEMAEQVDQLNKLKAKAEKEKNEYYGQLNDLRAGVDHITNEKAAQEKI 1247
            .||..||::|.||:..:..:.:|::.|.|.|...||.|.....:..||...:..:.:.:...::.
  Fly  1272 NHEGVLADMRHKHSQELNSINDQLENLRKAKTVLEKAKGTLEAENADLATELRSVNSSRQENDRR 1336

  Fly  1248 AKQLQHTLNEVQSKLDETNRTLNDFDASKKKLSIENSDLLRQLEEAESQVSQLSKIKISLTTQLE 1312
            .||.:..:.|:|.||.|..|..::......||..|..::..||||||.:.|...|...::.:||.
  Fly  1337 RKQAESQIAELQVKLAEIERARSELQEKCTKLQQEAENITNQLEEAELKASAAVKSASNMESQLT 1401

  Fly  1313 DTKRLADEESRERATLLGKFRNLEHDLDNLREQVEEEAEGKADLQRQLSKANAEAQVWRSKYESD 1377
            :.::|.:||:|::..|..|.|.:|.:.:.|:||:||:.|.|.:.:|:|::...:.|..:.|.|.|
  Fly  1402 EAQQLLEEETRQKLGLSSKLRQIESEKEALQEQLEEDDEAKRNYERKLAEVTTQMQEIKKKAEED 1466

  Fly  1378 GVARSEELEEAKRKLQARLAEAEETIESLNQKCIGLEKTKQRLSTEVEDLQLEVDRANAIANAAE 1442
             ...::||||.|::|...:...|..::.|..:...|:|:|:::.:|:||..:|::.........|
  Fly  1467 -ADLAKELEEGKKRLNKDIEALERQVKELIAQNDRLDKSKKKIQSELEDATIELEAQRTKVLELE 1530

  Fly  1443 KKQKAFDKIIGEWKLKVDDLAAELDASQKECRNYSTELFRLKGAYEEGQEQLEAVRRENKNLADE 1507
            ||||.||||:.|.|...:.:|.|.|.:::|.|...|::..:....:|..:::|.:..:.|.|.:|
  Fly  1531 KKQKNFDKILAEEKAISEQIAQERDTAEREAREKETKVLSVSRELDEAFDKIEDLENKRKTLQNE 1595

  Fly  1508 VKDLLDQIGEGGRNIHEIEKARKRLEAEKDELQAALEEAEAALEQEENKVLRAQLELSQVRQEID 1572
            :.||.:..|...:|:||:|||::.||::..||:|..||.|..|:..|:..||.::.:..:|.:.:
  Fly  1596 LDDLANTQGTADKNVHELEKAKRALESQLAELKAQNEELEDDLQLTEDAKLRLEVNMQALRSQFE 1660

  Fly  1573 RRIQEKEEEFENTRKNHQRALDSMQASLEAEAKGKAEALRMKKKLEADINELEIALDHANKANAE 1637
            |.:..|||..|..|:...:.|..::..|:.|.|.:..|:..|||||.|:.|:|..::..||...:
  Fly  1661 RDLLAKEEGAEEKRRGLVKQLRDLETELDEERKQRTAAVASKKKLEGDLKEIETTMEMHNKVKED 1725

  Fly  1638 AQKNIKRYQQQLKDIQTALEEEQRARDDAREQLGISERRANALQNELEESRTLLEQADRGRRQAE 1702
            |.|:.|:.|.|:||.....||.:.|:::.:.....:||:..||:.|:.:....|..::|.||.||
  Fly  1726 ALKHAKKLQAQVKDALRDAEEAKAAKEELQALSKEAERKVKALEAEVLQLTEDLASSERARRAAE 1790

  Fly  1703 QELADAHEQLNEVSAQNASISA----AKRKLESELQTLHSDLDELLNEAKNSE---EKAKKAMVD 1760
            .|    .::|.|..|.||:..:    .||:||:.:.||..:|:|   |..|||   ::::||.:.
  Fly  1791 TE----RDELAEEIANNANKGSLMIDEKRRLEARIATLEEELEE---EQSNSEVLLDRSRKAQLQ 1848

  Fly  1761 AARLADELRAEQDHAQTQEKLRKALEQQIKELQVRLDEAEANALKGGKKAIQKLEQRVRELENEL 1825
            ..:|..||..|:.::|..|..|..||:|.|||:.:|.|.|.......|..|..||.::..||.:|
  Fly  1849 IEQLTTELANEKSNSQKNENGRALLERQNKELKAKLAEIETAQRTKVKATIATLEAKIANLEEQL 1913

  Fly  1826 DGEQRRHADAQKNLRKSERRVKELSFQSEEDRKNHERMQDLVDKLQQKIKTYKRQIEEAEEIAAL 1890
            :.|.:.....||..||.::::|||:...|::|::.::.::.:|||..:||..||.::|.||....
  Fly  1914 ENEGKERLLQQKANRKMDKKIKELTMNIEDERRHVDQHKEQMDKLNSRIKLLKRNLDETEEELQK 1978

  Fly  1891 NLAKFRKAQQELEEAEERADLAEQAISKFRAK-GRAGSVGRGAS 1933
            ...:.||.|:|.|:..|..:...:.|:..:.| .|.|.:|..:|
  Fly  1979 EKTQKRKYQRECEDMIESQEAMNREINSLKTKLRRTGGIGLSSS 2022

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
MhcNP_523587.4 Myosin_N 33..75 CDD:460670 13/42 (31%)
MYSc_Myh1_insects_crustaceans 100..765 CDD:276874 349/717 (49%)
Myosin_tail_1 842..1922 CDD:460256 337/1086 (31%)
zipNP_523860.2 Myosin_N 79..122 CDD:460670 14/42 (33%)
MYSc_Myh2_insects_mollusks 145..854 CDD:276876 349/717 (49%)
Myosin_tail_1 931..2011 CDD:460256 337/1087 (31%)

Return to query results.
Submit another query.