DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment Mhc and CGNL1

DIOPT Version :10

Sequence 1:NP_523587.4 Gene:Mhc / 35007 FlyBaseID:FBgn0264695 Length:1962 Species:Drosophila melanogaster
Sequence 2:XP_005254783.1 Gene:CGNL1 / 84952 HGNCID:25931 Length:1303 Species:Homo sapiens


Alignment Length:1460 Identity:320/1460 - (21%)
Similarity:514/1460 - (35%) Gaps:514/1460 - (35%)


- Green bases have known domain annotations that are detailed below.


  Fly   549 EKLTNTHLGKSAPFQKPKPPKPGQQAAHFAIAHYAGCVSYNITGWLEKNKDPLNDTVVDQFKKSQ 613
            :..|...:|:.|.|...:|     ..||...||                         .:.||::
Human   238 QSCTKERVGEEALFTSGRP-----LTAHSPHAH-------------------------PETKKTR 272

  Fly   614 NKLLIEIFADHAGQSGGGEQAKGGRGKKGGGFATVSSAYKEQLNSLMTTLRSTQP---------- 668
            ..:|     ....|...|....|.|.::....:|..::    .|||...|...|.          
Human   273 PDVL-----PFRRQDSAGPVLDGARSRRSSSSSTTPTS----ANSLYRFLLDDQECAIHADNVNR 328

  Fly   669 HFVRCIIPNEMKQPGV---VDAHLVMHQLTCNGVLEGI--------------RICRKGFPNRMMY 716
            |..|..||   ..||.   :|          .|.:.|:              |..|.|..||:..
Human   329 HENRRYIP---FLPGTGRDID----------TGSIPGVDQLIEKFDQKPGLQRRGRSGKRNRINT 380

  Fly   717 PDFKMRYKI--MCPKLLQGVEKDKKATEIIIKFIDLPEDQYRLG-NTKVFFRAGVLGQMEEFRDE 778
            .|.|....:  ..|..|||      .:|.:|:|      ...|| :::...|...:........|
Human   381 DDRKRSRSVDSAFPFGLQG------NSEYLIEF------SRNLGKSSEHLLRPSQVCPQRPLSQE 433

  Fly   779 RLGKIMSWMQAWARGYLSRKG--------FKKLQEQRVALKVVQRNLRKYLQLRTWPWYKLWQKV 835
            |.||     |:..|.:...:|        ..:..:..:..||::..                   
Human   434 RRGK-----QSVGRTFAKLQGAAHGASCAHSRPPQPNIDGKVLETE------------------- 474

  Fly   836 KPLLNVSRIEDEIAR---LEEKAKKAEELHAAEVKVRKELEAL------NAKLLAEKT------- 884
                  ...|..:.|   |..::||.||:..|...:..:..|.      .||.::.||       
Human   475 ------GSQESTVIRAPSLGAQSKKEEEVKTATATLMLQNRATATSPDSGAKKISVKTFPSASNT 533

  Fly   885 -ALLDSLSGEKGALQDYQERNAKLTAQKNDLENQLRDIQERLTQEEDAR----NQLFQQ----KK 940
             |..|.|.|::...|...|..|     |..|.|.|:   |..|..:||.    |.:|::    |.
Human   534 QATPDLLKGQQELTQQTNEETA-----KQILYNYLK---EGSTDNDDATKRKVNLVFEKIQTLKS 590

  Fly   941 KADQEISGLKKDIEDLELNVQKAEQDKATKD--HQIRNLNDEIAHQDELINKLNKEKKMQGETNQ 1003
            :|.....|          |.|........||  .|...|..|:|   ||..:|..|.|.|    |
Human   591 RAAGSAQG----------NNQACNSTSEVKDLLEQKSKLTIEVA---ELQRQLQLEVKNQ----Q 638

  Fly  1004 KTGEELQAAEDKINHLNKVKAKLEQTLDELEDSLEREKKVRGDVEKSKRKVEGDLKLTQEAVADL 1068
            ...||.:          :::|.||:...:..:.:|....::..:|:|    ||:|          
Human   639 NIKEERE----------RMRANLEELRSQHNEKVEENSTLQQRLEES----EGEL---------- 679

  Fly  1069 ERNKKELEQTIQRKDKELSSITAKLEDEQVVVLKHQRQIKELQARIEELEEEVEAERQARAKAEK 1133
               :|.||:..|          .|:|.||     ||.:|::||.::.|:.:|:::   |:...::
Human   680 ---RKNLEELFQ----------VKMEREQ-----HQTEIRDLQDQLSEMHDELDS---AKRSEDR 723

  Fly  1134 QRADLARELEELGERLEEAGGATSAQIELNKKREAELSKLRRDLEEANIQHESTLANLRKKHNDA 1198
            ::..|..||.:..:.|::...|...|.:|.:|||.||:.|:..|:|....|:             
Human   724 EKGALIEELLQAKQDLQDLLIAKEEQEDLLRKRERELTALKGALKEEVSSHD------------- 775

  Fly  1199 VAEMAEQVDQLNKLKAKAEKEKNEYYGQLNDLRAGVDHITN--EKAAQEKIAKQLQHTLNEVQSK 1261
                 :::|:|          |.:|..:|..||..|:..|.  |..|......:......|::.|
Human   776 -----QEMDKL----------KEQYDAELQALRESVEEATKNVEVLASRSNTSEQDQAGTEMRVK 825

  Fly  1262 LDETNRTLNDFDASKKKLSIENSDLLRQLEEAESQVSQLSKIKISLTTQLEDTKRLADEESRERA 1326
            |                |..||..|..:.||.|.:|:||.:       |:||.|   .:|::.:.
Human   826 L----------------LQEENEKLQGRSEELERRVAQLQR-------QIEDLK---GDEAKAKE 864

  Fly  1327 TLL---GKFRNLEHDLDNLREQVEEEAEGKADLQRQLSKANAEAQVWRSKYESDGVARSEELEEA 1388
            ||.   |:.|.||..|.:.|::.:|....:..|:.:|..|....              |:..:|.
Human   865 TLKKYEGEIRQLEEALVHARKEEKEAVSARRALENELEAAQGNL--------------SQTTQEQ 915

  Fly  1389 KRKLQARLAEAEETIESLNQKCIGLEKTKQRLSTEVEDLQLEVDRANAIANAAEKKQKAFDKIIG 1453
            | :|..:|.|..|..|.|           :||..|:|:.:..      :....||.||       
Human   916 K-QLSEKLKEESEQKEQL-----------RRLKNEMENERWH------LGKTIEKLQK------- 955

  Fly  1454 EWKLKVDDLAAELDASQKECRNYSTELFRLKGAYEEGQEQLEAVRRENKNLADEVKDLLDQIGEG 1518
                   ::|..::||    |..:.||          |.||:..:.:|:.          ::.|.
Human   956 -------EMADIVEAS----RTSTLEL----------QNQLDEYKEKNRR----------ELAEM 989

  Fly  1519 GRNIHEIEKARKRLEAEKDELQAALEEAEAALEQEENKVLRAQLELSQVRQEIDRRIQEKEEEFE 1583
            .|.:.|     |.|||||..|.|...:.|..|.:||                             
Human   990 QRQLKE-----KTLEAEKSRLTAMKMQDEMRLMEEE----------------------------- 1020

  Fly  1584 NTRKNHQRALDSMQASLEAEAKGKAEALRMKKKLEADINELEIALDHANKANAEAQKNIKRYQQQ 1648
              .:::|||.|              |||..::.||                            |.
Human  1021 --LRDYQRAQD--------------EALTKRQLLE----------------------------QT 1041

  Fly  1649 LKDIQTALEEEQRARDDAREQLGISERRANALQNELEESRT----LLEQADRGRRQAEQELADAH 1709
            |||::..||.:...:||....:...|.:.:.|:.||||.|.    |.|:..|.|.|.||      
Human  1042 LKDLEYELEAKSHLKDDRSRLVKQMEDKVSQLEMELEEERNNSDLLSERISRSREQMEQ------ 1100

  Fly  1710 EQLNEVSAQNASISAAKRKLESELQTLHSDLDELLNEAKNSEEKAKKAMVDAARLADELRAEQDH 1774
                                                                  |.:||..|:..
Human  1101 ------------------------------------------------------LRNELLQERAA 1111

  Fly  1775 AQTQEKLRKALEQQIKELQVRLDEAEANALKGGKKAIQKLEQRVRELENELDGEQRRHADAQKNL 1839
            .|..|..:.:||:|.|:|:.|:...|.:.....:..:.::|.|:.|||:.|:.|:|..|:.|.:.
Human  1112 RQDLECDKISLERQNKDLKSRIIHLEGSYRSSKEGLVVQMEARIAELEDRLESEERDRANLQLSN 1176

  Fly  1840 RKSERRVKELSFQSEEDRKNHERMQDLVDKLQQKIKTYKRQIEEA-EEIAALNLAKFRKAQQELE 1903
            |:.||:||||..|.:::   |..:.|..|:|..::|..|||:||| |||..|..:| :|.|:|| 
Human  1177 RRLERKVKELVMQVDDE---HLSLTDQKDQLSLRLKAMKRQVEEAEEEIDRLESSK-KKLQREL- 1236

  Fly  1904 EAEERADLAEQAISKFRAKGRAGSVGRGAS 1933
              ||:.|:.|      ..:|:..|:.:..|
Human  1237 --EEQMDMNE------HLQGQLNSMKKDLS 1258

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
MhcNP_523587.4 Myosin_N 33..75 CDD:460670
MYSc_Myh1_insects_crustaceans 100..765 CDD:276874 47/245 (19%)
Myosin_tail_1 842..1922 CDD:460256 260/1116 (23%)
CGNL1XP_005254783.1 Myosin_tail_1 <615..1257 CDD:460256 225/968 (23%)
Blue background indicates that the domain is not in the aligned region.

Return to query results.
Submit another query.