DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment Hml and Tecta

DIOPT Version :10

Sequence 1:NP_524060.2 Gene:Hml / 39529 FlyBaseID:FBgn0029167 Length:3843 Species:Drosophila melanogaster
Sequence 2:NP_033373.2 Gene:Tecta / 21683 MGIID:109575 Length:2155 Species:Mus musculus


Alignment Length:1761 Identity:409/1761 - (23%)
Similarity:636/1761 - (36%) Gaps:472/1761 - (26%)


- Green bases have known domain annotations that are detailed below.


  Fly   485 CTTWGGINMKTFDGLVFKAPLSCSHTLITDKVSGT----FDIILKACPYG--------------S 531
            |..:|..:..||||.:|....||::.|....:..:    |.:..|....|              :
Mouse   322 CVVFGEPHYHTFDGFLFHFQGSCAYLLARQCLQTSSLPFFSVEAKNEHRGGSAVSWVKELSVEVN 386

  Fly   532 GYGCAHTLKILWQSVLYTFENLNGTMQLTTPIKKLPMPVQVMGMKVMPVAQHVQIDLESVGLKLD 596
            ||      |||.....|      |.:::...:..||:.:::..:|:........::.: .||.:.
Mouse   387 GY------KILIPKGSY------GKVKVNDLVTSLPVTLELGAVKIYQSGMSTAVETD-FGLLVT 438

  Fly   597 WDHRQYVSVQAGPQMWGKVGGLCGTLDGDPNTDLTSRTGKKLATVKAFADAWRVEDRSELCQ--- 658
            :|.:.|.|:...........||||..:.:|..|.....|:...:|....::|||......|.   
Mouse   439 FDGQHYASISIPGSYINSTCGLCGNYNKNPLDDFLRPDGRPAMSVLDLGESWRVYHADWKCGSGC 503

  Fly   659 VENSAEME-------FGMDSCEQSKLQKAVSVCERLLANEKLGDCIKPFNYDALIRTCMADYCNC 716
            |:|..:.:       ||.|.|  ..|.|         .:..|.:|....:..|.:.:|:.|.|:.
Mouse   504 VDNCTQCDAATEALYFGSDYC--GFLNK---------TDGPLWECGTVVDATAFVHSCVYDLCSV 557

  Fly   717 ANREHPESCNCDAIAMLAKECAFKGIKLEHGWRNLEIC--PISCGFGRVYQACGPNVEPTCDSDL 779
            .:.   .:..|.||...|..|...||.: ..||....|  .:.|.....|..|..:...|| |||
Mouse   558 RDN---GTLLCQAIQAYALVCQALGIPI-GDWRIQTGCVSTVRCPSFSHYSVCTSSCPDTC-SDL 617

  Fly   780 ALPASKGA---CNEGCFCPEGTVQYKEACITRELCPCSLRGK-----EFKPESTVKKNCNT-CTC 835
            .  ||:..   |.|||.|.||.|.....|:....|.|...|.     ||   .....||.. |.|
Mouse   618 T--ASQNCATPCTEGCECNEGFVLSTSQCVPLHKCGCDFDGHYYTMGEF---FWATANCTVQCLC 677

  Fly   836 -KNGQWRCTEDKC--GARCGAVGDPHYQ-------------------TFDGKRYDFMGKCSYHLL 878
             :.|...|....|  |..| ||.| .||                   ||||..|.|..:.||.||
Mouse   678 EEGGDVYCFNKTCRSGEVC-AVED-GYQGCFPKRETVCLLSQNQVLHTFDGAAYAFPSELSYTLL 740

  Fly   879 KTQNTSVEAENVACSGAVSESMNFAAPD-DPSCTKAVTIRFILRDGTPSVIKLD--QGLTTIVND 940
            ||.....|...:        .:|...|| .|:..:.|  |.::.|   ..:|:.  ..|...:|.
Mouse   741 KTCPERPEYLEI--------DINKKKPDAGPAWLRGV--RILVAD---QEVKIGGVGALEVKLNG 792

  Fly   941 KPIAKLPKMLGLGEVLIRRASSTFLTVEFADGIRVWWDGVSRVYIDAPPSLRGQTQGLCGTFNSN 1005
            :.: :||.....|.:.|.|..:: .|||....:.|.:..|..:||.........|.||||.||:|
Mouse   793 QDV-ELPFFHPSGRLEIHRNKNS-TTVESKGVVSVQYSDVGLLYIRLSTMYFNCTGGLCGFFNAN 855

  Fly  1006 TQDDFLTPEGDVETAVEPFADKWRT-KDTCQ------FKAETHQGPHPCTLNPE--KKAQAEKFC 1061
            ..|:|..|.|.....:..|.:.|.| ::.|.      .||        |..:.|  |..::...|
Mouse   856 ASDEFCLPNGKCTDNLAVFLESWTTFEEICNGECGDLLKA--------CNNDSELLKFYRSRSRC 912

  Fly  1062 DWILQD----IFQDCHFLVEPEQFYEDCLYDTCACKDEMSKCFCPILSAYGTECMRQGVKTG-WR 1121
            . |:.|    .|.:||.:|....:|..||:..|......|: .|..::.|.:.|....|:.| ||
Mouse   913 G-IINDPSNSSFLECHGVVNVTAYYRTCLFRLCQSGGNESE-LCDSVARYASACKNADVEVGPWR 975

  Fly  1122 MSVKECAVKCPLGQVFDECGDGCALSCDDLPSKGSCKRECVEGCRCPHGEYVNEDGECVPKKMCH 1186
             :...|.::||....|:||.. |..:|:.|.....|...|.|||:|..| |..:..:|||:..|.
Mouse   976 -TYDFCPLECPENSHFEECMT-CTETCETLALGPICVDSCSEGCQCDEG-YALQGSQCVPRSECG 1037

  Fly  1187 CNFDGMSFRPGYKEVRPGEKFLDLCTCTDGVWDCQDAEPGDKDKYPPSSELRSKCAKQPYAEFTK 1251
            |||:|       .::...|.|          |..||                  |....|     
Mouse  1038 CNFEG-------HQLATNETF----------WVDQD------------------CQIFCY----- 1062

  Fly  1252 CAPKEPKTCKNMDKYV-ADSSDCLPGCVCMEGYVYDTSRLACVLPANCSCHHAGKSYDDGEKIKE 1315
                    |...|..| .::..|.....|||                    .:|..|        
Mouse  1063 --------CNGTDNSVHCETIPCRDDEYCME--------------------ESGLYY-------- 1091

  Fly  1316 DCNLCECRAGNWKCSKNGCESTCSVWGDSHFTTFDGHDFDFQGACDYVL----AKGVFDNGDGFS 1376
                |:.|.          :::|.|.|..|:.||||:.||||.:|..:|    ::.:.|:...|.
Mouse  1092 ----CQPRT----------DASCIVSGYGHYLTFDGYPFDFQTSCPLILCTTGSRPISDSFPKFI 1142

  Fly  1377 ITIQNVLCGTMGVTCSKSLEIALTGHA-------EESLLLSADSAYSTDPNKTPIKKLRDSVNSK 1434
            :|.:|...........|.:::.:.|::       :.::|::.:..|      .|:|        .
Mouse  1143 VTAKNEDRDPSLALWVKQVDVNVFGYSIVIHRAYKHTVLVNNERLY------LPLK--------L 1193

  Fly  1435 GHNAFHIYKAGVFVVVEVIPLKLQVKWDEGTRVYVKLGNEWRQKVSGLCGNYNGNSLDDMQTPSM 1499
            |....:|:..|..||||. ...|:|.:|..|.:.:.:....:....||||.||||..||::.| |
Mouse  1194 GQGKINIFSFGFHVVVET-DFGLKVVYDWKTFLSITVPRSMQNGTYGLCGRYNGNPDDDLEMP-M 1256

  Fly  1500 GLETSPML----FGHAW-KLQPHCSAPVAPIDACKKHPERETWAQLK--CGALKSDL--FKECHA 1555
            ||   |.|    ||.:| |....|.  |...|.|....:.|.:::::  |..:.:..  |.:||:
Mouse  1257 GL---PALSINEFGQSWVKRDTFCQ--VGCGDRCPSCAKVEGFSKVQQLCSLIPNQNAGFAKCHS 1316

  Fly  1556 EVPLERFWKRCIFDTCACDQGGDCECLCTAVAAYADACAQKGINIR-WRSQHFCPMQCDPHCSDY 1619
            :|....|:|.|:||:|.  .||..:..|:.:..||..|..:||.:. ||:...|.:.|.|: |.|
Mouse  1317 KVNPTFFYKNCLFDSCI--DGGAVQTACSWLQNYASTCQTQGIAVTGWRNYTSCSVTCPPN-SHY 1378

  Fly  1620 KACT----PACAV----ETCDNFLDQGIAERMCNRENCLEGCHIKPCEDGFIYLNDTYRDCVPKA 1676
            ::|.    |.||.    ..|:::              |:|||.   |:.|:: ||.  :.|:...
Mouse  1379 ESCVSVCQPRCAAIRLKSDCNHY--------------CVEGCQ---CDAGYV-LNG--KSCILPH 1423

  Fly  1677 ECKPVCMVRDGK------TFYEGDITFTDSCATCRCSKRK------------EIC---SGVK-CD 1719
            .|.  | ..|||      .|:.||.|     ..|||.:|.            |.|   |||: |.
Mouse  1424 NCG--C-YSDGKYYEPKQLFWNGDCT-----RRCRCFRRNLIQCDPRQCKSDEECALRSGVRGCF 1480

  Fly  1720 VPATT------GLPAPLVEGTTLPTPLATQNQTKCVKGWTRWCDKDRDTSDKSVRLNDEEKVPRY 1778
            ...|:      |......:|..|..|      ..|....:..|.|..|.|.:.:...|:...|..
Mouse  1481 STKTSYCLAAGGGVFRTFDGAFLRFP------ANCAFVLSTICQKLPDISFQLIINFDKWSSPNL 1539

  Fly  1779 DRMENVYGTCLKQYMTKVECRVKDTHEAP---EQMDENVVCSL------EEGLRCIGKCHDYELR 1834
            ..:..||     .|:.:.:..:.|.:...   .|::...:..|      .||...|....|.:: 
Mouse  1540 TIISPVY-----FYINEEQILINDRNTVKVNGTQVNVPFITGLATKIYSSEGFLVIDTSPDIQI- 1598

  Fly  1835 AFCQCDEELEPELPKPTEKPQLGLACDAAVVEYKEFPGD-CHKFLHCQPKGVEGGWIYVEK---- 1894
             :......::..:.:..:....|| |.       .|.|| ...::..:.|.|....:..:.    
Mouse  1599 -YYNGFNVIKISISERLQNKVCGL-CG-------NFNGDMTDDYVTLRGKPVVSSVVLAQSWKTN 1654

  Fly  1895 ---------TCGEYMMFNPTMLICDHI-----------ATVTEIK----PNCGLKPEPEPEFEPI 1935
                     :|.| :.|:.....||::           ..:|::|    |..||. :|.|.:|  
Mouse  1655 GMQKRPLAPSCNE-LQFSQYAATCDNVHIQAMQGDGYCLKLTDMKGFFQPCYGLL-DPLPFYE-- 1715

  Fly  1936 KQCPPGKIKSECANQCENTCHYYGSILKKR-----GLCQVGEHCKPGCVDELRPDCPKLGKFWRD 1995
                              :|:..|....|:     .|...||.|:...:         |...|.:
Mouse  1716 ------------------SCYLDGCYNHKKFQLCGSLAAYGEACRSFGI---------LSTEWIE 1753

  Fly  1996 --------EDTCVHADECP---C-MDKAEHYVQPHKPVLGEFEVCQCIDNAFTCVPNKPEPVPKD 2048
                    ||.||.|| ||   | :|...             |:|.||:         |.|...:
Mouse  1754 KENCSGVVEDPCVGAD-CPNRTCELDNGG-------------ELCGCIE---------PPPYGNN 1795

  Fly  2049 EDDDLD 2054
            ..|.:|
Mouse  1796 SHDIID 1801

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
HmlNP_524060.2 VWD 485..636 CDD:459671 34/168 (20%)
C8 684..754 CDD:462584 15/69 (22%)
TIL 758..811 CDD:410995 19/55 (35%)
VWD 840..1015 CDD:214566 55/198 (28%)
C8 1054..1121 CDD:214843 17/71 (24%)
TIL 1131..1185 CDD:410995 19/53 (36%)
TIL 1245..1298 CDD:460351 8/53 (15%)
VWD 1327..1498 CDD:214566 45/181 (25%)
C8 1535..1609 CDD:214843 22/78 (28%)
TIL 1938..2005 CDD:473303 15/79 (19%)
FA58C 2089..2223 CDD:238014
FA58C <2299..2403 CDD:238014
VWD 2703..2858 CDD:459671
TIL 2974..3030 CDD:460351
VWD 3035..3198 CDD:459671
C8 3248..3313 CDD:462584
VWC 3397..3451 CDD:450195
GHB_like <3755..3813 CDD:473907
TectaNP_033373.2 NIDO 98..254 CDD:214712
VWD 313..477 CDD:214566 34/167 (20%)
C8 517..591 CDD:214843 21/88 (24%)
TIL 597..650 CDD:460351 19/55 (35%)
VWC 652..706 CDD:450195 18/58 (31%)
VWD 713..866 CDD:459671 46/167 (28%)
C8 905..981 CDD:214843 19/78 (24%)
TIL 984..1036 CDD:410995 19/53 (36%)
VWD 1100..1258 CDD:459671 47/173 (27%)
C8 1300..1368 CDD:462584 21/69 (30%)
TIL 1372..1425 CDD:410995 18/73 (25%)
VWD 1487..1639 CDD:459671 27/172 (16%)
C8 1685..1757 CDD:214843 17/101 (17%)
ZP 1805..2059 CDD:214579
FXa_inhibition 2089..2121 CDD:464251
Blue background indicates that the domain is not in the aligned region.

Return to query results.
Submit another query.