DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment Hml and Tecta

DIOPT Version :9

Sequence 1:NP_524060.2 Gene:Hml / 39529 FlyBaseID:FBgn0029167 Length:3843 Species:Drosophila melanogaster
Sequence 2:NP_001100284.1 Gene:Tecta / 300653 RGDID:1309824 Length:2155 Species:Rattus norvegicus


Alignment Length:1776 Identity:402/1776 - (22%)
Similarity:628/1776 - (35%) Gaps:502/1776 - (28%)


- Green bases have known domain annotations that are detailed below.


  Fly   485 CTTWGGINMKTFDGLVFKAPLSCSHTLITDKVSGT----FDIILKACPYG--------------S 531
            |..:|..:..||||.:|....||::.|....:..:    |.:..|....|              :
  Rat   322 CVVFGEPHYHTFDGFLFHFQGSCAYLLARQCLQTSSLPFFSVEAKNEHRGGSAVSWVKELSVEVN 386

  Fly   532 GYGCAHTLKILWQSVLYTFENLNGTMQLTTPIKKLPMPVQVMGMKVMPVAQHVQIDLESVGLKLD 596
            ||      |||.....|      |.:::...:..||:.:::..:|:........::.: .||.:.
  Rat   387 GY------KILIPKGSY------GKVKVNDLVTSLPVTLELGAVKIYQSGMSTAVETD-FGLLVT 438

  Fly   597 WDHRQYVSVQAGPQMWGKVGGLCGTLDGDPNTDLTSRTGKKLATVKAFADAWRVE---------- 651
            :|.:.|.|:...........||||..:.:|..|.....|:...:|....::|||.          
  Rat   439 FDGQHYASISIPGSYINSTCGLCGNYNKNPLDDFLRPDGRPAMSVLDLGESWRVYHTDWKCGSGC 503

  Fly   652 -DRSELCQVENSAEMEFGMDSCEQSKLQKAVSVCERLLANEKLGDCIKPFNYDALIRTCMADYCN 715
             |....|.....| :.||.|.|  ..|.|         .:..|.:|....:..|.:.:|:.|.|:
  Rat   504 IDNCTQCDAATEA-LYFGSDYC--GFLNK---------TDGPLWECGTVVDATAFVHSCVYDLCS 556

  Fly   716 CANREHPESCNCDAIAMLAKECAFKGIKLEHGWRNLEIC--PISCGFGRVYQACGPNVEPTCDSD 778
            ..:.   .:..|.||...|..|...||.: ..||....|  .:.|.....|..|..:...|| ||
  Rat   557 VRDN---GTLLCQAIQAYALVCQALGIPI-GDWRIQTGCVSTVRCPSFSHYSVCTSSCPDTC-SD 616

  Fly   779 LALPASKGA---CNEGCFCPEGTVQYKEACITRELCPCSLRGK-----EFKPESTVKKNCNT-CT 834
            |.  ||:..   |.|||.|.||.|.....|:....|.|...|.     ||   .....||.. |.
  Rat   617 LT--ASQNCATPCTEGCECNEGFVLSTSQCVPLHKCGCDFEGHYYTMGEF---FWATANCTVQCL 676

  Fly   835 C-KNGQWRCTEDKC--GARCGAVGDPHYQ-------------------TFDGKRYDFMGKCSYHL 877
            | :.|...|....|  |..| ||.| .||                   ||||..|.|..:.||.|
  Rat   677 CEEGGDVYCFNKTCRSGEVC-AVED-GYQGCFPKRETVCLLSQNQVLHTFDGAAYAFPSELSYTL 739

  Fly   878 LKTQNTSVEAENVACSGAVSESMNFAAPDDPSCTKAVTIRFILRDGTPSVIKLDQGLTTIVNDKP 942
            |||                       .|:.|...: :.|.....|..|:.::   |:..:|.|:.
  Rat   740 LKT-----------------------CPERPEYLE-IDINKRKPDAGPAWLR---GVRILVADQE 777

  Fly   943 IAKLPKMLGLGEVLI---------------------RRASSTFLTVEFADGIRVWWDGVSRVYID 986
            :    |:.|:|.:.:                     |..:||  |||....:.|.:..|..:||.
  Rat   778 V----KIGGVGALEVKLNGHDVELPFFHPSGRLEIHRNKNST--TVESKGVVSVQYSDVGLLYIR 836

  Fly   987 APPSLRGQTQGLCGTFNSNTQDDFLTPEGDVETAVEPFADKWRT-KDTCQ------FKAETHQGP 1044
            ........|.||||.||:|..|:|..|.|.....:..|.:.|.| ::.|.      .||      
  Rat   837 LSTMYFNCTGGLCGFFNANASDEFCLPNGKCTDNLAVFLESWTTFEEICNGECGDLLKA------ 895

  Fly  1045 HPCTLNPE--KKAQAEKFCDWILQD----IFQDCHFLVEPEQFYEDCLYDTCACKDEMSKCFCPI 1103
              |..:.|  |..::...|. |:.|    .|.:||.:|....:|..||:..|......|: .|..
  Rat   896 --CNNDSELLKFYRSRSRCG-IINDPSNSSFLECHGVVNVTAYYRTCLFRLCQSGGNESE-LCDS 956

  Fly  1104 LSAYGTECMRQGVKTG-WRMSVKECAVKCPLGQVFDECGDGCALSCDDLPSKGSCKRECVEGCRC 1167
            ::.|.:.|....|:.| || :...|.::||....|:||.. |..:|:.|.....|...|.|||:|
  Rat   957 VARYASACKNADVEVGPWR-TYDFCPLECPENSHFEECMT-CTETCETLALGPICVDSCSEGCQC 1019

  Fly  1168 PHGEYVNEDGECVPKKMCHCNFDGMSFRPGYKEVRPGEKFLDLCTCTDGVWDCQDAEPGDKDKYP 1232
            ..| |..:..:||.:..|.|||:|       .::...|.|          |..||          
  Rat  1020 DEG-YALQGSQCVTRSECGCNFEG-------HQLATNETF----------WVDQD---------- 1056

  Fly  1233 PSSELRSKCAKQPYAEFTKCAPKEPKTCKNMDKYV-ADSSDCLPGCVCMEGYVYDTSRLACVLPA 1296
                    |....|             |...|..| .::..|.....|||               
  Rat  1057 --------CQIFCY-------------CNGTDNSVHCETIPCRDDDYCME--------------- 1085

  Fly  1297 NCSCHHAGKSYDDGEKIKEDCNLCECRAGNWKCSKNGCESTCSVWGDSHFTTFDGHDFDFQGACD 1361
                 .:|..|            |:.|.          :::|.|.|..|:.||||:.||||.:|.
  Rat  1086 -----ESGLYY------------CQPRT----------DASCIVSGYGHYLTFDGYPFDFQTSCP 1123

  Fly  1362 YVL----AKGVFDNGDGFSITIQNVLCGTMGVTCSKSLEIALTGHA-------EESLLLSADSAY 1415
            .:|    ::.:.|:...|.:|.:|...........|.:::.:.|::       :.::|::.:..|
  Rat  1124 LILCTTGSRSISDSFPKFIVTAKNEDRDPSLALWVKQVDVTVFGYSIVIHRAYKHTVLVNNERLY 1188

  Fly  1416 STDPNKTPIKKLRDSVNSKGHNAFHIYKAGVFVVVEVIPLKLQVKWDEGTRVYVKLGNEWRQKVS 1480
                  .|:|        .|....:|:..|..||||. ...|:|.:|..|.:.:.:....:....
  Rat  1189 ------LPLK--------LGQGKINIFSFGFHVVVET-DFGLKVVYDWKTFLSITVPRSMQNSTY 1238

  Fly  1481 GLCGNYNGNSLDDMQTPSMGLETSPMLFGHAW-KLQPHCSAPVAPIDACKKHPERETWAQLK--C 1542
            ||||.||||..||::.|...|..|...||.:| |....|.  |...|.|....:.|.:::::  |
  Rat  1239 GLCGRYNGNPGDDLEMPMGLLALSINEFGQSWVKRDTFCQ--VGCGDRCPSCAKVEGFSKVQQLC 1301

  Fly  1543 GALKSDL--FKECHAEVPLERFWKRCIFDTCACDQGGDCECLCTAVAAYADACAQKGINIR-WRS 1604
            ..:.:..  |.:||::|....|:|.|:||:|.  .||..:..|:.:..||..|..:|:.:. ||:
  Rat  1302 SLIPNQNAGFAKCHSKVNPTFFYKNCLFDSCI--DGGAVQTACSWLQNYASTCQTQGVAVTGWRN 1364

  Fly  1605 QHFCPMQCDPHCSDYKACT----PACAV----ETCDNFLDQGIAERMCNRENCLEGCHIKPCEDG 1661
            ...|.:.|.|: |.|::|.    |.||.    ..|:::              |:|||.   |:.|
  Rat  1365 YTSCSVTCPPN-SHYESCVSVCQPRCAAIRLKSDCNHY--------------CVEGCQ---CDAG 1411

  Fly  1662 FIYLNDTYRDCVPKAECKPVCMVRDGK------TFYEGDITFTDSCATCRCSKRK---------- 1710
            :: ||.  :.|:....|.  | ..|||      .|:.||.|     ..|||.:|.          
  Rat  1412 YV-LNG--KSCILPHNCG--C-YSDGKYYEPKQLFWNGDCT-----RRCRCFRRNLIQCDPRQCK 1465

  Fly  1711 --EIC---SGVK-CDVPATT------GLPAPLVEGTTLPTPLATQNQTKCVKGWTRWCDKDRDTS 1763
              |.|   |||: |....|:      |......:|..|..|      ..|....:..|.|..|.|
  Rat  1466 SDEECALRSGVRGCFSTKTSYCLAAGGGVFRTFDGAFLRFP------ANCAFVLSTICQKLPDIS 1524

  Fly  1764 DKSVRLNDEEKVPRYDRMENVYGTCLKQYMTKVECRVKDTHEAP---EQMDENVVCSL------E 1819
            .:.:...|:...|....:..||     .|:.:.:..:.|.:...   .|::...:..|      .
  Rat  1525 FQLIINFDKWSSPNLTIISPVY-----FYINEEQILINDRNTVKVNGTQVNVPFITGLATKIYSS 1584

  Fly  1820 EGLRCIGKCHDYELRAFCQCDEELEPELPKPTEKPQLGLACDAAVVEYKEFPGD-CHKFLHCQPK 1883
            ||...|....|.::  :......::..:.:..:....|| |.       .|.|| ...::..:.|
  Rat  1585 EGFLVIDTSPDIQI--YYNGFNVIKISISERLQNKVCGL-CG-------NFNGDMTDDYVTLRGK 1639

  Fly  1884 GVEGGWIYVEK-------------TCGEYMMFNPTMLICDHI-----------ATVTEIK----P 1920
            .|....:..:.             :|.| :.|:.....||::           ..:|::|    |
  Rat  1640 PVVSSVVLAQSWKTNGMQKRPLTPSCNE-LQFSQYAATCDNVHIQAMQGDGYCLKLTDMKGFFQP 1703

  Fly  1921 NCGLKPEPEPEFEPIKQCPPGKIKSECANQCENTCHYYGSILKKR-----GLCQVGEHCKPGCVD 1980
            ..||. :|.|.:|                    :|:..|....|:     .|...||.|:...: 
  Rat  1704 CYGLL-DPLPFYE--------------------SCYLDGCYNHKKFQLCGSLAAYGEACRSFGI- 1746

  Fly  1981 ELRPDCPKLGKFWRD--------EDTCVHADECP---C-MDKAEHYVQPHKPVLGEFEVCQCIDN 2033
                    |...|.:        ||.||.|| ||   | :|...             |:|.||: 
  Rat  1747 --------LSTEWIEKENCSGVVEDPCVGAD-CPNRTCELDNGG-------------ELCGCIE- 1788

  Fly  2034 AFTCVPNKPEPVPKDEDDDLD 2054
                    |.|...:..|.:|
  Rat  1789 --------PPPYGNNSHDIID 1801

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
HmlNP_524060.2 VWD 485..636 CDD:295339 34/168 (20%)
C8 680..754 CDD:285899 15/73 (21%)
TIL 758..811 CDD:280072 19/55 (35%)
VWD 840..1015 CDD:214566 53/216 (25%)
C8 1054..1121 CDD:214843 17/71 (24%)
TIL 1131..1185 CDD:280072 18/53 (34%)
TIL 1245..1298 CDD:280072 8/53 (15%)
VWD 1327..1498 CDD:214566 45/181 (25%)
C8 1535..1609 CDD:214843 21/78 (27%)
Mucin2_WxxW 1751..1837 CDD:290069 16/94 (17%)
TIL 1938..2005 CDD:280072 15/79 (19%)
FA58C 2089..2223 CDD:238014
FA58C 2104..2225 CDD:214572
FA58C <2299..2404 CDD:214572
FA58C <2299..2403 CDD:238014
VWD 2703..2858 CDD:295339
C8 2893..2970 CDD:285899
TIL 2974..3030 CDD:280072
VWD 3035..3198 CDD:295339
C8 3257..3313 CDD:285899
VWC 3397..3451 CDD:302663
GHB_like <3755..3813 CDD:304424
TectaNP_001100284.1 NIDO 98..254 CDD:214712
VWD 322..478 CDD:278521 34/168 (20%)
C8 517..591 CDD:214843 21/88 (24%)
TIL 597..650 CDD:280072 19/55 (35%)
VWC 652..706 CDD:302663 18/58 (31%)
VWD 703..865 CDD:214566 45/194 (23%)
C8 905..981 CDD:214843 19/78 (24%)
TIL 984..1036 CDD:280072 18/53 (34%)
VWD 1100..1258 CDD:278521 46/172 (27%)
C8 1298..1368 CDD:285899 20/71 (28%)
TIL 1372..1425 CDD:280072 18/73 (25%)
VWD 1487..1639 CDD:278521 27/172 (16%)
C8 1685..1757 CDD:214843 17/101 (17%)
ZP 1805..2059 CDD:214579
Zona_pellucida 1937..2057 CDD:278526
FXa_inhibition 2089..2121 CDD:291342
Blue background indicates that the domain is not in the aligned region.


Information from Original Tools:


Tool Simple Score Weighted Score Original Tool Information
BLAST Result Score Score Type Cluster ID
Compara 00.000 Not matched by this tool.
Domainoid 00.000 Not matched by this tool.
eggNOG 1 0.900 - - E1_KOG1216
Hieranoid 00.000 Not matched by this tool.
Homologene 00.000 Not matched by this tool.
Inparanoid 00.000 Not matched by this tool.
OMA 00.000 Not matched by this tool.
OrthoDB 00.000 Not matched by this tool.
OrthoFinder 00.000 Not matched by this tool.
OrthoInspector 00.000 Not matched by this tool.
orthoMCL 00.000 Not matched by this tool.
Panther 00.000 Not matched by this tool.
Phylome 00.000 Not matched by this tool.
SonicParanoid 00.000 Not matched by this tool.
SwiftOrtho 1 1.000 - -
TreeFam 1 0.960 - -
32.860

Return to query results.
Submit another query.