DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment htl and epha2a

DIOPT Version :10

Sequence 1:NP_524394.2 Gene:htl / 42160 FlyBaseID:FBgn0010389 Length:729 Species:Drosophila melanogaster
Sequence 2:NP_571490.1 Gene:epha2a / 30689 ZFINID:ZDB-GENE-990415-63 Length:984 Species:Danio rerio


Alignment Length:627 Identity:162/627 - (25%)
Similarity:263/627 - (41%) Gaps:142/627 - (22%)


- Green bases have known domain annotations that are detailed below.


  Fly   169 CGNYTCKVCNSLGCIRHD--------TQVIVSDRVNHKPILMT--------------GPLNLTLV 211
            |....|.:|.  |.:|.:        .:|:||:...|.....|              ...::|.|
Zfish   366 CEGGLCTLCG--GRVRFEPAQTALRAPEVVVSELEPHVNYTFTVEAQNGVSQFSRKRAKASITTV 428

  Fly   212 VNSTGSMHCKYL--SDLTSKK---AWIFVPCHGMTNCSNNRSII---AEDKDQLDFVNVRMEQEG 268
            ::.|......||  .|.|:..   :|. |..|.....|....::   .||..:||...       
Zfish   429 LHFTDGPRVLYLRVEDRTTSSLTLSWA-VDHHVQNQPSPRYELMYRKKEDPGELDVTT------- 485

  Fly   269 WYT--CVESNSLGQSNSTAYLRVVRSLHVLEA----------------GVASGSLHSTSFVYI-- 313
             ||  .:|.||:..::.....:.|..:|.|.|                .:|.....::|.|.:  
Zfish   486 -YTVLVLEKNSVPINDLLPGTKYVFRVHTLTAEGHPSSHSAELEFETLPLAESRTQNSSMVVMGA 549

  Fly   314 FVFGGLIFIFMTTLFVFYAIRKMKHEKVLKQRIETVHQWTKKVIIFKPEGGGDSSGSMDTMIMPV 378
            ...||::.:.:..:.:.:..|...|   :::|::                 ||.. |....::|:
Zfish   550 IAGGGVMLLIVVVILLLHKRRLNSH---VRRRVD-----------------GDYF-SCPEKLLPL 593

  Fly   379 VRIQKQRTTVLQNGNEPAPFNEYEFP----LDSNWELPRSHLVLGATLGEGAFGRVVMAEV---- 435
                  :|.:     :|   :.||.|    |....|:...|:.....:|.|.||.|....:    
Zfish   594 ------KTYI-----DP---HTYEDPCAAILKFASEIHPGHITKQKVIGAGEFGEVFRGSLKMPG 644

  Fly   436 -NNAIVAVKMVKEGHTDDDIASLVREMEVMKIIGR--HINIINLLGCCSQNGPLYVIVEYAPHGN 497
             :...||:|.:|.|:|:......:.|..:|   |:  |.|||.|.|..::.....:|.||..:|.
Zfish   645 RSEVAVAIKTLKPGYTEKQRQDFLSEASIM---GQFSHKNIIRLEGVVTKFKDAMIITEYMENGA 706

  Fly   498 LKDFLYKNRPFGRDQDRDSSQPPPSPPAHVITEKDLIKFAHQIARGMDYLASRRCIHRDLAARNV 562
            |..:|       ||.|.|.|            ...|:...:.||.||.||:....:|||||||||
Zfish   707 LDQYL-------RDHDGDFS------------SYQLVGMLNGIAAGMKYLSDMNYVHRDLAARNV 752

  Fly   563 LVSDDYVLKIADFGLAR---DIQSTDYYRKNTNGRLPIKWMAPESLQEKFYDSKSDVWSYGILLW 624
            ||:.:...|::||||:|   |.....|  ..|.|::||:|.|||::..:.:.|.|||||:||::|
Zfish   753 LVNSNLECKVSDFGLSRVLEDFPEGTY--TTTGGKIPIRWTAPEAIAYRKFTSASDVWSFGIVMW 815

  Fly   625 EIMTYGQQPYPTIMSAEELYTYLMSGQRMEKPAKCSMNIYILMRQCWHFNADDRPPFTEIVEYMD 689
            |:|::|::|| ..||.:|:...:..|.|:..|..|...:..||.|||..:...||.|.:||..::
Zfish   816 EVMSFGERPY-WDMSNQEVMKSINDGYRLPAPMGCPSAVNQLMLQCWMQDRSTRPRFVDIVNLLE 879

  Fly   690 KLLQTKEDYLDVDIANLDTP-----PSTSDEEEDETDNLQKW 726
            |||:..|..  ..||::|..     ||||..:.....::.:|
Zfish   880 KLLRNPESL--TSIASIDPRVSIRLPSTSGCDGAPFRSVDEW 919

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
htlNP_524394.2 Ig 101..191 CDD:472250 6/29 (21%)
Ig strand B 121..125 CDD:409353
Ig strand C 134..138 CDD:409353
Ig strand E 157..161 CDD:409353
Ig strand F 171..176 CDD:409353 1/4 (25%)
Ig strand G 184..187 CDD:409353 1/10 (10%)
Ig 200..289 CDD:472250 21/112 (19%)
Ig strand B 216..220 CDD:409353 0/3 (0%)
Ig strand C 229..233 CDD:409353 0/6 (0%)
Ig strand E 255..259 CDD:409353 1/3 (33%)
Ig strand F 269..274 CDD:409353 2/6 (33%)
Ig strand G 282..285 CDD:409353 0/2 (0%)
Protein Kinases, catalytic domain 404..692 CDD:473864 101/301 (34%)
epha2aNP_571490.1 EphR_LBD_A2 27..202 CDD:198448
FN3 <321..>537 CDD:442628 35/181 (19%)
FN3 440..531 CDD:238020 20/99 (20%)
EphA2_TM 546..618 CDD:464211 16/106 (15%)
PTKc_EphR_A2 615..882 CDD:133194 99/291 (34%)
SAM_EPH-A2 909..978 CDD:188942 1/11 (9%)
Blue background indicates that the domain is not in the aligned region.

Return to query results.
Submit another query.