DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment htl and Epha1

DIOPT Version :10

Sequence 1:NP_524394.2 Gene:htl / 42160 FlyBaseID:FBgn0010389 Length:729 Species:Drosophila melanogaster
Sequence 2:NP_076069.2 Gene:Epha1 / 13835 MGIID:107381 Length:977 Species:Mus musculus


Alignment Length:577 Identity:144/577 - (24%)
Similarity:243/577 - (42%) Gaps:139/577 - (24%)


- Green bases have known domain annotations that are detailed below.


  Fly   203 TGPLNLTLVVNSTGSMHCKYLSDLTSKKAWIFVPCHGMTNCSNNRSIIAEDKDQLDF-------- 259
            :.|.:.:|.:|..   |.:.||.|:.|                   ::.::..||:.        
Mouse   432 SSPSSASLSINMG---HAESLSGLSLK-------------------LVKKEPRQLELTWAGSRPR 474

  Fly   260 ---------VNVRMEQEGWYTCV-ESNSL--GQSNSTAYLRVVRSLHVLEAG------------V 300
                     ::|..:.|.|:..| |...|  .....|.|:..||:|..|..|            .
Mouse   475 NPGGNLSYELHVLNQDEEWHQMVLEPRVLLTKLQPDTTYIVRVRTLTPLGPGPFSPDHEFRTSPP 539

  Fly   301 ASGSLHSTSFVYIFVFGGLIFIFMTTLFVFYAIRKMKHEKVLKQRIETVHQWTKKVIIFKPEGGG 365
            .|.||.....|.: :||.|:.|.:  |...|..|..:.::..:||                    
Mouse   540 VSRSLTGGEIVAV-IFGLLLGIAL--LIGIYVFRSRRGQRQRQQR-------------------- 581

  Fly   366 DSSGSMDTMIMPVVRIQKQRTTVLQNGN----EP-APFNEYEFP----LDSNWELPRSHLVLGAT 421
                            |::|||.:...:    :| .....||.|    ||...||..:.|::...
Mouse   582 ----------------QRERTTNVDREDKLWLKPYVDLQAYEDPAQGALDFAQELDPAWLIVDTV 630

  Fly   422 LGEGAFGRVV-----MAEVNNAIVAVKMVKEGHTDDDIASLVREMEVMKIIGR--HINIINLLGC 479
            :|||.||.|.     :...:...||:|.:|:...|....:.:||..:|   |:  |.:|:.|.|.
Mouse   631 IGEGEFGEVYRGALRLPSQDCKTVAIKTLKDTSPDGYWWNFLREATIM---GQFNHPHILRLEGV 692

  Fly   480 CSQNGPLYVIVEYAPHGNLKDFLYKNRPFGRDQDRDSSQPPPSPPAHVITEKDLIKFAHQIARGM 544
            .::..|:.:|.|:..:|.|..||         ::|:....|          ..|:.....||.||
Mouse   693 ITKRKPIMIITEFMENGALDAFL---------KEREDQLAP----------GQLVAMLLGIASGM 738

  Fly   545 DYLASRRCIHRDLAARNVLVSDDYVLKIADFGLARDIQSTDYYRKNTNGRLPIKWMAPESLQEKF 609
            :.|:....:||||||||:||:.:...|::||||.|.:...|...:...|::||:|.|||::..:.
Mouse   739 NCLSGHNYVHRDLAARNILVNQNLCCKVSDFGLTRLLDDFDGTYETQGGKIPIRWTAPEAIAHRI 803

  Fly   610 YDSKSDVWSYGILLWEIMTYGQQPYPTIMSAEELYTYLMSGQRMEKPAKCSMNIYILMRQCWHFN 674
            :.:.|||||:||::||::::|.:||.. ||.:|:...:..|.|:..|..|...:|.||:.||.::
Mouse   804 FTTASDVWSFGIVMWEVLSFGDKPYGE-MSNQEVMKSIEDGYRLPPPVDCPAPLYELMKNCWAYD 867

  Fly   675 ADDRPPFTEIVEYMDKLLQTKEDYLDVDIANLDTP-----PSTSDEEEDETDNLQKW 726
            ...||.|.::..::::||.......  .|||.|..     ||.|..:.....::.:|
Mouse   868 RARRPHFLQLQAHLEQLLTDPHSLR--TIANFDPRVTLRLPSLSGSDGIPYRSVSEW 922

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
htlNP_524394.2 Ig 101..191 CDD:472250
Ig strand B 121..125 CDD:409353
Ig strand C 134..138 CDD:409353
Ig strand E 157..161 CDD:409353
Ig strand F 171..176 CDD:409353
Ig strand G 184..187 CDD:409353
Ig 200..289 CDD:472250 18/105 (17%)
Ig strand B 216..220 CDD:409353 0/3 (0%)
Ig strand C 229..233 CDD:409353 1/3 (33%)
Ig strand E 255..259 CDD:409353 2/3 (67%)
Ig strand F 269..274 CDD:409353 2/5 (40%)
Ig strand G 282..285 CDD:409353 0/2 (0%)
Protein Kinases, catalytic domain 404..692 CDD:473864 91/298 (31%)
Epha1NP_076069.2 EphR_LBD_A1 28..204 CDD:198447
fn3 335..426 CDD:394996
FN3 454..536 CDD:238020 16/100 (16%)
EphA2_TM <596..622 CDD:464211 8/25 (32%)
Protein Kinases, catalytic domain 620..885 CDD:473864 87/287 (30%)
SAM_EPH-A1 913..975 CDD:188941 1/10 (10%)
PDZ-binding. /evidence=ECO:0000255 975..977
Blue background indicates that the domain is not in the aligned region.

Return to query results.
Submit another query.