DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment sim and Epas1

DIOPT Version :10

Sequence 1:NP_524340.2 Gene:sim / 41612 FlyBaseID:FBgn0004666 Length:688 Species:Drosophila melanogaster
Sequence 2:NP_034267.3 Gene:Epas1 / 13819 MGIID:109169 Length:874 Species:Mus musculus


Alignment Length:807 Identity:207/807 - (25%)
Similarity:324/807 - (40%) Gaps:190/807 - (23%)


- Green bases have known domain annotations that are detailed below.


  Fly     1 MTNHRRVRKDCYESRLHDIAKTCAMKEKSKNAARTRREKENTEFCELAKLLPLPAAITSQLDKAS 65
            ||..:..::...|.|          ||||::|||.||.||...|.|||..||||.:::|.|||||
Mouse     1 MTADKEKKRSSSELR----------KEKSRDAARCRRSKETEVFYELAHELPLPHSVSSHLDKAS 55

  Fly    66 VIRLTTSYLKMRQVFPDGLGEAWGSSPAMQRGATIKELGSHLLQTLDGFIFVVAPDGKIMYISET 130
            ::||..|:|:..::......|....:.|.|      ::.:..|:.|:|||.||..||.::::||.
Mouse    56 IMRLAISFLRTHKLLSSVCSENESEAEADQ------QMDNLYLKALEGFIAVVTQDGDMIFLSEN 114

  Fly   131 ASVHLGLSQVELTGNSIFEYIHNYDQDEMNAILSLHPHINQHPLAQTHTPIGSPNGVQHPSAYDH 195
            .|..:||:||||||:|||::.|..|.:|:...|:|..              ||..|         
Mouse   115 ISKFMGLTQVELTGHSIFDFTHPCDHEEIRENLTLKN--------------GSGFG--------- 156

  Fly   196 DRGSHTIEIEKTFFLRMKCVLAK--RNAGLTTSGFKVIHCSGYLKARIY---PDRGDGQGSLIQN 255
             :.|..:..|:.||:||||.:..  |...|.::.:||:||:|  :.|:|   |......||....
Mouse   157 -KKSKDVSTERDFFMRMKCTVTNRGRTVNLKSATWKVLHCTG--QVRVYNNCPPHSSLCGSKEPL 218

  Fly   256 LG-LVAVGHSLPSSAITEIKLHQNMFMFRAKLDMKLIFFDARVSQLTGYEPQDLIEKTLYQYIHA 319
            |. |:.:...:...:..:|.|....|:.|..:|||..:.|.|:.:|.||.|::|:.::.|::.||
Mouse   219 LSCLIIMCEPIQHPSHMDIPLDSKTFLSRHSMDMKFTYCDDRILELIGYHPEELLGRSAYEFYHA 283

  Fly   320 ADIMAMRCSHQILLYKGQVTTKYYRFLTKGGGWVWVQSYATLVHNSRSSREVFIVSVNYVLSERE 384
            .|...|..|||.|..||||.:..||.|.|.||:||:::..|:::|.|:.:...|:.|||||||.|
Mouse   284 LDSENMTKSHQNLCTKGQVVSGQYRMLAKHGGYVWLETQGTVIYNPRNLQPQCIMCVNYVLSEIE 348

  Fly   385 VKDLVLNEIQT-------------------------------GVVKREPISPAAQAAQAAQA--- 415
            ..|:|.:..||                               ..:|.||...|..|.....|   
Mouse   349 KNDVVFSMDQTESLFKPHLMAMNSIFDSSDDVAVTEKSNYLFTKLKEEPEELAQLAPTPGDAIIS 413

  Fly   416 ------------------AQAAQAAQAAQAAHVAQAVQAQV--VVVPQQ----SVVVQPQCAGAT 456
                              ....|...:...:|.||:....:  ..|||.    :.......:.:.
Mouse   414 LDFGSQNFDEPSAYGKAILPPGQPWVSGLRSHSAQSESGSLPAFTVPQADTPGNTTPSASSSSSC 478

  Fly   457 GQPVGP-------GTPVSL-------ALSASPK----------------LDPYFE---------- 481
            ..|..|       ..|:.:       |:...|:                |.||..          
Mouse   479 STPSSPEDYYSSLENPLKIEVIEKLFAMDTEPRDPGSTQTDFSELDLETLAPYIPMDGEDFQLSP 543

  Fly   482 --PELPLQPAVTPVPPTNNSSSSSNN---------NNGVWHHHHVQQQQQSGSMDHDSLSYTQLY 535
              ||.||.|. :|.|...:..|:..:         .:|.:......||.:|...:.:       :
Mouse   544 ICPEEPLMPE-SPQPTPQHCFSTMTSIFQPLTPGATHGPFFLDKYPQQLESRKTESE-------H 600

  Fly   536 PPLNDLVVSSSSSVGGGTASSAGGGSSASASSSGVYSTEMQYPDTTTGNLYYNNNNHYYYDYDA- 599
            .|::.:...:.|.   |:.|...|.:|...||.|..|.....||..   |::........|..| 
Mouse   601 WPMSSIFFDAGSK---GSLSPCCGQASTPLSSMGGRSNTQWPPDPP---LHFGPTKWPVGDQSAE 659

  Fly   600 ---TVDVATSMIRPFSANSNSCSSSSESERQLSTGNASIVNETSPSQTTYSD-LSHNFELSY--- 657
               .:.|.:|.:.|.||..:.......|.:........::   ||:....|: |....:|.|   
Mouse   660 SLGALPVGSSQLEPPSAPPHVSMFKMRSAKDFGARGPYMM---SPAMIALSNKLKLKRQLEYEEQ 721

  Fly   658 -FSDNS-------SQQHQHQQQQQHLM 676
             |.|.|       |..|...::.:.||
Mouse   722 AFQDTSGGDPPGTSSSHLMWKRMKSLM 748

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
simNP_524340.2 bHLH-PAS_dSIM_like 25..86 CDD:381583 29/60 (48%)
PAS 102..215 CDD:395786 38/112 (34%)
PAS_3 291..377 CDD:430001 32/85 (38%)
Epas1NP_034267.3 Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 1..23 8/31 (26%)
bHLH-PAS_HIF2a_PASD2 9..74 CDD:381571 31/74 (42%)
DNA-binding. /evidence=ECO:0000269|PubMed:26245371 26..53 14/26 (54%)
PAS 86..147 CDD:214512 26/60 (43%)
Required for heterodimer formation with ARNT. /evidence=ECO:0000269|PubMed:26245371 171..192 6/20 (30%)
PAS_3 254..341 CDD:430001 32/86 (37%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 438..489 8/50 (16%)
NTAD 495..541 5/45 (11%)
HIF-1 516..548 CDD:463274 4/31 (13%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 777..803
CTAD 834..874
HIF-1a_CTAD 837..873 CDD:430212
Blue background indicates that the domain is not in the aligned region.

Return to query results.
Submit another query.