DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment CG31999 and Efemp1

DIOPT Version :10

Sequence 1:NP_726551.2 Gene:CG31999 / 43777 FlyBaseID:FBgn0051999 Length:917 Species:Drosophila melanogaster
Sequence 2:NP_001012039.1 Gene:Efemp1 / 305604 RGDID:1308528 Length:493 Species:Rattus norvegicus


Alignment Length:537 Identity:159/537 - (29%)
Similarity:232/537 - (43%) Gaps:97/537 - (18%)


- Green bases have known domain annotations that are detailed below.


  Fly   406 CGTGYTLNAETGNCDDDDECTLSTHNCPSNYDCHNTRGSFRCYRKISTMLTTRTTSTTVPPLSLE 470
            |..||..:.....|.|.|||.:....|.....|.|..|.:.|..|.:.::..........|.:..
  Rat    29 CTDGYEWDPVRQQCKDIDECDIVPDACKGGMKCVNHYGGYLCLPKTAQIIVNNEQPQQETPAAEA 93

  Fly   471 N--------ARRSFTSRYPYP---------LAVHPEYSQNNDSISTNRRVDCSPGFYRNTLGACI 518
            :        |.||..:....|         ....||       :.|.|                 
  Rat    94 SSGAATGTIAARSMATSGVIPGGGFIASATAVAGPE-------VQTGR----------------- 134

  Fly   519 DTNECMEQNPCGNHERCINTNGHFRCESLLQCSPGYKSTVDGKSCIDIDECDTGEHNCGERQICR 583
             .|..:.:|| .:.:|..:...|     .:||:.||:.: :...|.|||||.:|.|||...|:|.
  Rat   135 -NNFVIRRNP-ADPQRIPSNPSH-----RIQCAAGYEQS-EHNVCQDIDECTSGTHNCRLDQVCI 191

  Fly   584 NRNGGFVCSCPIGHELKRSIGGASTCVDTNECALEQRVCPLNAQ-CFNTIGAYYCECKAGFQKKS 647
            |..|.|.|.|..|:: ||    ...|||.:||:    |.|...| |.||.|::||:|..|||..:
  Rat   192 NLRGSFTCHCLPGYQ-KR----GEQCVDIDECS----VPPYCHQGCVNTPGSFYCQCNPGFQLAA 247

  Fly   648 DGNNSTQCFDIDECQVIPGLCQQKCLNFWGGYRCTCNSGYQLGPDNRTCNDINECEVHKDYKLCM 712
              ||.| |.||:||.. ...|.|:|.|..|.:.|.||.||:|..|...|.||:||.. ..| ||.
  Rat   248 --NNYT-CVDINECDA-SNQCAQQCYNILGSFICQCNQGYELSSDRLNCEDIDECRT-SSY-LCQ 306

  Fly   713 GLCINTPGSYQCSCPRGYILAADMNTCRDVDECATDSINQVCTGRNDICTNIRGSYKCTTVN-CP 776
            ..|:|.||.:.|.||:||.:... .||:|::||.|   ...|. .:::|.|..|.::|...| |.
  Rat   307 YQCVNEPGKFSCMCPQGYQVVRS-RTCQDINECET---TNECR-EDEMCWNYHGGFRCYPQNPCQ 366

  Fly   777 LGYSIDPEQKNRCRQNLNFC--EGEECYTQPSAFTYNFITFVSKLMIPPDGRTIFTLRGPLWYDN 839
            ..|.:..|  |||     .|  ....|...|.:..|.::...|...:|.|   ||.::....|.|
  Rat   367 DPYVLTSE--NRC-----VCPVSNTMCRDVPQSIVYKYMNIRSDRSVPSD---IFQIQATTIYAN 421

  Fly   840 IEFDLKIVRIQATTNIQKATDGSFDTLQNNNQVN--VILKKSLEGPQD--IELELSMTVYTNGMP 900
               .:...||::...     :|.| .|:..:.|:  ::|.|||.||::  ::||: :||.:.|..
  Rat   422 ---TINTFRIKSGNE-----NGEF-YLRQTSPVSAMLVLVKSLTGPREHIVDLEM-LTVSSIGTF 476

  Fly   901 RGKSVAKLFLFVSQHTF 917
            |..||.:|.:.|...:|
  Rat   477 RTSSVLRLTIIVGPFSF 493

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
CG31999NP_726551.2 vWFA <189..227 CDD:469594
EGF_CA 251..293 CDD:238011
EGF_CA 328..>363 CDD:214542
EGF_CA 421..447 CDD:429571 8/25 (32%)
EGF_CA 519..564 CDD:214542 9/44 (20%)
EGF_CA 565..601 CDD:238011 17/35 (49%)
EGF_CA 611..642 CDD:429571 13/31 (42%)
FXa_inhibition 661..696 CDD:464251 13/34 (38%)
EGF_CA 698..>730 CDD:214542 14/31 (45%)
cEGF 723..744 CDD:463661 8/20 (40%)
Efemp1NP_001012039.1 EGF_CA 44..71 CDD:473889 8/26 (31%)
EGF_CA 173..204 CDD:429571 16/30 (53%)
EGF_CA 214..244 CDD:214542 14/33 (42%)
EGF_CA 254..293 CDD:214542 16/39 (41%)
Mediates interaction with TIMP3. /evidence=ECO:0000250 259..493 80/261 (31%)
EGF_CA 294..333 CDD:214542 17/41 (41%)
EGF_CA 334..378 CDD:214542 16/54 (30%)
Blue background indicates that the domain is not in the aligned region.

Return to query results.
Submit another query.