DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment CG31999 and Efemp2

DIOPT Version :10

Sequence 1:NP_726551.2 Gene:CG31999 / 43777 FlyBaseID:FBgn0051999 Length:917 Species:Drosophila melanogaster
Sequence 2:XP_006230804.1 Gene:Efemp2 / 293677 RGDID:1359496 Length:471 Species:Rattus norvegicus


Alignment Length:477 Identity:146/477 - (30%)
Similarity:216/477 - (45%) Gaps:101/477 - (21%)


- Green bases have known domain annotations that are detailed below.


  Fly   485 AVHPEYSQNNDSISTNRRVDCSPGF-YRNTLGACIDTNECME-QNPCGNHERCINTNGHFRC--- 544
            |..|:.|:..||.:     :|:.|: :......|.|.|||:. ...|....:|||..|.:.|   
  Rat    52 AASPQDSEEPDSYT-----ECTDGYEWDADSQHCRDVNECLTIPEACKGEMKCINHYGGYLCLPR 111

  Fly   545 --------------------ESLLQCSPGYKSTVDGKSCIDIDECDTGEHNCGERQICRNRNGGF 589
                                :....|.|||:.. :.:||:|:|||....|:|...|.|.|..|.:
  Rat   112 SAAVISDLHSEGPPPPAASAQHPNPCPPGYEPD-EQESCVDVDECTQAIHDCRPSQDCHNLPGSY 175

  Fly   590 VCSCPIGHELKRSIGGASTCVDTNECALEQRVCPLNAQCFNTIGAYYCECKAGFQKKSDGNNSTQ 654
            .|:||.|:   |.||  ..|||.:||  ..|.|  ..:|.|..|::.|:|:.|||.   |.|:..
  Rat   176 QCTCPDGY---RKIG--PECVDIDEC--RYRYC--QHRCVNLPGSFRCQCEPGFQL---GPNNRS 228

  Fly   655 CFDIDECQV-IPGLCQQKCLNFWGGYRCTCNSGYQLGPDNRTCNDINECEVHKDYKLCMGLCINT 718
            |.|::||.: .|  |:|:|.|.:|.:.|.||.||:|..|..:|:||:||. :..| ||...|:|.
  Rat   229 CVDVNECDMGAP--CEQRCFNSYGTFLCRCNQGYELHRDGFSCSDIDECS-YSSY-LCQYRCVNE 289

  Fly   719 PGSYQCSCPRGYILAADMNTCRDVDECATDSINQVCTGRNDICTNIRGSYKCTTVN-CPLGYSID 782
            ||.:.|.||:||.|.| ...|:|:|||.|.:  ..|: ....|.|..|.|:|...| |     ::
  Rat   290 PGRFSCHCPQGYQLLA-TRLCQDIDECETGA--HQCS-EAQTCVNFHGGYRCVDTNRC-----VE 345

  Fly   783 P---EQKNRCRQNLNFCEGEE--CYTQPSAFTYNFITFVSKLMIPPDGRTIFTLRGPLWYDNIEF 842
            |   ...|||     .|....  |..|||:..:.:::..|:..:|.|                  
  Rat   346 PYVQVSDNRC-----LCPASNPLCREQPSSIVHRYMSITSERSVPAD------------------ 387

  Fly   843 DLKIVRIQATTNIQKA----------TDGSFDTLQ-NNNQVNVILKKSLEGPQDIELELSMTVYT 896
               :.:||||:....|          |.|.|...| ||....::|.:.:.||::..|:|.|....
  Rat   388 ---VFQIQATSVYPGAYNAFQIRAGNTQGDFYIRQINNVSAMLVLARPVTGPREYVLDLEMVTMN 449

  Fly   897 NGMP-RGKSVAKLFLFVSQHTF 917
            :.|. |..||.:|.:||..:||
  Rat   450 SLMSYRASSVLRLTVFVGAYTF 471

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
CG31999NP_726551.2 vWFA <189..227 CDD:469594
EGF_CA 251..293 CDD:238011
EGF_CA 328..>363 CDD:214542
EGF_CA 421..447 CDD:429571
EGF_CA 519..564 CDD:214542 15/68 (22%)
EGF_CA 565..601 CDD:238011 14/35 (40%)
EGF_CA 611..642 CDD:429571 10/30 (33%)
FXa_inhibition 661..696 CDD:464251 14/35 (40%)
EGF_CA 698..>730 CDD:214542 14/31 (45%)
cEGF 723..744 CDD:463661 9/20 (45%)
Efemp2XP_006230804.1 EGF_CA 151..191 CDD:214542 17/44 (39%)
EGF_CA 192..225 CDD:214542 14/39 (36%)
EGF_CA 231..264 CDD:238011 15/34 (44%)
EGF_CA 271..302 CDD:214542 15/32 (47%)
EGF_CA 311..338 CDD:429571 10/29 (34%)
Blue background indicates that the domain is not in the aligned region.

Return to query results.
Submit another query.