DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment arr and Egf

DIOPT Version :10

Sequence 1:NP_524737.2 Gene:arr / 44279 FlyBaseID:FBgn0000119 Length:1678 Species:Drosophila melanogaster
Sequence 2:NP_036974.2 Gene:Egf / 25313 RGDID:2542 Length:1132 Species:Rattus norvegicus


Alignment Length:1310 Identity:290/1310 - (22%)
Similarity:452/1310 - (34%) Gaps:457/1310 - (34%)


- Green bases have known domain annotations that are detailed below.


  Fly    81 AFVNTPATLLFTTRHDIQVANITRPTGGPQIDVIVRDLAEAMAIDFYYAKNLVCWTDSGREIIEC 145
            |....|..|:|...:.|...|    |.|.....:|.|...::.:||:|.:..:.|.|..|::::.
  Rat    41 AAAGPPRFLIFLQGNSIFRIN----TDGTNHQQLVVDAGVSVVMDFHYKEERLYWVDLERQLLQR 101

  Fly   146 AQTNSSALQPLLRAPKQTVISTGLDKPEGLAMDWYTDKIYWTDGEKNRIEVATLDGRYQKVLFWT 210
            ...|.|..:.:.:..|..         .|||::|...:|..||..|..|.|..::|...:||. :
  Rat   102 VFFNGSGQETVCKVDKNV---------SGLAINWIDGEILRTDRWKGVITVTDMNGNNSRVLL-S 156

  Fly   211 DLDQPRAVAVVPARKLLIWTD--WGEYPKIERASMDGDPLSRMTLVKEHVFWPNGLAVDLKNELI 273
            .|.:|..:.|.|..:|:.|:.  .|   .:.||.:.|..:..:....|.:   :.|.:|:.::.:
  Rat   157 SLKRPANILVDPTERLIFWSSVVTG---NLHRADLGGMDVKTLLEAPERI---SVLILDILDKRL 215

  Fly   274 YWT----DGKHHFIDVMRLDGSSRRTIVNNLKYP-FSLTFYDDRLYWTD------W--------- 318
            :|.    :|.|.:|.....||.|...|.:..::. .::..:.|::.::.      |         
  Rat   216 FWAQDGREGSHGYIHSCDYDGGSIHHIRHQARHDLLTMAIFGDKILYSALKEKAIWIADKHTGKN 280

  Fly   319 -QRGSLNALDLQTRELK--ELIDTPKAPNSVRAWDP------------SL---QPYED------- 358
             .|.:|:...:..|||:  .|...|...|..:|.|.            ||   .|..|       
  Rat   281 VVRVNLDPASVPPRELRVVHLHAQPGTENRAQASDSERCKQRRGQCLYSLSERDPNSDSCACAEG 345

  Fly   359 -------------NPCAHNN--------------------------------------GN---CS 369
                         |.||..|                                      ||   ||
  Rat   346 YTLSRDRKYCEDVNECALQNHGCTLGCENIPGSYYCTCPTGFVLLPDGKRCHELVACPGNRSECS 410

  Fly   370 HLCLLATNSQGFSCACPTGVKL-----------------------------------------IS 393
            |.|:|  .|.|..|.||.|..|                                         :.
  Rat   411 HDCIL--TSDGPLCICPAGSVLGKDGKTCTGCSSPDNGGCSQICLPLSLASWECDCFPGYDLQLD 473

  Fly   394 ANTC-ANGSQEMMFIVQRTQISKISLDSPDY-TIFPLPLGKVKYAIAIDYDPVEEHIYWSDVETY 456
            ..:| |:|.|..:.......|..:..|..|| |:....:|.|   .|:||||||..||::.....
  Rat   474 RKSCAASGPQPFLLFANSQDIRHMHFDGTDYKTLLSRQMGMV---FALDYDPVESKIYFAQTALK 535

  Fly   457 TIKRAHADGTGVTDFVTSEVRHPDGLALDWLARNLYWTDTVTDRIEVCRLDGTARKVLIYEHLEE 521
            .|:||:.||:.....:|..|..|:|||:||:.|.:||||:....||...|.|...:::|.|.:..
  Rat   536 WIERANLDGSQRERLITEGVDTPEGLAVDWIGRRIYWTDSGKSVIEGSDLSGKHHQIIIKESISR 600

  Fly   522 PRAIAVAPSLGWMFWSDWNERKPKVERASLDGSERVVLVSENLGWPNGIALDIEAKAIYWCDGKT 586
            ||.|||.|....:||:| ....|::|.:||.||:|.::.|.||..|:|||:|.....:||||.|.
  Rat   601 PRGIAVHPKARRLFWTD-TGMSPRIESSSLQGSDRTLIASSNLLEPSGIAIDYLTDTLYWCDTKL 664

  Fly   587 DKIEVANMDGSGRRVVISDNLKHLFGLSILDDYLYWTDWQRRSIDRAHKITGNNRIVVVDQYPDL 651
            ..||:|::|||.||.:..:::.|.|.|::.:|:::::||...|:.|.:|.||.||:       .|
  Rat   665 SVIEMADLDGSKRRRLTQNDVGHPFSLAVFEDHVWFSDWAIPSVIRVNKRTGQNRV-------RL 722

  Fly   652 MG--LKVTRLREVR-----GQNACAVRNGGCSHLCLNRPRDYVCRCAIDYELANDKRTCVVPAAF 709
            .|  ||.:.|..|.     |.:.|..|||||.|:|........|.|...:..|.|.:.|:     
  Rat   723 RGSMLKPSSLVVVHPLAKPGADPCLHRNGGCEHICQESLGTAQCLCREGFVKAPDGKMCL----- 782

  Fly   710 LLFSRQEHIGRISIEYNEGNHNDERIPFKDVRDAHALDVSVAERRIYWTDQKSKCIFRAFL---- 770
                               ...|::|...|..|   |...||.     .|...|    |::    
  Rat   783 -------------------TRKDDQILAGDNAD---LSKEVAS-----LDNSPK----AYVPDDD 816

  Fly   771 ---NGSYVQRIVDSGL-----IGPDGIAVDWLANNIYWSDAEARRIEVARLDGSSRRVLLWKGVE 827
               :.:.|..|:.|||     .||.|.                         ||....     :.
  Rat   817 RTESSTLVAEIMVSGLNYEDDCGPGGC-------------------------GSHAHC-----IS 851

  Fly   828 EPRSLVLEPRRGYMYWTESPTDSIRRAAMDG---SDLQTIVAGANHAAGLTFDQETRRLYWATQS 889
            |..:.|.:..:|:              |.||   ||:.....|::...             .|.|
  Rat   852 EGEAAVCQCLKGF--------------AGDGNLCSDIDECELGSSDCP-------------PTSS 889

  Fly   890 RPAKIESADWDGKKRQILVGSDMDEPYAVSLYQDYVYWSDWNTGDIERVHKTTGQNRSLVHSGMT 954
            |....|.    |...|...|.:.|..|.:.:                                  
  Rat   890 RCINTEG----GYVCQCSEGYEGDGIYCLDV---------------------------------- 916

  Fly   955 YITSLLVFNDKRQTGVNPCKVN------NGGCSHLCLAQPGRRGMTCACPTHYQ-LAKDGVSCIP 1012
                     |:.|.|.:.|..|      .||.:..|...|...|:.|...|... |.|||  |..
  Rat   917 ---------DECQQGSHGCSENATCTNTEGGYNCTCAGCPSAPGLPCPDSTSPSLLGKDG--CHW 970

  Fly  1013 PRNYIIFSQRNCFGRLLPNTTDCP------------------------NIPLPVSGK-------- 1045
            .||.|               |.||                        |..:...|:        
  Rat   971 VRNSI---------------TGCPPSYDGYCLNGGVCMYVESVDRYVCNCVIGYIGERCQHRDLR 1020

  Fly  1046 --NIRAVDYDPITHHIYWIEGRSHSIKRSLANGTKVSLLANSGQPFDLAIDIIGRLLFWTC---- 1104
              .:|..||           |:.|.|       |.||:..       :|:.::..|..|..    
  Rat  1021 WWKLRHADY-----------GQRHDI-------TVVSVCV-------VALALLLLLGMWGTYYYR 1060

  Fly  1105 ------------SQSNSINVTSFLGESVGV-IDTG---------DSEKPRNIAVHAMKRLLFWTD 1147
                        |:.:|.||:|...:|.|. :.:|         :.::|:|..:.|         
  Rat  1061 TRKQLSESSKKPSEESSSNVSSNGPDSSGAGVSSGPQPWFVVLEEHQQPKNGRLPA--------- 1116

  Fly  1148 VGSHQAIIRA 1157
            .|::.|::.|
  Rat  1117 AGTNGAVVEA 1126

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
arrNP_524737.2 LY 162..201 CDD:214531 10/38 (26%)
NHL 168..503 CDD:302697 107/478 (22%)
NHL repeat 172..237 CDD:271320 20/66 (30%)
LY 252..293 CDD:214531 9/44 (20%)
NHL repeat 261..298 CDD:271320 10/40 (25%)
NHL repeat 303..339 CDD:271320 8/54 (15%)
NHL repeat 341..419 CDD:271320 30/195 (15%)
NHL repeat 428..476 CDD:271320 17/47 (36%)
LY 472..511 CDD:214531 17/38 (45%)
NHL repeat 477..503 CDD:271320 12/25 (48%)
Ldl_recept_b 534..574 CDD:459654 17/39 (44%)
LY 558..599 CDD:214531 19/40 (48%)
LY 600..641 CDD:214531 12/40 (30%)
FXa_inhibition 668..703 CDD:464251 12/34 (35%)
YncE <736..886 CDD:442618 27/164 (16%)
LY 776..818 CDD:214531 9/46 (20%)
LY 905..946 CDD:214531 4/40 (10%)
FXa_inhibition 973..1010 CDD:464251 13/43 (30%)
YncE <1047..1191 CDD:442618 26/137 (19%)
LDLa 1319..1362 CDD:238060
LDLa 1365..1399 CDD:238060
LDLa 1399..1433 CDD:197566
EgfNP_036974.2 LDL-receptor class B 1 87..128 10/49 (20%)
LDL-receptor class B 2 129..170 13/41 (32%)
LY 152..193 CDD:214531 12/44 (27%)
LDL-receptor class B 3 171..212 9/46 (20%)
LDL-receptor class B 4 213..259 8/45 (18%)
cEGF 339..360 CDD:463661 0/20 (0%)
FXa_inhibition 361..396 CDD:464251 3/34 (9%)
FXa_inhibition 409..437 CDD:464251 12/29 (41%)
FXa_inhibition 440..477 CDD:464251 0/36 (0%)
LDL-receptor class B 5 484..524 12/42 (29%)
LY 505..547 CDD:214531 17/44 (39%)
LDL-receptor class B 6 525..567 15/41 (37%)
LY 548..590 CDD:214531 17/41 (41%)
LDL-receptor class B 7 568..610 17/41 (41%)
LY 591..634 CDD:214531 17/43 (40%)
LDL-receptor class B 8 611..654 18/43 (42%)
LY 634..677 CDD:214531 20/42 (48%)
LDL-receptor class B 9 655..697 16/41 (39%)
LY 678..719 CDD:214531 12/40 (30%)
FXa_inhibition 746..781 CDD:464251 12/34 (35%)
EGF_3 839..871 CDD:463759 11/75 (15%)
EGF_CA 873..913 CDD:214542 10/56 (18%)
EGF_CA 915..943 CDD:429571 7/70 (10%)
PHA03099 <974..>1016 CDD:165381 6/56 (11%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 1068..1096 7/27 (26%)
Blue background indicates that the domain is not in the aligned region.

Return to query results.
Submit another query.