DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment mgl and Egf

DIOPT Version :10

Sequence 1:NP_572563.3 Gene:mgl / 8674055 FlyBaseID:FBgn0261260 Length:4769 Species:Drosophila melanogaster
Sequence 2:XP_011238312.1 Gene:Egf / 13645 MGIID:95290 Length:1221 Species:Mus musculus


Alignment Length:1411 Identity:291/1411 - (20%)
Similarity:492/1411 - (34%) Gaps:508/1411 - (36%)


- Green bases have known domain annotations that are detailed below.


  Fly   499 LIFAHDRAIMRMLPHG-SEPKILANATAAAGVTFHYARNTLYWSDIKTRKVQSLPLDAQN-KAVS 561
            |:|:..::|.|:.|.| :..:::.:|..:|.:..||.:..|||.|::.:.:..:.|:... :.|.
Mouse    55 LVFSQGKSISRIDPDGTNHQQLVVDAGISADMDIHYKKERLYWVDVERQVLLRVFLNGTGLEKVC 119

  Fly   562 PFDQTLPGTWAPVALAVDWVGDKIYVADLVGQKIDVFELSGQWHAVVLGSNLTSPADLALDPTAG 626
            ..::.:.|      ||:||:.|::...|.....|.|.:::|: ::.||.|:|..|:::|:||...
Mouse   120 NVERKVSG------LAIDWIDDEVLWVDQQNGVITVTDMTGK-NSRVLLSSLKHPSNIAVDPIER 177

  Fly   627 LMFVAD--GGQVLRAHMDGTHARSIVSEAAYKASGVTVDIISKRVFWCDSLLD----YIESVDYE 685
            |||.:.  .|.:.|||:.|...::::....  .|.:|:|::.||:||.....:    ||.|.|||
Mouse   178 LMFWSSEVTGSLHRAHLKGVDVKTLLETGG--ISVLTLDVLDKRLFWVQDSGEGSHAYIHSCDYE 240

  Fly   686 GAHRVMVLRGQQVPSPSRLALFENRIYWTDATKQGIMSVDKFEGPTSIQVTYKAKDIREPKGIIA 750
            |. .|.::|.|...|.|.:|.|.:||:::....:.|...:|..|..::::......: .|..::.
Mouse   241 GG-SVRLIRHQARHSLSSMAFFGDRIFYSVLKSKAIWIANKHTGKDTVRINLHPSFV-TPGKLMV 303

  Fly   751 VHALSQPRVS------------------------------------------------------N 761
            ||..:|||..                                                      |
Mouse   304 VHPRAQPRTEDAAKDPGESLDPELLKQRGRPCRFGLCERDPKSHSSACAEGYTLSRDRKYCEDVN 368

  Fly   762 PCGNNNGG-----------------------------------------CNHMCIVTA-----VK 780
            .|...|.|                                         |:|.|::|:     :.
Mouse   369 ECATQNHGCTLGCENTPGSYHCTCPTGFVLLPDGKQCHELVSCPGNVSKCSHGCVLTSDGPRCIC 433

  Fly   781 GAPTGLG----------------------------FRCACSTGYQLETDLKLC-----KPVSEFL 812
            .|.:.||                            :.|.|..||.|::|.|.|     :|:   |
Mouse   434 PAGSVLGRDGKTCTGCSSPDNGGCSQICLPLRPGSWECDCFPGYDLQSDRKSCAASGPQPL---L 495

  Fly   813 MYSQQRFIKG--------KVLEPVIEGFSDAIMPVVSRRARFV-GLDFDARDEFIYYSDVLQDVI 868
            :::..:.|:.        |||              :||:...| .||:|..:..||::......|
Mouse   496 LFANSQDIRHMHFDGTDYKVL--------------LSRQMGMVFALDYDPVESKIYFAQTALKWI 546

  Fly   869 YRVHRNGTGREIVLASQNEGVEGLAVDWASKNLYYIDSRKGTLNVLSTRNVTHRRTLLKNLKRPR 933
            .|.:.:|:.||.::....:.:||||:||..:.:|:.||.|..:.........||..:.:.:.|||
Mouse   547 ERANMDGSQRERLITEGVDTLEGLALDWIGRRIYWTDSGKSVVGGSDLSGKHHRIIIQERISRPR 611

  Fly   934 AIVVHPNRGFIFFSEWDRPANITRANTDGSGLLVFKNVTLGWPNGLSIDFKEDRVYWCDALLDHV 998
            .|.|||....:|:::......|..|:..||..::..:..|..|:|::||:..|.:||||.....:
Mouse   612 GIAVHPRARRLFWTDVGMSPRIESASLQGSDRVLIASSNLLEPSGITIDYLTDTLYWCDTKRSVI 676

  Fly   999 QHANLDGTDIKTVNSRLVRHPFSIVIHNDWMYITDWRLDAIIRLHKLTGEQEEMMVREPQTNRLY 1063
            :.|||||:..:.:....|.||||:.:..|.::::||.:.::||::|.||:...         ||.
Mouse   677 EMANLDGSKRRRLIQNDVGHPFSLAVFEDHLWVSDWAIPSVIRVNKRTGQNRV---------RLQ 732

  Fly  1064 GVK------VYSHEVQRIADTQPCHRNNGGCQKICFAVPIGASNGTDGVTTSSPSFGRLQSRCSC 1122
            |..      |..|.:.: ....||...||||:.||     ..|.||              :||.|
Mouse   733 GSMLKPSSLVVVHPLAK-PGADPCLYRNGGCEHIC-----QESLGT--------------ARCLC 777

  Fly  1123 PYGERLADDQVSCIPDPSAEPPVQPCPNSWDFTCNNQRCIPKSWLCDGDDDCLDNSDEEQNCTKP 1187
            ..|                                    ..|:|  ||.                
Mouse   778 REG------------------------------------FVKAW--DGK---------------- 788

  Fly  1188 TCGSNEFQCRSGRCIPQNFRCDQENDCGDNSDEQECGNVTCGTSQFACANGRCIPNMWKCDSEND 1252
                        .|:||::....    |:|:|..:  .||            .:.|..:.:..:|
Mouse   789 ------------MCLPQDYPILS----GENADLSK--EVT------------SLSNSTQAEVPDD 823

  Fly  1253 CGDSSDEGDFCAEKTCAYFQFTCPRTGHCIPQSWVCDGDDDCFDKQDEKDCPPISCLANQFKCAD 1317
              |.::.....||...:...:                 :|||                       
Mouse   824 --DGTESSTLVAEIMVSGMNY-----------------EDDC----------------------- 846

  Fly  1318 LRQCVEESYKCDGIPDCNDGSDEVGCPSMGPNQCNLEKHFRCKSTGFCIPIAWHCDGSNDCSDHS 1382
                                         ||..|.  .|.||.|.|                   
Mouse   847 -----------------------------GPGGCG--SHARCVSDG------------------- 861

  Fly  1383 DEQDCGQITCAQNFFKCNNTNCVFKAYICDGKDDCG-DNSDEGAEHACVPPPFKCPHGQWQCPGV 1446
               :..:..|.:.|.:..|        :|...|:|. ..||                    ||..
Mouse   862 ---ETAECQCLKGFARDGN--------LCSDIDECVLARSD--------------------CPST 895

  Fly  1447 SERCVNITS--VCDDTPDCPNGSDEGEG--C-DLAECEHQAGQC--SSFCQKTPNGALCVC---- 1500
            |.||:|...  ||    .|..|. ||:|  | |:.||:..|..|  ::.|..|..|..|.|    
Mouse   896 SSRCINTEGGYVC----RCSEGY-EGDGISCFDIDECQRGAHNCGENAACTNTEGGYNCTCAGRP 955

  Fly  1501 -PPG---------SEIGEDGYTCIDSNE-----------CDPPGLCSQQCTNTKGSYFCSCTDGY 1544
             .||         |.:||||:. :|.|.           |...|:|..  ..:..||.|:|..||
Mouse   956 SSPGLSCPDSTAPSLLGEDGHH-LDRNSYPGCPSSYDGYCLNGGVCMH--IESLDSYTCNCVIGY 1017

  Fly  1545 VLEPNKHTCKA-------VNHTAAFLIISNRHSILVADLKEQGLERVPIIVENVVATASNMHTGT 1602
                :...|:.       :.|..    ...:|.|:|..:         .:|..|:.....| .||
Mouse  1018 ----SGDRCQTRDLRWWELRHAG----YGQKHDIMVVAV---------CMVALVLLLVLGM-WGT 1064

  Fly  1603 IFWSDMKLKKISRLDRGM--EPQ-EIINTGLDLVEGLA-----YDWIAQNLYWLDSKLNTIEVSA 1659
            .::...  |::|...:..  ||. .:.::|.:...|.|     ..|........|.|..::....
Mouse  1065 YYYRTR--KQLSNPPKNPCDEPSGSVSSSGPNSSSGAAVASCPQPWFVVLEKHQDPKNGSLPADG 1127

  Fly  1660 ENGSNRLVLVRENITQPRGMCIDPSPGARWIFWTDWGENPRVERIG 1705
            .||:    :|...::        ||.....:..|.|.:.|.::.:|
Mouse  1128 TNGA----VVDAGLS--------PSLQLGSVHLTSWRQKPHIDGMG 1161

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
mglNP_572563.3 LDLa 157..189 CDD:238060
LDLa 204..236 CDD:238060
LDLa 292..327 CDD:238060
LDLa 332..366 CDD:238060
LDLa 371..412 CDD:238060
vWFA <447..488 CDD:469594
YncE <492..634 CDD:442618 37/138 (27%)
NHL 573..>716 CDD:302697 48/148 (32%)
NHL repeat 573..610 CDD:271320 10/36 (28%)
NHL repeat 616..654 CDD:271320 12/39 (31%)
NHL repeat 657..693 CDD:271320 15/39 (38%)
FXa_inhibition 763..805 CDD:464251 17/115 (15%)
YvrE 854..>992 CDD:442613 39/137 (28%)
LY 922..964 CDD:214531 12/41 (29%)
LY 973..1007 CDD:214531 15/33 (45%)
Ldl_recept_a 1188..1223 CDD:395011 6/34 (18%)
LDLa 1227..1259 CDD:197566 4/31 (13%)
LDLa 1268..1303 CDD:238060 3/34 (9%)
LDLa 1308..1343 CDD:238060 0/34 (0%)
LDLa 1356..1387 CDD:238060 5/30 (17%)
LDLa 1391..1423 CDD:197566 7/32 (22%)
LDLa 1435..1469 CDD:197566 10/35 (29%)
FXa_inhibition 1522..1553 CDD:464251 8/30 (27%)
LY 1625..1665 CDD:214531 8/44 (18%)
LY 1667..1711 CDD:214531 7/39 (18%)
Ldl_recept_b 1690..1729 CDD:459654 4/16 (25%)
LY 1712..1754 CDD:214531
LY 1986..2037 CDD:214531
FXa_inhibition 2151..>2179 CDD:464251
LY 2259..2305 CDD:214531
Ldl_recept_b 2326..2366 CDD:459654
LY 2352..2392 CDD:214531
NHL 2579..>2711 CDD:302697
NHL repeat 2593..2622 CDD:271320
NHL repeat 2634..2678 CDD:271320
LY 2671..2713 CDD:214531
LDLa 2867..2901 CDD:238060
LDLa 2906..2941 CDD:238060
LDLa 2950..2983 CDD:238060
LDLa 3034..3066 CDD:197566
LDLa 3080..3115 CDD:197566
LDLa 3128..3159 CDD:238060
LDLa 3170..3204 CDD:238060
LDLa 3210..3242 CDD:197566
FXa_inhibition 3259..3287 CDD:464251
FXa_inhibition 3293..3329 CDD:464251
LY 3362..3394 CDD:214531
LY 3406..3448 CDD:214531
Ldl_recept_b 3468..3511 CDD:459654
LY 3498..3536 CDD:214531
LY 3536..3578 CDD:214531
FXa_inhibition 3607..3659 CDD:464251
LDLa 3663..3696 CDD:238060
LDLa 3705..3739 CDD:238060
LDLa 3743..3775 CDD:197566
LDLa 3784..3817 CDD:197566
LDLa 3827..3861 CDD:197566
LDLa 3871..3901 CDD:197566
LDLa 3913..3947 CDD:238060
LDLa 3952..3986 CDD:238060
LDLa 3996..4027 CDD:197566
LDLa 4043..4071 CDD:238060
LDLa 4086..4117 CDD:238060
EGF_CA 4163..4194 CDD:214542
NHL 4276..>4426 CDD:302697
NHL repeat 4291..4336 CDD:271320
Ldl_recept_b 4350..4391 CDD:459654
LY 4374..4417 CDD:214531
NHL repeat 4376..4422 CDD:271320
EgfXP_011238312.1 LY <86..114 CDD:214531 7/27 (26%)
LY <125..156 CDD:214531 10/37 (27%)
LY 158..199 CDD:214531 16/40 (40%)
FXa_inhibition 370..405 CDD:464251 3/34 (9%)
FXa_inhibition 418..446 CDD:464251 7/27 (26%)
FXa_inhibition 449..486 CDD:464251 7/36 (19%)
LY 514..556 CDD:214531 14/55 (25%)
LY 557..599 CDD:214531 11/41 (27%)
LY 600..643 CDD:214531 13/42 (31%)
LY 643..686 CDD:214531 15/42 (36%)
LY 687..728 CDD:214531 13/40 (33%)
FXa_inhibition 755..790 CDD:464251 18/119 (15%)
EGF_CA 881..913 CDD:429571 13/55 (24%)
EGF_CA 923..951 CDD:429571 9/27 (33%)
PHA02887 <986..1023 CDD:165214 9/42 (21%)
Blue background indicates that the domain is not in the aligned region.

Return to query results.
Submit another query.