DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment Mp and Col5a2

DIOPT Version :10

Sequence 1:NP_001246651.1 Gene:Mp / 38769 FlyBaseID:FBgn0260660 Length:1039 Species:Drosophila melanogaster
Sequence 2:NP_031763.2 Gene:Col5a2 / 12832 MGIID:88458 Length:1497 Species:Mus musculus


Alignment Length:730 Identity:217/730 - (29%)
Similarity:268/730 - (36%) Gaps:249/730 - (34%)


- Green bases have known domain annotations that are detailed below.


  Fly   292 PPGQTQYTHERPYRGIKGE---------KGERGPKGD----SIRGPPGPP------GPPGPKGET 337
            |||..........:||:|:         |||.||||:    .|:||.|||      ||.|..|..
Mouse   446 PPGSPGPQGSTGPQGIRGQSGDPGVPGFKGEAGPKGEPGPHGIQGPIGPPGEEGKRGPRGDPGTV 510

  Fly   338 APYPPFVETTSAGAK-YTGECTCNASDILEAIKDNESLRESL-----RGAPGTPGKDGKPGTPGH 396
            .|..|..|..:.|.: :.|      ||.|...|..:..|..:     :|..|.||:.|:||.||.
Mouse   511 GPPGPMGERGAPGNRGFPG------SDGLPGPKGAQGERGPVGSSGPKGGQGDPGRPGEPGLPGA 569

  Fly   397 TGATGVPGARGARGS---------------EGAQGLKGEPGVDGLP------------------- 427
            .|.||.||.:|..|.               .|:.|::|:||..|||                   
Mouse   570 RGLTGNPGVQGPEGKLGPLGAPGEDGRPGPPGSIGIRGQPGSMGLPGPKGSSGDLGKPGEAGNAG 634

  Fly   428 -----------GVMGPPGPPGPPGLPENYDESLMVNSMGAFRGTTQPGAKGVPGEKGDAGQKGER 481
                       |.:||.||.|||||.....|.......| |:|.  ||..|.|||.|.||.:|..
Mouse   635 VPGQRGAPGKDGEVGPSGPVGPPGLAGERGEQGPPGPTG-FQGL--PGPPGPPGEGGKAGDQGVP 696

  Fly   482 GDPGHKGAHGPSGAKGEPGEPGTPGLPGLPGQVGQPGGLD------------------------- 521
            |:||..|..||.|.:|.|||.|.||:.||||:.|..||..                         
Mouse   697 GEPGAVGPLGPRGERGNPGERGEPGITGLPGEKGMAGGHGPDGPKGNPGPTGTIGDTGPPGLQGM 761

  Fly   522 ----GLASANGTKGEKGEKGEKGMRGRRGGTGA---TGPIGPPGKPGPMGDIGHSGRPGMTGPKG 579
                |:|...|.||::|..||||..|..|..||   .||:||||..||.|:.|..|..|:.||.|
Mouse   762 PGERGIAGTPGPKGDRGGIGEKGAEGTAGNDGARGLPGPLGPPGPAGPTGEKGEPGPRGLVGPPG 826

  Fly   580 EMGPKGPKGDSG--------------GREGLKGDKGDRGQDGRDGLPGPPGLPSTGGGDGDSG-- 628
            ..|..|.:|::|              |:.|:||:.|:.||.|..|.|||.||..:.|..|..|  
Mouse   827 SRGNPGSRGENGPTGAVGFAGPQGPDGQPGVKGEPGEPGQKGDAGSPGPQGLAGSPGPHGPHGVP 891

  Fly   629 ------GVQYIPMPGPP------------GPPGPPGLPGLSISGPKGEPGVDSRSSFFGDASYYG 675
                  |.|     |||            |||||.|.||  .:||.||||.:......||...:|
Mouse   892 GLKGGRGTQ-----GPPGATGFPGSAGRVGPPGPAGAPG--PAGPAGEPGKEGPPGLRGDPGSHG 949

  Fly   676 RPGARSSLDELKALRELQDLRDRPDGTAEPPRQPGHSHKHEETLGLVDGEEPYFSASSSNMNMKI 740
            |.|.|                    |.|.||..||......|     ||:               
Mouse   950 RVGDR--------------------GPAGPPGSPGDKGDPGE-----DGQ--------------- 974

  Fly   741 VPGAVTFQNIDEMTKKSALNPPGTLAYITEEEALLVRVNKGWQYIALGTLVPIATP--------- 796
             ||..              .|||. |..|.:..::....:..:....|...|..||         
Mouse   975 -PGPD--------------GPPGP-AGTTGQRGIVGMPGQRGERGMPGLPGPAGTPGKVGPTGAT 1023

  Fly   797 ---APPTTVAPSMRFDLQSKNLLNSPPPLLNTPTWYPRMLRVAALNEPSTGDLQGIRGADFACYR 858
               .||..|.                ||..|.|...|      ....|:..|  |..|.|.|...
Mouse  1024 GDKGPPGPVG----------------PPGSNGPVGEP------GPEGPAGND--GTPGRDGAVGE 1064

  Fly   859 QGRR-----AGLLGT 868
            :|.|     |||.|:
Mouse  1065 RGDRGDPGPAGLPGS 1079

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
MpNP_001246651.1 LamG 58..222 CDD:473984
gly_rich_SclB <292..>421 CDD:468478 52/168 (31%)
gly_rich_SclB <379..>616 CDD:468478 114/327 (35%)
Collagen_trimer 744..792 CDD:466257 6/47 (13%)
Endostatin-like 831..999 CDD:238151 12/43 (28%)
Col5a2NP_031763.2 VWC 40..95 CDD:214564
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 103..1265 217/730 (30%)
gly_rich_SclB <107..>326 CDD:468478
Cell attachment site. /evidence=ECO:0000255 141..143
Collagen 280..336 CDD:460189
gly_rich_SclB <316..601 CDD:468478 49/160 (31%)
gly_rich_SclB <478..>746 CDD:468478 93/276 (34%)
Cell attachment site. /evidence=ECO:0000255 504..506 0/1 (0%)
gly_rich_SclB <659..>914 CDD:468478 91/262 (35%)
gly_rich_SclB <832..>1109 CDD:468478 85/335 (25%)
Cell attachment site. /evidence=ECO:0000255 942..944 0/1 (0%)
Cell attachment site. /evidence=ECO:0000255 1065..1067 0/1 (0%)
Cell attachment site. /evidence=ECO:0000255 1068..1070 1/1 (100%)
Collagen 1093..1144 CDD:460189
Cell attachment site. /evidence=ECO:0000255 1125..1127
Cell attachment site. /evidence=ECO:0000255 1134..1136
COLFI 1263..1496 CDD:460199
Blue background indicates that the domain is not in the aligned region.

Return to query results.
Submit another query.