DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment Mp and Col18a1

DIOPT Version :9

Sequence 1:NP_001246651.1 Gene:Mp / 38769 FlyBaseID:FBgn0260660 Length:1039 Species:Drosophila melanogaster
Sequence 2:NP_445941.2 Gene:Col18a1 / 85251 RGDID:70936 Length:1311 Species:Rattus norvegicus


Alignment Length:1359 Identity:387/1359 - (28%)
Similarity:523/1359 - (38%) Gaps:473/1359 - (34%)


- Green bases have known domain annotations that are detailed below.


  Fly    22 LGSFELVGQSIKDALAEYTLTDIMNNNQFAGIE--FGEAEDGFPAFRFLQTADVKSPYRMLLPEK 84
            :|..:|:|..:...:::.       .:...|:.  ||      |..:..|.|....|       |
  Rat    34 VGLLQLLGDPLPQKISQV-------EDPHVGLAYVFG------PYSKSSQMAQYHFP-------K 78

  Fly    85 LY--EFAILITFRQSSLKGGYLFSVVNPLDTVVQLGVHLSPVVKNSYNVSLVYTQADQNIGRKLA 147
            |:  :|::|...|.::...|.||::.:....||.|||.||.|.....|:||:||:...:..:..|
  Rat    79 LFFRDFSLLFEVRPTTEAAGVLFAITDAAQVVVSLGVKLSEVRDGQQNISLLYTEPGASQTQTGA 143

  Fly   148 SFGVAHVPDKWNSIALQVLSDKVSFYYDCELRNTTLVTREPIELVFDSASTLYIGQAGSIIGGKF 212
            ||.:.....:|...||.|....|:.|.|||........|.|..|..:..:.|::||||:....||
  Rat   144 SFRLPAFVGQWTHFALSVDGSSVALYVDCEEFQRVPFARSPHGLELERGAGLFVGQAGAADPDKF 208

  Fly   213 EGYLEKINVYGNPDAINVTC-----------------------------------MPPPKATIAP 242
            :|.:.::.|...|....|.|                                   :|.|....:|
  Rat   209 QGMISELRVRKTPRVSPVHCLDEEDDDDDRASGDFGSGLEESSNLHRQETYLRPGLPQPPPVTSP 273

  Fly   243 TTADDGSIFYEGSGENILFEDSTEANILSDDFWNTGDEATDIFDASG-----MQPPGQTQYTHER 302
            ..| .||...:...|    |...||.:.|     .|.:...:.|:||     :|.||.....   
  Rat   274 PLA-GGSATEDSRTE----EKEEEATVDS-----KGADTLPVTDSSGVWDGDVQNPGGGLIK--- 325

  Fly   303 PYRGIKGEKGE------------RGPKGDSI-----------RGPPGPPGPP------------- 331
              .|:||:|||            :||.|.::           :|||||.|||             
  Rat   326 --GGLKGQKGEPGAQGPPGPAGPQGPAGPAVQSPSSQPVPGAQGPPGPQGPPGKDGIPGRDGEPG 388

  Fly   332 -----------------------GPKGETA------------PYPP----------FVETTSAGA 351
                                   |||||..            |.||          |::...:|.
  Rat   389 DPGEDGRPGDTGPQGFPGTPGDVGPKGEKGDPGIGPRGPPGPPGPPGPSFRQDKLTFIDMEGSGG 453

  Fly   352 KYTGECTCNASDILEAIKD-------------------------NESLRESLRGAPGTPGKDGKP 391
             ::|:        ||:::.                         |.|......|.||.|||:|.|
  Rat   454 -FSGD--------LESLRGPRGFPGPPGPPGVPGLPGEPGRFGVNSSYAPGPAGLPGVPGKEGPP 509

  Fly   392 GTP------------------GHTGATGVPGARGARGSEGAQGLKGEPGVDGLPGVMGPPGPPGP 438
            |.|                  |..|..|.||.:|::|..|..|:.|:.|:.||||.:||||||||
  Rat   510 GFPGPPGPPGKEGPPGVAGQKGSVGDAGSPGPKGSKGDLGPIGMPGKSGLPGLPGPVGPPGPPGP 574

  Fly   439 PGLP-----ENYDES--------LMVNSMGAFRG----TTQPGAK-------------------- 466
            ||.|     ..:|:.        ....|....:|    |..||||                    
  Rat   575 PGPPGPGFAAGFDDMEGSGTPLWSTARSSDGLQGDPGVTGPPGAKGEVGADGVQGIPGLPGREGV 639

  Fly   467 -GVPGEKGDAGQKGERGD----------------------------------------PGHKGAH 490
             |.||.||:.|.:||:|:                                        ||:.|..
  Rat   640 AGPPGPKGEKGTQGEKGNPGKDGVGRPGLPGPPGPPGPVIYVSNEDRAVVSTPGPEGKPGYAGFP 704

  Fly   491 GPSGAKGEPGEPGTPGLPGLPGQVGQPGGL---DGLASANGTKGEKGE---------------KG 537
            ||:|.||:.|..|..||||..|:.|:||.:   ||.|.....||.|||               ||
  Rat   705 GPAGPKGDLGSKGEQGLPGPKGEKGEPGSIFSPDGTALGQAQKGAKGEPGFRGPPGPYGRPGYKG 769

  Fly   538 EKGMRGRRG--GT--------------------GATGPIGPPGKPGPMG-------DIGHSGRPG 573
            |.|..||.|  ||                    |..||.||||.|||.|       ....|||||
  Rat   770 EIGFPGRPGRPGTNGLKGEKGEPGEASLGFSMRGLPGPPGPPGPPGPPGVPVYDSNAFVESGRPG 834

  Fly   574 MTGPKGEMGPKGPKGDSG-----GREG------------LKGDKGDRGQDGRDGLPGPPGLPSTG 621
            :.|.:|..||.|||||.|     |..|            :||||||||..||.|..|.||.|  |
  Rat   835 LPGQQGVQGPPGPKGDKGEVGPPGPPGQFPIDLFHLEAEMKGDKGDRGDAGRKGERGEPGAP--G 897

  Fly   622 GGDGDSGGVQYIPMPGPPGPPGPPGLP---GLSISGPKGEPGVDSRSSFFGDASYYGRPGARSSL 683
            ||...|.      :||||||||.||:|   |.||.||.|.||.......    .|.||.|     
  Rat   898 GGFFSSS------VPGPPGPPGYPGIPGPKGESIRGPPGPPGPQGPPGI----GYEGRQG----- 947

  Fly   684 DELKALRELQDLRDRPDGTAEPPRQPGHSHKHEETLGL--------VDGEEPYFSASSSNMNMKI 740
                           |.|...||..|.....|.:|:.:        ..|......||:..:.:  
  Rat   948 ---------------PPGPPGPPGPPSFPGPHRQTVSVPGPPGPPGPPGPPGAMGASAGQVRI-- 995

  Fly   741 VPGAVTFQNIDEMTKKSALNPPGTLAYITEEEALLVRVNKGWQYIAL--------GTLVPIATPA 797
               ..|:|.   |..|....|.|.|.::.|.|.|.|||..|::.:.|        ||...:|...
  Rat   996 ---WATYQT---MLDKIREVPEGWLIFVAEREELYVRVRNGFRKVLLEARTALPHGTDNEVAALQ 1054

  Fly   798 PP-----------------TTVAPSMRFDLQSKNLLNSP-------------------------- 819
            ||                 .|..|....|:    |.|.|                          
  Rat  1055 PPLVQLHEGSSYTRREHSYPTARPWRADDI----LANPPRLPDRQPYPGVPHHHHHHHHHHSSHE 1115

  Fly   820 --PPLLNTPT-------WYPRMLRVAALNEPSTGDLQGIRGADFACYRQGRRAGLLGTFKAFLSS 875
              ||...:|:       ::| :|.:.|||.|.:|.::|||||||.|::|.|..||.|||:|||||
  Rat  1116 HRPPAHPSPSPAHTHQDFHP-VLHLVALNTPLSGGMRGIRGADFQCFQQARAVGLSGTFRAFLSS 1179

  Fly   876 RVQNLDTIVRPADR-DLPVVNTRGDVLFNSWKGIFNGQGGFFSQAPRIYSFSGKNVMTDSTWPMK 939
            |:|:|.:|||.||| .:|:||.:.:||..||..:|:|..|......||:||.|::|:....||.|
  Rat  1180 RLQDLYSIVRRADRSSVPIVNLKDEVLSPSWDTLFSGSQGQLHSGARIFSFDGRDVLRHPAWPQK 1244

  Fly   940 MVWHGSLPNGERSMDTYCDAWHS-GDHLKGSFASNLDGHKLLEQKRQSCDSKLIILCVE 997
            .|||||.|:|.|.|::||:.|.: ...:.|..:|.|.| :|||||.:||.:..|:||:|
  Rat  1245 SVWHGSDPSGRRLMESYCETWRTEATGVTGQASSLLSG-RLLEQKAESCHNSYIVLCIE 1302

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
MpNP_001246651.1 LamG 58..222 CDD:304605 48/165 (29%)
Collagen 378..432 CDD:189968 25/71 (35%)
Collagen 464..518 CDD:189968 28/114 (25%)
Collagen 494..615 CDD:189968 70/184 (38%)
Endostatin-like 831..999 CDD:238151 81/169 (48%)
Col18a1NP_445941.2 TSPN 33..221 CDD:214560 55/206 (27%)
Collagen <381..416 CDD:189968 3/34 (9%)
Collagen 605..659 CDD:189968 15/53 (28%)
Collagen <692..733 CDD:189968 17/40 (43%)
Endostatin-like 1135..1304 CDD:238151 82/170 (48%)


Information from Original Tools:


Tool Simple Score Weighted Score Original Tool Information
BLAST Result Score Score Type Cluster ID
Compara 00.000 Not matched by this tool.
Domainoid 1 1.000 183 1.000 Domainoid score I3340
eggNOG 1 0.900 - - E1_KOG3546
Hieranoid 1 1.000 - -
Homologene 00.000 Not matched by this tool.
Inparanoid 1 1.050 485 1.000 Inparanoid score I1389
OMA 00.000 Not matched by this tool.
OrthoDB 1 1.010 - - D1362001at2759
OrthoFinder 1 1.000 - - FOG0002295
OrthoInspector 00.000 Not matched by this tool.
orthoMCL 1 0.900 - - OOG6_106129
Panther 00.000 Not matched by this tool.
Phylome 1 0.910 - -
SonicParanoid 1 1.000 - - X1979
SwiftOrtho 1 1.000 - -
TreeFam 00.000 Not matched by this tool.
109.770

Return to query results.
Submit another query.