DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment Mp and COL21A1

DIOPT Version :10

Sequence 1:NP_001246651.1 Gene:Mp / 38769 FlyBaseID:FBgn0260660 Length:1039 Species:Drosophila melanogaster
Sequence 2:NP_110447.2 Gene:COL21A1 / 81578 HGNCID:17025 Length:957 Species:Homo sapiens


Alignment Length:848 Identity:235/848 - (27%)
Similarity:310/848 - (36%) Gaps:281/848 - (33%)


- Green bases have known domain annotations that are detailed below.


  Fly    14 ICTLLVPVLGSFE-----LVGQSIKDALAE-----------YTLTDIMNNNQFAGIEFGEAEDGF 62
            :|...:||....|     |:|..:...:.:           |.:|..::.::.....|.|...  
Human   216 VCPTRIPVAARDERGFDILLGLDVNKKVKKRIQLSPKKIKGYEVTSKVDLSELTSNVFPEGLP-- 278

  Fly    63 PAFRFLQTADVKSPYRMLLPEKLYEFAILITFRQSSLKGGYLFSVVNPLDTVVQLGVHLSPVVKN 127
            |::.|:.|...|       .:|:::...::|                 :|...|:.|.|:.|.| 
Human   279 PSYVFVSTQRFK-------VKKIWDLWRILT-----------------IDGRPQIAVTLNGVDK- 318

  Fly   128 SYNVSLVYTQADQNIGRKLASFGVAHV----PDKWNSIALQVLSDKVSFYYDCELRNTTLVTREP 188
                .|::|......|.::.:|....|    .:.|:.|.|.|....|:.|.|.:     .:..:|
Human   319 ----ILLFTTTSVINGSQVVTFANPQVKTLFDEGWHQIRLLVTEQDVTLYIDDQ-----QIENKP 374

  Fly   189 IELVFDSASTLYIGQAGSIIGGKFEGY-------LEKINVYGNPD-----------AINVTCM-- 233
            :..|.   ..|..||...   ||:.|.       ::|:.:|.:|:           ..|..|:  
Human   375 LHPVL---GILINGQTQI---GKYSGKEETVQFDVQKLRIYCDPEQNNRETACEIPGFNGECLNG 433

  Fly   234 -------------PPPKATIAPTTAD---------------DGSIFYEGSGENILFEDSTEANIL 270
                         ||.|..:.....|               ||...|:|                
Human   434 PSDVGSTPAPCICPPGKPGLQGPKGDPGLPGNPGYPGQPGQDGKPGYQG---------------- 482

  Fly   271 SDDFWNTGDEATDIFDASGMQPPGQTQYTHERPYRGIKGEKGERGPKGDSIRGPPGPP---GPPG 332
                         |....|:  ||.......|...|.|||.|..|.|||  ||.||.|   |.||
Human   483 -------------IAGTPGV--PGSPGIQGARGLPGYKGEPGRDGDKGD--RGLPGFPGLHGMPG 530

  Fly   333 PKGETAP-----YPPFVETTSAGAKYTGECTCNASDILEAIKDNESLRESLRGAPGTPGKDGKPG 392
            .|||...     .|.|.  ...|||  ||            |.|       .|.||.||..|:||
Human   531 SKGEMGAKGDKGSPGFY--GKKGAK--GE------------KGN-------AGFPGLPGPAGEPG 572

  Fly   393 TPGHTGATGVPGARGARGSEGA---QGLKGEPGVDGLP---GVMGPPGPPGPPGLPENYDESLMV 451
            ..|..|..|.||.:|..||.||   .|.:||||:.|.|   |:||..|..||||.........|.
Human   573 RHGKDGLMGSPGFKGEAGSPGAPGQDGTRGEPGIPGFPGNRGLMGQKGEIGPPGQQGKKGAPGMP 637

  Fly   452 NSMGAFRGTTQPGAKGVPGEKGDAGQKGERGDPGHKGAHGPSGAKGEPGEPGTPGLPGLPGQVGQ 516
            ..||:   ...||..|.||.||..|:.|.:|.||..|..|..||.|.|||||..||||:.|:.|.
Human   638 GLMGS---NGSPGQPGTPGSKGSKGEPGIQGMPGASGLKGEPGATGSPGEPGYMGLPGIQGKKGD 699

  Fly   517 PG--------------------GLDGLASANGTKGEKGEKGEKGMRGRRGGTGAT------GPIG 555
            .|                    |..|:...:|.|||:|||||.|:||..|..|.:      ||.|
Human   700 KGNQGEKGIQGQKGENGRQGIPGQQGIQGHHGAKGERGEKGEPGVRGAIGSKGESGVDGLMGPAG 764

  Fly   556 P---PGKPGPMGDIG------------------------------HSGR-------------PGM 574
            |   ||.|||.|..|                              .|||             ||:
Human   765 PKGQPGDPGPQGPPGLDGKPGREFSEQFIRQVCTDVIRAQLPVLLQSGRIRNCDHCLSQHGSPGI 829

  Fly   575 TGPKGEMGPKGPKGDSG--GREGLKGDKGDRGQDGRDGLPGPPGLPSTGGGDGDSGGVQYIPMPG 637
            .||.|.:||:||:|..|  ||:|:.|..|..|:.|..||.|.||    ..|:..|.|..|....|
Human   830 PGPPGPIGPEGPRGLPGLPGRDGVPGLVGVPGRPGVRGLKGLPG----RNGEKGSQGFGYPGEQG 890

  Fly   638 PPGPPGPPGLPGLSISGPKGEPGVDSRSSFFGDASYYGRPGARSSLDELKALRELQDLRDRPDGT 702
            |||||||.|.||:|..||.|:||:..:....|.....|:||                    |.|.
Human   891 PPGPPGPEGPPGISKEGPPGDPGLPGKDGDHGKPGIQGQPG--------------------PPGI 935

  Fly   703 AEP 705
            .:|
Human   936 CDP 938

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
MpNP_001246651.1 LamG 58..222 CDD:473984 33/174 (19%)
gly_rich_SclB <292..>421 CDD:468478 52/139 (37%)
gly_rich_SclB <379..>616 CDD:468478 116/316 (37%)
Collagen_trimer 744..792 CDD:466257
Endostatin-like 831..999 CDD:238151
COL21A1NP_110447.2 vWFA 34..254 CDD:469594 6/37 (16%)
LamG 230..412 CDD:473984 40/223 (18%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 448..786 133/396 (34%)
Collagen 452..508 CDD:460189 13/86 (15%)
gly_rich_SclB <511..>746 CDD:468478 101/262 (39%)
gly_rich_SclB <692..>934 CDD:468478 83/265 (31%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 825..938 50/136 (37%)
Blue background indicates that the domain is not in the aligned region.

Return to query results.
Submit another query.