DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment Mp and COL6A1

DIOPT Version :10

Sequence 1:NP_001246651.1 Gene:Mp / 38769 FlyBaseID:FBgn0260660 Length:1039 Species:Drosophila melanogaster
Sequence 2:NP_001839.2 Gene:COL6A1 / 1291 HGNCID:2211 Length:1028 Species:Homo sapiens


Alignment Length:840 Identity:240/840 - (28%)
Similarity:317/840 - (37%) Gaps:254/840 - (30%)


- Green bases have known domain annotations that are detailed below.


  Fly   104 LFSVVNPLDTVVQLGVHLSPV------VKNSYNVSLV------YTQADQNIGRKLASFGVAHVPD 156
            ||.|   |||...:.:.|.|.      || |:....:      |.:.|:|:   :.:.|..|..|
Human    38 LFFV---LDTSESVALRLKPYGALVDKVK-SFTKRFIDNLRDRYYRCDRNL---VWNAGALHYSD 95

  Fly   157 KWNSI-----------ALQVLSDKVSF-----YYDCELRNTTLVTREPIELVFDSASTLYIGQAG 205
            :...|           ||:...|.|.:     |.||.:       ::.:|.:....|.|...:..
Human    96 EVEIIQGLTRMPGGRDALKSSVDAVKYFGKGTYTDCAI-------KKGLEQLLVGGSHLKENKYL 153

  Fly   206 SII--GGKFEGYLEKINVYGNPDAIN-----------VTCMP---PPKATIAPT--------TAD 246
            .::  |...|||.|...  |..||:|           |...|   .|:.:|..|        ||.
Human   154 IVVTDGHPLEGYKEPCG--GLEDAVNEAKHLGVKVFSVAITPDHLEPRLSIIATDHTYRRNFTAA 216

  Fly   247 DGSIFYEGSGENILFEDSTEA-----NILSDDFWNTGDEATDIFDASGMQ-PPGQTQYTHERPYR 305
            |.       |::   .|:.||     :.:.|...|..::....|:....: |||         .|
Human   217 DW-------GQS---RDAEEAISQTIDTIVDMIKNNVEQVCCSFECQPARGPPG---------LR 262

  Fly   306 GIKGEKGERGPKGDSIRGPPGPPGPPGPKGETAPYPPFVETTSAGAKYTGECTCNASDILEAIKD 370
            |..|.:||||..|  :.|..|..|.||..|:..|.                              
Human   263 GDPGFEGERGKPG--LPGEKGEAGDPGRPGDLGPV------------------------------ 295

  Fly   371 NESLRESLRGAPGTPGKDGKPGTPGHTGATGVPGARGARGSEGAQGLKGEPGVDGLPGVMGPP-- 433
                     |..|..|:.|..|..|..|..|..|.:|.||.:|..|:|||.|..||||..|.|  
Human   296 ---------GYQGMKGEKGSRGEKGSRGPKGYKGEKGKRGIDGVDGVKGEMGYPGLPGCKGSPGF 351

  Fly   434 ----GPPGPPGLPENYDESLMVNSMGAFRGTTQPGAKGVPGEKGDAGQKGERGDPGHKGAHGPSG 494
                |||||.|                     .|||.|:.||||:.|..||.|.|   |:.||||
Human   352 DGIQGPPGPKG---------------------DPGAFGLKGEKGEPGADGEAGRP---GSSGPSG 392

  Fly   495 AKGEPGEPGTPGLPGLPGQVGQPGGLDGLASANGTKGEKGEKGEKGMRGRRGGTGATGPIGPPGK 559
            .:|:|||||.||..|..|..|.||       .:|..||:|..||   ||.||..|..||.|.||:
Human   393 DEGQPGEPGPPGEKGEAGDEGNPG-------PDGAPGERGGPGE---RGPRGTPGTRGPRGDPGE 447

  Fly   560 PGPMGDIGHSGRPGMTGPKGEMGPKGPKGDSGGREGLKGDKGDRGQDGRDGLPGPPGLPSTGGGD 624
            .||.||.|..|..|:.|..||.||.||||..|. ||..|.:|.||..|..|.||.|||....|.|
Human   448 AGPQGDQGREGPVGVPGDPGEAGPIGPKGYRGD-EGPPGSEGARGAPGPAGPPGDPGLMGERGED 511

  Fly   625 GDSG-GVQYIPMPGPPGPPGPPGLPGLSISGPKGEPGVDSRSSFFGDASYYGRPGARSSLDELKA 688
            |.:| |.:  ..||.||.||..|.||  |:|.||.||:.      ||....|.||..::....:.
Human   512 GPAGNGTE--GFPGFPGYPGNRGAPG--INGTKGYPGLK------GDEGEAGDPGDDNNDIAPRG 566

  Fly   689 LRELQDLRDRPDGTAEPPRQPGHSHKHE------------------------------ETLGLVD 723
            ::..:..|. |:|...||...|.....|                              |::||.:
Human   567 VKGAKGYRG-PEGPQGPPGHQGPPGPDECEILDIIMKMCSCCECKCGPIDLLFVLDSSESIGLQN 630

  Fly   724 GE------EPYFSASSSNMNMKIVPG----AVTFQNIDEMTKKSALNPPGTLAYITEEEALLVRV 778
            .|      .......|.:..:|..||    .|...:..:|.:..:|..|........:||:    
Human   631 FEIAKDFVVKVIDRLSRDELVKFEPGQSYAGVVQYSHSQMQEHVSLRSPSIRNVQELKEAI---- 691

  Fly   779 NKGWQYIALGT------------LVPIATPAPPTTVA---PSMRFDLQSK----NLLNSP 819
             |..|::|.||            |:|   |:|...:|   ...|.|.|..    |:|.||
Human   692 -KSLQWMAGGTFTGEALQYTRDQLLP---PSPNNRIALVITDGRSDTQRDTTPLNVLCSP 747

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
MpNP_001246651.1 LamG 58..222 CDD:473984 33/147 (22%)
gly_rich_SclB <292..>421 CDD:468478 34/128 (27%)
gly_rich_SclB <379..>616 CDD:468478 104/242 (43%)
Collagen_trimer 744..792 CDD:466257 12/59 (20%)
Endostatin-like 831..999 CDD:238151
COL6A1NP_001839.2 N-terminal globular domain 20..256 52/243 (21%)
vWA_collagen_alpha_1-VI-type 34..227 CDD:238757 47/214 (22%)
Collagen <253..289 CDD:460189 16/46 (35%)
triple-helical domain 257..592 156/430 (36%)
gly_rich_SclB <275..>555 CDD:468478 138/365 (38%)
C-terminal globular domain 593..1028 32/163 (20%)
vWA_collagen_alpha_1-VI-type 612..807 CDD:238757 31/144 (22%)
vWA_collagen_alpha_1-VI-type 826..1011 CDD:238757
Blue background indicates that the domain is not in the aligned region.

Return to query results.
Submit another query.