DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment vkg and COL5A1

DIOPT Version :9

Sequence 1:NP_001260071.1 Gene:vkg / 33726 FlyBaseID:FBgn0016075 Length:1940 Species:Drosophila melanogaster
Sequence 2:NP_000084.3 Gene:COL5A1 / 1289 HGNCID:2209 Length:1838 Species:Homo sapiens


Alignment Length:1680 Identity:616/1680 - (36%)
Similarity:725/1680 - (43%) Gaps:440/1680 - (26%)


- Green bases have known domain annotations that are detailed below.


  Fly    49 APGPIGVPGLEGPAG-DIGPPGRAGPLGEKGDVGEYGEQGEKGHRGDIGPKGEMGYPGIMGKSGE 112
            :|..|| ||:  ||. |....|..||.||||..||                     |.|:    |
Human   426 SPSEIG-PGM--PANQDTIYEGIGGPRGEKGQKGE---------------------PAII----E 462

  Fly   113 P-----GTPGPRGIDGCDGRPGMQGPSGAPGQNGVRGPPGKPGQQGPPGEAGEGGINSKGTKGNR 172
            |     |.|||.|..|..|.||..||:|..|..|.|||||:||..|..|..              
Human   463 PGMLIEGPPGPEGPAGLPGPPGTMGPTGQVGDPGERGPPGRPGLPGADGLP-------------- 513

  Fly   173 GETGQPGGVGPPGF----------DGDRGSKG-----DTGYAGLTGEKGDPGLPGPKGDTGAVSE 222
                     ||||.          .||.||||     ....|....::....|.||.|       
Human   514 ---------GPPGTMLMLPFRFGGGGDAGSKGPMVSAQESQAQAILQQARLALRGPAG------- 562

  Fly   223 LPYSLIGPPGAKGEPGDSLSGVLKPDDTLKGYKGYVGLQGDEGPQGPTGEQGAVGRNGLPGARGE 287
             |..|.|.||..|.||   ||.||            |..||.|||||.|.||..|..|.||.||.
Human   563 -PMGLTGRPGPVGPPG---SGGLK------------GEPGDVGPQGPRGVQGPPGPAGKPGRRGR 611

  Fly   288 IGGPGERGKPGKDGEPGRFGDKGMKGAPGWTGADGLDGSPGERGEDGFTGMPGVQGGAGPPGIYD 352
            .|..|.||.||:                        .|..|:||.||..|:||.:|..|.||...
Human   612 AGSDGARGMPGQ------------------------TGPKGDRGFDGLAGLPGEKGHRGDPGPSG 652

  Fly   353 PSLTKSLPGPIGSQGDIGPPGEQGPPGLPGKPGRR---GPIGLAGQSGDPGLNGSRGPPGRSERG 414
            |      |||.|..|:.|..||.||.||||:||.|   ||.|..|..|.||:.|..|.||  .:|
Human   653 P------PGPPGDDGERGDDGEVGPRGLPGEPGPRGLLGPKGPPGPPGPPGVTGMDGQPG--PKG 709

  Fly   415 EAGDYGFIGPPGPQGPPGEAGLPGRYGLHGEPGQNVVGPKGEPGLNGQPGLEGYRGDRGEVGLPG 479
            ..|..|..||||.||.||..||||..|..|.||:.  ||.|:|||.|.||.:|..|..|:.|.||
Human   710 NVGPQGEPGPPGQQGNPGAQGLPGPQGAIGPPGEK--GPLGKPGLPGMPGADGPPGHPGKEGPPG 772

  Fly   480 DKGLPGEGYNIVGPPGSQGPPGFRGLPGDDGYNGLRGLPGEKGLRGDDCPVCNAGPRGPRGQEGD 544
            :||  |:     ||||.|||.|:.|..|..|.:|:|||.|.||.:|:|         |..|.:||
Human   773 EKG--GQ-----GPPGPQGPIGYPGPRGVKGADGIRGLKGTKGEKGED---------GFPGFKGD 821

  Fly   545 TGYPGSHGNRGAIGLTGPRGVQGLQGNPGRAGHKGLPGPAGIPGEPGKVGAAGPDGKAIEVGSLR 609
            .|.   .|:||.||..||||..|.:|..||.|..|.|||.|.|||.||:|..|..|..   |  |
Human   822 MGI---KGDRGEIGPPGPRGEDGPEGPKGRGGPNGDPGPLGPPGEKGKLGVPGLPGYP---G--R 878

  Fly   610 KGEIGDTGDSGHRGDTGDDGEKGRDGSDGSKGERGETGQRGDYGDAGYQGRDGEPGRDGRDGAPG 674
            :|..|..|..|..|..|:.|.:|..|..|.:|:||.||.||:.|..|..|:.|..|..|.||.  
Human   879 QGPKGSIGFPGFPGANGEKGGRGTPGKPGPRGQRGPTGPRGERGPRGITGKPGPKGNSGGDGP-- 941

  Fly   675 RNATTPKVYLIGEPGYDGIKGERGDDGDTGFKGVKGEPNPGQIYDNTGEPGED---GYTGPKGVK 736
                      .|.||.   :|..|..|.|||.|.||.|.|         ||:|   |:.|.:|..
Human   942 ----------AGPPGE---RGPNGPQGPTGFPGPKGPPGP---------PGKDGLPGHPGQRGET 984

  Fly   737 GAKGEQGAIGLRGEIGDRGPAGEVIPGPVGAKGYPGPTGDYGQQGAPGLPGRDGEPGLDGGIGYK 801
            |.:|:.|..|..|.:|.:||.||.  ||:|.:|:|||.|..|:||.|||.|::|..      |..
Human   985 GFQGKTGPPGPPGVVGPQGPTGET--GPMGERGHPGPPGPPGEQGLPGLAGKEGTK------GDP 1041

  Fly   802 GQRGVPGQEVIQGEIGPPGRSGIKGFPGDVGAPGQYGLAGRPGPKGVKGEQGPDGAVGQTGLPGN 866
            |..|:||::      |||   |::|||||.|.||..|..      |:||.:||.           
Human  1042 GPAGLPGKD------GPP---GLRGFPGDRGLPGPVGAL------GLKGNEGPP----------- 1080

  Fly   867 KGQRGDFLVGPPGPKGQPGRNGRQAPHGAKGQKGEVGSLGQNGQNGAKGSIGFSGRRGLLGNAGL 931
                     |||||.|.||..|   |.||                                 ||.
Human  1081 ---------GPPGPAGSPGERG---PAGA---------------------------------AGP 1100

  Fly   932 QGLPGSPGIPGLPGMIGEIGERGEIGYNGRQGDIGPRGPNGEFGPKGLSGDDGPDGYPGANGLPG 996
            .|:||.||..|.||..||.|..||      :|..||.|.:|..||.||.|..||.|.||.:   |
Human  1101 IGIPGRPGPQGPPGPAGEKGAPGE------KGPQGPAGRDGLQGPVGLPGPAGPVGPPGED---G 1156

  Fly   997 RKGETGNPGFPGRPGAKGVAAYSGIKGDDGESGLTGPIGYPGAPGAKGQRGPVGDSQPALDGVAG 1061
            .|||.|.||            ..|.|||.||.         |.||..|.:||:|  ||      |
Human  1157 DKGEIGEPG------------QKGSKGDKGEQ---------GPPGPTGPQGPIG--QP------G 1192

  Fly  1062 RKGEVGSPGPNGLPGRHGLKGQRGDRGLPGQQGRPGEPGAKGLGGYPGRNGINGLKGATGFPGPQ 1126
            ..|..|.|||.   |:.||.||:||.   |.:|.||.||..||.|.||..|..|..|..|..||.
Human  1193 PSGADGEPGPR---GQQGLFGQKGDE---GPRGFPGPPGPVGLQGLPGPPGEKGETGDVGQMGPP 1251

  Fly  1127 GPKGPQGESGVVGLDGRNGQIGDQGPRGLIGEQGEQGEQGDEGEVGIPGRLENLRDRSFYRGFTG 1191
            ||.||:|.||..|.|      |.|||.|.||..|..||:|:.||.|.||          ..|..|
Human  1252 GPPGPRGPSGAPGAD------GPQGPPGGIGNPGAVGEKGEPGEAGEPG----------LPGEGG 1300

  Fly  1192 DQGLQGERGEQGDMGPIGFIGPPGAKGERGDIGYAGQLGFDGAEGLKGFQGDQGPRGPPGITLPA 1256
            ..|.:|||||:|:.||.|..||||.||..||.|.      .|:.|..||.||.||.|.||   ||
Human  1301 PPGPKGERGEKGESGPSGAAGPPGPKGPPGDDGP------KGSPGPVGFPGDPGPPGEPG---PA 1356

  Fly  1257 EKGDEGVAGLDGRAGRPGHFGQKGAPGPPGENGPNGAIGHRGPQIQGPPGPQGDVGFPGAPGHNG 1321
              |.:|..|..|..|.|   ||.|:|||.||.||:|..|.|||  .||.||:             
Human  1357 --GQDGPPGDKGDDGEP---GQTGSPGPTGEPGPSGPPGKRGP--PGPAGPE------------- 1401

  Fly  1322 RHGLIGPKGELGDMGRQGERGESGYA-IVGRQGDIGDIGFQGEPGWDGAKGEQGYPGLPGKNGRV 1385
                          |||||:|..|.| :.|..|..|.||.||.||..|..|.:|.||..|:.|..
Human  1402 --------------GRQGEKGAKGEAGLEGPPGKTGPIGPQGAPGKPGPDGLRGIPGPVGEQGLP 1452

  Fly  1386 GAPGPRGPTGDAGWGGIDGMDGLVGPKGQPGVTYSYSMARPGDRGEPGLDGFQGEEGDGGAPGLI 1450
            |:|||.||.|..|..|:.|:.|..||||:.|        .||..|..|..|.|||:||.|.||..
Human  1453 GSPGPDGPPGPMGPPGLPGLKGDSGPKGEKG--------HPGLIGLIGPPGEQGEKGDRGLPGPQ 1509

  Fly  1451 GFQGQRGAVGYRGDQGEV---GYTGADGPQGQRGDKGYMGLTGAPGLRGLPGPQGEPAPAPPAPK 1512
            |..|.:|..|..|..|.:   |..|..||.|.:|.||..|.||..|..|.|||.|.|.|.....:
Human  1510 GSSGPKGEQGITGPSGPIGPPGPPGLPGPPGPKGAKGSSGPTGPKGEAGHPGPPGPPGPPGEVIQ 1574

  Fly  1513 SRGFIFARHSQSVHVPQCPANTNLLWEGYSLSGNVAASRAVGQD--LGQSGSCMMRFTTM----- 1570
            ......:|..:::...|      ||.:|   :|......|.|.:  .|...|..:....|     
Human  1575 PLPIQASRTRRNIDASQ------LLDDG---NGENYVDYADGMEEIFGSLNSLKLEIEQMKRPLG 1630

  Fly  1571 ----PYMLCDITNVCHFAQNNDDSLWLSTAEPMPMTMTPIQG--RDLMKYI-------SRCVVCE 1622
                |...|....:|| ....|...|:.          |.||  ||..|..       |.||..:
Human  1631 TQQNPARTCKDLQLCH-PDFPDGEYWVD----------PNQGCSRDSFKVYCNFTAGGSTCVFPD 1684

  Fly  1623 TTT---RIIALHSQSMSIPDCPGGWEEMWTGYSYF-----MSTLDNVG---GVGQ 1666
            ..:   ||.:...::      ||.|      :|.|     :|.:|..|   ||.|
Human  1685 KKSEGARITSWPKEN------PGSW------FSEFKRGKLLSYVDAEGNPVGVVQ 1727

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
vkgNP_001260071.1 Collagen 60..119 CDD:189968 19/64 (30%)
Collagen 102..152 CDD:189968 25/54 (46%)
Collagen 277..334 CDD:189968 16/56 (29%)
Collagen 534..593 CDD:189968 28/58 (48%)
Collagen 814..871 CDD:189968 18/56 (32%)
Collagen 957..1014 CDD:189968 22/56 (39%)
Collagen 990..1049 CDD:189968 19/58 (33%)
Collagen 1070..1128 CDD:189968 26/57 (46%)
C4 1515..1624 CDD:128421 26/128 (20%)
C4 1625..1737 CDD:128421 13/53 (25%)
COL5A1NP_000084.3 TSPN 39..230 CDD:214560
Nonhelical region 231..443 8/19 (42%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 242..269
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 281..457 15/33 (45%)
Interrupted collagenous region 444..558 47/161 (29%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 470..520 26/72 (36%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 526..545 6/18 (33%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 559..1574 521/1309 (40%)
Triple-helical region 559..1570 521/1305 (40%)
Collagen 726..>767 CDD:189968 21/42 (50%)
Collagen 871..930 CDD:189968 24/63 (38%)
Collagen 1474..1519 CDD:189968 22/52 (42%)
Nonhelical region 1571..1605 6/42 (14%)
COLFI 1610..1836 CDD:279718 31/141 (22%)
Blue background indicates that the domain is not in the aligned region.


Information from Original Tools:


Tool Simple Score Weighted Score Original Tool Information
BLAST Result Score Score Type Cluster ID
Compara 00.000 Not matched by this tool.
Domainoid 1 1.000 286 1.000 Domainoid score I1603
eggNOG 1 0.900 - - E33208_3BAFZ
Hieranoid 00.000 Not matched by this tool.
Homologene 00.000 Not matched by this tool.
Inparanoid 00.000 Not matched by this tool.
Isobase 00.000 Not matched by this tool.
OMA 00.000 Not matched by this tool.
OrthoDB 00.000 Not matched by this tool.
OrthoFinder 00.000 Not matched by this tool.
OrthoInspector 00.000 Not matched by this tool.
orthoMCL 00.000 Not matched by this tool.
Panther 00.000 Not matched by this tool.
Phylome 1 0.910 - -
RoundUp 00.000 Not matched by this tool.
SonicParanoid 00.000 Not matched by this tool.
SwiftOrtho 00.000 Not matched by this tool.
TreeFam 00.000 Not matched by this tool.
User_Submission 00.000 Not matched by this tool.
32.810

Return to query results.
Submit another query.