DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment Col4a1 and COL5A2

DIOPT Version :9

Sequence 1:NP_723044.1 Gene:Col4a1 / 33727 FlyBaseID:FBgn0000299 Length:1779 Species:Drosophila melanogaster
Sequence 2:NP_000384.2 Gene:COL5A2 / 1290 HGNCID:2210 Length:1499 Species:Homo sapiens


Alignment Length:1791 Identity:621/1791 - (34%)
Similarity:751/1791 - (41%) Gaps:533/1791 - (29%)


- Green bases have known domain annotations that are detailed below.


  Fly     1 MLPFW---KRLLYAAVIAGALVGADAQ-------------------------FWKTA-------G 30
            |:..|   :.||...|:.|..|...||                         .||.|       .
Human     1 MMANWAEARPLLILIVLLGQFVSIKAQEEDEDEGYGEEIACTQNGQMYLNRDIWKPAPCQICVCD 65

  Fly    31 TAGSIQDSVKHYNRNEPKFPIDDSYDIVDSAGVARGDLPPKNC---------------------T 74
            ....:.|.:             :..|::|.|...   .||..|                     .
Human    66 NGAILCDKI-------------ECQDVLDCADPV---TPPGECCPVCSQTPGGGNTNFGRGRKGQ 114

  Fly    75 AGYAGCVPKCIAEKGNRGLPGPLGPTGLKGEMGFPGMEGPSGDKGQKGDPGPYGQRGDKGERGSP 139
            .|..|.||   ...|.||.|||.||         ||.:||.|::|.||.|||   ||.:|..|.|
Human   115 KGEPGLVP---VVTGIRGRPGPAGP---------PGSQGPRGERGPKGRPGP---RGPQGIDGEP 164

  Fly   140 GLHGQAGVPGVQGPAGNPGAPGINGKD--------GCDGQDGIPGLEGLSGMPGPRGYAGQLGSK 196
            |:.||   ||..||.|:|..||.:|..        |.|.:.|:....||  |||..|        
Human   165 GVPGQ---PGAPGPPGHPSHPGPDGLSRPFSAQMAGLDEKSGLGSQVGL--MPGSVG-------- 216

  Fly   197 GEKGEPAKENGDYAKGEKGEPGWRGTAGLAGPQGFPGEKGERGDSGPYGAKGPRGEHGLKGEKGA 261
                 |.        |.:|..|.:|..|.|||.|.|||.|:.|..||.|::||.|          
Human   217 -----PV--------GPRGPQGLQGQQGGAGPTGPPGEPGDPGPMGPIGSRGPEG---------- 258

  Fly   262 SCYGPMKPGAPGIKGEKGEPASSFPVKPTHTVMGPRGDMGQKGEPGLVGRKGEPGPEGDTGLDGQ 326
                  .||.|   ||.|||                   |:.|.||.||..|.|           
Human   259 ------PPGKP---GEDGEP-------------------GRNGNPGEVGFAGSP----------- 284

  Fly   327 KGEKGLPGGPGDRGRQGNFGPPGSTGQKGDRGEPGLNGLPGNPGQKGEPGRAGATGKPGLLGPPG 391
             |.:|.||.            ||..|.||.||..||.|..|..|..|..|.||.||..|.:||.|
Human   285 -GARGFPGA------------PGLPGLKGHRGHKGLEGPKGEVGAPGSKGEAGPTGPMGAMGPLG 336

  Fly   392 PPGGGRGTPGPPGPKGPRGYVGAPGPQGLNGVDGLPGPQGYNGQKGGAGLPGRPGNEGPPGKKGE 456
            |    ||.||..|..||:   ||||.:|.:|:.|.|||.|..|..|.:|.||.|           
Human   337 P----RGMPGERGRLGPQ---GAPGQRGAHGMPGKPGPMGPLGIPGSSGFPGNP----------- 383

  Fly   457 KGTAGLNGPKGSIGPIGHPGPPGPEGQKGDAGLPGYGIQGSKGDAGIPGYPGLKGSKGERGFKGN 521
                   |.||..||.|..||.||:||:|:.|.|        |..|.||.||..|:.|..|.||.
Human   384 -------GMKGEAGPTGARGPEGPQGQRGETGPP--------GPVGSPGLPGAIGTDGTPGAKGP 433

  Fly   522 AGAPGDSKLGRPGTPGAAGAPGQKGDAGRPGTPGQKGDMGIKGDVGGKCSSCRAGPKGDKGTSGL 586
            .|:||.|     |.||:||.         ||:||.:|.               .||:|.:|..|.
Human   434 TGSPGTS-----GPPGSAGP---------PGSPGPQGS---------------TGPQGIRGQPGD 469

  Fly   587 PGIPGKDGARGPPGERGYPGERGHDGINGQTGPPGEKGEDGRTGLPGATGEPGKPALCDLSLIEP 651
            ||:||..|..||.||   ||..   ||.|..|||||:|:.|..|.||..|.||           |
Human   470 PGVPGFKGEAGPKGE---PGPH---GIQGPIGPPGEEGKRGPRGDPGTVGPPG-----------P 517

  Fly   652 LKGDKGYPGAPGAKGVQGFKGAEGLPGIPGPKGEFGFKGEKGLSGAPGNDGTPGRAGRDGYPGIP 716
            : |::|.||.      :||.|::||   |||||..|.:|..|.||..|:.|.|||.|..|.||..
Human   518 V-GERGAPGN------RGFPGSDGL---PGPKGAQGERGPVGSSGPKGSQGDPGRPGEPGLPGAR 572

  Fly   717 GQSIKGEPGFHGRDGAKGDKGSFGRSGEKGEPGSCALDEIKMPAKGNKGEPGQTGMPGPPGEDGS 781
            |  :.|.||..|.:|..|..|:.|..|..|.|||.          |.:|:||..|:|||.|..|.
Human   573 G--LTGNPGVQGPEGKLGPLGAPGEDGRPGPPGSI----------GIRGQPGSMGLPGPKGSSGD 625

  Fly   782 PGERGYTGLKGNTGPQGPPGVEGPRGLNGPRGEKGNQGAVGVPGNPGKDGLRGIPGRNGQPGPRG 846
            ||:                           .||.||   .||||.      ||.||::|:.||  
Human   626 PGK---------------------------PGEAGN---AGVPGQ------RGAPGKDGEVGP-- 652

  Fly   847 EPGISRPGPMGPPGLNGLQGEKGDRGPTGPIGFPGADGSVGYPGDRGDAGLPGVSGRPGIVGEKG 911
                  .||:|||   ||.||:|::||.||.||.|..|..|.||:.|..|..||.|.||.     
Human   653 ------SGPVGPP---GLAGERGEQGPPGPTGFQGLPGPPGPPGEGGKPGDQGVPGDPGA----- 703

  Fly   912 DVGPIGPAGVAGPPGVPGIDGVRGRDGAKGEPGSPGLVGMPGNKGDRGAPGNDGPKGFAGVTGAP 976
             |||:||.|..|.|               ||.|.||:.|:||.||..|..|.|||||..|.:|.|
Human   704 -VGPLGPRGERGNP---------------GERGEPGITGLPGEKGMAGGHGPDGPKGSPGPSGTP 752

  Fly   977 GKRGPAGIPGVSGAKGDKGATGLTGNDGPVGGRGPPGAPGLMGIKGDQGLAGAPGQQGLDGMPGE 1041
            |..||                                 |||.|:.|::|:||.||.:|..|..||
Human   753 GDTGP---------------------------------PGLQGMPGERGIAGTPGPKGDRGGIGE 784

  Fly  1042 K------GNQGFPGLDGPPGLPGDASEKGQKGEPGPSGLRGDTGPAGTPGWPGEKGLPGLAVHGR 1100
            |      ||.|..||.||.|.||.|...|:||||||.||.|..|..|.||..||.          
Human   785 KGAEGTAGNDGARGLPGPLGPPGPAGPTGEKGEPGPRGLVGPPGSRGNPGSRGEN---------- 839

  Fly  1101 AGPPGEKGDQGRSGIDGRDGINGEKGEQGLQGVWGQPGEKGSVGAPGIPGAPGMDGLPGAAGAPG 1165
             ||.|..|..|..|.||:.|:.||.||         ||:||..|:   ||..|:.|.||..|..|
Human   840 -GPTGAVGFAGPQGPDGQPGVKGEPGE---------PGQKGDAGS---PGPQGLAGSPGPHGPNG 891

  Fly  1166 AVGYPGDRGDKGEPGLSGLPGLKGETGPVGLQGFTGAPGPKGERGIRGQPGLPATVPDIRGDKGS 1230
            ..|..|.||.:|.||.:|.||..|..||.|..|..|..||.||.|..|.|||       |||.||
Human   892 VPGLKGGRGTQGPPGATGFPGSAGRVGPPGPAGAPGPAGPLGEPGKEGPPGL-------RGDPGS 949

  Fly  1231 QGERGYTGEKGEQGERGLTGPAGVAGAKGDRGLQGPPGASGLNGIPGAKGDIGPRGEIGYPGVTI 1295
            .|.                        .||||..||||.      ||.|||   .||.|.|    
Human   950 HGR------------------------VGDRGPAGPPGG------PGDKGD---PGEDGQP---- 977

  Fly  1296 KGEKGLPGRPGRNGRQGLIGAPGLIGERGLPGLAGEPGLVGLPGPIGPAGSKGERGLAGSPGQPG 1360
             |..|.||..|..|::|::|.||..||||:|||   ||..|.||.:||.|:.|::      |.||
Human   978 -GPDGPPGPAGTTGQRGIVGMPGQRGERGMPGL---PGPAGTPGKVGPTGATGDK------GPPG 1032

  Fly  1361 QDGFPGAPGLKGDTGPQGFKGERGLNGFEGQKGDKGDRGLQGPSGLPGLVGQKGDTGYPGLNGND 1425
            ..|.||:.|..|:.||:|..|..|..|.:|..|::||||..||:||||..|..|..|..|..|:.
Human  1033 PVGPPGSNGPVGEPGPEGPAGNDGTPGRDGAVGERGDRGDPGPAGLPGSQGAPGTPGPVGAPGDA 1097

  Fly  1426 GPVGAPGERGFTGPKGRDGRDGTPGLPGQKGEPGMLPPPGPKGEPGQPGRNGPKGEPGRP---GE 1487
            |..|.||.||..||.||.|:.|.||..|.:|:.|.....|.:|:.|..|..|.:|.||.|   ||
Human  1098 GQRGDPGSRGPIGPPGRAGKRGLPGPQGPRGDKGDHGDRGDRGQKGHRGFTGLQGLPGPPGPNGE 1162

  Fly  1488 RGLIGIQGERGEKGERGLIGETGNVGRPGPKGDRGEPGERGYEGAIGLIGQKGEPGAPAP----- 1547
            :|..||.|..|.:|..|.:|.:|..|.|||.|..|.||.||..|..|..|..||||.|.|     
Human  1163 QGSAGIPGPFGPRGPPGPVGPSGKEGNPGPLGPIGPPGVRGSVGEAGPEGPPGEPGPPGPPGPPG 1227

  Fly  1548 ---AALDYLTGILITRHSQSETVPACSAGHTELWTGYSLLYVDGNDYAHNQDLG--SPG------ 1601
               |||..:.|      ...|::|......||             |.|...|..  .||      
Human  1228 HLTAALGDIMG------HYDESMPDPLPEFTE-------------DQAAPDDKNKTDPGVHATLK 1273

  Fly  1602 SCVPRFSTL---------PVLSCGQNNVCNYASRNDKTFWLTTNAAIPMMPVENIEIRQYIS--- 1654
            |...:..|:         |..:|....:|:.|.::.: :|:..|..    .||: .|:.|.:   
Human  1274 SLSSQIETMRSPDGSKKHPARTCDDLKLCHSAKQSGE-YWIDPNQG----SVED-AIKVYCNMET 1332

  Fly  1655 --RCVVCEAPANVIAVHSQT---IEVPDCPNGWEGL 1685
              .|:    .||..:|..:|   .:.||....|.||
Human  1333 GETCI----SANPSSVPRKTWWASKSPDNKPVWYGL 1364

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
Col4a1NP_723044.1 Collagen 104..161 CDD:189968 25/56 (45%)
Collagen 322..380 CDD:189968 19/57 (33%)
Collagen 413..465 CDD:189968 17/51 (33%)
Collagen 499..561 CDD:189968 25/61 (41%)
Collagen 574..632 CDD:189968 27/57 (47%)
Collagen 657..714 CDD:189968 25/56 (45%)
Collagen 765..824 CDD:189968 16/58 (28%)
Collagen 854..911 CDD:189968 28/56 (50%)
Collagen 884..943 CDD:189968 20/58 (34%)
Collagen 923..982 CDD:189968 24/58 (41%)
Collagen 1028..1085 CDD:189968 31/62 (50%)
Collagen 1229..1287 CDD:189968 16/57 (28%)
Collagen 1318..1376 CDD:189968 25/57 (44%)
Collagen 1399..1458 CDD:189968 28/58 (48%)
Collagen 1477..1534 CDD:189968 27/59 (46%)
C4 1555..1662 CDD:128421 23/128 (18%)
C4 1663..1777 CDD:128421 9/26 (35%)
COL5A2NP_000384.2 VWC 41..96 CDD:278520 10/70 (14%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 104..1268 579/1564 (37%)
Collagen 270..328 CDD:189968 29/81 (36%)
Collagen 312..370 CDD:189968 30/64 (47%)
PRK07764 <346..553 CDD:236090 110/291 (38%)
Collagen 495..553 CDD:189968 32/78 (41%)
Cell attachment site. /evidence=ECO:0000255 506..508 0/1 (0%)
Collagen 546..603 CDD:189968 24/58 (41%)
Collagen 591..649 CDD:189968 31/103 (30%)
Collagen 702..759 CDD:189968 32/110 (29%)
Collagen 744..796 CDD:189968 24/84 (29%)
Collagen 819..877 CDD:189968 30/80 (38%)
Collagen 849..907 CDD:189968 28/69 (41%)
Cell attachment site. /evidence=ECO:0000255 944..946 1/1 (100%)
Collagen 1044..1100 CDD:189968 25/55 (45%)
Cell attachment site. /evidence=ECO:0000255 1067..1069 0/1 (0%)
Cell attachment site. /evidence=ECO:0000255 1070..1072 1/1 (100%)
Collagen 1074..1131 CDD:189968 27/56 (48%)
Cell attachment site. /evidence=ECO:0000255 1100..1102 0/1 (0%)
Cell attachment site. /evidence=ECO:0000255 1127..1129 0/1 (0%)
Cell attachment site. /evidence=ECO:0000255 1136..1138 0/1 (0%)
COLFI 1265..1498 CDD:366624 24/110 (22%)


Information from Original Tools:


Tool Simple Score Weighted Score Original Tool Information
BLAST Result Score Score Type Cluster ID
Compara 00.000 Not matched by this tool.
Domainoid 00.000 Not matched by this tool.
eggNOG 1 0.900 - - E33208_3BAFZ
Hieranoid 00.000 Not matched by this tool.
Homologene 00.000 Not matched by this tool.
Inparanoid 00.000 Not matched by this tool.
Isobase 00.000 Not matched by this tool.
OMA 00.000 Not matched by this tool.
OrthoDB 00.000 Not matched by this tool.
OrthoFinder 00.000 Not matched by this tool.
OrthoInspector 00.000 Not matched by this tool.
orthoMCL 00.000 Not matched by this tool.
Panther 00.000 Not matched by this tool.
Phylome 1 0.910 - -
RoundUp 00.000 Not matched by this tool.
SonicParanoid 00.000 Not matched by this tool.
SwiftOrtho 00.000 Not matched by this tool.
TreeFam 00.000 Not matched by this tool.
User_Submission 00.000 Not matched by this tool.
21.810

Return to query results.
Submit another query.