DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment: Col4a1 and Col1a1

Sequence 1:NP_723044.1 Gene:Col4a1 FlyBaseID:FBgn0000299 Length:1779 Species:Drosophila melanogaster
Sequence 2:NP_445756.1 Gene:Col1a1 RGDID:61817 Length:1453 Species:Rattus norvegicus

Alignment Length:1404 Identity:561/1405 (40%)
Similarity:645/1405 (46%) Gaps:292/1405 (21%)


  Fly    63 VARGDLPPKNCTAGYAGCVPKCIAE--------------KGNRGLPGPLGPTGLKGEMGFPGMEG 113
            :.:.||...|.......|.|.|..|              ||:.|..||.||.|..|:.|.||..|
  Rat    65 LCKEDLDCPNPQKREGECCPFCPEEYVSPDAEVIGVEGPKGDPGPQGPRGPVGPPGQDGIPGQPG 129

  Fly   114 PSGDKGQKGDPGPYGQRGDKGERGSPGLHGQA---GVPGVQGPAGN---PGAPGINGKDGCDGQD 172
            ..|..|..|.|||.|..|:...:.|.|...::   .|||..||:|.   ||.||..|..|..|..
  Rat   130 LPGPPGPPGPPGPPGLGGNFASQMSYGYDEKSAGVSVPGPMGPSGPRGLPGPPGAPGPQGFQGPP 194

  Fly   173 GIPGLEGLSGMPGPRGYAGQLGSKGEKGEPAKENGDYAKGEKGEPGWRGTAGLAGPQGFPGEKGE 237
            |.||..|.||..||||..|..|..|:.||..|..   ..||:|.||.:|..||.|..|.||.||.
  Rat   195 GEPGEPGASGPMGPRGPPGPPGKNGDDGEAGKPG---RPGERGPPGPQGARGLPGTAGLPGMKGH 256

  Fly   238 RGDSGPYGAK------GPRGEHGLKGEKGASCYGPMKPGAPGIKGEKGEPASSFPVKPTHTVMGP 296
            ||.||..|||      ||:||.|..||.||    |.:.|..|:.||:|.|             ||
  Rat   257 RGFSGLDGAKGDTGPAGPKGEPGSPGENGA----PGQMGPRGLPGERGRP-------------GP 304

  Fly   297 RGDMGQKGEPGLVGRKGEPGPEGDTGLDGQKGEKGLPGGPGDRGRQGNFGPPGSTGQKGDRGEPG 361
            .|..|.:|..|.||..|.|||.|.||..|..|..|..|..|.:|.:|:.||.|..|:.|..|..|
  Rat   305 PGSAGARGNDGAVGAAGPPGPTGPTGPPGFPGAAGAKGEAGPQGARGSEGPQGVRGEPGPPGPAG 369

  Fly   362 LNGLPGNPGQKGEPGRAGATGKPGLLGPPGPPGGGRGTPGPPGPKGPRGYVGAPGPQGLNGVDGL 426
            ..|..||||..|:||..||.|.||:.|.|       |.||..||.||:|..|||||         
  Rat   370 AAGPAGNPGADGQPGAKGANGAPGIAGAP-------GFPGARGPSGPQGPSGAPGP--------- 418

  Fly   427 PGPQGYNGQKGGAGLPGRPGNEGPPGKKGEKGTAGLNGPKGSIGPIGHPGPPGPEGQKGDAGLPG 491
                     ||.:|.||.|||:|..|.|||.|.||:.||         |||.|.||::|..|.| 
  Rat   419 ---------KGNSGEPGAPGNKGDTGAKGEPGPAGVQGP---------PGPAGEEGKRGARGEP- 464

  Fly   492 YGIQGSKGDAGIPGYPGLKGSKGERGFKGNAGAPG-DSKLGRPGTPGAAGAPGQKGDAGRPGTPG 555
                   |.:|:||.||.:|..|.|||.|..|..| ....|..|:||.||..|..|:|||||   
  Rat   465 -------GPSGLPGPPGERGGPGSRGFPGADGVAGPKGPAGERGSPGPAGPKGSPGEAGRPG--- 519

  Fly   556 QKGDMGIKGDVGGKCSSCRAGPKGDKGTSGLPGIPGKDGARGPPGERGYPGERGHDGINGQTGPP 620
                              .||..|.||.:|.||.||.||..||      ||..|.||..|..|||
  Rat   520 ------------------EAGLPGAKGLTGSPGSPGPDGKTGP------PGPAGQDGRPGPAGPP 560

  Fly   621 GEKGEDGRTGLP---GATGEPGKPALCDLSLIEPLKGDKGYPGAPGAKGVQGFKGAEGLPGIPGP 682
            |.:|:.|..|.|   |..|||||            .|::|.||.|||.|..|..|..|..|.|||
  Rat   561 GARGQAGVMGFPGPKGTAGEPGK------------AGERGVPGPPGAVGPAGKDGEAGAQGAPGP 613

  Fly   683 KGEFGFKGEKGLSGAPGNDGTPGRAGRDGYPGIPG-QSIKGEPGFHGRDGAKGDKGSFGRSGEKG 746
            .|..|.:||:|.:|:||..|.||.||..|..|.|| |.:.|:.|..|..||:|::|..|..|.:|
  Rat   614 AGPAGERGEQGPAGSPGFQGLPGPAGPPGEAGKPGEQGVPGDLGAPGPSGARGERGFPGERGVQG 678

  Fly   747 EPGSCALDEIKMPA--KGNKGEPGQTGMPGPPGEDGSPGERGYTGLKGNTGPQGPPGVEGPRGLN 809
            .||         ||  :||.|.||..|..|..|..|:||.:|..||      ||.||..|..||.
  Rat   679 PPG---------PAGPRGNNGAPGNDGAKGDTGAPGAPGSQGAPGL------QGMPGERGAAGLP 728

  Fly   810 GPRGEKGNQGAVGVPGNPGKDGLRGIPGRNGQPGPRGEPG----ISRPGPMGPPGLNGLQGEKGD 870
            ||:|::|:.|..|..|:|||||:||:.|..|.|||.|.||    ....||.||.|..|..|::|:
  Rat   729 GPKGDRGDAGPKGADGSPGKDGVRGLTGPIGPPGPAGAPGDKGETGPSGPAGPTGARGAPGDRGE 793

  Fly   871 RGPTGPIGF---PGADGSVGYPGDRGDAGLPGVSGRPGIVGEKGDVGPIGPAGVAGPPGVPGIDG 932
            .||.||.||   |||||.            ||..|.||..|.|||.||.||||.|||||      
  Rat   794 PGPPGPAGFAGPPGADGQ------------PGAKGEPGDTGVKGDAGPPGPAGPAGPPG------ 840

  Fly   933 VRGRDGAKGEPGSP-GLVGMPGNKGDRGAPGNDGPKGFAGVTGAPGKRGPAGIPGVSGAKGDKGA 996
                         | |.||.||.||.|||.|..|..||.|..|..|..||               
  Rat   841 -------------PIGNVGAPGPKGSRGAAGPPGATGFPGAAGRVGPPGP--------------- 877

  Fly   997 TGLTGNDGPVGGRGPPGAPGLMGIKGDQGLAGAPGQQGLDGMPGEKGNQGFPGLDGPPGLPGDAS 1061
               :||.||.|..||.|..|..|.:|:.|.||.||:.|..|.||..|.:|.||.|||.|.|    
  Rat   878 ---SGNAGPPGPPGPVGKEGGKGPRGETGPAGRPGEVGPPGPPGPAGEKGSPGADGPAGSP---- 935

  Fly  1062 EKGQKGEPGPSGLRGDTGPAGTPGWPGEKGLPGLAVHGRAGPPGEKGDQGRSGIDGRDGINGEKG 1126
                 |.|||.|:.|..|..|.||..||:|.|||     .||.||.|.||.||..|..|..|..|
  Rat   936 -----GTPGPQGIAGQRGVVGLPGQRGERGFPGL-----PGPSGEPGKQGPSGASGERGPPGPMG 990

  Fly  1127 EQGLQGVWGQPGEKGSVGAPGIPGAPGMDGLPGAAGAPGAVGYPGDRGDKGEPGLSGLPGLKGET 1191
            ..||.   |.|||.|..|:||..|:||.|      |||||.|..|:.|..|.||..|.||..|..
  Rat   991 PPGLA---GPPGESGREGSPGAEGSPGRD------GAPGAKGDRGETGPAGPPGAPGAPGAPGPV 1046

  Fly  1192 GPVGLQGFTGAPGPKGERGIRGQPGLPATVPDIRGDKGSQGERGYTGEKGEQGERGLTGPAGVAG 1256
            ||.|..|..|..||.|..|..|..|       .||..|.||.||..||.||||:|      |:.|
  Rat  1047 GPAGKNGDRGETGPAGPAGPIGPAG-------ARGPAGPQGPRGDKGETGEQGDR------GIKG 1098

  Fly  1257 AKGDRGLQGPPGASGLNGIPGAKGDIGPRGEIGYPGVTIKGEKGLPGRPGRNGRQGLIGAPGLIG 1321
            .:|..|||||||:                                   ||..|.||..||.|..|
  Rat  1099 HRGFSGLQGPPGS-----------------------------------PGSPGEQGPSGASGPAG 1128

  Fly  1322 ERGLPGLAGEP---GLVGLPGPIGPAGSKGERGLAGSPGQPGQDGFPGAPGLKGDTGPQGFK--- 1380
            .||.||.||.|   ||.||||||||.|.:|..|.:|..|.||..|.||.||  ..:|...|.   
  Rat  1129 PRGPPGSAGSPGKDGLNGLPGPIGPPGPRGRTGDSGPAGPPGPPGPPGPPG--PPSGGYDFSFLP 1191

  Fly  1381 ---GERGLNGFEGQKGDKG----DRGLQGPSGLPGLVGQ 1412
               .|:..:|....:.|..    ||.|:..:.|..|..|
  Rat  1192 QPPQEKSQDGGRYYRADDANVVRDRDLEVDTTLKSLSQQ 1230

Known Domains:


GeneSequenceDomainRegion External IDIdentity
Col4a1NP_723044.1 Collagen 104..161 CDD:189968 24/63 (38%)
Collagen 322..380 CDD:189968 24/58 (41%)
Collagen 413..465 CDD:189968 22/52 (42%)
Collagen 499..561 CDD:189968 27/63 (43%)
Collagen 574..632 CDD:189968 27/58 (47%)
Collagen 657..714 CDD:189968 28/57 (49%)
Collagen 765..824 CDD:189968 25/59 (42%)
Collagen 854..911 CDD:189968 26/60 (43%)
Collagen 884..943 CDD:189968 21/59 (36%)
Collagen 923..982 CDD:189968 24/60 (40%)
Collagen 1028..1085 CDD:189968 25/57 (44%)
Collagen 1229..1287 CDD:189968 22/58 (38%)
Collagen 1318..1376 CDD:189968 32/61 (52%)
Collagen 1399..1458 CDD:189968 4/15 (27%)
Collagen 1477..1534 CDD:189968
C4 1555..1662 CDD:128421
C4 1663..1777 CDD:128421
Col1a1NP_445756.1 VWC 31..86 CDD:214564 5/21 (24%)
Nonhelical region (N-terminal) 152..167 3/15 (20%)
Triple-helical region 168..1181 518/1251 (41%)
Collagen 225..284 CDD:189968 29/62 (47%)
Collagen 264..316 CDD:189968 25/69 (36%)
Collagen 390..449 CDD:189968 36/93 (39%)
Collagen 486..545 CDD:189968 30/86 (35%)
Collagen 525..584 CDD:189968 32/77 (42%)
Collagen 657..714 CDD:189968 25/66 (38%)
Collagen 696..748 CDD:189968 25/58 (43%)
Cell attachment site. {ECO:0000255} 734..736 1/2 (50%)
Collagen 1068..1123 CDD:189968 32/103 (31%)
Cell attachment site. {ECO:0000255} 1082..1084 2/2 (100%)
Major antigenic determinant (of neutral salt-extracted rat skin collagen) 1176..1186 4/12 (33%)
Nonhelical region (C-terminal) 1182..1207 4/25 (16%)
COLFI 1219..1452 CDD:279718 3/13 (23%)
Blue background indicates that the domain is not in the aligned region.


Information from Original Tools:


Tool Simple Score Weighted Score Original Tool Information
BLAST Result Score Score Type Cluster ID
Compara 00.000 Not matched by this tool.
eggNOG 1 0.900 E33208_3BAFZ
Hieranoid 00.000 Not matched by this tool.
Homologene 00.000 Not matched by this tool.
Inparanoid 00.000 Not matched by this tool.
OMA 00.000 Not matched by this tool.
OrthoDB 00.000 Not matched by this tool.
OrthoFinder 00.000 Not matched by this tool.
OrthoInspector 00.000 Not matched by this tool.
orthoMCL 1 0.900 OOG5_126592
Panther 00.000 Not matched by this tool.
Phylome 00.000 Not matched by this tool.
TreeFam 00.000 Not matched by this tool.
21.800

Return to query results.
Submit another query.