DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment Col4a1 and Mp

DIOPT Version :9

Sequence 1:NP_723044.1 Gene:Col4a1 / 33727 FlyBaseID:FBgn0000299 Length:1779 Species:Drosophila melanogaster
Sequence 2:NP_001246651.1 Gene:Mp / 38769 FlyBaseID:FBgn0260660 Length:1039 Species:Drosophila melanogaster


Alignment Length:823 Identity:230/823 - (27%)
Similarity:282/823 - (34%) Gaps:342/823 - (41%)


- Green bases have known domain annotations that are detailed below.


  Fly   450 PPGK---------KGEKGTAGLNGPKG-SI-GPIGHPGPPGPEGQKGDAGLPGYGIQGSKGDAGI 503
            |||:         :|.||..|..|||| || ||.|.||||||:|:  .|..|.: ::.:...|..
  Fly   292 PPGQTQYTHERPYRGIKGEKGERGPKGDSIRGPPGPPGPPGPKGE--TAPYPPF-VETTSAGAKY 353

  Fly   504 PGYPGLKGSKGERGFKGNAGAPGDSKLGRPGTPGAAGAPGQKGDAGRPGTPGQKGDMGIKGDVGG 568
            .|......|......|.|... .:|..|.||||         |..|:|||||.            
  Fly   354 TGECTCNASDILEAIKDNESL-RESLRGAPGTP---------GKDGKPGTPGH------------ 396

  Fly   569 KCSSCRAGPKGDKGTSGLPGIPGKDGARGPPGERGYPGERGHDGINGQTGPPGEKGEDGRTGLPG 633
                           :|..|:||..||||..|.:|..||.|.||:.|..||            ||
  Fly   397 ---------------TGATGVPGARGARGSEGAQGLKGEPGVDGLPGVMGP------------PG 434

  Fly   634 ATGEPGKPALCDLSLIEPLKGDKGYPGAPGAKGVQGFKGAEGLPGIPGPKGEFGFKGEKGLSGAP 698
            ..|.||.|...|.||:....|.......||||||            ||.||:.|.|||       
  Fly   435 PPGPPGLPENYDESLMVNSMGAFRGTTQPGAKGV------------PGEKGDAGQKGE------- 480

  Fly   699 GNDGTPGRAGRDGYPGIPGQSIKGEPGFHGRDGAKGDKGSFGRSGEKGEPGSCALDEIKMPAKGN 763
                                  :|:|         |.||:.|.||.||||               
  Fly   481 ----------------------RGDP---------GHKGAHGPSGAKGEP--------------- 499

  Fly   764 KGEPGQTGMPGPPGEDGSPGERGYTGLKGNTGPQGPPGVEGPRGLNGPRGEKGNQGAVGVPGNPG 828
             ||||..|:||.||:.|.||                 |::|....||.:||||.:|.        
  Fly   500 -GEPGTPGLPGLPGQVGQPG-----------------GLDGLASANGTKGEKGEKGE-------- 538

  Fly   829 KDGLRGIPGRNGQPGPRGEPGISRPGPMGPPGLNGLQGEKGDRGPTGPIGFPGADGSVGYPGDRG 893
                                             .|::|.:|..|.|||||.||.      ||..|
  Fly   539 ---------------------------------KGMRGRRGGTGATGPIGPPGK------PGPMG 564

  Fly   894 DAGLPGVSGRPGIVGEKGDVGPIGPAGVAGPPGVPGIDGVRGRDGAKGEPGSPGLVGMPGNKGDR 958
            |.   |.|||||:.|.||::||.||.|.:|           ||:|.||:            ||||
  Fly   565 DI---GHSGRPGMTGPKGEMGPKGPKGDSG-----------GREGLKGD------------KGDR 603

  Fly   959 GAPGNDGPKGFAGVTGAPGKRGPAGIPGVSGAKGDKGATGLTGNDGPVGGRGPPGAPGLMGIKGD 1023
            |..|.|         |.|   ||.|:|...|..||.|                            
  Fly   604 GQDGRD---------GLP---GPPGLPSTGGGDGDSG---------------------------- 628

  Fly  1024 QGLAGAPGQQGLDGMPGEKGNQGFPGLDGPPGLPGDASEKGQKGEPG---PSGLRGDTGPAGTPG 1085
             |:...|       |||.      ||..||||||| .|..|.|||||   .|...||....|.||
  Fly   629 -GVQYIP-------MPGP------PGPPGPPGLPG-LSISGPKGEPGVDSRSSFFGDASYYGRPG 678

  Fly  1086 WPGEKGLPGLAV-----------HGRAGPPGEKGDQGRSGIDGRDGINGEKGEQGLQGVWGQ--- 1136
              ....|..|..           .|.|.||.:.|             :..|.|:.|..|.|:   
  Fly   679 --ARSSLDELKALRELQDLRDRPDGTAEPPRQPG-------------HSHKHEETLGLVDGEEPY 728

  Fly  1137 -PGEKGSVGAPGIPGAPGMDGL-----PGAAGAPGAVGYPGD------RGDKGEPGLSGLPGLKG 1189
             .....::....:|||.....:     ..|...||.:.|..:      |.:||...::     .|
  Fly   729 FSASSSNMNMKIVPGAVTFQNIDEMTKKSALNPPGTLAYITEEEALLVRVNKGWQYIA-----LG 788

  Fly  1190 ETGPVGLQGFTGAPGPKGERGIRGQPGLPATV-PDIRGDKGSQ 1231
            ...|:      ..|.|            |.|| |.:|.|..|:
  Fly   789 TLVPI------ATPAP------------PTTVAPSMRFDLQSK 813

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
Col4a1NP_723044.1 Collagen 104..161 CDD:189968
Collagen 322..380 CDD:189968
Collagen 413..465 CDD:189968 7/23 (30%)
Collagen 499..561 CDD:189968 18/61 (30%)
Collagen 574..632 CDD:189968 18/57 (32%)
Collagen 657..714 CDD:189968 14/56 (25%)
Collagen 765..824 CDD:189968 21/58 (36%)
Collagen 854..911 CDD:189968 22/56 (39%)
Collagen 884..943 CDD:189968 24/58 (41%)
Collagen 923..982 CDD:189968 16/58 (28%)
Collagen 1028..1085 CDD:189968 24/59 (41%)
Collagen 1229..1287 CDD:189968 1/3 (33%)
Collagen 1318..1376 CDD:189968
Collagen 1399..1458 CDD:189968
Collagen 1477..1534 CDD:189968
C4 1555..1662 CDD:128421
C4 1663..1777 CDD:128421
MpNP_001246651.1 LamG 58..222 CDD:304605
Collagen 378..432 CDD:189968 28/89 (31%)
Collagen 464..518 CDD:189968 35/119 (29%)
Collagen 494..615 CDD:189968 71/238 (30%)
Endostatin-like 831..999 CDD:238151
Blue background indicates that the domain is not in the aligned region.


Information from Original Tools:


Tool Simple Score Weighted Score Original Tool Information
BLAST Result Score Score Type Cluster ID
Compara 1 0.930 - - C45461340
Domainoid 00.000 Not matched by this tool.
eggNOG 00.000 Not matched by this tool.
Homologene 00.000 Not matched by this tool.
Inparanoid 00.000 Not matched by this tool.
Isobase 00.000 Not matched by this tool.
OMA 00.000 Not matched by this tool.
OrthoDB 00.000 Not matched by this tool.
OrthoFinder 00.000 Not matched by this tool.
OrthoInspector 00.000 Not matched by this tool.
orthoMCL 00.000 Not matched by this tool.
Panther 00.000 Not matched by this tool.
Phylome 1 0.910 - -
RoundUp 00.000 Not matched by this tool.
SonicParanoid 00.000 Not matched by this tool.
21.840

Return to query results.
Submit another query.