DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment Col4a1 and CG42342

DIOPT Version :9

Sequence 1:NP_723044.1 Gene:Col4a1 / 33727 FlyBaseID:FBgn0000299 Length:1779 Species:Drosophila melanogaster
Sequence 2:NP_001189234.1 Gene:CG42342 / 7354466 FlyBaseID:FBgn0259244 Length:831 Species:Drosophila melanogaster


Alignment Length:813 Identity:270/813 - (33%)
Similarity:329/813 - (40%) Gaps:273/813 - (33%)


- Green bases have known domain annotations that are detailed below.


  Fly   532 RPGTPGAAGAPGQKGDAGRPGTPGQKGDMGIKGDVGGKCSSCRAGPKGDKGTSGLPGIPGKDGAR 596
            |.....||.|...:|.:|                 ||:| .|:.||.                  
  Fly   217 RQAAEAAAAAASGEGGSG-----------------GGQC-QCQPGPP------------------ 245

  Fly   597 GPPGERGYPGERGHDGINGQTGPPGEKGEDGRTGLPGATGEPGKPALCDLSLIEPLKGDKGYPGA 661
            |||                  ||||::|:.|:                        |||.|..|.
  Fly   246 GPP------------------GPPGKRGKRGK------------------------KGDSGEKGD 268

  Fly   662 PGAKGVQGFKGAEGLPGIPGPKGEFG------FKGEKGLSGAPGNDGTPGRAGRDGYPGIPGQSI 720
            ||..|:.|.|||.|.||..|.||:.|      |:..|||.    ...|....|..||..|.....
  Fly   269 PGLNGISGEKGAAGKPGDKGQKGDVGHPGMDVFQTVKGLK----RSVTTLHGGTLGYAEIVAVKD 329

  Fly   721 KGEPGFHGRDGAKGDKGSFGRSGEKGEPGSCALDEIKMPAKGNKGEPGQTGMPGPPGEDGSPGER 785
            ..|.|.:                      ..|...||:     |||||:.|.||||||.|.|   
  Fly   330 LQEAGVN----------------------VSASTVIKL-----KGEPGEPGPPGPPGEAGQP--- 364

  Fly   786 GYTGLKGNTGPQGPPGVEGPRGLNGPRGEKGNQGAVGVPGNPGKDGLRGIPGRNGQPGPRGEPGI 850
                        |.||..||                              ||..|..||:||.| 
  Fly   365 ------------GAPGERGP------------------------------PGEIGAQGPQGEAG- 386

  Fly   851 SRPGPMGPPGLNGLQGEKGDRGPTGPIGFPGADGSVGYPGDRGDAGLPGVSGRPGIVGEKGDVGP 915
             :||..||||:.|..|.|||:                  |||||.||...     |.|::...|.
  Fly   387 -QPGVAGPPGVAGAPGTKGDK------------------GDRGDRGLTTT-----IKGDEFPTGI 427

  Fly   916 I-GPAGVAGPPGVPGIDGVRGRDGAKGEPGSPGLVGMPGNKGDRGAPGND--GPKGF-------- 969
            | ||.|.|||||.|      |..||:||||..|..|.||.||.||..|..  ||.|.        
  Fly   428 IEGPPGPAGPPGPP------GEPGARGEPGPIGPAGPPGEKGPRGKRGKRIFGPGGTKIDEDYDD 486

  Fly   970 AGVTGAPGKRGPAGIPGVSGAKGDKGATGLTGNDGPVGGRGPPGAPGLMGIKGDQGLAGAPGQQG 1034
            ..||..   |||.|.||::|.         .|.||..|.:|.||.||..|..|.:||.|.||:.|
  Fly   487 PPVTLL---RGPPGPPGIAGK---------DGRDGRDGSKGEPGEPGEPGSLGPRGLDGLPGEPG 539

  Fly  1035 LDGMPGEKGNQGFPGLDGPPGLPGDASEKGQKGEPGPSGLRGDTGPAGTPGWPGEKGLPGLAVHG 1099
            ::|.|      |.||..||||      |||.:|:.||.||.|..|..|.||:||.|         
  Fly   540 IEGPP------GLPGYQGPPG------EKGDRGDIGPPGLMGPPGLPGPPGYPGVK--------- 583

  Fly  1100 RAGPPGEKGDQGRSGIDGR----DGINGEKGEQGLQGVWGQPGEKGSVGAPGIPGAPGMDGLPGA 1160
                 |:|||:|.|....|    ||::.......::.::|.||..|.:|.||..|:.|..||.|.
  Fly   584 -----GDKGDRGDSYRKMRRRQDDGMSDAPHMPTIEYLYGPPGPPGPMGPPGHTGSQGERGLDGR 643

  Fly  1161 AGAPGAVGYPGDRGDKGEPGLSGLPGLKGETGPVGLQGFTGAPGPKGERGIRGQPGLPATVPDIR 1225
            .|.||..|:.||:|..|.|            ||:|::|.:|..||.|:.||.|.|||.       
  Fly   644 KGDPGEKGHKGDQGPMGLP------------GPMGMRGESGPSGPSGKAGIPGPPGLD------- 689

  Fly  1226 GDKGSQGERGYTGEKGEQGERGLTGPAGVAGAKGDRGLQGPPGASGLNGIPGAKGDIGPRGEIGY 1290
            |.||:|||   ||.|||:|:.||.|..|:.|.:|.||.||..|.:|..|..|.|||.|.:||.|.
  Fly   690 GMKGAQGE---TGHKGERGDPGLPGTDGIPGQEGPRGEQGSRGDAGPPGKRGRKGDRGDKGEQGV 751

  Fly  1291 PGVTIK---GEKGLPGRPG---RNGRQGLIGAP 1317
            ||:...   |..||| .||   |..::.:|..|
  Fly   752 PGLDAPCPLGADGLP-LPGCGWRPPKEPIISTP 783

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
Col4a1NP_723044.1 Collagen 104..161 CDD:189968
Collagen 322..380 CDD:189968
Collagen 413..465 CDD:189968
Collagen 499..561 CDD:189968 6/28 (21%)
Collagen 574..632 CDD:189968 11/57 (19%)
Collagen 657..714 CDD:189968 24/62 (39%)
Collagen 765..824 CDD:189968 18/58 (31%)
Collagen 854..911 CDD:189968 19/56 (34%)
Collagen 884..943 CDD:189968 24/59 (41%)
Collagen 923..982 CDD:189968 28/68 (41%)
Collagen 1028..1085 CDD:189968 24/56 (43%)
Collagen 1229..1287 CDD:189968 28/57 (49%)
Collagen 1318..1376 CDD:189968 270/813 (33%)
Collagen 1399..1458 CDD:189968
Collagen 1477..1534 CDD:189968
C4 1555..1662 CDD:128421
C4 1663..1777 CDD:128421
CG42342NP_001189234.1 DUF4763 130..>210 CDD:292582
Collagen 366..413 CDD:189968 28/96 (29%)
Collagen 660..718 CDD:189968 32/79 (41%)
Collagen 690..748 CDD:189968 30/60 (50%)
Blue background indicates that the domain is not in the aligned region.


Information from Original Tools:


Tool Simple Score Weighted Score Original Tool Information
BLAST Result Score Score Type Cluster ID
Compara 00.000 Not matched by this tool.
Domainoid 00.000 Not matched by this tool.
eggNOG 00.000 Not matched by this tool.
Homologene 00.000 Not matched by this tool.
Inparanoid 00.000 Not matched by this tool.
Isobase 00.000 Not matched by this tool.
OMA 00.000 Not matched by this tool.
OrthoDB 00.000 Not matched by this tool.
OrthoFinder 00.000 Not matched by this tool.
OrthoInspector 00.000 Not matched by this tool.
orthoMCL 00.000 Not matched by this tool.
Panther 1 1.100 - - P PTHR24023
Phylome 1 0.910 - -
RoundUp 00.000 Not matched by this tool.
SonicParanoid 00.000 Not matched by this tool.
22.010

Return to query results.
Submit another query.