DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment Col4a1 and Col9a2

DIOPT Version :10

Sequence 1:NP_723044.1 Gene:Col4a1 / 33727 FlyBaseID:FBgn0000299 Length:1779 Species:Drosophila melanogaster
Sequence 2:NP_001102145.1 Gene:Col9a2 / 362584 RGDID:1307029 Length:688 Species:Rattus norvegicus


Alignment Length:1009 Identity:350/1009 - (34%)
Similarity:412/1009 - (40%) Gaps:370/1009 - (36%)


- Green bases have known domain annotations that are detailed below.


  Fly   346 GPPGSTGQKGDRGEPGLNGLPGNPGQKGEPGRAGATGKPGLLGPPGPPGGGRGTPGPPGPKGPRG 410
            |||         |||||.|.||.||..|..|..|..|.||.:|||    |.:|.||.|||.||. 
  Rat    26 GPP---------GEPGLPGPPGPPGVPGSDGIDGDKGPPGKVGPP----GSKGEPGKPGPDGPD- 76

  Fly   411 YVGAPGPQGLNGVDGLPGPQGYNGQKGGAGLPGRPGNEGPPGKKGEKGTAGLNGPKGSIGPIGHP 475
              |.||..||.|..|.|||.|..|.||..||||.||..||          |..||         |
  Rat    77 --GKPGIDGLMGAKGEPGPMGTPGIKGQPGLPGPPGLPGP----------GFAGP---------P 120

  Fly   476 GPPGPEGQKGDAGLPGYGIQGSKGDAGIPGYPGLKGSKGERGFKGNAGAPGDSKLGRPGTPGAAG 540
            |||||.|..|:.|.|     |.|||                                ||..|.:|
  Rat   121 GPPGPVGLPGEIGTP-----GPKGD--------------------------------PGPEGPSG 148

  Fly   541 APGQKGDAGRPGTPGQKGDMGIKGDVGGKC-SSCRAGPKGDKGTSGLPGIPGKDGARGPPGERGY 604
            .||..|..|||||     ..|::|.....| ::|.||.||.:|..|:.|.|||.|..|.||.:|.
  Rat   149 PPGPPGKPGRPGT-----IQGLEGSADFLCPTNCPAGVKGPQGLQGVKGHPGKRGILGDPGRQGK 208

  Fly   605 PGERGHDGINGQTGPPGEKGEDGRTGLPGATGEPGKPALCDLSLIEPLKGDKGYPGAPGAKGVQG 669
            ||.:|..|.:                                       |::|.||.||.:|::|
  Rat   209 PGPKGDVGAS---------------------------------------GEQGIPGPPGPQGIRG 234

  Fly   670 FKGAEGLPGIPGPKGEFGFKGEKGLSGAPGNDGTPGRAGRDGYPGIPGQSIKGEPGFHGRDGAKG 734
            :      ||:.|||||.|.:|.||:.|:.|..|.||..|..|.|                     
  Rat   235 Y------PGMAGPKGEMGPRGYKGMVGSIGAAGPPGEEGPRGPP--------------------- 272

  Fly   735 DKGSFGRSGEKGEPGSCALDEIKMPAKGNKGEPGQTGMPGPPGEDGSPGERGYTGLKGNTGPQGP 799
                 ||:||||:.||       ..|:|.:|..|..|..||||.||..|..|..|:||:.|..|.
  Rat   273 -----GRAGEKGDVGS-------QGARGPQGITGPKGTTGPPGIDGKDGTPGIPGMKGSAGQVGR 325

  Fly   800 PGVEGPRGLNGPRGEKGNQGAVGVPGNPGKDGLRGIPGRNGQPGPRGEPGISRPGPMGPPGLNGL 864
            |            |..|:||..||||.|   |.:|.||..|:||.:|.||||     |||   |.
  Rat   326 P------------GSPGHQGLAGVPGQP---GTKGGPGDKGEPGQQGFPGIS-----GPP---GK 367

  Fly   865 QGEKGDRGPTGPIGFPGADGSVGYPGDRGDAGLPGVSGRPGIVGEKGDVGPIGPAGVAGPPGVPG 929
            :||.|.||.|||                           .||:|||||.|..||.|..||     
  Rat   368 EGEPGPRGETGP---------------------------QGIMGEKGDQGERGPVGQPGP----- 400

  Fly   930 IDGVRGRDGAKGEPGSPGL---VGMPGNKGDRGAPGNDGPKGFAGVTGAPGKRGPAGIPGVSGAK 991
                :||.|.|||.||||:   .|:||.|||:|:||..||            ||..|.|||:|..
  Rat   401 ----QGRQGPKGEQGSPGIPGPQGLPGIKGDKGSPGKTGP------------RGGVGDPGVAGLP 449

  Fly   992 GDKGATGLTGNDGPVGGRGPPGAPGLMGIKGDQGLAGAPGQQGLDGMPGEKGNQGFPGLDGPPGL 1056
            |:||..||:|.            |||   ||.||:.|.|      |.||..|:.|.||:.|.|||
  Rat   450 GEKGEKGLSGE------------PGL---KGQQGVRGEP------GYPGPSGDAGAPGVQGYPGL 493

  Fly  1057 PGDASEKGQKGEPGPSGLRGDTGPAGTPGWPGEKGLPGLAVHGRAGPPGEKGDQGRSGIDGRDGI 1121
                        |||.||.||.|..|.|                          ||.|:.||...
  Rat   494 ------------PGPRGLVGDRGVPGQP--------------------------GRQGVVGRAAS 520

  Fly  1122 NGEKGEQGLQGVWGQPGEKGSVGAPGIPGAPGMDGLPGAAGAPGAVGYPGDRGDKGEPGLSGLPG 1186
            :....:..|:.:..|..|..........||.||.||||..|.|   ||||.:|..|.||..|:||
  Rat   521 DQHIVDVVLKMIQEQLAEVAVSAKREALGATGMVGLPGPPGPP---GYPGKQGPNGHPGPRGIPG 582

  Fly  1187 LKGETGPVGLQGFTGAPGPKGERGIRGQPGLPATVPDIRGDKGSQGERGYTGEKGEQGERGLTGP 1251
            :.|..|.:      |..||||:|                            ||||:|||.|    
  Rat   583 IVGAVGQI------GNTGPKGKR----------------------------GEKGDQGEMG---- 609

  Fly  1252 AGVAGAKGDRGLQGPPGASGLNGIPGAKGDIGPRGEIGYPGVTIKGEKGLPGRPGR--NGRQGLI 1314
                  :|..|:.|||      |||                       |||||||:  ||:.|..
  Rat   610 ------RGHPGMPGPP------GIP-----------------------GLPGRPGQAINGKDGDR 639

  Fly  1315 GAPGLIGERGLPGLAGEPGLVGLPGPIGPAGSKG 1348
            |:||..||.|.|   |.||.|||||...||...|
  Rat   640 GSPGAPGEAGRP---GRPGPVGLPGFCEPAACLG 670

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
Col4a1NP_723044.1 gly_rich_SclB <107..>361 CDD:468478 5/14 (36%)
gly_rich_SclB <355..>642 CDD:468478 102/287 (36%)
gly_rich_SclB <543..820 CDD:468478 87/277 (31%)
gly_rich_SclB <727..>968 CDD:468478 94/243 (39%)
gly_rich_SclB <969..>1218 CDD:468478 80/248 (32%)
gly_rich_SclB <1186..>1420 CDD:468478 53/165 (32%)
gly_rich_SclB <1321..>1547 CDD:468478 15/28 (54%)
C4 1555..1662 CDD:128421
C4 1663..1777 CDD:128421
Col9a2NP_001102145.1 gly_rich_SclB <192..>437 CDD:468478 134/393 (34%)
gly_rich_SclB <346..>656 CDD:468478 171/503 (34%)
Blue background indicates that the domain is not in the aligned region.

Return to query results.
Submit another query.