DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment Mp and Col4a1

DIOPT Version :10

Sequence 1:NP_001246651.1 Gene:Mp / 38769 FlyBaseID:FBgn0260660 Length:1039 Species:Drosophila melanogaster
Sequence 2:NP_001128481.1 Gene:Col4a1 / 290905 RGDID:1307148 Length:1669 Species:Rattus norvegicus


Alignment Length:607 Identity:213/607 - (35%)
Similarity:252/607 - (41%) Gaps:150/607 - (24%)


- Green bases have known domain annotations that are detailed below.


  Fly   273 DFWNTGD---------EATDIFDASGMQPPGQTQYTHERPYRGIKGEKGERGPKGDSIRGPPGPP 328
            ||..||:         :.|..:...|  .||:.....:....|.|||||..|..|:|  |.||.|
  Rat   253 DFAPTGEKGQKGEPGFQGTPGYGEKG--EPGKPGPRGKPGKDGEKGEKGSPGFPGES--GYPGLP 313

  Fly   329 GPPGPKGETA----PYPP--FVETTSAGAKYTGECTCNASDILEAIKDNESLR--ESLRGAPGTP 385
            |..||:||..    |.||  .|.|...|.|  ||         .....:..||  ...:|.||:|
  Rat   314 GRQGPQGEKGEPGLPGPPGTVVGTRPLGEK--GE---------RGYPGSPGLRGEPGPKGYPGSP 367

  Fly   386 GKDGKPG--TPGHTGATGVPGARGARGSEGAQG--LKGEPGVDGLPGVMGPPGPP---------- 436
            |:.|.||  .||.|||.|.||.||.:|..|:.|  |.|..|.||.||..||||||          
  Rat   368 GQPGPPGFAVPGQTGAPGFPGERGEKGERGSPGVSLPGPSGRDGAPGPPGPPGPPGQPGHTNGIV 432

  Fly   437 ----------GPPGLP------------ENYDESLMVNSMGAFRGTTQPGAKGVPGEKGDAGQ-- 477
                      ||||:|            ....||.:.......||  .||.:|.|||.|..||  
  Rat   433 ECQPGPPGDQGPPGIPGQPGLTGEVGQKGQKGESCLACDTEGLRG--PPGPQGPPGEIGFPGQPG 495

  Fly   478 -KGERGDPGHKGAH------------GPSGAKGEPGE-------------PGTPGLPGLPGQVGQ 516
             ||:||.||..|..            |..||||||||             ||.||.||:||:.|.
  Rat   496 AKGDRGLPGRDGLEGLPGPQGSPGLIGQPGAKGEPGEIFFDMRLKGDKGDPGFPGQPGMPGRAGT 560

  Fly   517 PG--GLDGLASANGTKGEKGEKGEKGMRGRRGGTGATGPIGPPGKP--GPMGDIGHSGR------ 571
            ||  |..||....|:.|..|.|||:|..|..|..|:.|.|||||.|  ||:|.||..|:      
  Rat   561 PGRDGHPGLPGPKGSPGSIGLKGERGPPGGVGFPGSRGDIGPPGPPGVGPIGPIGEKGQAGLPGG 625

  Fly   572 ---PGMTGPKGEMGP-------------------KGPKGDSG-----GREGLKGDKGDRGQD--G 607
               ||:.|||||.|.                   .||:||.|     ||.|..|:||..||.  |
  Rat   626 PGSPGLPGPKGEAGKVVPLPGPPGAAGLPGSPGFPGPQGDRGFPGTPGRPGNPGEKGAVGQPGIG 690

  Fly   608 RDGLPGP---PGLPSTGGGDGDSGGVQYIPMPGPPGPPGPPGLPGLSISGPKGEPGVDSRSSFFG 669
            ..|||||   .|||...|..|..|...:..:||.|||.|..|.||:.:.|.||:||:.......|
  Rat   691 FPGLPGPKGVDGLPGEIGRPGSPGRPGFNGLPGNPGPQGQKGEPGIGLPGLKGQPGLPGIPGTPG 755

  Fly   670 DASYYGRPGARSSLDELKALRELQDLRDRP-----DGTAEPPRQPGHSHKHEETLGLVDGEEPYF 729
            :....|.||.... ..|.....||.:|..|     .|.|.||..||..  ....:|...|:.|  
  Rat   756 EKGSIGGPGVPGE-QGLTGPPGLQGIRGDPGPPGVQGPAGPPGVPGIG--PPGAMGPPGGQGP-- 815

  Fly   730 SASSSNMNMKIVPGAVTFQNID 751
            ..||....:|...|...|..:|
  Rat   816 PGSSGPPGVKGEKGFPGFPGLD 837

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
MpNP_001246651.1 LamG 58..222 CDD:473984
gly_rich_SclB <292..>421 CDD:468478 54/140 (39%)
gly_rich_SclB <379..>616 CDD:468478 132/342 (39%)
Collagen_trimer 744..792 CDD:466257 2/8 (25%)
Endostatin-like 831..999 CDD:238151
Col4a1NP_001128481.1 gly_rich_SclB <42..>320 CDD:468478 24/70 (34%)
gly_rich_SclB <267..>577 CDD:468478 116/326 (36%)
gly_rich_SclB <506..>766 CDD:468478 98/259 (38%)
gly_rich_SclB <717..>992 CDD:468478 38/126 (30%)
gly_rich_SclB <948..>1163 CDD:468478
gly_rich_SclB <1118..>1335 CDD:468478
gly_rich_SclB <1229..>1443 CDD:468478
C4 1446..1553 CDD:460201
C4 1556..1667 CDD:460201
Blue background indicates that the domain is not in the aligned region.

Return to query results.
Submit another query.