DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment Col4a1 and Col4a4

DIOPT Version :10

Sequence 1:NP_723044.1 Gene:Col4a1 / 33727 FlyBaseID:FBgn0000299 Length:1779 Species:Drosophila melanogaster
Sequence 2:NP_031761.1 Gene:Col4a4 / 12829 MGIID:104687 Length:1682 Species:Mus musculus


Alignment Length:1810 Identity:803/1810 - (44%)
Similarity:944/1810 - (52%) Gaps:271/1810 - (14%)


- Green bases have known domain annotations that are detailed below.


  Fly    71 KNCTAGYAGCVPKCIAEKGNRGLPGPLGPTGLKGEMGFPGMEGPSGDKGQKGD---PGPYGQRGD 132
            :||:      |.:|..|||:||.||||||.|..|.:|..|..|..|:||::||   |||.|::||
Mouse    42 RNCS------VCQCFPEKGSRGHPGPLGPQGPIGPLGPLGPIGIPGEKGERGDSGSPGPPGEKGD 100

  Fly   133 KGERGSPGLHGQAGVPGVQGPAGNPGAPGINGKDGCDGQDGIPGLEGLSGMPGPRGYAGQLGSKG 197
            ||..|.||..|..||||..||.|..|.||:   ||.:|..|.||..|..|.|||.|..||.|..|
Mouse   101 KGPTGVPGFPGVDGVPGHPGPPGPRGKPGV---DGYNGSRGDPGYPGERGAPGPGGPPGQPGENG 162

  Fly   198 EKGEPAKENGDY--AKGEKGEPGWRGTAGLAGPQGFPGEKGERGDSGPYGAKGPRGEHGLKGEKG 260
            |||......|..  .:|::|:||..|..|..|.||.||..|..|..|..|..|..|..||||.. 
Mouse   163 EKGRSVYITGGVKGIQGDRGDPGPPGLPGSRGAQGSPGPMGHAGAPGLAGPIGHPGSPGLKGNP- 226

  Fly   261 ASCYGPMKPGAPGIKGEKGEP---ASSFPVKPTHTVMGP-----RGDMGQKGEPGLVGRKGEPGP 317
                      |.|:||::|||   ....|..||..|..|     :|:.|.||.||::|..|.||.
Mouse   227 ----------ATGLKGQRGEPGEVGQRGPPGPTLLVQPPDLSIYKGEKGVKGMPGMIGPPGPPGR 281

  Fly   318 EGDTGLDGQKGEKGLPGGPGDRGRQGNFGPPGSTGQKGDRGEPGLNGLPGNPGQKGEPGRAGATG 382
            :|..|: |.|||||:||.||.||..|:.||||..|.||.:|..|..||.|..|.||:.|..|..|
Mouse   282 KGAPGV-GIKGEKGIPGFPGPRGEPGSHGPPGFPGFKGIQGAAGEPGLFGFLGPKGDLGDRGYPG 345

  Fly   383 KPGLL-----------GPPGPPG-----GGRGTPGPPGPKGPRGYVGAPGPQGLNGVDGLPGPQG 431
            .||:|           |.|||||     |..|.||||||.|..|.. .||..|..|..|:|||.|
Mouse   346 PPGILLTPAPPLKGVPGDPGPPGYYGEIGDVGLPGPPGPPGRPGET-CPGMMGPPGPPGVPGPPG 409

  Fly   432 YNGQKGGAGLPGR----PGNEGPPGKKGEKGTAGLNGPKGSIGPIGHPGPPGPEGQKGDAGLPG- 491
            :.|:   ||:|||    ||..|.||..|..|..|..||.||......||.|||.|:||..|.|| 
Mouse   410 FPGE---AGVPGRLDCAPGKPGKPGLPGLPGAPGPEGPPGSDVIYCRPGCPGPMGEKGKVGPPGR 471

  Fly   492 YGIQGSKGDAGI-------------PGYPGLKGSKGERGFKGNAGAPGDSKLGRPGTPGAAGAPG 543
            .|.:|:||:.|:             ||.||.:||||:.|..|..|..||.  |:||..|..|.||
Mouse   472 RGAKGAKGNKGLCTCPPGPMGPPGPPGPPGRQGSKGDLGLPGWHGEKGDP--GQPGAEGPPGPPG 534

  Fly   544 QKGDAGRPGTPGQKGDMGIKGDVGGKCSSCRAGPKGDKGTSGLPGIP---GKDGARGPPGERGYP 605
            :.|..|.||..|:||||.|         |...|.||::|..|.||.|   |:||..|.|||||.|
Mouse   535 RPGAMGPPGHKGEKGDMVI---------SRVKGQKGERGLDGPPGFPGPHGQDGGDGRPGERGDP 590

  Fly   606 GERG--HDGINGQTGPPGEKGEDGRTGL--PGATGEPGKPALCDLSLIEPLKGDKGYPGAPGAKG 666
            |.||  .|...|:.|.||..|..||||.  |...|.||.|            |.:|.||.||..|
Mouse   591 GPRGDHKDAAPGERGLPGLPGPPGRTGPEGPPGLGFPGPP------------GQRGLPGEPGRPG 643

  Fly   667 VQGFKGAE--------------GLPGIPGPKGEFGFKGEKGLSGAPG---NDGTPGRAGRDGYPG 714
            .:||.|.:              |.||:||..|..|.||..|..||||   .||..|:.|:.|..|
Mouse   644 TRGFDGTKGQKGDSILCNVSYPGKPGLPGLDGPPGLKGFPGPPGAPGMRCPDGQKGQRGKPGMSG 708

  Fly   715 IPGQSIKGEPGFHGRDGAKGDKGSFGRS--GEKGEPGSCALDEIKMPAK-GNKGEPGQT--GMPG 774
            ||     |.|||.|..|..|.||..|.|  |..|.|||        |.| |.||.||..  |.||
Mouse   709 IP-----GPPGFRGDMGDPGIKGEKGTSPIGPPGPPGS--------PGKDGQKGIPGDPAFGDPG 760

  Fly   775 PPGEDGSPGERGYTGLKGN---TGPQGPPGVEGPRGLNGPRGEKGNQGAVGVPGNPGKDGLRGIP 836
            ||||.|.||..|..|.||:   .|..||||:.|..||.||:|.:|::|..|:||:||....||.|
Mouse   761 PPGERGLPGAPGMKGQKGHPGCPGAGGPPGIPGSPGLKGPKGREGSRGFPGIPGSPGHSCERGAP 825

  Fly   837 GRNGQPGPRGEPGISRPGPMGPPGLNGLQGEKGDRGPTGPIGFPGADGSVGYPGDRGDAGLPGVS 901
            |..||||        .||..|.||..|.:|:.||.||:||.|..|..|..|.||..|..|.||: 
Mouse   826 GIPGQPG--------LPGTPGDPGAPGWKGQPGDMGPSGPAGMKGLPGLPGLPGADGLRGPPGI- 881

  Fly   902 GRPGIVGEKGDVGPIGPAGVAGPPGVPGIDGVRGRDGAKGEPGSPGLV---GMPGNKGD---RGA 960
              ||..||.|..|..|..|:.|.||.||..|.||:.|..||||..|.|   |.||.|||   |||
Mouse   882 --PGPNGEDGLPGLPGLKGLPGLPGFPGFPGERGKPGPDGEPGRKGEVGEKGWPGLKGDLGERGA 944

  Fly   961 PGNDGPKGFAG--VT---GAPGKRGPAGIPGVSGAKGDKGATGLTGNDGPVGGRGPPGAPGLMGI 1020
            .|:.|..|.||  ||   |.||..||.|..|.||.:||||::|:.      ||||.||..||.|:
Mouse   945 KGDRGLPGDAGEAVTSRKGEPGDAGPPGDGGFSGERGDKGSSGMR------GGRGDPGRDGLPGL 1003

  Fly  1021 -KGDQGLAGAPGQQGLDGMPGEKGNQGFPGLDGPPGLPGDASEKGQKGEPGPSGLRGDTGPAGTP 1084
             :|..|:.|.||.   .|.||..|:.|..|:.|.||.|||   :|..|.|||.|..||.|..|..
Mouse  1004 HRGQPGIDGPPGP---PGPPGPPGSPGLRGVIGFPGFPGD---QGDPGSPGPPGFPGDDGARGPK 1062

  Fly  1085 GWPGEKGLPGLAVHGRAGPPGEKGDQGRSGIDGRDGINGEK---GEQGLQGVWGQPGEKGSVGAP 1146
            |:.|:..       .:.||||.||:.|..|..||.|:.|||   |::|.:|..|:||:.||.|.|
Mouse  1063 GYKGDPA-------SQCGPPGPKGEPGSPGYQGRTGVPGEKGFPGDEGPRGPPGRPGQPGSFGPP 1120

  Fly  1147 GIPGAPGMDGLPGAAGAPGAVGYPGDRGDKGEPGLSGLPGLKGETGPVGLQGFTGAPGPKGERGI 1211
            |.||.|||   ||..|.||.||.||.|||.|:   .|.||..|..||:|..|..|..|.|||:|.
Mouse  1121 GCPGDPGM---PGLKGHPGEVGDPGPRGDAGD---FGRPGPAGVKGPLGSPGLNGLHGLKGEKGT 1179

  Fly  1212 RGQPGLPATVPDIRGDKGSQGERGYTGEKGEQGERGLTGPAGVAGAKGDRGLQGPPGASGLNGIP 1276
            :|..||.                          |.|..||.|:.|.||::|..|.||        
Mouse  1180 KGASGLL--------------------------EMGPPGPMGMPGQKGEKGDPGSPG-------- 1210

  Fly  1277 GAKGDIGPRGEIGYPGVTIKGEKGLPGRPGRNGRQGLIGAPGLIGERGLPGLAGEPGLVGLPGPI 1341
                 |.|.|        :.||||.||.|||.|..|..||||...:..:|. .|.||..|.|||.
Mouse  1211 -----ISPPG--------LPGEKGFPGPPGRPGPPGPAGAPGRAAKGDIPD-PGPPGDRGPPGPD 1261

  Fly  1342 GPAGSKGERGLAGS----PGQPGQDGFPGAPGLKGDTGPQGFKGERGLNGFEGQKGDKGDRGLQG 1402
            ||.|..|..|..|:    .|.||..|.||.||.:|..||.|.:|..|.:|.:||||..|..||.|
Mouse  1262 GPRGVPGPPGSPGNVDLLKGDPGDCGLPGPPGSRGPPGPPGCQGPPGCDGKDGQKGPMGLPGLPG 1326

  Fly  1403 PSGLPGLVGQKGDTGYPGLNGNDGPVGAPGERGFTGPKGRDGRDGTPGLPGQKGEPGMLPPPGPK 1467
            |.||||..|:|   |.||..|..||||.||.||..||..  ..|..|.:||..|.||...|.|..
Mouse  1327 PPGLPGAPGEK---GLPGPPGRKGPVGPPGCRGEPGPPA--DVDSCPRIPGLPGVPGPRGPEGAM 1386

  Fly  1468 GEPGQPGRNGP--KGEPGRPGERGLIGIQGERGEKGERGLIGETGNVGRPGPKGDRGEPGERGYE 1530
            ||||:.|..||  |||||..|.||..||.|..|..|.:|..||.|..|.|||.|..|:||.:|: 
Mouse  1387 GEPGRRGLPGPGCKGEPGPDGRRGQDGIPGSPGPPGRKGDTGEAGCPGAPGPPGPTGDPGPKGF- 1450

  Fly  1531 GAIGLIGQKGEPGAPAPAALDYLTGILITRHSQSETVPACSAGHTELWTGYSLLYVDGNDYAHNQ 1595
                      .||:        |:|.|:..|||::..|||..|...|||||||||::|.:.||||
Mouse  1451 ----------GPGS--------LSGFLLVLHSQTDQEPACPVGMPRLWTGYSLLYMEGQEKAHNQ 1497

  Fly  1596 DLGSPGSCVPRFSTLPVLSCGQNNVCNYASRNDKTFWLTTNAAIPMMPVENIEIRQYISRCVVCE 1660
            |||..|||:|.|||||...|..:.||:||.|||:::||::.|.:||||:...|||.|||||.|||
Mouse  1498 DLGLAGSCLPVFSTLPFAYCNIHQVCHYAQRNDRSYWLSSAAPLPMMPLSEEEIRSYISRCAVCE 1562

  Fly  1661 APANVIAVHSQTIEVPDCPNGWEGLWIGYSFLMHTAVGNGGGGQALQSPGSCLEDFRATPFIECN 1725
            |||..:|||||...:|.||..|..|||||||||||..|:.||||||.|||||||||||.||:||.
Mouse  1563 APAQAVAVHSQDQSIPPCPRTWRSLWIGYSFLMHTGAGDQGGGQALMSPGSCLEDFRAAPFVECQ 1627

  Fly  1726 GAKGTCHFYETMTSFWMYNLESSQPF-ERPQQQTIKAGERQSH-VSRCQVCMKNS 1778
            |.:|||||:....|||:..:.....| ..|...|:|..:.|.. :||||||||:|
Mouse  1628 GRQGTCHFFANEYSFWLTTVNPDLQFASGPSPDTLKEVQAQRRKISRCQVCMKHS 1682

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
Col4a1NP_723044.1 gly_rich_SclB <107..>361 CDD:468478 116/266 (44%)
gly_rich_SclB <355..>642 CDD:468478 140/327 (43%)
gly_rich_SclB <543..820 CDD:468478 129/308 (42%)
gly_rich_SclB <727..>968 CDD:468478 116/254 (46%)
gly_rich_SclB <969..>1218 CDD:468478 115/257 (45%)
gly_rich_SclB <1186..>1420 CDD:468478 90/237 (38%)
gly_rich_SclB <1321..>1547 CDD:468478 102/231 (44%)
C4 1555..1662 CDD:128421 61/106 (58%)
C4 1663..1777 CDD:128421 64/115 (56%)
Col4a4NP_031761.1 7S domain. /evidence=ECO:0000250|UniProtKB:P53420 31..56 7/19 (37%)
gly_rich_SclB <54..>356 CDD:468478 140/316 (44%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 56..255 92/212 (43%)
Triple-helical region. /evidence=ECO:0000250|UniProtKB:P53420 57..1451 663/1548 (43%)
Cell attachment site. /evidence=ECO:0000255 86..88 0/1 (0%)
Cell attachment site. /evidence=ECO:0000255 137..139 0/1 (0%)
Cell attachment site. /evidence=ECO:0000255 181..183 0/1 (0%)
gly_rich_SclB <288..>483 CDD:468478 92/198 (46%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 379..1453 517/1213 (43%)
gly_rich_SclB <495..>727 CDD:468478 107/259 (41%)
Cell attachment site. /evidence=ECO:0000255 587..589 1/1 (100%)
Cell attachment site. /evidence=ECO:0000255 593..595 1/1 (100%)
gly_rich_SclB <695..>924 CDD:468478 115/252 (46%)
Cell attachment site. /evidence=ECO:0000255 716..718 0/1 (0%)
gly_rich_SclB <908..>1217 CDD:468478 153/380 (40%)
Cell attachment site. /evidence=ECO:0000255 980..982 0/1 (0%)
Cell attachment site. /evidence=ECO:0000255 992..994 1/1 (100%)
gly_rich_SclB <1142..>1412 CDD:468478 134/325 (41%)
Cell attachment site. /evidence=ECO:0000255 1144..1146 1/1 (100%)
C4 1458..1563 CDD:460201 59/104 (57%)
C4 1566..1680 CDD:460201 62/113 (55%)

Return to query results.
Submit another query.