DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment Col4a1 and Col4a1

DIOPT Version :9

Sequence 1:NP_723044.1 Gene:Col4a1 / 33727 FlyBaseID:FBgn0000299 Length:1779 Species:Drosophila melanogaster
Sequence 2:NP_034061.2 Gene:Col4a1 / 12826 MGIID:88454 Length:1669 Species:Mus musculus


Alignment Length:1866 Identity:822/1866 - (44%)
Similarity:982/1866 - (52%) Gaps:290/1866 - (15%)


- Green bases have known domain annotations that are detailed below.


  Fly     2 LPFWKRLLYAAVIAGALVGADAQFWKTAGTAGSIQDSVKHYNRNEPKFPIDDSYDIVDSAGVARG 66
            |..|..||:||::.                         |..|               |...|:|
Mouse     5 LSVWLLLLFAALLL-------------------------HEER---------------SRAAAKG 29

  Fly    67 DLPPKNCTAGYAGCVPKC-----IAEKGNRGLPGPLGPTGLKGEMGFPGMEGPSGDKGQKGDPGP 126
            |       .|.:|| .||     ..:||.||||      ||:|.:|||||:||      :|..||
Mouse    30 D-------CGGSGC-GKCDCHGVKGQKGERGLP------GLQGVIGFPGMQGP------EGPHGP 74

  Fly   127 YGQRGDKGERGSPGLHGQAGVPGVQGPAGNPGAPGINGKDGCDGQDGIPGLEGLSGMPGPRGYAG 191
            .||:||.||.|.||..|..|.||..|..||||.|||.|:||..|..||||..|..|..||.|..|
Mouse    75 PGQKGDAGEPGLPGTKGTRGPPGAAGYPGNPGLPGIPGQDGPPGPPGIPGCNGTKGERGPLGPPG 139

  Fly   192 QLGSKGEKGEPAKENGDYAKGEKGEPG-----WRGTAGLAGPQGFPGEKGERGDSGPYGAKGPRG 251
            ..|..|..|.|.      ..|.||:||     ..||. |.|.:||||..|..|..|..|.:||.|
Mouse   140 LPGFSGNPGPPG------LPGMKGDPGEILGHVPGTL-LKGERGFPGIPGMPGSPGLPGLQGPVG 197

  Fly   252 EHGLKGEKGASCYGPMKPGAPGIKGEKGEPASSFPVKPTHTVMGPRGDMGQKGEPGLVGRKGEPG 316
            ..|..|..|       .||.||..||||:..|||        .||:||   |||.|:.|..|.||
Mouse   198 PPGFTGPPG-------PPGPPGPPGEKGQMGSSF--------QGPKGD---KGEQGVSGPPGVPG 244

  Fly   317 -----PEGD---TGLDGQKGEKGLPGGP-----GDRGRQGNFGPPGSTGQKGDRGEPGLNGLPGN 368
                 .:||   ||..|||||.|.||.|     |:.|:||..|.||..|:||:||.|   |:||:
Mouse   245 QAQVKEKGDFAPTGEKGQKGEPGFPGVPGYGEKGEPGKQGPRGKPGKDGEKGERGSP---GIPGD 306

  Fly   369 PGQKGEPGRAGATGKPGLLGPPGPPGGGRGTPGPPGPKGPRGYVGAPGPQGLNGVDGLPGPQGYN 433
            .|..|.|||.|..|:.|..|.|||||...||. |.|.||.|||.|||      |:.|.|||:|: 
Mouse   307 SGYPGLPGRQGPQGEKGEAGLPGPPGTVIGTM-PLGEKGDRGYPGAP------GLRGEPGPKGF- 363

  Fly   434 GQKGGAGLPGRPGNEGPPGKKGEKGTAGLNGPKGSIGPIGHPGPPGPEGQKGDAGLPGYGIQGSK 498
                    ||.||..||||..             :.|..|.||.||..|:|||.|.||..:.|..
Mouse   364 --------PGTPGQPGPPGFP-------------TPGQAGAPGFPGERGEKGDQGFPGVSLPGPS 407

  Fly   499 GDAGIPGYPGLKGSKGERGFKGNAGAPGDSKLGRPGTPGAAGA-----PGQKGDAGRPGTPGQ-- 556
            |..|.||.|               |.||.     ||.||....     ||..||.|.||||||  
Mouse   408 GRDGAPGPP---------------GPPGP-----PGQPGHTNGIVECQPGPPGDQGPPGTPGQPG 452

  Fly   557 -KGDMGIKGDVGGKCSSCRAGPKGDKGTSGLPGIPGKDGARGPPGERGYP------GERGHDGIN 614
             .|::|.||..|..|.:|        .|.||.|.|   |.:|||||.|:|      |:||..|.:
Mouse   453 LTGEVGQKGQKGESCLAC--------DTEGLRGPP---GPQGPPGEIGFPGQPGAKGDRGLPGRD 506

  Fly   615 GQTGPPGEKGEDGRTGLPGATGEPGKPALCDLSLIEPLKGDKGYPGAPGAKGVQGFKGAEGLPGI 679
            |..|.||.:|..|..|.|||.||||: ...|:.| :..|||.|:||.||..|..|..|.:|.||:
Mouse   507 GLEGLPGPQGSPGLIGQPGAKGEPGE-IFFDMRL-KGDKGDPGFPGQPGMPGRAGTPGRDGHPGL 569

  Fly   680 PGPKGEFGFKGEKGLSGAPGNDGTPGRAGRDGYPGIPGQSIKGEPGFHGRDGAKGDKGSFGRSGE 744
            |||||..|..|.||..|.||..|.||..|..|.||.||....|..|..|:.|..|..||.|..|.
Mouse   570 PGPKGSPGSIGLKGERGPPGGVGFPGSRGDIGPPGPPGVGPIGPVGEKGQAGFPGGPGSPGLPGP 634

  Fly   745 KGEPGSCALDEIKMPA-KGNKGEPGQTGMPGPPGEDGSPGERGYTGLKGNTGPQGPPGVEGPRGL 808
            |||.|..    :.:|. .|..|.||..|.|||.|:.|.||..|..|:.|..|..|.||:..| ||
Mouse   635 KGEAGKV----VPLPGPPGAAGLPGSPGFPGPQGDRGFPGTPGRPGIPGEKGAVGQPGIGFP-GL 694

  Fly   809 NGPRGEKGNQGAVGVPGNPGKDGLRGIPGRNGQPGPRGEPGISRPGPMGPPGLNGLQGEKGDRGP 873
            .||:|..|..|.:|.||:||:.|..|:||..|..|.:|||||..||..|.|||.|:.|..|::|.
Mouse   695 PGPKGVDGLPGEIGRPGSPGRPGFNGLPGNPGPQGQKGEPGIGLPGLKGQPGLPGIPGTPGEKGS 759

  Fly   874 TGPIGFPGADGSVGYP---GDRGDAGLPGVSGRPGIVGEKGDVGPIGPAGVAGPPGVPGIDGVRG 935
            .|..|.||..|..|.|   |.|||.|.|||.|..|..|..|    |||.|..||||..|..|..|
Mouse   760 IGGPGVPGEQGLTGPPGLQGIRGDPGPPGVQGPAGPPGVPG----IGPPGAMGPPGGQGPPGSSG 820

  Fly   936 RDGAKGEPGSPGLVG--MPGNKGDRGAPGNDGPKGFAGVTGAPGKRGPAGIPGVSGAKGDKGATG 998
            ..|.|||.|.||..|  |||.|||:|:.|..|..|.:|:.|.||::|..|:||..|:||:   .|
Mouse   821 PPGIKGEKGFPGFPGLDMPGPKGDKGSQGLPGLTGQSGLPGLPGQQGTPGVPGFPGSKGE---MG 882

  Fly   999 LTGNDGPVGGRGPPGAPGLMGIKGDQGLAGAPGQQGLDGMPGEKGNQGFPGLDGP------PGLP 1057
            :.|..|..|..||.|.|||.|.|||.||.|:.|.:|..|..|:||:.|.||:.|.      ..:.
Mouse   883 VMGTPGQPGSPGPAGTPGLPGEKGDHGLPGSSGPRGDPGFKGDKGDVGLPGMPGSMEHVDMGSMK 947

  Fly  1058 GDASEKGQKGEPGPSGLRGDTGPAGTPGWPGEKGLPGLAVHGRAGPPGEKGDQGRSGIDGRDGIN 1122
            |...::|:||:.||:|.:|..|..||||.||:.|..     |..|.||.|||.|.||..|..|:.
Mouse   948 GQKGDQGEKGQIGPTGDKGSRGDPGTPGVPGKDGQA-----GHPGQPGPKGDPGLSGTPGSPGLP 1007

  Fly  1123 GEKGEQGLQGVWGQPGEKGSVGAPGIPGAPGMDGLPGAAGAPG--------AVGYPGDRGDKGEP 1179
            |.||..|..|:.|.||||   |.|||||:.|:.|.||..||.|        .:|.||..||||:.
Mouse  1008 GPKGSVGGMGLPGSPGEK---GVPGIPGSQGVPGSPGEKGAKGEKGQSGLPGIGIPGRPGDKGDQ 1069

  Fly  1180 GLSGLPGL---KGETGPVGLQGFTGAPGPKGERGIRGQPGLPATVPDIRGDKGSQGERGYTGEKG 1241
            ||:|.||.   |||.|..|..|..|:|||:|..|..|.||.|. :|..:||||..|..|..|.||
Mouse  1070 GLAGFPGSPGEKGEKGSAGTPGMPGSPGPRGSPGNIGHPGSPG-LPGEKGDKGLPGLDGVPGVKG 1133

  Fly  1242 EQGERGLTGPAGVAGAKGDRGLQGPPGAS-----------GLNGIPGAKGDIGPRGEIGYPGVTI 1295
            |.|..|..||.|.||.||:.|..|.||::           |..|.||:|||.|.:||:|:||:. 
Mouse  1134 EAGLPGTPGPTGPAGQKGEPGSDGIPGSAGEKGEQGVPGRGFPGFPGSKGDKGSKGEVGFPGLA- 1197

  Fly  1296 KGEKGLPGRPGRNGRQGLIGAPGLIGERGLPGLAGEPGLVGLPGPIGPAGSKGERGLAGSPGQPG 1360
                |.||.||..|.||.:|.||..|:.||||..|.|    :.||.|..|.:|:.||.|.||..|
Mouse  1198 ----GSPGIPGVKGEQGFMGPPGPQGQPGLPGTPGHP----VEGPKGDRGPQGQPGLPGHPGPMG 1254

  Fly  1361 QDGFPGAPGLKGDTGPQGFKGERGLNGFEGQKGDKGDRGLQGPSGLPGLVGQKGDTGYPGLNGND 1425
            ..||||..|.|||.|.||:.|..|:   .|.|||.|.:|:.|..|.||:.|.|||.|.||:.|..
Mouse  1255 PPGFPGINGPKGDKGNQGWPGAPGV---PGPKGDPGFQGMPGIGGSPGITGSKGDMGLPGVPGFQ 1316

  Fly  1426 GPVGAPGERGFTGPKGRDGRDGTPGLPGQKGEPGMLPPPGP----KGEPGQPGRNGPKGEPGRPG 1486
            |..|.|   |..|.||..|..|.||..|.:|.||   ||||    |||||.|   ||:|.|    
Mouse  1317 GQKGLP---GLQGVKGDQGDQGVPGPKGLQGPPG---PPGPYDVIKGEPGLP---GPEGPP---- 1368

  Fly  1487 ERGLIGIQGERGEKGERGLIGETGNVGRPGPKGDRGEPGERGYEGAIGLIGQKGEPGAPAPAAL- 1550
              ||.|:||..|.||::|:.|..|..|.||..|..|.||::|..|..|..|.:|.||.|.|..| 
Mouse  1369 --GLKGLQGPPGPKGQQGVTGSVGLPGPPGVPGFDGAPGQKGETGPFGPPGPRGFPGPPGPDGLP 1431

  Fly  1551 -----------DYLTGILITRHSQSETVPACSAGHTELWTGYSLLYVDGNDYAHNQDLGSPGSCV 1604
                       |:  |.|:|||||:...|.|..|...|:.|||||||.||:.||.||||:.|||:
Mouse  1432 GSMGPPGTPSVDH--GFLVTRHSQTTDDPLCPPGTKILYHGYSLLYVQGNERAHGQDLGTAGSCL 1494

  Fly  1605 PRFSTLPVLSCGQNNVCNYASRNDKTFWLTTNAAIP--MMPVENIEIRQYISRCVVCEAPANVIA 1667
            .:|||:|.|.|..|||||:|||||.::||:|...:|  |.|:....||.:||||.||||||.|:|
Mouse  1495 RKFSTMPFLFCNINNVCNFASRNDYSYWLSTPEPMPMSMAPISGDNIRPFISRCAVCEAPAMVMA 1559

  Fly  1668 VHSQTIEVPDCPNGWEGLWIGYSFLMHTAVGNGGGGQALQSPGSCLEDFRATPFIECNGAKGTCH 1732
            ||||||::|.|||||..|||||||:|||:.|..|.||||.|||||||:||:.|||||:| :|||:
Mouse  1560 VHSQTIQIPQCPNGWSSLWIGYSFVMHTSAGAEGSGQALASPGSCLEEFRSAPFIECHG-RGTCN 1623

  Fly  1733 FYETMTSFWMYNLESSQPFERPQQQTIKAGERQSHVSRCQVCMKNS 1778
            :|....|||:..:|.|:.|::|...|:||||.::|||||||||:.:
Mouse  1624 YYANAYSFWLATIERSEMFKKPTPSTLKAGELRTHVSRCQVCMRRT 1669

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
Col4a1NP_723044.1 Collagen 104..161 CDD:189968 29/56 (52%)
Collagen 322..380 CDD:189968 31/62 (50%)
Collagen 413..465 CDD:189968 17/51 (33%)
Collagen 499..561 CDD:189968 24/69 (35%)
Collagen 574..632 CDD:189968 24/63 (38%)
Collagen 657..714 CDD:189968 28/56 (50%)
Collagen 765..824 CDD:189968 27/58 (47%)
Collagen 854..911 CDD:189968 27/59 (46%)
Collagen 884..943 CDD:189968 29/61 (48%)
Collagen 923..982 CDD:189968 30/60 (50%)
Collagen 1028..1085 CDD:189968 22/62 (35%)
Collagen 1229..1287 CDD:189968 28/68 (41%)
Collagen 1318..1376 CDD:189968 27/57 (47%)
Collagen 1399..1458 CDD:189968 26/58 (45%)
Collagen 1477..1534 CDD:189968 24/56 (43%)
C4 1555..1662 CDD:128421 60/108 (56%)
C4 1663..1777 CDD:128421 71/113 (63%)
Col4a1NP_034061.2 Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 47..1443 629/1466 (43%)
Triple-helical region 173..1440 575/1332 (43%)
Collagen 275..331 CDD:189968 27/63 (43%)
Collagen 510..585 CDD:189968 26/82 (32%)
Collagen <645..688 CDD:189968 20/42 (48%)
Collagen 690..745 CDD:189968 24/54 (44%)
Collagen 737..788 CDD:189968 22/51 (43%)
Collagen 839..896 CDD:189968 27/59 (46%)
Collagen 876..935 CDD:189968 28/61 (46%)
Collagen 975..1032 CDD:189968 26/56 (46%)
Collagen 1058..1116 CDD:189968 24/57 (42%)
Collagen 1088..1147 CDD:189968 26/58 (45%)
Collagen 1269..1326 CDD:189968 27/67 (40%)
C4 1446..1552 CDD:279721 49/109 (45%)
C4 1556..1666 CDD:279721 61/111 (55%)


Information from Original Tools:


Tool Simple Score Weighted Score Original Tool Information
BLAST Result Score Score Type Cluster ID
Compara 00.000 Not matched by this tool.
Domainoid 1 1.000 164 1.000 Domainoid score I3922
eggNOG 1 0.900 - - E33208_3BAFZ
Hieranoid 1 1.000 - -
Homologene 00.000 Not matched by this tool.
Inparanoid 1 1.050 1346 1.000 Inparanoid score I143
Isobase 00.000 Not matched by this tool.
OMA 00.000 Not matched by this tool.
OrthoDB 1 1.010 - - D41315at33208
OrthoFinder 1 1.000 - - FOG0000443
OrthoInspector 00.000 Not matched by this tool.
orthoMCL 1 0.900 - - OOG6_100768
Panther 1 1.100 - - LDO PTHR24023
Phylome 1 0.910 - -
RoundUp 1 1.030 - avgDist Average_Evolutionary_Distance R2460
SonicParanoid 1 1.000 - - X1239
SwiftOrtho 1 1.000 - -
TreeFam 00.000 Not matched by this tool.
1211.900

Return to query results.
Submit another query.