DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment Col4a1 and let-2

DIOPT Version :9

Sequence 1:NP_723044.1 Gene:Col4a1 / 33727 FlyBaseID:FBgn0000299 Length:1779 Species:Drosophila melanogaster
Sequence 2:NP_510663.1 Gene:let-2 / 181708 WormBaseID:WBGene00002280 Length:1759 Species:Caenorhabditis elegans


Alignment Length:1758 Identity:880/1758 - (50%)
Similarity:1021/1758 - (58%) Gaps:97/1758 - (5%)


- Green bases have known domain annotations that are detailed below.


  Fly    79 GCVPKCIAEKGNRGLPGPLGPTGLKGEMGFPGMEGPSGDKGQKGDPGPYGQRGDKGERGSPGLHG 143
            ||.  |:.|||:.|.|||.||.|.:|..||||.||.:|.||.||..||.|..|.||:||:.|:.|
 Worm    37 GCF--CVGEKGSMGAPGPQGPPGTQGIRGFPGPEGLAGPKGLKGAQGPPGPVGIKGDRGAVGVPG 99

  Fly   144 QAGVPGVQGPAGNPGAPGINGKDGCDGQDGIPGLEGLSGMPGPRGYAGQLGSKGEKGEPAKENGD 208
            ..|..|..|..|.||.||..|.|||:|.||.||:.|..|.||..|:.|..|..|.|||||.....
 Worm   100 FPGNDGGNGRPGEPGPPGAPGWDGCNGTDGAPGIPGRPGPPGMPGFPGPPGMDGLKGEPAIGYAG 164

  Fly   209 YAKGEKGEPGWRGTAGLAGP---QGFPGEKGERGDSGPYGAKGPRGEHGLKGEKGASCYGPM-KP 269
             |.||||:.|..|..||.||   .|:|||||:|||:|..|.:||.||.|..|..|....||. .|
 Worm   165 -APGEKGDGGMPGMPGLPGPSGRDGYPGEKGDRGDTGNAGPRGPPGEAGSPGNPGIGSIGPKGDP 228

  Fly   270 GAPGIKGEKGEPASSFPVKPTHTVMGPRGDM---GQKGEPGLVGRKGEPGP---EGDTGLDGQKG 328
            |..|..|..|.|..........|::||:||:   |:|||||..|::|.||.   .|..||.|.||
 Worm   229 GDIGAMGPAGPPGPIASTMSKGTIIGPKGDLGEKGEKGEPGEGGQRGYPGNGGLSGQPGLPGMKG 293

  Fly   329 EKGL--PGGP-GDRGRQGNFGPPGSTGQKGDRGEPGLNGLPGNPGQKGEPGRAGATGKPGLLGPP 390
            ||||  |.|| |..||.||.||||.   |||||..||.|:||.||||||.|..|..|..|..|||
 Worm   294 EKGLSGPAGPRGKEGRPGNAGPPGF---KGDRGLDGLGGIPGLPGQKGEAGYPGRDGPKGNSGPP 355

  Fly   391 GPPGGGRGTPGPPGPKGPRGYVGAPGPQGLNGVDGLPGPQGYNGQKGGAGLPGRPGNEGPPGKKG 455
            ||||||....|.|||.|..|..|.|||.|.:|..|.|||.|..|..||.||||.|||||.||.||
 Worm   356 GPPGGGTFNDGAPGPPGLPGRPGNPGPPGTDGYPGAPGPAGPIGNTGGPGLPGYPGNEGLPGPKG 420

  Fly   456 EKGTAGLNGPKGSIGPIGHPGPPGPEGQKGDAGLPGYGIQGSKGDAGIPGYPGLKGSKGERGFKG 520
            :||..|:.|..|..||.|.||.|||:|:.|..|.||..|.|..|..|.||..|..|.|||.|..|
 Worm   421 DKGDGGIPGAPGVSGPSGIPGLPGPKGEPGYRGTPGQSIPGLPGKDGKPGLDGAPGRKGENGLPG 485

  Fly   521 NAGAPGDSKLGRPGTPGAAGAPGQKGDAGRPGTPGQKGDMGIKGDVGGKCSSCRAGPKGDKGTSG 585
            ..|.||||..|.||.||..||||..|..||.|..|..|..|.|||.||.||:|..|.||:||..|
 Worm   486 VRGPPGDSLNGLPGAPGQRGAPGPNGYDGRDGVNGLPGAPGTKGDRGGTCSACAPGTKGEKGLPG 550

  Fly   586 LPGIPGKDGARGPPGERGYPGERGHDGINGQTGPPGEKGEDGRTGLPGATGEPGKPALCDLSLIE 650
            ..|.||..|.||.||..|..|:.|.||:.|..|.||..|..|:.|.||..|:.|:|....|.   
 Worm   551 YSGQPGPQGDRGLPGMPGPVGDAGDDGLPGPAGRPGSPGPPGQDGFPGLPGQKGEPTQLTLR--- 612

  Fly   651 PLKGDKGYPGAPGAKGVQGFKGAEGLPGIPGPKGEF------GFKGEKGLSGAPGNDGTPGRAGR 709
                 .|.||.||.||..||.|..|:.|:|||.|..      |:.||||.:|.||..|.||:.|.
 Worm   613 -----PGPPGYPGLKGENGFPGQPGVDGLPGPSGPVGPPGAPGYPGEKGDAGLPGLSGKPGQDGL 672

  Fly   710 DGYPGIPGQSIKGEPGFHGRDGAKGD---KGSFGRSGEKGEPGSCALDEIKMPA-------KGNK 764
            .|.||..|::..|:||..|..|||||   .|..|..|.:|.||..|.:....||       .|..
 Worm   673 PGLPGNKGEAGYGQPGQPGFPGAKGDGGLPGLPGTPGLQGMPGEPAPENQVNPAPPGQPGLPGLP 737

  Fly   765 GEPGQTGMPGPPGEDGSPGERGYTGLKGNTGPQGPPGVEGPRGLNGPRGEKGNQGAVGVPGNPGK 829
            |..|:.|.||.|||.|.||..|..|:||::|..||||:.|..|:.|.:|..|..|..|:||..|.
 Worm   738 GTKGEGGYPGRPGEVGQPGFPGLPGMKGDSGLPGPPGLPGHPGVPGDKGFGGVPGLPGIPGPKGD 802

  Fly   830 DGLRGIPGRNGQPGPRGEPGISRPGPMGPPGLNGLQGEKGDRGPTGPIGFPGADGSVGYPGD--- 891
            .|..|:||.|||   :||||:..||..|.||..||   |||.|..|..|.||.:|..|:||.   
 Worm   803 VGNPGLPGLNGQ---KGEPGVGVPGQPGSPGFPGL---KGDAGLPGLPGTPGLEGQRGFPGAPGL 861

  Fly   892 RGDAGLPGVSGRPGIVGEKGDVGPIGPAGVAGPPGVPGIDGVRGRDGAKGEPGSPGLVGMPGNKG 956
            :|..||||:||:||..|||||.|..|..|..|.||.||.||:.|..|.|||.|.|||.|:.|.||
 Worm   862 KGGDGLPGLSGQPGYPGEKGDAGLPGVPGREGSPGFPGQDGLPGVPGMKGEDGLPGLPGVTGLKG 926

  Fly   957 DRGAPGNDGPKGFAGVTGAPGKRGPAGIPGVSGAKGDKGATGLTGNDGPVGGRGPPGAPGLMGIK 1021
            |.||||..|..|..|..|.||.:|.||||||.|.|||.|..||.|.:||.|..|.||.||..|:|
 Worm   927 DLGAPGQSGAPGLPGAPGYPGMKGNAGIPGVPGFKGDGGLPGLPGLNGPKGEPGVPGMPGTPGMK 991

  Fly  1022 GDQGLAGAPGQQGLDGMPGEKGNQGFPGLDGPPGLPGDASEKGQKGE---PGPSGLRGDTGPAGT 1083
            |:.||.|.||:.||.|:||.||::||.||.|..|..|.|:..||||:   ||..||||..||:|.
 Worm   992 GNGGLPGLPGRDGLSGVPGMKGDRGFNGLPGEKGEAGPAARDGQKGDAGLPGQPGLRGPQGPSGL 1056

  Fly  1084 PGWPGEKGLPGLAVHGRAGPPGEKGDQGRSGIDGRDGINGEKGEQGLQGVWGQPGEKGSVGAPGI 1148
            ||.||.||..||..:|:.|.|||||..|..|..||.|..|..|:.||.|.   ||.||..|.||.
 Worm  1057 PGVPGFKGETGLPGYGQPGQPGEKGLPGIPGKAGRQGAPGSPGQDGLPGF---PGMKGESGYPGQ 1118

  Fly  1149 PGAPGMDGLPGAAGAPGAVGYPGDRGDKGEPGLS---GLPGLKGETGPVGL------QGFTGAPG 1204
            .|.||.|||||..|..|.:|..|..|..|.|||.   |:||::|:.|..||      :|..|.||
 Worm  1119 DGLPGRDGLPGVPGQKGDLGQSGQPGLSGAPGLDGQPGVPGIRGDKGQGGLPGIPGDRGMDGYPG 1183

  Fly  1205 PKGERGIRGQPGLPATVPDIRGDKGSQGERGYTGEKGEQGERGLTGPAGVAGAKGDRGLQGPPGA 1269
            .|||.|..||||||.    :.|:||..|..|:.|.||..|..|..|..|:.|.|||.|..|.||.
 Worm  1184 QKGENGYPGQPGLPG----LGGEKGFAGTPGFPGLKGSPGYPGQDGLPGIPGLKGDSGFPGQPGQ 1244

  Fly  1270 SGLNGIPGAKGDIGPRGEIGYPGVTIKGEKGLPGRPGRNGRQGLIGAPGLIGERGLPGLAGEPGL 1334
            .||.|:.|.||..|..|..|.||.:|.|..|.||.||..|:.|..|.||..||.||.||.|.|||
 Worm  1245 EGLPGLSGEKGMGGLPGMPGQPGQSIAGPVGPPGAPGLQGKDGFPGLPGQKGESGLSGLPGAPGL 1309

  Fly  1335 VGLPGPIGPAGSKGERGLAGSPGQPGQDGFPGAPGLKGDTGPQGFKGERGLNGFEGQKGDKGDRG 1399
            .|..|..|..|:||:.|..|.||:.|:||.||.||..|..|..|.|||.|..|..||.|..|..|
 Worm  1310 KGESGMPGFPGAKGDLGANGIPGKRGEDGLPGVPGRDGQPGIPGLKGEVGGAGLPGQPGFPGIPG 1374

  Fly  1400 LQGPSGLPGLVGQKGDTGYPGLNGNDGPVGAPGERGFTGPKGRDGRDGTPGLPGQKGEPGM---L 1461
            |:|..||||..|.||:.|:|         |.||..|:.|.|| ||  |.|||||:.|.||.   :
 Worm  1375 LKGEGGLPGFPGAKGEAGFP---------GTPGVPGYAGEKG-DG--GLPGLPGRDGLPGADGPV 1427

  Fly  1462 PPPGPKG-----EPGQPGRNGPKGEPGRPGERGLIGIQGERGEKGERGLIGETGNVGRPGPKGDR 1521
            .||||.|     |||:.|..|..|.||..||:|:.|:.|..|..|..||.|:.||.|.||..|..
 Worm  1428 GPPGPSGPQNLVEPGEKGLPGLPGAPGLRGEKGMPGLDGPPGNDGPPGLPGQRGNDGYPGAPGLS 1492

  Fly  1522 GEPGERGYEGAIGLIGQKGEPGAP----APAALD--YLTGILITRHSQSETVPACSAGHTELWTG 1580
            ||.|..|..|..||.||.|.||||    ||.|..  |..|.::.:|||:..||.|..|.|:||.|
 Worm  1493 GEKGMGGLPGFPGLDGQPGGPGAPGLPGAPGAAGPAYRDGFVLVKHSQTTEVPRCPEGQTKLWDG 1557

  Fly  1581 YSLLYVDGNDYAHNQDLGSPGSCVPRFSTLPVLSCGQNNVCNYASRNDKTFWLTTNAAIPMMPVE 1645
            |||||::||:.:||||||..|||:.||||:|.|.|..||||||||||||::||:|:.|||||||.
 Worm  1558 YSLLYIEGNEKSHNQDLGHAGSCLQRFSTMPFLFCDFNNVCNYASRNDKSYWLSTSEAIPMMPVN 1622

  Fly  1646 NIEIRQYISRCVVCEAPANVIAVHSQTIEVPDCPNGWEGLWIGYSFLMHTAVGNGGGGQALQSPG 1710
            ..||..|||||.|||||||.||||||||::|:||.||..|||||||.|||..|..||||:|.|||
 Worm  1623 EREIEPYISRCAVCEAPANTIAVHSQTIQIPNCPAGWSSLWIGYSFAMHTGAGAEGGGQSLSSPG 1687

  Fly  1711 SCLEDFRATPFIECNGAKGTCHFYETMTSFWMYNLESSQPFERPQQQTIKAGERQSHVSRCQVCM 1775
            |||||||||||||||||:|:||::....|||:..:::...|:.|:.||:|:|..::.|||||||:
 Worm  1688 SCLEDFRATPFIECNGARGSCHYFANKFSFWLTTIDNDSEFKVPESQTLKSGNLRTRVSRCQVCV 1752

  Fly  1776 KNS 1778
            |::
 Worm  1753 KST 1755

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
Col4a1NP_723044.1 Collagen 104..161 CDD:189968 28/56 (50%)
Collagen 322..380 CDD:189968 38/60 (63%)
Collagen 413..465 CDD:189968 30/51 (59%)
Collagen 499..561 CDD:189968 31/61 (51%)
Collagen 574..632 CDD:189968 26/57 (46%)
Collagen 657..714 CDD:189968 29/62 (47%)
Collagen 765..824 CDD:189968 27/58 (47%)
Collagen 854..911 CDD:189968 29/59 (49%)
Collagen 884..943 CDD:189968 32/61 (52%)
Collagen 923..982 CDD:189968 32/58 (55%)
Collagen 1028..1085 CDD:189968 31/59 (53%)
Collagen 1229..1287 CDD:189968 25/57 (44%)
Collagen 1318..1376 CDD:189968 29/57 (51%)
Collagen 1399..1458 CDD:189968 28/58 (48%)
Collagen 1477..1534 CDD:189968 25/56 (45%)
C4 1555..1662 CDD:128421 67/106 (63%)
C4 1663..1777 CDD:128421 70/113 (62%)
let-2NP_510663.1 Collagen 181..240 CDD:189968 27/58 (47%)
Collagen 429..490 CDD:189968 29/60 (48%)
Collagen 470..531 CDD:189968 31/60 (52%)
Collagen 791..844 CDD:189968 28/58 (48%)
Collagen 821..879 CDD:189968 28/60 (47%)
Collagen 899..958 CDD:189968 33/58 (57%)
Collagen 1004..1061 CDD:189968 30/56 (54%)
Collagen 1072..1131 CDD:189968 32/61 (52%)
Collagen 1117..1176 CDD:189968 25/58 (43%)
Collagen 1202..1253 CDD:189968 22/50 (44%)
Collagen 1308..1367 CDD:189968 28/58 (48%)
Collagen 1347..1405 CDD:189968 28/66 (42%)
C4 1533..1637 CDD:279721 64/103 (62%)
C4 1641..1751 CDD:279721 67/109 (61%)


Information from Original Tools:


Tool Simple Score Weighted Score Original Tool Information
BLAST Result Score Score Type Cluster ID
Compara 00.000 Not matched by this tool.
Domainoid 1 1.000 161 1.000 Domainoid score I2450
eggNOG 00.000 Not matched by this tool.
Hieranoid 1 1.000 - -
Homologene 1 1.000 - - H1390
Inparanoid 1 1.050 1570 1.000 Inparanoid score I39
Isobase 00.000 Not matched by this tool.
OMA 00.000 Not matched by this tool.
OrthoDB 1 1.010 - - D41315at33208
OrthoFinder 1 1.000 - - FOG0000443
OrthoInspector 00.000 Not matched by this tool.
orthoMCL 1 0.900 - - OOG6_100768
Panther 00.000 Not matched by this tool.
Phylome 1 0.910 - -
RoundUp 1 1.030 - avgDist Average_Evolutionary_Distance R2460
SonicParanoid 1 1.000 - - X1239
SwiftOrtho 1 1.000 - -
TreeFam 00.000 Not matched by this tool.
1110.900

Return to query results.
Submit another query.