DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment Col4a1 and emb-9

DIOPT Version :9

Sequence 1:NP_723044.1 Gene:Col4a1 / 33727 FlyBaseID:FBgn0000299 Length:1779 Species:Drosophila melanogaster
Sequence 2:NP_001022662.2 Gene:emb-9 / 176314 WormBaseID:WBGene00001263 Length:1759 Species:Caenorhabditis elegans


Alignment Length:1813 Identity:810/1813 - (44%)
Similarity:973/1813 - (53%) Gaps:173/1813 - (9%)


- Green bases have known domain annotations that are detailed below.


  Fly    58 VDSAGVARGDLPPKNCTAGYAGCVPKCIAEKGNRGLPGPLGPTGLKGEMGFPGMEGPSGDKGQKG 122
            ||:|...:|..||         ||  |...||.||.|      |..||.|.||..|..|.:|..|
 Worm    26 VDAAAACKGCAPP---------CV--CPGTKGERGNP------GFGGEPGHPGAPGQDGPEGAPG 73

  Fly   123 DPGPYGQRGD------KGERGSPGLHGQAGVPGVQGPAGNPGAPGINGKDGCDGQDGIPGLEGLS 181
            .||.:|..||      ||.||..||.|..|.||:||..|.||..|..|..||:|.||.||:.||:
 Worm    74 APGMFGAEGDFGDMGSKGARGDRGLPGSPGHPGLQGLDGLPGLKGEEGIPGCNGTDGFPGMPGLA 138

  Fly   182 GMPGPRGYAGQLGSKGEKGEPAK--ENGDYAKGEKGEPGWRGTAGLAGPQGFPGEKGERGDSGPY 244
            |.||..|..|..|..|..|.|.:  .|....||.|||.|..|..||.|..|:||.||.:||.|||
 Worm   139 GPPGQSGQNGNPGRPGLSGPPGEGGVNSQGRKGVKGESGRSGVPGLPGNSGYPGLKGAKGDPGPY 203

  Fly   245 GAKGPRGEHGLKGEKGASCYGPMK-----PGAPGIKGEKGE-PASSFPVKPTHTVMGPRGDMGQK 303
            |..|..|..||||..|....| :|     ||.||..|:.|. |.:|.|:: ...:.||.|..|.|
 Worm   204 GLPGFPGVSGLKGRMGVRTSG-VKGEKGLPGPPGPPGQPGSYPWASKPIE-MEVLQGPVGPAGVK 266

  Fly   304 GEPGLVGRKGEPGPEGDTGLDGQ------KGEKGLPGGPGDRGRQGNFGPPGSTGQKGDRGEPGL 362
            ||.   ||.|..||.|..||||.      ||:||..|..|.||::|..|.||:.|:||.:||.||
 Worm   267 GEK---GRDGPVGPPGMLGLDGPPGYPGLKGQKGDLGDAGQRGKRGKDGVPGNYGEKGSQGEQGL 328

  Fly   363 NGLPGNPGQKGEPGRAGATGKPGLLGPPGPPGG-GRGTPGPPGPKGPRGYVGAPGPQGLNGVDGL 426
            .|.||.||.||..|..|..|:||..|..||.|. |.|| |..||.|.:|:.|..|.:||.|.|||
 Worm   329 GGTPGYPGTKGGAGEPGYPGRPGFEGDCGPEGPLGEGT-GEAGPHGAQGFDGVQGGKGLPGHDGL 392

  Fly   427 PGPQGYNGQKGGAGLPGRPGNEGPPGKKGEKGTAGLNGPKGSIGPIGHPGPPGPEGQKGDAGLPG 491
            |||.|..|..|..|.||:||.:|.||.. |||..|.:|..|..|..|.||.||..|..|:.||||
 Worm   393 PGPVGPRGPVGAPGAPGQPGIDGMPGYT-EKGDRGEDGYPGFAGEPGLPGEPGDCGYPGEDGLPG 456

  Fly   492 YGIQGS---KGDAGIPGYPGLKGSKGERGFKGNAGAPGDSKLGRPGTPGAAGAPGQKGDAGRPGT 553
            |.|||.   .|.:|..|:||:.|..|:.|:.|..|.|| :.:.:.|.||..|.||:.|..||.|.
 Worm   457 YDIQGPPGLDGQSGRDGFPGIPGDIGDPGYSGEKGFPG-TGVNKVGPPGMTGLPGEPGMPGRIGV 520

  Fly   554 PGQKGDMGIKGDVGGKCSSCRAGPKGDKGTSGLPGIPGKDGARGPPGERGYPGERGHDGINGQTG 618
            .|..|..|..|:.|..|..|   |.|..|.:|.||.||.:|..||||..   |:.|..|:.|..|
 Worm   521 DGYPGPPGNNGERGEDCGYC---PDGVPGNAGDPGFPGMNGYPGPPGPN---GDHGDCGMPGAPG 579

  Fly   619 PPGEKGEDGRTGLPGATGEPGKPALCDLS--LIEP---------LKGDKGYPGAPGAKGVQGFKG 672
            .||..|.||.:|.||..|.||.|.:...:  ::.|         ||||.|.||.||.      .|
 Worm   580 KPGSAGSDGLSGSPGLPGIPGYPGMKGEAGEIVGPMENPAGIPGLKGDHGLPGLPGR------PG 638

  Fly   673 AEGLPGIPGPKGEFGFKGEKGLSGAPGNDGTPGRAGRDGYPGIPGQSIKGEPG--FHGRDGAKGD 735
            ::||||.||..|:.||   .||.|.||..|..|:.||.|..||||  ::|.||  |.|:.|..|.
 Worm   639 SDGLPGYPGGPGQNGF---PGLQGEPGLAGIDGKRGRQGSLGIPG--LQGPPGDSFPGQPGTPGY 698

  Fly   736 KGSFGRSGEKGEPGSCALDEIKMPAK-GNK--GEPGQTGMPGPPGEDGSPGERGYTGLKGNTGPQ 797
            ||..|..|..|.||:.....|..|.: .|:  |:||..||||.||:.|:.   |..||.|..||.
 Worm   699 KGERGADGLPGLPGAQGPRGIPAPLRIVNQVAGQPGVDGMPGLPGDRGAD---GLPGLPGPVGPD 760

  Fly   798 GPPGVEGPRGLN------------GPRGEKGNQGAVGVPGNPGKDGLRGIPGRNGQPGPRGEPGI 850
            |.||..|.||::            |.||::|..|..|:.|:.|:.||.|.||..|.||..||.|.
 Worm   761 GYPGTPGERGMDGLPGFPGLHGEPGMRGQQGEVGFNGIDGDCGEPGLDGYPGAPGAPGAPGETGF 825

  Fly   851 SRPGPMGPPGLNGLQGEKGDRGPTGPIGFPGAD---GSVGYPGDRGDAGLPGVSGRPGIVGEKGD 912
            ..||.:|.||.|   |:.|..|..||.|:||.|   |:.||||:.|..|..|..|:||..||.|.
 Worm   826 GFPGQVGYPGPN---GDAGAAGLPGPDGYPGRDGLPGTPGYPGEAGMNGQDGAPGQPGSRGESGL 887

  Fly   913 VGPIGPAGVAGPPGVPGIDGVRGRDGAKGEPGSPGLVGMPGNKGDRGAPGNDGPKGFAGVTGAPG 977
            ||..|..|..|.||..|.||..|..|..|.||..|:.|.||..||:|.||:.|..|:.|.:|.||
 Worm   888 VGIDGKKGRDGTPGTRGQDGGPGYSGEAGAPGQNGMDGYPGAPGDQGYPGSPGQDGYPGPSGIPG 952

  Fly   978 KRGPAGIPGVSGAKGDKGATGLTGNDGPVGGRGPPGAPGLMGIKGDQGLAGAPGQQGLDGMPGEK 1042
            :.|..|.||:.|..||.|..||.|..|..|.||..|.||..|..|..||.|.||..|..|..||.
 Worm   953 EDGLVGFPGLRGEHGDNGLPGLEGECGEEGSRGLDGVPGYPGEHGTDGLPGLPGADGQPGFVGEA 1017

  Fly  1043 GNQGFPGLDGPPGLPGDASEKGQKGEPG-PSGLRGDTGPAGTPGWPGEKGLPGLAVHGRAGP--- 1103
            |..|.||..|.||.||:.:..||.|:.| |       ||.|.||.||:.|||||  :|..|.   
 Worm  1018 GEPGTPGYRGQPGEPGNLAYPGQPGDVGYP-------GPDGPPGLPGQDGLPGL--NGERGDNGD 1073

  Fly  1104 --PGEKGDQGRSGIDGRDGINGEKGEQGLQGVWGQPGEKGSVGAPGIPGAPGMDGLPGA------ 1160
              ||..|..|:.|..|.||::|..|..|..|:.|.||.||..|.||:||..|.||:||.      
 Worm  1074 SYPGNPGLSGQPGDAGYDGLDGVPGPPGYPGITGMPGLKGESGLPGLPGRQGNDGIPGQPGLEGE 1138

  Fly  1161 ------AGAPGAVGYPGDRGDKGEPGLSGLPGLKGETGPVGLQGFTGAPGPKGERGIRGQPGLPA 1219
                  .|:||..||||.:|.:||   .|.||:.||.|..||:|..|.||.|||.|:.||||.|.
 Worm  1139 CGEDGFPGSPGQPGYPGQQGREGE---KGYPGIPGENGLPGLRGQDGQPGLKGENGLDGQPGYPG 1200

  Fly  1220 TVPDIRGDKGSQGERGYTGEKGEQGERGLTGPAGVAGAKGDRGLQGPPGASGLNGIPGAKGDIGP 1284
            :.    |..|:.|:.||.|..||.|:.|..|..|..|.:|:.|..|.||..|.:|.|   |.:||
 Worm  1201 SA----GQLGTPGDVGYPGAPGENGDNGNQGRDGQPGLRGESGQPGQPGLPGRDGQP---GPVGP 1258

  Fly  1285 RGEIGYPGV---TIKGEKGLPGRPGRNGRQGLIGAPGLIGERGLPGLAGEPGLVGLPGPIGPAGS 1346
            .|:.||||.   .|.|..|..|:.|..|..||.|||||.||.|.||..|.|||.|.||..|..|.
 Worm  1259 PGDDGYPGAPGQDIYGPPGQAGQDGYPGLDGLPGAPGLNGEPGSPGQYGMPGLPGGPGESGLPGY 1323

  Fly  1347 KGERGLAGSPGQPGQDGFPGAPGLKGDTGPQGFKGERGLNGFEGQKGDKGDRGLQGPSGLPGLVG 1411
            .|||||.|..|:.|.||.|||||:.|..|..|.:|:.|.:|:.|..|..|..|..|..||||:.|
 Worm  1324 PGERGLPGLDGKRGHDGLPGAPGVPGVEGVPGLEGDCGEDGYPGAPGAPGSNGYPGERGLPGVPG 1388

  Fly  1412 QK---GDTGYPGLNGNDGPVGAPGERGFTGPKGRD---GRDGTPGLPGQKGEPGMLPPPGP---- 1466
            |:   ||.|||         ||||:.|..||:|.|   ||||..||||:.|..|:   |||    
 Worm  1389 QQGRSGDNGYP---------GAPGQPGIKGPRGDDGFPGRDGLDGLPGRPGREGL---PGPMAMA 1441

  Fly  1467 -KGEPGQPGRNGPKGEPGRPGERGLIGIQGERGEKGERGLIGETGNVGRPGPKGDRGEPGERGYE 1530
             :..|||||.||..||.|.||..|..|:.|..|:.|..|..|..|..|.||..|..|..|::|::
 Worm  1442 VRNPPGQPGENGYPGEKGYPGLPGDNGLSGPPGKAGYPGAPGTDGYPGPPGLSGMPGHGGDQGFQ 1506

  Fly  1531 GAIGLIGQKGEPGAP----APAALDYLTGILITRHSQSETVPACSAGHTELWTGYSLLYVDGNDY 1591
            ||.|..|..|.||.|    :|.......|....:|||:..||.|..|.::||.|||||||.||..
 Worm  1507 GAAGRTGNPGLPGTPGYPGSPGGWAPSRGFTFAKHSQTTAVPQCPPGASQLWEGYSLLYVQGNGR 1571

  Fly  1592 AHNQDLGSPGSCVPRFSTLPVLSCGQNNVCNYASRNDKTFWLTTNAAI-PMM-PVENIEIRQYIS 1654
            |..||||.||||:.:|:|:|.:.|..|:||:.:||||.:|||:|:..: ||| ||....||.|||
 Worm  1572 ASGQDLGQPGSCLSKFNTMPFMFCNMNSVCHVSSRNDYSFWLSTDEPMTPMMNPVTGTAIRPYIS 1636

  Fly  1655 RCVVCEAPANVIAVHSQTIEVPDCPNGWEGLWIGYSFLMHTAVGNGGGGQALQSPGSCLEDFRAT 1719
            ||.|||.|..:||||||...||.||.||.|:|.||||:||||.|..|.||:|||||||||:|||.
 Worm  1637 RCAVCEVPTQIIAVHSQDTSVPQCPQGWSGMWTGYSFVMHTAAGAEGTGQSLQSPGSCLEEFRAV 1701

  Fly  1720 PFIECNGAKGTCHFYETMTSFWMYNLESSQPFERPQQQTIKAGERQSHVSRCQVCMKN 1777
            |||||:| :|||::|.|...||:..::..:.|.:|..||:|||..:..|||||||:||
 Worm  1702 PFIECHG-RGTCNYYATNHGFWLSIVDQDKQFRKPMSQTLKAGGLKDRVSRCQVCLKN 1758

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
Col4a1NP_723044.1 Collagen 104..161 CDD:189968 29/62 (47%)
Collagen 322..380 CDD:189968 31/63 (49%)
Collagen 413..465 CDD:189968 26/51 (51%)
Collagen 499..561 CDD:189968 24/61 (39%)
Collagen 574..632 CDD:189968 24/57 (42%)
Collagen 657..714 CDD:189968 25/56 (45%)
Collagen 765..824 CDD:189968 28/70 (40%)
Collagen 854..911 CDD:189968 27/59 (46%)
Collagen 884..943 CDD:189968 27/58 (47%)
Collagen 923..982 CDD:189968 27/58 (47%)
Collagen 1028..1085 CDD:189968 24/57 (42%)
Collagen 1229..1287 CDD:189968 23/57 (40%)
Collagen 1318..1376 CDD:189968 32/57 (56%)
Collagen 1399..1458 CDD:189968 31/64 (48%)
Collagen 1477..1534 CDD:189968 23/56 (41%)
C4 1555..1662 CDD:128421 58/108 (54%)
C4 1663..1777 CDD:128421 65/113 (58%)
emb-9NP_001022662.2 Collagen 61..119 CDD:189968 25/57 (44%)
Collagen 97..154 CDD:189968 28/56 (50%)
Collagen 267..325 CDD:189968 27/60 (45%)
Collagen 357..417 CDD:189968 31/60 (52%)
Collagen 391..451 CDD:189968 29/60 (48%)
Collagen 423..485 CDD:189968 27/61 (44%)
Collagen 473..535 CDD:189968 24/62 (39%)
Collagen 655..717 CDD:189968 28/63 (44%)
Collagen 731..789 CDD:189968 24/60 (40%)
Collagen 841..900 CDD:189968 27/58 (47%)
Collagen 895..953 CDD:189968 26/57 (46%)
Collagen 984..1041 CDD:189968 26/56 (46%)
Collagen 1107..1166 CDD:189968 26/61 (43%)
Collagen 1143..1202 CDD:189968 31/61 (51%)
Collagen 1194..1249 CDD:189968 23/58 (40%)
Collagen 1346..1405 CDD:189968 27/67 (40%)
Collagen 1379..1437 CDD:189968 32/69 (46%)
Collagen 1468..1527 CDD:189968 22/58 (38%)
C4 1536..1642 CDD:279721 55/105 (52%)
C4 1645..1758 CDD:128421 65/113 (58%)


Information from Original Tools:


Tool Simple Score Weighted Score Original Tool Information
BLAST Result Score Score Type Cluster ID
Compara 00.000 Not matched by this tool.
Domainoid 00.000 Not matched by this tool.
eggNOG 00.000 Not matched by this tool.
Hieranoid 00.000 Not matched by this tool.
Homologene 1 1.000 - - H1390
Inparanoid 00.000 Not matched by this tool.
Isobase 00.000 Not matched by this tool.
OMA 00.000 Not matched by this tool.
OrthoDB 1 1.010 - - D41315at33208
OrthoFinder 1 1.000 - - FOG0000443
OrthoInspector 00.000 Not matched by this tool.
orthoMCL 1 0.900 - - OOG6_100768
Panther 1 1.100 - - LDO PTHR24023
Phylome 1 0.910 - -
RoundUp 00.000 Not matched by this tool.
SonicParanoid 1 1.000 - - X1239
SwiftOrtho 1 1.000 - -
TreeFam 00.000 Not matched by this tool.
87.920

Return to query results.
Submit another query.