DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment vkg and Col4a1

DIOPT Version :9

Sequence 1:NP_001260071.1 Gene:vkg / 33726 FlyBaseID:FBgn0016075 Length:1940 Species:Drosophila melanogaster
Sequence 2:NP_034061.2 Gene:Col4a1 / 12826 MGIID:88454 Length:1669 Species:Mus musculus


Alignment Length:1844 Identity:763/1844 - (41%)
Similarity:908/1844 - (49%) Gaps:358/1844 - (19%)


- Green bases have known domain annotations that are detailed below.


  Fly    38 CDCKGIKGRM------GAPGPIGVPGLEGPAGDIGPPGRAGPLGEKGDVGEYGEQGEKGHRGDIG 96
            |||.|:||:.      |..|.||.||::||.|..|||      |:|||.||.|..|.||.||..|
Mouse    39 CDCHGVKGQKGERGLPGLQGVIGFPGMQGPEGPHGPP------GQKGDAGEPGLPGTKGTRGPPG 97

  Fly    97 PKGEMGYPGIMGKSGEPGTPGPRGIDGCDGR---------------------------------- 127
            ..|..|.||:.|..|:.|.|||.||.||:|.                                  
Mouse    98 AAGYPGNPGLPGIPGQDGPPGPPGIPGCNGTKGERGPLGPPGLPGFSGNPGPPGLPGMKGDPGEI 162

  Fly   128 ---------------------------PGMQGPSGAPGQNGVRGPPGKPGQQGPPGEAGEGGINS 165
                                       ||:|||.|.|   |..||||.||..|||||.|:.|.:.
Mouse   163 LGHVPGTLLKGERGFPGIPGMPGSPGLPGLQGPVGPP---GFTGPPGPPGPPGPPGEKGQMGSSF 224

  Fly   166 KGTKGNRGETGQPGGVGPPGF-----------DGDRGSKGDTGYAGLT--GEKGDPGLPGPKGDT 217
            :|.||::||.|..|..|.||.           .|::|.||:.|:.|:.  ||||:||..||:|..
Mouse   225 QGPKGDKGEQGVSGPPGVPGQAQVKEKGDFAPTGEKGQKGEPGFPGVPGYGEKGEPGKQGPRGKP 289

  Fly   218 GAVSELPYSLIGPPGAKGEPGDSLSGVLKPDDTLKGYKGYVGLQGDEGPQGPTGEQGAVGRNG-- 280
            |...|.     |..|:.|.||||               ||.||.|.:||||..||.|..|..|  
Mouse   290 GKDGEK-----GERGSPGIPGDS---------------GYPGLPGRQGPQGEKGEAGLPGPPGTV 334

  Fly   281 ---LP----GARGEIGGPGERGKPGKDGEPGRFGDKGMKG--APGWTGADGLDGSPGERGEDGFT 336
               :|    |.||..|.||.||:||..|.||..|..|..|  .||..||.|..|..||:|:.||.
Mouse   335 IGTMPLGEKGDRGYPGAPGLRGEPGPKGFPGTPGQPGPPGFPTPGQAGAPGFPGERGEKGDQGFP 399

  Fly   337 G--MPGVQGGAGPPGIYDPSLTKSLPGPIGSQGDI--------GPPGEQGPPGLPGKPGRRGPIG 391
            |  :||..|..|.||...|      |||.|..|..        ||||:|||||.||:||..|.:|
Mouse   400 GVSLPGPSGRDGAPGPPGP------PGPPGQPGHTNGIVECQPGPPGDQGPPGTPGQPGLTGEVG 458

  Fly   392 LAGQSGDPGL----NGSRGPPGRSERGEAGDYGFIGPPGPQGPPGEAGLPGRYGLHGEPGQNVVG 452
            ..||.|:..|    .|.|||||  .:|..|:.||   ||..|..|:.|||||.||.|.|     |
Mouse   459 QKGQKGESCLACDTEGLRGPPG--PQGPPGEIGF---PGQPGAKGDRGLPGRDGLEGLP-----G 513

  Fly   453 PKGEPGLNGQPGLEGYRGDRGEV----GLPGDKGLPGEGYNIVGPPGSQGPPGFRGLPGDDGYNG 513
            |:|.|||.||||.   :|:.||:    .|.||||.||.       ||..|.||..|.||.||:  
Mouse   514 PQGSPGLIGQPGA---KGEPGEIFFDMRLKGDKGDPGF-------PGQPGMPGRAGTPGRDGH-- 566

  Fly   514 LRGLPGEKGLRGDDCPVCNAGPRGPRGQEGDTGYPGSHGNRG-----AIGLTGPRGVQGLQGNPG 573
             .||||.||..|      :.|.:|.||..|..|:|||.|:.|     .:|..||.|.:|..|.||
Mouse   567 -PGLPGPKGSPG------SIGLKGERGPPGGVGFPGSRGDIGPPGPPGVGPIGPVGEKGQAGFPG 624

  Fly   574 RAGHKGLPGPAGIPGEPGK-VGAAGPDGKAIEVGSLRKGEIGDTGDSGHRGDTGDDGEKGRDGSD 637
            ..|..|||||   .||.|| |...||.|.|        |..|..|..|.:||.|..|..||.|..
Mouse   625 GPGSPGLPGP---KGEAGKVVPLPGPPGAA--------GLPGSPGFPGPQGDRGFPGTPGRPGIP 678

  Fly   638 GSKGERGETGQRGDYGDAGYQGRDGEPGRDGRDGAPGRNATTPKVYLIGEPGYDGIKGERGDDGD 702
            |.||..|:.| .|..|..|.:|.||.||..||.|:|||....      |.||..|.:|::|:.| 
Mouse   679 GEKGAVGQPG-IGFPGLPGPKGVDGLPGEIGRPGSPGRPGFN------GLPGNPGPQGQKGEPG- 735

  Fly   703 TGFKGVKGEPN-PGQIYDNTGEPGEDGYTGPKGVKGAKGEQGAIGLRGEIGDRGPAGEVIPGPVG 766
            .|..|:||:|. ||    ..|.|||.|..|..||.|.:|..|..||:|..||.||.|  :.||.|
Mouse   736 IGLPGLKGQPGLPG----IPGTPGEKGSIGGPGVPGEQGLTGPPGLQGIRGDPGPPG--VQGPAG 794

  Fly   767 AKGYP--GPTGDYGQQGAPGLPGRDGEPGLDGGIGYKGQRG--VPGQEVIQGEIGPPGRSGIKGF 827
            ..|.|  ||.|..|..|..|.||..|.||:.|..|:.|..|  :||.:..:|..|.||.:|..|.
Mouse   795 PPGVPGIGPPGAMGPPGGQGPPGSSGPPGIKGEKGFPGFPGLDMPGPKGDKGSQGLPGLTGQSGL 859

  Fly   828 PGDVGAPGQYGLAGRPGPKGVKGEQGPDGAVGQTGLPGNKGQRGDFLVGPPGPKGQPGRNGRQAP 892
            |   |.|||.|..|.||..|.|||.|..|..||.|.||..|..|  |.|..|..|.||.:|.:..
Mouse   860 P---GLPGQQGTPGVPGFPGSKGEMGVMGTPGQPGSPGPAGTPG--LPGEKGDHGLPGSSGPRGD 919

  Fly   893 HGAKGQKGEVGSLGQNG------QNGAKGSIGFSGRRGLLGNAGLQGLPGSPGIPGLPGMIGEIG 951
            .|.||.||:||..|..|      ....||..|..|.:|.:|..|.:|..|.||.||:||..|:.|
Mouse   920 PGFKGDKGDVGLPGMPGSMEHVDMGSMKGQKGDQGEKGQIGPTGDKGSRGDPGTPGVPGKDGQAG 984

  Fly   952 ERGEIGYNGRQGDIGPRGPNGEFGPKGLSGDDGPDGYPGANGLPGRKGETGNPGFPGRPGAKGVA 1016
            ..|:         .||:|..|..|..|..|..||.|..|..||||..||.|.||.||..|..|..
Mouse   985 HPGQ---------PGPKGDPGLSGTPGSPGLPGPKGSVGGMGLPGSPGEKGVPGIPGSQGVPGSP 1040

  Fly  1017 AYSGIKGDDGESGLTGPIGYPGAPGAKGQRGPVGDSQPALDGVAGRKGEVGSPGPNGLPGRHGLK 1081
            ...|.||:.|:|||.| ||.||.||.||.:|..|     ..|..|.|||.||.|..|:||..|.:
Mouse  1041 GEKGAKGEKGQSGLPG-IGIPGRPGDKGDQGLAG-----FPGSPGEKGEKGSAGTPGMPGSPGPR 1099

  Fly  1082 GQRGDRGLPGQQGRPGEPGAKGLGGYPGRNGINGLKGATGFPGPQGPKGPQGESGVVGLDGRNGQ 1146
            |..|:.|.||..|.|||.|.|||   ||.:|:.|:||..|.||..||.||.|:.|..|.||..|.
Mouse  1100 GSPGNIGHPGSPGLPGEKGDKGL---PGLDGVPGVKGEAGLPGTPGPTGPAGQKGEPGSDGIPGS 1161

  Fly  1147 IGDQGP-----RGLIGEQGEQGEQGDEGEVGIPGRLENLRDRSFYRGFTGDQGLQGERGEQGDMG 1206
            .|::|.     ||..|..|.:|::|.:||||.|             |..|..|:.|.:|||    
Mouse  1162 AGEKGEQGVPGRGFPGFPGSKGDKGSKGEVGFP-------------GLAGSPGIPGVKGEQ---- 1209

  Fly  1207 PIGFIGPPGAKGERGDIGYAGQLGFDGAEG--LKGFQGDQGPRGPPGITLPAEKGDEGVAGLDGR 1269
              ||:||||.:         ||.|..|..|  ::|.:||:||:|.||:                 
Mouse  1210 --GFMGPPGPQ---------GQPGLPGTPGHPVEGPKGDRGPQGQPGL----------------- 1246

  Fly  1270 AGRPGHFGQKGAPGPPGENGPNGAIGHRG-PQIQGPPGPQGDVGFPGAPGHNGRHGLIGPKGELG 1333
               |||.|..|.||.||.|||.|..|::| |...|.|||:||.||.|.||..|..|:.|.||::|
Mouse  1247 ---PGHPGPMGPPGFPGINGPKGDKGNQGWPGAPGVPGPKGDPGFQGMPGIGGSPGITGSKGDMG 1308

  Fly  1334 DMGRQGERGESGYAIVGRQGDIGDIGFQGEPGWDGAKGEQGYPGLPGK----NGRVGAPGPRGPT 1394
            ..|..|.:|:.|  :.|.||..||.|.||.|   |.||.||.||.||.    .|..|.|||.||.
Mouse  1309 LPGVPGFQGQKG--LPGLQGVKGDQGDQGVP---GPKGLQGPPGPPGPYDVIKGEPGLPGPEGPP 1368

  Fly  1395 GDAGWGGIDGMDGLVGPKGQPGVTYSYSMARPGDRGEPGLDGFQGEEGDGGAPGLIGFQGQRGAV 1459
                  |:.|:.|..|||||.|||.|..:  ||..|.||.||..|::|:                
Mouse  1369 ------GLKGLQGPPGPKGQQGVTGSVGL--PGPPGVPGFDGAPGQKGE---------------- 1409

  Fly  1460 GYRGDQGEVGYTGADGPQGQRGDKGYMGLTGAPGLRGLPGPQGEPAPAPPAPKS--RGFIFARHS 1522
                       ||..||.|.|      |..|.||..||||..|     ||...|  .||:..|||
Mouse  1410 -----------TGPFGPPGPR------GFPGPPGPDGLPGSMG-----PPGTPSVDHGFLVTRHS 1452

  Fly  1523 QSVHVPQCPANTNLLWEGYSL---SGNVAASRAVGQDLGQSGSCMMRFTTMPYMLCDITNVCHFA 1584
            |:...|.||..|.:|:.||||   .||   .||.|||||.:|||:.:|:|||::.|:|.|||:||
Mouse  1453 QTTDDPLCPPGTKILYHGYSLLYVQGN---ERAHGQDLGTAGSCLRKFSTMPFLFCNINNVCNFA 1514

  Fly  1585 QNNDDSLWLSTAEPMPMTMTPIQGRDLMKYISRCVVCETTTRIIALHSQSMSIPDCPGGWEEMWT 1649
            ..||.|.||||.|||||:|.||.|.::..:||||.|||....::|:|||::.||.||.||..:|.
Mouse  1515 SRNDYSYWLSTPEPMPMSMAPISGDNIRPFISRCAVCEAPAMVMAVHSQTIQIPQCPNGWSSLWI 1579

  Fly  1650 GYSYFMSTLDNVGGVGQNLVSPGSCLEEFRAQPVIECHGHGRCNYYDALASFWLTVIEEQDQFVQ 1714
            |||:.|.|.....|.||.|.||||||||||:.|.|||||.|.||||....||||..||..:.|.:
Mouse  1580 GYSFVMHTSAGAEGSGQALASPGSCLEEFRSAPFIECHGRGTCNYYANAYSFWLATIERSEMFKK 1644

  Fly  1715 PRQQTLKA-DFTSKISRCTVCRRR 1737
            |...|||| :..:.:|||.||.||
Mouse  1645 PTPSTLKAGELRTHVSRCQVCMRR 1668

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
vkgNP_001260071.1 Collagen 60..119 CDD:189968 28/58 (48%)
Collagen 102..152 CDD:189968 28/110 (25%)
Collagen 277..334 CDD:189968 28/67 (42%)
Collagen 534..593 CDD:189968 29/64 (45%)
Collagen 814..871 CDD:189968 27/56 (48%)
Collagen 957..1014 CDD:189968 23/56 (41%)
Collagen 990..1049 CDD:189968 31/58 (53%)
Collagen 1070..1128 CDD:189968 27/57 (47%)
C4 1515..1624 CDD:128421 61/111 (55%)
C4 1625..1737 CDD:128421 58/112 (52%)
Col4a1NP_034061.2 Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 47..1443 636/1606 (40%)
Triple-helical region 173..1440 595/1471 (40%)
Collagen 275..331 CDD:189968 31/75 (41%)
Collagen 510..585 CDD:189968 41/98 (42%)
Collagen <645..688 CDD:189968 19/50 (38%)
Collagen 690..745 CDD:189968 25/61 (41%)
Collagen 737..788 CDD:189968 25/54 (46%)
Collagen 839..896 CDD:189968 28/59 (47%)
Collagen 876..935 CDD:189968 28/60 (47%)
Collagen 975..1032 CDD:189968 26/65 (40%)
Collagen 1058..1116 CDD:189968 28/62 (45%)
Collagen 1088..1147 CDD:189968 30/61 (49%)
Collagen 1269..1326 CDD:189968 25/58 (43%)
C4 1446..1552 CDD:279721 58/108 (54%)
C4 1556..1666 CDD:279721 57/109 (52%)


Information from Original Tools:


Tool Simple Score Weighted Score Original Tool Information
BLAST Result Score Score Type Cluster ID
Compara 00.000 Not matched by this tool.
Domainoid 1 1.000 131 1.000 Domainoid score I5118
eggNOG 1 0.900 - - E33208_3BAFZ
Hieranoid 00.000 Not matched by this tool.
Homologene 00.000 Not matched by this tool.
Inparanoid 00.000 Not matched by this tool.
Isobase 00.000 Not matched by this tool.
OMA 00.000 Not matched by this tool.
OrthoDB 1 1.010 - - D41315at33208
OrthoFinder 1 1.000 - - FOG0000443
OrthoInspector 00.000 Not matched by this tool.
orthoMCL 1 0.900 - - OOG6_100768
Panther 1 1.100 - - O PTHR24023
Phylome 1 0.910 - -
RoundUp 00.000 Not matched by this tool.
SonicParanoid 00.000 Not matched by this tool.
SwiftOrtho 1 1.000 - -
TreeFam 1 0.960 - -
98.780

Return to query results.
Submit another query.