DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment kon and Cspg4

DIOPT Version :9

Sequence 1:NP_001260539.1 Gene:kon / 35104 FlyBaseID:FBgn0032683 Length:2381 Species:Drosophila melanogaster
Sequence 2:NP_112284.2 Gene:Cspg4 / 81651 RGDID:619942 Length:2326 Species:Rattus norvegicus


Alignment Length:2559 Identity:599/2559 - (23%)
Similarity:1037/2559 - (40%) Gaps:437/2559 - (17%)


- Green bases have known domain annotations that are detailed below.


  Fly    14 LAWLAILQLQTVAAMKVSLFGDGYVSMPLQEAKMSTNIRVKFRTRQENAFLFLAAGRTDYCLLRL 78
            ||.:..|.|...:....|.||:.::.:|:..|....::.::|.|.|..|.|.||||:||:.||:|
  Rat    14 LALILTLALFVRSTAPASFFGENHLEVPVPSALTRVDLLLQFSTSQPEALLLLAAGQTDHLLLQL 78

  Fly    79 ESGLISFSYKIERDVVQLRSPKKQKLNDLEWHDVAVQRFENNITLQVDGYIMRKQLPGDLAALNI 143
            :||.:.....:.::.:.|::|....|:|...|.|.:....:...|.|||.:....|....:.|.:
  Rat    79 QSGHLQVRLALGQNELSLQTPADTVLSDSTTHTVVLTVSNSWAVLSVDGVLNTSALIPKASHLKV 143

  Fly   144 HFGTFLGGVGDFTAEFLDDVI-GFRGCISDVFYNNINIIKRAKDRTSHTTSTGVAWTCSTEFEGS 207
            .:|.|:|..|.....:|..:. ..|||:.....|..|:::        ..:..|...|:.||  |
  Rat   144 PYGLFVGSSGSLDLPYLKGISRPLRGCLHSAILNGRNLLR--------PLTPDVHEGCAEEF--S 198

  Fly   208 IQDSISFMRNDSYSLMMKESYAMGE--TLSLQFRTMASAGVIFFNGG---YDFILLEIEDQHLKV 267
            ..|.:....:..:||....:::..|  ||.....|.:....:.|..|   .:||.::|.:.||:.
  Rat   199 AGDEVGLGFSGPHSLAAFPAWSTREEGTLEFTLTTRSQQAPLAFQAGDKRGNFIYVDIFEGHLRA 263

  Fly   268 TFNKAGSLVQFMTNEHISDGKWHRVFLRYNAAIAELSLDDATSGYRGTHANETKASINLEKSVFF 332
            ...|....:....:..::||:.|.|.:..:....|:|:|...:   .|......:.:....|:..
  Rat   264 VVEKGQGTMLLRNSVPVADGQPHEVSVHIDVHRLEISVDQYPT---RTFNRGVLSYLEPRGSLLL 325

  Fly   333 GGVQEEMRRRLISKGLRIN----EISFKGCMRDILVNDLPLGFAEMTISRSLALNCLWKYPCVEY 393
            ||:..|..|.|....|.:.    .||..||:.|..||...||..:..::|.:|..|..:      
  Rat   326 GGLDTEASRHLQEHRLGLTPGAANISLVGCIEDFSVNGRRLGLRDAWLTRDMAAGCRPE------ 384

  Fly   394 NPCLKSGICSQHGVDGFICYCDQSYCIKADFQGPFKIFTETSPE--------------------- 437
                                 :..|  :.:..|||:.|:..:||                     
  Rat   385 ---------------------EDEY--EEEVYGPFEAFSTLAPEAWPVMDLPEPCVPEPGLPAVF 426

  Fly   438 ---LELLYVSPMQLLEGGTAFLSPHFIDIILDLRRYPSLNEQSIIFHVVHQPKYGQLLQYSAEKA 499
               .:||.:||:.:.|||||:|....:...|||.. ..|.:..::|.|....::|:|       .
  Rat   427 ANFTQLLTISPLVVAEGGTAWLEWRHVQPTLDLTE-AELRKSQVLFSVSQGARHGEL-------E 483

  Fly   500 IFVP----CRTFNLVDLATDKLKYVHNGQENFNDHATLDMQIFGDVHKIPENILGKHRFLLHANI 560
            :.:|    .:.|.|:|:...|.::||:|.|:.:|...|::.:.... .:|..:.....::|...:
  Rat   484 LDIPGAQTRKMFTLLDVVNRKARFVHDGSEDTSDQLMLEVSVTSRA-PVPSCLRRGQIYILPIQV 547

  Fly   561 TPINDPPQLRLHSHKILRVIEGIERVLDVDLFNIDDPDSEPGNLIYTIL------PTQSPQETFG 619
            .|:||||::......::.::|..::.|..::|...||||....|...:|      |.:...:.  
  Rat   548 NPVNDPPRIVFPHGSLMVILEHTQKPLGPEIFQAYDPDSACEGLTIQLLGVSASVPVEHRDQP-- 610

  Fly   620 CFIVGGATTSAFSQAEVNVGKVSYLYNSTTAESFSYELQLQVSDGIETSETVYLP-VSVHPLELR 683
                 |...:.||..::..|.:.|::....|:    :|..:||||::.|....|. |:|.| .::
  Rat   611 -----GEPVTEFSCRDLEAGNIVYVHRGGPAQ----DLTFRVSDGMQASGPATLKVVAVRP-AIQ 665

  Fly   684 LVNNTGLIMIHKSSLPISTANLSIGTNAVDDHIDIRYDIVKAPQHGVLQRLRQIDGSWVNVDW-- 746
            :::||||.:...|:..|..||||:.||||...:.:.:.:....|.|.||  :|..|.....:|  
  Rat   666 ILHNTGLRLAQGSAAAILPANLSVETNAVGQDVSVLFRVTGTLQFGELQ--KQGAGGVEGTEWWD 728

  Fly   747 ---FSDSQLLLGHIRYL------HSSDFPWQDEFKFIASFGFVTTQTFDFRITFTRLRITSTRPS 802
               |....:..|.:|||      |:.|  ..::.......|..|.....|.:|..|..:...|..
  Rat   729 TLAFHQRDVEQGRVRYLSTDPQHHTQD--TVEDLTLEVQVGQETLSNLSFPVTIQRATVWMLRLE 791

  Fly   803 QISINGSREILLTSDVLS---YETTPIGSFARSVIYKITKPTRYGGIYVEGSRKAAKKLDSFTQQ 864
            .:......:..|||..|.   .|....|.:.....|::.:..|.|.:.::|:|.:..:  ||:|.
  Rat   792 PLHTQNPHQETLTSAHLEASLEEEGEGGPYPHIFHYELVQAPRRGNLLLQGTRLSDGQ--SFSQS 854

  Fly   865 DIEKRRIRYQTHHTSYSSFSDHLEFVVSVAECDDVAGVLEINYRPPDELINKLGYQNHEP----- 924
            |::..|:.|:....:..:..|...|.|:...          ::.|.......:|...:.|     
  Rat   855 DLQAGRVTYRATTRTSEAAEDSFRFRVTSPP----------HFSPLYTFPIHIGGDPNAPVLTNV 909

  Fly   925 -LQVQEGERALITKNHFGIRFNIYESLQFQVSLSPEHGVICKYDEQTGLTTPVEMFTLEQLFRND 988
             |.|.||...:::.:|..::.....|..::|...|.||.:...|.: |..|||..||.|.|....
  Rat   910 LLMVPEGGEGVLSADHLFVKSLNSASYLYEVMEQPHHGSLAWRDPK-GRATPVTSFTNEDLLHGR 973

  Fly   989 IYYCHDDTESTRDTFELLIL-----SGDETDLQFVSNMEVHIKLVNDNEPYRTAIERVFHVVRNG 1048
            :.|.|||:|:..|....:..     |||....:......|.|:.|||:.|.:| |.|||||.|.|
  Rat   974 LVYQHDDSETIEDDIPFVATRQGEGSGDMAWEEVRGVFRVAIQPVNDHAPVQT-ISRVFHVARGG 1037

  Fly  1049 IRTLNPTVLQYLDADVNTNHTDIHYIHVSSTNG---AFYKSGHYIDSFTQDDIANRRIMFQHTGA 1110
            .|.|....:.:.|||...:...:.........|   |..:....|..|||:|:..::::|.|:||
  Rat  1038 QRLLTTDDVAFSDADSGFSDAQLVLTRKDLLFGSIVAMEEPTRPIYRFTQEDLRKKQVLFVHSGA 1102

  Fly  1111 DSGTASFIVTDREHEVNGLLEIRASDPFVSMMATNASIVQEGKFVVLKNKDFILETNLDMKL-DE 1174
            |.|.....|:|.:|:...:||::||:|::.:..:::.:|.:|....:......|:||||::. :|
  Rat  1103 DHGWLQLQVSDGQHQATAMLEVQASEPYLHVANSSSLVVPQGGQGTIDTAVLHLDTNLDIRSGNE 1167

  Fly  1175 IYYEVIKPPSYGILMYLSRANEGENGTTIYKATNYTSLSNFTHLDIERERLVY-WNTEIASMDKV 1238
            ::|.|...|.:|.|:     .:|:            |:::|:..|:....::| .|..::..|.:
  Rat  1168 VHYHVTAGPHWGQLL-----RDGQ------------SVTSFSQRDLLDGAILYSHNGSLSPQDTL 1215

  Fly  1239 RYRVHIKNITAEGEVMFRIYPSAYWEPLQVKQNHTLYVEESTSVLISRDVLEVVHPNISPGDITF 1303
            ...|....:.....:...|.......|||:.|:..:||.:..:..|.||.||||...:.|.||.|
  Rat  1216 ALSVAAGPVHTSTVLQVTIALEGPLAPLQLVQHKRIYVFQGEAAEIRRDQLEVVQEAVLPADIMF 1280

  Fly  1304 LVTSSPMHGYLEMQS--MSFD-----DEYNCKVFDQSSVNTEKMFYIQAGVNQSTDYFVFDVTNG 1361
            .:.|.|..|||.|.|  .|.|     |...|  |.|.::|:.::.|:.:.....:|.|..||.:|
  Rat  1281 SLRSPPNAGYLVMVSHGASADGPPSLDPVQC--FSQEAINSGRVLYLHSRPGAWSDSFSLDVASG 1343

  Fly  1362 I-TWLRQLMIKIVIIPEKLYMHSNIISVVEGKTVQLNPTDIQ---PYSEYYRGKILEYIVTITPS 1422
            : ..|..:.:::.::|..:.:.....||.||.|..|.|..||   ||.....|.:|:  |...|.
  Rat  1344 LGDPLEGISVELEVLPTVIPLDVQNFSVPEGGTRTLAPPLIQITGPYFPTLPGLVLQ--VLEPPQ 1406

  Fly  1423 SGHV----LAGNSKVKRFTQKQLEQGSIQYVHNGSENATDSITLVAMA--RNKESVPFELEFAVV 1481
            .|.:    ...:..:..|:.:::|:..|:|||:|||..||...|:|.|  .:::|.|......::
  Rat  1407 HGALQKEDRPQDGTLSTFSWREVEEQLIRYVHDGSETQTDGFILLANASEMDRQSQPMAFTITIL 1471

  Fly  1482 QVNDEEPMMVTNTGLQVWNGGRYVIKNTDLLAQDYDTPPENLTFVVNHIYGGYLARKSAPHQKIE 1546
            .|||:.|::.||||||:|.|....|....|...|.|:.||:|.:.:.....|.:|.:.||..:..
  Rat  1472 PVNDQPPVITTNTGLQIWEGAIVPIPPEALRGIDSDSGPEDLVYTIEQPSNGRIALRVAPDAEAH 1536

  Fly  1547 HFTQAQINLEEIYFMHDSNSRRNELSFVVTDGLFNTTTQMLNIEI-KPIEILAEHNENLHVFPLT 1610
            .|||||::...:.|.| ..:......|.::||:..:......:.. |.:.:..|.:..|.|.|.:
  Rat  1537 RFTQAQLDSGLVLFSH-RGALEGGFHFDLSDGVHTSPGHFFRVVAQKQVLLSLEGSRKLTVCPES 1600

  Fly  1611 KKQILRDYLHFKCS--DEEREIRYNITVPPSLGRIVNEFIDNGFTKE-VSEFTQNDVDNGHIFYE 1672
            .:.:....|....|  .:.|.:.|.:...|.|||:::  ...|..:| :..|||.:|:.|:|.||
  Rat  1601 VQPLSSQSLSASSSTGSDPRHLLYQVVRGPQLGRLLH--AQQGSAEEALVNFTQAEVNAGNILYE 1663

  Fly  1673 HTAVIMEF-RTNDSF--------------------YFDVVA-ERSDRLLNQKFNIEISVSSGGLL 1715
            |......| ..:|:.                    .||... :|..||...|          ||.
  Rat  1664 HEISSEPFWEAHDTIGLLLSSSPARDLAATLAVTVSFDAACPQRPSRLWRNK----------GLW 1718

  Fly  1716 RFLPVNKLNVDEGGSVPI---KLDFSKILEYLKTKAGINNPELFIEAIQKPTHGNIGLGHEFKHM 1777
                     |.||....|   .||.:.:|..:.......:..|| :..|.||.|.:.:..|..|.
  Rat  1719 ---------VPEGQRAKITVAALDAANLLASVPASQRGRHDVLF-QITQFPTRGQLLVSEEPLHA 1773

  Fly  1778 QRYH--PSDFFTKKVYYIHDHSDTLEDTILMSVYLTQG------------NIFLCNLTIPVSINP 1828
            :|.|  .|:....::.|.|....|.:|......:| ||            ..|:      :::..
  Rat  1774 RRPHFLQSELTAGQLVYAHGGGGTQQDGFRFRAHL-QGPTGASVAGPQTSEAFV------ITVRD 1831

  Fly  1829 INDQPFHLVTHLPQMTVVEGENRTITRNDLLTEDADTPPEEIIYDVMSGPTLGVLRKITNDGRPE 1893
            :|::|......:| :.:..|....::|..|...|.|:.|.||.|:|...|..|.|....::..| 
  Rat  1832 VNERPPQPQASIP-LRITRGSRAPVSRAQLSVVDPDSAPGEIEYEVQRAPHNGFLSLAGDNTGP- 1894

  Fly  1894 DLLAFSNQFTQADINNDRIIYVHFGMPQSTTFCFTVSDGQSNPAYEIFTIKI--DSIHLQPSAMQ 1956
                 ...|||||::..|:.:|..|...:..|..::|||.|.|......:.:  .:|.:|   ::
  Rat  1895 -----VTHFTQADVDAGRLAFVANGSSVAGVFQLSMSDGASPPIPMSLAVDVLPSTIEVQ---LR 1951

  Fly  1957 APVKVQQGATTAPMRLDHIGVSTNVHMERLSYNVTNSPLFGIIVYKHQPTLRFTQQQLETSQISY 2021
            ||::|.|....:.:....:.|.::.....::|.:|..||:|.::...||...|:|.|::...:.:
  Rat  1952 APLEVPQALGRSSLSRQQLQVISDREEPDVAYRLTQGPLYGQVLVGGQPASAFSQLQVDQGDVVF 2016

  Fly  2022 MQTDLNRSNDSFQVSAYVPGTNYVAQVDVVMEVEPVIQINDIVMKEAESSGGKIKLITSLDNDNP 2086
            ..|:.:.|.|.|:|.|...|.|..|.|:|.  |:.::.:         .:||.....|:|..| |
  Rat  2017 AFTNFSSSQDHFKVLALARGVNASATVNVT--VQALLHV---------WAGGPWPQGTTLRLD-P 2069

  Fly  2087 LSLKLNKF------NPKFVITRMPSTGQIRKIIRSTGLTTDGQSDRSTN----IFSYKELRSGVV 2141
            ..|..::.      .|:|.:...|..|::.::       :.|:::..||    .|:.::|..|.:
  Rat  2070 TVLDASELANRTGSMPRFRLLEGPRYGRVVRV-------SQGRAESRTNQLVEDFTQQDLEEGRL 2127

  Fly  2142 YF-VPHESVEETEADHDSFDYQLLIKTVQPAQA------------------TVSIEYRSRVE--- 2184
            .. |.......|....|....:|....|.||.|                  .:|:...:|.|   
  Rat  2128 GLEVGRPEGRSTGPTGDRLTLELQATGVPPAVALLDFATEPYHAAKFYKVTLLSVPEAARTETEK 2192

  Fly  2185 ---ETETVHLGSAGLS----------INYLAIGCAIFIVLICLLIVLLIL---------KIRKLR 2227
               .|.|...|.|..|          :.:|.......|:.:||:::||.|         |..|..
  Rat  2193 TGKSTPTGQPGQAASSPMPTVAKSGFLGFLEANMFSVIIPVCLVLLLLALILPLLFYLRKRNKTG 2257

  Fly  2228 KHKADISKDQPPALPCPPDLTSVSPQHHLHHHHGHHYASSEADSVPATGNSTPLPGFSNIPHCKI 2292
            ||...:             ||                      :.|..|.:.....|..:...:.
  Rat  2258 KHDVQV-------------LT----------------------AKPRNGLAGDTETFRKVEPGQA 2287

  Fly  2293 IPVESYKHEYPDYDPDEEETDDQCDPQQMMLPQRYNPYHMEHDAWSSSCDMANDFAGYASVPQSI 2357
            ||:.:...:.|       ....|.||:.:.                           :...|   
  Rat  2288 IPLTTVPGQGP-------PPGGQPDPELLQ---------------------------FCRTP--- 2315

  Fly  2358 SGSVSSPPSAPPTNPLLRRNQYWV 2381
                         ||.||..||||
  Rat  2316 -------------NPALRNGQYWV 2326

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
konNP_001260539.1 LamG 30..176 CDD:238058 41/146 (28%)
LamG 211..365 CDD:238058 34/162 (21%)
Cadherin_3 507..664 CDD:292802 36/162 (22%)
Cadherin_3 631..776 CDD:292802 43/156 (28%)
Cadherin_3 747..892 CDD:292802 32/153 (21%)
Cadherin_3 861..1005 CDD:292802 36/149 (24%)
Cadherin_3 979..1121 CDD:292802 47/149 (32%)
Cadherin_3 1093..1242 CDD:292802 38/150 (25%)
Cadherin_3 <1264..1361 CDD:292802 36/103 (35%)
Cadherin_3 1330..1464 CDD:292802 39/141 (28%)
Cadherin_3 1436..1578 CDD:292802 47/143 (33%)
Cadherin_3 1789..1932 CDD:292802 35/154 (23%)
Cadherin_3 2009..2163 CDD:292802 35/164 (21%)
Cspg4NP_112284.2 Globular or compact configuration stabilized by disulfide bonds 30..640 149/671 (22%)
Neurite growth inhibition 30..640 149/671 (22%)
Laminin_G_1 55..182 CDD:395008 38/126 (30%)
LamG 204..362 CDD:238058 34/160 (21%)
Cadherin_3 <437..524 CDD:406568 26/94 (28%)
Cadherin_3 537..646 CDD:406568 25/119 (21%)
CSPG 2. /evidence=ECO:0000255|PROSITE-ProRule:PRU01201, ECO:0000269|PubMed:12220645 554..646 20/102 (20%)
Interaction with COL6A2 575..1044 126/498 (25%)
Interaction with COL5A1 632..1450 225/861 (26%)
Cadherin_3 665..744 CDD:406568 23/80 (29%)
Cadherin_3 <826..882 CDD:406568 13/57 (23%)
Cadherin_3 888..993 CDD:406568 28/105 (27%)
Cadherin_3 1007..1114 CDD:406568 34/107 (32%)
Cadherin_3 1132..1219 CDD:406568 20/103 (19%)
Cadherin_3 1244..1340 CDD:406568 32/97 (33%)
Cadherin_3 1360..1453 CDD:406568 28/94 (30%)
CSPG 9. /evidence=ECO:0000255|PROSITE-ProRule:PRU01201, ECO:0000269|PubMed:12220645 1360..1453 28/94 (30%)
Cadherin_3 1460..1567 CDD:406568 33/107 (31%)
CSPG 10. /evidence=ECO:0000255|PROSITE-ProRule:PRU01201, ECO:0000269|PubMed:12220645 1477..1567 28/90 (31%)
CSPG 11. /evidence=ECO:0000255|PROSITE-ProRule:PRU01201, ECO:0000269|PubMed:12220645 1585..1683 25/99 (25%)
Neurite growth inhibition 1590..2225 152/692 (22%)
Cysteine-containing 1591..2225 152/691 (22%)
Cadherin_3 1713..1806 CDD:406568 25/112 (22%)
Cadherin_3 1819..1928 CDD:406568 28/121 (23%)
Cadherin_3 2042..2127 CDD:406568 20/103 (19%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 2187..2210 7/22 (32%)
PDZ-binding 2324..2326 1/1 (100%)


Information from Original Tools:


Tool Simple Score Weighted Score Original Tool Information
BLAST Result Score Score Type Cluster ID
Compara 1 0.930 - - C166353756
Domainoid 1 1.000 75 1.000 Domainoid score I8835
eggNOG 1 0.900 - - E1_KOG3597
Hieranoid 1 1.000 - -
Homologene 00.000 Not matched by this tool.
Inparanoid 1 1.050 555 1.000 Inparanoid score I1071
OMA 1 1.010 - - QHG46545
OrthoDB 1 1.010 - - D119072at33208
OrthoFinder 1 1.000 - - FOG0003479
OrthoInspector 1 1.000 - - otm46034
orthoMCL 1 0.900 - - OOG6_106870
Panther 00.000 Not matched by this tool.
Phylome 1 0.910 - -
SonicParanoid 1 1.000 - - X2369
SwiftOrtho 00.000 Not matched by this tool.
TreeFam 1 0.960 - -
1312.670

Return to query results.
Submit another query.