DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment NCAN and Ncan

DIOPT Version :10

Sequence 1:NP_004377.2 Gene:NCAN / 1463 HGNCID:2465 Length:1321 Species:Homo sapiens
Sequence 2:NP_031815.2 Gene:Ncan / 13004 MGIID:104694 Length:1268 Species:Mus musculus


Alignment Length:1360 Identity:889/1360 - (65%)
Similarity:993/1360 - (73%) Gaps:131/1360 - (9%)


- Green bases have known domain annotations that are detailed below.


Human     1 MGAPFVWALGLLMLQMLLFVAGEQGTQDITDASERGLHMQKLGSGSVQAALAELVALPCLFTLQP 65
            |||..|||.|||:|.:||.|||:|.||| |.|:|:||.|.|.|||.|:|||||||||||.|||||
Mouse     1 MGAGSVWASGLLLLWLLLLVAGDQDTQD-TTATEKGLRMLKSGSGPVRAALAELVALPCFFTLQP 64

Human    66 RPSAARDAPRIKWTKVRTASGQRQDLPILVAKDNVVRVAKSWQGRVSLPSYPRRRANATLLLGPL 130
            |.|:.||.||||||||:|||||||||||||||||||||||.||||||||:|||.|||||||||||
Mouse    65 RLSSLRDIPRIKWTKVQTASGQRQDLPILVAKDNVVRVAKGWQGRVSLPAYPRHRANATLLLGPL 129

Human   131 RASDSGLYRCQVVRGIEDEQDLVPLEVTGVVFHYRSARDRYALTFAEAQEACRLSSAIIAAPRHL 195
            |||||||||||||:|||||||||.|||||||||||:|||||||||||||||||||||.|||||||
Mouse   130 RASDSGLYRCQVVKGIEDEQDLVTLEVTGVVFHYRAARDRYALTFAEAQEACRLSSATIAAPRHL 194

Human   196 QAAFEDGFDNCDAGWLSDRTVRYPITQSRPGCYGDRSSLPGVRSYGRRNPQELYDVYCFARELGG 260
            ||||||||||||||||||||||||||||||||||||||||||||||||:||||||||||||||||
Mouse   195 QAAFEDGFDNCDAGWLSDRTVRYPITQSRPGCYGDRSSLPGVRSYGRRDPQELYDVYCFARELGG 259

Human   261 EVFYVGPARRLTLAGARAQCRRQGAALASVGQLHLAWHEGLDQCDPGWLADGSVRYPIQTPRRRC 325
            ||||||||||||||||||||:||||||||||||||||||||||||||||||||||||||||||||
Mouse   260 EVFYVGPARRLTLAGARAQCQRQGAALASVGQLHLAWHEGLDQCDPGWLADGSVRYPIQTPRRRC 324

Human   326 GGPAPGVRTVYRFANRTGFPSPAERFDAYCFRAHHPTSQHGDLETPSSGDEGEILSAEGPPVREL 390
            ||||||||||||||||||||:|..|||||||||||.|:||||.|.|||||||||:||||||.|||
Mouse   325 GGPAPGVRTVYRFANRTGFPAPGARFDAYCFRAHHHTAQHGDSEIPSSGDEGEIVSAEGPPGREL 389

Human   391 EPTLEEEEVVTPDFQEPLVSSGEEETLILEEKQESQQTLSPTPGDPMLASWPTGEVWLSTVAP-- 453
            :|:|.|:||:.|||||||:||||.|...|...|..::||..|||.|.|||||:.|.||.|.||  
Mouse   390 KPSLGEQEVIAPDFQEPLMSSGEGEPPDLTWTQAPEETLGSTPGGPTLASWPSSEKWLFTGAPSS 454

Human   454 ----SPSDMGAGTAASS--HTEVAPTDPMPRRRGRFKGLNGRYFQQQEPEPGLQGGMEASAQPPT 512
                ||||||....|::  .|:||||..|  |||||||||||:||||.||..|....|.||||||
Mouse   455 MGVSSPSDMGVDMEATTPLGTQVAPTPTM--RRGRFKGLNGRHFQQQGPEDQLPEVAEPSAQPPT 517

Human   513 SEAAVNQMEPPLAMAVTEMLGSGQSRSPWADLTNEVDMPGAGSAGGKSSPEPWLWPPTMVPPSIS 577
            ..|..|.|.|   .|.||...|.||.||||.||||||.|||||.|.:|.||..:|.|:::.||:.
Mouse   518 LGATANHMRP---SAATEASESDQSHSPWAILTNEVDEPGAGSLGSRSLPESLMWSPSLISPSVP 579

Human   578 GHSRAPVLELEKAEGPSARPATPDLFWSPLEATVSAPSPAPWEAFPVAT--------SPDLPMMA 634
            .....|..:...||.||.:.|.|.|...|.|.  .||||.|.||....:        |||.|::|
Mouse   580 STDSTPSAKPGAAEAPSVKSAIPHLPRLPSEP--PAPSPGPSEALSAVSLQASSADGSPDFPIVA 642

Human   635 MLRGPKEWMLPHPTPISTEANRVEAHGEATATAPPSPAAETKVYSLPL--------SLTPTGQGG 691
            |||.||.|:||..|.:..           ....|.|||:       ||        ::.|...|.
Mouse   643 MLRAPKLWLLPRSTLVPN-----------MTPVPLSPAS-------PLPSWVPEEQAVRPVSLGA 689

Human   692 EAMPT------------TPESPRADFRETGETSPAQVNKAEHSSSSPWPSV-NRNVAVGFVPTET 743
            |.:.|            :..||.||..|...||..:..|  |..|.||.|: :.||.:..||::.
Mouse   690 EDLETPFQTTIAAPVEASHRSPDADSIEIEGTSSMRATK--HPISGPWASLDSSNVTMNPVPSDA 752

Human   744 ATEPTGLRGIPGSESGVFDTAESPTSGLQATVDEVQDPWPSVYSKGLDASSPSAPLGSPGVFLVP 808
                    ||.|:||||.|...|||||.||||::|...|..:..:|||..|.|.|:.:.||.:  
Mouse   753 --------GILGTESGVLDLPGSPTSGGQATVEKVLATWLPLPGQGLDPGSQSTPMEAHGVAV-- 807

Human   809 KVTPNLEPWVATDEGPTVNPMDSTVTPAPSDASGIWEPGSQVFEEAESTTLSPQVALDTSIVTPL 873
                ::||.||.:.|.|..||::|....||.|...||      .|:.|...|..:|:        
Mouse   808 ----SMEPTVALEGGATEGPMEATREVVPSTADATWE------SESRSAISSTHIAV-------- 854

Human   874 TTLEQGDKVGVPAMSTLGSSSSQPHPEPEDQVETQGTSGA--SVPPHQSSPLGKPAVPPGTPTAA 936
             |:.:..  |:|   ||.|:||:.||||:.|:..|.:...  ::|.|..|.|..|          
Mouse   855 -TMARAQ--GMP---TLTSTSSEGHPEPKGQMVAQESLEPLNTLPSHPWSSLVVP---------- 903

Human   937 SVGESASVSSGEPTVPWDPSSTLLPVTLGIEDFELEVLAGSPGVESFWEEVASGEEPALPGTPMN 1001
             :.|.|||||||||..||..|||:||:||:::.:|.|:|.||.|:.||||||||:|..       
Mouse   904 -MDEVASVSSGEPTGLWDIPSTLIPVSLGLDESDLNVVAESPSVKGFWEEVASGQEDP------- 960

Human  1002 AGAEEVHSDPCENNPCLHGGTCNANGTMYGCSCDQGFAGENCEIDIDDCLCSPCENGGTCIDEVN 1066
                   :||||||||||||||:.|||:||||||||:||||||||||||||||||||||||||||
Mouse   961 -------TDPCENNPCLHGGTCHTNGTVYGCSCDQGYAGENCEIDIDDCLCSPCENGGTCIDEVN 1018

Human  1067 GFVCLCLPSYGGSFCEKDTEGCDRGWHKFQGHCYRYFAHRRAWEDAEKDCRRRSGHLTSVHSPEE 1131
            ||:||||||||||.|||||||||||||||||||||||||||||||||:|||||:|||||||||||
Mouse  1019 GFICLCLPSYGGSLCEKDTEGCDRGWHKFQGHCYRYFAHRRAWEDAERDCRRRAGHLTSVHSPEE 1083

Human  1132 HSFINSFGHENTWIGLNDRIVERDFQWTDNTGLQFENWRENQPDNFFAGGEDCVVMVAHESGRWN 1196
            |.|||||||||:|||||||.||||||||||||||:|||||.||||||||||||||||||||||||
Mouse  1084 HKFINSFGHENSWIGLNDRTVERDFQWTDNTGLQYENWREKQPDNFFAGGEDCVVMVAHESGRWN 1148

Human  1197 DVPCNYNLPYVCKKGTVLCGPPPAVENASLIGARKAKYNVHATVRYQCNEGFAQHHVATIRCRSN 1261
            ||||||||||||||||||||||||||||||:|.||.||||||||||||:|||:||.|||||||:|
Mouse  1149 DVPCNYNLPYVCKKGTVLCGPPPAVENASLVGVRKIKYNVHATVRYQCDEGFSQHRVATIRCRNN 1213

Human  1262 GKWDRPQIVCTKPRRSHRMRRHHHHHQHHHQHHHHKSRKERRKHKKHPTEDWEKDEGNFC 1321
            ||||||||:|.|||||||||||     |||.|.|||.|||.||||:||.||||||||:||
Mouse  1214 GKWDRPQIMCIKPRRSHRMRRH-----HHHPHRHHKPRKEHRKHKRHPAEDWEKDEGDFC 1268

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
NCANNP_004377.2 Ig_Neurocan 41..162 CDD:409483 106/120 (88%)
Ig strand A 41..44 CDD:409483 1/2 (50%)
Ig strand A' 47..49 CDD:409483 1/1 (100%)
Ig strand B 53..61 CDD:409483 7/7 (100%)
Ig strand C 75..81 CDD:409483 5/5 (100%)
Ig strand C' 90..93 CDD:409483 2/2 (100%)
Ig strand D 111..116 CDD:409483 4/4 (100%)
Ig strand E 122..128 CDD:409483 5/5 (100%)
Ig strand F 136..143 CDD:409483 6/6 (100%)
Ig strand G 153..162 CDD:409483 7/8 (88%)
Link_domain_CSPGs_modules_1_3 160..254 CDD:239594 90/93 (97%)
Link_domain_CSPGs_modules_2_4 261..356 CDD:239597 90/94 (96%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 362..414 39/51 (76%)
PHA03247 <421..964 CDD:223021 228/581 (39%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 456..479 11/24 (46%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 494..513 10/18 (56%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 532..579 26/46 (57%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 682..731 18/69 (26%)
O-glycosylated at one site 708..712 0/3 (0%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 820..844 8/23 (35%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 876..955 29/80 (36%)
EGF 1012..1042 CDD:394967 25/29 (86%)
EGF_CA 1046..1082 CDD:238011 33/35 (94%)
CLECT 1088..1211 CDD:470576 115/122 (94%)
CCP 1215..1271 CDD:153056 48/55 (87%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 1275..1321 33/45 (73%)
NcanNP_031815.2 Ig 40..161 CDD:472250 106/120 (88%)
Ig strand B 54..58 CDD:409483 3/3 (100%)
Ig strand C 74..78 CDD:409483 3/3 (100%)
Ig strand E 122..126 CDD:409483 3/3 (100%)
Ig strand F 136..141 CDD:409483 4/4 (100%)
Ig strand G 154..157 CDD:409483 2/2 (100%)
Link_domain_CSPGs_modules_1_3 159..253 CDD:239594 90/93 (97%)
Link_domain_CSPGs_modules_2_4 260..355 CDD:239597 90/94 (96%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 363..391 23/27 (85%)
PHA03247 <400..851 CDD:223021 204/497 (41%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 406..442 19/35 (54%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 472..540 40/72 (56%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 574..630 19/57 (33%)
EGF 964..994 CDD:394967 25/29 (86%)
EGF_CA 998..1034 CDD:238011 33/35 (94%)
CLECT 1040..1163 CDD:470576 115/122 (94%)
CCP 1167..1224 CDD:153056 48/56 (86%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 1228..1268 32/44 (73%)

Return to query results.
Submit another query.