DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment CRB1 and N

DIOPT Version :10

Sequence 1:XP_016856341.1 Gene:CRB1 / 23418 HGNCID:2343 Length:1451 Species:Homo sapiens
Sequence 2:NP_476859.2 Gene:N / 31293 FlyBaseID:FBgn0004647 Length:2703 Species:Drosophila melanogaster


Alignment Length:1462 Identity:387/1462 - (26%)
Similarity:538/1462 - (36%) Gaps:472/1462 - (32%)


- Green bases have known domain annotations that are detailed below.


Human    23 KNSFCNKNNTRCLSNSCQNNSTCKDFSKDNDCSCS------------------------------ 57
            ||  |.:|...||.:.|||..||.|...|..|.|.                              
  Fly   288 KN--CEQNYDDCLGHLCQNGGTCIDGISDYTCRCPPNFTGRFCQDDVDECAQRDHPVCQNGATCT 350

Human    58 -----------------DTANNLDKDCD------------------------------NMKDPCF 75
                             |.:||.| ||.                              ::.|.|.
  Fly   351 NTHGSYSCICVNGWAGLDCSNNTD-DCKQAACFYGATCIDGVGSFYCQCTKGKTGLLCHLDDACT 414

Human    76 SNPCQGSATCVNTPGERSFLCKCPPGYSGTICETTIGSCGKNS-CQHGGICHQDPIYPVCICPAG 139
            ||||...|.|..:|...|:.|.|..||.|..|...|..|.:.| |:|.|||...|....|.|..|
  Fly   415 SNPCHADAICDTSPINGSYACSCATGYKGVDCSEDIDECDQGSPCEHNGICVNTPGSYRCNCSQG 479

Human   140 YAGRFCEIDHDECASSPCQNGAVCQDGIDGYSCFCVPGYQGRHCDLEVDECASDPCKNEATCLNE 204
            :.|..||.:.:||.|.||||...|.|....:.|.|:||:.|..|::::|||.|:||.|:.||.::
  Fly   480 FTGPRCETNINECESHPCQNEGSCLDDPGTFRCVCMPGFTGTQCEIDIDECQSNPCLNDGTCHDK 544

Human   205 IGRYTCICPHNYSGVNCELEIDECWSQPCLNGATCQDALGAYFCDCAPGFLGDHCELNTDECASQ 269
            |..:.|.|...::|..|::.||:|.||||.|...|.|::..|.|:|.||:.|..||:|.::|.|.
  Fly   545 INGFKCSCALGFTGARCQINIDDCQSQPCRNRGICHDSIAGYSCECPPGYTGTSCEININDCDSN 609

Human   270 PCLHGGLCVDGENRYSCNCTGSGFTGTHCETLMPLCWSKPCHNNATCEDSVDNYTCHCWPGYTGA 334
            || |.|.|:|..|.:.|.| ..|:||..|:..:..|.|.||..:..|:|.|.:|.|.|..|.:|.
  Fly   610 PC-HRGKCIDDVNSFKCLC-DPGYTGYICQKQINECESNPCQFDGHCQDRVGSYYCQCQAGTSGK 672

Human   335 QCEIDLNECNSNPCQSNGECVELSSEKQYGRITGLPSSFSYHEASGYVCICQPGFTGIHCEEDVN 399
            .||:::|||:||||.:...|::           |:.|         |.|.|.|||||.|||::|:
  Fly   673 NCEVNVNECHSNPCNNGATCID-----------GINS---------YKCQCVPGFTGQHCEKNVD 717

Human   400 ECSSNPCQNGGTCENLPGNYTCHCPFDNLSRTFYGGRDCSDILLGCTHQQCLNNGTCIPHFQDGQ 464
            ||.|:||.|.|.|.:....|.|.||     |.||.....||: ..|....|:|.|.|    :||.
  Fly   718 ECISSPCANNGVCIDQVNGYKCECP-----RGFYDAHCLSDV-DECASNPCVNEGRC----EDGI 772

Human   465 HGFSCLCPSGYTGSLCEIATTLSFEGDGFLWVKSGSVTTKGSVCNIALRFQTVQPMALLLFRSNR 529
            :.|.|.||.||||..||:.....          |.:....|..|...|...:.|.|         
  Fly   773 NEFICHCPPGYTGKRCELDIDEC----------SSNPCQHGGTCYDKLNAFSCQCM--------- 818

Human   530 DVFVKLELLSGYIHLSIQVNNQSKVLLFISHNTSDGEWHFVEVIFAEAVTLTLIDDSCKEKCIAK 594
                     .||.....:.|...    .:::...:|.              |.||.....||:.|
  Fly   819 ---------PGYTGQKCETNIDD----CVTNPCGNGG--------------TCIDKVNGYKCVCK 856

Human   595 AP---TPLESDQSICAFQNSFLGGLPVGMTSNGVALLNFYNMPSTPSFVG--CLQDIKIDWNHIT 654
            .|   ...||....|| .|..........:||   .|:| :......:.|  |.:||        
  Fly   857 VPFTGRDCESKMDPCA-SNRCKNEAKCTPSSN---FLDF-SCTCKLGYTGRYCDEDI-------- 908

Human   655 LENISSGSSLNVKAGCVRKDWCE-SQPCQSRGRCINLWLSYQCDCHRPYEGPNCLREYVAGRFGQ 718
                               |.|. |.||::...|:|:..||:|.|.:.|||.:|       ....
  Fly   909 -------------------DECSLSSPCRNGASCLNVPGSYRCLCTKGYEGRDC-------AINT 947

Human   719 DDSTGYVIFTLDESYGDTISLSMFVRTLQPSGLLLALENSTYQYIRVWLERGRLAMLTPNSPKLV 783
            ||...:                    ..|..|..|                              
  Fly   948 DDCASF--------------------PCQNGGTCL------------------------------ 962

Human   784 VKFVLNDGNVHLISLKIKPYKIELYQSSQNLGFISASTWKIEKGDVIYIGGLPDKQ-ETELNGGF 847
                  ||                      :|..|.          :.:.|...|. ||::|...
  Fly   963 ------DG----------------------IGDYSC----------LCVDGFDGKHCETDINECL 989

Human   848 FKGCIQDVRLNNQNLEFFPNPTNNASLNPVLVNVTQGC----------AGDNSCKRQT--NVGRA 900
            .:.|                 .|.|:.:..:.:.|..|          ..|..|...:  |.|..
  Fly   990 SQPC-----------------QNGATCSQYVNSYTCTCPLGFSGINCQTNDEDCTESSCLNGGSC 1037

Human   901 LTELGSRGPKYQVSLFRFCVGSWATGNTFFLSSIKPGSNPCHNGGVCHSRWDDFSCSCPALTSGK 965
            :.  |..|  |..|    |:..::..|..:..: |..||||.||..||.:.::::|.||:..:||
  Fly  1038 ID--GING--YNCS----CLAGYSGANCQYKLN-KCDSNPCLNGATCHEQNNEYTCHCPSGFTGK 1093

Human   966 ACEE-VQWCGFSPCPHGAQCQPVLQGFECIANAVFNGQSGQILFRSNGNITRELTNITFGFRTRD 1029
            .|.| |.|||.|||.:||.|..:...|.|..:|   |.:|::.                      
  Fly  1094 QCSEYVDWCGQSPCENGATCSQMKHQFSCKCSA---GWTGKLC---------------------- 1133

Human  1030 ANVIILHAEKEPEFLNISIQDSRLFFQLQSGNSFYMLSLTSLQSVNDGTWHEVTLSMTDPLSQTS 1094
                        :...||.||:    ..:.|     |||..|  .|:||                
  Fly  1134 ------------DVQTISCQDA----ADRKG-----LSLRQL--CNNGT---------------- 1159

Human  1095 RWQMEVDNETPFVTSTIATGSLNFLKDNTDIYVGDRAIDNIKGLQGCLSTIEIGGIYLSYFENVH 1159
                                    .||..:.:|             |..:....|.|.       
  Fly  1160 ------------------------CKDYGNSHV-------------CYCSQGYAGSYC------- 1180

Human  1160 GFINKPQEEQFLKISTNSVVTGCLQLNVCNSNPCLHGGNCEDIYSSYHCSCPLGWSGKHCELNID 1224
                  |:|                ::.|.|.||.:||.|.|:..:|.|.|..|:.|::||||||
  Fly  1181 ------QKE----------------IDECQSQPCQNGGTCRDLIGAYECQCRQGFQGQNCELNID 1223

Human  1225 ECFSNPCIH-GNCSDRVAAYHCTCEPGYTGVNCEVDIDNCQSHQCANGATCISHTNGYSCLCFGN 1288
            :|..|||.: |.|.|||..:.|:|.||..|:.||::.|:|:...|.|..:||....|:.|:|...
  Fly  1224 DCAPNPCQNGGTCHDRVMNFSCSCPPGTMGIICEINKDDCKPGACHNNGSCIDRVGGFECVCQPG 1288

Human  1289 FTGKFCRQSRLPSTVCGNEKTNLTCYNGG--NCTEFQTELKCMCRPGFTGEWCEKDIDECASDPC 1351
            |.|..|....       ||..:..|.|.|  :|.:......|.||||..|..||..:|.||..||
  Fly  1289 FVGARCEGDI-------NECLSNPCSNAGTLDCVQLVNNYHCNCRPGHMGRHCEHKVDFCAQSPC 1346

Human  1352 VNGGLCQDLLNKFQCLCDVAFAGERCEVDLAD 1383
            .|||.|....:...|:|:..|.|:.||:...|
  Fly  1347 QNGGNCNIRQSGHHCICNNGFYGKNCELSGQD 1378

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
CRB1XP_016856341.1 EGF_CA 72..108 CDD:238011 15/35 (43%)
EGF_CA 148..183 CDD:238011 14/34 (41%)
EGF_CA 187..222 CDD:238011 13/34 (38%)
EGF_CA 225..260 CDD:238011 16/34 (47%)
EGF_CA 262..299 CDD:238011 15/36 (42%)
EGF_CA <309..337 CDD:238011 10/27 (37%)
EGF_CA 339..395 CDD:238011 19/55 (35%)
EGF_CA 397..438 CDD:214542 16/40 (40%)
Laminin_G_2 514..649 CDD:460494 26/139 (19%)
EGF_CA 676..707 CDD:238011 13/31 (42%)
Laminin_G_2 743..860 CDD:460494 13/117 (11%)
EGF_CA <939..968 CDD:238011 12/28 (43%)
Laminin_G_2 1025..1150 CDD:460494 16/124 (13%)
EGF_CA <1191..1220 CDD:238011 11/28 (39%)
EGF_CA 1222..1257 CDD:238011 17/35 (49%)
EGF_CA 1259..1294 CDD:238011 11/34 (32%)
EGF_CA 1342..1378 CDD:238011 13/35 (37%)
NNP_476859.2 EGF_CA 179..214 CDD:238011
EGF_CA 217..252 CDD:238011
EGF_CA 260..291 CDD:238011 2/4 (50%)
EGF_CA 295..329 CDD:238011 11/33 (33%)
EGF_CA 331..369 CDD:238011 0/37 (0%)
EGF_CA 449..486 CDD:238011 13/36 (36%)
EGF_CA 488..524 CDD:238011 14/35 (40%)
EGF_CA 526..562 CDD:238011 13/35 (37%)
EGF_CA 564..600 CDD:238011 16/35 (46%)
EGF_CA 602..637 CDD:238011 15/36 (42%)
EGF_CA 640..675 CDD:238011 12/34 (35%)
EGF_CA 677..713 CDD:238011 19/55 (35%)
EGF_CA 715..750 CDD:238011 16/39 (41%)
EGF_CA 753..789 CDD:238011 16/40 (40%)
EGF_CA 791..827 CDD:238011 8/63 (13%)
EGF_CA 829..865 CDD:238011 9/53 (17%)
EGF_CA 907..943 CDD:238011 16/62 (26%)
EGF_CA 946..982 CDD:238011 11/123 (9%)
EGF_CA 984..1020 CDD:238011 6/52 (12%)
EGF_CA 1027..1058 CDD:238011 8/38 (21%)
EGF_CA 1062..1095 CDD:238011 14/33 (42%)
EGF_CA 1184..1219 CDD:238011 13/34 (38%)
EGF_CA 1221..1257 CDD:238011 17/35 (49%)
EGF_CA 1259..1295 CDD:238011 11/35 (31%)
EGF_CA 1297..1335 CDD:238011 12/44 (27%)
EGF_CA 1338..1373 CDD:238011 13/34 (38%)
EGF_CA 1417..1450 CDD:238011
NL 1476..1512 CDD:197463
Notch 1519..1553 CDD:459658
Notch 1565..1593 CDD:459658
NOD 1598..1652 CDD:462014
NODP 1679..1731 CDD:462229
JMTM_dNotch 1719..1806 CDD:411989
ANK repeat 1902..1948 CDD:293786
ANKYR 1936..2139 CDD:440430
ANK repeat 1951..1981 CDD:293786
ANK repeat 1984..2015 CDD:293786
ANK repeat 2017..2048 CDD:293786
ANK repeat 2050..2081 CDD:293786
ANK repeat 2083..2114 CDD:293786
Blue background indicates that the domain is not in the aligned region.

Return to query results.
Submit another query.