DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment N and Crb1

DIOPT Version :9

Sequence 1:NP_001245510.1 Gene:N / 31293 FlyBaseID:FBgn0004647 Length:2703 Species:Drosophila melanogaster
Sequence 2:NP_573502.2 Gene:Crb1 / 170788 MGIID:2136343 Length:1405 Species:Mus musculus


Alignment Length:1415 Identity:376/1415 - (26%)
Similarity:518/1415 - (36%) Gaps:466/1415 - (32%)


- Green bases have known domain annotations that are detailed below.


  Fly   327 FCQDDVDECAQRDHPVCQNGATCTNTHGSYSCICVNGWAGLDCSNNTD-DCKQAACFYGATCIDG 390
            ||..:...|....   |||.:||.:.....:| |      ||.:||.| ||:.            
Mouse    26 FCNKNNTRCLSGP---CQNNSTCKHFPQDNNC-C------LDTANNLDKDCED------------ 68

  Fly   391 VGSFYCQCTKGKTGLLCHLDDACTSNPCHADAICDTSPINGSYACSCATGYKGVDCSEDIDECDQ 455
                              |.|.|.|:||...|.|...|..|::.|.|..||.|::|....:.|. 
Mouse    69 ------------------LKDPCFSSPCQGIATCVKIPGEGNFLCQCPPGYSGLNCETATNSCG- 114

  Fly   456 GSPCEHNGICVNTPGSYRCNCSQGFTGPRCETNINECESHPCQNEGSCLDDPGTFRCVCMPGFTG 520
            |:.|:|.|.|...|....|.|..|:.|..|||:.|||.|.||.|...|.|....:.|.|:||:.|
Mouse   115 GNLCQHGGTCRKDPEHPVCICPPGYAGRFCETDHNECASSPCHNGAMCQDGINGYSCFCVPGYQG 179

  Fly   521 TQCEIDIDECQSNPCLNDGTCHDKINGFKCSCALGFTGARCQINIDDCQSQPCRNRGICHDSIAG 585
            ..|::::|||.|:||.|:..|.::|..:.|.|...|:|..|::.||:|:||||.:...|.|:..|
Mouse   180 RHCDLEVDECVSDPCKNEAVCLNEIGRYTCVCPQEFSGVNCELEIDECRSQPCLHGATCQDAPGG 244

  Fly   586 YSCECPPGYTGTSCEININDCDSNPC-HRGKCIDDVNSFKCLC-DPGYTGYICQKQINECESNPC 648
            |||:|.||:.|..||:::|:|:|.|| |.|.|:|..||:.|.| ..|:||..|:..|..|.|.||
Mouse   245 YSCDCAPGFLGEHCELSVNECESQPCLHGGLCVDGRNSYHCDCTGSGFTGMHCESLIPLCWSKPC 309

  Fly   649 QFDGHCQDRVGSYYCQCQAGTSGKNCEVNVNECHSNPCNNGATCID------------------- 694
            ..|..|:|.|.||.|.|:.|.:|..||.::|||.||||.....|::                   
Mouse   310 HNDATCEDTVDSYICHCRPGYTGALCETDINECSSNPCQFWGECVELSSEGLYGNTAGLPSSFSY 374

  Fly   695 -GINSYKCQCVPGFTGQHCEKNVDECISSPCANNGVCIDQVNGYKCECP-----RGFYDA-HCLS 752
             |.:.|.|.|.|||||.|||::||||:..||.|.|.|.:....|.|.||     |.||.. :|..
Mouse   375 VGASGYVCICQPGFTGIHCEEDVDECLLHPCLNGGTCENLPGNYACHCPFDDTSRTFYGGENCSE 439

  Fly   753 DVDECASNPCVNEGRCEDGINEFICHCPPGYTGKRCELDIDECSSNPCQHGGTCYDKLNAFSCQC 817
            .:..|..:.|:|.|:|       |.|...|                  |||         |:|||
Mouse   440 ILLGCTHHQCLNNGKC-------IPHFQNG------------------QHG---------FTCQC 470

  Fly   818 MPGYTGQKCETNIDDCVTNPCGNGGTCIDKVNGYKCVCKVPFTG--------------------- 861
            :.||.|..|||    ..|...|:        ||:..|.....||                     
Mouse   471 LSGYAGPLCET----VTTLSFGS--------NGFLWVTSGSHTGIGPECNISLRFHTVQPNALLL 523

  Fly   862 ----RDCESKMDPCASNRCK-------NEAKCTPSS---------NFLDFSC--TCKLGYTGRYC 904
                :|...|::  ..|.|.       |:.|...|.         :|::.:.  |..|...|..|
Mouse   524 IRGNKDVSMKLE--LLNGCVHLSIEVWNQLKVLLSISHNTSDGEWHFVEVTIAETLTLALVGGSC 586

  Fly   905 DEDIDECSLSS------------------------PCRNGASCLNV------PGSYRCL------ 933
            .|   :|:..|                        ...|..|.||:      |....||      
Mouse   587 KE---KCTTKSSVPVENHQSICALQDSFLGGLPMGTANNSVSVLNIYNVPSTPSFVGCLQDIRFD 648

  Fly   934 --------CTKGYEGRDCA--INTDDCASFPCQNGGTCLDGIGDYSCLCVDGFDGKHC------- 981
                    .:.|......|  :..|.|.|.||||.|.|::....|.|.|...:.|.:|       
Mouse   649 LNHITLENVSSGLSSNVKAGCLGKDWCESQPCQNRGRCINLWQGYQCECDRPYTGSNCLKEYVAG 713

  Fly   982 -----------------------------------------ETDINECLSQPCQNGATCSQYVNS 1005
                                                     |....:.:|...::|:...|...|
Mouse   714 RFGQDDSTGYAAFSVNDNYGQNFSLSMFVRTRQPLGLLLALENSTYQYVSVWLEHGSLALQTPGS 778

  Fly  1006 ----------------------------YTCTCPLGFSGINCQT------------NDEDCTESS 1030
                                        |..:..|||..:...|            .|.:.||  
Mouse   779 PKFMVNFFLSDGNVHLISLRIKPNEIELYQSSQNLGFISVPTWTIRRGDVIFIGGLPDREKTE-- 841

  Fly  1031 CLNGG---SCIDGI---------------NGYNCSCLAGYSGANCQYKLNKCDSNPCLNGATCHE 1077
             :.||   .|:..:               |.|:...|...: ..|... |.|.||||.||..||.
Mouse   842 -VYGGFFKGCVQDVRLNSQTLEFFPNSTNNAYDDPILVNVT-QGCPGD-NTCKSNPCHNGGVCHS 903

  Fly  1078 QNNEYTCHCPSGFTGKQCSEYVDWCGQSPCENGATCSQMKHQFSCKCSAGWTG------------ 1130
            ..::::|.||:...|:.| |.|.||..|||...|.|..:...|.|..:|.::|            
Mouse   904 LWDDFSCSCPTNTAGRAC-EQVQWCQLSPCPPTAECQLLPQGFECIANAVFSGLSREILFRSNGN 967

  Fly  1131 -------------------------KLCDVQTISCQDAADRKGLSLR-------------QLCNN 1157
                                     |..:...||.|||  |....||             ||.|:
Mouse   968 ITRELTNITFAFRTHDTNVMILHAEKEPEFLNISIQDA--RLFFQLRSGNSFYTLHLMGSQLVND 1030

  Fly  1158 GT----------------------------------------CKDYGNSHV-------------C 1169
            ||                                        .||..:.:|             |
Mouse  1031 GTWHQVTFSMIDPVAQTSRWQMEVNDQTPFVISEVATGSLNFLKDNTDIYVGDQSVDNPKGLQGC 1095

  Fly  1170 ---------YCS-----QGYAGSYCQKEI---------------DECQSQPCQNGGTCRDLIGAY 1205
                     |.|     .|:.|...:::.               :.|.|.||.:||.|.|...:|
Mouse  1096 LSTIEIGGIYLSYFENLHGFPGKPQEEQFLKVSTNMVLTGCLPSNACHSSPCLHGGNCEDSYSSY 1160

  Fly  1206 ECQCRQGFQGQNCELNIDDCAPNPCQNGGTCHDRVMNFSCSCPPGTMGIICEINKDDCKPGACHN 1270
            .|.|..|:.|.:||:|||:|..:||.: |.|.|.|..:.|.|.||..|:.||::.|:||...|.|
Mouse  1161 RCACLSGWSGTHCEINIDECFSSPCIH-GNCSDGVAAYHCRCEPGYTGVNCEVDVDNCKSHQCAN 1224

  Fly  1271 NGSCIDRVGGFECVCQPGFVGARCEGDI-------NECLSNPCSNAGTLDCVQLVNNYHCNCRPG 1328
            ..:|:....|:.|:|...|.|..|....       ||..:..|.|.|:  |.....::.|.|.||
Mouse  1225 GATCVPEAHGYSCLCFGNFTGRFCRHSRLPSTVCGNEKRNFTCYNGGS--CSMFQEDWQCMCWPG 1287

  Fly  1329 HMGRHCEHKVDFCAQSPCQNGGNCNIRQSGHHCICNNGFYGKNCELSGQD 1378
            ..|..||..::.||..||.|||.|....:...|||:..|.|:.|||...|
Mouse  1288 FTGEWCEEDINECASDPCINGGLCRDLVNRFLCICDVAFAGERCELDLAD 1337

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
NNP_001245510.1 EGF_CA 179..214 CDD:238011
EGF_CA 217..252 CDD:238011
EGF_CA 260..291 CDD:238011
EGF_CA 295..329 CDD:238011 1/1 (100%)
EGF_CA 331..369 CDD:238011 9/37 (24%)
EGF_CA 449..486 CDD:238011 11/36 (31%)
EGF_CA 488..524 CDD:238011 14/35 (40%)
EGF_CA 526..562 CDD:238011 13/35 (37%)
EGF_CA 564..600 CDD:238011 17/35 (49%)
EGF_CA 602..637 CDD:238011 16/36 (44%)
EGF_CA 640..675 CDD:238011 15/34 (44%)
EGF_CA 677..713 CDD:238011 18/55 (33%)
EGF_CA 715..750 CDD:238011 16/40 (40%)
EGF_CA 753..789 CDD:238011 8/35 (23%)
EGF_CA 791..827 CDD:238011 10/35 (29%)
EGF_CA 829..865 CDD:238011 8/60 (13%)
EGF_CA 907..943 CDD:238011 10/79 (13%)
EGF_CA 946..982 CDD:238011 14/83 (17%)
EGF_CA 984..1020 CDD:238011 8/63 (13%)
EGF_CA 1027..1058 CDD:238011 8/48 (17%)
EGF_CA 1062..1095 CDD:238011 14/32 (44%)
EGF_CA 1184..1219 CDD:238011 13/49 (27%)
EGF_CA 1221..1257 CDD:238011 15/35 (43%)
EGF_CA 1259..1295 CDD:238011 11/35 (31%)
EGF_CA 1297..1335 CDD:238011 11/44 (25%)
EGF_CA 1417..1450 CDD:238011
NL 1476..1512 CDD:197463
Notch 1519..1553 CDD:278494
Notch 1565..1593 CDD:278494
NOD 1599..1648 CDD:284282
NODP 1680..1731 CDD:284987
ANK 1896..2038 CDD:238125
ANK repeat 1902..1948 CDD:293786
ANK repeat 1951..1981 CDD:293786
Ank_5 1970..2025 CDD:290568
ANK 1978..2104 CDD:238125
ANK repeat 1984..2015 CDD:293786
ANK repeat 2017..2048 CDD:293786
Ank_2 2022..2114 CDD:289560
ANK repeat 2050..2081 CDD:293786
ANK repeat 2083..2114 CDD:293786
DUF3454 2627..2682 CDD:288764
Crb1NP_573502.2 EGF 113..143 CDD:394967 11/30 (37%)
EGF_CA 147..182 CDD:238011 14/34 (41%)
EGF_CA 186..221 CDD:238011 13/34 (38%)
EGF_CA 224..259 CDD:238011 17/34 (50%)
EGF_CA 262..298 CDD:238011 16/35 (46%)
EGF_CA <308..336 CDD:238011 12/27 (44%)
EGF_CA 338..394 CDD:238011 18/55 (33%)
EGF_CA 396..437 CDD:214542 16/40 (40%)
Laminin_G_2 513..648 CDD:396680 22/139 (16%)
EGF_CA 675..706 CDD:238011 12/30 (40%)
Laminin_G_2 742..859 CDD:396680 16/119 (13%)
EGF_CA 888..922 CDD:238011 15/34 (44%)
Laminin_G_2 979..1104 CDD:396680 18/126 (14%)
EGF_CA 1140..1174 CDD:238011 13/33 (39%)
EGF_CA 1176..1211 CDD:238011 15/35 (43%)
EGF_CA 1213..1248 CDD:238011 11/34 (32%)
EGF_CA 1296..1332 CDD:238011 13/35 (37%)
Blue background indicates that the domain is not in the aligned region.


Information from Original Tools:


Tool Simple Score Weighted Score Original Tool Information
BLAST Result Score Score Type Cluster ID
Compara 00.000 Not matched by this tool.
Domainoid 00.000 Not matched by this tool.
eggNOG 1 0.900 - - E33208_3BAFT
Hieranoid 00.000 Not matched by this tool.
Homologene 00.000 Not matched by this tool.
Inparanoid 00.000 Not matched by this tool.
Isobase 00.000 Not matched by this tool.
OMA 00.000 Not matched by this tool.
OrthoDB 00.000 Not matched by this tool.
OrthoFinder 00.000 Not matched by this tool.
OrthoInspector 00.000 Not matched by this tool.
orthoMCL 1 0.900 - - OOG6_100271
Panther 00.000 Not matched by this tool.
Phylome 1 0.910 - -
RoundUp 00.000 Not matched by this tool.
SonicParanoid 00.000 Not matched by this tool.
SwiftOrtho 00.000 Not matched by this tool.
TreeFam 00.000 Not matched by this tool.
32.710

Return to query results.
Submit another query.