DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment crb and Crb2

DIOPT Version :9

Sequence 1:NP_001247284.1 Gene:crb / 42896 FlyBaseID:FBgn0259685 Length:2253 Species:Drosophila melanogaster
Sequence 2:NP_001157038.1 Gene:Crb2 / 241324 MGIID:2679260 Length:1282 Species:Mus musculus


Alignment Length:1588 Identity:458/1588 - (28%)
Similarity:635/1588 - (39%) Gaps:403/1588 - (25%)


- Green bases have known domain annotations that are detailed below.


  Fly   718 CATQPCQNGGECVDLPNGDYECKCTRGWTGRTCGNDVDECTLHPKICGNG-IC---KNEKGSYKC 778
            ||:.||..|.:|....:|.|.|:          .:::..|...|  |.:| :|   ..:..|::|
Mouse    46 CASDPCAPGTKCQATESGGYTCE----------PSELGGCATQP--CHHGALCVPQGPDPNSFRC 98

  Fly   779 YCTPGFTGVHCDSDVDECLSFPCLNGATCHNKINAYECVCQPGYEGENCEVDIDECGSNPCSNGS 843
            ||.|||.|.||:.|:|||.|.||.:|.||.|..:.|||.|..||.|..||.::|||.|.||.:|.
Mouse    99 YCVPGFQGPHCELDIDECASRPCQHGGTCQNLADHYECHCPLGYAGVTCEAEVDECSSAPCLHGG 163

  Fly   844 TCIDRINNFTCNCIPGMTGRICDIDIDDCVGDPCLNGGQCIDQLGGFRCDCSGTGYEGENCELNI 908
            :|:|.:.::.|.|.||..|..|.:|:|:|...||.:||.|.|.:.||||||:.|||||..||..:
Mouse   164 SCLDGVGSYRCVCAPGYAGANCQLDVDECQSQPCAHGGVCHDLVNGFRCDCADTGYEGARCEQEV 228

  Fly   909 DECLSNPCTNGAKCLDRVKDYFCDCHNGYKGKNCEQDINECESNPCQYNGNCLERSNITLYQMSR 973
            .||.|.||.:.|.|||..:.:.|.|..|:.|:.||.|.:||.|.|||..|.||:||:.|||    
Mouse   229 LECASAPCAHNASCLDGFRSFRCLCWPGFSGERCEVDEDECASGPCQNGGQCLQRSDPTLY---- 289

  Fly   974 ITDLPKVFSQPFSFENASGYECVCVPGIIGKNCEININECDSNPCSKHGNCNDGIGTYTCECEPG 1038
             ..:..:|...|||.:|:|:.|.|..|..|.:|.::::||.|.||...|:|.|....:.|.|:.|
Mouse   290 -GGVQAIFPGAFSFSHAAGFLCSCPLGFAGNDCSMDVDECASGPCLNGGSCQDLPNGFQCYCQDG 353

  Fly  1039 FEGTHCEINIDECDRYNPC-QRGTCYDQIDDYDCDCDANYGGKNCSVLLKGCDQNPCLNGGACLP 1102
            :.|..|:.::||| :..|| ..|||.|.:..|.|.|...:||.:|||.|.||..:.|.....|:|
Mouse   354 YTGLTCQEDMDEC-QSEPCLHGGTCSDTVAGYICQCPEAWGGHDCSVQLTGCQGHTCPLAATCIP 417

  Fly  1103 YLINEVTHLYNCTCENGFQGDKCEKTTTLSMVATSLISVTTEREEGYDINLQFRTTLPNGVLAFG 1167
             ......|.|.|.|..|..|..|.:.||.|:|:.|.:...........:.|:|||||..|.||  
Mouse   418 -TFKSGLHGYFCRCPPGTYGPFCGQNTTFSVVSGSSVWGLVPAAASLGLALRFRTTLLAGTLA-- 479

  Fly  1168 TTGEKNEPVSYIL--ELINGRLNLHSSLLNKWEGVFI----GSKLNDSNWHKVFVAINTSHLVLS 1226
            |..:..:.:..:|  .::...|:.|.:      .|.|    ...|||.:||:|.|.::...|.|.
Mouse   480 TLKDTRDSLELVLVGAVLQATLSRHGT------AVLILTLPDLALNDGHWHQVEVTLHLGTLELR 538

  Fly  1227 ANDE-------QAIFPVGSYETAN--NSQPSFPRTYLGGTIPNLKSYLRHLTHQPSAFVGCMQDI 1282
            ...|       .|..||.:..||:  :..|.....||||.:                |.||.||:
Mouse   539 LWHEGCPGQLCVASGPVATGPTASVASGPPGSYSIYLGGGV----------------FAGCFQDV 587

  Fly  1283 MVNGKWIFPDEQDANISYTKLENVQSGCPRTEQCKPNPCHSNGECTDLWHTFACHCPRPFFGHTC 1347
            .|.|..:.|:|...        .|..||.|.|.|:|.||...|.|.|||..|.|.||||:.|.||
Mouse   588 RVEGHLLLPEELKG--------TVLLGCERREPCQPLPCAHGGACVDLWTHFRCDCPRPYRGATC 644

  Fly  1348 QHNMTAATFGHENTTHSAVIVETTDVARRAIRSILDISMFIRTREPTGQVFYLGTDPRKAPTKNI 1412
            ...:.|||||....|.||..:      ...:...|.:|.|:|||||.|.:.....|...:.|..:
Mouse   645 TDEVPAATFGLGGATSSASFL------LHQLGPNLTVSFFLRTREPAGLLLQFANDSVASLTVFL 703

  Fly  1413 GDSYVAAKLHGGELLVKMQFSGTPEAYTVGGQKLDNGYNHLIEVVRNQTLVQVKLNGTEYFRKTL 1477
            .:..:.|           :..|.|.....|  :.|:|..||:.:                   :.
Mouse   704 SEGQIRA-----------EGLGHPAVVLPG--RWDDGLPHLVML-------------------SF 736

  Fly  1478 STTGLLD-AQVLYLGGP-APTRESLLGATTEPGIIPVPGAGIPIEDTTVPKEADDSRDYFKGIIQ 1540
            ....|.| .|.||:||. .|....|.|..                              |:|.:|
Mouse   737 GPDQLQDLGQRLYVGGRFYPDDTQLWGGP------------------------------FRGCLQ 771

  Fly  1541 DVKVSNGSLNLIVEMYSLNVTDVQVNAKPLGAVTIDRASVLPGEVSDDLCRKNPCLHNAECRNTW 1605
            |::::  |::|  ..:|   :.::.::.|........:::..|.||:|.|..|||.:...|..||
Mouse   772 DLQLN--SIHL--PFFS---SPMENSSWPSELEAGQSSNLTQGCVSEDTCNPNPCFNGGTCHVTW 829

  Fly  1606 NDYTCKCPNGYKGKNCQEIEFCQHVTCPGQSLCQNLDDGYECVTNTTFTGQERSPLAFFYFQEQQ 1670
            ||:.|.|...:.|..|.:..:|....|...:.|:.:.||:.||...||  :|..|..|      .
Mouse   830 NDFYCTCSENFTGPTCAQQRWCPRQPCLPPATCEEVPDGFVCVAEATF--REGPPAVF------T 886

  Fly  1671 SDDIVSEASPKQTLKPVIDIAFRTRAGGTLLYIDNVDGFFEIGVNGGRVTITWKLSALHFGESAR 1735
            ..::.|..|.       :.:|||||.                                  .|:..
Mouse   887 GHNVSSSLSG-------LTLAFRTRD----------------------------------SEAGL 910

  Fly  1736 FEKENTDGEWSRIYLRAHNSKLEGGWKGWESMVDPTPA------------FSTDIDQAAFQSLIA 1788
            ....:..|..|.|:|...|..|.|...|   .|.|.|.            .:.:..|||     |
Mouse   911 LRAVSAAGAHSNIWLAVRNGSLAGDVAG---SVLPAPGPRVADGAWHRVRLAREFPQAA-----A 967

  Fly  1789 TSTQVYLGG--MPESRQARGSTLSAQQG---------SQFKGCVGEARVGDLLLPYFSMAELYSR 1842
            :...::|.|  .|.:....|..|...||         ..|.||:|...:||..||          
Mouse   968 SRWLLWLDGAATPVALHGLGGDLGFLQGPGAVPLLLAENFTGCLGRVALGDFPLP---------- 1022

  Fly  1843 TNVSVQQKAQFRLNATRPEEGCILCFQSDCKNDGFCQSPSDEYACTCQPGFEGDDCGTDIDECLN 1907
                          ...|..|.:    |..:........|...:..|:.|          ..|..
Mouse  1023 --------------LAPPRSGTV----SGAREHFVAWPGSPAVSLGCRGG----------PVCSP 1059

  Fly  1908 TECLNNGTCINQVAAFFCQCQPGFEGQHCEQNIDECADQPCHNGGNCTDLIASYVCDCPEDYMGP 1972
            :.||:.|.|.:...||.|.|.|.:||..||...|.|...||..|                     
Mouse  1060 SPCLHGGACRDLFDAFACSCGPAWEGPRCEIRADPCRSTPCVRG--------------------- 1103

  Fly  1973 QCDVLKQMTCENEPCRNGSTCQNGFNASTGNNFTCTCVPGFEGPLCDIPFCEITPCDNGGLCLTT 2037
            ||                       :|.....|.|.|.|||.||.|.:|            .|..
Mouse  1104 QC-----------------------HARPDGRFECRCPPGFSGPRCRLP------------VLPQ 1133

  Fly  2038 GAVPMCKCSLGYTGRLCEQDINECESNPCQNGGQCKDLVGRYECDCQGTGFEGIRCENDIDECNM 2102
            |      |:|..|                     |||               |..||        
Mouse  1134 G------CNLNST---------------------CKD---------------GAPCE-------- 1148

  Fly  2103 EGDYCGGLGRCFNKPGSFQCICQKPYCGAYCNFTD-PCNATDLCSNGGRCVESCGAKPDYYCECP 2166
                 ||       |....|.||:...|..|...| ||.|:. |.|||.|..:.|.   :.|.|.
Mouse  1149 -----GG-------PLGTNCSCQEGLAGLRCQSLDKPCEASP-CLNGGTCRVASGI---FECTCS 1197

  Fly  2167 EGFAGKNC----TAPITAKEDGPSTTDIAIIVIPV--VVVLLLIAGALLGTFLVMARNKRATRGT 2225
            .||:|:.|    |.|:      |....:..:.:|.  ..:|||:.|.|.|  ::.||.:|.:.||
Mouse  1198 AGFSGQFCEVVKTLPL------PLPFPLLEVAVPAACACLLLLLLGLLSG--ILAARKRRQSEGT 1254

  Fly  2226 YSPSAQEYCNPRLEMDNVLKPPPEERLI 2253
            ||||.||....|||||:|||.|||||||
Mouse  1255 YSPSQQEVAGARLEMDSVLKVPPEERLI 1282

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
crbNP_001247284.1 LamG 91..228 CDD:238058
EGF_CA 386..423 CDD:238011
EGF_CA 425..460 CDD:238011
EGF 466..495 CDD:278437
EGF 605..633 CDD:278437
EGF_CA 716..750 CDD:238011 9/31 (29%)
EGF_CA 753..789 CDD:238011 13/39 (33%)
EGF_CA 792..828 CDD:238011 18/35 (51%)
EGF_CA 830..865 CDD:238011 14/34 (41%)
EGF_CA 868..905 CDD:238011 20/36 (56%)
EGF_CA 907..943 CDD:238011 13/35 (37%)
EGF_CA 1009..1045 CDD:238011 12/35 (34%)
EGF_CA 1047..1082 CDD:238011 14/35 (40%)
Laminin_G_1 1155..1290 CDD:278483 41/149 (28%)
EGF 1316..1346 CDD:278437 16/29 (55%)
Laminin_G_1 1388..1550 CDD:278483 31/163 (19%)
EGF_CA <1593..1622 CDD:238011 11/28 (39%)
Laminin_G_2 1692..1828 CDD:280389 31/158 (20%)
EGF_CA 1901..1937 CDD:238011 12/35 (34%)
EGF_CA 1939..1974 CDD:238011 5/34 (15%)
EGF_CA 2057..2094 CDD:238011 4/36 (11%)
EGF_CA 2096..2133 CDD:238011 7/36 (19%)
EGF_CA 2137..2175 CDD:238011 16/42 (38%)
Crb2NP_001157038.1 EGF_CA <79..110 CDD:238011 13/32 (41%)
EGF_CA 112..148 CDD:238011 18/35 (51%)
EGF_CA 151..186 CDD:238011 14/34 (41%)
EGF_CA 188..225 CDD:238011 20/36 (56%)
EGF_CA 230..263 CDD:238011 13/32 (41%)
EGF_CA 324..360 CDD:238011 12/35 (34%)
EGF_CA 362..397 CDD:238011 14/35 (40%)
Laminin_G_2 469..592 CDD:280389 40/146 (27%)
EGF 613..643 CDD:278437 16/29 (55%)
LamG 666..776 CDD:238058 33/171 (19%)
EGF 814..844 CDD:278437 12/29 (41%)
Laminin_G_2 901..1018 CDD:280389 31/158 (20%)
EGF_CA 1058..1089 CDD:238011 11/30 (37%)
EGF_CA 1175..1206 CDD:238011 12/34 (35%)
Interaction with EPB41L5. /evidence=ECO:0000250|UniProtKB:Q5IJ48 1246..1282 23/35 (66%)
Blue background indicates that the domain is not in the aligned region.


Information from Original Tools:


Tool Simple Score Weighted Score Original Tool Information
BLAST Result Score Score Type Cluster ID
Compara 00.000 Not matched by this tool.
Domainoid 1 1.000 61 1.000 Domainoid score I10449
eggNOG 1 0.900 - - E33208_3BAFT
Hieranoid 00.000 Not matched by this tool.
Homologene 00.000 Not matched by this tool.
Inparanoid 00.000 Not matched by this tool.
Isobase 00.000 Not matched by this tool.
OMA 00.000 Not matched by this tool.
OrthoDB 1 1.010 - - D111153at2759
OrthoFinder 1 1.000 - - FOG0003041
OrthoInspector 1 1.000 - - otm42733
orthoMCL 1 0.900 - - OOG6_100271
Panther 00.000 Not matched by this tool.
Phylome 1 0.910 - -
RoundUp 1 1.030 - avgDist Average_Evolutionary_Distance R3765
SonicParanoid 00.000 Not matched by this tool.
SwiftOrtho 00.000 Not matched by this tool.
TreeFam 00.000 Not matched by this tool.
87.750

Return to query results.
Submit another query.