DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment crb and Crb1

DIOPT Version :9

Sequence 1:NP_001247284.1 Gene:crb / 42896 FlyBaseID:FBgn0259685 Length:2253 Species:Drosophila melanogaster
Sequence 2:NP_001100652.1 Gene:Crb1 / 304825 RGDID:1309947 Length:1407 Species:Rattus norvegicus


Alignment Length:1681 Identity:505/1681 - (30%)
Similarity:734/1681 - (43%) Gaps:361/1681 - (21%)


- Green bases have known domain annotations that are detailed below.


  Fly   644 ASALALTPIN--CNATNGKCL-----NGGTCSMNGTHCYCAVGYSGD---RCEKAEN-CSPLNCQ 697
            :|:|.:...|  ||..|.:||     |..||........|.:..:.|   .||..:: |....||
  Rat    17 SSSLLICVKNSFCNKNNTRCLSNSCQNNSTCKRFPQDSSCCLDTANDLDTDCEDLKDPCLSSPCQ 81

  Fly   698 EPMVCVQNQ------CLCPE-------NKVCNQCATQPCQNGGECVDLPNGDYECKCTRGWTGRT 749
            ....||...      |.||.       :...|.|....||:||.|...|.... |.|..|:.||.
  Rat    82 GTATCVNIPGERSFLCQCPPGYSGLTCDTATNSCGGNLCQHGGTCHKDPEHPV-CICPPGYAGRF 145

  Fly   750 CGNDVDECTLHPKICGNG-ICKNEKGSYKCYCTPGFTGVHCDSDVDECLSFPCLNGATCHNKINA 813
            |..|.:||...|  |.|| :|::....|.|:|.||:.|.|||.:||||:|.||:|.|.|.|:|..
  Rat   146 CETDHNECDSSP--CHNGAVCQDRINGYSCFCVPGYQGRHCDLEVDECVSDPCMNEAVCLNEIGR 208

  Fly   814 YECVCQPGYEGENCEVDIDECGSNPCSNGSTCIDRINNFTCNCIPGMTGRICDIDIDDCVGDPCL 878
            |.|||...|.|.|||::||||||.||.:|:||.|.:..:.|:|.||..|..|:::.::|...|||
  Rat   209 YTCVCPQEYSGVNCELEIDECGSQPCLHGATCRDALGGYFCDCAPGFLGDHCELNFNECESQPCL 273

  Fly   879 NGGQCIDQLGGFRCDCSGTGYEGENCELNIDECLSNPCTNGAKCLDRVKDYFCDCHNGYKGKNCE 943
            :||.|:|....:.|||:|:|:.|.:||..|..|.|.||.|.|.|.|.|..|.|.|..||.|..||
  Rat   274 HGGLCVDGRNSYHCDCTGSGFRGIHCESLIPLCWSKPCHNDATCEDTVDSYICHCWPGYTGALCE 338

  Fly   944 QDINECESNPCQYNGNCLERSNITLYQMSRITDLPKVFSQPFSFENASGYECVCVPGIIGKNCEI 1008
            .|||||.|||||:.|.|.|.|:..||  .....||    ..||:..||||.|:|:||..|::||.
  Rat   339 TDINECSSNPCQFGGECAELSSQDLY--GNTAGLP----SSFSYLGASGYVCICLPGFTGRHCEE 397

  Fly  1009 NINECDSNPCSKHGNCNDGIGTYTCECEPGFEGTHCEINIDECDRYNPCQRGTCYDQIDDYDCDC 1073
            :|:||..|||...|.|.:..|.|||.|.           .|:..|                    
  Rat   398 DIDECLPNPCLNGGTCQNLPGNYTCHCP-----------FDDTSR-------------------- 431

  Fly  1074 DANYGGKNCSVLLKGCDQNPCLNGGACLPYLINEVTHLYNCTCENGFQGDKCEKTTTLSMVATSL 1138
             ..|||::||.:|.||..:.|||.|.|:|:..|. .|.:.|.|.:|:.|..||..||||..:...
  Rat   432 -TFYGGEDCSEILLGCTHHQCLNNGKCIPHFQNG-QHEFTCQCPSGYAGPLCETVTTLSFESNGF 494

  Fly  1139 ISVT----TEREEGYDINLQFRTTLPNGVLAFGTTGEKNEPVSYILELINGRLNLHSSLLNKWEG 1199
            :.||    |......:|:|:|:|..||.:|..    ..|:.:|..|||::|.::|...:.|:.:.
  Rat   495 LWVTSGSHTSMGSECNISLKFQTVQPNALLLV----RGNKDMSVKLELLDGCVHLSIEVWNQSKV 555

  Fly  1200 -VFIGSKLNDSNWHKVFVAI-NTSHLVLSANDEQAIFPVGSYETANNSQP--SFPRTYLG----G 1256
             ::|....:|..||.|.|.. .|..|.|:.:..:......|..:..|.|.  :...::||    |
  Rat   556 LLYISHNTSDGEWHLVEVTFAETITLALNGSSCKEKCTTKSSVSIENHQSICALQNSFLGGLPMG 620

  Fly  1257 TIPNLKSYLRHLTHQPS--AFVGCMQDIMVNGKWIFPDEQDANISYTKLENVQSGCPRTEQCKPN 1319
            |..:..|.| ::.:.||  :||||:|||..:...|..:...:.:|    .||::||.|.:.|:..
  Rat   621 TASDSMSVL-NIYNVPSTPSFVGCLQDIRFDLNHITLESISSGLS----SNVKAGCLRKDWCESQ 680

  Fly  1320 PCHSNGECTDLWHTFACHCPRPFFGHTCQHNMTAATFGHENTTHSAVIVETTDVARRAIRSILDI 1384
            ||.:.|.|.:||..:.|.|.||:.|..|.....|..||.:::|..|......:..:.     .::
  Rat   681 PCQNRGRCINLWQGYRCECDRPYAGSNCLKEYVAGRFGQDDSTGYAAFKVNDNFGQN-----FNL 740

  Fly  1385 SMFIRTREPTGQVFYLGTDPRKAPTKNIGDSYVAAKLHGGELLVKMQFSGTPEAYTVGGQKLDNG 1449
            |||:|||:|.|.:..||         |....||...|..|.|.:|.  .|:|:  .|....|.:|
  Rat   741 SMFVRTRQPLGFLLTLG---------NSTYQYVCVWLEHGSLALKT--PGSPK--LVANFFLSDG 792

  Fly  1450 YNHLIEVVRNQTLVQVKLNGTEYFRKTLSTTGLLD--------AQVLYLGGPAPTRESLLGATTE 1506
            ..|||.       :::|.|..|.::.: ...|.:.        ..|:::|| .|.||        
  Rat   793 NAHLIS-------LRIKPNEIELYQSS-QNLGFMSVPAWTIQRGDVIFIGG-LPDRE-------- 840

  Fly  1507 PGIIPVPGAGIPIEDTTVPKEADDSRDYFKGIIQDVKVSNGSLNLIVEMYSLNVTDVQVNAKPLG 1571
                                :.:....:.||.||||::::.:|.....         ..|:....
  Rat   841 --------------------KTEAYGGFLKGCIQDVRLNSQNLEFFPN---------STNSAQHD 876

  Fly  1572 AVTIDRASVLPGEVSDDLCRKNPCLHNAECRNTWNDYTCKCPNGYKGKNCQEIEFCQHVTCPGQS 1636
            .|.::.....||   |::|:.|||.:...|.:.|:|::|.||....|:.|.|:::||...||..:
  Rat   877 PVLVNVTQGCPG---DNVCKSNPCHNGGVCHSLWDDFSCSCPTNTSGRACDEVQWCQLSPCPPIA 938

  Fly  1637 LCQNLDDGYECVTNTTFTGQERSPLAFFYFQEQQSDDIVSEASPKQTLKPVIDIAFRTR-AGGTL 1700
            .||.:..|:||:.|..|:|.. |.:.|     :.:.:|..|.:.       |...|||: ....:
  Rat   939 ECQLVPQGFECIANAAFSGLS-SEILF-----RSNGNITRELTN-------ITFGFRTQDTNAVI 990

  Fly  1701 LYIDNVDGFFEIGVNGGRVTITWK----LSALHFGESARFEKENTDGEWSRIYLRAHNSKLEGGW 1761
            |:.:....|..|.:...|:....:    ...||...|...    .||.|.|:..           
  Rat   991 LHAEKEPEFLNISIQDSRLLFQLRSGNSFYTLHLTGSQLV----NDGAWHRVTF----------- 1040

  Fly  1762 KGWESMVDP---TPAFSTDI-DQAAFQ-SLIAT--------STQVYLGGMPESRQARGSTLSAQQ 1813
                ||:||   |..:..:: ||..|. |.:||        :|.:|:|         ...:..|:
  Rat  1041 ----SMIDPRAQTSLWQMEVDDQTPFVISAVATGNLNFLKDNTDIYVG---------DQAVDNQK 1092

  Fly  1814 GSQFKGCVGEARVGDLLLPYFSMAELYSRTNVSVQQKAQF-RLNATRPEEGCI---LCFQSDCKN 1874
            |.|  ||:...::|.|.|.||.  .|:..|  |..|:.|| :::......||:   .|..|.|.:
  Rat  1093 GLQ--GCLSTVQIGGLYLSYFE--SLHGFT--SKPQEEQFLKVSTNVALTGCLPLSACHSSPCLH 1151

  Fly  1875 DGFCQSPSDEYACTCQPGFEGDDCGTDIDECLNTECLNNGTCINQVAAFFCQCQPGFEGQHCEQN 1939
            .|.|:.....|.|.|.||:.|..|..::|||.::.|: :|.|.:.|||:.|:|:||:.|.:||..
  Rat  1152 GGICEDSYSSYRCACLPGWSGAHCEINVDECSSSPCV-HGNCSDGVAAYHCRCEPGYTGGNCEAE 1215

  Fly  1940 IDECADQPCHNGGNCTDLIASYVCDCPEDYMGPQC--DVLKQMTCENEPCRNGSTCQNGFNASTG 2002
            :|.|....|.||..|......|.|.|..::.|..|  ..|....|.||.                
  Rat  1216 VDTCKSHQCANGATCVSGAQGYSCLCLGNFTGRFCRHSRLPSTVCGNEK---------------- 1264

  Fly  2003 NNFTCTCVPGFEGPLCDIPFCEITPCDNGGLCLTTGAVPMCKCSLGYTGRLCEQDINECESNPCQ 2067
            .||||.                     |||.|........|.|..|:||..||::||||.|:||.
  Rat  1265 TNFTCY---------------------NGGSCSVFQEDWKCVCRPGFTGEWCEENINECASDPCI 1308

  Fly  2068 NGGQCKDLVGRYECDCQGTGFEGIRCENDIDECNMEGDYCGGLGRCFNKPGSFQCICQKPYCGAY 2132
            |||.|:|||.|::|.|. ..|.|.|||.|:.:..:.|                            
  Rat  1309 NGGLCRDLVNRFQCICD-VAFAGERCELDLADDRLVG---------------------------- 1344

  Fly  2133 CNFTDPCNATDLCSNGGRCVESCGAKPDYYCECPEGFAGKNCTAPITAKEDGPSTTDIAIIVIPV 2197
                                                        .|||...|           .:
  Rat  1345 --------------------------------------------IITAVGSG-----------TL 1354

  Fly  2198 VVVLLLIAGALLGTFLVMARNKRATRGTYSPSAQEYCNPRLEMDNVLKPPPEERLI 2253
            |::.:|:..|::.   ::|.|||||:||||||.||...||:||.:.:.||..||||
  Rat  1355 VLLFILLLAAIVS---LIASNKRATQGTYSPSGQEKAGPRVEMWSRMPPPALERLI 1407

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
crbNP_001247284.1 LamG 91..228 CDD:238058
EGF_CA 386..423 CDD:238011
EGF_CA 425..460 CDD:238011
EGF 466..495 CDD:278437
EGF 605..633 CDD:278437
EGF_CA 716..750 CDD:238011 13/33 (39%)
EGF_CA 753..789 CDD:238011 14/36 (39%)
EGF_CA 792..828 CDD:238011 19/35 (54%)
EGF_CA 830..865 CDD:238011 17/34 (50%)
EGF_CA 868..905 CDD:238011 14/36 (39%)
EGF_CA 907..943 CDD:238011 16/35 (46%)
EGF_CA 1009..1045 CDD:238011 13/35 (37%)
EGF_CA 1047..1082 CDD:238011 5/34 (15%)
Laminin_G_1 1155..1290 CDD:278483 40/144 (28%)
EGF 1316..1346 CDD:278437 12/29 (41%)
Laminin_G_1 1388..1550 CDD:278483 38/169 (22%)
EGF_CA <1593..1622 CDD:238011 10/28 (36%)
Laminin_G_2 1692..1828 CDD:280389 33/153 (22%)
EGF_CA 1901..1937 CDD:238011 14/35 (40%)
EGF_CA 1939..1974 CDD:238011 10/34 (29%)
EGF_CA 2057..2094 CDD:238011 20/36 (56%)
EGF_CA 2096..2133 CDD:238011 2/36 (6%)
EGF_CA 2137..2175 CDD:238011 0/37 (0%)
Crb1NP_001100652.1 EGF_CA 73..108 CDD:238011 8/34 (24%)
EGF 115..145 CDD:278437 11/30 (37%)
EGF_CA 149..184 CDD:238011 14/36 (39%)
EGF_CA 188..223 CDD:238011 19/34 (56%)
EGF_CA 226..261 CDD:238011 17/34 (50%)
EGF_CA 263..300 CDD:238011 14/36 (39%)
EGF_CA <310..338 CDD:238011 13/27 (48%)
EGF_CA 340..396 CDD:238011 29/61 (48%)
EGF_CA 398..>425 CDD:214542 12/26 (46%)
Laminin_G_2 515..650 CDD:280389 40/139 (29%)
EGF_CA 677..708 CDD:238011 12/30 (40%)
Laminin_G_2 744..861 CDD:280389 38/166 (23%)
EGF 892..921 CDD:278437 10/28 (36%)
LamG 955..1103 CDD:238058 40/190 (21%)
EGF_CA <1147..1176 CDD:238011 10/28 (36%)
EGF_CA 1178..1213 CDD:238011 14/35 (40%)
EGF_CA 1298..1334 CDD:238011 20/36 (56%)
Blue background indicates that the domain is not in the aligned region.


Information from Original Tools:


Tool Simple Score Weighted Score Original Tool Information
BLAST Result Score Score Type Cluster ID
Compara 00.000 Not matched by this tool.
Domainoid 1 1.000 74 1.000 Domainoid score I8987
eggNOG 1 0.900 - - E33208_3BAFT
Hieranoid 00.000 Not matched by this tool.
Homologene 00.000 Not matched by this tool.
Inparanoid 1 1.050 707 1.000 Inparanoid score I626
OMA 00.000 Not matched by this tool.
OrthoDB 1 1.010 - - D111153at2759
OrthoFinder 1 1.000 - - FOG0003041
OrthoInspector 1 1.000 - - otm44798
orthoMCL 1 0.900 - - OOG6_100271
Panther 00.000 Not matched by this tool.
Phylome 1 0.910 - -
SonicParanoid 1 1.000 - - X4671
SwiftOrtho 1 1.000 - -
TreeFam 00.000 Not matched by this tool.
109.770

Return to query results.
Submit another query.