DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment crb and Notch4

DIOPT Version :9

Sequence 1:NP_001247284.1 Gene:crb / 42896 FlyBaseID:FBgn0259685 Length:2253 Species:Drosophila melanogaster
Sequence 2:NP_001002827.1 Gene:Notch4 / 406162 RGDID:1303282 Length:1961 Species:Rattus norvegicus


Alignment Length:2101 Identity:457/2101 - (21%)
Similarity:633/2101 - (30%) Gaps:899/2101 - (42%)


- Green bases have known domain annotations that are detailed below.


  Fly   272 DPCMGHGTCSSSPEGY-ECRCTARYSGKNCQKDNGSPCAKNP-CENGGSC------LENSRG--- 325
            :||...|||....:|. .|:|...:.|:.||..:  ||.... |||||||      ..:|..   
  Rat    30 EPCANGGTCLRLSQGQGTCQCAPGFLGETCQFPD--PCWDTQLCENGGSCQALLPTAPSSHSPTS 92

  Fly   326 ----DYQCFCDPNHSGQHCETEVNIHPLCQTNPCLNNGACVVIGGSGALTCECPKGYAGARCEVD 386
                .:.|.|....:|..|::.  :..||..:.|.|.|.|.| ..||...|.|..|:.|.:|:: 
  Rat    93 PLTPHFSCTCPSGFTGDRCQSP--LEELCPPSFCSNGGHCSV-QVSGRPQCSCEPGWTGEQCQL- 153

  Fly   387 TDECASQPCQNNGSCIDRINGFSCDCSGTGYTGAFCQTNVDEC--DKNPCLNGGRCFDTYGWYTC 449
            .|.|::.||.|.|.|:.......|.|. ||:.|..|:.:|:||  :..||..|..|.:|.|.:.|
  Rat   154 RDFCSANPCANGGVCLATYPQIQCRCP-TGFEGHICERDVNECFLEPGPCPRGTSCHNTLGSFQC 217

  Fly   450 QCLDGWGGEICD-RPMTCQTQQCLNGGTCLDKPIG----FQCLCPPEYTGELCQIAPSCAQQCPI 509
            .|..|..|..|. |...|....|||||||...|.|    ..|||||.:||..|::.|        
  Rat   218 LCPVGQEGPQCKLRKGACLPGTCLNGGTCQLVPEGDTTFHLCLCPPGFTGLNCEMNP-------- 274

  Fly   510 DSECVGGKCVCKPGSSGPIGHCLPTTTTPTPEQEPTTTPRTTPNPNPAIPNTLTTTTKIPPITTS 574
                                                                             
  Rat   275 ----------------------------------------------------------------- 274

  Fly   575 RTLVGTTTGSRRPPQQPLQSPTQRSASLNACPQENCLNGGTCLGYSGNYSCICASGYTGYNCQTS 639
                                        :.|.:..|.||.||....|.|:|:|...:.|::|...
  Rat   275 ----------------------------DDCVRNQCQNGATCQDGLGTYTCLCPKTWKGWDCSED 311

  Fly   640 TGDGASALALTPINCNATN-GKCLNGGTC--SMNGTHCYCAVGYSGDRC-EKAENCSPLNCQEPM 700
            ..:           |.|.. .:|.|||||  |..|.||.|..|:.|:.| |..::|:...|....
  Rat   312 IDE-----------CEAQGPPRCRNGGTCQNSAGGFHCVCVSGWGGEGCDENLDDCAAATCALGS 365

  Fly   701 VCVQN----QCLCPENK---VC---NQCATQPCQNGGECVDLP-NGDYECKCTRGWTGRTCGNDV 754
            .|:..    .||||..:   :|   :.|..|||....:|...| .|...|.|..|::|.||..|:
  Rat   366 TCIDRVGSFSCLCPPGRTGLLCHLEDMCLRQPCHVNAQCSTNPLTGSTLCICQPGYSGPTCHQDL 430

  Fly   755 DECTL---HPKICGN-GICKNEKGSYKCYCTPGFTGVHCDSDVDECLSFPCLNGATCHNKINAYE 815
            |||.:   .|..|.: |.|.|..||:.|.|.||:||..|::|.:||||.||..|:||.:.:..::
  Rat   431 DECQMAQQGPSPCEHGGSCINTPGSFNCLCLPGYTGSRCEADHNECLSQPCHPGSTCLDLLATFQ 495

  Fly   816 CVCQPGYEGENCEVDIDECGSNPCSNGSTCIDRINNFTCNCIPGMTGRICDIDIDDCVGDPCLNG 880
            |:|.||.||..|||:|:||.||||.|.:.|.|::|.|.|.|:||.||..|:.|:|:|...||.||
  Rat   496 CLCPPGLEGRLCEVEINECASNPCLNQAACHDQLNGFLCLCLPGFTGARCEKDMDECSSAPCANG 560

  Fly   881 GQCIDQLGGFRCDCSGTGYEGENCELNIDECLSNPCTNGAKCLDRVKDYFCDCHNGYKGKNCEQD 945
            |.|.||.|.|.|:|. .|:||..||...|||.|:||..||.|||....:.|.|..|:.|:.||..
  Rat   561 GHCQDQPGAFHCECL-PGFEGPRCETEADECRSDPCPVGASCLDLPGAFLCLCRPGFTGQLCEVP 624

  Fly   946 INECESNPCQYNGNCLERSNITLYQMSRITDLPKVFSQPFSFENASGYECVCVPGIIGKNCEINI 1010
            :  |....||....|.::.:..                          .|:|..|..|  |....
  Rat   625 L--CSPILCQPGQQCQDQEHRA--------------------------PCLCPDGSPG--CVPAE 659

  Fly  1011 NECDSNPCSKHGNCNDGIGTYTCECEPGFEGTHCEINIDECDRYNPCQR-GTCYDQIDDYDCDCD 1074
            ::|   || .||:|...:    |.|..|:.|..||..:..| ...||.. |||:.|...|:|.|.
  Rat   660 DDC---PC-HHGHCQRSL----CVCNEGWTGPECETELGGC-LSTPCAHGGTCHPQPSGYNCSCL 715

  Fly  1075 ANYGGKNCSVLLKGCDQNPCLNGGACLPYLINEVTHLYNCTCENGFQGDKCEKTTTLSMVATSLI 1139
            |.|.|..||..:..|...||||||:|..:     ...|:|||.....|..|              
  Rat   716 AGYTGLTCSEEITACHSGPCLNGGSCSIH-----PEGYSCTCPPSHTGPHC-------------- 761

  Fly  1140 SVTTEREEGYDINLQFRTTLPNGVLAFGTTGEKNEPVSYILELINGRLNLHSSLLNKWEGVFIGS 1204
                                                                             
  Rat   762 ----------------------------------------------------------------- 761

  Fly  1205 KLNDSNWHKVFVAINTSHLVLSANDEQAIFPVGSYETANNSQPSFPRTYLGGTIPNLKSYLRHLT 1269
                                               :||                           
  Rat   762 -----------------------------------QTA--------------------------- 764

  Fly  1270 HQPSAFVGCMQDIMVNGKWIFPDEQDANISYTKLENVQSGCPRTEQCKPNPCHSNGECTDLWHTF 1334
                                                       .:.|....|.:.|.|.....||
  Rat   765 -------------------------------------------VDHCASASCLNGGTCMSKPGTF 786

  Fly  1335 ACHCPRPFFGHTCQHNMTAATFGHENTTHSAVIVETTDVARRAIRSILDISMFIRTREPTGQVFY 1399
            .|||...|.|..|:                                                   
  Rat   787 FCHCATGFQGLHCE--------------------------------------------------- 800

  Fly  1400 LGTDPRKAPTKNIGDSYVAAKLHGGELLVKMQFSGTPEAYTVGGQKLDNGYNHLIEVVRNQTLVQ 1464
                               .|:|                                          
  Rat   801 -------------------KKIH------------------------------------------ 804

  Fly  1465 VKLNGTEYFRKTLSTTGLLDAQVLYLGGPAPTRESLLGATTEPGIIPVPGAGIPIEDTTVPKEAD 1529
                                          |:                                 
  Rat   805 ------------------------------PS--------------------------------- 806

  Fly  1530 DSRDYFKGIIQDVKVSNGSLNLIVEMYSLNVTDVQVNAKPLGAVTIDRASVLPGEVSDDLCRKNP 1594
                                                                        |..||
  Rat   807 ------------------------------------------------------------CADNP 811

  Fly  1595 CLHNAECRNTWNDYTCKCPNGYKGKNCQE-IEFCQHVTCPGQSLCQNLDDGYECVTNTTFTGQER 1658
            |.:.|.|::|.....|.|..||.|.:||. |:.|....||..:.|......:.|:.:..:||...
  Rat   812 CRNKATCQDTPRGARCLCSPGYTGSSCQTLIDLCARKPCPHTARCLQSGPSFHCLCHQGWTGSLC 876

  Fly  1659 S-PLAFFYFQEQQSDDIVSEASPKQTLKPVIDIAFRTRAGGTLLYIDNVDGFFEIGVNGGRVTIT 1722
            . ||:             .:|:   .:...::|:...:.||  |.||....:|            
  Rat   877 DLPLS-------------CQAA---AMSQGVEISNLCQNGG--LCIDTGSSYF------------ 911

  Fly  1723 WKLSALHFGESARFEKENTDGEWSRIYLRAHNSKLEGGWKG--WESMVDPTPAFSTDIDQAAFQS 1785
                                            .:...|::|  .:..|:|               
  Rat   912 --------------------------------CRCPPGFEGKLCQDTVNP--------------- 929

  Fly  1786 LIATSTQVYLGGMPESRQARGSTLSAQQGSQFKGCVGEARVGDLLLPYFSMAELYSRTNVSVQQK 1850
                                                                             
  Rat   930 ----------------------------------------------------------------- 929

  Fly  1851 AQFRLNATRPEEGCILCFQSDCKNDGFCQSPSDEYACTCQPGFEGDDCGTDIDECLNTECLNNGT 1915
                            |....|.:...|....:.|.|.|.||:||.:|....|.|.:..|.|:||
  Rat   930 ----------------CTSKPCLHGATCVPQPNGYVCQCAPGYEGQNCSKVHDACQSGPCHNHGT 978

  Fly  1916 CINQVAAFFCQCQPGFEGQHCEQNIDECADQPCHNGG--NCTDLIASYVCDCPEDYMGPQCDVLK 1978
            |..:...|.|.|.|||.|..||.::|||.|:|||..|  :|..|..::.|.|...:.|.:|:|..
  Rat   979 CTPRPGGFHCACPPGFVGLRCEGDVDECLDRPCHPSGTASCHSLANAFYCQCLPGHTGQRCEVEM 1043

  Fly  1979 QMTCENEPCRNGSTCQNGFNASTG--NNFTCTCVPGFEGPLCD--IPFCEITPCDNGGLCLTT-- 2037
            .: |:::||.||.:|:    .:||  ..|||.|..|||||.|.  .|.|....|.||||||.:  
  Rat  1044 DL-CQSQPCSNGGSCE----VTTGPPPGFTCRCPEGFEGPTCSRKAPACGNHHCHNGGLCLPSPK 1103

  Fly  2038 -GAVPMCKCSLGYTGRLCEQDINEC----------ESNPCQNGGQCKDLVG----RYECDCQGTG 2087
             |:.|:|.|..|:.|       .:|          ..:||.:.|.|.:..|    .::|.|....
  Rat  1104 PGSPPLCACLSGFGG-------PDCLTPPAPPGCGPPSPCMHNGSCTETPGLGNPGFQCTCPPDS 1161

  Fly  2088 FEGIRCENDIDECNMEGDYCGGLGRCFNKPGSFQCI--CQKP---YCGAYCNF--TDP---CNAT 2142
             .|.||:..            |...|..:.|...|.  |..|   :.|..|:.  .||   |...
  Rat  1162 -PGPRCQRP------------GANGCEGRGGDGACDAGCSGPGGDWDGGDCSLGVPDPWKGCPPH 1213

  Fly  2143 DLC---SNGGRCVESCGAK----PDYYCECPE----------------------------GFAGK 2172
            ..|   ...|||...|.::    ..|.||.|.                            |:.|.
  Rat  1214 SQCWLLFRDGRCHPQCDSEECLFDGYDCEIPPTCTPAYDQYCRDHFHNGHCEKGCNNAQCGWDGG 1278

  Fly  2173 NCTAPITAKEDGPSTTDIAIIVIPVVVVLLLIAGALLGTFLVM-------------------ARN 2218
            :|.......|.|||...:.::..|.:...||....:|...|.:                   .|.
  Rat  1279 DCRPEGDDSEGGPSLALLVVLSPPALDQQLLALARVLSLTLRVGLWVRKDSEGRNMVFPYPGTRA 1343

  Fly  2219 KRATRGTYSPSAQEYCNPRLE 2239
            |....||...|:.|...|..:
  Rat  1344 KEELSGTRDSSSWERQAPHTQ 1364

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
crbNP_001247284.1 LamG 91..228 CDD:238058
EGF_CA 386..423 CDD:238011 12/36 (33%)
EGF_CA 425..460 CDD:238011 13/36 (36%)
EGF 466..495 CDD:278437 16/32 (50%)
EGF 605..633 CDD:278437 10/27 (37%)
EGF_CA 716..750 CDD:238011 11/34 (32%)
EGF_CA 753..789 CDD:238011 17/39 (44%)
EGF_CA 792..828 CDD:238011 16/35 (46%)
EGF_CA 830..865 CDD:238011 18/34 (53%)
EGF_CA 868..905 CDD:238011 18/36 (50%)
EGF_CA 907..943 CDD:238011 15/35 (43%)
EGF_CA 1009..1045 CDD:238011 10/35 (29%)
EGF_CA 1047..1082 CDD:238011 13/35 (37%)
Laminin_G_1 1155..1290 CDD:278483 2/134 (1%)
EGF 1316..1346 CDD:278437 11/29 (38%)
Laminin_G_1 1388..1550 CDD:278483 3/161 (2%)
EGF_CA <1593..1622 CDD:238011 11/28 (39%)
Laminin_G_2 1692..1828 CDD:280389 10/137 (7%)
EGF_CA 1901..1937 CDD:238011 14/35 (40%)
EGF_CA 1939..1974 CDD:238011 13/36 (36%)
EGF_CA 2057..2094 CDD:238011 10/50 (20%)
EGF_CA 2096..2133 CDD:238011 7/41 (17%)
EGF_CA 2137..2175 CDD:238011 14/75 (19%)
Notch4NP_001002827.1 EGF_CA 191..>224 CDD:284955 12/32 (38%)
EGF_CA 311..349 CDD:238011 15/48 (31%)
EGF_CA 352..387 CDD:238011 7/34 (21%)
EGF_CA 429..470 CDD:238011 17/40 (43%)
EGF_CA 472..508 CDD:238011 16/35 (46%)
EGF_CA 511..546 CDD:238011 18/34 (53%)
EGF_CA 548..584 CDD:238011 18/36 (50%)
EGF_CA 588..622 CDD:238011 15/33 (45%)
EGF_CA <696..723 CDD:238011 12/26 (46%)
EGF_CA 765..800 CDD:238011 11/34 (32%)
EGF_CA <810..839 CDD:238011 11/28 (39%)
EGF_CA 892..924 CDD:238011 9/77 (12%)
EGF_CA 927..961 CDD:238011 12/129 (9%)
EGF_CA 966..1000 CDD:238011 14/33 (42%)
EGF_CA 1002..1040 CDD:238011 13/37 (35%)
Notch 1207..1242 CDD:278494 7/34 (21%)
Notch 1245..1281 CDD:278494 2/35 (6%)
NOD 1291..1337 CDD:284282 7/45 (16%)
NODP 1376..>1415 CDD:284987
Ank_2 <1571..1656 CDD:289560
ANK 1625..1732 CDD:238125
ANK repeat 1626..1656 CDD:293786
Ank_2 1630..1723 CDD:289560
ANK repeat 1658..1690 CDD:293786
ANK 1687..1811 CDD:238125
ANK repeat 1692..1723 CDD:293786
Ank_2 1697..1789 CDD:289560
ANK repeat 1725..1756 CDD:293786
ANK repeat 1758..1789 CDD:293786
ANK repeat 1791..1827 CDD:293786
Blue background indicates that the domain is not in the aligned region.


Information from Original Tools:


Tool Simple Score Weighted Score Original Tool Information
BLAST Result Score Score Type Cluster ID
Compara 00.000 Not matched by this tool.
Domainoid 00.000 Not matched by this tool.
eggNOG 00.000 Not matched by this tool.
Hieranoid 1 1.000 - -
Homologene 00.000 Not matched by this tool.
Inparanoid 00.000 Not matched by this tool.
OMA 00.000 Not matched by this tool.
OrthoDB 00.000 Not matched by this tool.
OrthoFinder 00.000 Not matched by this tool.
OrthoInspector 00.000 Not matched by this tool.
orthoMCL 1 0.900 - - OOG6_100271
Panther 00.000 Not matched by this tool.
Phylome 1 0.910 - -
SonicParanoid 00.000 Not matched by this tool.
SwiftOrtho 1 1.000 - -
TreeFam 00.000 Not matched by this tool.
43.810

Return to query results.
Submit another query.