DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment crb and Notch1

DIOPT Version :9

Sequence 1:NP_001247284.1 Gene:crb / 42896 FlyBaseID:FBgn0259685 Length:2253 Species:Drosophila melanogaster
Sequence 2:NP_001099191.1 Gene:Notch1 / 25496 RGDID:3187 Length:2531 Species:Rattus norvegicus


Alignment Length:2070 Identity:534/2070 - (25%)
Similarity:700/2070 - (33%) Gaps:790/2070 - (38%)


- Green bases have known domain annotations that are detailed below.


  Fly   223 SGCLLDGPGLQFVNNSTVQNVVFG-----HCPLTPGPC-------------SDH----DLFTRLP 265
            ||..|:| |...|.|.|...|..|     .|. .|.||             .||    |.....|
  Rat    28 SGTCLNG-GRCEVANGTEACVCSGAFVGQRCQ-DPSPCLSTPCKNAGTCYVVDHGGIVDYACSCP 90

  Fly   266 ------------DNFCLNDPCMGHGTCS-SSPEGYECRCTARYSGKNCQKDNGSPCAKNPCENGG 317
                        .|.||.:||...|||. .:...|:|||...:|||:||:  ..|||.|||.|||
  Rat    91 LGFSGPLCLTPLANACLANPCRNGGTCDLLTLTEYKCRCPPGWSGKSCQQ--ADPCASNPCANGG 153

  Fly   318 SCL---------------------------------------ENSRGDYQCFCDPNHSGQHCETE 343
            .||                                       .|..|.|:|.|...|:|.|||..
  Rat   154 QCLPFESSYICGCPPGFHGPTCRQDVNECSQNPGLCRHGGTCHNEIGSYRCACRATHTGPHCELP 218

  Fly   344 VNIHPLCQTNPCLNNGACVVIGGSGALTCECPKGYAGARCEVDTDECASQPCQNNGSCIDRINGF 408
               :..|..:||.|.|.|...|.: ...|.|..|:||..||.:.|:|....|:|.|:|:|.:|.:
  Rat   219 ---YVPCSPSPCQNGGTCRPTGDT-THECACLPGFAGQNCEENVDDCPGNNCKNGGACVDGVNTY 279

  Fly   409 SCDCSGTGYTGAFCQTNVDECD--KNPCLNGGRCFDTYGWYTCQCLDGWGGEICDRPM-TCQTQQ 470
            :|.|. ..:||.:|..:||||.  .|.|.|||.|.:::|.|.|.|::||.||.|...: .|.:..
  Rat   280 NCRCP-PEWTGQYCTEDVDECQLMPNACQNGGTCHNSHGGYNCVCVNGWTGEDCSENIDDCASAA 343

  Fly   471 CLNGGTCLDKPIGFQCLCPPEYTGELCQIAPSC-AQQCPIDSEC----VGGK--CVCKPGSSGPI 528
            |..|.||.|:...|.|.||...||.||.:..:| :..|...|.|    |.||  |.|..|.:||.
  Rat   344 CFQGATCHDRVASFYCECPHGRTGLLCHLNDACISNPCNEGSNCDTNPVNGKAICTCPSGYTGPA 408

  Fly   529 GHCLPTTTTPTPEQEPTTTPRTTPNPNPAIPNTLTTTTKIPPITTSRTLVGTTTGSRRPPQQPLQ 593
              |                                                              
  Rat   409 --C-------------------------------------------------------------- 409

  Fly   594 SPTQRSASLNACPQENCLNGGTCLGYSGNYSCICASGYTGYNCQTSTGDGASALALTPINCNATN 658
            |......:|.|.|   |.:.|.||...|::.|.|..||||..|:....:             ..:
  Rat   410 SQDVDECALGANP---CEHAGKCLNTLGSFECQCLQGYTGPRCEIDVNE-------------CIS 458

  Fly   659 GKCLNGGTC--SMNGTHCYCAVGYSGDRCEKAENCSPLNCQEPMVCVQNQCLCPENKVCNQCATQ 721
            ..|.|..||  .:....|.|..||.|..||       :|..|                   ||:.
  Rat   459 NPCQNDATCLDQIGEFQCICMPGYEGVYCE-------INTDE-------------------CASS 497

  Fly   722 PCQNGGECVDLPNGDYECKCTRGWTGRTCGNDVDECTLHPKICGNGI-CKNEKGSYKCYCTPGFT 785
            ||.:.|.|||..| ::.|:|.:|::|..|..|||||...|  |.||. |.:...:|.|.||.|:|
  Rat   498 PCLHNGRCVDKIN-EFLCQCPKGFSGHLCQYDVDECASTP--CKNGAKCLDGPNTYTCVCTEGYT 559

  Fly   786 GVHCDSDVDECLSFPCLNGATCHNKINAYECVCQPGYEGENCEVDIDECGSNPCSNGSTCIDRIN 850
            |.||:.|:|||...||..| .|.:.:..:.|:|||||.|.:||.:|:||.|.||.:|.||.||.|
  Rat   560 GTHCEVDIDECDPDPCHYG-LCKDGVATFTCLCQPGYTGHHCETNINECHSQPCRHGGTCQDRDN 623

  Fly   851 NFTCNCIPGMTGRICDIDIDDCVGDPCLNGGQCIDQLGGFRCDCSGTGYEGENCELNIDECLSNP 915
            .:.|.|:.|.||..|:|::|||..:|| :.|.|:|::.|:.|.|. .||.|..|.:|||||..:|
  Rat   624 YYLCLCLKGTTGPNCEINLDDCASNPC-DSGTCLDKIDGYECACE-PGYTGSMCNVNIDECAGSP 686

  Fly   916 CTNGAKCLDRVKDYFCDCHNGYKGKNCEQDINECESNPCQYNGNCLERSNITLYQMSRITDLPKV 980
            |.||..|.|.:..:.|.|..||....|..::|||.|||| .:|.|.:..|               
  Rat   687 CHNGGTCEDGIAGFTCRCPEGYHDPTCLSEVNECNSNPC-IHGACRDGLN--------------- 735

  Fly   981 FSQPFSFENASGYECVCVPGIIGKNCEININECDSNPCSKHGNCNDGIGTYTCECEPGFEGTHCE 1045
                       ||:|.|.||..|.||:||.|||:||||...|.|.|....|.|.|..||.|.:|:
  Rat   736 -----------GYKCDCAPGWSGTNCDINNNECESNPCVNGGTCKDMTSGYVCTCREGFSGPNCQ 789

  Fly  1046 INIDECDRYNPC-QRGTCYDQIDDYDCDCDANYGGKNCSVLLKGCDQNPCLNGGACLPYLINEVT 1109
            .||:|| ..||| .:|||.|.:..|.|:|...|.|..|.|:|..|..:||.|.|.|..   :|..
  Rat   790 TNINEC-ASNPCLNQGTCIDDVAGYKCNCPLPYTGATCEVVLAPCATSPCKNSGVCKE---SEDY 850

  Fly  1110 HLYNCTCENGFQGDKCEKTTTLSMVATSLISVTTEREEGYDINLQFRTTLPNGVLAFGTTGEKNE 1174
            ..::|.|..|:||..||                      .|||...::...:|.....|.|    
  Rat   851 ESFSCVCPTGWQGQTCE----------------------IDINECVKSPCRHGASCQNTNG---- 889

  Fly  1175 PVSYILELINGRLNLHSSLLNKWEGVFIGSKLNDSNWHKVFVAINTSHLVLSANDEQAIFPVGSY 1239
              ||                                                    :.:...|  
  Rat   890 --SY----------------------------------------------------RCLCQAG-- 898

  Fly  1240 ETANNSQPSFPRTYLGGTIPNLKSYLRHLTHQPSAFVGCMQDIMVNGKWIFPDEQDANISYTKLE 1304
                         |.|.                    .|..||                      
  Rat   899 -------------YTGR--------------------NCESDI---------------------- 908

  Fly  1305 NVQSGCPRTEQCKPNPCHSNGECTDLWHTFACHCPRPFFGHTCQHNMTAATFGHENTTHSAVIVE 1369
                     :.|:|||||:.|.|||..:...|.|...|.|..|:.::....   .|...:.  ..
  Rat   909 ---------DDCRPNPCHNGGSCTDGVNAAFCDCLPGFQGAFCEEDINECA---SNPCQNG--AN 959

  Fly  1370 TTDVARRAIRSILDISMFIRTREPTGQVFYLGTDPRKAPTKNIGDSYVAAKLHGGELLVKMQFSG 1434
            .||..                                       |||...               
  Rat   960 CTDCV---------------------------------------DSYTCT--------------- 970

  Fly  1435 TPEAYTVGGQKLDNGYNHLIEVVRNQTLVQVKLNGTEYFRKTLSTTGLLDAQVLYLGGPAPTRES 1499
            .|..:        ||.:     ..|.|                               |..|..|
  Rat   971 CPTGF--------NGIH-----CENNT-------------------------------PDCTESS 991

  Fly  1500 LL-GATTEPGI-----IPVPGAGIPIEDTTVPKEADDSRDYFKGIIQDVKVSNGSLNLIVEMYSL 1558
            .. |.|...||     :..||                    |.|                     
  Rat   992 CFNGGTCVDGINSFTCLCPPG--------------------FTG--------------------- 1015

  Fly  1559 NVTDVQVNAKPLGAVTIDRASVLPGEVSDDLCRKNPCLHNAECRNTWNDYTCKCPNGYKGKNCQE 1623
                                |....:|::  |...||||...|::::..|.|.||.||.|.|||.
  Rat  1016 --------------------SYCQYDVNE--CDSRPCLHGGTCQDSYGTYKCTCPQGYTGLNCQN 1058

  Fly  1624 -IEFCQHVTCPGQSLCQNLDDGYECVTNTTFTGQERSPLAFFYFQEQQSDDIVSEASPKQTLKPV 1687
             :.:|....|.....|...:..|.|                                        
  Rat  1059 LVRWCDSAPCKNGGKCWQTNTQYHC---------------------------------------- 1083

  Fly  1688 IDIAFRTRAGGTLLYIDNVDGFFEIGVNGGRVTITWKLSALHFGESARFEKENTDGEWSRIYLRA 1752
                                                                             
  Rat  1084 ----------------------------------------------------------------- 1083

  Fly  1753 HNSKLEGGWKGWESMVDPTPAFSTDIDQAAFQSLIATSTQVYLGGMPESRQARGSTLSAQQGSQF 1817
               :...||.|          |:.|:        ::.|.:|       :.|.||..::       
  Rat  1084 ---ECRSGWTG----------FNCDV--------LSVSCEV-------AAQKRGIDVT------- 1113

  Fly  1818 KGCVGEARVGDLLLPYFSMAELYSRTNVSVQQKAQFRLNATRPEEGCILCFQSDCKNDGFCQSPS 1882
                        ||                                        |::.|.|....
  Rat  1114 ------------LL----------------------------------------CQHGGLCVDEE 1126

  Fly  1883 DEYACTCQPGFEGDDCGTDIDECLNTECLNNGTCINQVAAFFCQCQPGFEGQHCEQNIDECADQP 1947
            |::.|.||.|:.|..|..::|||....|.|..||.:.:..|.|:|..|:.|.:|.:.|:||..||
  Rat  1127 DKHYCHCQAGYTGSYCEDEVDECSPNPCQNGATCTDYLGGFSCKCVAGYHGSNCSEEINECLSQP 1191

  Fly  1948 CHNGGNCTDLIASYVCDCPEDYMGPQCDVLKQMTC--------ENEPCRNGSTC--QNGFNASTG 2002
            |.|||.|.||..:|.|.||....|..|:: ....|        .:..|.|..||  |.|      
  Rat  1192 CQNGGTCIDLTNTYKCSCPRGTQGVHCEI-NVDDCHPPLDPASRSPKCFNNGTCVDQVG------ 1249

  Fly  2003 NNFTCTCVPGFEGPLC--DIPFCEITPCDNGGLCLTTGAVPM-----CKCSLGYTGRLCEQDINE 2060
             .:||||.|||.|..|  |:..|...|||..|   |...|..     |:|..|:|||.||..||.
  Rat  1250 -GYTCTCPPGFVGERCEGDVNECLSNPCDPRG---TQNCVQRVNDFHCECRAGHTGRRCESVING 1310

  Fly  2061 CESNPCQNGGQC---KDLVGRYECDCQGTGFEGIRCENDIDECNMEGDY-CGGLGRCFNKPGSFQ 2121
            |...||:|||.|   .:....:.|.|. .||||..||||...|   |.. |...|.|.:.|.|..
  Rat  1311 CRGKPCRNGGVCAVASNTARGFICRCP-AGFEGATCENDARTC---GSLRCLNGGTCISGPRSPT 1371

  Fly  2122 CICQKPYCGAYCNF--TDPCNATDLCSNGGRCVESCGAKPDYYCECPEGFAGKNC 2174
            |:|...:.|..|.|  :.||..::.|.|.|.| |.....|.|.|.||..|.|..|
  Rat  1372 CLCLGSFTGPECQFPASSPCVGSNPCYNQGTC-EPTSESPFYRCLCPAKFNGLLC 1425

Known Domains:


Indicated by green bases in alignment.

Software error:

Illegal division by zero at /www/www.flyrnai.org/docroot/cgi-bin/DRSC_prot_align.pl line 591.

For help, please send mail to the webmaster (ritg@hms.harvard.edu), giving this error message and the time and date of the error.

GeneSequenceDomainRegion External IDIdentity
crbNP_001247284.1 LamG 91..228 CDD:238058 2/4 (50%)
EGF_CA 386..423 CDD:238011 12/36 (33%)
EGF_CA 425..460 CDD:238011 18/36 (50%)
EGF 466..495 CDD:278437 11/28 (39%)
EGF 605..633 CDD:278437 10/27 (37%)
EGF_CA 716..750 CDD:238011 13/33 (39%)
EGF_CA 753..789 CDD:238011 17/36 (47%)
EGF_CA 792..828 CDD:238011 15/35 (43%)
EGF_CA 830..865 CDD:238011 17/34 (50%)
EGF_CA 868..905 CDD:238011 14/36 (39%)
EGF_CA 907..943 CDD:238011 15/35 (43%)
EGF_CA 1009..1045 CDD:238011 17/35 (49%)
EGF_CA 1047..1082 CDD:238011 16/35 (46%)
Laminin_G_1 1155..1290 CDD:278483 11/134 (8%)
EGF 1316..1346 CDD:278437 14/29 (48%)
Laminin_G_1 1388..1550 CDD:278483 19/167 (11%)
EGF_CA <1593..1622 CDD:238011 13/28 (46%)
Laminin_G_2 1692..1828 CDD:280389 10/135 (7%)
EGF_CA 1901..1937 CDD:238011 12/35 (34%)
EGF_CA 1939..1974 CDD:238011 17/34 (50%)
EGF_CA 2057..2094 CDD:238011 15/39 (38%)
EGF_CA 2096..2133 CDD:238011 11/37 (30%)
EGF_CA 2137..2175 CDD:238011 15/38 (39%)
Notch1NP_001099191.1 EGF_CA 142..175 CDD:238011 11/32 (34%)
EGF_CA 178..216 CDD:238011 8/37 (22%)
EGF_CA 257..293 CDD:238011 12/36 (33%)
EGF_CA 295..332 CDD:238011 18/36 (50%)
EGF_CA 335..370 CDD:238011 12/34 (35%)
EGF_CA 412..450 CDD:238011 14/40 (35%)