DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment crb and Fbn2

DIOPT Version :10

Sequence 1:NP_001247284.1 Gene:crb / 42896 FlyBaseID:FBgn0259685 Length:2253 Species:Drosophila melanogaster
Sequence 2:NP_034311.2 Gene:Fbn2 / 14119 MGIID:95490 Length:2907 Species:Mus musculus


Alignment Length:2552 Identity:583/2552 - (22%)
Similarity:833/2552 - (32%) Gaps:962/2552 - (37%)


- Green bases have known domain annotations that are detailed below.


  Fly   208 DIGYKDAILILGNSFSGCLLDGP---GLQFVNNSTVQNVVFGHCPLTPGPCSDHDLFTRLPDNFC 269
            |.|:...:   |.:..|....||   ||..:|.             |...|..|       .|.|
Mouse   457 DNGFSPGV---GGAGVGAGGQGPIITGLTILNQ-------------TIDICKHH-------ANLC 498

  Fly   270 LNDPCMGHGTCSSSPEGYECRCTARYSGKNCQKDNG-----SPCAKNPCENGGSCLENSRGDYQC 329
            ||      |.|..:...|.|.|...|.    |..||     ..|..|||.| |.|: |:.|.|.|
Mouse   499 LN------GRCIPTVSSYRCECNMGYK----QDANGDCIDVDECTSNPCSN-GDCV-NTPGSYYC 551

  Fly   330 FCDPNHSG-QHCETE---VNIHPLCQTNPCLNNGACVVIGGSGALTCECPKGYA----GARCEVD 386
            .|   |:| |...|:   ::|....|......||.||  ...|:..|.|..|:.    |..| ||
Mouse   552 KC---HAGFQRTPTKQACIDIDECIQNGVLCKNGRCV--NTDGSFQCICNAGFELTTDGKNC-VD 610

  Fly   387 TDECASQPCQNNGSCIDRINGFSCDCSGTGY----TGAFCQTNVDECDKNP--CLNGGRCFDTYG 445
            .|||.:.....||.||:....|.|.|. .|:    .|.:| |:|||| :.|  |:| |.|.:..|
Mouse   611 HDECTTTNMCLNGMCINEDGSFKCVCK-PGFILAPNGRYC-TDVDEC-QTPGICMN-GHCINNEG 671

  Fly   446 WYTCQCLDGWG----GEIC-DRPM--TCQTQQCLNGGTCLDKPI-----GFQCLCP-PEY-TGEL 496
            .:.|.|..|..    |.:| |..|  ||..:  :..|.|: :|.     ..:|.|. |:| .||.
Mouse   672 SFRCDCPPGLAVGVDGRVCVDTHMRSTCYGE--IKKGVCV-RPFPGAVTKSECCCANPDYGFGEP 733

  Fly   497 CQIAPS---------CA------------QQCPIDSE-CVGG---------KCVCKPGSSGPIGH 530
            ||..|:         |:            .:|.:|.: |..|         :|.|..|       
Mouse   734 CQPCPAKNSAEFHGLCSSGIGITVDGRDINECALDPDICANGICENLRGSYRCNCNSG------- 791

  Fly   531 CLPTTTTPTPEQEPTTTPRTTPNPNPAIPNTLTTTTKIPPITTSRTLVGTTTGSRR---PPQQPL 592
                       .||..:.|...:.:..:.|.|         .....|...|.||..   ||....
Mouse   792 -----------YEPDASGRNCIDIDECLVNRL---------LCDNGLCRNTPGSYSCTCPPGYVF 836

  Fly   593 QSPTQRSASLNACPQENCLNGGTCLGYSGNYSCICASG----YTGYNCQTSTGDGASALALTPIN 653
            ::.|:....:|.|....|:| |.|....|::.|.|:.|    .||..|..|. .|...|.:....
Mouse   837 RTETETCEDVNECESNPCVN-GACRNNLGSFHCECSPGSKLSSTGLICIDSL-KGTCWLNIQDNR 899

  Fly   654 CNAT-NGKCLNGGTCSMNGTHCYCAVGYSGDRCE------------KAENCSPLN-CQE-PMVCV 703
            |... ||..|....|:..|    .|.|...:|||            |...|..:| |:. |.||.
Mouse   900 CEVNINGATLKSECCATLG----AAWGSPCERCELDAACPRGFARIKGVTCEDVNECEVFPGVCP 960

  Fly   704 QNQCL---------CPE-------NKVC------------------------------------- 715
            ..:|:         |||       .:||                                     
Mouse   961 NGRCVNSKGSFHCECPEGLTLDGTGRVCLDIRMEHCFLKWDEDECIHPVPGKFRMDACCCAVGAA 1025

  Fly   716 --NQCATQP----------CQNG-------------------GECVDLPN-----------GDYE 738
              .:|...|          |..|                   .||...|.           |.::
Mouse  1026 WGTECEECPKPGTKEYETLCPRGPGFANRGDILTGRPFYKDINECKAFPGMCTYGKCRNTIGSFK 1090

  Fly   739 CKCTRGWT----GRTCGNDVDECTLHPKICGNGICKNEKGSYKCYCTPGFTG-----VHCDSDVD 794
            |:|..|:.    .|.| .|:|||.:.|.:||:|||.|..||::|.|..|:..     .:| .|:|
Mouse  1091 CRCNNGFALDMEERNC-TDIDECRISPDLCGSGICVNTPGSFECECFEGYESGFMMMKNC-MDID 1153

  Fly   795 ECLSFPCL-NGATCHNKINAYECVCQPGYE----GENCEVDIDEC--GSNPCSNGSTCIDRINNF 852
            ||...|.| .|.||.|...:::|.|..|:|    .|:| |||:||  ..|.|.||. |::.|..:
Mouse  1154 ECERNPLLCRGGTCVNTEGSFQCDCPLGHELSPSREDC-VDINECSLSDNLCRNGK-CVNMIGTY 1216

  Fly   853 TCNCIPGMTG---RICDIDIDDCVGDPCLNGG---QCIDQLGGFRCDCSGTGY----EGENCELN 907
            .|:|.||...   |....|||:|:   .:|||   ||.:..|.:.|.|| .||    :|.:| .:
Mouse  1217 QCSCNPGYQATPDRQGCTDIDECM---IMNGGCDTQCTNSEGSYECSCS-EGYALMPDGRSC-AD 1276

  Fly   908 IDECLSNP--CTNGAKCLDRVKDYFCDCHNGYKG----KNCEQDINECESNP--CQYNGNCLERS 964
            ||||.:||  | :|.:|.:...:|.|.|::|:..    |.| .|:|||:.||  |.: |.|....
Mouse  1277 IDECENNPDIC-DGGQCTNIPGEYRCLCYDGFMASMDMKTC-IDVNECDLNPNICMF-GECENTK 1338

  Fly   965 NITL------YQMSR----ITDLPKV------FSQPFSFENASG-YECVCVPGIIGKNCE-ININ 1011
            ...:      |.:.:    .||:.:.      .....|..|..| ::|.|..|.:|...: |:::
Mouse  1339 GSFICHCQLGYSVKKGTTGCTDVDECEIGAHNCDMHASCLNVPGSFKCSCREGWVGNGIKCIDLD 1403

  Fly  1012 EC--DSNPCSKHGNCNDGIGTYTCECEPGF--EGTHCEINIDEC-DRYNPCQRGTCYDQIDDYDC 1071
            ||  .::.||.:..|.:..|:|.|.|..||  :|..|. ::||| :..|.|:.|.|.:....|.|
Mouse  1404 ECANGTHQCSINAQCVNTPGSYRCACSEGFTGDGFTCS-DVDECAENTNLCENGQCLNVPGAYRC 1467

  Fly  1072 DCDANY----GGKNCSVLLKGCDQNPCLNGGACLPYLINEVTHLYNCTCENGFQGDKCEKTTTLS 1132
            :|:..:    ..::|..:.:...||.|: .|.|     |.:..:::|.|::|::.|:.....|  
Mouse  1468 ECEMGFTPASDSRSCQDIDECSFQNICV-FGTC-----NNLPGMFHCICDDGYELDRTGGNCT-- 1524

  Fly  1133 MVATSLISVTTEREEGYDINLQFRTTLPNGVLAFGTTGEKNEPVSYILELINGRLNLHSSLLNKW 1197
                             ||:                  |..:|::    .:||            
Mouse  1525 -----------------DID------------------ECADPIN----CVNG------------ 1538

  Fly  1198 EGVFIGSKLNDSNWHKVFVAINTSHLVLSANDEQAIFPVGSYETANNSQPSFPRTYLG-GTIPNL 1261
                              :.:||.               |.||.  |..|.|.....| |.:.|.
Mouse  1539 ------------------LCVNTP---------------GRYEC--NCPPDFQLNPTGVGCVDNR 1568

  Fly  1262 --KSYLRHLTHQPSAFVGCMQDIMVN----------GK-WIFPDEQDANISYTKLENVQSGCPRT 1313
              ..||: ...:....:.|..::.|.          || |..|.|....::.|:...:   ||..
Mouse  1569 VGNCYLK-FGPRGDGSLSCNTEVGVGVSRSSCCCSLGKAWGNPCETCPPVNSTEYYTL---CPGG 1629

  Fly  1314 EQCKPNP----------CH------SNGECTDLWHTFACHCPRPFF----GHTCQHNMTAATFGH 1358
            |..:|||          |.      ..|.|.:.:.:|.|.||:.::    ...|:.  ....|.|
Mouse  1630 EGFRPNPITIILEDIDECQELPGLCQGGNCINTFGSFQCECPQGYYLSEETRICED--IDECFAH 1692

  Fly  1359 ENTTHSAVIVETTDVARRAIRSILDISMFIRTREPTGQVFYLGTDPRKAPTKNIGDSYVAAKLHG 1423
            ...........|                             ||......|.:.:       :::|
Mouse  1693 PGVCGPGTCYNT-----------------------------LGNYTCICPPEYM-------QVNG 1721

  Fly  1424 GELLVKMQFSGTPEAYTVGGQKLDNGYNHLIEVVRNQTLVQVKLNGTEYFRKTLSTTGLLDAQVL 1488
            |...:.|:.|....:|  .|...:|               ::..|.|:  |....|         
Mouse  1722 GHNCMDMRKSFCYRSY--NGTTCEN---------------ELPFNVTK--RMCCCT--------- 1758

  Fly  1489 YLGGPAPTRESLLGATTEPGIIPVPGAGIPIEDTTVPKEADDSRDYFKGIIQDV-----KVSNGS 1548
            |..|.|..:                    |.|....|..||     ||.|..::     .:..|.
Mouse  1759 YNVGKAWNK--------------------PCEPCPTPGTAD-----FKTICGNIPGFTFDIHTGK 1798

  Fly  1549 LNLIVEMYSLNVTDVQVNAKPLGAVTIDRASVLPGEVSDDLC----------------------- 1590
                                   ||.||....:||..::.:|                       
Mouse  1799 -----------------------AVDIDECKEIPGICANGVCINQIGSFRCECPTGFSYNDLLLV 1840

  Fly  1591 ---------RKNPCLHNAECRNTWNDYTCKCPNGYK---------GKNCQEI-EFCQHVTCPGQS 1636
                     ..|.|..||:|.|:...|.|:|..|:|         ...|.|| ..|.|      .
Mouse  1841 CEDIDECSNGDNLCQRNADCINSPGSYRCECAAGFKLSPNGACVDRNECLEIPNVCSH------G 1899

  Fly  1637 LCQNLDDGYECVTNTTFTGQERSPLAFFYFQEQQSDDIVSEASPKQTLKPVIDIAFRTRAG-GT- 1699
            ||.:|...|:|:.|..|                       :||..||:...:|...|...| || 
Mouse  1900 LCVDLQGSYQCICNNGF-----------------------KASQDQTMCMDVDECERHPCGNGTC 1941

  Fly  1700 ---------LLY-------------IDNVDGFF-----------EIG-------------VNGGR 1718
                     |.|             ||....||           |||             .:|..
Mouse  1942 KNTVGSYNCLCYPGFELTHNNDCLDIDECSSFFGQVCRNGRCFNEIGSFKCLCNEGYELTPDGKN 2006

  Fly  1719 VTITWKLSALHFGESARFEKENTDGEWSRI-----YLRAHN-------------------SKLEG 1759
            ...|.:..||. |..:....:|.:|.:..|     .:|:.|                   :...|
Mouse  2007 CIDTNECVALP-GSCSPGTCQNLEGSFRCICPPGYEVRSENCIDINECDEDPNICLFGSCTNTPG 2070

  Fly  1760 GWKGWESMVDPTPAFSTDIDQAAF---QSLIATSTQVYLGGMPESRQARGSTLSAQQGSQFKGCV 1821
            |::    .:.|.....:|..:..|   ||...|:.:          ..:.|...|...::.|.|.
Mouse  2071 GFQ----CICPPGFVLSDNGRRCFDTRQSFCFTNFE----------NGKCSVPKAFNTTKAKCCC 2121

  Fly  1822 ----GEARVGDLLLPYFSMAELYSRTN-VSVQQKAQF------RLNATRPEEGCILCFQSD--CK 1873
                ||. .||       ..||..:.: |:.|....:      .|:.||  |....|.:|.  |.
Mouse  2122 SKMPGEG-WGD-------PCELCPKDDEVAFQDLCPYGHGTVPSLHDTR--EDVNECLESPGICS 2176

  Fly  1874 NDGFCQSPSDEYACTCQPGFEGDDCG---TDIDEC-LNTECLNNGTCINQVAAFFCQCQPGFEG- 1933
            | |.|.:....:.|.|..|:..|..|   .|.||| :...| .||||.|.:.:|.|.|..|||. 
Mouse  2177 N-GQCINTDGSFRCECPMGYNLDYTGVRCVDTDECSIGNPC-GNGTCTNVIGSFECTCNEGFEPG 2239

  Fly  1934 --QHCEQNIDECADQPCHNGGNCTDLIASYVCDCPEDY------------------------MGP 1972
              .:|| :|:|||..|......|.:...||.|.||..|                        .|.
Mouse  2240 PMMNCE-DINECAQNPLLCAFRCMNTFGSYECTCPVGYALREDQKMCKDLDECAEGLHDCESRGM 2303

  Fly  1973 QCDVL-----------------------------KQMTCENEPCRN--GS---TCQNGFN----- 1998
            .|..|                             |...|||..|.|  ||   .|..||.     
Mouse  2304 MCKNLIGTFMCICPPGMARRPDGEGCVDENECRTKPGICENGRCVNIIGSYRCECNEGFQSSSSG 2368

  Fly  1999 ----------------------ASTGNNFT----CTC---------------------------V 2010
                                  ||:..|..    |.|                           .
Mouse  2369 TECLDNRQGLCFAEVLQTMCQMASSSRNLVTKSECCCDGGRGWGHQCELCPLPGTAQYKKICPHG 2433

  Fly  2011 PGFEGPLCDIPFCEITP--CDNGGLCLTTGAVPMCKCSLGYT----GRLCEQDINECESNPCQNG 2069
            ||:.....||..|::.|  |.| |.|:.|.....|.|.:|||    |..| .|::||..:|....
Mouse  2434 PGYATDGRDIDECKVMPSLCTN-GQCVNTMGSFRCFCKVGYTTDISGTAC-VDLDECSQSPKPCN 2496

  Fly  2070 GQCKDLVGRYECDCQGTGF----EGIRCENDIDEC------------NMEGDY------------ 2106
            ..||:..|.|:|.|. .|:    :|..|: |:|||            |..|.:            
Mouse  2497 FICKNTKGSYQCSCP-RGYVLQEDGKTCK-DLDECQTKQHNCQFLCVNTLGGFTCKCPPGFTQHH 2559

  Fly  2107 ---------------CGGLGRCFNKPGSFQCICQKPY----CGAYCNFTDPCNATDLCSNGGRCV 2152
                           ||..|.|.|.||||.|.||:.:    .|..|...|.|:....|.:|  |.
Mouse  2560 TACIDNNECGSQPSLCGAKGICQNTPGSFSCECQRGFSLDASGLNCEDVDECDGNHRCQHG--CQ 2622

  Fly  2153 ESCGAKPDYYCECPEGF 2169
            ...|.   |.|.||:|:
Mouse  2623 NILGG---YRCGCPQGY 2636

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
crbNP_001247284.1 LamG 91..228 CDD:238058 4/19 (21%)
EGF 269..299 CDD:394967 9/29 (31%)
EGF_CA <311..341 CDD:238011 14/30 (47%)
EGF_CA 386..423 CDD:238011 13/40 (33%)
EGF_CA 425..460 CDD:238011 14/40 (35%)
EGF 466..495 CDD:394967 8/35 (23%)
EGF 605..633 CDD:394967 9/31 (29%)
EGF_CA 716..750 CDD:238011 12/77 (16%)
EGF_CA 753..789 CDD:238011 16/40 (40%)
EGF_CA 792..828 CDD:238011 15/40 (38%)
EGF_CA 830..865 CDD:238011 15/39 (38%)
EGF_CA 868..905 CDD:238011 16/43 (37%)
EGF_CA 907..943 CDD:238011 14/41 (34%)
EGF_CA 1009..1045 CDD:238011 12/39 (31%)
EGF_CA 1047..1082 CDD:238011 10/39 (26%)
Laminin_G_1 1155..1290 CDD:395008 22/148 (15%)
EGF 1316..1346 CDD:394967 10/49 (20%)
Laminin_G_1 1388..1550 CDD:395008 26/166 (16%)
EGF_CA <1593..1622 CDD:238011 11/37 (30%)
Laminin_G_2 1692..1828 CDD:460494 37/214 (17%)
EGF_CA 1901..1937 CDD:238011 16/39 (41%)
EGF_CA 1939..1974 CDD:238011 13/58 (22%)
EGF_CA 2057..2094 CDD:238011 12/40 (30%)
EGF_CA 2096..2133 CDD:238011 19/79 (24%)
EGF_CA 2137..2175 CDD:238011 11/33 (33%)
Fbn2NP_034311.2 Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 26..58
Fibrillin_U_N 76..112 CDD:436338
Interaction with MFAP4. /evidence=ECO:0000250|UniProtKB:P35556 149..359
TB 224..>257 CDD:459903
EGF_CA 276..>306 CDD:214542
TB 373..417 CDD:459903
EGF_CA 528..558 CDD:214542 14/34 (41%)
EGF_CA 568..609 CDD:214542 12/43 (28%)
EGF_CA 610..650 CDD:214542 14/41 (34%)
EGF_CA 651..691 CDD:214542 14/41 (34%)
TB 706..749 CDD:459903 12/43 (28%)
EGF_CA 761..802 CDD:214542 10/58 (17%)
EGF_CA 803..838 CDD:238011 8/43 (19%)
TB 898..>928 CDD:459903 8/33 (24%)
EGF_CA 948..978 CDD:429571 8/29 (28%)
TB 1003..1045 CDD:459903 2/41 (5%)
EGF_CA 1108..1142 CDD:214542 16/33 (48%)
EGF_CA 1151..1183 CDD:214542 13/31 (42%)
EGF_CA 1193..1225 CDD:214542 14/32 (44%)
FXa_inhibition 1239..1274 CDD:464251 13/38 (34%)
EGF_3 1364..1399 CDD:463759 7/34 (21%)
EGF_3 1405..1440 CDD:463759 11/34 (32%)
EGF_CA 1484..1519 CDD:214542 10/40 (25%)
EGF_CA 1525..1556 CDD:214542 13/99 (13%)
TB 1586..1626 CDD:459903 8/42 (19%)
EGF_CA 1643..1675 CDD:214542 7/31 (23%)
Interaction with MFAP4. /evidence=ECO:0000250|UniProtKB:P35556 1728..2164 102/565 (18%)
TB 1741..1784 CDD:459903 15/93 (16%)
EGF_CA 1801..1833 CDD:214542 5/31 (16%)
EGF_CA 1843..>1875 CDD:214542 9/31 (29%)
vWFA <1880..1922 CDD:469594 14/70 (20%)
EGF_CA 1927..1961 CDD:214542 7/33 (21%)
EGF_CA 1966..2008 CDD:214542 8/41 (20%)
EGF_CA 2009..2041 CDD:214542 7/32 (22%)
EGF_CA 2049..2089 CDD:214542 4/43 (9%)
TB 2104..2148 CDD:459903 12/51 (24%)
EGF_CA 2164..2205 CDD:214542 11/41 (27%)
EGF_CA 2206..2245 CDD:214542 16/39 (41%)
EGF_CA 2287..2319 CDD:429571 3/31 (10%)
TB 2387..2430 CDD:459903 5/42 (12%)
EGF_CA 2442..2475 CDD:214542 13/33 (39%)
FXa_inhibition 2488..2523 CDD:464251 10/35 (29%)
EGF_CA 2525..2554 CDD:429571 6/28 (21%)
EGF_CA 2564..2606 CDD:214542 13/41 (32%)
EGF_CA 2607..>2636 CDD:214542 10/33 (30%)
Cadherin_repeat <2821..>2868 CDD:206637
Blue background indicates that the domain is not in the aligned region.

Return to query results.
Submit another query.