DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment CG34370 and Cubn

DIOPT Version :10

Sequence 1:NP_995921.2 Gene:CG34370 / 5740565 FlyBaseID:FBgn0085399 Length:952 Species:Drosophila melanogaster
Sequence 2:NP_001074553.1 Gene:Cubn / 65969 MGIID:1931256 Length:3623 Species:Mus musculus


Alignment Length:1120 Identity:223/1120 - (19%)
Similarity:343/1120 - (30%) Gaps:439/1120 - (39%)


- Green bases have known domain annotations that are detailed below.


  Fly    81 TVDIFE--DISSPEVSNQNVGRPLTCWYRFRTLKGAPRDFVLRLRFKKFKV--GQLLNAT----- 136
            |::|:.  |..||.::......|..     ..::.:..|..|.:|||....  |:..||:     
Mouse  1448 TLEIYTGLDFHSPRIAQLCSRSPSA-----NPMQISSTDNELAIRFKTDSSLNGRGFNASWRAVP 1507

  Fly   137 -HCEGGYLQIVDGNAKTDVSNRREPGMFCGEAEQPQTFISETSYVKVLFHTDNF----TDQTYFT 196
             .| ||..|:..|    ::.:...|..:....| ....|....|.:||.:..:|    ||....|
Mouse  1508 GGC-GGIFQVSRG----EIHSPNYPNNYRANTE-CSWIIQVEKYHRVLLNITDFDLEATDSCLMT 1566

  Fly   197 FDSRAEQQTEVYLRYG-QHPELYPN-------------RRGEVVQGSYCEREYRDCRLQTC---- 243
            :|..:...|.|....| |.|   ||             :.|...|......::|    |.|    
Mouse  1567 YDGSSSANTRVATVCGRQQP---PNSITSSGNSLFVRFQSGSSSQSRGFRAQFR----QECGAHI 1624

  Fly   244 ------YVQSPAYPGLYPRALNCRYKLHTRQPYIKLYLQNEQFAVDGQRCENVMTCPIRPIGSGN 302
                  .:.||.||..||...||.:.:..:.|:..:.|....|.:                 ..:
Mouse  1625 ITDSSDSISSPLYPANYPNNQNCTWIIEAQPPFNHIALSFTHFHL-----------------QSS 1672

  Fly   303 EHCPYDWLAVYDGRDEHSPLIGKFCGLGKFPFSIIGTSQYMYVEFV-----------------TS 350
            ..|..|::.:.||||..:|:.|::||. ..|..||.....:.|.||                 ||
Mouse  1673 TDCTRDFVEILDGRDSDAPVQGRYCGT-SLPHPIISFGNALTVRFVSDSVYGFDGFHAIYSASTS 1736

  Fly   351 PAGPLLNTGFHFNVGNWPGHVETAGIKHGVCDWLLSSD------------SLKDSSASEGIFLSI 403
            ..|....||  ..:.|.||:.|... .:..|.|.::|.            .|::|......|:.|
Mouse  1737 ACGGTFYTG--DGIFNSPGYPEDYH-SNTECVWNIASSPGNHLQLSFLSFQLENSLNCNKDFVEI 1798

  Fly   404 --------------AHWYPPNTSCSYHIKGH---------------------------------V 421
                          .:..|.|.|   .|:||                                 .
Mouse  1799 REGNATGHLMGRYCGNSLPGNYS---SIEGHNLWVRFVSDGSGTGMGFQARFKNIFGNDNIVGTH 1860

  Fly   422 GEIVRLYFP-SFRIN-----------------RIESPILKYEGDC-GESLTIYDSDHADPARIIK 467
            |:|...::| ::.:|                 ||....::...:| .:||.|||..... :|:|.
Mouse  1861 GKIATPFWPGNYPLNSNYRWTVNVDSSHIIHGRILEMDIELTTNCFYDSLKIYDGFDIH-SRLIG 1924

  Fly   468 TFCDTFSRPMEKVDFVSTSPSLYVQFDSKTGSYSGSSLYYWAHYDFFNNTRFGDPVPNT---LCD 529
            |:|.|     ::..|.|:..||..||.|.:.......|..|...|..|.|     :|..   .|.
Mouse  1925 TYCGT-----QRESFSSSRNSLTFQFSSDSSKSGRGFLLEWFAVDVSNVT-----LPTIAPGACG 1979

  Fly   530 EVMY-----------AWKHPGGRLRSPLNSLIFKRTGGSDVRCQYKFVTDRRLYA-RAIIEVNSV 582
            ..|.           .|..|.|              .|:|  |.:      .:|| .:.:|:|.:
Mouse  1980 GYMVTGDTPVFFFSPGWPGPYG--------------NGAD--CIW------IIYAPDSTVELNIL 2022

  Fly   583 SFKELPYNSNACTRCHEERVDKLVIWEERDKYQNNLACFCD-NIPRAVRVISSADQMNLEMIVQG 646
            |.     :..|...|   ..|||:|.:...:....||..|. ::|..:|  |:.:.|.:.....|
Mouse  2023 SM-----DIEAQLSC---SYDKLIIKDGDSRLSQQLAVLCGRSVPGPIR--STGEYMYIRFTSDG 2077

  Fly   647 Q------------------HA----ITS------YFKNPN--------------PLFEATYEF-- 667
            .                  ||    |||      |..|.|              ..||..::.  
Mouse  2078 SVTGAGFNASFQKSCGGYLHADRGIITSPKYPDNYLPNLNCSWHVLVQSGLTIAVHFEQPFQIQN 2142

  Fly   668 ----------------------------AHGPLCG---PITLGPSPDGELVFPY----------- 690
                                        .:|..||   |.||..| |.|:...:           
Mouse  2143 RDSSCSQGDYLVLRNGPDNHSPPLGPSGGNGRFCGIYTPSTLFTS-DNEMFIQFISDNSNGGQGF 2206

  Fly   691 -----KKALA--------------MVSGPMAP---PEHHYRREKCIWELKVAAQRDLWL------ 727
                 .|:||              .|:.|..|   |:|    .:|||.|:..:.|.:.|      
Mouse  2207 KIRYEAKSLACGGTIYIHDANSDGYVTSPNYPANYPQH----AECIWILEAPSGRSIQLQFEDQF 2267

  Fly   728 NLEKARFADRSCDSAKIEVYLAGRLEPRFVVCPENISLARDLPILTTAELGATGADQEPLPVLIQ 792
            |:|:.    .:|.::.:|:                    ||            ||:.. .|||.:
Mouse  2268 NIEET----PNCSASYLEL--------------------RD------------GANSN-APVLSK 2295

  Fly   793 YTGDGQPGRNIFRLVWTE---LFHLPRNPDG-------SLAASLLQDGGCDFRCPGDT----EVC 843
            ..|...| ||     |..   |.:|..:.:|       ....|::..||   ...||:    .|.
Mouse  2296 LCGHTLP-RN-----WVSSRGLMYLKFHTEGGSGYMGFKAKYSIVSCGG---TVSGDSGVIESVG 2351

  Fly   844 IPKHLLCNGIANCPNVTHTSTLSQLTRHIEWNLEQLGLLHHAGDYQQLVLHD---ESPEICLKRD 905
            .|..|..|.:                 ..:|:::.|     .|.|..:...|   :|...|.| |
Mouse  2352 YPTRLYANNV-----------------FCQWHIQGL-----PGHYLTIRFEDFNLQSSPGCAK-D 2393

  Fly   906 LSELPYGKLSLIALG 920
            ..|:.....|.|.||
Mouse  2394 FVEIWENHTSGILLG 2408

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
CG34370NP_995921.2 CUB 88..195 CDD:238001 25/118 (21%)
CUB 244..363 CDD:238001 32/135 (24%)
CUB 407..509 CDD:238001 31/153 (20%)
CubnNP_001074553.1 cubilin_NTD 38..132 CDD:412063
Interaction with AMN. /evidence=ECO:0000250|UniProtKB:O60494 39..46
EGF_CA 133..164 CDD:238011
EGF_CA 167..207 CDD:238011
EGF_CA 260..301 CDD:214542
EGF_3 306..344 CDD:463759
EGF_3 350..387 CDD:463759
EGF_CA 400..430 CDD:238011
EGF_CA 432..468 CDD:238011
CUB 474..585 CDD:238001
CUB 590..699 CDD:238001
CUB 708..815 CDD:238001
CUB 817..927 CDD:238001
CUB 932..1041 CDD:238001
CUB 1052..1158 CDD:238001
CUB 1165..1275 CDD:238001
CUB 1278..1388 CDD:238001
CUB 1391..1505 CDD:238001 14/61 (23%)
CUB 1510..1599 CDD:238001 23/97 (24%)
CUB 1620..1733 CDD:238001 28/130 (22%)
CUB 1738..1847 CDD:238001 20/114 (18%)
CUB 1859..1962 CDD:238001 26/108 (24%)
CUB 1978..2089 CDD:238001 27/142 (19%)
CUB 2092..2212 CDD:238001 19/120 (16%)
CUB 2217..2333 CDD:238001 30/162 (19%)
CUB 2336..2447 CDD:238001 22/99 (22%)
CUB 2452..2564 CDD:238001
CUB 2570..2686 CDD:238001
CUB 2689..2800 CDD:238001
CUB 2805..2918 CDD:238001
CUB 2920..3034 CDD:238001
CUB 3037..3148 CDD:238001
CUB 3157..3273 CDD:238001
CUB 3278..3392 CDD:238001
CUB 3395..3506 CDD:238001
CUB 3511..3623 CDD:238001
Blue background indicates that the domain is not in the aligned region.

Return to query results.
Submit another query.