DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment CG42613 and Cubn

DIOPT Version :10

Sequence 1:NP_001247185.1 Gene:CG42613 / 42269 FlyBaseID:FBgn0261262 Length:1188 Species:Drosophila melanogaster
Sequence 2:NP_001074553.1 Gene:Cubn / 65969 MGIID:1931256 Length:3623 Species:Mus musculus


Alignment Length:972 Identity:233/972 - (23%)
Similarity:364/972 - (37%) Gaps:251/972 - (25%)


- Green bases have known domain annotations that are detailed below.


  Fly   215 QKFARSIFPLVI--IFNLLPLFYAGGAHLKSNGADGEPWMQPVVAYLEVSSRPTKSPKCDQTFVS 277
            :||..:..|..|  ::|:|.:.:...:.:::.|            ::.:.|  ::..:|.:....
Mouse   888 EKFCGTNIPSFITSVYNVLYVTFVKSSSMENRG------------FMAMFS--SEKLECGKVLTE 938

  Fly   278 RIGGPQNGSFSAPLLHNHRNHSRQCLYTFLAGPGQRVEVVFKSFNLRGSPPDGSAVGELPSCVHE 342
                 ..|...:|...|.......|.:..:...||.:.:||.||.|...          .:|.::
Mouse   939 -----STGIIESPGHPNVYPSGVNCTWHIVVQRGQLIRLVFSSFYLEFH----------YNCAND 988

  Fly   343 YMDIYSEVQSSEPAELINSPFGGRYCG-TIPPRRRISMYRAVAISFFSNKNVTTDLFEGTFRFIN 406
            |:::|..:..:..         ||||| :|||....|.: ::.:.|.|:..:..:.|...:..||
Mouse   989 YLEVYDTIAQTSL---------GRYCGKSIPPSLTSSSH-SIKLIFVSDSALAHEGFSINYEAIN 1043

  Fly   407 ASEYEIGIPIAGSPCSYTITPSMSVNKTGALISPTYPGAYPKDMSCTYQFLGESNQRVRLEFRDF 471
            ||          |.|.|..|.:.     |.|.||.:|..||.:.:|.|:.....||::.|.|.||
Mouse  1044 AS----------SVCLYDYTDNF-----GRLSSPNFPNNYPHNWNCVYRITVGLNQQIALHFTDF 1093

  Fly   472 DL--FFGGPHCPFDYVKVYDGPDNSSALIGTYCGQQRNLVLYSSESSLFVHFYTLSRTANTQNRG 534
            .|  :| ||.| .|:|::.||...:|.|||.|||......:.|..:.|::.|  .|.||.|. ||
Mouse  1094 ALEDYF-GPKC-VDFVEIRDGGFETSPLIGIYCGSVFPPRIISHSNKLWLRF--KSDTALTA-RG 1153

  Fly   535 FKGIYEFSESFVKLDFIRENDGIHIRGSECDQKILSKKESTGFVLSPNYPYPYIPKTVCRYFIYG 599
            |...::.|                  .:.|...:.:   .||.:.|||||.||...:.|    |.
Mouse  1154 FSAYWDAS------------------STGCGGNLTT---PTGVLTSPNYPMPYYHSSEC----YW 1193

  Fly   600 MQDAQHLERVRLEFNAFNIPKVEHKDKSESNCTDGYLKIYLKGQETADAYDKFDYELCGNETQRV 664
            ..:|.......|||..|::   ||    ..||:..||.:: .|..|   ..:...:|||:.....
Mouse  1194 RLEASRGSPFLLEFQDFHL---EH----HPNCSLDYLAVF-DGPST---NSRLINKLCGDTPPAP 1247

  Fly   665 ISEGPRLAMV---FSSGELQGRGFKGKYTFETEYKIPGTAAPDGTCSFTYVSSSKKRGELNSPRY 726
            |.....:.::   ..:|: ||||      ||..|:        .||. ..|..:|..|.|.|..|
Mouse  1248 IRSSKDIVLLKLRTDAGQ-QGRG------FEINYR--------QTCD-NVVIVNKTSGILESINY 1296

  Fly   727 PSNYPSDTNCSYLFLAEADEQVTIVFDHFKIKADNNANATAGAYGSGACFEDWLEMYVVYRDNND 791
            |:.|..|..|::...|.....|...|..|.:  :|..|          |..|:||:|    |...
Mouse  1297 PNPYDKDQRCNWTIQATTGNTVNYTFLEFDV--ENYVN----------CSTDYLELY----DGPQ 1345

  Fly   792 RLLGRYCGQTAPGPVESPRGAV---GLRITLHTD-QESVASGFKARYFFESAKSDAGDCGGNFSN 852
            | :|||||:..|     |.||.   .|.:..||| .:|...|||..:|...       |||..|.
Mouse  1346 R-IGRYCGENIP-----PPGATTGSKLIVVFHTDGVDSGEKGFKMHWFIHG-------CGGEMSG 1397

  Fly   853 QDSGLITSPNWPAGYKAPGRGMASNACNWVMKARPGYKLSIHFEQFGLEGDPANRGCPAAVLRLW 917
             ..|..:||.:|..|.      .:..|.|.::..||..:.:....|.:|   .:..|....|.::
Mouse  1398 -TMGSFSSPGYPNSYP------HNKECIWNIRVAPGNSIQLTIHDFDVE---YHASCKYDTLEIY 1452

  Fly   918 MNVDSDQPPL-ELCGEKPPVEQWHYTSTGQTARISFTTSDKTVGAQGFRIVWTEIQDSGPGPPSV 981
            ..:|...|.: :||...|.......:||.....|.|.| |.::..:||...|..:    ||    
Mouse  1453 TGLDFHSPRIAQLCSRSPSANPMQISSTDNELAIRFKT-DSSLNGRGFNASWRAV----PG---- 1508

  Fly   982 GLHCESTYHFQCGAGYCISDKLRCDGVKNCGPGDDTDELH---------CTTEAA-----EEDYD 1032
                                  .|.|:.....|    |:|         ..||.:     |:.:.
Mouse  1509 ----------------------GCGGIFQVSRG----EIHSPNYPNNYRANTECSWIIQVEKYHR 1547

  Fly  1033 VLIIIT---LLVVSSLICLL------------CILCHSRRSRRNHVRQLGLHRNAHYDSQTSATA 1082
            ||:.||   |....|  ||:            ..:| .|:...|.:...|......:.|.:|:.:
Mouse  1548 VLLNITDFDLEATDS--CLMTYDGSSSANTRVATVC-GRQQPPNSITSSGNSLFVRFQSGSSSQS 1609

  Fly  1083 GLSSSRRHLQSGGASIAGSLHGINSGNLIIPPAPLPPTSVP-----------PPP--HICIA 1131
            ....::...:.|...|..|...|:|        ||.|.:.|           .||  ||.::
Mouse  1610 RGFRAQFRQECGAHIITDSSDSISS--------PLYPANYPNNQNCTWIIEAQPPFNHIALS 1663

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
CG42613NP_001247185.1 CUB 277..404 CDD:238001 26/127 (20%)
CUB 421..541 CDD:238001 44/121 (36%)
CUB 564..691 CDD:238001 34/129 (26%)
CUB 707..836 CDD:238001 41/132 (31%)
CUB 846..969 CDD:238001 30/123 (24%)
LDLa 987..1022 CDD:238060 5/43 (12%)
CubnNP_001074553.1 cubilin_NTD 38..132 CDD:412063
Interaction with AMN. /evidence=ECO:0000250|UniProtKB:O60494 39..46
EGF_CA 133..164 CDD:238011
EGF_CA 167..207 CDD:238011
EGF_CA 260..301 CDD:214542
EGF_3 306..344 CDD:463759
EGF_3 350..387 CDD:463759
EGF_CA 400..430 CDD:238011
EGF_CA 432..468 CDD:238011
CUB 474..585 CDD:238001
CUB 590..699 CDD:238001
CUB 708..815 CDD:238001
CUB 817..927 CDD:238001 8/52 (15%)
CUB 932..1041 CDD:238001 27/133 (20%)
CUB 1052..1158 CDD:238001 42/115 (37%)
CUB 1165..1275 CDD:238001 36/134 (27%)
CUB 1278..1388 CDD:238001 41/132 (31%)
CUB 1391..1505 CDD:238001 31/124 (25%)
CUB 1510..1599 CDD:238001 20/95 (21%)
CUB 1620..1733 CDD:238001 13/52 (25%)
CUB 1738..1847 CDD:238001
CUB 1859..1962 CDD:238001
CUB 1978..2089 CDD:238001
CUB 2092..2212 CDD:238001
CUB 2217..2333 CDD:238001
CUB 2336..2447 CDD:238001
CUB 2452..2564 CDD:238001
CUB 2570..2686 CDD:238001
CUB 2689..2800 CDD:238001
CUB 2805..2918 CDD:238001
CUB 2920..3034 CDD:238001
CUB 3037..3148 CDD:238001
CUB 3157..3273 CDD:238001
CUB 3278..3392 CDD:238001
CUB 3395..3506 CDD:238001
CUB 3511..3623 CDD:238001
Blue background indicates that the domain is not in the aligned region.

Return to query results.
Submit another query.