DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment Cubn2 and nas-33

DIOPT Version :10

Sequence 1:NP_729748.3 Gene:Cubn2 / 39334 FlyBaseID:FBgn0259140 Length:3613 Species:Drosophila melanogaster
Sequence 2:NP_509086.2 Gene:nas-33 / 186987 WormBaseID:WBGene00003551 Length:644 Species:Caenorhabditis elegans


Alignment Length:480 Identity:96/480 - (20%)
Similarity:162/480 - (33%) Gaps:153/480 - (31%)


- Green bases have known domain annotations that are detailed below.


  Fly  1130 RNGDSES------SPLIGRYCGN--EIPSRIPSFG---NVLHLKFKSDDSMEEKGFLLSWQQMG- 1182
            |.|:|..      |....|..|:  :|.:.|||.|   |.:....|....|.|....|:..||. 
 Worm   119 RPGESYDKVIQIMSSYFNRKSGSQYDINTVIPSSGIYNNEMAANSKIAAVMFESDMALTVSQMNK 183

  Fly  1183 -AGCGGKLSSSM---GTIHS---PHLLAGNRGILACDWQIIVAEGSRVSLQLRSNDNRIC----- 1235
             |..|.::...|   ||..|   |:......|    :||      |:::..||..:...|     
 Worm   184 VAQNGFRVKRKMNLNGTTWSRNIPYRFLDTDG----NWQ------SQITNGLRHYERNTCIRFSL 238

  Fly  1236 ----SGQLTLYDGPTTASNPIVIRCNG----TIAKPLQSTGNRVLVRYDVGH--------DAPDG 1284
                |..|....|....|:  |.|..|    :|....::.|   ::.::|||        ..|:.
 Worm   239 NGGGSDYLVFSKGEGCYSS--VGRLGGPQEISIGDGCETLG---IITHEVGHALGFWHEQARPER 298

  Fly  1285 TDFMLNYQTNCRVRLEGLQGAIETPNFPENYPPGQDCEWDIRAGGRKNHLQLIFSHLSVEKF--S 1347
            ..::   :.|.:..:.||:|                 ::|.|:....|...|.:.:.||..:  .
 Worm   299 DSYV---RINRQNAINGLEG-----------------QFDKRSWSEVNEYSLPYDYGSVMHYGPK 343

  Fly  1348 SICLNDYVSLVDMLDDQTLSEQHLCTNDGLEPITTVGNRLLLRFKSDSSVELQGFRAEY------ 1406
            |...:..::.|:.:|...              |.|:|||:     ..|.::|:.....:      
 Worm   344 SFSKSSTMNTVEPVDPAF--------------INTIGNRV-----EPSFLDLKLLNTAFCSNICT 389

  Fly  1407 KRIGCG------------------------EHLRESGGRFESP-------NAPFSVDMDCVWIIT 1440
            .||.|.                        |.|:.|....|.|       |..:|...||.|.|.
 Worm   390 NRINCQHGGYADPNNCGQCTCPTGLEGTYCERLQTSNCGVELPRADYSWRNISYSGSSDCYWRIV 454

  Fly  1441 ASEGNQIRLLLHEVYFEAPQIECRDAESSLSVSAPSGYNSSVVLFRSCHEETQTQTFTSPGNELV 1505
            ::.|..:|..|..|.:....: |.:     .|...:.|:.....:|.|.:....:.. |.||.::
 Worm   455 SANGGNVRFELTYVMYRCSPV-CEE-----FVEMKAEYSHEATGYRQCCKAVLGERI-SKGNSVL 512

  Fly  1506 I--------RFV-----SSSAPSRK 1517
            |        :||     ..:||:::
 Worm   513 IISKATQNSQFVLRYREDGTAPTQR 537

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
Cubn2NP_729748.3 cubilin_NTD 38..141 CDD:412063
EGF_CA 156..190 CDD:238011
EGF_CA 192..233 CDD:238011
EGF_CA 290..328 CDD:214542
EGF_CA 330..374 CDD:214542
EGF 427..455 CDD:394967
EGF_CA 462..496 CDD:238011
CUB 503..619 CDD:238001
CUB 624..738 CDD:238001
CUB 745..854 CDD:238001
CUB 857..963 CDD:238001
CUB 1066..1179 CDD:238001 16/59 (27%)
CUB 1185..1293 CDD:238001 26/134 (19%)
CUB 1303..1406 CDD:238001 17/104 (16%)
CUB 1411..1523 CDD:238001 29/151 (19%)
CUB 1530..1648 CDD:238001
CUB 1754..1862 CDD:238001
CUB 1979..2091 CDD:238001
CUB 2210..2321 CDD:238001
CUB 2327..2441 CDD:238001
CUB 2688..2782 CDD:412131
CUB 2810..2911 CDD:238001
CUB 3029..3143 CDD:238001
CUB 3169..3249 CDD:412131
CUB 3499..3607 CDD:238001
nas-33NP_509086.2 Astacin 200..386 CDD:426242 43/239 (18%)
CUB 442..527 CDD:412131 20/91 (22%)
TSP1 553..595 CDD:214559
Blue background indicates that the domain is not in the aligned region.

Return to query results.
Submit another query.