DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment Cubn2 and mep1ba

DIOPT Version :10

Sequence 1:NP_729748.3 Gene:Cubn2 / 39334 FlyBaseID:FBgn0259140 Length:3613 Species:Drosophila melanogaster
Sequence 2:NP_001070089.2 Gene:mep1ba / 100151009 ZFINID:ZDB-GENE-041014-209 Length:677 Species:Danio rerio


Alignment Length:622 Identity:117/622 - (18%)
Similarity:187/622 - (30%) Gaps:222/622 - (35%)


- Green bases have known domain annotations that are detailed below.


  Fly  1852 NTAGGFRFRW-TYVHNNEINSGINGTIEPPPPLFVSNEDQPFTWRLFTDFKKVFVLQFEEYISGL 1915
            ||..|.::|| |.|                 |.|:.|.       |..:.|.|.:..||:|    
Zfish    65 NTILGEQYRWPTTV-----------------PYFLDNS-------LEINAKGVILKAFEQY---- 101

  Fly  1916 ILFDGYDDNALAVNIPVSPWRFTSS---------------------------SNVVYLKTVNDAL 1953
                     .|...|...||...|:                           ||...|.||....
Zfish   102 ---------RLKTCIDFKPWNGESNYIFVFKGSGCYSKVGNRQMGKQELSIGSNCDSLGTVEHEF 157

  Fly  1954 THFRLKW-----GVLDSNLVASNLSLTTGGCTKELTLSHHGDIELSSPGYPHGYAP--------- 2004
            .|....|     ...|..::.....:..|   ||...:.:.:.:.||.|.|:.|:.         
Zfish   158 LHALGLWHEQSRSDRDDYVIIVWDQIQDG---KEHNFNLYDETQSSSLGVPYDYSSVMHYSKTSF 219

  Fly  2005 NLNCEWTIRSQFPSHHIYAHSIIVDLEDYPACSADYLSIQSSR------DLIKWKNELHACKASQ 2063
            |...|.||.::.|                     ::|::...|      ||:| .|.|:.|..| 
Zfish   220 NKGSEPTIVTKIP---------------------EFLNVIGQRMEFSDNDLLK-LNRLYNCTTS- 261

  Fly  2064 IAPVHGTPYLRLQFRSDVSINGTGFRAKLRTSCGSNMTGIVGTIPQENLFDECAWHIDVRPGRKI 2128
                                      :....||......|.|.|..:....:.|....|..|.|.
Zfish   262 --------------------------STFLDSCHFEEPNICGMIQGDGGNAKWARVQTVEGGPKT 300

  Fly  2129 DIAINYNNMPPIAVCEAYGLIYDGVDEHASLLEHTRFGNQMGIRRTQFRTSGSHAYIKYHIGRSR 2193
            |    |.|   :..|:..|...     |.|    |..|.|           |..|:::..:....
Zfish   301 D----YTN---LGQCQGVGFFM-----HFS----TATGAQ-----------GDKAHLESRLFYPN 338

  Fly  2194 INGLCL-------------WNLTYREFNECN--GEIQLNQQAPNYTIMSPGYPYLPHPHAECTWL 2243
            ....||             .|:..||:...|  |:::|.||      :|.|        .:.:|.
Zfish   339 RRSQCLQFYHYNSGGTDDQLNIWVREYTAENPKGDLRLIQQ------ISGG--------LKDSWE 389

  Fly  2244 VMAPPGETIAVDFDEQFE-----LSARHCDKENVEFFDGATKLARLLLRTCRKPQNT--VRTTGN 2301
            :.     .:.:|...:|.     :..|...|       |...|..:.|...:.||:|  :|....
Zfish   390 LY-----HVTLDVSSKFRVVFEGVKGRDTSK-------GGLSLDDINLSETQCPQHTWRIRDFTK 442

  Fly  2302 LLLVH------YQSQLNEPTG-GFRLNLSLSTCGGQFSASAGFISSENYPHLGG--YPKPSVCEY 2357
            ||...      |..:|..|.| .|::.|.::.........|.::...:.||...  :|.|.....
Zfish   443 LLATTAPGSKIYSPRLLSPDGYSFQIGLYINGLKDSPDKMAIYLHLTSGPHDDNLQWPCPWRQAS 507

  Fly  2358 SILLPKNAFIRLNITDLH-LPYDANGTSSDRLEIVDY 2393
            ..::.:|..|:..:.::. :..|.:.||:|....|:|
Zfish   508 MEMMDQNPDIQRRMNNIRMITTDPDKTSTDSSGNVEY 544

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
Cubn2NP_729748.3 cubilin_NTD 38..141 CDD:412063
EGF_CA 156..190 CDD:238011
EGF_CA 192..233 CDD:238011
EGF_CA 290..328 CDD:214542
EGF_CA 330..374 CDD:214542
EGF 427..455 CDD:394967
EGF_CA 462..496 CDD:238011
CUB 503..619 CDD:238001
CUB 624..738 CDD:238001
CUB 745..854 CDD:238001
CUB 857..963 CDD:238001
CUB 1066..1179 CDD:238001
CUB 1185..1293 CDD:238001
CUB 1303..1406 CDD:238001
CUB 1411..1523 CDD:238001
CUB 1530..1648 CDD:238001
CUB 1754..1862 CDD:238001 5/10 (50%)
CUB 1979..2091 CDD:238001 21/126 (17%)
CUB 2210..2321 CDD:238001 26/126 (21%)
CUB 2327..2441 CDD:238001 13/70 (19%)
CUB 2688..2782 CDD:412131
CUB 2810..2911 CDD:238001
CUB 3029..3143 CDD:238001
CUB 3169..3249 CDD:412131
CUB 3499..3607 CDD:238001
mep1baNP_001070089.2 ZnMc_meprin 29..258 CDD:239809 49/254 (19%)
MAM 268..431 CDD:459878 40/215 (19%)
MATH_Meprin_Beta 430..599 CDD:239751 25/115 (22%)
Blue background indicates that the domain is not in the aligned region.

Return to query results.
Submit another query.