DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment Cubn2 and thbs3a

DIOPT Version :10

Sequence 1:NP_729748.3 Gene:Cubn2 / 39334 FlyBaseID:FBgn0259140 Length:3613 Species:Drosophila melanogaster
Sequence 2:NP_775332.2 Gene:thbs3a / 252849 ZFINID:ZDB-GENE-020708-3 Length:962 Species:Danio rerio


Alignment Length:805 Identity:169/805 - (20%)
Similarity:251/805 - (31%) Gaps:288/805 - (35%)


- Green bases have known domain annotations that are detailed below.


  Fly   132 LRRQLQRVERVKGIL---QTLAGNLARNECLSNPCKNGGTCHDA--YKGFQC-ECPAGWQGDS-- 188
            :|.|::.:..|:..:   |....:..|:.|..|||..|.:|.:.  |.|::| .||.|..|:.  
Zfish   252 IREQVKEMSLVRNAILECQMC
GFHEPRSRCQPNPCFKGVSCMETFEYPGYRCGPCPDGMTGNGTH 316

  Fly   189 CEDDVNECFTLAGTDLDGCLNNGQCINTPGSYRC-VCRNGFTGTHCRLRHNTCLFGGSRELCGEH 252
            |: |::||     ::...|...|.|:||...:.| .|..|..|...                  .
Zfish   317 CQ-DIDEC-----SEAQPCYTPGACVNTARGFTCESCPPGMWGPPL------------------S 357

  Fly   253 GTCIQAANSAGYVCICDQGWTWADANVTSASPSACVRDVDECEPRVNPC--HDECINLPGSFRCG 315
            |..::.|.|....|                      .|:|||....|.|  :..|||:.||||||
Zfish   358 GVGVEYAKSHRQEC----------------------SDIDECVDLANACTPNSVCINIIGSFRCG 400

  Fly   316 ACPTGYTGD---GRFCRDIDECAS------EDNGGCSLQPRVTCTNTEGSHRCGRCPAGWTGDGR 371
            .|.|||.|:   |.|.|  ..|:|      :.|..|.:|       ..|...|. |..||.|:|.
Zfish   401 QCKTGYVGNQTAGCFPR--KSCSSLSFNPCDTNAHCVMQ-------RNGDVSCA-CNVGWAGNGH 455

  Fly   372 TC-------------------------------TASDSNSCNNEGI---CHPLA----------K 392
            ||                               ..|.....:|:||   |...|          .
Zfish   456 TCGKDTDIDGYPDRSLPCMDNHKHCRQDNCVYTPNSGQEDADNDGIGDQCDEDADGDGIKNVEDN 520

  Fly   393 CEYVSDMVVCTCPLGSFGHGYGADGCSADSSRLPCDQHPCQNNGTCVQNGRGTTC---ICQPGYS 454
            |..||:.........|||     |.|.      .|...|..:......||.|..|   |...|..
Zfish   521 CRLVSNKDQQNSDTDSFG-----DACD------NCPTVPNIDQKDTDSNGEGDACDDDIDGDGIQ 574

  Fly   455 GVVCNSSDACHPSPCLN-------GGTCRLLPDAK--YQCVCPRGYTGTTCSHQRFFCGVTIRGP 510
            .|:.|....  |:|...       |..|...|:..  .|........|..|...:          
Zfish   575 NVLDNCPKV--PNPMQTDRDRDGVGDACDSCPEISNPMQTDVDNDLVGDVCDTNQ---------- 627

  Fly   511 SGQLHYPPNTADGDYQADER--CPFIIRTNRNMVLNLTFTQFQLEDSADCTADFLQLHDGNSLSS 573
                     ..|||...|.|  ||.|..::            ||:...|...|.....|.|   .
Zfish   628 ---------DTDGDGHQDTRDNCPDIPNSS------------QLDSDNDGIGDDCDEDDDN---D 668

  Fly   574 RLIGRFCGSRLPMTNGSVITTQEQVFFWFRSDNQTQGKGFHVIWN--------SLPFSCGETINL 630
            .:......:.:...|..:|:...|      .|:.:.|.| .|..|        .|...|.|:..:
Zfish   669 GIPDNHAINGIGPDNCRLISNPNQ------KDSDSNGVG-DVCENDFDNDSVMDLVDVCPESAEV 726

  Fly   631 TST-----QTGVLRSPGYPGQARPELDCRWQLTAPFGYRLLLRFYDISLGSSEASAGNCSQDSLI 690
            |.|     ||.:|...|     ..::|..|                                  :
Zfish   727 TLTDFRAYQTVILDPEG-----DAQIDPNW----------------------------------V 752

  Fly   691 VYDSDRQLLRACQSIQPPPV-YSSSNSLRLD--FHTDAIRSD------------SSF-------- 732
            |.:...::::...|.....| |::.|.:..:  ||.:.:..|            |||        
Zfish   753 VLNQGMEIVQTMNSDPGLAVGYTAFNGVDFEGTFHVNTVTDDDYAGFIFGYQDSSSFYVVMWKQT 817

  Fly   733 -QMHYEVVP----GHPGCGGVYTESR---GRI-------SGYMNFEVCLYLIEQPRGTQVKLVID 782
             |.:::.:|    ..||......:||   |..       :|..:.||.| |.:.||...   .:|
Zfish   818 EQTYWQSIPFRAMAEPGLQLKAVKSRTGPGEFLRNALWHAGDTDGEVKL-LWKDPRNVG---WLD 878

  Fly   783 RVSLVQSLSCH-----YLKIEIFDG 802
            :.|....|| |     |:::::::|
Zfish   879 KTSYRWQLS-HRPQVGYIRVKLYEG 902

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
Cubn2NP_729748.3 cubilin_NTD 38..141 CDD:412063 2/8 (25%)
EGF_CA 156..190 CDD:238011 13/38 (34%)
EGF_CA 192..233 CDD:238011 12/41 (29%)
EGF_CA 290..328 CDD:214542 21/42 (50%)
EGF_CA 330..374 CDD:214542 14/80 (18%)
EGF 427..455 CDD:394967 8/30 (27%)
EGF_CA 462..496 CDD:238011 7/42 (17%)
CUB 503..619 CDD:238001 22/125 (18%)
CUB 624..738 CDD:238001 22/142 (15%)
CUB 745..854 CDD:238001 17/73 (23%)
CUB 857..963 CDD:238001
CUB 1066..1179 CDD:238001
CUB 1185..1293 CDD:238001
CUB 1303..1406 CDD:238001
CUB 1411..1523 CDD:238001
CUB 1530..1648 CDD:238001
CUB 1754..1862 CDD:238001
CUB 1979..2091 CDD:238001
CUB 2210..2321 CDD:238001
CUB 2327..2441 CDD:238001
CUB 2688..2782 CDD:412131
CUB 2810..2911 CDD:238001
CUB 3029..3143 CDD:238001
CUB 3169..3249 CDD:412131
CUB 3499..3607 CDD:238001
thbs3aNP_775332.2 LamG <58..196 CDD:473984
coiled coil 230..272 CDD:293925 4/19 (21%)
TSP-3cc 230..272 CDD:293925 4/19 (21%)
EGF_CA 319..354 CDD:238011 11/39 (28%)
EGF_CA 373..405 CDD:429571 16/31 (52%)
EGF_3 426..457 CDD:463759 10/38 (26%)
TSP type-3 1 459..493 1/33 (3%)
MSCRAMM_ClfA <460..698 CDD:468110 50/290 (17%)
TSP type-3 2 494..529 8/34 (24%)
TSP_3 494..529 CDD:367074 8/34 (24%)
TSP3 repeat_long 494..529 CDD:275366 8/34 (24%)
TSP type-3 3 530..552 7/32 (22%)
TSP3 repeat_short 530..552 CDD:275366 7/32 (22%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 548..704 37/198 (19%)
TSP_3 553..588 CDD:367074 10/36 (28%)
TSP3 repeat_long 553..588 CDD:275366 10/36 (28%)
TSP3 repeat_short 589..611 CDD:275365 3/21 (14%)
TSP_3 612..649 CDD:367074 10/67 (15%)
TSP3 repeat_long 612..649 CDD:275365 10/67 (15%)
TSP3 repeat_short 650..692 CDD:275366 7/44 (16%)
TSP type-3 8 693..728 8/35 (23%)
TSP_3 693..727 CDD:367074 8/34 (24%)
TSP_C 746..943 CDD:461725 34/196 (17%)
Blue background indicates that the domain is not in the aligned region.

Return to query results.
Submit another query.