DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment CG31999 and thbs1b

DIOPT Version :10

Sequence 1:NP_726551.2 Gene:CG31999 / 43777 FlyBaseID:FBgn0051999 Length:917 Species:Drosophila melanogaster
Sequence 2:XP_005160819.1 Gene:thbs1b / 561901 ZFINID:ZDB-GENE-020708-1 Length:1171 Species:Danio rerio


Alignment Length:929 Identity:180/929 - (19%)
Similarity:271/929 - (29%) Gaps:379/929 - (40%)


- Green bases have known domain annotations that are detailed below.


  Fly    16 GH---VTTTMASDSISGYIRKCCINGLRHARTTASCKKIDIAP------TIIPQ----------- 60
            ||   :|..:..|.:..|:               .|:::::|.      ||:.|           
Zfish   150 GHWKNITLFVQEDRVQFYV---------------GCEEVNVAELDASIHTILTQEIPGVAKMRIG 199

  Fly    61 ------LWLGLCH-------STLEVCCSRELDHQDCELGRLAALD--------GTRCDGEGNVTS 104
                  .::|:..       :|||... |....|:..:..:..||        ..|.|..|:.|.
Zfish   200 KGAVKDRFMGVLQNVRFVFGTTLEAIL-RNKGCQNSAMTDIITLDNPINGSSPAIRTDYTGHKTK 263

  Fly   105 SSYATCCRSCQIGLAVKASKANCKDPLFSFIFLIESYRACCYGSADFKDQPGIDEI-----DKAN 164
            .....|..||: .||                             |.||:..|:..:     ::..
Zfish   264 DLQMICGFSCE-DLA-----------------------------AMFKELKGLGVVVQELSNELR 298

  Fly   165 SITDEGELPFVSEEDMNVTIVLTGDDDICGKIENLCAHICENTF---DAYQCKCHPGFMLDNNNV 226
            .:||:..:.      ||...:..|   :|  :.|...|..:..:   |..:|.|      .|:..
Zfish   299 KVTDDKNML------MNQMGIRAG---VC--LHNGIVHKNKEEWTVDDCTECTC------QNSAT 346

  Fly   227 TCSPMKTQICPSGYNLDKLDNKCIDIDECREDLHDCKSSQYCHNTNGGYHCLNVKEKECPPGFHY 291
            .|..:...:.|                              |.|.       .|.:.||.|  ..
Zfish   347 VCRKISCPLIP------------------------------CANA-------TVPDGECCP--RC 372

  Fly   292 DHDYDACKDDYK-------CKDRKCVKIQSCDKGFSLHNGTCSDIDECSHKSLNNCHVNSNQECV 349
            ....|:.:|.:.       |.       .||.:|......:|..|:       |||...|    |
Zfish   373 GTPSDSAEDGWSPWSEWTHCS-------VSCGRGIQQRGRSCDRIN-------NNCEGTS----V 419

  Fly   350 NTVGSYSCNCLPGFNLDATLNKCVDINECSINNHNCLPTQRCDNTIGSYICTRLQSCGTGYTLNA 414
            .|...|...|...|..|.:.:.....:.||:             |.|:.:.||::.|      |:
Zfish   420 QTRDCYLQECDKRFKQDGSWSHWSPWSSCSV-------------TCGAGVITRIRLC------NS 465

  Fly   415 ETGNCDDDD---------ECTLSTHNCPSN--------YD-CHNTRGSFRCYRKISTMLTTRTTS 461
            .|...|..|         .|..|.  ||.|        :| |..|.|.                 
Zfish   466 PTPQMDGKDCQGEGRQTERCEKSP--CPINGGWGPWSPWDTCSVTCGG----------------- 511

  Fly   462 TTVPPLSLENARRSFTSRYPYPLAVHPEYSQNNDSISTNRRVD--CSPGFYRNTLGACIDTNECM 524
                  .::|.:|         |..:|...........:.:|.  |:.       .|| ..:.|:
Zfish   512 ------GVQNRKR---------LCNNPVPKHGGKECVGDAKVSQICNK-------QAC-PVDGCL 553

  Fly   525 EQNPCGNHERCIN-TNGHFRCESLLQCSPGYKSTVDGKSCIDIDECDTGEHNCGE---RQICRNR 585
             .:||....:|.: .:|.::|.   :|..||  |.:|.:|.|::||......|.|   ...|.|.
Zfish   554 -SSPCFEGAQCTSFPDGSWKCG---KCPTGY--TGNGINCKDVNECKEVPDACFEFNGVHRCENT 612

  Fly   586 NGGFVC-SC--------PIGHELKRSIGGASTCVDTNECALEQRVCPLNAQCFNTIG-----AYY 636
            ..|:.| .|        |.|..::.:......|...|.|......|..||:| |.:|     .:.
Zfish   613 VPGYNCLPCPTRYTGPQPFGRGVEDAAAKKQVCTPRNPCLDGSHDCNKNARC-NYLGHFSDPMFR 676

  Fly   637 CECKAGFQKKSDGNNSTQCFDIDECQVIPGLCQQKCLNFWGGYRCTC--NSGYQLGPDNRTCNDI 699
            ||||.||.    ||......|.|             |:.|......|  |:.|....||      
Zfish   677 CECKPGFA----GNGHICGEDTD-------------LDGWPNADLVCVENATYHCKKDN------ 718

  Fly   700 NECEVHKDYKLCMGLCINTPGSYQ-----------C--------------SC-----PRGYILAA 734
                           |.|.|.|.|           |              :|     ||.|....
Zfish   719 ---------------CPNLPNSGQEDYDKDGIGDACDDDDDDDGIPDDRDNCPLVFNPRQYDYDR 768

  Fly   735 DMNTCRDV----DEC-------ATDSINQVCTGRNDIC-TNIRGSYKCTTV-NCPLGYSIDPEQK 786
            |     ||    |.|       .||:.|   .|..|.| .:|.|....... |||..|::|  ||
Zfish   769 D-----DVGDRCDNCPYNSNPDQTDTDN---NGEGDACAVDIDGDGILNEKDNCPYLYNVD--QK 823

  Fly   787 NRCRQNLNFCEGEECYTQP 805
            :.....:    |::|...|
Zfish   824 DTDLDGV----GDQCDNCP 838

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
CG31999NP_726551.2 vWFA <189..227 CDD:469594 7/40 (18%)
EGF_CA 251..293 CDD:238011 6/41 (15%)
EGF_CA 328..>363 CDD:214542 9/34 (26%)
EGF_CA 421..447 CDD:429571 10/43 (23%)
EGF_CA 519..564 CDD:214542 11/45 (24%)
EGF_CA 565..601 CDD:238011 12/47 (26%)
EGF_CA 611..642 CDD:429571 12/35 (34%)
FXa_inhibition 661..696 CDD:464251 7/36 (19%)
EGF_CA 698..>730 CDD:214542 9/61 (15%)
cEGF 723..744 CDD:463661 9/54 (17%)
thbs1bXP_005160819.1 LamG 25..220 CDD:473984 11/84 (13%)
VWC 318..372 CDD:278520 15/100 (15%)
TSP1 383..430 CDD:214559 13/64 (20%)
TSP1 448..491 CDD:214559 13/63 (21%)
TSP1 496..548 CDD:214559 12/91 (13%)
EGF_3 651..690 CDD:463759 15/43 (35%)
TSP3 repeat_1C 692..727 CDD:275367 13/68 (19%)
TSP3 repeat_short 764..786 CDD:275365 6/26 (23%)
TSP_3 787..822 CDD:367074 13/39 (33%)
TSP3 repeat_long 788..822 CDD:275366 12/38 (32%)
TSP3 repeat_short 823..845 CDD:275366 4/20 (20%)
TSP3 repeat_long 846..883 CDD:275365
TSP_3 847..883 CDD:367074
TSP3 repeat_short 884..919 CDD:275366
TSP_3 921..950 CDD:367074
TSP_C 973..1170 CDD:461725
Blue background indicates that the domain is not in the aligned region.

Return to query results.
Submit another query.