DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment sona and Thsd7b

DIOPT Version :10

Sequence 1:NP_001137746.1 Gene:sona / 37762 FlyBaseID:FBgn0034903 Length:1102 Species:Drosophila melanogaster
Sequence 2:NP_001178598.1 Gene:Thsd7b / 289007 RGDID:1306938 Length:1607 Species:Rattus norvegicus


Alignment Length:559 Identity:95/559 - (16%)
Similarity:153/559 - (27%) Gaps:224/559 - (40%)


- Green bases have known domain annotations that are detailed below.


  Fly   295 EPQVAESRTRSRRQAPYIIYPEVLVIVDYDGYRLHGGDNLQVK-----------RYFISFWNGVD 348
            :|...:.|||      :::.|.:       ..|..|.|: ||:           :|.::.|:...
  Rat  1145 DPHATQRRTR------HLLRPSL-------NSRTCGEDS-QVRPCLLNENCFQFQYNLTEWSTCQ 1195

  Fly   349 LRYRLLKGPRIRISIAGIIISRGRDAT-PYLERNRVGRDAIDSAAALTD---------------- 396
            |...:..|..:|..:...:.|.|:..: .:.|...:.:....|...|.:                
  Rat  1196 LSENISCGQGVRTRLLSCVRSDGKPVSMDHCEERNLDKPQRMSIPCLVECVVNCQLSGWTTWTEC 1260

  Fly   397 ---------MGKYLF-------RERRLPVYDIAVAITKLDMCRRTSAY--------------GEC 431
                     |.:..|       ..||.|     ..:|:...|..|..|              |:|
  Rat  1261 SQTCGQGGRMSRTRFIIMPTQGEGRRCP-----TELTQQKPCPVTPCYSWVLGNWSACKLEGGDC 1320

  Fly   432 NRGTAGFAYVGGACVVNKRLEKVNSVAI-IEDTGGFSGIIVAAHEVGHLLGAVHDGSPPPSYLGG 495
            ..|....::   :|||:.  ..::..|: :|||               |.|.|       .:..|
  Rat  1321 GEGVQVRSF---SCVVHN--GSISHTAVQVEDT---------------LCGEV-------PFQEG 1358

  Fly   496 PGAQRCR-----------WEDGYIMSDLRHTERGFRWSACTVQSFHHFLNGDTATCLHNAPHEDS 549
            ...|.|.           |.:               ||.|.:            ||:.....|.:
  Rat  1359 LQKQLCSVPCPGDCHITPWSE---------------WSKCEL------------TCIDGRSFETT 1396

  Fly   550 A-LGRSLPGTLLSLDAQ------------------------------------CRRDRG---TYA 574
            . ..||....:.|.:.|                                    |:|..|   |..
  Rat  1397 GRQSRSRTFIIQSFENQDSCPQQVLETRPCAGGKCYHYTWKASLWNNNERTVWCQRSDGLNVTGG 1461

  Fly   575 CFKDERVCAQLFCFDA---QTGYCVAYRPAAEGSACGNGYHCLDGRCTPLPSNIIPDYGHNYRLV 636
            |....|..|...|..|   ...||      .:|..||    |..|....:.|:...||       
  Rat  1462 CSPQARPAAIRQCIPACKKPFSYC------TQGGVCG----CEQGYTEIMRSSGFLDY------- 1509

  Fly   637 YNKIDNKKDAAEDVESSSEETESQEDETESVEDQDEGTTSSEEQEDN--KIEQSSAAGGAAAQST 699
            ..|:...:|...||::.|.:......:   :.|..:|.:......|.  ||.....:||      
  Rat  1510 CMKVPGSEDKKADVKTFSGKNRPVNSK---IHDIFKGWSLQPLDPDGRVKIWVYGVSGG------ 1565

  Fly   700 TTARSTTTTTVRTTSTTTTTAKPKSKSFGYLHRSLPERQ 738
                |.........::.....|||.      |:|.|..|
  Rat  1566 ----SFLIMIFLVFASYLVCKKPKP------HQSTPRHQ 1594

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
sonaNP_001137746.1 ZnMc_salivary_gland_MPs 313..542 CDD:239800 47/298 (16%)
ADAMTS_CR_2 556..619 CDD:465496 18/104 (17%)
Thsd7bNP_001178598.1 TSP1_spondin 602..660 CDD:465948
TSP1_ADAMTS 665..733 CDD:465950
TSP1_spondin 738..789 CDD:480609
TSP1_ADAMTS 1063..1123 CDD:465950
TSP1_spondin 1129..1175 CDD:480609 10/43 (23%)
TSP1_spondin 1249..1302 CDD:465948 8/57 (14%)
TSP1_spondin 1372..1431 CDD:480609 11/85 (13%)
TSP1_ADAMTS 44..95 CDD:465950
TSP1_spondin 180..229 CDD:465948
TSP1_spondin 337..398 CDD:465948
TSP1_spondin 485..542 CDD:480609
Blue background indicates that the domain is not in the aligned region.

Return to query results.
Submit another query.