DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment: Sgs1 and MUC5B

Sequence 1:NP_523475.3 Gene:Sgs1 FlyBaseID:FBgn0003372 Length:1286 Species:Drosophila melanogaster
Sequence 2:NP_002449.2 Gene:MUC5B HGNCID:7516 Length:5762 Species:Homo sapiens

Alignment Length:1797 Identity:537/1798 (30%)
Similarity:688/1798 (38%) Gaps:692/1798 (38%)


  Fly    49 PDPVQPCDTDSNPTTT--------KPRQKTKRPKSTRRTTKRTKR----------------PRRK 89
            |.|.....|.:.||.|        .||..|..|..|...||.|..                |.:.
Human  3020 PPPKVLTSTATTPTATSSKATSSSSPRTATTLPVLTSTATKSTATSFTPIPSFTLGTTGTLPEQT 3084

  Fly    90 TTKW-------------TTKRATKRTTKRTTRRRPTTPKTPDTTDSPITTTGAECTCSDRTTASS 141
            ||..             ||..:|..|||.||.|..::..||.      :|.|.....::.|||::
Human  3085 TTPMATMSTIHPSSTPETTHTSTVLTTKATTTRATSSMSTPS------STPGTTWILTELTTAAT 3143

  Fly   142 TDSTTDRT-TVTNTDWTTPLCTDTPPCTCSEESSTAIPSSPCIDTSTVIPT-------SPCTQET 198
            |.:.|..| |.::|..||.:.|        |.|:||..:.|...|:|...|       ...|...
Human  3144 TTAATGPTATPSSTPGTTWILT--------EPSTTATVTVPTGSTATASSTRATAGTLKVLTSTA 3200

  Fly   199 TTP-------TPTCSTQGTQTTPCTCAQTTTTPRST-------------------TTTSTSRPTT 237
            |||       ||: |:.||.|.......|.|||.:|                   |||.|:..:|
Human  3201 TTPTVISSRATPS-SSPGTATALPALRSTATTPTATSVTAIPSSSLGTAWTRLSQTTTPTATMST 3264

  Fly   238 TTPRS-----------TTTTTTSRPT--TTTPRST--TTTTTRRPTTTTPRCTTTTSTCAPTTTT 287
            .||.|           ||||||:|.|  ..||.||  |..||:.|||||...|.|.|:...|..|
Human  3265 ATPSSTPETVHTSTVLTTTTTTTRATGSVATPSSTPGTAHTTKVPTTTTTGFTATPSSSPGTALT 3329

  Fly   288 PR---STTTTTTSRPTTTTPR------------CTTTTSTCSPTRTTPRSTTTTSTSRPTTTTPR 337
            |.   |||||.|:|.:|.||.            .||||:..:.:..||.|:|.||.:.|:.||  
Human  3330 PPVWISTTTTPTTRGSTVTPSSIPGTTHTATVLTTTTTTVATGSMATPSSSTQTSGTPPSLTT-- 3392

  Fly   338 CTTTPSTSRPTTTTPRSTTKTSTCAP----TTTTPRPTT---TPSTSRPTTTTPRSTTTTST--- 392
             |.|..|:..:||.|.||..|:...|    |.|||..|:   |||::..||.||....||:|   
Human  3393 -TATTITATGSTTNPSSTPGTTPIPPVLTTTATTPAATSSTVTPSSALGTTHTPPVPNTTATTHG 3456

  Fly   393 -------------------------------SRPTTTTPRSTTTTT---TRRPTTTTPRSTTTTS 423
                                           |..|:.||.:||:||   |...::..|.|.||.|
Human  3457 RSLPPSSPHTVRTAWTSATSGILGTTHITEPSTVTSHTPAATTSTTQHSTPALSSPHPSSRTTES 3521

  Fly   424 TSRPTTTTP-------RSTTTTTTSRPTTTT---------PRSTTTTCTCSP------------- 459
            ...|.||||       |:|.|.|.|:..|:|         |.:|..|..|.|             
Human  3522 PPSPGTTTPGHTRGTSRTTATATPSKTRTSTLLPSSPTSAPITTVVTTGCEPQCAWSEWLDYSYP 3586

  Fly   460 ----------------------------------------------------------------- 459
                                                                             
Human  3587 MPGPSGGDFDTYSNIRAAGGAVCEQPLGLECRAQAQPGVPLRELGQVVECSLDFGLVCRNREQVG 3651

  Fly   460 ---------------------------TTTTPRST----------TTTST---SRPTTTTPRST- 483
                                       :|.||.||          |||:|   |..:|.||.|| 
Human  3652 KFKMCFNYEIRVFCCNYGHCPSTPATSSTATPSSTPGTTWILTKLTTTATTTESTGSTATPSSTP 3716

  Fly   484 ---------TTTST----SGPTTT---------TPRSTTTTTTSGPTTT----TPRSTTTTCTCS 522
                     :||:|    :|.|.|         ||..:||.||  ||.|    ||.|:..|.|..
Human  3717 GTTWILTEPSTTATVTVPTGSTATASSTQATAGTPHVSTTATT--PTVTSSKATPFSSPGTATAL 3779

  Fly   523 P----TTTTPRSTT---TPSTSRPTTTTPRSTTTTCTCSPTTTTPRS-----------TTTTSTS 569
            |    |.|||.:|:   .||:|..||.|..|.|||.|.:.:|.||.|           |||.:|:
Human  3780 PALRSTATTPTATSFTAIPSSSLGTTWTRLSQTTTPTATMSTATPSSTPETAHTSTVLTTTATTT 3844

  Fly   570 RPT--TTTPRST--TTTTTSRPTTTTPRSTTTTSTSGPTTTTPR---STTTTSTSGPTTTTPRS- 626
            |.|  ..||.||  |..||..|||||...|.|.|:|..|..||.   |||||.|:..:|.||.| 
Human  3845 RATGSVATPSSTPGTAHTTKVPTTTTTGFTVTPSSSPGTARTPPVWISTTTTPTTSGSTVTPSSV 3909

  Fly   627 -----------TTTTSTSGPTTTTPRSTTTTSTSGP------TTTTPRSTTTTSTSGP------- 667
                       ||||:.:..:..||.|:|.||.:.|      ||.|...:||..:|.|       
Human  3910 PGTTHTPTVLTTTTTTVATGSMATPSSSTQTSGTPPSLITTATTITATGSTTNPSSTPGTTPIPP 3974

  Fly   668 ----TTTTPRSTTTT----STSGPTTTTPRSTTTTSTSGPTTT-----TPRSTTTTSTSGP---- 715
                |.|||.:|::|    |..|.|.|.|...||.:|.|.:.:     |.|:..|::|||.    
Human  3975 VLTTTATTPAATSSTVTPSSALGTTHTPPVPNTTATTHGRSLSPSSPHTVRTAWTSATSGTLGTT 4039

  Fly   716 --------TTTTPRSTT-TTSTSGPTTTTPR--STTTTSTSGPTTTTPRSTTTTS---------- 759
                    |:.||.:|| ||..|.|..::|.  |.||.|...|.||||..||.||          
Human  4040 HITEPSTGTSHTPAATTGTTQHSTPALSSPHPSSRTTESPPSPGTTTPGHTTATSRTTATATPSK 4104

  Fly   760 -----------TSGPTTTT---------------------------------------------- 767
                       ||.|.||.                                              
Human  4105 TRTSTLLPSSPTSAPITTVVTTGCEPQCAWSEWLDYSYPMPGPSGGDFDTYSNIRAAGGAVCEQP 4169

  Fly   768 -----------------------------------------------------------PRSTTT 773
                                                                       |.:..|
Human  4170 LGLECRAQAQPGVPLGELGQVVECSLDFGLVCRNREQVGKFKMCFNYEIRVFCCNYGHCPSTPAT 4234

  Fly   774 TSTSGPTTT----------TPRSTTTTSTSGPTTTTPRSTTTTS------TSGPTTTTPRSTTTT 822
            :||:.|::|          |..:|||.||.  :|.||.||..|:      ||..||.|..|:..|
Human  4235 SSTAMPSSTPGTTWILTELTTTATTTASTG--STATPSSTPGTAPPPKVLTSPATTPTATSSKAT 4297

  Fly   823 STSGP---------TTTTPRSTTT------TSTSGPTTTTPRSTT----TTSTSGPTTT------ 862
            |:|.|         |:|..:||.|      :||.|.|.|.|..||    |.||..|::|      
Human  4298 SSSSPRTATTLPVLTSTATKSTATSVTPIPSSTLGTTGTLPEQTTTPVATMSTIHPSSTPETTHT 4362

  Fly   863 ----TPRSTTTTSTSGPTTTTPRST--------------TTTSTSGPTTT---TPRST------- 899
                |.::|||.:||  :|:||.||              |||:.:|||.|   ||.:|       
Human  4363 STVLTTKATTTRATS--STSTPSSTPGTTWILTELTTAATTTAATGPTATPSSTPGTTWILTELT 4425

  Fly   900 ---TTTSTSGPTTT---TPRST-------TTTSTSCPTTTTPRSTTTTCTSG----------PTT 941
               |||:::|.|.|   ||.:|       ||.:.:.||.:|..:::|..|:|          ||.
Human  4426 TTATTTASTGSTATPSSTPGTTWILTEPSTTATVTVPTGSTATASSTQATAGTPHVSTTATTPTV 4490

  Fly   942 T----TPRSTTTTCTSCP----TTTTPRST-------------------TTTCTSCPTTTTPRS- 978
            |    ||.|:..|.|:.|    |.|||.:|                   |||.|:..:|.||.| 
Human  4491 TSSKATPSSSPGTATALPALRSTATTPTATSFTAIPSSSLGTTWTRLSQTTTPTATMSTATPSST 4555

  Fly   979 -----TTTTCTSGPTTT-------TPRST--TKTSTCAPTTTTPRSTTTTSTSRPTTTTPRSTTT 1029
                 |:|..|:..|||       ||.||  |..:|..|||||...|.|.|:|..|..||....:
Human  4556 PETVHTSTVLTATATTTGATGSVATPSSTPGTAHTTKVPTTTTTGFTATPSSSPGTALTPPVWIS 4620

  Fly  1030 TTTSRPTTTTPR---STTTPSTSRPTTTTPR--STTTTSTSRPTTTTPRSTTKTSTCAPTTTTPR 1089
            |||: ||||||.   ||.|||:...||.|.|  :||||:.:..:..||.|:|:||...|:.||  
Human  4621 TTTT-PTTTTPTTSGSTVTPSSIPGTTHTARVLTTTTTTVATGSMATPSSSTQTSGTPPSLTT-- 4682

  Fly  1090 STTTTTTSRPTTTTPRSTTTTTTSRP----TTTTPRSTTTPCTS----RPTTTTPRSTTTTTTSR 1146
             |.||.|:..:||.|.||..||...|    |.|||.:|::..||    |..||.|..|:|.|.|.
Human  4683 -TATTITATGSTTNPSSTPGTTPITPVLTSTATTPAATSSKATSSSSPRTATTLPVLTSTATKST 4746

  Fly  1147 PTTTT--PRST-----TTPCPTTTPSASPTRTTTTRRPCPCH 1181
            .|:.|  |.||     |.|..||||.::.:...|:..|...|
Human  4747 ATSFTPIPSSTLWTTWTVPAQTTTPMSTMSTIHTSSTPETTH 4788

Known Domains:


GeneSequenceDomainRegion External IDIdentity
Sgs1NP_523475.3 None
MUC5BNP_002449.2 VWD 70..223 CDD:214566
C8 270..325 CDD:285899
TIL 329..385 CDD:280072
VWC_out 387..>432 CDD:214565
VWD 414..578 CDD:214566
C8 615..685 CDD:214843
TIL 695..752 CDD:280072
TIL 794..855 CDD:280072
VWD 884..1041 CDD:214566
C8 1078..1152 CDD:214843
7 X Cys-rich subdomain repeats 1333..4228 335/1228 (27%)
Mucin2_WxxW 1340..1426 CDD:290069
Mucin2_WxxW 1508..1597 CDD:290069
Mucin2_WxxW 1789..1878 CDD:290069
11 X approximate tandem repeats, Ser/Thr-rich 1890..2199
Mucin2_WxxW 2320..2408 CDD:290069
11 X approximate tandem repeats, Ser/Thr-rich 2419..2756
PARM 2675..>2860 CDD:293666
Mucin2_WxxW 2877..2965 CDD:290069
17 X approximate tandem repeats, Ser/Thr-rich 2976..3456 146/454 (32%)
Mucin2_WxxW 3577..3665 CDD:290069 0/88 (0%)
11 X approximate tandem repeats, Ser/Thr-rich 3676..4013 123/339 (36%)
Mucin2_WxxW 4134..4222 CDD:290069 0/88 (0%)
23 X approximate tandem repeats, Ser/Thr-rich 4233..4879 201/565 (36%)
VWD 5064..5237 CDD:214566
C8 5290..5351 CDD:285899
VWC 5523..5586 CDD:214564
CT 5660..5742 CDD:214482
Blue background indicates that the domain is not in the aligned region.


Information from Original Tools:


Tool Simple Score Weighted Score Original Tool Information
BLAST Result Score Score Type Cluster ID
Compara 00.000 Not matched by this tool.
eggNOG 00.000 Not matched by this tool.
Hieranoid 00.000 Not matched by this tool.
Homologene 00.000 Not matched by this tool.
Inparanoid 00.000 Not matched by this tool.
Isobase 00.000 Not matched by this tool.
OMA 00.000 Not matched by this tool.
OrthoDB 00.000 Not matched by this tool.
OrthoFinder 00.000 Not matched by this tool.
OrthoInspector 00.000 Not matched by this tool.
orthoMCL 1 0.900 OOG5_126579
Panther 00.000 Not matched by this tool.
Phylome 00.000 Not matched by this tool.
RoundUp 00.000 Not matched by this tool.
TreeFam 00.000 Not matched by this tool.
User_Submission 00.000 Not matched by this tool.
10.900

Return to query results.
Submit another query.