DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment: Sgs1 and MUC19

Sequence 1:NP_523475.3 Gene:Sgs1 FlyBaseID:FBgn0003372 Length:1286 Species:Drosophila melanogaster
Sequence 2:NP_775871.2 Gene:MUC19 HGNCID:14362 Length:8384 Species:Homo sapiens

Alignment Length:1430 Identity:419/1431 (29%)
Similarity:566/1431 (40%) Gaps:317/1431 (22%)


  Fly    40 GCGGDTIYYPDPVQPCDTDSNPTTTKPRQ---------KTKRPKSTRRTTKRTKRPRRKTTKWTT 95
            |.|...:.....|.|..|...|.::.|.:         |:..|..:..:|  |..|..|.|.   
Human  1928 GPGNTAVSGTPVVSPGATPGAPGSSTPGEADIGNTSFGKSGTPTVSAAST--TSSPVSKHTD--- 1987

  Fly    96 KRATKRTTKRTTRRRPTTPKTPDTTDSPITTTGAECTCSDRTTASSTDSTTDRTTVTNTDWTTPL 160
              |...|....:..:|.||.||....|....|....:....|.||:|...|..:| ..||.:.|.
Human  1988 --AASATAVTISGSKPGTPGTPGGATSGGKITSGWSSSGTSTGASNTPGATGSST-GQTDTSGPS 2049

  Fly   161 CTDTPPCTCSEESSTAIPSSPCI-------DTS---TVIPTSPCTQE---TTTPTPTCSTQGTQT 212
            ...|.....|.|....|.||..:       ||:   :|..|....|.   |....|:....|| |
Human  2050 AKVTGNYGQSSEIPGTIKSSSDVSGTMGQSDTTSGPSVAVTRTSEQSSGVTVASEPSVGVSGT-T 2113

  Fly   213 TPCTCAQTTTTPRST---TTTSTSRPTTTTPRSTTTTTTSRP----------TTTTPRSTTTTTT 264
            .|......||.|..:   ||.|::..:.||..|:..:.|:||          :.|..|:|..:.|
Human  2114 GPLAEISGTTRPLVSGLRTTGSSAEGSGTTGPSSRESVTTRPLAEGSGTSGQSVTGSRATGLSAT 2178

  Fly   265 RRPTTT--TPRCTTTTSTCAPTTTTPRSTTTTTTSRPTTTTPRCTTTTSTCSPTRTTPRSTTTTS 327
            ...||.  |....|:.|:...|.||..|...:.|:.|:..  |..||..:...||.|..|...|.
Human  2179 ELGTTVSFTGGLGTSRSSARETRTTGPSADGSGTTGPSVV--RSGTTRLSVGVTRATESSPGVTG 2241

  Fly   328 TSRPTTTTPRCT--------TTPSTSRPTTTTPRSTTKT--STCAPTTTTPRP----TTTPSTSR 378
            |:.|:....|.|        ||..:.:.:.||.:|..::  |.....||.|..    ||.|||.|
Human  2242 TTTPSAEESRTTGPSVLVTGTTGQSGQGSGTTGKSFIESGPSVVGSGTTGPTSAGLGTTAPSTRR 2306

  Fly   379 PTTTTP----RSTTTTSTSRPTTTTPRSTTTTTTRRPTTTTPR---STTTTSTSRP----TTTTP 432
            .:||.|    ..||..|.:...||.|.:.....|......:.|   |.|.:|||||    |.||.
Human  2307 SSTTKPSVGRTGTTGQSGAESGTTEPSARVAGVTGTSAEVSGRIEPSATESSTSRPLGETTGTTI 2371

  Fly   433 RSTTTTTTSRPTTTTPRSTTTTCTCSPTTTTPR--STTTTSTSRPTTTTPRSTTTTSTSGP---- 491
            .|...:..:.|:.....:|..:...|.||.|..  |..|.|:.....||.:||..:.|:||    
Human  2372 PSMEGSEATGPSVIGSETTRLSVIGSGTTGTSSGGSGATRSSGGGMGTTGQSTARSETTGPLFGL 2436

  Fly   492 TTTTPRSTTTTTTSGPT--TTTPRS------TTTTCTCSPTTTTPR---STTTPSTSRPTTTTPR 545
            |.|..:|.|.|.||..:  .|||..      ||........||.||   |.||.|::....|:.:
Human  2437 TGTFGQSATVTGTSSNSAGVTTPEKSPGVAMTTGLLVEGSATTQPRILESETTESSAGVIVTSGQ 2501

  Fly   546 STTTTCTCSP----TTTTPRST----------TTTSTSRP-------TTTTPRSTTTTTTSRPTT 589
            |...|....|    |.||..||          ..:.|:||       |.|....::||.:|..||
Human  2502 SARVTGATGPSAGETGTTEPSTEGSVAAVLFVIGSETTRPLDIGSGTTGTLSGGSSTTRSSDGTT 2566

  Fly   590 TTPRSTT----TTSTSGPTTTTPRSTTTTSTS----GPTTTTPRSTTTTSTSG-----PTTTTP- 640
            .|.|.:|    ||..||.|.|:.:....|.||    |.|.|:.:|......:|     |.||.| 
Human  2567 GTTRKSTARSETTGLSGLTGTSGQLAGVTGTSSKSAGVTVTSEKSAGVAVITGSFVERPVTTGPP 2631

  Fly   641 --RSTTTTSTSGPTTTTPRSTTTTST----SGPTTTTPRSTTTTSTSGPT-----TTTP---RST 691
              .|.||..:.|.|.|:.:|...|.|    :|.|.||..||..:..:||:     ||.|   .|.
Human  2632 LLESETTRPSGGVTVTSGQSARVTETVGASAGVTGTTGPSTEGSGATGPSVVGSGTTRPLAGESG 2696

  Fly   692 TTTSTSGPTTTTPRSTTTTSTSGPTT----TTPRSTTTTSTSGPTTTTPRSTTTTSTSGPTTTTP 752
            ||.|::|.|.|.|.|:..::|:||:.    ||..|...|.|||.:..  ::.||.:.:|.|.||.
Human  2697 TTESSAGVTGTRPSSSRESATTGPSDEGSGTTGLSAGVTVTSGQSVR--KTGTTGAPAGVTETTR 2759

  Fly   753 RSTTTTSTSGPTTTTPRST--------TTTSTSGPTTTTPRST----TTTSTSGPTTTTPRSTTT 805
            .|...:.|:||:....|:|        .|.|:.|.|.||.:|.    ||.|.:..|.|:.:|...
Human  2760 PSVVKSGTTGPSVIGTRTTGTSSGGSGATRSSGGETETTGQSAVKSGTTESFTRLTRTSGQSAGM 2824

  Fly   806 TSTS----GPTTTTP----RSTTTTSTSGPTTTTP-------------RSTTTTSTSGPTTTTPR 849
            |.||    |...|:|    ..||.:||.|..||.|             ::.||..::|.|.|:.:
Human  2825 TGTSAQSAGVALTSPFVEGLVTTGSSTVGLETTRPSAVGSGKTGPPVVKAQTTGPSAGVTVTSGQ 2889

  Fly   850 STTTTSTSGPT-----TTTPRS--------------TTTTSTSGPTTTTPR--STTTTSTSGPTT 893
            |...|..|||:     ||.|.|              ||..|..|..||.|.  ..||..::|.|.
Human  2890 SARMTGASGPSVGVTGTTGPASKGLGTIRPSVVGLETTELSAEGSGTTGPPIVGETTVPSAGVTV 2954

  Fly   894 TTPRSTTTT---------------STSGPTTTTPRSTTTTSTSCPTTTTPRSTTTTCTSGPTTT- 942
            |:..|...|               |.:|..||.| |.|...|:..||:...|||.:...|..|| 
Human  2955 TSGYSDRVTGATEPLAGVTGTIKPSVAGSVTTGP-SVTGVETTAKTTSGGLSTTISSVGGTGTTG 3018

  Fly   943 -TP-RSTTTTCTSCPTTTTPRSTTTTCTSCPTT------------------------TTPRSTTT 981
             :| ||.||...:..|.|:.:|...|.||..:.                        ||..|...
Human  3019 QSPERSGTTGPFTGLTGTSAQSAGVTMTSIQSAGVLVTTGLNVDGLGTTGKALIGSGTTGLSAEA 3083

  Fly   982 TCTSGPTT---------TTPRSTTK----------TSTCAPTTTTP--RSTTTT--STSRPTTTT 1023
            |.|.||:|         .|...||:          ||:...:||:|  |.|.||  |.:...||.
Human  3084 TGTIGPSTEGLEKTGPSITGSGTTRPLVTESWTAGTSSGGHSTTSPSVRGTETTGQSAAESVTTG 3148

  Fly  1024 PRSTTTTTTSRPT---TTTPRST-----TTPSTSRPTTTTPRSTTTTSTSRPTT----------- 1069
            | .|..|.||.|:   |.|||.:     ||.|::..:.||.:|.|.:.|:||::           
Human  3149 P-VTGYTETSGPSAGVTVTPRQSPTVTQTTGSSAAVSGTTVQSLTVSGTTRPSSGQTEITGSSVK 3212

  Fly  1070 ---TTPRSTTKTSTCAPT-----TTTPRSTTTTTTSRPTTTTPRSTTTTTTSRPTTTTPRSTTTP 1126
               ||..|..::.|..||     |..|.|...|..   |.::|..|.||.:|...|.|..|:...
Human  3213 ESGTTESSAVRSGTTGPTAGVTGTNGPSSAGVTGI---TGSSPGVTGTTGSSPGVTGTTGSSARS 3274

  Fly  1127 CTSRP----TTTTPRSTTTTTTSRPT---------------TTTPRSTTTPCPTTTPSASPTRTT 1172
            .||.|    |.||..|...:.|:||:               ||.|.:..|  .||.|||..||||
Human  3275 GTSIPSVGKTGTTRTSVEESRTTRPSAGITGTNGLSAEVTGTTGPLAGVT--GTTGPSAGVTRTT 3337

  Fly  1173  1172
            Human  3338  3337

Known Domains:


GeneSequenceDomainRegion External IDIdentity
Sgs1NP_523475.3 None
MUC19NP_775871.2 VWD 481..616 CDD:214566
TIL 721..776 CDD:280072
VWC 778..>814 CDD:214564
VWD 806..968 CDD:214566
C8 1003..1073 CDD:285899
TIL 1077..1134 CDD:280072
TIL 1176..1236 CDD:280072
VWC 1238..>1273 CDD:302663
VWD 1265..1425 CDD:214566
C8 1466..1536 CDD:285899
Approximate repeats of G-V-T-G-T-T-G-P-S-A. {ECO:0000305} 2238..6086 336/1110 (30%)
FhaB 3428..4424 CDD:225751
FhaB 4620..5458 CDD:225751
FhaB 5235..6216 CDD:225751
VWC 8161..8221 CDD:214564
CT 8295..8376 CDD:214482
Blue background indicates that the domain is not in the aligned region.


Information from Original Tools:


Tool Simple Score Weighted Score Original Tool Information
BLAST Result Score Score Type Cluster ID
Compara 00.000 Not matched by this tool.
eggNOG 00.000 Not matched by this tool.
Hieranoid 00.000 Not matched by this tool.
Homologene 00.000 Not matched by this tool.
Inparanoid 00.000 Not matched by this tool.
Isobase 00.000 Not matched by this tool.
OMA 00.000 Not matched by this tool.
OrthoDB 00.000 Not matched by this tool.
OrthoFinder 00.000 Not matched by this tool.
OrthoInspector 00.000 Not matched by this tool.
orthoMCL 1 0.900 OOG5_126579
Panther 00.000 Not matched by this tool.
Phylome 00.000 Not matched by this tool.
RoundUp 00.000 Not matched by this tool.
TreeFam 00.000 Not matched by this tool.
User_Submission 00.000 Not matched by this tool.
10.900

Return to query results.
Submit another query.