DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment Usp47 and T24B8.7

DIOPT Version :10

Sequence 1:NP_523937.2 Gene:Usp47 / 38644 FlyBaseID:FBgn0016756 Length:1556 Species:Drosophila melanogaster
Sequence 2:NP_495931.4 Gene:T24B8.7 / 174440 WormBaseID:WBGene00011980 Length:2938 Species:Caenorhabditis elegans


Alignment Length:1474 Identity:301/1474 - (20%)
Similarity:519/1474 - (35%) Gaps:466/1474 - (31%)


- Green bases have known domain annotations that are detailed below.


  Fly   295 SDDDLALG----ASASPTMLGPGYDYGAPTGDSDVEGVTGVTDPSTIGTDDGTYPALSNFYRRKY 355
            :::||||.    |:|...|:.....|   ...||:|..:.....:.|  :|..:|.|.:.....:
 Worm  1697 TNNDLALTENIYAAAKMRMISCLLPY---CSQSDIEDCSKAFIDAII--NDFLFPELPDIEDSMF 1756

  Fly   356 GGDELRAWQRVNTTGADFVSSATTETEAEARQASLG------PRGY------------------- 395
            ..:.:| |                  |.:||:|::.      .|.|                   
 Worm  1757 EEESIR-W------------------EYQAREAAIQTLNAFCERSYKNSHQLLNMWSKFMIAQKT 1802

  Fly   396 ----------------VGLVNQAMTCYLNSLLQALFMTPEFRNAL---------YRWEFDNDNEA 435
                            ||:.|...|||:|:::|.|...|.....|         .||   .||.|
 Worm  1803 YDPSYRPIIRGRQFDKVGMKNDGGTCYMNAMIQQLVHVPGLSRELIALQNIDPQLRW---GDNTA 1864

  Fly   436 KNIPYQLQKLFLNLQTSPKAAVETTDLTRSFGWD-----STEAWQQHDIQELCRVMFDALEHKFK 495
            . :..:||::|..|..:...|:....|.:.|.::     :|:  |.||..:...::.|..::..|
 Worm  1865 A-LLCELQRVFAQLNFAQCQAIVPEGLWKEFRFEPDMPLNTK--QHHDAIDFYSILLDKCDNVLK 1926

  Fly   496 NTKQANLISNLYEGKMNDYVKCLEC-NTEKTREDTFLDIPLPVRPFGSSSAYGSIEEALRAFVQP 559
            ..:...|..|.:.||.:....|..| :..|:.::.|..|.|.:       :..::||||..|:..
 Worm  1927 KLELPPLFQNRFFGKYSYEKICYGCWHRYKSPDEEFNCISLAL-------SGDNLEEALENFLAA 1984

  Fly   560 ETLDGNNQYLCEKCKKKCDAHKGLHFKSFPYILTLHLKRFDFDYQTMHRIKLNDRVTFPQTLNLN 624
            ..::|.|.|.||||.:|........|...|..:|:.||||.:|.......|.|....||..:::.
 Worm  1985 HVMEGENAYHCEKCDEKKTTLNRTSFLELPSTMTIQLKRFTYDLVNNMIRKDNQLFRFPFEIDMT 2049

  Fly   625 TFINRSGNSGEQNSQ--------LNGTVDDC-STADSGSAMEDDNLSSGVVTTASSSQHENDLND 680
            .::..|.:..:::.|        .||..|:. |........|..||.||..:|.|....:..:..
 Worm  2050 PYMTTSRHVPDEHVQDLFDEMLYGNGEADEAPSPPHKNGVAEKPNLGSGSASTPSLESAQKKMFR 2114

  Fly   681 EDEGIDMSSSTSKSAKQG------SGPYLYELFAIMIHSGSASGGHYYAYIK----DFDN----N 731
            ......|..|.|.:...|      ..|.:|||..::.|||.|:.||||::||    :|.:    |
 Worm  2115 RHRSSTMRLSQSFANTSGFDTPSQQKPLIYELVGVLAHSGIATAGHYYSFIKERREEFRDSPHYN 2179

  Fly   732 EWFCFNDQNVTSITQEDIQRSFGGPNGSYYSSAYTSS------------TNAYMLMYRQVDAKRN 784
            :|...||..|:.:       ||......:|...:|..            .|||:|.|   :.||:
 Worm  2180 KWHHINDMIVSPM-------SFNNIEDLWYGGTFTQEGVFIGLDERVRHWNAYVLFY---EKKRD 2234

  Fly   785 ELVAKVADFPEHI----KTLLPKLHSEEETRVSRLG--RHITVTDLALPDLYKPRVYFYNPSLKK 843
            |..|.:   |.||    ..:.||:..:....:...|  :...|.|....|:.:.|         |
 Worm  2235 EPTALI---PRHIIDRLNDVKPKVTFDVSDEIMDDGEDKMKAVQDALKKDMSEAR---------K 2287

  Fly   844 MKITRVYVSQSFNINLVLMSAY-EMLNVEQFAPLSRCRLVAYNSSMDTIIQSLESCTDPALTELR 907
            ::| |::.|....:...|...| :.|:...|......::  |.:|:..:::. |...:..:|:|.
 Worm  2288 LRI-RMFNSMDLKMKKFLNDDYCKFLDDRDFFSFDLYQI--YINSLLPLLKR-EETVEYTVTDLD 2348

  Fly   908 AAQNYSLDFLLEYRAEDQEFEVYPPNGITWYVFKVDLSTMAMDGPFLVYSAAR----EREASDVL 968
            .|..:.|.|  ||              |..|:.:|         .::::...|    .|.|:|::
 Worm  2349 KADFFKLAF--EY--------------IASYIIRV---------AWMMFDEHRPKNFPRAATDLI 2388

  Fly   969 RRSIALRLHISEQQFLLATVRATVPKAFVSYDPHPTPEALQHLQNMANTQFKSITYFYLNVPNTD 1033
            |  :.|..|...:.|.                                  |||:          :
 Worm  2389 R--LLLLRHPDNKMFF----------------------------------FKSL----------E 2407

  Fly  1034 AATLEMLG--VPTVESVECASGGDVVDAAMMNGVAPGHMSSSNDYDWRRYKRDLVEPMSQPSP-- 1094
            |...|||.  :.|.|....||....:.:|:...|    :.:.|        :|  |.|.:|||  
 Worm  2408 ANNSEMLTRMLETTEHDIRASFWHCMRSALRLWV----LENGN--------KD--ENMMEPSPDS 2458

  Fly  1095 ------SHGHESNSEDSSLSDGDRTLVETDNMAHRGGGD----------SQ-VSSTSHSPQLSS- 1141
                  ....:.:.||..|.|.|.   ||:.|..:   |          || |.:.|..|||.. 
 Worm  2459 TDLDDDDEEDDEDDEDDDLEDEDS---ETEEMFRQ---DMMRPSPMALVSQLVMNQSRPPQLKPM 2517

  Fly  1142 -PEDEAASHDAMM------------RVHAYCNGNGSYAAADVVDPLLLPTSTNHFFYATKVECVD 1193
             |.|.:.:....:            |:|. ..|:|.:.:..:|:.|.:.:..|.|          
 Worm  2518 LPVDLSKARSQRLNIVRRIVQVLPFRIHR-MEGSGRHYSRSLVEMLYMISRLNEF---------- 2571

  Fly  1194 VVGTGSSSGHQSDEEAQLRKPTRAYKLLVGTHM---------RMGAFKKHIEQL---IQVPAAHF 1246
                |.|..|..:.           ..|||..:         |:...:..|:.|   ..:|..:|
 Worm  2572 ----GKSVLHVCNA-----------LQLVGEFLWEDYSTVFCRLRFSEDRIKGLGLWPYLPGLYF 2621

  Fly  1247 KLQRKHDNNLSNNQNNSLVHLIE-----------------GETL-----------TVELGKTLEP 1283
            :|       |.:..|.||:|:|:                 .|:|           |.| ||::..
 Worm  2622 EL-------LLDTMNRSLLHIIDPHLHYRHLLMTTQGKFMNESLALYCASREEHETTE-GKSVND 2678

  Fly  1284 DEFKAKIH------FLRLADIDNETSKLPCVCEWVYNANTTAEQAKKELVAKLHRIDAKYATLSV 1342
            ::   .:|      ::|...:...:.......|:::||        .:::.|....|.:|...|:
 Worm  2679 EK---SLHSRIVLCYVRQIQLIVRSEVPDAAQEYIFNA--------CQILLKAFLKDVQYMFTSI 2732

  Fly  1343 QNCRIWLKGGRIPIKILSDDETLYCDMR--------SSIAAEFI-------VQECEEEVDPQPKD 1392
            ::   |           |.....:.|:.        ||:.|..:       :.|.::.|:|    
 Worm  2733 EH---W-----------SHVIDFFIDLAQRLVRVDLSSLGACLLKNLMWVGIAEDDDTVEP---- 2779

  Fly  1393 DSLTLFVRRWCPAKLEFGKFQE-------ITLDQDSEIRLSLSQISD--IPIDKLSYMK--LNSN 1446
             .:...:..|  .:.::.|:::       |.:.:..|:|..|.:.|.  ..|.|:||.:  ::|:
 Worm  2780 -GVLPVMLEW--KETDYTKYRKMNEALFNIRIIKHPELRAVLLRCSKRFRSIFKMSYAENDIDSD 2841

  Fly  1447 FPCTSISALSVNESSSWYSVPTTLDKYPLNSTQTGNIYLYKD---------RTVPARELTLEERR 1502
            .|        ::||......|..| |...|||.     |.:.         |.:..:|:.|||..
 Worm  2842 EP--------MDESDEVMIGPQLL-KRATNSTD-----LTRSAVPSACATARVIETKEINLEEDL 2892

  Fly  1503 LMNAREKARLDRVGCVSTTRYAQRRERALKIYLDSPEKSSNVTA 1546
                  ||.|.......|.:  ...|..|.:  ||.|:..::.|
 Worm  2893 ------KANLSAASLEMTEQ--PSLEEVLSV--DSSEEDISMIA 2926

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
Usp47NP_523937.2 DUF4045 <122..249 CDD:433066
peptidase_C19C 394..780 CDD:239124 111/470 (24%)
USP47_C 1300..1534 CDD:466158 49/268 (18%)
T24B8.7NP_495931.4 UBA 133..168 CDD:197551
peptidase_C19C 1819..2234 CDD:239124 111/437 (25%)
Blue background indicates that the domain is not in the aligned region.

Return to query results.
Submit another query.