DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment Usp47 and Usp40

DIOPT Version :10

Sequence 1:NP_523937.2 Gene:Usp47 / 38644 FlyBaseID:FBgn0016756 Length:1556 Species:Drosophila melanogaster
Sequence 2:NP_001128357.1 Gene:Usp40 / 316599 RGDID:1309613 Length:1235 Species:Rattus norvegicus


Alignment Length:1436 Identity:294/1436 - (20%)
Similarity:472/1436 - (32%) Gaps:511/1436 - (35%)


- Green bases have known domain annotations that are detailed below.


  Fly   392 PRGYV---GLVNQAMTCYLNSLLQALFMTPEFRNALY--------RWEFDNDNEAK--NIPYQLQ 443
            ||.:.   |:.||..||||:||||.|..|||||.||:        ..|..:..|||  .||.|||
  Rat    34 PREFTKLSGIRNQGGTCYLSSLLQTLHFTPEFREALFSLGPEELGSLEDKDKPEAKVRIIPLQLQ 98

  Fly   444 KLFLNLQTSPKAAVETTDLTRSFGWDSTEAWQQHDIQELCRVMFDALEHKFKNTKQANLISNLYE 508
            :||..|....:.|..|||||.||||.|.|..:|||:|||.|::|.|||.....|...:||..||.
  Rat    99 RLFAQLLLVDQEAASTTDLTDSFGWTSDEEMRQHDVQELNRILFSALETSLVGTSGHDLIHRLYH 163

  Fly   509 GKMNDYVKCLECNTEKTREDTFLDIPLPVRPFGSSSAYGSIEEAL-RAFVQPETLDGNNQYLCEK 572
            |.:.:.:.|.||.....|::.|||:.:.|:      ....:|:|| ..:|:.|..|.:|.|.|..
  Rat   164 GTIVNQIVCKECKNVSERQEDFLDLTVAVK------NVSGLEDALWNMYVEEEIFDYDNLYHCGT 222

  Fly   573 CKKKCDAHKGLHFKSFPYILTLHLKRFDFDYQTMHRIKLNDRVTFPQTLNLNTFINRSGNSGEQN 637
            |.:...|.|....:..|..||:.|.||:||:....|.|.....|||..:||..|..:|       
  Rat   223 CDRLVKAAKSAKLRKLPPFLTISLLRFNFDFVKCERYKDTSCYTFPLRINLKPFCEQS------- 280

  Fly   638 SQLNGTVDDCSTADSGSAMEDDNLSSGVVTTASSSQHENDLNDEDEGIDMSSSTSKSAKQGSGPY 702
                                                   |::|.:                   |
  Rat   281 ---------------------------------------DMDDME-------------------Y 287

  Fly   703 LYELFAIMIHSGSASGGHYYAYIKDFDN---------------------NE-------------- 732
            :|:||:::||.|...||||:.||||.|:                     ||              
  Rat   288 MYDLFSVIIHKGGCYGGHYHVYIKDVDHLGNWQCQEEISDSAVNLKAPQNEEETDDPLVVLKAIL 352

  Fly   733 ----------------------------------------------------------------- 732
                                                                             
  Rat   353 LQEEANQIPVDQLGQKLLIKTGISWNKKYRKQHGPLRKFLRLHPQVFLLSTDESTVSLQRSHFPQ 417

  Fly   733 ------------------------------WFCFNDQNVTSITQEDIQRSFGGPNGSYYSSAYTS 767
                                          ||..||..|..|.::||.:.|.|            
  Rat   418 VPSDPQSHEQIAHTLASEPPGLRDNISCPHWFDINDSKVQPIKEKDIMQQFQG------------ 470

  Fly   768 STNAYMLMYRQVDAKR-NELVAKVADFPEHIKTLLPKLHSEEETRVSRLGRHITVTDLALPDLYK 831
            ..:||||.||:...:| :|..|.    |.:                 |:..|:            
  Rat   471 KESAYMLFYRKATLQRPSEAQAN----PRY-----------------RVPCHL------------ 502

  Fly   832 PRVYFYNPSLKKMKITRVYV----------SQSFNINLVLMSAYEMLNVEQFAPLSRCRLVAYNS 886
                     ||:|....|.:          :.:|.::|.|...|..                :|.
  Rat   503 ---------LKEMDAANVLLQTRRAECDSANSTFELHLHLGPHYRF----------------FNG 542

  Fly   887 SMDTIIQSLESCTDPALTELRAAQN--YSLDFLLEYRAEDQEFEVYP--PNGITWYVFKVDLSTM 947
            ::...:...||..|....:.:...:  .|:..|||:...|....|..  |.|:..|        .
  Rat   543 ALHPAVSQTESVWDLTFDKRKTVGDLRQSIFQLLEFWEGDMVLSVAKRVPAGLHVY--------H 599

  Fly   948 AMDGPFLVYSAAREREASDVLRRSIALRLHISEQQFLLATVRATVPKAFVSYDPHPTPEALQHLQ 1012
            .:||..|....|...:..|:               |:...|.....:....:|..|   .|.|:.
  Rat   600 TLDGDELTLGEAEIADEEDI---------------FVWNGVEVGGVQIQTGFDCEP---LLLHIL 646

  Fly  1013 NMANTQFKSITYFYLNVPNTDAATLEM------LGVPTVESVECASGGDVVDAAMMNGVAPGHMS 1071
            ::..:...|.....:..|:...|..|:      ||.|  |.|           .:||.|     .
  Rat   647 HLELSGEGSECEQLVESPHVFPANAEVGTVFTALGTP--EGV-----------LLMNSV-----E 693

  Fly  1072 SSNDYDWRRY-KRDLVEPMSQPSPSHG-----HESNSEDSSL---------SDGDRTLVETDNMA 1121
            |:::..|... :.|:.:...:....:|     .:|:|:|:||         |..:...::..|..
  Rat   694 STDEECWTAIPQEDMKKTFREQGLRNGSLILVQDSDSDDNSLLSKQGRWTSSMNELNWLQVKNFC 758

  Fly  1122 HRGGGDSQV--SSTSHSP----QLSSPED-----EAASHDAMMRVHAYCNGNGSYAAADVVDPLL 1175
            ..|..:.||  :.|.|:.    ::.:.::     |.|.:..:..:    :.||.         ||
  Rat   759 QSGSEEKQVQIAVTMHTVVFDIRIKAIKELKLMKELAENSCLRPI----DRNGR---------LL 810

  Fly  1176 LP---TSTNHFFYATKVECVDVVGTGSSSGHQSDEEAQL---RKPTRAYKLL---VGTHMRMGA- 1230
            .|   |||        :|..: |..|||.|        |   :.||.:...|   :||.:..|| 
  Rat   811 CPVPDTST--------LEEAE-VKMGSSVG--------LCLGKAPTSSQLFLFFALGTDIHPGAE 858

  Fly  1231 FKKHIEQLIQV-PAAHFKLQRKHDNNLSNNQNNSLVHL-------------------------IE 1269
            .:..:|:.:.| ......|::       :.|...:.||                         ..
  Rat   859 LEVIVEETLSVRDCLKIMLEK-------SGQQGEMWHLRKMDWCYEAGEPLCEEDATLKELMICS 916

  Fly  1270 GETLTVELGKTLEPDEFKAKIHFL---RLADIDNETSKLPCV----CEW---------------- 1311
            |:||.:..||...|...|..|.:.   ||:........|.|.    ..|                
  Rat   917 GDTLLLTEGKLPSPGHLKMPIWWYQPERLSGHRESWDHLNCAFSQGSSWGAAPTQGAPGPEPAEV 981

  Fly  1312 --VY--NANTTAEQAKKELVAKLHRIDA--KYATLSVQNCRIWLKGGRIPIKILSDDETLYCDMR 1370
              :|  :...:.|....||.:|...:.:  |.|..|....|:|....:.|.::|........:.|
  Rat   982 SLLYLGDMEISEEATLAELKSKALALPSVLKLAVQSTSLLRVWTVESKRPSRLLRTGWRQLKEYR 1046

  Fly  1371 SSIAAEFIVQECEEEVDPQPKDDSLTLFVRRWCPAKLEFGKFQEITLDQD-----SEIRLSLSQI 1430
            .....|..::..::|.|..|:|..|...:|  .|.:..:...:::..|..     ..:|..::..
  Rat  1047 LGRRTELCLELLQKEEDLGPRDVLLRTQLR--IPGERAYSLAKDLVWDTTRGWTAGSLRQRVADF 1109

  Fly  1431 SDIPIDKLSYMKLNSNFPCTSISALSVNESSSW-YSVPTTLDKYPLNSTQTGNIYLYKDRTVPAR 1494
            ..:|::|:...|.   || .....|.:   ||| ..:.....|...::.|.|..||....|:..:
  Rat  1110 YSLPVEKIEIAKY---FP-EKFEWLPI---SSWNQQIAKRKKKKNQDTLQGGPYYLKDGDTIGIK 1167

  Fly  1495 ELTLEERRLMNAREKARLDRVGCVSTTRYAQRRERALKIYLDSPEKSSNVTAS-----------A 1548
            .|..::    |.......|.:|..:..|.|..::::.::   ...:||:|.::           |
  Rat  1168 NLLFDD----NDDFSTIRDDIGKENQKRTALEKKKSREV---QRTQSSDVFSNSGMPTRPRGPEA 1225

  Fly  1549 PMDVHV 1554
            .:.:||
  Rat  1226 SLSIHV 1231

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
Usp47NP_523937.2 DUF4045 <122..249 CDD:433066
peptidase_C19C 394..780 CDD:239124 133/529 (25%)
USP47_C 1300..1534 CDD:466158 47/265 (18%)
Usp40NP_001128357.1 peptidase_C19C 40..381 CDD:239124 117/411 (28%)
Blue background indicates that the domain is not in the aligned region.

Return to query results.
Submit another query.