DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment CG40470 and Enpep

DIOPT Version :10

Sequence 1:NP_001036635.1 Gene:CG40470 / 3355093 FlyBaseID:FBgn0058470 Length:941 Species:Drosophila melanogaster
Sequence 2:NP_071587.2 Gene:Enpep / 64017 RGDID:621228 Length:945 Species:Rattus norvegicus


Alignment Length:933 Identity:219/933 - (23%)
Similarity:395/933 - (42%) Gaps:134/933 - (14%)


- Green bases have known domain annotations that are detailed below.


  Fly    52 RLPKEVLPLSYEVLIEPHMDNQNFEGSIRMHLRWIGDSKKVYFHAHDTLLIDVSQINLTTLNMGD 116
            |||..:.|:.|::.::..|:...:.|.:.:.:....|::.::.|..:|.:..:.::...:     
  Rat    84 RLPDFIQPVHYDLEVKVLMEEDRYTGIVSISVNLSKDTRDLWLHIRETRITKLPELRRPS----- 143

  Fly   117 GTLDKNVIILRGVRLPRKPVFVLYLKDKI---KKGSECLLDIYFQGNISETEEGLFRSYYTNSGN 178
               .:.|.|.|.....::...|:..::.:   ...|...|.|.|:|.::.:..|.:|:.||..|.
  Rat   144 ---GEQVPIRRCFEYKKQEYVVIQAEEDLAATSGDSVYRLTIEFEGWLNGSLVGFYRTTYTEDGQ 205

  Fly   179 DGEEIYLATNLKPNNARRLFPCFDEPGIKVPFNVSIARPKGYITLFNTPL--HNTINHPKLRSYS 241
              .:...||:.:|.:||:.|||||||..|..:|:|:..||.|..|.|.|:  ..|:::    .:.
  Rat   206 --TKSIAATDHEPTDARKSFPCFDEPNKKATYNISLIHPKEYSALSNMPVEKEETLDN----DWK 264

  Fly   242 LDFFHTTAPMSTHAFGFVILKLHMWNEHKIVKSSDIPAINIWSNNLSSTNLLDIQNKLNVAHTTI 306
            ...|..:.||||:...|.:   |.:...:....|..|.......|...|    .:...|:.....
  Rat   265 KTTFMKSVPMSTYLVCFAV---HQFTSIQRTSRSGKPLTVYVQPNQKQT----AEYAANITKAVF 322

  Fly   307 QHF---FNIPLPLTKLDVIAIPSLATLPFISASGILIARESEILKKDVFE-----------ISRE 357
            ..|   |.:...|.|||.||||...| ..:...|::..||:.:|...:..           ::.|
  Rat   323 DFFEDYFAMEYSLPKLDKIAIPDFGT-GAMENWGLVTYRETNLLYDPLLSASSNQQRVASVVAHE 386

  Fly   358 LIYQWIGIWITPEWWTDANVNKALISF--------------IASEIVFE----INGGIEFNGKYP 404
            |::||.|..:|.:||.|..:|:...||              :.|:::.|    :.........:|
  Rat   387 LVHQWFGNIVTMDWWDDLWLNEGFASFFEFLGVNHAEADWQMLSQVLLEDVLPVQEDDSLMSSHP 451

  Fly   405 MTILYSLYYELSKRYPNSHITGIKHEFASIKVQLIIRMLSLTVGKYTFRLGIQSFICDYKFKTYK 469
            :.:..|...|::     |...||.:.    |...|:|||...:....|:.|.|.::.::|||..|
  Rat   452 VVVTVSTPAEIT-----SVFDGISYS----KGASILRMLQDWITPEKFQKGCQIYLENFKFKNAK 507

  Fly   470 SSDFWNAITTQAKADNSLDSDLSILSIAESWLEHSRLPLVTIIRDYDSETAIVQQKVYLRERLHD 534
            :||||:::   .||.|.     .:..:.::|......|:||:     |....|.||.:|.:...|
  Rat   508 TSDFWDSL---EKASNQ-----PVKEVMDTWTSQMGYPVVTV-----SGKQNVTQKRFLLDYKAD 559

  Fly   535 VPDQDNML---WWIPIALKRQDSLSFVNTGSFKW-------------MNKTRQMLISNLPSKNMF 583
            .....:.|   |.|||                ||             .|:....|.:|| |.:.|
  Rat   560 PSQPPSALGYTWNIPI----------------KWTENGNSNITVYYRSNREGITLNANL-SGDGF 607

  Fly   584 IIVNEEEIGPFPVNYDDNNWNMLSKYLRTEEKRESIPVYTRAKLLHDAWNLAYAGELNFSTALNV 648
            :.:|.:.||.:.|||:...|:.:::.|.:.....|..  .|:..:.||:.||.|..|::..|||:
  Rat   608 LKINPDHIGFYRVNYEAETWDWIAETLSSNHMNFSSA--DRSSFIDDAFALARAQLLDYEKALNL 670

  Fly   649 TLFLKYERNHIVWSPVFTFLDQVGKRLEKS-SINKKFELYIIELLAPLYEYLGTAHFNEDINITE 712
            |.:|..|::.:.|..|.:.:..:....|.. .:....|.|....:.|:.:.||..  :...:||:
  Rat   671 TRYLTSEKDFLPWERVISAVSYIISMFEDDRELYPLIETYFRSQVKPIADSLGWQ--DTGSHITK 733

  Fly   713 -LRKLTTSFLCKAGYFPCFKEARRAFNIWI--NSSFP-NFETPVPNEYICSIFKWGSMKEWMFGL 773
             ||.....|.||.|.......|.:.|..|:  |.|.| |....|   |...:...|:...|.:.|
  Rat   734 LLRASVLGFACKMGAGEALGNASQLFEAWLKGNESIPVNLRLLV---YRYGMQNSGNEAAWNYTL 795

  Fly   774 DRLCEFPKSRIQSDRTHLLKMLAGCPAQRDKIFILLELAILKNISIFSDTDKMLIISTVTSRSIG 838
            :   ::.|:.:..::.   |:|.|..:.:|...:...|.:||:.:|....|...:|..::..|.|
  Rat   796 E---QYQKTSLAQEKE---KLLYGLASVKDVTLLARYLEMLKDPNIIKTQDVFTVIRYISYNSYG 854

  Fly   839 YTTLLDFLSNNWDDIHHKFYNNTNIWTKLISSATGMFSTQEGYDLVKKFYDEHYGHFGRAQHIIE 903
            .:...:::..|||.:.::|..|.....::::.|. .|:|:.....::.|:.: |.:.|......|
  Rat   855 KSMAWNWIQLNWDYLVNRFTINDRYLGRIVTIAE-PFNTELQLWQMQSFFAK-YPNAGAGAKPRE 917

  Fly   904 KSLRNIKEGIQWSKQNIPVIEEW 926
            :.|..:|..|:|.|.|...|.||
  Rat   918 QVLETVKNNIEWLKLNRKSISEW 940

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
CG40470NP_001036635.1 GluZincin 60..502 CDD:472708 109/478 (23%)
ERAP1_C 583..910 CDD:463368 77/331 (23%)
EnpepNP_071587.2 Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 45..77
M1_APN-Q_like 92..531 CDD:341064 108/477 (23%)
ERAP1_C 607..925 CDD:463368 77/332 (23%)
Blue background indicates that the domain is not in the aligned region.

Return to query results.
Submit another query.