DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment CG5087 and Hecw2

DIOPT Version :10

Sequence 1:NP_648279.1 Gene:CG5087 / 39035 FlyBaseID:FBgn0035953 Length:1078 Species:Drosophila melanogaster
Sequence 2:NP_001101688.1 Gene:Hecw2 / 316395 RGDID:1593244 Length:1578 Species:Rattus norvegicus


Alignment Length:945 Identity:220/945 - (23%)
Similarity:355/945 - (37%) Gaps:253/945 - (26%)


- Green bases have known domain annotations that are detailed below.


  Fly   174 TAPKSWAILRNA----QFEKLQPAMQKI------------CCNIQGHLVQHDFYKI-----MRLV 217
            |||.:..:|:.:    |.|:|....|.|            ...|.|...:.||::.     ...|
  Rat   845 TAPPAPQVLQRSNSIQQMEQLNRRYQSIRRTMTNERPEENTSAIDGAGEEADFHQASADFRRENV 909

  Fly   218 LIRGTVREELSVKPVTLVAIITLCLR-PLIDGNFSRNLLAKFLSEILSVPALIYHLQQSVPQCLE 281
            |...|.|..|           ||.|: |.:          |||     :....:.:..|.|....
  Rat   910 LPHSTSRSRL-----------TLLLQSPPV----------KFL-----ISPEFFTVLHSNPSAYR 948

  Fly   282 QFSSMGLLKKALS-ISGDVQWFEEFGTSMPGTKSLAFLGNIVNLFNIDGQGESKELAYPLLTETT 345
            .|::...||..:: :..|...||.:..:..       |...:|:|      .:|:|..|      
  Rat   949 MFTNNTCLKHMITKVRRDTHHFERYQHNRD-------LVGFLNMF------ANKQLELP------ 994

  Fly   346 TSLLELIPNTVTTKGVFTQWHELLGWHTPGPEPAQNQNVALIKKQFHMLWDHRCIKLLLGDLLKQ 410
                                   .||                    .|..||:.....       
  Rat   995 -----------------------RGW--------------------EMKHDHQGKAFF------- 1009

  Fly   411 INLNYERIEFQSPQQPSISNLLRRALERSSTRGVNLMGVASSKQSKQQWRKLDSAEVVQVSRICG 475
            ::.|.....|..|:.|..|:....||..              :|...:.|...:.||.:.||..|
  Rat  1010 VDHNSRTTTFIDPRLPLQSSRPTSALVH--------------RQHLTRQRSHSAGEVGEDSRHAG 1060

  Fly   476 -----MYYAALNTLSQMKLDILTGICYSD---------NVLYDIWLLITSLGPNCGMKEYLELLK 526
                 ...:..||:|:.:...:..:.|:|         |:|..:......|..|..::|.::.::
  Rat  1061 PPVLPRPASTFNTVSRPQYQDMVPVAYNDKIVAFLRQPNILEILQERQPDLARNHSLREKIQFIR 1125

  Fly   527 SETNLQKPQTAMLMLFCDCMTHYVTILDEHEMYTEQNPFKLNDYVLLTYFLNNILYKLINDNLLA 591
            :|..   |....|....|.:                        :||:.|...|:..:....|| 
  Rat  1126 TEGT---PGLVRLSSDADLV------------------------MLLSLFEEEIMSYVPPHALL- 1162

  Fly   592 GAKNIVQHPVFLSLHTLMLCLYRRDCRRPFTPPKHWLIPEVKPSTFINDLEKAKRNAMLLLAKMP 656
                   ||.:              |:.|...|..  .|:..|.|       .:.||.   |..|
  Rat  1163 -------HPSY--------------CQSPRGSPVS--SPQNSPGT-------QRANAR---APAP 1194

  Fly   657 HIIPHEDRVKLFRKFVQNEKAVMGLTESACASPRSALIVIHRDRIVEDGYRQLAAQPTQAL-KGV 720
            :....|.:::.|.:.::        |:.....|....::|.||.::||.:.|:.....:.| :..
  Rat  1195 YKRDFEAKLRNFYRKLE--------TKGYGQGPGKLKLIIRRDHLLEDAFNQIMGYSRKDLQRNK 1251

  Fly   721 IRVRFINQQGLHEAGIDQDGVFKEFLEETIKKVFDPSLNLFKTTSDQ--RLYPSPISYVQDNHLE 783
            :.|.|:.::||     |..|..:||.....:::|:|...||:.:::.  .:..||:|...|||.|
  Rat  1252 LYVTFVGEEGL-----DYSGPSREFFFLVSRELFNPYYGLFEYSANDTYTVQISPMSAFVDNHHE 1311

  Fly   784 LFEFVGRMLGKAVYEGIVVDVPFASFFLSQLLGQTQQALYSCMDELPSLDNELYRSLTFIKHYKQ 848
            .|.|.||:||.|:....::|..|...|...||     .:...:.:|..||.|.::||.::|  ..
  Rat  1312 WFRFSGRILGLALIHQYLLDAFFTRPFYKALL-----RILCDLSDLEYLDEEFHQSLQWMK--DN 1369

  Fly   849 DVSD-LNLTFSVDQDVMGKIVTLALHPGGKARVVNDHNKLVYIHYMAFFHMNTQIREQTIAFNRG 912
            |:.| |:|||:|:::|.|:|....|.|||....|.:.||..||..|..:.:...:.:||.:..||
  Rat  1370 DIHDILDLTFTVNEEVFGQITERELKPGGANIPVTEKNKKEYIERMVKWRIERGVVQQTESLVRG 1434

  Fly   913 FRSIVNPEWLSLFSPPELQRLISGDTSPLDLKDLQKHTHYYGGFHDTHQVVCWLWDILAKDFTEE 977
            |..:|:...:|:|...||:.:|:| |:.:||.|.:.:|.|.||:||.|.|:.|.|..:.: |..|
  Rat  1435 FYEVVDARLVSVFDARELELVIAG-TAEIDLNDWRNNTEYRGGYHDNHIVIRWFWAAVER-FNNE 1497

  Fly   978 ERKLFLKFVTSCSKPPLLGFAHLEPPFSIRCVEVSDDEDTGDTIGSVIRGFFAIRKKDPLNRLPT 1042
            :|...|:|||..|..|..|||.|.                    ||.....|.:.|...:..||.
  Rat  1498 QRLRLLQFVTGTSSIPYEGFASLR--------------------GSNGPRRFCVEKWGKITALPR 1542

  Fly  1043 SSTCFNLLKLPNYQKKSTLRDKLRYAVSSNTGFEL 1077
            :.||||.|.||.|...|.|.:||..||...:.|.|
  Rat  1543 AHTCFNRLDLPPYPSFSMLYEKLLTAVEETSTFGL 1577

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
CG5087NP_648279.1 IQCD 20..50 CDD:467745
HECTc 694..1076 CDD:238033 124/385 (32%)
Hecw2NP_001101688.1 HECW_N 45..162 CDD:465177
C2_NEDL1-like 185..321 CDD:176073
DMP1 <345..731 CDD:462128
NESP55 <464..632 CDD:115071
WW 815..844 CDD:459800
HECW1_helix 921..987 CDD:465766 17/93 (18%)
WW 993..1022 CDD:459800 8/84 (10%)
HECTc 1222..1576 CDD:238033 124/387 (32%)
Blue background indicates that the domain is not in the aligned region.

Return to query results.
Submit another query.