DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment Tg and Epb42

DIOPT Version :10

Sequence 1:NP_609174.1 Gene:Tg / 34093 FlyBaseID:FBgn0031975 Length:776 Species:Drosophila melanogaster
Sequence 2:NP_001102060.1 Gene:Epb42 / 362202 RGDID:1305306 Length:689 Species:Rattus norvegicus


Alignment Length:711 Identity:183/711 - (25%)
Similarity:316/711 - (44%) Gaps:57/711 - (8%)


- Green bases have known domain annotations that are detailed below.


  Fly    54 LGVLKVDLCLEDNHEEHHTSHFYAAAAKEALVVRRGEPFRLKIHFNRDYSPSKDAISFIFTVA-- 116
            |.:...|....:|::||||.    |.:.:.|::|||:.|.:.::|.   :|:...:|.:..||  
  Rat     5 LSIKSCDFHAAENNKEHHTD----AISSQHLILRRGQSFTITLNFR---APAHIFLSALKKVALI 62

  Fly   117 --DDTKPSPGHGTLNALVPHDGIDYLGDTLEWGAGIESHEGQTLTVLIKPPSTCPVTEWKLDIDT 179
              ...:||....| .|:.|   |..|||...|.|.:...:.|..|:.:..|....:..:.|.:..
  Rat    63 AQTGEQPSKTSKT-QAIFP---ISSLGDKKGWSAAVVERDAQHWTISVTTPVDAVIGHYSLLLQV 123

  Fly   180 KLLGDGSRSYPLPLPIYVLFNPWCPDDQVYLEDRDQRKEYVMHDTTLIWRGSYNRLRPSVWKIGQ 244
                .|.:.|||. ...:|||||..||.|:|::..||.|||::....|:.|:.:.::...|..||
  Rat   124 ----SGKKQYPLG-QFTLLFNPWNRDDAVFLQNEVQRTEYVLNQDGFIYLGTADCIQEEPWDFGQ 183

  Fly   245 FERHVLECSLKVLGTVGRIPPAYRGDPVRVARALSALVNSVDDDGVLLGNWSEDFSGGVAPTKWT 309
            |||.|::.||.:|....::..  ...|..||..:.||::::....||..:.::....|....|..
  Rat   184 FERDVMDLSLNLLSVDKQVKD--WSQPAHVACIVGALLHALKKKSVLPISQTQAAQQGALLYKRR 246

  Fly   310 GSVEILQQFYKTQ-KSVKFAQCWNFSGVLTTIARSLGIPSRIITCYSSAHDTQASLTVDVFIDAN 373
            |||.||:|:...: :.|..:|.|.|:.|..|:.|.||||:|::|.:.||..|..||.||.:.:..
  Rat   247 GSVPILRQWLTGRGRPVYESQAWVFAAVACTVLRCLGIPARVVTTFDSAQGTNGSLLVDEYYNEE 311

  Fly   374 NKKLDAETTDSIWNYHVWNELWMQRPDLGVGEHGTFDGWQVVDATPQEASDNMYRVGPASVAAVK 438
            ..:........||.:....|.||.||||..|    :||||::..........:.......|.|||
  Rat   312 GLQNGEGQRGHIWVFQTSVECWMNRPDLSPG----YDGWQILHPRAPNGVGVLGSCNLVPVKAVK 372

  Fly   439 NGDILRPFDGGFVFAEVNADKLYWRYNGPSQPLKLLRKDTLAIGHLISTKAVLKWEREDITDTYK 503
            .||:........:||.|||..:.|:.....: |:|...:...:|:.||||.|.....||||..||
  Rat   373 EGDLQLDPAVPELFAAVNASCVVWKCCEDGK-LELTNSNRKNVGNSISTKVVGSDRCEDITQNYK 436

  Fly   504 HAERSEEERSTMLKALKQSRHAFSRYYLNDN--------------FNDIEFDMELKDDIKIGQSF 554
            :.|.|.:|:..:.:..|:      |..|.::              |.::.....|:.:.::    
  Rat   437 YPEGSLQEKEVLERVQKE------RMKLGEDTCPPSCEPGDPLHLFLEVPSSQPLRGNGRL---- 491

  Fly   555 SVVLKVSNKSESRTHMATGQISCDAVLYTGVGAVEVKTLGFELELEPKSSDYVRMEVIFEEYYDK 619
            ||.|......|....:.   |:..|:.|.||.|..:......|.|.|.....:...:.|..:...
  Rat   492 SVALINPTDKEKEVELV---IAAQALYYNGVLATGLWRQKQFLMLGPNQVLRLSTSLSFSCFEQN 553

  Fly   620 LSSQAAFQISAAAKVKDTDYDYYAQDDFRVRKPDIKFQLGEAAIVAQKELDVILRLENPLPIPLH 684
            .......:::|.|:.....:..:||:|..:.:|::..::.:.|...| .|...:|:.|.|..|:.
  Rat   554 PPENTFLRVTAMARHSHAGFSCFAQEDVVISRPNLVIEMPKRATQYQ-PLTASVRMHNSLDAPMQ 617

  Fly   685 KGVFTVEGPG-IEQPLKFKIAEIPVGGTAAATFKYTPPYAGRGTMLAKFTSKELDDVDGYR 744
            ..:.::.|.| |.:..::.:..:..|.:....|::||.:.|...:..:.......::.|:|
  Rat   618 NCIISIFGRGLIHREKRYGLGSVWPGSSLHTQFQFTPTHLGLQRLTVEVDCDMFQNLTGHR 678

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
TgNP_609174.1 Transglut_N 58..180 CDD:459971 32/125 (26%)
TGc 324..419 CDD:214673 34/94 (36%)
Transglut_C 538..644 CDD:460002 17/105 (16%)
Transglut_C 652..750 CDD:460002 18/94 (19%)
Epb42NP_001102060.1 Transglut_N 8..124 CDD:459971 32/130 (25%)
TGc 261..351 CDD:214673 34/93 (37%)
Transglut_C 473..565 CDD:460002 16/98 (16%)
Transglut_C 586..684 CDD:460002 18/94 (19%)

Return to query results.
Submit another query.