DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment tgo and sim1a

DIOPT Version :10

Sequence 1:NP_731308.1 Gene:tgo / 41084 FlyBaseID:FBgn0264075 Length:642 Species:Drosophila melanogaster
Sequence 2:NP_835740.2 Gene:sim1a / 260351 ZFINID:ZDB-GENE-020829-1 Length:745 Species:Danio rerio


Alignment Length:677 Identity:160/677 - (23%)
Similarity:257/677 - (37%) Gaps:171/677 - (25%)


- Green bases have known domain annotations that are detailed below.


  Fly    15 RENHCEIERRRRNKMTAYITELSDMVPTCSALARKPDKLTILRMAVAHMKAL----RGTG----- 70
            :|......|.||.|..:...||:.::|..||:..:.||.:|:|:..:::|..    .|.|     
Zfish     2 KEKSKNAARTRREKENSEFYELAKLLPLPSAITSQLDKASIIRLTTSYLKMRIVFPEGLGESWGH 66

  Fly    71 ---NTSSDGTYKPSFLTDQELKHLILEAADGFLFVVSCDSGRVIYVSDSVTPVLNYTQSDWYGTS 132
               .||.:.       ..:||...:|:..|||:|||:.| |:::|:|::.:..|..:|.:..|.|
Zfish    67 VSRTT
SLEN-------VGRELGSHLLQTLDGFIFVVAPD-GKILYISETASVHLGLSQVELTGNS 123

  Fly   133 LYEHIHPDDREKIREQLSTQESQNAGRILDLKSGTVKKEGHQSSMRLSMGARRGFICRMRVGNVN 197
            :||:|||.|.:::...|:..:..::..:            |:..|      .|.|..||:.    
Zfish   124 IYEYIHPADHDEMTAVLTAHQPYHSHFV------------HEYEM------ERSFFLRMKC---- 166

  Fly   198 PESMVSGHLNRLKQRNSLGPSRDGTNYAVVHCTGYIK-NWPPTDMFPNMHMERDVDDMSSHCCLV 261
                      .|.:||: |.:..|  |.|:||:||:| .....||.|       .|....:..||
Zfish   167 ----------VLAKRNA-GLTCGG--YKVIHCSGYLKIRQYSLDMSP-------FDGCYQNVGLV 211

  Fly   262 AIGRLQVTSTAANDMSGSNNQSEFITRHAMDGKFTFVDQRVLNILGYTPTELLGKICYDFFHPED 326
            |:|. .:..:|..::...:|.  |:.|.::|.|..|:|.||..:.||.|.:|:.|..|...|..|
Zfish   212 AVGH-SLPPSAVTEIKLHSNM--FMFRASLDMKLIFLDSRVAELTGYEPQDLIEKTLYHHVHSCD 273

  Fly   327 QSHMKESFDQVLKQKGQMFSLLYRARAKNSEYVWLRTQAYAFLNPYTD------EVEYIVCTNSS 385
            ..|:: ....:|..|||:.:..||..||...:||:::.|....|..:.      .|.|:: |::.
Zfish   274 TFHLR-CAHHLLLVKGQVTTKYYRFLAKQGGWVWVQSYATIVHNSRSSRPHCIVSVNYVL-TDTE 336

  Fly   386 GKTMHGAPLDAAAAHTPEQVQQQQQQEQHVYVQAAPGVDYARRELTPVGSATNDGMYQTHMLAMQ 450
            .|.:. ..||.||:..|          ...|...:..|...||    ||.:.          ..:
Zfish   337 YKGLQ-LSLDQAASTKP----------SFTYNSPSNPVTENRR----VGKSR----------VSR 376

  Fly   451 APTPQQQQQQQQRPGSAQTTPVGYTYDTTHSPYSAGGPSPLAKIPKSGTSPTPVAPNSWAALRPQ 515
            ..|..:.....|.||    .|...:.....||:   |.|||.    ...||              
Zfish   377 TKTKTRLSPYSQYPG----FPTDRSESDQDSPW---GGSPLT----DSASP-------------- 416

  Fly   516 QQQQQQQPVTEGYQYQQTSPARS------------PSGPTYTQLSAGNGNRQQAQPGAYQAGPPP 568
            |..:|.:.:.....|:|.|..||            .|...|:...:.:..|...:.|.|..|.| 
Zfish   417 QLLEQCEGIESSCVYRQFSDPRSLCYGLPLTEDHHTSNELYSHPHSESCERGCCKAGRYFLGTP- 480

  Fly   569 PPNAPGMWDWQQAGGHPHPPHPTAHPHH--------PH-------------------AHPGGPAG 606
               .||...|..| .....|.|.:.|.:        ||                   :.|.|  |
Zfish   481 ---QPGREAWWGA-ARSVLPLPKSSPENGDSFEGVMPHIASIHSLQVRGHWDEDSAVSSPDG--G 539

  Fly   607 AGQPQGQEF-SDMLQMLDHTPTTFEDL 632
            :....|..| :|..:.....|:..|.|
Zfish   540 SASDSGDRFRADQCRSSPQEPSKIETL 566

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
tgoNP_731308.1 bHLH-PAS_ARNT 9..74 CDD:381517 18/70 (26%)
PAS 90..154 CDD:214512 22/63 (35%)
PAS_11 282..384 CDD:464214 32/107 (30%)
sim1aNP_835740.2 bHLH-PAS_SIM1 1..71 CDD:381581 17/68 (25%)
PAS 88..>158 CDD:238075 23/88 (26%)
PAS_3 243..329 CDD:430001 25/86 (29%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 350..413 18/97 (19%)
SIM_C 358..647 CDD:461963 50/255 (20%)
Nuclear localization signal. /evidence=ECO:0000250 368..387 4/32 (13%)

Return to query results.
Submit another query.