DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment Hel89B and CHR4

DIOPT Version :10

Sequence 1:NP_732097.2 Gene:Hel89B / 41943 FlyBaseID:FBgn0022787 Length:1923 Species:Drosophila melanogaster
Sequence 2:NP_199293.3 Gene:CHR4 / 834510 AraportID:AT5G44800 Length:2223 Species:Arabidopsis thaliana


Alignment Length:650 Identity:191/650 - (29%)
Similarity:302/650 - (46%) Gaps:125/650 - (19%)


- Green bases have known domain annotations that are detailed below.


  Fly  1285 DESVRLLSTHCFANLVQLM-PLDGKTEQLKSDPLQARKTRDR-EFLDYLFNPKSIPNYKVPVPIS 1347
            :|.:...|:|    |:.|. ..:.||.:..|   :...||:| |.:.....|:            
plant   637 EEPILKHSSH----LIDLFHQYEQKTLERNS---KGNPTRERGEVVTLTEQPQ------------ 682

  Fly  1348 VELR-----CYQQAGINWL---WFLNKYNLHGILCDDMGLGKTLQTICILAGDHMHRQTANLANL 1404
             |||     .:|...:|||   |..:|   :.||.|:||||||:.....|:..:.....|.    
plant   683 -ELRGGALFAHQLEALNWLRRCWHKSK---NVILADEMGLGKTVSASAFLSSLYFEFGVAR---- 739

  Fly  1405 PSLVICPPTLTGHWVYEVEKFLDQGSVLRPLHYYGFPVGREKLR-------SDIGT-------KC 1455
            |.||:.|.:...:|:.|   |.....:|..:.|:|...||..:|       :..||       |.
plant   740 PCLVLVPLSTMPNWLSE---FSLWAPLLNVVEYHGSAKGRAIIRDYEWHAKNSTGTTKKPTSYKF 801

  Fly  1456 NLVVASYDTVRKDIDFFSGIHFNYCVLDEGHIIKNGKTKSSKAIKRLKANHRLILSGTPIQNNVL 1520
            |:::.:|:.|..|.....|:.:...|:||||.:||.::|....:......||::|:|||:|||:.
plant   802 NVLLTTYEMVLADSSHLRGVPWEVLVVDEGHRLKNSESKLFSLLNTFSFQHRVLLTGTPLQNNIG 866

  Fly  1521 ELWSLFDFLMPGFLGTEKQFVQRFSRPILSSRDAKSSAKEQEAGVLAMEALHRQVLPFLLRRVKE 1585
            |:::|.:||.|....:...|.:||       .|..|:.|        :|.|.:.|.|.:|||:|:
plant   867 EMYNLLNFLQPSSFPSLSSFEERF-------HDLTSAEK--------VEELKKLVAPHMLRRLKK 916

  Fly  1586 DVLKDLPPKITQDLLCELSPLQLRLYEDFSNKHLKDCLDKLGDSSSASMVTENLS---AKTHIFQ 1647
            |.::::|||..:.:..||:.:|...|.....|:.:              :..|:.   |:..:..
plant   917 DAMQNIPPKTERMVPVELTSIQAEYYRAMLTKNYQ--------------ILRNIGKGVAQQSMLN 967

  Fly  1648 ALRYLQNVCNHPKLVLRQSEELTKVTSQLALSNSSLD-----DIEHSAKLPALKQLLLDCGIGVQ 1707
            .:..|:.|||||.|:.....|           :.||:     .|:.||||..|..:|       :
plant   968 IVMQLRKVCNHPYLIPGTEPE-----------SGSLEFLHDMRIKASAKLTLLHSML-------K 1014

  Fly  1708 TESVSQHRALIFCQLKAMLDIVEQDLLRRHLPSVTYLRLDGSVPASQRQDIVNNFNSDPSIDVLL 1772
            ......||.|||.|:..:|||:| |.|.......|:.|:||||..:.||..:..||.|.:..|.|
plant  1015 VLHKEGHRVLIFSQMTKLLDILE-DYLNIEFGPKTFERVDGSVAVADRQAAIARFNQDKNRFVFL 1078

  Fly  1773 LTTMVGGLGLNLTGADTVIFVEHDWNPMKDLQAMDRAHRIGQKKVVNVYRLITRNSLEEKIMGLQ 1837
            |:|...|||:||..|||||..:.|:||..|:|||:|||||||.|.:.||||:.|.|:||:|:.|.
plant  1079 LSTRACGLGINLATADTVIIYDSDFNPHADIQAMNRAHRIGQSKRLLVYRLVVRASVEERILQLA 1143

  Fly  1838 KFKILTANTVVS-----AENASLQTMGTSQIFDLFNGGKDKGAESGSSAVQGTASGGMSMNTIIE 1897
            |.|::.....|:     .|...:...||.::|:          :|.....:.||....:::.|::
plant  1144 KKKLMLDQLFVNKSGSQKEFEDILRWGTEELFN----------DSAGENKKDTAESNGNLDVIMD 1198

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
Hel89BNP_732097.2 HEAT repeat 5..30 CDD:293787
HEAT repeat 42..70 CDD:293787
HEAT repeat 395..420 CDD:293787
HEAT repeat 433..465 CDD:293787
HEAT repeat 475..504 CDD:293787
HEAT repeat 516..540 CDD:293787
HEAT repeat 560..589 CDD:293787
DUF3535 654..1138 CDD:463447
HepA <1342..1849 CDD:440319 169/536 (32%)
CHR4NP_199293.3 PHD2_CHD_II 78..119 CDD:277007
CD1_tandem 507..585 CDD:349307
CD2_tandem 600..653 CDD:349306 5/19 (26%)
PLN03142 675..>1197 CDD:215601 178/602 (30%)
DUF1087 <1306..1341 CDD:461922
SANT_TRF 1705..1749 CDD:212558
Blue background indicates that the domain is not in the aligned region.

Return to query results.
Submit another query.