DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment CG10445 and CHR4

DIOPT Version :10

Sequence 1:NP_649751.2 Gene:CG10445 / 40938 FlyBaseID:FBgn0037531 Length:945 Species:Drosophila melanogaster
Sequence 2:NP_199293.3 Gene:CHR4 / 834510 AraportID:AT5G44800 Length:2223 Species:Arabidopsis thaliana


Alignment Length:952 Identity:183/952 - (19%)
Similarity:326/952 - (34%) Gaps:345/952 - (36%)


- Green bases have known domain annotations that are detailed below.


  Fly    11 EIHKQSCSIQDQSSVPVPDSSGEINNWMEPEPVDSLELSGEYADNESAI-FVSASEYNEKKAQID 74
            |.|:::   .::|:|    :..||.   ||....:.:|.||....|..: :|..|  |.....|.
plant   517 EAHQET---GEKSTV----ADEEIE---EPVAAKTSDLIGETVSYEFLVKWVDKS--NIHNTWIS 569

  Fly    75 QLVQKILTMEQQQRQLEENASETKELCDLKLHLEKLNARLEALGEFLDTLRL-REDQQEIKIEDN 138
            :...|.|...:.:....:..:....:|:.|.   |...|:.|       ||: :|..||..::..
plant   570 EAELKGLAKRKLENYKAKYGTAVINICEDKW---KQPQRIVA-------LRVSKEGNQEAYVKWT 624

  Fly   139 SELVDHTEQPKWSNLYSGLNRYRQKANHTAAQFYQHKSNIIDSLKTLYEPIQPRPAIDDLEKQPA 203
            ....|   :..|.:|...:.::   ::|....|:|::...::. .:...|.:.|..:..|.:||.
plant   625 GLAYD---ECTWESLEEPILKH---SSHLIDLFHQYEQKTLER-NSKGNPTRERGEVVTLTEQPQ 682

  Fly   204 LLL-VRLLKHQQSCLKWMQFRERQKISGGILADDMGLGKTLSMIALILASEETKNRKREEKKKAL 267
            .|. ..|..||...|.|:: |...|....||||:||||||:|..|.:                  
plant   683 ELRGGALFAHQLEALNWLR-RCWHKSKNVILADEMGLGKTVSASAFL------------------ 728

  Fly   268 TLKWTQEFNRVYCKEIRKISMFDDEEESGKEEEQYEPPEKRTCHVKTKKINQFRILDDDDNDAGD 332
                    :.:|                      :|....|.|                      
plant   729 --------SSLY----------------------FEFGVARPC---------------------- 741

  Fly   333 KAVVEDEQKDLLAKTPEPEVFSSDEEEEHLSNGRYPSANTLVVCPMSVMCQWAHEVASKVAQNAI 397
                                                    ||:.|:|.|..|..|.:  :....:
plant   742 ----------------------------------------LVLVPLSTMPNWLSEFS--LWAPLL 764

  Fly   398 RVLTFHGPNRHEIGIEAFRSYD-------------------LVITSYNLVVNELKRYGNTSPLFA 443
            .|:.:||..:   |....|.|:                   :::|:|.:|:      .::|.|..
plant   765 NVVEYHGSAK---GRAIIRDYEWHAKNSTGTTKKPTSYKFNVLLTTYEMVL------ADSSHLRG 820

  Fly   444 VYWNRVILDEAHIIRNSKTNCCNSVCQLRAHCHWALTGTPVQNRGVDVFALLRFVNVPNFQDLQQ 508
            |.|..:::||.|.::||::...:.:..........|||||:||...:::.||.|:...:|..|..
plant   821 VPWEVLVVDEGHRLKNSESKLFSLLNTFSFQHRVLLTGTPLQNNIGEMYNLLNFLQPSSFPSLSS 885

  Fly   509 WKKNLNESMLGHR--RLNFIIKPLMLRRTKQKLQASGDMP---------ALPSLKIELICVQLSK 562
            :::..::.....:  .|..::.|.||||.|:  .|..::|         .|.|::.|.....|:|
plant   886 FEERFHDLTSAEKVEELKKLVAPHMLRRLKK--DAMQNIPPKTERMVPVELTSIQAEYYRAMLTK 948

  Fly   563 TEMAVYQILSAISKKIFTQFLLQREKGNSDLNYYSLERTPQFIAGHMSDERYNEIYERFLKSLGY 627
            .    ||||..|.|.:..|.:|.                                          
plant   949 N----YQILRNIGKGVAQQSMLN------------------------------------------ 967

  Fly   628 NPGEKILGIYILVLLLRLRQFCCHPGLMIGMLRGALTAEDVQNVKVDASDVEGQLKMDVLAELDK 692
                         ::::||:.|.||.|:.|                                   
plant   968 -------------IVMQLRKVCNHPYLIPG----------------------------------- 984

  Fly   693 FDETDSEDDCCDEEDSTRRDGNFKLEVIKDEIKEENVPWDSGDDLPTASSFEDQLDSARALKLLN 757
               |:.|..              .||.:.|                                   
plant   985 ---TEPESG--------------SLEFLHD----------------------------------- 997

  Fly   758 PQNPIFQFIRPSAKLKMVIDKLEELLTGTNDKIIVTSQWVSYLAIVRKRLQ----DLSWETLDFN 818
                  ..|:.||||.::...| ::|.....::::.||....|.|:...|.    ..::|.:|  
plant   998 ------MRIKASAKLTLLHSML-KVLHKEGHRVLIFSQMTKLLDILEDYLNIEFGPKTFERVD-- 1053

  Fly   819 GQLTAKEREIVLRDFNANNEKRVLLLSLTAGGVGLNLNVANHMLIVDLHWNPQLERQAQDRIYRY 883
            |.:...:|:..:..||.:..:.|.|||..|.|:|:||..|:.::|.|..:||..:.||.:|.:|.
plant  1054 GSVAVADRQAAIARFNQDKNRFVFLLSTRACGLGINLATADTVIIYDSDFNPHADIQAMNRAHRI 1118

  Fly   884 GQTKPTFIYRYMCQDTVEQRIKSLQDCKLEIAKVVLPEEGGE 925
            ||:|...:||.:.:.:||:||..|...||.:.::.:.:.|.:
plant  1119 GQSKRLLVYRLVVRASVEERILQLAKKKLMLDQLFVNKSGSQ 1160

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
CG10445NP_649751.2 HepA <113..919 CDD:440319 161/841 (19%)
DEAD-like_helicase_N 209..534 CDD:475120 64/345 (19%)
CHR4NP_199293.3 PHD2_CHD_II 78..119 CDD:277007
CD1_tandem 507..585 CDD:349307 18/79 (23%)
CD2_tandem 600..653 CDD:349306 13/68 (19%)
PLN03142 675..>1197 CDD:215601 147/765 (19%)
DUF1087 <1306..1341 CDD:461922
SANT_TRF 1705..1749 CDD:212558
Blue background indicates that the domain is not in the aligned region.

Return to query results.
Submit another query.