DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment htl and AT4G23050

DIOPT Version :10

Sequence 1:NP_524394.2 Gene:htl / 42160 FlyBaseID:FBgn0010389 Length:729 Species:Drosophila melanogaster
Sequence 2:NP_849424.1 Gene:AT4G23050 / 828404 AraportID:AT4G23050 Length:736 Species:Arabidopsis thaliana


Alignment Length:670 Identity:155/670 - (23%)
Similarity:271/670 - (40%) Gaps:161/670 - (24%)


- Green bases have known domain annotations that are detailed below.


  Fly    97 VSSKPYWRNPKKMSFLQTRPSGSLLTLNCHALGNPEPNITWYRNGTVDWTRGYGSLKRNRWTL-- 159
            |:..|.:.|.:.:..: |..|.:.|....|.|.|.........|      |...:|::::|.|  
plant   180 VTKSPVYENGELVGVV-TVSSDATLFNRMHPLSNEHQQQARSNN------RHESNLRKHQWHLPR 237

  Fly   160 ------TMEDLVPGDCGNYTCKVCNSLGCIRHDTQVIVSDRVNHKPILMTGPLNLTLVVNST--- 215
                  :...:||    .|:..|.::|...:...|....|..|......:...|:.:|.::|   
plant   238 PQIAAASQVPVVP----QYSSAVASNLKASKLLPQRNGDDSFNGNHNSRSRDENVPVVASTTFEK 298

  Fly   216 -GSMHCKYLSDLTSKKAWIFVPCHGMTNCSNNRSIIAEDKDQLDFVNVRMEQEGWYTCVESNSLG 279
             ||:..|:|..|..|       ..|.....:|..|:....::             ..|....|..
plant   299 YGSLADKFLGKLQRK-------ITGSQGTEDNEPILRNGINK-------------SACGSGGSSK 343

  Fly   280 QSNS---TAY---------LRVVRSLHVLEAGVASGSLHS-TSFVYIFVFG----------GLIF 321
            .||:   ||:         ...||...|...| |.|.:|: ..|.||...|          ||  
plant   344 ASNAVTCTAFRDNGNGKPKRAEVRISDVYGNG-AEGLIHNGDRFQYIGNLGQSKPPRGLESGL-- 405

  Fly   322 IFMTTLFVFYAIRKMKHEKVLKQRIETVHQWTKKVIIFKPEGGGDSSGSMDTMIMPVVRIQKQRT 386
                       :..|:..|:.....|....|..::             |:|.  :|::.:     
plant   406 -----------VSGMRGTKMSDLNGEIEDAWNTRL-------------SVDP--LPILGV----- 439

  Fly   387 TVLQNGNEPAPFNEYEFPL--DSNWELPRSHLVLGATLGEGAFGRVVMAEVNNAIVAVKMVKEGH 449
               .:|.:.:|.|:....|  ||:.|:....|.||..:|.|:|..|.....|.:.||:|:..:| 
plant   440 ---NSGRQQSPVNQRNNRLVTDSSCEIRWEDLQLGEEVGRGSFAAVHRGVWNGSDVAIKVYFDG- 500

  Fly   450 TDDDIASLV---REMEVMKIIGRHINIINLLGCCSQNGPLYVIVEYAPHGNLKDFLYKNRPFGRD 511
             |.:..:|.   :|:.:||.: ||.|::..:|.........:|:||.|.|:|...|:        
plant   501 -DYNAMTLTECKKEINIMKKL-RHPNVLLFMGAVCTEEKSAIIMEYMPRGSLFKILH-------- 555

  Fly   512 QDRDSSQPPPSPPAHVITEKDLIKFAHQIARGMDYLASRR--CIHRDLAARNVLVSDDYVLKIAD 574
               :::||        :.:|..::.|..:||||:||..|.  .:||||.:.|:||..::.:|:.|
plant   556 ---NTNQP--------LDKKRRLRMALDVARGMNYLHRRNPPIVHRDLKSSNLLVDKNWNVKVGD 609

  Fly   575 FGLARDIQSTDYYRKNTNGRLPIKWMAPESLQEKFYDSKSDVWSYGILLWEIMTYGQQPYPTIMS 639
            |||::...:|  :....:|:...:|||||.|:.:..:.|.||:|:|::|||:||       |::.
plant   610 FGLSKWKNAT--FLSTKSGKGTPQWMAPEVLRSEPSNEKCDVFSFGVILWELMT-------TLVP 665

  Fly   640 AEELYTYLMSG------QRMEKPAKCSMNIYILMRQCWHFNADDRPPFTEIVEYMDKLLQTKEDY 698
            .:.|.:..:.|      :|::.|...:..|..:::.||..:...||.|.|::..|..|.:     
plant   666 WDRLNSIQVVGVVGFMDRRLDLPEGLNPRIASIIQDCWQTDPAKRPSFEELISQMMSLFR----- 725

  Fly   699 LDVDIANLDTPPSTSDEEED 718
                     .|.|.:.||:|
plant   726 ---------KPGSGAQEEDD 736

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
htlNP_524394.2 Ig 101..191 CDD:472250 19/97 (20%)
Ig strand B 121..125 CDD:409353 1/3 (33%)
Ig strand C 134..138 CDD:409353 0/3 (0%)
Ig strand E 157..161 CDD:409353 2/11 (18%)
Ig strand F 171..176 CDD:409353 1/4 (25%)
Ig strand G 184..187 CDD:409353 0/2 (0%)
Ig 200..289 CDD:472250 18/104 (17%)
Ig strand B 216..220 CDD:409353 2/3 (67%)
Ig strand C 229..233 CDD:409353 1/3 (33%)
Ig strand E 255..259 CDD:409353 0/3 (0%)
Ig strand F 269..274 CDD:409353 1/4 (25%)
Ig strand G 282..285 CDD:409353 1/5 (20%)
Protein Kinases, catalytic domain 404..692 CDD:473864 86/300 (29%)
AT4G23050NP_849424.1 PAS 92..>202 CDD:441804 5/22 (23%)
STKc_MAP3K-like 474..720 CDD:270901 78/276 (28%)

Return to query results.
Submit another query.