DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment trx and SDG16

DIOPT Version :10

Sequence 1:NP_476769.1 Gene:trx / 41737 FlyBaseID:FBgn0003862 Length:3726 Species:Drosophila melanogaster
Sequence 2:NP_194520.3 Gene:SDG16 / 828904 AraportID:AT4G27910 Length:1027 Species:Arabidopsis thaliana


Alignment Length:1149 Identity:230/1149 - (20%)
Similarity:389/1149 - (33%) Gaps:367/1149 - (31%)


- Green bases have known domain annotations that are detailed below.


  Fly  2814 KIPPVTQIKRTNAQAK---AAGISGVGKVPP---QPQVVNKVLP--------TSIVTQQSQ-VQV 2863
            :||.:.:.|..|...|   ...:.|.|...|   ..::...::|        .|..|:.:: |:|
plant    10 QIPSLERCKLGNESRKKKRKLNLGGGGYYYPLNLLGEIAAGIVPGNGRNGFSASWCTEVTKPVEV 74

  Fly  2864 KNSNLKQSQVKGKAASGTGTTCGAPPSIASKPLQKKTNMIRPIHKLEVKPKVMKPTPKVQNQNHS 2928
            :.|       ..|..|.:||...:||:..|:|...:|:..|    ::|.|         ...|.|
plant    75 EES-------LSKRRSDSGTVRDSPPAEVSRPPLVRTSRGR----IQVLP---------SRFNDS 119

  Fly  2929 LLQQQQQ--------QQPQLQQQIPAVVVNQVPKVT-ISQQRIPAQTQQQQLQQAQMIHIPQQQQ 2984
            :|...::        ::.:::.:...||..:|||.| :..:.:..:::...|.:.:..|    :|
plant   120 VLDNWRKDSKSDCDLEEEEIECRNEKVVSFRVPKATNLKSKELDRKSKYSALCKEERFH----EQ 180

  Fly  2985 PLQQQQVQVQPSMPIITLAEAPVVQSQFVMEPQALEQQELANRVQHFSTSSSSSSSNCSLPTNVV 3049
            ...:.:.:|...:|          ..:....|:.....:|        ..:.|..:....|..|:
plant   181 HNDEARARVDEKLP----------NKKGTFGPENFYSGDL--------VWAKSGRNEPFWPAIVI 227

  Fly  3050 NPMQQQAPSTTSSS-------------TTRPTNRVLPMQQRQEPAPLSNECPVVSSPTPPKPVEQ 3101
            :|| .|||.....|             :.....|.....:|....|..:..        .:..||
plant   228 DPM-TQAPELVLRSCIPDAACVVFFGHSGNENERDYAWVRRGMIFPFVDYV--------ARFQEQ 283

  Fly  3102 PIIHQMTSASVSKCYAQKSTLPSPVYEAELKVSSVLESIVPDVTM----------------DAIL 3150
            |        .:..|  :.......:.||.|......|.::.|:.:                :..:
plant   284 P--------ELQGC--KPGNFQMALEEAFLADQGFTEKLMHDIHLAAGNSTFDDSFYRWIQETAV 338

  Fly  3151 EEQPVTESIYTEGLYEKNS------------------------PGESKTEQLLLQQQQR------ 3185
            ..|.:..:...:||.:|:.                        ||    :|||.:...|      
plant   339 SNQELNNNAPRQGLLKKHRNPLACAGCETVISFEMAKKMKDLIPG----DQLLCKPCSRLTKSKH 399

  Fly  3186 -----EQLNQQLVNNGY----------------LLDKHTFQVEPMDTDVY----REE---DLEEE 3222
                 :::...|.|..:                :.|:|...:.  :||.|    |.:   ||.:.
plant   400 ICGICKKIRNHLDNKSWVRCDGCKVRIHAECDQISDRHLKDLR--ETDYYCPTCRAKFNFDLSDS 462

  Fly  3223 EDEDDDFSL-----------------------------------------KMATSACNDHEMSDS 3246
            |.::....:                                         |.|.|....|..|.|
plant   463 EKQNSKSKVAKGDGQMVLPDKVIVVCAGVEGVYFPRLHLVVCKCGSCGPKKKALSEWERHTGSKS 527

  Fly  3247 E--EPAVKDKISKIL--DNLTNDDCADSIATATTMEVDASAGYQQMVEDVLATTAAQSAPTEEFE 3307
            :  :.:||.|.||:.  |.:.|  .|:..|.||..:|......:|..:.:||..:....|.    
plant   528 KNWKTSVKVKSSKLALEDWMMN--LAELHANATAAKVPKRPSIKQRKQRLLAFLSETYEPV---- 586

  Fly  3308 GALETAAVEAAATYINEMADAHVLDLKQLQNGVELELRRRKEEQRTVSQEQEQSKAAIVPTAAAP 3372
            .|..|....|...::.:.....::...:.|..|..|....:..:...|.           ...|.
plant   587 NAKWTTERCAVCRWVEDWDYNKIIICNRCQIAVHQECYGARHVRDFTSW-----------VCKAC 640

  Fly  3373 EPPQPIQE----PKKMTGPHLLYEIQSEDGFTYKSSSITEIW----------EKVFEAVQVARRA 3423
            |.|...:|    |.|              |...|.:.:..:|          |..|.:.:....|
plant   641 ERPDIKRECCLCPVK--------------GGALKPTDVETLWVHVTCAWFQPEVCFASEEKMEPA 691

  Fly  3424 HGLTPLPEGPLADMGGIQMIGLKTNALKYLI--EQLPG-VEKCSKYTPKYH-------------- 3471
            .|:..:|               .||.:|..:  :|:.| ..:|.|.:..||              
plant   692 VGILSIP---------------STNFVKICVICKQIHGSCTQCCKCSTYYHAMCASRAGYRMELH 741

  Fly  3472 --KRNGN-----VSTAA--NGAHGGNL-------GGSSASAALSVSGGDSHGLLDYGSDQDEL-E 3519
              ::||.     ||..|  ...:..|:       |..||.:.:.........|:....:.||. .
plant   742 CLEKNGQQITKMVSYCAYHRAPNPDNVLIIQTPSGAFSAKSLVQNKKKGGSRLISLIREDDEAPA 806

  Fly  3520 ENAYDC-----ARCEPYSNRSEYDMFSWLASRHRKQPIQVFVQPSDNELVPRRGTGSNLPMAMKY 3579
            ||...|     |||..:..:        :.|:.|          .:.|.:|....|.....:...
plant   807 ENTITCDPFSAARCRVFKRK--------INSKKR----------IEEEAIPHHTRGPRHHASAAI 853

  Fly  3580 RT-----------------------LKETYKDYVGVFRSHIHGRGLYCTKDIEAGEMVIEYAGEL 3621
            :|                       |:.|..|.|...||.|||.||:..::|:.||||:||.||.
plant   854 QTLNTFRHVPEEPKSFSSFRERLHHLQRTEMDRVCFGRSGIHGWGLFARRNIQEGEMVLEYRGEQ 918

  Fly  3622 IRSTLTDKRERYYDSRGIGCYMFKIDDNLVVDATMRGNAARFINHCCEPNCYSKVVDILGHK-HI 3685
            :|.::.|.||..|...|..||:|||.:.:|||||.:||.||.|||.|.||||::::.:...: .|
plant   919 VRGSIADLREARYRRVGKDCYLFKISEEVVVDATDKGNIARLINHSCTPNCYARIMSVGDEESRI 983

  Fly  3686 IIFALRRIVQGEELTYDYKF-PFEDE--KIPCSCGSKRCRKYLN 3726
            ::.|...:..||||||||.| |.|.|  |:||.|.:..|||::|
plant   984 VLIAKANVAVGEELTYDYLFDPDEAEELKVPCLCKAPNCRKFMN 1027

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
trxNP_476769.1 NR_DBD_like 762..>856 CDD:413390
PRK13914 <1006..>1224 CDD:237555
PHD1_KMT2A_like 1268..1344 CDD:276981
PHD 1346..1390 CDD:214584
PHD3_KMT2A_like 1423..1479 CDD:276983
ePHD_KMT2A_like 1737..1841 CDD:277134
FYRN 1890..1937 CDD:461787
FYRC 3388..3476 CDD:197781 18/116 (16%)
SET_KMT2A_2B 3575..3726 CDD:380947 70/177 (40%)
SDG16NP_194520.3 PWWP_AtATX3-like 206..310 CDD:438971 20/130 (15%)
PHD_SF 401..451 CDD:473978 7/51 (14%)
PHD_ATX3_4_5_like 594..640 CDD:276970 5/56 (9%)
ePHD_ATX3_4_5_like 649..760 CDD:277133 24/139 (17%)
SET_SETD1-like 872..1023 CDD:380916 66/150 (44%)
Blue background indicates that the domain is not in the aligned region.

Return to query results.
Submit another query.