DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment tos and GEN1

DIOPT Version :10

Sequence 1:NP_477145.1 Gene:tos / 35119 FlyBaseID:FBgn0015553 Length:732 Species:Drosophila melanogaster
Sequence 2:NP_872431.5 Gene:GEN1 / 348654 HGNCID:26881 Length:908 Species:Homo sapiens


Alignment Length:910 Identity:167/910 - (18%)
Similarity:296/910 - (32%) Gaps:269/910 - (29%)


- Green bases have known domain annotations that are detailed below.


  Fly     1 MGITGLIPFVGKASSQLHLKDIRGSTVAVDTYCWLHKGVFGCAEKLAR---GEDTDVYIQYCLKY 62
            ||:..|...:......:.|:::.|.|:|||...|:      |..:..:   |.....:::.....
Human     1 MGVNDLWQILEPVKQHIPLRNLGGKTIAVDLSLWV------CEAQTVKKMMGSVMKPHLRNLFFR 59

  Fly    63 VNMLLSYDIKPILVFDGQHLPAKALTEKRRRDSRKQSKERAAELLRLGRIEEARSHMRRCVDVTH 127
            ::.|...|:|.:.|.:|:....||....:|..||..|..::...      :..|||.:       
Human    60 ISYLTQMDVKLVFVMEGEPPKLKADVISKRNQSRYGSSGKSWSQ------KTGRSHFK------- 111

  Fly   128 DMALRLIRECRSRNVDCIVAPY-----EADAQMAWLNRADVAQYIITEDSDLTLFGAKNIIFKLD 187
                .::||| ...::|:..|:     ||:|..|:||........:|.|.|..|:||:.:.....
Human   112 ----SVLREC-LHMLECLGIPWVQAAGEAEAMCAYLNAGGHVDGCLTNDGDTFLYGAQTVYRNFT 171

  Fly   188 LNGSGLLVEAEKLHLAMGCTEEKYHFDK--FRRMCILSGCDYL-DSLPGIGLAKACKF--ILKTE 247
            :|.....|:.    ..|...:.|...|:  ...:.||.||||| ..:||:|..:|.|.  |||.:
Human   172 MNTKDPHVDC----YTMSSIKSKLGLDRDALVGLAILLGCDYLPKGVPGVGKEQALKLIQILKGQ 232

  Fly   248 QEDMRI------ALKKIPSYLNMRNLEVDDDYIENFMKAEATF-RHMFIYNPLERRMQRLCALED 305
            ....|.      :....|..|..:.|            |..:. .|.......||...|||..:.
Human   233 SLLQRFNRWNETSCNSSPQLLVTKKL------------AHCSVCSHPGSPKDHERNGCRLCKSDK 285

  Fly   306 Y--ETDERYCSNAGTLLEDSEQALHLALGNLN---------PF---------------------- 337
            |  ..|..||........:.::.|.....|:.         ||                      
Human   286 YCEPHDYEYCCPCEWHRTEHDRQLSEVENNIKKKACCCEGFPFHEVIQEFLLNKDKLVKVIRYQR 350

  Fly   338 -SMKRLDSWTPEK-AWPT----PKNVKRSKHKSIWQTNFQSENTHTPKKENPCALFFKKV----- 391
             .:.....:|.|| .||.    .|.:....|..:.:....|.|::   :..|..:...::     
Human   351 PDLLLFQRFTLEKMEWPNHYACEKLLVLLTHYDMIERKLGSRNSN---QLQPIRIVKTRIRNGVH 412

  Fly   392 --------------------DFVGKTLNEE------------IEANQRLEQAKQTEAELFNMYSF 424
                                :|...|:.||            :...|:||...:.:..:      
Human   413 CFEIEWEKPEHYAMEDKQHGEFALLTIEEESLFEAAYPEIVAVYQKQKLEIKGKKQKRI------ 471

  Fly   425 KAKRRRSPSREDSVDQERTPPPSPVHKSRHNPFAKERTGEEANQRSPVVCENASLLRLLSPKKAS 489
            |.|....|..::.:..:......|..:..|...:|..:|...:...|....:|||..||.||...
Human   472 KPKENNLPEPDEVMSFQSHMTLKPTCEIFHKQNSKLNSGISPDPTLPQESISASLNSLLLPKNTP 536

  Fly   490 PLDGE---------AGVKKVDSLKRSIFAKE---------------------------------- 511
            .|:.:         ..::::.::.:|:.::.                                  
Human   537 CLNAQEQFMSSLRPLAIQQIKAVSKSLISESSQPNTSSHNISVIADLHLSTIDWEGTSFSNSPAI 601

  Fly   512 ---------QVQIRSRFFATQD------EQTRLQRE----HLRDTENDDMDEQKLSSH--SGHKK 555
                     :.::.|...|..|      ||...:.|    :::...::|.|......|  ||...
Human   602 QRNTFSHDLKSEVESELSAIPDGFENIPEQLSCESERYTANIKKVLDEDSDGISPEEHLLSGITD 666

  Fly   556 LRLVCKDIPGKNPIRQRCSSQISDGETDTDTTASSLLESQDK------------------GVPSP 602
            |.|  :|:|.|..|..:.|....:.:.|.:....|:|..::.                  |:|..
Human   667 LCL--QDLPLKERIFTKLSYPQDNLQPDVNLKTLSILSVKESCIANSGSDCTSHLSKDLPGIPLQ 729

  Fly   603 LES------------QED--LNNSQPQIPTEGNTNSTTIRIKSLDLLLENSPEPTQESDRNNNDA 653
            .||            |||  :|.|.|.  :..||...|..::..:..|::|.:...::.|.    
Human   730 NESRDSKILKGDQLLQEDYKVNTSVPY--SVSNTVVKTCNVRPPNTALDHSRKVDMQTTRK---- 788

  Fly   654 IILLSDDSC----SSDQRASST-------------SSSSQQRQNFLPTSKRRVGLSKPSTAKKGT 701
             ||:....|    |||::::..             ||......:|..:...:  ||.|....|.|
Human   789 -ILMKKSVCLDRHSSDEQSAPVFGKAKYTTQRMKHSSQKHNSSHFKESGHNK--LSSPKIHIKET 850

  Fly   702  701
            Human   851  850

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
tosNP_477145.1 PIN_EXO1 1..202 CDD:350207 45/208 (22%)
H3TH_EXO1 214..291 CDD:188628 22/88 (25%)
GEN1NP_872431.5 PIN_GEN1 2..206 CDD:350217 49/231 (21%)
XPG-N domain. /evidence=ECO:0000269|PubMed:26682650 2..96 21/99 (21%)
XPG-I domain. /evidence=ECO:0000269|PubMed:26682650 122..208 22/89 (25%)
H3TH_GEN1 195..338 CDD:188625 34/154 (22%)
5'-3' exonuclease domain. /evidence=ECO:0000269|PubMed:26682650 208..384 37/187 (20%)
Chromodomain. /evidence=ECO:0000269|PubMed:26682650 390..464 10/76 (13%)
Chromo_2 398..458 CDD:465841 5/59 (8%)

Return to query results.
Submit another query.