DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment Xpc and Xpc

DIOPT Version :10

Sequence 1:NP_725451.1 Gene:Xpc / 36697 FlyBaseID:FBgn0004698 Length:1294 Species:Drosophila melanogaster
Sequence 2:NP_033557.2 Gene:Xpc / 22591 MGIID:103557 Length:930 Species:Mus musculus


Alignment Length:1120 Identity:305/1120 - (27%)
Similarity:456/1120 - (40%) Gaps:295/1120 - (26%)


- Green bases have known domain annotations that are detailed below.


  Fly   189 SEDNNESSFEDKAGNAF-DFRGLLENANSLERTRDALSKRNVT-ATPPRSQAATMDVNALLALGE 251
            :|||..:..|:...:.| |.:......:|..:......||..: ...|.:.||...|....|..:
Mouse    18 TEDNKVARHEESVADDFEDEKQKPRRKSSFPKVSQGKRKRGCSDPGDPTNGAAKKKVAKATAKSK 82

  Fly   252 NQNYQSVEVEEREGNQRKKAGRGAPAAPPTLDEPSRLSKTKSTRIKRHTKTRPVSTVVANAGDTD 316
            |     ::|.:.|........|.:||              ...:.|:|    |.|.||....|.|
Mouse    83 N-----LKVLKEEALSDGDDFRDSPA--------------DCKKAKKH----PKSKVVDQGTDED 124

  Fly   317 DS--DFEE-------VADADLSSDQDDGETPNISGDLEIRVGLEGLRPTKEQKTQHELEMALKRR 372
            ||  |:||       |.|...:|.....:.|..:.::||....:.....:.:|.:.|.|..|:|.
Mouse   125 DSEDDWEEVEELTEPVLDMGENSATSPSDMPVKAVEIEIETPQQAKERERSEKIKMEFETYLRRM 189

  Fly   373 LNRDIKDRQILLHKVSLMCQIARSLKYNRLLSESDSLMQATLKLLPSRNAYPTERGTELKYLQSF 437
            :.|..|:.|..:|||.|:|.:|.....|.:..:.| |:...|.::|.|......:..:..||.:.
Mouse   190 MKRFNKEVQENMHKVHLLCLLASGFYRNSICRQPD-LLAIGLSIIPIRFTKVPLQDRDAYYLSNL 253

  Fly   438 VTWFKTSIKLLSPNLYSAQSPATKEAILEALLE-QVKRKEARCKQDMIFIFIALARGMGMHCRLI 501
            |.||      :.....:|...|:::..|:..|| ::....||..::::.||:.:.|.:.:..||:
Mouse   254 VKWF------IGTFTVNADLSASEQDDLQTTLERRIAIYSARDNEELVHIFLLILRALQLLTRLV 312

  Fly   502 VNLQPMPLRPAASDLIPIKLRPDDKNKSQTVESERESEDEKPKKDKKAGKPAEKESSKSTISKEA 566
            ::|||:||:.|.:                                  .|:.:.||:|.......:
Mouse   313 LSLQPIPLKSAVT----------------------------------KGRKSSKETSVEGPGGSS 343

  Fly   567 EKKNNAKKAEAKPLSKSTTKGSETTKSGTVPKVKKELSLSSKLVEKSKHQKAYTSSKSDTSFDEK 631
            |..:|:.::..||.:....|..||...|           ..|...:.|.......|:.    ..|
Mouse   344 ELSSNSPESHNKPTTSRRIKEEETLSEG-----------RGKATARGKRGTGTAGSRQ----RRK 393

  Fly   632 PSTSSSSKCLKEEYSELGLSKKLLKPTLSSKLVLKSKNQSSFSSNKSDTSFEENPSTSSSSKSLK 696
            ||.|...:                                                         
Mouse   394 PSCSEGEE--------------------------------------------------------- 401

  Fly   697 EETAKLSSSKLEDKKVASPAETKTKVQSSLLKRVTTQNISESGDSKKPKVAPVDTFSPVAGRTRR 761
                               ||.|.:.:....||.....:|...:|:.........|.|.:|    
Mouse   402 -------------------AEQKVQGRPHARKRRVAAKVSYKEESESDGAGSGSDFEPSSG---- 443

  Fly   762 ATVKPKTEEKPQVVGSPVIPKLMLSKVKQLNAKHSDTENASPANKHLQEQRNTRETRSRSKSPKV 826
                                          ..:||..|:..|..:          .:.|:.:|  
Mouse   444 ------------------------------EGQHSSDEDCEPGPR----------KQKRASAP-- 466

  Fly   827 LISPSFLKKKSDGADSTSDPQKHQMAPETKARISPNFLSEALPARQLRSRGQKASSLAIPQLDGG 891
                   ::...|:.|.|..|:       .::..|:...||..:.....||:|.||       |.
Mouse   467 -------QRTKAGSKSASKTQR-------GSQCEPSSFPEASSSSSGCKRGKKVSS-------GA 510

  Fly   892 DDVPLPKKRPKLEKLKNSQDSDEVFEPAKPVKKAPVLPKSVQNLRKDRRVMSTDDEGGSRLNRKT 956
            :::                                                         .:||.
Mouse   511 EEM---------------------------------------------------------ADRKP 518

  Fly   957 DASDMWVEVWSDVEEQWICIDLFKGKLHCVDTIRKNATPGLAYVFAFQDDQSLKDVTARYCASWS 1021
            ...|.|:||:.:.:.:|:|:|...|.:.......|.||..:.||.....|..::|||.||..:|.
Mouse   519 AGVDQWLEVYCEPQAKWVCVDCVHGVVGQPVACYKYATKPMTYVVGIDSDGWVRDVTQRYDPAWM 583

  Fly  1022 TTVRKARVEKAWLDETIAPYLGRRTKRDITEDDQLRRIHSDKPLPKSISEFKDHPLYVLERHLLK 1086
            |..||.||:..|..||:.||....|:|:..||.:.:..|.|:|||.|||.:|:||||.|:|||||
Mouse   584 TATRKCRVDAEWWAETLRPYRSLLTEREKKEDQEFQAKHLDQPLPTSISTYKNHPLYALKRHLLK 648

  Fly  1087 FQGLYPPDAPTLGFIRGEAVYSRDCVHLLHSREIWLKSARVVKLGEQPYKVVKA---RPKWDRLT 1148
            ||.:||..|..||:.||||||||||||.||||:.|||.||||:|||.|||:||.   |.:..||:
Mouse   649 FQAIYPETAAVLGYCRGEAVYSRDCVHTLHSRDTWLKQARVVRLGEVPYKMVKGFSNRARKARLS 713

  Fly  1149 RTVIKD-QPLEIFGYWQTQEYEPPTAENGIVPRNAYGNVELFKDCMLPKKTVHLRLPGLMRICKK 1212
            ...:.| ..|.::|:|||:||:||.|.:|.||||.:|||.||...|:|...|.:.||.|.|:.:|
Mouse   714 EPQLHDHNDLGLYGHWQTEEYQPPIAVDGKVPRNEFGNVYLFLPSMMPVGCVQMTLPNLNRVARK 778

  Fly  1213 LNIDCANAVVGFDFHQGACHPMYDGFIVCEEFREVVTAAWEEDQQVQVLKEQEKYETRVYGNWKK 1277
            |.|||..|:.|||||.|.|||:.||:|||||||:|:.||||.:|.:...||:||.|.|..||||.
Mouse   779 LGIDCVQAITGFDFHGGYCHPVTDGYIVCEEFRDVLLAAWENEQAIIEKKEKEKKEKRALGNWKL 843

  Fly  1278 LIKGLLIRERLKKKY 1292
            |::|||||||||.:|
Mouse   844 LVRGLLIRERLKLRY 858

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
XpcNP_725451.1 rad4 345..1294 CDD:273170 268/953 (28%)
PTZ00108 <696..924 CDD:240271 29/227 (13%)
XpcNP_033557.2 Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 1..134 22/117 (19%)
rad4 143..859 CDD:273170 130/725 (18%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 323..517 27/193 (14%)
Nuclear localization signal. /evidence=ECO:0000255 388..393 0/4 (0%)
Interaction with RAD23B. /evidence=ECO:0000250 489..727 29/237 (12%)
Minimal sensor domain involved in damage recognition. /evidence=ECO:0000250 600..759 19/158 (12%)
DNA-binding, preference for heteroduplex DNA. /evidence=ECO:0000250 600..734 19/133 (14%)
DNA-binding, preference for single stranded DNA, required for formation of stable nucleoprotein complex. /evidence=ECO:0000250 760..824 12/63 (19%)
Interaction with ERCC2 and GTF2H1. /evidence=ECO:0000250 809..930 53/120 (44%)
Interaction with CETN2. /evidence=ECO:0000250 840..859 10/18 (56%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 867..930 32/62 (52%)
Blue background indicates that the domain is not in the aligned region.

Return to query results.
Submit another query.