DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment sxc and Gtf3c3

DIOPT Version :10

Sequence 1:NP_523620.1 Gene:sxc / 35486 FlyBaseID:FBgn0261403 Length:1059 Species:Drosophila melanogaster
Sequence 2:NP_001028366.1 Gene:Gtf3c3 / 98488 MGIID:2138383 Length:882 Species:Mus musculus


Alignment Length:881 Identity:165/881 - (18%)
Similarity:304/881 - (34%) Gaps:260/881 - (29%)


- Green bases have known domain annotations that are detailed below.


  Fly    11 QSQGQSHQLPSAAHI-LLDQNPNSTGSNLVVKQNDIQSLSSV--GLLELAHREYQAVDYESAEKH 72
            :.:.::.:.|:|..: :|:...|.....::.::.....|...  ||:..|:..:...::|.|...
Mouse   103 EEEEETAEQPTAGDVFVLEMVLNRETKKMMKEKRPRSKLPRALRGLMGEANIRFARGEHEEAILM 167

  Fly    73 CMQLWRQDSTNTGVLLLLSSIHFQCRRLDKSAQFSTLAIKQNPVLAEAYSNLGNVFKERGQLQEA 137
            ||::.||..........|:.|:.....::||.||..:|...||...|.:..|..:..|:..:::|
Mouse   168 CMEIIRQAPLAYEPFSTLAMIYEDQGDMEKSLQFELIAAHLNPSDTEEWVRLAEMSLEQDNIKQA 232

  Fly   138 LDNYRRAVRLKPDFI--------------------DGY---INLAA------------------- 160
            :..|.:|::.:|..:                    |||   :||.:                   
Mouse   233 IFCYTKALKYEPTNVRYLWERSSLYEQMGDHKMAMDGYRRILNLLSPSDGERFMQLARDMAKSYY 297

  Fly   161 --------------------ALVAARDMESAVQAYITALQYNPDLYCVRSDLGNLLKALGRLEEA 205
                                .||:..|:..|.:.||:..||:..|..: :|...::.....|||.
Mouse   298 EANDSASAINIIEEAFSKHQGLVSMEDVNIAAELYISNKQYDKALEVI-TDFSGIILEKETLEEG 361

  Fly   206 KACYLKAIETCPGFAVAWSNLGCV-FNAQGEIWLAIHHFEKAVTLDP-----------NFLDAYI 258
            .:...||.||     |..|....| .:...::.:.:.|......|:|           :..|.|:
Mouse   362 TSEENKAAET-----VTCSIPDSVPIDITVKLMVCLVHLNILEPLNPLLTTLVEQNPEDMGDLYL 421

  Fly   259 NLGNVLKEARIFDRAVAAYLRALNLSPNNAVVHGNLACVYYEQGLIDLAIDTYRRAIELQPNFPD 323
            ::.....:...::.|: ..|.||..|...     |||.|:...                      
Mouse   422 DVAEAFLDVGEYNSAL-PLLSALVCSERY-----NLAVVWLRH---------------------- 458

  Fly   324 AYCNLANALKEKGQVKEAEDCYNTALRLCSNHADSLNNLANIKREQGYIEEATRLYLKALEVF-- 386
                 |..||..|.::.|.:.|:..:.|...|.|:..:|:.::::.|..|:|    |:|||..  
Mouse   459 -----AECLKALGYMERAAESYSKVVDLAPLHLDARISLSILQQQLGRPEKA----LEALEPMYD 514

  Fly   387 PDFAAAHSNLASVLQQQGKLKEALMHYKEAIRIQPTFADAYSNMGNTLKELQDVSGALQCYTRAI 451
            ||..|..:|.|   ||:.||   |:|....:..|   ...|..:...|..|   :..|:......
Mouse   515 PDTLAQDANAA---QQELKL---LLHRSTLLFSQ---GKMYGYLDTLLTML---AMLLKVAMNRA 567

  Fly   452 QI----NPAFADAHSNLASIHKD----------SGNIPEAIQSYRTALKLKPDFPDAYCNLAHCL 502
            |:    :....:.|..|..:.:|          |....:||.:..|::..|.|:.:......:.|
Mouse   568 QVCLISSSKSGERHLYLIKVSRDKISDNNEQETSNYDAKAIFAVLTSVLPKEDWWNLLLKAIYTL 632

  Fly   503 ---------QIVCD-----WTDYDIRMKK-----------------------LVSIVTEQLEKNR 530
                     :::.|     ::.||.|.|:                       :..:|.|.:.|.:
Mouse   633 SDLARFQEAELLVDSSLEYYSFYDDRQKRKELEYFGLSAAILDKNFRKAYDYIRVMVMENVNKPQ 697

  Fly   531 LPSVHPHHSMLYPLTHDCRKAIAARHANLCLEKVHVLHKKPYN-----------FLKKLPTKGRL 584
            |.::....:|         .:...||...||   .::.|.|.|           |:     .|..
Mouse   698 LWNIFNQVTM---------HSQDVRHHRFCL---RLMLKNPDNHALCVLNGHNAFV-----SGSF 745

  Fly   585 R--IGYLSSDFGNHPTSHLMQSVPGL---HDRSKVEIFCYALSPDDGTTFRHKISRESENFVDLS 644
            :  :|.....|..:|:..|.....||   |..|:.    |.|.       ||.::.:..:|:: .
Mouse   746 KHALGQYVQAFRAYPSEPLYNLCIGLTFIHMASQK----YVLK-------RHALTVQGFSFLN-R 798

  Fly   645 QIPCNGKAADKIFN--DGIHIL----VNMNGYTKGARNEIFALRPAPIQVMWLGYPGTSGASFMD 703
            .:...|...:..:|  .|:|.|    :.::.|.|.     .||.|..::          |.....
Mouse   799 YLSIRGPCQESFYNLGRGLHQLGLTHLAIHYYQKA-----LALPPLVVE----------GIEVDQ 848

  Fly   704 YIITDSVTSPLELAYQYS------EKLSYMPHTYFI 733
            ..:...:...:.|.||.|      :||.|   ||.:
Mouse   849 LDLRRDIAYNMSLIYQSSGNTAMAQKLLY---TYCV 881

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
sxcNP_523620.1 TPR repeat 53..78 CDD:276809 6/24 (25%)
TPR 80..318 CDD:440225 55/311 (18%)
TPR repeat 86..112 CDD:276809 7/25 (28%)
TPR repeat 118..146 CDD:276809 6/27 (22%)
TPR repeat 151..181 CDD:276809 11/91 (12%)
LapB 219..486 CDD:442196 58/294 (20%)
TPR repeat 220..248 CDD:276809 4/28 (14%)
TPR repeat 255..282 CDD:276809 5/26 (19%)
TPR repeat 287..317 CDD:276809 4/29 (14%)
TPR repeat 322..350 CDD:276809 6/27 (22%)
TPR repeat 355..385 CDD:276809 9/29 (31%)
TPR repeat 390..418 CDD:276809 9/27 (33%)
TPR repeat 423..453 CDD:276809 4/29 (14%)
TPR repeat 458..486 CDD:276809 7/37 (19%)
Glyco_transf_41 505..1036 CDD:404688 52/285 (18%)
Gtf3c3NP_001028366.1 LapB <142..347 CDD:442196 39/205 (19%)
TPR repeat 148..173 CDD:276809 6/24 (25%)
TPR repeat 179..207 CDD:276809 7/27 (26%)
TPR repeat 212..242 CDD:276809 6/29 (21%)
TPR repeat 247..274 CDD:276809 3/26 (12%)
TPR repeat 282..312 CDD:276809 0/29 (0%)
Spy 404..>510 CDD:443119 26/142 (18%)
TPR repeat 418..445 CDD:276809 6/27 (22%)
TPR repeat 450..481 CDD:276809 10/57 (18%)
TPR repeat 486..510 CDD:276809 7/27 (26%)
Spy 743..>882 CDD:443119 34/169 (20%)
TPR repeat 806..836 CDD:276809 6/34 (18%)
TPR repeat 841..876 CDD:276809 6/44 (14%)

Return to query results.
Submit another query.