DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment trx and Tcf20

DIOPT Version :10

Sequence 1:NP_476769.1 Gene:trx / 41737 FlyBaseID:FBgn0003862 Length:3726 Species:Drosophila melanogaster
Sequence 2:NP_001426756.1 Gene:Tcf20 / 366964 RGDID:1594486 Length:1992 Species:Rattus norvegicus


Alignment Length:2025 Identity:371/2025 - (18%)
Similarity:615/2025 - (30%) Gaps:733/2025 - (36%)


- Green bases have known domain annotations that are detailed below.


  Fly     7 PGKPSKSINRKRISVL------QLEDDAANPAEPQQPAPESQQPSGSGSGSSAAREKGNNCDNDE 65
            ||..|.|....:::.|      .|..||..|.:     ..|::||.|.       :|.::|.|.|
  Rat   483 PGLSSLSALSTQVANLPNTVQHMLLSDALTPQK-----KTSKRPSSSS-------KKADSCTNSE 535

  Fly    66 DDN-------APGGASISGNTASSSAGSGN-----SGNGSSSGSSTGSGSS---GSGSTNGG--- 112
            ..:       :|...|:.|..:|||...|.     ||..:||.::...|:|   ||..|.|.   
  Rat   536 GSSQPEEQLKSPMAESLDGGCSSSSEDQGERVRQLSGQSTSSDTTYKCGASEKAGSSPTQGAQNE 600

  Fly   113 ----SVNGGTHHKSAA------NLDKEAVTK-------------------DQNGDGDKTRGNVSS 148
                |.:..|..::|:      :|..|..||                   :::|..||:......
  Rat   601 APRLSTSPATREEAASPGAKDTSLSSEGNTKVNEKTVGVIVSREAMTGRVEKSGGQDKSSQEDDP 665

  Fly   149 APSGKLSAAASGKALSKSSRTFSASTSVTSSGRSSGSSPDGNSGASSDGASSGISCGKSTAKSTE 213
            |.|.:..:.:..|.:|.:|...|......|.|..:|.:...|.....:|.||..:.|.|....||
  Rat   666 AASQRPPSNSGVKEMSHTSLPQSDPLGGGSKGNKNGDNNSSNHNGEGNGQSSHSAVGPSFTGRTE 730

  Fly   214 ASSGKLAKTTGAGTCSSAKSSKASSGTTSEATTSGLSGACLKALFVATPATSTGLACALVSPGGS 278
            .|     |:.|    |...|.|.|.|:......||..                      ..|.|.
  Rat   731 PS-----KSPG----SLRYSYKESFGSAVPRNVSGFP----------------------QYPSGQ 764

  Fly   279 SQG--GTFPISAALLRARKNSNKKFKNLNLARGEVMLPSTSKLKQLNSPVVDNPSPSPPIASGST 341
            .:|  |:.       ..||..|:||.:|              |:::......:|....|.::...
  Rat   765 EKGDFGSH-------GERKGRNEKFPSL--------------LQEVLQGYHHHPDRRYPRSAQEH 808

  Fly   342 PSVEGGIGVGGVVSPGEDAALKRVLTEMPNEVARDPSPSSCTAAANGAASGK-GSASNGPPAMAS 405
            ..:..|:         |..|...:|....||:           .:.|..:.. ||....|     
  Rat   809 QGMASGL---------EGTARPNILVSQTNEL-----------TSRGLLNKSIGSLLENP----- 848

  Fly   406 SGDGSSPKSGADTGP--STSSTTAKQKKTVTFRNVLETSDDKSVVKRFYNPDIRIPIVSIMKKDS 468
                       ..||  ..||:||.:.|.:..       .|..:.::|          .|....|
  Rat   849 -----------HWGPWERKSSSTAPEMKQINL-------SDYPIPRKF----------EIEPSSS 885

  Fly   469 LNRP---LNYSRGGECIVRPSILSKILNKNSNIDKLNSLKFRSAGASSSSSNQESGSSSNVFGLS 530
            .:.|   |:..|...|.:.|  |.:|: ::.....|..:     ||.:.....|..:.|    ||
  Rat   886 SHEPGASLSERRSVICDISP--LRQIV-RDPGAHSLGHM-----GADARIGRNERLNPS----LS 938

  Fly   531 RAFGAPMDEDDEGGVTFRRNDSPEDQNNAEDDEMDDDDDDEEAEEDDENEDDNDE----AVSEKS 591
            ::...|      ||:.     |.|.:..::..::.::|.::...:...|:...|.    ::..::
  Rat   939 QSVILP------GGLV-----SMETKLKSQSGQIKEEDFEQSKSQASFNKKSGDHCHPTSIKHET 992

  Fly   592 AETEKSAGADERDPDEKQLVMDSHFVLPKRSTRSSRIIKPNKRLLEEGAISTKKPLSLGDSKGKN 656
            .....|.||...|........||      |||...|:         .|.:.:::.:.        
  Rat   993 YRGNASPGAAAHDSISDYGPQDS------RSTPMRRV---------PGRVGSRETMR-------- 1034

  Fly   657 VFGTSSSSAGSTASTFSASTNLKLGKETFFNFGTLKPNSSAAGNFVLRQPRLQFQADNQQATFTA 721
              |.|||.....|.....|.....|                                        
  Rat  1035 --GRSSSQYHDFAEKLKMSPGRSRG---------------------------------------- 1057

  Fly   722 PKACPTSPSAIPKPANSLATSSFGSLASTNSSTVTPTPSACSICSAVVSSKEVTQARKYGVVACD 786
                   |...|...|...|  |...|:.:|.....:|::.|:.||..::   |:|..||.....
  Rat  1058 -------PGGDPHHMNPHMT--FSERANRSSLHAPFSPNSESLASAYHTN---TRAHAYGDPNTG 1110

  Fly   787 V------CRKFFSKMTKKSISANSSTANTSSGSQQYLQCKGNEGSPCS---IHSAKSQLKNFKK- 841
            :      .|:.:.:..::.....||:|.....:.|:.| :|...||..   :...:|.|||.|. 
  Rat  1111 LNSQLHYKRQMYQQQQEEYKDWASSSAQGVIAAAQHRQ-EGPRKSPRQQQFLDRVRSPLKNDKDG 1174

  Fly   842 --------FYKDRCTACWLKKCMISFQLPAAHRSRLSAILPPGMRGEAAAREEKSAELLSPTGSL 898
                    .|.|..|. ...:|::|               ..||..::...:..|.:|......|
  Rat  1175 MMYGPPVGTYHDPSTQ-EAGRCLMS---------------SDGMPAKSMELKHSSQKLQESCWDL 1223

  Fly   899 -RFTSTASSSSPSVVASTSVKWKSSG-----------DSTSALTSIKPN------PLAENNVTFG 945
             |.||.|.||.|..:::.    |..|           :||.   |.||:      |..|::.:  
  Rat  1224 SRQTSPAKSSGPPGMSNQ----KRYGPPHEPDGHGLAESTQ---SSKPSNVMLRLPGQEDHSS-- 1279

  Fly   946 STPLLRPAILENPLFLKISNAADQKLAAAEAISPSLTKKNSKQEKEKVKESEQSEK--LLSPTQA 1008
                      :|||.:        :......|||.    .||::.:.||.|...:|  ||.|::.
  Rat  1280 ----------QNPLIM--------RRRVRSFISPI----PSKRQSQDVKNSNTDDKGRLLHPSKE 1322

  Fly  1009 GTKKSGAAEAQVEEVQPQKEEAPQTSTTTQPSASNGASHGVPQAELAGETNA------TGDTLKR 1067
            |..|...:.:.:...|..|....:.::...|:..|   ...|...|......      .|..||.
  Rat  1323 GADKVYNSYSHLSHSQDIKSIPKRDASKDLPNPDN---RNCPAVTLTSPAKTKILPPRKGRGLKL 1384

  Fly  1068 QRI--DLKGPRVKHVCRSASIVLGQPLATFGE-----DQQPE------DAADMQQ---EIAAPVP 1116
            :.|  .:..|.::....:.|...|....|..:     ...||      ..|:|::   |:.:.:.
  Rat  1385 EAIVQKITSPNIRRSASANSAETGGDTVTLDDILSLKSGPPEGGTVANQEAEMEKRKGEVVSDLV 1449

  Fly  1117 SAI-MEPSPEKPTHIVTDE-----NDNC---ASCKTSPVGDESKPSKSSGSAQAEVKKATALGKE 1172
            ||. .|.:.|||....::|     :|..   |..:|:..|.|...:.:|.::|...      |.:
  Rat  1450 SATNQESNVEKPLPGPSEEWRGSGDDKVKTEAHVETASTGKEPSGTMTSTASQKPG------GNQ 1508

  Fly  1173 GTASAAGGSSAKV---TTRNAAVASNLIVAASKK--QRNGDIATSSSVTQS--------SNQTQG 1224
            |....:.|.:|.:   .::|.|....|...|:.|  ::..|....|...:|        |.:.:|
  Rat  1509 GRPDGSLGGAAPLIFPDSKNVAPVGILTPEANPKTEEKENDTVMISPKQESFPPKGYFPSGKKKG 1573

  Fly  1225 R---------------------------------KTKEHRQQRTLISIDFWENYDPAEVCQTGFG 1256
            |                                 |.|:.||:|        |...|        |
  Rat  1574 RPIGSVNKQKKQQQQPPPPPQPPQMPEGSADGEPKPKKQRQRR--------ERRKP--------G 1622

  Fly  1257 LIVTETVAQRALCFLCGSTGLDPLIFCACCCEPYHQYCVQDEYNLKHGSFEDTTLMGSLLETTVN 1321
            ....:...::|:          |::      ||.     :.|..||:.:            ..::
  Rat  1623 AQPRKRKTKQAV----------PIV------EPQ-----EPEIKLKYAT------------QPLD 1654

  Fly  1322 ASTGPSSSLNQLTQRLNWLCPRCTVCYTCNMSSGSKVKCQKCQKNYHSTC---LGTSKRLLGADR 1383
            .:...:.|.......:| .|....||...|.....:.|..:.:|...|..   ..|..::|.|..
  Rat  1655 KTDAKNKSFFPYIHVVN-KCELGAVCTIINAEEEEQTKLVRSRKGQRSLTPPPSSTESKVLPASS 1718

  Fly  1384 -----PLICVNCLKCKSCSTTKVSKFVGNLPMCTGCFKLRKKGNFCPICQRCYDDNDFDLKMMEC 1443
                 |::             ..|..:|:|..|.                               
  Rat  1719 FMLQGPVV-------------TESSVMGHLVCCL------------------------------- 1739

  Fly  1444 GDCGQWVHSKCEGLSDEQYNLLSTLPESIEFICKKCARRNESSKIKAEEWRQAVMEEFKASLYSV 1508
              ||:|                             .:.||              |.:.....|  
  Rat  1740 --CGKW-----------------------------ASYRN--------------MGDLFGPFY-- 1757

  Fly  1509 LKLLSKSRQACALLKLSPRKKLRCTCGASSNQGKLQPKALQFSSGSDNGLGSDGESQNSDDVYEF 1573
                 ....|..|.|..|.|:      ::..|.|::.:....|:||            ..|..|.
  Rat  1758 -----PQDYAATLPKNPPPKR------STEMQNKVKVRHKSASNGS------------KTDTEEE 1799

  Fly  1574 KDQQQQQQQRNANMNKPRVKSLPCSCQQHISHSQSFSLVDIKQKIAGNSYVSLAEFNYDMSQVIQ 1638
            ::|||||:::.:....||.|      ::|.|                                  
  Rat  1800 EEQQQQQKEQRSLAAHPRFK------RRHRS---------------------------------- 1824

  Fly  1639 QSNCDELDIAYKELLSEQFPWFQNETKACTDALEEDMFESCSGGNYEDLQDTGGVSASVYNEHST 1703
                                                  |.|.||                     
  Rat  1825 --------------------------------------EDCGGG--------------------- 1830

  Fly  1704 SQAESRSGVLDIPLEEVDDFGSCGIKMRLDTRMCL-FCRKSGEGLSGEEARLLYCGHDCWVHTNC 1767
                .||....:|.::....|| ..|...||:..: ...:.|..|..:...|....::.|||..|
  Rat  1831 ----PRSLSRGLPCKKAAAEGS-SEKTASDTKPSVPTTSEGGPELELQIPELPLDSNEFWVHEGC 1890

  Fly  1768 AMWSAEVFEEIDGSLQNVHSAVARGRMIKCTVCGNRGATVGCNVRSCGEHYHYPCARSIDCAFLT 1832
            .:|:..:: .:.|.|..:..|:...|.:||:.|...|||:||..:.|...||||||...||....
  Rat  1891 ILWANGIY-LVCGRLYGLQEALEIAREMKCSHCQEAGATLGCYNKGCSFRYHYPCAIDADCLLHE 1954

  Fly  1833 DK-SMYCPAH 1841
            :. |:.||.|
  Rat  1955 ENFSVRCPKH 1964

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
trxNP_476769.1 NR_DBD_like 762..>856 CDD:413390 24/111 (22%)
PRK13914 <1006..>1224 CDD:237555 48/261 (18%)
PHD1_KMT2A_like 1268..1344 CDD:276981 9/75 (12%)
PHD 1346..1390 CDD:214584 10/51 (20%)
PHD3_KMT2A_like 1423..1479 CDD:276983 3/55 (5%)
ePHD_KMT2A_like 1737..1841 CDD:277134 32/105 (30%)
FYRN 1890..1937 CDD:461787
FYRC 3388..3476 CDD:197781
SET_KMT2A_2B 3575..3726 CDD:380947
Tcf20NP_001426756.1 None
Blue background indicates that the domain is not in the aligned region.

Return to query results.
Submit another query.