DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment: CG5098 and Tcf20

Sequence 1:NP_001261066.1 Gene:CG5098 FlyBaseID:FBgn0034300 Length:1339 Species:Drosophila melanogaster
Sequence 2:NP_001107612.1 Gene:Tcf20 MGIID:108399 Length:1987 Species:Mus musculus

Alignment Length:2018 Identity:378/2019 (19%)
Similarity:599/2019 (30%) Gaps:812/2019 (40%)


  Fly     1 MANNNHPHPGHLSQAAQNASW-----NPLQIPSYIPRQPQLAHMSNERPGMVRSPLAWHVSGTVS 60
            ||:....|.|:.....:...:     |...:.:..|:.||      .||           ||.|.
Mouse    68 MASETSGHQGYQGFRKEAGDFYYMAGNKDTVAAGTPQPPQ------RRP-----------SGPVQ 115

  Fly    61 AQPPPPPDAIYN-FMSANHKN-----------LDHHHQMNPLLAPPYSGTLPFNSMDLSLQSARS 113
            :..||...:..| :.|..|.:           :.|:.|       .|:|  ||:......|...|
Mouse   116 SYGPPQGSSFGNQYASEGHVSQFQAQHSALGGVSHYQQ-------DYTG--PFSPGSAQYQQQAS 171

  Fly   114 AAQPLAKQPPNQQPHQTQQQQQSLIHAPNYPSIQNL-TTNATPT--STQLQQQQQQEHLAAMAAA 175
            :.|...:|...||..|.||||...:....|.|.|.| .|...|.  |:.||..|:...|.:.|..
Mouse   172 SQQQQQQQQQQQQQQQQQQQQVQQLRQQLYQSHQPLPQTTGQPASGSSHLQPMQRPSTLPSSAGY 236

  Fly   176 HVSLLQSSRQNQGAPSGNLSNGGDCESLLPPPPPTSVSGNTNHTGSNSSSNSGSNNHIASPHYMQ 240
            .:.:.|..:..|.:.|.:.|      |..|.|...|.||. ::.|| .|.|:||.   ...|.:.
Mouse   237 QLRVGQFGQHYQSSASSSSS------SSFPSPQRFSQSGQ-SYDGS-YSVNAGSQ---YEGHNVG 290

  Fly   241 SRDENFKLTQLKRSFEPDLSGKNPQKEKDFGYPSASSASKLPTHNVQ-QQHANKKPSPLRNYHQQ 304
            |..:.:. ||...|::       ||..|:|      ..:|:|..|.| ||...::|.|.:...||
Mouse   291 SNAQAYG-TQSNYSYQ-------PQSMKNF------EQAKIPPGNQQGQQQQQQQPQPQQQQPQQ 341

  Fly   305 QQ--------PPYNL---------------TPKYNGPQTP----------------PTPQSPLAA 330
            ||        ||.::               ..:||.|:.|                |:|.:.:..
Mouse   342 QQQQQQQQQHPPQHVMQYTNAATKMPLQSQVGQYNQPEVPVRSPMQFHQNFSPISNPSPAASVVQ 406

  Fly   331 NP--HQMLSPTMDYNQLHLHHQLNSSSG----GSYQHMQQDQTQSQSHPQHLHYHNQHA------ 383
            :|  ....||.|...:     .|....|    .|...:.|...|....|..:...|.||      
Mouse   407 SPSCSSTPSPLMQSGE-----NLQCGQGNVPMSSRNRILQLMPQLSPTPSMMPSPNSHAAGFKGF 466

  Fly   384 ---------------------TSQTAPPP------LLPPLLTSGQFHAQPQDASQQQTASSSQHQ 421
                                 :||.|..|      ||...||       ||..:.::.:|||:..
Mouse   467 GLEGVPEKRLTDPGLSSLSALSSQVANLPNTVQHMLLSDALT-------PQKKTSKRPSSSSKKA 524

  Fly   422 ---THHSRTAQ---------LTNLDQAVKHKPESE--------EQPVITDLSYRNSETDKTAANP 466
               |:...::|         ..:||.......|.:        .|...:|.:|:...::|..::|
Mouse   525 DSCTNSEGSSQPEEQLKSPMAESLDGGCSSSSEDQGERVRQLSGQSTSSDTTYKCGASEKAGSSP 589

  Fly   467 VPEAP-ESPYLTTS------------NEESLESNSNSSNSRK----------------------- 495
            ...|. |:|.|:||            .:.||.|..|:..:.|                       
Mouse   590 TQGAQNEAPRLSTSPATRDEAASPGAKDTSLSSEGNTKVNEKTVGVIVSREAMTGRVEKSGGQDK 654

  Fly   496 ----------RRKRKASMVMRVT----PNENAP--------EGENSKPQH--------------P 524
                      :|....|.|..::    |..:.|        .|:|:...|              |
Mouse   655 GSQEDDPAASQRPPSNSGVKEISHTSLPQPDPPGGGSKGNKNGDNNSSNHNGEGNGPSSHSAVGP 719

  Fly   525 QQAANLNNSCSP-------KKS-----PKN--------GGGEFQPFSTQKQSQTENEK------- 562
            ........|.||       |:|     |:|        .|.|...|.:..:.:..|||       
Mouse   720 SFTGRTEPSKSPGSLRYSYKESFGSAVPRNVSGYPQYPSGQEKGDFGSHGERKGRNEKFPSLLQE 784

  Fly   563 ---------------TTQE-----NGRGGSPAPAENNSNSN---SSTLYNDN-----ENP----- 594
                           :.||     :|..|:..|....|.:|   |..|.|.:     |||     
Mouse   785 VLQGYHHHPDRRYPRSAQEHQGMASGLEGTARPNILVSQTNELASRGLLNKSIGSLLENPHWGPW 849

  Fly   595 --KTKKQRQALLQRNLTEQHRMQQ-DDEPPKNHTSPA------------MPP------------- 631
              |:......:.|.||::....:: :.|||.:...|.            :.|             
Mouse   850 ERKSSSTAPEMKQINLSDYPIPRKFEIEPPSSAHEPGGSLSERRSVICDISPLRQIVRDPGAHSL 914

  Fly   632 ----------------PS------------------------------PQSNSSSSSSSSSSANT 650
                            ||                              .||.|.:|.:..|..:.
Mouse   915 GHMGTDARIGRNERLNPSLSQSVILPGGLVSMETKLKSQSGQIKEEDFEQSKSQASFNKKSGDHC 979

  Fly   651 HSSQSSHAV---NNIP----KPEINNKATTDTPASPALVEQGDIDAKPAVSVHECDEEEEPAVN- 707
            |.:...|..   |..|    ...|::....|:.::|.....|.:.::..:......:..:.|.. 
Mouse   980 HPTSIKHETYRGNASPGAAAHDSISDYGPQDSRSTPMRRVPGRVGSRETMRGRSSSQYHDFAEKL 1044

  Fly   708 KVSPA----------HPDPPTT-------AAVAAP--PATESPKKSSPAANSESCPFGEVEDKL- 752
            |:||.          |.:|..|       :::.||  |.:|| ..|:...|:.:..:|:....| 
Mouse  1045 KMSPGRSRGPGGDPHHMNPHMTFSERANRSSLHAPFSPNSES-LASAYHTNTRAHAYGDPNTGLN 1108

  Fly   753 ------EQMFAGIEEE--------------------------------TERISSPEK-------- 771
                  .||:...:||                                .:|:.||.|        
Mouse  1109 SQLHYKRQMYQQQQEEYKDWASSSAQGVIAAAQHRQEGPRKSPRQQQFLDRVRSPLKNDKDGMMY 1173

  Fly   772 ---------PAEESAA---MVAHNLTAQ-------------------LALDPSKTLDTPA----- 800
                     |:.:.|.   |.:..|.|:                   ....|:|:...|.     
Mouse  1174 GPPVGTYHDPSTQEAGRCLMSSDGLPAKSMELKHSSQKLQESCWDLSRQTSPAKSSGPPGMSNQK 1238

  Fly   801 ------------------ENQTSVLAVLAPNQTPTPEIRPVATKAAMKSTMPSPVHSPIPQSRST 847
                              .::.|.:.:..|.|.......|:..:..::|.:     ||||..|.:
Mouse  1239 RYGPPHEPDGHGLAESAQSSKPSNVMLRLPGQEDHSSQNPLIMRRRVRSFI-----SPIPSKRQS 1298

  Fly   848 STPLVAGDDSKSNTPVPAKAPA-------------------PRR--------------------- 872
            .....:..|.|.....|:|..|                   |:|                     
Mouse  1299 QDVKNSNADDKGRLLHPSKEGADKAYNSYSHLSHSQDIKSIPKRDSSKDLPNPDNRNCPAVTLTS 1363

  Fly   873 -------PPPRRLSMGMDASLLRF---------------------MIDD-------PP------- 895
                   ||.:...:.::|.:.:.                     .:||       ||       
Mouse  1364 PAKTKILPPRKGRGLKLEAIVQKITSPNIRRSASANSAEAGGDTVTLDDILSLKSGPPEGGTVAT 1428

  Fly   896 --AKKPGRKKKVTKE-----------------PDFE----DDDKPSTSAAAAAALAARQLSEAAS 937
              |:...||.:|..:                 |..|    .|||..|.|....|...::.|...:
Mouse  1429 QEAEMEKRKCEVVSDLVSVTNQESNVEKPLPGPSEEWRGSGDDKVKTEAHVETASTGKEPSGTMT 1493

  Fly   938 ATKSK----------------------------------PAAGAK---KKNAGV----------- 954
            :|.|:                                  |.|..|   |:|..|           
Mouse  1494 STASQKPGGNQGRPDGSLGGAAPLIFPDSKNVAPVGILAPEANPKAEEKENDTVMISPKQESFPP 1558

  Fly   955 -----KGKK-----GSAGK-----------------------GNAKNAKQNGKKSARKPAFTTDE 986
                 .|||     ||..|                       |..|..||..::..|||.     
Mouse  1559 KGYFPSGKKKGRPIGSVNKQKKQQQQPPPPPQPPQMPEGSADGEPKPKKQRQRRERRKPG----- 1618

  Fly   987 DSTPAPTNGGGSVPELRFKSPFILIK----PDGSVSIKNTH--------------------SAED 1027
             :.|.......:||.:..:.|.|.:|    |......||..                    :||:
Mouse  1619 -AQPRKRKTKQAVPIVEPQEPEIKLKYATQPLDKTDAKNKSFFPYIHVVNKCELGAVCTIINAEE 1682

  Fly  1028 VNEKQTKVKKAPHERKNLRGMHSSTLSNRYDADT--------TDST----WICVFCKRGPHKLGL 1080
              |:|||:.::...:::|....|||.|....|.:        |:|:    .:|..|.:......:
Mouse  1683 --EEQTKLVRSRKGQRSLTPPPSSTESKVLPASSFMLQGPVVTESSVMGHLVCCLCGKWASYRNM 1745

  Fly  1081 GDLFGPYLVTSDCDEYRAAV-QTPGAQDIDGMFVNKRRREDMVKGQERNLPAVPATLANIMQAPK 1144
            ||||||:..    .:|.|.: :.|..         ||..|...|.:.|:..|...:..:..:..:
Mouse  1746 GDLFGPFYP----QDYAATLPKNPPP---------KRSSEMQSKVKVRHKSASNGSKTDTEEEEE 1797

  Fly  1145 ISMHKRKR--------KQTHDSSISYSDDPNESRSQCSSVDLLDCSTESKFVETFRGMGKTSENG 1201
            ....|.:|        |:.|.|........:.||.........:.|:|....:|...:..|||.|
Mouse  1798 QQQQKEQRSLAAHPRFKRRHRSEDCGGGPRSLSRGLPCKKAATEGSSEKTVSDTKPSVPTTSEGG 1862

  Fly  1202 FEV--------------WLHEDCAVWSNDIHLIGAHVNGLDAAVWDSTRYQCVLCQQTGASICCF 1252
            .|:              |:||.|.:|:|.|:|:...:.||..|:..:...:|..||:.||::.|:
Mouse  1863 PELELQIPELPLDSNEFWVHEGCILWANGIYLVCGRLYGLQEALEIAREMKCSHCQEAGATLGCY 1927

  Fly  1253 QRCCKAAAHVPCGRSANWSLSEEDRKVYCHLHRHEPGVVEPIKTESIAPVEVSVATPPAPAP-PP 1316
            .:.|....|.||...|:..|.||:..|.|..|:                       ||.|.| ||
Mouse  1928 NKGCSFRYHYPCAIDADCLLHEENFSVRCPKHK-----------------------PPLPCPLPP 1969

  Fly  1317 AFN 1319
            ..|
Mouse  1970 LQN 1972

Known Domains:


GeneSequenceDomainRegion External IDIdentity
CG5098NP_001261066.1 PHD_SF 1068..1284 CDD:304600 59/239 (25%)
Tcf20NP_001107612.1 Leucine-zipper 1198..1219 2/21 (10%)
Nuclear localization signal 1282..1295 5/18 (28%)
Nuclear localization signal 1604..1628 7/30 (23%)
ePHD_TCF20 1733..1959 CDD:277169 59/239 (25%)
Nuclear localization signal 1812..1819 2/7 (29%)


Information from Original Tools:


Tool Simple Score Weighted Score Original Tool Information
BLAST Result Score Score Type Cluster ID
Compara 1 0.930 C112723719
eggNOG 1 0.900 E2759_KOG1084
Hieranoid 00.000 Not matched by this tool.
Homologene 00.000 Not matched by this tool.
Inparanoid 1 1.050 114 1.000 Inparanoid score I4790
Isobase 1 0.950 0 Normalized mean entropy S8124
OMA 00.000 Not matched by this tool.
OrthoDB 00.000 Not matched by this tool.
OrthoFinder 1 1.000 FOG0003562
OrthoInspector 00.000 Not matched by this tool.
orthoMCL 00.000 Not matched by this tool.
Panther 1 1.100 O PTHR14955
Phylome 00.000 Not matched by this tool.
RoundUp 1 1.030 avgDist Average_Evolutionary_Distance R9940
TreeFam 00.000 Not matched by this tool.
76.960

Return to query results.
Submit another query.