DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment tho2 and Thoc2l

DIOPT Version :9

Sequence 1:NP_608646.3 Gene:tho2 / 326153 FlyBaseID:FBgn0031390 Length:1642 Species:Drosophila melanogaster
Sequence 2:NP_001160053.1 Gene:Thoc2l / 100042165 MGIID:3040669 Length:1589 Species:Mus musculus


Alignment Length:1649 Identity:666/1649 - (40%)
Similarity:967/1649 - (58%) Gaps:182/1649 - (11%)


- Green bases have known domain annotations that are detailed below.


  Fly    63 EVFKHLDKSGRSEFLKYLKHQLQD-AADTPVYDKNGVFKQGISNAIYQLLWQTLRFTHKKDIVLH 126
            |..|..:||||.|||:..:...:. :.||..:       :....|:|:|.:..::.|.|.:...:
Mouse    13 EWIKTWEKSGRGEFLRLCRILSESKSRDTFAW-------RDFQQALYELSYHVIKGTLKPEQASN 70

  Fly   127 LLLEVVALHADFPSLIVDVVNILDSETSLITDGLQEERHAFVQLVKDLDRVIPESLLKERLEIDT 191
            :|.::.....|.|.::.|:..|||.||:.:.:  :.:|..|.||:.....::.:::|||||:.:|
Mouse    71 VLNDISEFREDIPFILADIFCILDIETNCLEE--KSKRDHFTQLILACLYLVSDTVLKERLDPET 133

  Fly   192 LQEAGIVK-NKSFYSKFIKVKTKLYYKQRRFNLFREESEGFAKLITELNQEFDENTTPESIMDII 255
            |...|::| ::.|..|.:|:||||:|||::|||.|||:||:||||.||.|:...|.|.:.|::|:
Mouse   134 LGSLGLIKQSQQFNQKSVKIKTKLFYKQQKFNLLREENEGYAKLIVELGQDLSGNITSDLILEIL 198

  Fly   256 KSLIGCFNLDPNRVLDIIIESFETRPDRWNLFIPLLRSYMP--TGAIICEVLGYKFCHFKD--SR 316
            ||||||||||||||||||:|.||.||:..:.||.||.:||.  ....:|.:||:||..:::  ..
Mouse   199 KSLIGCFNLDPNRVLDIILEVFECRPEHHDFFISLLEAYMSMCEPHTLCHILGFKFKFYQEPNGE 263

  Fly   317 TPRSLYHVCALLLKHGVIALNDVYVWLTPNDGSIKADWEEDLADAREMVRKLNL-IQTNKKEDEK 380
            ||.|||...|:||:..:|.|:|:||.|.|.|..|.::::.::.:|:::|:||.: :..::|.||:
Mouse   264 TPSSLYRAAAVLLQFNLIDLDDLYVHLLPADSCIVSEYKREIVEAKQIVKKLTMVVLPSEKSDER 328

  Fly   381 DPPPPPSVKKFNE--EKYNANQKFGLCEALLKVGDWENAYKIIQKLPEQAVVLQEPIARAIADLI 443
            :     ..||.::  ||...|||.||.||||.:|||::|..|:..:|.......:.||.||.:||
Mouse   329 E-----KEKKKDDKVEKAPDNQKLGLLEALLIIGDWKHAQSIMDHMPPYYATSHKVIALAICNLI 388

  Fly   444 HLSVENIYYKKCFKAPAGRRPSRNRLYEDSKLVAKMQAKEFGDLRKYTWPMANVLGPAMHYDTVL 508
            |:::|.||.:  ...|.|.:.|.....::.|  |..||:.|.|||:..:.|...|||.:.:|.:|
Mouse   389 HITIEPIYRR--VGVPKGAKGSPVNALQNKK--APKQAESFEDLRRDVFSMFYYLGPHLSHDPIL 449

  Fly   509 MYKLIRIMRKLVVDMGVDSLNGPPPNSEAEQHYYD-IMSSLDACILPSLLYLDNNCSMSEEIWAV 572
            ..|::||.:..:.:...|   |...|.|..:.... ::|..|..:||||..:|.|..||||:|.:
Mouse   450 FAKVVRIGKSFMKEFQSD---GKQENKEKMEAILSCLLSVTDQVLLPSLSLMDCNACMSEELWGM 511

  Fly   573 LKYFPYHFRYSLYARWKNDSYQLHPNLIRRCGLAQRDIKALMKRVSKENVKQLGRLVGKYSHCAP 637
            .|.|||..||.||.:|||::|..||.|::.........|.:|||::|||||..||.:||.||..|
Mouse   512 FKTFPYQHRYRLYGQWKNETYNSHPLLVKVKAYTIDRAKYIMKRLTKENVKPSGRQIGKLSHSNP 576

  Fly   638 GLLFDYILLQIQIYDNLIGPVCDLLKYLTALSFDCLGYCIIESLTLTGRLRFKDDGTSLSLWLQS 702
            .:||||||.|||.|||||.||.|.|||||||::|.|.|||||:|....:.|.|.|.|::|.||||
Mouse   577 TILFDYILSQIQKYDNLITPVVDSLKYLTALNYDVLAYCIIEALANPEKERMKHDDTTISNWLQS 641

  Fly   703 LASFCGTIYKKYSIELSGLLQYVANQLKSQKSLDLLILREIVHKMAGVESCEEMTNDQLQAMCGG 767
            ||||||.:::||.|:|:|||||||||||:.||.|||||:|:|.||||:|..||||.:||:||.||
Mouse   642 LASFCGAVFRKYPIDLAGLLQYVANQLKAGKSFDLLILKEVVQKMAGIEVTEEMTMEQLEAMTGG 706

  Fly   768 EQLRGEAGYFSQVRNTKKSSNRLKEALANNDLAVAICLLMAQQKHCVIYRETAAHSHLKLVGNLY 832
            |||:.|.|||.|:|||||||.|||:||.::|||:.:|||||||::.||::| ....||||||.||
Mouse   707 EQLKAEGGYFGQIRNTKKSSQRLKDALLDHDLALPLCLLMAQQRNGVIFQE-GGEKHLKLVGKLY 770

  Fly   833 DQCQDTLVQFGTFLGSTYSVDEYVERLPSIITMLREYHINTDVAFFLARPMFTHQINQKYDQLRK 897
            |||.|||||||.||.|..|.|:|::|:|||..:..|:|...|.||||:|||:.|.|:.|||:|:|
Mouse   771 DQCHDTLVQFGGFLASNLSTDDYIKRVPSIDVLCNEFHTPHDAAFFLSRPMYAHHISSKYDELKK 835

  Fly   898 DDPNAKKLTTTQKLQKYLEATQLIMNPIVESVRPLHSSKVWEDISPQFLVTFWSLSMYDLHVPNE 962
            .:..:|:   ..|:.||:.:.:::|.|:.|:|..||.||||:||||||..|||||:||||.||:.
Mouse   836 SEKGSKQ---QHKVHKYIMSCEMVMAPVHEAVVSLHISKVWDDISPQFYTTFWSLTMYDLAVPHT 897

  Fly   963 SYQREVAKLKQLAQQAAEGKDSNQSKNKKEQERYIALMEKLNDERKKQHEHVDKISQRLQEQKDS 1027
            ||:|||.|||...:...:.::...:|.|||:||..||.:||.:|.|||.|||.::..||:.:||:
Mouse   898 SYEREVNKLKIQMKAVDDSQEMPLNKKKKEKERCTALQDKLLEEEKKQTEHVQRVLHRLKLEKDN 962

  Fly  1028 WFLLRSGKSAKNDTITQFLQLCLFPRCTFTALDALYCAKFVHTIHNLKTSNFSTLLCYDRIFCDI 1092
            |.|   .||.||:|||:|||||:||||.|:|:|::|||.||..:|..||.||||||||||:|.||
Mouse   963 WLL---AKSTKNETITKFLQLCIFPRCIFSAIDSVYCAHFVELVHQQKTPNFSTLLCYDRVFSDI 1024

  Fly  1093 TYSVTSCTEGEATRYGRFLCAMLETVMRWHADQAVFNKECANYPGFVTKFRVSNQFSEAN--DHV 1155
            .|.|.|.||.||:|||||||.|||||.|||:|::.:.|||.|||||:|..|.:. |...|  |.:
Mouse  1025 IYMVASFTENEASRYGRFLCCMLETVTRWHSDRSTYEKECGNYPGFLTILRATG-FDGGNKADQL 1088

  Fly  1156 GYENYRHVCHKWHYKITKAIVFCLDSKDFMQIRNALIILMRILPHYPVLSKLAQIIERKVDKVRE 1220
            .|||:|||.||||||:|||.|.||::.::..|||.||:|.:|||.||.:..|.|.:||:|.|:.:
Mouse  1089 DYENFRHVVHKWHYKLTKASVHCLETGEYTHIRNILIVLTKILPWYPKVLNLGQALERRVHKICQ 1153

  Fly  1221 EEKTKRPDLYAIASSYIGQLKLKTPHMLKESVFHQIAERPNKES------------PTSVG---- 1269
            |||.|:|||||:|..|.||||.:..:|:.|:.||.....|...:            |:|:|    
Mouse  1154 EEKEKKPDLYALAMVYSGQLKSRKSYMIPENEFHHKDPPPRNTATNLQPSGPCSGLPSSIGSMCK 1218

  Fly  1270 ---------------APAAATRSDKLSPTSPSGN-TQGTRAPGGAAPFYNSEQKSVIKEPDAKAA 1318
                           |..|...::|.|..:|.|| :.|.......|...|.::|...||.:.|..
Mouse  1219 LDESSAEEADKSRERAQCAVKAANKASSVTPKGNLSNGNSGSNSKAVKENDKEKGKEKEKEKKEK 1283

  Fly  1319 STT---------------------------RES---------------KSQRGEGNNVTLVSTAT 1341
            :..                           ||:               |.::.:.....::....
Mouse  1284 TPAVTPEARVLGKDSKEKPKEEQPNKDEKIREAKERMPKSDKDKEKLKKEEKAKDEKFRIIVANV 1348

  Fly  1342 STASNNERE-----SKQRDLPAPRESQSRSKDDQQEQANNGSNGSRQSESRNRDV---ERDRHDQ 1398
            .:.|..|||     ||:|||....:|:...|..::...    :||.:|.....|:   ||::..:
Mouse  1349 ESKSTQEREKEKEPSKERDLAKEMKSKENVKGGEKAPV----SGSLKSPISRTDITEPEREKRRK 1409

  Fly  1399 QDQRSISSHRSS-RD-IVRVKERTEAELH--------QRSRER---SQRLEELEAQQRKREKTSR 1450
            .|.....||.|: :| :|::|| :.|:|:        .:|:||   .:.|::...:.|:|||...
Mouse  1410 VDSHPSPSHSSTIKDSLVKLKE-SSAKLYINHIPPLLCKSKEREADKKDLDKSRERSREREKKEE 1473

  Fly  1451 RGGEERIR---HGDGVETVDLVGSTDNRHYEEFEGRMRDLSSVSNESNGSLHHRQRSHETIEFEK 1512
            :..:||.|   :.|....:||:            .|.:|.:.:...|.   |..:...|::...:
Mouse  1474 KDRKERKRDYSNNDREAPLDLI------------KRRKDENGILGVSK---HKSESPCESLYPNE 1523

  Fly  1513 VDSKRRKLESSTSSSKKVEELVDSVKKARALKTKERNKDKLSDEERDARKDRKLGRKRDRVDESN 1577
            .|.::.|   |.||.|   |..||.|..:.        ||:|..::::..|::...|:::.|.|.
Mouse  1524 KDKEKMK---SKSSGK---EKGDSFKPEKI--------DKISSGKKESGHDKEKIEKKEKWDGSG 1574

  Fly  1578 SNEHKRRREGQNGEEELRDRERHR 1601
            ..|.|:..         :..::||
Mouse  1575 DKEEKKHH---------KTSDKHR 1589

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
tho2NP_608646.3 THOC2_N 133..627 CDD:292752 198/502 (39%)
Thoc2 629..703 CDD:288568 45/73 (62%)
Tho2 939..1240 CDD:288156 171/302 (57%)
PRP38_assoc 1570..1639 CDD:289628 6/32 (19%)
Thoc2lNP_001160053.1 THOC2_N 13..566 CDD:406523 215/573 (38%)
Thoc2 568..642 CDD:403051 45/73 (62%)
Tho2 874..1173 CDD:402722 171/302 (57%)


Information from Original Tools:


Tool Simple Score Weighted Score Original Tool Information
BLAST Result Score Score Type Cluster ID
Compara 1 0.930 - - C167850312
Domainoid 1 1.000 378 1.000 Domainoid score I853
eggNOG 1 0.900 - - E1_KOG1874
Hieranoid 1 1.000 - -
Homologene 00.000 Not matched by this tool.
Inparanoid 1 1.050 1163 1.000 Inparanoid score I204
Isobase 00.000 Not matched by this tool.
OMA 1 1.010 - - QHG55273
OrthoDB 00.000 Not matched by this tool.
OrthoFinder 1 1.000 - - FOG0003449
OrthoInspector 1 1.000 - - otm43092
orthoMCL 1 0.900 - - OOG6_102850
Panther 1 1.100 - - O PTHR21597
Phylome 1 0.910 - -
RoundUp 1 1.030 - avgDist Average_Evolutionary_Distance R3684
SonicParanoid 1 1.000 - - X2346
SwiftOrtho 00.000 Not matched by this tool.
TreeFam 1 0.960 - -
1413.790

Return to query results.
Submit another query.