DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment htk and Arid5b

DIOPT Version :10

Sequence 1:NP_573330.3 Gene:htk / 32877 FlyBaseID:FBgn0085451 Length:2486 Species:Drosophila melanogaster
Sequence 2:XP_006514192.1 Gene:Arid5b / 71371 MGIID:2175912 Length:1214 Species:Mus musculus


Alignment Length:1186 Identity:228/1186 - (19%)
Similarity:393/1186 - (33%) Gaps:353/1186 - (29%)


- Green bases have known domain annotations that are detailed below.


  Fly    23 AFCEAKVSKVVRNIKVKVAYKQGLGSGIVSDDAIKAPTGQLRVGAVVEVRHPDRKELVEATITKI 87
            |.|..:.|:|:| :::..|.....||....|:.|                     .:.|..|.|:
Mouse    90 ALCLPRQSRVIR-LRLLRAASSAPGSRAFEDEVI---------------------AVSEKVIVKL 132

  Fly    88 QDCSQYTVVFDDGDITTLRRTALCLKSGRHFNESETLDQLPLTHPEHFGNPVVGGRRGRRRGHLN 152
            :|    .|.:...|.:..|    |   |......:|         |.|      ||.|::...|.
Mouse   133 ED----LVKWAHSDFSKWR----C---GLRATPVKT---------EAF------GRNGQKEALLR 171

  Fly   153 EDSSEDDDESDAKEVVNEK------EENIGKVV------CVETESKKKDKEKWFPALVVAPTAQA 205
            ...|..:...:.|:|:.||      ||....:|      |......|:.::|  |:.::  |.|.
Mouse   172 YRQSTLNSGLNFKDVLKEKADLGEDEEETNVIVLSYPQYCRYRSMLKRIQDK--PSSIL--TDQF 232

  Fly   206 TVRI-----------------------RVKDEYLVRSFK---DGRYYTVPKKEATEFTREVASKQ 244
            .:.:                       .:::|.:...|.   .||    |:|:.|...|      
Mouse   233 ALALGGIAVVSRNPQILYCRDTFDHPTLIENESVCDEFAPNLKGR----PRKKKTCPQR------ 287

  Fly   245 DVPAVQAALEFLDSSVLPAHWDRDSLFGLTNISSDDEGEIDS----------------------D 287
                                  |||..|..:.:::.:|::.|                      .
Mouse   288 ----------------------RDSFSGSKDPNNNCDGKVISKVKGEARSALTKPKNNHNNCKKT 330

  Fly   288 SSDDEP--------HEEKDRFVAQLYKYMDDRGTPLNKVPSILSRDVDLYRLFRAVQKRGGYNRV 344
            |::::|        ..::..|:..|||||.:|.||:.::|.:..:.::|:.:|:|.||.|||..:
Mouse   331 SNEEKPKLSIGEECRADEQAFLVALYKYMKERKTPIERIPYLGFKQINLWTMFQAAQKLGGYETI 395

  Fly   345 TSQNQWKLIAMRLGFTPCTVSVMNLVKQAYKKFLQPYGDF-------------HRKLGCSMLMTS 396
            |::.|||.|...||..|.:.|.....::.|::.:.||..|             .||...:   |.
Mouse   396 TARRQWKHIYDELGGNPGSTSAATCTRRHYERLILPYERFIKGEEDKPLPPIKPRKQENN---TQ 457

  Fly   397 RNSNRSK--GRSLVRANSVA-------SPKPMETMKT--------ETISKLAQPN-----QTNVV 439
            .|.|::|  |...::.....       :|||.:|.:.        ||::..:.|.     :....
Mouse   458 ENENKTKVSGNKRIKQEMAKNKKEKENTPKPQDTSEVSSEQRKEEETLNHKSAPEPLPAPEVKGK 522

  Fly   440 ASTSSSAAAASVAASSTPARAVST----ASQSAAEESGNTSESSVVVEPPKKQRKGSAASSQQGK 500
            ........|.:..:.:.|.:|..|    .|:..|||.|:...:.::..||....|.||.:...||
Mouse   523 PEGHKDLGARAPVSRADPEKANETDQGSNSEKEAEEMGDKGLAPLLPSPPLPPEKDSAPTPGAGK 587

  Fly   501 ------VKSLVEKYEEKSTAVQATSSGTVAGSGASASAAAMPTTSAAASTATNLSSATSG--GSA 557
                  ...:..|.|.|......:....:.|:..|:.:|..|..::..........||:.  .:.
Mouse   588 QPLASPSTQMDSKQEAKPCCFTESPEKDLQGAPFSSFSATKPPLTSQNEAEEEQLPATANYIANC 652

  Fly   558 TIATTAMNKDAESDLPLAKIKAAAVAAASTRHSMEKETNI-----------------SSGSSASA 605
            |:....:..|   |:..| :|............|.|:.::                 |.|:....
Mouse   653 TVKVDQLGSD---DIHTA-LKQTPKVLVVQSFDMFKDKDLTGPMNENHGLNYTPLLYSRGNPGIM 713

  Fly   606 SSKANSAEMQRSRDASPSVAAP---PSAGASTSGAAATAQTASNKKEKH--QRSKQADKEKDKDK 665
            |..|....:.:...||.|.:.|   |....|.....|.....|...:.|  |.|......:....
Mouse   714 SPLAKKKLLSQVSGASLSSSYPYGSPPPLISKKKLIAREDLCSGLSQGHHSQSSDHTAVSRPSVI 778

  Fly   666 EEKQASSGKRKKEKISVEKIDTGDFVVGIGDKLKVNYHEKKSPSSH--GSTYEAKVIEISVQRG- 727
            :..|:...|..:::.|:..|..       .|||..:...:...|.|  ||..::.:::...|.| 
Mouse   779 QHVQSFKNKASEDRKSINDIFK-------HDKLSRSDAHRCGFSKHQLGSLADSYILKQETQEGK 836

  Fly   728 -------------VPMYLVHYTGWNNRYDEWVPRERIAENLTKGSKQKTRTISTSSANSGSGGGG 779
                         ||.:|..:....:.:..:...|....| .:.||...|.....|.|       
Mouse   837 DKLLEKRAVSHAHVPSFLADFYSSPHLHSLYRHTEHHLHN-EQSSKYAARDAYQESEN------- 893

  Fly   780 GGGGGGGGGGGGGGGSLLVQGSQPPGVSDKQPGKDGCS---------KMSPSSGNSTGPGAPSLS 835
                          |:.|         |.|.|.|...:         |...::..||......||
Mouse   894 --------------GAFL---------SHKHPEKIHVNYLASLHLQDKKVAAAEASTDDQPTDLS 935

  Fly   836 GSLGGASSTPSLLSTVV--KTPPTGGAKRGRGRSD----SMPPRSTTPS----SVVAHSGRTKSP 890
                 ....|..|::.|  ....|.|::..:|.|.    |...|...|.    |.:..||..|.|
Mouse   936 -----LPKNPHKLTSKVLGLAHSTSGSQEIKGASQFQVVSNQSRDCHPKACRVSPMTMSGPKKYP 995

  Fly   891 -AASQPQLQQQMKKRPTRVVPGTTTP---RRVSDASMASESDSDSDEPVRRPKRQSAKDKPQ--A 949
             :.::.....|::....|.:.|...|   |::|..::.:          .||.::|.:|...  |
Mouse   996 ESLARSGKPHQVRLENFRKMEGMVHPILHRKMSPQNIGA----------ARPIKRSLEDLDLVIA 1050

  Fly   950 GKAQPPGKGRLASSASSTAPAAHPSDDSEEDEEEEEPSAARAASSKQQQQQASSLRGSRAGGNRA 1014
            ||               .|.|..|.|.::|...:|          |..:|::...:|:..|    
Mouse  1051 GK---------------KARAVSPLDPAKEASGKE----------KASEQESEGNKGAYGG---- 1086

  Fly  1015 MSSGAASAKGRDYDLS 1030
             .||||| :|....||
Mouse  1087 -HSGAAS-EGHKLPLS 1100

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
htkNP_573330.3 Tudor_ARID4_rpt1 10..60 CDD:410460 10/36 (28%)
Tudor_ARID4_rpt2 65..118 CDD:410461 9/52 (17%)
RBB1NT 170..263 CDD:462390 19/130 (15%)
ARID 296..381 CDD:460187 28/84 (33%)
CBD_RBP1_like 698..758 CDD:350843 12/75 (16%)
Arid5bXP_006514192.1 ARID_ARID5B 347..441 CDD:350649 31/93 (33%)
PTZ00121 <437..>562 CDD:173412 23/127 (18%)

Return to query results.
Submit another query.