DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment htk and Arid4a

DIOPT Version :10

Sequence 1:NP_573330.3 Gene:htk / 32877 FlyBaseID:FBgn0085451 Length:2486 Species:Drosophila melanogaster
Sequence 2:XP_036013289.1 Gene:Arid4a / 238247 MGIID:2444354 Length:1283 Species:Mus musculus


Alignment Length:1681 Identity:413/1681 - (24%)
Similarity:608/1681 - (36%) Gaps:527/1681 - (31%)


- Green bases have known domain annotations that are detailed below.


  Fly     1 MQQMDDPPSLPVGTEVSAKYKGAFCEAKVSKVVRNIKVKVAYKQGLGSGIVSDDAIKAPTGQLRV 65
            |:..|:|..|.|||:|||||:|||||||:..|.|.:||||..||...:.:|.||.:|.|   |||
Mouse     1 MKAADEPAYLTVGTDVSAKYRGAFCEAKIKTVKRLVKVKVLLKQDNTTQLVQDDQVKGP---LRV 62

  Fly    66 GAVVEVRHPDRKELVEATITKIQDCSQYTVVFDDGDITTLRRTALCLKSGRHFNESETLDQLPLT 130
            ||:||.|..| ..:.||.|:|:.|.|.|||||||||..|||||:||||..|||.|||||||||||
Mouse    63 GAIVETRTSD-GSIQEAIISKLTDASWYTVVFDDGDERTLRRTSLCLKGERHFAESETLDQLPLT 126

  Fly   131 HPEHFGNPVVGGR--RGRR------RGHLNEDSSEDDDESDAKEVVNEKEENIGKVVCVETESKK 187
            :|||||.||:..:  ||||      .....|:|||::||.  |..:|  :|.:||||.|.:.:  
Mouse   127 NPEHFGTPVIAKKTNRGRRSSLPITEDEKEEESSEEEDED--KRRLN--DELLGKVVSVASTA-- 185

  Fly   188 KDKEKWFPALVVAPTAQATVRIRVKDEYLVRSFKDGRYYTVPKKEATEFTREVASKQDV---PAV 249
             :...|:|||||:|:....|.:: ||:.|||||.|.::|::.:|:..|.......:.::   |.:
Mouse   186 -ESTGWYPALVVSPSCNDDVTVK-KDQCLVRSFIDSKFYSIARKDIKELDILTLPESELCARPGL 248

  Fly   250 QAALEFLDSSVLPAHWDRDSLFGLTNISSDDEGEIDSDSSDDEPH-----------------EEK 297
            :.|..||...::|.:|..|....|.:.||||| |..::..::|..                 ||:
Mouse   249 RRASVFLKGRIVPDNWKMDISEILESSSSDDE-ECPAEEHEEEKEKEAKKEEEELPEEELDPEER 312

  Fly   298 DRFVAQLYKYMDDRGTPLNKVPSILSRDVDLYRLFRAVQKRGGYNRVTSQNQWKLIAMRLGFTPC 362
            |.|:.||||:|:|||||:||.|.:..:|::|::|||.|..:||...:.|...||.|.|.||....
Mouse   313 DNFLQQLYKFMEDRGTPINKPPVLGYKDLNLFKLFRLVYHQGGCGNIDSGAVWKQIYMDLGIPIL 377

  Fly   363 TVSVMNLVKQAYKKFLQPYGDFHRKLGCSMLMTSRNSNRSKGRSLVRANSVASPKPMETMKTETI 427
            ..:....||.||:|:|..:.::.|......                |......||..|..|    
Mouse   378 NSAASYNVKTAYRKYLYGFEEYCRSANIQF----------------RTIHHHEPKVKEEKK---- 422

  Fly   428 SKLAQPNQTNVVASTSSSAAAASVAASSTPARAVSTASQSAAEESGNTSESSVVVEPPKKQRK-G 491
                         ....|...|...|...|...|.:..:...:.:..:......::.|:.:|| .
Mouse   423 -------------DFEDSMDEALKEAPEMPLLDVKSEPEENTDSNSESDREDTELKSPRGRRKIV 474

  Fly   492 SAASSQQGKVKSLVEKYEEKSTAVQATSSGTVAGSGASASAAAMPTTSAAASTATNLSSATSGGS 556
            ..|:..:.:::.  ||.|:|                                             
Mouse   475 RDANCIKKEIEE--EKIEDK--------------------------------------------- 492

  Fly   557 ATIATTAMNKDAESDLPLAKIKAAAVAAASTRHSMEKETNISSGSSASASSKANSAEMQRSRDAS 621
             .:.....||||..|      ......||...|                       |:...|.::
Mouse   493 -FLRDDLENKDAGDD------DDDGDPAAKREH-----------------------ELLFGRKST 527

  Fly   622 PSVAAPPSAGASTSGAAATAQTASNKKEKHQRSKQADKEKDKDKEEKQASSGKRKKEKISVEKID 686
            |                      .||::|.:  |..|.|:|.|:||:::...:..:.:...|..|
Mouse   528 P----------------------KNKEKKIK--KPEDSERDSDEEEEKSQEREETESRCDSEGED 568

  Fly   687 TGDFVVG--IGDKLKVNYHEKKSPSSHGSTYEAKVIEISVQRGVPMYLVHYTGWNNRYDEWVPRE 749
            ..|....  .|.|:||.|...|:.    ..|||.:....:..|..:|||||.|||.||||||..:
Mouse   569 EEDDTEPCLTGTKVKVKYGRGKTQ----KIYEASIKSTEMDDGEILYLVHYYGWNVRYDEWVKAD 629

  Fly   750 RIAENLTKGSKQKTRTISTSSANSGSGGGGGGGGGGGGGGGGGGGSLLVQGSQPPGVSDKQPGKD 814
            ||...|.||..:|                                              ||..|.
Mouse   630 RIIWPLDKGGPKK----------------------------------------------KQKKKV 648

  Fly   815 GCSKMSPSSGNSTGPGAPSLSGSLGGASSTPSLLSTVVKTPPTGGAKRGRGRSDSMPPRSTTPSS 879
            .|.                           |.||                               
Mouse   649 KCQ---------------------------PCLL------------------------------- 655

  Fly   880 VVAHS--GRTKSPAASQPQLQQQMKKRPTRVVPGTTTPRRVSDASMASESDSDSDEPVRRPKRQS 942
              ||.  |....|                  ||           ::.::.||:.||     ||..
Mouse   656 --AHCTLGHAFCP------------------VP-----------AVENKEDSEKDE-----KRDE 684

  Fly   943 AKDKPQAGKAQPPGKG--------RLASSASSTAPAAHPSDDSEEDEEEEEPSAARAASSKQQQQ 999
            .:.|.:.|:  ||.|.        .|:.:::|...:...|.|||.|::.|:.|            
Mouse   685 ERQKSKRGR--PPLKSTFSPNMPYSLSKTSNSEGKSDSCSSDSEADDQLEKSS------------ 735

  Fly  1000 QASSLRGSRAGGNRAMSSGAASAKGRDYDLS-EIRSELKGFQPKLLTNAASNEERKDLAKKEPSD 1063
                      ||.               ||| :::.||:           .||...|  .|...:
Mouse   736 ----------GGE---------------DLSPDVKEELE-----------KNENAHD--DKLDEE 762

  Fly  1064 EPALQDIKKE-PKLESSAKSSSTELSSETES----YADEDSQSSDYRKQL----KGSGAGKKEPT 1119
            .|.:..|.|| .:.::....:.|..:.:::.    :.|:..|..:::||:    ||.|...|...
Mouse   763 NPKIVHISKENDRTQAQPSDTLTVEAGDSDQIVHIFGDKVDQVEEFKKQVEKSPKGKGRRSKTKD 827

  Fly  1120 ASPSKLHHEPVTKRELAVKE-------EPLKIEPKT--------EPKEEETKSKPFLSGADIKPT 1169
            .|...:...|..:.|...:.       |...:|.|.        :|.|:|.|.|..:.|......
Mouse   828 LSLELIKISPFGQEEAGSEAHGDVHSLEFSSLECKNFSSTEDDIDPYEKEKKLKRKILGQQSPEK 892

  Fly  1170 ALIAPARFGN-----TSAAQAASSSGSSSTTAKYTSVIVEKPLTIGGKKS--VEQHVPKKAELLK 1227
            .|    |..|     |..:|..|..|:.:.                |.|.  ||||...:.|   
Mouse   893 KL----RLDNGMEMTTGVSQERSDDGAGAE----------------GMKGAHVEQHFETEGE--- 934

  Fly  1228 KQSGGAGGTAASSSAASQESKKFAEPVASLKVEMPA----ACSPSSSSSSSSSFCSTGSAVSSSS 1288
                |.....|......||       :.|.|.:.||    ..:|......:.......:.|....
Mouse   935 ----GMPSLTAEPDQGLQE-------LTSEKSDSPAEEEPVHTPLKEEEDAMPLIGPETLVCHEV 988

  Fly  1289 ATRSLPDMSKLEISSGTVPGATPGAAALQPSNVAQVSSSGAAKESKYSSSGGAAAGSGISMRKLL 1353
            ....|.:..|..|....|.|:...:.|..|..:..|:....:..|..:.|...:       |.:.
Mouse   989 DLDDLDEKDKTSIEDVVVEGSESNSLASVPPALPPVAQHNFSVASPLTLSQDES-------RSIK 1046

  Fly  1354 S-SDVYEFKDTEPFEFEKRISPMASVGGTVAAAATAAGAMAAAGAASVITTMVPT-------PGA 1410
            | ||:....|:...|.::.:....|..|..|:        .|:||.|:|.....:       |..
Mouse  1047 SESDITIEVDSIAEESQEGLCERESANGFEAS--------VASGACSIIAHERESREKGQKRPSD 1103

  Fly  1411 SGSGVAASASTSAPVVVGSAARKQALKASAIQHILEHQSPAAGRERGGYGGMTSS-----ISLLT 1470
            ..||:.|......|....:||:.:  |..|.|.......||          |.||     :..||
Mouse  1104 GNSGLIAKKQKRTPKRTSAAAKTE--KNGAGQSSDSEDLPA----------MDSSSNCTPVKRLT 1156

  Fly  1471 APKLKK------RGSP-LKEAALCMEKAK-----------TYK-------LDKDQVQQPVE--QK 1508
            .||.:|      |.|| :|:|    ||.|           |||       ||.....:.:.  |:
Mouse  1157 LPKSQKLPRSPARTSPHIKDA----EKEKHREKHPNSSPRTYKWSFQLNELDNMNSTERISFLQE 1217

  Fly  1509 TQQLIQPTLGPVESYGAGSPEAMSNLSTPTTSSGHAQLSASSTYSQ--LTPHHATP 1562
            ..|.|:.....::|..|........|........||..|.||..|.  ::|..::|
Mouse  1218 KLQEIRKYYMSLKSEVATIDRRRKRLKKKDREVSHAGASMSSASSDTGMSPSSSSP 1273

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
htkNP_573330.3 Tudor_ARID4_rpt1 10..60 CDD:410460 28/49 (57%)
Tudor_ARID4_rpt2 65..118 CDD:410461 32/52 (62%)
RBB1NT 170..263 CDD:462390 29/95 (31%)
ARID 296..381 CDD:460187 37/84 (44%)
CBD_RBP1_like 698..758 CDD:350843 25/59 (42%)
Arid4aXP_036013289.1 Tudor_ARID4A_rpt1 4..61 CDD:410530 31/59 (53%)
Tudor_SF 62..118 CDD:470623 35/56 (63%)
RBB1NT 170..262 CDD:462390 30/97 (31%)
ARID_ARID4A 312..397 CDD:350646 36/84 (43%)
CBD_RBP1_like 580..638 CDD:350843 26/61 (43%)

Return to query results.
Submit another query.