DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment htk and osa

DIOPT Version :10

Sequence 1:NP_573330.3 Gene:htk / 32877 FlyBaseID:FBgn0085451 Length:2486 Species:Drosophila melanogaster
Sequence 2:NP_524392.2 Gene:osa / 42130 FlyBaseID:FBgn0261885 Length:2716 Species:Drosophila melanogaster


Alignment Length:2387 Identity:447/2387 - (18%)
Similarity:662/2387 - (27%) Gaps:849/2387 - (35%)


- Green bases have known domain annotations that are detailed below.


  Fly   424 TETISKLAQ------PNQTNVVASTSSSAAAASVAASST-------------------------- 456
            |.|::.|.|      |.|.....:.....||||.||::.                          
  Fly   212 TPTLNSLLQSSNPPPPPQHRYANTYDPQQAAASAAAAAAAQQQQAGGPPPPGHGPPPPQHQPSPY 276

  Fly   457 ---------PARAVS----TASQSAAEESGNTSESSVVVEPPKKQRKGSAASSQQGKVKSLVEKY 508
                     |.|..|    .:.|.......|||.......|...|..||..||.|.:.:...::.
  Fly   277 GGQQGGWAPPPRPYSPQLGPSQQYRTPPPTNTSRGQSPYPPAHGQNSGSYPSSPQQQQQQQQQQQ 341

  Fly   509 EE--------------KSTAVQATSSGTVAGSGASASAAAMPTTSAAASTATNLSSATS------ 553
            ::              ..|..|.....|...|..|......||.....:..:|..:|.|      
  Fly   342 QQAGQQPGGPVPGGPPPGTGQQPPQQNTPPTSQYSPYPQRYPTPPGLPAGGSNHRTAYSTHQYPE 406

  Fly   554 ------GGSATIATTAMNKDAESDLPLAKIKAAAVAAASTRH--SMEKE----TNISSGSSASAS 606
                  |||        :....|..||        ..||..|  .::::    .::|:|....:|
  Fly   407 PNRPWPGGS--------SPSPGSGHPL--------PPASPHHVPPLQQQPPPPPHVSAGGPPPSS 455

  Fly   607 SKANSAEMQRSRDASPSVAAP-------------PSAGASTSGAAATAQTASNKKEKHQRSKQAD 658
            |..::.    |....||.|:|             .|:|.:.||..:......|.::..       
  Fly   456 SPGHAP----SPSPQPSQASPSPHQELIGQNSNDSSSGGAHSGMGSGPPGTPNPQQVM------- 509

  Fly   659 KEKDKDKEEKQASSGKRKKEKISVEKIDTGDFVVGIGDKLKVNYHEKKSPSSHGSTYEAKVIEIS 723
                :.......|||.|.......:.                  |....|:|:.|:....:.:..
  Fly   510 ----RPTPSPTGSSGSRSMSPAVAQN------------------HPISRPASNQSSSGGPMQQPP 552

  Fly   724 VQRGVPMYLVHYTGWNNRYDEWVPRERIAENLTKGSKQKTRTISTSSANSGSGGG---------- 778
            |..|.|..:..:.|                 :..|..|:.::....::||.|...          
  Fly   553 VGAGGPPPMPPHPG-----------------MPGGPPQQQQSQQQQASNSASSASNSPQQTPPPA 600

  Fly   779 ------------------GGGGGG--------GGGGGGGGGGSLLVQGSQPPGVSDKQPGKDGCS 817
                              |..|||        ||...||.|.|...||..|.......||.....
  Fly   601 PPPNQGMNNMATPPPPPQGAAGGGYPMPPHMHGGYKMGGPGQSPGAQGYPPQQPQQYPPGNYPPR 665

  Fly   818 KMSPSSGNSTGPGAPSLS-GSLGGASSTPSLLSTVVKTPPTGGAKRGRGRSDSMP---------- 871
            ...|....:|||..|..| ...|||:|.||            ||:.|......||          
  Fly   666 PQYPPGAYATGPPPPPTSQAGAGGANSMPS------------GAQAGGYPGRGMPNHTGQYPPYQ 718

  Fly   872 -----PRSTTPSSVVAHSGRTKSPAASQPQLQQQMKKRPTRVVPGTTTPRRVSDASMASESDSDS 931
                 |:.|.|..         :|..:......|.|..|...|.|...|          ...|.|
  Fly   719 WVPPSPQQTVPGG---------APGGAMVGNHVQGKGTPPPPVVGGPPP----------PQGSGS 764

  Fly   932 DEPVRRPKRQSAKDKPQAGKAQPP------GKGRLASSASSTAPAAH----PSDDSEEDEEEEEP 986
            ..|:...|:.........|...||      |.|.............|    |...:........|
  Fly   765 PRPLNYLKQHLQHKGGYGGSPTPPQGPQGYGNGPTGMHPGMPMGPPHHMGPPHGPTNMGPPTSTP 829

  Fly   987 SAARAASSKQQQQQASSLRGSRAGG-------NRAMSSGAASAKGRDYDLSEIRSELKG------ 1038
            ..::.....|.|.|.:| .|..:||       |...|||...|.|.....|.:.:...|      
  Fly   830 PQSQMLQGGQPQGQGAS-GGPESGGPEHISQDNGISSSGPTGAAGMHAVTSVVTTGPDGTSMDEV 893

  Fly  1039 FQPKLLTN--AASNEERKDLAKKEPSDEPALQDIKKEPKLESSAKSSSTELSSETESYADEDSQS 1101
            .|...|:|  |||.|:.:....|...::|                        .::|:....|.|
  Fly   894 SQQSTLSNASAASGEDPQCTTPKSRKNDP------------------------YSQSHLAPPSTS 934

  Fly  1102 SDYRKQLKGSGAGKK----------EPTASPSKLHHEPVTKRELAVKEEPLKIEPKTEPKEEETK 1156
            ........|.|.|::          .|..||...:|.|       |.:||.: ...|..|:.::.
  Fly   935 PHPVVMHPGGGPGEEYDMSSPPNWPRPAGSPQVFNHVP-------VPQEPFR-STITTTKKSDSL 991

  Fly  1157 SKPFLSGADIKPTALIAPARFGNTSAAQAASSSGSSSTTAKYTSVIVEKPLTI----------GG 1211
            .|.:....:        |.|.|.....:|......:..||..|  |.::||.:          ||
  Fly   992 CKLYEMDDN--------PDRRGWLDKLRAFMEERRTPITACPT--ISKQPLDLYRLYIYVKERGG 1046

  Fly  1212 KKSVEQHVPKKAELLK----KQSGGAGGTAASSSAASQESKKFAE-----------------PV- 1254
                      ..|:.|    |...|..|..||||||....|.:.:                 |: 
  Fly  1047 ----------FVEVTKSKTWKDIAGLLGIGASSSAAYTLRKHYTKNLLTFECHFDRGDIDPLPII 1101

  Fly  1255 -----ASLKVEMPAACSPS--------------SSSSSSSSF-CSTGSAVSSS-SATRSLPDMSK 1298
                 .|.|....||..||              .||:|..|| ...|||.::: ......|..|.
  Fly  1102 QQVEAGSKKKTAKAASVPSPGGGHLDAGTTNSTGSSNSQDSFPAPPGSAPNAAIDGYPGYPGGSP 1166

  Fly  1299 LEISSGTVPG-ATPGAAALQPS-NVAQVSSSGAAKESKYSSSGGAAAGSGISMRKLLSSDVYEFK 1361
            ..::||..|. ||.|.....|| |..|....|||        ...|||..||:            
  Fly  1167 YPVASGPQPDYATAGQMQRPPSQNNPQTPHPGAA--------AAVAAGDNISV------------ 1211

  Fly  1362 DTEPFEFEKRISPMASVGGTVAAAATAAGAMAAAGAASVITTMVPTPGASGSGVAASASTSAPVV 1426
             :.|||     .|:|:.||..:......|.....||||        .||...|.........|..
  Fly  1212 -SNPFE-----DPIAAGGGPGSGTGPGPGQGPGPGAAS--------GGAGAVGAVGGGPQPHPPP 1262

  Fly  1427 VGS--AARKQALKASAIQHILEHQSPAAGRERGGYGGMTSSISLLTAPKLKKRGSPLKEAALCME 1489
            ..|  .|.:||......|| .:||.|       |..|        ..|..:::|           
  Fly  1263 PHSPHTAAQQAAGQHQQQH-PQHQHP-------GLPG--------PPPPQQQQG----------- 1300

  Fly  1490 KAKTYKLDKDQVQQP-------------------VEQKTQQLIQPTLG-PVESYGAGSPEAMSNL 1534
                     .|.|||                   |....||.::|..| |....|:|.|..:|..
  Fly  1301 ---------QQGQQPPPSVGGGPPPAPQQHGPGQVPPSPQQHVRPAAGAPYPPGGSGYPTPVSRT 1356

  Fly  1535 ---STPTTSSGHAQLSASSTYSQLTPHHATPFDALRKSPSFNLNITALNEELAQTVQETTRALTD 1596
               ..|:....:.|..:|..|                                           :
  Fly  1357 PGSPYPSQPGAYGQYGSSDQY-------------------------------------------N 1378

  Fly  1597 ALQPPSTPVPPTVSASGSTPTAVTPVITAM-PTAGTAGLSCPSTPPTGAN---PVLASPKLSTPP 1657
            |..||..|.       |..|....|....| |..|..|    ..||||||   |..:.|....||
  Fly  1379 ATGPPGQPF-------GQGPGQYPPQNRNMYPPYGPEG----EAPPTGANQYGPYGSRPYSQPPP 1432

  Fly  1658 QAINASKQPPASHIVGSPFIETRNVFELSTSNEGSGYSSGESKDNKFEKLENVKILLAGAVGPFD 1722
                ...|||...:.|.|                                               
  Fly  1433 ----GGPQPPTQTVAGGP----------------------------------------------- 1446

  Fly  1723 LDAGSYEQKTSSIADKVLKAISQKKEEVENNKGKPPVEPAAAAKTPGREDVVPSPTAKLDLCSSA 1787
             .||                            |.|...|::|..| ||                 
  Fly  1447 -PAG----------------------------GAPGAPPSSAYPT-GR----------------- 1464

  Fly  1788 IKLDTLKLLSEPLKIQTGPLLGELYRPGPATSSPETK---SILESSLPAKNSELSETIQKLECAI 1849
                              |...:.|:| |...||:.:   ..::.|.|.........|.....:.
  Fly  1465 ------------------PSQQDYYQP-PPDQSPQPRRHPDFIKDSQPYPGYNARPQIYGAWQSG 1510

  Fly  1850 QQRKTPVGGALSLASSTAGTPPTAHTPNSTATAG------AGFSDESMDSTDSEQRLVIEDVIAE 1908
            .|:..|...:.....:..|.||....|...|..|      ||.:.                  .:
  Fly  1511 TQQYRPQYPSSPAPQNWGGAPPRGAAPPPGAPHGPPIQQPAGVAQ------------------WD 1557

  Fly  1909 EHTTTTTTGEQKSPGSQQEEGQTNTATMATAVTAETEGTSSSTPTPPVPAPVKLELGAKAMPGIQ 1973
            :|......|....|..||:..|.........|.    |.....|....|...::..|..|..||.
  Fly  1558 QHRYPPQQGPPPPPQQQQQPQQQQQQPPYQQVA----GPPGQQPPQAPPQWAQMNPGQTAQSGIA 1618

  Fly  1974 AP-IPLKPAEGSSFAGKLTGVVQPPPLAKLQQQEELGQAQGQSKSISSFGIPIVLPEIPASVVVA 2037
            .| .||:|..|.....::.|:    |..:.|.|::.|..|...:..|..|:|             
  Fly  1619 PPGSPLRPPSGPGQQNRMPGM----PAQQQQSQQQGGVPQPPPQQASHGGVP------------- 1666

  Fly  2038 TPALMVASAATVMRHSPGSTPNSQAPVLISPKTFLTAPKVGEVGGLLLKAYT-------GVSGAV 2095
                           |||                  .|:||. ||::...|.       ||...|
  Fly  1667 ---------------SPG------------------LPQVGP-GGMVKPPYAMPPPPSQGVGQQV 1697

  Fly  2096 GAAPAGSVIASAGGGGTPTVISNFPQQQQQASVLQASSEMPANYVLDAEMAEPSNSSAPVVARPF 2160
            |..|.|.:::..........:...|.|||..|...........:....:|  |.|.:||....|.
  Fly  1698 GQGPPGGMMSQKPPPMPGQAMQQQPLQQQPPSHQHPHPHQHPQHQHPHQM--PPNQTAPGGYGPP 1760

  Fly  2161 VMHAVGKELFNVVAPAASSPLISDPHN--ESI--------NLLCEETIPGSP-----APNYGGTD 2210
            .|...|.:|       ....||. ||:  ||.        .|:..:..|..|     |...|...
  Fly  1761 GMPGGGAQL-------VKKELIF-PHDSVESTTPVLYRRKRLMKADVCPVDPWRIFMAMRSGLLT 1817

  Fly  2211 QLPTAV---------NSALAIVGITPLPPDTPPPVPVAAIQQQQQHQLQSGQQQQAAKTGEVNPS 2266
            :...|:         :|.:...||:.|     |.:....::..|::..:...:::          
  Fly  1818 ECTWALDVLNVLLFDDSTVQFFGISNL-----PGLLTLLLEHFQKNLAEMFDERE---------- 1867

  Fly  2267 GPNSSPDSASQDESGEEAKKNAGLEHEDSTGLGALNKRKRNRKPLPMSAQTAAAVAAQLQSLGKR 2331
              |....:...:::.::|         ||..:.....|...|:|     :...::::.     .|
  Fly  1868 --NEEQSALLAEDADDDA---------DSGTVMCEKLRTSGRQP-----RCVRSISSY-----NR 1911

  Fly  2332 RRQVSGMRNQKTTTTAAGSDTDDNSDNIAPNAGQQQQQPRQSARQHMM----------------- 2379
            ||....|..........|||::|..:.|  :.||.:.||....|..::                 
  Fly  1912 RRHYENMDRSGKDGAGNGSDSEDADEGI--DLGQVRVQPNPEERSLLLSFTPNYTMVTRKGVPVR 1974

  Fly  2380 ---PQSNAQQQQQQQLLQQQQTQLPQGIQSTNSSGVRPCPYNFLVELDPALSSDECITILRKQIQ 2441
               .:::....::|:.......:|.:.::...|..   ..|.| .|.||.   |..|.:.:.:|.
  Fly  1975 IQPAENDIFVDERQKAWDIDTNRLYEQLEPVGSDA---WTYGF-TEPDPL---DGIIDVFKSEIV 2032

  Fly  2442 DL---RKAYNTIKG----ELAVIDRRRKKLRRREREKKQQQLQSQQQ 2481
            ::   |...:..||    ||| ...|:.:::..|...::|....:::
  Fly  2033 NIPFARYIRSDKKGRKRTELA-SSSRKPEIKTEENSTEEQTFNKKRR 2078

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
htkNP_573330.3 Tudor_ARID4_rpt1 10..60 CDD:410460
Tudor_ARID4_rpt2 65..118 CDD:410461
RBB1NT 170..263 CDD:462390
ARID 296..381 CDD:460187
CBD_RBP1_like 698..758 CDD:350843 8/59 (14%)
osaNP_524392.2 PHA03247 <640..977 CDD:223021 82/399 (21%)
ARID_ARID1A-like 1001..1093 CDD:350629 24/103 (23%)
BAF250_C 2191..2451 CDD:463439
Blue background indicates that the domain is not in the aligned region.

Return to query results.
Submit another query.