DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment osa and Arid1a

DIOPT Version :9

Sequence 1:NP_001163639.1 Gene:osa / 42130 FlyBaseID:FBgn0261885 Length:2716 Species:Drosophila melanogaster
Sequence 2:XP_006239073.1 Gene:Arid1a / 297867 RGDID:1310500 Length:2291 Species:Rattus norvegicus


Alignment Length:2789 Identity:775/2789 - (27%)
Similarity:1071/2789 - (38%) Gaps:805/2789 - (28%)


- Green bases have known domain annotations that are detailed below.


  Fly     2 NEKIKSPQTQQQQQGG-APAPAATPPSAGAAPGAATP-PTSGPPTP------------------- 45
            :|..|:.|.|:::.|| |.|.||......||.|..:. |..|||.|                   
  Rat    21 SELKKAEQQQREEAGGEAAAAAAERGEMKAAAGQESEGPAVGPPQPLGKELQDGAESNGGGGGGG 85

  Fly    46 -NNNSNNGSDPSIQQQQQNVAPHP-------------------YGAP--------PPPGSGPGGP 82
             .:....|::|.::....|..|.|                   .|||        |||..|.|.|
  Rat    86 AGSGGGPGAEPDLKNSNGNAGPRPALNNNLTEPPGGGGGSSDGVGAPPHSAAAALPPPAYGFGQP 150

  Fly    83 PGPDPAAVMH-----YHHLHQQQQQHPPPPHMQQQQHHGGPAPPPPGGAPEHAPGVKEEYTHLPP 142
            .|..|:||..     :|..|..||.    |.:...|..||      ||...:|...:..:.|..|
  Rat   151 YGRSPSAVAAAAAAVFHQQHGGQQS----PGLAALQSGGG------GGLEPYAGPQQNSHDHGFP 205

  Fly   143 PHPHPAYGRYHADPNMDPY------------RYGQP-----LPGGKPPQQQQPHPQQQ----PPQ 186
            .|   .|..|:  ||...|            |.|.|     ..|.|||..........    .||
  Rat   206 NH---QYNSYY--PNRSAYPPPPQAYALSSPRGGTPGSGAAAAGSKPPPSSSASASSSSSSFAPQ 265

  Fly   187 QPGP--GGSPNRPPQQRYIPGQPPQGPTPTLNSLLQSSNPPPPPQHRYANTY--DPQQ------- 240
            :.|.  ||.|:...     .|.|....|||||.||.|.:.....|......|  .||.       
  Rat   266 RFGAMGGGGPSAAG-----GGTPQPTATPTLNQLLTSPSSARGYQGYPGGDYGGGPQDGGAGKGP 325

  Fly   241 ----------AAASAAAAAA---AQQQQAGGPPPPGH----GPP---PPQH----------QPSP 275
                      |||:||||||   |||:....|..||.    |.|   .||.          :|.|
  Rat   326 ADMASQCWGAAAAAAAAAAASGGAQQRSHHAPMSPGSSGGGGQPLARTPQSSSPMDQMGKMRPQP 390

  Fly   276 YGG------QQGGWAPPPRPYSPQLGPSQQY--RTP---PPTNTSRGQS------------PY-- 315
            |||      |||   ||..|......|.|.|  :||   |.|...|.||            ||  
  Rat   391 YGGTNPYSQQQG---PPSGPQQGHGYPGQPYGSQTPQRYPMTMQGRAQSAMGSLSYAQQIPPYGQ 452

  Fly   316 --PPAHGQNSGSYPSSPQQQQQQQQQQQQQAGQQPGGPVPGGPP-----PGTGQQPPQQNTPPTS 373
              |.|:|| .|..|...||....|||||....|||....|...|     |.|.|...|.:.||.|
  Rat   453 QGPSAYGQ-QGQTPYYNQQSPHPQQQQQPPYSQQPPSQTPHAQPSYQQQPQTQQPQLQSSQPPYS 516

  Fly   374 QYSPYP--QRYPTP-PGLPAGGSNHRTAYSTHQYPEPNRPW-------PGGSSPSPGSGHPLP-P 427
            |....|  |:..|| |...:....|..:...:..|:...|:       |..|:.|..:.:|.| |
  Rat   517 QQPSQPPHQQSQTPYPSQQSTTQQHPQSQPPYSQPQAQAPYQQQQPQQPASSTLSQQAAYPQPQP 581

  Fly   428 ASPHHVPPLQQQPPPPPHVS--AGGPPPSSSPGHAPSPSPQPS-----QASPSPHQELIGQ---- 481
            .........||:.|||..:|  :.|...||:|....|...|..     |:.||...:|.|.    
  Rat   582 QQSQQTAYSQQRFPPPQELSQDSFGSQASSAPSMTSSKGGQEDMNLSLQSRPSSLPDLSGSIDDL 646

  Fly   482 ---NSNDSSSGGAHSGMGSGPPGTPNP-QQVMRPTPSPTGSSGSRSMSPAVAQNHPISRPASNQS 542
               .....|.|.:.||:.|......|| |....|..|| ...|.|..||:     |:..|||...
  Rat   647 PMGTEGALSPGVSTSGISSSQGEQSNPAQSPFSPHTSP-HLPGIRGPSPS-----PVGSPASVAQ 705

  Fly   543 SSGGPMQQPPVGAGGPPPMPP--------HPGMPGGPPQQQQSQQQQASNSASSASNSPQQTPPP 599
            |..||:....|.....||.||        ||.|......|.:...|:          :||.....
  Rat   706 SRSGPLSPAAVPGNQMPPRPPSGQSDSIMHPSMNQSSIAQDRGYMQR----------NPQMPQYT 760

  Fly   600 APPPNQGMNNMATPPPPPQGAAGGGYPMPPHMHGGYKMGGPGQSPGAQGYPPQQPQQYPPGNYPP 664
            :|.|...::        |:..:||      .||.|.   |..|......|.||..|..|.|.||.
  Rat   761 SPQPGSALS--------PRQPSGG------QMHSGM---GSYQQNSMGSYGPQGSQYGPQGGYPR 808

  Fly   665 RPQYPPGAYATGPPPPPTSQAGAGGANSMPSGAQAGGYPGRGMPNHTGQYPPYQWVPPSPQQTVP 729
            :|.|.....|..|     |...||..|.:.:|.|..|.||         .|||..:||       
  Rat   809 QPNYNALPNANYP-----SAGMAGSMNPLGAGGQMHGQPG---------IPPYGTLPP------- 852

  Fly   730 GGAPGGAMVGNHVQG--KGTPPPPVVGGP-PPPQGSGSPRPLNYLKQH-----LQHKGGYGGSPT 786
             |....|.:||...|  ....||.|..|. |||.|.......:.:..|     :|::        
  Rat   853 -GRMTHASMGNRPYGPNMANMPPQVGSGMCPPPGGMNRKTQESAVAMHVAANSIQNR-------- 908

  Fly   787 PPQGPQGYGNGPTGMHPGMPMGPPHHMGPPHGPTNMGPPTSTPPQSQMLQGGQPQGQGASGGPES 851
                |.||.|    |:.|..||    .|||:|                      ||..:..|.  
  Rat   909 ----PPGYPN----MNQGGIMG----TGPPYG----------------------QGINSMAGM-- 937

  Fly   852 GGPEHISQDNGISSSGPTGAAGMHAVTSVVTTGPDGTSMDEVSQQSTLSNASAASGEDPQC---- 912
                       |:..||....|                       .|::|.||.....|:.    
  Rat   938 -----------INPQGPPYPMG-----------------------GTMANNSAGMAASPEMMGLG 968

  Fly   913 ---TTPKSRKNDPYSQSHLAPPSTSPHPVVMHPGGGPGEEYDMSSPPNWPRPAGSPQVFNHVPVP 974
               .||.::.|:                                      :..|:|:.       
  Rat   969 DVKLTPATKMNN--------------------------------------KADGTPKT------- 988

  Fly   975 QEPFRSTITTTKKSDSLCKLYEMDDNPDRRGWLDKLRAFMEERRTPITACPTISKQPLDLYRLYI 1039
            :...:.:.::|..::.:.||||:...|:|:.|:|:..||.||:...:|..|.:.::||||||||:
  Rat   989 ESKSKKSSSSTTTNEKITKLYELGGEPERKMWVDRYLAFTEEKAMGMTNLPAVGRKPLDLYRLYV 1053

  Fly  1040 YVKERGGFVEVTKSKTWKDIAGLLGIGASSSAAYTLRKHYTKNLLTFECHFDRGDIDPLPIIQQV 1104
            .|||.||..:|.|:|.|:::|..|.:|.|||||.:|:|.|.:.|..|||..:||: ||.|.|  .
  Rat  1054 SVKEIGGLTQVNKNKKWRELATNLNVGTSSSAASSLKKQYIQCLYAFECKIERGE-DPPPDI--F 1115

  Fly  1105 EAGSKKKTAKAASVPSPGGGHLDAG--TTNSTGSSNSQ--DSFPAPPGSAPNAAIDGYPG----- 1160
            .|...||:......|||.|.....|  |..||.||.::  |..|..|.|.|::.|...||     
  Rat  1116 AAADSKKSQPKIQPPSPAGSGSMQGPQTPQSTSSSMAEGGDLKPPTPASTPHSQIPPLPGMSRSN 1180

  Fly  1161 -------YPGGSPYPVASGPQPDYATAGQMQRPPSQNNPQTPHPGAAAAVAAGDNISVSN--PFE 1216
                   :|.||        :|.:          .:.|..||:||...::...|.:...:  |.:
  Rat  1181 SVGIQDAFPDGS--------EPTF----------QKRNSMTPNPGYQPSMNTSDMMGRMSYEPNK 1227

  Fly  1217 DPIAAGG---GPGSGTGPGPGQGPGPGAASGGAGAVGAVGGGPQPHPPPPHSPHTAAQQAAGQHQ 1278
            ||.  ||   .|||......||||     :||.|              .|:|      :|||   
  Rat  1228 DPY--GGMRKAPGSDPFMSSGQGP-----NGGMG--------------DPYS------RAAG--- 1262

  Fly  1279 QQHPQHQHPGLPGPPPPQQQQGQQGQQPPPSVGGGPPPAPQQHGP-GQVPPSPQQHVR--PAAGA 1340
                    |||                      |.....|:||.| |    .|...||  |..| 
  Rat  1263 --------PGL----------------------GNVAMGPRQHYPYG----GPYDRVRTEPGIG- 1292

  Fly  1341 PYPPG--GSGYPTPVSRTPGSPYPSQPGAYGQYGSSDQYNATGPPGQPFGQGPGQYPPQNRNMYP 1403
              |.|  |:|.|.|      :..||.|.: |.|                  .|.:||||.:....
  Rat  1293 --PEGNMGTGAPQP------NLMPSTPDS-GMY------------------SPSRYPPQQQQQQQ 1330

  Fly  1404 PYGPEGEAPPTGANQYGPYGSRPYSQPPPGGPQPPTQTVAGGPPAGGAPGAPPSSAYPTGRPS-- 1466
            ....:.:......:.|              |.|..||            |.|.||.:|:.:.:  
  Rat  1331 QQQQQQQQQQQRHDSY--------------GNQFSTQ------------GTPSSSPFPSQQTTMY 1369

  Fly  1467 ---QQDYYQPPPD--QSPQPRRHPDFIKDSQPYPGYNARPQIYGAWQSGTQQYRPQYPSSPAPQN 1526
               ||..|:.|.|  ..|..:||...:. |.||               .|.|.:||....||.|:
  Rat  1370 QQQQQQNYKRPMDGTYGPPAKRHEGELY-SVPY---------------STGQGQPQQQQLPAAQS 1418

  Fly  1527 WGGAPPRGAAPPPGAPHGPPIQQ----------PAGVAQWDQHRYPPQQGPPPPPQQQQQPQQQQ 1581
            ...:.|:.|.|.|        ||          ||......:.|  |..||             |
  Rat  1419 QSASQPQAAQPSP--------QQDVYNQYSNAYPATATAATERR--PAGGP-------------Q 1460

  Fly  1582 QQPPYQ----QVAGPPGQQPPQ-APPQWAQMNPGQTAQSGIAPPGSPLRPPSGPGQQNRM----P 1637
            .|.|:|    :|:.|||....| .|||..               |.|::..:...||..|    .
  Rat  1461 NQFPFQFGRDRVSAPPGSNAQQNMPPQMM---------------GGPIQASAEVAQQGTMWQGRN 1510

  Fly  1638 GMPAQQQQSQQQGGVPQPP--------------PQQASHGGV-PSPGL--PQVGPGGMVKPPYAM 1685
            .|.......|..|..||.|              .|:|:|.|. ||.|.  |..||...| ||...
  Rat  1511 DMTYNYANRQNTGSAPQGPAYHGVNRTDEMLHTDQRANHEGPWPSHGTRQPPYGPSAPV-PPMTR 1574

  Fly  1686 PPPPSQGVGQQVGQGPPGGMMS-----QKPPPMPGQAMQQQPLQQQPPSHQHPHPHQHPQHQHP- 1744
            |||.:        ..||..|.:     ..|.|:|      :|::.:....:.|..|...:.|.. 
  Rat  1575 PPPSN--------YQPPPSMQNHIPQVSSPAPLP------RPMENRTSPSKSPFLHSGMKMQKAG 1625

  Fly  1745 HQMPPNQTAPGGYGPPGMPGGGAQLVKKELIFPHDSVESTTPVLYRRKRLMKADVCPVDPWRIFM 1809
            ..:|.:..||....||        ::::::.||..|||:|.|||.:|:||...|:...:.||:.|
  Rat  1626 PPVPASHIAPTPVQPP--------MIRRDITFPPGSVEATQPVLKQRRRLTMKDIGTPEAWRVMM 1682

  Fly  1810 AMRSGLLTECTWALDVLNVLLFDDSTVQFFGISNLPGLLTLLLEHFQKNLAEMFDERENEEQSAL 1874
            :::||||.|.|||||.:|:||:||:::..|.:|.|||||.||:|:|::.|.|:|         .:
  Rat  1683 SLKSGLLAESTWALDTINILLYDDNSIMTFNLSQLPGLLELLVEYFRRCLIEIF---------GI 1738

  Fly  1875 LAEDADDDADSGTVMCEKLRTSGRQPRCVRSISSYNRRRHYENMDRSGKDGAGNGSDSEDADEGI 1939
            |.|....|....|::     ..||..: |.|.:........|::|...:         |:.:||:
  Rat  1739 LKEYEVGDPGQRTLL-----DPGRLTK-VSSPAPTEEEEEEEHVDPKLE---------EEEEEGV 1788

  Fly  1940 D-------LGQVRVQPNPEERSLLLSFTPNYTMVTRKGVPVRIQPAENDIFV----DERQKAWDI 1993
            :       ||:.:......|..|:..|..         :||:| ..:||.||    |:..:..:.
  Rat  1789 ENDEEMAFLGKDKPPSEKSEEKLVSKFDK---------LPVKI-VQKNDPFVVDCSDKLGRVQEF 1843

  Fly  1994 DTNRLYEQLEPVGSDAWTYGFTEPDPLDGIIDVFKSEIVNIPFARYIRSDKKGRKRTELASSSRK 2058
            |:..|:          |..|  ..|..:.|...|:|:|..:|....:.|....||..        
  Rat  1844 DSGLLH----------WRIG--GGDTTEHIQTHFESKIELLPSRPCVPSPVPPRKHV-------- 1888

  Fly  2059 PEIKTEENSTEEQTFNKKRRLVSGGSSSSGAHAEG---KKSKLTSEEFAQPNAEVKKEPGTADSD 2120
                    :|.|.|        .|.:...|...:|   |:...|.::.....:....:.||..::
  Rat  1889 --------TTVEGT--------PGTTEQEGPPPDGLPEKRITATMDDMLSTRSSTLTDEGTKSTE 1937

  Fly  2121 CRPVDMDIEAPQQRLTNGVAPCSSTPAIFDPRTTAKDEARVLQRRRDSSFEDECYTRDEASLHLV 2185
            .       .....:...|::|..|                   .|.....|||.:::||..|..:
  Rat  1938 A-------NKESSKFPFGISPAQS-------------------HRNIKILEDEPHSKDETPLCTL 1976

  Fly  2186 SESQDSLARRCIALSNIFRNLTFVPGNETVLAKSTRFLAVLGRLLLLNHEHLRRTPKTRNYDREE 2250
            .:.|||||:||:.:||..|:|:|||||:..::|....|.:||:|:||:|:|..|......|::||
  Rat  1977 LDWQDSLAKRCVCVSNAIRSLSFVPGNDFEMSKHPGLLLILGKLILLHHKHPERKQAPLTYEKEE 2041

  Fly  2251 DTDFSDSCSSLQGEREWWWDYLITIRENMLVAMANIAGHLELSRYDELIARPLIDGLLHWAVCPS 2315
            :.|...||..:    |||||.|..:|||.||.:|||:|.|:||.|.|.|..|::|||||||||||
  Rat  2042 EQDQGVSCDKV----EWWWDCLEMLRENTLVTLANISGQLDLSPYPESICLPVLDGLLHWAVCPS 2102

  Fly  2316 AHGQDPFPSCGPNSVLSPQRLALEALCKLCVTDANVDLVIATPPFSRLEKLCAVLTRHLCRNEDQ 2380
            |..||||.:.|||:|||||||.||.|.||.:.|.||||::|||||||||||.:.:.|.|...::.
  Rat  2103 AEAQDPFSTLGPNAVLSPQRLVLETLSKLSIQDNNVDLILATPPFSRLEKLYSTMVRFLSDRKNP 2167

  Fly  2381 VLREFSVNLLHYLAAADSAMARTVALQSPCISYLVAFIEQAEQTALGVANQHGINYLRENPDSMG 2445
            |.||.:|.||..||..||..||.:|:|...|..|:.|:|.:........:|..:.:: ::|....
  Rat  2168 VCREMAVVLLANLAQGDSLAARAIAVQKGSIGNLLGFLEDSLAATQFQQSQASLLHM-QSPPFEP 2231

  Fly  2446 TSLDMLRRAAGTLLHLAKHPDNRSLFMQQEQRLLGLVMSHILDQQVALIISRVLYQVSR 2504
            ||:||:||||..||.|||..:|.|.|...|.|||.:.:|.:::..|:.:|..||:.:.:
  Rat  2232 TSVDMMRRAARALLALAKVDENHSEFTLYESRLLDISVSPLMNSLVSQVICDVLFLIGQ 2290

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
osaNP_001163639.1 BRIGHT 1001..1092 CDD:128777 42/90 (47%)
DUF3518 2191..2451 CDD:288854 124/259 (48%)
Arid1aXP_006239073.1 DUF3498 592..>858 CDD:288827 87/320 (27%)
BRIGHT 1015..1106 CDD:128777 42/90 (47%)
DUF3518 1982..2237 CDD:288854 124/259 (48%)


Information from Original Tools:


Tool Simple Score Weighted Score Original Tool Information
BLAST Result Score Score Type Cluster ID
Compara 1 0.930 - - C166349542
Domainoid 1 1.000 252 1.000 Domainoid score I2025
eggNOG 1 0.900 - - E1_KOG2510
Hieranoid 1 1.000 - -
Homologene 00.000 Not matched by this tool.
Inparanoid 1 1.050 604 1.000 Inparanoid score I902
OMA 1 1.010 - - QHG46609
OrthoDB 1 1.010 - - D135670at33208
OrthoFinder 1 1.000 - - FOG0002548
OrthoInspector 1 1.000 - - otm45740
orthoMCL 1 0.900 - - OOG6_104640
Panther 1 1.100 - - LDO PTHR12656
Phylome 00.000 Not matched by this tool.
SonicParanoid 1 1.000 - - X2249
SwiftOrtho 00.000 Not matched by this tool.
TreeFam 00.000 Not matched by this tool.
1211.900

Return to query results.
Submit another query.