DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment sim and Arnt2

DIOPT Version :10

Sequence 1:NP_524340.2 Gene:sim / 41612 FlyBaseID:FBgn0004666 Length:688 Species:Drosophila melanogaster
Sequence 2:NP_031514.3 Gene:Arnt2 / 11864 MGIID:107188 Length:712 Species:Mus musculus


Alignment Length:713 Identity:160/713 - (22%)
Similarity:270/713 - (37%) Gaps:167/713 - (23%)


- Green bases have known domain annotations that are detailed below.


  Fly    26 KEKSKNAARTRREKENTEFCELAKLLPLPAAITSQLDKASVIRLTTSYLK-MRQVFPDGLGEAWG 89
            :|......|.||.|......||:.::|..:|:..:.||.:::|:..|::| ||     |.|....
Mouse    65 RENHSEIERRRRNKMTQYITELSDMVPTCSALARKPDKLTILRMAVSHMKSMR-----GTGNKST 124

  Fly    90 SSPAMQRGATIKELGSHLLQTLDGFIFVVAPD-GKIMYISETASVHLGLSQVELTGNSIFEYIHN 153
            .........|.:||...:|:..|||:||||.: |:::|:|::.:..|...|.|..|::::|.:|.
Mouse   125 DGAYKPSFLTEQELKHLILEAADGFLFVVAAETGRVIYVSDSVTPVLNQPQSEWFGSTLYEQVHP 189

  Fly   154 YDQDEMNAILSLHPHINQHPLAQTHTPIGSPNGVQHPSAYDHDRGSHTIEIEKTFFLRMKC---- 214
            .|.:::...|....:.....:....|......|.|  |:.....||     .::|..||:|    
Mouse   190 DDVEKLREQLCTSENSMTGRILDLKTGTVKKEGQQ--SSMRMCMGS-----RRSFICRMRCGNAP 247

  Fly   215 ----------VLAKR---NAGLTTSG---FKVIHCSGYLKARIYP---------DRGDGQGSLIQ 254
                      .:.||   ..|....|   :.|:||:||:||  :|         |...||||   
Mouse   248 LDHLPLNRITTMRKRFRNGLGPVKEGEAQYAVVHCTGYIKA--WPPAGMTIPEEDADVGQGS--- 307

  Fly   255 NLGLVAVGH-SLPSSAI----------TEIKLHQNMFMFRAKLDMKLIFFDARVSQLTGYEPQDL 308
            ...|||:|. .:.||.:          ||       |:.|...|..:.|.|.|...:.||:||||
Mouse   308 KYCLVAIGRLQVTSSPVCMDMSGMSVPTE-------FLSRHNSDGIITFVDPRCISVIGYQPQDL 365

  Fly   309 IEKTLYQYIHAADIMAMRCS-HQILLYKGQVTTKYYRFLTKGGGWVWVQSYATLVHNSRSSREVF 372
            :.|.:.::.|..|...:|.| .|::..||||.:..|||.||...|:.:::.:....|..|....:
Mouse   366 LGKDILEFCHPEDQSHLRESFQQVVKLKGQVLSVMYRFRTKNREWLLIRTSSFTFQNPYSDEIEY 430

  Fly   373 IVSVNYVLSEREVKDLVLNEIQTGVVKREPISPAAQAAQAAQAAQAAQAAQAAQAAHVAQAVQAQ 437
            ::..|     ..||.|...:.:..|.:|:.:| :...:|.......|...:|.::...|.|:.:|
Mouse   431 VICTN-----TNVKQLQQQQAELEVHQRDGLS-SYDLSQVPVPNLPAGVHEAGKSVEKADAIFSQ 489

  Fly   438 ----------VVVVPQQSVVVQPQCAGATGQPVGPGTPVSLALSASPKLDPYFEPELPLQPAVTP 492
                      ..:...:..::....|..:.|....|:|.....|..           ....:|..
Mouse   490 ERDPRFAEMFAGISASEKKMMSSASASGSQQIYSQGSPFPAGHSGK-----------AFSSSVVH 543

  Fly   493 VPPTNNSSSSSNNNNGVWHHHHVQQQQQSGSMDHDSLSYTQLYPPLNDLVVSSSSSVGGGTASSA 557
            ||..|:..|||:....:        .|.|..::...:::|...||.                  .
Mouse   544 VPGVNDIQSSSSTGQNI--------SQISRQLNQGQVAWTGSRPPF------------------P 582

  Fly   558 GGGSSASASSSGVYSTEMQYPDTTTGNLYYNNNNHYYYDYDATVDVATSMIRPFSANSNSCSSSS 622
            |..|...:|:.|:.|:                                   .|:.|:.:|.|..|
Mouse   583 GQPSKTQSSAFGIGSS-----------------------------------HPYPADPSSYSPLS 612

  Fly   623 ESERQLSTGNA--SIVNET----------SPSQTTYSDLSHNFELSYFSDNSSQQHQHQQQQQ 673
            .......:|||  |:.|.|          ...|...|::...::..:....|.:||.|||..|
Mouse   613 SPAASSPSGNAYPSLANRTPGFAESGQSGGQFQGRPSEVWSQWQSQHHGQQSGEQHSHQQPGQ 675

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
simNP_524340.2 bHLH-PAS_dSIM_like 25..86 CDD:381583 17/60 (28%)
PAS 102..215 CDD:395786 31/127 (24%)
PAS_3 291..377 CDD:430001 27/86 (31%)
Arnt2NP_031514.3 Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 36..73 1/7 (14%)
bHLH-PAS_ARNT 62..124 CDD:381517 18/63 (29%)
PAS 137..243 CDD:395786 30/112 (27%)
PAS_11 336..436 CDD:464214 32/111 (29%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 573..712 28/156 (18%)

Return to query results.
Submit another query.