DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment CG1571 and Dnai2

DIOPT Version :10

Sequence 1:NP_572435.1 Gene:CG1571 / 31725 FlyBaseID:FBgn0029993 Length:651 Species:Drosophila melanogaster
Sequence 2:NP_001030050.2 Gene:Dnai2 / 432611 MGIID:2685574 Length:623 Species:Mus musculus


Alignment Length:646 Identity:198/646 - (30%)
Similarity:325/646 - (50%) Gaps:60/646 - (9%)


- Green bases have known domain annotations that are detailed below.


  Fly     6 FIYSRERRRFGRQCRFQDRN-ELMVSVHPSGRQRLKYIMANPSEKSTQLSRQMALTVMETENVTL 69
            ::|.::|..||:||.|.||. ||.:.:.|:......|:..||.:...|.|..|:.....||...:
Mouse     5 YVYLKKRSEFGKQCNFSDRQAELNIDILPNPELAALYVERNPVDTGIQCSASMSEHEANTERFEM 69

  Fly    70 DQHGMYHYEGGWPKEVNFNDEEQTQRHRKKVEREDSWGEQVLSMIRTTMSVAEQNNTLNIYQNFF 134
            :..|:.|.||||||:||..:.|||.|.|||||:::::...|:.:........:|||.::||:.:|
Mouse    70 ESCGVNHVEGGWPKDVNPQELEQTIRFRKKVEKDENYINAVMQLGSIMEHCIKQNNAIDIYEEYF 134

  Fly   135 ADLPPELGHDIKMRFRARVANVFHDLWLPARQLRSIEWMPNNSRQFMTQYTNHFAKGERLRPVTD 199
            .|  .|.....:....|:..|||.|.....|....:.|.|:.:|:....|:  ..|.:|      
Mouse   135 DD--EEAVEVTEEAPSAKTINVFRDPQEIKRTATHLSWHPDGNRKLAVAYS--CLKFQR------ 189

  Fly   200 EPFGGTNGFYVWDVKNPLKPRITYDSKQQVSLAKICPKDENNMVGGTGLGQVCLWGTFKGGLPIR 264
            .|.......|:||::||.:|.|.......:...:..|||.:.::||...||:..|.|.||.|...
Mouse   190 APMSMNYDSYIWDLENPNRPEIALKPLSPLVTLEYNPKDSHVLLGGCYNGQIACWDTRKGSLVAE 254

  Fly   265 NCPLEVSHRETTSALCWVHSKSNTEFYSGSLDGSIKYWDTRDLKMPMQELLLEPEPQERQSRMDS 329
            ...:|.|||:......|:.||:.||.:|.|.||.:.:||.|.:..|::.::::...:|:..  ::
Mouse   255 LSTIEFSHRDPVYGTIWLQSKTGTECFSASTDGQVMWWDIRKISEPIEVVIMDISRKEQLE--NA 317

  Fly   330 HGVTVLEFEYTIPVRFIIGSDMGHVFVGNRKGMTPMETLLAHYQLFVGPVRSINRNPFFVKNFLV 394
            .|...||||.|:|.:|::|::.|.|...|||..|..|.::..:....||:.::.||||:.||||.
Mouse   318 LGAISLEFESTLPTKFMVGTEQGIVISCNRKAKTQAEKIVCTFYGHHGPIYALQRNPFYPKNFLT 382

  Fly   395 TGDWRARIWSEEVKDSPSTMYFRKN-AQILCGAWSTGRCSLFVTGDINGVVDFWDLLLHHRKPIR 458
            .|||.||||||:.::| |.|:.:.: |.:..||||..|.::|.|..::|.:|.|||:.....|..
Mouse   383 VGDWTARIWSEDSRES-SIMWTKYHMAYLSDGAWSPVRPAVFFTTKMDGTLDIWDLVFKQCDPAL 446

  Fly   459 SVD------FKVAIADLVFRPEGDLLAIGLKNGDTHIMTLDESMRQATGKEKALMAAMFEREIVR 517
            |:.      |.:.:.|     .|.|:|.|.:.|.|.::.:..|:......||.:.:::||||..|
Mouse   447 SLKVCDDPLFCLRVQD-----NGCLIACGSELGTTTLLEVSSSLSTLQRNEKNIASSIFERETRR 506

  Fly   518 CKLLEARYDEVRLKRKTLQMAEEERVRKQQQLAPTLELDPDNPDQFVMMIEGDEEFRTAITEFQD 582
            .|:||||:.|:|||.|. ::..:|..:|:::.|  |:||.       ::.:.:|||      |:.
Mouse   507 EKILEARHREMRLKEKG-KVEGKEDDQKEEEAA--LDLDE-------LVGKAEEEF------FEV 555

  Fly   583 IILSVERKRSKRQVIMERTVFEEWNPADEKLQGEPVVYTKPERKVQTSGNYDKRTSGEPRA 643
            |...::||.:                  |.|:.:|....|...||:.....::....|..|
Mouse   556 IFSELKRKEA------------------EALKKKPKPRKKSSVKVEAEEEVEENVGEEEEA 598

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
CG1571NP_572435.1 WD40 210..462 CDD:475233 89/258 (34%)
WD40 repeat 229..269 CDD:293791 12/39 (31%)
WD40 repeat 277..325 CDD:293791 14/47 (30%)
WD40 repeat 332..369 CDD:293791 15/36 (42%)
WD40 repeat 379..415 CDD:293791 19/35 (54%)
WD40 repeat 422..447 CDD:293791 9/24 (38%)
Dnai2NP_001030050.2 WD40 repeat 165..212 CDD:293791 13/54 (24%)
WD40 166..472 CDD:475233 103/321 (32%)
WD 1 214..254 12/39 (31%)
WD40 repeat 220..259 CDD:293791 12/38 (32%)
WD 2 261..302 16/40 (40%)
WD40 repeat 266..361 CDD:293791 30/96 (31%)
WD40 repeat 318..351 CDD:293791 14/32 (44%)
WD 3 362..401 21/39 (54%)
WD40 repeat 367..403 CDD:293791 20/36 (56%)
WD 4 405..445 13/39 (33%)
WD40 repeat 410..435 CDD:293791 9/24 (38%)
WD 5 450..489 9/43 (21%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 565..602 8/52 (15%)

Return to query results.
Submit another query.