DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment CG32354 and agr-1

DIOPT Version :10

Sequence 1:NP_729380.1 Gene:CG32354 / 50302 FlyBaseID:FBgn0052354 Length:662 Species:Drosophila melanogaster
Sequence 2:NP_001022152.3 Gene:agr-1 / 3565243 WormBaseID:WBGene00018304 Length:1473 Species:Caenorhabditis elegans


Alignment Length:642 Identity:156/642 - (24%)
Similarity:236/642 - (36%) Gaps:196/642 - (30%)


- Green bases have known domain annotations that are detailed below.


  Fly   164 CPRSCPP-SITVGAEPVCGSDGLIYANICELRKKTC-SRSGVSLI----------------KDVR 210
            ||..||. ..:|.:.|||.|.|:.|.:.|.||...| |::.:::.                :..:
 Worm   252 CPTQCPNYGDSVESSPVCSSHGVDYQSSCHLRHHACESKTNITVKFFGRCDPCHGHKCPNGQTCQ 316

  Fly   211 DGCERSKGSDCKHRCSTEKDPVCGTDGRTYLNRCMLRVQSCRV---------------------- 253
            .|.:|.....|..:|:.....||||||:||||.|.|::.:|:.                      
 Worm   317 LGVDRRPECKCSEQCTMNSAHVCGTDGKTYLNECFLKLAACKEQKDILVWKRGNCDEAGSPCEKM 381

  Fly   254 -----GLAAVKLSHVGPCSNTSAVRESCPVDCNSAPKDGPVCSSDGNVYNSTCEMKLK------- 306
                 |...||......|        .||..|....:  |||:::|..:::.||||.|       
 Worm   382 ECGFWGSCVVKPDRTAEC--------ECPNRCEDVMR--PVCATNGETFDNECEMKKKSCETKSM 436

  Fly   307 -------TCGQGVVKTSRKHCQSTRMC----------RESCWRVARPTCGSDGRLYASPCKMRSS 354
                   |||.||..|. ..|:..::|          ..||....:..|||||:.|::.|:::::
 Worm   437 IKVKHQGTCGIGVCATF-DSCKKPQVCVVVDGKPKCVCPSCTDEFKEVCGSDGKTYSNECRLQNA 500

  Fly   355 NC--GKHVFEVPLSY-------------------CMSQERHGASDACPTECPKSDTDSSSQYVCG 398
            .|  .|::|   :.|                   |:..|...|...||.:||..:.:...: |||
 Worm   501 ACMAQKNIF---VKYNSACEACKLKKEKCDFYSACVVGENEKAECKCPDDCPSYEMEEGKE-VCG 561

  Fly   399 SDGNIYSSLCELKMLNCGPQRKSIQKVSMDKCKNRL-TRCKQLPPCK------DFNSLFGSIFSS 456
            :||..|||.|.:|...| .|.|.:......||...| .:|:....|:      .:|.......|:
 Worm   562 TDGVTYSSECHMKKSAC-HQSKFVMTAFEGKCDECLHVQCRYGEECRSGVCVCSYNCPANPPLSA 625

  Fly   457 KRNDKLCGTDAKTYNNECELAHATCLRGVNLAHIGPCTDLNSPTK---DCGDACTRADLEQQPVC 518
                ::||.:...|.:.|.|..|:|.:|..::.:.|....:|.|.   .|  .|.|.        
 Worm   626 ----RICGENGVLYPSLCHLQLASCQKGAPISEMPPSHCHSSKTSFPDSC--QCNRV-------- 676

  Fly   519 GSDGNTFASMCEFKRR------TCDLRVVP------------VSLKNCALTA------DCE---- 555
            ||.|:|.....:.|.|      .|| ..:|            :|.:.|..:|      |||    
 Worm   677 GSFGHTCDETGQCKCRPGVAGIKCD-HCLPSFWGIHLIAQGALSCRPCGCSAFGSSRSDCEQTTG 740

  Fly   556 ----------SDCDAQPPSFV-----CGSDNNLYKS--ECHMRKENCGKHVFVVPLKRCLAAFQL 603
                      ..||..|...:     |.|. .:||:  :||..:  |......||     :....
 Worm   741 KCECKNGALGDKCDLCPNGSMMTAGGCVSP-AVYKTPRDCHSLR--CFHGAKCVP-----SPSSF 797

  Fly   604 KGCARICPREFE----------PVCGSDNKTYLNDCFLEIENCRANQTVNVNYYGAC 650
            ..|  |||:...          .|||||..||.|.|.|::..|:....|.....|.|
 Worm   798 PDC--ICPQSCNMNHLGIVANMTVCGSDGTTYSNLCELKMFACKHQIDVVPVSMGIC 852

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
CG32354NP_729380.1 KAZAL_FS 163..>200 CDD:412159 15/37 (41%)
KAZAL 220..266 CDD:197624 18/72 (25%)
KAZAL_FS 326..>359 CDD:412159 11/44 (25%)
KAZAL 380..430 CDD:197624 17/49 (35%)
KAZAL 452..493 CDD:197624 9/40 (23%)
KAZAL 502..>536 CDD:197624 9/39 (23%)
KAZAL_FS 553..597 CDD:412159 15/64 (23%)
KAZAL 606..650 CDD:197624 17/53 (32%)
agr-1NP_001022152.3 KAZAL 172..220 CDD:197624
KAZAL 251..301 CDD:197624 16/48 (33%)
KAZAL 327..371 CDD:197624 15/43 (35%)
KAZAL 400..445 CDD:197624 12/46 (26%)
KAZAL 472..516 CDD:197624 13/46 (28%)
KAZAL 547..592 CDD:197624 15/46 (33%)
KAZAL 612..660 CDD:197624 11/51 (22%)
EGF_Lam 670..716 CDD:238012 12/56 (21%)
EGF_Lam 722..>759 CDD:238012 8/36 (22%)
KAZAL <819..852 CDD:197624 13/32 (41%)
LamG 1081..1236 CDD:238058
LamG 1289..1450 CDD:238058
Blue background indicates that the domain is not in the aligned region.

Return to query results.
Submit another query.