DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment CG32085 and Kdm2a

DIOPT Version :9

Sequence 1:NP_729732.1 Gene:CG32085 / 39311 FlyBaseID:FBgn0052085 Length:666 Species:Drosophila melanogaster
Sequence 2:NP_001001984.2 Gene:Kdm2a / 225876 MGIID:1354736 Length:1161 Species:Mus musculus


Alignment Length:582 Identity:120/582 - (20%)
Similarity:196/582 - (33%) Gaps:188/582 - (32%)


- Green bases have known domain annotations that are detailed below.


  Fly    73 CGGGNSNSNSGSNSSNSNTSSASATAATSPASNANPPQTPDKPSRGS------SP-SPGGI---T 127
            |...:|:..:.......:...|.......|..:...|.||...|..|      .| ||.|:   :
Mouse   675 CYQEDSSDKAQKRKIEESDEEAVQAKVLRPLRSCEEPLTPPPHSPTSMLQLIHDPVSPRGMVTRS 739

  Fly   128 MPGGQSQVQNSTHHLLQQQQQQQQHMQLQQSQQQHLQLQAS--TLINSNHHVMVGPAPPTGMPLG 190
            .||...    |.||...:.::.:        ::|.|:|||:  |::....:      .|:|.   
Mouse   740 SPGAGP----SDHHSASRDERFK--------RRQLLRLQATERTMVREKEN------NPSGK--- 783

  Fly   191 APPTPTVKSIAKQMNITIPGG--GVNPGSPTFSTMGMVAAQKAAASAGGTPLQLRKQLPNPHLHH 253
                   |.:::.....|.|.  .|....||....|.....|..|....: ..||   |||.   
Mouse   784 -------KELSEVEKAKIRGSYLTVTLQRPTKELHGTSIVPKLQAITASS-ANLR---PNPR--- 834

  Fly   254 PYGSMGINAASPLIMQHQLHPPPIHSIEQLM---------------------------------L 285
                        ::|||.....|.|..|:.:                                 .
Mouse   835 ------------VLMQHCPARNPQHGDEEGLGGEEEEEEEEEEDDSAEEGGAARLNGRGSWAQDG 887

  Fly   286 DDRFLSR-----FFQYFSPYERRILAQVCIKWRDTLYRSPRYWSGLLPTLQCRELRQMPGCDRGK 345
            |:.::.|     .|:|.|..|.....:||..|    |:    |        |        ||: :
Mouse   888 DESWMQREVWMSVFRYLSRKELCECMRVCKTW----YK----W--------C--------CDK-R 927

  Fly   346 LYNSLIRRGFHALGLVGASDEDALD-VVHSFPLASKHVHSLSLRCSSISDRGLETLLDHLQSLFE 409
            |:..:      .|....|....||. ::...|:      ||.|..::||.:.|..|::.|..|.:
Mouse   928 LWTKI------DLSRCKAIVPQALSGIIKRQPV------SLDLSWTNISKKQLTWLVNRLPGLKD 980

  Fly   410 LELAGCNEVTEAGLWACLTPRIVSLSLADCINIADEAVGAV------------AQLLPSLYEFSL 462
            |.||||:....:.|.....|.:.:|.|...:.|.|..:..:            ...|.::.:|.|
Mouse   981 LLLAGCSWSAVSALSTSSCPLLRTLDLRWAVGIKDPQIRDLLTPPTDKPGQDNRSKLRNMTDFRL 1045

  Fly   463 QAYHVTDAALGYFSPKQSHSLSILRLQSCWELTNHGIVNIVHSLPHLTVLSLSGCSKLTDDGVEL 527
            ....:|||.              |||             |:..:|.|:.|.||.||.|||....|
Mouse  1046 AGLDITDAT--------------LRL-------------IIRHMPLLSRLDLSHCSHLTDQSSNL 1083

  Fly   528 I----AENLQKLRALDLSWCPRITDASLEYI-------ACDLNQLEELTLDRCVH-ITDIGV 577
            :    :.....|..|:::.|.::||.:|.::       ..||...:::|...|.| |:|:.:
Mouse  1084 LTAVGSSTRYSLTELNMAGCNKLTDQTLFFLRRIANVTLIDLRGCKQITRKACEHFISDLSI 1145

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
CG32085NP_729732.1 leucine-rich repeat 382..406 CDD:275381 8/23 (35%)
AMN1 383..600 CDD:187754 54/219 (25%)
leucine-rich repeat 431..456 CDD:275381 5/36 (14%)
leucine-rich repeat 457..482 CDD:275381 5/24 (21%)
leucine-rich repeat 483..508 CDD:275381 4/24 (17%)
leucine-rich repeat 509..534 CDD:275381 10/28 (36%)
leucine-rich repeat 535..560 CDD:275381 8/31 (26%)
leucine-rich repeat 561..585 CDD:275381 5/18 (28%)
leucine-rich repeat 586..610 CDD:275381
leucine-rich repeat 636..661 CDD:275381
Kdm2aNP_001001984.2 cupin_like 199..299 CDD:389752
JHD 304..>339 CDD:375347
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 419..445
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 532..557
zf-CXXC <576..609 CDD:366873
PHD_KDM2A 619..675 CDD:277113 120/582 (21%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 705..789 23/111 (21%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 840..886 3/45 (7%)
F-box-like 895..933 CDD:372399 13/68 (19%)
leucine-rich repeat 929..953 CDD:275381 4/29 (14%)
AMN1 931..1137 CDD:187754 55/244 (23%)
leucine-rich repeat 954..977 CDD:275381 8/28 (29%)
LRR 1 960..981 6/20 (30%)
leucine-rich repeat 978..1001 CDD:275381 7/22 (32%)
LRR 2 983..1009 8/25 (32%)
leucine-rich repeat 1002..1039 CDD:275381 5/36 (14%)
leucine-rich repeat 1040..1064 CDD:275381 9/50 (18%)
LRR 3 1047..1072 11/51 (22%)
leucine-rich repeat 1065..1094 CDD:275381 10/28 (36%)
LRR 4 1073..1102 8/28 (29%)
leucine-rich repeat 1095..1119 CDD:275381 6/23 (26%)
LRR 5 1103..1127 6/23 (26%)
leucine-rich repeat 1120..1139 CDD:275381 4/18 (22%)
LRR 6 1128..1155 5/18 (28%)
Blue background indicates that the domain is not in the aligned region.


Information from Original Tools:


Tool Simple Score Weighted Score Original Tool Information
BLAST Result Score Score Type Cluster ID
Compara 00.000 Not matched by this tool.
Domainoid 00.000 Not matched by this tool.
eggNOG 1 0.900 - - E1_KOG1947
Hieranoid 00.000 Not matched by this tool.
Homologene 00.000 Not matched by this tool.
Inparanoid 00.000 Not matched by this tool.
Isobase 00.000 Not matched by this tool.
OMA 00.000 Not matched by this tool.
OrthoDB 00.000 Not matched by this tool.
OrthoFinder 00.000 Not matched by this tool.
OrthoInspector 00.000 Not matched by this tool.
orthoMCL 00.000 Not matched by this tool.
Panther 00.000 Not matched by this tool.
Phylome 1 0.910 - -
RoundUp 00.000 Not matched by this tool.
SonicParanoid 00.000 Not matched by this tool.
SwiftOrtho 00.000 Not matched by this tool.
TreeFam 00.000 Not matched by this tool.
21.810

Return to query results.
Submit another query.