DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment CTCF and LOC4576068

DIOPT Version :10

Sequence 1:NP_648109.1 Gene:CTCF / 38817 FlyBaseID:FBgn0035769 Length:818 Species:Drosophila melanogaster
Sequence 2:XP_001237313.4 Gene:LOC4576068 / 4576068 VectorBaseID:AGAMI1_004786 Length:533 Species:Anopheles gambiae


Alignment Length:597 Identity:121/597 - (20%)
Similarity:193/597 - (32%) Gaps:205/597 - (34%)


- Green bases have known domain annotations that are detailed below.


  Fly    20 NF---HKEIEGNSDEKVVNTILEAISAEAIDLDENGAEA-----GGSKPMEEAEADLDHAEEAEE 76
            ||   ||:        .|:.:..:|.:|.....|...|.     ||....||      |.||.::
Mosquito   111 NFIVRHKQ--------QVDYLYASIVSEGEQYLEGSLEIERELFGGEGQQEE------HLEEHQQ 161

  Fly    77 EEEDDEDKYFIDDEGNCYIKTTPKKQKELQKKLKQAAAKPGKATRSVVSTATNKSINLRPAKSTP 141
            |::.||    :|       ||:                  |..||             ||...:.
Mosquito   162 EQQQDE----VD-------KTS------------------GSETR-------------RPGPQSG 184

  Fly   142 KATTSKPPPEPKAISVRPARAAAAKAKQSAMPPPPALVVKVPAPRGRPRKNPVIPKPEPMDLERE 206
            :....:|.||                            :.:...:....:.|.:|....:|   .
Mosquito   185 RLEQCEPDPE----------------------------ILLEITKFLEEQTPRLPDGGRLD---A 218

  Fly   207 LEELVDEPDISSMVTELSDYTVDEA-----AVEAATATLTPNEAE--------VYEFEDNATTED 258
            .|.:|::.....:...|||.|::..     |:.|.....||...|        |..|....|...
Mosquito   219 TEHVVEQRRRCGLCCVLSDVTLELTSDQLKALGALNVDKTPQVCEECCILLDIVDSFRKTTTIAR 283

  Fly   259 ENADKKDVDFVLSNKEVK----LKTASSTSQNSNASGHKYSCPHCPYTASKKFLITRHSRSHDVE 319
            |          ||.:|.:    |:..|:..|                   .|.|.::....||..
Mosquito   284 E----------LSGQEQRALDVLQLDSALEQ-------------------AKCLASKLRHKHDAF 319

  Fly   320 PSFKCSICERSFRSNVGLQNHINTHMGNKPHKCKLCESAFTTSGELVRHTRYKHTKEKPHKCTEC 384
            ...:||                    |..      .:::.|||....|.:    ...|.:|    
Mosquito   320 AGLQCS--------------------GGS------MDASSTTSSPAERQS----VPRKAYK---- 350

  Fly   385 TYASVELTKLRRHMTCHTGERPYQCPHCTYASQDMFKLKRHMVIHTGEKKYQCDI--CKSRFTQS 447
                      |..:|||...:.|          |.:||:.|:..|...:.|.||.  |.|.|...
Mosquito   351 ----------RAKVTCHICGKQY----------DSWKLQTHLNEHENIRPYVCDQEGCSSTFAGL 395

  Fly   448 NSLKAH-KLIHSVVDKPVFQCNYCPTTCGRKADLRVHIKHMHTSDVPMTCRRCGQQLPDRYQYKL 511
            ..|..| ||.|:  |.....||.|...|..:...:.|:.:.....:|  |..||:.:.::.....
Mosquito   396 VLLNRHKKLWHT--DYFYAVCNVCGKKCKTQGIYKTHLSYHEEPKLP--CTVCGKLMRNKRAIWK 456

  Fly   512 HVKSHEGEKCYSCKLCSYASVTQRHLASHMLIHLDEKPFHCDQCPQAFRQRQLLRRHMNLVH--- 573
            |:|:|..::.:.|.:|:........|..||.||.::||:.|..|.:.|:.:.||:.|....|   
Mosquito   457 HMKTHSNDRKHVCGVCNKRFTIAYTLRVHMRIHTNDKPYPCADCDKRFQYKCLLKNHCRRYHGGG 521

  Fly   574 NEEYQPPEPREK 585
            ||...|.:.|::
Mosquito   522 NEPAPPCKERQR 533

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
CTCFNP_648109.1 FAP 116..>202 CDD:429334 10/85 (12%)
C2H2 Zn finger 296..316 CDD:275368 2/19 (11%)
COG5048 321..>621 CDD:227381 63/271 (23%)
C2H2 Zn finger 324..344 CDD:275368 2/19 (11%)
C2H2 Zn finger 352..373 CDD:275368 4/20 (20%)
C2H2 Zn finger 381..401 CDD:275368 2/19 (11%)
zf-H2C2_2 394..415 CDD:463886 5/20 (25%)
C2H2 Zn finger 409..429 CDD:275368 4/19 (21%)
zf-H2C2_2 422..446 CDD:463886 9/25 (36%)
C2H2 Zn finger 437..457 CDD:275368 9/22 (41%)
C2H2 Zn finger 467..485 CDD:275368 5/17 (29%)
C2H2 Zn finger 496..516 CDD:275370 5/19 (26%)
C2H2 Zn finger 524..544 CDD:275368 5/19 (26%)
zf-H2C2_2 536..561 CDD:463886 10/24 (42%)
C2H2 Zn finger 552..573 CDD:275368 6/20 (30%)
zf-C2H2 587..609 CDD:395048
C2H2 Zn finger 589..609 CDD:275368
LOC4576068XP_001237313.4 None
Blue background indicates that the domain is not in the aligned region.

Return to query results.
Submit another query.