DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment su(Hw) and CTCF

DIOPT Version :10

Sequence 1:NP_524349.1 Gene:su(Hw) / 41740 FlyBaseID:FBgn0003567 Length:941 Species:Drosophila melanogaster
Sequence 2:NP_648109.1 Gene:CTCF / 38817 FlyBaseID:FBgn0035769 Length:818 Species:Drosophila melanogaster


Alignment Length:944 Identity:209/944 - (22%)
Similarity:327/944 - (34%) Gaps:257/944 - (27%)


- Green bases have known domain annotations that are detailed below.


  Fly    24 KDKRPATRMKLLNDV-----GAGEDSEASTTTTTSRTPSNKQEKRGSVAGSRIKILNEE------ 77
            ||:.|......||:.     |..::...:|........:...::.|:.||....:...|      
  Fly     7 KDEDPEDLQTFLNNFHKEIEGNSDEKVVNTILEAISAEAIDLDENGAEAGGSKPMEEAEADLDHA 71

  Fly    78 -----------------------ILGTPKTE-------KRGATK---------STA--------P 95
                                   |..|||.:       |:.|.|         |||        |
  Fly    72 EEAEEEEEDDEDKYFIDDEGNCYIKTTPKKQKELQKKLKQAAAKPGKATRSVVSTATNKSINLRP 136

  Fly    96 AASTVKILNEKKTPSATVTAVETTKIKTSPSKRKKMEHYVLQAV----------------KSENT 144
            |.||.|....|..|.....:|...:...:.:|:..|.......|                |.|..
  Fly   137 AKSTPKATTSKPPPEPKAISVRPARAAAAKAKQSAMPPPPALVVKVPAPRGRPRKNPVIPKPEPM 201

  Fly   145 KADTTVTVVTEEDDTIDFI--LAD---DEEVVPGRIENNNGQEIVVTE-DDEDLGEDGDEDGEDS 203
            ..:..:..:.:|.|....:  |:|   ||..|..........|..|.| :|....||.:.|    
  Fly   202 DLERELEELVDEPDISSMVTELSDYTVDEAAVEAATATLTPNEAEVYEFEDNATTEDENAD---- 262

  Fly   204 SGKGNSSQTKIKEIVEHVCGKCYKTFRRVQSLKKHLEFCRYDSGYHLRKADMLKNLEKIEKDAVV 268
                                            ||.::|             :|.|.|...|.|  
  Fly   263 --------------------------------KKDVDF-------------VLSNKEVKLKTA-- 280

  Fly   269 MEKKDICFCCSESYDTFHLGH-INCPDCPKSFKTQTSYERHIFITHSEFSD----FPCSICNANL 328
                     .|.|.::...|| .:||.||     .|:.::.:...||...|    |.||||..:.
  Fly   281 ---------SSTSQNSNASGHKYSCPHCP-----YTASKKFLITRHSRSHDVEPSFKCSICERSF 331

  Fly   329 RSEALLALHEEQHKSRGKPYACKICGKDFTRSYHLKRHQKYSSCSSNETDTMSCKVCDRVFYRLD 393
            ||...|..|...|.. .||:.||:|...||.|..|.||.:|   ...:.....|..|......|.
  Fly   332 RSNVGLQNHINTHMG-NKPHKCKLCESAFTTSGELVRHTRY---KHTKEKPHKCTECTYASVELT 392

  Fly   394 NLRSHLKQHLGTQVVKKPEYMCHTCKNCFYSLSTLNIHIRTHTGEKPFDCDLCDKKFSALVALKK 458
            .||.|:..|.|    ::| |.|..|......:..|..|:..|||||.:.||:|..:|:...:||.
  Fly   393 KLRRHMTCHTG----ERP-YQCPHCTYASQDMFKLKRHMVIHTGEKKYQCDICKSRFTQSNSLKA 452

  Fly   459 HRRYHT-GEKP-YSCTVCNQAFAVKEVLNRHMKR-HTGERPHKCDECGKSFIQATQLRTHSKTH- 519
            |:..|: .:|| :.|..|......|..|..|:|. ||.:.|..|..||:......|.:.|.|:| 
  Fly   453 HKLIHSVVDKPVFQCNYCPTTCGRKADLRVHIKHMHTSDVPMTCRRCGQQLPDRYQYKLHVKSHE 517

  Fly   520 -IRPFPCEQCDEKFKTEKQLERHVKTHSRTKRPVFSCAECKRNFRTPALLKEHMD---EGKHSPK 580
             .:.:.|:.|.....|::.|..|:..|...|  .|.|.:|.:.||...||:.||:   ..::.|.
  Fly   518 GEKCYSCKLCSYASVTQRHLASHMLIHLDEK--PFHCDQCPQAFRQRQLLRRHMNLVHNEEYQPP 580

  Fly   581 QQRSSMRSAVKIMERTDCAICDKNFDSSDTLRRHIRTVHECDPDDIFGVEPHPSKRAKKDIESEE 645
            :.|..:..         |..|.:.|.....|.||:.| |    ||  .......:|..|      
  Fly   581 EPREKLHK---------CPSCPREFTHKGNLMRHMET-H----DD--SANAREKRRRLK------ 623

  Fly   646 VVPVALNTSAGSLISSQTDGNGV-VVREFLVDEGDGAAQTITLEN-ETYTILPLDGA-IEGEQLT 707
                     .|..:..|.||..: ::::..||......:....:| |:|.:..::.. .|.|...
  Fly   624 ---------LGRNVRLQKDGTVITLIKDQYVDMDRDQEENEEDDNPESYDLAEIEPENSEAEDAD 679

  Fly   708 DEA-GVKPEAKKEEAQVSPVVKKEQRKSLAASLAAAIADN-------------LEESCSEDDFS- 757
            |:. .:..:..::..:.:|::..:|.: ||||....:..|             ::|.....||: 
  Fly   680 DDVETIVSDPIRQRIKPAPIIINKQAR-LAASEKQPMIINQRLRSQRGTKTFHIKEEPDNSDFTV 743

  Fly   758 ------GEILTEEDIKLKENVGKLIDMLV--DPPILKKYGWPNAPEETVLCKV-IENC-----GH 808
                  ||::..|              ||  |..:|.|:      |.:...|: .:||     ..
  Fly   744 EWQGDDGEVMVVE--------------LVNGDEEVLVKH------EPSANSKISAKNCFGFEDDD 788

  Fly   809 DLTKGGENYAELDYGSRMREYCKLLFTVVIHNDS 842
            |..:.|:...|:|..|  :|:.:|:  .:|..||
  Fly   789 DYEEYGDGENEVDGAS--QEFLQLM--DMIEQDS 818

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
su(Hw)NP_524349.1 C2H2 Zn finger 292..313 CDD:275368 5/20 (25%)
COG5048 <312..517 CDD:227381 68/211 (32%)
C2H2 Zn finger 321..341 CDD:275368 8/19 (42%)
C2H2 Zn finger 350..368 CDD:275368 9/17 (53%)
C2H2 Zn finger 382..402 CDD:275368 6/19 (32%)
C2H2 Zn finger 415..435 CDD:275368 4/19 (21%)
C2H2 Zn finger 443..463 CDD:275368 7/19 (37%)
COG5048 <467..620 CDD:227381 43/159 (27%)
C2H2 Zn finger 471..491 CDD:275368 6/20 (30%)
C2H2 Zn finger 499..519 CDD:275368 6/19 (32%)
C2H2 Zn finger 525..545 CDD:275368 5/19 (26%)
C2H2 Zn finger 555..572 CDD:275368 6/16 (38%)
C2H2 Zn finger 598..615 CDD:275371 5/16 (31%)
CTCFNP_648109.1 FAP 116..>202 CDD:429334 16/85 (19%)
C2H2 Zn finger 296..316 CDD:275368 7/24 (29%)
COG5048 321..>621 CDD:227381 95/326 (29%)
C2H2 Zn finger 324..344 CDD:275368 8/19 (42%)
C2H2 Zn finger 352..373 CDD:275368 10/23 (43%)
C2H2 Zn finger 381..401 CDD:275368 6/19 (32%)
zf-H2C2_2 394..415 CDD:463886 9/25 (36%)
C2H2 Zn finger 409..429 CDD:275368 4/19 (21%)
zf-H2C2_2 422..446 CDD:463886 11/23 (48%)
C2H2 Zn finger 437..457 CDD:275368 7/19 (37%)
C2H2 Zn finger 467..485 CDD:275368 5/17 (29%)
C2H2 Zn finger 496..516 CDD:275370 6/19 (32%)
C2H2 Zn finger 524..544 CDD:275368 5/19 (26%)
zf-H2C2_2 536..561 CDD:463886 8/26 (31%)
C2H2 Zn finger 552..573 CDD:275368 8/20 (40%)
zf-C2H2 587..609 CDD:395048 7/31 (23%)
C2H2 Zn finger 589..609 CDD:275368 7/20 (35%)

Return to query results.
Submit another query.