DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment CG5937 and AT4G38200

DIOPT Version :10

Sequence 1:NP_572289.2 Gene:CG5937 / 31537 FlyBaseID:FBgn0029834 Length:2051 Species:Drosophila melanogaster
Sequence 2:NP_195533.2 Gene:AT4G38200 / 829976 AraportID:AT4G38200 Length:1687 Species:Arabidopsis thaliana


Alignment Length:2096 Identity:370/2096 - (17%)
Similarity:692/2096 - (33%) Gaps:649/2096 - (30%)


- Green bases have known domain annotations that are detailed below.


  Fly     8 IIKESTGTKHNALRQTAQIAYDKLYRQHGIHRDPSHEL-------RSVCFTALQMALDTKRPKFI 65
            |||.:...||..|....:...||| .......|||..|       .......|.::|||...|.|
plant    24 IIKNAAWRKHTFLVSACKSVLDKL-EALSDSPDPSSPLFGLTTSDADAVLQPLLLSLDTGYAKVI 87

  Fly    66 TMGLNGLHRVIKDERFYIGLEPEDDSVWLPSQLL-RATNGILPT--TSSEDTVVNVLRLFLAMAC 127
            ...|:...::     |.:.|...:.....|..|| :..:.|...  ...|...:.|||:.||...
plant    88 EPALDCSFKL-----FSLSLLRGEVCSSSPDSLLYKLIHAICKVCGIGEESIELAVLRVLLAAVR 147

  Fly   128 SPACTLNGRLLIEILSRCGECWEMGSRATKAASLAAASQCLRTFCAFLIEEAEEVKKTAPAGLM- 191
            ||...:.|..|:.::..|...:..|...|            ...||          |:..|.:| 
plant   148 SPRILIRGDCLLHLVRTCYNVYLGGFNGT------------NQICA----------KSVLAQIML 190

  Fly   192 ---TQTQASAVYNEVIPVMQWLCSRLVEPNVN---ASPNKKCENHSSLYLTE----CILTLSSAL 246
               |:::|::           :.:.|...|||   |..:|.....:|:::.:    .::|...|.
plant   191 IVFTRSEANS-----------MDASLKTVNVNDLLAITDKNVNEGNSVHICQGFINDVITAGEAA 244

  Fly   247 PRNVHANPHFT-------------------------SFLWQKFC--------------------P 266
            |     .|.|.                         ..|::..|                    .
plant   245 P-----PPDFALVQPPEEGASSTEDEGTGSKIREDGFLLFKNLCKLSMKFSSQENTDDQILVRGK 304

  Fly   267 TLAAAL-------GSPGRINLDKKF--TYKD--ALHMIENEA---RGFFTGPGLDGPQARCVYLT 317
            ||:..|       |.|..:: |::|  ..|.  .|.:::|.|   ...|        |.:|...|
plant   305 TLSLELLKVIIDNGGPIWLS-DERFLNAIKQLLCLSLLKNSALSVMSIF--------QLQCAIFT 360

  Fly   318 AIQLLRIAGAHGSLRPMLEALFHRMLLLPAPQNRTEP--------LRCVREIFKSPERLIDLAVI 374
            .:.....:|....:     .:|..||:|...:|..:|        |..:..|...|..:||:.|.
plant   361 TLLRKYRSGMKSEV-----GIFFPMLVLRVLENVLQPSFVQKMTVLSLLENICHDPNLIIDIFVN 420

  Fly   375 LYVDKNTAQGCSDEMALFRLLVDAMEECAYGASGGGSATEASLQ------ASVECMVALLDSLQV 433
            ...|..:..       :|..:|:.:.:.|.|...|.|...:.:|      .||:|:|:::.::..
plant   421 FDCDVESPN-------IFERIVNGLLKTALGPPPGSSTILSPVQDITFRHESVKCLVSIIKAMGT 478

  Fly   434 LCSGELT--ESMISDQIVQVV--------NGRHDLLKDADYSGPLTYQSMARLPAPYRDAIVEFR 488
            ....:|:  :|::...:....        |.......|.|:...|..:|........|.|....|
plant   479 WMDQQLSVGDSLLPKSLENEAPANNHSNSNEEDGTTIDHDFHPDLNPESSDAATLEQRRAYKIER 543

  Fly   489 Q---TVF--ETSSG------SESEGEPQHE------QDAASNGS--GDTEGPEDDDPSSSSDETS 534
            |   |:|  :.|.|      |:..|....|      .....|.:  ||..|..:|.|..      
plant   544 QKGVTLFNRKPSKGIEFLISSKKVGNSPDEVVSFLRNTTGLNATMIGDYLGEREDFPMK------ 602

  Fly   535 RVENRWPYSHLEAPVMPIRTDSDNDRQHARDFARALRQDLVPKLLRL-RSCVELDEAMQEFASAV 598
                          ||....||.:.::  .:|..|:|..|  :..|| ....::|..|::||...
plant   603 --------------VMHAYVDSFDFKE--MNFGEAIRFFL--RGFRLPGEAQKIDRIMEKFAERF 649

  Fly   599 CQENSMNFSDFDYNLTAINADGIYLAIYSSLLLSLQLMRAGFYEQVAQGMSHKDILV--PMSEQQ 661
            |:.|..:||         :||..|:..||.::|:..              :| :|:|  .|::..
plant   650 CKCNPNSFS---------SADTAYVLAYSVIMLNTD--------------AH-NIMVKEKMTKAD 690

  Fly   662 FVTSVQNTGVLVYLSSPWLCELYQSVTVCNV-----LEAMPRQQLEGMGPRCALVDML------- 714
            |:.:.:.......|...:|..||..|.:..:     ..|...:|..|:.....|..:|       
plant   691 FIRNNRGIDDGKDLPEEYLGALYDQVVINEIKMSSDSSAPESRQSNGLNKLLGLDGILNLVYWTQ 755

  Fly   715 CDAGGLGATQMLSEWQRLQTANVKHKEEEQLHDKRREAAKKLCRRLLTCCWDSMVIVLSSGLGDL 779
            .:...:||..:|.:..:.:..:...|.|...|.....|   :.|.::...|..|:...|..|   
plant   756 TEEKAVGANGLLIKDIQEKFRSKSGKSESAYHVVTDVA---ILRFMVEVSWGPMLAAFSVTL--- 814

  Fly   780 QTSSASNKLVALSKRTLRVKAKANKSNGEALYAMCLDGLHSAATLSNSLNLQ-------HLAGKI 837
              ..:.::|.|:.                     ||.|...|..::..:.:|       ....|.
plant   815 --DQSDDRLAAVE---------------------CLRGFRYAVHVTAVMGMQTQRDAFVTSMAKF 856

  Fly   838 LNL-LASNVCQTSGPRISASQAMSMDVVLTGGLNLGSYSADCWPSIFAVCRHVSQLEH-EIFSMQ 900
            .|| .|.::.|.:...:.|        :::..:..|::..|.|..|. .|  :|::|| ::....
plant   857 TNLHCAGDMKQKNVDAVKA--------IISIAIEDGNHLQDAWEHIL-TC--LSRIEHLQLLGEG 910

  Fly   901 NPAISASPGSSRRDLETGEKLSNG--NAQDKLNLSSIPIDDDETCVDVYSFLQAPMQSPNTNITS 963
            .|    |..|.....||.||.:.|  |.:.|..|.: |:            :.|.::..:.:.::
plant   911 AP----SDASYFASTETEEKKALGFPNLKKKGALQN-PV------------MMAVVRGGSYDSST 958

  Fly   964 ILKVYSGTNETVLLSQSDTSKVLCAL-------SHQAENLFGDAAERLSLPSLCQFLKHLCRASR 1021
            |     |.|...|:.|...:..:..|       |.|..|::.. ::||...::..|:|.||:.|.
plant   959 I-----GPNMPGLVKQDQINNFIANLNLLDQIGSFQLNNVYAH-SQRLKTEAIVAFVKALCKVSM 1017

  Fly  1022 DQLYKSQVARKGSRIWWPSKGWKKLDSLPMSLLLHRIGDVTLKVFRSSRPLLHVLKVWAITGPHL 1086
            .:|..            |:.        |....|.::  |.:..:..:|..|...::|:|.....
plant  1018 SELQS------------PTD--------PRVFSLTKL--VEIAHYNMNRIRLVWSRIWSILSDFF 1060

  Fly  1087 MDAACHRDRMISKRAIEYIHDIITALLVEQSELPYFHFNEALLKPFENLLSMDTDVDVQDQIVAC 1151
            :......:..::...::.:..:....| |:.||..::|....|:||..::...:..::::.||.|
plant  1061 VSVGLSENLSVAIFVMDSLRQLSMKFL-EREELANYNFQNEFLRPFVIVMQKSSSAEIRELIVRC 1124

  Fly  1152 LYEVVEAHRTEIRSGWRPLFGTLRNA----RSRML-----NMSNIIDIFRVFLDSDNTLVFANAG 1207
            :.::|.:..:.::|||:.:|.....|    |..::     .|..|:..:..::.......|.   
plant  1125 ISQMVLSRVSNVKSGWKSVFKVFTTAAADERKNIVLLAFETMEKIVREYFSYITETEATTFT--- 1186

  Fly  1208 LDCILCLLSYLEISGGGNNNACSGGSGSAAGGTASNGGQQEEDNTFRPTDFLHETLRFLERCSSI 1272
             ||:.||:::                               .::|| .:|.....:.||..|:. 
plant  1187 -DCVRCLITF-------------------------------TNSTF-TSDVSLNAIAFLRFCAL- 1217

  Fly  1273 LGFMHSMPKCPNFHSTYKIKGIS---YTHIIDANIPSSMENFTYFGNDYLQTRNEQYMISYRSLH 1334
                    |..:....:..||.|   .|.:.|.:.||: :||                       
plant  1218 --------KLADGGLVWNEKGRSSSPSTPVTDDHSPST-QNF----------------------- 1250

  Fly  1335 IDKDTIVKIDEMDKPSGVLKVWFLLLDGLTNSLIVCPYSHQAPILQTIFKLFKNLLASPGIDFGF 1399
                       ||....: ..|..||.||:........:.:...|:.:|.:.|        |.| 
plant  1251 -----------MDADENI-SYWVPLLTGLSKLTSDSRSAIRKSSLEVLFNILK--------DHG- 1294

  Fly  1400 YCINHLLVPMIQDWLRYINKTGSSWQLVEKNFKHCCCMTTDLVVEFIEKSVPEQRRLGAGTKTRL 1464
                |:.             :.:.|                               :|.     .
plant  1295 ----HIF-------------SRTFW-------------------------------IGV-----F 1306

  Fly  1465 AQIVHPADNLLYSKLKFVTERIEREQQQQQSPSVQDHGCGYTQQYDSSSSSASEEQQKNGGGVGG 1529
            :.:::|..|.::.:    .:.:.:::......:...|....:  :|:.:|:.:.:..        
plant  1307 SSVIYPIFNSVWGE----NDLLSKDEHSSFPSTFSSHPSEVS--WDAETSAMAAQYL-------- 1357

  Fly  1530 GGSNLIESPTKISGSATLALKQLLLVLIECAAQSQEAIARISVSCLKHVILSTGMLFNESQWMIA 1594
              .:|..|...:..|...::..||..||...||....   ..|..|..:....|..|:|::|...
plant  1358 --VDLFVSFFTVIRSQLSSVVSLLAGLIRSPAQGPTV---AGVGALLRLADELGDRFSENEWKEI 1417

  Fly  1595 CSAIHRACTVTIAPLRQLSFAFHEKSNSFYGDCANVKVAARRDSSLEELARIYALAQQVFLSDNQ 1659
            ..|::.|.::|:              :||      :|.....|...:|    ..|:.|.|  .|:
plant  1418 FLAVNEAASLTL--------------SSF------MKTLRTMDDIPDE----DTLSDQDF--SNE 1456

  Fly  1660 REPGQNQAPTPSASQCKLSDDRSYSFLLYPLNNGFNSNLDNFVIRIPFKNLVVGLLANQMLLQLV 1724
            .:..::...|.|                             :|:.....::.|.|...|::..|.
plant  1457 DDIDEDSLQTMS-----------------------------YVVARTKSHITVQLQVVQVVTDLY 1492

  Fly  1725 ----AKLLLSRLKCVPQAVSTCIFDNYAASAASPSHDYDLDFRSKEILLRCVKQYLMSALEFDSR 1785
                ..||.|.:..:.:.:|         |.:|.:|..:.|.    ||.:.|:: ..|.||....
plant  1493 RIHQQSLLASHVTVILEILS---------SISSHAHQLNSDL----ILQKKVRR-ACSILELSEP 1543

  Fly  1786 PGLKF----------LMQKVSNIEYAANLYKQMTSSWMIYYIALVDSHL------NDIVVYNLGP 1834
            |.|.|          ::|.:.......:|...:.|..|...:.::..:|      .|.:.....|
plant  1544 PMLHFENDTFQNYLDILQAIVTNNPGVSLELNVESQLMTVCMQILKMYLKCTLFQGDELEETRQP 1608

  Fly  1835 EDLNFIL--------ESCSR-------LNTTTVKKKENFVRYLFCLQDAWNLVCELYLSNSALHE 1884
            :  |:||        |:.:|       |......|:::|.||   ..:.:.|:.||..|..:..:
plant  1609 K--NWILPMGAASKEEAAARSPLVVAVLKALRELKRDSFKRY---APNFFPLLVELVRSEHSSSQ 1668

  Fly  1885 IESGSGKLRQKPPQLL 1900
            :           ||:|
plant  1669 V-----------PQVL 1673

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
CG5937NP_572289.2 Sec7 451..692 CDD:469626 56/270 (21%)
DUF1981 1115..>1177 CDD:462756 15/61 (25%)
AT4G38200NP_195533.2 PLN03076 11..1685 CDD:215560 370/2096 (18%)

Return to query results.
Submit another query.