DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment zip and XID

DIOPT Version :10

Sequence 1:NP_523860.2 Gene:zip / 38001 FlyBaseID:FBgn0287873 Length:2056 Species:Drosophila melanogaster
Sequence 2:NP_180882.2 Gene:XID / 817886 AraportID:AT2G33240 Length:1770 Species:Arabidopsis thaliana


Alignment Length:2008 Identity:518/2008 - (25%)
Similarity:879/2008 - (43%) Gaps:434/2008 - (21%)


- Green bases have known domain annotations that are detailed below.


  Fly    82 VWVPHENQGFVAASIKREHGDEVEVELAETGKRVMILRDDIQKMNPPKFDK--VEDMAELTCLNE 144
            |||...::.::...:...:|.|::|. .:|  :.::.:.:......|:|.:  |:||.:|..|:|
plant    30 VWVEDPDEAWLDGEVVEANGQEIKVN-CQT--KTVVAKVNAVHPKDPEFPELGVDDMTKLAYLHE 91

  Fly   145 ASVLHNIKDRYYSGLIYTYSGLFCVVVNPYKKLP-IYTEKIMERYKGIKRHEVPPHVFAITDSAY 208
            ..||.|:|.||.:..||||:|...:.|||:|:|| :|..:|||:|||....|:.||.||:.||||
plant    92 PGVLLNLKARYNANEIYTYTGNILIAVNPFKRLPHLYGNEIMEQYKGTDFGELSPHPFAVADSAY 156

  Fly   209 RNMLGDREDQSILCTGESGAGKTENTKKVIQFLAYVAASKPKGSGAVPHPAVLINFSVNTNKYIK 273
            |.|:.:...|:||.:|||||||||:||.::|:|||:...                          
plant   157 RKMINEGVSQAILVSGESGAGKTESTKMLMQYLAYMGGK-------------------------- 195

  Fly   274 VKIMAQNQNQTIEVVNGLKMVEVNSNCQEGELEQQLLQANPILEAFGNAKTVKNDNSSRFGKFIR 338
                |:::.:::                    |||:|::||:||||||||||:|:||||||||:.
plant   196 ----AESEGRSV--------------------EQQVLESNPVLEAFGNAKTVRNNNSSRFGKFVE 236

  Fly   339 INFDASGFISGANIETYLLEKSRAIRQAKDERTFHIFYQLLAGATPEQR-EKFILDDVKSYAFLS 402
            |.|:..|.||||.|.|||||:||..:.:..||.:|.||.|.  |.|||. |::.|....::.:|:
plant   237 IQFNHMGRISGAAIRTYLLERSRVCQVSDPERNYHCFYMLC--AAPEQETERYQLGKPSTFHYLN 299

  Fly   403 NGSL-PVPGVDDYAEFQATVKSMNIMGMTSEDFNSIFRIVSAVLLFGSMKFRQERNNDQATLPD- 465
            ..:. .:..:||..|:.||.|:|:::|::.|:.::|||:|:|:|..|:::|.:...:|.|...| 
plant   300 QSNCHALDAIDDSKEYLATRKAMDVVGISPEEQDAIFRVVAAILHLGNIEFAKSEESDGAEPKDD 364

  Fly   466 -NTVAQKIAHLLGLSVTDMTRAFLTPRIKVGR-DFVTKAQTKEQVEFAVEAIAKACYERMFKWLV 528
             :....|:|..|.:.........|..|:.|.| :.:||.........:.:|:||..|.::|.|||
plant   365 KSRFHLKVAAKLFMCDEKALENSLCNRVMVTRGESITKPLDPGSAALSRDALAKIVYSKLFDWLV 429

  Fly   529 NRINRSLDRTKRQGASFIGILDMAGFEIFELNSFEQLCINYTNEKLQQLFNHTMFILEQEEYQRE 593
            .:||.|:.:.. .....||:||:.|||.|:.|||||.|||.|||||||.||..:|.:|||||.:|
plant   430 TKINNSIGQDS-SSKYIIGVLDIYGFESFKTNSFEQFCINLTNEKLQQHFNQHVFKMEQEEYTKE 493

  Fly   594 GIEWKFIDFGLDLQPTIDLID-KPGGIMALLDEECWFPKATDKTFVDKLVSAHSMHPKFMKTDFR 657
            .|:|.:|:| :|.|..:|||: |||||:|||||.|.||::|..|..:||......|.:|.|... 
plant   494 EIDWSYIEF-IDNQDVLDLIEKKPGGIIALLDEACMFPRSTHDTLAEKLYQTFGSHKRFTKPKL- 556

  Fly   658 GVADFAIVHYAGRVDYSAAKWLMKNMDPLNENIVSLLQGSQDPFVVNIW-KDAEIVGMAQQALTD 721
            ...||.|.||||.|.|....:|.||.|.:.....||:..|...||.::: |..|           
plant   557 ARTDFTICHYAGDVTYQTELFLDKNKDYVVGEHQSLMNSSDCSFVSSLFPKSRE----------- 610

  Fly   722 TQFGARTRKGMFRTVSHLYKEQLAKLMDTLRNTNPNFVRCIIPNHEKRAGKIDAPLVLDQLRCNG 786
                ..::...|.::...:|:||..|::||..|.|:::||:.||:..:....:...||.||||.|
plant   611 ----ESSKSSKFSSIGSQFKQQLQSLLETLNTTEPHYIRCVKPNNVLKPEIFENVNVLHQLRCGG 671

  Fly   787 VLEGIRICRQGFPNRIPFQEFRQRYELLTPNVIPKGFMDGKKACEKMIQALELDSNLYRVGQSKI 851
            |:|.|||...|:|.|.||.||..|:.:|.|....:.| |...||:|::..::|..  :::|::|:
plant   672 VMEAIRISCAGYPTRKPFNEFLTRFRILAPEATERSF-DEVDACKKLLARVDLKG--FQIGKTKV 733

  Fly   852 FFRAG-----------------------VLAHLEEERDFKISDLIVNFQAFCRGFLARRNYQKRL 893
            |.|||                       |:.:|..::...:.......||||||.:||..::...
plant   734 FLRAGQMAELDAHRAEVLGHSARIIQRKVITYLSRKKYLLLQSASTEIQAFCRGHIARVQFKATR 798

  Fly   894 QQLNAIRIIQRNCAAYLKLRNWQWWRLYTKVKPLLEVTKQEEKLVQKEDELKQVREKLDTLAKNT 958
            ::..::| ||:....|:         ..|..|.|..             ....::..|..:|...
plant   799 REAASVR-IQKQARTYI---------CQTAFKKLCA-------------SAISIQSGLRAMAARV 840

  Fly   959 Q-EYERKYQQALVEKTTLAEQLQAEIELCAEAEESRSRLMARKQELEDMMQELETRIEEEEERVL 1022
            : :|..|.:.|::        :|::|..|          :.|::.|......:.|:.        
plant   841 EFQYRTKRKAAII--------IQSQIRRC----------LCRRRYLRTKKAAITTQC-------- 879

  Fly  1023 ALGGEKKKLELNIQDLEEQLEEEEAARQKLQLEKVQLDAKIKKYEEDLALTDDQNQKLLKEKKLL 1087
               |.:.|:               |.|   :|.|:::.||     |..||.|             
plant   880 ---GWRVKV---------------AHR---ELRKLKMAAK-----ETGALQD------------- 905

  Fly  1088 EERANDLSQTLAEEEEKAKHLAKLKAKHEATISELEERLHKDQQQRQESDRSKRKIETEVADLKE 1152
                                   .|.|.|..:.||...|..::|.|.|.::.|.:   ||.||:.
plant   906 -----------------------AKTKLEKEVEELTSCLELEKQMRMELEQVKTQ---EVEDLRS 944

  Fly  1153 QLNERRVQVDEMQAQLAKREEELTQTLLRIDEESATKATAQKAQRELESQLAEIQEDLEAEKAAR 1217
            .||:.::|:.|  .|:.|.||     :|::          |.|.::::.:..|:.::||......
plant   945 ALNDMKLQLGE--TQVTKSEE-----ILKL----------QSALQDMQLEFEELAKELEMTNDLA 992

  Fly  1218 AKAEKVRRDLSEELEALKNELLDSLDTTAAQQELRSKRE----------------QELATLKKSL 1266
            |:.|:: :||...|:...:|.....:.|:...|.|.|:|                |:|..|..:|
plant   993 AENEQL-KDLVSSLQRKIDESDSKYEETSKLSEERVKQEVPVIDQGVIIKLEAENQKLKALVSTL 1056

  Fly  1267 EEETVNHEGVLADMRHKHSQELNSINDQLENLRKAKTVLEKAKGTLEAENADLATELRSVNSSRQ 1331
            |::       :..:..||....::|:|||:  ..|.:..|.. ..|.|||..|...:.|:.:...
plant  1057 EKK-------IDSLDRKHDVTSSNISDQLK--ESASSDYEML-SNLAAENERLKALVSSLENENY 1111

  Fly  1332 ENDRRRKQAESQIAELQVK---LAEIERARSELQEKCTKLQQEAENITNQLEEAELKASAAVKSA 1393
            |||......|.:.....:|   |||......|:..|.....::..::.:.||          :..
plant  1112 ENDGNDSPNEQKEGPQMLKEEILAEDFSIDDEMTNKLAAENKDLYDLVDLLE----------RKI 1166

  Fly  1394 SNMESQLTEAQQLLEEETRQKLGLSSKLRQIESEKEALQEQLEEDDEAKRNYERKLAEVTTQMQE 1458
            ...|.:..||.:|.||..:|.:....|   .|......:|:|::..:.    |.||.|:.|.||.
plant  1167 DETEKKYEEASKLCEERLKQVVDTEKK---YEEASRLCEERLKQVVDT----ETKLIELKTSMQR 1224

  Fly  1459 IKKKAEEDADLAKELEEGKKRLNKDIEALERQVKELIAQNDRLDKSK-----KKIQSELEDATIE 1518
            :::|       ..::|...|.|.:  :||.......::....||...     :.:::...::...
plant  1225 LEEK-------VSDMEAEDKILRQ--QALRNSASRKMSPQKSLDLFVFMYLFQPVENGHHESFAP 1280

  Fly  1519 LEAQRTKVL-----ELEKKQKNFDKILAEEKAISEQIAQERDTAEREAREKETKVLSVSR----- 1573
            :.::|...:     ::|::...|..:|.  |.:|:.:.        .:..|.....::.:     
plant  1281 IPSRRFGAMSFRRSQIEQQPHEFVDVLL--KCVSKNVG--------FSHGKPVAAFTIYKCLIHW 1335

  Fly  1574 -----ELDEAFDKIEDLENKRKTLQNELDD------LANTQGTADKNVHELEKAKRALESQLAEL 1627
                 |....||:|..:..  ..::|..||      |.||...    :..|:::.::..:..|..
plant  1336 KLFEAEKTSVFDRIVPIFG--SAIENPEDDSNLAYWLTNTSTL----LFLLQRSLKSHSTTGASP 1394

  Fly  1628 K------------AQNEELEDDLQLTEDAKLRLEVNMQALRSQFERDLLAKEEG-----AEEKRR 1675
            |            .|.........|:.|...:::....||  .|::.|.|..|.     .|..:|
plant  1395 KKPPQPTSFFGRMTQGFRSPSSASLSGDVVQQVDARYPAL--LFKQQLTAYIETIYGIFQENVKR 1457

  Fly  1676 GLVKQLRDLETELDEERKQRTAAVASKKKLEGDLKEIETTMEMHNKVKEDALKHAKKLQAQVKDA 1740
            .|...|......|.:...:.:|...|.:..|.:..|..:......|:.||  ..:.||       
plant  1458 KLAPVLSSCIQGLKDSSHEFSAETLSAESSEQNSPEKPSEENPPEKLSED--NSSGKL------- 1513

  Fly  1741 LRDAEEAKAAK-EELQALSKEAERKVKALEAEVLQLTEDLASSERARRAAETERDELAEEIANNA 1804
               :|:..||| .|..:.:|.:|...:|..:||....:..|.:..|:.:.|....|..:::....
plant  1514 ---SEDYLAAKPSEDNSPAKPSEENSQAKLSEVNPQAKPSAENSLAKPSEENSPTETWQDVIGLL 1575

  Fly  1805 NKGSLMIDEKRRLEARIATLEEELEEEQSNSEVLLDRSRKAQLQ-IEQLTTELANEKSNSQKNEN 1868
            |:             .:.||::        :.|.|..::|...| .:.:..:|.|          
plant  1576 NQ-------------LLGTLKK--------NYVPLFLAQKIFCQTFQDINVQLFN---------- 1609

  Fly  1869 GRALLERQ------NKELKAKLAEIET--AQRTKVKATIATLEAKIANLEEQLENEGKERLL--- 1922
              :||:|:      .|::...|.|:|:  :|.|         |..:.:..::|:|..:..:|   
plant  1610 --SLLQRECCTFIMGKKVNVWLNELESWCSQAT---------EDFVGSSWDELKNTRQALVLLVT 1663

  Fly  1923 QQKANRKMDKKIKELTMNIEDERRH-------VDQHKEQ--MDKLNSRIKLLKRNLDE 1971
            :||:....|.....|...:..::.:       :|.|::|  ...:.|.:|||..:.||
plant  1664 EQKSTITYDDLTTNLCPALSTQQLYRICTLCKIDDHEDQNVSPDVISNLKLLVTDEDE 1721

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
zipNP_523860.2 Myosin_N 79..122 CDD:460670 7/39 (18%)
MYSc_Myh2_insects_mollusks 145..854 CDD:276876 272/716 (38%)
Myosin_tail_1 931..2011 CDD:460256 211/1126 (19%)
XIDNP_180882.2 Myosin_N 25..68 CDD:460670 7/40 (18%)
MYSc_Myo11 92..736 CDD:276835 272/716 (38%)
Mplasa_alph_rch 889..>1259 CDD:275316 102/467 (22%)
MyosinXI_CBD 1298..1739 CDD:271259 90/496 (18%)

Return to query results.
Submit another query.