DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment GCS2alpha and Mgam

DIOPT Version :10

Sequence 1:NP_652145.1 Gene:GCS2alpha / 49953 FlyBaseID:FBgn0027588 Length:924 Species:Drosophila melanogaster
Sequence 2:NP_001355804.1 Gene:Mgam / 232714 MGIID:1203495 Length:3616 Species:Mus musculus


Alignment Length:814 Identity:229/814 - (28%)
Similarity:348/814 - (42%) Gaps:134/814 - (16%)


- Green bases have known domain annotations that are detailed below.


  Fly   123 DGEIVITSEKNKAVIHGDPFRIDF---FENDVLVVSVNAKNWLYF---EHLRQKAQEPTSH--PA 179
            :.::|.|:....|.:...|....|   .||.:|.......|..:|   :..:::.:.|..|  |.
Mouse   115 ESDVVNTNAGFTATLKNLPSAPVFGNSIENILLTAEYQTSNRFHFKLTDQTKKRYEVPHEHVQPF 179

  Fly   180 ENENQQEQDAAVETPKAADTIDDPGAWEENFKSHH----DSKPYGPEAVA---LDFS--FPAAKV 235
            ........:..||..|      :|.:.:...||::    ||. .||...:   |.||  .|:|.|
Mouse   180 SGNAPSSLNYKVEVSK------EPFSIKVTRKSNNRVLFDSS-IGPLLFSDQFLQFSTHLPSANV 237

  Fly   236 LFGIPEHADS-------------FILKSTSGTDPYRLYNLDVFEYVVDSKMALYGSVPVIYGHGA 287
             :|:.||...             |...:|...|...||.:..|...::....|            
Mouse   238 -YGLGEHVHQQYRHNMNWKTWPMFSRDTTPNEDGTNLYGVQTFFLCLEDNSGL------------ 289

  Fly   288 QRTAGVYWQNAAETWVDIQTSETNVVSSLVNFVSGSQKTPPPAAHFMSESGIVDAFIMLGPKPMD 352
              :.||:..|:....|.:|                    |.||..:.:..||:|.::.||..|..
Mouse   290 --SFGVFLMNSNAMEVTLQ--------------------PTPAITYRTTGGILDFYVFLGNTPEQ 332

  Fly   353 TFKQYAALTGTHELPQLFALAYHQSRWNYNDERDVTSVSAKFDEYNIPMDTMWLDIEHTDGKRYF 417
            ..::|..|.|...||..:.|.:..||::|....::.:|..:.....:|.|....||::.|.|:.|
Mouse   333 VVQEYLELIGRPALPSYWTLGFQLSRYDYKSLDNMKAVVERNRAAQLPYDVQHADIDYMDQKKDF 397

  Fly   418 TWDKFKFPHPLAMIKNLTELGRHLVVIVDPHIKRDNNYFFHRDC--TDRG----YYVKTREG-ND 475
            |:|...|......:|.|...|:.||:|:||.|  .||.|.....  .|||    .:|.:.:| :.
Mouse   398 TYDPVNFKGFPEFVKELHNNGQKLVIILDPAI--SNNSFSSNPYGPYDRGSAMKIWVNSSDGISP 460

  Fly   476 YEGWCWPGAASYPDFFNPVVREYYASQYALDKFQTVTADVMLWNDMNEPSVF------------- 527
            ..|..|||...:||:.:|....::..::.|  |........:|.||||.|.|             
Mouse   461 VIGKVWPGTTVFPDYTSPNCAVWWTKEFEL--FHKEVEFDGIWIDMNEVSNFIDGSFSGCSQNNL 523

  Fly   528 NGPEITAPKDLIHY-----------GNW-EHRDVHNLYGHMHLMGSFAGLQQRDPNQRPFILTRA 580
            |.|..| ||.|..|           .:| :..|||||||:...:.:...::...|::|.||:||:
Mouse   524 NYPPFT-PKVLDGYLFSKTLCMDAVQHWGKQYDVHNLYGYSMAIATAKAVKDVFPDKRSFIITRS 587

  Fly   581 HFAGSQRYAAIWTGDNFADWSHLQHSVKMCLTEAVAGFSFCGADVGAFFGNPDTELLERWYQTGA 645
            .||||.::||.|.|||.|.|..||.|:...|...:.|....|||:..|..:...||..||.|.||
Mouse   588 TFAGSGKFAAHWLGDNTATWKDLQWSIPGMLEFNLFGIPMVGADICGFAQDTYEELCRRWMQLGA 652

  Fly   646 FLPFFRAHAHIDTKRREPWLFPERTRQVIQNA----VIKRYSYLPLWYTAFYELELTGEPVIRPL 706
            |.||.|.|.....|.::|..|...:  ::.|:    :..||:.||..||.||.....|:.|.|||
Mouse   653 FYPFSRNHNGQGYKDQDPASFGNNS--LLLNSSRHYLNIRYTLLPYLYTLFYRAHSRGDTVARPL 715

  Fly   707 LAQYPLDKEAFGVDNQLLVQDRLLVRPVMQQGVSKVDVYFPAIDDKKNGDWWYDVDTYQR---QE 768
            |.::..|...:|:|.|.|....||:.||:.||..||..|.|       ...|||.:|.:.   ::
Mouse   716 LHEFYDDNNTWGIDRQFLWGPGLLITPVLDQGAEKVKAYVP-------NATWYDYETGEELGWRK 773

  Fly   769 RSGYVSVPVDDFKIPVWQRGGSIVPKKERQRRAST--LMLHDPYTLIICLDRQGKASGSLYLDDE 831
            :|..:.:|.|  ||.:..|||.|.|   .|:.|:|  ....:|..||:.||...:|.|.|:.||.
Mouse   774 QSIEMQLPGD--KIGLHLRGGYIFP---TQQPATTTEASRKNPLGLIVALDENKEARGELFWDDG 833

  Fly   832 KSYAYRQGQRIHVNYEFAHDQ-LVNSFVGKPKYK 864
            :| .....|.|::..||:..| .::..:..|.||
Mouse   834 ES-KDTVAQNIYLFSEFSVTQNHLDVTISSPNYK 866

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
GCS2alphaNP_652145.1 GH31_N 224..362 CDD:270212 32/155 (21%)
GH31_GANC_GANAB_alpha 362..833 CDD:269889 165/511 (32%)
DUF5110 809..877 CDD:465360 19/57 (33%)
MgamNP_001355804.1 GH31_N 226..342 CDD:270212 32/150 (21%)
GH31_MGAM_SI_GAA 342..705 CDD:269888 117/369 (32%)
PD 934..972 CDD:197472
NtCtMGAM_N 988..1102 CDD:465286
GH31_N 1087..1208 CDD:270212
Glyco_hydro_31 1189..1690 CDD:460044
Trefoil 1828..1867 CDD:238059
NtCtMGAM_N 1883..1997 CDD:465286
GH31_N 1982..2103 CDD:270212
Glyco_hydro_31 2084..2586 CDD:460044
PD 2720..2764 CDD:197472
NtCtMGAM_N 2777..2891 CDD:465286
GH31_N 2876..2997 CDD:270212
Glyco_hydro_31 2978..3480 CDD:460044
PD 65..110 CDD:197472
NtCtMGAM_N 124..234 CDD:465286 24/116 (21%)
Blue background indicates that the domain is not in the aligned region.

Return to query results.
Submit another query.