DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment Set1 and EHMT2

DIOPT Version :10

Sequence 1:NP_001015221.1 Gene:Set1 / 3354971 FlyBaseID:FBgn0040022 Length:1641 Species:Drosophila melanogaster
Sequence 2:XP_006715037.2 Gene:EHMT2 / 10919 HGNCID:14129 Length:1395 Species:Homo sapiens


Alignment Length:1035 Identity:192/1035 - (18%)
Similarity:313/1035 - (30%) Gaps:388/1035 - (37%)


- Green bases have known domain annotations that are detailed below.


  Fly   846 LERDLSDQEEMVQRSDSDKEDSNVEISDTARSKIKGPVPIQESDSKSHTSGLNSKRKGSASSFFS 910
            |...||::||..:..:.::|:...|                |.:.:...||..|.|.||:..   
Human   472 LTEQLSEEEEEEEEEEEEEEEEEEE----------------EEEEEDEESGNQSDRSGSSGR--- 517

  Fly   911 SSSSSTSSEAEYEAIDCVEKARTS-EEDSPRGYGQRNLNQ----RTTTIRNRNLVGTMDVINVRN 970
                              .||:.. .:|||.....|...:    |....|..|.||:........
Human   518 ------------------RKAKKKWRKDSPWVKPSRKRRKREPPRAKEPRGVNGVGSSGPSEYME 564

  Fly   971 LCSGSNEFKKENVTKRTKKNIYSDTDEDNDRTLFPALKEKNISTILSDLEEISK---DSCIGLDE 1032
            :..||.|...|.........:.:||........|..|...:.......::.||:   ..|:..:.
Human   565 VPLGSLELPSEGTLSPNHAGVSNDTSSLETERGFEELPLCSCRMEAPKIDRISERAGHKCMATES 629

  Fly  1033 -----NGIEPTILRKIPNTP--------------------------------------------- 1047
                 :|....||::....|                                             
Human   630 VDGELSGCNAAILKRETMRPSSRVALMVLCETHRARMVKHHCCPGCGYFCTAGTFLECHPDFRVA 694

  Fly  1048 -KLNEECRRSL----------------------------------TPVPPPGYNEEEIKKKVDCK 1077
             :.::.|...|                                  .|.|||  ..:::..:.|..
Human   695 HRFHKACVSQLNGMVFCPHCGEDASEAQEVTIPRGDGVTPPAGTAAPAPPP--LSQDVPGRADTS 757

  Fly  1078 QKPSFEY---------------DRIYSDSEE-----------------------EK----EYQER 1100
            | ||...               |.|.|....                       ||    :..||
Human   758 Q-PSARMRGHGEPRRPPCDPLADTIDSSGPSLTLPNGGCLSAVGLPLGPGREALEKALVIQESER 821

  Fly  1101 RKRNTEYMAQMEREFLEEQEKRIEK-------SLDKNLQSPNNIVKNNNSPRNKNDETRKTAISQ 1158
            ||:...:..|:   :|..::..::|       :||.|.||              :.::::|.:  
Human   822 RKKLRFHPRQL---YLSVKQGELQKVILMLLDNLDPNFQS--------------DQQSKRTPL-- 867

  Fly  1159 TRSCFESASKVDTTLVNIISVENDINEFGPHEEGDVLTNGCNKMYTNSKGKTKRTQ--------- 1214
                ..:|.|....:.:::                 |..|.|   .|:..|.:||.         
Human   868 ----HAAAQKGSVEICHVL-----------------LQAGAN---INAVDKQQRTPLMEAVVNNH 908

  Fly  1215 -----------SPVYS--EGGSSQASQASQVALEHCYS--LPPHSVSLGDYPSGKVNETKNILKR 1264
                       ..|||  |.||:....|:::......|  |....|.:....||  ..|..|...
Human   909 LEVARYMVQRGGCVYSKEEDGSTCLHHAAKIGNLEMVSLLLSTGQVDVNAQDSG--GWTPIIWAA 971

  Fly  1265 EAENIAIVSQMTRTGPGRPRKDPICIQKKKRDLAPRMSNVKSKMTPNGD---EWPDLAHK----- 1321
            |.::|.::..:...|......|                ||..::....:   .|......     
Human   972 EHKHIEVIRMLLTRGADVTLTD----------------NVSERLVEEENICLHWASFTGSAAIAE 1020

  Fly  1322 ---NVHFVPCDMY------------KTRDQNEEMVILYTFLTKGIDAEDINFIKMSYLDHLHKEP 1371
               |..   ||::            ..|:...:.|:|  ||::|.:.|..|           || 
Human  1021 VLLNAR---CDLHAVNYHGDTPLHIAARESYHDCVLL--FLSRGANPELRN-----------KE- 1068

  Fly  1372 YAMFLNNTHWVDHCTTDRA-FWPPPSKKRRKDDELIRHKTG--CARTEGFYKLDVR--------- 1424
                 .:|.|  ..|.:|: .|......|:     :|...|  ..|||.....||.         
Human  1069 -----GDTAW--DLTPERSDVWFALQLNRK-----LRLGVGNRAIRTEKIICRDVARGYENVPIP 1121

  Fly  1425 -------EKAKHKYHYAKANTEDS-FNEDRSDEPTALTNHHH----------NKLISKMQGISRE 1471
                   |.....|.|...|.|.| .|.||:     :|:..|          |.|..::.  .|.
Human  1122 CVNGVDGEPCPEDYKYISENCETSTMNIDRN-----ITHLQHCTCVDDCSSSNCLCGQLS--IRC 1179

  Fly  1472 ARSNQRRLLTAFGSMGESELLKFNQ-------LKFR------KKQLKFAKSAIHDWGLFAMEPIA 1523
            ......|||..|..:....:.:.||       .|.|      |.:|:..::|...||:.|::.|.
Human  1180 WYDKDGRLLQEFNKIEPPLIFECNQACSCWRNCKNRVVQSGIKVRLQLYRTAKMGWGVRALQTIP 1244

  Fly  1524 ADEMVIEYVGQMIRPVVADLRETKYEAIGIGSSYLFRIDMET----IIDATKCGNLARFINHSCN 1584
            ....:.||||::|....||:||        ..||||.:|.:.    .|||...||::|||||.|:
Human  1245 QGTFICEYVGELISDAEADVRE--------DDSYLFDLDNKDGEVYCIDARYYGNISRFINHLCD 1301

  Fly  1585 PNCY-AKVITIESE---KKIVIYSKQPIGINEEITYDYK---FPLEDEKIPCLCGAQGCR 1637
            ||.. .:|..:..:   .:|..:|.:.|...||:.:||.   :.::.:...|.||::.|:
Human  1302 PNIIPVRVFMLHQDLRFPRIAFFSSRDIRTGEELGFDYGDRFWDIKSKYFTCQCGSEKCK 1361

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
Set1NP_001015221.1 RRM_Set1 99..191 CDD:409745
U2AF_lg 255..>386 CDD:273727
N-SET 1352..1496 CDD:463344 35/173 (20%)
SET_SETD1 1490..1637 CDD:380946 49/170 (29%)
EHMT2XP_006715037.2 PHA03247 <126..382 CDD:223021
2A1904 <396..504 CDD:273344 8/47 (17%)
EHMT_ZBD 598..727 CDD:411018 11/128 (9%)
ANKYR 806..1072 CDD:440430 58/348 (17%)
ANK repeat 864..893 CDD:293786 7/54 (13%)
ANK repeat 928..960 CDD:293786 6/31 (19%)
ANK repeat 962..993 CDD:293786 6/32 (19%)
ANK repeat 1002..1033 CDD:293786 4/33 (12%)
ANK repeat 1035..1066 CDD:293786 7/32 (22%)
SET_EHMT2 1133..1371 CDD:380931 67/244 (27%)
Blue background indicates that the domain is not in the aligned region.

Return to query results.
Submit another query.