DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment Set1 and ASH1L

DIOPT Version :10

Sequence 1:NP_001015221.1 Gene:Set1 / 3354971 FlyBaseID:FBgn0040022 Length:1641 Species:Drosophila melanogaster
Sequence 2:NP_001353106.1 Gene:ASH1L / 55870 HGNCID:19088 Length:2969 Species:Homo sapiens


Alignment Length:2010 Identity:358/2010 - (17%)
Similarity:633/2010 - (31%) Gaps:699/2010 - (34%)


- Green bases have known domain annotations that are detailed below.


  Fly    54 GDPSYPTITPRDPRNPLIRIRAR-----AVEPLMLLIPRFVID--SDYVGQPPAVEVTIVNLNDN 111
            |:.:.|:.:|....|||.|....     |..||:|.....:|:  |:.||:.   :.|..:.:.|
Human   551 GESNLPSPSPTVSVNPLTRSPPETSSQLAPNPLLLSSTTELIEEISESVGKN---QFTSESTHLN 612

  Fly   112 IDKQFLA-SMLDKC-GTSDEINIYHHPITNKHLGIARI---------------VFDSTKGARQFV 159
            :..:.:. |:..:| |...|:|    .....|:.|.||               :...|.....|.
Human   613 VGHRSVGHSISIECKGIDKEVN----DSKTTHIDIPRISSSLGKKPSLTSESSIHTITPSVVNFT 673

  Fly   160 EKYNQKSV--MGKI--LDVFCDPFGATLKKSLESLTNSVAGKQL---IGPKVTPQWTFQQAALED 217
            ..::.|..  :|.:  .|..|        :..|||:.|:..|.|   .|.|  |:||        
Human   674 SLFSNKPFLKLGAVSASDKHC--------QVAESLSTSLQSKPLKKRKGRK--PRWT-------- 720

  Fly   218 TEFIHGYPEKNGEHIKDIYTTQTNHEIPNRSRDRNWNRDKERERDRHFKERSRHSSERSYDRDRG 282
                                     ::..||..|: .:..|.||...||..|..|...|......
Human   721 -------------------------KVVARSTCRS-PKGLELERSELFKNVSCSSLSNSNSEPAK 759

  Fly   283 MRENVGTSIRRRRTFYRRRSSDISPEDSRDILIMTRER------------------SRD--SD-- 325
            ..:|:|........|.:||...:|...:..:.::....                  |.|  ||  
Human   760 FMKNIGPPSFVDHDFLKRRLPKLSKSTAPSLALLADSEKPSHKSFATHKLSSSMCVSSDLLSDIY 824

  Fly   326 ----SRPRD----------------------YCRSRERE------------SFRD--------RK 344
                .||:.                      ..:|:|.:            ||:.        :|
Human   825 KPKRGRPKSKEMPQLEGPPKRTLKIPASKVFSLQSKEEQEPPILQPEIEIPSFKQGLSVSPFPKK 889

  Fly   345 RSHEK--------------------GRDQP----REKREHYYNS----SKDREYRGRDRDRS--- 378
            |...|                    ..:.|    .|...|..:|    |:|:.....|.|.|   
Human   890 RGRPKRQMRSPVKMKPPVLSVAPFVATESPSKLESESDNHRSSSDFFESEDQLQDPDDLDDSHRP 954

  Fly   379 -------------AEIDQRDRGSL-----------KYCSRYS-LHEYIETDVRRSS--------- 409
                         .:|.:|:.|.|           |...|.. |::.:.:.|..|:         
Human   955 SVCSMSDLEMEPDKKITKRNNGQLMKTIIRKINKMKTLKRKKLLNQILSSSVESSNKGKVQSKLH 1019

  Fly   410 NTISSYYSASSLPIAS------------------------HGFNSCSFPSIENIKTWSDRRAWTA 450
            ||:||..:.....:..                        :|..|.|..|:..::..:.:.|.:|
Human  1020 NTVSSLAATFGSKLGQQINVSKKGTIYIGKRRGRKPKTVLNGILSGSPTSLAVLEQTAQQAAGSA 1084

  Fly   451 FQPDFHPVQPPPPPPEEIDNWDEEEHDKNSIVPTHYGCMAKLQPPVPSNVNFATKLQSVTQPNSD 515
            ......|:.|......||         ..|.:.:.....:..|.||.|:..|       .:|:|.
Human  1085 LGQILPPLLPSSASSSEI---------LPSPICSQSSGTSGGQSPVSSDAGF-------VEPSSV 1133

  Fly   516 PGTVDLDTR-----IALIFKGKTFGN---APPFLQMDS------------------SDSETDQGK 554
            | .:.|.:|     ..|..|..:.|.   :||.|..:|                  |:|.:|:..
Human  1134 P-YLHLHSRQGSMIQTLAMKKASKGRRRLSPPTLLPNSPSHLSELTSLKEATPSPISESHSDETI 1197

  Fly   555 PE---VFSDVNSDSNNSE----NKKRSCEKNNKVLHQPNEASDISSDEELIGKKDKSKLSLICEK 612
            |.   :.:|.||.|:.:|    .|||.....:..|..|..::.:||.:|    |.|.|    |::
Human  1198 PSDSGIGTDNNSTSDRAEKFCGQKKRRHSFEHVSLIPPETSTVLSSLKE----KHKHK----CKR 1254

  Fly   613 EVNDDNMSLSSLSSQ--------------EDPIQTKEGAEYKSIMSSYMYSHS------------ 651
            . |.|.:|...:..|              :||....|..|..|.:|....:|.            
Human  1255 R-NHDYLSYDKMKRQKRKRKKKYPQLRNRQDPDFIAELEELISRLSEIRITHRSHHFIPRDLLPT 1318

  Fly   652 ----NQNPFYYHAS--------------------------------------------------- 661
                |.|.||.|.|                                                   
Human  1319 IFRINFNSFYTHPSFPLDPLHYIRKPDLKKKRGRPPKMREAMAEMPFMHSLSFPLSSTGFYPSYG 1383

  Fly   662 -------------GYGHY-------LSGIPSESASRLFSNGAYVHSEYLKAVASFNFDSFSKPYD 706
                         |.|:|       ....||.|.:......:|:|:.:|          ...|..
Human  1384 MPYSPSPLTAAPIGLGYYGRYPPTLYPPPPSPSFTTPLPPPSYMHAGHL----------LLNPAK 1438

  Fly   707 YNKGALSDQNDGIRQKVKQVIGYIVEELKQILKRDVNKRMIEITAFKHFETWWDEHTSKARSKPL 771
            |:|.         :.|:.:...::......:|.......:....|:    .|..||..:.|.|..
Human  1439 YHKK---------KHKLLRQEAFLTTSRTPLLSMSTYPSVPPEMAY----GWMVEHKHRHRHKHR 1490

  Fly   772 FEKADSTVNTPLNCIKDTSYNEKNPDINLLINAHREVADFQSYSSIGLRAAMPKLPSFRRIRKH- 835
            ..:                 :.:.|.:::...:.|.|.:.......|..|...:.....:.|.| 
Human  1491 EHR-----------------SSEQPQVSMDTGSSRSVLESLKRYRFGKDAVGERYKHKEKHRCHM 1538

  Fly   836 --PSPIPTKRNFLERDLSDQEEMVQRSDSD--------KEDSNVEISDTARS-KIKGPVPIQE-S 888
              |...|:| :.:.|    :|:.|.|..|:        :....::.|:::.| .:.|..|..| :
Human  1539 SCPHLSPSK-SLINR----EEQWVHREPSESSPLALGLQTPLQIDCSESSPSLSLGGFTPNSEPA 1598

  Fly   889 DSKSH----TSGLNSKRKGSASS-----------FFSSSSSSTSSEAEYEAIDCVEKA------- 931
            .|..|    ||.:.|.|..:.:|           .||:..:|.:.....|::...|:|       
Human  1599 SSDEHTNLFTSAIGSCRVSNPNSSGRKKLTDSPGLFSAQDTSLNRLHRKESLPSNERAVQTLAGS 1663

  Fly   932 -RTSEEDSPRGYGQRNLN---QRTTTIRNRNLVGTMDVINVRNLCSGSNEFKKENVTKRTKKNIY 992
             .||::.|.|.....|.:   :|:::....:.|..:...:.|.:.||.     ::|....::.:.
Human  1664 QPTSDKPSQRPSESTNCSPTRKRSSSESTSSTVNGVPSRSPRLVASGD-----DSVDSLLQRMVQ 1723

  Fly   993 SDTDEDNDRTLFPALKEKNISTILSDLEEISKDSCIGLDENGIEPTILRKIPNTPKLNEECRRS- 1056
            ::..|..::::...:...:.....|.....|||..:|..::.:.|.:         .::.|..| 
Human  1724 NEDQEPMEKSIDAVIATASAPPSSSPGRSHSKDRTLGKPDSLLVPAV---------TSDSCNNSI 1779

  Fly  1057 ------LTPVPPPGYNEEEIKKKVDCKQKPSFEYDRIYSDSEEEKEYQERRKRNTEYMAQMEREF 1115
                  ||....|.:.:..:.:.:..:.:....||:|.:           .|:|.:::.::.:  
Human  1780 SLLSEKLTSSCSPHHIKRSVVEAMQRQARKMCNYDKILA-----------TKKNLDHVNKILK-- 1831

  Fly  1116 LEEQEKRIEKSLDKNLQSPNNIVKNNNSPRNKNDETRKTAISQTRS--CFESASKV--------- 1169
                    .|.|.:..::.||.||      .:....||..:....|  .|::|..|         
Human  1832 --------AKKLQRQARTGNNFVK------RRPGRPRKCPLQAVVSMQAFQAAQFVNPELNRDEE 1882

  Fly  1170 --------DTTLVNIISVENDINEFGPHEEGDVLTNGCNKMYTNSKG-----KTKRTQSPVYSEG 1221
                    ||....|.:|...:|....|::|           ...||     :|::.|.|:..|.
Human  1883 GAALHLSPDTVTDVIEAVVQSVNLNPEHKKG-----------LKRKGWLLEEQTRKKQKPLPEEE 1936

  Fly  1222 GSSQASQASQVALEHCYSLPPHSVSLGDYPSGKVNETKNILKREAENIAIVSQMTRTGPGRPRKD 1286
            ........::..:|  ...|..:.:....|...:....:::.||.:            |.||   
Human  1937 EQENNKSFNEAPVE--IPSPSETPAKPSEPESTLQPVLSLIPREKK------------PPRP--- 1984

  Fly  1287 PICIQKKKRDLAPRMSNVKSKMTPNGDEWPDLAHKNVHFVPCDMYKTRDQNEEMVILYTFLTKGI 1351
                .|||...|...|                          |:|||.|....::          
Human  1985 ----PKKKYQKAGLYS--------------------------DVYKTTDPKSRLI---------- 2009

  Fly  1352 DAEDINFIKMSYLDHLHKEPYAMFLNNTHWVDHCTTDRAFWPPPSKKRRKDDELIRHKTGCARTE 1416
               .:...|:.|....|:  |.:|....|.|        |:......|:|            |.:
Human  2010 ---QLKKEKLEYTPGEHE--YGLFPAPIHVV--------FFVSGKYLRQK------------RID 2049

  Fly  1417 GFYKLDVREKAKHKYHYAKANT---------------------EDSFNEDRSDEPT--ALTNHHH 1458
            .....|:..:.||...|.|.:.                     ..:.|..:.|:.|  ...:...
Human  2050 FQLPYDILWQWKHNQLYKKPDVPLYKKIRSNVYVDVKPLSGYEATTCNCKKPDDDTRKGCVDDCL 2114

  Fly  1459 NKLI----SKMQGISREARSNQRRLLTAFGSMGESELLKFNQLKFRKKQLKFAKSAIHDWGLFAM 1519
            |::|    |.......|...|||        :...|.::..: :||.::        ..||:...
Human  2115 NRMIFAECSPNTCPCGEQCCNQR--------IQRHEWVQCLE-RFRAEE--------KGWGIRTK 2162

  Fly  1520 EPIAADEMVIEYVGQMIRPVVADLRETKYEAIGIGSS-YLFRIDMETIIDATKCGNLARFINHSC 1583
            ||:.|.:.:|||:|:::..  .:.|....|.....|. |...:|...:||:.:.||.||||||||
Human  2163 EPLKAGQFIIEYLGEVVSE--QEFRNRMIEQYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSC 2225

  Fly  1584 NPNCYAKVITIESEKKIVIYSKQPIGINEEITYDY---KFPLEDEKIPCLCGAQGCRGTL 1640
            :|||..:..::....:|.:|:.:.:....|:||||   .|.:|.::: |.||.:.|||.:
Human  2226 DPNCEMQKWSVNGVYRIGLYALKDMPAGTELTYDYNFHSFNVEKQQL-CKCGFEKCRGII 2284

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
Set1NP_001015221.1 RRM_Set1 99..191 CDD:409745 19/112 (17%)
U2AF_lg 255..>386 CDD:273727 38/242 (16%)
N-SET 1352..1496 CDD:463344 27/170 (16%)
SET_SETD1 1490..1637 CDD:380946 44/150 (29%)
ASH1LNP_001353106.1 Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 1..70
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 118..143
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 501..525
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 537..583 8/31 (26%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 824..845 2/20 (10%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 878..966 12/87 (14%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 1100..1128 8/43 (19%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 1151..1231 18/79 (23%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 1243..1281 10/46 (22%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 1489..1508 1/35 (3%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 1580..1711 27/130 (21%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 1741..1761 4/19 (21%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 1911..1991 18/111 (16%)
Catalytic domain 2069..2288 57/236 (24%)
AWS 2092..2143 CDD:197795 10/58 (17%)
SET_ASH1L 2146..2286 CDD:380951 46/151 (30%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 2288..2346
Bromo_ASH1 2443..2548 CDD:99955
PHD_ASH1L 2586..2628 CDD:277023
BAH_polybromo 2665..2799 CDD:240068
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 2825..2856
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 2876..2919
Blue background indicates that the domain is not in the aligned region.

Return to query results.
Submit another query.