DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment Set1 and setd1ba

DIOPT Version :10

Sequence 1:NP_001015221.1 Gene:Set1 / 3354971 FlyBaseID:FBgn0040022 Length:1641 Species:Drosophila melanogaster
Sequence 2:NP_001038599.2 Gene:setd1ba / 567970 ZFINID:ZDB-GENE-050309-289 Length:1844 Species:Danio rerio


Alignment Length:1989 Identity:535/1989 - (26%)
Similarity:804/1989 - (40%) Gaps:541/1989 - (27%)


- Green bases have known domain annotations that are detailed below.


  Fly     8 NLVNNSSNSHDSSLANSKMP---RNFKLLSDPQLVKCGTRLYRYDGLMPGDPSY---PTITPRDP 66
            :|.|...|||....:..|..   |::||:.||.|.|...::|||||.....||:   |....|||
Zfish    42 SLANGMDNSHPICSSGEKRSHHWRSYKLIIDPALKKGSHKVYRYDGHQFSTPSFGMSPVDIVRDP 106

  Fly    67 RNPLIRIRARAVEPLMLLIPRFVIDSDYVGQPPAVEVTIVNLNDNIDKQFLASMLDKCGTSDEIN 131
            |...:..:.:..:   |.:|:|.||..|||:.|..|||...||||:.:.||..|..|.|..:|:.
Zfish   107 RIGRLWTKYKETD---LPVPKFKIDECYVGRVPPKEVTFAKLNDNVREGFLTDMCKKFGDIEEVE 168

  Fly   132 IYHHPITNKHLGIARIVFDSTKGARQFVEKYNQKSVMGKILDVFCDPFGATLKKSLESLTNS--- 193
            |.::|...||||||::||::.|.|:..|:..:..||||.|:.|..||.|...::..:.|.|.   
Zfish   169 ILYNPKNKKHLGIAKVVFETVKAAKDAVQNLHNTSVMGNIIHVELDPKGENRQRYFQRLINGSYT 233

  Fly   194 -----VAGKQLIGPKVTPQWTFQQA--ALEDTE--FIHGYPEKNGEHIKDIYTTQTNHEIPNRSR 249
                 |.|::..  .|:|: :..:|  |.|.:.  |..|.....|       ||.:....| .|.
Zfish   234 PLTLPVGGEEAC--DVSPR-SLAEALMACEPSRRLFEGGSSVVAG-------TTPSGTNTP-MSL 287

  Fly   250 DRNWNRDKERERDRHFKERSRHSSERSYDRDRGMRENVGTSIRRRRTFYRRRSSDISPEDSRDIL 314
            |..::   ...:|....:.:.|:...|           ||...:..::..|:.:..         
Zfish   288 DTAYS---SLRQDTPQSQGTPHTPRPS-----------GTPFSQDSSYSSRQGTPA--------- 329

  Fly   315 IMTRERSRDSDSRPRDYCRSRERESFRDRKRSHEKGRDQPREKREHYYNSSKDREY-RGRDRDRS 378
                            :..:|...|...:.|.||       .|.:..||...:|.| .|      
Zfish   330 ----------------FQANRAESSGGYKSRRHE-------TKFQDAYNRRPERRYVHG------ 365

  Fly   379 AEIDQRDRGSLKYCSRYSLHEYIETDVRRSSNTISSYYSASSLPIASHGFNSCSFPSIENIKTWS 443
                ...||:.:....:..|:..|..        |..::.:..|           |:..|.||  
Zfish   366 ----PTQRGNTEQPPSFKQHQPPEPP--------SPAFTHTPPP-----------PTSANFKT-- 405

  Fly   444 DRRAWTAFQP---------DFH-PVQ--------PPPPPPEEIDNWDEEEHDKNSIVPTHYGCMA 490
               |::.:||         .:| |||        |..|||...|.....:......:|.      
Zfish   406 ---AYSQYQPPIPQEYTVASYHQPVQRELDYRRPPQAPPPPSTDFLPVRDRPTTPPIPE------ 461

  Fly   491 KLQPPVPSNVNFATKLQSVTQPNSDPGTV----------DLDTRIALIFKGKTFGNAPPFL---- 541
              .||.|       :.|..|.|:|.|...          .||:||.::.|        |||    
Zfish   462 --PPPAP-------ETQPTTPPSSTPEPCPSPTQESERNSLDSRIEMLLK--------PFLNERG 509

  Fly   542 ------QMD-----SSDSETDQGKPEVFSDVNS------------DSNNSE--NKKRSCEKNNKV 581
                  :||     ||.|:.....|:..|..:|            ||.:.|  ....|...|::.
Zfish   510 DSDAEVRMDGSPISSSSSQLSPIPPQRPSRPSSTGLEDISPTPLPDSEDDEPIRGTASLLANSRG 574

  Fly   582 LHQPN--EASDISSDEELIGKKDKSKLSLICEKEVNDDNMSLSSLSSQE---------------- 628
            :...|  ..|.:......|.|.|....|...:.|::||.|..:.::|.:                
Zfish   575 MSPTNMHSKSCVGEPRTAIDKMDTGHQSSGEDMEISDDEMPGTPIASGDCDKNIVVNSALSLIQT 639

  Fly   629 --------DPIQTKEG-------AEYKSIMSSYMYSHSNQNPFYYHASGYGHY-----LSGIP-- 671
                    .|:....|       ..:.|.:|.....|...:|.  |:.|..|:     ||.:|  
Zfish   640 IPMPPPGFPPLPHAAGFPLPPHHLPHHSTVSHLPSHHPMLHPL--HSYGMMHFLPVDLLSSLPQL 702

  Fly   672 --------SESASRLFSN---------------------GAY------------VHSE--YLKAV 693
                    ::..||:..:                     |.|            :|.:  .|.::
Zfish   703 LQMPFQMQTQMLSRMAQSQHPYAYPYPAPSANPAAMPFGGPYPPLSVVSAPADTLHGQPWPLPSM 767

  Fly   694 ASFNFDSFSKPYDYNKGALSDQNDGIRQKVKQVIGYIVEELKQILKRDVNKRMIEITAFKHFETW 758
            ..||      |.....|....:.|..:..:..|:..||:|||.|:|:|:|::|:|:.||:.|:.|
Zfish   768 PQFN------PAVPPPGYEPQKEDPHKATIDGVLMAIVKELKAIMKKDLNRKMVEVVAFRKFDEW 826

  Fly   759 WD--EHTSKARSKPLFEKADSTVNTPLNCIKDTSYNEKNPDINLLINAH-----REVADFQSYS- 815
            ||  |.::||...|        |.|...  ||.......|...:  ::|     .|...|:... 
Zfish   827 WDKQELSAKATLTP--------VKTGEG--KDEEKERAKPKETM--SSHLPWNKGEGLGFEGMGL 879

  Fly   816 SIGLRAAMPKLPSFRRIRKHPSPIPTKRNFLER---------DLSDQE-EMVQRSD--------- 861
            .||||..  :||||:..||.| |.||..:..:|         :|.|:| |.:.|:|         
Zfish   880 GIGLRGI--RLPSFKVKRKQP-PEPTSTSDNKRVRPSTPVDDELEDEESERMGRTDGSRVDPAGS 941

  Fly   862 ----------------------SDKEDSNV-----EISDTARSKIKGPVPIQESD---SKSHTSG 896
                                  |.||:|::     |..|.|..::.....::|.|   |:||:|.
Zfish   942 SSKRRPARPLELDSEGEEEEETSGKEESSLSDHEEEPVDDASERLSSGKDLEEEDEKKSESHSSE 1006

  Fly   897 LNS------KRKGSASSFFSSSSSSTSSEAEYEAIDCVEKARTSEEDSPRGYG---QRNLNQRTT 952
            ..|      :...|:||...|.||.:.|.::||:..  |:....||:..|..|   :.:::.||:
Zfish  1007 SESSDSSDDEASSSSSSKSGSDSSGSESSSDYESSS--EEEEEEEEEEERIVGMDDEEDVDARTS 1069

  Fly   953 TIRNRNLVGTMD---VINVRNLCSGSNEFKKENVTKRTKKNIYSDTDEDNDRTLFPALKEKNIST 1014
            |..:.....:.|   |:.|:...:.:....:|...:..:.....:.:.|:        |...:|.
Zfish  1070 TSSSTTSTSSSDEEEVVEVKAPSTPTGPPPEEEPNELGRLEAVDEAEIDH--------KPSMVSL 1126

  Fly  1015 ILSDLEEISKDSCIGL--DENGIEPTILRKIPNTPKLNEECRRSLTPVPPPGY--NEEEIKKKVD 1075
            |.:.:||:...|..||  ||..::..:...:|.|....||......|.|...:  ::::.:.|: 
Zfish  1127 IKTKVEEVRPPSPKGLPADELDVDLEVKIPVPKTEASLEEVGNLRPPTPTGSFADSDQDTRPKI- 1190

  Fly  1076 CKQKPSFEYDRIYSDSEE-----EKEYQERRKRNTEYM--------AQMEREFLEEQEKRIEKSL 1127
                |:.::.|  :...|     |.|....|...|..|        ....:..|...|...:..:
Zfish  1191 ----PTEDFPR--TPGHEGPVPLESETTVPRSLPTPSMHLPLPPSHVPDPQSLLPPPETLPDMPV 1249

  Fly  1128 DKNLQSPNNIVKNNNSPRNKNDETRKTAISQTRSCFESASKVDTTLV-NIISVENDI-NEFGPHE 1190
            ...|.:..:|.:...  |:..|..|.....|:..........||.|. |.:|..:.: :.|....
Zfish  1250 RGRLPTEEDIPRTPG--RDLMDRARGLGKLQSTDTVPVTPGSDTPLTGNSLSSPHILGSPFSYPA 1312

  Fly  1191 EGDVLTNGCNK----------MYTNSKGKT------KRTQS------PVYSE---GGSSQASQAS 1230
            :..||:.|..:          .:.:|.|.:      ::..|      |::.|   ..|.|||..:
Zfish  1313 QSPVLSAGIPRTPGRDLTFAPAFPDSAGLSAGLPIHRKASSEILEEKPLFKEPLLSASPQASLPN 1377

  Fly  1231 QVA----------------------------LEHCYSLPPHS---------VSLGDYPSGK---- 1254
            ..|                            :|:.:...|..         |.|.|.||.|    
Zfish  1378 NAASSPFPGPPLPTASLPEPALPPQGSPPASIENSFPASPKELPVPMIDVPVPLDDTPSKKKLVR 1442

  Fly  1255 VNETKNILKREAENIAIVS-------QMTRTGPGRPRK-------DPICIQKKKRD---LAPRMS 1302
            ....|.|...|...:.::.       .:....|..|.:       :|...:|::..   :.|::.
Zfish  1443 SKNKKGIQDSEEPQVTLIEASSLPELPVNNQYPDLPSESIKEEDGEPAFSEKEESQVPTIIPKVE 1507

  Fly  1303 NV---------KSKMTPNGDEWPDL---AHKNVHFVPCDMYKTRDQNEEMVILYTFLTKGIDAED 1355
            ..         |::....|  |.:|   .|..|.......:..|...|||.|||.....|||.||
Zfish  1508 ETSFYVEEPIQKTRRQRRG--WQELLLSMHSPVASPRRPSFMPRSDFEEMTILYDIWNDGIDEED 1570

  Fly  1356 INFIKMSYLDHLHKEPYAMFLNNTHWVDHCTTDRAFWPPPS-------KKRRKDDELIRHKTGCA 1413
            |.::|::|...|.::....:||:|.||.|        ||.:       ||:||:|.:..|.||||
Zfish  1571 IRYLKITYDKMLQQDNAHDWLNDTLWVHH--------PPTNMGSATGVKKKRKEDGIRDHVTGCA 1627

  Fly  1414 RTEGFYKLDVREKAKHKYHYAKANTEDSFNEDR--SDEPTALTNHHHNKLISKMQGIS----REA 1472
            |:||:||:|.::|.|:            .|..|  |:||...|   ..|.|.....:|    .|.
Zfish  1628 RSEGYYKIDKKDKMKY------------LNSSRLQSEEPDVDT---QGKSIPAQPQVSTRAGSER 1677

  Fly  1473 RSNQRRLLTAFGSMGESELLKFNQLKFRKKQLKFAKSAIHDWGLFAMEPIAADEMVIEYVGQMIR 1537
            ||.|||||::|..  :|:||||||||||||:::|.:|.||||||||||||||||||||||||.||
Zfish  1678 RSEQRRLLSSFSC--DSDLLKFNQLKFRKKKIRFCRSHIHDWGLFAMEPIAADEMVIEYVGQNIR 1740

  Fly  1538 PVVADLRETKYEAIGIGSSYLFRIDMETIIDATKCGNLARFINHSCNPNCYAKVITIESEKKIVI 1602
            .|:||:||.:||..||||||:||:|.:||||||||||.||||||||||||||||||:||:|||||
Zfish  1741 QVIADMREKRYEDEGIGSSYMFRVDHDTIIDATKCGNFARFINHSCNPNCYAKVITVESQKKIVI 1805

  Fly  1603 YSKQPIGINEEITYDYKFPLEDEKIPCLCGAQGCRGTLN 1641
            ||:|||.:|||||||||||:|||||||||||:.||||||
Zfish  1806 YSRQPINVNEEITYDYKFPIEDEKIPCLCGAENCRGTLN 1844

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
Set1NP_001015221.1 RRM_Set1 99..191 CDD:409745 36/91 (40%)
U2AF_lg 255..>386 CDD:273727 17/131 (13%)
N-SET 1352..1496 CDD:463344 56/156 (36%)
SET_SETD1 1490..1637 CDD:380946 118/146 (81%)
setd1baNP_001038599.2 Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 18..58 5/15 (33%)
RRM_Set1B 136..228 CDD:409965 36/91 (40%)
PABP-1234 <145..333 CDD:130689 55/237 (23%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 269..288 6/26 (23%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 295..562 67/366 (18%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 838..871 7/44 (16%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 890..1108 50/220 (23%)
PHA03247 <1136..1500 CDD:223021 62/372 (17%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 1160..1341 32/189 (17%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 1367..1499 20/131 (15%)
N-SET 1556..1699 CDD:463344 61/167 (37%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 1647..1678 9/33 (27%)
RxxxRR motif. /evidence=ECO:0000250|UniProtKB:P38827 1678..1683 3/4 (75%)
SET_SETD1 1693..1840 CDD:380946 118/146 (81%)

Return to query results.
Submit another query.