Sequence 1: | NP_726773.2 | Gene: | trr / 31149 | FlyBaseID: | FBgn0023518 | Length: | 2431 | Species: | Drosophila melanogaster |
---|---|---|---|---|---|---|---|---|---|
Sequence 2: | NP_071900.2 | Gene: | NSD1 / 64324 | HGNCID: | 14234 | Length: | 2696 | Species: | Homo sapiens |
Alignment Length: | 2567 | Identity: | 491/2567 - (19%) |
---|---|---|---|
Similarity: | 803/2567 - (31%) | Gaps: | 828/2567 - (32%) |
- Green bases have known domain annotations that are detailed below.
Fly 101 QDVETADGVWDARDQQIIVCNF---GSGTEMGAIKAEDADKQSEYRISTPRNSQSNPLLHRNTAF 162
Fly 163 TSFTKKEGASSSASSS-----SSTASVISIEPSGSGQDHAENSGKSEDLDYVLMPASGADSSTSV 222
Fly 223 GNSTGTGTPAGTPIGATTSTIILNANNGTAGVSGAGTTTILTQ---KSGHTNYNIFNTTATGSQT 284
Fly 285 PTTTLLNRVNL----HPKMKTQLMVNAKKLSEVTQTTAKVSIGNKTISVPLLKPLMSASGAATAG 345
Fly 346 GATIVESKQLLQPGGQVTTVMSAAQQSGGQQVHPHVHSHAHHNFTKLIKRGPKNSGTIVSFSGLQ 410
Fly 411 IKPANTKIVATKVV----------SKKMLQLQQHQQQIQQQQQLQQLQVT---SGGGLAP----P 458
Fly 459 TGS--------IVTI----------TTTNPSQTYAMVQDSATVGPAAHSEDDAPAPRKITAYSEN 505
Fly 506 LQKILNKSKSQESTGGPEEFTNINSVVIKPLDKNTLNCPPSFNIFKQQQHSQAAQSQSISAVGSG 570
Fly 571 AGTPVTFTMASGNASDLATTSTVSVSA-GTICINSPMMGTRPIISIQNKNISLVLSKTTMAQQKP 634
Fly 635 KMITTTTLSSQAALQMHHALIQDSSADKAGSSANSGSATSGASMQLKLTTANTPTKLSVSLAPDV 699
Fly 700 VKLEEVGSESKAKLLVKQEAVVKDSTG-TPTSEERAEEIGTPEKRLNANATMTAINQVQNQSANQ 763
Fly 764 IQMATSTSTASNPSTPNPTVNATPMNNQRSAAEDNALLKQLLQNNSSSHSLNQISITSAHVGSAS 828
Fly 829 ASAPLSARKVINVRAPSMGKVRSLEDQLARPVIPPVPTATQAAGSSSSSGSVATSTTTTTVASGG 893
Fly 894 SSQQVATASATALPVSAVAITTPGVGGEAKLEQKSDQP--AAIMQNQSQNQAPPPPPPPQQQQQQ 956
Fly 957 QLHQPQQLQPSPHQVK-QTVQIVSKETSFISGPVAAKTLVTEATSKPAELLPPPPYEMATAPISN 1020
Fly 1021 VTISISTKQAAPKELQMKPKAVAMSLPMEQGDESLPEQAEPPLHSEQGATAAGVAPHSGGPLVSA 1085
Fly 1086 QWTNNHLEGGVATTKIPFKPGEPQKRKLPMHPQLDEKQIQQQAEIPISTSLPTTPTGQG-----T 1145
Fly 1146 PDKVQ--LISAIATY---VKKSGVPNEAQPIQNQSQGQVQMQAQMQATMQGHLSGQMSGQISGHA 1205
Fly 1206 AGQ-------------------IPAQMHLQVQHQLHMAVHPQQQQQQLHQNQPQNATIPLPVTGQ 1251
Fly 1252 GAVPIPVPTMESKAGDQRKRRKREVQKPRRTNLNAGQAGGALKDLTGPLPAGAMVQLAG------ 1310
Fly 1311 -----------MPPGTQYIQGAA--SGTGHVITS--TGQGVTLGGVGASTGASSSPML------- 1353
Fly 1354 KKRVRKFSK----------------VEEDHDAFTEKLLTHIRQMQPLQVLEPHLNRNFHFLIGSN 1402
Fly 1403 ETSGGGSPASMSSAASAGSSSAGGGKLKGGSRGWPLSRHLEGLEDCDGTVLGRYGRVNLPGIPSL 1467
Fly 1468 YDSERFGGSRGLVGGSARTRSPSPAESPGAEKMLPMSSIQNDF---YDQEFSTHMERNPRERLVR 1529
Fly 1530 HIGAVKDCNLETVDLVESEGVAAWATLPRLTRYPGLILLNGNSRCHGRM----SPVALPEDPLTM 1590
Fly 1591 RFPVSPLLRSCGEELRKTQQMELGMGPLGNNNNNNYQQKNQNVILALPASASENIAGVLRDLANL 1655
Fly 1656 LHLAPA---------LTCKIIEDKIGNKLEDQFMNQDDEKHVDFKRPLSQVSHGHLRKILNGRRK 1711
Fly 1712 LCRSCGNVVHATGLRVPRHSVPALEEQLPRLAQLMDMLPRKSVPPPFVYFCDRACFARFKWNGKD 1776
Fly 1777 GQAEAASLLLQPAGGSAVKSSNGDS----PGSF-----CASSTAPAEMVVKQEPEDEDEKTPSVP 1832
Fly 1833 GNPTNIPAQRKCIVKCFSADCFTTDSAPSGLELDGTAGAGTGAGPVNNTVWETETSGLQLEDTRQ 1897
Fly 1898 CVFCNQRGDGQADGPSRLLNFDVDKWVHLNCALWSNGVYETVSGALMNFQTALQAGLSQACSACH 1962
Fly 1963 QPGATIK-CFKSRCNSLYHLPCAIREECVFYKNKSVHCSVHGHAHAGITMGAGAGATTGAGLG-- 2024
Fly 2025 -------------------GS--VADNELSSLVVHRRVFVDR-----DENRQVATVMHYSELSNL 2063
Fly 2064 L------RVGNMTFLNV----GQLLPHQLEAFHTPHYIYPIGYKVSRYYW-----CVRR--PNR- 2110
Fly 2111 -RCRYICSIAEAGCKPEFRIQVQDAGDKEPEREFRGSSPSAVWQQILQPITRLRKVHKWLQLFPQ 2174
Fly 2175 HISGEDLFG---------LTEPAIVRI--LESLPGIETLTDYRFKYGRNP--------------- 2213
Fly 2214 ------LLEFPLA------INPSGAARTEPKQRQLLVWRKPHTQRTAGSCSTQRMANSAAIAGEV 2266
Fly 2267 ACPYSKQFVHSKSSQYKKMKQEWRNNVYLARSKIQGLGLYAARDIEKHTMIIEYIGEVIRTEV-- 2329
Fly 2330 SEIREKQYESKNRGIYMFRLDEDRVVDATLSGGLARYINHSCNPNCVTEIVEVDRDVRIIIFAKR 2394
Fly 2395 KIYRGEELSYDYKFDIEDESHKIPCACGAPNC 2426 |
Gene | Sequence | Domain | Region | External ID | Identity |
---|---|---|---|---|---|
trr | NP_726773.2 | PHA03255 | 184..>320 | CDD:165513 | 27/142 (19%) |
ePHD2_KMT2C_like | 1898..2002 | CDD:277136 | 23/104 (22%) | ||
FYRN | 2068..2118 | CDD:283589 | 16/62 (26%) | ||
FYRC | 2126..2215 | CDD:197781 | 18/120 (15%) | ||
SET | <2266..2431 | CDD:225491 | 59/163 (36%) | ||
SET | 2291..2413 | CDD:214614 | 46/123 (37%) | ||
PostSET | 2415..2431 | CDD:214703 | 8/12 (67%) | ||
NSD1 | NP_071900.2 | Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite | 207..252 | 9/50 (18%) | |
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite | 281..311 | 5/36 (14%) | |||
MSH6_like | 319..429 | CDD:99898 | 18/128 (14%) | ||
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite | 487..514 | 8/37 (22%) | |||
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite | 872..891 | 4/18 (22%) | |||
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite | 936..1035 | 20/100 (20%) | |||
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite | 1067..1093 | 6/39 (15%) | |||
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite | 1112..1134 | 4/21 (19%) | |||
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite | 1243..1272 | 9/43 (21%) | |||
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite | 1294..1344 | 9/83 (11%) | |||
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite | 1382..1428 | 12/53 (23%) | |||
ING | <1431..1587 | CDD:331088 | 49/267 (18%) | ||
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite | 1480..1534 | 18/92 (20%) | |||
PHD1_NSD1_2 | 1545..1587 | CDD:277118 | 12/69 (17%) | ||
PHD2_NSD1 | 1592..1638 | CDD:277120 | 14/49 (29%) | ||
PHD3_NSD1 | 1639..1692 | CDD:277123 | 10/56 (18%) | ||
PHD4_NSD1 | 1709..1748 | CDD:277126 | 7/38 (18%) | ||
WHSC1_related | 1754..1848 | CDD:99899 | 26/124 (21%) | ||
AWS | 1891..1941 | CDD:197795 | 16/71 (23%) | ||
SET | 1942..2065 | CDD:214614 | 46/131 (35%) | ||
S-adenosyl-L-methionine binding | 1952..1954 | 0/1 (0%) | |||
S-adenosyl-L-methionine binding | 1994..1997 | 0/2 (0%) |