DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment trr and Nsd2

DIOPT Version :9

Sequence 1:NP_726773.2 Gene:trr / 31149 FlyBaseID:FBgn0023518 Length:2431 Species:Drosophila melanogaster
Sequence 2:NP_001074571.2 Gene:Nsd2 / 107823 MGIID:1276574 Length:1366 Species:Mus musculus


Alignment Length:1226 Identity:231/1226 - (18%)
Similarity:375/1226 - (30%) Gaps:468/1226 - (38%)


- Green bases have known domain annotations that are detailed below.


  Fly  1468 YDSERFGGSRGLVGGSARTRSPSPAESPGAEKMLPMSSIQNDFYDQEFSTHMERNPRERLVRH-I 1531
            |||..   .:|||..:..::..||     |:|.:|:.        :|...:..|: |:.|::: :
Mouse   175 YDSLL---EQGLVEAALVSKISSP-----ADKKIPVK--------KESCPNTGRD-RDLLLKYNV 222

  Fly  1532 GAVKDCNLETVDLVESE--GVAAWATL----PRL---TRYPG----------------------- 1564
            |          |||.|:  |...|..:    |.|   |:..|                       
Mouse   223 G----------DLVWSKVSGYPWWPCMVSADPLLHNHTKLKGQKKSARQYHVQFFGDAPERAWIF 277

  Fly  1565 ---LILLNGNSR----CHGRMSPVALPEDPLTMRFPVSPLLRSCGEELRKTQQMELGMGPLGNNN 1622
               |:...|..:    |...........:.:.:..|:|..||:         |.|:|        
Mouse   278 EKSLVAFEGEEQFEKLCQESAKQAPTKAEKIKLLKPISGRLRA---------QWEMG-------- 325

  Fly  1623 NNNYQQKNQNVILALPASASENIAG-----VLRDLANLLHLAP--ALTCKIIEDKIGNKLEDQFM 1680
                       |:....:||.:|..     ....:.:.|||.|  |....|:.:.:|..::....
Mouse   326 -----------IVQAEEAASMSIEERKAKFTFLYVGDQLHLNPQVAKEAGIVTEPLGEMVDSSGA 379

  Fly  1681 NQDDEKHVDFKRPLSQVSHGHLRK--ILNGRRKLCRSCGNVVHATGLRVPRHSVPALEEQLPRLA 1743
            :::           :.|..|.:|:  |...||:              |..|.|....:|..|.  
Mouse   380 SEE-----------AAVDPGSVREEDIPTKRRR--------------RTKRSSSAENQEGDPG-- 417

  Fly  1744 QLMDMLPRKSVPPPFVYFCDRACFARFKWNGKDGQAEAASLLLQPAG-----GSAVKSSNGDSPG 1803
                  ..||.||                  |..:||....:..|||     |||.:|..|||..
Mouse   418 ------TDKSTPP------------------KMAEAEPKRGVGSPAGRKKSTGSAPRSRKGDSAA 458

  Fly  1804 SFCASSTAPAEMVVKQEPEDEDEKTPSVPGNPTNI------------------------------ 1838
            .|........:.||.:.|:...|:...:.|:..::                              
Mouse   459 QFLVFCQKHRDEVVAEHPDASGEEIEELLGSQWSMLNEKQKARYNTKFSLMISAQSEEDSGNGNG 523

  Fly  1839 -----------PA----------------------QRKCI------------------------V 1846
                       ||                      ||:.|                        .
Mouse   524 KKRSHTKRADDPAEDVDVEDAPRKRLRADKHSLRKQRETITDKTARTSSYKAIEAASSLKSQAAT 588

  Fly  1847 KCFSADC------------------FTTDSAPSGL----ELDGTAGAGTGAGPVNNTVWETETS- 1888
            |..|..|                  |...|:||..    |:..:.|......|..:.. ||:|. 
Mouse   589 KNLSDACKPLKKRNRASATASSALGFNKSSSPSASLTEHEVSDSPGDEPSESPYESAD-ETQTEA 652

  Fly  1889 ---------GLQLEDTRQCVFCNQRG-----DGQADGPSRLLNFDVDKWVHLNCALWSNGVYETV 1939
                     |:..:....|..|.:.|     :|...|.           .||.|.    |:....
Mouse   653 SVSSKKSERGMAAKKEYVCQLCEKTGSLLLCEGPCCGA-----------FHLACL----GLSRRP 702

  Fly  1940 SGALMNFQTALQAGLSQACSACHQPGATIK-CFKSRCNSLYHLPCAIREECVFYKNKSVHCSVHG 2003
            .|.....:.|  :|: .:|..|.:....:| |..::|...||..|..:.....::::...|.:|.
Mouse   703 EGRFTCTECA--SGI-HSCFVCKESKMEVKRCVVNQCGKFYHEACVKKYPLTVFESRGFRCPLHS 764

  Fly  2004 ----HA-----------------------HAG-ITMGAGAGATTGAGL----------------- 2023
                ||                       |.| ..:.||........:                 
Mouse   765 CMSCHASNPSNPRPSKGKMMRCVRCPVAYHGGDACLAAGCSVIASNSIICTGHFTARKGKRHHTH 829

  Fly  2024 -----------GGSVADNELSSLVVHRRVF-VDRDENR------QVATVMHYSELSNLLRVGNMT 2070
                       |||:...|......|.... ::..:..      :....:|:.::. .:::||..
Mouse   830 VNVSWCFVCSKGGSLLCCEACPAAFHPDCLNIEMPDGSWFCNDCRAGKKLHFQDII-WVKLGNYR 893

  Fly  2071 FL------------NVGQLLPHQLEAFHTPHYIYPIGYKVSR-YYWC-------VRRPNRRCRYI 2115
            :.            |: |.:.|::..|       |:.:..|: |||.       ....:|..|| 
Mouse   894 WWPAEVCHPKNVPPNI-QKMKHEIGEF-------PVFFFGSKDYYWTHQARVFPYMEGDRGSRY- 949

  Fly  2116 CSIAEAGCKPEFRIQVQDA----GDKEPEREFRGSSPSAVWQQILQPITRLRKVHKWLQLFPQHI 2176
              ....|....|:..:|:|    .:.:.:||.|.:..|.            ||...:     :||
Mouse   950 --QGVRGIGRVFKNALQEAEARFNEVKLQREARETQESE------------RKPPPY-----KHI 995

  Fly  2177 SGEDLFGLTEPAIVRILESLPGIETLTDYRFKYGRNPLLEFPLAINPSGAARTEPKQRQLLVWRK 2241
            .....:|..:.....|.| :|          |....|..|     ||.| :.:|...|.|:....
Mouse   996 KVNKPYGKVQIYTADISE-IP----------KCNCKPTDE-----NPCG-SDSECLNRMLMFECH 1043

  Fly  2242 PHTQRTAGSCSTQRMANSAAIAGEVACPYSKQFVHSKSSQYKKMKQEWRNNVYLARSKIQGLGLY 2306
            |........|..|             |...:|:..:|                :.::..:|.||.
Mouse  1044 PQVCPAGEYCQNQ-------------CFTKRQYPETK----------------IIKTDGKGWGLV 1079

  Fly  2307 AARDIEKHTMIIEYIGEVIRTEVSEIREK-QYESKNRGIYMFRLDEDRVVDATLSGGLARYINHS 2370
            |.|||.|...:.||:||:|..|....|.| .:|:.....||..:|:||::||...|..:|::|||
Mouse  1080 AKRDIRKGEFVNEYVGELIDEEECMARIKYAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHS 1144

  Fly  2371 CNPNCVTEIVEVDRDVRIIIFAKRKIYRGEELSYDYKFDIEDESHKIPCACGAPNC 2426
            |.|||.|....|:.|.|:.:||...|..|.||:::|..|... :.|..|.|||.||
Mouse  1145 CQPNCETLKWTVNGDTRVGLFAVCDIPAGTELTFNYNLDCLG-NEKTVCRCGASNC 1199

Known Domains:


Indicated by green bases in alignment.

Software error:

Illegal division by zero at /www/www.flyrnai.org/docroot/cgi-bin/DRSC_prot_align.pl line 591.

For help, please send mail to the webmaster (ritg@hms.harvard.edu), giving this error message and the time and date of the error.

GeneSequenceDomainRegion External IDIdentity
trrNP_726773.2 PHA03255 184..>320 CDD:165513
ePHD2_KMT2C_like 1898..2002 CDD:277136 21/109 (19%)
FYRN 2068..2118 CDD:283589 13/69 (19%)
FYRC 2126..2215 CDD:197781 17/92 (18%)
SET <2266..2431 CDD:225491 55/162 (34%)
SET 2291..2413 CDD:214614 45/122 (37%)
PostSET 2415..2431 CDD:214703 7/12 (58%)
Nsd2NP_001074571.2 Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 149..169
MSH6_like 218..337 CDD:99898 24/156 (15%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 373..455 25/132 (19%)
HMG-box 460..>505 CDD:238037 6/44 (14%)
TNG2 <546..711 CDD:227367 27/180 (15%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 595..659 11/64 (17%)
PHD1_NSD1_2 670..712 CDD:277118 10/56 (18%)
PHD2_NSD2 717..763 CDD:277121 9/45 (20%)
PHD3_NSD2 764..817 CDD:277124 6/52 (12%)
PHD_SF 834..874 CDD:389947 5/39 (13%)
WHSC1_related 879..972 CDD:99899 19/104 (18%)
AWS 1013..1063 CDD:197795 15/79 (19%)
SET_NSD2 1063..1204 CDD:380988 53/154 (34%)
S-adenosyl-L-methionine binding. /evidence=ECO:0000250|UniProtKB:O96028 1116..1119 0/2 (0%)