DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment Set1 and trr

DIOPT Version :10

Sequence 1:NP_001015221.1 Gene:Set1 / 3354971 FlyBaseID:FBgn0040022 Length:1641 Species:Drosophila melanogaster
Sequence 2:NP_726773.2 Gene:trr / 31149 FlyBaseID:FBgn0023518 Length:2431 Species:Drosophila melanogaster


Alignment Length:1707 Identity:310/1707 - (18%)
Similarity:522/1707 - (30%) Gaps:659/1707 - (38%)


- Green bases have known domain annotations that are detailed below.


  Fly   417 SASSLPIASHGFNSCSFPSIENIKTWSDRRAWTAFQPDFHPVQPPPPPPEEIDNWDEEEHDKNSI 481
            ||::||:::....:........::..||:.|........:...||||||::  ...::.|....:
  Fly   902 SATALPVSAVAITTPGVGGEAKLEQKSDQPAAIMQNQSQNQAPPPPPPPQQ--QQQQQLHQPQQL 964

  Fly   482 VPTHYGCMAKLQPPVPSNVNFATKLQS-----------VTQPNSDPG------------------ 517
            .|:.:        .|...|...:|..|           ||:..|.|.                  
  Fly   965 QPSPH--------QVKQTVQIVSKETSFISGPVAAKTLVTEATSKPAELLPPPPYEMATAPISNV 1021

  Fly   518 TVDLDTRIA----LIFKGKTFGNAPPFLQMDSSDSETDQGKPEVFSDVNSDSNN----------- 567
            |:.:.|:.|    |..|.|....:.|..|.|  :|..:|.:|.:.|:..:.:..           
  Fly  1022 TISISTKQAAPKELQMKPKAVAMSLPMEQGD--ESLPEQAEPPLHSEQGATAAGVAPHSGGPLVS 1084

  Fly   568 ---SENKKRSCEKNNKVLHQPNEASDISSDEELIGKKDKSKLSL---ICEKEVNDDNMSLSSLSS 626
               :.|.........|:..:|             |:..|.||.:   :.||::........|.|.
  Fly  1085 AQWTNNHLEGGVATTKIPFKP-------------GEPQKRKLPMHPQLDEKQIQQQAEIPISTSL 1136

  Fly   627 QEDPIQTKEGAEYK----SIMSSYMYSHSNQNPFYYHASGYGHYLSGIPSESAS-RLFSNGAYVH 686
            ...|  |.:|...|    |.:::|:..                  ||:|:|:.. :..|.|....
  Fly  1137 PTTP--TGQGTPDKVQLISAIATYVKK------------------SGVPNEAQPIQNQSQGQVQM 1181

  Fly   687 SEYLKAVASFNFDSFSKPYDYNKGALSDQNDGIRQKVKQVIGYIVEELKQILKRDVNKRMIEITA 751
            ...::|..              :|.||.|..|      |:.|:...               :|.|
  Fly  1182 QAQMQATM--------------QGHLSGQMSG------QISGHAAG---------------QIPA 1211

  Fly   752 FKHFETWWDEHTSKARSKPLFEKADSTVNTPLNCIKDTSYNEKNPDINLLINAHREVADFQSYSS 816
            ..|.:.   :|.......|..::.....|.|           :|..|.|.:.....|        
  Fly  1212 QMHLQV---QHQLHMAVHPQQQQQQLHQNQP-----------QNATIPLPVTGQGAV-------- 1254

  Fly   817 IGLRAAMPKLPSFRRIRKHPSPIPTKRNFLERDLSDQEEMVQRSDSDKEDSNVEISDT--ARSKI 879
                               |.|:||    :|....||.:..:|.......:|:.....  |...:
  Fly  1255 -------------------PIPVPT----MESKAGDQRKRRKREVQKPRRTNLNAGQAGGALKDL 1296

  Fly   880 KGPVP----IQESDSKSHTSGLNSKRKGSASSFFSSS--------SSSTSSEAEYEAIDCVEKAR 932
            .||:|    :|.:.....|..:.....|:.....|:.        .:||.:.:.......|.|..
  Fly  1297 TGPLPAGAMVQLAGMPPGTQYIQGAASGTGHVITSTGQGVTLGGVGASTGASSSPMLKKRVRKFS 1361

  Fly   933 TSEEDSPRGYGQRNLNQRTTTIRNRNLVGTMDVINVRN--LCSGSNEFKKENVTKRTKKNIYSDT 995
            ..|||.. .:.::.|    |.||....:..::....||  ...||||...............:.:
  Fly  1362 KVEEDHD-AFTEKLL----THIRQMQPLQVLEPHLNRNFHFLIGSNETSGGGSPASMSSAASAGS 1421

  Fly   996 DEDNDRTLFPALKEKNISTILSDLEEISKDSCIGLDENGIEPTILRK-----IPNTPKLNEECR- 1054
            .......|....:...:|..|..||:     |.|        |:|.:     :|..|.|.:..| 
  Fly  1422 SSAGGGKLKGGSRGWPLSRHLEGLED-----CDG--------TVLGRYGRVNLPGIPSLYDSERF 1473

  Fly  1055 -------------RSLTPVPPPGYNEEEIKKKVDCKQKPSFEYDRIYSDSEEEKEYQERRKRNTE 1106
                         ||.:|...||     .:|.:......:..||:.:| :..|:..:||..|:..
  Fly  1474 GGSRGLVGGSARTRSPSPAESPG-----AEKMLPMSSIQNDFYDQEFS-THMERNPRERLVRHIG 1532

  Fly  1107 YMAQMERE---FLEEQEKRIEKSLDKNLQSPNNIVKNNNSP------------------------ 1144
            .:.....|   .:|.:......:|.:..:.|..|:.|.||.                        
  Fly  1533 AVKDCNLETVDLVESEGVAAWATLPRLTRYPGLILLNGNSRCHGRMSPVALPEDPLTMRFPVSPL 1597

  Fly  1145 -RNKNDETRKT-------------------------AISQTRSCFESASKVDTTLVNIIS----- 1178
             |:..:|.|||                         .::...|..|:.:.|...|.|::.     
  Fly  1598 LRSCGEELRKTQQMELGMGPLGNNNNNNYQQKNQNVILALPASASENIAGVLRDLANLLHLAPAL 1662

  Fly  1179 ----VENDINE------FGPHEEGDV--------LTNGCNKMYTNSKGKTKRTQSPVYSEGGSSQ 1225
                :|:.|..      ....:|..|        :::|..:...|.:.|..|:...|....| .:
  Fly  1663 TCKIIEDKIGNKLEDQFMNQDDEKHVDFKRPLSQVSHGHLRKILNGRRKLCRSCGNVVHATG-LR 1726

  Fly  1226 ASQASQVALEHCY-------------SLPPHSV-------------------------------- 1245
            ..:.|..|||...             |:||..|                                
  Fly  1727 VPRHSVPALEEQLPRLAQLMDMLPRKSVPPPFVYFCDRACFARFKWNGKDGQAEAASLLLQPAGG 1791

  Fly  1246 -----SLGDYP----SGKVNETKNILKREAENIAIVSQMTRTGPGRPRKDPI---CIQK------ 1292
                 |.||.|    :......:.::|:|.|:   ..:.|.:.||.|...|.   ||.|      
  Fly  1792 SAVKSSNGDSPGSFCASSTAPAEMVVKQEPED---EDEKTPSVPGNPTNIPAQRKCIVKCFSADC 1853

  Fly  1293 KKRDLAPR----------------MSNVKSKMTPNG----------------------------- 1312
            ...|.||.                ::|...:...:|                             
  Fly  1854 FTTDSAPSGLELDGTAGAGTGAGPVNNTVWETETSGLQLEDTRQCVFCNQRGDGQADGPSRLLNF 1918

  Fly  1313 --DEWPDL----------------------------------AHK-------------NVHFVPC 1328
              |:|..|                                  .|:             :::.:||
  Fly  1919 DVDKWVHLNCALWSNGVYETVSGALMNFQTALQAGLSQACSACHQPGATIKCFKSRCNSLYHLPC 1983

  Fly  1329 DM------YKT--------------------------------------------------RDQN 1337
            .:      ||.                                                  ||:|
  Fly  1984 AIREECVFYKNKSVHCSVHGHAHAGITMGAGAGATTGAGLGGSVADNELSSLVVHRRVFVDRDEN 2048

  Fly  1338 EEM--VILYTFLTKGIDAEDINFIKMSYLDHLHKEPYAMFLNNTHWVDHC--TTDRAFWPPPSKK 1398
            .::  |:.|:.|:..:...::.|:.:..|.....|.:    :..|::...  ...|.:|......
  Fly  2049 RQVATVMHYSELSNLLRVGNMTFLNVGQLLPHQLEAF----HTPHYIYPIGYKVSRYYWCVRRPN 2109

  Fly  1399 RRKDDELIRH-----KTGCARTEGFYKLDVREKAKHKYHYAKANTEDSFNEDRSDEPTA------ 1452
            ||     .|:     :.||             |.:.:.....|..::...|.|...|:|      
  Fly  2110 RR-----CRYICSIAEAGC-------------KPEFRIQVQDAGDKEPEREFRGSSPSAVWQQIL 2156

  Fly  1453 ---------------------------LTNHHHNKLISKMQGI---------------------- 1468
                                       ||.....:::..:.||                      
  Fly  2157 QPITRLRKVHKWLQLFPQHISGEDLFGLTEPAIVRILESLPGIETLTDYRFKYGRNPLLEFPLAI 2221

  Fly  1469 --SREARS--NQRRLL---------TAFGS-----MGESELL---------------KFNQLKFR 1500
              |..||:  .||:||         || ||     |..|..:               |.:|.|..
  Fly  2222 NPSGAARTEPKQRQLLVWRKPHTQRTA-GSCSTQRMANSAAIAGEVACPYSKQFVHSKSSQYKKM 2285

  Fly  1501 KKQLK----FAKSAIHDWGLFAMEPIAADEMVIEYVGQMIRPVVADLRETKYEAIGIGSSYLFRI 1561
            |::.:    .|:|.|...||:|...|....|:|||:|::||..|:::||.:||:...| .|:||:
  Fly  2286 KQEWRNNVYLARSKIQGLGLYAARDIEKHTMIIEYIGEVIRTEVSEIREKQYESKNRG-IYMFRL 2349

  Fly  1562 DMETIIDATKCGNLARFINHSCNPNCYAKVITIESEKKIVIYSKQPIGINEEITYDYKFPLEDE- 1625
            |.:.::|||..|.|||:|||||||||..:::.::.:.:|:|::|:.|...||::|||||.:||| 
  Fly  2350 DEDRVVDATLSGGLARYINHSCNPNCVTEIVEVDRDVRIIIFAKRKIYRGEELSYDYKFDIEDES 2414

  Fly  1626 -KIPCLCGAQGCRGTLN 1641
             ||||.|||..||..:|
  Fly  2415 HKIPCACGAPNCRKWMN 2431

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
Set1NP_001015221.1 RRM_Set1 99..191 CDD:409745
U2AF_lg 255..>386 CDD:273727
N-SET 1352..1496 CDD:463344 35/238 (15%)
SET_SETD1 1490..1637 CDD:380946 65/167 (39%)
trrNP_726773.2 PHA03255 184..>320 CDD:165513
PRK13914 <623..>851 CDD:237555
ePHD2_KMT2C_like 1898..2002 CDD:277136 8/103 (8%)
FYRN 2067..2118 CDD:461787 9/59 (15%)
FYRC 2126..2215 CDD:197781 9/88 (10%)
SET_KMT2C_2D 2278..2430 CDD:380948 67/152 (44%)
Blue background indicates that the domain is not in the aligned region.

Return to query results.
Submit another query.