DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment Cpsf160 and Cpsf1

DIOPT Version :9

Sequence 1:NP_995833.1 Gene:Cpsf160 / 44250 FlyBaseID:FBgn0024698 Length:1455 Species:Drosophila melanogaster
Sequence 2:NP_001157645.1 Gene:Cpsf1 / 94230 MGIID:2679722 Length:1450 Species:Mus musculus


Alignment Length:1509 Identity:674/1509 - (44%)
Similarity:948/1509 - (62%) Gaps:141/1509 - (9%)


- Green bases have known domain annotations that are detailed below.


  Fly     1 MFSMCKQTHSATAVEFSIACRFFNNLDENLVVAGANVLKVYRIAPNVEASQRQKLNPSEMRLAPK 65
            |:::.||.|..|.:||::.|.||||.:.||||||.:.|.|||:  |.:|....|.:.|....|.:
Mouse     1 MYAVYKQAHPPTGLEFTMYCNFFNNSERNLVVAGTSQLYVYRL--NRDAEALTKNDGSTEGKAHR 63

  Fly    66 MRLECLATYTLYGNVMSLQCVSLAGAMRDALLISFKDAKLSVLQHDPDTFALKTLSLHYFEEDDI 130
            .:||.:|:::.:|||||:..|.||||.|||||:|||||||||:::||.|..|||||||||||.::
Mouse    64 EKLELVASFSFFGNVMSMASVQLAGAKRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPEL 128

  Fly   131 RGGWTGRYFVPTVRVDPDSRCAVMLVYGKRLVVLPFRKDNSLDEIELADVKPIKKAPTAMVSRTP 195
            |.|:......|.||||||.|||.||:||.||||||||:::..:|.|         .......|:.
Mouse   129 RDGFVQNVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRESLAEEHE---------GLMGEGQRSS 184

  Fly   196 IMASYLIALRDLDEKIDNVLDIQFLHGYYEPTLLILYEPVRTCPGRIKVRSDTCVLVAISLNIQQ 260
            .:.||:|.:|.||||:.|::|:||||||||||||||:||.:|.|||:.||.|||.:|||||||.|
Mouse   185 FLPSYIIDVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQ 249

  Fly   261 RVHPIIWTVNSLPFDCLQVYPIQKPIGGCLVMTVNAVIYLNQSVPPYGVSLNSSADNSTAFPLKP 325
            :|||:||::.||||||.|...:.|||||.::..||:::|||||||||||:|||....:|||||:.
Mouse   250 KVHPVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTGTTAFPLRT 314

  Fly   326 QDGVRISLDCANFAFIDVDKLVISLRTGDLYVLTLCVDSMRTVRNFHFHKAAASVLTSCICVLHS 390
            |:||||:||||..|||..||:||||:.|::|||||..|.||:||.|||.||||||||:.:..:..
Mouse   315 QEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASVLTTSMVTMEP 379

  Fly   391 EYIFLGSRLGNSLLLHFTEEDQSTVITLDEVEQQSEQQQRNLQDEDQNLEEIFDVDQLEMAPTQA 455
            .|:||||||||||||.:||          ::::......|...|:::               ..:
Mouse   380 GYLFLGSRLGNSLLLKYTE----------KLQEPPASSVREAADKEE---------------PPS 419

  Fly   456 KSRRIE-------------DE--ELEVYGSGAKASVLQLRKFIFEVCDSLMNVAPINYMCAGERV 505
            |.:|:|             ||  |:|||||.|::.. ||..:.||||||::|:.|......||..
Mouse   420 KKKRVEPAVGWTGGKTVPQDEVDEIEVYGSEAQSGT-QLATYSFEVCDSMLNIGPCANAAVGEPA 483

  Fly   506 EFEEDGVTLRPHAESLQDLKIELVAATGHSKNGALSVFVNCINPQIITSFELDGCLDVWTVF--- 567
            ...|:       .::..:..:|:|..:|:.|||||||....|.||::|:|||.||.|:|||.   
Mouse   484 FLSEE-------FQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPV 541

  Fly   568 ---DDATKKSSRNDQ-------------HDFMLLSQRNSTLVLQTGQEINEIENTGFTVNQPTIF 616
               ::.|.|:...:|             |.|::||:.:||::|||||||.|::.:||....||:|
Mouse   542 RKEEEETPKAESTEQEPSAPKAEEDGRRHGFLILSREDSTMILQTGQEIMELDTSGFATQGPTVF 606

  Fly   617 VGNLGQQRFIVQVTTRHVRLLQGTRLIQNVPIDVGSPVVQVSIADPYVCLRVLNGQVITLALR-E 680
            .||:|..|:||||:...:|||:|...:..:|:|:|:|:||.::|||||.:....|.|....|: :
Mouse   607 AGNIGDNRYIVQVSPLGIRLLEGVNQLHFIPVDLGAPIVQCAVADPYVVIMSAEGHVTMFLLKSD 671

  Fly   681 TRG--TPRLAINKHTISSSPAVVAISAYKDLSGLFTVK----GDDINLTGSSNS-AFGHSFGGYM 738
            :.|  ..|||::|..:.....|:|:..|:|:||:||.:    |....|.|.|.| |.|      :
Mouse   672 SYGGRHHRLALHKPPLHHQSKVIALCLYRDVSGMFTTESRLGGARDELGGRSGSEAEG------L 730

  Fly   739 KAEPNMKVEDEEDLLYGDAGSAFKMNSMADLAKQSKQKNSDWWRR--LLVQAKPSYWLVVARQSG 801
            .:|.:..|:|||::||||:.:.|..:.  :.|::|.|..:|   |  ...:|.|::|.::.|::|
Mouse   731 GSETSPTVDDEEEMLYGDSSALFSPSK--EEARRSSQPPAD---RDPAPFKADPTHWCLLVRENG 790

  Fly   802 TLEIYSMPDMKLVYLVNDVGNGSMVLTDAMEFVPISLTTQ------ENSKAGIVQACMPQHANSP 860
            |:|||.:||.:||:||.:...|..||.|:....|   |||      |.::.|          ..|
Mouse   791 TMEIYQLPDWRLVFLVKNFPVGQRVLVDSSFGQP---TTQGEVRKEEATRQG----------ELP 842

  Fly   861 LPLELSVIGLGLNGERPLLLVRTRVELLIYQVFRYP----KGHLKIRFRKM-DQLNLLDQQPTHI 920
            |..|:.::.||....||.|||....|||||:.|.:.    :|:||:||:|: ..:|..:::|   
Mouse   843 LVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFPHDSQLGQGNLKVRFKKVPHNINFREKKP--- 904

  Fly   921 DLDENDEQEEIESYQMQP-----KYVQKLRPFANVGGLSGVMVCGVNPCFVFLTFRGELRIHRLL 980
                ...:::.|....:.     ..|.:.|.|.::.|.|||.:||.:|.::.:|.||.||:|.:.
Mouse   905 ----KPSKKKAEGCSTEEGSGGRGRVARFRYFEDIYGYSGVFICGPSPHWLLVTGRGALRLHPMG 965

  Fly   981 GNGDVRSFAAFNNVNIPNGFLYFDTTYELKISVLPSYLSYDSVWPVRKVPLRCTPRQLVYHRENR 1045
            .:|.:.|||.|:|||.|.|||||:...||:|||||:|||||:.|||||:|||||...:.||.|::
Mouse   966 IDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAHYVAYHVESK 1030

  Fly  1046 VYCLITQTEEPMTKYYRFNGEDKELSEESRGERFIYPIGSQFEMVLISPETWEIVPDASITFEPW 1110
            ||.:.|.|..|.|:..|..||:||.....|.:|:|:|....|.:.||||.:||.:|:|.|..|.|
Mouse  1031 VYAVATSTNTPCTRIPRMTGEEKEFEAIERDDRYIHPQQEAFSIQLISPVSWEAIPNARIELEEW 1095

  Fly  1111 EHVTAFKIVKLSYEGTRSGLKEYLCIGTNFNYSEDITSRGNIHIYDIIEVVPEPGKPMTKFKIKE 1175
            ||||..|.|.|..|.|.||||.|:..||.....|::|.||.|.|.|:||||||||:|:||.|.|.
Mouse  1096 EHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKFKV 1160

  Fly  1176 IFKKEQKGPVSAISDVLGFLVTGLGQKIYIWQLRDGDLIGVAFIDTNIYVHQIITVKSLIFIADV 1240
            :::|||||||:|:....|.||:.:||||::|.||..:|.|:|||||.:|:||:|:||:.|..|||
Mouse  1161 LYEKEQKGPVTALCHCNGHLVSAIGQKIFLWSLRASELTGMAFIDTQLYIHQMISVKNFILAADV 1225

  Fly  1241 YKSISLLRFQEEYRTLSLASRDFNPLEVYGIEFMVDNSNLGFLVTDAERNIIVYMYQPEARESLG 1305
            .|||||||:|||.:||||.|||..|||||.::|||||:.|||||:|.:||::||||.|||:||.|
Mouse  1226 MKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFG 1290

  Fly  1306 GQKLLRKADYHLGQVVNTMFRVQCHQKGLHQ---RQPFLYENKHFVVYGTLDGALGYCLPLPEKV 1367
            |.:|||:||:|:|..|||.:|..|  :|..:   ::..::||||...:.||||.:|..||:.||.
Mouse  1291 GMRLLRRADFHVGAHVNTFWRTPC--RGAAEGPSKKSVVWENKHITWFATLDGGIGLLLPMQEKT 1353

  Fly  1368 YRRFLMLQNVLLSYQEHLCGLNPKEYRTLKSSKKQGINPSRCIIDGDLIWSYRLMANSERNEVAK 1432
            |||.|||||.|.:...|..||||:.:|.|...::...|..|.::||:|:..|..::..||:|:||
Mouse  1354 YRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRILQNAVRNVLDGELLNRYLYLSTMERSELAK 1418

  Fly  1433 KIGTRTEEILGDLL 1446
            |||| |.:|.|..|
Mouse  1419 KIGT-TPDITGSRL 1431

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
Cpsf160NP_995833.1 SFT1 1..1453 CDD:227490 674/1509 (45%)
MMS1_N 94..674 CDD:287414 286/613 (47%)
CPSF_A 1086..1419 CDD:281209 176/335 (53%)
Cpsf1NP_001157645.1 SFT1 1..1421 CDD:227490 667/1496 (45%)
MMS1_N 92..672 CDD:287414 288/621 (46%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 404..435 5/45 (11%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 545..569 3/23 (13%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 713..775 22/72 (31%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 899..921 2/28 (7%)
CPSF_A 1071..1406 CDD:281209 176/336 (52%)


Information from Original Tools:


Tool Simple Score Weighted Score Original Tool Information
BLAST Result Score Score Type Cluster ID
Compara 1 0.930 - - C167850531
Domainoid 1 1.000 546 1.000 Domainoid score I259
eggNOG 1 0.900 - - E1_COG5161
Hieranoid 1 1.000 - -
Homologene 1 1.000 - - H40865
Inparanoid 1 1.050 1226 1.000 Inparanoid score I187
Isobase 1 0.950 - 0 Normalized mean entropy S5756
OMA 1 1.010 - - QHG55418
OrthoDB 00.000 Not matched by this tool.
OrthoFinder 1 1.000 - - FOG0004621
OrthoInspector 1 1.000 - - oto93811
orthoMCL 1 0.900 - - OOG6_103351
Panther 1 1.100 - - LDO PTHR10644
Phylome 1 0.910 - -
RoundUp 1 1.030 - avgDist Average_Evolutionary_Distance R190
SonicParanoid 1 1.000 - - X3233
SwiftOrtho 00.000 Not matched by this tool.
TreeFam 1 0.960 - -
1615.740

Return to query results.
Submit another query.