DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment Cpsf160 and CPSF160

DIOPT Version :9

Sequence 1:NP_995833.1 Gene:Cpsf160 / 44250 FlyBaseID:FBgn0024698 Length:1455 Species:Drosophila melanogaster
Sequence 2:NP_199979.2 Gene:CPSF160 / 835240 AraportID:AT5G51660 Length:1442 Species:Arabidopsis thaliana


Alignment Length:1509 Identity:454/1509 - (30%)
Similarity:709/1509 - (46%) Gaps:221/1509 - (14%)


- Green bases have known domain annotations that are detailed below.


  Fly    29 NLVVAGANVLKVYRIAPNVEASQRQKLNPSEMRLAPK---------MRLECLATYTLYGNVMSLQ 84
            |:|:..||:|:||.:....|.:.::..||   :||.:         :.||.:..|.|:|||.|:.
plant    59 NVVITAANILEVYIVRAQEEGNTQELRNP---KLAKRGGVMDGVYGVSLELVCHYRLHGNVESIA 120

  Fly    85 CVSLAGAM----RDALLISFKDAKLSVLQHDPDTFALKTLSLHYFEEDDIRGGW----TGRYFV- 140
            .:.:.|..    ||:::::|:|||:|||:.|....:|:..|:|.||..|    |    .||... 
plant   121 VLPMGGGNSSKGRDSIILTFRDAKISVLEFDDSIHSLRMTSMHCFEGPD----WLHLKRGRESFP 181

  Fly   141 --PTVRVDPDSRCAVMLVYGKRLVVLPFRKDNSLDEIELADVKPIKKAPTAMVSRTPIMASYLIA 203
              |.|:|||..||..:||||.::::|   |.:.:....:.|........|....   :.:||:|.
plant   182 RGPLVKVDPQGRCGGVLVYGLQMIIL---KTSQVGSGLVGDDDAFSSGGTVSAR---VESSYIIN 240

  Fly   204 LRDLDEKIDNVLDIQFLHGYYEPTLLILYEPVRTCPGRIKVRSDTCVLVAISLNIQQRVHPIIWT 268
            ||||:.|  :|.|..|||||.||.::||.|...|..||:..:..||||.|:|:|...:.||:||:
plant   241 LRDLEMK--HVKDFVFLHGYIEPVIVILQEEEHTWAGRVSWKHHTCVLSALSINSTLKQHPVIWS 303

  Fly   269 VNSLPFDCLQVYPIQKPIGGCLVMTVNAVIYLNQSVPPYGVSLN---SSADNSTAFPLKPQDGVR 330
            ..:||.|..::..:..||||.||:..|.:.|.:||. ...::||   ||||:|...   |.....
plant   304 AINLPHDAYKLLAVPSPIGGVLVLCANTIHYHSQSA-SCALALNNYASSADSSQEL---PASNFS 364

  Fly   331 ISLDCANFAFIDVDKLVISLRTGDLYVLTLCVDSMRTVRNFHFHKAAASVLTSCICVLHSEYIFL 395
            :.||.|:..:|..|..::|.::|:|.:|||..|. |.|:.....|:.||||.|.|..:.:...||
plant   365 VELDAAHGTWISNDVALLSTKSGELLLLTLIYDG-RAVQRLDLSKSKASVLASDITSVGNSLFFL 428

  Fly   396 GSRLGNSLLLHFTEEDQSTVITLDEVEQQSEQQQRNLQDEDQNLE-EIFDVDQLEMAPTQAKSRR 459
            |||||:|||:.|:            ...........|:|||:::| |.....:|.|. :......
plant   429 GSRLGDSLLVQFS------------CRSGPAASLPGLRDEDEDIEGEGHQAKRLRMT-SDTFQDT 480

  Fly   460 IEDEELEVYGSGAKASVLQLRKFIFEVCDSLMNVAPINYMCAGERVEFEED--GVTLRPHAESLQ 522
            |.:|||.::||....|....:.|.|.|.|||:||.|:.....|.|:..:.:  ||:        :
plant   481 IGNEELSLFGSTPNNSDSAQKSFSFAVRDSLVNVGPVKDFAYGLRINADANATGVS--------K 537

  Fly   523 DLKIELVAATGHSKNGALSVFVNCINPQIITSFELDGCLDVWTVFD--------DATKKSSRNDQ 579
            ....|||..:||.|||||.|....|.|::||..||.||..:|||:.        |::|.::..|:
plant   538 QSNYELVCCSGHGKNGALCVLRQSIRPEMITEVELPGCKGIWTVYHKSSRGHNADSSKMAADEDE 602

  Fly   580 -HDFMLLSQRNSTLVLQTGQEINEI-ENTGFTVNQPTIFVGNLGQQRFIVQVTTRHVRLLQGTRL 642
             |.::::|....|:||:|...:.|: |:..:.|...||..|||..:|.::||.....|:|.|:.:
plant   603 YHAYLIISLEARTMVLETADLLTEVTESVDYYVQGRTIAAGNLFGRRRVIQVFEHGARILDGSFM 667

  Fly   643 IQNVPIDV----------GSPVVQVSIADPYVCLRVLNGQVITLALRETRGTPRLAINKHTIS-S 696
            .|.:....          .|.|..||||||||.||:.:..:..|.     |.|...    |:| |
plant   668 NQELSFGASNSESNSGSESSTVSSVSIADPYVLLRMTDDSIRLLV-----GDPSTC----TVSIS 723

  Fly   697 SPAVVAISAYK-DLSGLFTVKGDDINLTGSSNSAFGHSFGGYMKAEPNMKVEDEEDLLYGDAGSA 760
            ||:|:..|..| ....|:..||                      .||.::....:..|....|.|
plant   724 SPSVLEGSKRKISACTLYHDKG----------------------PEPWLRKASTDAWLSSGVGEA 766

  Fly   761 FKMNSMADLAKQSKQKNSDWWRRLLVQAKPSYWLVVARQSGTLEIYSMPDMKLVYLVNDVGNGSM 825
                  .|......|...|            .:.||..:||.|||:.:|....|:.|:...:|..
plant   767 ------VDSVDGGPQDQGD------------IYCVVCYESGALEIFDVPSFNCVFSVDKFASGRR 813

  Fly   826 VLTDAMEFVPISLTTQENSKAGIVQACMPQHANSPLPLELSVIGLGLNGERPLLL-VRTRVELLI 889
            .|:|    :||.....|.:|.........:..|:.: :||::.....:..||.|. |.....:|.
plant   814 HLSD----MPIHELEYELNKNSEDNTSSKEIKNTRV-VELAMQRWSGHHTRPFLFAVLADGTILC 873

  Fly   890 YQVFRY---------------------PKGHLKIRFRKMDQLNLLDQQPTHIDLDENDEQEEIES 933
            |..:.:                     ..|..|:|..|.          ..|.||.:..:...:.
plant   874 YHAYLFDGVDSTKAENSLSSENPAALNSSGSSKLRNLKF----------LRIPLDTSTREGTSDG 928

  Fly   934 YQMQPKYVQKLRPFANVGGLSGVMVCGVNPCFVFLTFRGELRIHRLLGNGDVRSFAAFNNVNIPN 998
            ...     |::..|.|:.|..|..:.|..|.:..| ||..||.|..|.:|.:.:|...:|||..:
plant   929 VAS-----QRITMFKNISGHQGFFLSGSRPGWCML-FRERLRFHSQLCDGSIAAFTVLHNVNCNH 987

  Fly   999 GFLYFDTTYELKISVLPSYLSYDSVWPVRKVPLRCTPRQLVYHRENRVYCLITQTEEPMTKYYRF 1063
            ||:|......|||..|||...||:.|||:|:||:.||.|:.|:.|..:|.||  ...|::|  ..
plant   988 GFIYVTAQGVLKICQLPSASIYDNYWPVQKIPLKATPHQVTYYAEKNLYPLI--VSYPVSK--PL 1048

  Fly  1064 NGEDKELSEESRGERF------------IYPIGSQFEMVLISPE----TWEIVPDASITFEPWEH 1112
            |.....|.::..|::.            .|.: .:||:.::.||    .||  ..|.|..:..||
plant  1049 NQVLSSLVDQEAGQQLDNHNMSSDDLQRTYTV-EEFEIQILEPERSGGPWE--TKAKIPMQTSEH 1110

  Fly  1113 VTAFKIVKLSYEGTRSGLKEYLCIGTNFNYSEDITSRGNIHIYDIIEVVPEPGK--PMTKFKIKE 1175
            ....::|.|....|... :..|.:||.:...||:.:||.:.::..       ||  ..::..:.|
plant  1111 ALTVRVVTLLNASTGEN-ETLLAVGTAYVQGEDVAARGRVLLFSF-------GKNGDNSQNVVTE 1167

  Fly  1176 IFKKEQKGPVSAISDVLGFLVTGLGQKIYIWQLRDGDLIGVAFIDT-NIYVHQIITVKSLIFIAD 1239
            ::.:|.||.:||::.:.|.|:...|.||.:.:....:|.||||.|. .:||..:..|||.|.:.|
plant  1168 VYSRELKGAISAVASIQGHLLISSGPKIILHKWNGTELNGVAFFDAPPLYVVSMNVVKSFILLGD 1232

  Fly  1240 VYKSISLLRFQEEYRTLSLASRDFNPLEVYGIEFMVDNSNLGFLVTDAERNIIVYMYQPEARESL 1304
            |:|||..|.::|:...|||.::||..|:.:..||::|.|.|...|:|.::||.|:.|.|:..||.
plant  1233 VHKSIYFLSWKEQGSQLSLLAKDFESLDCFATEFLIDGSTLSLAVSDEQKNIQVFYYAPKMIESW 1297

  Fly  1305 GGQKLLRKADYHLGQVVNTMFRVQCHQKGLHQRQPFLYENKHFVVYGTLDGALGYCLPLPEKVYR 1369
            .|.|||.:|::|:|..|:...|:|....|..:      .|:..:::|||||:.|...||.|..:|
plant  1298 KGLKLLSRAEFHVGAHVSKFLRLQMVSSGADK------INRFALLFGTLDGSFGCIAPLDEVTFR 1356

  Fly  1370 RFLMLQNVLLSYQEHLCGLNPKEYRTLKSSKKQGINPSRCIIDGDLIWSYRLMANSERNEVAKKI 1434
            |...||..|:....|:.||||..:|..:||.|...:....|:|.:|:..|.::...|:.|:|.:|
plant  1357 RLQSLQKKLVDAVPHVAGLNPLAFRQFRSSGKARRSGPDSIVDCELLCHYEMLPLEEQLELAHQI 1421

  Fly  1435 GTRTEEILGDLLEI 1448
            ||....||.||:::
plant  1422 GTTRYSILKDLVDL 1435

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
Cpsf160NP_995833.1 SFT1 1..1453 CDD:227490 454/1509 (30%)
MMS1_N 94..674 CDD:287414 206/612 (34%)
CPSF_A 1086..1419 CDD:281209 110/339 (32%)
CPSF160NP_199979.2 SFT1 79..1442 CDD:227490 446/1489 (30%)
CPSF_A 1083..1408 CDD:397339 111/340 (33%)


Information from Original Tools:


Tool Simple Score Weighted Score Original Tool Information
BLAST Result Score Score Type Cluster ID
Domainoid 1 1.000 263 1.000 Domainoid score I468
eggNOG 1 0.900 - - E1_COG5161
Hieranoid 1 1.000 - -
Homologene 1 1.000 - - H40865
Inparanoid 1 1.050 561 1.000 Inparanoid score I248
OMA 1 1.010 - - QHG55418
OrthoDB 1 1.010 - - D360328at2759
OrthoFinder 1 1.000 - - FOG0004621
OrthoInspector 1 1.000 - - oto3653
orthoMCL 1 0.900 - - OOG6_103351
Panther 1 1.100 - - LDO PTHR10644
Phylome 1 0.910 - -
SonicParanoid 1 1.000 - - X3233
SwiftOrtho 1 1.000 - -
TreeFam 1 0.960 - -
1514.840

Return to query results.
Submit another query.