DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment Cpsf160 and CPSF1

DIOPT Version :10

Sequence 1:NP_995833.1 Gene:Cpsf160 / 44250 FlyBaseID:FBgn0024698 Length:1455 Species:Drosophila melanogaster
Sequence 2:NP_037423.2 Gene:CPSF1 / 29894 HGNCID:2324 Length:1443 Species:Homo sapiens


Alignment Length:1502 Identity:685/1502 - (45%)
Similarity:952/1502 - (63%) Gaps:106/1502 - (7%)


- Green bases have known domain annotations that are detailed below.


  Fly     1 MFSMCKQTHSATAVEFSIACRFFNNLDENLVVAGANVLKVYRIAPNVEASQRQKLNPSEMRLAPK 65
            |:::.||.|..|.:|||:.|.||||.:.||||||.:.|.|||:  |.:|....|.:.|....|.:
Human     1 MYAVYKQAHPPTGLEFSMYCNFFNNSERNLVVAGTSQLYVYRL--NRDAEALTKNDRSTEGKAHR 63

  Fly    66 MRLECLATYTLYGNVMSLQCVSLAGAMRDALLISFKDAKLSVLQHDPDTFALKTLSLHYFEEDDI 130
            .:||..|:::.:|||||:..|.||||.|||||:|||||||||:::||.|..|||||||||||.::
Human    64 EKLELAASFSFFGNVMSMASVQLAGAKRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPEL 128

  Fly   131 RGGWTGRYFVPTVRVDPDSRCAVMLVYGKRLVVLPFRKDNSLDEIELADVKPIKKAPTAMV---S 192
            |.|:......|.||||||.|||.|||||.||||||||:::..:|.|            .:|   .
Human   129 RDGFVQNVHTPRVRVDPDGRCAAMLVYGTRLVVLPFRRESLAEEHE------------GLVGEGQ 181

  Fly   193 RTPIMASYLIALRDLDEKIDNVLDIQFLHGYYEPTLLILYEPVRTCPGRIKVRSDTCVLVAISLN 257
            |:..:.||:|.:|.||||:.|::|:||||||||||||||:||.:|.|||:.||.|||.:||||||
Human   182 RSSFLPSYIIDVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLN 246

  Fly   258 IQQRVHPIIWTVNSLPFDCLQVYPIQKPIGGCLVMTVNAVIYLNQSVPPYGVSLNSSADNSTAFP 322
            |.|:|||:||::.||||||.|...:.|||||.:|..||:::|||||||||||:|||....:||||
Human   247 ITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNSLTTGTTAFP 311

  Fly   323 LKPQDGVRISLDCANFAFIDVDKLVISLRTGDLYVLTLCVDSMRTVRNFHFHKAAASVLTSCICV 387
            |:.|:||||:||||...||..||:||||:.|::|||||..|.||:||.|||.||||||||:.:..
Human   312 LRTQEGVRITLDCAQATFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASVLTTSMVT 376

  Fly   388 LHSEYIFLGSRLGNSLLLHFTEEDQ----STVITLDEVEQQSEQQQRNLQDEDQNLEEIFDVDQL 448
            :...|:||||||||||||.:||:.|    |.|....:.|:...:::|              ||..
Human   377 MEPGYLFLGSRLGNSLLLKYTEKLQEPPASAVREAADKEEPPSKKKR--------------VDAT 427

  Fly   449 EMAPTQAKS-RRIEDEELEVYGSGAKASVLQLRKFIFEVCDSLMNVAPINYMCAGERVEFEEDGV 512
            .......|| .:.|.:|:|||||.|::.. ||..:.||||||::|:.|......||.....|:  
Human   428 AGWSAAGKSVPQDEVDEIEVYGSEAQSGT-QLATYSFEVCDSILNIGPCANAAVGEPAFLSEE-- 489

  Fly   513 TLRPHAESLQDLKIELVAATGHSKNGALSVFVNCINPQIITSFELDGCLDVWTVFDDATKKSSRN 577
                 .::..:..:|:|..:||.|||||||....|.||::|:|||.||.|:|||.....|:...|
Human   490 -----FQNSPEPDLEIVVCSGHGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPVRKEEEDN 549

  Fly   578 D--------------------QHDFMLLSQRNSTLVLQTGQEINEIENTGFTVNQPTIFVGNLGQ 622
            .                    :|.|::||:.:||::|||||||.|::.:||....||:|.||:|.
Human   550 PKGEGTEQEPSTTPEADDDGRRHGFLILSREDSTMILQTGQEIMELDTSGFATQGPTVFAGNIGD 614

  Fly   623 QRFIVQVTTRHVRLLQGTRLIQNVPIDVGSPVVQVSIADPYVCLRVLNGQVITLALR-ETRG--T 684
            .|:||||:...:|||:|...:..:|:|:|:|:||.::|||||.:....|.|....|: ::.|  .
Human   615 NRYIVQVSPLGIRLLEGVNQLHFIPVDLGAPIVQCAVADPYVVIMSAEGHVTMFLLKSDSYGGRH 679

  Fly   685 PRLAINKHTISSSPAVVAISAYKDLSGLFTVKGDDINLTGSSNSAFGHS------FGGYMKAEPN 743
            .|||::|..:.....|:.:..|:||||:||.:.   .|.|:.:...|.|      .|    :|.:
Human   680 HRLALHKPPLHHQSKVITLCLYRDLSGMFTTES---RLGGARDELGGRSGPEAEGLG----SETS 737

  Fly   744 MKVEDEEDLLYGDAGSAFKMNSMADLAKQSKQKNSDWWRR--LLVQAKPSYWLVVARQSGTLEIY 806
            ..|:|||::||||:||.|..:.  :.|::|.|..:|   |  ...:|:|::|.::.|::||:|||
Human   738 PTVDDEEEMLYGDSGSLFSPSK--EEARRSSQPPAD---RDPAPFRAEPTHWCLLVRENGTMEIY 797

  Fly   807 SMPDMKLVYLVNDVGNGSMVLTDAMEFVPISLTTQENSKAGIVQACMPQHANSPLPLELSVIGLG 871
            .:||.:||:||.:...|..||.|:....|   |||..::    :....:....||..|:.::.||
Human   798 QLPDWRLVFLVKNFPVGQRVLVDSSFGQP---TTQGEAR----REEATRQGELPLVKEVLLVALG 855

  Fly   872 LNGERPLLLVRTRVELLIYQVFRYP----KGHLKIRFRKM-DQLNLLDQQPTHIDLD-ENDEQEE 930
            ....||.|||....|||||:.|.:.    :|:||:||:|: ..:|..:::|...... |....||
Human   856 SRQSRPYLLVHVDQELLIYEAFPHDSQLGQGNLKVRFKKVPHNINFREKKPKPSKKKAEGGGAEE 920

  Fly   931 IESYQMQPKYVQKLRPFANVGGLSGVMVCGVNPCFVFLTFRGELRIHRLLGNGDVRSFAAFNNVN 995
            ....:.:   |.:.|.|.::.|.|||.:||.:|.::.:|.||.||:|.:..:|.|.|||.|:|||
Human   921 GAGARGR---VARFRYFEDIYGYSGVFICGPSPHWLLVTGRGALRLHPMAIDGPVDSFAPFHNVN 982

  Fly   996 IPNGFLYFDTTYELKISVLPSYLSYDSVWPVRKVPLRCTPRQLVYHRENRVYCLITQTEEPMTKY 1060
            .|.|||||:...||:|||||:|||||:.|||||:|||||...:.||.|::||.:.|.|..|..:.
Human   983 CPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAHYVAYHVESKVYAVATSTNTPCARI 1047

  Fly  1061 YRFNGEDKELSEESRGERFIYPIGSQFEMVLISPETWEIVPDASITFEPWEHVTAFKIVKLSYEG 1125
            .|..||:||.....|.||:|:|....|.:.||||.:||.:|:|.|..:.|||||..|.|.|..|.
Human  1048 PRMTGEEKEFETIERDERYIHPQQEAFSIQLISPVSWEAIPNARIELQEWEHVTCMKTVSLRSEE 1112

  Fly  1126 TRSGLKEYLCIGTNFNYSEDITSRGNIHIYDIIEVVPEPGKPMTKFKIKEIFKKEQKGPVSAISD 1190
            |.||||.|:..||.....|::|.||.|.|.|:||||||||:|:||.|.|.:::|||||||:|:..
Human  1113 TVSGLKGYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCH 1177

  Fly  1191 VLGFLVTGLGQKIYIWQLRDGDLIGVAFIDTNIYVHQIITVKSLIFIADVYKSISLLRFQEEYRT 1255
            ..|.||:.:||||::|.||..:|.|:|||||.:|:||:|:||:.|..|||.|||||||:|||.:|
Human  1178 CNGHLVSAIGQKIFLWSLRASELTGMAFIDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKT 1242

  Fly  1256 LSLASRDFNPLEVYGIEFMVDNSNLGFLVTDAERNIIVYMYQPEARESLGGQKLLRKADYHLGQV 1320
            |||.|||..|||||.::|||||:.|||||:|.:||::||||.|||:||.||.:|||:||:|:|..
Human  1243 LSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAH 1307

  Fly  1321 VNTMFRVQCH--QKGLHQRQPFLYENKHFVVYGTLDGALGYCLPLPEKVYRRFLMLQNVLLSYQE 1383
            |||.:|..|.  .:|| .::..::||||...:.||||.:|..||:.||.|||.|||||.|.:...
Human  1308 VNTFWRTPCRGATEGL-SKKSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLP 1371

  Fly  1384 HLCGLNPKEYRTLKSSKKQGINPSRCIIDGDLIWSYRLMANSERNEVAKKIGTRTEEILGDLLEI 1448
            |..||||:.:|.|...::...|..|.::||:|:..|..::..||:|:||||||..:.||.||||.
Human  1372 HHAGLNPRAFRMLHVDRRTLQNAVRNVLDGELLNRYLYLSTMERSELAKKIGTTPDIILDDLLET 1436

  Fly  1449 ERLASVF 1455
            :|:.:.|
Human  1437 DRVTAHF 1443

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
Cpsf160NP_995833.1 SFT1 1..1453 CDD:227490 684/1498 (46%)
CPSF1NP_037423.2 SFT1 1..1437 CDD:227490 683/1494 (46%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 404..435 6/44 (14%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 546..570 1/23 (4%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 715..777 21/70 (30%)
Nuclear localization signal. /evidence=ECO:0000255 893..908 3/14 (21%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 901..921 3/19 (16%)

Return to query results.
Submit another query.