DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment Cpsf100 and dclre1a

DIOPT Version :10

Sequence 1:NP_651658.1 Gene:Cpsf100 / 43426 FlyBaseID:FBgn0027873 Length:756 Species:Drosophila melanogaster
Sequence 2:NP_001018385.5 Gene:dclre1a / 324964 ZFINID:ZDB-GENE-050522-124 Length:926 Species:Danio rerio


Alignment Length:828 Identity:151/828 - (18%)
Similarity:254/828 - (30%) Gaps:281/828 - (33%)


- Green bases have known domain annotations that are detailed below.


  Fly   129 LKYNQTVSLKDKGYGISITPLNAGHMIGGTIWKIVKVGEEDIVYATDFNHKKERHL------SGC 187
            ||.|:..|.|.:|....:|....    ..|:.|.||..|:.    ||.:|:...:.      ||.
Zfish    16 LKKNKKRSCKSQGSREHVTKRKK----ESTVQKTVKSAEKH----TDLDHQIGSNAPNNTVNSGS 72

  Fly   188 E------LDRLQRPSLLITDAYNAQYQQARRRARDEKLMTNILQTVRNNGNVLIAVDTAGRVLEL 246
            |      :.|         |..||..|..|......::..:||.......:|...:|||.     
Zfish    73 EDVNHGTVSR---------DVPNADSQSGRGFCPVCQMPFSILVVQSQQWHVAECLDTAA----- 123

  Fly   247 AHMLDQLWKNKESGLMAYSLALLNNVSYNVIEFAKSQ-----IEWMSDKLTKAFEGARNNPFQFK 306
                |.. |....||...|....:...||....|:|:     ::.:||....:.....:...:..
Zfish   124 ----DNC-KECPDGLQCMSTIPNHYKRYNHSLLAQSRAQNDSVQAISDFSLNSVNNGASTSVKES 183

  Fly   307 HIQLCHSLADVYKLPAGPKVVLASTPD-------LESGFTRDLFVQ--WASNANNS--------- 353
            .:....:::......:.|...|..||.       |.|..|.|:..:  |:.:...|         
Zfish   184 SVDSAVNVSASSSQSSSPGDRLKGTPTKSNALLLLRSPNTEDIKKKKGWSPSVKRSQSQISSQEA 248

  Fly   354 ------------IILTTRTSPGTLAMELVE--------NCAPGKQI-ELDVRRRVDLEGAELEEY 397
                        :....:|.|..:..||.|        :|:|..:: |:|...||:....:...|
Zfish   249 KAKISAPGEVRCVDSAVQTQPNEVKKELFECKDDDDYISCSPLSELPEVDEEHRVEKSNNDRHFY 313

  Fly   398 ---LRTQGEKLNPLIVKPDVEEESSSESEDDIEMSVI---------------TGKHDIVVRPEGR 444
               |:...:.||..           |:.:||:...||               ..|....:.||..
Zfish   314 STALQEDDDSLNLF-----------SDEDDDLFFDVIDQYEEHGPDKSSLDTCEKQQTSLAPESH 367

  Fly   445 -----------HHSGFFKSNKRHHVMFPYHEEKVKCDEYGEIINLDDYRIADATGYEFVPMEEQN 498
                       ||.....::....:..|   :.:..:...:.|:...:..:..:.:|.|...:.|
Zfish   368 LTASSSTGVSFHHCSTNNNSSNSQLQSP---QSLVLEHLRDRISTPTHVKSLTSSFEEVNFTQTN 429

  Fly   499 KENVKKE----------------EPGIGAEQQANGGI-----------VDNDVQLLEKPTKLISQ 536
            :|:...:                :.|....:|.:.|:           ...:|:.|.:...|...
Zfish   430 QESFSNQADLSATQSMAPRRTQIKAGPSGLKQTDIGVFFGLKPLHEKKAGKEVKPLVREADLQKS 494

  Fly   537 RKTIEVNAQVQRIDFEGR--------SDG-------ESMLKILSQLRPRRVIVIHGTAEGTQ--- 583
            |:.....||.:....:||        |:|       |..:.:.:|...:|    .|.|||.:   
Zfish   495 RRVRVSEAQGEDGKRQGRWRGKSKAPSEGSGVSPAVEDSVALPTQTEGKR----GGRAEGRKRWN 555

  Fly   584 ---------VVARHCE-----QNVGARVFTPQKGEIIDVT------------------SEIHIYQ 616
                     ...:.|.     ...|..|...|.|.:..||                  |.:.||.
Zfish   556 RGKATDGDPKEPKRCPFYKKIPGTGFAVDAFQYGVVEGVTAYFLTHFHSDHYGGLKKDSAVPIYC 620

  Fly   617 VRLTEGLV-SQLQFQKGKDAEVAWVDGRLGMRVKAIEAPMDVTVEQDASVQEGKTLTLETLADDE 680
            .::|..|| |:|:.    |.:...|   |.|..:.|...:.||: .||:...|..:.|..|.|.:
Zfish   621 NKVTSNLVKSKLKV----DEQYIHV---LPMNTECIVQGVKVTL-LDANHCPGAAMLLFVLPDGQ 677

  Fly   681 IPIHNSVLINELKLSDFK----------------QTLMRNN-------------------INSEF 710
            ..:|.         .||:                |||..:.                   :|:.|
Zfish   678 TVLHT---------GDFRADPSMERYPELQGLRIQTLYLDTTYCSPEYTFPTQQEVVTFAVNTAF 733

  Fly   711 SGGVL------WCSNGT---------LALRRVDAGKVAMEGCLSEEYY 743
            ....|      .|  ||         ||:..|.:.||    |||::.|
Zfish   734 ERVTLNPRTLVVC--GTYSVGKEKVFLAVSEVLSSKV----CLSKDKY 775

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
Cpsf100NP_651658.1 CPSF2-like_MBL-fold 7..204 CDD:293851 20/86 (23%)
Beta-Casp 243..369 CDD:214983 23/160 (14%)
RMMBL 539..600 CDD:462191 16/92 (17%)
CPSF100_C 617..753 CDD:463836 39/178 (22%)
dclre1aNP_001018385.5 None

Return to query results.
Submit another query.