DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment Sema5c and Sema4c

DIOPT Version :10

Sequence 1:NP_611293.3 Gene:Sema5c / 37066 FlyBaseID:FBgn0284221 Length:1093 Species:Drosophila melanogaster
Sequence 2:XP_006244820.1 Gene:Sema4c / 301346 RGDID:1562837 Length:856 Species:Rattus norvegicus


Alignment Length:782 Identity:209/782 - (26%)
Similarity:312/782 - (39%) Gaps:168/782 - (21%)


- Green bases have known domain annotations that are detailed below.


  Fly    53 ISYQDLMSTAKRFYDPETTWYSEMLFDVARNQVIVGARDTLYRMSFDLEPLER---ASWGATPSE 114
            :|..:|::..:||.......:..:........:.||||:.|:  :|.:|.||.   .||.|...:
  Rat    55 VSSGELVTVVRRFSQTGIQDFLTLTLTDRSGLLYVGAREALF--AFSVEALELQGVISWEAPAEK 117

  Fly   115 IAMCQAKGQSERW-CRNYVRVLHSYGENQLYACGTNAFQPSCSWRQMENLTV--TGVDSGVVKCP 176
            ...|..||:|.:. |.|::|.|..|..:.||.|||.||||.|::..|...|:  ...:.|..|||
  Rat   118 KIECTQKGKSNQTECFNFIRFLQPYNASHLYVCGTYAFQPKCTYINMLTFTLDRAEFEDGKGKCP 182

  Fly   177 FHPQANSTSLLQSNGQLFVGTATDFSGSDVAILRTGVESNKRFLRTKQYNNNWLSGAQFVGSF-- 239
            :.|....|.|| .:|:|:..|..:|.|::..|||.....:.  ::| :|...||:...||||.  
  Rat   183 YDPAKGHTGLL-VDGELYSATLNNFLGTEPVILRNMGPHHP--IKT-EYLAFWLNEPHFVGSAFV 243

  Fly   240 --EAGHF------VYFLLRESAAEHMSCGKVIYSRVARVCKNDVGGGGQLLRDNWTSFLKARLNC 296
              ..|.|      :||...|.|.|:....:.:.:|||||||.|: ||.:.|:..||:||||||.|
  Rat   244 PESVGSFTGDDDKIYFFFSERAVEYDCYSEQVVARVARVCKGDM-GGARTLQKKWTTFLKARLVC 307

  Fly   297 SLPGEYPYYFDEIQ------GMTYAESESILYATFRTSGSSIFGSAVCAYNLSSINAAFDGPFKQ 355
            |.| ::..||::::      |.::  ..:..:|.|:.....:..||||.|.|..|...|:|||| 
  Rat   308 SAP-DWKVYFNQLRAVHTLLGASW--HNTTFFAVFQARWGDMDLSAVCEYQLEHIQQVFEGPFK- 368

  Fly   356 QEHSDAAWKTVNTNQRSQFQCGTSSIGHWLESSRY-----------------QLMDEAVQPIGAE 403
             |:|:.|.|..............|.|.:|...:.|                 .||:|.|:|....
  Rat   369 -EYSEQAQKWARYTDPVPTPRPGSCINNWHRDNGYTSSLELPDNTLNFIKKHPLMEEQVKPRLGR 432

  Fly   404 PLYHSKLEQFGRLALD-IINTKTEQVHVLFVAS-SGNHIKKLSVKYDGDGVQTCLVELWQA-DDT 465
            ||...|...|..:..| |:........|||:.: .|..:|.:|:     |....:||..|. |..
  Rat   433 PLLVKKNTNFTHVVADRILGLDGATYTVLFIGTGDGWLLKAVSL-----GPWIHMVEELQVFDQE 492

  Fly   466 GTSSLLNMAYLKVSDSLYLGTDLALTRIPAQHCSRHVSQSSCLNSMDPYCGWNELVERCMPQPQD 530
            ...||:.....||   |:.|:...|.::....|:::.....|:.:.||||.||....||:.....
  Rat   493 PVESLVLSQSKKV---LFAGSRSQLVQLSLADCTKYRFCVDCVLARDPYCAWNVNTSRCVATGGH 554

  Fly   531 SSVLQHWHQA---PQITC---------PVL--NAPIDGGWSTWSP-----------WAVCQQH-- 568
            |..|...|.|   |...|         ||:  |..:..|.....|           |....:.  
  Rat   555 SGSLLVQHVANLDPSKMCIQYAIKKARPVVPKNITVVAGTDLVLPCHLSSNLAHALWTFRGRDLP 619

  Fly   569 -EQP-----DSNCQCRQRSCNNPQPQHGGATCEGISTQVTNCTQHGGWTEWSAWS-PCSQTCGIA 626
             |||     |:..|......  .|.:|.|.         .:|......|:.:|.| ..|...|.:
  Rat   620 AEQPGSFLYDTGLQALVVMA--AQSRHSGP---------YHCYSEEQGTKLAAESYLVSVVAGSS 673

  Fly   627 VKIRRRTCGNPRPAFG---------GRTCV-------------------GSEQSE---MYCRHLP 660
            |.:..|.   |....|         |..|:                   ||:.:|   :|...||
  Rat   674 VTLEARA---PLENLGLVWLAVVALGAVCLVLLLLVLSLRRRLREELEKGSKAAERTLVYPLELP 735

  Fly   661 PCPVAK-----PQSVDGGWGPWG-EWSECS-------AQC--GGGFRMRRRECNDPAPLNGGMEC 710
            ..|.:.     |::.:..|.|.| .:|:.|       |:|  |||         .|:|..|   .
  Rat   736 KEPASPPFRPGPETDEKLWDPVGYYYSDGSLKIVPGHARCQPGGG---------PPSPPPG---I 788

  Fly   711 PG 712
            ||
  Rat   789 PG 790

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
Sema5cNP_611293.3 Sema_5C 65..496 CDD:200526 141/472 (30%)
PSI 497..546 CDD:396154 16/60 (27%)
TSP1 556..607 CDD:214559 11/69 (16%)
TSP1 610..663 CDD:214559 17/84 (20%)
TSP1 674..725 CDD:214559 15/49 (31%)
TSP1 802..834 CDD:214559
TSP1 853..901 CDD:214559
TSP1 906..953 CDD:214559
Sema4cXP_006244820.1 Sema_4C 64..520 CDD:200519 142/475 (30%)
PSI 521..>550 CDD:214655 10/28 (36%)
Ig 586..670 CDD:472250 17/94 (18%)
Ig strand B 596..600 CDD:409456 0/3 (0%)
Ig strand C 608..612 CDD:409456 0/3 (0%)
Ig strand E 633..637 CDD:409456 1/3 (33%)
Ig strand F 647..652 CDD:409456 1/13 (8%)
Ig strand G 664..667 CDD:409456 0/2 (0%)
Blue background indicates that the domain is not in the aligned region.

Return to query results.
Submit another query.