DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment cv-2 and Muc5b

DIOPT Version :10

Sequence 1:NP_524809.2 Gene:cv-2 / 45280 FlyBaseID:FBgn0000395 Length:751 Species:Drosophila melanogaster
Sequence 2:NP_001414085.1 Gene:Muc5b / 309114 RGDID:1561983 Length:3925 Species:Rattus norvegicus


Alignment Length:839 Identity:203/839 - (24%)
Similarity:286/839 - (34%) Gaps:281/839 - (33%)


- Green bases have known domain annotations that are detailed below.


  Fly    27 TGFRPSTQLLILIAVLLAL----------LQGRTVDAGAGD--SLSGV--------------RQS 65
            ||.:...||:..:.|.:.|          |.|......|.|  :||||              :.|
  Rat   536 TGLQLQVQLVPFMQVFVRLDRSYQGQMCGLCGNFNQNQADDFTALSGVVEGTGAAFSNTWKTQAS 600

  Fly    66 CSNE--------GEEVQLKNQPQIFTCFKCECQNGFVNCRDTCPPV---NDCYILDKSNGTCCRR 119
            |.|.        ...|:.:|..:.:.....|....|..|..|..||   ::| :.|..|......
  Rat   601 CPNSKNTYEDPCSYSVENENFAREWCSMLTESSGVFSACHATVSPVPFYSNC-LFDTCNCENSED 664

  Fly   120 C---------KGCSFRGMSYESGSEWNDPEDPCKTYKCVATVVTETIQKCYSQCDNNQLQPPRPG 175
            |         :.|:.:|:..   |.|..                   :.||...:|    .|:..
  Rat   665 CMCAALSSYVRACAAKGVLL---SGWRG-------------------KACYKYMNN----CPQTK 703

  Fly   176 E-------CCPTCQGCKINGQTVAEGHEVDASIDDRCLVCQCRGTQLTCSKKTCPV--LPCPMSK 231
            |       |.|||:...          |||                :|||....||  ..||...
  Rat   704 EYSYSVSTCQPTCRSLS----------EVD----------------VTCSIPFVPVDGCTCPEGT 742

  Fly   232 QIKRPDECCP--RCPQNHSFLPVPGKCLFNKSVYPEKTQFMPDRCTNCTCLNGTSVC-----QRP 289
            .:...|.|.|  .||           |.|:.:|.......| |....|:|.||...|     ||.
  Rat   743 FLNDKDHCVPVEECP-----------CYFHGTVVASGEVVM-DNGVVCSCTNGKLTCLGALMQRN 795

  Fly   290 TCPILECAPEFQEPDGCCPRCAVAEVRSECSLDGIVYQNNETWDMGPCRSCRC------------ 342
            .    ||.......|  |...:|.:..:||      .::..|.|: .|.|.:|            
  Rat   796 K----ECQAPMVYLD--CNNASVGDHGAEC------LRSCHTLDV-DCFSTQCVSGCVCPSGLVA 847

  Fly   343 --NGGTIRCAQMRCPAVKCRANEELKQPPGEC----CQRCV-----------ETAGTCTVFGDPH 390
              |||.|  |:..||.|    :.|....|||.    |..|.           ...|.|..:||.|
  Rat   848 DGNGGCI--AEEDCPCV----HNEATYRPGEIIRVDCNNCTCRNRRWECTNQPCMGACVAYGDGH 906

  Fly   391 FRTFDGKFFSFQGSCKYLLASD-CMGK-----TFHIRLTNEGRGTRRASWAKTVTLSLRNLKVNL 449
            |.||||:.:.|:|:|:|.||.| |.|.     ||.|...|...||...:.:|.:.:.:.:.::.|
  Rat   907 FVTFDGERYIFEGNCEYTLAQDYCRGNTSTDGTFRIVTENVPCGTTGTTCSKAIKIFVESYELIL 971

  Fly   450 GQ-RMRVKVNGTRVTLPYFVVAGGQNVTIERLANGGAVMLRSEMGLTLEWNGAGFLQVSVPAKFK 513
            .: ..:|...|.....||.:...|..:.||         :||  |:.:.|:....:.|.:...:|
  Rat   972 HEGNFKVVARGPSGDPPYKIRYMGIFLVIE---------IRS--GIVVSWDRKTSVFVRLQQHYK 1025

  Fly   514 KRLCGLCGNFNGSSRDDLTGKDGRSHGDDEVWHFANSWKVGGPKSCSRKREFLAATPTC------ 572
            .|:|||||||:.::.:|.|.:.....||  |..|.||||.               :|:|      
  Rat  1026 GRVCGLCGNFDDNAINDFTTRSQSVVGD--VLEFGNSWKF---------------SPSCPDAPVP 1073

  Fly   573 -DKRKSNFY--------CHPLSVPALFGECNERLNPENYKAACRMDVCECPSG---DCHCDSFAA 625
             |...:|.|        |..:: .|.|..|:.:::...|..||..|||.|.||   :|.|.:.||
  Rat  1074 KDPCIANPYRKSWAQKKCSIIN-SATFAACHSQVDSTKYYEACVHDVCACDSGGDCECFCTAVAA 1137

  Fly   626 YAHECRRLGVQLPDWRSATNCPAGWRRNATLSSFKGNQF--YGDPSFSRMKGRRQKNHQLRLQLQ 688
            ||..||.:||.| .||:...||.               |  |.:|            |.   |.:
  Rat  1138 YAQACRDVGVCL-SWRTPDICPL---------------FCDYYNP------------HG---QCE 1171

  Fly   689 QEQQQRSKQGQKGRHKPGGHNQLDRQGHNGLDKDQLQKEFILKHVPSSFLYPRAPDRTP 747
            ...|.......|....|.||..:|..|..|                   .||:.|...|
  Rat  1172 WHYQPCGAPCLKTCRNPSGHCLVDLPGLEG-------------------CYPQCPPSQP 1211

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
cv-2NP_524809.2 VWC 184..243 CDD:214564 13/62 (21%)
VWC 256..310 CDD:278520 16/58 (28%)
VWC 319..376 CDD:214564 19/74 (26%)
VWD 371..536 CDD:214566 53/186 (28%)
C8 580..646 CDD:462584 27/76 (36%)
Muc5bNP_001414085.1 None

Return to query results.
Submit another query.