DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment CadN and Dsg3

DIOPT Version :10

Sequence 1:NP_001027277.1 Gene:CadN / 35070 FlyBaseID:FBgn0015609 Length:3101 Species:Drosophila melanogaster
Sequence 2:NP_001178008.2 Gene:Dsg3 / 291752 RGDID:1592103 Length:1043 Species:Rattus norvegicus


Alignment Length:598 Identity:159/598 - (26%)
Similarity:273/598 - (45%) Gaps:82/598 - (13%)


- Green bases have known domain annotations that are detailed below.


  Fly  1529 EEDDRNLPKRVLQVTATDGDKDRPQNIVYFLTGQGIDPDNPANSKFDINRTTGEIFVLKPLDRDQ 1593
            |.:|.:....:.::|:   |..:.|.|.|.::|.||  |.|....|.::...|:|.:...:||::
  Rat    60 EREDNSKRNPIAKITS---DFQKNQKITYRISGVGI--DQPPFGIFVVDPNNGDINITAIVDREE 119

  Fly  1594 PNGRPQWRFTVFAQDEGGEGLVGYADVQVNLKDINDNAPIFPQGVYFGNVTENGTAGMVVMTMTA 1658
               .|.:..|..|.:..|:.:.....:.|.:.|:|||||||.|.::.|.:.||..:..:||.:.|
  Rat   120 ---TPSFLITCRALNALGQDVERPLILTVKILDVNDNAPIFSQTIFKGEIEENSASNSLVMILNA 181

  Fly  1659 VDYDDPNEGSNARLVYSIEKNVIEEETGSPIFEIEPDTGVIKTAVCCLDRERTPDYSIQVVAM-- 1721
            .|.|:||. .|:::.:.|   |.:|..|..:|.|..:||.::|....||||:...|.:.|...  
  Rat   182 TDADEPNH-MNSKIAFKI---VSQEPAGMSMFLISRNTGEVRTLTSSLDREQVGSYHLIVSGADN 242

  Fly  1722 DGGGLKGTGTASIRVKDINDMPPQFTKDEWFTEVDETDGTALPEMPILTVTVHDEDETNKFQYKV 1786
            ||.||......||::||:||..|..|..::...::|  .|...|:....||..||:.|:.:....
  Rat   243 DGTGLSTQCECSIKIKDVNDNFPVLTDSQYSARIEE--NTLNSELLRFQVTDWDEEYTDNWLAVY 305

  Fly  1787 IDNSGYGADKF---TMVRNNDGTGSLKIVQPLDYEDQLQSNGFRFRIQVNDKGEDNDN--DKYHV 1846
            ...||...:.|   |..|.|:|.  ||:|:.|||| |:||  .:|.|.|.:|.|.:.:  .:|.|
  Rat   306 FFTSGNEGNWFEIETDPRTNEGI--LKLVKVLDYE-QMQS--MQFSIAVRNKAEFHQSVISQYQV 365

  Fly  1847 AYSWVVVKLRDINDN---KPHFERANVEVSVFEDTKVGTELEKFKATDPDQG-GKSKVSYSIDR- 1906
            ..:.|.:::.::.:.   :|......|:..|..:...|..|..::|||.|.| ..|.|.|.:.| 
  Rat   366 QSTPVTIQVINVQEGISFRPPSRTFTVQRGVSINKLAGYILGTYQATDEDTGKAASSVRYVLGRN 430

  Fly  1907 --------SSDRQRQFA--INQNGSVTIQRSLDREVVPRHQVKILAIDDGSPPKTATATLTVIVQ 1961
                    |...:.:|.  ||::.:..:.:::..||        ||||:.: .||:|.|:.|.|.
  Rat   431 DGGFLVIDSKTAEIKFVKNINRDSTFIVNKTISAEV--------LAIDENT-GKTSTGTVYVEVP 486

  Fly  1962 DINDNAPKFLKDYRPVLPEHVPPRKVVEILATDDDDRSKSNGPPFQFRLDPSADDIIRASFKVEQ 2026
            ..|:|.|..:.:.|.:..........|..|     ||:|..||                 :.|..
  Rat   487 SFNENCPSVVLEKREICTSSPSVTLSVRTL-----DRAKYTGP-----------------YTVSL 529

  Fly  2027 DQKGANGDGMAVISSLRSFDREQQKE-------YMIPIVIKDHGSPAMTGTSTLTVIIGDVNDNK 2084
            :::......:..|::|.:.....|.:       |.:|:::||:.........:||:.:...:|..
  Rat   530 EEQPLKLPVVWTITTLNATSALLQAQQHISPGVYNVPVIVKDNQDGQCDTLESLTLTVCQCDDRS 594

  Fly  2085 M--QPG-SKDIFV 2094
            |  .|| |::.|:
  Rat   595 MCRAPGPSREPFI 607

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
CadNNP_001027277.1 Cadherin_repeat 178..301 CDD:206637
CA_like 449..541 CDD:481204
Cadherin_repeat 549..>625 CDD:206637
Cadherin_repeat 655..752 CDD:206637
Cadherin_repeat 765..853 CDD:206637
Cadherin_repeat 862..964 CDD:206637
Cadherin_repeat 973..1074 CDD:206637
Cadherin_repeat 1083..1166 CDD:206637
Cadherin_repeat <1219..1298 CDD:206637
Cadherin_repeat 1310..1414 CDD:206637
Cadherin_repeat 1423..1514 CDD:206637
Cadherin_repeat 1522..1630 CDD:206637 24/100 (24%)
Cadherin_repeat 1638..1741 CDD:206637 34/104 (33%)
CA 1770..1863 CDD:214520 30/100 (30%)
Cadherin_repeat 1871..1966 CDD:206637 29/106 (27%)
Cadherin_repeat 1974..2083 CDD:206637 18/115 (16%)
EGF 2350..2380 CDD:394967
LamG 2385..2570 CDD:238058
EGF_2 2605..2631 CDD:400365
LamG 2637..2797 CDD:238058
EGF_CA 2869..2907 CDD:238011
CADH_Y-type_LIR 3029..3088 CDD:460041
Dsg3NP_001178008.2 CA 75..155 CDD:214520 23/87 (26%)
Cadherin_repeat 161..263 CDD:206637 35/105 (33%)
Cadherin 272..375 CDD:394985 33/109 (30%)
Cadherin_repeat 396..491 CDD:206637 29/103 (28%)
Blue background indicates that the domain is not in the aligned region.

Return to query results.
Submit another query.