DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment dpy and Nid2

DIOPT Version :10

Sequence 1:NP_001260032.1 Gene:dpy / 318824 FlyBaseID:FBgn0053196 Length:22949 Species:Drosophila melanogaster
Sequence 2:NP_001012005.2 Gene:Nid2 / 302248 RGDID:1311685 Length:1396 Species:Rattus norvegicus


Alignment Length:722 Identity:169/722 - (23%)
Similarity:225/722 - (31%) Gaps:254/722 - (35%)


- Green bases have known domain annotations that are detailed below.


  Fly   120 YGGCNCVDKDENECKQRPCDVFAHCTNTLGSFTCTCFPGYRGNGFHCEDIDECQDPAIAARCVEN 184
            |.|.|   ::..|.....|...|.||:....|.|.|...:.|||.||  :.|.....:..:...:
  Rat   496 YNGAN---RETCEHSHGQCSRHAFCTDYTTGFCCHCQSRFYGNGRHC--VPEGAPHRVNGKVSGH 555

  Fly   185 AECCNLPAHF-------LCKCKDG----------------------------------------- 201
            .:..::|.||       .....||                                         
  Rat   556 LQVGHMPVHFTDVDLHAYIVSNDGRAYTAISQIPQPAAQALLPVLPIGGLFGWLFALEKPGSENG 620

  Fly   202 ----------------YEGDGEVLCTDVDECRNPENCGPNALCTNT---------PGNYTCSCPD 241
                            :.|:..|..|...|..:||    |.|..||         |.|:|.... 
  Rat   621 FSLTGATFVHDVEVTFHPGEERVRITQTAEGLDPE----NYLSLNTNIEGQVPFIPANFTAHIA- 680

  Fly   242 GYVGNNPYREGCQDVDEC-------SYPNVCGPGAICTNLEGSYRCDCPPGYDGDGRS------- 292
                  ||:|.....|..       |:..:.|    ..|...|||.|....|.....:       
  Rat   681 ------PYQEFYHHRDSVVTSSSSRSFSLISG----SINQTRSYRIDQNITYKACSHAPRHLAVP 735

  Fly   293 ----------------ESGC-------------VDQDECARTPCGRNADCLNTDG--------SF 320
                            |.|.             ||....|..||...:...:|..        .:
  Rat   736 ATQQLTVDRAFALYSEEEGVLRFAVTNQIGPVEVDSAPTAVNPCYDGSHTCDTTARCHPGTGVDY 800

  Fly   321 RCLCPDGYSGDPMNGCEDVDECATN-NPCGLGAECVNLGGSFQCRCPSGFVLEHDPHADQLPQPL 384
            .|.|..|:.||. ..|.||:||||. :.||..:.||||.||::|.|.||:....|.|...|    
  Rat   801 TCECTPGFQGDG-RSCVDVNECATGFHRCGPNSVCVNLVGSYRCECRSGYEFADDRHTCVL---- 860

  Fly   385 NTQQLGYGPGATDIAPYQRTSGAGLACLDIDECNQPDGVAKCGTNAKCINFPG-SYRCLCPSGFQ 448
                         |||...      .|||......|:|      .|:||:..| |:.|.|..||.
  Rat   861 -------------IAPPPN------PCLDGSHTCAPEG------QARCIHHGGSSFSCACLPGFV 900

  Fly   449 GQGYLHCENINECQDNPCGENAICTDTVGSFVCTCKPDYTGDPFRGCVDI---DECTALDKPC-- 508
            |.|: .|.:::||.:|.|...|||.:|.|||.|.|:|.|.||.|....|.   |..:.| |||  
  Rat   901 GTGH-QCSDVDECAENRCHGAAICYNTPGSFSCRCQPGYHGDGFHCASDTVPEDSISGL-KPCEY 963

  Fly   509 -GQHAVCENTVPGYNCKCPQGYDGKPDPKVACEQVDVNILCSSNFDCTNNAECIENQCFCLDGFE 572
             .::|..::...|.....||                          |.:....:..||....|| 
  Rat   964 QQRYAQAQHAHAGSRIHIPQ--------------------------CDDQGNFVPLQCHGSTGF- 1001

  Fly   573 PIGSSCVDIDECRTHAEVCGPHAQCLNTPGSYGCECEAGYVGSP------PRMACKQPCEDVRCG 631
               ..|||    :...||.|..    ..|||....|     |.|      ||..|::..|.:.  
  Rat  1002 ---CWCVD----QNGHEVPGTQ----TPPGSTPPHC-----GPPPEPTQRPRTVCERWRESLL-- 1048

  Fly   632 AHAYCKPDQNEAYCVCEDGWTYNP------SDVAAGCVDIDECDVMHGPFGSCGQNATCTNSAGG 690
            .|....|..::....|:|...:.|      ||. ..|||.|            |:....|.|..|
  Rat  1049 EHYGGTPRDDQYVPQCDDLGHFIPLQCHGKSDF-CWCVDKD------------GRELPGTRSQPG 1100

  Fly   691 FTCACPP 697
            ...||.|
  Rat  1101 TMPACIP 1107

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
dpyNP_001260032.1 EGF_3 137..166 CDD:463759 10/28 (36%)
EGF_CA 212..247 CDD:238011 10/43 (23%)
EGF_CA 255..>286 CDD:214542 8/37 (22%)
EGF_CA 298..331 CDD:238011 8/40 (20%)
EGF_CA 338..373 CDD:238011 18/35 (51%)
EGF_CA 413..456 CDD:238011 14/43 (33%)
EGF_CA 457..490 CDD:238011 15/32 (47%)
EGF_CA 497..>529 CDD:214542 10/37 (27%)
EGF_CA 580..>612 CDD:214542 8/31 (26%)
EGF_3 676..702 CDD:463759 7/22 (32%)
EGF_CA 1022..1056 CDD:214542
EGF_CA 2227..2260 CDD:238011
EGF_CA 2393..>2422 CDD:214542
Herpes_BLLF1 <4032..4489 CDD:282904
Herpes_BLLF1 <4334..4797 CDD:282904
PHA03255 4845..>5021 CDD:165513
DUF5585 4959..5356 CDD:465521
PHA03247 <5170..5799 CDD:223021
PRK12495 <5749..5877 CDD:183558
PHA03247 <5865..6485 CDD:223021
Chi1 6385..>6609 CDD:442692
Herpes_BLLF1 <6607..7102 CDD:282904
Herpes_BLLF1 <7027..7490 CDD:282904
DUF5585 7550..7954 CDD:465521
DUF5585 7881..>8130 CDD:465521
Atrophin-1 18652..>19081 CDD:460830
PHA03247 <18873..19514 CDD:223021
ZP 22576..22811 CDD:214579
Nid2NP_001012005.2 NIDO 108..274 CDD:214712
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 346..393
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 408..452
EGF_3 504..539 CDD:463759 11/34 (32%)
nidG2 543..767 CDD:238158 32/238 (13%)
EGF_3 779..815 CDD:463759 7/36 (19%)
EGF_CA 817..859 CDD:214542 20/41 (49%)
EGF_3 912..945 CDD:463759 17/32 (53%)
TY 960..1021 CDD:238114 18/98 (18%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 1013..1036 7/31 (23%)
Thyroglobulin_1 1040..1105 CDD:459665 17/79 (22%)
LY 1155..1193 CDD:214531
LDL-receptor class B 1 1175..1218
LY 1199..1241 CDD:214531
LDL-receptor class B 2 1219..1261
LY 1242..1286 CDD:214531
LDL-receptor class B 3 1262..1306
LY 1287..1329 CDD:214531
LDL-receptor class B 4 1307..1348
Blue background indicates that the domain is not in the aligned region.

Return to query results.
Submit another query.