DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment Cubn and Nrp1

DIOPT Version :10

Sequence 1:NP_727348.2 Gene:Cubn / 326235 FlyBaseID:FBgn0052702 Length:3750 Species:Drosophila melanogaster
Sequence 2:NP_659566.1 Gene:Nrp1 / 246331 RGDID:621588 Length:922 Species:Rattus norvegicus


Alignment Length:970 Identity:207/970 - (21%)
Similarity:330/970 - (34%) Gaps:322/970 - (33%)


- Green bases have known domain annotations that are detailed below.


  Fly   960 MGRGFKFEYRALATG--------NDKCGG-VHTRSGDHIRLPVHDDSYAGEATCYWVIMAP-ANK 1014
            |.||.......||..        :||||| :...:..::..|.:..||.....|.|:|.|| ..:
  Rat     1 MERGLPLLCATLALALALAGAFRSDKCGGTIKIENPGYLTSPGYPHSYHPSEKCEWLIQAPEPYQ 65

  Fly  1015 AIRLHWN-SFSLENAVDCIYDYLEIYDSLGAQVNDERSKPLAKYCGNSVPEDLLSHSRQLVLKFV 1078
            .|.:::| .|.||:. ||.|||:|:.|.     .:|..:...|:||...|..::|....|.:|||
  Rat    66 RIMINFNPHFDLEDR-DCKYDYVEVIDG-----ENEGGRLWGKFCGKIAPSPVVSSGPFLFIKFV 124

  Fly  1079 SDYSESDGGFDLTY-TFEDRAKCGGHIHASSGELTSPEYPANYSAGLDCDWHLTGTIDHLLEIQV 1142
            |||.....||.:.| .|:...:|..:..|.:|.:.||.:|..|...|:|.:.:.......:.::.
  Rat   125 SDYETHGAGFSIRYEIFKRGPECSQNYTAPTGVIKSPGFPEKYPNSLECTYIIFAPKMSEIILEF 189

  Fly  1143 ENFELEQSPN------CSADYLEVRNGGGTDSPLIGRFCGRDIPARIPGFSHEMRLILHTDSAIN 1201
            |:|:|||..|      |..|.||:.:|.....|.|||:||:..|.||...|..:.::.:|||||.
  Rat   190 ESFDLEQDSNPPGGVFCRYDRLEIWDGFPEVGPHIGRYCGQKTPGRIRSSSGILSMVFYTDSAIA 254

  Fly  1202 GRGFRLRWRIFA------FGCGGSLRSNMGAISSPRYPNS---------------YPNM------ 1239
            ..||...:.:..      |.|..:|....|.|.|.:...|               ||..      
  Rat   255 KEGFSANYSVLQSSISEDFKCMEALGMESGEIHSDQITASSQYGTNWSVERSRLNYPENGWTPGE 319

  Fly  1240 -AHCEW-RISLHPGSGISLLIEDLELEG-----LSNCYY-------------DSVKIYTGIKL-- 1282
             ::.|| ::.|    |:...:..:..:|     ....||             |.:.:..|.|.  
  Rat   320 DSYREWIQVDL----GLLRFVTAVGTQGAISKETKKKYYVKTYRVDISSNGEDWITLKEGNKAII 380

  Fly  1283 --PNQSPCKVL--------------------------------CKDDDLHNPLIQLENNKGTIVF 1313
              .|.:|..|:                                ||..|.  |...:......::.
  Rat   381 FQGNTNPTDVVFGVFPKPLITRFVRIKPASWETGISMRFEVYGCKITDY--PCSGMLGMVSGLIS 443

  Fly  1314 DSDASNTFRGFRISYKANCIRNLTATTGTIESLNYMEPFWETIP-----INCSWTIRAPKGNRVL 1373
            ||..:.:.:|.| ::....||.:|:.||           |...|     || .| ::...|:..:
  Rat   444 DSQITASNQGDR-NWMPENIRLVTSRTG-----------WALPPSPHPYIN-EW-LQVDLGDEKI 494

  Fly  1374 VEVSHLARHEQHVPTATMPGGLYIVDGRNVQEIVTPQAMNISGEVLTVVHNASNVNFQLDYRIDG 1438
            |.                  |:.|..|::.:..|..:...|:     ..:|.|:....:|     
  Rat   495 VR------------------GVIIQGGKHRENKVFMRKFKIA-----YSNNGSDWKMIMD----- 531

  Fly  1439 CMEELRGTFGFFQSPNYP---------------KMYPNN------------LECYWLITVEQDSA 1476
              :..|....|..:.||.               ::||..            |.|    .||..:|
  Rat   532 --DSKRKAKSFEGNNNYDTPELRAFTPLSTRFIRIYPERATHSGLGLRMELLGC----EVEVPTA 590

  Fly  1477 IELTINNIDLEDSPNCTKDALTVSNHKNSVEVHERHCGSTTKLVITSSGHRLHVRFISDNSHNGL 1541
            ...|.|...:::   |..|                                      ..|.|:|.
  Rat   591 GPTTPNGNPVDE---CDDD--------------------------------------QANCHSGT 614

  Fly  1542 G--FEAT-YRTVKATCGGKLTARNGVIES--PNYPLN----YPAHSR-CEWQVEVSQHHQIVFEM 1596
            |  |:.| ..||.||  .|.|..:..|:|  |.|..|    :.:|.. |.|  |...|.|:.:.:
  Rat   615 GDDFQLTGGTTVLAT--EKPTIIDSTIQSEFPTYGFNCEFGWGSHKTFCHW--EHDSHAQLRWRV 675

  Fly  1597 ADLNLESG------YDCNWDYLEAYDLTEDDTEGERLFKVCGDETEDDKLL----------SSSS 1645
              |..::|      .|.|:.|.:|                  ||.:..|:.          ||:.
  Rat   676 --LTSKTGPIQDHTGDGNFIYSQA------------------DENQKGKVARLVSPVVYSQSSAH 720

  Fly  1646 NMAVVRFISDDSVSKKGFRLHFHESCGQTIIVDETMFDYIQMSRQAARNESCLWVFQAVEPNK-- 1708
            .|.....:|...|.....:||:.:.           .:|.|:......::...|....|..:|  
  Rat   721 CMTFWYHMSGSHVGTLRVKLHYQKP-----------EEYDQLVWMVVGHQGDHWKEGRVLLHKSL 774

  Fly  1709 ---RIIF------------------TPTHVKLREDANQQYPTEGDCLNVGVKIYE-GTEP 1746
               ::||                  ...|:. :||..:  ||:.|..|..:||.| |:.|
  Rat   775 KLYQVIFEGEIGKGNLGGIAVDDISINNHIP-QEDCAK--PTDLDKKNTEIKIDETGSTP 831

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
CubnNP_727348.2 cubilin_NTD 21..149 CDD:412063
EGF_CA 156..190 CDD:238011
EGF_CA 192..233 CDD:238011
EGF_CA 282..322 CDD:214542
EGF_CA 324..367 CDD:214542
EGF 430..457 CDD:394967
EGF_CA 469..503 CDD:238011
CUB 509..622 CDD:238001
CUB 627..737 CDD:238001
CUB 744..849 CDD:412131
CUB 853..970 CDD:238001 3/9 (33%)
CUB 978..1094 CDD:238001 39/119 (33%)
CUB 1100..1211 CDD:238001 37/116 (32%)
CUB 1216..1330 CDD:238001 29/190 (15%)
CUB 1446..1549 CDD:238001 20/132 (15%)
CUB 1554..1667 CDD:238001 28/135 (21%)
CUB 1792..1899 CDD:238001
CUB 1910..1998 CDD:412131
CUB 2019..2133 CDD:238001
CUB 2140..2242 CDD:238001
CUB 2263..2379 CDD:238001
CUB 2385..2511 CDD:238001
CUB 2516..2630 CDD:238001
CUB <2833..2892 CDD:412131
CUB 2898..3008 CDD:238001
CUB 3011..3127 CDD:238001
CUB 3130..3241 CDD:238001
CUB 3254..3363 CDD:238001
CUB 3379..3508 CDD:238001
CUB 3531..3601 CDD:412131
CUB 3623..3733 CDD:238001
Nrp1NP_659566.1 CUB 27..140 CDD:238001 39/118 (33%)
CUB 147..264 CDD:238001 37/116 (32%)
FA58C 277..423 CDD:238014 20/149 (13%)
FA58C 435..582 CDD:238014 30/190 (16%)
MAM 650..810 CDD:459878 32/193 (17%)
DUF3481 844..922 CDD:403259
Blue background indicates that the domain is not in the aligned region.

Return to query results.
Submit another query.