DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment Leo1 and zormin

DIOPT Version :10

Sequence 1:NP_001005548.1 Gene:Leo1 / 300837 RGDID:1549772 Length:678 Species:Rattus norvegicus
Sequence 2:NP_001261317.1 Gene:zormin / 2769001 FlyBaseID:FBgn0052311 Length:3664 Species:Drosophila melanogaster


Alignment Length:674 Identity:130/674 - (19%)
Similarity:218/674 - (32%) Gaps:208/674 - (30%)


- Green bases have known domain annotations that are detailed below.


  Rat   146 EKAQSDDEKWDGEDKSDQSDDD--------------------EKLQN-SDDEEREQGSDDEKLQN 189
            ||..||:|  :...||.|.|..                    |||.| ..|.||.|....:::|.
  Fly   757 EKVMSDNE--ETVAKSTQVDTQLYPVFTSQSVDSKQLLISTREKLTNVIQDIERAQDEIQQRIQT 819

  Rat   190 SADEEEKMQNTDDEDRAQLSD--DDRQQLSEEEKGNSDDERPAASDNDEEKQNSDDEDRPQVSDE 252
            :..    :|..|....|::..  ::.:.|..:..|...|.|.......:..:|. .:.|.::.|.
  Fly   820 TLG----IQTKDQPSLAKIEQVINNLRMLKAKLDGIKYDYRTLVESVIQFLENI-VQLRREIDDY 879

  Rat   253 EKMQNSDDERPQVSDEDRRHSDDEEEQDQKSESARGSDSEDEVLRMKRKNAI--PSDSEVDSD-- 313
            ...|    ::...|..||..::.|:.:||..:..|...::.|:| :.|...:  |...|:|:|  
  Fly   880 FARQ----QKEPASGADRSIAEHEKFRDQCMDKFRSLITQSELL-IDRVRVLEPPGAREIDTDRI 939

  Rat   314 ----------------------TEVPKDNNGTMDLFGGADDISSGSD--GEDKPPTPGQPVD--- 351
                                  ..:.|......||    :||....|  .:.......|.||   
  Fly   940 LKLLENLRLHFESNSSARMSTLERLEKIEQFRSDL----EDIDRSLDSVSQQLHEINNQSVDSLA 1000

  Rat   352 ----------------ENGLPQDQQEEEPIPETR--------------IEVEIPKVNTDLGNDLY 386
                            |..:|:.      :..||              :|..|.|........|:
  Fly  1001 AAKTTSLAFEYFERTIECSMPRQ------LKSTRHQRRWGQTAHTIELLEKRIEKFTESTSQQLF 1059

  Rat   387 FVKLPNFLSVEPRPFDPQYYEDE----------------------------FEDEEMLDEEGR-- 421
            ..          .|...:|.:||                            ||..|.:|.|.|  
  Fly  1060 IT----------NPESERYVKDELRKLNEKWQSFKDQVKQKRKSLNQATDFFEVVEKIDAEYREI 1114

  Rat   422 TRLKLKVENTIRWRMRRDEEGNEIKESNARIV--------KWSDGSMSLHLGNEVFDVYKAPLQG 478
            :.....|.|.:.:.....|.||.:.:....:.        |....|...|..|:|..:|     .
  Fly  1115 SYFYTSVSNKVPYLRDSVEAGNLVNDIENYVTSREAALRSKLDSASQCAHDMNKVSSLY-----N 1174

  Rat   479 DHNHLFIRQGTGLQGQAVFKTKLTFRPHSTDSATHRKMTLSLADRCSKTQKIRILPMAGRDPECQ 543
            |..::|         |:..|.|:..      :....::.   .::..|.|:.|    ..|| :.:
  Fly  1175 DVMNIF---------QSFIKLKMDI------NVVQERLK---QEQRQKEQRER----DARD-QAE 1216

  Rat   544 RTEMIKKEEERLRASIRRESQQRRMREKQH------QRGLSASYLEPDRYDEEEEGEESVSLAAI 602
            |.:.||:.|.:.|  :.||.|.|...::|.      ||.|:|..|   ...|:...||...|.||
  Fly  1217 REKAIKEAEAKER--LHREEQSRLENQRQQAAIEQAQRELAAREL---ALREQAVREEEARLQAI 1276

  Rat   603 KNR-------YKGGIREERARIYS-SDSDEGSEE-------DKAQRLLKAKKLNSDEEGESSGKR 652
            :.:       .:...|||..||.| .|.....||       |:..|:.:.::.....|.||..||
  Fly  1277 REQATREQLAREQAAREEELRIQSLRDIARREEEVRLQNIRDEETRIRREEEERIRRENESRSKR 1341

  Rat   653 KAEDDDKANKKHKKYVISDEEEEE 676
            :.|...:..:..:...:.|:.:::
  Fly  1342 EEEARIQREEITRLQTLRDQVDQQ 1365

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
Leo1NP_001005548.1 Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 1..373 57/310 (18%)
MSCRAMM_ClfA <11..373 CDD:468110 57/310 (18%)
Leo1 386..548 CDD:461126 32/199 (16%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 559..596 12/42 (29%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 613..678 16/72 (22%)
zorminNP_001261317.1 SPEC 126..332 CDD:238103
SPEC <504..641 CDD:413338
SMC_prok_B 654..>1389 CDD:274008 130/674 (19%)
Smc <1192..>1377 CDD:440809 44/187 (24%)
I-set 1390..1479 CDD:400151
Ig strand B 1407..1411 CDD:409564
Ig strand C 1420..1424 CDD:409564
Ig strand E 1445..1449 CDD:409564
Ig strand F 1459..1464 CDD:409564
Ig strand G 1472..1475 CDD:409564
I-set 1491..1580 CDD:400151
Ig strand B 1508..1512 CDD:409353
Ig strand C 1521..1525 CDD:409353
Ig strand E 1546..1550 CDD:409353
Ig strand F 1560..1565 CDD:409353
Ig strand G 1573..1576 CDD:409353
I-set 1589..1675 CDD:400151
Ig strand B 1606..1610 CDD:409353
Ig strand C 1619..1623 CDD:409353
Ig strand E 1641..1645 CDD:409353
Ig strand F 1655..1660 CDD:409353
Ig strand G 1668..1671 CDD:409353
I-set 1702..1786 CDD:400151
Ig strand B 1714..1718 CDD:409353
Ig strand C 1727..1731 CDD:409353
Ig strand E 1752..1756 CDD:409353
Ig strand F 1766..1771 CDD:409353
Ig strand G 1779..1782 CDD:409353
I-set 1811..1902 CDD:400151
Ig strand B 1828..1832 CDD:409353
Ig strand C 1841..1845 CDD:409353
Ig strand E 1868..1872 CDD:409353
Ig strand F 1882..1887 CDD:409353
I-set 1927..2015 CDD:400151
Ig strand B 1944..1948 CDD:409353
Ig strand C 1957..1961 CDD:409353
Ig strand E 1981..1985 CDD:409353
Ig strand F 1995..2000 CDD:409353
Ig strand G 2008..2011 CDD:409353
Ig 2142..2231 CDD:472250
Ig strand B 2159..2163 CDD:409353
Ig strand C 2172..2176 CDD:409353
Ig strand E 2197..2201 CDD:409353
Ig strand F 2211..2216 CDD:409353
Ig strand G 2224..2227 CDD:409353
I-set 2238..2325 CDD:400151
Ig strand B 2255..2259 CDD:409353
Ig strand C 2268..2272 CDD:409353
Ig strand F 2309..2314 CDD:409353
PHA03247 <2593..2893 CDD:223021
I-set 3566..3655 CDD:400151
Ig strand B 3583..3587 CDD:409353
Ig strand C 3596..3600 CDD:409353
Ig strand E 3621..3625 CDD:409353
Ig strand F 3635..3640 CDD:409353
Ig strand G 3648..3651 CDD:409353
Blue background indicates that the domain is not in the aligned region.

Return to query results.
Submit another query.