DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment tobi and Si

DIOPT Version :10

Sequence 1:NP_651391.1 Gene:tobi / 43072 FlyBaseID:FBgn0261575 Length:657 Species:Drosophila melanogaster
Sequence 2:NP_001376155.1 Gene:Si / 497756 RGDID:3675 Length:1836 Species:Rattus norvegicus


Alignment Length:830 Identity:163/830 - (19%)
Similarity:262/830 - (31%) Gaps:286/830 - (34%)


- Green bases have known domain annotations that are detailed below.


  Fly    14 CLWEAV-----------------HTCASIE----------------QRYKFPNS-----NVYVRL 40
            |||:.|                 :|..||:                .|.|.|::     .|.|:.
  Rat   970 CLWQPVSGLSNVPPCYFPPEHNPYTLTSIQPLPTGITAELQLNPANARIKLPSNPISTLRVEVKY 1034

  Fly    41 RRGDKLQYELMKGEKSL----------QTVTFDN---FD---------------SSANLV-ETSS 76
            .:.|.||:::...:...          .|.|..|   :|               ||..|: ::|.
  Rat  1035 HKNDMLQFKIYDAQHKRYEVPVPLNIPDTPTSSNERLYDVEIKENPFGIQVRRRSSGKLIWDSSL 1099

  Fly    77 GSYQLTDGQSTVEFTLVKSGVVAGTGSEITHVKVSRDQVLQGTQPRDCVPLKTGSTHWFSGPEQK 141
            ..:...| |.....|.:.|..:.|.| |:.|....||                  .:|       
  Rat  1100 PGFAFND-QFIQISTRLPSNYLYGFG-EVEHTAFKRD------------------LNW------- 1137

  Fly   142 QHYWPV-ERQRHSNYSYLPKELDNFGVGERYWL-----NGDGVFLYVESNTPLFIDQNSAAYPDQ 200
             |.|.: .|.:...|     :|:::|....|..     |..||.| :.||.   :|......|  
  Rat  1138 -HTWGMFTRDQPPGY-----KLNSYGFHPYYMALENEGNAHGVLL-LNSNG---MDVTFQPTP-- 1190

  Fly   201 LCLSAISALPYDPRVTSAKFVYHVAVAKDAKAAHRYAVKTFLGQPTGYPDERMVMHPVWSTWALY 265
                   ||.|  |.......:::.:....:.|.|.     ..:..|:|    ||.|.|:     
  Rat  1191 -------ALTY--RTIGGILDFYMFLGPTPEIATRQ-----YHEVIGFP----VMPPYWA----- 1232

  Fly   266 KAEVSDDVVRDFAKQILDNGFNNSQLEIDDDWEDCYGAMTFRKNKFPD---------------SK 315
                       ...|:...|:.|:. ||:..:.|...|......::.|               .|
  Rat  1233 -----------LGFQLCRYGYRNTS-EIEQLYNDMVAANIPYDVQYTDINYMERQLDFTIGERFK 1285

  Fly   316 TL---TDDLKALGFRVTLWIHPFINNNCTAIYDEAKSKGY---LVLDHEGSSDTQW---W----- 366
            ||   .|.::..|.:..:.:.|.|:.|.|..| .|..:|.   :.:....::|..|   |     
  Rat  1286 TLPEFVDRIRKDGMKYIVILDPAISGNETQPY-PAFERGIQKDVFVKWPNTNDICWAKVWPDLPN 1349

  Fly   367 -------------NSKPKDAAILDFTKTEVQEWFSKRLKRLQDEDGIDSFKFDA----------- 407
                         |:.....|..||.:....||:::.:....:|    ..|||.           
  Rat  1350 VTIDETITEDEAVNASRAHVAFPDFFRNSTLEWWAREIYDFYNE----KMKFDGLWIDMNEPSSF 1410

  Fly   408 --------------------------GE-----TSWLPSDPVLQASSSMV----------TPLQA 431
                                      ||     |..:.::.:|...||::          :.::.
  Rat  1411 VNGTVTNKCRNDTLNYPPYFPELTKRGEGLHFRTMCMETEHILSDGSSVLHYDVHNLYGWSQVKP 1475

  Fly   432 VGDYVRTVAAFGDMVEVRSAQNTQDQPIFVRIVDKDSEWG----------WNNGLLTLISSMLQM 486
            ..|.::.......:|..||...|.            ..||          |:|...:|| .||:.
  Rat  1476 TLDALQNTTGLRGIVISRSTYPTA------------GRWGGHWLGDNYANWDNLEKSLI-GMLEF 1527

  Fly   487 NLNGYPFVLPDMIGGNGYYEKPPTKELFLRWLQANVFMP-----ALQFSF----VPWNFDDEAIA 542
            ||.|.|::..|:.|    :.......|..||:|...|.|     .:||:.    |.||   |..|
  Rat  1528 NLFGIPYIGADICG----FFNDSEYHLCTRWMQVGAFYPYSRNHNIQFTRRQDPVSWN---ETFA 1585

  Fly   543 -ISKNFTALHATYTPYIMKLFKRAVDSGEPVNVPLWWIAPTDEVAQSIYDEFLLGEDIIAAPVVV 606
             :||....:..|..||.......|...|..|..||......|:....||.:||.|...:..|||.
  Rat  1586 QMSKKVLEIRYTLLPYFYTQMHEAHAQGGTVIRPLMHEFFDDKETWEIYKQFLWGPAFMVTPVVE 1650

  Fly   607 EGATKRDIYLPEGEWQDGNSDQVYTGPTWVMDYAAPLDTLPYFLRVGFTL 656
            ...|....|:|:..|.|.::.:.......:..::||.||:...:|.|:.|
  Rat  1651 PFRTSVTGYVPKARWFDYHTGEDIKLKGILHTFSAPFDTINLHVRGGYIL 1700

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
tobiNP_651391.1 GH31_NET37 255..621 CDD:269878 96/479 (20%)
SiNP_001376155.1 Trefoil 80..118 CDD:459666
NtCtMGAM_N 135..245 CDD:465286
GH31_N 237..353 CDD:270212
GH31_MGAM_SI_GAA 353..717 CDD:269888
PD 944..990 CDD:197472 4/19 (21%)
NtCtMGAM_N 1004..1117 CDD:465286 19/113 (17%)
GH31_N 1109..1223 CDD:270212 30/165 (18%)
Glyco_hydro_31 1204..1700 CDD:460044 109/546 (20%)
Blue background indicates that the domain is not in the aligned region.

Return to query results.
Submit another query.