DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment shn and Hivep1

DIOPT Version :10

Sequence 1:NP_001260883.1 Gene:shn / 36171 FlyBaseID:FBgn0003396 Length:2587 Species:Drosophila melanogaster
Sequence 2:XP_038951236.1 Gene:Hivep1 / 117140 RGDID:727847 Length:2691 Species:Rattus norvegicus


Alignment Length:2880 Identity:614/2880 - (21%)
Similarity:914/2880 - (31%) Gaps:1117/2880 - (38%)


- Green bases have known domain annotations that are detailed below.


  Fly   186 GEVPLPTVDSNHIISNNNNNNNNNNTSNNNNNNHHSDNSISENNYEFKSRDKASLSDSKMTLQQM 250
            ||.|..||:::                       .|.:|:|..        |||....:..|::.
  Rat   113 GETPGMTVEAS-----------------------ESGDSVSPK--------KASSPHHRSELRRW 146

  Fly   251 AAVSSNQEPVAPNVANTMSNSSIINSQQTTVN---ATPATDEAPLGDRSNISRYLHKKFKRLAST 312
            .  |...:|...:..:...:||..:|:..|.|   ::|.....|       ..|....|..|...
  Rat   147 R--SEGSDPARLSGLDGQRDSSSSSSKTRTDNSECSSPCCSTTP-------PSYTSTAFDVLLKA 202

  Fly   313 TEVDSWSATNGGGALQASGD------AVRTTSLSSNSSLSPP--------------PTT----PL 353
            .|.:..:.:..|.:.....:      .||:.|...||||..|              |.|    |:
  Rat   203 MEPELSTLSQKGSSCAVKTEKPRPNRTVRSPSKLKNSSLDAPNPASPELVAESQCSPCTSYPVPV 267

  Fly   354 AN---GHHLMTQQQSHQQVQQQ-------QQQQQTPPPSVAIHEFSANFVNSNHVNAIGHEESII 408
            |:   ...:..|:.||...|..       ||..|.|                .|:...|      
  Rat   268 ASTQKSEQVAAQRVSHLHSQYDHLVPKPGQQNPQLP----------------GHLGLAG------ 310

  Fly   409 SNSGYKAAGRTRTPLANSNSNTNSTSNSNSNHAANATL---SPASFAQ----HQQL--TQPTTVS 464
                         .|.|.:::.|:......|.|..:|:   ||:|..|    |||:  ..|::||
  Rat   311 -------------SLTNPHTHENTKLEPIYNIAMTSTVGLASPSSRTQVTPPHQQMDSVSPSSVS 362

  Fly   465 PGIPGGAAAPNP-----------------AC---------EKSGRYVCQYCNLICAKPSVLEKHI 503
            |. ....:.|.|                 .|         :|.|:|:|:|||..|||||||.|||
  Rat   363 PA-TSTQSPPGPIYSSTHVASVVSQSVEQMCNLLLRDQKPKKQGKYICEYCNRACAKPSVLLKHI 426

  Fly   504 RAHTNERPYPCDTCGIAFKTKSNLYKHCRSRSHAAR--------ARGLEVPADADDGLS-DQDAE 559
            |:||.||||||.|||.:||||||||||.:|.:|..:        |.||.:..:....|| ..|.|
  Rat   427 RSHTGERPYPCVTCGFSFKTKSNLYKHKKSHAHTIKLGLVLQPEAGGLFLSQECPKALSVHSDIE 491

  Fly   560 LSNSSSELPSRAGSPYEEPINSPTPSPSTLSAAKSAYIQQPPLPTYMQQLPLGSPAAGTLPPTTA 624
            .|..|.|         |.|.:.....||.....          |..:.::| ..|.|  ||.:.|
  Rat   492 DSGESDE---------EGPGDGRQNDPSVRDLQ----------PVQIMKMP-SDPEA--LPKSIA 534

  Fly   625 D--NH--HSATAQHR----QSIDYKPYKPKFHNASLYSCSSKELQQQQQQLQI---QQQQQQHQL 678
            .  :|  ...::|.|    |::...| |...|..|:....:..:|....:.::   |.|:..|..
  Rat   535 SRADHVVSGFSSQDRPSESQALTELP-KVVVHPVSMSPLKTDSVQVASPKPELPSTQSQKDLHAA 598

  Fly   679 AQQKLSIQLPLVQQPSLAHP----TLSPSTQMKMKHHINSHQIQLQLQ--------QQQSLLAQQ 731
            ::...|..:..::.....|.    |||   :.|.:.|..:...|||.|        ||..||...
  Rat   599 SRLSHSAGVSSLETDETCHQKGDVTLS---EGKPEPHSGAAHAQLQRQQATDDPQEQQGKLLLSP 660

  Fly   732 SLLAAMPPGGVYYLGQPSYYNQDTAAAIHQHALAIQQFQIHLAQQQQQQQQQHPPLVKA-TPPMQ 795
            ..|.:...|         |:::..:|                      .|...||...| |.|..
  Rat   661 RSLGSTDSG---------YFSRSESA----------------------DQAMSPPTPFARTFPTT 694

  Fly   796 QPPSLPPQQLVRANSQISSVATAPATPTPAANLSTFASSSGGKQVNVA--KVQEHISKLISQNEA 858
            .|.  |.:....|..:||  |.|||...|....|..|   |..:..:|  .::|.||||||.|||
  Rat   695 DPD--PAKNSGPAGPRIS--APAPAALAPGEKSSVVA---GQMRPPLATKTLEERISKLISDNEA 752

  Fly   859 IVENKEILLQKKYPKQLSRSRSFNNANSNNASQHGSNASALHANNSQNNTNAQMPERETKVNLAQ 923
            :|::|::...|.....|||..|.::..|                                     
  Rat   753 LVDDKQLDSVKPRRTSLSRRGSIDSPKS------------------------------------- 780

  Fly   924 AIFQKQQHQLQQQQQQQITQQQQQQLEQQNYYTYIQQQQECQQQQLEPPNGVVKRNAYKPGVVMT 988
            .|| |...|...:...:.|.....                            :.::.:.|     
  Rat   781 YIF-KDSFQFDLKPMGRRTSSSSD----------------------------IPKSPFTP----- 811

  Fly   989 TPVKQQQQLPPPPSPLPMQTMQYRQDPATPVTKI-EQPTTAVPVMPLNLSAKPKP---------- 1042
            |...:|..|...||.           ...|:|:. ..|||....:|.|:...|.|          
  Rat   812 TEKSKQVFLLSVPSL-----------DCLPITRSNSMPTTGYSAVPANIIPPPPPLRGSQSFDDK 865

  Fly  1043 -------TLVTVPVSSLSTS---------SLAPTPTTSTN----------------PTSSSKTAP 1075
                   ..|:.|..|:..|         :.....|.|.|                |.||.   |
  Rat   866 IGTFYDDVFVSGPNPSMPPSGHHRPLVRQAAVEDSTASENHVLGSGQSVDESCQGCPASSE---P 927

  Fly  1076 PPAQVNNSIIKNLLLNARGLAVPIGEGDDAVYSCPICASEFRSADDLKLHNSTYCQDASSSAPMS 1140
            .|.|...:...:|.....      .:|...::.|..|.:.:|..::.:.|...||.:........
  Rat   928 GPVQSKAANTPHLEKKKS------HQGRGTMFECETCRNRYRKLENFENHKKFYCSELHGPKTKV 986

  Fly  1141 PASSPFRSNSISLSLPELKSHMANSKNPLSLAKLAWSQL----KTKRSSLV-----LSRLSAAQT 1196
            .|..|....:.....|::..:.      ::.....|.|.    |.::...|     |....:.::
  Rat   987 AAREPEHGLAPGGVQPQVLHYR------VAAPTAVWEQTPQIRKRRKMKSVGDDEDLQPRESGRS 1045

  Fly  1197 PARTSTVTAPTVTASAPAPAVATVTAPSSAPAPQIEALRFVDAPLPSPGPLLGKTPLVDYAQQST 1261
            |.....:....|..|||:|   :..||::|..   :|.|.|... .||..|:.:.|     :|:.
  Rat  1046 PESAEALQLQPVPGSAPSP---SKQAPATAGD---QAHRGVQLQ-SSPVQLVARVP-----EQAL 1098

  Fly  1262 PRKAQDSVVITKMHEDRQFVIEAQ-PAKRIKTSDLVVASSSQQPTSFN----------FSFN--- 1312
            |.| |..||..:::...|..||.| .|..|  |.:...:|..:|:||:          ||..   
  Rat  1099 PLK-QCPVVEQQLNSATQDRIEVQRQAGGI--SVIQHTNSLSRPSSFDKLEPLEGGTAFSLQELG 1160

  Fly  1313 ------------------------------NQNSSNSVP-------------------------- 1321
                                          ::.|:..:|                          
  Rat  1161 RAGMPGALKVIGMPPEEGHPPRDATHQIALSRESARKIPSERFVLGQPLRLVRQHNIQVPEILVT 1225

  Fly  1322 -------ELQSSKEERLRRFT----SSGGSMIPISECP---------DLDNSP-----------K 1355
                   |.||..||:..:||    |...|.:|..:.|         ::::|.           .
  Rat  1226 EEPDRDLEAQSHDEEKSEKFTWPQRSETLSKLPTEKLPPKKKRLRLAEIEHSSTESSFESTLSRS 1290

  Fly  1356 MIRTPLLSGGS-------FQDVS-VKVNNETGSSSKERKLMALVSGSGLLGVSSGPQHFQ----- 1407
            :.|...||..|       .:|:| |:::.:|...|| .:.:.:..||..|.|....:..:     
  Rat  1291 LSRESSLSHASSFSASLDVEDISKVELSPKTDFPSK-AEFLFIPLGSNTLSVPGSHREMRRAASE 1354

  Fly  1408 ----FPPINSITAF----------------------NPLTLPPMSGGDKTTPVT-----PI---- 1437
                .|.:..::.|                      .|...|...||....|:.     |:    
  Rat  1355 QISCVPTLMEVSDFRSKSFDCGSIAPSHVVPALVEPQPSNSPSGVGGTGHVPLLERRRGPLTRQI 1419

  Fly  1438 --------PHVPG---------MPGPGSLTPQMPLLPP--------------------------- 1458
                    |..||         :||..:: |...|.||                           
  Rat  1420 SLNIASDSPLSPGSVSALQTIVLPGVNAI-PLQALRPPDVASADLPAHTVPSQALAKDLQAEMSS 1483

  Fly  1459 -------PPQQL------------------QLPIP-SSRGRSPNRKQPSPLLLGGG--------- 1488
                   |||||                  .||:| |.:|..|....|:.....|.         
  Rat  1484 CSSTDTFPPQQLFGAHLLNKTNMPLSHQNTPLPLPVSVQGGKPGAPPPAGTSSTGDGSFAPKYQL 1548

  Fly  1489 ------SGELKALSPFGGVQN--VPSEFSRQP-------------------PTPAQRQALQWNSK 1526
                  ||...:.||...:.|  :|.:.:..|                   |.|:....|.  |.
  Rat  1549 QCQAFTSGRGCSSSPLHSLPNPVLPDQTAADPCTASVALSAKAVDPVSKSYPLPSLELGLP--SD 1611

  Fly  1527 EAPKKAPFNFLRMAD---------NVKTTEPE--VRHFNLENV-----------ISGKQQELPLT 1569
            :..|:.|...|.:..         :..|:.|:  |.| :|.|.           :|.:|..:|.:
  Rat  1612 QVQKRLPSFVLPVLQPRDVPVYCLSTVTSLPQILVTH-DLSNTPICQTNQSIVPVSEEQNPMPKS 1675

  Fly  1570 PLHVD----TP-------------NGNAP-------------EETSPVASASASAK--------- 1595
            ..::.    ||             ..|||             ...||...:|||:|         
  Rat  1676 QNYLQNALPTPEKDLACKTVLSEMGQNAPVSESSATVQKVSAGRLSPQQESSASSKRMLSPANSL 1740

  Fly  1596 -------------------SKFLRPTSLPLK-------------------------------PGT 1610
                               |..:||..||..                               ||:
  Rat  1741 DIAMEKHQKRAKDENGAVCSTNIRPLELPSSRANEGHKQKKPVLVRQLCTVEPLEGTALEQDPGS 1805

  Fly  1611 FTPK------------------RHH---------------------GITPT----------ANTL 1626
            .:.|                  ||.                     .:|||          |..|
  Rat  1806 ASGKSSRNADSTQVLSTDSLSSRHSMFAVPDHVSEFQEFKNTKLSTSLTPTVGSSHIPLESACVL 1870

  Fly  1627 P------------------------------------LISPETPRPSKSCVQLYLNGHAYTYLGL 1655
            |                                    |.|.:|.:||              :..|
  Rat  1871 PLKSRDDNQEKGSSGVQNEENKVIQGQRQPPIPGLSVLSSSDTQQPS--------------FPSL 1921

  Fly  1656 KCSTKMFYCTVNCPQPSYVAGM--HKLSMYSVWQVCEENQPHPLGFKLKQVMALYDSRQRMLGNG 1718
            |.:|...:|.: ..|.|.....  .|.|.|:.|.|...| |:|||...|..::|.:|:|:     
  Rat  1922 KTATSFTWCYL-LRQKSLPLAQSDQKTSAYAGWTVSPSN-PNPLGLPTKVALSLLNSKQK----- 1979

  Fly  1719 SSTAMAGSGKLSY-----NLVASQQIVSSPSTSSTSSAFYQGPLKTPPTVTIAALSEANVAAKAN 1778
                   :||..|     ....|..:|.|....:..|....|..|    .||...|..:.:...:
  Rat  1980 -------TGKSLYCQAITTHSKSDLVVYSSKWKNNLSKRALGNQK----ATIVEFSNKDDSEINS 2033

  Fly  1779 EEAQAKKLETSPSGQPLV--GGYESHEDYTYIRGRGRGRYVCSECGIRCKKPSMLKKHIRTHTDV 1841
            |:.:...|..|...:..:  |||:|:|||.|:||||||:|:|.||||||||||||||||||||||
  Rat  2034 EQDKENSLIKSEPRRIKIFDGGYKSNEDYVYVRGRGRGKYICEECGIRCKKPSMLKKHIRTHTDV 2098

  Fly  1842 RPFTCSHCNFSFKTKGNLTKHMQSKTHFKKCIELGINPGPMPPDSEFLDVDMDFDQQSSTSAGG- 1905
            ||:.|::|||||||||||||||:||.|.|||::|||:.|       .:| :.|.::.......| 
  Rat  2099 RPYHCTYCNFSFKTKGNLTKHMKSKAHSKKCVDLGISVG-------LID-EQDTEESDEKQRFGC 2155

  Fly  1906 -RTSSMAGESDSDDYSDNESESSDTDESKSRQKEHEAARGLLSLSMTPPIPQSVSPYPQLQDTPL 1969
             |:.....|||..|..||::|..|.|.        :|..|   ||..|    ||:..||    .|
  Rat  2156 ERSGFDLEESDGPDEDDNDNEEDDDDS--------QAESG---LSAAP----SVTASPQ----HL 2201

  Fly  1970 PAASPANSIGSSGSQPKRLVCSFTSPKPPFDYQKQEQYYSNPEESKPKR--------SVANEESA 2026
            |:.|.....||. .:..|:...|:.            .:::|.:..|:.        |....:..
  Rat  2202 PSRSGLQDPGSV-EEDLRVSSCFSG------------VHTDPMDILPRALLTKMTVLSTVQIDPN 2253

  Fly  2027 PMDLT-KPRGSI----LLISPSPVSVPAHD--LPKSQAQQMHDVIFGTS--------GNESGFMK 2076
            ..||| |.|.:.    |.::| ||..|...  .|:|...|| .|.:..|        ||      
  Rat  2254 RTDLTAKARQNTGKDELELAP-PVDTPISPEVTPRSPGHQM-SVHYSESDALRSPAAGN------ 2310

  Fly  2077 TLISVSDK------------------VRISAEMEEQAKHEAEGEDVQL---------QTYIKEH- 2113
            |:.|:.|.                  .|||:.:....:.:...:.:.|         ||::..| 
  Rat  2311 TVASIQDSPSVGLPPATIAQLNPQPAARISSSVSPHPESQEPKQQITLQPSPGLPSPQTHLFSHL 2375

  Fly  2114 ALHQAKIKQSQFSRSYLI-----------NTLYTAASPVMS--------SSSTLFTANSRPVMSV 2159
            .||    .|.|....|.:           ...|:...|:.:        :.|.:......||.::
  Rat  2376 PLH----SQQQSRTPYNMVPVGGIHVVPAGLTYSTFVPIQAGPMQLTIPAVSVIHRTVGTPVDTI 2436

  Fly  2160 NE-----------------VPSIEVHEVK-------TPEAIESPRS----------------APE 2184
            .|                 ||.|.:.::.       :|.|::|..|                .|:
  Rat  2437 TEVSGATNRPTGVAELSSVVPCIPIGQIHVPGLQNLSPPALQSLSSLGMETVNIVGLANATVGPQ 2501

  Fly  2185 QAP-------VILAQIAQEENEEPNIAEPH-------NANLPAAVQQPDVNEF-TGVLGNPTAPP 2234
            ..|       |.|..:|....:.....:.|       |..||..:  |.|... .|..|.|.   
  Rat  2502 VHPPGLALNAVGLQVLANAPAQSSPATQTHIPGLQILNIALPTLI--PSVGPVAVGTAGTPE--- 2561

  Fly  2235 TSSVTATSVSTTTAAPVAPSSTANSTQPAQR---------TVIVG-------EDGFKSSTPTSKS 2283
                |||| ::.|..|..|:...:|.:|.||         ..:.|       .|| .|...|.|.
  Rat  2562 ----TATS-NSKTVEPQTPAGQGHSAEPPQRLPEGPQETPQTVPGPSVDHARPDG-SSKMDTKKV 2620

  Fly  2284 GDLQHVSYGRG---VPPAPI 2300
            ....||..||.   ..||||
  Rat  2621 ASASHVLPGRSSAQAQPAPI 2640

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
shnNP_001260883.1 C2H2 Zn finger 486..506 CDD:275368 15/19 (79%)
zf-H2C2_2 499..523 CDD:463886 17/23 (74%)
C2H2 Zn finger 514..536 CDD:275368 15/21 (71%)
PRK12323 <1185..>1269 CDD:481241 22/88 (25%)
zf-C2H2 1816..1838 CDD:395048 19/21 (90%)
C2H2 Zn finger 1818..1838 CDD:275370 18/19 (95%)
zf-H2C2_2 1830..1855 CDD:463886 20/24 (83%)
zf-C2H2 1844..1868 CDD:395048 18/23 (78%)
C2H2 Zn finger 1846..1868 CDD:275370 18/21 (86%)
zf-C2H2_8 <2305..2352 CDD:464935
C2H2 Zn finger 2308..2328 CDD:275368
zf-C2H2_jaz 2333..2359 CDD:432381
C2H2 Zn finger 2336..2358 CDD:275368
C2H2 Zn finger 2381..2403 CDD:275368
Hivep1XP_038951236.1 C2H2 Zn finger 409..429 CDD:275368 15/19 (79%)
zf-H2C2_2 422..446 CDD:463886 17/23 (74%)
C2H2 Zn finger 437..457 CDD:275368 14/19 (74%)
zf-C2H2 2073..2095 CDD:395048 19/21 (90%)
C2H2 Zn finger 2075..2095 CDD:275370 18/19 (95%)
zf-H2C2_2 2087..2112 CDD:463886 20/24 (83%)
ZnF_U1 2098..2129 CDD:197732 23/30 (77%)
C2H2 Zn finger 2103..2121 CDD:275370 15/17 (88%)
PHA03247 <2273..2663 CDD:223021 79/391 (20%)
Blue background indicates that the domain is not in the aligned region.

Return to query results.
Submit another query.