DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment Unc-89 and Siglec1

DIOPT Version :9

Sequence 1:NP_001097440.1 Gene:Unc-89 / 3346201 FlyBaseID:FBgn0053519 Length:4218 Species:Drosophila melanogaster
Sequence 2:XP_036015809.1 Gene:Siglec1 / 20612 MGIID:99668 Length:1719 Species:Mus musculus


Alignment Length:1889 Identity:397/1889 - (21%)
Similarity:650/1889 - (34%) Gaps:521/1889 - (27%)


- Green bases have known domain annotations that are detailed below.


  Fly  1017 PVVVKMLKSVQVEPGETAHFEIQFKDQPGLVTWLKDNKPLEDRLADRITQTAAPMNSY-----RL 1076
            |.:......|.|..|.||.:...:..:..:|....|.|.::.|...|    |..|.:.     .|
Mouse    46 PCIFSYPADVPVSNGITAIWYYDYSGKRQVVIHSGDPKLVDKRFRGR----AELMGNMDHKVCNL 106

  Fly  1077 DIKNCSETDAGTY--------------------TIRAQSASETTTVSAQLAV------------- 1108
            .:|:....|:|||                    |:....:..|.|:..:|..             
Mouse   107 LLKDLKPEDSGTYNFRFEISDSNRWLDVKGTTVTVTTDPSPPTITIPEELREGMERNFNCSTPYL 171

  Fly  1109 ------------GQAPGHDETKT--NTEPAFL-------VSLKDAEMIENTLFRFMVKIIGDPKP 1152
                        ||.|.|..|.:  :.||..:       ::|...:.....|.:|.   :|....
Mouse   172 CLQEKQVSLQWRGQDPTHSVTSSFQSLEPTGVYHQTTLHMALSWQDHGRTLLCQFS---LGAHSS 233

  Fly  1153 RVKFY-------KDEKEILETNDRIQIIRDKDYLGFYELVIADVQKTDAGTYSCKATNKHGEANC 1210
            |.:.|       |..:.:|.::.|                  ::...|..|.:|:..:.:     
Mouse   234 RKEVYLQVPHAPKGVEILLSSSGR------------------NILPGDPVTLTCRVNSSY----- 275

  Fly  1211 EAIATTVEDKNPFGALSGQILPAGEKPVFQWKRNGEEFDPEERFKVLFGEDEDSLALVFQHVKPE 1275
                                 ||  ....||.|:|...........||     |.|.       .
Mouse   276 ---------------------PA--VSAVQWARDGVNLGVTGHVLRLF-----SAAW-------N 305

  Fly  1276 DAGIYTCVAQTSTGNISCSAELSVQGAIQTLNREPEKPTLVIEHREANASIGGSAILELQCKGFP 1340
            |:|.|||.|....|:: .|:.||:...:..:...|..|.|..|            .:.|.| ..|
Mouse   306 DSGAYTCQATNDMGSL-VSSPLSLHVFMAEVKMNPAGPVLENE------------TVTLLC-STP 356

  Fly  1341 KPAVQ-----WKHDGEVIQVDDRHKFMYEDEESMSLVIKNVDTVDAGVYTIEAINELGQDESS-I 1399
            |.|.|     |.          ::..:.||..:.:|.:..|...|.|.|..|..|..|.:.|| :
Mouse   357 KEAPQELRYSWY----------KNHILLEDAHASTLHLPAVTRADTGFYFCEVQNAQGSERSSPL 411

  Fly  1400 NLVVKAPPKIKKITD-ITCSAGETIKMEIEVEGFPQPTVQVTNNGKDVTAES-----NVKISSSS 1458
            ::||:.||....:|. :...||....:...|...|..||.:::.|..:.:.|     |.:...||
Mouse   412 SVVVRYPPLTPDLTTFLETQAGLVGILHCSVVSEPLATVVLSHGGLTLASNSGENDFNPRFRISS 476

  Fly  1459 IGKSLEKVVVEVKEIKLSQAGNYSIKATNDLSQTS---EYWSCTVKSKPVIVKNFESEYIHGEKE 1520
            ...||.   :|:::::.:.:|.|:..|.|.|..::   ::::...:    ::.|..:|.:.|:..
Mouse   477 APNSLR---LEIRDLQPADSGEYTCLAVNSLGNSTSSLDFYANVAR----LLINPSAEVVEGQAV 534

  Fly  1521 NVQMTVRIDAYPEAKLTWYHDETEIKITDSKYTVSSDGNAYTLKITGATRVDAGKY---TVKATN 1582
            .:.....:...|:.:.:||.:.          .:..:|::.:|.:..|:..|||.|   |....|
Mouse   535 TLSCRSGLSPAPDTRFSWYLNG----------ALLLEGSSSSLLLPAASSTDAGSYYCRTQAGPN 589

  Fly  1583 EHGSA--TSSTQLLIKCAPEFTHKLKNITVAEGDSNVE-LVVGVDAYPRPHAKWYIDG------- 1637
            ..|.:  |..|.......|.||.:|...|...||.... |:..||:.|....:....|       
Mouse   590 TSGPSLPTVLTVFYPPRKPTFTARLDLDTSGVGDGRRGILLCHVDSDPPAQLRLLHKGHVVATSL 654

  Fly  1638 ----------IEIDEKRNDFRHVEEGNDFKLIMNQVATNMQGNYTCKIMNDYGKLEDNCVVTVNC 1692
                      .::....|.. |||       |...|..: :|.|.|:..|..|  ..:...:.|.
Mouse   655 PSRCGSCSQRTKVSRTSNSL-HVE-------IQKPVLED-EGVYLCEASNTLG--NSSAAASFNA 708

  Fly  1693 KPKVKRGLKNVEVQEGKSFTLEVEVYSE---PEAKIKWFKDGHEIYEDARIKISRDTQRIENYYL 1754
            |..|.....:..::||....|...|..|   ..|...||::| .::...    |.:|.|::    
Mouse   709 KATVLVITPSNTLREGTEANLTCNVNQEVAVSPANFSWFRNG-VLWTQG----SLETVRLQ---- 764

  Fly  1755 TLNLARTEDAGTYEMKATNFIGETTSTCKVAVLTSEALSLEQTVTKTLIATTEEPEEGAVPEIVH 1819
              .:||| ||..|             .|:  :||.:...|...|..:::...:.|:..|:     
Mouse   765 --PVART-DAAVY-------------ACR--LLTEDGAQLSAPVVLSVLYAPDPPKLSAL----- 806

  Fly  1820 VDVFQQH------SYESVPLKY-------EVIATGIPKPEAIWYHDGKPITPD------KHTAIT 1865
            :||.|.|      :.:|.||.:       .::||.:           :|..|.      |.||  
Mouse   807 LDVGQGHMAVFICTVDSYPLAHLSLFRGDHLLATNL-----------EPQRPSHGRIQAKATA-- 858

  Fly  1866 VDGDHYKLEVQSLDLVDAGEYKVVVQNKVGEKSHQGELSLSGIAEYRKPILTQGPGLKDIKVNKG 1930
               :..:|||:.|.|||:|.|.....|.:|..:......:.|..               ::|:  
Mouse   859 ---NSLQLEVRELGLVDSGNYHCEATNILGSANSSLFFQVRGAW---------------VQVS-- 903

  Fly  1931 DKVCEPVVFTADPAPEIVLLKDGQPVVETNNVKLKVDK-------KDAENGLVQYTCTLNILEAE 1988
                        |:||   |::||.||.:..|...|.:       :|........:.||.|....
Mouse   904 ------------PSPE---LREGQAVVLSCQVPTGVSEGTSYSWYQDGRPLQESTSSTLRIAAIS 953

  Fly  1989 IKDSGRYELKVK---NKYGELVTSGWIDVLAKPE---ISGLNDTKCLP---GDTICFEALVQANP 2044
            ::.:|.|..:.:   .....|.....:.|...|.   :|.|..|.  |   |..:|   .||::|
Mouse   954 LRQAGAYHCQAQAPDTAIASLAAPVSLHVSYTPRHVTLSALLSTD--PERLGHLVC---SVQSDP 1013

  Fly  2045 KPKV----------SWTRGNENLCNHENCEVIADVDADKYRLVFQSVSPCEDGKYTITATNSEGR 2099
            ..::          |..:|.:.|.. .|..:...|..::.||........:||.||..|:|:.|:
Mouse  1014 PAQLQLFHRNRLVASTLQGADELAG-SNPRLHVTVLPNELRLQIHFPELEDDGTYTCEASNTLGQ 1077

  Fly  2100 --AAVDFNL-AVLVEK-PTFIVQPESQSIHDYRPVSTKVLVHGVPLPTIEWFKDDKPINYEAINK 2160
              ||.||:. ||.|.. |...|| |.|.::               |..:.|......::| ...|
Mouse  1078 ASAAADFDAQAVRVTVWPNATVQ-EGQQVN---------------LTCLVWSTHQDSLSY-TWYK 1125

  Fly  2161 PGKDKLYAKEDTKKGTDQIESVLDIKSFRENDVGAYTCVATNEIGVTKAPFKLAMLSLAPSFVKK 2225
            .|:..|.|:..|            :.|.:..|..:|.|      ||       .:...||...:.
Mouse  1126 GGQQLLGARSIT------------LPSVKVLDATSYRC------GV-------GLPGHAPHLSRP 1165

  Fly  2226 LDNALDVL--------------QGEPLVLE-CCVDGSPLPTVQWLKDGDEVKPSESIKISTNPDG 2275
            :  .||||              ||..|.|. |.||..| |....|..||::..|.:   ..:...
Mouse  1166 V--TLDVLHAPRNLRLTYLLETQGRQLALVLCTVDSRP-PAQLTLSHGDQLVASST---EASVPN 1224

  Fly  2276 LVKLEINSCQPNDSGAYKLIISNPHGE-----KVALCAVAVKPEEMQPKFLKPITSQTVVVGEPL 2335
            .::||:...:|::.|.|.....:|.|:     ::.|..|.||   |.|       |.:|..|||:
Mouse  1225 TLRLELQDPRPSNEGLYSCSAHSPLGKANTSLELLLEGVRVK---MNP-------SGSVPEGEPV 1279

  Fly  2336 KLEAQ-VTGFPAPEVKWYKDGMLLR--PSPEINFINSPNGQIGLIIDAAQPLDAGVYKCLIANKG 2397
            .:..: .....:....|:.:|..|:  |:..:.|:.:....            ||.|.|.:.:..
Mouse  1280 TVTCEDPAALSSALYAWFHNGHWLQEGPASSLQFLVTTRAH------------AGAYFCQVHDTQ 1332

  Fly  2398 GEIEGVSKVEIVPKESKPVFVAELQ------DA------SSIEGFPVKMDIKVVGNPKPKLQWFH 2450
            |           .:.|:|   |.||      ||      .|.....|.:...|...|..::...|
Mouse  1333 G-----------TRSSRP---ASLQILYAPRDAVLSSFRDSRTRLMVVIQCTVDSEPPAEMVLSH 1383

  Fly  2451 NG------HEIKPDASHIAIVENPDNSSSLIIEKTAPGDSGLYEVIAQNPEGSTASKAKLYVAPK 2509
            ||      ||....||.|..::...|:..|.::....||...|...|||..||.::..:|..   
Mouse  1384 NGKVLAASHERHSSASGIGHIQVARNALRLQVQDVTLGDGNTYVCTAQNTLGSISTTQRLLT--- 1445

  Fly  2510 ADETATEEAPQFVSALRDVNADEGQELVLSA--PFISNPMPEV----IWSKDGVTLTPNERLLMT 2568
                   |....|:|...::..||..|.||.  |..|.|....    .|::..:...|...|..|
Mouse  1446 -------ETDIRVTAEPGLDVPEGTALNLSCLLPGGSGPTGNSSFTWFWNRHRLHSAPVPTLSFT 1503

  Fly  2569 CDGKHIGLTIKPAEAADSGNYTCLLANPLGEDSSACNANVRKVYKPPVFTQKISDQQQVFGNNAK 2633
                       |...|.:|.|.|....|.|..:|| ...:|.:|.|...|..:..:.| .|:...
Mouse  1504 -----------PVVRAQAGLYHCRADLPTGATTSA-PVMLRVLYPPKTPTLIVFVEPQ-GGHQGI 1555

  Fly  2634 IPVTVSGVPYPDLEWYFQDKPIPKSE------KYSIKNDGDHHMLIVNNCEKG--DQGVYKCIAS 2690
            :...|...|...|..:...:.:..::      |..|:.....:.|.|:..|.|  :||.|.|.||
Mouse  1556 LDCRVDSEPLAILTLHRGSQLVASNQLHDAPTKPHIRVTAPPNALRVDIEELGPSNQGEYVCTAS 1620

  Fly  2691 NREG 2694
            |..|
Mouse  1621 NTLG 1624

Known Domains:


Indicated by green bases in alignment.

Software error:

Illegal division by zero at /www/www.flyrnai.org/docroot/cgi-bin/DRSC_prot_align.pl line 591.

For help, please send mail to the webmaster (ritg@hms.harvard.edu), giving this error message and the time and date of the error.

GeneSequenceDomainRegion External IDIdentity
Unc-89NP_001097440.1 RhoGEF 90..260 CDD:238091
PH_unc89 275..388 CDD:270134
Atrophin-1 <493..690 CDD:331285
TonB_N 502..>596 CDD:318287
I-set 1017..1108 CDD:333254 23/115 (20%)
I-set 1123..1212 CDD:254352 13/102 (13%)
I-set <1236..1299 CDD:333254 17/62 (27%)
I-set 1313..1403 CDD:254352 22/95 (23%)
Ig_3 1406..1487 CDD:316449 20/86 (23%)
I-set 1499..1595 CDD:254352 18/100 (18%)
I-set 1599..1690 CDD:333254 23/108 (21%)
I-set 1694..1786 CDD:254352 20/94 (21%)
I-set <1836..1903 CDD:333254 18/72 (25%)
I-set 1922..2005 CDD:333254 17/92 (18%)
I-set 2018..2108 CDD:333254 28/108 (26%)
I-set 2113..2214 CDD:333254 19/100 (19%)
I-set 2220..2302 CDD:254352 24/96 (25%)
I-set 2318..2408 CDD:254352 16/92 (17%)
I-set 2415..2506 CDD:254352 29/108 (27%)
I-set 2519..2608 CDD:254352 23/94 (24%)
I-set 2615..2696 CDD:254352 20/88 (23%)
I-set 2717..2805 CDD:254352
FN3 2834..2925 CDD:238020
I-set <2996..3063 CDD:333254
I-set 3067..3157 CDD:333254
PK_Unc-89_rpt1 3182..3440 CDD:271011
I-set 3654..3744 CDD:333254
FN3 3748..3840 CDD:238020
STKc_Unc-89_rpt2 3893..4151 CDD:271014
Siglec1XP_036015809.1 IgV_CD33 27..134 CDD:409377 19/91 (21%)