DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment CG6739 and Sorl1

DIOPT Version :9

Sequence 1:NP_609130.1 Gene:CG6739 / 34037 FlyBaseID:FBgn0031926 Length:787 Species:Drosophila melanogaster
Sequence 2:NP_035566.2 Gene:Sorl1 / 20660 MGIID:1202296 Length:2215 Species:Mus musculus


Alignment Length:611 Identity:131/611 - (21%)
Similarity:179/611 - (29%) Gaps:250/611 - (40%)


- Green bases have known domain annotations that are detailed below.


  Fly   298 PTEAMALTTPSTTNS-------GCQSTMLP----MCQGVLDYDLTFN----REGAAPRDAVSMAA 347
            |.....|..|...||       |..|::||    ||.....|....|    .|....|:....: 
Mouse  1023 PQPCSLLCLPKANNSKSCRCPEGVASSVLPSGDLMCDCPQGYQRKNNTCVKEENTCLRNQYRCS- 1086

  Fly   348 YDSLIRANC--SVRAAEF--ICGALEPE---------------------CRPLHIGQLPPCRRIC 387
                 ..||  |:...:|  .||.:..|                     |.||..          
Mouse  1087 -----NGNCINSIWWCDFDNDCGDMSDERNCPTTVCDADTQFRCQESGTCIPLSY---------- 1136

  Fly   388 KAILE----------ACSIPIYNSDVLGELFDCNLYPDAHESHKCEDPTRRRDY----------- 431
            |..||          .|.:....||.    |:|:.......|..|:.....||:           
Mouse  1137 KCDLEDDCGDNSDESHCEMHQCRSDE----FNCSSGMCIRSSWVCDGDNDCRDWSDEANCTAIYH 1197

  Fly   432 -CYGNEFQCHDGSCIPQNWQCDKIKDCQGGEDEDE---------------QCLV----------- 469
             |..:.||||:|.||||.|.||...|||.|.|||.               .|:.           
Mouse  1198 TCEASNFQCHNGHCIPQRWACDGDADCQDGSDEDPVSCEKKCNGFHCPNGTCIPSSKHCDGLRDC 1262

  Fly   470 --------CEP-----DEFRCRSNEKCLVEKYRCDQNIDCMDGSDE--------------QDCDE 507
                    |||     .:|.|::.::||.....||..:.|.|||||              ::|||
Mouse  1263 PDGSDEQHCEPFCTRFMDFVCKNRQQCLFHSMVCDGIVQCRDGSDEDAAFAGCSQDPEFHKECDE 1327

  Fly   508 YG--------------------SGDLAPFDESEL---NAFPRVFTYASF------LSPNETN-DK 542
            :|                    .||.:  ||:..   ...|....|..|      ..||... |:
Mouse  1328 FGFQCQNGVCISLIWKCDGMDDCGDYS--DEANCENPTEAPNCSRYFQFHCENGHCIPNRWKCDR 1390

  Fly   543 VYTYITATTDEDAGTETKFQIHQVAAPAP------PVNSSAEEGAGGPKGFV--NFRDSKEIMMT 599
            .......:.::|.|..     |.:.:|.|      |.......||.....:|  .:||..:   .
Mouse  1391 ENDCGDWSDEKDCGDS-----HVLPSPTPGPSTCLPNYFHCSSGACVMGTWVCDGYRDCAD---G 1447

  Fly   600 SDSETKFKYSQRANRTSVKFSVSAPTT---------PAARTSSAIPS------SALVQQRERTTS 649
            ||.|.   ....||.|    :.|.||.         ...:....||:      ....|..:...:
Mouse  1448 SDEEA---CPSLANST----AASTPTQFGQCDRFEFECHQPKKCIPNWKRCDGHQDCQDGQDEAN 1505

  Fly   650 TTTTSTTTSTTTSSITSTPSITSTTLIPINAATSEPNPYEVVTSLGGCPPQELRCVSGK-CITVS 713
            ..|.||.|                                       |..:|.:|..|: ||.:|
Mouse  1506 CPTHSTLT---------------------------------------CTSREFKCEDGEACIVLS 1531

  Fly   714 QLCDKQIDCPDAADELMC-----VYR 734
            :.||..:||.|.:||..|     ||:
Mouse  1532 ERCDGFLDCSDESDEKACSDELTVYK 1557

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
CG6739NP_609130.1 CRD_FZ 314..427 CDD:143549 29/155 (19%)
LDLa 432..464 CDD:238060 18/31 (58%)
LDLa 470..505 CDD:238060 15/53 (28%)
LDLa 697..731 CDD:238060 14/34 (41%)
Sorl1NP_035566.2 Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 59..84
VPS10 124..753 CDD:214740
BNR 1 136..147
BNR 2 232..243
BNR 3 441..452
BNR 4 521..532
BNR 5 562..573
Sortilin_C 588..753 CDD:292523
LY 780..821 CDD:214531
LDL-receptor class B 1 800..843
LY 824..866 CDD:214531
LDL-receptor class B 2 844..887
LY 868..910 CDD:214531
LDL-receptor class B 3 888..932
Ldl_recept_b 890..929 CDD:278487
LY 913..953 CDD:214531
LDL-receptor class B 4 933..972
LDL-receptor class B 5 973..1013
LDLa 1078..1112 CDD:238060 8/39 (21%)
LDLa 1117..1153 CDD:238060 6/45 (13%)
Ldl_recept_a 1157..1192 CDD:278486 8/38 (21%)
Ldl_recept_a 1198..1230 CDD:278486 17/31 (55%)
LDLa 1240..1271 CDD:238060 1/30 (3%)
LDLa 1281..1308 CDD:197566 10/26 (38%)
LDLa 1325..1359 CDD:238060 8/35 (23%)
LDLa 1373..1403 CDD:238060 4/29 (14%)
LDLa 1419..1453 CDD:238060 9/39 (23%)
LDLa 1471..1506 CDD:238060 3/34 (9%)
LDLa 1514..1549 CDD:238060 14/34 (41%)
FN3 1557..1630 CDD:238020 0/1 (0%)
FN3 1651..1742 CDD:238020
FN3 1752..1831 CDD:238020
FN3 1935..2022 CDD:238020
FN3 2026..2113 CDD:238020
Potential nuclear localization signal for the C-terminal fragment generated by PSEN1. /evidence=ECO:0000250|UniProtKB:Q92673 2162..2165
Endocytosis signal. /evidence=ECO:0000255 2173..2178
Required for efficient Golgi apparatus -endosome sorting. /evidence=ECO:0000250|UniProtKB:Q92673 2191..2215
Required for interaction with GGA1 and GGA2. /evidence=ECO:0000250|UniProtKB:Q92673 2202..2215
DXXLL motif involved in the interaction with GGA1. /evidence=ECO:0000250|UniProtKB:Q92673 2209..2213
Blue background indicates that the domain is not in the aligned region.


Information from Original Tools:


Tool Simple Score Weighted Score Original Tool Information
BLAST Result Score Score Type Cluster ID
Compara 00.000 Not matched by this tool.
Domainoid 00.000 Not matched by this tool.
eggNOG 1 0.900 - - E1_KOG1215
Hieranoid 00.000 Not matched by this tool.
Homologene 00.000 Not matched by this tool.
Inparanoid 00.000 Not matched by this tool.
Isobase 00.000 Not matched by this tool.
OMA 00.000 Not matched by this tool.
OrthoDB 00.000 Not matched by this tool.
OrthoFinder 00.000 Not matched by this tool.
OrthoInspector 00.000 Not matched by this tool.
orthoMCL 00.000 Not matched by this tool.
Panther 00.000 Not matched by this tool.
Phylome 00.000 Not matched by this tool.
RoundUp 00.000 Not matched by this tool.
SonicParanoid 00.000 Not matched by this tool.
SwiftOrtho 1 1.000 - -
TreeFam 00.000 Not matched by this tool.
21.900

Return to query results.
Submit another query.