DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment arr and Sspo

DIOPT Version :9

Sequence 1:NP_524737.2 Gene:arr / 44279 FlyBaseID:FBgn0000119 Length:1678 Species:Drosophila melanogaster
Sequence 2:NP_001007017.1 Gene:Sspo / 474348 RGDID:1549716 Length:5141 Species:Rattus norvegicus


Alignment Length:1328 Identity:252/1328 - (18%)
Similarity:374/1328 - (28%) Gaps:573/1328 - (43%)


- Green bases have known domain annotations that are detailed below.


  Fly   668 CAVRNGGCSHLCLNRPRDYVCR----CAIDYELANDKRTCVVPAAFLLFSRQEHIGRISIEYNEG 728
            |||  ||..|.|....|.:..|    |  .|.|..|...                |::.:....|
  Rat   567 CAV--GGDGHYCTFDGRSFSFRGNPGC--QYSLVQDSVK----------------GQLLVVLEHG 611

  Fly   729 NHNDERIPFKDVRDAHALDVSVAERRIYWTDQKSKCI--FRAFLNGSYVQRIVDSGLIGPDGIAV 791
                                         ..:...|:  ..|||..:::| :..||.:..||..|
  Rat   612 -----------------------------ACETGSCLHALSAFLGKTHIQ-LRYSGAVLVDGQDV 646

  Fly   792 DWLANNIYWSDAEARRIEVARLDGSSRRVLLWK------GVEEPR-SLVLEPRRGYMY------- 842
            |     :.|..||...:..|   .|:..:|.|.      ||.:|. .:.|:||..|..       
  Rat   647 D-----LPWIGAEGFNVSHA---SSTFLLLRWPGAWVLWGVADPAVYITLDPRHAYQVQGLCGTF 703

  Fly   843 -WTE-----SPTDSIRRAA--------MDGSDLQTIVAGANHAAGLTFDQETRRLYWATQSRPAK 893
             |.:     :|...|..:.        :.|.....:|.....::..|:.|   ||.:      |:
  Rat   704 TWKQQDDFLTPAGDIETSVTAFASKFQVSGDGRCPLVDNTPLSSCSTYSQ---RLAF------AE 759

  Fly   894 IESADWDGKKRQILVGSDMDEPYAVSLYQDYVYWSDWNTGDIERVHKTTGQNRSLVHSGMTYITS 958
            ...|...|...|...|....||:.:...:...               :....|..:.|.::....
  Rat   760 AACAALHGHAFQECHGLVEREPFRLRCLESMC---------------SCAPGRDCLCSVLSAYAH 809

  Fly   959 -------LLVFNDKRQTGVNPCKVNNGG---------CSHLCLAQP------GRRGMTCACPTHY 1001
                   ||.:.::....| ||.   ||         |.|.| .:|      |.....|.||...
  Rat   810 HCAQEGVLLQWRNETLCSV-PCP---GGQVYQECAPACGHYC-GEPEDCKELGSCVAGCNCPPGL 869

  Fly  1002 QLAKDGVSCIPPR----------------------NYIIFSQR---NCFGRLLP----------- 1030
            ....:| .|:||.                      ::.|..:|   ||.....|           
  Rat   870 LWDLEG-QCVPPSMCPCQLGGHRYAFNTTTTLKDCSHCICQERGLWNCIAHHCPRQWALCPQELI 933

  Fly  1031 ---------------------NTTD---CPNIPLPVSG-----KNIRAVDYDPITHHIYWIEGRS 1066
                                 .:||   ||      ||     |:..:.|..|..|:..|     
  Rat   934 YAPGACLLTCDSLGANHSCLAGSTDGCVCP------SGTVLLDKHCVSPDLCPCRHNGQW----- 987

  Fly  1067 HSIKRSLANGTKVSLLAN-----SGQPFDLAIDIIGRLLFWTCSQSNSINVTSFLGESVGVIDTG 1126
            :....::.....:.:..|     :||          |...| |..|.:.:..:|.|         
  Rat   988 YPPNATIQEDCNICVCQNQRWHCTGQ----------RCSGW-CQASGAPHYVTFDG--------- 1032

  Fly  1127 DSEKPRNIAVHAMKRLLFWTDVGSHQAIIRARVDGNERVEL------AYKLEGVTALALDQQSDM 1185
                            |.:|..|:.:.::.....|...|.:      |..|....|||:...|.:
  Rat  1033 ----------------LVFTFPGACEYLLVREAGGRFSVSIQNLPCGASGLTCTKALAVRLDSTV 1081

  Fly  1186 IYYAHGKRIDAIDINGKNKKTL-------VSMHISQVINI--AALGGFVYWLDDKTGV------- 1234
            ::...|:   |:.:||.:.|..       :|:|.:.:..:  ..||..:.| |..|.|       
  Rat  1082 VHMLRGQ---AVTVNGVSIKLPKVYTGPGLSLHHAGLFLLLTTRLGLTLLW-DGGTRVLVQLSPH 1142

  Fly  1235 --ERIT---------VNGERRS--------AELQ----RL----PQITDIRAVWTPDPKVLRNHT 1272
              .|:|         |:.:.||        |||.    ||    |:..|:     |.|..:..|.
  Rat  1143 FHGRVTGLCGNFDGDVSNDLRSRQGVLEPTAELTAHSWRLNPLCPEPGDL-----PHPCSVNAHR 1202

  Fly  1273 CMHSRTKCSHI---------------------------CIASGE------GIARTRDVCSCPKHL 1304
            ...:|..|..|                           |...|:      .||...|.|:..:|.
  Rat  1203 VNWARAHCEVILQPIFAPCHTEVPPQQYYEWCVYDACGCDTGGDCECLCSAIATYADECARHRHH 1267

  Fly  1305 MLLEDKENC-------GAFPACGP-------DH----------FTCAAPVSG------------- 1332
            :....:|.|       ..:..||.       ||          .||   |.|             
  Rat  1268 VRWRSQELCPLQCEGGQVYEPCGSTCPPTCHDHHPELRWHCQAITC---VEGCFCPEGTLLHGGT 1329

  Fly  1333 ---ISD---------------VNKD---------------------------------------C 1340
               ::|               :.||                                       |
  Rat  1330 CVELTDCPCEWQGSFFPPGAVLQKDCGNCTCQESQWHCNPSGAPCEEMEPGCAEGEALCRESGHC 1394

  Fly  1341 IPASWRCDGQKDCPDKSDEVGCPT--CRADQFSCQSGECIDKSLVCDGTTNCANGHDEADCCKRP 1403
            :|..|.||.|.||.|.|||.||.|  |...|.|||||.|:..||:|||..:|.:|.||..|....
  Rat  1395 VPLEWLCDNQDDCGDGSDEEGCDTSVCGEGQMSCQSGRCLPLSLICDGQDDCGDGTDEQGCLCPQ 1459

  Fly  1404 GEFQCPINKLCISAALLCDGWENCADGADESSDICLQRRMAPATDKRAFMILIGATMITIFSI-- 1466
            |...|...: |:..||||||..:|.|.|||.|  ||.              .:..|...:..:  
  Rat  1460 GFLACADGR-CLPPALLCDGHPDCLDAADEES--CLG--------------WVSCTSGEVSCVDG 1507

  Fly  1467 --VYLLQFC-------------------------RTRIGKS------RTEPKDDQATDPLSPSTL 1498
              :..:|.|                         ...||::      .|.|....:..|.||.:|
  Rat  1508 PCIRTIQLCDGVWDCPDGADEGPVHCSSPSLPTPPAGIGQNPSTSSPDTSPSPVGSASPASPCSL 1572

  Fly  1499 SKSQRVSKIASVADAVRMSTLNSRNSMNSYDRNHITGASSSTTNGSSMV-------AYPINPPPS 1556
            |:.|                   .||.....|..........|:||..:       .|.:  |.:
  Rat  1573 SEFQ-------------------CNSGECTPRGWRCDREEDCTDGSDELDCGGPCKLYQM--PCA 1616

  Fly  1557 PATRSRRPYRHYKIINQPPPPTPCSTDICDESDSNYTSKSNSNNSNGGATKHSSSSAAACLQYGY 1621
            .......|.:....:.|.|..:....|:|:|       :|.|...||        :|..|     
  Rat  1617 HGPHCLSPGQLCDGVAQCPDGSDEDPDVCEE-------RSASGGPNG--------TAVPC----- 1661

  Fly  1622 DSEPYPPPPTPRSHYHSDVRIVPESSCP 1649
                                  ||.|||
  Rat  1662 ----------------------PEFSCP 1667

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
arrNP_524737.2 LY 162..201 CDD:214531
NHL 168..503 CDD:302697
NHL repeat 172..237 CDD:271320
LY 252..293 CDD:214531
NHL repeat 261..298 CDD:271320
NHL repeat 303..339 CDD:271320
NHL repeat 341..419 CDD:271320
NHL repeat 428..476 CDD:271320
LY 472..511 CDD:214531
NHL repeat 477..503 CDD:271320
Ldl_recept_b 534..573 CDD:278487
LY 558..599 CDD:214531
LY 600..641 CDD:214531
FXa_inhibition 668..703 CDD:291342 13/38 (34%)
LY 739..773 CDD:214531 4/35 (11%)
LY 776..818 CDD:214531 12/41 (29%)
LY 819..860 CDD:214531 13/68 (19%)
FXa_inhibition 973..1010 CDD:291342 12/51 (24%)
LY 1122..1164 CDD:214531 4/41 (10%)
FXa_inhibition 1273..1313 CDD:291342 11/72 (15%)
LDLa 1319..1362 CDD:238060 23/129 (18%)
LDLa 1365..1399 CDD:238060 17/33 (52%)
LDLa 1399..1433 CDD:197566 13/33 (39%)
SspoNP_001007017.1 VWD 192..341 CDD:214566
C8 393..468 CDD:285899
TIL 472..527 CDD:280072
VWD 556..716 CDD:214566 42/206 (20%)
C8 769..826 CDD:285899 8/71 (11%)
TIL 830..883 CDD:280072 15/57 (26%)
VWD 1019..1168 CDD:278521 33/177 (19%)
C8 1206..1276 CDD:285899 11/69 (16%)
TIL 1280..1336 CDD:280072 8/58 (14%)
LDLa 1381..1416 CDD:238060 12/34 (35%)
LDLa 1421..1455 CDD:238060 17/33 (52%)
LDLa 1457..1491 CDD:238060 15/36 (42%)
LDLa 1497..1528 CDD:197566 3/30 (10%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 1533..1567 4/33 (12%)
Ldl_recept_a 1568..1604 CDD:278486 11/54 (20%)
TSP1 1702..1753 CDD:214559
TSP1 1758..1812 CDD:214559
TIL 1822..1873 CDD:280072
TSP1 1917..1970 CDD:214559
FA58C 2070..2226 CDD:214572
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 2087..2109
FA58C <2111..2225 CDD:238014
LDLa 2236..2270 CDD:238060
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 2262..2335
LDLa 2392..2426 CDD:238060
LDLa 2449..2483 CDD:238060
TSP1 2490..2538 CDD:214559
TSP1 2543..2595 CDD:214559
TIL 2618..2660 CDD:280072
TSP1 2703..2752 CDD:214559
TSP1 2760..2813 CDD:214559
TSP_1 2819..2867 CDD:278517
TIL 2871..2930 CDD:280072
VWC_out 2932..2979 CDD:214565
TSP1 2972..3023 CDD:214559
TIL 3075..3127 CDD:280072
TSP1 3240..3292 CDD:214559
TIL 3300..3350 CDD:280072
TSP1 3396..3439 CDD:214559
TSP1 3460..3504 CDD:214559
TIL 3514..3570 CDD:280072
TSP1 3633..3677 CDD:214559
TSP1 3810..3862 CDD:214559
TSP1 3879..3927 CDD:214559
TSP1 3945..3998 CDD:214559
TSP1 4003..4055 CDD:214559
TIL 4058..4113 CDD:280072
TSP1 4158..4208 CDD:214559
TSP1 4252..4303 CDD:214559
TSP1 4367..4418 CDD:214559
TIL 4422..4477 CDD:280072
TSP1 4611..4659 CDD:214559
TIL 4673..4719 CDD:280072
TSP1 4763..4812 CDD:214559
TIL 4814..4868 CDD:280072
TIL 4920..4978 CDD:280072
VWC 4980..5035 CDD:302663
Blue background indicates that the domain is not in the aligned region.


Information from Original Tools:


Tool Simple Score Weighted Score Original Tool Information
BLAST Result Score Score Type Cluster ID
Compara 00.000 Not matched by this tool.
Domainoid 00.000 Not matched by this tool.
eggNOG 1 0.900 - - E1_KOG1215
Hieranoid 00.000 Not matched by this tool.
Homologene 00.000 Not matched by this tool.
Inparanoid 00.000 Not matched by this tool.
OMA 00.000 Not matched by this tool.
OrthoDB 00.000 Not matched by this tool.
OrthoFinder 00.000 Not matched by this tool.
OrthoInspector 00.000 Not matched by this tool.
orthoMCL 00.000 Not matched by this tool.
Panther 00.000 Not matched by this tool.
Phylome 00.000 Not matched by this tool.
SonicParanoid 00.000 Not matched by this tool.
SwiftOrtho 1 1.000 - -
TreeFam 00.000 Not matched by this tool.
21.900

Return to query results.
Submit another query.