DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment nolo and Thsd7a

DIOPT Version :10

Sequence 1:NP_001036374.1 Gene:nolo / 35424 FlyBaseID:FBgn0051619 Length:1394 Species:Drosophila melanogaster
Sequence 2:NP_001158277.1 Gene:Thsd7a / 330267 MGIID:2685683 Length:1646 Species:Mus musculus


Alignment Length:1622 Identity:308/1622 - (18%)
Similarity:476/1622 - (29%) Gaps:618/1622 - (38%)


- Green bases have known domain annotations that are detailed below.


  Fly    79 SSWSDWSTCSRTCDGGIMHQMRRCGSP----GSCRGESTRYRICNMQPCPEQQDFRSSQCSAYN- 138
            |.:|.||.|||||..|:.|:.|...:|    ||.....|.:::|...||.|.:...|.|...:: 
Mouse   187 SEFSPWSECSRTCGSGLQHRTRHVVAPPQYGGSGCPNLTEFQVCQSNPCEEDESLYSLQVGPWSA 251

  Fly   139 -DVPYD-------------------GTLYKWTPHYDYVEPCALTCRGHPAHLVEDISRETGDGNA 183
             .||:.                   |...|        :|.|       ..|::. .|.....|.
Mouse   252 CSVPHTRQARQARRRGKNKEREKERGKAVK--------DPEA-------RELIKK-KRNRNRQNR 300

  Fly   184 EEAEHYDEQSVIVQLSARVQDGTRC--RSG-SLDMCIQGKCQRVGCDLKIGSTKKIDGCGVCGGD 245
            :|..::|     :|:..:.:|.| |  |:| |.|:..   ||:....:...|......|.|.   
Mouse   301 QENRYWD-----IQIGYQTRDVT-CLNRTGKSADLSF---CQQERLPMTFQSCVITKECQVS--- 353

  Fly   246 GNSCSQPLFNWEMAPMSQCSVTCGSGYKMSRPICRNR-LTNADV-DDTLCSVTNRPEASVEQ--- 305
                       |....|.||.||......:....|.| :|...: .:..|......|..|.|   
Mouse   354 -----------EWLEWSPCSKTCHDVTSPTGTRVRTRTITQFPIGSEKECPALEEKEPCVSQGDG 407

  Fly   306 ---CNTHSCPPRWIADDWSTC----------------SRLCGHGYRERMVVCAEESNGIKTR--- 348
               |.|:.    |...:|:.|                :.|||.|.:.|.:.|.:.::.:.:.   
Mouse   408 AVLCATYG----WRTTEWTECHVDPLLSQQDKRRANQTALCGGGVQTREIYCIQTNDNMLSHGNT 468

  Fly   349 ---------VADIMCRTPKPPTQETCIIEECPHWEVEDWTGCSVSCGQGIQMRGVECKSTDGSLS 404
                     |...:|..|.|.|.:.|.: .||                      :||:.:..|..
Mouse   469 QKDKEASKPVDSKLCTGPVPNTTQLCHV-PCP----------------------IECEVSPWSAW 510

  Fly   405 AKCDPLTKPGSMQQCSTGIHCGGSLNKVGGTIIVGSSRSLNERSERQLDSSDADEDNEDENDEGD 469
            ..|             |..:|.....|.|..:           .:|::       .||.....| 
Mouse   511 GPC-------------TYENCNDQQGKKGFKL-----------RKRRI-------TNEPTGGSG- 543

  Fly   470 DVDDLESGQDTDDGEGLSYADQPLLYAHRTQSRLNQEAPDEPRTMHLMNGNSNNNFNRGED---- 530
                 .:|......|.:. .::|..|..:: .||....||        ||.|.....:.::    
Mouse   544 -----ATGNCPHLLEAIP-CEEPSCYDWKS-VRLGDCEPD--------NGKSCGPGTQVQEVVCI 593

  Fly   531 ESEGPSLDPTYIKD----------------------NEWSPCSVTC-GEGIRRRTYNCKIFLEYS 572
            .|:|..:|....:|                      :.||.||.|| |:....:....:..|.|:
Mouse   594 NSDGEEVDRQLCRDAIFPIPVACDAPCPKDCVLSAWSSWSSCSHTCSGKTTEGKQTRARSILAYA 658

  Fly   573 RTVATVNDSLCEGKKPHDEVERCVEDPCMLPSHGFDDQFPRDSIKVGVSEPGKTYVWREQGYTSC 637
            .....:.   |.......||..|.|.||.: .|.....:.:......||....|..|..:     
Mouse   659 GEEGGIR---CPNISALQEVRSCNEHPCTV-YHWQTGPWGQCIEDTSVSSFNTTTTWNGE----- 714

  Fly   638 SASCLGGVEELIINCVREDNGRVVSPFLCSPETKPEARVRTCNDRPC------PPRWNYSDYTPC 696
             |||..|::...:.|||.:.|: |.|..|....:||. ||.|. .||      .|   |||:|||
Mouse   715 -ASCSVGMQTRKVICVRVNVGQ-VGPKKCPESLRPET-VRPCL-LPCRKDCVVTP---YSDWTPC 772

  Fly   697 SKSC-----GIGIKTREVQCIHEVTRGGDNTMVVPNSMCPQP---------PPADRQYCNVLDCP 747
            ..||     |...::|:...|.....||..        |..|         ||....|       
Mouse   773 PSSCREGDSGARKQSRQRVIIQLPANGGKE--------CSDPLYEEKACEAPPTCHSY------- 822

  Fly   748 VRWEVGEWSKC--------------SHTCGYGFKDRKVECKQIMAQEHKIERPESMC---PSAKP 795
             ||:..:|.:|              ...||.|.:.|.:.|::....:..|:.    |   ....|
Mouse   823 -RWKTHKWRRCQLVPWSIQQDVPGAQEGCGPGRQARAITCRKQDGGQASIQE----CLQYAGPVP 882

  Fly   796 ADKKPCNVKPCPPEDPKPVIQINNSTHIQHDPKKT---KITLKVGGAGLV------FFGTQVKIK 851
            |..:.|.: ||                 |.|.:.|   |.:...|..|.|      ..|...|.:
Mouse   883 ALTQACQI-PC-----------------QDDCQFTSWSKFSSCNGDCGAVRTRKRAIVGKSKKKE 929

  Fly   852 ---------------CPVKRYNRTKI-KWSK------------------DHKPLQRSRKYKV--- 879
                           ||..:||...: .||.                  |.|...:..:|:.   
Mouse   930 KCKNSHLYPLIETQYCPCDKYNAQPVGNWSDCILPEGKAEVLLGMKVQGDSKECGQGYRYQAMAC 994

  Fly   880 -SKKGALRILDIT-------FRDAGVYSCHAGLSSAEIS------------IEVKAK-------- 916
             .:.|  |:::.:       ..:|.:..|.:....:|.|            ::|::|        
Mouse   995 YDQNG--RLVETSRCNSHGYIEEACIIPCPSDCKLSEWSNWSRCSKSCGSGVKVRSKWLREKPYN 1057

  Fly   917 ---PGQQIEELEQQETERLVREKSG-------TE-----ALTSADMKSADGTGT---------ST 957
               |..:::.:.|.:...:|...|.       ||     .:|..||:...|.|.         :|
Mouse  1058 GGRPCPKLDHVNQAQVYEVVPCHSDCNQYIWVTEPWSVCKVTFVDMRDNCGEGVQTRKVRCMQNT 1122

  Fly   958 AVGPNSHVN-----------GSAQ-------------------------QHGSRRRQQQSER--L 984
            |.||:.||.           ||.:                         ..|||:|.....|  .
Mouse  1123 ADGPSEHVEDYLCDPEDMPLGSRECKLPCPEDCVISEWGPWTQCALPCNPSGSRQRSADPIRQPA 1187

  Fly   985 QNGR------ERSRRPKSDGVQHADSSIMEDDLSQLVQSEDRIPQPASASSGGSRTRTLLAMPYF 1043
            ..||      |:.....:....|.|.::.:....||.:.        :....|.:||.|..    
Mouse  1188 DEGRACPDAVEKEPCSLNKNCYHYDYNVTDWSTCQLSEK--------AVCGNGIKTRMLDC---- 1240

  Fly  1044 QALLSNLQLLWPLQRFKDSRGQHLLQGEALKFGIDLSGKDFDQDESERVPAEQLPPEVLHSPSDP 1108
                                             :...||..|....|.:..|:            
Mouse  1241 ---------------------------------VRSDGKSVDLKYCEELGLEK------------ 1260

  Fly  1109 EYHEPSKSGPEADVPHVPPHMPHARWETGAAASTTAVPSLPNDGFVYRWMLGEWSQCSQECGAAG 1173
                                    .|    ..:|:.....|.:..:..|  ..||||||.||..|
Mouse  1261 ------------------------NW----PMNTSCTVECPVNCQLSDW--SSWSQCSQTCGLTG 1295

  Fly  1174 SGLQRRTVSCQRAAKATSSGSTSDTENNVVDNSECFAHGLELPEFFQSCGNEACPQWSKGDWTPC 1238
            ..:::|||                |:....|...|    ..|.|..:.|..:.|.:|..|.|:||
Mouse  1296 KMIRKRTV----------------TQPFQGDGRPC----PSLMEQSKPCPVKPCYRWQYGQWSPC 1340

  Fly  1239 --QRSHCHGRNTAMQRREVTCRYPNGTE---GSICDEY-----------ERPAMRQECYNERCEG 1287
              |.:.| |..|  :.|.::|...:|:.   ..:.||.           .:..:.:|...:.|.|
Mouse  1341 QVQEAQC-GEGT--RTRNISCVVSDGSAEDFSKVVDEEFCANTELIIDGNKQIVLEETCTQPCPG 1402

  Fly  1288 VWRVEPWSE---CNAPC------GRKGI----------------------------------KYR 1309
            ...:..||.   |...|      |..||                                  :|:
Mouse  1403 DCYLNDWSSWSLCQLTCVNGEDLGFGGIQVRSRAVIIQELENQHLCPEQMLETKSCDDGQCYEYK 1467

  Fly  1310 ILQCVWYGSRRPAG-------NV---CKHQPRPAVMKVCKSPACQAQLSSSPAQLCRDSSRY 1361
            .:...|.||.|...       ||   |....:|...:.|..|..|.....|..:.||....|
Mouse  1468 WVASAWKGSSRTVWCQRSDGINVTGGCLVVSQPDTDRSCNPPCSQPHSYCSEMKTCRCEEGY 1529

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
noloNP_001036374.1 TSP1 78..124 CDD:214559 18/48 (38%)
TSP1_ADAMTS 256..311 CDD:465950 15/62 (24%)
TSP1_ADAMTS 315..370 CDD:465950 15/82 (18%)
TSP1_ADAMTS 373..420 CDD:465950 4/46 (9%)
TSP1 546..600 CDD:214559 14/54 (26%)
TSP1_ADAMTS 629..684 CDD:465950 17/54 (31%)
TSP1_ADAMTS 688..746 CDD:465950 18/71 (25%)
TSP1_ADAMTS 750..806 CDD:465950 13/72 (18%)
Ig 845..914 CDD:472250 17/125 (14%)
Ig strand B 848..852 CDD:409353 1/18 (6%)
Ig strand C 861..865 CDD:409353 0/4 (0%)
Ig strand E 883..887 CDD:409353 1/3 (33%)
Ig strand F 897..902 CDD:409353 1/4 (25%)
TSP1_ADAMTS 1157..1227 CDD:465950 19/69 (28%)
TSP1_ADAMTS 1289..1343 CDD:465950 18/106 (17%)
PLAC 1355..1384 CDD:462560 3/7 (43%)
Thsd7aNP_001158277.1 TSP1_spondin 184..235 CDD:465948 17/47 (36%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 255..300 7/60 (12%)
TSP1_spondin 350..404 CDD:465948 14/67 (21%)
TSP1_spondin 502..562 CDD:480609 14/97 (14%)
TSP1_ADAMTS 688..756 CDD:465950 20/76 (26%)
TSP1_spondin 761..817 CDD:465948 16/66 (24%)
TSP1_spondin 1025..>1065 CDD:465948 5/39 (13%)
TSP1_ADAMTS 1088..1150 CDD:465950 15/61 (25%)
TSP1_spondin 1155..1206 CDD:480609 8/50 (16%)
TSP1_spondin 1276..1329 CDD:465948 19/74 (26%)
TSP1_spondin 1404..1463 CDD:480609 7/58 (12%)

Return to query results.
Submit another query.