DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment N and Fat4

DIOPT Version :10

Sequence 1:NP_476859.2 Gene:N / 31293 FlyBaseID:FBgn0004647 Length:2703 Species:Drosophila melanogaster
Sequence 2:NP_899044.3 Gene:Fat4 / 329628 MGIID:3045256 Length:4981 Species:Mus musculus


Alignment Length:960 Identity:206/960 - (21%)
Similarity:277/960 - (28%) Gaps:429/960 - (44%)


- Green bases have known domain annotations that are detailed below.


  Fly   389 DGVGSFYCQCTKGKTGLLCHLDDAC--TSNPCHADAICDTSPINGSYACSCAT--------GYKG 443
            |.:.||:|..|.|.|.|......:|  :|.|...|...|.:.::.....|..|        |:..
Mouse  3644 DVLDSFHCSLTSGVTSLFSIPAGSCDLSSQPRSTDGTFDLTVVSSDGVHSTVTNNIRVFFAGFSN 3708

  Fly   444 VDCSEDIDECDQGSPCEHN--------------------GICVNTPGSYRCNCSQGFTGPRCETN 488
            ......| ....|.|...:                    |..|....:|..| ::.|.....:.|
Mouse  3709 ATIDNSI-LLRVGVPTVKDFLTNHYLHFLRIASSQLTGLGTAVQLYAAYEEN-NRTFLLAAVKRN 3771

  Fly   489 INE---------------------------------CESHPCQNEGSCL---------------- 504
            .|:                                 |...||||.||||                
Mouse  3772 NNQYVNPSGVATFFESIKEILLRQSGVKVESVDHDPCIHGPCQNGGSCLRRLAVGSALKIQESLP 3836

  Fly   505 -----DDP-GTFRCVCMPGFTGTQCEIDIDECQSNPCLNDGTCHDKINGFKCSCALGFTGARCQI 563
                 ::| ...:|.|:||:.|:.||:|||||...||.|.||||:.:.||.|||..||||..|:.
Mouse  3837 VIIVANEPLQPSQCKCVPGYAGSWCEVDIDECLPAPCHNGGTCHNLVGGFSCSCPEGFTGRACER 3901

  Fly   564 NIDDCQSQPCRNRGICHDSIAGYSCECPPGYTGTSCEININDCDSNPCHRGKCIDDVNSFKCLCD 628
            :|::|...||::..:|.:...|::|.|..||||..||.::|.|:.|||..|              
Mouse  3902 DINECLPSPCKHGAVCQNFPGGFNCVCKTGYTGKMCESSVNYCECNPCFNG-------------- 3952

  Fly   629 PGYTGYICQKQINECESNPCQFDGHCQDRVGSYYCQCQAGTSGKNCEVN--------VNECHSNP 685
                                   |.||..|.||||.|..|..||:||:|        ..|..|..
Mouse  3953 -----------------------GSCQSGVESYYCHCPFGVFGKHCELNSYGFEELSYMEFPSLD 3994

  Fly   686 CNNGATCID----------------------------------------GINSYKCQCVPGFT-- 708
            .||....:.                                        |..:||...:...:  
Mouse  3995 PNNNYIYVKFATIKSHALLLYNYDNQTGERAEFLALEIAEERLRFSYNLGSGTYKLTTMKKVSDG 4059

  Fly   709 ----------GQHCEKNVDECISSP----CANNGVCIDQ------------VNGYKCECP----R 743
                      |......||.|..:.    |..:.|.:..            |.|.:...|    |
Mouse  4060 QFHTVIARRAGMAASLTVDSCSENQEPGYCTVSNVAVSDDWTLDVQPNRVTVGGIRSLEPILQRR 4124

  Fly   744 GFYDAHCLSDVDECASNPCVNEGR--------CEDGINEFICHCPPGYTGKRCELDIDECSSNPC 800
            |..::|   |...|.....|| ||        ...||   :..||      |.|   ..|:.|||
Mouse  4125 GHVESH---DFVGCVMEFAVN-GRPLEPSQALAAQGI---LDQCP------RLE---GTCARNPC 4173

  Fly   801 QHGGTCYDKLNAFSCQCMPGYTGQKCE--------------------------------TNIDDC 833
            ||||||.|..:...||||.|.||:.||                                .:|.|.
Mouse  4174 QHGGTCVDFWSWQQCQCMEGLTGKYCEKSVTPDTALSLEGKGRLDYHMSQSEKREYLLTQSIRDT 4238

  Fly   834 VTNPCG----------------------------------------------------------- 839
            ...|.|                                                           
Mouse  4239 TLEPFGVNSLEVKFRTRSENGILIHIQESSNYTTVKIKNGKVHFTSDAGVAGKVERIIPEAYIAD 4303

  Fly   840 ----------NGGTCI--------------------------------------DKVNGYK-CVC 855
                      ||...:                                      |...|:. |:.
Mouse  4304 GHWHTFRISKNGSITVLSVDRIHNRDIVHPTQDFGGIEVLSMSLGGIPPNQAHRDTQTGFNGCIA 4368

  Fly   856 KV-------PFTGRD---CESKMDPCASNRCKNEAKCTPSSNFLDFSCTCKLGYTGRYCDEDIDE 910
            .|       ||:|:.   ..||.||.....|:....|.                           
Mouse  4369 SVLYGGESLPFSGKHSLASISKTDPSVKIGCRGPNICA--------------------------- 4406

  Fly   911 CSLSSPCRNGASCLNVPGSYRCLCTKGYEGRDCAINTDDCASFPCQNGGTCLDG-IGDYSCLCVD 974
               |:||.....|:|...:|:|            :...||||.||||||:|..| :..|:|.|.:
Mouse  4407 ---SNPCWGDLLCINQWYAYKC------------VPPGDCASHPCQNGGSCEPGLLSGYTCSCPE 4456

  Fly   975 GFDGKHCETDINECLSQPCQNGATCSQYVNSYTCTCPLGFSGINCQTNDE 1024
            ...|:.||| :..||...|..|..|.       ...|.|...:..|..||
Mouse  4457 SHTGRTCET-VVACLGVLCPQGKVCK-------AGSPGGHVCVQSQGPDE 4498

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
NNP_476859.2 EGF_CA 179..214 CDD:238011
EGF_CA 217..252 CDD:238011
EGF_CA 260..291 CDD:238011
EGF_CA 295..329 CDD:238011
EGF_CA 331..369 CDD:238011
EGF_CA 449..486 CDD:238011 8/56 (14%)
EGF_CA 488..524 CDD:238011 17/90 (19%)
EGF_CA 526..562 CDD:238011 21/35 (60%)
EGF_CA 564..600 CDD:238011 12/35 (34%)
EGF_CA 602..637 CDD:238011 6/34 (18%)
EGF_CA 640..675 CDD:238011 12/34 (35%)
EGF_CA 677..713 CDD:238011 9/95 (9%)
EGF_CA 715..750 CDD:238011 10/54 (19%)
EGF_CA 753..789 CDD:238011 11/43 (26%)
EGF_CA 791..827 CDD:238011 18/35 (51%)
EGF_CA 829..865 CDD:238011 13/153 (8%)
EGF_CA 907..943 CDD:238011 7/35 (20%)
EGF_CA 946..982 CDD:238011 16/36 (44%)
EGF_CA 984..1020 CDD:238011 7/35 (20%)
EGF_CA 1027..1058 CDD:238011
EGF_CA 1062..1095 CDD:238011
EGF_CA 1184..1219 CDD:238011
EGF_CA 1221..1257 CDD:238011
EGF_CA 1259..1295 CDD:238011
EGF_CA 1297..1335 CDD:238011
EGF_CA 1338..1373 CDD:238011
EGF_CA 1417..1450 CDD:238011
NL 1476..1512 CDD:197463
Notch 1519..1553 CDD:459658
Notch 1565..1593 CDD:459658
NOD 1598..1652 CDD:462014
NODP 1679..1731 CDD:462229
JMTM_dNotch 1719..1806 CDD:411989
ANK repeat 1902..1948 CDD:293786
ANKYR 1936..2139 CDD:440430
ANK repeat 1951..1981 CDD:293786
ANK repeat 1984..2015 CDD:293786
ANK repeat 2017..2048 CDD:293786
ANK repeat 2050..2081 CDD:293786
ANK repeat 2083..2114 CDD:293786
Fat4NP_899044.3 EGF_CA 4430..4464 CDD:238011 16/33 (48%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 4535..4585
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 4677..4713
Necessary and sufficient for interaction with MPDZ. /evidence=ECO:0000269|PubMed:19506035 4708..4797
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 4753..4773
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 4796..4911
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 4957..4981
Cadherin_repeat 48..131 CDD:206637
Cadherin_repeat 139..246 CDD:206637
Cadherin_repeat 254..349 CDD:206637
Cadherin_repeat 364..471 CDD:206637
Cadherin_repeat 479..577 CDD:206637
Cadherin_repeat 588..685 CDD:206637
Cadherin_repeat 693..789 CDD:206637
Cadherin_repeat 798..889 CDD:206637
Cadherin_repeat 904..992 CDD:206637
Cadherin_repeat 1000..1096 CDD:206637
Cadherin_repeat 1105..1206 CDD:206637
Cadherin_repeat 1215..1311 CDD:206637
Cadherin_repeat 1321..1416 CDD:206637
Cadherin_repeat 1428..1525 CDD:206637
Cadherin_repeat 1540..1622 CDD:206637
Cadherin_repeat 1634..1736 CDD:206637
Cadherin_repeat 1747..1837 CDD:206637
Cadherin_repeat 1845..1940 CDD:206637
Cadherin_repeat 1948..2047 CDD:206637
Cadherin_repeat 2057..2150 CDD:206637
Cadherin_repeat 2159..2254 CDD:206637
Cadherin_repeat 2263..2360 CDD:206637
Cadherin_repeat 2369..2464 CDD:206637
Cadherin_repeat 2472..2559 CDD:206637
Cadherin_repeat 2573..2667 CDD:206637
Cadherin_repeat 2675..2771 CDD:206637
Cadherin_repeat 2778..2870 CDD:206637
Cadherin_repeat 2878..2981 CDD:206637
Cadherin_repeat 2990..3085 CDD:206637
Cadherin_repeat 3095..3192 CDD:206637
Cadherin_repeat 3200..3295 CDD:206637
Cadherin_repeat 3305..3402 CDD:206637
Cadherin_repeat 3410..3508 CDD:206637
Cadherin_repeat 3516..3612 CDD:206637
EGF_CA 3804..3862 CDD:238011 15/57 (26%)
EGF_CA 3864..3900 CDD:238011 21/35 (60%)
EGF_CA 3902..3938 CDD:238011 12/35 (34%)
EGF_CA 3941..3976 CDD:238011 18/71 (25%)
LamG 3979..4142 CDD:238058 21/165 (13%)
EGF_CA 4168..4200 CDD:238011 18/31 (58%)
Laminin_G_2 4252..4375 CDD:460494 6/122 (5%)
Blue background indicates that the domain is not in the aligned region.

Return to query results.
Submit another query.