DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment trol and Egflam

DIOPT Version :10

Sequence 1:NP_001401049.1 Gene:trol / 45320 FlyBaseID:FBgn0284408 Length:4489 Species:Drosophila melanogaster
Sequence 2:XP_006232075.1 Gene:Egflam / 365691 RGDID:1306592 Length:1013 Species:Rattus norvegicus


Alignment Length:1198 Identity:277/1198 - (23%)
Similarity:454/1198 - (37%) Gaps:268/1198 - (22%)


- Green bases have known domain annotations that are detailed below.


  Fly  3358 VKTYFVKELVL---MPLPRIRFYPNIPLTVEAGQNLDVHCQVENVRPEDVHWSTDNNRPLPSSVR 3419
            :.|:.:..|:|   :|...:.....:..:.:.|..||:.....|.....:.|.|    |..|...
  Rat     4 ISTFLLHFLLLACSLPPGAVSLRTALRKSGKVGPPLDIKLGALNCTAFSIQWKT----PKRSGSS 64

  Fly  3420 IVGSVLRFVS----------------------ITQAAAG------EYRCS--AFNQYG-NRSQIA 3453
            |||..: |.|                      ||:...|      |||.|  |::|.| .|....
  Rat    65 IVGYTV-FYSELGSDKSLREQSHNVPVGQDTLITEEVIGDLKPGTEYRVSIAAYSQTGKGRLSFP 128

  Fly  3454 RVAVKKPADFHQVPQSQLQRHREGENIQLQCTVTDQYGVRAQDNVEFNWFRDDRRPLPNNARTDS 3518
            |.......|....|::..|.|        ...|:|       ..|..:|     ||..|......
  Rat   129 RHVTTLSQDSCLPPEAPHQPH--------VLVVSD-------SEVALSW-----RPGENEGSAPI 173

  Fly  3519 QILVLTNLRPEDAGRYICNSYDVDRGQQL--PEVSIDLQVLRAPQYPYNRFKGGVSLKDTPCMVL 3581
            |...:..:||           |.|:...:  ..:.:|..|::... |...::..|.     .|..
  Rat   174 QSYSVEFIRP-----------DFDKSWTIIQERLQMDSMVIKGLD-PDTNYQFAVR-----AMNA 221

  Fly  3582 YICAAATPPPNSPIYLPPQLPAKSR-------DYSLKLDDQSSNLRAGESTDVECYSSDDTYTDV 3639
            |..:..:.|.|:...|.|......|       |..:..||.:              |.|:...||
  Rat   222 YGFSLRSQPSNTIRTLGPGEAGSGRYGPGYITDTGVSEDDDA--------------SEDELDLDV 272

  Fly  3640 VWERSDGAPLSNNVRQVGN-RLVISNVSPSDAGNYVCKCKTDEGDLYTTSYKLEVEDQPHELKSS 3703
            .:|.....|.:    :||| :...::||.|:..:.:  .:.....|..|:..:.......:.|:|
  Rat   273 SFEEVKPLPAT----KVGNKKSKKTSVSNSEMDSRL--AQPTSASLPETTVAVPPTPAQRKGKNS 331

  Fly  3704 KIVYAKVGANADLQCGADESRQPTYRWSRQYGQLQAGRSLMNEKLSLDSVQANDA--GTYICTAQ 3766
            ..|.:::   .|:.|  ||:                       ..|.||...||.  |...|...
  Rat   332 VAVMSRL---FDMSC--DET-----------------------LCSADSFCVNDYAWGGSRCHCN 368

  Fly  3767 YADGETADFPNILVVTGAIPQFRQEPRSYMSFPTLPNSSFKFNFELTFRPENGDGLLLFNGQT-R 3830
            ...|..|...:|.:   ..|||  ...||::|..|.||...|...|.||.|..|||||:.|:: .
  Rat   369 LGKGGEACSEDIFI---QYPQF--FGHSYVTFEPLKNSYQAFQITLEFRAEAEDGLLLYCGESEH 428

  Fly  3831 GSGDYIALSLKDRYAEFRFDFGGKPMLVRAEEPLALNEWHTVRVSRFKRDGYIQVDEQHPVAFPT 3895
            |.||:::|:|..|...|||:.|....::.:|..:.|..||:|.:.|...:|.:|::...||...:
  Rat   429 GRGDFMSLALIRRSLHFRFNCGTGMAIIISETKIKLGAWHSVTLYRDGLNGLLQLNNGTPVTGQS 493

  Fly  3896 LQQIPQLDLIEDLYIGGVPN--WELLPADAVSQQVGFVGCISRLTLQGRTVEL----IREAKYKE 3954
            ..|..::.....||:||.|:  | |:.|...::  ||.||:..|.:.|:.:::    :.:|....
  Rat   494 QGQYSKITFRTPLYLGGAPSAYW-LVRATGTNR--GFQGCVQSLAVNGKKIDMRPWPLGKALNGA 555

  Fly  3955 GITDCRP--CAQGPCQNKGVCLESQTEQAYTCICQPGWTGRDCAIEGTQCTPGVCGAGRCENTEN 4017
            .:.:|..  |.:..|.|.|.|...:.: :|.|:|..|:.||.|....|...|             
  Rat   556 DVGECSSGICDEASCINGGTCAAIKAD-SYICLCPLGFRGRHCEDAFTLTIP------------- 606

  Fly  4018 DMECLCPLNRSGDRCQYNEILNEHSLNFKGNSFAAYGTP-------KVTKVNITLSVRPASLEDS 4075
                           |:.|.|         .|:||...|       ..|:..||.  ||.| .|.
  Rat   607 ---------------QFRESL---------RSYAATPWPLEPQHYLSFTEFEITF--RPDS-GDG 644

  Fly  4076 VILYTAESTLPSGDYLALVLRGGHAELLINTAARLDPVVVRSAEPLPLNRWTRIEIRRRLGEGIL 4140
            |:||:.:::  |.|:|::::..||.|...:..:...  |:||.:.|.|.:|..:.:.|....|||
  Rat   645 VLLYSYDTS--SKDFLSIIMAAGHVEFRFDCGSGTG--VLRSEDTLTLGQWHDLRVSRTAKNGIL 705

  Fly  4141 RVGDGPERKAKAPGSDRILSLKTHLYVGG---YDRSTVKVNRDVNITKGFDGCISRLYNFQKPVN 4202
            :|......:..|.|....:...|.:::||   ||    .|.::..|...|.|.|.::....:.::
  Rat   706 QVDKQKVVEGMAEGGFTQIKCNTDIFIGGVPNYD----DVKKNSGILHPFSGSIQKIILNDRTIH 766

  Fly  4203 LLADIKDAANIQSCGETNMIGGDEDSDNEPPVPPPTPDVHENELQPYAMAPCASDPCENGGSCSE 4267
            :..|.....|:::                                  |..||...||.:||||..
  Rat   767 VRHDFTSGVNVEN----------------------------------AAHPCVGAPCAHGGSCRP 797

  Fly  4268 QEDVAVCSCPFGFSGKHCQ------------EHLQLGFNASFRGDGYVELNRSHFQPALEQSYTS 4320
            :::...|.||.||.|.:||            |.:::   ..|.|..|:..:..:....:..|.::
  Rat   798 RKEGYECDCPLGFEGLNCQKECGNYCLNTITEAIEI---PQFIGRSYLTYDNPNILKRVSGSRSN 859

  Fly  4321 MGIVFTTNKPNGLLFWWGQEAGEEYTGQDFIAAAVVDGYVEYSMRL-DGEEAVIRNSDIRVDNGE 4384
            ..:.|.|...:|||.|.|.....  ...|||:..:.||.:.:|..| .|..:::.|...  .:|.
  Rat   860 AFMRFKTTAKDGLLLWRGDSPMR--PNSDFISLGLRDGALVFSYNLGSGVASIMVNGSF--SDGR 920

  Fly  4385 RHIVIAKRDENTAILEVDRM-LHSGETRPTSKKSMKLPGNVFVGGAPDLEVFTGFRYKHNLNGCI 4448
            .|.|.|.||..:..:.||.. ..:|:: |...:.:.:.|.::|||..::.:.|..:|...|.|||
  Rat   921 WHRVKAVRDGQSGKITVDDYGARTGKS-PGMMRQLNINGALYVGGMKEIALHTNRQYMRGLVGCI 984

  Fly  4449 VVVEGETVGQINLSSAAVNGVNANVCPA 4476
            ......|...|:|...||:|.|.|.|.|
  Rat   985 SHFTLSTDYHISLVEDAVDGKNINTCGA 1012

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
trolNP_001401049.1 LDLa 447..480 CDD:238060
LDLa 532..564 CDD:238060
LDLa 571..606 CDD:238060
LDLa 616..650 CDD:238060
LDLa 704..739 CDD:238060
LDLa 813..847 CDD:238060
LDLa 852..887 CDD:238060
LDLa 902..936 CDD:238060
LDLa 956..990 CDD:238060
LDLa 997..1030 CDD:238060
LDLa 1033..1067 CDD:238060
LDLa 1072..1105 CDD:238060
LDLa 1133..1165 CDD:238060
LDLa 1182..1215 CDD:238060
LDLa 1221..1251 CDD:238060
LDLa 1252..1286 CDD:238060
LDLa 1292..1326 CDD:238060
Ig 1344..1405 CDD:472250
Ig strand B 1355..1359 CDD:409353
Ig strand C 1369..1372 CDD:409353
Ig strand E 1388..1392 CDD:409353
LDLa 1439..1473 CDD:238060
LDLa 1479..1513 CDD:238060
LDLa 1522..1556 CDD:238060
Ig_Perlecan_like 1574..1649 CDD:143220
Ig strand B 1577..1583 CDD:143220
Ig strand C 1590..1595 CDD:143220
Ig strand E 1613..1617 CDD:143220
Ig strand F 1626..1632 CDD:143220
Ig strand G 1641..1647 CDD:143220
Laminin_B 1748..1878 CDD:459652
EGF_Lam 1935..1989 CDD:238012
Laminin_B 2113..2249 CDD:459652
EGF_Lam <2250..2276 CDD:238012
EGF_Lam 2284..2333 CDD:238012
Laminin_EGF <2369..2399 CDD:395007
Laminin_B 2465..2599 CDD:459652
Ig_3 2688..2756 CDD:464046
Ig_3 2791..2861 CDD:464046
Ig_3 2890..2958 CDD:464046
Ig 2987..3071 CDD:472250
Ig strand B 3004..3008 CDD:409353
Ig strand C 3017..3021 CDD:409353
Ig strand E 3036..3040 CDD:409353
Ig strand F 3050..3055 CDD:409353
Ig strand G 3063..3066 CDD:409353
I-set 3089..3162 CDD:400151
Ig strand B 3094..3098 CDD:409353
Ig strand C 3107..3111 CDD:409353
Ig strand E 3128..3132 CDD:409353
Ig strand F 3142..3147 CDD:409353
Ig strand G 3155..3158 CDD:409353
Ig 3165..3253 CDD:472250
Ig strand B 3185..3189 CDD:409353
Ig strand C 3198..3202 CDD:409353
Ig strand E 3219..3223 CDD:409353
Ig strand F 3233..3238 CDD:409353
Ig strand G 3246..3249 CDD:409353
Ig 3298..3355 CDD:472250
Ig strand B 3301..3305 CDD:409353
Ig strand C 3313..3316 CDD:409353
Ig strand E 3333..3336 CDD:409353
IG_like 3381..3457 CDD:214653 25/106 (24%)
Ig strand B 3390..3394 CDD:409353 2/3 (67%)
Ig strand C 3403..3407 CDD:409353 0/3 (0%)
Ig strand F 3437..3442 CDD:409353 4/6 (67%)
IG_like 3475..>3536 CDD:214653 10/60 (17%)
Ig strand B 3480..3484 CDD:409353 0/3 (0%)
Ig strand C 3491..3503 CDD:409353 1/11 (9%)
Ig strand E 3519..3523 CDD:409353 1/3 (33%)
Ig 3613..3675 CDD:472250 14/62 (23%)
Ig strand B 3625..3629 CDD:409353 0/3 (0%)
Ig strand C 3637..3642 CDD:409353 2/4 (50%)
Ig strand E 3658..3662 CDD:409353 1/4 (25%)
IG_like 3706..3772 CDD:214653 13/67 (19%)
Ig strand B 3714..3718 CDD:409353 1/3 (33%)
Ig strand C 3727..3731 CDD:409353 0/3 (0%)
Ig strand E 3746..3750 CDD:409353 0/3 (0%)
Ig strand F 3760..3765 CDD:409353 1/4 (25%)
Ig strand G 3774..3777 CDD:409353 0/2 (0%)
LamG 3794..3939 CDD:238058 51/147 (35%)
EGF 3962..3994 CDD:394967 10/31 (32%)
LamG 4042..4195 CDD:238058 43/162 (27%)
EGF_CA 4254..4286 CDD:238011 13/31 (42%)
LamG 4295..4448 CDD:238058 38/154 (25%)
EgflamXP_006232075.1 FN3 36..133 CDD:238020 25/101 (25%)
FN3 142..236 CDD:238020 23/130 (18%)
LamG 387..537 CDD:238058 52/154 (34%)
EGF_CA 556..598 CDD:238011 12/42 (29%)
LamG 614..761 CDD:238058 43/157 (27%)
EGF_CA 786..816 CDD:238011 12/29 (41%)
Laminin_G_2 864..989 CDD:460494 36/129 (28%)
Blue background indicates that the domain is not in the aligned region.

Return to query results.
Submit another query.