DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment sca and tnca

DIOPT Version :10

Sequence 1:NP_476710.2 Gene:sca / 36411 FlyBaseID:FBgn0003326 Length:799 Species:Drosophila melanogaster
Sequence 2:XP_068077498.1 Gene:tnca / 100149028 ZFINID:ZDB-GENE-130530-849 Length:2336 Species:Danio rerio


Alignment Length:630 Identity:154/630 - (24%)
Similarity:242/630 - (38%) Gaps:160/630 - (25%)


- Green bases have known domain annotations that are detailed below.


  Fly   154 KSLRL--TTNANSVD------ADMRSELNQLREELAALRSSQSGN-KERLTVEWLQQTISEIRKQ 209
            :|.||  ||..:..|      .|.| :|:..||      .|.||: :.::..:.|..|..||...
Zfish  1769 ESFRLSWTTEDDVFDRFVLKVRDSR-KLSHPRE------FSMSGDERTKVLTQLLGGTEYEIELY 1826

  Fly   210 LVDLQRTASNVAQDVQQRSSTFEDLATIRSDYQQLKLDLAAQRERQQQTEVYVQELREEMLQQEQ 274
            .|.|:|.:..|...|:.......||..  ||.    .|.:|        .||....|    .|..
Zfish  1827 GVTLERRSQPVTAVVKTGLGALRDLHF--SDI----TDTSA--------VVYWTLPR----AQPD 1873

  Fly   275 DFQHALVKLQQRTRKDGSSASVEEESGSQEANQEQTGLETTADHKRRHCRFQSEQIHQLQLAQRN 339
            .::..|:.:|      |.|..:.:..|||.                             |::.||
Zfish  1874 SYRITLIPIQ------GGSPMIVQVDGSQS-----------------------------QVSLRN 1903

  Fly   340 L----RRQVNGLRFHHIDE-------------RVRSIEVEQHRIANANFNLSSQIASLDKLHTSM 387
            |    ..||:.:....::|             ..||:.|.....::|.......:|::|..   |
Zfish  1904 LIPGETYQVSVIAVKGLEESEPVSDTFTTALDTPRSLTVVNVTDSSALLFWQPAVATVDGY---M 1965

  Fly   388 LELLEDVEGLQTKMDKSIPELRHEIS--KLEFANAQITSEQSLIREEGTNAARSLQAMAVSVSVL 450
            :....|          ::|.:..::|  .:||              |..:.|.:.| ..|||..:
Zfish  1966 ITYSAD----------TVPPISEQVSGNTVEF--------------EMNSLAPATQ-YTVSVYAI 2005

  Fly   451 QEEREGMRKLSANVDQLRTNVDRLQSLVNDEMKNKLTHLNKPHKRPHHQNVQAQMPQDDSPIDSV 515
            : :||  :.|.|..| ..|:||..:.||...::.....|:....|.............|:.:..|
Zfish  2006 R-DRE--KSLPATAD-FTTDVDAPRDLVASNIQTDSAVLSWTPPRAAITGYTLTFQNSDADVREV 2066

  Fly   516 LAETLVSE--LENVE--TQYEAIINKL----------------------PHDCSEV----HTQTD 550
            :.:..|:.  |.|:.  |:|..|:..:                      |.||||.    .|.:.
Zfish  2067 IVDPSVASHTLSNLRSTTKYTVILQAVSEDKRSRSISTEFTTVGMLYGHPRDCSEALLNGETSSG 2131

  Fly   551 GLHLIAPAGQRHPLMTHC--TAD--GWTTVQRRFDGSADFNRSWADYAQGFGAPGGEFWIGNEQL 611
            ...:.....::.||..:|  |.|  ||....||..|..:|.|:|.:|:.|||....|||:|...|
Zfish  2132 PYTIYINGDEKQPLRVYCDMTTDGGGWMLFLRRQSGKLNFYRNWRNYSAGFGDTSDEFWLGLSNL 2196

  Fly   612 HHLTLDNCSRLQVQMQDIYDNVWVAEYKRFYISSRADGYRLHIAEYSGNASDALNYQQGMQFSAI 676
            |.:|......::|.::|..:.|: |.|.||||......|::.|..|||.|.|:|.|.|...||..
Zfish  2197 HKITAAKQYEIRVDLRDGSETVF-AVYDRFYIGDPRSRYKIQIGAYSGTAGDSLTYHQNRPFSTY 2260

  Fly   677 DDDRDISQTHCAANYEGGWWFSHCQHANLNGRY-----NLGLTWF 716
            |.|.||:.|:||.:|:|.:|:.:|...||.|:|     :.|:.||
Zfish  2261 DSDNDIAITNCALSYKGAFWYKNCHRVNLMGKYGDSSHSKGINWF 2305

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
scaNP_476710.2 PRK03918 134..>485 CDD:235175 75/358 (21%)
FBG 538..736 CDD:214548 69/214 (32%)
tncaXP_068077498.1 EGF_Tenascin 191..218 CDD:480934
EGF_Tenascin 253..280 CDD:480934
EGF_Tenascin 315..342 CDD:480934
EGF_Tenascin 377..404 CDD:480934
EGF_Tenascin 408..434 CDD:480934
EGF_Tenascin 439..466 CDD:480934
C_rich_MXAN6577 449..578 CDD:469225
EGF_Tenascin 470..497 CDD:480934
FN3 593..679 CDD:238020
fn3 682..763 CDD:394996
FN3 742..1122 CDD:442628
FN3 1087..1171 CDD:238020
fn3 1210..1289 CDD:394996
fn3 1301..1376 CDD:394996
fn3 1392..1464 CDD:394996
FN3 1486..1570 CDD:238020
fn3 1574..1649 CDD:394996
fn3 1665..1740 CDD:394996
fn3 1756..1829 CDD:394996 18/66 (27%)
fn3 1846..1925 CDD:394996 24/131 (18%)
fn3 1935..2013 CDD:394996 20/108 (19%)
fn3 2023..2096 CDD:394996 13/72 (18%)
FReD 2114..2323 CDD:238040 69/193 (36%)
Blue background indicates that the domain is not in the aligned region.

Return to query results.
Submit another query.