DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment frac and Fbn2

DIOPT Version :10

Sequence 1:NP_648137.3 Gene:frac / 38850 FlyBaseID:FBgn0035798 Length:1618 Species:Drosophila melanogaster
Sequence 2:NP_114014.2 Gene:Fbn2 / 689008 RGDID:620910 Length:2907 Species:Rattus norvegicus


Alignment Length:1429 Identity:381/1429 - (26%)
Similarity:520/1429 - (36%) Gaps:527/1429 - (36%)


- Green bases have known domain annotations that are detailed below.


  Fly   241 PTFLD-NNCGEDNGGCAH-ICKRLLYPDENQPINKCDCREGYTLDPNDYASCLDIDECLESNGGC 303
            |.:.| |.|....|.|.: .|:..:      ...||.|..|:.||..: .:|.|||||..|...|
  Rat  1062 PFYKDINECKAFPGMCTYGKCRNTI------GSFKCRCNSGFALDMEE-RNCTDIDECRISPDLC 1119

  Fly   304 -SEICENLPGEYKCSCQEGYYLDESG----KSCVDINEC-ANPELSSNCQ-GACENLPGSYRCVE 361
             |.||.|.||.:.|.|.|||   |||    |:|:||:|| .||.|   |: |.|.|..||::|..
  Rat  1120 GSGICVNTPGSFDCECFEGY---ESGFMMMKNCMDIDECERNPLL---CRGGTCVNTEGSFQCDC 1178

  Fly   362 PL--EENPEITEVVENPIEKTNEVPV--NVSESQPAGK----------TCNSGFQLSADGTDCQD 412
            ||  |.:|...:.::     .||..:  |:..:   ||          :||.|:|.:.|...|.|
  Rat  1179 PLGHELSPSREDCID-----INECSLSDNLCRN---GKCVNMIGTYQCSCNPGYQATPDRQGCSD 1235

  Fly   413 INECEVDGPEDLDNNAVCQQKCENTIGSFRCTCVEGYHLLEDQRSCA-LDSCTDLENPQLNRTRC 476
            |:||.:       .|..|..:|.|:.||:.|:|.|||.|:.|.|||| :|.|.:  ||.:    |
  Rat  1236 IDECMI-------MNGGCDTQCTNSEGSYECSCSEGYALMPDGRSCADIDECEN--NPDI----C 1287

  Fly   477 -AHECQDLPEGSYRCVCPKGYELSEDQHSCL-VQE----------SPCSTEKG------------ 517
             ..:|.::| |.|||:|..|:..|.|..:|: |.|          ..|...||            
  Rat  1288 DGGQCTNIP-GEYRCLCYDGFMASMDMRTCIDVNECDLNPNICMFGECENTKGSFICHCQLGYSV 1351

  Fly   518 ---------VEKCSPG--------TCLASEDNT--SFSCICPTGYRSEAFSCQDIDECAEDTHLC 563
                     |::|..|        :||    |.  ||.|.|..|:......|.|:|||:..||.|
  Rat  1352 KKGATGCTDVDECEIGAHNCDMHASCL----NVPGSFKCSCREGWVGNGIKCIDLDECSNGTHQC 1412

  Fly   564 S--HTCQNTPGGYQCQCPEGLNLVEEYTCLAENLCEVNNNGCEQ-ICLTARGGV-CACREGFRLS 624
            |  ..|.||||.|:|.|.||.. .:.:||...:.|..|.|.||. .||...|.. |.|..||..:
  Rat  1413 SINAQCVNTPGSYRCACSEGFT-GDGFTCSDVDECAENINLCENGQCLNVPGAYRCECEMGFTPA 1476

  Fly   625 ADGKSCEDVDECLVNNGGCQQVCRNLPGSYGCICAAGYELLKLDGIRGYCFDIDECSQRTH---- 685
            :|.:||:|:|||...|......|.||||.:.|||..|||   ||...|.|.|||||:...:    
  Rat  1477 SDSRSCQDIDECSFQNICVFGTCNNLPGMFHCICDDGYE---LDRTGGNCTDIDECADPINCVNG 1538

  Fly   686 ------------------------GCSDQ------------------------------------ 690
                                    ||.|.                                    
  Rat  1539 LCVNTPGRYECNCPPDFQLNPTGVGCVDNRVGNCYLKFGPRGDGSLSCNTEVGVGVSRSSCCCSL 1603

  Fly   691 ----------------------------------------------------------------- 690
                                                                             
  Rat  1604 GKAWGNPCETCPPVNSTEYYTLCPGGEGFRPNPITIILEDIDECQELPGLCQGGNCINTFGSFQC 1668

  Fly   691 ------------MLCENLN------------------GSYTCLCPPGYALGLDNHIVTSLNSSFI 725
                        .:||:::                  |:|||:|||.|......|....:..||.
  Rat  1669 ECPQGYYLSEETRICEDIDECFAHPGVCGPGTCYNTLGNYTCICPPEYMQVNGGHNCMDMRKSFC 1733

  Fly   726 TDS---TSSE------------------------------TPSA-----------------HT-- 738
            ..|   |:.|                              ||..                 ||  
  Rat  1734 YRSYNGTTCENELPFNVTKRMCCCTYNVGKAWNKPCEPCPTPGTADFKTICGNIPGFTFDIHTGK 1798

  Fly   739 CLDIDECSLANGNCSH-FCQNEPGGFQCACPLGYALSEDMRTCQDIDECLDSNGQCSQL--CLNQ 800
            .:|||||....|.|:: .|.|:.|.|:|.||.|::.::.:..|:|||||.:.:..|.:.  |:|.
  Rat  1799 AVDIDECKEIPGICANGVCINQIGSFRCECPTGFSYNDLLLVCEDIDECSNGDNLCQRNADCINS 1863

  Fly   801 PGGFACACETGFELTPDGFGCADIDECSQDYGNCS-DICINLLGTHACACERGYELAKDKLSCLD 864
            ||.:.|.|..||:|:|:| .|.|.:||.:....|| .:|::|.|::.|.|..|::.::|:..|:|
  Rat  1864 PGSYRCECAAGFKLSPNG-ACVDRNECLEIPNVCSHGLCVDLQGSYQCICNNGFKASQDQTMCMD 1927

  Fly   865 VDECAGLLSGGCSHECINKAGTFECGCPLGYIL---ND--DGRSCSPALVG--CPPGTQRSADGC 922
            ||||.....|  :..|.|..|::.|.|..|:.|   ||  |...|| :..|  |..|...:..|.
  Rat  1928 VDECERHPCG--NGTCKNTVGSYNCLCYPGFELTHNNDCLDIDECS-SFFGQVCRNGRCFNEIGS 1989

  Fly   923 APIECNPGYTLGSDDK-CVDIDECQKQNGGCS-HRCSNTEGSFKCSCPPGYELDSDQKTCQDIDE 985
            ....||.||.|..|.| |:|.:||....|.|| ..|.|.||||:|.||||||:.|:  .|.||:|
  Rat  1990 FKCLCNEGYELTPDGKNCIDTNECVALPGSCSPGTCQNLEGSFRCICPPGYEVKSE--NCIDINE 2052

  Fly   986 CDQDKTSCITGTCINEIGGFRCEFPKFPVLPE-----IPTASSLPESPKIELKTPKYPDFTELSN 1045
            ||:|...|:.|:|.|..|||:|..|...||.:     ..|..|.              .||..  
  Rat  2053 CDEDPNICLFGSCTNTPGGFQCICPPGFVLSDNGRRCFDTRQSF--------------CFTNF-- 2101

  Fly  1046 EIPENPKKPAEFDYPEPKFPSLPKWEGLPKLPPLADIPTSKAPVPLRPEVPKSLWVNQLQPRDLC 1110
               ||.|            .|:||           ...|:||.. ...::|...|.:   |.:||
  Rat  2102 ---ENGK------------CSVPK-----------AFNTTKAKC-CCSKMPGEGWGD---PCELC 2136

  Fly  1111 PR-----FQ-----------APRNGKSHCNRYRHKQKLFYYSRC-------RISCNSGYILQGPE 1152
            |:     ||           :..:.:...|.......:....:|       |..|..||.|....
  Rat  2137 PKDDEVAFQDLCPYGHGTVPSLHDTREDVNECLESPGICSNGQCINTDGSFRCECPMGYNLDYSG 2201

  Fly  1153 IK-------------------------SCDAN-GIWEGPETKCVAINQPRAQSP----------- 1180
            ::                         .|:.| |...||...|..||: .||:|           
  Rat  2202 VRCVDTDECSIGNPCGNGTCANVIGSFECNCNEGFEPGPMMNCEDINE-CAQNPLLCAFRCMNTF 2265

  Fly  1181 ------------------------------------------------GICP---ALKPAQNGVI 1194
                                                            .|||   |.:|...|.:
  Rat  2266 GSYECTCPVGYALREDQKMCKDLDECAEGLHDCESRGMMCKNLIGTFMCICPPGMARRPDGEGCV 2330

  Fly  1195 LPASCTQGPSRFGDVC------------RLQCNAGFVSTGSLLTACM-MLQGWSFGADLN--CQ 1243
            ....|...|.    :|            |.:||.||.|:.| .|.|: ..||..|...|.  ||
  Rat  2331 DENECRTKPG----ICENGRCVNIIGSYRCECNEGFQSSSS-GTECLDNRQGLCFAEVLQTMCQ 2389

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
fracNP_648137.3 FXa_inhibition 296..331 CDD:464251 18/39 (46%)
cEGF 396..415 CDD:463661 8/18 (44%)
FXa_inhibition 425..458 CDD:464251 14/32 (44%)
FXa_inhibition 476..505 CDD:464251 11/29 (38%)
EGF_CA 552..>581 CDD:214542 16/30 (53%)
FXa_inhibition 596..630 CDD:464251 13/35 (37%)
FXa_inhibition 636..>664 CDD:464251 12/27 (44%)
EGF_CA 676..711 CDD:214542 18/193 (9%)
FXa_inhibition 745..780 CDD:464251 11/35 (31%)
FXa_inhibition 786..821 CDD:464251 13/36 (36%)
vWFA <821..861 CDD:469594 13/40 (33%)
FXa_inhibition 874..904 CDD:464251 11/34 (32%)
FXa_inhibition 945..980 CDD:464251 18/35 (51%)
PTZ00449 <1017..>1098 CDD:185628 14/85 (16%)
CCP 1110..1170 CDD:153056 16/108 (15%)
CCP 1183..1235 CDD:153056 19/67 (28%)
HYR 1468..1536 CDD:460572
Fbn2NP_114014.2 Fibrillin_U_N 76..112 CDD:436338
TB 224..>257 CDD:459903
EGF_CA 276..>306 CDD:214542
TB 373..417 CDD:459903
EGF_CA 528..558 CDD:214542
EGF_CA 568..609 CDD:214542
EGF_CA 610..650 CDD:214542
EGF_CA 651..691 CDD:214542
TB 706..749 CDD:459903
EGF_CA 761..802 CDD:214542
EGF_CA 803..838 CDD:238011
TB 898..>928 CDD:459903
EGF_CA 948..978 CDD:429571
TB 1003..1045 CDD:459903
EGF_CA 1108..1142 CDD:214542 19/36 (53%)
EGF_CA 1151..1183 CDD:214542 16/34 (47%)
EGF_CA 1193..1225 CDD:214542 8/39 (21%)
FXa_inhibition 1239..1274 CDD:464251 15/41 (37%)
EGF_3 1364..1399 CDD:463759 10/38 (26%)
EGF_3 1405..1440 CDD:463759 15/35 (43%)
EGF_CA 1484..1519 CDD:214542 18/37 (49%)
EGF_CA 1525..1556 CDD:214542 5/30 (17%)
TB 1586..1626 CDD:459903 0/39 (0%)
EGF_CA 1643..1675 CDD:214542 0/31 (0%)
TB 1741..1784 CDD:459903 3/42 (7%)
EGF_CA 1801..1833 CDD:214542 15/31 (48%)
EGF_CA 1843..>1875 CDD:214542 12/31 (39%)
vWFA <1880..1922 CDD:469594 13/42 (31%)
EGF_CA 1927..1961 CDD:214542 13/35 (37%)
EGF_CA 1966..2008 CDD:214542 14/42 (33%)
EGF_CA 2009..2041 CDD:214542 17/31 (55%)
EGF_CA 2049..2089 CDD:214542 17/39 (44%)
TB 2104..2148 CDD:459903 15/70 (21%)
EGF_CA 2164..2205 CDD:214542 7/40 (18%)
EGF_CA 2206..2245 CDD:214542 5/38 (13%)
EGF_CA 2287..2319 CDD:429571 3/31 (10%)
TB 2387..2430 CDD:459903 2/3 (67%)
EGF_CA 2442..2475 CDD:214542
FXa_inhibition 2488..2523 CDD:464251
EGF_CA 2525..2554 CDD:429571
EGF_CA 2564..2606 CDD:214542
EGF_CA 2607..>2636 CDD:214542
Cadherin_repeat <2821..>2868 CDD:206637
Blue background indicates that the domain is not in the aligned region.

Return to query results.
Submit another query.