DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment CG43896 and Fbn2

DIOPT Version :10

Sequence 1:NP_001261732.1 Gene:CG43896 / 39360 FlyBaseID:FBgn0264488 Length:2113 Species:Drosophila melanogaster
Sequence 2:NP_114014.2 Gene:Fbn2 / 689008 RGDID:620910 Length:2907 Species:Rattus norvegicus


Alignment Length:2848 Identity:585/2848 - (20%)
Similarity:842/2848 - (29%) Gaps:1105/2848 - (38%)


- Green bases have known domain annotations that are detailed below.


  Fly    38 PKPLNCSSYY---RCTAKN----AVRTV---TCA-----------------PGKEYNPKN-GKCT 74
            |....|||..   .|.||:    :||.:   |||                 |..|...:| |:|.
  Rat   127 PNMCTCSSGQISPTCGAKSIQQCSVRCMNGGTCADDHCQCQKGYIGTYCGQPVCENGCQNGGRCI 191

  Fly    75 MAGRSLCKLSLLAPLAE---ATNVCSTEVN---------GAYIANSGSCGEF-----YICDEQIA 122
            ...|..|......|..|   .|..|.|:||         |.....:..|...     :.|:...|
  Rat   192 GPNRCACVYGFTGPQCERDYRTGPCFTQVNNQMCQGQLTGIVCTKTLCCATIGRAWGHPCEMCPA 256

  Fly   123 YPQKCDLGSFFNETLAAC--------IPDANSTCWQNLCINKTNGVFVENEANCGSYY------- 172
            .||.|..|...|....||        ||   ..|....||| |.|.|   |..|.:.:       
  Rat   257 QPQPCRRGFIPNIRTGACQDVDECQAIP---GLCQGGNCIN-TVGSF---ECRCPAGHKQSETTQ 314

  Fly   173 -------------VCSNGEAT------LQTCPQGSFFNTSVAACTVDQGNSQCW---VNYCIGQD 215
                         :|..|:.:      ...||:|...:|..:.| :||....|:   ||....|:
  Rat   315 KCEDIDECSVVPGICETGDCSNTVGSYFCLCPRGFVTSTDGSRC-IDQRTGTCFSGLVNGRCAQE 378

  Fly   216 DGSAVADKSNCSLFYVCSNNTATAQEC----------------------------PEGSYFESNN 252
            ....:|....|.....|.......:.|                            |.||...||.
  Rat   379 LPGRMAKAQCCCEPGRCWGIGTIPEACPVRGSEEYRRLCLDGLPMGGIPGSSVSRPGGSGGNSNG 443

  Fly   253 WG----------------------------------------------------CVPGTC--TTE 263
            :|                                                    |:.|.|  |..
  Rat   444 YGPGGTGFLPIPGGNGFSPGVGGAGVGAGGQGPIITGLTILNQTIDICKHHANLCLNGRCIPTVS 508

  Fly   264 S---------------PCDDSTTTTTESCAE-ETTEPPAS--CDCGDIKNADF--IPDEENCRKY 308
            |               .|.|....|:..|:. :....|.|  |.|    :|.|  .|.::.|...
  Rat   509 SYRCECNMGYKQDANGDCIDVDECTSNPCSHGDCVNTPGSYYCKC----HAGFQRTPTKQACIDI 569

  Fly   309 FICI-DGVLVAADCGKGNVFNAN---LSVCEVDADNTCCVADCTDGEAKVDPQDCTKYFKCQSG- 368
            ..|| :|||    |..|...|.:   ..:|....:.|      ||||..||..:||....|.:| 
  Rat   570 DECIQNGVL----CKNGRCVNTDGRFQCICNAGFELT------TDGENCVDHDECTTTNMCLNGM 624

  Fly   369 ---DWTSVSC---------DSGSYFNETLNCCQVD-------------VNN-------------- 394
               :..|..|         .:|.|      |..||             :||              
  Rat   625 CINEDGSFKCVCKPGFVLAPNGRY------CTDVDECQTPGICMNGHCINNEGSFRCDCPPGLAV 683

  Fly   395 -----VCIDAKSNST---QI-------PTTSTVETSSVDKCNAKDP---------PASGKNCWTY 435
                 ||:|....||   :|       |....|..|   :|...:|         |...||...:
  Rat   684 GVDGRVCVDTHMRSTCYGEIKKGVCVRPFPGAVTKS---ECCCANPDYGFGEPCQPCPAKNSAEF 745

  Fly   436 QHCISGQ---WEDGTCPNNTYFDASV---GICREDTENV-----CPENRSSGSRQKRSVEDC--- 486
            ....||.   ..||...|....|..:   |||    ||:     |  |.:||.....|..:|   
  Rat   746 HGLCSGGVGITVDGRDINECALDPDICANGIC----ENLRGSYRC--NCNSGYEPDASGRNCIDI 804

  Fly   487 --------TCEGGIAQGTIIGHSTDCDKYLI-------------CENGQLVEGV----------- 519
                    .|:.|:.:.|...:|..|....:             ||:...|.|.           
  Rat   805 DECLVNRLLCDNGLCRNTPGSYSCTCPPGYVFRTETETCEDVNECESNPCVNGACRNNLGSFHCE 869

  Fly   520 CGVGNVFQKSSGICVPDTKATCWV------CSNKPNGYQMADPTDCTSYL-TCWNGLATK----H 573
            |..|:....:..||:...|.|||:      |....||..:  .::|.:.| |.|.....:    .
  Rat   870 CSPGSKLSSTGLICIDSLKGTCWLNIQDNRCEVNINGATL--KSECCATLGTAWGSPCERCELDA 932

  Fly   574 TCGSGEWYNGDGNCVIDVNAKCINPCSCGNGNVAHPICTN---YFQCTDGVPQVKQCVVGEAFDS 635
            .|..| :....|....|||...:.|..|.||.     |.|   .|.|        :|..|...|.
  Rat   933 ACPRG-FARIKGVTCEDVNECEVFPGVCPNGR-----CVNNKGSFHC--------ECPEGLTLDG 983

  Fly   636 ATGQCSTTVECSAKNCATA--SDGTTYPVAGETGQFYVCLSNEATIQPCPVNTGYSAALGICLDQ 698
            ....|   ::...::|...  .|...:||.|:......|         |.|...:...   |.:.
  Rat   984 TGRVC---LDVRMEHCFLKWDEDECVHPVPGKFRMDACC---------CAVGAAWGTE---CEEC 1033

  Fly   699 PSPDCD--QTIC-------NSAAVDSTYP--SNDNDSSTF---CL---CRDS-GAYIQSCPTGLF 745
            |.|...  :|:|       |...:.:..|  .:.|:...|   |.   ||:: |::...|.:|..
  Rat  1034 PKPGTKEYETLCPRGPGFANRGDILTGRPFYKDINECKAFPGMCTYGKCRNTIGSFKCRCNSGFA 1098

  Fly   746 YDATDQVCTFSGNCDPQVCNDQSEYFVSPD-------YEDPNSF--------------------- 782
            .|..::.||           |..|..:|||       ...|.||                     
  Rat  1099 LDMEERNCT-----------DIDECRISPDLCGSGICVNTPGSFDCECFEGYESGFMMMKNCMDI 1152

  Fly   783 --C-----LCRAGEPITV------SCPIGYTFNTEELECVLI---PLPDPRC----CAN-----G 822
              |     |||.|..:..      .||:|:..:....:|:.|   .|.|..|    |.|     .
  Rat  1153 DECERNPLLCRGGTCVNTEGSFQCDCPLGHELSPSREDCIDINECSLSDNLCRNGKCVNMIGTYQ 1217

  Fly   823 CSGKTDFSTFPTIEGTDGFCLCVDDVPKYISCP-ENSQYDLELGACTATEEETTCPVNACTVGI- 885
            ||....:...|..:|    |..:|:      |. .|...|.:   ||.:|....|   :|:.|. 
  Rat  1218 CSCNPGYQATPDRQG----CSDIDE------CMIMNGGCDTQ---CTNSEGSYEC---SCSEGYA 1266

  Fly   886 -------------CESALDFNTFPVSSDPSGFCYCLDSCPIYETCLDDLVYNEDLKRCTEPEDPS 937
                         ||:..|.......::..|...||        |.|..:.:.|::.|       
  Rat  1267 LMPDGRSCADIDECENNPDICDGGQCTNIPGEYRCL--------CYDGFMASMDMRTC------- 1316

  Fly   938 VVSRCDAEKCDSIPDFTPYPSINGTDG--FCYCNGGLPVFRECEENQEFDPAVGFCRTVVDSCVT 1000
                .|..:||..|:...:.....|.|  .|:|..|..|.:         .|.| | |.||.|..
  Rat  1317 ----IDVNECDLNPNICMFGECENTKGSFICHCQLGYSVKK---------GATG-C-TDVDECEI 1366

  Fly  1001 ALCDANQCATLDNYVPFPTNTASDDDGAF-CYCKDGEV-------ILEQCSPEMVFSADLGRCTV 1057
            ...:.:..|:..| ||          |:| |.|::|.|       .|::|      |....:|::
  Rat  1367 GAHNCDMHASCLN-VP----------GSFKCSCREGWVGNGIKCIDLDEC------SNGTHQCSI 1414

  Fly  1058 QAAPECVCAPG--KCGSSDGYTTYEALNTNEGF-CLCVSGTPVFTECLNGETYNSILGACFSSTA 1119
            .|  :||..||  :|..|:|:       |.:|| |..|.      ||  .|..|           
  Rat  1415 NA--QCVNTPGSYRCACSEGF-------TGDGFTCSDVD------EC--AENIN----------- 1451

  Fly  1120 KIIASGLCDDRRCHS-----RAVVDATFAAVNTTSGYCSCQDVGVSTFIN--------------- 1164
                  ||::.:|.:     |...:..|...:.:.   ||||:...:|.|               
  Rat  1452 ------LCENGQCLNVPGAYRCECEMGFTPASDSR---SCQDIDECSFQNICVFGTCNNLPGMFH 1507

  Fly  1165 --CADEHVYEESAGTCV-VAAC-DYTSCLNRVAFEPFEAKNTTEGFCSCDGVPYFHQCQTG-HVF 1224
              |.|.:..:.:.|.|. :..| |..:|:|.:..       .|.|...|:..|.|....|| ...
  Rat  1508 CICDDGYELDRTGGNCTDIDECADPINCVNGLCV-------NTPGRYECNCPPDFQLNPTGVGCV 1565

  Fly  1225 DKAQGICLLK-----DTMAMISCNLRECLKRSQFEPFAAENTNSGFCS--------CDDLQRVSV 1276
            |...|.|.||     |  ..:|||.         |.....:.:|..||        |:....|:.
  Rat  1566 DNRVGNCYLKFGPRGD--GSLSCNT---------EVGVGVSRSSCCCSLGKAWGNPCETCPPVNS 1619

  Fly  1277 T--YHPCGEGEIFAPE---------------FGMC-------TSHSIQKRSVEAEENICISNEMR 1317
            |  |..|..||.|.|.               .|:|       |..|.|   .|..:...:|.|.|
  Rat  1620 TEYYTLCPGGEGFRPNPITIILEDIDECQELPGLCQGGNCINTFGSFQ---CECPQGYYLSEETR 1681

  Fly  1318 SV--------------PANC------------SQY------EMCIDGR-------WRRRTCSDQR 1343
            ..              |..|            .:|      ..|:|.|       :...||.::.
  Rat  1682 ICEDIDECFAHPGVCGPGTCYNTLGNYTCICPPEYMQVNGGHNCMDMRKSFCYRSYNGTTCENEL 1746

  Fly  1344 YYNPEQQRCLEPRDDLVCAY----------------------ARVSNIPTCT---------AISE 1377
            .:|..::.|       .|.|                      ....|||..|         .|.|
  Rat  1747 PFNVTKRMC-------CCTYNVGKAWNKPCEPCPTPGTADFKTICGNIPGFTFDIHTGKAVDIDE 1804

  Fly  1378 SMTIP--AKSGNCMQY---FRCSGGKWRLRNCPKQHYYSPLIGSCLPF----------------- 1420
            ...||  ..:|.|:..   |||        .||....|:.|:..|...                 
  Rat  1805 CKEIPGICANGVCINQIGSFRC--------ECPTGFSYNDLLLVCEDIDECSNGDNLCQRNADCI 1861

  Fly  1421 -----------------PRGHEQDRNQ-----TICSWAMS-KLAGN------------------- 1443
                             |.|...|||:     .:||..:. .|.|:                   
  Rat  1862 NSPGSYRCECAAGFKLSPNGACVDRNECLEIPNVCSHGLCVDLQGSYQCICNNGFKASQDQTMCM 1926

  Fly  1444 SSDDCQHLAVRPSLAGGCQ----SYLM-------------CLD----NSWWLHQCPLGMYFSREH 1487
            ..|:|:.   .|...|.|:    ||..             |||    :|::...|..|..|:...
  Rat  1927 DVDECER---HPCGNGTCKNTVGSYNCLCYPGFELTHNNDCLDIDECSSFFGQVCRNGRCFNEIG 1988

  Fly  1488 NY-CLPN-------------DAGQCI-LPAERSGNCSNGESRSVVGSCH-----SYEVCSDNGQW 1532
            :: ||.|             |..:|: ||    |:||.|..:::.||..     .|||.|:|...
  Rat  1989 SFKCLCNEGYELTPDGKNCIDTNECVALP----GSCSPGTCQNLEGSFRCICPPGYEVKSENCID 2049

  Fly  1533 IRRRCQEREQFEPMLGCVPSDGSCQD--NGMRRICREGELQAQFGNNC---SQGFLYCEAEEWHL 1592
            | ..|.|    :|.: |:  .|||.:  .|.:.||..|.:.:..|..|   .|.|.:...|.   
  Rat  2050 I-NECDE----DPNI-CL--FGSCTNTPGGFQCICPPGFVLSDNGRRCFDTRQSFCFTNFEN--- 2103

  Fly  1593 GSCLKGHSFARNRNKCQLQSQ--------CQLQIRDLPAENQCLGQIDGLSVPDPTDCTRFYLCL 1649
            |.|....:|...:.||.....        |:|..:|.....|.|......:||...| ||     
  Rat  2104 GKCSVPKAFNTTKAKCCCSKMPGEGWGDPCELCPKDDEVAFQDLCPYGHGTVPSLHD-TR----- 2162

  Fly  1650 QQVPTILQS---CSSGSFFDSNQGYCRPNDGS--CQLAICNGLE-DGKLVAHPEDCRSYYSCSSQ 1708
            :.|...|:|   ||:|.        |...|||  |:..:...|: .|......::|.....|.  
  Rat  2163 EDVNECLESPGICSNGQ--------CINTDGSFRCECPMGYNLDYSGVRCVDTDECSIGNPCG-- 2217

  Fly  1709 NGTSL-------VQCDEGQYFHSLLSICRVDHGQCRKVSNQDETETAPRLCYGLHGVKLPHELYC 1766
            |||..       ..|:||           .:.|......:.:|....|.||          ...|
  Rat  2218 NGTCANVIGSFECNCNEG-----------FEPGPMMNCEDINECAQNPLLC----------AFRC 2261

  Fly  1767 -NLY--YACVKGLAIPVECPVQHQFNPVLSIC---EPESQAVQPC-SNGQLDGNVSYVYRC---- 1820
             |.:  |.|.        |||.:.......:|   :..::.:..| |.|.:..|:...:.|    
  Rat  2262 MNTFGSYECT--------CPVGYALREDQKMCKDLDECAEGLHDCESRGMMCKNLIGTFMCICPP 2318

  Fly  1821 --GNLQDGTFLANRTDC-TRYFICAGG----VATAQRC---------AAGT-FFDSEQLLCLAD- 1867
              ....||....:..:| |:..||..|    :..:.||         ::|| ..|:.|.||.|: 
  Rat  2319 GMARRPDGEGCVDENECRTKPGICENGRCVNIIGSYRCECNEGFQSSSSGTECLDNRQGLCFAEV 2383

  Fly  1868 -DGSCPLVESVPDDDDNPNNQHVPPDPVVCEGKHGY-----LMPDPANCNNFYLCVSG------- 1919
             ...|.:..|        :...|......|:|..|:     |.|.|.......:|..|       
  Rat  2384 LQTMCQMASS--------SRNLVTKSECCCDGGRGWGHQCELCPLPGTAQYKKICPHGPGYATDG 2440

  Fly  1920 ------KLRHELC-------------------YTDNFFNAT---LQQCQPYEIASDGNNQTESPL 1956
                  |:...||                   ||.:.....   |.:|          :|:..| 
  Rat  2441 RDIDECKVMPSLCTNGQCVNTMGSFRCFCKVGYTTDISGTACVDLDEC----------SQSPKP- 2494

  Fly  1957 QSRQIDLGGHEKAVTNGICKDAPTSFAGIC-------------------------------GVIG 1990
                          .|.|||:...|:...|                               ..:|
  Rat  2495 --------------CNFICKNTEGSYQCSCPRGYVLQEDGKTCKDLDECQTKQHNCQFLCVNTLG 2545

  Fly  1991 NGASVAEQGDCRRYTSCEDDEPVSQRCRNGESFDSLLGICRQSDG--TCLLENGERVGVCNGKHG 2053
            ........|..:.:|:|.|    :..|.:..|.....|||:.:.|  :|..:.|..:..    .|
  Rat  2546 GFTCKCPPGFTQHHTACID----NNECGSQPSLCGAKGICQNTPGSFSCECQRGFSLDA----SG 2602

  Fly  2054 QLARDADNCRGYFTCVHG-QQIDG----ECAQGEFFNRLTNCCEVDALQQCKS 2101
            ....|.|.|.|...|.|| |.|.|    .|.||...:...|.| ||. .:|.:
  Rat  2603 LNCEDVDECDGNHRCQHGCQNILGGYRCGCPQGYVQHYQWNQC-VDE-NECSN 2653

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
CG43896NP_001261732.1 CBM_14 96..147 CDD:426342 17/72 (24%)
CBM_14 153..206 CDD:426342 17/78 (22%)
CBM_14 211..257 CDD:426342 12/125 (10%)
CBM_14 294..343 CDD:426342 14/54 (26%)
CBM_14 354..395 CDD:426342 15/85 (18%)
CBM_14 427..468 CDD:426342 12/46 (26%)
ChtBD2 488..535 CDD:214696 13/70 (19%)
CBM_14 544..593 CDD:426342 12/53 (23%)
CBM_14 <1320..1361 CDD:426342 10/65 (15%)
ChtBD2 1627..1675 CDD:214696 13/50 (26%)
CBM_14 1684..1733 CDD:426342 10/56 (18%)
ChtBD2 1752..1797 CDD:214696 9/50 (18%)
CBM_14 1820..1868 CDD:426342 17/70 (24%)
CBM_14 1896..1939 CDD:426342 14/82 (17%)
ChtBD2 2046..2091 CDD:214696 15/49 (31%)
Fbn2NP_114014.2 Fibrillin_U_N 76..112 CDD:436338
TB 224..>257 CDD:459903 3/32 (9%)
EGF_CA 276..>306 CDD:214542 11/36 (31%)
TB 373..417 CDD:459903 5/43 (12%)
EGF_CA 528..558 CDD:214542 8/33 (24%)
EGF_CA 568..609 CDD:214542 14/50 (28%)
EGF_CA 610..650 CDD:214542 9/45 (20%)
EGF_CA 651..691 CDD:214542 5/39 (13%)
TB 706..749 CDD:459903 8/45 (18%)
EGF_CA 761..802 CDD:214542 12/46 (26%)
EGF_CA 803..838 CDD:238011 5/34 (15%)
TB 898..>928 CDD:459903 7/31 (23%)
EGF_CA 948..978 CDD:429571 12/42 (29%)
TB 1003..1045 CDD:459903 11/53 (21%)
EGF_CA 1108..1142 CDD:214542 8/33 (24%)
EGF_CA 1151..1183 CDD:214542 8/31 (26%)
EGF_CA 1193..1225 CDD:214542 8/31 (26%)
FXa_inhibition 1239..1274 CDD:464251 9/40 (23%)
EGF_3 1364..1399 CDD:463759 11/45 (24%)
EGF_3 1405..1440 CDD:463759 14/49 (29%)
EGF_CA 1484..1519 CDD:214542 5/34 (15%)
EGF_CA 1525..1556 CDD:214542 8/37 (22%)
TB 1586..1626 CDD:459903 10/48 (21%)
EGF_CA 1643..1675 CDD:214542 6/34 (18%)
TB 1741..1784 CDD:459903 6/49 (12%)
EGF_CA 1801..1833 CDD:214542 11/39 (28%)
EGF_CA 1843..>1875 CDD:214542 0/31 (0%)
vWFA <1880..1922 CDD:469594 8/41 (20%)
EGF_CA 1927..1961 CDD:214542 7/36 (19%)
EGF_CA 1966..2008 CDD:214542 8/41 (20%)
EGF_CA 2009..2041 CDD:214542 10/35 (29%)
EGF_CA 2049..2089 CDD:214542 13/47 (28%)
TB 2104..2148 CDD:459903 9/43 (21%)
EGF_CA 2164..2205 CDD:214542 13/48 (27%)
EGF_CA 2206..2245 CDD:214542 9/51 (18%)
EGF_CA 2287..2319 CDD:429571 5/31 (16%)
TB 2387..2430 CDD:459903 9/50 (18%)
EGF_CA 2442..2475 CDD:214542 4/32 (13%)
FXa_inhibition 2488..2523 CDD:464251 9/59 (15%)
EGF_CA 2525..2554 CDD:429571 1/28 (4%)
EGF_CA 2564..2606 CDD:214542 10/49 (20%)
EGF_CA 2607..>2636 CDD:214542 12/28 (43%)
Cadherin_repeat <2821..>2868 CDD:206637
Blue background indicates that the domain is not in the aligned region.

Return to query results.
Submit another query.