DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment CG6052 and Abca4

DIOPT Version :10

Sequence 1:NP_649002.3 Gene:CG6052 / 39971 FlyBaseID:FBgn0036747 Length:1725 Species:Drosophila melanogaster
Sequence 2:NP_031404.1 Gene:Abca4 / 11304 MGIID:109424 Length:2310 Species:Mus musculus


Alignment Length:1977 Identity:567/1977 - (28%)
Similarity:874/1977 - (44%) Gaps:483/1977 - (24%)


- Green bases have known domain annotations that are detailed below.


  Fly    67 DEIRYPTVSISDLNLYMRMVPLGNRILHENG-SLNIPRNFLCYTPN----NPINS----AIVGAT 122
            |.:::|||.           ...||.|.|.| :.....||....|.    :.:.|    .|...|
Mouse   453 DTLQHPTVK-----------DFINRQLGEEGITTEAVLNFFSNGPQEKQADDMTSFDWRDIFNIT 506

  Fly   123 AIRLRLLGT----------RAYDTALHMEQDMVL----HNFLAGVQFEDNENVNTNDAGYPLNLN 173
            ...|||...          .:||..:.:.|..:.    :.|.|||.|   ..:....:..|.::.
Mouse   507 DRFLRLANQYLECLVLDKFESYDDEVQLTQRALSLLEENRFWAGVVF---PGMYPWASSLPPHVK 568

  Fly   174 YSLRFPSELRTMQGPIIDTWRTSRLFLSYDTSGSRNRLDNDGGVPV---GYIREGFLPIQHALTM 235
            |.:|       |...:::  :|:::...|..||.|       ..||   .||..||..:|.    
Mouse   569 YKIR-------MDIDVVE--KTNKIKDRYWDSGPR-------ADPVEDFRYIWGGFAYLQD---- 613

  Fly   236 SWLALASGVTDTGIPA-----IHLQRFPYRAYTYDQLLSGLRQLLPFVILLSFIYPASTVTKYVT 295
               .:..|:..:.:.|     ::||:.||..:..|..:..|.:..|..::|::||..|...|.:.
Mouse   614 ---MVEQGIVKSQMQAEPPIGVYLQQMPYPCFVDDSFMIILNRCFPIFMVLAWIYSVSMTVKGIV 675

  Fly   296 SEKELQLKEIMKLIGVHNWLHWVAWFVKSYIMLMLVVFLIMSLIMVKFYASVAVLTFSSWVPVLL 360
            .||||:|||.:|..||.|.:.|..||:.|:.::.|.:||:...||     ...:|.:|....:.|
Mouse   676 LEKELRLKETLKNQGVSNAVIWCTWFLDSFSIMALSIFLLTLFIM-----HGRILHYSDPFILFL 735

  Fly   361 FLHTYVVTSVCLCFMLAVLFSKASTASAVAAIFWFLTYIPYSFGYYYYERLSLMSKLLISLIFSN 425
            ||..:...::...|:|:.||||||.|:|.:.:.:|..|:|:...:.:.:|::...|..:||: |:
Mouse   736 FLLAFATATIMQSFLLSTLFSKASLAAACSGVIYFTLYLPHVLCFAWQDRMTADLKTTVSLL-SS 799

  Fly   426 SALGFGIHVIVMWEGTGEGITWRNMFHPVSTDDSLTLFYIIMTMSFGSIMFISICLYVEQVFPGE 490
            .|.|||...:|.:|..|.|:.|.|:.......|..:....:..|...:.::..:..|::|||||:
Mouse   800 VAFGFGTEYLVRFEEQGLGLQWSNIGKSPLEGDEFSFLLSMKMMLLDAALYGLLAWYLDQVFPGD 864

  Fly   491 YGVPRRWNFMCHKNYWRQYVPSLNIVPSFQTILHGSAKAKSCRRA-------------------- 535
            ||.|..|.|:..::||                |.|...:....||                    
Mouse   865 YGTPLPWYFLLQESYW----------------LGGEGCSTREERALEKTEPLTEEMEDPEHPEGM 913

  Fly   536 ------REV-----GIQLFNLQKNY---GKLKAVKGISLKMHRNEITVLLGHNGAGKTTTINMIT 586
                  ||:     |:.:.||.|.:   |: .||..:::..:.|:||..|||||||||||::::|
Mouse   914 NDSFFERELPGLVPGVCVKNLVKVFEPSGR-PAVDRLNITFYENQITAFLGHNGAGKTTTLSILT 977

  Fly   587 GIVKPTSGTAIVNGYDIRTHLAKARESLGICPQNNILFKEMSVRDHIIFFSKLKGIRGTKAVENE 651
            |::.|||||.::.|.||.|:|...|:|||:|||:||||..::|.:||:|:::||| |..:..:.|
Mouse   978 GLLPPTSGTVLIGGKDIETNLDVVRQSLGMCPQHNILFHHLTVAEHILFYAQLKG-RSWEEAQLE 1041

  Fly   652 VGKYMTMLKLQDKSYVAAKNLSGGMKRKLSLCCALCGNAKVVLCDEPSSGIDAAGRRSLWDLLQS 716
            :...:....|..|....|::|||||:||||:..|..|::|||:.|||:||:|...|||:||||..
Mouse  1042 MEAMLEDTGLHHKRNEEAQDLSGGMQRKLSVAIAFVGDSKVVVLDEPTSGVDPYSRRSIWDLLLK 1106

  Fly   717 EKDGRTILLTTHYMDEADVLGDRIAILSEGKLQCQGTSFYLKKRFGTGYLLVCI--------MQS 773
            .:.||||:::||:|||||:|||||||:|:|:|.|.||..:||..||||:.|..:        .:.
Mouse  1107 YRSGRTIIMSTHHMDEADLLGDRIAIISQGRLYCSGTPLFLKNCFGTGFYLTLVRKMKNIQSQRG 1171

  Fly   774 GC------------------------------DVGAVTQLIRKYVPPIKPERVLGTELTYRLPTE 808
            ||                              ||..:..|:..:||..|....:|.||.:.||.:
Mouse  1172 GCEGVCSCTSKGFSTRCPTRVDEITEEQVLDGDVQELMDLVYHHVPEAKLVECIGQELIFLLPNK 1236

  Fly   809 YSKK--FAELLQDLDEKCAQLQLVGYGLSGATLEDVFMAVNTD-------------KRVQGGAEG 858
            ..|:  :|.|.::|:|..|.|.|..:|:|...||::|:.|..|             ||.|.|...
Mouse  1237 NFKQRAYASLFRELEETLADLGLSSFGISDTPLEEIFLKVTEDAGAGSMFVGGAQQKREQAGLRH 1301

  Fly   859 P---------------------------------PVDGSIDFKELVFDSKTREKRRIRRCFMFWQ 890
            |                                 |.|..:.|           ....|......|
Mouse  1302 PCSAPTEKLRQYAQAPHTCSPGQVDPPKGQPSPEPEDPGVPF-----------NTGARLILQHVQ 1355

  Fly   891 ALFLKKFYTTTRNYWLLGIQLVLPIAVMALTILNSRGGRIYYELPAMPISINQYSSAYV------ 949
            ||.:|:|:.|.|:......|:|||...:.|.::.|.....:.|.||:.:....|...|.      
Mouse  1356 ALLVKRFHHTIRSRKDFVAQIVLPATFVFLALMLSIIVPPFGEFPALTLHPWMYGHQYTFFSMDE 1420

  Fly   950 -------VLED------------------------NTTD-KTSSLADAYSKHL--------EHYA 974
                   ||.|                        |.|. ||.|::...: ||        .|.:
Mouse  1421 PNNEHLEVLADVLLNRPGFGNRCLKEEWLPEYPCINATSWKTPSVSPNIT-HLFQKQKWTAAHPS 1484

  Fly   975 RRC--------TLL--------------RT-------GDL---KFEDYIL-------------SH 994
            ..|        |:|              ||       .||   ...||::             ..
Mouse  1485 PSCKCSTREKLTMLPECPEGAGGLPPPQRTQRSTEVLQDLTNRNISDYLVKTYPALIRSSLKSKF 1549

  Fly   995 DVNHSR-----------------RIDFHFLAGLTVSEN-------------------------NF 1017
            .||..|                 .....||:||....|                         |.
Mouse  1550 WVNEQRYGGISIGGKLPAIPISGEALVGFLSGLGQMMNVSGGPVTREASKEMLDFLKHLETTDNI 1614

  Fly  1018 IVWLNNKPLHTAPLTLNLLHNALAIKLLGQDASTYVTNEPLPYSDD--TRTLRLNKGQV------ 1074
            .||.|||..|.....||:.|||:....|.:|      .:|..|...  ::.|.|.|.|:      
Mouse  1615 KVWFNNKGWHALVSFLNVAHNAILRASLPRD------RDPEEYGITVISQPLNLTKEQLSDITVL 1673

  Fly  1075 ---LGAEISINLSLTMCFITAFYAIPIIRERETRAKLLQFLSGVDVCAYWTSHIVWDYLVFVLSA 1136
               :.|.::|.:...|.|:.|.:.:.:|:||.|:||.|||:|||....||.::.:||.:.:.:||
Mouse  1674 TTSVDAVVAICVIFAMSFVPASFVLYLIQERVTKAKHLQFISGVSPTTYWLTNFLWDIMNYAVSA 1738

  Fly  1137 LSSILTIAAFKEIGYITPLDLSRYFYMLLIFGFPGIMLSYAASGCFSDAATGFTRISIINTLMGT 1201
            ...:.....|::..|.:|.:|.....:|:::|:..|.:.|.||..|...:|.:..:|..|..:|.
Mouse  1739 GLVVGIFIGFQKKAYTSPDNLPALVSLLMLYGWAVIPMMYPASFLFEVPSTAYVALSCANLFIGI 1803

  Fly  1202 GLFLMFMTL-----NFEAFQLKDVAEKLAWYFRLSPHYSLASSTHSIHIGYNIRRGCSIGGIRKL 1261
            ....:...|     |....:...:..||...|   ||:.|......:.:...:            
Mouse  1804 NSSAITFVLELFENNRTLLRFNAMLRKLLIVF---PHFCLGRGLIDLALSQAV------------ 1853

  Fly  1262 PKQLRCRNVPICCDIPGYYG---------WRKPGVLVEITYMIMLGSTLFL--LIVMHDAKVCNL 1315
                        .|:...:|         |...|  ..:..|.:.|...||  |::.|...:...
Mouse  1854 ------------TDVYAQFGEEYSANPFQWDLIG--KNLVAMAIEGVVYFLLTLLIQHHFFLTRW 1904

  Fly  1316 IAEKLGNCFSKRKRVEGGTSIENDSVVAEQRVVREMINSGRKDVPLLVYKISKRY--RSKLAVKA 1378
            |||      ..|:.|     .:.|..|||:|  :.:::.|.|...|.:.:::|.|  .|..||..
Mouse  1905 IAE------PAREPV-----FDEDDDVAEER--QRVMSGGNKTDILKLNELTKVYSGSSSPAVDR 1956

  Fly  1379 ISFHVPHAECFGLLGINGAGKTSTFKMLAGDEKITSGEAYIDGTNI--STHKVYRKIGYCPQFDA 1441
            :...|...|||||||:||||||:|||||.||..:|||:|.:.|.:|  |...|::.:||||||||
Mouse  1957 LCVGVRPGECFGLLGVNGAGKTTTFKMLTGDTTVTSGDATVAGKSILTSISDVHQNMGYCPQFDA 2021

  Fly  1442 LFEDLTGRETLNIYCLLRGVQRRHVTPIC-WGLAISFGFAKHMDKQTKHYSGGNRRKLSTAISVL 1505
            :.:.|||||.|.:|..||||..:.:..:. ||:. |.|.:.:.|:....|||||:|||||||::.
Mouse  2022 IDDLLTGREHLYLYARLRGVPSKEIEKVANWGIQ-SLGLSLYADRLAGTYSGGNKRKLSTAIALT 2085

  Fly  1506 GNPSVLYLDEPTSGMDPAARRQLWQIIGLIRTAGKSIVLTSHSMDECEALCSRLAIMVDGEFKCL 1570
            |.|.:|.|||||:||||.|||.||..|..|...|:::|||||||:||||||:||||||.|.|:||
Mouse  2086 GCPPLLLLDEPTTGMDPQARRMLWNTIVSIIREGRAVVLTSHSMEECEALCTRLAIMVKGTFQCL 2150

  Fly  1571 GSVQSLKNQFSKGLILKVKVKHKKKTFQRVVEDSSSSNDKKSISETDLKFLQMASVMESSQADRI 1635
            |::|.||.:|..|.|:.:|:|..|                                      |.:
Mouse  2151 GTIQHLKYKFGDGYIVTMKIKSPK--------------------------------------DDL 2177

  Fly  1636 L----KVNRFISKEIPDAELKEEYNGLITYYIPHSKTLSKIFQLLETNSHKLNIEDYLIMQTRLE 1696
            |    .|.:|.....|.:..:|.::.::.:.:| |.:|::|||||.::...|.||:|.:.||.|:
Mouse  2178 LPDLNPVEQFFQGNFPGSVQRERHHSMLQFQVP-SSSLARIFQLLISHKDSLLIEEYSVTQTTLD 2241

  Fly  1697 EIFLDFASKR-DSSDI-FTKRIYSCSW 1721
            ::|::||.:: ::.|: ...|....||
Mouse  2242 QVFVNFAKQQTETYDLPLHPRAAGASW 2268

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
CG6052NP_649002.3 rim_protein 21..>54 CDD:130324
rim_protein <149..1709 CDD:130324 542/1857 (29%)
Abca4NP_031404.1 rim_protein 1..2271 CDD:130324 567/1977 (29%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 891..910 2/18 (11%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 1311..1344 3/43 (7%)
Essential for ATP binding and ATPase activity. /evidence=ECO:0000250|UniProtKB:P78363 2243..2248 1/4 (25%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 2266..2310 2/3 (67%)
Blue background indicates that the domain is not in the aligned region.

Return to query results.
Submit another query.