DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment Megf8 and Atrn

DIOPT Version :10

Sequence 1:NP_609180.3 Gene:Megf8 / 34099 FlyBaseID:FBgn0031981 Length:2892 Species:Drosophila melanogaster
Sequence 2:NP_112641.1 Gene:Atrn / 83526 RGDID:69063 Length:1432 Species:Rattus norvegicus


Alignment Length:1822 Identity:377/1822 - (20%)
Similarity:563/1822 - (30%) Gaps:740/1822 - (40%)


- Green bases have known domain annotations that are detailed below.


  Fly  1280 SEGEKCERCRQGSYGNATAPHGCLPCECNGHGNQDLGVCNVSNGECYCKDNTQGLNCELCAPGYY 1344
            :|.::|:|                || .||      |.||...|:|.|.....|..|:.|     
  Rat    99 AEAKECDR----------------PC-VNG------GRCNPGTGQCVCPTGWVGEQCQHC----- 135

  Fly  1345 GDPRGGGKCYYQCESRGILTNIGKSAIGSYQSYRSPWGASLEVKECLWIL--QPKTLQAEKSLLQ 1407
                 ||: :....|.|.:|:    ..|:|: |::         :|.|::  ||      ..:::
  Rat   136 -----GGR-FRLTGSSGFVTD----GPGNYK-YKT---------KCTWLIEGQP------NKIMR 174

  Fly  1408 LEFQWQSLAMDCDENAVYIYDSLPDLTGATQQNQLLAVVCAPYSSPRIIEARSSHVT---VHYKQ 1469
            |.|  ...|.:|..:.:|:||.          :.:.|.:.|.:|. .|:..|..:.|   |....
  Rat   175 LRF--NHFATECSWDHLYVYDG----------DSIYAPLIAAFSG-LIVPERDGNETAPEVTVTS 226

  Fly  1470 GSERRHF---------GFNALYSV----MNCVA-GSCISPHICDAQQRCVCPAGYVGASCEIEIC 1520
            |....||         |||..|:.    .||.. |.|.|.: ..:...|.|...:.|.||:|..|
  Rat   227 GYALLHFFSDAAYNLTGFNITYNFDMCPNNCSGRGECKSSN-SSSTVECECSENWKGESCDIPHC 290

  Fly  1521 PSNCNAKRMQGFCDTEYGR-CICSNANYAGADCGTLVQRNHLVMTELFNTQLLSESLEHLRKTIP 1584
            ..||.... :|.|:....| |.|. .::.|..|...|..|....|.        |....|:  :|
  Rat   291 TDNCGFPH-RGICNASDTRGCSCF-PHWQGPGCSIPVPANQSFWTR--------EEYSDLK--LP 343

  Fly  1585 RFGHS--VNADRRGSLWMFGGYSPNHGPLNDFRQFDTKNSTWLQV--TVESSTPEDRMPLGRYFH 1645
            |..|.  ||.:   .:|:.|||..||...:....:|..:..||.:  :|.|       .:.||.|
  Rat   344 RASHKAVVNGN---IMWVVGGYMFNHSDYSMVLAYDLASREWLSLNHSVNS-------VVVRYGH 398

  Fly  1646 ASEIYVKKQIIYIYGG-IGANSQLLNDFWMFSIQNQRWSQIKVEVEPPEAD-YEVDVPPPLAGHT 1708
            :..::  |..||:||| |.:...:.|:..:|.|.|:.|    |.:.|...| |.|      .||:
  Rat   399 SLALH--KDKIYMYGGKIDSTGNVTNELRVFHIHNESW----VLLTPKAKDQYAV------VGHS 451

  Fly  1709 LTHIRYQEHESLILL---------GGLSLNKSRPLELWEFNLDTGRWQQLAAVGARMPVLYGHTS 1764
             .||.......:::|         |.:|:       :.|::|:...|..|...||.:...|||:|
  Rat   452 -AHIVTLSSGRVVMLVIFGHCPLYGYISV-------VQEYDLEKNTWSILQTQGALVQGGYGHSS 508

  Fly  1765 VYHQETNSVYLFGGYST------EPQSNLYALDLQKLSWTELPSFRELNSPASLLPRARYFHSAV 1823
            ||...|.::|:.|||..      ....:||...:....||.|...|..          ||.|:||
  Rat   509 VYDHRTKALYVHGGYKAFSANKYRLADDLYRYHVDTQMWTILKDSRFF----------RYLHTAV 563

  Fly  1824 TTEHYMILYGGRTQPFNGTDV----------LIAYVYACNQW-----VRLTEDVELIGRVPASSY 1873
            .....|:::||.|.  |.|.:          .:||..||::|     ..|..||...|.   |:.
  Rat   564 IVSGTMLVFGGNTH--NDTSMSHGAKCFSSDFMAYDIACDRWSVLPRPELHHDVNRFGH---SAV 623

  Fly  1874 AEDMAIDPDTGAIYVIGGWDG--------------SSTHSHVTKITLPDDICQLWSNGKYQCRHY 1924
            ..:       ..:||.||::.              .:..|....:.....|..||.....:|..:
  Rat   624 LHN-------STMYVFGGFNSLLLSDVLVFTSEQCDAHRSEAACVAAGPGIRCLWDTQSSRCTSW 681

  Fly  1925 MGCSYCTIQNTYSYSSHCFSHGRTPCANHNGTLVVNNGAACDDDWMASRNCSSFATCGACLAAWP 1989
               ...|.:......|.|||. ||                .|.|     .|.....|.:|.|  .
  Rat   682 ---ELATEEQAEKLKSECFSK-RT----------------LDHD-----RCDQHTDCYSCTA--N 719

  Fly  1990 THQEVAPVCHWCDDCGIRGRCVPAGVDCGRRSAWCNKELSVGVLGLCPL--PQCY---QLSCESC 2049
            |:.     ||||:|     .|||....|      ...::|:.....||.  |..|   :.||.||
  Rat   720 TND-----CHWCND-----HCVPVNHSC------TEGQISIAKYDNCPKDNPMYYCNKKTSCRSC 768

  Fly  2050 MLQPQCNW-ARNELGTVECI-----------------------AKELVEK--------------- 2075
            .|...|.| .||:    |||                       |||..:.               
  Rat   769 ALDQNCQWEPRNQ----ECIALPENICGIGWHLVGNSCLKITTAKENYDNAKLSCRNHNAFLASL 829

  Fly  2076 ----------NQYRVVES----------------------------------------------- 2083
                      .|.|:::|                                               
  Rat   830 TSQKKVEFVLKQLRLMQSSQSTSKLTLTPWVGLRKINVSYWCWEDMSPFTNSLLQWMPSEPSDAG 894

  Fly  2084 -------------------------------------CPLPCHTYENCSLCLSQTPTQDHQECKW 2111
                                                 |..||.....|..|     |....||.|
  Rat   895 FCGILSEPSTRGLKAATCINPLNGSVCERPANHSAKQCRTPCALRTACGEC-----TSSSSECMW 954

  Fly  2112 STMLNLCLTPSSQPLLCAGGVCGLVLEASELQRC-PEPCHVYTQCSSCLEHAHCGWCAREGFNGD 2175
            .:.:..|:..::.......|.|   :|...:..| ||.|..|..||.|||...||||......|.
  Rat   955 CSNMKQCVDSNAYVASFPFGQC---MEWYTMSSCPPENCSGYCTCSHCLEQPGCGWCTDPSNTGK 1016

  Fly  2176 GICTEGALEHKQEHPSGSTCDLIYAS-WRNDSQLTHADVVSWHYVQCPAENECINGHHNCDTVSE 2239
            |.|.||:.:...:.||.::...:|.. ..|.|........:|.::.|||                
  Rat  1017 GKCIEGSYKGPVKMPSHASTGNVYPQPLLNSSMCLEDSRYNWSFIHCPA---------------- 1065

  Fly  2240 QCIDLDTAVGYKCVCAQGYREEQGACLPVCSQGCVRGNCVSPDQCQCDFGYVGANCSIQCLCNGH 2304
                                                                       |.||||
  Rat  1066 -----------------------------------------------------------CQCNGH 1071

  Fly  2305 SNCESSSRLDICLKCHNNTMGEQCEKCQPLFVGNPREGHACQPCLDYCHGHSDVCVAYDADPAVF 2369
            |.|.:.|   ||.||.:.|.|:.||.|...|.|:|..|..||                       
  Rat  1072 SKCINQS---ICEKCEDLTTGKHCETCISGFYGDPTNGGKCQ----------------------- 1110

  Fly  2370 NMTRSELERILQEGPAYNATCLRCGNHTAGDRCDSCLTGYFRGSEDLHKECRPCQCHGHGNICDP 2434
                                                                ||:|:||.::|:.
  Rat  1111 ----------------------------------------------------PCKCNGHASLCNT 1123

  Fly  2435 VTGEKCNCANNTESDATCTAGGGKNSAQLCWMVQCSKCRDSYAGNPTDGHQCYKQITVESRMCFD 2499
            .|| ||.|.         |.|......|||      :..:.|.|||..| .||..:.::.:..|.
  Rat  1124 NTG-KCFCT---------TKGVKGEECQLC------EVENRYQGNPLKG-TCYYTLLIDYQFTFS 1171

  Fly  2500 AKPIEECKSKPAALKPGQTVFFVIQPRFMNVDIRIIIDVTQG-ELDVFMSPQ--------DDSFI 2555
            ....::        :....:.||..|...|.|:.:.|:.::. .|::..:..        ::..:
  Rat  1172 LSQEDD--------RYYTAINFVATPDEQNRDLDMFINASKNFNLNITWATSFPAGTQTGEEVPV 1228

  Fly  2556 VETNETTGYHEIFLDNRYNWGPKIKREHPLNVALPRHDNVTIQKLFSPERRIGGGGLGGGGGERI 2620
            |.......|.:.|.:.::::     |.||         |:|                        
  Rat  1229 VSKTNIKEYKDSFSNEKFDF-----RNHP---------NIT------------------------ 1255

  Fly  2621 GANTYYVPQLQDCKSHGGHNFIVKDQHAKDLSTHVTLNHCNTLLRLFGLKNRLVLTLPQHAHNLS 2685
                                                                             
  Rat  1256 ----------------------------------------------------------------- 1255

  Fly  2686 ATRFFIALRASSGPEPSYGSVVFRQDQLHIDLFVFFSVFFSCFFLFLAVCVIVWKVKQAADLRRA 2750
               ||:.:  |:...|....:.|.|....:||..||..|||||...|.|..:|||:||:....|.
  Rat  1256 ---FFVYV--SNFTWPIKIQIAFSQHSNFMDLVQFFVTFFSCFLSLLLVAAVVWKIKQSCWASRR 1315

  Fly  2751 RRQHVVEMLHLAKRPFAQIFLASNGLDMDSPQPTSSSSSARAMRQRARQALLLQEQSAGDSQSVM 2815
            |.|.:.||..:|.||||.:.:|   |:.|...|.....|.:.:.:                    
  Rat  1316 REQLLREMQQMASRPFASVNVA---LETDEEPPDLIGGSIKTVPK-------------------- 1357

  Fly  2816 HHSTRRQSSRIMMVAIEPTFDNLAAVGTVFISL---------PGRSRAPLSIALGSTLISYSRQY 2871
                        .:|:||.|.|.|||.:||:.|         ||:|    .:|:.|.|:..|:|.
  Rat  1358 ------------PIALEPCFGNKAAVLSVFVRLPRGLGGIPPPGQS----GLAVASALVDISQQM 1406

  Fly  2872 PL 2873
            |:
  Rat  1407 PI 1408

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
Megf8NP_609180.3 CUB 33..141 CDD:238001
NanM <210..455 CDD:442289
KELCH repeat 228..268 CDD:276965
KELCH repeat 370..420 CDD:276965
KELCH repeat 425..476 CDD:276965
NanM 431..>630 CDD:442289
KELCH repeat 480..540 CDD:276965
EGF_3 1186..1222 CDD:463759
EGF_Lam 1257..1302 CDD:238012 3/21 (14%)
EGF_Lam 1304..1354 CDD:238012 15/49 (31%)
KELCH repeat 1585..1638 CDD:276965 15/56 (27%)
NanM 1596..1892 CDD:442289 85/329 (26%)
KELCH repeat 1642..1685 CDD:276965 15/43 (35%)
KELCH repeat 1760..1813 CDD:276965 17/58 (29%)
KELCH repeat 1817..1867 CDD:276965 19/64 (30%)
EGF_Lam 2299..2346 CDD:238012 21/46 (46%)
EGF_Lam 2422..2487 CDD:238012 21/64 (33%)
AtrnNP_112641.1 CUB 135..250 CDD:238001 34/158 (22%)
NanM 332..637 CDD:442289 94/366 (26%)
KELCH repeat 344..385 CDD:276965 13/43 (30%)
Kelch 1. /evidence=ECO:0000255 355..405 14/58 (24%)
KELCH repeat 395..447 CDD:276965 19/57 (33%)
Kelch 2. /evidence=ECO:0000255 406..454 18/58 (31%)
KELCH repeat 450..493 CDD:276965 9/50 (18%)
Kelch 3. /evidence=ECO:0000255 464..511 14/53 (26%)
KELCH repeat 504..553 CDD:276965 16/48 (33%)
Kelch 4. /evidence=ECO:0000255 516..567 15/60 (25%)
KELCH repeat 557..612 CDD:276965 16/56 (29%)
Kelch 5. /evidence=ECO:0000255 569..627 16/69 (23%)
Kelch 6. /evidence=ECO:0000255 628..674 8/45 (18%)
PSI 650..696 CDD:214655 7/48 (15%)
CLECT 791..923 CDD:470576 6/131 (5%)
PSI 935..986 CDD:396154 12/58 (21%)
EGF_Lam 1065..1110 CDD:238012 22/122 (18%)
EGF_Lam 1111..1156 CDD:238012 20/60 (33%)
Blue background indicates that the domain is not in the aligned region.

Return to query results.
Submit another query.