DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment CG41520 and Tnr

DIOPT Version :10

Sequence 1:NP_001104019.1 Gene:CG41520 / 5740294 FlyBaseID:FBgn0087011 Length:745 Species:Drosophila melanogaster
Sequence 2:NP_037177.2 Gene:Tnr / 25567 RGDID:3886 Length:1358 Species:Rattus norvegicus


Alignment Length:699 Identity:168/699 - (24%)
Similarity:273/699 - (39%) Gaps:202/699 - (28%)


- Green bases have known domain annotations that are detailed below.


  Fly   201 KVQLDNNLLHVEDFNAEASEKKPVTINVNDVTKA--------LSYVAMTQVTSELSELRDSTD-- 255
            :.:||:    ..|....||.:..:::   ..|||        :::...:.::||::..||.|.  
  Rat   683 RTELDS----PRDLMVTASSETSISL---IWTKASGPIDHYRITFTPSSGISSEVTVPRDRTSYT 740

  Fly   256 --NIDKKLQFYINIASENIGKMTSM------------MHEIHCAVVHHNNQFKFLNVTTHSKSNN 306
              :::...::.|:|.:|. |:..|:            :..:|.:.|..::    :|:|....|..
  Rat   741 LTDLEPGAEYIISITAER-GRQQSLESTVDAFTGFRPISHLHFSHVTSSS----VNITWSDPSPP 800

  Fly   307 INKLVKEIRPMGVVSEQNDNVWDVGVDTKSTDGHLLSKSTAPLSQ----------TQRQERAIDE 361
            .::|:....|    .::.:.:.:|.:|  :|..|.:.....|.::          |...|..:..
  Rat   801 ADRLILNYSP----RDEEEEMMEVLLD--ATKRHAVLMGLQPATEYIVNLVAVHGTVTSEPIVGS 859

  Fly   362 IHHDKKPKSYLNINN----------------FDMVK-----KQFEKQENFKPPEEISE---TRIK 402
            |.....|...:.|:|                ||..:     .|..:.::...|..::|   ||:.
  Rat   860 ITTGIDPPKNITISNVTKDSLTVSWSPPVAPFDYYRVSYRPTQVGRLDSSVVPNTVTEFTITRLY 924

  Fly   403 EALEYE--SKNYTDTDENSRTEILL---------LSLSNIQPT------------IVNKTITTPF 444
            .|.|||  ..:....:|:.|...|:         |..:||.||            :.|..|....
  Rat   925 PATEYEISLNSVRGREESERICTLVHTAMDSPMDLIATNITPTEALLQWKAPMGEVENYVIVLTH 989

  Fly   445 TSTPGDGDVLGPLHSE--SLD-SNRTHY-------NGPAVNGLKSANRKDRIIFPSIKKKPAFLN 499
            .:..|:..::..:..|  .:| ..||||       :||.|:|..:.|      |.::...||.|.
  Rat   990 FAMAGETILVDGVSEEFQLVDLLPRTHYTVTMYATSGPLVSGTIATN------FSTLLDPPANLT 1048

  Fly   500 TT-----------------ISNDFWALKDVKG--------------------------------- 514
            .:                 |.|.....|...|                                 
  Rat  1049 ASEVTRQSALISWQPPRAAIENYVLTYKSTDGSRKELIVDAEDTWIRLEGLSENTDYTVLLQAAQ 1113

  Fly   515 ------------------FS----CVDILNAGMKQSGVFYLQIRGTTYWFLKVYCDQETTDGGWT 557
                              ||    |...|..|...|||:.:.:.|.....|:||||..|..|||.
  Rat  1114 EATRSSLTSTIFTTGGRVFSHPQDCAQHLMNGDTLSGVYTIFLNGELSHKLQVYCDMTTDGGGWI 1178

  Fly   558 VIQRRDDFKDSRENFNRDWADYKNGFGEPSKDFWLGNENIYMLTNNEEYSLRVELEDFEGNKR-Y 621
            |.|||   ::.:.:|.|.||||:.|||....:||||.:||:.:|....|.|||::.|  |.:. :
  Rat  1179 VFQRR---QNGQTDFFRKWADYRVGFGNLEDEFWLGLDNIHRITAQGRYELRVDMRD--GQEAVF 1238

  Fly   622 AQYSHFKIHSEADYYKLEIDGYEGNAGDSLNDPWYGSNNSPFSTYNKDNDRSSLNCASMLKGGWW 686
            |.|..|.:......|||.|.||.|.|||||:   | ....||||.::|||.:..|||...||.||
  Rat  1239 AYYDKFAVEDSRSLYKLRIGGYNGTAGDSLS---Y-HQGRPFSTEDRDNDVAVTNCAMSYKGAWW 1299

  Fly   687 WKSCGR-GLNGLYLHDPQDITARQGIVWFRWRGWDYTLKKSKMMIRPRI 734
            :|:|.| .|||.|    .:....|||.|:.|:|.::::...:|.:||.|
  Rat  1300 YKNCHRTNLNGKY----GESRHSQGINWYHWKGHEFSIPFVEMKMRPYI 1344

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
CG41520NP_001104019.1 FReD 516..733 CDD:238040 88/222 (40%)
TnrNP_037177.2 EGF_Tenascin 203..231 CDD:376143
C_rich_MXAN6577 207..317 CDD:469225
EGF_Tenascin 235..261 CDD:480934
FN3 355..934 CDD:442628 47/268 (18%)
fn3 954..1026 CDD:394996 14/71 (20%)
FN3 1042..1127 CDD:238020 7/84 (8%)
FReD 1134..1342 CDD:238040 86/220 (39%)
Blue background indicates that the domain is not in the aligned region.

Return to query results.
Submit another query.