DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment CG40470 and Anpep

DIOPT Version :10

Sequence 1:NP_001036635.1 Gene:CG40470 / 3355093 FlyBaseID:FBgn0058470 Length:941 Species:Drosophila melanogaster
Sequence 2:NP_032512.2 Gene:Anpep / 16790 MGIID:5000466 Length:966 Species:Mus musculus


Alignment Length:1012 Identity:241/1012 - (23%)
Similarity:430/1012 - (42%) Gaps:153/1012 - (15%)


- Green bases have known domain annotations that are detailed below.


  Fly    16 MASG---TRTLFILTILL---------------------------TAPGLEAFVNDFSSLHV--- 47
            ||.|   ::||.||.|||                           |||.|....:..::...   
Mouse     1 MAKGFYISKTLGILGILLGVAAVCTIIALSVVYAQEKNRNAENSATAPTLPGSTSATTATTTPAV 65

  Fly    48 -----ISEVRLPKEVLPLSYEVLIEPHMDNQN-----FEGSIRMHLRWIGDSKKVYFHAH---DT 99
                 .::.||||.::|.||.|::.|::...|     |:|           :..|.|..:   |.
Mouse    66 DESKPWNQYRLPKTLIPDSYRVILRPYLTPNNQGLYIFQG-----------NSTVRFTCNQTTDV 119

  Fly   100 LLIDVSQINLT-------TLNMGDGT----LDKNVIILRGVRLPRKPVFVLYLKDKIKKGSECLL 153
            ::|...::|.|       .|...|||    :||..::      .|....|::|:..:.:|.:..:
Mouse   120 IIIHSKKLNYTLKGNHRVVLRTLDGTPAPNIDKTELV------ERTEYLVVHLQGSLVEGRQYEM 178

  Fly   154 DIYFQGNISETEEGLFRSYYTNSGNDGEEIYLATNLKPNNARRLFPCFDEPGIKVPFNVSIARPK 218
            |..|||.:::...|.:||.|..  .|.:::...|.::..:||:.|||||||.:|..||:::..|.
Mouse   179 DSQFQGELADDLAGFYRSEYME--GDVKKVVATTQMQAADARKSFPCFDEPAMKAMFNITLIYPN 241

  Fly   219 GYITLFNTPLHNTINHPKLRSYSLDFFHTTAPMSTHAFGFVILKLHMWNEHKIVK--SSDIPAIN 281
            ..|.|.|.....:..:|:..|.::..||:|..|||:...:::      :|.|.:.  |::...|.
Mouse   242 NLIALSNMLPKESKPYPEDPSCTMTEFHSTPKMSTYLLAYIV------SEFKNISSVSANGVQIG 300

  Fly   282 IWSNNLSSTNLLDIQNKLNVAHTTIQHF---FNIPLPLTKLDVIAIPSL--------ATLPFISA 335
            ||:.. |:.:.......|||....:..|   :|...||.|.|.||:|..        ..:.:..:
Mouse   301 IWARP-SAIDEGQGDYALNVTGPILNFFAQHYNTSYPLPKSDQIALPDFNAGAMENWGLVTYRES 364

  Fly   336 SGILIARESEILKKD--VFEISRELIYQWIGIWITPEWWTDANVNKALISFIA------SEIVFE 392
            |.:..::.|.|..|:  |..|:.||.:||.|..:|..||.|..:|:...|::.      :|..:.
Mouse   365 SLVFDSQSSSISNKERVVTVIAHELAHQWFGNLVTVAWWNDLWLNEGFASYVEYLGADYAEPTWN 429

  Fly   393 INGGIEFNGKYPMTILYSLYYELSKRYPNSHI---TGIKHEFASI---KVQLIIRMLSLTVGKYT 451
            :...:..|..|.:..:.:|........|...|   ..|...|.||   |...:|||||..:.:..
Mouse   430 LKDLMVLNDVYRVMAVDALASSHPLSSPADEIKTPDQIMELFDSITYSKGASVIRMLSSFLTEDL 494

  Fly   452 FRLGIQSFICDYKFKTYKSSDFWNAITTQAKADNSLDSDLSILSIAESWLEHSRLPLVTIIRDYD 516
            |:.|:.|::..|::......|.|..:........::....::.:|.:.|:.....|::|:    :
Mouse   495 FKKGLSSYLHTYQYSNTVYLDLWEHLQKAVNQQTAVQPPATVRTIMDRWILQMGFPVITV----N 555

  Fly   517 SETAIVQQKVYLRERLHDV--PDQDNMLWWIPIALKRQDSLSFVNTG--SFKWMNKTRQMLISNL 577
            :.|..:.||.:|.:...:|  |.:.|.:|..||        .|:.:|  ...|::..:.......
Mouse   556 TNTGEISQKHFLLDSKSNVTRPSEFNYIWIAPI--------PFLKSGQEDHYWLDVEKNQSAKFQ 612

  Fly   578 PSKNMFIIVNEEEIGPFPVNYDDNNWNMLSKYLRTEEKRESIPVYTRAKLLHDAWNLAYAGELNF 642
            .|.|.:|::|....|.:.||||:|||..|...|:|:  ...|||..||:::||::|||.|..:..
Mouse   613 TSSNEWILLNINVTGYYLVNYDENNWKKLQNQLQTD--LSVIPVINRAQIIHDSFNLASAKMIPI 675

  Fly   643 STALNVTLFLKYERNHIVWSPVFTFLDQVGKRLEKSSINKKFELYIIELLAPLYEYLGTAHFN-- 705
            :.||:.||||..|..::.|....:.|:......::|.:....:.|:.:.:.||:.|......|  
Mouse   676 TLALDNTLFLVKEAEYMPWQAALSSLNYFTLMFDRSEVYGPMKRYLKKQVTPLFFYFQNRTNNWV 740

  Fly   706 -------EDINITELRKLTTSFLCKAGYFPCFKEARRAFNIWINSSFPNFETPVPN---EYICSI 760
                   |..|  |:..::|:  |.:|...|.......::.|:.:  ||..|..||   ...|:.
Mouse   741 NRPPTLMEQYN--EINAISTA--CSSGLKECRDLVVELYSQWMKN--PNNNTIHPNLRSTVYCNA 799

  Fly   761 FKWGSMKEWMFGLDRLCEFPKSRIQSDRTHLLKMLAGCPAQRDKIFILLELAILKNISIFSDTDK 825
            ..:|..:||.|..:   :|..:.:.::...|...||   ..:|...:...|:...|.......|.
Mouse   800 IAFGGEEEWNFAWE---QFRNATLVNEADKLRSALA---CSKDVWILNRYLSYTLNPDYIRKQDT 858

  Fly   826 MLIISTVTSRSIGYTTLLDFLSNNWDDIHHKFYNNTNIWTKLISSATGMFSTQEGYDLVKKF-YD 889
            ...|.::.|...|:..:.||:.:||..:...:...:..:..||...|..||::.....:::| .|
Mouse   859 TSTIISIASNVAGHPLVWDFVRSNWKKLFENYGGGSFSFANLIQGVTRRFSSEFELQQLEQFKAD 923

  Fly   890 EHYGHFGRAQHIIEKSLRNIKEGIQWSKQNIPVIEEW 926
            .....||.....:|::|...:..|.|.|:|...:.:|
Mouse   924 NSATGFGTGTRALEQALEKTRANIDWVKENKDAVFKW 960

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
CG40470NP_001036635.1 GluZincin 60..502 CDD:472708 117/487 (24%)
ERAP1_C 583..910 CDD:463368 83/339 (24%)
AnpepNP_032512.2 Cytosolic Ser/Thr-rich junction 33..68 4/34 (12%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 42..64 4/21 (19%)
Metalloprotease 69..966 227/944 (24%)
M1_APN-Q_like 84..545 CDD:341064 117/486 (24%)
ERAP1_C 618..945 CDD:463368 83/340 (24%)

Return to query results.
Submit another query.