DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment U4-U6-60K and Prpf4

DIOPT Version :10

Sequence 1:NP_648990.1 Gene:U4-U6-60K / 39955 FlyBaseID:FBgn0036733 Length:553 Species:Drosophila melanogaster
Sequence 2:NP_081573.1 Gene:Prpf4 / 70052 MGIID:1917302 Length:521 Species:Mus musculus


Alignment Length:548 Identity:276/548 - (50%)
Similarity:367/548 - (66%) Gaps:48/548 - (8%)


- Green bases have known domain annotations that are detailed below.


  Fly     4 DDDIQYIKRQRTLHYGSLEESERKRQNAAASGAAATTTSGTTASSGAGTTTTGTGGQLEDIDSDE 68
            ||.:..:.::..::||||||.||:|.....||...  ..|..|...||.....:|          
Mouse    17 DDLVAPVVKKPHIYYGSLEEKERERLAKGESGILG--KEGLKAGIEAGNINITSG---------- 69

  Fly    69 DYEESTKKTSNAKQAGAPPPTAATLANKIDDDYFDLEMEMERDKVALLEEFERKKRARQINVSTD 133
                                           :.|::|..:...:..:|.||||:|||||||||||
Mouse    70 -------------------------------EVFEIEEHISERQAEVLAEFERRKRARQINVSTD 103

  Fly   134 DTEIKSNLRQLNEPICYFGEGPAERRRRLKELLAGLGENAINKRQYEDEERKQQQREQDQATWYH 198
            |:|:|:.||.|.|||..|||||||||.||:.:|:.:|.:|:.|.: :|:|:.::.:|:.|.||||
Mouse   104 DSEVKACLRALGEPITLFGEGPAERRERLRNILSVVGTDALKKTK-KDDEKSKKSKEEYQQTWYH 167

  Fly   199 EGPDSLRIARLWLADYSLPRAKDRLVRAREALEVPSAARAGRMVEMQKKLQSLAPLCSQVGDTRP 263
            |||:||::||||:|:||||||..||..||...|:|...|..:|.|:.|.|:||...|||:||.||
Mouse   168 EGPNSLKVARLWIANYSLPRAMKRLEEARLHKEIPETTRTSQMQELHKSLRSLNNFCSQIGDDRP 232

  Fly   264 VSSAAFSEDSSLLLTSSWSGLCKLWSVPDCELKQTLRGHASYVGGVALRP--GVKADEENVVAMA 326
            :|...||.:|.:|.|:.|||||||||||||.|..|||||.:.||.:...|  .|..|::: |.:|
Mouse   233 ISYCHFSPNSKMLATACWSGLCKLWSVPDCSLLHTLRGHNTNVGAIVFHPKSTVSLDQKD-VNLA 296

  Fly   327 SGGHDGAVKLWGFNNEESIADITGHMPHRVSKVAFHPSGRFLATACYDSSWRLWDLEQKTEVLHQ 391
            |...||:||||..:::|.:|||.||.. ||::|.:|||||||.|.|||.||||||||.:.|:|||
Mouse   297 SCAADGSVKLWSLDSDEPVADIEGHTV-RVARVMWHPSGRFLGTTCYDRSWRLWDLEAQEEILHQ 360

  Fly   392 EGHAKPVHCLSYHSDGSVLVTGGLDAFGRVWDLRTGRCIMFLEGHLGAVFGVDFSPNGFHIATGS 456
            |||:..|:.:::|.|||:..||||||||||||||||||||||||||..::|::|||||:||||||
Mouse   361 EGHSMGVYDIAFHQDGSLAGTGGLDAFGRVWDLRTGRCIMFLEGHLKEIYGINFSPNGYHIATGS 425

  Fly   457 QDNTCKIWDLRRRQPVYTIPAHTNLISDVKYQQECGSFLVTCSYDSTTKIWSNKTWQPLKTLQGH 521
            .|||||:||||:|:.|||||||.||::.||::...|.||:|.:||:|.|||::..|.|||||.||
Mouse   426 GDNTCKVWDLRQRRCVYTIPAHQNLVTGVKFEPIHGDFLLTGAYDNTAKIWTHPGWSPLKTLAGH 490

  Fly   522 DNKVISVDIAPNSQYIATTSFDRTFKLW 549
            :.||:.:||:.:.|.|||.|:|||||||
Mouse   491 EGKVMGLDISSDGQLIATCSYDRTFKLW 518

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
U4-U6-60KNP_648990.1 CCDC66 <113..>194 CDD:434558 43/80 (54%)
SFM 132..>166 CDD:128776 22/33 (67%)
WD40 repeat 264..301 CDD:293791 22/36 (61%)
WD40 <286..553 CDD:441893 161/266 (61%)
WD40 repeat 357..393 CDD:293791 24/35 (69%)
WD40 repeat 398..434 CDD:293791 26/35 (74%)
WD40 repeat 441..476 CDD:293791 25/34 (74%)
WD40 repeat 482..519 CDD:293791 17/36 (47%)
WD40 repeat 525..551 CDD:293791 15/25 (60%)
Prpf4NP_081573.1 SFM 102..140 CDD:128776 23/37 (62%)
WD40 <228..518 CDD:441893 174/291 (60%)
WD 1 228..267 24/38 (63%)
WD40 repeat 233..270 CDD:293791 22/36 (61%)
WD 2 270..317 17/47 (36%)
WD40 repeat 284..320 CDD:293791 15/36 (42%)
WD 3 320..359 25/39 (64%)
WD40 repeat 325..361 CDD:293791 24/35 (69%)
WD 4 362..401 26/38 (68%)
WD40 repeat 368..403 CDD:293791 25/34 (74%)
WD 5 404..443 26/38 (68%)
WD40 repeat 409..443 CDD:293791 23/33 (70%)
WD 6 446..486 19/39 (49%)
WD40 repeat 451..488 CDD:293791 17/36 (47%)
WD 7 489..521 18/30 (60%)
WD40 repeat 494..518 CDD:293791 13/23 (57%)

Return to query results.
Submit another query.