DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment htl and Flt4

DIOPT Version :10

Sequence 1:NP_524394.2 Gene:htl / 42160 FlyBaseID:FBgn0010389 Length:729 Species:Drosophila melanogaster
Sequence 2:NP_446104.2 Gene:Flt4 / 114110 RGDID:621737 Length:1363 Species:Rattus norvegicus


Alignment Length:834 Identity:229/834 - (27%)
Similarity:339/834 - (40%) Gaps:254/834 - (30%)


- Green bases have known domain annotations that are detailed below.


  Fly     1 MAAAWSWRASHSTITMTSGSLVVLFLLLSIWQP----AVQVEGRRQMANSQEMIKDHLGARSQ-- 59
            ::..|.||.                     |.|    |.:...|||..:.....:|.....:|  
  Rat   454 LSVQWHWRP---------------------WTPCKTFAQRSLRRRQPRDGMPQCRDWKEVTTQDA 497

  Fly    60 -----------------NKTPA--ITNNANQSSTSSADLDDGAADDDD----NKADLPVNVSSKP 101
                             |||.:  :..:||.|:.....:.:....|:.    ....:|...|   
  Rat   498 VNPIESLDTWTESVEGKNKTVSKLVIQDANVSAMYKCVVFNKVGQDERLIYFYVTTIPDGFS--- 559

  Fly   102 YWRNPKKMSFLQTRPS-----GSLLTLNCHALGNPEPNITWYR--NGTVDWTRGYGSL------- 152
                      :::.||     |..:.|:|.|......::.|||  ..|:...:|...|       
  Rat   560 ----------IESEPSEDPLEGQSVRLSCRADNYTYEHLRWYRLNLSTLHDAQGNPLLLDCKNVH 614

  Fly   153 -----------------KRNRWTLTMEDLVPGDCGNYTCKVCNSLGCIRHDTQVIVSDRVN---- 196
                             :....:|.:..:.|.|.|:|.|:               |.||.|    
  Rat   615 LFATPLEANLEEAEPGARHATLSLNIPRVAPEDEGDYVCE---------------VQDRRNQDKH 664

  Fly   197 -HKPILMTGPL-------NLT-LVVNSTGS--MHCKYLSDLTSKKAWIFVPCHGMTNCSNNRSI- 249
             ||..|....|       ||| |:||.:.|  |.|...........|.          .:.|.: 
  Rat   665 CHKKYLSVQALEAPRLTQNLTDLLVNVSDSLEMRCPVAGAHVPSIVWY----------KDERLLE 719

  Fly   250 ------IAEDKDQLDFVNVRMEQEGWYTCVESNSLGQSNSTAYLRVVRSLHVLEAGVASGSLHST 308
                  :|:...:|....||.|..|.|.|...|:.|..||:|.:.|       |.....||:.  
  Rat   720 KESGIDLADSNQRLSIQRVREEDAGRYLCSVCNAKGCVNSSASVAV-------EGSEDKGSME-- 775

  Fly   309 SFVYIFVFGGLI--FIFMTTLFVFYAIRKMKHEKVLKQRIETVHQWTKKVIIFKPEGGGDSSGSM 371
              :.|.:..|:|  |.::..|.:|..:::..|..:....:..:                     |
  Rat   776 --IVILIGTGVIAVFFWVLLLLIFCNMKRPAHADIKTGYLSII---------------------M 817

  Fly   372 DTMIMPVVRIQKQRTTVLQNGNEPAPFNEYEFPLDSNWELPRSHLVLGATLGEGAFGRVVMAE-- 434
            |...:|:              .|...:..|:.   |.||.||..|.||..||.||||:||.|.  
  Rat   818 DPGEVPL--------------EEQCEYLSYDV---SQWEFPRERLHLGRVLGHGAFGKVVEASAF 865

  Fly   435 -VNNA----IVAVKMVKEGHTDDDIASLVREMEVMKIIGRHINIINLLGCCSQ-NGPLYVIVEYA 493
             :|..    .|||||:|||.|..:..:|:.|::::..||.|:|::||||.|:: ||||.||||:.
  Rat   866 GINKGSSCDTVAVKMLKEGATASEHRALMSELKILIHIGNHLNVVNLLGACTKPNGPLMVIVEFC 930

  Fly   494 PHGNLKDFLYKNR----PF----------------GRDQDR------------------DSSQPP 520
            .:|||.:||...|    |:                |...||                  .|::..
  Rat   931 KYGNLSNFLRVKRETFDPYAEKSPEQRRRFRAMVEGAKADRRRLGSTDRALFTRFLMGKGSARRA 995

  Fly   521 P----------SPPAHVITEKDLIKFAHQIARGMDYLASRRCIHRDLAARNVLVSDDYVLKIADF 575
            |          ||    :|.:||:.::.|:|.||::||||:||||||||||:|:|:..::||.||
  Rat   996 PFVQEAEDLWLSP----LTMEDLVCYSFQVALGMEFLASRKCIHRDLAARNILLSESDIVKICDF 1056

  Fly   576 GLARDI-QSTDYYRKNTNGRLPIKWMAPESLQEKFYDSKSDVWSYGILLWEIMTYGQQPYPTIMS 639
            |||||| :..||.||. :.|||:|||||||:.:|.|.::|||||:|:|||||.:.|..|||.:..
  Rat  1057 GLARDIYKDPDYVRKG-SARLPLKWMAPESIFDKVYTTQSDVWSFGVLLWEIFSLGASPYPGVQI 1120

  Fly   640 AEELYTYLMSGQRMEKPAKCSMNIYILMRQCWHFNADDRPPFTEIVEYMDKLLQ 693
            .||....|..|.||..|...:..|..:|:.||..:...||.|:::||.:..|||
  Rat  1121 NEEFCQRLKDGTRMRAPELATPAIRHIMQSCWSGDPKARPAFSDLVEILGDLLQ 1174

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
htlNP_524394.2 Ig 101..191 CDD:472250 18/120 (15%)
Ig strand B 121..125 CDD:409353 1/3 (33%)
Ig strand C 134..138 CDD:409353 0/3 (0%)
Ig strand E 157..161 CDD:409353 1/3 (33%)
Ig strand F 171..176 CDD:409353 2/4 (50%)
Ig strand G 184..187 CDD:409353 0/2 (0%)
Ig 200..289 CDD:472250 26/105 (25%)
Ig strand B 216..220 CDD:409353 2/5 (40%)
Ig strand C 229..233 CDD:409353 0/3 (0%)
Ig strand E 255..259 CDD:409353 1/3 (33%)
Ig strand F 269..274 CDD:409353 2/4 (50%)
Ig strand G 282..285 CDD:409353 2/2 (100%)
Protein Kinases, catalytic domain 404..692 CDD:473864 140/344 (41%)
Flt4NP_446104.2 Ig_2 137..224 CDD:464026
IgI_VEGFR 230..329 CDD:409448
Ig strand A 230..234 CDD:409448
Ig strand A' 240..244 CDD:409448
Ig strand B 246..255 CDD:409448
Ig strand C 262..267 CDD:409448
Ig strand C' 270..272 CDD:409448
Ig strand D 276..284 CDD:409448
Ig strand E 289..297 CDD:409448
Ig strand F 307..313 CDD:409448
Ig strand G 318..324 CDD:409448
Ig 331..418 CDD:472250
Ig strand B 352..356 CDD:409353
Ig strand C 365..369 CDD:409353
Ig strand E 382..386 CDD:409353
Ig strand F 396..401 CDD:409353
Ig strand G 411..414 CDD:409353
IG_like 570..663 CDD:214653 20/107 (19%)
Ig strand B 574..578 CDD:409353 1/3 (33%)
Ig strand C 587..591 CDD:409353 0/3 (0%)
Ig strand E 636..640 CDD:409353 1/3 (33%)
Ig strand F 650..655 CDD:409353 2/19 (11%)
Ig 678..765 CDD:472250 24/96 (25%)
Ig strand B 695..699 CDD:409353 1/3 (33%)
Ig strand C 708..712 CDD:409353 0/3 (0%)
Ig strand E 731..735 CDD:409353 1/3 (33%)
Ig strand F 745..750 CDD:409353 2/4 (50%)
VEGFR-2_TMD 770..804 CDD:375470 8/37 (22%)
Protein Kinases, catalytic domain 837..1175 CDD:473864 142/343 (41%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 1289..1330
Blue background indicates that the domain is not in the aligned region.

Return to query results.
Submit another query.