DRSC/TRiP Functional Genomics Resources

powered by:
logo

back to: DIOPT - Ortholog Prediction Tool / DIOPT for Diseases and Traits


Protein Alignment htl and Flt1

DIOPT Version :10

Sequence 1:NP_524394.2 Gene:htl / 42160 FlyBaseID:FBgn0010389 Length:729 Species:Drosophila melanogaster
Sequence 2:NP_034358.2 Gene:Flt1 / 14254 MGIID:95558 Length:1333 Species:Mus musculus


Alignment Length:843 Identity:236/843 - (27%)
Similarity:342/843 - (40%) Gaps:282/843 - (33%)


- Green bases have known domain annotations that are detailed below.


  Fly    91 ADLPVNVSSKPYWRNPKKMSFLQTRPSGSLLTLNCHALGNPEPNITW-----YRNGT-------- 142
            |.|.|||..:.|.::...:......|.||...|.|...|.|.|.|||     :.|.:        
Mouse   421 ATLIVNVKPQIYEKSVSSLPSPPLYPLGSRQVLTCTVYGIPRPTITWLWHPCHHNHSKERYDFCT 485

  Fly   143 -------VDWTRGYGSLKRN---RWTL------TMEDLVPGDC---GNYTCKVCNSLGCIRHDTQ 188
                   :|.:...|:...:   |.|:      |:..||..|.   |.|:|:..|.:|.:..:.:
Mouse   486 ENEESFILDPSSNLGNRIESISQRMTVIEGTNKTVSTLVVADSQTPGIYSCRAFNKIGTVERNIK 550

  Fly   189 VIVSD-----RVNHKPILMTG-PLNLTLVVNSTGSMHCKYL-SDLTSKKAWIFVPCHGMTNCSNN 246
            ..|:|     .|:.:.:...| .|.|:.|||       |:| .|:|    ||      :....||
Mouse   551 FYVTDVPNGFHVSLEKMPAEGEDLKLSCVVN-------KFLYRDIT----WI------LLRTVNN 598

  Fly   247 RSI--------IAEDKD-----QLDFVNVRMEQEGWYTCVESN---------------------- 276
            |::        :|..:|     .|...||.:|..|.|.|...|                      
Mouse   599 RTMHHSISKQKMATTQDYSITLNLVIKNVSLEDSGTYACRARNIYTGEDILRKTEVLVRDSEAPH 663

  Fly   277 ---------------------------------------------SLGQSNSTAYLRVV----RS 292
                                                         .||..|||.::..|    ..
Mouse   664 LLQNLSDYEVSISGSTTLDCQARGVPAPQITWFKNNHKIQQEPGIILGPGNSTLFIERVTEEDEG 728

  Fly   293 LHVLEAGVASGSLHSTSFVYIFVFG------------------GLIFIFMTTLFVFYAIRKMKHE 339
            ::...|....|::.|.:  |:.|.|                  ..:|..:.|||    |||:|. 
Mouse   729 VYRCRATNQKGAVESAA--YLTVQGTSDKSNLELITLTCTCVAATLFWLLLTLF----IRKLKR- 786

  Fly   340 KVLKQRIETVHQWTKKVIIFKPEGGGDSSGSMDTMIMPVVRIQKQRTTVLQNGNEPAPFNEY--E 402
                                       ||..:.|..:.::.           ..:..|.:|.  .
Mouse   787 ---------------------------SSSEVKTDYLSIIM-----------DPDEVPLDEQCER 813

  Fly   403 FPLD-SNWELPRSHLVLGATLGEGAFGRVVMAEV-------NNAIVAVKMVKEGHTDDDIASLVR 459
            .|.| |.||..|..|.||.:||.||||:||.|..       ....|||||:|||.|..:..:|:.
Mouse   814 LPYDASKWEFARERLKLGKSLGRGAFGKVVQASAFGIKKSPTCRTVAVKMLKEGATASEYKALMT 878

  Fly   460 EMEVMKIIGRHINIINLLGCCS-QNGPLYVIVEYAPHGNLKDFLYKNR----------------- 506
            |::::..||.|:|::||||.|: |.|||.|||||..:|||.::|...|                 
Mouse   879 ELKILTHIGHHLNVVNLLGACTKQGGPLMVIVEYCKYGNLSNYLKSKRDLFCLNKDAALHMELKK 943

  Fly   507 --------------------------PFGRDQ-------DRDSSQPPPSPPAHVITEKDLIKFAH 538
                                      .|..|:       |.|.|:....|    :|.:|||.::.
Mouse   944 ESLEPGLEQGQKPRLDSVSSSSVTSSSFPEDRSVSDVEGDEDYSEISKQP----LTMEDLISYSF 1004

  Fly   539 QIARGMDYLASRRCIHRDLAARNVLVSDDYVLKIADFGLARDI-QSTDYYRKNTNGRLPIKWMAP 602
            |:||||::|:||:||||||||||:|:|::.|:||.|||||||| ::.||.|:. :.|||:|||||
Mouse  1005 QVARGMEFLSSRKCIHRDLAARNILLSENNVVKICDFGLARDIYKNPDYVRRG-DTRLPLKWMAP 1068

  Fly   603 ESLQEKFYDSKSDVWSYGILLWEIMTYGQQPYPTIMSAEELYTYLMSGQRMEKPAKCSMNIYILM 667
            ||:.:|.|.:||||||||:|||||.:.|..|||.:...|:..:.|..|.||..|...:..||.:|
Mouse  1069 ESIFDKVYSTKSDVWSYGVLLWEIFSLGGSPYPGVQMDEDFCSRLKEGMRMRTPEYATPEIYQIM 1133

  Fly   668 RQCWHFNADDRPPFTEIVEYMDKLLQTK-----EDYLDVDIA-------NLDTPPSTSDEEED 718
            ..|||.:..:||.|.|:||.:..|||..     :||:.::..       ...||..:.|..:|
Mouse  1134 LDCWHKDPKERPRFAELVEKLGDLLQANVQQDGKDYIPLNAILTRNSGFTYSTPTFSEDLFKD 1196

Known Domains:


Indicated by green bases in alignment.

GeneSequenceDomainRegion External IDIdentity
htlNP_524394.2 Ig 101..191 CDD:472250 26/121 (21%)
Ig strand B 121..125 CDD:409353 1/3 (33%)
Ig strand C 134..138 CDD:409353 3/8 (38%)
Ig strand E 157..161 CDD:409353 1/9 (11%)
Ig strand F 171..176 CDD:409353 2/4 (50%)
Ig strand G 184..187 CDD:409353 0/2 (0%)
Ig 200..289 CDD:472250 30/170 (18%)
Ig strand B 216..220 CDD:409353 0/3 (0%)
Ig strand C 229..233 CDD:409353 0/3 (0%)
Ig strand E 255..259 CDD:409353 2/8 (25%)
Ig strand F 269..274 CDD:409353 2/4 (50%)
Ig strand G 282..285 CDD:409353 2/2 (100%)
Protein Kinases, catalytic domain 404..692 CDD:473864 143/347 (41%)
Flt1NP_034358.2 Ig_3 32..109 CDD:464046
IG_like 144..220 CDD:214653
Ig 232..331 CDD:472250
Ig strand B 249..253 CDD:409448
Ig strand C 264..268 CDD:409448
Ig strand E 295..299 CDD:409448
Ig strand F 309..314 CDD:409448
Ig strand G 322..325 CDD:409448
Ig 334..425 CDD:472250 2/3 (67%)
Ig strand B 354..358 CDD:409353
Ig strand C 367..371 CDD:409353
Ig strand E 389..393 CDD:409353
Ig strand F 403..408 CDD:409353
Ig strand G 418..421 CDD:409353 236/843 (28%)
Ig 447..553 CDD:472250 24/105 (23%)
Ig strand B 451..455 CDD:409353 1/3 (33%)
Ig strand C 464..468 CDD:409353 2/3 (67%)
Ig strand E 515..520 CDD:409353 0/4 (0%)
Ig strand F 533..538 CDD:409353 2/4 (50%)
Ig strand G 546..549 CDD:409353 0/2 (0%)
IG_like 570..641 CDD:214653 24/87 (28%)
Ig strand B 574..578 CDD:409353 2/3 (67%)
Ig strand C 587..595 CDD:409353 4/17 (24%)
Ig strand E 620..624 CDD:409353 1/3 (33%)
Ig strand F 634..639 CDD:409353 2/4 (50%)
I-set 662..749 CDD:400151 10/88 (11%)
Ig strand B 680..683 CDD:409353 0/2 (0%)
Ig strand C 692..696 CDD:409353 0/3 (0%)
Ig strand E 715..719 CDD:409353 2/3 (67%)
Ig strand F 729..734 CDD:409353 0/4 (0%)
Protein Kinases, catalytic domain 820..1157 CDD:473864 140/341 (41%)
Disordered. /evidence=ECO:0000256|SAM:MobiDB-lite 947..983 2/35 (6%)
Blue background indicates that the domain is not in the aligned region.

Return to query results.
Submit another query.