HG10021801 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10021801
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionSAGA-associated factor 11
LocationChr05: 16928240 .. 16937180 (+)
RNA-Seq ExpressionHG10021801
SyntenyHG10021801
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCTACAATTCTTCAATTTCGATTCTGCTCGTCAGTTGGGTCGTTTGGCCCTTTCTTTCCCTTCAGCTCCCCCAATTTTCGCTCTCGATTCATACCAGTTTCCGTCTCTTTCAACTCCTTTTCTACGCCAATCTTCCGCTCCAGTAACTTCGCTTTGAGATTTCTGTCCAAAATCCGTCGGGAACACTGCCCAGTTGCGGCAGTTATGTTGTTGCCTAAAAATCCGGTGGTCTCCGACATCTGCGCCACCGCCTTGTCCGGCGTAGTCGCCTTGTCTTTGCTTCGATTATGGGCAGAAACGGCGAAACGTGGCCTCGACCAGGTTCTATTCCTTTTATCCACTGATTTTCCATTTTTTTCTTGTCTTGAGGAAAAGAAAGTTATCCAAGAAAGGAATCCAATGAGACAAACTCATAACTGCAGGGGAAATGTTTTAGTTAATGCTTAACCTTTGGGGTTTGTGTTGGAGAGTGGTTTTTTAATTTATTTTAGGAAAAAGTGCTTAGTAACAAGTATGTTTGATTCTACTGCTGCTTTTGGACCAATAACGCTCTAGTTTGTGGAGGAGTTCTTTCGGATAAGGCACTTGAAGTCTTCAAGGATTTTGACTGTCTTCAAAGGTGTAAATGATAAAAGCACCTATTATCGTCTCAAACGTTGATGATCTGATCCTCACCACAATCGTTTATCTTAAAAAAACCTTAATCTTAATAACTTTTAGATGAATTATAACAAGTTATCATATGATTTTTTTAGAAGGATAATCGAAAAAATTAAACTCAAATAAATAAAAAGAGGATAACTTAGAAAGGATTGCCTCCAAATTGATTATGGTCGTTGATATTCTATTCTTAACATAAGAAAGTGATTGGTTCAGTTCAACAAGGCGCATGTTCTGTGTACAGTGCTGATAAATAATCAATGTTATGCATATTCTGACAATGTAAAAATGCTTATCAGTGGAAATGTCAAGGCTTCAAATTGTGAAACAATTCTGATCTTTTATCAAAGGTGTGTACTTACCTATAATTGGTTCAGAAATTGAACAGGAAGCTTGTTCATATAAGCATTGGGCTTGCTTTCATGCTTTGCTGGCCTATGTTCAGGTGAACTTAGTTTGAAAAAAGTTAATTCTGACCCTTCTTCTACTCCTATATTTGCACTAATATGCTTGAATCAAACGATGCTACTCCTTAATTGTAGTTCTGGTTATCGAGGAGCAATACTTGCGTCTCTAATTCCCGGTGTCAATATAATACGAATGCTCGTCTTGGGATTCGGGATATTGAAAGACGAGGCTACAGTGAAATCAATGAGCAGATATGGAGACTACAGGTACAACTTCGTAAAGAACATAATCAAGTAAAATAAGTTTTTGGGGTGCCTGTGTTTTAGTTGATTTTTAGCTTCCAATCAAGATCAATCTCGGACTATACATACAGTTAACCAAATGAATGTTGATTCTAAGGATAACTATCCTCCTATTCCTCTCCATTTGTTAGGTTCTTCTTCCATTCATTGTTGCCATTTGGAATTCTTTTGGGCTCAATTGTTATATCATTTAGCTGGTTTCCCATAGCTGAAAATTAATGGTTGATGATACTCACAAAACAAAAGCATAGCGCCCATTATTTCGCTCATTAATGCTTTATTTTAAAATTTGACAAGCATGTTTTCATCCATCAGGGAGCTTTTGAAGGGGCCTTTGTATTATGTTGCAACAATTACATTAGTTTGTATATTCTACTGGAGGACGTCCCCCATTTCAATTGCTCTGATATGCAACTTATGTGCCGGAGATGGTACTTTTCACATTTGTTCATCTACTTGTTACCTCTTCCCAAACACAAATTTAAAGCTTACCCATAACAACTTTCACACTTTACTATTTCAGGGTTGGCTGATATCGTTGGAAGACGATTTGGAAGTAAAAAGATCTTTTACAACAGAAACAAGTCTCTAGTTGGTAGTGTGGCAATGGCATCTGCTGGTTTTCTTGCATCTGTTGGGTAAAGTCTCTCTTCGTTTCGGATTCAGGCATCAAGTATAACAAGTTACTGTGTTTGTGAACTTTCTGAGTTGGTTTAGAAATAAGAGGCTGTATTGTGATTTGTGAATGGCAGGTATATGTACTATTTCTCATCATTTGGGTATGTTGTGGGAAGCAACAGAATGGTTTTGGGATTCTTAGTTGTGTCCCTTGCCTCAGCATTGGTGGAGTCTCTCCCCATAAGCACTGAGATTGATGACAACCTTACAGTTCCACTCACTTCTTTGCTGGTAGGGAGCCTAGTTTTTTAGTATTTGGGAGCCAAAAACACAAAGTTGCTACTATTAGAAGGATAGGAAATTGAGGATTTCTTACAAATCCTGCTTGTAATGTTGTATTATTATGTGGCTTGAATAGATTCTACTGAACCAGCTTGATATTAGACTTTGCTAATGAAATTTTATTGATGCCATTGTTATGAAATCCATTGTTTAGTGAAGTGACTATGAAACAATTATTGAAACTTCAAGATGCAGAGAAGCTAAAATCAGCTGAAGATGAATCCAAGCTGCCCCATGGGAACAAACAAGATCACCAGACAGAGAACAGAGGAGGGTTTAATTTTTAAAATAAAAATTTTGAGGGCCATTTTACAATTAGTTAAAAAGGGGTTTGATTTTAAAAATAAAAATTGGCGGCCCAGTTTCTATGACGTAAATTTGAGGGACCAATTTTAACTATTGTGTAATTTTGGGAAAAATTTTAAAAATCACCCCAGATGTGTAATGTTAGTTGCTATTACACTCTCAATCTTTTAATGGTAAAAATCATGCCCTCAAACTTCTATAAGTGTTAAAATTGGGCCCTTAAATTTATTTTAAGAGATTTTTACATAAATGATCATTTTTGGGCCTTATATTTCAAAAATGTCTTTAAATTTGAATTTTTTCAAGAAAGACTTTTAGTGTTATTTTTGGACAAATTTACCCTTTTTTATTTTTTTATTTTTGAATTTTATATTTATTTTTCCTATTTTTAAAGTTTCCTTAATGATGGGGAGAGAAAATACATTTAATATGATTTTCTTCTAATTTTCAATTGGAGAGAATCATTTTAATATTTTTTTCTATTATCAATTGGAGAGAGAAAATAAATTTAATATTTTTTTTCCTTTTCACATATCAAACATTTCTTTGATATTGTTTGATATTATTTAACGTTTGCTTTTTGATATTATGTGATATATTACACTGTTTGTTTTGAGTTCTATTTGTTTTCATGTAATATTTCATTGTTTATTTGGTATATATATGAGGTATTCTGATATGTGCAACACTTAGTGCTTGTAATATATTCATCGAATGTCTGATATATATGTCACTCTAATGCAATTTTTATATGATGGTTTGTCTTGTTTGAGTTTGTCTATTATAGTTCCTTGTTTATTTTGAGTTCATTTGTATTTTGCGTTATTATCTCATATATACTATAATTTAGATTCTATCTATCATGGTTTATAGATCTCAAATGAACTTAATGTTCAACATATATTAGTGTATATATCATAAACAGAAAGTCTCAATAAAAAATTAAAGATATATATCACAATATATATCGAAAAATCACTCTATATATCGAAAATGATTATATACTTGTAAATATTCTAACCAATATATGAAATATAGTAGTAAGATTGACTAAATTCCATTTAAGTAAATTTAACTGAAGCTCCCTAATGATTTTCAAATAAACAAAGCTATATAAATATCAAATGAACTTAATATTCAACATATTAGTGTATATAACATAAAACAGAAAGTATCAATAAAAAATTGAAGATATATATCATATATCGAAAAATCAACCTATATATCAAAAATAAATAAATAAATAAATAAATATATAAATATACACTAATATGTTGAATAAATAAATATATATATATATATATATATATATCGAAAAAATATAAAAGTATATATAAATCCACGACGACGGTGGCGATTGGTGATAGATGTGAGATCTTGGTGGCGCGCGAGTTGGAATGAGATCGGAGAAGTGTGGCAAAAAAAAATCGAGAAAAAAACTCACTGTCGGTCGAAACTTGAACGACGGGAACGACCGCCGAAACTTGAACGGTGACCACGATAGTGGCCAAGAACGTCACATCACAATATTGAAGAAAATCAGAACTAATAACCTAGGTAGAAATGTAGAATTGTCAAGTTGCCAATGACATTTACCCCTAATTTCTTCAAACTAATGTAGATATTTTGAAACTTAATGTAGATATTTTTTGTGATGACAAATCGAAGTAGGACAATCTAGAGTATTAACGTGGGATCATTGAGTGATGTTCTGTTAAGAAGCTTTGAAAAGATCATGATTATTGCATGGCAGATTTGGGATGCCAGATAATAAGATACTTTTTTTAGAGAGGTCCAAACGATTCGGATAAAATTAGTGCATCGATTGACAAATATATGAATGAAATGATATCGTTCTATCATATTCCTTATGGAAAGGTAATTGTGGAGAATCAAATATATCCTGCCTCATCTATCATTGGTGGCTAGACACCTCTTCCTATGAGCTGGTGGAAATTGAACATCGACGTAGTGTGAAACCCTATTAGCCAGTCGAGTGGTGTTGGTTAGGTGCTTCGAGATTAGGTGAACCATGTTCGTTTGGCTGGGTTTAAGTTCATCCCAAGATGCATTCTGGTCAAGATTCTTGCAGTGAGGATCTGGCTCGTATTTCACTCAATTAATGTGTGTAATATTTGGATCACATCTGACTATTTTGATAGTGGTTATCTTATTAATTAGGTTTCCTTGGATCTCTCTAAGGTTACTTTTGTTTTGTTTTGTTTTTTTTTTTTTTAATTAAGGAAGCTAGGTTGCTTCTGATTGATTTGGGTATTGTTTCTTTCTCTCATGTGAGGTGGAGCTACAATTTCTTGGCACGCTCTCTTGCCGGTAAGGCGTTGGTTGAGCATGATTCGTCTTTTATGCTTTCTCCTTACTCTAAATAGTTTGTTTACTTGTGTCTAAAGATGTTGGTTTATAATTCATCGTTGGTTTGTTAGCTCTAAAAAAGAAAGTACAATTGTCTCGAAATTTCTGACCTCTTGAACAATAATAAATTTTGTCAAATCTCATTTGGATGCTTAACATATGAAACCCAATTACTCCATTTATAAAAATCTTACCACCTATCAACACATTTTCCTTCTTCAAATGATGTGTGAGAATATGTTTATAAGACAAAACTTGACACATTGTATCAAAAGTTCACACTATATTTTTAATTTCATAAAATTAGACACTAGACTTATTTAGATGTTACCATTTATATCATCTATTTAACTTTCGCTAATTCATCACAAATTCTAACCATGCAAAAGCTTTTGTAAATTTATTAATTTCAATAATATCAAGTTTTGCACCAATTTAAATTTGATGAATATAGAGTTTAATTAATAATTTTTGCAAAACTCAAAATTTAGCCTTTATATCATTTTTTTCTTTGAAATCTTATCAATTTTGTATCATAACCAAAATTTACACTTTTTCTTTCATCGAATTTTGAACCAAAAGTTTATCAAAATGTTAATTGCAACATTTTTTAAGTTTAAAGTTAAAATTAAACACTTATCTAAATTTAGAGGTACAATTTGATAAAATAAAAAATTTACGGTAAAGTTCATTCAACTGAATAACTAGAGGCTAGAGGCTAGTATGTACATTTAACATATTTTTAGGGAGAAAAATTTAATTAAAAAAAAACAATTTGCTTAATTCACTCGTCAATACTTCAATATTGCTTTTTAAATCAACTCTTGTAGTTTTAAAAGATACGAGTAATTGATCAAAATAAGAATCAAAGAATAATAATAATAAAATTTGAAAGCATGTAAGTTTAATTTTAAATTTAAAGAAAAAAAGTTGTCAAAAATATATATATAAATACCATTAAAAAGAAAACCCAAATTTGCAACAACCTCCGTTGTCGTCCAGCGGTTAGGATATCTGGCTTTCACCCAGGAGACCCGCGTTCGATTCCCGGCAACGGAATTTTATTTTTCCTTTTTCTTATTTTATTTTATTTTATTTTTATTATTATTGAACATATTTTCGATGCCCTCTTCTTCTCCTCGATGAATCTTTCCCTCTTTCTGCAGTCTTCTCGGATCCCACTGTGTTTGGATTTGTAGACTCTGTCCATTTTTGCTGCGAATTCAACTTTATATAAGTAAATACAGCTATATCTTTCTGCATTTTTCTGTGTTCCTCATTGATTTTGCCAATTTTACGAACATTTTTTCAGTAAACAATTTCTAAGAAATGTTGCGTTTGATGCCAGTTTGCAATATTTTGAATGTTAGGAGATTCTTTTTTTTTTTTTTTTTTTCTTTTTTATATTTTGTATGATCGGTTACTTTTACAAGTTTGATTAATTTCAGAATGTGTAAATCTTCTCGTTAGTCTCTTATGCAACTTCAGTTTTTGAATAGAATGATGATCCTTGATATTGGATTTATTTTCTGTACTGTGTTGCTTGAAATTGGTTAAGTCGTGTTTCTGATGACATTTGTCCTGGTTTTTATGTTTCTAGGCCTATATTGAAACTCGTCCTGCTTCTAGATCCATGTCAATGCCTAATGAGGACAATGCATCTTCACAAACTCAGGTAGCCTTTCCCTTTATATTTCTTCTTCTTTTTCTTATTTGCATTTAGCACCTTAGTATATAAACAATGGGAAATGAAGGTAACAGCTATAATGGTTTTGTTTACGGTGAAGAACAAAGTCTTGAATTAATTATTGATATGGAATATGCTAGTTACAAGAGAACAATGGCATTATAACGTTGATGTGCCCTGAAGTATTTTGAGCAGCTGCAATCCGTCGGTTCTCGAATTAGTTACCTCCTGAAACAAGAACAGTGGTACTGTACCACTGTTATAAACGCTTTACAAGCCTGACAGTCTTACTTGCTTAACTTACAAGCTATTAAATAATCCTAACCTAATCAACCCAACCGGTTACTAACCACAAAGAAGCTGCTCAATCAGTAAGCTTTCTTTCACAGACTTCCCCCCCTCAGTTTAATGATACTTAACCAGATTGTTCCCTCGAAATTGTATTTTCATTATCTGTGAGATTTCTCTGCGGTAAGTTTTTTTTTTTTTTTTTTTTTTTCATTAGTATGTTCCAGAAATTTAATTGTAAAAAATAGGAAGTCGTGTGTGTACAATATATGATAGTTCTCCCTGGATTGTTAAGAAGATTGAGATGCATCTGGCTGGTTCTTTTGAAAGATTTTTTTAGCAATGGTCTGTCAACACATGTTTTAAAAGTCTAAAGGCGTCTCAAGCTTTTCCTCTTAAAAGAGGCGAGCCTCAAGGTGGCATGAGATATTAGCTTCAATGTTAATAAAAGTTTACACAAATCCTAAATTTTTATGATCGCATATATATAATTTGGTAAGACACCAAGCTTTCATTGAGAAAATGGAAGATTACGAGGGAATACATAAAAACATGTCTGAACAAAAGAGTCCCAAAGAAAAGAAAAATTACAGGAAAGAACTCCAATCCAAAGAAGAAAAAAAAGAAGACAATGGTCATAATGACAAAGAGGCCTTGTAACCGAGGCCTAAAGGAAGGCATAAAACCTAGCCATCTCCCAAACCTCTCATTCTAATGCTCAACCTCTAGAAAGTCCCACAAGTCACAACAAGTAAATAAAAGATAATGTTTATATAGGTACAAAAATGGATTACACTCTGGTAAATGGCCACTCTAATAAAAAATTTATCTTGGTTATTCTTATTGCCACCACTATCATCACTAAGCCTAGATCGTAAAGATAGTAATAGGTTAAACTGCCACAAGTCATGTATTAGCATTACATGCACACAAAACTTTTATATTTCTTAATAATGGGGCAAATTAATATTATTCTTCTAATTAATATATGGGGTATAAAGACTTTGAACCTTTGGTGTGTTTGCATTGAGTCATAATGCCTCTTATACTCCAAAGAAGGTATACGAGTTTCAAAGCTAAGAGTGAGAGCCTTGTCTCAAGGTGTAATCCTCAAAAGTATTTATTCATTAGTTAGCAAGCACCTAAACCAACATGTCTAGCCCAATGAATTATTTGAAAGCTTCGTCTAACTTGCTTAATGTGAAAAAAAGGCTTAGTCTGAGTATGGGTGATGTGTCAAACATGTCATGGAAACCCAAGAAGTATAGTAGTAATAGTGATCAATTGTGACTTCCTTATTAGAGGACACATAACTAATGGAACTGGTACTTCTGTTGCTTTCTGCTCATATTTGCAGCTTTCATCTAATTTGTTTGGAGATCTTCTGGATTCTGTGATTGTTGATATTGCATCAGAATGTCATCGAATAGCAAGGTTAGGTCTTGATCGTAACTTAGAAGAGGAAGAAGAAGAATTAAGACTTTCAGCACAGGCACGAGTAAGAGTAGCTGATTCTAGCAATAGTAGTGAGGCAAACGGCAAATATGTAGTTGATATTTTTGGACAAACTCATCCTTCTGTTGCGAATGAAATATTTGATTGCATGAATTGTGGTCGATCAATTATGGCTGGGAGATTTGCTCCTCATTTAGAGAAATGCATGGGAAGGGTAACTCTCTGCTCCTTAATTAATATATGCTTTCCAGACACATTCTGTTTCTATTTTGTCAAATCTCTTTCCACTTTGTCTCATCAACTGTTTACTCTACTATAGGGTAGAAAGGCTCGTCCCAAAGTAACAAGAAGTAGTACAGCTGCCCAGAGCCGGTATTCACGAGGCAATCCTGTTTCTGCATATTCCCCTTACCCTAATTCCACCAGCACGAATCGCTTACCTAATGGAACGTCTAGTCTTGCAGGGGAGGAGTACTCAAATGGTACATCTGAAGACCCATGA

mRNA sequence

ATGGCTACAATTCTTCAATTTCGATTCTGCTCGTCAGTTGGGTCGTTTGGCCCTTTCTTTCCCTTCAGCTCCCCCAATTTTCGCTCTCGATTCATACCAGTTTCCGTCTCTTTCAACTCCTTTTCTACGCCAATCTTCCGCTCCAGTAACTTCGCTTTGAGATTTCTGTCCAAAATCCGTCGGGAACACTGCCCAGTTGCGGCAGTTATGTTGTTGCCTAAAAATCCGGTGGTCTCCGACATCTGCGCCACCGCCTTGTCCGGCGTAGTCGCCTTGTCTTTGCTTCGATTATGGGCAGAAACGGCGAAACGTGGCCTCGACCAGAAATTGAACAGGAAGCTTGTTCATATAAGCATTGGGCTTGCTTTCATGCTTTGCTGGCCTATGTTCAGTTCTGGTTATCGAGGAGCAATACTTGCGTCTCTAATTCCCGGTGTCAATATAATACGAATGCTCGTCTTGGGATTCGGGATATTGAAAGACGAGGCTACAGTGAAATCAATGAGCAGATATGGAGACTACAGGGAGCTTTTGAAGGGGCCTTTGTATTATGTTGCAACAATTACATTAGTTTGTATATTCTACTGGAGGACGTCCCCCATTTCAATTGCTCTGATATGCAACTTATGTGCCGGAGATGGGTTGGCTGATATCGTTGGAAGACGATTTGGAAGTAAAAAGATCTTTTACAACAGAAACAAGTCTCTAGTTGGTAGTGTGGCAATGGCATCTGCTGGTTTTCTTGCATCTGTTGGGTATATGTACTATTTCTCATCATTTGGGTATGTTGTGGGAAGCAACAGAATGGTTTTGGGATTCTTAGTTGTGTCCCTTGCCTCAGCATTGGTGGAGTCTCTCCCCATAAGCACTGAGATTGATGACAACCTTACAGTTCCACTCACTTCTTTGCTGGCCTATATTGAAACTCGTCCTGCTTCTAGATCCATGTCAATGCCTAATGAGGACAATGCATCTTCACAAACTCAGCTTTCATCTAATTTGTTTGGAGATCTTCTGGATTCTGTGATTGTTGATATTGCATCAGAATGTCATCGAATAGCAAGGTTAGGTCTTGATCGTAACTTAGAAGAGGAAGAAGAAGAATTAAGACTTTCAGCACAGGCACGAGTAAGAGTAGCTGATTCTAGCAATAGTAGTGAGGCAAACGGCAAATATGTAGTTGATATTTTTGGACAAACTCATCCTTCTGTTGCGAATGAAATATTTGATTGCATGAATTGTGGTCGATCAATTATGGCTGGGAGATTTGCTCCTCATTTAGAGAAATGCATGGGAAGGGGTAGAAAGGCTCGTCCCAAAGTAACAAGAAGTAGTACAGCTGCCCAGAGCCGGTATTCACGAGGCAATCCTGTTTCTGCATATTCCCCTTACCCTAATTCCACCAGCACGAATCGCTTACCTAATGGAACGTCTAGTCTTGCAGGGGAGGAGTACTCAAATGGTACATCTGAAGACCCATGA

Coding sequence (CDS)

ATGGCTACAATTCTTCAATTTCGATTCTGCTCGTCAGTTGGGTCGTTTGGCCCTTTCTTTCCCTTCAGCTCCCCCAATTTTCGCTCTCGATTCATACCAGTTTCCGTCTCTTTCAACTCCTTTTCTACGCCAATCTTCCGCTCCAGTAACTTCGCTTTGAGATTTCTGTCCAAAATCCGTCGGGAACACTGCCCAGTTGCGGCAGTTATGTTGTTGCCTAAAAATCCGGTGGTCTCCGACATCTGCGCCACCGCCTTGTCCGGCGTAGTCGCCTTGTCTTTGCTTCGATTATGGGCAGAAACGGCGAAACGTGGCCTCGACCAGAAATTGAACAGGAAGCTTGTTCATATAAGCATTGGGCTTGCTTTCATGCTTTGCTGGCCTATGTTCAGTTCTGGTTATCGAGGAGCAATACTTGCGTCTCTAATTCCCGGTGTCAATATAATACGAATGCTCGTCTTGGGATTCGGGATATTGAAAGACGAGGCTACAGTGAAATCAATGAGCAGATATGGAGACTACAGGGAGCTTTTGAAGGGGCCTTTGTATTATGTTGCAACAATTACATTAGTTTGTATATTCTACTGGAGGACGTCCCCCATTTCAATTGCTCTGATATGCAACTTATGTGCCGGAGATGGGTTGGCTGATATCGTTGGAAGACGATTTGGAAGTAAAAAGATCTTTTACAACAGAAACAAGTCTCTAGTTGGTAGTGTGGCAATGGCATCTGCTGGTTTTCTTGCATCTGTTGGGTATATGTACTATTTCTCATCATTTGGGTATGTTGTGGGAAGCAACAGAATGGTTTTGGGATTCTTAGTTGTGTCCCTTGCCTCAGCATTGGTGGAGTCTCTCCCCATAAGCACTGAGATTGATGACAACCTTACAGTTCCACTCACTTCTTTGCTGGCCTATATTGAAACTCGTCCTGCTTCTAGATCCATGTCAATGCCTAATGAGGACAATGCATCTTCACAAACTCAGCTTTCATCTAATTTGTTTGGAGATCTTCTGGATTCTGTGATTGTTGATATTGCATCAGAATGTCATCGAATAGCAAGGTTAGGTCTTGATCGTAACTTAGAAGAGGAAGAAGAAGAATTAAGACTTTCAGCACAGGCACGAGTAAGAGTAGCTGATTCTAGCAATAGTAGTGAGGCAAACGGCAAATATGTAGTTGATATTTTTGGACAAACTCATCCTTCTGTTGCGAATGAAATATTTGATTGCATGAATTGTGGTCGATCAATTATGGCTGGGAGATTTGCTCCTCATTTAGAGAAATGCATGGGAAGGGGTAGAAAGGCTCGTCCCAAAGTAACAAGAAGTAGTACAGCTGCCCAGAGCCGGTATTCACGAGGCAATCCTGTTTCTGCATATTCCCCTTACCCTAATTCCACCAGCACGAATCGCTTACCTAATGGAACGTCTAGTCTTGCAGGGGAGGAGTACTCAAATGGTACATCTGAAGACCCATGA

Protein sequence

MATILQFRFCSSVGSFGPFFPFSSPNFRSRFIPVSVSFNSFSTPIFRSSNFALRFLSKIRREHCPVAAVMLLPKNPVVSDICATALSGVVALSLLRLWAETAKRGLDQKLNRKLVHISIGLAFMLCWPMFSSGYRGAILASLIPGVNIIRMLVLGFGILKDEATVKSMSRYGDYRELLKGPLYYVATITLVCIFYWRTSPISIALICNLCAGDGLADIVGRRFGSKKIFYNRNKSLVGSVAMASAGFLASVGYMYYFSSFGYVVGSNRMVLGFLVVSLASALVESLPISTEIDDNLTVPLTSLLAYIETRPASRSMSMPNEDNASSQTQLSSNLFGDLLDSVIVDIASECHRIARLGLDRNLEEEEEELRLSAQARVRVADSSNSSEANGKYVVDIFGQTHPSVANEIFDCMNCGRSIMAGRFAPHLEKCMGRGRKARPKVTRSSTAAQSRYSRGNPVSAYSPYPNSTSTNRLPNGTSSLAGEEYSNGTSEDP
Homology
BLAST of HG10021801 vs. NCBI nr
Match: KAG7019118.1 (Farnesol kinase, chloroplastic [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 838.6 bits (2165), Expect = 2.8e-239
Identity = 446/493 (90.47%), Postives = 456/493 (92.49%), Query Frame = 0

Query: 1   MATILQFRFCSSVGSFGPFFPFSSPNFRSRFIPVSVSFNSFSTPIFRSSNFALRFLSKIR 60
           MA  LQF F S  GSFGP FPFSSP   SRF PVSVSFNS   PIFRS  F LRF +KIR
Sbjct: 1   MAANLQFLFRS--GSFGPLFPFSSPTLISRFRPVSVSFNSIFAPIFRSDAFTLRFGTKIR 60

Query: 61  REHCPVAAVMLLPKNPVVSDICATALSGVVALSLLRLWAETAKRGLDQKLNRKLVHISIG 120
           RE C VAAVMLLP NPVVSDICATA+SG VALSLLRLW ETAKRGLDQKLNRKLVH SIG
Sbjct: 61  REQCRVAAVMLLPDNPVVSDICATAVSGGVALSLLRLWTETAKRGLDQKLNRKLVHTSIG 120

Query: 121 LAFMLCWPMFSSGYRGAILASLIPGVNIIRMLVLGFGILKDEATVKSMSRYGDYRELLKG 180
           LAFMLCWPMFSSG+RGA+LASLIPGVNIIRMLVLG GILKDEATVKSMSR GD RELLKG
Sbjct: 121 LAFMLCWPMFSSGHRGALLASLIPGVNIIRMLVLGLGILKDEATVKSMSRNGDCRELLKG 180

Query: 181 PLYYVATITLVCIFYWRTSPISIALICNLCAGDGLADIVGRRFGSKKIFYNRNKSLVGSV 240
           PLYYVATITLVCIFYWRTSPISIALICNLCAGDGLADIVGRRFGS+KI YN+NKSL GSV
Sbjct: 181 PLYYVATITLVCIFYWRTSPISIALICNLCAGDGLADIVGRRFGSRKISYNKNKSLAGSV 240

Query: 241 AMASAGFLASVGYMYYFSSFGYVVGSNRMVLGFLVVSLASALVESLPISTEIDDNLTVPL 300
           AM SAGFLASVGYMYYFSSFGYV GSNRMVLGFLVVS+ASALVESLPISTEIDDNLTVPL
Sbjct: 241 AMVSAGFLASVGYMYYFSSFGYVEGSNRMVLGFLVVSIASALVESLPISTEIDDNLTVPL 300

Query: 301 TSLLAYIETRPASRSMSMPNEDNASSQTQLSSNLFGDLLDSVIVDIASECHRIARLGLDR 360
           TSLLAYIETRPASRSMSMP+ED+ASS TQLS NLFGDLLDSVI D+ASECHRIARLGLDR
Sbjct: 301 TSLLAYIETRPASRSMSMPHEDSASSHTQLSFNLFGDLLDSVIADVASECHRIARLGLDR 360

Query: 361 NLEEEEEELRLSAQARVRVADSSNSSEANGKYVVDIFGQTHPSVANEIFDCMNCGRSIMA 420
           NLEEEEEELRLSAQAR RVADS NSSEANGKYVVDIFGQTHPSVANEIFDCMNCGRSIMA
Sbjct: 361 NLEEEEEELRLSAQARERVADSCNSSEANGKYVVDIFGQTHPSVANEIFDCMNCGRSIMA 420

Query: 421 GRFAPHLEKCMGRGRKARPKVTRSSTAAQSRYSRGNPVSAYSPYPNSTSTNRLPNGTSSL 480
           GRFAPHLEKCMGRGRKARPKVTRSSTAAQSRYSRGNPVS YSPYPNSTSTNRLPNGTSSL
Sbjct: 421 GRFAPHLEKCMGRGRKARPKVTRSSTAAQSRYSRGNPVS-YSPYPNSTSTNRLPNGTSSL 480

Query: 481 AGEEYSNGTSEDP 494
           AGEEYSNGTSEDP
Sbjct: 481 AGEEYSNGTSEDP 490

BLAST of HG10021801 vs. NCBI nr
Match: KAA0060025.1 (farnesol kinase [Cucumis melo var. makuwa] >TYJ97282.1 farnesol kinase [Cucumis melo var. makuwa])

HSP 1 Score: 754.6 bits (1947), Expect = 5.4e-214
Identity = 390/418 (93.30%), Postives = 401/418 (95.93%), Query Frame = 0

Query: 70  MLLPKNPVVSDICATALSGVVALSLLRLWAETAKRGLDQKLNRKLVHISIGLAFMLCWPM 129
           ML P+NPVVSDICATALS  VALSLL+LW ETAKRGLDQKLNRKLVHISIGLAFMLCWPM
Sbjct: 1   MLFPENPVVSDICATALSSGVALSLLQLWVETAKRGLDQKLNRKLVHISIGLAFMLCWPM 60

Query: 130 FSSGYRGAILASLIPGVNIIRMLVLGFGILKDEATVKSMSRYGDYRELLKGPLYYVATIT 189
           FSSGY+GAI ASLIPG N+IRMLVLGFGILKDEAT+KSMSRYGDYRELLKGPLYYVATIT
Sbjct: 61  FSSGYQGAIFASLIPGANVIRMLVLGFGILKDEATLKSMSRYGDYRELLKGPLYYVATIT 120

Query: 190 LVCIFYWRTSPISIALICNLCAGDGLADIVGRRFGSKKIFYNRNKSLVGSVAMASAGFLA 249
           LVCIFYWRTSPISIALICNLCAGDG ADIVGRRFGS+KIFYN NKSL GSVAMA+AGFLA
Sbjct: 121 LVCIFYWRTSPISIALICNLCAGDGFADIVGRRFGSEKIFYNENKSLAGSVAMATAGFLA 180

Query: 250 SVGYMYYFSSFGYVVGSNRMVLGFLVVSLASALVESLPISTEIDDNLTVPLTSLLAYIET 309
           S+GYMYYFSSFGYV  S  M L FL+VSLASALVESLPISTE+DDNLTVPLTSLLAYIET
Sbjct: 181 SIGYMYYFSSFGYVEASIGMALRFLIVSLASALVESLPISTELDDNLTVPLTSLLAYIET 240

Query: 310 RPASRSMSMPNEDNASSQTQLSSNLFGDLLDSVIVDIASECHRIARLGLDRNLEEEEEEL 369
             ASRSMSMPNEDNASSQTQLSSNLFGDLLDSVIVD+ASECHRIARLGLDRNLEEEEEEL
Sbjct: 241 SSASRSMSMPNEDNASSQTQLSSNLFGDLLDSVIVDVASECHRIARLGLDRNLEEEEEEL 300

Query: 370 RLSAQARVRVADSSNSSEANGKYVVDIFGQTHPSVANEIFDCMNCGRSIMAGRFAPHLEK 429
           RLSAQARVRVADSSNSSEANGKYVVDIFGQTHPSVANEIFDCMNCGRSIMAGRFAPHLEK
Sbjct: 301 RLSAQARVRVADSSNSSEANGKYVVDIFGQTHPSVANEIFDCMNCGRSIMAGRFAPHLEK 360

Query: 430 CMGRGRKARPKVTRSSTAAQSRYSRGNPVSAYSPYPNSTSTNRLPNGTSSLAGEEYSN 488
           CMGRGRKARPKVTRSSTAAQSRYSRGNPVSAYSPYPNSTSTNRLPNGTSSLAGEEYSN
Sbjct: 361 CMGRGRKARPKVTRSSTAAQSRYSRGNPVSAYSPYPNSTSTNRLPNGTSSLAGEEYSN 418

BLAST of HG10021801 vs. NCBI nr
Match: KAG6583348.1 (hypothetical protein SDJN03_19280, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 748.4 bits (1931), Expect = 3.8e-212
Identity = 405/481 (84.20%), Postives = 415/481 (86.28%), Query Frame = 0

Query: 1   MATILQFRFCSSVGSFGPFFPFSSPNFRSRFIPVSVSFNSFSTPIFRSSNFALRFLSKIR 60
           MA  LQF F S  GSFGP FPFSSP   SRF PVSVSFNS   PIFRS  F LRF +KIR
Sbjct: 1   MAANLQFLFRS--GSFGPLFPFSSPTLISRFRPVSVSFNSIFAPIFRSDAFTLRFGTKIR 60

Query: 61  REHCPVAAVMLLPKNPVVSDICATALSGVVALSLLRLWAETAKRGLDQKLNRKLVHISIG 120
           RE C VAAVMLLP NPVVSDICATA+SG VALSLLRLW ETAKRGLDQKLNRKLVH SIG
Sbjct: 61  REQCRVAAVMLLPDNPVVSDICATAVSGGVALSLLRLWTETAKRGLDQKLNRKLVHTSIG 120

Query: 121 LAFMLCWPMFSSGYRGAILASLIPGVNIIRMLVLGFGILKDEATVKSMSRYGDYRELLKG 180
           LAFMLCWPMFSSG+RGA+LASLIPGVNIIRMLVLG GILKDEATVKSMSR GD RELLKG
Sbjct: 121 LAFMLCWPMFSSGHRGALLASLIPGVNIIRMLVLGLGILKDEATVKSMSRNGDCRELLKG 180

Query: 181 PLYYVATITLVCIFYWRTSPISIALICNLCAGDGLADIVGRRFGSKKIFYNRNKSLVGSV 240
           PLYYVATITLVCIFYWRTSPISIALICNLCAGDGLADIVGRRFGS+KI YN+NKSL GSV
Sbjct: 181 PLYYVATITLVCIFYWRTSPISIALICNLCAGDGLADIVGRRFGSRKISYNKNKSLAGSV 240

Query: 241 AMASAGFLASVGYMYYFSSFGYVVGSNRMVLGFLVVSLASALVESLPISTEIDDNLTVPL 300
           AM SAGFLASVGYMYYFSSFGYV GSNRMVLGFLVVS+ASALVESLPISTEIDDNLTVPL
Sbjct: 241 AMVSAGFLASVGYMYYFSSFGYVEGSNRMVLGFLVVSIASALVESLPISTEIDDNLTVPL 300

Query: 301 TSLL------------------------------AYIETRPASRSMSMPNEDNASSQTQL 360
           TSLL                              AYIETRPASRSMSMP+ED+ASS TQL
Sbjct: 301 TSLLVGSLVFYFVGSLFLWICKLCPFCCEFDFISAYIETRPASRSMSMPHEDSASSHTQL 360

Query: 361 SSNLFGDLLDSVIVDIASECHRIARLGLDRNLEEEEEELRLSAQARVRVADSSNSSEANG 420
           S NLFGDLLDSVI D+ASECHRIARLGLDRNLEEEEEELRLSAQAR RVADS NSSEANG
Sbjct: 361 SFNLFGDLLDSVIADVASECHRIARLGLDRNLEEEEEELRLSAQARERVADSCNSSEANG 420

Query: 421 KYVVDIFGQTHPSVANEIFDCMNCGRSIMAGRFAPHLEKCMGRGRKARPKVTRSSTAAQS 452
           KYVVDIFGQTHPSVANEIFDCMNCGRSIMAGRFAPHLEKCMGRGRKARPKVTRSSTAAQS
Sbjct: 421 KYVVDIFGQTHPSVANEIFDCMNCGRSIMAGRFAPHLEKCMGRGRKARPKVTRSSTAAQS 479

BLAST of HG10021801 vs. NCBI nr
Match: KAG6607329.1 (hypothetical protein SDJN03_00671, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 707.6 bits (1825), Expect = 7.5e-200
Identity = 385/483 (79.71%), Postives = 408/483 (84.47%), Query Frame = 0

Query: 20  FPFSSPNFRSRFIPVSVSFNSFSTPIFRSSNFALRFLS----KIRREHCPVAAVMLLPKN 79
           +P  SP+F SRF  +SVS NS S P  RS +F  RF S    KIRR+  PVAA MLLP N
Sbjct: 12  YPLRSPSFLSRFRTLSVSLNSISAPNLRSGSFVFRFRSNFNLKIRRKQYPVAAAMLLPDN 71

Query: 80  PVVSDICATALSGVVALSLLRLWAETAKRGLDQKLNRKLVHISIGLAFMLCWPMFSSGYR 139
           PVVSDICA+ LSG VA SLLRLWAETAKRGLDQKLNRKLVHISIGLAFMLCWPMFSSG R
Sbjct: 72  PVVSDICASVLSGAVAFSLLRLWAETAKRGLDQKLNRKLVHISIGLAFMLCWPMFSSGRR 131

Query: 140 GAILASLIPGVNIIRMLVLGFGILKDEATVKSMSRYGDYRELLKGPLYYVATITLVCIFY 199
           GAILASL+PGVNIIRMLV G GI+KDEATVKSM+RYGDYRELLKGPLYYVATITLVCIFY
Sbjct: 132 GAILASLVPGVNIIRMLVFGLGIVKDEATVKSMTRYGDYRELLKGPLYYVATITLVCIFY 191

Query: 200 WRTSPISIALICNLCAGDGLADIVGRRFGSKKIFYNRNKSLVGSVAMASAGFLASVGYMY 259
           WRTSPISIALICNLCAGDG+ADI+GRRFG++KI YN+NKS+VGSVAMASAGFLASVGYMY
Sbjct: 192 WRTSPISIALICNLCAGDGVADIIGRRFGTQKISYNKNKSIVGSVAMASAGFLASVGYMY 251

Query: 260 YFSSFGYVVGSNRMVLGFLVVSLASALVESLPISTEIDDNLTVPLTSLLAY-----IETR 319
           YFSSFGYV GS+RMVLGFLVVSLASALVESLPISTEIDDNLTVPLTS L+      IET 
Sbjct: 252 YFSSFGYVEGSSRMVLGFLVVSLASALVESLPISTEIDDNLTVPLTSFLSLSTSVCIETL 311

Query: 320 PASRSMSMPNEDNASSQTQLSSNLFGDLLDSVIVDIASECHRIARLGLDRNLEEEEEELR 379
           P+SRSMSMP+EDNASS  QLSSN FGDLLDSVIVD+ASECHRIARLGLDRNLEEEEEELR
Sbjct: 312 PSSRSMSMPDEDNASS--QLSSNFFGDLLDSVIVDVASECHRIARLGLDRNLEEEEEELR 371

Query: 380 LSAQARVRVADSSNSSEANGKYVVDIFGQTHPSVANEIFDCMNCGRSIMAGRFAPHLEKC 439
           LSAQARVRVADSSNSSEANGKYVVDIFGQ HPSVA+EIFDCMNCGRSI+AGRFAPHLEKC
Sbjct: 372 LSAQARVRVADSSNSSEANGKYVVDIFGQNHPSVASEIFDCMNCGRSIVAGRFAPHLEKC 431

Query: 440 MGRGRKARPKVTRSSTAAQSRYSRGNPVSAYSPYPNSTSTNRLPNGTSSLAGEEYSNGTS 494
           MGRGRKAR KVTRSSTA QSR                            LAG EYSNG S
Sbjct: 432 MGRGRKARLKVTRSSTATQSR----------------------------LAGGEYSNGPS 464

BLAST of HG10021801 vs. NCBI nr
Match: KAG8471435.1 (hypothetical protein CXB51_036412 [Gossypium anomalum])

HSP 1 Score: 610.9 bits (1574), Expect = 9.6e-171
Identity = 326/463 (70.41%), Postives = 369/463 (79.70%), Query Frame = 0

Query: 40  SFSTPIFRSSNFALRFLSKIRREHCPVAAVMLLPKNPVVSDICATALSGVVALSLLRLWA 99
           SF+     ++NF L +  K R      AA ML P+N + SD CA  +SG +ALS+LRLW 
Sbjct: 36  SFAVASNTTTNFTL-WSRKPRGNPTSPAAAMLFPQNLLFSDTCAAVISGSIALSVLRLWQ 95

Query: 100 ETAKRGL-DQKLNRKLVHISIGLAFMLCWPMFSSGYRGAILASLIPGVNIIRMLVLGFGI 159
           ETAKRGL DQKLNRKLVHISIGL FMLCWP++SSGYRGAILA++ PGVNIIRM+++G G+
Sbjct: 96  ETAKRGLFDQKLNRKLVHISIGLVFMLCWPLYSSGYRGAILAAITPGVNIIRMILIGSGL 155

Query: 160 LKDEATVKSMSRYGDYRELLKGPLYYVATITLVCIFYWRTSPISIALICNLCAGDGLADI 219
            KDEATVKSMSRYGDYRELLKGPLYY  TITL C FYWRTSPI+IA ICNLCAGDG ADI
Sbjct: 156 WKDEATVKSMSRYGDYRELLKGPLYYATTITLACAFYWRTSPIAIAAICNLCAGDGFADI 215

Query: 220 VGRRFGSKKIFYNRNKSLVGSVAMASAGFLASVGYMYYFSSFGYVVGSNRMVLGFLVVSL 279
           VGR+FG +K+ YN+NKS+ GSVAMA AGFL SVGYMYYFS FGY+  S  +V GFL+VSL
Sbjct: 216 VGRQFGGQKLPYNKNKSIAGSVAMAIAGFLTSVGYMYYFSYFGYLKESTEIVFGFLIVSL 275

Query: 280 ASALVESLPISTEIDDNLTVPLTSLL------------AYIETRPASR-----SMSMPNE 339
           ASALVESLP+STE+DDNLTV LTS+L             +    P SR      +     
Sbjct: 276 ASALVESLPVSTELDDNLTVTLTSILVGSLFSATVVSAGFCRILPVSRFGIRIGLCRQGY 335

Query: 340 DNASSQTQLSSNLFGDLLDSVIVDIASECHRIARLGLDRNLEEEEEELRLSAQARVRVAD 399
              +   +LSS+ FGDLLDS+IVD+ASECHRIA+LGLDRNLEEEEEE+RLS QAR RVAD
Sbjct: 336 CAPNCGVRLSSHFFGDLLDSIIVDVASECHRIAKLGLDRNLEEEEEEMRLSVQARARVAD 395

Query: 400 SSNSSEANGKYVVDIFGQTHPSVANEIFDCMNCGRSIMAGRFAPHLEKCMGRGRKARPKV 459
            SNSSE N KYVVDIFGQTHPSVA EIF+CMNCGRSI AGRFAPHLEKCMG+GRKAR KV
Sbjct: 396 PSNSSETNTKYVVDIFGQTHPSVATEIFECMNCGRSIAAGRFAPHLEKCMGKGRKARLKV 455

Query: 460 TRSSTAAQSRYSRGNPVSAYSPYPNSTSTNRLPNGTSSLAGEE 485
           TRSSTAAQ+RYSRG+PVSAYSPY NSTSTNRLPNGT S+AGEE
Sbjct: 456 TRSSTAAQNRYSRGSPVSAYSPYSNSTSTNRLPNGTPSVAGEE 497

BLAST of HG10021801 vs. ExPASy Swiss-Prot
Match: Q2N2K0 (Probable phytol kinase 3, chloroplastic OS=Glycine max OX=3847 PE=2 SV=1)

HSP 1 Score: 342.0 bits (876), Expect = 1.1e-92
Identity = 193/288 (67.01%), Postives = 226/288 (78.47%), Query Frame = 0

Query: 18  PFFPFSSPNFRSRFIPVSVSFNSFSTPIFRSSNFALRFLSKIRREHCPVAAVMLLPKNPV 77
           P F F SP F S+  P  + F SFS+    SS+F   F S       P  + M L  +P+
Sbjct: 37  PTFHFPSP-FLSKPKPTYL-FTSFSSSSSSSSSF---FSST-----TPPRSTM-LHHDPL 96

Query: 78  VSDICATALSGVVALSLLRLWAETAKRGL-DQKLNRKLVHISIGLAFMLCWPMFSSGYRG 137
           VSD+ ATA+SGVVALS LRL+ ETAKR L DQKLNRKLVHISIGL FMLC P+FS+    
Sbjct: 97  VSDVYATAISGVVALSFLRLFQETAKRDLFDQKLNRKLVHISIGLIFMLCXPLFSTETWA 156

Query: 138 AILASLIPGVNIIRMLVLGFGILKDEATVKSMSRYGDYRELLKGPLYYVATITLVCIFYW 197
           +  A+LIPG+NI RMLV+G GILKDEATVKSMSR+GDYRELLKGPLYY ATITL  I YW
Sbjct: 157 SFFAALIPGINIFRMLVIGLGILKDEATVKSMSRFGDYRELLKGPLYYAATITLAAIIYW 216

Query: 198 RTSPISIALICNLCAGDGLADIVGRRFGSKKIFYNRNKSLVGSVAMASAGFLASVGYMYY 257
           RTSPISIA ICNLCAGDG+ADIVGRR G +KI YN+NKS  GS+AMA+AGFL S+GYM+Y
Sbjct: 217 RTSPISIAAICNLCAGDGMADIVGRRLGGEKIPYNKNKSFAGSIAMATAGFLTSIGYMWY 276

Query: 258 FSSFGYVVGSNRMVLGFLVVSLASALVESLPISTEIDDNLTVPLTSLL 305
           FSSFG++ GS ++VLGFL+VS+ +A VESLPISTE+DDNLTVPLTS+L
Sbjct: 277 FSSFGFIEGSWKLVLGFLLVSIVTAFVESLPISTELDDNLTVPLTSIL 313

BLAST of HG10021801 vs. ExPASy Swiss-Prot
Match: Q67ZM7 (Farnesol kinase, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=FOLK PE=1 SV=2)

HSP 1 Score: 335.9 bits (860), Expect = 7.8e-91
Identity = 178/289 (61.59%), Postives = 222/289 (76.82%), Query Frame = 0

Query: 18  PFFPFSSPNFRSRFIPVSVSFNSFSTPIFRSSNFALRFLSKIRREHCPVAAVMLLPKNPV 77
           P   F SP  R   + ++ SF S       SS F     +KIR+    +AAVM  P+N V
Sbjct: 27  PSLAFFSPIPRFLTVRIATSFRS-------SSRFP---ATKIRKS--SLAAVM-FPENSV 86

Query: 78  VSDICATALSGVVALSLLRLWAETAKRGL-DQKLNRKLVHISIGLAFMLCWPMFSSGYRG 137
           +SD+CA  ++ +VA S L  W E  KRG+ DQKL RKLVHI+IGL FMLCWP+FSSG +G
Sbjct: 87  LSDVCAFGVTSIVAFSCLGFWGEIGKRGIFDQKLIRKLVHINIGLVFMLCWPLFSSGIQG 146

Query: 138 AILASLIPGVNIIRMLVLGFGILKDEATVKSMSRYGDYRELLKGPLYYVATITLVCIFYW 197
           A+ ASL+PG+NI+RML+LG G+  DE T+KSMSR+GD RELLKGPLYYV +IT  CI+YW
Sbjct: 147 ALFASLVPGLNIVRMLLLGLGVYHDEGTIKSMSRHGDRRELLKGPLYYVLSITSACIYYW 206

Query: 198 RTSPISIALICNLCAGDGLADIVGRRFGSKKIFYNRNKSLVGSVAMASAGFLASVGYMYY 257
           ++SPI+IA+ICNLCAGDG+ADIVGRRFG++K+ YN+NKS  GS+ MA+AGFLASV YMYY
Sbjct: 207 KSSPIAIAVICNLCAGDGMADIVGRRFGTEKLPYNKNKSFAGSIGMATAGFLASVAYMYY 266

Query: 258 FSSFGYVVGSNRMVLGFLVVSLASALVESLPISTEIDDNLTVPLTSLLA 306
           F+SFGY+  S  M+L FLV+S+ASALVESLPIST+IDDNLT+ LTS LA
Sbjct: 267 FASFGYIEDSGGMILRFLVISIASALVESLPISTDIDDNLTISLTSALA 302

BLAST of HG10021801 vs. ExPASy Swiss-Prot
Match: Q5N9J9 (Probable phytol kinase 2, chloroplastic OS=Oryza sativa subsp. japonica OX=39947 GN=Os01g0832000 PE=2 SV=3)

HSP 1 Score: 288.5 bits (737), Expect = 1.4e-76
Identity = 151/252 (59.92%), Postives = 191/252 (75.79%), Query Frame = 0

Query: 54  RFLSKIRREHCPVAAVMLLPKNPVVSDICATALSGVVALSLLRLWAETAKRGL-DQKLNR 113
           R ++   R    +AA +    + +  D+ + A++  VAL+LLR + E AKRG+ +QKLNR
Sbjct: 47  RLVADGSRRKGTMAAAIPPEASGLAHDLGSAAVTAGVALALLRFFEELAKRGVFEQKLNR 106

Query: 114 KLVHISIGLAFMLCWPMFSSGYRGAILASLIPGVNIIRMLVLGFGILKDEATVKSMSRYG 173
           KLVHI+IG+ F+L WP+FSSG     LA++ PG+NIIRML+LG G++K+EA VKSMSR G
Sbjct: 107 KLVHITIGMVFLLFWPLFSSGSYAPFLAAVAPGINIIRMLLLGLGVMKNEAMVKSMSRSG 166

Query: 174 DYRELLKGPLYYVATITLVCIFYWRTSPISIALICNLCAGDGLADIVGRRFGSKKIFYNR 233
           D RELLKGPLYY  TIT     +WRTSPI+IALICNLCAGDG+ADIVGRR G +K+ YN 
Sbjct: 167 DPRELLKGPLYYATTITFATSIFWRTSPIAIALICNLCAGDGIADIVGRRLGQEKLPYNP 226

Query: 234 NKSLVGSVAMASAGFLASVGYMYYFSSFGYVVGSNRMVLGFLVVSLASALVESLPISTEI 293
           NKS  GS+AMA AGF+AS+GYM+YF SFG++  S  +  GFLVVS+ +ALVES PIST +
Sbjct: 227 NKSYAGSIAMALAGFMASIGYMHYFQSFGFIEESWSLAFGFLVVSVTAALVESHPISTHL 286

Query: 294 DDNLTVPLTSLL 305
           DDNLTVPLTS L
Sbjct: 287 DDNLTVPLTSFL 298

BLAST of HG10021801 vs. ExPASy Swiss-Prot
Match: Q94BV2 (SAGA-associated factor 11 OS=Arabidopsis thaliana OX=3702 GN=SGF11 PE=1 SV=1)

HSP 1 Score: 227.6 bits (579), Expect = 3.0e-58
Identity = 115/169 (68.05%), Postives = 139/169 (82.25%), Query Frame = 0

Query: 321 EDNASSQTQLSSNLFGDLLDSVIVDIASECHRIARLGLDRNLEEEEEELRLSAQARVRVA 380
           EDN SS  QLSS +F DL+DSVI D+ASECHR+ARLGLDR+L+  EEELRLS +AR ++A
Sbjct: 5   EDNKSSHAQLSSQIFLDLVDSVIADVASECHRVARLGLDRDLDIVEEELRLSVEARAKIA 64

Query: 381 DSSNSSEANGKYVVDIFGQTHPSVANEIFDCMNCGRSIMAGRFAPHLEKCMGRGRKARPK 440
           D SN+ E N KYVVDIFGQTHP VA+E+F+CMNCGR I+AGRFAPHLEKCMG+GRKAR K
Sbjct: 65  DPSNNLETNTKYVVDIFGQTHPPVASEVFNCMNCGRQIVAGRFAPHLEKCMGKGRKARAK 124

Query: 441 VTRSSTAAQSRYSRGNPVSAYSPYPNSTSTNRLPNGTSSLAGEEYSNGT 490
            TRS+TAAQ+R +R +P   YSPYPNS S N+L +G+  +AGE+ SN T
Sbjct: 125 TTRSTTAAQNRNARRSPNPRYSPYPNSASENQLASGSPGVAGEDCSNFT 173

BLAST of HG10021801 vs. ExPASy Swiss-Prot
Match: Q2N2K1 (Probable phytol kinase 1, chloroplastic OS=Glycine max OX=3847 PE=2 SV=1)

HSP 1 Score: 188.3 bits (477), Expect = 2.0e-46
Identity = 98/213 (46.01%), Postives = 142/213 (66.67%), Query Frame = 0

Query: 93  SLLRLWAETAKRG-LDQKLNRKLVHISIGLAFMLCWPMFSSGYRGAILASLIPGVNIIRM 152
           +L+R + E  +R  L Q L+RKLVHI  GL F++ WP+FS+  +    A+ +P VN +R+
Sbjct: 81  ALVRAFDELTRRNILQQGLSRKLVHILSGLLFLVSWPIFSNSPKARYFAAFVPLVNCLRL 140

Query: 153 LVLGFGILKDEATVKSMSRYGDYRELLKGPLYYVATITLVCIFYWRTSPISIALICNLCA 212
           LV G  +  DE  +KS++R GD  ELL+GPLYYV  + L  + +WR SPI +  +  +CA
Sbjct: 141 LVNGLSLASDEGLIKSVTREGDPLELLRGPLYYVLILILSALVFWRESPIGVISLAMMCA 200

Query: 213 GDGLADIVGRRFGSKKIFYNRNKSLVGSVAMASAGFLASVGYMYYFSSFGYVVGSNRMVL 272
           GDG+ADI+GRR+GS KI YN +KSL GS++M   GFL S+G +YY+S  G+V       L
Sbjct: 201 GDGIADIIGRRYGSMKIPYNEHKSLAGSMSMLVFGFLVSIGMLYYYSVLGHVQLDWASTL 260

Query: 273 GFLV-VSLASALVESLPISTEIDDNLTVPLTSL 304
             +  +S  + LVESLPI+  +DDN++VPL ++
Sbjct: 261 PRVAFISFVATLVESLPITKVVDDNISVPLATM 293

BLAST of HG10021801 vs. ExPASy TrEMBL
Match: A0A5D3BBT0 (SAGA-associated factor 11 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold194G00760 PE=3 SV=1)

HSP 1 Score: 754.6 bits (1947), Expect = 2.6e-214
Identity = 390/418 (93.30%), Postives = 401/418 (95.93%), Query Frame = 0

Query: 70  MLLPKNPVVSDICATALSGVVALSLLRLWAETAKRGLDQKLNRKLVHISIGLAFMLCWPM 129
           ML P+NPVVSDICATALS  VALSLL+LW ETAKRGLDQKLNRKLVHISIGLAFMLCWPM
Sbjct: 1   MLFPENPVVSDICATALSSGVALSLLQLWVETAKRGLDQKLNRKLVHISIGLAFMLCWPM 60

Query: 130 FSSGYRGAILASLIPGVNIIRMLVLGFGILKDEATVKSMSRYGDYRELLKGPLYYVATIT 189
           FSSGY+GAI ASLIPG N+IRMLVLGFGILKDEAT+KSMSRYGDYRELLKGPLYYVATIT
Sbjct: 61  FSSGYQGAIFASLIPGANVIRMLVLGFGILKDEATLKSMSRYGDYRELLKGPLYYVATIT 120

Query: 190 LVCIFYWRTSPISIALICNLCAGDGLADIVGRRFGSKKIFYNRNKSLVGSVAMASAGFLA 249
           LVCIFYWRTSPISIALICNLCAGDG ADIVGRRFGS+KIFYN NKSL GSVAMA+AGFLA
Sbjct: 121 LVCIFYWRTSPISIALICNLCAGDGFADIVGRRFGSEKIFYNENKSLAGSVAMATAGFLA 180

Query: 250 SVGYMYYFSSFGYVVGSNRMVLGFLVVSLASALVESLPISTEIDDNLTVPLTSLLAYIET 309
           S+GYMYYFSSFGYV  S  M L FL+VSLASALVESLPISTE+DDNLTVPLTSLLAYIET
Sbjct: 181 SIGYMYYFSSFGYVEASIGMALRFLIVSLASALVESLPISTELDDNLTVPLTSLLAYIET 240

Query: 310 RPASRSMSMPNEDNASSQTQLSSNLFGDLLDSVIVDIASECHRIARLGLDRNLEEEEEEL 369
             ASRSMSMPNEDNASSQTQLSSNLFGDLLDSVIVD+ASECHRIARLGLDRNLEEEEEEL
Sbjct: 241 SSASRSMSMPNEDNASSQTQLSSNLFGDLLDSVIVDVASECHRIARLGLDRNLEEEEEEL 300

Query: 370 RLSAQARVRVADSSNSSEANGKYVVDIFGQTHPSVANEIFDCMNCGRSIMAGRFAPHLEK 429
           RLSAQARVRVADSSNSSEANGKYVVDIFGQTHPSVANEIFDCMNCGRSIMAGRFAPHLEK
Sbjct: 301 RLSAQARVRVADSSNSSEANGKYVVDIFGQTHPSVANEIFDCMNCGRSIMAGRFAPHLEK 360

Query: 430 CMGRGRKARPKVTRSSTAAQSRYSRGNPVSAYSPYPNSTSTNRLPNGTSSLAGEEYSN 488
           CMGRGRKARPKVTRSSTAAQSRYSRGNPVSAYSPYPNSTSTNRLPNGTSSLAGEEYSN
Sbjct: 361 CMGRGRKARPKVTRSSTAAQSRYSRGNPVSAYSPYPNSTSTNRLPNGTSSLAGEEYSN 418

BLAST of HG10021801 vs. ExPASy TrEMBL
Match: A0A6J1HM59 (farnesol kinase, chloroplastic-like OS=Cucurbita moschata OX=3662 GN=LOC111464846 PE=3 SV=1)

HSP 1 Score: 498.0 bits (1281), Expect = 4.4e-137
Identity = 266/304 (87.50%), Postives = 273/304 (89.80%), Query Frame = 0

Query: 1   MATILQFRFCSSVGSFGPFFPFSSPNFRSRFIPVSVSFNSFSTPIFRSSNFALRFLSKIR 60
           MA  LQF F S  GSFGP FPFSSP   SRF PVSVSFNS   PIFRS  F LRF +KIR
Sbjct: 1   MAANLQFLFRS--GSFGPLFPFSSPTLISRFRPVSVSFNSIFAPIFRSDAFTLRFGTKIR 60

Query: 61  REHCPVAAVMLLPKNPVVSDICATALSGVVALSLLRLWAETAKRGLDQKLNRKLVHISIG 120
           RE C VAAVMLLP NPVVSDICATA+SG VALSLLRLW ETAKRGLDQKLNRKLVH SIG
Sbjct: 61  REQCRVAAVMLLPDNPVVSDICATAVSGGVALSLLRLWTETAKRGLDQKLNRKLVHTSIG 120

Query: 121 LAFMLCWPMFSSGYRGAILASLIPGVNIIRMLVLGFGILKDEATVKSMSRYGDYRELLKG 180
           LAFMLCWPMFSSG+RGA+LASLIPGVNIIRMLVLG GILKDEATVKSMSR GD RELLKG
Sbjct: 121 LAFMLCWPMFSSGHRGALLASLIPGVNIIRMLVLGLGILKDEATVKSMSRNGDCRELLKG 180

Query: 181 PLYYVATITLVCIFYWRTSPISIALICNLCAGDGLADIVGRRFGSKKIFYNRNKSLVGSV 240
           PLYYVATITLVCIFYWRTSPISIALICNLCAGDGLADIVGRRFGS+KI YN+NKSL GSV
Sbjct: 181 PLYYVATITLVCIFYWRTSPISIALICNLCAGDGLADIVGRRFGSRKISYNKNKSLAGSV 240

Query: 241 AMASAGFLASVGYMYYFSSFGYVVGSNRMVLGFLVVSLASALVESLPISTEIDDNLTVPL 300
           AM SAGFLASVGYMYYFSSFGYV GSNRMVLGFLVVS+ASALVESLPISTEIDDNLTVPL
Sbjct: 241 AMVSAGFLASVGYMYYFSSFGYVEGSNRMVLGFLVVSVASALVESLPISTEIDDNLTVPL 300

Query: 301 TSLL 305
           TSLL
Sbjct: 301 TSLL 302

BLAST of HG10021801 vs. ExPASy TrEMBL
Match: A0A6J1I3Z6 (farnesol kinase, chloroplastic OS=Cucurbita maxima OX=3661 GN=LOC111469454 PE=3 SV=1)

HSP 1 Score: 493.8 bits (1270), Expect = 8.3e-136
Identity = 264/304 (86.84%), Postives = 271/304 (89.14%), Query Frame = 0

Query: 1   MATILQFRFCSSVGSFGPFFPFSSPNFRSRFIPVSVSFNSFSTPIFRSSNFALRFLSKIR 60
           MA  LQF F S  GSFGP FPFSSP   SRF PVSVSFNS   PIFR   F LRF +KIR
Sbjct: 1   MAANLQFLFHS--GSFGPSFPFSSPTLISRFRPVSVSFNSIFAPIFRFDAFTLRFGTKIR 60

Query: 61  REHCPVAAVMLLPKNPVVSDICATALSGVVALSLLRLWAETAKRGLDQKLNRKLVHISIG 120
           RE C VAA MLLP NPVVSDICATA+SG VALSLLRLW ETAKRGLDQKLNRKLVH SIG
Sbjct: 61  RERCRVAAAMLLPDNPVVSDICATAVSGGVALSLLRLWTETAKRGLDQKLNRKLVHTSIG 120

Query: 121 LAFMLCWPMFSSGYRGAILASLIPGVNIIRMLVLGFGILKDEATVKSMSRYGDYRELLKG 180
           LAFMLCWPMFSSG+RGA+LASLIPGVNIIRMLVLG GILKDEATVKSMSR GD RELLKG
Sbjct: 121 LAFMLCWPMFSSGHRGALLASLIPGVNIIRMLVLGLGILKDEATVKSMSRNGDCRELLKG 180

Query: 181 PLYYVATITLVCIFYWRTSPISIALICNLCAGDGLADIVGRRFGSKKIFYNRNKSLVGSV 240
           PLYYVATITLVCIFYWRTSPISIALICNLCAGDGLADIVGRRFGS+KI YN+NKSL GSV
Sbjct: 181 PLYYVATITLVCIFYWRTSPISIALICNLCAGDGLADIVGRRFGSRKISYNKNKSLAGSV 240

Query: 241 AMASAGFLASVGYMYYFSSFGYVVGSNRMVLGFLVVSLASALVESLPISTEIDDNLTVPL 300
           AM SAGFLASVGYMYYFSSFGYV GSNRMVLGFLVVS+ASALVESLPISTEIDDNLTVPL
Sbjct: 241 AMVSAGFLASVGYMYYFSSFGYVEGSNRMVLGFLVVSIASALVESLPISTEIDDNLTVPL 300

Query: 301 TSLL 305
           TSLL
Sbjct: 301 TSLL 302

BLAST of HG10021801 vs. ExPASy TrEMBL
Match: D7MQE9 (SAGA-associated factor 11 OS=Arabidopsis lyrata subsp. lyrata OX=81972 GN=ARALYDRAFT_332145 PE=3 SV=1)

HSP 1 Score: 485.3 bits (1248), Expect = 2.9e-133
Identity = 279/507 (55.03%), Postives = 347/507 (68.44%), Query Frame = 0

Query: 3   TILQFRFCSSVGSFGPFFPFSSPNFRSRFIPVSVSFNSFSTPIFRSSNFALRF-LSKIRR 62
           T L    CS + S     P S  N    F P+      F T  FRSS+   RF  ++IR+
Sbjct: 5   TKLSVLCCSFISSPLVDSPPSLANIPRFFSPIP----RFLTTSFRSSS---RFPATEIRK 64

Query: 63  EHCPVAAVMLLPKNPVVSDICATALSGVVALSLLRLWAETAKRG-LDQKLNRKLVHISIG 122
                +   L   +      CA  ++ +VA S L  W E  KR  LDQKL RKLVHI+IG
Sbjct: 65  -----STRSLTTSSSTRRHACAFGITSIVAFSCLGFWGEIGKRDLLDQKLIRKLVHINIG 124

Query: 123 LAFMLCWPMFSSGYRGAILASLIPGVNIIRMLVLGFGILKDEATVKSMSRYGDYRELLKG 182
           L FMLCWP+FSSG +GA+ ASL+PG+NIIRML+LG G+  DE T+KSMSR+GD RELLKG
Sbjct: 125 LVFMLCWPLFSSGIQGALFASLVPGLNIIRMLLLGLGVYHDEGTIKSMSRHGDRRELLKG 184

Query: 183 PLYYVATITLVCIFYWRTSPISIALICNLCAGDGLADIVGRRFGSKKIFYNRNKSLVGSV 242
           PLYY  +IT  CIFYW++SPI+IA+ICNLCAGDG+ADIVGRRFG++K+ YN+NKS  GS+
Sbjct: 185 PLYYALSITSACIFYWKSSPIAIAVICNLCAGDGMADIVGRRFGTEKLPYNKNKSFAGSI 244

Query: 243 AMASAGFLASVGYMYYFSSFGYVVGSNRMVLGFLVVSLASALVESLPIST---------- 302
            MA+AGFLASVGYMYYF+SFGY+  S  M+L FL++SLASALV  + +S           
Sbjct: 245 GMATAGFLASVGYMYYFASFGYIEDSGGMILRFLIISLASALVGLVTVSAFQTRKQYKEQ 304

Query: 303 -----EIDDNLTVPLTSL-LAYIETRPASRSMSMPNE--DNASSQTQLSSNLFGDLLDSV 362
                  D N    LT   +  + +      +S+P      A+    LSS +F DL+DSV
Sbjct: 305 EKDGKPDDKNYRQDLTMHGVVSVSSHDWLEKLSLPPPPFSVATRLYSLSSQVFLDLVDSV 364

Query: 363 IVDIASECHRIARLGLDRNLEEEEEELRLSAQARVRVADSSNSSEANGKYVVDIFGQTHP 422
           I D+ASECHR+ARLGLDR+LE  EEELRLS +AR +VAD SN+ E N K+VVDIFGQTHP
Sbjct: 365 IADVASECHRVARLGLDRDLEVVEEELRLSVEARAKVADPSNNLETNTKFVVDIFGQTHP 424

Query: 423 SVANEIFDCMNCGRSIMAGRFAPHLEKCMGRGRKARPKVTRSSTAAQSRYSRGNPVSAYS 482
            VA E+F+CMNCGR I+AGRFAPHLEKCMG+GRKAR K TRS+TAAQ+R +R +P   YS
Sbjct: 425 PVATEVFNCMNCGRQIVAGRFAPHLEKCMGKGRKARAKTTRSTTAAQNRNARRSPNPRYS 484

Query: 483 PYPNSTSTNRLPNGTSSLAGEEYSNGT 490
           PYPNS S N+L +G+  +AGE+ SNGT
Sbjct: 485 PYPNSASENQLASGSPGVAGEDCSNGT 499

BLAST of HG10021801 vs. ExPASy TrEMBL
Match: A0A6J1DBM3 (probable phytol kinase 3, chloroplastic OS=Momordica charantia OX=3673 GN=LOC111018992 PE=3 SV=1)

HSP 1 Score: 471.5 bits (1212), Expect = 4.4e-129
Identity = 249/308 (80.84%), Postives = 265/308 (86.04%), Query Frame = 0

Query: 1   MATILQFRFCSSVGSFGPFFPFSSPNFRSRFIPVSVSFNSFSTPIFRSSNFALRFLS--- 60
           MA ILQ RF SS+G F P F    P F  +F PVSVSFN  S P      F LRF S   
Sbjct: 1   MAAILQVRFRSSIGLFDPSFSARFPKFLPQFKPVSVSFNPISAPTLCCHRFVLRFGSTSA 60

Query: 61  -KIRREHCPVAAVMLLPKNPVVSDICATALSGVVALSLLRLWAETAKRGLDQKLNRKLVH 120
            KIRR   PVAAVMLLP NPVVSDICATA++G +ALSLLRLW ETAKRGLDQKLNRKLVH
Sbjct: 61  PKIRRNQYPVAAVMLLPDNPVVSDICATAVAGGIALSLLRLWQETAKRGLDQKLNRKLVH 120

Query: 121 ISIGLAFMLCWPMFSSGYRGAILASLIPGVNIIRMLVLGFGILKDEATVKSMSRYGDYRE 180
           ISIGLAFMLCWPMFSSG RGA+LASLIPGVNIIRMLVLG GILKDEATVKSMSRYGDYRE
Sbjct: 121 ISIGLAFMLCWPMFSSGQRGALLASLIPGVNIIRMLVLGLGILKDEATVKSMSRYGDYRE 180

Query: 181 LLKGPLYYVATITLVCIFYWRTSPISIALICNLCAGDGLADIVGRRFGSKKIFYNRNKSL 240
           LLKGPLYYV TITL CI YWRTSPISIAL+CNLCAGDGLAD++GRRFGS+KI YN+NKSL
Sbjct: 181 LLKGPLYYVTTITLACIIYWRTSPISIALVCNLCAGDGLADVIGRRFGSRKISYNKNKSL 240

Query: 241 VGSVAMASAGFLASVGYMYYFSSFGYVVGSNRMVLGFLVVSLASALVESLPISTEIDDNL 300
            GSVAMASAGFLASVGYMYYFSSFGY+ GS+RM+LGFLVVS+ASALVESLPISTEIDDNL
Sbjct: 241 AGSVAMASAGFLASVGYMYYFSSFGYLEGSSRMILGFLVVSVASALVESLPISTEIDDNL 300

Query: 301 TVPLTSLL 305
           +VPLTSLL
Sbjct: 301 SVPLTSLL 308

BLAST of HG10021801 vs. TAIR 10
Match: AT5G58560.1 (Phosphatidate cytidylyltransferase family protein )

HSP 1 Score: 335.9 bits (860), Expect = 5.5e-92
Identity = 178/289 (61.59%), Postives = 222/289 (76.82%), Query Frame = 0

Query: 18  PFFPFSSPNFRSRFIPVSVSFNSFSTPIFRSSNFALRFLSKIRREHCPVAAVMLLPKNPV 77
           P   F SP  R   + ++ SF S       SS F     +KIR+    +AAVM  P+N V
Sbjct: 27  PSLAFFSPIPRFLTVRIATSFRS-------SSRFP---ATKIRKS--SLAAVM-FPENSV 86

Query: 78  VSDICATALSGVVALSLLRLWAETAKRGL-DQKLNRKLVHISIGLAFMLCWPMFSSGYRG 137
           +SD+CA  ++ +VA S L  W E  KRG+ DQKL RKLVHI+IGL FMLCWP+FSSG +G
Sbjct: 87  LSDVCAFGVTSIVAFSCLGFWGEIGKRGIFDQKLIRKLVHINIGLVFMLCWPLFSSGIQG 146

Query: 138 AILASLIPGVNIIRMLVLGFGILKDEATVKSMSRYGDYRELLKGPLYYVATITLVCIFYW 197
           A+ ASL+PG+NI+RML+LG G+  DE T+KSMSR+GD RELLKGPLYYV +IT  CI+YW
Sbjct: 147 ALFASLVPGLNIVRMLLLGLGVYHDEGTIKSMSRHGDRRELLKGPLYYVLSITSACIYYW 206

Query: 198 RTSPISIALICNLCAGDGLADIVGRRFGSKKIFYNRNKSLVGSVAMASAGFLASVGYMYY 257
           ++SPI+IA+ICNLCAGDG+ADIVGRRFG++K+ YN+NKS  GS+ MA+AGFLASV YMYY
Sbjct: 207 KSSPIAIAVICNLCAGDGMADIVGRRFGTEKLPYNKNKSFAGSIGMATAGFLASVAYMYY 266

Query: 258 FSSFGYVVGSNRMVLGFLVVSLASALVESLPISTEIDDNLTVPLTSLLA 306
           F+SFGY+  S  M+L FLV+S+ASALVESLPIST+IDDNLT+ LTS LA
Sbjct: 267 FASFGYIEDSGGMILRFLVISIASALVESLPISTDIDDNLTISLTSALA 302

BLAST of HG10021801 vs. TAIR 10
Match: AT5G58575.1 (CONTAINS InterPro DOMAIN/s: Sgf11, transcriptional regulation (InterPro:IPR013246); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )

HSP 1 Score: 227.6 bits (579), Expect = 2.1e-59
Identity = 115/169 (68.05%), Postives = 139/169 (82.25%), Query Frame = 0

Query: 321 EDNASSQTQLSSNLFGDLLDSVIVDIASECHRIARLGLDRNLEEEEEELRLSAQARVRVA 380
           EDN SS  QLSS +F DL+DSVI D+ASECHR+ARLGLDR+L+  EEELRLS +AR ++A
Sbjct: 5   EDNKSSHAQLSSQIFLDLVDSVIADVASECHRVARLGLDRDLDIVEEELRLSVEARAKIA 64

Query: 381 DSSNSSEANGKYVVDIFGQTHPSVANEIFDCMNCGRSIMAGRFAPHLEKCMGRGRKARPK 440
           D SN+ E N KYVVDIFGQTHP VA+E+F+CMNCGR I+AGRFAPHLEKCMG+GRKAR K
Sbjct: 65  DPSNNLETNTKYVVDIFGQTHPPVASEVFNCMNCGRQIVAGRFAPHLEKCMGKGRKARAK 124

Query: 441 VTRSSTAAQSRYSRGNPVSAYSPYPNSTSTNRLPNGTSSLAGEEYSNGT 490
            TRS+TAAQ+R +R +P   YSPYPNS S N+L +G+  +AGE+ SN T
Sbjct: 125 TTRSTTAAQNRNARRSPNPRYSPYPNSASENQLASGSPGVAGEDCSNFT 173

BLAST of HG10021801 vs. TAIR 10
Match: AT5G04490.1 (vitamin E pathway gene 5 )

HSP 1 Score: 181.4 bits (459), Expect = 1.7e-45
Identity = 98/234 (41.88%), Postives = 147/234 (62.82%), Query Frame = 0

Query: 75  NPVVSDICAT--ALSGVVALSLLRLWAETAKRGLDQKLNRKLVHISIGLAFMLCWPMFSS 134
           N ++ D+ AT   L G  AL +L   + T +  + Q L+RKLVHI  GL F+L WP+FS 
Sbjct: 65  NSLLHDVGATVAVLGGAYAL-VLSFESLTKRNVIQQSLSRKLVHILSGLLFVLAWPIFSG 124

Query: 135 GYRGAILASLIPGVNIIRMLVLGFGILKDEATVKSMSRYGDYRELLKGPLYYVATITLVC 194
                  A+ +P VN +R+++ G  I  +   +KS++R G   ELLKGPL+YV  +    
Sbjct: 125 STEARYFAAFVPLVNGLRLVINGLSISPNSMLIKSVTREGRAEELLKGPLFYVLALLFSA 184

Query: 195 IFYWRTSPISIALICNLCAGDGLADIVGRRFGSKKIFYNRNKSLVGSVAMASAGFLASVG 254
           +F+WR SPI +  +  +C GDG+ADI+GR+FGS KI YN  KS  GS++M   GF  S+ 
Sbjct: 185 VFFWRESPIGMISLAMMCGGDGIADIMGRKFGSTKIPYNPRKSWAGSISMFIFGFFISIA 244

Query: 255 YMYYFSSFGYV-VGSNRMVLGFLVVSLASALVESLPISTEIDDNLTVPLTSLLA 306
            +YY+SS GY+ +     +    +VS+ + +VESLPI+ ++DDN++VPL ++LA
Sbjct: 245 LLYYYSSLGYLHMNWETTLQRVAMVSMVATVVESLPITDQLDDNISVPLATILA 297

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
KAG7019118.12.8e-23990.47Farnesol kinase, chloroplastic [Cucurbita argyrosperma subsp. argyrosperma][more]
KAA0060025.15.4e-21493.30farnesol kinase [Cucumis melo var. makuwa] >TYJ97282.1 farnesol kinase [Cucumis ... [more]
KAG6583348.13.8e-21284.20hypothetical protein SDJN03_19280, partial [Cucurbita argyrosperma subsp. sorori... [more]
KAG6607329.17.5e-20079.71hypothetical protein SDJN03_00671, partial [Cucurbita argyrosperma subsp. sorori... [more]
KAG8471435.19.6e-17170.41hypothetical protein CXB51_036412 [Gossypium anomalum][more]
Match NameE-valueIdentityDescription
Q2N2K01.1e-9267.01Probable phytol kinase 3, chloroplastic OS=Glycine max OX=3847 PE=2 SV=1[more]
Q67ZM77.8e-9161.59Farnesol kinase, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=FOLK PE=1 SV=2[more]
Q5N9J91.4e-7659.92Probable phytol kinase 2, chloroplastic OS=Oryza sativa subsp. japonica OX=39947... [more]
Q94BV23.0e-5868.05SAGA-associated factor 11 OS=Arabidopsis thaliana OX=3702 GN=SGF11 PE=1 SV=1[more]
Q2N2K12.0e-4646.01Probable phytol kinase 1, chloroplastic OS=Glycine max OX=3847 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A5D3BBT02.6e-21493.30SAGA-associated factor 11 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffo... [more]
A0A6J1HM594.4e-13787.50farnesol kinase, chloroplastic-like OS=Cucurbita moschata OX=3662 GN=LOC11146484... [more]
A0A6J1I3Z68.3e-13686.84farnesol kinase, chloroplastic OS=Cucurbita maxima OX=3661 GN=LOC111469454 PE=3 ... [more]
D7MQE92.9e-13355.03SAGA-associated factor 11 OS=Arabidopsis lyrata subsp. lyrata OX=81972 GN=ARALYD... [more]
A0A6J1DBM34.4e-12980.84probable phytol kinase 3, chloroplastic OS=Momordica charantia OX=3673 GN=LOC111... [more]
Match NameE-valueIdentityDescription
AT5G58560.15.5e-9261.59Phosphatidate cytidylyltransferase family protein [more]
AT5G58575.12.1e-5968.05CONTAINS InterPro DOMAIN/s: Sgf11, transcriptional regulation (InterPro:IPR01324... [more]
AT5G04490.11.7e-4541.88vitamin E pathway gene 5 [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableGENE3D3.30.160.60Classic Zinc Fingercoord: 408..436
e-value: 1.0E-9
score: 39.9
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 442..487
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 434..493
NoneNo IPR availablePANTHERPTHR32523:SF7FARNESOL KINASE, CHLOROPLASTICcoord: 67..305
IPR013246SAGA complex, Sgf11 subunitPFAMPF08209Sgf11coord: 406..436
e-value: 2.6E-16
score: 58.9
IPR039606Phytol/farnesol kinasePANTHERPTHR32523PHYTOL KINASE 1, CHLOROPLASTICcoord: 67..305

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10021801.1HG10021801.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0048440 carpel development
biological_process GO:0006325 chromatin organization
biological_process GO:0016487 farnesol metabolic process
biological_process GO:0016310 phosphorylation
biological_process GO:0009737 response to abscisic acid
cellular_component GO:0031969 chloroplast membrane
cellular_component GO:0016021 integral component of membrane
cellular_component GO:0070461 SAGA-type complex
molecular_function GO:0052668 CTP:farnesol kinase activity
molecular_function GO:0052670 geraniol kinase activity
molecular_function GO:0052671 geranylgeraniol kinase activity
molecular_function GO:0016301 kinase activity
molecular_function GO:0046872 metal ion binding