HG10021418 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10021418
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
Descriptionhomeobox-leucine zipper protein HOX32-like
LocationChr05: 8892901 .. 8899838 (+)
RNA-Seq ExpressionHG10021418
SyntenyHG10021418
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCTCTTGTCATGCATAGAGATTCCTTGAATAAGCAGATGGATACTAGCAAGTATGTTAGATATACGCCTGAACAAGTCGATGCATTAGAAAGGGTTTATGCAGAATGCCCAAAACCCAGTTCGTTGAGGAGGCAACAGCTCATTCGGGAGTGTCCTATTCTCTCTAATATTGAGCCTAAGCAGATCAAAGTTTGGTTCCAAAATCGCAGGTAAATGGCTTAAGCTCTGAGATTAATGGAAATTTGACAGTGGAATTTCTTCTTCCTGTTCTTTGACGCTTTCAAGCTTGTGGCTCTTGACAGAACGCATGCCATTTGGTTTTTTTTTTTTTTTTCCTCTCTTCTTTTATCTGTCTAGGAAGTGAACGTTGAAGAGCTGGAAGGATTTTGTGGCATTCATTTAAGTTTGTTTATGTTTTTCTTGTCTTCTCTGTTTTCATAACATATTTTTCCTCCATTTTCCTGAAAATCGAAACTGTATCAACAAGAGAATGATAAATGGTATCTGGGTTTAGCTTTGCAAGGCTAATTGTGTACATTCTTGTCGAGATAGTTTGTATCTAACTTTTTAGATCTTGTCGAATGCATACAAGAAATGTGTCTGCTTTAGTAGGTATTCTTTGATTCTATATATTTATAAAAAGCATAGATTTATTGATTAATAAAATAAAAGCTTCCCTTTTCAAATTTCATACATGATTTGTGTTTCTAGCTTGCTTGTCATTTGATTTTTTTTAATAGTTTCTTTTGAGTTTTTCTCTGTTAAAAAATACTTGGTATGGAAGCTTTCTATTGTAGGCGTGCAGTGAAGCCTTGTATCCTAATTATTTTGTCTCTCTTTCATTTTTGACTTTGTATTATTTACTTTTCTTGCATTCTGTTTTGGGAAATCTTGTATTGCACTTTAGCTCCTCAACCCCACCACACCCTCCACTTTTTCCCACTTCTATTCTGAGGACAATGTTCTTGGAATTATTTTCTTTGAATAAGTTTATCTATGATATTGTTGTCTGCTCTGGAAAAATGTTTAGTGTTTTCCCACAGTGTTGCATCTACTTGAAAGATAATCTGATCTAATCAAACTTTGTTTAATTTTCTTGTGTCTTTTCGCTTTTTATTTTATTAATATCATCATCATCATCATTACTATTTTTGGGGGAGGGGGAAGTCCTTTTGCAAACATGTGCAAGTTTGACTGACATGAAAATTCTTTAATGGATTTTCTATGTTGTCTCGGACATTTTGATGGTTTGATATTAGTCAAACGTTTTGAAATCTGCTATTATCAATTTTCTAGTTAGGACTTTATTTCTCCATTTCTTTAAGACTTGAATTTTTCATTTGGTAGGAAGAATAAAATGAATTTTATTTGGTCATGCTTGGGAAAACTTCAGTTTAGTTGCAATTTTAGACATTTATCCAACTAGAAAATAAGCTTGTCTTCCATCTTTGGTTCCAACAATATTTCCTAGTGCAGGTTGCACTATCATTTTTTAGTGGGGTGGTTATTCAACCGCTTTGTAATTAACAAAGTTTAGTTGATTTTAAGAGTCCTTTTCAAGTTCAAATGTTTCCAAATCTATAAGTTTAGTTGAAAGCCATAGTGATTCGTCGTGCATTATTATAGTCGTAATTCTACTGGACTTAGAATGTAACCCCGTTTAAAGTTTTTAATAGCAATTACTATATCAGGAAATATGATTAGTCTATAAGAAAGAATGATCTTTACACACTGTCTCAGCGTCTTCAATTATTTAAATGAACATAACTTTATCCTACTAATTATTTAGTTTTTAATGAACATTTGTCACCAATTTGACCGGGCAAGAAGGAATCCTTTCCTACCCTTTTATGTTCCTAAGCAAAAATCACAATATCTCCAAGGCTAGACTTATTATATGTTGTATTGAAAATGCAAAAAGGAGGATCTAAGGGTTTTTTTTTTTTTTTTTTTGTAATTTATTTATTTATTGATAAGTCGATGTTTATATTGTTAGATGTCGTGAAAAGCAAAGGAAGGAAGCTTCTCGTCTTCAAACAGTTAATAGGAAGCTAACTGCGATGAACAAGCTGTTGATGGAAGAGAACGACCGGTTGCAGACGCAGGTCTCTCGTTTGGTTTACGAGAACGGATACATGAGACAGCAACTGCATACTGTGAGTTCCTTTAAGCTTCTTTATGTCTGTTTGGATCCGTTTAAGTTTTTATGGAATCCACTTGACCGAATTCATTGAATTGGTAATTCTATTACTCATCTGAATAATGGATCAAAACTTGAATAGAACAGAATTTAGTTGAATTTTATTGTTCAGAATCGGGACGGTAATATTCATTTAAGAAATCTCCAAGTTTGTCCTTGTAGAAAAACTTCCATGAGATATATATTTTACCACATTTATCTCATTTGAACTGCCTTATTTCTCCATATTATATCATTCCTCAGTGTGAATTTGTTTACTTCCTTAATTTTCTAAACTCTTTTTCCATGATCTTGGTTTGATTACTAATCAGATCAGACCATACCCTTATTGGTTTTGATACTCTAAAAAGTATTTTAGAATTCCAAAATTACAAATATCTACTTGGTTGTCCTACTGATTCTTGATTTTATGAGTTTCCATGAATTGTGTGGTGTTTATTTTCAGGCATCTGGAACAACCACAGACAATAGCTGTGAGTCTGTGGTCATGAGTGGTCAACAGCATCAACAGCAAAACCCAACTAAGCATACTCAAAAGGATGCTAATAACCCAGCTGGGTAAGCTCACGTTCTGAAACTAACAGTTTCTATGACTTGTTTTGTTTCGAGTACAATTTATTACTTAGTCTCATCAATTCATATTTCTATTCTTTCTGTTGACAGTCTTCTTGCAATAGCCGAGGAGACCTTGGCAGAGTTCCTTTCCAAGGCAACAGGAACTGCTGTCGACTGGGTACAAATGATTGGGATGAAGGTATGTCTGAGAATCGTTCATGACCAAAATTGATGGTTGAAGGTCGTAGGGAGTATTTCAACAGTATTCTTTTATCTGTTCAAAAGTTTGATTTCTTTGTACCTTATGCAGCCTGGTCCGGATTCAATTGGGATTGTTGCTGTTTCCCGCAATTGCAATGGGGTAGCAGCACGAGCCTGTGGTTTAGTGAGTCTTGAGCCCATGAAGGTACTTTGGATCTTACAATGGACCTCTATTCAGTTTCGTGTATCTGCCTGTATGTTAAAAATATATATTTATTGCAGGTGGCCGAAATTCTCAAAGATCGTCTGTCATGGTTTCGTGACTGCCGTTGTGTTGATGTACTAAGTGCAATCTCTACAGGAAATGGGGGGACCATAGAACTCTTATATATGCAGGTATCCTCTTATCATTGTGTTCTACAAAAATTGGTTTTTGTTATCCTATTTAGCCTTTTTTGCTTTTCAAACCAGACATATGCACCTACAACTTTGGCCGCAGCGCGTGACTTTTGGACTCTTCGATACACTACAAGCTTGGAAGATGGCAGCCTTGTGGTAAATATTTCTCCTGAACATAGTTACTCTCTTTATGTTTTGATGCCCATGCATTATGTAATTTTTGAGCTCTTATGGTTTTACTCTATGGAGCAAAATAAAAGGCCACATATATTATTTTCAACATGTAGCAATGTAGTTAGTCATTCATACTTGACAACTCATATTTCAATGATCACTTATTTGCAGATATGTGAGAGATCGTTAACGTCATCTACTGGTGGCCCATCAGGGCCCCCTCCATCCAGTTTCATAAGAGCTGAAATGCTTCCAAGCGGGTATCTTATTCGAGCTTGTGAGGGTGGTTCCCTTATTCACATTGTCGATCATGTTGATTTAGATGTAAGTATGCTCTCTGATTGTTTCAAAATAAATAAATAACTAAATAAATGCTTGCTGATGATTTATCTGTTTTCTCCCATGGTCAAATCAATATTATATCGTTCTATCTCCCACTGTATCGTAAAATGTTTTCTTCACGGTCGGTCAGTTTTGGAGTGTTCCTGAAGTTCTAAGGCCACTTTATGAATCAACAAAGATTCTAGCTCAGAGAATGACTGTTGCTGTAAGTATTGTTGCTTGATAGTTCATCCTTTCCTTGTGTCATATGTGTTCACAATATTGTGCTTAACAGGCCTTACGCTACGTTCGGCAAATTGCTCAAGAGGCGAGTGGAGAAATTCAATTGGGTGGTGGCCGCCAACCGGCCGTTTTAAGGACTTTCAGTCAAAGACTTTGTAGGTAAGGCGCTAAACTTATTGTCTCAAAGTTAGGCCATGCTGGACATTTTTGGTTAAGGCTTGACGTGTTTGAACTTTGAAGAATGCCTCCAAACTCTAAACTTTAAGTTCAAACTTCCAAGTTGAAAATTTATTAACTTAAAATTCTCGTTCTCTTGAAGCTAGTTTTTGGATGGAAAAACCTCTACAGATGAGTGGAGTGGACTATGAGAGTGGGTTGTCCAATATCTTGATTGTCAAAACATTATATGCCCTCATGTCTCGCAATTTTAAAATTCTCTTTCTGGGACTTAGTTGTTTGTGTTTTCGTGACACCAATCTTATTAATACACCCTTTTTTTTTTTAATTACATTAGTTTTGTTACTTATAGTTTATGGAATTGGTTTTAGGGTTTTCAATGATGCAGTAAATGGATTTGTGGATGATGGTTGGTCGCTTATGGATAGCGATGGAGTAGAGGATGTGACAGTTGTTATTAATTCATCTCCAAACAAATTCCTCGGTTCCCAATATAACACATCATTGTATCCTACTTTTGGAGGTGTATTGTGTGCCAAGGCGTCAATGTTGCTCCAGGTACTCTATTAGTCTATTGTATCCTTCAAATGCTAGGCTAGTTGACATTAAGTATGAACTTTTCAACTACTTTGTGGTTCAGAATGTCCCCCCTGCTTTGCTTGTTCGTTTCCTACGGGAGCATCGTTCAGAATGGGCTGATTATGGGGTTGATGCATACTCTGCTGCATGTCTTAAAGCCAGTGCATATGCTGTTCCGTGTGCGAGACCCGGTGGCTTCCCTAGTGGTCAAGTCATTTTACCACTTGCACATACAGTTGAGAATGAGGAGGTAACAACCTTTTATTTTATCAGCATTTTTTTCCCTTGAAGTTTATTCACCTTATGCTATTCTTGTGTAGTTCCTAGAGGTCGTTCGGTTAGAGGGCCACGCTATGTTCCCTGAAGAAGCTGCTTTAGGAGGACGGGATATGTATTTATTGCAGGTTAGTTTTGCTTCATCTACCAGGAGCTCTTTTATAGATGTCTTTGATTTCCATTGAAAATCGTTGATCAAACGCTTCTTTGCAGCTGTGTAGTGGTGTTGATGAAAATACAGTTGGTGCCTGTGCTCAGCTTGTATTTGCACCTATAGACGAATCATTTGCTGATGATGCTCCCTTGCTGCCATCTGGTTTTCGTGTCATACCATTGGAGTCCAAAACAGTTTGTGACTTTCCTTTCTGCTTTCATTCGAAAATTTGCAATTTCAATAGAATATGAACATCTTTAAAGGCCACTTTGCAACTTGAAAGCTCAGATCGGAGGAACTGACTTTTTGCACAATCTTTGTTCATGTAGTATATCTTCTCTTTTGTATGTTGTAGGATGCGCCTGCAGCTACTCGAACTTTGGATTTGGCATCTACTCTAGAAGTCCGACCTGGAACTACCCGCCCAGGAGGTGAAACTGATGTTACAAACTACAACCTCAGGTCAGTCTTGACTATTGCATTCCAATTCACTTTCGAGAACCACATGCGGGATAGTGTGGCTGCTATGGCGAGACAATATGTGCGGACTGTTGTGGGTTCAGTTCAAAGGGTTGCCATGGCTATCGCCCCCTCACAGCTTGGCTCCCAAATCGGTCCAAAAAGCCTTCCCGGCTCCCCAGAGGCTCTCACATTAGCTCAGTGGATCGCACGAAGCTATAGGTATGACTTCTTATCTTTAACTGTTACATTCTCAATGAAGTGTATTGTTTGGATCTATATCATTGAGAATTAAAGCACAGAATTGTCAATATTTGTCCCTCCTTTTCACGCAAGGGCTAATGAGACCATCGACGTCATGTATCCGAAACCCAACCCCTTGAACGTCATGTATCCGAAACCCGACCCCTTGAATTTTTAACCTTGCACAATATTTATTCAAGTTTTGTGCTCTTGGTGTCCCACACCTGTCTTTTTATTCTTTGGTTCTTATAACATATGTTGAATTTCAGGATCCACTCAGGAACAGAGCTCTTTCAGGTTGAATCCCAATCTGATGCTATCTTGAAGCAGCTTTGGCACCACTCAGATACAATCTTGTGCTGCTCTGTCAAAACTAATGTAATAAGAACTGTCTTCTTCTCTTCCATTTCATGAATTTCATGTTTGTTCGGTCTACGACTTAGCAGTTTATTCAAACTACTCGATGTCTGGCTAGGATAAACATAATTCTGCTATGGTTTTAAAGCCGATGCAGCAAATACTTACACGATCAATGAACTTTGCAGGCATCTCCTGTTTTCACATTTGCGAACCAAGCTGGACTTGACATGCTCGAAACCACCCTTGTTTCTCTTCAAGATATAACACTCGACAAGATCCTCGATGATGCTGGTCGGAAGATACTGTGCTCCGAATTCTCCAAGATAATGCAACAGGTAGATTTCTGTTCCTATTGAACTGCACTTGCAGTTGACTTCATTTTAATGCTGCAACTGCTCTTAGTCTACTCGAGTACGTAAAATCTAAAGGTGTACGATCATTATGCAGGGATTTGCATACCTTCCAGCAGGGATATGCGTCTCGAGTATGGGAAGGCCGGTGTCGTATGAACAGGCAATTGCATGGAAAGTTCTGAATGATGATGATGTCCATCACTGTCTGGCTTTCATGTTTGTTAACTGGTCCTTTATGTGA

mRNA sequence

ATGGCTCTTGTCATGCATAGAGATTCCTTGAATAAGCAGATGGATACTAGCAAGTATGTTAGATATACGCCTGAACAAGTCGATGCATTAGAAAGGGTTTATGCAGAATGCCCAAAACCCAGTTCGTTGAGGAGGCAACAGCTCATTCGGGAGTGTCCTATTCTCTCTAATATTGAGCCTAAGCAGATCAAAGTTTGGTTCCAAAATCGCAGATGTCGTGAAAAGCAAAGGAAGGAAGCTTCTCGTCTTCAAACAGTTAATAGGAAGCTAACTGCGATGAACAAGCTGTTGATGGAAGAGAACGACCGGTTGCAGACGCAGGTCTCTCGTTTGGTTTACGAGAACGGATACATGAGACAGCAACTGCATACTGCATCTGGAACAACCACAGACAATAGCTGTGAGTCTGTGGTCATGAGTGGTCAACAGCATCAACAGCAAAACCCAACTAAGCATACTCAAAAGGATGCTAATAACCCAGCTGGTCTTCTTGCAATAGCCGAGGAGACCTTGGCAGAGTTCCTTTCCAAGGCAACAGGAACTGCTGTCGACTGGGTACAAATGATTGGGATGAAGCCTGGTCCGGATTCAATTGGGATTGTTGCTGTTTCCCGCAATTGCAATGGGGTAGCAGCACGAGCCTGTGGTTTAGTGAGTCTTGAGCCCATGAAGGTGGCCGAAATTCTCAAAGATCGTCTGTCATGGTTTCGTGACTGCCGTTGTGTTGATGTACTAAGTGCAATCTCTACAGGAAATGGGGGGACCATAGAACTCTTATATATGCAGACATATGCACCTACAACTTTGGCCGCAGCGCGTGACTTTTGGACTCTTCGATACACTACAAGCTTGGAAGATGGCAGCCTTGTGATATGTGAGAGATCGTTAACGTCATCTACTGGTGGCCCATCAGGGCCCCCTCCATCCAGTTTCATAAGAGCTGAAATGCTTCCAAGCGGGTATCTTATTCGAGCTTGTGAGGGTGGTTCCCTTATTCACATTGTCGATCATGTTGATTTAGATTTTTGGAGTGTTCCTGAAGTTCTAAGGCCACTTTATGAATCAACAAAGATTCTAGCTCAGAGAATGACTGTTGCTGCCTTACGCTACGTTCGGCAAATTGCTCAAGAGGCGAGTGGAGAAATTCAATTGGGTGGTGGCCGCCAACCGGCCGTTTTAAGGACTTTCAGTCAAAGACTTTGTAGGGTTTTCAATGATGCAGTAAATGGATTTGTGGATGATGGTTGGTCGCTTATGGATAGCGATGGAGTAGAGGATGTGACAGTTGTTATTAATTCATCTCCAAACAAATTCCTCGGTTCCCAATATAACACATCATTGTATCCTACTTTTGGAGGTGTATTGTGTGCCAAGGCGTCAATGTTGCTCCAGAATGTCCCCCCTGCTTTGCTTGTTCGTTTCCTACGGGAGCATCGTTCAGAATGGGCTGATTATGGGGTTGATGCATACTCTGCTGCATGTCTTAAAGCCAGTGCATATGCTGTTCCGTGTGCGAGACCCGGTGGCTTCCCTAGTGGTCAAGTCATTTTACCACTTGCACATACAGTTGAGAATGAGGAGTTCCTAGAGGTCGTTCGGTTAGAGGGCCACGCTATGTTCCCTGAAGAAGCTGCTTTAGGAGGACGGGATATGTATTTATTGCAGCTGTGTAGTGGTGTTGATGAAAATACAGTTGGTGCCTGTGCTCAGCTTGTATTTGCACCTATAGACGAATCATTTGCTGATGATGCTCCCTTGCTGCCATCTGGTTTTCGTGTCATACCATTGGAGTCCAAAACAGATGCGCCTGCAGCTACTCGAACTTTGGATTTGGCATCTACTCTAGAAGTCCGACCTGGAACTACCCGCCCAGGAGGTGAAACTGATGTTACAAACTACAACCTCAGGTCAGTCTTGACTATTGCATTCCAATTCACTTTCGAGAACCACATGCGGGATAGTGTGGCTGCTATGGCGAGACAATATGTGCGGACTGTTGTGGGTTCAGTTCAAAGGGTTGCCATGGCTATCGCCCCCTCACAGCTTGGCTCCCAAATCGGTCCAAAAAGCCTTCCCGGCTCCCCAGAGGCTCTCACATTAGCTCAGTGGATCGCACGAAGCTATAGGATCCACTCAGGAACAGAGCTCTTTCAGGTTGAATCCCAATCTGATGCTATCTTGAAGCAGCTTTGGCACCACTCAGATACAATCTTGTGCTGCTCTGTCAAAACTAATGCATCTCCTGTTTTCACATTTGCGAACCAAGCTGGACTTGACATGCTCGAAACCACCCTTGTTTCTCTTCAAGATATAACACTCGACAAGATCCTCGATGATGCTGGTCGGAAGATACTGTGCTCCGAATTCTCCAAGATAATGCAACAGGGATTTGCATACCTTCCAGCAGGGATATGCGTCTCGAGTATGGGAAGGCCGGTGTCGTATGAACAGGCAATTGCATGGAAAGTTCTGAATGATGATGATGTCCATCACTGTCTGGCTTTCATGTTTGTTAACTGGTCCTTTATGTGA

Coding sequence (CDS)

ATGGCTCTTGTCATGCATAGAGATTCCTTGAATAAGCAGATGGATACTAGCAAGTATGTTAGATATACGCCTGAACAAGTCGATGCATTAGAAAGGGTTTATGCAGAATGCCCAAAACCCAGTTCGTTGAGGAGGCAACAGCTCATTCGGGAGTGTCCTATTCTCTCTAATATTGAGCCTAAGCAGATCAAAGTTTGGTTCCAAAATCGCAGATGTCGTGAAAAGCAAAGGAAGGAAGCTTCTCGTCTTCAAACAGTTAATAGGAAGCTAACTGCGATGAACAAGCTGTTGATGGAAGAGAACGACCGGTTGCAGACGCAGGTCTCTCGTTTGGTTTACGAGAACGGATACATGAGACAGCAACTGCATACTGCATCTGGAACAACCACAGACAATAGCTGTGAGTCTGTGGTCATGAGTGGTCAACAGCATCAACAGCAAAACCCAACTAAGCATACTCAAAAGGATGCTAATAACCCAGCTGGTCTTCTTGCAATAGCCGAGGAGACCTTGGCAGAGTTCCTTTCCAAGGCAACAGGAACTGCTGTCGACTGGGTACAAATGATTGGGATGAAGCCTGGTCCGGATTCAATTGGGATTGTTGCTGTTTCCCGCAATTGCAATGGGGTAGCAGCACGAGCCTGTGGTTTAGTGAGTCTTGAGCCCATGAAGGTGGCCGAAATTCTCAAAGATCGTCTGTCATGGTTTCGTGACTGCCGTTGTGTTGATGTACTAAGTGCAATCTCTACAGGAAATGGGGGGACCATAGAACTCTTATATATGCAGACATATGCACCTACAACTTTGGCCGCAGCGCGTGACTTTTGGACTCTTCGATACACTACAAGCTTGGAAGATGGCAGCCTTGTGATATGTGAGAGATCGTTAACGTCATCTACTGGTGGCCCATCAGGGCCCCCTCCATCCAGTTTCATAAGAGCTGAAATGCTTCCAAGCGGGTATCTTATTCGAGCTTGTGAGGGTGGTTCCCTTATTCACATTGTCGATCATGTTGATTTAGATTTTTGGAGTGTTCCTGAAGTTCTAAGGCCACTTTATGAATCAACAAAGATTCTAGCTCAGAGAATGACTGTTGCTGCCTTACGCTACGTTCGGCAAATTGCTCAAGAGGCGAGTGGAGAAATTCAATTGGGTGGTGGCCGCCAACCGGCCGTTTTAAGGACTTTCAGTCAAAGACTTTGTAGGGTTTTCAATGATGCAGTAAATGGATTTGTGGATGATGGTTGGTCGCTTATGGATAGCGATGGAGTAGAGGATGTGACAGTTGTTATTAATTCATCTCCAAACAAATTCCTCGGTTCCCAATATAACACATCATTGTATCCTACTTTTGGAGGTGTATTGTGTGCCAAGGCGTCAATGTTGCTCCAGAATGTCCCCCCTGCTTTGCTTGTTCGTTTCCTACGGGAGCATCGTTCAGAATGGGCTGATTATGGGGTTGATGCATACTCTGCTGCATGTCTTAAAGCCAGTGCATATGCTGTTCCGTGTGCGAGACCCGGTGGCTTCCCTAGTGGTCAAGTCATTTTACCACTTGCACATACAGTTGAGAATGAGGAGTTCCTAGAGGTCGTTCGGTTAGAGGGCCACGCTATGTTCCCTGAAGAAGCTGCTTTAGGAGGACGGGATATGTATTTATTGCAGCTGTGTAGTGGTGTTGATGAAAATACAGTTGGTGCCTGTGCTCAGCTTGTATTTGCACCTATAGACGAATCATTTGCTGATGATGCTCCCTTGCTGCCATCTGGTTTTCGTGTCATACCATTGGAGTCCAAAACAGATGCGCCTGCAGCTACTCGAACTTTGGATTTGGCATCTACTCTAGAAGTCCGACCTGGAACTACCCGCCCAGGAGGTGAAACTGATGTTACAAACTACAACCTCAGGTCAGTCTTGACTATTGCATTCCAATTCACTTTCGAGAACCACATGCGGGATAGTGTGGCTGCTATGGCGAGACAATATGTGCGGACTGTTGTGGGTTCAGTTCAAAGGGTTGCCATGGCTATCGCCCCCTCACAGCTTGGCTCCCAAATCGGTCCAAAAAGCCTTCCCGGCTCCCCAGAGGCTCTCACATTAGCTCAGTGGATCGCACGAAGCTATAGGATCCACTCAGGAACAGAGCTCTTTCAGGTTGAATCCCAATCTGATGCTATCTTGAAGCAGCTTTGGCACCACTCAGATACAATCTTGTGCTGCTCTGTCAAAACTAATGCATCTCCTGTTTTCACATTTGCGAACCAAGCTGGACTTGACATGCTCGAAACCACCCTTGTTTCTCTTCAAGATATAACACTCGACAAGATCCTCGATGATGCTGGTCGGAAGATACTGTGCTCCGAATTCTCCAAGATAATGCAACAGGGATTTGCATACCTTCCAGCAGGGATATGCGTCTCGAGTATGGGAAGGCCGGTGTCGTATGAACAGGCAATTGCATGGAAAGTTCTGAATGATGATGATGTCCATCACTGTCTGGCTTTCATGTTTGTTAACTGGTCCTTTATGTGA

Protein sequence

MALVMHRDSLNKQMDTSKYVRYTPEQVDALERVYAECPKPSSLRRQQLIRECPILSNIEPKQIKVWFQNRRCREKQRKEASRLQTVNRKLTAMNKLLMEENDRLQTQVSRLVYENGYMRQQLHTASGTTTDNSCESVVMSGQQHQQQNPTKHTQKDANNPAGLLAIAEETLAEFLSKATGTAVDWVQMIGMKPGPDSIGIVAVSRNCNGVAARACGLVSLEPMKVAEILKDRLSWFRDCRCVDVLSAISTGNGGTIELLYMQTYAPTTLAAARDFWTLRYTTSLEDGSLVICERSLTSSTGGPSGPPPSSFIRAEMLPSGYLIRACEGGSLIHIVDHVDLDFWSVPEVLRPLYESTKILAQRMTVAALRYVRQIAQEASGEIQLGGGRQPAVLRTFSQRLCRVFNDAVNGFVDDGWSLMDSDGVEDVTVVINSSPNKFLGSQYNTSLYPTFGGVLCAKASMLLQNVPPALLVRFLREHRSEWADYGVDAYSAACLKASAYAVPCARPGGFPSGQVILPLAHTVENEEFLEVVRLEGHAMFPEEAALGGRDMYLLQLCSGVDENTVGACAQLVFAPIDESFADDAPLLPSGFRVIPLESKTDAPAATRTLDLASTLEVRPGTTRPGGETDVTNYNLRSVLTIAFQFTFENHMRDSVAAMARQYVRTVVGSVQRVAMAIAPSQLGSQIGPKSLPGSPEALTLAQWIARSYRIHSGTELFQVESQSDAILKQLWHHSDTILCCSVKTNASPVFTFANQAGLDMLETTLVSLQDITLDKILDDAGRKILCSEFSKIMQQGFAYLPAGICVSSMGRPVSYEQAIAWKVLNDDDVHHCLAFMFVNWSFM
Homology
BLAST of HG10021418 vs. NCBI nr
Match: XP_008445207.1 (PREDICTED: homeobox-leucine zipper protein ATHB-14-like [Cucumis melo])

HSP 1 Score: 1647.9 bits (4266), Expect = 0.0e+00
Identity = 829/844 (98.22%), Postives = 834/844 (98.82%), Query Frame = 0

Query: 1   MALVMHRDSLNKQMDTSKYVRYTPEQVDALERVYAECPKPSSLRRQQLIRECPILSNIEP 60
           MALVMHRDSLNKQMDTSKYVRYTPEQVDALERVYAECPKPSSLRRQQLIRECPILSNIEP
Sbjct: 1   MALVMHRDSLNKQMDTSKYVRYTPEQVDALERVYAECPKPSSLRRQQLIRECPILSNIEP 60

Query: 61  KQIKVWFQNRRCREKQRKEASRLQTVNRKLTAMNKLLMEENDRLQTQVSRLVYENGYMRQ 120
           KQIKVWFQNRRCREKQRKEASRLQTVNRKLTAMNKLLMEENDRLQTQVSRLVYENGYMRQ
Sbjct: 61  KQIKVWFQNRRCREKQRKEASRLQTVNRKLTAMNKLLMEENDRLQTQVSRLVYENGYMRQ 120

Query: 121 QLHTASGTTTDNSCESVVMSGQQHQQQNPTKHTQKDANNPAGLLAIAEETLAEFLSKATG 180
           QLHTASGTTTDNSCESVVMSGQQHQQQNPTKHTQKDANNPAGLLAIAEETLAEFLSKATG
Sbjct: 121 QLHTASGTTTDNSCESVVMSGQQHQQQNPTKHTQKDANNPAGLLAIAEETLAEFLSKATG 180

Query: 181 TAVDWVQMIGMKPGPDSIGIVAVSRNCNGVAARACGLVSLEPMKVAEILKDRLSWFRDCR 240
           TAVDWVQMIGMKPGPDSIGIVAVSRNCNGVAARACGLVSLEPMKVAEILKDRLSWFRDCR
Sbjct: 181 TAVDWVQMIGMKPGPDSIGIVAVSRNCNGVAARACGLVSLEPMKVAEILKDRLSWFRDCR 240

Query: 241 CVDVLSAISTGNGGTIELLYMQTYAPTTLAAARDFWTLRYTTSLEDGSLVICERSLTSST 300
           CVDVLS ISTGNGGTIELLYMQTYAPTTLAAARDFWTLRYTTSLEDGSLVICERSLTSST
Sbjct: 241 CVDVLSVISTGNGGTIELLYMQTYAPTTLAAARDFWTLRYTTSLEDGSLVICERSLTSST 300

Query: 301 GGPSGPPPSSFIRAEMLPSGYLIRACEGGSLIHIVDHVDLDFWSVPEVLRPLYESTKILA 360
           GGPSGPPPSSF+RAEMLPSGYLIRACEGGSLIHIVDHVDLD WSVPEVLRPLYESTKILA
Sbjct: 301 GGPSGPPPSSFVRAEMLPSGYLIRACEGGSLIHIVDHVDLDVWSVPEVLRPLYESTKILA 360

Query: 361 QRMTVAALRYVRQIAQEASGEIQLGGGRQPAVLRTFSQRLCRVFNDAVNGFVDDGWSLMD 420
           QRMTVAALRYVRQIAQEASGE+QLGGGRQPAVLRTFSQRLCR FNDAVNGFVDDGWSLMD
Sbjct: 361 QRMTVAALRYVRQIAQEASGEVQLGGGRQPAVLRTFSQRLCRGFNDAVNGFVDDGWSLMD 420

Query: 421 SDGVEDVTVVINSSPNKFLGSQYNTSLYPTFGGVLCAKASMLLQNVPPALLVRFLREHRS 480
           SDGVEDVTVVINSSPNKFLGSQYNTSLYPTFGGVLCAKASMLLQNVPPALLVRFLREHRS
Sbjct: 421 SDGVEDVTVVINSSPNKFLGSQYNTSLYPTFGGVLCAKASMLLQNVPPALLVRFLREHRS 480

Query: 481 EWADYGVDAYSAACLKASAYAVPCARPGGFPSGQVILPLAHTVENEEFLEVVRLEGHAMF 540
           EWADYGVDAYSAACLKASAYAVPCARPGGFP GQVILPLAHTVENEEFLEVVRLEGHAMF
Sbjct: 481 EWADYGVDAYSAACLKASAYAVPCARPGGFPGGQVILPLAHTVENEEFLEVVRLEGHAMF 540

Query: 541 PEEAALGGRDMYLLQLCSGVDENTVGACAQLVFAPIDESFADDAPLLPSGFRVIPLESKT 600
           PEEAALGGRDMYLLQLCSGV+ENTVGACAQLVFAPIDESFADDAPLLPSGFRVIPLESKT
Sbjct: 541 PEEAALGGRDMYLLQLCSGVEENTVGACAQLVFAPIDESFADDAPLLPSGFRVIPLESKT 600

Query: 601 DAPAATRTLDLASTLEVRPGTTRPGGETDVTNYNLRSVLTIAFQFTFENHMRDSVAAMAR 660
           + P ATRTLDLASTLEVRPGT RPGGETDVTNYNLRSVLTIAFQFTFENHMRDSVAAMAR
Sbjct: 601 EMPGATRTLDLASTLEVRPGTNRPGGETDVTNYNLRSVLTIAFQFTFENHMRDSVAAMAR 660

Query: 661 QYVRTVVGSVQRVAMAIAPSQLGSQIGPKSLPGSPEALTLAQWIARSYRIHSGTELFQVE 720
           QYV+TVVGSVQRVAMAIAPSQLGSQIGPKSLPGSPEALTLAQWI RSYRIHSG ELFQVE
Sbjct: 661 QYVQTVVGSVQRVAMAIAPSQLGSQIGPKSLPGSPEALTLAQWITRSYRIHSGAELFQVE 720

Query: 721 SQS-DAILKQLWHHSDTILCCSVKTNASPVFTFANQAGLDMLETTLVSLQDITLDKILDD 780
           SQS DAILKQLWHHSDTILCCSVKTNASPVFTFANQAGLDMLETTLVSLQDITLDKILDD
Sbjct: 721 SQSGDAILKQLWHHSDTILCCSVKTNASPVFTFANQAGLDMLETTLVSLQDITLDKILDD 780

Query: 781 AGRKILCSEFSKIMQQGFAYLPAGICVSSMGRPVSYEQAIAWKVLNDDDVHHCLAFMFVN 840
           AGRKILCSEFSKIMQQGFAYLPAGICVSSMGRPVSYEQAIAWKVLNDDDVHHCLAFMFVN
Sbjct: 781 AGRKILCSEFSKIMQQGFAYLPAGICVSSMGRPVSYEQAIAWKVLNDDDVHHCLAFMFVN 840

Query: 841 WSFM 844
           WSFM
Sbjct: 841 WSFM 844

BLAST of HG10021418 vs. NCBI nr
Match: XP_038895414.1 (homeobox-leucine zipper protein ATHB-14-like [Benincasa hispida])

HSP 1 Score: 1645.2 bits (4259), Expect = 0.0e+00
Identity = 829/844 (98.22%), Postives = 834/844 (98.82%), Query Frame = 0

Query: 1   MALVMHRDSLNKQMDTSKYVRYTPEQVDALERVYAECPKPSSLRRQQLIRECPILSNIEP 60
           MALVMHRDSLNKQMDTSKYVRYTPEQVDALERVYAECPKPSSLRRQQLIRECPILSNIEP
Sbjct: 1   MALVMHRDSLNKQMDTSKYVRYTPEQVDALERVYAECPKPSSLRRQQLIRECPILSNIEP 60

Query: 61  KQIKVWFQNRRCREKQRKEASRLQTVNRKLTAMNKLLMEENDRLQTQVSRLVYENGYMRQ 120
           KQIKVWFQNRRCREKQRKEASRLQTVNRKLTAMNKLLMEENDRLQTQVSRLVYENGYMRQ
Sbjct: 61  KQIKVWFQNRRCREKQRKEASRLQTVNRKLTAMNKLLMEENDRLQTQVSRLVYENGYMRQ 120

Query: 121 QLHTASGTTTDNSCESVVMSGQQHQQQNPTKHTQKDANNPAGLLAIAEETLAEFLSKATG 180
           QLHTASGTTTDNSCESVVMSGQQHQQQNPTKHTQKDANNPAGLLAIAEETLAEFLSKATG
Sbjct: 121 QLHTASGTTTDNSCESVVMSGQQHQQQNPTKHTQKDANNPAGLLAIAEETLAEFLSKATG 180

Query: 181 TAVDWVQMIGMKPGPDSIGIVAVSRNCNGVAARACGLVSLEPMKVAEILKDRLSWFRDCR 240
           TAVDWVQMIGMKPGPDSIGIVAVSRNCNGVAARACGLVSLEPMKVAEILKDRLSWFRDCR
Sbjct: 181 TAVDWVQMIGMKPGPDSIGIVAVSRNCNGVAARACGLVSLEPMKVAEILKDRLSWFRDCR 240

Query: 241 CVDVLSAISTGNGGTIELLYMQTYAPTTLAAARDFWTLRYTTSLEDGSLVICERSLTSST 300
           CVDVLSAISTGNGGTIELLYMQTYAPTTLAAARDFWTLRYTTSLEDGSLVICERSLTSST
Sbjct: 241 CVDVLSAISTGNGGTIELLYMQTYAPTTLAAARDFWTLRYTTSLEDGSLVICERSLTSST 300

Query: 301 GGPSGPPPSSFIRAEMLPSGYLIRACEGGSLIHIVDHVDLDFWSVPEVLRPLYESTKILA 360
           GGPSGPPPSSF+RAEMLPSGYLIRACEGGSLIHIVDHVDLD WSVPEVLRPLYESTKILA
Sbjct: 301 GGPSGPPPSSFVRAEMLPSGYLIRACEGGSLIHIVDHVDLDVWSVPEVLRPLYESTKILA 360

Query: 361 QRMTVAALRYVRQIAQEASGEIQLGGGRQPAVLRTFSQRLCRVFNDAVNGFVDDGWSLMD 420
           QRMTVAALRYVRQI+QEASGEIQLGGGRQPAVLRTFSQRLCR FNDAVNGFVDDGWSLMD
Sbjct: 361 QRMTVAALRYVRQISQEASGEIQLGGGRQPAVLRTFSQRLCRGFNDAVNGFVDDGWSLMD 420

Query: 421 SDGVEDVTVVINSSPNKFLGSQYNTSLYPTFGGVLCAKASMLLQNVPPALLVRFLREHRS 480
           SDGVEDVTVVINSSPNKFLGSQYNTSLYPTFGGVLCAKASMLLQNVPPALLVRFLREHRS
Sbjct: 421 SDGVEDVTVVINSSPNKFLGSQYNTSLYPTFGGVLCAKASMLLQNVPPALLVRFLREHRS 480

Query: 481 EWADYGVDAYSAACLKASAYAVPCARPGGFPSGQVILPLAHTVENEEFLEVVRLEGHAMF 540
           EWADYGVDAYSAACLKAS YAVPCARPGGFPSGQVILPLAHTVENEEFLEVVRLEGHAMF
Sbjct: 481 EWADYGVDAYSAACLKASGYAVPCARPGGFPSGQVILPLAHTVENEEFLEVVRLEGHAMF 540

Query: 541 PEEAALGGRDMYLLQLCSGVDENTVGACAQLVFAPIDESFADDAPLLPSGFRVIPLESKT 600
           PEEAALGGRDMYLLQLCSGVDE+TVGACAQLVFAPIDESFADDAPLLPSGFRVIPLESKT
Sbjct: 541 PEEAALGGRDMYLLQLCSGVDESTVGACAQLVFAPIDESFADDAPLLPSGFRVIPLESKT 600

Query: 601 DAPAATRTLDLASTLEVRPGTTRPGGETDVTNYNLRSVLTIAFQFTFENHMRDSVAAMAR 660
           D PAATRTLDLASTLEVR GTTR GGE DVTNYNLRSVLTIAFQFTFENHMRDSVAAMAR
Sbjct: 601 DVPAATRTLDLASTLEVRSGTTRTGGEIDVTNYNLRSVLTIAFQFTFENHMRDSVAAMAR 660

Query: 661 QYVRTVVGSVQRVAMAIAPSQLGSQIGPKSLPGSPEALTLAQWIARSYRIHSGTELFQVE 720
           QYVRTVVGSVQRVAMAIAPSQLGSQIGPKSLPGSPEALTLAQWI RSYRIHSG ELFQVE
Sbjct: 661 QYVRTVVGSVQRVAMAIAPSQLGSQIGPKSLPGSPEALTLAQWIVRSYRIHSGAELFQVE 720

Query: 721 SQS-DAILKQLWHHSDTILCCSVKTNASPVFTFANQAGLDMLETTLVSLQDITLDKILDD 780
           SQS DAILKQLWHHSDTILCCSVKTNASPVFTFANQAGLDMLETTLV+LQDITLDKILDD
Sbjct: 721 SQSGDAILKQLWHHSDTILCCSVKTNASPVFTFANQAGLDMLETTLVALQDITLDKILDD 780

Query: 781 AGRKILCSEFSKIMQQGFAYLPAGICVSSMGRPVSYEQAIAWKVLNDDDVHHCLAFMFVN 840
           AGRKILCSEFSKIMQQGFAYLPAGIC+SSMGRPVSYEQAIAWKVLNDDDVHHCLAFMFVN
Sbjct: 781 AGRKILCSEFSKIMQQGFAYLPAGICISSMGRPVSYEQAIAWKVLNDDDVHHCLAFMFVN 840

Query: 841 WSFM 844
           WSFM
Sbjct: 841 WSFM 844

BLAST of HG10021418 vs. NCBI nr
Match: KGN65925.1 (hypothetical protein Csa_023222 [Cucumis sativus])

HSP 1 Score: 1639.4 bits (4244), Expect = 0.0e+00
Identity = 826/844 (97.87%), Postives = 830/844 (98.34%), Query Frame = 0

Query: 1   MALVMHRDSLNKQMDTSKYVRYTPEQVDALERVYAECPKPSSLRRQQLIRECPILSNIEP 60
           MALVMHRDSLNKQMDTSKYVRYTPEQVDALERVYAECPKPSSLRRQQLIRECPILSNIEP
Sbjct: 1   MALVMHRDSLNKQMDTSKYVRYTPEQVDALERVYAECPKPSSLRRQQLIRECPILSNIEP 60

Query: 61  KQIKVWFQNRRCREKQRKEASRLQTVNRKLTAMNKLLMEENDRLQTQVSRLVYENGYMRQ 120
           KQIKVWFQNRRCREKQRKEASRLQTVNRKLTAMNKLLMEENDRLQTQVSRLVYENGYMRQ
Sbjct: 61  KQIKVWFQNRRCREKQRKEASRLQTVNRKLTAMNKLLMEENDRLQTQVSRLVYENGYMRQ 120

Query: 121 QLHTASGTTTDNSCESVVMSGQQHQQQNPTKHTQKDANNPAGLLAIAEETLAEFLSKATG 180
           QLHTASGTTTDNSCESVVMSGQQHQQQNPTKHTQKDANNPAGLLAIAEETLAEFLSKATG
Sbjct: 121 QLHTASGTTTDNSCESVVMSGQQHQQQNPTKHTQKDANNPAGLLAIAEETLAEFLSKATG 180

Query: 181 TAVDWVQMIGMKPGPDSIGIVAVSRNCNGVAARACGLVSLEPMKVAEILKDRLSWFRDCR 240
           TAVDWVQMIGMKPGPDSIGIVAVSRNCNGVAARACGLVSLEPMKVAEILKDRLSWFRDCR
Sbjct: 181 TAVDWVQMIGMKPGPDSIGIVAVSRNCNGVAARACGLVSLEPMKVAEILKDRLSWFRDCR 240

Query: 241 CVDVLSAISTGNGGTIELLYMQTYAPTTLAAARDFWTLRYTTSLEDGSLVICERSLTSST 300
           CVDVLS ISTGNGGTIELLYMQTYAPTTLAAARDFWTLRYTTSLEDGSLVICERSLTSST
Sbjct: 241 CVDVLSVISTGNGGTIELLYMQTYAPTTLAAARDFWTLRYTTSLEDGSLVICERSLTSST 300

Query: 301 GGPSGPPPSSFIRAEMLPSGYLIRACEGGSLIHIVDHVDLDFWSVPEVLRPLYESTKILA 360
           GGPSGPPPSSF+RAEMLPSGYLIRACEGGSLIHIVDHVDLD WSVPEVLRPLYESTKILA
Sbjct: 301 GGPSGPPPSSFVRAEMLPSGYLIRACEGGSLIHIVDHVDLDVWSVPEVLRPLYESTKILA 360

Query: 361 QRMTVAALRYVRQIAQEASGEIQLGGGRQPAVLRTFSQRLCRVFNDAVNGFVDDGWSLMD 420
           QR TVAALRYVRQIAQEASGE+QLGGGRQPAVLRTFSQRLCR FNDAVNGFVDDGWSLMD
Sbjct: 361 QRTTVAALRYVRQIAQEASGEVQLGGGRQPAVLRTFSQRLCRGFNDAVNGFVDDGWSLMD 420

Query: 421 SDGVEDVTVVINSSPNKFLGSQYNTSLYPTFGGVLCAKASMLLQNVPPALLVRFLREHRS 480
           SDGVEDVTVVINSSPNKFLGSQYNTSLYPTFGGVLCAKASMLLQNVPPALLVRFLREHRS
Sbjct: 421 SDGVEDVTVVINSSPNKFLGSQYNTSLYPTFGGVLCAKASMLLQNVPPALLVRFLREHRS 480

Query: 481 EWADYGVDAYSAACLKASAYAVPCARPGGFPSGQVILPLAHTVENEEFLEVVRLEGHAMF 540
           EWADYGVDAYSAACLKASAYAVPCARPGGFP GQVILPLAHTVENEEFLEVVRLEGHAMF
Sbjct: 481 EWADYGVDAYSAACLKASAYAVPCARPGGFPGGQVILPLAHTVENEEFLEVVRLEGHAMF 540

Query: 541 PEEAALGGRDMYLLQLCSGVDENTVGACAQLVFAPIDESFADDAPLLPSGFRVIPLESKT 600
           PEEAALGGRDMYLLQLCSGVDENTVGACAQLVFAPIDESFADDAPLLPSGFRVIPLESK 
Sbjct: 541 PEEAALGGRDMYLLQLCSGVDENTVGACAQLVFAPIDESFADDAPLLPSGFRVIPLESKA 600

Query: 601 DAPAATRTLDLASTLEVRPGTTRPGGETDVTNYNLRSVLTIAFQFTFENHMRDSVAAMAR 660
           + P ATRTLDLASTLEVRPGT RPG ETDVTNYNLRSVLTIAFQFTFENHMRDSVAAMAR
Sbjct: 601 EMPGATRTLDLASTLEVRPGTNRPGCETDVTNYNLRSVLTIAFQFTFENHMRDSVAAMAR 660

Query: 661 QYVRTVVGSVQRVAMAIAPSQLGSQIGPKSLPGSPEALTLAQWIARSYRIHSGTELFQVE 720
           QYVRTVVGSVQRVAMAIAPSQLGSQIGPKSLP SPEALTLAQWI RSYRIHSG ELFQVE
Sbjct: 661 QYVRTVVGSVQRVAMAIAPSQLGSQIGPKSLPASPEALTLAQWITRSYRIHSGAELFQVE 720

Query: 721 SQS-DAILKQLWHHSDTILCCSVKTNASPVFTFANQAGLDMLETTLVSLQDITLDKILDD 780
           SQS DAILKQLWHHSDTILCCSVKTNASPVFTFANQAGLDMLETTLVSLQDITL+KILDD
Sbjct: 721 SQSGDAILKQLWHHSDTILCCSVKTNASPVFTFANQAGLDMLETTLVSLQDITLEKILDD 780

Query: 781 AGRKILCSEFSKIMQQGFAYLPAGICVSSMGRPVSYEQAIAWKVLNDDDVHHCLAFMFVN 840
           AGRKILCSEFSKIMQQGFAYLPAGICVSSMGRPVSYEQAIAWKVLNDDDVHHCLAFMFVN
Sbjct: 781 AGRKILCSEFSKIMQQGFAYLPAGICVSSMGRPVSYEQAIAWKVLNDDDVHHCLAFMFVN 840

Query: 841 WSFM 844
           WSFM
Sbjct: 841 WSFM 844

BLAST of HG10021418 vs. NCBI nr
Match: XP_031741286.1 (homeobox-leucine zipper protein ATHB-14 isoform X2 [Cucumis sativus])

HSP 1 Score: 1627.8 bits (4214), Expect = 0.0e+00
Identity = 820/838 (97.85%), Postives = 824/838 (98.33%), Query Frame = 0

Query: 7   RDSLNKQMDTSKYVRYTPEQVDALERVYAECPKPSSLRRQQLIRECPILSNIEPKQIKVW 66
           RDSLNKQMDTSKYVRYTPEQVDALERVYAECPKPSSLRRQQLIRECPILSNIEPKQIKVW
Sbjct: 57  RDSLNKQMDTSKYVRYTPEQVDALERVYAECPKPSSLRRQQLIRECPILSNIEPKQIKVW 116

Query: 67  FQNRRCREKQRKEASRLQTVNRKLTAMNKLLMEENDRLQTQVSRLVYENGYMRQQLHTAS 126
           FQNRRCREKQRKEASRLQTVNRKLTAMNKLLMEENDRLQTQVSRLVYENGYMRQQLHTAS
Sbjct: 117 FQNRRCREKQRKEASRLQTVNRKLTAMNKLLMEENDRLQTQVSRLVYENGYMRQQLHTAS 176

Query: 127 GTTTDNSCESVVMSGQQHQQQNPTKHTQKDANNPAGLLAIAEETLAEFLSKATGTAVDWV 186
           GTTTDNSCESVVMSGQQHQQQNPTKHTQKDANNPAGLLAIAEETLAEFLSKATGTAVDWV
Sbjct: 177 GTTTDNSCESVVMSGQQHQQQNPTKHTQKDANNPAGLLAIAEETLAEFLSKATGTAVDWV 236

Query: 187 QMIGMKPGPDSIGIVAVSRNCNGVAARACGLVSLEPMKVAEILKDRLSWFRDCRCVDVLS 246
           QMIGMKPGPDSIGIVAVSRNCNGVAARACGLVSLEPMKVAEILKDRLSWFRDCRCVDVLS
Sbjct: 237 QMIGMKPGPDSIGIVAVSRNCNGVAARACGLVSLEPMKVAEILKDRLSWFRDCRCVDVLS 296

Query: 247 AISTGNGGTIELLYMQTYAPTTLAAARDFWTLRYTTSLEDGSLVICERSLTSSTGGPSGP 306
            ISTGNGGTIELLYMQTYAPTTLAAARDFWTLRYTTSLEDGSLVICERSLTSSTGGPSGP
Sbjct: 297 VISTGNGGTIELLYMQTYAPTTLAAARDFWTLRYTTSLEDGSLVICERSLTSSTGGPSGP 356

Query: 307 PPSSFIRAEMLPSGYLIRACEGGSLIHIVDHVDLDFWSVPEVLRPLYESTKILAQRMTVA 366
           PPSSF+RAEMLPSGYLIRACEGGSLIHIVDHVDLD WSVPEVLRPLYESTKILAQR TVA
Sbjct: 357 PPSSFVRAEMLPSGYLIRACEGGSLIHIVDHVDLDVWSVPEVLRPLYESTKILAQRTTVA 416

Query: 367 ALRYVRQIAQEASGEIQLGGGRQPAVLRTFSQRLCRVFNDAVNGFVDDGWSLMDSDGVED 426
           ALRYVRQIAQEASGE+QLGGGRQPAVLRTFSQRLCR FNDAVNGFVDDGWSLMDSDGVED
Sbjct: 417 ALRYVRQIAQEASGEVQLGGGRQPAVLRTFSQRLCRGFNDAVNGFVDDGWSLMDSDGVED 476

Query: 427 VTVVINSSPNKFLGSQYNTSLYPTFGGVLCAKASMLLQNVPPALLVRFLREHRSEWADYG 486
           VTVVINSSPNKFLGSQYNTSLYPTFGGVLCAKASMLLQNVPPALLVRFLREHRSEWADYG
Sbjct: 477 VTVVINSSPNKFLGSQYNTSLYPTFGGVLCAKASMLLQNVPPALLVRFLREHRSEWADYG 536

Query: 487 VDAYSAACLKASAYAVPCARPGGFPSGQVILPLAHTVENEEFLEVVRLEGHAMFPEEAAL 546
           VDAYSAACLKASAYAVPCARPGGFP GQVILPLAHTVENEEFLEVVRLEGHAMFPEEAAL
Sbjct: 537 VDAYSAACLKASAYAVPCARPGGFPGGQVILPLAHTVENEEFLEVVRLEGHAMFPEEAAL 596

Query: 547 GGRDMYLLQLCSGVDENTVGACAQLVFAPIDESFADDAPLLPSGFRVIPLESKTDAPAAT 606
           GGRDMYLLQLCSGVDENTVGACAQLVFAPIDESFADDAPLLPSGFRVIPLESK + P AT
Sbjct: 597 GGRDMYLLQLCSGVDENTVGACAQLVFAPIDESFADDAPLLPSGFRVIPLESKAEMPGAT 656

Query: 607 RTLDLASTLEVRPGTTRPGGETDVTNYNLRSVLTIAFQFTFENHMRDSVAAMARQYVRTV 666
           RTLDLASTLEVRPGT RPG ETDVTNYNLRSVLTIAFQFTFENHMRDSVAAMARQYVRTV
Sbjct: 657 RTLDLASTLEVRPGTNRPGCETDVTNYNLRSVLTIAFQFTFENHMRDSVAAMARQYVRTV 716

Query: 667 VGSVQRVAMAIAPSQLGSQIGPKSLPGSPEALTLAQWIARSYRIHSGTELFQVESQS-DA 726
           VGSVQRVAMAIAPSQLGSQIGPKSLP SPEALTLAQWI RSYRIHSG ELFQVESQS DA
Sbjct: 717 VGSVQRVAMAIAPSQLGSQIGPKSLPASPEALTLAQWITRSYRIHSGAELFQVESQSGDA 776

Query: 727 ILKQLWHHSDTILCCSVKTNASPVFTFANQAGLDMLETTLVSLQDITLDKILDDAGRKIL 786
           ILKQLWHHSDTILCCSVKTNASPVFTFANQAGLDMLETTLVSLQDITL+KILDDAGRKIL
Sbjct: 777 ILKQLWHHSDTILCCSVKTNASPVFTFANQAGLDMLETTLVSLQDITLEKILDDAGRKIL 836

Query: 787 CSEFSKIMQQGFAYLPAGICVSSMGRPVSYEQAIAWKVLNDDDVHHCLAFMFVNWSFM 844
           CSEFSKIMQQGFAYLPAGICVSSMGRPVSYEQAIAWKVLNDDDVHHCLAFMFVNWSFM
Sbjct: 837 CSEFSKIMQQGFAYLPAGICVSSMGRPVSYEQAIAWKVLNDDDVHHCLAFMFVNWSFM 894

BLAST of HG10021418 vs. NCBI nr
Match: TYK05546.1 (homeobox-leucine zipper protein ATHB-14-like [Cucumis melo var. makuwa])

HSP 1 Score: 1624.8 bits (4206), Expect = 0.0e+00
Identity = 817/831 (98.32%), Postives = 821/831 (98.80%), Query Frame = 0

Query: 14  MDTSKYVRYTPEQVDALERVYAECPKPSSLRRQQLIRECPILSNIEPKQIKVWFQNRRCR 73
           MDTSKYVRYTPEQVDALERVYAECPKPSSLRRQQLIRECPILSNIEPKQIKVWFQNRRCR
Sbjct: 1   MDTSKYVRYTPEQVDALERVYAECPKPSSLRRQQLIRECPILSNIEPKQIKVWFQNRRCR 60

Query: 74  EKQRKEASRLQTVNRKLTAMNKLLMEENDRLQTQVSRLVYENGYMRQQLHTASGTTTDNS 133
           EKQRKEASRLQTVNRKLTAMNKLLMEENDRLQTQVSRLVYENGYMRQQLHTASGTTTDNS
Sbjct: 61  EKQRKEASRLQTVNRKLTAMNKLLMEENDRLQTQVSRLVYENGYMRQQLHTASGTTTDNS 120

Query: 134 CESVVMSGQQHQQQNPTKHTQKDANNPAGLLAIAEETLAEFLSKATGTAVDWVQMIGMKP 193
           CESVVMSGQQHQQQNPTKHTQKDANNPAGLLAIAEETLAEFLSKATGTAVDWVQMIGMKP
Sbjct: 121 CESVVMSGQQHQQQNPTKHTQKDANNPAGLLAIAEETLAEFLSKATGTAVDWVQMIGMKP 180

Query: 194 GPDSIGIVAVSRNCNGVAARACGLVSLEPMKVAEILKDRLSWFRDCRCVDVLSAISTGNG 253
           GPDSIGIVAVSRNCNGVAARACGLVSLEPMKVAEILKDRLSWFRDCRCVDVLS ISTGNG
Sbjct: 181 GPDSIGIVAVSRNCNGVAARACGLVSLEPMKVAEILKDRLSWFRDCRCVDVLSVISTGNG 240

Query: 254 GTIELLYMQTYAPTTLAAARDFWTLRYTTSLEDGSLVICERSLTSSTGGPSGPPPSSFIR 313
           GTIELLYMQTYAPTTLAAARDFWTLRYTTSLEDGSLVICERSLTSSTGGPSGPPPSSF+R
Sbjct: 241 GTIELLYMQTYAPTTLAAARDFWTLRYTTSLEDGSLVICERSLTSSTGGPSGPPPSSFVR 300

Query: 314 AEMLPSGYLIRACEGGSLIHIVDHVDLDFWSVPEVLRPLYESTKILAQRMTVAALRYVRQ 373
           AEMLPSGYLIRACEGGSLIHIVDHVDLD WSVPEVLRPLYESTKILAQRMTVAALRYVRQ
Sbjct: 301 AEMLPSGYLIRACEGGSLIHIVDHVDLDVWSVPEVLRPLYESTKILAQRMTVAALRYVRQ 360

Query: 374 IAQEASGEIQLGGGRQPAVLRTFSQRLCRVFNDAVNGFVDDGWSLMDSDGVEDVTVVINS 433
           IAQEASGE+QLGGGRQPAVLRTFSQRLCR FNDAVNGFVDDGWSLMDSDGVEDVTVVINS
Sbjct: 361 IAQEASGEVQLGGGRQPAVLRTFSQRLCRGFNDAVNGFVDDGWSLMDSDGVEDVTVVINS 420

Query: 434 SPNKFLGSQYNTSLYPTFGGVLCAKASMLLQNVPPALLVRFLREHRSEWADYGVDAYSAA 493
           SPNKFLGSQYNTSLYPTFGGVLCAKASMLLQNVPPALLVRFLREHRSEWADYGVDAYSAA
Sbjct: 421 SPNKFLGSQYNTSLYPTFGGVLCAKASMLLQNVPPALLVRFLREHRSEWADYGVDAYSAA 480

Query: 494 CLKASAYAVPCARPGGFPSGQVILPLAHTVENEEFLEVVRLEGHAMFPEEAALGGRDMYL 553
           CLKASAYAVPCARPGGFP GQVILPLAHTVENEEFLEVVRLEGHAMFPEEAALGGRDMYL
Sbjct: 481 CLKASAYAVPCARPGGFPGGQVILPLAHTVENEEFLEVVRLEGHAMFPEEAALGGRDMYL 540

Query: 554 LQLCSGVDENTVGACAQLVFAPIDESFADDAPLLPSGFRVIPLESKTDAPAATRTLDLAS 613
           LQLCSGV+ENTVGACAQLVFAPIDESFADDAPLLPSGFRVIPLESKT+ P ATRTLDLAS
Sbjct: 541 LQLCSGVEENTVGACAQLVFAPIDESFADDAPLLPSGFRVIPLESKTEMPGATRTLDLAS 600

Query: 614 TLEVRPGTTRPGGETDVTNYNLRSVLTIAFQFTFENHMRDSVAAMARQYVRTVVGSVQRV 673
           TLEVRPGT RPGGETDVTNYNLRSVLTIAFQFTFENHMRDSVAAMARQYVRTVVGSVQRV
Sbjct: 601 TLEVRPGTNRPGGETDVTNYNLRSVLTIAFQFTFENHMRDSVAAMARQYVRTVVGSVQRV 660

Query: 674 AMAIAPSQLGSQIGPKSLPGSPEALTLAQWIARSYRIHSGTELFQVESQS-DAILKQLWH 733
           AMAIAPSQLGSQIGPKSLPGSPEALTLAQWI RSYRIHSG ELFQVESQS DAILKQLWH
Sbjct: 661 AMAIAPSQLGSQIGPKSLPGSPEALTLAQWITRSYRIHSGAELFQVESQSGDAILKQLWH 720

Query: 734 HSDTILCCSVKTNASPVFTFANQAGLDMLETTLVSLQDITLDKILDDAGRKILCSEFSKI 793
           HSDTILCCSVKTNASPVFTFANQAGLDMLETTLVSLQDITLDKILDDAGRKILCSEFSKI
Sbjct: 721 HSDTILCCSVKTNASPVFTFANQAGLDMLETTLVSLQDITLDKILDDAGRKILCSEFSKI 780

Query: 794 MQQGFAYLPAGICVSSMGRPVSYEQAIAWKVLNDDDVHHCLAFMFVNWSFM 844
           MQQGFAYLPAGICVSSMGRPVSYEQAIAWKVLNDDDVHHCLAFMFVNWSFM
Sbjct: 781 MQQGFAYLPAGICVSSMGRPVSYEQAIAWKVLNDDDVHHCLAFMFVNWSFM 831

BLAST of HG10021418 vs. ExPASy Swiss-Prot
Match: O04291 (Homeobox-leucine zipper protein ATHB-14 OS=Arabidopsis thaliana OX=3702 GN=ATHB-14 PE=1 SV=1)

HSP 1 Score: 1326.2 bits (3431), Expect = 0.0e+00
Identity = 672/847 (79.34%), Postives = 744/847 (87.84%), Query Frame = 0

Query: 4   VMHRDSLNKQMDTSKYVRYTPEQVDALERVYAECPKPSSLRRQQLIRECPILSNIEPKQI 63
           +M+R+S +K +D+ KYVRYTPEQV+ALERVY ECPKPSSLRRQQLIRECPILSNIEPKQI
Sbjct: 11  MMNRESPDKGLDSGKYVRYTPEQVEALERVYTECPKPSSLRRQQLIRECPILSNIEPKQI 70

Query: 64  KVWFQNRRCREKQRKEASRLQTVNRKLTAMNKLLMEENDRLQTQVSRLVYENGYMRQQLH 123
           KVWFQNRRCREKQRKEA+RLQTVNRKL AMNKLLMEENDRLQ QVS LVYENG+M+ QLH
Sbjct: 71  KVWFQNRRCREKQRKEAARLQTVNRKLNAMNKLLMEENDRLQKQVSNLVYENGHMKHQLH 130

Query: 124 TASGTTTDNSCESVVMSGQQHQQQNPT-KHTQKDANNPAGLLAIAEETLAEFLSKATGTA 183
           TASGTTTDNSCESVV+SGQQHQQQNP  +H Q+DANNPAGLL+IAEE LAEFLSKATGTA
Sbjct: 131 TASGTTTDNSCESVVVSGQQHQQQNPNPQHQQRDANNPAGLLSIAEEALAEFLSKATGTA 190

Query: 184 VDWVQMIGMKPGPDSIGIVAVSRNCNGVAARACGLVSLEPMKVAEILKDRLSWFRDCRCV 243
           VDWVQMIGMKPGPDSIGIVA+SRNC+G+AARACGLVSLEPMKVAEILKDR SW RDCR V
Sbjct: 191 VDWVQMIGMKPGPDSIGIVAISRNCSGIAARACGLVSLEPMKVAEILKDRPSWLRDCRSV 250

Query: 244 DVLSAISTGNGGTIELLYMQTYAPTTLAAARDFWTLRYTTSLEDGSLVICERSLTSSTGG 303
           D LS I  GNGGTIEL+Y Q YAPTTLAAARDFWTLRY+T LEDGS V+CERSLTS+TGG
Sbjct: 251 DTLSVIPAGNGGTIELIYTQMYAPTTLAAARDFWTLRYSTCLEDGSYVVCERSLTSATGG 310

Query: 304 PSGPPPSSFIRAEMLPSGYLIRACE-GGSLIHIVDHVDLDFWSVPEVLRPLYESTKILAQ 363
           P+GPP S+F+RAEM PSG+LIR C+ GGS++HIVDHVDLD WSVPEV+RPLYES+KILAQ
Sbjct: 311 PTGPPSSNFVRAEMKPSGFLIRPCDGGGSILHIVDHVDLDAWSVPEVMRPLYESSKILAQ 370

Query: 364 RMTVAALRYVRQIAQEASGEIQLGGGRQPAVLRTFSQRLCRVFNDAVNGFVDDGWSLMDS 423
           +MTVAALR+VRQIAQE SGE+Q GGGRQPAVLRTFSQRLCR FNDAVNGFVDDGWS M S
Sbjct: 371 KMTVAALRHVRQIAQETSGEVQYGGGRQPAVLRTFSQRLCRGFNDAVNGFVDDGWSPMGS 430

Query: 424 DGVEDVTVVINSSPNKFLGSQYNTSLYPTFG-GVLCAKASMLLQNVPPALLVRFLREHRS 483
           DG EDVTV+IN SP KF GSQY  S  P+FG GVLCAKASMLLQNVPPA+LVRFLREHRS
Sbjct: 431 DGAEDVTVMINLSPGKFGGSQYGNSFLPSFGSGVLCAKASMLLQNVPPAVLVRFLREHRS 490

Query: 484 EWADYGVDAYSAACLKASAYAVPCARPGGFPSGQVILPLAHTVENEEFLEVVRLEGHAMF 543
           EWADYGVDAY+AA L+AS +AVPCAR GGFPS QVILPLA TVE+EE LEVVRLEGHA  
Sbjct: 491 EWADYGVDAYAAASLRASPFAVPCARAGGFPSNQVILPLAQTVEHEESLEVVRLEGHAYS 550

Query: 544 PEEAALGGRDMYLLQLCSGVDENTVGACAQLVFAPIDESFADDAPLLPSGFRVIPLESKT 603
           PE+  L  RDMYLLQLCSGVDEN VG CAQLVFAPIDESFADDAPLLPSGFR+IPLE K+
Sbjct: 551 PEDMGL-ARDMYLLQLCSGVDENVVGGCAQLVFAPIDESFADDAPLLPSGFRIIPLEQKS 610

Query: 604 --DAPAATRTLDLASTLEVRPGTTRPGGETDVTNYNLRSVLTIAFQFTFENHMRDSVAAM 663
             +  +A RTLDLAS LE   G+TR  GE D    N RSVLTIAFQFTF+NH RDSVA+M
Sbjct: 611 TPNGASANRTLDLASALE---GSTRQAGEADPNGCNFRSVLTIAFQFTFDNHSRDSVASM 670

Query: 664 ARQYVRTVVGSVQRVAMAIAPSQLGSQIGPKSLPGSPEALTLAQWIARSYRIHSGTELFQ 723
           ARQYVR++VGS+QRVA+AIAP + GS I P S+P SPEALTL +WI+RSY +H+G +LF 
Sbjct: 671 ARQYVRSIVGSIQRVALAIAP-RPGSNISPISVPTSPEALTLVRWISRSYSLHTGADLFG 730

Query: 724 VESQS--DAILKQLWHHSDTILCCSVKTNASPVFTFANQAGLDMLETTLVSLQDITLDKI 783
            +SQ+  D +L QLW+HSD ILCCS+KTNASPVFTFANQ GLDMLETTLV+LQDI LDK 
Sbjct: 731 SDSQTSGDTLLHQLWNHSDAILCCSLKTNASPVFTFANQTGLDMLETTLVALQDIMLDKT 790

Query: 784 LDDAGRKILCSEFSKIMQQGFAYLPAGICVSSMGRPVSYEQAIAWKVLNDDDVHHCLAFM 843
           LD+ GRK LCSEF KIMQQG+A+LPAG+C SSMGR VSYEQA  WKVL DD+ +HCLAFM
Sbjct: 791 LDEPGRKALCSEFPKIMQQGYAHLPAGVCASSMGRMVSYEQATVWKVLEDDESNHCLAFM 850

BLAST of HG10021418 vs. ExPASy Swiss-Prot
Match: A2XK30 (Homeobox-leucine zipper protein HOX32 OS=Oryza sativa subsp. indica OX=39946 GN=HOX32 PE=2 SV=1)

HSP 1 Score: 1318.9 bits (3412), Expect = 0.0e+00
Identity = 675/837 (80.65%), Postives = 735/837 (87.81%), Query Frame = 0

Query: 13  QMDTSKYVRYTPEQVDALERVYAECPKPSSLRRQQLIRECPILSNIEPKQIKVWFQNRRC 72
           Q+DT KYVRYTPEQV+ALERVY ECPKPSSLRRQQLIRECPILSNIEPKQIKVWFQNRRC
Sbjct: 27  QVDTGKYVRYTPEQVEALERVYGECPKPSSLRRQQLIRECPILSNIEPKQIKVWFQNRRC 86

Query: 73  REKQRKEASRLQTVNRKLTAMNKLLMEENDRLQTQVSRLVYENGYMRQQLHTASGTTTDN 132
           REKQRKEASRLQTVNRKLTAMNKLLMEENDRLQ QVSRLVYENGYMRQQLH  S  TTD 
Sbjct: 87  REKQRKEASRLQTVNRKLTAMNKLLMEENDRLQKQVSRLVYENGYMRQQLHNPSVATTDT 146

Query: 133 SCESVVMSGQQHQQQNP-TKHTQKDANNPAGLLAIAEETLAEFLSKATGTAVDWVQMIGM 192
           SCESVV SGQ HQQQNP     Q+DANNPAGLLAIAEETLAEFLSKATGTAVDWVQM+GM
Sbjct: 147 SCESVVTSGQHHQQQNPAATRPQRDANNPAGLLAIAEETLAEFLSKATGTAVDWVQMVGM 206

Query: 193 KPGPDSIGIVAVSRNCNGVAARACGLVSLEPMKVAEILKDRLSWFRDCRCVDVLSAISTG 252
           KPGPDSIGI+AVS NC+GVAARACGLVSLEP KVAEILKDR SW+RDCRCVDVL  I TG
Sbjct: 207 KPGPDSIGIIAVSHNCSGVAARACGLVSLEPTKVAEILKDRPSWYRDCRCVDVLHVIPTG 266

Query: 253 NGGTIELLYMQTYAPTTLAAARDFWTLRYTTSLEDGSLVICERSLTSSTGGPSGPPPSSF 312
           NGGTIEL+YMQTYAPTTLAA RDFW LRYT+ LEDGSLVICERSLT STGGPSGP   +F
Sbjct: 267 NGGTIELIYMQTYAPTTLAAPRDFWILRYTSGLEDGSLVICERSLTQSTGGPSGPNTPNF 326

Query: 313 IRAEMLPSGYLIRACE-GGSLIHIVDHVDLDFWSVPEVLRPLYESTKILAQRMTVAALRY 372
           +RAE+LPSGYLIR CE GGS+IHIVDHVDLD WSVPEVLRPLYES KILAQ+MT+AALR+
Sbjct: 327 VRAEVLPSGYLIRPCEGGGSMIHIVDHVDLDAWSVPEVLRPLYESPKILAQKMTIAALRH 386

Query: 373 VRQIAQEASGEIQLGGGRQPAVLRTFSQRLCRVFNDAVNGFVDDGWSLMDSDGVEDVTVV 432
           +RQIA E+SGE+  GGGRQPAVLRTFSQRL R FNDAVNGF DDGWSLM SDG EDVT+ 
Sbjct: 387 IRQIAHESSGEMPYGGGRQPAVLRTFSQRLSRGFNDAVNGFPDDGWSLMSSDGAEDVTIA 446

Query: 433 INSSPNKFLGSQYNTS-LYPTF-GGVLCAKASMLLQNVPPALLVRFLREHRSEWADYGVD 492
            NSSPNK +GS  N+S L+    GG+LCAKASMLLQNVPPALLVRFLREHRSEWAD GVD
Sbjct: 447 FNSSPNKLVGSHVNSSQLFSAIGGGILCAKASMLLQNVPPALLVRFLREHRSEWADPGVD 506

Query: 493 AYSAACLKASAYAVPCARPGGFPSGQVILPLAHTVENEEFLEVVRLEGHAMFPEEAALGG 552
           AYSAA L+AS YAVP  R GGF   QVILPLAHT+E+EEFLEV+RLEGH++  +E  L  
Sbjct: 507 AYSAAALRASPYAVPGLRAGGFMGSQVILPLAHTLEHEEFLEVIRLEGHSLCHDEVVL-S 566

Query: 553 RDMYLLQLCSGVDENTVGACAQLVFAPIDESFADDAPLLPSGFRVIPLESKTDAPAATRT 612
           RDMYLLQLCSGVDEN  GACAQLVFAPIDESFADDAPLLPSGFRVIPL+ KTDAP+ATRT
Sbjct: 567 RDMYLLQLCSGVDENAAGACAQLVFAPIDESFADDAPLLPSGFRVIPLDGKTDAPSATRT 626

Query: 613 LDLASTLEV-RPGTTRPGGETDVTNYNLRSVLTIAFQFTFENHMRDSVAAMARQYVRTVV 672
           LDLASTLEV   GTTR   +T  T  N RSVLTIAFQF++ENH+R+SVAAMARQYVRTVV
Sbjct: 627 LDLASTLEVGSGGTTRASSDTSST-CNTRSVLTIAFQFSYENHLRESVAAMARQYVRTVV 686

Query: 673 GSVQRVAMAIAPSQLGSQIGPKSLPGSPEALTLAQWIARSYRIHSGTELFQVESQS-DAI 732
            SVQRVAMAIAPS+LG QI  K+ PGSPEA TLA+WI RSYR H+G +L + +SQS D+ 
Sbjct: 687 ASVQRVAMAIAPSRLGGQIETKNPPGSPEAHTLARWIGRSYRFHTGADLLRTDSQSMDSS 746

Query: 733 LKQLWHHSDTILCCSVKTNASPVFTFANQAGLDMLETTLVSLQDITLDKILDDAGRKILC 792
           LK +W HSD+I+CCS+K  A+PVFTFANQAGLDMLETTL++LQDI+L+KILDD GRK LC
Sbjct: 747 LKAMWQHSDSIMCCSLK--AAPVFTFANQAGLDMLETTLIALQDISLEKILDDDGRKALC 806

Query: 793 SEFSKIMQQGFAYLPAGICVSSMGRPVSYEQAIAWKVLNDDDVHHCLAFMFVNWSFM 844
           +EF KIMQQGFAYLP G+CVSSMGRPVSYEQA+AWKVL+DDD  HCLAFMFVNWSF+
Sbjct: 807 TEFPKIMQQGFAYLPGGVCVSSMGRPVSYEQAVAWKVLSDDDTPHCLAFMFVNWSFV 859

BLAST of HG10021418 vs. ExPASy Swiss-Prot
Match: Q6AST1 (Homeobox-leucine zipper protein HOX32 OS=Oryza sativa subsp. japonica OX=39947 GN=HOX32 PE=2 SV=1)

HSP 1 Score: 1318.9 bits (3412), Expect = 0.0e+00
Identity = 675/837 (80.65%), Postives = 735/837 (87.81%), Query Frame = 0

Query: 13  QMDTSKYVRYTPEQVDALERVYAECPKPSSLRRQQLIRECPILSNIEPKQIKVWFQNRRC 72
           Q+DT KYVRYTPEQV+ALERVY ECPKPSSLRRQQLIRECPILSNIEPKQIKVWFQNRRC
Sbjct: 27  QVDTGKYVRYTPEQVEALERVYGECPKPSSLRRQQLIRECPILSNIEPKQIKVWFQNRRC 86

Query: 73  REKQRKEASRLQTVNRKLTAMNKLLMEENDRLQTQVSRLVYENGYMRQQLHTASGTTTDN 132
           REKQRKEASRLQTVNRKLTAMNKLLMEENDRLQ QVSRLVYENGYMRQQLH  S  TTD 
Sbjct: 87  REKQRKEASRLQTVNRKLTAMNKLLMEENDRLQKQVSRLVYENGYMRQQLHNPSVATTDT 146

Query: 133 SCESVVMSGQQHQQQNP-TKHTQKDANNPAGLLAIAEETLAEFLSKATGTAVDWVQMIGM 192
           SCESVV SGQ HQQQNP     Q+DANNPAGLLAIAEETLAEFLSKATGTAVDWVQM+GM
Sbjct: 147 SCESVVTSGQHHQQQNPAATRPQRDANNPAGLLAIAEETLAEFLSKATGTAVDWVQMVGM 206

Query: 193 KPGPDSIGIVAVSRNCNGVAARACGLVSLEPMKVAEILKDRLSWFRDCRCVDVLSAISTG 252
           KPGPDSIGI+AVS NC+GVAARACGLVSLEP KVAEILKDR SW+RDCRCVDVL  I TG
Sbjct: 207 KPGPDSIGIIAVSHNCSGVAARACGLVSLEPTKVAEILKDRPSWYRDCRCVDVLHVIPTG 266

Query: 253 NGGTIELLYMQTYAPTTLAAARDFWTLRYTTSLEDGSLVICERSLTSSTGGPSGPPPSSF 312
           NGGTIEL+YMQTYAPTTLAA RDFW LRYT+ LEDGSLVICERSLT STGGPSGP   +F
Sbjct: 267 NGGTIELIYMQTYAPTTLAAPRDFWILRYTSGLEDGSLVICERSLTQSTGGPSGPNTPNF 326

Query: 313 IRAEMLPSGYLIRACE-GGSLIHIVDHVDLDFWSVPEVLRPLYESTKILAQRMTVAALRY 372
           +RAE+LPSGYLIR CE GGS+IHIVDHVDLD WSVPEVLRPLYES KILAQ+MT+AALR+
Sbjct: 327 VRAEVLPSGYLIRPCEGGGSMIHIVDHVDLDAWSVPEVLRPLYESPKILAQKMTIAALRH 386

Query: 373 VRQIAQEASGEIQLGGGRQPAVLRTFSQRLCRVFNDAVNGFVDDGWSLMDSDGVEDVTVV 432
           +RQIA E+SGE+  GGGRQPAVLRTFSQRL R FNDAVNGF DDGWSLM SDG EDVT+ 
Sbjct: 387 IRQIAHESSGEMPYGGGRQPAVLRTFSQRLSRGFNDAVNGFPDDGWSLMSSDGAEDVTIA 446

Query: 433 INSSPNKFLGSQYNTS-LYPTF-GGVLCAKASMLLQNVPPALLVRFLREHRSEWADYGVD 492
            NSSPNK +GS  N+S L+    GG+LCAKASMLLQNVPPALLVRFLREHRSEWAD GVD
Sbjct: 447 FNSSPNKLVGSHVNSSQLFSAIGGGILCAKASMLLQNVPPALLVRFLREHRSEWADPGVD 506

Query: 493 AYSAACLKASAYAVPCARPGGFPSGQVILPLAHTVENEEFLEVVRLEGHAMFPEEAALGG 552
           AYSAA L+AS YAVP  R GGF   QVILPLAHT+E+EEFLEV+RLEGH++  +E  L  
Sbjct: 507 AYSAAALRASPYAVPGLRAGGFMGSQVILPLAHTLEHEEFLEVIRLEGHSLCHDEVVL-S 566

Query: 553 RDMYLLQLCSGVDENTVGACAQLVFAPIDESFADDAPLLPSGFRVIPLESKTDAPAATRT 612
           RDMYLLQLCSGVDEN  GACAQLVFAPIDESFADDAPLLPSGFRVIPL+ KTDAP+ATRT
Sbjct: 567 RDMYLLQLCSGVDENAAGACAQLVFAPIDESFADDAPLLPSGFRVIPLDGKTDAPSATRT 626

Query: 613 LDLASTLEV-RPGTTRPGGETDVTNYNLRSVLTIAFQFTFENHMRDSVAAMARQYVRTVV 672
           LDLASTLEV   GTTR   +T  T  N RSVLTIAFQF++ENH+R+SVAAMARQYVRTVV
Sbjct: 627 LDLASTLEVGSGGTTRASSDTSST-CNTRSVLTIAFQFSYENHLRESVAAMARQYVRTVV 686

Query: 673 GSVQRVAMAIAPSQLGSQIGPKSLPGSPEALTLAQWIARSYRIHSGTELFQVESQS-DAI 732
            SVQRVAMAIAPS+LG QI  K+ PGSPEA TLA+WI RSYR H+G +L + +SQS D+ 
Sbjct: 687 ASVQRVAMAIAPSRLGGQIETKNPPGSPEAHTLARWIGRSYRFHTGADLLRTDSQSTDSS 746

Query: 733 LKQLWHHSDTILCCSVKTNASPVFTFANQAGLDMLETTLVSLQDITLDKILDDAGRKILC 792
           LK +W HSD+I+CCS+K  A+PVFTFANQAGLDMLETTL++LQDI+L+KILDD GRK LC
Sbjct: 747 LKAMWQHSDSIMCCSLK--AAPVFTFANQAGLDMLETTLIALQDISLEKILDDDGRKALC 806

Query: 793 SEFSKIMQQGFAYLPAGICVSSMGRPVSYEQAIAWKVLNDDDVHHCLAFMFVNWSFM 844
           +EF KIMQQGFAYLP G+CVSSMGRPVSYEQA+AWKVL+DDD  HCLAFMFVNWSF+
Sbjct: 807 TEFPKIMQQGFAYLPGGVCVSSMGRPVSYEQAVAWKVLSDDDTPHCLAFMFVNWSFV 859

BLAST of HG10021418 vs. ExPASy Swiss-Prot
Match: O04292 (Homeobox-leucine zipper protein ATHB-9 OS=Arabidopsis thaliana OX=3702 GN=ATHB-9 PE=1 SV=1)

HSP 1 Score: 1280.8 bits (3313), Expect = 0.0e+00
Identity = 656/846 (77.54%), Postives = 732/846 (86.52%), Query Frame = 0

Query: 7   RDSLNKQMDTSKYVRYTPEQVDALERVYAECPKPSSLRRQQLIRECPILSNIEPKQIKVW 66
           RDS +K  D+ KYVRYTPEQV+ALERVYAECPKPSSLRRQQLIRECPIL NIEP+QIKVW
Sbjct: 10  RDSPDKGFDSGKYVRYTPEQVEALERVYAECPKPSSLRRQQLIRECPILCNIEPRQIKVW 69

Query: 67  FQNRRCREKQRKEASRLQTVNRKLTAMNKLLMEENDRLQTQVSRLVYENGYMRQQLHTAS 126
           FQNRRCREKQRKE++RLQTVNRKL+AMNKLLMEENDRLQ QVS LVYENG+M+ ++HTAS
Sbjct: 70  FQNRRCREKQRKESARLQTVNRKLSAMNKLLMEENDRLQKQVSNLVYENGFMKHRIHTAS 129

Query: 127 GTTTDNSCESVVMSGQQHQQQNPT-KHTQKDANNPAGLLAIAEETLAEFLSKATGTAVDW 186
           GTTTDNSCESVV+SGQQ QQQNPT +H Q+D NNPA LL+IAEETLAEFL KATGTAVDW
Sbjct: 130 GTTTDNSCESVVVSGQQRQQQNPTHQHPQRDVNNPANLLSIAEETLAEFLCKATGTAVDW 189

Query: 187 VQMIGMKPGPDSIGIVAVSRNCNGVAARACGLVSLEPMKVAEILKDRLSWFRDCRCVDVL 246
           VQMIGMKPGPDSIGIVAVSRNC+G+AARACGLVSLEPMKVAEILKDR SWFRDCRCV+ L
Sbjct: 190 VQMIGMKPGPDSIGIVAVSRNCSGIAARACGLVSLEPMKVAEILKDRPSWFRDCRCVETL 249

Query: 247 SAISTGNGGTIELLYMQTYAPTTLAAARDFWTLRYTTSLEDGSLVICERSLTSSTGGPSG 306
           + I TGNGGTIEL+  Q YAPTTLAAARDFWTLRY+TSLEDGS V+CERSLTS+TGGP+G
Sbjct: 250 NVIPTGNGGTIELVNTQIYAPTTLAAARDFWTLRYSTSLEDGSYVVCERSLTSATGGPNG 309

Query: 307 PPPSSFIRAEMLPSGYLIRACE-GGSLIHIVDHVDLDFWSVPEVLRPLYESTKILAQRMT 366
           P  SSF+RA+ML SG+LIR C+ GGS+IHIVDHVDLD  SVPEVLRPLYES+KILAQ+MT
Sbjct: 310 PLSSSFVRAKMLSSGFLIRPCDGGGSIIHIVDHVDLDVSSVPEVLRPLYESSKILAQKMT 369

Query: 367 VAALRYVRQIAQEASGEIQLGGGRQPAVLRTFSQRLCRVFNDAVNGFVDDGWSLMDSDGV 426
           VAALR+VRQIAQE SGE+Q  GGRQPAVLRTFSQRLCR FNDAVNGFVDDGWS M SDG 
Sbjct: 370 VAALRHVRQIAQETSGEVQYSGGRQPAVLRTFSQRLCRGFNDAVNGFVDDGWSPMSSDGG 429

Query: 427 EDVTVVINSSPNKFLGSQYNTSLYPTFG-GVLCAKASMLLQNVPPALLVRFLREHRSEWA 486
           ED+T++INSS  KF GSQY +S  P+FG GVLCAKASMLLQNVPP +L+RFLREHR+EWA
Sbjct: 430 EDITIMINSSSAKFAGSQYGSSFLPSFGSGVLCAKASMLLQNVPPLVLIRFLREHRAEWA 489

Query: 487 DYGVDAYSAACLKASAYAVPCARPGGFPSGQVILPLAHTVENEEFLEVVRLEGHAMFPEE 546
           DYGVDAYSAA L+A+ YAVPC R GGFPS QVILPLA T+E+EEFLEVVRL GHA  PE+
Sbjct: 490 DYGVDAYSAASLRATPYAVPCVRTGGFPSNQVILPLAQTLEHEEFLEVVRLGGHAYSPED 549

Query: 547 AALGGRDMYLLQLCSGVDENTVGACAQLVFAPIDESFADDAPLLPSGFRVIPLESKT--- 606
             L  RDMYLLQLCSGVDEN VG CAQLVFAPIDESFADDAPLLPSGFRVIPL+ KT   
Sbjct: 550 MGL-SRDMYLLQLCSGVDENVVGGCAQLVFAPIDESFADDAPLLPSGFRVIPLDQKTNPN 609

Query: 607 DAPAATRTLDLASTLEVRPGTTRPGGETDVTNYNLRSVLTIAFQFTFENHMRDSVAAMAR 666
           D  +A+RT DLAS+L+   G+T+   ET     N R VLTIAFQFTF+NH RD+VA MAR
Sbjct: 610 DHQSASRTRDLASSLD---GSTKTDSET-----NSRLVLTIAFQFTFDNHSRDNVATMAR 669

Query: 667 QYVRTVVGSVQRVAMAIAPSQLGSQIGPKSLPGSPEALTLAQWIARSYRIHSGTELFQVE 726
           QYVR VVGS+QRVA+AI P     + G   LP SPEALTL +WI RSY IH+G +LF  +
Sbjct: 670 QYVRNVVGSIQRVALAITP-----RPGSMQLPTSPEALTLVRWITRSYSIHTGADLFGAD 729

Query: 727 SQS---DAILKQLWHHSDTILCCSVKTNASPVFTFANQAGLDMLETTLVSLQDITLDKIL 786
           SQS   D +LKQLW HSD ILCCS+KTNASPVFTFANQAGLDMLETTLV+LQDI LDK L
Sbjct: 730 SQSCGGDTLLKQLWDHSDAILCCSLKTNASPVFTFANQAGLDMLETTLVALQDIMLDKTL 789

Query: 787 DDAGRKILCSEFSKIMQQGFAYLPAGICVSSMGRPVSYEQAIAWKVLNDDDVHHCLAFMF 844
           DD+GR+ LCSEF+KIMQQG+A LPAGICVSSMGRPVSYEQA  WKV++D++ +HCLAF  
Sbjct: 790 DDSGRRALCSEFAKIMQQGYANLPAGICVSSMGRPVSYEQATVWKVVDDNESNHCLAFTL 841

BLAST of HG10021418 vs. ExPASy Swiss-Prot
Match: A2ZMN9 (Homeobox-leucine zipper protein HOX33 OS=Oryza sativa subsp. indica OX=39946 GN=HOX33 PE=2 SV=2)

HSP 1 Score: 1250.3 bits (3234), Expect = 0.0e+00
Identity = 643/837 (76.82%), Postives = 710/837 (84.83%), Query Frame = 0

Query: 13  QMDTSKYVRYTPEQVDALERVYAECPKPSSLRRQQLIRECPILSNIEPKQIKVWFQNRRC 72
           Q+D  KYVRYTPEQV+ALERVY ECPKPSSLRRQQLIRECPILSNIEPKQIKVWFQNRRC
Sbjct: 24  QVDAGKYVRYTPEQVEALERVYTECPKPSSLRRQQLIRECPILSNIEPKQIKVWFQNRRC 83

Query: 73  REKQRKEASRLQTVNRKLTAMNKLLMEENDRLQTQVSRLVYENGYMRQQLHTASGTTTDN 132
           REKQRKEASRLQTVNRKL AMNKLLMEENDRLQ QVSRLVYENGYMR QLH  S  TTD 
Sbjct: 84  REKQRKEASRLQTVNRKLNAMNKLLMEENDRLQKQVSRLVYENGYMRTQLHNPSAATTDT 143

Query: 133 SCESVVMSGQQHQQQNP-TKHTQKDANNPAGLLAIAEETLAEFLSKATGTAVDWVQMIGM 192
           SCESVV SGQ HQQQNP   H Q+DANNPAGLLAIAEETLAEF+SKATGTAV+WVQM+GM
Sbjct: 144 SCESVVTSGQHHQQQNPAVLHPQRDANNPAGLLAIAEETLAEFMSKATGTAVEWVQMVGM 203

Query: 193 KPGPDSIGIVAVSRNCNGVAARACGLVSLEPMKVAEILKDRLSWFRDCRCVDVLSAISTG 252
           KPGPDSIGI+AVS NC+GVAARACGLVSLEP KVAEILKDR SW+RDCRCVD++  I TG
Sbjct: 204 KPGPDSIGIIAVSHNCSGVAARACGLVSLEPTKVAEILKDRPSWYRDCRCVDIIHVIPTG 263

Query: 253 NGGTIELLYMQTYAPTTLAAARDFWTLRYTTSLEDGSLVICERSLTSSTGGPSGPPPSSF 312
           NGGTIEL+YMQTYAPTTLAA RDFWTLRYT+ LEDGSLVICERSLT STGGPSGP   +F
Sbjct: 264 NGGTIELIYMQTYAPTTLAAPRDFWTLRYTSGLEDGSLVICERSLTQSTGGPSGPNTPNF 323

Query: 313 IRAEMLPSGYLIRACE-GGSLIHIVDHVDLDFWSVPEVLRPLYESTKILAQRMTVAALRY 372
           IRAE+LPSGYLIR CE GGS+I+IVDHVDLD WSVPEVLRPLYES KILAQ+MT+AALR+
Sbjct: 324 IRAEVLPSGYLIRPCEGGGSMIYIVDHVDLDAWSVPEVLRPLYESPKILAQKMTIAALRH 383

Query: 373 VRQIAQEASGEIQLGGGRQPAVLRTFSQRLCRVFNDAVNGFVDDGWSLMDSDGVEDVTVV 432
           +RQIA E+SGEI  G GRQPAV RTFSQRL R FNDAV+GF DDGWSL+ SDG ED+T+ 
Sbjct: 384 IRQIAHESSGEIPYGAGRQPAVFRTFSQRLSRGFNDAVSGFPDDGWSLLSSDGSEDITIS 443

Query: 433 INSSPNKFLGSQYNTS-LYPTF-GGVLCAKASMLLQNVPPALLVRFLREHRSEWADYGVD 492
           +NSSPNK +GS  + + L+ T  GG+LCAKASMLLQNVPPALLVRFLREHRSEWAD GVD
Sbjct: 444 VNSSPNKLVGSHVSPNPLFSTVGGGILCAKASMLLQNVPPALLVRFLREHRSEWADPGVD 503

Query: 493 AYSAACLKASAYAVPCARPGGFPSGQVILPLAHTVENEEFLEVVRLEGHAMFPEEAALGG 552
           AYSAA L+AS YAVP  R  GF   QVILPLAHT+E+EEFLEV+RLEGH  F  +  L  
Sbjct: 504 AYSAASLRASPYAVPGLRTSGFMGSQVILPLAHTLEHEEFLEVIRLEGHG-FSHDEVLLS 563

Query: 553 RDMYLLQLCSGVDENTVGACAQLVFAPIDESFADDAPLLPSGFRVIPLESKTDAPAATRT 612
           RDMYLLQLCSGVDEN   A AQLVFAPIDESFADDAPLLPSGFRVIPL++K D P+ATRT
Sbjct: 564 RDMYLLQLCSGVDENATSASAQLVFAPIDESFADDAPLLPSGFRVIPLDTKMDGPSATRT 623

Query: 613 LDLASTLEVRP-GTTRPGGETDVTNYNLRSVLTIAFQFTFENHMRDSVAAMARQYVRTVV 672
           LDLAS LEV P G +R   E   T    RSVLTIAFQF++ENH+R+SVAAMAR YVR V+
Sbjct: 624 LDLASALEVGPGGASRASVEASGTCN--RSVLTIAFQFSYENHLRESVAAMARSYVRAVM 683

Query: 673 GSVQRVAMAIAPSQLGSQIGPKSLPGSPEALTLAQWIARSYRIHSGTELFQVESQ-SDAI 732
            SVQRVA+AIAPS+LG QIG K  P SPEALTLA WI RSYR H+G ++   +++ +D+ 
Sbjct: 684 ASVQRVAVAIAPSRLGPQIGMKHPPASPEALTLASWIGRSYRAHTGADIRWSDTEDADSP 743

Query: 733 LKQLWHHSDTILCCSVKTNASPVFTFANQAGLDMLETTLVSLQDITLDKILDDAGRKILC 792
           L  LW HSD ILCCS+K   +P+FTFAN AGLD+LETTLV+LQDI+L+ ILDD GRK LC
Sbjct: 744 LALLWKHSDAILCCSLK--PAPMFTFANNAGLDILETTLVNLQDISLEMILDDEGRKALC 803

Query: 793 SEFSKIMQQGFAYLPAGICVSSMGRPVSYEQAIAWKVLNDDDVHHCLAFMFVNWSFM 844
           SEF KIMQQGF YLP G+C SSMGR  SYEQA+AWKVL+DDD  HCLAFM VNW+FM
Sbjct: 804 SEFPKIMQQGFTYLPGGVCKSSMGRQASYEQAVAWKVLSDDDAPHCLAFMLVNWTFM 855

BLAST of HG10021418 vs. ExPASy TrEMBL
Match: A0A1S3BC42 (homeobox-leucine zipper protein ATHB-14-like OS=Cucumis melo OX=3656 GN=LOC103488304 PE=3 SV=1)

HSP 1 Score: 1647.9 bits (4266), Expect = 0.0e+00
Identity = 829/844 (98.22%), Postives = 834/844 (98.82%), Query Frame = 0

Query: 1   MALVMHRDSLNKQMDTSKYVRYTPEQVDALERVYAECPKPSSLRRQQLIRECPILSNIEP 60
           MALVMHRDSLNKQMDTSKYVRYTPEQVDALERVYAECPKPSSLRRQQLIRECPILSNIEP
Sbjct: 1   MALVMHRDSLNKQMDTSKYVRYTPEQVDALERVYAECPKPSSLRRQQLIRECPILSNIEP 60

Query: 61  KQIKVWFQNRRCREKQRKEASRLQTVNRKLTAMNKLLMEENDRLQTQVSRLVYENGYMRQ 120
           KQIKVWFQNRRCREKQRKEASRLQTVNRKLTAMNKLLMEENDRLQTQVSRLVYENGYMRQ
Sbjct: 61  KQIKVWFQNRRCREKQRKEASRLQTVNRKLTAMNKLLMEENDRLQTQVSRLVYENGYMRQ 120

Query: 121 QLHTASGTTTDNSCESVVMSGQQHQQQNPTKHTQKDANNPAGLLAIAEETLAEFLSKATG 180
           QLHTASGTTTDNSCESVVMSGQQHQQQNPTKHTQKDANNPAGLLAIAEETLAEFLSKATG
Sbjct: 121 QLHTASGTTTDNSCESVVMSGQQHQQQNPTKHTQKDANNPAGLLAIAEETLAEFLSKATG 180

Query: 181 TAVDWVQMIGMKPGPDSIGIVAVSRNCNGVAARACGLVSLEPMKVAEILKDRLSWFRDCR 240
           TAVDWVQMIGMKPGPDSIGIVAVSRNCNGVAARACGLVSLEPMKVAEILKDRLSWFRDCR
Sbjct: 181 TAVDWVQMIGMKPGPDSIGIVAVSRNCNGVAARACGLVSLEPMKVAEILKDRLSWFRDCR 240

Query: 241 CVDVLSAISTGNGGTIELLYMQTYAPTTLAAARDFWTLRYTTSLEDGSLVICERSLTSST 300
           CVDVLS ISTGNGGTIELLYMQTYAPTTLAAARDFWTLRYTTSLEDGSLVICERSLTSST
Sbjct: 241 CVDVLSVISTGNGGTIELLYMQTYAPTTLAAARDFWTLRYTTSLEDGSLVICERSLTSST 300

Query: 301 GGPSGPPPSSFIRAEMLPSGYLIRACEGGSLIHIVDHVDLDFWSVPEVLRPLYESTKILA 360
           GGPSGPPPSSF+RAEMLPSGYLIRACEGGSLIHIVDHVDLD WSVPEVLRPLYESTKILA
Sbjct: 301 GGPSGPPPSSFVRAEMLPSGYLIRACEGGSLIHIVDHVDLDVWSVPEVLRPLYESTKILA 360

Query: 361 QRMTVAALRYVRQIAQEASGEIQLGGGRQPAVLRTFSQRLCRVFNDAVNGFVDDGWSLMD 420
           QRMTVAALRYVRQIAQEASGE+QLGGGRQPAVLRTFSQRLCR FNDAVNGFVDDGWSLMD
Sbjct: 361 QRMTVAALRYVRQIAQEASGEVQLGGGRQPAVLRTFSQRLCRGFNDAVNGFVDDGWSLMD 420

Query: 421 SDGVEDVTVVINSSPNKFLGSQYNTSLYPTFGGVLCAKASMLLQNVPPALLVRFLREHRS 480
           SDGVEDVTVVINSSPNKFLGSQYNTSLYPTFGGVLCAKASMLLQNVPPALLVRFLREHRS
Sbjct: 421 SDGVEDVTVVINSSPNKFLGSQYNTSLYPTFGGVLCAKASMLLQNVPPALLVRFLREHRS 480

Query: 481 EWADYGVDAYSAACLKASAYAVPCARPGGFPSGQVILPLAHTVENEEFLEVVRLEGHAMF 540
           EWADYGVDAYSAACLKASAYAVPCARPGGFP GQVILPLAHTVENEEFLEVVRLEGHAMF
Sbjct: 481 EWADYGVDAYSAACLKASAYAVPCARPGGFPGGQVILPLAHTVENEEFLEVVRLEGHAMF 540

Query: 541 PEEAALGGRDMYLLQLCSGVDENTVGACAQLVFAPIDESFADDAPLLPSGFRVIPLESKT 600
           PEEAALGGRDMYLLQLCSGV+ENTVGACAQLVFAPIDESFADDAPLLPSGFRVIPLESKT
Sbjct: 541 PEEAALGGRDMYLLQLCSGVEENTVGACAQLVFAPIDESFADDAPLLPSGFRVIPLESKT 600

Query: 601 DAPAATRTLDLASTLEVRPGTTRPGGETDVTNYNLRSVLTIAFQFTFENHMRDSVAAMAR 660
           + P ATRTLDLASTLEVRPGT RPGGETDVTNYNLRSVLTIAFQFTFENHMRDSVAAMAR
Sbjct: 601 EMPGATRTLDLASTLEVRPGTNRPGGETDVTNYNLRSVLTIAFQFTFENHMRDSVAAMAR 660

Query: 661 QYVRTVVGSVQRVAMAIAPSQLGSQIGPKSLPGSPEALTLAQWIARSYRIHSGTELFQVE 720
           QYV+TVVGSVQRVAMAIAPSQLGSQIGPKSLPGSPEALTLAQWI RSYRIHSG ELFQVE
Sbjct: 661 QYVQTVVGSVQRVAMAIAPSQLGSQIGPKSLPGSPEALTLAQWITRSYRIHSGAELFQVE 720

Query: 721 SQS-DAILKQLWHHSDTILCCSVKTNASPVFTFANQAGLDMLETTLVSLQDITLDKILDD 780
           SQS DAILKQLWHHSDTILCCSVKTNASPVFTFANQAGLDMLETTLVSLQDITLDKILDD
Sbjct: 721 SQSGDAILKQLWHHSDTILCCSVKTNASPVFTFANQAGLDMLETTLVSLQDITLDKILDD 780

Query: 781 AGRKILCSEFSKIMQQGFAYLPAGICVSSMGRPVSYEQAIAWKVLNDDDVHHCLAFMFVN 840
           AGRKILCSEFSKIMQQGFAYLPAGICVSSMGRPVSYEQAIAWKVLNDDDVHHCLAFMFVN
Sbjct: 781 AGRKILCSEFSKIMQQGFAYLPAGICVSSMGRPVSYEQAIAWKVLNDDDVHHCLAFMFVN 840

Query: 841 WSFM 844
           WSFM
Sbjct: 841 WSFM 844

BLAST of HG10021418 vs. ExPASy TrEMBL
Match: A0A0A0LVI5 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G538230 PE=3 SV=1)

HSP 1 Score: 1639.4 bits (4244), Expect = 0.0e+00
Identity = 826/844 (97.87%), Postives = 830/844 (98.34%), Query Frame = 0

Query: 1   MALVMHRDSLNKQMDTSKYVRYTPEQVDALERVYAECPKPSSLRRQQLIRECPILSNIEP 60
           MALVMHRDSLNKQMDTSKYVRYTPEQVDALERVYAECPKPSSLRRQQLIRECPILSNIEP
Sbjct: 1   MALVMHRDSLNKQMDTSKYVRYTPEQVDALERVYAECPKPSSLRRQQLIRECPILSNIEP 60

Query: 61  KQIKVWFQNRRCREKQRKEASRLQTVNRKLTAMNKLLMEENDRLQTQVSRLVYENGYMRQ 120
           KQIKVWFQNRRCREKQRKEASRLQTVNRKLTAMNKLLMEENDRLQTQVSRLVYENGYMRQ
Sbjct: 61  KQIKVWFQNRRCREKQRKEASRLQTVNRKLTAMNKLLMEENDRLQTQVSRLVYENGYMRQ 120

Query: 121 QLHTASGTTTDNSCESVVMSGQQHQQQNPTKHTQKDANNPAGLLAIAEETLAEFLSKATG 180
           QLHTASGTTTDNSCESVVMSGQQHQQQNPTKHTQKDANNPAGLLAIAEETLAEFLSKATG
Sbjct: 121 QLHTASGTTTDNSCESVVMSGQQHQQQNPTKHTQKDANNPAGLLAIAEETLAEFLSKATG 180

Query: 181 TAVDWVQMIGMKPGPDSIGIVAVSRNCNGVAARACGLVSLEPMKVAEILKDRLSWFRDCR 240
           TAVDWVQMIGMKPGPDSIGIVAVSRNCNGVAARACGLVSLEPMKVAEILKDRLSWFRDCR
Sbjct: 181 TAVDWVQMIGMKPGPDSIGIVAVSRNCNGVAARACGLVSLEPMKVAEILKDRLSWFRDCR 240

Query: 241 CVDVLSAISTGNGGTIELLYMQTYAPTTLAAARDFWTLRYTTSLEDGSLVICERSLTSST 300
           CVDVLS ISTGNGGTIELLYMQTYAPTTLAAARDFWTLRYTTSLEDGSLVICERSLTSST
Sbjct: 241 CVDVLSVISTGNGGTIELLYMQTYAPTTLAAARDFWTLRYTTSLEDGSLVICERSLTSST 300

Query: 301 GGPSGPPPSSFIRAEMLPSGYLIRACEGGSLIHIVDHVDLDFWSVPEVLRPLYESTKILA 360
           GGPSGPPPSSF+RAEMLPSGYLIRACEGGSLIHIVDHVDLD WSVPEVLRPLYESTKILA
Sbjct: 301 GGPSGPPPSSFVRAEMLPSGYLIRACEGGSLIHIVDHVDLDVWSVPEVLRPLYESTKILA 360

Query: 361 QRMTVAALRYVRQIAQEASGEIQLGGGRQPAVLRTFSQRLCRVFNDAVNGFVDDGWSLMD 420
           QR TVAALRYVRQIAQEASGE+QLGGGRQPAVLRTFSQRLCR FNDAVNGFVDDGWSLMD
Sbjct: 361 QRTTVAALRYVRQIAQEASGEVQLGGGRQPAVLRTFSQRLCRGFNDAVNGFVDDGWSLMD 420

Query: 421 SDGVEDVTVVINSSPNKFLGSQYNTSLYPTFGGVLCAKASMLLQNVPPALLVRFLREHRS 480
           SDGVEDVTVVINSSPNKFLGSQYNTSLYPTFGGVLCAKASMLLQNVPPALLVRFLREHRS
Sbjct: 421 SDGVEDVTVVINSSPNKFLGSQYNTSLYPTFGGVLCAKASMLLQNVPPALLVRFLREHRS 480

Query: 481 EWADYGVDAYSAACLKASAYAVPCARPGGFPSGQVILPLAHTVENEEFLEVVRLEGHAMF 540
           EWADYGVDAYSAACLKASAYAVPCARPGGFP GQVILPLAHTVENEEFLEVVRLEGHAMF
Sbjct: 481 EWADYGVDAYSAACLKASAYAVPCARPGGFPGGQVILPLAHTVENEEFLEVVRLEGHAMF 540

Query: 541 PEEAALGGRDMYLLQLCSGVDENTVGACAQLVFAPIDESFADDAPLLPSGFRVIPLESKT 600
           PEEAALGGRDMYLLQLCSGVDENTVGACAQLVFAPIDESFADDAPLLPSGFRVIPLESK 
Sbjct: 541 PEEAALGGRDMYLLQLCSGVDENTVGACAQLVFAPIDESFADDAPLLPSGFRVIPLESKA 600

Query: 601 DAPAATRTLDLASTLEVRPGTTRPGGETDVTNYNLRSVLTIAFQFTFENHMRDSVAAMAR 660
           + P ATRTLDLASTLEVRPGT RPG ETDVTNYNLRSVLTIAFQFTFENHMRDSVAAMAR
Sbjct: 601 EMPGATRTLDLASTLEVRPGTNRPGCETDVTNYNLRSVLTIAFQFTFENHMRDSVAAMAR 660

Query: 661 QYVRTVVGSVQRVAMAIAPSQLGSQIGPKSLPGSPEALTLAQWIARSYRIHSGTELFQVE 720
           QYVRTVVGSVQRVAMAIAPSQLGSQIGPKSLP SPEALTLAQWI RSYRIHSG ELFQVE
Sbjct: 661 QYVRTVVGSVQRVAMAIAPSQLGSQIGPKSLPASPEALTLAQWITRSYRIHSGAELFQVE 720

Query: 721 SQS-DAILKQLWHHSDTILCCSVKTNASPVFTFANQAGLDMLETTLVSLQDITLDKILDD 780
           SQS DAILKQLWHHSDTILCCSVKTNASPVFTFANQAGLDMLETTLVSLQDITL+KILDD
Sbjct: 721 SQSGDAILKQLWHHSDTILCCSVKTNASPVFTFANQAGLDMLETTLVSLQDITLEKILDD 780

Query: 781 AGRKILCSEFSKIMQQGFAYLPAGICVSSMGRPVSYEQAIAWKVLNDDDVHHCLAFMFVN 840
           AGRKILCSEFSKIMQQGFAYLPAGICVSSMGRPVSYEQAIAWKVLNDDDVHHCLAFMFVN
Sbjct: 781 AGRKILCSEFSKIMQQGFAYLPAGICVSSMGRPVSYEQAIAWKVLNDDDVHHCLAFMFVN 840

Query: 841 WSFM 844
           WSFM
Sbjct: 841 WSFM 844

BLAST of HG10021418 vs. ExPASy TrEMBL
Match: A0A5D3C592 (Homeobox-leucine zipper protein ATHB-14-like OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold178G00540 PE=3 SV=1)

HSP 1 Score: 1624.8 bits (4206), Expect = 0.0e+00
Identity = 817/831 (98.32%), Postives = 821/831 (98.80%), Query Frame = 0

Query: 14  MDTSKYVRYTPEQVDALERVYAECPKPSSLRRQQLIRECPILSNIEPKQIKVWFQNRRCR 73
           MDTSKYVRYTPEQVDALERVYAECPKPSSLRRQQLIRECPILSNIEPKQIKVWFQNRRCR
Sbjct: 1   MDTSKYVRYTPEQVDALERVYAECPKPSSLRRQQLIRECPILSNIEPKQIKVWFQNRRCR 60

Query: 74  EKQRKEASRLQTVNRKLTAMNKLLMEENDRLQTQVSRLVYENGYMRQQLHTASGTTTDNS 133
           EKQRKEASRLQTVNRKLTAMNKLLMEENDRLQTQVSRLVYENGYMRQQLHTASGTTTDNS
Sbjct: 61  EKQRKEASRLQTVNRKLTAMNKLLMEENDRLQTQVSRLVYENGYMRQQLHTASGTTTDNS 120

Query: 134 CESVVMSGQQHQQQNPTKHTQKDANNPAGLLAIAEETLAEFLSKATGTAVDWVQMIGMKP 193
           CESVVMSGQQHQQQNPTKHTQKDANNPAGLLAIAEETLAEFLSKATGTAVDWVQMIGMKP
Sbjct: 121 CESVVMSGQQHQQQNPTKHTQKDANNPAGLLAIAEETLAEFLSKATGTAVDWVQMIGMKP 180

Query: 194 GPDSIGIVAVSRNCNGVAARACGLVSLEPMKVAEILKDRLSWFRDCRCVDVLSAISTGNG 253
           GPDSIGIVAVSRNCNGVAARACGLVSLEPMKVAEILKDRLSWFRDCRCVDVLS ISTGNG
Sbjct: 181 GPDSIGIVAVSRNCNGVAARACGLVSLEPMKVAEILKDRLSWFRDCRCVDVLSVISTGNG 240

Query: 254 GTIELLYMQTYAPTTLAAARDFWTLRYTTSLEDGSLVICERSLTSSTGGPSGPPPSSFIR 313
           GTIELLYMQTYAPTTLAAARDFWTLRYTTSLEDGSLVICERSLTSSTGGPSGPPPSSF+R
Sbjct: 241 GTIELLYMQTYAPTTLAAARDFWTLRYTTSLEDGSLVICERSLTSSTGGPSGPPPSSFVR 300

Query: 314 AEMLPSGYLIRACEGGSLIHIVDHVDLDFWSVPEVLRPLYESTKILAQRMTVAALRYVRQ 373
           AEMLPSGYLIRACEGGSLIHIVDHVDLD WSVPEVLRPLYESTKILAQRMTVAALRYVRQ
Sbjct: 301 AEMLPSGYLIRACEGGSLIHIVDHVDLDVWSVPEVLRPLYESTKILAQRMTVAALRYVRQ 360

Query: 374 IAQEASGEIQLGGGRQPAVLRTFSQRLCRVFNDAVNGFVDDGWSLMDSDGVEDVTVVINS 433
           IAQEASGE+QLGGGRQPAVLRTFSQRLCR FNDAVNGFVDDGWSLMDSDGVEDVTVVINS
Sbjct: 361 IAQEASGEVQLGGGRQPAVLRTFSQRLCRGFNDAVNGFVDDGWSLMDSDGVEDVTVVINS 420

Query: 434 SPNKFLGSQYNTSLYPTFGGVLCAKASMLLQNVPPALLVRFLREHRSEWADYGVDAYSAA 493
           SPNKFLGSQYNTSLYPTFGGVLCAKASMLLQNVPPALLVRFLREHRSEWADYGVDAYSAA
Sbjct: 421 SPNKFLGSQYNTSLYPTFGGVLCAKASMLLQNVPPALLVRFLREHRSEWADYGVDAYSAA 480

Query: 494 CLKASAYAVPCARPGGFPSGQVILPLAHTVENEEFLEVVRLEGHAMFPEEAALGGRDMYL 553
           CLKASAYAVPCARPGGFP GQVILPLAHTVENEEFLEVVRLEGHAMFPEEAALGGRDMYL
Sbjct: 481 CLKASAYAVPCARPGGFPGGQVILPLAHTVENEEFLEVVRLEGHAMFPEEAALGGRDMYL 540

Query: 554 LQLCSGVDENTVGACAQLVFAPIDESFADDAPLLPSGFRVIPLESKTDAPAATRTLDLAS 613
           LQLCSGV+ENTVGACAQLVFAPIDESFADDAPLLPSGFRVIPLESKT+ P ATRTLDLAS
Sbjct: 541 LQLCSGVEENTVGACAQLVFAPIDESFADDAPLLPSGFRVIPLESKTEMPGATRTLDLAS 600

Query: 614 TLEVRPGTTRPGGETDVTNYNLRSVLTIAFQFTFENHMRDSVAAMARQYVRTVVGSVQRV 673
           TLEVRPGT RPGGETDVTNYNLRSVLTIAFQFTFENHMRDSVAAMARQYVRTVVGSVQRV
Sbjct: 601 TLEVRPGTNRPGGETDVTNYNLRSVLTIAFQFTFENHMRDSVAAMARQYVRTVVGSVQRV 660

Query: 674 AMAIAPSQLGSQIGPKSLPGSPEALTLAQWIARSYRIHSGTELFQVESQS-DAILKQLWH 733
           AMAIAPSQLGSQIGPKSLPGSPEALTLAQWI RSYRIHSG ELFQVESQS DAILKQLWH
Sbjct: 661 AMAIAPSQLGSQIGPKSLPGSPEALTLAQWITRSYRIHSGAELFQVESQSGDAILKQLWH 720

Query: 734 HSDTILCCSVKTNASPVFTFANQAGLDMLETTLVSLQDITLDKILDDAGRKILCSEFSKI 793
           HSDTILCCSVKTNASPVFTFANQAGLDMLETTLVSLQDITLDKILDDAGRKILCSEFSKI
Sbjct: 721 HSDTILCCSVKTNASPVFTFANQAGLDMLETTLVSLQDITLDKILDDAGRKILCSEFSKI 780

Query: 794 MQQGFAYLPAGICVSSMGRPVSYEQAIAWKVLNDDDVHHCLAFMFVNWSFM 844
           MQQGFAYLPAGICVSSMGRPVSYEQAIAWKVLNDDDVHHCLAFMFVNWSFM
Sbjct: 781 MQQGFAYLPAGICVSSMGRPVSYEQAIAWKVLNDDDVHHCLAFMFVNWSFM 831

BLAST of HG10021418 vs. ExPASy TrEMBL
Match: A0A6J1CFI1 (homeobox-leucine zipper protein HOX32-like OS=Momordica charantia OX=3673 GN=LOC111010757 PE=3 SV=1)

HSP 1 Score: 1621.3 bits (4197), Expect = 0.0e+00
Identity = 814/844 (96.45%), Postives = 825/844 (97.75%), Query Frame = 0

Query: 1   MALVMHRDSLNKQMDTSKYVRYTPEQVDALERVYAECPKPSSLRRQQLIRECPILSNIEP 60
           MAL +HRDSLNKQMDTSKYVRYTPEQVDALERVYAECPKPSSLRRQQLIRECPILSNIEP
Sbjct: 1   MALAIHRDSLNKQMDTSKYVRYTPEQVDALERVYAECPKPSSLRRQQLIRECPILSNIEP 60

Query: 61  KQIKVWFQNRRCREKQRKEASRLQTVNRKLTAMNKLLMEENDRLQTQVSRLVYENGYMRQ 120
           KQIKVWFQNRRCREKQRKEASRLQTVNRKLTAMNKLLMEENDRLQ QVSRLVY+NGYM+Q
Sbjct: 61  KQIKVWFQNRRCREKQRKEASRLQTVNRKLTAMNKLLMEENDRLQKQVSRLVYDNGYMKQ 120

Query: 121 QLHTASGTTTDNSCESVVMSGQQHQQQNPTKHTQKDANNPAGLLAIAEETLAEFLSKATG 180
           QLHTASGTTTDNSCESVVMSGQ HQQQNPTKHTQKDANNPAGLLAIAEETLAEFLSKATG
Sbjct: 121 QLHTASGTTTDNSCESVVMSGQHHQQQNPTKHTQKDANNPAGLLAIAEETLAEFLSKATG 180

Query: 181 TAVDWVQMIGMKPGPDSIGIVAVSRNCNGVAARACGLVSLEPMKVAEILKDRLSWFRDCR 240
           TAVDWVQMIGMKPGPDSIGIVAVSRNCNGVAARACGLVSLEP KVAEILKDRLSWFRDCR
Sbjct: 181 TAVDWVQMIGMKPGPDSIGIVAVSRNCNGVAARACGLVSLEPTKVAEILKDRLSWFRDCR 240

Query: 241 CVDVLSAISTGNGGTIELLYMQTYAPTTLAAARDFWTLRYTTSLEDGSLVICERSLTSST 300
           CVD+LS ISTGNGGTIELLYMQTYAPTTLAAARDFWTLRYTTSLEDGSLVICERSLTSST
Sbjct: 241 CVDILSVISTGNGGTIELLYMQTYAPTTLAAARDFWTLRYTTSLEDGSLVICERSLTSST 300

Query: 301 GGPSGPPPSSFIRAEMLPSGYLIRACEGGSLIHIVDHVDLDFWSVPEVLRPLYESTKILA 360
           GGPSGPPPSSF+RAEMLPSGYLIRACEGGSLIHIVDHVDLD WSVPEVLRPLYESTKILA
Sbjct: 301 GGPSGPPPSSFVRAEMLPSGYLIRACEGGSLIHIVDHVDLDAWSVPEVLRPLYESTKILA 360

Query: 361 QRMTVAALRYVRQIAQEASGEIQLGGGRQPAVLRTFSQRLCRVFNDAVNGFVDDGWSLMD 420
           QRMTVAALRYVRQI+QEASGEIQLGGGRQPAVLRTFSQRLCR FNDAVNGFVDDGWSLMD
Sbjct: 361 QRMTVAALRYVRQISQEASGEIQLGGGRQPAVLRTFSQRLCRGFNDAVNGFVDDGWSLMD 420

Query: 421 SDGVEDVTVVINSSPNKFLGSQYNTSLYPTFGGVLCAKASMLLQNVPPALLVRFLREHRS 480
           SDGVEDVTVVINSSPNKFLG QYNTSLYPTFGGVLCAKASMLLQNVPPALLVRFLREHRS
Sbjct: 421 SDGVEDVTVVINSSPNKFLGLQYNTSLYPTFGGVLCAKASMLLQNVPPALLVRFLREHRS 480

Query: 481 EWADYGVDAYSAACLKASAYAVPCARPGGFPSGQVILPLAHTVENEEFLEVVRLEGHAMF 540
           EWADYGVDAYSA CLKASAYAVPCARPGGF SGQVILPLAHTVENEEFLEVVRLEGHAMF
Sbjct: 481 EWADYGVDAYSATCLKASAYAVPCARPGGFSSGQVILPLAHTVENEEFLEVVRLEGHAMF 540

Query: 541 PEEAALGGRDMYLLQLCSGVDENTVGACAQLVFAPIDESFADDAPLLPSGFRVIPLESKT 600
           PEEAAL GRDMYLLQLCSG+DEN VG+CAQLVFAPIDESFADDAPLLPSGFRVIPLESKT
Sbjct: 541 PEEAALAGRDMYLLQLCSGIDENAVGSCAQLVFAPIDESFADDAPLLPSGFRVIPLESKT 600

Query: 601 DAPAATRTLDLASTLEVRPGTTRPGGETDVTNYNLRSVLTIAFQFTFENHMRDSVAAMAR 660
           DAPAATRTLDLASTLEVRPGT RPG ETDVTNYNLRSVLTIAFQFTFENHMRDSVAAMAR
Sbjct: 601 DAPAATRTLDLASTLEVRPGTARPGSETDVTNYNLRSVLTIAFQFTFENHMRDSVAAMAR 660

Query: 661 QYVRTVVGSVQRVAMAIAPSQLGSQIGPKSLPGSPEALTLAQWIARSYRIHSGTELFQVE 720
           QYVRTVVGSVQRVAMAI+PSQLGSQIGPKSLPGSPEALTLAQWI RSYRIH+G ELFQVE
Sbjct: 661 QYVRTVVGSVQRVAMAISPSQLGSQIGPKSLPGSPEALTLAQWICRSYRIHTGAELFQVE 720

Query: 721 SQS-DAILKQLWHHSDTILCCSVKTNASPVFTFANQAGLDMLETTLVSLQDITLDKILDD 780
           SQS DAILKQLWHHSDTILCCSVKTNASPVFTFANQAGLDMLETTLV+LQDI LDKILDD
Sbjct: 721 SQSDDAILKQLWHHSDTILCCSVKTNASPVFTFANQAGLDMLETTLVALQDIMLDKILDD 780

Query: 781 AGRKILCSEFSKIMQQGFAYLPAGICVSSMGRPVSYEQAIAWKVLNDDDVHHCLAFMFVN 840
           AGRKILCSEFSKIMQQGF YLPAGICVSSMGRPVSYEQAIAWKVLNDDDVHHCLAFMFVN
Sbjct: 781 AGRKILCSEFSKIMQQGFIYLPAGICVSSMGRPVSYEQAIAWKVLNDDDVHHCLAFMFVN 840

Query: 841 WSFM 844
           WSFM
Sbjct: 841 WSFM 844

BLAST of HG10021418 vs. ExPASy TrEMBL
Match: A0A5A7T0F7 (Homeobox-leucine zipper protein ATHB-14-like OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold9475G00140 PE=3 SV=1)

HSP 1 Score: 1620.1 bits (4194), Expect = 0.0e+00
Identity = 817/832 (98.20%), Postives = 821/832 (98.68%), Query Frame = 0

Query: 14  MDTSKYVRYTPEQVDALERVYAECPKPSSLRRQQLIRECPILSNIEPKQIKVWFQNRRCR 73
           MDTSKYVRYTPEQVDALERVYAECPKPSSLRRQQLIRECPILSNIEPKQIKVWFQNRRCR
Sbjct: 1   MDTSKYVRYTPEQVDALERVYAECPKPSSLRRQQLIRECPILSNIEPKQIKVWFQNRRCR 60

Query: 74  EKQRKEASRLQTVNRKLTAMNKLLMEENDRLQTQVSRLVYENGYMRQQLHTASGTTTDNS 133
           EKQRKEASRLQTVNRKLTAMNKLLMEENDRLQTQVSRLVYENGYMRQQLHTASGTTTDNS
Sbjct: 61  EKQRKEASRLQTVNRKLTAMNKLLMEENDRLQTQVSRLVYENGYMRQQLHTASGTTTDNS 120

Query: 134 CESVVMSGQQHQQQNPTKHTQKDANNPAGLLAIAEETLAEFLSKATGTAVDWVQMIGMKP 193
           CESVVMSGQQHQQQNPTKHTQKDANNPAGLLAIAEETLAEFLSKATGTAVDWVQMIGMKP
Sbjct: 121 CESVVMSGQQHQQQNPTKHTQKDANNPAGLLAIAEETLAEFLSKATGTAVDWVQMIGMKP 180

Query: 194 GPDSIGIVAVSRNCNGVAARACGLVSLEPMKVAEILKDRLSWFRDCRCVDVLSAISTGNG 253
           GPDSIGIVAVSRNCNGVAARACGLVSLEPMKVAEILKDRLSWFRDCRCVDVLS ISTGNG
Sbjct: 181 GPDSIGIVAVSRNCNGVAARACGLVSLEPMKVAEILKDRLSWFRDCRCVDVLSVISTGNG 240

Query: 254 GTIELLYMQTYAPTTLAAARDFWTLRYTTSLEDGSLVICERSLTSSTGGPSGPPPSSFIR 313
           GTIELLYMQTYAPTTLAAARDFWTLRYTTSLEDGSLVICERSLTSSTGGPSGPPPSSF+R
Sbjct: 241 GTIELLYMQTYAPTTLAAARDFWTLRYTTSLEDGSLVICERSLTSSTGGPSGPPPSSFVR 300

Query: 314 AEMLPSGYLIRACEGGSLIHIVDHVDLDFWSVPEVLRPLYESTKILAQRMTVAALRYVRQ 373
           AEMLPSGYLIRACEGGSLIHIVDHVDLD WSVPEVLRPLYESTKILAQRMTVAALRYVRQ
Sbjct: 301 AEMLPSGYLIRACEGGSLIHIVDHVDLDVWSVPEVLRPLYESTKILAQRMTVAALRYVRQ 360

Query: 374 IAQEASGEIQLGGGRQPAVLRTFSQRLCRVFNDAVNGFVDDGWSLMDSDGVEDVTVVINS 433
           IAQEASGE+QLGGGRQPAVLRTFSQRLCR FNDAVNGFVDDGWSLMDSDGVEDVTVVINS
Sbjct: 361 IAQEASGEVQLGGGRQPAVLRTFSQRLCRGFNDAVNGFVDDGWSLMDSDGVEDVTVVINS 420

Query: 434 SPNKFLGSQYNTSLYPTFGGVLCAKASMLLQNVPPALLVRFLREHRSEWADYGVDAYSAA 493
           SPNKFLGSQYNTSLYPTFGGVLCAKASMLLQNVPPALLVRFLREHRSEWADYGVDAYSAA
Sbjct: 421 SPNKFLGSQYNTSLYPTFGGVLCAKASMLLQNVPPALLVRFLREHRSEWADYGVDAYSAA 480

Query: 494 CLKASAYAVPCARPGGFPSGQVILPLAHTVENEEFLEVVRLEGHAMFPEEAALGGRDMYL 553
           CLKASAYAVPCARPGGFP GQVILPLAHTVENEEFLEVVRLEGHAMFPEEAALGGRDMYL
Sbjct: 481 CLKASAYAVPCARPGGFPGGQVILPLAHTVENEEFLEVVRLEGHAMFPEEAALGGRDMYL 540

Query: 554 L-QLCSGVDENTVGACAQLVFAPIDESFADDAPLLPSGFRVIPLESKTDAPAATRTLDLA 613
           L QLCSGV+ENTVGACAQLVFAPIDESFADDAPLLPSGFRVIPLESKT+ P ATRTLDLA
Sbjct: 541 LQQLCSGVEENTVGACAQLVFAPIDESFADDAPLLPSGFRVIPLESKTEMPGATRTLDLA 600

Query: 614 STLEVRPGTTRPGGETDVTNYNLRSVLTIAFQFTFENHMRDSVAAMARQYVRTVVGSVQR 673
           STLEVRPGT RPGGETDVTNYNLRSVLTIAFQFTFENHMRDSVAAMARQYVRTVVGSVQR
Sbjct: 601 STLEVRPGTNRPGGETDVTNYNLRSVLTIAFQFTFENHMRDSVAAMARQYVRTVVGSVQR 660

Query: 674 VAMAIAPSQLGSQIGPKSLPGSPEALTLAQWIARSYRIHSGTELFQVESQS-DAILKQLW 733
           VAMAIAPSQLGSQIGPKSLPGSPEALTLAQWI RSYRIHSG ELFQVESQS DAILKQLW
Sbjct: 661 VAMAIAPSQLGSQIGPKSLPGSPEALTLAQWITRSYRIHSGAELFQVESQSGDAILKQLW 720

Query: 734 HHSDTILCCSVKTNASPVFTFANQAGLDMLETTLVSLQDITLDKILDDAGRKILCSEFSK 793
           HHSDTILCCSVKTNASPVFTFANQAGLDMLETTLVSLQDITLDKILDDAGRKILCSEFSK
Sbjct: 721 HHSDTILCCSVKTNASPVFTFANQAGLDMLETTLVSLQDITLDKILDDAGRKILCSEFSK 780

Query: 794 IMQQGFAYLPAGICVSSMGRPVSYEQAIAWKVLNDDDVHHCLAFMFVNWSFM 844
           IMQQGFAYLPAGICVSSMGRPVSYEQAIAWKVLNDDDVHHCLAFMFVNWSFM
Sbjct: 781 IMQQGFAYLPAGICVSSMGRPVSYEQAIAWKVLNDDDVHHCLAFMFVNWSFM 832

BLAST of HG10021418 vs. TAIR 10
Match: AT2G34710.1 (Homeobox-leucine zipper family protein / lipid-binding START domain-containing protein )

HSP 1 Score: 1326.2 bits (3431), Expect = 0.0e+00
Identity = 672/847 (79.34%), Postives = 744/847 (87.84%), Query Frame = 0

Query: 4   VMHRDSLNKQMDTSKYVRYTPEQVDALERVYAECPKPSSLRRQQLIRECPILSNIEPKQI 63
           +M+R+S +K +D+ KYVRYTPEQV+ALERVY ECPKPSSLRRQQLIRECPILSNIEPKQI
Sbjct: 11  MMNRESPDKGLDSGKYVRYTPEQVEALERVYTECPKPSSLRRQQLIRECPILSNIEPKQI 70

Query: 64  KVWFQNRRCREKQRKEASRLQTVNRKLTAMNKLLMEENDRLQTQVSRLVYENGYMRQQLH 123
           KVWFQNRRCREKQRKEA+RLQTVNRKL AMNKLLMEENDRLQ QVS LVYENG+M+ QLH
Sbjct: 71  KVWFQNRRCREKQRKEAARLQTVNRKLNAMNKLLMEENDRLQKQVSNLVYENGHMKHQLH 130

Query: 124 TASGTTTDNSCESVVMSGQQHQQQNPT-KHTQKDANNPAGLLAIAEETLAEFLSKATGTA 183
           TASGTTTDNSCESVV+SGQQHQQQNP  +H Q+DANNPAGLL+IAEE LAEFLSKATGTA
Sbjct: 131 TASGTTTDNSCESVVVSGQQHQQQNPNPQHQQRDANNPAGLLSIAEEALAEFLSKATGTA 190

Query: 184 VDWVQMIGMKPGPDSIGIVAVSRNCNGVAARACGLVSLEPMKVAEILKDRLSWFRDCRCV 243
           VDWVQMIGMKPGPDSIGIVA+SRNC+G+AARACGLVSLEPMKVAEILKDR SW RDCR V
Sbjct: 191 VDWVQMIGMKPGPDSIGIVAISRNCSGIAARACGLVSLEPMKVAEILKDRPSWLRDCRSV 250

Query: 244 DVLSAISTGNGGTIELLYMQTYAPTTLAAARDFWTLRYTTSLEDGSLVICERSLTSSTGG 303
           D LS I  GNGGTIEL+Y Q YAPTTLAAARDFWTLRY+T LEDGS V+CERSLTS+TGG
Sbjct: 251 DTLSVIPAGNGGTIELIYTQMYAPTTLAAARDFWTLRYSTCLEDGSYVVCERSLTSATGG 310

Query: 304 PSGPPPSSFIRAEMLPSGYLIRACE-GGSLIHIVDHVDLDFWSVPEVLRPLYESTKILAQ 363
           P+GPP S+F+RAEM PSG+LIR C+ GGS++HIVDHVDLD WSVPEV+RPLYES+KILAQ
Sbjct: 311 PTGPPSSNFVRAEMKPSGFLIRPCDGGGSILHIVDHVDLDAWSVPEVMRPLYESSKILAQ 370

Query: 364 RMTVAALRYVRQIAQEASGEIQLGGGRQPAVLRTFSQRLCRVFNDAVNGFVDDGWSLMDS 423
           +MTVAALR+VRQIAQE SGE+Q GGGRQPAVLRTFSQRLCR FNDAVNGFVDDGWS M S
Sbjct: 371 KMTVAALRHVRQIAQETSGEVQYGGGRQPAVLRTFSQRLCRGFNDAVNGFVDDGWSPMGS 430

Query: 424 DGVEDVTVVINSSPNKFLGSQYNTSLYPTFG-GVLCAKASMLLQNVPPALLVRFLREHRS 483
           DG EDVTV+IN SP KF GSQY  S  P+FG GVLCAKASMLLQNVPPA+LVRFLREHRS
Sbjct: 431 DGAEDVTVMINLSPGKFGGSQYGNSFLPSFGSGVLCAKASMLLQNVPPAVLVRFLREHRS 490

Query: 484 EWADYGVDAYSAACLKASAYAVPCARPGGFPSGQVILPLAHTVENEEFLEVVRLEGHAMF 543
           EWADYGVDAY+AA L+AS +AVPCAR GGFPS QVILPLA TVE+EE LEVVRLEGHA  
Sbjct: 491 EWADYGVDAYAAASLRASPFAVPCARAGGFPSNQVILPLAQTVEHEESLEVVRLEGHAYS 550

Query: 544 PEEAALGGRDMYLLQLCSGVDENTVGACAQLVFAPIDESFADDAPLLPSGFRVIPLESKT 603
           PE+  L  RDMYLLQLCSGVDEN VG CAQLVFAPIDESFADDAPLLPSGFR+IPLE K+
Sbjct: 551 PEDMGL-ARDMYLLQLCSGVDENVVGGCAQLVFAPIDESFADDAPLLPSGFRIIPLEQKS 610

Query: 604 --DAPAATRTLDLASTLEVRPGTTRPGGETDVTNYNLRSVLTIAFQFTFENHMRDSVAAM 663
             +  +A RTLDLAS LE   G+TR  GE D    N RSVLTIAFQFTF+NH RDSVA+M
Sbjct: 611 TPNGASANRTLDLASALE---GSTRQAGEADPNGCNFRSVLTIAFQFTFDNHSRDSVASM 670

Query: 664 ARQYVRTVVGSVQRVAMAIAPSQLGSQIGPKSLPGSPEALTLAQWIARSYRIHSGTELFQ 723
           ARQYVR++VGS+QRVA+AIAP + GS I P S+P SPEALTL +WI+RSY +H+G +LF 
Sbjct: 671 ARQYVRSIVGSIQRVALAIAP-RPGSNISPISVPTSPEALTLVRWISRSYSLHTGADLFG 730

Query: 724 VESQS--DAILKQLWHHSDTILCCSVKTNASPVFTFANQAGLDMLETTLVSLQDITLDKI 783
            +SQ+  D +L QLW+HSD ILCCS+KTNASPVFTFANQ GLDMLETTLV+LQDI LDK 
Sbjct: 731 SDSQTSGDTLLHQLWNHSDAILCCSLKTNASPVFTFANQTGLDMLETTLVALQDIMLDKT 790

Query: 784 LDDAGRKILCSEFSKIMQQGFAYLPAGICVSSMGRPVSYEQAIAWKVLNDDDVHHCLAFM 843
           LD+ GRK LCSEF KIMQQG+A+LPAG+C SSMGR VSYEQA  WKVL DD+ +HCLAFM
Sbjct: 791 LDEPGRKALCSEFPKIMQQGYAHLPAGVCASSMGRMVSYEQATVWKVLEDDESNHCLAFM 850

BLAST of HG10021418 vs. TAIR 10
Match: AT1G30490.1 (Homeobox-leucine zipper family protein / lipid-binding START domain-containing protein )

HSP 1 Score: 1280.8 bits (3313), Expect = 0.0e+00
Identity = 656/846 (77.54%), Postives = 732/846 (86.52%), Query Frame = 0

Query: 7   RDSLNKQMDTSKYVRYTPEQVDALERVYAECPKPSSLRRQQLIRECPILSNIEPKQIKVW 66
           RDS +K  D+ KYVRYTPEQV+ALERVYAECPKPSSLRRQQLIRECPIL NIEP+QIKVW
Sbjct: 10  RDSPDKGFDSGKYVRYTPEQVEALERVYAECPKPSSLRRQQLIRECPILCNIEPRQIKVW 69

Query: 67  FQNRRCREKQRKEASRLQTVNRKLTAMNKLLMEENDRLQTQVSRLVYENGYMRQQLHTAS 126
           FQNRRCREKQRKE++RLQTVNRKL+AMNKLLMEENDRLQ QVS LVYENG+M+ ++HTAS
Sbjct: 70  FQNRRCREKQRKESARLQTVNRKLSAMNKLLMEENDRLQKQVSNLVYENGFMKHRIHTAS 129

Query: 127 GTTTDNSCESVVMSGQQHQQQNPT-KHTQKDANNPAGLLAIAEETLAEFLSKATGTAVDW 186
           GTTTDNSCESVV+SGQQ QQQNPT +H Q+D NNPA LL+IAEETLAEFL KATGTAVDW
Sbjct: 130 GTTTDNSCESVVVSGQQRQQQNPTHQHPQRDVNNPANLLSIAEETLAEFLCKATGTAVDW 189

Query: 187 VQMIGMKPGPDSIGIVAVSRNCNGVAARACGLVSLEPMKVAEILKDRLSWFRDCRCVDVL 246
           VQMIGMKPGPDSIGIVAVSRNC+G+AARACGLVSLEPMKVAEILKDR SWFRDCRCV+ L
Sbjct: 190 VQMIGMKPGPDSIGIVAVSRNCSGIAARACGLVSLEPMKVAEILKDRPSWFRDCRCVETL 249

Query: 247 SAISTGNGGTIELLYMQTYAPTTLAAARDFWTLRYTTSLEDGSLVICERSLTSSTGGPSG 306
           + I TGNGGTIEL+  Q YAPTTLAAARDFWTLRY+TSLEDGS V+CERSLTS+TGGP+G
Sbjct: 250 NVIPTGNGGTIELVNTQIYAPTTLAAARDFWTLRYSTSLEDGSYVVCERSLTSATGGPNG 309

Query: 307 PPPSSFIRAEMLPSGYLIRACE-GGSLIHIVDHVDLDFWSVPEVLRPLYESTKILAQRMT 366
           P  SSF+RA+ML SG+LIR C+ GGS+IHIVDHVDLD  SVPEVLRPLYES+KILAQ+MT
Sbjct: 310 PLSSSFVRAKMLSSGFLIRPCDGGGSIIHIVDHVDLDVSSVPEVLRPLYESSKILAQKMT 369

Query: 367 VAALRYVRQIAQEASGEIQLGGGRQPAVLRTFSQRLCRVFNDAVNGFVDDGWSLMDSDGV 426
           VAALR+VRQIAQE SGE+Q  GGRQPAVLRTFSQRLCR FNDAVNGFVDDGWS M SDG 
Sbjct: 370 VAALRHVRQIAQETSGEVQYSGGRQPAVLRTFSQRLCRGFNDAVNGFVDDGWSPMSSDGG 429

Query: 427 EDVTVVINSSPNKFLGSQYNTSLYPTFG-GVLCAKASMLLQNVPPALLVRFLREHRSEWA 486
           ED+T++INSS  KF GSQY +S  P+FG GVLCAKASMLLQNVPP +L+RFLREHR+EWA
Sbjct: 430 EDITIMINSSSAKFAGSQYGSSFLPSFGSGVLCAKASMLLQNVPPLVLIRFLREHRAEWA 489

Query: 487 DYGVDAYSAACLKASAYAVPCARPGGFPSGQVILPLAHTVENEEFLEVVRLEGHAMFPEE 546
           DYGVDAYSAA L+A+ YAVPC R GGFPS QVILPLA T+E+EEFLEVVRL GHA  PE+
Sbjct: 490 DYGVDAYSAASLRATPYAVPCVRTGGFPSNQVILPLAQTLEHEEFLEVVRLGGHAYSPED 549

Query: 547 AALGGRDMYLLQLCSGVDENTVGACAQLVFAPIDESFADDAPLLPSGFRVIPLESKT--- 606
             L  RDMYLLQLCSGVDEN VG CAQLVFAPIDESFADDAPLLPSGFRVIPL+ KT   
Sbjct: 550 MGL-SRDMYLLQLCSGVDENVVGGCAQLVFAPIDESFADDAPLLPSGFRVIPLDQKTNPN 609

Query: 607 DAPAATRTLDLASTLEVRPGTTRPGGETDVTNYNLRSVLTIAFQFTFENHMRDSVAAMAR 666
           D  +A+RT DLAS+L+   G+T+   ET     N R VLTIAFQFTF+NH RD+VA MAR
Sbjct: 610 DHQSASRTRDLASSLD---GSTKTDSET-----NSRLVLTIAFQFTFDNHSRDNVATMAR 669

Query: 667 QYVRTVVGSVQRVAMAIAPSQLGSQIGPKSLPGSPEALTLAQWIARSYRIHSGTELFQVE 726
           QYVR VVGS+QRVA+AI P     + G   LP SPEALTL +WI RSY IH+G +LF  +
Sbjct: 670 QYVRNVVGSIQRVALAITP-----RPGSMQLPTSPEALTLVRWITRSYSIHTGADLFGAD 729

Query: 727 SQS---DAILKQLWHHSDTILCCSVKTNASPVFTFANQAGLDMLETTLVSLQDITLDKIL 786
           SQS   D +LKQLW HSD ILCCS+KTNASPVFTFANQAGLDMLETTLV+LQDI LDK L
Sbjct: 730 SQSCGGDTLLKQLWDHSDAILCCSLKTNASPVFTFANQAGLDMLETTLVALQDIMLDKTL 789

Query: 787 DDAGRKILCSEFSKIMQQGFAYLPAGICVSSMGRPVSYEQAIAWKVLNDDDVHHCLAFMF 844
           DD+GR+ LCSEF+KIMQQG+A LPAGICVSSMGRPVSYEQA  WKV++D++ +HCLAF  
Sbjct: 790 DDSGRRALCSEFAKIMQQGYANLPAGICVSSMGRPVSYEQATVWKVVDDNESNHCLAFTL 841

BLAST of HG10021418 vs. TAIR 10
Match: AT5G60690.1 (Homeobox-leucine zipper family protein / lipid-binding START domain-containing protein )

HSP 1 Score: 1112.1 bits (2875), Expect = 0.0e+00
Identity = 575/860 (66.86%), Postives = 694/860 (80.70%), Query Frame = 0

Query: 1   MALVMHR----DSLNKQMDTS-KYVRYTPEQVDALERVYAECPKPSSLRRQQLIRECPIL 60
           MA+  HR    DS+N+ +D+S KYVRYT EQV+ALERVYAECPKPSSLRRQQLIREC IL
Sbjct: 3   MAVANHRERSSDSMNRHLDSSGKYVRYTAEQVEALERVYAECPKPSSLRRQQLIRECSIL 62

Query: 61  SNIEPKQIKVWFQNRRCREKQRKEASRLQTVNRKLTAMNKLLMEENDRLQTQVSRLVYEN 120
           +NIEPKQIKVWFQNRRCR+KQRKEASRLQ+VNRKL+AMNKLLMEENDRLQ QVS+LV EN
Sbjct: 63  ANIEPKQIKVWFQNRRCRDKQRKEASRLQSVNRKLSAMNKLLMEENDRLQKQVSQLVCEN 122

Query: 121 GYMRQQLHTASGTTTDNSCESVVMSGQQHQQQNPTKHTQKDANNPAGLLAIAEETLAEFL 180
           GYM+QQL T      D SCESVV + Q         H+ +DAN+PAGLL+IAEETLAEFL
Sbjct: 123 GYMKQQLTT---VVNDPSCESVVTTPQ---------HSLRDANSPAGLLSIAEETLAEFL 182

Query: 181 SKATGTAVDWVQMIGMKPGPDSIGIVAVSRNCNGVAARACGLVSLEPMKVAEILKDRLSW 240
           SKATGTAVDWVQM GMKPGPDS+GI A+S+ CNGVAARACGLVSLEPMK+AEILKDR SW
Sbjct: 183 SKATGTAVDWVQMPGMKPGPDSVGIFAISQRCNGVAARACGLVSLEPMKIAEILKDRPSW 242

Query: 241 FRDCRCVDVLSAISTGNGGTIELLYMQTYAPTTLAAARDFWTLRYTTSLEDGSLVICERS 300
           FRDCR ++V +    GNGGTIEL+YMQTYAPTTLA ARDFWTLRYTTSL++GS V+CERS
Sbjct: 243 FRDCRSLEVFTMFPAGNGGTIELVYMQTYAPTTLAPARDFWTLRYTTSLDNGSFVVCERS 302

Query: 301 LTSSTGGPSGPPPSSFIRAEMLPSGYLIRACE-GGSLIHIVDHVDLDFWSVPEVLRPLYE 360
           L+ S  GP+    S F+RAEML SGYLIR C+ GGS+IHIVDH++L+ WSVP+VLRPLYE
Sbjct: 303 LSGSGAGPNAASASQFVRAEMLSSGYLIRPCDGGGSIIHIVDHLNLEAWSVPDVLRPLYE 362

Query: 361 STKILAQRMTVAALRYVRQIAQEASGEIQLGGGRQPAVLRTFSQRLCRVFNDAVNGFVDD 420
           S+K++AQ+MT++ALRY+RQ+AQE++GE+  G GRQPAVLRTFSQRL R FNDAVNGF DD
Sbjct: 363 SSKVVAQKMTISALRYIRQLAQESNGEVVYGLGRQPAVLRTFSQRLSRGFNDAVNGFGDD 422

Query: 421 GWSLMDSDGVEDVTVVINSSPNKFLGSQYNTSLYPTFGGVLCAKASMLLQNVPPALLVRF 480
           GWS M  DG ED+ V INS+  K L +  N+  +   GGVLCAKASMLLQNVPPA+L+RF
Sbjct: 423 GWSTMHCDGAEDIIVAINST--KHLNNISNSLSF--LGGVLCAKASMLLQNVPPAVLIRF 482

Query: 481 LREHRSEWADYGVDAYSAACLKASAYAVPCARPGGFPSGQVILPLAHTVENEEFLEVVRL 540
           LREHRSEWAD+ VDAYSAA LKA ++A P  RP  F   Q+I+PL HT+E+EE LEVVRL
Sbjct: 483 LREHRSEWADFNVDAYSAATLKAGSFAYPGMRPTRFTGSQIIMPLGHTIEHEEMLEVVRL 542

Query: 541 EGHAMFPEEAALGGRDMYLLQLCSGVDENTVGACAQLVFAPIDESFADDAPLLPSGFRVI 600
           EGH++  E+A +  RD++LLQ+C+G+DEN VGAC++L+FAPI+E F DDAPL+PSGFRVI
Sbjct: 543 EGHSLAQEDAFM-SRDVHLLQICTGIDENAVGACSELIFAPINEMFPDDAPLVPSGFRVI 602

Query: 601 PLESKTD-----APAATRTLDLASTLEVRPGTTRPGGETDVTNYNLRSVLTIAFQFTFEN 660
           P+++KT        A  RTLDL S+LEV P      G +  ++ + R +LTIAFQF FEN
Sbjct: 603 PVDAKTGDVQDLLTANHRTLDLTSSLEVGPSPENASGNS-FSSSSSRCILTIAFQFPFEN 662

Query: 661 HMRDSVAAMARQYVRTVVGSVQRVAMAIAPSQLGSQIGPKSLPGSPEALTLAQWIARSYR 720
           +++++VA MA QYVR+V+ SVQRVAMAI+PS +   +G K  PGSPEA+TLAQWI++SY 
Sbjct: 663 NLQENVAGMACQYVRSVISSVQRVAMAISPSGISPSLGSKLSPGSPEAVTLAQWISQSYS 722

Query: 721 IHSGTELFQVES--QSDAILKQLWHHSDTILCCSVKTNASPVFTFANQAGLDMLETTLVS 780
            H G+EL  ++S    D++LK LW H D ILCCS+K    PVF FANQAGLDMLETTLV+
Sbjct: 723 HHLGSELLTIDSLGSDDSVLKLLWDHQDAILCCSLK--PQPVFMFANQAGLDMLETTLVA 782

Query: 781 LQDITLDKILDDAGRKILCSEFSKIMQQGFAYLPAGICVSSMGRPVSYEQAIAWKVL--- 840
           LQDITL+KI D++GRK +CS+F+K+MQQGFA LP+GICVS+MGR VSYEQA+AWKV    
Sbjct: 783 LQDITLEKIFDESGRKAICSDFAKLMQQGFACLPSGICVSTMGRHVSYEQAVAWKVFAAS 842

Query: 841 -NDDDVHHCLAFMFVNWSFM 844
             +++  HCLAF FVNWSF+
Sbjct: 843 EENNNNLHCLAFSFVNWSFV 842

BLAST of HG10021418 vs. TAIR 10
Match: AT1G52150.1 (Homeobox-leucine zipper family protein / lipid-binding START domain-containing protein )

HSP 1 Score: 1070.5 bits (2767), Expect = 7.2e-313
Identity = 555/838 (66.23%), Postives = 667/838 (79.59%), Query Frame = 0

Query: 14  MDTSKYVRYTPEQVDALERVYAECPKPSSLRRQQLIRECPILSNIEPKQIKVWFQNRRCR 73
           +D  KYVRYTPEQV+ALER+Y +CPKPSS+RRQQLIRECPILSNIEPKQIKVWFQNRRCR
Sbjct: 13  LDNGKYVRYTPEQVEALERLYHDCPKPSSIRRQQLIRECPILSNIEPKQIKVWFQNRRCR 72

Query: 74  EKQRKEASRLQTVNRKLTAMNKLLMEENDRLQTQVSRLVYENGYMRQQLHTASGTTTDNS 133
           EKQRKEASRLQ VNRKLTAMNKLLMEENDRLQ QVS+LV+EN Y RQ     S    D S
Sbjct: 73  EKQRKEASRLQAVNRKLTAMNKLLMEENDRLQKQVSQLVHENSYFRQHTPNPSLPAKDTS 132

Query: 134 CESVVMSGQ-QHQQQNPTKHTQKDANNPAGLLAIAEETLAEFLSKATGTAVDWVQMIGMK 193
           CESVV SGQ Q   QNP    Q+DA +PAGLL+IAEETLAEFLSKATGTAV+WVQM GMK
Sbjct: 133 CESVVTSGQHQLASQNP----QRDA-SPAGLLSIAEETLAEFLSKATGTAVEWVQMPGMK 192

Query: 194 PGPDSIGIVAVSRNCNGVAARACGLVSLEPMKVAEILKDRLSWFRDCRCVDVLSAISTGN 253
           PGPDSIGI+A+S  C GVAARACGLV LEP +VAEI+KDR SWFR+CR V+V++ + T N
Sbjct: 193 PGPDSIGIIAISHGCTGVAARACGLVGLEPTRVAEIVKDRPSWFRECRAVEVMNVLPTAN 252

Query: 254 GGTIELLYMQTYAPTTLAAARDFWTLRYTTSLEDGSLVICERSLTSSTGGPSGPPPSSFI 313
           GGT+ELLYMQ YAPTTLA  RDFW LRYT+ LEDGSLV+CERSL S+  GPS P   +F+
Sbjct: 253 GGTVELLYMQLYAPTTLAPPRDFWLLRYTSVLEDGSLVVCERSLKSTQNGPSMPLVQNFV 312

Query: 314 RAEMLPSGYLIRACE-GGSLIHIVDHVDLDFWSVPEVLRPLYESTKILAQRMTVAALRYV 373
           RAEML SGYLIR C+ GGS+IHIVDH+DL+  SVPEVLRPLYES K+LAQ+ T+AALR +
Sbjct: 313 RAEMLSSGYLIRPCDGGGSIIHIVDHMDLEACSVPEVLRPLYESPKVLAQKTTMAALRQL 372

Query: 374 RQIAQEA--SGEIQLGGGRQPAVLRTFSQRLCRVFNDAVNGFVDDGWSLMDSDGVEDVTV 433
           +QIAQE   +     G GR+PA LR  SQRL R FN+AVNGF D+GWS++  D ++DVT+
Sbjct: 373 KQIAQEVTQTNSSVNGWGRRPAALRALSQRLSRGFNEAVNGFTDEGWSVI-GDSMDDVTI 432

Query: 434 VINSSPNKFLGSQ--YNTSLYPTFGGVLCAKASMLLQNVPPALLVRFLREHRSEWADYGV 493
            +NSSP+K +G    +     P    VLCAKASMLLQNVPPA+L+RFLREHRSEWAD  +
Sbjct: 433 TVNSSPDKLMGLNLTFANGFAPVSNVVLCAKASMLLQNVPPAILLRFLREHRSEWADNNI 492

Query: 494 DAYSAACLKASAYAVPC-ARPGGFPSGQVILPLAHTVENEEFLEVVRLEGHAMFPEEAAL 553
           DAY AA +K      PC AR GGF  GQVILPLAHT+E+EEF+EV++LEG    PE+A +
Sbjct: 493 DAYLAAAVKVG----PCSARVGGF-GGQVILPLAHTIEHEEFMEVIKLEGLGHSPEDAIV 552

Query: 554 GGRDMYLLQLCSGVDENTVGACAQLVFAPIDESFADDAPLLPSGFRVIPLESKTDAPAAT 613
             RD++LLQLCSG+DEN VG CA+L+FAPID SFADDAPLLPSGFR+IPL+S  +  +  
Sbjct: 553 -PRDIFLLQLCSGMDENAVGTCAELIFAPIDASFADDAPLLPSGFRIIPLDSAKEVSSPN 612

Query: 614 RTLDLASTLEVRPGTTRPGGETDVTNYNLRSVLTIAFQFTFENHMRDSVAAMARQYVRTV 673
           RTLDLAS LE+    T+   +    +   RSV+TIAF+F  E+HM++ VA+MARQYVR +
Sbjct: 613 RTLDLASALEIGSAGTKASTDQSGNSTCARSVMTIAFEFGIESHMQEHVASMARQYVRGI 672

Query: 674 VGSVQRVAMAIAPSQLGSQIGPKSLPGSPEALTLAQWIARSYRIHSGTELFQVESQ-SDA 733
           + SVQRVA+A++PS + SQ+G ++  G+PEA TLA+WI +SYR + G EL +  S  +++
Sbjct: 673 ISSVQRVALALSPSHISSQVGLRTPLGTPEAQTLARWICQSYRGYMGVELLKSNSDGNES 732

Query: 734 ILKQLWHHSDTILCCSVKTNASPVFTFANQAGLDMLETTLVSLQDITLDKILDDAGRKIL 793
           ILK LWHH+D I+CCS+K  A PVFTFANQAGLDMLETTLV+LQDI+L+KI DD GRK L
Sbjct: 733 ILKNLWHHTDAIICCSMK--ALPVFTFANQAGLDMLETTLVALQDISLEKIFDDNGRKTL 792

Query: 794 CSEFSKIMQQGFAYLPAGICVSSMGRPVSYEQAIAWKVLNDDDVHHCLAFMFVNWSFM 844
           CSEF +IMQQGFA L  GIC+SSMGRPVSYE+A+AWKVLN+++  HC+ F+F+NWSF+
Sbjct: 793 CSEFPQIMQQGFACLQGGICLSSMGRPVSYERAVAWKVLNEEENAHCICFVFINWSFV 836

BLAST of HG10021418 vs. TAIR 10
Match: AT1G52150.2 (Homeobox-leucine zipper family protein / lipid-binding START domain-containing protein )

HSP 1 Score: 1068.1 bits (2761), Expect = 3.5e-312
Identity = 556/839 (66.27%), Postives = 668/839 (79.62%), Query Frame = 0

Query: 14  MDTSKYVRYTPEQVDALERVYAECPKPSSLRRQQLIRECPILSNIEPKQIKVWFQNRRCR 73
           +D  KYVRYTPEQV+ALER+Y +CPKPSS+RRQQLIRECPILSNIEPKQIKVWFQNRRCR
Sbjct: 13  LDNGKYVRYTPEQVEALERLYHDCPKPSSIRRQQLIRECPILSNIEPKQIKVWFQNRRCR 72

Query: 74  EKQRKEASRLQTVNRKLTAMNKLLMEENDRLQTQVSRLVYENGYMRQQLHTASGTTTDNS 133
           EKQRKEASRLQ VNRKLTAMNKLLMEENDRLQ QVS+LV+EN Y RQ     S    D S
Sbjct: 73  EKQRKEASRLQAVNRKLTAMNKLLMEENDRLQKQVSQLVHENSYFRQHTPNPSLPAKDTS 132

Query: 134 CESVVMSGQ-QHQQQNPTKHTQKDANNPAGLLAIAEETLAEFLSKATGTAVDWVQMIGMK 193
           CESVV SGQ Q   QNP    Q+DA +PAGLL+IAEETLAEFLSKATGTAV+WVQM GMK
Sbjct: 133 CESVVTSGQHQLASQNP----QRDA-SPAGLLSIAEETLAEFLSKATGTAVEWVQMPGMK 192

Query: 194 PGPDSIGIVAVSRNCNGVAARACGLVSLEPMKVAEILKDRLSWFRDCRCVDVLSAISTGN 253
           PGPDSIGI+A+S  C GVAARACGLV LEP +VAEI+KDR SWFR+CR V+V++ + T N
Sbjct: 193 PGPDSIGIIAISHGCTGVAARACGLVGLEPTRVAEIVKDRPSWFRECRAVEVMNVLPTAN 252

Query: 254 GGTIELLYMQTYAPTTLAAARDFWTLRYTTSLEDGSLVICERSLTSSTGGPSGPPPSSFI 313
           GGT+ELLYMQ YAPTTLA  RDFW LRYT+ LEDGSLV+CERSL S+  GPS P   +F+
Sbjct: 253 GGTVELLYMQLYAPTTLAPPRDFWLLRYTSVLEDGSLVVCERSLKSTQNGPSMPLVQNFV 312

Query: 314 RAEMLPSGYLIRACE-GGSLIHIVDHVDLDFWSVPEVLRPLYESTKILAQRMTVAALRYV 373
           RAEML SGYLIR C+ GGS+IHIVDH+DL+  SVPEVLRPLYES K+LAQ+ T+AALR +
Sbjct: 313 RAEMLSSGYLIRPCDGGGSIIHIVDHMDLEACSVPEVLRPLYESPKVLAQKTTMAALRQL 372

Query: 374 RQIAQEA--SGEIQLGGGRQPAVLRTFSQRLCRVFNDAVNGFVDDGWSLMDSDGVEDVTV 433
           +QIAQE   +     G GR+PA LR  SQRL R FN+AVNGF D+GWS++  D ++DVT+
Sbjct: 373 KQIAQEVTQTNSSVNGWGRRPAALRALSQRLSRGFNEAVNGFTDEGWSVI-GDSMDDVTI 432

Query: 434 VINSSPNKFLGSQ--YNTSLYPTFGGVLCAKASMLLQNVPPALLVRFLREHRSEWADYGV 493
            +NSSP+K +G    +     P    VLCAKASMLLQNVPPA+L+RFLREHRSEWAD  +
Sbjct: 433 TVNSSPDKLMGLNLTFANGFAPVSNVVLCAKASMLLQNVPPAILLRFLREHRSEWADNNI 492

Query: 494 DAYSAACLKASAYAVPC-ARPGGFPSGQVILPLAHTVENEEFLEVVRLEGHAMFPEEAAL 553
           DAY AA +K      PC AR GGF  GQVILPLAHT+E+EEF+EV++LEG    PE+A +
Sbjct: 493 DAYLAAAVKVG----PCSARVGGF-GGQVILPLAHTIEHEEFMEVIKLEGLGHSPEDAIV 552

Query: 554 GGRDMYLLQLCSGVDENTVGACAQLVFAPIDESFADDAPLLPSGFRVIPLES-KTDAPAA 613
             RD++LLQLCSG+DEN VG CA+L+FAPID SFADDAPLLPSGFR+IPL+S K +  + 
Sbjct: 553 -PRDIFLLQLCSGMDENAVGTCAELIFAPIDASFADDAPLLPSGFRIIPLDSAKQEVSSP 612

Query: 614 TRTLDLASTLEVRPGTTRPGGETDVTNYNLRSVLTIAFQFTFENHMRDSVAAMARQYVRT 673
            RTLDLAS LE+    T+   +    +   RSV+TIAF+F  E+HM++ VA+MARQYVR 
Sbjct: 613 NRTLDLASALEIGSAGTKASTDQSGNSTCARSVMTIAFEFGIESHMQEHVASMARQYVRG 672

Query: 674 VVGSVQRVAMAIAPSQLGSQIGPKSLPGSPEALTLAQWIARSYRIHSGTELFQVESQ-SD 733
           ++ SVQRVA+A++PS + SQ+G ++  G+PEA TLA+WI +SYR + G EL +  S  ++
Sbjct: 673 IISSVQRVALALSPSHISSQVGLRTPLGTPEAQTLARWICQSYRGYMGVELLKSNSDGNE 732

Query: 734 AILKQLWHHSDTILCCSVKTNASPVFTFANQAGLDMLETTLVSLQDITLDKILDDAGRKI 793
           +ILK LWHH+D I+CCS+K  A PVFTFANQAGLDMLETTLV+LQDI+L+KI DD GRK 
Sbjct: 733 SILKNLWHHTDAIICCSMK--ALPVFTFANQAGLDMLETTLVALQDISLEKIFDDNGRKT 792

Query: 794 LCSEFSKIMQQGFAYLPAGICVSSMGRPVSYEQAIAWKVLNDDDVHHCLAFMFVNWSFM 844
           LCSEF +IMQQGFA L  GIC+SSMGRPVSYE+A+AWKVLN+++  HC+ F+F+NWSF+
Sbjct: 793 LCSEFPQIMQQGFACLQGGICLSSMGRPVSYERAVAWKVLNEEENAHCICFVFINWSFV 837

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_008445207.10.0e+0098.22PREDICTED: homeobox-leucine zipper protein ATHB-14-like [Cucumis melo][more]
XP_038895414.10.0e+0098.22homeobox-leucine zipper protein ATHB-14-like [Benincasa hispida][more]
KGN65925.10.0e+0097.87hypothetical protein Csa_023222 [Cucumis sativus][more]
XP_031741286.10.0e+0097.85homeobox-leucine zipper protein ATHB-14 isoform X2 [Cucumis sativus][more]
TYK05546.10.0e+0098.32homeobox-leucine zipper protein ATHB-14-like [Cucumis melo var. makuwa][more]
Match NameE-valueIdentityDescription
O042910.0e+0079.34Homeobox-leucine zipper protein ATHB-14 OS=Arabidopsis thaliana OX=3702 GN=ATHB-... [more]
A2XK300.0e+0080.65Homeobox-leucine zipper protein HOX32 OS=Oryza sativa subsp. indica OX=39946 GN=... [more]
Q6AST10.0e+0080.65Homeobox-leucine zipper protein HOX32 OS=Oryza sativa subsp. japonica OX=39947 G... [more]
O042920.0e+0077.54Homeobox-leucine zipper protein ATHB-9 OS=Arabidopsis thaliana OX=3702 GN=ATHB-9... [more]
A2ZMN90.0e+0076.82Homeobox-leucine zipper protein HOX33 OS=Oryza sativa subsp. indica OX=39946 GN=... [more]
Match NameE-valueIdentityDescription
A0A1S3BC420.0e+0098.22homeobox-leucine zipper protein ATHB-14-like OS=Cucumis melo OX=3656 GN=LOC10348... [more]
A0A0A0LVI50.0e+0097.87Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G538230 PE=3 SV=1[more]
A0A5D3C5920.0e+0098.32Homeobox-leucine zipper protein ATHB-14-like OS=Cucumis melo var. makuwa OX=1194... [more]
A0A6J1CFI10.0e+0096.45homeobox-leucine zipper protein HOX32-like OS=Momordica charantia OX=3673 GN=LOC... [more]
A0A5A7T0F70.0e+0098.20Homeobox-leucine zipper protein ATHB-14-like OS=Cucumis melo var. makuwa OX=1194... [more]
Match NameE-valueIdentityDescription
AT2G34710.10.0e+0079.34Homeobox-leucine zipper family protein / lipid-binding START domain-containing p... [more]
AT1G30490.10.0e+0077.54Homeobox-leucine zipper family protein / lipid-binding START domain-containing p... [more]
AT5G60690.10.0e+0066.86Homeobox-leucine zipper family protein / lipid-binding START domain-containing p... [more]
AT1G52150.17.2e-31366.23Homeobox-leucine zipper family protein / lipid-binding START domain-containing p... [more]
AT1G52150.23.5e-31266.27Homeobox-leucine zipper family protein / lipid-binding START domain-containing p... [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 87..107
NoneNo IPR availableGENE3D1.10.10.60coord: 10..84
e-value: 3.9E-21
score: 76.3
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 126..156
NoneNo IPR availablePANTHERPTHR45950:SF7HOMEOBOX-LEUCINE ZIPPER PROTEIN ATHB-14coord: 6..843
NoneNo IPR availableCDDcd14686bZIPcoord: 70..109
e-value: 1.85252E-6
score: 43.6881
NoneNo IPR availableSUPERFAMILY55961Bet v1-likecoord: 166..376
NoneNo IPR availableSUPERFAMILY55961Bet v1-likecoord: 413..598
IPR001356Homeobox domainSMARTSM00389HOX_1coord: 15..81
e-value: 7.1E-16
score: 68.7
IPR001356Homeobox domainPFAMPF00046Homeodomaincoord: 18..76
e-value: 3.6E-16
score: 58.8
IPR001356Homeobox domainPROSITEPS50071HOMEOBOX_2coord: 13..77
score: 15.596498
IPR001356Homeobox domainCDDcd00086homeodomaincoord: 18..78
e-value: 2.94965E-17
score: 74.5872
IPR002913START domainSMARTSM00234START_1coord: 165..374
e-value: 1.6E-49
score: 180.5
IPR002913START domainPFAMPF01852STARTcoord: 166..373
e-value: 3.1E-50
score: 170.5
IPR002913START domainPROSITEPS50848STARTcoord: 156..383
score: 25.068983
IPR023393START-like domain superfamilyGENE3D3.30.530.20coord: 185..377
e-value: 9.0E-20
score: 73.0
IPR013978MEKHLAPFAMPF08670MEKHLAcoord: 699..842
e-value: 2.1E-50
score: 170.3
IPR044830Class III homeodomain-leucine zipper familyPANTHERPTHR45950HOMEOBOX-LEUCINE ZIPPER PROTEIN ATHB-14coord: 6..843
IPR009057Homeobox-like domain superfamilySUPERFAMILY46689Homeodomain-likecoord: 18..80

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10021418.1HG10021418.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0030154 cell differentiation
biological_process GO:0006355 regulation of transcription, DNA-templated
cellular_component GO:0005634 nucleus
molecular_function GO:0003677 DNA binding
molecular_function GO:0003700 DNA-binding transcription factor activity
molecular_function GO:0008289 lipid binding