HG10022254 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10022254
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionDUF4378 domain-containing protein
LocationChr05: 22373790 .. 22380620 (+)
RNA-Seq ExpressionHG10022254
SyntenyHG10022254
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTACTGTTACCGTATTCAGGAAGCAGAGAGATTGAGGAACCTAGAAATCACGGGAAGACAAAGTAAGCTTTTCAATGTTCTGCTTTTGTAATTGATACGTTTTTATAATTCACTTTGGTAGTTAATAAAGCTTTATGTCTCAAGAGGGGACAATTGGTTGTTGAACTTAAGACACATGAACTTTCTCTCTCTCTCTCTCTCTCTCTCTCTTTTCCTCCTTTTCCTGAAATATTCTTAATGATTTTTTGCAGACAATAATTTTATTTTCAGTAAACTACTTGAGGTCGGTGGTTGTCTACCATAAAGTTTTGTCGGTAGAGTTTCTCCAGATGGAACCTAGACAGTCTACAGCTAGTGTTCTTGAAGTATTGATGGGCTTTGATGAGCGGCAATCTCAGCACCGTGCTCCAAGGCATCCTAAAGTTTTTTCTGACGATTATTTACAAAGGGTTGCATCCATTGGAATCTTGAAGAAGAAATCCTCCTCCAGATGTCATCCATTTAGGATGATAGTCGAAGAGTCAACAGAACTCTTCAATTCTCTCAAAGTAGAAAATAACTTTAGCCGCTGCAACGAGTTATGGGAACGGGAGAAGGTCGATTCTACTTTATCAGCAGCGTATATGCCACTTACAAGACACACCATTATGAATGAAAAGCACTTTTCAACAGGTAAGGTGATACAGACTTCAAAGGATTTTCAAGATTTACCAGTGGTTCTTGATTCTATGGACATCTCACCAAGACCTACAAGAGGAAAAAATTATATATTCAACCAGGCCAAAAATGAACCAAGTGTTTCAAAAGCACATTATAATGATGCAGGCACTAAATTTAAGGACAGGAAACAAGGGCAGGCACACTCGTCAGAGGATCTAGATTTTTTGATGCCTTCAAGACCTTTTTTAGAGTGGAGAGATAAACTACGTTTTTCCTCCTCTTCATCAACTTCTTTGAAAGGCTCACATTTAATTAATGATAAATGCAAAGATTGTCATAGTTCTCGAAATGGAAAGCTTATTGCTAAAGAAAAAAAAAGGACTATGGAGTATGCACTGCAGCCCATCAAGCAACCATCTCGAGTTTCAAGTATTCTGGATGGAAGTAGGAGAACAACGAAGCATAATTTTGTTAATTTACATTTGAAGACCTCAAGAACAGAAACCATATATGACGATGTGCGCAGAAAAGAAACCAAGTGCAAAAGGAATTCTTCCCCCCGTTTATCTAATTGCACGGCGGAATACAAGCATTCCTGCTTCTTTTCAGTTGAGTCATACGAGGCTAGAGAATCCAGGGAGGAAGTAATAGAAGAAGAAAGGAAGACGGAGAACTTGATGCTATTTACACGAGGTAGGAAAATGAATGAAATGCCTACACTGCCTCATTATGCAATTTTGCCCAGCGATTTGAATTGCAAACCTGTCAATTATGATTTCCAGAAGCATGATTGTTCGAATAAGGAACATTTGCATTCTGGCAGTCCCTTGTGCTTGAGCCAGAAGGTTAAGAGACTAGATCAACTCAGTAAAAATTCCCATAGATCGAGATTTGATTCTACTTCTGTGGTGACTACAAGATCTAGAACCAGGAGCAGATACGAGGCCCTTCGAAATACATGGTTCTTAAAGCATGAAGGTCCTGGTACTTGGCTACAATGCACGCCATTGAATAGAAGTTCCAATAAAAAGGATGCTTCAGAACCTACCTTGAAATTAAGCTCTAAGAAATTGAAGATTTTTCCTTGCCCTGATTCAGCAAGCGATCATGTTGACAACGATGGCTGTATGGTTGGTGATGATCTGAAGACCACAGTTGAGAAGAAAGACCTTTGTGATCAGAAATCTTTAAACTGTTTATCACCAAGGAGCAAAGTTGTTTTCTGCACACAAAACAATGCCGTCAAACGAGAAAATCAAGCCATAGAGTGTGCTTTGAAGAGTGATTTTCAAGATAATCTTTCAGGTATGGCTTCTAATTCATTGGCTGTAAAGACTGATGATGTAGCGAACCCTACTGTGGAGAAGCAAGAACCTAATTCATTGTCTGGCATCCTTTTAGAGACTCGTGGTGATTCATCTACCAACTCTTGCCGTGCCATGTATACTTCTATTCAACAGGTTCCATATTTCCCCTCATTAAGTTGTTCAGAATTAATTTTGCACGAAAGTAGAATAGTTAGGAGCATACTCTCTTCTAATGCAGTCTTGTATTCTATAATCTTATTGGCTAATTAACTTTGAATTTACTGAAATTGCTCAATTTTACTTCTGGCTAGTTGGAGTATATTTTTACTTAATTATGCAACATACAGTCAGACAAAGCTAAATGGTTTAAGCATACCTTCCAATTGAAGGTGGGAATACAGAATTATGATAAGAGAAATAAAATAATTTAGTTGTTTTTCTCCCACGATGAACCTTTGGATGTCTTTCAAAGATGATATAAATGCATAGCCAAGTTTATAAGCTATAGATATATGTGCCTTAGTTTCTTTTTCATGCATGCAAAAAGACTAATCACTAAGAACTTTGGTGTCAGAAGAAATGCCCACTTGAGGAAGTGAGAATTGGAAATTCAACTGAACTAGTTAACTTTCTGGTGTAAAAAAAATTCACTTGGTGTAAATTTGAGGGACTAGTCTCAAATTTTTTAATAGACCAAGCCCTTTAGACAGCTATTAATATTTCATAGTCTTGGTTTGGTTCTAATTTCTTGGGCTATAGACATGGTAAAGTAGTGTACTCAAGCTGCTGTAGAGTAATGGTCATCTGGGTGTATTTGACTTCTCTTTACATTACAAAGCACTAAGGTTCTTACTTCTACTCTAATTAAATGGAACATTTCAGAACCTAGATTTATGTTATACCAACGGGTATTCTACTTCATTCTATGCTCATTGGCGATTGGAGAACTTTCAAAGTCCACGTATAACAACTATAAATACACCTTTTTCCAATTGGAAGTGCTTTTTGCATTCCCGTTTGGCTTGGGAGGACTTGAATTTCTTCCCTTCTGTAATTTCAGATCATTTATGAAAAATTTTCTCGTGTAGAAAGACAAATTTTCTAAATAGTTGGTAATTTCTCTGTGTCAAGGTATTCATTCATTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTCCTGATATGTGCTTTGTATCATACCTCGTTTGTAGTTAGTACATCTTCTTTCGTTAGAGTATTTCCCTGTTTCTTGTAGTGGCATTTTTCCATTGTACGCCAATACTTCTTTATCCCCATCAAGTTTCTATATCCTTTATGGTTTAACATTTCTTAACCTATAAAAAAATTGCAGGAAGGTCTTGCCTTTGAACACTACCCTAGCAAAGAGCAAGATTCTATTGTGAATTTGGAGGAGGCTTATCAACCTAGCCCAGTTTCAGTCCTTGAACCACTTTTTAAAGAAGAAACATTATTCAGTTCTGAATCCTCAGGCATCAACAGTAGAGGTGACATATCTTATGACTATATTCTCACTTTCAAATTTATGCCTTCTTAAACACTCTCTCCATCCTCATCTAACATATTCTGGTTGTAGATTTAGTGATGCAACTTGAACTTCTGATGTCGGATTCCCCGGGAACTAACTCAGAAGGACATGATTTGTTCGTATCAAGTGATGATGATGGTGGAGAAGGATCTATATGCGGTTCTGATGAAATTAATGACATCATGAGCACATTCAAATTCAAAGATAGTAGAGATTTTTCATACCTTGTTGATGTATTGAACGAGGCAAGCTTACATTGTAAAAACCTGGAGACGGGTTCTGTTCCGTTACATGGTCAGGAACATCAGGTAATCAGCCCCACAGTCTTTGAGATCTTAGAGAAGAAGTTTGGGGAACAAATTTCTTGGAGGAGATCAGAAAGAAAGCTTCTCTTTGACAGGATAAATTCTGGGTTAGTAGAACTCTTTCAGTCATTTGTTGGTGTGCCAGAATGGGCAAAGCCTGTATCGAGAAGATTTCGGCCATTGCTTAACCATGAAATGATCGAGGAAGAACTATGGATCTTGCTGGATAGCCAAGAAAGGGAAGTGAACAAGGATTTAGTAGATAAGCAGTTTGGAAAGGAGATTGGATGGATAGATCTTGGAGATGAGATTGATTCTATTTGTAGAGAAGTAGAGAGATTGTTGTTTAATGAGCTTGTTGCAGAGTTTGGTAGCATTGAATTATTTTGAATGGTATGATTTATAGTAGTCATGGATAAACATTAGCATACAAAAACAGAGAGATTTGTTTCTTTCTTTTCTTTTCTTTTTTAAAAGAAAAAGAGAGTCATATATTTGTGTATAGGAAACAGTTTTGATCCAGTTGAATTGTTTGAGGAGGATGTATTCTTTTATTCATGAGTGGAACCATAGTTTATTGTTGTCAAGTGTGACAGATTGATATTATTTTTATAAATCAAAGTCTGTGATGCCAATTTTGGCAGGTTGAGTTATATATATTTGTGTATGAACCTTTGTGTTTTTGTGTATAAACATGGACTTCTTCTTAGAGGTTGGAGGTTCGATCCTTATTATTGCAATTGTGTATTAAAAAAAAAAGACACATTAACCAAATAGATGTTTATTTATTTTAAAATAGATGCTACTCCACGTTTAAAAGTATTAACTGTTGGGATTGAAATAATCGTTGTTTACTTCTAGAGACTAAAATAGCCGTTGTAAAAGTTTAAGGTCTTATATTAGAACAAGACTCAAAGTTTAAAGATCAAAATAAAATTTAAATCTAAGGTTATATTACAAAAAGTTTACTCGAAGTGTATTTTATTTTATTTTATTTTGAATTTGGTTTGATTAATTATTTATTTAAATTTTCAGATGTAGGGAGAGCATTTAAATAATGCAATAATCAAAAGAGTACTTTTGTAAAAACTATAAAGAAACAAACCTAACAACGATTTTTAAATCAATTCCACGATCTCAAATTAAAGGGTTAATTAGATGAATATTAGCATTAAAAATATTGGTTATAAGGTCTTGGTTTGAAATTAATGATTGTATATACCCATTCATGATTTTTTTTAGATAAAGGATTAGACAACTAATGGCTTTTTTAGAATGACATTCTACCCTTTCTTTCAAAATCTTTATTTTACGACATATTTTCTTAATTAATTTAAAACCAACACTTTTCTACCAATTTTTTTTTTTAAATACTAATATTTTTTCTAATTAACCCCAAATGAAGTCGTAGCCTAATTGGGCCGATATAGTTAGTGGGCTAAAACCACCCATATTTTCAGGCCCAAAAAGGGCAAATTTGGCAAATCGAAAATTATACTTATAAGATCACAACCTTTTTCTCGTACACGAAAACCCTAGTTTTCTCAGTAGCAGCTCTTCCACCAGGTTCCATCTTCTCTTCTAGGGTTTCGCCGAGCGCGGCGCAGAGTCTAAGAGCACACGAGAAGAAATGGGTAAATTCTCAATCACATCATATCATTAACGAAATGGGACATTCTCAATCTAGTTTTTGGCTCTCTTGTTTTTCAAGGAAAACGATTAAATTTTTGTTATTGAATTTGTTGTTTTTGGTTTCTAGCGAGAATCAAGGTTCATGAGTTGAGGCAGAAATCGAAGGCGGATCTGTTGTTGCAGCTTAAGGATCTTAAGGCGGAGCTTGCTCTCCTTCGAGTCGCCAAAGTCACCGGCGGTGCCCTGAACAAGCGATCCAAAATGTAATCTCTCTCTCTCTTTTTTTTTTTTCTTTCTTTTCCGCCGCGATGTATCTCATTGTTTTGTTGATCTTGTTGTTTTATGTGGTTGATTTTGCAGCAAGGTAGTGAGATTGTCAATCGCTCAAGTCTTGACAGTAATTTCTCAGAAGCAGAAGTCTGCGTTGAGGGAAGCTTACAAGAAGAAGAAGCTATTGCCTCTCGATCTTCGCCCAAAGAAGACCAGGGCCATTCGTAGAAGGCTCACCAAGTACCAGGTAATTGTTTCAACTTTATTCTGTCCTGTTTCTTTGTTTAATTTTTCAGTACAGTTTGGTTTTGTGTGTTCCTCTTTTAACTGTTTAATTAAATTTCAGTACGGATTATATGTCTAAATGTTGCCAATGTCTGTGCACGATACCTGTCGAGTTTTTCTATGTTGCAAAAGAATAAAAAAGTGCCGAACTATTTGATGAGGATGGGTTTGTTGGTTTTACAACCTTGATATCTATTGTGGAAGTTAGTATTGTTTCATAATCATTTCATTTTTTTAATTTCTGCTTGTTTGTGTCCAATTTCCTTACGAAGAATTTTTGGTCAAATTTCAGAAACAAAAACTACTTTTATGTGTTTTTGAAAGCATTGGTAGGGAGAGAATTACAATATACAGAGAGACACACCCATGGGATTACTGCTAAGAAGCTTAATTTTCCAAAACTAAAATCAAATTATTATCGAATTGGGCCTCCATTCATCAGAATCATGATTTTGAAACTTGAAGTTTTCAAGTAATTTTGTATATAATCAACATGAGCATAACTAGACTGACATGAACTTGTACCCTCAACTTTGAGGTATGAGGTTGGATATTGTAGGATATCAGTAGGTTTTGGCTTTAAGCCATCTAATATATTTAAAAAAAAACTTTACCTCTCATTCGTTCTTGATTCACTTTGATGCAATTGTCATGCTTCGAACTGGTTGGAAGTCCATTTATTTATCTTTAGTTTGCTTTCGAAATTCGACCTTTTTGTTTAATTGTTCCATATTTTTTTTTTCCTTATGCAGGCTTCTTTGAAAACTGAGAGACAGAAGAAGAAAGAAATGTACTTTCCATTGAGGAAATATGCTATTAAGGTGTAG

mRNA sequence

ATGTACTGTTACCGTATTCAGGAAGCAGAGAGATTGAGGAACCTAGAAATCACGGGAAGACAAATAAACTACTTGAGGTCGGTGGTTGTCTACCATAAAGTTTTGTCGGTAGAGTTTCTCCAGATGGAACCTAGACAGTCTACAGCTAGTGTTCTTGAAGTATTGATGGGCTTTGATGAGCGGCAATCTCAGCACCGTGCTCCAAGGCATCCTAAAGTTTTTTCTGACGATTATTTACAAAGGGTTGCATCCATTGGAATCTTGAAGAAGAAATCCTCCTCCAGATGTCATCCATTTAGGATGATAGTCGAAGAGTCAACAGAACTCTTCAATTCTCTCAAAGTAGAAAATAACTTTAGCCGCTGCAACGAGTTATGGGAACGGGAGAAGGTCGATTCTACTTTATCAGCAGCGTATATGCCACTTACAAGACACACCATTATGAATGAAAAGCACTTTTCAACAGGTAAGGTGATACAGACTTCAAAGGATTTTCAAGATTTACCAGTGGTTCTTGATTCTATGGACATCTCACCAAGACCTACAAGAGGAAAAAATTATATATTCAACCAGGCCAAAAATGAACCAAGTGTTTCAAAAGCACATTATAATGATGCAGGCACTAAATTTAAGGACAGGAAACAAGGGCAGGCACACTCGTCAGAGGATCTAGATTTTTTGATGCCTTCAAGACCTTTTTTAGAGTGGAGAGATAAACTACGTTTTTCCTCCTCTTCATCAACTTCTTTGAAAGGCTCACATTTAATTAATGATAAATGCAAAGATTGTCATAGTTCTCGAAATGGAAAGCTTATTGCTAAAGAAAAAAAAAGGACTATGGAGTATGCACTGCAGCCCATCAAGCAACCATCTCGAGTTTCAAGTATTCTGGATGGAAGTAGGAGAACAACGAAGCATAATTTTGTTAATTTACATTTGAAGACCTCAAGAACAGAAACCATATATGACGATGTGCGCAGAAAAGAAACCAAGTGCAAAAGGAATTCTTCCCCCCGTTTATCTAATTGCACGGCGGAATACAAGCATTCCTGCTTCTTTTCAGTTGAGTCATACGAGGCTAGAGAATCCAGGGAGGAAGTAATAGAAGAAGAAAGGAAGACGGAGAACTTGATGCTATTTACACGAGGTAGGAAAATGAATGAAATGCCTACACTGCCTCATTATGCAATTTTGCCCAGCGATTTGAATTGCAAACCTGTCAATTATGATTTCCAGAAGCATGATTGTTCGAATAAGGAACATTTGCATTCTGGCAGTCCCTTGTGCTTGAGCCAGAAGGTTAAGAGACTAGATCAACTCAGTAAAAATTCCCATAGATCGAGATTTGATTCTACTTCTGTGGTGACTACAAGATCTAGAACCAGGAGCAGATACGAGGCCCTTCGAAATACATGGTTCTTAAAGCATGAAGGTCCTGGTACTTGGCTACAATGCACGCCATTGAATAGAAGTTCCAATAAAAAGGATGCTTCAGAACCTACCTTGAAATTAAGCTCTAAGAAATTGAAGATTTTTCCTTGCCCTGATTCAGCAAGCGATCATGTTGACAACGATGGCTGTATGGTTGGTGATGATCTGAAGACCACAGTTGAGAAGAAAGACCTTTGTGATCAGAAATCTTTAAACTGTTTATCACCAAGGAGCAAAGTTGTTTTCTGCACACAAAACAATGCCGTCAAACGAGAAAATCAAGCCATAGAGTGTGCTTTGAAGAGTGATTTTCAAGATAATCTTTCAGGTATGGCTTCTAATTCATTGGCTGTAAAGACTGATGATGTAGCGAACCCTACTGTGGAGAAGCAAGAACCTAATTCATTGTCTGGCATCCTTTTAGAGACTCGTGGTGATTCATCTACCAACTCTTGCCGTGCCATGTATACTTCTATTCAACAGGAAGGTCTTGCCTTTGAACACTACCCTAGCAAAGAGCAAGATTCTATTGTGAATTTGGAGGAGGCTTATCAACCTAGCCCAGTTTCAGTCCTTGAACCACTTTTTAAAGAAGAAACATTATTCAGTTCTGAATCCTCAGGCATCAACAGTAGAGATTTAGTGATGCAACTTGAACTTCTGATGTCGGATTCCCCGGGAACTAACTCAGAAGGACATGATTTGTTCGTATCAAGTGATGATGATGGTGGAGAAGGATCTATATGCGGTTCTGATGAAATTAATGACATCATGAGCACATTCAAATTCAAAGATAGTAGAGATTTTTCATACCTTGTTGATGTATTGAACGAGGCAAGCTTACATTGTAAAAACCTGGAGACGGGTTCTGTTCCGTTACATGGTCAGGAACATCAGGTAATCAGCCCCACAGTCTTTGAGATCTTAGAGAAGAAGTTTGGGGAACAAATTTCTTGGAGGAGATCAGAAAGAAAGCTTCTCTTTGACAGGATAAATTCTGGGTTAGTAGAACTCTTTCAGTCATTTGTTGGTGTGCCAGAATGGGCAAAGCCTGTATCGAGAAGATTTCGGCCATTGCTTAACCATGAAATGATCGAGGAAGAACTATGGATCTTGCTGGATAGCCAAGAAAGGGAAGTGAACAAGGATTTAGTAGATAAGCAGTTTGGAAAGGAGATTGGATGGATAGATCTTGGAGATGAGATTGATTCTATTTGTAGAGAAGTAGAGAGATTGTTGTTTAATGAGCTTGTTGCAGAGTTTGCGAGAATCAAGGTTCATGAGTTGAGGCAGAAATCGAAGGCGGATCTGTTGTTGCAGCTTAAGGATCTTAAGGCGGAGCTTGCTCTCCTTCGAGTCGCCAAAGTCACCGGCGGTGCCCTGAACAAGCGATCCAAAATCAAGGTAGTGAGATTGTCAATCGCTCAAGTCTTGACAGTAATTTCTCAGAAGCAGAAGTCTGCGTTGAGGGAAGCTTACAAGAAGAAGAAGCTATTGCCTCTCGATCTTCGCCCAAAGAAGACCAGGGCCATTCGTAGAAGGCTCACCAAGTACCAGGCTTCTTTGAAAACTGAGAGACAGAAGAAGAAAGAAATGTACTTTCCATTGAGGAAATATGCTATTAAGGTGTAG

Coding sequence (CDS)

ATGTACTGTTACCGTATTCAGGAAGCAGAGAGATTGAGGAACCTAGAAATCACGGGAAGACAAATAAACTACTTGAGGTCGGTGGTTGTCTACCATAAAGTTTTGTCGGTAGAGTTTCTCCAGATGGAACCTAGACAGTCTACAGCTAGTGTTCTTGAAGTATTGATGGGCTTTGATGAGCGGCAATCTCAGCACCGTGCTCCAAGGCATCCTAAAGTTTTTTCTGACGATTATTTACAAAGGGTTGCATCCATTGGAATCTTGAAGAAGAAATCCTCCTCCAGATGTCATCCATTTAGGATGATAGTCGAAGAGTCAACAGAACTCTTCAATTCTCTCAAAGTAGAAAATAACTTTAGCCGCTGCAACGAGTTATGGGAACGGGAGAAGGTCGATTCTACTTTATCAGCAGCGTATATGCCACTTACAAGACACACCATTATGAATGAAAAGCACTTTTCAACAGGTAAGGTGATACAGACTTCAAAGGATTTTCAAGATTTACCAGTGGTTCTTGATTCTATGGACATCTCACCAAGACCTACAAGAGGAAAAAATTATATATTCAACCAGGCCAAAAATGAACCAAGTGTTTCAAAAGCACATTATAATGATGCAGGCACTAAATTTAAGGACAGGAAACAAGGGCAGGCACACTCGTCAGAGGATCTAGATTTTTTGATGCCTTCAAGACCTTTTTTAGAGTGGAGAGATAAACTACGTTTTTCCTCCTCTTCATCAACTTCTTTGAAAGGCTCACATTTAATTAATGATAAATGCAAAGATTGTCATAGTTCTCGAAATGGAAAGCTTATTGCTAAAGAAAAAAAAAGGACTATGGAGTATGCACTGCAGCCCATCAAGCAACCATCTCGAGTTTCAAGTATTCTGGATGGAAGTAGGAGAACAACGAAGCATAATTTTGTTAATTTACATTTGAAGACCTCAAGAACAGAAACCATATATGACGATGTGCGCAGAAAAGAAACCAAGTGCAAAAGGAATTCTTCCCCCCGTTTATCTAATTGCACGGCGGAATACAAGCATTCCTGCTTCTTTTCAGTTGAGTCATACGAGGCTAGAGAATCCAGGGAGGAAGTAATAGAAGAAGAAAGGAAGACGGAGAACTTGATGCTATTTACACGAGGTAGGAAAATGAATGAAATGCCTACACTGCCTCATTATGCAATTTTGCCCAGCGATTTGAATTGCAAACCTGTCAATTATGATTTCCAGAAGCATGATTGTTCGAATAAGGAACATTTGCATTCTGGCAGTCCCTTGTGCTTGAGCCAGAAGGTTAAGAGACTAGATCAACTCAGTAAAAATTCCCATAGATCGAGATTTGATTCTACTTCTGTGGTGACTACAAGATCTAGAACCAGGAGCAGATACGAGGCCCTTCGAAATACATGGTTCTTAAAGCATGAAGGTCCTGGTACTTGGCTACAATGCACGCCATTGAATAGAAGTTCCAATAAAAAGGATGCTTCAGAACCTACCTTGAAATTAAGCTCTAAGAAATTGAAGATTTTTCCTTGCCCTGATTCAGCAAGCGATCATGTTGACAACGATGGCTGTATGGTTGGTGATGATCTGAAGACCACAGTTGAGAAGAAAGACCTTTGTGATCAGAAATCTTTAAACTGTTTATCACCAAGGAGCAAAGTTGTTTTCTGCACACAAAACAATGCCGTCAAACGAGAAAATCAAGCCATAGAGTGTGCTTTGAAGAGTGATTTTCAAGATAATCTTTCAGGTATGGCTTCTAATTCATTGGCTGTAAAGACTGATGATGTAGCGAACCCTACTGTGGAGAAGCAAGAACCTAATTCATTGTCTGGCATCCTTTTAGAGACTCGTGGTGATTCATCTACCAACTCTTGCCGTGCCATGTATACTTCTATTCAACAGGAAGGTCTTGCCTTTGAACACTACCCTAGCAAAGAGCAAGATTCTATTGTGAATTTGGAGGAGGCTTATCAACCTAGCCCAGTTTCAGTCCTTGAACCACTTTTTAAAGAAGAAACATTATTCAGTTCTGAATCCTCAGGCATCAACAGTAGAGATTTAGTGATGCAACTTGAACTTCTGATGTCGGATTCCCCGGGAACTAACTCAGAAGGACATGATTTGTTCGTATCAAGTGATGATGATGGTGGAGAAGGATCTATATGCGGTTCTGATGAAATTAATGACATCATGAGCACATTCAAATTCAAAGATAGTAGAGATTTTTCATACCTTGTTGATGTATTGAACGAGGCAAGCTTACATTGTAAAAACCTGGAGACGGGTTCTGTTCCGTTACATGGTCAGGAACATCAGGTAATCAGCCCCACAGTCTTTGAGATCTTAGAGAAGAAGTTTGGGGAACAAATTTCTTGGAGGAGATCAGAAAGAAAGCTTCTCTTTGACAGGATAAATTCTGGGTTAGTAGAACTCTTTCAGTCATTTGTTGGTGTGCCAGAATGGGCAAAGCCTGTATCGAGAAGATTTCGGCCATTGCTTAACCATGAAATGATCGAGGAAGAACTATGGATCTTGCTGGATAGCCAAGAAAGGGAAGTGAACAAGGATTTAGTAGATAAGCAGTTTGGAAAGGAGATTGGATGGATAGATCTTGGAGATGAGATTGATTCTATTTGTAGAGAAGTAGAGAGATTGTTGTTTAATGAGCTTGTTGCAGAGTTTGCGAGAATCAAGGTTCATGAGTTGAGGCAGAAATCGAAGGCGGATCTGTTGTTGCAGCTTAAGGATCTTAAGGCGGAGCTTGCTCTCCTTCGAGTCGCCAAAGTCACCGGCGGTGCCCTGAACAAGCGATCCAAAATCAAGGTAGTGAGATTGTCAATCGCTCAAGTCTTGACAGTAATTTCTCAGAAGCAGAAGTCTGCGTTGAGGGAAGCTTACAAGAAGAAGAAGCTATTGCCTCTCGATCTTCGCCCAAAGAAGACCAGGGCCATTCGTAGAAGGCTCACCAAGTACCAGGCTTCTTTGAAAACTGAGAGACAGAAGAAGAAAGAAATGTACTTTCCATTGAGGAAATATGCTATTAAGGTGTAG

Protein sequence

MYCYRIQEAERLRNLEITGRQINYLRSVVVYHKVLSVEFLQMEPRQSTASVLEVLMGFDERQSQHRAPRHPKVFSDDYLQRVASIGILKKKSSSRCHPFRMIVEESTELFNSLKVENNFSRCNELWEREKVDSTLSAAYMPLTRHTIMNEKHFSTGKVIQTSKDFQDLPVVLDSMDISPRPTRGKNYIFNQAKNEPSVSKAHYNDAGTKFKDRKQGQAHSSEDLDFLMPSRPFLEWRDKLRFSSSSSTSLKGSHLINDKCKDCHSSRNGKLIAKEKKRTMEYALQPIKQPSRVSSILDGSRRTTKHNFVNLHLKTSRTETIYDDVRRKETKCKRNSSPRLSNCTAEYKHSCFFSVESYEARESREEVIEEERKTENLMLFTRGRKMNEMPTLPHYAILPSDLNCKPVNYDFQKHDCSNKEHLHSGSPLCLSQKVKRLDQLSKNSHRSRFDSTSVVTTRSRTRSRYEALRNTWFLKHEGPGTWLQCTPLNRSSNKKDASEPTLKLSSKKLKIFPCPDSASDHVDNDGCMVGDDLKTTVEKKDLCDQKSLNCLSPRSKVVFCTQNNAVKRENQAIECALKSDFQDNLSGMASNSLAVKTDDVANPTVEKQEPNSLSGILLETRGDSSTNSCRAMYTSIQQEGLAFEHYPSKEQDSIVNLEEAYQPSPVSVLEPLFKEETLFSSESSGINSRDLVMQLELLMSDSPGTNSEGHDLFVSSDDDGGEGSICGSDEINDIMSTFKFKDSRDFSYLVDVLNEASLHCKNLETGSVPLHGQEHQVISPTVFEILEKKFGEQISWRRSERKLLFDRINSGLVELFQSFVGVPEWAKPVSRRFRPLLNHEMIEEELWILLDSQEREVNKDLVDKQFGKEIGWIDLGDEIDSICREVERLLFNELVAEFARIKVHELRQKSKADLLLQLKDLKAELALLRVAKVTGGALNKRSKIKVVRLSIAQVLTVISQKQKSALREAYKKKKLLPLDLRPKKTRAIRRRLTKYQASLKTERQKKKEMYFPLRKYAIKV
Homology
BLAST of HG10022254 vs. NCBI nr
Match: XP_038889736.1 (uncharacterized protein LOC120079578 isoform X1 [Benincasa hispida] >XP_038889737.1 uncharacterized protein LOC120079578 isoform X1 [Benincasa hispida] >XP_038889738.1 uncharacterized protein LOC120079578 isoform X1 [Benincasa hispida] >XP_038889739.1 uncharacterized protein LOC120079578 isoform X1 [Benincasa hispida])

HSP 1 Score: 1314.7 bits (3401), Expect = 0.0e+00
Identity = 688/875 (78.63%), Postives = 720/875 (82.29%), Query Frame = 0

Query: 42  MEPRQSTASVLEVLMGFDERQSQHRAPRHPKVFSDDYLQRVASIGILKKKSSSRCHPFRM 101
           ME RQ T SVLE LMGFDERQ QH APRH +V SDDYLQRVASIGI KKK  SRCHPFRM
Sbjct: 1   MELRQCTTSVLEALMGFDERQPQHHAPRHSEVLSDDYLQRVASIGISKKKYPSRCHPFRM 60

Query: 102 IVEESTELFNSLKVENNFSRCNELWEREKVDSTLSAAYMPLTRHTIMNEKHFSTGKVIQT 161
            VEE TELFNS KVENNFSRCNELWE EK DS+LSA  MPLTRHTIM EKHFSTGKVIQT
Sbjct: 61  TVEEPTELFNSFKVENNFSRCNELWEWEKADSSLSAGCMPLTRHTIMTEKHFSTGKVIQT 120

Query: 162 SKDFQDLPVVLDSMDISPRPTRGKNYIFNQAKNEPSVSKAHY------NDAGTKFKDRKQ 221
           SKDFQ+LP VLDSMDISPRPTRGKN IFNQAKN PSVSK HY      NDAGTK KDRK 
Sbjct: 121 SKDFQNLPEVLDSMDISPRPTRGKNSIFNQAKNGPSVSKEHYSSTERNNDAGTKLKDRKL 180

Query: 222 GQAHSSEDLDFLMPSRPFLEWRDKLRFSSSSSTSLKGSHLINDKCKDCHSS-------RN 281
           GQ HSSEDLDFL  SRP LEWRDKL FSSSS TSL+GSHL+NDKCKDC SS       +N
Sbjct: 181 GQTHSSEDLDFLKSSRPLLEWRDKLCFSSSSPTSLRGSHLVNDKCKDCLSSQNGKNIAQN 240

Query: 282 GKLIAKEKKRTMEYALQPIKQPSRVSSILDGSRRTTKHNFVNLHLKTSRTETIYDDVRRK 341
           GK IAKE +RTMEYALQPIKQ S+VSSILD SRRTT+H FVNLHLK SR  TIYDDV R 
Sbjct: 241 GKNIAKENQRTMEYALQPIKQSSQVSSILDESRRTTRHGFVNLHLKNSRLGTIYDDVCRN 300

Query: 342 ETKCKRNSSPRLSNCTAEYKHSCFFSVESYEARESREEVIEEERKTENLMLFTRGRKMNE 401
           ETK +RNSSP LSN TA+YKHSCFFSVESY+ARESRE+V EE+RKTENL+  T+GR+MNE
Sbjct: 301 ETKYRRNSSPSLSNWTAKYKHSCFFSVESYKARESREKVTEEQRKTENLLPSTQGRQMNE 360

Query: 402 MPTLPHYAILPSDLNCKPVNYDFQKHDCSNKEHLHSGSPLCLSQKVKRLDQLSKNSHRSR 461
           MPTLPH+A LPSDLNCKPV +DFQKH CSNKEH HSGSPLCLS KVKRLDQL KNSHR R
Sbjct: 361 MPTLPHFASLPSDLNCKPVKFDFQKHVCSNKEHFHSGSPLCLSWKVKRLDQLCKNSHRLR 420

Query: 462 FDSTSVVTTRSRTRSRYEALRNTWFLKHEGPGTWLQCTPLNRSSNKKDASEPTLKLSSKK 521
           FDSTS VTTRSRTRSRYEALRNTWFLKHEGPG WLQC P NRSSNKKDASEP+LKLSSKK
Sbjct: 421 FDSTSAVTTRSRTRSRYEALRNTWFLKHEGPGAWLQCKPSNRSSNKKDASEPSLKLSSKK 480

Query: 522 LKIFPCPDSASDHVDNDGCMVGDDLKTTVEKKDLCDQKSLNCLSPRSKVVFCTQNNAVKR 581
           LKIFPCPDSASDHVDND CMVGDDLKT VEKKD CDQ SLNCLSPRSK VFCTQN  VK+
Sbjct: 481 LKIFPCPDSASDHVDNDDCMVGDDLKTKVEKKDHCDQHSLNCLSPRSKGVFCTQNIPVKQ 540

Query: 582 ENQAIECALKSDFQDNLSGMASNSLAVKTDDVANPTVEKQEPNSLSGILLETRGDSSTNS 641
            NQA                                                        
Sbjct: 541 GNQA-------------------------------------------------------- 600

Query: 642 CRAMYTSIQQEGLAFEHYPSKEQDSIVNLEEAYQPSPVSVLEPLFKEETLFSSESSGINS 701
                TSIQQEGL FEHYPSKEQDSIV+LEEA+QPSPVSVLEPLFK+ETLFSSES GIN 
Sbjct: 601 -----TSIQQEGLPFEHYPSKEQDSIVSLEEAFQPSPVSVLEPLFKDETLFSSESPGING 660

Query: 702 RDLVMQLELLMSDSPGTNSEGHDLFVSSDDDGGEGSICGSDEINDIMSTFKFKDSRDFSY 761
           RDL+MQLELLMSDSPGTNSEGHDLFVSSDDDGGEGSIC S+EI+DIMSTFKFKDSRDFSY
Sbjct: 661 RDLMMQLELLMSDSPGTNSEGHDLFVSSDDDGGEGSICSSNEIDDIMSTFKFKDSRDFSY 720

Query: 762 LVDVLNEASLHCKNLETGSVPLHGQEHQVISPTVFEILEKKFGEQISWRRSERKLLFDRI 821
           LVDVL+EASLHCK+LETGSV  H QEHQVISP VFE LEKKFGEQ SWRRSERKLLFDRI
Sbjct: 721 LVDVLSEASLHCKSLETGSVSCHNQEHQVISPAVFETLEKKFGEQNSWRRSERKLLFDRI 780

Query: 822 NSGLVELFQSFVGVPEWAKPVSRRFRPLLNHEMIEEELWILLDSQEREVNKDLVDKQFGK 881
           NSGLVELFQSF GVPEWAKPVSRRFRPLLNHEMIEEELWILLDSQEREVNKDLVDKQFGK
Sbjct: 781 NSGLVELFQSFDGVPEWAKPVSRRFRPLLNHEMIEEELWILLDSQEREVNKDLVDKQFGK 814

Query: 882 EIGWIDLGDEIDSICREVERLLFNELVAEFARIKV 904
           EIGWIDLGDEIDSICRE+ERLL NELVAEF  I++
Sbjct: 841 EIGWIDLGDEIDSICRELERLLVNELVAEFGSIEL 814

BLAST of HG10022254 vs. NCBI nr
Match: XP_038889740.1 (uncharacterized protein LOC120079578 isoform X2 [Benincasa hispida])

HSP 1 Score: 1307.7 bits (3383), Expect = 0.0e+00
Identity = 687/875 (78.51%), Postives = 719/875 (82.17%), Query Frame = 0

Query: 42  MEPRQSTASVLEVLMGFDERQSQHRAPRHPKVFSDDYLQRVASIGILKKKSSSRCHPFRM 101
           ME RQ T SVLE LMGFDERQ QH APRH +V SDDYLQRVASIGI KKK  SRCHPFRM
Sbjct: 1   MELRQCTTSVLEALMGFDERQPQHHAPRHSEVLSDDYLQRVASIGISKKKYPSRCHPFRM 60

Query: 102 IVEESTELFNSLKVENNFSRCNELWEREKVDSTLSAAYMPLTRHTIMNEKHFSTGKVIQT 161
            VEE TELFNS KVENNFSRCNELWE EK DS+LSA  MPLTRHTIM EKHFSTGKVIQT
Sbjct: 61  TVEEPTELFNSFKVENNFSRCNELWEWEKADSSLSAGCMPLTRHTIMTEKHFSTGKVIQT 120

Query: 162 SKDFQDLPVVLDSMDISPRPTRGKNYIFNQAKNEPSVSKAHY------NDAGTKFKDRKQ 221
           SKDFQ+LP VLDSMDISPRPTRGKN IFNQAKN PSVSK HY      NDAGTK KDRK 
Sbjct: 121 SKDFQNLPEVLDSMDISPRPTRGKNSIFNQAKNGPSVSKEHYSSTERNNDAGTKLKDRKL 180

Query: 222 GQAHSSEDLDFLMPSRPFLEWRDKLRFSSSSSTSLKGSHLINDKCKDCHSS-------RN 281
           GQ HSSEDLDFL  SRP LEWRDKL FSSSS TSL+GSHL+NDKCKDC SS       +N
Sbjct: 181 GQTHSSEDLDFLKSSRPLLEWRDKLCFSSSSPTSLRGSHLVNDKCKDCLSSQNGKNIAQN 240

Query: 282 GKLIAKEKKRTMEYALQPIKQPSRVSSILDGSRRTTKHNFVNLHLKTSRTETIYDDVRRK 341
           GK IAKE +RTMEYALQPIKQ S+VSSILD SRRTT+H FVNLHLK SR  TIYDDV R 
Sbjct: 241 GKNIAKENQRTMEYALQPIKQSSQVSSILDESRRTTRHGFVNLHLKNSRLGTIYDDVCRN 300

Query: 342 ETKCKRNSSPRLSNCTAEYKHSCFFSVESYEARESREEVIEEERKTENLMLFTRGRKMNE 401
           ETK +RNSSP LSN TA+YKHSCFFSVESY+ARESRE+V EE+RKTENL+  T+GR+MNE
Sbjct: 301 ETKYRRNSSPSLSNWTAKYKHSCFFSVESYKARESREKVTEEQRKTENLLPSTQGRQMNE 360

Query: 402 MPTLPHYAILPSDLNCKPVNYDFQKHDCSNKEHLHSGSPLCLSQKVKRLDQLSKNSHRSR 461
           MPTLPH+A LPSDLNCKPV +DFQKH CSNKEH HSGSPLCLS KVKRLDQL KNSHR R
Sbjct: 361 MPTLPHFASLPSDLNCKPVKFDFQKHVCSNKEHFHSGSPLCLSWKVKRLDQLCKNSHRLR 420

Query: 462 FDSTSVVTTRSRTRSRYEALRNTWFLKHEGPGTWLQCTPLNRSSNKKDASEPTLKLSSKK 521
           FDSTS VTTRSRTRSRYEALRNTWFLKHEGPG WLQC P NRSSNKKDASEP+LKLSSKK
Sbjct: 421 FDSTSAVTTRSRTRSRYEALRNTWFLKHEGPGAWLQCKPSNRSSNKKDASEPSLKLSSKK 480

Query: 522 LKIFPCPDSASDHVDNDGCMVGDDLKTTVEKKDLCDQKSLNCLSPRSKVVFCTQNNAVKR 581
           LKIFPCPDSASDHVDND CMVGDDLKT VEKKD CDQ SLNCLSPRSK VFCTQN  VK+
Sbjct: 481 LKIFPCPDSASDHVDNDDCMVGDDLKTKVEKKDHCDQHSLNCLSPRSKGVFCTQNIPVKQ 540

Query: 582 ENQAIECALKSDFQDNLSGMASNSLAVKTDDVANPTVEKQEPNSLSGILLETRGDSSTNS 641
            NQA                                                        
Sbjct: 541 GNQA-------------------------------------------------------- 600

Query: 642 CRAMYTSIQQEGLAFEHYPSKEQDSIVNLEEAYQPSPVSVLEPLFKEETLFSSESSGINS 701
                TSIQQEGL FEHYPSKEQDSIV+LEEA+QPSPVSVLEPLFK+ETLFSSES GIN 
Sbjct: 601 -----TSIQQEGLPFEHYPSKEQDSIVSLEEAFQPSPVSVLEPLFKDETLFSSESPGIN- 660

Query: 702 RDLVMQLELLMSDSPGTNSEGHDLFVSSDDDGGEGSICGSDEINDIMSTFKFKDSRDFSY 761
            DL+MQLELLMSDSPGTNSEGHDLFVSSDDDGGEGSIC S+EI+DIMSTFKFKDSRDFSY
Sbjct: 661 -DLMMQLELLMSDSPGTNSEGHDLFVSSDDDGGEGSICSSNEIDDIMSTFKFKDSRDFSY 720

Query: 762 LVDVLNEASLHCKNLETGSVPLHGQEHQVISPTVFEILEKKFGEQISWRRSERKLLFDRI 821
           LVDVL+EASLHCK+LETGSV  H QEHQVISP VFE LEKKFGEQ SWRRSERKLLFDRI
Sbjct: 721 LVDVLSEASLHCKSLETGSVSCHNQEHQVISPAVFETLEKKFGEQNSWRRSERKLLFDRI 780

Query: 822 NSGLVELFQSFVGVPEWAKPVSRRFRPLLNHEMIEEELWILLDSQEREVNKDLVDKQFGK 881
           NSGLVELFQSF GVPEWAKPVSRRFRPLLNHEMIEEELWILLDSQEREVNKDLVDKQFGK
Sbjct: 781 NSGLVELFQSFDGVPEWAKPVSRRFRPLLNHEMIEEELWILLDSQEREVNKDLVDKQFGK 812

Query: 882 EIGWIDLGDEIDSICREVERLLFNELVAEFARIKV 904
           EIGWIDLGDEIDSICRE+ERLL NELVAEF  I++
Sbjct: 841 EIGWIDLGDEIDSICRELERLLVNELVAEFGSIEL 812

BLAST of HG10022254 vs. NCBI nr
Match: TYK05761.1 (uncharacterized protein E5676_scaffold98G002500 [Cucumis melo var. makuwa])

HSP 1 Score: 1245.7 bits (3222), Expect = 0.0e+00
Identity = 656/868 (75.58%), Postives = 704/868 (81.11%), Query Frame = 0

Query: 42  MEPRQSTASVLEVLMGFDERQSQHRAPRHPKVFSDDYLQRVASIGILKKKSSSRCHPFRM 101
           MEPR+ TASVLE LMGFDE QSQH  PRH KVFSDDYLQR ASIGI KKK  SRCHPFRM
Sbjct: 1   MEPREYTASVLEGLMGFDESQSQHPVPRHSKVFSDDYLQRAASIGISKKKCPSRCHPFRM 60

Query: 102 IVEESTELFNSLKVENNFSRCNELWEREKVDSTLSAAYMPLTRHTIMNEKHFSTGKVIQT 161
            +EE TELFNSLKVENNFSRC +LWERE+ DSTLSAA +PLTRH IM EKHFSTGKVIQT
Sbjct: 61  TIEEPTELFNSLKVENNFSRCTKLWEREEADSTLSAACIPLTRH-IMYEKHFSTGKVIQT 120

Query: 162 SKDFQDLPVVLDSMDISPRPTRGKNYIFNQAKNEPSVSKAHY------NDAGTKFKDRKQ 221
           SK FQDLP VLDSMDISPRP+RGKN IF+ A+N PSVSKA+Y      NDAGTKFKDR+Q
Sbjct: 121 SKGFQDLPEVLDSMDISPRPSRGKNSIFHHAENGPSVSKANYNLTEGNNDAGTKFKDRRQ 180

Query: 222 GQAHSSEDLDFLMPSRPFLEWRDKLRFSSSSSTSLKGSHLINDKCKDCHSSRNGKLIAKE 281
           GQAH SEDL  L  SRPFLEW +KL FSSS  TSLKGSHL+ DKCK CH+S+NGK I KE
Sbjct: 181 GQAHLSEDLCLLKSSRPFLEWSNKLGFSSSPPTSLKGSHLVTDKCKGCHNSQNGKNITKE 240

Query: 282 KKRTMEYALQPIKQPSRVSSILDGSRRTTKHNFVNLHLKTSRTETIYDDVRRKETKCKRN 341
           K+R+   +L+PIKQ S+VSSILDGSRRT  H F+NL LKTSR+ETIYD++ R E      
Sbjct: 241 KERS-TVSLEPIKQLSQVSSILDGSRRTMSHEFINLPLKTSRSETIYDNMCRNEAS---- 300

Query: 342 SSPRLSNCTAEYKHSCFFSVESYEARESREEVIEEERKTENLMLFTRGRKMNEMPTLPHY 401
               LSN TAE KHSC FSVESY+ARES E+VIEE+RKTE+LM   RGRKMNEMPT+PHY
Sbjct: 301 ----LSNWTAESKHSCCFSVESYKARESGEKVIEEQRKTESLMPSIRGRKMNEMPTVPHY 360

Query: 402 AILPSDLNCKPVNYDFQKHDCSNKEHLHSGSPLCLSQKVKRLDQLSKNSHRSRFDSTSVV 461
           A LPSDLNCKPV YDFQKH CS+ EHLHSGSPLCLS KVKRLD+L K  HR RFDST+ V
Sbjct: 361 ATLPSDLNCKPVKYDFQKHSCSDMEHLHSGSPLCLSWKVKRLDELGKKLHRLRFDSTTTV 420

Query: 462 TTRSRTRSRYEALRNTWFLKHEGPGTWLQCTPLNRSSNKKDASEPTLKLSSKKLKIFPCP 521
           TTRSRTRSRYEALRNTWFLKHEGPGTWLQC PLNRSSNKKDA++PTLKLSSKKLKIFPCP
Sbjct: 421 TTRSRTRSRYEALRNTWFLKHEGPGTWLQCKPLNRSSNKKDAAKPTLKLSSKKLKIFPCP 480

Query: 522 DSASDHVDNDGCMVGDDLKTTVEKKDLCDQKSLNCLSPRSKVVFCTQNNAVKRENQAIEC 581
           DSAS HVDNDGCMVG DLKTTVEKKD CDQ S NCL PRSKVVFCTQN  VK+ NQA   
Sbjct: 481 DSASHHVDNDGCMVGGDLKTTVEKKDPCDQHSSNCLPPRSKVVFCTQNIPVKQGNQA--- 540

Query: 582 ALKSDFQDNLSGMASNSLAVKTDDVANPTVEKQEPNSLSGILLETRGDSSTNSCRAMYTS 641
                                                                     TS
Sbjct: 541 ----------------------------------------------------------TS 600

Query: 642 IQQEGLAFEHYPSKEQDSIVNLEEAYQPSPVSVLEPLFKEETLFSSESSGINSRDLVMQL 701
           IQQEGLAFEHYPSKE+DSIV+LEE +QPSPVSVLEPLFKEETLFSSESSGINSRDLVMQL
Sbjct: 601 IQQEGLAFEHYPSKERDSIVSLEETFQPSPVSVLEPLFKEETLFSSESSGINSRDLVMQL 660

Query: 702 ELLMSDSPGTNSEGHDLFVSSDDDGGEGSICGSDEINDIMSTFKFKDSRDFSYLVDVLNE 761
           ELLM DSPGTNSEGHDLFVSSDDDGGEGSIC SD+I+DIMSTFKFKDSR FSYLVDVL+E
Sbjct: 661 ELLMLDSPGTNSEGHDLFVSSDDDGGEGSICNSDKIDDIMSTFKFKDSRAFSYLVDVLSE 720

Query: 762 ASLHCKNLETGSVPLHGQEHQVISPTVFEILEKKFGEQISWRRSERKLLFDRINSGLVEL 821
           ASL CKNLETGSV  + QEH VISP VFEILEKKFGEQISWRRSERKLLFDRINSGL EL
Sbjct: 721 ASLDCKNLETGSVSWYNQEHHVISPAVFEILEKKFGEQISWRRSERKLLFDRINSGLAEL 780

Query: 822 FQSFVGVPEWAKPVSRRFRPLLNHEMIEEELWILLDSQEREVNKDLVDKQFGKEIGWIDL 881
           FQSFVGVPEWAKPVSRRFRPL+NHEMIEEELWILLDSQEREVNK+L+DKQFGKEI WIDL
Sbjct: 781 FQSFVGVPEWAKPVSRRFRPLVNHEMIEEELWILLDSQEREVNKELIDKQFGKEIEWIDL 797

Query: 882 GDEIDSICREVERLLFNELVAEFARIKV 904
           GDEIDSIC+E+ERLL NELVAEF  I++
Sbjct: 841 GDEIDSICKELERLLVNELVAEFGSIEL 797

BLAST of HG10022254 vs. NCBI nr
Match: XP_008463525.1 (PREDICTED: uncharacterized protein LOC103501659 [Cucumis melo] >XP_016903042.1 PREDICTED: uncharacterized protein LOC103501659 [Cucumis melo])

HSP 1 Score: 1243.8 bits (3217), Expect = 0.0e+00
Identity = 655/868 (75.46%), Postives = 703/868 (80.99%), Query Frame = 0

Query: 42  MEPRQSTASVLEVLMGFDERQSQHRAPRHPKVFSDDYLQRVASIGILKKKSSSRCHPFRM 101
           MEPR+ TASVLE LMGFDE QSQH  PRH KVFSDDYLQR ASIGI KKK  SRCHPFRM
Sbjct: 1   MEPREYTASVLEGLMGFDESQSQHPVPRHSKVFSDDYLQRAASIGISKKKCPSRCHPFRM 60

Query: 102 IVEESTELFNSLKVENNFSRCNELWEREKVDSTLSAAYMPLTRHTIMNEKHFSTGKVIQT 161
            +EE TELFNSLKVENNFSRC +LWERE+ DSTLSAA +PLTRH IM EKHFSTGKVIQT
Sbjct: 61  TIEEPTELFNSLKVENNFSRCTKLWEREEADSTLSAACIPLTRH-IMYEKHFSTGKVIQT 120

Query: 162 SKDFQDLPVVLDSMDISPRPTRGKNYIFNQAKNEPSVSKAHY------NDAGTKFKDRKQ 221
           SK FQDLP VLDSMDISPRP+RGKN IF+ A+N PSVSKA+Y      NDAGTKFKDR+Q
Sbjct: 121 SKGFQDLPEVLDSMDISPRPSRGKNSIFHHAENGPSVSKANYNLTEGNNDAGTKFKDRRQ 180

Query: 222 GQAHSSEDLDFLMPSRPFLEWRDKLRFSSSSSTSLKGSHLINDKCKDCHSSRNGKLIAKE 281
           GQAH SEDL  L  SRPFLEW +KL FSSS  TSLKGSHL+ DKCK CH+S+NGK I KE
Sbjct: 181 GQAHLSEDLCLLKSSRPFLEWSNKLGFSSSPPTSLKGSHLVTDKCKGCHNSQNGKNITKE 240

Query: 282 KKRTMEYALQPIKQPSRVSSILDGSRRTTKHNFVNLHLKTSRTETIYDDVRRKETKCKRN 341
           K+R+   +L+PIKQ S+VSSILDGSRRT  H F+NL LKTSR+E IYD++ R E      
Sbjct: 241 KERS-TVSLEPIKQLSQVSSILDGSRRTMSHEFINLPLKTSRSEAIYDNMCRNEAS---- 300

Query: 342 SSPRLSNCTAEYKHSCFFSVESYEARESREEVIEEERKTENLMLFTRGRKMNEMPTLPHY 401
               LSN TAE KHSC FSVESY+ARES E+VIEE+RKTE+LM   RGRKMNEMPT+PHY
Sbjct: 301 ----LSNWTAESKHSCCFSVESYKARESGEKVIEEQRKTESLMPSIRGRKMNEMPTVPHY 360

Query: 402 AILPSDLNCKPVNYDFQKHDCSNKEHLHSGSPLCLSQKVKRLDQLSKNSHRSRFDSTSVV 461
           A LPSDLNCKPV YDFQKH CS+ EHLHSGSPLCLS KVKRLD+L K  HR RFDST+ V
Sbjct: 361 ATLPSDLNCKPVKYDFQKHSCSDMEHLHSGSPLCLSWKVKRLDELGKKLHRLRFDSTTTV 420

Query: 462 TTRSRTRSRYEALRNTWFLKHEGPGTWLQCTPLNRSSNKKDASEPTLKLSSKKLKIFPCP 521
           TTRSRTRSRYEALRNTWFLKHEGPGTWLQC PLNRSSNKKDA++PTLKLSSKKLKIFPCP
Sbjct: 421 TTRSRTRSRYEALRNTWFLKHEGPGTWLQCKPLNRSSNKKDAAKPTLKLSSKKLKIFPCP 480

Query: 522 DSASDHVDNDGCMVGDDLKTTVEKKDLCDQKSLNCLSPRSKVVFCTQNNAVKRENQAIEC 581
           DSAS HVDNDGCMVG DLKTTVEKKD CDQ S NCL PRSKVVFCTQN  VK+ NQA   
Sbjct: 481 DSASHHVDNDGCMVGGDLKTTVEKKDPCDQHSSNCLPPRSKVVFCTQNIPVKQGNQA--- 540

Query: 582 ALKSDFQDNLSGMASNSLAVKTDDVANPTVEKQEPNSLSGILLETRGDSSTNSCRAMYTS 641
                                                                     TS
Sbjct: 541 ----------------------------------------------------------TS 600

Query: 642 IQQEGLAFEHYPSKEQDSIVNLEEAYQPSPVSVLEPLFKEETLFSSESSGINSRDLVMQL 701
           IQQEGLAFEHYPSKE+DSIV+LEE +QPSPVSVLEPLFKEETLFSSESSGINSRDLVMQL
Sbjct: 601 IQQEGLAFEHYPSKERDSIVSLEETFQPSPVSVLEPLFKEETLFSSESSGINSRDLVMQL 660

Query: 702 ELLMSDSPGTNSEGHDLFVSSDDDGGEGSICGSDEINDIMSTFKFKDSRDFSYLVDVLNE 761
           ELLM DSPGTNSEGHDLFVSSDDDGGEGSIC SD+I+DIMSTFKFKDSR FSYLVDVL+E
Sbjct: 661 ELLMLDSPGTNSEGHDLFVSSDDDGGEGSICNSDKIDDIMSTFKFKDSRAFSYLVDVLSE 720

Query: 762 ASLHCKNLETGSVPLHGQEHQVISPTVFEILEKKFGEQISWRRSERKLLFDRINSGLVEL 821
           ASL CKNLETGSV  + QEH VISP VFEILEKKFGEQISWRRSERKLLFDRINSGL EL
Sbjct: 721 ASLDCKNLETGSVSWYNQEHHVISPAVFEILEKKFGEQISWRRSERKLLFDRINSGLAEL 780

Query: 822 FQSFVGVPEWAKPVSRRFRPLLNHEMIEEELWILLDSQEREVNKDLVDKQFGKEIGWIDL 881
           FQSFVGVPEWAKPVSRRFRPL+NHEMIEEELWILLDSQEREVNK+L+DKQFGKEI WIDL
Sbjct: 781 FQSFVGVPEWAKPVSRRFRPLVNHEMIEEELWILLDSQEREVNKELIDKQFGKEIEWIDL 797

Query: 882 GDEIDSICREVERLLFNELVAEFARIKV 904
           GDEIDSIC+E+ERLL NELVAEF  I++
Sbjct: 841 GDEIDSICKELERLLVNELVAEFGSIEL 797

BLAST of HG10022254 vs. NCBI nr
Match: XP_011655343.1 (uncharacterized protein LOC101203594 [Cucumis sativus] >XP_031741594.1 uncharacterized protein LOC101203594 [Cucumis sativus] >KGN51260.1 hypothetical protein Csa_007976 [Cucumis sativus])

HSP 1 Score: 1234.9 bits (3194), Expect = 0.0e+00
Identity = 658/868 (75.81%), Postives = 700/868 (80.65%), Query Frame = 0

Query: 42  MEPRQSTASVLEVLMGFDERQSQHRAPRHPKVFSDDYLQRVASIGILKKKSSSRCHPFRM 101
           MEPRQ TASVLE LMGFDE QSQH A RH KVFSDDYLQRVASIGI KKK  SRCHPFRM
Sbjct: 1   MEPRQHTASVLEALMGFDESQSQHPASRHSKVFSDDYLQRVASIGISKKKYPSRCHPFRM 60

Query: 102 IVEESTELFNSLKVENNFSRCNELWEREKVDSTLSAAYMPLTRHTIMNEKHFSTGKVIQT 161
            +EE TELFNSLKVENNFSRC +LWERE+ DSTLSAAY PLTRH    EKHFSTGKVIQT
Sbjct: 61  TIEEPTELFNSLKVENNFSRCTKLWEREEADSTLSAAYTPLTRH----EKHFSTGKVIQT 120

Query: 162 SKDFQDLPVVLDSMDISPRPTRGKNYIFNQAKNEPSVSKAHY------NDAGTKFKDRKQ 221
           SK FQDLP VLDSMDISPRPTRGKN +F+QAK+  SVS AHY      NDAGTKFKDRKQ
Sbjct: 121 SKGFQDLPEVLDSMDISPRPTRGKNSLFHQAKSGLSVSTAHYNLTEGNNDAGTKFKDRKQ 180

Query: 222 GQAHSSEDLDFLMPSRPFLEWRDKLRFSSSSSTSLKGSHLINDKCKDCHSSRNGKLIAKE 281
           GQAH SEDL  L  SRPFLEW +KL FSSS   SLKGSHL+ DKCK CH+S+NGK IAKE
Sbjct: 181 GQAHLSEDLCLLKSSRPFLEWSNKLGFSSSPPNSLKGSHLVTDKCKGCHNSQNGKNIAKE 240

Query: 282 KKRTMEYALQPIKQPSRVSSILDGSRRTTKHNFVNLHLKTSRTETIYDDVRRKETKCKRN 341
           K+RT   +L+PIKQ S+VSSILDGSRRT +  F NLHLKTSR+ETIYD+V      C+  
Sbjct: 241 KERT-TVSLEPIKQLSQVSSILDGSRRTMRREFFNLHLKTSRSETIYDNV------CRNK 300

Query: 342 SSPRLSNCTAEYKHSCFFSVESYEARESREEVIEEERKTENLMLFTRGRKMNEMPTLPHY 401
           +S  LSN TAE KHSC FSVESY+ARES E+VIEE+RKT NLM  T+GRKMNEMPT+P Y
Sbjct: 301 AS--LSNWTAESKHSCCFSVESYKARESGEKVIEEQRKTANLMPSTQGRKMNEMPTVPRY 360

Query: 402 AILPSDLNCKPVNYDFQKHDCSNKEHLHSGSPLCLSQKVKRLDQLSKNSHRSRFDSTSVV 461
           A LPSDLNCKPV YDFQKH CS+KEHLHSGSPLCLS KVKRLD+L K  HR RFDSTS V
Sbjct: 361 ATLPSDLNCKPVEYDFQKHVCSDKEHLHSGSPLCLSWKVKRLDELDKKFHRLRFDSTSTV 420

Query: 462 TTRSRTRSRYEALRNTWFLKHEGPGTWLQCTPLNRSSNKKDASEPTLKLSSKKLKIFPCP 521
           TTRSRTRSRYEAL NTWFLKHEGPGTWLQC PLNRSSNKKDA++PTLKLSSKKLKIFPCP
Sbjct: 421 TTRSRTRSRYEAL-NTWFLKHEGPGTWLQCNPLNRSSNKKDAAKPTLKLSSKKLKIFPCP 480

Query: 522 DSASDHVDNDGCMVGDDLKTTVEKKDLCDQKSLNCLSPRSKVVFCTQNNAVKRENQAIEC 581
           DSAS H DNDGCMVG D KTTV+KKD CDQ SLNCL PRSKVVFCTQN  VK+ NQA   
Sbjct: 481 DSASHHFDNDGCMVGGDPKTTVKKKDPCDQHSLNCLPPRSKVVFCTQNIPVKQGNQA--- 540

Query: 582 ALKSDFQDNLSGMASNSLAVKTDDVANPTVEKQEPNSLSGILLETRGDSSTNSCRAMYTS 641
                                                                     TS
Sbjct: 541 ----------------------------------------------------------TS 600

Query: 642 IQQEGLAFEHYPSKEQDSIVNLEEAYQPSPVSVLEPLFKEETLFSSESSGINSRDLVMQL 701
           IQQEGLAF+HYPSKE+DSIV+LEEA+QPSPVSVLEPLFKEETLFSSES GINSRDLVMQL
Sbjct: 601 IQQEGLAFDHYPSKERDSIVSLEEAFQPSPVSVLEPLFKEETLFSSESPGINSRDLVMQL 660

Query: 702 ELLMSDSPGTNSEGHDLFVSSDDDGGEGSICGSDEINDIMSTFKFKDSRDFSYLVDVLNE 761
           ELLMSDSPGTNSEGHDLFVSSDDD GEGSIC SD+I+DIMSTFKFKDSR FSYLVDVL+E
Sbjct: 661 ELLMSDSPGTNSEGHDLFVSSDDDSGEGSICNSDKIDDIMSTFKFKDSRTFSYLVDVLSE 720

Query: 762 ASLHCKNLETGSVPLHGQEHQVISPTVFEILEKKFGEQISWRRSERKLLFDRINSGLVEL 821
           ASLHCKNLE GSV  H QE  VISP VFEILEKKFGEQISWRRSERKLLFDRINSGL EL
Sbjct: 721 ASLHCKNLEMGSVSWHNQEQHVISPAVFEILEKKFGEQISWRRSERKLLFDRINSGLAEL 780

Query: 822 FQSFVGVPEWAKPVSRRFRPLLNHEMIEEELWILLDSQEREVNKDLVDKQFGKEIGWIDL 881
           FQSFVGVPEWAKPVSRRFRPLLNHEMIEEELWILLDSQEREVNK+LVDKQFGKEI WIDL
Sbjct: 781 FQSFVGVPEWAKPVSRRFRPLLNHEMIEEELWILLDSQEREVNKELVDKQFGKEIEWIDL 793

Query: 882 GDEIDSICREVERLLFNELVAEFARIKV 904
           GDEI+SICRE+E LL NELVAEF  I++
Sbjct: 841 GDEINSICRELEILLVNELVAEFGSIEL 793

BLAST of HG10022254 vs. ExPASy Swiss-Prot
Match: Q9M5L0 (60S ribosomal protein L35 OS=Euphorbia esula OX=3993 GN=RPL35 PE=2 SV=1)

HSP 1 Score: 191.8 bits (486), Expect = 3.8e-47
Identity = 110/122 (90.16%), Postives = 115/122 (94.26%), Query Frame = 0

Query: 899  ARIKVHELRQKSKADLLLQLKDLKAELALLRVAKVTGGALNKRSKIKVVRLSIAQVLTVI 958
            ARIKVHELRQK+KA+LL QLKDLKAELALLRVAKVTGGA NK SKIKVVRLSIAQVLTVI
Sbjct: 2    ARIKVHELRQKTKAELLNQLKDLKAELALLRVAKVTGGAPNKLSKIKVVRLSIAQVLTVI 61

Query: 959  SQKQKSALREAYKKKKLLPLDLRPKKTRAIRRRLTKYQASLKTERQKKKEMYFPLRKYAI 1018
            SQKQK ALREAYK KK LPLDLRPKKTRAIRRRLTK+Q SLKTER+KKKEMYFP+RKYAI
Sbjct: 62   SQKQKLALREAYKNKKFLPLDLRPKKTRAIRRRLTKHQQSLKTEREKKKEMYFPMRKYAI 121

Query: 1019 KV 1021
            KV
Sbjct: 122  KV 123

BLAST of HG10022254 vs. ExPASy Swiss-Prot
Match: O80626 (60S ribosomal protein L35-2 OS=Arabidopsis thaliana OX=3702 GN=RPL35B PE=2 SV=1)

HSP 1 Score: 191.0 bits (484), Expect = 6.4e-47
Identity = 110/122 (90.16%), Postives = 115/122 (94.26%), Query Frame = 0

Query: 899  ARIKVHELRQKSKADLLLQLKDLKAELALLRVAKVTGGALNKRSKIKVVRLSIAQVLTVI 958
            ARIKVHELR+KSKADL  QLK+ KAELALLRVAKVTGGA NK SKIKVVR SIAQVLTVI
Sbjct: 2    ARIKVHELREKSKADLSGQLKEFKAELALLRVAKVTGGAPNKLSKIKVVRKSIAQVLTVI 61

Query: 959  SQKQKSALREAYKKKKLLPLDLRPKKTRAIRRRLTKYQASLKTERQKKKEMYFPLRKYAI 1018
            SQKQKSALREAYK KKLLPLDLRPKKTRAIRRRLTK+QASLKTER+KKKEMYFP+RKYAI
Sbjct: 62   SQKQKSALREAYKNKKLLPLDLRPKKTRAIRRRLTKHQASLKTEREKKKEMYFPIRKYAI 121

Query: 1019 KV 1021
            KV
Sbjct: 122  KV 123

BLAST of HG10022254 vs. ExPASy Swiss-Prot
Match: Q9SF53 (60S ribosomal protein L35-1 OS=Arabidopsis thaliana OX=3702 GN=RPL35A PE=2 SV=1)

HSP 1 Score: 189.5 bits (480), Expect = 1.9e-46
Identity = 109/122 (89.34%), Postives = 115/122 (94.26%), Query Frame = 0

Query: 899  ARIKVHELRQKSKADLLLQLKDLKAELALLRVAKVTGGALNKRSKIKVVRLSIAQVLTVI 958
            ARIKVHELR+KSK+DL  QLK+LKAELALLRVAKVTGGA NK SKIKVVR SIAQVLTV 
Sbjct: 2    ARIKVHELREKSKSDLQNQLKELKAELALLRVAKVTGGAPNKLSKIKVVRKSIAQVLTVS 61

Query: 959  SQKQKSALREAYKKKKLLPLDLRPKKTRAIRRRLTKYQASLKTERQKKKEMYFPLRKYAI 1018
            SQKQKSALREAYK KKLLPLDLRPKKTRAIRRRLTK+QASLKTER+KKKEMYFP+RKYAI
Sbjct: 62   SQKQKSALREAYKNKKLLPLDLRPKKTRAIRRRLTKHQASLKTEREKKKEMYFPIRKYAI 121

Query: 1019 KV 1021
            KV
Sbjct: 122  KV 123

BLAST of HG10022254 vs. ExPASy Swiss-Prot
Match: Q9LZ41 (60S ribosomal protein L35-4 OS=Arabidopsis thaliana OX=3702 GN=RPL35D PE=2 SV=1)

HSP 1 Score: 188.7 bits (478), Expect = 3.2e-46
Identity = 109/122 (89.34%), Postives = 113/122 (92.62%), Query Frame = 0

Query: 899  ARIKVHELRQKSKADLLLQLKDLKAELALLRVAKVTGGALNKRSKIKVVRLSIAQVLTVI 958
            ARIKVHELR KSK DL  QLK+ KAELALLRVAKVTGGA NK SKIKVVR SIAQVLTVI
Sbjct: 2    ARIKVHELRDKSKTDLQNQLKEFKAELALLRVAKVTGGAPNKLSKIKVVRKSIAQVLTVI 61

Query: 959  SQKQKSALREAYKKKKLLPLDLRPKKTRAIRRRLTKYQASLKTERQKKKEMYFPLRKYAI 1018
            SQKQKSALREAYK KKLLPLDLRPKKTRAIRRRLTK+QASLKTER+KKKEMYFP+RKYAI
Sbjct: 62   SQKQKSALREAYKNKKLLPLDLRPKKTRAIRRRLTKHQASLKTEREKKKEMYFPVRKYAI 121

Query: 1019 KV 1021
            KV
Sbjct: 122  KV 123

BLAST of HG10022254 vs. ExPASy Swiss-Prot
Match: Q9M3D2 (60S ribosomal protein L35-3 OS=Arabidopsis thaliana OX=3702 GN=RPL35C PE=2 SV=1)

HSP 1 Score: 186.0 bits (471), Expect = 2.1e-45
Identity = 107/122 (87.70%), Postives = 113/122 (92.62%), Query Frame = 0

Query: 899  ARIKVHELRQKSKADLLLQLKDLKAELALLRVAKVTGGALNKRSKIKVVRLSIAQVLTVI 958
            ARIKVHELR KSK+DL  QLK+LKAELA LRVAKVTGGA NK SKIKVVR SIAQVLTV 
Sbjct: 2    ARIKVHELRDKSKSDLSTQLKELKAELASLRVAKVTGGAPNKLSKIKVVRKSIAQVLTVS 61

Query: 959  SQKQKSALREAYKKKKLLPLDLRPKKTRAIRRRLTKYQASLKTERQKKKEMYFPLRKYAI 1018
            SQKQKSALREAYK KKLLPLDLRPKKTRAIRRRLTK+QASLKTER+KKK+MYFP+RKYAI
Sbjct: 62   SQKQKSALREAYKNKKLLPLDLRPKKTRAIRRRLTKHQASLKTEREKKKDMYFPIRKYAI 121

Query: 1019 KV 1021
            KV
Sbjct: 122  KV 123

BLAST of HG10022254 vs. ExPASy TrEMBL
Match: A0A5D3C1E7 (DUF4378 domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold98G002500 PE=4 SV=1)

HSP 1 Score: 1245.7 bits (3222), Expect = 0.0e+00
Identity = 656/868 (75.58%), Postives = 704/868 (81.11%), Query Frame = 0

Query: 42  MEPRQSTASVLEVLMGFDERQSQHRAPRHPKVFSDDYLQRVASIGILKKKSSSRCHPFRM 101
           MEPR+ TASVLE LMGFDE QSQH  PRH KVFSDDYLQR ASIGI KKK  SRCHPFRM
Sbjct: 1   MEPREYTASVLEGLMGFDESQSQHPVPRHSKVFSDDYLQRAASIGISKKKCPSRCHPFRM 60

Query: 102 IVEESTELFNSLKVENNFSRCNELWEREKVDSTLSAAYMPLTRHTIMNEKHFSTGKVIQT 161
            +EE TELFNSLKVENNFSRC +LWERE+ DSTLSAA +PLTRH IM EKHFSTGKVIQT
Sbjct: 61  TIEEPTELFNSLKVENNFSRCTKLWEREEADSTLSAACIPLTRH-IMYEKHFSTGKVIQT 120

Query: 162 SKDFQDLPVVLDSMDISPRPTRGKNYIFNQAKNEPSVSKAHY------NDAGTKFKDRKQ 221
           SK FQDLP VLDSMDISPRP+RGKN IF+ A+N PSVSKA+Y      NDAGTKFKDR+Q
Sbjct: 121 SKGFQDLPEVLDSMDISPRPSRGKNSIFHHAENGPSVSKANYNLTEGNNDAGTKFKDRRQ 180

Query: 222 GQAHSSEDLDFLMPSRPFLEWRDKLRFSSSSSTSLKGSHLINDKCKDCHSSRNGKLIAKE 281
           GQAH SEDL  L  SRPFLEW +KL FSSS  TSLKGSHL+ DKCK CH+S+NGK I KE
Sbjct: 181 GQAHLSEDLCLLKSSRPFLEWSNKLGFSSSPPTSLKGSHLVTDKCKGCHNSQNGKNITKE 240

Query: 282 KKRTMEYALQPIKQPSRVSSILDGSRRTTKHNFVNLHLKTSRTETIYDDVRRKETKCKRN 341
           K+R+   +L+PIKQ S+VSSILDGSRRT  H F+NL LKTSR+ETIYD++ R E      
Sbjct: 241 KERS-TVSLEPIKQLSQVSSILDGSRRTMSHEFINLPLKTSRSETIYDNMCRNEAS---- 300

Query: 342 SSPRLSNCTAEYKHSCFFSVESYEARESREEVIEEERKTENLMLFTRGRKMNEMPTLPHY 401
               LSN TAE KHSC FSVESY+ARES E+VIEE+RKTE+LM   RGRKMNEMPT+PHY
Sbjct: 301 ----LSNWTAESKHSCCFSVESYKARESGEKVIEEQRKTESLMPSIRGRKMNEMPTVPHY 360

Query: 402 AILPSDLNCKPVNYDFQKHDCSNKEHLHSGSPLCLSQKVKRLDQLSKNSHRSRFDSTSVV 461
           A LPSDLNCKPV YDFQKH CS+ EHLHSGSPLCLS KVKRLD+L K  HR RFDST+ V
Sbjct: 361 ATLPSDLNCKPVKYDFQKHSCSDMEHLHSGSPLCLSWKVKRLDELGKKLHRLRFDSTTTV 420

Query: 462 TTRSRTRSRYEALRNTWFLKHEGPGTWLQCTPLNRSSNKKDASEPTLKLSSKKLKIFPCP 521
           TTRSRTRSRYEALRNTWFLKHEGPGTWLQC PLNRSSNKKDA++PTLKLSSKKLKIFPCP
Sbjct: 421 TTRSRTRSRYEALRNTWFLKHEGPGTWLQCKPLNRSSNKKDAAKPTLKLSSKKLKIFPCP 480

Query: 522 DSASDHVDNDGCMVGDDLKTTVEKKDLCDQKSLNCLSPRSKVVFCTQNNAVKRENQAIEC 581
           DSAS HVDNDGCMVG DLKTTVEKKD CDQ S NCL PRSKVVFCTQN  VK+ NQA   
Sbjct: 481 DSASHHVDNDGCMVGGDLKTTVEKKDPCDQHSSNCLPPRSKVVFCTQNIPVKQGNQA--- 540

Query: 582 ALKSDFQDNLSGMASNSLAVKTDDVANPTVEKQEPNSLSGILLETRGDSSTNSCRAMYTS 641
                                                                     TS
Sbjct: 541 ----------------------------------------------------------TS 600

Query: 642 IQQEGLAFEHYPSKEQDSIVNLEEAYQPSPVSVLEPLFKEETLFSSESSGINSRDLVMQL 701
           IQQEGLAFEHYPSKE+DSIV+LEE +QPSPVSVLEPLFKEETLFSSESSGINSRDLVMQL
Sbjct: 601 IQQEGLAFEHYPSKERDSIVSLEETFQPSPVSVLEPLFKEETLFSSESSGINSRDLVMQL 660

Query: 702 ELLMSDSPGTNSEGHDLFVSSDDDGGEGSICGSDEINDIMSTFKFKDSRDFSYLVDVLNE 761
           ELLM DSPGTNSEGHDLFVSSDDDGGEGSIC SD+I+DIMSTFKFKDSR FSYLVDVL+E
Sbjct: 661 ELLMLDSPGTNSEGHDLFVSSDDDGGEGSICNSDKIDDIMSTFKFKDSRAFSYLVDVLSE 720

Query: 762 ASLHCKNLETGSVPLHGQEHQVISPTVFEILEKKFGEQISWRRSERKLLFDRINSGLVEL 821
           ASL CKNLETGSV  + QEH VISP VFEILEKKFGEQISWRRSERKLLFDRINSGL EL
Sbjct: 721 ASLDCKNLETGSVSWYNQEHHVISPAVFEILEKKFGEQISWRRSERKLLFDRINSGLAEL 780

Query: 822 FQSFVGVPEWAKPVSRRFRPLLNHEMIEEELWILLDSQEREVNKDLVDKQFGKEIGWIDL 881
           FQSFVGVPEWAKPVSRRFRPL+NHEMIEEELWILLDSQEREVNK+L+DKQFGKEI WIDL
Sbjct: 781 FQSFVGVPEWAKPVSRRFRPLVNHEMIEEELWILLDSQEREVNKELIDKQFGKEIEWIDL 797

Query: 882 GDEIDSICREVERLLFNELVAEFARIKV 904
           GDEIDSIC+E+ERLL NELVAEF  I++
Sbjct: 841 GDEIDSICKELERLLVNELVAEFGSIEL 797

BLAST of HG10022254 vs. ExPASy TrEMBL
Match: A0A1S4E497 (uncharacterized protein LOC103501659 OS=Cucumis melo OX=3656 GN=LOC103501659 PE=4 SV=1)

HSP 1 Score: 1243.8 bits (3217), Expect = 0.0e+00
Identity = 655/868 (75.46%), Postives = 703/868 (80.99%), Query Frame = 0

Query: 42  MEPRQSTASVLEVLMGFDERQSQHRAPRHPKVFSDDYLQRVASIGILKKKSSSRCHPFRM 101
           MEPR+ TASVLE LMGFDE QSQH  PRH KVFSDDYLQR ASIGI KKK  SRCHPFRM
Sbjct: 1   MEPREYTASVLEGLMGFDESQSQHPVPRHSKVFSDDYLQRAASIGISKKKCPSRCHPFRM 60

Query: 102 IVEESTELFNSLKVENNFSRCNELWEREKVDSTLSAAYMPLTRHTIMNEKHFSTGKVIQT 161
            +EE TELFNSLKVENNFSRC +LWERE+ DSTLSAA +PLTRH IM EKHFSTGKVIQT
Sbjct: 61  TIEEPTELFNSLKVENNFSRCTKLWEREEADSTLSAACIPLTRH-IMYEKHFSTGKVIQT 120

Query: 162 SKDFQDLPVVLDSMDISPRPTRGKNYIFNQAKNEPSVSKAHY------NDAGTKFKDRKQ 221
           SK FQDLP VLDSMDISPRP+RGKN IF+ A+N PSVSKA+Y      NDAGTKFKDR+Q
Sbjct: 121 SKGFQDLPEVLDSMDISPRPSRGKNSIFHHAENGPSVSKANYNLTEGNNDAGTKFKDRRQ 180

Query: 222 GQAHSSEDLDFLMPSRPFLEWRDKLRFSSSSSTSLKGSHLINDKCKDCHSSRNGKLIAKE 281
           GQAH SEDL  L  SRPFLEW +KL FSSS  TSLKGSHL+ DKCK CH+S+NGK I KE
Sbjct: 181 GQAHLSEDLCLLKSSRPFLEWSNKLGFSSSPPTSLKGSHLVTDKCKGCHNSQNGKNITKE 240

Query: 282 KKRTMEYALQPIKQPSRVSSILDGSRRTTKHNFVNLHLKTSRTETIYDDVRRKETKCKRN 341
           K+R+   +L+PIKQ S+VSSILDGSRRT  H F+NL LKTSR+E IYD++ R E      
Sbjct: 241 KERS-TVSLEPIKQLSQVSSILDGSRRTMSHEFINLPLKTSRSEAIYDNMCRNEAS---- 300

Query: 342 SSPRLSNCTAEYKHSCFFSVESYEARESREEVIEEERKTENLMLFTRGRKMNEMPTLPHY 401
               LSN TAE KHSC FSVESY+ARES E+VIEE+RKTE+LM   RGRKMNEMPT+PHY
Sbjct: 301 ----LSNWTAESKHSCCFSVESYKARESGEKVIEEQRKTESLMPSIRGRKMNEMPTVPHY 360

Query: 402 AILPSDLNCKPVNYDFQKHDCSNKEHLHSGSPLCLSQKVKRLDQLSKNSHRSRFDSTSVV 461
           A LPSDLNCKPV YDFQKH CS+ EHLHSGSPLCLS KVKRLD+L K  HR RFDST+ V
Sbjct: 361 ATLPSDLNCKPVKYDFQKHSCSDMEHLHSGSPLCLSWKVKRLDELGKKLHRLRFDSTTTV 420

Query: 462 TTRSRTRSRYEALRNTWFLKHEGPGTWLQCTPLNRSSNKKDASEPTLKLSSKKLKIFPCP 521
           TTRSRTRSRYEALRNTWFLKHEGPGTWLQC PLNRSSNKKDA++PTLKLSSKKLKIFPCP
Sbjct: 421 TTRSRTRSRYEALRNTWFLKHEGPGTWLQCKPLNRSSNKKDAAKPTLKLSSKKLKIFPCP 480

Query: 522 DSASDHVDNDGCMVGDDLKTTVEKKDLCDQKSLNCLSPRSKVVFCTQNNAVKRENQAIEC 581
           DSAS HVDNDGCMVG DLKTTVEKKD CDQ S NCL PRSKVVFCTQN  VK+ NQA   
Sbjct: 481 DSASHHVDNDGCMVGGDLKTTVEKKDPCDQHSSNCLPPRSKVVFCTQNIPVKQGNQA--- 540

Query: 582 ALKSDFQDNLSGMASNSLAVKTDDVANPTVEKQEPNSLSGILLETRGDSSTNSCRAMYTS 641
                                                                     TS
Sbjct: 541 ----------------------------------------------------------TS 600

Query: 642 IQQEGLAFEHYPSKEQDSIVNLEEAYQPSPVSVLEPLFKEETLFSSESSGINSRDLVMQL 701
           IQQEGLAFEHYPSKE+DSIV+LEE +QPSPVSVLEPLFKEETLFSSESSGINSRDLVMQL
Sbjct: 601 IQQEGLAFEHYPSKERDSIVSLEETFQPSPVSVLEPLFKEETLFSSESSGINSRDLVMQL 660

Query: 702 ELLMSDSPGTNSEGHDLFVSSDDDGGEGSICGSDEINDIMSTFKFKDSRDFSYLVDVLNE 761
           ELLM DSPGTNSEGHDLFVSSDDDGGEGSIC SD+I+DIMSTFKFKDSR FSYLVDVL+E
Sbjct: 661 ELLMLDSPGTNSEGHDLFVSSDDDGGEGSICNSDKIDDIMSTFKFKDSRAFSYLVDVLSE 720

Query: 762 ASLHCKNLETGSVPLHGQEHQVISPTVFEILEKKFGEQISWRRSERKLLFDRINSGLVEL 821
           ASL CKNLETGSV  + QEH VISP VFEILEKKFGEQISWRRSERKLLFDRINSGL EL
Sbjct: 721 ASLDCKNLETGSVSWYNQEHHVISPAVFEILEKKFGEQISWRRSERKLLFDRINSGLAEL 780

Query: 822 FQSFVGVPEWAKPVSRRFRPLLNHEMIEEELWILLDSQEREVNKDLVDKQFGKEIGWIDL 881
           FQSFVGVPEWAKPVSRRFRPL+NHEMIEEELWILLDSQEREVNK+L+DKQFGKEI WIDL
Sbjct: 781 FQSFVGVPEWAKPVSRRFRPLVNHEMIEEELWILLDSQEREVNKELIDKQFGKEIEWIDL 797

Query: 882 GDEIDSICREVERLLFNELVAEFARIKV 904
           GDEIDSIC+E+ERLL NELVAEF  I++
Sbjct: 841 GDEIDSICKELERLLVNELVAEFGSIEL 797

BLAST of HG10022254 vs. ExPASy TrEMBL
Match: A0A0A0KNN6 (DUF4378 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_5G505210 PE=4 SV=1)

HSP 1 Score: 1234.9 bits (3194), Expect = 0.0e+00
Identity = 658/868 (75.81%), Postives = 700/868 (80.65%), Query Frame = 0

Query: 42  MEPRQSTASVLEVLMGFDERQSQHRAPRHPKVFSDDYLQRVASIGILKKKSSSRCHPFRM 101
           MEPRQ TASVLE LMGFDE QSQH A RH KVFSDDYLQRVASIGI KKK  SRCHPFRM
Sbjct: 1   MEPRQHTASVLEALMGFDESQSQHPASRHSKVFSDDYLQRVASIGISKKKYPSRCHPFRM 60

Query: 102 IVEESTELFNSLKVENNFSRCNELWEREKVDSTLSAAYMPLTRHTIMNEKHFSTGKVIQT 161
            +EE TELFNSLKVENNFSRC +LWERE+ DSTLSAAY PLTRH    EKHFSTGKVIQT
Sbjct: 61  TIEEPTELFNSLKVENNFSRCTKLWEREEADSTLSAAYTPLTRH----EKHFSTGKVIQT 120

Query: 162 SKDFQDLPVVLDSMDISPRPTRGKNYIFNQAKNEPSVSKAHY------NDAGTKFKDRKQ 221
           SK FQDLP VLDSMDISPRPTRGKN +F+QAK+  SVS AHY      NDAGTKFKDRKQ
Sbjct: 121 SKGFQDLPEVLDSMDISPRPTRGKNSLFHQAKSGLSVSTAHYNLTEGNNDAGTKFKDRKQ 180

Query: 222 GQAHSSEDLDFLMPSRPFLEWRDKLRFSSSSSTSLKGSHLINDKCKDCHSSRNGKLIAKE 281
           GQAH SEDL  L  SRPFLEW +KL FSSS   SLKGSHL+ DKCK CH+S+NGK IAKE
Sbjct: 181 GQAHLSEDLCLLKSSRPFLEWSNKLGFSSSPPNSLKGSHLVTDKCKGCHNSQNGKNIAKE 240

Query: 282 KKRTMEYALQPIKQPSRVSSILDGSRRTTKHNFVNLHLKTSRTETIYDDVRRKETKCKRN 341
           K+RT   +L+PIKQ S+VSSILDGSRRT +  F NLHLKTSR+ETIYD+V      C+  
Sbjct: 241 KERT-TVSLEPIKQLSQVSSILDGSRRTMRREFFNLHLKTSRSETIYDNV------CRNK 300

Query: 342 SSPRLSNCTAEYKHSCFFSVESYEARESREEVIEEERKTENLMLFTRGRKMNEMPTLPHY 401
           +S  LSN TAE KHSC FSVESY+ARES E+VIEE+RKT NLM  T+GRKMNEMPT+P Y
Sbjct: 301 AS--LSNWTAESKHSCCFSVESYKARESGEKVIEEQRKTANLMPSTQGRKMNEMPTVPRY 360

Query: 402 AILPSDLNCKPVNYDFQKHDCSNKEHLHSGSPLCLSQKVKRLDQLSKNSHRSRFDSTSVV 461
           A LPSDLNCKPV YDFQKH CS+KEHLHSGSPLCLS KVKRLD+L K  HR RFDSTS V
Sbjct: 361 ATLPSDLNCKPVEYDFQKHVCSDKEHLHSGSPLCLSWKVKRLDELDKKFHRLRFDSTSTV 420

Query: 462 TTRSRTRSRYEALRNTWFLKHEGPGTWLQCTPLNRSSNKKDASEPTLKLSSKKLKIFPCP 521
           TTRSRTRSRYEAL NTWFLKHEGPGTWLQC PLNRSSNKKDA++PTLKLSSKKLKIFPCP
Sbjct: 421 TTRSRTRSRYEAL-NTWFLKHEGPGTWLQCNPLNRSSNKKDAAKPTLKLSSKKLKIFPCP 480

Query: 522 DSASDHVDNDGCMVGDDLKTTVEKKDLCDQKSLNCLSPRSKVVFCTQNNAVKRENQAIEC 581
           DSAS H DNDGCMVG D KTTV+KKD CDQ SLNCL PRSKVVFCTQN  VK+ NQA   
Sbjct: 481 DSASHHFDNDGCMVGGDPKTTVKKKDPCDQHSLNCLPPRSKVVFCTQNIPVKQGNQA--- 540

Query: 582 ALKSDFQDNLSGMASNSLAVKTDDVANPTVEKQEPNSLSGILLETRGDSSTNSCRAMYTS 641
                                                                     TS
Sbjct: 541 ----------------------------------------------------------TS 600

Query: 642 IQQEGLAFEHYPSKEQDSIVNLEEAYQPSPVSVLEPLFKEETLFSSESSGINSRDLVMQL 701
           IQQEGLAF+HYPSKE+DSIV+LEEA+QPSPVSVLEPLFKEETLFSSES GINSRDLVMQL
Sbjct: 601 IQQEGLAFDHYPSKERDSIVSLEEAFQPSPVSVLEPLFKEETLFSSESPGINSRDLVMQL 660

Query: 702 ELLMSDSPGTNSEGHDLFVSSDDDGGEGSICGSDEINDIMSTFKFKDSRDFSYLVDVLNE 761
           ELLMSDSPGTNSEGHDLFVSSDDD GEGSIC SD+I+DIMSTFKFKDSR FSYLVDVL+E
Sbjct: 661 ELLMSDSPGTNSEGHDLFVSSDDDSGEGSICNSDKIDDIMSTFKFKDSRTFSYLVDVLSE 720

Query: 762 ASLHCKNLETGSVPLHGQEHQVISPTVFEILEKKFGEQISWRRSERKLLFDRINSGLVEL 821
           ASLHCKNLE GSV  H QE  VISP VFEILEKKFGEQISWRRSERKLLFDRINSGL EL
Sbjct: 721 ASLHCKNLEMGSVSWHNQEQHVISPAVFEILEKKFGEQISWRRSERKLLFDRINSGLAEL 780

Query: 822 FQSFVGVPEWAKPVSRRFRPLLNHEMIEEELWILLDSQEREVNKDLVDKQFGKEIGWIDL 881
           FQSFVGVPEWAKPVSRRFRPLLNHEMIEEELWILLDSQEREVNK+LVDKQFGKEI WIDL
Sbjct: 781 FQSFVGVPEWAKPVSRRFRPLLNHEMIEEELWILLDSQEREVNKELVDKQFGKEIEWIDL 793

Query: 882 GDEIDSICREVERLLFNELVAEFARIKV 904
           GDEI+SICRE+E LL NELVAEF  I++
Sbjct: 841 GDEINSICRELEILLVNELVAEFGSIEL 793

BLAST of HG10022254 vs. ExPASy TrEMBL
Match: A0A6J1BX36 (uncharacterized protein LOC111006294 OS=Momordica charantia OX=3673 GN=LOC111006294 PE=4 SV=1)

HSP 1 Score: 1134.4 bits (2933), Expect = 0.0e+00
Identity = 604/881 (68.56%), Postives = 699/881 (79.34%), Query Frame = 0

Query: 42  MEPRQSTASVLEVLMGFDERQSQHRAPRHPKVFSDDYLQRVASIGILKKKSSSRCHPFRM 101
           M  +Q TASVLE LMGF+E+QS H   RH +V S+ YLQR ASIG+ KKK  S+CHPFR 
Sbjct: 1   MGTKQCTASVLEALMGFEEQQSAHHVSRHSRVLSEGYLQRAASIGVPKKKPPSKCHPFRT 60

Query: 102 IVEESTELFNSLKVENNFS---RCNELWEREKVDSTLSAAYMPLTRHTIMNEKHFSTGKV 161
            VEE  ELFN+L V ++F     CNEL  REK  S LS+A MPLTRH  M  +HF T K+
Sbjct: 61  TVEEPIELFNTLDVVDSFKSDISCNELGVREKEHSALSSACMPLTRHNFMRVEHFPTDKM 120

Query: 162 IQTSKDFQDLPVVLDSMDISPRPTRGKNYIFNQAKNEPSVSKAHY------NDAGTKFKD 221
           IQTS D Q+LP V DSMDISPRPTR K YIFN  +N  S+SK+H+      NDAGTKF +
Sbjct: 121 IQTSNDLQELPEVTDSMDISPRPTREKEYIFNHVENGLSLSKSHFTLTRGINDAGTKFTN 180

Query: 222 RKQGQAHSSEDLDFLMPSRPFLEWRDKLRFSSSSSTSLKGSHLINDKCKDCHSSRNGKLI 281
           RKQGQA + +D D L  S P LEW+DKL FSSSS TSLKGSHL+++KCK  H S+NGK +
Sbjct: 181 RKQGQACAYDDFDLLKSSIPLLEWKDKLCFSSSSLTSLKGSHLVSEKCKYFHGSQNGKHM 240

Query: 282 AKEKKR-TMEYALQPIKQPSRVSSILDGSRRTTKHNFVNLHLKTSRTETIYDDVRRKETK 341
           AKEK+R TM   ++PIKQPS+VS ILD S R T+H+FVNL +K SR+E+IYDDV RKET+
Sbjct: 241 AKEKERKTMVCVVEPIKQPSQVSRILDVSGRKTRHDFVNLQMKASRSESIYDDVHRKETE 300

Query: 342 CKRNSSPRLSNCTAEYKHSCFFSVESYEARESREEVIEEERKTENLMLFTRGRKMNEMPT 401
            +   SP LSN  AEYKHSC FSVESY+AR  RE+ IEE+++T+ L+L  +G    EMP 
Sbjct: 301 FRTTFSPGLSNLKAEYKHSCCFSVESYKARGFRED-IEEQKETQKLILSRQGSNKGEMPI 360

Query: 402 LPHYAILPSDLNCKPVNYDFQKHDCSNKEHLHSGSPLCLSQKVKRLDQLSKNSHRSRFDS 461
           L H+A LP+DLNCKPV YDFQKH CSNKEHLHSGSPLCLS K +RLDQ+SKNSHR RF S
Sbjct: 361 LHHHATLPNDLNCKPVKYDFQKHVCSNKEHLHSGSPLCLSCKDERLDQVSKNSHRLRFCS 420

Query: 462 TSVVTT-RSRTRSRYEALRNTWFLKHEGPGTWLQCTPLNRSSNKKDASEPTLKLSSKKLK 521
            + VTT RSRTRSRYE+LRNTWFLK EG  TWLQC P ++SS+ KDAS+PTLKL SKKL+
Sbjct: 421 AATVTTKRSRTRSRYESLRNTWFLKSEGSATWLQCKPSDKSSDGKDASDPTLKLGSKKLR 480

Query: 522 IFPCPDSASDHVDNDGCMVGDDLKTTVEKKDLCDQKSLNCLSPRSKVVFCTQNNAVKREN 581
           IFPCP+SAS H+ +DGC+V   L+T VEKK LC+Q+S+N LS R+ VVFC +NN     N
Sbjct: 481 IFPCPESASGHIVDDGCIVVGHLETRVEKKSLCNQRSINSLSSRNDVVFCAENN----PN 540

Query: 582 QAIECALKSDF-QDNLSGMASNSLAVKTDDVANPTVEKQEPNSLSGILLETRGDSSTNSC 641
           +AIEC+LKSD+  DN SGMASN LAVKTDD   PTV+KQEP+S+S  + ET GDSSTNS 
Sbjct: 541 KAIECSLKSDYPDDNFSGMASNVLAVKTDDAEVPTVDKQEPDSMSCSISETDGDSSTNSF 600

Query: 642 RAMYTSIQQ--------EGLAFEHYPSKEQDSIVNLEEAYQPSPVSVLEPLFKEETLFSS 701
           R    SIQQ        EG  FEHYP KE DSIV+LEEAYQPSPVSVLEPLFKEET+ SS
Sbjct: 601 RTTCRSIQQEASTIFDKEGPGFEHYPCKELDSIVSLEEAYQPSPVSVLEPLFKEETISSS 660

Query: 702 ESSGINSRDLVMQLELLMSDSPGTNSEGHDLFVSSDDD-GGEGSICGSDEINDIMSTFKF 761
           ESSGINSRDL+MQLELLMSDSPG+NSEGH++FVSSDDD GGEGS C S+EI+DIMSTFKF
Sbjct: 661 ESSGINSRDLMMQLELLMSDSPGSNSEGHEMFVSSDDDGGGEGSKCSSEEIDDIMSTFKF 720

Query: 762 KDSRDFSYLVDVLNEASLHCKNLETGSVPLHGQEHQVISPTVFEILEKKFGEQISWRRSE 821
           KDSRDFSYL+DVL+EA L+C NL+ G V   GQE  VISP+VFE LEKKFGEQ SWRRSE
Sbjct: 721 KDSRDFSYLLDVLSEAGLYCGNLDKGCVSWDGQEPHVISPSVFETLEKKFGEQTSWRRSE 780

Query: 822 RKLLFDRINSGLVELFQSFVGVPEWAKPVSRRFRPLLNHEMIEEELWILLDSQEREVNKD 881
           RKLLFDRINSGL+ELFQS VGVPEWAKPVSRRFRPLL+ EM+EEELWILLDSQERE+NKD
Sbjct: 781 RKLLFDRINSGLIELFQSLVGVPEWAKPVSRRFRPLLDREMVEEELWILLDSQERELNKD 840

Query: 882 LVDKQFGKEIGWIDLGDEIDSICREVERLLFNELVAEFARI 902
           LVDKQFGKEIGWIDLG+EI+SICRE+ERLL  EL+AEF  I
Sbjct: 841 LVDKQFGKEIGWIDLGEEINSICRELERLLIKELLAEFGII 876

BLAST of HG10022254 vs. ExPASy TrEMBL
Match: A0A6J1JSS4 (uncharacterized protein LOC111487197 OS=Cucurbita maxima OX=3661 GN=LOC111487197 PE=4 SV=1)

HSP 1 Score: 778.9 bits (2010), Expect = 2.7e-221
Identity = 468/857 (54.61%), Postives = 518/857 (60.44%), Query Frame = 0

Query: 42  MEPRQSTASVLEVLMGFDERQSQHRAPRHPKVFSDDYLQRVASI-GILKKKSSSRCHPFR 101
           ME  Q +ASVLE LMGFDE QS+HRA    +  S+ YLQRVASI G  KKKS SRC PFR
Sbjct: 1   MESTQCSASVLEALMGFDELQSEHRASGRSRGLSERYLQRVASIGGTQKKKSPSRCQPFR 60

Query: 102 MIVEESTELFNSLKVENNFSRCNELWEREKVDSTLSAAYMPLTRHTIMNEKHFSTGKVIQ 161
           M +EE  E+         FS  N LWERE             +    MNEKHFST ++I 
Sbjct: 61  MTIEEPPEV---------FSIRNVLWEREH-----------FSIRNFMNEKHFSTDEIIP 120

Query: 162 TSKDFQDLPVVLDSMDISPRPTRGKNYIFNQAKNEPSVSKAHYNDAGTKFKDRKQGQAHS 221
           TSKDF DLP  +DSMDISPR TR K+  FN  +N P++SK                    
Sbjct: 121 TSKDFHDLPEAVDSMDISPRHTRTKDNTFNHVENGPNLSK-------------------- 180

Query: 222 SEDLDFLMPSRPFLEWRDKLRFSSSSSTSLKGSHLINDKCKDCHSSRNGKLIAKEKKRTM 281
                                        L  +H                          
Sbjct: 181 ----------------------------PLNNAH-------------------------- 240

Query: 282 EYALQPIKQPSRVSSILDGSRRTTKHNFVNLHLKTSRTETIYDDVRRKETKCKRNSSPRL 341
                                                         RK+           
Sbjct: 241 ----------------------------------------------RKD----------- 300

Query: 342 SNCTAEYKHSCFFSVESYEARESREEVIEEERKTENLMLFTRGRKMNEMPTLPHYAILPS 401
                EYK SCF SVESY+  ESRE+VIEE+RK  NLML  +GR MNEM  LPHYA  PS
Sbjct: 301 -----EYKRSCFISVESYKGGESREKVIEEQRKNGNLMLAKQGRNMNEMFILPHYATFPS 360

Query: 402 DLNCKPVNYDFQKHDCSNKEHLHSGSPLCLSQKVKRLDQLSKNSHRSRFDSTSVVTTRSR 461
           DLNCKPV YDF K  C NK+HLHSGSPLCLS K +R D+LSK  HRSR DS   V  RSR
Sbjct: 361 DLNCKPVEYDFPKRICLNKDHLHSGSPLCLSCKDRRFDRLSKKPHRSRLDSAYTVIARSR 420

Query: 462 TRSRYEALRNTWFLKHEGPGTWLQCTPLNRSSNKKDASEPTLKLSSKKLKIFPCPDSASD 521
            RSRYEALRNTWFLK EG GTWLQ  PLN  SNKK+ASEP+ KLSSKKL+IFPCPDS SD
Sbjct: 421 IRSRYEALRNTWFLKPEGLGTWLQYKPLNTRSNKKNASEPSSKLSSKKLRIFPCPDSVSD 480

Query: 522 HVDNDGCMVGDDLKTTVEKKDLCDQKSLNCLSPRSKVVFCTQNNAVKRENQAIECALKSD 581
           HVDNDGC+VG+DLKT VEK  LCDQ S+N LS  S              N AIE      
Sbjct: 481 HVDNDGCIVGNDLKTRVEKNGLCDQHSVNLLSSNS--------------NLAIE------ 540

Query: 582 FQDNLSGMASNSLAVKTDDVANPTVEKQEPNSLSGILLETRGDSSTNSCRAMYTSIQQEG 641
                                       +P SLS I+ ET G SST SCRA  TSIQQ+G
Sbjct: 541 ----------------------------QP-SLSSIVPETDGHSSTISCRATCTSIQQDG 600

Query: 642 LAFEHYPSKEQDSIVNLEEAYQPSPVSVLEPLFKEETLFSSESSGINSRDLVMQLELLMS 701
           L+F+ Y SKE DSIV LEE YQPSPVSVLE  FKEET  S ESSGINSR    +LELLM 
Sbjct: 601 LSFDRYDSKELDSIVRLEEFYQPSPVSVLERHFKEETFSSFESSGINSR----ELELLMW 648

Query: 702 DSPGTNSEGHDLFVSSDDDGGEGSICGSDEINDIMSTFKFKDSRDFSYLVDVLNEASLHC 761
           DSPGTNS+ H+LFVSS++DGGEGSIC SDEI DIMSTFKFKDSRDFSYLVDV++EA LH 
Sbjct: 661 DSPGTNSDEHELFVSSEEDGGEGSICNSDEIYDIMSTFKFKDSRDFSYLVDVISEAGLHH 648

Query: 762 KNLETGSVPLHGQEHQVISPTVFEILEKKFGEQISWRRSERKLLFDRINSGLVELFQSFV 821
           +NLE G V  H QE  VISP+VFE LEKKFGEQ+SWRRSERKLLFDRINSGL ELFQSFV
Sbjct: 721 RNLEKGCVLWHDQERYVISPSVFEALEKKFGEQVSWRRSERKLLFDRINSGLAELFQSFV 648

Query: 822 GVPEWAKPVSRRFRPLLNHEMIEEELWILLDSQEREVNKDLVDKQFGKEIGWIDLGDEID 881
           GVPEWAKPVSRRFRPLL+ EM+E++LW LLDSQE+E NKDLVDKQFGKEIGWIDL DEI 
Sbjct: 781 GVPEWAKPVSRRFRPLLDQEMVEDKLWTLLDSQEKEGNKDLVDKQFGKEIGWIDLEDEIG 648

Query: 882 SICREVERLLFNELVAE 898
           SICRE+E LL  ELVAE
Sbjct: 841 SICRELEGLLIVELVAE 648

BLAST of HG10022254 vs. TAIR 10
Match: AT2G39390.1 (Ribosomal L29 family protein )

HSP 1 Score: 191.0 bits (484), Expect = 4.6e-48
Identity = 110/122 (90.16%), Postives = 115/122 (94.26%), Query Frame = 0

Query: 899  ARIKVHELRQKSKADLLLQLKDLKAELALLRVAKVTGGALNKRSKIKVVRLSIAQVLTVI 958
            ARIKVHELR+KSKADL  QLK+ KAELALLRVAKVTGGA NK SKIKVVR SIAQVLTVI
Sbjct: 2    ARIKVHELREKSKADLSGQLKEFKAELALLRVAKVTGGAPNKLSKIKVVRKSIAQVLTVI 61

Query: 959  SQKQKSALREAYKKKKLLPLDLRPKKTRAIRRRLTKYQASLKTERQKKKEMYFPLRKYAI 1018
            SQKQKSALREAYK KKLLPLDLRPKKTRAIRRRLTK+QASLKTER+KKKEMYFP+RKYAI
Sbjct: 62   SQKQKSALREAYKNKKLLPLDLRPKKTRAIRRRLTKHQASLKTEREKKKEMYFPIRKYAI 121

Query: 1019 KV 1021
            KV
Sbjct: 122  KV 123

BLAST of HG10022254 vs. TAIR 10
Match: AT3G09500.1 (Ribosomal L29 family protein )

HSP 1 Score: 189.5 bits (480), Expect = 1.3e-47
Identity = 109/122 (89.34%), Postives = 115/122 (94.26%), Query Frame = 0

Query: 899  ARIKVHELRQKSKADLLLQLKDLKAELALLRVAKVTGGALNKRSKIKVVRLSIAQVLTVI 958
            ARIKVHELR+KSK+DL  QLK+LKAELALLRVAKVTGGA NK SKIKVVR SIAQVLTV 
Sbjct: 2    ARIKVHELREKSKSDLQNQLKELKAELALLRVAKVTGGAPNKLSKIKVVRKSIAQVLTVS 61

Query: 959  SQKQKSALREAYKKKKLLPLDLRPKKTRAIRRRLTKYQASLKTERQKKKEMYFPLRKYAI 1018
            SQKQKSALREAYK KKLLPLDLRPKKTRAIRRRLTK+QASLKTER+KKKEMYFP+RKYAI
Sbjct: 62   SQKQKSALREAYKNKKLLPLDLRPKKTRAIRRRLTKHQASLKTEREKKKEMYFPIRKYAI 121

Query: 1019 KV 1021
            KV
Sbjct: 122  KV 123

BLAST of HG10022254 vs. TAIR 10
Match: AT5G02610.1 (Ribosomal L29 family protein )

HSP 1 Score: 188.7 bits (478), Expect = 2.3e-47
Identity = 109/122 (89.34%), Postives = 113/122 (92.62%), Query Frame = 0

Query: 899  ARIKVHELRQKSKADLLLQLKDLKAELALLRVAKVTGGALNKRSKIKVVRLSIAQVLTVI 958
            ARIKVHELR KSK DL  QLK+ KAELALLRVAKVTGGA NK SKIKVVR SIAQVLTVI
Sbjct: 2    ARIKVHELRDKSKTDLQNQLKEFKAELALLRVAKVTGGAPNKLSKIKVVRKSIAQVLTVI 61

Query: 959  SQKQKSALREAYKKKKLLPLDLRPKKTRAIRRRLTKYQASLKTERQKKKEMYFPLRKYAI 1018
            SQKQKSALREAYK KKLLPLDLRPKKTRAIRRRLTK+QASLKTER+KKKEMYFP+RKYAI
Sbjct: 62   SQKQKSALREAYKNKKLLPLDLRPKKTRAIRRRLTKHQASLKTEREKKKEMYFPVRKYAI 121

Query: 1019 KV 1021
            KV
Sbjct: 122  KV 123

BLAST of HG10022254 vs. TAIR 10
Match: AT3G55170.1 (Ribosomal L29 family protein )

HSP 1 Score: 186.0 bits (471), Expect = 1.5e-46
Identity = 107/122 (87.70%), Postives = 113/122 (92.62%), Query Frame = 0

Query: 899  ARIKVHELRQKSKADLLLQLKDLKAELALLRVAKVTGGALNKRSKIKVVRLSIAQVLTVI 958
            ARIKVHELR KSK+DL  QLK+LKAELA LRVAKVTGGA NK SKIKVVR SIAQVLTV 
Sbjct: 2    ARIKVHELRDKSKSDLSTQLKELKAELASLRVAKVTGGAPNKLSKIKVVRKSIAQVLTVS 61

Query: 959  SQKQKSALREAYKKKKLLPLDLRPKKTRAIRRRLTKYQASLKTERQKKKEMYFPLRKYAI 1018
            SQKQKSALREAYK KKLLPLDLRPKKTRAIRRRLTK+QASLKTER+KKK+MYFP+RKYAI
Sbjct: 62   SQKQKSALREAYKNKKLLPLDLRPKKTRAIRRRLTKHQASLKTEREKKKDMYFPIRKYAI 121

Query: 1019 KV 1021
            KV
Sbjct: 122  KV 123

BLAST of HG10022254 vs. TAIR 10
Match: AT3G55170.2 (Ribosomal L29 family protein )

HSP 1 Score: 186.0 bits (471), Expect = 1.5e-46
Identity = 107/122 (87.70%), Postives = 113/122 (92.62%), Query Frame = 0

Query: 899  ARIKVHELRQKSKADLLLQLKDLKAELALLRVAKVTGGALNKRSKIKVVRLSIAQVLTVI 958
            ARIKVHELR KSK+DL  QLK+LKAELA LRVAKVTGGA NK SKIKVVR SIAQVLTV 
Sbjct: 2    ARIKVHELRDKSKSDLSTQLKELKAELASLRVAKVTGGAPNKLSKIKVVRKSIAQVLTVS 61

Query: 959  SQKQKSALREAYKKKKLLPLDLRPKKTRAIRRRLTKYQASLKTERQKKKEMYFPLRKYAI 1018
            SQKQKSALREAYK KKLLPLDLRPKKTRAIRRRLTK+QASLKTER+KKK+MYFP+RKYAI
Sbjct: 62   SQKQKSALREAYKNKKLLPLDLRPKKTRAIRRRLTKHQASLKTEREKKKDMYFPIRKYAI 121

Query: 1019 KV 1021
            KV
Sbjct: 122  KV 123

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038889736.10.0e+0078.63uncharacterized protein LOC120079578 isoform X1 [Benincasa hispida] >XP_03888973... [more]
XP_038889740.10.0e+0078.51uncharacterized protein LOC120079578 isoform X2 [Benincasa hispida][more]
TYK05761.10.0e+0075.58uncharacterized protein E5676_scaffold98G002500 [Cucumis melo var. makuwa][more]
XP_008463525.10.0e+0075.46PREDICTED: uncharacterized protein LOC103501659 [Cucumis melo] >XP_016903042.1 P... [more]
XP_011655343.10.0e+0075.81uncharacterized protein LOC101203594 [Cucumis sativus] >XP_031741594.1 uncharact... [more]
Match NameE-valueIdentityDescription
Q9M5L03.8e-4790.1660S ribosomal protein L35 OS=Euphorbia esula OX=3993 GN=RPL35 PE=2 SV=1[more]
O806266.4e-4790.1660S ribosomal protein L35-2 OS=Arabidopsis thaliana OX=3702 GN=RPL35B PE=2 SV=1[more]
Q9SF531.9e-4689.3460S ribosomal protein L35-1 OS=Arabidopsis thaliana OX=3702 GN=RPL35A PE=2 SV=1[more]
Q9LZ413.2e-4689.3460S ribosomal protein L35-4 OS=Arabidopsis thaliana OX=3702 GN=RPL35D PE=2 SV=1[more]
Q9M3D22.1e-4587.7060S ribosomal protein L35-3 OS=Arabidopsis thaliana OX=3702 GN=RPL35C PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A5D3C1E70.0e+0075.58DUF4378 domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E567... [more]
A0A1S4E4970.0e+0075.46uncharacterized protein LOC103501659 OS=Cucumis melo OX=3656 GN=LOC103501659 PE=... [more]
A0A0A0KNN60.0e+0075.81DUF4378 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_5G505210 PE=... [more]
A0A6J1BX360.0e+0068.56uncharacterized protein LOC111006294 OS=Momordica charantia OX=3673 GN=LOC111006... [more]
A0A6J1JSS42.7e-22154.61uncharacterized protein LOC111487197 OS=Cucurbita maxima OX=3661 GN=LOC111487197... [more]
Match NameE-valueIdentityDescription
AT2G39390.14.6e-4890.16Ribosomal L29 family protein [more]
AT3G09500.11.3e-4789.34Ribosomal L29 family protein [more]
AT5G02610.12.3e-4789.34Ribosomal L29 family protein [more]
AT3G55170.11.5e-4687.70Ribosomal L29 family protein [more]
AT3G55170.21.5e-4687.70Ribosomal L29 family protein [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 911..931
NoneNo IPR availableGENE3D6.10.250.3450coord: 975..1020
e-value: 5.9E-20
score: 73.3
NoneNo IPR availablePANTHERPTHR46836AFADINcoord: 42..900
NoneNo IPR availablePANTHERPTHR46836:SF7DUF4378 DOMAIN PROTEINcoord: 42..900
NoneNo IPR availableCDDcd00427Ribosomal_L29_HIPcoord: 905..961
e-value: 4.57047E-8
score: 48.6379
IPR036049Ribosomal protein L29/L35 superfamilyGENE3D1.10.287.310coord: 898..974
e-value: 1.6E-30
score: 107.3
IPR036049Ribosomal protein L29/L35 superfamilySUPERFAMILY46561Ribosomal protein L29 (L29p)coord: 901..962
IPR001854Ribosomal protein L29/L35PFAMPF00831Ribosomal_L29coord: 905..961
e-value: 5.7E-15
score: 55.0
IPR001854Ribosomal protein L29/L35TIGRFAMTIGR00012TIGR00012coord: 905..961
e-value: 5.3E-15
score: 53.2
IPR001854Ribosomal protein L29/L35HAMAPMF_00374Ribosomal_L29coord: 901..961
score: 12.540656
IPR025486Domain of unknown function DUF4378PFAMPF14309DUF4378coord: 745..897
e-value: 1.8E-27
score: 96.7

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10022254.1HG10022254.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006412 translation
cellular_component GO:0005840 ribosome
molecular_function GO:0003735 structural constituent of ribosome