Cla97C11G225350 (gene) Watermelon (97103) v2.5

Overview
NameCla97C11G225350
Typegene
OrganismCitrullus lanatus subsp. vulgaris cv. 97103 (Watermelon (97103) v2.5)
DescriptionBEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein .
LocationCla97Chr11: 30791278 .. 30796055 (-)
RNA-Seq ExpressionCla97C11G225350
SyntenyCla97C11G225350
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTCTCCTGCTGTTAATGCTTTGAAGTTGAACATATGGACTCCAATGATTGAGAAATCTAGCATGAACTGGATATGTGGAAAGTTCCTTTCCTTTCAGAAGGGTGGCTGTTTATCTGTTGCTCAAGCTTCTTGGCCAGTATCCCAACCTACTTTCTCCTTTTAAGTTTCCTACCAAGGTTAGATCCTCAATTTATCAAGTGGGAATCGGATATCATTTTGTTGCACTAAGATGAGGGATTGATAGACTTTTGGGTTATTTCAAGAATGCGCTTTGCCTCCTATTGGGTTTGACATTTTCCTCTTCTGAGAAGTTGCTTGTGGTACTCTGTGATTGCCGACTATTTAAGGTGGTGAAAGCCTTCTTTTGAATGTTTGGTTGGAAATTGGTCAGAGAATCTTTGGTGAAAAGCTAGACATTAGCCAGCATTTGGGAGGGAATCAAAACTTTAGGCTGGTCTTTATAATATCTGTTATGGCTTCTGTATTTACGATGTTTTGTCCATTCTTTCCAAGGAGAGCAGTTCTTTTACACGTGCTTTTCATTTTCAATAATAAAAAATGCTTTTTCCTATTTTCAATTGAAAAAATACAAACAAGAAACAGAAAAGATACGAAACATGTATCTAGAAAAAAATTGAATTCAAGGATAAGGAAAAACAATGTCAGAGAAATAAAAGACTAATTGAGAACCCAAAAAAAACCACCTGAAAGGAAGAAGCATAGTAAGAAAATAAATGGATATCAGAACGCATGTGAAGAATGAGCAGAACAATGGACTCTTCAACAATAAAGCTAGATCTATGGATAGCCTTTGGGACCACGTTCTTTTGGTTGGGGTATTGTGGTCAAATATTTTTATTTCTTTTAAATCTTATAGTCTCTTTTTATTTCTTTTTTCTTTTATGCCAGTTGGAAGAGGTGTAGCTTCCTTAGCTTGGTGTTTGGCTTTCTTCCCCTTTCTTTTCTCATTGGGAGAAAAAAGAAAACAAAAAAAAAACCAAGAAAAAGTTGCTGCAACAGTAGCAACAGATTTGGAGATGAAGAAGCAATTTAAGAACATATTTAGGATGAAAGAGAATTTAATAGGAATGTGAAAGATGGAAGATTTAAGACAGAAAAGAGATGCTGTTATTTTCAGAAACACAGAAGATAACTTTAGTCACTTTCCAAACTTTTCATAGTATATTAGTTCTAGACATCGTTTTTCATAATCAAATGACTTAATGTGCTTTCTTTTTTTTCCTTGTATTTTCATTGTTTTGTTAATTAAGACTTACACCAAAACTCTGAAATTTCTTACAAGTATTCTGACTTTTAATTTCAACAGTGCAGAGTTGTAATTTTAAATGTCTAGTGGATGTCGGTTAGTTTTTTCTCTCTTTTTTTAATTTATTATTTTAATATTTTATTTTCATTGGAAGACGAGGAATTAAAAGCTGTGGTATTCTGCACATAATAAACTTCTGTTGAACACCAGCATTAGATTTTCCTGCTTGTGCTTATCTTTTTATTTTTTGTTCTAATGTTGTTTGATTACGTATTTATAAAATATAATTACTGCCGTCGATATTTGATTTATTGATCCTCACTTTAAGGCGCTTCTTCTGTGCTTAAGTGGCCTACTTCCATTAACAACATGCTCAGATCTAGAAGGCACTTTAACTCATGCTAAAACTGAATTTTGGACTTTTTTATGGATCATTTTGAACTGGCCATCTCCGCCAGCATGAATTTGTACAAAAATTGTGAAAGTGGGTGGTTCTAAACTTTTCTATTCTTTAATGAGTTGTCATGATTACCTTCTAATACTTTCCTCTATACCAGGGAAAGAGATGGGGTGGATGTTGGGGTGCATTATCTTGTTTTCACTCTCAGAAGGGAGAAAAGCGCATCGTACCTGCATCTCGTTTACCTGAGGGCAATGCCGTGACAACCCAGCCTAATGGACCTCAAGCAGCAGGAATGACCAACCAGGCTACAGTGATAACTCCATCCCTTCTAGCCCCACCTTCTTCACCAGCATCTTTTACAAATTCTGCACTCCCTTCAACAGTCCAATCACCTAGCTGTTTCATGTCACTGTCTGCAAACTCACCTGGAGGTCCTTCATCCACAATGTTTGCTACAGGGCCATATGCGCATGAAACACAGCTGGTTTCTCCTCCTGTTTTCTCAGCCTTCACCACTGAACCGTCAACTGCTCCCCTCACCCCCCCACCCGAACTAGCTCATCTAACCACGCCTTCTTCCCCTGATGTGCCCTTTGCTCAGTTCCTATCCTCATCGGTGGATCTCAAAGGAACTGGAAAGGCCAATTACATTGCTTCAAATGATCTTCAAGCAGCATATTCTCTCTACCCTGGAAGTCCTGCCAGTAGCCTCGTGTCACCAATTTCAAGGACCTCTGGCGATTGCTTATCATCCTCATTTCCTGAAAGGGACTTCCCATCACAGTGGAATCCTTCAGCTTCTCTCCAAGATGGAAAATATCCAAGAAGTGGTTCTGGTCGGCTATTTGGAAATGAGAAAGCTGGTGGTACATCGTTGGCATCTCAGGATTCTAATTTCTTCTGCCCTGCTACATTTGCACAATTCTATCTGGACAATCCACCATTCCCTCATACTGGTGGGAGGTTAAGTGTATCGAAGGATTCAGATGTCTACTCTTCTAGTGGGAATGGATACCAGAACCGGCACAGTAAATCTCCAAAACAAGATGTGGAGGAAATAGAAGCTTACAGAGCATCGTTTGGTTTCAGTGCGGATGAAATTATAACTACTACACAGTATGTGGAGATATCTGATGTAATGGAGGATTCCTTTACTATGAGACCTTTTACCTCAACTAGTCTATCAGCAGAAGAAAGTATTGAACCTCCATTGTTGGGTGAAAAACTAAAATCCACGCATACAACTTTACAGAGTCAGAGAAGTATTAAATCAGCACCTGAGGTTGTCGAAAAGGAAACCTGCACTGAAGTGCTGGCATTATGCAATGGTTATAAAGGTGAGCCGCTGTTCCCTGTTACAACAAGCACTGAAGCACACACTTGTTTCATAAAATGAGAAATCAAAAGAAAATAAAAATCAGATATGGTTTAGAAAACGTGTTTAAAATGTCATTATTAGAATTTTATACTTTTCTTTTTTATTTTACTATTATTGTTATTCTTTTTTTTTTTTTTTTTTGGGGGGGGGGGGGGGGGGATGGAATTTTGGAGGTATTATTTTATCCGTGACCCACAAGTCTGATACTATAAATAGTTAAGATTCTCCACTGTACACCTCCGGTTACAGTTGTTACCTGTTTGATGCAGACAATAAATTGCAAAGACAACCTGGTAATATGTCAGGATCAAGTACTTCAAACCAAGTTGAAAAAGACGTATTCTCAAGGACAGGGTCATCCAAAAATAGTCGCAAGTATAATCTTGGCTTATCCTGCTCTGATGCAGAGGTTGACTACGGAAGGGGAAGGAGCCCAAGGGAGGCCAGGGAAGATTTTTCATGGCATGACTAAGACATCCTCTGGAATAGTTTACAGTGTGTTTGTGTATCTGTTTTTCTGCTTTACAGGTTTCCATGGAATGTCTAACGTATGATCTAACCTGTTGTCTCCCTGAATTATGACTTGGCCACTGGATGGAAGAACGATTATAATTTGTGTTCTTTATCATGCCAATTTAGATATGGAATGGGTCGAATTTCTGTATTTGGTAAACGAAATTGACTTTTTTCAATGAATTATATACGTTGTATAATCTGCTTGCACGTTCACATCTGGTCAGCGTATCCAAACATCTGTTCTTTGAGGTCATTGATCGGGCTTTCTATAGAAACTAAGATTGCTAAAATTTCATGTAAAAAGGTTGGTGTTACACCAGAAACAGGTGGCTGTCGTTGTAAACAAATGTTAGATATTTCTTGCTCTTCTAAGATCCGGAATTGGTGTGGCAAAATGAGAGAAAACCGTGGAATGAACCTCTAGCAGCGATGGTCGTCTTGCCCACCTGACTGGTTGGAAACTGGGAATTGAAATTGATAATTAGAATGCCCACCCTGTTGAGGTCAGGTCTCTACTTCGATTAGGGCAGGAATCCTTGGTTTGACAGGGGATAGGAGTGGATTAAAGTAGGGTCCTAGGGATGGGGCGGGAATGGCGAGAGTATCCCTCCTTGCCCCCTACCTCGAACCCATCCCATTTGGTTTTTTAATATATTTTTAAAATATATAAATATATTAAATATGTTTTATTTATGTGAAGTGGAAAATTTGTTTTCTTTTGCATTTTATTTTTGCTTGAGTTTTTAAAATTTTATTTCATTTTTTTTAACTGAAATTAACTTTATTAACTGTTTTTAATAAAGAAAATTCAATATGTAAGATTAATAAAGGTGTCTAAAGTATTTCGGGGATGTTCTCTGATGGGGAATTCTTGTCCATGCCTTCAATTTGCAGGGATGGGTATTAATTTCCCCCCGCAGATGGGAACAAGGAACCCCTCCCCCCAACCCGCGCCGTCCTGTTGCCAACCTTGAAATACTTAATTGCCTCAAATTAACCTTGAATGGTGTTTAAAACTCTGAAATGCATGTATTACTCTGGATTTTACAAACAAGTGAGATTAGACACTTAGGTAGCTCATATCACTTCTTCCCTTAGTTTATCTCTTACCATGTCGTTTCATTTGGCCCTTGTTACACGGATGCTGGTTGGGCTAGTTGCCCGAATGATCGTCGTAGGAGTAGTGGTTTAGTGGTTTTTGCAATTTTCTAG

mRNA sequence

ATGTCTCCTGCTGTTAATGCTTTGAAGTTGAACATATGGACTCCAATGATTGAGAAATCTAGCATGAACTGGATATGTGGAAAGTTCCTTTCCTTTCAGAAGGGTGGCTGTTTATCTGTTGCTCAAGCTTCTTGGCCAGGAAAGAGATGGGGTGGATGTTGGGGTGCATTATCTTGTTTTCACTCTCAGAAGGGAGAAAAGCGCATCGTACCTGCATCTCGTTTACCTGAGGGCAATGCCGTGACAACCCAGCCTAATGGACCTCAAGCAGCAGGAATGACCAACCAGGCTACAGTGATAACTCCATCCCTTCTAGCCCCACCTTCTTCACCAGCATCTTTTACAAATTCTGCACTCCCTTCAACAGTCCAATCACCTAGCTGTTTCATGTCACTGTCTGCAAACTCACCTGGAGGTCCTTCATCCACAATGTTTGCTACAGGGCCATATGCGCATGAAACACAGCTGGTTTCTCCTCCTGTTTTCTCAGCCTTCACCACTGAACCGTCAACTGCTCCCCTCACCCCCCCACCCGAACTAGCTCATCTAACCACGCCTTCTTCCCCTGATGTGCCCTTTGCTCAGTTCCTATCCTCATCGGTGGATCTCAAAGGAACTGGAAAGGCCAATTACATTGCTTCAAATGATCTTCAAGCAGCATATTCTCTCTACCCTGGAAGTCCTGCCAGTAGCCTCGTGTCACCAATTTCAAGGACCTCTGGCGATTGCTTATCATCCTCATTTCCTGAAAGGGACTTCCCATCACAGTGGAATCCTTCAGCTTCTCTCCAAGATGGAAAATATCCAAGAAGTGGTTCTGGTCGGCTATTTGGAAATGAGAAAGCTGGTGGTACATCGTTGGCATCTCAGGATTCTAATTTCTTCTGCCCTGCTACATTTGCACAATTCTATCTGGACAATCCACCATTCCCTCATACTGGTGGGAGGTTAAGTGTATCGAAGGATTCAGATGTCTACTCTTCTAGTGGGAATGGATACCAGAACCGGCACAGTAAATCTCCAAAACAAGATGTGGAGGAAATAGAAGCTTACAGAGCATCGTTTGGTTTCAGTGCGGATGAAATTATAACTACTACACAGTATGTGGAGATATCTGATGTAATGGAGGATTCCTTTACTATGAGACCTTTTACCTCAACTAGTCTATCAGCAGAAGAAAGTATTGAACCTCCATTGTTGGGTGAAAAACTAAAATCCACGCATACAACTTTACAGAGTCAGAGAAGTATTAAATCAGCACCTGAGGTTGTCGAAAAGGAAACCTGCACTGAAGTGCTGGCATTATGCAATGGTTATAAAGACAATAAATTGCAAAGACAACCTGGTAATATGTCAGGATCAAGTACTTCAAACCAAAGGTTGACTACGGAAGGGGAAGGAGCCCAAGGGAGGCCAGGGAAGATTTTTCATGGCATGACTAAGACATCCTCTGGAATAGTTTACAGTGTGTTTGTGTATCTGTTTTTCTGCTTTACAGGCCCTTGTTACACGGATGCTGGTTGGGCTAGTTGCCCGAATGATCGTCGTAGGAGTAGTGGTTTAGTGGTTTTTGCAATTTTCTAG

Coding sequence (CDS)

ATGTCTCCTGCTGTTAATGCTTTGAAGTTGAACATATGGACTCCAATGATTGAGAAATCTAGCATGAACTGGATATGTGGAAAGTTCCTTTCCTTTCAGAAGGGTGGCTGTTTATCTGTTGCTCAAGCTTCTTGGCCAGGAAAGAGATGGGGTGGATGTTGGGGTGCATTATCTTGTTTTCACTCTCAGAAGGGAGAAAAGCGCATCGTACCTGCATCTCGTTTACCTGAGGGCAATGCCGTGACAACCCAGCCTAATGGACCTCAAGCAGCAGGAATGACCAACCAGGCTACAGTGATAACTCCATCCCTTCTAGCCCCACCTTCTTCACCAGCATCTTTTACAAATTCTGCACTCCCTTCAACAGTCCAATCACCTAGCTGTTTCATGTCACTGTCTGCAAACTCACCTGGAGGTCCTTCATCCACAATGTTTGCTACAGGGCCATATGCGCATGAAACACAGCTGGTTTCTCCTCCTGTTTTCTCAGCCTTCACCACTGAACCGTCAACTGCTCCCCTCACCCCCCCACCCGAACTAGCTCATCTAACCACGCCTTCTTCCCCTGATGTGCCCTTTGCTCAGTTCCTATCCTCATCGGTGGATCTCAAAGGAACTGGAAAGGCCAATTACATTGCTTCAAATGATCTTCAAGCAGCATATTCTCTCTACCCTGGAAGTCCTGCCAGTAGCCTCGTGTCACCAATTTCAAGGACCTCTGGCGATTGCTTATCATCCTCATTTCCTGAAAGGGACTTCCCATCACAGTGGAATCCTTCAGCTTCTCTCCAAGATGGAAAATATCCAAGAAGTGGTTCTGGTCGGCTATTTGGAAATGAGAAAGCTGGTGGTACATCGTTGGCATCTCAGGATTCTAATTTCTTCTGCCCTGCTACATTTGCACAATTCTATCTGGACAATCCACCATTCCCTCATACTGGTGGGAGGTTAAGTGTATCGAAGGATTCAGATGTCTACTCTTCTAGTGGGAATGGATACCAGAACCGGCACAGTAAATCTCCAAAACAAGATGTGGAGGAAATAGAAGCTTACAGAGCATCGTTTGGTTTCAGTGCGGATGAAATTATAACTACTACACAGTATGTGGAGATATCTGATGTAATGGAGGATTCCTTTACTATGAGACCTTTTACCTCAACTAGTCTATCAGCAGAAGAAAGTATTGAACCTCCATTGTTGGGTGAAAAACTAAAATCCACGCATACAACTTTACAGAGTCAGAGAAGTATTAAATCAGCACCTGAGGTTGTCGAAAAGGAAACCTGCACTGAAGTGCTGGCATTATGCAATGGTTATAAAGACAATAAATTGCAAAGACAACCTGGTAATATGTCAGGATCAAGTACTTCAAACCAAAGGTTGACTACGGAAGGGGAAGGAGCCCAAGGGAGGCCAGGGAAGATTTTTCATGGCATGACTAAGACATCCTCTGGAATAGTTTACAGTGTGTTTGTGTATCTGTTTTTCTGCTTTACAGGCCCTTGTTACACGGATGCTGGTTGGGCTAGTTGCCCGAATGATCGTCGTAGGAGTAGTGGTTTAGTGGTTTTTGCAATTTTCTAG

Protein sequence

MSPAVNALKLNIWTPMIEKSSMNWICGKFLSFQKGGCLSVAQASWPGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNAVTTQPNGPQAAGMTNQATVITPSLLAPPSSPASFTNSALPSTVQSPSCFMSLSANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKANYIASNDLQAAYSLYPGSPASSLVSPISRTSGDCLSSSFPERDFPSQWNPSASLQDGKYPRSGSGRLFGNEKAGGTSLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSSSGNGYQNRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLLGEKLKSTHTTLQSQRSIKSAPEVVEKETCTEVLALCNGYKDNKLQRQPGNMSGSSTSNQRLTTEGEGAQGRPGKIFHGMTKTSSGIVYSVFVYLFFCFTGPCYTDAGWASCPNDRRRSSGLVVFAIF
Homology
BLAST of Cla97C11G225350 vs. NCBI nr
Match: XP_038899313.1 (uncharacterized protein At1g76660 [Benincasa hispida])

HSP 1 Score: 788.5 bits (2035), Expect = 3.6e-224
Identity = 406/413 (98.31%), Postives = 409/413 (99.03%), Query Frame = 0

Query: 47  GKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNAVTTQPNGPQAAGMTNQATVITPSLLA 106
           GKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNAVTTQPNGPQAAGMTNQATVITPSLLA
Sbjct: 14  GKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNAVTTQPNGPQAAGMTNQATVITPSLLA 73

Query: 107 PPSSPASFTNSALPSTVQSPSCFMSLSANSPGGPSSTMFATGPYAHETQLVSPPVFSAFT 166
           PPSSPASFTNSALPSTVQSPSCFMSLSANSPGGPSSTMFATGPYAHETQLVSPPVFSAFT
Sbjct: 74  PPSSPASFTNSALPSTVQSPSCFMSLSANSPGGPSSTMFATGPYAHETQLVSPPVFSAFT 133

Query: 167 TEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKANYIASNDLQAAYSLYPG 226
           TEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKANYIASNDLQAAYSLYPG
Sbjct: 134 TEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKANYIASNDLQAAYSLYPG 193

Query: 227 SPASSLVSPISRTSGDCLSSSFPERDFPSQWNPSASLQDGKYPRSGSGRLFGNEKAGGTS 286
           SPASSLVSPISRTSGDCLSSSFPERDFP QWNPSASLQDGKYPRSGSGRLFGNEKA  TS
Sbjct: 194 SPASSLVSPISRTSGDCLSSSFPERDFPPQWNPSASLQDGKYPRSGSGRLFGNEKA-DTS 253

Query: 287 LASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSSSGNGYQNRHSKSPKQDVE 346
           LASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSD YSSSGNGYQNRH+KSPKQDVE
Sbjct: 254 LASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDAYSSSGNGYQNRHNKSPKQDVE 313

Query: 347 EIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLLGEKLKS 406
           EIEAYRASFGFSADEII+TTQYVEISDVMEDSFTMRPFTST+LSAEESIEPPLLGEKLKS
Sbjct: 314 EIEAYRASFGFSADEIISTTQYVEISDVMEDSFTMRPFTSTNLSAEESIEPPLLGEKLKS 373

Query: 407 THTTLQSQRSIKSAPEVVEKETCTEVLALCNGYKDNKLQRQPGNMSGSSTSNQ 460
           THTTLQSQRSIKSAPEVVEKETCTEVLALCNGYKDNKLQRQPGNMSGSSTSNQ
Sbjct: 374 THTTLQSQRSIKSAPEVVEKETCTEVLALCNGYKDNKLQRQPGNMSGSSTSNQ 425

BLAST of Cla97C11G225350 vs. NCBI nr
Match: KAG6591175.1 (hypothetical protein SDJN03_13521, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 772.7 bits (1994), Expect = 2.0e-219
Identity = 404/489 (82.62%), Postives = 423/489 (86.50%), Query Frame = 0

Query: 1   MSPAVNALKLNIWTPMIEKSSMNWICGKFLSFQKGGCLSVAQAS---------------- 60
           M PAVN L L++WTP+IEKSSMNWICGKFLSFQKGGCLSV Q +                
Sbjct: 97  MPPAVNTLMLDLWTPIIEKSSMNWICGKFLSFQKGGCLSVPQQAILSRIVRLLDNFENDP 156

Query: 61  --------------WPGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNAVTTQPNGPQA 120
                           GK+WGGCWGALSCFHSQKGEKRIVPASRLPEGN VTTQPN P A
Sbjct: 157 CFLVVLTFLPSENLLVGKKWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNRPPA 216

Query: 121 AGMTNQATVITPSLLAPPSSPASFTNSALPSTVQSPSCFMSLSANSPGGPSSTMFATGPY 180
           AGM  QATVI PSLLAPPSSPASFTNSALPST QSPSCF+S+SANSPGGPSSTMFATGPY
Sbjct: 217 AGMAIQATVIDPSLLAPPSSPASFTNSALPSTAQSPSCFLSMSANSPGGPSSTMFATGPY 276

Query: 181 AHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKAN 240
           AHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFA+FLSSSVDLKGTGK N
Sbjct: 277 AHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAEFLSSSVDLKGTGKEN 336

Query: 241 YIASNDLQAAYSLYPGSPASSLVSPISRTSGDCLSSSFPERDFPSQWNPSASLQDGKYPR 300
           YIASNDLQ AYSLYPGSP+SSLVSPISRTSGDCL SSFPERDFP QWNPS S QDGKYPR
Sbjct: 337 YIASNDLQTAYSLYPGSPSSSLVSPISRTSGDCL-SSFPERDFPPQWNPSVSPQDGKYPR 396

Query: 301 SGSGRLFGNEKAGGTSLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSSSG 360
           +GSGRLFG+EKA GTSL SQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYS  G
Sbjct: 397 TGSGRLFGHEKA-GTSLVSQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSPGG 456

Query: 361 NGYQNRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLS 420
           N  QNRH+KSPKQDVEE+EAYRASFGFSADEII TTQYVEIS VMEDSFTM+PFTSTSLS
Sbjct: 457 NVLQNRHNKSPKQDVEELEAYRASFGFSADEIIITTQYVEISGVMEDSFTMKPFTSTSLS 516

Query: 421 AEESIEPPLLGEKLKSTHTTLQSQRSIKSAPEVVEKETCTEVLALCNGYKDNKLQRQPGN 460
           AEES EPPLL E LKS HTTLQSQR IKS P+VV+K+TCTEVLALC+ Y+DNKLQRQPGN
Sbjct: 517 AEESFEPPLLAENLKSAHTTLQSQRRIKSPPDVVQKDTCTEVLALCSVYEDNKLQRQPGN 576

BLAST of Cla97C11G225350 vs. NCBI nr
Match: XP_022975613.1 (uncharacterized protein At1g76660-like isoform X1 [Cucurbita maxima] >XP_022975614.1 uncharacterized protein At1g76660-like isoform X1 [Cucurbita maxima])

HSP 1 Score: 763.8 bits (1971), Expect = 9.5e-217
Identity = 394/459 (85.84%), Postives = 414/459 (90.20%), Query Frame = 0

Query: 1   MSPAVNALKLNIWTPMIEKSSMNWICGKFLSFQKGGCLSVAQASWPGKRWGGCWGALSCF 60
           M PAVN L L++WTP+IEKSSMNWICGKFLSFQK            GK+WGGCWGALSCF
Sbjct: 11  MPPAVNTLMLDLWTPIIEKSSMNWICGKFLSFQK------------GKKWGGCWGALSCF 70

Query: 61  HSQKGEKRIVPASRLPEGNAVTTQPNGPQAAGMTNQATVITPSLLAPPSSPASFTNSALP 120
           HSQKGEKRIVPASRLPEGN VTTQPN P  AGM  QATVI PSLLAPPSSPASFTNSALP
Sbjct: 71  HSQKGEKRIVPASRLPEGNVVTTQPNRPPEAGMAIQATVIDPSLLAPPSSPASFTNSALP 130

Query: 121 STVQSPSCFMSLSANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPEL 180
           ST QSPSCF+S+SANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPEL
Sbjct: 131 STAQSPSCFLSMSANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPEL 190

Query: 181 AHLTTPSSPDVPFAQFLSSSVDLKGTGKANYIASNDLQAAYSLYPGSPASSLVSPISRTS 240
           AHLTTPSSPDVPFA+FLSSS+DLKG GK NYIASNDLQ AYSLYPGSP+SSLVSPISRTS
Sbjct: 191 AHLTTPSSPDVPFAEFLSSSMDLKGAGKENYIASNDLQTAYSLYPGSPSSSLVSPISRTS 250

Query: 241 GDCLSSSFPERDFPSQWNPSASLQDGKYPRSGSGRLFGNEKAGGTSLASQDSNFFCPATF 300
           GDCL SSFPERDFP QWNPS S QDGKYPR+GSGRLFG+EKA GTSL SQDSNFFCPATF
Sbjct: 251 GDCL-SSFPERDFPPQWNPSVSPQDGKYPRTGSGRLFGHEKA-GTSLVSQDSNFFCPATF 310

Query: 301 AQFYLDNPPFPHTGGRLSVSKDSDVYSSSGNGYQNRHSKSPKQDVEEIEAYRASFGFSAD 360
           AQFYLDNPPFPHTGGRLSVSKDSDVYS  GN +QNRH+KSPKQDVEE+EAYRASFGFSAD
Sbjct: 311 AQFYLDNPPFPHTGGRLSVSKDSDVYSPGGNVFQNRHNKSPKQDVEELEAYRASFGFSAD 370

Query: 361 EIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLLGEKLKSTHTTLQSQRSIKSA 420
           EIITTTQYVEIS VMEDSFTM+PFTSTSLSAEES EPPLL E L S HTTLQSQR IKS 
Sbjct: 371 EIITTTQYVEISGVMEDSFTMKPFTSTSLSAEESFEPPLLAENLNSAHTTLQSQRRIKSP 430

Query: 421 PEVVEKETCTEVLALCNGYKDNKLQRQPGNMSGSSTSNQ 460
           P+VV+K+TCTEVLALC+ Y+DNKLQRQPGNMSGSST NQ
Sbjct: 431 PDVVQKDTCTEVLALCSVYEDNKLQRQPGNMSGSSTLNQ 455

BLAST of Cla97C11G225350 vs. NCBI nr
Match: XP_023521113.1 (uncharacterized protein At1g76660-like isoform X1 [Cucurbita pepo subsp. pepo] >XP_023521114.1 uncharacterized protein At1g76660-like isoform X1 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 763.8 bits (1971), Expect = 9.5e-217
Identity = 396/459 (86.27%), Postives = 414/459 (90.20%), Query Frame = 0

Query: 1   MSPAVNALKLNIWTPMIEKSSMNWICGKFLSFQKGGCLSVAQASWPGKRWGGCWGALSCF 60
           M PAVN L L++WTP+IEKSSMNWICGKFLSFQK            GK+WGGCWGALSCF
Sbjct: 11  MPPAVNTLMLDLWTPIIEKSSMNWICGKFLSFQK-----------QGKKWGGCWGALSCF 70

Query: 61  HSQKGEKRIVPASRLPEGNAVTTQPNGPQAAGMTNQATVITPSLLAPPSSPASFTNSALP 120
           HSQKGEKRIVPASRLPEGN VTTQPN P AAGM  QATVI PSLLAPPSSPASFTNSALP
Sbjct: 71  HSQKGEKRIVPASRLPEGNVVTTQPNRPPAAGMAIQATVIDPSLLAPPSSPASFTNSALP 130

Query: 121 STVQSPSCFMSLSANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPEL 180
           ST QSPSCF+S+SANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPEL
Sbjct: 131 STAQSPSCFLSMSANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPEL 190

Query: 181 AHLTTPSSPDVPFAQFLSSSVDLKGTGKANYIASNDLQAAYSLYPGSPASSLVSPISRTS 240
           AHLTTPSSPDVPFA+FLSSSVDLKGTGK NYIASNDLQ AYSLYPGSP+SSLVSPISRTS
Sbjct: 191 AHLTTPSSPDVPFAEFLSSSVDLKGTGKENYIASNDLQTAYSLYPGSPSSSLVSPISRTS 250

Query: 241 GDCLSSSFPERDFPSQWNPSASLQDGKYPRSGSGRLFGNEKAGGTSLASQDSNFFCPATF 300
           GDCL SSFPERDF  QWNPS S QDGKYPR+GSGRLFG+EKA GTSL SQDSNFFCPATF
Sbjct: 251 GDCL-SSFPERDFLPQWNPSVSPQDGKYPRTGSGRLFGHEKA-GTSLVSQDSNFFCPATF 310

Query: 301 AQFYLDNPPFPHTGGRLSVSKDSDVYSSSGNGYQNRHSKSPKQDVEEIEAYRASFGFSAD 360
           AQFYLDNPPFPHTGGRLSVSKDSDVYS  GN  QNRH+KSPKQDVEE+EAYRASFGFSAD
Sbjct: 311 AQFYLDNPPFPHTGGRLSVSKDSDVYSPGGNVLQNRHNKSPKQDVEELEAYRASFGFSAD 370

Query: 361 EIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLLGEKLKSTHTTLQSQRSIKSA 420
           EIITTTQYVEIS VMEDSFTM+PFTSTSLSAEES EPPLL E L S HTTLQSQR IKS 
Sbjct: 371 EIITTTQYVEISGVMEDSFTMKPFTSTSLSAEESFEPPLLAENLNSAHTTLQSQRRIKSP 430

Query: 421 PEVVEKETCTEVLALCNGYKDNKLQRQPGNMSGSSTSNQ 460
           P+VV+K+TCTEVLALC+ Y+DNKLQRQPGNMSGSST NQ
Sbjct: 431 PDVVQKDTCTEVLALCSVYEDNKLQRQPGNMSGSSTLNQ 456

BLAST of Cla97C11G225350 vs. NCBI nr
Match: XP_023521115.1 (uncharacterized protein At1g76660-like isoform X2 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 763.5 bits (1970), Expect = 1.2e-216
Identity = 396/459 (86.27%), Postives = 414/459 (90.20%), Query Frame = 0

Query: 1   MSPAVNALKLNIWTPMIEKSSMNWICGKFLSFQKGGCLSVAQASWPGKRWGGCWGALSCF 60
           M PAVN L L++WTP+IEKSSMNWICGKFLSFQK            GK+WGGCWGALSCF
Sbjct: 11  MPPAVNTLMLDLWTPIIEKSSMNWICGKFLSFQK------------GKKWGGCWGALSCF 70

Query: 61  HSQKGEKRIVPASRLPEGNAVTTQPNGPQAAGMTNQATVITPSLLAPPSSPASFTNSALP 120
           HSQKGEKRIVPASRLPEGN VTTQPN P AAGM  QATVI PSLLAPPSSPASFTNSALP
Sbjct: 71  HSQKGEKRIVPASRLPEGNVVTTQPNRPPAAGMAIQATVIDPSLLAPPSSPASFTNSALP 130

Query: 121 STVQSPSCFMSLSANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPEL 180
           ST QSPSCF+S+SANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPEL
Sbjct: 131 STAQSPSCFLSMSANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPEL 190

Query: 181 AHLTTPSSPDVPFAQFLSSSVDLKGTGKANYIASNDLQAAYSLYPGSPASSLVSPISRTS 240
           AHLTTPSSPDVPFA+FLSSSVDLKGTGK NYIASNDLQ AYSLYPGSP+SSLVSPISRTS
Sbjct: 191 AHLTTPSSPDVPFAEFLSSSVDLKGTGKENYIASNDLQTAYSLYPGSPSSSLVSPISRTS 250

Query: 241 GDCLSSSFPERDFPSQWNPSASLQDGKYPRSGSGRLFGNEKAGGTSLASQDSNFFCPATF 300
           GDCL SSFPERDF  QWNPS S QDGKYPR+GSGRLFG+EKA GTSL SQDSNFFCPATF
Sbjct: 251 GDCL-SSFPERDFLPQWNPSVSPQDGKYPRTGSGRLFGHEKA-GTSLVSQDSNFFCPATF 310

Query: 301 AQFYLDNPPFPHTGGRLSVSKDSDVYSSSGNGYQNRHSKSPKQDVEEIEAYRASFGFSAD 360
           AQFYLDNPPFPHTGGRLSVSKDSDVYS  GN  QNRH+KSPKQDVEE+EAYRASFGFSAD
Sbjct: 311 AQFYLDNPPFPHTGGRLSVSKDSDVYSPGGNVLQNRHNKSPKQDVEELEAYRASFGFSAD 370

Query: 361 EIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLLGEKLKSTHTTLQSQRSIKSA 420
           EIITTTQYVEIS VMEDSFTM+PFTSTSLSAEES EPPLL E L S HTTLQSQR IKS 
Sbjct: 371 EIITTTQYVEISGVMEDSFTMKPFTSTSLSAEESFEPPLLAENLNSAHTTLQSQRRIKSP 430

Query: 421 PEVVEKETCTEVLALCNGYKDNKLQRQPGNMSGSSTSNQ 460
           P+VV+K+TCTEVLALC+ Y+DNKLQRQPGNMSGSST NQ
Sbjct: 431 PDVVQKDTCTEVLALCSVYEDNKLQRQPGNMSGSSTLNQ 455

BLAST of Cla97C11G225350 vs. ExPASy Swiss-Prot
Match: Q9SRE5 (Uncharacterized protein At1g76660 OS=Arabidopsis thaliana OX=3702 GN=At1g76660 PE=2 SV=1)

HSP 1 Score: 417.5 bits (1072), Expect = 2.2e-115
Identity = 246/402 (61.19%), Postives = 284/402 (70.65%), Query Frame = 0

Query: 48  KRWGGCWGALSCFHSQKGEKRIVPASRLPE-GNAVTTQPNGPQAAGMTNQ--ATVITPSL 107
           KRWGGC G  SCF SQKG KRIVPASR+PE GN   +QPNG   AG+ N   A  I  SL
Sbjct: 9   KRWGGCLGVFSCFKSQKGGKRIVPASRIPEGGNVSASQPNGAHQAGVLNNQAAGGINLSL 68

Query: 108 LAPPSSPASFTNSALPSTVQSPSCFMSLSANSPGGPSSTMFATGPYAHETQLVSPPVFSA 167
           LAPPSSPASFTNSALPST QSP+C++SL+ANSPGGPSS+M+ATGPYAHETQLVSPPVFS 
Sbjct: 69  LAPPSSPASFTNSALPSTTQSPNCYLSLAANSPGGPSSSMYATGPYAHETQLVSPPVFST 128

Query: 168 FTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKANYIASNDLQAAYSLY 227
           FTTEPSTAP TPPPELA LT PSSPDVP+A+FL+SS+DLK +GK +Y   NDLQA YSLY
Sbjct: 129 FTTEPSTAPFTPPPELARLTAPSSPDVPYARFLTSSMDLKNSGKGHY---NDLQATYSLY 188

Query: 228 PGSPASSLVSPISRTSGDCLSSSFPERDFPSQWNPSASLQDGKYPRSGSGRLFGNEKAGG 287
           PGSPAS+L SPISR SGD L S                 Q+GK  RS SG  FG +   G
Sbjct: 189 PGSPASALRSPISRASGDGLLSP----------------QNGKCSRSDSGNTFGYD-TNG 248

Query: 288 TSLASQDSNFFCPATFAQFYLD-NPPFPHTGGRLSVSKDSDVYSSS--GNGYQNRHSKSP 347
            S   Q+SNFFCP TFA+FYLD +P  P  GGRLSVSKDSDVY ++  GNG QNR ++SP
Sbjct: 249 VSTPLQESNFFCPETFAKFYLDHDPSVPQNGGRLSVSKDSDVYPTNGYGNGNQNRQNRSP 308

Query: 348 KQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLLG 407
           KQD+EE+EAYRASFGFSADEIITT+QYVEI+DVM+ SF    ++            P  G
Sbjct: 309 KQDMEELEAYRASFGFSADEIITTSQYVEITDVMDGSFNTSAYS------------PSDG 368

Query: 408 EKLKSTHTTLQSQRSIKSAPEVVEKETCTEVLALCNGYKDNK 444
           +KL      L SQ S KS  ++  +    +     N YKD+K
Sbjct: 369 QKLLRREANLLSQTSPKSEADLDSQVVDFQSPKSSNSYKDHK 378

BLAST of Cla97C11G225350 vs. ExPASy TrEMBL
Match: A0A6J1IL36 (uncharacterized protein At1g76660-like isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111475260 PE=4 SV=1)

HSP 1 Score: 763.8 bits (1971), Expect = 4.6e-217
Identity = 394/459 (85.84%), Postives = 414/459 (90.20%), Query Frame = 0

Query: 1   MSPAVNALKLNIWTPMIEKSSMNWICGKFLSFQKGGCLSVAQASWPGKRWGGCWGALSCF 60
           M PAVN L L++WTP+IEKSSMNWICGKFLSFQK            GK+WGGCWGALSCF
Sbjct: 11  MPPAVNTLMLDLWTPIIEKSSMNWICGKFLSFQK------------GKKWGGCWGALSCF 70

Query: 61  HSQKGEKRIVPASRLPEGNAVTTQPNGPQAAGMTNQATVITPSLLAPPSSPASFTNSALP 120
           HSQKGEKRIVPASRLPEGN VTTQPN P  AGM  QATVI PSLLAPPSSPASFTNSALP
Sbjct: 71  HSQKGEKRIVPASRLPEGNVVTTQPNRPPEAGMAIQATVIDPSLLAPPSSPASFTNSALP 130

Query: 121 STVQSPSCFMSLSANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPEL 180
           ST QSPSCF+S+SANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPEL
Sbjct: 131 STAQSPSCFLSMSANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPEL 190

Query: 181 AHLTTPSSPDVPFAQFLSSSVDLKGTGKANYIASNDLQAAYSLYPGSPASSLVSPISRTS 240
           AHLTTPSSPDVPFA+FLSSS+DLKG GK NYIASNDLQ AYSLYPGSP+SSLVSPISRTS
Sbjct: 191 AHLTTPSSPDVPFAEFLSSSMDLKGAGKENYIASNDLQTAYSLYPGSPSSSLVSPISRTS 250

Query: 241 GDCLSSSFPERDFPSQWNPSASLQDGKYPRSGSGRLFGNEKAGGTSLASQDSNFFCPATF 300
           GDCL SSFPERDFP QWNPS S QDGKYPR+GSGRLFG+EKA GTSL SQDSNFFCPATF
Sbjct: 251 GDCL-SSFPERDFPPQWNPSVSPQDGKYPRTGSGRLFGHEKA-GTSLVSQDSNFFCPATF 310

Query: 301 AQFYLDNPPFPHTGGRLSVSKDSDVYSSSGNGYQNRHSKSPKQDVEEIEAYRASFGFSAD 360
           AQFYLDNPPFPHTGGRLSVSKDSDVYS  GN +QNRH+KSPKQDVEE+EAYRASFGFSAD
Sbjct: 311 AQFYLDNPPFPHTGGRLSVSKDSDVYSPGGNVFQNRHNKSPKQDVEELEAYRASFGFSAD 370

Query: 361 EIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLLGEKLKSTHTTLQSQRSIKSA 420
           EIITTTQYVEIS VMEDSFTM+PFTSTSLSAEES EPPLL E L S HTTLQSQR IKS 
Sbjct: 371 EIITTTQYVEISGVMEDSFTMKPFTSTSLSAEESFEPPLLAENLNSAHTTLQSQRRIKSP 430

Query: 421 PEVVEKETCTEVLALCNGYKDNKLQRQPGNMSGSSTSNQ 460
           P+VV+K+TCTEVLALC+ Y+DNKLQRQPGNMSGSST NQ
Sbjct: 431 PDVVQKDTCTEVLALCSVYEDNKLQRQPGNMSGSSTLNQ 455

BLAST of Cla97C11G225350 vs. ExPASy TrEMBL
Match: A0A6J1F9B9 (uncharacterized protein At1g76660-like isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111443280 PE=4 SV=1)

HSP 1 Score: 761.1 bits (1964), Expect = 3.0e-216
Identity = 395/459 (86.06%), Postives = 412/459 (89.76%), Query Frame = 0

Query: 1   MSPAVNALKLNIWTPMIEKSSMNWICGKFLSFQKGGCLSVAQASWPGKRWGGCWGALSCF 60
           M PAVN L L++WTP+IEKSSMNWICGKFLSFQK            GK+WGGCWGALSCF
Sbjct: 11  MPPAVNTLMLDLWTPIIEKSSMNWICGKFLSFQK------------GKKWGGCWGALSCF 70

Query: 61  HSQKGEKRIVPASRLPEGNAVTTQPNGPQAAGMTNQATVITPSLLAPPSSPASFTNSALP 120
           HSQKGEKRIVPASRLPEGN VTTQPN P AAGM  QATVI PSLLAPPSSPASFTNSALP
Sbjct: 71  HSQKGEKRIVPASRLPEGNVVTTQPNRPPAAGMAIQATVIDPSLLAPPSSPASFTNSALP 130

Query: 121 STVQSPSCFMSLSANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPEL 180
           ST QSPSCF+S+SANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPEL
Sbjct: 131 STAQSPSCFLSMSANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPEL 190

Query: 181 AHLTTPSSPDVPFAQFLSSSVDLKGTGKANYIASNDLQAAYSLYPGSPASSLVSPISRTS 240
           AHLTTPSSPDVPFA+FLSSSVDLKGTGK NYIASNDLQ AYSLYPGSP+SSLVSPISRTS
Sbjct: 191 AHLTTPSSPDVPFAEFLSSSVDLKGTGKENYIASNDLQTAYSLYPGSPSSSLVSPISRTS 250

Query: 241 GDCLSSSFPERDFPSQWNPSASLQDGKYPRSGSGRLFGNEKAGGTSLASQDSNFFCPATF 300
           GDCL SSFPERDFP QWNPS S QDGKYPR+GSGRLFG+EKA GTSL SQDSNFFCPATF
Sbjct: 251 GDCL-SSFPERDFPPQWNPSVSPQDGKYPRTGSGRLFGHEKA-GTSLVSQDSNFFCPATF 310

Query: 301 AQFYLDNPPFPHTGGRLSVSKDSDVYSSSGNGYQNRHSKSPKQDVEEIEAYRASFGFSAD 360
           AQFYLDNPPFPHTGGRLSVSKDSDVYS  GN  QNRH+KSPKQDVEE+EAYRASFGFSAD
Sbjct: 311 AQFYLDNPPFPHTGGRLSVSKDSDVYSPGGNVLQNRHNKSPKQDVEELEAYRASFGFSAD 370

Query: 361 EIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLLGEKLKSTHTTLQSQRSIKSA 420
           EIITTTQYVEIS VMEDSFTM+PFTSTSLSAEES EP LL E L S HTTLQS R IKS 
Sbjct: 371 EIITTTQYVEISGVMEDSFTMKPFTSTSLSAEESFEPSLLAENLNSAHTTLQSLRRIKSP 430

Query: 421 PEVVEKETCTEVLALCNGYKDNKLQRQPGNMSGSSTSNQ 460
           P+VV+K+TCTEVLALC  Y+DNKLQRQPGNMSGSST NQ
Sbjct: 431 PDVVQKDTCTEVLALCTVYEDNKLQRQPGNMSGSSTLNQ 455

BLAST of Cla97C11G225350 vs. ExPASy TrEMBL
Match: A0A5A7VFM0 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold255G004100 PE=4 SV=1)

HSP 1 Score: 756.1 bits (1951), Expect = 9.5e-215
Identity = 392/413 (94.92%), Postives = 400/413 (96.85%), Query Frame = 0

Query: 47  GKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNAVTTQPNGPQAAGMTNQATVITPSLLA 106
           GKRWGGCWGALSCFHSQKGEKRIVPASRLPEGN VTTQPNGPQAAGMTNQATVITPSLLA
Sbjct: 15  GKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGPQAAGMTNQATVITPSLLA 74

Query: 107 PPSSPASFTNSALPSTVQSPSCFMSLSANSPGGPSSTMFATGPYAHETQLVSPPVFSAFT 166
           PPSSPASFTNSALPSTVQSPSCF+SLSANSPGGPSST++ATGPYAHETQ VSPPVFSAFT
Sbjct: 75  PPSSPASFTNSALPSTVQSPSCFLSLSANSPGGPSSTIYATGPYAHETQPVSPPVFSAFT 134

Query: 167 TEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKANYIASNDLQAAYSLYPG 226
           TEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSS DLKGTGKANYIASNDLQAAYSLYPG
Sbjct: 135 TEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSDDLKGTGKANYIASNDLQAAYSLYPG 194

Query: 227 SPASSLVSPISRTSGDCLSSSFPERDFPSQWNPSASLQDGKYPRSGSGRLFGNEKAGGTS 286
           SPASSLVSPISRTSGDCLSSSFPERDF  QWN SASLQDGKYPRSGSGRLFGNEKA GTS
Sbjct: 195 SPASSLVSPISRTSGDCLSSSFPERDFRPQWNSSASLQDGKYPRSGSGRLFGNEKA-GTS 254

Query: 287 LASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSSSGNGYQNRHSKSPKQDVE 346
           LASQDSNFFCPATFAQFYLDN  FPHTGGRLSVSKDSDVYSS GNGYQNRHSKSPKQDVE
Sbjct: 255 LASQDSNFFCPATFAQFYLDNTTFPHTGGRLSVSKDSDVYSSCGNGYQNRHSKSPKQDVE 314

Query: 347 EIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLLGEKLKS 406
           EIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEES EPPLLGEKLKS
Sbjct: 315 EIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESTEPPLLGEKLKS 374

Query: 407 THTTLQSQRSIKSAPEVVEKETCTEVLALCNGYKDNKLQRQPGNMSGSSTSNQ 460
           +HTTLQ+QRSIKSAPEVVEKETCTEV ALCNGYKDNKLQRQPG++ GSSTS+Q
Sbjct: 375 SHTTLQNQRSIKSAPEVVEKETCTEVPALCNGYKDNKLQRQPGDILGSSTSDQ 426

BLAST of Cla97C11G225350 vs. ExPASy TrEMBL
Match: A0A5D3D8J8 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold134G00170 PE=4 SV=1)

HSP 1 Score: 756.1 bits (1951), Expect = 9.5e-215
Identity = 392/413 (94.92%), Postives = 400/413 (96.85%), Query Frame = 0

Query: 47  GKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNAVTTQPNGPQAAGMTNQATVITPSLLA 106
           GKRWGGCWGALSCFHSQKGEKRIVPASRLPEGN VTTQPNGPQAAGMTNQATVITPSLLA
Sbjct: 8   GKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGPQAAGMTNQATVITPSLLA 67

Query: 107 PPSSPASFTNSALPSTVQSPSCFMSLSANSPGGPSSTMFATGPYAHETQLVSPPVFSAFT 166
           PPSSPASFTNSALPSTVQSPSCF+SLSANSPGGPSST++ATGPYAHETQ VSPPVFSAFT
Sbjct: 68  PPSSPASFTNSALPSTVQSPSCFLSLSANSPGGPSSTIYATGPYAHETQPVSPPVFSAFT 127

Query: 167 TEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKANYIASNDLQAAYSLYPG 226
           TEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSS DLKGTGKANYIASNDLQAAYSLYPG
Sbjct: 128 TEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSDDLKGTGKANYIASNDLQAAYSLYPG 187

Query: 227 SPASSLVSPISRTSGDCLSSSFPERDFPSQWNPSASLQDGKYPRSGSGRLFGNEKAGGTS 286
           SPASSLVSPISRTSGDCLSSSFPERDF  QWN SASLQDGKYPRSGSGRLFGNEKA GTS
Sbjct: 188 SPASSLVSPISRTSGDCLSSSFPERDFRPQWNSSASLQDGKYPRSGSGRLFGNEKA-GTS 247

Query: 287 LASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSSSGNGYQNRHSKSPKQDVE 346
           LASQDSNFFCPATFAQFYLDN  FPHTGGRLSVSKDSDVYSS GNGYQNRHSKSPKQDVE
Sbjct: 248 LASQDSNFFCPATFAQFYLDNTTFPHTGGRLSVSKDSDVYSSCGNGYQNRHSKSPKQDVE 307

Query: 347 EIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLLGEKLKS 406
           EIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEES EPPLLGEKLKS
Sbjct: 308 EIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESTEPPLLGEKLKS 367

Query: 407 THTTLQSQRSIKSAPEVVEKETCTEVLALCNGYKDNKLQRQPGNMSGSSTSNQ 460
           +HTTLQ+QRSIKSAPEVVEKETCTEV ALCNGYKDNKLQRQPG++ GSSTS+Q
Sbjct: 368 SHTTLQNQRSIKSAPEVVEKETCTEVPALCNGYKDNKLQRQPGDILGSSTSDQ 419

BLAST of Cla97C11G225350 vs. ExPASy TrEMBL
Match: A0A1S3BV86 (uncharacterized protein At1g76660 OS=Cucumis melo OX=3656 GN=LOC103493867 PE=4 SV=1)

HSP 1 Score: 756.1 bits (1951), Expect = 9.5e-215
Identity = 392/413 (94.92%), Postives = 400/413 (96.85%), Query Frame = 0

Query: 47  GKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNAVTTQPNGPQAAGMTNQATVITPSLLA 106
           GKRWGGCWGALSCFHSQKGEKRIVPASRLPEGN VTTQPNGPQAAGMTNQATVITPSLLA
Sbjct: 14  GKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGPQAAGMTNQATVITPSLLA 73

Query: 107 PPSSPASFTNSALPSTVQSPSCFMSLSANSPGGPSSTMFATGPYAHETQLVSPPVFSAFT 166
           PPSSPASFTNSALPSTVQSPSCF+SLSANSPGGPSST++ATGPYAHETQ VSPPVFSAFT
Sbjct: 74  PPSSPASFTNSALPSTVQSPSCFLSLSANSPGGPSSTIYATGPYAHETQPVSPPVFSAFT 133

Query: 167 TEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKANYIASNDLQAAYSLYPG 226
           TEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSS DLKGTGKANYIASNDLQAAYSLYPG
Sbjct: 134 TEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSDDLKGTGKANYIASNDLQAAYSLYPG 193

Query: 227 SPASSLVSPISRTSGDCLSSSFPERDFPSQWNPSASLQDGKYPRSGSGRLFGNEKAGGTS 286
           SPASSLVSPISRTSGDCLSSSFPERDF  QWN SASLQDGKYPRSGSGRLFGNEKA GTS
Sbjct: 194 SPASSLVSPISRTSGDCLSSSFPERDFRPQWNSSASLQDGKYPRSGSGRLFGNEKA-GTS 253

Query: 287 LASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSSSGNGYQNRHSKSPKQDVE 346
           LASQDSNFFCPATFAQFYLDN  FPHTGGRLSVSKDSDVYSS GNGYQNRHSKSPKQDVE
Sbjct: 254 LASQDSNFFCPATFAQFYLDNTTFPHTGGRLSVSKDSDVYSSCGNGYQNRHSKSPKQDVE 313

Query: 347 EIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLLGEKLKS 406
           EIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEES EPPLLGEKLKS
Sbjct: 314 EIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESTEPPLLGEKLKS 373

Query: 407 THTTLQSQRSIKSAPEVVEKETCTEVLALCNGYKDNKLQRQPGNMSGSSTSNQ 460
           +HTTLQ+QRSIKSAPEVVEKETCTEV ALCNGYKDNKLQRQPG++ GSSTS+Q
Sbjct: 374 SHTTLQNQRSIKSAPEVVEKETCTEVPALCNGYKDNKLQRQPGDILGSSTSDQ 425

BLAST of Cla97C11G225350 vs. TAIR 10
Match: AT1G76660.1 (FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: plasma membrane; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein (TAIR:AT5G52430.1); Has 353 Blast hits to 231 proteins in 60 species: Archae - 0; Bacteria - 6; Metazoa - 57; Fungi - 22; Plants - 125; Viruses - 4; Other Eukaryotes - 139 (source: NCBI BLink). )

HSP 1 Score: 417.5 bits (1072), Expect = 1.5e-116
Identity = 246/402 (61.19%), Postives = 284/402 (70.65%), Query Frame = 0

Query: 48  KRWGGCWGALSCFHSQKGEKRIVPASRLPE-GNAVTTQPNGPQAAGMTNQ--ATVITPSL 107
           KRWGGC G  SCF SQKG KRIVPASR+PE GN   +QPNG   AG+ N   A  I  SL
Sbjct: 9   KRWGGCLGVFSCFKSQKGGKRIVPASRIPEGGNVSASQPNGAHQAGVLNNQAAGGINLSL 68

Query: 108 LAPPSSPASFTNSALPSTVQSPSCFMSLSANSPGGPSSTMFATGPYAHETQLVSPPVFSA 167
           LAPPSSPASFTNSALPST QSP+C++SL+ANSPGGPSS+M+ATGPYAHETQLVSPPVFS 
Sbjct: 69  LAPPSSPASFTNSALPSTTQSPNCYLSLAANSPGGPSSSMYATGPYAHETQLVSPPVFST 128

Query: 168 FTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKANYIASNDLQAAYSLY 227
           FTTEPSTAP TPPPELA LT PSSPDVP+A+FL+SS+DLK +GK +Y   NDLQA YSLY
Sbjct: 129 FTTEPSTAPFTPPPELARLTAPSSPDVPYARFLTSSMDLKNSGKGHY---NDLQATYSLY 188

Query: 228 PGSPASSLVSPISRTSGDCLSSSFPERDFPSQWNPSASLQDGKYPRSGSGRLFGNEKAGG 287
           PGSPAS+L SPISR SGD L S                 Q+GK  RS SG  FG +   G
Sbjct: 189 PGSPASALRSPISRASGDGLLSP----------------QNGKCSRSDSGNTFGYD-TNG 248

Query: 288 TSLASQDSNFFCPATFAQFYLD-NPPFPHTGGRLSVSKDSDVYSSS--GNGYQNRHSKSP 347
            S   Q+SNFFCP TFA+FYLD +P  P  GGRLSVSKDSDVY ++  GNG QNR ++SP
Sbjct: 249 VSTPLQESNFFCPETFAKFYLDHDPSVPQNGGRLSVSKDSDVYPTNGYGNGNQNRQNRSP 308

Query: 348 KQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLLG 407
           KQD+EE+EAYRASFGFSADEIITT+QYVEI+DVM+ SF    ++            P  G
Sbjct: 309 KQDMEELEAYRASFGFSADEIITTSQYVEITDVMDGSFNTSAYS------------PSDG 368

Query: 408 EKLKSTHTTLQSQRSIKSAPEVVEKETCTEVLALCNGYKDNK 444
           +KL      L SQ S KS  ++  +    +     N YKD+K
Sbjct: 369 QKLLRREANLLSQTSPKSEADLDSQVVDFQSPKSSNSYKDHK 378

BLAST of Cla97C11G225350 vs. TAIR 10
Match: AT5G52430.1 (hydroxyproline-rich glycoprotein family protein )

HSP 1 Score: 134.4 bits (337), Expect = 2.6e-31
Identity = 102/231 (44.16%), Postives = 133/231 (57.58%), Query Frame = 0

Query: 49  RWGGCWGALSCFHSQKGEKRIVPASRLPEGNAVTTQPNGPQAAGMTNQATVITPSLLAPP 108
           RWG CW   SCF +QK  KRI  A  +PE   VT+              TV+ P  +APP
Sbjct: 35  RWGKCWSLYSCFGTQKNNKRIGNAVLVPE--PVTSGVPVVTVQNSATSTTVVLP-FIAPP 94

Query: 109 SSPASFTNSALPSTVQSPSCFMSLSAN--SPGGPSSTMFATGPYAHETQLVSPPVFSAFT 168
           SSPASF  S   S   SP   +SL++N  SP  P S +F  GPYA+ETQ V+PPVFSAF 
Sbjct: 95  SSPASFLQSDPSSVSHSPVGPLSLTSNTFSPKEPQS-VFTVGPYANETQPVTPPVFSAFI 154

Query: 169 TEPSTAPLTPPPELA-HLTTPSSPDVPFAQFLSSSVDL----KGTGKANYIASNDLQ-AA 228
           TEPSTAP TPPPE + H+TTPSSP+VPFAQ L+SS++L      +G     +S+  +  +
Sbjct: 155 TEPSTAPYTPPPESSVHITTPSSPEVPFAQLLTSSLELTRRDSTSGMNQKFSSSHYEFRS 214

Query: 229 YSLYPGSP-ASSLVSPISRTSGDCLSSSFPERDFPSQWNPSASLQDGKYPR 271
             + PGSP   +L+SP S  S    SS +P +      +P    + G+ P+
Sbjct: 215 NQVCPGSPGGGNLISPGSVISNSGTSSPYPGK------SPMVEFRIGEPPK 255

BLAST of Cla97C11G225350 vs. TAIR 10
Match: AT1G63720.1 (BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein (TAIR:AT5G52430.1); Has 490 Blast hits to 394 proteins in 96 species: Archae - 0; Bacteria - 2; Metazoa - 132; Fungi - 88; Plants - 175; Viruses - 14; Other Eukaryotes - 79 (source: NCBI BLink). )

HSP 1 Score: 124.0 bits (310), Expect = 3.5e-28
Identity = 86/207 (41.55%), Postives = 120/207 (57.97%), Query Frame = 0

Query: 48  KRWGGCWGALSCFHSQKGEKRIVPASRLPEGNAVTTQPNGPQAAGMTNQATVITPSLLAP 107
           ++W   W  L CF S +  KRI  +  +PE  ++++  +    +G   ++ + T   +AP
Sbjct: 38  RKWWNRWSLLKCFGSSRQRKRIGNSVLVPEPVSMSSSNSTTSNSGY--RSVITTLPFIAP 97

Query: 108 PSSPASFTNSALPSTVQSPSCFMSLSANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTT 167
           PSSPASF  S  PS  QSP   +S S   P     ++FA GPYAHETQLVSPPVFS +TT
Sbjct: 98  PSSPASFFQSEPPSATQSPVGILSFSP-LPCNNRPSIFAIGPYAHETQLVSPPVFSTYTT 157

Query: 168 EPSTAPLTPPPELAHL----TTPSSPDVPFAQFLSSSVDLKGTGKANYIASNDLQAAYSL 227
           EPS+AP+TPP + + +    TTPSSP+VPFAQ  +S+      G    ++S+     Y L
Sbjct: 158 EPSSAPITPPLDDSSIYLTTTTPSSPEVPFAQLFNSNHQTGSYGYKFPMSSSYEFQFYQL 217

Query: 228 YPGSPASSLVSPISRTSGDCLSSSFPE 251
            PGSP   L+SP   + G   +S FP+
Sbjct: 218 PPGSPLGQLISP---SPGSGPTSPFPD 238

BLAST of Cla97C11G225350 vs. TAIR 10
Match: AT4G25620.1 (hydroxyproline-rich glycoprotein family protein )

HSP 1 Score: 122.9 bits (307), Expect = 7.9e-28
Identity = 105/263 (39.92%), Postives = 133/263 (50.57%), Query Frame = 0

Query: 39  SVAQASWPGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNAVTTQPNGPQAAGMTNQAT 98
           S  Q S   K+ G  W    CF S+K  KRI  A  +PE  A +     P     +N  +
Sbjct: 24  SRTQPSSVQKKRGSWWSLYWCFGSKKNNKRIGHAVLVPE-PAASGAAVAPVQNSSSNSTS 83

Query: 99  VITPSLLAPPSSPASFTNSALPST--VQSPSCFMSLSANSPGGPSSTMFATGPYAHETQL 158
           +  P  +APPSSPASF  S  PS      P    SL+ N P  PS+  F  GPYAHETQ 
Sbjct: 84  IFMP-FIAPPSSPASFLPSGPPSASHTPDPGLLCSLTVNEP--PSA--FTIGPYAHETQP 143

Query: 159 VSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLK-----GTGKANY 218
           V+PPVFSAFTTEPSTAP TPPPE     +PSSP+VPFAQ L+SS++       G     +
Sbjct: 144 VTPPVFSAFTTEPSTAPFTPPPE-----SPSSPEVPFAQLLTSSLERARRNSGGGMNQKF 203

Query: 219 IASNDLQAAYSLYPGSPASSLVSPISRTS----GDCL--------------SSSFPERDF 277
            A++    +  +YPGSP  +L+SP S TS    G C                  F  R +
Sbjct: 204 SAAHYEFKSCQVYPGSPGGNLISPGSGTSSPYPGKCSIIEFRIGEPPKFLGFEHFTARKW 263

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038899313.13.6e-22498.31uncharacterized protein At1g76660 [Benincasa hispida][more]
KAG6591175.12.0e-21982.62hypothetical protein SDJN03_13521, partial [Cucurbita argyrosperma subsp. sorori... [more]
XP_022975613.19.5e-21785.84uncharacterized protein At1g76660-like isoform X1 [Cucurbita maxima] >XP_0229756... [more]
XP_023521113.19.5e-21786.27uncharacterized protein At1g76660-like isoform X1 [Cucurbita pepo subsp. pepo] >... [more]
XP_023521115.11.2e-21686.27uncharacterized protein At1g76660-like isoform X2 [Cucurbita pepo subsp. pepo][more]
Match NameE-valueIdentityDescription
Q9SRE52.2e-11561.19Uncharacterized protein At1g76660 OS=Arabidopsis thaliana OX=3702 GN=At1g76660 P... [more]
Match NameE-valueIdentityDescription
A0A6J1IL364.6e-21785.84uncharacterized protein At1g76660-like isoform X1 OS=Cucurbita maxima OX=3661 GN... [more]
A0A6J1F9B93.0e-21686.06uncharacterized protein At1g76660-like isoform X1 OS=Cucurbita moschata OX=3662 ... [more]
A0A5A7VFM09.5e-21594.92Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold... [more]
A0A5D3D8J89.5e-21594.92Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... [more]
A0A1S3BV869.5e-21594.92uncharacterized protein At1g76660 OS=Cucumis melo OX=3656 GN=LOC103493867 PE=4 S... [more]
Match NameE-valueIdentityDescription
AT1G76660.11.5e-11661.19FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknow... [more]
AT5G52430.12.6e-3144.16hydroxyproline-rich glycoprotein family protein [more]
AT1G63720.13.5e-2841.55BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein fam... [more]
AT4G25620.17.9e-2839.92hydroxyproline-rich glycoprotein family protein [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (97103) v2.5
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 445..474
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 244..284
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 445..465
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 320..338
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 315..344
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 244..267
NoneNo IPR availablePANTHERPTHR31798:SF3OS01G0103800 PROTEINcoord: 47..473
IPR040420Uncharacterized protein At1g76660-likePANTHERPTHR31798HYDROXYPROLINE-RICH GLYCOPROTEIN-LIKEcoord: 47..473

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla97C11G225350.2Cla97C11G225350.2mRNA