Cp4.1LG20g02030 (gene) Cucurbita pepo (MU‐CU‐16) v4.1

Overview
NameCp4.1LG20g02030
Typegene
OrganismCucurbita pepo (Cucurbita pepo (MU‐CU‐16) v4.1)
DescriptionProtein of unknown function (DUF789)
LocationCp4.1LG20: 1093629 .. 1097908 (-)
RNA-Seq ExpressionCp4.1LG20g02030
SyntenyCp4.1LG20g02030
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CTCTTCAGTTCCCATTAAAACCTTTTTTCTCTCTGAAACAAATTCTCAGTCAAAACCTCTGCTCGAATTTCAGCGTTATCCTCAACTCTTCGTCTTCTTCATCTCTCGATTCGACCTTTTGCTTTCTTCCTCCGTCTTCTTCTTCCACCATTTCGGAAAACCACGACTCTACTGAACCAGAGATTCGCCGGTGGTTTGATTGCTAATTAAGAGTATGTTTTCATCTTCTTCTCTCTTTTTTCCTTTCCTTTTATCTTCATTGCTCGTTCTCTGTAATCGATACTTCCACTGTGCCGCTGAATTTGCTTTTCCTTTTTTATTTCTCTCAATTTTCAATTCCTTCTGTTATTTCCTTTTTCACCAGCCGGATTTTCCGATCAGGTGCCTCATTCTTCTTTATATTCAACCAATTCGTCAATCATCCACCACCGGAAACGCACATTCCTCTGAACGAAACGAAGACCTAGCCGGAATTTCGATCAGAGGAGGCGAAAAGAACGAGAAAATCGATGGCTCAAACGGCGCCGTTTAGTTCCTCTAATTTGGAACGGTTTCTTCAATGCGTTACTCCTTGCGTTCCTTGGCGAACTCTTCCTCGGGTAAAAATAAATAAATAAATATATATTCAGCATTTTATTTTCTTTAATTATTTAATTATTATTTATTATTTCATTATAAATCCTACATTTTCCCCCTTTCCCTCCATTATAATTTTTCATTTATTTATTTCTGAATTTGTATCTGCAAAACCATGTGACTCACGATGTGGATAATTTTTATTATATAATTATTTTAATTTCCACCTCAACCCTAATTTTGGAAATTGAAAATTCCCCCATTTTCCACCACCAATATAATTTTCTCCATTTCCCCATAGTCAGCTTTATTTATTTATTTATTAATTAAATAAAATAAAACTAAATCATGGAAAGGAATTAACAAAAGGTTCATTCTTAAATTTCTTACCATATCTTAATTTTACGCAAATTTCTTTCCATAACAAATAATTAATGTAGTGGGCCCTTAATGGGCCGACGTAAGTCATAATTTATTATTTATTTAAAGATTTAATGTCATTTTTTGAAGCTCAATTTAATTTAGATAAATAAGCAAATTTATTATTTTCGTCGAAGTTGGCCCCACCCTTATCTTCAGAATTTCTTATTTTTGTCATTTGCTTGAAAAATAAATTAAATAGATAGGACATACGACATGTTTTCAAATGTCTTGTCACTATAAGTCTTCCCTCTGCGCCACGTGGCATTTTTTTATTATTTTAATTTTAATTTTTTTTTTCGGCCATGAATAAATACTTTCATAAATTAAATTTCATTGTTGTGGGTCATCAGAAAAATTGTCAAGTGTCTTTAACGTTAAAGTCTTCAAAATGAGATTTAATGGTTTTTTTTATCTTTTTTGCTCATTTATTGATGATATCGCTGCATGTTATCCCTTTGATTGCTTCCTCTAAGCCTATGATGTTTCTTTTCCTTATAGAGTTGTCTTCATGATCTCAATAGTCAGTGGCAACAACCTGATAAAGATACCATTCGGTACTTCACGCTTGGTGAGCTGTGGGATTACTACGATGAATGGAGTGCATATGGTGCTGGAATACCGTTACAATTGAATGCTTTTGAAACTGCTACTCAATTCTACGTGCCTTATCTGTCTGCAATTCAGATCTACACAAGCAAGTCTGTTCCACCTTCCAGGTATACTGATGCAACTTTCATTTCCTTCTGCTGCCTTTGAAGTTTGAAAACTGTGAGATCCTATGCTAGTTGGAGAGGGGAACGAAGCACTCCTTGTAAGGGTATGGAAACCTCTCCTTAACAGACGTGTTTTAAAATCGTGAGACTAACGGCAATACGTAACGAGGCAAAGTGGACAATATCTGCTAGCGGTGGGGTTGGGCTGTTATAAATGGTATCAGAGCCAGACATTTAGCAGTGTGCCAGTGAGGACGCTAGGCCCTCAAGGGGGTGGATTGTAGTATCACACATTGGTTGAAGAGGAGAACAAAACACTCCTTATAAGGGTGTGGAAACCTCTTCCTAGCAGACGCGTTTTGAAACCGTGAGGCTGATGGTGACACGTAACGAGCCAAAGCGAACAATATCTTCTAGCGGTGGGCTTGAGCTGTTACAAATGGTATTAGAGTTAGTCATTGGATGGTGCAAGACACTGGTCGGAGTAGGGCTAGATCCTCTTCATAGTAGACGCGTTTTAAAACCTTGAGGGGAAACCTAGAAGGGAAAGCCCAAAGAAGACAATATCTGCTAGCTAGCGGTGGGCTTGGGCTGTTACAAATGGTCTCTAACGGTGTGCCAACGAGAACACTGGTCCCTGAGGGGGGGTGGATTGTGAGATCCCACATTGGTTGGAGAGGGGAACGGAGCACTCCTTATAAGGGTGTGGAAACCTCTCCGTAGTAGACGTGTTTTAAAATCGTGAAGCTGACAGCGATACGTAACGGGTCAAAGCGGATAATATCTGCTAGCGGTGAGCTTGAACTGTTTTGTAACAGATATGTTTCTTTTTGACAAATGGTTGTTTATTGGAATTTATGAAAGAGGGAATATTAGTGTGAGTTTTAAAGGAAATGTTTGATATACAATTTAGTTTTTAGTGCATTGTGTTGCAAATAAGCTATATTTGTTCGAGTTGTGTGTGTAGAAGCCGAAGATATGATAGTGACATGGCAGAATGTGAAAGCGATTCTTGGAGTGACGATAGCTGGAGTGACATCTTGTCAAGATCCGTGAGCAACAACTCTAGCAGGACATGGGATGCTGTTTCTGAAGATTCGATCTTCGACCAGGAGGGATCGTGGCCGCTGAGGGAAAAACTTGGCTACCTCTCTTTACAGTATATGGAGATGTCTTCTCCTTATTGGAGAGTGCCTTTTATGGACAAGGTAGATCCTTTGTTGTTCTAAATTGATCATGGTATTTATCTTCACGGATAGAAACTGATGTTTTCAAATTTAACAGCTCTATATTTGTCGATAAGCCTTCGATAGAAGATCGAATGATAGTTTTGATATTCAAATGAAAAAAGAAAATAGGTGGAATTTAGATGGTCGAGTACTGCAAGAACACATAAAGCATTGAAGAAGTGTAAAATGATATTATCAGCATCTTCTTCTTTGCTTATGTTAGACGATCTGCATTTCAGATAAGGGAACTAACTCAAAACTATCCAGCACTAACTACACTTAGGAGTGTGGATCTTTCTCCAGCTAGTTGGATGGCTGTTTCCTGGTAAAATTCGCCTTCCCTATCGACCGCTCCACGCAAAAGCTTATTTGCCGTTTCCTTTTGCTTATTGGTTCTGTTCAATGCTTTTAACTTGATTCAGGTACCCTATTTATCATATCCCTAGTCAGAAAAACGACAAAGACTTCGCGACGTGCTTTCTAACGTATCATACATTGTCATCATCTTTTCAAGGTGTGGGACTATCAATGTTTCTTTTCAGAAATGTTATGAACCAATATCTGTCTATACACTACAAAGTTAAATGTTTGATGACTGCAGATTGCTCTATGGATCATGAAGGGAAAGATGGATGTAGTTGTTCCGAATCCGATACCGGGGGTCAAATCGATGGCAAAACTAGTACTTCTACTTCTACTAGCCTTTCTCCATTTGGGTTAGCCACTTATAGAATGCAGGGAGACCTCTGGTTGAAGCCAGACACATCTGACTACGACCGAATCGTCGATCTCTATCATGCAGCTGATTCATGGTTGAAGCAGCTCGGCGTCCAGCATCACGACTTCAACTTCTTTTCCCTGCACTCGAGCATGTAAATAATCAGTTTCCATTTCGTAGGCTGGCTTACTGATTCTCCTTCATGCAGGTTGTATATTTCTGTTCATGCTTTTGCAACCAACATGAGAAGCTCTTACCTCTGCCTCTGCTCACTGCAGTTCATTTCACAGTTAGAAAACCAGGAGTTTTGATTCAAAACCAACGCTTTTGTACTTTTTTGCTCATGAAAAATGATGAACATTGCTGTGGTTTAAAAGCAGAGGCATGGGTGAGGGGTTTTTTCTTTGGATTCAGGCTTCAGTTTTTCTCCCCAGTCTGCTTTTAACCTTTTATGGTATGAACTTCCCTAGACATGTAGGTTGCTATTAAGACATGCCTGATTGGGAGTTGTTTTGTAATTGAGTGTCAATTTTGTTTATCTGGTGGAGTTGTAATGTAAGTTTTTGTTCTTGTTGTTCTTGTTGTTCTCTGTCTTTACTTTAATCGTTCTTCAATGGTTGC

mRNA sequence

CTCTTCAGTTCCCATTAAAACCTTTTTTCTCTCTGAAACAAATTCTCAGTCAAAACCTCTGCTCGAATTTCAGCGTTATCCTCAACTCTTCGTCTTCTTCATCTCTCGATTCGACCTTTTGCTTTCTTCCTCCGTCTTCTTCTTCCACCATTTCGGAAAACCACGACTCTACTGAACCAGAGATTCGCCGGTGGTTTGATTGCTAATTAAGACCGGATTTTCCGATCAGGTGCCTCATTCTTCTTTATATTCAACCAATTCGTCAATCATCCACCACCGGAAACGCACATTCCTCTGAACGAAACGAAGACCTAGCCGGAATTTCGATCAGAGGAGGCGAAAAGAACGAGAAAATCGATGGCTCAAACGGCGCCGTTTAGTTCCTCTAATTTGGAACGGTTTCTTCAATGCGTTACTCCTTGCGTTCCTTGGCGAACTCTTCCTCGGAGTTGTCTTCATGATCTCAATAGTCAGTGGCAACAACCTGATAAAGATACCATTCGGTACTTCACGCTTGGTGAGCTGTGGGATTACTACGATGAATGGAGTGCATATGGTGCTGGAATACCGTTACAATTGAATGCTTTTGAAACTGCTACTCAATTCTACGTGCCTTATCTGTCTGCAATTCAGATCTACACAAGCAAAAGCCGAAGATATGATAGTGACATGGCAGAATGTGAAAGCGATTCTTGGAGTGACGATAGCTGGAGTGACATCTTGTCAAGATCCGTGAGCAACAACTCTAGCAGGACATGGGATGCTGTTTCTGAAGATTCGATCTTCGACCAGGAGGGATCGTGGCCGCTGAGGGAAAAACTTGGCTACCTCTCTTTACAGTATATGGAGATGTCTTCTCCTTATTGGAGAGTGCCTTTTATGGACAAGATAAGGGAACTAACTCAAAACTATCCAGCACTAACTACACTTAGGAGTGTGGATCTTTCTCCAGCTAGTTGGATGGCTGTTTCCTGGTACCCTATTTATCATATCCCTAGTCAGAAAAACGACAAAGACTTCGCGACGTGCTTTCTAACGTATCATACATTGTCATCATCTTTTCAAGATTGCTCTATGGATCATGAAGGGAAAGATGGATGTAGTTGTTCCGAATCCGATACCGGGGGTCAAATCGATGGCAAAACTAGTACTTCTACTTCTACTAGCCTTTCTCCATTTGGGTTAGCCACTTATAGAATGCAGGGAGACCTCTGGTTGAAGCCAGACACATCTGACTACGACCGAATCGTCGATCTCTATCATGCAGCTGATTCATGGTTGAAGCAGCTCGGCGTCCAGCATCACGACTTCAACTTCTTTTCCCTGCACTCGAGCATGTAAATAATCAGTTTCCATTTCGTAGGCTGGCTTACTGATTCTCCTTCATGCAGGTTGTATATTTCTGTTCATGCTTTTGCAACCAACATGAGAAGCTCTTACCTCTGCCTCTGCTCACTGCAGTTCATTTCACAGTTAGAAAACCAGGAGTTTTGATTCAAAACCAACGCTTTTGTACTTTTTTGCTCATGAAAAATGATGAACATTGCTGTGGTTTAAAAGCAGAGGCATGGGTGAGGGGTTTTTTCTTTGGATTCAGGCTTCAGTTTTTCTCCCCAGTCTGCTTTTAACCTTTTATGGTATGAACTTCCCTAGACATGTAGGTTGCTATTAAGACATGCCTGATTGGGAGTTGTTTTGTAATTGAGTGTCAATTTTGTTTATCTGGTGGAGTTGTAATGTAAGTTTTTGTTCTTGTTGTTCTTGTTGTTCTCTGTCTTTACTTTAATCGTTCTTCAATGGTTGC

Coding sequence (CDS)

ATGGCTCAAACGGCGCCGTTTAGTTCCTCTAATTTGGAACGGTTTCTTCAATGCGTTACTCCTTGCGTTCCTTGGCGAACTCTTCCTCGGAGTTGTCTTCATGATCTCAATAGTCAGTGGCAACAACCTGATAAAGATACCATTCGGTACTTCACGCTTGGTGAGCTGTGGGATTACTACGATGAATGGAGTGCATATGGTGCTGGAATACCGTTACAATTGAATGCTTTTGAAACTGCTACTCAATTCTACGTGCCTTATCTGTCTGCAATTCAGATCTACACAAGCAAAAGCCGAAGATATGATAGTGACATGGCAGAATGTGAAAGCGATTCTTGGAGTGACGATAGCTGGAGTGACATCTTGTCAAGATCCGTGAGCAACAACTCTAGCAGGACATGGGATGCTGTTTCTGAAGATTCGATCTTCGACCAGGAGGGATCGTGGCCGCTGAGGGAAAAACTTGGCTACCTCTCTTTACAGTATATGGAGATGTCTTCTCCTTATTGGAGAGTGCCTTTTATGGACAAGATAAGGGAACTAACTCAAAACTATCCAGCACTAACTACACTTAGGAGTGTGGATCTTTCTCCAGCTAGTTGGATGGCTGTTTCCTGGTACCCTATTTATCATATCCCTAGTCAGAAAAACGACAAAGACTTCGCGACGTGCTTTCTAACGTATCATACATTGTCATCATCTTTTCAAGATTGCTCTATGGATCATGAAGGGAAAGATGGATGTAGTTGTTCCGAATCCGATACCGGGGGTCAAATCGATGGCAAAACTAGTACTTCTACTTCTACTAGCCTTTCTCCATTTGGGTTAGCCACTTATAGAATGCAGGGAGACCTCTGGTTGAAGCCAGACACATCTGACTACGACCGAATCGTCGATCTCTATCATGCAGCTGATTCATGGTTGAAGCAGCTCGGCGTCCAGCATCACGACTTCAACTTCTTTTCCCTGCACTCGAGCATGTAA

Protein sequence

MAQTAPFSSSNLERFLQCVTPCVPWRTLPRSCLHDLNSQWQQPDKDTIRYFTLGELWDYYDEWSAYGAGIPLQLNAFETATQFYVPYLSAIQIYTSKSRRYDSDMAECESDSWSDDSWSDILSRSVSNNSSRTWDAVSEDSIFDQEGSWPLREKLGYLSLQYMEMSSPYWRVPFMDKIRELTQNYPALTTLRSVDLSPASWMAVSWYPIYHIPSQKNDKDFATCFLTYHTLSSSFQDCSMDHEGKDGCSCSESDTGGQIDGKTSTSTSTSLSPFGLATYRMQGDLWLKPDTSDYDRIVDLYHAADSWLKQLGVQHHDFNFFSLHSSM
Homology
BLAST of Cp4.1LG20g02030 vs. NCBI nr
Match: XP_023519160.1 (uncharacterized protein LOC111782609 isoform X1 [Cucurbita pepo subsp. pepo] >XP_023519161.1 uncharacterized protein LOC111782609 isoform X1 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 672 bits (1734), Expect = 6.88e-244
Identity = 327/333 (98.20%), Postives = 327/333 (98.20%), Query Frame = 0

Query: 1   MAQTAPFSSSNLERFLQCVTPCVPWRTLPRSCLHDLNSQWQQPDKDTIRYFTLGELWDYY 60
           MAQTAPFSSSNLERFLQCVTPCVPWRTLPRSCLHDLNSQWQQPDKDTIRYFTLGELWDYY
Sbjct: 1   MAQTAPFSSSNLERFLQCVTPCVPWRTLPRSCLHDLNSQWQQPDKDTIRYFTLGELWDYY 60

Query: 61  DEWSAYGAGIPLQLNAFETATQFYVPYLSAIQIYTSKS------RRYDSDMAECESDSWS 120
           DEWSAYGAGIPLQLNAFETATQFYVPYLSAIQIYTSKS      RRYDSDMAECESDSWS
Sbjct: 61  DEWSAYGAGIPLQLNAFETATQFYVPYLSAIQIYTSKSVPPSRSRRYDSDMAECESDSWS 120

Query: 121 DDSWSDILSRSVSNNSSRTWDAVSEDSIFDQEGSWPLREKLGYLSLQYMEMSSPYWRVPF 180
           DDSWSDILSRSVSNNSSRTWDAVSEDSIFDQEGSWPLREKLGYLSLQYMEMSSPYWRVPF
Sbjct: 121 DDSWSDILSRSVSNNSSRTWDAVSEDSIFDQEGSWPLREKLGYLSLQYMEMSSPYWRVPF 180

Query: 181 MDKIRELTQNYPALTTLRSVDLSPASWMAVSWYPIYHIPSQKNDKDFATCFLTYHTLSSS 240
           MDKIRELTQNYPALTTLRSVDLSPASWMAVSWYPIYHIPSQKNDKDFATCFLTYHTLSSS
Sbjct: 181 MDKIRELTQNYPALTTLRSVDLSPASWMAVSWYPIYHIPSQKNDKDFATCFLTYHTLSSS 240

Query: 241 FQDCSMDHEGKDGCSCSESDTGGQIDGKTSTSTSTSLSPFGLATYRMQGDLWLKPDTSDY 300
           FQDCSMDHEGKDGCSCSESDTGGQIDGKTSTSTSTSLSPFGLATYRMQGDLWLKPDTSDY
Sbjct: 241 FQDCSMDHEGKDGCSCSESDTGGQIDGKTSTSTSTSLSPFGLATYRMQGDLWLKPDTSDY 300

Query: 301 DRIVDLYHAADSWLKQLGVQHHDFNFFSLHSSM 327
           DRIVDLYHAADSWLKQLGVQHHDFNFFSLHSSM
Sbjct: 301 DRIVDLYHAADSWLKQLGVQHHDFNFFSLHSSM 333

BLAST of Cp4.1LG20g02030 vs. NCBI nr
Match: KAG6584275.1 (hypothetical protein SDJN03_20207, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 668 bits (1724), Expect = 2.30e-242
Identity = 324/333 (97.30%), Postives = 327/333 (98.20%), Query Frame = 0

Query: 1   MAQTAPFSSSNLERFLQCVTPCVPWRTLPRSCLHDLNSQWQQPDKDTIRYFTLGELWDYY 60
           M+QTAPFSSSNLERFLQCVTPCVPWRTLPRSCLHDLNSQWQQPDKDTIRYFTLGELWDYY
Sbjct: 1   MSQTAPFSSSNLERFLQCVTPCVPWRTLPRSCLHDLNSQWQQPDKDTIRYFTLGELWDYY 60

Query: 61  DEWSAYGAGIPLQLNAFETATQFYVPYLSAIQIYTSKS------RRYDSDMAECESDSWS 120
           DEWSAYGAGIPLQLNAFETATQFYVPYLSAIQIYTSKS      RRYDSDMAECESDSWS
Sbjct: 61  DEWSAYGAGIPLQLNAFETATQFYVPYLSAIQIYTSKSVPPSRSRRYDSDMAECESDSWS 120

Query: 121 DDSWSDILSRSVSNNSSRTWDAVSEDSIFDQEGSWPLREKLGYLSLQYMEMSSPYWRVPF 180
           DDSWSDILSRS+SNNSSRTWDAVSEDSIFDQEGSWPLREKLGYLSLQYMEMSSPYWRVPF
Sbjct: 121 DDSWSDILSRSLSNNSSRTWDAVSEDSIFDQEGSWPLREKLGYLSLQYMEMSSPYWRVPF 180

Query: 181 MDKIRELTQNYPALTTLRSVDLSPASWMAVSWYPIYHIPSQKNDKDFATCFLTYHTLSSS 240
           MDKIRELTQNYPALTTLRSVDLSPASWMAVSWYPIYHIPSQKNDKDFATCFLTYHTLSSS
Sbjct: 181 MDKIRELTQNYPALTTLRSVDLSPASWMAVSWYPIYHIPSQKNDKDFATCFLTYHTLSSS 240

Query: 241 FQDCSMDHEGKDGCSCSESDTGGQIDGKTSTSTSTSLSPFGLATYRMQGDLWLKPDTSDY 300
           FQDCSMDHEG+DGCSCSESDTGGQIDGKTSTSTSTSLSPFGLATYRMQGDLWLKPDTSDY
Sbjct: 241 FQDCSMDHEGQDGCSCSESDTGGQIDGKTSTSTSTSLSPFGLATYRMQGDLWLKPDTSDY 300

Query: 301 DRIVDLYHAADSWLKQLGVQHHDFNFFSLHSSM 327
           DRIVDLYHAADSWLKQLGVQHHDFNFFSLHSSM
Sbjct: 301 DRIVDLYHAADSWLKQLGVQHHDFNFFSLHSSM 333

BLAST of Cp4.1LG20g02030 vs. NCBI nr
Match: XP_022923969.1 (uncharacterized protein LOC111431528 isoform X1 [Cucurbita moschata] >XP_022923970.1 uncharacterized protein LOC111431528 isoform X1 [Cucurbita moschata])

HSP 1 Score: 662 bits (1709), Expect = 4.46e-240
Identity = 322/333 (96.70%), Postives = 325/333 (97.60%), Query Frame = 0

Query: 1   MAQTAPFSSSNLERFLQCVTPCVPWRTLPRSCLHDLNSQWQQPDKDTIRYFTLGELWDYY 60
           M+QTAPFSSSNLERFL CVTPCVPWRTLPRSCLHDLNSQWQQPDKDTIRYFTLGELWDYY
Sbjct: 1   MSQTAPFSSSNLERFLHCVTPCVPWRTLPRSCLHDLNSQWQQPDKDTIRYFTLGELWDYY 60

Query: 61  DEWSAYGAGIPLQLNAFETATQFYVPYLSAIQIYTSKS------RRYDSDMAECESDSWS 120
           DEWSAYGAGIPLQLNAFETATQFYVPYLSAIQIYTSKS      RRYDSDMAECESDSWS
Sbjct: 61  DEWSAYGAGIPLQLNAFETATQFYVPYLSAIQIYTSKSVPPSRSRRYDSDMAECESDSWS 120

Query: 121 DDSWSDILSRSVSNNSSRTWDAVSEDSIFDQEGSWPLREKLGYLSLQYMEMSSPYWRVPF 180
           DDSWSDILSRS+SNNSSRTWDAVSEDSIFDQEGSW LREKLGYLSLQYMEMSSPYWRVPF
Sbjct: 121 DDSWSDILSRSLSNNSSRTWDAVSEDSIFDQEGSWLLREKLGYLSLQYMEMSSPYWRVPF 180

Query: 181 MDKIRELTQNYPALTTLRSVDLSPASWMAVSWYPIYHIPSQKNDKDFATCFLTYHTLSSS 240
           MDKIRELTQNYPALTTLRSVDLSPASWMAVSWYPIYHIPSQKNDKDFATCFLTYHTLSSS
Sbjct: 181 MDKIRELTQNYPALTTLRSVDLSPASWMAVSWYPIYHIPSQKNDKDFATCFLTYHTLSSS 240

Query: 241 FQDCSMDHEGKDGCSCSESDTGGQIDGKTSTSTSTSLSPFGLATYRMQGDLWLKPDTSDY 300
           FQDCSMDHEG+DGCSCSESDTGGQIDGKTSTSTSTSLSPFGLATYRMQGDLWLKPDTSDY
Sbjct: 241 FQDCSMDHEGQDGCSCSESDTGGQIDGKTSTSTSTSLSPFGLATYRMQGDLWLKPDTSDY 300

Query: 301 DRIVDLYHAADSWLKQLGVQHHDFNFFSLHSSM 327
           DRIVDLYHAADSWLKQLGVQHHDFNFFSLHSSM
Sbjct: 301 DRIVDLYHAADSWLKQLGVQHHDFNFFSLHSSM 333

BLAST of Cp4.1LG20g02030 vs. NCBI nr
Match: XP_023001164.1 (uncharacterized protein LOC111495384 isoform X1 [Cucurbita maxima] >XP_023001165.1 uncharacterized protein LOC111495384 isoform X1 [Cucurbita maxima])

HSP 1 Score: 660 bits (1702), Expect = 5.20e-239
Identity = 321/333 (96.40%), Postives = 325/333 (97.60%), Query Frame = 0

Query: 1   MAQTAPFSSSNLERFLQCVTPCVPWRTLPRSCLHDLNSQWQQPDKDTIRYFTLGELWDYY 60
           M+QTAPFSSSNLERFLQCVTPCVPWRTLPRSCLHDLNSQWQQPDKDTI YFTLGELWDYY
Sbjct: 1   MSQTAPFSSSNLERFLQCVTPCVPWRTLPRSCLHDLNSQWQQPDKDTIGYFTLGELWDYY 60

Query: 61  DEWSAYGAGIPLQLNAFETATQFYVPYLSAIQIYTSKS------RRYDSDMAECESDSWS 120
           DEWSAYGAGIPLQLNAFETATQFYVPYLSAIQIYTSKS      RRYDSDMAECESDSWS
Sbjct: 61  DEWSAYGAGIPLQLNAFETATQFYVPYLSAIQIYTSKSVPPSRSRRYDSDMAECESDSWS 120

Query: 121 DDSWSDILSRSVSNNSSRTWDAVSEDSIFDQEGSWPLREKLGYLSLQYMEMSSPYWRVPF 180
           DDSWSDILSRS+SNNSSRTWDAVSEDSIFDQEGSWPLREKLGYLSLQYMEMSSPYWRVPF
Sbjct: 121 DDSWSDILSRSLSNNSSRTWDAVSEDSIFDQEGSWPLREKLGYLSLQYMEMSSPYWRVPF 180

Query: 181 MDKIRELTQNYPALTTLRSVDLSPASWMAVSWYPIYHIPSQKNDKDFATCFLTYHTLSSS 240
           MDKIRELTQNYPALTTLRSVDLSPASWMAVSWYPIYHIPSQKNDKDFATCFLTYHTLSSS
Sbjct: 181 MDKIRELTQNYPALTTLRSVDLSPASWMAVSWYPIYHIPSQKNDKDFATCFLTYHTLSSS 240

Query: 241 FQDCSMDHEGKDGCSCSESDTGGQIDGKTSTSTSTSLSPFGLATYRMQGDLWLKPDTSDY 300
           FQDCSMDHEG+DGCS SESDTGGQIDGKTSTSTSTSLSPFGLATYR+QGDLWLKPDTSDY
Sbjct: 241 FQDCSMDHEGQDGCSYSESDTGGQIDGKTSTSTSTSLSPFGLATYRIQGDLWLKPDTSDY 300

Query: 301 DRIVDLYHAADSWLKQLGVQHHDFNFFSLHSSM 327
           DRIVDLYHAADSWLKQLGVQHHDFNFFSLHSSM
Sbjct: 301 DRIVDLYHAADSWLKQLGVQHHDFNFFSLHSSM 333

BLAST of Cp4.1LG20g02030 vs. NCBI nr
Match: KAG7019872.1 (hypothetical protein SDJN02_18837, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 651 bits (1679), Expect = 9.52e-236
Identity = 315/327 (96.33%), Postives = 318/327 (97.25%), Query Frame = 0

Query: 1   MAQTAPFSSSNLERFLQCVTPCVPWRTLPRSCLHDLNSQWQQPDKDTIRYFTLGELWDYY 60
           M+QTAPFSSSNLERFLQCVTPCVPWRTLPRSCLHDLNSQWQQPDKDTIRYFTLGELWDYY
Sbjct: 1   MSQTAPFSSSNLERFLQCVTPCVPWRTLPRSCLHDLNSQWQQPDKDTIRYFTLGELWDYY 60

Query: 61  DEWSAYGAGIPLQLNAFETATQFYVPYLSAIQIYTSKSRRYDSDMAECESDSWSDDSWSD 120
           DEWSAYGAGIPLQLNAFETATQFYVPYLS         RRYDSDMAECESDSWSDDSWSD
Sbjct: 61  DEWSAYGAGIPLQLNAFETATQFYVPYLS---------RRYDSDMAECESDSWSDDSWSD 120

Query: 121 ILSRSVSNNSSRTWDAVSEDSIFDQEGSWPLREKLGYLSLQYMEMSSPYWRVPFMDKIRE 180
           ILSRS+SNNSSRTWDAVSEDSIFDQEGSWPLREKLGYLSLQYMEMSSPYWRVPFMDKIRE
Sbjct: 121 ILSRSLSNNSSRTWDAVSEDSIFDQEGSWPLREKLGYLSLQYMEMSSPYWRVPFMDKIRE 180

Query: 181 LTQNYPALTTLRSVDLSPASWMAVSWYPIYHIPSQKNDKDFATCFLTYHTLSSSFQDCSM 240
           LTQNYPALTTLRSVDLSPASWMAVSWYPIYHIPSQKNDKDFATCFLTYHTLSSSFQDCSM
Sbjct: 181 LTQNYPALTTLRSVDLSPASWMAVSWYPIYHIPSQKNDKDFATCFLTYHTLSSSFQDCSM 240

Query: 241 DHEGKDGCSCSESDTGGQIDGKTSTSTSTSLSPFGLATYRMQGDLWLKPDTSDYDRIVDL 300
           DHEG+DGCSCSESDTGGQIDGKTSTSTSTSLSPFGLATYRMQGDLWLKPDTSDYDRIVDL
Sbjct: 241 DHEGQDGCSCSESDTGGQIDGKTSTSTSTSLSPFGLATYRMQGDLWLKPDTSDYDRIVDL 300

Query: 301 YHAADSWLKQLGVQHHDFNFFSLHSSM 327
           YHAADSWLKQLGVQHHDFNFFSLHSSM
Sbjct: 301 YHAADSWLKQLGVQHHDFNFFSLHSSM 318

BLAST of Cp4.1LG20g02030 vs. ExPASy TrEMBL
Match: A0A6J1EDG5 (uncharacterized protein LOC111431528 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111431528 PE=4 SV=1)

HSP 1 Score: 662 bits (1709), Expect = 2.16e-240
Identity = 322/333 (96.70%), Postives = 325/333 (97.60%), Query Frame = 0

Query: 1   MAQTAPFSSSNLERFLQCVTPCVPWRTLPRSCLHDLNSQWQQPDKDTIRYFTLGELWDYY 60
           M+QTAPFSSSNLERFL CVTPCVPWRTLPRSCLHDLNSQWQQPDKDTIRYFTLGELWDYY
Sbjct: 1   MSQTAPFSSSNLERFLHCVTPCVPWRTLPRSCLHDLNSQWQQPDKDTIRYFTLGELWDYY 60

Query: 61  DEWSAYGAGIPLQLNAFETATQFYVPYLSAIQIYTSKS------RRYDSDMAECESDSWS 120
           DEWSAYGAGIPLQLNAFETATQFYVPYLSAIQIYTSKS      RRYDSDMAECESDSWS
Sbjct: 61  DEWSAYGAGIPLQLNAFETATQFYVPYLSAIQIYTSKSVPPSRSRRYDSDMAECESDSWS 120

Query: 121 DDSWSDILSRSVSNNSSRTWDAVSEDSIFDQEGSWPLREKLGYLSLQYMEMSSPYWRVPF 180
           DDSWSDILSRS+SNNSSRTWDAVSEDSIFDQEGSW LREKLGYLSLQYMEMSSPYWRVPF
Sbjct: 121 DDSWSDILSRSLSNNSSRTWDAVSEDSIFDQEGSWLLREKLGYLSLQYMEMSSPYWRVPF 180

Query: 181 MDKIRELTQNYPALTTLRSVDLSPASWMAVSWYPIYHIPSQKNDKDFATCFLTYHTLSSS 240
           MDKIRELTQNYPALTTLRSVDLSPASWMAVSWYPIYHIPSQKNDKDFATCFLTYHTLSSS
Sbjct: 181 MDKIRELTQNYPALTTLRSVDLSPASWMAVSWYPIYHIPSQKNDKDFATCFLTYHTLSSS 240

Query: 241 FQDCSMDHEGKDGCSCSESDTGGQIDGKTSTSTSTSLSPFGLATYRMQGDLWLKPDTSDY 300
           FQDCSMDHEG+DGCSCSESDTGGQIDGKTSTSTSTSLSPFGLATYRMQGDLWLKPDTSDY
Sbjct: 241 FQDCSMDHEGQDGCSCSESDTGGQIDGKTSTSTSTSLSPFGLATYRMQGDLWLKPDTSDY 300

Query: 301 DRIVDLYHAADSWLKQLGVQHHDFNFFSLHSSM 327
           DRIVDLYHAADSWLKQLGVQHHDFNFFSLHSSM
Sbjct: 301 DRIVDLYHAADSWLKQLGVQHHDFNFFSLHSSM 333

BLAST of Cp4.1LG20g02030 vs. ExPASy TrEMBL
Match: A0A6J1KKF0 (uncharacterized protein LOC111495384 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111495384 PE=4 SV=1)

HSP 1 Score: 660 bits (1702), Expect = 2.52e-239
Identity = 321/333 (96.40%), Postives = 325/333 (97.60%), Query Frame = 0

Query: 1   MAQTAPFSSSNLERFLQCVTPCVPWRTLPRSCLHDLNSQWQQPDKDTIRYFTLGELWDYY 60
           M+QTAPFSSSNLERFLQCVTPCVPWRTLPRSCLHDLNSQWQQPDKDTI YFTLGELWDYY
Sbjct: 1   MSQTAPFSSSNLERFLQCVTPCVPWRTLPRSCLHDLNSQWQQPDKDTIGYFTLGELWDYY 60

Query: 61  DEWSAYGAGIPLQLNAFETATQFYVPYLSAIQIYTSKS------RRYDSDMAECESDSWS 120
           DEWSAYGAGIPLQLNAFETATQFYVPYLSAIQIYTSKS      RRYDSDMAECESDSWS
Sbjct: 61  DEWSAYGAGIPLQLNAFETATQFYVPYLSAIQIYTSKSVPPSRSRRYDSDMAECESDSWS 120

Query: 121 DDSWSDILSRSVSNNSSRTWDAVSEDSIFDQEGSWPLREKLGYLSLQYMEMSSPYWRVPF 180
           DDSWSDILSRS+SNNSSRTWDAVSEDSIFDQEGSWPLREKLGYLSLQYMEMSSPYWRVPF
Sbjct: 121 DDSWSDILSRSLSNNSSRTWDAVSEDSIFDQEGSWPLREKLGYLSLQYMEMSSPYWRVPF 180

Query: 181 MDKIRELTQNYPALTTLRSVDLSPASWMAVSWYPIYHIPSQKNDKDFATCFLTYHTLSSS 240
           MDKIRELTQNYPALTTLRSVDLSPASWMAVSWYPIYHIPSQKNDKDFATCFLTYHTLSSS
Sbjct: 181 MDKIRELTQNYPALTTLRSVDLSPASWMAVSWYPIYHIPSQKNDKDFATCFLTYHTLSSS 240

Query: 241 FQDCSMDHEGKDGCSCSESDTGGQIDGKTSTSTSTSLSPFGLATYRMQGDLWLKPDTSDY 300
           FQDCSMDHEG+DGCS SESDTGGQIDGKTSTSTSTSLSPFGLATYR+QGDLWLKPDTSDY
Sbjct: 241 FQDCSMDHEGQDGCSYSESDTGGQIDGKTSTSTSTSLSPFGLATYRIQGDLWLKPDTSDY 300

Query: 301 DRIVDLYHAADSWLKQLGVQHHDFNFFSLHSSM 327
           DRIVDLYHAADSWLKQLGVQHHDFNFFSLHSSM
Sbjct: 301 DRIVDLYHAADSWLKQLGVQHHDFNFFSLHSSM 333

BLAST of Cp4.1LG20g02030 vs. ExPASy TrEMBL
Match: A0A5D3BL53 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold248G002890 PE=4 SV=1)

HSP 1 Score: 604 bits (1558), Expect = 2.80e-217
Identity = 296/339 (87.32%), Postives = 309/339 (91.15%), Query Frame = 0

Query: 1   MAQTAPFS------SSNLERFLQCVTPCVPWRTLPRSCLHDLNSQWQQPDKDTIRYFTLG 60
           MAQT PFS      +SNLERFLQCVTP VPWRTLPRSCLHDLNSQWQQPDKDT++YFTLG
Sbjct: 1   MAQTTPFSFTDHKTTSNLERFLQCVTPSVPWRTLPRSCLHDLNSQWQQPDKDTVQYFTLG 60

Query: 61  ELWDYYDEWSAYGAGIPLQLNAFETATQFYVPYLSAIQIYTSK------SRRYDSDMAEC 120
           ELWDYYDEWSAYGAGIP+QLN  ETATQFYVPYLSAIQIYTSK      +RRYD DMAEC
Sbjct: 61  ELWDYYDEWSAYGAGIPIQLNDSETATQFYVPYLSAIQIYTSKPVAPSRNRRYDIDMAEC 120

Query: 121 ESDSWSDDSWSDILSRSVSNNSSRTWDAVSEDSIFDQEGSWPLREKLGYLSLQYMEMSSP 180
           ESDSWSDDSWSD +SRS+SNNSSRTWDAVSEDS FDQEGSWPLREKLGYLSLQYMEMSSP
Sbjct: 121 ESDSWSDDSWSDNMSRSLSNNSSRTWDAVSEDSSFDQEGSWPLREKLGYLSLQYMEMSSP 180

Query: 181 YWRVPFMDKIRELTQNYPALTTLRSVDLSPASWMAVSWYPIYHIPSQKNDKDFATCFLTY 240
           YWRVPFMDKI ELTQ YPAL TLRSVDLSPASWMAVSWYPIYHIPSQKNDKDFATCFLTY
Sbjct: 181 YWRVPFMDKITELTQTYPALNTLRSVDLSPASWMAVSWYPIYHIPSQKNDKDFATCFLTY 240

Query: 241 HTLSSSFQDCSMDHEGKDGCSCSESDTGGQIDGKTSTSTSTSLSPFGLATYRMQGDLWLK 300
           HTLSSSFQDCSM+HEGKDGCS S+S+TGGQ   KTS STST LSPFGLATYRMQGDLWL 
Sbjct: 241 HTLSSSFQDCSMNHEGKDGCSGSKSETGGQTYAKTSASTSTCLSPFGLATYRMQGDLWLM 300

Query: 301 PDTSDYDRIVDLYHAADSWLKQLGVQHHDFNFFSLHSSM 327
           P+TSDY+RIVDLYHAADSWLKQLGV HHDFNFFSLHSS+
Sbjct: 301 PETSDYERIVDLYHAADSWLKQLGVHHHDFNFFSLHSSL 339

BLAST of Cp4.1LG20g02030 vs. ExPASy TrEMBL
Match: A0A1S3ATR9 (uncharacterized protein LOC103482746 OS=Cucumis melo OX=3656 GN=LOC103482746 PE=4 SV=1)

HSP 1 Score: 604 bits (1558), Expect = 2.80e-217
Identity = 296/339 (87.32%), Postives = 309/339 (91.15%), Query Frame = 0

Query: 1   MAQTAPFS------SSNLERFLQCVTPCVPWRTLPRSCLHDLNSQWQQPDKDTIRYFTLG 60
           MAQT PFS      +SNLERFLQCVTP VPWRTLPRSCLHDLNSQWQQPDKDT++YFTLG
Sbjct: 1   MAQTTPFSFTDHKTTSNLERFLQCVTPSVPWRTLPRSCLHDLNSQWQQPDKDTVQYFTLG 60

Query: 61  ELWDYYDEWSAYGAGIPLQLNAFETATQFYVPYLSAIQIYTSK------SRRYDSDMAEC 120
           ELWDYYDEWSAYGAGIP+QLN  ETATQFYVPYLSAIQIYTSK      +RRYD DMAEC
Sbjct: 61  ELWDYYDEWSAYGAGIPIQLNDSETATQFYVPYLSAIQIYTSKPVAPSRNRRYDIDMAEC 120

Query: 121 ESDSWSDDSWSDILSRSVSNNSSRTWDAVSEDSIFDQEGSWPLREKLGYLSLQYMEMSSP 180
           ESDSWSDDSWSD +SRS+SNNSSRTWDAVSEDS FDQEGSWPLREKLGYLSLQYMEMSSP
Sbjct: 121 ESDSWSDDSWSDNMSRSLSNNSSRTWDAVSEDSSFDQEGSWPLREKLGYLSLQYMEMSSP 180

Query: 181 YWRVPFMDKIRELTQNYPALTTLRSVDLSPASWMAVSWYPIYHIPSQKNDKDFATCFLTY 240
           YWRVPFMDKI ELTQ YPAL TLRSVDLSPASWMAVSWYPIYHIPSQKNDKDFATCFLTY
Sbjct: 181 YWRVPFMDKITELTQTYPALNTLRSVDLSPASWMAVSWYPIYHIPSQKNDKDFATCFLTY 240

Query: 241 HTLSSSFQDCSMDHEGKDGCSCSESDTGGQIDGKTSTSTSTSLSPFGLATYRMQGDLWLK 300
           HTLSSSFQDCSM+HEGKDGCS S+S+TGGQ   KTS STST LSPFGLATYRMQGDLWL 
Sbjct: 241 HTLSSSFQDCSMNHEGKDGCSGSKSETGGQTYAKTSASTSTCLSPFGLATYRMQGDLWLM 300

Query: 301 PDTSDYDRIVDLYHAADSWLKQLGVQHHDFNFFSLHSSM 327
           P+TSDY+RIVDLYHAADSWLKQLGV HHDFNFFSLHSS+
Sbjct: 301 PETSDYERIVDLYHAADSWLKQLGVHHHDFNFFSLHSSL 339

BLAST of Cp4.1LG20g02030 vs. ExPASy TrEMBL
Match: A0A0A0LW14 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G044320 PE=4 SV=1)

HSP 1 Score: 603 bits (1556), Expect = 5.64e-217
Identity = 294/339 (86.73%), Postives = 309/339 (91.15%), Query Frame = 0

Query: 1   MAQTAPFS------SSNLERFLQCVTPCVPWRTLPRSCLHDLNSQWQQPDKDTIRYFTLG 60
           MAQT PFS      +SNLERFL CVTP VPWRTLPRSCLHDLNSQWQQPDKDT++YFTLG
Sbjct: 1   MAQTTPFSFTNHKTTSNLERFLHCVTPSVPWRTLPRSCLHDLNSQWQQPDKDTVQYFTLG 60

Query: 61  ELWDYYDEWSAYGAGIPLQLNAFETATQFYVPYLSAIQIYTSK------SRRYDSDMAEC 120
           ELWDYYDEWSAYGAGIP+QLN  ETATQFYVPYLSAIQIYTSK      +RRYDSDMAEC
Sbjct: 61  ELWDYYDEWSAYGAGIPIQLNDLETATQFYVPYLSAIQIYTSKPVAPSRNRRYDSDMAEC 120

Query: 121 ESDSWSDDSWSDILSRSVSNNSSRTWDAVSEDSIFDQEGSWPLREKLGYLSLQYMEMSSP 180
           ESDSWSDDSWSD +SRS+SNNSSRTWDAVSEDS FDQEGSWPLREKLGYLSLQYME SSP
Sbjct: 121 ESDSWSDDSWSDNMSRSLSNNSSRTWDAVSEDSSFDQEGSWPLREKLGYLSLQYMETSSP 180

Query: 181 YWRVPFMDKIRELTQNYPALTTLRSVDLSPASWMAVSWYPIYHIPSQKNDKDFATCFLTY 240
           YWRVPFMDKI ELTQ YPALTTLRSVDLSPASWMA+SWYPIYHIPSQKNDKDFATCFLTY
Sbjct: 181 YWRVPFMDKITELTQTYPALTTLRSVDLSPASWMAISWYPIYHIPSQKNDKDFATCFLTY 240

Query: 241 HTLSSSFQDCSMDHEGKDGCSCSESDTGGQIDGKTSTSTSTSLSPFGLATYRMQGDLWLK 300
           HTLSSSFQDCSM+HEG+DGCS SE++TGGQ   KTS STST LSPFGLATYRMQGDLWL 
Sbjct: 241 HTLSSSFQDCSMNHEGQDGCSGSETETGGQTYAKTSASTSTCLSPFGLATYRMQGDLWLM 300

Query: 301 PDTSDYDRIVDLYHAADSWLKQLGVQHHDFNFFSLHSSM 327
           P+TSDY+RIVDLYHAADSWLKQLGV HHDFNFFSLHSS+
Sbjct: 301 PETSDYERIVDLYHAADSWLKQLGVHHHDFNFFSLHSSL 339

BLAST of Cp4.1LG20g02030 vs. TAIR 10
Match: AT1G17830.1 (Protein of unknown function (DUF789) )

HSP 1 Score: 322.0 bits (824), Expect = 5.5e-88
Identity = 172/323 (53.25%), Postives = 225/323 (69.66%), Query Frame = 0

Query: 10  SNLERFLQCVTPCVPWRTLPRSCLHDLNSQWQQPDKDTIRYFTLGELWDYYDEWSAYGAG 69
           SNLERFL+ +TP  P  +L +SC +DLNS W   +KD I YF L +LWD +DE SAYG G
Sbjct: 18  SNLERFLRGITPKPPSFSLSQSCKNDLNSLWIHENKDEIEYFRLSDLWDCFDEPSAYGLG 77

Query: 70  IPLQLNAFETATQFYVPYLSAIQIYTSKS---RRYDSDMAECESDSWSDDSWSDILSRSV 129
             + LN  E+  Q+YVPYLSAIQIYT+KS    R  SD+ +CES+ WSDDS  + LSRS+
Sbjct: 78  SKVDLNNGESVMQYYVPYLSAIQIYTNKSTAISRIHSDVVDCESECWSDDSEIEKLSRSM 137

Query: 130 SNNSSRTWDAVSEDSIFDQEGSWPL-REKLGYLSLQYMEMSSPYWRVPFMDKIRELTQNY 189
           S+ SS+ WD+VS+DS ++ +G+  L R+KLG +  QY E   P+ RVP   K+ EL + Y
Sbjct: 138 SSGSSKIWDSVSDDSGYEIDGTSSLMRDKLGSIDFQYFESVKPHLRVPLTAKVNELAEKY 197

Query: 190 PALTTLRSVDLSPASWMAVSWYPIYHIPSQKNDKDFATCFLTYHTLSSSFQDCSMDHEGK 249
           P L+TLRSVDLSPASW+A++WYPIYHIPS+K DKD +TCFL+YHTLSS+FQ   +  EG 
Sbjct: 198 PGLSTLRSVDLSPASWLAIAWYPIYHIPSRKTDKDLSTCFLSYHTLSSAFQGNLI--EGD 257

Query: 250 DGCSCSESDTGGQIDGKTSTSTSTSLSPFGLATYRMQGDLWLKPDTSDYDRIVDLYHAAD 309
           D  + +  +     D +   + S  L+PFGL +Y++QGDLW   +  D  RIV L  AAD
Sbjct: 258 DEINETMKEETLCFD-EGPVTKSIPLAPFGLVSYKLQGDLWRNQECGDQGRIVYLRSAAD 317

Query: 310 SWLKQLGVQ-HHDFNFFSLHSSM 328
           SWLKQL VQ HHD +FFS++ S+
Sbjct: 318 SWLKQLNVQDHHDHSFFSMNMSL 337

BLAST of Cp4.1LG20g02030 vs. TAIR 10
Match: AT4G03420.1 (Protein of unknown function (DUF789) )

HSP 1 Score: 306.6 bits (784), Expect = 2.4e-83
Identity = 165/322 (51.24%), Postives = 211/322 (65.53%), Query Frame = 0

Query: 10  SNLERFLQCVTPCVPWRTLPRSCLHDLNSQWQQPDKDTIRYFTLGELWDYYDEWSAYGAG 69
           SNL+RFL C TP VP ++L ++ +  LN  W   ++  + +F L +LWD YDEWSAYGAG
Sbjct: 7   SNLDRFLHCTTPVVPPQSLSKAEIRSLNRIWHPWERQKVEFFRLSDLWDCYDEWSAYGAG 66

Query: 70  IPLQLNAFETATQFYVPYLSAIQIYTSKSR----RYDSDMAECE---SDSWSDDSWSDIL 129
           +P++L+  E+  Q+YVPYLSAIQI+TS+S     R DS+  E     SDS+SD+S SD L
Sbjct: 67  VPIRLSNGESLVQYYVPYLSAIQIFTSRSSLIRLRDDSEDGESRDSFSDSYSDESESDKL 126

Query: 130 SRSVSNNSSRTWDAVSEDSIFDQEGSWPLREKLGYLSLQYMEMSSPYWRVPFMDKIRELT 189
           SR  S+      + +  D++          ++LGYL LQY E S+PY RVP MDKI EL 
Sbjct: 127 SRCASD------EGLEHDALLHP------NDRLGYLYLQYFERSAPYARVPLMDKINELA 186

Query: 190 QNYPALTTLRSVDLSPASWMAVSWYPIYHIPSQKNDKDFATCFLTYHTLSSSFQDCSMDH 249
           Q YP L +LRSVDLSPASWMAV+WYPIYHIP  +  KD +TCFLTYHTLSSSFQD  M+ 
Sbjct: 187 QRYPGLMSLRSVDLSPASWMAVAWYPIYHIPMGRTIKDLSTCFLTYHTLSSSFQD--MEP 246

Query: 250 EGKDGCSCSESDTGGQIDGKTSTSTSTSLSPFGLATYRMQGDLWLKPDT--SDYDRIVDL 309
           E          + GG+ +         +L PFGLATY+MQG++WL  D    D +R++ L
Sbjct: 247 E----------ENGGEKERIRKEGEGVTLLPFGLATYKMQGNVWLSEDDQGQDQERVLSL 304

Query: 310 YHAADSWLKQLGVQHHDFNFFS 323
              ADSWLKQL VQHHDFN+FS
Sbjct: 307 LSVADSWLKQLRVQHHDFNYFS 304

BLAST of Cp4.1LG20g02030 vs. TAIR 10
Match: AT1G03610.1 (Protein of unknown function (DUF789) )

HSP 1 Score: 303.9 bits (777), Expect = 1.5e-82
Identity = 159/316 (50.32%), Postives = 202/316 (63.92%), Query Frame = 0

Query: 10  SNLERFLQCVTPCVPWRTLPRSCLHDLNSQWQQPDKDTIRYFTLGELWDYYDEWSAYGAG 69
           SNL+RFL C+TP VP ++LP++ +  LN  W   ++  + +F L +LWD YDEWSAYGA 
Sbjct: 11  SNLDRFLHCITPLVPPQSLPKTEIRTLNRLWHPWERQKVEFFRLSDLWDCYDEWSAYGAS 70

Query: 70  IPLQLNAFETATQFYVPYLSAIQIYTSKSR----RYDSDMAECESDSWSDDSWSDILSRS 129
           +P+ +   E+  Q+YVPYLSAIQI+TS S     R +S+  ECE      DS SD     
Sbjct: 71  VPIHVTNGESLVQYYVPYLSAIQIFTSHSSLIRLREESEDGECEGRDPFSDSGSD----- 130

Query: 130 VSNNSSRTWDAVSEDSIFDQEGSWPLREKLGYLSLQYMEMSSPYWRVPFMDKIRELTQNY 189
                    ++VSE+ + +     P  ++LGYL LQY E S+PY RVP MDKI EL Q Y
Sbjct: 131 ---------ESVSEEGLENNTLLHP-SDRLGYLYLQYFERSAPYTRVPLMDKINELAQRY 190

Query: 190 PALTTLRSVDLSPASWMAVSWYPIYHIPSQKNDKDFATCFLTYHTLSSSFQDCSMDHEGK 249
           P L +LRSVDLSPASWM+V+WYPIYHIP  +  KD +TCFLTYHTLSSSFQD  M+ E  
Sbjct: 191 PGLMSLRSVDLSPASWMSVAWYPIYHIPMGRTIKDLSTCFLTYHTLSSSFQD--MEPE-- 250

Query: 250 DGCSCSESDTGGQIDGKTSTSTSTSLSPFGLATYRMQGDLWLKPDTSDYDRIVDLYHAAD 309
                   + GG  +         +L PFG+ATY+MQGD+WL  D  D +R+  LY  AD
Sbjct: 251 --------ENGGDKERVRREGEDITLLPFGMATYKMQGDVWLSQDHDDQERLASLYSVAD 299

Query: 310 SWLKQLGVQHHDFNFF 322
           SWLKQL VQHHDFN+F
Sbjct: 311 SWLKQLRVQHHDFNYF 299

BLAST of Cp4.1LG20g02030 vs. TAIR 10
Match: AT1G73210.1 (Protein of unknown function (DUF789) )

HSP 1 Score: 303.1 bits (775), Expect = 2.6e-82
Identity = 169/321 (52.65%), Postives = 214/321 (66.67%), Query Frame = 0

Query: 10  SNLERFLQCVTPCVPWRTLPRSCLHDLNSQWQQPDKDTIRYFTLGELWDYYDEWSAYGAG 69
           SNLERFL  +TP  P  +LP           Q+  K+ I YF LG+LWD YDE SAYG G
Sbjct: 12  SNLERFLLGITPKPPSFSLP-----------QEQGKEEIEYFRLGDLWDCYDEMSAYGFG 71

Query: 70  IPLQLNAFETATQFYVPYLSAIQIYTSKS---RRYDSDMAECE-SDSWSDDSWSDILSRS 129
             + LN  ET  Q+YVPYLSAIQI+T+K     R  +++AE E S+ WSD     +LSRS
Sbjct: 72  TQVDLNNGETVMQYYVPYLSAIQIHTNKPALLSRNQNEVAESESSEGWSDSESEKLLSRS 131

Query: 130 VSNNSSRTWDAVSEDSIFDQEGSWPLREKLGYLSLQYMEMSSPYWRVPFMDKIRELTQNY 189
           +SN+SS+TWDAVSEDS+FD +GS  L+++LG L  +Y+E   P+ R+P  DKI  L + Y
Sbjct: 132 MSNDSSKTWDAVSEDSVFDPDGSPLLKDRLGNLDFKYIERDPPHKRIPLTDKINVLVEKY 191

Query: 190 PALTTLRSVDLSPASWMAVSWYPIYHIPSQKNDKDFATCFLTYHTLSSSFQDCSMDHEGK 249
           P L TLRSVD+SPASWMAV+WYPIYHIP+ +N+KD  T FLTYHTLSSSFQD  ++ +  
Sbjct: 192 PGLMTLRSVDMSPASWMAVAWYPIYHIPTCRNEKDLTTGFLTYHTLSSSFQDNVVEGDQS 251

Query: 250 DGCSCSESDTGGQIDGKTSTSTSTSLSPFGLATYRMQGDLWLKPDTSDYDRIVDLYHAAD 309
           +    +E      I+ +        L PFG+ TY+MQGDLW K    D DR++ L  AAD
Sbjct: 252 NNNEETEFCEDSVINKR------MPLPPFGVTTYKMQGDLWGKTG-FDQDRLLYLQSAAD 311

Query: 310 SWLKQLGVQHHDFNFFSLHSS 327
           SWLKQL V HHD+NFF L+SS
Sbjct: 312 SWLKQLNVDHHDYNFF-LNSS 313

BLAST of Cp4.1LG20g02030 vs. TAIR 10
Match: AT1G73210.2 (Protein of unknown function (DUF789) )

HSP 1 Score: 300.8 bits (769), Expect = 1.3e-81
Identity = 168/321 (52.34%), Postives = 213/321 (66.36%), Query Frame = 0

Query: 10  SNLERFLQCVTPCVPWRTLPRSCLHDLNSQWQQPDKDTIRYFTLGELWDYYDEWSAYGAG 69
           SNLERFL  +TP  P  +LP+              K+ I YF LG+LWD YDE SAYG G
Sbjct: 12  SNLERFLLGITPKPPSFSLPQG-------------KEEIEYFRLGDLWDCYDEMSAYGFG 71

Query: 70  IPLQLNAFETATQFYVPYLSAIQIYTSKS---RRYDSDMAECE-SDSWSDDSWSDILSRS 129
             + LN  ET  Q+YVPYLSAIQI+T+K     R  +++AE E S+ WSD     +LSRS
Sbjct: 72  TQVDLNNGETVMQYYVPYLSAIQIHTNKPALLSRNQNEVAESESSEGWSDSESEKLLSRS 131

Query: 130 VSNNSSRTWDAVSEDSIFDQEGSWPLREKLGYLSLQYMEMSSPYWRVPFMDKIRELTQNY 189
           +SN+SS+TWDAVSEDS+FD +GS  L+++LG L  +Y+E   P+ R+P  DKI  L + Y
Sbjct: 132 MSNDSSKTWDAVSEDSVFDPDGSPLLKDRLGNLDFKYIERDPPHKRIPLTDKINVLVEKY 191

Query: 190 PALTTLRSVDLSPASWMAVSWYPIYHIPSQKNDKDFATCFLTYHTLSSSFQDCSMDHEGK 249
           P L TLRSVD+SPASWMAV+WYPIYHIP+ +N+KD  T FLTYHTLSSSFQD  ++ +  
Sbjct: 192 PGLMTLRSVDMSPASWMAVAWYPIYHIPTCRNEKDLTTGFLTYHTLSSSFQDNVVEGDQS 251

Query: 250 DGCSCSESDTGGQIDGKTSTSTSTSLSPFGLATYRMQGDLWLKPDTSDYDRIVDLYHAAD 309
           +    +E      I+ +        L PFG+ TY+MQGDLW K    D DR++ L  AAD
Sbjct: 252 NNNEETEFCEDSVINKR------MPLPPFGVTTYKMQGDLWGKTG-FDQDRLLYLQSAAD 311

Query: 310 SWLKQLGVQHHDFNFFSLHSS 327
           SWLKQL V HHD+NFF L+SS
Sbjct: 312 SWLKQLNVDHHDYNFF-LNSS 311

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
XP_023519160.16.88e-24498.20uncharacterized protein LOC111782609 isoform X1 [Cucurbita pepo subsp. pepo] >XP... [more]
KAG6584275.12.30e-24297.30hypothetical protein SDJN03_20207, partial [Cucurbita argyrosperma subsp. sorori... [more]
XP_022923969.14.46e-24096.70uncharacterized protein LOC111431528 isoform X1 [Cucurbita moschata] >XP_0229239... [more]
XP_023001164.15.20e-23996.40uncharacterized protein LOC111495384 isoform X1 [Cucurbita maxima] >XP_023001165... [more]
KAG7019872.19.52e-23696.33hypothetical protein SDJN02_18837, partial [Cucurbita argyrosperma subsp. argyro... [more]
Match NameE-valueIdentityDescription
A0A6J1EDG52.16e-24096.70uncharacterized protein LOC111431528 isoform X1 OS=Cucurbita moschata OX=3662 GN... [more]
A0A6J1KKF02.52e-23996.40uncharacterized protein LOC111495384 isoform X1 OS=Cucurbita maxima OX=3661 GN=L... [more]
A0A5D3BL532.80e-21787.32Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... [more]
A0A1S3ATR92.80e-21787.32uncharacterized protein LOC103482746 OS=Cucumis melo OX=3656 GN=LOC103482746 PE=... [more]
A0A0A0LW145.64e-21786.73Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G044320 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G17830.15.5e-8853.25Protein of unknown function (DUF789) [more]
AT4G03420.12.4e-8351.24Protein of unknown function (DUF789) [more]
AT1G03610.11.5e-8250.32Protein of unknown function (DUF789) [more]
AT1G73210.12.6e-8252.65Protein of unknown function (DUF789) [more]
AT1G73210.21.3e-8152.34Protein of unknown function (DUF789) [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita pepo (Zucchini) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR008507Protein of unknown function DUF789PFAMPF05623DUF789coord: 10..321
e-value: 3.7E-98
score: 328.8
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 246..270
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 251..270
NoneNo IPR availablePANTHERPTHR31343:SF29PLANT/F9H3-4 PROTEINcoord: 8..326
NoneNo IPR availablePANTHERPTHR31343T15D22.8coord: 8..326

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG20g02030.1Cp4.1LG20g02030.1mRNA