Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonfive_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CGAAGAAATATGGTCAAGATTTTAAAGCTATCTATAGAAATTAAGGCATAATTATGAAATGAAGGACTTGTTAGAATTTTTGTTGGAGTAGAGAGATCAACTGATCAAGCGCTACGGGGTTTGTGTAGCGGGTCACAGTTACGGGGCGTGCGGATCTACCATGACCCGAACTGGATCGTTCTCATCTAGCATCTCATTTGCTCGAGACGTTGCGGGCTTCGATTTCAATCTGGAGAGGATAACTTGACCATTACAGAGAAAAAAATCGTATAAGGTACAAAGATATTTTGTTCTTGAAATTTTTTGCGTTTGATCGAAACTCTTGCTATACCTGCAATACTACTGTTACGTGTCGTTCTACATGAGATGTTATCGGTCTAGAGCGATGATTTGGCTCGTAGTTGCTCGTCTGCCGCCTCTTCGTCAATGATTTCACGTTTTTCCATAGCTATTTTTATCTGTTAAGGTATTAAAACTGGCGTTAGTGCAAGAGTTGGTGTGTATAATCTTCAGAGAGTCAGCTAGAGGTATCTGGATATGGCGTCACGGCCACTTCAGTTTCAAGAACACTTAAAATTTTGGTCTGTTTTTGATTTGGTTTAAATTATCTGAGTTTGAGAATGCTGTTCCCGAATACTTGGGTTGCGTGGTAGGTTGCAAAAGGAATAATGTTGTATACGAGCTGCCACCATTAGAGAGTGCTTGAAGGCACAGTAGGACCTAAATATGTTTGTTGTAGAGAGGGGCTGGGGTGTTGGTGTTGGTCTTGGGATTGGGAATGGTTTGTTTTCAAATTTTGATTGATATGATCTTGAAAATGTCACTTGAACTTGGGAATGGTTGGATTTCAAATTTTGGTTAAAATTTGGTGGTTTTTACAGGTGGGCGTAACAATGAGACAGCAGGGTCAATATTCTGATTCAGGACTTGGCTCATATTCAGTTTCTCAAATGCATCAAGTTCCCAGTCAAAGGATGGAACAAAGCCACCCTGATCCTTTTGAAGGACGGCTGGAAGCCTTCACTCCAGAGAGAGAGAATTCATATCTAGCTTCAAAAAATGAGGATCAATGGAGATGGGAAAGAGATGAGTCGAAGATGCCAAATTCCATGGCCTCTCACATGTTCAACGAAGGCAAGCATTAATAGTGTGTGTATTGCATCCATCACTTATTGTTGCTTGCACCTGTTTTCTTATGAAGTGTTTTATTCTCTTGTACATTACAATATCCCTTTCCTAGACTATGTAAAATAAACCATATTAGGTACTTGTTTATTGATATCAACCTCTTGGTCGATGGCGCAGTTGAGCTATGTTTAGGTTGACTCATTTGATTCTTTTGTTGGTTCTAGTCTACTTGGTTATATGATCTGAAATTCTCACCTGTGCTCCCTAAGCTACATTGAGTATTTTCCTTGGGAAACTGAAAAGAGAAATTAACAACATAACTATAATTGCCTTATGGCATTTACACTCTTACAAAGTTTAATTTCACATGATATAGTTCTTGGATATAAGATGATTAGTCAGACCGTTAACGAAAAAGAAATCGTACTATTAATTCTTTTAGATTCATCATTCCCCATTCAAAAGCTATCAAGGTCAGATGATAGTTGAATTAGCTAGGTATCCACACATGTGGTTATCAAGAATAAAGCGATTAATTGATGCCAAAACAGTATCCATTTCATAATGGTTGGGTGATGTGATATTCCTCATAGTATCAAGCTGTTAATTGTTTATTGACAGAGAGAATTTCTATTCATTGTTATTCTATTTCTTTTTTACCCTTTCCCTTGTTTATTTTCCCTTTAAGATGTTTATATACTTCTCCCATGCAAAGAATTTCTGCCTTCTCTTGTTAAACGAAGTTTTCAGAATTTCTGGACTGAGAAATATGCTTGTTTAAACATAGGTCAAGGGGGTGATCTCAGAAGATCATACTTTCAAGGTCAGAGACCAAATCCAAAATTGGTTCTTGAGAAAGGAAGCAACAGCGATCTCAGATTTCAATCCCATGGAAAAAACATGGAAAGTAGGTTTGGAGATGGCCTGCTGCCACAGAATTTTGACGGTCTCGAGCAGAAATTCATTGATGACATCATTAACTTTTCCAAGGAACAAAATGATGCAGAGGATGAGGAAAATGCTCGACATAGAGAGGTATATTTGCACATTCCTATTTCAGTACTTGCATTCTCGTTTTTCACATCATCTTTACTGAACATACTTCATTTTGATGCATATCTTCAGAGAATTATCGCTATCAATTCTCAGTACGAGGAGCAATTAGCAGCACTTCGAGCTCAGCATGCTGGTCGTCGTGATGAGCTGCTACGAAGGGAATCAAGTGCACGGCACCATCAGTATCAGAAGGGAATAAGGGACCATTACCCCAATGGGGGCATTGGCCCAGGTAATCCTCGTGGCAGTAGTGGAGTTACAACCTTAGCTGCTTCTGGACGAGCACATCAAAATTATGAATCTGAACACTTCAATTCCTACAGAGAATGAACTCGGTTTCTTGGGAATTCTATGCGGAATTCTAATTTGGATTCGAGTTTATGACACTGGCTCACGGTACTACTAATCAAACTTGCTTAATTTTCAGTATAAAATTACGCATTTTTACATTGTCCGATTAGGCTTCCTTAACAAACCTTTAAGCTGTTTTTTCATCCTAAAGTTATGCACTTTTCTGTTGCACACCCTTCTCTATAGAGCCCATGATCTATGGTATCTCTGGATCCCAGTTTTTGCCGACTTGAACATATCAATGTTTAAGTTGTAGGATCCTCGAATTTTTAGGAGCTTTGGAGTTGGAGATAGTGGCTTACCATGATCTGATTGATGAGCTTGTATCGGAATTACGTTGCCACTTTATATAAAAGCTTCTTCTTTAATGCTTTACATAAAAGTTTTCTTTATGTGAATTTGTATGGTTTCCTTTCCTCACTGGTTCACACTCTCTTGATTTGCGGACTCCAACTGTTAAGAATATATATATATAATCTTGCGGTAGAACTGATTAAACTTATTATTTAATATAGAGTTGTCTTCTAATCCTAGTTCAAGACTATATTCAATGTACTAGTTTATAAATGAGGTATTTTTCTGTGTGATGATGTAACAGCTTAAGCCTACCGTTAGCAATATTGTTCTTTTTGGGCTTCCCCTCAAGATTTTTAAAGCACGTTTGTTAGGGAGAGGTTTCCACACCATTATAAAGATTGATTTGTTTCCCTCTCCAACGATGTAAGATCTCATAGATGATCTTCAATCTTCATTTAAACAAAAAATCATGTTATTCATAAGCGATTAGAACACTCATACTTTTGCCCTTTCTACCAGCTTAGCAACTTGAGTATTGTACAATCCAGTAGTTAATACATTTATACTCCTACTGTGACCAGAAGTATAAAGATGGTGTAAAAGAATATGCATATATATACATGAGTTTCTTTTCAGGAAAGCATTTATTCAATGTATAATGCTTTGTTTTCTTATAAATATTCTTTGGTGGTTACTTTAGGTTAGGTTCCAATGGCAACATCACGTGGCATGTGAGCACATATCCATTGCCTCCAGATCTTGACCAGCAATTGCAATACGGTGGTCTCACTTTTCTGTCTCTCTTCTTTTGTGTTATTATTATCAATGATATCTGTTAGGGCCTTGCTCTGAAACTGCCTGCTCAATATTCACAGCATACACACTTTTACTTGCTGGGAAAAAGAAATCATGTTGTTCTTCGCTTTAATATTTCACTTTGTTAGTGATTCTGTTAGAATTGGATGCCATTCCCTTTCTCATTCGCCTGGGTTTGGTTTGGTTTCCTTGAAGCCAAATAGAATCCTACCTTCATTCCCAAATAAAGTTACTGGTGTTTAAG
mRNA sequence
CGAAGAAATATGGTCAAGATTTTAAAGCTATCTATAGAAATTAAGGCATAATTATGAAATGAAGGACTTGTTAGAATTTTTGTTGGAGTAGAGAGATCAACTGATCAAGCGCTACGGGGTTTGTGTAGCGGGTCACAGTTACGGGGCGTGCGGATCTACCATGACCCGAACTGGATCGTTCTCATCTAGCATCTCATTTGCTCGAGACGTTGCGGGCTTCGATTTCAATCTGGAGAGGATAACTTGACCATTACAGAGAAAAAAATCGTATAAGGTACAAAGATATTTTGTTCTTGAAATTTTTTGCGTTTGATCGAAACTCTTGCTATACCTGCAATACTACTGTTACGTGTCGTTCTACATGAGATGTTATCGGTCTAGAGCGATGATTTGGCTCGTAGTTGCTCGTCTGCCGCCTCTTCGTCAATGATTTCACGTTTTTCCATAGCTATTTTTATCTGTTAAGGTGGGCGTAACAATGAGACAGCAGGGTCAATATTCTGATTCAGGACTTGGCTCATATTCAGTTTCTCAAATGCATCAAGTTCCCAGTCAAAGGATGGAACAAAGCCACCCTGATCCTTTTGAAGGACGGCTGGAAGCCTTCACTCCAGAGAGAGAGAATTCATATCTAGCTTCAAAAAATGAGGATCAATGGAGATGGGAAAGAGATGAGTCGAAGATGCCAAATTCCATGGCCTCTCACATGTTCAACGAAGGTCAAGGGGGTGATCTCAGAAGATCATACTTTCAAGGTCAGAGACCAAATCCAAAATTGGTTCTTGAGAAAGGAAGCAACAGCGATCTCAGATTTCAATCCCATGGAAAAAACATGGAAAGTAGGTTTGGAGATGGCCTGCTGCCACAGAATTTTGACGGTCTCGAGCAGAAATTCATTGATGACATCATTAACTTTTCCAAGGAACAAAATGATGCAGAGGATGAGGAAAATGCTCGACATAGAGAGAGAATTATCGCTATCAATTCTCAGTACGAGGAGCAATTAGCAGCACTTCGAGCTCAGCATGCTGGTCGTCGTGATGAGCTGCTACGAAGGGAATCAAGTGCACGGCACCATCAGTATCAGAAGGGAATAAGGGACCATTACCCCAATGGGGGCATTGGCCCAGAGAATGAACTCGGTTTCTTGGGAATTCTATGCGGAATTCTAATTTGGATTCGAGTTTATGACACTGGCTCACGGTTAGGTTCCAATGGCAACATCACGTGGCATGTGAGCACATATCCATTGCCTCCAGATCTTGACCAGCAATTGCAATACGGTGGTCTCACTTTTCTGTCTCTCTTCTTTTGTGTTATTATTATCAATGATATCTGTTAGGGCCTTGCTCTGAAACTGCCTGCTCAATATTCACAGCATACACACTTTTACTTGCTGGGAAAAAGAAATCATGTTGTTCTTCGCTTTAATATTTCACTTTGTTAGTGATTCTGTTAGAATTGGATGCCATTCCCTTTCTCATTCGCCTGGGTTTGGTTTGGTTTCCTTGAAGCCAAATAGAATCCTACCTTCATTCCCAAATAAAGTTACTGGTGTTTAAG
Coding sequence (CDS)
ATGAGACAGCAGGGTCAATATTCTGATTCAGGACTTGGCTCATATTCAGTTTCTCAAATGCATCAAGTTCCCAGTCAAAGGATGGAACAAAGCCACCCTGATCCTTTTGAAGGACGGCTGGAAGCCTTCACTCCAGAGAGAGAGAATTCATATCTAGCTTCAAAAAATGAGGATCAATGGAGATGGGAAAGAGATGAGTCGAAGATGCCAAATTCCATGGCCTCTCACATGTTCAACGAAGGTCAAGGGGGTGATCTCAGAAGATCATACTTTCAAGGTCAGAGACCAAATCCAAAATTGGTTCTTGAGAAAGGAAGCAACAGCGATCTCAGATTTCAATCCCATGGAAAAAACATGGAAAGTAGGTTTGGAGATGGCCTGCTGCCACAGAATTTTGACGGTCTCGAGCAGAAATTCATTGATGACATCATTAACTTTTCCAAGGAACAAAATGATGCAGAGGATGAGGAAAATGCTCGACATAGAGAGAGAATTATCGCTATCAATTCTCAGTACGAGGAGCAATTAGCAGCACTTCGAGCTCAGCATGCTGGTCGTCGTGATGAGCTGCTACGAAGGGAATCAAGTGCACGGCACCATCAGTATCAGAAGGGAATAAGGGACCATTACCCCAATGGGGGCATTGGCCCAGAGAATGAACTCGGTTTCTTGGGAATTCTATGCGGAATTCTAATTTGGATTCGAGTTTATGACACTGGCTCACGGTTAGGTTCCAATGGCAACATCACGTGGCATGTGAGCACATATCCATTGCCTCCAGATCTTGACCAGCAATTGCAATACGGTGGTCTCACTTTTCTGTCTCTCTTCTTTTGTGTTATTATTATCAATGATATCTGTTAG
Protein sequence
MRQQGQYSDSGLGSYSVSQMHQVPSQRMEQSHPDPFEGRLEAFTPERENSYLASKNEDQWRWERDESKMPNSMASHMFNEGQGGDLRRSYFQGQRPNPKLVLEKGSNSDLRFQSHGKNMESRFGDGLLPQNFDGLEQKFIDDIINFSKEQNDAEDEENARHRERIIAINSQYEEQLAALRAQHAGRRDELLRRESSARHHQYQKGIRDHYPNGGIGPENELGFLGILCGILIWIRVYDTGSRLGSNGNITWHVSTYPLPPDLDQQLQYGGLTFLSLFFCVIIINDIC
Homology
BLAST of Cp4.1LG16g08230 vs. NCBI nr
Match:
XP_023512046.1 (uncharacterized protein LOC111776883 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 444 bits (1141), Expect = 1.48e-155
Identity = 220/226 (97.35%), Postives = 221/226 (97.79%), Query Frame = 0
Query: 1 MRQQGQYSDSGLGSYSVSQMHQVPSQRMEQSHPDPFEGRLEAFTPERENSYLASKNEDQW 60
MRQQGQYSDSGLGSYSVSQMHQVPSQRMEQSHPDPFEGRLEAFTPERENSYLASKNEDQW
Sbjct: 1 MRQQGQYSDSGLGSYSVSQMHQVPSQRMEQSHPDPFEGRLEAFTPERENSYLASKNEDQW 60
Query: 61 RWERDESKMPNSMASHMFNEGQGGDLRRSYFQGQRPNPKLVLEKGSNSDLRFQSHGKNME 120
RWERDESKMPNSMASHMFNEGQGGDLRRSYFQGQRPNPKLVLEKGSNSDLRFQSHGKNME
Sbjct: 61 RWERDESKMPNSMASHMFNEGQGGDLRRSYFQGQRPNPKLVLEKGSNSDLRFQSHGKNME 120
Query: 121 SRFGDGLLPQNFDGLEQKFIDDIINFSKEQNDAEDEENARHRERIIAINSQYEEQLAALR 180
SRFGDGLLPQNFDGLEQKFIDDIINFSKEQNDAEDEENARHRERIIAINSQYEEQLAALR
Sbjct: 121 SRFGDGLLPQNFDGLEQKFIDDIINFSKEQNDAEDEENARHRERIIAINSQYEEQLAALR 180
Query: 181 AQHAGRRDELLRRESSARHHQYQKGIRDHYPNGGIGPENELGFLGI 226
AQHAGRRDELLRRESSARHHQYQKGIRDHYPNGGIGP N G G+
Sbjct: 181 AQHAGRRDELLRRESSARHHQYQKGIRDHYPNGGIGPGNPRGSSGV 226
BLAST of Cp4.1LG16g08230 vs. NCBI nr
Match:
XP_022986988.1 (uncharacterized protein LOC111484575 [Cucurbita maxima])
HSP 1 Score: 429 bits (1104), Expect = 6.40e-150
Identity = 214/226 (94.69%), Postives = 217/226 (96.02%), Query Frame = 0
Query: 1 MRQQGQYSDSGLGSYSVSQMHQVPSQRMEQSHPDPFEGRLEAFTPERENSYLASKNEDQW 60
MRQQ QYSDSGLGSYSVSQMHQVPSQR+EQSHPDPFEGRLEAFTPERENSY+ASKNEDQW
Sbjct: 1 MRQQRQYSDSGLGSYSVSQMHQVPSQRIEQSHPDPFEGRLEAFTPERENSYIASKNEDQW 60
Query: 61 RWERDESKMPNSMASHMFNEGQGGDLRRSYFQGQRPNPKLVLEKGSNSDLRFQSHGKNME 120
R ERDESKMPNSMASHMFNEGQGGDLRRSYFQGQRPNPK VLEKGSNSDLRFQSHGKNME
Sbjct: 61 RRERDESKMPNSMASHMFNEGQGGDLRRSYFQGQRPNPKFVLEKGSNSDLRFQSHGKNME 120
Query: 121 SRFGDGLLPQNFDGLEQKFIDDIINFSKEQNDAEDEENARHRERIIAINSQYEEQLAALR 180
SRFGDGLLPQNFDGLEQKFIDDIINFSKEQND EDEENARHRERIIAINSQYEEQLAALR
Sbjct: 121 SRFGDGLLPQNFDGLEQKFIDDIINFSKEQNDVEDEENARHRERIIAINSQYEEQLAALR 180
Query: 181 AQHAGRRDELLRRESSARHHQYQKGIRDHYPNGGIGPENELGFLGI 226
AQHAGRRDELLRRESSARHHQYQKGIRDHYPNGGIGP N G G+
Sbjct: 181 AQHAGRRDELLRRESSARHHQYQKGIRDHYPNGGIGPGNPRGNSGV 226
BLAST of Cp4.1LG16g08230 vs. NCBI nr
Match:
KAG6570459.1 (hypothetical protein SDJN03_29374, partial [Cucurbita argyrosperma subsp. sororia])
HSP 1 Score: 420 bits (1080), Expect = 1.36e-146
Identity = 206/209 (98.56%), Postives = 207/209 (99.04%), Query Frame = 0
Query: 79 NEGQGGDLRRSYFQGQRPNPKLVLEKGSNSDLRFQSHGKNMESRFGDGLLPQNFDGLEQK 138
N GQGGDLRRSYFQGQRPNPKLVLEKGSNSDLRFQSHGKNMESRFGDGLLPQNFDGLEQK
Sbjct: 20 NIGQGGDLRRSYFQGQRPNPKLVLEKGSNSDLRFQSHGKNMESRFGDGLLPQNFDGLEQK 79
Query: 139 FIDDIINFSKEQNDAEDEENARHRERIIAINSQYEEQLAALRAQHAGRRDELLRRESSAR 198
FIDDIINFSKEQ+DAEDEENARHRERIIAINSQYEEQLAALRAQHAGRRDELLRRESSAR
Sbjct: 80 FIDDIINFSKEQSDAEDEENARHRERIIAINSQYEEQLAALRAQHAGRRDELLRRESSAR 139
Query: 199 HHQYQKGIRDHYPNGGIGPENELGFLGILCGILIWIRVYDTGSRLGSNGNITWHVSTYPL 258
HHQYQKGIRDHYPNGGIGPENELGFLGIL GILIWIRVYDTGSRLGSNGNITWHVSTYPL
Sbjct: 140 HHQYQKGIRDHYPNGGIGPENELGFLGILRGILIWIRVYDTGSRLGSNGNITWHVSTYPL 199
Query: 259 PPDLDQQLQYGGLTFLSLFFCVIIINDIC 287
PPDLDQQLQYGGLTFLSLFFCVIIINDIC
Sbjct: 200 PPDLDQQLQYGGLTFLSLFFCVIIINDIC 228
BLAST of Cp4.1LG16g08230 vs. NCBI nr
Match:
XP_004140174.2 (uncharacterized protein LOC101221687 isoform X1 [Cucumis sativus] >XP_031744068.1 uncharacterized protein LOC101221687 isoform X1 [Cucumis sativus] >KAE8647365.1 hypothetical protein Csa_004279 [Cucumis sativus])
HSP 1 Score: 420 bits (1079), Expect = 4.47e-145
Identity = 223/308 (72.40%), Postives = 236/308 (76.62%), Query Frame = 0
Query: 1 MRQQGQYSDSGLGSYSVSQMHQVPSQRMEQSHPDPFEGRLEAFTPERENSYLASKNEDQW 60
MRQQGQYSDSGLG+YSVSQMH VPSQ +EQSHPDPFEGRLEAFTPERE+SY+ASKNEDQW
Sbjct: 1 MRQQGQYSDSGLGAYSVSQMHHVPSQMVEQSHPDPFEGRLEAFTPEREHSYVASKNEDQW 60
Query: 61 RWERDESKMPNSMASHMFNEGQG--GDLRRSYFQGQRPNPKLVLEKGSNSDLRFQSHGKN 120
RWERDESKMPNSM SHMFNEGQG GD RSYFQGQRPNPKL LEKGSN+D R QSHGKN
Sbjct: 61 RWERDESKMPNSMTSHMFNEGQGQGGDATRSYFQGQRPNPKLGLEKGSNNDPRSQSHGKN 120
Query: 121 MESRFGDGLLPQNFDGLEQKFIDDIINFSKEQNDAEDEENARHRERIIAINSQYEEQLAA 180
MESRFGDG LPQNFDGLEQKFIDDII +KEQNDAEDEENARHRERI+AIN+QYEEQLAA
Sbjct: 121 MESRFGDGPLPQNFDGLEQKFIDDIIKLTKEQNDAEDEENARHRERILAINAQYEEQLAA 180
Query: 181 LRAQHAGRRDELLRRESSARHHQYQKGIRDHYPNGGIGPENELG---------------- 240
LR +HAGRRDELLRRES+AR HQYQKGI DHYPNGGIGP + G
Sbjct: 181 LRVRHAGRRDELLRRESTARQHQYQKGIMDHYPNGGIGPGDPRGNSGVTNLAASGQAHQN 240
Query: 241 --------------FLGILC--------GILIWIRVYDTGSRLGSNGNITWHVST-YPLP 267
FLG G RVYDT SRLGSNGNITWHV + LP
Sbjct: 241 YESEHFDSFRERARFLGNSARDPNLDPRGSYPGGRVYDTASRLGSNGNITWHVVVHFALP 300
BLAST of Cp4.1LG16g08230 vs. NCBI nr
Match:
XP_022943334.1 (uncharacterized protein LOC111448131 [Cucurbita moschata] >XP_022943335.1 uncharacterized protein LOC111448131 [Cucurbita moschata])
HSP 1 Score: 405 bits (1040), Expect = 1.80e-140
Identity = 199/207 (96.14%), Postives = 202/207 (97.58%), Query Frame = 0
Query: 20 MHQVPSQRMEQSHPDPFEGRLEAFTPERENSYLASKNEDQWRWERDESKMPNSMASHMFN 79
MHQVPSQRMEQSHPDPFEGRLEAFTPERENSY+ASKNEDQWRWERDESKMPNSMASHMFN
Sbjct: 1 MHQVPSQRMEQSHPDPFEGRLEAFTPERENSYIASKNEDQWRWERDESKMPNSMASHMFN 60
Query: 80 EGQGGDLRRSYFQGQRPNPKLVLEKGSNSDLRFQSHGKNMESRFGDGLLPQNFDGLEQKF 139
EGQGGDLRRSYFQGQRPNPKLVLEKGSNSDLRFQSHGKNMESRFGDGLLPQNFDGLEQKF
Sbjct: 61 EGQGGDLRRSYFQGQRPNPKLVLEKGSNSDLRFQSHGKNMESRFGDGLLPQNFDGLEQKF 120
Query: 140 IDDIINFSKEQNDAEDEENARHRERIIAINSQYEEQLAALRAQHAGRRDELLRRESSARH 199
IDDIINFSKEQ+DAEDEENARHRERIIAINSQYEEQLAALRAQHAGRRDELLRRESSARH
Sbjct: 121 IDDIINFSKEQSDAEDEENARHRERIIAINSQYEEQLAALRAQHAGRRDELLRRESSARH 180
Query: 200 HQYQKGIRDHYPNGGIGPENELGFLGI 226
HQYQKGIRDHYPNGGIGP N G G+
Sbjct: 181 HQYQKGIRDHYPNGGIGPGNPRGSSGV 207
BLAST of Cp4.1LG16g08230 vs. ExPASy TrEMBL
Match:
A0A6J1JI55 (uncharacterized protein LOC111484575 OS=Cucurbita maxima OX=3661 GN=LOC111484575 PE=4 SV=1)
HSP 1 Score: 429 bits (1104), Expect = 3.10e-150
Identity = 214/226 (94.69%), Postives = 217/226 (96.02%), Query Frame = 0
Query: 1 MRQQGQYSDSGLGSYSVSQMHQVPSQRMEQSHPDPFEGRLEAFTPERENSYLASKNEDQW 60
MRQQ QYSDSGLGSYSVSQMHQVPSQR+EQSHPDPFEGRLEAFTPERENSY+ASKNEDQW
Sbjct: 1 MRQQRQYSDSGLGSYSVSQMHQVPSQRIEQSHPDPFEGRLEAFTPERENSYIASKNEDQW 60
Query: 61 RWERDESKMPNSMASHMFNEGQGGDLRRSYFQGQRPNPKLVLEKGSNSDLRFQSHGKNME 120
R ERDESKMPNSMASHMFNEGQGGDLRRSYFQGQRPNPK VLEKGSNSDLRFQSHGKNME
Sbjct: 61 RRERDESKMPNSMASHMFNEGQGGDLRRSYFQGQRPNPKFVLEKGSNSDLRFQSHGKNME 120
Query: 121 SRFGDGLLPQNFDGLEQKFIDDIINFSKEQNDAEDEENARHRERIIAINSQYEEQLAALR 180
SRFGDGLLPQNFDGLEQKFIDDIINFSKEQND EDEENARHRERIIAINSQYEEQLAALR
Sbjct: 121 SRFGDGLLPQNFDGLEQKFIDDIINFSKEQNDVEDEENARHRERIIAINSQYEEQLAALR 180
Query: 181 AQHAGRRDELLRRESSARHHQYQKGIRDHYPNGGIGPENELGFLGI 226
AQHAGRRDELLRRESSARHHQYQKGIRDHYPNGGIGP N G G+
Sbjct: 181 AQHAGRRDELLRRESSARHHQYQKGIRDHYPNGGIGPGNPRGNSGV 226
BLAST of Cp4.1LG16g08230 vs. ExPASy TrEMBL
Match:
A0A6J1FXP5 (uncharacterized protein LOC111448131 OS=Cucurbita moschata OX=3662 GN=LOC111448131 PE=4 SV=1)
HSP 1 Score: 405 bits (1040), Expect = 8.72e-141
Identity = 199/207 (96.14%), Postives = 202/207 (97.58%), Query Frame = 0
Query: 20 MHQVPSQRMEQSHPDPFEGRLEAFTPERENSYLASKNEDQWRWERDESKMPNSMASHMFN 79
MHQVPSQRMEQSHPDPFEGRLEAFTPERENSY+ASKNEDQWRWERDESKMPNSMASHMFN
Sbjct: 1 MHQVPSQRMEQSHPDPFEGRLEAFTPERENSYIASKNEDQWRWERDESKMPNSMASHMFN 60
Query: 80 EGQGGDLRRSYFQGQRPNPKLVLEKGSNSDLRFQSHGKNMESRFGDGLLPQNFDGLEQKF 139
EGQGGDLRRSYFQGQRPNPKLVLEKGSNSDLRFQSHGKNMESRFGDGLLPQNFDGLEQKF
Sbjct: 61 EGQGGDLRRSYFQGQRPNPKLVLEKGSNSDLRFQSHGKNMESRFGDGLLPQNFDGLEQKF 120
Query: 140 IDDIINFSKEQNDAEDEENARHRERIIAINSQYEEQLAALRAQHAGRRDELLRRESSARH 199
IDDIINFSKEQ+DAEDEENARHRERIIAINSQYEEQLAALRAQHAGRRDELLRRESSARH
Sbjct: 121 IDDIINFSKEQSDAEDEENARHRERIIAINSQYEEQLAALRAQHAGRRDELLRRESSARH 180
Query: 200 HQYQKGIRDHYPNGGIGPENELGFLGI 226
HQYQKGIRDHYPNGGIGP N G G+
Sbjct: 181 HQYQKGIRDHYPNGGIGPGNPRGSSGV 207
BLAST of Cp4.1LG16g08230 vs. ExPASy TrEMBL
Match:
A0A1S3BME9 (uncharacterized protein LOC103491440 OS=Cucumis melo OX=3656 GN=LOC103491440 PE=4 SV=1)
HSP 1 Score: 406 bits (1044), Expect = 5.07e-140
Identity = 217/296 (73.31%), Postives = 227/296 (76.69%), Query Frame = 0
Query: 1 MRQQGQYSDSGLGSYSVSQMHQVPSQRMEQSHPDPFEGRLEAFTPERENSYLASKNEDQW 60
MRQQGQYSDSGLG+YSVSQMH VPSQ +EQSHPDPFEGRLEAFTPERE+SY+ASKNEDQW
Sbjct: 1 MRQQGQYSDSGLGAYSVSQMHHVPSQMVEQSHPDPFEGRLEAFTPEREHSYVASKNEDQW 60
Query: 61 RWERDESKMPNSMASHMFNEGQG--GDLRRSYFQGQRPNPKLVLEKGSNSDLRFQSHGKN 120
RWERDESKMPNSM SHMFNEGQG GD RSYFQGQRPNPKL LEKGSNSD R QSHGKN
Sbjct: 61 RWERDESKMPNSMTSHMFNEGQGQGGDATRSYFQGQRPNPKLGLEKGSNSDPRSQSHGKN 120
Query: 121 MESRFGDGLLPQNFDGLEQKFIDDIINFSKEQNDAEDEENARHRERIIAINSQYEEQLAA 180
MESRFGDG LPQNFDGLEQKFIDDII +KEQNDAEDEENARHRERI+AIN+QYEEQLAA
Sbjct: 121 MESRFGDGPLPQNFDGLEQKFIDDIIKLTKEQNDAEDEENARHRERILAINAQYEEQLAA 180
Query: 181 LRAQHAGRRDELLRRESSARHHQYQKGIRDHYPNGGIGPENELG---------------- 240
LR +HAGRRDELLRRES+AR HQYQKGI DHYPNGGIGP + G
Sbjct: 181 LRVRHAGRRDELLRRESTARQHQYQKGIMDHYPNGGIGPGDPRGNSGVTNLAASGQAHQN 240
Query: 241 --------------FLGILC--------GILIWIRVYDTGSRLGSNGNITWHVSTY 256
FLG G RVYDT SRL SNGNITWHV Y
Sbjct: 241 YESEHFDSFRERARFLGNSARDPNLDPRGSYPGGRVYDTVSRLCSNGNITWHVVVY 296
BLAST of Cp4.1LG16g08230 vs. ExPASy TrEMBL
Match:
A0A6J1K145 (uncharacterized protein LOC111490759 OS=Cucurbita maxima OX=3661 GN=LOC111490759 PE=4 SV=1)
HSP 1 Score: 396 bits (1017), Expect = 1.76e-136
Identity = 197/226 (87.17%), Postives = 208/226 (92.04%), Query Frame = 0
Query: 1 MRQQGQYSDSGLGSYSVSQMHQVPSQRMEQSHPDPFEGRLEAFTPERENSYLASKNEDQW 60
MRQQGQYSDSGL SYS SQMH VPSQR+EQSHPDPFEGRLEAFTPERENS++ASKNEDQW
Sbjct: 1 MRQQGQYSDSGLNSYSGSQMHHVPSQRVEQSHPDPFEGRLEAFTPERENSFVASKNEDQW 60
Query: 61 RWERDESKMPNSMASHMFNEGQGGDLRRSYFQGQRPNPKLVLEKGSNSDLRFQSHGKNME 120
RWERDESKMPNSMASHMFNEGQGGD RSYFQGQRPN KLVLEKGSNSD R QSHGKNME
Sbjct: 61 RWERDESKMPNSMASHMFNEGQGGDATRSYFQGQRPNSKLVLEKGSNSDPRSQSHGKNME 120
Query: 121 SRFGDGLLPQNFDGLEQKFIDDIINFSKEQNDAEDEENARHRERIIAINSQYEEQLAALR 180
+RFGDGLLPQNFDGLEQKFIDDII +KEQNDAEDEENARHRERI+AIN+QYEEQLAALR
Sbjct: 121 NRFGDGLLPQNFDGLEQKFIDDIIKLAKEQNDAEDEENARHRERIMAINAQYEEQLAALR 180
Query: 181 AQHAGRRDELLRRESSARHHQYQKGIRDHYPNGGIGPENELGFLGI 226
A+HAGRRDELL+RESSAR HQYQKGI DHY NGGIGP + G G+
Sbjct: 181 ARHAGRRDELLQRESSARQHQYQKGIMDHYMNGGIGPGDPRGNSGV 226
BLAST of Cp4.1LG16g08230 vs. ExPASy TrEMBL
Match:
A0A6J1H5U1 (uncharacterized protein LOC111459838 OS=Cucurbita moschata OX=3662 GN=LOC111459838 PE=4 SV=1)
HSP 1 Score: 396 bits (1017), Expect = 1.76e-136
Identity = 197/226 (87.17%), Postives = 208/226 (92.04%), Query Frame = 0
Query: 1 MRQQGQYSDSGLGSYSVSQMHQVPSQRMEQSHPDPFEGRLEAFTPERENSYLASKNEDQW 60
MRQQGQYSDSGL SYS SQMH VPSQR+EQSHPDPFEGRLEAFTPERENS++ASKNEDQW
Sbjct: 1 MRQQGQYSDSGLNSYSGSQMHHVPSQRVEQSHPDPFEGRLEAFTPERENSFVASKNEDQW 60
Query: 61 RWERDESKMPNSMASHMFNEGQGGDLRRSYFQGQRPNPKLVLEKGSNSDLRFQSHGKNME 120
RWERDESKMPNSMASHMFNEGQGGD RSYFQGQRPN KLVLEKGSNSD R QSHGKNME
Sbjct: 61 RWERDESKMPNSMASHMFNEGQGGDATRSYFQGQRPNSKLVLEKGSNSDPRSQSHGKNME 120
Query: 121 SRFGDGLLPQNFDGLEQKFIDDIINFSKEQNDAEDEENARHRERIIAINSQYEEQLAALR 180
+RFGDGLLPQNFDGLEQKFIDDII +KEQNDAEDEENARHRERI+AIN+QYEEQLAALR
Sbjct: 121 NRFGDGLLPQNFDGLEQKFIDDIIKLAKEQNDAEDEENARHRERIMAINAQYEEQLAALR 180
Query: 181 AQHAGRRDELLRRESSARHHQYQKGIRDHYPNGGIGPENELGFLGI 226
A+HAGRRDELL+RESSAR HQYQKGI DHY NGGIGP + G G+
Sbjct: 181 ARHAGRRDELLQRESSARQHQYQKGIMDHYMNGGIGPGDPRGNSGV 226
BLAST of Cp4.1LG16g08230 vs. TAIR 10
Match:
AT5G22040.1 (unknown protein; Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )
HSP 1 Score: 162.2 bits (409), Expect = 6.4e-40
Identity = 99/224 (44.20%), Postives = 141/224 (62.95%), Query Frame = 0
Query: 1 MRQQGQYSDSGLGS-YSVSQMHQVPSQRMEQSHPDPFEGRLEAFTPERENSYLASKNEDQ 60
MR+QG ++DS S Y Q ++ H D F+G+LEAFTPER+ Y S+ E Q
Sbjct: 1 MRRQGNFADSSPASAYGAGQ--------IQDPHSD-FQGQLEAFTPERDQPYSDSQAEGQ 60
Query: 61 WRWERDESKMPNSMASHMFNEGQGG-DLRRSYFQGQRPNPKLVLEKGSNSDLRFQSHGKN 120
WRWERD M MA+ ++NEGQ G D R+Y++GQ +PK +EK + H +N
Sbjct: 61 WRWERDGPNMSRPMATAVYNEGQQGVDSSRTYYRGQ-IDPKSGMEKQGSDPRAQPQHQEN 120
Query: 121 MESRFGDGLLPQNFDGLEQKFIDDIINFSKEQNDAEDEENARHRERIIAINSQYEEQLAA 180
++ + + Q F+GLEQKF+DDI +K+Q +AED E ARHRE+I IN++YEEQLA
Sbjct: 121 PKTGYDNNRGVQTFEGLEQKFMDDITRLAKDQIEAEDAEIARHREKINTINARYEEQLAT 180
Query: 181 LRAQHAGRRDELLRRESSARHHQYQK---GIRDHYPNGGIGPEN 220
LRA+H G+R+E++R+ES AR Q+++ G+ D Y +G N
Sbjct: 181 LRARHTGKREEIMRKESLARQQQFKQQTMGMMDQYHPNVVGQAN 214
BLAST of Cp4.1LG16g08230 vs. TAIR 10
Match:
AT5G22040.2 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )
HSP 1 Score: 162.2 bits (409), Expect = 6.4e-40
Identity = 99/224 (44.20%), Postives = 141/224 (62.95%), Query Frame = 0
Query: 1 MRQQGQYSDSGLGS-YSVSQMHQVPSQRMEQSHPDPFEGRLEAFTPERENSYLASKNEDQ 60
MR+QG ++DS S Y Q ++ H D F+G+LEAFTPER+ Y S+ E Q
Sbjct: 1 MRRQGNFADSSPASAYGAGQ--------IQDPHSD-FQGQLEAFTPERDQPYSDSQAEGQ 60
Query: 61 WRWERDESKMPNSMASHMFNEGQGG-DLRRSYFQGQRPNPKLVLEKGSNSDLRFQSHGKN 120
WRWERD M MA+ ++NEGQ G D R+Y++GQ +PK +EK + H +N
Sbjct: 61 WRWERDGPNMSRPMATAVYNEGQQGVDSSRTYYRGQ-IDPKSGMEKQGSDPRAQPQHQEN 120
Query: 121 MESRFGDGLLPQNFDGLEQKFIDDIINFSKEQNDAEDEENARHRERIIAINSQYEEQLAA 180
++ + + Q F+GLEQKF+DDI +K+Q +AED E ARHRE+I IN++YEEQLA
Sbjct: 121 PKTGYDNNRGVQTFEGLEQKFMDDITRLAKDQIEAEDAEIARHREKINTINARYEEQLAT 180
Query: 181 LRAQHAGRRDELLRRESSARHHQYQK---GIRDHYPNGGIGPEN 220
LRA+H G+R+E++R+ES AR Q+++ G+ D Y +G N
Sbjct: 181 LRARHTGKREEIMRKESLARQQQFKQQTMGMMDQYHPNVVGQAN 214
BLAST of Cp4.1LG16g08230 vs. TAIR 10
Match:
AT5G22040.3 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown. )
HSP 1 Score: 162.2 bits (409), Expect = 6.4e-40
Identity = 99/224 (44.20%), Postives = 141/224 (62.95%), Query Frame = 0
Query: 1 MRQQGQYSDSGLGS-YSVSQMHQVPSQRMEQSHPDPFEGRLEAFTPERENSYLASKNEDQ 60
MR+QG ++DS S Y Q ++ H D F+G+LEAFTPER+ Y S+ E Q
Sbjct: 1 MRRQGNFADSSPASAYGAGQ--------IQDPHSD-FQGQLEAFTPERDQPYSDSQAEGQ 60
Query: 61 WRWERDESKMPNSMASHMFNEGQGG-DLRRSYFQGQRPNPKLVLEKGSNSDLRFQSHGKN 120
WRWERD M MA+ ++NEGQ G D R+Y++GQ +PK +EK + H +N
Sbjct: 61 WRWERDGPNMSRPMATAVYNEGQQGVDSSRTYYRGQ-IDPKSGMEKQGSDPRAQPQHQEN 120
Query: 121 MESRFGDGLLPQNFDGLEQKFIDDIINFSKEQNDAEDEENARHRERIIAINSQYEEQLAA 180
++ + + Q F+GLEQKF+DDI +K+Q +AED E ARHRE+I IN++YEEQLA
Sbjct: 121 PKTGYDNNRGVQTFEGLEQKFMDDITRLAKDQIEAEDAEIARHREKINTINARYEEQLAT 180
Query: 181 LRAQHAGRRDELLRRESSARHHQYQK---GIRDHYPNGGIGPEN 220
LRA+H G+R+E++R+ES AR Q+++ G+ D Y +G N
Sbjct: 181 LRARHTGKREEIMRKESLARQQQFKQQTMGMMDQYHPNVVGQAN 214
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
XP_023512046.1 | 1.48e-155 | 97.35 | uncharacterized protein LOC111776883 [Cucurbita pepo subsp. pepo] | [more] |
XP_022986988.1 | 6.40e-150 | 94.69 | uncharacterized protein LOC111484575 [Cucurbita maxima] | [more] |
KAG6570459.1 | 1.36e-146 | 98.56 | hypothetical protein SDJN03_29374, partial [Cucurbita argyrosperma subsp. sorori... | [more] |
XP_004140174.2 | 4.47e-145 | 72.40 | uncharacterized protein LOC101221687 isoform X1 [Cucumis sativus] >XP_031744068.... | [more] |
XP_022943334.1 | 1.80e-140 | 96.14 | uncharacterized protein LOC111448131 [Cucurbita moschata] >XP_022943335.1 unchar... | [more] |
Match Name | E-value | Identity | Description | |
A0A6J1JI55 | 3.10e-150 | 94.69 | uncharacterized protein LOC111484575 OS=Cucurbita maxima OX=3661 GN=LOC111484575... | [more] |
A0A6J1FXP5 | 8.72e-141 | 96.14 | uncharacterized protein LOC111448131 OS=Cucurbita moschata OX=3662 GN=LOC1114481... | [more] |
A0A1S3BME9 | 5.07e-140 | 73.31 | uncharacterized protein LOC103491440 OS=Cucumis melo OX=3656 GN=LOC103491440 PE=... | [more] |
A0A6J1K145 | 1.76e-136 | 87.17 | uncharacterized protein LOC111490759 OS=Cucurbita maxima OX=3661 GN=LOC111490759... | [more] |
A0A6J1H5U1 | 1.76e-136 | 87.17 | uncharacterized protein LOC111459838 OS=Cucurbita moschata OX=3662 GN=LOC1114598... | [more] |
Match Name | E-value | Identity | Description | |
AT5G22040.1 | 6.4e-40 | 44.20 | unknown protein; Has 30201 Blast hits to 17322 proteins in 780 species: Archae -... | [more] |
AT5G22040.2 | 6.4e-40 | 44.20 | unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... | [more] |
AT5G22040.3 | 6.4e-40 | 44.20 | unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... | [more] |