Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TATGTATAGTCAAGTAGGCGTGCTTCCTCAAGGTTGGGCCCGCGGGTTAAATGGGTTGCAAAAGGACTAATTTCTTTATGATTACCCTTCATTTTCTCCGATTTCCCTCTTGGAAACCCCTTGTTTAATTGGTCATGAAGTATGAACCCACGAAGTACAAAAGTTCATACTGAGCTGCAATTCGCATTTCTGTTTGGAATTATAATCTGAATCTCTACACCAGATTGTGAAGCAACAGAACAATGAGGCATGGTGGATCCAGGAAGAAAAGATCGTCATCGTTTGCGCGATATGTCGTTGTTCTATGTGCCGTCGGTGCTTCAATTGGATTTTTGATGCTCAATTTTCTTATGAGGATGGAAGCTCAAGAATCAGAATCGTCCTCTGATCAGTTAGGTAATGGCGATGACGTTGAAGAGAGTCGGGTTCTGAGTGAAATGGACGGAAGGCGGAGCTGCGCGACGGTGGAGCAGATGGGAGAGGCCTTCAAAGATGGTGTCTGGAAGGAAAGCCTGAGAGTAAGAACAATTATTCAAAATCACTTTTATTTGAATGGTAATATTTCCCTATATGCTTTTCTGATAACCATCTGCCCTCCCCTTGTTGATCGTTTTATAGTTAAATGAATGATTAATAGCTACTGTCAAGCAGCTCTTCTTTTGTATATTAGGACTGTTTAATGGAGTCATGATTCGGCTCACGATCCCTTTTTTGCCATTATAATAATCGTTTGATGTTCCATAGTGAGTGTGTTGGCTGTATCGATTGAATATAAAATACTGATTATTTGGGGGTGTTGAGTAAAATGTCGTAGATCAGAAACTACACTTGTCAACCTCCGATCGATCTACGATCACGAGTAGGTAGGCTGCTAGGGGGAGGGGCGGCTGGGCGGCCTTTTCAATCAATATCAGACTGACCCCTAATTACTAGACCAACCTGGTGCGAGTTGTAAGGAGTTTTGGTCTCGAGTTGAAGGAATGTGGAAGATACAGGTTGTAAACAGGGGAAAGTAGTTGGAATTTGGAGAAGGATGGTGGATGAAACTAGAGAAAACCTTAGGCTTCCAAGATGTAATTTGGATGGTCAAAATTCCCAAAGAGATTCGATATTTTTTCACCTGGCTATAAGTTCATAGGAATATAGTTGATAATGGATTGGTGCAATAGAAATTCTTGGTCGACTGTTCCNTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTAATGTTTCTTTTGCTTAATGATAGATCCAGCTTCTTGTTAAATAATTAAAAAAAGAAAACTGCAGAATGGTGGAATATTATTATTATTGTTCTTTCTCTAGTAGGTCAAAGAGACCATTCCACTTGGTCAGTGGGGATAATAAGGCCAGAACAATAGTAGGATATCATTGTGTGGTTAAAAGTAATCTTGATTGCATTATGGTGCCATATGAGCGTGAAGAATGGTCTCCTTGAGCTTCTTTCATATTAAGAATGAGGATTCGAGGTTCTTCAAGCTGCCAACTCTATAATAAAGAAGATAGTTGTTGGAGTGAATTGAAAATGAACTGCTTGTTGGTTTGGTTTAGCTCAATTCTTTGTAGTCAGTAGTTGTTAATAGTTAAATTCCTGGGTTCTTAAGTAGTCTGCTTTGTTAGTAAAAGAGATTGTTTTGAATTTAATAAAGATTCCACGAGGACATTTTGTGATGCCAAGCAGTTGTATTAAAGACTTCTCGAGTCTTACGAGTTATGGTGGATCTCTTGGCGAAAGAACACATAAAGTGAGAAAGATGAAAACTCACGTATACATATTCGTTCTTTAGAATAGGTTATAGGGAAGTTCATATTGGTTTTTTTTTCTGATGGTGTAGTTTGTGCTCTTCTTCTTATTCAAATAACAATTTTAAAACTAATCTTCTTCATCTTGGCTTTTCCTTTACTGTTTAATCTTTTGTCATAACTGTTTTCTGGTGGAAAACCTGTTAGCTCTCTGAGATTTATAAAAACTGACATGGTTAATTTCATTTTAGGTGCTTCAAGAGTGCGACAACTTCCTCCGGAGCAGTTTTGCAAACACGGTTTTGTCATGGGCAAATCTTCAGAGGCAGGCTTTGGGAATGAGATGTACAAGATTCTAACTGCTGGAGCTTTAAGTATAATGTTGAACAGATCCTTGATTATCGGGCAAACCAGGCATGTTGGTTCTATATCAAGCCTCTTTATTTCTATTTAAATATTGGTTGTGAATACTATAATGTATCGAGTCGTTAGAAGCATTTTGTGTCTGTGATAGTTCCATTTCACACTCACATTTCAAGTTATAGGCTAAGTAATGATGCCTTGGATCTGGTATATAAAAACTTGTTAGCTTTTCTCAGCTAAAATCTCAATAAATGGATGGAATAACTTCAAAGAAGCCATCAGATAAATTTCAGAAGAAACTATGCAAATCTATGCAATAGGATTATAGTCGTGCATTGATGGAATAACTTTCATAGTTCTTATGTTATTACTTCTGTTTTCAGGGGCAAGTTTCCATTTGGAGACTACATTTCTTATTCCAATGTCACGTTTACCATGAAAGAAATCAAGCATTTGTGGAGACTTAAAGGTTGTATTAGGAAATTCAATAGGCATTTGATTATGCGAACTGATGATTTTGAAAAGCCTGCACAGACAAATGTTCTATGTAGTAATTGGAAGGAATGGGAGCATCCTATCATATGGTATAGTTAACTTTCAGTTTACTTTAAACCGGTACTCAATTCGTACTCTGTTTATGCTGACTGATGTGCATCAGGTTCCAAGGTACAACAGATGCTGTGGCTGCTCAATTTTTCTTGAAGAATGTACATCCCGCCATGAGGGCCGCTGCTTCTAATTTGTTTGGACATCCAGAGGTTTTGGAATCTAGACCTAATGTATTCGGAGAGCTGATGAGAGTTCTTATATCTCCGTCAAAGGATGTTGAAGAAGCAGTGTTCTCAGTTCTTAAAAGTGGGGATGATCCTGATATTTCATTGCACATGAGGATGCTTATGAATAGGTAGTGTTACTACTTATTTACCGTTATTTCCTTCTCTCTCTACTTCCACTAGCTATAAGAATTGAATCCACAATGTCTATTTCCAATAAATTCCATTTAGTTGATAGAAATCCTGGATATTAGTCACTTTTGTAATCCTTTCCCTGAGTAATGTGCATCTGCATTTCAGGTCTGTGAGAGGCTTACAGGCTGCATTGCAGTGCATCAGAAAAGGCATACGCAATCTAACCACGGACTCGAAACCCAGATTGGTTTTAGTATCAGATACCCCAAATTTTGTAAAAAGTATTGTGCCCATGCTTGGTGAATTTGCAGAGGTAACTGCCTCATTGTGTGATTCTATTCTTCAAATGTAGTCCTGATTTCAAATACATGCTTAACGGAATAATTGTTCAGTCAAAGACCCCCATAACTTCTGTATGCCTGTAAAACAATTTCGGACCAGTTTCATTTGTGACTCTGTTTTCAGGTTATTCATTTTGATTATGAACACTTCAGAGGAATCATTTCTGGAACGCACGATGAATTTCATAAATTGGATTTCAGAGTGAAGGACTGGGGCCCTTCACCAAGATGGGTTGCCTTTGTGGATTTTTTTCTTGCATCCCGTGCCAAGCGTGCTGTTATTTCTGGTGCTCACAGGCGTGTAGGTACTACCTATGCTCAGCTAATCGCAGCATTGGCTGCAGCACACAATCTCGACAATCTCGGTATCTTTGAATACTCTATCTATCATTTTACATCATGTAGGAACATAATTTGTTTCTTTGAGGAACATCTGTACAACTGTTCCTAACATAACATTACAAATATGCTCAGGGAAAAATTCTACTGGTTCAGACTTCTTCTTCTTGAGTAGCTTCCAAAGTAATTTGTTGAGAGAAGGTTTAAAGAACCAGGTTGGCTGGGGTCATATCTGGAACAGATTTGCAGGCCCTTTAAGCTGCCCTAGCCAGCCTAACCAGTGTGCCTTAACCCCTCTTCTCCCTCCAGCATGGTGGGATGGACTTTGGCAATCTCCCATTCCACGAGATATCAAAAGAATGGAAAATTATGGAGTTCATTTATCGGGCTTTGGCACGATTGATGAAGACAGCCTTCGATCGTTCTGTAATGCAAAGAAGAATGTTGTGAGGACTATCCCTTTCATATTATAGTCATCTTATGCTTCTGGTTTGTCCTGTAAGCTTTATAACTAACATAGTTCTTATAACTTTTGCCAAACTCTGCTTATTTTGTCAATTTTGCTCGGTCCAGGAATTTGTTTCTGTTAAATCAAACGTCCCAGTAATGTCTTCTTGTATTTATTGGGCCATTTGATTTACAAAACAAAGTCTTATATGCTGTCAAGAGAGTCCAATTTCATTTTCAGTCGATACCATTGGATCAAATATGCATAAGACGGCATTTTCCCTTTTTGCTCCCCCAGTGAACAGTTTAGTATTGGCTTATAGTTCCAAGGATGTCTTGAATAGTATTAGTGGGATATTGAATGATTGGGTTGTACTTGTTCTTGGTTGTTGATTGCCACCAAAGGTAGGTGGCGGTAGATTTACACT
mRNA sequence
TATGTATAGTCAAGTAGGCGTGCTTCCTCAAGGTTGGGCCCGCGGGTTAAATGGGTTGCAAAAGGACTAATTTCTTTATGATTACCCTTCATTTTCTCCGATTTCCCTCTTGGAAACCCCTTGTTTAATTGGTCATGAAGTATGAACCCACGAAGTACAAAAGTTCATACTGAGCTGCAATTCGCATTTCTGTTTGGAATTATAATCTGAATCTCTACACCAGATTGTGAAGCAACAGAACAATGAGGCATGGTGGATCCAGGAAGAAAAGATCGTCATCGTTTGCGCGATATGTCGTTGTTCTATGTGCCGTCGGTGCTTCAATTGGATTTTTGATGCTCAATTTTCTTATGAGGATGGAAGCTCAAGAATCAGAATCGTCCTCTGATCAGTTAGGTAATGGCGATGACGTTGAAGAGAGTCGGGTTCTGAGTGAAATGGACGGAAGGCGGAGCTGCGCGACGGTGGAGCAGATGGGAGAGGCCTTCAAAGATGGTGTCTGGAAGGAAAGCCTGAGAGTAAGAACAATTATTCAAAATCACTTTTATTTGAATGGTGCTTCAAGAGTGCGACAACTTCCTCCGGAGCAGTTTTGCAAACACGGTTTTGTCATGGGCAAATCTTCAGAGGCAGGCTTTGGGAATGAGATGTACAAGATTCTAACTGCTGGAGCTTTAAGTATAATGTTGAACAGATCCTTGATTATCGGGCAAACCAGGCATGCTAAGGGCAAGTTTCCATTTGGAGACTACATTTCTTATTCCAATGTCACGTTTACCATGAAAGAAATCAAGCATTTGTGGAGACTTAAAGGTTGTATTAGGAAATTCAATAGGCATTTGATTATGCGAACTGATGATTTTGAAAAGCCTGCACAGACAAATGTTCTATGTAGTAATTGGAAGGAATGGGAGCATCCTATCATATGGTTCCAAGGTACAACAGATGCTGTGGCTGCTCAATTTTTCTTGAAGAATGTACATCCCGCCATGAGGGCCGCTGCTTCTAATTTGTTTGGACATCCAGAGGTTTTGGAATCTAGACCTAATGTATTCGGAGAGCTGATGAGAGTTCTTATATCTCCGTCAAAGGATGTTGAAGAAGCAGTGTTCTCAGTTCTTAAAAGTGGGGATGATCCTGATATTTCATTGCACATGAGGATGCTTATGAATAGGTCTGTGAGAGGCTTACAGGCTGCATTGCAGTGCATCAGAAAAGGCATACGCAATCTAACCACGGACTCGAAACCCAGATTGGTTTTAGTATCAGATACCCCAAATTTTGTAAAAAGTATTGTGCCCATGCTTGGTGAATTTGCAGAGGTTATTCATTTTGATTATGAACACTTCAGAGGAATCATTTCTGGAACGCACGATGAATTTCATAAATTGGATTTCAGAGTGAAGGACTGGGGCCCTTCACCAAGATGGGTTGCCTTTGTGGATTTTTTTCTTGCATCCCGTGCCAAGCGTGCTGTTATTTCTGGTGCTCACAGGCGTGTAGGTACTACCTATGCTCAGCTAATCGCAGCATTGGCTGCAGCACACAATCTCGACAATCTCGGGAAAAATTCTACTGGTTCAGACTTCTTCTTCTTGAGTAGCTTCCAAAGTAATTTGTTGAGAGAAGGTTTAAAGAACCAGGTTGGCTGGGGTCATATCTGGAACAGATTTGCAGGCCCTTTAAGCTGCCCTAGCCAGCCTAACCAGTGTGCCTTAACCCCTCTTCTCCCTCCAGCATGGTGGGATGGACTTTGGCAATCTCCCATTCCACGAGATATCAAAAGAATGGAAAATTATGGAGTTCATTTATCGGGCTTTGGCACGATTGATGAAGACAGCCTTCGATCGTTCTGTAATGCAAAGAAGAATGTTGTGAGGACTATCCCTTTCATATTATAGTCATCTTATGCTTCTGGTTTGTCCTGTAAGCTTTATAACTAACATAGTTCTTATAACTTTTGCCAAACTCTGCTTATTTTGTCAATTTTGCTCGGTCCAGGAATTTGTTTCTGTTAAATCAAACGTCCCAGTAATGTCTTCTTGTATTTATTGGGCCATTTGATTTACAAAACAAAGTCTTATATGCTGTCAAGAGAGTCCAATTTCATTTTCAGTCGATACCATTGGATCAAATATGCATAAGACGGCATTTTCCCTTTTTGCTCCCCCAGTGAACAGTTTAGTATTGGCTTATAGTTCCAAGGATGTCTTGAATAGTATTAGTGGGATATTGAATGATTGGGTTGTACTTGTTCTTGGTTGTTGATTGCCACCAAAGGTAGGTGGCGGTAGATTTACACT
Coding sequence (CDS)
ATGAGGCATGGTGGATCCAGGAAGAAAAGATCGTCATCGTTTGCGCGATATGTCGTTGTTCTATGTGCCGTCGGTGCTTCAATTGGATTTTTGATGCTCAATTTTCTTATGAGGATGGAAGCTCAAGAATCAGAATCGTCCTCTGATCAGTTAGGTAATGGCGATGACGTTGAAGAGAGTCGGGTTCTGAGTGAAATGGACGGAAGGCGGAGCTGCGCGACGGTGGAGCAGATGGGAGAGGCCTTCAAAGATGGTGTCTGGAAGGAAAGCCTGAGAGTAAGAACAATTATTCAAAATCACTTTTATTTGAATGGTGCTTCAAGAGTGCGACAACTTCCTCCGGAGCAGTTTTGCAAACACGGTTTTGTCATGGGCAAATCTTCAGAGGCAGGCTTTGGGAATGAGATGTACAAGATTCTAACTGCTGGAGCTTTAAGTATAATGTTGAACAGATCCTTGATTATCGGGCAAACCAGGCATGCTAAGGGCAAGTTTCCATTTGGAGACTACATTTCTTATTCCAATGTCACGTTTACCATGAAAGAAATCAAGCATTTGTGGAGACTTAAAGGTTGTATTAGGAAATTCAATAGGCATTTGATTATGCGAACTGATGATTTTGAAAAGCCTGCACAGACAAATGTTCTATGTAGTAATTGGAAGGAATGGGAGCATCCTATCATATGGTTCCAAGGTACAACAGATGCTGTGGCTGCTCAATTTTTCTTGAAGAATGTACATCCCGCCATGAGGGCCGCTGCTTCTAATTTGTTTGGACATCCAGAGGTTTTGGAATCTAGACCTAATGTATTCGGAGAGCTGATGAGAGTTCTTATATCTCCGTCAAAGGATGTTGAAGAAGCAGTGTTCTCAGTTCTTAAAAGTGGGGATGATCCTGATATTTCATTGCACATGAGGATGCTTATGAATAGGTCTGTGAGAGGCTTACAGGCTGCATTGCAGTGCATCAGAAAAGGCATACGCAATCTAACCACGGACTCGAAACCCAGATTGGTTTTAGTATCAGATACCCCAAATTTTGTAAAAAGTATTGTGCCCATGCTTGGTGAATTTGCAGAGGTTATTCATTTTGATTATGAACACTTCAGAGGAATCATTTCTGGAACGCACGATGAATTTCATAAATTGGATTTCAGAGTGAAGGACTGGGGCCCTTCACCAAGATGGGTTGCCTTTGTGGATTTTTTTCTTGCATCCCGTGCCAAGCGTGCTGTTATTTCTGGTGCTCACAGGCGTGTAGGTACTACCTATGCTCAGCTAATCGCAGCATTGGCTGCAGCACACAATCTCGACAATCTCGGGAAAAATTCTACTGGTTCAGACTTCTTCTTCTTGAGTAGCTTCCAAAGTAATTTGTTGAGAGAAGGTTTAAAGAACCAGGTTGGCTGGGGTCATATCTGGAACAGATTTGCAGGCCCTTTAAGCTGCCCTAGCCAGCCTAACCAGTGTGCCTTAACCCCTCTTCTCCCTCCAGCATGGTGGGATGGACTTTGGCAATCTCCCATTCCACGAGATATCAAAAGAATGGAAAATTATGGAGTTCATTTATCGGGCTTTGGCACGATTGATGAAGACAGCCTTCGATCGTTCTGTAATGCAAAGAAGAATGTTGTGAGGACTATCCCTTTCATATTATAG
Protein sequence
MRHGGSRKKRSSSFARYVVVLCAVGASIGFLMLNFLMRMEAQESESSSDQLGNGDDVEESRVLSEMDGRRSCATVEQMGEAFKDGVWKESLRVRTIIQNHFYLNGASRVRQLPPEQFCKHGFVMGKSSEAGFGNEMYKILTAGALSIMLNRSLIIGQTRHAKGKFPFGDYISYSNVTFTMKEIKHLWRLKGCIRKFNRHLIMRTDDFEKPAQTNVLCSNWKEWEHPIIWFQGTTDAVAAQFFLKNVHPAMRAAASNLFGHPEVLESRPNVFGELMRVLISPSKDVEEAVFSVLKSGDDPDISLHMRMLMNRSVRGLQAALQCIRKGIRNLTTDSKPRLVLVSDTPNFVKSIVPMLGEFAEVIHFDYEHFRGIISGTHDEFHKLDFRVKDWGPSPRWVAFVDFFLASRAKRAVISGAHRRVGTTYAQLIAALAAAHNLDNLGKNSTGSDFFFLSSFQSNLLREGLKNQVGWGHIWNRFAGPLSCPSQPNQCALTPLLPPAWWDGLWQSPIPRDIKRMENYGVHLSGFGTIDEDSLRSFCNAKKNVVRTIPFIL
Homology
BLAST of Cp4.1LG01g19760 vs. NCBI nr
Match:
XP_023532982.1 (uncharacterized protein LOC111794994 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 1118 bits (2892), Expect = 0.0
Identity = 549/552 (99.46%), Postives = 549/552 (99.46%), Query Frame = 0
Query: 1 MRHGGSRKKRSSSFARYVVVLCAVGASIGFLMLNFLMRMEAQESESSSDQLGNGDDVEES 60
MRHGGSRKKRSSSFARYVVVLCAVGASIGFLMLNFLMRMEAQESESSSDQLGNGDDVEES
Sbjct: 1 MRHGGSRKKRSSSFARYVVVLCAVGASIGFLMLNFLMRMEAQESESSSDQLGNGDDVEES 60
Query: 61 RVLSEMDGRRSCATVEQMGEAFKDGVWKESLRVRTIIQNHFYLNGASRVRQLPPEQFCKH 120
RVLSEMDGRRSCATVEQMGEAFKDGVWKESLRVRTIIQNHFYLNGASRVRQLPPEQFCKH
Sbjct: 61 RVLSEMDGRRSCATVEQMGEAFKDGVWKESLRVRTIIQNHFYLNGASRVRQLPPEQFCKH 120
Query: 121 GFVMGKSSEAGFGNEMYKILTAGALSIMLNRSLIIGQTRHAKGKFPFGDYISYSNVTFTM 180
GFVMGKSSEAGFGNEMYKILTAGALSIMLNRSLIIGQTR GKFPFGDYISYSNVTFTM
Sbjct: 121 GFVMGKSSEAGFGNEMYKILTAGALSIMLNRSLIIGQTR---GKFPFGDYISYSNVTFTM 180
Query: 181 KEIKHLWRLKGCIRKFNRHLIMRTDDFEKPAQTNVLCSNWKEWEHPIIWFQGTTDAVAAQ 240
KEIKHLWRLKGCIRKFNRHLIMRTDDFEKPAQTNVLCSNWKEWEHPIIWFQGTTDAVAAQ
Sbjct: 181 KEIKHLWRLKGCIRKFNRHLIMRTDDFEKPAQTNVLCSNWKEWEHPIIWFQGTTDAVAAQ 240
Query: 241 FFLKNVHPAMRAAASNLFGHPEVLESRPNVFGELMRVLISPSKDVEEAVFSVLKSGDDPD 300
FFLKNVHPAMRAAASNLFGHPEVLESRPNVFGELMRVLISPSKDVEEAVFSVLKSGDDPD
Sbjct: 241 FFLKNVHPAMRAAASNLFGHPEVLESRPNVFGELMRVLISPSKDVEEAVFSVLKSGDDPD 300
Query: 301 ISLHMRMLMNRSVRGLQAALQCIRKGIRNLTTDSKPRLVLVSDTPNFVKSIVPMLGEFAE 360
ISLHMRMLMNRSVRGLQAALQCIRKGIRNLTTDSKPRLVLVSDTPNFVKSIVPMLGEFAE
Sbjct: 301 ISLHMRMLMNRSVRGLQAALQCIRKGIRNLTTDSKPRLVLVSDTPNFVKSIVPMLGEFAE 360
Query: 361 VIHFDYEHFRGIISGTHDEFHKLDFRVKDWGPSPRWVAFVDFFLASRAKRAVISGAHRRV 420
VIHFDYEHFRGIISGTHDEFHKLDFRVKDWGPSPRWVAFVDFFLASRAKRAVISGAHRRV
Sbjct: 361 VIHFDYEHFRGIISGTHDEFHKLDFRVKDWGPSPRWVAFVDFFLASRAKRAVISGAHRRV 420
Query: 421 GTTYAQLIAALAAAHNLDNLGKNSTGSDFFFLSSFQSNLLREGLKNQVGWGHIWNRFAGP 480
GTTYAQLIAALAAAHNLDNLGKNSTGSDFFFLSSFQSNLLREGLKNQVGWGHIWNRFAGP
Sbjct: 421 GTTYAQLIAALAAAHNLDNLGKNSTGSDFFFLSSFQSNLLREGLKNQVGWGHIWNRFAGP 480
Query: 481 LSCPSQPNQCALTPLLPPAWWDGLWQSPIPRDIKRMENYGVHLSGFGTIDEDSLRSFCNA 540
LSCPSQPNQCALTPLLPPAWWDGLWQSPIPRDIKRMENYGVHLSGFGTIDEDSLRSFCNA
Sbjct: 481 LSCPSQPNQCALTPLLPPAWWDGLWQSPIPRDIKRMENYGVHLSGFGTIDEDSLRSFCNA 540
Query: 541 KKNVVRTIPFIL 552
KKNVVRTIPFIL
Sbjct: 541 KKNVVRTIPFIL 549
BLAST of Cp4.1LG01g19760 vs. NCBI nr
Match:
XP_022922653.1 (uncharacterized protein LOC111430593 [Cucurbita moschata])
HSP 1 Score: 1112 bits (2875), Expect = 0.0
Identity = 546/552 (98.91%), Postives = 546/552 (98.91%), Query Frame = 0
Query: 1 MRHGGSRKKRSSSFARYVVVLCAVGASIGFLMLNFLMRMEAQESESSSDQLGNGDDVEES 60
MRHGGSRKKR SSFARYVVVLCAVGASIGFLMLNFLMRMEAQESESSSDQLGNGDDVEES
Sbjct: 1 MRHGGSRKKRWSSFARYVVVLCAVGASIGFLMLNFLMRMEAQESESSSDQLGNGDDVEES 60
Query: 61 RVLSEMDGRRSCATVEQMGEAFKDGVWKESLRVRTIIQNHFYLNGASRVRQLPPEQFCKH 120
RVLSEMDGRRSCATVEQMGEAFKDGVWKESLRVRTIIQNHFYLNGASRVRQLPPEQFCKH
Sbjct: 61 RVLSEMDGRRSCATVEQMGEAFKDGVWKESLRVRTIIQNHFYLNGASRVRQLPPEQFCKH 120
Query: 121 GFVMGKSSEAGFGNEMYKILTAGALSIMLNRSLIIGQTRHAKGKFPFGDYISYSNVTFTM 180
GFVMGKSSEAGFGNEMYKILTAGALSIMLNRSLIIGQTR GKFPFGDYISYSNVTFTM
Sbjct: 121 GFVMGKSSEAGFGNEMYKILTAGALSIMLNRSLIIGQTR---GKFPFGDYISYSNVTFTM 180
Query: 181 KEIKHLWRLKGCIRKFNRHLIMRTDDFEKPAQTNVLCSNWKEWEHPIIWFQGTTDAVAAQ 240
KEIKHLWRLKGCIRKFNRHLIMRTDDFEKPAQTNVLCSNWKEWEHPIIWFQGTTDAVAAQ
Sbjct: 181 KEIKHLWRLKGCIRKFNRHLIMRTDDFEKPAQTNVLCSNWKEWEHPIIWFQGTTDAVAAQ 240
Query: 241 FFLKNVHPAMRAAASNLFGHPEVLESRPNVFGELMRVLISPSKDVEEAVFSVLKSGDDPD 300
FFLKNVHPAMRAAASNLFGHPEVLESRPNVFGELMRVLISPSKDVEEAVFSVLKSGDDPD
Sbjct: 241 FFLKNVHPAMRAAASNLFGHPEVLESRPNVFGELMRVLISPSKDVEEAVFSVLKSGDDPD 300
Query: 301 ISLHMRMLMNRSVRGLQAALQCIRKGIRNLTTDSKPRLVLVSDTPNFVKSIVPMLGEFAE 360
ISLHMRMLMNRSVRGLQAALQCIRKGIRNLTTDSKPRLVLVSDTPNFVKSIVPMLGEFAE
Sbjct: 301 ISLHMRMLMNRSVRGLQAALQCIRKGIRNLTTDSKPRLVLVSDTPNFVKSIVPMLGEFAE 360
Query: 361 VIHFDYEHFRGIISGTHDEFHKLDFRVKDWGPSPRWVAFVDFFLASRAKRAVISGAHRRV 420
VIHFDYEHFRGIISGTHDEFHKLDFRVKDWGPSPRWVAFVDFFLASRAK AVISGAHRRV
Sbjct: 361 VIHFDYEHFRGIISGTHDEFHKLDFRVKDWGPSPRWVAFVDFFLASRAKHAVISGAHRRV 420
Query: 421 GTTYAQLIAALAAAHNLDNLGKNSTGSDFFFLSSFQSNLLREGLKNQVGWGHIWNRFAGP 480
GTTYAQLIAALAAAHNLDNLG NSTGSDFFFLSSFQSNLLREGLKNQVGWGHIWNRFAGP
Sbjct: 421 GTTYAQLIAALAAAHNLDNLGNNSTGSDFFFLSSFQSNLLREGLKNQVGWGHIWNRFAGP 480
Query: 481 LSCPSQPNQCALTPLLPPAWWDGLWQSPIPRDIKRMENYGVHLSGFGTIDEDSLRSFCNA 540
LSCPSQPNQCALTPLLPPAWWDGLWQSPIPRDIKRMENYGVHLSGFGTIDEDSLRSFCNA
Sbjct: 481 LSCPSQPNQCALTPLLPPAWWDGLWQSPIPRDIKRMENYGVHLSGFGTIDEDSLRSFCNA 540
Query: 541 KKNVVRTIPFIL 552
KKNVVRTIPFIL
Sbjct: 541 KKNVVRTIPFIL 549
BLAST of Cp4.1LG01g19760 vs. NCBI nr
Match:
KAG7032713.1 (hypothetical protein SDJN02_06763 [Cucurbita argyrosperma subsp. argyrosperma])
HSP 1 Score: 1111 bits (2873), Expect = 0.0
Identity = 546/562 (97.15%), Postives = 548/562 (97.51%), Query Frame = 0
Query: 1 MRHGGSRKKRSSSFARYVVVLCAVGASIGFLMLNFLMRMEAQESESSSDQLGNGDDVEES 60
MRHGGSRKKRSSSFARYVVVLCAVGASIGFLMLNFLMRMEAQESESSSDQLGNGDDVEES
Sbjct: 1 MRHGGSRKKRSSSFARYVVVLCAVGASIGFLMLNFLMRMEAQESESSSDQLGNGDDVEES 60
Query: 61 RVLSEMDGRRSCATVEQMGEAFKDGVWKESLRVRTIIQNHFYLNGASRVRQLPPEQFCKH 120
RVLSEMDGRRSCATVEQMGEAFKDGVWKESLRVRTIIQNHFYLNGASRVRQLPPEQFCKH
Sbjct: 61 RVLSEMDGRRSCATVEQMGEAFKDGVWKESLRVRTIIQNHFYLNGASRVRQLPPEQFCKH 120
Query: 121 GFVMGKSSEAGFGNEMYKILTAGALSIMLNRSLIIGQTRHA----------KGKFPFGDY 180
GFVMGKSSEAGFGNEMYKILTAGALSIMLNRSLIIGQTRH+ GKFPFGDY
Sbjct: 121 GFVMGKSSEAGFGNEMYKILTAGALSIMLNRSLIIGQTRHSCIDGITFVVLMGKFPFGDY 180
Query: 181 ISYSNVTFTMKEIKHLWRLKGCIRKFNRHLIMRTDDFEKPAQTNVLCSNWKEWEHPIIWF 240
ISYSNV FTMKEIKHLWRLKGCIR+FNRHLIMRTDDFEKPAQTNVLCSNWKEWEHPIIWF
Sbjct: 181 ISYSNVMFTMKEIKHLWRLKGCIREFNRHLIMRTDDFEKPAQTNVLCSNWKEWEHPIIWF 240
Query: 241 QGTTDAVAAQFFLKNVHPAMRAAASNLFGHPEVLESRPNVFGELMRVLISPSKDVEEAVF 300
QGTTDAVAAQFFLKNVHPAMRAAASNLFGHPEVLESRPNVFGELMRVLISPSKDVEEAVF
Sbjct: 241 QGTTDAVAAQFFLKNVHPAMRAAASNLFGHPEVLESRPNVFGELMRVLISPSKDVEEAVF 300
Query: 301 SVLKSGDDPDISLHMRMLMNRSVRGLQAALQCIRKGIRNLTTDSKPRLVLVSDTPNFVKS 360
SVLKSGDDPDISLHMRMLMNRSVRGLQAALQCIRKGIRNLTTDSKPRLVLVSDTPNFVKS
Sbjct: 301 SVLKSGDDPDISLHMRMLMNRSVRGLQAALQCIRKGIRNLTTDSKPRLVLVSDTPNFVKS 360
Query: 361 IVPMLGEFAEVIHFDYEHFRGIISGTHDEFHKLDFRVKDWGPSPRWVAFVDFFLASRAKR 420
IVPMLGEFAEVIHFDYEHFRGIISGTHDEFHKLDFRVKDWGPSPRWVAFVDFFLASRAK
Sbjct: 361 IVPMLGEFAEVIHFDYEHFRGIISGTHDEFHKLDFRVKDWGPSPRWVAFVDFFLASRAKH 420
Query: 421 AVISGAHRRVGTTYAQLIAALAAAHNLDNLGKNSTGSDFFFLSSFQSNLLREGLKNQVGW 480
AVISGAHRRVGTTYAQLIAALAAAHNLDNLG NSTGSDFFFLSSFQSNLLREGLKNQVGW
Sbjct: 421 AVISGAHRRVGTTYAQLIAALAAAHNLDNLGNNSTGSDFFFLSSFQSNLLREGLKNQVGW 480
Query: 481 GHIWNRFAGPLSCPSQPNQCALTPLLPPAWWDGLWQSPIPRDIKRMENYGVHLSGFGTID 540
GHIWNRFAGPLSCPSQPNQCALTPLLPPAWWDGLWQSPIPRDIKRMENYGVHLSGFGTID
Sbjct: 481 GHIWNRFAGPLSCPSQPNQCALTPLLPPAWWDGLWQSPIPRDIKRMENYGVHLSGFGTID 540
Query: 541 EDSLRSFCNAKKNVVRTIPFIL 552
EDSLRSFCNAKKNVVRTIPFIL
Sbjct: 541 EDSLRSFCNAKKNVVRTIPFIL 562
BLAST of Cp4.1LG01g19760 vs. NCBI nr
Match:
XP_022990922.1 (uncharacterized protein LOC111487669 [Cucurbita maxima])
HSP 1 Score: 1098 bits (2841), Expect = 0.0
Identity = 538/552 (97.46%), Postives = 544/552 (98.55%), Query Frame = 0
Query: 1 MRHGGSRKKRSSSFARYVVVLCAVGASIGFLMLNFLMRMEAQESESSSDQLGNGDDVEES 60
MRHGGSRKKRSSSFARYVVVLCAVGASIGF MLNFLMRMEA+ESESSSDQLGNGDDVEES
Sbjct: 1 MRHGGSRKKRSSSFARYVVVLCAVGASIGFFMLNFLMRMEARESESSSDQLGNGDDVEES 60
Query: 61 RVLSEMDGRRSCATVEQMGEAFKDGVWKESLRVRTIIQNHFYLNGASRVRQLPPEQFCKH 120
RVL+EMDGRRSCATVE+MGEAFKDGVWKESLRVR IIQNHFYLNGASRVRQLPPEQFCKH
Sbjct: 61 RVLNEMDGRRSCATVEKMGEAFKDGVWKESLRVRKIIQNHFYLNGASRVRQLPPEQFCKH 120
Query: 121 GFVMGKSSEAGFGNEMYKILTAGALSIMLNRSLIIGQTRHAKGKFPFGDYISYSNVTFTM 180
GFVMGKSSEAGFGNEMYKILTAGALSIMLNRSLIIGQTR GKFPFGDYISYS+VTFTM
Sbjct: 121 GFVMGKSSEAGFGNEMYKILTAGALSIMLNRSLIIGQTR---GKFPFGDYISYSDVTFTM 180
Query: 181 KEIKHLWRLKGCIRKFNRHLIMRTDDFEKPAQTNVLCSNWKEWEHPIIWFQGTTDAVAAQ 240
KEIKHLWRLKGCIRKFNRHLIMRTDDFEKPAQTNVLCSNWKEWEHPIIWFQGTTDAVAAQ
Sbjct: 181 KEIKHLWRLKGCIRKFNRHLIMRTDDFEKPAQTNVLCSNWKEWEHPIIWFQGTTDAVAAQ 240
Query: 241 FFLKNVHPAMRAAASNLFGHPEVLESRPNVFGELMRVLISPSKDVEEAVFSVLKSGDDPD 300
FFLKNVHPAMRAAASNLFGHPE+LESRPNVFGELMRVLISPSKDVEEAVFSVLKSGDDPD
Sbjct: 241 FFLKNVHPAMRAAASNLFGHPELLESRPNVFGELMRVLISPSKDVEEAVFSVLKSGDDPD 300
Query: 301 ISLHMRMLMNRSVRGLQAALQCIRKGIRNLTTDSKPRLVLVSDTPNFVKSIVPMLGEFAE 360
ISLHMRMLMNRSVRGLQAALQCIRKGIRNLTT+SKPRLVLVSDTPNFVKSIVPMLGEFAE
Sbjct: 301 ISLHMRMLMNRSVRGLQAALQCIRKGIRNLTTNSKPRLVLVSDTPNFVKSIVPMLGEFAE 360
Query: 361 VIHFDYEHFRGIISGTHDEFHKLDFRVKDWGPSPRWVAFVDFFLASRAKRAVISGAHRRV 420
VIHFDYEHFRGIISGTHDEFHKLDFRVKDWGPSPRWVAFVDFFLASRAKRAVISGAHRRV
Sbjct: 361 VIHFDYEHFRGIISGTHDEFHKLDFRVKDWGPSPRWVAFVDFFLASRAKRAVISGAHRRV 420
Query: 421 GTTYAQLIAALAAAHNLDNLGKNSTGSDFFFLSSFQSNLLREGLKNQVGWGHIWNRFAGP 480
GTTYAQLIAALAAAHNLDNLG NSTG DFFFLSSFQSNLLREGLKNQVGWGHIWNRFAGP
Sbjct: 421 GTTYAQLIAALAAAHNLDNLGNNSTGPDFFFLSSFQSNLLREGLKNQVGWGHIWNRFAGP 480
Query: 481 LSCPSQPNQCALTPLLPPAWWDGLWQSPIPRDIKRMENYGVHLSGFGTIDEDSLRSFCNA 540
LSCPSQPNQCALTPLLPPAWWDGLWQSPIPRDIKRMENYGVHLSGFGTIDEDSLRSFCNA
Sbjct: 481 LSCPSQPNQCALTPLLPPAWWDGLWQSPIPRDIKRMENYGVHLSGFGTIDEDSLRSFCNA 540
Query: 541 KKNVVRTIPFIL 552
KKNVVRT PFIL
Sbjct: 541 KKNVVRTPPFIL 549
BLAST of Cp4.1LG01g19760 vs. NCBI nr
Match:
KAG6602018.1 (hypothetical protein SDJN03_07251, partial [Cucurbita argyrosperma subsp. sororia])
HSP 1 Score: 1077 bits (2785), Expect = 0.0
Identity = 535/562 (95.20%), Postives = 538/562 (95.73%), Query Frame = 0
Query: 1 MRHGGSRKKRSSSFARYVVVLCAVGASIGFLMLNFLMRMEAQESESSSDQLGNGDDVEES 60
MRHGGSRKKRSSSFARYVVVLCAVGASIGFLMLNFLMRMEAQESESSSDQLGNGDDVEES
Sbjct: 1 MRHGGSRKKRSSSFARYVVVLCAVGASIGFLMLNFLMRMEAQESESSSDQLGNGDDVEES 60
Query: 61 RVLSEMDGRRSCATVEQMGEAFKDGVWKESLRVRTI----------IQNHFYLNGASRVR 120
RVLSEMDGRRSCATVEQMGEAFKDGVWKESLR + + +N ASRVR
Sbjct: 61 RVLSEMDGRRSCATVEQMGEAFKDGVWKESLRGKVVGIWRRMVDETRENLRLPRCASRVR 120
Query: 121 QLPPEQFCKHGFVMGKSSEAGFGNEMYKILTAGALSIMLNRSLIIGQTRHAKGKFPFGDY 180
QLPPEQFCKHGFVMGKSSEAGFGNEMYKILTAGALSIMLNRSLIIGQTR GKFPFGDY
Sbjct: 121 QLPPEQFCKHGFVMGKSSEAGFGNEMYKILTAGALSIMLNRSLIIGQTR---GKFPFGDY 180
Query: 181 ISYSNVTFTMKEIKHLWRLKGCIRKFNRHLIMRTDDFEKPAQTNVLCSNWKEWEHPIIWF 240
ISYSNVTFTMKEIKHLWRLKGCIRKFNRHLIMRTDDFEKPAQTNVLCSNWKEWEHPIIWF
Sbjct: 181 ISYSNVTFTMKEIKHLWRLKGCIRKFNRHLIMRTDDFEKPAQTNVLCSNWKEWEHPIIWF 240
Query: 241 QGTTDAVAAQFFLKNVHPAMRAAASNLFGHPEVLESRPNVFGELMRVLISPSKDVEEAVF 300
QGTTDAVAAQFFLKNVHPAMRAAASNLFGHPEVLESRPNVFGELMRVLISPSKDVEEAVF
Sbjct: 241 QGTTDAVAAQFFLKNVHPAMRAAASNLFGHPEVLESRPNVFGELMRVLISPSKDVEEAVF 300
Query: 301 SVLKSGDDPDISLHMRMLMNRSVRGLQAALQCIRKGIRNLTTDSKPRLVLVSDTPNFVKS 360
SVLKSGDDPDISLHMRMLMNRSVRGLQAALQCIRKGIRNLTTDSKPRLVLVSDTPNFVKS
Sbjct: 301 SVLKSGDDPDISLHMRMLMNRSVRGLQAALQCIRKGIRNLTTDSKPRLVLVSDTPNFVKS 360
Query: 361 IVPMLGEFAEVIHFDYEHFRGIISGTHDEFHKLDFRVKDWGPSPRWVAFVDFFLASRAKR 420
IVPMLGEFAEVIHFDYEHFRGIISGTHDEFHKLDFRVKDWGPSPRWVAFVDFFLASRAK
Sbjct: 361 IVPMLGEFAEVIHFDYEHFRGIISGTHDEFHKLDFRVKDWGPSPRWVAFVDFFLASRAKH 420
Query: 421 AVISGAHRRVGTTYAQLIAALAAAHNLDNLGKNSTGSDFFFLSSFQSNLLREGLKNQVGW 480
AVISGAHRRVGTTYAQLIAALAAAHNLDNLG NSTGSDFFFLSSFQSNLLREGLKNQVGW
Sbjct: 421 AVISGAHRRVGTTYAQLIAALAAAHNLDNLGNNSTGSDFFFLSSFQSNLLREGLKNQVGW 480
Query: 481 GHIWNRFAGPLSCPSQPNQCALTPLLPPAWWDGLWQSPIPRDIKRMENYGVHLSGFGTID 540
GHIWNRFAGPLSCPSQPNQCALTPLLPPAWWDGLWQSPIPRDIKRMENYGVHLSGFGTID
Sbjct: 481 GHIWNRFAGPLSCPSQPNQCALTPLLPPAWWDGLWQSPIPRDIKRMENYGVHLSGFGTID 540
Query: 541 EDSLRSFCNAKKNVVRTIPFIL 552
EDSLRSFCNAKKNVVRTIPFIL
Sbjct: 541 EDSLRSFCNAKKNVVRTIPFIL 559
BLAST of Cp4.1LG01g19760 vs. ExPASy TrEMBL
Match:
A0A6J1E7F2 (uncharacterized protein LOC111430593 OS=Cucurbita moschata OX=3662 GN=LOC111430593 PE=4 SV=1)
HSP 1 Score: 1112 bits (2875), Expect = 0.0
Identity = 546/552 (98.91%), Postives = 546/552 (98.91%), Query Frame = 0
Query: 1 MRHGGSRKKRSSSFARYVVVLCAVGASIGFLMLNFLMRMEAQESESSSDQLGNGDDVEES 60
MRHGGSRKKR SSFARYVVVLCAVGASIGFLMLNFLMRMEAQESESSSDQLGNGDDVEES
Sbjct: 1 MRHGGSRKKRWSSFARYVVVLCAVGASIGFLMLNFLMRMEAQESESSSDQLGNGDDVEES 60
Query: 61 RVLSEMDGRRSCATVEQMGEAFKDGVWKESLRVRTIIQNHFYLNGASRVRQLPPEQFCKH 120
RVLSEMDGRRSCATVEQMGEAFKDGVWKESLRVRTIIQNHFYLNGASRVRQLPPEQFCKH
Sbjct: 61 RVLSEMDGRRSCATVEQMGEAFKDGVWKESLRVRTIIQNHFYLNGASRVRQLPPEQFCKH 120
Query: 121 GFVMGKSSEAGFGNEMYKILTAGALSIMLNRSLIIGQTRHAKGKFPFGDYISYSNVTFTM 180
GFVMGKSSEAGFGNEMYKILTAGALSIMLNRSLIIGQTR GKFPFGDYISYSNVTFTM
Sbjct: 121 GFVMGKSSEAGFGNEMYKILTAGALSIMLNRSLIIGQTR---GKFPFGDYISYSNVTFTM 180
Query: 181 KEIKHLWRLKGCIRKFNRHLIMRTDDFEKPAQTNVLCSNWKEWEHPIIWFQGTTDAVAAQ 240
KEIKHLWRLKGCIRKFNRHLIMRTDDFEKPAQTNVLCSNWKEWEHPIIWFQGTTDAVAAQ
Sbjct: 181 KEIKHLWRLKGCIRKFNRHLIMRTDDFEKPAQTNVLCSNWKEWEHPIIWFQGTTDAVAAQ 240
Query: 241 FFLKNVHPAMRAAASNLFGHPEVLESRPNVFGELMRVLISPSKDVEEAVFSVLKSGDDPD 300
FFLKNVHPAMRAAASNLFGHPEVLESRPNVFGELMRVLISPSKDVEEAVFSVLKSGDDPD
Sbjct: 241 FFLKNVHPAMRAAASNLFGHPEVLESRPNVFGELMRVLISPSKDVEEAVFSVLKSGDDPD 300
Query: 301 ISLHMRMLMNRSVRGLQAALQCIRKGIRNLTTDSKPRLVLVSDTPNFVKSIVPMLGEFAE 360
ISLHMRMLMNRSVRGLQAALQCIRKGIRNLTTDSKPRLVLVSDTPNFVKSIVPMLGEFAE
Sbjct: 301 ISLHMRMLMNRSVRGLQAALQCIRKGIRNLTTDSKPRLVLVSDTPNFVKSIVPMLGEFAE 360
Query: 361 VIHFDYEHFRGIISGTHDEFHKLDFRVKDWGPSPRWVAFVDFFLASRAKRAVISGAHRRV 420
VIHFDYEHFRGIISGTHDEFHKLDFRVKDWGPSPRWVAFVDFFLASRAK AVISGAHRRV
Sbjct: 361 VIHFDYEHFRGIISGTHDEFHKLDFRVKDWGPSPRWVAFVDFFLASRAKHAVISGAHRRV 420
Query: 421 GTTYAQLIAALAAAHNLDNLGKNSTGSDFFFLSSFQSNLLREGLKNQVGWGHIWNRFAGP 480
GTTYAQLIAALAAAHNLDNLG NSTGSDFFFLSSFQSNLLREGLKNQVGWGHIWNRFAGP
Sbjct: 421 GTTYAQLIAALAAAHNLDNLGNNSTGSDFFFLSSFQSNLLREGLKNQVGWGHIWNRFAGP 480
Query: 481 LSCPSQPNQCALTPLLPPAWWDGLWQSPIPRDIKRMENYGVHLSGFGTIDEDSLRSFCNA 540
LSCPSQPNQCALTPLLPPAWWDGLWQSPIPRDIKRMENYGVHLSGFGTIDEDSLRSFCNA
Sbjct: 481 LSCPSQPNQCALTPLLPPAWWDGLWQSPIPRDIKRMENYGVHLSGFGTIDEDSLRSFCNA 540
Query: 541 KKNVVRTIPFIL 552
KKNVVRTIPFIL
Sbjct: 541 KKNVVRTIPFIL 549
BLAST of Cp4.1LG01g19760 vs. ExPASy TrEMBL
Match:
A0A6J1JK88 (uncharacterized protein LOC111487669 OS=Cucurbita maxima OX=3661 GN=LOC111487669 PE=4 SV=1)
HSP 1 Score: 1098 bits (2841), Expect = 0.0
Identity = 538/552 (97.46%), Postives = 544/552 (98.55%), Query Frame = 0
Query: 1 MRHGGSRKKRSSSFARYVVVLCAVGASIGFLMLNFLMRMEAQESESSSDQLGNGDDVEES 60
MRHGGSRKKRSSSFARYVVVLCAVGASIGF MLNFLMRMEA+ESESSSDQLGNGDDVEES
Sbjct: 1 MRHGGSRKKRSSSFARYVVVLCAVGASIGFFMLNFLMRMEARESESSSDQLGNGDDVEES 60
Query: 61 RVLSEMDGRRSCATVEQMGEAFKDGVWKESLRVRTIIQNHFYLNGASRVRQLPPEQFCKH 120
RVL+EMDGRRSCATVE+MGEAFKDGVWKESLRVR IIQNHFYLNGASRVRQLPPEQFCKH
Sbjct: 61 RVLNEMDGRRSCATVEKMGEAFKDGVWKESLRVRKIIQNHFYLNGASRVRQLPPEQFCKH 120
Query: 121 GFVMGKSSEAGFGNEMYKILTAGALSIMLNRSLIIGQTRHAKGKFPFGDYISYSNVTFTM 180
GFVMGKSSEAGFGNEMYKILTAGALSIMLNRSLIIGQTR GKFPFGDYISYS+VTFTM
Sbjct: 121 GFVMGKSSEAGFGNEMYKILTAGALSIMLNRSLIIGQTR---GKFPFGDYISYSDVTFTM 180
Query: 181 KEIKHLWRLKGCIRKFNRHLIMRTDDFEKPAQTNVLCSNWKEWEHPIIWFQGTTDAVAAQ 240
KEIKHLWRLKGCIRKFNRHLIMRTDDFEKPAQTNVLCSNWKEWEHPIIWFQGTTDAVAAQ
Sbjct: 181 KEIKHLWRLKGCIRKFNRHLIMRTDDFEKPAQTNVLCSNWKEWEHPIIWFQGTTDAVAAQ 240
Query: 241 FFLKNVHPAMRAAASNLFGHPEVLESRPNVFGELMRVLISPSKDVEEAVFSVLKSGDDPD 300
FFLKNVHPAMRAAASNLFGHPE+LESRPNVFGELMRVLISPSKDVEEAVFSVLKSGDDPD
Sbjct: 241 FFLKNVHPAMRAAASNLFGHPELLESRPNVFGELMRVLISPSKDVEEAVFSVLKSGDDPD 300
Query: 301 ISLHMRMLMNRSVRGLQAALQCIRKGIRNLTTDSKPRLVLVSDTPNFVKSIVPMLGEFAE 360
ISLHMRMLMNRSVRGLQAALQCIRKGIRNLTT+SKPRLVLVSDTPNFVKSIVPMLGEFAE
Sbjct: 301 ISLHMRMLMNRSVRGLQAALQCIRKGIRNLTTNSKPRLVLVSDTPNFVKSIVPMLGEFAE 360
Query: 361 VIHFDYEHFRGIISGTHDEFHKLDFRVKDWGPSPRWVAFVDFFLASRAKRAVISGAHRRV 420
VIHFDYEHFRGIISGTHDEFHKLDFRVKDWGPSPRWVAFVDFFLASRAKRAVISGAHRRV
Sbjct: 361 VIHFDYEHFRGIISGTHDEFHKLDFRVKDWGPSPRWVAFVDFFLASRAKRAVISGAHRRV 420
Query: 421 GTTYAQLIAALAAAHNLDNLGKNSTGSDFFFLSSFQSNLLREGLKNQVGWGHIWNRFAGP 480
GTTYAQLIAALAAAHNLDNLG NSTG DFFFLSSFQSNLLREGLKNQVGWGHIWNRFAGP
Sbjct: 421 GTTYAQLIAALAAAHNLDNLGNNSTGPDFFFLSSFQSNLLREGLKNQVGWGHIWNRFAGP 480
Query: 481 LSCPSQPNQCALTPLLPPAWWDGLWQSPIPRDIKRMENYGVHLSGFGTIDEDSLRSFCNA 540
LSCPSQPNQCALTPLLPPAWWDGLWQSPIPRDIKRMENYGVHLSGFGTIDEDSLRSFCNA
Sbjct: 481 LSCPSQPNQCALTPLLPPAWWDGLWQSPIPRDIKRMENYGVHLSGFGTIDEDSLRSFCNA 540
Query: 541 KKNVVRTIPFIL 552
KKNVVRT PFIL
Sbjct: 541 KKNVVRTPPFIL 549
BLAST of Cp4.1LG01g19760 vs. ExPASy TrEMBL
Match:
A0A6J1JUE3 (uncharacterized protein LOC111488989 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111488989 PE=4 SV=1)
HSP 1 Score: 1006 bits (2602), Expect = 0.0
Identity = 492/553 (88.97%), Postives = 513/553 (92.77%), Query Frame = 0
Query: 1 MRHGGSRKKRSSSFARYVVVLCAVGASIGFLMLNFLMRMEAQESESSSDQLGNGDDVEES 60
MRHGG ++KRSSS RYVVVLCAVGA+IGFLMLN L R+E++ SE SSDQ GNGDDVEES
Sbjct: 19 MRHGGLKRKRSSSLVRYVVVLCAVGAAIGFLMLNVLFRLESRGSELSSDQFGNGDDVEES 78
Query: 61 RVLSEMDGRR-SCATVEQMGEAFKDGVWKESLRVRTIIQNHFYLNGASRVRQLPPEQFCK 120
S ++GRR SCATVEQMGE F DGVWKESLRVRTIIQNHFYLNGASRVR LPPEQFCK
Sbjct: 79 FARSGIEGRRGSCATVEQMGEVFNDGVWKESLRVRTIIQNHFYLNGASRVRHLPPEQFCK 138
Query: 121 HGFVMGKSSEAGFGNEMYKILTAGALSIMLNRSLIIGQTRHAKGKFPFGDYISYSNVTFT 180
HGFVMGKSSEAGFGNEMYKILTAGALSIMLNRSLIIGQTR GKFPFGDYISYS+++FT
Sbjct: 139 HGFVMGKSSEAGFGNEMYKILTAGALSIMLNRSLIIGQTR---GKFPFGDYISYSDISFT 198
Query: 181 MKEIKHLWRLKGCIRKFNRHLIMRTDDFEKPAQTNVLCSNWKEWEHPIIWFQGTTDAVAA 240
+KEIKHLWRLKGC+RKF RHLIMR DDFEKPAQTNVLCSNWKEWEHPIIWFQGTTDAVAA
Sbjct: 199 LKEIKHLWRLKGCVRKFKRHLIMRIDDFEKPAQTNVLCSNWKEWEHPIIWFQGTTDAVAA 258
Query: 241 QFFLKNVHPAMRAAASNLFGHPEVLESRPNVFGELMRVLISPSKDVEEAVFSVLKSGDDP 300
QFFLKNVHPAMRAAASNLFG PEVLESRPNVFGELMR+LISPSKDVEEAV SVLKSG DP
Sbjct: 259 QFFLKNVHPAMRAAASNLFGQPEVLESRPNVFGELMRILISPSKDVEEAVLSVLKSGADP 318
Query: 301 DISLHMRMLMNRSVRGLQAALQCIRKGIRNLTTDSKPRLVLVSDTPNFVKSIVPMLGEFA 360
DISLHMRMLMNRS+RGLQAA+QCIRK I NLTT KPRLVLVSDTP+FV SI+P+LGEFA
Sbjct: 319 DISLHMRMLMNRSIRGLQAAVQCIRKAILNLTTVPKPRLVLVSDTPDFVTSIMPILGEFA 378
Query: 361 EVIHFDYEHFRGIISGTHDEFHKLDFRVKDWGPSPRWVAFVDFFLASRAKRAVISGAHRR 420
EVIHFDYEHFRG IS THDEFHKLDFRVKDWGPSPRWVAFVDFFLASRAK AVISGAHRR
Sbjct: 379 EVIHFDYEHFRGNISRTHDEFHKLDFRVKDWGPSPRWVAFVDFFLASRAKHAVISGAHRR 438
Query: 421 VGTTYAQLIAALAAAHNLDNLGKNSTGSDFFFLSSFQSNLLREGLKNQVGWGHIWNRFAG 480
+GTTYAQLIAALAAAHNLDN G NSTGSDF FLSSFQSNLL EGLKNQVGWGHIWNRFAG
Sbjct: 439 IGTTYAQLIAALAAAHNLDNFGNNSTGSDFSFLSSFQSNLLTEGLKNQVGWGHIWNRFAG 498
Query: 481 PLSCPSQPNQCALTPLLPPAWWDGLWQSPIPRDIKRMENYGVHLSGFGTIDEDSLRSFCN 540
PLSCP QPNQCALTPLLPPAWWDGLWQSPIPRDIKRMENYGVHLS G +DEDSLRSFCN
Sbjct: 499 PLSCPGQPNQCALTPLLPPAWWDGLWQSPIPRDIKRMENYGVHLSSSGIVDEDSLRSFCN 558
Query: 541 AKKNVVRTIPFIL 552
AKKNVVRTIPFIL
Sbjct: 559 AKKNVVRTIPFIL 568
BLAST of Cp4.1LG01g19760 vs. ExPASy TrEMBL
Match:
A0A6J1FF37 (uncharacterized protein LOC111444894 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111444894 PE=4 SV=1)
HSP 1 Score: 1005 bits (2598), Expect = 0.0
Identity = 492/553 (88.97%), Postives = 514/553 (92.95%), Query Frame = 0
Query: 1 MRHGGSRKKRSSSFARYVVVLCAVGASIGFLMLNFLMRMEAQESESSSDQLGNGDDVEES 60
MRHGGS++KRSSS RYVVVLCAVGA+IGFLMLN L R+E++ SE SSDQ GNGDDVEES
Sbjct: 1 MRHGGSKRKRSSSLVRYVVVLCAVGAAIGFLMLNVLFRLESRGSELSSDQFGNGDDVEES 60
Query: 61 RVLSEMDGRR-SCATVEQMGEAFKDGVWKESLRVRTIIQNHFYLNGASRVRQLPPEQFCK 120
S ++GRR SCATVE+MGE F DGVWKESLRVRTIIQNHF LNGASRVR LPPEQFCK
Sbjct: 61 FARSGIEGRRGSCATVERMGEVFNDGVWKESLRVRTIIQNHFCLNGASRVRHLPPEQFCK 120
Query: 121 HGFVMGKSSEAGFGNEMYKILTAGALSIMLNRSLIIGQTRHAKGKFPFGDYISYSNVTFT 180
HGFVMGKSSEAGFGNEMYKILTAGALSIMLNRSLIIGQTR GKFPFGDYISYS+++FT
Sbjct: 121 HGFVMGKSSEAGFGNEMYKILTAGALSIMLNRSLIIGQTR---GKFPFGDYISYSDISFT 180
Query: 181 MKEIKHLWRLKGCIRKFNRHLIMRTDDFEKPAQTNVLCSNWKEWEHPIIWFQGTTDAVAA 240
+KEIKHLWRLKGC+RKF RHLIMR DDFEKPAQTNVLCSNWKEWEHPIIWFQGTTDAVAA
Sbjct: 181 LKEIKHLWRLKGCVRKFKRHLIMRIDDFEKPAQTNVLCSNWKEWEHPIIWFQGTTDAVAA 240
Query: 241 QFFLKNVHPAMRAAASNLFGHPEVLESRPNVFGELMRVLISPSKDVEEAVFSVLKSGDDP 300
QFFLKNVHPAMRAAASNLFG PEVLESRPNVFGELMR+LISPSKDVEEAV SVLKSG DP
Sbjct: 241 QFFLKNVHPAMRAAASNLFGQPEVLESRPNVFGELMRILISPSKDVEEAVLSVLKSGADP 300
Query: 301 DISLHMRMLMNRSVRGLQAALQCIRKGIRNLTTDSKPRLVLVSDTPNFVKSIVPMLGEFA 360
DISLHMRMLMNRS+RGLQAA+QCIRK + NLTT KPRLVLVSDTP+FVKSI+P+LGEFA
Sbjct: 301 DISLHMRMLMNRSIRGLQAAVQCIRKAMLNLTTVPKPRLVLVSDTPDFVKSIMPILGEFA 360
Query: 361 EVIHFDYEHFRGIISGTHDEFHKLDFRVKDWGPSPRWVAFVDFFLASRAKRAVISGAHRR 420
EVIHFDYEHFRG IS THDEFHKLDFRVKDWGPSPRWVAFVDFFLASRAK AVISGAHRR
Sbjct: 361 EVIHFDYEHFRGNISATHDEFHKLDFRVKDWGPSPRWVAFVDFFLASRAKHAVISGAHRR 420
Query: 421 VGTTYAQLIAALAAAHNLDNLGKNSTGSDFFFLSSFQSNLLREGLKNQVGWGHIWNRFAG 480
+GTTYAQLIAALAAAHNLDN G NSTGSDF FLSSFQSNLL EGLKNQVGWGHIWNRFAG
Sbjct: 421 IGTTYAQLIAALAAAHNLDNPGNNSTGSDFSFLSSFQSNLLTEGLKNQVGWGHIWNRFAG 480
Query: 481 PLSCPSQPNQCALTPLLPPAWWDGLWQSPIPRDIKRMENYGVHLSGFGTIDEDSLRSFCN 540
PLSCP QPNQCALTPLLPPAWWDGLWQSPIPRDIKRMENYGVHLS G IDEDSLRSFCN
Sbjct: 481 PLSCPGQPNQCALTPLLPPAWWDGLWQSPIPRDIKRMENYGVHLSSSGIIDEDSLRSFCN 540
Query: 541 AKKNVVRTIPFIL 552
AKKNVVRTIPFIL
Sbjct: 541 AKKNVVRTIPFIL 550
BLAST of Cp4.1LG01g19760 vs. ExPASy TrEMBL
Match:
A0A6J1JQR8 (uncharacterized protein LOC111488989 isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111488989 PE=4 SV=1)
HSP 1 Score: 991 bits (2562), Expect = 0.0
Identity = 484/545 (88.81%), Postives = 505/545 (92.66%), Query Frame = 0
Query: 1 MRHGGSRKKRSSSFARYVVVLCAVGASIGFLMLNFLMRMEAQESESSSDQLGNGDDVEES 60
MRHGG ++KRSSS RYVVVLCAVGA+IGFLMLN L R+E++ SE SSDQ GNGDDVEES
Sbjct: 19 MRHGGLKRKRSSSLVRYVVVLCAVGAAIGFLMLNVLFRLESRGSELSSDQFGNGDDVEES 78
Query: 61 RVLSEMDGRR-SCATVEQMGEAFKDGVWKESLRVRTIIQNHFYLNGASRVRQLPPEQFCK 120
S ++GRR SCATVEQMGE F DGVWKESLRVRTIIQNHFYLNGASRVR LPPEQFCK
Sbjct: 79 FARSGIEGRRGSCATVEQMGEVFNDGVWKESLRVRTIIQNHFYLNGASRVRHLPPEQFCK 138
Query: 121 HGFVMGKSSEAGFGNEMYKILTAGALSIMLNRSLIIGQTRHAKGKFPFGDYISYSNVTFT 180
HGFVMGKSSEAGFGNEMYKILTAGALSIMLNRSLIIGQTR GKFPFGDYISYS+++FT
Sbjct: 139 HGFVMGKSSEAGFGNEMYKILTAGALSIMLNRSLIIGQTR---GKFPFGDYISYSDISFT 198
Query: 181 MKEIKHLWRLKGCIRKFNRHLIMRTDDFEKPAQTNVLCSNWKEWEHPIIWFQGTTDAVAA 240
+KEIKHLWRLKGC+RKF RHLIMR DDFEKPAQTNVLCSNWKEWEHPIIWFQGTTDAVAA
Sbjct: 199 LKEIKHLWRLKGCVRKFKRHLIMRIDDFEKPAQTNVLCSNWKEWEHPIIWFQGTTDAVAA 258
Query: 241 QFFLKNVHPAMRAAASNLFGHPEVLESRPNVFGELMRVLISPSKDVEEAVFSVLKSGDDP 300
QFFLKNVHPAMRAAASNLFG PEVLESRPNVFGELMR+LISPSKDVEEAV SVLKSG DP
Sbjct: 259 QFFLKNVHPAMRAAASNLFGQPEVLESRPNVFGELMRILISPSKDVEEAVLSVLKSGADP 318
Query: 301 DISLHMRMLMNRSVRGLQAALQCIRKGIRNLTTDSKPRLVLVSDTPNFVKSIVPMLGEFA 360
DISLHMRMLMNRS+RGLQAA+QCIRK I NLTT KPRLVLVSDTP+FV SI+P+LGEFA
Sbjct: 319 DISLHMRMLMNRSIRGLQAAVQCIRKAILNLTTVPKPRLVLVSDTPDFVTSIMPILGEFA 378
Query: 361 EVIHFDYEHFRGIISGTHDEFHKLDFRVKDWGPSPRWVAFVDFFLASRAKRAVISGAHRR 420
EVIHFDYEHFRG IS THDEFHKLDFRVKDWGPSPRWVAFVDFFLASRAK AVISGAHRR
Sbjct: 379 EVIHFDYEHFRGNISRTHDEFHKLDFRVKDWGPSPRWVAFVDFFLASRAKHAVISGAHRR 438
Query: 421 VGTTYAQLIAALAAAHNLDNLGKNSTGSDFFFLSSFQSNLLREGLKNQVGWGHIWNRFAG 480
+GTTYAQLIAALAAAHNLDN G NSTGSDF FLSSFQSNLL EGLKNQVGWGHIWNRFAG
Sbjct: 439 IGTTYAQLIAALAAAHNLDNFGNNSTGSDFSFLSSFQSNLLTEGLKNQVGWGHIWNRFAG 498
Query: 481 PLSCPSQPNQCALTPLLPPAWWDGLWQSPIPRDIKRMENYGVHLSGFGTIDEDSLRSFCN 540
PLSCP QPNQCALTPLLPPAWWDGLWQSPIPRDIKRMENYGVHLS G +DEDSLRSFCN
Sbjct: 499 PLSCPGQPNQCALTPLLPPAWWDGLWQSPIPRDIKRMENYGVHLSSSGIVDEDSLRSFCN 558
Query: 541 AKKNV 544
AKKNV
Sbjct: 559 AKKNV 560
BLAST of Cp4.1LG01g19760 vs. TAIR 10
Match:
AT3G26950.1 (unknown protein; Has 27 Blast hits to 27 proteins in 8 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 27; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )
HSP 1 Score: 688.3 bits (1775), Expect = 4.9e-198
Identity = 341/558 (61.11%), Postives = 415/558 (74.37%), Query Frame = 0
Query: 1 MRHGGSRKKRSSSFARYVVVLCAVGASIGFLMLNFLMRMEAQESESSSDQLGNGDDVEES 60
M+ GG+R+KR F + ++L +V IGF +L +R S D + E S
Sbjct: 1 MKRGGTRRKR--LFGK-TILLSSVVFFIGFGLLLLTLRSVDPNSSFIDDDDDESESEEAS 60
Query: 61 RVLSE-------MDGRRSCATVEQMGEAFKDGVWKESLRVRTIIQNHFYLNGASRVRQLP 120
R + +DG + CATVE+MG F G +SLRVR +I HF +NGAS +R+LP
Sbjct: 61 RWSNSSSIGEAMVDGAKLCATVEEMGSEFDGGFVDQSLRVRDVIHRHFQINGASAIRELP 120
Query: 121 PEQFCKHGFVMGKSSEAGFGNEMYKILTAGALSIMLNRSLIIGQTRHAKGKFPFGDYISY 180
PEQFC+HG+V+GK++EAGFGNEMYKILT+ ALSIMLNRSLIIGQTR GK+PFGDYI+Y
Sbjct: 121 PEQFCRHGYVLGKTAEAGFGNEMYKILTSAALSIMLNRSLIIGQTR---GKYPFGDYIAY 180
Query: 181 SNVTFTMKEIKHLWRLKGCIRKFNRHLIMRTDDFEKPAQTNVLCSNWKEWEHPIIWFQGT 240
SN TFTM E+KHLWR GC++K+ R L+MR DDFEKPA++NVLCSNWK+WE IIWFQGT
Sbjct: 181 SNATFTMSEVKHLWRQNGCVKKYKRRLVMRLDDFEKPAKSNVLCSNWKKWEEAIIWFQGT 240
Query: 241 TDAVAAQFFLKNVHPAMRAAASNLFGHPEVLESRPNVFGELMRVLISPSKDVEEAVFSVL 300
TDAVAAQFFLKNVHP MRAAA LFG R NVFGELM LISP+KDV+EAV VL
Sbjct: 241 TDAVAAQFFLKNVHPEMRAAAFELFGEQGNSAPRGNVFGELMMSLISPTKDVKEAVDWVL 300
Query: 301 KSGDDPDISLHMRMLMNRSVRGLQAALQCIRKGIRNLTTDSKPRLVLVSDTPNFVKSIVP 360
DPDIS+HMRMLM++SVR ++AA+ C+ K I L + PR+V+VSDTP+ VK I
Sbjct: 301 HETGDPDISVHMRMLMSKSVRPMRAAINCLGKAINRLGIPN-PRVVIVSDTPSVVKIIKT 360
Query: 361 MLGEFAEVIHFDYEHFRGIISGTHDEFHKLDFRVKDWGPSPRWVAFVDFFLASRAKRAVI 420
+ AEV+HFDY+ FRG I+ LDFR+KDWGP+PRWVAFVDFFLA RAK AVI
Sbjct: 361 NISTIAEVLHFDYKLFRGDIAQRGRGLPMLDFRIKDWGPAPRWVAFVDFFLACRAKHAVI 420
Query: 421 SGAHRRVGTTYAQLIAALAAAHNLDNLGKNSTGSDFFFLSSFQSNLLREGLKNQVGWGHI 480
SGA+RRVGTTYAQL+AALAAA++L + S+ S F FLSSFQSNLL +GLKNQVGWGH+
Sbjct: 421 SGANRRVGTTYAQLVAALAAANSLKD---GSSNSSFAFLSSFQSNLLADGLKNQVGWGHV 480
Query: 481 WNRFAGPLSCPSQPNQCALTPLLPPAWWDGLWQSPIPRDIKRMENYGVHLSGFGTIDEDS 540
WNR+AGPLSCP QPNQCA TPL PP WWDG+WQSPIPRD +R+ +G+ LSGFGT++ED
Sbjct: 481 WNRYAGPLSCPKQPNQCAFTPLAPPGWWDGIWQSPIPRDTRRLAAFGIELSGFGTVNEDR 540
Query: 541 LRSFCNAKKNVVRTIPFI 552
++C+AKK V T+ I
Sbjct: 541 FHAYCSAKKEYVSTVTII 548
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
XP_023532982.1 | 0.0 | 99.46 | uncharacterized protein LOC111794994 [Cucurbita pepo subsp. pepo] | [more] |
XP_022922653.1 | 0.0 | 98.91 | uncharacterized protein LOC111430593 [Cucurbita moschata] | [more] |
KAG7032713.1 | 0.0 | 97.15 | hypothetical protein SDJN02_06763 [Cucurbita argyrosperma subsp. argyrosperma] | [more] |
XP_022990922.1 | 0.0 | 97.46 | uncharacterized protein LOC111487669 [Cucurbita maxima] | [more] |
KAG6602018.1 | 0.0 | 95.20 | hypothetical protein SDJN03_07251, partial [Cucurbita argyrosperma subsp. sorori... | [more] |
Match Name | E-value | Identity | Description | |
A0A6J1E7F2 | 0.0 | 98.91 | uncharacterized protein LOC111430593 OS=Cucurbita moschata OX=3662 GN=LOC1114305... | [more] |
A0A6J1JK88 | 0.0 | 97.46 | uncharacterized protein LOC111487669 OS=Cucurbita maxima OX=3661 GN=LOC111487669... | [more] |
A0A6J1JUE3 | 0.0 | 88.97 | uncharacterized protein LOC111488989 isoform X1 OS=Cucurbita maxima OX=3661 GN=L... | [more] |
A0A6J1FF37 | 0.0 | 88.97 | uncharacterized protein LOC111444894 isoform X1 OS=Cucurbita moschata OX=3662 GN... | [more] |
A0A6J1JQR8 | 0.0 | 88.81 | uncharacterized protein LOC111488989 isoform X2 OS=Cucurbita maxima OX=3661 GN=L... | [more] |
Match Name | E-value | Identity | Description | |
AT3G26950.1 | 4.9e-198 | 61.11 | unknown protein; Has 27 Blast hits to 27 proteins in 8 species: Archae - 0; Bact... | [more] |