Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GCATCGATATTCCATCCCTGATTTTGAATCAGAGTAAGTCCATCTCTCCACATTCCACTCTTTTTCTCTCCCACAGTTGTTTCTCTCTGCTATTTCTGTGTCAACTCATTTCTGTGTCGACTTTTTGGTCTCTGAGAGTTGAAATTTCTTCAACTTAGCTTACTGAACTCAAAATTCTGGATTCCCCCATTTTAGCTTTCACTTCTCTCTGAGCATTCACTCCATTAATGGCTCAAAAACACTTACACGAGCTTCTTAAAGAGGATCAAGAGCCCTTTCTTCTCACCAATTTCATAGCCGACAGACGTGTTCTTAAGCGCCCTTCCCCCAAATCTCACCTTCTTCACCTCAATAAACGAAAACCCATTTCCCATTTCGCTGATTTTCCGGCGAGTTTTTGTAAGGGTGCTTGTTTTTTATCGTTTAATGATTCTCCTGATCTTAGAAACCCTTCGCCGCTCTTTCAATTTCAGTCTCCGGTGAAGAGTCCTTGCCGGAATTCCAATGCTGTGTTCCTCCATGTTCCGGCTACAACGGCGGGGCTTCTCCTGGAAGCTGCTTTGAGGATTCAGAAACAGTCAACGGCCGCGAGATCGAATGGATTTGGGCTTTTGGGTTCTTTTCTTAAGCGGTTTACTCATCGTGGCCGTTCTCGGAAGCGGGAGATTGACGGTGGTTGCCGGAAAAATGACCCCTGCGACGACCACCTATTGCCGGCGAAAGTGGCGATTAACGAGAACAAGAACGACTCTGTTTCTCGGCAGAGTAATGTAACGAGCTCGGACTTCTGCGATAGCCCTTTTCGATTTGTGCTTCAATCAAGTCCCTCCGCCGGTCACCGGACGCCGGAATTTTCTTCTCCGCCGTCTTCTCCGGCTCGACACGACCATCAGGTTGAAATACTCATTCTATTACTGTATTTTTGCAATTTTATGACATGGGTTCATCGGAAAAACAGCAACTGTCGGCAATTTTGTGTAAATTTTCTGAAATTACCGCCGGAATTAGTTTCAGGCGTCAAAACACGACAACCCCACCAAAACGACAGTGGGTTATGAGGAACAAACTGCTAAATCCCTCATTTTCTATTCCACACTCTTCTTTTTTCCCGGAAAATTTTTGACCTTTCAAGAAAAGGACAAACCCCTTTTTGTATTTTCCTTTGTATTTTAATTCCCATAGGTACCGACATCCTCCTTATAATTAAACATTTCCCCCTTTGTTGGTTGGCATAATTATAAGAACAGAAAAAGAACCAACTTTTTCACTGTTGTTTGCCTTGTTTTCTTTGACAAATTTACAGGTCAATGACGTTGAGAGCTTGAAGAAATTGCCTGTTCAGGATGAGGAGGAAGAGAAAGAACAAAGCAGTCCTGTGTCTGTGTTGGATCCTCCGTTCCAGGACGACGAAGAGGGTCGCTATGAGGACGGTGAGGACGACGACGATTACAAAATGGAGCGCAGCTACGCCATTGTCGAAAGTAAGCATGATTCAATTCAATTGCTAATCATATTAGGGAAATATATACTTGTTTTGTCTGTTGCATTGATTCGAGTTTGTTGTTCGATCGTTATCGGTGACATTTGCGTATAGATACTGTCTGAATTGATAGTTTTTACTGATTAGACATGGTTAATAGAGCTTTAAAACTCATGAAGCTTGTTGCTGTACTAGATATGAACACATGAATCCTTGGTGGGTTCCATTGGGAAGATTTCAATTATCAGATGGGTGGGGGTGAAACCAAAACAAGCTTTTGAATTTAGGTAAAATCTGTAATAGAACAAAGGGGAAAGAGGATAGACGATGACAATACGAGGAAATCTCTCCTCGGCATTTGTTTTCTACAACAGCCCAAGCCTACCACTAGTAGATATTGTCTGCTTTGACCCGTTATGTATCGCTGTCAGCCTCACAGTTTTAAAACATGTCTACTAGGGGGAGGTTTCCACACCTTTATAAATAACACTCCGTTCCCCTCTCTAACCGATGTGGGATCTCATAATTCACTCCCCTTGAGGACTAGCCTCCTTGCTCGTATACCCCCCGGTGTCTAGCTCTAATATCGTTTGTAACAGCCCAAGCCCATTACTAATAGATATTGTTCACTATGACCCGTTACATATCGCCGTCAACCTCACGATTTTAAAACGCATCTACTAGGGAGAGATTTCCACACCCTTATAAAAAATACTTCATTCACCTCTCCAACCGACACCGACGTGGGATCTCACAATCCACCCCTTTGGGTACCAGTCTCCTCGTTGGCACACCGCCCAATATCTGTCACTAATACTTCATTCCCCCCTCCCCAATCGATGTAGGATCTCACAATCCACCTCCCTTGGGGACCAACCTCCTCACTAGCACACTGCTCGATGTTTGTCACTGATACCATTTGTAATAGCCCAAGTTCACCACTAGTAGATATTGTCTGTTTTGGCCCGTTACGTATCCTCATCAGCCTCACGGTTTTAAAACGTGTCTACGAGGGGGAGGTGTCCACACCCTTATAAAGAATACTTTGTTCTCCTCTGTAACCGATGTGAGATCTCAAATTTTCCGCCCAAATGACCTATTGTGGCTCTCGTATTTTCTTGTTCTTCAAAACTTTAAATCTTACCTACATTTTGCAGCAAGAGAAGTGTTATAATTATGGGAGTCGTCTTCAATTTATTAGGCTATTCAATCATTGAACCAGTTCCTTCATGGTTTTGGAAGGGCATGGTCGTTTTCTATACCTCTGGACTAGGTCCCCCCCCACCCCCTACCGGTCCACTCATCTTTTGAACTGTTTGAGCATTCAATACGGGATTTTAAACCATTTCAGCAATGAGTAGCTCTAGTAATTATACGCGAATTGACCGATTCGTTCTTACCAAAAACGAATCATTTAGTCGAACTTCGCTACCTTCACCTGTTTCTAATCTGTTCATGTTAACCTTGTGGTGTTTTTCAGAGGCAAAGCATCAGCTACTCAAAAAGCTTCGGAGATTCGAGAGACTAGCAGAGCTAGACCCGGTAGAACTCGAGACGTTTCTACTAGAAGATGAGGAAGGCGAACTTGACGACAACGACATTGATCATCTCAAGGAAGAAGAGTGCGAAAGCCATAACTTAGATCGGTCTAACAACGAAAAGGACATGAAACAACACGGCATAGATGGCAATGTCGAGAGAGTTTACATGAGATGGGATTTGTGGAAAGAGGTGGAGTCGAGCGCCATCGACGTGATGGCGGAGGAAGATTTGAGAGCGGAGGTTGACGACGGGTGGAAGAGAAATGGGGAGGAAAGAGGAGACATAGCCATAGAAATAGAGGTTGAGATCTTCAGGTTGCTGGTGGAGGAAATGCAAACAGAAGTAGATTGGTTCATTAAGTGATGGAAATGATTAAACTTCATAGATAATATTTAAATTTGCATAAATAATGTTTAGATTATAAATTAGATTAGGAATATAATCTGACTTTAAGAGAATAGGCTTAAGTTTAACTTTACCTCCCTCTTTAAGTACAAAGATATTAGCACTATGATTTGTAAATTTATTATCCTTCCATATATTATCATCATCTACCTTTTCAAAT
mRNA sequence
GCATCGATATTCCATCCCTGATTTTGAATCAGAGTAAGTCCATCTCTCCACATTCCACTCTTTTTCTCTCCCACAGTTGTTTCTCTCTGCTATTTCTGTGTCAACTCATTTCTGTGTCGACTTTTTGGTCTCTGAGAGTTGAAATTTCTTCAACTTAGCTTACTGAACTCAAAATTCTGGATTCCCCCATTTTAGCTTTCACTTCTCTCTGAGCATTCACTCCATTAATGGCTCAAAAACACTTACACGAGCTTCTTAAAGAGGATCAAGAGCCCTTTCTTCTCACCAATTTCATAGCCGACAGACGTGTTCTTAAGCGCCCTTCCCCCAAATCTCACCTTCTTCACCTCAATAAACGAAAACCCATTTCCCATTTCGCTGATTTTCCGGCGAGTTTTTGTAAGGGTGCTTGTTTTTTATCGTTTAATGATTCTCCTGATCTTAGAAACCCTTCGCCGCTCTTTCAATTTCAGTCTCCGGTGAAGAGTCCTTGCCGGAATTCCAATGCTGTGTTCCTCCATGTTCCGGCTACAACGGCGGGGCTTCTCCTGGAAGCTGCTTTGAGGATTCAGAAACAGTCAACGGCCGCGAGATCGAATGGATTTGGGCTTTTGGGTTCTTTTCTTAAGCGGTTTACTCATCGTGGCCGTTCTCGGAAGCGGGAGATTGACGGTGGTTGCCGGAAAAATGACCCCTGCGACGACCACCTATTGCCGGCGAAAGTGGCGATTAACGAGAACAAGAACGACTCTGTTTCTCGGCAGAGTAATGTAACGAGCTCGGACTTCTGCGATAGCCCTTTTCGATTTGTGCTTCAATCAAGTCCCTCCGCCGGTCACCGGACGCCGGAATTTTCTTCTCCGCCGTCTTCTCCGGCTCGACACGACCATCAGGTCAATGACGTTGAGAGCTTGAAGAAATTGCCTGTTCAGGATGAGGAGGAAGAGAAAGAACAAAGCAGTCCTGTGTCTGTGTTGGATCCTCCGTTCCAGGACGACGAAGAGGGTCGCTATGAGGACGGTGAGGACGACGACGATTACAAAATGGAGCGCAGCTACGCCATTGTCGAAAAGGCAAAGCATCAGCTACTCAAAAAGCTTCGGAGATTCGAGAGACTAGCAGAGCTAGACCCGGTAGAACTCGAGACGTTTCTACTAGAAGATGAGGAAGGCGAACTTGACGACAACGACATTGATCATCTCAAGGAAGAAGAGTGCGAAAGCCATAACTTAGATCGGTCTAACAACGAAAAGGACATGAAACAACACGGCATAGATGGCAATGTCGAGAGAGTTTACATGAGATGGGATTTGTGGAAAGAGGTGGAGTCGAGCGCCATCGACGTGATGGCGGAGGAAGATTTGAGAGCGGAGGTTGACGACGGGTGGAAGAGAAATGGGGAGGAAAGAGGAGACATAGCCATAGAAATAGAGGTTGAGATCTTCAGGTTGCTGGTGGAGGAAATGCAAACAGAAGTAGATTGGTTCATTAAGTGATGGAAATGATTAAACTTCATAGATAATATTTAAATTTGCATAAATAATGTTTAGATTATAAATTAGATTAGGAATATAATCTGACTTTAAGAGAATAGGCTTAAGTTTAACTTTACCTCCCTCTTTAAGTACAAAGATATTAGCACTATGATTTGTAAATTTATTATCCTTCCATATATTATCATCATCTACCTTTTCAAAT
Coding sequence (CDS)
ATGGCTCAAAAACACTTACACGAGCTTCTTAAAGAGGATCAAGAGCCCTTTCTTCTCACCAATTTCATAGCCGACAGACGTGTTCTTAAGCGCCCTTCCCCCAAATCTCACCTTCTTCACCTCAATAAACGAAAACCCATTTCCCATTTCGCTGATTTTCCGGCGAGTTTTTGTAAGGGTGCTTGTTTTTTATCGTTTAATGATTCTCCTGATCTTAGAAACCCTTCGCCGCTCTTTCAATTTCAGTCTCCGGTGAAGAGTCCTTGCCGGAATTCCAATGCTGTGTTCCTCCATGTTCCGGCTACAACGGCGGGGCTTCTCCTGGAAGCTGCTTTGAGGATTCAGAAACAGTCAACGGCCGCGAGATCGAATGGATTTGGGCTTTTGGGTTCTTTTCTTAAGCGGTTTACTCATCGTGGCCGTTCTCGGAAGCGGGAGATTGACGGTGGTTGCCGGAAAAATGACCCCTGCGACGACCACCTATTGCCGGCGAAAGTGGCGATTAACGAGAACAAGAACGACTCTGTTTCTCGGCAGAGTAATGTAACGAGCTCGGACTTCTGCGATAGCCCTTTTCGATTTGTGCTTCAATCAAGTCCCTCCGCCGGTCACCGGACGCCGGAATTTTCTTCTCCGCCGTCTTCTCCGGCTCGACACGACCATCAGGTCAATGACGTTGAGAGCTTGAAGAAATTGCCTGTTCAGGATGAGGAGGAAGAGAAAGAACAAAGCAGTCCTGTGTCTGTGTTGGATCCTCCGTTCCAGGACGACGAAGAGGGTCGCTATGAGGACGGTGAGGACGACGACGATTACAAAATGGAGCGCAGCTACGCCATTGTCGAAAAGGCAAAGCATCAGCTACTCAAAAAGCTTCGGAGATTCGAGAGACTAGCAGAGCTAGACCCGGTAGAACTCGAGACGTTTCTACTAGAAGATGAGGAAGGCGAACTTGACGACAACGACATTGATCATCTCAAGGAAGAAGAGTGCGAAAGCCATAACTTAGATCGGTCTAACAACGAAAAGGACATGAAACAACACGGCATAGATGGCAATGTCGAGAGAGTTTACATGAGATGGGATTTGTGGAAAGAGGTGGAGTCGAGCGCCATCGACGTGATGGCGGAGGAAGATTTGAGAGCGGAGGTTGACGACGGGTGGAAGAGAAATGGGGAGGAAAGAGGAGACATAGCCATAGAAATAGAGGTTGAGATCTTCAGGTTGCTGGTGGAGGAAATGCAAACAGAAGTAGATTGGTTCATTAAGTGA
Protein sequence
MAQKHLHELLKEDQEPFLLTNFIADRRVLKRPSPKSHLLHLNKRKPISHFADFPASFCKGACFLSFNDSPDLRNPSPLFQFQSPVKSPCRNSNAVFLHVPATTAGLLLEAALRIQKQSTAARSNGFGLLGSFLKRFTHRGRSRKREIDGGCRKNDPCDDHLLPAKVAINENKNDSVSRQSNVTSSDFCDSPFRFVLQSSPSAGHRTPEFSSPPSSPARHDHQVNDVESLKKLPVQDEEEEKEQSSPVSVLDPPFQDDEEGRYEDGEDDDDYKMERSYAIVEKAKHQLLKKLRRFERLAELDPVELETFLLEDEEGELDDNDIDHLKEEECESHNLDRSNNEKDMKQHGIDGNVERVYMRWDLWKEVESSAIDVMAEEDLRAEVDDGWKRNGEERGDIAIEIEVEIFRLLVEEMQTEVDWFIK
Homology
BLAST of Cp4.1LG08g10310 vs. NCBI nr
Match:
XP_023539063.1 (uncharacterized protein LOC111799817 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 825 bits (2132), Expect = 1.98e-301
Identity = 422/422 (100.00%), Postives = 422/422 (100.00%), Query Frame = 0
Query: 1 MAQKHLHELLKEDQEPFLLTNFIADRRVLKRPSPKSHLLHLNKRKPISHFADFPASFCKG 60
MAQKHLHELLKEDQEPFLLTNFIADRRVLKRPSPKSHLLHLNKRKPISHFADFPASFCKG
Sbjct: 1 MAQKHLHELLKEDQEPFLLTNFIADRRVLKRPSPKSHLLHLNKRKPISHFADFPASFCKG 60
Query: 61 ACFLSFNDSPDLRNPSPLFQFQSPVKSPCRNSNAVFLHVPATTAGLLLEAALRIQKQSTA 120
ACFLSFNDSPDLRNPSPLFQFQSPVKSPCRNSNAVFLHVPATTAGLLLEAALRIQKQSTA
Sbjct: 61 ACFLSFNDSPDLRNPSPLFQFQSPVKSPCRNSNAVFLHVPATTAGLLLEAALRIQKQSTA 120
Query: 121 ARSNGFGLLGSFLKRFTHRGRSRKREIDGGCRKNDPCDDHLLPAKVAINENKNDSVSRQS 180
ARSNGFGLLGSFLKRFTHRGRSRKREIDGGCRKNDPCDDHLLPAKVAINENKNDSVSRQS
Sbjct: 121 ARSNGFGLLGSFLKRFTHRGRSRKREIDGGCRKNDPCDDHLLPAKVAINENKNDSVSRQS 180
Query: 181 NVTSSDFCDSPFRFVLQSSPSAGHRTPEFSSPPSSPARHDHQVNDVESLKKLPVQDEEEE 240
NVTSSDFCDSPFRFVLQSSPSAGHRTPEFSSPPSSPARHDHQVNDVESLKKLPVQDEEEE
Sbjct: 181 NVTSSDFCDSPFRFVLQSSPSAGHRTPEFSSPPSSPARHDHQVNDVESLKKLPVQDEEEE 240
Query: 241 KEQSSPVSVLDPPFQDDEEGRYEDGEDDDDYKMERSYAIVEKAKHQLLKKLRRFERLAEL 300
KEQSSPVSVLDPPFQDDEEGRYEDGEDDDDYKMERSYAIVEKAKHQLLKKLRRFERLAEL
Sbjct: 241 KEQSSPVSVLDPPFQDDEEGRYEDGEDDDDYKMERSYAIVEKAKHQLLKKLRRFERLAEL 300
Query: 301 DPVELETFLLEDEEGELDDNDIDHLKEEECESHNLDRSNNEKDMKQHGIDGNVERVYMRW 360
DPVELETFLLEDEEGELDDNDIDHLKEEECESHNLDRSNNEKDMKQHGIDGNVERVYMRW
Sbjct: 301 DPVELETFLLEDEEGELDDNDIDHLKEEECESHNLDRSNNEKDMKQHGIDGNVERVYMRW 360
Query: 361 DLWKEVESSAIDVMAEEDLRAEVDDGWKRNGEERGDIAIEIEVEIFRLLVEEMQTEVDWF 420
DLWKEVESSAIDVMAEEDLRAEVDDGWKRNGEERGDIAIEIEVEIFRLLVEEMQTEVDWF
Sbjct: 361 DLWKEVESSAIDVMAEEDLRAEVDDGWKRNGEERGDIAIEIEVEIFRLLVEEMQTEVDWF 420
Query: 421 IK 422
IK
Sbjct: 421 IK 422
BLAST of Cp4.1LG08g10310 vs. NCBI nr
Match:
XP_022945267.1 (uncharacterized protein LOC111449564 [Cucurbita moschata] >KAG7028088.1 hypothetical protein SDJN02_09268 [Cucurbita argyrosperma subsp. argyrosperma])
HSP 1 Score: 775 bits (2002), Expect = 1.06e-281
Identity = 403/422 (95.50%), Postives = 410/422 (97.16%), Query Frame = 0
Query: 1 MAQKHLHELLKEDQEPFLLTNFIADRRVLKRPSPKSHLLHLNKRKPISHFADFPASFCKG 60
MAQKHLHELLKEDQEPFLLTNFIADRRVLKRPSPKSHLLHLNKRKPISHF+DFPASFCKG
Sbjct: 1 MAQKHLHELLKEDQEPFLLTNFIADRRVLKRPSPKSHLLHLNKRKPISHFSDFPASFCKG 60
Query: 61 ACFLSFNDSPDLRNPSPLFQFQSPVKSPCRNSNAVFLHVPATTAGLLLEAALRIQKQSTA 120
ACFLSFNDSPDLRNPSPLFQFQSPVKSPCRNSNAVFLHVPATTAGLLLEAALRIQKQSTA
Sbjct: 61 ACFLSFNDSPDLRNPSPLFQFQSPVKSPCRNSNAVFLHVPATTAGLLLEAALRIQKQSTA 120
Query: 121 ARSNGFGLLGSFLKRFTHRGRSRKREIDGGCRKNDPCDDHLLPAKVAINENKNDSVSRQS 180
ARSNGFGLLGSFLKRFTHRGRSRKREIDGGCR+NDP DDHLLP INE DSVSRQS
Sbjct: 121 ARSNGFGLLGSFLKRFTHRGRSRKREIDGGCRRNDPRDDHLLPP---INEK--DSVSRQS 180
Query: 181 NVTSSDFCDSPFRFVLQSSPSAGHRTPEFSSPPSSPARHDHQVNDVESLKKLPVQDEEEE 240
NVTSSDFC+SPFRFVLQSSPSAGHRTPEFSSPPSSPARHDHQVNDVESLKKLPVQDEEEE
Sbjct: 181 NVTSSDFCESPFRFVLQSSPSAGHRTPEFSSPPSSPARHDHQVNDVESLKKLPVQDEEEE 240
Query: 241 KEQSSPVSVLDPPFQDDEEGRYEDGEDDDDYKMERSYAIVEKAKHQLLKKLRRFERLAEL 300
KEQSSPVSVLDPPF+DDEEGRYEDGEDDDDY+MERSYAIVEKAKHQLLKKLRRFERLAEL
Sbjct: 241 KEQSSPVSVLDPPFEDDEEGRYEDGEDDDDYEMERSYAIVEKAKHQLLKKLRRFERLAEL 300
Query: 301 DPVELETFLLEDEEGELDDNDIDHLKEEECESHNLDRSNNEKDMKQHGIDGNVERVYMRW 360
DPVELETFLL+DEEGELDD+DIDHLKEEECESHN DRSNNEKDMKQHGIDGNVERVYMRW
Sbjct: 301 DPVELETFLLKDEEGELDDDDIDHLKEEECESHNFDRSNNEKDMKQHGIDGNVERVYMRW 360
Query: 361 DLWKEVESSAIDVMAEEDLRAEVDDGWKRNGEERGDIAIEIEVEIFRLLVEEMQTEVDWF 420
DLWKEVESSAIDVMA EDLRAEVDDGWKRNGE RGDIAIEIEVEIFRLLVEEMQTEVD F
Sbjct: 361 DLWKEVESSAIDVMAGEDLRAEVDDGWKRNGEARGDIAIEIEVEIFRLLVEEMQTEVDCF 417
Query: 421 IK 422
IK
Sbjct: 421 IK 417
BLAST of Cp4.1LG08g10310 vs. NCBI nr
Match:
KAG6596552.1 (hypothetical protein SDJN03_09732, partial [Cucurbita argyrosperma subsp. sororia])
HSP 1 Score: 773 bits (1996), Expect = 9.42e-281
Identity = 402/421 (95.49%), Postives = 408/421 (96.91%), Query Frame = 0
Query: 1 MAQKHLHELLKEDQEPFLLTNFIADRRVLKRPSPKSHLLHLNKRKPISHFADFPASFCKG 60
MAQKHLHELLKEDQEPFLLTNFIADRRVLKRPSPKSHLLHLNKRKPISHF+DFPASFCKG
Sbjct: 1 MAQKHLHELLKEDQEPFLLTNFIADRRVLKRPSPKSHLLHLNKRKPISHFSDFPASFCKG 60
Query: 61 ACFLSFNDSPDLRNPSPLFQFQSPVKSPCRNSNAVFLHVPATTAGLLLEAALRIQKQSTA 120
ACFLSFNDSPDLRNPSPLFQFQSPVKSPCRNSNAVFLHVPATTAGLLLEAALRIQKQSTA
Sbjct: 61 ACFLSFNDSPDLRNPSPLFQFQSPVKSPCRNSNAVFLHVPATTAGLLLEAALRIQKQSTA 120
Query: 121 ARSNGFGLLGSFLKRFTHRGRSRKREIDGGCRKNDPCDDHLLPAKVAINENKNDSVSRQS 180
ARSNGFGLLGSFLKRFTHRGRSRKREIDGGCR+NDP DDH LP INE DSVSRQS
Sbjct: 121 ARSNGFGLLGSFLKRFTHRGRSRKREIDGGCRRNDPRDDHQLPP---INEK--DSVSRQS 180
Query: 181 NVTSSDFCDSPFRFVLQSSPSAGHRTPEFSSPPSSPARHDHQVNDVESLKKLPVQDEEEE 240
NVTSSDFCDSPFRFVLQSSPSAGHRTPEFSSPPSSPARHDHQVNDVESLKKLPVQDEEEE
Sbjct: 181 NVTSSDFCDSPFRFVLQSSPSAGHRTPEFSSPPSSPARHDHQVNDVESLKKLPVQDEEEE 240
Query: 241 KEQSSPVSVLDPPFQDDEEGRYEDGEDDDDYKMERSYAIVEKAKHQLLKKLRRFERLAEL 300
KEQSSPVSVLDPPF+DDEEGRYEDGEDDDDYKMERSYAIVEKAKHQLLKKLRRFERLAEL
Sbjct: 241 KEQSSPVSVLDPPFEDDEEGRYEDGEDDDDYKMERSYAIVEKAKHQLLKKLRRFERLAEL 300
Query: 301 DPVELETFLLEDEEGELDDNDIDHLKEEECESHNLDRSNNEKDMKQHGIDGNVERVYMRW 360
DPVELETFLL+DEEGELDD+DIDHLKEEECESHN DRSNNEKDMKQHGIDGNVERVYMRW
Sbjct: 301 DPVELETFLLKDEEGELDDDDIDHLKEEECESHNFDRSNNEKDMKQHGIDGNVERVYMRW 360
Query: 361 DLWKEVESSAIDVMAEEDLRAEVDDGWKRNGEERGDIAIEIEVEIFRLLVEEMQTEVDWF 420
DLWKEVESSAIDVMA EDLRAEVDDGWKRNGE RGD+AIEIEVEIFRLLVEEMQTEVD F
Sbjct: 361 DLWKEVESSAIDVMAGEDLRAEVDDGWKRNGEARGDMAIEIEVEIFRLLVEEMQTEVDCF 416
BLAST of Cp4.1LG08g10310 vs. NCBI nr
Match:
XP_023005858.1 (uncharacterized protein LOC111498735 [Cucurbita maxima])
HSP 1 Score: 741 bits (1914), Expect = 2.83e-268
Identity = 391/424 (92.22%), Postives = 404/424 (95.28%), Query Frame = 0
Query: 1 MAQKHLHELLKEDQEPFLLTNFIADRRVLKRPSPKSHLLHLNKRKPISHFADFPASFCKG 60
MAQKHLHELLKEDQEPFLLTNFIA+RRVLKRPSPKSHLLHLNK KPISHFADFPASFCKG
Sbjct: 1 MAQKHLHELLKEDQEPFLLTNFIANRRVLKRPSPKSHLLHLNKPKPISHFADFPASFCKG 60
Query: 61 ACFLSFNDSPDLRNPSPLFQFQSPVKSPCRNSNAVFLHVPATTAGLLLEAALRIQKQSTA 120
ACFLSFN SPDLRNPSPLFQFQSPVKSPCRNSNA+FLHVPATTA LLLEAALRIQKQST
Sbjct: 61 ACFLSFNHSPDLRNPSPLFQFQSPVKSPCRNSNAMFLHVPATTARLLLEAALRIQKQSTP 120
Query: 121 ARSNGFGLLGSFLKRFTHRGRSRKREIDGGCRKNDPCDDHLLPAKVAINENKN--DSVSR 180
ARSNGFGLLGSFLKRFT+RGRSRKREIDGGCR+NDP AK+AINEN+N DSVSR
Sbjct: 121 ARSNGFGLLGSFLKRFTYRGRSRKREIDGGCRRNDPS-----TAKMAINENENGNDSVSR 180
Query: 181 QSNVTSSDFCDSPFRFVLQSSPSAGHRTPEFSSPPSSPARHDHQVNDVESLKKLPVQDEE 240
QSNVTSSDFCDSPFRFVLQSSPSAGHRTPEFSSPPSSPAR DHQVNDVESLKKLPVQDEE
Sbjct: 181 QSNVTSSDFCDSPFRFVLQSSPSAGHRTPEFSSPPSSPARDDHQVNDVESLKKLPVQDEE 240
Query: 241 EEKEQSSPVSVLDPPFQDDEEGRYEDGEDDDDYKMERSYAIVEKAKHQLLKKLRRFERLA 300
EEKEQSSPVSVLDPPF+DDEEGRYEDGEDDDDYKMERSYAIV+KAKHQLLKKLRRFERLA
Sbjct: 241 EEKEQSSPVSVLDPPFEDDEEGRYEDGEDDDDYKMERSYAIVQKAKHQLLKKLRRFERLA 300
Query: 301 ELDPVELETFLLEDEEGELDDNDIDHLKEEECESHNLDRSNNEKDMKQHGIDGNVERVYM 360
ELDPVELETFLL+DEEG+LDD D DHL+EEEC+SHN DRSNNEKDMKQHGI+ NVERVYM
Sbjct: 301 ELDPVELETFLLKDEEGKLDD-DGDHLEEEECKSHNFDRSNNEKDMKQHGIESNVERVYM 360
Query: 361 RWDLWKEVESSAIDVMAEEDLRAEVDDGWKRNGEERGDIAIEIEVEIFRLLVEEMQTEVD 420
RWDLWKEVESSAIDVMAEEDLRAEVD GWKRNGEERGDIAIEIEVEIFRLLVEEMQTEVD
Sbjct: 361 RWDLWKEVESSAIDVMAEEDLRAEVDVGWKRNGEERGDIAIEIEVEIFRLLVEEMQTEVD 418
Query: 421 WFIK 422
FIK
Sbjct: 421 CFIK 418
BLAST of Cp4.1LG08g10310 vs. NCBI nr
Match:
XP_038903007.1 (uncharacterized protein LOC120089713 [Benincasa hispida])
HSP 1 Score: 545 bits (1405), Expect = 3.95e-190
Identity = 316/471 (67.09%), Postives = 354/471 (75.16%), Query Frame = 0
Query: 1 MAQKHLHELLKEDQEPFLLTNFIADRR-VLKRPSPKSHLLHLNKRKPISHFADFPASFCK 60
MAQKHLHELLKEDQEPFLLTNFIADRR +LKRPS KSH HLN KPISH +DFPA FC+
Sbjct: 1 MAQKHLHELLKEDQEPFLLTNFIADRRSLLKRPSLKSHF-HLNNPKPISHSSDFPAKFCR 60
Query: 61 GACFLSFNDSPDLRNPSPLFQFQSPVKSPCRNSNAVFLHVPATTAGLLLEAALRIQKQST 120
ACF SFN SPDL N SPLF FQSPVK+PCRN N +FLHVPA TAGLLLEAALRIQKQST
Sbjct: 61 SACFFSFNHSPDLINSSPLFGFQSPVKTPCRNPNPIFLHVPARTAGLLLEAALRIQKQST 120
Query: 121 AARS------NGFGLLGSFLKRFTHRGRSRKREIDGGCRKNDPCDDHLLPAKVAI--NEN 180
ARS NG G+LGSFLKR THRGR+RKREIDG RKNDP D LPAK+AI NEN
Sbjct: 121 VARSKSLGKSNGLGVLGSFLKRLTHRGRARKREIDGDGRKNDPRDGPPLPAKMAIEENEN 180
Query: 181 KNDSVSRQSNVTSSDFCDS-----PFRFVLQSSPSAGHRTPEFSSPPSSPARHDHQVNDV 240
+NDSVSR SNVT DFCDS PFRFVLQSSPS GH+TPE +SP SSPAR DHQ NDV
Sbjct: 181 ENDSVSRLSNVTGFDFCDSNLCDSPFRFVLQSSPSPGHQTPELASPASSPARLDHQANDV 240
Query: 241 ESLKKLPVQDEEEEKEQSSPVSVLDPPFQDDEEGRYEDGEDDDDYKMERSYAIVEKAKHQ 300
E LKKLPV+DEEEEKEQSSPVSVLDPPF+DD+EG YEDGED+DDY +ERS+AIV++AKHQ
Sbjct: 241 EGLKKLPVEDEEEEKEQSSPVSVLDPPFEDDDEGHYEDGEDEDDYNLERSFAIVQQAKHQ 300
Query: 301 LLKKLRRFERLAELDPVELETFLL--EDEEGELDDNDIDHLKEEECESHNLDRSNNEKDM 360
LLKKLRRFERLAELDPVELETFLL EDE+ + DD+DIDHLKEEE + +KD+
Sbjct: 301 LLKKLRRFERLAELDPVELETFLLKDEDEDEDEDDDDIDHLKEEE---------DYKKDI 360
Query: 361 KQHGIDGN---------------------------------------VERVYMRWDLWKE 416
K+H I+ N ++ +Y+R DLWK
Sbjct: 361 KEHDIEANDSSRFQIPHRPARDMTTLVCNLVTEEERDLVVIEKREEMMKGMYVRSDLWKR 420
BLAST of Cp4.1LG08g10310 vs. ExPASy TrEMBL
Match:
A0A6J1G0G0 (uncharacterized protein LOC111449564 OS=Cucurbita moschata OX=3662 GN=LOC111449564 PE=4 SV=1)
HSP 1 Score: 775 bits (2002), Expect = 5.15e-282
Identity = 403/422 (95.50%), Postives = 410/422 (97.16%), Query Frame = 0
Query: 1 MAQKHLHELLKEDQEPFLLTNFIADRRVLKRPSPKSHLLHLNKRKPISHFADFPASFCKG 60
MAQKHLHELLKEDQEPFLLTNFIADRRVLKRPSPKSHLLHLNKRKPISHF+DFPASFCKG
Sbjct: 1 MAQKHLHELLKEDQEPFLLTNFIADRRVLKRPSPKSHLLHLNKRKPISHFSDFPASFCKG 60
Query: 61 ACFLSFNDSPDLRNPSPLFQFQSPVKSPCRNSNAVFLHVPATTAGLLLEAALRIQKQSTA 120
ACFLSFNDSPDLRNPSPLFQFQSPVKSPCRNSNAVFLHVPATTAGLLLEAALRIQKQSTA
Sbjct: 61 ACFLSFNDSPDLRNPSPLFQFQSPVKSPCRNSNAVFLHVPATTAGLLLEAALRIQKQSTA 120
Query: 121 ARSNGFGLLGSFLKRFTHRGRSRKREIDGGCRKNDPCDDHLLPAKVAINENKNDSVSRQS 180
ARSNGFGLLGSFLKRFTHRGRSRKREIDGGCR+NDP DDHLLP INE DSVSRQS
Sbjct: 121 ARSNGFGLLGSFLKRFTHRGRSRKREIDGGCRRNDPRDDHLLPP---INEK--DSVSRQS 180
Query: 181 NVTSSDFCDSPFRFVLQSSPSAGHRTPEFSSPPSSPARHDHQVNDVESLKKLPVQDEEEE 240
NVTSSDFC+SPFRFVLQSSPSAGHRTPEFSSPPSSPARHDHQVNDVESLKKLPVQDEEEE
Sbjct: 181 NVTSSDFCESPFRFVLQSSPSAGHRTPEFSSPPSSPARHDHQVNDVESLKKLPVQDEEEE 240
Query: 241 KEQSSPVSVLDPPFQDDEEGRYEDGEDDDDYKMERSYAIVEKAKHQLLKKLRRFERLAEL 300
KEQSSPVSVLDPPF+DDEEGRYEDGEDDDDY+MERSYAIVEKAKHQLLKKLRRFERLAEL
Sbjct: 241 KEQSSPVSVLDPPFEDDEEGRYEDGEDDDDYEMERSYAIVEKAKHQLLKKLRRFERLAEL 300
Query: 301 DPVELETFLLEDEEGELDDNDIDHLKEEECESHNLDRSNNEKDMKQHGIDGNVERVYMRW 360
DPVELETFLL+DEEGELDD+DIDHLKEEECESHN DRSNNEKDMKQHGIDGNVERVYMRW
Sbjct: 301 DPVELETFLLKDEEGELDDDDIDHLKEEECESHNFDRSNNEKDMKQHGIDGNVERVYMRW 360
Query: 361 DLWKEVESSAIDVMAEEDLRAEVDDGWKRNGEERGDIAIEIEVEIFRLLVEEMQTEVDWF 420
DLWKEVESSAIDVMA EDLRAEVDDGWKRNGE RGDIAIEIEVEIFRLLVEEMQTEVD F
Sbjct: 361 DLWKEVESSAIDVMAGEDLRAEVDDGWKRNGEARGDIAIEIEVEIFRLLVEEMQTEVDCF 417
Query: 421 IK 422
IK
Sbjct: 421 IK 417
BLAST of Cp4.1LG08g10310 vs. ExPASy TrEMBL
Match:
A0A6J1L3C1 (uncharacterized protein LOC111498735 OS=Cucurbita maxima OX=3661 GN=LOC111498735 PE=4 SV=1)
HSP 1 Score: 741 bits (1914), Expect = 1.37e-268
Identity = 391/424 (92.22%), Postives = 404/424 (95.28%), Query Frame = 0
Query: 1 MAQKHLHELLKEDQEPFLLTNFIADRRVLKRPSPKSHLLHLNKRKPISHFADFPASFCKG 60
MAQKHLHELLKEDQEPFLLTNFIA+RRVLKRPSPKSHLLHLNK KPISHFADFPASFCKG
Sbjct: 1 MAQKHLHELLKEDQEPFLLTNFIANRRVLKRPSPKSHLLHLNKPKPISHFADFPASFCKG 60
Query: 61 ACFLSFNDSPDLRNPSPLFQFQSPVKSPCRNSNAVFLHVPATTAGLLLEAALRIQKQSTA 120
ACFLSFN SPDLRNPSPLFQFQSPVKSPCRNSNA+FLHVPATTA LLLEAALRIQKQST
Sbjct: 61 ACFLSFNHSPDLRNPSPLFQFQSPVKSPCRNSNAMFLHVPATTARLLLEAALRIQKQSTP 120
Query: 121 ARSNGFGLLGSFLKRFTHRGRSRKREIDGGCRKNDPCDDHLLPAKVAINENKN--DSVSR 180
ARSNGFGLLGSFLKRFT+RGRSRKREIDGGCR+NDP AK+AINEN+N DSVSR
Sbjct: 121 ARSNGFGLLGSFLKRFTYRGRSRKREIDGGCRRNDPS-----TAKMAINENENGNDSVSR 180
Query: 181 QSNVTSSDFCDSPFRFVLQSSPSAGHRTPEFSSPPSSPARHDHQVNDVESLKKLPVQDEE 240
QSNVTSSDFCDSPFRFVLQSSPSAGHRTPEFSSPPSSPAR DHQVNDVESLKKLPVQDEE
Sbjct: 181 QSNVTSSDFCDSPFRFVLQSSPSAGHRTPEFSSPPSSPARDDHQVNDVESLKKLPVQDEE 240
Query: 241 EEKEQSSPVSVLDPPFQDDEEGRYEDGEDDDDYKMERSYAIVEKAKHQLLKKLRRFERLA 300
EEKEQSSPVSVLDPPF+DDEEGRYEDGEDDDDYKMERSYAIV+KAKHQLLKKLRRFERLA
Sbjct: 241 EEKEQSSPVSVLDPPFEDDEEGRYEDGEDDDDYKMERSYAIVQKAKHQLLKKLRRFERLA 300
Query: 301 ELDPVELETFLLEDEEGELDDNDIDHLKEEECESHNLDRSNNEKDMKQHGIDGNVERVYM 360
ELDPVELETFLL+DEEG+LDD D DHL+EEEC+SHN DRSNNEKDMKQHGI+ NVERVYM
Sbjct: 301 ELDPVELETFLLKDEEGKLDD-DGDHLEEEECKSHNFDRSNNEKDMKQHGIESNVERVYM 360
Query: 361 RWDLWKEVESSAIDVMAEEDLRAEVDDGWKRNGEERGDIAIEIEVEIFRLLVEEMQTEVD 420
RWDLWKEVESSAIDVMAEEDLRAEVD GWKRNGEERGDIAIEIEVEIFRLLVEEMQTEVD
Sbjct: 361 RWDLWKEVESSAIDVMAEEDLRAEVDVGWKRNGEERGDIAIEIEVEIFRLLVEEMQTEVD 418
Query: 421 WFIK 422
FIK
Sbjct: 421 CFIK 418
BLAST of Cp4.1LG08g10310 vs. ExPASy TrEMBL
Match:
A0A6J1CUE0 (uncharacterized protein LOC111014376 OS=Momordica charantia OX=3673 GN=LOC111014376 PE=4 SV=1)
HSP 1 Score: 524 bits (1350), Expect = 3.74e-182
Identity = 304/460 (66.09%), Postives = 351/460 (76.30%), Query Frame = 0
Query: 1 MAQKHLHELLKEDQEPFLLTNFIADRR-VLKRPSPKSHLLHLNKRKPISHFADFPASFCK 60
M QKHLHELLKEDQEPF+LTNFIADRR +LKRPSPKS+L HL +RKPIS DFP FCK
Sbjct: 2 MPQKHLHELLKEDQEPFVLTNFIADRRSLLKRPSPKSNL-HLKRRKPISETLDFPGKFCK 61
Query: 61 GACFLSFNDSPDLRNPSPLFQFQSPVKSPCRNSNAVFLHVPATTAGLLLEAALRIQKQST 120
ACF SF++SPDLR SPLF+FQSPV RN NA+FLHVPA TAG+LLEAALRIQKQST
Sbjct: 62 SACFFSFHESPDLRK-SPLFEFQSPV----RNPNAIFLHVPARTAGILLEAALRIQKQST 121
Query: 121 AARS------NGFGLLGSFLKRFTHRGRSRKREIDGGCRKNDPCDDHLLPAKVAINENK- 180
AARS NG GLLGSFLKR THRGR+RKREIDG R+ND LPAK+AI EN+
Sbjct: 122 AARSKPHGKTNGLGLLGSFLKRLTHRGRARKREIDGDGRRNDLGGGRPLPAKMAIEENED 181
Query: 181 -----NDSVSRQSNVTS-----SDFCDSPFRFVLQSSPSAGHRTPEFSSPPSSPARHDHQ 240
N SVS Q+N+TS S+FCDSPFRFVLQSSPS+GHRTPEFSSP +SP R DHQ
Sbjct: 182 ENVNENGSVSGQTNLTSFAFCESNFCDSPFRFVLQSSPSSGHRTPEFSSPAASPVRRDHQ 241
Query: 241 VNDVESLKKLPVQDEEEEKEQSSPVSVLDPPFQDDEEGRYEDGEDDDDYKMERSYAIVEK 300
NDVESLKKLPV+DEEEEKEQSSPVS+LDPPF+DD+EG YEDGED+D Y +ERSY IV+K
Sbjct: 242 DNDVESLKKLPVEDEEEEKEQSSPVSILDPPFEDDDEGHYEDGEDEDGYDLERSYTIVQK 301
Query: 301 AKHQLLKKLRRFERLAELDPVELETFLLEDEEGELDDND-IDHLKEEECESHNLDRSNNE 360
AKHQLLKKLRRFE+LAELDPVELE+FLL+ EE ELDD+D IDHLKEEE ESHN ++ + E
Sbjct: 302 AKHQLLKKLRRFEKLAELDPVELESFLLKGEEDELDDDDDIDHLKEEEYESHNFEQHDVE 361
Query: 361 K-------------------DMKQHGIDGNVER----VYMRWDLWKEVESSAIDVMAEED 418
+ + + N E VY+R DLWK V+S+AID +D
Sbjct: 362 ANGSSSFQIPHRLVRNRITGEQRDQAVTDNREEMTKGVYVRSDLWKRVDSNAIDATVGQD 421
BLAST of Cp4.1LG08g10310 vs. ExPASy TrEMBL
Match:
A0A5D3DNQ5 (Histone-lysine N-methyltransferase SETD1B-like isoform X2 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold352G003580 PE=4 SV=1)
HSP 1 Score: 512 bits (1318), Expect = 3.82e-177
Identity = 310/472 (65.68%), Postives = 340/472 (72.03%), Query Frame = 0
Query: 1 MAQK-HLHELLKEDQEPFLLTNFIADRR-VLKRPSPKSHLLHLNKRKPISHFADFPASFC 60
MA+K HLHELLK+DQEPFLL+NFI DRR +LKR S KSH HL KPISH DF A FC
Sbjct: 1 MARKQHLHELLKQDQEPFLLSNFINDRRSLLKRSSFKSHF-HLKNPKPISHSPDFSAKFC 60
Query: 61 KGACFLSFNDSPDLRNPSPLFQFQSPVKSPCRNSNAVFLHVPATTAGLLLEAALRIQKQS 120
+ CF SFN SPDL N SPLF FQSPVK+PCR+ N VF HVPA TAGLLLEAALRIQKQS
Sbjct: 61 RSTCFFSFNHSPDLANSSPLFGFQSPVKTPCRSPNPVFFHVPARTAGLLLEAALRIQKQS 120
Query: 121 TAARS------NGFGLLGSFLKRFTHRGRSRKREIDGGCRKNDPCDDHLLPAKVAINENK 180
TAARS NG GLLGSFLKR THR RSRKREI G R NDP D LPAK+AI EN+
Sbjct: 121 TAARSKSFGKSNGLGLLGSFLKRLTHRSRSRKREIHGDGRINDPRDGPPLPAKMAIEENE 180
Query: 181 --NDSVSRQSNVTSSDFC-----DSPFRFVLQSSPSAGHRTPEFSSPPSSPARHDHQVND 240
NDSV R SNVT DFC DSPFRFVLQSS S GHRTPE SSP SSPAR DHQ ND
Sbjct: 181 KENDSVFRLSNVTGFDFCESNLCDSPFRFVLQSSSSPGHRTPELSSPVSSPARLDHQAND 240
Query: 241 VESLKKLPVQDEEEEKEQSSPVSVLDPPFQDDEEGRYEDGEDDDDYKMERSYAIVEKAKH 300
VESL+KLP +DEEEEKEQSSPVSVLDPPF+DD+EG +EDGED+DDY +ERS+AIV+KAKH
Sbjct: 241 VESLQKLPAEDEEEEKEQSSPVSVLDPPFEDDDEGNFEDGEDEDDYNLERSFAIVQKAKH 300
Query: 301 QLLKKLRRFERLAELDPVELETFLLEDE---EGELDD-NDIDHLKEEECESHNLDRSNNE 360
QLLKKLRRFERLAELDP+ELETFLL DE E EL D +DIDHLKEE E E
Sbjct: 301 QLLKKLRRFERLAELDPLELETFLLNDEDQDEDELSDGDDIDHLKEEVEEY--------E 360
Query: 361 KDMKQHGIDGN-------------------------------------VERVYMRWDLWK 416
KD+KQH +GN ++RVYMR DLWK
Sbjct: 361 KDIKQHNKEGNDSSRFQNRPSRDTKILVCNLITEEERNIVAIEKREETMKRVYMRPDLWK 420
BLAST of Cp4.1LG08g10310 vs. ExPASy TrEMBL
Match:
A0A0A0LAR8 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G751450 PE=4 SV=1)
HSP 1 Score: 510 bits (1314), Expect = 1.78e-176
Identity = 307/476 (64.50%), Postives = 341/476 (71.64%), Query Frame = 0
Query: 1 MAQK-HLHELLKEDQEPFLLTNFIADRR-VLKRPSPKSHLLHLNKRKPISHFADFPASFC 60
MA+K HLHELLK+DQEPFLL+NFI DRR +LKR S KSH HL KPI H +DF A FC
Sbjct: 1 MARKQHLHELLKQDQEPFLLSNFINDRRSLLKRSSFKSHF-HLKNPKPIPHSSDFSAKFC 60
Query: 61 KGACFLSFNDSPDLRNPSPLFQFQSPVKSPCRNSNAVFLHVPATTAGLLLEAALRIQKQS 120
+ CF SFN SPDL N SP F FQSPVK+PCRN N VF HVPA TAGLLLEAALRIQKQS
Sbjct: 61 RSTCFFSFNHSPDLANSSPFFGFQSPVKTPCRNPNPVFFHVPARTAGLLLEAALRIQKQS 120
Query: 121 TAARS------NGFGLLGSFLKRFTHRGRSRKREIDGGCRKNDPCDDHLLPAKVAINENK 180
TAARS NG GLLGSFLKR THR R+RKREI G R NDP D LPAK+AI EN+
Sbjct: 121 TAARSKSFGKSNGLGLLGSFLKRLTHRSRARKREIHGDGRMNDPRDGPPLPAKMAIEENE 180
Query: 181 --NDSVSRQSNVTSSDFC-----DSPFRFVLQSSPSAGHRTPEFSSPPSSPARHDHQVND 240
NDSV R SNVT DFC DSPFRFVLQSSPS GHRTPE SSP SSPAR DHQ ND
Sbjct: 181 TENDSVFRLSNVTGFDFCESNLCDSPFRFVLQSSPSPGHRTPELSSPASSPARLDHQAND 240
Query: 241 VESLKKLPVQDEEEEKEQSSPVSVLDPPFQDDEEGRYEDGEDDDDYKMERSYAIVEKAKH 300
VESL+KLP +DEEEEKEQSSPVSVLDPPF+DD+EG +EDGED+DDY +ERS+AIV+KAKH
Sbjct: 241 VESLQKLPAEDEEEEKEQSSPVSVLDPPFEDDDEGHFEDGEDEDDYNLERSFAIVQKAKH 300
Query: 301 QLLKKLRRFERLAELDPVELETFLLEDE---EGELDD---NDIDHLKEEECESHNLDRSN 360
QLLKKLRRFERLAELDP+ELETFLL DE E EL D +DIDHLKEE E +
Sbjct: 301 QLLKKLRRFERLAELDPIELETFLLHDEDQDEDELSDGDGDDIDHLKEE-VEQY------ 360
Query: 361 NEKDMKQHGIDGN---------------------------------------VERVYMRW 416
EKD+KQH +GN ++RVYMR
Sbjct: 361 -EKDIKQHNKEGNDSSRFQIPYRPSRDTKTLVCNLITKEERNLVVIEKSEETMKRVYMRQ 420
BLAST of Cp4.1LG08g10310 vs. TAIR 10
Match:
AT5G03670.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G36420.1); Has 700 Blast hits to 624 proteins in 104 species: Archae - 0; Bacteria - 18; Metazoa - 333; Fungi - 60; Plants - 73; Viruses - 24; Other Eukaryotes - 192 (source: NCBI BLink). )
HSP 1 Score: 245.0 bits (624), Expect = 1.1e-64
Identity = 196/527 (37.19%), Postives = 266/527 (50.47%), Query Frame = 0
Query: 2 AQKHLHELLKEDQEPFLLTNFIADRRVLKRPSPKSHLLHL--NKRKPISHFADFPASFCK 61
+Q+HL +LL+EDQEPF L ++I+DRR +H+ HL KR+PIS A P+ FC+
Sbjct: 3 SQRHLKDLLEEDQEPFQLQSYISDRRC----QINAHVTHLQVKKRRPISQNAGLPSRFCR 62
Query: 62 GACFLSFNDSPDLRNPSPLFQFQSPVKSPCRNSNAVFLHVPATTAGLLLEAALRIQKQS- 121
ACF S +SPD + SPLF+ +KSP R+ NA+F+++PA TA +LLEAA+RIQKQS
Sbjct: 63 NACFFSLRESPDPKK-SPLFE----LKSPNRSQNAIFVNIPARTASILLEAAVRIQKQSS 122
Query: 122 ------TAARSNGFGLLGSFLKRFTHRGRSRKREIDGG--------------CRKNDPCD 181
T N FG+ GS LK+ T+R +KREI GG R P
Sbjct: 123 EVSKTRTRNAGNAFGIFGSVLKKLTNR---KKREISGGKEAGRVSSSSVKDMLRWESPVV 182
Query: 182 DHLLPAKVAINENKN----------------------------------------DSVSR 241
++ K NE +N S+S
Sbjct: 183 RKIVTRKSKRNEEENASSQTHKIASETHFSRRSSSSGVWSESVTNGERSWDVDFETSIST 242
Query: 242 QSNVTSSD----------------FCDSPFRFVLQSSPS-AGHRTPEFSSPPSSPARHDH 301
S SD FC+SPF FVLQ+ PS G RTP FSSP +SP H
Sbjct: 243 SSRSNGSDEFAMMMNGQDLSEDKRFCESPFHFVLQTMPSNGGFRTPNFSSPAASPRHDCH 302
Query: 302 QVN----DVESLKKLPVQDEEEEKEQSSPVSVLDPPFQDDEEGRYEDGEDDDDYKMERSY 361
++ +VE LKKL +++EEEEKEQSSPVSVLDPPFQDD+E + DD + S+
Sbjct: 303 EMEKESYEVEKLKKLEMEEEEEEKEQSSPVSVLDPPFQDDDEDIHM-----DDNNIPSSF 362
Query: 362 AIVEKAKHQLLKKLRRFERLAELDPVELETFLLEDEEGELDDNDIDHLKE---------- 418
V+KAKH LL+KL RFE+LA LDP+ELE + + E E ++ + + +K
Sbjct: 363 RSVQKAKHLLLQKLCRFEQLAGLDPMELEKRMSDQETEEEEEEEEEEMKSLYHCEIITQR 422
BLAST of Cp4.1LG08g10310 vs. TAIR 10
Match:
AT2G36420.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G03670.1); Has 10588 Blast hits to 6606 proteins in 440 species: Archae - 8; Bacteria - 365; Metazoa - 4146; Fungi - 1198; Plants - 483; Viruses - 212; Other Eukaryotes - 4176 (source: NCBI BLink). )
HSP 1 Score: 203.8 bits (517), Expect = 2.8e-52
Identity = 165/454 (36.34%), Postives = 239/454 (52.64%), Query Frame = 0
Query: 3 QKHLHELLKEDQEPFLLTNFIADRRVLKRPSPKSHLLHLNKRKPISHFADFPASF--CKG 62
+KHLHE L++DQEPF L ++I + R S + + KRK + A FP C+
Sbjct: 7 KKHLHEFLEDDQEPFHLNHYIGNLRSQMGCSD----MRVKKRKS-DNVATFPPGLFSCEN 66
Query: 63 ACFLSFNDSPDLRNPSPLFQFQSPVKSPCRNSNAVFLHVPATTAGLLLEAALRIQKQST- 122
+CF + + SPD R SPLF+ +SP K R+ VFL +PA TA +LL+AA RIQKQ +
Sbjct: 67 SCFFAAHKSPDPRK-SPLFELRSPGKKKIRDGR-VFLQIPARTAAILLDAAARIQKQQSE 126
Query: 123 -------AARSNGFGLLGSFLKRFTHRGRSRKREIDGGCRKNDPCDDHLLPAKVAINENK 182
R NGFG+ GS LK T+R + D D + + +
Sbjct: 127 KAKTNKARTRGNGFGMFGSVLKLLTYRITKPRL---------DNADGNAVSLERGSEPTS 186
Query: 183 NDSVSRQSNVTSSDFCDSPFRFVLQSSP-SAGHRTPEFSSPPSSPAR---HDHQVNDVES 242
+ R ++ FC+SPF FVLQ++P S+GH+TP F+S +SPAR D ++ ES
Sbjct: 187 SSRRERIVEISDKCFCESPFHFVLQTTPSSSGHQTPHFTSTATSPARRSTEDEDSDETES 246
Query: 243 LKKLPVQD----EEEEKEQSSPVSVLDPPFQDDEEGRYEDGEDDDDYKMERSYAIVEKAK 302
L+K+ Q+ EEE+KEQ SPVSVLDP +++E+ + E D + S+ IV++AK
Sbjct: 247 LEKVRGQEEEDKEEEDKEQCSPVSVLDPLEEEEEDEDHHQHEPDPPNNLSCSFEIVQRAK 306
Query: 303 HQLLKKLRRFERLAELDPVELETFLLED--------EEGELDDN-----------DIDH- 362
+LLKKLRRFE+LA LDPVELE + E+ EE E DDN D+D
Sbjct: 307 RRLLKKLRRFEKLAGLDPVELEGKMSEEEDEEEEEYEESEEDDNIRIYDSDEEYEDVDEA 366
Query: 363 -LKEEECESHNLDRSNNEKDMKQHGIDGNVERVYMRWDLWKEVESSAIDVMAEEDLRAEV 418
+E C + N+E+ K R+ W + E +D + +DLR E
Sbjct: 367 MARESRCAEDEKRKKNDERQKKW--------RMMNAWRVGLGAEED-VDAVVRKDLREEA 426
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
XP_023539063.1 | 1.98e-301 | 100.00 | uncharacterized protein LOC111799817 [Cucurbita pepo subsp. pepo] | [more] |
XP_022945267.1 | 1.06e-281 | 95.50 | uncharacterized protein LOC111449564 [Cucurbita moschata] >KAG7028088.1 hypothet... | [more] |
KAG6596552.1 | 9.42e-281 | 95.49 | hypothetical protein SDJN03_09732, partial [Cucurbita argyrosperma subsp. sorori... | [more] |
XP_023005858.1 | 2.83e-268 | 92.22 | uncharacterized protein LOC111498735 [Cucurbita maxima] | [more] |
XP_038903007.1 | 3.95e-190 | 67.09 | uncharacterized protein LOC120089713 [Benincasa hispida] | [more] |
Match Name | E-value | Identity | Description | |
A0A6J1G0G0 | 5.15e-282 | 95.50 | uncharacterized protein LOC111449564 OS=Cucurbita moschata OX=3662 GN=LOC1114495... | [more] |
A0A6J1L3C1 | 1.37e-268 | 92.22 | uncharacterized protein LOC111498735 OS=Cucurbita maxima OX=3661 GN=LOC111498735... | [more] |
A0A6J1CUE0 | 3.74e-182 | 66.09 | uncharacterized protein LOC111014376 OS=Momordica charantia OX=3673 GN=LOC111014... | [more] |
A0A5D3DNQ5 | 3.82e-177 | 65.68 | Histone-lysine N-methyltransferase SETD1B-like isoform X2 OS=Cucumis melo var. m... | [more] |
A0A0A0LAR8 | 1.78e-176 | 64.50 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G751450 PE=4 SV=1 | [more] |
Match Name | E-value | Identity | Description | |
AT5G03670.1 | 1.1e-64 | 37.19 | unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... | [more] |
AT2G36420.1 | 2.8e-52 | 36.34 | unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... | [more] |