Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CACCGTAATTTCGTTTGACGTGGACCAAATTATCCGCAAACCGCTAGTCCTCTCCGGGTCTCGTCTTCTCTCTCGGACATAACCCCAAATCCCGATTCCCAACTCCTAAGAAACCGTTTTCCTTTCTTCTCCGCCGCCGTCGCCCTATCACCGACCGGTGCATCTACATTCCGGCGATCCTTTTACCTCACTCTCTCTAGGGTTTTTCTGGTTTTCTGGTTTTCTTTTTGTATTTCGTGTTCCGCTTGAGACCTCTGCTATTTCAGTGTGAATCCCTGATTTGGGTGTTTTTGTTTTATCTTTGTGGGGAACGGCGTTGCTGTTAGGGTTTTGATTTTCCCCAGTCGAGCTGTGGGATACGGCTAGAAGATTTGAAATTTAGGGTTCTTTTTAATTTACTTGGGGGTTTGTTTTGAAATTAGGGTTTTCAGTGTGATTCAGTATGCCGAGGGGTTCGAGGCACAAATCTAGTAGACATGGTCTGAAGGATGCTAAGGAATCTTCGGACTCGGAAAATGATTCCACTCTGAGAGATCGGAAGGGCAAAGAGAGTGGGAGTAGGGTAACGAAGGACTCTGCTTCTAGTGAGAAGCGCAGATTCGAATCGAAGGATTCAAAAGAGTTCTACGGTTCAGAGAATCTGGAGATGGAAGAGCATGGACATTCGAAGCGGCGTAAGGAGAGGTATGATGAGGGAACGACTGATAGGTGGAATGGGGGAAGCAACGATGAGCTTGGTGTTCCTTCCAAAAAGTCAAAAACGTTGGTGGATTCCAAGAGCAAGAGAAGGGACGAGAGTGTGGGATTTCAGGGGGATGGCGAAGAACACAAGAAGAGTAGTGGAAAGGGCGAGGGAAGGCACCGGGAGTCGAGCCGAAAGGAGGGTAGGAGTGGTGGAGGGGAAAGAGAAAGGGAAAGGGAAAGGGAGAGGGAGAAGGATAGGAAAGGTAGAGAAGGAAGAAGTGACAGGGGGGTTGCAAGTGAAGATCTCCGTGTTGAAAAACAAGTGGAAAAGAACTCAGGTCAGGCAATGGGTTACACTTAGTTTTTTTCATCACAGGCCAACTTGTGTATCTATTGAACTTGCTCTTTGTCTTTACTGATTAGTGTCCACATTTTTCTTAGCCTCATTTTTAGGATTAACGGGTTGATTTTGTAGTGTTTTCTGTTGCGTTGGAGATCTTGTTCCCGTCATTGACTCATTTGGTTGGACTATTTATTTTCTATTTTTCTCTATTATAATCTGGTCATATTTTGTTCTCTTAAAAACTTTTTCAAGTACCATTTCTGTCTTGTTTGATAGATTCTCTGCTTGAAGATGTTCTGTATGAAATGACAGTGAGGATAGTGGTTTTTGTATTTCTACGCTGCCACATTTTAGACTATACCCGATCATTCTTTTTTTGAGTACTAAAGGTTGCAATGGCGTTAATCTCCTTGTTTCTCTTCAGGATCGTCTTACACCCCACACCTAAGATGGGTTTTGCCTTTTCTTAATATCAATATGAAATTTTTTATTTTCTGGAAAAATTGGTGGTAATCATTTTCTTTCTTTGTTGTTATTGTTTGTTATTGTTATTATCATTATTATTATCTTTGTGTAGTTGTGTGTGAAGTTAAAAGGATAATACAATGATAATTAGATGCTTTCATTTTGCTGATTTGGTGAAGAATGAATGTTAAATTTTCCCAAGTGCGTGATTTGGTGAGATTCCGCCGTGATGACTGTCACAGAATTTCAAGGAAGAGTTACACAAGGATTACTAAACCCTCTTGCCTTGTCTGCATTTCCATGAGAATTAAGTTCTCTGTCTGGTTGAAACAAATTAATGCCACATCTTAGATTACTCTTTCTCTTGCATCAAATTTCATGTAAAAGACTAACCGACATTGAAAGATTTTCCAGTGATTCCAATTGTTTTCTCCAATCAACGATGACCACCTCATCTCTACTCATATCAAGGACCACAACCTCAGCTTAAGATCGCCCCTTCAAACTTCGAGGCATGGAAGATATGTGTATCAAAGAGGACAGGTCCTTTTACCCCTTTAGGTCCCATCTGCTGGTATGTAATAATTTACATCCTTCCTCTTCGTTCATGGAGAGGCCTTCCTTGGAACAACATCTCCTATTCAAGACCTTCTGCACCCAAAGAATGTCATCTTCCTATCTCATTTCCTGAAAAAGCCAAGGTTTTATTAGTGAAAAGTAGGTCAGCAAAACAACCTACCACCCATACTACGAGCCCTTAGTTCAAGGGAATTTGGAGCTGTCTGTATTCACTACTGTCCTATATGTTGACTTTCCCACTCACTCCTTGATCAACTACTTTAGAATCAAAGGTTTTGTGACAATCCTGCACAAAAACAATGGAGTTACACTACCCTCCCTTTCACAAATCACCAGTGACGCCTTTCTGAATACACAACTTTTTCTGCCCCTAATCGTGCAGTTTAATGACAGTATTACTTATATATCAGATTTTGGCGTACCTTATGCCAGCAAATGGCTTGATTGATTAGTAGTGATTTTTTAGTTGGTTTTAGTTAGGATAGGATATAAGTTCCTAGATTTTATTCTATTTCTCCTTTGTCTCAATTTGCTAACTTATTGTACACTGATGTTTAAGGGTTTAAATTCTATTTTTCTTTTTCACTATGTTTCCAAATTCTTTTTTGTTTGTTTTGTTTTTTCTTTTTTGTCTGTTTTTGTTTTCTTTGAATTATGTTTTGGTTCTAGAATTCTGATTTTATTTGATATTCTTGTCATTTTACAATTCTTAGTCTTCGTTCATCAGATTTTAGCTGTATATATGCATGATTTGCTAAATTGGTTAATCTTAAGAATTAGATTATACTTGCTTTTGGAGAAGTATTTTTTTGTTTTTCTTTCAATTTGTACTACTAGTATTCTAATTTTGTGCAAATTTTGGATGTGAAGTTCAATTTGGTTTGCTTTCAATCATGATAACTAATTATTCCCTTCTCTCTCTCTCTCTCTCAGACGCACATATTCAGGCAATCATGTTTAGGTGAACAAATTTTCTTTATTGAGTGTTACGTTAGCGAGAGCTGATTGTGCTTGTTAGGTCATTTTCTTATAGTATAGATGGATGTTTGGAACTTTACTAGACATATTTATTTTCTTTTGGAATCTATTTAAACAATCAAGTCTAGTTTGAAAATTGTCATGCTGAGTATTTTAATATTTGATACTTGGTGACTTGGAGTTAGTGTCATACTAGATATTAACTATGAAATTGTTCATCGTATGATAATTTTTTTTTTTAATTTAATGGAAATATTAGTCATCTTCTTGGCATTTTTGTAGCATCTTTTACACTTTTTTAATGATCTATTTTTGGTGCTGGTTTATGTACTAATGAAAAGCTTAACTTTTCTTTTTCCCTATGGTCTCATCATTAGATGAGTTGTGATCCCCTTGAACAAATAGATGGTTTAGCTTTCCCTAATACCATTGTATTGTTGGCTCATTATTAGTTTTCAACTTGTTTTTATCATTTTGTGGAATTTTGTGTAAATATGTTAGTTTGTAACATGTTGCACTTTAATGCAGAAAATGTGTTGCATAGCCCTGGATTAGAGAATCACCTGGAGATCCGAGTTAGGAAGAGAACTGGTTCTTTTGATGGGGATAAACATAAAGATGATATAGGGGATGTCGATAATAGACAGCTATCTTCAAAGAATGATACTGTGAAGGATGGAAGACGAAAGAGTGAGAAATACAAGGATGAGAGAAATAGGGAGAAGTACCGGGAAGATGTTGATAGGGATGGCAAGGAAAGAAATGAACTTGTTAAAGATCACATCAGCAGGTCAAATGACAGAGATTTGAGAGATGAGAAGGATGCAGTGGATATGCATCATAAGAGAAATAAGCCTCAAGATAGCGATCCTGATCGAGAGGTAACCAAGGCCAAACGTGAAGGTGATATAGATGCTATGAGAGATCAAGATCATGATCGCCACCATGCGTATGAACGTGATCATGAACAAGAGAGTAGGCGTAGACGTGATCGTGATCGTGGTCGGGATCGTGACCGTGACCGTGACCATGATCGGGATAGTAGACGACATCGTAGTCGAAGCCGTGCACGTGACCGTTACTCTGATTATGAATGTGATGTTGACCGTGATGGTTCACATTTTGATGATCAATACACAAAATATGTTGATAGTAGGGGAAGGAAAAGATCTCCTAATGATCATGATGATTCTGTTGATGCTAGATCTAAAAGTTTGAAGAATAGTCACCATGCAAATGATGAAAAGAAGTCTTTGAGCAATGATAAAGTGGATTCAGATGCTGAGAGAGGAAGGTCTCAATCACGATCCCGGCATGGAGATGTTAGTTTAAGTAGCCATAGACGGAAGAGTTCGCCCAGTTCTCATTCACGTGTTGTCACAGATGAATACAGGTTTCAACTCTTTTTCTTTATCGTTACAGTTGGTCTATGGTGTGTTTGAAGTCCTTGATGCATAGATAAATTTTTTGTGCAATGATGTTATCTCTGTTGAATTGCATAAATAGGTTAATAAGCAACTATATATTGGTAGCTATCATCTTTATTGCTTTTAAAGTATGGTTAAGTAGTTACATCTATGGTTAAAATACCATTTTGCACTTACGTATTGGTAGCTCTCATCTCTTATTCTTTTAAAGCATAGTTAAGCACTCACATCTTAAGTTAAAATAGAATTTTGGTCCTTGTACTTTATTTCTCATTCTTTTAAAGCATGGTTAAGCACTCACATCTTTAGTTAAAATAGAATTTTGGACCTTGTACTTTATATTTCTCATTCTATTTTTGTTCCTATACTTCCAAATGCTCAAAATTAATCCCATACGCTTAATAAATCTTGAAATTGAACCTTAGTGTTAGTTTGAAGTTAATATTTATTGAAATTGGTAGAATAATAATAGTTTCCATTCAAGAAAAGGCAATGTGAATATGTTTCCGAAATTGACATAAAGAAAGGNGGGGGGGGGGGGGGGGGGATTGACTAATACGACTAATCTTGAAATTTATTGAAAGTATGTGGATTAAGTTTGAATATTTGAAAGTACGTGGGCTAAGATAGAACGAAATGTTGTAGATGGATCAAAATGGTATTTTAACTACTTCTCAAATTTTCATTCAAGGAATTTTTTTGAAAGGAGGAGAGAGCACCAGACTATGAGCAATTTCTTCTCCACGTCTTTTCATCTGCATTATTATTAATGGTCGTAGCTGTGAATTCTATTATGATTAAGTGGTGATTCTTTTTATGAATAACTATTGTTGTGAATTCTATTATAATACCTGGCAATTTTTCTTGGCTGAGTTTTCATTTTACCTTATGAATGGCGAAGGCATTATGGCACACTAGTTGGGTCCAATAAAAGGTACTGACAAGGGAGCCTCACAGACTAGTATCAGTTAAAAAAATTTCCGCCACAAAGAAACTCTTTAATATTGTGTGGTGCAACTTCGGTTTATTTCTTTTTCTGAAAGGAGTTCTCACTGCATCTTCCTGCATCATGTTATTTTTTTCTTATATTTTTTAGGATTGATAGTGCGGAAACTTAGGGTTCCATACATTCATTTGTTTGTCTTAATTCTTGTGTAGTGATTTCCTTTTTCCTTTTCTCCTTTTTGGAAGAATTTCTTCTTGATAGGAAATGATGCAACAACTGATGACGGTTATGTGGTCTTGATTTTTATTAAACTTCTTAACTATGTATGTCGTCAAGAGGATTTATTATATTTAAAGGATTATAATTTGTTTGTAGTAAGTGGGTAACTGACTTCATGAATTGCAGGAGTTAACAGAGCAATGAGATGTTCATCTAATATTTATCTTTTTCCATTCTAGGCATCAAGATCAGGAAGATTTGAGAGACCGATACCCAAAAAAGGAAGAAAGGTCCAAATCCATTTCTACTAGAGATAAAGGTGTTCTTTCGGTAGTACAAGAAAAGGGTTCCAAATACACTTATTCGGAGAAACCCAGTGAAATAGAGGGTGGCAATGCTACTGAGATGTTACGAGACAGGACTTTAAATTCTAAGGTATCTATCAGCATTGAGCTTGTCCAAAAGATCTGTTACATTTATGAATGACTTTGGACATGCTAACAGTTCTTATGTTTCTGCAGAATGTTGACATTGAAGAAAGTGGACGAAGGCACAATAATTCTATTGATGCCAAAGACCTCTCTTCCAATAAGGATAGGCATAGCTGGGATATACAAGGAGAGAAGCCTGTGATGGATGATTCATCTCAGGTAGAGTCTTATTATAGCAAAGGTAGTCAGAGCAATCCATCACCATTCCATCCTCGCCCTGCTTTTAGGGGTGGAGTTGATATTCCTTTTGATGGTTCACTAGATGATGATGGCAGACTTAATTCTAATAGCCGTTTTCGAAGGGGTAATGATCCAAATATGGGTAGAGTACATGGCAACACTTGGAGAGGTGTTCCAAACTGGACAGCACCACTACCAAATGGCTTTATCCCTTTCCAGCATGGACCTCCTCCTCATGGAAGTTTCCAATCACTTATGCCACAGTTTCCAGCACCGCCTATGTTTGGTATCAGACCCCCACTTGATATCAATCACTCGGGAATTCATTATCGGATGCCTGATGCTGACAGATTTTCAAGTCACATGCATCCACTAGGGTGGCAGAATATGTTGGATGGTTCAAGCCCTTCTCACTTGCATGGATGGGATGCAAACAACGGTATCTTTAGGGATGAATCTCATATTTATAATGGAGCTGAATGGGATGAGAACAGGCAGATGGTGAATGGTCGAGGTTGGGACTCCAAAGCTGAAATGTGGAAGAGACAGAGTGGCTCCCTGAAAAGGGAAATACCTTCCCAATTTCAGAAAGATGAGCGTTCGGTGCAAGATCCTGTTGATGATGTATCAAGTAAAGAGATATGTGATGAGAATGCTGATACTGTTTTGACAAAAACTGCTGAAATAAGGCCTAATATCCCTTCTGCAAAAGAAAGCCCCAACACCCCCGAACTCTTATCTGAAACACCAGGTCCTCTTAGTCGGTCAATGGATGATAATTCTAAACTTAGTTGTTCTTACCTTTCTAAGCTTAAGATTTCCACAGAACTTGCACTTCCTGATTTGTACCAACAGTGTCAGAGATTGATGGACATTGAGCACTGTGCAACTGCAGATGAGGAAACTGCTGCTTATATAGTTCTTGAGGTAAAATCCTGGATTAAGTTTCATATTTCACCTATGTCTGTTTATGCCTTCCTCATAATTCTATGAATTTAATTATTATTCATGATTTGCATCGACTTTTTGACAGGGTGGTATGAGAGCTGTGTCCATCTCTTCAAATAGTGCACAAATATCTCTTTTCCGTCCAAACAAGAACTCAGTTTTTCAGGTATAATACTTGGCTGCGTAAGTGTAGGAAATATAAGTTGTCTGTGGTCTTTTAGTATTCACATTAGGTTTTAGATATTTGATGCTCGGGACTCGCAAGATGGTACAAATTTGGACTGTCCTTTTTTTTCATGTATTAAAATACTATACTACTATGAAACGTTGTGTTAGTTATAGTTTCTGATATTTAATCATTGTGTCTTGGCAATTCTGCTGGATTTCATGGCCCTTTCTGTTCTCTTTTGATAAAATATGGGGGAAAACTTGTAAAGTACCATGCCATTTGTATGTTAGTAAAACTTTATCCTTGTAGATTCTTAGCGACAAATGACGCTCAAAAGTTAATAGACCATGTAATAATTGTGGTATTGATATTTTGCAGCATGCAATGGACTTGTACAAAAAGCAGAGAACGGAAATGAAGGAGATGCAAGCTAGTTCTAGGGAAATGCCCTCCTCTGAGAGGATGCTTGAAGAAGAGCAGCAGGGGATGCAAGTTGTTTCCGGGGGAATGGCTTTCTCGGAGAGGAAACATGAAGAGAAGGGCTTTACTTTCAATAATGAAGACGTTAAGGCTCCTGTTTCAACTGTTGATGCGGAAATGACACAGGCACCCATCAAAACCACTGGTGTTGATAAGGCAATTGAGGCAGATGCTGCTTTGGGGAAATTGGAGGATTTGGCAGTTGAGGCGGATGCTGCTTTGGGGGAACTGGAGGATTTGGCTTCTCCTGCCACTCGGGAGGTTAAGTGTCTTGAGAACTCAGAGGAGTCGGTGCCGATTACCAATTCAACAGAAGTGGATATGATGGATTCAGAGCAGCAGGCGAACCTAGACGCTGAGAAGGATACCATTGTCATAGCAAATGACAACACACCAGTTAATAACATCAATGAATCCAGTAACGACGACATGAAGGGGATTGTCAATGGCAAAGACTCTCCACGATGCGATGAATTCAGTAACAACAACGACATAAAGGGGATTGTGAATGGCAAAGAATCTCCAGGATGCGACGAATTGAGTAAGAACGAAAATGGCAAAGAATCTCCAGGATGTGGAGTTGGTAATTCTTGTTTTGACAAAGCAGTGAGTGGTCCTTTATCTTTAGCAGGAGGAGATGAAATAGGGGGGGAGAGTTGTGAGGAGGTGGGGTTAATGGGTGGTGGTGGTGGTGGTGGTGGTGTGCCAATAGGGTCAGAGTCTTTAATTTTGAGTCAGCAGATACATCATTCTCCTGAAAGTACACATTGAAACAATTTTAATATCGCTTCTGCATTTCATATTGCTTAGTTTTAAGTTATTGATTTTTCTATATTCATGTTGCTTCATCTTCCTGCAAGGAATAAAATTTCCTACTGTTCTGCACCTTGGTGCATAGTGTGTCTTGAGGTTGCTTGC
mRNA sequence
CACCGTAATTTCGTTTGACGTGGACCAAATTATCCGCAAACCGCTAGTCCTCTCCGGGTCTCGTCTTCTCTCTCGGACATAACCCCAAATCCCGATTCCCAACTCCTAAGAAACCGTTTTCCTTTCTTCTCCGCCGCCGTCGCCCTATCACCGACCGGTGCATCTACATTCCGGCGATCCTTTTACCTCACTCTCTCTAGGGTTTTTCTGGTTTTCTGGTTTTCTTTTTGTATTTCGTGTTCCGCTTGAGACCTCTGCTATTTCAGTGTGAATCCCTGATTTGGGTGTTTTTGTTTTATCTTTGTGGGGAACGGCGTTGCTGTTAGGGTTTTGATTTTCCCCAGTCGAGCTGTGGGATACGGCTAGAAGATTTGAAATTTAGGGTTCTTTTTAATTTACTTGGGGGTTTGTTTTGAAATTAGGGTTTTCAGTGTGATTCAGTATGCCGAGGGGTTCGAGGCACAAATCTAGTAGACATGGTCTGAAGGATGCTAAGGAATCTTCGGACTCGGAAAATGATTCCACTCTGAGAGATCGGAAGGGCAAAGAGAGTGGGAGTAGGGTAACGAAGGACTCTGCTTCTAGTGAGAAGCGCAGATTCGAATCGAAGGATTCAAAAGAGTTCTACGGTTCAGAGAATCTGGAGATGGAAGAGCATGGACATTCGAAGCGGCGTAAGGAGAGGTATGATGAGGGAACGACTGATAGGTGGAATGGGGGAAGCAACGATGAGCTTGGTGTTCCTTCCAAAAAGTCAAAAACGTTGGTGGATTCCAAGAGCAAGAGAAGGGACGAGAGTGTGGGATTTCAGGGGGATGGCGAAGAACACAAGAAGAGTAGTGGAAAGGGCGAGGGAAGGCACCGGGAGTCGAGCCGAAAGGAGGGTAGGAGTGGTGGAGGGGAAAGAGAAAGGGAAAGGGAAAGGGAGAGGGAGAAGGATAGGAAAGGTAGAGAAGGAAGAAGTGACAGGGGGGTTGCAAGTGAAGATCTCCGTGTTGAAAAACAAGTGGAAAAGAACTCAGAAAATGTGTTGCATAGCCCTGGATTAGAGAATCACCTGGAGATCCGAGTTAGGAAGAGAACTGGTTCTTTTGATGGGGATAAACATAAAGATGATATAGGGGATGTCGATAATAGACAGCTATCTTCAAAGAATGATACTGTGAAGGATGGAAGACGAAAGAGTGAGAAATACAAGGATGAGAGAAATAGGGAGAAGTACCGGGAAGATGTTGATAGGGATGGCAAGGAAAGAAATGAACTTGTTAAAGATCACATCAGCAGGTCAAATGACAGAGATTTGAGAGATGAGAAGGATGCAGTGGATATGCATCATAAGAGAAATAAGCCTCAAGATAGCGATCCTGATCGAGAGGTAACCAAGGCCAAACGTGAAGGTGATATAGATGCTATGAGAGATCAAGATCATGATCGCCACCATGCGTATGAACGTGATCATGAACAAGAGAGTAGGCGTAGACGTGATCGTGATCGTGGTCGGGATCGTGACCGTGACCGTGACCATGATCGGGATAGTAGACGACATCGTAGTCGAAGCCGTGCACGTGACCGTTACTCTGATTATGAATGTGATGTTGACCGTGATGGTTCACATTTTGATGATCAATACACAAAATATGTTGATAGTAGGGGAAGGAAAAGATCTCCTAATGATCATGATGATTCTGTTGATGCTAGATCTAAAAGTTTGAAGAATAGTCACCATGCAAATGATGAAAAGAAGTCTTTGAGCAATGATAAAGTGGATTCAGATGCTGAGAGAGGAAGGTCTCAATCACGATCCCGGCATGGAGATGTTAGTTTAAGTAGCCATAGACGGAAGAGTTCGCCCAGTTCTCATTCACGTGTTGTCACAGATGAATACAGGCATCAAGATCAGGAAGATTTGAGAGACCGATACCCAAAAAAGGAAGAAAGGTCCAAATCCATTTCTACTAGAGATAAAGGTGTTCTTTCGGTAGTACAAGAAAAGGGTTCCAAATACACTTATTCGGAGAAACCCAGTGAAATAGAGGGTGGCAATGCTACTGAGATGTTACGAGACAGGACTTTAAATTCTAAGAATGTTGACATTGAAGAAAGTGGACGAAGGCACAATAATTCTATTGATGCCAAAGACCTCTCTTCCAATAAGGATAGGCATAGCTGGGATATACAAGGAGAGAAGCCTGTGATGGATGATTCATCTCAGGTAGAGTCTTATTATAGCAAAGGTAGTCAGAGCAATCCATCACCATTCCATCCTCGCCCTGCTTTTAGGGGTGGAGTTGATATTCCTTTTGATGGTTCACTAGATGATGATGGCAGACTTAATTCTAATAGCCGTTTTCGAAGGGGTAATGATCCAAATATGGGTAGAGTACATGGCAACACTTGGAGAGGTGTTCCAAACTGGACAGCACCACTACCAAATGGCTTTATCCCTTTCCAGCATGGACCTCCTCCTCATGGAAGTTTCCAATCACTTATGCCACAGTTTCCAGCACCGCCTATGTTTGGTATCAGACCCCCACTTGATATCAATCACTCGGGAATTCATTATCGGATGCCTGATGCTGACAGATTTTCAAGTCACATGCATCCACTAGGGTGGCAGAATATGTTGGATGGTTCAAGCCCTTCTCACTTGCATGGATGGGATGCAAACAACGGTATCTTTAGGGATGAATCTCATATTTATAATGGAGCTGAATGGGATGAGAACAGGCAGATGGTGAATGGTCGAGGTTGGGACTCCAAAGCTGAAATGTGGAAGAGACAGAGTGGCTCCCTGAAAAGGGAAATACCTTCCCAATTTCAGAAAGATGAGCGTTCGGTGCAAGATCCTGTTGATGATGTATCAAGTAAAGAGATATGTGATGAGAATGCTGATACTGTTTTGACAAAAACTGCTGAAATAAGGCCTAATATCCCTTCTGCAAAAGAAAGCCCCAACACCCCCGAACTCTTATCTGAAACACCAGGTCCTCTTAGTCGGTCAATGGATGATAATTCTAAACTTAGTTGTTCTTACCTTTCTAAGCTTAAGATTTCCACAGAACTTGCACTTCCTGATTTGTACCAACAGTGTCAGAGATTGATGGACATTGAGCACTGTGCAACTGCAGATGAGGAAACTGCTGCTTATATAGTTCTTGAGGGTGGTATGAGAGCTGTGTCCATCTCTTCAAATAGTGCACAAATATCTCTTTTCCGTCCAAACAAGAACTCAGTTTTTCAGCATGCAATGGACTTGTACAAAAAGCAGAGAACGGAAATGAAGGAGATGCAAGCTAGTTCTAGGGAAATGCCCTCCTCTGAGAGGATGCTTGAAGAAGAGCAGCAGGGGATGCAAGTTGTTTCCGGGGGAATGGCTTTCTCGGAGAGGAAACATGAAGAGAAGGGCTTTACTTTCAATAATGAAGACGTTAAGGCTCCTGTTTCAACTGTTGATGCGGAAATGACACAGGCACCCATCAAAACCACTGGTGTTGATAAGGCAATTGAGGCAGATGCTGCTTTGGGGAAATTGGAGGATTTGGCAGTTGAGGCGGATGCTGCTTTGGGGGAACTGGAGGATTTGGCTTCTCCTGCCACTCGGGAGGTTAAGTGTCTTGAGAACTCAGAGGAGTCGGTGCCGATTACCAATTCAACAGAAGTGGATATGATGGATTCAGAGCAGCAGGCGAACCTAGACGCTGAGAAGGATACCATTGTCATAGCAAATGACAACACACCAGTTAATAACATCAATGAATCCAGTAACGACGACATGAAGGGGATTGTCAATGGCAAAGACTCTCCACGATGCGATGAATTCAGTAACAACAACGACATAAAGGGGATTGTGAATGGCAAAGAATCTCCAGGATGCGACGAATTGAGTAAGAACGAAAATGGCAAAGAATCTCCAGGATGTGGAGTTGGTAATTCTTGTTTTGACAAAGCAGTGAGTGGTCCTTTATCTTTAGCAGGAGGAGATGAAATAGGGGGGGAGAGTTGTGAGGAGGTGGGGTTAATGGGTGGTGGTGGTGGTGGTGGTGGTGTGCCAATAGGGTCAGAGTCTTTAATTTTGAGTCAGCAGATACATCATTCTCCTGAAAGTACACATTGAAACAATTTTAATATCGCTTCTGCATTTCATATTGCTTAGTTTTAAGTTATTGATTTTTCTATATTCATGTTGCTTCATCTTCCTGCAAGGAATAAAATTTCCTACTGTTCTGCACCTTGGTGCATAGTGTGTCTTGAGGTTGCTTGC
Coding sequence (CDS)
ATGCCGAGGGGTTCGAGGCACAAATCTAGTAGACATGGTCTGAAGGATGCTAAGGAATCTTCGGACTCGGAAAATGATTCCACTCTGAGAGATCGGAAGGGCAAAGAGAGTGGGAGTAGGGTAACGAAGGACTCTGCTTCTAGTGAGAAGCGCAGATTCGAATCGAAGGATTCAAAAGAGTTCTACGGTTCAGAGAATCTGGAGATGGAAGAGCATGGACATTCGAAGCGGCGTAAGGAGAGGTATGATGAGGGAACGACTGATAGGTGGAATGGGGGAAGCAACGATGAGCTTGGTGTTCCTTCCAAAAAGTCAAAAACGTTGGTGGATTCCAAGAGCAAGAGAAGGGACGAGAGTGTGGGATTTCAGGGGGATGGCGAAGAACACAAGAAGAGTAGTGGAAAGGGCGAGGGAAGGCACCGGGAGTCGAGCCGAAAGGAGGGTAGGAGTGGTGGAGGGGAAAGAGAAAGGGAAAGGGAAAGGGAGAGGGAGAAGGATAGGAAAGGTAGAGAAGGAAGAAGTGACAGGGGGGTTGCAAGTGAAGATCTCCGTGTTGAAAAACAAGTGGAAAAGAACTCAGAAAATGTGTTGCATAGCCCTGGATTAGAGAATCACCTGGAGATCCGAGTTAGGAAGAGAACTGGTTCTTTTGATGGGGATAAACATAAAGATGATATAGGGGATGTCGATAATAGACAGCTATCTTCAAAGAATGATACTGTGAAGGATGGAAGACGAAAGAGTGAGAAATACAAGGATGAGAGAAATAGGGAGAAGTACCGGGAAGATGTTGATAGGGATGGCAAGGAAAGAAATGAACTTGTTAAAGATCACATCAGCAGGTCAAATGACAGAGATTTGAGAGATGAGAAGGATGCAGTGGATATGCATCATAAGAGAAATAAGCCTCAAGATAGCGATCCTGATCGAGAGGTAACCAAGGCCAAACGTGAAGGTGATATAGATGCTATGAGAGATCAAGATCATGATCGCCACCATGCGTATGAACGTGATCATGAACAAGAGAGTAGGCGTAGACGTGATCGTGATCGTGGTCGGGATCGTGACCGTGACCGTGACCATGATCGGGATAGTAGACGACATCGTAGTCGAAGCCGTGCACGTGACCGTTACTCTGATTATGAATGTGATGTTGACCGTGATGGTTCACATTTTGATGATCAATACACAAAATATGTTGATAGTAGGGGAAGGAAAAGATCTCCTAATGATCATGATGATTCTGTTGATGCTAGATCTAAAAGTTTGAAGAATAGTCACCATGCAAATGATGAAAAGAAGTCTTTGAGCAATGATAAAGTGGATTCAGATGCTGAGAGAGGAAGGTCTCAATCACGATCCCGGCATGGAGATGTTAGTTTAAGTAGCCATAGACGGAAGAGTTCGCCCAGTTCTCATTCACGTGTTGTCACAGATGAATACAGGCATCAAGATCAGGAAGATTTGAGAGACCGATACCCAAAAAAGGAAGAAAGGTCCAAATCCATTTCTACTAGAGATAAAGGTGTTCTTTCGGTAGTACAAGAAAAGGGTTCCAAATACACTTATTCGGAGAAACCCAGTGAAATAGAGGGTGGCAATGCTACTGAGATGTTACGAGACAGGACTTTAAATTCTAAGAATGTTGACATTGAAGAAAGTGGACGAAGGCACAATAATTCTATTGATGCCAAAGACCTCTCTTCCAATAAGGATAGGCATAGCTGGGATATACAAGGAGAGAAGCCTGTGATGGATGATTCATCTCAGGTAGAGTCTTATTATAGCAAAGGTAGTCAGAGCAATCCATCACCATTCCATCCTCGCCCTGCTTTTAGGGGTGGAGTTGATATTCCTTTTGATGGTTCACTAGATGATGATGGCAGACTTAATTCTAATAGCCGTTTTCGAAGGGGTAATGATCCAAATATGGGTAGAGTACATGGCAACACTTGGAGAGGTGTTCCAAACTGGACAGCACCACTACCAAATGGCTTTATCCCTTTCCAGCATGGACCTCCTCCTCATGGAAGTTTCCAATCACTTATGCCACAGTTTCCAGCACCGCCTATGTTTGGTATCAGACCCCCACTTGATATCAATCACTCGGGAATTCATTATCGGATGCCTGATGCTGACAGATTTTCAAGTCACATGCATCCACTAGGGTGGCAGAATATGTTGGATGGTTCAAGCCCTTCTCACTTGCATGGATGGGATGCAAACAACGGTATCTTTAGGGATGAATCTCATATTTATAATGGAGCTGAATGGGATGAGAACAGGCAGATGGTGAATGGTCGAGGTTGGGACTCCAAAGCTGAAATGTGGAAGAGACAGAGTGGCTCCCTGAAAAGGGAAATACCTTCCCAATTTCAGAAAGATGAGCGTTCGGTGCAAGATCCTGTTGATGATGTATCAAGTAAAGAGATATGTGATGAGAATGCTGATACTGTTTTGACAAAAACTGCTGAAATAAGGCCTAATATCCCTTCTGCAAAAGAAAGCCCCAACACCCCCGAACTCTTATCTGAAACACCAGGTCCTCTTAGTCGGTCAATGGATGATAATTCTAAACTTAGTTGTTCTTACCTTTCTAAGCTTAAGATTTCCACAGAACTTGCACTTCCTGATTTGTACCAACAGTGTCAGAGATTGATGGACATTGAGCACTGTGCAACTGCAGATGAGGAAACTGCTGCTTATATAGTTCTTGAGGGTGGTATGAGAGCTGTGTCCATCTCTTCAAATAGTGCACAAATATCTCTTTTCCGTCCAAACAAGAACTCAGTTTTTCAGCATGCAATGGACTTGTACAAAAAGCAGAGAACGGAAATGAAGGAGATGCAAGCTAGTTCTAGGGAAATGCCCTCCTCTGAGAGGATGCTTGAAGAAGAGCAGCAGGGGATGCAAGTTGTTTCCGGGGGAATGGCTTTCTCGGAGAGGAAACATGAAGAGAAGGGCTTTACTTTCAATAATGAAGACGTTAAGGCTCCTGTTTCAACTGTTGATGCGGAAATGACACAGGCACCCATCAAAACCACTGGTGTTGATAAGGCAATTGAGGCAGATGCTGCTTTGGGGAAATTGGAGGATTTGGCAGTTGAGGCGGATGCTGCTTTGGGGGAACTGGAGGATTTGGCTTCTCCTGCCACTCGGGAGGTTAAGTGTCTTGAGAACTCAGAGGAGTCGGTGCCGATTACCAATTCAACAGAAGTGGATATGATGGATTCAGAGCAGCAGGCGAACCTAGACGCTGAGAAGGATACCATTGTCATAGCAAATGACAACACACCAGTTAATAACATCAATGAATCCAGTAACGACGACATGAAGGGGATTGTCAATGGCAAAGACTCTCCACGATGCGATGAATTCAGTAACAACAACGACATAAAGGGGATTGTGAATGGCAAAGAATCTCCAGGATGCGACGAATTGAGTAAGAACGAAAATGGCAAAGAATCTCCAGGATGTGGAGTTGGTAATTCTTGTTTTGACAAAGCAGTGAGTGGTCCTTTATCTTTAGCAGGAGGAGATGAAATAGGGGGGGAGAGTTGTGAGGAGGTGGGGTTAATGGGTGGTGGTGGTGGTGGTGGTGGTGTGCCAATAGGGTCAGAGTCTTTAATTTTGAGTCAGCAGATACATCATTCTCCTGAAAGTACACATTGA
Protein sequence
MPRGSRHKSSRHGLKDAKESSDSENDSTLRDRKGKESGSRVTKDSASSEKRRFESKDSKEFYGSENLEMEEHGHSKRRKERYDEGTTDRWNGGSNDELGVPSKKSKTLVDSKSKRRDESVGFQGDGEEHKKSSGKGEGRHRESSRKEGRSGGGEREREREREREKDRKGREGRSDRGVASEDLRVEKQVEKNSENVLHSPGLENHLEIRVRKRTGSFDGDKHKDDIGDVDNRQLSSKNDTVKDGRRKSEKYKDERNREKYREDVDRDGKERNELVKDHISRSNDRDLRDEKDAVDMHHKRNKPQDSDPDREVTKAKREGDIDAMRDQDHDRHHAYERDHEQESRRRRDRDRGRDRDRDRDHDRDSRRHRSRSRARDRYSDYECDVDRDGSHFDDQYTKYVDSRGRKRSPNDHDDSVDARSKSLKNSHHANDEKKSLSNDKVDSDAERGRSQSRSRHGDVSLSSHRRKSSPSSHSRVVTDEYRHQDQEDLRDRYPKKEERSKSISTRDKGVLSVVQEKGSKYTYSEKPSEIEGGNATEMLRDRTLNSKNVDIEESGRRHNNSIDAKDLSSNKDRHSWDIQGEKPVMDDSSQVESYYSKGSQSNPSPFHPRPAFRGGVDIPFDGSLDDDGRLNSNSRFRRGNDPNMGRVHGNTWRGVPNWTAPLPNGFIPFQHGPPPHGSFQSLMPQFPAPPMFGIRPPLDINHSGIHYRMPDADRFSSHMHPLGWQNMLDGSSPSHLHGWDANNGIFRDESHIYNGAEWDENRQMVNGRGWDSKAEMWKRQSGSLKREIPSQFQKDERSVQDPVDDVSSKEICDENADTVLTKTAEIRPNIPSAKESPNTPELLSETPGPLSRSMDDNSKLSCSYLSKLKISTELALPDLYQQCQRLMDIEHCATADEETAAYIVLEGGMRAVSISSNSAQISLFRPNKNSVFQHAMDLYKKQRTEMKEMQASSREMPSSERMLEEEQQGMQVVSGGMAFSERKHEEKGFTFNNEDVKAPVSTVDAEMTQAPIKTTGVDKAIEADAALGKLEDLAVEADAALGELEDLASPATREVKCLENSEESVPITNSTEVDMMDSEQQANLDAEKDTIVIANDNTPVNNINESSNDDMKGIVNGKDSPRCDEFSNNNDIKGIVNGKESPGCDELSKNENGKESPGCGVGNSCFDKAVSGPLSLAGGDEIGGESCEEVGLMGGGGGGGGVPIGSESLILSQQIHHSPESTH
Homology
BLAST of Cp4.1LG13g03060 vs. NCBI nr
Match:
XP_023551223.1 (uncharacterized protein LOC111809105 isoform X1 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 2316 bits (6001), Expect = 0.0
Identity = 1223/1223 (100.00%), Postives = 1223/1223 (100.00%), Query Frame = 0
Query: 1 MPRGSRHKSSRHGLKDAKESSDSENDSTLRDRKGKESGSRVTKDSASSEKRRFESKDSKE 60
MPRGSRHKSSRHGLKDAKESSDSENDSTLRDRKGKESGSRVTKDSASSEKRRFESKDSKE
Sbjct: 1 MPRGSRHKSSRHGLKDAKESSDSENDSTLRDRKGKESGSRVTKDSASSEKRRFESKDSKE 60
Query: 61 FYGSENLEMEEHGHSKRRKERYDEGTTDRWNGGSNDELGVPSKKSKTLVDSKSKRRDESV 120
FYGSENLEMEEHGHSKRRKERYDEGTTDRWNGGSNDELGVPSKKSKTLVDSKSKRRDESV
Sbjct: 61 FYGSENLEMEEHGHSKRRKERYDEGTTDRWNGGSNDELGVPSKKSKTLVDSKSKRRDESV 120
Query: 121 GFQGDGEEHKKSSGKGEGRHRESSRKEGRSGGGEREREREREREKDRKGREGRSDRGVAS 180
GFQGDGEEHKKSSGKGEGRHRESSRKEGRSGGGEREREREREREKDRKGREGRSDRGVAS
Sbjct: 121 GFQGDGEEHKKSSGKGEGRHRESSRKEGRSGGGEREREREREREKDRKGREGRSDRGVAS 180
Query: 181 EDLRVEKQVEKNSENVLHSPGLENHLEIRVRKRTGSFDGDKHKDDIGDVDNRQLSSKNDT 240
EDLRVEKQVEKNSENVLHSPGLENHLEIRVRKRTGSFDGDKHKDDIGDVDNRQLSSKNDT
Sbjct: 181 EDLRVEKQVEKNSENVLHSPGLENHLEIRVRKRTGSFDGDKHKDDIGDVDNRQLSSKNDT 240
Query: 241 VKDGRRKSEKYKDERNREKYREDVDRDGKERNELVKDHISRSNDRDLRDEKDAVDMHHKR 300
VKDGRRKSEKYKDERNREKYREDVDRDGKERNELVKDHISRSNDRDLRDEKDAVDMHHKR
Sbjct: 241 VKDGRRKSEKYKDERNREKYREDVDRDGKERNELVKDHISRSNDRDLRDEKDAVDMHHKR 300
Query: 301 NKPQDSDPDREVTKAKREGDIDAMRDQDHDRHHAYERDHEQESRRRRDRDRGRDRDRDRD 360
NKPQDSDPDREVTKAKREGDIDAMRDQDHDRHHAYERDHEQESRRRRDRDRGRDRDRDRD
Sbjct: 301 NKPQDSDPDREVTKAKREGDIDAMRDQDHDRHHAYERDHEQESRRRRDRDRGRDRDRDRD 360
Query: 361 HDRDSRRHRSRSRARDRYSDYECDVDRDGSHFDDQYTKYVDSRGRKRSPNDHDDSVDARS 420
HDRDSRRHRSRSRARDRYSDYECDVDRDGSHFDDQYTKYVDSRGRKRSPNDHDDSVDARS
Sbjct: 361 HDRDSRRHRSRSRARDRYSDYECDVDRDGSHFDDQYTKYVDSRGRKRSPNDHDDSVDARS 420
Query: 421 KSLKNSHHANDEKKSLSNDKVDSDAERGRSQSRSRHGDVSLSSHRRKSSPSSHSRVVTDE 480
KSLKNSHHANDEKKSLSNDKVDSDAERGRSQSRSRHGDVSLSSHRRKSSPSSHSRVVTDE
Sbjct: 421 KSLKNSHHANDEKKSLSNDKVDSDAERGRSQSRSRHGDVSLSSHRRKSSPSSHSRVVTDE 480
Query: 481 YRHQDQEDLRDRYPKKEERSKSISTRDKGVLSVVQEKGSKYTYSEKPSEIEGGNATEMLR 540
YRHQDQEDLRDRYPKKEERSKSISTRDKGVLSVVQEKGSKYTYSEKPSEIEGGNATEMLR
Sbjct: 481 YRHQDQEDLRDRYPKKEERSKSISTRDKGVLSVVQEKGSKYTYSEKPSEIEGGNATEMLR 540
Query: 541 DRTLNSKNVDIEESGRRHNNSIDAKDLSSNKDRHSWDIQGEKPVMDDSSQVESYYSKGSQ 600
DRTLNSKNVDIEESGRRHNNSIDAKDLSSNKDRHSWDIQGEKPVMDDSSQVESYYSKGSQ
Sbjct: 541 DRTLNSKNVDIEESGRRHNNSIDAKDLSSNKDRHSWDIQGEKPVMDDSSQVESYYSKGSQ 600
Query: 601 SNPSPFHPRPAFRGGVDIPFDGSLDDDGRLNSNSRFRRGNDPNMGRVHGNTWRGVPNWTA 660
SNPSPFHPRPAFRGGVDIPFDGSLDDDGRLNSNSRFRRGNDPNMGRVHGNTWRGVPNWTA
Sbjct: 601 SNPSPFHPRPAFRGGVDIPFDGSLDDDGRLNSNSRFRRGNDPNMGRVHGNTWRGVPNWTA 660
Query: 661 PLPNGFIPFQHGPPPHGSFQSLMPQFPAPPMFGIRPPLDINHSGIHYRMPDADRFSSHMH 720
PLPNGFIPFQHGPPPHGSFQSLMPQFPAPPMFGIRPPLDINHSGIHYRMPDADRFSSHMH
Sbjct: 661 PLPNGFIPFQHGPPPHGSFQSLMPQFPAPPMFGIRPPLDINHSGIHYRMPDADRFSSHMH 720
Query: 721 PLGWQNMLDGSSPSHLHGWDANNGIFRDESHIYNGAEWDENRQMVNGRGWDSKAEMWKRQ 780
PLGWQNMLDGSSPSHLHGWDANNGIFRDESHIYNGAEWDENRQMVNGRGWDSKAEMWKRQ
Sbjct: 721 PLGWQNMLDGSSPSHLHGWDANNGIFRDESHIYNGAEWDENRQMVNGRGWDSKAEMWKRQ 780
Query: 781 SGSLKREIPSQFQKDERSVQDPVDDVSSKEICDENADTVLTKTAEIRPNIPSAKESPNTP 840
SGSLKREIPSQFQKDERSVQDPVDDVSSKEICDENADTVLTKTAEIRPNIPSAKESPNTP
Sbjct: 781 SGSLKREIPSQFQKDERSVQDPVDDVSSKEICDENADTVLTKTAEIRPNIPSAKESPNTP 840
Query: 841 ELLSETPGPLSRSMDDNSKLSCSYLSKLKISTELALPDLYQQCQRLMDIEHCATADEETA 900
ELLSETPGPLSRSMDDNSKLSCSYLSKLKISTELALPDLYQQCQRLMDIEHCATADEETA
Sbjct: 841 ELLSETPGPLSRSMDDNSKLSCSYLSKLKISTELALPDLYQQCQRLMDIEHCATADEETA 900
Query: 901 AYIVLEGGMRAVSISSNSAQISLFRPNKNSVFQHAMDLYKKQRTEMKEMQASSREMPSSE 960
AYIVLEGGMRAVSISSNSAQISLFRPNKNSVFQHAMDLYKKQRTEMKEMQASSREMPSSE
Sbjct: 901 AYIVLEGGMRAVSISSNSAQISLFRPNKNSVFQHAMDLYKKQRTEMKEMQASSREMPSSE 960
Query: 961 RMLEEEQQGMQVVSGGMAFSERKHEEKGFTFNNEDVKAPVSTVDAEMTQAPIKTTGVDKA 1020
RMLEEEQQGMQVVSGGMAFSERKHEEKGFTFNNEDVKAPVSTVDAEMTQAPIKTTGVDKA
Sbjct: 961 RMLEEEQQGMQVVSGGMAFSERKHEEKGFTFNNEDVKAPVSTVDAEMTQAPIKTTGVDKA 1020
Query: 1021 IEADAALGKLEDLAVEADAALGELEDLASPATREVKCLENSEESVPITNSTEVDMMDSEQ 1080
IEADAALGKLEDLAVEADAALGELEDLASPATREVKCLENSEESVPITNSTEVDMMDSEQ
Sbjct: 1021 IEADAALGKLEDLAVEADAALGELEDLASPATREVKCLENSEESVPITNSTEVDMMDSEQ 1080
Query: 1081 QANLDAEKDTIVIANDNTPVNNINESSNDDMKGIVNGKDSPRCDEFSNNNDIKGIVNGKE 1140
QANLDAEKDTIVIANDNTPVNNINESSNDDMKGIVNGKDSPRCDEFSNNNDIKGIVNGKE
Sbjct: 1081 QANLDAEKDTIVIANDNTPVNNINESSNDDMKGIVNGKDSPRCDEFSNNNDIKGIVNGKE 1140
Query: 1141 SPGCDELSKNENGKESPGCGVGNSCFDKAVSGPLSLAGGDEIGGESCEEVGLMGGGGGGG 1200
SPGCDELSKNENGKESPGCGVGNSCFDKAVSGPLSLAGGDEIGGESCEEVGLMGGGGGGG
Sbjct: 1141 SPGCDELSKNENGKESPGCGVGNSCFDKAVSGPLSLAGGDEIGGESCEEVGLMGGGGGGG 1200
Query: 1201 GVPIGSESLILSQQIHHSPESTH 1223
GVPIGSESLILSQQIHHSPESTH
Sbjct: 1201 GVPIGSESLILSQQIHHSPESTH 1223
BLAST of Cp4.1LG13g03060 vs. NCBI nr
Match:
XP_023551224.1 (uncharacterized protein LOC111809105 isoform X2 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 2255 bits (5843), Expect = 0.0
Identity = 1198/1223 (97.96%), Postives = 1198/1223 (97.96%), Query Frame = 0
Query: 1 MPRGSRHKSSRHGLKDAKESSDSENDSTLRDRKGKESGSRVTKDSASSEKRRFESKDSKE 60
MPRGSRHKSSRHGLKDAKESSDSENDSTLRDRKGKESGSRVTKDSASSEKRRFESKDSKE
Sbjct: 1 MPRGSRHKSSRHGLKDAKESSDSENDSTLRDRKGKESGSRVTKDSASSEKRRFESKDSKE 60
Query: 61 FYGSENLEMEEHGHSKRRKERYDEGTTDRWNGGSNDELGVPSKKSKTLVDSKSKRRDESV 120
FYGSENLEMEEHGHSKRRKERYDEGTTDRWNGGSNDELGVPSKKSKTLVDSKSKRRDESV
Sbjct: 61 FYGSENLEMEEHGHSKRRKERYDEGTTDRWNGGSNDELGVPSKKSKTLVDSKSKRRDESV 120
Query: 121 GFQGDGEEHKKSSGKGEGRHRESSRKEGRSGGGEREREREREREKDRKGREGRSDRGVAS 180
GFQGDGEEHKKSSGKGEGRHRESSRKEGRSGGGEREREREREREKDRKGREGRSDRGVAS
Sbjct: 121 GFQGDGEEHKKSSGKGEGRHRESSRKEGRSGGGEREREREREREKDRKGREGRSDRGVAS 180
Query: 181 EDLRVEKQVEKNSENVLHSPGLENHLEIRVRKRTGSFDGDKHKDDIGDVDNRQLSSKNDT 240
EDLRVEKQVEKNSENVLHSPGLENHLEIRVRKRTGSFDGDKHKDDIGDVDNRQLSSKNDT
Sbjct: 181 EDLRVEKQVEKNSENVLHSPGLENHLEIRVRKRTGSFDGDKHKDDIGDVDNRQLSSKNDT 240
Query: 241 VKDGRRKSEKYKDERNREKYREDVDRDGKERNELVKDHISRSNDRDLRDEKDAVDMHHKR 300
VKDGRRKSEKYKDERNREKYREDVDRDGKERNELVKDHISRSNDRDLRDEKDAVDMHHKR
Sbjct: 241 VKDGRRKSEKYKDERNREKYREDVDRDGKERNELVKDHISRSNDRDLRDEKDAVDMHHKR 300
Query: 301 NKPQDSDPDREVTKAKREGDIDAMRDQDHDRHHAYERDHEQESRRRRDRDRGRDRDRDRD 360
NKPQDSDPDREVTKAKREGDIDAMRDQDHDRHHAYERDHEQESRRRRDRDRGRDRDRDRD
Sbjct: 301 NKPQDSDPDREVTKAKREGDIDAMRDQDHDRHHAYERDHEQESRRRRDRDRGRDRDRDRD 360
Query: 361 HDRDSRRHRSRSRARDRYSDYECDVDRDGSHFDDQYTKYVDSRGRKRSPNDHDDSVDARS 420
HDRDSRRHRSRSRARDRYSDYECDVDRDGSHFDDQYTKYVDSRGRKRSPNDHDDSVDARS
Sbjct: 361 HDRDSRRHRSRSRARDRYSDYECDVDRDGSHFDDQYTKYVDSRGRKRSPNDHDDSVDARS 420
Query: 421 KSLKNSHHANDEKKSLSNDKVDSDAERGRSQSRSRHGDVSLSSHRRKSSPSSHSRVVTDE 480
KSLKNSHHANDEKKSLSNDKVDSDAERGRSQSRSRHGDVSLSSHRRKSSPSSHSRVVTDE
Sbjct: 421 KSLKNSHHANDEKKSLSNDKVDSDAERGRSQSRSRHGDVSLSSHRRKSSPSSHSRVVTDE 480
Query: 481 YRHQDQEDLRDRYPKKEERSKSISTRDKGVLSVVQEKGSKYTYSEKPSEIEGGNATEMLR 540
YRHQDQEDLRDRYPKKEERSKSISTRDKGVLSVVQEKGSKYTYSEKPSEIEGGNATEMLR
Sbjct: 481 YRHQDQEDLRDRYPKKEERSKSISTRDKGVLSVVQEKGSKYTYSEKPSEIEGGNATEMLR 540
Query: 541 DRTLNSKNVDIEESGRRHNNSIDAKDLSSNKDRHSWDIQGEKPVMDDSSQVESYYSKGSQ 600
DRTLNSKNVDIEESGRRHNNSIDAKDLSSNKDRHSWDIQGEKPVMDDSSQVESYYSKGSQ
Sbjct: 541 DRTLNSKNVDIEESGRRHNNSIDAKDLSSNKDRHSWDIQGEKPVMDDSSQVESYYSKGSQ 600
Query: 601 SNPSPFHPRPAFRGGVDIPFDGSLDDDGRLNSNSRFRRGNDPNMGRVHGNTWRGVPNWTA 660
SNPSPFHPRPAFRGGVDIPFDGSLDDDGRLNSNSRFRRGNDPNMGRVHGNTWRGVPNWTA
Sbjct: 601 SNPSPFHPRPAFRGGVDIPFDGSLDDDGRLNSNSRFRRGNDPNMGRVHGNTWRGVPNWTA 660
Query: 661 PLPNGFIPFQHGPPPHGSFQSLMPQFPAPPMFGIRPPLDINHSGIHYRMPDADRFSSHMH 720
PLPNGFIPFQHGPPPHGSFQSLMPQFPAPPMFGIRPPLDINHSGIHYRMPDADRFSSHMH
Sbjct: 661 PLPNGFIPFQHGPPPHGSFQSLMPQFPAPPMFGIRPPLDINHSGIHYRMPDADRFSSHMH 720
Query: 721 PLGWQNMLDGSSPSHLHGWDANNGIFRDESHIYNGAEWDENRQMVNGRGWDSKAEMWKRQ 780
PLGWQNMLDGSSPSHLHGWDANNGIFRDESHIYNGAEWDENRQMVNGRGWDSKAEMWKRQ
Sbjct: 721 PLGWQNMLDGSSPSHLHGWDANNGIFRDESHIYNGAEWDENRQMVNGRGWDSKAEMWKRQ 780
Query: 781 SGSLKREIPSQFQKDERSVQDPVDDVSSKEICDENADTVLTKTAEIRPNIPSAKESPNTP 840
SGSLKREIPSQFQKDERSVQDPVDDVSSKEICDENADTVLTKTAEIRPNIPSAKESPNTP
Sbjct: 781 SGSLKREIPSQFQKDERSVQDPVDDVSSKEICDENADTVLTKTAEIRPNIPSAKESPNTP 840
Query: 841 ELLSETPGPLSRSMDDNSKLSCSYLSKLKISTELALPDLYQQCQRLMDIEHCATADEETA 900
ELLSETP ELALPDLYQQCQRLMDIEHCATADEETA
Sbjct: 841 ELLSETP-------------------------ELALPDLYQQCQRLMDIEHCATADEETA 900
Query: 901 AYIVLEGGMRAVSISSNSAQISLFRPNKNSVFQHAMDLYKKQRTEMKEMQASSREMPSSE 960
AYIVLEGGMRAVSISSNSAQISLFRPNKNSVFQHAMDLYKKQRTEMKEMQASSREMPSSE
Sbjct: 901 AYIVLEGGMRAVSISSNSAQISLFRPNKNSVFQHAMDLYKKQRTEMKEMQASSREMPSSE 960
Query: 961 RMLEEEQQGMQVVSGGMAFSERKHEEKGFTFNNEDVKAPVSTVDAEMTQAPIKTTGVDKA 1020
RMLEEEQQGMQVVSGGMAFSERKHEEKGFTFNNEDVKAPVSTVDAEMTQAPIKTTGVDKA
Sbjct: 961 RMLEEEQQGMQVVSGGMAFSERKHEEKGFTFNNEDVKAPVSTVDAEMTQAPIKTTGVDKA 1020
Query: 1021 IEADAALGKLEDLAVEADAALGELEDLASPATREVKCLENSEESVPITNSTEVDMMDSEQ 1080
IEADAALGKLEDLAVEADAALGELEDLASPATREVKCLENSEESVPITNSTEVDMMDSEQ
Sbjct: 1021 IEADAALGKLEDLAVEADAALGELEDLASPATREVKCLENSEESVPITNSTEVDMMDSEQ 1080
Query: 1081 QANLDAEKDTIVIANDNTPVNNINESSNDDMKGIVNGKDSPRCDEFSNNNDIKGIVNGKE 1140
QANLDAEKDTIVIANDNTPVNNINESSNDDMKGIVNGKDSPRCDEFSNNNDIKGIVNGKE
Sbjct: 1081 QANLDAEKDTIVIANDNTPVNNINESSNDDMKGIVNGKDSPRCDEFSNNNDIKGIVNGKE 1140
Query: 1141 SPGCDELSKNENGKESPGCGVGNSCFDKAVSGPLSLAGGDEIGGESCEEVGLMGGGGGGG 1200
SPGCDELSKNENGKESPGCGVGNSCFDKAVSGPLSLAGGDEIGGESCEEVGLMGGGGGGG
Sbjct: 1141 SPGCDELSKNENGKESPGCGVGNSCFDKAVSGPLSLAGGDEIGGESCEEVGLMGGGGGGG 1198
Query: 1201 GVPIGSESLILSQQIHHSPESTH 1223
GVPIGSESLILSQQIHHSPESTH
Sbjct: 1201 GVPIGSESLILSQQIHHSPESTH 1198
BLAST of Cp4.1LG13g03060 vs. NCBI nr
Match:
XP_022922430.1 (uncharacterized protein LOC111430427 isoform X1 [Cucurbita moschata])
HSP 1 Score: 2253 bits (5839), Expect = 0.0
Identity = 1197/1225 (97.71%), Postives = 1206/1225 (98.45%), Query Frame = 0
Query: 1 MPRGSRHKSSRHGLKDAKESSDSENDSTLRDRKGKESGSRVTKDSASSEKRRFESKDSKE 60
MPRGSRHKSSRHGLKDAKESSDSENDSTLRDRKGKESGSRV KDSASSEKRRFESKDSKE
Sbjct: 1 MPRGSRHKSSRHGLKDAKESSDSENDSTLRDRKGKESGSRVMKDSASSEKRRFESKDSKE 60
Query: 61 FYGSENLEMEEHGHSKRRKERYDEGTTDRWNGGSNDELGVPSKKSKTLVDSKSKRRDESV 120
FYGSENLEMEEHGHSKRRKERYDEGTTDRWNGGS+DELGVPSKKSKTLVDSKSKRRDESV
Sbjct: 61 FYGSENLEMEEHGHSKRRKERYDEGTTDRWNGGSDDELGVPSKKSKTLVDSKSKRRDESV 120
Query: 121 GFQGDGEEHKKSSGKGEGRHRESSRKEGRSGGGERERERERERE--KDRKGREGRSDRGV 180
GFQGDGEEHKKSSGKGEGRHRESSRKEGR+GGGERERERERERE KDRKGREGRSDRGV
Sbjct: 121 GFQGDGEEHKKSSGKGEGRHRESSRKEGRNGGGEREREREREREREKDRKGREGRSDRGV 180
Query: 181 ASEDLRVEKQVEKNSENVLHSPGLENHLEIRVRKRTGSFDGDKHKDDIGDVDNRQLSSKN 240
ASEDLRVEKQVEKNSENVLHSPGLENHLEIRVRKRTGSFDGDKHKDDIGDVDNRQLSSKN
Sbjct: 181 ASEDLRVEKQVEKNSENVLHSPGLENHLEIRVRKRTGSFDGDKHKDDIGDVDNRQLSSKN 240
Query: 241 DTVKDGRRKSEKYKDERNREKYREDVDRDGKERNELVKDHISRSNDRDLRDEKDAVDMHH 300
DTVKDGRRKSEKYKDERNREKYREDVDRDGKERNELVKDHISRSNDRDLRDEKDA+DMHH
Sbjct: 241 DTVKDGRRKSEKYKDERNREKYREDVDRDGKERNELVKDHISRSNDRDLRDEKDAMDMHH 300
Query: 301 KRNKPQDSDPDREVTKAKREGDIDAMRDQDHDRHHAYERDHEQESRRRRDRDRGRDRDRD 360
KRNKPQDSDPDREVTKAKREGDIDAMRDQDHDRHHAYERDHEQESRRRRDRDR R RDRD
Sbjct: 301 KRNKPQDSDPDREVTKAKREGDIDAMRDQDHDRHHAYERDHEQESRRRRDRDRDRGRDRD 360
Query: 361 RDHDRDSRRHRSRSRARDRYSDYECDVDRDGSHFDDQYTKYVDSRGRKRSPNDHDDSVDA 420
RDHDRDSRRHRSRSRARDRYSDYECDVDRDGSHFDDQYTKYVDSRGRKRSPNDHDDSVDA
Sbjct: 361 RDHDRDSRRHRSRSRARDRYSDYECDVDRDGSHFDDQYTKYVDSRGRKRSPNDHDDSVDA 420
Query: 421 RSKSLKNSHHANDEKKSLSNDKVDSDAERGRSQSRSRHGDVSLSSHRRKSSPSSHSRVVT 480
RSKSLKNSHHANDEKKSLSNDKVDSDAERGRSQSRSRHGDVSLSSHRRKSSPSSHSRVVT
Sbjct: 421 RSKSLKNSHHANDEKKSLSNDKVDSDAERGRSQSRSRHGDVSLSSHRRKSSPSSHSRVVT 480
Query: 481 DEYRHQDQEDLRDRYPKKEERSKSISTRDKGVLSVVQEKGSKYTYSEKPSEIEGGNATEM 540
DEYRHQDQEDLRDRYPKKEERSKSISTRDKGVLSVVQEKGSKYTYSEKPSEIEGGNATE+
Sbjct: 481 DEYRHQDQEDLRDRYPKKEERSKSISTRDKGVLSVVQEKGSKYTYSEKPSEIEGGNATEL 540
Query: 541 LRDRTLNSKNVDIEESGRRHNNSIDAKDLSSNKDRHSWDIQGEKPVMDDSSQVESYYSKG 600
LRDRTLNSKNVDIEESGRRHNNSIDAKDLSSNKDRHSWDIQGEKPVMDDSSQVESYYSKG
Sbjct: 541 LRDRTLNSKNVDIEESGRRHNNSIDAKDLSSNKDRHSWDIQGEKPVMDDSSQVESYYSKG 600
Query: 601 SQSNPSPFHPRPAFRGGVDIPFDGSLDDDGRLNSNSRFRRGNDPNMGRVHGNTWRGVPNW 660
SQSNPSPFHPRPAFRGGVDIPFDGSLDDDGRLNSNSRFRRGNDPNMGRVHGNTWRGVPNW
Sbjct: 601 SQSNPSPFHPRPAFRGGVDIPFDGSLDDDGRLNSNSRFRRGNDPNMGRVHGNTWRGVPNW 660
Query: 661 TAPLPNGFIPFQHGPPPHGSFQSLMPQFPAPPMFGIRPPLDINHSGIHYRMPDADRFSSH 720
TAPLPNGFIPFQHGPPPHGSFQSLMPQFPAPPMFGIRPPLDINHSGIHYRMPDADRFSSH
Sbjct: 661 TAPLPNGFIPFQHGPPPHGSFQSLMPQFPAPPMFGIRPPLDINHSGIHYRMPDADRFSSH 720
Query: 721 MHPLGWQNMLDGSSPSHLHGWDANNGIFRDESHIYNGAEWDENRQMVNGRGWDSKAEMWK 780
MHPLGWQNMLDGSSPSHLHGWDANNGIFRDESHIYNGAEWDENRQMVNGRGWDSKAEMWK
Sbjct: 721 MHPLGWQNMLDGSSPSHLHGWDANNGIFRDESHIYNGAEWDENRQMVNGRGWDSKAEMWK 780
Query: 781 RQSGSLKREIPSQFQKDERSVQDPVDDVSSKEICDENADTVLTKTAEIRPNIPSAKESPN 840
RQSGSLKREIPSQFQKDERSVQDPVDDVSSKEI DENADTVLTKT+EIRPNIPSAKESPN
Sbjct: 781 RQSGSLKREIPSQFQKDERSVQDPVDDVSSKEIFDENADTVLTKTSEIRPNIPSAKESPN 840
Query: 841 TPELLSETPGPLSRSMDDNSKLSCSYLSKLKISTELALPDLYQQCQRLMDIEHCATADEE 900
TPELLSETP PLSRSMDDNSKLSCSYLSKL ISTELALPDLYQQCQRLMDIEHCATADEE
Sbjct: 841 TPELLSETPAPLSRSMDDNSKLSCSYLSKLNISTELALPDLYQQCQRLMDIEHCATADEE 900
Query: 901 TAAYIVLEGGMRAVSISSNSAQISLFRPNKNSVFQHAMDLYKKQRTEMKEMQASSREMPS 960
TAAYIVLEGGMRAVS+SSNSAQISLFRPNKNSVFQHAMDLYKKQRTEMKEMQA SREMPS
Sbjct: 901 TAAYIVLEGGMRAVSVSSNSAQISLFRPNKNSVFQHAMDLYKKQRTEMKEMQAISREMPS 960
Query: 961 SERMLEEEQQGMQVVSGGMAFSERKHEEKGFTFNNEDVKAPVSTVDAEMTQAPIKTTGVD 1020
SERMLEEEQQGMQVVS GMAFSERKHEE G F NE+VKAPVSTVDAEMTQAPIKTTGVD
Sbjct: 961 SERMLEEEQQGMQVVSRGMAFSERKHEEMGLNFKNEEVKAPVSTVDAEMTQAPIKTTGVD 1020
Query: 1021 KAIEADAALGKLEDLAVEADAALGELEDLASPATREVKCLENSEESVPITNSTEVDMMDS 1080
AIEADAALGKLEDLAVEADAALGELEDLASPATREVKCLENSEESVPITNSTEVDMMDS
Sbjct: 1021 NAIEADAALGKLEDLAVEADAALGELEDLASPATREVKCLENSEESVPITNSTEVDMMDS 1080
Query: 1081 EQQANLDAEKDTIVIANDNTPVNNINESSNDDMKGIVNGKDSPRCDEFSNNNDIKGIVNG 1140
EQ ANLDAEKDTIVIA+DNTPVNNINESSNDDMKGIVNGKDSPRCDEFSNNND+KGI NG
Sbjct: 1081 EQPANLDAEKDTIVIASDNTPVNNINESSNDDMKGIVNGKDSPRCDEFSNNNDMKGIENG 1140
Query: 1141 KESPGCDELSKNENGKESPGCGVGNSCFDKAVSGPLSLAGGDEIGGESCEEVGLMGGGGG 1200
KESPGCDELSKNENGKESPGCGVGNSCFDKAVSGPLSLAGGDEIGGESCEE GLMGGGGG
Sbjct: 1141 KESPGCDELSKNENGKESPGCGVGNSCFDKAVSGPLSLAGGDEIGGESCEEGGLMGGGGG 1200
Query: 1201 GGGVPIGSESLILSQQIHHSPESTH 1223
GG VPIGSESLILSQQIHHSPESTH
Sbjct: 1201 GG-VPIGSESLILSQQIHHSPESTH 1224
BLAST of Cp4.1LG13g03060 vs. NCBI nr
Match:
KAG6579324.1 (hypothetical protein SDJN03_23772, partial [Cucurbita argyrosperma subsp. sororia])
HSP 1 Score: 2235 bits (5791), Expect = 0.0
Identity = 1186/1223 (96.97%), Postives = 1199/1223 (98.04%), Query Frame = 0
Query: 1 MPRGSRHKSSRHGLKDAKESSDSENDSTLRDRKGKESGSRVTKDSASSEKRRFESKDSKE 60
MPRGSRHKSSRHGLKDAKESSDSENDSTLRDRKGKESGSRV KDSASSEKRRFESKDSKE
Sbjct: 1 MPRGSRHKSSRHGLKDAKESSDSENDSTLRDRKGKESGSRVMKDSASSEKRRFESKDSKE 60
Query: 61 FYGSENLEMEEHGHSKRRKERYDEGTTDRWNGGSNDELGVPSKKSKTLVDSKSKRRDESV 120
FYGSENLEMEEHGHSKRRKERYDEGTTDRWNGGS+DELGVPSKKSKTLVDSKSKRRDESV
Sbjct: 61 FYGSENLEMEEHGHSKRRKERYDEGTTDRWNGGSDDELGVPSKKSKTLVDSKSKRRDESV 120
Query: 121 GFQGDGEEHKKSSGKGEGRHRESSRKEGRSGGGEREREREREREKDRKGREGRSDRGVAS 180
GFQGDGEEHKKSSGKGEGRHRESSRKEGR+GGGEREREREREREKDRKGREGRSDRGVAS
Sbjct: 121 GFQGDGEEHKKSSGKGEGRHRESSRKEGRNGGGEREREREREREKDRKGREGRSDRGVAS 180
Query: 181 EDLRVEKQVEKNSENVLHSPGLENHLEIRVRKRTGSFDGDKHKDDIGDVDNRQLSSKNDT 240
EDLRVEKQVEKNSENVLHSPGLENHLEIRVRKRTGSFDGDKHKDDIGDVDNRQLSSKNDT
Sbjct: 181 EDLRVEKQVEKNSENVLHSPGLENHLEIRVRKRTGSFDGDKHKDDIGDVDNRQLSSKNDT 240
Query: 241 VKDGRRKSEKYKDERNREKYREDVDRDGKERNELVKDHISRSNDRDLRDEKDAVDMHHKR 300
VKDGRRKSEKYK+ERNREKYREDVDRDGKERNELVKDHISRSNDRDLRDEKDA+DMHHKR
Sbjct: 241 VKDGRRKSEKYKEERNREKYREDVDRDGKERNELVKDHISRSNDRDLRDEKDAMDMHHKR 300
Query: 301 NKPQDSDPDREVTKAKREGDIDAMRDQDHDRHHAYERDHEQESRRRRDRDRGRDRDRDRD 360
NK QDSDPDREVTKAKREGDIDAMRDQDHDRHHAYERDHEQESRRRRDR GRDRDRDRD
Sbjct: 301 NKLQDSDPDREVTKAKREGDIDAMRDQDHDRHHAYERDHEQESRRRRDR--GRDRDRDRD 360
Query: 361 HDRDSRRHRSRSRARDRYSDYECDVDRDGSHFDDQYTKYVDSRGRKRSPNDHDDSVDARS 420
HDRDSRRHRSRSRARDRYSDYECDVDRDGSHFDDQYTKYVDSRGRKRSPNDHDDSVDARS
Sbjct: 361 HDRDSRRHRSRSRARDRYSDYECDVDRDGSHFDDQYTKYVDSRGRKRSPNDHDDSVDARS 420
Query: 421 KSLKNSHHANDEKKSLSNDKVDSDAERGRSQSRSRHGDVSLSSHRRKSSPSSHSRVVTDE 480
KSLKNSHHANDEKKSLSNDKVDSDAERGRSQSRSRHGD+SLSSHRRKSSPSSHSRVVTDE
Sbjct: 421 KSLKNSHHANDEKKSLSNDKVDSDAERGRSQSRSRHGDISLSSHRRKSSPSSHSRVVTDE 480
Query: 481 YRHQDQEDLRDRYPKKEERSKSISTRDKGVLSVVQEKGSKYTYSEKPSEIEGGNATEMLR 540
YRHQDQEDLRDRYPKKEERSKSISTRDKGVLSVVQEKGSKYTYSEKPSEIEGGNATE+LR
Sbjct: 481 YRHQDQEDLRDRYPKKEERSKSISTRDKGVLSVVQEKGSKYTYSEKPSEIEGGNATELLR 540
Query: 541 DRTLNSKNVDIEESGRRHNNSIDAKDLSSNKDRHSWDIQGEKPVMDDSSQVESYYSKGSQ 600
DRTLNSKNVDIEESGRRHNNSIDAKDLSSNKDRHSWDIQGEKPVMDDSSQVESYYSKGSQ
Sbjct: 541 DRTLNSKNVDIEESGRRHNNSIDAKDLSSNKDRHSWDIQGEKPVMDDSSQVESYYSKGSQ 600
Query: 601 SNPSPFHPRPAFRGGVDIPFDGSLDDDGRLNSNSRFRRGNDPNMGRVHGNTWRGVPNWTA 660
SNPSPFHPRPAFRGGVDIPFDGSLDDDGRLNSNSRFRRGNDPN+GRVHGNTWRGVPNWTA
Sbjct: 601 SNPSPFHPRPAFRGGVDIPFDGSLDDDGRLNSNSRFRRGNDPNIGRVHGNTWRGVPNWTA 660
Query: 661 PLPNGFIPFQHGPPPHGSFQSLMPQFPAPPMFGIRPPLDINHSGIHYRMPDADRFSSHMH 720
PLPNGFIPFQHGPPPHGSFQSLMPQFPAPPMFGIRPPLDINHSGIHYRMPDADRFSSH+H
Sbjct: 661 PLPNGFIPFQHGPPPHGSFQSLMPQFPAPPMFGIRPPLDINHSGIHYRMPDADRFSSHIH 720
Query: 721 PLGWQNMLDGSSPSHLHGWDANNGIFRDESHIYNGAEWDENRQMVNGRGWDSKAEMWKRQ 780
PLGWQNMLDGSSPSHLHGWDANNGIFRDESHIYNGAEWDENRQMVNGRGWDSKAEMWKRQ
Sbjct: 721 PLGWQNMLDGSSPSHLHGWDANNGIFRDESHIYNGAEWDENRQMVNGRGWDSKAEMWKRQ 780
Query: 781 SGSLKREIPSQFQKDERSVQDPVDDVSSKEICDENADTVLTKTAEIRPNIPSAKESPNTP 840
SGSLKREIPSQFQKDERSVQDPVDDVSSKEI DEN DTVLTKT+EIRPNIP AKESPNTP
Sbjct: 781 SGSLKREIPSQFQKDERSVQDPVDDVSSKEIFDENGDTVLTKTSEIRPNIPPAKESPNTP 840
Query: 841 ELLSETPGPLSRSMDDNSKLSCSYLSKLKISTELALPDLYQQCQRLMDIEHCATADEETA 900
ELLSETP PLSRSMDDNSKLSCSYLSKL ISTELALPDLYQQCQRLMDIEHCATADEETA
Sbjct: 841 ELLSETPAPLSRSMDDNSKLSCSYLSKLNISTELALPDLYQQCQRLMDIEHCATADEETA 900
Query: 901 AYIVLEGGMRAVSISSNSAQISLFRPNKNSVFQHAMDLYKKQRTEMKEMQASSREMPSSE 960
AYIVLEGGMRAVS+SSNSAQISLFRPNKNSVFQHAMDLYKKQRTEMKEMQA SREMPSSE
Sbjct: 901 AYIVLEGGMRAVSVSSNSAQISLFRPNKNSVFQHAMDLYKKQRTEMKEMQAISREMPSSE 960
Query: 961 RMLEEEQQGMQVVSGGMAFSERKHEEKGFTFNNEDVKAPVSTVDAEMTQAPIKTTGVDKA 1020
RMLEEEQQGMQVVSGGMAFSERKHEE GF FNNE+VKAPVSTVDAEMTQAPIKTTGVD A
Sbjct: 961 RMLEEEQQGMQVVSGGMAFSERKHEEMGFNFNNEEVKAPVSTVDAEMTQAPIKTTGVDNA 1020
Query: 1021 IEADAALGKLEDLAVEADAALGELEDLASPATREVKCLENSEESVPITNSTEVDMMDSEQ 1080
EADAALGKLEDLAVEADAALGE EDLASPATREVK LENSEESVPITNSTEVDMMDSEQ
Sbjct: 1021 TEADAALGKLEDLAVEADAALGEPEDLASPATREVKSLENSEESVPITNSTEVDMMDSEQ 1080
Query: 1081 QANLDAEKDTIVIANDNTPVNNINESSNDDMKGIVNGKDSPRCDEFSNNNDIKGIVNGKE 1140
ANLDAEKDTIVIANDNTPVNNINESSNDDMKGIVNGKDSPRCDEFSNNND+KGI NGKE
Sbjct: 1081 PANLDAEKDTIVIANDNTPVNNINESSNDDMKGIVNGKDSPRCDEFSNNNDMKGIENGKE 1140
Query: 1141 SPGCDELSKNENGKESPGCGVGNSCFDKAVSGPLSLAGGDEIGGESCEEVGLMGGGGGGG 1200
SPGCDEL+KNENGKESPGCGVGNSCFDKAVSGPLSLAGGDEIGGESCEE GLMGG G
Sbjct: 1141 SPGCDELNKNENGKESPGCGVGNSCFDKAVSGPLSLAGGDEIGGESCEEGGLMGG----G 1200
Query: 1201 GVPIGSESLILSQQIHHSPESTH 1223
GV IGSESLILSQQIHHSPESTH
Sbjct: 1201 GVTIGSESLILSQQIHHSPESTH 1217
BLAST of Cp4.1LG13g03060 vs. NCBI nr
Match:
KAG7016825.1 (hypothetical protein SDJN02_21936, partial [Cucurbita argyrosperma subsp. argyrosperma])
HSP 1 Score: 2231 bits (5780), Expect = 0.0
Identity = 1184/1223 (96.81%), Postives = 1198/1223 (97.96%), Query Frame = 0
Query: 1 MPRGSRHKSSRHGLKDAKESSDSENDSTLRDRKGKESGSRVTKDSASSEKRRFESKDSKE 60
MPRGSRHKSSRHGLKDAKESSDSENDSTLRDRKGKESGSRV KDSASSEKRRFESKDSKE
Sbjct: 1 MPRGSRHKSSRHGLKDAKESSDSENDSTLRDRKGKESGSRVMKDSASSEKRRFESKDSKE 60
Query: 61 FYGSENLEMEEHGHSKRRKERYDEGTTDRWNGGSNDELGVPSKKSKTLVDSKSKRRDESV 120
FYGSENLEMEEHGHSKRRKERYDEGTTDRWNGGS+DELGVPSKKSKTLVDSKSKRRDESV
Sbjct: 61 FYGSENLEMEEHGHSKRRKERYDEGTTDRWNGGSDDELGVPSKKSKTLVDSKSKRRDESV 120
Query: 121 GFQGDGEEHKKSSGKGEGRHRESSRKEGRSGGGEREREREREREKDRKGREGRSDRGVAS 180
GFQGDGEEHKKSSGKGEGRHRESSRKEGR+GGGEREREREREREKDRKGREGRSDRGVAS
Sbjct: 121 GFQGDGEEHKKSSGKGEGRHRESSRKEGRNGGGEREREREREREKDRKGREGRSDRGVAS 180
Query: 181 EDLRVEKQVEKNSENVLHSPGLENHLEIRVRKRTGSFDGDKHKDDIGDVDNRQLSSKNDT 240
EDLRVEKQVEKNSENVLHSPGLENHLEIRVRKRTGSFDGDKHKDDIGDVDNRQLSSKNDT
Sbjct: 181 EDLRVEKQVEKNSENVLHSPGLENHLEIRVRKRTGSFDGDKHKDDIGDVDNRQLSSKNDT 240
Query: 241 VKDGRRKSEKYKDERNREKYREDVDRDGKERNELVKDHISRSNDRDLRDEKDAVDMHHKR 300
VKDGRRKSEKYK+ERNREKYREDVDRDGKERNELVKDHISRSNDRDLRDEKDA+DMHHKR
Sbjct: 241 VKDGRRKSEKYKEERNREKYREDVDRDGKERNELVKDHISRSNDRDLRDEKDAMDMHHKR 300
Query: 301 NKPQDSDPDREVTKAKREGDIDAMRDQDHDRHHAYERDHEQESRRRRDRDRGRDRDRDRD 360
NK QDSDPDREVTKAKREGDIDAMRDQDHDRHHAYERDHEQESRRRRDR GRDRDRDRD
Sbjct: 301 NKLQDSDPDREVTKAKREGDIDAMRDQDHDRHHAYERDHEQESRRRRDR--GRDRDRDRD 360
Query: 361 HDRDSRRHRSRSRARDRYSDYECDVDRDGSHFDDQYTKYVDSRGRKRSPNDHDDSVDARS 420
HDRDSRRHRSRSRARDRYSDYECDVDRDGSHFDDQYTKYVDSRGRKRSPNDHDDSVDARS
Sbjct: 361 HDRDSRRHRSRSRARDRYSDYECDVDRDGSHFDDQYTKYVDSRGRKRSPNDHDDSVDARS 420
Query: 421 KSLKNSHHANDEKKSLSNDKVDSDAERGRSQSRSRHGDVSLSSHRRKSSPSSHSRVVTDE 480
KSLKNSHHANDEKKSLSNDKVDSDAERGRSQSRSRHGD+SLSSHRRKSSPSSHSRVVTDE
Sbjct: 421 KSLKNSHHANDEKKSLSNDKVDSDAERGRSQSRSRHGDISLSSHRRKSSPSSHSRVVTDE 480
Query: 481 YRHQDQEDLRDRYPKKEERSKSISTRDKGVLSVVQEKGSKYTYSEKPSEIEGGNATEMLR 540
YRHQDQEDLRDRYPKKEERSKSISTRDKGVLSVVQEKGSKYTYSEKPSEIEGGNATE+LR
Sbjct: 481 YRHQDQEDLRDRYPKKEERSKSISTRDKGVLSVVQEKGSKYTYSEKPSEIEGGNATELLR 540
Query: 541 DRTLNSKNVDIEESGRRHNNSIDAKDLSSNKDRHSWDIQGEKPVMDDSSQVESYYSKGSQ 600
DRTLNSKNVDIEESGRRHNNSIDAKDLSSNKDRHSWDIQGEKPVMDDSSQVESYYSKGSQ
Sbjct: 541 DRTLNSKNVDIEESGRRHNNSIDAKDLSSNKDRHSWDIQGEKPVMDDSSQVESYYSKGSQ 600
Query: 601 SNPSPFHPRPAFRGGVDIPFDGSLDDDGRLNSNSRFRRGNDPNMGRVHGNTWRGVPNWTA 660
SNPSPFHPRPAFRGGVDIPFDGSLDDDGRLNSNSRFRRGNDPN+GRVHGNTWRGVPNWTA
Sbjct: 601 SNPSPFHPRPAFRGGVDIPFDGSLDDDGRLNSNSRFRRGNDPNIGRVHGNTWRGVPNWTA 660
Query: 661 PLPNGFIPFQHGPPPHGSFQSLMPQFPAPPMFGIRPPLDINHSGIHYRMPDADRFSSHMH 720
PLPNGFIPFQHGPPPHGSFQSLMPQFPAPPMFGIRPPLDINHSGIHYRMPDADRFSSHMH
Sbjct: 661 PLPNGFIPFQHGPPPHGSFQSLMPQFPAPPMFGIRPPLDINHSGIHYRMPDADRFSSHMH 720
Query: 721 PLGWQNMLDGSSPSHLHGWDANNGIFRDESHIYNGAEWDENRQMVNGRGWDSKAEMWKRQ 780
PLGWQNMLDGSSPSHLHGWDANNGIFRDESHIYNGAEWDENRQMVNGRGWDSKAEMWKRQ
Sbjct: 721 PLGWQNMLDGSSPSHLHGWDANNGIFRDESHIYNGAEWDENRQMVNGRGWDSKAEMWKRQ 780
Query: 781 SGSLKREIPSQFQKDERSVQDPVDDVSSKEICDENADTVLTKTAEIRPNIPSAKESPNTP 840
SGSLKREIPSQFQKDERSVQDPVDDVSSKEI DEN DTVLTKT+EIRPNIP AKESPNTP
Sbjct: 781 SGSLKREIPSQFQKDERSVQDPVDDVSSKEIFDENGDTVLTKTSEIRPNIPPAKESPNTP 840
Query: 841 ELLSETPGPLSRSMDDNSKLSCSYLSKLKISTELALPDLYQQCQRLMDIEHCATADEETA 900
ELLSETP PLSRSMDDNSKLSCSYLSKL ISTELALPDLYQQCQRLMDIEHCATADEETA
Sbjct: 841 ELLSETPAPLSRSMDDNSKLSCSYLSKLNISTELALPDLYQQCQRLMDIEHCATADEETA 900
Query: 901 AYIVLEGGMRAVSISSNSAQISLFRPNKNSVFQHAMDLYKKQRTEMKEMQASSREMPSSE 960
AYIVLEGGMRAVS+SSNSAQISLFRP+KNSVFQHAMDLYKKQRTEMKEMQA SREMPSSE
Sbjct: 901 AYIVLEGGMRAVSVSSNSAQISLFRPDKNSVFQHAMDLYKKQRTEMKEMQAISREMPSSE 960
Query: 961 RMLEEEQQGMQVVSGGMAFSERKHEEKGFTFNNEDVKAPVSTVDAEMTQAPIKTTGVDKA 1020
RMLEEEQQGMQVVSGGMAFSERKHEE GF FNNE+VKAPVSTVDAEMTQAPIKTTGVD A
Sbjct: 961 RMLEEEQQGMQVVSGGMAFSERKHEEMGFNFNNEEVKAPVSTVDAEMTQAPIKTTGVDNA 1020
Query: 1021 IEADAALGKLEDLAVEADAALGELEDLASPATREVKCLENSEESVPITNSTEVDMMDSEQ 1080
EADAALGKLEDLAVEADAALGE EDLASPA +EVK LENSEESVPITNSTEVDMMDSEQ
Sbjct: 1021 TEADAALGKLEDLAVEADAALGEPEDLASPAIQEVKSLENSEESVPITNSTEVDMMDSEQ 1080
Query: 1081 QANLDAEKDTIVIANDNTPVNNINESSNDDMKGIVNGKDSPRCDEFSNNNDIKGIVNGKE 1140
ANLDAEKDTIVIANDNTPVNNINESSNDDMKGIVNGKDSPRCDEFSNNND+KGI NGKE
Sbjct: 1081 PANLDAEKDTIVIANDNTPVNNINESSNDDMKGIVNGKDSPRCDEFSNNNDMKGIENGKE 1140
Query: 1141 SPGCDELSKNENGKESPGCGVGNSCFDKAVSGPLSLAGGDEIGGESCEEVGLMGGGGGGG 1200
SPGCDEL+KNENGKESPGCGVGNSCFDKAVSGPLSLAGGDEIGGESCEE GLMGG G
Sbjct: 1141 SPGCDELNKNENGKESPGCGVGNSCFDKAVSGPLSLAGGDEIGGESCEEGGLMGG----G 1200
Query: 1201 GVPIGSESLILSQQIHHSPESTH 1223
GV IGSESLILSQQIHHSPESTH
Sbjct: 1201 GVTIGSESLILSQQIHHSPESTH 1217
BLAST of Cp4.1LG13g03060 vs. ExPASy TrEMBL
Match:
A0A6J1E8Q7 (uncharacterized protein LOC111430427 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111430427 PE=4 SV=1)
HSP 1 Score: 2253 bits (5839), Expect = 0.0
Identity = 1197/1225 (97.71%), Postives = 1206/1225 (98.45%), Query Frame = 0
Query: 1 MPRGSRHKSSRHGLKDAKESSDSENDSTLRDRKGKESGSRVTKDSASSEKRRFESKDSKE 60
MPRGSRHKSSRHGLKDAKESSDSENDSTLRDRKGKESGSRV KDSASSEKRRFESKDSKE
Sbjct: 1 MPRGSRHKSSRHGLKDAKESSDSENDSTLRDRKGKESGSRVMKDSASSEKRRFESKDSKE 60
Query: 61 FYGSENLEMEEHGHSKRRKERYDEGTTDRWNGGSNDELGVPSKKSKTLVDSKSKRRDESV 120
FYGSENLEMEEHGHSKRRKERYDEGTTDRWNGGS+DELGVPSKKSKTLVDSKSKRRDESV
Sbjct: 61 FYGSENLEMEEHGHSKRRKERYDEGTTDRWNGGSDDELGVPSKKSKTLVDSKSKRRDESV 120
Query: 121 GFQGDGEEHKKSSGKGEGRHRESSRKEGRSGGGERERERERERE--KDRKGREGRSDRGV 180
GFQGDGEEHKKSSGKGEGRHRESSRKEGR+GGGERERERERERE KDRKGREGRSDRGV
Sbjct: 121 GFQGDGEEHKKSSGKGEGRHRESSRKEGRNGGGEREREREREREREKDRKGREGRSDRGV 180
Query: 181 ASEDLRVEKQVEKNSENVLHSPGLENHLEIRVRKRTGSFDGDKHKDDIGDVDNRQLSSKN 240
ASEDLRVEKQVEKNSENVLHSPGLENHLEIRVRKRTGSFDGDKHKDDIGDVDNRQLSSKN
Sbjct: 181 ASEDLRVEKQVEKNSENVLHSPGLENHLEIRVRKRTGSFDGDKHKDDIGDVDNRQLSSKN 240
Query: 241 DTVKDGRRKSEKYKDERNREKYREDVDRDGKERNELVKDHISRSNDRDLRDEKDAVDMHH 300
DTVKDGRRKSEKYKDERNREKYREDVDRDGKERNELVKDHISRSNDRDLRDEKDA+DMHH
Sbjct: 241 DTVKDGRRKSEKYKDERNREKYREDVDRDGKERNELVKDHISRSNDRDLRDEKDAMDMHH 300
Query: 301 KRNKPQDSDPDREVTKAKREGDIDAMRDQDHDRHHAYERDHEQESRRRRDRDRGRDRDRD 360
KRNKPQDSDPDREVTKAKREGDIDAMRDQDHDRHHAYERDHEQESRRRRDRDR R RDRD
Sbjct: 301 KRNKPQDSDPDREVTKAKREGDIDAMRDQDHDRHHAYERDHEQESRRRRDRDRDRGRDRD 360
Query: 361 RDHDRDSRRHRSRSRARDRYSDYECDVDRDGSHFDDQYTKYVDSRGRKRSPNDHDDSVDA 420
RDHDRDSRRHRSRSRARDRYSDYECDVDRDGSHFDDQYTKYVDSRGRKRSPNDHDDSVDA
Sbjct: 361 RDHDRDSRRHRSRSRARDRYSDYECDVDRDGSHFDDQYTKYVDSRGRKRSPNDHDDSVDA 420
Query: 421 RSKSLKNSHHANDEKKSLSNDKVDSDAERGRSQSRSRHGDVSLSSHRRKSSPSSHSRVVT 480
RSKSLKNSHHANDEKKSLSNDKVDSDAERGRSQSRSRHGDVSLSSHRRKSSPSSHSRVVT
Sbjct: 421 RSKSLKNSHHANDEKKSLSNDKVDSDAERGRSQSRSRHGDVSLSSHRRKSSPSSHSRVVT 480
Query: 481 DEYRHQDQEDLRDRYPKKEERSKSISTRDKGVLSVVQEKGSKYTYSEKPSEIEGGNATEM 540
DEYRHQDQEDLRDRYPKKEERSKSISTRDKGVLSVVQEKGSKYTYSEKPSEIEGGNATE+
Sbjct: 481 DEYRHQDQEDLRDRYPKKEERSKSISTRDKGVLSVVQEKGSKYTYSEKPSEIEGGNATEL 540
Query: 541 LRDRTLNSKNVDIEESGRRHNNSIDAKDLSSNKDRHSWDIQGEKPVMDDSSQVESYYSKG 600
LRDRTLNSKNVDIEESGRRHNNSIDAKDLSSNKDRHSWDIQGEKPVMDDSSQVESYYSKG
Sbjct: 541 LRDRTLNSKNVDIEESGRRHNNSIDAKDLSSNKDRHSWDIQGEKPVMDDSSQVESYYSKG 600
Query: 601 SQSNPSPFHPRPAFRGGVDIPFDGSLDDDGRLNSNSRFRRGNDPNMGRVHGNTWRGVPNW 660
SQSNPSPFHPRPAFRGGVDIPFDGSLDDDGRLNSNSRFRRGNDPNMGRVHGNTWRGVPNW
Sbjct: 601 SQSNPSPFHPRPAFRGGVDIPFDGSLDDDGRLNSNSRFRRGNDPNMGRVHGNTWRGVPNW 660
Query: 661 TAPLPNGFIPFQHGPPPHGSFQSLMPQFPAPPMFGIRPPLDINHSGIHYRMPDADRFSSH 720
TAPLPNGFIPFQHGPPPHGSFQSLMPQFPAPPMFGIRPPLDINHSGIHYRMPDADRFSSH
Sbjct: 661 TAPLPNGFIPFQHGPPPHGSFQSLMPQFPAPPMFGIRPPLDINHSGIHYRMPDADRFSSH 720
Query: 721 MHPLGWQNMLDGSSPSHLHGWDANNGIFRDESHIYNGAEWDENRQMVNGRGWDSKAEMWK 780
MHPLGWQNMLDGSSPSHLHGWDANNGIFRDESHIYNGAEWDENRQMVNGRGWDSKAEMWK
Sbjct: 721 MHPLGWQNMLDGSSPSHLHGWDANNGIFRDESHIYNGAEWDENRQMVNGRGWDSKAEMWK 780
Query: 781 RQSGSLKREIPSQFQKDERSVQDPVDDVSSKEICDENADTVLTKTAEIRPNIPSAKESPN 840
RQSGSLKREIPSQFQKDERSVQDPVDDVSSKEI DENADTVLTKT+EIRPNIPSAKESPN
Sbjct: 781 RQSGSLKREIPSQFQKDERSVQDPVDDVSSKEIFDENADTVLTKTSEIRPNIPSAKESPN 840
Query: 841 TPELLSETPGPLSRSMDDNSKLSCSYLSKLKISTELALPDLYQQCQRLMDIEHCATADEE 900
TPELLSETP PLSRSMDDNSKLSCSYLSKL ISTELALPDLYQQCQRLMDIEHCATADEE
Sbjct: 841 TPELLSETPAPLSRSMDDNSKLSCSYLSKLNISTELALPDLYQQCQRLMDIEHCATADEE 900
Query: 901 TAAYIVLEGGMRAVSISSNSAQISLFRPNKNSVFQHAMDLYKKQRTEMKEMQASSREMPS 960
TAAYIVLEGGMRAVS+SSNSAQISLFRPNKNSVFQHAMDLYKKQRTEMKEMQA SREMPS
Sbjct: 901 TAAYIVLEGGMRAVSVSSNSAQISLFRPNKNSVFQHAMDLYKKQRTEMKEMQAISREMPS 960
Query: 961 SERMLEEEQQGMQVVSGGMAFSERKHEEKGFTFNNEDVKAPVSTVDAEMTQAPIKTTGVD 1020
SERMLEEEQQGMQVVS GMAFSERKHEE G F NE+VKAPVSTVDAEMTQAPIKTTGVD
Sbjct: 961 SERMLEEEQQGMQVVSRGMAFSERKHEEMGLNFKNEEVKAPVSTVDAEMTQAPIKTTGVD 1020
Query: 1021 KAIEADAALGKLEDLAVEADAALGELEDLASPATREVKCLENSEESVPITNSTEVDMMDS 1080
AIEADAALGKLEDLAVEADAALGELEDLASPATREVKCLENSEESVPITNSTEVDMMDS
Sbjct: 1021 NAIEADAALGKLEDLAVEADAALGELEDLASPATREVKCLENSEESVPITNSTEVDMMDS 1080
Query: 1081 EQQANLDAEKDTIVIANDNTPVNNINESSNDDMKGIVNGKDSPRCDEFSNNNDIKGIVNG 1140
EQ ANLDAEKDTIVIA+DNTPVNNINESSNDDMKGIVNGKDSPRCDEFSNNND+KGI NG
Sbjct: 1081 EQPANLDAEKDTIVIASDNTPVNNINESSNDDMKGIVNGKDSPRCDEFSNNNDMKGIENG 1140
Query: 1141 KESPGCDELSKNENGKESPGCGVGNSCFDKAVSGPLSLAGGDEIGGESCEEVGLMGGGGG 1200
KESPGCDELSKNENGKESPGCGVGNSCFDKAVSGPLSLAGGDEIGGESCEE GLMGGGGG
Sbjct: 1141 KESPGCDELSKNENGKESPGCGVGNSCFDKAVSGPLSLAGGDEIGGESCEEGGLMGGGGG 1200
Query: 1201 GGGVPIGSESLILSQQIHHSPESTH 1223
GG VPIGSESLILSQQIHHSPESTH
Sbjct: 1201 GG-VPIGSESLILSQQIHHSPESTH 1224
BLAST of Cp4.1LG13g03060 vs. ExPASy TrEMBL
Match:
A0A6J1I6E2 (uncharacterized protein LOC111471538 isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111471538 PE=4 SV=1)
HSP 1 Score: 2181 bits (5652), Expect = 0.0
Identity = 1178/1227 (96.01%), Postives = 1185/1227 (96.58%), Query Frame = 0
Query: 1 MPRGSRHKSSRHGLKDAKESSDSENDSTLRDRKGKESGSRVTKDSASSEKRRFESKDSKE 60
MPRGSRHKSSR GLKDAKESSDSENDSTLRDRKGKESGSRV KDSASSEKRRFESKDSKE
Sbjct: 1 MPRGSRHKSSRQGLKDAKESSDSENDSTLRDRKGKESGSRVMKDSASSEKRRFESKDSKE 60
Query: 61 FYGSENLEMEEHGHSKRRKERYDEGTTDRWNGGSNDELGVPSKKSKTLVDSKSKRRDESV 120
FYGSENLEMEEHGHSKRRKERYDEGTTDRWNGGS+DELGVPSKKSKTLVDSKSKRRDESV
Sbjct: 61 FYGSENLEMEEHGHSKRRKERYDEGTTDRWNGGSDDELGVPSKKSKTLVDSKSKRRDESV 120
Query: 121 GFQGDGEEHKKSSGKGEGRHRESSRKEGRSGGGERERERERERE--KDRKGREGRSDRGV 180
GF GDGEEHKKSSGKGEGRHRESSRKEGR+GGGERERERERERE KDRKGREGRSDRGV
Sbjct: 121 GFHGDGEEHKKSSGKGEGRHRESSRKEGRNGGGEREREREREREREKDRKGREGRSDRGV 180
Query: 181 ASEDLRVEKQVEKNSENVLHSPGLENHLEIRVRKRTGSFDGDKHKDDIGDVDNRQLSSKN 240
ASEDLRVEKQVEKNSENVLHSPGLENHLEIRVRKRTGSFDGDKHKDDIGDVDNRQLSSKN
Sbjct: 181 ASEDLRVEKQVEKNSENVLHSPGLENHLEIRVRKRTGSFDGDKHKDDIGDVDNRQLSSKN 240
Query: 241 DTVKDGRRKSEKYKDERNREKYREDVDRDGKERNE-LVKDHISRSNDRDLRDEKDAVDMH 300
DTVKDGRRKSEKYKDERNREKYREDVDRDGKER+E LVKDHISRSNDRDLRDEKDA+DMH
Sbjct: 241 DTVKDGRRKSEKYKDERNREKYREDVDRDGKERHEQLVKDHISRSNDRDLRDEKDAMDMH 300
Query: 301 HKRNKPQDSDPDREVTKAKREGDIDAMRDQDHDRHHAYERDHEQESRRRRDRDRGRDRDR 360
HKRNKPQDSDPDREVTKAKREGDIDAMRDQDHDRHHAYERDHEQESRRRRDR GRDRDR
Sbjct: 301 HKRNKPQDSDPDREVTKAKREGDIDAMRDQDHDRHHAYERDHEQESRRRRDR--GRDRDR 360
Query: 361 DRDHDRDSRRHRSRSRARDRYSDYECDVDRDGSHFDDQYTKYVDSRGRKRSPNDHDDSVD 420
DRD RDSRRHRSRSRARDRYSDYECDVDRDG HFDDQYTKYVDSRGRKRSPNDHDDSVD
Sbjct: 361 DRD--RDSRRHRSRSRARDRYSDYECDVDRDGYHFDDQYTKYVDSRGRKRSPNDHDDSVD 420
Query: 421 ARSKSLKNSHHANDEKKSLSNDKVDSDAERGRSQSRSRHGDVSLSSHRRKSSPSSHSRVV 480
ARSKSLKNSHHANDEKKSLSNDKVDSDAERGRSQSRSRHGDVSLSSHRRKSSPSSHSRVV
Sbjct: 421 ARSKSLKNSHHANDEKKSLSNDKVDSDAERGRSQSRSRHGDVSLSSHRRKSSPSSHSRVV 480
Query: 481 TDEYRHQDQEDLRDRYPKKEERSKSISTRDKGVLSVVQEKGSKYTYSEKPSEIEGGNATE 540
TDEYRHQDQEDLRDRYPKKE+RSKSISTRDKGVLSVVQEKGSKYTYSEKPSEIEGGNATE
Sbjct: 481 TDEYRHQDQEDLRDRYPKKEDRSKSISTRDKGVLSVVQEKGSKYTYSEKPSEIEGGNATE 540
Query: 541 MLRDRTLNSKNVDIEESGRRHNNSIDAKDLSSNKDRHSWDIQGEKPVMDDSSQVESYYSK 600
MLRDRTLNSKNVDIEESGRRHNNSIDAKDLSSNKDRHSWDIQGEKPVMDDSSQVESYYSK
Sbjct: 541 MLRDRTLNSKNVDIEESGRRHNNSIDAKDLSSNKDRHSWDIQGEKPVMDDSSQVESYYSK 600
Query: 601 GSQSNPSPFHPRPAFRGGVDIPFDGSLDDDGRLNSNSRFRRGNDPNMGRVHGNTWRGVPN 660
GSQSNPSPFHPRPAFRGGVDIPFDGSLDDDGRLNSNS FRRGNDPNMGRVHGNTWRGVPN
Sbjct: 601 GSQSNPSPFHPRPAFRGGVDIPFDGSLDDDGRLNSNSHFRRGNDPNMGRVHGNTWRGVPN 660
Query: 661 WTAPLPNGFIPFQHGPPPHGSFQSLMPQFPAPPMFGIRPPLDINHSGIHYRMPDADRFSS 720
WTAPLPNGFIPFQHGPPPHGSFQSLMPQFPAPPMFGIRPPLDINHSGIHYRMPDADRFSS
Sbjct: 661 WTAPLPNGFIPFQHGPPPHGSFQSLMPQFPAPPMFGIRPPLDINHSGIHYRMPDADRFSS 720
Query: 721 HMHPLGWQNMLDGSSPSHLHGWDANNGIFRDESHIYNGAEWDENRQMVNGRGWDSKAEMW 780
HMHPLGWQNMLDGSSPSHLHGWDANNGIFRDESHIYNGAEWDENRQMVNGRGWDSKAEMW
Sbjct: 721 HMHPLGWQNMLDGSSPSHLHGWDANNGIFRDESHIYNGAEWDENRQMVNGRGWDSKAEMW 780
Query: 781 KRQSGSLKREIPSQFQKDERSVQDPVDDVSSKEICDENADTVLTKTAEIRPNIPSAKESP 840
KRQSGSLKREIPSQFQKDER VQDPVDDVSSKEICDENADTVLTKTAEIRPNIPSAKESP
Sbjct: 781 KRQSGSLKREIPSQFQKDERLVQDPVDDVSSKEICDENADTVLTKTAEIRPNIPSAKESP 840
Query: 841 NTPELLSETPGPLSRSMDDNSKLSCSYLSKLKISTELALPDLYQQCQRLMDIEHCATADE 900
NTPELLSETP PLSRSMDDNSKLSCSYLSKLKISTELALPDLYQQCQRLMDIEHCATADE
Sbjct: 841 NTPELLSETPAPLSRSMDDNSKLSCSYLSKLKISTELALPDLYQQCQRLMDIEHCATADE 900
Query: 901 ETAAYIVLEGGMRAVSISSNSAQISLFRPNKNSVFQHAMDLYKKQRTEMKEMQASSREMP 960
ETAAYIVLEGGMRAVS+SSNSAQISLFRPNKNSVFQHAMDLYKKQRTEMKEMQA SREMP
Sbjct: 901 ETAAYIVLEGGMRAVSVSSNSAQISLFRPNKNSVFQHAMDLYKKQRTEMKEMQAISREMP 960
Query: 961 SSERMLEEEQQGMQVVSGGMAFSERKHEEKGFTFNNEDVKAPVSTVDAEMTQAPIKTTGV 1020
SERML EEQ GMQVVSGGMAFSERKHEEKGF FNNE+VKAPVSTVDAEMTQAPIKTTGV
Sbjct: 961 FSERMLVEEQ-GMQVVSGGMAFSERKHEEKGFNFNNEEVKAPVSTVDAEMTQAPIKTTGV 1020
Query: 1021 DKAIEADAALGKLEDLAVEADAALGELEDLASPATREVKCLENSEESVPITNSTEVDMMD 1080
DKAIEADAALGKLEDLAVEADAALGELEDLASPATREVKCLENSEESVP TNSTEV MMD
Sbjct: 1021 DKAIEADAALGKLEDLAVEADAALGELEDLASPATREVKCLENSEESVPTTNSTEVVMMD 1080
Query: 1081 SEQQANLDAEKDTIVIANDNTPVNNINESSNDD-MKGIVNGKDSPRCDEFSNNNDIKGIV 1140
SEQQANLDAEKDTIVIANDNTPVNNINESSNDD MKGIVNGKDSPRCDE SNNNDIKGIV
Sbjct: 1081 SEQQANLDAEKDTIVIANDNTPVNNINESSNDDDMKGIVNGKDSPRCDELSNNNDIKGIV 1140
Query: 1141 NGKESPGCDELSKNENGKESPGCGVGNSCFDKAVSGPLSLAGGDEIGGESCEEVGLMGGG 1200
NGKESPGC GVGNSCFDKAVSGPLS AGGDEIGGESCEE GLMGGG
Sbjct: 1141 NGKESPGC---------------GVGNSCFDKAVSGPLSFAGGDEIGGESCEEGGLMGGG 1200
Query: 1201 GGGGGVPIGSESLILSQQIHHSPESTH 1223
GGGGGVPIGSESLILSQ I HSPESTH
Sbjct: 1201 GGGGGVPIGSESLILSQ-IRHSPESTH 1206
BLAST of Cp4.1LG13g03060 vs. ExPASy TrEMBL
Match:
A0A6J1I7J4 (uncharacterized protein LOC111471538 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111471538 PE=4 SV=1)
HSP 1 Score: 2172 bits (5627), Expect = 0.0
Identity = 1178/1241 (94.92%), Postives = 1185/1241 (95.49%), Query Frame = 0
Query: 1 MPRGSRHKSSRHGLKDAKESSDSENDSTLRDRKGKESGSRVTKDSASSEKRRFESKDSKE 60
MPRGSRHKSSR GLKDAKESSDSENDSTLRDRKGKESGSRV KDSASSEKRRFESKDSKE
Sbjct: 1 MPRGSRHKSSRQGLKDAKESSDSENDSTLRDRKGKESGSRVMKDSASSEKRRFESKDSKE 60
Query: 61 FYGSENLEMEEHGHSKRRKERYDEGTTDRWNGGSNDELGVPSKKSKTLVDSKSKRRDESV 120
FYGSENLEMEEHGHSKRRKERYDEGTTDRWNGGS+DELGVPSKKSKTLVDSKSKRRDESV
Sbjct: 61 FYGSENLEMEEHGHSKRRKERYDEGTTDRWNGGSDDELGVPSKKSKTLVDSKSKRRDESV 120
Query: 121 GFQGDGEEHKKSSGKGEGRHRESSRKEGRSGGGERERERERERE--KDRKGREGRSDRGV 180
GF GDGEEHKKSSGKGEGRHRESSRKEGR+GGGERERERERERE KDRKGREGRSDRGV
Sbjct: 121 GFHGDGEEHKKSSGKGEGRHRESSRKEGRNGGGEREREREREREREKDRKGREGRSDRGV 180
Query: 181 ASEDLRVEKQVEKNSENVLHSPGLENHLEIRVRKRTGSFDGDKHKDDIGDVDNRQLSSKN 240
ASEDLRVEKQVEKNSENVLHSPGLENHLEIRVRKRTGSFDGDKHKDDIGDVDNRQLSSKN
Sbjct: 181 ASEDLRVEKQVEKNSENVLHSPGLENHLEIRVRKRTGSFDGDKHKDDIGDVDNRQLSSKN 240
Query: 241 DTVKDGRRKSEKYKDERNREKYREDVDRDGKERNE-LVKDHISRSNDRDLRDEKDAVDMH 300
DTVKDGRRKSEKYKDERNREKYREDVDRDGKER+E LVKDHISRSNDRDLRDEKDA+DMH
Sbjct: 241 DTVKDGRRKSEKYKDERNREKYREDVDRDGKERHEQLVKDHISRSNDRDLRDEKDAMDMH 300
Query: 301 HKRNKPQDSDPDREVTKAKREGDIDAMRDQDHDRHHAYERDHEQESRRRRDRDRGRDRDR 360
HKRNKPQDSDPDREVTKAKREGDIDAMRDQDHDRHHAYERDHEQESRRRRDR GRDRDR
Sbjct: 301 HKRNKPQDSDPDREVTKAKREGDIDAMRDQDHDRHHAYERDHEQESRRRRDR--GRDRDR 360
Query: 361 DRDHDRDSRRHRSRSRARDRYSDYECDVDRDGSHFDDQYTKYVDSRGRKRSPNDHDDSVD 420
DRD RDSRRHRSRSRARDRYSDYECDVDRDG HFDDQYTKYVDSRGRKRSPNDHDDSVD
Sbjct: 361 DRD--RDSRRHRSRSRARDRYSDYECDVDRDGYHFDDQYTKYVDSRGRKRSPNDHDDSVD 420
Query: 421 ARSKSLKNSHHANDEKKSLSNDKVDSDAERGRSQSRSRHGDVSLSSHRRKSSPSSHSRVV 480
ARSKSLKNSHHANDEKKSLSNDKVDSDAERGRSQSRSRHGDVSLSSHRRKSSPSSHSRVV
Sbjct: 421 ARSKSLKNSHHANDEKKSLSNDKVDSDAERGRSQSRSRHGDVSLSSHRRKSSPSSHSRVV 480
Query: 481 TDEYRHQDQEDLRDRYPKKEERSKSISTRDKGVLSVVQEKGSKYTYSEKPSEIEGGNATE 540
TDEYRHQDQEDLRDRYPKKE+RSKSISTRDKGVLSVVQEKGSKYTYSEKPSEIEGGNATE
Sbjct: 481 TDEYRHQDQEDLRDRYPKKEDRSKSISTRDKGVLSVVQEKGSKYTYSEKPSEIEGGNATE 540
Query: 541 MLRDRTLNSKNVDIEESGRRHNNSIDAKDLSSNKDRHSWDIQGEKPVMDDSSQVESYYSK 600
MLRDRTLNSKNVDIEESGRRHNNSIDAKDLSSNKDRHSWDIQGEKPVMDDSSQVESYYSK
Sbjct: 541 MLRDRTLNSKNVDIEESGRRHNNSIDAKDLSSNKDRHSWDIQGEKPVMDDSSQVESYYSK 600
Query: 601 GSQSNPSPFHPRPAFRGGVDIPFDGSLDDDGRLNSNSRFRRGNDPNMGRVHGNTWRGVPN 660
GSQSNPSPFHPRPAFRGGVDIPFDGSLDDDGRLNSNS FRRGNDPNMGRVHGNTWRGVPN
Sbjct: 601 GSQSNPSPFHPRPAFRGGVDIPFDGSLDDDGRLNSNSHFRRGNDPNMGRVHGNTWRGVPN 660
Query: 661 WTAPLPNGFIPFQHGPPPHGSFQSLMPQFPAPPMFGIRPPLDINHSGIHYRMPDADRFSS 720
WTAPLPNGFIPFQHGPPPHGSFQSLMPQFPAPPMFGIRPPLDINHSGIHYRMPDADRFSS
Sbjct: 661 WTAPLPNGFIPFQHGPPPHGSFQSLMPQFPAPPMFGIRPPLDINHSGIHYRMPDADRFSS 720
Query: 721 HMHPLGWQNMLDGSSPSHLHGWDANNGIFRDESHIYNGAEWDENRQMVNGRGWDSKAEMW 780
HMHPLGWQNMLDGSSPSHLHGWDANNGIFRDESHIYNGAEWDENRQMVNGRGWDSKAEMW
Sbjct: 721 HMHPLGWQNMLDGSSPSHLHGWDANNGIFRDESHIYNGAEWDENRQMVNGRGWDSKAEMW 780
Query: 781 KRQSGSLKREIPSQFQKDERSVQDPVDDVSSKEICDENADTVLTKTAEIRPNIPSAKESP 840
KRQSGSLKREIPSQFQKDER VQDPVDDVSSKEICDENADTVLTKTAEIRPNIPSAKESP
Sbjct: 781 KRQSGSLKREIPSQFQKDERLVQDPVDDVSSKEICDENADTVLTKTAEIRPNIPSAKESP 840
Query: 841 NTPELLSETPGPLSRSMDDNSKLSCSYLSKLKISTELALPDLYQQCQRLMDIEHCATADE 900
NTPELLSETP PLSRSMDDNSKLSCSYLSKLKISTELALPDLYQQCQRLMDIEHCATADE
Sbjct: 841 NTPELLSETPAPLSRSMDDNSKLSCSYLSKLKISTELALPDLYQQCQRLMDIEHCATADE 900
Query: 901 ETAAYIVLEGGMRAVSISSNSAQISLFRPNKNSVFQHAMDLYKKQRTEMKEMQASSREMP 960
ETAAYIVLEGGMRAVS+SSNSAQISLFRPNKNSVFQHAMDLYKKQRTEMKEMQA SREMP
Sbjct: 901 ETAAYIVLEGGMRAVSVSSNSAQISLFRPNKNSVFQHAMDLYKKQRTEMKEMQAISREMP 960
Query: 961 SSERMLEEEQQGMQVVSGGMAFSERKHEEKGFTFNNEDVKAPVSTVDAEMTQAPIKTTGV 1020
SERML EEQ GMQVVSGGMAFSERKHEEKGF FNNE+VKAPVSTVDAEMTQAPIKTTGV
Sbjct: 961 FSERMLVEEQ-GMQVVSGGMAFSERKHEEKGFNFNNEEVKAPVSTVDAEMTQAPIKTTGV 1020
Query: 1021 DKAIEADAALGKLEDLAVEADAALGELEDLA--------------SPATREVKCLENSEE 1080
DKAIEADAALGKLEDLAVEADAALGELEDLA SPATREVKCLENSEE
Sbjct: 1021 DKAIEADAALGKLEDLAVEADAALGELEDLAVEADSALGELEDLASPATREVKCLENSEE 1080
Query: 1081 SVPITNSTEVDMMDSEQQANLDAEKDTIVIANDNTPVNNINESSNDD-MKGIVNGKDSPR 1140
SVP TNSTEV MMDSEQQANLDAEKDTIVIANDNTPVNNINESSNDD MKGIVNGKDSPR
Sbjct: 1081 SVPTTNSTEVVMMDSEQQANLDAEKDTIVIANDNTPVNNINESSNDDDMKGIVNGKDSPR 1140
Query: 1141 CDEFSNNNDIKGIVNGKESPGCDELSKNENGKESPGCGVGNSCFDKAVSGPLSLAGGDEI 1200
CDE SNNNDIKGIVNGKESPGC GVGNSCFDKAVSGPLS AGGDEI
Sbjct: 1141 CDELSNNNDIKGIVNGKESPGC---------------GVGNSCFDKAVSGPLSFAGGDEI 1200
Query: 1201 GGESCEEVGLMGGGGGGGGVPIGSESLILSQQIHHSPESTH 1223
GGESCEE GLMGGGGGGGGVPIGSESLILSQ I HSPESTH
Sbjct: 1201 GGESCEEGGLMGGGGGGGGVPIGSESLILSQ-IRHSPESTH 1220
BLAST of Cp4.1LG13g03060 vs. ExPASy TrEMBL
Match:
A0A6J1E442 (uncharacterized protein LOC111430427 isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111430427 PE=4 SV=1)
HSP 1 Score: 2165 bits (5610), Expect = 0.0
Identity = 1163/1225 (94.94%), Postives = 1171/1225 (95.59%), Query Frame = 0
Query: 1 MPRGSRHKSSRHGLKDAKESSDSENDSTLRDRKGKESGSRVTKDSASSEKRRFESKDSKE 60
MPRGSRHKSSRHGLKDAKESSDSENDSTLRDRKGKESGSRV KDSASSEKRRFESKDSKE
Sbjct: 1 MPRGSRHKSSRHGLKDAKESSDSENDSTLRDRKGKESGSRVMKDSASSEKRRFESKDSKE 60
Query: 61 FYGSENLEMEEHGHSKRRKERYDEGTTDRWNGGSNDELGVPSKKSKTLVDSKSKRRDESV 120
FYGSENLEMEEHGHSKRRKERYDEGTTDRWNGGS+DELGVPSKKSKTLVDSKSKRRDESV
Sbjct: 61 FYGSENLEMEEHGHSKRRKERYDEGTTDRWNGGSDDELGVPSKKSKTLVDSKSKRRDESV 120
Query: 121 GFQGDGEEHKKSSGKGEGRHRESSRKEGRSGGGERERERERERE--KDRKGREGRSDRGV 180
GFQGDGEEHKKSSGKGEGRHRESSRKEGR+GGGERERERERERE KDRKGREGRSDRGV
Sbjct: 121 GFQGDGEEHKKSSGKGEGRHRESSRKEGRNGGGEREREREREREREKDRKGREGRSDRGV 180
Query: 181 ASEDLRVEKQVEKNSENVLHSPGLENHLEIRVRKRTGSFDGDKHKDDIGDVDNRQLSSKN 240
ASEDLRVEKQVEKNSENVLHSPGLENHLEIRVRKRTGSFDGDKHKDDIGDVDNRQLSSKN
Sbjct: 181 ASEDLRVEKQVEKNSENVLHSPGLENHLEIRVRKRTGSFDGDKHKDDIGDVDNRQLSSKN 240
Query: 241 DTVKDGRRKSEKYKDERNREKYREDVDRDGKERNELVKDHISRSNDRDLRDEKDAVDMHH 300
DTVKDGRRKSEKYKDERNREKYREDVDRDGKERNELVKDHISRSNDRDLRDEKDA+DMHH
Sbjct: 241 DTVKDGRRKSEKYKDERNREKYREDVDRDGKERNELVKDHISRSNDRDLRDEKDAMDMHH 300
Query: 301 KRNKPQDSDPDREVTKAKREGDIDAMRDQDHDRHHAYERDHEQESRRRRDRDRGRDRDRD 360
KRNKPQDSDPDREVTKAKREGDIDAMRDQDHDRHHAYERDHEQESRRRRDRDR R RDRD
Sbjct: 301 KRNKPQDSDPDREVTKAKREGDIDAMRDQDHDRHHAYERDHEQESRRRRDRDRDRGRDRD 360
Query: 361 RDHDRDSRRHRSRSRARDRYSDYECDVDRDGSHFDDQYTKYVDSRGRKRSPNDHDDSVDA 420
RDHDRDSRRHRSRSRARDRYSDYECDVDRDGSHFDDQYTKYVDSRGRKRSPNDHDDSVDA
Sbjct: 361 RDHDRDSRRHRSRSRARDRYSDYECDVDRDGSHFDDQYTKYVDSRGRKRSPNDHDDSVDA 420
Query: 421 RSKSLKNSHHANDEKKSLSNDKVDSDAERGRSQSRSRHGDVSLSSHRRKSSPSSHSRVVT 480
RSKSLKNSHHANDEKKSLSNDKVDSDAERGRSQSRSRHGDVSLSSHRRKSSPSSHSRVVT
Sbjct: 421 RSKSLKNSHHANDEKKSLSNDKVDSDAERGRSQSRSRHGDVSLSSHRRKSSPSSHSRVVT 480
Query: 481 DEYRHQDQEDLRDRYPKKEERSKSISTRDKGVLSVVQEKGSKYTYSEKPSEIEGGNATEM 540
DEYRHQDQEDLRDRYPKKEERSKSISTRDKGVLSVVQEKGSKYTYSEKPSEIEGGNATE+
Sbjct: 481 DEYRHQDQEDLRDRYPKKEERSKSISTRDKGVLSVVQEKGSKYTYSEKPSEIEGGNATEL 540
Query: 541 LRDRTLNSKNVDIEESGRRHNNSIDAKDLSSNKDRHSWDIQGEKPVMDDSSQVESYYSKG 600
LRDRTLNSKNVDIEESGRRHNNSIDAKDLSSNKDRHSWDIQGEKPVMDDSSQVESYYSKG
Sbjct: 541 LRDRTLNSKNVDIEESGRRHNNSIDAKDLSSNKDRHSWDIQGEKPVMDDSSQVESYYSKG 600
Query: 601 SQSNPSPFHPRPAFRGGVDIPFDGSLDDDGRLNSNSRFRRGNDPNMGRVHGNTWRGVPNW 660
SQSNPSPFHPRPAFRGGVDIPFDGSLDDDGRLNSNSRFRRGNDPNMGRVHGNTWRGVPNW
Sbjct: 601 SQSNPSPFHPRPAFRGGVDIPFDGSLDDDGRLNSNSRFRRGNDPNMGRVHGNTWRGVPNW 660
Query: 661 TAPLPNGFIPFQHGPPPHGSFQSLMPQFPAPPMFGIRPPLDINHSGIHYRMPDADRFSSH 720
TAPLPNGFIPFQHGPPPHGSFQSLMPQFPAPPMFGIRPPLDINHSGIHYRMPDADRFSSH
Sbjct: 661 TAPLPNGFIPFQHGPPPHGSFQSLMPQFPAPPMFGIRPPLDINHSGIHYRMPDADRFSSH 720
Query: 721 MHPLGWQNMLDGSSPSHLHGWDANNGIFRDESHIYNGAEWDENRQMVNGRGWDSKAEMWK 780
MHPLGWQNMLDGSSPSHLHGWDANNGIFRDESHIYNGAEWDENRQMVNGRGWDSKAEMWK
Sbjct: 721 MHPLGWQNMLDGSSPSHLHGWDANNGIFRDESHIYNGAEWDENRQMVNGRGWDSKAEMWK 780
Query: 781 RQSGSLKREIPSQFQKDERSVQDPVDDVSSKEICDENADTVLTKTAEIRPNIPSAKESPN 840
RQSGSLKREIPSQFQKDERSVQDPVDDVSSKEI DENADTVLTKT+EIRPNIPSAKESPN
Sbjct: 781 RQSGSLKREIPSQFQKDERSVQDPVDDVSSKEIFDENADTVLTKTSEIRPNIPSAKESPN 840
Query: 841 TPELLSETPGPLSRSMDDNSKLSCSYLSKLKISTELALPDLYQQCQRLMDIEHCATADEE 900
TPELLSETP PLSRSMDDNSKLSCSYLSKL ISTELALPDLYQQCQRLMDIEHCATADEE
Sbjct: 841 TPELLSETPAPLSRSMDDNSKLSCSYLSKLNISTELALPDLYQQCQRLMDIEHCATADEE 900
Query: 901 TAAYIVLEGGMRAVSISSNSAQISLFRPNKNSVFQHAMDLYKKQRTEMKEMQASSREMPS 960
TAAYIVLEGGMRAVS+SSNSAQISLFRPNKNSVFQHAMDLYKKQRTEMKEMQA SREMPS
Sbjct: 901 TAAYIVLEGGMRAVSVSSNSAQISLFRPNKNSVFQHAMDLYKKQRTEMKEMQAISREMPS 960
Query: 961 SERMLEEEQQGMQVVSGGMAFSERKHEEKGFTFNNEDVKAPVSTVDAEMTQAPIKTTGVD 1020
SERMLEEEQQGMQVVS GMAFSERKHEE G F NE+VKAPVSTVDAEMTQAPIKTTGVD
Sbjct: 961 SERMLEEEQQGMQVVSRGMAFSERKHEEMGLNFKNEEVKAPVSTVDAEMTQAPIKTTGVD 1020
Query: 1021 KAIEADAALGKLEDLAVEADAALGELEDLASPATREVKCLENSEESVPITNSTEVDMMDS 1080
AIEADAALGKLEDLAVEADAALGELEDLASPATREVKCLENSEESVPITNSTEVDMMDS
Sbjct: 1021 NAIEADAALGKLEDLAVEADAALGELEDLASPATREVKCLENSEESVPITNSTEVDMMDS 1080
Query: 1081 EQQANLDAEKDTIVIANDNTPVNNINESSNDDMKGIVNGKDSPRCDEFSNNNDIKGIVNG 1140
EQ ANLDAEKDTIVIA+DNTPVNNINESSNDDMKGIVNGK
Sbjct: 1081 EQPANLDAEKDTIVIASDNTPVNNINESSNDDMKGIVNGK-------------------- 1140
Query: 1141 KESPGCDELSKNENGKESPGCGVGNSCFDKAVSGPLSLAGGDEIGGESCEEVGLMGGGGG 1200
ESPGCGVGNSCFDKAVSGPLSLAGGDEIGGESCEE GLMGGGGG
Sbjct: 1141 ----------------ESPGCGVGNSCFDKAVSGPLSLAGGDEIGGESCEEGGLMGGGGG 1188
Query: 1201 GGGVPIGSESLILSQQIHHSPESTH 1223
GG VPIGSESLILSQQIHHSPESTH
Sbjct: 1201 GG-VPIGSESLILSQQIHHSPESTH 1188
BLAST of Cp4.1LG13g03060 vs. ExPASy TrEMBL
Match:
A0A6J1E3D0 (zinc finger CCCH domain-containing protein 13-like isoform X4 OS=Cucurbita moschata OX=3662 GN=LOC111430427 PE=4 SV=1)
HSP 1 Score: 2091 bits (5418), Expect = 0.0
Identity = 1113/1139 (97.72%), Postives = 1122/1139 (98.51%), Query Frame = 0
Query: 1 MPRGSRHKSSRHGLKDAKESSDSENDSTLRDRKGKESGSRVTKDSASSEKRRFESKDSKE 60
MPRGSRHKSSRHGLKDAKESSDSENDSTLRDRKGKESGSRV KDSASSEKRRFESKDSKE
Sbjct: 1 MPRGSRHKSSRHGLKDAKESSDSENDSTLRDRKGKESGSRVMKDSASSEKRRFESKDSKE 60
Query: 61 FYGSENLEMEEHGHSKRRKERYDEGTTDRWNGGSNDELGVPSKKSKTLVDSKSKRRDESV 120
FYGSENLEMEEHGHSKRRKERYDEGTTDRWNGGS+DELGVPSKKSKTLVDSKSKRRDESV
Sbjct: 61 FYGSENLEMEEHGHSKRRKERYDEGTTDRWNGGSDDELGVPSKKSKTLVDSKSKRRDESV 120
Query: 121 GFQGDGEEHKKSSGKGEGRHRESSRKEGRSGGGERERERERERE--KDRKGREGRSDRGV 180
GFQGDGEEHKKSSGKGEGRHRESSRKEGR+GGGERERERERERE KDRKGREGRSDRGV
Sbjct: 121 GFQGDGEEHKKSSGKGEGRHRESSRKEGRNGGGEREREREREREREKDRKGREGRSDRGV 180
Query: 181 ASEDLRVEKQVEKNSENVLHSPGLENHLEIRVRKRTGSFDGDKHKDDIGDVDNRQLSSKN 240
ASEDLRVEKQVEKNSENVLHSPGLENHLEIRVRKRTGSFDGDKHKDDIGDVDNRQLSSKN
Sbjct: 181 ASEDLRVEKQVEKNSENVLHSPGLENHLEIRVRKRTGSFDGDKHKDDIGDVDNRQLSSKN 240
Query: 241 DTVKDGRRKSEKYKDERNREKYREDVDRDGKERNELVKDHISRSNDRDLRDEKDAVDMHH 300
DTVKDGRRKSEKYKDERNREKYREDVDRDGKERNELVKDHISRSNDRDLRDEKDA+DMHH
Sbjct: 241 DTVKDGRRKSEKYKDERNREKYREDVDRDGKERNELVKDHISRSNDRDLRDEKDAMDMHH 300
Query: 301 KRNKPQDSDPDREVTKAKREGDIDAMRDQDHDRHHAYERDHEQESRRRRDRDRGRDRDRD 360
KRNKPQDSDPDREVTKAKREGDIDAMRDQDHDRHHAYERDHEQESRRRRDRDR R RDRD
Sbjct: 301 KRNKPQDSDPDREVTKAKREGDIDAMRDQDHDRHHAYERDHEQESRRRRDRDRDRGRDRD 360
Query: 361 RDHDRDSRRHRSRSRARDRYSDYECDVDRDGSHFDDQYTKYVDSRGRKRSPNDHDDSVDA 420
RDHDRDSRRHRSRSRARDRYSDYECDVDRDGSHFDDQYTKYVDSRGRKRSPNDHDDSVDA
Sbjct: 361 RDHDRDSRRHRSRSRARDRYSDYECDVDRDGSHFDDQYTKYVDSRGRKRSPNDHDDSVDA 420
Query: 421 RSKSLKNSHHANDEKKSLSNDKVDSDAERGRSQSRSRHGDVSLSSHRRKSSPSSHSRVVT 480
RSKSLKNSHHANDEKKSLSNDKVDSDAERGRSQSRSRHGDVSLSSHRRKSSPSSHSRVVT
Sbjct: 421 RSKSLKNSHHANDEKKSLSNDKVDSDAERGRSQSRSRHGDVSLSSHRRKSSPSSHSRVVT 480
Query: 481 DEYRHQDQEDLRDRYPKKEERSKSISTRDKGVLSVVQEKGSKYTYSEKPSEIEGGNATEM 540
DEYRHQDQEDLRDRYPKKEERSKSISTRDKGVLSVVQEKGSKYTYSEKPSEIEGGNATE+
Sbjct: 481 DEYRHQDQEDLRDRYPKKEERSKSISTRDKGVLSVVQEKGSKYTYSEKPSEIEGGNATEL 540
Query: 541 LRDRTLNSKNVDIEESGRRHNNSIDAKDLSSNKDRHSWDIQGEKPVMDDSSQVESYYSKG 600
LRDRTLNSKNVDIEESGRRHNNSIDAKDLSSNKDRHSWDIQGEKPVMDDSSQVESYYSKG
Sbjct: 541 LRDRTLNSKNVDIEESGRRHNNSIDAKDLSSNKDRHSWDIQGEKPVMDDSSQVESYYSKG 600
Query: 601 SQSNPSPFHPRPAFRGGVDIPFDGSLDDDGRLNSNSRFRRGNDPNMGRVHGNTWRGVPNW 660
SQSNPSPFHPRPAFRGGVDIPFDGSLDDDGRLNSNSRFRRGNDPNMGRVHGNTWRGVPNW
Sbjct: 601 SQSNPSPFHPRPAFRGGVDIPFDGSLDDDGRLNSNSRFRRGNDPNMGRVHGNTWRGVPNW 660
Query: 661 TAPLPNGFIPFQHGPPPHGSFQSLMPQFPAPPMFGIRPPLDINHSGIHYRMPDADRFSSH 720
TAPLPNGFIPFQHGPPPHGSFQSLMPQFPAPPMFGIRPPLDINHSGIHYRMPDADRFSSH
Sbjct: 661 TAPLPNGFIPFQHGPPPHGSFQSLMPQFPAPPMFGIRPPLDINHSGIHYRMPDADRFSSH 720
Query: 721 MHPLGWQNMLDGSSPSHLHGWDANNGIFRDESHIYNGAEWDENRQMVNGRGWDSKAEMWK 780
MHPLGWQNMLDGSSPSHLHGWDANNGIFRDESHIYNGAEWDENRQMVNGRGWDSKAEMWK
Sbjct: 721 MHPLGWQNMLDGSSPSHLHGWDANNGIFRDESHIYNGAEWDENRQMVNGRGWDSKAEMWK 780
Query: 781 RQSGSLKREIPSQFQKDERSVQDPVDDVSSKEICDENADTVLTKTAEIRPNIPSAKESPN 840
RQSGSLKREIPSQFQKDERSVQDPVDDVSSKEI DENADTVLTKT+EIRPNIPSAKESPN
Sbjct: 781 RQSGSLKREIPSQFQKDERSVQDPVDDVSSKEIFDENADTVLTKTSEIRPNIPSAKESPN 840
Query: 841 TPELLSETPGPLSRSMDDNSKLSCSYLSKLKISTELALPDLYQQCQRLMDIEHCATADEE 900
TPELLSETP PLSRSMDDNSKLSCSYLSKL ISTELALPDLYQQCQRLMDIEHCATADEE
Sbjct: 841 TPELLSETPAPLSRSMDDNSKLSCSYLSKLNISTELALPDLYQQCQRLMDIEHCATADEE 900
Query: 901 TAAYIVLEGGMRAVSISSNSAQISLFRPNKNSVFQHAMDLYKKQRTEMKEMQASSREMPS 960
TAAYIVLEGGMRAVS+SSNSAQISLFRPNKNSVFQHAMDLYKKQRTEMKEMQA SREMPS
Sbjct: 901 TAAYIVLEGGMRAVSVSSNSAQISLFRPNKNSVFQHAMDLYKKQRTEMKEMQAISREMPS 960
Query: 961 SERMLEEEQQGMQVVSGGMAFSERKHEEKGFTFNNEDVKAPVSTVDAEMTQAPIKTTGVD 1020
SERMLEEEQQGMQVVS GMAFSERKHEE G F NE+VKAPVSTVDAEMTQAPIKTTGVD
Sbjct: 961 SERMLEEEQQGMQVVSRGMAFSERKHEEMGLNFKNEEVKAPVSTVDAEMTQAPIKTTGVD 1020
Query: 1021 KAIEADAALGKLEDLAVEADAALGELEDLASPATREVKCLENSEESVPITNSTEVDMMDS 1080
AIEADAALGKLEDLAVEADAALGELEDLASPATREVKCLENSEESVPITNSTEVDMMDS
Sbjct: 1021 NAIEADAALGKLEDLAVEADAALGELEDLASPATREVKCLENSEESVPITNSTEVDMMDS 1080
Query: 1081 EQQANLDAEKDTIVIANDNTPVNNINESSNDDMKGIVNGKDSPRCDEFSNNNDIKGIVN 1137
EQ ANLDAEKDTIVIA+DNTPVNNINESSNDDMKGIVNGKDSPRCDEFSNNND+KGI N
Sbjct: 1081 EQPANLDAEKDTIVIASDNTPVNNINESSNDDMKGIVNGKDSPRCDEFSNNNDMKGIEN 1139
BLAST of Cp4.1LG13g03060 vs. TAIR 10
Match:
AT5G53440.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cytosol; EXPRESSED IN: 25 plant structures; EXPRESSED DURING: 15 growth stages; Has 1807 Blast hits to 1807 proteins in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes - 339 (source: NCBI BLink). )
HSP 1 Score: 426.8 bits (1096), Expect = 5.9e-119
Identity = 412/1149 (35.86%), Postives = 610/1149 (53.09%), Query Frame = 0
Query: 1 MPRGSRHKSSRHGLKDA-KESSDSENDSTLRDRKGKESGS---RVTKDSASSEKRRFESK 60
MPR +RHKSS+H KDA KE SDSE +++L+++K KE S RV+K+S S +KR
Sbjct: 1 MPRSTRHKSSKH--KDATKEYSDSEKETSLKEKKSKEESSTTVRVSKESGSGDKR----- 60
Query: 61 DSKEFYGSENLEMEEH---GHSKRRKERYDEGTTDRWNGGSNDELGVPSKKSKTLVDSKS 120
KE+Y S N E E SKRRK + E +DRWN G +D+ G SKK+K + KS
Sbjct: 61 --KEYYDSVNGEYYEEYTSSSSKRRKGKSGESGSDRWN-GKDDDKGESSKKTK-VSSEKS 120
Query: 121 KRRDESVGFQGDGEEHKKSSGKGEGRHRESSRKEGRSGGGEREREREREREKDRKGREGR 180
++RDE GDGEE KKSSGK +G+HRESSR+E ++ ++EKDRK +EG+
Sbjct: 121 RKRDE-----GDGEETKKSSGKSDGKHRESSRRE----------SKDVDKEKDRKYKEGK 180
Query: 181 SDRGVASEDLRVEK----QVEKNSENVLHSPGLENHLEIRV-RKRTGSFDGDKHKDDIGD 240
SD+ +D K + E +++ SPG EN+ E R RKR GDKH D+ D
Sbjct: 181 SDKFYDGDDHHKSKAGSDKTESKAQDHARSPGTENYTEKRSRRKRDDHGTGDKHHDNSDD 240
Query: 241 VDNRQLSSKNDTVKDGRRKSEKYKDERNREKYREDVDRDG-KERNEL-VKDHISRSNDRD 300
V +R L+S +D +KDG+ K EK +D+ +K ED+ + G K+R++ K+H+ RS+++
Sbjct: 241 VGDRVLTSGDDYIKDGKHKGEKSRDKYREDKEEEDIKQKGDKQRDDRPTKEHL-RSDEKL 300
Query: 301 LRDEK----------------DAVDMHHKRNKPQDSDPDREVTKAKREGDIDAMRDQDHD 360
RDE +D +H+R + +D D + + + RE D RD + D
Sbjct: 301 TRDESKKKSKFQDNDHGHEPDSELDGYHERERNRDYDRESDRNERDRERTRDRDRDYERD 360
Query: 361 RHHAYERDHEQESRRR---------RDRDRGRDRDRDRDHDRDSRRHRSRSRARDRY--- 420
R +RD E++ RR RD DR R RDRDRDH+RD R + R+RD Y
Sbjct: 361 RDRDRDRDRERDRDRRDYEHDRYHDRDWDRDRSRDRDRDHERDRTHDREKDRSRDYYHDG 420
Query: 421 ----SDYECDVDRDGSHFDDQYTKYVDSRGRKRSPN--DHDDSV-DARSKSLKNSHHAND 480
SD E D DRD S DDQ +Y D R +RSP+ D+ D + +RS ++
Sbjct: 421 KRSKSDRERDNDRDVSRLDDQSGRYKDRRDGRRSPDYQDYQDVITGSRSSRVEPDGDMTR 480
Query: 481 EKKSLSNDKVDSDAERGRSQSRSRHGDVSLSSHRRKSSPSSHSRVVTDEYRHQDQEDLRD 540
++ LS+ V E G + + G S +R E ++ +
Sbjct: 481 PERQLSSSVVQE--ENGNASDQITKGASSREVAELSGGSERGTRQKVSEKTANMEDGVLG 540
Query: 541 RYPKKEERSKSISTRDKGVLSVVQEKGSKYTYSEKPSEIEGGNATEMLRDRTLNSKNVDI 600
+P + + S R + E+ T E+ GG +++++
Sbjct: 541 EFPAERSFAAKASPRP------MVERSPSSTSLERRYNNRGG-----------ARRSIEV 600
Query: 601 EESGRRHNNSIDAKDLSSNKDRHSWDIQGEKPVMDDSSQVESYYSKGSQSNPSPFHPRPA 660
EE+G R+N A+D S+ ++ E+ ++D++SQ E ++ + N S F PRP
Sbjct: 601 EETGHRNN----ARDYSATEE--------ERHLVDETSQAELSFNNKANQNNSSFPPRPE 660
Query: 661 FRGGVDIPFDGSLDDDGRLNSNSRFRRGN-DPNMGRVHGNTWRGVPNWTAPLPNGFIPFQ 720
R GV P G ++D R+N+ R++RG D MGR N WRGVP+W +PL NG+ PFQ
Sbjct: 661 SRSGVSSPRVGPREEDNRVNTGGRYKRGGVDAMMGRGQSNMWRGVPSWPSPLSNGYFPFQ 720
Query: 721 HGPPPHGSFQSLMPQFPAPPMFGIRPPLDINHSGIHYRMPDADRFSSHMHPLGWQNMLDG 780
H PPHG+FQ++MPQFP+P +FG+RP +++NH GI Y +PDA+RFS HM PLGWQNM+D
Sbjct: 721 H-VPPHGAFQTMMPQFPSPALFGVRPSMEMNHQGISYHIPDAERFSGHMRPLGWQNMMDS 780
Query: 781 SSPSHLHGW--DANNGIFRDESHIYNGAEWDENRQMVNGRGWDSKAEMWKRQSGSLKREI 840
S SH+HG+ D +N + RDES++Y G+EWD+NR+M NGRGW+S A+ WK ++G E+
Sbjct: 781 SGASHMHGFFGDMSNSV-RDESNMYGGSEWDQNRRM-NGRGWESGADEWKSRNGDASMEV 840
Query: 841 PSQFQKDERSVQDPVDDVSSKEICDENADTVLTKTAEIRPNIPS-AKE----SPNTPELL 900
S KD+ S Q V D S ++D K+ E N+ S AKE SP T E +
Sbjct: 841 SSMSVKDDNSAQ--VADDESLGGQTSHSDNNRAKSVEAGSNLTSPAKELHASSPKTMEEV 900
Query: 901 SETPGPLSRSMDDNSKLSCSYLSKLKISTELALPDLYQQCQRLMDIEHCATADEETAAYI 960
+ P+S ++D+ + YLSKL +S LA +L + L+ EH A D+ TA ++
Sbjct: 901 A-ADDPVSETIDNTERYCRHYLSKLDVSAGLADAELRKCISLLIGEEHLA-MDDGTAVFV 960
Query: 961 VL-EGGMRAVSISSNSAQ-ISLFRPNKNSVFQHAMDLYKKQRTEMKEM----QASSREMP 1020
L EGG R +SNS + +SLF +SVFQ AMD YK+QR E+K + + ++P
Sbjct: 961 NLKEGGKRVTKSNSNSLKALSLFPSQNSSVFQIAMDFYKEQRFEIKGLPNVKNHEAPQVP 1020
Query: 1021 SSERMLEEEQQGMQVVSGGMAFSERKHEEKGFTFNNEDVKAPVSTVDAEMTQAPIKTTGV 1080
S + E + G + E + +++ + + V + A ++T
Sbjct: 1021 PSNLVKVENNDDLNDARNGNSSIEATDMKIADVSDSDTSQKELQKVSSN-AGAKMETETR 1072
Query: 1081 DKAIEADAALGKLEDL-AVEADAALGELEDLASPATREVKCLENSEESVPITNSTEVDMM 1086
D+ + E L AV +D G E +AS +E SEE+V + + +
Sbjct: 1081 DEGSSSPNPDNSPEALNAVSSDHIEGSEEAMASDH------IEGSEEAVALDH-----IE 1072
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
XP_023551223.1 | 0.0 | 100.00 | uncharacterized protein LOC111809105 isoform X1 [Cucurbita pepo subsp. pepo] | [more] |
XP_023551224.1 | 0.0 | 97.96 | uncharacterized protein LOC111809105 isoform X2 [Cucurbita pepo subsp. pepo] | [more] |
XP_022922430.1 | 0.0 | 97.71 | uncharacterized protein LOC111430427 isoform X1 [Cucurbita moschata] | [more] |
KAG6579324.1 | 0.0 | 96.97 | hypothetical protein SDJN03_23772, partial [Cucurbita argyrosperma subsp. sorori... | [more] |
KAG7016825.1 | 0.0 | 96.81 | hypothetical protein SDJN02_21936, partial [Cucurbita argyrosperma subsp. argyro... | [more] |
Match Name | E-value | Identity | Description | |
A0A6J1E8Q7 | 0.0 | 97.71 | uncharacterized protein LOC111430427 isoform X1 OS=Cucurbita moschata OX=3662 GN... | [more] |
A0A6J1I6E2 | 0.0 | 96.01 | uncharacterized protein LOC111471538 isoform X2 OS=Cucurbita maxima OX=3661 GN=L... | [more] |
A0A6J1I7J4 | 0.0 | 94.92 | uncharacterized protein LOC111471538 isoform X1 OS=Cucurbita maxima OX=3661 GN=L... | [more] |
A0A6J1E442 | 0.0 | 94.94 | uncharacterized protein LOC111430427 isoform X2 OS=Cucurbita moschata OX=3662 GN... | [more] |
A0A6J1E3D0 | 0.0 | 97.72 | zinc finger CCCH domain-containing protein 13-like isoform X4 OS=Cucurbita mosch... | [more] |
Match Name | E-value | Identity | Description | |
AT5G53440.1 | 5.9e-119 | 35.86 | unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... | [more] |