CcUC05G103030 (gene) Watermelon (PI 537277) v1

Overview
NameCcUC05G103030
Typegene
OrganismCitrullus colocynthis (Watermelon (PI 537277) v1)
DescriptionMyb_DNA-binding domain-containing protein
LocationCicolChr05: 30802694 .. 30821079 (-)
RNA-Seq ExpressionCcUC05G103030
SyntenyCcUC05G103030
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CGGATGCCCATCTCTCTCTCTCTCTCTCTCTGCTCAAGGATCTATTTTGTTCCTTATCGTCTTTCTTGCGCGTTCTCTTTTGAAGCAGATTTTCCACCAGCCAAGAAAGCGAGATAAAGGAGTCATTAAGAAGATTGCAAACCCTAGAACGTGGAAGAAGAAGAAGAACTCTTTTAGTCATCGTATTTTGTTTTCTCTATCTGATTCAAGCCCTAGAATTCATTTGCTTCGCCGAGGATTGTGTGTTTGAGGGTTTAAGGAAGGGTATATTGGTTGCGAGAGTTAGTTTGTGGGGTTATTCAACTCAGTGCTCAAATCTTTTTTGAGCCATCGTTTGTGTGATTATTTGCCTTGCGAACCACAGTTCTGGTCTGTGGGGTGGGTTTTTGGTTCGTGATATTGGAATTGGAATTGGCCGCGGTGGTGGGGGCGAGGTGCTAGGGTTTTTATACTGGTTTTTGTGTGTGTGTTTGTGGGGCATCCGTTGTTCGTTGATGCTCATGCAATTTTCTTGCTTTTCATGCCGCCAGAACCTTTGCCGTGGGACAGGAAAGACCTCTTCAAGGAGAGGAAACACGAGAAGTCGGAGGCCATAGGGTCTGCGGCCAGATGGCGGGACTCTCATCATGGATCTCGCGAGTTCAATCGGTGGGGTTCTGCTGACTTTCGAAGGCCTACTGGTGAGTTTTGATTTTCCTCTACTGTTTCCTCCTTTTGAACTTCTTAGATCGTAGGGCGTTCAAGCAGGTATATCATCATCGTGTTTATTTTTTAATATTCTTGAGGTTTTGGCCGGGGTATTTTATGAAACTTCGGATGATAAAGGATTCCTGTTCTTATCCCTGCTGAGATTTGCCCTCCCATCCCTGCATATATATTTCTTTTAATCTTCTGGTATTGATCCTCTTTGTTGTCTTTTTTCCCCCCGTCGAAGGTCATGGTAAGCAGGGTAGTTGGCACCAGTTTTCTGAAGAATCTAGTCACGGTTATGGGCCTTCTCGGTCATTCAGTGACAGGGTGTTAGAAGATGAGAGCTTCCGGCCGTCAGTTCCTCGTGGAGATGGAAGATATATTAGAATCGGTAGAGAAATTAGAGGTTCTTTTAGTCATAGAGACTGGAGAAGTCACTCCAGGGAGACCAACAATGGATTTGGGAACCCATCGCGAAGGCCATCATCGGCATCGCAGGATGTGAGTTCTGATCAGAGGTCAGTAGATGATACGGTGACATATTCCTCTCCTCAATCTGTTCATGGGTTAGAAAATGGCCCGAGGGCCGATGTGGAAGTTTCCCTTGGCTCCACTGATTGGAAGCCACTTAAGTGGTCCCGATCTGGGAGTTTGTCTTCCCGGGGATCTGCTTACAGCAGTTCGACAAACTCGAAGAATGAAAAGGCTGATTTACCTCTTAGAGTTGCATCTCCTATAGAAAGCCCTTCTGCCGAAACTACTGCCTGTGTGACATCTTCTCTGCCTTCTGAAGATGCAATTTCTAGGAAGAAGCCAAGGCTTGGATGGGGTGATGGATTAGCCAAATACGAGAAAGAAAAAGTTGAGGTTCCTGATGGAAGCCTGAAAAAAGAAGTGGCTCTTCTTTCAAGCGCCAGTGCTGAATTAACTCATTCCCTTGGTTCAAACTTTGCTGAGAAAAGTCCCAAAACTTTGCCCTTTTCAGATTGTGCATCTCCTGCAACTCCATCCTCTTTTGCCTGCAGTTCATCATCAGGTAAACTTTTTCCATAAAAAAAATAGACGGGATAATTTAGTGCCTTGAAGTTCTTTATACTTGTAATTATTTTAATGTGTGTGATAATCACGACAAGATTGTATGCTTAACATATTCCTACTACGTTTTTTATTAATTTCTTATCAATTGTTCTAAGCTCATGTTGGCACAGTCAATTTAAAGATTTAAGAATGAGGAATGGGATAGTAGTCTACTATTGAATTTTTGCATGGCTTATATGTGATGTATGCTGTACTGTAGTTTTGATAATTTTTTTAGTACAAAAGCGGTGATGGGGATCGAACCTCTGATCTTAAGGAAGGAAAGCTATGCCAATTACCGTTGAGCTAATCTCACTTTGGTGTATTGTAGTTTTGCTATGGCACGTGCTATTATTAGGTGGACACTTTTGCATGTGTGGTTAAAGGTTTTGAGGGTTGTGAAGAGTAAGCTTTAGTTGAACCTGTTTAAAGCAAGTTGCATAGAGTATAAACTTTGAATTAAAGGAAAGAAGTTTGAATTAGCCGATGGAGAACCACATGTTTTGTGGTAAAATTCCATGTATTTAGCTTGCATTGTTTTCTCTGATGTTGGACTTTTTGTTTCTATTCCTTCAACTAAATCTCACTATTAAACAATGAAAAATTGGTTTCTATTAAGGTTAAGGGCAGTTTCTTTCTGATTTACCTGTAGACCTTTTATTTTCCTATTTTTTGTGATCACCTTGGGACTAAACTTTTTGTTACTTGATTGATTGATACATGCCCTCCTGAACTCTCTAATTATAATTTTCTTAACCAATTTTGGTCCTTCATTATTTTGATTCAGGGGATTTTTGGAATATTTCCCCATAGCCCGTCAGTTTGGTTGTGGTTTTGTGGACTGCTTTTGTTTTTCTGTGTTTATTTCATTATAGTTGCCTTGTATCTAGAAAAAGCGATTGGGTGGGTGTTTTTAATTCTTTTAGTAGCTGTTGTGATTGAGAAGTAACTCTGTACTTGAATTTCACTTTTAGGAATTCTAGTATCCGTTTTGTTTCTTTCTTTCTTGCCACTGTATATGCTGAACAATCCTTTTTATTTCTTCTTACTATTTTCTTTTGATTTAAATGCAATAGGCTTGGAGGATAAACCATTTAGTAAGGGAGCAAGTCTGGATGGCATGATATGTAGTTCACCTGGGTCCAGTTCTCAAAATCTTCAAAAATTATTGTCTAGTATAGAAATGATGGAGATCTGTTCAATTGCTAATTTAGGATCGTCACTTGTCGAACTGTTTCATTCTGATGATCCGAGTACAGTAGAATCATGTTTTGGGAAGTCTACATTGAATAAGCTGCTAGCATATAAAGGTGAAATTTCAAAGAAATTGGAGACTACAGAGTCTGAAATTGATTCTCTTGAAAATGAACTTAAATCTTTGAAATCTGGAAATGGAGGCAATGTTTCTCATAAAAAATCTTGCAGTGTCACACATTTGGTGGAGAATGTGACATATTTCAAAGAACAAGATGGTGTCTCTTGTGTTGCCCCTCGTCCTGCTCCCTTGGTGATTGTTTCGTCTTCTGATGCAACAGTTGAGAAGATGCCAGTCTGCAAGGGTGACATGGGAGTTGAAGATGTTGATACAAAGGCTGATGAAATCGATAGTCCTGGAACTGTGACATCAAAATTTAACGAACCATCCCGAGTGGTAAAGACTCTTGCTTCTGATCTTGTGGTAAATGGTCATTGCTCTGAAGTTACAGATGTAATTGTCCCTGACAAGATGGAAGGGAATTTTCCTGTATCTAGGTCGTTTGTGGACGAACATAAAACAATTGGCTCTGACAATGAATGCATTCTTGCTAAGAGTTGTACCAAGGAATCTATTTATGGTGATTTGATGGCCCAAGCTGGCAGTAGATCATCTCTTTGTGATCATATTTTTGTGTGTAATAAAGAATATGCAAGTAGAGCTGCAGAAGTAATTTTTAAGAAATTACCAGTGGAAATGTGCAAGATCAGCAGTAAAAGCACCAAAATTGTGTCCTGCTCAGAGATTGAGAAACTTGTTAAAGAGAAATTTCTAATGAGGAGGCAGTTCTTAAAATTTAAGGAGAGTGCATTAACCCTCAGATTTAAAGCCTTGCAACAATCATGGAAAGAAGGTTTGCTGCATTCTGTGAAGAAATGTCGCTCAAGGCCACAAAAAAAGGAGTTGAGTCTAAGGGTGTCACATTCTGGCCATCAGAAGTACAGGTCTTCAACTCGCTCCCGTTTGGTTCAGCAAGGTAAGATTACTTTCATCTCTCCTTAGCAGTATATCTTTTCGTTTCCACTCCAGACTTACAAATTCGTTTTGAAGTCAAACAAATTCTTCAAATGCTATATTGTCTATTTTCTACATCAAAAATGGTTTCCTCTATTTGTGTTGTTGACAAAACTACAATGTGTATGTATATTTAAATGAGTTAGAATGTATACGATGAAATTCTTGGAATATTCTCATCTGCAATGATGTATTAAGTTTTATTGTATTATTGGAACATTTTTCAATTAAAGAAAGCAAGCCTGTCTCTACACTTCTTGTTGTGATCGACAATTCATGGGAGTTATCGTGTATTTTATACTTCCTTAATTTTACTGTTGTTGGGTTGGTTGATTTTAATTTGTGGATTGTGTAACTAGGCTTGATATGTTCTATTGAGTTGAGTTCATTGAGAATTGTGTTATGTTAGGGATTTTTTTCTTTTTCTTTATTCGACAGAACATTTTAGTTTCCATGATCCCGGTTATGCCTCAAGTAAGACAGGTTCAAAAGATTTAGTGGATGTGTTCAATATTTCCCTCTTGTAGACAGAGATCTCGGATGAAACATCTTCAAAGATATGCATGTATTTTTTTTTATCATTTATTTTGTTGAAGTTTACATTTTTTTTTTTTTTGACATCTTCAATGTTGTTTGTGGGTCACTAGATTCAAATCTTCGTAGTAGTCCCTGATGTGTATATTTTTCTCTTCCCCTTTGTGATTTGCTTGGCAGCCAACTATTAAGCTATCTCATTCCAAAACTGCCGCGATTATGATTTGTTGGATAGTTGGAACACGAGGCTGTCATTAAATTTTCTCTAGCGATCATGATATGCCATTATGATATAATGAATACTTAAATTTTGAATTTTCAGGAGCATGTCAGAACCCTACCCTTAACACAGAAATTGCTGTTCGTTACTCCAGTAAGCTGCTGTTGAATCCTCAAATTAAGCTTTACAGGAATAGTTTAAAGATGCCAGCTATGATTTTGGACAAGAAGGAAAAGATGGCATTAAGGTTCATCTCTCATAATGGGTTGGTTGAAGATCCCTGTGCTGTTGAGAAGGAAAGGAACATGATAAACCCTTGGACTTCAGCCGAGAGAGAGATATTCTGGGAGAAACTATCCTTGTTTGGAAAGGATTTTAAGAAAATTTCTTCATTTCTCGACCTCAAAACCACAGCTGACTGTATCCAGTTCTATTACAAGAACCACAAATCTGATAGTTTTAAGAAGAATAAAAATTTGGAGTTGGGCAAGCAAGTGAAATCTTCTGCCATCACATACTTGGTTACATCAGGGAAAAAATGGAATCCAGACATGAATGCTACTTCCCTCGATATCTTAGGTGTTGCTTCAATAATGGCAGCACAAGCAGACTACGATATTGGAAACCAGCAAAAATGTACTCGCCATTTGGGTATGGGAAGGGATGTTGAGTCAAAAGTATCATTTAGTGCTAGCACTCCTTCAAATAAAAACAATTTGGATGCTCTTCAGACTGAAAAAGAAACGGTTGCTGCTGATGTGCTTGCTGGTATATGTGGTTCAATATCTTCAGAGGCCCTGAGTTCTTGCATTACAAGTGCTATTGATCCCAGTGAGGACCACTGGGAGCGGAAGTGTTATAAAGTGGATTCTGCAGTGAAATTGCCTTCGTTGTCTGACGTCATACAGAAAACTGATAATGAGGAACCTTGTTCAGATGATAGTTCTGAGGATGTAGATTCTTCAAATTGGACAGATGAGGAGAAGTCGATATTCATGCAGGCTGTGTCGTCCTATGGTAAGGATTTTGATATGATCTCTAGATGTATCAGGTCAAAGTCTAGGGACCAGTGCAAGGTTTTCTTCAGCAAAGCTCGGAAATGCCTTGGACTGGATTTGATGCATACTTCTGGAGATGTAGGCGAAACACCTGGGAGTGGTAATGACGCCAGTGGGAGTGGGACTGACACAGAAGATCACTGTATTGTTGAAATCTGTGGAGCCCATGGTAGTGATGAATTTGTCTCCAAGTCAGTCAACGGTGTATCAACATCTGTTAACATAAATCATGAAGAATCTGTTTCTGCTGTGACTGTCAACATGCGGACCAGTAGTGAATTTGAGGAAAATACAGTATTGCAACAGTCGGATGAGAAATGTGCTGAGGCTGTTGGAAACTTGATTTCTGAGATATCGAAGGAAGAGGATTTGCCTAGTCCAGATTCTCATTCTGCCTACAATCTCACAAATGCAGCTGCTTCTTTGAGCCAGCCCGTGCATGACCACAAAATTGAAGGCTCTTCTGAAAATACCGAAGGTGGAAGCAAGTGCTGTAATGAACCTGACATTCTGAGATCTGAATCGGTCTCCACTGTTGATGAAAATTCAGCTGCTGTGAGCGAGAGCAGAGCTACAGCGAAGCTTGCATTTGGAGGAGAAGAAGAAGGAAGGAACACTAATTTACATGTTCAGAGTATATTGCAGTGCTCTGTTCAGAATTCAACTGGGTTTGATTCCAAACTTGCTTTAGAGGGCAGCTCCTTAGGACTTGATCCACAAATCTTGCATCCAACCGTTCTTAAAGTGGAACATGTAGAGAAGTCTTGTGTTGAGTCTGAGAACTCTCTTGCTGTCGGGAATTCTGAACCTGGTGTCATTGGAAGGGAACAGATGCTTAACCAATATATGTTGTCATCAACAGCTGTCTTGCAGGAGGTTAGTGATGCGCATCAGAAGCCTATGAATAGAGATGACTATGCTGAGCATCAAAATAATTTGTCGCACGATAGTGAATCCAAGTTTCCAAGAAGCTATCCTTTCAACAAACAAATCTTTGAGGACATCAATAGAAATATCAATCGCACATATTTTCCTGTTGTTCAAGGGCTGTCAAAGCCAGACATCAATTGTAGCAGTTCATATGTTTCTGAGGGCCACTATCTTCAGAATTGTAACAGTTCCAAGCCGCACAACCCGGCGGAGCTTCCTTTTTTGCCTCAGAATGTAGACTTGGGTCATGATCGTCAGAAGAAAGCTTTATGCAGTGGCAGTGCTTCAGATTCTGATGTTCCACGCAGGAAAGGTGATGTGAAACTGTTTGGTCAGATATTAAGTCATGCCCCTTCCAAGCAAAATTCGAGTTCTGGTTCGAACGAGGGTGGAGAGGAGAAGGGACTTCACAAATCCAGCAGCAAATCATGCGACATCGGAGAAAATGTTCCGTTAAGGAGTTACGGTTTTTGGGATGGAAGCAGAATACAGACGGGTTTGTCTGCTTTGCCGGATTCTGCCATTTTACAAGCCAAGTATCCTGCTGCATTCAGTGGCTACTCTGCTACGTCTGTTAAAACTGAACAGCAGCCATTGCAGGCACTCACAAATAATGGTGACCGAAGTCTTAATGGACTAGTGTCCGCTTTTCCAACCAAGGATGGAGTTGTAGATTATCATTCGTATAGGAGTCGAGATGGAGTTAAGTTGCGACCTTTCCCAGTTGATATATTTTCTGAGATGCAAAGAAGAAATGGCTTCGATGCTGTGTCCTTGTCAAGTTTACAGCAGCAGGGAAGGGTGCTAGTTGGAATGAATGTTGTTGGAAGGGGAGGGATTCTCATGGGTGGTTCTTGTACTGGTGTTTCAGATCCTGTAGCAGCCATTAAAATGCACTATTCCAAGGCCGAGCAATACGTTGGGCAACCTGGTAGTACATTCACTAGAGAAGATGGGAGTTGGAGAGGAGGTAATGGTGGAGATTTAGGCAGCAGGTAGTAGATACGCATTGGGGGCCTATGCCTGGCCGGCCAGGGAGGCCCTCGCCTGTATCATAATTAGTCTGTTTCAAGTTTTCTTAAAAAAGGAAGTGTAGGGTAGGAGTAATTTGAACCAATGGGTTCTGAAAAATCCATCTTTTTTGGTTGAAGAAAAAGGGAAGAGATTTTAGGGGATTGTAATATTGTAGCAGTCTTTTGTATTTGTTGTATTTGATTGCAACACAGAAATAATCTTCTGAAAATGGGAGCGGGCATAGTGTTACACCTCAGCTCCTCTTACAGGTCGATCCTTCAACTTTGATAACCTAACCTACCTGACTCGTGACATTATCCAGTGGAGGTTAGTTGTTTTGGTCAGCTTCTCCTTTTTTTTTTTTTTTTTTTTGACATCCCCCNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNACCCCCCCCCCCCCCCCCCATACTTGTTTTGTATGATTAGGAAATTAGGAGTATTTTGTAGAATTTTGCTTATAATATAGTTATCAAAATTTGTATTGTATCATATTTACTTGTTTGTATTTGAGTTTATGACTAATTTGAAGAATGATTAGTCTTGGAAATGGGGCAAATTATTGAATGTTGCTTTCACCTTCACCGTTATAGCCAATAAATGTATGAAAAAAGAAGAATTCTTTCAATCATATGTTTAACGGCTGGATGTGATGGTTACTTTCAATAATTACTTCAAAATAATTGTTTACTTGAATGAGACCCGTGGGGTCTTAACATCTTTAGAATTATTCTCAAGGCATATTCACAAATTTCGGTGTGTTATGAATGGATAATTTTAGTTAAATAATCTTTTGAAAAAACTAGATCAGCGTTGATGTTGGTACTCTAGATCGGGTTAAAAGATCAATTTTTCTAGCATCTACTTTATTTATTTCAATATTTTTAGATTAGTTTTCTAATTTAAAACATATACAAGAAGGTTTGATTTCGAAGCATTCTCGTTAATTTTTCAAATAAAAGAATAAAAATAAAAATAATTTTTGTAAATAGAAAGTTTATCTATTTGTTTATTTTTAAAATTAGAGTTTAATAATTGAGTCAACCGATATTCAATCATGAATGAGGATTTTAGGGTGTGCCGGCAAATTATTGAGCGAACATGGCATGATTTCATTAGATTGTGTGCTAATATAATATTTTTTCTTTAAGCCTTAAGACAGTTGGAATGCTTTGTCAATCTTATCAAAATCATTTTTCAATGTTCTTTTTCTTTTTTATTTTTTTATTTTTTTATTTCAAATAATATGTTAAGAGGTAGAGGTTATCTTTAATAAGTTTAGCACTATAATGATTAATTTCTGCTATCTATATGTATGGATAATGATTATGTTTTTTTATCGCCAAATGTTATTCTTATGTACTAAAAAACTTTGTGGCTTCCCCGTATTATTTGCACACGTTTGACTAATTTTTTTTTTTTTTTTTTAATTCAAGTTCGGAAATAAAACTTCAGCATAAAAGCAAAAATAAATACTTTTTACTAGCCAATCCAATAAAATTGTCTAAACGGAGCATAAATTTTGTATATTTTTTACTAGCCAATCCAATAAATACTTAAATGGGTTAATATATTTACATAAGATTTACACACTTTTAGATTCTTAATTTTATCTTTGATAAACAAATCTTAACCGAGCCCCAATGTGTAACAATTTAATTTGTCAAGTTGAACATATGCTAAGACGTGTGTCATTTCAAGTTCTTACCATTCATTATTGAAATATAAAAGATAGTTTGTTGAGACGAACATCTTTAAATCATTATTTTATTAATATATTTGAATGCATGAACATGATATGCCAAATAGTGTTCTAGTATCGTGATTTCATGTTCAAATATGTTAAAAAACAATATGATAAGTCAGCTTTTTTTTTTTTTTTNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNAGGCAGCTCAGCCATGATTGAAACTTAAACTATTTTAAGGCATCAAGAAAAGAGTTGTAAATGTAACGACCAACGAGATGAAAACCAACCAATGGCTTATAGTTATGAGGTCGTGTCAACGGCGGAGCAAAAAGTTCCAATGCGCCTATCTGTATCAACATCATCGGCCTTCTTAAGATCCAGCAGTTATATTTGCTTTCTGGTAATGATTCGATTCATTGTTAGGTTCTAATCATTGTGAAATTCTTCTTCCTTTTTTTTGGGAAAATGGAATAGGATAGCAGCTGAAATTAGATTTTTTTTTTTGTTGTTGTTCATGATTGAGATTGAAGAAGAGGACGACGACGATGATGATGCTAATTTGATTTGATGTTCTGTAAATAATAATTGATCAGAATTCGTAGGCCCCATGTTTCCCCTGCTACCGCCAAAATTCGAAGACTCGTTAATTTTCCAACACTACTGTTGAATTTCTCTGGTTTTTCTCTCAAGATCCGGTCCACCTTCTCCCTCTTCCTCTATTTTATCTATAGGCCTGTTCTAATCGTTTTTCTTTGGGGGAATATTTGTGCCCTTTAAGAAAAAAATTTCGCTTATGGTTCATGACGATTTGGAACTTCATACGTAGGTATTGAAATGATAGCTCGTTTCTTGTGGATTTTTTTTTTTAGTTATTATTTCTTCTGCTTTTGAGTCTTTGCCGTGGCATGAACGCTCTCTAAATGGTGAAGTTCGTTTCATCGGAAAATTTCGATATATGTTGATCCTTCTTTGGTTAAGATTTTTGCAGTTTTATGATATCGGGTGTTGCATNTGATAGCTCGTTTCTTGTGGATTTTTTTTTTTAGTTATTATTTCTTCTGCTTTTGAGTCTTTGCCGTGGCATGAACGCTCTCTAAATGGTGAAGTTCGTTTCATCGGAAAATTTCGATATATGTTGATCCTTCTTTGGTTAAGATTTTTGCAGTTTTATGATATCGGGTGTTGCATCATTTACTGTATCTCTTACCAGAGACTCCCTAAATTTTTTGTTTCAATTGGTGAATTGATTTTTTTTTTTTTTTTTTTTTTTAAAAAAAAGTAATTTGATGAACTATTATCTATGAATAATTTAATCGAATATCAATTGACATGCCATAAATTTTATTGTCTTTTTAGCAGTTGATGTCAAGTATATGGTGTTTTTAGTTTCATTTTGGTTTGGAAATTCATCCGTTCTTTTCTCCCGGCGACCAGATAAAGAGAAAGAAGACATATGAATGAATAAGAAATGGGGAAGAAAGGGAGTTGGTTTTCTGCGGTGAAGAGGGTTCTCAGTCAGCCTTCTGAGAAGAAAGACAAGGTTTTAATTGAGCTACTTCTGTTCAATTATATTTTGTACTGTTTTTACAATCACATGGCGGTGGCCTTTTAACTTAAGATATATTATTTCAATTTTCAACAGAAACCAGACAAATCTAAGAAAAAATGGTTTCAAAAGGAGGAGAGTGTGGATGTGATTTCCATTTTGGAACAATCTCCATTGGACGTTCCTGCACAACCTCCAATAGAAGATGATGTCAAACAAACCGAACCGGAGAGTGAACCAAGCGAGCTTGCGCATTTGGAGGCTGCGGAGTCGGCTGTTGCTGAAGCTCAGCTGGCTGTGGTGGTTGAATATCCACCTAGTCCTATCTCCTGTCGGCCAGAAATGTCGGAGGAAACAGCAGCTAGTGTGATTCAAACTGCATTTCGTGGATATACGGTACTTCTATATTCAGTTTACATTAAGTATAAAATTTTTAAGCGATTAGAAAAATCAGTATCTGTTGTTAATGACATAAAGCCTTAAGATTATTTGTTACTCTTTAAGACTACTTCTATCTAGCAGTATTTTAAGAAATTTATAGAATTCATAACCAAGTTTTATCTTTGAGTAGCATATGGATTGGAATTGGTCCCACGTATTTTCTGCAGAATCGGTTTATTGTTTCTTCAAAACAAGTTCTTTTGGAATACATTAGCTTTCAACTTCGATGAAGCATCGGCTGAAAAACCATTTCATATGTCAGTCAAGAGGCTTATTTATGGTCAACGTTGCTGATTGGGAATGAGAACCTCGATATTGGAATTTTAGGCTCATTCTTCTTTGTGTTGTTCCTAAAATTTTGTGAAAGCATTTATAGATATCGTTCAACTCTGTTCTGGATTTTGAAGAGTTTTTTTTTTCTTTTTTTTCCTACTGGATATAGGAAAGCATCAGTAATGGTCTGCACTATGATTTTCCCCTAGTATTTTACATTTCAAACATGACATGTACGAATGTCTTTTTTTTAATTTATTCTTTTTCATTTCTCAATACAGGCAAGGAGAGCATTGAGAGCTCTCAAAGCGTTGATGAGATTGAAGACATTGGTACAAGGGCAATCTGTCAAACGGCAAGTGGCCTCTACTTTAAAATGCATGCAAACCCTTACTCATTTACAGTCAGAGATTCGTGCAAGAAGAATAAGAATGTCGGAAGAAAATCAGGCTCTTCAGCGACAACTTCGTAACAAACGTGAGAAAGATCTCGAGAAGTTAAAATTTGCTGTAAGTTCTCTTTGTTCAAACTTTTCTGAATATAACCTGCTGTGTCTGTAGCGCTTGTACAAACAAGTCTTTCAATGTGCTGTGTCAATATGAGCTTTTGACCTTTATTGATTTGCGATCATTACAACTTGCACTGCAGATGGATGATGACTGGAATGATAGCACCCAATCTAAAGCACAAATTGAAGCTAAACTATTGAACAAGCATGAAGCTGCAATAAGAAGAGAAAGGGCTCTGGCTTATGCTTACTCTCATCAGGTACCAAAATTTTATATTTTCTTTATCCCATAGTACGTGCTTTCTGAGTTTAATTCCTGACTCTGGATCGCAATGTACTTATTGGTCCGAACTTTCTGGGGGACATTCTACCTGGTGAATTGAAGGGTACTTCAACAGGTGCATATAATTTGCACTCTCGATAGTATTCAATCATACCTAATCGCTTTAGTTCTATGGTTGCATTGGCATCATCTGTCGTTATGAGTTCTGCTGATTGTCTTTCTTTGTGATAGAAAAAAGTAACTGGTATCAATAGAATATCTCACTTCACTAACTTTCTCTATAAGGTGGTTTTTAGAAAGTCTCTTGTAGATACCATTAGCAACTCCTTGCAGTTTTTGTTCTGCTCATAGTAGGATTGTTTCTTAAGTTCCGTAGACAATAATTTCCATTGAGTATCGTATCCTTTTAGAATGTCACATGCAAATAAGTTTTTATCTACTATTAACAGCAAACATGGAAGAACTCTTCAAAAACTGCAACCCCCACTGTCATGGATCCAAACAATCCCCACTGGGGTTGGAGTTGGTTGGAGAGGTGGATGGCAGCTCGTCCATGGGAATCTCAAAGCACCACTGATCAGCCGGATCACATTTCTGTCACGAGTGTTGCAACTCGTGCATCTGTGGTCGACATTCTCCAAATTTATGCTCGGCGTGATCAGAACCCTTCTACCAAGCTTTCTCCAAAGTCTCCTACAAGCCAAAAGTCAAGCCAATTGCCTAAGTATCATTCACCTTCAACTCCCAAGGCACTATTTTCATCATCTGGCAGAAAGAAAACAAATGCAGCCAACTCAAGGATAGGCAGCTGGGGCAGAGACGAAGACATAAGGACCATCACTAGTGACAAATCAAAGCTCTCCCGGAGGCATACTATTGCAGGATCATCATTCAGGGATGATGAAAGCCTTGCCAGTTTACCTTCAGTTTCAAGTTACACGACGCCATCGAAGGCTGTGAAGAATCGGTCCCGGTTGGCAAGTTCATCTAGAACAGAGAAGAAAGGAACAATGGAAAAAGGATCTGCACGTGCAGGTTCTGCAAAAAAACAACTTTCCTTTTCAGCTTTTCCTGTTAAGCCAAGGAGGCAATCTAGTCCTCCAATTGTGAATACTAGCTAAAGTACATTAAAAGATAAGAAAGTTTGCATCAGAGAGAATGATAGTATCATGGATGAAACAAATTGATCGATCATTGAGTTCGTGGATTTGAGGATCCGTACTTCCATTATCTGCTTTCTTCAACAGCTTCTACTTACCACGTGAGTGAACAAATATATATATATATATATCTTGCAAGCGCCAATCCATATCCATGGTCTTTTTAGTTTCTTTCTTTCTTTTTCTTTTCCATGCCATTGATCTTCCTGAAATTCTTATCTGTATGCAATAAGATTTGTTTTTGGGTTACATTTATCTATTGAAACCATTTGTAAATCGGTTTGTCTATAAGATATTGTGAGGATTCTTCTGGGTCTTCTTTTGAGACAATTCTCTGACTGGTTCATCTTGTATATTTCAAATTCCATTTGTTTAAAATAACAATATTTTGGAGTAGTGTTCTTGTAGTTTCTTTTCTCACTGTAGATTCCCCTATATCTCTTTTCTTTGTCCTTTAAGCAACGAATTTTTGAGTAGCTGTCAAAGTGGGAATTGGTATGTGTCCCTTGACTTTTCTGAGAAAATTAGATGTATATGTATAGAAAACTATATTAAAAGTATATAGTCTCAGTTTCATCATTTTAATTTGCAGTTAAATTATCCGCAAACAAAGAATGAGAACTTCATGTCAAACAACAAAGTAGAATCAAAATTTACATCTTTGTACAAGTCATTAATGCTTCAGCTAAGTGAGTTGAAACATGTATAATTTATTAATATTAATTACCTTCATTACTTTGTAGTTGAAGTATTGGAATTAATCATCAAATTTAGTAAACATATATATAAGTTATCATGAAATTATCTTTGAATAAGATAATCTACACAAACACAAAAATTTATGGTATATGACTTAGCAGTAATCGATATACTTATTTTCTAGAGGTTCAAATTATCATACGTATTATTTTATGATTAAAAAACATACCTATAAACAATAAGTATAGTTTCAATTACACTCCAAACATTAAAAAGTTTCATTTTAACATTTCAAAATAACCTTTAATAGATTTTAAAAAACAAAGACACATGACACATATGTGATCAAAAAACGTTCTCTCCCATATTTTGTTTATAATTTTTTTTAACCTCAACTCCTAAATTTACTTTTCAGGCGTATTTAGATTTGTTCACTTGAACAAGATATAAGTAAGTACAAAATTTTCTGTCCATGAATTTAACCATAAAAGCCTCTCACAGAGTTGTTTGAGGAAGAAGAAAAAGAAAAGGAAAATACCCTAATTTTTAATTTGTTGAAATTGTAAATTCTTGGACGAACCCATTTTTAAGCCCAAAAAAAAAAGAAGACCTTTCATCATCGTCCCCTGAAAAGTGAGAGCCAAAGGTGCTTTGCTTCGTCATTGTAATTCCGATAACTGAATCGGGTTTGATTTTAAATCGAAAATTGGATCCGGATTTCTGAATGAAATTCCACATTCTCTCATCTTTCACTGGAATTTTACTCAAATGGTGAGCAAAATCCTTCTCTCTCTCTCTCTCTCTCTCTCTGTTCTACGTTCTTATGATCCACATTTGTTTTTCGTTTGTAATTGATGTTACCTTTGATCCTTGCGAGGAGTTACTACTTGCTTCTTATTGGACTATAGTGTAAATCATTTCAATTTTGCCCTTGAAATTTTATTGTGGATATACTGTATTATGGTTTCGTGTTGCTGATATTTCATTGAGTTTACATCGCCTGGGTTTTGATTTTGGAGTCGTGGCCAAAATATTTTGATCGACGATTTTCAATGGATGCGAGGTCAACGCACTATATATGGAAATGGAATCAGTGCATCTTTAAATACTAATTATAGTGGGTTATTTGATAAAACTTGTCTGGTGGTGGTGAACTTAAGACTGGGAAAGTGCTCTATTGAACGAGATTTTGCTAATTGGCAGCATTGAACAATTGCACAATATTGATTAGGATCTTGGAGAAAAGTGATGGAACTCAATATTGTTTTTCACGGTACAAACTGAAACTGGGGAAGGATTGGAGAGCTCTTCCTCTCACAGACTTGGAGTATCTCAGTTCACGAAGGGATACCCTCTTTCTTACACAGATCCCCTAAATACCCCATTACTCAAGACCTGTTGCATCCAATGACTCAGTTACAAAAATGATCCCCTTCTAACTAACTCGCAATTCTTTGATGAACAGCTATTATTTAATGCCACCTTACAAGCGTCGGTTACTAGACCTTTTTGTAATTATGAGCTTATTCTTATTCTTTTCGATTGGAGCCCGTTTTTGAAGCTTGTGTCATACCCCATTCCTTTTCATCAGGCTTGTTTTTTTGTATACCCTGTATATGCTTTCATTCTAGTTTATTATTTAGAAAGAAAAAAAAAAAAAAGGAAGGGTATGGGTTTCAAGTTTAACTCGCCTGTAGTTATTTGGATTTGTAAATGATTTTTACGTTCAACTACGAGTTTCAAGTCCTGATGTTGCAATAAGACTAACAAACTTCTAGGTGTGGATTAGGCTACACCATTCCTCGCTGGCCTTGCGGTAGCTGCTGCAGCTCTAGCTGGTCGATATGGAATTCGAGCTTGGCAAGCATTCAAGACACGGCCACCACAAGCCAGATCACGTAAATTCTATGAAGGTGGTTTCTATCCTACGATGACAAGGAGGGAAGCAGCTCTTATTCTTGGTATTAGGTAGTTCAACATACTTGAATTTGATCTTCGTAAATAAATAATCAAATTTTAGCTGGTTAGATCTTCCATCACTCATGATGTTACATTGCATCTTATTTATTTTGATAATCCAATTTTTTTAAGTTCATAACTTGGTGCACAGTTTACTCGAGTCGTAGGAATCTTAAAGACATTCAGGTTTTTTTTTTTTGGCAGTTAATGGTCACGTTCATATTCACATAACAGAACCCAATTAGTAAACTACTTCGTAGGAATTAACCATCATGGGTTAACCTAGTACTAAAAGAGGAAACGGTCTAATAAATGGCTAAGAGGTCATAGGTTCAATTCATGATGGTCACCTACCCAGGAATTAACATCTTACGAGTGGAATTAACATTTTACGAGTTTCCTTGACACCCCTCTCTAAGAAAGTTAAGAATCTTTACTTCTTTATTTCTTCTTTCCCTTACATTTCATAATTGGAACCTTTGGGAAACATATTATCTAGATCATTCGTTTTCAGTACAGAGATCAATGAGACACCGACAAGATAATTGATTCTGATGTTTCATGAAGAGATTTTGAAATTACTTACTTTTTTTAGGACAGATCTCGAAAGCAATGGTTCCATACTTGAGTCCGAATGTCTATTGAAAACATGGCATCTTAAGCAATGTTCACTTGCATATATTGTCCTTATTCTCGAGAAAAACAAAAGAACTACTGGTGGGCTTGTACTAACTTTTCCACTATAAACATCTTGTTGAATGAAACAACACTGAGAAGACATCTGTATTTATTTATCTATTGCTTCATTCACGCCATTAAGTAGCTTACTTCGTAGGGATTAACCAGAGTTATCCTTCATGGTTTACAGAGAGAATGCAACTCCAGATAAGATTAAGGAAGCACATAGAAGGGTCATGATCGCGAACCATCCAGATGCTGGTGGCAGTCATTATCTTGCTTCTAAGATCAATGAAGCCAAGGATGTGCTACTCGGAAAATCGAAAAGCAGTGGATCAGCATTTTAATGAGGTTTGCTCCTTGAATTTTAGGATCCTTGCAAAAAAGTGGAAAAAACTTTCGTTTCCATATGAACTTCAAAGTTTAAACAATCTGCCAAACGCTATGCTCACCAATCGATTTTGTTTCCTTTAGTCTGAGAAACAAGATACAATTGTTAGACTTGAAGTTGACCTTCATGCTGAAAGTTCCTTTGAACATAAAAATAAGTATGATTGTGAAGTGCATCCTATAATCGGATTCATGGAACAATGTTGTCCTTTCATAAGTATCATTCACTTATCTAACCTCTACCAGGACTAGTCATCTGAATGCATTTTCTCTAGACATTTATCATTTATCTATTTTAAGTCTCCAGTGATATTGAATCTGTACAGCAACCTTTGTCGAGTTT

mRNA sequence

CGGATGCCCATCTCTCTCTCTCTCTCTCTCTGCTCAAGGATCTATTTTGTTCCTTATCGTCTTTCTTGCGCGTTCTCTTTTGAAGCAGATTTTCCACCAGCCAAGAAAGCGAGATAAAGGAGTCATTAAGAAGATTGCAAACCCTAGAACGTGGAAGAAGAAGAAGAACTCTTTTAGTCATCGTATTTTGTTTTCTCTATCTGATTCAAGCCCTAGAATTCATTTGCTTCGCCGAGGATTGTGTGTTTGAGGGTTTAAGGAAGGGTATATTGGTTGCGAGAGTTAGTTTGTGGGGTTATTCAACTCAGTGCTCAAATCTTTTTTGAGCCATCGTTTGTGTGATTATTTGCCTTGCGAACCACAGTTCTGGTCTGTGGGGTGGGTTTTTGGTTCGTGATATTGGAATTGGAATTGGCCGCGGTGGTGGGGGCGAGGTGCTAGGGTTTTTATACTGGTTTTTGTGTGTGTGTTTGTGGGGCATCCGTTGTTCGTTGATGCTCATGCAATTTTCTTGCTTTTCATGCCGCCAGAACCTTTGCCGTGGGACAGGAAAGACCTCTTCAAGGAGAGGAAACACGAGAAGTCGGAGGCCATAGGGTCTGCGGCCAGATGGCGGGACTCTCATCATGGATCTCGCGAGTTCAATCGGTGGGGTTCTGCTGACTTTCGAAGGCCTACTGGTCATGGTAAGCAGGGTAGTTGGCACCAGTTTTCTGAAGAATCTAGTCACGGTTATGGGCCTTCTCGGTCATTCAGTGACAGGGTGTTAGAAGATGAGAGCTTCCGGCCGTCAGTTCCTCGTGGAGATGGAAGATATATTAGAATCGGTAGAGAAATTAGAGGTTCTTTTAGTCATAGAGACTGGAGAAGTCACTCCAGGGAGACCAACAATGGATTTGGGAACCCATCGCGAAGGCCATCATCGGCATCGCAGGATGTGAGTTCTGATCAGAGGTCAGTAGATGATACGGTGACATATTCCTCTCCTCAATCTGTTCATGGGTTAGAAAATGGCCCGAGGGCCGATGTGGAAGTTTCCCTTGGCTCCACTGATTGGAAGCCACTTAAGTGGTCCCGATCTGGGAGTTTGTCTTCCCGGGGATCTGCTTACAGCAGTTCGACAAACTCGAAGAATGAAAAGGCTGATTTACCTCTTAGAGTTGCATCTCCTATAGAAAGCCCTTCTGCCGAAACTACTGCCTGTGTGACATCTTCTCTGCCTTCTGAAGATGCAATTTCTAGGAAGAAGCCAAGGCTTGGATGGGGTGATGGATTAGCCAAATACGAGAAAGAAAAAGTTGAGGTTCCTGATGGAAGCCTGAAAAAAGAAGTGGCTCTTCTTTCAAGCGCCAGTGCTGAATTAACTCATTCCCTTGGTTCAAACTTTGCTGAGAAAAGTCCCAAAACTTTGCCCTTTTCAGATTGTGCATCTCCTGCAACTCCATCCTCTTTTGCCTGCAGTTCATCATCAGGCTTGGAGGATAAACCATTTAGTAAGGGAGCAAGTCTGGATGGCATGATATGTAGTTCACCTGGGTCCAGTTCTCAAAATCTTCAAAAATTATTGTCTAGTATAGAAATGATGGAGATCTGTTCAATTGCTAATTTAGGATCGTCACTTGTCGAACTGTTTCATTCTGATGATCCGAGTACAGTAGAATCATGTTTTGGGAAGTCTACATTGAATAAGCTGCTAGCATATAAAGGTGAAATTTCAAAGAAATTGGAGACTACAGAGTCTGAAATTGATTCTCTTGAAAATGAACTTAAATCTTTGAAATCTGGAAATGGAGGCAATGTTTCTCATAAAAAATCTTGCAGTGTCACACATTTGGTGGAGAATGTGACATATTTCAAAGAACAAGATGGTGTCTCTTGTGTTGCCCCTCGTCCTGCTCCCTTGGTGATTGTTTCGTCTTCTGATGCAACAGTTGAGAAGATGCCAGTCTGCAAGGGTGACATGGGAGTTGAAGATGTTGATACAAAGGCTGATGAAATCGATAGTCCTGGAACTGTGACATCAAAATTTAACGAACCATCCCGAGTGGTAAAGACTCTTGCTTCTGATCTTGTGGTAAATGGTCATTGCTCTGAAGTTACAGATGTAATTGTCCCTGACAAGATGGAAGGGAATTTTCCTGTATCTAGGTCGTTTGTGGACGAACATAAAACAATTGGCTCTGACAATGAATGCATTCTTGCTAAGAGTTGTACCAAGGAATCTATTTATGGTGATTTGATGGCCCAAGCTGGCAGTAGATCATCTCTTTGTGATCATATTTTTGTGTGTAATAAAGAATATGCAAGTAGAGCTGCAGAAGTAATTTTTAAGAAATTACCAGTGGAAATGTGCAAGATCAGCAGTAAAAGCACCAAAATTGTGTCCTGCTCAGAGATTGAGAAACTTGTTAAAGAGAAATTTCTAATGAGGAGGCAGTTCTTAAAATTTAAGGAGAGTGCATTAACCCTCAGATTTAAAGCCTTGCAACAATCATGGAAAGAAGGTTTGCTGCATTCTGTGAAGAAATGTCGCTCAAGGCCACAAAAAAAGGAGTTGAGTCTAAGGGTGTCACATTCTGGCCATCAGAAGTACAGGTCTTCAACTCGCTCCCGTTTGGTTCAGCAAGGAGCATGTCAGAACCCTACCCTTAACACAGAAATTGCTGTTCGTTACTCCAGTAAGCTGCTGTTGAATCCTCAAATTAAGCTTTACAGGAATAGTTTAAAGATGCCAGCTATGATTTTGGACAAGAAGGAAAAGATGGCATTAAGGTTCATCTCTCATAATGGGTTGGTTGAAGATCCCTGTGCTGTTGAGAAGGAAAGGAACATGATAAACCCTTGGACTTCAGCCGAGAGAGAGATATTCTGGGAGAAACTATCCTTGTTTGGAAAGGATTTTAAGAAAATTTCTTCATTTCTCGACCTCAAAACCACAGCTGACTGTATCCAGTTCTATTACAAGAACCACAAATCTGATAGTTTTAAGAAGAATAAAAATTTGGAGTTGGGCAAGCAAGTGAAATCTTCTGCCATCACATACTTGGTTACATCAGGGAAAAAATGGAATCCAGACATGAATGCTACTTCCCTCGATATCTTAGGTGTTGCTTCAATAATGGCAGCACAAGCAGACTACGATATTGGAAACCAGCAAAAATGTACTCGCCATTTGGGTATGGGAAGGGATGTTGAGTCAAAAGTATCATTTAGTGCTAGCACTCCTTCAAATAAAAACAATTTGGATGCTCTTCAGACTGAAAAAGAAACGGTTGCTGCTGATGTGCTTGCTGGTATATGTGGTTCAATATCTTCAGAGGCCCTGAGTTCTTGCATTACAAGTGCTATTGATCCCAGTGAGGACCACTGGGAGCGGAAGTGTTATAAAGTGGATTCTGCAGTGAAATTGCCTTCGTTGTCTGACGTCATACAGAAAACTGATAATGAGGAACCTTGTTCAGATGATAGTTCTGAGGATGTAGATTCTTCAAATTGGACAGATGAGGAGAAGTCGATATTCATGCAGGCTGTGTCGTCCTATGGTAAGGATTTTGATATGATCTCTAGATGTATCAGGTCAAAGTCTAGGGACCAGTGCAAGGTTTTCTTCAGCAAAGCTCGGAAATGCCTTGGACTGGATTTGATGCATACTTCTGGAGATGTAGGCGAAACACCTGGGAGTGGTAATGACGCCAGTGGGAGTGGGACTGACACAGAAGATCACTGTATTGTTGAAATCTGTGGAGCCCATGGTAGTGATGAATTTGTCTCCAAGTCAGTCAACGGTGTATCAACATCTGTTAACATAAATCATGAAGAATCTGTTTCTGCTGTGACTGTCAACATGCGGACCAGTAGTGAATTTGAGGAAAATACAGTATTGCAACAGTCGGATGAGAAATGTGCTGAGGCTGTTGGAAACTTGATTTCTGAGATATCGAAGGAAGAGGATTTGCCTAGTCCAGATTCTCATTCTGCCTACAATCTCACAAATGCAGCTGCTTCTTTGAGCCAGCCCGTGCATGACCACAAAATTGAAGGCTCTTCTGAAAATACCGAAGGTGGAAGCAAGTGCTGTAATGAACCTGACATTCTGAGATCTGAATCGGTCTCCACTGTTGATGAAAATTCAGCTGCTGTGAGCGAGAGCAGAGCTACAGCGAAGCTTGCATTTGGAGGAGAAGAAGAAGGAAGGAACACTAATTTACATGTTCAGAGTATATTGCAGTGCTCTGTTCAGAATTCAACTGGGTTTGATTCCAAACTTGCTTTAGAGGGCAGCTCCTTAGGACTTGATCCACAAATCTTGCATCCAACCGTTCTTAAAGTGGAACATGTAGAGAAGTCTTGTGTTGAGTCTGAGAACTCTCTTGCTGTCGGGAATTCTGAACCTGGTGTCATTGGAAGGGAACAGATGCTTAACCAATATATGTTGTCATCAACAGCTGTCTTGCAGGAGGTTAGTGATGCGCATCAGAAGCCTATGAATAGAGATGACTATGCTGAGCATCAAAATAATTTGTCGCACGATAGTGAATCCAAGTTTCCAAGAAGCTATCCTTTCAACAAACAAATCTTTGAGGACATCAATAGAAATATCAATCGCACATATTTTCCTGTTGTTCAAGGGCTGTCAAAGCCAGACATCAATTGTAGCAGTTCATATGTTTCTGAGGGCCACTATCTTCAGAATTGTAACAGTTCCAAGCCGCACAACCCGGCGGAGCTTCCTTTTTTGCCTCAGAATGTAGACTTGGGTCATGATCGTCAGAAGAAAGCTTTATGCAGTGGCAGTGCTTCAGATTCTGATGTTCCACGCAGGAAAGGTGATGTGAAACTGTTTGGTCAGATATTAAGTCATGCCCCTTCCAAGCAAAATTCGAGTTCTGGTTCGAACGAGGGTGGAGAGGAGAAGGGACTTCACAAATCCAGCAGCAAATCATGCGACATCGGAGAAAATGTTCCGTTAAGGAGTTACGGTTTTTGGGATGGAAGCAGAATACAGACGGGTTTGTCTGCTTTGCCGGATTCTGCCATTTTACAAGCCAAGTATCCTGCTGCATTCAGTGGCTACTCTGCTACGTCTGTTAAAACTGAACAGCAGCCATTGCAGGCACTCACAAATAATGGTGACCGAAGTCTTAATGGACTAGTGTCCGCTTTTCCAACCAAGGATGGAGTTGTAGATTATCATTCGTATAGGAGTCGAGATGGAGTTAAGTTGCGACCTTTCCCAGTTGATATATTTTCTGAGATGCAAAGAAGAAATGGCTTCGATGCTGTGTCCTTGTCAAGTTTACAGCAGCAGGGAAGGGTGCTAGTTGGAATGAATGTTGTTGGAAGGGGAGGGATTCTCATGGGTGGTTCTTGTACTGGTGTTTCAGATCCTGTAGCAGCCATTAAAATGCACTATTCCAAGGCCGAGCAATACGTTGGGCAACCTGGTAGTACATTCACTAGAGAAGATGGGAGTTGGAGAGGAGGTAATGGTGGAGATTTAGGCAGCAGGTAGTAGATACGCATTGGGGGCCTATGCCTGGCCGGCCAGGGAGGCCCTCGCCTGTATCATAATTAGTCTGTTTCAAGTTTTCTTAAAAAAGGAAGTGTAGGGTAGGAGTAATTTGAACCAATGGGTTCTGAAAAATCCATCTTTTTTGGTTGAAGAAAAAGGGAAGAGATTTTAGGGGATTGTAATATTGTAGCAGTCTTTTGTATTTGTTGTATTTGATTGCAACACAGAAATAATCTTCTGAAAATGGGAGCGGGCATAGTGTTACACCTCAGCTCCTCTTACAGGTCGATCCTTCAACTTTGATAACCTAACCTACCTGACTCGTGACATTATCCAGTGGAGATAAAGAGAAAGAAGACATATGAATGAATAAGAAATGGGGAAGAAAGGGAGTTGGTTTTCTGCGGTGAAGAGGGTTCTCAGTCAGCCTTCTGAGAAGAAAGACAAGAAACCAGACAAATCTAAGAAAAAATGGTTTCAAAAGGAGGAGAGTGTGGATGTGATTTCCATTTTGGAACAATCTCCATTGGACGTTCCTGCACAACCTCCAATAGAAGATGATGTCAAACAAACCGAACCGGAGAGTGAACCAAGCGAGCTTGCGCATTTGGAGGCTGCGGAGTCGGCTGTTGCTGAAGCTCAGCTGGCTGTGGTGGTTGAATATCCACCTAGTCCTATCTCCTGTCGGCCAGAAATGTCGGAGGAAACAGCAGCTAGTGTGATTCAAACTGCATTTCGTGGATATACGGCAAGGAGAGCATTGAGAGCTCTCAAAGCGTTGATGAGATTGAAGACATTGGTACAAGGGCAATCTGTCAAACGGCAAGTGGCCTCTACTTTAAAATGCATGCAAACCCTTACTCATTTACAGTCAGAGATTCGTGCAAGAAGAATAAGAATGTCGGAAGAAAATCAGGCTCTTCAGCGACAACTTCGTAACAAACGTGAGAAAGATCTCGAGAAGTTAAAATTTGCTATGGATGATGACTGGAATGATAGCACCCAATCTAAAGCACAAATTGAAGCTAAACTATTGAACAAGCATGAAGCTGCAATAAGAAGAGAAAGGGCTCTGGCTTATGCTTACTCTCATCAGCAAACATGGAAGAACTCTTCAAAAACTGCAACCCCCACTGTCATGGATCCAAACAATCCCCACTGGGGTTGGAGTTGGTTGGAGAGGTGGATGGCAGCTCGTCCATGGGAATCTCAAAGCACCACTGATCAGCCGGATCACATTTCTGTCACGAGTGTTGCAACTCGTGCATCTGTGGTCGACATTCTCCAAATTTATGCTCGGCGTGATCAGAACCCTTCTACCAAGCTTTCTCCAAAGTCTCCTACAAGCCAAAAGTCAAGCCAATTGCCTAAGTATCATTCACCTTCAACTCCCAAGGCACTATTTTCATCATCTGGCAGAAAGAAAACAAATGCAGCCAACTCAAGGATAGGCAGCTGGGGCAGAGACGAAGACATAAGGACCATCACTAGTGACAAATCAAAGCTCTCCCGGAGGCATACTATTGCAGGATCATCATTCAGGGATGATGAAAGCCTTGCCAGTTTACCTTCAGTTTCAAGTTACACGACGCCATCGAAGGCTGTGAAGAATCGGTCCCGGTTGGCAAGTTCATCTAGAACAGAGAAGAAAGGAACAATGGAAAAAGGATCTGCACGTGCAGGTTCTGCAAAAAAACAACTTTCCTTTTCAGCTTTTCCTGTTAAGCCAAGGAGGCAATCTAGTCCTCCAATTGTGAATACTAGCTAAAGTACATTAAAAGATAAGAAAGTTTGCATCAGAGAGAATGATAGTATCATGGATGAAACAAATTGATCGATCATTGAGTTCGTGGATTTGAGGATCCGTACTTCCATTATCTGCTTTCTTCAACAGCTTCTACTTACCACGTGTGGATTAGGCTACACCATTCCTCGCTGGCCTTGCGGTAGCTGCTGCAGCTCTAGCTGGTCGATATGGAATTCGAGCTTGGCAAGCATTCAAGACACGGCCACCACAAGCCAGATCACGTAAATTCTATGAAGGTGGTTTCTATCCTACGATGACAAGGAGGGAAGCAGCTCTTATTCTTGGTATTAGAGAGAATGCAACTCCAGATAAGATTAAGGAAGCACATAGAAGGGTCATGATCGCGAACCATCCAGATGCTGGTGGCAGTCATTATCTTGCTTCTAAGATCAATGAAGCCAAGGATGTGCTACTCGGAAAATCGAAAAGCAGTGGATCAGCATTTTAATGAGGTTTGCTCCTTGAATTTTAGGATCCTTGCAAAAAAGTGGAAAAAACTTTCGTTTCCATATGAACTTCAAAGTTTAAACAATCTGCCAAACGCTATGCTCACCAATCGATTTTGTTTCCTTTAGTCTGAGAAACAAGATACAATTGTTAGACTTGAAGTTGACCTTCATGCTGAAAGTTCCTTTGAACATAAAAATAAGTATGATTGTGAAGTGCATCCTATAATCGGATTCATGGAACAATGTTGTCCTTTCATAAGTATCATTCACTTATCTAACCTCTACCAGGACTAGTCATCTGAATGCATTTTCTCTAGACATTTATCATTTATCTATTTTAAGTCTCCAGTGATATTGAATCTGTACAGCAACCTTTGTCGAGTTT

Coding sequence (CDS)

ATGCCGCCAGAACCTTTGCCGTGGGACAGGAAAGACCTCTTCAAGGAGAGGAAACACGAGAAGTCGGAGGCCATAGGGTCTGCGGCCAGATGGCGGGACTCTCATCATGGATCTCGCGAGTTCAATCGGTGGGGTTCTGCTGACTTTCGAAGGCCTACTGGTCATGGTAAGCAGGGTAGTTGGCACCAGTTTTCTGAAGAATCTAGTCACGGTTATGGGCCTTCTCGGTCATTCAGTGACAGGGTGTTAGAAGATGAGAGCTTCCGGCCGTCAGTTCCTCGTGGAGATGGAAGATATATTAGAATCGGTAGAGAAATTAGAGGTTCTTTTAGTCATAGAGACTGGAGAAGTCACTCCAGGGAGACCAACAATGGATTTGGGAACCCATCGCGAAGGCCATCATCGGCATCGCAGGATGTGAGTTCTGATCAGAGGTCAGTAGATGATACGGTGACATATTCCTCTCCTCAATCTGTTCATGGGTTAGAAAATGGCCCGAGGGCCGATGTGGAAGTTTCCCTTGGCTCCACTGATTGGAAGCCACTTAAGTGGTCCCGATCTGGGAGTTTGTCTTCCCGGGGATCTGCTTACAGCAGTTCGACAAACTCGAAGAATGAAAAGGCTGATTTACCTCTTAGAGTTGCATCTCCTATAGAAAGCCCTTCTGCCGAAACTACTGCCTGTGTGACATCTTCTCTGCCTTCTGAAGATGCAATTTCTAGGAAGAAGCCAAGGCTTGGATGGGGTGATGGATTAGCCAAATACGAGAAAGAAAAAGTTGAGGTTCCTGATGGAAGCCTGAAAAAAGAAGTGGCTCTTCTTTCAAGCGCCAGTGCTGAATTAACTCATTCCCTTGGTTCAAACTTTGCTGAGAAAAGTCCCAAAACTTTGCCCTTTTCAGATTGTGCATCTCCTGCAACTCCATCCTCTTTTGCCTGCAGTTCATCATCAGGCTTGGAGGATAAACCATTTAGTAAGGGAGCAAGTCTGGATGGCATGATATGTAGTTCACCTGGGTCCAGTTCTCAAAATCTTCAAAAATTATTGTCTAGTATAGAAATGATGGAGATCTGTTCAATTGCTAATTTAGGATCGTCACTTGTCGAACTGTTTCATTCTGATGATCCGAGTACAGTAGAATCATGTTTTGGGAAGTCTACATTGAATAAGCTGCTAGCATATAAAGGTGAAATTTCAAAGAAATTGGAGACTACAGAGTCTGAAATTGATTCTCTTGAAAATGAACTTAAATCTTTGAAATCTGGAAATGGAGGCAATGTTTCTCATAAAAAATCTTGCAGTGTCACACATTTGGTGGAGAATGTGACATATTTCAAAGAACAAGATGGTGTCTCTTGTGTTGCCCCTCGTCCTGCTCCCTTGGTGATTGTTTCGTCTTCTGATGCAACAGTTGAGAAGATGCCAGTCTGCAAGGGTGACATGGGAGTTGAAGATGTTGATACAAAGGCTGATGAAATCGATAGTCCTGGAACTGTGACATCAAAATTTAACGAACCATCCCGAGTGGTAAAGACTCTTGCTTCTGATCTTGTGGTAAATGGTCATTGCTCTGAAGTTACAGATGTAATTGTCCCTGACAAGATGGAAGGGAATTTTCCTGTATCTAGGTCGTTTGTGGACGAACATAAAACAATTGGCTCTGACAATGAATGCATTCTTGCTAAGAGTTGTACCAAGGAATCTATTTATGGTGATTTGATGGCCCAAGCTGGCAGTAGATCATCTCTTTGTGATCATATTTTTGTGTGTAATAAAGAATATGCAAGTAGAGCTGCAGAAGTAATTTTTAAGAAATTACCAGTGGAAATGTGCAAGATCAGCAGTAAAAGCACCAAAATTGTGTCCTGCTCAGAGATTGAGAAACTTGTTAAAGAGAAATTTCTAATGAGGAGGCAGTTCTTAAAATTTAAGGAGAGTGCATTAACCCTCAGATTTAAAGCCTTGCAACAATCATGGAAAGAAGGTTTGCTGCATTCTGTGAAGAAATGTCGCTCAAGGCCACAAAAAAAGGAGTTGAGTCTAAGGGTGTCACATTCTGGCCATCAGAAGTACAGGTCTTCAACTCGCTCCCGTTTGGTTCAGCAAGGAGCATGTCAGAACCCTACCCTTAACACAGAAATTGCTGTTCGTTACTCCAGTAAGCTGCTGTTGAATCCTCAAATTAAGCTTTACAGGAATAGTTTAAAGATGCCAGCTATGATTTTGGACAAGAAGGAAAAGATGGCATTAAGGTTCATCTCTCATAATGGGTTGGTTGAAGATCCCTGTGCTGTTGAGAAGGAAAGGAACATGATAAACCCTTGGACTTCAGCCGAGAGAGAGATATTCTGGGAGAAACTATCCTTGTTTGGAAAGGATTTTAAGAAAATTTCTTCATTTCTCGACCTCAAAACCACAGCTGACTGTATCCAGTTCTATTACAAGAACCACAAATCTGATAGTTTTAAGAAGAATAAAAATTTGGAGTTGGGCAAGCAAGTGAAATCTTCTGCCATCACATACTTGGTTACATCAGGGAAAAAATGGAATCCAGACATGAATGCTACTTCCCTCGATATCTTAGGTGTTGCTTCAATAATGGCAGCACAAGCAGACTACGATATTGGAAACCAGCAAAAATGTACTCGCCATTTGGGTATGGGAAGGGATGTTGAGTCAAAAGTATCATTTAGTGCTAGCACTCCTTCAAATAAAAACAATTTGGATGCTCTTCAGACTGAAAAAGAAACGGTTGCTGCTGATGTGCTTGCTGGTATATGTGGTTCAATATCTTCAGAGGCCCTGAGTTCTTGCATTACAAGTGCTATTGATCCCAGTGAGGACCACTGGGAGCGGAAGTGTTATAAAGTGGATTCTGCAGTGAAATTGCCTTCGTTGTCTGACGTCATACAGAAAACTGATAATGAGGAACCTTGTTCAGATGATAGTTCTGAGGATGTAGATTCTTCAAATTGGACAGATGAGGAGAAGTCGATATTCATGCAGGCTGTGTCGTCCTATGGTAAGGATTTTGATATGATCTCTAGATGTATCAGGTCAAAGTCTAGGGACCAGTGCAAGGTTTTCTTCAGCAAAGCTCGGAAATGCCTTGGACTGGATTTGATGCATACTTCTGGAGATGTAGGCGAAACACCTGGGAGTGGTAATGACGCCAGTGGGAGTGGGACTGACACAGAAGATCACTGTATTGTTGAAATCTGTGGAGCCCATGGTAGTGATGAATTTGTCTCCAAGTCAGTCAACGGTGTATCAACATCTGTTAACATAAATCATGAAGAATCTGTTTCTGCTGTGACTGTCAACATGCGGACCAGTAGTGAATTTGAGGAAAATACAGTATTGCAACAGTCGGATGAGAAATGTGCTGAGGCTGTTGGAAACTTGATTTCTGAGATATCGAAGGAAGAGGATTTGCCTAGTCCAGATTCTCATTCTGCCTACAATCTCACAAATGCAGCTGCTTCTTTGAGCCAGCCCGTGCATGACCACAAAATTGAAGGCTCTTCTGAAAATACCGAAGGTGGAAGCAAGTGCTGTAATGAACCTGACATTCTGAGATCTGAATCGGTCTCCACTGTTGATGAAAATTCAGCTGCTGTGAGCGAGAGCAGAGCTACAGCGAAGCTTGCATTTGGAGGAGAAGAAGAAGGAAGGAACACTAATTTACATGTTCAGAGTATATTGCAGTGCTCTGTTCAGAATTCAACTGGGTTTGATTCCAAACTTGCTTTAGAGGGCAGCTCCTTAGGACTTGATCCACAAATCTTGCATCCAACCGTTCTTAAAGTGGAACATGTAGAGAAGTCTTGTGTTGAGTCTGAGAACTCTCTTGCTGTCGGGAATTCTGAACCTGGTGTCATTGGAAGGGAACAGATGCTTAACCAATATATGTTGTCATCAACAGCTGTCTTGCAGGAGGTTAGTGATGCGCATCAGAAGCCTATGAATAGAGATGACTATGCTGAGCATCAAAATAATTTGTCGCACGATAGTGAATCCAAGTTTCCAAGAAGCTATCCTTTCAACAAACAAATCTTTGAGGACATCAATAGAAATATCAATCGCACATATTTTCCTGTTGTTCAAGGGCTGTCAAAGCCAGACATCAATTGTAGCAGTTCATATGTTTCTGAGGGCCACTATCTTCAGAATTGTAACAGTTCCAAGCCGCACAACCCGGCGGAGCTTCCTTTTTTGCCTCAGAATGTAGACTTGGGTCATGATCGTCAGAAGAAAGCTTTATGCAGTGGCAGTGCTTCAGATTCTGATGTTCCACGCAGGAAAGGTGATGTGAAACTGTTTGGTCAGATATTAAGTCATGCCCCTTCCAAGCAAAATTCGAGTTCTGGTTCGAACGAGGGTGGAGAGGAGAAGGGACTTCACAAATCCAGCAGCAAATCATGCGACATCGGAGAAAATGTTCCGTTAAGGAGTTACGGTTTTTGGGATGGAAGCAGAATACAGACGGGTTTGTCTGCTTTGCCGGATTCTGCCATTTTACAAGCCAAGTATCCTGCTGCATTCAGTGGCTACTCTGCTACGTCTGTTAAAACTGAACAGCAGCCATTGCAGGCACTCACAAATAATGGTGACCGAAGTCTTAATGGACTAGTGTCCGCTTTTCCAACCAAGGATGGAGTTGTAGATTATCATTCGTATAGGAGTCGAGATGGAGTTAAGTTGCGACCTTTCCCAGTTGATATATTTTCTGAGATGCAAAGAAGAAATGGCTTCGATGCTGTGTCCTTGTCAAGTTTACAGCAGCAGGGAAGGGTGCTAGTTGGAATGAATGTTGTTGGAAGGGGAGGGATTCTCATGGGTGGTTCTTGTACTGGTGTTTCAGATCCTGTAGCAGCCATTAAAATGCACTATTCCAAGGCCGAGCAATACGTTGGGCAACCTGGTAGTACATTCACTAGAGAAGATGGGAGTTGGAGAGGAGGTAATGGTGGAGATTTAGGCAGCAGGTAG

Protein sequence

MPPEPLPWDRKDLFKERKHEKSEAIGSAARWRDSHHGSREFNRWGSADFRRPTGHGKQGSWHQFSEESSHGYGPSRSFSDRVLEDESFRPSVPRGDGRYIRIGREIRGSFSHRDWRSHSRETNNGFGNPSRRPSSASQDVSSDQRSVDDTVTYSSPQSVHGLENGPRADVEVSLGSTDWKPLKWSRSGSLSSRGSAYSSSTNSKNEKADLPLRVASPIESPSAETTACVTSSLPSEDAISRKKPRLGWGDGLAKYEKEKVEVPDGSLKKEVALLSSASAELTHSLGSNFAEKSPKTLPFSDCASPATPSSFACSSSSGLEDKPFSKGASLDGMICSSPGSSSQNLQKLLSSIEMMEICSIANLGSSLVELFHSDDPSTVESCFGKSTLNKLLAYKGEISKKLETTESEIDSLENELKSLKSGNGGNVSHKKSCSVTHLVENVTYFKEQDGVSCVAPRPAPLVIVSSSDATVEKMPVCKGDMGVEDVDTKADEIDSPGTVTSKFNEPSRVVKTLASDLVVNGHCSEVTDVIVPDKMEGNFPVSRSFVDEHKTIGSDNECILAKSCTKESIYGDLMAQAGSRSSLCDHIFVCNKEYASRAAEVIFKKLPVEMCKISSKSTKIVSCSEIEKLVKEKFLMRRQFLKFKESALTLRFKALQQSWKEGLLHSVKKCRSRPQKKELSLRVSHSGHQKYRSSTRSRLVQQGACQNPTLNTEIAVRYSSKLLLNPQIKLYRNSLKMPAMILDKKEKMALRFISHNGLVEDPCAVEKERNMINPWTSAEREIFWEKLSLFGKDFKKISSFLDLKTTADCIQFYYKNHKSDSFKKNKNLELGKQVKSSAITYLVTSGKKWNPDMNATSLDILGVASIMAAQADYDIGNQQKCTRHLGMGRDVESKVSFSASTPSNKNNLDALQTEKETVAADVLAGICGSISSEALSSCITSAIDPSEDHWERKCYKVDSAVKLPSLSDVIQKTDNEEPCSDDSSEDVDSSNWTDEEKSIFMQAVSSYGKDFDMISRCIRSKSRDQCKVFFSKARKCLGLDLMHTSGDVGETPGSGNDASGSGTDTEDHCIVEICGAHGSDEFVSKSVNGVSTSVNINHEESVSAVTVNMRTSSEFEENTVLQQSDEKCAEAVGNLISEISKEEDLPSPDSHSAYNLTNAAASLSQPVHDHKIEGSSENTEGGSKCCNEPDILRSESVSTVDENSAAVSESRATAKLAFGGEEEGRNTNLHVQSILQCSVQNSTGFDSKLALEGSSLGLDPQILHPTVLKVEHVEKSCVESENSLAVGNSEPGVIGREQMLNQYMLSSTAVLQEVSDAHQKPMNRDDYAEHQNNLSHDSESKFPRSYPFNKQIFEDINRNINRTYFPVVQGLSKPDINCSSSYVSEGHYLQNCNSSKPHNPAELPFLPQNVDLGHDRQKKALCSGSASDSDVPRRKGDVKLFGQILSHAPSKQNSSSGSNEGGEEKGLHKSSSKSCDIGENVPLRSYGFWDGSRIQTGLSALPDSAILQAKYPAAFSGYSATSVKTEQQPLQALTNNGDRSLNGLVSAFPTKDGVVDYHSYRSRDGVKLRPFPVDIFSEMQRRNGFDAVSLSSLQQQGRVLVGMNVVGRGGILMGGSCTGVSDPVAAIKMHYSKAEQYVGQPGSTFTREDGSWRGGNGGDLGSR
Homology
BLAST of CcUC05G103030 vs. NCBI nr
Match: XP_038892245.1 (uncharacterized protein LOC120081444 [Benincasa hispida])

HSP 1 Score: 2861.6 bits (7417), Expect = 0.0e+00
Identity = 1494/1674 (89.25%), Postives = 1556/1674 (92.95%), Query Frame = 0

Query: 1    MPPEPLPWDRKDLFKERKHEKSEAIGSAARWRDSHHGSREFNRWGSADFRRPTGHGKQGS 60
            MPPEPLPWDRKDLFKERKHEKSEAIGSAARWRDS+HGSREFNRWGSAD RRPTGHGKQG 
Sbjct: 1    MPPEPLPWDRKDLFKERKHEKSEAIGSAARWRDSYHGSREFNRWGSADLRRPTGHGKQGG 60

Query: 61   WHQFSEESSHGYGPSRSFSDRVLEDESFRPSVPRGDGRYIRIGREIRGSFSHRDWRSHSR 120
            WHQFSEESSHGYGPSRSFSDRVLEDESFRPSV RGDG+YIRIGRE RGSFSHRDWR HSR
Sbjct: 61   WHQFSEESSHGYGPSRSFSDRVLEDESFRPSVARGDGKYIRIGRESRGSFSHRDWRGHSR 120

Query: 121  ETNNGFGNPSRRPSSASQDVSSDQRSVDDTVTYSSPQSVHGLENGPRADVEVSLGSTDWK 180
            ETNNGFGN SRR S  SQDVSSDQRSVDDTVTYSSPQSVHGLENGPR+DVEV LGSTDWK
Sbjct: 121  ETNNGFGNSSRRLS--SQDVSSDQRSVDDTVTYSSPQSVHGLENGPRSDVEVPLGSTDWK 180

Query: 181  PLKWSRSGSLSSRGSAYSSSTNSKNEKADLPLRVASPIESPSAETTACVTSSLPSEDAIS 240
            PLKWSRSGSLSSRGSAYSSSTNSKNEKADLPLRVASPIESPSAE TACVTSSLPSEDAIS
Sbjct: 181  PLKWSRSGSLSSRGSAYSSSTNSKNEKADLPLRVASPIESPSAEATACVTSSLPSEDAIS 240

Query: 241  RKKPRLGWGDGLAKYEKEKVEVPDGSLKKEVALLSSASAELTHSLGSNFAEKSPKTLPFS 300
            RKKPRLGWGDGLAKYEKEKVEVPDGSL+KEVAL+SS SAELTHSLGSNFAEKSPKTLPFS
Sbjct: 241  RKKPRLGWGDGLAKYEKEKVEVPDGSLRKEVALISSGSAELTHSLGSNFAEKSPKTLPFS 300

Query: 301  DCASPATPSSFACSSSSGLEDKPFSKGASLDGMICSSPGSSSQNLQKLLSSIEMMEICSI 360
            DCASPATPSSFACSSSSGLEDKPFSKGAS DGM+CSSPGS SQNLQKLL SIE MEI SI
Sbjct: 301  DCASPATPSSFACSSSSGLEDKPFSKGASADGMMCSSPGSGSQNLQKLLCSIEKMEISSI 360

Query: 361  ANLGSSLVELFHSDDPSTVESCFGKSTLNKLLAYKGEISKKLETTESEIDSLENELKSLK 420
            ANLGSSLVELFHS DPSTVESCFGKSTLNKLLAYKGEISK LETTESEIDSLENELKSLK
Sbjct: 361  ANLGSSLVELFHS-DPSTVESCFGKSTLNKLLAYKGEISKTLETTESEIDSLENELKSLK 420

Query: 421  SGNGGNVSHKKSCSVTHLVENVTYFKEQDGVSCVAPRPAPLVIVSSSDATVEKMPVCKGD 480
            SGNGGNVS KK  S THLVE+ TYFKEQDGVSC+ PRPAPLVIVSSSDATVEKMPVCKGD
Sbjct: 421  SGNGGNVSPKKYSSATHLVESGTYFKEQDGVSCIVPRPAPLVIVSSSDATVEKMPVCKGD 480

Query: 481  MGVEDVDTKADEIDSPGTVTSKFNEPSRVVKTLASDLVVNGHCSEVTDVIVPDKMEGNFP 540
            MGVEDVDTK DEIDSPGTVTSKFNEPS+VVK +ASDLV N HC EVT+ IVPDKMEGNFP
Sbjct: 481  MGVEDVDTKVDEIDSPGTVTSKFNEPSQVVKAVASDLVENSHCYEVTNAIVPDKMEGNFP 540

Query: 541  VSRSFVDEHKTIGSDNECILAKSCTKESIYGDLMAQAGSRSSLCDHIFVCNKEYASRAAE 600
            +S   VDEHKTIG  NECILAKSCT ES+YGDLMAQA SRSSLCD IF CNKE AS+AAE
Sbjct: 541  ISGPSVDEHKTIGFGNECILAKSCTSESMYGDLMAQADSRSSLCDFIFACNKECASKAAE 600

Query: 601  VIFKKLPVEMCKISSKSTKIVSCSEIEKLVKEKFLMRRQFLKFKESALTLRFKALQQSWK 660
            VIFKKLP EMCKISSKSTKI+SCSE EKL+KEKF MRR+F KFKESALTLRFKALQQSWK
Sbjct: 601  VIFKKLPAEMCKISSKSTKILSCSETEKLIKEKFAMRRRFFKFKESALTLRFKALQQSWK 660

Query: 661  EGLLHSVKKCRSRPQKKELSLRVSHSGHQKYRSSTRSRLVQQGACQNPTLNTEIAVRYSS 720
            E LLHSVKKCRSRPQKKELSLRV+HSGHQKYRSS RS  VQQGACQN +L+TEIAVR+SS
Sbjct: 661  ESLLHSVKKCRSRPQKKELSLRVTHSGHQKYRSSIRSLSVQQGACQNSSLHTEIAVRHSS 720

Query: 721  KLLLNPQIKLYRNSLKMPAMILDKKEKMALRFISHNGLVEDPCAVEKERNMINPWTSAER 780
            KLLLNPQIKLYR++LKMPAMILDKK+K ALRFIS+NGLVEDPCAVEKERNMINPWTSAER
Sbjct: 721  KLLLNPQIKLYRSTLKMPAMILDKKDKKALRFISNNGLVEDPCAVEKERNMINPWTSAER 780

Query: 781  EIFWEKLSLFGKDFKKISSFLDLKTTADCIQFYYKNHKSDSFKKNKNLELGKQVKSSAIT 840
            EIFWEKLSLFGKDFKKISSFLDLKTTADCIQFYYKNHKSDSFKKNKNLELGKQVKSSAIT
Sbjct: 781  EIFWEKLSLFGKDFKKISSFLDLKTTADCIQFYYKNHKSDSFKKNKNLELGKQVKSSAIT 840

Query: 841  YLVTSGKKWNPDMNATSLDILGVASIMAAQADYDIGNQQKCTRHLGMGRDVESKVSFSAS 900
            YLVTSGKKWNPD+NATSLDILGVAS+MAAQADYDIGNQQKCTRHLGMG +VESKVS+SAS
Sbjct: 841  YLVTSGKKWNPDVNATSLDILGVASVMAAQADYDIGNQQKCTRHLGMGGEVESKVSWSAS 900

Query: 901  TPSNKNNLDALQTEKETVAADVLAGICGSISSEALSSCITSAIDPSEDHWERKCYKVDSA 960
            TPSNKN+LDALQTEKETVAADVLAGICGSISSEALSSCITSAIDPSEDH E KC KVDSA
Sbjct: 901  TPSNKNSLDALQTEKETVAADVLAGICGSISSEALSSCITSAIDPSEDHMEWKCNKVDSA 960

Query: 961  VKLPSLSDVIQKTDNEEPCSDDSSEDVDSSNWTDEEKSIFMQAVSSYGKDFDMISRCIRS 1020
             KLPS SDVIQKTDN EPCSDDSSEDVDSSNWTDEEK IFMQAVSSYGKDFD ISRCIRS
Sbjct: 961  AKLPSSSDVIQKTDN-EPCSDDSSEDVDSSNWTDEEKLIFMQAVSSYGKDFDSISRCIRS 1020

Query: 1021 KSRDQCKVFFSKARKCLGLDLMHTSGDVGETPGSGNDASGSGTDTEDHCIVEICGAHGSD 1080
            KSRDQCKVFFSKARKCLGLD MHTSGDVGETPGSGND SGSGTD+EDHC+VEICGA GSD
Sbjct: 1021 KSRDQCKVFFSKARKCLGLDSMHTSGDVGETPGSGNDGSGSGTDSEDHCVVEICGARGSD 1080

Query: 1081 EFVSKSVNGVSTSVNINHEESVSAVTVNMRTSSEFEENTVLQQSDEKCAEAVGNLISEIS 1140
            EFVS+S+NGV+TSVN+NHEESVSAVTVNMRTSSEFEE+T LQQSDE C +AV NLISE S
Sbjct: 1081 EFVSESINGVATSVNVNHEESVSAVTVNMRTSSEFEESTELQQSDENC-QAVRNLISETS 1140

Query: 1141 KEEDLPSPDSHSAYNLTN-AAASLSQPVHDHKIEGSSENTEGGSKCCNEPDILRSESVST 1200
            KEED+PS D+ SAYNLTN AAASLSQPVHDHKIEGSSENTEGGSKCCNEPDILRSESVST
Sbjct: 1141 KEEDVPSLDTRSAYNLTNAAAASLSQPVHDHKIEGSSENTEGGSKCCNEPDILRSESVST 1200

Query: 1201 VDENSAAVSESRATAKLAFGGEEEGRNTNLHVQSILQCSVQNSTGFDSKLALEGSSLGLD 1260
            +DENSAAVSESRA AKL FGGEEEG NTNLH QSILQ SVQ+STGFDS LA EGSS+G D
Sbjct: 1201 LDENSAAVSESRAIAKLVFGGEEEGSNTNLHGQSILQGSVQDSTGFDSSLAPEGSSVGPD 1260

Query: 1261 PQILHPTVLKVEHVE-KSCVES-ENSLAVGNSEPGVIGREQMLNQYMLSSTAVLQEVSDA 1320
            PQILHP +LKVE  E KSC+ES ENSLAV NS+PGVI RE++LNQ +LSS  VLQEVSDA
Sbjct: 1261 PQILHPNILKVEPAEKKSCIESEENSLAVKNSDPGVIRREEVLNQDILSSPLVLQEVSDA 1320

Query: 1321 HQKPMNRDDYAEHQNNLS-HDSESKFPRSYPFNKQIFEDINRNINRTYFPVVQGLSKPDI 1380
            HQK MN+DD+AEHQNNLS H   SKFPRSYPFNKQ FE +N+NIN TYFPVVQGLSKPDI
Sbjct: 1321 HQKAMNKDDHAEHQNNLSRHSESSKFPRSYPFNKQNFEGMNQNINHTYFPVVQGLSKPDI 1380

Query: 1381 NCSSSYVSEGHYLQNCNSSKPHNPAELPFLPQNVDLGHDRQKKA-------LCSGSASDS 1440
            NC+S+YV+EGHYLQNCNSSKPHNPAELPFLPQN+  GH  QK A        CSGSASDS
Sbjct: 1381 NCNSTYVAEGHYLQNCNSSKPHNPAELPFLPQNIKFGHGHQKNASCSGSASACSGSASDS 1440

Query: 1441 DVPRRKGDVKLFGQILSHAPSKQNSSSGSNEGGEEKGLHKSSSKSCDIGENVPLRSYGFW 1500
            DVPRRKGDVKLFGQILSHAPS+QNSSSGS+E GEEKGLHKSSSKSCDIGENVPLRSYGFW
Sbjct: 1441 DVPRRKGDVKLFGQILSHAPSQQNSSSGSSECGEEKGLHKSSSKSCDIGENVPLRSYGFW 1500

Query: 1501 DGSRIQTGLSALPDSAILQAKYPAAFSGYSATSVKTEQQPLQALTNNGDRSLNGLVSAFP 1560
            DGSRIQ GLSALPDSAILQAKYPAAFSGYS+TSVK EQQPLQAL NNGDRSLNGL SAFP
Sbjct: 1501 DGSRIQMGLSALPDSAILQAKYPAAFSGYSSTSVKNEQQPLQALANNGDRSLNGLGSAFP 1560

Query: 1561 TKDGVVDYHSYRSRDGVKLRPFPVDIFSEMQRRNGFDAVSLSSLQQQGRVLVGMNVVGRG 1620
             KDGVVDYHSYRSRDGVK+RPFPVDIFSEM RRN FDAVSLSSLQQQGRVLVGMNVVGRG
Sbjct: 1561 AKDGVVDYHSYRSRDGVKMRPFPVDIFSEMHRRNSFDAVSLSSLQQQGRVLVGMNVVGRG 1620

Query: 1621 GILMGGSCTGVSDPVAAIKMHYSKAEQYVGQPGSTFTREDGSWRGGNGGDLGSR 1664
            GILMGGSCTGVSDPVAAIKMHY+KAEQYVGQPGSTFTREDGSWRGGNGGDLGSR
Sbjct: 1621 GILMGGSCTGVSDPVAAIKMHYAKAEQYVGQPGSTFTREDGSWRGGNGGDLGSR 1669

BLAST of CcUC05G103030 vs. NCBI nr
Match: KAA0034735.1 (Myb_DNA-binding domain-containing protein [Cucumis melo var. makuwa] >TYK09288.1 Myb_DNA-binding domain-containing protein [Cucumis melo var. makuwa])

HSP 1 Score: 2786.1 bits (7221), Expect = 0.0e+00
Identity = 1448/1668 (86.81%), Postives = 1532/1668 (91.85%), Query Frame = 0

Query: 1    MPPEPLPWDRKDLFKERKHEKSEAIGSAARWRDSHHGSREFNRWGSADFRRPTGHGKQGS 60
            MPPEPLPWDRKDLFKERKHEKSEAIGSAARWRDS+HGSREFNRWGSAD RRPTGHGKQG 
Sbjct: 1    MPPEPLPWDRKDLFKERKHEKSEAIGSAARWRDSYHGSREFNRWGSADLRRPTGHGKQGG 60

Query: 61   WHQFSEESSHGYGPSRSFSDRVLEDESFRPSVPRGDGRYIRIGREIRGSFSHRDWRSHSR 120
            WHQFSE+SSHGYGPSRSFSDRV+EDESFRPSVPRGDG+YIRIGRE RGSFSHRDWRSHSR
Sbjct: 61   WHQFSEDSSHGYGPSRSFSDRVIEDESFRPSVPRGDGKYIRIGRESRGSFSHRDWRSHSR 120

Query: 121  ETNNGFGNPSRRPSSASQDVSSDQRSVDDTVTYSSPQSVHGLENGPRADVEVSLGSTDWK 180
            +TNNGFGNPSRRPS  SQDVSSDQRSVDDTVTYSSPQS HGLENGPR+DVEVSLGSTDWK
Sbjct: 121  DTNNGFGNPSRRPS--SQDVSSDQRSVDDTVTYSSPQSFHGLENGPRSDVEVSLGSTDWK 180

Query: 181  PLKWSRSGSLSSRGSAYSSSTNSKNEKADLPLRVASPIESPSAETTACVTSSLPSEDAIS 240
            PLKWSRSGSLSSRGSAYSSSTNSKNEKADLPLRVASPIESPSAE TACVTSSLPSED IS
Sbjct: 181  PLKWSRSGSLSSRGSAYSSSTNSKNEKADLPLRVASPIESPSAEATACVTSSLPSEDTIS 240

Query: 241  RKKPRLGWGDGLAKYEKEKVEVPDGSLKKEVALLSSASAELTHSLGSNFAEKSPKTLPFS 300
            RKKPRLGWGDGLAKYEKEKV+VPDGSL+KEVALLSS S ELTHSLGSNFAEKSPKTLPFS
Sbjct: 241  RKKPRLGWGDGLAKYEKEKVDVPDGSLRKEVALLSSGSGELTHSLGSNFAEKSPKTLPFS 300

Query: 301  DCASPATPSSFACSSSSGLEDKPFSKGASLDGMICSSPGSSSQNLQKLLSSIEMMEICSI 360
            DCASPATPSSFACSSSSGLEDKPFSKGAS DGMICSSPGS SQNLQKLL SIE MEI SI
Sbjct: 301  DCASPATPSSFACSSSSGLEDKPFSKGASADGMICSSPGSGSQNLQKLLCSIEKMEISSI 360

Query: 361  ANLGSSLVELFHSDDPSTVESCFGKSTLNKLLAYKGEISKKLETTESEIDSLENELKSLK 420
            ANLGSSLVELFHSDDP+T+ESCFGKSTLNKLLAYKGEISK LE TESEIDSLENELKSLK
Sbjct: 361  ANLGSSLVELFHSDDPNTIESCFGKSTLNKLLAYKGEISKTLEMTESEIDSLENELKSLK 420

Query: 421  SGNGGNVSHKKSCSVTHLVENVTYFKEQDGVSCVAPRPAPLVIVSSSDATVEKMPVCKGD 480
            SGNGGNVS+KKSCS T LVE+ TYFKEQDG+SC+APRPAPLV+VSSSDATVEK+P+CKGD
Sbjct: 421  SGNGGNVSNKKSCSATRLVESSTYFKEQDGISCIAPRPAPLVVVSSSDATVEKVPLCKGD 480

Query: 481  MGVEDVDTKADEIDSPGTVTSKFNEPSRVVKTLASDLVVNGHCSEVTDVIVPDKMEGNFP 540
            MGVEDVDTKADEIDSPGTVTSKFNEPSRVVK   SD+V NGHCS VTD+IVP KMEGNFP
Sbjct: 481  MGVEDVDTKADEIDSPGTVTSKFNEPSRVVKENTSDIVDNGHCSVVTDMIVPGKMEGNFP 540

Query: 541  VSRSFVDEHKTIGSDNECILAKSCTKESIYGDLMAQAGSRSSLCDHIFVCNKEYASRAAE 600
            +S  FVDE KT GS NECILAKSC+ ES  GDLMAQAGSRSSLCD IF CNKEYASRAAE
Sbjct: 541  ISEPFVDERKTTGSGNECILAKSCSSESFNGDLMAQAGSRSSLCDSIFACNKEYASRAAE 600

Query: 601  VIFKKLPVEMCKISSKSTKIVSCSEIEKLVKEKFLMRRQFLKFKESALTLRFKALQQSWK 660
            VIFK+ PV +CKISSKSTK VSCSE EKL+KEKF+ R++FLKFKESALTLRFKALQQSWK
Sbjct: 601  VIFKRSPVGVCKISSKSTKYVSCSETEKLIKEKFVSRKKFLKFKESALTLRFKALQQSWK 660

Query: 661  EGLLHSVKKCRSRPQKKELSLRVSHSGHQKYRSSTRSRLVQQGACQNPTLNTEIAVRYSS 720
            E LLHSVKKCRSRPQKKELSLRV+HSGHQKYRSS RSRL+QQGACQ+ T NTEIAVR+SS
Sbjct: 661  ECLLHSVKKCRSRPQKKELSLRVTHSGHQKYRSSFRSRLIQQGACQSTTFNTEIAVRHSS 720

Query: 721  KLLLNPQIKLYRNSLKMPAMILDKKEKMALRFISHNGLVEDPCAVEKERNMINPWTSAER 780
            KLLLNPQIKLYRN+LKMPAMILDKKEKMALRFISHNGLVEDPCAVEKERN+INPWTSAE+
Sbjct: 721  KLLLNPQIKLYRNTLKMPAMILDKKEKMALRFISHNGLVEDPCAVEKERNLINPWTSAEK 780

Query: 781  EIFWEKLSLFGKDFKKISSFLDLKTTADCIQFYYKNHKSDSFKKNKNLELGKQVKSSAIT 840
            EIFWEKLSLFGKDFKKISSFLDLKTTADCIQFYYKNHKSDSFKKNKNLELGKQ+KSSAIT
Sbjct: 781  EIFWEKLSLFGKDFKKISSFLDLKTTADCIQFYYKNHKSDSFKKNKNLELGKQMKSSAIT 840

Query: 841  YLVTSGKKWNPDMNATSLDILGVASIMAAQADYDIGNQQKCTRHLGMGRDVESKVSFSAS 900
            YLVTSGKKWNPD NATSLDILGVAS+MAAQA+YDIGNQQKC+RHLG G+DVESKVS+SAS
Sbjct: 841  YLVTSGKKWNPDANATSLDILGVASVMAAQAEYDIGNQQKCSRHLGTGKDVESKVSWSAS 900

Query: 901  TPSNKNNLDALQTEKETVAADVLAGICGSISSEALSSCITSAIDPSEDHWERKCYKVDSA 960
            TP NK+NLD LQTEKETVAADVLAGI GSISSEALSSCITSAIDP E+  E+KCYKVDSA
Sbjct: 901  TP-NKSNLDDLQTEKETVAADVLAGISGSISSEALSSCITSAIDPREELREQKCYKVDSA 960

Query: 961  VKLPSLSDVIQKTDNEEPCSDDSSEDVDSSNWTDEEKSIFMQAVSSYGKDFDMISRCIRS 1020
             KLPSLSDV+QKTDN EPCSDDSSEDVDSSNWTDEEK IF+QAVSSYGKDFDMISRCIRS
Sbjct: 961  AKLPSLSDVMQKTDN-EPCSDDSSEDVDSSNWTDEEKVIFLQAVSSYGKDFDMISRCIRS 1020

Query: 1021 KSRDQCKVFFSKARKCLGLDLMHTSGDVGETPGSGNDASGSGTDTEDHCIVEICGAHGSD 1080
            KSRDQCK+FFSKARKCLGLDLMHTSGDVGETPG+GND SGSGTDTEDHC+VEICG  GSD
Sbjct: 1021 KSRDQCKIFFSKARKCLGLDLMHTSGDVGETPGNGNDISGSGTDTEDHCVVEICGGRGSD 1080

Query: 1081 EFVSKSVNGVSTSVNINHEESVSAVTVNMRTSSEFEENTVLQQSDEKCAEAVGNLISEIS 1140
            E +SKS+NGVSTSVNINHEESVSA TVNMRTS EFE +T LQQ DEK AEAVGN+I E  
Sbjct: 1081 ESISKSINGVSTSVNINHEESVSAATVNMRTSMEFEGSTALQQLDEKGAEAVGNMIFETL 1140

Query: 1141 KEEDLPSPDSHSAYNLTNAAASLSQPVHDHKIEGSSENTEGGSKCCNEPDILRSESVSTV 1200
            KEED+P+P               SQP+HD KIEGSSENTEGG K CNEPDILRSESVSTV
Sbjct: 1141 KEEDVPNP---------------SQPMHDQKIEGSSENTEGG-KSCNEPDILRSESVSTV 1200

Query: 1201 DENSAAVSESRATAKLAFGGEEEGRNTNLHVQSILQCSVQNSTGFDSKLALEGSSLGLDP 1260
            DENSAAVSE RAT KLA G EE G + NLH QS +QCS Q+STG+DS +ALEGSS+GLDP
Sbjct: 1201 DENSAAVSECRATVKLAIGEEEVGSDANLHSQSTMQCSGQDSTGYDSNIALEGSSIGLDP 1260

Query: 1261 QILHPTVLKVEHVE-KSCVES-ENSLAVGNSEPGVIGREQMLNQYMLSSTAVLQEVSDAH 1320
            QILHP +LKVE VE KSC++S EN LAV NS+ GVIGREQMLNQ + SST VLQ+VSDA 
Sbjct: 1261 QILHPNILKVEPVEKKSCIKSEENFLAVRNSDTGVIGREQMLNQDVSSSTLVLQDVSDAD 1320

Query: 1321 QKPMNR--DDYAEHQNNLSHDSES-KFPRSYPFNKQIFEDINRNINRTYFPVVQGLSKPD 1380
            QKPMNR  DD  EH+NNL  +SES KFPRSYPFNKQIFEDINRNIN TYFPVVQGLSKPD
Sbjct: 1321 QKPMNRDKDDDDEHRNNLLRNSESPKFPRSYPFNKQIFEDINRNINHTYFPVVQGLSKPD 1380

Query: 1381 INCSSSYVSEGHYLQNCNSSKPHNPAELPFLPQNVDLGHDRQKKALCSGSASDSDVPRRK 1440
            INC++ YV EG YLQNCNSSKPHNPAELPFL QN++LGH+ QK A  SGSASDSDVPRRK
Sbjct: 1381 INCNNKYVPEGQYLQNCNSSKPHNPAELPFLSQNIELGHNHQKNASGSGSASDSDVPRRK 1440

Query: 1441 GDVKLFGQILSHAPSKQNSSSGSNEGGEEKGLHKSSSKSCDIGENVPLRSYGFWDGSRIQ 1500
            GDVKLFGQILSHAPS+QNSSSGSNE GE+KGLH SSSKSCD+GE+VPLRSYGFWDGSRIQ
Sbjct: 1441 GDVKLFGQILSHAPSQQNSSSGSNECGEKKGLHNSSSKSCDMGEHVPLRSYGFWDGSRIQ 1500

Query: 1501 TGLSALPDSAILQAKYPAAFSGYSATSVKTEQQPLQALTNNGDRSLNGLVSAFPTKDGVV 1560
            TGLSALPDSAILQ+KYPAAFSGYS TSVKTEQQ LQAL NN D+SLN +VSAFPTKDGVV
Sbjct: 1501 TGLSALPDSAILQSKYPAAFSGYSGTSVKTEQQTLQALANNSDQSLNEVVSAFPTKDGVV 1560

Query: 1561 DYHSYRSRDGVKLRPFPVDIFSEMQRRNGFDAVSLSSLQQQGRVLVGMNVVGRGGILMGG 1620
            DYHSYRSRDGVK+RPFPVDIFSEM RRNGFDAVSLSSLQQQGRVLVGMNVVGRGGILMGG
Sbjct: 1561 DYHSYRSRDGVKMRPFPVDIFSEMHRRNGFDAVSLSSLQQQGRVLVGMNVVGRGGILMGG 1620

Query: 1621 SCTGVSDPVAAIKMHYSKAEQYVGQPGSTFTREDGSWRGGNGGDLGSR 1664
            SCTGVSDPVAAIKMHY+KA+QY GQPGS FTREDGSWRGG GGDLGSR
Sbjct: 1621 SCTGVSDPVAAIKMHYAKADQYAGQPGSMFTREDGSWRGGKGGDLGSR 1648

BLAST of CcUC05G103030 vs. NCBI nr
Match: XP_004142488.1 (uncharacterized protein LOC101222167 [Cucumis sativus] >KGN52286.1 hypothetical protein Csa_008147 [Cucumis sativus])

HSP 1 Score: 2758.8 bits (7150), Expect = 0.0e+00
Identity = 1446/1671 (86.54%), Postives = 1532/1671 (91.68%), Query Frame = 0

Query: 1    MPPEPLPWDRKDLFKERKHEKSEAIGSAARWRDSHHGSREFNRWGSADFRRPTGHGKQGS 60
            MPPEPLPWDRKDLFKERKHEKSEAIGSAARWRDS+HGSREFNRWGSAD RRPTGHGKQG 
Sbjct: 1    MPPEPLPWDRKDLFKERKHEKSEAIGSAARWRDSYHGSREFNRWGSADLRRPTGHGKQGG 60

Query: 61   WHQFSEESSHGYGPSRSFSDRVLEDESFRPSVPRGDGRYIRIGREIRGSFSHRDWRSHSR 120
            WHQFSE+SSHGYGPSRSFSDRV+EDESFRPSVPRGDG+YIRIGRE RGSFSHRDWRSHSR
Sbjct: 61   WHQFSEDSSHGYGPSRSFSDRVIEDESFRPSVPRGDGKYIRIGRESRGSFSHRDWRSHSR 120

Query: 121  ETNNGFGNPSRRPSSASQDVSSDQRSVDDTVTYSSPQSVHGLENGPRADVEVSLGSTDWK 180
            + NNGFGNPSRR S  SQDVSSDQRSVDDTVTYSSPQS HGLENGPR+DVEVSLGSTDWK
Sbjct: 121  DANNGFGNPSRRTS--SQDVSSDQRSVDDTVTYSSPQSFHGLENGPRSDVEVSLGSTDWK 180

Query: 181  PLKWSRSGSLSSRGSAYSSSTNSKNEKADLPLRVASPIESPSAETTACVTSSLPSEDAIS 240
            PLKWSRSGSLSSRGSAYSSSTNSKNEKADLPLRVASPIESPSAE TACVTSSLPSEDAIS
Sbjct: 181  PLKWSRSGSLSSRGSAYSSSTNSKNEKADLPLRVASPIESPSAEATACVTSSLPSEDAIS 240

Query: 241  RKKPRLGWGDGLAKYEKEKVEVPDGSLKKEVALLSSASAELTHSLGSNFAEKSPKTLPFS 300
            RKKPRLGWGDGLAKYEKEKVEVPDGSL+KEVALLSS S ELTHSLGSNFAEKSPKTLPFS
Sbjct: 241  RKKPRLGWGDGLAKYEKEKVEVPDGSLRKEVALLSSGSGELTHSLGSNFAEKSPKTLPFS 300

Query: 301  DCASPATPSSFACSSSSGLEDKPFSKGASLDGMICSSPGSSSQNLQKLLSSIEMMEICSI 360
            DCASPATPSSFACSSSSGLEDKPFSKGA  DGMICSSPGS SQNLQKLL SIE MEI S+
Sbjct: 301  DCASPATPSSFACSSSSGLEDKPFSKGAGADGMICSSPGSGSQNLQKLLCSIEKMEISSV 360

Query: 361  ANLGSSLVELFHSDDPSTVESCFGKSTLNKLLAYKGEISKKLETTESEIDSLENELKSLK 420
            ANLGSSLVELFHSDDP+T+ESCFGKSTLNKLLAYKGEISK LE TESEIDSLENELKSLK
Sbjct: 361  ANLGSSLVELFHSDDPNTIESCFGKSTLNKLLAYKGEISKTLEMTESEIDSLENELKSLK 420

Query: 421  SGNGGNVSHKKSCSVTHLVENVTYFKEQDGVSCVAPRPAPLVIVSSSDATVEKMPVCKGD 480
            S NGGNVSHKKSCS T ++E+ TYFKEQDG+SC+A RPAPLV+VSSSDATVEK+P+CKGD
Sbjct: 421  SVNGGNVSHKKSCSATRVMESSTYFKEQDGISCIATRPAPLVVVSSSDATVEKVPLCKGD 480

Query: 481  MGVEDVDTKADEIDSPGTVTSKFNEPSRVVKTLASDLVVNGHCSEVTDVIVPDKMEGNFP 540
            +GVEDVDTKADEIDSPGTVTSKFNEPSRVVK +ASD+V NGHCS VTD IVP KMEG+FP
Sbjct: 481  VGVEDVDTKADEIDSPGTVTSKFNEPSRVVKAIASDIVDNGHCSVVTDAIVPGKMEGSFP 540

Query: 541  VSRSFVDEHKTIGSDNECILAKSCTKESIYGDLMAQAGSRSSLCDHIFVCNKEYASRAAE 600
            +S  FVDEH+TIGS NEC LAKSCT ES+YGDLMAQAGSRSSLCD IF CNKEYASRAAE
Sbjct: 541  ISGPFVDEHETIGSGNECTLAKSCTSESVYGDLMAQAGSRSSLCDSIFACNKEYASRAAE 600

Query: 601  VIFKKLPVEMCKISSKSTKIVSCSEIEKLVKEKFLMRRQFLKFKESALTLRFKALQQSWK 660
            VIFK+ PV MCKISSKSTK VSCSE EKL+KEKF+MR++FLKFKESALTLRFK+LQQSWK
Sbjct: 601  VIFKRSPVGMCKISSKSTKNVSCSETEKLIKEKFVMRKKFLKFKESALTLRFKSLQQSWK 660

Query: 661  EGLLHSVKKCRSRPQKKELSLRVSHSGHQKYRSST-RSRLVQQGACQNPTLNTEIAVRYS 720
            EGLLHSVKKCRSRPQKKELSLRV+HSGHQKYRSS+ RSRLVQQGACQ+ T NTEIAVR+S
Sbjct: 661  EGLLHSVKKCRSRPQKKELSLRVTHSGHQKYRSSSIRSRLVQQGACQSSTFNTEIAVRHS 720

Query: 721  SKLLLNPQIKLYRNSLKMPAMILDKKEKMALRFISHNGLVEDPCAVEKERNMINPWTSAE 780
            SKLLLNPQIKLYRN+LKMPAMILDKKEK+ALRFISHNGLVEDPCAVEKERN+INPWTSAE
Sbjct: 721  SKLLLNPQIKLYRNTLKMPAMILDKKEKIALRFISHNGLVEDPCAVEKERNLINPWTSAE 780

Query: 781  REIFWEKLSLFGKDFKKISSFLDLKTTADCIQFYYKNHKSDSFKKNKNLELGKQVKSSAI 840
            +EIFWEKLSLFGKDFKKISSFLDLKTTADCIQFYYKNHKSDSFKKNKNLELGKQ+KSSAI
Sbjct: 781  KEIFWEKLSLFGKDFKKISSFLDLKTTADCIQFYYKNHKSDSFKKNKNLELGKQMKSSAI 840

Query: 841  TYLVTSGKKWNPDMNATSLDILGVASIMAAQADYDIGNQQKCTRHLGMGRDVESKVSFSA 900
            TYLVTSGKKWNPD NATSLDILGVAS+MAAQADYDI NQQKCTRHLG+GRDVESKVS+SA
Sbjct: 841  TYLVTSGKKWNPDANATSLDILGVASVMAAQADYDIENQQKCTRHLGVGRDVESKVSWSA 900

Query: 901  STPSNKNNLDALQTEKETVAADVLAGICGSISSEALSSCITSAIDPSEDHWERKCYKVDS 960
            S+P NK+NLD LQTEKETVAADVLAGI GSISSEALSSCITSAIDP E+  ERKCY+VD 
Sbjct: 901  SSP-NKSNLDDLQTEKETVAADVLAGISGSISSEALSSCITSAIDPREELRERKCYRVDF 960

Query: 961  AVKLPSLSDVIQKTDNEEPCSDDSSEDVDSSNWTDEEKSIFMQAVSSYGKDFDMISRCIR 1020
            A KLPSLSDV+QKTDN EPCSDDSSEDVDSSNWTDEEK +FMQAVSSYGKDFDMISRCIR
Sbjct: 961  AAKLPSLSDVMQKTDN-EPCSDDSSEDVDSSNWTDEEKLVFMQAVSSYGKDFDMISRCIR 1020

Query: 1021 SKSRDQCKVFFSKARKCLGLDLMHTSGDVGETPGSGNDA--SGSGTDTEDHCIVEICGAH 1080
            SKSRDQCK+FFSKARKCLGLDLMHTSGDVGETPG+GNDA  SGSGTDTE+HC+VEIC   
Sbjct: 1021 SKSRDQCKIFFSKARKCLGLDLMHTSGDVGETPGNGNDASGSGSGTDTEEHCVVEICEGR 1080

Query: 1081 GSDEFVSKSVNGVSTSVNINHEESVSAVTVNMRTSSEFEENTVLQQSDEKCAEAVGNLIS 1140
            GSDEF+SKS+NG STSVNINHEE+VSAVT NMRTS EFEE+T LQQSDEK AEAVGNLI 
Sbjct: 1081 GSDEFISKSINGGSTSVNINHEETVSAVTDNMRTSMEFEESTALQQSDEKGAEAVGNLIF 1140

Query: 1141 EISKEEDLPSPDSHSAYNLTNAAASLSQPVHDHKIEGSSENTEGGSKCCNEPDILRSESV 1200
            E  KEED+P+P               SQP HDHKIEGSSENTE G K CNEPDILRSESV
Sbjct: 1141 ETLKEEDVPNP---------------SQPTHDHKIEGSSENTESG-KSCNEPDILRSESV 1200

Query: 1201 STVDENSAAVSESRATAKLAFGGEEEGRNTNLHVQSILQCSVQNSTGFDSKLALEGSSLG 1260
            STVDENSAAVSE RAT KLA  GEE G +TNLH QS + CS Q+STG DS +ALEGSS+G
Sbjct: 1201 STVDENSAAVSEGRATVKLAI-GEEVGSDTNLHGQSTILCSGQDSTGNDSNIALEGSSVG 1260

Query: 1261 LDPQILHPTVLKVEHVE-KSCVES-ENSLAVGNSEPGVIGREQMLNQYMLSSTAVLQEVS 1320
            LDP ILHP +LKVE VE KSC++S EN L+V NS+ GVIGREQMLNQ +LS T VLQE+S
Sbjct: 1261 LDPHILHPNILKVEPVEKKSCIKSEENFLSVRNSDTGVIGREQMLNQDILSPTLVLQEIS 1320

Query: 1321 DAHQKPMNRDDYAEHQNNLSHDSESK-FPRSYPFNKQIFEDINRNINRTYFPVVQGLSKP 1380
            DA+QKPMNRDD AEH NNL  +SES  FPRSYPFNKQIFEDINRNIN  YF  VQGLSKP
Sbjct: 1321 DANQKPMNRDDDAEHPNNLLCNSESSTFPRSYPFNKQIFEDINRNINHAYFR-VQGLSKP 1380

Query: 1381 DINCSSSYVSEGHYLQNCNSSKPHNPAELPFLPQNVDLGHDRQKKALCSGSASDSDVPRR 1440
            DINC+S YVSEG +LQNCNSSKPHN AE PFL QN++LGHD QK A  SGSASDSDVPRR
Sbjct: 1381 DINCNSKYVSEGQFLQNCNSSKPHNLAEPPFLSQNIELGHDHQKNASGSGSASDSDVPRR 1440

Query: 1441 KGDVKLFGQILSHAPSKQNSSSGSNEGGEEKG-LHKSSSKSCDIGENVPLRSYGFWDGSR 1500
            KGDVKLFGQILSHAPS+QNSSSGSNE GE+KG LH SSSKSCD+GEN+PLRSYGFWDGSR
Sbjct: 1441 KGDVKLFGQILSHAPSQQNSSSGSNECGEKKGPLHNSSSKSCDMGENIPLRSYGFWDGSR 1500

Query: 1501 IQTGLSALPDSAILQAKYPAAFSGYSATSVKTEQQPLQALTNNGDRSLNGLVSAFPTKDG 1560
            IQTGLSALPDSAILQAKYPAAFSGYSATSVKTEQQPLQAL+NNGD+SLN LVSAFPTKDG
Sbjct: 1501 IQTGLSALPDSAILQAKYPAAFSGYSATSVKTEQQPLQALSNNGDQSLNELVSAFPTKDG 1560

Query: 1561 VVDYHSYRSRDGVKLRPFPVDIFSEMQRRNGFDAVSLSSLQQQGRVLVGMNVVGRGGILM 1620
            VVDYHSYRSRDGVK+RPFPVDIFSEM RRNGFDAVSLSSLQQQGRVLVGMNVVGRGGILM
Sbjct: 1561 VVDYHSYRSRDGVKMRPFPVDIFSEMHRRNGFDAVSLSSLQQQGRVLVGMNVVGRGGILM 1620

Query: 1621 GGSCTGVSDPVAAIKMHYSKAEQYVGQPGSTFTREDGSW-RGGNGGDLGSR 1664
            GGSCTGVSDPVAAIKMHY+KA+QY GQP S FTREDGSW  GGNGGDLGSR
Sbjct: 1621 GGSCTGVSDPVAAIKMHYAKADQYAGQPASMFTREDGSWGGGGNGGDLGSR 1649

BLAST of CcUC05G103030 vs. NCBI nr
Match: XP_008446909.2 (PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC103489481 [Cucumis melo])

HSP 1 Score: 2751.9 bits (7132), Expect = 0.0e+00
Identity = 1435/1668 (86.03%), Postives = 1521/1668 (91.19%), Query Frame = 0

Query: 1    MPPEPLPWDRKDLFKERKHEKSEAIGSAARWRDSHHGSREFNRWGSADFRRPTGHGKQGS 60
            MPPEPLPWDRKDLFKERKHEKSEAIGSAARWRDS+HGSREFNRWGSAD RRPTGHGKQG 
Sbjct: 1    MPPEPLPWDRKDLFKERKHEKSEAIGSAARWRDSYHGSREFNRWGSADLRRPTGHGKQGG 60

Query: 61   WHQFSEESSHGYGPSRSFSDRVLEDESFRPSVPRGDGRYIRIGREIRGSFSHRDWRSHSR 120
            WHQFSE+SSHGYGPSRSFSDRV+EDESFRPSVPRGDG+YIRIGRE RGSFSHRDWRSHSR
Sbjct: 61   WHQFSEDSSHGYGPSRSFSDRVIEDESFRPSVPRGDGKYIRIGRESRGSFSHRDWRSHSR 120

Query: 121  ETNNGFGNPSRRPSSASQDVSSDQRSVDDTVTYSSPQSVHGLENGPRADVEVSLGSTDWK 180
            +TNNGFGNPSRRPS  SQDVSSDQRSVDDTVTYSSPQS HGLENGPR+DVEVSLGSTDWK
Sbjct: 121  DTNNGFGNPSRRPS--SQDVSSDQRSVDDTVTYSSPQSFHGLENGPRSDVEVSLGSTDWK 180

Query: 181  PLKWSRSGSLSSRGSAYSSSTNSKNEKADLPLRVASPIESPSAETTACVTSSLPSEDAIS 240
            PLKWSRSGSLSSRGSAYSSSTNSKNEKADLPLRVASPIESPSAE TACVTSSLPSED IS
Sbjct: 181  PLKWSRSGSLSSRGSAYSSSTNSKNEKADLPLRVASPIESPSAEATACVTSSLPSEDTIS 240

Query: 241  RKKPRLGWGDGLAKYEKEKVEVPDGSLKKEVALLSSASAELTHSLGSNFAEKSPKTLPFS 300
            RKKPRLGWGDGLAKYEKEKV+VPDGSL+KEVALLSS S ELTHSLGSNFAEKSPKTLPFS
Sbjct: 241  RKKPRLGWGDGLAKYEKEKVDVPDGSLRKEVALLSSGSGELTHSLGSNFAEKSPKTLPFS 300

Query: 301  DCASPATPSSFACSSSSGLEDKPFSKGASLDGMICSSPGSSSQNLQKLLSSIEMMEICSI 360
            DCASPATPSSFACSSSSGLEDKPFSKGAS DGMICSSPGS SQNLQKLL SIE MEI SI
Sbjct: 301  DCASPATPSSFACSSSSGLEDKPFSKGASADGMICSSPGSGSQNLQKLLCSIEKMEISSI 360

Query: 361  ANLGSSLVELFHSDDPSTVESCFGKSTLNKLLAYKGEISKKLETTESEIDSLENELKSLK 420
            ANLGSSLVELFHSDDP+T+ESCFGKSTLNKLLAYKGEISK LE TESEIDSLENELKSLK
Sbjct: 361  ANLGSSLVELFHSDDPNTIESCFGKSTLNKLLAYKGEISKTLEMTESEIDSLENELKSLK 420

Query: 421  SGNGGNVSHKKSCSVTHLVENVTYFKEQDGVSCVAPRPAPLVIVSSSDATVEKMPVCKGD 480
            SGNGGNVS+KKSCS T LVE+ TYFKEQDG+SC+APRPAPLV+VSSSDATVEK+P+CKGD
Sbjct: 421  SGNGGNVSNKKSCSATRLVESSTYFKEQDGISCIAPRPAPLVVVSSSDATVEKVPLCKGD 480

Query: 481  MGVEDVDTKADEIDSPGTVTSKFNEPSRVVKTLASDLVVNGHCSEVTDVIVPDKMEGNFP 540
            MGVEDVDTKADEIDSPGTVTSKFNEPSRVVK   SD+V NGHCS VTD+IVP KMEGNFP
Sbjct: 481  MGVEDVDTKADEIDSPGTVTSKFNEPSRVVKENTSDIVDNGHCSVVTDMIVPGKMEGNFP 540

Query: 541  VSRSFVDEHKTIGSDNECILAKSCTKESIYGDLMAQAGSRSSLCDHIFVCNKEYASRAAE 600
            +S  FVDE KT GS NECILAKSC+ ES  GDLMAQAGSRSSLCD IF CNKEYASRAAE
Sbjct: 541  ISEPFVDERKTTGSGNECILAKSCSSESFNGDLMAQAGSRSSLCDSIFACNKEYASRAAE 600

Query: 601  VIFKKLPVEMCKISSKSTKIVSCSEIEKLVKEKFLMRRQFLKFKESALTLRFKALQQSWK 660
            VIFK+ PV +CKISSKSTK VSCSE EKL+KEKF+ R++FLKFKESALTLRFKALQQSWK
Sbjct: 601  VIFKRSPVGVCKISSKSTKYVSCSETEKLIKEKFVSRKKFLKFKESALTLRFKALQQSWK 660

Query: 661  EGLLHSVKKCRSRPQKKELSLRVSHSGHQKYRSSTRSRLVQQGACQNPTLNTEIAVRYSS 720
                   K+   +  KKELSLRV+HSGHQKYRSS RSRL+QQGACQ+ T NTEIAVR+SS
Sbjct: 661  M-FAAFCKEMSLKATKKELSLRVTHSGHQKYRSSFRSRLIQQGACQSTTFNTEIAVRHSS 720

Query: 721  KLLLNPQIKLYRNSLKMPAMILDKKEKMALRFISHNGLVEDPCAVEKERNMINPWTSAER 780
            KLLLNPQIKLYRN+LKMPAMILDKKEKMALRFISHNGLVEDPCAVEKERN+INPWTSAE+
Sbjct: 721  KLLLNPQIKLYRNTLKMPAMILDKKEKMALRFISHNGLVEDPCAVEKERNLINPWTSAEK 780

Query: 781  EIFWEKLSLFGKDFKKISSFLDLKTTADCIQFYYKNHKSDSFKKNKNLELGKQVKSSAIT 840
            EIFWEKLSLFGKDFKKISSFLDLKTTADCIQFYYKNHKSDSFKKNKNLELGKQ+KSSAIT
Sbjct: 781  EIFWEKLSLFGKDFKKISSFLDLKTTADCIQFYYKNHKSDSFKKNKNLELGKQMKSSAIT 840

Query: 841  YLVTSGKKWNPDMNATSLDILGVASIMAAQADYDIGNQQKCTRHLGMGRDVESKVSFSAS 900
            YLVTSGKKWNPD NATSLDILGVAS+MAAQA+YDIGNQQKC+RHLG G+DVESKVS+SAS
Sbjct: 841  YLVTSGKKWNPDANATSLDILGVASVMAAQAEYDIGNQQKCSRHLGTGKDVESKVSWSAS 900

Query: 901  TPSNKNNLDALQTEKETVAADVLAGICGSISSEALSSCITSAIDPSEDHWERKCYKVDSA 960
            TP NK+NLD LQTEKETVAADVLAGI GSISSEALSSCITSAIDP E+  E+KCYKVDSA
Sbjct: 901  TP-NKSNLDDLQTEKETVAADVLAGISGSISSEALSSCITSAIDPREELREQKCYKVDSA 960

Query: 961  VKLPSLSDVIQKTDNEEPCSDDSSEDVDSSNWTDEEKSIFMQAVSSYGKDFDMISRCIRS 1020
             KLPSLSDV+QKTDN EPCSDDSSEDVDSSNWTDEEK IF+QAVSSYGKDFDMISRCIRS
Sbjct: 961  AKLPSLSDVMQKTDN-EPCSDDSSEDVDSSNWTDEEKVIFLQAVSSYGKDFDMISRCIRS 1020

Query: 1021 KSRDQCKVFFSKARKCLGLDLMHTSGDVGETPGSGNDASGSGTDTEDHCIVEICGAHGSD 1080
            KSRDQCK+FFSKARKCLGLDLMHTSGDVGETPG+GND SGSGTDTEDHC+VEICG  GSD
Sbjct: 1021 KSRDQCKIFFSKARKCLGLDLMHTSGDVGETPGNGNDISGSGTDTEDHCVVEICGGRGSD 1080

Query: 1081 EFVSKSVNGVSTSVNINHEESVSAVTVNMRTSSEFEENTVLQQSDEKCAEAVGNLISEIS 1140
            E +SKS+NGVSTSVNINHEESVSA TVNMRTS EFE +T LQQ DEK AEAVGN+I E  
Sbjct: 1081 ESISKSINGVSTSVNINHEESVSAATVNMRTSMEFEGSTALQQLDEKGAEAVGNMIFETL 1140

Query: 1141 KEEDLPSPDSHSAYNLTNAAASLSQPVHDHKIEGSSENTEGGSKCCNEPDILRSESVSTV 1200
            KEED+P+P               SQP+HD KIEGSSENTEGG K CNEPDILRSESVSTV
Sbjct: 1141 KEEDVPNP---------------SQPMHDQKIEGSSENTEGG-KSCNEPDILRSESVSTV 1200

Query: 1201 DENSAAVSESRATAKLAFGGEEEGRNTNLHVQSILQCSVQNSTGFDSKLALEGSSLGLDP 1260
            DENSAAVSE RAT KLA G EE G + NLH QS +QCS Q+STG+DS +ALEGSS+GLDP
Sbjct: 1201 DENSAAVSECRATVKLAIGEEEVGSDANLHSQSTMQCSGQDSTGYDSNIALEGSSIGLDP 1260

Query: 1261 QILHPTVLKVEHVE-KSCVES-ENSLAVGNSEPGVIGREQMLNQYMLSSTAVLQEVSDAH 1320
            QILHP +LKVE VE KSC++S EN LAV NS+ GVIGREQMLNQ + SST VLQ+VSDA 
Sbjct: 1261 QILHPNILKVEPVEKKSCIKSEENFLAVRNSDTGVIGREQMLNQDVSSSTLVLQDVSDAD 1320

Query: 1321 QKPMNR--DDYAEHQNNLSHDSES-KFPRSYPFNKQIFEDINRNINRTYFPVVQGLSKPD 1380
            QKPMNR  DD  EH+NNL  +SES KFPRSYPFNKQIFEDINRNIN TYFPVVQGLSKPD
Sbjct: 1321 QKPMNRDKDDDDEHRNNLLRNSESPKFPRSYPFNKQIFEDINRNINHTYFPVVQGLSKPD 1380

Query: 1381 INCSSSYVSEGHYLQNCNSSKPHNPAELPFLPQNVDLGHDRQKKALCSGSASDSDVPRRK 1440
            INC++ YV EG YLQNCNSSKPHNPAELPFL QN++LGH+ QK A  SGSASDSDVPRRK
Sbjct: 1381 INCNNKYVPEGQYLQNCNSSKPHNPAELPFLSQNIELGHNHQKNASGSGSASDSDVPRRK 1440

Query: 1441 GDVKLFGQILSHAPSKQNSSSGSNEGGEEKGLHKSSSKSCDIGENVPLRSYGFWDGSRIQ 1500
            GDVKLFGQILSHAPS+QNSSSGSNE GE+KGLH SSSKSCD+GE+VPLRSYGFWDGSRIQ
Sbjct: 1441 GDVKLFGQILSHAPSQQNSSSGSNECGEKKGLHNSSSKSCDMGEHVPLRSYGFWDGSRIQ 1500

Query: 1501 TGLSALPDSAILQAKYPAAFSGYSATSVKTEQQPLQALTNNGDRSLNGLVSAFPTKDGVV 1560
            TGLSALPDSAILQ+KYPAAFSGYS TSVKTEQQ LQAL NN D+SLN +VSAFPTKDGVV
Sbjct: 1501 TGLSALPDSAILQSKYPAAFSGYSGTSVKTEQQTLQALANNSDQSLNEVVSAFPTKDGVV 1560

Query: 1561 DYHSYRSRDGVKLRPFPVDIFSEMQRRNGFDAVSLSSLQQQGRVLVGMNVVGRGGILMGG 1620
            DYHSYRSRDGVK+RPFPVDIFSEM RRNGFDAVSLSSLQQQGRVLVGMNVVGRGGILMGG
Sbjct: 1561 DYHSYRSRDGVKMRPFPVDIFSEMHRRNGFDAVSLSSLQQQGRVLVGMNVVGRGGILMGG 1620

Query: 1621 SCTGVSDPVAAIKMHYSKAEQYVGQPGSTFTREDGSWRGGNGGDLGSR 1664
            SCTGVSDPVAAIKMHY+KA+QY GQPGS FTREDGSWRGG GGDLGSR
Sbjct: 1621 SCTGVSDPVAAIKMHYAKADQYAGQPGSMFTREDGSWRGGKGGDLGSR 1647

BLAST of CcUC05G103030 vs. NCBI nr
Match: KAG6601151.1 (Nuclear receptor corepressor 1, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 2641.7 bits (6846), Expect = 0.0e+00
Identity = 1397/1687 (82.81%), Postives = 1490/1687 (88.32%), Query Frame = 0

Query: 1    MPPEPLPWDRKDLFKERKHEKSEAIGSAARWRDSHHGSREFNRWGSADFRRPTGHGKQGS 60
            MPPEPLPWDRKDLFKERKHEKSEAIGSA RWRDS+HGSREFNRWGSADFRRPTGHGK G 
Sbjct: 1    MPPEPLPWDRKDLFKERKHEKSEAIGSATRWRDSYHGSREFNRWGSADFRRPTGHGKLGG 60

Query: 61   WHQFSEESSHGYGPSRSFSDRVLEDESFRPSVPRGDGRYIRIGREIRGSFSHRDWRSHSR 120
            WHQFSEE+SHGYGPSRSFSDRVLEDESFRPSVPRGDG+Y RIGRE RGSFS RDWR HS+
Sbjct: 61   WHQFSEETSHGYGPSRSFSDRVLEDESFRPSVPRGDGKYNRIGRESRGSFSQRDWRGHSK 120

Query: 121  ETNNGFGNPSRRPSSASQDVSSDQRSVDDTVTYSSPQS--------------------VH 180
            E +  FGNPSRRPS  SQD SSDQRS+DDTVTYSSPQS                    V+
Sbjct: 121  ENSKEFGNPSRRPS--SQDASSDQRSLDDTVTYSSPQSDFVSVSDKIHSKDRNDKVGGVY 180

Query: 181  GLENGPRADVEVSLGSTDWKPLKWSRSGSLSSRGSAYSSSTNSKNEKADLPLRVASPIES 240
            GL NGPR+DVEVSLGSTDWKPLKWSRSGSLSSRGSAYSSSTNSKNEK DLP RVASP++S
Sbjct: 181  GLGNGPRSDVEVSLGSTDWKPLKWSRSGSLSSRGSAYSSSTNSKNEKTDLPRRVASPLQS 240

Query: 241  PSAETTACVTSSLPSEDAISRKKPRLGWGDGLAKYEKEKVEVPDGSLKKEVALLSSASAE 300
            PSAE TAC+TSSLPSEDAISRKKPRLGWGDGLAKYEKEKVEVPDGSL+KEV +LSS+SAE
Sbjct: 241  PSAEATACLTSSLPSEDAISRKKPRLGWGDGLAKYEKEKVEVPDGSLRKEVTVLSSSSAE 300

Query: 301  LTHSLGSNFAEKSPKTLPFSDCASPATPSSFACSSSSGLEDKPFSKGASLDGMICSSPGS 360
            LTHSLGSNFAEKSPKTLPFSDCASPATPSSFACSSSSGLEDKPFSK AS+DG+ICSSPGS
Sbjct: 301  LTHSLGSNFAEKSPKTLPFSDCASPATPSSFACSSSSGLEDKPFSKAASVDGIICSSPGS 360

Query: 361  SSQN-LQKLLSSIEMMEICSIANLGSSLVELFHSDDPSTVESCFGKSTLNKLLAYKGEIS 420
            SSQN LQKL SSIE +EI SI NLGSSLVELF+SDDPSTVESCFGKSTLNKLLAYKGEIS
Sbjct: 361  SSQNHLQKLFSSIEKVEISSITNLGSSLVELFNSDDPSTVESCFGKSTLNKLLAYKGEIS 420

Query: 421  KKLETTESEIDSLENELKSLKSGNGGNVSHKKSCSVTHLVENVTYFKEQDGVSCVAPRPA 480
            K LETTESEID LENELKSLKS NGGNVSH KSCS  HLVE+V YFKEQDGVSC+APRPA
Sbjct: 421  KTLETTESEIDFLENELKSLKSENGGNVSHPKSCSAVHLVESVPYFKEQDGVSCIAPRPA 480

Query: 481  PLVIVSSSDATVEKMPVCKGDMGVEDVDTKADEIDSPGTVTSKFNEPSRVVKTLASDLVV 540
            PL IVSSSDATVEKMPVC GDMG+EDV TKADEIDSPGTVTSKFNEPSRVVK +AS+LV 
Sbjct: 481  PLKIVSSSDATVEKMPVCIGDMGIEDVGTKADEIDSPGTVTSKFNEPSRVVKAVASNLVE 540

Query: 541  NGHCSEVTDVIVPDKMEGNFPVSRSFVDEHKTIGSDNECILAKSCTKESIYGDLMAQAGS 600
            N HCSE TD IVPDKME +F  S  FVDEH TIGS NECILAKSCT ESIYGDL   A S
Sbjct: 541  NDHCSEATDSIVPDKMEESFKKSGPFVDEHLTIGSGNECILAKSCTSESIYGDLTTHADS 600

Query: 601  RSSLCDHIFVCNKEYASRAAEVIFKKLPVEMCKISSKSTKIVSCSEIEKLVKEKFLMRRQ 660
             SSL   IF CNKEYAS+AAEVIFK+LP EMCKIS++STKIVSC E EKLVKEK  MRRQ
Sbjct: 601  GSSLRYLIFACNKEYASKAAEVIFKELPTEMCKISTQSTKIVSCFETEKLVKEKIAMRRQ 660

Query: 661  FLKFKESALTLRFKALQQSWKEGLLHSVKKCRSRPQKKELSLRVSHSGHQKYRSSTRSRL 720
             LKFKESALTLRFKALQ SWKEGLLHSVKK RSRPQKKELSLRV+HSGHQKYRSS RSR 
Sbjct: 661  ILKFKESALTLRFKALQHSWKEGLLHSVKKSRSRPQKKELSLRVTHSGHQKYRSSIRSRF 720

Query: 721  VQQGACQNPTLNTEIAVRYSSKLLLNPQIKLYRNSLKMPAMILDKKEKMALRFISHNGLV 780
            VQ G  QNP +N+EIA+RYSS+LLLNPQ+KLYRN+LKMPAMILDK EK+ALRFISHNGLV
Sbjct: 721  VQHGESQNPVVNSEIAIRYSSQLLLNPQVKLYRNTLKMPAMILDKNEKIALRFISHNGLV 780

Query: 781  EDPCAVEKERNMINPWTSAEREIFWEKLSLFGKDFKKISSFLDLKTTADCIQFYYKNHKS 840
            EDPCAVEKERNMINPWTSAEREIFWEKLSLFGKDF+KISSFLDLKTTADCIQFYYKNHKS
Sbjct: 781  EDPCAVEKERNMINPWTSAEREIFWEKLSLFGKDFRKISSFLDLKTTADCIQFYYKNHKS 840

Query: 841  DSFKKNKNLELGKQVKSSAITYLVTSGKKWNPDMNATSLDILGVASIMAAQADYDIGNQQ 900
            DSFKKNKNLELGKQVKSSA+TY++TSGKKWNPD+NATSLDILGVAS MAAQAD +IGNQQ
Sbjct: 841  DSFKKNKNLELGKQVKSSAVTYMLTSGKKWNPDVNATSLDILGVASEMAAQADGNIGNQQ 900

Query: 901  KCTRHLGMGRDVESKVSFSASTPSNKNNLDALQTEKETVAADVLAGICGSISSEALSSCI 960
             C RHLGMG D+ SKVS+SASTPSNKNNLDALQTEKETVAADVLAGICGSISSEALSSCI
Sbjct: 901  NCNRHLGMGGDIGSKVSWSASTPSNKNNLDALQTEKETVAADVLAGICGSISSEALSSCI 960

Query: 961  TSAIDPSEDHWERKCYKVDSAVKLPSLSDVIQKTDNEEPCSDDSSEDVDSSNWTDEEKSI 1020
            TSAIDPSEDH ERKC+KVD A K PS SDV+QKTDN EPCSDDSSEDVDSSNWTDEEKSI
Sbjct: 961  TSAIDPSEDHKERKCHKVDFATKFPSTSDVMQKTDN-EPCSDDSSEDVDSSNWTDEEKSI 1020

Query: 1021 FMQAVSSYGKDFDMISRCIRSKSRDQCKVFFSKARKCLGLDLMHTSGDVGETPGSGNDAS 1080
             MQAVSSYGKDFDMISRC+RSKSRDQCKVFFSKARKCLGLDL+H SGDVG TPGS ND+S
Sbjct: 1021 LMQAVSSYGKDFDMISRCVRSKSRDQCKVFFSKARKCLGLDLIHNSGDVG-TPGSDNDSS 1080

Query: 1081 GSGTDTEDHCIVEICGAHGSDEFVSKSVNGVSTSVNINHEESVSAVTVNMRTSSEFEENT 1140
            GSGTDT+DHC+VE CGA  SDEFVSKSVNG+STSV INHEESVSAVT NMR SSEFEE+T
Sbjct: 1081 GSGTDTDDHCVVETCGARSSDEFVSKSVNGLSTSVIINHEESVSAVTANMRNSSEFEEST 1140

Query: 1141 VLQQSDEKCAEAVGNLISEISKEEDLPSPDSHSAYNLTNAAASLSQPVHDHKIEGSSENT 1200
              +Q D   AEAVGNL+SEISKEED+P+ DSHSA +LTNAAA  SQP HDHKIEG SENT
Sbjct: 1141 AFEQLDVTGAEAVGNLVSEISKEEDVPNLDSHSACSLTNAAAFPSQPAHDHKIEGCSENT 1200

Query: 1201 EGGSKCCNEPDILRSESVSTVDENSAAVSESRATAKLAFGGEEEGRNTNLHVQSILQCSV 1260
            E   K CN+PDILR ESV+TVDENSAAVSESRAT +LAFGGEE+G +TNLH QS+LQ SV
Sbjct: 1201 E-ACKRCNDPDILRPESVATVDENSAAVSESRATTELAFGGEEDGSDTNLHGQSMLQRSV 1260

Query: 1261 QNSTGFDSKLALEGSSLGLDPQILHPTVLKVEHV-EKSCVESENSLAVGNSEPGVIGREQ 1320
            Q+STGF+S LALE  SLG DPQI HP +LKV+ V  KSC++ ENSL V NS PGVIGRE+
Sbjct: 1261 QDSTGFNSNLALE--SLGFDPQISHPKILKVDSVANKSCIKDENSLVVRNSGPGVIGREE 1320

Query: 1321 MLNQYMLSSTAVLQEVSDAHQKPMNRDDYAEHQNNLS-HDSESKFPRSYPFNKQIFEDIN 1380
            MLNQ M  ST VLQ V DAHQKPMNRDD A+HQN LS H   S+FP SYPFNKQI EDIN
Sbjct: 1321 MLNQDMFPSTLVLQGVGDAHQKPMNRDDCADHQNRLSRHIESSEFPSSYPFNKQIVEDIN 1380

Query: 1381 RNINRTYFPVVQGLSKPDINCSSSYVSEGHYLQNCNSSKP--HNPAELPFLPQNVDLGHD 1440
            RNIN T FP  QGLSK  INC+ +YV E  YLQ+CNSSK   H  AELP LPQNVDLGHD
Sbjct: 1381 RNINHTDFPAFQGLSK--INCNGTYVVEDCYLQDCNSSKEPCHRAAELPLLPQNVDLGHD 1440

Query: 1441 RQKKALCSGSASDSDVPRRKGDVKLFGQILSHAPSKQNSSSGSNEGGEEKGLHKSSSKSC 1500
             Q  + CSG+ASDSDVPRRKGDVKLFGQILSHAPS QNSSSGSN+ GEEK  HK   KS 
Sbjct: 1441 HQNTS-CSGNASDSDVPRRKGDVKLFGQILSHAPSLQNSSSGSNDCGEEKEFHK-LRKSY 1500

Query: 1501 DIGENVPLRSYGFWDGSRIQTGLSALPDSAILQAKYPAAFSGYSATSVKTEQQPLQALTN 1560
            D+GENVPLRSYGFW+GSR+QTGLSALPDSAILQAKYPAAFSGYSATS+KTEQQPL+AL N
Sbjct: 1501 DMGENVPLRSYGFWNGSRMQTGLSALPDSAILQAKYPAAFSGYSATSLKTEQQPLRALAN 1560

Query: 1561 NGDRSLNGLVSAFPTKDGVVDYHSYRSRDGVKLRPFPVDIFSEMQRRNGFDAVSLSSLQQ 1620
            NGDR+LN LVSAFPTKDGVVDY SYRSRDGV +RPFPVD+FSEM RRNG+D +SLSSLQQ
Sbjct: 1561 NGDRNLNELVSAFPTKDGVVDYQSYRSRDGVNMRPFPVDLFSEMHRRNGYDPLSLSSLQQ 1620

Query: 1621 QGRVLVGMNVVGRGGILMGGSCTGVSDPVAAIKMHYSKAEQYVGQPGSTFTREDGSWRGG 1663
            QGRV+VGMNVVGRGGILMGGSCTGVSDPVAAIKMHY+K++QYVGQPGSTFTREDGSWRGG
Sbjct: 1621 QGRVVVGMNVVGRGGILMGGSCTGVSDPVAAIKMHYAKSDQYVGQPGSTFTREDGSWRGG 1676

BLAST of CcUC05G103030 vs. ExPASy Swiss-Prot
Match: Q4KKX4 (Nuclear receptor corepressor 1 OS=Xenopus tropicalis OX=8364 GN=ncor1 PE=2 SV=1)

HSP 1 Score: 76.6 bits (187), Expect = 2.9e-12
Identity = 60/201 (29.85%), Postives = 99/201 (49.25%), Query Frame = 0

Query: 634 FLMRRQFLKFKESALTLRFKALQQSWKEGLLHSVKKCRSRPQK--KELSLRVSHSGH--- 693
           F  R    K +E  +  R+  L ++W++     V +  + P++  KE   R  +      
Sbjct: 285 FKRRNHARKLREQNICQRYDQLMEAWEK----KVDRIENNPRRKAKESKTREYYEKQFPE 344

Query: 694 ---QKYRSSTRSRLVQQGACQNPTL---NTEIAVRYSSKLLLNPQIKLYRNSLKMPAMIL 753
              Q+ +     R+ Q+GA  + T+     EI+             K  R    +P M+ 
Sbjct: 345 IRKQREQQERFQRVGQRGAGLSATIARSEHEISEIIDGLSEQENNEKQMRQLSVIPPMMF 404

Query: 754 DKKEKMALRFISHNGLVEDPCAVEKERNMINPWTSAEREIFWEKLSLFGKDFKKISSFLD 813
           D  E+  ++FI+ NGL+EDP  V K+R  +N WT  E+EIF EK     K+F  I+S+L+
Sbjct: 405 D-AEQRRVKFINMNGLMEDPMKVYKDRQFMNVWTDHEKEIFKEKFVQHPKNFGLIASYLE 464

Query: 814 LKTTADCIQFYYKNHKSDSFK 824
            KT +DC+ +YY   K+++FK
Sbjct: 465 RKTVSDCVLYYYLTKKNENFK 480

BLAST of CcUC05G103030 vs. ExPASy Swiss-Prot
Match: Q8QG78 (Nuclear receptor corepressor 1 OS=Xenopus laevis OX=8355 GN=ncor1 PE=1 SV=1)

HSP 1 Score: 71.6 bits (174), Expect = 9.3e-11
Identity = 57/201 (28.36%), Postives = 96/201 (47.76%), Query Frame = 0

Query: 634 FLMRRQFLKFKESALTLRFKALQQSWKEGLLHSVKKCRSRPQK--KELSLRVSHSGH--- 693
           F  R    K +E  +  R+  L ++W++     V +  + P++  KE   R  +      
Sbjct: 285 FKRRNHARKLREQNICQRYDQLMEAWEK----KVDRIENNPRRKAKESKTREYYEKQFPE 344

Query: 694 ---QKYRSSTRSRLVQQGACQNPTL---NTEIAVRYSSKLLLNPQIKLYRNSLKMPAMIL 753
              Q+ +     R+ Q+G   + T+     EI+             K  R    +P M+ 
Sbjct: 345 IRKQREQQERFQRVGQRGTGMSATIARSEHEISEIIDGLSEQENNEKQMRQLSVIPPMMF 404

Query: 754 DKKEKMALRFISHNGLVEDPCAVEKERNMINPWTSAEREIFWEKLSLFGKDFKKISSFLD 813
           D  E+  ++FI+ NGL+EDP  V K+R  +N WT  E+EIF EK     K+F  I+S+L+
Sbjct: 405 D-AEQRRVKFINTNGLMEDPMKVYKDRQFMNVWTDHEKEIFKEKFVRHPKNFGLIASYLE 464

Query: 814 LKTTADCIQFYYKNHKSDSFK 824
            K  +DC+ +YY   K+++ K
Sbjct: 465 RKNVSDCVLYYYLTKKNENLK 480

BLAST of CcUC05G103030 vs. ExPASy Swiss-Prot
Match: Q9WU42 (Nuclear receptor corepressor 2 OS=Mus musculus OX=10090 GN=Ncor2 PE=1 SV=3)

HSP 1 Score: 71.2 bits (173), Expect = 1.2e-10
Identity = 99/467 (21.20%), Postives = 192/467 (41.11%), Query Frame = 0

Query: 593  EYASRAAEVIFKKLPVEMCKISSKSTKIVSCSEIEKLVKEKFLM----RRQFLKFKESAL 652
            E A R  E +  ++ + +    S + +     +I + +++K ++    R    K  E   
Sbjct: 239  EAAHRILEGLGPQVELPLYNQPSDTRQYHENIKINQAMRKKLILYFKRRNHARKQWEQRF 298

Query: 653  TLRFKALQQSWKEGLLHSVKKCRSRPQKKELSLRVSHSGHQKY---------RSSTRSRL 712
              R+  L ++W++     V++  + P+++    +V     +++         +   +SR+
Sbjct: 299  CQRYDQLMEAWEK----KVERIENNPRRRAKESKVREYYEKQFPEIRKQRELQERMQSRV 358

Query: 713  VQQG------ACQNPTLNTEIAVRYSSKLLLNPQIKLYRNSLKMPAMILDKKEKMALRFI 772
             Q+G      A ++    +EI    S +  L  Q+   R    +P M+ D  ++  ++FI
Sbjct: 359  GQRGSGLSMSAARSEHEVSEIIDGLSEQENLEKQM---RQLAVIPPMLYD-ADQQRIKFI 418

Query: 773  SHNGLVEDPCAVEKERNMINPWTSAEREIFWEKLSLFGKDFKKISSFLDLKTTADCIQFY 832
            + NGL++DP  V K+R + N W+  ER+ F EK     K+F  I+SFL+ KT A+C+ +Y
Sbjct: 419  NMNGLMDDPMKVYKDRQVTNMWSEQERDTFREKFMQHPKNFGLIASFLERKTVAECVLYY 478

Query: 833  YKNHKSDSFKKNKNLELGKQVKSSAITYLVTSGKKWNPDMNATSLDILGVASIMAAQADY 892
            Y   K++++K        ++ KS                                 Q   
Sbjct: 479  YLTKKNENYKSLVRRSYRRRGKS------------------------------QQQQQQQ 538

Query: 893  DIGNQQKCTRHLGMGRDVESKVSFSASTPSNKNNLDALQTEKETVAADVLAGICGSISSE 952
                QQ+  R     ++ + K          +   DA + EKE ++ +      G  + E
Sbjct: 539  QQQQQQQMARSSQEEKEEKEK---EKEADKEEEKQDA-ENEKEELSKEKTDDTSGEDNDE 598

Query: 953  ALSSCITSAIDPSEDHWERKCYKVDSAVKLPSLSDVIQKTDNEEPCSDDSSEDVDSSNWT 1012
               +  +     +     RK     S     +  +      + E  S + +E   SS WT
Sbjct: 599  K-EAVASKGRKTANSQGRRKGRITRSMANEANHEETATPQQSSELASMEMNE---SSRWT 658

Query: 1013 DEEKSIFMQAVSSYGKDFDMISRCIRSKSRDQCKVFFSKARKCLGLD 1041
            +EE     + +  +G+++  I+R + SK+  QCK F+   +K   LD
Sbjct: 659  EEEMETAKKGLLEHGRNWSAIARMVGSKTVSQCKNFYFNYKKRQNLD 659

BLAST of CcUC05G103030 vs. ExPASy Swiss-Prot
Match: Q9Y618 (Nuclear receptor corepressor 2 OS=Homo sapiens OX=9606 GN=NCOR2 PE=1 SV=3)

HSP 1 Score: 64.7 bits (156), Expect = 1.1e-08
Identity = 110/554 (19.86%), Postives = 219/554 (39.53%), Query Frame = 0

Query: 593  EYASRAAEVIFKKLPVEMCKISSKSTKIVSCSEIEKLVKEKFLM----RRQFLKFKESAL 652
            E A R  E +  ++ + +    S + +     +I + +++K ++    R    K  E   
Sbjct: 239  EAAHRILEGLGPQVELPLYNQPSDTRQYHENIKINQAMRKKLILYFKRRNHARKQWEQKF 298

Query: 653  TLRFKALQQSWKEGLLHSVKKCRSRPQKKELSLRVSHSGHQKY---------RSSTRSRL 712
              R+  L ++W++     V++  + P+++    +V     +++         +   +SR+
Sbjct: 299  CQRYDQLMEAWEK----KVERIENNPRRRAKESKVREYYEKQFPEIRKQRELQERMQSRV 358

Query: 713  VQQG------ACQNPTLNTEIAVRYSSKLLLNPQIKLYRNSLKMPAMILDKKEKMALRFI 772
             Q+G      A ++    +EI    S +  L  Q+   R    +P M+ D  ++  ++FI
Sbjct: 359  GQRGSGLSMSAARSEHEVSEIIDGLSEQENLEKQM---RQLAVIPPMLYD-ADQQRIKFI 418

Query: 773  SHNGLVEDPCAVEKERNMINPWTSAEREIFWEKLSLFGKDFKKISSFLDLKTTADCIQFY 832
            + NGL+ DP  V K+R ++N W+  E+E F EK     K+F  I+SFL+ KT A+C+ +Y
Sbjct: 419  NMNGLMADPMKVYKDRQVMNMWSEQEKETFREKFMQHPKNFGLIASFLERKTVAECVLYY 478

Query: 833  YKNHKSDSFKKNKNLELGKQVKSSAITYLVTSGKKWNPDMNATSLDILGVASIMAAQADY 892
            Y   K++++K        ++ KS                                 Q   
Sbjct: 479  YLTKKNENYKSLVRRSYRRRGKS------------------------------QQQQQQQ 538

Query: 893  DIGNQQKCTRHLGMGRDVESKVSFSASTPSNKNNLDALQTEKETVAADVLAGICGSISSE 952
                QQ+  + +      E            +     ++ +KE +  +      G  + E
Sbjct: 539  QQQQQQQQQQPMPRSSQEEKDEKEKEKEAEKEEEKPEVENDKEDLLKEKTDDTSGEDNDE 598

Query: 953  ALSSCITSAIDPSEDHWERKCYKVDSAVKLPSLSDVIQKTDNEEPCSDDSSEDVDSSNWT 1012
               +  +     +     RK     S     +  + I    + E  S + +E   SS WT
Sbjct: 599  K-EAVASKGRKTANSQGRRKGRITRSMANEANSEEAITPQQSAELASMELNE---SSRWT 658

Query: 1013 DEEKSIFMQAVSSYGKDFDMISRCIRSKSRDQCKVFFSKARKCLGLDLM----------- 1072
            +EE     + +  +G+++  I+R + SK+  QCK F+   +K   LD +           
Sbjct: 659  EEEMETAKKGLLEHGRNWSAIARMVGSKTVSQCKNFYFNYKKRQNLDEILQQHKLKMEKE 718

Query: 1073 -HTSGDVGETPGSGNDASGSGTDTEDHCIVEICGAHGSDEFVSKSVNGVSTSVN-INHEE 1115
             +      + P + ++ +      ED   +E  G  G++E + +    +  S N +   E
Sbjct: 719  RNARRKKKKAPAAASEEAAFPPVVEDE-EMEASGVSGNEEEMVEEAEALHASGNEVPRGE 749

BLAST of CcUC05G103030 vs. ExPASy Swiss-Prot
Match: O75376 (Nuclear receptor corepressor 1 OS=Homo sapiens OX=9606 GN=NCOR1 PE=1 SV=2)

HSP 1 Score: 63.5 bits (153), Expect = 2.5e-08
Identity = 128/602 (21.26%), Postives = 229/602 (38.04%), Query Frame = 0

Query: 634  FLMRRQFLKFKESALTLRFKALQQSWKEGLLHSVKKCRSRPQK--KELSLRVSHSGH--- 693
            F  R    K +E  +  R+  L ++W++     V +  + P++  KE   R  +      
Sbjct: 293  FKRRNHARKQREQKICQRYDQLMEAWEK----KVDRIENNPRRKAKESKTREYYEKQFPE 352

Query: 694  ---QKYRSSTRSRLVQQGACQNPTL---NTEIAVRYSSKLLLNPQIKLYRNSLKMPAMIL 753
               Q+ +     R+ Q+GA  + T+     EI+             K  R    +P M+ 
Sbjct: 353  IRKQREQQERFQRVGQRGAGLSATIARSEHEISEIIDGLSEQENNEKQMRQLSVIPPMMF 412

Query: 754  DKKEKMALRFISHNGLVEDPCAVEKERNMINPWTSAEREIFWEKLSLFGKDFKKISSFLD 813
            D  E+  ++FI+ NGL+EDP  V K+R  +N WT  E+EIF +K     K+F  I+S+L+
Sbjct: 413  D-AEQRRVKFINMNGLMEDPMKVYKDRQFMNVWTDHEKEIFKDKFIQHPKNFGLIASYLE 472

Query: 814  LKTTADCIQFYYKNHKSDSFKKNKNLELGKQVKSSAITYLVTSGKK--WNPDMNATSLDI 873
             K+  DC+ +YY   K++++K       GK+   +      +  +K     +  A   + 
Sbjct: 473  RKSVPDCVLYYYLTKKNENYKALVRRNYGKRRGRNQQIARPSQEEKVEEKEEDKAEKTEK 532

Query: 874  LGVASIMAAQADYDIGNQQKCTRHLGMGRDVESKVSFSASTPSNKNNLDALQTEKETVAA 933
                     + D    +++       +    E       +TP  +   ++    K     
Sbjct: 533  KEEEKKDEEEKDEKEDSKENTKEKDKIDGTAEETEEREQATPRGRKTANSQGRRK----- 592

Query: 934  DVLAGICGSISSEALSSCITSAIDPSEDHWERKCYKVDSAVKLPSLSDVIQKTDNEEPCS 993
                 I  S+++EA ++   +A    E                P L           P  
Sbjct: 593  ---GRITRSMTNEAAAASAAAAAATEEPP--------------PPL---------PPPPE 652

Query: 994  DDSSEDVDSSNWTDEEKSIFMQAVSSYGKDFDMISRCIRSKSRDQCKVFFSKARKCLGLD 1053
              S+E V++S WT+EE  +  + +  +G+++  I++ + +KS  QCK F+   ++   LD
Sbjct: 653  PISTEPVETSRWTEEEMEVAKKGLVEHGRNWAAIAKMVGTKSEAQCKNFYFNYKRRHNLD 712

Query: 1054 --LMHTSGDVGETPGSGNDASGSGTDTEDHCIVEICGAHGSDEFVSKSVNGVSTSVNINH 1113
              L          P    D S      +   +     A   DE +  S          N 
Sbjct: 713  NLLQQHKQKTSRKPREERDVS------QCESVASTVSAQ-EDEDIEAS----------NE 772

Query: 1114 EESVSAVTVN-MRTSSEFEENTVLQQSDEKCAEAVGNLISEISKEEDLPSPDSHSAYN-- 1173
            EE+     V  ++ S +  EN   + + E   E      +  S    L  P +  A +  
Sbjct: 773  EENPEDSEVEAVKPSEDSPENATSRGNTEPAVELEPTTETAPSTSPSLAVPSTKPAEDES 832

Query: 1174 -LTNAAASLSQPVHDHKIEGSSENTEGGSKCCNEPDILRSESVST---VDENSAAVSESR 1214
              T    S+S    +       E++      C+ P   +++SV     V EN A+  E  
Sbjct: 833  VETQVNDSISAETAEQMDVDQQEHSAEEGSVCDPPPATKADSVDVEVRVPENHASKVEGD 841

BLAST of CcUC05G103030 vs. ExPASy TrEMBL
Match: A0A5A7SZU1 (Myb_DNA-binding domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold475G002830 PE=4 SV=1)

HSP 1 Score: 2786.1 bits (7221), Expect = 0.0e+00
Identity = 1448/1668 (86.81%), Postives = 1532/1668 (91.85%), Query Frame = 0

Query: 1    MPPEPLPWDRKDLFKERKHEKSEAIGSAARWRDSHHGSREFNRWGSADFRRPTGHGKQGS 60
            MPPEPLPWDRKDLFKERKHEKSEAIGSAARWRDS+HGSREFNRWGSAD RRPTGHGKQG 
Sbjct: 1    MPPEPLPWDRKDLFKERKHEKSEAIGSAARWRDSYHGSREFNRWGSADLRRPTGHGKQGG 60

Query: 61   WHQFSEESSHGYGPSRSFSDRVLEDESFRPSVPRGDGRYIRIGREIRGSFSHRDWRSHSR 120
            WHQFSE+SSHGYGPSRSFSDRV+EDESFRPSVPRGDG+YIRIGRE RGSFSHRDWRSHSR
Sbjct: 61   WHQFSEDSSHGYGPSRSFSDRVIEDESFRPSVPRGDGKYIRIGRESRGSFSHRDWRSHSR 120

Query: 121  ETNNGFGNPSRRPSSASQDVSSDQRSVDDTVTYSSPQSVHGLENGPRADVEVSLGSTDWK 180
            +TNNGFGNPSRRPS  SQDVSSDQRSVDDTVTYSSPQS HGLENGPR+DVEVSLGSTDWK
Sbjct: 121  DTNNGFGNPSRRPS--SQDVSSDQRSVDDTVTYSSPQSFHGLENGPRSDVEVSLGSTDWK 180

Query: 181  PLKWSRSGSLSSRGSAYSSSTNSKNEKADLPLRVASPIESPSAETTACVTSSLPSEDAIS 240
            PLKWSRSGSLSSRGSAYSSSTNSKNEKADLPLRVASPIESPSAE TACVTSSLPSED IS
Sbjct: 181  PLKWSRSGSLSSRGSAYSSSTNSKNEKADLPLRVASPIESPSAEATACVTSSLPSEDTIS 240

Query: 241  RKKPRLGWGDGLAKYEKEKVEVPDGSLKKEVALLSSASAELTHSLGSNFAEKSPKTLPFS 300
            RKKPRLGWGDGLAKYEKEKV+VPDGSL+KEVALLSS S ELTHSLGSNFAEKSPKTLPFS
Sbjct: 241  RKKPRLGWGDGLAKYEKEKVDVPDGSLRKEVALLSSGSGELTHSLGSNFAEKSPKTLPFS 300

Query: 301  DCASPATPSSFACSSSSGLEDKPFSKGASLDGMICSSPGSSSQNLQKLLSSIEMMEICSI 360
            DCASPATPSSFACSSSSGLEDKPFSKGAS DGMICSSPGS SQNLQKLL SIE MEI SI
Sbjct: 301  DCASPATPSSFACSSSSGLEDKPFSKGASADGMICSSPGSGSQNLQKLLCSIEKMEISSI 360

Query: 361  ANLGSSLVELFHSDDPSTVESCFGKSTLNKLLAYKGEISKKLETTESEIDSLENELKSLK 420
            ANLGSSLVELFHSDDP+T+ESCFGKSTLNKLLAYKGEISK LE TESEIDSLENELKSLK
Sbjct: 361  ANLGSSLVELFHSDDPNTIESCFGKSTLNKLLAYKGEISKTLEMTESEIDSLENELKSLK 420

Query: 421  SGNGGNVSHKKSCSVTHLVENVTYFKEQDGVSCVAPRPAPLVIVSSSDATVEKMPVCKGD 480
            SGNGGNVS+KKSCS T LVE+ TYFKEQDG+SC+APRPAPLV+VSSSDATVEK+P+CKGD
Sbjct: 421  SGNGGNVSNKKSCSATRLVESSTYFKEQDGISCIAPRPAPLVVVSSSDATVEKVPLCKGD 480

Query: 481  MGVEDVDTKADEIDSPGTVTSKFNEPSRVVKTLASDLVVNGHCSEVTDVIVPDKMEGNFP 540
            MGVEDVDTKADEIDSPGTVTSKFNEPSRVVK   SD+V NGHCS VTD+IVP KMEGNFP
Sbjct: 481  MGVEDVDTKADEIDSPGTVTSKFNEPSRVVKENTSDIVDNGHCSVVTDMIVPGKMEGNFP 540

Query: 541  VSRSFVDEHKTIGSDNECILAKSCTKESIYGDLMAQAGSRSSLCDHIFVCNKEYASRAAE 600
            +S  FVDE KT GS NECILAKSC+ ES  GDLMAQAGSRSSLCD IF CNKEYASRAAE
Sbjct: 541  ISEPFVDERKTTGSGNECILAKSCSSESFNGDLMAQAGSRSSLCDSIFACNKEYASRAAE 600

Query: 601  VIFKKLPVEMCKISSKSTKIVSCSEIEKLVKEKFLMRRQFLKFKESALTLRFKALQQSWK 660
            VIFK+ PV +CKISSKSTK VSCSE EKL+KEKF+ R++FLKFKESALTLRFKALQQSWK
Sbjct: 601  VIFKRSPVGVCKISSKSTKYVSCSETEKLIKEKFVSRKKFLKFKESALTLRFKALQQSWK 660

Query: 661  EGLLHSVKKCRSRPQKKELSLRVSHSGHQKYRSSTRSRLVQQGACQNPTLNTEIAVRYSS 720
            E LLHSVKKCRSRPQKKELSLRV+HSGHQKYRSS RSRL+QQGACQ+ T NTEIAVR+SS
Sbjct: 661  ECLLHSVKKCRSRPQKKELSLRVTHSGHQKYRSSFRSRLIQQGACQSTTFNTEIAVRHSS 720

Query: 721  KLLLNPQIKLYRNSLKMPAMILDKKEKMALRFISHNGLVEDPCAVEKERNMINPWTSAER 780
            KLLLNPQIKLYRN+LKMPAMILDKKEKMALRFISHNGLVEDPCAVEKERN+INPWTSAE+
Sbjct: 721  KLLLNPQIKLYRNTLKMPAMILDKKEKMALRFISHNGLVEDPCAVEKERNLINPWTSAEK 780

Query: 781  EIFWEKLSLFGKDFKKISSFLDLKTTADCIQFYYKNHKSDSFKKNKNLELGKQVKSSAIT 840
            EIFWEKLSLFGKDFKKISSFLDLKTTADCIQFYYKNHKSDSFKKNKNLELGKQ+KSSAIT
Sbjct: 781  EIFWEKLSLFGKDFKKISSFLDLKTTADCIQFYYKNHKSDSFKKNKNLELGKQMKSSAIT 840

Query: 841  YLVTSGKKWNPDMNATSLDILGVASIMAAQADYDIGNQQKCTRHLGMGRDVESKVSFSAS 900
            YLVTSGKKWNPD NATSLDILGVAS+MAAQA+YDIGNQQKC+RHLG G+DVESKVS+SAS
Sbjct: 841  YLVTSGKKWNPDANATSLDILGVASVMAAQAEYDIGNQQKCSRHLGTGKDVESKVSWSAS 900

Query: 901  TPSNKNNLDALQTEKETVAADVLAGICGSISSEALSSCITSAIDPSEDHWERKCYKVDSA 960
            TP NK+NLD LQTEKETVAADVLAGI GSISSEALSSCITSAIDP E+  E+KCYKVDSA
Sbjct: 901  TP-NKSNLDDLQTEKETVAADVLAGISGSISSEALSSCITSAIDPREELREQKCYKVDSA 960

Query: 961  VKLPSLSDVIQKTDNEEPCSDDSSEDVDSSNWTDEEKSIFMQAVSSYGKDFDMISRCIRS 1020
             KLPSLSDV+QKTDN EPCSDDSSEDVDSSNWTDEEK IF+QAVSSYGKDFDMISRCIRS
Sbjct: 961  AKLPSLSDVMQKTDN-EPCSDDSSEDVDSSNWTDEEKVIFLQAVSSYGKDFDMISRCIRS 1020

Query: 1021 KSRDQCKVFFSKARKCLGLDLMHTSGDVGETPGSGNDASGSGTDTEDHCIVEICGAHGSD 1080
            KSRDQCK+FFSKARKCLGLDLMHTSGDVGETPG+GND SGSGTDTEDHC+VEICG  GSD
Sbjct: 1021 KSRDQCKIFFSKARKCLGLDLMHTSGDVGETPGNGNDISGSGTDTEDHCVVEICGGRGSD 1080

Query: 1081 EFVSKSVNGVSTSVNINHEESVSAVTVNMRTSSEFEENTVLQQSDEKCAEAVGNLISEIS 1140
            E +SKS+NGVSTSVNINHEESVSA TVNMRTS EFE +T LQQ DEK AEAVGN+I E  
Sbjct: 1081 ESISKSINGVSTSVNINHEESVSAATVNMRTSMEFEGSTALQQLDEKGAEAVGNMIFETL 1140

Query: 1141 KEEDLPSPDSHSAYNLTNAAASLSQPVHDHKIEGSSENTEGGSKCCNEPDILRSESVSTV 1200
            KEED+P+P               SQP+HD KIEGSSENTEGG K CNEPDILRSESVSTV
Sbjct: 1141 KEEDVPNP---------------SQPMHDQKIEGSSENTEGG-KSCNEPDILRSESVSTV 1200

Query: 1201 DENSAAVSESRATAKLAFGGEEEGRNTNLHVQSILQCSVQNSTGFDSKLALEGSSLGLDP 1260
            DENSAAVSE RAT KLA G EE G + NLH QS +QCS Q+STG+DS +ALEGSS+GLDP
Sbjct: 1201 DENSAAVSECRATVKLAIGEEEVGSDANLHSQSTMQCSGQDSTGYDSNIALEGSSIGLDP 1260

Query: 1261 QILHPTVLKVEHVE-KSCVES-ENSLAVGNSEPGVIGREQMLNQYMLSSTAVLQEVSDAH 1320
            QILHP +LKVE VE KSC++S EN LAV NS+ GVIGREQMLNQ + SST VLQ+VSDA 
Sbjct: 1261 QILHPNILKVEPVEKKSCIKSEENFLAVRNSDTGVIGREQMLNQDVSSSTLVLQDVSDAD 1320

Query: 1321 QKPMNR--DDYAEHQNNLSHDSES-KFPRSYPFNKQIFEDINRNINRTYFPVVQGLSKPD 1380
            QKPMNR  DD  EH+NNL  +SES KFPRSYPFNKQIFEDINRNIN TYFPVVQGLSKPD
Sbjct: 1321 QKPMNRDKDDDDEHRNNLLRNSESPKFPRSYPFNKQIFEDINRNINHTYFPVVQGLSKPD 1380

Query: 1381 INCSSSYVSEGHYLQNCNSSKPHNPAELPFLPQNVDLGHDRQKKALCSGSASDSDVPRRK 1440
            INC++ YV EG YLQNCNSSKPHNPAELPFL QN++LGH+ QK A  SGSASDSDVPRRK
Sbjct: 1381 INCNNKYVPEGQYLQNCNSSKPHNPAELPFLSQNIELGHNHQKNASGSGSASDSDVPRRK 1440

Query: 1441 GDVKLFGQILSHAPSKQNSSSGSNEGGEEKGLHKSSSKSCDIGENVPLRSYGFWDGSRIQ 1500
            GDVKLFGQILSHAPS+QNSSSGSNE GE+KGLH SSSKSCD+GE+VPLRSYGFWDGSRIQ
Sbjct: 1441 GDVKLFGQILSHAPSQQNSSSGSNECGEKKGLHNSSSKSCDMGEHVPLRSYGFWDGSRIQ 1500

Query: 1501 TGLSALPDSAILQAKYPAAFSGYSATSVKTEQQPLQALTNNGDRSLNGLVSAFPTKDGVV 1560
            TGLSALPDSAILQ+KYPAAFSGYS TSVKTEQQ LQAL NN D+SLN +VSAFPTKDGVV
Sbjct: 1501 TGLSALPDSAILQSKYPAAFSGYSGTSVKTEQQTLQALANNSDQSLNEVVSAFPTKDGVV 1560

Query: 1561 DYHSYRSRDGVKLRPFPVDIFSEMQRRNGFDAVSLSSLQQQGRVLVGMNVVGRGGILMGG 1620
            DYHSYRSRDGVK+RPFPVDIFSEM RRNGFDAVSLSSLQQQGRVLVGMNVVGRGGILMGG
Sbjct: 1561 DYHSYRSRDGVKMRPFPVDIFSEMHRRNGFDAVSLSSLQQQGRVLVGMNVVGRGGILMGG 1620

Query: 1621 SCTGVSDPVAAIKMHYSKAEQYVGQPGSTFTREDGSWRGGNGGDLGSR 1664
            SCTGVSDPVAAIKMHY+KA+QY GQPGS FTREDGSWRGG GGDLGSR
Sbjct: 1621 SCTGVSDPVAAIKMHYAKADQYAGQPGSMFTREDGSWRGGKGGDLGSR 1648

BLAST of CcUC05G103030 vs. ExPASy TrEMBL
Match: A0A0A0KWU7 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G623500 PE=4 SV=1)

HSP 1 Score: 2758.8 bits (7150), Expect = 0.0e+00
Identity = 1446/1671 (86.54%), Postives = 1532/1671 (91.68%), Query Frame = 0

Query: 1    MPPEPLPWDRKDLFKERKHEKSEAIGSAARWRDSHHGSREFNRWGSADFRRPTGHGKQGS 60
            MPPEPLPWDRKDLFKERKHEKSEAIGSAARWRDS+HGSREFNRWGSAD RRPTGHGKQG 
Sbjct: 1    MPPEPLPWDRKDLFKERKHEKSEAIGSAARWRDSYHGSREFNRWGSADLRRPTGHGKQGG 60

Query: 61   WHQFSEESSHGYGPSRSFSDRVLEDESFRPSVPRGDGRYIRIGREIRGSFSHRDWRSHSR 120
            WHQFSE+SSHGYGPSRSFSDRV+EDESFRPSVPRGDG+YIRIGRE RGSFSHRDWRSHSR
Sbjct: 61   WHQFSEDSSHGYGPSRSFSDRVIEDESFRPSVPRGDGKYIRIGRESRGSFSHRDWRSHSR 120

Query: 121  ETNNGFGNPSRRPSSASQDVSSDQRSVDDTVTYSSPQSVHGLENGPRADVEVSLGSTDWK 180
            + NNGFGNPSRR S  SQDVSSDQRSVDDTVTYSSPQS HGLENGPR+DVEVSLGSTDWK
Sbjct: 121  DANNGFGNPSRRTS--SQDVSSDQRSVDDTVTYSSPQSFHGLENGPRSDVEVSLGSTDWK 180

Query: 181  PLKWSRSGSLSSRGSAYSSSTNSKNEKADLPLRVASPIESPSAETTACVTSSLPSEDAIS 240
            PLKWSRSGSLSSRGSAYSSSTNSKNEKADLPLRVASPIESPSAE TACVTSSLPSEDAIS
Sbjct: 181  PLKWSRSGSLSSRGSAYSSSTNSKNEKADLPLRVASPIESPSAEATACVTSSLPSEDAIS 240

Query: 241  RKKPRLGWGDGLAKYEKEKVEVPDGSLKKEVALLSSASAELTHSLGSNFAEKSPKTLPFS 300
            RKKPRLGWGDGLAKYEKEKVEVPDGSL+KEVALLSS S ELTHSLGSNFAEKSPKTLPFS
Sbjct: 241  RKKPRLGWGDGLAKYEKEKVEVPDGSLRKEVALLSSGSGELTHSLGSNFAEKSPKTLPFS 300

Query: 301  DCASPATPSSFACSSSSGLEDKPFSKGASLDGMICSSPGSSSQNLQKLLSSIEMMEICSI 360
            DCASPATPSSFACSSSSGLEDKPFSKGA  DGMICSSPGS SQNLQKLL SIE MEI S+
Sbjct: 301  DCASPATPSSFACSSSSGLEDKPFSKGAGADGMICSSPGSGSQNLQKLLCSIEKMEISSV 360

Query: 361  ANLGSSLVELFHSDDPSTVESCFGKSTLNKLLAYKGEISKKLETTESEIDSLENELKSLK 420
            ANLGSSLVELFHSDDP+T+ESCFGKSTLNKLLAYKGEISK LE TESEIDSLENELKSLK
Sbjct: 361  ANLGSSLVELFHSDDPNTIESCFGKSTLNKLLAYKGEISKTLEMTESEIDSLENELKSLK 420

Query: 421  SGNGGNVSHKKSCSVTHLVENVTYFKEQDGVSCVAPRPAPLVIVSSSDATVEKMPVCKGD 480
            S NGGNVSHKKSCS T ++E+ TYFKEQDG+SC+A RPAPLV+VSSSDATVEK+P+CKGD
Sbjct: 421  SVNGGNVSHKKSCSATRVMESSTYFKEQDGISCIATRPAPLVVVSSSDATVEKVPLCKGD 480

Query: 481  MGVEDVDTKADEIDSPGTVTSKFNEPSRVVKTLASDLVVNGHCSEVTDVIVPDKMEGNFP 540
            +GVEDVDTKADEIDSPGTVTSKFNEPSRVVK +ASD+V NGHCS VTD IVP KMEG+FP
Sbjct: 481  VGVEDVDTKADEIDSPGTVTSKFNEPSRVVKAIASDIVDNGHCSVVTDAIVPGKMEGSFP 540

Query: 541  VSRSFVDEHKTIGSDNECILAKSCTKESIYGDLMAQAGSRSSLCDHIFVCNKEYASRAAE 600
            +S  FVDEH+TIGS NEC LAKSCT ES+YGDLMAQAGSRSSLCD IF CNKEYASRAAE
Sbjct: 541  ISGPFVDEHETIGSGNECTLAKSCTSESVYGDLMAQAGSRSSLCDSIFACNKEYASRAAE 600

Query: 601  VIFKKLPVEMCKISSKSTKIVSCSEIEKLVKEKFLMRRQFLKFKESALTLRFKALQQSWK 660
            VIFK+ PV MCKISSKSTK VSCSE EKL+KEKF+MR++FLKFKESALTLRFK+LQQSWK
Sbjct: 601  VIFKRSPVGMCKISSKSTKNVSCSETEKLIKEKFVMRKKFLKFKESALTLRFKSLQQSWK 660

Query: 661  EGLLHSVKKCRSRPQKKELSLRVSHSGHQKYRSST-RSRLVQQGACQNPTLNTEIAVRYS 720
            EGLLHSVKKCRSRPQKKELSLRV+HSGHQKYRSS+ RSRLVQQGACQ+ T NTEIAVR+S
Sbjct: 661  EGLLHSVKKCRSRPQKKELSLRVTHSGHQKYRSSSIRSRLVQQGACQSSTFNTEIAVRHS 720

Query: 721  SKLLLNPQIKLYRNSLKMPAMILDKKEKMALRFISHNGLVEDPCAVEKERNMINPWTSAE 780
            SKLLLNPQIKLYRN+LKMPAMILDKKEK+ALRFISHNGLVEDPCAVEKERN+INPWTSAE
Sbjct: 721  SKLLLNPQIKLYRNTLKMPAMILDKKEKIALRFISHNGLVEDPCAVEKERNLINPWTSAE 780

Query: 781  REIFWEKLSLFGKDFKKISSFLDLKTTADCIQFYYKNHKSDSFKKNKNLELGKQVKSSAI 840
            +EIFWEKLSLFGKDFKKISSFLDLKTTADCIQFYYKNHKSDSFKKNKNLELGKQ+KSSAI
Sbjct: 781  KEIFWEKLSLFGKDFKKISSFLDLKTTADCIQFYYKNHKSDSFKKNKNLELGKQMKSSAI 840

Query: 841  TYLVTSGKKWNPDMNATSLDILGVASIMAAQADYDIGNQQKCTRHLGMGRDVESKVSFSA 900
            TYLVTSGKKWNPD NATSLDILGVAS+MAAQADYDI NQQKCTRHLG+GRDVESKVS+SA
Sbjct: 841  TYLVTSGKKWNPDANATSLDILGVASVMAAQADYDIENQQKCTRHLGVGRDVESKVSWSA 900

Query: 901  STPSNKNNLDALQTEKETVAADVLAGICGSISSEALSSCITSAIDPSEDHWERKCYKVDS 960
            S+P NK+NLD LQTEKETVAADVLAGI GSISSEALSSCITSAIDP E+  ERKCY+VD 
Sbjct: 901  SSP-NKSNLDDLQTEKETVAADVLAGISGSISSEALSSCITSAIDPREELRERKCYRVDF 960

Query: 961  AVKLPSLSDVIQKTDNEEPCSDDSSEDVDSSNWTDEEKSIFMQAVSSYGKDFDMISRCIR 1020
            A KLPSLSDV+QKTDN EPCSDDSSEDVDSSNWTDEEK +FMQAVSSYGKDFDMISRCIR
Sbjct: 961  AAKLPSLSDVMQKTDN-EPCSDDSSEDVDSSNWTDEEKLVFMQAVSSYGKDFDMISRCIR 1020

Query: 1021 SKSRDQCKVFFSKARKCLGLDLMHTSGDVGETPGSGNDA--SGSGTDTEDHCIVEICGAH 1080
            SKSRDQCK+FFSKARKCLGLDLMHTSGDVGETPG+GNDA  SGSGTDTE+HC+VEIC   
Sbjct: 1021 SKSRDQCKIFFSKARKCLGLDLMHTSGDVGETPGNGNDASGSGSGTDTEEHCVVEICEGR 1080

Query: 1081 GSDEFVSKSVNGVSTSVNINHEESVSAVTVNMRTSSEFEENTVLQQSDEKCAEAVGNLIS 1140
            GSDEF+SKS+NG STSVNINHEE+VSAVT NMRTS EFEE+T LQQSDEK AEAVGNLI 
Sbjct: 1081 GSDEFISKSINGGSTSVNINHEETVSAVTDNMRTSMEFEESTALQQSDEKGAEAVGNLIF 1140

Query: 1141 EISKEEDLPSPDSHSAYNLTNAAASLSQPVHDHKIEGSSENTEGGSKCCNEPDILRSESV 1200
            E  KEED+P+P               SQP HDHKIEGSSENTE G K CNEPDILRSESV
Sbjct: 1141 ETLKEEDVPNP---------------SQPTHDHKIEGSSENTESG-KSCNEPDILRSESV 1200

Query: 1201 STVDENSAAVSESRATAKLAFGGEEEGRNTNLHVQSILQCSVQNSTGFDSKLALEGSSLG 1260
            STVDENSAAVSE RAT KLA  GEE G +TNLH QS + CS Q+STG DS +ALEGSS+G
Sbjct: 1201 STVDENSAAVSEGRATVKLAI-GEEVGSDTNLHGQSTILCSGQDSTGNDSNIALEGSSVG 1260

Query: 1261 LDPQILHPTVLKVEHVE-KSCVES-ENSLAVGNSEPGVIGREQMLNQYMLSSTAVLQEVS 1320
            LDP ILHP +LKVE VE KSC++S EN L+V NS+ GVIGREQMLNQ +LS T VLQE+S
Sbjct: 1261 LDPHILHPNILKVEPVEKKSCIKSEENFLSVRNSDTGVIGREQMLNQDILSPTLVLQEIS 1320

Query: 1321 DAHQKPMNRDDYAEHQNNLSHDSESK-FPRSYPFNKQIFEDINRNINRTYFPVVQGLSKP 1380
            DA+QKPMNRDD AEH NNL  +SES  FPRSYPFNKQIFEDINRNIN  YF  VQGLSKP
Sbjct: 1321 DANQKPMNRDDDAEHPNNLLCNSESSTFPRSYPFNKQIFEDINRNINHAYFR-VQGLSKP 1380

Query: 1381 DINCSSSYVSEGHYLQNCNSSKPHNPAELPFLPQNVDLGHDRQKKALCSGSASDSDVPRR 1440
            DINC+S YVSEG +LQNCNSSKPHN AE PFL QN++LGHD QK A  SGSASDSDVPRR
Sbjct: 1381 DINCNSKYVSEGQFLQNCNSSKPHNLAEPPFLSQNIELGHDHQKNASGSGSASDSDVPRR 1440

Query: 1441 KGDVKLFGQILSHAPSKQNSSSGSNEGGEEKG-LHKSSSKSCDIGENVPLRSYGFWDGSR 1500
            KGDVKLFGQILSHAPS+QNSSSGSNE GE+KG LH SSSKSCD+GEN+PLRSYGFWDGSR
Sbjct: 1441 KGDVKLFGQILSHAPSQQNSSSGSNECGEKKGPLHNSSSKSCDMGENIPLRSYGFWDGSR 1500

Query: 1501 IQTGLSALPDSAILQAKYPAAFSGYSATSVKTEQQPLQALTNNGDRSLNGLVSAFPTKDG 1560
            IQTGLSALPDSAILQAKYPAAFSGYSATSVKTEQQPLQAL+NNGD+SLN LVSAFPTKDG
Sbjct: 1501 IQTGLSALPDSAILQAKYPAAFSGYSATSVKTEQQPLQALSNNGDQSLNELVSAFPTKDG 1560

Query: 1561 VVDYHSYRSRDGVKLRPFPVDIFSEMQRRNGFDAVSLSSLQQQGRVLVGMNVVGRGGILM 1620
            VVDYHSYRSRDGVK+RPFPVDIFSEM RRNGFDAVSLSSLQQQGRVLVGMNVVGRGGILM
Sbjct: 1561 VVDYHSYRSRDGVKMRPFPVDIFSEMHRRNGFDAVSLSSLQQQGRVLVGMNVVGRGGILM 1620

Query: 1621 GGSCTGVSDPVAAIKMHYSKAEQYVGQPGSTFTREDGSW-RGGNGGDLGSR 1664
            GGSCTGVSDPVAAIKMHY+KA+QY GQP S FTREDGSW  GGNGGDLGSR
Sbjct: 1621 GGSCTGVSDPVAAIKMHYAKADQYAGQPASMFTREDGSWGGGGNGGDLGSR 1649

BLAST of CcUC05G103030 vs. ExPASy TrEMBL
Match: A0A1S3BG74 (LOW QUALITY PROTEIN: uncharacterized protein LOC103489481 OS=Cucumis melo OX=3656 GN=LOC103489481 PE=4 SV=1)

HSP 1 Score: 2751.9 bits (7132), Expect = 0.0e+00
Identity = 1435/1668 (86.03%), Postives = 1521/1668 (91.19%), Query Frame = 0

Query: 1    MPPEPLPWDRKDLFKERKHEKSEAIGSAARWRDSHHGSREFNRWGSADFRRPTGHGKQGS 60
            MPPEPLPWDRKDLFKERKHEKSEAIGSAARWRDS+HGSREFNRWGSAD RRPTGHGKQG 
Sbjct: 1    MPPEPLPWDRKDLFKERKHEKSEAIGSAARWRDSYHGSREFNRWGSADLRRPTGHGKQGG 60

Query: 61   WHQFSEESSHGYGPSRSFSDRVLEDESFRPSVPRGDGRYIRIGREIRGSFSHRDWRSHSR 120
            WHQFSE+SSHGYGPSRSFSDRV+EDESFRPSVPRGDG+YIRIGRE RGSFSHRDWRSHSR
Sbjct: 61   WHQFSEDSSHGYGPSRSFSDRVIEDESFRPSVPRGDGKYIRIGRESRGSFSHRDWRSHSR 120

Query: 121  ETNNGFGNPSRRPSSASQDVSSDQRSVDDTVTYSSPQSVHGLENGPRADVEVSLGSTDWK 180
            +TNNGFGNPSRRPS  SQDVSSDQRSVDDTVTYSSPQS HGLENGPR+DVEVSLGSTDWK
Sbjct: 121  DTNNGFGNPSRRPS--SQDVSSDQRSVDDTVTYSSPQSFHGLENGPRSDVEVSLGSTDWK 180

Query: 181  PLKWSRSGSLSSRGSAYSSSTNSKNEKADLPLRVASPIESPSAETTACVTSSLPSEDAIS 240
            PLKWSRSGSLSSRGSAYSSSTNSKNEKADLPLRVASPIESPSAE TACVTSSLPSED IS
Sbjct: 181  PLKWSRSGSLSSRGSAYSSSTNSKNEKADLPLRVASPIESPSAEATACVTSSLPSEDTIS 240

Query: 241  RKKPRLGWGDGLAKYEKEKVEVPDGSLKKEVALLSSASAELTHSLGSNFAEKSPKTLPFS 300
            RKKPRLGWGDGLAKYEKEKV+VPDGSL+KEVALLSS S ELTHSLGSNFAEKSPKTLPFS
Sbjct: 241  RKKPRLGWGDGLAKYEKEKVDVPDGSLRKEVALLSSGSGELTHSLGSNFAEKSPKTLPFS 300

Query: 301  DCASPATPSSFACSSSSGLEDKPFSKGASLDGMICSSPGSSSQNLQKLLSSIEMMEICSI 360
            DCASPATPSSFACSSSSGLEDKPFSKGAS DGMICSSPGS SQNLQKLL SIE MEI SI
Sbjct: 301  DCASPATPSSFACSSSSGLEDKPFSKGASADGMICSSPGSGSQNLQKLLCSIEKMEISSI 360

Query: 361  ANLGSSLVELFHSDDPSTVESCFGKSTLNKLLAYKGEISKKLETTESEIDSLENELKSLK 420
            ANLGSSLVELFHSDDP+T+ESCFGKSTLNKLLAYKGEISK LE TESEIDSLENELKSLK
Sbjct: 361  ANLGSSLVELFHSDDPNTIESCFGKSTLNKLLAYKGEISKTLEMTESEIDSLENELKSLK 420

Query: 421  SGNGGNVSHKKSCSVTHLVENVTYFKEQDGVSCVAPRPAPLVIVSSSDATVEKMPVCKGD 480
            SGNGGNVS+KKSCS T LVE+ TYFKEQDG+SC+APRPAPLV+VSSSDATVEK+P+CKGD
Sbjct: 421  SGNGGNVSNKKSCSATRLVESSTYFKEQDGISCIAPRPAPLVVVSSSDATVEKVPLCKGD 480

Query: 481  MGVEDVDTKADEIDSPGTVTSKFNEPSRVVKTLASDLVVNGHCSEVTDVIVPDKMEGNFP 540
            MGVEDVDTKADEIDSPGTVTSKFNEPSRVVK   SD+V NGHCS VTD+IVP KMEGNFP
Sbjct: 481  MGVEDVDTKADEIDSPGTVTSKFNEPSRVVKENTSDIVDNGHCSVVTDMIVPGKMEGNFP 540

Query: 541  VSRSFVDEHKTIGSDNECILAKSCTKESIYGDLMAQAGSRSSLCDHIFVCNKEYASRAAE 600
            +S  FVDE KT GS NECILAKSC+ ES  GDLMAQAGSRSSLCD IF CNKEYASRAAE
Sbjct: 541  ISEPFVDERKTTGSGNECILAKSCSSESFNGDLMAQAGSRSSLCDSIFACNKEYASRAAE 600

Query: 601  VIFKKLPVEMCKISSKSTKIVSCSEIEKLVKEKFLMRRQFLKFKESALTLRFKALQQSWK 660
            VIFK+ PV +CKISSKSTK VSCSE EKL+KEKF+ R++FLKFKESALTLRFKALQQSWK
Sbjct: 601  VIFKRSPVGVCKISSKSTKYVSCSETEKLIKEKFVSRKKFLKFKESALTLRFKALQQSWK 660

Query: 661  EGLLHSVKKCRSRPQKKELSLRVSHSGHQKYRSSTRSRLVQQGACQNPTLNTEIAVRYSS 720
                   K+   +  KKELSLRV+HSGHQKYRSS RSRL+QQGACQ+ T NTEIAVR+SS
Sbjct: 661  M-FAAFCKEMSLKATKKELSLRVTHSGHQKYRSSFRSRLIQQGACQSTTFNTEIAVRHSS 720

Query: 721  KLLLNPQIKLYRNSLKMPAMILDKKEKMALRFISHNGLVEDPCAVEKERNMINPWTSAER 780
            KLLLNPQIKLYRN+LKMPAMILDKKEKMALRFISHNGLVEDPCAVEKERN+INPWTSAE+
Sbjct: 721  KLLLNPQIKLYRNTLKMPAMILDKKEKMALRFISHNGLVEDPCAVEKERNLINPWTSAEK 780

Query: 781  EIFWEKLSLFGKDFKKISSFLDLKTTADCIQFYYKNHKSDSFKKNKNLELGKQVKSSAIT 840
            EIFWEKLSLFGKDFKKISSFLDLKTTADCIQFYYKNHKSDSFKKNKNLELGKQ+KSSAIT
Sbjct: 781  EIFWEKLSLFGKDFKKISSFLDLKTTADCIQFYYKNHKSDSFKKNKNLELGKQMKSSAIT 840

Query: 841  YLVTSGKKWNPDMNATSLDILGVASIMAAQADYDIGNQQKCTRHLGMGRDVESKVSFSAS 900
            YLVTSGKKWNPD NATSLDILGVAS+MAAQA+YDIGNQQKC+RHLG G+DVESKVS+SAS
Sbjct: 841  YLVTSGKKWNPDANATSLDILGVASVMAAQAEYDIGNQQKCSRHLGTGKDVESKVSWSAS 900

Query: 901  TPSNKNNLDALQTEKETVAADVLAGICGSISSEALSSCITSAIDPSEDHWERKCYKVDSA 960
            TP NK+NLD LQTEKETVAADVLAGI GSISSEALSSCITSAIDP E+  E+KCYKVDSA
Sbjct: 901  TP-NKSNLDDLQTEKETVAADVLAGISGSISSEALSSCITSAIDPREELREQKCYKVDSA 960

Query: 961  VKLPSLSDVIQKTDNEEPCSDDSSEDVDSSNWTDEEKSIFMQAVSSYGKDFDMISRCIRS 1020
             KLPSLSDV+QKTDN EPCSDDSSEDVDSSNWTDEEK IF+QAVSSYGKDFDMISRCIRS
Sbjct: 961  AKLPSLSDVMQKTDN-EPCSDDSSEDVDSSNWTDEEKVIFLQAVSSYGKDFDMISRCIRS 1020

Query: 1021 KSRDQCKVFFSKARKCLGLDLMHTSGDVGETPGSGNDASGSGTDTEDHCIVEICGAHGSD 1080
            KSRDQCK+FFSKARKCLGLDLMHTSGDVGETPG+GND SGSGTDTEDHC+VEICG  GSD
Sbjct: 1021 KSRDQCKIFFSKARKCLGLDLMHTSGDVGETPGNGNDISGSGTDTEDHCVVEICGGRGSD 1080

Query: 1081 EFVSKSVNGVSTSVNINHEESVSAVTVNMRTSSEFEENTVLQQSDEKCAEAVGNLISEIS 1140
            E +SKS+NGVSTSVNINHEESVSA TVNMRTS EFE +T LQQ DEK AEAVGN+I E  
Sbjct: 1081 ESISKSINGVSTSVNINHEESVSAATVNMRTSMEFEGSTALQQLDEKGAEAVGNMIFETL 1140

Query: 1141 KEEDLPSPDSHSAYNLTNAAASLSQPVHDHKIEGSSENTEGGSKCCNEPDILRSESVSTV 1200
            KEED+P+P               SQP+HD KIEGSSENTEGG K CNEPDILRSESVSTV
Sbjct: 1141 KEEDVPNP---------------SQPMHDQKIEGSSENTEGG-KSCNEPDILRSESVSTV 1200

Query: 1201 DENSAAVSESRATAKLAFGGEEEGRNTNLHVQSILQCSVQNSTGFDSKLALEGSSLGLDP 1260
            DENSAAVSE RAT KLA G EE G + NLH QS +QCS Q+STG+DS +ALEGSS+GLDP
Sbjct: 1201 DENSAAVSECRATVKLAIGEEEVGSDANLHSQSTMQCSGQDSTGYDSNIALEGSSIGLDP 1260

Query: 1261 QILHPTVLKVEHVE-KSCVES-ENSLAVGNSEPGVIGREQMLNQYMLSSTAVLQEVSDAH 1320
            QILHP +LKVE VE KSC++S EN LAV NS+ GVIGREQMLNQ + SST VLQ+VSDA 
Sbjct: 1261 QILHPNILKVEPVEKKSCIKSEENFLAVRNSDTGVIGREQMLNQDVSSSTLVLQDVSDAD 1320

Query: 1321 QKPMNR--DDYAEHQNNLSHDSES-KFPRSYPFNKQIFEDINRNINRTYFPVVQGLSKPD 1380
            QKPMNR  DD  EH+NNL  +SES KFPRSYPFNKQIFEDINRNIN TYFPVVQGLSKPD
Sbjct: 1321 QKPMNRDKDDDDEHRNNLLRNSESPKFPRSYPFNKQIFEDINRNINHTYFPVVQGLSKPD 1380

Query: 1381 INCSSSYVSEGHYLQNCNSSKPHNPAELPFLPQNVDLGHDRQKKALCSGSASDSDVPRRK 1440
            INC++ YV EG YLQNCNSSKPHNPAELPFL QN++LGH+ QK A  SGSASDSDVPRRK
Sbjct: 1381 INCNNKYVPEGQYLQNCNSSKPHNPAELPFLSQNIELGHNHQKNASGSGSASDSDVPRRK 1440

Query: 1441 GDVKLFGQILSHAPSKQNSSSGSNEGGEEKGLHKSSSKSCDIGENVPLRSYGFWDGSRIQ 1500
            GDVKLFGQILSHAPS+QNSSSGSNE GE+KGLH SSSKSCD+GE+VPLRSYGFWDGSRIQ
Sbjct: 1441 GDVKLFGQILSHAPSQQNSSSGSNECGEKKGLHNSSSKSCDMGEHVPLRSYGFWDGSRIQ 1500

Query: 1501 TGLSALPDSAILQAKYPAAFSGYSATSVKTEQQPLQALTNNGDRSLNGLVSAFPTKDGVV 1560
            TGLSALPDSAILQ+KYPAAFSGYS TSVKTEQQ LQAL NN D+SLN +VSAFPTKDGVV
Sbjct: 1501 TGLSALPDSAILQSKYPAAFSGYSGTSVKTEQQTLQALANNSDQSLNEVVSAFPTKDGVV 1560

Query: 1561 DYHSYRSRDGVKLRPFPVDIFSEMQRRNGFDAVSLSSLQQQGRVLVGMNVVGRGGILMGG 1620
            DYHSYRSRDGVK+RPFPVDIFSEM RRNGFDAVSLSSLQQQGRVLVGMNVVGRGGILMGG
Sbjct: 1561 DYHSYRSRDGVKMRPFPVDIFSEMHRRNGFDAVSLSSLQQQGRVLVGMNVVGRGGILMGG 1620

Query: 1621 SCTGVSDPVAAIKMHYSKAEQYVGQPGSTFTREDGSWRGGNGGDLGSR 1664
            SCTGVSDPVAAIKMHY+KA+QY GQPGS FTREDGSWRGG GGDLGSR
Sbjct: 1621 SCTGVSDPVAAIKMHYAKADQYAGQPGSMFTREDGSWRGGKGGDLGSR 1647

BLAST of CcUC05G103030 vs. ExPASy TrEMBL
Match: A0A6J1GWV0 (uncharacterized protein LOC111458252 OS=Cucurbita moschata OX=3662 GN=LOC111458252 PE=4 SV=1)

HSP 1 Score: 2620.5 bits (6791), Expect = 0.0e+00
Identity = 1386/1687 (82.16%), Postives = 1485/1687 (88.03%), Query Frame = 0

Query: 1    MPPEPLPWDRKDLFKERKHEKSEAIGSAARWRDSHHGSREFNRWGSADFRRPTGHGKQGS 60
            MPPEPLPWDRKDLFKERKHEKSEAIGSA RWRDS+HGSREFNRWGSADFRRPTGHGK G 
Sbjct: 1    MPPEPLPWDRKDLFKERKHEKSEAIGSATRWRDSYHGSREFNRWGSADFRRPTGHGKLGG 60

Query: 61   WHQFSEESSHGYGPSRSFSDRVLEDESFRPSVPRGDGRYIRIGREIRGSFSHRDWRSHSR 120
            WHQFSEE+SHGYGPSRSFSDRVLEDESFRPSVPRGDG+Y RIGRE RGSFS RDWR HS+
Sbjct: 61   WHQFSEETSHGYGPSRSFSDRVLEDESFRPSVPRGDGKYNRIGRESRGSFSQRDWRGHSK 120

Query: 121  ETNNGFGNPSRRPSSASQDVSSDQRSVDDTVTYSSPQS--------------------VH 180
            E +  FGNPSRRPS  SQD SSDQRS+DDTVTYSSPQS                    V+
Sbjct: 121  ENSKEFGNPSRRPS--SQDASSDQRSLDDTVTYSSPQSDFVSVSDKIHSKDRNDKVGGVY 180

Query: 181  GLENGPRADVEVSLGSTDWKPLKWSRSGSLSSRGSAYSSSTNSKNEKADLPLRVASPIES 240
            GL NGPR+DVEVSLGSTDWKPLKWSRSGSLSSRGSAYSSSTNSKNEK DLP RVASP++S
Sbjct: 181  GLGNGPRSDVEVSLGSTDWKPLKWSRSGSLSSRGSAYSSSTNSKNEKTDLPRRVASPLQS 240

Query: 241  PSAETTACVTSSLPSEDAISRKKPRLGWGDGLAKYEKEKVEVPDGSLKKEVALLSSASAE 300
            PS E TAC+TSSLPSEDAISRKKPRLGWGDGLAKYEKEKVEVPDGSL+KEV +LSS+SAE
Sbjct: 241  PSTEATACLTSSLPSEDAISRKKPRLGWGDGLAKYEKEKVEVPDGSLRKEVTVLSSSSAE 300

Query: 301  LTHSLGSNFAEKSPKTLPFSDCASPATPSSFACSSSSGLEDKPFSKGASLDGMICSSPGS 360
            LTHSLGSNFAEKSPKTLPFSDCASPATPSSFACSSSSGLEDKPFSK AS+DG+ICSSPGS
Sbjct: 301  LTHSLGSNFAEKSPKTLPFSDCASPATPSSFACSSSSGLEDKPFSKAASVDGIICSSPGS 360

Query: 361  SSQN-LQKLLSSIEMMEICSIANLGSSLVELFHSDDPSTVESCFGKSTLNKLLAYKGEIS 420
            SSQN LQKL SSIE +EI SI NLGSSLVELF+SDDP+TVESCFGKSTLNKLLAYKGEIS
Sbjct: 361  SSQNHLQKLFSSIEKVEISSITNLGSSLVELFNSDDPNTVESCFGKSTLNKLLAYKGEIS 420

Query: 421  KKLETTESEIDSLENELKSLKSGNGGNVSHKKSCSVTHLVENVTYFKEQDGVSCVAPRPA 480
            K LETTESEID LENELKSLKS NGGNVSH KSCS  HLVE+V YFKEQDGVSC+A RPA
Sbjct: 421  KTLETTESEIDFLENELKSLKSENGGNVSHPKSCSAVHLVESVPYFKEQDGVSCIASRPA 480

Query: 481  PLVIVSSSDATVEKMPVCKGDMGVEDVDTKADEIDSPGTVTSKFNEPSRVVKTLASDLVV 540
            PL IVSSSDATVEKMPVC GD G+EDV TKADEIDSPGTVTSKFNEPSRVVK +AS+LV 
Sbjct: 481  PLKIVSSSDATVEKMPVCIGDKGIEDVGTKADEIDSPGTVTSKFNEPSRVVKAVASNLVE 540

Query: 541  NGHCSEVTDVIVPDKMEGNFPVSRSFVDEHKTIGSDNECILAKSCTKESIYGDLMAQAGS 600
            N HCSE TD IVPDKMEG+F  S  FVDEH TIGS NECILAKSCT ESIYGDL  QA  
Sbjct: 541  NDHCSEATDSIVPDKMEGSFKKSGPFVDEHLTIGSGNECILAKSCTSESIYGDLTTQANC 600

Query: 601  RSSLCDHIFVCNKEYASRAAEVIFKKLPVEMCKISSKSTKIVSCSEIEKLVKEKFLMRRQ 660
             SS  D IF  NKEYAS+A EVIFK+LP EMCKIS++STKIVSC E EKLVKEK  MRRQ
Sbjct: 601  GSSFRDLIFARNKEYASKATEVIFKELPTEMCKISTQSTKIVSCFETEKLVKEKIAMRRQ 660

Query: 661  FLKFKESALTLRFKALQQSWKEGLLHSVKKCRSRPQKKELSLRVSHSGHQKYRSSTRSRL 720
            FLKFKESALTLRFKALQ SWKEGLLHSVKK RSRPQKKELSLRV+HSGHQKYRSS RSR 
Sbjct: 661  FLKFKESALTLRFKALQHSWKEGLLHSVKKSRSRPQKKELSLRVTHSGHQKYRSSIRSRF 720

Query: 721  VQQGACQNPTLNTEIAVRYSSKLLLNPQIKLYRNSLKMPAMILDKKEKMALRFISHNGLV 780
            VQ G  QNP +N+EIA+RYSSKLLLNPQ+KLYRN+LKMPAMILDK EKMALRFISHNGLV
Sbjct: 721  VQHGETQNPVINSEIAIRYSSKLLLNPQVKLYRNTLKMPAMILDKNEKMALRFISHNGLV 780

Query: 781  EDPCAVEKERNMINPWTSAEREIFWEKLSLFGKDFKKISSFLDLKTTADCIQFYYKNHKS 840
            EDPCAVEKERNMINPWTSAEREIFWEKLSLFGKDF+K+SSFLDLKTTADCIQFYYKNHKS
Sbjct: 781  EDPCAVEKERNMINPWTSAEREIFWEKLSLFGKDFRKVSSFLDLKTTADCIQFYYKNHKS 840

Query: 841  DSFKKNKNLELGKQVKSSAITYLVTSGKKWNPDMNATSLDILGVASIMAAQADYDIGNQQ 900
            DSFKKNKNLELGKQVKSSA+TY++TSGKKWNPD+NAT+LDILGVAS MAAQAD +IGNQQ
Sbjct: 841  DSFKKNKNLELGKQVKSSAVTYMLTSGKKWNPDVNATNLDILGVASEMAAQADGNIGNQQ 900

Query: 901  KCTRHLGMGRDVESKVSFSASTPSNKNNLDALQTEKETVAADVLAGICGSISSEALSSCI 960
             C RHLGMG D+ SKVS+SASTPSNKNNLDALQTEKETVAADVLAGICGSISSEALSSCI
Sbjct: 901  NCNRHLGMGGDIGSKVSWSASTPSNKNNLDALQTEKETVAADVLAGICGSISSEALSSCI 960

Query: 961  TSAIDPSEDHWERKCYKVDSAVKLPSLSDVIQKTDNEEPCSDDSSEDVDSSNWTDEEKSI 1020
            TSAIDPSEDH ERKC+KVDSA K PS SDV+QKTDN EPCSDDSSEDVDSSNWTDEEKSI
Sbjct: 961  TSAIDPSEDHKERKCHKVDSATKFPSTSDVMQKTDN-EPCSDDSSEDVDSSNWTDEEKSI 1020

Query: 1021 FMQAVSSYGKDFDMISRCIRSKSRDQCKVFFSKARKCLGLDLMHTSGDVGETPGSGNDAS 1080
             MQAVSSYGKDFDMISRC+RSKSRDQCKVFFSKARKCLGLDL+H SGDVG TPGSGND+S
Sbjct: 1021 LMQAVSSYGKDFDMISRCVRSKSRDQCKVFFSKARKCLGLDLIHNSGDVG-TPGSGNDSS 1080

Query: 1081 GSGTDTEDHCIVEICGAHGSDEFVSKSVNGVSTSVNINHEESVSAVTVNMRTSSEFEENT 1140
            GSGTDT+DHC+VE CGA  SDEFVSKSVNG+STSV INHEESVSAVT NMR SSEFEE+T
Sbjct: 1081 GSGTDTDDHCVVETCGARSSDEFVSKSVNGLSTSVIINHEESVSAVTANMRNSSEFEEST 1140

Query: 1141 VLQQSDEKCAEAVGNLISEISKEEDLPSPDSHSAYNLTNAAASLSQPVHDHKIEGSSENT 1200
              +Q D   AEAV NL+SEISKEED+P+ DSHSA +LTNAAA  SQP HDHKIEG SENT
Sbjct: 1141 AFEQLDVTGAEAVVNLVSEISKEEDVPNLDSHSACSLTNAAAFPSQPAHDHKIEGCSENT 1200

Query: 1201 EGGSKCCNEPDILRSESVSTVDENSAAVSESRATAKLAFGGEEEGRNTNLHVQSILQCSV 1260
            E   K CN+PDILR ESV+TVDENSAAVSESRAT +LAFGGEE+G +TNLH QS+LQ SV
Sbjct: 1201 E-ACKRCNDPDILRPESVATVDENSAAVSESRATTELAFGGEEDGSDTNLHGQSMLQRSV 1260

Query: 1261 QNSTGFDSKLALEGSSLGLDPQILHPTVLKVEHV-EKSCVESENSLAVGNSEPGVIGREQ 1320
            Q+STGF+S L LE  SLG DP+I HP +LKV+ V  KSC++ ENSL V NS  GV+GRE+
Sbjct: 1261 QDSTGFNSNLDLE--SLGFDPRISHPKILKVDSVANKSCIKDENSL-VRNSGLGVVGREE 1320

Query: 1321 MLNQYMLSSTAVLQEVSDAHQKPMNRDDYAEHQNNLS-HDSESKFPRSYPFNKQIFEDIN 1380
            MLNQ M  ST VLQ V DAHQKPMNRDD ++HQN LS H   S+FP SYPFNKQI EDIN
Sbjct: 1321 MLNQDMFPSTLVLQGVGDAHQKPMNRDDCSDHQNRLSRHIESSEFPSSYPFNKQIVEDIN 1380

Query: 1381 RNINRTYFPVVQGLSKPDINCSSSYVSEGHYLQNCNSSKP--HNPAELPFLPQNVDLGHD 1440
            RNIN T FP  QGLSK  INC+ +YV E  YLQNCNSSK   H  AELP LPQNV+LGHD
Sbjct: 1381 RNINHTDFPAFQGLSK--INCNGTYVVEDCYLQNCNSSKEPCHRAAELPLLPQNVELGHD 1440

Query: 1441 RQKKALCSGSASDSDVPRRKGDVKLFGQILSHAPSKQNSSSGSNEGGEEKGLHKSSSKSC 1500
             Q  + CSG+ASDSDVPR KGDVKLFGQILSHAPS QNSSSGSN+ G+EK  HK   KS 
Sbjct: 1441 HQNTS-CSGNASDSDVPRSKGDVKLFGQILSHAPSLQNSSSGSNDCGDEKEFHK-LRKSY 1500

Query: 1501 DIGENVPLRSYGFWDGSRIQTGLSALPDSAILQAKYPAAFSGYSATSVKTEQQPLQALTN 1560
            D+GENVPLRSYGFW+GSR+QTGLSALPDSAILQAKYPAAFSGYS+TS+KTEQQPL+AL N
Sbjct: 1501 DMGENVPLRSYGFWNGSRMQTGLSALPDSAILQAKYPAAFSGYSSTSLKTEQQPLRALAN 1560

Query: 1561 NGDRSLNGLVSAFPTKDGVVDYHSYRSRDGVKLRPFPVDIFSEMQRRNGFDAVSLSSLQQ 1620
            NGDR+LN LVSAFPTKDGVVDY SYRSRDGV +RPFPVD+FSEM RRNG+D +SLSSLQQ
Sbjct: 1561 NGDRNLNELVSAFPTKDGVVDYQSYRSRDGVNMRPFPVDLFSEMHRRNGYDPLSLSSLQQ 1620

Query: 1621 QGRVLVGMNVVGRGGILMGGSCTGVSDPVAAIKMHYSKAEQYVGQPGSTFTREDGSWRGG 1663
            QGRV+VGMNVVGRGGILMGGSCTGVSDPVAAIKMHY+K++QYVGQPGSTFTREDGSWRGG
Sbjct: 1621 QGRVVVGMNVVGRGGILMGGSCTGVSDPVAAIKMHYAKSDQYVGQPGSTFTREDGSWRGG 1675

BLAST of CcUC05G103030 vs. ExPASy TrEMBL
Match: A0A6J1JPM1 (uncharacterized protein LOC111486582 OS=Cucurbita maxima OX=3661 GN=LOC111486582 PE=4 SV=1)

HSP 1 Score: 2617.8 bits (6784), Expect = 0.0e+00
Identity = 1390/1687 (82.39%), Postives = 1479/1687 (87.67%), Query Frame = 0

Query: 1    MPPEPLPWDRKDLFKERKHEKSEAIGSAARWRDSHHGSREFNRWGSADFRRPTGHGKQGS 60
            MPPEPLPWDRKDLFKERKHEKSEAIGSA RWRDS+HGSREFNRWGSADFRRPTGHGK G 
Sbjct: 1    MPPEPLPWDRKDLFKERKHEKSEAIGSATRWRDSYHGSREFNRWGSADFRRPTGHGKLGG 60

Query: 61   WHQFSEESSHGYGPSRSFSDRVLEDESFRPSVPRGDGRYIRIGREIRGSFSHRDWRSHSR 120
            WHQFSEE+SHGYGPSRSFSDRVLEDESFRPSVPRGDG+Y RIGRE RGSFS RDWRSHS+
Sbjct: 61   WHQFSEETSHGYGPSRSFSDRVLEDESFRPSVPRGDGKYNRIGRESRGSFSQRDWRSHSK 120

Query: 121  ETNNGFGNPSRRPSSASQDVSSDQRSVDDTVTYSSPQS--------------------VH 180
            E +  FGNPSRRPS  SQD SSDQRS+DDTVTYSSPQS                    V+
Sbjct: 121  ENSKEFGNPSRRPS--SQDASSDQRSLDDTVTYSSPQSDFVSVSDKIHSKDRNDKVGGVY 180

Query: 181  GLENGPRADVEVSLGSTDWKPLKWSRSGSLSSRGSAYSSSTNSKNEKADLPLRVASPIES 240
            GL NGPR+DVEVSLGSTDWKPLKWSRSGSLSSRGSAYSSSTNSKNEK DLP RVASP++S
Sbjct: 181  GLGNGPRSDVEVSLGSTDWKPLKWSRSGSLSSRGSAYSSSTNSKNEKTDLPRRVASPLQS 240

Query: 241  PSAETTACVTSSLPSEDAISRKKPRLGWGDGLAKYEKEKVEVPDGSLKKEVALLSSASAE 300
            PSA+ TAC+TSSLPSEDAISRKKPRLGWGDGLAKYEKEKVEVPDGSL+KEV +LSS+SAE
Sbjct: 241  PSADATACLTSSLPSEDAISRKKPRLGWGDGLAKYEKEKVEVPDGSLRKEVTVLSSSSAE 300

Query: 301  LTHSLGSNFAEKSPKTLPFSDCASPATPSSFACSSSSGLEDKPFSKGASLDGMICSSPGS 360
            LTHSLGSNFAEKSPKTLPFSDCASPATPSSFACSSSSGLEDKPFSK AS+DGMICSSPGS
Sbjct: 301  LTHSLGSNFAEKSPKTLPFSDCASPATPSSFACSSSSGLEDKPFSKAASVDGMICSSPGS 360

Query: 361  SSQN-LQKLLSSIEMMEICSIANLGSSLVELFHSDDPSTVESCFGKSTLNKLLAYKGEIS 420
            SSQN LQKL SSIE +EI SI NLGSSLVELF+SDDPS+VESCFGKSTLNKLL YKGEIS
Sbjct: 361  SSQNHLQKLFSSIEKVEISSITNLGSSLVELFNSDDPSSVESCFGKSTLNKLLTYKGEIS 420

Query: 421  KKLETTESEIDSLENELKSLKSGNGGNVSHKKSCSVTHLVENVTYFKEQDGVSCVAPRPA 480
            K LETTESEID LENELKSLKS NGGNVSH KSCS  HLVE+V YFKEQDGVSC+APRPA
Sbjct: 421  KTLETTESEIDFLENELKSLKSENGGNVSHPKSCSAVHLVESVPYFKEQDGVSCIAPRPA 480

Query: 481  PLVIVSSSDATVEKMPVCKGDMGVEDVDTKADEIDSPGTVTSKFNEPSRVVKTLASDLVV 540
            PL IVSSSDATVEKMPVC GDMG+ED  TKADEIDSPGTVTSKFNEPSRVVK +ASDLV 
Sbjct: 481  PLKIVSSSDATVEKMPVCIGDMGIEDGSTKADEIDSPGTVTSKFNEPSRVVKAVASDLVE 540

Query: 541  NGHCSEVTDVIVPDKMEGNFPVSRSFVDEHKTIGSDNECILAKSCTKESIYGDLMAQAGS 600
            N HCSE TD IVP KMEG+   S  FVDEH TIGS NECILAKSCT ESIYGD+  QA S
Sbjct: 541  NDHCSEATDSIVPHKMEGSSKKSGPFVDEHLTIGSGNECILAKSCTSESIYGDMTTQADS 600

Query: 601  RSSLCDHIFVCNKEYASRAAEVIFKKLPVEMCKISSKSTKIVSCSEIEKLVKEKFLMRRQ 660
             SSLCD IF  NKEYAS+AAEVIFK+LP EMCKIS++S KIVSC E EKLVKEK  MRRQ
Sbjct: 601  GSSLCDLIFARNKEYASKAAEVIFKELPTEMCKISTQSIKIVSCFETEKLVKEKIAMRRQ 660

Query: 661  FLKFKESALTLRFKALQQSWKEGLLHSVKKCRSRPQKKELSLRVSHSGHQKYRSSTRSRL 720
            FLKFKESALTLRFKALQ SWKEGLLHSVKK RSRPQKKELSLRV+HSGHQKYRSS RSR 
Sbjct: 661  FLKFKESALTLRFKALQHSWKEGLLHSVKKSRSRPQKKELSLRVTHSGHQKYRSSIRSRF 720

Query: 721  VQQGACQNPTLNTEIAVRYSSKLLLNPQIKLYRNSLKMPAMILDKKEKMALRFISHNGLV 780
            VQ G CQNP +N+EIA+RYSSKLLLNPQ+KLYRN+LKMPAMILDK EKMALRFISHNGLV
Sbjct: 721  VQHGECQNPVVNSEIAIRYSSKLLLNPQVKLYRNTLKMPAMILDKNEKMALRFISHNGLV 780

Query: 781  EDPCAVEKERNMINPWTSAEREIFWEKLSLFGKDFKKISSFLDLKTTADCIQFYYKNHKS 840
            EDPCAVEKERNMINPWTSAEREIFWEKLSLFGKDF+KISSFLDLKTTADCIQFYYKNHKS
Sbjct: 781  EDPCAVEKERNMINPWTSAEREIFWEKLSLFGKDFRKISSFLDLKTTADCIQFYYKNHKS 840

Query: 841  DSFKKNKNLELGKQVKSSAITYLVTSGKKWNPDMNATSLDILGVASIMAAQADYDIGNQQ 900
            DSFKKNKNLELGKQVKSSA TY++TSGKKWNPD+NATSLDILGVAS MAAQAD DI NQQ
Sbjct: 841  DSFKKNKNLELGKQVKSSAATYMLTSGKKWNPDVNATSLDILGVASEMAAQADVDIENQQ 900

Query: 901  KCTRHLGMGRDVESKVSFSASTPSNKNNLDALQTEKETVAADVLAGICGSISSEALSSCI 960
            KC RHLGMGRD+ SKVS+SASTPSNKNNLDALQTEKETVAADVLAGICGSISSEALSSC+
Sbjct: 901  KCNRHLGMGRDIGSKVSWSASTPSNKNNLDALQTEKETVAADVLAGICGSISSEALSSCL 960

Query: 961  TSAIDPSEDHWERKCYKVDSAVKLPSLSDVIQKTDNEEPCSDDSSEDVDSSNWTDEEKSI 1020
            TSAIDPSEDH ERKC+KVDSA KLPS SDV+QKTDN EPCSDDSSEDVDSSNWTDEEKSI
Sbjct: 961  TSAIDPSEDHKERKCHKVDSATKLPSTSDVMQKTDN-EPCSDDSSEDVDSSNWTDEEKSI 1020

Query: 1021 FMQAVSSYGKDFDMISRCIRSKSRDQCKVFFSKARKCLGLDLMHTSGDVGETPGSGNDAS 1080
             MQAVSSYGKDFDMISRC+RSKSRDQCKVFFSKARKCLGLDL+HTSGDVG TPGSGND+S
Sbjct: 1021 LMQAVSSYGKDFDMISRCVRSKSRDQCKVFFSKARKCLGLDLIHTSGDVG-TPGSGNDSS 1080

Query: 1081 GSGTDTEDHCIVEICGAHGSDEFVSKSVNGVSTSVNINHEESVSAVTVNMRTSSEFEENT 1140
            GSGTDT+DHC+VE CGA  SDEFVSKSVNG+STSV INHEESVSAVT NMR SS+FEE+T
Sbjct: 1081 GSGTDTDDHCVVETCGARSSDEFVSKSVNGLSTSVIINHEESVSAVTANMRNSSQFEEST 1140

Query: 1141 VLQQSDEKCAEAVGNLISEISKEEDLPSPDSHSAYNLTNAAASLSQPVHDHKIEGSSENT 1200
              +Q D   AEAVGNL+SEISKEED P+ DSHSA +LTNAAA  SQP HDHKIEG SENT
Sbjct: 1141 AFEQLDVTGAEAVGNLVSEISKEEDAPNLDSHSACSLTNAAAFPSQPAHDHKIEGCSENT 1200

Query: 1201 EGGSKCCNEPDILRSESVSTVDENSAAVSESRATAKLAFGGEEEGRNTNLHVQSILQCSV 1260
            E   K CNEPDILR ESV+TVDENSAAVSESRAT +LAFGG E+G +TNLH QS+LQ S 
Sbjct: 1201 E-ACKRCNEPDILRPESVATVDENSAAVSESRATTELAFGG-EDGSDTNLHGQSMLQRSF 1260

Query: 1261 QNSTGFDSKLALEGSSLGLDPQILHPTVLKVEHV-EKSCVESENSLAVGNSEPGVIGREQ 1320
            Q+STGF+S LALE  SLG DPQI HP +LKV+ V  KSC++ ENSL V NS PG+IGRE+
Sbjct: 1261 QDSTGFNSNLALE--SLGFDPQISHPKILKVDSVANKSCIKDENSLVVRNSGPGIIGREE 1320

Query: 1321 MLNQYMLSSTAVLQEVSDAHQKPMNRDDYAEHQNNLS-HDSESKFPRSYPFNKQIFEDIN 1380
            MLNQ M  S  VLQ V DAHQKPMNRDD A+HQN LS H   S+FP SYPFNKQI EDIN
Sbjct: 1321 MLNQDMFPSALVLQGVGDAHQKPMNRDDCADHQNRLSRHIESSEFPSSYPFNKQIVEDIN 1380

Query: 1381 RNINRTYFPVVQGLSKPDINCSSSYVSEGHYLQNCNSSKP--HNPAELPFLPQNVDLGHD 1440
            RNIN T FP  QGLSK  INC+ +YV E  Y QNCNSSK   H  AELP LP+NV+LGHD
Sbjct: 1381 RNINHTDFPAFQGLSK--INCNGTYVVEDCYPQNCNSSKEPCHRAAELPLLPKNVELGHD 1440

Query: 1441 RQKKALCSGSASDSDVPRRKGDVKLFGQILSHAPSKQNSSSGSNEGGEEKGLHKSSSKSC 1500
             Q  + CSG+ASDSDVP RKGDVKLFGQILSHAPS QN SSGSN+  EEK  HK  SKS 
Sbjct: 1441 HQNTS-CSGNASDSDVPHRKGDVKLFGQILSHAPSLQNLSSGSNDCREEKEFHKLRSKSY 1500

Query: 1501 DIGENVPLRSYGFWDGSRIQTGLSALPDSAILQAKYPAAFSGYSATSVKTEQQPLQALTN 1560
            D+GENVPLRSY FWDGSRIQTGLS LPDSAILQAKYPAAFSGYSATS+KTEQQPL+A  N
Sbjct: 1501 DMGENVPLRSYCFWDGSRIQTGLSTLPDSAILQAKYPAAFSGYSATSLKTEQQPLRAFAN 1560

Query: 1561 NGDRSLNGLVSAFPTKDGVVDYHSYRSRDGVKLRPFPVDIFSEMQRRNGFDAVSLSSLQQ 1620
            NGDR+LN LVSAFPTKDGVVDY SYR RDGV +RPFPVD+FSEM RRNG+D +SLSSLQQ
Sbjct: 1561 NGDRNLNELVSAFPTKDGVVDYQSYRIRDGVNMRPFPVDLFSEMHRRNGYDPLSLSSLQQ 1620

Query: 1621 QGRVLVGMNVVGRGGILMGGSCTGVSDPVAAIKMHYSKAEQYVGQPGSTFTREDGSWRGG 1663
            QGRV     VVGRGGILMGGSCTGVSDPVAAIKMHY+K++QYV QPGSTFTREDGSWRGG
Sbjct: 1621 QGRV-----VVGRGGILMGGSCTGVSDPVAAIKMHYAKSDQYVRQPGSTFTREDGSWRGG 1670

BLAST of CcUC05G103030 vs. TAIR 10
Match: AT3G52250.1 (Duplicated homeodomain-like superfamily protein )

HSP 1 Score: 518.1 bits (1333), Expect = 2.7e-146
Identity = 533/1715 (31.08%), Postives = 774/1715 (45.13%), Query Frame = 0

Query: 1    MPPEPLPWDRKDLFKERKHEKSEAIGSAA--RWRD---SHHGSREF-NRWGSADFRRPTG 60
            MP +   WDRK+L ++RKH++ E    +   RWRD   SHH  REF +R GS DFRRP+ 
Sbjct: 1    MPQDHASWDRKELLRQRKHDRPEQSFESPPFRWRDSPSSHHVPREFSSRLGSGDFRRPSC 60

Query: 61   HGKQGSWHQFSEESSHGYGPSRSFSDRVLEDESFRPSVPRGDGRYIRIGREIRGSFSHRD 120
            HGKQG  HQF EE+SHGY  SRS S R+   +++RPS  RGD RY R  R+ R S S ++
Sbjct: 61   HGKQGGRHQFVEETSHGYTSSRS-SARMF--DNYRPSASRGDWRYTRNCRDDRVSVSQKE 120

Query: 121  WRSHSRETNNG------------------------------------------------- 180
            W+ ++ E +NG                                                 
Sbjct: 121  WKCNTWEMSNGSSRSFERPFGIRNGRRSVDERPLHASDTHSTVVNSLDPANSAHYLDNEI 180

Query: 181  ------------------------------------------FGNPSRRPSSASQDVSSD 240
                                                      +GN    P+    D+   
Sbjct: 181  STPVRSLKIKNEHKFSDQRLSLPSDPHSECISLFERPSSENNYGNKVCSPAKQCNDLMYG 240

Query: 241  QRSVDDT-----------------VTYSSPQ---SVHG---LENGPRADVEVSLGSTDWK 300
            +R V D                  +    PQ   S+HG   ++   +   E SLG+T   
Sbjct: 241  RRLVSDNSLDAPIPNAELEGTWEQLRLKDPQDNNSLHGINDIDGDRKCAKESSLGATGKL 300

Query: 301  PLKWSRSGSLSSRGSAYSSST--------NSKNEKADLPLRVASPIESPSAETTACVTSS 360
            PL W+ SGS +S+ S +S S+        +S + K ++  ++ +  +S S + TAC T++
Sbjct: 301  PL-WNSSGSFASQSSGFSHSSSLKSLGAVDSSDRKIEVLPKIVTVTQSSSGDATACATTT 360

Query: 361  LPSEDAISRKKPRLGWGDGLAKYEKEKVEVPDGSLKKEVALLSSASAELTHSLGSNFAEK 420
              SE+  SRKK RLGWG+GLAKYEK+KV+V   +  ++   L     E  HSL  N A+K
Sbjct: 361  HLSEEMSSRKKQRLGWGEGLAKYEKKKVDV---NPNEDGTTLMENGLEELHSLNKNIADK 420

Query: 421  SPKTLPFSDCASPATPSSFACSSSSGLEDKPFSK---GASLDGMICSSPGS-SSQNLQKL 480
            SP      D  SP TPSS ACSSS G  DK   K    AS    +C SP   SS +L++ 
Sbjct: 421  SPTAAIVPDYGSPTTPSSVACSSSPGFADKSSPKAAIAASDVSNMCRSPSPVSSIHLERF 480

Query: 481  LSSIEMMEICSIANLGSSLVELFHSDDPSTVESCFGKST-LNKLLAYKGEISKKLETTES 540
              +IE ++  S+   G  L EL  +DD  T +S   + T +N LLA+KGEI K +E TES
Sbjct: 481  PINIEELDNISMERFGCLLNELLGTDDSGTGDSSSVQLTSMNTLLAWKGEILKAVEMTES 540

Query: 541  EIDSLENELKSLKSGNGGNVSHKKSCSVTHLVENVTYFKEQDGVSCVAPRPAPLVIVSSS 600
            EID LEN+ ++LK   G   S     S      +    KEQ   S       P    SS 
Sbjct: 541  EIDLLENKHRTLKL-EGRRHSRVVGPSSYCCDGDANVPKEQASCSL-----DPKATASSV 600

Query: 601  DATVEKMPVCKGDMGVEDVDTKADEIDSPGTVTSKFNEPSRVVKTLASDLVVNGHCSEVT 660
              T+ + PV +   G+  V     E DSPG            VK L+          E  
Sbjct: 601  AKTLVRAPVHQA--GLAKVPADVFE-DSPGE-----------VKPLSQSFAT----VERE 660

Query: 661  DVIVPDKMEGNFPVSRSFVD--EHKTIGSDNECILAKSCTKESIYGDLMAQAGSRSSLCD 720
            + I+P       P  ++ V   E  T    N+  +  S   +S+       A        
Sbjct: 661  EDILP------IPSMKAAVSSKEINTPAFANQETIEVSSADDSM-------ASKEDLFWA 720

Query: 721  HIFVCNKEYASRAAEVIFKKLPVEMCKISSKSTKIVSCSEIEKLVKEKFLMRRQFLKFKE 780
             +   NK+YA  ++ V  + LP +     +     +  ++ +  V+EK   R   L+ +E
Sbjct: 721  KLLSANKKYACESSGVFNQLLPRDFNSSDNSRFPGICQTQFDSHVQEKIADRVGLLRARE 780

Query: 781  SALTLRFKALQQSWKEGLLH-SVKKCRSRPQKK-ELSLRVSHSGHQKYRSSTRSRLVQQG 840
              L L+FKA Q SWK+ L   ++ K +S+  KK EL     + G+ K   S R R     
Sbjct: 781  KILLLQFKAFQLSWKKDLDQLALAKYQSKSSKKTELYPNAKNGGYLKLPQSVRLRFSSSA 840

Query: 841  ACQNPTLNTEIAVRYSSKLLLNPQIKLYRNSLKMPAMILDKKEKMALRFISHNGLVEDPC 900
              ++  + T   V Y  KLL    +K +R+ LKMPAMILD+KE++  RFIS NGL+EDPC
Sbjct: 841  PRRDSVVPTTELVSYMEKLLPGTHLKPFRDILKMPAMILDEKERVMSRFISSNGLIEDPC 900

Query: 901  AVEKERNMINPWTSAEREIFWEKLSLFGKDFKKISSFLDLKTTADCIQFYYKNHKSDSFK 960
             VEKER MINPWTS E+EIF   L++ GKDFKKI+S L  KTTADCI +YYKNHKSD F 
Sbjct: 901  DVEKERTMINPWTSEEKEIFLNLLAMHGKDFKKIASSLTQKTTADCIDYYYKNHKSDCFG 960

Query: 961  K-NKNLELGKQVKSSAITYLVTSGKKWNPDMNATSLDILGVASIMAAQADYDIGNQQKCT 1020
            K  K    GK+ K    TY++   KKW  +M A SLDILG  SI+AA A      +   T
Sbjct: 961  KIKKQRAYGKEGKH---TYMLAPRKKWKREMGAASLDILGDVSIIAANA-----GKVAST 1020

Query: 1021 RHLGMGRDVESKVSFSASTPSNKNNLDALQ-----TEKETVAADVLAGICGSISSEALSS 1080
            R +   +      S + S   + NN +          K T  ADVLA   G +S E ++S
Sbjct: 1021 RPISSKKITLRGCSSANSLQHDGNNSEGCSYSFDFPRKRTAGADVLA--VGPLSPEQINS 1080

Query: 1081 CITSAIDPSE---DHWERKCYKVDSAVKLPSLSDVI-----------QKTDNEEPCSDDS 1140
            C+ +++   E   DH      K +  VK P +S  +              + ++ CS++S
Sbjct: 1081 CLRTSVSSRERCMDH-----LKFNHVVKKPRISHTLHNENSNTLHNENSNEEDDSCSEES 1140

Query: 1141 SEDVDSSNWTDEEKSIFMQAVSSYGKDFDMISRCIRSKSRDQCKVFFSKARKCLGLD-LM 1200
              +    +WTD+E+S F+Q  S +GK+F  ISR + ++S DQCKVFFSK RKCLGL+ + 
Sbjct: 1141 CGETGPIHWTDDERSAFIQGFSLFGKNFASISRYVGTRSPDQCKVFFSKVRKCLGLESIK 1200

Query: 1201 HTSGDVGETPGSGNDASGSGTDTEDHCIVEI-CGAHGSDEFVSKSVNGVSTSVNINHEES 1260
              SG+V  +    N   G G+D ED C +E   G   +       +N  ++  N+N +  
Sbjct: 1201 FGSGNVSTSVSVDNGNEGGGSDLEDPCPMESNSGIVNNGVCAKMGMNSPTSPFNMNQDGV 1260

Query: 1261 VSAVTVNMRTSSEFEENTVLQQSDEK--CAEAVGNLISEISKEEDLPSPDSHSAYNLT-- 1320
              + + N++      E    +++ +K  C +   NL++        PS  S S  +L   
Sbjct: 1261 NQSGSANVKADLSRSE----EENGQKYLCLKDDNNLVNNAYVNGGFPSLVSESCRDLVDI 1320

Query: 1321 NAAASLSQPVHDHKIEG--SSENTEG---GSKCCNEP-----DILRSESVSTVDENSAAV 1380
            N   S SQ     K     S E  EG        +EP      +L +  V T  E S   
Sbjct: 1321 NTVESQSQAAGKSKSNDLMSMEIDEGVLTSVTISSEPLYCGLSVLSNVIVETPTEISRKG 1380

Query: 1381 SESRATAKLAFGGEEEGRNTNLHVQSILQCSVQ-NSTGFDSKLALEGSSLGLDPQILHPT 1440
            S  +      F  + +          ++Q + +  ++G + + A  G      P+ LH  
Sbjct: 1381 SGDQGATMPKFSSKNQ--------DGVMQAANRTRNSGLEPESAPSGFRY---PECLHHV 1440

Query: 1441 VLKVEHVEKSCVESENSLAVGNSEPGVIGREQMLNQYMLSSTAVLQEVSDAHQK--PMNR 1500
             ++V      C E+   ++     P      +       S  +++ +V + H    P N 
Sbjct: 1441 PIEV------CTENPIGVSAPRGNPNCHAESE-------SGNSLVGQVDETHDLGWPKNN 1500

Query: 1501 DDYAEHQNNLSHDSESKFPRSYPFNKQIFEDINRNINRTYFPVVQGLSKPDINCSSSYVS 1531
             +       L H +  +       N +  ++  R++ +    + +  SK D+   +    
Sbjct: 1501 LELDGRLQVLGHVNPEQIGLLKATNTESCQNPQRSVTQDLSRISR--SKSDLIVKTQRTG 1560

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038892245.10.0e+0089.25uncharacterized protein LOC120081444 [Benincasa hispida][more]
KAA0034735.10.0e+0086.81Myb_DNA-binding domain-containing protein [Cucumis melo var. makuwa] >TYK09288.1... [more]
XP_004142488.10.0e+0086.54uncharacterized protein LOC101222167 [Cucumis sativus] >KGN52286.1 hypothetical ... [more]
XP_008446909.20.0e+0086.03PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC103489481 [Cucumis me... [more]
KAG6601151.10.0e+0082.81Nuclear receptor corepressor 1, partial [Cucurbita argyrosperma subsp. sororia][more]
Match NameE-valueIdentityDescription
Q4KKX42.9e-1229.85Nuclear receptor corepressor 1 OS=Xenopus tropicalis OX=8364 GN=ncor1 PE=2 SV=1[more]
Q8QG789.3e-1128.36Nuclear receptor corepressor 1 OS=Xenopus laevis OX=8355 GN=ncor1 PE=1 SV=1[more]
Q9WU421.2e-1021.20Nuclear receptor corepressor 2 OS=Mus musculus OX=10090 GN=Ncor2 PE=1 SV=3[more]
Q9Y6181.1e-0819.86Nuclear receptor corepressor 2 OS=Homo sapiens OX=9606 GN=NCOR2 PE=1 SV=3[more]
O753762.5e-0821.26Nuclear receptor corepressor 1 OS=Homo sapiens OX=9606 GN=NCOR1 PE=1 SV=2[more]
Match NameE-valueIdentityDescription
A0A5A7SZU10.0e+0086.81Myb_DNA-binding domain-containing protein OS=Cucumis melo var. makuwa OX=1194695... [more]
A0A0A0KWU70.0e+0086.54Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G623500 PE=4 SV=1[more]
A0A1S3BG740.0e+0086.03LOW QUALITY PROTEIN: uncharacterized protein LOC103489481 OS=Cucumis melo OX=365... [more]
A0A6J1GWV00.0e+0082.16uncharacterized protein LOC111458252 OS=Cucurbita moschata OX=3662 GN=LOC1114582... [more]
A0A6J1JPM10.0e+0082.39uncharacterized protein LOC111486582 OS=Cucurbita maxima OX=3661 GN=LOC111486582... [more]
Match NameE-valueIdentityDescription
AT3G52250.12.7e-14631.08Duplicated homeodomain-like superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (PI 537277) v1
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 395..422
NoneNo IPR availableGENE3D1.20.58.1880coord: 947..1045
e-value: 1.8E-16
score: 62.6
NoneNo IPR availableGENE3D1.10.10.60coord: 751..823
e-value: 1.0E-24
score: 88.5
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..236
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 180..207
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 121..161
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 79..120
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 8..46
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1446..1461
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1640..1663
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 58..75
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 219..236
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1446..1474
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 971..991
NoneNo IPR availablePANTHERPTHR47340:SF1DUPLICATED HOMEODOMAIN-LIKE SUPERFAMILY PROTEINcoord: 1..1633
NoneNo IPR availablePANTHERPTHR47340DUPLICATED HOMEODOMAIN-LIKE SUPERFAMILY PROTEINcoord: 1..1633
IPR001005SANT/Myb domainSMARTSM00717santcoord: 988..1036
e-value: 4.1E-7
score: 39.6
coord: 771..819
e-value: 3.1E-5
score: 33.4
IPR001005SANT/Myb domainPROSITEPS50090MYB_LIKEcoord: 984..1034
score: 5.993406
IPR001005SANT/Myb domainCDDcd00167SANTcoord: 991..1030
e-value: 8.94761E-7
score: 45.259
IPR017930Myb domainPFAMPF00249Myb_DNA-bindingcoord: 990..1031
e-value: 2.1E-6
score: 27.8
IPR017884SANT domainPROSITEPS51293SANTcoord: 987..1038
score: 11.647677
IPR017884SANT domainPROSITEPS51293SANTcoord: 770..821
score: 13.764482
IPR009057Homeobox-like domain superfamilySUPERFAMILY46689Homeodomain-likecoord: 986..1038
IPR009057Homeobox-like domain superfamilySUPERFAMILY46689Homeodomain-likecoord: 758..820

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CcUC05G103030.1CcUC05G103030.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0050789 regulation of biological process
molecular_function GO:0003677 DNA binding