MELO3C035074 (gene) Melon (DHL92) v4

Overview
NameMELO3C035074
Typegene
OrganismCucumis melo cv. DHL92 (Melon (DHL92) v4)
DescriptionReverse transcriptase
Locationchr11: 18934545 .. 18957829 (-)
RNA-Seq ExpressionMELO3C035074
SyntenyMELO3C035074
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTCGGAGATGGAGGAGCTTACGAATAGTCGGATGGTCGATCTTGAGGATTCATCAGTGGTTGACTGTTCTCAGGCGAAAGACGGAAACGTCAGATTTAGCCGGTTGGATGAAGGAGAAGTCGTGCGGATGGCAACTATGTTCATCAAGTGAAGGAGCTGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGGAGAAATTCGGAGGAGGGGGCGACGCACGCGGGAGGGGTGATGGGTTAGGGTTTTGAATATATATATATATATATATATTAAATAATAATAATAATATAATATAATATAATTTATATTATAATATTATTATTATATATATTATATTATATTATATCATATTATTAAAATTTATACCCTTCATTTTCTTTTTCTTTTCTCTTCTAAACCAAACAAAATTTAACACCCAAAATTTTAAATTCCCGAAAATTAACATGAAAATGTTAAAACTCACCTCAAAATTTGGGGCGTTACACTGAGATTGGTTTTCATATTGTTTTTCCACTTTCGGGAGCATAATGATTCTCATCTCTGAGGACCGAAAATGGAGGGTTACACTTACAAGAATTACTAAATAGTTAGTATATTCTTGACCGAAGTAACGGTAACTAAATGCTAAGTAAATATGAGATATTTGTGAGTTAGCATTTGGTCAAGATTGTCTTAATTCAAAGGACGAGTAATCGGCTACCCCTCGGTAGAGGTTATTCTGAACCTTTGAAATATCATTGCAAAACATAATAAGGGGTACATTATTAAAAGTTTTGCTAAAAAGATTGGTTTATTAGGATTATAGTTTCTAAACAAAGTATGTTTGTTTTTTCAGCATGAATAGCTCGATAGTTCAATTATTAGCTTTCGAAAAACTTAACGGCGATAATTATGCGACATGGAAATTGAATCTTAACACGATACTAATGGTCGATGATTTAAGATTTGCCTTAACTGAGGAATGTCCTCAAACCCCTACCTTAATTGCAAACCGAACTAGTCAGGAAGCATACGATCGATGGATAAAAGTCGATGAGAAAGTTCGTGTCTACATCCTTGCTAGCATGTTTGATGTTTTAACAAAGAAACATGAATCCTTAGCCACGGCTAAAGAGATAATGGATTCATTAAAAGGAATGCTTGGGCAAGTAGAATGGTCCATAAGACAAGAGGCAATTAAATACATTTACACTGAGAGAATGAAAAAGGGGACCTCTGTTAAAGAACATGTCCTGGACATGATGATGCACTTCAATATTGCTGAAGTAAATGGTGGTGCCATCGATGAGGCTAATCAGGTTAGCTTTATCTTAGAGTCTCTTCCAAAGAGCTTCATACCATTTCAAACAAATGCGTCCCTGAACAAGATAGAGTTTAATCTGACAATCCTTCTAAATGAGCTCTAGCGTTTCTAGAATCTTACCATGGGTAAGAGAAAGGAAGTAGAAGCAAATGTTGCTACTAAAAAAGGAAAATTCTCAAGAGGATTGTTCTTTAAATCTAAAGTTGGACCCTCAAAACCTAATCAAAAAATATAAAAGAAGGAAAAGGGGAAGACTCCCAAACAGAACAAAAGAAAGAAGACTATAGGAAAAAGTAAATGTTACCACTGTGGTAAAAATGGGCATTGGTTAAGAAACTGCCCAAAATACCTTGCTCAGTAAAAAGCAGAGAAGGAAGCACAAGGTAAATATGATTTACTTATTTTTGAAACATGTTTAGTGGAAAATGAAAATTATACTTGGATATTAAATTCGAGAGCCACTAATCATATTTGCTTCTCATTTCAGAAAAATAGTTCTTGGAAAAAGCTTTTTGAGGGCGAGATCACTCTCAAAGTTGGAACTGAAGAAATGGTCTCAGCTAAAGCAGTGGGAGACTTGAAGTTGTTTTTTAATGATAGATATATCATGCTCAAGAATGTCTTGTATGCACCTTAAATGAAGAGAAATTTAATATCTATCTCTTGTATGTTATAACATATGTACATAATATCTTTTGAAATTAATGAAGCATTCATTTTTCAAAAAGGTATTCATGTTTGTTTTGCTATACTTGAAGATAACTTATATAAGTTAAGACCAACACGAGCAAATTTTGTCTTAAATACTGAAATGTTTAGAACAGCTGAAACTCGAAATAAAAGACAAAAAGTTTCTTCCAATGCCTTCTTATGGCACTTAAGACTTGGTCACATAAATCTCAATAGGATTGGGAGATTGGTTAAGAGTGGATTTCTAAGTCAGTTAGAAGATAACTCTTTACCTTCATGTGATTCCTATTTTGAAGGAAAAATTGACCAGAAGATCTTTTACTGGAAAAGATCTTAGAGCCAAAACACCTTTAAAGCTCGTACATTTGGACCTTTGTGGACCAATGAATGTCAAACCTCGGGGAGGATATGAATATTTCATTAGTTTTATTGATGATTATTCAATATATGGTTATATTTACCTAATTCAAAACAAGTCGAATTCTTTTGAAAAGTTCAAAGAATATAAGGCTGAAGTTGAAAATGAATCTGGTAAAACTATAAAGACACTTTGATCAGATCGAGGTGGAGAGTATATGGACTTGCGATTCCAAGATTATTTGATAGAACATGAAATCCAATCACAACTCTCTGCACCTAGTACGCCTCAGAAAAACGTTGTATCAGAAAGAAAAAATCGAACTTTGTTAGACATGGTTCGCTCTATGATGAGTTTTGCTTAGTTGCCAGATTATTTTTGGGGATATGCTTTAGAAATAGCTATCTATATTTTAAACAACGTTCCCTCTAAAAGTGTTTCTGTAACACCTTATGAGCTATGGAAAAGGCGTAAAGAATGTCACTTTAGAATTTGGAGATGTCCAGCACACGTATTGGTACAAAATCCTAAAAGTTGGAACGTCGTTCAAAATTATGCCTATTTGTAGGTTATCGAAAAGAATCTAAAGGTGGTTTATTTTATGACCCTCAAGAAAATAAAATATTTGTATCGACGAATGCTATGCTTTTAGAGGAAGACCACATAAGAAATCATCAAACTCGCAGTAAACTAGTATTAGAAGAAATTTCCAAGAATTCTACAAATAGACCTAGTTCATCTACTAAAGTAGTTGATAAAACTAGGAATATTGGTCAAAAACATCTTTCTCAAGAGTTGGGAGAACCTCGACATAGTGGGAGGGTTGTACGACAGCCTGATTGCTATTTGGGTTTAAGTGAAGCTCAAATCGTCATACCCGATAATGGAATAGAGGATCCATTGACCTATAAACAAGCAATGAATGATGTGGACTGTGACCAATGGGTCAAAGCCATGGACCTCGAATTGGAATCTATGTATTTCAATTCTGTCTGGACTCTAGTAGATCAATCAAATGATGCAAAACATATTGGTTGTAAATGGATCTACAAGAGAAAACGAGACCAAGCTGGTAAAGTGCAGACTTTTAAAGCTCGACTAGTGGCAAAAGGTTATACACAAAAGGATGGAGTAGATTATGAAGAAACTCTCTCCTGTTGCCATGTTGAAGTTGATTAGAATACTCTTATCCATCGCCACTTTTTATGATTATGAAATTTGGCAGATGGATGTTAAGACAGTCTTTTTAAACGGAAATCTTAAGGAGAGTATCTATATGGTCCAACCAGAGGGGTTTATATAAAAGGGTCAAGAACAAAAGGTTTGTAAGCTTCAAAAATCCATATATGAATTAAAACAAGCATCTAGATCATGGAATATAAGGTTTGATACTGCGATCAAATCTTATGGTTTTGAACAAAATATCGATGAACCTTGTGTTTACAAAGGGATCATCAATACCACAGTAGCATTCTTAGTTTTGTATGTAGATGACATTCTACACATTGGGAATGATGTAGGTCATCTAACTGATATTAAGAAATGGCTAGCTACATAATTCCAAATGAAAGATTTGGGAAATGCTCAATACGTTCTTGGTATCCAAATACTTCGGAACCGAAAGAACAAAACCCTAGCTATGTCTCAAACATCTTATATAGACAAAATGTTGTCAAGATATAAGATGCAGAATTCCAAAAGGGTCTATTGTCGTATAGATATGGAATTCATTTATCAAAAGAACAATGTCCAAAGACACCTCAAGAAGTTGAGGATATGAGTAACATTCCCTATGCTTCTGTTGTTAGAAGCCTGATGTATGTAATGTTATGTACTAGACCTGACATTTGCTATTCAGTAGGGATGGTTAGTAGATATCAGTCCAATTCTGGACGTGACCATTGGACAGCCGTTAAGAATATTCTAAAATATCTTAGAAGAATAAAAGACTACATGCTTGTGTATGGTTCTAAGGATCTGATCCTTACTGGATACACTGACTCTGATTTTTAATTTGATAAAGATGCTAGAAAGTCTACATCTAGATCAGTTTTCACTCTGAATGAAGGAGCAGTAGTATAGAGAAGCATAAAACAATCATATATTGTCGACTCCACTATCGAAGCTAAATATGTAGCAGTCTGTGAAGCAGCAAAGGAAGTTGTATGGCTTAAAAAGTTCTTAACAAATTTGGAAGTTGTTCCAAATATGCATCTGCCAATCACCTTATACTGTGACAACAGTGGTGCAGTTGCAAATTCACGAGCACCTAAAAGTCATAAACGAGGAAAGCACATTGAACGAAAGTACCATCTTATCAGGGAAATCGTACATCAAGGAGATGTTACAGTAACAAAAATCTCCTTCGAGCAAAACATGGCTGATCCATTTACAAAAGCTCTCACGGCTAAAGTGTTTTAGAGCCACCTACATGGTTTAGGTCTACGTTGTTTGTAAACTAGGACAAGTGGGAGACTTACTGGGAATGTGCCCTAGTTTAATATATATGTTTATTGTACTCATATATATATTCTTCTCTCATTTTTAAAATTCTAACTTTGTTTATCCTACTAGGAGTTTTAGTCCAAGTGGGATTTTGTTGGAATTTATGTCCTAAAACTCGTACTTTGTTATTTGATTCAATAAATTTGTTATTGAATGCAATAATCTTAAAACCAATAAATTAAGATCCCGGGCTATTTTACTAAGTTTGTCAATACACTTGGACTTTATGTAGAGACATAAACATGGATTAACTTCAAGTTAATAGCTCAAATAGTCTATAGTATATGAATAAGGTTGGGTGCCTTATTCTAGAAAAACACTATGGATGCGGCCTGCTCCATAGTTAGTACAAATGATGTAATTCTGAATCGTTCAAGTGAAAACATGAGAGTGGGGCATTCTATATAAATGGTTTGCCTAAGACTGAAACTACGAAATAGTCACTTTTAGTTATAACGCCGTAAACTATATAAACTAACTATTTCAATTATGATGACCTAGGGAACTTGATCTTAATCTTGAGCTAACTATGAAATTCTGTTCATTCGGTAATATCCTTAGATCTGCATAGGTGAGGGTAGCTCATCATCACTGGCCCAATAAGCCTCCCATTTCAGGGATAAGACCGAGTGGATAGCTAAGGACATAGGGTGCAAGATGGAATTCACTTCTACCCACTTCTAGGATAGTAGATAGGTTGTTCCCTTAAGGACTGAATCCAAGTTTTGAACAAGGGACCCCACCTTCTCATTGGCTCGAGAAGGATTCAGGTTTATATGTTGGACCTTAAACCAATTGTTCAATAGTGAATCAATGGGACTTGAGGAGCAAGATGTAATCTCGGGGGTAAAACGGTATTTTGACCTAGCCAAGATTACGAACAATCTGTGAAGGATCAACTTACCCATCATGGTTATATCAGGTGGACAAAAATATATCTATAGTGAGGGGAGTGCAACTATGAGTCTTTAGTGGAATGACTCATTAGTTAACGAATGTTGCTTAAGCTTGGTCTTAAAGAGTTTAGCCAGTTAATCTCGAATTGTTGGAGTCCATGAACTATAGGTCCATTAGGTTCCCCTACTAGCACGTATGGATTCAACTTAGAACAATATGTTGGAATAACTCGAATTGTTCGAATTAAGTAAAAAGAGAGAAATTGATGAGTATATATGATATAAGTCGGTAAATATAAACTTTAAGCTTTATGTTTAAGTATGATCTAAATATGAATATGGATTCATATTTAGAACCTTGGAAAGTTTTGAAACGGTTAAAGTTGTAAAAGTCAACATGTTGACTTTTGACTTTGAAAAACAAAACTTTGACCAAATTTATATTCAAATATGATTTGAATTTTAGAAAAATGAATGTGGATTCATACTCAGGAGGTTGAAATTAGTTAAGACGGATAAAATAACAAAAAGTCAAAAAGTTGACTTTTGACTAAAAAAAGTCAAAGTTTTACTTTGACTTAATTGGTCAAATGACCAAATTGTCCTTTGACTAATATATTTATTACTAAATGTTAGTGGAAAATGTGACAGCTTATAATGCAAGTATAAGCCACTAATTCCATTAATAGTTAATGGATTAATTAGGTGTTGAGATTTCATGAAGTAAATTGCATGCATTTTGCATGTAATTTTCTTTATAAACTTCCATTTACAGAATGAGAAAGAGATGACAAAAATTTCATGAAAAAACTCTTAATAATACACCTTCCTCCATCCATCTCTCTAAGTTCTCTTTCATAAAAATAAGTCCCACAACTCGGTTCTTAGTCTTGAGATTAGTAGGTCAACATAGTGGTTGTCCTTTGCTCGTGATCTTCAGGCAAGAAGAAGTTTTGGAACGAAGAAGAAGTTAAGAACTACAAAGGTAAGCTCATCGTTTACCTTCTCTATTCTTCGTTTAGGTCTGTCGTTTAGTTACAGCATGTTAATTTCTAAATCGTTTAGATGCATATAGAGTAAAGCACGATCCTAGTCTTCCACTGTGCATGCTCTCTTTGTTTTTGTTCCATCAATTTGATAGATGATTTTAGATTCATGTCTATTGATCGTTTCATTCCTAAAGAGATTGTGCATAAGGCTCGTACAAATCTTGAAGTTAATATAAGTGACCAAAAAGCTTTAAGGGTAAAAGAACATATGGTACAGATATTGCAAGGTGACACAATTGAATCGTATGCATTGATTTCAAGCATCTTTGATAAATTGGTTGAATCTAACTCAGGTACGATCAATTTTGTATATCAAATTTAAATTCATTTGCTGATAGATGTGGATAGACTAATATCACAGTTATATAAATAGACAGATATTAGTATCTATCACATTCTTTCTGTTCTTTTTTGTTTGTGCTAATAGAAAGGTGTTTATAGTTATTAAAGGTACATGCACTGCTTTAGAAATAGATGATAGTGGTCATTTCAAGTTTTACTTTATGGCTTTCAGTGCACCAATTGAGGGTGGAAATATTGTAGACTTATTATTTCTATCGATGGTACATTTTTTAAGTGTAAGTTTGGTGGCATCCTATTAACAACCTCATCACAAGATGGTAACAATCAAATTTTCCCTCTTGCTTTTGCCATTGTAAATTCTGAAAATGATGTATCATGGACATAGTTTTTTTAGAAGATACGAAGAAGTTTTTCAGAGCGGGCTAATTTAGTCATTGTTTCTAATAGACATCTTAGCATCCCTAAAGGAGTTTTAAAAGTGTTTCCTGACGTTGAGTATTGTGTGAGCACGAAACATCTTTTAAGTAACTTGAAACTCCATTTTAAGGATCCACTACTTGATAAATATTTCTTTAGTTGTGCTTATGCGTACACTGTTGATGAATTTGAGTACCATATGAGATGCATGGGGTCTGTTTGTCCAAATATCAGAGATTATCTTAGTAGTGTTGGTTTTGAGAAGTGGTCTCGGGCATATTCACGTAGATGAAGATATAGAATGATGACATCAATTGTGCAGAAAGTGTAAATTCAATGTTTAAAAATTTTAGAGAACTACTTGTGGCTACGATGCTTAGTTCAATTAGGGATGTATTGCAAAAGTGATTTTATGAACGAAGTAAAGCTGCTTGTATAATGAAGAGTCATTTGACTTCTTGAGCGTTGAGAACGTTCTACGTTTAGAACATGAAAAATCGAGAAGATTATTGGTAATATTTTTTTATCTCGGACAGTTAGAGATAAACTGCTATCTTTGTCTATACGATAAATAAAGATAGAAATTTATTTGTTCCTATCTCAGATAAGCCTAGATAGACTACTATCTAGATATATCTGATGTATACAGAGATTGAAAGCTATTTGTTATTATTTTACTAATTGTCCTTATTTGTCTTATGTATTAGGCTGATCCAATTAATATGATTGAATACGAAGTAACGGATGGGATGTTGTAGTTGCCGAGTATAGGATGTTGAAAAAATTCCTTGTGTGCATGCTCTTGCTGTTCTACGAATGGTTAATTTAGATACTTATTCTTATGTATCTGTGGTTTATTATAGAGAAACACTATCAGCAACATACAATGGTTGTATTCAACCTATTGGATCTCATTACGATTGAAGAGTTGTAGATTCTGTCATGAAAATATTACCTCCAATCTTTAAACGGTAAGCTGGAAAACCAAGAAAACAAAGAATTTCATCTATTTGTCCAAAATTATACTACATACTGCTAAACAGTTTCATGTTATCCATAATTAGAAAATTATGGTTTGCATTTGTTACAAGGCATTCAACAAATTTGGCACAAAAAATCTCGCAATCTAATGAAGCACCTTTTTGTAAAGTGGTATTCGATCTTACTACCGACTAAGGGCCATACGCGAAACGTTTTCTTTACGCAAGTATTCCATTAGCAATTGCCAGTGAGGGGATGCATCGTGCAGGTATTGCAATAGCTGTATCAACAAGTTTTTGTTCAACATATTTTAGCATAGAGTCAAACACATAGATCTTGCATTTCTTCATGTCAGCTGCAATAGCCAACAAGTGTTCTTTTATGTTGATGCAGCCAATAACATAGTTGACATCTGCCCACCCTACTTTATCTTGGATTGCACCAATAGCTAAAGTAAAATATTAATCTAAATCTCCTTCTGACCATAGTGCTAAGTGTGTTGTTGTGTATGTCCATTTAACCTCCTTATTGTCTACTGTCAGCACACGATTATCAAAAAGTTGTTGCCTAATTACGGGTTCCTTATTAATACCAGTCTTGAAGAACAAATAAAAAAATTTAGTTAAAATATAATTATGCAGTTTAACTTCTATACCAAAAAAAAATTAAGAATAATAACTTACCATAAATGATAGAATCTAGTAATATAAAACTATATTTTCAAAGATGGTTCACATGTAACATCTTATTCTACATGTGGTACAATAAAAAATCCATTGTCTACAAAAAAACCCAAACGATCGTTTAGTGAATACAAGTAATAGTTTATCAAAATTCAAGCCATCATTTAGCAAATACAAGTGATTGTTTACAACATTCTTTTCTAAACTAAGCGATTGTGTACAATGTTTTTTGTTTGCTTATCCAATCAAACGTTCGTGCATTATATATAAACGATCGTTTGGTTAAATAAAGAATATTGTACTCGATCGTGCAGTTTTATGCACGATCGTTTGGTTAAATAAAGAATATTGTACTCGATCGTGCAGTTTTATGTATATAATAAAATTTGGAAAGGGTACTTACGTCTCCTAATATCCATGTGTTTTCTTCAAGCTTTTTGAAAAATTCATTCTTTCTGAATGAGAACACTATTGTCCTCTCTTTATTAGTTGGTTTTCTAGATCTCTGCCAAGATAAGTATGCCTTCCACAATTTTGGATCCACTGGTCACACGGCATCATAACTATCTACATCTCGCAAGTTGACATCTGGGACATGTATTGTAGGTTTTAATATGACCTCAGTAGATGTTGTGGGTGGATTAACTTGTTGAACAGACACTTTTTTATCTTCTATTTTACATAGCTTCCTTCTCTTTCTTGGTTTTTGTTGTTCCTTCATTTTCTAGAATGCACAATGTTAAATAACAAAAAGAAAAAAAACAATTGATGCAAAGTACTTCAAAGTTATTATATATACATACTAATAGAGATTTTAAGTCAATTATAGAAATTTGTTTCTCAGTTACTGGTTTATTTTCAACCGAAGTAACTACTTCATCATTTTTTTTAATCCCAACCTGTTGTTTAGGTCTTTCCTATAATGTATGATATTAGACAAGAAACAAACTCTTAATTTAAAGGACTACACAATGATTGTGTATACGTACACTTGAGATTCTATTTTTTTAGCTGAAGTAGATGGCTTGACGTTATTAATTTTTAAACTTTGTTCCTCAGTCACTGGTTCATTTTCTTTTTGTATCTGAATCGAAGTAACTACTTCATCATTTTTTTTGTATTTCCTACAATGCATAATATTAGATAATAAACAAAATTTTAATTCAAAGGACTACACAGTTGTTCTGTATACATATCAACTGAGATTCTATTTTTTCAGCCAAATTAGATGGTTGCACATTGTCAATTGTAAACTTTGTTCCTCAGTTACTGGTTCATTTTCTTTTTGTATCTCAACCAAAGTAACTGCTTAATCATTTTTTTGTATTTACTACAATGCATAATATTAGATAAGAAACAAACTGTTAATTCAAAGGACTACACAGTTGTTGTGTATACATACCAGTTGACATTCTTCATTTACATCCGAAATTAATGGTGGCACTTCGTGAATTTTTAAAATTTGTTTCAATGAAGCTAATGCAAGATTTTTTAGAGTACTTGATGGTAGTATCTCAAAAAGTGGAAGCGGTTAGATCATAAGTGGAAGAGTAGTAAGCATTTCTGGCATGTCTTTTTCTTTCTTACTTTGAGCATTCATATTATCTGATATATTGCTTATAGTTAATAGCAAGTCTGCATCTCTACCGTCTATGCTAATATCACTGCCTGACTCCTCTTGATTTTCAGGTGATGTTGCTTTTTGTTCCCTAAACAAAATGAGAAAAAAAAAAGAATGTGATAAAGAAAAAATACAATTGTTATTTATTTAAAATGTAATCAATTTTCAACAATATAAAGCAGTAAATAATGAATGAAACGCTAACCTTTGAGTTTCCACGATAAGTTCTTGAGTATATTGTTCTTCAGCATATCGTTGCTCAGTGCTGATTATATCTTCTGTATTCTGATGGTCATTATGTTTTCTACATTGTCATGATCAAAATCATTATGAAGATCATCTTACAAGTGATTAGTATCATCATTCTCTTAATTCTGATTCTGATTCTGATTCTGATTCTAATAAATAAATAAAATTTAAAAAAATTGATTGCTAAATGATCGTGTATACGAAAGCAAATGATCATGTATCATATCTAAACGATCATGTAAATGTGATACGTGATCATTAAAATGTGATACACAATCAGATATTTTAGTTAAACGATCGTTTATTTAAACAACTGATAAACAAAATACTTTCTGAAATCAATGATTGCCTTCTAATTGCGTCATTCATCCTACAAACTATCGAACTTAATGTGGATAAATCCTTACGAACTTCGTTGATGTCATTTTTTAGTTTTTAGTAGGATTTTGAATGACTTCATTCTGGTTTGTCTTTGGATCTGGAACGTTTCTTGTTGGTTATTAGATTTGTTTCAGGCTGGTCAACCACTGTTTGTCTTTCTTTTCTTCATGGTGATGGTTGTATAGCAGAATTAAGTCCACCAACATTTTCATTGGACTGAAGCAATGGAGATGAGGTTGATTGTAAGTTATTCAAGTCATTGTGATCCAATGATTGATTGTTCATAGTATCAGCAGGAGTATTGCTATCACTGCTATTACTATTTGGCATGGATTCATTGTCCTCAATATCCATGTTATCATCCCCTCAAATTCGCTCTCCATGAAAGCTTTCTCTTGGGTTGATAGCTTAACTACTACAAATTTGGAATTTAACGACGCAAGTTTTGCATCACAAAGACTTCCAACGACACAATTTTTGCGTCATTGAAACCACTGTCAAGAAAGGATACATGATCATTGTACCAAAGGCTACTCTTAGGTTGAACAACAATCTGCAATGTTATAATCAAAGAATTGTTAATCGTTGTATATCAAACTATGCTAGTAAACGATCATCTTAAAATGATTCAAGTTCGATCACGTTTCAAATATTTTAAGCGATCTTTTGCTTTAAGCGATTGCTGACAACTTGTACTTACAATTTTGTTCTCGAATATGTTTTTCTGCAACATTTTGTATGATGGAGCTTATGTACAATTTCATCTTAGTATTCTAAGGATTGATCTCTTGCAGTTTTTTGTGGCCAGGTGCTCACTTATAGTTGATAGTGTTTCATAAGCCCACACTTAAAATAATCAAACAAATATGTTAATCTTTGATGTTTAGTTTGTATTTAACTAAAGTATATAAAGAAATAAAATCACCTGAAAAGCAAGTATGTATCCTTTGATGTTGTAGTATGCCACTACTTTCGAACTTCTTGCCTTTTTTAACTCGTATGCTTATTTTTATCTTGTAGAATTGTTTTCAAACGTTGAGCAAACGAGTGTAAAGAAAGTTGACCAATCAAAGTTCTTAAAAACTTCTTCATCTACTCTTCTCAAAACATCCAAATCAAACTGCGTTTTTTTATCATTCGCGACCGTCACAGTTTCAATAAAGAGGGCTAAGGCAACCTTCACAACATCATCATCATTCCTAAAGTCAAAATTTTTGAAAATTTTCTCAACTTCCAAGCACGTTATTATCATCTTGTTTTGTGATCCAAATAGAGGCTTTATAGGCGCTTATTGTCATAAGTTTTATCCAATGTTACATTGGTTGGCCACAAACCGGTTATAATGTTGAATTCGTTTTGGGTGAAACGAACATCTTTTTCCAAAACAGAAAAACAGATTCCATTTACATTGGCTTCTTCAGGTATCTGCCTCAGCAAGAAATGAAGGATCAATTGGCCGTTGAAGACCATGCTGACATTTAGTAATGGACCAAAAATTGTTTTATGGAACATCAGCAGTTGTTCCTTGGTTAGTTTATCCTTAATAAGAGTGCAAATCTTGTGCACTTGGCATGATATAGTGGCAGGAAAGTATTTATTGTTAGGTACAACCATCCTAAAAATAATGATCATAATATATTTTCGTTATGTACACAATCGTTTATAAAACACTAAACCCTAAACGCTAAAATTAATATGTAAGCACTAAACATTGTTTAGACTGTATCATTCAGTCCAATGTACACGATCCTCTAGTCTAGGTTAAAGTATCGTTTACGTGACGTAACAGATCGATTAATTCAAAAATAAAGAAATTTGTAATACAATTAATCGGCAAGTAATGATAAATTTAAACGTTTAACAATCTTAAGAATAAAAGTTTAACCAAATCGAAAACGTTTGAGAGTAAGAATTTAACAACAGAACGATCGTGTTCAAGACAGCGTCGGAACGTTTAAAATATAAAGAAGACGAAACGTAACATTATAGTACGACAGTGATAACATAGTTTAGCAACAAAAATTTAGAGTTATACTGAAACATACCTTTTGACGTTGAAGAGAAAAATGGGCGTTTGAAAGGAAAGGAGTAAGGAAATAGGAAATGTTTCAGACAAAAAGCTTAGTGATCAGTGCGTTTGAAATAAAGTGACGAAAAACGAAGAAGGTGGTATGAATATTTTGGTTTTTACCAAATGGGCAATATTATAAATTTGCGTGCTTTTTTAAAATGTCGAATCGCATTCATTAACTGTAACGCCCCAAAATTTGGAATTTAATTTTAATATCTTAAGTGTAGTTAGTGAAATTTTAGATGTTTTGAATTAAGTGATTTATGCATATTTGATTTTTATTAGATATTAAGAAAATATCTAATGGTGATATTTTATTTGGGAAAAAGATTGATGTGATTGGTTGATTTTGAATTTAAAAAGGGAGATTTGGTGGGTTTTAATTGAGGAGATAGAAGATTTGTTTTGTTGGTTTAAAATAAGGAAAAGAAAAGAGAAATTAATATTGGTTTATTAATGGTGTTTAAATAAGCATAAGGACCCGTGCACAAAAGGAAAAAAAAAACCCTAATTTCTCCTTTTAAACCCTAGCCGTCGCCACCCCGCCACTCCATCACCGCCGCCGACCCGTAGTCCGCCGTGTGTCACCTCCATCCTTCGTTCTCCAGTCGCCCATTTGCCCCACCACCGTTGATCTCATTCGCAACCCGAGCCGACCCTCGTCGTGGCAGTCATCTGTTTCTGCTCCATCGTAGCTCAACTGTCAGCTGTCGTCGATCCCCTTCGTTCATCGTCCGATTTCGTTACCCAAGCCACGTCGGTCGTCACGCCTCTTTGCCGGATTTCCTTCAACACGAGTTGTTACGCCATTGCCCAAAGCACTGCCGCTTCGTCTGTCGTCGTTTGTTAGAGACCGCCACAGCCCGAGCCACACGCGCACCCGACCCGAACAGCCAAGCCCGCGCCCGCATCCATCAGCGAGCCATTCGTGTGGCAAGACCGTGCCGAGCCGCGCCCGCATCCATCAACAGAGTCGCGCCTGCGCCTGCTCCCAACCGAGCCACGCCTGCCTCCAGCCACGAGCCGCTTCATAACCTTAGCCGAGCCACCCCTGTTTTTCCTAGCCGTTTGAGCCACTTAACTTATTTTTGTAAGTTTTGTTTGGATTTTGATCTTCTCCCGTCAAATAACGAATTTTAGACCTTAAATAATTTAATTATGGAGTTAATGGAATTATTTTCGTGATAGGTCAATTGGGGGATTGAACCGAAGATCTTCCCAAACCAAGGCTAAATCTGAAATATTCTTCAACTTTGGGTAAGTTGAGATAAGTGGAATTGGACCCTAGGCAATTAATAATTAATTTATTAATTTTAGTCATTGGGTTCATTTGTGCTTAGGACTTGACTATTGAGGAGTTTGAGCGTGACTGTTAGGAAAATTCTTAAAAATTCGAGATTTCTCTAGGATCATTTTCTCTTGTCATCTATCGTTGGGATTACTAAGTTAAAAGAAGAAAGAAAACAAAAGAAAGGAAGTGGAATTGGACAACACAAAGCCCACCACCGACAAGCTCATTTTTAGTCTAATTTCATTTCATTCCTCCATCATTTTCCATCCAAACTTTCAGAGCACTTTGAGGAGAGAGTGAAGGAAAAAGAAGAAGAATTTCGTGGTTTGAGGTTGAAGAAGATAGGAAAAAGTTCATGCAAAGCCAGCCATTGCAGAAATAGTTAGACGTTGGGTCAGTTCAAGTATAACTAGCGAAGTAAGGTCCGAGTATTATTGAGAGGGAGTCCAGTCAGGACTAGAATATGGGTATAAGTTTAGTATCATATATCTAAGTTAAGTATCTATAGCATGTCTAATGTGTTGGTATGTTATAGGAGTCATGCCACCACGTACCAGTAGACGACGCAGGCAGAATCAGGACGGGACGCAGGATCCTACCCAAGGTCAATCTGAAAGCGGATCTAGTACCCCGAGAGGTCAGAATGAGGCAGGGAGTGAGCGATTTGCTAGATGTGCACAGGAGATTGGTAGGCCAGAGATAGTAGAGCCTAGAGATCCGAAAAAGATATATGGGATTGACCGGTTTAAGAAATTAGGAGTCACAGTGTTTGAGGGTTCCACGAATCCAGCTGACGTCGAGGTCTGGTTAAATATGCTGGAGAAATGCTTCGACGTAATGAGTTGTCCTCAGGAGCGAAAAGTCAGATTAGCCACATTTCTGTTATAGAAAGAAGCCGAGGGATGGTGGAAATCCATTATAGCCAGGCGCAATGATGCACGTACGTTAGTGGCCGAGTACGAGAGGAAGTACACCAAGCTTTCGCGGTATGCTGAAGTGATTGTGACATCTGAGAGTGATAGGTGTTGCAGGTTTGAGAGAGGGCTACATTTTGAGATACGTACCCCAGTGACCGTTATTGCCAAGTGGACGGATTTTTCCCAGCTAGTAGAGACTGCTTTACGTGTGGAGCTGAGTATTGTAGAGGAAAAGTCGGCAATGGAGCTTAGTCGAAGAGTTTCAACAACTAGTGGTATTCGAGGTCGAGAGCAACAGAGGTTCACACCTGGAGTGAATGTTTCAAGTTGTCAAGACTTCAAGCGTCGATCGGGTGGCAAATCATTGAGGCAGATGAGTTCAGGTAGTGCTTATTAGAGGCAGAGTTAGAGAGCTTCCAGTCAGTCTGTGAACTCAGTAGCAAAACCGCGGACGGGTCAGGAGTCTGTTGCTAGTGAATCCAGGAGAACCCCATGTGTAAGTTGTGGCACGAGTCATCGGGGTCAGTGTCTTGTTGGCGCCGGTGTGTGTTACCAGTGTGGACAAACAGGGCATTCCAAGAGGGCTTGTCCACAACTGAGAGTAGGAGTTCAGAGGGACCAGGGAGTTAAGTCCCACACAGTTGAACAGCCAAGAATCTCAGCAGCCACAGGAGAGGGAACTAGTGGTGCAAGACAAAAAAGAGATGAGGGAAGACCTAGGCAGTAGGGAAAAGTCTACGCCATGACTCAACAGAAAGCAGAGGACGCACCAGATGTGATCACTGGTACAATTTTGATTTGTAATGCACCTGCACGTGTTTTATTAGATCCGGGCGCTACGCATTCCTTTGTTTCCAGTATGTTTTTAACCAAGCTGAATAGGATGCTAGAGCCTTTATCGGAGGAGTTAGTCATATGCACACCAGTTGGCGACGTTTTATTAGTCGGTGAAGTGTTGCGTGATTGTGAGGTTGTAATGGAAGGTCTATGTATGTTGGTGCATCTTCTTCCCCTAAAGTTGCAGGCATTGGATGTAATTCTGGGAATGGATTTCTTATTCACTCACTATGCTTCGATGAATTTCCATAGGAAAGAGGTGACTTTTAGGAAACCAGGTTCGACTGAAGTTGTTTTTAGGGATGAGAGAAAGATTATCTCTACGAGTTTGATTTCAGCTCTGAAAGCTGAGAAGTTGTTGAGGAAAGGTTGCACAGCGTTTCTTGGCACGTGGTTGAGGTGCAAGAAGAAAAGTTGAAACCAGAAGATGTTCCTGTAGTGAATGAATATCTAGATGTTTTTCCAGCTGATCTATCGAGTTTTCCGTCTGATAGAGAAGTGGAATTCACTATTGAATTGTTACCAGAAACAGCACCTATTTCACAGGCACCATACAAAATGGCTCCGAGCGAGCTTAAGGAGCTGAAGGTCCAGTTGCAAGAACTAGTTGACAAGGGATACATCAGGCCTAGTGTATCACCTTGGGGAGCTCAAGTGTTATTCATGAAGAAGAAAGATGGTATCCTAAGATTATGCATTGATTACAGGCAGTTAAACAAGGTCACGATACGTAACAAGTATCCTCTACCACGCATCGATGACTTGTTTGACCAGTTAAAGGGAGCAGCAGTGTTCTCTAAGATTGATCTGAGATTAGGATACTACCAGTTGAAGGTTAGGGAATCAGATATTCCTAAGACAGCATTTAGAACGAGGTATGGGCACTATGAGTTTTTAGTAATGCCATTCGGTTTAACGAATGCGCCAGAAAATTTCATGAACCTCATGAACAGGATCTTCCATCAGTATTTAGATCAGTTTGTGATAGTGTTCATCGATGACATACTAGTTTACTCAATGGACAAGAAAGCCCATGAGGAACATCTGAGGATTGTTCTACAAACACTGCGCGATAAACAACTATACGCTAAGTTCAGCAAATGTGAGTTATGGTTGAATCAAGTGGTGTTCTTAGGGCATGTGGTTTCAGCGGACGGAGTTAGTGTTGATCCGCAGAAAGTGGAAGGTGTTGCCAATTGGGAGAGACCAGTTAGTGCAACAGAGGTATGTAGTTTCCTAGGCCTGGCCGGATACTACAGACGTTTTATTGAGGATTTCTCACGGTTAGCATTACCCTTGACAGCTCTGACAAGGAAGAATGCTAAGTTTGAGTGGTCGGATAAATGTGAACAAAGTTTTCAGGAGCTGAAGAAGATATTGGTGACAGCACCTATTCTGACACTTCCTGTAACAGGAAAGGAGTATGTGATCTATTGTGATGCTTCGAGGCAAGGATTAGGTTGTGTGCTCATGCAGGAAGGGAAAGTAATAGCTTATCCTTCAAGGCAGTTGAAGAAGCATGAGTGTAATTACCCTACCTATGATCTTGAGCTACCAGCAGTTGTTCTAGCACTGAAGATTTGGAGACATTATTTATTCGGCAAGAAGTGCCACATTTTCACAGATCATAAAAGTTTGAAGTACATCTTTGATCAGAAAGAGCTGAATCTAAGATAGAGACGATGGCTAGAACTAATCAAAGACTATGATTGTACCATAGAATATCATTCTGGTAAGGCTAACGTGGTAGCAGATGCATTAAGTAGGAAGTTGAGACTTCCGAAGAGTGCCTTGTGTGGTATTCGAGCAAGCTTGCTAAGTGAGTTAAGAGGTTTCAAAGCAGTTATGACTGCAGAAAGCTCAGGGAGTCTTTTAGCTCAATTTCAGGTTAGGTCTTCCTTAGTAGCAGAGATTGTAGGAAGACAGCCAGAGGATAGTAATTTACAGAAGATGCTTGCAAAGGCCAAGCAAGGCCCAGAGGCAGAATTTGAGTTGAGAACGGAAGGTGCCATAGTTAAGCATGGAAGACTATGCGTTCCGAATATTAGTGAGCTTAAGGGTGCTATACTAGAAGAAGCTCACAATTCAGCTTACGCTATGCATCCAGGAAGCACCAAGATGTATAGAACTCTAAAGAAGGCTTATTGGTGGCATGGTATGAAGCGAGAGATAGCTAAATATGTCGATAGATGTTTGATCTGTCAACAGGTTAAACTAGTGAGGCAGAGGCCGGGAGGACTCCTTAATCCCCTGCCAGTGCCAAAGTGGAAGTGGGAGCATATTACCATGGATTTTCTGTTTGGATTACCCTGTACATCTAGAAGATATGATGGTATATGGGTAATAGTAGATAGACTCACCAAGACAACGCGGTTTATACCGATTAAAGCAAAGTCTACACTAGATCAGCTAGTTAAGTTATACGTCGACAGGATTGTGAGTCAACATGGAGTGTCAGTGTCCATAGTTTCAAATAGGGACCTGAGGTTTACTTCTAAGTTTTGTCCTAGTGTACAAAAAACAATGGGAACAAAGTCGAAGTTCAGTATCGCGTTCCATCCCCAAACAGATGGTCAGTCAAAGAGGACCATCCAGACCTTAGAAAACATGTTGAGAGCATGTGTCCTTCAGTTTAAGGGAAATTGGGATACCCACTTATCACTTACGGAGTTCGCTTATAATAACAGCTACCAGTCTAGTATCGGCCTGGCACCATTCGAGGCTTTGTATGGCAGACCATGCAGGACTCCTGTGTGCTAGAACGAAGTGCGAGAATGAAAGCTAGTTGGTCCTGAGTTAGTACAAGTTACGTCAGACAATATTAAGCTGGTTAGGGAAAATTTGAAAATAGCTCAGGATCGACAGAAGAGCTATGCAGATAAGCGACGAAGAGACTTAGAGTTCCAGATTGGAGAAAAATTTTTCTTAAAGTTATCTCCATGGAGAGGTGTTCTTCGTTTTGGGAGGAAAGGTAAGTTGAGTCCTAGATATATTGATCTGTACCAGATAATAGAACGAGTTGGACCAGCAGCCTATAGACTTGAATTGCCAGCGGAACTCTCTCGAATACATGATGTTTTTCATGTGTCCATGTTGAGGAAATATATTTCAGATCCATCACATGTGTTGCAAGTACAACCAATTGAACTAAAAAAAGACTTGAGTTATAGAAAGGAAGTAGTTCAGATCCTCGACAAGAAGGAGCAAGTTTTGAGGAACAAAATGATCTGGCTCGTGAAAGTCCTTTGGAGACATCATGGAATAGAGGAGGCAACCTGGGAGTCATAAGATCAAATAAGGAGGAGTTACCCGACACTCTTCACCTAAGGACTTCTAAATTTTGAGGACGAAATTTTATAAGGGGGAGGTAAGTTGTAAACTCGAGATTTTCCTAGGATCATTTTCTCTTGTCATCTATTTTTGGGATTACTAAGTTAAAATAAGAAAGAAAAGAAAGGAAAGGAAGTGGAATTGGACAGCACAAAGCCCACCACCGACAAGCTCATTTTTCAGCCCAATTTCATTTCATTCCTCCATCATTTTCCATCCAAACTTTTAGAGCACTTTGAGGAGAGAGTGAAGGAAGAAGAAGAAGAATTTCATGGTTTGAGGTTGAAGAAGATAGGAAAAAGTTCATGCAAATCCAGCCATTGCAGAATCTTGATGCAACTGTCAACCTTTGGTTTGTTTTGTTCGGTTTGAGCTATCCAGAAGGAACTAAGATGTGAGGTTTGATTTTCTTGAAAGTTAGTTCATTTTTCAACAAATTTCATGAAGGAAGCTTGAGCTGAATAGTATGAGAAGTTAATGTTTTTTGAAATGGAAAACAGAGCACAAGATTTTCGAGCAAACCAGGAGGATTTCTGGTGTTCTCTGATGCTTTGAGTATTCTGGAAGTTCACAAGTAGAATCAGAAGCACTATTTGGTATGGTTGGAAAGCTTATTCAATATCCTACAATTTTCATGAAGATTTTGTGAGCCAAAAATGACTTTAAGTGGGGTTAATTGCAGGAGATAGATAAAGAGTGGTAGAAAATCTCGAATTTTGTATTGTCTGTGTTTGGGGCAATTCTGGACGAATTGGTTGAGAATCTCGTTTTCATTTCTTCAGGTAAGTTGTAGAGTATTCGAAAATGAAGAGTTTTATGTCTATTTAATTAATTTTGGTTAAGTTTTGAGGAAGTTATGGGTGTTGGAAGTTAGGATATCGAATCTGGAAAATTTGGAAAGTGGGATGTAATGTAATTCTTAATCTAATTGTTTAGAGATTCATGAGCATGGAAAGACAAAGCTATCTATGATTCTAATGTTCGGTTTTGGTTGTTGACAAGCCAAGAGGAAATAGGCCGAGTCTACAAACCGGGGAACTAGATAAGATTGTGAGTGACAAAACAAATTTCTAAAATGTTATTAAACAGTTGTTCATAATTATATGTTTTACATGTTTTTCTATGAATGATACATGGTTTTTGTGAACGGTTTCAAACATGAGTTTTGAGCTTGGATTTTCTAATGAGAGTTATATTCAAGCACGTGGTAATGGCTTTATGAAAAGTAGTTGAAACGATGTCTTTTAAAGTAAAGCATGTTTAAGCAAATATGGTATAATGATTTAAAGTTTTTCACGAGTAAGCTGAATAAACTTGAGTTTTACAAAAGATAACGATAAACAGGCATGTTTAAATGTTTTCATGTGACTTCATGGAGATTTCCATTGACTACATGACTGAGATATTGAGCCTGAAGCTATATAGTACCGTGTGCACACATGTTATATTTCCGTTGTCGACGTTGAGTGTACTCAGTGACAACGATGCTGTCGTGAGTGCAGGAAGGACCCCACTACGACAAAGACGATGGGAATGTCAGACGGGCCCCACTGCATCATAGGACTAGTAAACGTTGGTTGTACTGGGCGTGTCCTACACCACGTAGATCGGTCATGTTAGTTAAAACGTTTGATGTACTTGTTATGCCTTGCTAGATTTTTTTTTATGACGAACTTGGAAGATATTTTAGAAAGCCTTATTTATTACACATTTATGTTAAATGCTTTGATTGAGCATATTTATACGTCTAAAGGTTTATCACATGTTTTGACGTTCGTATTTAGAGTTTTAAATTTAGTCACTCACTGGGCCTCGTAGCTCATTCTTTTCAAAATGTTTCCCACTTTTCAGGTAGAGATCGAGCTTCCGGTGCCCGATACACTGCCAACGTCTGCTGAAAGTTTCAATCAAACTCCAGTACGTGGTTGGAGTTGTATTTTGAGTTTGTTCATGTTGAAATGTTTTGTAAGGTTTGTATAATATCATCCTAGGGTCGTGGTCGTAAAACTCTAGTTGTATAGGTTATTTTACAAACCAAGTGATGTCAGGACTGCATCTCGTGTATGTTGGAAAGGTTTCCGTGGATTCCGCTGTGTGGTGTTTCATGTTGAATATTCTATATCTAAGATTGGTATGTTTATCAGATTTAATGTTTCAGTAGGGTCAATAGGGATCATTAGAGGAGACGATGTCTGTTGGCTTCACGCCGTCTTTCGGGCTAAGATAGCAGGTAGTCCGAGAGGGGGTGTGACTACTTGGTATCAGAGCTGTTAGTTCCATGGGAAAAGAGACATGGTAGTTAGTGCTAAGAAAAATCCATAAAGTAAATAGTTAGACGTTAGGTCAGTTCAGGTAGAACTAGTGAAGTAAGGTCCGAGTTTTTTTAGAGGGAGTCCAATCAGGACTAGAATATGGGTATAAGTTTAGTATCATATATTTAAGTTAAGTATCTATAGCATGTCTAATGTGTTGGTATGTTATAGGAGTCATACCACCACATACCAGCAGATGAAGTAGGCAGAATCAGGACGAGACGCAGGATCCTACTCAAGGTCAATCTGAAAGGGGATCTAGTACCCCGAGAGGTCAGAATGAGGCAAGGAGTGAGCGATTTGCTAGATCTGCACAGGAGATCGGTAGGCCAGAGATAGCAGGGCCTAGTGATCCGGAAAAGATGTATGGAATTGAACGGTTGAAGAAATTAGGAGCCACAGTGTTTGAGAGTTCCACGGATCCAGCTGATGTCGAGGTCTGGTTAAATATGCTGGAGAAATGCTTCGACGTAATGAGTTGTCTTTAGGAGCGAAAAGTCAGATTAGCCACATTTCTGTTACAGAAAAAGGTCGAGGGATGGTGGAAATCCATTATAGTCAGGCACAATGATGCACGTACGTTAGATTGGCAGACGTTCAAAGGCATATTTGAGGAAAAGTACTATCCCACCATATATTGTGAGGCAAAGAGAGATGAGTTTCTGGAGCTGAAACAAGGGTCACTTTCAGTGGTCGAGTACGAGAGGAAGTATACCGAGCTTTCGCGGTATGCTGAAGTGATTGTGGCATCTGAGGTTTCCGTGGATTTCGCTGTATGGTGTTTCATGTTCAATATTCTATATCTAAGATTGGTATGTTTATCAGATTTAATGTTTCAGTAGGGTCAACAGAGATCGTTAGAGGAGATGATGTCTGTTGGCTTCACGCCGTCTTTCGGGCTAAGACAGCAGGTGGTCCGGGAAGGGGTGTGACATTATCGTTAGTCTTCATATGTGATGACAGCTCATCAATGCTTGCCCAATAATCTTCTCATTTCATGGGTTAGACTAGATGGATAGCTAGGGACATAGGGTGCAAGATGGACCTCACTCCTACCCGCTTTAGGGTTAGTA

mRNA sequence

ATGTCGGAGATGGAGGAGCTTACGAATAGTCGGATGGTCGATCTTGAGGATTCATCAGTGGTTGACTGTTCTCAGGCGAAAGACGGAAACGTCAGATTTAGCCGGTTGGATGAAGGAGAAGTCGTGCGGATGGCAACTATGTTCATCAACATGAATAGCTCGATAGTTCAATTATTAGCTTTCGAAAAACTTAACGGCGATAATTATGCGACATGGAAATTGAATCTTAACACGATACTAATGGTCGATGATTTAAGATTTGCCTTAACTGAGGAATGTCCTCAAACCCCTACCTTAATTGCAAACCGAACTAGTCAGGAAGCATACGATCGATGGATAAAAGTCGATGAGAAAGTTCGTGTCTACATCCTTGCTAGCATGGACCCCACCTTCTCATTGGCTCGAGAAGGATTCAGGTTTATATGTTGGACCTTAAACCAATTGTTCAATAGTGAATCAATGGGACTTGAGGAGCAAGATGTAATCTCGGGGCCGTCGCCACCCCGCCACTCCATCACCGCCGCCGACCCGTAGTCCGCCGTGTGTCACCTCCATCCTTCGTTCTCCAGTCGCCCATTTGCCCCACCACCGTTGATCTCATTCGCAACCCGAGCCGACCCTCGTCGTGGCAGTCATCTGTTTCTGCTCCATCGTAGCTCAACTGTCAGCTGTCGTCGATCCCCTTCGTTCATCGTCCGATTTCGTTACCCAAGCCACGTCGGTCGTCACGCCTCTTTGCCGGATTTCCTTCAACACGAGTTGTTACGCCATTGCCCAAAGCACTGCCGCTTCGTCTGTCGTCGTTTGTTAGAGACCGCCACAGCCCGAGCCACACGCGCACCCGACCCGAACAGCCAAGCCCGCGCCCGCATCCATCAGCGAGCCATTCGTGTGGCAAGACCGTGCCGAGCCGCGCCCGCATCCATCAACAGAGTCGCGCCTGCGCCTGCTCCCAACCGAGCCACGCCTGCCTCCAGCCACGAGCCGCTTCATAACCTTAGCCGAGCCACCCCTGTTTTTCCTAGCCAGCACTTTGAGGAGAGAGTGAAGGAAAAAGAAGAAGAATTTCGTGGTTTGAGGTTGAAGAAGATAGGAAAAAGTTCATGCAAAGCCAGCCATTGCAGAAATAGAGTCATGCCACCACGTACCAGTAGACGACGCAGGCAGAATCAGGACGGGACGCAGGATCCTACCCAAGGTCAATCTGAAAGCGGATCTAGTACCCCGAGAGGTCAGAATGAGGCAGGGAGTGAGCGATTTGCTAGATGTGCACAGGAGATTGGTAGGCCAGAGATAGTAGAGCCTAGAGATCCGAAAAAGATATATGGGATTGACCGGTTTAAGAAATTAGGAGTCACAGTGTTTGAGGGTTCCACGAATCCAGCTGACGTCGAGGTCTGGTTAAATATGCTGGAGAAATGCTTCGACGTAATGAGTTGTCCTCAGGAGCGAAAAAAAGAAGCCGAGGGATGGTGGAAATCCATTATAGCCAGGCGCAATGATGCACGTACGTTAGTGGCCGAGTACGAGAGGAAGTACACCAAGCTTTCGCGGTATGCTGAAGTGATTGTGACATCTGAGAGTGATAGGTGTTGCAGGTTTGAGAGAGGGCTACATTTTGAGATACGTACCCCAGTGACCGTTATTGCCAAGTGGACGGATTTTTCCCAGCTAGTAGAGACTGCTTTACGTGTGGAGCTGAGTATTGTAGAGGAAAAGTCGGCAATGGAGCTTAGTCGAAGAGTTTCAACAACTAGTGGTATTCGAGGTCGAGAGCAACAGAGGTTCACACCTGGAGTGAATGTTTCAAGTTGTCAAGACTTCAAGCGTCGATCGGGTGGCAAATCATTGAGGCAGATGAGTTCAGTAGCAAAACCGCGGACGGGTCAGGAGTCTGTTGCTAGTGAATCCAGGAGAACCCCATGTGTAAGTTGTGGCACGAGTCATCGGGGTCAGTGTCTTGTTGGCGCCGGTGTGTGTTACCAGTGTGGACAAACAGGGCATTCCAAGAGGGCTTGTCCACAACTGAGAGTAGGAGTTCAGAGGGACCAGGGAGTTAAGTCCCACACAGTTGAACAGCCAAGAATCTCAGCAGCCACAGGAGAGGGAACTAAAAAGAAAGGAAAGGAAGTGGAATTGGACAGCACAAAGCCCACCACCGACAAGCTCATTTTTCAGCCCAATTTCATTTCATTCCTCCATCATTTTCCATCCAAACTTTTAGAGCACTTTGAGGAGAGAGTGAAGGAAGAAGAAGAAGAATTTCATGGTTTGAGGTTGAAGAAGATAGGAAAAAGTTCATGCAAATCCAGCCATTGCAGAATCTTGATGCAACTGTCAACCTTTGGTTTGTTTTGTTCGAGCACAAGATTTTCGAGCAAACCAGGAGGATTTCTGGTGTTCTCTGATGCTTTGAGTATTCTGGAAGTTCACAATTATGGGTGTTGGAAGTTAGGATATCGAATCTGGAAAATTTGGAAAGTGGGATGTAATATTTCCATTGACTACATGACTGAGATATTGAGCCTGAAGCTATATAGTACCGTGTGCACACATGTTATATTTCCGTTGTCGACGTTGAGTGTACTCAGTGACAACGATGCTGTCGTGAGTGCAGGAAGGACCCCACTACGACAAAGACGATGGGAATGTCAGACGGGCCCCACTGCATCATAGGACTAGTAAACGTTGGTTGTACTGGGCGTGTCCTACACCACGTAGATCGGTCATGTTAGTTAAAACGTTTGATGTAGAGATCGAGCTTCCGGTGCCCGATACACTGCCAACGTCTGCTGAAAGTTTCAATCAAACTCCAGTACGTGGTTGGAGTTGTATTTTGAGTTTGTTCATGTTGAAATGTTTTGTAAGGACTGCATCTCGTGTATGTTGGAAAGGTTTCCGTGGATTCCGCTGTGTGGTGTTTCATGTTGAATATTCTATATCTAAGATTGGGTCAATAGGGATCATTAGAGGAGACGATGTCTGTTGGCTTCACGCCGTCTTTCGGGCTAAGATAGCAGGTAGTCCGAGAGGGGGTAATCAGGACGAGACGCAGGATCCTACTCAAGGTCAATCTGAAAGGGGATCTAGTACCCCGAGAGGTCAGAATGAGGCAAGGAGTGAGCGATTTGCTAGATCTGCACAGGAGATCGGTAGGCCAGAGATAGCAGGGCCTAGTGATCCGGAAAAGATGTATGGAATTGAACGGTTGAAGAAATTAGGAGCCACAGTGTTTGAGAGTTCCACGGATCCAGCTGATGTCGAGGTCTGGTTAAATATGCTGGAGAAATGCTTCGACCGAAAAGTCAGATTAGCCACATTTCTGTTACAGAAAAAGGTCGAGGGATGGTGGAAATCCATTATAGTCAGGCACAATGATGCACGTACGTTAGATTGGCAGACGTTCAAAGGCATATTTGAGGAAAAGTACTATCCCACCATATATTGTGAGGCAAAGAGAGATGAGTTTCTGGAGCTGAAACAAGGGTCACTTTCAGTGGTCGAGTACGAGAGGAAGTATACCGAGCTTTCGCGGTATGCTGAAGTGATTGTGGCATCTGAGGTTTCCGTGGATTTCGCTGTATGGTGTTTCATGTTCAATATTCTATATCTAAGATTGGGTCAACAGAGATCGTTAGAGGAGATGATGTCTGTTGGCTTCACGCCGTCTTTCGGGCTAAGACAGCAGGTGGTCCGGGAAGGGACTAGATGGATAGCTAGGGACATAGGGTGCAAGATGGACCTCACTCCTACCCGCTTTAGGGTTAGTA

Coding sequence (CDS)

ATGCCACCACGTACCAGTAGACGACGCAGGCAGAATCAGGACGGGACGCAGGATCCTACCCAAGGTCAATCTGAAAGCGGATCTAGTACCCCGAGAGGTCAGAATGAGGCAGGGAGTGAGCGATTTGCTAGATGTGCACAGGAGATTGGTAGGCCAGAGATAGTAGAGCCTAGAGATCCGAAAAAGATATATGGGATTGACCGGTTTAAGAAATTAGGAGTCACAGTGTTTGAGGGTTCCACGAATCCAGCTGACGTCGAGGTCTGGTTAAATATGCTGGAGAAATGCTTCGACGTAATGAGTTGTCCTCAGGAGCGAAAAAAAGAAGCCGAGGGATGGTGGAAATCCATTATAGCCAGGCGCAATGATGCACGTACGTTAGTGGCCGAGTACGAGAGGAAGTACACCAAGCTTTCGCGGTATGCTGAAGTGATTGTGACATCTGAGAGTGATAGGTGTTGCAGGTTTGAGAGAGGGCTACATTTTGAGATACGTACCCCAGTGACCGTTATTGCCAAGTGGACGGATTTTTCCCAGCTAGTAGAGACTGCTTTACGTGTGGAGCTGAGTATTGTAGAGGAAAAGTCGGCAATGGAGCTTAGTCGAAGAGTTTCAACAACTAGTGGTATTCGAGGTCGAGAGCAACAGAGGTTCACACCTGGAGTGAATGTTTCAAGTTGTCAAGACTTCAAGCGTCGATCGGGTGGCAAATCATTGAGGCAGATGAGTTCAGTAGCAAAACCGCGGACGGGTCAGGAGTCTGTTGCTAGTGAATCCAGGAGAACCCCATGTGTAAGTTGTGGCACGAGTCATCGGGGTCAGTGTCTTGTTGGCGCCGGTGTGTGTTACCAGTGTGGACAAACAGGGCATTCCAAGAGGGCTTGTCCACAACTGAGAGTAGGAGTTCAGAGGGACCAGGGAGTTAAGTCCCACACAGTTGAACAGCCAAGAATCTCAGCAGCCACAGGAGAGGGAACTAAAAAGAAAGGAAAGGAAGTGGAATTGGACAGCACAAAGCCCACCACCGACAAGCTCATTTTTCAGCCCAATTTCATTTCATTCCTCCATCATTTTCCATCCAAACTTTTAGAGCACTTTGAGGAGAGAGTGAAGGAAGAAGAAGAAGAATTTCATGGTTTGAGGTTGAAGAAGATAGGAAAAAGTTCATGCAAATCCAGCCATTGCAGAATCTTGATGCAACTGTCAACCTTTGGTTTGTTTTGTTCGAGCACAAGATTTTCGAGCAAACCAGGAGGATTTCTGGTGTTCTCTGATGCTTTGAGTATTCTGGAAGTTCACAATTATGGGTGTTGGAAGTTAGGATATCGAATCTGGAAAATTTGGAAAGTGGGATGTAATATTTCCATTGACTACATGACTGAGATATTGAGCCTGAAGCTATATAGTACCGTGTGCACACATGTTATATTTCCGTTGTCGACGTTGAGTGTACTCAGTGACAACGATGCTGTCGTGAGTGCAGGAAGGACCCCACTACGACAAAGACGATGGGAATGTCAGACGGGCCCCACTGCATCATAG

Protein sequence

MPPRTSRRRRQNQDGTQDPTQGQSESGSSTPRGQNEAGSERFARCAQEIGRPEIVEPRDPKKIYGIDRFKKLGVTVFEGSTNPADVEVWLNMLEKCFDVMSCPQERKKEAEGWWKSIIARRNDARTLVAEYERKYTKLSRYAEVIVTSESDRCCRFERGLHFEIRTPVTVIAKWTDFSQLVETALRVELSIVEEKSAMELSRRVSTTSGIRGREQQRFTPGVNVSSCQDFKRRSGGKSLRQMSSVAKPRTGQESVASESRRTPCVSCGTSHRGQCLVGAGVCYQCGQTGHSKRACPQLRVGVQRDQGVKSHTVEQPRISAATGEGTKKKGKEVELDSTKPTTDKLIFQPNFISFLHHFPSKLLEHFEERVKEEEEEFHGLRLKKIGKSSCKSSHCRILMQLSTFGLFCSSTRFSSKPGGFLVFSDALSILEVHNYGCWKLGYRIWKIWKVGCNISIDYMTEILSLKLYSTVCTHVIFPLSTLSVLSDNDAVVSAGRTPLRQRRWECQTGPTAS
Homology
BLAST of MELO3C035074 vs. NCBI nr
Match: XP_016901625.1 (PREDICTED: uncharacterized protein LOC107991320 [Cucumis melo])

HSP 1 Score: 454.9 bits (1169), Expect = 9.1e-124
Identity = 250/356 (70.22%), Postives = 273/356 (76.69%), Query Frame = 0

Query: 1   MPPRTSRRRRQNQDGTQDPTQGQSESGSSTPRGQNEAGSERFARCAQEIGRPEIVEPRDP 60
           MPPRTS+R RQNQDGTQDPTQGQSE GSSTPRGQNEAGSERF+R AQEIGRPE   P DP
Sbjct: 1   MPPRTSKRHRQNQDGTQDPTQGQSERGSSTPRGQNEAGSERFSRSAQEIGRPEKAGPSDP 60

Query: 61  KKIYGIDRFKKLGVTVFEGSTNPADVEVWLNMLEKCFDVMSCPQERKK----EAEGWWKS 120
           +K+YGI+R KKL  TVF+GST+ AD EVWLNMLEKCFD+MSCPQERK       E ++ +
Sbjct: 61  EKMYGIERLKKLEATVFDGSTDLADAEVWLNMLEKCFDLMSCPQERKTFRGIFEEKYYPT 120

Query: 121 II--ARRNDARTL------VAEYERKYTKLSRYAEVIVTSESDRCCRFERGLHFEIRTPV 180
               A+R++   L      V EYERK      YAE+IV  ESDRCCR ERGL FE RTPV
Sbjct: 121 TYCEAKRDEFLELKQESLSVVEYERK------YAEMIVAFESDRCCRGERGLRFEKRTPV 180

Query: 181 TVIAKWTDFSQLVETALRVELSIVEEKSAMELSRRVSTTSGIRGREQQRFTPGVNVSSCQ 240
           T I KW DFSQLVETALRVE SIVEEKS MELSR VSTTSGIRGREQ+RFTPGVNVS CQ
Sbjct: 181 TAITKWMDFSQLVETALRVEQSIVEEKSVMELSRGVSTTSGIRGREQRRFTPGVNVSCCQ 240

Query: 241 DFKRRSGGKSLRQMS------------------SVAKPRTGQESVASESRRTPCVSCGTS 300
           DFKRRSGGK LRQMS                  SVA+ RTGQ+SVASESRRTPCV+CG  
Sbjct: 241 DFKRRSGGKPLRQMSSGSAYQRQSQRASSQSANSVARSRTGQKSVASESRRTPCVTCGKC 300

Query: 301 HRGQCLVGAGVCYQCGQTGHSKRACPQLRVGVQRDQGVKSHTVEQPRISAATGEGT 327
           HRGQCL+G GV YQCGQTGH KR CPQLRV V+RDQGV+SHT+EQPRIS   GEGT
Sbjct: 301 HRGQCLIGVGVYYQCGQTGHLKRDCPQLRVVVRRDQGVESHTIEQPRISTTVGEGT 350

BLAST of MELO3C035074 vs. NCBI nr
Match: KAA0051980.1 (DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa] >TYK04577.1 DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa])

HSP 1 Score: 444.1 bits (1141), Expect = 1.6e-120
Identity = 247/357 (69.19%), Postives = 264/357 (73.95%), Query Frame = 0

Query: 1   MPPRTSRRRRQNQDGTQDPTQGQSESGSSTPRGQNEAGSERFARCAQEIGRPEIVEPRDP 60
           MPPRTSRRRRQNQD  QDPTQGQSE GSSTPRGQNEAGSERF R AQEIGR E   P DP
Sbjct: 151 MPPRTSRRRRQNQDRMQDPTQGQSERGSSTPRGQNEAGSERFVRSAQEIGRSERARPSDP 210

Query: 61  KKIYGIDRFKKLGVTVFEGSTNPADVEVWLN-------------MLEKCFDVMSCPQERK 120
           +K+YGI+R K+LG TVF GST+ AD EVW N               EK +    C  +R 
Sbjct: 211 EKMYGIERLKELGATVFVGSTDLADAEVWHNDARTLDWQTFRGIFEEKYYPTTCCEAKRD 270

Query: 121 KEAEGWWKSIIARRNDARTLVAEYERKYTKLSRYAEVIVTSESDRCCRFERGLHFEIRTP 180
           +  E                VA+Y+RKYT+LS YAEVI+ SESDRC RFERGLHFEIRTP
Sbjct: 271 EFLE---------LKQGSLSVAKYKRKYTELSWYAEVIMASESDRCRRFERGLHFEIRTP 330

Query: 181 VTVIAKWTDFSQLVETALRVELSIVEEKSAMELSRRVSTTSGIRGREQQRFTPGVNVSSC 240
           VT IAKWTDFSQL+ETALRVE SIVEEKSAMELSR VSTTS IRGREQ+R TPGVN+S C
Sbjct: 331 VTAIAKWTDFSQLIETALRVEQSIVEEKSAMELSRGVSTTSRIRGREQRRSTPGVNISGC 390

Query: 241 QDFKRRSGGKSLRQMS------------------SVAKPRTGQESVASESRRTPCVSCGT 300
           QDFKRR GGK LRQMS                  SVA+PRTGQESVASESRRTPCVSC  
Sbjct: 391 QDFKRRLGGKPLRQMSSGSAYQRQSQRASSQFTNSVARPRTGQESVASESRRTPCVSCSK 450

Query: 301 SHRGQCLVGAGVCYQCGQTGHSKRACPQLRVGVQRDQGVKSHTVEQPRISAATGEGT 327
           SHRGQCLVGAGVCYQCGQTGH KR CPQLRVGVQRDQGV+SHTVEQPRI AA  EGT
Sbjct: 451 SHRGQCLVGAGVCYQCGQTGHFKRDCPQLRVGVQRDQGVESHTVEQPRILAAAREGT 498

BLAST of MELO3C035074 vs. NCBI nr
Match: KAA0037581.1 (reverse transcriptase [Cucumis melo var. makuwa])

HSP 1 Score: 433.0 bits (1112), Expect = 3.7e-117
Identity = 236/349 (67.62%), Postives = 258/349 (73.93%), Query Frame = 0

Query: 47  QEIGRPEIVEPRDPKKIYGIDRFKKLGVTVFEGSTNPADVEVWLNMLEKCFDVMSCPQER 106
           +EIGRPE   P D +K+YGI+R KKLG TVFEGST+PAD EVWLNMLEKCFDVMSCPQER
Sbjct: 259 KEIGRPEKAGPSDLEKMYGIERLKKLGATVFEGSTDPADAEVWLNMLEKCFDVMSCPQER 318

Query: 107 K---------KEAEGWWKSIIARRNDARTL------------------------------ 166
           K         KEAEGWWKSIIARRNDARTL                              
Sbjct: 319 KVKLATFLLLKEAEGWWKSIIARRNDARTLDWQTFRGIFEEKYYPTTYCEAKRDEFLELK 378

Query: 167 -----VAEYERKYTKLSRYAEVIVTSESDRCCRFERGLHFEIRTPVTVIAKWTDFSQLVE 226
                VA+YERKYT+LSRYAE+IV SESDRC RFERGL FEIRTPVT IAKW +FSQLVE
Sbjct: 379 QWSLSVAKYERKYTELSRYAEMIVASESDRCHRFERGLRFEIRTPVTAIAKWMNFSQLVE 438

Query: 227 TALRVELSIVEEKSAMELSRRVSTTSGIRGREQQRFTPGVNVSSCQDFKRRSGGKSLRQM 286
           TALRV+ SIVEEKSAMELSR VSTTSGIRGREQ+RFTPGVNVS CQDFKRRSGGK LRQM
Sbjct: 439 TALRVKQSIVEEKSAMELSRGVSTTSGIRGREQRRFTPGVNVSGCQDFKRRSGGKPLRQM 498

Query: 287 S------------------SVAKPRTGQESVASESRRTPCVSCGTSHRGQCLVGAGVCYQ 331
           S                  SVA+ RTGQESVASES+RTPCVSCG SH+G+C++GAGVCYQ
Sbjct: 499 SSGSAYQRQSRRASSQPANSVARSRTGQESVASESKRTPCVSCGKSHQGRCVIGAGVCYQ 558

BLAST of MELO3C035074 vs. NCBI nr
Match: TYK03091.1 (reverse transcriptase [Cucumis melo var. makuwa])

HSP 1 Score: 433.0 bits (1112), Expect = 3.7e-117
Identity = 236/349 (67.62%), Postives = 258/349 (73.93%), Query Frame = 0

Query: 47  QEIGRPEIVEPRDPKKIYGIDRFKKLGVTVFEGSTNPADVEVWLNMLEKCFDVMSCPQER 106
           +EIGRPE   P D +K+YGI+R KKLG TVFEGST+PAD EVWLNMLEKCFDVMSCPQER
Sbjct: 259 KEIGRPEKAGPSDLEKMYGIERLKKLGATVFEGSTDPADAEVWLNMLEKCFDVMSCPQER 318

Query: 107 K---------KEAEGWWKSIIARRNDARTL------------------------------ 166
           K         KEAEGWWKSIIARRNDARTL                              
Sbjct: 319 KVKLATFLLQKEAEGWWKSIIARRNDARTLDWQTFRGIFEEKYYPTTYCEAKRDEFLELK 378

Query: 167 -----VAEYERKYTKLSRYAEVIVTSESDRCCRFERGLHFEIRTPVTVIAKWTDFSQLVE 226
                VA+YERKYT+LSRYAE+IV SESDRC RFERGL FEIRTPVT IAKW +FSQLVE
Sbjct: 379 QWSLSVAKYERKYTELSRYAEMIVASESDRCHRFERGLRFEIRTPVTAIAKWMNFSQLVE 438

Query: 227 TALRVELSIVEEKSAMELSRRVSTTSGIRGREQQRFTPGVNVSSCQDFKRRSGGKSLRQM 286
           TALRV+ SIVEEKSAMELSR VSTTSGIRGREQ+RFTPGVNVS CQDFKRRSGGK LRQM
Sbjct: 439 TALRVKQSIVEEKSAMELSRGVSTTSGIRGREQRRFTPGVNVSGCQDFKRRSGGKPLRQM 498

Query: 287 S------------------SVAKPRTGQESVASESRRTPCVSCGTSHRGQCLVGAGVCYQ 331
           S                  SVA+ RTGQESVASES+RTPCVSCG SH+G+C++GAGVCYQ
Sbjct: 499 SSGSAYQRQSRRASSQPANSVARSRTGQESVASESKRTPCVSCGKSHQGRCVIGAGVCYQ 558

BLAST of MELO3C035074 vs. NCBI nr
Match: KAA0066849.1 (DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa])

HSP 1 Score: 433.0 bits (1112), Expect = 3.7e-117
Identity = 237/388 (61.08%), Postives = 267/388 (68.81%), Query Frame = 0

Query: 1   MPPRTSRRRRQNQDGTQDPTQGQSESGSSTPRGQNEAGSERFARCAQEIGRPEIVEPRDP 60
           MPPRT RRRRQNQDG Q PTQG S   SST   +  AG+E+FAR  QEIGR +  EP DP
Sbjct: 39  MPPRTGRRRRQNQDGMQGPTQGPSVGESSTLGVRGGAGNEQFARTTQEIGRTDRAEPSDP 98

Query: 61  KKIYGIDRFKKLGVTVFEGSTNPADVEVWLNMLEKCFDVMSCPQERK---------KEAE 120
           +K YGI+R KKLG TVFEGST+PAD E WLNMLEKCFDVM+CP+ERK         KEAE
Sbjct: 99  EKAYGIERLKKLGATVFEGSTDPADAENWLNMLEKCFDVMNCPEERKVRLATFLLQKEAE 158

Query: 121 GWWKSIIARRNDARTL-----------------------------------VAEYERKYT 180
           GWWKSI+ARR+DAR L                                   VAEYERKYT
Sbjct: 159 GWWKSILARRSDARALDWQTFRGIFEDKYYPSTYCEAKRDEFLGLKQGSLSVAEYERKYT 218

Query: 181 KLSRYAEVIVTSESDRCCRFERGLHFEIRTPVTVIAKWTDFSQLVETALRVELSIVEEKS 240
           +LSRYA+VI+ SESDRC RFERGL FEIRTPVT IAKWT+FSQLVETALRVE SI EEKS
Sbjct: 219 ELSRYADVIIASESDRCRRFERGLRFEIRTPVTAIAKWTNFSQLVETALRVEQSITEEKS 278

Query: 241 AMELSRRVSTTSGIRGREQQRFTPGVNVSSCQDFKRRSGGKSLRQMS------------- 300
           A+ELSR  ST SG RGREQ+RFTPG+N+SS QDFK RSGG++ R +S             
Sbjct: 279 AVELSRGTSTASGFRGREQRRFTPGINISSRQDFKNRSGGQASRNVSYGSVFQRQSQRIP 338

Query: 301 -----SVAKPRTGQESVASESRRTPCVSCGTSHRGQCLVGAGVCYQCGQTGHSKRACPQL 327
                S  + + GQES+AS  RR PC SCG +HRGQCLVGAGVCYQCGQ GH K+ CPQL
Sbjct: 339 SQPIRSTVRSQPGQESIASTVRRIPCTSCGRNHRGQCLVGAGVCYQCGQPGHFKKDCPQL 398

BLAST of MELO3C035074 vs. ExPASy TrEMBL
Match: A0A1S4E0X6 (uncharacterized protein LOC107991320 OS=Cucumis melo OX=3656 GN=LOC107991320 PE=4 SV=1)

HSP 1 Score: 454.9 bits (1169), Expect = 4.4e-124
Identity = 250/356 (70.22%), Postives = 273/356 (76.69%), Query Frame = 0

Query: 1   MPPRTSRRRRQNQDGTQDPTQGQSESGSSTPRGQNEAGSERFARCAQEIGRPEIVEPRDP 60
           MPPRTS+R RQNQDGTQDPTQGQSE GSSTPRGQNEAGSERF+R AQEIGRPE   P DP
Sbjct: 1   MPPRTSKRHRQNQDGTQDPTQGQSERGSSTPRGQNEAGSERFSRSAQEIGRPEKAGPSDP 60

Query: 61  KKIYGIDRFKKLGVTVFEGSTNPADVEVWLNMLEKCFDVMSCPQERKK----EAEGWWKS 120
           +K+YGI+R KKL  TVF+GST+ AD EVWLNMLEKCFD+MSCPQERK       E ++ +
Sbjct: 61  EKMYGIERLKKLEATVFDGSTDLADAEVWLNMLEKCFDLMSCPQERKTFRGIFEEKYYPT 120

Query: 121 II--ARRNDARTL------VAEYERKYTKLSRYAEVIVTSESDRCCRFERGLHFEIRTPV 180
               A+R++   L      V EYERK      YAE+IV  ESDRCCR ERGL FE RTPV
Sbjct: 121 TYCEAKRDEFLELKQESLSVVEYERK------YAEMIVAFESDRCCRGERGLRFEKRTPV 180

Query: 181 TVIAKWTDFSQLVETALRVELSIVEEKSAMELSRRVSTTSGIRGREQQRFTPGVNVSSCQ 240
           T I KW DFSQLVETALRVE SIVEEKS MELSR VSTTSGIRGREQ+RFTPGVNVS CQ
Sbjct: 181 TAITKWMDFSQLVETALRVEQSIVEEKSVMELSRGVSTTSGIRGREQRRFTPGVNVSCCQ 240

Query: 241 DFKRRSGGKSLRQMS------------------SVAKPRTGQESVASESRRTPCVSCGTS 300
           DFKRRSGGK LRQMS                  SVA+ RTGQ+SVASESRRTPCV+CG  
Sbjct: 241 DFKRRSGGKPLRQMSSGSAYQRQSQRASSQSANSVARSRTGQKSVASESRRTPCVTCGKC 300

Query: 301 HRGQCLVGAGVCYQCGQTGHSKRACPQLRVGVQRDQGVKSHTVEQPRISAATGEGT 327
           HRGQCL+G GV YQCGQTGH KR CPQLRV V+RDQGV+SHT+EQPRIS   GEGT
Sbjct: 301 HRGQCLIGVGVYYQCGQTGHLKRDCPQLRVVVRRDQGVESHTIEQPRISTTVGEGT 350

BLAST of MELO3C035074 vs. ExPASy TrEMBL
Match: A0A5A7U9X4 (DNA/RNA polymerases superfamily protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold409G002130 PE=4 SV=1)

HSP 1 Score: 444.1 bits (1141), Expect = 7.8e-121
Identity = 247/357 (69.19%), Postives = 264/357 (73.95%), Query Frame = 0

Query: 1   MPPRTSRRRRQNQDGTQDPTQGQSESGSSTPRGQNEAGSERFARCAQEIGRPEIVEPRDP 60
           MPPRTSRRRRQNQD  QDPTQGQSE GSSTPRGQNEAGSERF R AQEIGR E   P DP
Sbjct: 151 MPPRTSRRRRQNQDRMQDPTQGQSERGSSTPRGQNEAGSERFVRSAQEIGRSERARPSDP 210

Query: 61  KKIYGIDRFKKLGVTVFEGSTNPADVEVWLN-------------MLEKCFDVMSCPQERK 120
           +K+YGI+R K+LG TVF GST+ AD EVW N               EK +    C  +R 
Sbjct: 211 EKMYGIERLKELGATVFVGSTDLADAEVWHNDARTLDWQTFRGIFEEKYYPTTCCEAKRD 270

Query: 121 KEAEGWWKSIIARRNDARTLVAEYERKYTKLSRYAEVIVTSESDRCCRFERGLHFEIRTP 180
           +  E                VA+Y+RKYT+LS YAEVI+ SESDRC RFERGLHFEIRTP
Sbjct: 271 EFLE---------LKQGSLSVAKYKRKYTELSWYAEVIMASESDRCRRFERGLHFEIRTP 330

Query: 181 VTVIAKWTDFSQLVETALRVELSIVEEKSAMELSRRVSTTSGIRGREQQRFTPGVNVSSC 240
           VT IAKWTDFSQL+ETALRVE SIVEEKSAMELSR VSTTS IRGREQ+R TPGVN+S C
Sbjct: 331 VTAIAKWTDFSQLIETALRVEQSIVEEKSAMELSRGVSTTSRIRGREQRRSTPGVNISGC 390

Query: 241 QDFKRRSGGKSLRQMS------------------SVAKPRTGQESVASESRRTPCVSCGT 300
           QDFKRR GGK LRQMS                  SVA+PRTGQESVASESRRTPCVSC  
Sbjct: 391 QDFKRRLGGKPLRQMSSGSAYQRQSQRASSQFTNSVARPRTGQESVASESRRTPCVSCSK 450

Query: 301 SHRGQCLVGAGVCYQCGQTGHSKRACPQLRVGVQRDQGVKSHTVEQPRISAATGEGT 327
           SHRGQCLVGAGVCYQCGQTGH KR CPQLRVGVQRDQGV+SHTVEQPRI AA  EGT
Sbjct: 451 SHRGQCLVGAGVCYQCGQTGHFKRDCPQLRVGVQRDQGVESHTVEQPRILAAAREGT 498

BLAST of MELO3C035074 vs. ExPASy TrEMBL
Match: A0A5A7U2V7 (Reverse transcriptase OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold374G00630 PE=4 SV=1)

HSP 1 Score: 433.0 bits (1112), Expect = 1.8e-117
Identity = 237/388 (61.08%), Postives = 267/388 (68.81%), Query Frame = 0

Query: 1   MPPRTSRRRRQNQDGTQDPTQGQSESGSSTPRGQNEAGSERFARCAQEIGRPEIVEPRDP 60
           MPPRT RRRRQNQDG Q PTQG S   SST   +  AG+E+FAR  QEIGR +  EP DP
Sbjct: 1   MPPRTGRRRRQNQDGMQGPTQGPSVGESSTLGVRGGAGNEQFARTTQEIGRTDRAEPSDP 60

Query: 61  KKIYGIDRFKKLGVTVFEGSTNPADVEVWLNMLEKCFDVMSCPQERK---------KEAE 120
           +K YGI+R KKLG TVFEGST+PAD E WLNMLEKCFDVM+CP+ERK         KEAE
Sbjct: 61  EKAYGIERLKKLGATVFEGSTDPADAENWLNMLEKCFDVMNCPEERKVRLATFLLQKEAE 120

Query: 121 GWWKSIIARRNDARTL-----------------------------------VAEYERKYT 180
           GWWKSI+ARR+DAR L                                   VAEYERKYT
Sbjct: 121 GWWKSILARRSDARALDWQTFRGIFEDKYYPSTYCEAKRDEFLGLKQGSLSVAEYERKYT 180

Query: 181 KLSRYAEVIVTSESDRCCRFERGLHFEIRTPVTVIAKWTDFSQLVETALRVELSIVEEKS 240
           +LSRYA+VI+ SESDRC RFERGL FEIRTPVT IAKWT+FSQLVETALRVE SI EEKS
Sbjct: 181 ELSRYADVIIASESDRCRRFERGLRFEIRTPVTAIAKWTNFSQLVETALRVEQSITEEKS 240

Query: 241 AMELSRRVSTTSGIRGREQQRFTPGVNVSSCQDFKRRSGGKSLRQMS------------- 300
           A+ELSR  ST SG RGREQ+RFTPG+N+SS QDFK RSGG++ R +S             
Sbjct: 241 AVELSRGTSTASGFRGREQRRFTPGINISSRQDFKNRSGGQASRNVSYGSVFQRQSQRIP 300

Query: 301 -----SVAKPRTGQESVASESRRTPCVSCGTSHRGQCLVGAGVCYQCGQTGHSKRACPQL 327
                S  + + GQES+AS  RR PC SCG +HRGQCLVGAGVCYQCGQ GH K+ CPQL
Sbjct: 301 SQPIRSTVRSQPGQESIASTVRRIPCTSCGRNHRGQCLVGAGVCYQCGQPGHFKKDCPQL 360

BLAST of MELO3C035074 vs. ExPASy TrEMBL
Match: A0A5D3BS67 (Reverse transcriptase OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold509G00050 PE=4 SV=1)

HSP 1 Score: 433.0 bits (1112), Expect = 1.8e-117
Identity = 237/388 (61.08%), Postives = 267/388 (68.81%), Query Frame = 0

Query: 1   MPPRTSRRRRQNQDGTQDPTQGQSESGSSTPRGQNEAGSERFARCAQEIGRPEIVEPRDP 60
           MPPRT RRRRQNQDG Q PTQG S   SST   +  AG+E+FAR  QEIGR +  EP DP
Sbjct: 39  MPPRTGRRRRQNQDGMQGPTQGPSVGESSTLGVRGGAGNEQFARTTQEIGRTDRAEPSDP 98

Query: 61  KKIYGIDRFKKLGVTVFEGSTNPADVEVWLNMLEKCFDVMSCPQERK---------KEAE 120
           +K YGI+R KKLG TVFEGST+PAD E WLNMLEKCFDVM+CP+ERK         KEAE
Sbjct: 99  EKAYGIERLKKLGATVFEGSTDPADAENWLNMLEKCFDVMNCPEERKVRLATFLLQKEAE 158

Query: 121 GWWKSIIARRNDARTL-----------------------------------VAEYERKYT 180
           GWWKSI+ARR+DAR L                                   VAEYERKYT
Sbjct: 159 GWWKSILARRSDARALDWQTFRGIFEDKYYPSTYCEAKRDEFLGLKQGSLSVAEYERKYT 218

Query: 181 KLSRYAEVIVTSESDRCCRFERGLHFEIRTPVTVIAKWTDFSQLVETALRVELSIVEEKS 240
           +LSRYA+VI+ SESDRC RFERGL FEIRTPVT IAKWT+FSQLVETALRVE SI EEKS
Sbjct: 219 ELSRYADVIIASESDRCRRFERGLRFEIRTPVTAIAKWTNFSQLVETALRVEQSITEEKS 278

Query: 241 AMELSRRVSTTSGIRGREQQRFTPGVNVSSCQDFKRRSGGKSLRQMS------------- 300
           A+ELSR  ST SG RGREQ+RFTPG+N+SS QDFK RSGG++ R +S             
Sbjct: 279 AVELSRGTSTASGFRGREQRRFTPGINISSRQDFKNRSGGQASRNVSYGSVFQRQSQRIP 338

Query: 301 -----SVAKPRTGQESVASESRRTPCVSCGTSHRGQCLVGAGVCYQCGQTGHSKRACPQL 327
                S  + + GQES+AS  RR PC SCG +HRGQCLVGAGVCYQCGQ GH K+ CPQL
Sbjct: 339 SQPIRSTVRSQPGQESIASTVRRIPCTSCGRNHRGQCLVGAGVCYQCGQPGHFKKDCPQL 398

BLAST of MELO3C035074 vs. ExPASy TrEMBL
Match: A0A5D3BHI1 (Reverse transcriptase OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold115G00450 PE=4 SV=1)

HSP 1 Score: 433.0 bits (1112), Expect = 1.8e-117
Identity = 237/388 (61.08%), Postives = 267/388 (68.81%), Query Frame = 0

Query: 1   MPPRTSRRRRQNQDGTQDPTQGQSESGSSTPRGQNEAGSERFARCAQEIGRPEIVEPRDP 60
           MPPRT RRRRQNQDG Q PTQG S   SST   +  AG+E+FAR  QEIGR +  EP DP
Sbjct: 39  MPPRTGRRRRQNQDGMQGPTQGPSVGESSTLGVRGGAGNEQFARTTQEIGRTDRAEPSDP 98

Query: 61  KKIYGIDRFKKLGVTVFEGSTNPADVEVWLNMLEKCFDVMSCPQERK---------KEAE 120
           +K YGI+R KKLG TVFEGST+PAD E WLNMLEKCFDVM+CP+ERK         KEAE
Sbjct: 99  EKAYGIERLKKLGATVFEGSTDPADAENWLNMLEKCFDVMNCPEERKVRLATFLLQKEAE 158

Query: 121 GWWKSIIARRNDARTL-----------------------------------VAEYERKYT 180
           GWWKSI+ARR+DAR L                                   VAEYERKYT
Sbjct: 159 GWWKSILARRSDARALDWQTFRGIFEDKYYPSTYCEAKRDEFLGLKQGSLSVAEYERKYT 218

Query: 181 KLSRYAEVIVTSESDRCCRFERGLHFEIRTPVTVIAKWTDFSQLVETALRVELSIVEEKS 240
           +LSRYA+VI+ SESDRC RFERGL FEIRTPVT IAKWT+FSQLVETALRVE SI EEKS
Sbjct: 219 ELSRYADVIIASESDRCRRFERGLRFEIRTPVTAIAKWTNFSQLVETALRVEQSITEEKS 278

Query: 241 AMELSRRVSTTSGIRGREQQRFTPGVNVSSCQDFKRRSGGKSLRQMS------------- 300
           A+ELSR  ST SG RGREQ+RFTPG+N+SS QDFK RSGG++ R +S             
Sbjct: 279 AVELSRGTSTASGFRGREQRRFTPGINISSRQDFKNRSGGQASRNVSYGSVFQRQSQRIP 338

Query: 301 -----SVAKPRTGQESVASESRRTPCVSCGTSHRGQCLVGAGVCYQCGQTGHSKRACPQL 327
                S  + + GQES+AS  RR PC SCG +HRGQCLVGAGVCYQCGQ GH K+ CPQL
Sbjct: 339 SQPIRSTVRSQPGQESIASTVRRIPCTSCGRNHRGQCLVGAGVCYQCGQPGHFKKDCPQL 398

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
XP_016901625.19.1e-12470.22PREDICTED: uncharacterized protein LOC107991320 [Cucumis melo][more]
KAA0051980.11.6e-12069.19DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa] >TYK04577.1 D... [more]
KAA0037581.13.7e-11767.62reverse transcriptase [Cucumis melo var. makuwa][more]
TYK03091.13.7e-11767.62reverse transcriptase [Cucumis melo var. makuwa][more]
KAA0066849.13.7e-11761.08DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa][more]
Match NameE-valueIdentityDescription
A0A1S4E0X64.4e-12470.22uncharacterized protein LOC107991320 OS=Cucumis melo OX=3656 GN=LOC107991320 PE=... [more]
A0A5A7U9X47.8e-12169.19DNA/RNA polymerases superfamily protein OS=Cucumis melo var. makuwa OX=1194695 G... [more]
A0A5A7U2V71.8e-11761.08Reverse transcriptase OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold37... [more]
A0A5D3BS671.8e-11761.08Reverse transcriptase OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold50... [more]
A0A5D3BHI11.8e-11761.08Reverse transcriptase OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold11... [more]
Match NameE-valueIdentityDescription
InterPro
Analysis Name: InterPro Annotations of Melon (DHL92) v4
Date Performed: 2022-08-06
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 363..383
NoneNo IPR availableGENE3D4.10.60.10coord: 272..306
e-value: 4.4E-5
score: 25.0
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 15..36
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 220..262
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..41
NoneNo IPR availablePANTHERPTHR34482DNA DAMAGE-INDUCIBLE PROTEIN 1-LIKEcoord: 84..299
NoneNo IPR availablePANTHERPTHR34482:SF4POLYMERASES SUPERFAMILY PROTEIN, PUTATIVE ISOFORM 1-RELATEDcoord: 84..299
IPR001878Zinc finger, CCHC-typeSMARTSM00343c2hcfinal6coord: 281..297
e-value: 7.3E-4
score: 28.8
IPR001878Zinc finger, CCHC-typePFAMPF00098zf-CCHCcoord: 281..297
e-value: 7.1E-5
score: 22.7
IPR001878Zinc finger, CCHC-typePROSITEPS50158ZF_CCHCcoord: 282..297
score: 10.344996
IPR036875Zinc finger, CCHC-type superfamilySUPERFAMILY57756Retrovirus zinc finger-like domainscoord: 269..299

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
MELO3C035074.1MELO3C035074.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0003676 nucleic acid binding
molecular_function GO:0008270 zinc ion binding