IVF0022031 (gene) Melon (IVF77) v1

Overview
NameIVF0022031
Typegene
OrganismCucumis melo L. ssp. agrestis cv. IVF77 (Melon (IVF77) v1)
DescriptionBTB/POZ domain-containing protein TNFAIP1 isoform 1
Locationchr11: 109946 .. 131786 (+)
RNA-Seq ExpressionIVF0022031
SyntenyIVF0022031
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CCAGACTAGACAATTTTCTAGAAATACAGTCAGCCGAGCCGCTTAACTTCCGAGAAGAGCCAGCATGGCGTTGTGTATCTCTATTCCTTCCTTCTCCGATTCTGTTACCGCTCCTTCCCTATCTCAAACTTATTTCAGTCTCAGTCGCCGGCGGCCTTCCAATTTCATCCATTTTCTTAATTTAAGCCTTCCCAACCGCCCACCATGTCTTTGCCGAGCTTCCAATTCTCAGCCTGGACCGTTCCCCAAACAGTCTGCTTCTGCTTCTTCTAAGAAAAGGAAGAAAAAGGATAAGGGAGATTCCAAGGTTTTTAATCCTAACAACTTCGAAGTTGTGGACGATTTTAGCTTCGATGACGCTGGACCTTCTAGCTCCACTTCCACTTCCACTTATTTGTCGTACCATCCTTCGACCTTGCCCAAACCGCCAGCTGGGTTTGTACTAGACGACCATGGAAAGGTCCTCATGGCTTCAAACAAGCGAATTGCTACCATGGTAACATTTTTTTCTCAGTTCCCTCATGGAGGATTTGTTTAAAATTGGTATCATTTTAGTGCTTCCCTGTTCACCTAACCTCTAGTGCCTTTTGATTTTCATGTATGAACGAGAAGTTCAGAATATGTTAAACCACCAATATTAATAGGATATAATAGGAGAAACAAGGGGAAGTTGGCACAGGATAGGTTAGTTAGAGATGTGGAGGAGAATTCGGAAAGAGGAAACTATAGTGGTGGGCCCAGCAGTTAGTTGGGATTGGAGAGAGATTTAGGCCGGGGAGATAGCAGAAACAATTGGGCAGAATTTTGGTGAAGTCAAGAGGACTATCACACTCCTTGAGAGACAGGAGGAGTAGTACACTGGTTCTGAATTGTCCTTGTTTTCTTGTTACATTGTTCTGTCTTTTATTTTGTTTTAAATTTGATTGCTGTATCTTTGCTTCAGTTAGGGAAGGATTGAGATTCTAGCCATGCTGTTTTTTGCCTTTTGTTTTCATCGAATATTGTAATTACGTTTGGATAGCAATAAAAGTAGAATCACCATACTGGTATTCTATCAATTTGGTATCAGAGCATTAGAATCTGGAAAAGCGTAAGAAGATGGCCAAGAAAATCGAAGAACTGTTGGATTTTGTGGAATAGGAAATTAGGGAAATGCGAACAGAGTTGAAGAAGCTATCCGCAATGGAGGAGAATATGTCCTCGATTTTGAAGAGTATTGAAAATATAAATGCGCAGATGGAGAAACAACAAACACAATAGCAAGCGATCCTGAAATACATCGAAGGAATCATTCGAGAGAAGGCTGTTACAACGGTTGAATTGGAAGGATCTCCGAGTAGGGGAATGGGAAGCGACTTTACATCTAAAGTGTTGACTGGGGAATCGAAGGGAGAACGAAAGAGGGAGGATGATAAGACATTTGATAGGAGCAAGTTTAAGAAGGTAGAAATGCCGATCTTCAATGGAACAGACCCTCACTCATGGTTGTTTCGAGCAGACCGTTATTTCAAGATTCACAACTAACCGAATTAGAGAAGATGTCAGTGGCTATTATTAGCTTTGATGGTCCGGCCCTCGACTGGTATCGATCTCAGGATGAACGTGAATCTTTCAAGAGTTGAGATGATTTGAAGCAAAAGATGTTAACAAGGTTTCGAACGATCAGGGATGGTACGTTGGTAGGCAGTTTTTTGACGATTAAACAAGAGACCACGGTGAATGAGTATCGAAATAGATTTGACAAATATCTAGCTTTGGTAGCATTCTTACAAACGATGGTGCTTGAGGAGATTTTCATGAATGGGCTCAGCCCATGGTTGAAGACCGAAGTTGATGTTTTGGAACCATAAGGGTTAGCCCAAATGATGAAGCTGGCTCTTAAGATTGAGAACAAAGAGAGTGAGGAAGGAATGTGGGTTGATCAGTGTGTATGAGAGTAAGTTCTAGTACAACCTGCCTAAGGCAAAGGAAGGGACAAAGACCAAAGCAATGGCAGCCACGACTAGTGGAAATACTCCAATGAGAACAGTCACATTCAGAGGAGTCACGACGACGGATAACTGAAGAGAAGGACCTTCCAAGCGTTTGACTGATGCTGAATTTTAGGCTAGGAGGGAAAAGAGGTTGTGTTTTAAGTGTGAAGAGAAATATCATGCTGGCCATCGTTGTAAAGCTAAAGAACACAAGGAGTTAAGAATGCTGGTAGTACGGGAGAATGAAGAAGAGTTTGAGATCATTGAGGAAGTCGGAGGAGAAGAAGTGGTGGATGAGAATGTTATTGAGGTAGGAGTAGTGGAGAATTTGAACATAGAGCTATCCATTAATTCGGTGGTGGGGTTGACCAATCCTGAAACTATGAAGGTGAAGGGAAAGCTGAAGGATGAGGATGTGGTGGTGCTGATTGACTGTGGGGCTACCTACAATTTTATATCTGAAAAATTGGTAACCAACCTAAATCTACCGTTGAAAGCTACAACCAATTATGGGGTGATTCTGGGTTCAGGAGAAGCCATTAAAGGAAAAGAAATTTGTGGAAAAGTAGAGGTATTGCTAGACAATCGGAAAGTAGTGGACAGCTTCTTGCCACTTGAGTTGGGTGGTGTTGATGTCATACTTGGTATGCGGTGGTTGCATTCTCTTGAGTGACTGAAGTGGATTGGAAGCACTTAGTCATGTCCTTTCAGCATGGAGGAAGAAAGGTTATAATTCATGGGGATCCAAGCCTCACTAAGAAGGGAGTGAATTTGAGGAGCATGATGAAAACTTGGGAAGGGGAAGACCAAGGGTTTTTGGTGGAATGCCGAGCTATTGAAGGGAAGGTACCAGTGGCAACCTTTTATGAAGAGGAACTTGAAACAACTGTAGATAATTCCATCCCTCCATTGTTAAAGAAATTTTTAGATGTGTTTGAATGGCCAGAAACATTGCCACCCAAGAGAGGGATAAAGCATCACATTCATTTAAAACATGGTACTAACCCAGTCAATGTGAGGGCTTATCGCATAGCTATCAGAGTAGATGGAAAGATCAGTTGACGAGATGCTGGCTTCTGGAATAATTCGACCTAGCACCAGTCCATATTCAAGTCCTGTATTGTTAGTAAGGAAGAAGGATGGAAGCTGGAGGTTTTGTGTTGACTACCGAGCGCTTAATAATGTCACCATACCGGACTAATTTCCAATTCCTATCATTGAAGAGCTGTTTGATGAGTTGAATGGTGTAAAGATGTTTTCCAAGATTGACCTTAAAGCTGACTATCATTAAATACGGATGTACTAAGAGGATGTGGAAAAGATAGCCTTTCGCACTCATGAAAAGTTTTACACCATAAGTACCTCCATATTACACTGAGTGGGATTCGACAATAGGTTTGAGTTAATGATTAGAAAACTTCCTTTTCTCGATCTTGATTCCTAGTCATGCCTTTTGGTTTGACTAATGCACTTTCTACTTTCCAAGCCATATATGAGGAGGTTTGTACTGGTTTTCTTTGATGACATCCTAGTATATAGCAAGGGATTGGAAGAACATATACAACACTTGGAATTGGTGCTGGAAATTCTGAGGGCAAACGAGTTGTATGCTAATCTGCGCAAGTGCAGTTTTGCTAAGGAGAGAGTGAACTGTTTGGGGCATGTTATCTCTGAAAAAGGAGTGGAAGTGAATCCCGAGAAGATTAGGGCTATTAGAGAATGACTTGCTCCAACCAATGTACGTGAGGTAAGAGGATTTTTGGGTTTGATTGGATATTACCATAGATTTGTACAAAATTATGGCAGCATAGCAGGAACCCTTACTCAGCTGTTGAACAATGGGAGTTTCAAGTGGAATGAAGAAGCCGAAACTTCCTTTGAAAAGCTAAAGACAACTATGATGACCTTGCCCGTGTTAGCAATGCCAAATTTCAATTTGCCTTTTGAAATTGAAATGGATGCATCTGGCTATGGAGTGGGGGCTGTGTTAACTCAAGCTAAACGACCTATTGCCTATTTTAGTCGAACTTTAAGCATGAAGGACAAGGCCAAACCGGTATATGAGAGGGAATTGATAGCTGTAGTATTTACAGTGTAAAGATGGAGGCCATATCTTTTGGGCCGCAAGTTCATTGTGAAAACTGATCAGAGGTCACTGAAGTTCTTGTTGGAAGAGCGTGTCATCCAACCACAATACCAGAAATGGATTGCTAAATTGCTTGGATACTCTTTTGGAGTGGTTTATAAACCAGGGTTAGAAAACAAAGCTGCCGATGCACTATCTAGGGTGCCACCTACAATCCATTTGAACCAACTTTCAGCCCCTGCCCTAATTGATTTGGGCAAAATACAAGAAGAGGTAGAGAATGACCCGAAATTGAAGGAGATTAGAAGTATAGTTGAGCAAGATCCAGAGGAGTTTCCAAATTTCACAGTGAACTAAGGAGTCTTGCAATTCAAAGGGAGGTTAGTAATTTCCAAGAATTCTTCTTTTTTACTTATCGTTTTACATACCTATCATGATTCAATATTTGGTGGACATTTTGGATTCTCGAGAACCTATAAAAGAATTGCTGGAGAATTGTATTGGGATGGAATGAAGAAGGATATAAAGAAGTACTGTGAGGAGCCTTTGATTTGTCAAAAGAATAAAACACTAGCTTTGTCACCAGCTGGATTGCTAACTCCCTTAGAAATACCGGTTACCATATGGACCGATATATTTGTGGACTCATAGATGGATTACCTAAGTCTACTAGTTTTGAAGTTATATTTGTAGTGGTAGATAGGATGAGTAAATACGCACACTTCATGGCACTCAAACACCCCTATATGGCTAAATCTGTTGTTGAACTCTTTGAGAAGGAAATTGTTAGATTACATGGATACTCGAGATCTATTGTTTCTGATCGGGACAGGGTGTTTGTTAGTAACTTTCGGAATGAACTATTTAAGTTGGCAGGCACAGAACTCCATAGAAGCCCAGCATATCATCCGCAAAACAATGGTCAAACAGAGGTGGTTAATAGAGGGGTAGAAGCTTATCTCTGGTGCTTTTGTGGGGAAAGACCAAAGGAGTGGACAAATTGGCTGCATCGGGCTGAATATTGGCATAACACCACCTAACAGCTCAATTGGCATTTCACTGTTTCAGGCTGTTTATGGAAGGCTTCCACCTCCTTTGCTCTATTTTGGAGATATGGAAACATCCAATTCTACTCTAGATCAGCAGCTCAAAGATAGGGACATTGCTTTGGGGACTTTGAAGGAACACTTATGCATAGCCCAAGAGAAAATGAAGAAACAGGCAGATTTGAAGAGGAGAGCAGTTGAATTCCAGGTTGATGATATGGTTTTCCTAAAACTCATACCATATAGACAACTATCACTGTGAAGGAAGCGCAATTAAAAACCGTCCCCAAAATATTTTGGTCCTTACAGAGTGTTAGAAAAGATAGGTCCAGTAGCTTATAAGCTGGAATTGCCTAGTACAACAACTTTTCATCCAGTTTTCCACGTTTCCCATCTAAAGAGGGCATTAGGGGATCATACTCAAGTACAACAGCTTGATTCTTACCTAACAGAAAACCATGAATGGATGACGCAATCGGATGAAGTATATGGGTACAAAAAGAATTCTAACACAAAGGACTGGGAGGTGTTGATAAGTTGGAAAGGGTTACCACCCCATGAGGCCACGTGGGAAGACTGTAATGACTTCAAACATCAGTTTTCTGACTTCCACCTTGAGGACAAGGTGGATTTGGAGGAGGAGAGTAATGTTAAACCACCAATATTATTCACATATAATAGGAGAAATAATAGGAGAAATAAGGGGAAGTTGGCACAGGATATGTTAGTTAGAGATGTGGAGGAGACTTCGGAAAGAGGAAACTATAGTGGTGGGCCTAACAGTTAGTTGGGATTGGAGAGGGATTTAGGCCGGGGAGATAGCAGAAACAGTTGGGCAGAATTTTGATGAAGTCAAGAGGACTATCTCACTCCTTGAGAGAGAGGAGGAGTAGTACACTGGTTCTGAATTTTCCTTGTTTTTTTGTTACATTGTTCTGTCTTTTATTTTGTTTTAAATTTGATTACTGTATCTTTGCGTCAGTTAGGGAAGGATTGAGATCCTAGTAGTGCCGTCTTTTTGCTTTTTGTTTTCATCAAATATTGTAATTACGTTTGGATATCAATAAAAGTAGAATCACCGTATTGGTGTTCTATCAGAATACATCTGTAATTTATAGGCACTCAGTTATTGGATCTAAATTAATCAAACTACATCTGTAATTTATAGGCACTCAGTTATTGAATCTAAACTAATCAAACTAAAAGTATGCAGTTAATAAATCGAATTAATTTTTCTTATTTATGTTCATCCCATTTACCCCTTTACAGCTGGTTTTCTCATTATTTATGAATTGTTTTATGTATTTGGGTTTGCTAATCTTTGATCAACTGTACTGCCACAAGTGTATGACTGGTTTTTTCATGTATTCAGTTTATTGCTCTAGCTTTCTTTCATAGATATAATGAAAAGATACTGATTCTCGGAAAAATAGAAGAGACGGTCACCAGTGATTTAATACAGTATATAATCTCCAGTTGGAGTACCGTTTCCTGCTTCTTTTCTTGTAATTATACTATCAGATCAACCTCAATTGGGGAATCGCCTTCCCCTAGCCTTAGGAGTGATTTATTATTATTATGATTATTTTACGAATCCTGCCTGTCCTTATCAAAGAAAGGAAATTATATCATTATTCGTTACTAAATCCTTTACAATTTTTATCATTGTTGTTCTTGTATTTGTGACTTAATTAAATTCCTTACTTTTTTCTTTTTCTTTTTGCTTTTGCATCTTCAAATCTTTTAATTTGACAGGTCGATCCTTTAAATAATTTGCCTCTGGAATGCGTTATAAGAAGAATTTTTAGAAGTTCAAAAGGGGATGATTGCATGCTTCTTTGCCCCGTGGACACGTGAGTGTTGTTTGTTGCTACCAGATCATTATGCTAGAGCCTCGACACTTTGAAAGTTTGACCTTTTTTTTATATTATTTATCTCAAAATTGCTGCAGGCCTGTTCAAATACTAAAGAGCAAGAATATTGATGGATGGTCAGCTGTATGCTTTCTTTTCATATGGTTAACTTCCTCGCTAGTTTTAGTTTGTTCATGAACTTGAGTGCGGATGGTTCTACAATATTAATTCACATCATGAAACTTTTATATTTCGCATTTCTGTATGCTGTATAAGTAAGGAGTTCATGCATGTATTGTTCATTTGTTTGTCCCTTCTACCATTTTACCATTTTGTTCTATTTTTTCTAAAAAGGAAACAAGTCTTTTATTCATTCAAATGTAATACAAAAGAAACTAATGTTCTAAGTACATGAGGGTTATACAAAAAGTGTAAAGTATAAAGCATAACTAGGATCAGTAGGCGCATCCAAACATCTCAACTAGGTTGACATCTCCTTAGTGCCCTCATCATATCCTTAAAAATAACATCAAAAATCAATCGAGATTTATAAATTTGGCCATTACGGATAAGACAAAAAGACTGTTGATCACTAGTCTTACAAAGGATTTAACAAAAAGCATACAGAAGGCTGGAGACAGAAGTACAATAAGAGATTTAACTCAAAACAAAATTAAGGTGGGAAGATAAAAAGCTCTCCAATATGAGTAGATTTCTTGAATGCTTTAGCTTTCAAAAGGCTTGGAGAAGGAACACCACGAGGAGGCATTCTTACAGGTCAAAAAGACTATTGATCACGTATTTTAGTATTTTACTTTACCCTTATTTCCTTTTTTGTATTTGGACTTACTAGAGCCTTAAATTTTTTTTTCTTTTGTAACTATCAGTGTGTATGAAAATAATAAAATAGATTCTCTATCGTAGTTTTTTCTCCTTGTTCTAGAGTTTAAAACGTAAATCTTGTGTTCTTTTTTTTTTTACCGTTTAACATGGTATTAGAGCATGGTGGCGAAACTCTAGCTGCCATTGACGAAAACCAACTGGTGCAAGAAAACCTTAGTTCTGCACCAGTCTCGCTTCAACCACATTAAGGTTGTTTAGGCGATAGGCGAGACGATGGGGGCCATACTGCCTTGCAATTACCCAAGGCGAGGACCTTCAATGAGGCACTTGCTTTTTGCGCCTTTCAGTCATCTTGAACATTTGTTCAAGACGACACAGGCGCTTAACTTTCTTCTTTTTTATAACATGTTTTGTGTTTTTTGTCTTCTTCAAATTTTCTTCTGAAGAACGAAGGTTTAATAAGATGAGTTAAATAAAAACCAAAGAAATGAAGAAAAAGAAAAGCAGAACTAGTCATCTTTTTGCTTAGCCATCCATCGGTTACTTGTCAAAAATGATGGAAAGAAAGAAGAGGGATTGGAAGTCAGAACGGAAGAAGAGATATTATGTTCTCTTTTTATTTTTAAATTTCCTAGAAGAGAATTGAAGGGTTGGCTGCCAGAATTAAAGTAATGTTGGATGCATGCCAACCCAACGGCCAAATGGCAAAATTTCTACCCCATTACTCAAAGAACTAATTTTTACCCATTTAAAAAGCATTTATGTTTCTATCTTTTATTAAATATTCTTTTTCAGTACTATTTTAGTAGACTTTTCTTAATAATTTTGTTAAAAGCAGTAAAGTACAGTAGACACTGAGTTACGTGGAAACTCAAGTACCGGAAGAAAAACCACGATTGACTTGATTATTAATTTCTAATATTAACAAAAGATACAAGAGGGAAGATAATTAAGTTACAAGTTATGATAAAAAAGGAAAGGATGTAGCTATCAATAAGATCGGACTTACTAACACAAAAGTCAAAGTTTTGTTTGAAAAGCCCCTTGGTAAGAACATCAGCAACTTGTTGACTCGAGGGAATGTATGGATTGATATGCTACCATTGTCTAGTTTTTCGTTAATGAAGTGCCGATCAATCTCCACATGTTTAATTCTATCATGTTGGATTAGATTGTTGGCGATGCTAATAGCTGCGTTGTTATCACATAATAATTTCATCAGCACCTCACAATCCCGATGAAAATCAAATAAAACTTTATGGAGACAAATTTCCTCACATATCCCCAAACTCATAGCCCTATACTCTGCTCGACACTGCTTCTAACCTCAACCCCTTGCTCCTTACTCCTCCCAGTTAAAAGATTGCCTCATACAAAGGTACAACAATCGAAGGTGGATTTCCTGTCAACAATAGACCCTGATCAGTCAAAGTGAGTATAGGCCTCAATAGCTTTCCTGTATGTCTTTCTGAACATCAATCCTTTACCAAAAGTTGTTTTTTAAGTTCCTCAGGATTTTGTTAACTGCCTCCGTGTGTTCCTCATAGGAAGCTTGTATGAACTAGCTAACTACACTTACAACATAGGAAATATTTGGTCGAGTATGAGACAAGTAAATCAACTTACCCACAATTTACTGATATTGTTCTTTATCAACAGAAACTTATCACCTGAGAATTTCCTAGTTTACAGTTGAATTCAGCGGGAGTATCAACAGGGCGACATCCTAGCATACCTGTCTCGGTCAACAAATCAAGGGTATACTTCTTATGAGATACAGAGATACCTTCTTTAGATCTAGGTACCTCCATCCCAAGGAAATATTTTAGATTTCCCAAGTTTTTGATTTCAAACTAATCACCCATCCTCTTTTTCAGTTGGATGATTTCAGCAGTGTCATTCCCTGATAAAACGATGTCATCAACATAAACAATCAATACTACAATCTTGTCAGGGTTGCAAACTTTTGTAAAAAAAGTGTGGTCAAAGTGCCCCTGATTGTACCCTTGAGACTTGACAAAGGTAGTAAACCTATCAAACCATGCTCTTGGTGACTGTTTCAGTTCATATAGAGATTTATGAAATTTACAAACCCAATGACCAAACTGGGGGGGGGGGGGGAGGCTCATATAGACTTCCTCTTCCAGGTCTCCATTCAAGAACGCATTCTTAACATCTAGCTGATAGATATCGGTCTTTGTTCACAGCAACAGATAATGAGACTCTAACTGTATTTAGTTTAGCAACTGGGGAAAATGTTTCCGAGTAATCAACACCATAGGTTTGAGTAAACCCTTTTGCAACTCCTTGCCTTGTGTTTGTTAAGGGTTCTATCTGCCTTGTATTTGAGTATGAGCACCCATTTACATCTAACAATTTTATGTCCCTTAGGGAAAACATAGATCTCTCGAGTCTTATTCTTTTCAGGGCCTTAATCTTCTGTAACAAAAGATTTTCACTCTGGACACTCTAAAGCAATGTGAATATTTTTCGGTATCACAATAGAGTCAAGGCTAGCTGTGAAGGCTTTGAACTGTAGAGGTAGACTATCGTAGGAAACATAGTTACAAATGGGGTGTTTTGTGCAAGACCTGTTACCTTTTCTTAACGCAATAGGAATGTTCAAAGAAGGATCATACTCATCACTCATCAGGATTATTTGAATGACCCTGTTCAGCCTTATTACTACTAGTCCTGCTTTAACCTCAGTCTCATCAACACTATCATTTTCTCCCATATCTTCAGGAACAACAGTATCAGACCTGTCATTCTCACTCATTTTACTATTAGTGCACGATTCAATAGGATTTGTCATACCTTGATCTTGAGGAGGTTCAGGGTCTTGGACTGGAGCCCGTTGATCAGTAGGGGACTCAATTTCCTTTATGAGATTTCTTCTATAGTAAATTTTTCAGGGAACTTGGTTTGTGGGTAAGACCATGGGATGAGTATTGGAGTTAGGTAAGGTAATAGGAGTTGACATCCATAATGGCAAATTATCTACAGAAAGGAGGATGGAAACATTTATAGTCCCATTGATGAAAAGGATACCCAACAAACACACACGCCTGAGCTCGAGGGGTAAATTTGGTATGATTAGGGCCAATGCTATGAACATAGGTGTACACCCAAACATACAAAGGGGAACCTCAGAAGTTAGGCGGGTAGAGGGATATGACTTTTTGAGACATCTAAGGGAGTCTGAAGGTGGAGGACACAAGAAGGCATCCTATTGATGCGATAAGCTGCATTAAGAACAACATCTCTCCACTGATATGAAGGAAGGGAGTAGACAACTAAAGGAAACGAGCTACTTCCAAAAGGTGACAGTTTTTTTCGCTTGGTAACCCCATTTTGTTAGAGGTGTAAGCACATGAGCTCTAGTGAACTATCCCCTTGGAGGCTAGGAACTTAACAAGAGTATGGTTTTGAAACTCTCGACTATTATCACTTCGAATAATTACATTTTTTGCATTGAACTGCGTTTCAATGGTGTGATAGAAGTTCTGGAAAATGGAGGAAATCTTGAATTTATCAGTGATAAGAAAGACCCAGGTAAGACGAGTATGATCATCAATAAAAGTCACAAACCAGTGTTTTCCAGAGGAGGTAGTGACCTTGGATGGACCCCAAACGTCATTATGGATAAGGGTGAACAGTTAGGTTGGTTTATATGGTTGTGAGGGGAAGGAGACCCGATGTTGTTTGGCCCGAATACACACATCACAAGTTAAAGAGGAGACATCAACTTTAGAGAAGAGATGAGGAAATAAATACTTCATATATGTAAAGTTTAGGTGGTCCAACCGGAAATGCCACAACATACAATCTTGTTCTAAAATAGTAAAATATGAAGACAATAAATTAGTCCTAGAGATACTACTAGTGGAGGTATCATCATCAAAGAGATAGAGTTCCCTGCTATGTCGGGCAATGCCAATCATCCTCCTCGAGCTCATGTTCTAAAAAGAAACAGATTCAAGTAAGAAAGTAGCTTTGTAGTTCAGCTCACGAGTGATCTTGCTTATAGAAAGCAAATTATAAGAAATTTTAGGCACATGCAAAGCATTCTGAAAGGATAGCCCTAAAAAAAAATGTCCCTTCCCAGCAATTGAAGTTGAAGAACCATCAACTATTATGATCTTTTCATTACTAGCACACATAATATAATAAACAAAATTCTCAGAGGAACCTATCAAGTGGTCTATAGCCCCCGAGTTTAGGATCCAAGGATTCTTCCCATCAACATTGATAAGACTAAGGGACTGAGACATATCTGACTGAGCAATGGCTCCTAAAGTGGAGGGTGGAGTGAGACCGTTCTGGTTTACAGTAGGGCCAGATGGTTGAAAAGTGCTAGCATACTCACTCATATAAGCACACCCTGAATTTTGTTTATCGTTGGAGGAACGTTTCTTACCTTTGGGGGGACAACTATGAAATTTCCAACACTGATTCTTAGTGTGTTACCATTTCTTGCAATGCTTGCAAATAGGGATTGGTTTCTCGTTATTTTTTTCACTGTCATGGGTCGAGGATCTAGCACTGAAGGTAGCAGAGAAGTAGCAGGAGGAACACTCATGGCATTCGTACTGTCTTCTTCAAGACGGACTTCATAACACATTTCTATTAAGAAGGGAGAGGTTTTTGTCCAAGTATAAGACCACAGACAATGTCAATTTGGGGTTGAAACCTGCAAGGAAGTCATAAATCCGGTCAGGTTCTTCAAGTCTAGCAGTGTATAACATCATACGGCGTATCCCATACCGTCTCTCTGCACAGGTCCATGTCTTGCCAAAGTAGAGAAAGTTCACTGAAGTAGGAGGTTACATCTAAAGTTCCTTGCTTGAAATCATGGACTTGTTTTCTTAGTGTATACAAATGAGAAGCTTTCTGACACTTTGAATACAATTTCTGAGTTGTATCTTGCATAGATCCTTCGCAATTGCTGCGTAAAGCAAAAGCTTGCCTATTTATGGTTTTATACTATTAATCAACATGGACTAAATATACGAGTCCTCCCCTCTTCTGAAACATTCTAAGATATCACTGGGTGGGGGACAGAAAGTCTTCCCTATCAGAAAACCGAATCGGTGACGACCCTTGAGAAACATGTTGATCAATTGAGACTAGTAAAAGTAATTCTAGCCATTTAGCTTCTCTCCTGAAAAGGCATCTATAGATGGTGCTAGAGAACTCATTATATAATTTGAGGATAAGTTAGGGAGTGAAGTTTCCAGGTCCTTAGAATACATCGACAAGTCGGTTAGTTTGGATTGTCGAAGATTCAACAACTTCAAACCCTGATTTATTATGGGCTTGATCGATATCGGAACCATGGACGTGCAGTTGTTATAAAAATCAAAGGATAGCAGTGTGTGAAATGATGGCAGCCTATAAATCAAGGGCGGCGACTGCGCATGGAGGTGTGGCTAACCAGAAGGGTATTGCAACTGGACTTCTTAAAACAAGTGGGAGTAGACCGGCAGAGCAGTGGAGGCATTGGGCCATGGCGTGCGGGCGGTTAATTGAAGTGCGCGCGACTGGTTTCTGCCGCTGGACTGCACGACTGACTCCGAGGGTAGGCCCATTGTGAATGCCGACGTCTTCTGAAGCTGGTGGAGCAGCTTGGCCATGGTGGTGGCAATCTCGATGGCGACTGTCTTTGTTTGGGTTTCATTTACACCTGGGTTTAGGGTTTCGTCTTTATCTCGCTTTGATACCATGTTAAAAGTGGTCAAGTAAAGTAGACACGACTTACTTGGAAACCTGAGTACCAAATGAAAAACCACGATTGAATTTCTTATTAATTTCTCGTATTAACAAAAGATACAAGAGGGAAAATAAATCGGTTACAGCTTACATGATAAAAAAAAAAAGGAAAGGATATTAGAATAAATCATTCCTTGGGCCAAGCCCACTAGTTCTAACAAATTTTGTTTATATTTTAACATTTTAAAAATCTAACTATGTTTTTTTTATCTATAATTGTCCAACATGATATTTATTCATTATTTATTAAGTATGCCTTGCTTCAGTTGGGCAATGCTTTTTTGTCATTTCGCACCTTAAGGCGATTAAACAGCTTGTTGACTTAAAGTGCACCTTTCGTTCTGAAAACACTGAGCTATAGTTGTTGCCGCTATTGTCGTCGATTTTTTTGCTGCCATTGTTGTCCAAGTTGTCATTGATAGTTATCTTTAGTCCTTGTGTATCCATCCAATTCTAGCGCTGCCACAACCTTTGGTGGAAACCCTAGCTGAAAAAAACGCCATGAAACCGAGCGGTGAGTCATCCACCTACCACAGGGGTAAGTTTCCTACTGTGGCAACAAGCTTCTCACAGCAACCGAGTCTTTCTCATCATCCGTTGGCTAGTTATGAACATCAAGGTTCTGTTGATTTATCTTTGGTAATGGTACAACAACAGTTAGCTAATCTTTAGACATGTTCTCAGCAACGAATTGTTGCTCCTGGGGCAGCCCTAGGTGCTTCCACAAACATAAATTCCAATACAATGCATTATGACATTCTTTTCGGTCCATCGATGTATCCAAAGAATTCAATATCCTCTATCCCTACTTTAACTACTGCAAAGTGTTTGTCTAGTTTTATGGAAAATTGCACAGGGATAATTGGAGGAGACAAGTGGAATGATCATAATTATTTCTCTTAGTCTCAGTCCATTAAAATGGTCCTTGAGGGGTGTTACAAGTTTGGGTATCTAACTGGTGAGATACCTAGACCTAGACTAGGGGATCCTTAGGTGTGCATTTGGAAAGGAGAAGACTCCCTGCTTCGATCACTGCTGATTAATAGCATGGAACATTAGATAGGGAAACCGTTGTATGCGATGACTCCTCGAGATGTCTAGGATGCAATATAGAAGTTGTATTCTAGAAGACAGAATGGTCTTTACCCTTTGCGTAAACAAGCTCATGAATGCAAACATCGTGTTTTTGATTTTCTTTTCGGCTGTGGCACTCTTTTTTAAACAAGCTGTCCTTAATATGGCAAAAGATGGATTTATGTCGTGAAATCATCTAGGATTGTCCATGTGGAGATGTCCAGTATTCCAAGATTAAGGAAGTTGATTGTGTTTATGATTGCCTAGCTGGTTTGAACTCCAAGTTCGATGTAATGTGAGGCCATATATCGGCTATGAACACATGGCTGTATCGGCTACTAATTATGCTGCTTTTAGTGTGGAATTATCTGATTCTTATAGTGGCAAGCAAAATGGGAAACCGACCTCGGTGTCATGCCCAAGGAAATATTGTCCCCGAGTGCTCCGCTGGCTACGATCCATGGATCTGAATCGACACATGCTTGAGGTACTATTGGCTCTGATAGTATTAACTTGTATGTTGAGGATGGTATTGTTGATTTGGGTAAGAATTACAAGACATATATGTTAGTTTTAGAGTATAACGGGGTTGGAACTAGTGATGAAACTTTGATTAAAACACAAGAGGGTGAGACTCTTCCTCAGGCTAGAACAGGTAATAAGAAATTTATTACAGATGAGGAATTAGACAAATCGGGGGATTATGATATCTCCCTTGACATGCCCATTGCATTAGCAAAAGGCACCAAATCCTGCCTCATATATAGTTTCCTGTCTTACAATAATCTGTCTTTTTAGTTCAGCAACAATACCGTAATACATATGACAGTGGAAATTCGTGAGTGGAATATTGCCGTTATGGAAGAAATGGGAGCTCTTAAAAAGAATAATACATTGGATCTTTGTGCTCTTCCTATAAGGCATAAAACAGTTGGGTGCAAGTGGATGTTCACTCTAAAGTATAAATCAGTTGGAACTCTAGACGGTTACAAGACCAGACTAGTAGTGAAATGGTTTACTCAAACTTATGGGATAGATTATTCTGGGACTTTCTCCCATGTAGCGAAGTTAAACATGGTCTGAATCCTCCTTTCTGTTGCAATTAATAAAGACTAACCTCTCTATCAACTTGATGTGAAAAATACATTTATGAATGGTGAACTGGAAGAAGTTTATATGAGCCCCCGGTTTTGAAGCTCTATTTGATCATTAGGTATGTAAAATCCAAAGTCTTTGTATGGTTTGAAATAGTCCTCGAGAGCATGGTTTGACAAGCTACATTTGTTGAGTCGCATGGTTTCACTTAGAAATCACGTTTACAAAAAGGTCAGTATTTGGGAAAATTGATGTTTCAATTGTGCATAATATTGTCTTGTTGGAGATGATATTGCTGAGATCACCGGGTTGAAAAAGAAAATGGTTGATGAGTTTGAGATCAAAGATCTAAGGAATCTAAAATATTTCCTAGGAATGGAAGTGACAAGATCAAGGGAAGGGATCTTTGTCTCATAACGGAACAACACCTTGGCCTTCTTTTTCTTGGAGAGTCGAATCGGAGCAACGAAGAGTTGAATCTTAGTGAATCATGGTAGTAAGGCCACTCTGCCACTTACAATACCACTTACACCCTAGACCAGGTATGACTGGATATAGACTTGCTGATGCTCCTATTGAATTCAATGCGAAACTGGGAGATTTTTTTGATTAAGTTTCTGTTGATAAGAGAGAGGTATCAACGTCTAGTGGGAAAGTTGATTTACTTATCTTACACTAGACCAATATTTTCTTTGATCTCAGTGTTGTTAGTTAGGTTATGAGACACCATACGGAGAACATGTGGAAGCTGTGAACTAAATTCTGAGATATTTGAAAAGCTAACAAGTAAAGGTCTGATATTTAGGAAAACTAACTGGAAATGCATTGAGGCTTATACTAACTCTGATTGGGTAGGATCAGTTATTAACAAATATACCTATAGTTATTGTACCTTTGTATGGGGTAACTTAGAAACATGGAGAAGTAAGAAGCAAAGTGTTGTTGCTAGAAGCAATGCTGAAGCTGAATACAGAGCCATGAGTTTAGGAAGAAATCTTGGTTTTAACTATGAAGCTCTATTGTGATAATAAAGCGGCTATAAGCATAGCCAATAATCCTATACAACGTGATAGAACAAATCATGAGAAGATTTATAGACACTTTATAAAAGAGAAACTAGACAATGGTAGAATCTGTATTCTGTATATTCCTTCTTGTCAACAATTTGATGTTCTCACCAAAGGTTACTCAAACCAAGTTTTGACTCATTTATTAGCAAGTTGGGTCTCATTGACATCTACGCCCCAACTTGAGGGCGAGTATTGAAAATAAGTGGCTTTGTTATCAGTTTATAGCTCGAGGGTATTTTAGTATTTTACTTGACCCTAAATTTATTTCATTTTTTGTATTTGGGTATGCTAAGCCTATAGAGTTGTCTTCTTTTGTAACTTTTAGTGGGTGTAAAAATAATAAAAGATACCCCCTATCGTGGTTTTTTCTCCCTATTCTAAGGTTTTCCATGTAAATCTTGTGTTCTCATTTCTACCGTTTAATAAAATTAAATTAAAAAAGAAAAGGTTGACCTAGTGATCATTAGCAGCAGATGGAAATAGTAAAGATCCTGACACTTAAAAAAAGGCAATAAAGTACTGCACATTTCATTGCTACCCTTGTCTTTTAGATGAAGTATCACAAACAAAAACAAATTATTAGCTTGAAAATCATTGCCCCACTCTTCTTGCTACTCAAATCATGGGGCAGCCTGGCCAATCGCTCATTCTTGATAAGTGTCACATCAATTACACTATGGGGAAAAAAATTGTTCTTAAAAATCTTCAATGTTTGCCTAGTGTTAGAATCGATATCCTTAATAGTCTTCTTGCTTTGGAGACTATTTGAAGAAGCCTGGTTGAATTCGGTTCATTCTTTGCTCCGAGGGGTAGTAGCCAGAGCTAGCTGGTATATATCACTTTTGGCTCATTGAAAGAACAAGTTTTTCCAATTTTTATATTGTTTGGATAAGGTTATTTTTGAGAAGGCTTTCTCCTCCAAATGTACTCTATTCTGTCTCTGCCTTTTTTTTCTTTTTTCTTTTCTATCTTTCTAGCTCTCTTTTTATTTATTTATTTATTTATTTTATTTTTAAAGATTGAAGTGGGATCTTCCTTTCCTTATGCTTGGCTGACCTTATAGTTATTGTTTGCGGACTTGCTGTTTCTTAATGTGAGGATCAACGTTGCTGTGGTTTATCTAAGGAGAGCATGTAGTCTATGGGTCAATTTGTTAATTTTTTTTAGGATTTTCAACATAACGTGTGATTTTTTGATGGATACTTTGTGGATGTCTATTCGTTCCTGCTGGGAGTTCATTGTTTTGATTGATAGTTTAGGGTGTTTTATTTCATTGTTACGTAATCCAGATTTTTTTTGGTGGGTGTGGGGGGGAGGGGGAGGATGTTTTTATTCTGAACGTCAGTTGTAAACCCTCTGTAAATTTTCAGTTTTCATAGATTCAGTAAAAGTTCTGTTTCCTTTACAAGAAGAAAAAAATGGTCTCCATGTTCTCTGTCTCAGGTGAGCGATGATGAAGTCGAAGCTCTTTTGCCTGCTGCTGCTTATGCTCTTGCCAAAATACACATGCATTTGGTCCACAGCGGGTATGATATATTTTATACCTTCGTTTAAACTTTATGAAAACAATTGGTCTACTATCTTGGTACAGCATATGATTTCTTGATATCTTGTTAGATTTTGCTATACGGCGAGGGGAGCATTTTGCTATTCAGAAGATGACATTTTTGATTTCCGCACAGGTACTGATATTAGGTTCCTAAAATTTAAGTGATCATTTTTAGCTTAAAACTATACAAAATGCCCTCAACTCTAGGGTTGGTTTCACTTACTCTTGAACTTTCAAATTTTGTATTTTACCCTTAAATTTTTTAAATTGCTTCAAATTAACCCTTCAGTCAACCGTTTACTAAAATGAAGACAGAATCTACTAAGTGAGTTCTAAAATTAGAGGTTCACATGGATGTTGGTGGCTAGTTTCAATTATGCCGGTGAAAGGTTGGGATGTCTTTCTACTATGCTTCCATGATTTCAAAAGTTGCGTCTTCACCCTTAACCAAAATTGATTCAAAACAATCATGCAGTTAGTTTTTTTTTTTCTTTGCATTTTTTTAATTAGAAGATTTTTTGGAAGGCTTTTCTATTTAAAATATGTGAAGGCTTGTTTGATTCTATTTTGTAGCTTTGTTGTGTCAAACTAATTAATTACTTATTAGAACTTGAATATTATTGTGACTTGTGGTGTTGGATCATTAGTCATCTTATCTATCGTCTTTTACATGAATTGTATGGTTGTCATTTCAATGTTTTTAAAGGAAATATTTCACATTAAGCATATGAACAGCATATATCTATTTCTATGAAGAAAAAGATATAATGACTCCTTGACGTTTTAATGTTTTACTTTGTTAAAAAATTGATGTGTTGTGTCGGTGTCATGTTGTATTTGTGTTCATGTCTATGCTTGTGCTTCTTAGCCAAGACCTATGCCTAGATAGTTTTGTCAAAGTTTAAAGGATTCAAAATTAGTACAGCAAGAGAATCCAATAAATGAGAATTTTACTTGGATCAATTTTGTGAGACATGATTTGTTCTGGGTTTGAAAGGTCAATGAAGTGGTTTCCTTGGATTTTAACAATTTGTTGGTTGTATCCAAATTATTTTCACACAATAAATCTTTTTTGGAAAGTTCTGCTTGAAAGTAACTAAAGAACATTTCACAATATTAAGCTTTCCTTTGAAGTCTCTATTTAGACACACCAATATACAATTTTAGAATCTCTCTTATTTAATTATTAATAATAGATCTTCAAAACAAAGTACAATTTTAGAATTATCTACTCATTTGCACAGAAATAAATATATTGATACTTAGGTAATAGATTTCATTGCTTCTGACTATTTTGTATGTAAAAATAAGGGATGCCGGACTGCCGAACTACAGTTTACGTTCTATGTTCATTCTTTTGCAGATGATGGACAAGATGTAGATGGCTTGCCCAATGAGGGTGTAGAGATCACTTGCTTTCATATGGTAATGTGTAATTTAAGGATTTAATTATAATAATGGGCTTGGACTTCTCTTGAATGCCTCTTTTTTTGTAGCATTTTTTTTGTTCCTTGTTGGTTTTAGAATCTGTTTAAATTGTCACTGAAACATCAAATTGATGACATTTCAGAAAATAGGTAGTAGTGTCAATAACTTCTTTCTTTCTAATAGATTCGTATTTTCATTCATAGCACTCAATGTTTCATCTTGGTTTTGTATTTTAACCTGGGCTTCCGTTATTCTTCTTCATTTGATGCGGTAGACAGTGGTGGATCTAGAAAATATCTTGATAGGGGCTAAAAGAAAATAAACTACAAGGTAAGATTTGAACCTGGGTTGGAAGGGAGGCTGATATTCAAAGCAACCAGTAAGTTAACTATAAATATTAAATATTAAATAGTTTTCTTTATATATATTATATAAAGACAATAAAGTTGAGGGGGGCTCGAAGCCCCTGGGCCTATGCTAAATCCGCCGGTGGTGGTAGATATTGTGCTATAATTTTCTTAAGAGCAGCACCTCCCAACCCCCCAAAGATAAAGAAACAAGAATCGAAGGTGATACCTGGGATAGGAAGCATTTTGTAACCTTTGCCTCGACGATGTTCTAATCCTTGATTCTTTTTCAAGTATTGGTAAACGAATAATATATATTCCGACAACAGAGCATGCATATGTGGTTGCAGTATGTCCATATTGCTTGTTAGTTGATTGTCTGTTCTACAAAATACATCGTGCAACTTATTTTTTTTTAAATTACATTTCCTTTGCCTTTTGATATGTAAAAATCATGACAACCATTCTATGGATAATAACATACGCTTTATGCTATGATGTGAACATTATTTCTCTAGGTAGCTGAAGTTCCTCAGAGGTCTCCTCTGTTTCAGATGGTTGAAATTTGTGATTTTATTGCCTCATGATTGCTACACTTTTTGTGGGTTGCAGGATGGAGCACATTACATGATTTATACGCCATCTGATCCACTTCTTTTTGTTGCTATCAAGGTACCTATGAGTTAGCTATACACTGTGCGATGTTATACTTCAACTAAAGCACTGTTATTTTGATTAGGATCTCCCTTTAAACACTATAATGTTAAATCTATGGTGCTGGGTATCGTTTTTCCAAATCAAATGTCGTTAGGTGGCTGGTGCTTCTATTGAGCAATGATTAATCAAGTGTATCTACTCATTTGTTTATGCAGGACAAGCTTGGCCAACTAACAATTGCTGATGATGTAAGACATCTCTTACGTTTTATGAACACTTTCTTATGCATTATGTTGTTTGGAAGTTCTTTAAAAATCAAACTAGAAACTACTGCATATGGTCGGTGTGGTTTATACATCTTTATTGAAGAACAATGTTTTTTGTTATTGTATTGGTAAATAGTAGCTTCGAGGTTGGCTGATTTTTATCAAAGAACAATGATTTCTAGTTTAAGAAATGAATAGATTTATAATAATTGAACAGGTTTAAATTTGTAGATCATATTTGGTGAATGAATGCACAAGGGTTGAGGTAAAAGTGGGGAGCAGGGAAAGAGTAAATGACCAATTGACAATTCATTTAACCGACTTTACTGAATATTGTGGAAACAGGAGCTGTTGGAGGACCCAGCAATTATAAGTGCCATCGACGAAGAGACTGAATTTAATGCATTGGTGGTATGTGAAATGTGTGATTTAGTGTAAGAATTCAGATGTTTCTAATGTAATGTAATGGCTTATTTTATTATGTCTGCAGGAGGAGGAGGCTGCTCTTCTTGAATCAGTTTTAGGGAAAGAATAACCCAACACAGGCTGCGCAGCTTTAGTTTGGAAAACAATACACTAGGTAGAGCATTTTGTAATATATATTAAATTGGGTTTAGAAGTGCATGTCCTTCCCTTACTTTGCCTCATTTGAGTTTCTTGCTGTTGGGGGAAACACATCAGTTGTTGAGGACTCTATCTAATCAAAGCTCAGGACTGCCGTTTCAGTTTCATCTTTCTAATACGTAGAAGCTAAAAAAACACACTTCAACTTTTAATTTGGG

mRNA sequence

CCAGACTAGACAATTTTCTAGAAATACAGTCAGCCGAGCCGCTTAACTTCCGAGAAGAGCCAGCATGGCGTTGTGTATCTCTATTCCTTCCTTCTCCGATTCTGTTACCGCTCCTTCCCTATCTCAAACTTATTTCAGTCTCAGTCGCCGGCGGCCTTCCAATTTCATCCATTTTCTTAATTTAAGCCTTCCCAACCGCCCACCATGTCTTTGCCGAGCTTCCAATTCTCAGCCTGGACCGTTCCCCAAACAGTCTGCTTCTGCTTCTTCTAAGAAAAGGAAGAAAAAGGATAAGGGAGATTCCAAGGTTTTTAATCCTAACAACTTCGAAGTTGTGGACGATTTTAGCTTCGATGACGCTGGACCTTCTAGCTCCACTTCCACTTCCACTTATTTGTCGTACCATCCTTCGACCTTGCCCAAACCGCCAGCTGGGTTTGTACTAGACGACCATGGAAAGGTCCTCATGGCTTCAAACAAGCGAATTGCTACCATGGTCGATCCTTTAAATAATTTGCCTCTGGAATGCGTTATAAGAAGAATTTTTAGAAGTTCAAAAGGGGATGATTGCATGCTTCTTTGCCCCGTGGACACGCCTGTTCAAATACTAAAGAGCAAGAATATTGATGGATGGTCAGCTGTGAGCGATGATGAAGTCGAAGCTCTTTTGCCTGCTGCTGCTTATGCTCTTGCCAAAATACACATGCATTTGGTCCACAGCGGATTTTGCTATACGGCGAGGGGAGCATTTTGCTATTCAGAAGATGACATTTTTGATTTCCGCACAGATGATGGACAAGATGTAGATGGCTTGCCCAATGAGGGTGTAGAGATCACTTGCTTTCATATGGATGGAGCACATTACATGATTTATACGCCATCTGATCCACTTCTTTTTGTTGCTATCAAGGACAAGCTTGGCCAACTAACAATTGCTGATGATGAGCTGTTGGAGGACCCAGCAATTATAAGTGCCATCGACGAAGAGACTGAATTTAATGCATTGGTGGAGGAGGAGGCTGCTCTTCTTGAATCAGTTTTAGGGAAAGAATAACCCAACACAGGCTGCGCAGCTTTAGTTTGGAAAACAATACACTAGGTAGAGCATTTTGTAATATATATTAAATTGGGTTTAGAAGTGCATGTCCTTCCCTTACTTTGCCTCATTTGAGTTTCTTGCTGTTGGGGGAAACACATCAGTTGTTGAGGACTCTATCTAATCAAAGCTCAGGACTGCCGTTTCAGTTTCATCTTTCTAATACGTAGAAGCTAAAAAAACACACTTCAACTTTTAATTTGGG

Coding sequence (CDS)

ATGGCGTTGTGTATCTCTATTCCTTCCTTCTCCGATTCTGTTACCGCTCCTTCCCTATCTCAAACTTATTTCAGTCTCAGTCGCCGGCGGCCTTCCAATTTCATCCATTTTCTTAATTTAAGCCTTCCCAACCGCCCACCATGTCTTTGCCGAGCTTCCAATTCTCAGCCTGGACCGTTCCCCAAACAGTCTGCTTCTGCTTCTTCTAAGAAAAGGAAGAAAAAGGATAAGGGAGATTCCAAGGTTTTTAATCCTAACAACTTCGAAGTTGTGGACGATTTTAGCTTCGATGACGCTGGACCTTCTAGCTCCACTTCCACTTCCACTTATTTGTCGTACCATCCTTCGACCTTGCCCAAACCGCCAGCTGGGTTTGTACTAGACGACCATGGAAAGGTCCTCATGGCTTCAAACAAGCGAATTGCTACCATGGTCGATCCTTTAAATAATTTGCCTCTGGAATGCGTTATAAGAAGAATTTTTAGAAGTTCAAAAGGGGATGATTGCATGCTTCTTTGCCCCGTGGACACGCCTGTTCAAATACTAAAGAGCAAGAATATTGATGGATGGTCAGCTGTGAGCGATGATGAAGTCGAAGCTCTTTTGCCTGCTGCTGCTTATGCTCTTGCCAAAATACACATGCATTTGGTCCACAGCGGATTTTGCTATACGGCGAGGGGAGCATTTTGCTATTCAGAAGATGACATTTTTGATTTCCGCACAGATGATGGACAAGATGTAGATGGCTTGCCCAATGAGGGTGTAGAGATCACTTGCTTTCATATGGATGGAGCACATTACATGATTTATACGCCATCTGATCCACTTCTTTTTGTTGCTATCAAGGACAAGCTTGGCCAACTAACAATTGCTGATGATGAGCTGTTGGAGGACCCAGCAATTATAAGTGCCATCGACGAAGAGACTGAATTTAATGCATTGGTGGAGGAGGAGGCTGCTCTTCTTGAATCAGTTTTAGGGAAAGAATAA

Protein sequence

MALCISIPSFSDSVTAPSLSQTYFSLSRRRPSNFIHFLNLSLPNRPPCLCRASNSQPGPFPKQSASASSKKRKKKDKGDSKVFNPNNFEVVDDFSFDDAGPSSSTSTSTYLSYHPSTLPKPPAGFVLDDHGKVLMASNKRIATMVDPLNNLPLECVIRRIFRSSKGDDCMLLCPVDTPVQILKSKNIDGWSAVSDDEVEALLPAAAYALAKIHMHLVHSGFCYTARGAFCYSEDDIFDFRTDDGQDVDGLPNEGVEITCFHMDGAHYMIYTPSDPLLFVAIKDKLGQLTIADDELLEDPAIISAIDEETEFNALVEEEAALLESVLGKE
Homology
BLAST of IVF0022031 vs. ExPASy TrEMBL
Match: A0A1S4E324 (uncharacterized protein LOC103499586 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103499586 PE=4 SV=1)

HSP 1 Score: 656.0 bits (1691), Expect = 8.4e-185
Identity = 329/329 (100.00%), Postives = 329/329 (100.00%), Query Frame = 0

Query: 1   MALCISIPSFSDSVTAPSLSQTYFSLSRRRPSNFIHFLNLSLPNRPPCLCRASNSQPGPF 60
           MALCISIPSFSDSVTAPSLSQTYFSLSRRRPSNFIHFLNLSLPNRPPCLCRASNSQPGPF
Sbjct: 1   MALCISIPSFSDSVTAPSLSQTYFSLSRRRPSNFIHFLNLSLPNRPPCLCRASNSQPGPF 60

Query: 61  PKQSASASSKKRKKKDKGDSKVFNPNNFEVVDDFSFDDAGPSSSTSTSTYLSYHPSTLPK 120
           PKQSASASSKKRKKKDKGDSKVFNPNNFEVVDDFSFDDAGPSSSTSTSTYLSYHPSTLPK
Sbjct: 61  PKQSASASSKKRKKKDKGDSKVFNPNNFEVVDDFSFDDAGPSSSTSTSTYLSYHPSTLPK 120

Query: 121 PPAGFVLDDHGKVLMASNKRIATMVDPLNNLPLECVIRRIFRSSKGDDCMLLCPVDTPVQ 180
           PPAGFVLDDHGKVLMASNKRIATMVDPLNNLPLECVIRRIFRSSKGDDCMLLCPVDTPVQ
Sbjct: 121 PPAGFVLDDHGKVLMASNKRIATMVDPLNNLPLECVIRRIFRSSKGDDCMLLCPVDTPVQ 180

Query: 181 ILKSKNIDGWSAVSDDEVEALLPAAAYALAKIHMHLVHSGFCYTARGAFCYSEDDIFDFR 240
           ILKSKNIDGWSAVSDDEVEALLPAAAYALAKIHMHLVHSGFCYTARGAFCYSEDDIFDFR
Sbjct: 181 ILKSKNIDGWSAVSDDEVEALLPAAAYALAKIHMHLVHSGFCYTARGAFCYSEDDIFDFR 240

Query: 241 TDDGQDVDGLPNEGVEITCFHMDGAHYMIYTPSDPLLFVAIKDKLGQLTIADDELLEDPA 300
           TDDGQDVDGLPNEGVEITCFHMDGAHYMIYTPSDPLLFVAIKDKLGQLTIADDELLEDPA
Sbjct: 241 TDDGQDVDGLPNEGVEITCFHMDGAHYMIYTPSDPLLFVAIKDKLGQLTIADDELLEDPA 300

Query: 301 IISAIDEETEFNALVEEEAALLESVLGKE 330
           IISAIDEETEFNALVEEEAALLESVLGKE
Sbjct: 301 IISAIDEETEFNALVEEEAALLESVLGKE 329

BLAST of IVF0022031 vs. ExPASy TrEMBL
Match: A0A1S4E3T2 (uncharacterized protein LOC103499586 isoform X2 OS=Cucumis melo OX=3656 GN=LOC103499586 PE=4 SV=1)

HSP 1 Score: 594.3 bits (1531), Expect = 3.0e-166
Identity = 293/293 (100.00%), Postives = 293/293 (100.00%), Query Frame = 0

Query: 1   MALCISIPSFSDSVTAPSLSQTYFSLSRRRPSNFIHFLNLSLPNRPPCLCRASNSQPGPF 60
           MALCISIPSFSDSVTAPSLSQTYFSLSRRRPSNFIHFLNLSLPNRPPCLCRASNSQPGPF
Sbjct: 1   MALCISIPSFSDSVTAPSLSQTYFSLSRRRPSNFIHFLNLSLPNRPPCLCRASNSQPGPF 60

Query: 61  PKQSASASSKKRKKKDKGDSKVFNPNNFEVVDDFSFDDAGPSSSTSTSTYLSYHPSTLPK 120
           PKQSASASSKKRKKKDKGDSKVFNPNNFEVVDDFSFDDAGPSSSTSTSTYLSYHPSTLPK
Sbjct: 61  PKQSASASSKKRKKKDKGDSKVFNPNNFEVVDDFSFDDAGPSSSTSTSTYLSYHPSTLPK 120

Query: 121 PPAGFVLDDHGKVLMASNKRIATMVDPLNNLPLECVIRRIFRSSKGDDCMLLCPVDTPVQ 180
           PPAGFVLDDHGKVLMASNKRIATMVDPLNNLPLECVIRRIFRSSKGDDCMLLCPVDTPVQ
Sbjct: 121 PPAGFVLDDHGKVLMASNKRIATMVDPLNNLPLECVIRRIFRSSKGDDCMLLCPVDTPVQ 180

Query: 181 ILKSKNIDGWSAVSDDEVEALLPAAAYALAKIHMHLVHSGFCYTARGAFCYSEDDIFDFR 240
           ILKSKNIDGWSAVSDDEVEALLPAAAYALAKIHMHLVHSGFCYTARGAFCYSEDDIFDFR
Sbjct: 181 ILKSKNIDGWSAVSDDEVEALLPAAAYALAKIHMHLVHSGFCYTARGAFCYSEDDIFDFR 240

Query: 241 TDDGQDVDGLPNEGVEITCFHMDGAHYMIYTPSDPLLFVAIKDKLGQLTIADD 294
           TDDGQDVDGLPNEGVEITCFHMDGAHYMIYTPSDPLLFVAIKDKLGQLTIADD
Sbjct: 241 TDDGQDVDGLPNEGVEITCFHMDGAHYMIYTPSDPLLFVAIKDKLGQLTIADD 293

BLAST of IVF0022031 vs. ExPASy TrEMBL
Match: A0A6J1EW11 (uncharacterized protein LOC111438664 OS=Cucurbita moschata OX=3662 GN=LOC111438664 PE=4 SV=1)

HSP 1 Score: 578.2 bits (1489), Expect = 2.2e-161
Identity = 295/331 (89.12%), Postives = 308/331 (93.05%), Query Frame = 0

Query: 1   MALCISIPSFSDSVTAPSLSQTYFSLSRRRPSNFIHFLNLSLPNRPPCLCRASNSQPGPF 60
           MALCISIPSF DSVTA SLS+TYFSL RRRPSN +HFLNL+LP  P C+CRASNSQPGP 
Sbjct: 1   MALCISIPSFLDSVTASSLSRTYFSL-RRRPSNSVHFLNLNLPTWPLCICRASNSQPGPS 60

Query: 61  PKQSASASSKKRKKKDKGDSKVFNPNNFEVVDDFSFDDAGPSSSTSTSTYLSYHPSTLPK 120
           PKQSAS +SKKRKKK KGD+KVFNP + EV DDFS D+AGPSSSTS S+YLSYHP+TLPK
Sbjct: 61  PKQSASTASKKRKKKGKGDTKVFNPRDIEVGDDFSVDEAGPSSSTSNSSYLSYHPTTLPK 120

Query: 121 PPAGFVLDDHGKVLMASNKRIATMVDPLNNLPLECVIRRIFRSSKGDDCMLLCPVDTPVQ 180
           PPAGFVLDDHGKVLMASNKRIATMVDPLNNLPLECVIRRIFRSSKGD+CMLLCPVDTPVQ
Sbjct: 121 PPAGFVLDDHGKVLMASNKRIATMVDPLNNLPLECVIRRIFRSSKGDECMLLCPVDTPVQ 180

Query: 181 ILKSKNIDGWSAVSDDEVEALLPAAAYALAKIHMHLVHSGFCYTARGAFCYSEDDIFDFR 240
           ILKSKNIDGWSAVSDDEVE +LPAAAYALAKIHMHLVHSGFCYTARG FCYSE+DIFDF 
Sbjct: 181 ILKSKNIDGWSAVSDDEVETILPAAAYALAKIHMHLVHSGFCYTARGGFCYSEEDIFDFC 240

Query: 241 TDDGQ--DVDGLPNEGVEITCFHMDGAHYMIYTPSDPLLFVAIKDKLGQLTIADDELLED 300
           TDDGQ  DVDGLPNEGVEITCF MDGAHYMIYTPSDPLLFVA KDKLGQLTIADDELLED
Sbjct: 241 TDDGQDVDVDGLPNEGVEITCFDMDGAHYMIYTPSDPLLFVATKDKLGQLTIADDELLED 300

Query: 301 PAIISAIDEETEFNALVEEEAALLESVLGKE 330
           PA ISAIDEETEFNALVEEEAALLESVLGKE
Sbjct: 301 PATISAIDEETEFNALVEEEAALLESVLGKE 330

BLAST of IVF0022031 vs. ExPASy TrEMBL
Match: A0A6J1I6V6 (uncharacterized protein LOC111471694 OS=Cucurbita maxima OX=3661 GN=LOC111471694 PE=4 SV=1)

HSP 1 Score: 574.7 bits (1480), Expect = 2.5e-160
Identity = 294/331 (88.82%), Postives = 307/331 (92.75%), Query Frame = 0

Query: 1   MALCISIPSFSDSVTAPSLSQTYFSLSRRRPSNFIHFLNLSLPNRPPCLCRASNSQPGPF 60
           MALCISIPSF DSVTA SLS+TYFSL RRRPSN +HFLNLSLP  P C+CRASNSQPGP 
Sbjct: 1   MALCISIPSFLDSVTASSLSRTYFSL-RRRPSNSVHFLNLSLPTWPLCICRASNSQPGPS 60

Query: 61  PKQSASASSKKRKKKDKGDSKVFNPNNFEVVDDFSFDDAGPSSSTSTSTYLSYHPSTLPK 120
           PKQSAS +SKKRKKK KGD+KVFN  + EV DDFS D+AGPSSSTS S+YLSYHP+TLPK
Sbjct: 61  PKQSASTASKKRKKKGKGDTKVFNHRDIEVGDDFSVDEAGPSSSTSNSSYLSYHPTTLPK 120

Query: 121 PPAGFVLDDHGKVLMASNKRIATMVDPLNNLPLECVIRRIFRSSKGDDCMLLCPVDTPVQ 180
           PPAGFVLDDHGKVLMASNKRIATM+DPLNNLPLECVIRRIFRSSKGD+CMLLCPVDTPVQ
Sbjct: 121 PPAGFVLDDHGKVLMASNKRIATMLDPLNNLPLECVIRRIFRSSKGDECMLLCPVDTPVQ 180

Query: 181 ILKSKNIDGWSAVSDDEVEALLPAAAYALAKIHMHLVHSGFCYTARGAFCYSEDDIFDFR 240
           ILKSKNIDGWSAVSDDEVE +LPAAAYALAKIHMHLVHSGFCYTARG FCYSE+DIFDF 
Sbjct: 181 ILKSKNIDGWSAVSDDEVETILPAAAYALAKIHMHLVHSGFCYTARGGFCYSEEDIFDFC 240

Query: 241 TDDGQ--DVDGLPNEGVEITCFHMDGAHYMIYTPSDPLLFVAIKDKLGQLTIADDELLED 300
           TDDGQ  DVDGLPNEGVEITCF MDGAHYMIYTPSDPLLFVA KDKLGQLTIADDELLED
Sbjct: 241 TDDGQDVDVDGLPNEGVEITCFDMDGAHYMIYTPSDPLLFVATKDKLGQLTIADDELLED 300

Query: 301 PAIISAIDEETEFNALVEEEAALLESVLGKE 330
           PA ISAIDEETEFNALVEEEAALLESVLGKE
Sbjct: 301 PATISAIDEETEFNALVEEEAALLESVLGKE 330

BLAST of IVF0022031 vs. ExPASy TrEMBL
Match: A0A6J1DXG8 (uncharacterized protein LOC111024390 isoform X1 OS=Momordica charantia OX=3673 GN=LOC111024390 PE=4 SV=1)

HSP 1 Score: 563.5 bits (1451), Expect = 5.7e-157
Identity = 284/328 (86.59%), Postives = 301/328 (91.77%), Query Frame = 0

Query: 3   LCISIPSFSDSVTAPSLSQTYFSLS-RRRPSNFIHFLNLSLPNRPPCLCRASNSQPGPFP 62
           L ISIPS SDS  APSL  T+ SL  R  PSN IHFLN S P   PCLCRA+NSQPGP P
Sbjct: 4   LSISIPSLSDSTAAPSLFHTFSSLKWRPSPSNSIHFLNPSSPTLRPCLCRAANSQPGPLP 63

Query: 63  KQSASASSKKRKKKDKGDSKVFNPNNFEVVDDFSFDDAGPSSSTSTSTYLSYHPSTLPKP 122
           KQS+SA+ KKRKKK KGD+KVFNP++ E V+DFS D+AGPSSS+S S+YLSYHP+TLPKP
Sbjct: 64  KQSSSAAPKKRKKKGKGDTKVFNPSDLEAVEDFSVDEAGPSSSSSDSSYLSYHPTTLPKP 123

Query: 123 PAGFVLDDHGKVLMASNKRIATMVDPLNNLPLECVIRRIFRSSKGDDCMLLCPVDTPVQI 182
           PAGFVLDDHGKVLMAS KRIATMVDPLNNLPLECVIRR+FRSSKGD+CMLLCP+DTPVQI
Sbjct: 124 PAGFVLDDHGKVLMASTKRIATMVDPLNNLPLECVIRRVFRSSKGDECMLLCPMDTPVQI 183

Query: 183 LKSKNIDGWSAVSDDEVEALLPAAAYALAKIHMHLVHSGFCYTARGAFCYSEDDIFDFRT 242
           LKSKNIDGWSAVSDDEVEA+LPAAAYALAKIHMHLVHSGFCYTARG FCYSEDDIFDFRT
Sbjct: 184 LKSKNIDGWSAVSDDEVEAILPAAAYALAKIHMHLVHSGFCYTARGGFCYSEDDIFDFRT 243

Query: 243 DDGQDVDGLPNEGVEITCFHMDGAHYMIYTPSDPLLFVAIKDKLGQLTIADDELLEDPAI 302
           DDGQDVDGLPNEGVEITCFHMDGAHYMIYTPSDPLLFVA KDKLGQLTIADDELLEDPAI
Sbjct: 244 DDGQDVDGLPNEGVEITCFHMDGAHYMIYTPSDPLLFVATKDKLGQLTIADDELLEDPAI 303

Query: 303 ISAIDEETEFNALVEEEAALLESVLGKE 330
           ISAID+ETEFNALVEEEAALLESVLGKE
Sbjct: 304 ISAIDKETEFNALVEEEAALLESVLGKE 331

BLAST of IVF0022031 vs. NCBI nr
Match: XP_016902634.1 (PREDICTED: uncharacterized protein LOC103499586 isoform X1 [Cucumis melo])

HSP 1 Score: 652 bits (1681), Expect = 7.72e-236
Identity = 329/329 (100.00%), Postives = 329/329 (100.00%), Query Frame = 0

Query: 1   MALCISIPSFSDSVTAPSLSQTYFSLSRRRPSNFIHFLNLSLPNRPPCLCRASNSQPGPF 60
           MALCISIPSFSDSVTAPSLSQTYFSLSRRRPSNFIHFLNLSLPNRPPCLCRASNSQPGPF
Sbjct: 1   MALCISIPSFSDSVTAPSLSQTYFSLSRRRPSNFIHFLNLSLPNRPPCLCRASNSQPGPF 60

Query: 61  PKQSASASSKKRKKKDKGDSKVFNPNNFEVVDDFSFDDAGPSSSTSTSTYLSYHPSTLPK 120
           PKQSASASSKKRKKKDKGDSKVFNPNNFEVVDDFSFDDAGPSSSTSTSTYLSYHPSTLPK
Sbjct: 61  PKQSASASSKKRKKKDKGDSKVFNPNNFEVVDDFSFDDAGPSSSTSTSTYLSYHPSTLPK 120

Query: 121 PPAGFVLDDHGKVLMASNKRIATMVDPLNNLPLECVIRRIFRSSKGDDCMLLCPVDTPVQ 180
           PPAGFVLDDHGKVLMASNKRIATMVDPLNNLPLECVIRRIFRSSKGDDCMLLCPVDTPVQ
Sbjct: 121 PPAGFVLDDHGKVLMASNKRIATMVDPLNNLPLECVIRRIFRSSKGDDCMLLCPVDTPVQ 180

Query: 181 ILKSKNIDGWSAVSDDEVEALLPAAAYALAKIHMHLVHSGFCYTARGAFCYSEDDIFDFR 240
           ILKSKNIDGWSAVSDDEVEALLPAAAYALAKIHMHLVHSGFCYTARGAFCYSEDDIFDFR
Sbjct: 181 ILKSKNIDGWSAVSDDEVEALLPAAAYALAKIHMHLVHSGFCYTARGAFCYSEDDIFDFR 240

Query: 241 TDDGQDVDGLPNEGVEITCFHMDGAHYMIYTPSDPLLFVAIKDKLGQLTIADDELLEDPA 300
           TDDGQDVDGLPNEGVEITCFHMDGAHYMIYTPSDPLLFVAIKDKLGQLTIADDELLEDPA
Sbjct: 241 TDDGQDVDGLPNEGVEITCFHMDGAHYMIYTPSDPLLFVAIKDKLGQLTIADDELLEDPA 300

Query: 301 IISAIDEETEFNALVEEEAALLESVLGKE 329
           IISAIDEETEFNALVEEEAALLESVLGKE
Sbjct: 301 IISAIDEETEFNALVEEEAALLESVLGKE 329

BLAST of IVF0022031 vs. NCBI nr
Match: XP_011649451.1 (uncharacterized protein LOC101207357 isoform X1 [Cucumis sativus] >KAE8652000.1 hypothetical protein Csa_016960 [Cucumis sativus])

HSP 1 Score: 634 bits (1634), Expect = 1.13e-228
Identity = 321/329 (97.57%), Postives = 323/329 (98.18%), Query Frame = 0

Query: 1   MALCISIPSFSDSVTAPSLSQTYFSLSRRRPSNFIHFLNLSLPNRPPCLCRASNSQPGPF 60
           MALCISIPSFSDS TAPSLSQTYFSLSRRRPSNFIHFLNLSLPNRPP LCRASNSQPGPF
Sbjct: 1   MALCISIPSFSDSATAPSLSQTYFSLSRRRPSNFIHFLNLSLPNRPPRLCRASNSQPGPF 60

Query: 61  PKQSASASSKKRKKKDKGDSKVFNPNNFEVVDDFSFDDAGPSSSTSTSTYLSYHPSTLPK 120
           PKQSASASSKKRKKKDKGDSKVFNP + EVVDDFSFDDAGPSSSTS STY SYHPSTLPK
Sbjct: 61  PKQSASASSKKRKKKDKGDSKVFNPTHIEVVDDFSFDDAGPSSSTSNSTYSSYHPSTLPK 120

Query: 121 PPAGFVLDDHGKVLMASNKRIATMVDPLNNLPLECVIRRIFRSSKGDDCMLLCPVDTPVQ 180
           PPAGFVLDDHGKVLMASNKRIATMVDPLNNLPLECVIRRIFRSSKGDDCMLLCPVDTPVQ
Sbjct: 121 PPAGFVLDDHGKVLMASNKRIATMVDPLNNLPLECVIRRIFRSSKGDDCMLLCPVDTPVQ 180

Query: 181 ILKSKNIDGWSAVSDDEVEALLPAAAYALAKIHMHLVHSGFCYTARGAFCYSEDDIFDFR 240
           ILKSKNIDGWSAVSDDEVE+LLPAAAYALAKIHMHLVHSGFCYTARGAFCYSEDDIFDFR
Sbjct: 181 ILKSKNIDGWSAVSDDEVESLLPAAAYALAKIHMHLVHSGFCYTARGAFCYSEDDIFDFR 240

Query: 241 TDDGQDVDGLPNEGVEITCFHMDGAHYMIYTPSDPLLFVAIKDKLGQLTIADDELLEDPA 300
           TDDGQDVDGLPNEGVEITCFHMDGAHYMIYTPSDPLLFVAIKDKLGQLTIADDELLEDPA
Sbjct: 241 TDDGQDVDGLPNEGVEITCFHMDGAHYMIYTPSDPLLFVAIKDKLGQLTIADDELLEDPA 300

Query: 301 IISAIDEETEFNALVEEEAALLESVLGKE 329
           IISAIDEETEFNALVEEEAALLESVLGKE
Sbjct: 301 IISAIDEETEFNALVEEEAALLESVLGKE 329

BLAST of IVF0022031 vs. NCBI nr
Match: XP_038900636.1 (uncharacterized protein LOC120087801 isoform X1 [Benincasa hispida])

HSP 1 Score: 594 bits (1531), Expect = 5.83e-213
Identity = 302/331 (91.24%), Postives = 315/331 (95.17%), Query Frame = 0

Query: 1   MALCISIPSFSDSVTAPSLSQTYFSLSRRRPSNFIHFLNLSLPNRPPCLCRASNSQPGPF 60
           MALCISIPSFSDSVTAPSLSQTYFSL+RR PSN IHFLNL LP RPPCLCRASNSQPGP 
Sbjct: 1   MALCISIPSFSDSVTAPSLSQTYFSLTRR-PSNSIHFLNLRLPTRPPCLCRASNSQPGPL 60

Query: 61  PKQSASASSKKRKKKDKGDSKVFNPNNFEVVDDFSFDDAGPSSST--STSTYLSYHPSTL 120
           PK S S SSKKRKKK KGD+KVFNP + EVVDDF+ D+AGPSSS+  S+S+YLSYHPSTL
Sbjct: 61  PKHSVSTSSKKRKKKGKGDTKVFNPTDVEVVDDFTVDEAGPSSSSASSSSSYLSYHPSTL 120

Query: 121 PKPPAGFVLDDHGKVLMASNKRIATMVDPLNNLPLECVIRRIFRSSKGDDCMLLCPVDTP 180
           PKPPAGFVLDDHGKVLMASNKRIA MVDPLNNLPLECVIRRIFRSSKGD+CMLLCPVDTP
Sbjct: 121 PKPPAGFVLDDHGKVLMASNKRIAIMVDPLNNLPLECVIRRIFRSSKGDECMLLCPVDTP 180

Query: 181 VQILKSKNIDGWSAVSDDEVEALLPAAAYALAKIHMHLVHSGFCYTARGAFCYSEDDIFD 240
           VQILKSKN+DGWSAVSDDEVEA+LPAA+YALAKIHMHLVHSGFCYTARG FCYSEDDIFD
Sbjct: 181 VQILKSKNVDGWSAVSDDEVEAILPAASYALAKIHMHLVHSGFCYTARGGFCYSEDDIFD 240

Query: 241 FRTDDGQDVDGLPNEGVEITCFHMDGAHYMIYTPSDPLLFVAIKDKLGQLTIADDELLED 300
           FRTDDGQDVDGLPNEGVEITCFHM+GAHYMIYTPSDPLLFVAIKDKLGQLTIADDELLED
Sbjct: 241 FRTDDGQDVDGLPNEGVEITCFHMNGAHYMIYTPSDPLLFVAIKDKLGQLTIADDELLED 300

Query: 301 PAIISAIDEETEFNALVEEEAALLESVLGKE 329
           PAIISAIDEETEFNALVEEEAALLESVLGKE
Sbjct: 301 PAIISAIDEETEFNALVEEEAALLESVLGKE 330

BLAST of IVF0022031 vs. NCBI nr
Match: XP_016902635.1 (PREDICTED: uncharacterized protein LOC103499586 isoform X2 [Cucumis melo])

HSP 1 Score: 590 bits (1520), Expect = 7.28e-212
Identity = 293/293 (100.00%), Postives = 293/293 (100.00%), Query Frame = 0

Query: 1   MALCISIPSFSDSVTAPSLSQTYFSLSRRRPSNFIHFLNLSLPNRPPCLCRASNSQPGPF 60
           MALCISIPSFSDSVTAPSLSQTYFSLSRRRPSNFIHFLNLSLPNRPPCLCRASNSQPGPF
Sbjct: 1   MALCISIPSFSDSVTAPSLSQTYFSLSRRRPSNFIHFLNLSLPNRPPCLCRASNSQPGPF 60

Query: 61  PKQSASASSKKRKKKDKGDSKVFNPNNFEVVDDFSFDDAGPSSSTSTSTYLSYHPSTLPK 120
           PKQSASASSKKRKKKDKGDSKVFNPNNFEVVDDFSFDDAGPSSSTSTSTYLSYHPSTLPK
Sbjct: 61  PKQSASASSKKRKKKDKGDSKVFNPNNFEVVDDFSFDDAGPSSSTSTSTYLSYHPSTLPK 120

Query: 121 PPAGFVLDDHGKVLMASNKRIATMVDPLNNLPLECVIRRIFRSSKGDDCMLLCPVDTPVQ 180
           PPAGFVLDDHGKVLMASNKRIATMVDPLNNLPLECVIRRIFRSSKGDDCMLLCPVDTPVQ
Sbjct: 121 PPAGFVLDDHGKVLMASNKRIATMVDPLNNLPLECVIRRIFRSSKGDDCMLLCPVDTPVQ 180

Query: 181 ILKSKNIDGWSAVSDDEVEALLPAAAYALAKIHMHLVHSGFCYTARGAFCYSEDDIFDFR 240
           ILKSKNIDGWSAVSDDEVEALLPAAAYALAKIHMHLVHSGFCYTARGAFCYSEDDIFDFR
Sbjct: 181 ILKSKNIDGWSAVSDDEVEALLPAAAYALAKIHMHLVHSGFCYTARGAFCYSEDDIFDFR 240

Query: 241 TDDGQDVDGLPNEGVEITCFHMDGAHYMIYTPSDPLLFVAIKDKLGQLTIADD 293
           TDDGQDVDGLPNEGVEITCFHMDGAHYMIYTPSDPLLFVAIKDKLGQLTIADD
Sbjct: 241 TDDGQDVDGLPNEGVEITCFHMDGAHYMIYTPSDPLLFVAIKDKLGQLTIADD 293

BLAST of IVF0022031 vs. NCBI nr
Match: XP_011649452.1 (uncharacterized protein LOC101207357 isoform X2 [Cucumis sativus])

HSP 1 Score: 580 bits (1495), Expect = 8.20e-208
Identity = 301/329 (91.49%), Postives = 303/329 (92.10%), Query Frame = 0

Query: 1   MALCISIPSFSDSVTAPSLSQTYFSLSRRRPSNFIHFLNLSLPNRPPCLCRASNSQPGPF 60
           MALCISIPSFSDS TAPSLSQTYFSLSRRRPSNFIHFLNLSLPNRPP LCRASNSQPGPF
Sbjct: 1   MALCISIPSFSDSATAPSLSQTYFSLSRRRPSNFIHFLNLSLPNRPPRLCRASNSQPGPF 60

Query: 61  PKQSASASSKKRKKKDKGDSKVFNPNNFEVVDDFSFDDAGPSSSTSTSTYLSYHPSTLPK 120
           PKQSASASSKKRKKKDKGDSKVFNP + EVVDDFSFDDAGPSSSTS STY SYHPSTLPK
Sbjct: 61  PKQSASASSKKRKKKDKGDSKVFNPTHIEVVDDFSFDDAGPSSSTSNSTYSSYHPSTLPK 120

Query: 121 PPAGFVLDDHGKVLMASNKRIATMVDPLNNLPLECVIRRIFRSSKGDDCMLLCPVDTPVQ 180
           PPAGFVLDDHGKVLMASNKRIATMVDPLNNLPLECVIRRIFRSSKGDDCMLLCPVDTPVQ
Sbjct: 121 PPAGFVLDDHGKVLMASNKRIATMVDPLNNLPLECVIRRIFRSSKGDDCMLLCPVDTPVQ 180

Query: 181 ILKSKNIDGWSAVSDDEVEALLPAAAYALAKIHMHLVHSGFCYTARGAFCYSEDDIFDFR 240
           ILKSKNIDGWSAVSDDEVE+LLPAAAYALAKIHMHLVHSGFCYTARGAFCYSEDDIFDFR
Sbjct: 181 ILKSKNIDGWSAVSDDEVESLLPAAAYALAKIHMHLVHSGFCYTARGAFCYSEDDIFDFR 240

Query: 241 TDDGQDVDGLPNEGVEITCFHMDGAHYMIYTPSDPLLFVAIKDKLGQLTIADDELLEDPA 300
           TDDGQDVDGLPNEGVEITCFHMD                    KLGQLTIADDELLEDPA
Sbjct: 241 TDDGQDVDGLPNEGVEITCFHMD--------------------KLGQLTIADDELLEDPA 300

Query: 301 IISAIDEETEFNALVEEEAALLESVLGKE 329
           IISAIDEETEFNALVEEEAALLESVLGKE
Sbjct: 301 IISAIDEETEFNALVEEEAALLESVLGKE 309

BLAST of IVF0022031 vs. TAIR 10
Match: AT4G33480.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF3727 (InterPro:IPR022203); Has 348 Blast hits to 348 proteins in 78 species: Archae - 0; Bacteria - 101; Metazoa - 0; Fungi - 0; Plants - 183; Viruses - 0; Other Eukaryotes - 64 (source: NCBI BLink). )

HSP 1 Score: 390.2 bits (1001), Expect = 1.7e-108
Identity = 211/333 (63.36%), Postives = 254/333 (76.28%), Query Frame = 0

Query: 9   SFSDSVTAPSLSQTYFSLSRRR----PSNFIHFLNLSLPNRPP----CLCRASNSQPGPF 68
           +FS S+T+PS + T    S  +     S+F   L++S P         +  A N Q G  
Sbjct: 2   AFSYSLTSPSPNSTLSYSSTHQFLNPSSSFPVSLSVSSPKNTRHLRLLITSALNPQTGQP 61

Query: 69  PKQSA-----SASSKKRKKKDKGDSKVFNPNNFEVVDDFSFDDAGPSSSTSTSTYLSYHP 128
            K+++     S+++KKRKKK K    V +    +  D F  DD    SS+S+S      P
Sbjct: 62  TKKASTGSDKSSTNKKRKKKGKIAKPVEDWELRDSEDAFEEDDDADYSSSSSSLATFNSP 121

Query: 129 STLPKPPAGFVLDDHGKVLMASNKRIATMVDPLNNLPLECVIRRIFRSSKGDDCMLLCPV 188
            T+PKPPAGFV+++ G+VLMAS KRI T++DP NN PL+CVIRR+F SSKG+DCMLLCPV
Sbjct: 122 PTIPKPPAGFVINETGRVLMASKKRITTVIDPTNNSPLDCVIRRVFTSSKGEDCMLLCPV 181

Query: 189 DTPVQILKSKNIDGWSAVSDDEVEALLPAAAYALAKIHMHLVHSGFCYTARGAFCYSEDD 248
           DTPVQILKS NIDGWSAVSD+EVE+LLPAAAYALAKIHMHLVHSGFCYTARG FCY+ED+
Sbjct: 182 DTPVQILKSTNIDGWSAVSDEEVESLLPAAAYALAKIHMHLVHSGFCYTARGGFCYTEDN 241

Query: 249 IFDFRTDDGQDVDGLPNEGVEITCFHMDGAHYMIYTPSDPLLFVAIKDKLGQLTIADDEL 308
           +FDFRTDDGQDV+GLP EGVEITCFH+DG+HYM+YTPSDPLLFVA KD+ G L IADDEL
Sbjct: 242 VFDFRTDDGQDVEGLPTEGVEITCFHLDGSHYMVYTPSDPLLFVAAKDQNGLLQIADDEL 301

Query: 309 LEDPAIISAIDEETEFNALVEEEAALLESVLGK 329
           L+DPA+ISAIDEETEFNALVEEEAALLES+LG+
Sbjct: 302 LDDPAVISAIDEETEFNALVEEEAALLESLLGE 334

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A1S4E3248.4e-185100.00uncharacterized protein LOC103499586 isoform X1 OS=Cucumis melo OX=3656 GN=LOC10... [more]
A0A1S4E3T23.0e-166100.00uncharacterized protein LOC103499586 isoform X2 OS=Cucumis melo OX=3656 GN=LOC10... [more]
A0A6J1EW112.2e-16189.12uncharacterized protein LOC111438664 OS=Cucurbita moschata OX=3662 GN=LOC1114386... [more]
A0A6J1I6V62.5e-16088.82uncharacterized protein LOC111471694 OS=Cucurbita maxima OX=3661 GN=LOC111471694... [more]
A0A6J1DXG85.7e-15786.59uncharacterized protein LOC111024390 isoform X1 OS=Momordica charantia OX=3673 G... [more]
Match NameE-valueIdentityDescription
XP_016902634.17.72e-236100.00PREDICTED: uncharacterized protein LOC103499586 isoform X1 [Cucumis melo][more]
XP_011649451.11.13e-22897.57uncharacterized protein LOC101207357 isoform X1 [Cucumis sativus] >KAE8652000.1 ... [more]
XP_038900636.15.83e-21391.24uncharacterized protein LOC120087801 isoform X1 [Benincasa hispida][more]
XP_016902635.17.28e-212100.00PREDICTED: uncharacterized protein LOC103499586 isoform X2 [Cucumis melo][more]
XP_011649452.18.20e-20891.49uncharacterized protein LOC101207357 isoform X2 [Cucumis sativus][more]
Match NameE-valueIdentityDescription
AT4G33480.11.7e-10863.36unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
InterPro
Analysis Name: InterPro Annotations of Melon (IVF77) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR022203Protein of unknown function DUF3727PFAMPF12527DUF3727coord: 213..307
e-value: 1.5E-19
score: 70.1
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 51..66
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 51..84
NoneNo IPR availablePANTHERPTHR36061:SF3OS04G0692200 PROTEINcoord: 47..328
NoneNo IPR availablePANTHERPTHR36061FAMILY NOT NAMEDcoord: 47..328

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
IVF0022031.2IVF0022031.2mRNA