CaUC07G129440 (gene) Watermelon (USVL246-FR2) v1

Overview
NameCaUC07G129440
Typegene
OrganismCitrullus amarus (Watermelon (USVL246-FR2) v1)
DescriptionMannosyl-glycoprotein endo-beta-N-acetylglucosaminidase
LocationCiama_Chr07: 6006649 .. 6030248 (+)
RNA-Seq ExpressionCaUC07G129440
SyntenyCaUC07G129440
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGATAGTATGTATAAACTAACCCAAGTCTAATTATGTGGATTTTCAAAGGGGAAAGGAAGAAATCAATATAACTCTAAACCATGGTGGCAAAGACAAATATGCCCTGGCTCGTGGTCTTTGTTGGAATTGGTTGACTTGGTTAAGTCCTTAGGGTATAGTGAATGGGAAAATTTGTGATATGAAGGTGATGATGGCACTGGGATAAAGCATCAATGGAATTGGCTTATTTAGGTGTATTAAATAGAAATGCATCTATAAGTGTATATGTAGAGCATGTAGGGAATGAAGATGCACCTACATGACGATGAAGTTAGACTTGATCGAGTATCTGATATAGAGGATACTGGGAATTAATTAATTAATCATAGTGAGGAGGAGGACTTGAATTACAAGTTGGAACATGTGAAAGGAAAAGAAAAAGAAGTTATTGAGAAAATTTGTGAGAAAAATGATAGTGAAAGTTATTTCTTAACGGATTTTGAAAGTTCAAAGGTATTTTTGACACAAAATCCAAAGTTTAAAGATATTTTTTAGAATTTAGCCATGATTATTTCTTAAGTGAATTACTTTTTTTTAAGATTAAAAATAAAGCTTAATAATTCACTTGGGTCATTAATTTAATAGTTAGCTATTAATTAAAAGATTAATGAATTATAAATAATAAAGCTATTGTACTTTTCCTAAAATTTGACTATAAGGGAGCTATCTAAGTATTATTTCGACTAGATGGAGGTATTTAAGTTGATAGTAAGAGAACACTTCTACTTGGGAATTAGTCTAGGAGTTGAATTAGTGAAATGTTTATCATGCTAATATGAAATTTCTTTTATTATTGAATATTTAAATTATTCTCAAGTGGATAAATTTGTGAATTTTGGAAATCTATACTGGTTATATATATTTGATTAAATTCTCTAATGACAAAGTTAATTACTCTTCCTAAAGCTAATTTTGATTTAATGCACCTGAAGTTCCATAAATCAATTAGTTCTCATTGTTTTTGCTTTGCAACAGTCGTTGTACATATTTTAAATTAATAAATGCTTATAAACTTATAAGCATATATTATTTTCATATATCAAACATGCTCTCCTCTTTCATATAAAGTAAGAGATGATTATTCTTTTTGACCCACTTGTATATGAGATATTTTGACTTCTTTTGAATCTCTGAAGAGAATAGGAATAATTAATATTTATATTTTGCATATAGTGCATCTATGTATATTATATTTAAAGGTAGACGGTACATTTCCAGAATTCAAAATGCTTGAAGCGAAAAATCAAATATGATATTAAGTCTTGTAGACTTGTAGACTTGATTAAAGATTTTGGAAAAGAAAATATATATTATTTTGCTAGAGGAGCAAGATTTACAATTGCTTTACTTTATTATCTTGTAAATCAAAGAGAAGTTTTTTAAATTTGGAAGTTATACATCGTAATAGATATCATATAAAGAGATTGATAATAAGAATAATATATGGTATTTATATACTATTGTGTTCTACAGAAAATGCATATAGAAAAGCTGTCTACTTTTACTTCTAAATTATATTACACACATGTTTATGTAATTGAAATATATGTAACAATGAACTAGAAGTTCATTAATCCAAATATATTTTTATTTGACCCGATGAGTCATTTTGTGCTCATAGTGAAAATAATTATTGAAAATATACATTGACATCCACTGAAGAACTAGAAAATTCTTTCATCTAATAAATTGTCACGTGTGTCTTGTTCTTAAGAGATTAATTATCATATTCTCACTAGCTAAAATTGAAACTGAACACCTCCTAGTTTTGGAACAGATTCATGGTTGTATATGATCAAGATTTATTAGTCTACCAAGTGTATTTAAATATTTTATGGGTTTTATTGATGTATATAGTTGATGGTCACATATATGTTTATTATCATTTAAAAAGCTTAAATATGAGAGGTTAGTTGCTCAAATATTTAAGATAAAGAGTACAATTTCATGATTTTGAATTAAAAGTTTTTTTTTATTTTTTTATTTTTTATAATGCTACTAAGTTTGCATCTCAAACATTTGATAATTAGTATTGTATAGTAATTGGGATATAAGTGTTGAACATCCTGTAGCTTATATTCAAATAGAAAAGCTTCAAAATTTGTATGAGATCATGTTATTCTGTATACTTTAAGCCAATGACTTATGAGTTTATAGCCAGAAAGCAAATATGTTTCATATGCAAGATGTGCAATAAATGTATTTACTGCACAATTTGCTAGTTGTTATTGAGACAATTTTTTCAACGTTAGAGGAGGAATTAAGAAGAATTTGGAAAAAGAAATTGAATAAAATGTATAAGTAGTGTCTCATTTTGATCACTATACAAATCACTTTGAACTAAAAGTTCAAAAGATAATTCAGTTGTAAAATATTGGAAATCAAATGCTAGATATATTTATAGAGCATTCACAGTTGCAAGAAAGTGACTAAGTTATATATATTAATTGCAAATACACCATCTAAAATTGATATCCCAATGCAACAAGTTGTTTGCACCATGCTTGAAACATAATAGACAAATGAGTTCCAAAGATAAAGATCTATAGAAAAGTAGTAGTTAATAATCAAAATGACTTAGTTAAGAATATGGATATTCTAGAAAAAATCATAAATACGAGTACTCATAAGATTATCAAAGAAAGTGAAAATAATAATGAGATCTCGATAAATTATATGTCACGACAGGAAAAGATGGAACCATATTTATAAAAGTTGACAACATTTTTACATACAATATTGCTCAAGATATTATTTATAAAAATGATTACGATGTACTTAGATGTATTGAAAAATATAACATAGAAAAGATTTACTTATGTGGAAAGAAACAATCCAATAGAAATAAACTCATTTTTCAAATTGTGTCAAACATGTGAGATACAAGTGAGTATTTGTAAGAAAAACAAATAGAGTGAAGTCACAAAACAGAAAGCAAGACATATTCCACAAAAATTCTCATAATGGATTGGTATCTTAACATACACCTTAAGGATGTAGTCATGCCACATTTATATGAATATTTTAATAATGATATTTAATTTATAAAAATTCCTTTATTAAAAGTAAAATAATCATATAATTCAAATATTAGAGAATTATATTTATCACAAATATTCTTATAAAAACATTCCCAACGAATGTGATATAATCATCTTATTAAATATTTTTTGAAATAAAGGTATAAAATAATCATATTTGTCCATATGATATTATAAAGAAATCATAGTAAAGATTTACTATATATGTAAATGTTGATGATATAAATATATAATTGAAACTTCTAAGGAAACTTCAAAAGCAATAAAAATTTTTAATAAAAAAAATTGAGATGAAAGTTCTGAAAAACAAAGTTTTGTCTTAGCTCGCAAATAAAGTTTAAAGAAGACAAGATTGCATCAATTTATACACAAAAGATTTTATATGGACTAGTTACATTATTGAACACTCCTATGGTGGTTCATTCTCTAGATGTGAAAAAGATATATCATCCAATCTAAGGATATGATAATGAATAACTTTAAGTCATGAAGAACGATAGTTTTTTAGAAATAAGTGCATTTATATCTTTACTAGTTATATACTACCTATTGTAGTATTTTTAGTAATTTTGTTATACATATATAATACTCCTCCAAAAGAAAGATATTGAAGTAGAGTTGAACATATATTATGTTTTATTTGAGAATTAATTTACATTTATTTATTTTCTTCCTAATGAATCTAATTATGATCTAGTTGGTTGTGAATATACAAATTATTTATCTGAAACATAAAAGCTAAATCTTAACAAGGTCACTTAGTACATGGAGAATTTTTTTTTTTTTTTATTATAACTAGGTCACTTAGTACATGGAGAATAATTTTTATATTATGACAATCGATGAAGCAAATTATAACAATTACTGAAATCTTGACGATTTATGAAGCTAGCCAAGAATGTTATTAAATGAAAATAACACCACATATGTATTTCTACATTTATGACCTTGAAGAAAATTGTGACATCATTATGCAACAAATTTCTTTAAAAGATGACTTGCCAGACTTATTATGAAGGTGTTATTCACAACCAACAATTTGAGAAGTTAGTACACAATATTTGAATGCGACGATTGTGAGATGTCAAACGATGTTTCCATGAGGAGGAGTAAACTTACTACACTCTTTTTCCCTTGACTATGGTTTTTCCCACTAGGTTTTCCTAACAAAGTTTCTAATGAGGTAGTTATTAAATTATATAAAATAATGTACTTTTTTCCCTTCACTAAGATTATTTCCCATTGAGTTTTTCCTAGTAAGGTTTTAACGAGACATTTATTTTATATAATGTAGATATCCATAACTCACATAGGTTACAAAAATTTATAACCTCTCATTGAATTCCATCTTGGAACCTACGCCCTTCCATGTCCTATAAGTAGTGTTGTATGGTGCCTTTGTAATACACACTACAATTGAGTTCAATTGTTTTATTTACTTTCTTCTCTATTTTCCTCTTTACTTTGCTTCTTTTCTTCTTGTTTTGTTCTTACATTGTACTTATTTCACAATATCAATAAAAACTCTAATGACAAGAAATTGAAACTTACTTAATAACAAAACGAACAAAAAAAAAAAAAAACTCACTTAATAGCATAACGAACCACATCCAAACACATAGTAAATTAATTTGCACACTCCATTCAATGATGAATTGCCCAATAAATAGGCAGTCAACAAAATACATATGACATTAGTTAACTAATCACATTCATGTTGGAATTTGTACCATAAATCCTCGTAAGTAATTAGTTGATTAATTTAATATGCAATATTAAATTATCCATTACCAGTAAAATTCTAAAGTTATTATATGTAGTTTTGAACAATATGTAGTTGACATATATGGATTATGTCCAAGAAATAATTAAATATCTATAATATATGGATAAAGTTAGCTGCCTTATCTTGGTGAATATGACTTCACTTTATCATAATTACAAATGGTTTGATCCAAACCGTTCAGGTAGGGACATGTGAGTGAAGATATCCTATATAAAGAGCTTATTATATGAACAACAATTAACCCACTACATTTCTTGCATGAATATGGTTAAATAAAATGGGAAACGAAAGAGAAATCAAAACTTTTGATTCCTAATTTTTCCCCATGAATTTCATCATCTTCTTCTCCTTTTTCGTCTTCTTCTAGTTGTTTCTTTGCATTATTTTAGCAGCCAATGGAACTTTACCACATTTTTTCAACAATTCTACTTTCAATTTCTCCAGATTTGGTGAATCATCTTTCACTGATTTTTGCCAAAAAAAAATGTTTCAGGAGAAAAAGTCTCCATTTTATAATTTTTTTTTTTGTTATTTGTTACGTATTTTTCAACTACATACATACTTTTCCAAATGGTCTTGACACTCATTTGATATGTGAATTAAAATATATTTTACAATTTTTTGCATATAAATATTGCACTTAGATAAAATACTATTCTTTATAACATTTGGAGGTGGTTGTCGAAGTTGATTGTCAAAGATGAATTTCATCGGAGTTTGTTGTCGAAGGTGGTTGTCAGAGCTCGAAGTTGGTCGCATGAAGGTGGTATTGGAATTGGTCACTCAAAGGTGGTAATTGAAGTTGATCTTAGAAGATGATTTTTGCTGGAGTTTGTAGTCATAGGTGGTTTTTGGAGCCTGAAGTTGGTTGCATGAAGGTGGTATTGGACTTGGTCGTTGAAGTTTGTCATTGGAGTTGATCGGGGTCGTCAAAGTTGATCATCAAAGGTGATTTTTGTCGGAGTTTGTAGTCGGAGGTGCTTGTTGAGAGCCAGAGCTAGTCACATGAAGGTGGTTGTCAGAGTTGGTCATTGGAGGTGGTTGTTGGAGCCCGAAATTGGTCGTCAAAGTTGGTCGTGCGAAGGTGGTTACAAAGCTCATCATCGAAGGTGATTTTTTGTTGGAGTGTTTAGTCGGAGGTGGTTGTCGAAGCCTAGAGTTAGTGGGTGTTTGGGATTGCTTTTGAAGAGATAGAAACAATTATAGTCAAATCTTAACCATTTTTTGCATTTGGTAAAACAAATTTAAATGAAATATGTTAAAATCACTCTTTAAAAACACATTAATTTCACGCCTACTTTAGTATTCTATTAGCTTTTAGGTTCTCCTAAAATCGAGTAGAGTTTTTAATTAATAATTAGTTTTTACTTTTATACTTTAAAAATAAGATTTATTCACCTAATCAAATCTATTTTCTATTTTCATGAGTTGAATGCATCTGTTTTTAAAAAAAGTTTGACAAATTAATTTTTTTTACAAAAATAAAACTATGTCCATTAATATTTAATAATTAATTTATAACAAACAAAATTTATCATCCTAAACAATTATTTAAAATTTTAAATATAATAATAATAATAATAATAATAATAAAAAAGTTGATTATAAGTCTATTTTAGTTATTATATCTAAAATAAGATGTTTTTGTATTTATTTATCAAACAATATAACCTATTTTTTCAAGTTTAAGCTAAAATTTACCAAACACTAATTTACTTCTTTTTCACAACTAATTTTAAAAGCGCAACTACCATCAATTATTTTAAAAACTACAACAATCCCAAATGGAGCCTTAATCGCACGAAAGTAGTTGTCAAAATTGGTCATCGGAGTTTGTTGTTAGAGGTGGTTGCTAGAGTAGTCATTGGAAGTAGTTTGCTTGAGCTCGGAGTGTGACAACGATAGTTAGTGATAGTGGTTAATGGGTAGAGCCATTGAAAACAAAGAAAGGTGAGCTGGGAATATGCCAACTCAACTCAACTCCACCAATATTTAAAGTTGGTGGGTCAAACATCCCGTAAATCAAATAAAATAACAAACTAGTTTTTTATTTTTTGGAAAGATTTTTGTAAAAACATATTGAAATATATTATTTTCTAAAAGTTATTATTGAAAGATATATTTAGCATGTTCTTAAAGAAAAACCTAATGTATATTTTTCTAAAAAAATAAACCTATAATTGACTCCAAATCACATGATAACAATTTAGTTTAACTTTTTAAAATTAACCTATAAACACTTCTCCCATTTATTGGCTTCAATATTTTATTGTCTGCACTTTACTTTTAAGAGTACTATCAAATTCAAACCAAAATTTGTAAAACAAATAATTTTAAAAATTTGTTTTGTAAATGAAATTTGGTTAAGAATTCAAATACTGCATAGAAAACATAAAAACCATGATTACAAATTTGTGAGAAAACAAGAATACTTTTTTTTAAAAAAAAAAAATTCAAAATTAAAAACCAAAAATCATCAAACGGAACCTTAGAACTGTATTGACACCAATTTTAAAACTGCTCAAAAGTTTATCCCAAGTGTTTCATACAATGTGAACTTGCAAATGAGCATGTTAGTGTTCCTGCCATTATTTGCCTACTTTATTGGCATCGTTTGATAGTACAAAAATACATTAACCTTTTCTCCTAAATTGAAGAAATTCAATACATATTTTTCTGAACCAGAAACGACAACCTTTTTGATAGATAAAGAGAAATACAACATTCATGGATACAAACCCCAATCAGAAGTGAAGAATAGAACATACAGCCAAACACAGAGCCAAATAAAACCAAAAAGGAGAAAAACAAGCTGTCCAAGTACAAAAAGAACCCAACCAAACAAAGACACAAGAAAGAAAGAAGGTAAAAAACAAACCAGCCAAAGACCAAAAAGGAAAGAAGCTACAAAGCAACAAGATCCTTCAGAGAACACCCACCAAATTGAAGAGCCATCAACTAGGAGGGCAATGATACATATATACGCATAACTGAAGAGTTATGAAATCTGGTAATTGTTTAGTCATACAACCAACGGTGTAAGGAATTAATAAGACACAAAATTAATGACTTTTAGTTGTTAGCCCACAAACTTCCTGACCATCAGGCACTCAAATAGAGAAACGGAGCATCCTCCAACCTCTGGACACTCCCATCCACTCCACAAACTTGGATTATAAATTTGAAACCTGAGGTGCTGGAAGGAACTGGAAGTGTAGAGACATAAAAAGCTTTTACATGTGCCATCCCAAAATACTTTGGAACATTTTGTAATTTGCGAAGGAGATTTTTGCCTTCCTCAGCTACTTCCATGACATACACATTGTAGCGTTCAAATACCTTATCACTACTGCTATCTTTCAACTTCCAAATGATGTGAATGTCAAGGGTCTTTGTTCCACCTGGACTTGTAGTCCTTCTAACATATGGACTTTCAACTAGCCATGAACTAGAAGGAGGAAGATCTGGTTCCTCAACCGATCGAACAATGATATTACCGAACACTGCAAAGTATTCGGACAATGAACTGCAATCTAGTGTATTACTTTTCCCAACCACTCCAGATTTGTGTATCGGTGTACCAGTTTCAGGGCTTGATCTGTAGCAGACAACATTTATATTTGTCAGTTTGTATCCATTCATTTGAATTCTGCCCACATGAACAAACCAATCAGTATGAAGTCCAGGCACTTTAAGCGGAGTTGTCTCAATCACTTCGGAGTAATCACTCGAGAATTGCTTTTCAGTGCTAGAGGCAAGTAAAACTTTTCTCTTATTTGTGCTAGAAAAGAGTTCCAATGAAAGGCCTAGTTGAGAATCCCCGTTCGACATTGACTGCAAAAATTAAGTCGTGGACCGAGGTTTAATTAAATGTATAAAGAATCCTGTTATTTCTAGACCTTCAATACATACAGAGTTCACTTACAGAATACATGACTACTAAAGGAACATCCCCTAAAACAAGCTCTCCTTGAAAGAGTCTTATTCTAATATAGCGATTTTGCTCGAGAGTCCCCTTGAATGCAATGCTTCCTCCTCCATTGTAGACTATCTCAAAACTAGCAAATCCATGAATTCAAGAGATTATAATTAAACAAAAGAAATGATTACAAACAAATGAAGGGGTTGAAGAAAAGAATCTTGGGGGCCAACAGATGTGGTAATGTACTATCAATGCTCCACATTGAAAAGAGAATAGATATGAGCATTTCTACTTACTCAGAGTAAGCTTGAACAGAACGTGAAGTTGATGCATCGGTTACCTCAAGTATGGGCTGCAAGAAGTTACTAATATTGAGAAAGGAGAATTTCAAAAATGGTAACTAACAGGTCCCCTTCAATGCCTTCAGGGGCAGAAACTTCGATACAAGCCACCAATAAAATTATTTAATGTATGTATGTAGAGTAGGATTACAGAAAAGAAAACCTTTCAATATCTTAATTCCTAGAGATATATTGTTCACCTGGAAGCTTTGACTGGACAAATTGTTCCAAGAAGCATTTGATATTTTTACTCCATTTATGGAAACATGATAACCATGACCCTGGAAACGTGGAAAGAGATAGTCTTTGACTGAACAATAGTATTTATGAGAGAACAATTTTCATGCAAGAGACCATTTGTAGCTGTCAGATTAATGACATTTGAACAAGATGCCAATGAAGTTGATCCAAGCAATAAGACACAATTTCTTTTGCCAGGTATTATACACATTCAAAAGTCAAAACAATATAAAATCTACTTGATATGAAATTAGAAAGTAGGTGGATTACTTGGATGTCAACAAGGCAACTATGTGATAAGACAATACAATCAAAATTCCCGATGCTCATATGAGATTTAATAAACAAAACATATATCAGTGGTGTTTTGGAAGATACCAAAGAAATTATCATGGAGAATTTGAAATATATTTATTAGGATTTGTTCCTTTCAGTTAAAGTTTGACAATGAAAGCATGATAACGGATTCATAACCACTACCAAGACATCCAACCTTTAGGCCAATGCAATTCCACTTCCACTCACTCTATACTTGGACAAATTTAAGAACCATTAGAGTGCTCAAGCACTGGTTAAGGAACACGTGATTGGCAGAATTCTAACCTGATCAAAATTTGAGTAGAATGGTACTAGTTTTGGGTAACTTCGCACTATTTCCCATGATTGCTTCACAAGACCCCACCATCTGTTAGTCCAAAGTAGATAAAATTGAACCTCATTAAATATTTAGGGAGATGATCAATGGTGACTCTCCTAACAGTTCATCGTACAAAAGACATTCTCTTTAGAAAAAAAAGGACAAGGAACTTTTTAAAAGAAAATGAAAAAAAGACTAATGCAGAAAATAGAGGAACAAGACAATAAGACAAAAGTTCAAGAGAGTTCTAAAAAGGCCAAAAACATAACAAAACAAACCAAAAGGCCTTGATAAGAACAAAAAGATAACAATTAAAAAATGAGGAGTATCCCAGCTTTGAGAAATATCTTAAATTGAAGAATCGCTGAAGAACTTTGATGATCATATAATTAGCACATATCAACATTCAGAATAACTGCAAGGCTGCGAGACTTCGTCTTGGAGACAATAAATTTCAAGCATCCAAAAAATGATGGGAAAACTATAATTAATAGCACCAATTAAATGTTAAAAAACAGACTAAATATCAAAAAGTCAAGTATAATTTAATGAAACATTAACCAGCAAAGCATATTTTCCAGTATAAAACACATAGGAACTATATCCTTCAAAGTAACCTCAGAAAATAGTTGAGAATGTCCACTGATATTCAGCTCTTTTTTTAGTTCAACAATGTGGAGTGCGGGATTCGAACAGACATCTCTTATACAATTGGAATATGCTCAAGTTCACTATGATATTCAGAAGTTGTTCTTATAAAACATGGCGAAATCAATTAATGTAGGAAATTTCAAAAGGGTGGATAATGGTAGAATACTACGGCATCGACATGATACAATGAAAAATTAATTTCATATCTAACTTACTTATTTTGAGCAGTCTGAAAATCTGTTTCCTGCTCGTGCTCATACACCCATCCAGGAGCAAATATGGCAGCCGATACATCATCCCTTTTCAGAACATCCAGTGCAACATCCGTCTGATGTGTAAGCATATTTTTATTCAAAAATTAATTCTACCAACTCTCAGCATGGGATAATAATTGAAATCAATATATACACACACACATACATACATACAAAACTTTTCATTGATATAATGAAAAGAGCCTAATGCTTGAATTATAATGAGACATAATAGTCCATACAATATATATATATATATATATATATATATATATATATATATATTGTATGGACTATTATGTCTCATTATAATTCAAGCATTAGGCTCTTTTTATTATATCAATGAAAAGTTTTGTATGTATGTATGTGTGTGTGTATATATTGATTTCAATTATTATCCCATGCTGAGAGTTGGTAGAATTAATTTTTGAATAAAAATATGCTTACACATCAGACGGATGTTGCACTGGATGTTCTGAAAAGGGATGATGTATCGGCTGCCATATTTGCTCCTGGATGGGTGTATGAGCACGAGCAGGAAACAGATTTTCAGACTGCTCAAAATAAGTAAGTTAGATATGAAATTAATTTTTCATTGTATCATGTCGATGCCGTAGTATTCTACCATTATCCACCCTTTTGAAATTTCCTACATTAATTGATTTCGCCATGTTTTATAAGAACAACTTCTGAATATCATAGTGAACTTGAGCATATTCCAATTGTATAAGAGATGTCTGTTCGAATCCCGCACTCCACATTGTTGAACTAAAAAAAGAGCTGAATATCAGTGGACATTCTCAACTATTTTCTGAGGTTACTTTGAAGGATATAGTTCCTATGTGTTTTATACTGGAAAATATGCTTTGCTGGTTAATGTTTCATTAAATTATACTTGACTTTTTGATATTTAGTCTGTTTTTTAACATTTAATTGGTGCTATTAATTATAGTTTTCCCATCATTTTTTGGATGCTTGAAATTTATTGTCTCCAAGACGAAGTCTCGCAGCCTTGCAGTTATTCTGAATGTTGATATGTGCTAATTATATGATCATCAAAGTTCTTCAGCGATTCTTCAATTTAAGATATTTCTCAAAGCTGGGATACTCCTCATTTTTTAATTGTTATCTTTTTGTTCTTATCAAGGCCTTTTGGTTTGTTTTGTTATGTTTTTGGCCTTTTTAGAACTCTCTTGAACTTTTGTCTTATTGTCTTGTTCCTCTATTTTCTGCATTAGTCTTTTTTTCATTTTCTTTTAAAAAGTTCCTTGTCCTTTTTTTTCTAAAGAGAATGTCTTTTGTACGATGAACTGTTAGGAGAGTCACCATTGATCATCTCCCTAAATATTTAATGAGGTTCAATTTTATCTACTTTGGACTAACAGATGGTGGGGTCTTGTGAAGCAATCATGGGAAATAGTGCGAAGTTACCCAAAACTAGTACCATTCTACTCAAATTTTGATCAGGTTAGAATTCTGCCAATCACGTGTTCCTTAACCAGTGCTTGAGCACTCTAATGGTTCTTAAATTTGTCCAAGTATAGAGTGAGTGGAAGTGGAATTGCATTGGCCTAAAGGTTGGATGTCTTGGTAGTGGTTATGAATCCGTTATCATGCTTTCATTGTCAAACTTTAACTGAAAGGAACAAATCCTAATAAATATATTTCAAATTCTCCATGATAATTTCTTTGGTATCTTCCAAAACACCACTGATATATGTTTTGTTTATTAAATCTCATATGAGCATCGGGAATTTTGATTGTATTGTCTTATCACATAGTTGCCTTGTTGACATCCAAGTAATCCACCTACTTTCTAATTTCATATCAAGTAGATTTTATATTGTTTTGACTTTTGAATNATATATATATATATATATATATATATATATATATATATATTGTATGGACTATTATGTCTCATTATAATTCAAGCATTAGGCTCTTTTTATTATATCAATGAAAACTTTTCATTAATATATATATATATATATATATCTTACCTCGATAACATAATTATTTAACCAGTTACGATATCATATATTCAACTATAAAGGTGTCCAAATTATGCTAACTATGATGCTTGTTGCCTAGGGAAAACACATGTCAATACTCAATACACCTCATAGGGTTAGCAGGCTTACATTCCATCCTCCACCACCAAAGGTACCCCTCCCAAAAACATCAATGCCCATGTACACATCATGCTTTCTGTCTCCAGCAACAGCAGAAGATTTTTTAGGAGTATCCTCCTACGATGACAGATGAAAGTTTTTGTCTAATCACTGATGTGATCTTTTCAAAAGGATATAAAAATGAGATGATAACTAACATTTCTCATTGGTTATGAAAGAAATTTATTTCTTACATTCCAACCGTAGTTTACAAATATTCCATCCGAGATATCAAAGAAAAGTTTATTTTTCACATTCAATTCATTTTGCCAGTAAAGGTAACCATCCACGGTTACACTGTCATACCTGCCCATAAGTAAACAGTTTTTTTTTTTTTTTGGATAAAAGAGTCCATAAGTAAACATAATAGACTCCGTAAGCTCAAAACACAAATAATTGAGAATGTAACATCTCTTAGGACTTACCATATAACTAAGGACCCAGGTAGCTTGCAATGCATAGACTGAGTTAAATGGCTCACAAATTCTTTCAAATGAGTAACTTGCTGAGAACTCATAGAGATTTCCATATTGATCTGCAAGAACAAACTTTGATTGAGAAGAAACAACTATAAAAATTCAAGGATCAGTTCACATCGTTCACACTATTTAATTTATTTATTTATCATAAGAATTAATTTTGTTTCCTATAATTAAGGTTCAACTATTTAAATTTCCTTGTAAAATCTTATAGATCGACAGCTAAAAAAATATCATAACTCCTATACCCGACATTGATTAGCATATGTATATCGCTTTCAAGCTCATCCAGAATTCAAATCTTGATTGTGCACAAACACACGCAAGCTTGTAAAATTCAGAAAAGGAGAACAGAAGAAAAAAATATGTTGTAAAGAAGTATTTACACCTAATAAGCTATAGTATAGGATTCATATGAATATTTTGATGAAGTTACAAATAAATGCAGTCTAGGTAGGAAAACTTTATAAGCAATACCAGCCATCCATCAAATCCCAACGCAATAGCAAGCTCAGTTAAGCGTTCCGCATACATCTCAACAGAATCCTTTGATGAAAGCAAGGTATCCCGAATATCTGTTCCCCCTCCTTCCAAAATGAATGTCCCTAGTACCTATAGATTCAAGACCACTATTAATTTGTAAATGCTAAGAAAGACTACCATCGCAGCCCCACTTAGTAACTCACCCTTGTTCACAAATAGCTATATATATCTATAAATTACCGACTCAATATGATGCAAAATAACAAGTTTATTATTAGTTTTTTTGGTCTAAGTTGATTTCTTGTGATAACATCAAGTCATAAAGCTTTCTAAGAAAATCTGGAAAACATCCAGAATTTAAAATATATGTTCTTTTAATTACATTAACATTTAAAATATTTTAATTAACACGTCATTATTACAGGTAAAAAAACGTGGACAAATCTCTACTAAGTCAATCATTTTCAAATTTCCATCAATGTTTTATTTGCTCTTCATTTGATTTATGTCAAAAATGATGTCAAATGATTGTTCAGGATCAATAAGGCATGATTTAATGAAAGATACAGATAACCAATGTGCACTATATTTAAAGAAGAAGTGGGGCATGTTTGGAATTGATTTTGAAATAGCTAAAATCACTTTCCAAAACATGTTTTGATCATCCAAAATCAATTTTGATTGTATAAAAATCACGTTTAAAATGCAAAATCAAACATTAAATTTTTCTTTTTTTGAACGATCAAGTTCATACTTGGGAGTGATTTCAAACATGAAAAATGTAATCTTAATCATTTCAAATCATTCCCATATGTGTTCGTAATATGCAGCATTCTTTAAAATGTATGGCCACATCAAATGAGCCGATAACTCTTGTCATAATATTATGAAAAAATCTACAAGCAAACCACGGTAGGCTAGTTCAGACACAATCCAACTTTTCAACTAATAATAATAGTAAGGAAACAAAGAAGTATGGACCAATAAAACCAATCAGTGAACTGAGTAGGTACCTTAACACCGTGCCTGTGAGCTGTATTTGTCCAACATGGAGGTGGAAGGGTGACCAAATCATGGGAGAAGTAAACAAAAATATCGATCAAATACCAATGCCATATTGCATAAGCATCTGGGTTAGCTCCACCCTGAACCCATTTATCATCTTTATATCCACCAGCCATATCATGGCAAACAAGAATCCTACGCCTGTCGGGCAATGGCGCAGGTTGAAGAGCAACCGTGGATATATTAAATGGGTAGTGAAATGAATTGAAGTAGGCCCTGGACTCCAGGTCCTTGAGTGTTTTGATAGGGTAAGCAATAGGAATAGACGGTTCAGTTGGATCAAATGGAGGAGGAGGAGGAGGAGTTTCTAACCTTGGTGGGACAGAGCTCTGCTGGGTCATTGTGAAGAAAAAAGACGGAAGTATTCTGCAACTATGAAAGCGAATGGTTCTAACAAGGTTTCGGAATGCCGGTTTATGTAGGGGCAAAAAAGCATCAAAAGCGACAAGTATTGATGTAAAAAGAGACGAAGATTTGACTCTCAGTTTCAATAAAAAAATCACGAACACAATACAGCATGAGATACAGCCTACGACTTCAATTGATGATAAACGAACAGTAGAAATCTCCATATAAAGCAGATTCGATGAGTATCTAACCTTTCTCCACTGCTCTTATTTGCAGAAATCGGGAAGGGTTCCTGAGTTAAAGGATGAGAAGGAATCAATCTGAGGAAGGAGAGAGTGAGACAGATTATGGAAATGAAGAGTCGTTGATTCCTTTTTCCTTTTACTAATTCCGTTGTGTAATAGGAAGAAGATCCTGCGGAATGGGACTGCGTTTTGATGGTGACTTGTATCTTCGGTTGCTGCCATTGTTGATGCTGATGGACCTTCGATCGAAATCGAATGGGAAGAAGAAACTAGAAAGTGATGGGGGGAAATTAAAATGTTAGGGAAGGAACTGCTCTGCCAGACAGATTTTTCTTTTTTGTACTTTTTTTTCTTTCTAAAATAAGATGTGCAGGGAGATTAAAAAAAACACACAGAAAAAAAAATAATAATAATAACAATAATAAAATAAAATAACCATACTTCTTTTGGGTGAATTTGCAACCACCCAATAATGCTCCACCAATATGATCACAAAAATCCATTGAAGCTCACATTTGCCTCTATTGGTGGATCATGCTTCATCAATATTCGACTTTGATGGATCTTTTGCTGGAGTCGATTAAGAGGTAGGAAGCGATAATCTAGCGTCGACTAACGGCATCTACTAACACCAAACAAACTAGCGTAAGTGGTAGATATTTTAAGTATTATATTTACTGGTTTAACCACTTATCAAACTTATGTATAAATAGGCTTCCAACCCCTCATTTTGAATATCAATACAAGACTTACTCTTTTTCTTCGTGTAACTTTGCTTCTATTGGTATCAAAGCGTTTGATCCAAACGCGCATTCTTAAATTTGGATTAACAACTCATGGACGCATAAGCTTCCTCTTCAATTAAAGCTTAGGAGAACCGATTGACTTCAGTAGGAACTGCCATGATCGAGACGAAAAATGAAGTGGGTGACTTGTGAGTAATGTTAGCTCAAATTCTTCAACATTTGGATATCCAAAAGCCAAAAGAGAATGAGGGACAACAAGTGCAAGGCAATCAGAAAAACAAAGGAATTCTTCAAATCAATGACAACCTTAAAAAAGGAGGGTCAATTAATGATAAAAATGGGAGGAAGAATTCAAGAAAGATTTCAAGAACCCTTCAAGAATCAGATGAAAACCAAAGTAATTGGACACAATCTGCTTGGACCTCAAAATTTGCCAACAAAGCTAGATATATGGTGCAAGATTTCGGATTCTTTAGATGAAAATGTTCACGACTACTTTAGAATTCAAGAAAGAAGGGAGAATAGACAGGGGAGATATCAATATCATCAAGAACCACTAGAATACAAGATGAAGGTAGATCTTCCAAGTTTTGATGATCGAATGGAAGTAGAAGCTTTCCTCGACTGGGTTAAGAAGGTCGAAAACTTCTTCGAGTATGCCAGTATTCTAGAGGAATAGAAGGTGAAGTTGGTGGCTTTTAAATTTCAAAAGGGAGCTTCAACATGGTGGGATCAACTAGAAACAAACCGCCAGATCTATCTATGGGAAACCACCTATCAGAAATTGGCCCAAGCTACTAAAGCTGATGAAGAAAATATTCCTTCCTATGAACTATCAGAAGATTCTATACAATCAATACCAACTTTGCAAACAGGGAGACCGATCAATTTGTGATTACACTGAAAAATTTCAACGGTTAGGTGCCCGCAAAAATCTGCCCGAGACAGAGCAACAGTCAGTATCAAGATTCATTCTTGGTCTTCGTGATAAAATCAAAGAAGTTGTCAAGCTACATCATGTTGTCTATTTGGTAGAGGCAACTACCTTGGCTTCTACAATCGAATAAAAAGAAGGGGTTAAGCAAACAAAAACCTACCAAGAAAAAACACATGGGAGAGACAACAACCTACTTCAAGAAAAACACCTATTGAAACTTCAAGAACCACCATCCCAAAAATAGGTTGAACCTGTAGGGACTTGAATCATGCCGGAAGTATGTGATCGAGTTCTTTGATTTATTGAAAGAATTAAAAATAGAAAAAGTAAAGAAAAAAGTTAAGCATCATAACTCTTTCTTTTTCAGGCTGTGACCTTGGCAAGCATCCGGTTTGATCTCGCATCCGGGCTGTGACCTTGGTTTGATCTCGCATCGGTTTGTGGGAACCGCTTCTTTGAAAGAATGGGAAGAAGATCTAAAGTAATTGTAGAGGAAGATTTTTAAAGTAATTGTAGAGGAAGCTTTTTAAAGTAATTGTAGAGGAAGATTTTTAAAGTAATTGTAGAGGAAGATTTTTAAAGTAATCGTAGAGATTATTCGTGTAGAGAGTGTATTCTATGAAGTCAAATATTTAATAATATCCAATATTAATAAATATCTACCCATATAATATCTCATATTAAATGATTTAAAATCTTTAAATCTTATTTAAATATTTATTTTCTACATAATTTTTAATTTTTGGATCTCATCCAAATTAAATAATTAATTATTTGAATCCAATTCAAATTAATTAATTAATTCTCTCTCTATTTATTTATATGGATTTGATCCAAATAAATAATTAATTATTTGAATCATATTCAAATAAATTAGTTCTCCATATTATAAAGTTATAATTTGAATCACATTCATATAAAATTTATCATGCAATTAATTGTATCCAATACAATTAATATTTTCCCTCATAAATTTGAACATTTTAAATTTTTCTTTCAACATAGTTGATCATTGTTGGTCCGTTATGAGCTAGCAAGGGGACCTTGTGGACCTACAAATCAGAAGCTCCAACGATATGAGATTAATCGACTAAACTCATTAATCACATTAATCAATATTCGTTAACTATTGGTATACTGCTAAAGACCCATAGCTGCACTCTTCTCGCTACAGATATATTTATGTGTCTATGGATATAGACCAATAATAGCAAATTAGTCCTTCACAAGTGTTCGTAACACCAACTGGGTCAAATTACCGTTTTACCCCTGGGTTACTTCTAAATCCTTAAGTACTAGTACTCCTCTAATGAATAACCTGTTTATGGTCCAACCATTAAACAGAACCCCTCTCGGGCAAGTGAGAGGGTAGGGCCATTTGTTCAAGTCCTGGAGACACCACTTAAGGGAGCACTTATTTACTTACCCTAAAAATGAGAATGAGTGAATTCCATCTTGTATAATTATGTCCCCAGCTCCCCACTCGGTCTTGTCCCCAAGATGGTAGGCATATTGAGTCGGCAAACGGCTACTCTCACCCATACAAATCAAAGGACAATCCCTTGTGGACAGGAGTTCATAATATACTTAGGTTTAAGACTTAGTTGCATAGGTCATCCTATTGAAATAGAAACCTAACTAGTCAACGGAGTTACATCTAGTGATTACTATTTCGTGGTCCGGTCTTATGCAAACTCATTGCATAGGATACCCCCACTCGCATGTTGCTTACATGAACATGTTAGATCATTGTGCTTGTATCCAATACAAAGTGGGCCGTATCCATAGTGTCATCAGGATAAAGTATCCAGTCTTATCCATATACTATAGACCCTTTAGACTGTATCTTGAACATTGATCTCCATATGTCCCCACATATAGTTCAAGACTCATAAACAGCCTAGAATGTTAGTTTATTGGATTTAGGGTTATTAAGACAAAAAAAAAAAAAAAAACAATTAATAACGCTTATTGAAGTAACACATCAATAACTCTTTATTAATGATTTAATGTCGGTTAAATAATTACATTTACAATCTACGAGTTTTAGGACATAAAACCCAACAAAACCAACAATCCAAGAAAGATGATCCAATTCTGAAACAACTACCTATTTGAAGCAATGATACTAGCACTGCCGGAAAAAACACTAATCCCTACAAACGACCAACATTTGGGAAGTGCTTTTGTTGTGGTCAGCAAGGCCACCTTTCTAATGAATGTCCCCAACACAGGATGTTGGCTATATAAGAAGAAGAAGAGGATGGAGAAGATGGTTGCAATTCAGAGGACAATGAAGAATTGGCTTACTTACAAGTTGAAAATGGTGACCAATTGTCTTGTGTGCTTCGATAAGTACTCATCACACCTAAATTTGAAAGCCACCCTCAGAGACACTCCTTTTTTTTGGATAAGATGTACTATAAATGGTTGGGTATGTCAAATAATTATAGGTAGTGGTAGTAGTGAGAACATTGTGTCCATAAAGCTGGTTACTGCACTCAATTTGAAGGCTAAAGCACACCCTAATCCGTATAGGGTCAGTTGGATCAAAAAAAGGGGAGAAATCATGGTTAAAGAGATATATACAATCCCTCTCTCGATTGGAAACATTTACCAAGACCAAATTACGTGCGATGATGTGGATTATTCAAGAATCAAAGTCGAGTTGCTTTGTTTTGGAAATGGATGTGTGTCACCTACTCTTTGGAAGGCCATGGCAATACAACAACAATACAATCCATAATGGACAAGACAACACCAATGAATTCAGATGGATGGGCAAGAAGGTGGTTCTCATTCCCCTCGATAAGAATGACAACAGGCTTAAGGTACCTTCTACAACAAAAAAACAACTTTTCCTTTTGTCCCCAGGTAATGATCTACTATTAGAAAAAGACAATACACTTTTAGCATTTATTTTAAAACAAGACTCTTTTGATAGTCCAAATTTTGACATCTTAGATCCTCACATATCCAACCTATTAAATGAATTTCCTACTCTGATAGAAGAACCCAATTCTCTGCAACCACTTAGGAACATTCAACATCATATATATCTAATACTGGGGAGCTCCTTACCTCATTTACCACATTTTCTCATGAGTCCAAAAGAATATGAAGCTTTGCACAATGAAATCCATAAGGACATATACAACGAAGCCTTAGTCCATCTACCATCCCTGCTCTATTTGCACCAAAAAAATATGGATCTTGGAGGCTTTGTGTAGATAGTAGAGCCATTAACAAAATAACTATAAAATATAGGTTCCCAATACCTCGACTCTCTGATTTATTAAACCAACTTGGAGGAGCCTCCATTTTTTCTAAAATTGACCTTAAAATTGGATATCACCAAATTAGGATCCAATCGAGGGATGAGCGGAAAACAACTTTTAAAACAAATGAAGACCTTTTTAAATGGCGTGTGATGCCCTTTGGTTTGTCAAATGCTCCAAGTATATTTATGCGTTTAATGAACCAAATCTTGTTACCATTTCTAAATAAATTCATCGTAGTTTATTTTGATGACATCCTAATTTATTGCTCTTTGTATGACAAACACTTAAAACATTTACATTCAGTCTTTTATGTTTTAAAGGAAGATGCATTATATATAAATTTCAAAAATGTTATTTTCTTCAATCTGAACTTTATTTTTTGGGATTTCTAATTAGTTCAAAGGGAATTAGTGTAGACCCTAGAAAAATAGAGGCAATAGTTGAATAGGTAGAACCAAAAACAGCCAAAAACGTTTAAACCTTTCTTGGTTTAGCTTCATTTTATAGAAAATTTATCCCACATTTTAGTACCATTGCAGCCTCGATAACAGCTTGTTTAGAGGCCAATATTTCATTTGAATGGGGGGGGGGGGGGNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNGGGGGGGGGGGGGGTAATAACAAAGTTTTTCTGAATTAAAGAGAATGCTTAGCTCAACTCTAGTCCTAGGATTTCCAAACTTTAATAAACCGTTTGAGCTCTTAGTAGATGCTTCTAGAATCAGTATTGGAGCAGTACTAAGTCAAGACAAGCACCCTATAGAATTCTTTAGTGAAAAATTGAGCCTCTCCAGACAAAAATGATCAACTTATGAACATACTCTACTCTCTTGTTTGGTCTTTAAAATAGTGAGAACGTTACTTGCTTGGGAAAGAATTTATGTTGCTAACAAATCAGTTTTCTTTAAAAATTTTACAATCTCAAAAGGAAATTAGCTGTATGCATGCTAGGTGGATCCAATTCATTCAAAGGTTTGACTTCGTCATTAAACATACCCTAGGTAAAGCTAATAAAGTTGCAGACACATTAAGTAGAAAAGGTACATTTCTAACTTCAGTTCATGGAGAAATTATAACTTTTGGTCATCTACTTGAATTGTATGCTAATGATCCAGATTTCAAAAATATTTGGCAAGCATGCTCAAACAATATTCCAACCAAAGATTGTCATATTTTTTATGGGTTTTTATTTTAAAATGACACCTTATGTATCCCCCAAACATCTCTTTGTGAATTATTGTTACAAGAAGCCCATTCGAGAGGTCTTGCTAGCCACTTTGGTAAAGATAAAACTTTGGCACTACTTTCCTCTAACTTTTCCTAGCCACAACTTAAAAAAGATGTTGCTAGTTTTGTTAACGTTGCTATATTTGTCAATCTTCCAAGGGTTCTAGTTCAAACCAAGGTCTGTATTTCCCGTTACCAATACCAAACAATATTTGGGAAGATCTCTCTATGGATTTCGTCCTTGGTTTACCTAGGACCCAAAGAGGCTTTGATTCCCTTCTAGTGGTTGTTGACCACTTTAGCAAAATGACACATTTTTTAGCTTGCAAAAAAACTAATGATGCTTTAAATGTTACTGATCTCTTTTTAGGGAAATTGTTCGTCTCCATAGTGTACTTAAAAGCATAGTATCAGATAGGGATGTCAAATTTATGAGTTATTTTTGGCGATCCTTATGGAAATTTTTTTGGAACAAACCTCCTATATAGCACAACAAACCATCCTCAACAAACGGAGGTTACAAATTGCACACTCGACAATCTTTTAAAATGTTTAAGTGGAGATAAACTCGGCTTGAGCAAGAGAAATGTTCCCATTGGCGAGTGATTAATATGGTTAATCACTTGACGGGGAAGTGCCCATTTGAAGTTATATATACTCATGTTCCTAGATTAACATTAGATCTTGCTAAACTACCAATTTCTGTTGATCTTAGTGTTGAGGCTACCATTATGGCTGATCGAATAAAAATAATTCCTGAAGAATTTCAACAACACCTATAAGCAGCTAACTCCTCTTCCAAATCCAAAGCAGATCGACACAAACGAGTTGTGGAATTCCATATTGGAGATTTAGTAATGGTCCACTTAAAGAAAAGTAGAATACCTACTCACCAACACTCTAAACTCACAAATAAGAAGATAGGACCATTTCCTATTCTAGAAAGACTTGACCCTAATGCACAAGATTGATATTCCTCCAACAATGAAGATCAACAATACTTTCAATATCTCAAACATATTCCTCTATCATGCTTAGGACTAGTTCACTTTATCACAATCAAACTCGGGACGAGTTTATTTTTTGGAACGGGAAGAACTGATATAGATATTTTAAGTATTATATTATAGAACCAATTATTTAAGCTGGAAACTATTTTTTTATATGTTTTATTATTTACTATTACTGGCTACATAGGTTAATAAACCATTTTACTAGTTTAACTAGTTACCAACTTATGTATAA

mRNA sequence

ATGATAACGGATGTTGCACTGGATGTTCTGAAAAGGGATGATGTATCGGCTGCCATATTTGCTCCTGGATGGGTGTATGAGCACGAGCAGGAAACAGATTTTCAGACTGCTCAAAATAAATGGTGGGGTCTTGTGAAGCAATCATGGGAAATAGTGCGAAGTTACCCAAAACTAGTACCATTCTACTCAAATTTTGATCAGTTACCAACTTATGTATAA

Coding sequence (CDS)

ATGATAACGGATGTTGCACTGGATGTTCTGAAAAGGGATGATGTATCGGCTGCCATATTTGCTCCTGGATGGGTGTATGAGCACGAGCAGGAAACAGATTTTCAGACTGCTCAAAATAAATGGTGGGGTCTTGTGAAGCAATCATGGGAAATAGTGCGAAGTTACCCAAAACTAGTACCATTCTACTCAAATTTTGATCAGTTACCAACTTATGTATAA

Protein sequence

MITDVALDVLKRDDVSAAIFAPGWVYEHEQETDFQTAQNKWWGLVKQSWEIVRSYPKLVPFYSNFDQLPTYV
Homology
BLAST of CaUC07G129440 vs. NCBI nr
Match: XP_004148186.3 (cytosolic endo-beta-N-acetylglucosaminidase 1 [Cucumis sativus] >KGN50312.2 hypothetical protein Csa_005893 [Cucumis sativus])

HSP 1 Score: 136.3 bits (342), Expect = 1.0e-28
Identity = 61/65 (93.85%), Postives = 65/65 (100.00%), Query Frame = 0

Query: 3   TDVALDVLKRDDVSAAIFAPGWVYEHEQETDFQTAQNKWWGLVKQSWEIVRSYPKLVPFY 62
           T+VAL+VLKRDDVSAAIFAPGWVYEHEQETDFQTAQNKWWGLVKQSWEIVRSYPKL+PF+
Sbjct: 360 TNVALEVLKRDDVSAAIFAPGWVYEHEQETDFQTAQNKWWGLVKQSWEIVRSYPKLIPFH 419

Query: 63  SNFDQ 68
           SNFDQ
Sbjct: 420 SNFDQ 424

BLAST of CaUC07G129440 vs. NCBI nr
Match: XP_038877781.1 (LOW QUALITY PROTEIN: cytosolic endo-beta-N-acetylglucosaminidase 1-like [Benincasa hispida])

HSP 1 Score: 134.8 bits (338), Expect = 2.9e-28
Identity = 61/65 (93.85%), Postives = 64/65 (98.46%), Query Frame = 0

Query: 3   TDVALDVLKRDDVSAAIFAPGWVYEHEQETDFQTAQNKWWGLVKQSWEIVRSYPKLVPFY 62
           T+VALDVLKRDDVSAAIFAPGWV+EHEQETDFQTAQNKWWGLVKQSWEIVRSYPK +PFY
Sbjct: 368 TNVALDVLKRDDVSAAIFAPGWVHEHEQETDFQTAQNKWWGLVKQSWEIVRSYPKQLPFY 427

Query: 63  SNFDQ 68
           SNFDQ
Sbjct: 428 SNFDQ 432

BLAST of CaUC07G129440 vs. NCBI nr
Match: XP_008454832.2 (PREDICTED: cytosolic endo-beta-N-acetylglucosaminidase 1-like [Cucumis melo] >KAA0035914.1 cytosolic endo-beta-N-acetylglucosaminidase 1-like [Cucumis melo var. makuwa] >TYK19066.1 cytosolic endo-beta-N-acetylglucosaminidase 1-like [Cucumis melo var. makuwa])

HSP 1 Score: 134.8 bits (338), Expect = 2.9e-28
Identity = 61/65 (93.85%), Postives = 64/65 (98.46%), Query Frame = 0

Query: 3   TDVALDVLKRDDVSAAIFAPGWVYEHEQETDFQTAQNKWWGLVKQSWEIVRSYPKLVPFY 62
           T+VAL+VLKRDDVSAAIFAPGWVYE EQETDFQTAQNKWWGLVKQSWEIVRSYPKL+PFY
Sbjct: 287 TNVALEVLKRDDVSAAIFAPGWVYESEQETDFQTAQNKWWGLVKQSWEIVRSYPKLIPFY 346

Query: 63  SNFDQ 68
           SNFDQ
Sbjct: 347 SNFDQ 351

BLAST of CaUC07G129440 vs. NCBI nr
Match: XP_022977055.1 (cytosolic endo-beta-N-acetylglucosaminidase 1-like [Cucurbita maxima])

HSP 1 Score: 129.0 bits (323), Expect = 1.6e-26
Identity = 57/65 (87.69%), Postives = 62/65 (95.38%), Query Frame = 0

Query: 3   TDVALDVLKRDDVSAAIFAPGWVYEHEQETDFQTAQNKWWGLVKQSWEIVRSYPKLVPFY 62
           T+VALDVL++DDVS AIFAPGWVYEH QETDFQTAQNKWW LVK+SWEIVRSYPKL+PFY
Sbjct: 291 TNVALDVLRKDDVSVAIFAPGWVYEHPQETDFQTAQNKWWNLVKKSWEIVRSYPKLLPFY 350

Query: 63  SNFDQ 68
           SNFDQ
Sbjct: 351 SNFDQ 355

BLAST of CaUC07G129440 vs. NCBI nr
Match: XP_023536396.1 (cytosolic endo-beta-N-acetylglucosaminidase 1-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 129.0 bits (323), Expect = 1.6e-26
Identity = 57/65 (87.69%), Postives = 62/65 (95.38%), Query Frame = 0

Query: 3   TDVALDVLKRDDVSAAIFAPGWVYEHEQETDFQTAQNKWWGLVKQSWEIVRSYPKLVPFY 62
           T+VALDVL++DDVS AIFAPGWVYEH QETDFQTAQNKWW LVK+SWEIVRSYPKL+PFY
Sbjct: 287 TNVALDVLRKDDVSVAIFAPGWVYEHPQETDFQTAQNKWWNLVKKSWEIVRSYPKLLPFY 346

Query: 63  SNFDQ 68
           SNFDQ
Sbjct: 347 SNFDQ 351

BLAST of CaUC07G129440 vs. ExPASy Swiss-Prot
Match: F4JZC2 (Cytosolic endo-beta-N-acetylglucosaminidase 1 OS=Arabidopsis thaliana OX=3702 GN=ENGASE1 PE=1 SV=1)

HSP 1 Score: 100.9 bits (250), Expect = 6.2e-21
Identity = 44/64 (68.75%), Postives = 56/64 (87.50%), Query Frame = 0

Query: 4   DVALDVLKRDDVSAAIFAPGWVYEHEQETDFQTAQNKWWGLVKQSWEIVRSYPKLVPFYS 63
           +VALD+LK  +VSAAIFAPGWVYE EQ  DF TAQNKWW LV++SW IV++YP+++PFYS
Sbjct: 284 NVALDLLKSSNVSAAIFAPGWVYETEQPPDFYTAQNKWWSLVEKSWGIVQTYPQVLPFYS 343

Query: 64  NFDQ 68
           +F+Q
Sbjct: 344 DFNQ 347

BLAST of CaUC07G129440 vs. ExPASy Swiss-Prot
Match: Q9SRL4 (Cytosolic endo-beta-N-acetylglucosaminidase 2 OS=Arabidopsis thaliana OX=3702 GN=ENGASE2 PE=1 SV=1)

HSP 1 Score: 98.2 bits (243), Expect = 4.0e-20
Identity = 42/64 (65.62%), Postives = 56/64 (87.50%), Query Frame = 0

Query: 4   DVALDVLKRDDVSAAIFAPGWVYEHEQETDFQTAQNKWWGLVKQSWEIVRSYPKLVPFYS 63
           + ALD+LKR++VSAAIFAPGWVYE  Q  +F TAQNKWW LV++SW IV++YP+++PFYS
Sbjct: 289 NAALDLLKRNNVSAAIFAPGWVYETAQPPNFHTAQNKWWSLVEKSWGIVQTYPQVLPFYS 348

Query: 64  NFDQ 68
           +F+Q
Sbjct: 349 DFNQ 352

BLAST of CaUC07G129440 vs. ExPASy Swiss-Prot
Match: A1L251 (Cytosolic endo-beta-N-acetylglucosaminidase OS=Danio rerio OX=7955 GN=engase PE=2 SV=1)

HSP 1 Score: 55.5 bits (132), Expect = 3.0e-07
Identity = 28/65 (43.08%), Postives = 42/65 (64.62%), Query Frame = 0

Query: 3   TDVALDVLKRDDVSAAIFAPGWVYEHEQETDFQTAQNKWWGLVKQSWEIVRSYPKLVPFY 62
           T+ AL+++++ D+S AIFAP WVYE  ++ DF+  Q+K+W L+     I R    L PF 
Sbjct: 323 TNKALELIRKYDLSTAIFAPDWVYECHEKADFRQNQDKFWSLLSDFLYIHRPSSNL-PFV 382

Query: 63  SNFDQ 68
           S+F Q
Sbjct: 383 SSFCQ 386

BLAST of CaUC07G129440 vs. ExPASy Swiss-Prot
Match: P0C7A1 (Cytosolic endo-beta-N-acetylglucosaminidase OS=Gallus gallus OX=9031 GN=ENGASE PE=1 SV=1)

HSP 1 Score: 50.1 bits (118), Expect = 1.2e-05
Identity = 20/44 (45.45%), Postives = 34/44 (77.27%), Query Frame = 0

Query: 3   TDVALDVLKRDDVSAAIFAPGWVYEHEQETDFQTAQNKWWGLVK 47
           T+ +L ++++  +SAAIFAPGWVY+H  E +F   ++K+WGL++
Sbjct: 342 TNKSLSLIRKHGLSAAIFAPGWVYKHLGEENFLLNEDKFWGLLE 385

BLAST of CaUC07G129440 vs. ExPASy Swiss-Prot
Match: Q8BX80 (Cytosolic endo-beta-N-acetylglucosaminidase OS=Mus musculus OX=10090 GN=Engase PE=1 SV=1)

HSP 1 Score: 50.1 bits (118), Expect = 1.2e-05
Identity = 20/45 (44.44%), Postives = 35/45 (77.78%), Query Frame = 0

Query: 3   TDVALDVLKRDDVSAAIFAPGWVYEHEQETDFQTAQNKWWGLVKQ 48
           TD +L+++++   SAA+FAPGWVYE  +++DF   Q+K+W L+++
Sbjct: 336 TDKSLELIRKHGFSAALFAPGWVYECLEKSDFFQNQDKFWSLLER 380

BLAST of CaUC07G129440 vs. ExPASy TrEMBL
Match: A0A5A7SX49 (Mannosyl-glycoprotein endo-beta-N-acetylglucosaminidase OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold529G00270 PE=3 SV=1)

HSP 1 Score: 134.8 bits (338), Expect = 1.4e-28
Identity = 61/65 (93.85%), Postives = 64/65 (98.46%), Query Frame = 0

Query: 3   TDVALDVLKRDDVSAAIFAPGWVYEHEQETDFQTAQNKWWGLVKQSWEIVRSYPKLVPFY 62
           T+VAL+VLKRDDVSAAIFAPGWVYE EQETDFQTAQNKWWGLVKQSWEIVRSYPKL+PFY
Sbjct: 287 TNVALEVLKRDDVSAAIFAPGWVYESEQETDFQTAQNKWWGLVKQSWEIVRSYPKLIPFY 346

Query: 63  SNFDQ 68
           SNFDQ
Sbjct: 347 SNFDQ 351

BLAST of CaUC07G129440 vs. ExPASy TrEMBL
Match: A0A1S3BZ17 (Mannosyl-glycoprotein endo-beta-N-acetylglucosaminidase OS=Cucumis melo OX=3656 GN=LOC103495146 PE=3 SV=1)

HSP 1 Score: 134.8 bits (338), Expect = 1.4e-28
Identity = 61/65 (93.85%), Postives = 64/65 (98.46%), Query Frame = 0

Query: 3   TDVALDVLKRDDVSAAIFAPGWVYEHEQETDFQTAQNKWWGLVKQSWEIVRSYPKLVPFY 62
           T+VAL+VLKRDDVSAAIFAPGWVYE EQETDFQTAQNKWWGLVKQSWEIVRSYPKL+PFY
Sbjct: 287 TNVALEVLKRDDVSAAIFAPGWVYESEQETDFQTAQNKWWGLVKQSWEIVRSYPKLIPFY 346

Query: 63  SNFDQ 68
           SNFDQ
Sbjct: 347 SNFDQ 351

BLAST of CaUC07G129440 vs. ExPASy TrEMBL
Match: A0A6J1IL76 (Mannosyl-glycoprotein endo-beta-N-acetylglucosaminidase OS=Cucurbita maxima OX=3661 GN=LOC111477237 PE=3 SV=1)

HSP 1 Score: 129.0 bits (323), Expect = 7.8e-27
Identity = 57/65 (87.69%), Postives = 62/65 (95.38%), Query Frame = 0

Query: 3   TDVALDVLKRDDVSAAIFAPGWVYEHEQETDFQTAQNKWWGLVKQSWEIVRSYPKLVPFY 62
           T+VALDVL++DDVS AIFAPGWVYEH QETDFQTAQNKWW LVK+SWEIVRSYPKL+PFY
Sbjct: 291 TNVALDVLRKDDVSVAIFAPGWVYEHPQETDFQTAQNKWWNLVKKSWEIVRSYPKLLPFY 350

Query: 63  SNFDQ 68
           SNFDQ
Sbjct: 351 SNFDQ 355

BLAST of CaUC07G129440 vs. ExPASy TrEMBL
Match: A0A6J1GY36 (Mannosyl-glycoprotein endo-beta-N-acetylglucosaminidase OS=Cucurbita moschata OX=3662 GN=LOC111458520 PE=3 SV=1)

HSP 1 Score: 126.7 bits (317), Expect = 3.9e-26
Identity = 56/65 (86.15%), Postives = 61/65 (93.85%), Query Frame = 0

Query: 3   TDVALDVLKRDDVSAAIFAPGWVYEHEQETDFQTAQNKWWGLVKQSWEIVRSYPKLVPFY 62
           T+VALDVL++DDVS AIFAPGWVYEH QETDFQ AQNKWW LVK+SWEIVRSYPKL+PFY
Sbjct: 274 TNVALDVLRKDDVSVAIFAPGWVYEHPQETDFQIAQNKWWNLVKKSWEIVRSYPKLLPFY 333

Query: 63  SNFDQ 68
           SNFDQ
Sbjct: 334 SNFDQ 338

BLAST of CaUC07G129440 vs. ExPASy TrEMBL
Match: A0A5D3D685 (Mannosyl-glycoprotein endo-beta-N-acetylglucosaminidase OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold529G00320 PE=3 SV=1)

HSP 1 Score: 126.3 bits (316), Expect = 5.1e-26
Identity = 55/65 (84.62%), Postives = 63/65 (96.92%), Query Frame = 0

Query: 3   TDVALDVLKRDDVSAAIFAPGWVYEHEQETDFQTAQNKWWGLVKQSWEIVRSYPKLVPFY 62
           T+VALDVL++DDVSAAIFAPGWVYEH+QETDFQTAQNKWW LVK+SW +V+SYPKL+PFY
Sbjct: 283 TNVALDVLRKDDVSAAIFAPGWVYEHKQETDFQTAQNKWWNLVKKSWGLVQSYPKLLPFY 342

Query: 63  SNFDQ 68
           SNFDQ
Sbjct: 343 SNFDQ 347

BLAST of CaUC07G129440 vs. TAIR 10
Match: AT5G05460.1 (Glycosyl hydrolase family 85 )

HSP 1 Score: 100.9 bits (250), Expect = 4.4e-22
Identity = 44/64 (68.75%), Postives = 56/64 (87.50%), Query Frame = 0

Query: 4   DVALDVLKRDDVSAAIFAPGWVYEHEQETDFQTAQNKWWGLVKQSWEIVRSYPKLVPFYS 63
           +VALD+LK  +VSAAIFAPGWVYE EQ  DF TAQNKWW LV++SW IV++YP+++PFYS
Sbjct: 284 NVALDLLKSSNVSAAIFAPGWVYETEQPPDFYTAQNKWWSLVEKSWGIVQTYPQVLPFYS 343

Query: 64  NFDQ 68
           +F+Q
Sbjct: 344 DFNQ 347

BLAST of CaUC07G129440 vs. TAIR 10
Match: AT3G11040.1 (Glycosyl hydrolase family 85 )

HSP 1 Score: 98.2 bits (243), Expect = 2.8e-21
Identity = 42/64 (65.62%), Postives = 56/64 (87.50%), Query Frame = 0

Query: 4   DVALDVLKRDDVSAAIFAPGWVYEHEQETDFQTAQNKWWGLVKQSWEIVRSYPKLVPFYS 63
           + ALD+LKR++VSAAIFAPGWVYE  Q  +F TAQNKWW LV++SW IV++YP+++PFYS
Sbjct: 289 NAALDLLKRNNVSAAIFAPGWVYETAQPPNFHTAQNKWWSLVEKSWGIVQTYPQVLPFYS 348

Query: 64  NFDQ 68
           +F+Q
Sbjct: 349 DFNQ 352

BLAST of CaUC07G129440 vs. TAIR 10
Match: AT3G61010.1 (Ferritin/ribonucleotide reductase-like family protein )

HSP 1 Score: 66.2 bits (160), Expect = 1.2e-11
Identity = 30/51 (58.82%), Postives = 41/51 (80.39%), Query Frame = 0

Query: 7  LDVLKRDDVSAAIFAPGWVYEHEQETDFQTAQNKWWGLVKQSWEIVRSYPK 58
          L +LKR++VSAA+FAPGWVYE  Q+ +F +AQNKWW LV++S  IV++  K
Sbjct: 3  LYLLKRNNVSAAMFAPGWVYETAQQPNFNSAQNKWWSLVEKSCGIVQTIHK 53

BLAST of CaUC07G129440 vs. TAIR 10
Match: AT3G61010.2 (Ferritin/ribonucleotide reductase-like family protein )

HSP 1 Score: 66.2 bits (160), Expect = 1.2e-11
Identity = 30/51 (58.82%), Postives = 41/51 (80.39%), Query Frame = 0

Query: 7  LDVLKRDDVSAAIFAPGWVYEHEQETDFQTAQNKWWGLVKQSWEIVRSYPK 58
          L +LKR++VSAA+FAPGWVYE  Q+ +F +AQNKWW LV++S  IV++  K
Sbjct: 3  LYLLKRNNVSAAMFAPGWVYETAQQPNFNSAQNKWWSLVEKSCGIVQTIHK 53

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_004148186.31.0e-2893.85cytosolic endo-beta-N-acetylglucosaminidase 1 [Cucumis sativus] >KGN50312.2 hypo... [more]
XP_038877781.12.9e-2893.85LOW QUALITY PROTEIN: cytosolic endo-beta-N-acetylglucosaminidase 1-like [Beninca... [more]
XP_008454832.22.9e-2893.85PREDICTED: cytosolic endo-beta-N-acetylglucosaminidase 1-like [Cucumis melo] >KA... [more]
XP_022977055.11.6e-2687.69cytosolic endo-beta-N-acetylglucosaminidase 1-like [Cucurbita maxima][more]
XP_023536396.11.6e-2687.69cytosolic endo-beta-N-acetylglucosaminidase 1-like [Cucurbita pepo subsp. pepo][more]
Match NameE-valueIdentityDescription
F4JZC26.2e-2168.75Cytosolic endo-beta-N-acetylglucosaminidase 1 OS=Arabidopsis thaliana OX=3702 GN... [more]
Q9SRL44.0e-2065.63Cytosolic endo-beta-N-acetylglucosaminidase 2 OS=Arabidopsis thaliana OX=3702 GN... [more]
A1L2513.0e-0743.08Cytosolic endo-beta-N-acetylglucosaminidase OS=Danio rerio OX=7955 GN=engase PE=... [more]
P0C7A11.2e-0545.45Cytosolic endo-beta-N-acetylglucosaminidase OS=Gallus gallus OX=9031 GN=ENGASE P... [more]
Q8BX801.2e-0544.44Cytosolic endo-beta-N-acetylglucosaminidase OS=Mus musculus OX=10090 GN=Engase P... [more]
Match NameE-valueIdentityDescription
A0A5A7SX491.4e-2893.85Mannosyl-glycoprotein endo-beta-N-acetylglucosaminidase OS=Cucumis melo var. mak... [more]
A0A1S3BZ171.4e-2893.85Mannosyl-glycoprotein endo-beta-N-acetylglucosaminidase OS=Cucumis melo OX=3656 ... [more]
A0A6J1IL767.8e-2787.69Mannosyl-glycoprotein endo-beta-N-acetylglucosaminidase OS=Cucurbita maxima OX=3... [more]
A0A6J1GY363.9e-2686.15Mannosyl-glycoprotein endo-beta-N-acetylglucosaminidase OS=Cucurbita moschata OX... [more]
A0A5D3D6855.1e-2684.62Mannosyl-glycoprotein endo-beta-N-acetylglucosaminidase OS=Cucumis melo var. mak... [more]
Match NameE-valueIdentityDescription
AT5G05460.14.4e-2268.75Glycosyl hydrolase family 85 [more]
AT3G11040.12.8e-2165.63Glycosyl hydrolase family 85 [more]
AT3G61010.11.2e-1158.82Ferritin/ribonucleotide reductase-like family protein [more]
AT3G61010.21.2e-1158.82Ferritin/ribonucleotide reductase-like family protein [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (USVL246-FR2) v1
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableGENE3D3.20.20.80Glycosidasescoord: 2..57
e-value: 1.5E-8
score: 36.1
NoneNo IPR availablePANTHERPTHR13246:SF4BETA N-ACETYLGLUCOSAMINIDASE, PUTATIVE-RELATEDcoord: 3..69
IPR005201Glycoside hydrolase, family 85PFAMPF03644Glyco_hydro_85coord: 3..65
e-value: 5.2E-9
score: 36.0
IPR032979Cytosolic endo-beta-N-acetylglucosaminidasePANTHERPTHR13246ENDO BETA N-ACETYLGLUCOSAMINIDASEcoord: 3..69

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CaUC07G129440.1CaUC07G129440.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008152 metabolic process
cellular_component GO:0005829 cytosol
cellular_component GO:0005737 cytoplasm
molecular_function GO:0033925 mannosyl-glycoprotein endo-beta-N-acetylglucosaminidase activity