HG10018325 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10018325
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionCCHC-type domain-containing protein
LocationChr04: 3079143 .. 3096933 (+)
RNA-Seq ExpressionHG10018325
SyntenyHG10018325
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGTAGACGTCGTTCACTGGTGGAAAGTAAATCGGCACATAATTTCTCTCTACTTTCTCGCCCAGAGAAATCTTGTTTCAAGAAGAACAGAATCATCAGATAGTGAAGGAGAGAGCTCAGAGCGAAGAACTTCAGAAATGGGAGAAGAACAAGAGGATTCACAGAGGTTGAAGAGAGTAGCGGCGGCAGCATATGACTACGAGAACGATCCCAGATGGGCTGATTACTGGTCCAACATTCTGATCCCTCCTCACATGGCTTCTCGACCCGATGTTGTTGACCATTACAAGCGCAAGTTCTACCAGCGATACATCGTGGGTTCTCTCCCTTTTGATTCTTTTCGATTTGGGTATCGCTGTTCTCTTCAATTTAGAATCCGTACTCAGTAATCGCTTATTGGGGTTGTTTTCTTTCTATTGGATTATTACGCTGTCTGGCGTTATGTTCTGGATTTTGGGATGTGACTTAACTCTTTTTGTTGTTTTCGCATTATATCAGTTGGTTCTCGCTAATTTCTGCTTTGTGGAGTGTTACTGTTAGTGCAATTTTTTTGGGGGGAATTTTATCTTTTAAATTTTGAGATTTCTTGTGGATTTTGGGTTCTTGGAGTTAAGAGAAGATGTTCAAGTTCAATTACAAGGCAATCAGCTAAAATCTTAGTGCAATTTTTGTTGTTTTCGCATTATATCATTTGGTTCTTGCTAATTTCTGCTTTCTGGACTGTTACTGTTAGTGCATTTTTATTGGGATTTTTCTCTTTTAGATTTTGAGATTTCTTGTGGATTTAGGGTTCTTGGAGTTGAAAGAAGATGTTCAAGTTCAATTACAAGGCAATCAGTTAAAATCTTGGACTGGTTTTGTGTTAAGCTGCGAAATGGGTTTTTGTTTAACTGTCCTCTTTTATAGAATTTGTGCTCTTTGGTGGTTGTTGAGAATTCAAATGTGTTCATGCATTTGTTTTAAATTCTTACTCTATGCTTCTTTTCCTTACTCGGTTTTGAAAGACAGGATCCCGAACTTGTGGTAGAGGCCATGTCTTCAAGTAGTTCAACTCAGTCATCTAGACCTTCAGCTACATCTTCCGCAGCACCCCCTCCTACTAATGATCGAAGTCGACCACAAAGTGCAGGTAGAAGTTCTGTTTTTCCCCCTCTTAATGTTAGACATCAAGATCTAGATATTTCTTTAATGTTCATGTTGAATTTACCTGATTGCATTTCTATAACGCATTGCTCATGATGAAGCAGGATCAACAACGACTAGGACTTCAGGTACATCTGCAAGTGCAGATGCTAATCCGACTCCATTGCGCTGGGATCGGCAAACAATTCAGTTTTCTGTCAATGCATGGGTAGGTAGCTTAGTTCATACTTGCTTATATGATGTTGCAAAGTTGGCATTACTACAGAATGTTTGAGGGCTGCACAATGTTATGTTATTTATGTATTTTGTTTTTTTTCTTAAATTTGAGGTGCTGGTTTTCTGAAAGTTTGTACGAACTTTTAATTACAAAGGTTCCATCATTTCTGATAGATGAAATGTAATTCTCATAGGTTTTGTTGAATATTTTTTGTGCATACCGCCATACACTATTTGTATCTGAAATGTTTTTTCTTCCTTCTCTTTGTGGAGCTATCCCTTTCATTACTGAAACCGTCCTCTGTTTGTGTGTGCTTGGGTACGCTTGTTATGCACTTCAATTCCTAGATATAAAGAGTAAGGTGTACATCATCTCTTCTACTCTGTAAGTTCTTATTTGACAGTGCCCTGATCTTTCTCCAAGATTTACTACCAGGTTTTCTCTATTAATTCAAGTTCTGTTCGTTTTACCTCGATTGAGATTTTCAACTGCCCCTTTTTGGGTAAACTGCAAATAAAAGACATTCTATTGGACTTTGTGATATCCTGGAGTATTAGTTTATTTATGGAAAGCTCATTCATAGCAGATTTTAGTCATCAAATTTTTAGAGGTTATATGTGAGCATTGTCCATGGGCTTCTTTTCCTTCATGGAAACTTGGAACCTCCAAGAAAAATCTTTATTAGAAAAGTTATGAACATTATTTTAAATTTGAGCATGAAAGCTATATACTAAGGGTATTTTCATAAAAAATTATTGGTTGTCTTCAGTATTGGCTCATGGTTATGGATAGTTCTGTTCTGCTTCCATCTTTTTTAATATATTTTTCAATTTTGGGATCTATTCTGAAAAGTTAATTTTCTTTTCTTCTGCAGGTGTTTATTGTGGCTGTACTGGCAATTTTCCCCCTAATACCCAAAAATCTTTCGCAGAGGGCATACAGGCTATCTTTTATGGGCACAACTTGTTCTTCTTTATATTCTTTGTACTCGTTGTATGGAGTAAGTTCTCCATACTAATTTCACAGCGTCATTTCCAATTCTATGAAAACGTTTGGTTTCTTCCAATCTAAAGAAAAGAAAAGAAAGAAAATGTTTGATTTACTATCATCAAAAGGAACAAAAAAGAAAAATAATAATAATAATAAAAGAAGAAGAAGAAGAAGAAAAATAAAACAGCTCCTTTATGTTCATCTAGTTTTATGTATTATCTTTCTATCTTGAATTATTAATTTTTTAATGTTTGCAGAAGCCCAGGGCGTGGAATTTGCAAGCATTGCAAGCTTATTTCCAGTCCATAATTGCGACAAAAGATTTCATTTACTTCACTTACTGTATCACCTTTGTGACTTCAAATATTTGTCTTAAATGTAAGCGTAGCTTTGCTGATCAAGTTTTGTTTACTGATCTCTTTAAGAGATCACCGTATTCATTCATTGAAACATGACTTTGTTTATATGATGTTTTTCTACAATTTCATACTTGTGCGGCAAGATTTTTAAACTTCTTTTTTTAATATTGTTAATGCAGTTGCTTTAATTCCTATCCTATGTCGGGCTCTTGAACATGTTGCAAAGTTTCTTAGGCGTAATTTTGCACGTTCGTCTTTATACAGGTAGCGTGTTGTCAATTTTTAATTCAGTTTATTTGTTGTATGAAATATATGAATATATGTCTACTTGGATTTTGAACCCTCCCCCCACCAAAAAAAAATATTAAATCCTTAGTCTCTGCTTACATTTAAGATTGCATGTGTTCTTTATTTTATATCCTCCATCTTTGCTTTTATGGAAGGGGAAGGTGCTCACAGTTTCTTCCTTGTTGTGCCTAGAGAAGGATATACTTCTTAGGAGAAAATCAAGTGGTGTACGAACAGAAAAGAGAGAATAGATCTGATTAAAGTTCCTTTAATTTCAAATTTGGAAGGTGAATATTATTAATAAGAGACGCATTATGTGCTCTTGAGGGAATTAATGGGTGTTCAAGGATATAGGAGAAATCAAGTGTGTGAGCTCTCCTGTCTACATGCAGGAAGAATAGATGAATGTGTATTGCCTGACTCTTACTTTTCTCGTGTGCTTAAAATCTGCTGGTTGACTTCTTTTATCGCTCAAGTTTATAACAATATGTCTTTAATTTGTTGCAGGAAATATTTGGAAGAGCCTTGCGTATGGGTGGAGTCAAATTCAACTACTCTCAGCATCCTATCTTCACAGGCTGAGATTGGAATTGGCTTCATTCTAATCATCTCTTTGCTCTCGTCAGTTTTTACTCCCTTTTGGCTCTTGTCAACTTCTTCTAAATCCACCCCCCCCCCCCCCATAACACACTCACTTTTTTAATGGTCATTTCTCTTCAATGATCTCCTCAAACTTATGTTCTACAGGTGGCAACGCAACTTCTTACATACATTCATGTACTGGCAGGTGCGCTTTCTTGTAATCGTTACATAATAGCATATTCTTTATTATTATTATTTTTTCTTTTTTAGTACAACATAATAGATATTCAGGCACACAAACAATATTATAAAACATAGAAGATGAGTTTACTATCCTAAATGTGGGGGGTGGGGTGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGCTATATTGAGCTTAGTTTCTTGAACCATTAACTCGAGAATAATTCTCTTGGATTTTTAACCAAACAATTGGTATGTTGTTTGTAATATGATCTCATTTTGGTGTTATAACTCATGAGTAGACTGGTGACCATCTTTAGGGTGTTTCTCTTGTTGAAACTATATATGTTGTTTTCATAATAAAAAAAAAAAAAGAGTTGGTGACTCACTTTTTTTTGTTCTGGACAATTGCAAAACAGCTGCTAAAGCTCATGTATCATGCTCCTGTCACTTCTGGATATCATCGTAGCGCCTGGTCCAATATTGGGAGGGTTGTTTCCCCGCTGATCTACCGTTATGCCCCGTTCCTCAATACTCCTCTTTCAATGGCGCAAAGATGGTGGTTCAGGTAGAATCTAATAGAGGAAGAAGATGCAACTCTGATGTATGAAAAGTTTTAGACTAATCCCAGTAGTGACTCTCCTTCCACTTTTTCAAGCTATAACTTGGATGACATTCTCACTTAGGTCTGTGGAACAAGTTTGGCTTCCAAATGCAACTTCTTATATATTTATTATTGATATACATTATACATCTTAGGAGATTGTTATGTTTGAAATGTGGGTTCTCTATCCTGCACTCTGTTCATGGTGATGGAGGGAAGGTTCTTCATTATTTCTTTCATTTTGTAGACTTCATGTCTGACATTTTAATATCTAATGAGGTTCTGAAAACAGATCTTTGAATCACAATTAAAATGAGGTATAAATCTCCAATAAATAATTGCACTATTATTGACTTTGTGAGATAAATTTCACTTGTTCTTATATATATGCACATATATATGTATATATATATATATTATGTTTTTGTTGATCAGATGAAATTGAAACTTTTAAAACAGTTGAGATATTCGCTGATCCTTTTTCGATCATTTCTCATCCAAAATTTTGTCGTTTTGTTTCGATATTCTTCTCTATATACACTTTTTTTCTGTATTTTTTGATTCGATTATTGTTATTTGTAAATATGGTTCGTTCAGATAATATATTTGAGGCCAACGTTATCCCATCACGTCAATGGTGGATAAAATGATCAACTTTCTTGAAAAAACAATAAACGAACTGTTGCAACTCATTATCAATTTCAAAATTAAACTCATAATTGAACATATATTATCATTTACTATCATTGTTTTATCTAAAAAATATACACATCCATCGGTAGTTGATCTAGGTAGAAAGATGAAGATTAATATTGTAAATAGCAAAAGAAATTGGGCTTAACAACATATTAAAGAAATTTGAACGCTATATAATTGATAGAGGGTGAGAATTTAAATTTATGTGATCTGTGTCTGGTGTGTGAGTTTATGGTCGAGGCATAAATTGGACTAAGAGAAATTACACTCTAATCGCATATATTTCAAGTGTAAATTTGCTGAATTGAGAGATAGAAGGAACGTTATCATTTGTGATATAGATCAGGATTATTGAGTTATCATTAATAAACTTTATTTGTTTTTAATTAATTTCCAGCTGTTAAATTTAAATTTTTATTTTTAATTTTAAGGGTTACAAAACTTGGTAATTATGGAGATGTCTTTTGTTTCTAGTTTTGGATATTTGAAATGTTGGAAAGATGGAGGTTGATTTCTCCGATGAAGATGCACTAGTGTTTTTACCACGGGCTGTATTTTTTTTTAATTATTTGGATAATAAATTATTCATATAAATGAATTCATTAATTATTTTCATTTAATAATATTATTAATTTGGTAACATTTTATATTTGATGGGGCTGTGAATATAATTTTGAAGTAACTAATAATTATTTTTTAAAATTTACAATTTTCATATAACATTATAATTATGAGAGTGAAAAGTTAATTATCAACTCATAAAAAGGGATCTTAATTATTAGGTGTTTTGATTTTAATCCATATAAAGGGATCTACTATTTTATTTTATTTTATTGAAAAAAGGGATCTACTAATTATTTTCATGTAATAACCAAATATTTAGGATAATATTAGCATACTTGTTTCTCACATGCAATATTTATTCTAAATAGAAGTTTGAATATGAATCTTTTCTAATATGATTTATTTTTTTAAAAAATTATATTGATCATTTAATAGAATACAACGTAAAACTATATGAAGATTGATAAAGAATACTGCTTCAAAAAATGAAAAGGCAAGCAGACTGCTGTAATTAATAATTAAGAAATTAAACTAAAAAATAAAATAAATATAAATAGACCACAAAAAAGAAAAAGTAAAAAAAGTGAAGAGGATGAAAAGAAGAAAAACTTCACTTGTTGTAAGAAATGAGTGAGAAATAGTCATTTTATAGATGACTAAAATGGAGATGTTACCAAATAATTTGAGATTCTATCACTTTTGATATTAAATAATGAAATATAATTAAATGATAATGAAAGTGGAGAGTTACCAAACTTTTTTAATCTAATTTTTTATTATATTAATGAAATATAATCTTTGAAGTAGTAGCACGCCTGTGACACATAGTAAAGTGTGTTACTATTGACATTAACACATTAATAATAATCATCATAATAAAATAATAATAATAATAATAATAAGGATAAAATACCCTTTTGATCCTCGAAGTTTAAGATTTGAGATTCATGTCTATTTGGTACTAAGTTTGGAAACAGACAGTTTAGTCGCTCGCGTTTGTTAAATGCTTCTAAATAGTTTCTAACTTAACTTTTAAGTTAATATAATGCTTACCCGACCGGTATGTGATGATGTAGACGTTAATTTATTTGATGACATGTCAATTATCTTTGATTAGCTAGAAAATTAATTTAAATTTAAATTTAAAAATAAAGAAGAGTTGTTTAAAATATTTAAAAACTTAAAATAATACTCTCCATATGCCCTAGCCCCTAATTGCAGTAGGAAAAGCTTCAAGAATGTTAACAATAAGTGGTCGTCATCTTCGTTAGCAATAGCCCCTCATTCGTTTTTCCCCCCCCCCCCCCCCCCCCACCAAAAAAAAAAAAAAAAAAATCCTTAAAAACTCCCTCTCTGTCAAAATAAAATTTGATGACCTATTTTAACAACTTTTCTTTATTTTTAAGCTAAATTTTAATGAATTTCCCACTAATCCAAATATAATTGACACATCATAAAAAACAATTAATATCCACGCCAGGACATAATGGTCTAGTAAACATTTTGTTAACTTAAAATTAAGTCAGGGACCATTTAGAAGTATTTAGCAAACGTGAGTGACTAAAGTGTCTGTTTTCAAACTTCAAGGGTTAAAAGTGCATTTTGCCCTAATAATAATGAATTTAATATTGAGATTTGTGCCCTAAAACCTCGTAATTAATTAGTTGTTTAATTAATTTAATATGCAATATTAAATCTATTGCAATGAAAATCCAAAGTTATTTTATGTAATCTTGAACAGTATATGGTTGAAATACAGGCGGATCATATCCCATTAATAAGCTAAATGGTTTATAGTATATGGATAAGATTAGGTGTCTTATCCTGGCGACACTATGGATACGACACACTTTGTCATTGTTACAAAAGGTTTGATCAAAATCGTTCATATGGAGACATGTGAGTGAGAGTATCTTATACAAATAATTTGTATAAGACTTAATTGCGAAATATTTAATATCTCTTTGTAAATCCGTTAACTGAAGAGATTAATATTTCACAGGATGATCATATGTGATTCGATTTTAATCATGAGTGAGTTATGAACTCTTGCTTGTGAGGACTCGTCCTTTGATTTGCATAGGTGAGAGTGGTCTGAGTCTCCGGCTCAATATGCCTACCATTTTGAGACTTGACCAAATAGGAAATTGGGAATATAAATTTGCAAGATGGAATTCACTTCTTCCCATTTAGGAAGAGTAGATAAGTAGTTCCCTTAAATGCTGACTTCAAGACTTGAACAATAGGGTCCCATTCTCTCATTGGTCCGAGAGGGACTTGGTTTATGGTTGGACTATAAACAAATTATTCAATAGATGATCAGTGATACTTAAGAAGATAGAGGTAATTACAGAGGTAAAATGGACTTTTGACCCAGCTGTATTTACAAACAACTCGTGAAGGATCGACTTACTTGTAATTGTTATGTCCATGGTTGCAACTTGTCCTACAATGCATAAGAGGGCAACTATGCGTCTATAATGGTTTGCCCTATAGTTAATGAATTAAGGTTAGTTAATTAAAGCGTTTAGTTAATTAATCTTTTATCGTTGGAGTTTATGATCTGTAGGTCAATATGGTTTCCCTGCTATCTCACAACGGACCAACAAGGATGAAAGGTATTGAAAGAAAATTTTGAAATGTTCAAATTTTTGTGTCAAATGAAGGAAAATATATTTATATAATTAATATAAGGGGTCTTTTCACAAATAAAAAAAAAAAACAAAAGTAATTACACATAGAAAAAAAAGTGGGAAATACAAAAGTACGGGGTACACTTTTTTCTATATGTGTAATTACTTTTGTTTTTTTTCTATACACAACAACTTCTCTTAATAATATAAGCATATTGATTTATTAAGATACATTATTATAAAGTTAATTTGAATATGATTCAAATTAAAACTATGTAATAGGAAGAATTATTGTATTTTAATATGATTCAAATAATTAATGATTTTATTTACTAAAAAAAAAAGTAATCATCTTAATATGAAATATTAAAATAGATAATTATCATTATCATTAAAATTATCTTAACTATCTAAATCATTTTAATATGAAGATCATTATAATTATCATTTGAACTAACCTAATTATTTAAATCATTTTAATATGGATATTAAAATGGATAATTATAATTATCATTAAGATTAACCTAATTATCAAAATCATTTTAATATGAGAATAATTATATTTAATTAAGTTTTATCTTAATTAGATTCTATAAATAAGACCTTATAGAGAACATTAAAGGAGTTTTAAAGTTTACGAATCTTTTCTAAAATTCTCCTAAAATACATAGAAAGAGTTCTTTAATTTTCTTTTCTTTCAAAAGGCTCACACAAGTTAGACCCTTGCCAGAGTCATAGCGAGGAGGACTTCGTAAAAGAGAACAACAATTGGAGAGAATTTTTGCTGTAGACCATAAAAGTCTATCAATGATAGATGTCTATCACAGATAAACTTTTTTATATTTGCAAATTATTAAAAATGTTGTTATATATGCTAATACTTTAAATCTGAATGTTGTCCCTTTTTGATTTAGTATTGATATATTACAATCTCTAAGTAGGAGTTTGTGAATTTCACTATTTGTTTGACTTAACCGTCGGTGGGTTTCTAAGAAATAATAAAAAAATAAAAAGAAATCAATAATGCACCACCCGTCAAGATTAAATTTAAAATTTATTATAATACAGACTAAAATCAGAAATTAAATAGACTATTTTTTAAAAAAAAAATCTCAAATTAACATTAATCAAAATTATATTTCAACCCTAAAAAATAAAAAAGGATGTTTAAAAAAGAAATTAAAAGAAAAAGGAAGAAGAAGAAAAAGGAAAAGAATCTGAATCTGAATCGTACGGCCAGCACTCATCCATCTACTGCGCCGCCCGGATCATTTGACCTCACTCGCTCGTCGTCATTGCTCTGCCTCATCTTTTCCCAATTCCGGCTCCGACCGATGTCTCCTGCACTCACTTTCCCTTCTTCCGCTTCCTTTTTTCCCAATTCAGGCTCCCAATTTTACTGACGGTACCTTCTCCACAACCCTAAATTTCTCTTTTTCCTTCATTCAATATGCTTTTTTTCATTCCTTATTTGGTTAATTACAACCTACGCTCAACTAAACTAGAATGTATTCGAATTAGGAAATTTTCAAGTTTATTGATCTTTGTTGTTTTTTATTTCCCTCGAAATTTTAGAATTTGAAGAGTGCTTGGTGCTTTAAAACGATTTCCTGGTAACTTTAGTAACTGGGTTTCCTTCAAACTGAATGCTTGAGTTGTTTTCCCCCCCATTTATATACCATGCTGTAAACAGGTATCTGTGCATTGTTATTTCTGGAAGTTCAAATGTTTGCTGCCAAGTTTTCAGATTCCAATCTGCATCCGATTATACCATTATATTTACCATGCTGTAAACAGGTATCTGTGCTTGAGTAATAATTTGTGATAAAGAAATTTGGAAAGAAGTGCTGACTACAGGGAATTAAAGAAGAAGAATTAACTTGAACTTGATGATGCTGTTTTTAGTTTTAACACGACTTAGATATGTATTTTTCTTCATATTCAAAATTTTGGATTAGTTCTAAAGTTCAAATGTTTGCTGCCAAGTTTTCAGATTCCAATCTGCATCCGATTATACCATTATTTACCATGCTGTAAACAGGTATCTGTGCTTGAGTAATAATTGTGATAAAGAAATTGGAAAGAAGTGCTGACTACAGGGAATTAAAGAAGAAGAATTAACTTGAACTTGATGATGCTGTTTTTAGTTTTAACACGACTTAGATATGTATTTTTCTTCATATTCAAAATTTTGGATTAGTTCTAACCCTACAATGTTATGTAATGAAGACACCTGTTGATGGTTAATATCCTAACTTGGGGATCCTGGGGAATTTGAAACTAGGTCTTTGATGTGCTTTTGGTTTCTATATTTATGGCGTGAGGTTATTTGAAAGTTTCAGATTCTGAAAGTGTCATAAGTCAGATAGATATATTATTAATTACTTTTTGGATTCTTTATAGTTTTATTGACTTGTACAATTAATTTTTATATTGTTAATGAAGTTCCTAGTTTACTTTTACCTGAAGTGTTCTTAGTGTTGTTATGGTGTTAACATGTGCCATTGCCACTTTTTATTGATAGGTGATTTCTACCTGGTGTGCTGTTTATTCCTGCAGATAAAATTTACAGTTACTAAGTGGACTTTTTGATACAGAATTGTCCATTTTATCTCCATTTTATGGGGACCGAGGATTTCATTGCATTGCCAGGTTCTGGTGATTCTGGAAATGAAACCGAGAGTAATGAATCTCTTAGTTTTAATGAAACAAGAGAAGCTTATTCTCAATCAAGTGTTTTGAAGTGCAAGGACGATGATGCAAGCATAGAGAAGGTTGAGCTTGCAGATGATGTACACTTGGAAGATATGCCTTGCATACCTCAATCTGACCTTACTGATGAAACACAACGTTCTGATTCAGATATGGAAATTGAGGATTTGAATAACCTCCCAGATTTTAGTAAGACTAGAAGTAGAAGTGAGAATAATAAAATACTGAGTGAAACCGATTATCTGCCAGTCAACTCGGCAGATGAGAATATACTACCAAGCAGTGAGCCCTTGCAGCAGAATGAGCTTCATACGAGGTATGAAGATGTTTGTCATATTGAAAGTAGAAATTTTCAGGAGGATTGGGTTGATAATTCATCCTTCTTGAAAACTGGTGGTCAATTGACGGTAATGAACGGAGTTTCAATTGAGTTCAACGGATTAAACTCTGGAGTTCCCATTGAGTATGGTTCGGCTGCATCCCATCATCATGGAGGCCCCAGTAAGATTCACAAGAGTGATGGTAGTTGGATTCCCCTTCACCCCCCCCCCCCCCCCCCCCCCGATTTATTATTTACTTTAAAAGACTAATCCGTAGTTTTTTAAACCTTTTTTTTTTAAAAGGAATATCCCCTTTTAATTGTGATAAGTTAACTTTTCTTTTTCTACAATTTATAAGGTTCTAATTGTGATACTAAAAATTATTTTTCTATGTAACCAGCAATATCAGGTGTCAAAAGACCAAGGATGGCCATGGATGAGCAACAACCTTCAGTGCACATCGTATATACTTCTTTAACAAGGTTGGTTGTGCAACTGTTTTTGCAACAACGAGCCTATTTGTTCAATTTAATAAAAATAATAGTTTGTTATAATTGAGAAACATGTATCGTTGCCGAAAAGTGTAGAACCTGGTGATACGATTGTATTAGGGTTTTTAAGGGATTAGAGTGCTACCAGTGGTACTCATTGGTGAGACGAAGCAATTTTCTGAAATTGATTTGTCACTGAAGCCCATATAAATTTAGAAATGTCAACTGTAATTCCACGTTTTTCTTTTATTTGGCATCCCCTTATAAATGTTTCTTCCTTTCCTTCAAAATGTCCACGTGTAATACCGTGTTTGTTTTTCAATTTTTAAATTTATATATCCTATTGGAAGCTGATTTTATTATTAAAAAAAAAATCAAATGTAATGTTGTGTGAACAACTTATAGAGAACTATGTTCAACAAACAGATTTTGTAAGTTCCTAACTAGATTCAGTTTCTTATTTCCAGAGATAGTAAACAAAAGCTTGATGAATTATTAAAGCAGTGGTCTGAGTGGCATGCTCTACGAGGTTCTCTATCACATGTAAGCCTCATTCATTAGTGCTGCTTGGTTTGCTGCTATATTCTATGCCTTATTGTTCAAGATGTTGCTTTATACAATCTTGTAGAATTTAGAATTAACTCTTTTTCATATGATTTTGTTGAATTGTTAAACTGAGGTGTCCCCATCTGTACACAAGTGTTTTGCTGTTAATGGTTTTTCTTCACAAATGATATTTTCTAAATTTAAGGTTCACTATCTTGAGTTATTTTGGACATTGTTTTGTTCCTGCCTCAAGTTTTTCAAATGAGGTTATTTTATTATCTTTGACTTTGAATGAAGGAGCGAGCCACCAATGACTGGCCAATGGTCAAAGAGCTAAGTTGTAGTCTTAAGAAGCACGGACACTGACACGCGACACGAATATGACATTTTTTTTTTTTTTTTTACAAAAATAAATATATGTATGCATGATTTGACATATTGTGAAATATGGGCACATCTATTTAACAAAAAGTAAGAGAAACCATAATGGAAAAAAAAGAAAGTAAAGATAAAATCAGAAGGAAAAAACAGTAAGAGATGTCACATAAGTTGTCGGCAGTTGTGGGTCTTGAAGTTGAAGTCACCGTTGTCGATGAAGTTACGAAAGAAGAATCAAGTAGCATAGTGGGTTTTTTATTTTTTTTGAAAAAAAGTTGTGGGTTAAGTTTTGAGGCTGAAGTGTGTTGGTATTGAGAAAGTGTGTAAAAACTCTAGAATTTTGAAAAAGTTTGGTTGAACGATTAATTTTAATTAGCTTAAAATTCCCCTTAAATGGGCTGCCGATAGTGGTTCTTGTTTCATTATTTTTTTAAAAAGTTTTTTTTTTGCATATGGAGTGTCCTATGTGTGTCTTGGCGTGTCCTAACTTGTCGGAAAGCAAACAAATAGTAATAAAACAATGGACATGTTAGCTGGCGTGTCGGACATGTGTCGGAGGCGTGTTGGTGTCTGACACCGATACTTCGCCTGTTTAGAAGTGTCGTGTCCGACACCGATACTTCGCCGGTTTAGAAGTGTCGGTGCTTTATAGCTGTATCCTGTAGGTTGATGAGGTCGTTATCCTCTAATTTCATTATTCCTTGTGGTGTGTCTTATCACTACGGAACTTGAATTATGTGTATGAGTTTGTCTGCACCAGTTCATGTGAAGATCAATGAGGTGGTAGTGAGTATAACGCTTGATCTCTGGGCATCCATAATTCTAGTAAACTTATAAGTTTATCTTTGCTTGGCTGTCCTCATCCAAGTGATCAAGTGATGGAAGCAAAACTCTTACGGAGTTCGTGCAATACAGATTGTGCATATTACGTTTTTCTAAGGATTCCAAATTGTGGTTGAGTCTAGGAAGGTGAAGGACTGGTTTCTCGATTGCCTGGATGCTTGTAAAATAAACTATCATGTTAATTTGAATGGAAGTGATACTATGAGTGTGTTATGGGATATTTGGGGGGAGAGGAACAATCTAGTGTTCAGAGGTAGAGATAGGGACCCCTGTGAGGTTTGGTCTTTAGTGAGGTTTCATGTATCGCTTTGGGCTTCAGTATCAAAGTTTTTTTTGTAATTATTCTCTAGGAAACATCTTACTTAGTTGGAATCCCTTTCTTTAGAAGGGGTGTTTTGGTGGGCTAGTTTTTTTGTATGTCCTTGTATTCTTTCATTTTTTCTCAATGTAAGTTGTTGATTCTAAAAAAGAAAGTGATATTATGAGTGTGTTTACTTTGGTCAATGAATTCTTGATTGGGTGTGATGTGTGAAGACCTATATTTGTGTTGTTTGTACAATTAACTTAAACATCAGGGGTGAATTAGATTTTACAATGGTGATCATTTGTATTACTGGACCTTATTGTCTAGTGTGTAATCTTCTCTTCTCAAGTCCCTTCTTTCTTTTTAACATTTATTGAAAATTCCGTCTAAAGTCCTTGGAGTCGGTAAAAAATGTTTTCTTTTTGTTCATTGGATGAGTATTTTGTCTTTCTCTGTGTGTGTTGGGAGGAAGGGAGGGGGGTAGAGGTGAAGTAGCTAATTGAACTTGCTTGTTTCTCTGTTCAGGATGATAAAGACTCTGAAAACCTAGAATCTGGAGAAGAGACGTTCTTCCCTGCTCTCTGTGTCGGCACAAAGAAGACTTCAGCAGTGGTGACTTTTCTGTCTTTTTGTCCATAAACGTTTATATTTGAGATTGTTGACCTATGTTTATCTCAATTTGAGTGGCGTTTCTATTTTTGGGTTTAGTTAGCAGCAATTGTTATTGGCGTTGGATTATTGTTTCTTGTTCTTATACTGACCTTCTACCTATATATTTTGTGGTAAATCTTAAGTTAAGTGGAAAAGTTACACTTCGGCCTTCCCTTTTTCATTCTTTTTATCCATCCTGTGGGAAAAAATAAAGTTGAAGTTCTTCTTGGAGCTCAACACTTGAAACTTTCTTGGTTTGTGTTTGCATGACCACAGACCATCGTGTCCAAAAATTTGGATTGCTGGTAGCAATTAAAAGCAGGTGTATGTTTGGCAAACTATGCCAGTTGGTCTGCTTTTAGTTAACATGTGAGCAGTATGTGGAATGGAAGCAAAGAAACTGATAACTTTGTGGACCAATTGGTGTATAACACATATAAAGCTAATGCAAATAACAACATGAGGATGTTTTGATTCACAATATGAGAACATGTTGATAAACTTTCTTTTTGTTTACCTTATTATCCTTTATTCAGTTTGATGCTGGAGGCAAAAACTAAACCTATTTTCTCTTTATGGTTCTTTCTGTGTATGGTATTTTGTGGTAAAGAATTTGTGAAGCCTTTATTAGTTTTTTTTTTCCTCTCTTGTTTTCATAGCTACTTTCTGTATTTTCCTTCTTAATTATTGTTTAACTGCAGACTTTCTGGATTGACAACCAAAAAAGTGAACAGCAGCAAAATTTTGTTCCTATAGATGATAATTCTGTGCCACTGTATGATCGGGGATTCACTTTGGGACTGACTTCAGCCAATGATTCGAGTAAAGTGGAAGGGTAAGTACTCTTTATTCACCACTGTTGAATGAAATATTTTAGTGGGTTACCCTTCCTAGTTTTGAACTATTACGTGTAAATTCAGGATAGACAGTGGTAGAAAATTAAATTCAGGATAGTCAGTGGTAGAAAATTTGTAAAGTGTTGAGGATTCCACATTTGAAAGACGTGGAGACTTCACAATCTTTAAAAAATACATGGGCTATTTTTTCCATTGCTAATTGGTTTGAGATGGAATCTCATCTTTATGAGTTATCTAATATGGTTTCAAAATCCATGAAGCCTAACTAAGTATTTGGTCTAAAGAAAAAGAATTAGTATCAAAACCAAGAATGGTGAACCCAAAGAGGCCATCTTGAAAGGGGACATATTGAAGATCTCATATACATGGAGACCTCACAGTTGTTATCTTTTTTAGTAAGTTGCATTAATCTTGCATTAATCTTGCAGCGAATTTATGGCTGCTTTACAATACTTGTTATCTTTTTTAGTAAGTTGGAGGCTAATTTATACTTTTAAATTGAATTGATAATTCGAGTTAACAGCAGGGGCCAAGCAAATCTAAAGGAGGAGGCAGCTTTAAATTTAAAATGAAATCCAGTCCTTGATTTCCTTGTTTTTCATGGCAATGGAAATGAAAGGATTCTTTATAATAGTTCTTATTTTTTTAGTTTGGCAAAGAACCTAAATATTAATAAGGGCATATACTTGGTTTTGCTAGTTTATGTGCCTTTCCATCAGTGCTTTGTTACTACAACACTTTGAACTTGTTTTTTATTCTTAATTTTCTTTGGAAACTTTCTAGAGGCCTGAAGATAATTGATGATGCTAGCCGTTGTTTCAATTGTGGTTCTTACAATCATTCCTTAAAAGATTGTCGAAAACCTCGAGATAATGCTGCTGTTAATAATGCTCGCAATAAGTATAAAAAATATCATAGTTCTGGCTCCCGCAATTCAACTCGATATTATCAGAATTCACGTGGGGGGAAGTATGATGATTTGAGGCCAGGAGCTCTTGATGCTGAAACACGGCAACTGTTAGGTCTCAAGGTATTATAGATATCTTGTTCTTAAGAATTATTCCATTCATGGCTTGTCTATAAGAAAGCAAATAAAAATTTAAAACTGAATGCAGCATTACAATTATTAACTTCGACTTCAATTTGGTACATGTTTTTGTCTCCCAAGAGCATATCCAAGACAGCATGCATTTTTCTCAGCTCATGGTTTCCATAGAAATTCGTTGTGAGTGCAAGGGAATGATCCTATAGAAAAAGGGGGAACGTGCATAGCCATTTAAAACTGGCAATTTAATATAATTTGAAGGTCGTCAAATGATGTAGATTTACACCTTTCTGAGTTTCTATTCTCTTTTGCCTTGACTGTCATCTGCCATAAATGAACTGTCATTTTAGGTCAAATAGATTCAAACAAAAGTATAGAAATCAATGAAGATTGGCCTATGGTTTGAAAATCAGATGCATAATTCAGTCATTATTTTACAAATAGGAAATTCTGATTTACTTACTTGTGTACTTCATGAGAATTCATATGTGGTTAGAGTTTTGATATTCTTTTCGCCATTCAGGAGCTTGATCCACCCCCATGGCTTAACAGAATGCGAGAGTTGGGATATCCACCGGGATATTTAGGTAATTACATTACGTTATTTGTGTTGCTTTCTCTTTCATTTAAACAGTTTCAAAATCACTTCCATACATGCCCTGAATCTAGTTTTTCAAATTTTTTATGCTAAACAGAATTGCCTGCTAGAAAAATTATTACTTTTAACCCTTATAAAATAACAAAAACAGATAGAAAGAGATTTATTATTATCTGCAGTGTCTGTTCACTGTTTAATAGCAGTAAAAAACGTGTTTATCTTTCAGATCTGGAAGATGAGGATCAGCCATCCGGGATTACAATATATGCTGATGAGAAGACCGACGAACAGGAAGATGGGGAAATTACTGAGGCAGAGTACCGTAAACCACGAAAGAAAATGAGTGTTGAATTTCCTGGCATAAATGCTCCCATCCCAGAAAACGCAGATGAAAGACTTTGGGCTGCTGAACCTTCGAGTTCGGGTCTCCCTAGAAATCGGTCGAACCAGCGCTTGAACCATTACACAGAATATGATGGGAAGGGAAATGATCACCATCAACGATGGTCCCGGGATTACAGAGACGACAGACCTCCAGGTGTTGACTCTGTAAAAAGTCCTCCCATGTTTACTCCTAGGTATGGTGGTCATGATTTTAGTTCTGACTCTCAAACTCCAAGAGGTAATTTTTCATCGTCGAGGAGCCCGAATTTGGGGAGGCCCCACTCAGATAGAGGTAGAAGAAGTCCACTGCTTGACGATGATTACTCAAGATATGGCTCCTCTTACAGTTCTTCCCTGTTTTCACCACCAAGAAGACGCTGA

mRNA sequence

ATGGTAGACGTCGTTCACTGGTGGAAAGTAAATCGGCACATAATTTCTCTCTACTTTCTCGCCCAGAGAAATCTTGTTTCAAGAAGAACAGAATCATCAGATAGTGAAGGAGAGAGCTCAGAGCGAAGAACTTCAGAAATGGGAGAAGAACAAGAGGATTCACAGAGGTTGAAGAGAGTAGCGGCGGCAGCATATGACTACGAGAACGATCCCAGATGGGCTGATTACTGGTCCAACATTCTGATCCCTCCTCACATGGCTTCTCGACCCGATGTTGTTGACCATTACAAGCGCAAGTTCTACCAGCGATACATCGATCCCGAACTTGTGGTAGAGGCCATGTCTTCAAGTAGTTCAACTCAGTCATCTAGACCTTCAGCTACATCTTCCGCAGCACCCCCTCCTACTAATGATCGAAGTCGACCACAAAGTGCAGGATCAACAACGACTAGGACTTCAGGTACATCTGCAAGTGCAGATGCTAATCCGACTCCATTGCGCTGGGATCGGCAAACAATTCAGTTTTCTGTCAATGCATGGGTGTTTATTGTGGCTGTACTGGCAATTTTCCCCCTAATACCCAAAAATCTTTCGCAGAGGGCATACAGGCTATCTTTTATGGGCACAACTTGTTCTTCTTTATATTCTTTGTACTCGTTGTATGGAAAGCCCAGGGCGTGGAATTTGCAAGCATTGCAAGCTTATTTCCAGTCCATAATTGCGACAAAAGATTTCATTTACTTCACTTACTGTATCACCTTTGTGACTTCAAATATTTGTCTTAAATTTGCTTTAATTCCTATCCTATGTCGGGCTCTTGAACATGTTGCAAAGTTTCTTAGGCGTAATTTTGCACGTTCGTCTTTATACAGGAAATATTTGGAAGAGCCTTGCGTATGGGTGGAGTCAAATTCAACTACTCTCAGCATCCTATCTTCACAGGCTGAGATTGGAATTGGCTTCATTCTAATCATCTCTTTGCTCTCGTGGCAACGCAACTTCTTACATACATTCATGTACTGGCAGCGCCTGGTCCAATATTGGGAGGGTTGTTTCCCCGCTGATCTACCGTTATGCCCCGTTCCTCAATACTCCTCTTTCAATGGCGCAAAGATGGTGGTTCAGAATTGTCCATTTTATCTCCATTTTATGGGGACCGAGGATTTCATTGCATTGCCAGGTTCTGGTGATTCTGGAAATGAAACCGAGAGTAATGAATCTCTTAGTTTTAATGAAACAAGAGAAGCTTATTCTCAATCAAGTGTTTTGAAGTGCAAGGACGATGATGCAAGCATAGAGAAGGTTGAGCTTGCAGATGATGTACACTTGGAAGATATGCCTTGCATACCTCAATCTGACCTTACTGATGAAACACAACGTTCTGATTCAGATATGGAAATTGAGGATTTGAATAACCTCCCAGATTTTAGTAAGACTAGAAGTAGAAGTGAGAATAATAAAATACTGAGTGAAACCGATTATCTGCCAGTCAACTCGGCAGATGAGAATATACTACCAAGCAGTGAGCCCTTGCAGCAGAATGAGCTTCATACGAGGTATGAAGATGAGGATTGGGTTGATAATTCATCCTTCTTGAAAACTGGTGGTCAATTGACGGTAATGAACGGAGTTTCAATTGAGTTCAACGGATTAAACTCTGGAGTTCCCATTGAGTATGGTTCGGCTGCATCCCATCATCATGGAGGCCCCACAATATCAGGTGTCAAAAGACCAAGGATGGCCATGGATGAGCAACAACCTTCAGTGCACATCGTATATACTTCTTTAACAAGAGATAGTAAACAAAAGCTTGATGAATTATTAAAGCAGTGGTCTGAGTGGCATGCTCTACGAGGTTCTCTATCACATGATGATAAAGACTCTGAAAACCTAGAATCTGGAGAAGAGACGTTCTTCCCTGCTCTCTGTGTCGGCACAAAGAAGACTTCAGCAGTGACTTTCTGGATTGACAACCAAAAAAGTGAACAGCAGCAAAATTTTGTTCCTATAGATGATAATTCTGTGCCACTGTATGATCGGGGATTCACTTTGGGACTGACTTCAGCCAATGATTCGAGTAAAGTGGAAGGAGGCCTGAAGATAATTGATGATGCTAGCCGTTGTTTCAATTGTGGTTCTTACAATCATTCCTTAAAAGATTGTCGAAAACCTCGAGATAATGCTGCTGTTAATAATGCTCGCAATAAGTATAAAAAATATCATAGTTCTGGCTCCCGCAATTCAACTCGATATTATCAGAATTCACGTGGGGGGAAGTATGATGATTTGAGGCCAGGAGCTCTTGATGCTGAAACACGGCAACTGTTAGGTCTCAAGGAGCTTGATCCACCCCCATGGCTTAACAGAATGCGAGAGTTGGGATATCCACCGGGATATTTAGATCTGGAAGATGAGGATCAGCCATCCGGGATTACAATATATGCTGATGAGAAGACCGACGAACAGGAAGATGGGGAAATTACTGAGGCAGAGTACCGTAAACCACGAAAGAAAATGAGTGTTGAATTTCCTGGCATAAATGCTCCCATCCCAGAAAACGCAGATGAAAGACTTTGGGCTGCTGAACCTTCGAGTTCGGGTCTCCCTAGAAATCGGTCGAACCAGCGCTTGAACCATTACACAGAATATGATGGGAAGGGAAATGATCACCATCAACGATGGTCCCGGGATTACAGAGACGACAGACCTCCAGGTGTTGACTCTGTAAAAAGTCCTCCCATGTTTACTCCTAGGTATGGTGGTCATGATTTTAGTTCTGACTCTCAAACTCCAAGAGGTAATTTTTCATCGTCGAGGAGCCCGAATTTGGGGAGGCCCCACTCAGATAGAGGTAGAAGAAGTCCACTGCTTGACGATGATTACTCAAGATATGGCTCCTCTTACAGTTCTTCCCTGTTTTCACCACCAAGAAGACGCTGA

Coding sequence (CDS)

ATGGTAGACGTCGTTCACTGGTGGAAAGTAAATCGGCACATAATTTCTCTCTACTTTCTCGCCCAGAGAAATCTTGTTTCAAGAAGAACAGAATCATCAGATAGTGAAGGAGAGAGCTCAGAGCGAAGAACTTCAGAAATGGGAGAAGAACAAGAGGATTCACAGAGGTTGAAGAGAGTAGCGGCGGCAGCATATGACTACGAGAACGATCCCAGATGGGCTGATTACTGGTCCAACATTCTGATCCCTCCTCACATGGCTTCTCGACCCGATGTTGTTGACCATTACAAGCGCAAGTTCTACCAGCGATACATCGATCCCGAACTTGTGGTAGAGGCCATGTCTTCAAGTAGTTCAACTCAGTCATCTAGACCTTCAGCTACATCTTCCGCAGCACCCCCTCCTACTAATGATCGAAGTCGACCACAAAGTGCAGGATCAACAACGACTAGGACTTCAGGTACATCTGCAAGTGCAGATGCTAATCCGACTCCATTGCGCTGGGATCGGCAAACAATTCAGTTTTCTGTCAATGCATGGGTGTTTATTGTGGCTGTACTGGCAATTTTCCCCCTAATACCCAAAAATCTTTCGCAGAGGGCATACAGGCTATCTTTTATGGGCACAACTTGTTCTTCTTTATATTCTTTGTACTCGTTGTATGGAAAGCCCAGGGCGTGGAATTTGCAAGCATTGCAAGCTTATTTCCAGTCCATAATTGCGACAAAAGATTTCATTTACTTCACTTACTGTATCACCTTTGTGACTTCAAATATTTGTCTTAAATTTGCTTTAATTCCTATCCTATGTCGGGCTCTTGAACATGTTGCAAAGTTTCTTAGGCGTAATTTTGCACGTTCGTCTTTATACAGGAAATATTTGGAAGAGCCTTGCGTATGGGTGGAGTCAAATTCAACTACTCTCAGCATCCTATCTTCACAGGCTGAGATTGGAATTGGCTTCATTCTAATCATCTCTTTGCTCTCGTGGCAACGCAACTTCTTACATACATTCATGTACTGGCAGCGCCTGGTCCAATATTGGGAGGGTTGTTTCCCCGCTGATCTACCGTTATGCCCCGTTCCTCAATACTCCTCTTTCAATGGCGCAAAGATGGTGGTTCAGAATTGTCCATTTTATCTCCATTTTATGGGGACCGAGGATTTCATTGCATTGCCAGGTTCTGGTGATTCTGGAAATGAAACCGAGAGTAATGAATCTCTTAGTTTTAATGAAACAAGAGAAGCTTATTCTCAATCAAGTGTTTTGAAGTGCAAGGACGATGATGCAAGCATAGAGAAGGTTGAGCTTGCAGATGATGTACACTTGGAAGATATGCCTTGCATACCTCAATCTGACCTTACTGATGAAACACAACGTTCTGATTCAGATATGGAAATTGAGGATTTGAATAACCTCCCAGATTTTAGTAAGACTAGAAGTAGAAGTGAGAATAATAAAATACTGAGTGAAACCGATTATCTGCCAGTCAACTCGGCAGATGAGAATATACTACCAAGCAGTGAGCCCTTGCAGCAGAATGAGCTTCATACGAGGTATGAAGATGAGGATTGGGTTGATAATTCATCCTTCTTGAAAACTGGTGGTCAATTGACGGTAATGAACGGAGTTTCAATTGAGTTCAACGGATTAAACTCTGGAGTTCCCATTGAGTATGGTTCGGCTGCATCCCATCATCATGGAGGCCCCACAATATCAGGTGTCAAAAGACCAAGGATGGCCATGGATGAGCAACAACCTTCAGTGCACATCGTATATACTTCTTTAACAAGAGATAGTAAACAAAAGCTTGATGAATTATTAAAGCAGTGGTCTGAGTGGCATGCTCTACGAGGTTCTCTATCACATGATGATAAAGACTCTGAAAACCTAGAATCTGGAGAAGAGACGTTCTTCCCTGCTCTCTGTGTCGGCACAAAGAAGACTTCAGCAGTGACTTTCTGGATTGACAACCAAAAAAGTGAACAGCAGCAAAATTTTGTTCCTATAGATGATAATTCTGTGCCACTGTATGATCGGGGATTCACTTTGGGACTGACTTCAGCCAATGATTCGAGTAAAGTGGAAGGAGGCCTGAAGATAATTGATGATGCTAGCCGTTGTTTCAATTGTGGTTCTTACAATCATTCCTTAAAAGATTGTCGAAAACCTCGAGATAATGCTGCTGTTAATAATGCTCGCAATAAGTATAAAAAATATCATAGTTCTGGCTCCCGCAATTCAACTCGATATTATCAGAATTCACGTGGGGGGAAGTATGATGATTTGAGGCCAGGAGCTCTTGATGCTGAAACACGGCAACTGTTAGGTCTCAAGGAGCTTGATCCACCCCCATGGCTTAACAGAATGCGAGAGTTGGGATATCCACCGGGATATTTAGATCTGGAAGATGAGGATCAGCCATCCGGGATTACAATATATGCTGATGAGAAGACCGACGAACAGGAAGATGGGGAAATTACTGAGGCAGAGTACCGTAAACCACGAAAGAAAATGAGTGTTGAATTTCCTGGCATAAATGCTCCCATCCCAGAAAACGCAGATGAAAGACTTTGGGCTGCTGAACCTTCGAGTTCGGGTCTCCCTAGAAATCGGTCGAACCAGCGCTTGAACCATTACACAGAATATGATGGGAAGGGAAATGATCACCATCAACGATGGTCCCGGGATTACAGAGACGACAGACCTCCAGGTGTTGACTCTGTAAAAAGTCCTCCCATGTTTACTCCTAGGTATGGTGGTCATGATTTTAGTTCTGACTCTCAAACTCCAAGAGGTAATTTTTCATCGTCGAGGAGCCCGAATTTGGGGAGGCCCCACTCAGATAGAGGTAGAAGAAGTCCACTGCTTGACGATGATTACTCAAGATATGGCTCCTCTTACAGTTCTTCCCTGTTTTCACCACCAAGAAGACGCTGA

Protein sequence

MVDVVHWWKVNRHIISLYFLAQRNLVSRRTESSDSEGESSERRTSEMGEEQEDSQRLKRVAAAAYDYENDPRWADYWSNILIPPHMASRPDVVDHYKRKFYQRYIDPELVVEAMSSSSSTQSSRPSATSSAAPPPTNDRSRPQSAGSTTTRTSGTSASADANPTPLRWDRQTIQFSVNAWVFIVAVLAIFPLIPKNLSQRAYRLSFMGTTCSSLYSLYSLYGKPRAWNLQALQAYFQSIIATKDFIYFTYCITFVTSNICLKFALIPILCRALEHVAKFLRRNFARSSLYRKYLEEPCVWVESNSTTLSILSSQAEIGIGFILIISLLSWQRNFLHTFMYWQRLVQYWEGCFPADLPLCPVPQYSSFNGAKMVVQNCPFYLHFMGTEDFIALPGSGDSGNETESNESLSFNETREAYSQSSVLKCKDDDASIEKVELADDVHLEDMPCIPQSDLTDETQRSDSDMEIEDLNNLPDFSKTRSRSENNKILSETDYLPVNSADENILPSSEPLQQNELHTRYEDEDWVDNSSFLKTGGQLTVMNGVSIEFNGLNSGVPIEYGSAASHHHGGPTISGVKRPRMAMDEQQPSVHIVYTSLTRDSKQKLDELLKQWSEWHALRGSLSHDDKDSENLESGEETFFPALCVGTKKTSAVTFWIDNQKSEQQQNFVPIDDNSVPLYDRGFTLGLTSANDSSKVEGGLKIIDDASRCFNCGSYNHSLKDCRKPRDNAAVNNARNKYKKYHSSGSRNSTRYYQNSRGGKYDDLRPGALDAETRQLLGLKELDPPPWLNRMRELGYPPGYLDLEDEDQPSGITIYADEKTDEQEDGEITEAEYRKPRKKMSVEFPGINAPIPENADERLWAAEPSSSGLPRNRSNQRLNHYTEYDGKGNDHHQRWSRDYRDDRPPGVDSVKSPPMFTPRYGGHDFSSDSQTPRGNFSSSRSPNLGRPHSDRGRRSPLLDDDYSRYGSSYSSSLFSPPRRR
Homology
BLAST of HG10018325 vs. NCBI nr
Match: KAG7016075.1 (Zinc finger CCHC domain-containing protein 8 [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 1369.4 bits (3543), Expect = 0.0e+00
Identity = 753/1014 (74.26%), Postives = 795/1014 (78.40%), Query Frame = 0

Query: 47  MGEEQEDSQRLKRVAAAAYDYENDPRWADYWSNILIPPHMASRPDVVDHYKRKFYQRYID 106
           M EE+ED QRLKR AAAAYDYENDP+WADYWSNILIPPHMASRPDVVDHYKRKFYQRYID
Sbjct: 1   MAEEREDPQRLKRAAAAAYDYENDPKWADYWSNILIPPHMASRPDVVDHYKRKFYQRYID 60

Query: 107 PELVVEAMSSSSSTQSSRPSATSSAAPPPTNDRSRPQSAGSTTTRTSGTSASADANPTPL 166
           P+LVVEAMSSSSSTQSSRPSATSSAAPPPTNDRSRP+S+GS TTRTSGTSASADANP+PL
Sbjct: 61  PDLVVEAMSSSSSTQSSRPSATSSAAPPPTNDRSRPRSSGS-TTRTSGTSASADANPSPL 120

Query: 167 RWDRQTIQFSVNAWVFIVAVLAIFPLIPKNLSQRAYRLSFMGTTCSSLYSLYSLYGKPRA 226
           RWDRQTIQFSVNAWV IVAVLAIFPLIPKNLSQRAYRLSFMG TCSSLYSLYSLYGKPRA
Sbjct: 121 RWDRQTIQFSVNAWVLIVAVLAIFPLIPKNLSQRAYRLSFMGITCSSLYSLYSLYGKPRA 180

Query: 227 WNLQALQAYFQSIIATKDFIYFTYCITFVTSNICLKFALIPILCRALEHVAKFLRRNFAR 286
           WNLQALQAYFQSIIATKDFIYF YCITF+TSNICLKFALIPILCRALEHVAKFLRRNFAR
Sbjct: 181 WNLQALQAYFQSIIATKDFIYFIYCITFLTSNICLKFALIPILCRALEHVAKFLRRNFAR 240

Query: 287 SSLYRKYLEEPCVWVESNSTTLSILSSQAEIGIGFILIISLLS----------------- 346
           SSLYRKYLEEPCVWVESNSTTLSILSSQAEIG+GF+LIISLLS                 
Sbjct: 241 SSLYRKYLEEPCVWVESNSTTLSILSSQAEIGLGFLLIISLLSSLLKLMYHAPVTSGYHR 300

Query: 347 --WQR--NFLHTFMYWQ--------RLVQYW-------------EGCFP----------- 406
             W     F+   +Y           + Q W                 P           
Sbjct: 301 SAWSNIGRFVSPLIYRYAPFLNTPLSMAQRWWFSERVKVNLNRTASTHPSAALRRPEHLT 360

Query: 407 ----------ADLPLCPVPQYSSFNGAKMVVQNCPFYLHFMGTEDFIALPGSGDSGNETE 466
                     AD+   P P  +  + ++       ++ HFMGTEDFIALP SGD G+E E
Sbjct: 361 IPHLLVVFGVADIFSIPAP--TDLSSSRFAFFG--YFFHFMGTEDFIALPASGDFGHENE 420

Query: 467 SNESLSFNETREAYSQSSVLKCKDDDASIEKVELADDVHLEDMPCIPQSDLTDETQRSDS 526
           SNESLSFNETREA SQSSVL+CKD+ ASIEK ELADDV LEDM CIPQSDL DETQ S+S
Sbjct: 421 SNESLSFNETREASSQSSVLECKDNPASIEKFELADDVQLEDMRCIPQSDLNDETQCSES 480

Query: 527 DMEIEDLNNLPDFSKTRSRSENNKILSETDYLPVNSADENILPSSEPLQQNELHTRYED- 586
           DMEIEDLNNLPD SKTRS SEN KI SE +YLPVNS DENILPS EPLQQNELH R ED 
Sbjct: 481 DMEIEDLNNLPDLSKTRSTSENYKIRSEAEYLPVNSVDENILPSIEPLQQNELHLRSEDV 540

Query: 587 ---------EDWVDNSSFLKTGGQLTVMNGVSIEFNGLNSGVPIEYGSAASHHHGGP--- 646
                    +D VDNSSF KT G LT+  GVS           IE GSA SHHHGGP   
Sbjct: 541 SHAESKKIQKDLVDNSSFSKTSGLLTLATGVS-----------IENGSAPSHHHGGPRKI 600

Query: 647 ----TISGVKRPRMAMDEQQPSVHIVYTSLTRDSKQKLDELLKQWSEWHALRGSLSHDDK 706
                I GVK+PRMAMDEQQPSVHIVYTSLTRDSKQKLDELLKQWSEW+A RGS S    
Sbjct: 601 HKSDAILGVKKPRMAMDEQQPSVHIVYTSLTRDSKQKLDELLKQWSEWNAQRGSPSQ--- 660

Query: 707 DSENLESGEETFFPALCVGTKKTSAVTFWIDNQKSEQQQNFVPIDDNSVPLYDRGFTLGL 766
                                     TFWIDNQ SEQ QNFVPIDDNSVPLYDRGFTLGL
Sbjct: 661 --------------------------TFWIDNQTSEQPQNFVPIDDNSVPLYDRGFTLGL 720

Query: 767 TSANDSSKVEGGLKIIDDASRCFNCGSYNHSLKDCRKPRDNAAVNNARNKYKKYHSSGSR 826
           TSANDSS  EGG KIIDDASRCFNCGSYNHSLKDCRKPRDNAAVNNARN YKK++SSGSR
Sbjct: 721 TSANDSSNAEGGQKIIDDASRCFNCGSYNHSLKDCRKPRDNAAVNNARNMYKKHNSSGSR 780

Query: 827 NSTRYYQNSRGGKYDDLRPGALDAETRQLLGLKELDPPPWLNRMRELGYPPGYLDLEDED 886
           NSTRYYQ SRGGKYDDLRPGALDAETRQLLGLKELDPPPWLNRMRELGYPPGYLD EDED
Sbjct: 781 NSTRYYQTSRGGKYDDLRPGALDAETRQLLGLKELDPPPWLNRMRELGYPPGYLDPEDED 840

Query: 887 QPSGITIYADEKTDEQEDGEITEAEYRKPRKKMSVEFPGINAPIPENADERLWAAEPSSS 946
           QPSGITIYADEK  EQEDGEITE EYRKP KKMSVEFPGINAPIPE+ADERLW+AEP SS
Sbjct: 841 QPSGITIYADEKGVEQEDGEITEPEYRKPGKKMSVEFPGINAPIPESADERLWSAEPLSS 900

Query: 947 GLPRNRSNQRLNHYTEYDGKGNDHH-QRWSRDYRDDRPPGVDSVKSPPMFTPRYGGHDFS 980
            LPR RS+QRLNH+ E+DG+GNDHH QRWSRDYRD RPPGVDSVKSP +FTPRYGGH+ S
Sbjct: 901 DLPRKRSSQRLNHHIEHDGRGNDHHQQRWSRDYRDYRPPGVDSVKSPTIFTPRYGGHESS 960

BLAST of HG10018325 vs. NCBI nr
Match: XP_038890370.1 (uncharacterized protein LOC120079961 [Benincasa hispida] >XP_038890371.1 uncharacterized protein LOC120079961 [Benincasa hispida] >XP_038890372.1 uncharacterized protein LOC120079961 [Benincasa hispida])

HSP 1 Score: 1079.3 bits (2790), Expect = 0.0e+00
Identity = 555/614 (90.39%), Postives = 566/614 (92.18%), Query Frame = 0

Query: 384 MGTEDFIALPGSGDSGNETESNESLSFNETREAYSQSSVLKCKDDDASIEKVELADDVHL 443
           MGTEDFIALP SGDSGNETESNESLSF+ETR+  SQSSVLKCKDDDAS EKVELADDV  
Sbjct: 1   MGTEDFIALPASGDSGNETESNESLSFHETRD--SQSSVLKCKDDDASTEKVELADDVPS 60

Query: 444 EDMPCIPQSDLTDETQRSDSDMEIEDLNNLPDFSKTRSRSENNKILSETDYLPVNSADEN 503
           EDM  IPQSDL DETQRSDSDMEIEDLNNLPDF+KTRSRSENNKILSE +YLPVNSADEN
Sbjct: 61  EDMHSIPQSDLIDETQRSDSDMEIEDLNNLPDFNKTRSRSENNKILSEAEYLPVNSADEN 120

Query: 504 ILPSSEPLQQNELHTRYED----------EDWVDNSSFLKTGGQLTVMNGVSIEFNGLNS 563
           ILPSSEPLQQNELHTRYED          +D VDNSSFLKTG QLTV NGVSIEFN LNS
Sbjct: 121 ILPSSEPLQQNELHTRYEDVCHVESRNFQKDLVDNSSFLKTGCQLTVTNGVSIEFNRLNS 180

Query: 564 GVPIEYGSAASHHHGGPT-------ISGVKRPRMAMDEQQPSVHIVYTSLTRDSKQKLDE 623
           GVPIE G A+SH HGGP+       ISGVKRPRMAMDEQQPSVHIVYTSLTRDSKQKLDE
Sbjct: 181 GVPIENGLASSHQHGGPSKIHKSDAISGVKRPRMAMDEQQPSVHIVYTSLTRDSKQKLDE 240

Query: 624 LLKQWSEWHALRGSLSHDDKDSENLESGEETFFPALCVGTKKTSAVTFWIDNQKSEQQQN 683
           LLKQWSEWHA RGSLSHDDKDSENLESGEETFFPALCVGTKKTSAVTFWIDNQ+SEQQQN
Sbjct: 241 LLKQWSEWHAQRGSLSHDDKDSENLESGEETFFPALCVGTKKTSAVTFWIDNQRSEQQQN 300

Query: 684 FVPIDDNSVPLYDRGFTLGLTSANDSSKVEGGLKIIDDASRCFNCGSYNHSLKDCRKPRD 743
           FVPIDDNSVPLYDRGFTLGLTSA+DSS VEGG KIIDDASRCFNCGSYNHSLKDCRKPRD
Sbjct: 301 FVPIDDNSVPLYDRGFTLGLTSASDSSNVEGGQKIIDDASRCFNCGSYNHSLKDCRKPRD 360

Query: 744 NAAVNNARNKYKKYHSSGSRNSTRYYQNSRGGKYDDLRPGALDAETRQLLGLKELDPPPW 803
           NAAVNNARNKYKK HSSGSRNSTRYYQNSRGGKYDDLRPGALD ETRQLLGLKELDPPPW
Sbjct: 361 NAAVNNARNKYKKQHSSGSRNSTRYYQNSRGGKYDDLRPGALDPETRQLLGLKELDPPPW 420

Query: 804 LNRMRELGYPPGYLDLEDEDQPSGITIYADEKTDEQEDGEITEAEYRKPRKKMSVEFPGI 863
           LNRMRELGYPPGYLD EDEDQPSGITIYADEKTDEQEDGEITEAEYRKPRKKMSV FPGI
Sbjct: 421 LNRMRELGYPPGYLDPEDEDQPSGITIYADEKTDEQEDGEITEAEYRKPRKKMSVGFPGI 480

Query: 864 NAPIPENADERLWAAEPSSSGLPRNRSNQRLNHYTEYDGKGNDHH-QRWSRDYRDDRPPG 923
           NAPIPENADERLWA EPSS GLPRNRSNQRLNHYTEYD +GNDHH QRWSRDYRDDRPPG
Sbjct: 481 NAPIPENADERLWAPEPSSPGLPRNRSNQRLNHYTEYDARGNDHHQQRWSRDYRDDRPPG 540

Query: 924 VDSVKSPPMFTPRYGGHDFSSDSQTPRGNFSSSRSPNLGRPHSDRGRRSPLLDDDYSRYG 980
           VDSVKSPP FTPRYG HDFS DSQTPRGNFS+SRSPNLGRPHSDRGRRSPLLDDDYSRYG
Sbjct: 541 VDSVKSPPTFTPRYGSHDFSYDSQTPRGNFSTSRSPNLGRPHSDRGRRSPLLDDDYSRYG 600

BLAST of HG10018325 vs. NCBI nr
Match: XP_008459423.1 (PREDICTED: uncharacterized protein LOC103498564 isoform X2 [Cucumis melo] >ADN34281.1 nucleic acid binding protein [Cucumis melo subsp. melo] >KAA0039389.1 zinc finger CCHC domain-containing protein 8 isoform X2 [Cucumis melo var. makuwa] >TYK00577.1 zinc finger CCHC domain-containing protein 8 isoform X2 [Cucumis melo var. makuwa])

HSP 1 Score: 1073.9 bits (2776), Expect = 7.5e-310
Identity = 544/610 (89.18%), Postives = 563/610 (92.30%), Query Frame = 0

Query: 384 MGTEDFIALPGSGDSGNETESNESLSFNETREAYSQSSVLKCKDDDASIEKVELADDVHL 443
           MGTEDFIALP SGDSGNETESNESL+FNETREAYSQSSVLKCKDDDASIEK EL DDV L
Sbjct: 1   MGTEDFIALPASGDSGNETESNESLTFNETREAYSQSSVLKCKDDDASIEKAELVDDVQL 60

Query: 444 EDMPCIPQSDLTDETQRSDSDMEIEDLNNLPDFSKTRSRSENNKILSETDYLPVNSADEN 503
           EDM C+PQSDL DETQRSDSDMEIEDLNNLPDFSKTRSRSEN++ILS+ + LPVNSAD N
Sbjct: 61  EDMHCVPQSDLVDETQRSDSDMEIEDLNNLPDFSKTRSRSENSEILSKAEDLPVNSADGN 120

Query: 504 ILPSSEPLQQNELHTRYED----------EDWVDNSSFLKTGGQLTVMNGVSIEFNGLNS 563
           ILPS+EPLQQNELHTRYED          +D VDNSSF KTGGQLTVMNGVSI+FN LNS
Sbjct: 121 ILPSNEPLQQNELHTRYEDVCHVESQNFQKDLVDNSSFSKTGGQLTVMNGVSIDFNELNS 180

Query: 564 GVPIEYGSAASHHHGGPTISGVKRPRM---AMDEQQPSVHIVYTSLTRDSKQKLDELLKQ 623
           G P+E GSA SHHHGGP ISGVKRPRM   AMDEQQPSVHIVYTSLTRDSKQKLDELLKQ
Sbjct: 181 GAPMENGSATSHHHGGPRISGVKRPRMAMEAMDEQQPSVHIVYTSLTRDSKQKLDELLKQ 240

Query: 624 WSEWHALRGSLSHDDKDSENLESGEETFFPALCVGTKKTSAVTFWIDNQKSEQQQNFVPI 683
           WSEWHA +GSLS DDKD+ENLESGEETFFPALCVGTKKTSAVTFW+DNQKSEQQQ FVPI
Sbjct: 241 WSEWHAQQGSLSRDDKDTENLESGEETFFPALCVGTKKTSAVTFWMDNQKSEQQQTFVPI 300

Query: 684 DDNSVPLYDRGFTLGLTSANDSSKVEGGLKIIDDASRCFNCGSYNHSLKDCRKPRDNAAV 743
           DDNSVPLYDRGFTLGLTSANDSS VEGG KIIDDASRCFNCGSYNHSLKDCRKPRDNAAV
Sbjct: 301 DDNSVPLYDRGFTLGLTSANDSSNVEGGQKIIDDASRCFNCGSYNHSLKDCRKPRDNAAV 360

Query: 744 NNARNKYKKYHSSGSRNSTRYYQNSRGGKYDDLRPGALDAETRQLLGLKELDPPPWLNRM 803
           NNARNKYKK H+S SRNSTRYYQNSRGGKYDDLRPG LDAETRQLLGLKELDPPPWLNRM
Sbjct: 361 NNARNKYKKQHNSASRNSTRYYQNSRGGKYDDLRPGTLDAETRQLLGLKELDPPPWLNRM 420

Query: 804 RELGYPPGYLDLEDEDQPSGITIYADEKTDEQEDGEITEAEYRKPRKKMSVEFPGINAPI 863
           RELGYPPGYLD EDEDQPSGITIYADEKTDEQEDGEITEAEYRKP+KKMSVEFPGINAPI
Sbjct: 421 RELGYPPGYLDPEDEDQPSGITIYADEKTDEQEDGEITEAEYRKPQKKMSVEFPGINAPI 480

Query: 864 PENADERLWAAEPSSSGLPRNRSNQRLNHYTEYDGKGNDHH-QRWSRDYRDDRPPGVDSV 923
           PENADERLWA EPSSSGLPRNRSNQRLNHY EYD +GNDHH QRWSRDYRDDRPPGVDS+
Sbjct: 481 PENADERLWAPEPSSSGLPRNRSNQRLNHYPEYDTRGNDHHQQRWSRDYRDDRPPGVDSI 540

Query: 924 KSPPMFTPRYGGHDFSSDSQTPRGNFSSSRSPNLGRPHSDRGRRSPLLDDDYSRYGSSYS 980
           KSPP FTPRYGGHDFS DSQTPRG+FS+SRSPNLGRPHSDRGRRSP  DDDYSRY SSYS
Sbjct: 541 KSPPSFTPRYGGHDFSYDSQTPRGSFSTSRSPNLGRPHSDRGRRSPHRDDDYSRYSSSYS 600

BLAST of HG10018325 vs. NCBI nr
Match: XP_008459422.1 (PREDICTED: uncharacterized protein LOC103498564 isoform X1 [Cucumis melo])

HSP 1 Score: 1067.8 bits (2760), Expect = 5.7e-308
Identity = 544/617 (88.17%), Postives = 564/617 (91.41%), Query Frame = 0

Query: 384 MGTEDFIALPGSGDSGNETESNESLSFNETREAYSQSSVLKCKDDDASIEKVELADDVHL 443
           MGTEDFIALP SGDSGNETESNESL+FNETREAYSQSSVLKCKDDDASIEK EL DDV L
Sbjct: 1   MGTEDFIALPASGDSGNETESNESLTFNETREAYSQSSVLKCKDDDASIEKAELVDDVQL 60

Query: 444 EDMPCIPQSDLTDETQRSDSDMEIEDLNNLPDFSKTRSRSENNKILSETDYLPVNSADEN 503
           EDM C+PQSDL DETQRSDSDMEIEDLNNLPDFSKTRSRSEN++ILS+ + LPVNSAD N
Sbjct: 61  EDMHCVPQSDLVDETQRSDSDMEIEDLNNLPDFSKTRSRSENSEILSKAEDLPVNSADGN 120

Query: 504 ILPSSEPLQQNELHTRYED----------EDWVDNSSFLKTGGQLTVMNGVSIEFNGLNS 563
           ILPS+EPLQQNELHTRYED          +D VDNSSF KTGGQLTVMNGVSI+FN LNS
Sbjct: 121 ILPSNEPLQQNELHTRYEDVCHVESQNFQKDLVDNSSFSKTGGQLTVMNGVSIDFNELNS 180

Query: 564 GVPIEYGSAASHHHGGPT-------ISGVKRPRM---AMDEQQPSVHIVYTSLTRDSKQK 623
           G P+E GSA SHHHGGP+       ISGVKRPRM   AMDEQQPSVHIVYTSLTRDSKQK
Sbjct: 181 GAPMENGSATSHHHGGPSKIQKSDGISGVKRPRMAMEAMDEQQPSVHIVYTSLTRDSKQK 240

Query: 624 LDELLKQWSEWHALRGSLSHDDKDSENLESGEETFFPALCVGTKKTSAVTFWIDNQKSEQ 683
           LDELLKQWSEWHA +GSLS DDKD+ENLESGEETFFPALCVGTKKTSAVTFW+DNQKSEQ
Sbjct: 241 LDELLKQWSEWHAQQGSLSRDDKDTENLESGEETFFPALCVGTKKTSAVTFWMDNQKSEQ 300

Query: 684 QQNFVPIDDNSVPLYDRGFTLGLTSANDSSKVEGGLKIIDDASRCFNCGSYNHSLKDCRK 743
           QQ FVPIDDNSVPLYDRGFTLGLTSANDSS VEGG KIIDDASRCFNCGSYNHSLKDCRK
Sbjct: 301 QQTFVPIDDNSVPLYDRGFTLGLTSANDSSNVEGGQKIIDDASRCFNCGSYNHSLKDCRK 360

Query: 744 PRDNAAVNNARNKYKKYHSSGSRNSTRYYQNSRGGKYDDLRPGALDAETRQLLGLKELDP 803
           PRDNAAVNNARNKYKK H+S SRNSTRYYQNSRGGKYDDLRPG LDAETRQLLGLKELDP
Sbjct: 361 PRDNAAVNNARNKYKKQHNSASRNSTRYYQNSRGGKYDDLRPGTLDAETRQLLGLKELDP 420

Query: 804 PPWLNRMRELGYPPGYLDLEDEDQPSGITIYADEKTDEQEDGEITEAEYRKPRKKMSVEF 863
           PPWLNRMRELGYPPGYLD EDEDQPSGITIYADEKTDEQEDGEITEAEYRKP+KKMSVEF
Sbjct: 421 PPWLNRMRELGYPPGYLDPEDEDQPSGITIYADEKTDEQEDGEITEAEYRKPQKKMSVEF 480

Query: 864 PGINAPIPENADERLWAAEPSSSGLPRNRSNQRLNHYTEYDGKGNDHH-QRWSRDYRDDR 923
           PGINAPIPENADERLWA EPSSSGLPRNRSNQRLNHY EYD +GNDHH QRWSRDYRDDR
Sbjct: 481 PGINAPIPENADERLWAPEPSSSGLPRNRSNQRLNHYPEYDTRGNDHHQQRWSRDYRDDR 540

Query: 924 PPGVDSVKSPPMFTPRYGGHDFSSDSQTPRGNFSSSRSPNLGRPHSDRGRRSPLLDDDYS 980
           PPGVDS+KSPP FTPRYGGHDFS DSQTPRG+FS+SRSPNLGRPHSDRGRRSP  DDDYS
Sbjct: 541 PPGVDSIKSPPSFTPRYGGHDFSYDSQTPRGSFSTSRSPNLGRPHSDRGRRSPHRDDDYS 600

BLAST of HG10018325 vs. NCBI nr
Match: XP_011656031.1 (uncharacterized protein LOC101212144 [Cucumis sativus] >KGN52565.1 hypothetical protein Csa_009343 [Cucumis sativus])

HSP 1 Score: 1053.5 bits (2723), Expect = 1.1e-303
Identity = 542/617 (87.84%), Postives = 559/617 (90.60%), Query Frame = 0

Query: 384 MGTEDFIALPGSGDSGNETESNESLSFNETREAYSQSSVLKCKDDDASIEKVELADDVHL 443
           MGTEDFIALP SGDSGNETESNESL+FNETREAYSQSSVLK KDDDASIEK EL DDV L
Sbjct: 1   MGTEDFIALPASGDSGNETESNESLTFNETREAYSQSSVLKRKDDDASIEKAELVDDVQL 60

Query: 444 EDMPCIPQSDLTDETQRSDSDMEIEDLNNLPDFSKTRSRSENNKILSETDYLPVNSADEN 503
           E M CIPQSDL DETQRSDSDMEIEDLNNLPDFSKTRSRSEN++ILS+   LPVNSAD N
Sbjct: 61  EAMHCIPQSDLVDETQRSDSDMEIEDLNNLPDFSKTRSRSENSEILSKAADLPVNSADGN 120

Query: 504 ILPSSEPLQQNELHTRYED----------EDWVDNSSFLKTGGQLTVMNGVSIEFNGLNS 563
           ILPSSE LQQNELHTRYED          +D VDNSSFLKTGGQLTVMNGVSI+FN LNS
Sbjct: 121 ILPSSELLQQNELHTRYEDVCHVESKKFQKDLVDNSSFLKTGGQLTVMNGVSIDFNELNS 180

Query: 564 GVPIEYGSAASHHHGGPT-------ISGVKRPRM---AMDEQQPSVHIVYTSLTRDSKQK 623
           G P+E GSA SHHHGGP+       ISGVKRPRM   AMDEQQPSVHIVYTSLTRDSKQK
Sbjct: 181 GAPMENGSATSHHHGGPSKIQKSDGISGVKRPRMAMEAMDEQQPSVHIVYTSLTRDSKQK 240

Query: 624 LDELLKQWSEWHALRGSLSHDDKDSENLESGEETFFPALCVGTKKTSAVTFWIDNQKSEQ 683
           LDELLKQWSEWHA +GSLS DDKD+ENLESGEETFFPALCVGTKKTSAVTFW+DNQKSEQ
Sbjct: 241 LDELLKQWSEWHAQQGSLSCDDKDTENLESGEETFFPALCVGTKKTSAVTFWMDNQKSEQ 300

Query: 684 QQNFVPIDDNSVPLYDRGFTLGLTSANDSSKVEGGLKIIDDASRCFNCGSYNHSLKDCRK 743
           QQNFVPIDDNSVPLYDRGFTLGLTSANDSS  EGG KIIDDASRCFNCGSYNHSLKDCRK
Sbjct: 301 QQNFVPIDDNSVPLYDRGFTLGLTSANDSSNAEGGQKIIDDASRCFNCGSYNHSLKDCRK 360

Query: 744 PRDNAAVNNARNKYKKYHSSGSRNSTRYYQNSRGGKYDDLRPGALDAETRQLLGLKELDP 803
           PRDNAAVNNARNKYKK H+S SRNSTRYYQNSRGGKYDDLRPG LDAETRQLLGLKELDP
Sbjct: 361 PRDNAAVNNARNKYKKQHNSASRNSTRYYQNSRGGKYDDLRPGTLDAETRQLLGLKELDP 420

Query: 804 PPWLNRMRELGYPPGYLDLEDEDQPSGITIYADEKTDEQEDGEITEAEYRKPRKKMSVEF 863
           PPWLNRMRELGYPPGYLD EDEDQPSGITIYADEKTDEQEDGEITEAEYRKPRKK SVEF
Sbjct: 421 PPWLNRMRELGYPPGYLDPEDEDQPSGITIYADEKTDEQEDGEITEAEYRKPRKKKSVEF 480

Query: 864 PGINAPIPENADERLWAAEPSSSGLPRNRSNQRLNHYTEYDGKGNDHH-QRWSRDYRDDR 923
           PGINAPIPENADERLWA EPS+SGL RNRSNQRLNHY EYD +GNDHH QRWSRDYRDDR
Sbjct: 481 PGINAPIPENADERLWAPEPSNSGLSRNRSNQRLNHYPEYDTRGNDHHQQRWSRDYRDDR 540

Query: 924 PPGVDSVKSPPMFTPRYGGHDFSSDSQTPRGNFSSSRSPNLGRPHSDRGRRSPLLDDDYS 980
           PPGVDS+KSPP FTPRYGGHDFS DSQTPRG+FS+SRSPNLGRPHSDRGRRSP  DDDYS
Sbjct: 541 PPGVDSIKSPPSFTPRYGGHDFSYDSQTPRGSFSTSRSPNLGRPHSDRGRRSPQRDDDYS 600

BLAST of HG10018325 vs. ExPASy Swiss-Prot
Match: Q6DD45 (Zinc finger CCHC domain-containing protein 8 OS=Xenopus laevis OX=8355 GN=zcchc8 PE=2 SV=1)

HSP 1 Score: 93.6 bits (231), Expect = 1.3e-17
Identity = 54/151 (35.76%), Postives = 88/151 (58.28%), Query Frame = 0

Query: 708 CFNCGSYNHSLKDCRKPRDNAAVNNARNKY-KKYHSSGSRNSTRYYQNSRGGKYDDLRPG 767
           CFNCGS  H ++DC KPRD A +N  R ++      +G++N  RY+      ++   +PG
Sbjct: 215 CFNCGSEEHQMRDCPKPRDQAHINMKRKEFLDACGEAGNQNQQRYHAEEVEERFGKYKPG 274

Query: 768 ALDAETRQLLGLKELDPPPWLNRMRELGYPPGYLDLEDEDQPSGITIYADEKTDEQEDGE 827
            +  E ++ LG+ + + PP++ RMRELGYPPG+L  E E + SG+++Y  ++  +  DGE
Sbjct: 275 VISEELQEALGIMDKNLPPFIYRMRELGYPPGWLK-EAELENSGLSLYDGKERLDASDGE 334

Query: 828 ITEAEYRKPRKKMS------VEFPGINAPIP 852
           I + +  + +K +S      V +PG N   P
Sbjct: 335 IEDRD-TEAKKHVSYDVSKLVNYPGFNISAP 363

BLAST of HG10018325 vs. ExPASy Swiss-Prot
Match: Q5F3D1 (Zinc finger CCHC domain-containing protein 8 OS=Gallus gallus OX=9031 GN=ZCCHC8 PE=2 SV=1)

HSP 1 Score: 86.3 bits (212), Expect = 2.1e-15
Identity = 48/148 (32.43%), Postives = 78/148 (52.70%), Query Frame = 0

Query: 706 SRCFNCGSYNHSLKDCRKPRDNAAVNNARNKYKKYHSSGSRNS--TRYYQNSRGGKYDDL 765
           S CFNCGS  H +KDC KPR+ A ++  R ++ + +   S  +   RY+      ++   
Sbjct: 126 SHCFNCGSEEHQIKDCPKPRNAARISEKRKEFMEAYGEASNQNFQQRYHAEEVEERFGKF 185

Query: 766 RPGALDAETRQLLGLKELDPPPWLNRMRELGYPPGYLDLEDEDQPSGITIYADEKTDEQE 825
           +PG +  E +  LG+     PP++ RMR+LGYPPG+L  E E + SG+ +Y  +  +E E
Sbjct: 186 KPGVISGELQDALGVTAKSLPPFIYRMRQLGYPPGWLK-EAEMEHSGLALYDGKDDNETE 245

Query: 826 DGEITEAEYRKPRKKMSVEFPGINAPIP 852
           D    + ++        + +PG N   P
Sbjct: 246 DEGYLQPKHVTYDVSKLINYPGFNISTP 272

BLAST of HG10018325 vs. ExPASy Swiss-Prot
Match: Q9CYA6 (Zinc finger CCHC domain-containing protein 8 OS=Mus musculus OX=10090 GN=Zcchc8 PE=1 SV=3)

HSP 1 Score: 83.2 bits (204), Expect = 1.8e-14
Identity = 52/148 (35.14%), Postives = 77/148 (52.03%), Query Frame = 0

Query: 708 CFNCGSYNHSLKDCRKPRDNAAVNNARNKYKKY--HSSGSRNSTRYYQNSRGGKYDDLRP 767
           CFNCGS  H +K+C  PR+ A ++  R +Y      +SG     RY+      ++   +P
Sbjct: 232 CFNCGSEEHQMKECPMPRNAARISEKRKEYMDACGEASGQSFQQRYHAEEVEERFGRFKP 291

Query: 768 GALDAETRQLLGLKELDPPPWLNRMRELGYPPGYLDLEDEDQPSGITIY--ADEKTDEQE 827
           G +  E +  LG+ +   PP++ RMR+LGYPPG+L  E E + SG+ +Y   D+   E E
Sbjct: 292 GVISEELQDALGVTDKSLPPFIYRMRQLGYPPGWLK-EAELENSGLALYDGNDDADGETE 351

Query: 828 DGEITEAEYRKPRKKMSVEFPGINAPIP 852
            GEI          K+ V +PG N   P
Sbjct: 352 TGEIQNKNVTYDLSKL-VNYPGFNISTP 377

BLAST of HG10018325 vs. ExPASy Swiss-Prot
Match: Q5R789 (Zinc finger CCHC domain-containing protein 8 OS=Pongo abelii OX=9601 GN=ZCCHC8 PE=2 SV=1)

HSP 1 Score: 72.8 bits (177), Expect = 2.4e-11
Identity = 64/236 (27.12%), Postives = 105/236 (44.49%), Query Frame = 0

Query: 708 CFNCGSYNHSLKDCRKPRDNAAVNNARNKYKKY--HSSGSRNSTRYYQNSRGGKYDDLRP 767
           CFNCGS  H +KDC  PR+ A ++  R +Y      ++      RY+      ++   +P
Sbjct: 227 CFNCGSEEHQMKDCPMPRNAARISEKRKEYMDACGEANNQNFQQRYHAEEVEERFGRFKP 286

Query: 768 GALDAETRQLLGLKELDPPPWLNRMRELGYPPGYLDLEDEDQPSGITIY--ADEKTDEQE 827
           G +  E +  LG+ +   PP++ RMR+LGYPPG+L  E E + SG+ +Y   D    E E
Sbjct: 287 GVISEELQDALGVTDKSLPPFIYRMRQLGYPPGWLK-EAELENSGLALYDGKDGTDGETE 346

Query: 828 DGEITEAEYRKPRKKMSVEFPGINAPIPENADE--RLWAAEPSSSGLPRNRSNQRLNHYT 887
            GEI + +         V +PG N   P    +  R++ + P  +   ++     L   +
Sbjct: 347 VGEIQQNKSVTYDLSKLVNYPGFNISTPRGIPDEWRIFGSIPMQACQQKDVFANYLT--S 406

Query: 888 EYDGKGNDHHQRWSRDYRDDRPPGVDSVKSPPMFTPRYGGHDFSSDSQTPRGNFSS 938
            +   G     + S  +     P     +S    +P     +  SD + P G+ SS
Sbjct: 407 NFQAPGVKSGNKRSSSHSSPGSPKKQKKESNSAGSP--ADMELDSDMEVPHGSQSS 457

BLAST of HG10018325 vs. ExPASy Swiss-Prot
Match: Q6NZY4 (Zinc finger CCHC domain-containing protein 8 OS=Homo sapiens OX=9606 GN=ZCCHC8 PE=1 SV=2)

HSP 1 Score: 72.4 bits (176), Expect = 3.2e-11
Identity = 64/236 (27.12%), Postives = 105/236 (44.49%), Query Frame = 0

Query: 708 CFNCGSYNHSLKDCRKPRDNAAVNNARNKYKKY--HSSGSRNSTRYYQNSRGGKYDDLRP 767
           CFNCGS  H +KDC  PR+ A ++  R +Y      ++      RY+      ++   +P
Sbjct: 229 CFNCGSEEHQMKDCPMPRNAARISEKRKEYMDACGEANNQNFQQRYHAEEVEERFGRFKP 288

Query: 768 GALDAETRQLLGLKELDPPPWLNRMRELGYPPGYLDLEDEDQPSGITIY--ADEKTDEQE 827
           G +  E +  LG+ +   PP++ RMR+LGYPPG+L  E E + SG+ +Y   D    E E
Sbjct: 289 GVISEELQDALGVTDKSLPPFIYRMRQLGYPPGWLK-EAELENSGLALYDGKDGTDGETE 348

Query: 828 DGEITEAEYRKPRKKMSVEFPGINAPIPENADE--RLWAAEPSSSGLPRNRSNQRLNHYT 887
            GEI + +         V +PG N   P    +  R++ + P  +   ++     L   +
Sbjct: 349 VGEIQQNKSVTYDLSKLVNYPGFNISTPRGIPDEWRIFGSIPMQACQQKDVFANYLT--S 408

Query: 888 EYDGKGNDHHQRWSRDYRDDRPPGVDSVKSPPMFTPRYGGHDFSSDSQTPRGNFSS 938
            +   G     + S  +     P     +S    +P     +  SD + P G+ SS
Sbjct: 409 NFQAPGVKSGNKRSSSHSSPGSPKKQKNESNSAGSP--ADMELDSDMEVPHGSQSS 459

BLAST of HG10018325 vs. ExPASy TrEMBL
Match: A0A5D3BMZ1 (Zinc finger CCHC domain-containing protein 8 isoform X2 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold169G001930 PE=3 SV=1)

HSP 1 Score: 1073.9 bits (2776), Expect = 3.6e-310
Identity = 544/610 (89.18%), Postives = 563/610 (92.30%), Query Frame = 0

Query: 384 MGTEDFIALPGSGDSGNETESNESLSFNETREAYSQSSVLKCKDDDASIEKVELADDVHL 443
           MGTEDFIALP SGDSGNETESNESL+FNETREAYSQSSVLKCKDDDASIEK EL DDV L
Sbjct: 1   MGTEDFIALPASGDSGNETESNESLTFNETREAYSQSSVLKCKDDDASIEKAELVDDVQL 60

Query: 444 EDMPCIPQSDLTDETQRSDSDMEIEDLNNLPDFSKTRSRSENNKILSETDYLPVNSADEN 503
           EDM C+PQSDL DETQRSDSDMEIEDLNNLPDFSKTRSRSEN++ILS+ + LPVNSAD N
Sbjct: 61  EDMHCVPQSDLVDETQRSDSDMEIEDLNNLPDFSKTRSRSENSEILSKAEDLPVNSADGN 120

Query: 504 ILPSSEPLQQNELHTRYED----------EDWVDNSSFLKTGGQLTVMNGVSIEFNGLNS 563
           ILPS+EPLQQNELHTRYED          +D VDNSSF KTGGQLTVMNGVSI+FN LNS
Sbjct: 121 ILPSNEPLQQNELHTRYEDVCHVESQNFQKDLVDNSSFSKTGGQLTVMNGVSIDFNELNS 180

Query: 564 GVPIEYGSAASHHHGGPTISGVKRPRM---AMDEQQPSVHIVYTSLTRDSKQKLDELLKQ 623
           G P+E GSA SHHHGGP ISGVKRPRM   AMDEQQPSVHIVYTSLTRDSKQKLDELLKQ
Sbjct: 181 GAPMENGSATSHHHGGPRISGVKRPRMAMEAMDEQQPSVHIVYTSLTRDSKQKLDELLKQ 240

Query: 624 WSEWHALRGSLSHDDKDSENLESGEETFFPALCVGTKKTSAVTFWIDNQKSEQQQNFVPI 683
           WSEWHA +GSLS DDKD+ENLESGEETFFPALCVGTKKTSAVTFW+DNQKSEQQQ FVPI
Sbjct: 241 WSEWHAQQGSLSRDDKDTENLESGEETFFPALCVGTKKTSAVTFWMDNQKSEQQQTFVPI 300

Query: 684 DDNSVPLYDRGFTLGLTSANDSSKVEGGLKIIDDASRCFNCGSYNHSLKDCRKPRDNAAV 743
           DDNSVPLYDRGFTLGLTSANDSS VEGG KIIDDASRCFNCGSYNHSLKDCRKPRDNAAV
Sbjct: 301 DDNSVPLYDRGFTLGLTSANDSSNVEGGQKIIDDASRCFNCGSYNHSLKDCRKPRDNAAV 360

Query: 744 NNARNKYKKYHSSGSRNSTRYYQNSRGGKYDDLRPGALDAETRQLLGLKELDPPPWLNRM 803
           NNARNKYKK H+S SRNSTRYYQNSRGGKYDDLRPG LDAETRQLLGLKELDPPPWLNRM
Sbjct: 361 NNARNKYKKQHNSASRNSTRYYQNSRGGKYDDLRPGTLDAETRQLLGLKELDPPPWLNRM 420

Query: 804 RELGYPPGYLDLEDEDQPSGITIYADEKTDEQEDGEITEAEYRKPRKKMSVEFPGINAPI 863
           RELGYPPGYLD EDEDQPSGITIYADEKTDEQEDGEITEAEYRKP+KKMSVEFPGINAPI
Sbjct: 421 RELGYPPGYLDPEDEDQPSGITIYADEKTDEQEDGEITEAEYRKPQKKMSVEFPGINAPI 480

Query: 864 PENADERLWAAEPSSSGLPRNRSNQRLNHYTEYDGKGNDHH-QRWSRDYRDDRPPGVDSV 923
           PENADERLWA EPSSSGLPRNRSNQRLNHY EYD +GNDHH QRWSRDYRDDRPPGVDS+
Sbjct: 481 PENADERLWAPEPSSSGLPRNRSNQRLNHYPEYDTRGNDHHQQRWSRDYRDDRPPGVDSI 540

Query: 924 KSPPMFTPRYGGHDFSSDSQTPRGNFSSSRSPNLGRPHSDRGRRSPLLDDDYSRYGSSYS 980
           KSPP FTPRYGGHDFS DSQTPRG+FS+SRSPNLGRPHSDRGRRSP  DDDYSRY SSYS
Sbjct: 541 KSPPSFTPRYGGHDFSYDSQTPRGSFSTSRSPNLGRPHSDRGRRSPHRDDDYSRYSSSYS 600

BLAST of HG10018325 vs. ExPASy TrEMBL
Match: E5GCT2 (Nucleic acid binding protein OS=Cucumis melo subsp. melo OX=412675 PE=3 SV=1)

HSP 1 Score: 1073.9 bits (2776), Expect = 3.6e-310
Identity = 544/610 (89.18%), Postives = 563/610 (92.30%), Query Frame = 0

Query: 384 MGTEDFIALPGSGDSGNETESNESLSFNETREAYSQSSVLKCKDDDASIEKVELADDVHL 443
           MGTEDFIALP SGDSGNETESNESL+FNETREAYSQSSVLKCKDDDASIEK EL DDV L
Sbjct: 1   MGTEDFIALPASGDSGNETESNESLTFNETREAYSQSSVLKCKDDDASIEKAELVDDVQL 60

Query: 444 EDMPCIPQSDLTDETQRSDSDMEIEDLNNLPDFSKTRSRSENNKILSETDYLPVNSADEN 503
           EDM C+PQSDL DETQRSDSDMEIEDLNNLPDFSKTRSRSEN++ILS+ + LPVNSAD N
Sbjct: 61  EDMHCVPQSDLVDETQRSDSDMEIEDLNNLPDFSKTRSRSENSEILSKAEDLPVNSADGN 120

Query: 504 ILPSSEPLQQNELHTRYED----------EDWVDNSSFLKTGGQLTVMNGVSIEFNGLNS 563
           ILPS+EPLQQNELHTRYED          +D VDNSSF KTGGQLTVMNGVSI+FN LNS
Sbjct: 121 ILPSNEPLQQNELHTRYEDVCHVESQNFQKDLVDNSSFSKTGGQLTVMNGVSIDFNELNS 180

Query: 564 GVPIEYGSAASHHHGGPTISGVKRPRM---AMDEQQPSVHIVYTSLTRDSKQKLDELLKQ 623
           G P+E GSA SHHHGGP ISGVKRPRM   AMDEQQPSVHIVYTSLTRDSKQKLDELLKQ
Sbjct: 181 GAPMENGSATSHHHGGPRISGVKRPRMAMEAMDEQQPSVHIVYTSLTRDSKQKLDELLKQ 240

Query: 624 WSEWHALRGSLSHDDKDSENLESGEETFFPALCVGTKKTSAVTFWIDNQKSEQQQNFVPI 683
           WSEWHA +GSLS DDKD+ENLESGEETFFPALCVGTKKTSAVTFW+DNQKSEQQQ FVPI
Sbjct: 241 WSEWHAQQGSLSRDDKDTENLESGEETFFPALCVGTKKTSAVTFWMDNQKSEQQQTFVPI 300

Query: 684 DDNSVPLYDRGFTLGLTSANDSSKVEGGLKIIDDASRCFNCGSYNHSLKDCRKPRDNAAV 743
           DDNSVPLYDRGFTLGLTSANDSS VEGG KIIDDASRCFNCGSYNHSLKDCRKPRDNAAV
Sbjct: 301 DDNSVPLYDRGFTLGLTSANDSSNVEGGQKIIDDASRCFNCGSYNHSLKDCRKPRDNAAV 360

Query: 744 NNARNKYKKYHSSGSRNSTRYYQNSRGGKYDDLRPGALDAETRQLLGLKELDPPPWLNRM 803
           NNARNKYKK H+S SRNSTRYYQNSRGGKYDDLRPG LDAETRQLLGLKELDPPPWLNRM
Sbjct: 361 NNARNKYKKQHNSASRNSTRYYQNSRGGKYDDLRPGTLDAETRQLLGLKELDPPPWLNRM 420

Query: 804 RELGYPPGYLDLEDEDQPSGITIYADEKTDEQEDGEITEAEYRKPRKKMSVEFPGINAPI 863
           RELGYPPGYLD EDEDQPSGITIYADEKTDEQEDGEITEAEYRKP+KKMSVEFPGINAPI
Sbjct: 421 RELGYPPGYLDPEDEDQPSGITIYADEKTDEQEDGEITEAEYRKPQKKMSVEFPGINAPI 480

Query: 864 PENADERLWAAEPSSSGLPRNRSNQRLNHYTEYDGKGNDHH-QRWSRDYRDDRPPGVDSV 923
           PENADERLWA EPSSSGLPRNRSNQRLNHY EYD +GNDHH QRWSRDYRDDRPPGVDS+
Sbjct: 481 PENADERLWAPEPSSSGLPRNRSNQRLNHYPEYDTRGNDHHQQRWSRDYRDDRPPGVDSI 540

Query: 924 KSPPMFTPRYGGHDFSSDSQTPRGNFSSSRSPNLGRPHSDRGRRSPLLDDDYSRYGSSYS 980
           KSPP FTPRYGGHDFS DSQTPRG+FS+SRSPNLGRPHSDRGRRSP  DDDYSRY SSYS
Sbjct: 541 KSPPSFTPRYGGHDFSYDSQTPRGSFSTSRSPNLGRPHSDRGRRSPHRDDDYSRYSSSYS 600

BLAST of HG10018325 vs. ExPASy TrEMBL
Match: A0A1S3CBD3 (uncharacterized protein LOC103498564 isoform X2 OS=Cucumis melo OX=3656 GN=LOC103498564 PE=3 SV=1)

HSP 1 Score: 1073.9 bits (2776), Expect = 3.6e-310
Identity = 544/610 (89.18%), Postives = 563/610 (92.30%), Query Frame = 0

Query: 384 MGTEDFIALPGSGDSGNETESNESLSFNETREAYSQSSVLKCKDDDASIEKVELADDVHL 443
           MGTEDFIALP SGDSGNETESNESL+FNETREAYSQSSVLKCKDDDASIEK EL DDV L
Sbjct: 1   MGTEDFIALPASGDSGNETESNESLTFNETREAYSQSSVLKCKDDDASIEKAELVDDVQL 60

Query: 444 EDMPCIPQSDLTDETQRSDSDMEIEDLNNLPDFSKTRSRSENNKILSETDYLPVNSADEN 503
           EDM C+PQSDL DETQRSDSDMEIEDLNNLPDFSKTRSRSEN++ILS+ + LPVNSAD N
Sbjct: 61  EDMHCVPQSDLVDETQRSDSDMEIEDLNNLPDFSKTRSRSENSEILSKAEDLPVNSADGN 120

Query: 504 ILPSSEPLQQNELHTRYED----------EDWVDNSSFLKTGGQLTVMNGVSIEFNGLNS 563
           ILPS+EPLQQNELHTRYED          +D VDNSSF KTGGQLTVMNGVSI+FN LNS
Sbjct: 121 ILPSNEPLQQNELHTRYEDVCHVESQNFQKDLVDNSSFSKTGGQLTVMNGVSIDFNELNS 180

Query: 564 GVPIEYGSAASHHHGGPTISGVKRPRM---AMDEQQPSVHIVYTSLTRDSKQKLDELLKQ 623
           G P+E GSA SHHHGGP ISGVKRPRM   AMDEQQPSVHIVYTSLTRDSKQKLDELLKQ
Sbjct: 181 GAPMENGSATSHHHGGPRISGVKRPRMAMEAMDEQQPSVHIVYTSLTRDSKQKLDELLKQ 240

Query: 624 WSEWHALRGSLSHDDKDSENLESGEETFFPALCVGTKKTSAVTFWIDNQKSEQQQNFVPI 683
           WSEWHA +GSLS DDKD+ENLESGEETFFPALCVGTKKTSAVTFW+DNQKSEQQQ FVPI
Sbjct: 241 WSEWHAQQGSLSRDDKDTENLESGEETFFPALCVGTKKTSAVTFWMDNQKSEQQQTFVPI 300

Query: 684 DDNSVPLYDRGFTLGLTSANDSSKVEGGLKIIDDASRCFNCGSYNHSLKDCRKPRDNAAV 743
           DDNSVPLYDRGFTLGLTSANDSS VEGG KIIDDASRCFNCGSYNHSLKDCRKPRDNAAV
Sbjct: 301 DDNSVPLYDRGFTLGLTSANDSSNVEGGQKIIDDASRCFNCGSYNHSLKDCRKPRDNAAV 360

Query: 744 NNARNKYKKYHSSGSRNSTRYYQNSRGGKYDDLRPGALDAETRQLLGLKELDPPPWLNRM 803
           NNARNKYKK H+S SRNSTRYYQNSRGGKYDDLRPG LDAETRQLLGLKELDPPPWLNRM
Sbjct: 361 NNARNKYKKQHNSASRNSTRYYQNSRGGKYDDLRPGTLDAETRQLLGLKELDPPPWLNRM 420

Query: 804 RELGYPPGYLDLEDEDQPSGITIYADEKTDEQEDGEITEAEYRKPRKKMSVEFPGINAPI 863
           RELGYPPGYLD EDEDQPSGITIYADEKTDEQEDGEITEAEYRKP+KKMSVEFPGINAPI
Sbjct: 421 RELGYPPGYLDPEDEDQPSGITIYADEKTDEQEDGEITEAEYRKPQKKMSVEFPGINAPI 480

Query: 864 PENADERLWAAEPSSSGLPRNRSNQRLNHYTEYDGKGNDHH-QRWSRDYRDDRPPGVDSV 923
           PENADERLWA EPSSSGLPRNRSNQRLNHY EYD +GNDHH QRWSRDYRDDRPPGVDS+
Sbjct: 481 PENADERLWAPEPSSSGLPRNRSNQRLNHYPEYDTRGNDHHQQRWSRDYRDDRPPGVDSI 540

Query: 924 KSPPMFTPRYGGHDFSSDSQTPRGNFSSSRSPNLGRPHSDRGRRSPLLDDDYSRYGSSYS 980
           KSPP FTPRYGGHDFS DSQTPRG+FS+SRSPNLGRPHSDRGRRSP  DDDYSRY SSYS
Sbjct: 541 KSPPSFTPRYGGHDFSYDSQTPRGSFSTSRSPNLGRPHSDRGRRSPHRDDDYSRYSSSYS 600

BLAST of HG10018325 vs. ExPASy TrEMBL
Match: A0A1S3CAN3 (uncharacterized protein LOC103498564 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103498564 PE=3 SV=1)

HSP 1 Score: 1067.8 bits (2760), Expect = 2.8e-308
Identity = 544/617 (88.17%), Postives = 564/617 (91.41%), Query Frame = 0

Query: 384 MGTEDFIALPGSGDSGNETESNESLSFNETREAYSQSSVLKCKDDDASIEKVELADDVHL 443
           MGTEDFIALP SGDSGNETESNESL+FNETREAYSQSSVLKCKDDDASIEK EL DDV L
Sbjct: 1   MGTEDFIALPASGDSGNETESNESLTFNETREAYSQSSVLKCKDDDASIEKAELVDDVQL 60

Query: 444 EDMPCIPQSDLTDETQRSDSDMEIEDLNNLPDFSKTRSRSENNKILSETDYLPVNSADEN 503
           EDM C+PQSDL DETQRSDSDMEIEDLNNLPDFSKTRSRSEN++ILS+ + LPVNSAD N
Sbjct: 61  EDMHCVPQSDLVDETQRSDSDMEIEDLNNLPDFSKTRSRSENSEILSKAEDLPVNSADGN 120

Query: 504 ILPSSEPLQQNELHTRYED----------EDWVDNSSFLKTGGQLTVMNGVSIEFNGLNS 563
           ILPS+EPLQQNELHTRYED          +D VDNSSF KTGGQLTVMNGVSI+FN LNS
Sbjct: 121 ILPSNEPLQQNELHTRYEDVCHVESQNFQKDLVDNSSFSKTGGQLTVMNGVSIDFNELNS 180

Query: 564 GVPIEYGSAASHHHGGPT-------ISGVKRPRM---AMDEQQPSVHIVYTSLTRDSKQK 623
           G P+E GSA SHHHGGP+       ISGVKRPRM   AMDEQQPSVHIVYTSLTRDSKQK
Sbjct: 181 GAPMENGSATSHHHGGPSKIQKSDGISGVKRPRMAMEAMDEQQPSVHIVYTSLTRDSKQK 240

Query: 624 LDELLKQWSEWHALRGSLSHDDKDSENLESGEETFFPALCVGTKKTSAVTFWIDNQKSEQ 683
           LDELLKQWSEWHA +GSLS DDKD+ENLESGEETFFPALCVGTKKTSAVTFW+DNQKSEQ
Sbjct: 241 LDELLKQWSEWHAQQGSLSRDDKDTENLESGEETFFPALCVGTKKTSAVTFWMDNQKSEQ 300

Query: 684 QQNFVPIDDNSVPLYDRGFTLGLTSANDSSKVEGGLKIIDDASRCFNCGSYNHSLKDCRK 743
           QQ FVPIDDNSVPLYDRGFTLGLTSANDSS VEGG KIIDDASRCFNCGSYNHSLKDCRK
Sbjct: 301 QQTFVPIDDNSVPLYDRGFTLGLTSANDSSNVEGGQKIIDDASRCFNCGSYNHSLKDCRK 360

Query: 744 PRDNAAVNNARNKYKKYHSSGSRNSTRYYQNSRGGKYDDLRPGALDAETRQLLGLKELDP 803
           PRDNAAVNNARNKYKK H+S SRNSTRYYQNSRGGKYDDLRPG LDAETRQLLGLKELDP
Sbjct: 361 PRDNAAVNNARNKYKKQHNSASRNSTRYYQNSRGGKYDDLRPGTLDAETRQLLGLKELDP 420

Query: 804 PPWLNRMRELGYPPGYLDLEDEDQPSGITIYADEKTDEQEDGEITEAEYRKPRKKMSVEF 863
           PPWLNRMRELGYPPGYLD EDEDQPSGITIYADEKTDEQEDGEITEAEYRKP+KKMSVEF
Sbjct: 421 PPWLNRMRELGYPPGYLDPEDEDQPSGITIYADEKTDEQEDGEITEAEYRKPQKKMSVEF 480

Query: 864 PGINAPIPENADERLWAAEPSSSGLPRNRSNQRLNHYTEYDGKGNDHH-QRWSRDYRDDR 923
           PGINAPIPENADERLWA EPSSSGLPRNRSNQRLNHY EYD +GNDHH QRWSRDYRDDR
Sbjct: 481 PGINAPIPENADERLWAPEPSSSGLPRNRSNQRLNHYPEYDTRGNDHHQQRWSRDYRDDR 540

Query: 924 PPGVDSVKSPPMFTPRYGGHDFSSDSQTPRGNFSSSRSPNLGRPHSDRGRRSPLLDDDYS 980
           PPGVDS+KSPP FTPRYGGHDFS DSQTPRG+FS+SRSPNLGRPHSDRGRRSP  DDDYS
Sbjct: 541 PPGVDSIKSPPSFTPRYGGHDFSYDSQTPRGSFSTSRSPNLGRPHSDRGRRSPHRDDDYS 600

BLAST of HG10018325 vs. ExPASy TrEMBL
Match: A0A0A0KSK4 (CCHC-type domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_5G643930 PE=3 SV=1)

HSP 1 Score: 1053.5 bits (2723), Expect = 5.4e-304
Identity = 542/617 (87.84%), Postives = 559/617 (90.60%), Query Frame = 0

Query: 384 MGTEDFIALPGSGDSGNETESNESLSFNETREAYSQSSVLKCKDDDASIEKVELADDVHL 443
           MGTEDFIALP SGDSGNETESNESL+FNETREAYSQSSVLK KDDDASIEK EL DDV L
Sbjct: 1   MGTEDFIALPASGDSGNETESNESLTFNETREAYSQSSVLKRKDDDASIEKAELVDDVQL 60

Query: 444 EDMPCIPQSDLTDETQRSDSDMEIEDLNNLPDFSKTRSRSENNKILSETDYLPVNSADEN 503
           E M CIPQSDL DETQRSDSDMEIEDLNNLPDFSKTRSRSEN++ILS+   LPVNSAD N
Sbjct: 61  EAMHCIPQSDLVDETQRSDSDMEIEDLNNLPDFSKTRSRSENSEILSKAADLPVNSADGN 120

Query: 504 ILPSSEPLQQNELHTRYED----------EDWVDNSSFLKTGGQLTVMNGVSIEFNGLNS 563
           ILPSSE LQQNELHTRYED          +D VDNSSFLKTGGQLTVMNGVSI+FN LNS
Sbjct: 121 ILPSSELLQQNELHTRYEDVCHVESKKFQKDLVDNSSFLKTGGQLTVMNGVSIDFNELNS 180

Query: 564 GVPIEYGSAASHHHGGPT-------ISGVKRPRM---AMDEQQPSVHIVYTSLTRDSKQK 623
           G P+E GSA SHHHGGP+       ISGVKRPRM   AMDEQQPSVHIVYTSLTRDSKQK
Sbjct: 181 GAPMENGSATSHHHGGPSKIQKSDGISGVKRPRMAMEAMDEQQPSVHIVYTSLTRDSKQK 240

Query: 624 LDELLKQWSEWHALRGSLSHDDKDSENLESGEETFFPALCVGTKKTSAVTFWIDNQKSEQ 683
           LDELLKQWSEWHA +GSLS DDKD+ENLESGEETFFPALCVGTKKTSAVTFW+DNQKSEQ
Sbjct: 241 LDELLKQWSEWHAQQGSLSCDDKDTENLESGEETFFPALCVGTKKTSAVTFWMDNQKSEQ 300

Query: 684 QQNFVPIDDNSVPLYDRGFTLGLTSANDSSKVEGGLKIIDDASRCFNCGSYNHSLKDCRK 743
           QQNFVPIDDNSVPLYDRGFTLGLTSANDSS  EGG KIIDDASRCFNCGSYNHSLKDCRK
Sbjct: 301 QQNFVPIDDNSVPLYDRGFTLGLTSANDSSNAEGGQKIIDDASRCFNCGSYNHSLKDCRK 360

Query: 744 PRDNAAVNNARNKYKKYHSSGSRNSTRYYQNSRGGKYDDLRPGALDAETRQLLGLKELDP 803
           PRDNAAVNNARNKYKK H+S SRNSTRYYQNSRGGKYDDLRPG LDAETRQLLGLKELDP
Sbjct: 361 PRDNAAVNNARNKYKKQHNSASRNSTRYYQNSRGGKYDDLRPGTLDAETRQLLGLKELDP 420

Query: 804 PPWLNRMRELGYPPGYLDLEDEDQPSGITIYADEKTDEQEDGEITEAEYRKPRKKMSVEF 863
           PPWLNRMRELGYPPGYLD EDEDQPSGITIYADEKTDEQEDGEITEAEYRKPRKK SVEF
Sbjct: 421 PPWLNRMRELGYPPGYLDPEDEDQPSGITIYADEKTDEQEDGEITEAEYRKPRKKKSVEF 480

Query: 864 PGINAPIPENADERLWAAEPSSSGLPRNRSNQRLNHYTEYDGKGNDHH-QRWSRDYRDDR 923
           PGINAPIPENADERLWA EPS+SGL RNRSNQRLNHY EYD +GNDHH QRWSRDYRDDR
Sbjct: 481 PGINAPIPENADERLWAPEPSNSGLSRNRSNQRLNHYPEYDTRGNDHHQQRWSRDYRDDR 540

Query: 924 PPGVDSVKSPPMFTPRYGGHDFSSDSQTPRGNFSSSRSPNLGRPHSDRGRRSPLLDDDYS 980
           PPGVDS+KSPP FTPRYGGHDFS DSQTPRG+FS+SRSPNLGRPHSDRGRRSP  DDDYS
Sbjct: 541 PPGVDSIKSPPSFTPRYGGHDFSYDSQTPRGSFSTSRSPNLGRPHSDRGRRSPQRDDDYS 600

BLAST of HG10018325 vs. TAIR 10
Match: AT3G02420.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: membrane; EXPRESSED IN: 25 plant structures; EXPRESSED DURING: 15 growth stages; CONTAINS InterPro DOMAIN/s: Uncharacterised protein family UPF0121 (InterPro:IPR005344); Has 72 Blast hits to 71 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 2; Fungi - 2; Plants - 60; Viruses - 0; Other Eukaryotes - 8 (source: NCBI BLink). )

HSP 1 Score: 452.6 bits (1163), Expect = 8.1e-127
Identity = 231/301 (76.74%), Postives = 264/301 (87.71%), Query Frame = 0

Query: 47  MGEEQEDSQRLKRVAAAAYDYENDPRWADYWSNILIPPHMASRPDVVDHYKRKFYQRYID 106
           M E  EDSQRLK++AAAA+DYEND RWADYWSNILIPPHMASRP+VVDH+KRKFYQRYID
Sbjct: 1   MAEGGEDSQRLKKIAAAAFDYENDARWADYWSNILIPPHMASRPEVVDHFKRKFYQRYID 60

Query: 107 PELVVEAMS-SSSSTQSSRPSAT--SSAAPPPTNDRSRPQSAGSTTTRTSGTSASADANP 166
           P+LVVE MS SSSS+QS+RP+AT  SS A    N++ R +++GS   RTSG SA+  A P
Sbjct: 61  PDLVVEPMSTSSSSSQSARPTATSASSTASSNANEQVRSRNSGS-VPRTSGPSATTGATP 120

Query: 167 TPLRWDRQTIQFSVNAWVFIVAVLAIFPLIPKNLSQRAYRLSFMGTTCSSLYSLYSLYGK 226
           + +RWD QTIQFSVNAWVF++AVLA+ PLIPKNLS RAYRLSFMGT CSSLYSLYSLYG+
Sbjct: 121 SSMRWDEQTIQFSVNAWVFVIAVLAVLPLIPKNLSNRAYRLSFMGTACSSLYSLYSLYGR 180

Query: 227 PRAWNLQALQAYFQSIIATKDFIYFTYCITFVTSNICLKFALIPILCRALEHVAKFLRRN 286
           PRAWN+Q LQ YFQSI+A KDFIYF YC+TFVTS++CLKFALIPILCRALE VAKFLRRN
Sbjct: 181 PRAWNMQGLQVYFQSIVAAKDFIYFIYCLTFVTSHLCLKFALIPILCRALEQVAKFLRRN 240

Query: 287 FARSSLYRKYLEEPCVWVESNSTTLSILSSQAEIGIGFILIISLLSWQRNFLHTFMYWQR 345
           F RS++YRKYLE+PCVWVESN+TTL+ILSSQAEI IGF+LIISLLSWQRN + TFMYWQ 
Sbjct: 241 FGRSTIYRKYLEDPCVWVESNTTTLNILSSQAEIAIGFLLIISLLSWQRNIIQTFMYWQL 300

BLAST of HG10018325 vs. TAIR 10
Match: AT5G38600.1 (Proline-rich spliceosome-associated (PSP) family protein / zinc knuckle (CCHC-type) family protein )

HSP 1 Score: 347.8 bits (891), Expect = 2.8e-95
Identity = 209/409 (51.10%), Postives = 265/409 (64.79%), Query Frame = 0

Query: 573 SGVKRPRMAMDEQQPSVHIVYTSLTRDSKQKLDELLKQWSEWHALRGSLSHDDKDSENLE 632
           +GVKRPR + DEQQP+VH+ Y  LTR SKQKL+ LL++WSEW A   SL+ D +  +  E
Sbjct: 108 AGVKRPRTSYDEQQPTVHVTYKHLTRASKQKLESLLQKWSEWEAENTSLAQDQE--QLFE 167

Query: 633 SGEETFFPALCVGTKKTSAVTFWIDNQKSEQQ-QNFVPIDDNSVPLYDRGFTLGLTSAND 692
           SGEET FPA+ VG +KTS+V+FWIDNQ   +  ++FV ++ ++ PLYDR F +GL SA+ 
Sbjct: 168 SGEETCFPAIRVGLQKTSSVSFWIDNQTGHKPLEDFVLVESSTTPLYDRKFAIGLNSADG 227

Query: 693 SSKVEGGLKII-DDASRCFNCGSYNHSLKDCRKPRDNAAVNNARNKYK---KYHSSGSRN 752
           S  VEGGL+II DD  RCFNCG Y+HSL++C +P D +AVN+AR   K     +SSG R 
Sbjct: 228 SRNVEGGLEIIDDDPPRCFNCGGYSHSLRECPRPFDRSAVNSARKLQKSKRNQNSSGPRL 287

Query: 753 STRYYQNSRGGKYDDLRPGALDAETRQLLGLKELDPPPWLNRMRELGYPPGYLDLEDEDQ 812
            +RYYQ ++ GKYD L+PG LDAETRQLL L ELDPPPWLNRMRE+GYPPGYL  ED D 
Sbjct: 288 PSRYYQKTQTGKYDGLKPGTLDAETRQLLNLGELDPPPWLNRMREIGYPPGYLAPED-DH 347

Query: 813 PSGITIYADE----KTDEQEDGEITEAEYR--KPRKKMSVEFPGINAPIPENADERLWAA 872
            SGITI+ +E    +  E EDGEI E      +P+ K +VEFPGINAP PENADE LW A
Sbjct: 348 LSGITIFGEEVETREEIESEDGEILEKANHPPEPQMKKTVEFPGINAPFPENADEWLWEA 407

Query: 873 EPSSSGLPRNRSNQRLNHYTEYDGKGNDHHQRWSR--DYRDDRPPGVDSVKSPPMFTPRY 932
            PS      +R++ R          G    Q+ SR  DYRDD P GV+    PP +  RY
Sbjct: 408 APS------HRNSSR---------SGRWQQQKTSRGHDYRDDGPLGVEPSSYPPRYGSRY 467

Query: 933 GGHDFSSDSQTPRGNFSSSRSPNLGRPHSDRGRRSPLLDDDYSRYGSSY 969
             + + S+    R     SRSP + R  S+R +R      DYS Y + +
Sbjct: 468 -DYGYGSNEYGSR-----SRSPGIDRSLSERSKR------DYSSYDADF 486

BLAST of HG10018325 vs. TAIR 10
Match: AT1G67210.2 (Proline-rich spliceosome-associated (PSP) family protein / zinc knuckle (CCHC-type) family protein )

HSP 1 Score: 334.7 bits (857), Expect = 2.5e-91
Identity = 177/304 (58.22%), Postives = 228/304 (75.00%), Query Frame = 0

Query: 573 SGVKRPRMAMDEQQPSVHIVYTSLTRDSKQKLDELLKQWSEWHALRGSLSHDDKDSENLE 632
           SGVKR R    EQQPSVH+ Y  LTRDSKQKL+ LL+QWSEW A + SLS D +  + LE
Sbjct: 66  SGVKRARTISLEQQPSVHVTYKHLTRDSKQKLESLLQQWSEWEAEQNSLSEDQE--QVLE 125

Query: 633 SGEETFFPALCVGTKKTSAVTFWIDNQKS-EQQQNFVPIDDNSVPLYDRGFTLGLTSAND 692
           +G+ET+FPAL VG +KTS+V+FW D Q      +  VP++ ++ PLY+RGFT+GL S   
Sbjct: 126 AGDETYFPALRVGLQKTSSVSFWFDYQTGHSSSKKSVPVESSTTPLYNRGFTIGLDSG-- 185

Query: 693 SSKVEGGLKIIDDASRCFNCGSYNHSLKDCRKPRDNAAVNNAR--NKYKKYHSSGSRNST 752
           S+ VEGGL+IIDD  RCFNCG+Y+HS+++C +P D +AV+NAR  +K K+  + GSR  +
Sbjct: 186 SNNVEGGLEIIDDPPRCFNCGAYSHSIRECPRPFDRSAVSNARRQHKRKRNQTPGSRLPS 245

Query: 753 RYYQNSRGGKYDDLRPGALDAETRQLLGLKELDPPPWLNRMRELGYPPGYLDLEDEDQPS 812
           RYYQ+ + GKYD L+PG+LDAETR+LLGLKELDPPPWLNRMRE+GYPPGY + ED+D  S
Sbjct: 246 RYYQSLQRGKYDGLKPGSLDAETRKLLGLKELDPPPWLNRMREIGYPPGYFE-EDDDDHS 305

Query: 813 GITIYADEKTDEQ-----EDGEITE-AEYRKPRKKMSVEFPGINAPIPENADERLWAAEP 868
            ITI+ +E+T E+     E+GEI E A  ++PRK M+V FPGINAPIPENAD  LW    
Sbjct: 306 RITIFGEEETKEEEEVKTEEGEILEKASPQEPRKIMTVGFPGINAPIPENADSWLWEQRN 364

BLAST of HG10018325 vs. TAIR 10
Match: AT1G67210.1 (Proline-rich spliceosome-associated (PSP) family protein / zinc knuckle (CCHC-type) family protein )

HSP 1 Score: 334.0 bits (855), Expect = 4.2e-91
Identity = 177/305 (58.03%), Postives = 228/305 (74.75%), Query Frame = 0

Query: 573 SGVKRPRMAMDEQQPSVHIVYTSLTRDSKQKLDELLKQWSEWHALRGSLSHDDKDSENLE 632
           SGVKR R    EQQPSVH+ Y  LTRDSKQKL+ LL+QWSEW A + SLS D +  + LE
Sbjct: 66  SGVKRARTISLEQQPSVHVTYKHLTRDSKQKLESLLQQWSEWEAEQNSLSEDQE--QVLE 125

Query: 633 SGEETFFPALCVGTKKTSAVTFWIDNQKS-EQQQNFVPIDDNSVPLYDRGFTLGLTSAND 692
           +G+ET+FPAL VG +KTS+V+FW D Q      +  VP++ ++ PLY+RGFT+GL S   
Sbjct: 126 AGDETYFPALRVGLQKTSSVSFWFDYQTGHSSSKKSVPVESSTTPLYNRGFTIGLDSG-- 185

Query: 693 SSKVEGGLKIIDDASRCFNCGSYNHSLKDCRKPRDNAAVNNAR--NKYKKYHSSGSRNST 752
           S+ VEGGL+IIDD  RCFNCG+Y+HS+++C +P D +AV+NAR  +K K+  + GSR  +
Sbjct: 186 SNNVEGGLEIIDDPPRCFNCGAYSHSIRECPRPFDRSAVSNARRQHKRKRNQTPGSRLPS 245

Query: 753 RYYQNSRGGKYDDLRPGALDAETRQLLGLKELDPPPWLNRMRELGYPPGYLDL-EDEDQP 812
           RYYQ+ + GKYD L+PG+LDAETR+LLGLKELDPPPWLNRMRE+GYPPGY  + ED+D  
Sbjct: 246 RYYQSLQRGKYDGLKPGSLDAETRKLLGLKELDPPPWLNRMREIGYPPGYFAVEEDDDDH 305

Query: 813 SGITIYADEKTDEQ-----EDGEITE-AEYRKPRKKMSVEFPGINAPIPENADERLWAAE 868
           S ITI+ +E+T E+     E+GEI E A  ++PRK M+V FPGINAPIPENAD  LW   
Sbjct: 306 SRITIFGEEETKEEEEVKTEEGEILEKASPQEPRKIMTVGFPGINAPIPENADSWLWEQR 365

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
KAG7016075.10.0e+0074.26Zinc finger CCHC domain-containing protein 8 [Cucurbita argyrosperma subsp. argy... [more]
XP_038890370.10.0e+0090.39uncharacterized protein LOC120079961 [Benincasa hispida] >XP_038890371.1 unchara... [more]
XP_008459423.17.5e-31089.18PREDICTED: uncharacterized protein LOC103498564 isoform X2 [Cucumis melo] >ADN34... [more]
XP_008459422.15.7e-30888.17PREDICTED: uncharacterized protein LOC103498564 isoform X1 [Cucumis melo][more]
XP_011656031.11.1e-30387.84uncharacterized protein LOC101212144 [Cucumis sativus] >KGN52565.1 hypothetical ... [more]
Match NameE-valueIdentityDescription
Q6DD451.3e-1735.76Zinc finger CCHC domain-containing protein 8 OS=Xenopus laevis OX=8355 GN=zcchc8... [more]
Q5F3D12.1e-1532.43Zinc finger CCHC domain-containing protein 8 OS=Gallus gallus OX=9031 GN=ZCCHC8 ... [more]
Q9CYA61.8e-1435.14Zinc finger CCHC domain-containing protein 8 OS=Mus musculus OX=10090 GN=Zcchc8 ... [more]
Q5R7892.4e-1127.12Zinc finger CCHC domain-containing protein 8 OS=Pongo abelii OX=9601 GN=ZCCHC8 P... [more]
Q6NZY43.2e-1127.12Zinc finger CCHC domain-containing protein 8 OS=Homo sapiens OX=9606 GN=ZCCHC8 P... [more]
Match NameE-valueIdentityDescription
A0A5D3BMZ13.6e-31089.18Zinc finger CCHC domain-containing protein 8 isoform X2 OS=Cucumis melo var. mak... [more]
E5GCT23.6e-31089.18Nucleic acid binding protein OS=Cucumis melo subsp. melo OX=412675 PE=3 SV=1[more]
A0A1S3CBD33.6e-31089.18uncharacterized protein LOC103498564 isoform X2 OS=Cucumis melo OX=3656 GN=LOC10... [more]
A0A1S3CAN32.8e-30888.17uncharacterized protein LOC103498564 isoform X1 OS=Cucumis melo OX=3656 GN=LOC10... [more]
A0A0A0KSK45.4e-30487.84CCHC-type domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_5G643930 P... [more]
Match NameE-valueIdentityDescription
AT3G02420.18.1e-12776.74unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT5G38600.12.8e-9551.10Proline-rich spliceosome-associated (PSP) family protein / zinc knuckle (CCHC-ty... [more]
AT1G67210.22.5e-9158.22Proline-rich spliceosome-associated (PSP) family protein / zinc knuckle (CCHC-ty... [more]
AT1G67210.14.2e-9158.03Proline-rich spliceosome-associated (PSP) family protein / zinc knuckle (CCHC-ty... [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR006568PSP, proline-richSMARTSM00581testneucoord: 760..813
e-value: 7.7E-14
score: 62.0
IPR006568PSP, proline-richPFAMPF04046PSPcoord: 764..803
e-value: 2.5E-12
score: 46.4
IPR005344TMEM33/Pom33 familyPFAMPF03661TMEM33_Pom33coord: 171..344
e-value: 3.1E-14
score: 52.9
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 963..979
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 29..57
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 394..414
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 397..414
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 457..484
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 115..164
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 862..979
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 945..959
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 736..755
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 922..944
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 877..906
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 446..484
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 732..765
NoneNo IPR availablePANTHERPTHR13316:SF0ZINC FINGER CCHC DOMAIN-CONTAINING PROTEIN 8coord: 502..977
NoneNo IPR availablePANTHERPTHR13316ZINC FINGER, CCHC DOMAIN CONTAINING 8coord: 502..977
IPR001878Zinc finger, CCHC-typePROSITEPS50158ZF_CCHCcoord: 707..723
score: 8.861545

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10018325.1HG10018325.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0034470 ncRNA processing
cellular_component GO:0071013 catalytic step 2 spliceosome
cellular_component GO:0016021 integral component of membrane
cellular_component GO:0005654 nucleoplasm
molecular_function GO:0003723 RNA binding
molecular_function GO:0008270 zinc ion binding
molecular_function GO:0003676 nucleic acid binding