HG10002345 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10002345
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
Descriptiongeneral transcription factor 3C polypeptide 3
LocationChr11: 5823615 .. 5842501 (+)
RNA-Seq ExpressionHG10002345
SyntenyHG10002345
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAAAAGGAAGGGAGTAAAATTTCTGACAATGAAGAGGTTCCTGGTGGTGTTATGCGTGTTTTAGGAGCAGAAAAAGAGGTTGTAGAAACAGGAGTGGAGGCTAGAGAGGAGGAGGAAGAGGAAGAGGAAGAGGAAGAAGGAGAGGAAGAAGTGGAGGATGAAGGAGAGGATGATATTGAAGAAGAAGATGGTTACATATTCAAATTTAAGGCCGGAGAAAATCCATTTGATTTTGTTGAAGGGACAGATTTTAGCATCCAACCATATAAAAAATTTGAGCGCCTTGAATATGAAGCTCTTGCTGAGAAAAAGAGAAAAGCTCTTGCAAATGGTCAGAGGTAAATTTTATTCTATACTTTCTGTTCTTTCACCCCCTTCTTATTCTTAATTTTCTCTCTCGCCCTTTTTTGGGAGGTGGAGGAAATGGTTGCCTTCCATTGACAGGATTTACTGAAAAAATACTGGAAGTATGGCATCTACCTGCAAAGGTTTTATTGAAAAATATTGAAAGCATGATCCTACCTGACACCTGCAAAGTTTATAATAACAATCTTTTAAGTTACTGTGAAATTGGTGATGAGATGATGCGATGTTGCGATGCAGTGTCAGTGCAGAATTTATTTTGGTTTCTGTATTTTATTGTGGTTTATTAATTCAAGCTAGTTGTATCAATTTTGAAAATTTCAAATTTAGAGTGGTGCATTTTGGTATTAAAAAGATAGTTTTTCATTCATCCAAATGCATGAGCTAAAACAAAATCTATGCTCGTGCCTAACTCCCTACTAACAAACTCACAGTGGCAATGAATAATAATTAACTGATTTCCAACTACACTTGAGGTTTGGGCCGTTAAGTGAGCTATCAAACCTTTTGGCTAGTTTATAAAAAAAAATGCTATATTGTTTACGGACTAAGAACCCTGCAAACCAACAAACAGTAGTGTATTTGGCATGCATCTTGATCTTGGAGAAAGATTCAATTCTGGTTGGATTTATATATATATTAGCTCAAACTTAACCTTCTTGAATGTTCTAGTTGCGTTCTTGGTGATTCTAAATTACTTGAATGGGCTAATATAAGAATATTATGGTGTGAAATTCATATAATTTCCTTCTTTGTCCCTGTAAAAACAGTGAGAGAGCTGCAAAGAGGGGCAGGGTAGAAGATATTCCTGGTGCAAGCTTTGATGAAATTTTGGAAGCTATGAATTATGGATCTAGGAGGAAGTTAAAAGAGGTACATCCATGCTAATGTTATTTTAATTAAGCAGAGGAAGCGTTGGGCTGTTGAAAAGTCATTTTGTCATCCCCTTCACTCTGCCATATTATCAATATTCTGCATTATATTGTGCCTTCTTTGATTTTTTATATTTTCTGTGATCTATTATCTATTCTAAGATGCAAGCATTTCTGTGTTTTTGTAGCCTAAAAAAAGAGGTAGACGGAAAGGATCAAAGAAAAAACTTAATCGTGATGTTACGAAGTTGCTTGGTGATGCAACTTTATGTTATGCTCAAGGCCAGCATGAGAAGGTACAAAATCTTAGAGATGATCTCCTTCCCTTTGAATAGTTCTTGTTAAGTCTTCACAACACAATGCCTACAAAAAGAACAAGTGAAACAACCTTTTTCTTCTTATATCACTGAATTTACTCCTTGAAACATTATGGTTGTAGGCTATATCTATATTGCGTCAAGTTGTTCTGCAAGCCCCAGACTTACCTGATTCGTACCATACGCTTGGACTTGTATACAATGCAATTGGTGATGATGTAAAAGCCATGGGATTCTACATGCTTGCTGCACATTTAATGCCAAAAGATTCATCTCTCTGGAAACTGCTATTTTCATGGTCAATGTGAGGTCCTCTTCCATCTATGAATATATTTCTTAGAAGGTGGTATAATGCTTTTGCCAAATTAAGATCAATTTTGTTTATAATTGAATTCAACGAGGCATCAAATAGGCCAAGTTCATTAGAACGTTCTTGAGCAACCATCTTTATACATTCATGTGTTCCGTCTTTACATTATCGATATAAGGAAGTAAATGACAGACACAAAAGTCTGAGGCCAGGACATTGACGTTGAACAATTGCATCTTAAAGACCCTTCATCCTTGATTTGAATTAAAGGAAGTATCTAGTGTATATTGTATGTGACATACACCTCAGAAGTATAGGCTAATTTAAGAATTACGGTCATGCCCATGTTCCTGTGAAGTCATTACTTTTAGTTTTAGTGTATGAACTGAAGAAATATGATTTACAGCTTGTAGATATTTGACACGTGCAGTGATCGAGGTGATATTGATCAAGCAAGCTACTGTCTTTCTAAAGCAATAAAAGCAGAGCCTGATGACATTAATTTATTATTTCATCGTGCGTCACTCTACCTTGAGCGTGGAGATTGTGAAAAAGCAGCTGAAACATATGATCAGATTCATCAACAATGCCTTGGAAATGTTGAAGCACTCATGACAGGAGCAAAGGTTTGATGTTTTATCTCCTAATTTTCTACTGGGTTTGACTTCCTTAGCCTTGAACACTTGCGATTCCAAGATTATTATTGCATTAATTTATACACGTTTTTCTTAGTTTGAAAAGCTACCACTTGTTTTCTGCATGGTTTGTATTTGGCCAATCAAATTTCTTAAACTATTCTCTCTTCTCCGGTTAGAATGTCATTAGTTATTTTTCTCTTCTCAAGTTGGATTTTCCAAGCCTTGTAAGTTACAAATATATATCTTTAAAAACCATTGCAATCAAGACTTTGTAGTTCCAATCTTCTCATTGGCTACAATCCAATAGGTTTTACTTCTTGCCGAATTACTTTTATTTAAATTAATTAAGTTATCCATGTTATTCAATCCACGACTTGCAGTTAATAATTTTAATGAAACACAACCCCAAGCAGTCTTGATGTAAAATTCCTACTTCCTAATTAAATTTTACTTACAAGTTTTGAAATAATGAAGCTTATTATTGTTGGAATTTATAAAGTCTGAGGATGTGTTTCGCTGTTGTCTATTGTTTATTTCATCAGAATAGCTACCTCTAATTGTATTTGGATTTAAGTGACTAAATCCCCCCCCCCCCCCCCCCCCCCCCTTTTGTGACTCTGTCCTGCTTTCCCTTGCTAGCTGTACCAAAAATGCGGTCATCTTGAACGTGCAATTTGCATTCTTGAGGACTACATCAAAGAGCATCCAACTGAAGCTGATTTAGATGTGGTTGATCTTTTAGCTTCTTTATACATGGGAAGCAAAGAATTCAGCAAAGCTCTTGAGCGCATCGAGCATGCAGATGAGGTTTACTGTGCAGGAAATGAGCTACCTTTAAACTTGACAACTAAAGCAGGAATTTGCCACGTTCACCTTGGAAATATGGAGAAAGCAGAGGTCAGTATGGTCTTCAATGCATAACTACCAATAAGTAAATGTTGTTAACGTTTATTTATCCATAGCGAGCCAACTTAGTATGATTATCCATTCATTTGAAGTTAAGCTATCATTTTCTGTATTTAGTTTGACTTTGTTATTTTATGTCAGAAGTCTTCACATAACACTTTTTTTCCCTTTATATATATATATACATACATATATATATAAATAAATTTATTAATTTTTAATATAAAAAAAACACAAAAACAAAACAACATCTTAGCACACTGTTTAGAAGTATAACAAACAATAATTTCTTCATTACAAGATGTTTTTTGTGGTATGTTTGATTTATGTCTGCTTGGATACTGATGATGGTCAGATTTAGATTAGATTAATAACTGGGGGACATAGTTAGCCTTCTTTATTGGTGGGAAGTTGAAATAATTCAGTTAAAGGCTGTCAGAGTTTTTGCTTTTAGGTTTGTACTTATTTCTGGTTGTGAAAAAAATGATGTTACTTTTTTTCCCCCCTATATCTAATTCATGGTACACTTAGAAATTACGGCCAATAAAGTAATGCCCTCAAGGGCTTAATGTGTTTTCTAAGTTTTTTATGGAGATGCAACATTTGTATGTATGCCATGTTCCACCATAGGTGGGGCCTCTTCTATTATGGTAGGTTGTGGTTTTTATGTCAAATGATAATTTCTATAGCTTGCTTTGTATACTGTCATCTCTTGGGTGGGAGTGATAATATGTCTGCATCTGTCTTGAGTTTGAAAATATTGTACTTACCAGTTTATATTACCACTAGTGCCTCTTTGCTAATTTGGGACGGGAAACTGCCAATGATCACTCAAATTTGATGATTGAAGCTGCAGACTCGTTGCTGAGTCTTAAGCACTATAACTTGGCATTGAAGTATTATCTAATGTCCGAAGAAGTAAATGCTGGAGGGAACATGGTAAGCTGATTATGGTTTTCAGTTTTTTCCTTTTAACTGTGGTGTTCGAAGGGAGGGGTGGTGTGTTGACTTTCAAGTTGAACTGCTTGCATTTTTAGCGTAATGTTAGATGACATTTTACATAATTCTATTAGATCTTCTGGTAAGTTTATGATTGCTATATCCACTTAGCATGCATGTCTTAGGCTATGTGATAAGAAATTCTCAAATTACTGTTGGTCGTATTATGTATGCTGCATTTTCTTCAATCGATTATTTTTTTCAGCAGTAATCTTTCCTTTCCCCCCTGCATAATGGGCTCAACCTAATTTTCATACCCACTTTTCCATGTTTCTACATTACATTTCTTACAACTAAACTAATAATTATAATTTATCATCAATATACAACCAGTGATTCTTGTTTCCTTCTCTTGATTTTGCTTTATTTTTTAATGATAAAATTGAGGATTGAACCTCTGTAATTTTGCAGCGTCAATTGCTTAATTCAATTAATTTTATCTAGTTATATTTTTTAATCTTTAAAAGGGTTAGGGTATTAAGATTTATTTTTTTAATTTATTTGAATATTACTCCTTGAGTTTTGTTTTGTACTTTACCCTGATTAACAAGGCCTTACTTTTGTAACAGGGGATTTTATACCTAAAAATTGCCCAGTGTTACTTATCAACCGATGAAAGAACACAGGCAATTGTTTTCTTTTATAAAGGTGAGGCAAAGTTCTCAGGAAGATTTGATTAATGGCAGCTAAAGGGATTTTTTTTTTGGTTGTTGTTGTTGATGATGTTGTCATCGTCGTCGTCGTCGTCGTCAAATTCATATCTCTGCCTGTAGTTTTTTCTTTTATCCATGCAATTCTCGGTTACTTATAAAGATGAGGATATACTTTACTGAATTCACTTGTCTTTTTCTTTTCTTTTCTTGTTTTTCTGAAACAAAAATACACGTTGAAAGTTACCATTAAAGTCTAGAATTGAACCAAGTAAAACGTGAAGTTATATCACAAAGAACCTTGAATCAAACTTACAAAAGCCAAGCCCCTGACTTGTGTTAATTATGATTGATTTATAATTGCAAAATGATAGATCTCCGGGCATCAAAGTGTACACCCGGAGATATAGGAAAAAGAATATGCAATAAGTAGTTAGTAGTGGGCCAGGTTGTTAGGATTTGTTAGTAGTTAGTGGGCCCGGTTGTTAGGATTTGTTAGTAGTTAGTGGGCCCGGGTTGTTAGGATTTGTTAGGGAGAGATTTTGCCTTTATATAAGGGTAAAGAGAGAGGGTTAGGGGATCTTGTGAGGGAAGTAATTTGTAATTGTTTTCTGAGGAAAACTTGAGGAGAGGAAGGGGAAGCTCTTGAATTCTTCCCCGTTCTTTGATAAATAAAGTGTTAAGGCTAAGGCCTATCATTTTGGTATCAGAGCGGTTGTTCCGGGATGGTTGGAAAAATGGAAGCAAGAGTAGTAGAGCTGGAGGAGAAGCTTACGGGGATCCATTGTCAAGTAGGTGAGTTAGAAGAGAAACTCGGCGAGTTGGAAGAGAAACTTGACTCGCGTTTGACAGAAGCAGAGGTGAATCGTGCAAAGGCGGAATTGAGGATAGAGAAGTCGCTAGAGATGATCATGCGACGAATGGAGGCGTGGGGCGGCGAACAGCCAGGCGAAGCTTCGTCTGAAAGAACTGTTACGGATAAGGGAAAGAAACTGAGGGAGGAAGATACGAGCCAGGAAGGCCAGGGTACGCGCCTGAAAAAGGGAGAGGGAATCAGTGCGCCGACTGCTCCGAATAATTCGGAAGTACCGTTGTTCGATATGAGGTTGCGTAAGCTCGAGGTTCCAGTATTCAAAGGTGAGGTTGGGGAGAATGCAGATGGCTGGTTACACCGAGTTGAACGCTATTTTGTGGTGAATCGGCTTACGGAAAGGGACAAGTTGGATGCGGCGATACTGTGTTTAGAGGGAGAAGCCCTCGACTGGTATCAATGGCAAGATGGCCGGTCAAAAATAGAGAATTGGGCAGAGTTCCGGCAATTATTGTGGAGGCGATTTAGGCCATCCGATCAAAGCGATAAGCATGCTCGGTTAATGAAGTTGCAGCAAGACACAACCGTCAGAGAATATCGTCGGCGCTTCGAGCAGTATTCGGCGGGATTAAAGGACATGAGTGATGCAGCTCTGGAGAGTAAGTTCGTGTGTGGGCTCAAGGAAGATATTCAAAGCGAGATGCGAAAATTAAATCCAGGGGGCCTCGCGGCTAAGATGGAAATGGCCCAAGTCATCGAGGACGAATTGGCGGTGGAATTAAAAAGGATCCAGGGCGCGGCGAGTATCCAGACCTCAAATAAAAATGCGGCGAGCTCCCTTCATGCGACCACCGGTAGCGGCTCATCTGGAATTGGGTACGCGGCGAGTTCTACCGCAGGAGGGCAAGCGCGAACGATTACCATCGGTCCTCATCGCGCGTCGACTACAGCGTCGACCTCCGTCCGAGCGTGGGGTGGGAAACCGGCATCGTCAACGGCGCCGTATCGGAGATTGACGGACAGTGAGATGCAATTAAAGCGGGAAAAGGGGCTATGTTACAGATGCGATGGGAAATTCAGCGCTGGCCACCGTTGTCCCAAGAAGGAGTTGAATATTATCGCTGTCCAGGAATGTGGGGATGTCAGCGAGAATCTGAAAGAGAAGGATGTGGTAGCAGAGGAGGAGGAGAGTATGGAGCCGTCAGAAGCGGAGATTGCAACCTTGTCCTTGAATTCGCTCGCGGGGGTGGACTCTCCCAAAACTGTCAAAATTAAGGGGGAGATCCAAGGGAAGGAAGTGGTGGTATTAATTGATGGGGGAGCAACGCACAATTTTATATCTGAGGATGTGGTTGAGCAACTGAAATTACCGGTGTCGCCTTCTGAAGGGTATGGTGTAATGCTGGGTACCGGGGGAACCGTGCGCGCGGCTGGGATTTGTCGGGGGGTGCTTTTAACCATCTCTGAATTAGCTATTATGCAGGATTTTCTGCCGCTGCCGCTGGGCAGTGCTGACGTTATATTGGGGGTAACATGGTTGGAAACGTTGGGTACGATCCAGTTTGATTATCGCCTGTCAGAAATGGATTTTTGGATTGGAGGATGGCAGGTGCATCTATGCGGCGACCGGAGCTTAGTCAAATCCCAAATTTCGCTCAAATCCATGATGAAATCGTGGGCGCAGGAGGACCAAGGGATGTTGATAGAGTTGAGTGCGGTGGAGCGTTTGGTGATGGACGCCTCTGGGGATTGCCGCGAGGAGGCGTTGACATCGACACCACCGCAAGTGGCCAAGTTGTTAAGAAAGTTTGCGGTAGTCTTTGAACCGTTGCACGCATTACCCCCTGAACGCCAACATGATCATGCGATAGAACTCCATCCTGGGGCAGGTCCGGTTAATGTGCGGCCATATCGGTATCCCCAGTTTCAAAAGGATGAGATTGAACGCTTGGTGACTGAAATGTTGGCCGCAGGGATCATTCAGCCTAGCCGCAGCCCGTTTTCAAGTCCAGTCTTATTAGTCAAGAAAAAAGATGGTAGTTGGCGCTTTTGTGTGGATTATTGCGCACTCAATGATGCGACCGTTTCTGATAAATATCCAATCCCCATGGTTGACGAGTTATTGGACGAGTTGTGGGGGGCTGTGGTATTTTCTAAGATCGATTTGAAATCTGGGTATCACCAGATACGAGTTCGCACAACTGATGTGCACAAAACAGCATTTCGCACCCATGAAGGACATTATGAATTTGTGGTGATGCCCTTTGGCCTGAAAAATGCGCCGGCGACTTTCCAATCGATCATGAACGATATCTTGCGGCCACATTTGCGAAGGTTTGCGTTGGTCTTCTTCGATGACATCCTGATTTATAGTCCATTGCTGGAGGAGCACATCACTCATTTGGAAATAGTGTTGAGGTTGTTGCGGGAGCATCAGTTGGTGGCCAACTTTAAAAAATGTCAGTTTGCGGTGGATAGGATAGAGTACTTGGGCCATATTATTTCAGCGAACGGTGTCGCCGCGGATCCGGTTAAGGTCGAGGCCATGGTGAACTGGCCGCCCCCTAAGAATGTTAAGGAGTTAAGAGGCTTTTTGGGGCTCACGGGATACTATCGCAAGTTTGTGGCCAACTATGGCTCCATCGCATTGCCCTTAACGCAACAGCTGAAGAAGGGGGGATTTGCGTGGAATACAGAAGCGCAAGAAGCTTTTCAACGCTTAAAGTCAGCGATGGTTGACATTCCGACCCTGAGTATTCCGGATTTCTCTCAGCCATTTGTTGTCGAAACGGATGCGTCAGGGATAGGTGTGGGAGCCGTGCTGATGCAAAACCAAAGACCTCTTGCCTACTTTAGTCGTGCCTTGCCCCCAACTCATCGCTTCAAGGCCGTCTATGAACGCGAATTGATGGCAATTGTGTTTGCGGTACAAAAATGGCGTGCGTATTTATTGGGGCGCCGCTTTGTGGTCCGCACTGATCAACGCAGTCTCAAATTTTTGTTGGAACAACGCGTGATCGCAGGGGAGTACCAGCGCTGGATAGCGAAATTGATGGGCTATGACTTCTCGATTGAATATAAAAAAGGCCGAGAGAATTCAGCAGCCGATGCATTATCTCGGATGCCGCCTGCCATGGAATTCGGGTTTTTGAGTGTGATTGGTGGCATCAATACTTCCGTGTTCGAGGAACAGGTTCGAGAGGATGCTGGCCTTCACGCGATTGTGTTAGCTCTTCGACGCAAACAGTCGGCGCCAGCGGGGTATGCGTTACGCGGGGAGTTGTTGACTTTTCAAGGGCGTTTGGTCCTCCCAGCTACTTTGCCAACAATTCCCCCCCTTTTAGCTGAGTTCCATAATAGTCCGGTGGGGGGCCATCAGGGTAGCCTGAAGACATATCAACGCTTGGCGCGTGAGGTGTATTGGACGGGAATGAAGGCTCGCGTTCGTAGCTTTGTGGCCGCATGCACCGTGTGTCAGCAGGCAAAGTACTTAACGTTGTCTCCCGCAGGCCTATTGCAAGCTCTCCCCATTCCCGACCAAATTTGGGACGATATTAGCATGGATTTCATCGAAGGGTTACCTAAGTCTGATGGTTGGAATACTATTTTGGTAGTAGTTGATCGCCTGTCCAAGTATGCCCACTTTATACCTTTGAAACATCCATTCTCCGCTAAGTCCGTGGCAGCAATTTTTGTGAAGGAAGTGGTGCGCCTACATGGTTGTCCTCGCAGCATTGTATCCGATCGGGATAAAATTTTCACTAGTTTGTTTTGGGAGGAACTCATGCGTTTACTAGGGACGCAATTACGGCGCAGCACCGCATACCATCCTCAAACAGATGGTCAAACGGAAGTGGTTAACAGAGGCGTTGAGACATATTTGCGTTGCTTTACCATGGGTACACCCAAACAGTGGTCCAATTGGTTACCTTGGGCGGAATTCAACTACAATACGGCTGTCCATTCCTCGTCGCAAATTACGCCGTTTGAAGCAGTTTATGGTCGCCCGCCGCCTTCCTTGCTTCCCTATGACAAAGGTGATAGCGTGGTACATGAGGTTGATTGCCTCTTGCACGAACGCGACGAGATGTTAAAAAAAATGAAGGCGTCGTTGCAGCGGGCACAACAGCGCATGATTAAGGCCGCGAACGCCAAGCGGCGAGATGTGCAGTTCGATGTCAATGAGTGGGTTTATCTGAAACTCCGCCCATATCGCCAGTCTTCCATCCATCGCCACGCGCATCCCAAATTGGCTCCACGATTTGTGGGTCCGTTTCAGGTCATCGCAAGGGTTGGTCCAGTGGCCTACCGCTTGGCCTTGCCCCCTAATTCGAGTATACATCCGGTGTTTCATGTCTCTGTCCTACGCAAGGCTGTGGGCGCCTCCTTACCAGTTTTTTCCCTTCCACTCAAGCTAGCGCCTGACTTGTCGGTGGCAATTTCGCCCGAGGCAGTTCTCGGGATGCGACATTCCTCCTCTGCCGATCAAGACATGGAAGTTCTTATCCAGTGGGCGCATACTCTTCCGGAGGATGCGACGTGGGAGCAGGCTGACTGGATCCGCACGCAATTTCCGGACTTCCACCTTGAGGACAAGGTGGCTCTTTGGCGGGCGGGTAATGATAGATCTCCGGGCATCAAAGTGTACACCCGGAGATATAGGAAAAAGAATATGCAATAAGTAGTTAGTAGTGGGCCAGGTTGTTAGGATTTGTTAGTAGTTAGTGGGCCCGGTTGTTAGGATTTGTTAGTAGTTAGTGGGCCCGGGTTGTTAGGATTTGTTAGGGAGAGATTTTGCCTTTATATAAGGGTAAAGAGAGAGGGTTAGGGGATCTTGTGAGGGAAGTAATTTGTAATTGTTTTCTGAGGAAAACTTGAGGAGAGGAAGGGGAAGCTCTTGAATTCTTCCCCGTTCTTTGATAAATAAAGTGTCAAGGCTAAGGCCTATCACAAAACCTTCAGCTCCTATCAATAAAAACTGGAGAAGAGGCATGAAACTTTGCTAAATCAAGCACCTCCCACCAATCTTTTGCCTTTATGTTAAAAATTATGAAAAGCTCCTTTCCAGCCAATGGAAGATTTGACGACATTGGCCCATAAAAGAACTTTTACCTCTATCTCTTTGTAGTTTCTTCTTCTTCTTTTTTTCCTCCTTTATTTAATTATCTATTTTTATTTTGTTGCTTTATAAGGGATGACCACAGAGAAAATAATCCACATTTCCCTCCAAAGAAGAGTTTTAACTCCAAGATAAACTCAAAATTTATGGCAGTTGGTACCAAATTTTAAGTGAGTTTGTAAAGTACTTGACAAATGTTCCTTTGCAGTGCTTCAACATCTTGAAGATAACATTAATGCTCGATTAACTTTGGCCTCCCTCCTCCTTGAGGAAGCTAGAGATGAAGAAGCCATTTCATTACTATCTCCTCCAAAAGATTCAAGTATGTGTTCATTAAATAACTACCGTGTGCTTTATCACTTGCAAGTTAAAAGCTGGATGGAGCTAGCTGGTATTTCTCACAATGCTCATTAAACGTTTTTGGGTACTTTTGACTTTATGCAGACCCAACTAGCTCATCTTCCAGCAAATTAAAACCTTGGTGGCTCAATGAGAAAGTAAAACTGAAGCTTTGCCAAATATACAAAACTAGAGGAATGCTTGAGAACTTCGTTGAGGTGATCTTTCCTTTGGTCCGAGAGTCCTTATATATTGAGACTCTTCAAGAAAAGGTAATTAAAATTACGTGGCAGAAAACCTCTTCCTTTAGTGTGGTCCTTAACTCTTACCTAGTTAGAACGAATTTTTAGTCTAGGCTTTGAAAATATGAATACATAAGGGTAATATTGATGTTCACCTATTTGCCTATATTCATTATGATGCCTGCTTGGATCTCTAAACTCTAGGTCCTTTCTCATGTCCTAAGGACAAGTTTTCTGAAGCAATAAATTGCGATCAATCTCTTAGTTTTTATTGATCAGACTAATCATGAGTTGTATTGTATGGAAAGATGTTGAAAATACGTGGCCAAGAAGAGGCAATACACAATTACCGTTCATGTTGATTTAAATTTATGTCAGTTAAATTTGTTTTTTCTCAATAAAAAAATTGTTTTTGTAGTTTAGGGTATAGCTCATCTGGCATACAACATGTGTCATTGACAAAGAGGTCAAAATTTTGAATCCTCCTCCTAACATATTCTTTAACTCAAAATTAGTTGTTTTTGGTAGTTTAAGTGTCTTTTCATGTTTATCTGCAATATGGACACTTGGTTTTAGAAATCAATTATTGGCTCTGAACTTTTAGGATCACCGTTGGATTGACAATACAAGAGACCTCTATTTTTTATTTATTTTTTATTTTTAAAATTTTTTAAAAGTCATTGAGAATGAAAGAAGAAAATAAAGATCCAGGGGAAAAAGAGTTGTTTGGGGTGGGAAGAGATTGAATTTTATTGGTTATCCCTTGGCGTTACATTACCATAATCCATCAGCTTGCGCTTTTTGGTCAATCGGTTATTTGATATGGTATCAAAGCAGGAGGTCCTGTGTTTGAACTCCTAGAATGTCATTTCTTCTCCAATTAATATTGATTTCTACTTGTTGGGTCTTATGCAAACTTTCAAGACCACAGGTGCGGGAGAATGTCAAAGTTTTGATATAAACTTTTGTGTCAATTGGTGATTTGTCATTATCTTCTTCTTATAGAAAGAAATGCGACGTGAATTTGCAATAATTCACCAAGGCCCACCTAAAATTTTCTTATTATCAAAAGAAATGTGAGATGGGAATTTGCATTAATTCACCAAGGCGCGGGGACACAAACATGTCACGGATTCAACAACATGTCAATCTCTTAAAAGCATGTTGGAGACATCTTTTCTTTAAAATTTAAAAAATTATATAGAATATATGTGTATGAATTGATATATCAATACTTTATAGACAAACCATGTAAATCTAACATAAAAAATAAATTGTGTTTCACGAATAACATATAAAATTAAGTCCATTAATGATATTTGAGAACTTAAGATTTAGTTGGGTATGAATAGAGGAAGAAGATGTAGAAGGATTTAGTTGGGTATGAATAGAGGAAGGAGATGTTATCCTAGAACTTTAGGAATGATCCATATATAAGAATTAGTGTCCATAAGAACTTCTCTATTTTTTTCCTCCTACTTTGTTAGTTAAATTTTTCCAGAGCTTTGTTTAGTCGTGTCAAGCTGTATCCAAGCGTGTCCAACACGTGTCTGGACGTATCCATGCCATGCTTCCTAGATCAAGGCTAAATATTTGAACGTTCTTTATTATTTTGACGTTGATCCTTGTCCTTGCTCTTTGTCAGATTCAGAATATATTAGAAAATCAATTTATAGTGAAGGCCTATTGCCTTGTTTTTGATGCTCTTGAAAAGTATATACATTACAAGCAACTGACAGCGAAGGCCGTATATTCTGCGGGTCCTATTTCTTCCGGTGACTTGTTTCACTGTCGGTTTTTGTTTCTTGGATATGCCCACTTCAATGAGCAATTTTCCTTATTAGCATGTTTATAGTTAGGGCTATTATTTAAAAATTAAGATATCTGTTATCTGTAACCATTAAAAAATCTTTAGATTCTCTATCTCTCAAATTTCTTCAATTTATTTATTTGTTTTAACTTTTTCCAATTCTAGATCCCATGTCATTTCTTTAACAACAACTTGTCTGGGTTTTAGATTAAAGTGAACAAGAAGAAGCTTCCAAGGAGGGTTTTGCTTGAAAGAGTCAAAGTATTAGATGGACGTGAGACTGGTAACCTATTTCGTGGATTCAAACCTGTGGCTCCTAAATCAGATTTGTAAGTACTGAAGGTCTTGGCACTATGTTGAAGGAATATAATTTTTGACACAATTTTCTTACTTTCCTCAATAAGTCTGAATTATGAACAATGTCTCTTGAATGGGAATATGACTGCCTATAGTGAAGTAACATGATTGTTTTGGACATTAGGGTAATTTTAACTTTGTAAATCACCTGGAATTCTGGATGAGAATGTTGCAATGTAAATTGTAAGACGTATTTGCAAGTGATTCTAGAATAGTTAAAATCACTTTTGTTATTTTCAAAATCACTTTGAAACATGTGTTTAGTCATTCAAAACTAATTTTGATGATACAAAAATTGCATTTAAAAGTGCAAAATTAAATATTAAATTAATTTTTTGAGTGATTACAGGCATGTTTCGAAGTGAGTTTGATCATTAAAAAAGTCATTTAAACAATTTCAAAGTTACTCCCAAACATTCACGTAGTAATATTACTACTCTCTGGTTAGTATTGTCAGGCTCTTCACCCGTTGGTTAAAAATAGGTCTTGATGCTGATCGTTGACTTGAGCTTTTATTTCCTTTGTGGTCACCAGTTCTAGACTTCTAGTTCCTGCCCTTTCTCCATTTTGCCAAGAAAATGAAATCTAATCGGTCATATTCCGCAATGCTAAAATGGTATCTTCTCTTAATTTGTAACTAGCATCTCCAATTTCACATCTGAAATACTTCATTAATTCTAAATGTATCTAGTAATAAGCCTAAAAGGTTTCTCAACATTCGGTCCTGACTAAAAAACAGCTCCTATTTAAATTGACCAAAAAAAAAAAAAAATCCTATTTATGGCCTCAAAAGGCAACCATTTAAGTTCTGAATTTCCATAACCTTTGCTTCCTGTTGCTTGTCTTTTTATTTTTTATTTTTGAAACAGAAACAAGACCTTTCATTGAATTAGTGAAATGAGTCTAATGCTAAAAGTACAATGAAACAAACAGAAAAAAGAGAAAATACAACAAAGATAATGGATCAACAAGAACAAAATGAAAGATAAAGGAGACCAATCCTTAATACAACACCCAATAGATAAAAAATTCCAATCTGCCAATCTTCTAGCAAAGTACAAAGCTGGAGACTTGAATGCCATAAAGCTTTGAAAGAGCGGGAACTTGAAAAGGAGTTCAAAAGCAGACTGACTTGCCTACTGCCAACCCTAGAGTTCTAGCCAGCCACACATAAAACACCAAAACCACCCCACCAATAATAAGACCACAAATGGAGGAGAGCACTAGAATCCGAAAAATAAAATAATGTGTTGAAGACTTGCACCACAAACTAAAATACTATAAAAACTCCACAGCTGAAACTGACCAAAATAAAATTTCTAATCATAACACTATATAATCAACCACAACCCAAATAAAACAACCTCTTTTTATAAATCTTGTTTCTTATCTTTATATACCATTTCAACATCTTTGCTGTGAAGGAGCATATTAAAACTTCGAACTGTGCTTGAATTAGATCAAAAGCGTCCAGAGCAAAGAGATTGCTTCAAAAGAGGGAAAGAATCAAGGAAGAAAAGAAGGCTAGAGCGCTGGCTGCGGGAGTCGATGTGAACTATGATGATTTAGATGATGAGCCAGCGGTAACTTTTTTGGTTGTTTGCTAATTTTTATTGCGCATGAGCTATGCATATGCTAGCAGAAATTCAACTAAGAGTGCTTCTCTTTTCATCTGTGATGTCTCTAGCTACGGATGCACCGAGAATCCCCCCTGCCTAATCTTCTGAAGGAAGAAGAATATCATAATCTTATTGTTGATGTTAGTACCTCTTACATTTTTCTCAATCTTTATACTTTCCTCTTTGTTTATCATTCCATCATGCCAAATAAACATTCATCATGTATCCATCTTTATCCCATTTGTTTGATTACCTAACCCCCATTTAATCAAACACCTAATGCTTTCCTTGTGATTTTCATAGTTGTGCAAGGCATTGGCTTCCTTGGGAAGATGTTCTGAAGCTTTAGAGATTATAAGTCTAACTTTAAAGTTGGCTTTTAACTCGTTATCCATAGAAAGGAAGGAAGAACTCCAGTTACTTGGAGCTCGTAAGTATTTTTTGCCAGTTGTATCACTTGTTACATCTATGTTTCCTTTTTTCCACTTTGGTGCTAGATTTGTTGCTTGCTTCAAAAGGAGATGGCCGCCTAAATTCTCTTGGGTGAAGGCCTTGAATTATATTCTCTCATCATACCTCGTGAATTCAATTGTTTGATTTTCTGAACTTGAGGCATATATCTGCCTCTTTCTCATTGTTAACTTCATGTTCACAGAATTAGCATTCAGCTCAACTGATACCATGCATGGTTTCAACTTTGCAAAGCACGTTGTTAAGCAGTACCCTTATAGCATCTCTGCTTGGAACTGCTATTATAAAGTAGCTTCAAGGTATGTCAGTCTTTGATATTTTAGTTGGTTTGAAACAATTATCCATCTGCATTTGTTGTACTTAAAAATATCTACCTGCATTTGAGATATATTTTATTCCAAAACACAAAGGATAGAAAAAATTGCAGGGCAATAAGAGGGATCTCCTCCCGAACACGGAGGATACCTTTTCGTCCCCTCAATGAAAGCTACTTTCTCATAATTCCCCTCCCCAATTAGGAGGAGGCAGTGTGTTTTCCTCACAGAGAACACCCAAGTTTCTACTAAGATTTTTAGTGCTTGTTTGGTAAGTAATCTGAAAATAGAAACTTAAAAAATAAGGGTTCAATGAAAACGAACTTGTATTTCATGTTTTCATATATGTGTTTGGTAGCATTTTCATAAATTAGATCCCTATTTAAACAATTGTTCAAAATATGTTTTAATATTTATAGACCATAAAACTAGTCGTAGTTACCCACCAAATATTAAGTTGAATATAAACTGTTATTATTTAATTTATGAACATAGTTTATGTTAAAAAATTTGTATAATTATTCTTTTACATATCATATAGTTCATAATAATATAATATAATATATTATATATATTTTATTTTCAAAAAAATAAATGTATATATTGAAGTTTTTATCCTTAATACAATTCTGCATTCTTAAATTTTCTAATTCAAGAGATGTGGATAAAAAAACATGAAAATGTTGTTTCCAAAGTTTTTACTATTTGGATCACGATATCCAAAAAAACAGTTTTTAAAAAAACACATCTATCAAATGCATATTCGTTGAACTCTATGAATCAGAAAATATAAAACCGAAGGTTCCAATAAGTTGCAGCAGATTTGGAAACTATATAAGGGCGGTGTATTGTTTCTTCCTTGCATACACAATTTGGATGTACTTGGGGGAAGGAGAAAATAGTGTGGGTTCCTCTGGGTTCTGTCTTCAGTATTAGTACTCAGGTAGCAGAGAACAATAATAAAATTTCAAATTTTTTGTGGACCTTGGTATTTCAAACCTAATTTGATACTGGAGAAACAAAGGACGGCCAATTTACACATGAAAATTCCTGGGGGGTCTCTACTCTCTATACTTTGCCGTTCTCATTTTCGTTGGGGGGCAGTATTTTCTAACTTATTCACAAGAGTCAGCCCTCGAATTCCATTTCTCGCAAATTTCACTCTACCCGAATTTCCTATCAAAGTTGCTTTAATTCCAGCACTTTCAACCATTTTTTTCCTTCCATTTTATTAGAATATATATCACAAATACGAAACCTATGACCCTGAGGAAGTTAGAGATCCTTTCTTTCTAATAAATTAAAATCAGCGAAACCATTTTATCCTTTGGTGAGAGTTTTGACCTACAAGTTTGGAAAAAGAGATTTATTTACTCCACATCTTTTCTTTCTAATAAATTAGTACTTCCATAGGAGCATTATGTTTGTTTGTTTGTTTGCTTTTTTTTTTTCCTTTCTTTTTCTTATCCACAAATTACTACACTTTCCTGTATTCTAGTTTGACGAACCGGGATTCAAGGCATTGCAAGCTTCTGAATAGCATGCAAGCCAAATACAAAGATTGTGCACCACCCTATATCATTGCCGGGCATCAATTTACCACCATTAGCCATCATCAAGATGCTGCAAGGAAATATCTTGAAGCTTACAAAATCATGCCGGATAGTCCCCTGATTAACTTGTGTGTTGGTATGGCTCCTGTTCTTCATGTCTCTTTATAATTAGAACTACATTTATAAACTCTTAATTTTAGATACTCATTGCTTTCACGCTTGCTATAATTTTCTTCCTTGGCCTAAATAGGAGCATCCTTAATCAACTTGGCTCTTGGATTCCGTCTTCAAAATAAGCATCAGTGTGTTGCACAAGGCTTGGCATTCCTCTACAAAAATTTGAAGCTTTGTGATAACAGCCAGGTATTCTATTATTATTTTGTAATTACATGAGTCGTAATGTTAATTAAAAAGTTCGACATGAACAAGCTACCCAGGAAAATGAAATTGTTTGTCATTTAAGCTACAATTTTTCCCTTCTATTTTTCTAAAGTAAGAAACCGAAAGAATATACAAGGGCATTAAACAAGTTATGGTCATAAAAGTTGTGATGCAAAAAAATATGTAGTATTTCCTATGGAAGGAGAGCTAGATGTTGCAATAGCCTAAAGAAAGGGTCATTAGTTGGTTGGTGGTAGGTGGGCAGGTAAATGGATGCGAAGACCGAGCGGATAGTATAGTTTTGGGGAAAATTAGAGGGGAGGGGCTAAGCAAATGCCTATAACTTGCATTGACATTACTGTAACTTTGCTACAGCCACCAACAAGTGGTTAGCTCATTTGGCCATTAGCAAGAAATCTGGTTGGTTTTGGTATACATATGTTTAACCTAGGCGAACTGAGTTTATAATTTCCAGTAGATTGATCCAGAACCTGATTTTCCAACCATTCTTGTGAACATGATGTGTTTTGCCCTGTTAATTATCTGTAGGAAGCCTTGTACAACATAGCTCGAGCATATCATCACATCGGACTCGTGACACTGGCGGTTACATATTACGAAAAGGTGCTTGCAACTTACCAGAAGGATTGCCCCATTCCAGAACTTTTTGGTGAGAATCAAAACATTAAGCATCAGAAATCTGTCTATTGTGATCTACGCAGAGAAGCAGCTTACAATTTGCATCTGATATATAAAGAGAGTGGAGCTCTTGATCTTGCCAGGCAAGTCCTAAAAGATCATTGCACATTTTAA

mRNA sequence

ATGGAAAAGGAAGGGAGTAAAATTTCTGACAATGAAGAGGTTCCTGGTGGTGTTATGCGTGTTTTAGGAGCAGAAAAAGAGGTTGTAGAAACAGGAGTGGAGGCTAGAGAGGAGGAGGAAGAGGAAGAGGAAGAGGAAGAAGGAGAGGAAGAAGTGGAGGATGAAGGAGAGGATGATATTGAAGAAGAAGATGGTTACATATTCAAATTTAAGGCCGGAGAAAATCCATTTGATTTTGTTGAAGGGACAGATTTTAGCATCCAACCATATAAAAAATTTGAGCGCCTTGAATATGAAGCTCTTGCTGAGAAAAAGAGAAAAGCTCTTGCAAATGGTCAGAGTGAGAGAGCTGCAAAGAGGGGCAGGGTAGAAGATATTCCTGGTGCAAGCTTTGATGAAATTTTGGAAGCTATGAATTATGGATCTAGGAGGAAGTTAAAAGAGCCTAAAAAAAGAGGTAGACGGAAAGGATCAAAGAAAAAACTTAATCGTGATGTTACGAAGTTGCTTGGTGATGCAACTTTATGTTATGCTCAAGGCCAGCATGAGAAGGCTATATCTATATTGCGTCAAGTTGTTCTGCAAGCCCCAGACTTACCTGATTCGTACCATACGCTTGGACTTGTATACAATGCAATTGGTGATGATGTAAAAGCCATGGGATTCTACATGCTTGCTGCACATTTAATGCCAAAAGATTCATCTCTCTGGAAACTGCTATTTTCATGGTCAATTGATCGAGGTGATATTGATCAAGCAAGCTACTGTCTTTCTAAAGCAATAAAAGCAGAGCCTGATGACATTAATTTATTATTTCATCGTGCGTCACTCTACCTTGAGCGTGGAGATTGTGAAAAAGCAGCTGAAACATATGATCAGATTCATCAACAATGCCTTGGAAATGTTGAAGCACTCATGACAGGAGCAAAGCTGTACCAAAAATGCGGTCATCTTGAACGTGCAATTTGCATTCTTGAGGACTACATCAAAGAGCATCCAACTGAAGCTGATTTAGATGTGGTTGATCTTTTAGCTTCTTTATACATGGGAAGCAAAGAATTCAGCAAAGCTCTTGAGCGCATCGAGCATGCAGATGAGGTTTACTGTGCAGGAAATGAGCTACCTTTAAACTTGACAACTAAAGCAGGAATTTGCCACGTTCACCTTGGAAATATGGAGAAAGCAGAGTGCCTCTTTGCTAATTTGGGACGGGAAACTGCCAATGATCACTCAAATTTGATGATTGAAGCTGCAGACTCGTTGCTGAGTCTTAAGCACTATAACTTGGCATTGAAGTATTATCTAATGTCCGAAGAAGTAAATGCTGGAGGGAACATGGGGATTTTATACCTAAAAATTGCCCAGTGTTACTTATCAACCGATGAAAGAACACAGGCAATTGTTTTCTTTTATAAAGTGCTTCAACATCTTGAAGATAACATTAATGCTCGATTAACTTTGGCCTCCCTCCTCCTTGAGGAAGCTAGAGATGAAGAAGCCATTTCATTACTATCTCCTCCAAAAGATTCAAACCCAACTAGCTCATCTTCCAGCAAATTAAAACCTTGGTGGCTCAATGAGAAAGTAAAACTGAAGCTTTGCCAAATATACAAAACTAGAGGAATGCTTGAGAACTTCGTTGAGGTGATCTTTCCTTTGGTCCGAGAGTCCTTATATATTGAGACTCTTCAAGAAAAGATTAAAGTGAACAAGAAGAAGCTTCCAAGGAGGGTTTTGCTTGAAAGAGTCAAAGTATTAGATGGACGTGAGACTGGTAACCTATTTCGTGGATTCAAACCTGTGGCTCCTAAATCAGATTTATCAAAAGCGTCCAGAGCAAAGAGATTGCTTCAAAAGAGGGAAAGAATCAAGGAAGAAAAGAAGGCTAGAGCGCTGGCTGCGGGAGTCGATGTGAACTATGATGATTTAGATGATGAGCCAGCGTTGTGCAAGGCATTGGCTTCCTTGGGAAGATGTTCTGAAGCTTTAGAGATTATAAGTCTAACTTTAAAGTTGGCTTTTAACTCGTTATCCATAGAAAGGAAGGAAGAACTCCAGTTACTTGGAGCTCAATTAGCATTCAGCTCAACTGATACCATGCATGGTTTCAACTTTGCAAAGCACGTTGTTAAGCAGTACCCTTATAGCATCTCTGCTTGGAACTGCTATTATAAAGTAGCTTCAAGTTTGACGAACCGGGATTCAAGGCATTGCAAGCTTCTGAATAGCATGCAAGCCAAATACAAAGATTGTGCACCACCCTATATCATTGCCGGGCATCAATTTACCACCATTAGCCATCATCAAGATGCTGCAAGGAAATATCTTGAAGCTTACAAAATCATGCCGGATAGTCCCCTGATTAACTTGTGTGTTGGAGCATCCTTAATCAACTTGGCTCTTGGATTCCGTCTTCAAAATAAGCATCAGTGTGTTGCACAAGGCTTGGCATTCCTCTACAAAAATTTGAAGCTTTGTGATAACAGCCAGGAAGCCTTGTACAACATAGCTCGAGCATATCATCACATCGGACTCGTGACACTGGCGGTTACATATTACGAAAAGGTGCTTGCAACTTACCAGAAGGATTGCCCCATTCCAGAACTTTTTGGTGAGAATCAAAACATTAAGCATCAGAAATCTGTCTATTGTGATCTACGCAGAGAAGCAGCTTACAATTTGCATCTGATATATAAAGAGAGTGGAGCTCTTGATCTTGCCAGGCAAGTCCTAAAAGATCATTGCACATTTTAA

Coding sequence (CDS)

ATGGAAAAGGAAGGGAGTAAAATTTCTGACAATGAAGAGGTTCCTGGTGGTGTTATGCGTGTTTTAGGAGCAGAAAAAGAGGTTGTAGAAACAGGAGTGGAGGCTAGAGAGGAGGAGGAAGAGGAAGAGGAAGAGGAAGAAGGAGAGGAAGAAGTGGAGGATGAAGGAGAGGATGATATTGAAGAAGAAGATGGTTACATATTCAAATTTAAGGCCGGAGAAAATCCATTTGATTTTGTTGAAGGGACAGATTTTAGCATCCAACCATATAAAAAATTTGAGCGCCTTGAATATGAAGCTCTTGCTGAGAAAAAGAGAAAAGCTCTTGCAAATGGTCAGAGTGAGAGAGCTGCAAAGAGGGGCAGGGTAGAAGATATTCCTGGTGCAAGCTTTGATGAAATTTTGGAAGCTATGAATTATGGATCTAGGAGGAAGTTAAAAGAGCCTAAAAAAAGAGGTAGACGGAAAGGATCAAAGAAAAAACTTAATCGTGATGTTACGAAGTTGCTTGGTGATGCAACTTTATGTTATGCTCAAGGCCAGCATGAGAAGGCTATATCTATATTGCGTCAAGTTGTTCTGCAAGCCCCAGACTTACCTGATTCGTACCATACGCTTGGACTTGTATACAATGCAATTGGTGATGATGTAAAAGCCATGGGATTCTACATGCTTGCTGCACATTTAATGCCAAAAGATTCATCTCTCTGGAAACTGCTATTTTCATGGTCAATTGATCGAGGTGATATTGATCAAGCAAGCTACTGTCTTTCTAAAGCAATAAAAGCAGAGCCTGATGACATTAATTTATTATTTCATCGTGCGTCACTCTACCTTGAGCGTGGAGATTGTGAAAAAGCAGCTGAAACATATGATCAGATTCATCAACAATGCCTTGGAAATGTTGAAGCACTCATGACAGGAGCAAAGCTGTACCAAAAATGCGGTCATCTTGAACGTGCAATTTGCATTCTTGAGGACTACATCAAAGAGCATCCAACTGAAGCTGATTTAGATGTGGTTGATCTTTTAGCTTCTTTATACATGGGAAGCAAAGAATTCAGCAAAGCTCTTGAGCGCATCGAGCATGCAGATGAGGTTTACTGTGCAGGAAATGAGCTACCTTTAAACTTGACAACTAAAGCAGGAATTTGCCACGTTCACCTTGGAAATATGGAGAAAGCAGAGTGCCTCTTTGCTAATTTGGGACGGGAAACTGCCAATGATCACTCAAATTTGATGATTGAAGCTGCAGACTCGTTGCTGAGTCTTAAGCACTATAACTTGGCATTGAAGTATTATCTAATGTCCGAAGAAGTAAATGCTGGAGGGAACATGGGGATTTTATACCTAAAAATTGCCCAGTGTTACTTATCAACCGATGAAAGAACACAGGCAATTGTTTTCTTTTATAAAGTGCTTCAACATCTTGAAGATAACATTAATGCTCGATTAACTTTGGCCTCCCTCCTCCTTGAGGAAGCTAGAGATGAAGAAGCCATTTCATTACTATCTCCTCCAAAAGATTCAAACCCAACTAGCTCATCTTCCAGCAAATTAAAACCTTGGTGGCTCAATGAGAAAGTAAAACTGAAGCTTTGCCAAATATACAAAACTAGAGGAATGCTTGAGAACTTCGTTGAGGTGATCTTTCCTTTGGTCCGAGAGTCCTTATATATTGAGACTCTTCAAGAAAAGATTAAAGTGAACAAGAAGAAGCTTCCAAGGAGGGTTTTGCTTGAAAGAGTCAAAGTATTAGATGGACGTGAGACTGGTAACCTATTTCGTGGATTCAAACCTGTGGCTCCTAAATCAGATTTATCAAAAGCGTCCAGAGCAAAGAGATTGCTTCAAAAGAGGGAAAGAATCAAGGAAGAAAAGAAGGCTAGAGCGCTGGCTGCGGGAGTCGATGTGAACTATGATGATTTAGATGATGAGCCAGCGTTGTGCAAGGCATTGGCTTCCTTGGGAAGATGTTCTGAAGCTTTAGAGATTATAAGTCTAACTTTAAAGTTGGCTTTTAACTCGTTATCCATAGAAAGGAAGGAAGAACTCCAGTTACTTGGAGCTCAATTAGCATTCAGCTCAACTGATACCATGCATGGTTTCAACTTTGCAAAGCACGTTGTTAAGCAGTACCCTTATAGCATCTCTGCTTGGAACTGCTATTATAAAGTAGCTTCAAGTTTGACGAACCGGGATTCAAGGCATTGCAAGCTTCTGAATAGCATGCAAGCCAAATACAAAGATTGTGCACCACCCTATATCATTGCCGGGCATCAATTTACCACCATTAGCCATCATCAAGATGCTGCAAGGAAATATCTTGAAGCTTACAAAATCATGCCGGATAGTCCCCTGATTAACTTGTGTGTTGGAGCATCCTTAATCAACTTGGCTCTTGGATTCCGTCTTCAAAATAAGCATCAGTGTGTTGCACAAGGCTTGGCATTCCTCTACAAAAATTTGAAGCTTTGTGATAACAGCCAGGAAGCCTTGTACAACATAGCTCGAGCATATCATCACATCGGACTCGTGACACTGGCGGTTACATATTACGAAAAGGTGCTTGCAACTTACCAGAAGGATTGCCCCATTCCAGAACTTTTTGGTGAGAATCAAAACATTAAGCATCAGAAATCTGTCTATTGTGATCTACGCAGAGAAGCAGCTTACAATTTGCATCTGATATATAAAGAGAGTGGAGCTCTTGATCTTGCCAGGCAAGTCCTAAAAGATCATTGCACATTTTAA

Protein sequence

MEKEGSKISDNEEVPGGVMRVLGAEKEVVETGVEAREEEEEEEEEEEGEEEVEDEGEDDIEEEDGYIFKFKAGENPFDFVEGTDFSIQPYKKFERLEYEALAEKKRKALANGQSERAAKRGRVEDIPGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQHEKAISILRQVVLQAPDLPDSYHTLGLVYNAIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQIHQQCLGNVEALMTGAKLYQKCGHLERAICILEDYIKEHPTEADLDVVDLLASLYMGSKEFSKALERIEHADEVYCAGNELPLNLTTKAGICHVHLGNMEKAECLFANLGRETANDHSNLMIEAADSLLSLKHYNLALKYYLMSEEVNAGGNMGILYLKIAQCYLSTDERTQAIVFFYKVLQHLEDNINARLTLASLLLEEARDEEAISLLSPPKDSNPTSSSSSKLKPWWLNEKVKLKLCQIYKTRGMLENFVEVIFPLVRESLYIETLQEKIKVNKKKLPRRVLLERVKVLDGRETGNLFRGFKPVAPKSDLSKASRAKRLLQKRERIKEEKKARALAAGVDVNYDDLDDEPALCKALASLGRCSEALEIISLTLKLAFNSLSIERKEELQLLGAQLAFSSTDTMHGFNFAKHVVKQYPYSISAWNCYYKVASSLTNRDSRHCKLLNSMQAKYKDCAPPYIIAGHQFTTISHHQDAARKYLEAYKIMPDSPLINLCVGASLINLALGFRLQNKHQCVAQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENQNIKHQKSVYCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF
Homology
BLAST of HG10002345 vs. NCBI nr
Match: XP_038879313.1 (general transcription factor 3C polypeptide 3 [Benincasa hispida] >XP_038879314.1 general transcription factor 3C polypeptide 3 [Benincasa hispida] >XP_038879315.1 general transcription factor 3C polypeptide 3 [Benincasa hispida] >XP_038879316.1 general transcription factor 3C polypeptide 3 [Benincasa hispida] >XP_038879317.1 general transcription factor 3C polypeptide 3 [Benincasa hispida] >XP_038879318.1 general transcription factor 3C polypeptide 3 [Benincasa hispida] >XP_038879319.1 general transcription factor 3C polypeptide 3 [Benincasa hispida] >XP_038879320.1 general transcription factor 3C polypeptide 3 [Benincasa hispida] >XP_038879321.1 general transcription factor 3C polypeptide 3 [Benincasa hispida])

HSP 1 Score: 1707.6 bits (4421), Expect = 0.0e+00
Identity = 882/941 (93.73%), Postives = 897/941 (95.32%), Query Frame = 0

Query: 1   MEKEGSKISDNEEVPGGVMRVLGAEKEVVETGVEAREEEEEEEEEEEGEEEVEDEGEDDI 60
           MEKEG+ +SDNEEVPGG M VLGA KEVVETGVE R  EEEEEEEEEGEEEVEDEGEDDI
Sbjct: 1   MEKEGNTVSDNEEVPGGNMSVLGAGKEVVETGVEDR--EEEEEEEEEGEEEVEDEGEDDI 60

Query: 61  EEEDGYIFKFKAGENPFDFVEGTDFSIQPYKKFERLEYEALAEKKRKALANGQSERAAKR 120
           EEEDGYIFKFKAGENPFDFVEGTDFSIQPYKKFERLEYEALAEKKRKALANGQSERAAKR
Sbjct: 61  EEEDGYIFKFKAGENPFDFVEGTDFSIQPYKKFERLEYEALAEKKRKALANGQSERAAKR 120

Query: 121 GRVEDIPGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQG 180
           GRVEDIPGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQG
Sbjct: 121 GRVEDIPGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQG 180

Query: 181 QHEKAISILRQVVLQAPDLPDSYHTLGLVYNAIGDDVKAMGFYMLAAHLMPKDSSLWKLL 240
           QHEKA+SILRQVVLQAPDLPDSYHTLGLVYNAIGDDVKAMGFYMLAAHLMPKDSSLWKLL
Sbjct: 181 QHEKAVSILRQVVLQAPDLPDSYHTLGLVYNAIGDDVKAMGFYMLAAHLMPKDSSLWKLL 240

Query: 241 FSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQIHQQCLG 300
           FSWSIDRGDIDQASYCLSKAIKAEPDDI+LLFHRASLYLERGDCEKAAETYDQIHQQCLG
Sbjct: 241 FSWSIDRGDIDQASYCLSKAIKAEPDDIDLLFHRASLYLERGDCEKAAETYDQIHQQCLG 300

Query: 301 NVEALMTGAKLYQKCGHLERAICILEDYIKEHPTEADLDVVDLLASLYMGSKEFSKALER 360
           NVEALMTGAKLYQKCGHLERAICILEDYIKEHPTEADLDVVDLLASLYMG+KEF KALER
Sbjct: 301 NVEALMTGAKLYQKCGHLERAICILEDYIKEHPTEADLDVVDLLASLYMGNKEFRKALER 360

Query: 361 IEHADEVYCAGNELPLNLTTKAGICHVHLGNMEKAECLFANLGRETANDHSNLMIEAADS 420
           IEHADEVYCAG+ELPL LTTK GICH HLGNMEKAECLFANLG ETA+DHSNLMIE ADS
Sbjct: 361 IEHADEVYCAGSELPLKLTTKEGICHAHLGNMEKAECLFANLGWETADDHSNLMIEVADS 420

Query: 421 LLSLKHYNLALKYYLMSEEVNAGGNMGILYLKIAQCYLSTDERTQAIVFFYKVLQHLEDN 480
           LLSLKHYNLALKYYLM EEVNAGGNMGILYLKIAQCYLST+ER QAIVFFYKVLQHLEDN
Sbjct: 421 LLSLKHYNLALKYYLMFEEVNAGGNMGILYLKIAQCYLSTNERAQAIVFFYKVLQHLEDN 480

Query: 481 INARLTLASLLLEEARDEEAISLLSPPKDSNPTSSSSSKLKPWWLNEKVKLKLCQIYKTR 540
           INARLTLASLLLEEARDEEAISLLSPPKDSNPTSSSSSKLKPWW NEKVKLKLC IYKTR
Sbjct: 481 INARLTLASLLLEEARDEEAISLLSPPKDSNPTSSSSSKLKPWWFNEKVKLKLCHIYKTR 540

Query: 541 GMLENFVEVIFPLVRESLYIETLQEKIKVNKKKLPRRVLLERVKVLDGRETGNLFRGFKP 600
           GMLE+FVEVIFPLVRESLYIETLQEKIKVNKKKLP+RVLLERVKVLDGRE+GNLFRGF+P
Sbjct: 541 GMLESFVEVIFPLVRESLYIETLQEKIKVNKKKLPKRVLLERVKVLDGRESGNLFRGFRP 600

Query: 601 VAPKSDLSKASRAKRLLQKRERIKEEKKARALAAGVDVNYDDLDDEPA------------ 660
           VAPKSDLSKASRAKRLLQKRERIKEEKKA+ALAAGVDVNYDDLDDEPA            
Sbjct: 601 VAPKSDLSKASRAKRLLQKRERIKEEKKAKALAAGVDVNYDDLDDEPALRMHRESPLPNL 660

Query: 661 ------------LCKALASLGRCSEALEIISLTLKLAFNSLSIERKEELQLLGAQLAFSS 720
                       LCKALASLGRCSEALEIISLTLKLA NSLSIERKEELQLLGAQLAFSS
Sbjct: 661 LKEEEYHNLIVDLCKALASLGRCSEALEIISLTLKLALNSLSIERKEELQLLGAQLAFSS 720

Query: 721 TDTMHGFNFAKHVVKQYPYSISAWNCYYKVASSLTNRDSRHCKLLNSMQAKYKDCAPPYI 780
           TDT HGFNFAKHVVKQYPYSISAWNCYYKVASSLTNRDSRHCKLLNSMQAKYKDCAPPYI
Sbjct: 721 TDTTHGFNFAKHVVKQYPYSISAWNCYYKVASSLTNRDSRHCKLLNSMQAKYKDCAPPYI 780

Query: 781 IAGHQFTTISHHQDAARKYLEAYKIMPDSPLINLCVGASLINLALGFRLQNKHQCVAQGL 840
           IAGHQFT ISHHQDAARKYLEAYKIMPDSPLINLCVG+SLINLALGFRLQNKHQCVAQGL
Sbjct: 781 IAGHQFTNISHHQDAARKYLEAYKIMPDSPLINLCVGSSLINLALGFRLQNKHQCVAQGL 840

Query: 841 AFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENQNI 900
           AFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGEN+NI
Sbjct: 841 AFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENRNI 900

Query: 901 KHQKSVYCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF 918
           KHQKSVYCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF
Sbjct: 901 KHQKSVYCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF 939

BLAST of HG10002345 vs. NCBI nr
Match: XP_011652346.1 (general transcription factor 3C polypeptide 3 [Cucumis sativus] >KGN59936.2 hypothetical protein Csa_001841 [Cucumis sativus])

HSP 1 Score: 1686.4 bits (4366), Expect = 0.0e+00
Identity = 870/942 (92.36%), Postives = 896/942 (95.12%), Query Frame = 0

Query: 1   MEKEGSKISDNEEVPGGVMRVLGAEKEVVETGVEARE-EEEEEEEEEEGEEEVEDEGEDD 60
           MEKEGS+ISD EEVPG VM VLG EKEVVETGVE RE EEEEEEEEEEGEEEVEDEGEDD
Sbjct: 1   MEKEGSRISDCEEVPGEVMHVLGTEKEVVETGVEDREGEEEEEEEEEEGEEEVEDEGEDD 60

Query: 61  IEEEDGYIFKFKAGENPFDFVEGTDFSIQPYKKFERLEYEALAEKKRKALANGQSERAAK 120
           IEEEDGY FKFKAGENPFDFVEGTDFS+QPYKKFERLEYEALAEKKRKALANGQSERAAK
Sbjct: 61  IEEEDGYTFKFKAGENPFDFVEGTDFSVQPYKKFERLEYEALAEKKRKALANGQSERAAK 120

Query: 121 RGRVEDIPGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQ 180
           RGRVEDI GASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQ
Sbjct: 121 RGRVEDISGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQ 180

Query: 181 GQHEKAISILRQVVLQAPDLPDSYHTLGLVYNAIGDDVKAMGFYMLAAHLMPKDSSLWKL 240
           G+HEKAIS+LRQVVL+APDLPDSYHTLGLVYNAIGDDVKAMGFYMLAAHLMPKDSSLWKL
Sbjct: 181 GEHEKAISLLRQVVLRAPDLPDSYHTLGLVYNAIGDDVKAMGFYMLAAHLMPKDSSLWKL 240

Query: 241 LFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQIHQQCL 300
           LFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQIHQQCL
Sbjct: 241 LFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQIHQQCL 300

Query: 301 GNVEALMTGAKLYQKCGHLERAICILEDYIKEHPTEADLDVVDLLASLYMGSKEFSKALE 360
           GNVEALMTGAKLYQKCGHLERAICILEDYIK HP+EADLDVVDLLASLYMGSKEFSKALE
Sbjct: 301 GNVEALMTGAKLYQKCGHLERAICILEDYIKGHPSEADLDVVDLLASLYMGSKEFSKALE 360

Query: 361 RIEHADEVYCAGNELPLNLTTKAGICHVHLGNMEKAECLFANLGRETANDHSNLMIEAAD 420
           RIEHAD VYCAGNELPLNLTTKAGICH HLG++EKAECLFANL RET  DHSNLMIE AD
Sbjct: 361 RIEHADRVYCAGNELPLNLTTKAGICHAHLGDLEKAECLFANLRRETTYDHSNLMIEVAD 420

Query: 421 SLLSLKHYNLALKYYLMSEEVNAGGNMGILYLKIAQCYLSTDERTQAIVFFYKVLQHLED 480
           SL+SLKHY+ ALKYYLMSEEVNAG NMGILYLKIA+CYLST+ER QAIVFFYKVLQH+ED
Sbjct: 421 SLMSLKHYSWALKYYLMSEEVNAGENMGILYLKIAECYLSTNEREQAIVFFYKVLQHVED 480

Query: 481 NINARLTLASLLLEEARDEEAISLLSPPKDSNPTSSSSSKLKPWWLNEKVKLKLCQIYKT 540
           NINARLTLASLLLEEARD+EAISLLSPPKDSNPTSSSSSKLKPWWLNEKVKLKLC IY+T
Sbjct: 481 NINARLTLASLLLEEARDKEAISLLSPPKDSNPTSSSSSKLKPWWLNEKVKLKLCHIYRT 540

Query: 541 RGMLENFVEVIFPLVRESLYIETLQEKIKVNKKKLPRRVLLERVKVLDGRETGNLFRGFK 600
           RG+LENFVEVIFPLVRESLYIETLQEKIKVNKKKLPRRVLLERVKVLDGRETGNLFRGFK
Sbjct: 541 RGLLENFVEVIFPLVRESLYIETLQEKIKVNKKKLPRRVLLERVKVLDGRETGNLFRGFK 600

Query: 601 PVAPKSDLSKASRAKRLLQKRERIKEEKKARALAAGVDVNYDDLDDEPA----------- 660
           PVAPKSDL+KASRAKRLLQKRERIKEEKKA+ALAAGV+++YDDLDDEPA           
Sbjct: 601 PVAPKSDLTKASRAKRLLQKRERIKEEKKAKALAAGVNLSYDDLDDEPALRMHRESPLPN 660

Query: 661 -------------LCKALASLGRCSEALEIISLTLKLAFNSLSIERKEELQLLGAQLAFS 720
                        LCKALASLGRCSEALEIISLTLKLAFNSLS+ERKEELQLLGAQLAFS
Sbjct: 661 LLKEEEYHILIVDLCKALASLGRCSEALEIISLTLKLAFNSLSMERKEELQLLGAQLAFS 720

Query: 721 STDTMHGFNFAKHVVKQYPYSISAWNCYYKVASSLTNRDSRHCKLLNSMQAKYKDCAPPY 780
           ST TMHGFNFAKHVVKQYPYSISAWNCYYKVAS LTNRDSRHCKLLNSMQ+KYKDCAPPY
Sbjct: 721 STGTMHGFNFAKHVVKQYPYSISAWNCYYKVASCLTNRDSRHCKLLNSMQSKYKDCAPPY 780

Query: 781 IIAGHQFTTISHHQDAARKYLEAYKIMPDSPLINLCVGASLINLALGFRLQNKHQCVAQG 840
           IIAGHQFTTISHHQDAARKYLEAYKIMPDSPLINLCVG+SLINLALGFRLQNKHQCVAQG
Sbjct: 781 IIAGHQFTTISHHQDAARKYLEAYKIMPDSPLINLCVGSSLINLALGFRLQNKHQCVAQG 840

Query: 841 LAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENQN 900
           LAFLYKNLKLCDN+QEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGEN+N
Sbjct: 841 LAFLYKNLKLCDNNQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENRN 900

Query: 901 IKHQKSVYCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF 918
           IKHQ SVYCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF
Sbjct: 901 IKHQNSVYCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF 942

BLAST of HG10002345 vs. NCBI nr
Match: XP_008447634.1 (PREDICTED: general transcription factor 3C polypeptide 3 isoform X1 [Cucumis melo] >XP_008447635.1 PREDICTED: general transcription factor 3C polypeptide 3 isoform X1 [Cucumis melo])

HSP 1 Score: 1670.6 bits (4325), Expect = 0.0e+00
Identity = 861/941 (91.50%), Postives = 887/941 (94.26%), Query Frame = 0

Query: 1   MEKEGSKISDNEEVPGGVMRVLGAEKEVVETGVEAREEEEEEEEEEEGEEEVEDEGEDDI 60
           MEKEG++ISD+EEVPG VM VLG EKE VETGV  R   EEEEEEEEGEEEVEDEGEDDI
Sbjct: 1   MEKEGNRISDSEEVPGDVMHVLGVEKE-VETGVVDR---EEEEEEEEGEEEVEDEGEDDI 60

Query: 61  EEEDGYIFKFKAGENPFDFVEGTDFSIQPYKKFERLEYEALAEKKRKALANGQSERAAKR 120
           EEEDGY FKFKAGENPFDFVEGTDFS+QPYKKFERLEYEALAEKKRKALANGQSERAAKR
Sbjct: 61  EEEDGYTFKFKAGENPFDFVEGTDFSVQPYKKFERLEYEALAEKKRKALANGQSERAAKR 120

Query: 121 GRVEDIPGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQG 180
           GRVED+ GASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQG
Sbjct: 121 GRVEDVAGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQG 180

Query: 181 QHEKAISILRQVVLQAPDLPDSYHTLGLVYNAIGDDVKAMGFYMLAAHLMPKDSSLWKLL 240
           QHEKAIS+LRQVVLQAPDLPDSYHTLGLVYNAIGDDVKAMGFYMLAAHLMPKDSSLWKLL
Sbjct: 181 QHEKAISLLRQVVLQAPDLPDSYHTLGLVYNAIGDDVKAMGFYMLAAHLMPKDSSLWKLL 240

Query: 241 FSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQIHQQCLG 300
           FSWSI+RGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQIHQQCLG
Sbjct: 241 FSWSIERGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQIHQQCLG 300

Query: 301 NVEALMTGAKLYQKCGHLERAICILEDYIKEHPTEADLDVVDLLASLYMGSKEFSKALER 360
           NVEALMTGAKLYQKCGHLERAICILEDYIKEHP+EADLDVVDLLASLYMGSKEFSKALE 
Sbjct: 301 NVEALMTGAKLYQKCGHLERAICILEDYIKEHPSEADLDVVDLLASLYMGSKEFSKALEH 360

Query: 361 IEHADEVYCAGNELPLNLTTKAGICHVHLGNMEKAECLFANLGRETANDHSNLMIEAADS 420
           IEHAD VYCAGNELPLNLT KAGICH HLGN+EKAECLFANL RET  DHSNLMIE ADS
Sbjct: 361 IEHADRVYCAGNELPLNLTAKAGICHAHLGNLEKAECLFANLRRETTYDHSNLMIEVADS 420

Query: 421 LLSLKHYNLALKYYLMSEEVNAGGNMGILYLKIAQCYLSTDERTQAIVFFYKVLQHLEDN 480
           LLSLKHY+ ALKYYLMSEEVNAG NMGILY K+A+CYLST+E+ QAIVFFYKVLQH+EDN
Sbjct: 421 LLSLKHYSWALKYYLMSEEVNAGENMGILYQKVAECYLSTNEKEQAIVFFYKVLQHVEDN 480

Query: 481 INARLTLASLLLEEARDEEAISLLSPPKDSNPTSSSSSKLKPWWLNEKVKLKLCQIYKTR 540
           INARLTLASLLLEEARDEEAISLLSPPKDSNPTSSSS KLKPWWLNEKVKLKLC IY+TR
Sbjct: 481 INARLTLASLLLEEARDEEAISLLSPPKDSNPTSSSSGKLKPWWLNEKVKLKLCHIYRTR 540

Query: 541 GMLENFVEVIFPLVRESLYIETLQEKIKVNKKKLPRRVLLERVKVLDGRETGNLFRGFKP 600
           G+LENFVEVIFPLVRESLYIETLQEKIKVNKKKLPRRVLLERVKVLDGRETGNLFRGF+P
Sbjct: 541 GLLENFVEVIFPLVRESLYIETLQEKIKVNKKKLPRRVLLERVKVLDGRETGNLFRGFRP 600

Query: 601 VAPKSDLSKASRAKRLLQKRERIKEEKKARALAAGVDVNYDDLDDEPA------------ 660
           VAPKSDL+KASRAKRLLQKR+RIKEEKKA+ LAAGV+V+YDDLDDEPA            
Sbjct: 601 VAPKSDLTKASRAKRLLQKRDRIKEEKKAKLLAAGVNVSYDDLDDEPALRMHRESPLPNL 660

Query: 661 ------------LCKALASLGRCSEALEIISLTLKLAFNSLSIERKEELQLLGAQLAFSS 720
                       LCKALASLGRCSEALEIISLTLKLAFNSLS ERKEELQLLGAQLAFSS
Sbjct: 661 LKEEEHHILIVDLCKALASLGRCSEALEIISLTLKLAFNSLSTERKEELQLLGAQLAFSS 720

Query: 721 TDTMHGFNFAKHVVKQYPYSISAWNCYYKVASSLTNRDSRHCKLLNSMQAKYKDCAPPYI 780
           T TMHGFNFAKHVVKQYPYSISAWNCYYKVAS LTNRDSRHCKLLNSMQAKYKDCAPPYI
Sbjct: 721 TGTMHGFNFAKHVVKQYPYSISAWNCYYKVASCLTNRDSRHCKLLNSMQAKYKDCAPPYI 780

Query: 781 IAGHQFTTISHHQDAARKYLEAYKIMPDSPLINLCVGASLINLALGFRLQNKHQCVAQGL 840
           IAGHQFTTISHHQDAARKYLEAYKIMPDSPLINLCVG+SLINLALGFRLQNKHQCVAQGL
Sbjct: 781 IAGHQFTTISHHQDAARKYLEAYKIMPDSPLINLCVGSSLINLALGFRLQNKHQCVAQGL 840

Query: 841 AFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENQNI 900
           AFLYKNLKLCDN+QEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGEN+NI
Sbjct: 841 AFLYKNLKLCDNNQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENRNI 900

Query: 901 KHQKSVYCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF 918
           KHQ SVYCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF
Sbjct: 901 KHQNSVYCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF 937

BLAST of HG10002345 vs. NCBI nr
Match: XP_022152621.1 (general transcription factor 3C polypeptide 3 isoform X3 [Momordica charantia])

HSP 1 Score: 1617.8 bits (4188), Expect = 0.0e+00
Identity = 827/927 (89.21%), Postives = 882/927 (95.15%), Query Frame = 0

Query: 1   MEKEGSKISDNEEVPGGVMRVLGAEKEVVETGVEARE---------EEEEEEEEEEGEEE 60
           MEKEG++ISDN+EVPG  + V G  K + ET VE RE         EEEEEEEEEE EEE
Sbjct: 1   MEKEGNEISDNKEVPGCAVGVEGEVKGLKETEVENREEEEEEEEEDEEEEEEEEEEEEEE 60

Query: 61  VEDEGEDDIEEEDGYIFKFKAGENPFDFVEGTDFSIQPYKKFERLEYEALAEKKRKALAN 120
           VEDEGEDDIEEEDGYIFKFKAGENPFDFVEGTDFSIQPYKKFERLEYEALAEKKRKALAN
Sbjct: 61  VEDEGEDDIEEEDGYIFKFKAGENPFDFVEGTDFSIQPYKKFERLEYEALAEKKRKALAN 120

Query: 121 GQSERAAKRGRVEDIPGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLG 180
            QSER  KRGR+EDIPGASF+EI+EAMNYGSRRKLKEPK+RGRRKGSKKK+NR++TKLLG
Sbjct: 121 SQSERPVKRGRLEDIPGASFNEIMEAMNYGSRRKLKEPKRRGRRKGSKKKINREITKLLG 180

Query: 181 DATLCYAQGQHEKAISILRQVVLQAPDLPDSYHTLGLVYNAIGDDVKAMGFYMLAAHLMP 240
           DATLCYAQGQ+EKAIS+LRQVVLQAPDLPDSYHTLGLVYNAIGDDVKAMGFYMLAAHLMP
Sbjct: 181 DATLCYAQGQYEKAISVLRQVVLQAPDLPDSYHTLGLVYNAIGDDVKAMGFYMLAAHLMP 240

Query: 241 KDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETY 300
           +DSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDC+KAAETY
Sbjct: 241 RDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCQKAAETY 300

Query: 301 DQIHQQCLGNVEALMTGAKLYQKCGHLERAICILEDYIKEHPTEADLDVVDLLASLYMGS 360
           DQIHQ+C+GNVEALMTGAKLYQKCGH ERAICILEDYIK HPTEADLDVVDLLASLYMGS
Sbjct: 301 DQIHQKCIGNVEALMTGAKLYQKCGHHERAICILEDYIKGHPTEADLDVVDLLASLYMGS 360

Query: 361 KEFSKALERIEHADEVYCAGNELPLNLTTKAGICHVHLGNMEKAECLFANLGRETANDHS 420
           KEFSKALE IEHAD+VYCAGNE+PLNL TKAGICHVHLGN+EKAE LFANLGR+TA+DHS
Sbjct: 361 KEFSKALEHIEHADKVYCAGNEMPLNLATKAGICHVHLGNIEKAESLFANLGRKTADDHS 420

Query: 421 NLMIEAADSLLSLKHYNLALKYYLMSEEVNAGGNM-GILYLKIAQCYLSTDERTQAIVFF 480
           + +IEAADSLLSLKH+NLALKYYLMSEEVNAGG M GI+YLKIAQCYLST+ER +AIVFF
Sbjct: 421 DFIIEAADSLLSLKHHNLALKYYLMSEEVNAGGKMQGIVYLKIAQCYLSTNERAEAIVFF 480

Query: 481 YKVLQHLEDNINARLTLASLLLEEARDEEAISLLSPPKDSNPTSSSSSKLKPWWLNEKVK 540
           YKVLQ LEDNINARLTLASLLLEEAR+EEAISLLSPPKDSN +SSSSSK KPWWLNEKVK
Sbjct: 481 YKVLQTLEDNINARLTLASLLLEEAREEEAISLLSPPKDSNSSSSSSSKFKPWWLNEKVK 540

Query: 541 LKLCQIYKTRGMLENFVEVIFPLVRESLYIETLQEKIKVNKKKLPRRVLLERVKVLDGRE 600
           LKLC IY+T+GMLENFVE IF LVRESLYIETL+EKIKVNKKKLPRRVLLERVKVLDGRE
Sbjct: 541 LKLCNIYRTKGMLENFVETIFSLVRESLYIETLKEKIKVNKKKLPRRVLLERVKVLDGRE 600

Query: 601 TGNLFRGFKPVAPKSDLSKASRAKRLLQKRERIKEEKKARALAAGVDVNYDDLDDEPALC 660
           TG+LFRGF+PVAPKSDLSKASRAKRLLQKRERIKEEKKARALAAGV+V+YDD+DDEPALC
Sbjct: 601 TGSLFRGFRPVAPKSDLSKASRAKRLLQKRERIKEEKKARALAAGVNVSYDDVDDEPALC 660

Query: 661 KALASLGRCSEALEIISLTLKLAFNSLSIERKEELQLLGAQLAFSSTDTMHGFNFAKHVV 720
           KALASLGRCSEALEIISLTLKLAFNSLS+ERKEELQLLGAQLAFSSTDT HGFNFAKHVV
Sbjct: 661 KALASLGRCSEALEIISLTLKLAFNSLSMERKEELQLLGAQLAFSSTDTKHGFNFAKHVV 720

Query: 721 KQYPYSISAWNCYYKVASSLTNRDSRHCKLLNSMQAKYKDCAPPYIIAGHQFTTISHHQD 780
           KQYPYS SAWNCYYKV+S +T+RDSRHCKLLNS+QAKYKDCAPP+IIAGHQF  ISHHQ+
Sbjct: 721 KQYPYSNSAWNCYYKVSSRMTHRDSRHCKLLNSIQAKYKDCAPPFIIAGHQFNAISHHQE 780

Query: 781 AARKYLEAYKIMPDSPLINLCVGASLINLALGFRLQNKHQCVAQGLAFLYKNLKLCDNSQ 840
           AA+KYLEAYK++PDSPLINLCVGA+LINLALG RLQNKHQCVAQGLAFLY NLKLCDNSQ
Sbjct: 781 AAKKYLEAYKLLPDSPLINLCVGAALINLALGLRLQNKHQCVAQGLAFLYNNLKLCDNSQ 840

Query: 841 EALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENQNIKHQKSVYCDLRREA 900
           EALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIP++FGEN+NIKH+KSVYCDLRREA
Sbjct: 841 EALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPDIFGENRNIKHEKSVYCDLRREA 900

Query: 901 AYNLHLIYKESGALDLARQVLKDHCTF 918
           AYNLHLIYK+SGALDLARQVLKDHCTF
Sbjct: 901 AYNLHLIYKKSGALDLARQVLKDHCTF 927

BLAST of HG10002345 vs. NCBI nr
Match: XP_022152620.1 (general transcription factor 3C polypeptide 3 isoform X2 [Momordica charantia])

HSP 1 Score: 1609.0 bits (4165), Expect = 0.0e+00
Identity = 827/950 (87.05%), Postives = 882/950 (92.84%), Query Frame = 0

Query: 1   MEKEGSKISDNEEVPGGVMRVLGAEKEVVETGVEARE---------EEEEEEEEEEGEEE 60
           MEKEG++ISDN+EVPG  + V G  K + ET VE RE         EEEEEEEEEE EEE
Sbjct: 1   MEKEGNEISDNKEVPGCAVGVEGEVKGLKETEVENREEEEEEEEEDEEEEEEEEEEEEEE 60

Query: 61  VEDEGEDDIEEEDGYIFKFKAGENPFDFVEGTDFSIQPYKKFERLEYEALAEKKRKALAN 120
           VEDEGEDDIEEEDGYIFKFKAGENPFDFVEGTDFSIQPYKKFERLEYEALAEKKRKALAN
Sbjct: 61  VEDEGEDDIEEEDGYIFKFKAGENPFDFVEGTDFSIQPYKKFERLEYEALAEKKRKALAN 120

Query: 121 GQSERAAKRGRVEDIPGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLG 180
            QSER  KRGR+EDIPGASF+EI+EAMNYGSRRKLKEPK+RGRRKGSKKK+NR++TKLLG
Sbjct: 121 SQSERPVKRGRLEDIPGASFNEIMEAMNYGSRRKLKEPKRRGRRKGSKKKINREITKLLG 180

Query: 181 DATLCYAQGQHEKAISILRQVVLQAPDLPDSYHTLGLVYNAIGDDVKAMGFYMLAAHLMP 240
           DATLCYAQGQ+EKAIS+LRQVVLQAPDLPDSYHTLGLVYNAIGDDVKAMGFYMLAAHLMP
Sbjct: 181 DATLCYAQGQYEKAISVLRQVVLQAPDLPDSYHTLGLVYNAIGDDVKAMGFYMLAAHLMP 240

Query: 241 KDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETY 300
           +DSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDC+KAAETY
Sbjct: 241 RDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCQKAAETY 300

Query: 301 DQIHQQCLGNVEALMTGAKLYQKCGHLERAICILEDYIKEHPTEADLDVVDLLASLYMGS 360
           DQIHQ+C+GNVEALMTGAKLYQKCGH ERAICILEDYIK HPTEADLDVVDLLASLYMGS
Sbjct: 301 DQIHQKCIGNVEALMTGAKLYQKCGHHERAICILEDYIKGHPTEADLDVVDLLASLYMGS 360

Query: 361 KEFSKALERIEHADEVYCAGNELPLNLTTKAGICHVHLGNMEKAECLFANLGRETANDHS 420
           KEFSKALE IEHAD+VYCAGNE+PLNL TKAGICHVHLGN+EKAE LFANLGR+TA+DHS
Sbjct: 361 KEFSKALEHIEHADKVYCAGNEMPLNLATKAGICHVHLGNIEKAESLFANLGRKTADDHS 420

Query: 421 NLMIEAADSLLSLKHYNLALKYYLMSEEVNAGGNMGILYLKIAQCYLSTDERTQAIVFFY 480
           + +IEAADSLLSLKH+NLALKYYLMSEEVNAGG MGI+YLKIAQCYLST+ER +AIVFFY
Sbjct: 421 DFIIEAADSLLSLKHHNLALKYYLMSEEVNAGGKMGIVYLKIAQCYLSTNERAEAIVFFY 480

Query: 481 KVLQHLEDNINARLTLASLLLEEARDEEAISLLSPPKDSNPTSSSSSKLKPWWLNEKVKL 540
           KVLQ LEDNINARLTLASLLLEEAR+EEAISLLSPPKDSN +SSSSSK KPWWLNEKVKL
Sbjct: 481 KVLQTLEDNINARLTLASLLLEEAREEEAISLLSPPKDSNSSSSSSSKFKPWWLNEKVKL 540

Query: 541 KLCQIYKTRGMLENFVEVIFPLVRESLYIETLQEKIKVNKKKLPRRVLLERVKVLDGRET 600
           KLC IY+T+GMLENFVE IF LVRESLYIETL+EKIKVNKKKLPRRVLLERVKVLDGRET
Sbjct: 541 KLCNIYRTKGMLENFVETIFSLVRESLYIETLKEKIKVNKKKLPRRVLLERVKVLDGRET 600

Query: 601 GNLFRGFKPVAPKSDLSKASRAKRLLQKRERIKEEKKARALAAGVDVNYDDLDDEPA--- 660
           G+LFRGF+PVAPKSDLSKASRAKRLLQKRERIKEEKKARALAAGV+V+YDD+DDEPA   
Sbjct: 601 GSLFRGFRPVAPKSDLSKASRAKRLLQKRERIKEEKKARALAAGVNVSYDDVDDEPALRV 660

Query: 661 ---------------------LCKALASLGRCSEALEIISLTLKLAFNSLSIERKEELQL 720
                                LCKALASLGRCSEALEIISLTLKLAFNSLS+ERKEELQL
Sbjct: 661 HRESPLPNLLKDEEYHNLIVDLCKALASLGRCSEALEIISLTLKLAFNSLSMERKEELQL 720

Query: 721 LGAQLAFSSTDTMHGFNFAKHVVKQYPYSISAWNCYYKVASSLTNRDSRHCKLLNSMQAK 780
           LGAQLAFSSTDT HGFNFAKHVVKQYPYS SAWNCYYKV+S +T+RDSRHCKLLNS+QAK
Sbjct: 721 LGAQLAFSSTDTKHGFNFAKHVVKQYPYSNSAWNCYYKVSSRMTHRDSRHCKLLNSIQAK 780

Query: 781 YKDCAPPYIIAGHQFTTISHHQDAARKYLEAYKIMPDSPLINLCVGASLINLALGFRLQN 840
           YKDCAPP+IIAGHQF  ISHHQ+AA+KYLEAYK++PDSPLINLCVGA+LINLALG RLQN
Sbjct: 781 YKDCAPPFIIAGHQFNAISHHQEAAKKYLEAYKLLPDSPLINLCVGAALINLALGLRLQN 840

Query: 841 KHQCVAQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIP 900
           KHQCVAQGLAFLY NLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIP
Sbjct: 841 KHQCVAQGLAFLYNNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIP 900

Query: 901 ELFGENQNIKHQKSVYCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF 918
           ++FGEN+NIKH+KSVYCDLRREAAYNLHLIYK+SGALDLARQVLKDHCTF
Sbjct: 901 DIFGENRNIKHEKSVYCDLRREAAYNLHLIYKKSGALDLARQVLKDHCTF 950

BLAST of HG10002345 vs. ExPASy Swiss-Prot
Match: Q9Y5Q9 (General transcription factor 3C polypeptide 3 OS=Homo sapiens OX=9606 GN=GTF3C3 PE=1 SV=1)

HSP 1 Score: 229.6 bits (584), Expect = 1.5e-58
Identity = 224/920 (24.35%), Postives = 419/920 (45.54%), Query Frame = 0

Query: 27  EVVETGVEAREEEEEEEEEEEGEEEVEDEGEDDIEEEDGYIFKFKAGENPFDFVEGTDFS 86
           E  E   E R+  E++  +E+G+   E+  +D        I   K+ +   D  EG + S
Sbjct: 19  EEFERRREERKTREKKSLQEKGKLSAEENPDDSEVPSSSGINSTKSQDK--DVNEG-ETS 78

Query: 87  IQPYKKFERLEYEALAEKKRKALANGQSERAAKRGRVEDIPGASFDEILEAMNYGSRRKL 146
               K   ++    L E +       + E   +     + P A    +LE +        
Sbjct: 79  DGVRKSVHKVFASMLGENEDDEEEEEEEEEEEEEEETPEQPTAGDVFVLEMV------LN 138

Query: 147 KEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQHEKAISILRQVVLQAPDLPDSYHTL 206
           +E KK  + K  + KL R +  L+G+A + +A+G+ E+AI +  +++ QAP   + + TL
Sbjct: 139 RETKKMMKEKRPRSKLPRALRGLMGEANIRFARGEREEAILMCMEIIRQAPLAYEPFSTL 198

Query: 207 GLVYNAIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPD 266
            ++Y   GD  K++ F ++AAHL P D+  W  L   S+++ +I QA +C +KA+K EP 
Sbjct: 199 AMIYEDQGDMEKSLQFELIAAHLNPSDTEEWVRLAEMSLEQDNIKQAIFCYTKALKYEPT 258

Query: 267 DINLLFHRASLYLERGDCEKAAETYDQIHQQCLGN-----VEALMTGAKLYQKCGHLERA 326
           ++  L+ R+SLY + GD + A + Y +I      +     ++     AK Y +   +  A
Sbjct: 259 NVRYLWERSSLYEQMGDHKMAMDGYRRILNLLSPSDGERFMQLARDMAKSYYEANDVTSA 318

Query: 327 ICILEDYIKEHPTEADLDVVDLLASLYMGSKEFSKALERI-------------------- 386
           I I+++   +H     ++ V++ A LY+ +K++ KALE I                    
Sbjct: 319 INIIDEAFSKHQGLVSMEDVNIAAELYISNKQYDKALEIITDFSGIVLEKKTSEEGTSEE 378

Query: 387 -EHADEVYCA-GNELPLNLTTKAGICHVHLGNMEKAECLFANLGRETANDHSNLMIEAAD 446
            +  + V C   + +P+++T K  +C VHL  +E    L   L  +   D  +L ++ A+
Sbjct: 379 NKAPENVTCTIPDGVPIDITVKLMVCLVHLNILEPLNPLLTTLVEQNPEDMGDLYLDVAE 438

Query: 447 SLLSLKHYNLALKYYLMSEEVNAGGNMGILYLKIAQCYLSTDERTQAIVFFYKVLQHLED 506
           + L +  YN AL   L +   +   N+ +++L+ A+C  +     +A   + KV+     
Sbjct: 439 AFLDVGEYNSALP-LLSALVCSERYNLAVVWLRHAECLKALGYMERAAESYGKVVDLAPL 498

Query: 507 NINARLTLASLLLEEARDEEAISLLSPPKDSNPTSSSSSKLKPWWLNEKVKLKLCQIYKT 566
           +++AR++L++L  +  + E+A+  L P  D +  +  ++  +      K+ L    +  +
Sbjct: 499 HLDARISLSTLQQQLGQPEKALEALEPMYDPDTLAQDANAAQQ---ELKLLLHRSTLLFS 558

Query: 567 RGMLENFVEVIFPLVRESLYIETLQEKIKVNKKKLPRRVLLERVKVLDGRETGNLFRGFK 626
           +G +  +V+ +  ++   L                  +V + R +V        L    K
Sbjct: 559 QGKMYGYVDTLLTMLAMLL------------------KVAMNRAQVC-------LISSSK 618

Query: 627 PVAPKSDLSKASRAKRLLQKRERIKEEKKARALAAGVDVNYDDLDDEPALCKA---LASL 686
                  L K SR K +    ++      A+A+ A +       D    L KA   L  L
Sbjct: 619 SGERHLYLIKVSRDK-ISDSNDQESANCDAKAIFAVLTSVLTKDDWWNLLLKAIYSLCDL 678

Query: 687 GRCSEALEIISLTLKLAFNSLSIERKEELQLLGAQLAFSSTDTMHGFNFAKHVVKQYPYS 746
            R  EA  ++  +L+        ++++EL+  G   A    +    +N+ + +V +    
Sbjct: 679 SRFQEAELLVDSSLEYYSFYDDRQKRKELEYFGLSAAILDKNFRKAYNYIRIMVMENVNK 738

Query: 747 ISAWNCYYKVASSLTNRDSRHCKLLNSMQAKYKDCAPPYIIAGHQFTTISHHQDAARKYL 806
              WN + +V  ++ ++D RH +    +  K  +     ++ GH        + A  +Y+
Sbjct: 739 PQLWNIFNQV--TMHSQDVRHHRFCLRLMLKNPENHALCVLNGHNAFVSGSFKHALGQYV 798

Query: 807 EAYKIMPDSPLINLCVGASLINLALGFRLQNKHQCVAQGLAFLYKNLKLCDNSQEALYNI 866
           +A++  PD PL + C+G + I++A    +  +H  + QG +FL + L L    QE+ YN+
Sbjct: 799 QAFRTHPDEPLYSFCIGLTFIHMASQKYVLRRHALIVQGFSFLNRYLSLRGPCQESFYNL 858

Query: 867 ARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENQNIKHQKSVYCDLRREAAYNLHL 917
            R  H +GL+ LA+ YY+K L        +P L  E   +        DLRR+ AYNL L
Sbjct: 859 GRGLHQLGLIHLAIHYYQKAL-------ELPPLVVEGIELDQ-----LDLRRDIAYNLSL 885

BLAST of HG10002345 vs. ExPASy Swiss-Prot
Match: O74458 (Transcription factor tau subunit sfc4 OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) OX=284812 GN=sfc4 PE=1 SV=1)

HSP 1 Score: 122.1 bits (305), Expect = 3.3e-26
Identity = 211/935 (22.57%), Postives = 363/935 (38.82%), Query Frame = 0

Query: 124  EDIPGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQ-GQH 183
            +DI    ++E L+A+  G R+  K  K RGR   +    + +V ++L  A   +AQ G  
Sbjct: 91   DDIANEEWEENLKAV-AGFRKVRKGHKGRGRVSRADMLPSVEVQQMLSLANHLFAQEGNF 150

Query: 184  EKAISILRQVVLQAPDLPDSYHTLGLVYNAIGDD----VKAMGFYMLAAHLMPKDSSLWK 243
            ++A  +  ++V    ++  ++  LG  +   G+      K +  +M AAHL PKD  LW 
Sbjct: 151  DEAQKLAEEIVRIDNNVIAAWKMLGECHRQRGNGRVNIEKCLIAWMAAAHLKPKDHELWF 210

Query: 244  LLFSWSIDRGDIDQASYCLSKAIKAEPDDIN----LLFHRASLYLERGDCEKAAETYDQI 303
                 S      DQA YC ++A+ A+P + +     +++R+ L  E G  +KAAE +  +
Sbjct: 211  TCAKLSESLEFWDQADYCYNRAVSAKPPNKSELKKYIWNRSVLNKEHGSLKKAAEGFKFL 270

Query: 304  HQQCLGNVEALMTGAKLYQKCGHLERAIC----ILEDYIKEHPTEA------DLDVVDLL 363
             Q    N   L   A++Y K  H  R I     I   Y  ++P         DL  ++L 
Sbjct: 271  LQSSPYNASILKNLAEIYIKI-HAPREILKQFEIAWKYFYQYPAPPIGNDIFDLPTLNLY 330

Query: 364  ASLYMGSKEFSKALERI----------------------------EHADEVYCAGNE--- 423
            A L +   ++S  +  I                            E   E   A  E   
Sbjct: 331  AELLLLDHQWSNLIRLINRGVRWFRGRKSESFWDEFDDDREWDVDERRREFPNASEEHTN 390

Query: 424  -----LPLNLTTKAGICHVHLGNMEKAECLFANLGRETANDHSNLMIEAADSLLSLKHYN 483
                 LP    TK GI  +  G + +AE  F+ +     +    ++ + A + + ++  +
Sbjct: 391  KEAYLLPHLFRTKLGIARLKTGELPEAELHFSVIKNLPPDYAWGMLYDIAKAYMDIERLD 450

Query: 484  LALKYYLM------SEEVNAGGNMGILYLKI------AQC----YLSTDERTQAIVFFYK 543
            LAL+Y+++      ++ +    NMG+ YL++       QC     +  +  T A++   +
Sbjct: 451  LALEYFVLICNHEPAQNIGLWYNMGVCYLELKEYEHAQQCMEAILIVDNSNTNALIKLAE 510

Query: 544  VLQHLEDNINARLTLASLLLEEARDEEAISLLSPPKDSNPTSSSSSKLKPWWLNEKVKLK 603
            +   L+DN +A L + + + E+ R+   I+ L   +  N     +   + +  N+KV   
Sbjct: 511  I-NELQDNRDAALEIVTNIFEQRRN---INELEREQSQNEDHEKNVGSQLFVGNQKVPQ- 570

Query: 604  LCQIYKTRGMLENFVEVIFPLVRESLYIETLQEKIKVNKKKLPRRVLLERVKVLDGRE-- 663
                ++ R  +    E      R+    +T + + + +K  + R+ L +   V +     
Sbjct: 571  --DKWEKRARISRSKEE----ARQFTIWKTEETQRRFHKLDILRQSLKKEENVSESLNEW 630

Query: 664  ---TGNLFRGFKPVAPKSDLSKASRAKR-LLQKRER---IKEEKKA---RALAAGVDVNY 723
                  L   F  +       K +RA+  LL +R R   + ++  +   R   +     Y
Sbjct: 631  LAIASELIDEFVSIKAFFPSEKKARARAGLLTRRTRYASLNDQLTSMINRLNDSLTRTKY 690

Query: 724  DDLDDEPAL--------------------CKALASLGRCSEALEIISLTLKLAFNSLSIE 783
             DLD +  L                       L  +G   +A ++++  +          
Sbjct: 691  GDLDLDTILRTGYFRNVSIDAWYQLFVEFSLRLTKVGSVQQAYDVLTTAMGAILFDQDTI 750

Query: 784  RKEELQLLGAQLAFSSTDTMHGFNFAKHVVKQYPYSISAWNCYYKVAS-----SLTNRDS 843
            +++ L+      +  + D        + V   + +    +  +  V S     S    DS
Sbjct: 751  KRQNLRWCMLACSMYARDPQGALTPLRWVFTTFQFRQDTYRLFSAVLSQGYECSRAFVDS 810

Query: 844  RHCKLL------------NSM-------------QAKYKDCAPPYIIA--GHQFTTISHH 903
             + K L            NS+              A       P ++   GH        
Sbjct: 811  ANQKFLLRLIKLMDQLMSNSLVSGAATLVKNDDGLATVPTSYDPVLVLLYGHIMARNRSW 870

Query: 904  QDAARKYLEAYKIMPDSPLINLCVGASLINLALGFRLQNKHQCVAQGLAFLYKNLKLCDN 918
              A   Y  A+ I PD P+ NL +G + ++ A+     N+H  + QG  FLY+   L  N
Sbjct: 871  IPAINYYSRAFAINPDCPITNLSLGLAYLHRAMQRLSDNRHYQILQGFTFLYRYYDLRVN 930

BLAST of HG10002345 vs. ExPASy Swiss-Prot
Match: P33339 (Transcription factor tau 131 kDa subunit OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) OX=559292 GN=TFC4 PE=1 SV=1)

HSP 1 Score: 87.8 bits (216), Expect = 6.9e-16
Identity = 209/1058 (19.75%), Postives = 397/1058 (37.52%), Query Frame = 0

Query: 40   EEEEEEEEGEEEVEDEGEDDIEEEDGYIFKFKAGENPFDFVEGTDFSIQPYKKFERLEYE 99
            ++E++ +  E E  D G+ + E+E+         ++     E  D  +    +    EY 
Sbjct: 7    KKEQQNQSAERESADTGKVNDEDEEHLYGNIDDYKHLIQDEEYDDEDVPHDLQLSEDEYN 66

Query: 100  ALAEKKRKALANGQSERAAKRGRVEDIPGASFDEILEAMNYGSRRKLKEPK-KRGRRKGS 159
              +E+    LA             ED   A    I EA N+  ++K K  K K   R+  
Sbjct: 67   --SERDSSLLAEFSDYGEISEDDEEDFMNA----IREASNFKVKKKKKNDKGKSYGRQRK 126

Query: 160  KKKLNRDVTKLLGDATLCYAQGQHEKAISILRQVVLQAPDLPDSYHTLGLVYNAIGDDVK 219
            ++ L+ +V +LL  A   + +   + A  +  +V+ +      +Y TLG +Y   G    
Sbjct: 127  ERVLDPEVAQLLSQANEAFVRNDLQVAERLFNEVIKKDARNFAAYETLGDIYQLQGRLND 186

Query: 220  AMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLY 279
                + LAAHL   D   WK++   S D   + QA YC S+ I   P +   ++ R+ LY
Sbjct: 187  CCNSWFLAAHLNASDWEFWKIVAILSADLDHVRQAIYCFSRVISLNPMEWESIYRRSMLY 246

Query: 280  LERGDCEKAAETYDQIHQQ-------------CLGNVEALMTGAKLYQKC--GHLERAIC 339
             + G   +A + + +++                  + + +    +LY K    ++ER   
Sbjct: 247  KKTGQLARALDGFQRLYMYNPYDANILRELAILYVDYDRIEDSIELYMKVFNANVERREA 306

Query: 340  IL------------------EDYIKEHPTEADLD------------------------VV 399
            IL                  ED  ++ P E D D                         +
Sbjct: 307  ILAALENALDSSDEESAAEGEDADEKEPLEQDEDRQMFPDINWKKIDAKYKCIPFDWSSL 366

Query: 400  DLLASLYM--------GSKEFSKALERIEHA-------------------------DEVY 459
            ++LA L++        G K   K    I+                           D + 
Sbjct: 367  NILAELFLKLAVSEVDGIKTIKKCARWIQRRESQTFWDHVPDDSEFDNRRFKNSTFDSLL 426

Query: 460  CAGNE----LPLNLTTKAGICHVHLGNMEKAECLFANLGRETANDHSNLMIEAADSLLSL 519
             A  E    +P+++  + G+  ++  N+ +A   F  L  ET +D ++L  EAA +L   
Sbjct: 427  AAEKEKSYNIPIDIRVRLGLLRLNTDNLVEALNHFQCLYDETFSDVADLYFEAATALTRA 486

Query: 520  KHYNLALKYY---LMSEEVNAGGNMGILYLKIAQCYLSTDERTQAIVFFYKVLQHLEDNI 579
            + Y  A+ ++   L  EE         ++  +A+CY   +    A  F+   ++   D++
Sbjct: 487  EKYKEAIDFFTPLLSLEEWRTTD----VFKPLARCYKEIESYETAKEFYELAIKSEPDDL 546

Query: 580  NARLTLASLL------------------LEEARDEEAISLLSPPKDSNPTSSSSSK--LK 639
            + R++LA +                   + + + +E +  +S  K SN TS  SSK  L+
Sbjct: 547  DIRVSLAEVYYRLNDPETFKHMLVDVVEMRKHQVDETLHRISNEKSSNDTSDISSKPLLE 606

Query: 640  PWWLNEKVKLKLCQIYKTRGMLENFVEVIFPLVRESLYIETLQEKIKVNKKKLPR---RV 699
                    K K       R  +E    +   +V +   ++  +    +N+ K        
Sbjct: 607  DSKFRTFRKKKRTPYDAERERIERERRITAKVVDKYEKMKKFELNSGLNEAKQASIWINT 666

Query: 700  LLERVKVLD----------GRETGNLFRGFKPVAPKSD-----LSKASRAKR-----LLQ 759
            + E V +             R+   + R  K    + D     LSK +         L++
Sbjct: 667  VSELVDIFSSVKNFFMKSRSRKFVGILRRTKKFNTELDFQIERLSKLAEGDSVFEGPLME 726

Query: 760  KRERIKEEKKARALA-----------AGVDVNYDDLDDEPALCKALASLGRCSEALEIIS 819
            +R  +    + R L+           + V   Y  ++D  ++ +    +    +  E + 
Sbjct: 727  ERVTLTSATELRGLSYEQWFELFMELSLVIAKYQSVEDGLSVVETAQEVNVFFQDPERVK 786

Query: 820  LTLKLAFNSLSIERKEELQLLGAQLAFSSTDTMHGFNFAKHVVKQYPYSISAWNCYYKVA 879
            + +K    ++ ++  +E      +LA +    ++ F F + V++ + YS+        + 
Sbjct: 787  M-MKFVKLAIVLQMDDE-----EELAENLRGLLNQFQFNRKVLQVFMYSLCRGPSSLNIL 846

Query: 880  SSLTNRD------------------SRHCKLLNSMQAKYKDCAPPYIIAGHQFTTISHHQ 915
            SS   +                   +    + N         + PY+   +     S   
Sbjct: 847  SSTIQQKFFLRQLKAFDSCRYNTEVNGQASITNKEVYNPNKKSSPYLYYIYAVLLYS--- 906

BLAST of HG10002345 vs. ExPASy Swiss-Prot
Match: Q61371 (Intraflagellar transport protein 88 homolog OS=Mus musculus OX=10090 GN=Ift88 PE=1 SV=2)

HSP 1 Score: 48.5 bits (114), Expect = 4.6e-04
Identity = 51/214 (23.83%), Postives = 88/214 (41.12%), Query Frame = 0

Query: 94  ERLEYEALAEKKRKALANGQSERAAKRGRVEDIPGASFDEILE--AMNYGSRRKLKEPKK 153
           +R    AL  K     ANG  E+AA+  +      +S  E L    + Y    +L E   
Sbjct: 480 DRYNPSALTNKGNTVFANGDYEKAAEFYKEALRNDSSCTEALYNIGLTYKKLNRLDEALD 539

Query: 154 RGRRKGSKKKLNRDVTKLLGDATLCYAQGQHEKAISILRQVVLQAPDLPDSYHTLGLVYN 213
              +  +   L      L   A +        +AI  L Q++   P    +   LG +Y+
Sbjct: 540 SFLKLHA--ILRNSAQVLCQIANIYELMEDPNQAIEWLMQLISVVPTDSQALSKLGELYD 599

Query: 214 AIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLL 273
           + GD  +A  +Y  +    P +  + + L ++ ID    ++A     +A   +P  +   
Sbjct: 600 SEGDKSQAFQYYYESYRYFPSNIEVIEWLGAYYIDTQFCEKAIQYFERASLIQPTQVKWQ 659

Query: 274 FHRASLYLERGDCEKAAETYDQIHQQCLGNVEAL 306
              AS +   G+ +KA +TY +IH++   NVE L
Sbjct: 660 LMVASCFRRSGNYQKALDTYKEIHRKFPENVECL 691

BLAST of HG10002345 vs. ExPASy TrEMBL
Match: A0A1S3BHB9 (general transcription factor 3C polypeptide 3 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103490044 PE=4 SV=1)

HSP 1 Score: 1670.6 bits (4325), Expect = 0.0e+00
Identity = 861/941 (91.50%), Postives = 887/941 (94.26%), Query Frame = 0

Query: 1   MEKEGSKISDNEEVPGGVMRVLGAEKEVVETGVEAREEEEEEEEEEEGEEEVEDEGEDDI 60
           MEKEG++ISD+EEVPG VM VLG EKE VETGV  R   EEEEEEEEGEEEVEDEGEDDI
Sbjct: 1   MEKEGNRISDSEEVPGDVMHVLGVEKE-VETGVVDR---EEEEEEEEGEEEVEDEGEDDI 60

Query: 61  EEEDGYIFKFKAGENPFDFVEGTDFSIQPYKKFERLEYEALAEKKRKALANGQSERAAKR 120
           EEEDGY FKFKAGENPFDFVEGTDFS+QPYKKFERLEYEALAEKKRKALANGQSERAAKR
Sbjct: 61  EEEDGYTFKFKAGENPFDFVEGTDFSVQPYKKFERLEYEALAEKKRKALANGQSERAAKR 120

Query: 121 GRVEDIPGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQG 180
           GRVED+ GASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQG
Sbjct: 121 GRVEDVAGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQG 180

Query: 181 QHEKAISILRQVVLQAPDLPDSYHTLGLVYNAIGDDVKAMGFYMLAAHLMPKDSSLWKLL 240
           QHEKAIS+LRQVVLQAPDLPDSYHTLGLVYNAIGDDVKAMGFYMLAAHLMPKDSSLWKLL
Sbjct: 181 QHEKAISLLRQVVLQAPDLPDSYHTLGLVYNAIGDDVKAMGFYMLAAHLMPKDSSLWKLL 240

Query: 241 FSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQIHQQCLG 300
           FSWSI+RGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQIHQQCLG
Sbjct: 241 FSWSIERGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQIHQQCLG 300

Query: 301 NVEALMTGAKLYQKCGHLERAICILEDYIKEHPTEADLDVVDLLASLYMGSKEFSKALER 360
           NVEALMTGAKLYQKCGHLERAICILEDYIKEHP+EADLDVVDLLASLYMGSKEFSKALE 
Sbjct: 301 NVEALMTGAKLYQKCGHLERAICILEDYIKEHPSEADLDVVDLLASLYMGSKEFSKALEH 360

Query: 361 IEHADEVYCAGNELPLNLTTKAGICHVHLGNMEKAECLFANLGRETANDHSNLMIEAADS 420
           IEHAD VYCAGNELPLNLT KAGICH HLGN+EKAECLFANL RET  DHSNLMIE ADS
Sbjct: 361 IEHADRVYCAGNELPLNLTAKAGICHAHLGNLEKAECLFANLRRETTYDHSNLMIEVADS 420

Query: 421 LLSLKHYNLALKYYLMSEEVNAGGNMGILYLKIAQCYLSTDERTQAIVFFYKVLQHLEDN 480
           LLSLKHY+ ALKYYLMSEEVNAG NMGILY K+A+CYLST+E+ QAIVFFYKVLQH+EDN
Sbjct: 421 LLSLKHYSWALKYYLMSEEVNAGENMGILYQKVAECYLSTNEKEQAIVFFYKVLQHVEDN 480

Query: 481 INARLTLASLLLEEARDEEAISLLSPPKDSNPTSSSSSKLKPWWLNEKVKLKLCQIYKTR 540
           INARLTLASLLLEEARDEEAISLLSPPKDSNPTSSSS KLKPWWLNEKVKLKLC IY+TR
Sbjct: 481 INARLTLASLLLEEARDEEAISLLSPPKDSNPTSSSSGKLKPWWLNEKVKLKLCHIYRTR 540

Query: 541 GMLENFVEVIFPLVRESLYIETLQEKIKVNKKKLPRRVLLERVKVLDGRETGNLFRGFKP 600
           G+LENFVEVIFPLVRESLYIETLQEKIKVNKKKLPRRVLLERVKVLDGRETGNLFRGF+P
Sbjct: 541 GLLENFVEVIFPLVRESLYIETLQEKIKVNKKKLPRRVLLERVKVLDGRETGNLFRGFRP 600

Query: 601 VAPKSDLSKASRAKRLLQKRERIKEEKKARALAAGVDVNYDDLDDEPA------------ 660
           VAPKSDL+KASRAKRLLQKR+RIKEEKKA+ LAAGV+V+YDDLDDEPA            
Sbjct: 601 VAPKSDLTKASRAKRLLQKRDRIKEEKKAKLLAAGVNVSYDDLDDEPALRMHRESPLPNL 660

Query: 661 ------------LCKALASLGRCSEALEIISLTLKLAFNSLSIERKEELQLLGAQLAFSS 720
                       LCKALASLGRCSEALEIISLTLKLAFNSLS ERKEELQLLGAQLAFSS
Sbjct: 661 LKEEEHHILIVDLCKALASLGRCSEALEIISLTLKLAFNSLSTERKEELQLLGAQLAFSS 720

Query: 721 TDTMHGFNFAKHVVKQYPYSISAWNCYYKVASSLTNRDSRHCKLLNSMQAKYKDCAPPYI 780
           T TMHGFNFAKHVVKQYPYSISAWNCYYKVAS LTNRDSRHCKLLNSMQAKYKDCAPPYI
Sbjct: 721 TGTMHGFNFAKHVVKQYPYSISAWNCYYKVASCLTNRDSRHCKLLNSMQAKYKDCAPPYI 780

Query: 781 IAGHQFTTISHHQDAARKYLEAYKIMPDSPLINLCVGASLINLALGFRLQNKHQCVAQGL 840
           IAGHQFTTISHHQDAARKYLEAYKIMPDSPLINLCVG+SLINLALGFRLQNKHQCVAQGL
Sbjct: 781 IAGHQFTTISHHQDAARKYLEAYKIMPDSPLINLCVGSSLINLALGFRLQNKHQCVAQGL 840

Query: 841 AFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENQNI 900
           AFLYKNLKLCDN+QEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGEN+NI
Sbjct: 841 AFLYKNLKLCDNNQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENRNI 900

Query: 901 KHQKSVYCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF 918
           KHQ SVYCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF
Sbjct: 901 KHQNSVYCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF 937

BLAST of HG10002345 vs. ExPASy TrEMBL
Match: A0A0A0LGB1 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G848220 PE=4 SV=1)

HSP 1 Score: 1630.9 bits (4222), Expect = 0.0e+00
Identity = 846/942 (89.81%), Postives = 872/942 (92.57%), Query Frame = 0

Query: 1   MEKEGSKISDNEEVPGGVMRVLGAEKEVVETGVEARE-EEEEEEEEEEGEEEVEDEGEDD 60
           MEKEGS+ISD EEVPG VM VLG EKEVVETGVE RE EEEEEEEEEEGEEEVEDEGEDD
Sbjct: 1   MEKEGSRISDCEEVPGEVMHVLGTEKEVVETGVEDREGEEEEEEEEEEGEEEVEDEGEDD 60

Query: 61  IEEEDGYIFKFKAGENPFDFVEGTDFSIQPYKKFERLEYEALAEKKRKALANGQSERAAK 120
           IEEEDGY FKFKAGENPFDFVEGTDFS+QPYKKFERLEYEALAEKKRKALANGQSERAAK
Sbjct: 61  IEEEDGYTFKFKAGENPFDFVEGTDFSVQPYKKFERLEYEALAEKKRKALANGQSERAAK 120

Query: 121 RGRVEDIPGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQ 180
           RGRVEDI GASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQ
Sbjct: 121 RGRVEDISGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQ 180

Query: 181 GQHEKAISILRQVVLQAPDLPDSYHTLGLVYNAIGDDVKAMGFYMLAAHLMPKDSSLWKL 240
           G+HEKAIS+LRQVVL+APDLPDSYHTLGLVYNAIGDDVKAMGFYMLAAHLMPKDSSLWKL
Sbjct: 181 GEHEKAISLLRQVVLRAPDLPDSYHTLGLVYNAIGDDVKAMGFYMLAAHLMPKDSSLWKL 240

Query: 241 LFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQIHQQCL 300
           LFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQIHQQCL
Sbjct: 241 LFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQIHQQCL 300

Query: 301 GNVEALMTGAKLYQKCGHLERAICILEDYIKEHPTEADLDVVDLLASLYMGSKEFSKALE 360
           GNVEALMTGAKLYQKCGHLERAICILEDYIK HP+EADLDVVDLLASLYMGSKEFSKALE
Sbjct: 301 GNVEALMTGAKLYQKCGHLERAICILEDYIKGHPSEADLDVVDLLASLYMGSKEFSKALE 360

Query: 361 RIEHADEVYCAGNELPLNLTTKAGICHVHLGNMEKAECLFANLGRETANDHSNLMIEAAD 420
           RIEHAD VYCAGNELPLNLTTKAGICH HLG++EKAECLFANL RET  DHSNLMIE AD
Sbjct: 361 RIEHADRVYCAGNELPLNLTTKAGICHAHLGDLEKAECLFANLRRETTYDHSNLMIEVAD 420

Query: 421 SLLSLKHYNLALKYYLMSEEVNAGGNMGILYLKIAQCYLSTDERTQAIVFFYKVLQHLED 480
           SL+SLKHY+ ALKYYLMSEEVNAG NMGILYLKIA+CYLST+ER QAIVFFYKVLQH+ED
Sbjct: 421 SLMSLKHYSWALKYYLMSEEVNAGENMGILYLKIAECYLSTNEREQAIVFFYKVLQHVED 480

Query: 481 NINARLTLASLLLEEARDEEAISLLSPPKDSNPTSSSSSKLKPWWLNEKVKLKLCQIYKT 540
           NINARLTLASLLLEEARD+EAISLLSPPKDSNPTSSSSSKLKPWWLNEKVKLKLC IY+T
Sbjct: 481 NINARLTLASLLLEEARDKEAISLLSPPKDSNPTSSSSSKLKPWWLNEKVKLKLCHIYRT 540

Query: 541 RGMLENFVEVIFPLVRESLYIETLQEKIKVNKKKLPRRVLLERVKVLDGRETGNLFRGFK 600
           RG+LENFVEVIFPLVRESLYIETLQEKIKVNKKKLPRRVLLERVKVLDGRETGNLFRGFK
Sbjct: 541 RGLLENFVEVIFPLVRESLYIETLQEKIKVNKKKLPRRVLLERVKVLDGRETGNLFRGFK 600

Query: 601 PVAPKSDLSKASRAKRLLQKRERIKEEKKARALAAGVDVNYDDLDDEPA----------- 660
           PVAPKSDL+KASRAKRLLQKRERIKEEKKA+ALAAGV+++YDDLDDEPA           
Sbjct: 601 PVAPKSDLTKASRAKRLLQKRERIKEEKKAKALAAGVNLSYDDLDDEPALRMHRESPLPN 660

Query: 661 -------------LCKALASLGRCSEALEIISLTLKLAFNSLSIERKEELQLLGAQLAFS 720
                        LCKALASLGRCSEALEIISLTL                        +
Sbjct: 661 LLKEEEYHILIVDLCKALASLGRCSEALEIISLTL------------------------N 720

Query: 721 STDTMHGFNFAKHVVKQYPYSISAWNCYYKVASSLTNRDSRHCKLLNSMQAKYKDCAPPY 780
           ST TMHGFNFAKHVVKQYPYSISAWNCYYKVAS LTNRDSRHCKLLNSMQ+KYKDCAPPY
Sbjct: 721 STGTMHGFNFAKHVVKQYPYSISAWNCYYKVASCLTNRDSRHCKLLNSMQSKYKDCAPPY 780

Query: 781 IIAGHQFTTISHHQDAARKYLEAYKIMPDSPLINLCVGASLINLALGFRLQNKHQCVAQG 840
           IIAGHQFTTISHHQDAARKYLEAYKIMPDSPLINLCVG+SLINLALGFRLQNKHQCVAQG
Sbjct: 781 IIAGHQFTTISHHQDAARKYLEAYKIMPDSPLINLCVGSSLINLALGFRLQNKHQCVAQG 840

Query: 841 LAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENQN 900
           LAFLYKNLKLCDN+QEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGEN+N
Sbjct: 841 LAFLYKNLKLCDNNQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENRN 900

Query: 901 IKHQKSVYCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF 918
           IKHQ SVYCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF
Sbjct: 901 IKHQNSVYCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF 918

BLAST of HG10002345 vs. ExPASy TrEMBL
Match: A0A6J1DIA5 (general transcription factor 3C polypeptide 3 isoform X3 OS=Momordica charantia OX=3673 GN=LOC111020298 PE=4 SV=1)

HSP 1 Score: 1617.8 bits (4188), Expect = 0.0e+00
Identity = 827/927 (89.21%), Postives = 882/927 (95.15%), Query Frame = 0

Query: 1   MEKEGSKISDNEEVPGGVMRVLGAEKEVVETGVEARE---------EEEEEEEEEEGEEE 60
           MEKEG++ISDN+EVPG  + V G  K + ET VE RE         EEEEEEEEEE EEE
Sbjct: 1   MEKEGNEISDNKEVPGCAVGVEGEVKGLKETEVENREEEEEEEEEDEEEEEEEEEEEEEE 60

Query: 61  VEDEGEDDIEEEDGYIFKFKAGENPFDFVEGTDFSIQPYKKFERLEYEALAEKKRKALAN 120
           VEDEGEDDIEEEDGYIFKFKAGENPFDFVEGTDFSIQPYKKFERLEYEALAEKKRKALAN
Sbjct: 61  VEDEGEDDIEEEDGYIFKFKAGENPFDFVEGTDFSIQPYKKFERLEYEALAEKKRKALAN 120

Query: 121 GQSERAAKRGRVEDIPGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLG 180
            QSER  KRGR+EDIPGASF+EI+EAMNYGSRRKLKEPK+RGRRKGSKKK+NR++TKLLG
Sbjct: 121 SQSERPVKRGRLEDIPGASFNEIMEAMNYGSRRKLKEPKRRGRRKGSKKKINREITKLLG 180

Query: 181 DATLCYAQGQHEKAISILRQVVLQAPDLPDSYHTLGLVYNAIGDDVKAMGFYMLAAHLMP 240
           DATLCYAQGQ+EKAIS+LRQVVLQAPDLPDSYHTLGLVYNAIGDDVKAMGFYMLAAHLMP
Sbjct: 181 DATLCYAQGQYEKAISVLRQVVLQAPDLPDSYHTLGLVYNAIGDDVKAMGFYMLAAHLMP 240

Query: 241 KDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETY 300
           +DSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDC+KAAETY
Sbjct: 241 RDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCQKAAETY 300

Query: 301 DQIHQQCLGNVEALMTGAKLYQKCGHLERAICILEDYIKEHPTEADLDVVDLLASLYMGS 360
           DQIHQ+C+GNVEALMTGAKLYQKCGH ERAICILEDYIK HPTEADLDVVDLLASLYMGS
Sbjct: 301 DQIHQKCIGNVEALMTGAKLYQKCGHHERAICILEDYIKGHPTEADLDVVDLLASLYMGS 360

Query: 361 KEFSKALERIEHADEVYCAGNELPLNLTTKAGICHVHLGNMEKAECLFANLGRETANDHS 420
           KEFSKALE IEHAD+VYCAGNE+PLNL TKAGICHVHLGN+EKAE LFANLGR+TA+DHS
Sbjct: 361 KEFSKALEHIEHADKVYCAGNEMPLNLATKAGICHVHLGNIEKAESLFANLGRKTADDHS 420

Query: 421 NLMIEAADSLLSLKHYNLALKYYLMSEEVNAGGNM-GILYLKIAQCYLSTDERTQAIVFF 480
           + +IEAADSLLSLKH+NLALKYYLMSEEVNAGG M GI+YLKIAQCYLST+ER +AIVFF
Sbjct: 421 DFIIEAADSLLSLKHHNLALKYYLMSEEVNAGGKMQGIVYLKIAQCYLSTNERAEAIVFF 480

Query: 481 YKVLQHLEDNINARLTLASLLLEEARDEEAISLLSPPKDSNPTSSSSSKLKPWWLNEKVK 540
           YKVLQ LEDNINARLTLASLLLEEAR+EEAISLLSPPKDSN +SSSSSK KPWWLNEKVK
Sbjct: 481 YKVLQTLEDNINARLTLASLLLEEAREEEAISLLSPPKDSNSSSSSSSKFKPWWLNEKVK 540

Query: 541 LKLCQIYKTRGMLENFVEVIFPLVRESLYIETLQEKIKVNKKKLPRRVLLERVKVLDGRE 600
           LKLC IY+T+GMLENFVE IF LVRESLYIETL+EKIKVNKKKLPRRVLLERVKVLDGRE
Sbjct: 541 LKLCNIYRTKGMLENFVETIFSLVRESLYIETLKEKIKVNKKKLPRRVLLERVKVLDGRE 600

Query: 601 TGNLFRGFKPVAPKSDLSKASRAKRLLQKRERIKEEKKARALAAGVDVNYDDLDDEPALC 660
           TG+LFRGF+PVAPKSDLSKASRAKRLLQKRERIKEEKKARALAAGV+V+YDD+DDEPALC
Sbjct: 601 TGSLFRGFRPVAPKSDLSKASRAKRLLQKRERIKEEKKARALAAGVNVSYDDVDDEPALC 660

Query: 661 KALASLGRCSEALEIISLTLKLAFNSLSIERKEELQLLGAQLAFSSTDTMHGFNFAKHVV 720
           KALASLGRCSEALEIISLTLKLAFNSLS+ERKEELQLLGAQLAFSSTDT HGFNFAKHVV
Sbjct: 661 KALASLGRCSEALEIISLTLKLAFNSLSMERKEELQLLGAQLAFSSTDTKHGFNFAKHVV 720

Query: 721 KQYPYSISAWNCYYKVASSLTNRDSRHCKLLNSMQAKYKDCAPPYIIAGHQFTTISHHQD 780
           KQYPYS SAWNCYYKV+S +T+RDSRHCKLLNS+QAKYKDCAPP+IIAGHQF  ISHHQ+
Sbjct: 721 KQYPYSNSAWNCYYKVSSRMTHRDSRHCKLLNSIQAKYKDCAPPFIIAGHQFNAISHHQE 780

Query: 781 AARKYLEAYKIMPDSPLINLCVGASLINLALGFRLQNKHQCVAQGLAFLYKNLKLCDNSQ 840
           AA+KYLEAYK++PDSPLINLCVGA+LINLALG RLQNKHQCVAQGLAFLY NLKLCDNSQ
Sbjct: 781 AAKKYLEAYKLLPDSPLINLCVGAALINLALGLRLQNKHQCVAQGLAFLYNNLKLCDNSQ 840

Query: 841 EALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENQNIKHQKSVYCDLRREA 900
           EALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIP++FGEN+NIKH+KSVYCDLRREA
Sbjct: 841 EALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPDIFGENRNIKHEKSVYCDLRREA 900

Query: 901 AYNLHLIYKESGALDLARQVLKDHCTF 918
           AYNLHLIYK+SGALDLARQVLKDHCTF
Sbjct: 901 AYNLHLIYKKSGALDLARQVLKDHCTF 927

BLAST of HG10002345 vs. ExPASy TrEMBL
Match: A0A6J1DEF9 (general transcription factor 3C polypeptide 3 isoform X2 OS=Momordica charantia OX=3673 GN=LOC111020298 PE=4 SV=1)

HSP 1 Score: 1609.0 bits (4165), Expect = 0.0e+00
Identity = 827/950 (87.05%), Postives = 882/950 (92.84%), Query Frame = 0

Query: 1   MEKEGSKISDNEEVPGGVMRVLGAEKEVVETGVEARE---------EEEEEEEEEEGEEE 60
           MEKEG++ISDN+EVPG  + V G  K + ET VE RE         EEEEEEEEEE EEE
Sbjct: 1   MEKEGNEISDNKEVPGCAVGVEGEVKGLKETEVENREEEEEEEEEDEEEEEEEEEEEEEE 60

Query: 61  VEDEGEDDIEEEDGYIFKFKAGENPFDFVEGTDFSIQPYKKFERLEYEALAEKKRKALAN 120
           VEDEGEDDIEEEDGYIFKFKAGENPFDFVEGTDFSIQPYKKFERLEYEALAEKKRKALAN
Sbjct: 61  VEDEGEDDIEEEDGYIFKFKAGENPFDFVEGTDFSIQPYKKFERLEYEALAEKKRKALAN 120

Query: 121 GQSERAAKRGRVEDIPGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLG 180
            QSER  KRGR+EDIPGASF+EI+EAMNYGSRRKLKEPK+RGRRKGSKKK+NR++TKLLG
Sbjct: 121 SQSERPVKRGRLEDIPGASFNEIMEAMNYGSRRKLKEPKRRGRRKGSKKKINREITKLLG 180

Query: 181 DATLCYAQGQHEKAISILRQVVLQAPDLPDSYHTLGLVYNAIGDDVKAMGFYMLAAHLMP 240
           DATLCYAQGQ+EKAIS+LRQVVLQAPDLPDSYHTLGLVYNAIGDDVKAMGFYMLAAHLMP
Sbjct: 181 DATLCYAQGQYEKAISVLRQVVLQAPDLPDSYHTLGLVYNAIGDDVKAMGFYMLAAHLMP 240

Query: 241 KDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETY 300
           +DSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDC+KAAETY
Sbjct: 241 RDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCQKAAETY 300

Query: 301 DQIHQQCLGNVEALMTGAKLYQKCGHLERAICILEDYIKEHPTEADLDVVDLLASLYMGS 360
           DQIHQ+C+GNVEALMTGAKLYQKCGH ERAICILEDYIK HPTEADLDVVDLLASLYMGS
Sbjct: 301 DQIHQKCIGNVEALMTGAKLYQKCGHHERAICILEDYIKGHPTEADLDVVDLLASLYMGS 360

Query: 361 KEFSKALERIEHADEVYCAGNELPLNLTTKAGICHVHLGNMEKAECLFANLGRETANDHS 420
           KEFSKALE IEHAD+VYCAGNE+PLNL TKAGICHVHLGN+EKAE LFANLGR+TA+DHS
Sbjct: 361 KEFSKALEHIEHADKVYCAGNEMPLNLATKAGICHVHLGNIEKAESLFANLGRKTADDHS 420

Query: 421 NLMIEAADSLLSLKHYNLALKYYLMSEEVNAGGNMGILYLKIAQCYLSTDERTQAIVFFY 480
           + +IEAADSLLSLKH+NLALKYYLMSEEVNAGG MGI+YLKIAQCYLST+ER +AIVFFY
Sbjct: 421 DFIIEAADSLLSLKHHNLALKYYLMSEEVNAGGKMGIVYLKIAQCYLSTNERAEAIVFFY 480

Query: 481 KVLQHLEDNINARLTLASLLLEEARDEEAISLLSPPKDSNPTSSSSSKLKPWWLNEKVKL 540
           KVLQ LEDNINARLTLASLLLEEAR+EEAISLLSPPKDSN +SSSSSK KPWWLNEKVKL
Sbjct: 481 KVLQTLEDNINARLTLASLLLEEAREEEAISLLSPPKDSNSSSSSSSKFKPWWLNEKVKL 540

Query: 541 KLCQIYKTRGMLENFVEVIFPLVRESLYIETLQEKIKVNKKKLPRRVLLERVKVLDGRET 600
           KLC IY+T+GMLENFVE IF LVRESLYIETL+EKIKVNKKKLPRRVLLERVKVLDGRET
Sbjct: 541 KLCNIYRTKGMLENFVETIFSLVRESLYIETLKEKIKVNKKKLPRRVLLERVKVLDGRET 600

Query: 601 GNLFRGFKPVAPKSDLSKASRAKRLLQKRERIKEEKKARALAAGVDVNYDDLDDEPA--- 660
           G+LFRGF+PVAPKSDLSKASRAKRLLQKRERIKEEKKARALAAGV+V+YDD+DDEPA   
Sbjct: 601 GSLFRGFRPVAPKSDLSKASRAKRLLQKRERIKEEKKARALAAGVNVSYDDVDDEPALRV 660

Query: 661 ---------------------LCKALASLGRCSEALEIISLTLKLAFNSLSIERKEELQL 720
                                LCKALASLGRCSEALEIISLTLKLAFNSLS+ERKEELQL
Sbjct: 661 HRESPLPNLLKDEEYHNLIVDLCKALASLGRCSEALEIISLTLKLAFNSLSMERKEELQL 720

Query: 721 LGAQLAFSSTDTMHGFNFAKHVVKQYPYSISAWNCYYKVASSLTNRDSRHCKLLNSMQAK 780
           LGAQLAFSSTDT HGFNFAKHVVKQYPYS SAWNCYYKV+S +T+RDSRHCKLLNS+QAK
Sbjct: 721 LGAQLAFSSTDTKHGFNFAKHVVKQYPYSNSAWNCYYKVSSRMTHRDSRHCKLLNSIQAK 780

Query: 781 YKDCAPPYIIAGHQFTTISHHQDAARKYLEAYKIMPDSPLINLCVGASLINLALGFRLQN 840
           YKDCAPP+IIAGHQF  ISHHQ+AA+KYLEAYK++PDSPLINLCVGA+LINLALG RLQN
Sbjct: 781 YKDCAPPFIIAGHQFNAISHHQEAAKKYLEAYKLLPDSPLINLCVGAALINLALGLRLQN 840

Query: 841 KHQCVAQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIP 900
           KHQCVAQGLAFLY NLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIP
Sbjct: 841 KHQCVAQGLAFLYNNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIP 900

Query: 901 ELFGENQNIKHQKSVYCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF 918
           ++FGEN+NIKH+KSVYCDLRREAAYNLHLIYK+SGALDLARQVLKDHCTF
Sbjct: 901 DIFGENRNIKHEKSVYCDLRREAAYNLHLIYKKSGALDLARQVLKDHCTF 950

BLAST of HG10002345 vs. ExPASy TrEMBL
Match: A0A6J1DGK2 (general transcription factor 3C polypeptide 3 isoform X1 OS=Momordica charantia OX=3673 GN=LOC111020298 PE=4 SV=1)

HSP 1 Score: 1604.3 bits (4153), Expect = 0.0e+00
Identity = 827/951 (86.96%), Postives = 882/951 (92.74%), Query Frame = 0

Query: 1   MEKEGSKISDNEEVPGGVMRVLGAEKEVVETGVEARE---------EEEEEEEEEEGEEE 60
           MEKEG++ISDN+EVPG  + V G  K + ET VE RE         EEEEEEEEEE EEE
Sbjct: 1   MEKEGNEISDNKEVPGCAVGVEGEVKGLKETEVENREEEEEEEEEDEEEEEEEEEEEEEE 60

Query: 61  VEDEGEDDIEEEDGYIFKFKAGENPFDFVEGTDFSIQPYKKFERLEYEALAEKKRKALAN 120
           VEDEGEDDIEEEDGYIFKFKAGENPFDFVEGTDFSIQPYKKFERLEYEALAEKKRKALAN
Sbjct: 61  VEDEGEDDIEEEDGYIFKFKAGENPFDFVEGTDFSIQPYKKFERLEYEALAEKKRKALAN 120

Query: 121 GQSERAAKRGRVEDIPGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLG 180
            QSER  KRGR+EDIPGASF+EI+EAMNYGSRRKLKEPK+RGRRKGSKKK+NR++TKLLG
Sbjct: 121 SQSERPVKRGRLEDIPGASFNEIMEAMNYGSRRKLKEPKRRGRRKGSKKKINREITKLLG 180

Query: 181 DATLCYAQGQHEKAISILRQVVLQAPDLPDSYHTLGLVYNAIGDDVKAMGFYMLAAHLMP 240
           DATLCYAQGQ+EKAIS+LRQVVLQAPDLPDSYHTLGLVYNAIGDDVKAMGFYMLAAHLMP
Sbjct: 181 DATLCYAQGQYEKAISVLRQVVLQAPDLPDSYHTLGLVYNAIGDDVKAMGFYMLAAHLMP 240

Query: 241 KDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETY 300
           +DSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDC+KAAETY
Sbjct: 241 RDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCQKAAETY 300

Query: 301 DQIHQQCLGNVEALMTGAKLYQKCGHLERAICILEDYIKEHPTEADLDVVDLLASLYMGS 360
           DQIHQ+C+GNVEALMTGAKLYQKCGH ERAICILEDYIK HPTEADLDVVDLLASLYMGS
Sbjct: 301 DQIHQKCIGNVEALMTGAKLYQKCGHHERAICILEDYIKGHPTEADLDVVDLLASLYMGS 360

Query: 361 KEFSKALERIEHADEVYCAGNELPLNLTTKAGICHVHLGNMEKAECLFANLGRETANDHS 420
           KEFSKALE IEHAD+VYCAGNE+PLNL TKAGICHVHLGN+EKAE LFANLGR+TA+DHS
Sbjct: 361 KEFSKALEHIEHADKVYCAGNEMPLNLATKAGICHVHLGNIEKAESLFANLGRKTADDHS 420

Query: 421 NLMIEAADSLLSLKHYNLALKYYLMSEEVNAGGNM-GILYLKIAQCYLSTDERTQAIVFF 480
           + +IEAADSLLSLKH+NLALKYYLMSEEVNAGG M GI+YLKIAQCYLST+ER +AIVFF
Sbjct: 421 DFIIEAADSLLSLKHHNLALKYYLMSEEVNAGGKMQGIVYLKIAQCYLSTNERAEAIVFF 480

Query: 481 YKVLQHLEDNINARLTLASLLLEEARDEEAISLLSPPKDSNPTSSSSSKLKPWWLNEKVK 540
           YKVLQ LEDNINARLTLASLLLEEAR+EEAISLLSPPKDSN +SSSSSK KPWWLNEKVK
Sbjct: 481 YKVLQTLEDNINARLTLASLLLEEAREEEAISLLSPPKDSNSSSSSSSKFKPWWLNEKVK 540

Query: 541 LKLCQIYKTRGMLENFVEVIFPLVRESLYIETLQEKIKVNKKKLPRRVLLERVKVLDGRE 600
           LKLC IY+T+GMLENFVE IF LVRESLYIETL+EKIKVNKKKLPRRVLLERVKVLDGRE
Sbjct: 541 LKLCNIYRTKGMLENFVETIFSLVRESLYIETLKEKIKVNKKKLPRRVLLERVKVLDGRE 600

Query: 601 TGNLFRGFKPVAPKSDLSKASRAKRLLQKRERIKEEKKARALAAGVDVNYDDLDDEPA-- 660
           TG+LFRGF+PVAPKSDLSKASRAKRLLQKRERIKEEKKARALAAGV+V+YDD+DDEPA  
Sbjct: 601 TGSLFRGFRPVAPKSDLSKASRAKRLLQKRERIKEEKKARALAAGVNVSYDDVDDEPALR 660

Query: 661 ----------------------LCKALASLGRCSEALEIISLTLKLAFNSLSIERKEELQ 720
                                 LCKALASLGRCSEALEIISLTLKLAFNSLS+ERKEELQ
Sbjct: 661 VHRESPLPNLLKDEEYHNLIVDLCKALASLGRCSEALEIISLTLKLAFNSLSMERKEELQ 720

Query: 721 LLGAQLAFSSTDTMHGFNFAKHVVKQYPYSISAWNCYYKVASSLTNRDSRHCKLLNSMQA 780
           LLGAQLAFSSTDT HGFNFAKHVVKQYPYS SAWNCYYKV+S +T+RDSRHCKLLNS+QA
Sbjct: 721 LLGAQLAFSSTDTKHGFNFAKHVVKQYPYSNSAWNCYYKVSSRMTHRDSRHCKLLNSIQA 780

Query: 781 KYKDCAPPYIIAGHQFTTISHHQDAARKYLEAYKIMPDSPLINLCVGASLINLALGFRLQ 840
           KYKDCAPP+IIAGHQF  ISHHQ+AA+KYLEAYK++PDSPLINLCVGA+LINLALG RLQ
Sbjct: 781 KYKDCAPPFIIAGHQFNAISHHQEAAKKYLEAYKLLPDSPLINLCVGAALINLALGLRLQ 840

Query: 841 NKHQCVAQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPI 900
           NKHQCVAQGLAFLY NLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPI
Sbjct: 841 NKHQCVAQGLAFLYNNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPI 900

Query: 901 PELFGENQNIKHQKSVYCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF 918
           P++FGEN+NIKH+KSVYCDLRREAAYNLHLIYK+SGALDLARQVLKDHCTF
Sbjct: 901 PDIFGENRNIKHEKSVYCDLRREAAYNLHLIYKKSGALDLARQVLKDHCTF 951

BLAST of HG10002345 vs. TAIR 10
Match: AT1G17680.1 (tetratricopeptide repeat (TPR)-containing protein )

HSP 1 Score: 657.1 bits (1694), Expect = 2.0e-188
Identity = 400/929 (43.06%), Postives = 573/929 (61.68%), Query Frame = 0

Query: 4   EGSKISDNEEVPGGV---MRVLGAEKEVVETGVEAREEEEEEEEEEEGEEEVEDEGEDDI 63
           EG+ IS+ EE P  +    +VLG +    +  + + EE   ++++++ ++  +DEG++  
Sbjct: 13  EGNLISELEEGPSNMECDKQVLGGDTNYDDKDLNSDEEGLVDDDDDDSDD--DDEGDESE 72

Query: 64  EEEDGYIFKFKAGENPFDFVEGTDFSIQPYKKFERLEYEALAEKKRKALANGQSERAAKR 123
           EE+D     F+AG  P                FER EYEALAE+KRKALA+ Q   +   
Sbjct: 73  EEDD-----FEAGSVP--------------NTFERPEYEALAERKRKALADSQRNPSNIS 132

Query: 124 GRVEDIPGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQG 183
                + G      +E M+ G RRK ++ KK+GRR GSKK++  D+ K   +A   +A G
Sbjct: 133 NSTSGVEG-----FMEFMSSGRRRKSRKYKKKGRRPGSKKEVAPDILKRFREALFLHAHG 192

Query: 184 QHEKAISILRQVVLQAPDLPDSYHTLGLVYNAIG-DDVKAMGFYMLAAHLMPKDSSLWKL 243
           +  +A+ IL +V+ QAP    +Y+ L  V   +G  +  +     +AA++    S  WKL
Sbjct: 193 RDIEALPILVEVIKQAPAFDIAYYYLSRVSEQLGKTESSSTEALKIAANIKGSKSPFWKL 252

Query: 244 LFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQIHQQCL 303
           L+    ++ +I  A    SKAI+A+PDDI L +  A + L  G   +AAET++QI ++C 
Sbjct: 253 LYERFKEQENISVARSYASKAIQADPDDIPLKYEYADICLNTGKYREAAETFEQIFRRCP 312

Query: 304 GNVEALMTGAKLYQKCGHLERAICILEDYIKEHPTEADLDVVDLLASLYMGSKEFSKALE 363
             +EAL  G + + K G  ERA  ILED+IK H +E   DV+DLLAS++M      +AL+
Sbjct: 313 ERIEALKWGVQYFLKSGEGERAASILEDHIKSHSSEVGHDVLDLLASVFMKINAHDRALK 372

Query: 364 RIEHADEVYCAGNELPLNLTTKAGICHVHLGNMEKAECLFANLGRETANDHSNLMIEAAD 423
            I    ++Y  G EL  +L  +  ICHVHL  ME+AE + + L +E  ++H  L+   AD
Sbjct: 373 YIHDVRQIYNVGKELSSSLKIRQAICHVHLEEMEQAESVLSILPQEAVSEHPELITNLAD 432

Query: 424 SLLSLKHYNLALKYYL--MSEEVNAGGNMGILYLKIAQCYLSTDERTQAIVFFYKVLQHL 483
            L ++ +++ ALKYY+  +SE VN     G L++KIA+CY+S +ER QAIVF+YK L  L
Sbjct: 433 ELTNIGNFHSALKYYIEAISEPVN-----GNLFVKIARCYMSLEERKQAIVFYYKALNEL 492

Query: 484 EDNINARLTLASLLLEEARDEEAISLLSPPKDSNPTSSSSSKLKPWWLNEKVKLKLCQIY 543
            D ++ R+TLASLLLE+ + +EA+ +LSPP++ +P    ++KLK WW N K+++ LCQIY
Sbjct: 493 SDTVDVRITLASLLLEDGKRDEAVLVLSPPENPDP---DTAKLKAWWKNRKIRMNLCQIY 552

Query: 544 KTRGMLENFVEVIFPLVRESLYIETLQEKIKVNKKKLPRRVLLERVKVLDGRETGNLFRG 603
            + GMLE+F      LV + ++  T++ K K       R VL E  +    R   +    
Sbjct: 553 HSEGMLEDFANTALQLVLKWVWRRTVKGKRK-------RLVLSEHQRNKKRRRPRDAQAS 612

Query: 604 FKPVAPKSDLSKASRAKRLLQKRERIKEEKKARALAAGV--DVNYDDLDDEP------AL 663
                PK    K  + +  L +  RI+E    +A    V  +   + + DE        L
Sbjct: 613 QLRGGPK----KWRKIRATLNETRRIRERAAIKAHNEDVCSESEEEVIKDEEYHRLFVDL 672

Query: 664 CKALASLGRCSEALEIISLTLKLAFNSLSIERKEELQLLGAQLAFSSTDTMHGFNFAKHV 723
           CKALASL R  EALEI++L  +L    L +E K+ELQ LGA+++  + D    F+  + V
Sbjct: 673 CKALASLQRYWEALEIVNLARRLDAKMLPVETKKELQSLGAKISCDTMDPKQWFDCVRSV 732

Query: 724 VKQYPYSISAWNCYYKVASSLTNRDSRHCKLLNSMQAKYKDCAPPYIIAGHQFTTISHHQ 783
           ++Q+PY ++AWNCYY V S L  R S   K ++ +++KY+DC PP +IAGH FT  S HQ
Sbjct: 733 IQQHPYRLNAWNCYYSVISRLGKRASTEAKFMHHLRSKYRDCVPPILIAGHHFTVTSRHQ 792

Query: 784 DAARKYLEAYKIMPDSPLINLCVGASLINLALGFRLQNKHQCVAQGLAFLYKNLKLCDNS 843
           DAAR+YLEAYK+MP+SPLINLCVGA+LINLALGFRL+N+H+C+AQG AFLY NL++C NS
Sbjct: 793 DAAREYLEAYKLMPESPLINLCVGAALINLALGFRLKNRHECLAQGFAFLYNNLRICSNS 852

Query: 844 QEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENQNI-KHQKSVYCDLRR 903
           QEALYN+ARAY H+GLVTLA +YYEKVLA Y+KD  +P+L  E+  + + +K V CDLR+
Sbjct: 853 QEALYNVARAYQHVGLVTLAASYYEKVLAIYEKDYTMPKLPNEDPIVAEERKPVNCDLRK 896

Query: 904 EAAYNLHLIYKESGALDLARQVLKDHCTF 918
           EAA+NLHLIYK SGA DLARQVLKDHCTF
Sbjct: 913 EAAHNLHLIYKHSGAFDLARQVLKDHCTF 896

BLAST of HG10002345 vs. TAIR 10
Match: AT1G17680.2 (tetratricopeptide repeat (TPR)-containing protein )

HSP 1 Score: 657.1 bits (1694), Expect = 2.0e-188
Identity = 400/929 (43.06%), Postives = 573/929 (61.68%), Query Frame = 0

Query: 4   EGSKISDNEEVPGGV---MRVLGAEKEVVETGVEAREEEEEEEEEEEGEEEVEDEGEDDI 63
           EG+ IS+ EE P  +    +VLG +    +  + + EE   ++++++ ++  +DEG++  
Sbjct: 13  EGNLISELEEGPSNMECDKQVLGGDTNYDDKDLNSDEEGLVDDDDDDSDD--DDEGDESE 72

Query: 64  EEEDGYIFKFKAGENPFDFVEGTDFSIQPYKKFERLEYEALAEKKRKALANGQSERAAKR 123
           EE+D     F+AG  P                FER EYEALAE+KRKALA+ Q   +   
Sbjct: 73  EEDD-----FEAGSVP--------------NTFERPEYEALAERKRKALADSQRNPSNIS 132

Query: 124 GRVEDIPGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQG 183
                + G      +E M+ G RRK ++ KK+GRR GSKK++  D+ K   +A   +A G
Sbjct: 133 NSTSGVEG-----FMEFMSSGRRRKSRKYKKKGRRPGSKKEVAPDILKRFREALFLHAHG 192

Query: 184 QHEKAISILRQVVLQAPDLPDSYHTLGLVYNAIG-DDVKAMGFYMLAAHLMPKDSSLWKL 243
           +  +A+ IL +V+ QAP    +Y+ L  V   +G  +  +     +AA++    S  WKL
Sbjct: 193 RDIEALPILVEVIKQAPAFDIAYYYLSRVSEQLGKTESSSTEALKIAANIKGSKSPFWKL 252

Query: 244 LFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQIHQQCL 303
           L+    ++ +I  A    SKAI+A+PDDI L +  A + L  G   +AAET++QI ++C 
Sbjct: 253 LYERFKEQENISVARSYASKAIQADPDDIPLKYEYADICLNTGKYREAAETFEQIFRRCP 312

Query: 304 GNVEALMTGAKLYQKCGHLERAICILEDYIKEHPTEADLDVVDLLASLYMGSKEFSKALE 363
             +EAL  G + + K G  ERA  ILED+IK H +E   DV+DLLAS++M      +AL+
Sbjct: 313 ERIEALKWGVQYFLKSGEGERAASILEDHIKSHSSEVGHDVLDLLASVFMKINAHDRALK 372

Query: 364 RIEHADEVYCAGNELPLNLTTKAGICHVHLGNMEKAECLFANLGRETANDHSNLMIEAAD 423
            I    ++Y  G EL  +L  +  ICHVHL  ME+AE + + L +E  ++H  L+   AD
Sbjct: 373 YIHDVRQIYNVGKELSSSLKIRQAICHVHLEEMEQAESVLSILPQEAVSEHPELITNLAD 432

Query: 424 SLLSLKHYNLALKYYL--MSEEVNAGGNMGILYLKIAQCYLSTDERTQAIVFFYKVLQHL 483
            L ++ +++ ALKYY+  +SE VN     G L++KIA+CY+S +ER QAIVF+YK L  L
Sbjct: 433 ELTNIGNFHSALKYYIEAISEPVN-----GNLFVKIARCYMSLEERKQAIVFYYKALNEL 492

Query: 484 EDNINARLTLASLLLEEARDEEAISLLSPPKDSNPTSSSSSKLKPWWLNEKVKLKLCQIY 543
            D ++ R+TLASLLLE+ + +EA+ +LSPP++ +P    ++KLK WW N K+++ LCQIY
Sbjct: 493 SDTVDVRITLASLLLEDGKRDEAVLVLSPPENPDP---DTAKLKAWWKNRKIRMNLCQIY 552

Query: 544 KTRGMLENFVEVIFPLVRESLYIETLQEKIKVNKKKLPRRVLLERVKVLDGRETGNLFRG 603
            + GMLE+F      LV + ++  T++ K K       R VL E  +    R   +    
Sbjct: 553 HSEGMLEDFANTALQLVLKWVWRRTVKGKRK-------RLVLSEHQRNKKRRRPRDAQAS 612

Query: 604 FKPVAPKSDLSKASRAKRLLQKRERIKEEKKARALAAGV--DVNYDDLDDEP------AL 663
                PK    K  + +  L +  RI+E    +A    V  +   + + DE        L
Sbjct: 613 QLRGGPK----KWRKIRATLNETRRIRERAAIKAHNEDVCSESEEEVIKDEEYHRLFVDL 672

Query: 664 CKALASLGRCSEALEIISLTLKLAFNSLSIERKEELQLLGAQLAFSSTDTMHGFNFAKHV 723
           CKALASL R  EALEI++L  +L    L +E K+ELQ LGA+++  + D    F+  + V
Sbjct: 673 CKALASLQRYWEALEIVNLARRLDAKMLPVETKKELQSLGAKISCDTMDPKQWFDCVRSV 732

Query: 724 VKQYPYSISAWNCYYKVASSLTNRDSRHCKLLNSMQAKYKDCAPPYIIAGHQFTTISHHQ 783
           ++Q+PY ++AWNCYY V S L  R S   K ++ +++KY+DC PP +IAGH FT  S HQ
Sbjct: 733 IQQHPYRLNAWNCYYSVISRLGKRASTEAKFMHHLRSKYRDCVPPILIAGHHFTVTSRHQ 792

Query: 784 DAARKYLEAYKIMPDSPLINLCVGASLINLALGFRLQNKHQCVAQGLAFLYKNLKLCDNS 843
           DAAR+YLEAYK+MP+SPLINLCVGA+LINLALGFRL+N+H+C+AQG AFLY NL++C NS
Sbjct: 793 DAAREYLEAYKLMPESPLINLCVGAALINLALGFRLKNRHECLAQGFAFLYNNLRICSNS 852

Query: 844 QEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENQNI-KHQKSVYCDLRR 903
           QEALYN+ARAY H+GLVTLA +YYEKVLA Y+KD  +P+L  E+  + + +K V CDLR+
Sbjct: 853 QEALYNVARAYQHVGLVTLAASYYEKVLAIYEKDYTMPKLPNEDPIVAEERKPVNCDLRK 896

Query: 904 EAAYNLHLIYKESGALDLARQVLKDHCTF 918
           EAA+NLHLIYK SGA DLARQVLKDHCTF
Sbjct: 913 EAAHNLHLIYKHSGAFDLARQVLKDHCTF 896

BLAST of HG10002345 vs. TAIR 10
Match: AT3G04240.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 50.1 bits (118), Expect = 1.1e-05
Identity = 32/136 (23.53%), Postives = 59/136 (43.38%), Query Frame = 0

Query: 173 ATLCYAQGQHEKAISILRQVVLQAPDLPDSYHTLGLVYNAIGDDVKAMGFYMLAAHLMPK 232
           A L    G   +A+   ++ V   P  PD+Y  LG VY A+G   +A+  Y  A  + P 
Sbjct: 230 AGLFMESGDLNRALQYYKEAVKLKPAFPDAYLNLGNVYKALGRPTEAIMCYQHALQMRPN 289

Query: 233 DSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYD 292
            +  +  + S   ++G +D A     +A+  +P  +    +  +   + G  ++A   Y+
Sbjct: 290 SAMAFGNIASIYYEQGQLDLAIRHYKQALSRDPRFLEAYNNLGNALKDIGRVDEAVRCYN 349

Query: 293 QI------HQQCLGNV 303
           Q       H Q + N+
Sbjct: 350 QCLALQPNHPQAMANL 365

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038879313.10.0e+0093.73general transcription factor 3C polypeptide 3 [Benincasa hispida] >XP_038879314.... [more]
XP_011652346.10.0e+0092.36general transcription factor 3C polypeptide 3 [Cucumis sativus] >KGN59936.2 hypo... [more]
XP_008447634.10.0e+0091.50PREDICTED: general transcription factor 3C polypeptide 3 isoform X1 [Cucumis mel... [more]
XP_022152621.10.0e+0089.21general transcription factor 3C polypeptide 3 isoform X3 [Momordica charantia][more]
XP_022152620.10.0e+0087.05general transcription factor 3C polypeptide 3 isoform X2 [Momordica charantia][more]
Match NameE-valueIdentityDescription
Q9Y5Q91.5e-5824.35General transcription factor 3C polypeptide 3 OS=Homo sapiens OX=9606 GN=GTF3C3 ... [more]
O744583.3e-2622.57Transcription factor tau subunit sfc4 OS=Schizosaccharomyces pombe (strain 972 /... [more]
P333396.9e-1619.75Transcription factor tau 131 kDa subunit OS=Saccharomyces cerevisiae (strain ATC... [more]
Q613714.6e-0423.83Intraflagellar transport protein 88 homolog OS=Mus musculus OX=10090 GN=Ift88 PE... [more]
Match NameE-valueIdentityDescription
A0A1S3BHB90.0e+0091.50general transcription factor 3C polypeptide 3 isoform X1 OS=Cucumis melo OX=3656... [more]
A0A0A0LGB10.0e+0089.81Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G848220 PE=4 SV=1[more]
A0A6J1DIA50.0e+0089.21general transcription factor 3C polypeptide 3 isoform X3 OS=Momordica charantia ... [more]
A0A6J1DEF90.0e+0087.05general transcription factor 3C polypeptide 3 isoform X2 OS=Momordica charantia ... [more]
A0A6J1DGK20.0e+0086.96general transcription factor 3C polypeptide 3 isoform X1 OS=Momordica charantia ... [more]
Match NameE-valueIdentityDescription
AT1G17680.12.0e-18843.06tetratricopeptide repeat (TPR)-containing protein [more]
AT1G17680.22.0e-18843.06tetratricopeptide repeat (TPR)-containing protein [more]
AT3G04240.11.1e-0523.53Tetratricopeptide repeat (TPR)-like superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 33..58
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 34..62
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..62
IPR019734Tetratricopeptide repeatSMARTSM00028tpr_5coord: 166..199
e-value: 28.0
score: 11.0
coord: 268..301
e-value: 27.0
score: 11.1
coord: 200..233
e-value: 36.0
score: 10.1
coord: 447..480
e-value: 0.42
score: 19.7
coord: 830..863
e-value: 2.9
score: 16.9
coord: 234..267
e-value: 250.0
score: 2.6
IPR019734Tetratricopeptide repeatPFAMPF13174TPR_6coord: 831..860
e-value: 0.0073
score: 16.9
IPR019734Tetratricopeptide repeatPROSITEPS50005TPRcoord: 200..233
score: 8.3784
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 402..516
e-value: 4.2E-7
score: 31.8
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 99..401
e-value: 2.8E-32
score: 114.1
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 605..914
e-value: 1.4E-12
score: 49.0
IPR011990Tetratricopeptide-like helical domain superfamilySUPERFAMILY48452TPR-likecoord: 744..865
IPR011990Tetratricopeptide-like helical domain superfamilySUPERFAMILY48452TPR-likecoord: 169..304
IPR011990Tetratricopeptide-like helical domain superfamilySUPERFAMILY48452TPR-likecoord: 251..500
IPR039340Transcription factor Tfc4/TFIIIC-102/Sfc4PANTHERPTHR23082TRANSCRIPTION INITIATION FACTOR IIIC TFIIIC , POLYPEPTIDE 3-RELATEDcoord: 41..917

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10002345.1HG10002345.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006383 transcription by RNA polymerase III
molecular_function GO:0005515 protein binding