Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: five_prime_UTRexonpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GACAAATCAGTTCACATCCGACGAACAGAGGGTCGGTGGAGGGTATACAGAGTCCACTCTTCGCACACTAAAATTCTCTCTCTACAACCTCACTCCGAATCAACATTTTCTCTCTCTACCTCAACGGCGGAGCTCCGGTGACTCGACGGTTGTTTTCCGCCGGAATGTCGCTTCCCGGCCATTTTCCCTGCTCCAGTTGGGTCTATCTCTCTCTCTCTCTCTCTCTCTCTAATTTACAATGCAGCTTCTTGTTCAAAATGTCAGAATCTTGATTCTTTTACTTAGAAAAGCTGCGTGAAAATTGGCTCCATTTACAGTTTCGTACTGTATTTTTCTTTTGCTTTCTCCTTGATCTCGCTCTTTCAAAGCTCGTTCTCCAATTTTCATTTGTCTTTCTTTGTAGCTACTTGAATCTGACTTCACTCTCACTCTCTCTCTTTCTTTCTTACCTTTTTCTTTCCATTTTTTTCTTTTGTTCAGCTAACTTTTGAAGCTAGATTCTTCCCTTTTTATCTCTTTCTTTCTTTTGCCCCTTTCTCCTTGTCTCTGCGTAGACCCATTTTGAATCTGTAAAAAGTTGCTTTATTGGCTTTTGGTCACCTATTGGGCTCCTCTTGTTCTTAAGATTAGGGATTCAGTTCATAATTTTTAATTATATTTTTGGGTTTTCTTTTAGAATGTTTATCTTGTGGTTACCAGGTAAATTCAATGTACGTCACCCGAGACTCTGATTTTATGGCCTTTTTTTGCTCTTTGAACTCTCTATCCACCATGAAATTCCATAGCTGTTTACCTCATTGAAGGTGATCTTAAACCTAACTATTCTGGGTCAACTTCTCAGAAGCAGATCCTTGATTTTTTGCAGCCTAAATATTCAGGTTGGGAGACTTCAACGTCTTGGTTATTGTTCTTTATTTCTCTGTCTGTGTATGATTAGTTTAGCACTTTTACTTACTCTTAGATGTTCTGTTTCTTGTTCCATCGAAAGTTTCAGCATCACTGCAACCAAGGGATTAAAGGTATTGAGATGAATGGGATTCAGAGAAGAAAAGTTAGTAATAATGAAAAGCCTTTCCCTGGCTGTCTGGGAAGAATGGTGAACCTCTTTGATCTGAGTACGGGTGTCTCTAGGAACAAGCTTCTTACTGATGCACCACATCGTGAAGGTGACCTCTATTTCCCTCTAGATACCATATTTTGTTCAAATTACTTTAGTTTATCTTATTAAACTTTCATTGTTGTACAAGTATTGCTAGAATTTAGTGGGTTCTTCTCTTAGACTAAGCCTAATTTCCAATCTCGTCTCATCTATTCTGTCTCCTTTCCCCCATGATTTCGATTTGTCTTTGGCAGGGTCAAAGTACTCCATGATTCAGGTGCAATATCAGTATGTAAATATAGAACCTTTTTCCCGTCTGGATTTTTTTTTGGTGGGGTGGCGGAGTTACAAAATTTTGATTAGTTTCTTGGTGACTTGACCTGCTATCAATTGAAACTATCAGCAACGTGATTGGGCTATCAGTCTTACCGTCTGTTTGTGGTCTTGAGTAGCTGTAGCCACTAGCCAGCAGAAGGGACGAGAGAAGAGAAGGTGGATTGAAGAGAGGAGAAAAATGAAGGAGGAGATATTATTATTATTATTTTGGGGATTGGGGGGGGGGGGGGGTGCTCAAAATTAACGCCTCTTTTTTCTGGATTATTTGAATTAAAACCATAATTATTAACTAAGAAAATTAACTGACTGGACTCATTTTTTCACCTAGGACCTGTAGACTATTCTTTTTCTCTCTATTACTAGATTCATCCCAAACTAGTTTCCTCCTCACATAGGACATTGGCCGCTTTGGGTAAATAATTTATTCCTTTTCCATAATTATTCATGTATTTGTTCAATGGAAACTCCTTTTTAGTGTAAGTTATCTTTTTTTTGGTGGGACTCACACATACAGACACTTCTTCTGGTGCTTATATTTTGGTTGTGCTTAGGCTTTTTGTTCGATTGTGAACGAGCTGACAGACATGTGGCTATTTGGCAGGTTCTACTCCAAGGAATCAGGCTGACATGGCAAGGATGTTTAATCACTCCACAAATCAGACTGAAGATAATCGGGTAAGTATTGTGCTCCACTTGTCCTTATAAAATATTACAAAATCTAGTCATACGGAGCTGGAGGTTCAGAACCACGTATCAAAATTGTTCCGATTTAGTTATGTTATTCACAACGCATTATCTTGATACTCTGCATTTGTGTGGATTGATCAGAGTCGGACAATGCCAGAATTGCAGAGAGCATCAAATAAGAGAGCAAATGGAACACCTGTGAAGATGCTTATAGATCAAGAGATGTCCGAAATGGAGTGTACACAAAATCCACCCAATGTGGTTGCAAAGCTAATGGGGCTTGAAACTCTCCCTCACCAGCTTCCTGGTTCATCTGTTCAAAGAAATAATGTTAGAAGTTATCCGAAGAGTAAAATTGAAAACCATGGAAAGCCATTAGGATGCACGGAACAAAGTGATTTATTAGAAGAGGGAATGAAATGCCAAGTCAATGAATGTTCAGAGCAGAAAGAATGTAAAGATGTTTATGAAATATGGCAGCGATCTCCACAGGCAAATTATATTAGAGAAAAACGGCCAAAGGGAATAGAAAGTGAAGTTGTGAATGATAGAAAGATGGCTCTTGTTCGCCAGAAGTTTGTAGAAGCAAAACGTCTGGCTACAGACGAGAAACTGCGGCAATCCAAAGAATTTCAAGATGCACTAGAAGTTTTAAGTTCCAATAAAGATTTATTTGTCAAGTTTCTGCAGGAGCCAAATTATTTGTTTACTCAGCATCTGAATGAACTCCAGTCCATCCCTCCATCTCCTGAGACAAAGCGCATCACTGTTCTTAGACCTTCAAAGGTGTCTAGGGATGAAAGATTTACTGAATTTGAGAAACAAAGCTATAGACAAGCCAGGCTGCCAGTTCAGAGGGGTCAGTCAGCTACTTTGGATAAAAGTGATTCAAAACTTTCTCCTACTCCAGCTATTAATAGGACTAATGAATATGCAGTAGCTGTTCAACCAACAAGAATAGTGGTTTTAAAGCCAAGTCCTGGGAGGAATCATGATAATAAGCCTATAGTCTCATCGCCTGGTTCATTGCCTAGAGTGGTGCAAGATGGAAGTTTCAATGAAGGATATGAAGATGTTGATGTGAAGGAATCAAGAACATTTGCAAGGAATGTTACTCAGAAAATGTGTGACAATCTCTTGGGTCATCGAAGGGATGAAACTTTGCTATCTTCCGTGTTTTCAAATGGTTATACTGGTGATGAAAGTTCATTTGAAAAATCCGAGAATGACTATGCAGTAGAAAATCTGAGTGACTTGGAAGTCATGTCTTCCTCTTCTCGTCATTCTTGGGAATATATCAACAGATATAGTAGCCCATATTCTTCCTCCTCATTCAGCCGAATATCATGTTCTCCAGAGTCATCAGTGTGCAGAGAAGCTAAGAAGCGACTGTCGGAAAGGTGGTCCATGATGACGACACATGGGAACTATCAAGAGCGTAGGCATGTTAGGAGAAACTCAAGTACATTGGGTGAGATGCTTGCACTGTCAGATGCAAAGAAATCAACAGTAACGGATAATGTAGTTAATGAACATGAACCAAGTGAGTTAGATCACTGTTTTAATAGTGATGAAAACATAGAATGTCTGGATGATTCTCCCACAACACTTATGAAGTCAAAATCTGTTCTGGGATCTTCTGCATTGTTTGGTGTGCTTAATCTTGAAGCATCAGATCTTGAGACCATCAAAACTGATGATCCAAAGTTGCTAGCAAAGTCAAAGGGTGTTAAGTCATCATTTAATGAGAAAGTTTCAAGTTTATTTTTCTCTAGGAATAAGAAAACAAGTAAAGGAAAATATAGTGGATCTCAAACAAAAGATGAACCCCAATCTTGTAGTGCCGGAACACTCTCATCCTCAGCTTTCATCCATCACTCTAGAGGTTTGAGTAATGCTGCATCTCATTCTAACGATGGTGAAGGGTGCTCATCAGGCACTTCTTTCCTACATTTAACTAATGTGGTTGCAAGAGGAGGTGCAGTTCATCATGAGGTAAATCTTTCCTTCTGCGGTTGTAATATTTGCATGCTAATCTATTTGATGTTATATATCTCAATATCTATACATTATACATAGCACAGTCATAGAAATGTAGGGTTTTTCTTTTCTAAGTTATATTCTTGTTTGTTATTGTGATTAAATACGCAATTATATGTTGGTATGGAATTTTAAGCGATATATCCCAGCAAGTTCTTGCACCTTTTGGTCATTTTCAAATCTCATTTGCATATCATTGGTAACTTGTTTATATGTTAAGCTACTTGAGTTCTCCAATTTTTGGCCTAAGTTGTGACGAAATTTGGTTTTCTAATGTTTAGTATTGATGAATTTTCCTATTAAGTTTTGCATGTTTTTCTTTTGCATGCATCCAATTGGAGTTGCATTAAATGCCTGGCAGAGTAACTCAAATGCAGTTGACTTGTGTCTGCCTCAATAGTTCCATGAACATTTTTTCCCCATATAAATCAAACCATTTGATTGTGACTAGTAATTGAATTGGATCTGGATCGTTTCTTGTGATTTGCAGGTGGGGTTGTCTGTAAAAAGGCCCTTCGTATCTGGAAATGTTGGTGAAAATCAGGAGCAGCCAAGTCCAATATCCGTTCTGGAACCACCGTTTTTTGAAGATGACAACGCACATCTAGAATTGTCCAGCTATTTGAAGCCAAGGAATCAGGGTAATCATCCCCCTCCCCCCTCCTTCCAACTTAGTAATGCACACACTTAGTGAATACGTAAGCTAGAACTAAAATCGTTTCTACTTATCTATATCTGTCTGCATCTCATATAAATCTTGAACTGAACAGAGTTCTGTATGCCATTCAAGAATAGCCTCATCGACAAATCACCACCTATAGAATCAATTGCTCGTAGTATATTTTGGGATGGTTCTTATTCAGATTCATCTGCTCCTTGTGCACTCAAATCTTCACCCGTTTCCACTTGTCTGGAGGAAGAACAAAACTGGCATTGCCTTGTCAAAGCCCTTCTTACAATGTCTGGTCTCAGTAGTGAAGCACAACAATGTGGCTTATTGTTTACTAGATGGCATTCGCATGTCAATCCACTAGATCCATCGCTTAGAAACAAATATGCCAATCTAAGCAGCAAAGAGCCGATGCTTGAGGCCAAGCAAAGGCAGGTGCGATCAAGTAGGAAGCTTGTGTTCGATTGTGTCAACGCTGCCTTGATTGACATAACAAGTCAGGAACTAGATCACAGGCAAACCAAAATATCTAGCAGGGCCCATGACAGCAATTTTGCAGAGGACACATCACTAACATTATTGGACTGTGTTATGGTCAAACTGAAGGATTGGGTTTGTGGTGAACCTAGATGTGTTACAGGAGACATTGGGGACAGCAACAGCCTGGTGGTGGAAAGAGTAGTCAGAAAAGAAGTTGGTGGAAGAAACTGGGATGAGCATTTCAAGATGGAAATGGATAATTTGGGGAAGGAAGTAGAGAGGAGATTACTGGAAGAGCTTTTGGAAGAAGCTGTTGTTGAATTGACAGGTAAAGTTTAATTAGAATTTGCTTTTCTGTTGTGTTCCAAATCAATTCATAACCTCTCACCTTTGTATCTGATATTCAATAATGATGGTCCTCTGATTTATTTACTGTATGTTTAGCTAAGTCTACCTACTTACCTCTAATATTCATTTTGTGAAAATCCATCAAATATAATAGAAAGCAAAAGCTTTATTTTGATCATTGAATTTTCTATTTGATTCAGGCAAAGTTCTTAGAGGATTAAACTAGAATAAGAAAAAGAAAAGGGGACATTATCTTGTGATAGCCTGTGAAAGATTCATATTTGGATACCAGGGCATTAAGAAATAGGGGTTCAACTTTATTCATGGGGTCCATTGATTGACAGTGGATTCATTCTTCACACACTGAATCTAGTGGGAGTCTCTATCCATTTTGGAAGCCACACTGGTTCAAGGTAGACATGTGATTGCCTAAATTATTAAACTTGAATATTTACTATTCGGTGATAATAATATTTAATAAATAGGTTAAATTATAAAAATATTAAATTGAAAATTTGGTTTCTAGGTTAGAATTTAGTCCAATATGGTCTAAAATTAAATTTTAGTCCTTATAATTGTCTATTTACGAAGATTTTATCAAATTTTATTAAATCACAAAGGCTACTTATAAGGTTTTTATTAAACCATAGAGATTAAGGGGATTAATTTTGTTATTTAACCTTATTAAAGTATCTTCAACTTTTTCCTTAAAAAATACCTCAATTTGTCAAAAATTGCAGTATTACTCACCATTTCATCAATGTCTTAACACTATGTTCATAGGGAAGGCTTTTAAGTATAAAAAAGGTTGAGAGTAATATTACAATTTTTAAATGAATAGTTCATATTTTTTTAATAATTTAACCTAATAATTAAAGGAGTCTTGTTTTTATATTGCAGCTTTTGACTCATTAGTTTGTATTGATCAGCTGCAGCCTGCAACAGCCATACAAAATGTCCCTTTTATGTCGAATTATACATGCCTAGTAGTAACATGTGTATTTATGTATGGATATCGAAGGAGCCGTCATAGTTATAACCCTTAAAAGAAGGTTTCTACAAATAATCCATAAGATTAGATCTCACTCATTTAGTTAACATCAACCATCAACTTTACAAGATTATAATGGAAAAATAACTCATTCAAATTCCTCAACTTCGATGATATTCACATACTCATCATACACACCAGAACATTCCATTCTTTCACAAAGCTGATTGCAACCTATCCTATCAGACCAACCAAAATCAAAGCACACGACGTATAGTCGAACAGAAACTTACAGACACCAATGACCAGTTCACCTTGCAACTTTTGATTGAAGTCCAAGGTTAAATAACAACGTAGCAAAGAGAGAATCTGGTAGTAACGTCGTGGTTCCTTTGATCTGCGCAAGCAAGGCTTTCTGTTTCTTTGACTGCCCGAGTTTTCGATTTGATTACGAACTTTGGTTTGATGAATTGTATCTGTCACTGTTGTTTAATCCATACTGGAAAAGGATGGAAATGAAACTACATTTAGAAACAGATGTTGTGGAATTTACCCACCTCCCCTTTCTTAAAAGAGAACAGTTCATGGCAGAACACAATACCTTCTCTCTCATGCGGGCACCCCAAAGGTCAAAATGGGCACCAGAAGCTTGGTTTGCTGGTGGATTAAGCAACGGCTCCCGTGTTCTATCTTGAGCACCAACCTCCTCTTCTGTGTCGTACTCAGTTTTACGGGTTGAAACCATGGATCTCAGGATAATTGCTAGCAGCAGAG
mRNA sequence
GACAAATCAGTTCACATCCGACGAACAGAGGGTCGGTGGAGGGTATACAGAGTCCACTCTTCGCACACTAAAATTCTCTCTCTACAACCTCACTCCGAATCAACATTTTCTCTCTCTACCTCAACGGCGGAGCTCCGGTGACTCGACGGTTGTTTTCCGCCGGAATGTCGCTTCCCGGCCATTTTCCCTGCTCCAGTTGGGTCTATCTCTCTCTCTCTCTCTCTCTCTCTAATTTACAATGCAGCTTCTTGTTCAAAATGTCAGAATCTTGATTCTTTTACTTAGAAAAGCTGCGTGAAAATTGGCTCCATTTACAGTTTCGTACTGTATTTTTCTTTTGCTTTCTCCTTGATCTCGCTCTTTCAAAGCTCGTTCTCCAATTTTCATTTGTCTTTCTTTGTAGCTACTTGAATCTGACTTCACTCTCACTCTCTCTCTTTCTTTCTTACCTTTTTCTTTCCATTTTTTTCTTTTGTTCAGCTAACTTTTGAAGCTAGATTCTTCCCTTTTTATCTCTTTCTTTCTTTTGCCCCTTTCTCCTTGTCTCTGCGTAGACCCATTTTGAATCTGTAAAAAGTTGCTTTATTGGCTTTTGGTCACCTATTGGGCTCCTCTTGTTCTTAAGATTAGGGATTCAGTTCATAATTTTTAATTATATTTTTGGGTTTTCTTTTAGAATGTTTATCTTGTGGTTACCAGGTAAATTCAATGTACGTCACCCGAGACTCTGATTTTATGGCCTTTTTTTGCTCTTTGAACTCTCTATCCACCATGAAATTCCATAGCTGTTTACCTCATTGAAGGTGATCTTAAACCTAACTATTCTGGGTCAACTTCTCAGAAGCAGATCCTTGATTTTTTGCAGCCTAAATATTCAGTTTCAGCATCACTGCAACCAAGGGATTAAAGGTATTGAGATGAATGGGATTCAGAGAAGAAAAGTTAGTAATAATGAAAAGCCTTTCCCTGGCTGTCTGGGAAGAATGGTGAACCTCTTTGATCTGAGTACGGGTGTCTCTAGGAACAAGCTTCTTACTGATGCACCACATCGTGAAGGTTCTACTCCAAGGAATCAGGCTGACATGGCAAGGATGTTTAATCACTCCACAAATCAGACTGAAGATAATCGGAGTCGGACAATGCCAGAATTGCAGAGAGCATCAAATAAGAGAGCAAATGGAACACCTGTGAAGATGCTTATAGATCAAGAGATGTCCGAAATGGAGTGTACACAAAATCCACCCAATGTGGTTGCAAAGCTAATGGGGCTTGAAACTCTCCCTCACCAGCTTCCTGGTTCATCTGTTCAAAGAAATAATGTTAGAAGTTATCCGAAGAGTAAAATTGAAAACCATGGAAAGCCATTAGGATGCACGGAACAAAGTGATTTATTAGAAGAGGGAATGAAATGCCAAGTCAATGAATGTTCAGAGCAGAAAGAATGTAAAGATGTTTATGAAATATGGCAGCGATCTCCACAGGCAAATTATATTAGAGAAAAACGGCCAAAGGGAATAGAAAGTGAAGTTGTGAATGATAGAAAGATGGCTCTTGTTCGCCAGAAGTTTGTAGAAGCAAAACGTCTGGCTACAGACGAGAAACTGCGGCAATCCAAAGAATTTCAAGATGCACTAGAAGTTTTAAGTTCCAATAAAGATTTATTTGTCAAGTTTCTGCAGGAGCCAAATTATTTGTTTACTCAGCATCTGAATGAACTCCAGTCCATCCCTCCATCTCCTGAGACAAAGCGCATCACTGTTCTTAGACCTTCAAAGGTGTCTAGGGATGAAAGATTTACTGAATTTGAGAAACAAAGCTATAGACAAGCCAGGCTGCCAGTTCAGAGGGGTCAGTCAGCTACTTTGGATAAAAGTGATTCAAAACTTTCTCCTACTCCAGCTATTAATAGGACTAATGAATATGCAGTAGCTGTTCAACCAACAAGAATAGTGGTTTTAAAGCCAAGTCCTGGGAGGAATCATGATAATAAGCCTATAGTCTCATCGCCTGGTTCATTGCCTAGAGTGGTGCAAGATGGAAGTTTCAATGAAGGATATGAAGATGTTGATGTGAAGGAATCAAGAACATTTGCAAGGAATGTTACTCAGAAAATGTGTGACAATCTCTTGGGTCATCGAAGGGATGAAACTTTGCTATCTTCCGTGTTTTCAAATGGTTATACTGGTGATGAAAGTTCATTTGAAAAATCCGAGAATGACTATGCAGTAGAAAATCTGAGTGACTTGGAAGTCATGTCTTCCTCTTCTCGTCATTCTTGGGAATATATCAACAGATATAGTAGCCCATATTCTTCCTCCTCATTCAGCCGAATATCATGTTCTCCAGAGTCATCAGTGTGCAGAGAAGCTAAGAAGCGACTGTCGGAAAGGTGGTCCATGATGACGACACATGGGAACTATCAAGAGCGTAGGCATGTTAGGAGAAACTCAAGTACATTGGGTGAGATGCTTGCACTGTCAGATGCAAAGAAATCAACAGTAACGGATAATGTAGTTAATGAACATGAACCAAGTGAGTTAGATCACTGTTTTAATAGTGATGAAAACATAGAATGTCTGGATGATTCTCCCACAACACTTATGAAGTCAAAATCTGTTCTGGGATCTTCTGCATTGTTTGGTGTGCTTAATCTTGAAGCATCAGATCTTGAGACCATCAAAACTGATGATCCAAAGTTGCTAGCAAAGTCAAAGGGTGTTAAGTCATCATTTAATGAGAAAGTTTCAAGTTTATTTTTCTCTAGGAATAAGAAAACAAGTAAAGGAAAATATAGTGGATCTCAAACAAAAGATGAACCCCAATCTTGTAGTGCCGGAACACTCTCATCCTCAGCTTTCATCCATCACTCTAGAGGTTTGAGTAATGCTGCATCTCATTCTAACGATGGTGAAGGGTGCTCATCAGGCACTTCTTTCCTACATTTAACTAATGTGGTTGCAAGAGGAGGTGCAGTTCATCATGAGGTGGGGTTGTCTGTAAAAAGGCCCTTCGTATCTGGAAATGTTGGTGAAAATCAGGAGCAGCCAAGTCCAATATCCGTTCTGGAACCACCGTTTTTTGAAGATGACAACGCACATCTAGAATTGTCCAGCTATTTGAAGCCAAGGAATCAGGAGTTCTGTATGCCATTCAAGAATAGCCTCATCGACAAATCACCACCTATAGAATCAATTGCTCGTAGTATATTTTGGGATGGTTCTTATTCAGATTCATCTGCTCCTTGTGCACTCAAATCTTCACCCGTTTCCACTTGTCTGGAGGAAGAACAAAACTGGCATTGCCTTGTCAAAGCCCTTCTTACAATGTCTGGTCTCAGTAGTGAAGCACAACAATGTGGCTTATTGTTTACTAGATGGCATTCGCATGTCAATCCACTAGATCCATCGCTTAGAAACAAATATGCCAATCTAAGCAGCAAAGAGCCGATGCTTGAGGCCAAGCAAAGGCAGGTGCGATCAAGTAGGAAGCTTGTGTTCGATTGTGTCAACGCTGCCTTGATTGACATAACAAGTCAGGAACTAGATCACAGGCAAACCAAAATATCTAGCAGGGCCCATGACAGCAATTTTGCAGAGGACACATCACTAACATTATTGGACTGTGTTATGGTCAAACTGAAGGATTGGGTTTGTGGTGAACCTAGATGTGTTACAGGAGACATTGGGGACAGCAACAGCCTGGTGGTGGAAAGAGTAGTCAGAAAAGAAGTTGGTGGAAGAAACTGGGATGAGCATTTCAAGATGGAAATGGATAATTTGGGGAAGGAAGTAGAGAGGAGATTACTGGAAGAGCTTTTGGAAGAAGCTGTTGTTGAATTGACAGGCAAAGTTCTTAGAGGATTAAACTAGAATAAGAAAAAGAAAAGGGGACATTATCTTGTGATAGCCTGTGAAAGATTCATATTTGGATACCAGGGCATTAAGAAATAGGGGTTCAACTTTATTCATGGGGTCCATTGATTGACAGTGGATTCATTCTTCACACACTGAATCTAGTGGGAGTCTCTATCCATTTTGGAAGCCACACTGGTTCAAGCTTTTGACTCATTAGTTTGTATTGATCAGCTGCAGCCTGCAACAGCCATACAAAATGTCCCTTTTATGTCGAATTATACATGCCTAGTAGTAACATGTGTATTTATGTATGGATATCGAAGGAGCCGTCATAGTTATAACCCTTAAAAGAAGGTTTCTACAAATAATCCATAAGATTAGATCTCACTCATTTAGTTAACATCAACCATCAACTTTACAAGATTATAATGGAAAAATAACTCATTCAAATTCCTCAACTTCGATGATATTCACATACTCATCATACACACCAGAACATTCCATTCTTTCACAAAGCTGATTGCAACCTATCCTATCAGACCAACCAAAATCAAAGCACACGACGTATAGTCGAACAGAAACTTACAGACACCAATGACCAGTTCACCTTGCAACTTTTGATTGAAGTCCAAGGTTAAATAACAACGTAGCAAAGAGAGAATCTGGTAGTAACGTCGTGGTTCCTTTGATCTGCGCAAGCAAGGCTTTCTGTTTCTTTGACTGCCCGAGTTTTCGATTTGATTACGAACTTTGGTTTGATGAATTGTATCTGTCACTGTTGTTTAATCCATACTGGAAAAGGATGGAAATGAAACTACATTTAGAAACAGATGTTGTGGAATTTACCCACCTCCCCTTTCTTAAAAGAGAACAGTTCATGGCAGAACACAATACCTTCTCTCTCATGCGGGCACCCCAAAGGTCAAAATGGGCACCAGAAGCTTGGTTTGCTGGTGGATTAAGCAACGGCTCCCGTGTTCTATCTTGAGCACCAACCTCCTCTTCTGTGTCGTACTCAGTTTTACGGGTTGAAACCATGGATCTCAGGATAATTGCTAGCAGCAGAG
Coding sequence (CDS)
ATGAATGGGATTCAGAGAAGAAAAGTTAGTAATAATGAAAAGCCTTTCCCTGGCTGTCTGGGAAGAATGGTGAACCTCTTTGATCTGAGTACGGGTGTCTCTAGGAACAAGCTTCTTACTGATGCACCACATCGTGAAGGTTCTACTCCAAGGAATCAGGCTGACATGGCAAGGATGTTTAATCACTCCACAAATCAGACTGAAGATAATCGGAGTCGGACAATGCCAGAATTGCAGAGAGCATCAAATAAGAGAGCAAATGGAACACCTGTGAAGATGCTTATAGATCAAGAGATGTCCGAAATGGAGTGTACACAAAATCCACCCAATGTGGTTGCAAAGCTAATGGGGCTTGAAACTCTCCCTCACCAGCTTCCTGGTTCATCTGTTCAAAGAAATAATGTTAGAAGTTATCCGAAGAGTAAAATTGAAAACCATGGAAAGCCATTAGGATGCACGGAACAAAGTGATTTATTAGAAGAGGGAATGAAATGCCAAGTCAATGAATGTTCAGAGCAGAAAGAATGTAAAGATGTTTATGAAATATGGCAGCGATCTCCACAGGCAAATTATATTAGAGAAAAACGGCCAAAGGGAATAGAAAGTGAAGTTGTGAATGATAGAAAGATGGCTCTTGTTCGCCAGAAGTTTGTAGAAGCAAAACGTCTGGCTACAGACGAGAAACTGCGGCAATCCAAAGAATTTCAAGATGCACTAGAAGTTTTAAGTTCCAATAAAGATTTATTTGTCAAGTTTCTGCAGGAGCCAAATTATTTGTTTACTCAGCATCTGAATGAACTCCAGTCCATCCCTCCATCTCCTGAGACAAAGCGCATCACTGTTCTTAGACCTTCAAAGGTGTCTAGGGATGAAAGATTTACTGAATTTGAGAAACAAAGCTATAGACAAGCCAGGCTGCCAGTTCAGAGGGGTCAGTCAGCTACTTTGGATAAAAGTGATTCAAAACTTTCTCCTACTCCAGCTATTAATAGGACTAATGAATATGCAGTAGCTGTTCAACCAACAAGAATAGTGGTTTTAAAGCCAAGTCCTGGGAGGAATCATGATAATAAGCCTATAGTCTCATCGCCTGGTTCATTGCCTAGAGTGGTGCAAGATGGAAGTTTCAATGAAGGATATGAAGATGTTGATGTGAAGGAATCAAGAACATTTGCAAGGAATGTTACTCAGAAAATGTGTGACAATCTCTTGGGTCATCGAAGGGATGAAACTTTGCTATCTTCCGTGTTTTCAAATGGTTATACTGGTGATGAAAGTTCATTTGAAAAATCCGAGAATGACTATGCAGTAGAAAATCTGAGTGACTTGGAAGTCATGTCTTCCTCTTCTCGTCATTCTTGGGAATATATCAACAGATATAGTAGCCCATATTCTTCCTCCTCATTCAGCCGAATATCATGTTCTCCAGAGTCATCAGTGTGCAGAGAAGCTAAGAAGCGACTGTCGGAAAGGTGGTCCATGATGACGACACATGGGAACTATCAAGAGCGTAGGCATGTTAGGAGAAACTCAAGTACATTGGGTGAGATGCTTGCACTGTCAGATGCAAAGAAATCAACAGTAACGGATAATGTAGTTAATGAACATGAACCAAGTGAGTTAGATCACTGTTTTAATAGTGATGAAAACATAGAATGTCTGGATGATTCTCCCACAACACTTATGAAGTCAAAATCTGTTCTGGGATCTTCTGCATTGTTTGGTGTGCTTAATCTTGAAGCATCAGATCTTGAGACCATCAAAACTGATGATCCAAAGTTGCTAGCAAAGTCAAAGGGTGTTAAGTCATCATTTAATGAGAAAGTTTCAAGTTTATTTTTCTCTAGGAATAAGAAAACAAGTAAAGGAAAATATAGTGGATCTCAAACAAAAGATGAACCCCAATCTTGTAGTGCCGGAACACTCTCATCCTCAGCTTTCATCCATCACTCTAGAGGTTTGAGTAATGCTGCATCTCATTCTAACGATGGTGAAGGGTGCTCATCAGGCACTTCTTTCCTACATTTAACTAATGTGGTTGCAAGAGGAGGTGCAGTTCATCATGAGGTGGGGTTGTCTGTAAAAAGGCCCTTCGTATCTGGAAATGTTGGTGAAAATCAGGAGCAGCCAAGTCCAATATCCGTTCTGGAACCACCGTTTTTTGAAGATGACAACGCACATCTAGAATTGTCCAGCTATTTGAAGCCAAGGAATCAGGAGTTCTGTATGCCATTCAAGAATAGCCTCATCGACAAATCACCACCTATAGAATCAATTGCTCGTAGTATATTTTGGGATGGTTCTTATTCAGATTCATCTGCTCCTTGTGCACTCAAATCTTCACCCGTTTCCACTTGTCTGGAGGAAGAACAAAACTGGCATTGCCTTGTCAAAGCCCTTCTTACAATGTCTGGTCTCAGTAGTGAAGCACAACAATGTGGCTTATTGTTTACTAGATGGCATTCGCATGTCAATCCACTAGATCCATCGCTTAGAAACAAATATGCCAATCTAAGCAGCAAAGAGCCGATGCTTGAGGCCAAGCAAAGGCAGGTGCGATCAAGTAGGAAGCTTGTGTTCGATTGTGTCAACGCTGCCTTGATTGACATAACAAGTCAGGAACTAGATCACAGGCAAACCAAAATATCTAGCAGGGCCCATGACAGCAATTTTGCAGAGGACACATCACTAACATTATTGGACTGTGTTATGGTCAAACTGAAGGATTGGGTTTGTGGTGAACCTAGATGTGTTACAGGAGACATTGGGGACAGCAACAGCCTGGTGGTGGAAAGAGTAGTCAGAAAAGAAGTTGGTGGAAGAAACTGGGATGAGCATTTCAAGATGGAAATGGATAATTTGGGGAAGGAAGTAGAGAGGAGATTACTGGAAGAGCTTTTGGAAGAAGCTGTTGTTGAATTGACAGGCAAAGTTCTTAGAGGATTAAACTAG
Protein sequence
MNGIQRRKVSNNEKPFPGCLGRMVNLFDLSTGVSRNKLLTDAPHREGSTPRNQADMARMFNHSTNQTEDNRSRTMPELQRASNKRANGTPVKMLIDQEMSEMECTQNPPNVVAKLMGLETLPHQLPGSSVQRNNVRSYPKSKIENHGKPLGCTEQSDLLEEGMKCQVNECSEQKECKDVYEIWQRSPQANYIREKRPKGIESEVVNDRKMALVRQKFVEAKRLATDEKLRQSKEFQDALEVLSSNKDLFVKFLQEPNYLFTQHLNELQSIPPSPETKRITVLRPSKVSRDERFTEFEKQSYRQARLPVQRGQSATLDKSDSKLSPTPAINRTNEYAVAVQPTRIVVLKPSPGRNHDNKPIVSSPGSLPRVVQDGSFNEGYEDVDVKESRTFARNVTQKMCDNLLGHRRDETLLSSVFSNGYTGDESSFEKSENDYAVENLSDLEVMSSSSRHSWEYINRYSSPYSSSSFSRISCSPESSVCREAKKRLSERWSMMTTHGNYQERRHVRRNSSTLGEMLALSDAKKSTVTDNVVNEHEPSELDHCFNSDENIECLDDSPTTLMKSKSVLGSSALFGVLNLEASDLETIKTDDPKLLAKSKGVKSSFNEKVSSLFFSRNKKTSKGKYSGSQTKDEPQSCSAGTLSSSAFIHHSRGLSNAASHSNDGEGCSSGTSFLHLTNVVARGGAVHHEVGLSVKRPFVSGNVGENQEQPSPISVLEPPFFEDDNAHLELSSYLKPRNQEFCMPFKNSLIDKSPPIESIARSIFWDGSYSDSSAPCALKSSPVSTCLEEEQNWHCLVKALLTMSGLSSEAQQCGLLFTRWHSHVNPLDPSLRNKYANLSSKEPMLEAKQRQVRSSRKLVFDCVNAALIDITSQELDHRQTKISSRAHDSNFAEDTSLTLLDCVMVKLKDWVCGEPRCVTGDIGDSNSLVVERVVRKEVGGRNWDEHFKMEMDNLGKEVERRLLEELLEEAVVELTGKVLRGLN
Homology
BLAST of Bhi05G000096 vs. TAIR 10
Match:
AT4G28760.1 (Protein of unknown function (DUF3741) )
HSP 1 Score: 590.9 bits (1522), Expect = 1.9e-168
Identity = 419/988 (42.41%), Postives = 573/988 (58.00%), Query Frame = 0
Query: 1 MNGIQRRKVSNNEKPFPGCLGRMVNLFDLSTGVSRNKLLTDAPHREGST-PRNQADMARM 60
MN ++ RK E P PGCLG+MVNLFDL V+ NKLLTD PH +GS+ R+++D+ RM
Sbjct: 1 MNELRGRKAQKIESPVPGCLGKMVNLFDLGIAVNGNKLLTDKPHLDGSSLSRSRSDVTRM 60
Query: 61 FNHSTNQTEDNRSRTMPELQRASNKRANGTPVKMLIDQEMS-EMECTQNPPNVVAKLMGL 120
S + M +L+R+++ + +GTP+K LI +EMS E+E Q+P NVVAKLMGL
Sbjct: 61 PGPS-YKGHSEAELIMSDLRRSASSKLSGTPMKKLIAREMSKEVEHKQSPTNVVAKLMGL 120
Query: 121 ETLPHQLPGSSVQRNNVRSYPKSKIENHGKPLGCTEQSDLLEEGMKCQVNECSEQKECKD 180
ETLP ++ QR+ RS S + NH + E K Q +E KD
Sbjct: 121 ETLPQTHQETATQRSKSRSNSHSSL-NH-------SMTSTDNEVQKYQ----DFSREFKD 180
Query: 181 VYEIWQRSPQANYIREKRP-KGIESEVVNDRKMALVRQKFVEAKRLATDEKLRQSKEFQD 240
VYE WQ + + R+ P KG E +++MALVRQKF EAKRL TD+ L QSKEFQD
Sbjct: 181 VYETWQSPQKVSRSRDCSPRKGRYDESTTEKQMALVRQKFSEAKRLVTDDSLHQSKEFQD 240
Query: 241 ALEVLSSNKDLFVKFLQEPNYLFTQHLNELQSIPPSPETKRITVLRPSKVSRDERFTEFE 300
ALEVLSSNKDLFV+FLQE N Q+L++ +PP E KRITVLRPSK E++ +
Sbjct: 241 ALEVLSSNKDLFVQFLQESNSFSQQNLSDFHHVPPHSEAKRITVLRPSKAGETEKYV-VQ 300
Query: 301 KQSYRQARLPVQRGQSATLDKSDSKLSPTPAINRTNEYAVAVQPTRIVVLKPSPGRNHDN 360
+ +Q + Q D P+P +NR E VQPTRIVVLKPS G++ D
Sbjct: 301 GRRNKQVKKLASSSQETGWGNRDLGY-PSPYVNRGTE-EHTVQPTRIVVLKPSLGKSLDI 360
Query: 361 KPIVSSPGSLPRVVQDGSFNEGYEDVDVKESRTFARNVTQKMCDNLLGHRRDETLLSSVF 420
K + SS S PR + + + EDV+ KE A+ +T+++ +NL+GH R+ET SSV
Sbjct: 361 KAVSSSQSS-PRGLHSRGYFDEPEDVETKE---VAKEITRQVRENLMGHHRNETQSSSVL 420
Query: 421 SNGYTGDESSFEKSENDYAVENLSDLEVMSSSSRHSWEYINRYSSPYSSSSFSRISCSPE 480
SNGY GD+SSF KS+N+ V NLSD E+MS +SRHSW+ NR+ S +S SSFSR S SPE
Sbjct: 421 SNGYIGDDSSFNKSDNEDLVGNLSDSEIMSPASRHSWDCPNRFDSLFSPSSFSRASFSPE 480
Query: 481 SSVCREAKKRLSERWSMMTTHGNYQERRHVRRNSSTLGEMLALSDAKKSTVTDNVVNEHE 540
SSVCREAKKRLSERW++M+ G Q +HV R SSTLGEMLAL++ K +T + E
Sbjct: 481 SSVCREAKKRLSERWALMSVSGRTQPLKHVSRTSSTLGEMLALTETKVTTESGEGSYEIV 540
Query: 541 PSE--LDHCFNSD-ENIECLDDSPTTLMKSKSVLGSSALFGVLNLEASDLETIKTDDPKL 600
P+ C SD +E DS L +SKSV LN E S L + K P+
Sbjct: 541 PATRVSTSCITSDLSQVEMASDSLNILARSKSVSDVR-----LNGETSVLGSSKVQAPRE 600
Query: 601 LAKSKGVKSSFNEKVSSLFFSRNKKTSKGKYSGSQTKDEPQSCSAGTLSSSAFIHHSRGL 660
L K+ +KSS+ KVS+LFF +N K SK K SQ Q + ++ +
Sbjct: 601 LTKTGSLKSSW--KVSNLFFFKNNKASKEKRDASQCSSMSQLAAPSPVTLT--------- 660
Query: 661 SNAASHSNDGEGCSSGTSFLHLTNVVARGGAVHHEVGLSVKRPFVSGNVGENQEQPSPIS 720
E C L + + + E ++ +P +GN ENQ+QPSPIS
Sbjct: 661 ------GKTSEDCVFPIDCLPPVS-SEQQSIILGEEEVTTPKPLATGNTSENQDQPSPIS 720
Query: 721 VLEPPFFEDDNAHLELSSYLKP-RNQEFCMPFKNSLIDKSPPIESIARSIFW-DGSYSDS 780
VL PPF E+ + E S K +Q M K++LIDKSPPI SIAR + W D S +D+
Sbjct: 721 VLFPPFEEECASIPECSGSTKHWSSQGDEMSLKSNLIDKSPPIGSIARLLSWDDDSCTDN 780
Query: 781 SAPCALKSSPVSTCLEEEQNWHCLVKALLTMSGLSSEA-QQCGLLFTRWHSHVNPLDPSL 840
A A+ + EE++WH ++ +LT +G SS + +RWH +PLDPSL
Sbjct: 781 IAKPAMG-------VHEEEDWHLFIEMILTAAGFSSGCIVSHDPIMSRWHMPNSPLDPSL 840
Query: 841 RNKYANLSS---KEPMLEAKQRQVRSSRKLVFDCVNAALIDITSQELDHRQTKISSRAHD 900
R+KY N + KE + E K+RQ RS+RKL+FD +N+ + + T ++R +
Sbjct: 841 RDKYTNPDNNNIKEFIHEGKRRQQRSTRKLIFDRINSIVSETT-----------TTRTGN 900
Query: 901 SNFAEDTSLTLLDCVMVKLKDWVCGEP-RCVTGDIGDSNSLVVERVVRKEVGGRNWDEHF 960
+ D L++ V +LKDWV EP + +G+ D+NSL E +V+ E+ GR W
Sbjct: 901 GSLHFD----LVEHVWAQLKDWVSDEPSKRDSGEDMDANSLAAESLVKDEIVGRTWTHSL 923
Query: 961 KMEMDNLGKEVERRLLEELLEEAVVELT 976
++E+D+ G E+E+RLL+EL+EEAV++LT
Sbjct: 961 QVEIDDFGIEIEKRLLQELVEEAVIDLT 923
BLAST of Bhi05G000096 vs. TAIR 10
Match:
AT4G28760.2 (Protein of unknown function (DUF3741) )
HSP 1 Score: 590.9 bits (1522), Expect = 1.9e-168
Identity = 419/988 (42.41%), Postives = 573/988 (58.00%), Query Frame = 0
Query: 1 MNGIQRRKVSNNEKPFPGCLGRMVNLFDLSTGVSRNKLLTDAPHREGST-PRNQADMARM 60
MN ++ RK E P PGCLG+MVNLFDL V+ NKLLTD PH +GS+ R+++D+ RM
Sbjct: 1 MNELRGRKAQKIESPVPGCLGKMVNLFDLGIAVNGNKLLTDKPHLDGSSLSRSRSDVTRM 60
Query: 61 FNHSTNQTEDNRSRTMPELQRASNKRANGTPVKMLIDQEMS-EMECTQNPPNVVAKLMGL 120
S + M +L+R+++ + +GTP+K LI +EMS E+E Q+P NVVAKLMGL
Sbjct: 61 PGPS-YKGHSEAELIMSDLRRSASSKLSGTPMKKLIAREMSKEVEHKQSPTNVVAKLMGL 120
Query: 121 ETLPHQLPGSSVQRNNVRSYPKSKIENHGKPLGCTEQSDLLEEGMKCQVNECSEQKECKD 180
ETLP ++ QR+ RS S + NH + E K Q +E KD
Sbjct: 121 ETLPQTHQETATQRSKSRSNSHSSL-NH-------SMTSTDNEVQKYQ----DFSREFKD 180
Query: 181 VYEIWQRSPQANYIREKRP-KGIESEVVNDRKMALVRQKFVEAKRLATDEKLRQSKEFQD 240
VYE WQ + + R+ P KG E +++MALVRQKF EAKRL TD+ L QSKEFQD
Sbjct: 181 VYETWQSPQKVSRSRDCSPRKGRYDESTTEKQMALVRQKFSEAKRLVTDDSLHQSKEFQD 240
Query: 241 ALEVLSSNKDLFVKFLQEPNYLFTQHLNELQSIPPSPETKRITVLRPSKVSRDERFTEFE 300
ALEVLSSNKDLFV+FLQE N Q+L++ +PP E KRITVLRPSK E++ +
Sbjct: 241 ALEVLSSNKDLFVQFLQESNSFSQQNLSDFHHVPPHSEAKRITVLRPSKAGETEKYV-VQ 300
Query: 301 KQSYRQARLPVQRGQSATLDKSDSKLSPTPAINRTNEYAVAVQPTRIVVLKPSPGRNHDN 360
+ +Q + Q D P+P +NR E VQPTRIVVLKPS G++ D
Sbjct: 301 GRRNKQVKKLASSSQETGWGNRDLGY-PSPYVNRGTE-EHTVQPTRIVVLKPSLGKSLDI 360
Query: 361 KPIVSSPGSLPRVVQDGSFNEGYEDVDVKESRTFARNVTQKMCDNLLGHRRDETLLSSVF 420
K + SS S PR + + + EDV+ KE A+ +T+++ +NL+GH R+ET SSV
Sbjct: 361 KAVSSSQSS-PRGLHSRGYFDEPEDVETKE---VAKEITRQVRENLMGHHRNETQSSSVL 420
Query: 421 SNGYTGDESSFEKSENDYAVENLSDLEVMSSSSRHSWEYINRYSSPYSSSSFSRISCSPE 480
SNGY GD+SSF KS+N+ V NLSD E+MS +SRHSW+ NR+ S +S SSFSR S SPE
Sbjct: 421 SNGYIGDDSSFNKSDNEDLVGNLSDSEIMSPASRHSWDCPNRFDSLFSPSSFSRASFSPE 480
Query: 481 SSVCREAKKRLSERWSMMTTHGNYQERRHVRRNSSTLGEMLALSDAKKSTVTDNVVNEHE 540
SSVCREAKKRLSERW++M+ G Q +HV R SSTLGEMLAL++ K +T + E
Sbjct: 481 SSVCREAKKRLSERWALMSVSGRTQPLKHVSRTSSTLGEMLALTETKVTTESGEGSYEIV 540
Query: 541 PSE--LDHCFNSD-ENIECLDDSPTTLMKSKSVLGSSALFGVLNLEASDLETIKTDDPKL 600
P+ C SD +E DS L +SKSV LN E S L + K P+
Sbjct: 541 PATRVSTSCITSDLSQVEMASDSLNILARSKSVSDVR-----LNGETSVLGSSKVQAPRE 600
Query: 601 LAKSKGVKSSFNEKVSSLFFSRNKKTSKGKYSGSQTKDEPQSCSAGTLSSSAFIHHSRGL 660
L K+ +KSS+ KVS+LFF +N K SK K SQ Q + ++ +
Sbjct: 601 LTKTGSLKSSW--KVSNLFFFKNNKASKEKRDASQCSSMSQLAAPSPVTLT--------- 660
Query: 661 SNAASHSNDGEGCSSGTSFLHLTNVVARGGAVHHEVGLSVKRPFVSGNVGENQEQPSPIS 720
E C L + + + E ++ +P +GN ENQ+QPSPIS
Sbjct: 661 ------GKTSEDCVFPIDCLPPVS-SEQQSIILGEEEVTTPKPLATGNTSENQDQPSPIS 720
Query: 721 VLEPPFFEDDNAHLELSSYLKP-RNQEFCMPFKNSLIDKSPPIESIARSIFW-DGSYSDS 780
VL PPF E+ + E S K +Q M K++LIDKSPPI SIAR + W D S +D+
Sbjct: 721 VLFPPFEEECASIPECSGSTKHWSSQGDEMSLKSNLIDKSPPIGSIARLLSWDDDSCTDN 780
Query: 781 SAPCALKSSPVSTCLEEEQNWHCLVKALLTMSGLSSEA-QQCGLLFTRWHSHVNPLDPSL 840
A A+ + EE++WH ++ +LT +G SS + +RWH +PLDPSL
Sbjct: 781 IAKPAMG-------VHEEEDWHLFIEMILTAAGFSSGCIVSHDPIMSRWHMPNSPLDPSL 840
Query: 841 RNKYANLSS---KEPMLEAKQRQVRSSRKLVFDCVNAALIDITSQELDHRQTKISSRAHD 900
R+KY N + KE + E K+RQ RS+RKL+FD +N+ + + T ++R +
Sbjct: 841 RDKYTNPDNNNIKEFIHEGKRRQQRSTRKLIFDRINSIVSETT-----------TTRTGN 900
Query: 901 SNFAEDTSLTLLDCVMVKLKDWVCGEP-RCVTGDIGDSNSLVVERVVRKEVGGRNWDEHF 960
+ D L++ V +LKDWV EP + +G+ D+NSL E +V+ E+ GR W
Sbjct: 901 GSLHFD----LVEHVWAQLKDWVSDEPSKRDSGEDMDANSLAAESLVKDEIVGRTWTHSL 923
Query: 961 KMEMDNLGKEVERRLLEELLEEAVVELT 976
++E+D+ G E+E+RLL+EL+EEAV++LT
Sbjct: 961 QVEIDDFGIEIEKRLLQELVEEAVIDLT 923
BLAST of Bhi05G000096 vs. TAIR 10
Match:
AT2G20240.1 (Protein of unknown function (DUF3741) )
HSP 1 Score: 406.4 bits (1043), Expect = 6.7e-113
Identity = 334/915 (36.50%), Postives = 473/915 (51.69%), Query Frame = 0
Query: 75 MPELQR---ASNKRANGTPVKMLIDQEMSE--MECTQNPPNVVAKLMGLETLPHQLPGSS 134
MP+L+ + +K + +K LI +EMS+ +E Q+ NVVAKLMGLET
Sbjct: 1 MPDLRPGVFSKSKETSTESMKKLIAREMSKDVVEDRQSSNNVVAKLMGLET--------- 60
Query: 135 VQRNNVRSYPKSKIENHGK-PLGCTEQSDLLEEGMKCQVNECSEQKECKDVYEIWQRSPQ 194
S P+S+ ++ + L C E G + +E +QK
Sbjct: 61 -------SAPRSRSKSSSRCSLTCVGSK---EAGKHHREDETWDQK-------------- 120
Query: 195 ANYIREKRPKGIESEVVNDRKMALVRQKFVEAKRLATDEKLRQSKEFQDALEVLSSNKDL 254
A+ + K ++D++M LVR+KF+EAK L TD++L +S E Q+AL+VLSSNKDL
Sbjct: 121 ASNLSSKAS-------MSDKQMDLVRRKFMEAKHLVTDDRLHRSSELQEALQVLSSNKDL 180
Query: 255 FVKFLQEPNYLFTQHLNELQSIPPSPETKRITVLRPSKVSRDERFTEFEKQSYRQARLPV 314
FVKFLQE N LF QHL++ Q +PP P+ KRITVLRPSK
Sbjct: 181 FVKFLQESNSLFPQHLSDFQPVPPHPDAKRITVLRPSK---------------------- 240
Query: 315 QRGQSATLDKSDSKLSPTPA-INRTNEYAVAVQPTRIVVLKPSPGRNHDNKPIVSSPGSL 374
+ + K ++ S PA +N+ + AVQPTRIVVLKPSPG++ D K I SSP
Sbjct: 241 ----AVGVQKCLAEDSKKPASLNQETGWIDAVQPTRIVVLKPSPGKSLDIKAIASSP--- 300
Query: 375 PRVVQDGSFNEGYEDVDVKESRTFARNVTQKMCDNLLGHRRDETL---LSSVFSNGYTGD 434
+++ E+R A+ +T+++ + + GH R+ETL SSV SNGY GD
Sbjct: 301 ----------PYFDEAGDAETREVAKEITRQIRETVEGHCRNETLSSSSSSVLSNGYMGD 360
Query: 435 ESSFEKSENDYAVENLSDLEVMSSSSRHSWEYINRYSSPYSSSSFSRISCSPESSVCREA 494
+ S +S +Y V N+++ E+MS SSRHSW+ N++ SP+SSSS SR+S SP+SSV REA
Sbjct: 361 DCSLNRSNYEYLVGNITNSEIMSPSSRHSWDCANKFESPFSSSSLSRVSFSPDSSVYREA 420
Query: 495 KKRLSERWSMMTTHGNYQERRHVRRNSSTLGEMLALSDAKKSTVTDNVVN--EHEPSELD 554
KKRLSERW+MM+ +G+ Q+ ++ + S+ LGE+LALS+ K T + N + E
Sbjct: 421 KKRLSERWAMMSLNGDTQQPKNFPKVSTALGEVLALSETKVPTGSSEETNKVKQETRRSI 480
Query: 555 HCFNSD-ENIECLDDSPTTLMKSKSVLGSSALFGVLNLEASDLETIKTDDPKLLAKSKGV 614
C S + +E DS L +S+SV + L T K P+ L +S+ +
Sbjct: 481 SCIGSGLDQVESTSDSLNILERSRSV-------PEIRLNGG---TSKAQAPQELTESRSL 540
Query: 615 KSSFNEKVSSLFFSRNKKTSKGKYSGSQTKDEPQSCSAGTLSSSAFIHHSRGLSNAASHS 674
KSS+ KVSSLFF RNKK++K K T + S H +
Sbjct: 541 KSSW--KVSSLFFFRNKKSNKDK----------------TFAPSQLAIHRDAFQEQRIFT 600
Query: 675 NDGEGCSSGTSFLHLTNVVARGGAVHHEVGLSVKRPFVSGNVGENQEQPSPISVLEPPFF 734
++G+ ENQ+QPSP+SVL+P F
Sbjct: 601 SEGD------------------------------------VENENQDQPSPVSVLQPAFE 660
Query: 735 EDDNAHLELSSYLKPR-NQEFCMPFKNSLIDKSPPIESIARSIFW-DGSYSDSSAPCALK 794
E E S +KP+ Q M K++LIDKSPPI +IAR + W D SY+D+S P
Sbjct: 661 E------ECSGSVKPKTTQGEEMSLKSNLIDKSPPIGTIARILAWEDESYTDTSKP---- 712
Query: 795 SSPVSTCLEEEQNWHCLVKALLTMSGLSSEAQQCGLLFTRWHSHVNPLDPSLRNKYANLS 854
+ +EE+++W+ +K LLT SG S L TRWHS +PLDPSLR+K+AN
Sbjct: 721 ----AMGIEEDEDWYGFIKTLLTASGFSGSDS----LMTRWHSLESPLDPSLRDKFAN-- 712
Query: 855 SKEPMLEAKQRQVRSSRKLVFDCVNAALIDITSQELDHRQTKISSRAHDSNFAEDTSLTL 914
KE + K+R+ RS+RKLVFDCVNA + + TS TK +
Sbjct: 781 -KELI---KRRKQRSNRKLVFDCVNAIITETTSTLAHTGLTK--------------GFNM 712
Query: 915 LDCVMVKLKDWVCGEPRCVTGDIGDSNSLVVERVVRKEVGGRNWDEHFKMEMDNLGKEVE 974
L+ V +L++W V EV G+ W ++EM+NLG E+E
Sbjct: 841 LEHVWTELQEW----------------------AVNDEVAGKMWSYGLQVEMNNLGIEIE 712
BLAST of Bhi05G000096 vs. TAIR 10
Match:
AT5G43880.1 (Protein of unknown function (DUF3741) )
HSP 1 Score: 402.9 bits (1034), Expect = 7.4e-112
Identity = 353/993 (35.55%), Postives = 509/993 (51.26%), Query Frame = 0
Query: 1 MNGIQRRKVSNNEKPFPGCLGRMVNLFDLSTGVSRNKLLTDAPHRE-GSTPRNQADMARM 60
MN +RR V + GCL RMVNLFD T + KLLT+ PH + GS NQ D
Sbjct: 1 MNKQRRRNVQAH-----GCLARMVNLFDFGTVGNGKKLLTEKPHFDHGSIKGNQFD---- 60
Query: 61 FNHSTNQTEDNRSRTMPELQRASNKRANGTPVKMLIDQEMS-EMECTQNPPNVVAKLMGL 120
Q ED N NGTP+KML++QEMS EME + N+VAKLMGL
Sbjct: 61 ------QIEDKVD--------VRNGGVNGTPMKMLLEQEMSKEMEVKLSSTNLVAKLMGL 120
Query: 121 ETLPHQLPGSSVQRNNVRSYPKSKIENHGKPLGCTEQSDLLEEGMKCQVNECSEQKECKD 180
++ P +S P+S K ++ E K+
Sbjct: 121 DSFP-----------QTQSAPRS-------------------YSSKPRLKRSLSHGEYKN 180
Query: 181 VYEIWQRSPQANYIREKRPKGIESEVVNDRKMALVRQKFVEAKRLATDEKLRQSKEFQDA 240
VYEIWQ+ E G+E ++ +KM +VR+KF+EAKRL TD++LR SKEFQ+A
Sbjct: 181 VYEIWQKE------GELSSNGVEG--LSKKKMDIVREKFLEAKRLVTDDELRHSKEFQEA 240
Query: 241 LEVLSSNKDLFVKFLQEPNYLFTQHLNELQSI--PPSPETKRITVLRPSKVSRDERFTEF 300
+EVLSSNK+LF++FLQE N F+ HL+ QS P S ++KRIT+L+PSK DE+F
Sbjct: 241 MEVLSSNKELFLEFLQESNNFFSHHLHSFQSTDPPTSEKSKRITILKPSKTVADEKFG-- 300
Query: 301 EKQSYRQARLPVQRGQSATLDKSDSKLSPTPAINRTNEYAVAVQPTRIVVLKPSPGRNHD 360
+ + +R + G+ K + EY Q TRIVVLKP+
Sbjct: 301 NEPAIESSRDGSKSGKGLDFFKWPVE----------EEYPTK-QSTRIVVLKPN------ 360
Query: 361 NKPIVSSPGSLPRVVQDGSFNEGYEDVDVKESRTFARNVTQKMCDNLLGHRRDETLLSSV 420
G + + + G+E +ESR AR V ++ ++ETL SSV
Sbjct: 361 --------GQVTKASSCPTSPRGFEG---RESRDVARRVKSQIL-------KEETLQSSV 420
Query: 421 FSNGYTGDESSFEKSENDYAVENLSDLEVMSSSSRHSWEYINRYSSPYSSSSFSRISCSP 480
FSNGY D+SS NDYA D E+MS SRHSW+YIN+Y SP+SSS FSR S SP
Sbjct: 421 FSNGYICDDSSL----NDYA-----DSEIMSPVSRHSWDYINKYDSPFSSSPFSRASGSP 480
Query: 481 E-SSVCREAKKRLSERWSMM-TTHGNYQERRHVRRNSS--TLGEMLALSDAKKSTVTDNV 540
E SSVCREAKKRLSERW++M + N QE + + + S +LG+MLAL D ++ +T+
Sbjct: 481 ESSSVCREAKKRLSERWALMAAANENLQEAKVIEKKGSNISLGDMLALPDLREDLITEEE 540
Query: 541 V----NEHE-PSELDHCFNSD-ENIECLDDSPTTLMKSKSVLGSSALFGVLNLEASDLET 600
NE E P CF+ + E P L +SKS+ SS G +L++S+ ++
Sbjct: 541 ETSNGNEQEGPKVSASCFDGNFSREEGKLKPPKGLTRSKSLPESSTSLGHKSLDSSN-KS 600
Query: 601 IKTDDPKLLAKSKGVKSSFNEKVSSLFFSRNKKTSKGKYSGSQTKDEPQSCSAGTLSSSA 660
+ P+ L KSK +K S KVS+ FSR+KK SK + ++ P+ L S
Sbjct: 601 KSSRVPEELTKSKSLKWSLKGKVSNFLFSRSKKASKER----SYEESPE-----ILDSRC 660
Query: 661 FIHHSRGLSNAASHSNDGEGCSSGTSFLHLTNVVARGGAVHHEVGLSVKRPFVSGNVGEN 720
+ +S S +G GLS+ +P + GN E
Sbjct: 661 NNEYDASVSARIMTSREG--------------------------GLSITKPTIFGNSSEW 720
Query: 721 QEQPSPISVLEPPFFEDDNAHLELSSYLKPRNQEFCMPFKNSLIDKSPPIESIARSIFWD 780
+++PSPISVLE F E+D SS L + K++L+ KSPPI SI R++ +D
Sbjct: 721 RDEPSPISVLETSFDEEDGIFFN-SSILNRSSSSLEREMKSNLLGKSPPIGSIGRTLSFD 780
Query: 781 GSYSDSSAPCALKSSPVSTCLEEEQNWHCLVKALLTMSGLSSEAQQCGLLFTRWHSHVNP 840
S + A C ++ +EE++ L+ LL+ + L + + L ++WHS +P
Sbjct: 781 DS---TVARCYSSKRSTTSARDEEEDLRLLINTLLSAADLDAISDN---LLSKWHSSESP 825
Query: 841 LDPSLRNKYANLSSKEPMLEAKQRQVRSS-RKLVFDCVNAALIDITSQELDHRQTKISSR 900
LDPSLRN YA+ +Q+++ S+ + LVFD VN L+++T L R + +
Sbjct: 841 LDPSLRNSYAD--------STEQKRLGSNVKNLVFDLVNTLLLELTPSYLGPRSSPMIL- 825
Query: 901 AHDSNFAEDTSLTLLDCVMVKLKDWVCGEPRCVT---GDIGDSNSLVVERVVRKEVGGRN 960
+ L V+ ++++ + G R + GD +SL V +VVR EV
Sbjct: 901 ---------SGKPLGVYVINRMQECLTGNGRVEDRWWDEDGDLSSLAVNKVVRIEVAEIG 825
Query: 961 WDEHFKMEMDNLGKEVERRLLEELLEEAVVELT 976
E ++EMD++G+E+E +LLEEL+EEA+++L+
Sbjct: 961 SQESLRLEMDSMGEELELKLLEELVEEALMDLS 825
BLAST of Bhi05G000096 vs. TAIR 10
Match:
AT3G53540.1 (unknown protein; LOCATED IN: plasma membrane; EXPRESSED IN: 24 plant structures; EXPRESSED DURING: 14 growth stages; CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF3741 (InterPro:IPR022212); BEST Arabidopsis thaliana protein match is: Protein of unknown function (DUF3741) (TAIR:AT4G28760.2); Has 1710 Blast hits to 868 proteins in 206 species: Archae - 2; Bacteria - 409; Metazoa - 304; Fungi - 204; Plants - 304; Viruses - 2; Other Eukaryotes - 485 (source: NCBI BLink). )
HSP 1 Score: 149.4 bits (376), Expect = 1.5e-35
Identity = 255/1009 (25.27%), Postives = 411/1009 (40.73%), Query Frame = 0
Query: 24 VNLFDLSTGVSRNKLLTDAPHREGSTPRNQADMARMFNHSTNQTEDNRSRTMPELQRASN 83
+N F LS SR++L + P ++ + + ++ + + + S L
Sbjct: 1 MNRFRLSDLSSRDRLASTLP----TSHQGKKQKSQKLKSPRSSSPEFNSCHCEALSENKQ 60
Query: 84 KRANGTPVKMLIDQEMS-EMECTQNPPNVVAKLMGLETLPHQLPGSSVQRNNVRSYPKSK 143
G P+K L+ QEMS + E + P+++A+LMGL+ LP Q Q KS
Sbjct: 61 DFPTGVPMKSLLAQEMSKQKESKKRSPSIIARLMGLDVLPSQSSSHKQQ--------KSM 120
Query: 144 IENHGKPLGCTEQSDLLEEGMKCQVNECSEQKECKDVYEIWQRSPQANYIREKRPKGIES 203
G+ G T L G + + EQK KDV+E+ + A R +G +
Sbjct: 121 ENQQGRSGGGTSYKSL---GKRSK----GEQK-FKDVFEVLD-AKMAESNRNLYHQGRVN 180
Query: 204 EVVNDRKMALVRQKFVEAKRLATDEKLRQSKEFQDALEVLSSNKDLFVKFLQEPNYLFTQ 263
+ +MA +RQKF+EAKRL+TD+KLR SKEF DALE L SNKDL +KFLQ P+ LFT+
Sbjct: 181 ANLTQAEMAFIRQKFMEAKRLSTDDKLRHSKEFNDALEALDSNKDLLLKFLQHPDSLFTK 240
Query: 264 HLNELQSIPPSPETKRITVLRPSKVSR---DERFTEFEKQSYRQARLPVQR---GQSATL 323
HL++LQS P P+ + L+ R + + ++ R++ R G S
Sbjct: 241 HLHDLQSTPHKPQYSQAPSLKSPNSQRHVDSLKTQKVDRDLLRKSHRSPHRNGGGGSGCP 300
Query: 324 DKSDSKLSPTPAINRTNE---YAVAVQPTRIVVLKPSPGRNHDNKPIVSSPGSLP----- 383
+S ++ + I+ NE +QPT+IVVLKP+ G +SP S
Sbjct: 301 SRSHTRHASYDTIDLPNEELRKRSELQPTKIVVLKPNLGEPRYAARTFASPSSSSDEFRA 360
Query: 384 --RVVQDGSFNEGYEDVDVKESRTFARN------VTQKMCDNLLGHRRDETLLSSVFSNG 443
R+ + + DV+ SR +R+ + + G+ R + +S F G
Sbjct: 361 DRRLPCTTTHGRQKSNEDVRLSRQNSRDCGEMAKIMSRQRKVSCGNGRAMSFETSGF-RG 420
Query: 444 YTGDESSFEKSENDYAVENLSDLEVMSSSSRHSWEYINRYSSPYSSSSFSRISCSPESSV 503
Y GDESS S +D A E S+L ++S +R ++ N + S S S+ SSV
Sbjct: 421 YAGDESS---SGSDSASE--SELVPVTSGTRTAFNRRNYHRSLPSKST--------TSSV 480
Query: 504 CREAKKRLSERWSMMTTHGNYQERRHVRRNSSTLGEMLALSDAKKSTVTDNVVNEHE--P 563
REAK+RLSERW + TH ++ + R S TL EMLA SD + + N ++ +
Sbjct: 481 SREAKRRLSERWKL--TH-KFEHEIEISR-SGTLAEMLATSDREARPASFNGLSFEDGIS 540
Query: 564 SELDHCFNSDENIECLDDSPTTLMKSKSVLGSSALFGVLNLEASDLETIKTDDPK-LLAK 623
++ E E + S K S ++N E++ TI PK L+ +
Sbjct: 541 KRFENNIQWPELPEPVGISSRDGWKGSCSRSFSKSRTIMNQESAGGYTIVL--PKGLINR 600
Query: 624 SKGVKSSFNEKVSSLFFSRNKKTSKGKYSGSQTKDEPQSCSAGTLSSSAFIHHSRGLSN- 683
V+ + S S+++ S +S + E T S S F++ + G+ +
Sbjct: 601 DALVQGDSSHHGESFLSSKSRPGSNKSHSSYNSSPEVSI----TPSLSKFVYMNDGIPSK 660
Query: 684 -------AASHSNDGEGCSSGTSFLHLTNVVARGGAVHHEVGLSVKRPFVSGNVGEN--- 743
+S S D + +S A+ SV P +S E+
Sbjct: 661 SASPFKARSSFSGDANSDTEDSSASDDIKTAMSSEALDLSTVTSVTDPDISRRTTEDVNH 720
Query: 744 ---------------QEQPSPISVLEPPFFEDDNAHLELSSYLKPRNQEFCMPFKNSLID 803
+QPSP+SVLE F +D ++ E + + M + ++
Sbjct: 721 SSVPDPPQPRESSKEGDQPSPVSVLEASFDDDVSSGSECFESVSADLRGLRMQLQLLKLE 780
Query: 804 KSPPIESIARSIFWDGSYSDSSAPCALKSSPVSTCLEEEQNWHCLVKALLTMSGLSSEAQ 863
+ E + D + + ++ L EE + LL S S
Sbjct: 781 SATYKEG-GMLVSSDEDTDQEESSTITDEAMITKELREEDWKSSYLVDLLANSSFSDSDH 840
Query: 864 QCGLLFTRWHSHVNPLDPSLRNKYANLSSKEPMLEAKQRQVRSSRKLVFDCVNAALIDIT 923
+ T P++PSL E + + R RKL+FD ++ ++ +
Sbjct: 841 NIVMATT-------PVEPSL------FEDLEKKYSSVKTSTRLERKLLFDQISREVLHML 900
Query: 924 SQELDHRQTKISSRAHDSNFAEDTSLTLLDCVMVKLKDWVCGEPRCVTGDIGDSNSLVVE 973
Q D WV C D + +
Sbjct: 901 KQLSDPH------------------------------PWVKSTKVCPKWDANKIQETLRD 920
BLAST of Bhi05G000096 vs. ExPASy TrEMBL
Match:
A0A1S3BGN6 (uncharacterized protein LOC103489819 OS=Cucumis melo OX=3656 GN=LOC103489819 PE=4 SV=1)
HSP 1 Score: 1679.5 bits (4348), Expect = 0.0e+00
Identity = 866/979 (88.46%), Postives = 907/979 (92.65%), Query Frame = 0
Query: 1 MNGIQRRKVSNNEKPFPGCLGRMVNLFDLSTGVSRNKLLTDAPHREGST-PRNQADMARM 60
MNGIQRRKV N+EKPFPGCLGRMVNLFDLSTG+SRNKLLTDAPHREG T RNQAD+ARM
Sbjct: 1 MNGIQRRKVGNDEKPFPGCLGRMVNLFDLSTGISRNKLLTDAPHREGPTLSRNQADVARM 60
Query: 61 FNHSTNQTEDNRSRTMPELQRASNKRANGTPVKMLIDQEMSEMECTQNPPNVVAKLMGLE 120
FNHSTNQ+EDN S+T+PELQRASNKRA+GTPVKMLIDQEMSEME T NPPNVVAKLMGLE
Sbjct: 61 FNHSTNQSEDNLSQTVPELQRASNKRASGTPVKMLIDQEMSEMESTHNPPNVVAKLMGLE 120
Query: 121 TLPHQLPGSSVQRNNVRSYPKSKIENHGKPLGCTEQSDLLEEGMKCQVNECSEQKECKDV 180
TLPHQ GSSVQRNNVR+ PKS+IENHG LGC E SD LEEGMK QV+ECSEQKE KDV
Sbjct: 121 TLPHQFSGSSVQRNNVRTCPKSRIENHGVLLGCREHSDFLEEGMKYQVDECSEQKEYKDV 180
Query: 181 YEIWQRSPQANYIREKRPKGIESEVVNDRKMALVRQKFVEAKRLATDEKLRQSKEFQDAL 240
YEIWQRSPQ NYI+EK PKG+ESEVVNDRKMALVRQKFVEAKRLATDEKLRQSKEFQ+AL
Sbjct: 181 YEIWQRSPQTNYIKEKLPKGMESEVVNDRKMALVRQKFVEAKRLATDEKLRQSKEFQEAL 240
Query: 241 EVLSSNKDLFVKFLQEPNYLFTQHLNELQSIPPSPETKRITVLRPSKVSRDERFTEFEKQ 300
EVLSSNKDLFVKFLQEPN LFTQHLNE QSIPPSPETKRITVLRPSKVSR+E+FT+ EK+
Sbjct: 241 EVLSSNKDLFVKFLQEPNSLFTQHLNEFQSIPPSPETKRITVLRPSKVSRNEKFTDLEKK 300
Query: 301 SYRQARLPVQRGQSATLDKSDSKLSPTPAINRTNEYAVAVQPTRIVVLKPSPGRNHDNKP 360
+YRQ+RLP QRGQSATLDKSDS+LSPTPA NRTNEYAV VQPTRIVVLKPSPGRN DNKP
Sbjct: 301 TYRQSRLPAQRGQSATLDKSDSRLSPTPATNRTNEYAVGVQPTRIVVLKPSPGRNLDNKP 360
Query: 361 IVSSPGSLPRVVQDGSFNEGYEDVDVKESRTFARNVTQKMCDNLLGHRRDETLLSSVFSN 420
I SSPG PRVVQDGSFNEG+ED DVKESR FARN+TQKMCDNLLGHRRDETL+SSVFSN
Sbjct: 361 IASSPGPFPRVVQDGSFNEGFEDDDVKESRKFARNITQKMCDNLLGHRRDETLISSVFSN 420
Query: 421 GYTGDESSFEKSENDYAVENLSDLEVMSSSSRHSWEYINRYSSPYSSSSFSRISCSPESS 480
GYTGDESSFEKSENDYAVENLSDLEV+SSSSRHSWEY+NRYSSPYSSSSFSRISCSPESS
Sbjct: 421 GYTGDESSFEKSENDYAVENLSDLEVISSSSRHSWEYVNRYSSPYSSSSFSRISCSPESS 480
Query: 481 VCREAKKRLSERWSMMTTHGNYQERRHVRRNSSTLGEMLALSDAKKSTVTDNVVNEHEPS 540
VCREAKKRLSERW+MMTTHGNYQERR VRRNSSTLGEMLALSDAKKSTVTDN VNEHE S
Sbjct: 481 VCREAKKRLSERWAMMTTHGNYQERRQVRRNSSTLGEMLALSDAKKSTVTDNEVNEHEQS 540
Query: 541 ELDHCFNSDENIECLDDSPTTLMKSKSVLGSSALFGVLNLEASDLETIKTDDPKLLAKSK 600
+LD C NSDENIECLDDSPTTL SKSV GSSALFGVLNLEASDL+ +KTDDPK L K K
Sbjct: 541 DLDPCLNSDENIECLDDSPTTLKMSKSVSGSSALFGVLNLEASDLDIVKTDDPKWLGKPK 600
Query: 601 GVKSSFNEKVSSLFFSRNKKTSKGKYSGSQTKDEPQSCSAGTLSSSAFIHHSRGLSNAAS 660
GVKSSFNEKVSSLFFSRNKKT K KYSGSQTKDEPQSCSA TLSSSAFIHHSRGLSNAA
Sbjct: 601 GVKSSFNEKVSSLFFSRNKKTVKEKYSGSQTKDEPQSCSAETLSSSAFIHHSRGLSNAAF 660
Query: 661 HSNDGEGCSSGTSFLHLTNVVARGGAVHHEVGLSVKRPFVSGNVGENQEQPSPISVLEPP 720
HSNDGEGCSSGTSFLHLTNVV RGGAVHHE GLSVKRPFV+GNVGENQEQPSPISVLEPP
Sbjct: 661 HSNDGEGCSSGTSFLHLTNVVGRGGAVHHEAGLSVKRPFVAGNVGENQEQPSPISVLEPP 720
Query: 721 FFEDDNAHLELSSYLKPRNQEFCMPFKNSLIDKSPPIESIARSIFWDGSYSDSSAPCALK 780
F EDDN HLELSSYLKPRNQEFCMPFKNSLIDKSPPIESIARSIF DGSYS SSAPCALK
Sbjct: 721 FSEDDNTHLELSSYLKPRNQEFCMPFKNSLIDKSPPIESIARSIFRDGSYSGSSAPCALK 780
Query: 781 SSPVSTCLEEEQNWHCLVKALLTMSGLSSEAQQCGLLFTRWHSHVNPLDPSLRNKYANLS 840
S PVSTCL+EEQNWHCLV+ALLTMSGLS+E QQC LLFT+WHS NPLDPSLRNKYANLS
Sbjct: 781 SPPVSTCLKEEQNWHCLVQALLTMSGLSNEIQQCSLLFTKWHSLANPLDPSLRNKYANLS 840
Query: 841 SKEPMLEAKQRQVRSSRKLVFDCVNAALIDITSQELDHRQTKISSRAHDSNFAEDTSLTL 900
SKEPMLEA++RQ+RSSRKLVFDCVNAALI+ITSQELDHRQTKI A DTSLTL
Sbjct: 841 SKEPMLEAERRQLRSSRKLVFDCVNAALINITSQELDHRQTKI--------LAHDTSLTL 900
Query: 901 LDCVMVKLKDWVCGEPRCVTGDIGDSNSLVVERVVRKEVGGRNWDEHFKMEMDNLGKEVE 960
LD VMVKLKDW+CGE RC+TGDIGDSNSLVVERVVRKEVGG+NWDEH MEMDNLGKEVE
Sbjct: 901 LDYVMVKLKDWICGESRCLTGDIGDSNSLVVERVVRKEVGGKNWDEHLLMEMDNLGKEVE 960
Query: 961 RRLLEELLEEAVVELTGKV 979
RRLLEELLEEAVVELTGKV
Sbjct: 961 RRLLEELLEEAVVELTGKV 971
BLAST of Bhi05G000096 vs. ExPASy TrEMBL
Match:
A0A5D3BDV5 (DUF3741 domain-containing protein/DUF4378 domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold487G00600 PE=4 SV=1)
HSP 1 Score: 1679.5 bits (4348), Expect = 0.0e+00
Identity = 866/979 (88.46%), Postives = 907/979 (92.65%), Query Frame = 0
Query: 1 MNGIQRRKVSNNEKPFPGCLGRMVNLFDLSTGVSRNKLLTDAPHREGST-PRNQADMARM 60
MNGIQRRKV N+EKPFPGCLGRMVNLFDLSTG+SRNKLLTDAPHREG T RNQAD+ARM
Sbjct: 1 MNGIQRRKVGNDEKPFPGCLGRMVNLFDLSTGISRNKLLTDAPHREGPTLSRNQADVARM 60
Query: 61 FNHSTNQTEDNRSRTMPELQRASNKRANGTPVKMLIDQEMSEMECTQNPPNVVAKLMGLE 120
FNHSTNQ+EDN S+T+PELQRASNKRA+GTPVKMLIDQEMSEME T NPPNVVAKLMGLE
Sbjct: 61 FNHSTNQSEDNLSQTVPELQRASNKRASGTPVKMLIDQEMSEMESTHNPPNVVAKLMGLE 120
Query: 121 TLPHQLPGSSVQRNNVRSYPKSKIENHGKPLGCTEQSDLLEEGMKCQVNECSEQKECKDV 180
TLPHQ GSSVQRNNVR+ PKS+IENHG LGC E SD LEEGMK QV+ECSEQKE KDV
Sbjct: 121 TLPHQFSGSSVQRNNVRTCPKSRIENHGVLLGCREHSDFLEEGMKYQVDECSEQKEYKDV 180
Query: 181 YEIWQRSPQANYIREKRPKGIESEVVNDRKMALVRQKFVEAKRLATDEKLRQSKEFQDAL 240
YEIWQRSPQ NYI+EK PKG+ESEVVNDRKMALVRQKFVEAKRLATDEKLRQSKEFQ+AL
Sbjct: 181 YEIWQRSPQTNYIKEKLPKGMESEVVNDRKMALVRQKFVEAKRLATDEKLRQSKEFQEAL 240
Query: 241 EVLSSNKDLFVKFLQEPNYLFTQHLNELQSIPPSPETKRITVLRPSKVSRDERFTEFEKQ 300
EVLSSNKDLFVKFLQEPN LFTQHLNE QSIPPSPETKRITVLRPSKVSR+E+FT+ EK+
Sbjct: 241 EVLSSNKDLFVKFLQEPNSLFTQHLNEFQSIPPSPETKRITVLRPSKVSRNEKFTDLEKK 300
Query: 301 SYRQARLPVQRGQSATLDKSDSKLSPTPAINRTNEYAVAVQPTRIVVLKPSPGRNHDNKP 360
+YRQ+RLP QRGQSATLDKSDS+LSPTPA NRTNEYAV VQPTRIVVLKPSPGRN DNKP
Sbjct: 301 TYRQSRLPAQRGQSATLDKSDSRLSPTPATNRTNEYAVGVQPTRIVVLKPSPGRNLDNKP 360
Query: 361 IVSSPGSLPRVVQDGSFNEGYEDVDVKESRTFARNVTQKMCDNLLGHRRDETLLSSVFSN 420
I SSPG PRVVQDGSFNEG+ED DVKESR FARN+TQKMCDNLLGHRRDETL+SSVFSN
Sbjct: 361 IASSPGPFPRVVQDGSFNEGFEDDDVKESRKFARNITQKMCDNLLGHRRDETLISSVFSN 420
Query: 421 GYTGDESSFEKSENDYAVENLSDLEVMSSSSRHSWEYINRYSSPYSSSSFSRISCSPESS 480
GYTGDESSFEKSENDYAVENLSDLEV+SSSSRHSWEY+NRYSSPYSSSSFSRISCSPESS
Sbjct: 421 GYTGDESSFEKSENDYAVENLSDLEVISSSSRHSWEYVNRYSSPYSSSSFSRISCSPESS 480
Query: 481 VCREAKKRLSERWSMMTTHGNYQERRHVRRNSSTLGEMLALSDAKKSTVTDNVVNEHEPS 540
VCREAKKRLSERW+MMTTHGNYQERR VRRNSSTLGEMLALSDAKKSTVTDN VNEHE S
Sbjct: 481 VCREAKKRLSERWAMMTTHGNYQERRQVRRNSSTLGEMLALSDAKKSTVTDNEVNEHEQS 540
Query: 541 ELDHCFNSDENIECLDDSPTTLMKSKSVLGSSALFGVLNLEASDLETIKTDDPKLLAKSK 600
+LD C NSDENIECLDDSPTTL SKSV GSSALFGVLNLEASDL+ +KTDDPK L K K
Sbjct: 541 DLDPCLNSDENIECLDDSPTTLKMSKSVSGSSALFGVLNLEASDLDIVKTDDPKWLGKPK 600
Query: 601 GVKSSFNEKVSSLFFSRNKKTSKGKYSGSQTKDEPQSCSAGTLSSSAFIHHSRGLSNAAS 660
GVKSSFNEKVSSLFFSRNKKT K KYSGSQTKDEPQSCSA TLSSSAFIHHSRGLSNAA
Sbjct: 601 GVKSSFNEKVSSLFFSRNKKTVKEKYSGSQTKDEPQSCSAETLSSSAFIHHSRGLSNAAF 660
Query: 661 HSNDGEGCSSGTSFLHLTNVVARGGAVHHEVGLSVKRPFVSGNVGENQEQPSPISVLEPP 720
HSNDGEGCSSGTSFLHLTNVV RGGAVHHE GLSVKRPFV+GNVGENQEQPSPISVLEPP
Sbjct: 661 HSNDGEGCSSGTSFLHLTNVVGRGGAVHHEAGLSVKRPFVAGNVGENQEQPSPISVLEPP 720
Query: 721 FFEDDNAHLELSSYLKPRNQEFCMPFKNSLIDKSPPIESIARSIFWDGSYSDSSAPCALK 780
F EDDN HLELSSYLKPRNQEFCMPFKNSLIDKSPPIESIARSIF DGSYS SSAPCALK
Sbjct: 721 FSEDDNTHLELSSYLKPRNQEFCMPFKNSLIDKSPPIESIARSIFRDGSYSGSSAPCALK 780
Query: 781 SSPVSTCLEEEQNWHCLVKALLTMSGLSSEAQQCGLLFTRWHSHVNPLDPSLRNKYANLS 840
S PVSTCL+EEQNWHCLV+ALLTMSGLS+E QQC LLFT+WHS NPLDPSLRNKYANLS
Sbjct: 781 SPPVSTCLKEEQNWHCLVQALLTMSGLSNEIQQCSLLFTKWHSLANPLDPSLRNKYANLS 840
Query: 841 SKEPMLEAKQRQVRSSRKLVFDCVNAALIDITSQELDHRQTKISSRAHDSNFAEDTSLTL 900
SKEPMLEA++RQ+RSSRKLVFDCVNAALI+ITSQELDHRQTKI A DTSLTL
Sbjct: 841 SKEPMLEAERRQLRSSRKLVFDCVNAALINITSQELDHRQTKI--------LAHDTSLTL 900
Query: 901 LDCVMVKLKDWVCGEPRCVTGDIGDSNSLVVERVVRKEVGGRNWDEHFKMEMDNLGKEVE 960
LD VMVKLKDW+CGE RC+TGDIGDSNSLVVERVVRKEVGG+NWDEH MEMDNLGKEVE
Sbjct: 901 LDYVMVKLKDWICGESRCLTGDIGDSNSLVVERVVRKEVGGKNWDEHLLMEMDNLGKEVE 960
Query: 961 RRLLEELLEEAVVELTGKV 979
RRLLEELLEEAVVELTGKV
Sbjct: 961 RRLLEELLEEAVVELTGKV 971
BLAST of Bhi05G000096 vs. ExPASy TrEMBL
Match:
A0A0A0LBJ2 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G622430 PE=4 SV=1)
HSP 1 Score: 1639.8 bits (4245), Expect = 0.0e+00
Identity = 849/980 (86.63%), Postives = 897/980 (91.53%), Query Frame = 0
Query: 1 MNGIQRRKVSNNEKPFPGCLGRMVNLFDLSTGVSRNKLLTDAPHREGS-TPRNQADMARM 60
MNGIQRRKV NNEKPFPGCLGRMVNLFDLSTGVSRNKLLTDAPHREG RNQAD+ARM
Sbjct: 1 MNGIQRRKVGNNEKPFPGCLGRMVNLFDLSTGVSRNKLLTDAPHREGPILSRNQADVARM 60
Query: 61 FNHSTNQTEDNRSRTMPELQRASNKRANGTPVKMLIDQEMSEMECTQNPPNVVAKLMGLE 120
FNHS NQ+EDN S+T+PEL+RASNKRA+GTPVKMLIDQEMSEME TQ+PPNVVAKLMGLE
Sbjct: 61 FNHSINQSEDNLSQTVPELRRASNKRASGTPVKMLIDQEMSEMESTQSPPNVVAKLMGLE 120
Query: 121 TLPHQLPGSSVQRNNVRSYPKSKIENHGKPLGCTEQSDLLEEGMKCQVNECSEQKECKDV 180
TLPHQ GSSVQRNNVR+ PKS+I+NH SD LEEGMK QV+ECSEQKE KDV
Sbjct: 121 TLPHQFSGSSVQRNNVRTCPKSRIQNH---------SDFLEEGMKYQVDECSEQKEYKDV 180
Query: 181 YEIWQRSPQANYIREKRPKGIESEVVNDRKMALVRQKFVEAKRLATDEKLRQSKEFQDAL 240
YEIWQRSPQ NYI+EK PKG+ESEVVNDRKMALVRQKFVEAKRLA DEK+RQSKEFQ+AL
Sbjct: 181 YEIWQRSPQTNYIKEKLPKGMESEVVNDRKMALVRQKFVEAKRLAPDEKMRQSKEFQEAL 240
Query: 241 EVLSSNKDLFVKFLQEPNYLFTQHLNELQSIPPSPETKRITVLRPSKVSRDERFTEFEKQ 300
EVLSSNKDL VKFLQEPN LFTQHLNE QSIPPSPETKRITVLRPSKVSRDERFTE EK+
Sbjct: 241 EVLSSNKDLLVKFLQEPNSLFTQHLNEFQSIPPSPETKRITVLRPSKVSRDERFTELEKK 300
Query: 301 SYRQARLPVQRGQSATLDKSDSKLSPTPAINRTNEYAVAVQPTRIVVLKPSPGRNHDNKP 360
+YRQ+RLP QRGQSA+LD+SDS+LSPTPA NRTNEYAV VQPTRIVVLKPSPGRN DNKP
Sbjct: 301 NYRQSRLPAQRGQSASLDRSDSRLSPTPATNRTNEYAVGVQPTRIVVLKPSPGRNLDNKP 360
Query: 361 IVSSPGSLPRVVQDGSFNEGYEDVDVKESRTFARNVTQKMCDNLLGHRRDETLLSSVFSN 420
I SSP LPR VQDGSFN G+ED DVK+SR FARN+TQKMCDNLLGHRRDETL+SSVFSN
Sbjct: 361 IASSPSPLPRAVQDGSFNGGFEDDDVKKSRKFARNITQKMCDNLLGHRRDETLISSVFSN 420
Query: 421 GYTGDESSFEKSENDYAVENLSDLEVMSSSSRHSWEYINRYSSPYSSSSFSRISCSPESS 480
GYTGDESSFEKSENDYAVENLSDLEVMSSSSRHSWEY+NRYSSPYSSSSFSRISCSPESS
Sbjct: 421 GYTGDESSFEKSENDYAVENLSDLEVMSSSSRHSWEYVNRYSSPYSSSSFSRISCSPESS 480
Query: 481 VCREAKKRLSERWSMMTTHGNYQERRHVRRNSSTLGEMLALSDAKKSTVTDNVVNEHEPS 540
VC+EAKKRLSERW+MMTTHGNYQERR+VRRNSSTLGEMLALSDAKKSTVTDN VNEHE S
Sbjct: 481 VCKEAKKRLSERWAMMTTHGNYQERRYVRRNSSTLGEMLALSDAKKSTVTDNEVNEHEQS 540
Query: 541 ELDHCFNSDENIECLDDSPTTLMKSKSVLGSSALFGVLNLEASDLETIKTDDPKLLAKSK 600
+LD CFN DENIECLDDSPTT SKSV GSSALFGVLNLEASDL+ +K +D KLL K K
Sbjct: 541 DLDPCFNRDENIECLDDSPTTFEMSKSVSGSSALFGVLNLEASDLDIVKIEDSKLLGKPK 600
Query: 601 GVKSSFNEKVSSLFFSRNKKTSKGKYSGSQTKDEPQSCSAGTLSSSAFIHHSRGLSNAAS 660
GVKSSFNEKVSSLFFSRNKKT K KYSGSQTKDEPQSCSA TLSSSAFIHHSRG SNAAS
Sbjct: 601 GVKSSFNEKVSSLFFSRNKKTIKEKYSGSQTKDEPQSCSAETLSSSAFIHHSRGFSNAAS 660
Query: 661 HSNDGEGCSSGTSFLHLTNVVARGGAV-HHEVGLSVKRPFVSGNVGENQEQPSPISVLEP 720
HSNDGEGCSSGTSFLHLTNV RGGAV HHE GLSVKRPFV+GNVGENQEQPSPISVLEP
Sbjct: 661 HSNDGEGCSSGTSFLHLTNVAGRGGAVLHHEAGLSVKRPFVAGNVGENQEQPSPISVLEP 720
Query: 721 PFFEDDNAHLELSSYLKPRNQEFCMPFKNSLIDKSPPIESIARSIFWDGSYSDSSAPCAL 780
PFFEDDN HLELSSYLKPRNQEFCMP+KNSLIDKSPPIESIARSIFWDGSYSDSSAPCAL
Sbjct: 721 PFFEDDNTHLELSSYLKPRNQEFCMPYKNSLIDKSPPIESIARSIFWDGSYSDSSAPCAL 780
Query: 781 KSSPVSTCLEEEQNWHCLVKALLTMSGLSSEAQQCGLLFTRWHSHVNPLDPSLRNKYANL 840
KS+PVSTCLEEEQNWH LV+ALLTMSGLS+E QQC LLF +WHS NPLD SLRNKYANL
Sbjct: 781 KSAPVSTCLEEEQNWHSLVQALLTMSGLSNEVQQCSLLFAKWHSLANPLDLSLRNKYANL 840
Query: 841 SSKEPMLEAKQRQVRSSRKLVFDCVNAALIDITSQELDHRQTKISSRAHDSNFAEDTSLT 900
+SKEPMLEA++RQ+RSSRKLVFDCVNAALIDITSQELDHR+T+I A+DTSLT
Sbjct: 841 NSKEPMLEAERRQLRSSRKLVFDCVNAALIDITSQELDHRRTEI--------LAQDTSLT 900
Query: 901 LLDCVMVKLKDWVCGEPRCVTGDIGDSNSLVVERVVRKEVGGRNWDEHFKMEMDNLGKEV 960
LLDCVMVK+KDWVC E RCVTGDIGD NSLVVERVVRKEVGGRNWDEH +MEMDNLGKEV
Sbjct: 901 LLDCVMVKVKDWVCVESRCVTGDIGDINSLVVERVVRKEVGGRNWDEHLRMEMDNLGKEV 960
Query: 961 ERRLLEELLEEAVVELTGKV 979
ERRLLEELLEEAVVELTGKV
Sbjct: 961 ERRLLEELLEEAVVELTGKV 963
BLAST of Bhi05G000096 vs. ExPASy TrEMBL
Match:
A0A6J1HG99 (uncharacterized protein LOC111463809 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111463809 PE=4 SV=1)
HSP 1 Score: 1577.0 bits (4082), Expect = 0.0e+00
Identity = 826/982 (84.11%), Postives = 882/982 (89.82%), Query Frame = 0
Query: 1 MNGIQRRKVSNNEKPFPGCLGRMVNLFDLSTGVSRNKLLTDAPHREGST-PRNQADMARM 60
MNGIQR+KV +NEKPFPGCLGRMVNLFDLSTGVSRNKLLTDAPHREGST PRNQAD+ARM
Sbjct: 1 MNGIQRKKVGSNEKPFPGCLGRMVNLFDLSTGVSRNKLLTDAPHREGSTLPRNQADVARM 60
Query: 61 FNHSTNQTEDNRSRTMPELQRASNKRANGTPVKMLIDQEMSEMECTQNPPNVVAKLMGLE 120
FNHSTNQTEDN T+PE QRAS KRANGTPVKMLIDQ+MSEMECT+NPPNVVAKLMGLE
Sbjct: 61 FNHSTNQTEDN--LTVPEFQRASKKRANGTPVKMLIDQDMSEMECTKNPPNVVAKLMGLE 120
Query: 121 TLPHQLPGSSVQRNNVRSYPKSKIENHGKPLGCTEQSDLLEEGMKCQVNECSEQKECKDV 180
TLPH+LPGSSVQRNNV SYPK + HG P+ C E+SD LEEGMKCQVNECSEQKE KDV
Sbjct: 121 TLPHKLPGSSVQRNNVLSYPKGRSAKHGMPIECRERSDFLEEGMKCQVNECSEQKEYKDV 180
Query: 181 YEIWQRSPQANYIREKRPKG-IESEVVNDRKMALVRQKFVEAKRLATDEKLRQSKEFQDA 240
YEIWQRSPQ N IREK PK +ESE+++DRKMALVRQKFVEAK LATDEKLRQSKEFQDA
Sbjct: 181 YEIWQRSPQTNDIREKLPKKVVESEILDDRKMALVRQKFVEAKCLATDEKLRQSKEFQDA 240
Query: 241 LEVLSSNKDLFVKFLQEPNYLFTQHLNELQSIPPSPETKRITVLRPSKVSRDERFTEFEK 300
+E+LSSNKDL VKFLQEPN LFTQHLNEL SIPPSPETKRITVLRPSKVSRDERFTEFEK
Sbjct: 241 VEILSSNKDLLVKFLQEPNSLFTQHLNELPSIPPSPETKRITVLRPSKVSRDERFTEFEK 300
Query: 301 QSYRQARLPVQRGQSATLDKSDSKLSPTPAINRTNEYAVAVQPTRIVVLKPSPGRNHDNK 360
+ RQ+RLPVQRGQSA LDKSDS+LSPTP INRTNEYAVAVQPTRIVVLKPSPGRNHDNK
Sbjct: 301 KGCRQSRLPVQRGQSAILDKSDSRLSPTPGINRTNEYAVAVQPTRIVVLKPSPGRNHDNK 360
Query: 361 PIVSSPGSLPRVVQDGSFNEGYEDVDVKESRTFARNVTQKMCDNLLGHRRDETLLSSVFS 420
PIVSSPGSLP F+EG+ED DVKESR FARN+T+KMCDNLLG RRDETLLSSVFS
Sbjct: 361 PIVSSPGSLP-------FDEGFEDDDVKESRKFARNITEKMCDNLLGRRRDETLLSSVFS 420
Query: 421 NGYTGDESSFEKSENDYAVENLSDLEVMSSSSRHSWEYINRY-SSPYSSSSFSRISCSPE 480
NGYTGDESSFEKSENDYAVENLSDLEVMSSSSRHSWEY+NRY SSPYSSSSFSR+SCS E
Sbjct: 421 NGYTGDESSFEKSENDYAVENLSDLEVMSSSSRHSWEYVNRYSSSPYSSSSFSRMSCSLE 480
Query: 481 SSVCREAKKRLSERWSMMTTHGNYQERRHVRRNSSTLGEMLALSDAKKSTVTDNVVNEHE 540
SSVCREAKKRLSERW+MMT+HGNYQERR VRRNSSTLGEMLALSDAKKSTVTDN NEHE
Sbjct: 481 SSVCREAKKRLSERWAMMTSHGNYQERRRVRRNSSTLGEMLALSDAKKSTVTDNEANEHE 540
Query: 541 -PSELDHCFNSDENIECLDDSPTTLMKSKSVLGSSALFGVLNLEASDLETIKTDDPKLLA 600
SEL+ CFNSDENIECLDDSPT L +SKSV GSS LFG+LNLEASDLETIKTDD K+LA
Sbjct: 541 TTSELEPCFNSDENIECLDDSPTMLARSKSVPGSSPLFGMLNLEASDLETIKTDDSKMLA 600
Query: 601 KSKGVKSSFNEKVSSLFFSRNKKTSKGKYSGSQTKDEPQSCSAGTLSSSAFIHHSRGLSN 660
K KGVKSS NE+VSS FF+RNKKT+ K SG Q KDEP+S SA TL S AF+HHSRG SN
Sbjct: 601 KQKGVKSSLNEEVSSSFFTRNKKTNGEKCSGYQPKDEPKSWSAETLPSLAFVHHSRGFSN 660
Query: 661 AASHSNDGEGCSSGTSFLHLTNVVARGGAVHHEVGLSVKRPFVSGNVGENQEQPSPISVL 720
AASHSNDGEGCSS TSFLHLTNVVARG VHHE GLSVKRPF++GNVGENQEQPSPISVL
Sbjct: 661 AASHSNDGEGCSSSTSFLHLTNVVARGAEVHHEEGLSVKRPFMTGNVGENQEQPSPISVL 720
Query: 721 EPPFFEDDNAHLELSSYLKPRNQEFCMPFKNSLIDKSPPIESIARSIFWDGSYSDSSAPC 780
E PFFEDDN HLE SSYLKPRNQEF MPFKN+LIDKSPPIESIARS++W G
Sbjct: 721 ETPFFEDDNTHLEFSSYLKPRNQEFYMPFKNNLIDKSPPIESIARSVYWVG--------- 780
Query: 781 ALKSSPVSTCLEEEQNWHCLVKALLTMSGLSSEAQQCGLLFTRWHSHVNPLDPSLRNKYA 840
SSPVST LEEEQNWHCLV+ALLT+SGLS+E QQCGLLFTRWHS VNPLDPSLR+KYA
Sbjct: 781 ---SSPVSTFLEEEQNWHCLVEALLTLSGLSNEVQQCGLLFTRWHSLVNPLDPSLRDKYA 840
Query: 841 NLSSKEPMLEAKQRQVRSSRKLVFDCVNAALIDITSQELDHRQTKISSRAHDSNFAEDTS 900
NLSS+E MLEAK+RQ+RSSRKLVFDCVNAAL+DIT +ELDHR+ K+SSRAHDS+FAE TS
Sbjct: 841 NLSSQELMLEAKRRQLRSSRKLVFDCVNAALMDITCEELDHRRAKLSSRAHDSSFAEGTS 900
Query: 901 LTLLDCVMVKLKDWVCGEPRCVTGDIGDSNSLVVERVVRKEVGGRNWDEHFKMEMDNLGK 960
LTLLDCVMVKLKDWVCGE RCVTGDIGD + LVVER VRKEVGGR+WDE +MEMDNLGK
Sbjct: 901 LTLLDCVMVKLKDWVCGEFRCVTGDIGDCDGLVVERAVRKEVGGRHWDEQLRMEMDNLGK 960
Query: 961 EVERRLLEELLEEAVVELTGKV 979
EVERRLLEELLEEAVVELTGKV
Sbjct: 961 EVERRLLEELLEEAVVELTGKV 961
BLAST of Bhi05G000096 vs. ExPASy TrEMBL
Match:
A0A6J1HQ44 (uncharacterized protein LOC111466798 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111466798 PE=4 SV=1)
HSP 1 Score: 1574.3 bits (4075), Expect = 0.0e+00
Identity = 826/979 (84.37%), Postives = 884/979 (90.30%), Query Frame = 0
Query: 1 MNGIQRRKVSNNEKPFPGCLGRMVNLFDLSTGVSRNKLLTDAPHREGST-PRNQADMARM 60
MNGIQR+KV +NEKPFPGCLGRMVNLFDLST VSRNKLLTDAPHREGST PRNQAD+ARM
Sbjct: 1 MNGIQRKKVGSNEKPFPGCLGRMVNLFDLSTVVSRNKLLTDAPHREGSTLPRNQADVARM 60
Query: 61 FNHSTNQTEDNRSRTMPELQRASNKRANGTPVKMLIDQEMSEMECTQNPPNVVAKLMGLE 120
FNHSTNQTEDN T+PE QRAS KRANGTPVKMLIDQ+MSE ECT+NPPNVVAKLMGLE
Sbjct: 61 FNHSTNQTEDN--LTVPEFQRASKKRANGTPVKMLIDQDMSE-ECTKNPPNVVAKLMGLE 120
Query: 121 TLPHQLPGSSVQRNNVRSYPKSKIENHGKPLGCTEQSDLLEEGMKCQVNECSEQKECKDV 180
TLP QLP S +QRNNV SYPK +I HG P+ C E SDLLEEGMKCQVNECSEQKE KDV
Sbjct: 121 TLPRQLPCSPIQRNNVISYPKGRIAKHGMPIECRELSDLLEEGMKCQVNECSEQKEYKDV 180
Query: 181 YEIWQRSPQANYIREKRP-KGIESEVVNDRKMALVRQKFVEAKRLATDEKLRQSKEFQDA 240
YEIWQRSPQ N IREK+P KGIESE+++DRKMALVRQKFVEAK LATDEKLRQSKEFQDA
Sbjct: 181 YEIWQRSPQTNDIREKQPKKGIESEILDDRKMALVRQKFVEAKCLATDEKLRQSKEFQDA 240
Query: 241 LEVLSSNKDLFVKFLQEPNYLFTQHLNELQSIPPSPETKRITVLRPSKVSRDERFTEFEK 300
+E+LSS+KDL VKFLQEPN LFTQHLNEL SIPPSPETKRITVLRPSKVSRDERFTEFEK
Sbjct: 241 VEILSSSKDLLVKFLQEPNSLFTQHLNELPSIPPSPETKRITVLRPSKVSRDERFTEFEK 300
Query: 301 QSYRQARLPVQRGQSATLDKSDSKLSPTPAINRTNEYAVAVQPTRIVVLKPSPGRNHDNK 360
+ RQ+RLP QRGQSA LDKSDS+LSPTP INRTNEYAVAVQPTRIVVLKPSPGRNHDNK
Sbjct: 301 KGCRQSRLPAQRGQSAILDKSDSRLSPTPGINRTNEYAVAVQPTRIVVLKPSPGRNHDNK 360
Query: 361 PIVSSPGSLPRVVQDGSFNEGYEDVDVKESRTFARNVTQKMCDNLLGHRRDETLLSSVFS 420
PIVSSPGSLP F+EG+ED DVKESR FARN+TQKMCDNLLG RRDETLLSSVFS
Sbjct: 361 PIVSSPGSLP-------FDEGFEDDDVKESRKFARNITQKMCDNLLGRRRDETLLSSVFS 420
Query: 421 NGYTGDESSFEKSENDYAVENLSDLEVMSSSSRHSWEYINRYSSPYSSSSFSRISCSPES 480
NGYTGDESSFE SENDYAVENLSDLEVMSSSSRHSWEY+NRYSSPYSSSSFSR+SCSPES
Sbjct: 421 NGYTGDESSFEISENDYAVENLSDLEVMSSSSRHSWEYVNRYSSPYSSSSFSRMSCSPES 480
Query: 481 SVCREAKKRLSERWSMMTTHGNYQERRHVRRNSSTLGEMLALSDAKKSTVTDNVVNEHE- 540
SVCREAKKRLSERW+MMT+HGNYQERR VRRNSSTLGEMLALSDAKK +VTDN NEHE
Sbjct: 481 SVCREAKKRLSERWAMMTSHGNYQERRRVRRNSSTLGEMLALSDAKKLSVTDNEANEHET 540
Query: 541 PSELDHCFNSDENIECLDDSPTTLMKSKSVLGSSALFGVLNLEASDLETIKTDDPKLLAK 600
SEL+ CFNSDENI+CLDDSPT L +SKSV G+S LFG+LNLEASDLETIKTDD K LAK
Sbjct: 541 TSELEPCFNSDENIDCLDDSPTMLARSKSVPGASPLFGMLNLEASDLETIKTDDSKSLAK 600
Query: 601 SKGVKSSFNEKVSSLFFSRNKKTSKGKYSGSQTKDEPQSCSAGTLSSSAFIHHSRGLSNA 660
KGVKSS NE+VSS FF+RNKKT+K K SG Q KDEP+SCSA TLSS AF+HHSRGLSNA
Sbjct: 601 QKGVKSSPNEEVSSSFFTRNKKTNKEKCSGYQPKDEPKSCSAETLSSLAFVHHSRGLSNA 660
Query: 661 ASHSNDGEGCSSGTSFLHLTNVVARGGAVHHEVGLSVKRPFVSGNVGENQEQPSPISVLE 720
ASHSNDGEGCSS TSFLHLTNVVARGG VH E GLSVKRPF++GNVGENQEQPSPISVLE
Sbjct: 661 ASHSNDGEGCSSSTSFLHLTNVVARGGEVHREGGLSVKRPFMTGNVGENQEQPSPISVLE 720
Query: 721 PPFFEDDNAHLELSSYLKPRNQEFCMPFKNSLIDKSPPIESIARSIFWDGSYSDSSAPCA 780
PFFEDDN HLE S YLKP NQEFCMPFKN+LI+KSPPIESIARS++WDGS SDSSA A
Sbjct: 721 TPFFEDDNTHLEFSRYLKPSNQEFCMPFKNNLINKSPPIESIARSVYWDGSSSDSSAR-A 780
Query: 781 LKSSPVSTCLEEEQNWHCLVKALLTMSGLSSEAQQCGLLFTRWHSHVNPLDPSLRNKYAN 840
LKSSPVST LEEEQNWHC V+ALLT+SGLSSE QQCGLLFTRWHS VNPLDPSLR+KYAN
Sbjct: 781 LKSSPVSTFLEEEQNWHCHVEALLTLSGLSSEVQQCGLLFTRWHSLVNPLDPSLRDKYAN 840
Query: 841 LSSKEPMLEAKQRQVRSSRKLVFDCVNAALIDITSQELDHRQTKISSRAHDSNFAEDTSL 900
LSS+E MLEAK+RQ+RSSRKLVFDCVNA L+DIT +EL+H + K+SSRAHDS+FAE TSL
Sbjct: 841 LSSQELMLEAKRRQLRSSRKLVFDCVNAVLMDITCEELNHWRAKLSSRAHDSSFAEGTSL 900
Query: 901 TLLDCVMVKLKDWVCGEPRCVTGDIGDSNSLVVERVVRKEVGGRNWDEHFKMEMDNLGKE 960
TLLDCVMVKLKDWVCGE RCVTGDIGD + LVVER VRKEVGG +WDE +MEMDNLGKE
Sbjct: 901 TLLDCVMVKLKDWVCGEFRCVTGDIGDCDGLVVERAVRKEVGGGHWDEQLRMEMDNLGKE 960
Query: 961 VERRLLEELLEEAVVELTG 977
VERRLLEELLEEAVVELTG
Sbjct: 961 VERRLLEELLEEAVVELTG 968
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
AT4G28760.1 | 1.9e-168 | 42.41 | Protein of unknown function (DUF3741) | [more] |
AT4G28760.2 | 1.9e-168 | 42.41 | Protein of unknown function (DUF3741) | [more] |
AT2G20240.1 | 6.7e-113 | 36.50 | Protein of unknown function (DUF3741) | [more] |
AT5G43880.1 | 7.4e-112 | 35.55 | Protein of unknown function (DUF3741) | [more] |
AT3G53540.1 | 1.5e-35 | 25.27 | unknown protein; LOCATED IN: plasma membrane; EXPRESSED IN: 24 plant structures;... | [more] |
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A1S3BGN6 | 0.0e+00 | 88.46 | uncharacterized protein LOC103489819 OS=Cucumis melo OX=3656 GN=LOC103489819 PE=... | [more] |
A0A5D3BDV5 | 0.0e+00 | 88.46 | DUF3741 domain-containing protein/DUF4378 domain-containing protein OS=Cucumis m... | [more] |
A0A0A0LBJ2 | 0.0e+00 | 86.63 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G622430 PE=4 SV=1 | [more] |
A0A6J1HG99 | 0.0e+00 | 84.11 | uncharacterized protein LOC111463809 isoform X1 OS=Cucurbita moschata OX=3662 GN... | [more] |
A0A6J1HQ44 | 0.0e+00 | 84.37 | uncharacterized protein LOC111466798 isoform X1 OS=Cucurbita maxima OX=3661 GN=L... | [more] |