Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: polypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTCATGTCCAAGCAATAGCTTCCGCTTCAATGGCAGCCTCTGTGCTTGCCCACCTGGCCAACTTCTCAATCGTACAGAAAACAGCTGTGTTCTCTTCAGCAGCACTTCGGCCATCACTTCAGGCCGGCTCGAGAACTATGCTGTTAGCTTCCCTGAGACCATAGTCTCCTTTGATTCCATCAAGAAGATAACGCAGTCTCAGGCTGTGTTTCTTCAGGCCACTCTCGTCTTGCTGTTTTCTTGGCTCGCCTTCTGCATATTCCTTAGATTCATGAAGCTTGGGGATGGCAGAAATATCTGGTTCAGGATGAGATGGTGGGTTAGCAGACTGGACGTTTGCTTTGCCACAAGACATTGGCTGGTTAGCTCCTACCTCTGCCTCTTAGAATCTCACACTTATTGCTTTAATTATTGGCTTATTGGCGGTTTTGTTTAGACTTCTTTATTTAATCCTTTTCCACTGTGCATTACGCGAGATGTTCCCTTTCATTGACCGTTTCAATTTTTTAGATAAGTGATGAGAAGTTAAAGTTTCAACGATGCAATACTGGTTCATACCTTTATGGTTTGCGGTTTGCAAAGTCTAAACTGTACGTTTCTTGGGTTTACTGACATTTATGTAAATTATTAAATTCGATGAAGTTAGCTGTTGATCTGGTTTCGGGACTTGTTATCGCCCAACAAGAGCTAATTTTCAGGATAGGAAATTCAAAAGTTCACTGGATAAAGCTTCAGTAATCTATATTTGAACAGCTGAGCTAAAAAGAGACGAAAATTGAACACCATGGATTTGGATGATCAAGACCTGTATTCATGATAGAAGCTAAAACTGTAGAATATTTTGTTGGTAGGGTATCTAATACCTGTGTTGTTCTTGTTTTTTTTTTTTTTTCTCTTCTTCTATAAAAACATTTTTCATTTAAGGTTATTTAAAGTACAAAAAGGGAGAAGGAGCTCCTCAAAAACATAGGGAGTTCACGAAAAAAGCAATTCAATTGGTATGAATAAAAAGGGGGAAATTAGAAAGACCTATTGGATAATACCAGTTTGAAGCCACAAAAATAGTTAGATCTAGAAAATCATCAGAAATCTTTAGAGGTATCTCTGGAAAGCCTCTTATTTACTTTTTCCAGAATGGCTATGATAGCATTTTTTTCCAATAAGATTTTGGTTTCTTATTTGAACTGAGAGCACGAAAGATTAGAGTAAACTTTTAAGTGTTTGGGGGCATTGAAGTTGAATCTGTTGAACAAATTATTTTACAGCCTTGTAGATAACAAACAATTTAGGAAAAGATGGTTCTGCTTTTCCACATGATTTCTGCACGAAATGTACCAACCTGGAGAAAGCTGGGAACTTAACAACTTTCTCTGTACCTTAACATTTGTGTTAAATTTCCTTAATTCAAAGGCCATGTTTTAAAAAGCTTTTGAGGCTTACGCCTTGAGGCTCGCCTCGAGGCAAGGCTCTAACTTTTAGCCTCGAGACTTACGCCTTCTTTGAACCATAAGAGGCTTACGCCTCTCATAGAAGAGGCTTACGCCTCTTTGAAGACACACTGAGGCTTAAAGTCTTTACGCCTCGTGTAATGATTAGAATAATTTTAGTAATTTGCCCAATTCTTACAAAATATAAAGGTTTCTTTGCATATGGAATGCACTATAGGTGTGGCGATGATTCAATCTATCGCTACTTTTACTAATTGGACTTTAAAAACATTGGAGACAAATTTTATAGAATAAGTCAATCATATTAGTGATAATAGTACTAGAGATAATAGTAGCGATATTAGTAGTGATATTGAGGATCTTCTTTTAGGATTTTTATTTAAGTGGTAACTAACAGGTGATTTTTTTATACTTTTATATAACTATTATGTTTAGCAATACTTGATCATACAAATTTTAGATTTGTATAATTTTTTAATTAAAATTGAGGCTTACGCCTCGCCTCTTCAAGAGGAAAAGCCTCGAGGCAGCCTTTAGCCTTTTAAAACATTGTTCAAAGGTTCTAAAGAAAGTCTTTCACATTCTTTGGAAATTCTCTTTCAAATTCTCAACTTAAAGTTGTAGGCAGGCTGAAAGAATCTGCAGTAAGGTGTTCAAAAAGGGATTTGACAGAAATTTCTCCTCCTTTTTCTTTTCCACTTCCTTTTTTGTTTCAAGATTTTAAGTCCATCTGTCATTTTGCAGAGTTTAGAAAGGAACGTCAAACCAGCGAACTCTACATCTCTTCTCTTGTTAGATTATTTTTGAGATGAAGATTCCAAACACTCCTCTCCTCATCAGTCATCTCAGGATTTGGAAATTGGGTTGTTCTTCAGCTGGGTAACACTGTGTTATAGAGTTGGGAAAACTTAGGTCAATAGAAGCCTCTAGAGGTTTCCCAAAATCTGATAAGTCTCCCATTACCAGCATTGAAATGTGCAAGCAAGTCATAAGGTCTCGTTTCTACATATGTAAGTCCACAGGTGACAGGCCCCTAGACCAAATATTTTTTTCCTTCTTTTGATAGGAAACGTTAGATCTATATTGAATAAAGGAAAAAGGAACATCCAATAGGCCGGGGGGATGAGAAAGCCCCTGCCTAACAAAAAGCTAGTGAAGGACAGCTTTCCAATCATTTATTGGCATCATAACATTGTTATTACAAAAAAAGGCCTTGTGAGTTAGACACCATGTTGAGGCCATAATCTGTTTATAGCTACAAAAAGAATCAAAAGAACATACTTTGTTGTTAAAAATCCAACCAAATTTTGATTTACAAATTGACCTGCTTTTCCTTTTACTTTTATCATTCCATGTGAAGATTTTGAAGAAGTTTTCAAGTCGAGAAAGATGACCTCCTTGACCTTTTACTTTTATCATCCCATGAGAAGATTTTGAATAATTTTTCAATTTCGAGAAGATTTAAAAGCAGCTTTCAATGTTTTTTTCTTTAAAAAAAAAAAAAAGATTTTGAAGTTGAGAAGATCTTGAGGCAGTTTTCAAAGTTTCCTACTTTATAAGGCATTTTTAAGAGGTAGGTAGGTGTGTTGGAAAGAACATGTAGTATCAAAGTAAGTTTACCAAATTAGGATTCTTCTTCTTCTTCTTCTTTTATCTTCTCTCTCTCTCTCTCTTTTTAATTTTAAAATAAAATTCGATTCTAAAACTTTTTTTCAAAGAAGGATATTCTTCCCCTTGGTGAGCTTCTAATGAACTTTCTCCATTATTGGGAAATTTTACTGAGTTTGGATTTTCCTCTAAAGGCATACCAAATAGGACATAGATTATCTACTTCTGCAGAAGTAAGGCAAATTGATATCAACTATAATGTATTTAGAGAGATTTATTTTCAGCCCCAACATATTTTTAACAAGTCTCAGGATATCAACCAAAATATCCAGCTTTTGTCCCTTCATAGTCGCTAATAATGAGGGTACCGTCAGTGGATTAGAGGCAGACAATTGACATGGACTTATCTTTACCTATAAACTTATAGTAAAGCCTCGATAATGAGGTCCTCTGTAAATGTTTCGAGAATTTGGATACGCTCATCCACGATTGGAAACAGAATAAAAGGAGACAAAGGTTCTGCCTGTCTTAATTCTACTTTTATTTTTCCATGGGGTCTTTTTCTTCCATTTAGAAAGTTAGAGAAAGTCGTTGTACTGATGCAATTGAAGATCCACTCATCCAAACTTGACCAGAGCACTTTTTAAGCACCATGTCAAGAAAACCCCAATAAACATTAGGTGGAATATTTGTGCTTTTAGTTTCTCTTTTTAACTGGTGATGTAATAGCTGTTCCCAAAGAAATTTTAAAAAATGGAGGTCATTCTGATAGAAAAAATAGAAGTCATTACGGGATTGTGATTATATTTTATAAAGCTTAGCTGGAAAGAACTTGATATATTATGGAACAGAATTAGAGATGCTAATGCCACTAAGGGTTTCCTTGAGCATCGACTCTATTTCTTTTCCCTTTTATTTGTGATATTAACATCTAGATGGAATCTGTTATCTCAGATTTCTCCTTGGATAGTTTGGTATGCATTTTCATGCATACTTTTTATCAGTTTGGGGTGGATTTTGGATCACCTCTAGTCAATGAAAATTATCATTTCTGACCTTCCTCTCTCAGGCTGTTTTTTGTAGGCCCTCTTTTGTACTTTTTGTACCTCTTTTCATTTTCCTCAATGCAAGTGTTATTCAAAAAAGAAAAGTTTTATGAGGAAAAAAAAACCATTTCTGATTCTAGGAAAATTTCAATACATGAATGTTCTTTTTGTTATTCTTCCTTATTAATTAGCTTGCTATCTTATCTGTGTTAAAGGATGAGCGAAAGGTAGTTACAAAACGTAAAACTGAACTTGGTGGAGCATTCTCAATGGGAAGTTGGATACTTTTTATTGGCTTGTTTGCTGCGTAAGTACTAACATTCTTTTTCTCTCTTCTGTCATATACTAAAAACAATTCTCTCACGTTGGTTTCGTATTGACTTGGTCTTTCAGTGTTCTCTCAGGTTGCTTTACCAAATCATATCAAAGAGGAGCATCGAAGTGCATAATGTGAAAGCAGCAAATGGGCCAGACTTAGCTTCCTTTGTGAATGACATGGAATTTAATATAACTGCAGTTTCTACTATGAACTGTGCAAATGTTCGTGGTCTTGATACTGTAGTGTTTGGAAATCCTGGTTTTCTGGAGCAGAAAGTAATGCCTCTGTCAAATTTTGCAAACTACTCCTGTCAAAACAGGAGTGAAGGGCCAACTATTAGTGTTCGGTGTGAAAGATGTCGTTTCATTCAGGACGATATTTATATATCATGGCAGTTTGTTGATCTTCCAAATAGTCCTGCAAGTGCTGTTGGATTTCAGTTTAGCTTCTCTGCAAAGGATCATGCTAAAAAAGATCGAGAAAGTTTTGTTAGTGGAACATTAAAAAATGGAAGCAATTTTGATGATACACCAGTTACATTCAGAGGGAAGAATGCAAATATAGTGCAATTTAATCTATTTCCAAGAATATACCGGAACCAACAGGATTCCAAGCTCATGCAGCCTTTATTTCATGAGTTTCTCCCAGGTTCATCCTTTCAAAAGACTAGTGAGCTCCAATCATCCCTTCAAAATGTTAATGATGGACTACTCAACATCACCTTGTACATCAATCTTCTCTCTTCCTACATTGTTGAGGTAGAGAGTCAAAGTATTATGGGCCCTGGTAAGTCTTTCGCTTCATTGTCATGTACACGCACTCATTTACACACACTTATTTCACACAAACTGAATTGTTTGGATATTGAGCTAAGGATGGATGTTCATTCAGATTTTGTATATACCTCTTATAGGATGCAATCCTAGTCATAGTTATATAAATATCATGCTATTACCCTAATATTCCTTTCGGGATTAGCATATGTCGTTGTCTTGAATGAAACTTCCTCAATAATTACATGAACAGCCTGATTGCCACCTCTTCATTTATTAAAGTTTTCCATTTAGCTCTTAATGTGTTACCAACATTATCCATATTCCTTACTACGCTACCAATAAACTGTCCAGGATCGCTTAATCTCCACCATCATATTAGCCGTACTATTCCACTGATATATTTGGTTATGTATGTCATTGGACACATTATATATCACTCAAATAATGTTGAAAAGTTACTGTATTCTAGTAACTTGGATGCGCAGCCCAGTACACTAACTCTTGACCTAAGTCGTTCTTGGCAGGGATAATTGTTTCTCACCTAACATTTCCTTTGAGGTTATTTTTGTTTTGAATTGTTAACTATCCAAATGAATTTGATGCTTTCAGATCCAGTTACCCTAATATGGTAAATCTTTAGAAAAAAGTAGTAATTGTTAATATTACTGGTTACAGCTAGCTCTCTTTTCCAGCTAAAATCTCTGGCACTTTTTTTATTGTTTAAAGAGTCAGATGACTTGTCAATTTCCTATATTTTCCCCTTATCTCTCATCTCAGTGATCCAGATTAGTCAAACGGTATGATAAGTGGTTATGAAAGTGGTCCTTGAGAAATCTATTTCTTTTTTTTAACAAGATACTAACTTCTCATTGAAGAAATTAAAAGGAACAAAACTCTCAAGAGAGTGAAAGAAAGAAATAACAAAAATTAAAAAGGTATAATAAAAGCTCCCCAATTAGAAAGCTAGGGGTGTAACTTTTAAAAATAGAAGATAGGGAACACCATTGAGAAACTTTAAACTTAGCTAAATTGAAACAAATCAAGCTAGGGGTGTCCTTGAGAAATCCATTTCTTGCTGTTTTACTTCTGTCTTAGGTCACGGGTCTCTACATCTTCATCGAACTCTTGAATAAGATTGTTTTTCCTCCTTTCCCTCTGATTATGAAATTTATTTTATGAATGAATATCTTTAGATTCTCTACTTGATTTATATAAAAGTTAATGGGTAGCTTCTTTGAAGATTGTTTTTGTTATGCATTCAATTGATCTTATTTTGTAACTAATGTTTATAATGGTATTTGGCTAACATTGTGTTGGTCCTACAATTTCTTTGTTATGCAGTTAGCTTTCTTGCAGATCTTGGTGGCCTATATTGCATTAGTGTTGGCATTTTCTTCTACCTTCTGGTGCAGGTATGGCTTTTCCCCAAAAAAAATGATTTTGTTTGATAAGAAACATCAGGATACATTAACAAAAAGGGCCAAAGAAGTGGGGGCCGGGGTAGAGGAATCCCTCGTCCAAAAGAAAACTACACAATGAAAGCTTTCCAATCATTAATAATCATGAGGACGCTATAGTTGCACAAGAATTGCTTGTGGTTTGAGCTATAGTTACAAGAGAAGCTTTCCCAAGAGTTATTACCTTTTGTATTCTCAACTCTTTATGATACAACCATGATGTTCCTCTTCCAGTTTCTCATTTTGGAAGAACTAGCGTACACAGCACTGGTTTATATATATATATTTTAATAAAAATGCAACACTCATTATACGTTGTCATCATGGTGTCAGACATTGTTCCATAAGTGAGCTTGTCATTGATATTGATTTAGTTTGAAATAAAAAGAAAATTATATGAGAAATGACCGATGTTATGGGAGGTGGAGCTTTAGTCCCTTCCAGTTTCTGTGTCTCATTAGGCATATACTATATCACTGGAACCAATTCTCGCTTCTTTCTTTGAGATCGAGTAATTGTTTCACTTCCCTAACTTGGTTTGTTTTGTAGTGCGAGTACAGAATTAAAAGGCTCCGCAACGAAGACAGTGTTTTGCGTAAAATTAGAAATAGAAGAAAAGCACAAGAACATTGGAATAAGGTGTTTCTTCTTCATCTTTTTCTCTCCTCAAGTTATTTGATATACTAGTCAGTGTCTTCCTGTTCTGTTTTACTTTAAGTGAGGGAAATTATTTTAGTACCAAAGGAAAGACTCACCGAGTGCTAGTTTTTTAAGCCCAAAAAAAAGAGAGAGAGAGAGAGAGAAAAAAAAAGAAGAAAAAAACCGTATTTATCCCAACTCTGTGTGAGGTGGTATTGTTGGGCGTAGGGTAATCTTGGGGCCTAGGCATGGCTAGCAATAATTAGAAATTAGTGGGTTAAGAGAGATATAGTTTTTTTATGATCTAGAATCGGAGCTTTGCTCTCCTATATTAGGGTACCCACGTTCGTCTCAAGGCCTAGATTCACAAGCCCTTGCCAACGAGGCTGTCTTGGGTGGAGAGTTATAGTTGCTTTTTGAAATTAAAAAATTCTTGAATTATGATGGGTACCAAGCTGACTTGTATTACTTTCTAGTTGAGGAAATATGTAACGTATACATGGGGCTGTAGTACACTGGAGGATTATTATGATCCGTCAACAACAGGTTGCGGCAACTGCATGGTTCAATCAAGTAAGAGTGGATCATCACGCAACCGAAGGTTAAGGAATAGTAGCAGTACTGCTCTCAGTTTTAAGAGAGAAGTAAATGGATCTACGAAGAAGGTAATGTTCTGTTATCTTACATCTGCACGAGCTTATTTATTTTCTGAAATTGTCCCTTCAAGTCTCTCCCTCTCTCTCGCCTCTTGGCTCTTTCAAAATTGATTAGACTTATCATATGCAGAACGCTAATCAAGATATGAAATCTCCGGAGGCAAGAGCTACCGACCAGGAAATGAGAATGATAGCAACCAAACAAGAGCTTGTGAGTTTTCTTGCCAATTAGTATGAAAATTTTAACTATTTTTTTAAAACAAGATACATACAGTGAAAAAGATGTTCAAAGGAGACGGACCCAAAAATGAGTGAAAAAAAAAGAAAAATGCGAAAGGCAAAGCTCAAATGAGGCTCTATAAGAGCATCCCAATTCATACAAATATTGCTCGAAGGAAATAATGGCTAAACAAATTGTGAAGCTTTGAACATGGCTAGCTTTGTCTTCAAATATGCTTTCATTCATTTCAAACCAAATTTAAAAAAAAATGAATTTGACCGCATTAATCTATAAAAGCTTTGCTCCAGAGGGTAATACTGGTCCAATCAGCAACTGAGAGCATTTCTACTTACATTATTGTCAAAAATCCACTGCACTTTGAACATCTTAATAACAGAAACCTAGCAACCAGTACCATAAAAATACGAGAAAAAGAGATGATCTTCTTACCATCTCCAAAACATAATAAACACACTGAAGGCTGTAAAATTGAGGAGAGCTTCCTTTGAAGAACCTTCACCGTATTTAAGCCTGGATGAAGTATAATCCTGGATAGAACATTAACTTTCTTAGGGCTCTTAGTTTTCCAAATAGCCGAGTACTCCTCTTTAACTAAATGGTAGGAAAAGGATAAATGCCTTGTCAAGAACTTAGAATCTTCTTCATTTGATAAAGAGGATACTGCCTAACTTGCCAACAACATGCTTAATTCCTCAATTTCATCCTCTTTCAAATTGCTTCAACTTTCCAGACTTGTGAATATCATCCCAATAATAAAAACCGAACCCTCAGAGGATGATATTTCTTTTCTTCAATTTATATTCGTTGAACTTTTGGCGCTACATGTTAAATCACTATTTTAATCTGAAAACTCAAGCTGTTATGTGAGTAAATATGATATCTTAGTTAATTCCTTATTATATATTTTGCAATGAAAACAAATTCACTTTATTTCTCATCTTCTAAACATTGCGACAACCCTAGGATATGGAAACAAGTATCAATAGTAACAAAAGGCTAAGACAAAGGAGGAAAAAATCTTACAAGGAAACAAGAAGATACAATATTTACATATGGGAGATTAAGGAAAAAGAAAAATGAACTTGGTCCACGAGGTAGCCTCCTTGAATGGTTTCTTGGACGGACCCTTTCTCAACACTTCTTCATACTCTCTTTCACGTGTGGGCTTGGTTACTAGTTTTCTTTGAAAAAATAACAAAAGAAGGCAGGAAAGGGAAAAAAACCCATCACAATGATTATGGGAGTGATAAGAAATTGCAAAAAGGAGCTCCAACTATTGGTGATAAAAAAGGATGACCAGAAGTTTAAACTCATAGAAGAGGTTAAAATGTGCCCAACCTCCAACCTCATCCGACTATCAACACTTGAGAAAATGGTGGAGTGCAACGCCTCCCTTCAAAAGTCAATAGATTGTTTCTTGATTTGCAAATCTTAAGGTGCAGCTAAGAATGGAGCAATTAAAACCTGTCCCTGTTTAACATTGCAGCAAGTGGATGGAAGAAAATCTCTTTGGAGAGCTAGTTCTCTTTCCTAATTTGTAGATAATTTTGGCTTGAGATGAATACGGGGGCTCTAGTGAATGTTATTGAGTGGGAGGTGGTTTATGGTTTAGACCATGTCAACAAACCTCTTTAGATTTCTTACTACAAATTGATACCATTATTATTCGTTTTACTGAGTAATTTGAGTCTACTTGCTAGTTTGCGCGTGGCTTTTATCTTTTTTTCTTTTGCCCCCTCATAATTCTTTAAGAAATTCTTGTTTCTTATTTATAAAATTAGCTACTCTGCCAGATATATGGTTACTGACATTCATATGGTAAATTTCTAGATACAGAACAAAATTTATGATCTATGAATTATGAGGTCATTTTGAAAAATTTAGATTGGTCGATCAAATAAATGGATATCTTAAGTAGCTTCAGAATAAATTCATGAATATTGAGAATATACTTAGTTCCCACCCTGATTTGCATATCCTACATGCCACTTAATATAACCACAATCCATGTTGCTGGGGTAAAACTTGCTAAATGTATTGAAGATACAAATCGAATCACTTATAGTTCTGATGTCCATGTTTTTCTTCCAGCCTCTGAAACATCATGTACTTGGTTCTACCTACGAAGGGAAGCAAAGCTTGACCGTCTCATACGAGGGAGATTCTTCACAACTTGGAGACTTTTCTCATCTTGAAGATTTTATTCCCCCACCGCCCTCAATAGGTAACTACCATTTTTTTCAAGAATGATAACTGACGTTCCTGCTCTTCCCCTCCATATCTTTCATTTTTTTGTCAAGAGACATAATGTTATTACTGTTTTGACAAGAGGAGAGGAAGGGTTGGCATTTTAATTAGTCTTGAAAAATGTTATTCATTGCAATACTTGAATGATGAAAAAGTACTCATTCATCCTTTGGATTTGTGACATTGTTTCTGCTTTGAATGACCATGCTCAGAGTCATTTTTAAATTTTCTGTTGCAGAGTTTAGCGATAGTTCTGATATCGACATGTTTGATATCTTGAAGAACATGAAAAGTTTGTACGAGTATAACGTAATTCTTAGAGAAAAGCTATTGTCCACTCAATCTGAGGTTCGTGCTATAGCAACCAACTCCACGCCA
mRNA sequence
ATGTCATGTCCAAGCAATAGCTTCCGCTTCAATGGCAGCCTCTGTGCTTGCCCACCTGGCCAACTTCTCAATCGTACAGAAAACAGCTGTGTTCTCTTCAGCAGCACTTCGGCCATCACTTCAGGCCGGCTCGAGAACTATGCTGTTAGCTTCCCTGAGACCATAGTCTCCTTTGATTCCATCAAGAAGATAACGCAGTCTCAGGCTGTGTTTCTTCAGGCCACTCTCGTCTTGCTGTTTTCTTGGCTCGCCTTCTGCATATTCCTTAGATTCATGAAGCTTGGGGATGGCAGAAATATCTGGTTCAGGATGAGATGGTGGGTTAGCAGACTGGACGTTTGCTTTGCCACAAGACATTGGCTGGATGAGCGAAAGGTAGTTACAAAACGTAAAACTGAACTTGGTGGAGCATTCTCAATGGGAAGTTGGATACTTTTTATTGGCTTGTTTGCTGCGTTGCTTTACCAAATCATATCAAAGAGGAGCATCGAAGTGCATAATGTGAAAGCAGCAAATGGGCCAGACTTAGCTTCCTTTGTGAATGACATGGAATTTAATATAACTGCAGTTTCTACTATGAACTGTGCAAATGTTCGTGGTCTTGATACTGTAGTGTTTGGAAATCCTGGTTTTCTGGAGCAGAAAGTAATGCCTCTGTCAAATTTTGCAAACTACTCCTGTCAAAACAGGAGTGAAGGGCCAACTATTAGTGTTCGGTGTGAAAGATGTCGTTTCATTCAGGACGATATTTATATATCATGGCAGTTTGTTGATCTTCCAAATAGTCCTGCAAGTGCTGTTGGATTTCAGTTTAGCTTCTCTGCAAAGGATCATGCTAAAAAAGATCGAGAAAGTTTTGTTAGTGGAACATTAAAAAATGGAAGCAATTTTGATGATACACCAGTTACATTCAGAGGGAAGAATGCAAATATAGTGCAATTTAATCTATTTCCAAGAATATACCGGAACCAACAGGATTCCAAGCTCATGCAGCCTTTATTTCATGAGTTTCTCCCAGGTTCATCCTTTCAAAAGACTAGTGAGCTCCAATCATCCCTTCAAAATGTTAATGATGGACTACTCAACATCACCTTGTACATCAATCTTCTCTCTTCCTACATTGTTGAGGTAGAGAGTCAAAGTATTATGGGCCCTGTTAGCTTTCTTGCAGATCTTGGTGGCCTATATTGCATTAGTGTTGGCATTTTCTTCTACCTTCTGGTGCAGTGCGAGTACAGAATTAAAAGGCTCCGCAACGAAGACAGTGTTTTGCGTAAAATTAGAAATAGAAGAAAAGCACAAGAACATTGGAATAAGTTGAGGAAATATGTAACGTATACATGGGGCTGTAGTACACTGGAGGATTATTATGATCCGTCAACAACAGGTTGCGGCAACTGCATGGTTCAATCAAGTAAGAGTGGATCATCACGCAACCGAAGGTTAAGGAATAGTAGCAGTACTGCTCTCAGTTTTAAGAGAGAAGTAAATGGATCTACGAAGAAGAACGCTAATCAAGATATGAAATCTCCGGAGGCAAGAGCTACCGACCAGGAAATGAGAATGATAGCAACCAAACAAGAGCTTCCTCTGAAACATCATGTACTTGGTTCTACCTACGAAGGGAAGCAAAGCTTGACCGTCTCATACGAGGGAGATTCTTCACAACTTGGAGACTTTTCTCATCTTGAAGATTTTATTCCCCCACCGCCCTCAATAGAGTTTAGCGATAGTTCTGATATCGACATGTTTGATATCTTGAAGAACATGAAAAGTTTGTACGAGTATAACGTAATTCTTAGAGAAAAGCTATTGTCCACTCAATCTGAGGTTCGTGCTATAGCAACCAACTCCACGCCA
Coding sequence (CDS)
ATGTCATGTCCAAGCAATAGCTTCCGCTTCAATGGCAGCCTCTGTGCTTGCCCACCTGGCCAACTTCTCAATCGTACAGAAAACAGCTGTGTTCTCTTCAGCAGCACTTCGGCCATCACTTCAGGCCGGCTCGAGAACTATGCTGTTAGCTTCCCTGAGACCATAGTCTCCTTTGATTCCATCAAGAAGATAACGCAGTCTCAGGCTGTGTTTCTTCAGGCCACTCTCGTCTTGCTGTTTTCTTGGCTCGCCTTCTGCATATTCCTTAGATTCATGAAGCTTGGGGATGGCAGAAATATCTGGTTCAGGATGAGATGGTGGGTTAGCAGACTGGACGTTTGCTTTGCCACAAGACATTGGCTGGATGAGCGAAAGGTAGTTACAAAACGTAAAACTGAACTTGGTGGAGCATTCTCAATGGGAAGTTGGATACTTTTTATTGGCTTGTTTGCTGCGTTGCTTTACCAAATCATATCAAAGAGGAGCATCGAAGTGCATAATGTGAAAGCAGCAAATGGGCCAGACTTAGCTTCCTTTGTGAATGACATGGAATTTAATATAACTGCAGTTTCTACTATGAACTGTGCAAATGTTCGTGGTCTTGATACTGTAGTGTTTGGAAATCCTGGTTTTCTGGAGCAGAAAGTAATGCCTCTGTCAAATTTTGCAAACTACTCCTGTCAAAACAGGAGTGAAGGGCCAACTATTAGTGTTCGGTGTGAAAGATGTCGTTTCATTCAGGACGATATTTATATATCATGGCAGTTTGTTGATCTTCCAAATAGTCCTGCAAGTGCTGTTGGATTTCAGTTTAGCTTCTCTGCAAAGGATCATGCTAAAAAAGATCGAGAAAGTTTTGTTAGTGGAACATTAAAAAATGGAAGCAATTTTGATGATACACCAGTTACATTCAGAGGGAAGAATGCAAATATAGTGCAATTTAATCTATTTCCAAGAATATACCGGAACCAACAGGATTCCAAGCTCATGCAGCCTTTATTTCATGAGTTTCTCCCAGGTTCATCCTTTCAAAAGACTAGTGAGCTCCAATCATCCCTTCAAAATGTTAATGATGGACTACTCAACATCACCTTGTACATCAATCTTCTCTCTTCCTACATTGTTGAGGTAGAGAGTCAAAGTATTATGGGCCCTGTTAGCTTTCTTGCAGATCTTGGTGGCCTATATTGCATTAGTGTTGGCATTTTCTTCTACCTTCTGGTGCAGTGCGAGTACAGAATTAAAAGGCTCCGCAACGAAGACAGTGTTTTGCGTAAAATTAGAAATAGAAGAAAAGCACAAGAACATTGGAATAAGTTGAGGAAATATGTAACGTATACATGGGGCTGTAGTACACTGGAGGATTATTATGATCCGTCAACAACAGGTTGCGGCAACTGCATGGTTCAATCAAGTAAGAGTGGATCATCACGCAACCGAAGGTTAAGGAATAGTAGCAGTACTGCTCTCAGTTTTAAGAGAGAAGTAAATGGATCTACGAAGAAGAACGCTAATCAAGATATGAAATCTCCGGAGGCAAGAGCTACCGACCAGGAAATGAGAATGATAGCAACCAAACAAGAGCTTCCTCTGAAACATCATGTACTTGGTTCTACCTACGAAGGGAAGCAAAGCTTGACCGTCTCATACGAGGGAGATTCTTCACAACTTGGAGACTTTTCTCATCTTGAAGATTTTATTCCCCCACCGCCCTCAATAGAGTTTAGCGATAGTTCTGATATCGACATGTTTGATATCTTGAAGAACATGAAAAGTTTGTACGAGTATAACGTAATTCTTAGAGAAAAGCTATTGTCCACTCAATCTGAGGTTCGTGCTATAGCAACCAACTCCACGCCA
Protein sequence
MSCPSNSFRFNGSLCACPPGQLLNRTENSCVLFSSTSAITSGRLENYAVSFPETIVSFDSIKKITQSQAVFLQATLVLLFSWLAFCIFLRFMKLGDGRNIWFRMRWWVSRLDVCFATRHWLDERKVVTKRKTELGGAFSMGSWILFIGLFAALLYQIISKRSIEVHNVKAANGPDLASFVNDMEFNITAVSTMNCANVRGLDTVVFGNPGFLEQKVMPLSNFANYSCQNRSEGPTISVRCERCRFIQDDIYISWQFVDLPNSPASAVGFQFSFSAKDHAKKDRESFVSGTLKNGSNFDDTPVTFRGKNANIVQFNLFPRIYRNQQDSKLMQPLFHEFLPGSSFQKTSELQSSLQNVNDGLLNITLYINLLSSYIVEVESQSIMGPVSFLADLGGLYCISVGIFFYLLVQCEYRIKRLRNEDSVLRKIRNRRKAQEHWNKLRKYVTYTWGCSTLEDYYDPSTTGCGNCMVQSSKSGSSRNRRLRNSSSTALSFKREVNGSTKKNANQDMKSPEARATDQEMRMIATKQELPLKHHVLGSTYEGKQSLTVSYEGDSSQLGDFSHLEDFIPPPPSIEFSDSSDIDMFDILKNMKSLYEYNVILREKLLSTQSEVRAIATNSTP
Homology
BLAST of MS021997 vs. NCBI nr
Match:
XP_022149823.1 (uncharacterized protein LOC111018164 [Momordica charantia])
HSP 1 Score: 1228.8 bits (3178), Expect = 0.0e+00
Identity = 620/620 (100.00%), Postives = 620/620 (100.00%), Query Frame = 0
Query: 1 MSCPSNSFRFNGSLCACPPGQLLNRTENSCVLFSSTSAITSGRLENYAVSFPETIVSFDS 60
MSCPSNSFRFNGSLCACPPGQLLNRTENSCVLFSSTSAITSGRLENYAVSFPETIVSFDS
Sbjct: 1 MSCPSNSFRFNGSLCACPPGQLLNRTENSCVLFSSTSAITSGRLENYAVSFPETIVSFDS 60
Query: 61 IKKITQSQAVFLQATLVLLFSWLAFCIFLRFMKLGDGRNIWFRMRWWVSRLDVCFATRHW 120
IKKITQSQAVFLQATLVLLFSWLAFCIFLRFMKLGDGRNIWFRMRWWVSRLDVCFATRHW
Sbjct: 61 IKKITQSQAVFLQATLVLLFSWLAFCIFLRFMKLGDGRNIWFRMRWWVSRLDVCFATRHW 120
Query: 121 LDERKVVTKRKTELGGAFSMGSWILFIGLFAALLYQIISKRSIEVHNVKAANGPDLASFV 180
LDERKVVTKRKTELGGAFSMGSWILFIGLFAALLYQIISKRSIEVHNVKAANGPDLASFV
Sbjct: 121 LDERKVVTKRKTELGGAFSMGSWILFIGLFAALLYQIISKRSIEVHNVKAANGPDLASFV 180
Query: 181 NDMEFNITAVSTMNCANVRGLDTVVFGNPGFLEQKVMPLSNFANYSCQNRSEGPTISVRC 240
NDMEFNITAVSTMNCANVRGLDTVVFGNPGFLEQKVMPLSNFANYSCQNRSEGPTISVRC
Sbjct: 181 NDMEFNITAVSTMNCANVRGLDTVVFGNPGFLEQKVMPLSNFANYSCQNRSEGPTISVRC 240
Query: 241 ERCRFIQDDIYISWQFVDLPNSPASAVGFQFSFSAKDHAKKDRESFVSGTLKNGSNFDDT 300
ERCRFIQDDIYISWQFVDLPNSPASAVGFQFSFSAKDHAKKDRESFVSGTLKNGSNFDDT
Sbjct: 241 ERCRFIQDDIYISWQFVDLPNSPASAVGFQFSFSAKDHAKKDRESFVSGTLKNGSNFDDT 300
Query: 301 PVTFRGKNANIVQFNLFPRIYRNQQDSKLMQPLFHEFLPGSSFQKTSELQSSLQNVNDGL 360
PVTFRGKNANIVQFNLFPRIYRNQQDSKLMQPLFHEFLPGSSFQKTSELQSSLQNVNDGL
Sbjct: 301 PVTFRGKNANIVQFNLFPRIYRNQQDSKLMQPLFHEFLPGSSFQKTSELQSSLQNVNDGL 360
Query: 361 LNITLYINLLSSYIVEVESQSIMGPVSFLADLGGLYCISVGIFFYLLVQCEYRIKRLRNE 420
LNITLYINLLSSYIVEVESQSIMGPVSFLADLGGLYCISVGIFFYLLVQCEYRIKRLRNE
Sbjct: 361 LNITLYINLLSSYIVEVESQSIMGPVSFLADLGGLYCISVGIFFYLLVQCEYRIKRLRNE 420
Query: 421 DSVLRKIRNRRKAQEHWNKLRKYVTYTWGCSTLEDYYDPSTTGCGNCMVQSSKSGSSRNR 480
DSVLRKIRNRRKAQEHWNKLRKYVTYTWGCSTLEDYYDPSTTGCGNCMVQSSKSGSSRNR
Sbjct: 421 DSVLRKIRNRRKAQEHWNKLRKYVTYTWGCSTLEDYYDPSTTGCGNCMVQSSKSGSSRNR 480
Query: 481 RLRNSSSTALSFKREVNGSTKKNANQDMKSPEARATDQEMRMIATKQELPLKHHVLGSTY 540
RLRNSSSTALSFKREVNGSTKKNANQDMKSPEARATDQEMRMIATKQELPLKHHVLGSTY
Sbjct: 481 RLRNSSSTALSFKREVNGSTKKNANQDMKSPEARATDQEMRMIATKQELPLKHHVLGSTY 540
Query: 541 EGKQSLTVSYEGDSSQLGDFSHLEDFIPPPPSIEFSDSSDIDMFDILKNMKSLYEYNVIL 600
EGKQSLTVSYEGDSSQLGDFSHLEDFIPPPPSIEFSDSSDIDMFDILKNMKSLYEYNVIL
Sbjct: 541 EGKQSLTVSYEGDSSQLGDFSHLEDFIPPPPSIEFSDSSDIDMFDILKNMKSLYEYNVIL 600
Query: 601 REKLLSTQSEVRAIATNSTP 621
REKLLSTQSEVRAIATNSTP
Sbjct: 601 REKLLSTQSEVRAIATNSTP 620
BLAST of MS021997 vs. NCBI nr
Match:
XP_038889680.1 (uncharacterized protein LOC120079539 isoform X1 [Benincasa hispida] >XP_038889681.1 uncharacterized protein LOC120079539 isoform X1 [Benincasa hispida])
HSP 1 Score: 1027.3 bits (2655), Expect = 5.4e-296
Identity = 522/621 (84.06%), Postives = 568/621 (91.47%), Query Frame = 0
Query: 1 MSCPSNSFRFNGSLCACPPGQLLNRTENSCVLFSSTSAITSGRLENYAVSFPETIVSFDS 60
MSCPSNSFR+NGSLCACPPGQLLNRT NSCVLFS TSAIT+GRL+ YAVSFPETI SFDS
Sbjct: 1 MSCPSNSFRYNGSLCACPPGQLLNRTNNSCVLFSRTSAITTGRLQTYAVSFPETIFSFDS 60
Query: 61 IKKITQSQAVFLQATLVLLFSWLAFCIFLRFMKLGDGRNIWFRMRWWVSRLDVCFATRHW 120
I+KITQSQAVFL+ATLV+L SWL FCIFLRFMKLGDGRNIWFRMRWWVSRLDVCFA+RHW
Sbjct: 61 IRKITQSQAVFLEATLVMLLSWLFFCIFLRFMKLGDGRNIWFRMRWWVSRLDVCFASRHW 120
Query: 121 LDERKVVTKRKTELGGAFSMGSWILFIGLFAALLYQIISKRSIEVHNVKAANGPDLASFV 180
LD++KVV KRKTELGG FS+ SWILFIGLFAALLYQIISKRSIEVHNVKAAN PDL SFV
Sbjct: 121 LDDQKVVRKRKTELGGTFSIASWILFIGLFAALLYQIISKRSIEVHNVKAANAPDLISFV 180
Query: 181 NDMEFNITAVSTMNCANVRGLDTVVFGNPGFLEQKVMPLSNFANYSCQNRSEGPTISVRC 240
ND+EFNIT VSTM+CAN+RGLDT+VFGNPGFLEQKVM LSNFAN+SCQNRSEGPTISV+C
Sbjct: 181 NDIEFNITTVSTMSCANIRGLDTIVFGNPGFLEQKVMSLSNFANFSCQNRSEGPTISVKC 240
Query: 241 ERCRFIQDDIYISWQFVDLPNSPASAVGFQFSFSAKDHAKKDRESFVSGTLKNGSNFDDT 300
ERCRFIQDDIYISWQFVDLPN+PASAVGF+F+ SAKDH +K++ESFVSGTLKN SNFDDT
Sbjct: 241 ERCRFIQDDIYISWQFVDLPNNPASAVGFEFNISAKDHVQKNQESFVSGTLKNRSNFDDT 300
Query: 301 PVTFRGKNANIVQFNLFPRIYRNQQDSKLMQPLFHEFLPGSSFQKTSELQSSLQNVNDGL 360
PVTFRGKNANIVQFNLFPRIY N+QDSKLMQPLFHEF+ GSSFQ T+ELQ SL+N NDGL
Sbjct: 301 PVTFRGKNANIVQFNLFPRIYSNKQDSKLMQPLFHEFVSGSSFQNTNELQLSLENPNDGL 360
Query: 361 LNITLYINLLSSYIVEVESQSIMGPVSFLADLGGLYCISVGIFFYLLVQCEYRIKRLRNE 420
LNITLYINLLSSYIVEV+SQ+I+GPVSFLADLGGLYCISVGIFFYLLVQ EYRIK+LRNE
Sbjct: 361 LNITLYINLLSSYIVEVQSQNILGPVSFLADLGGLYCISVGIFFYLLVQFEYRIKKLRNE 420
Query: 421 DSVLRKIRNRRKAQEHWNKLRKYVTYTWGCSTLEDYY-DPS-TTGCGNCMVQSS-KSGSS 480
DSV+RKIRNRRKAQEHWNKLRKYV YTWGCS L+D Y DPS T+ C NC+ QSS K+GSS
Sbjct: 421 DSVMRKIRNRRKAQEHWNKLRKYVMYTWGCSALDDVYNDPSKTSSCPNCIGQSSHKNGSS 480
Query: 481 RNRRLRNSSSTALSFKREVNGSTKKNANQDMKSPEARATDQEMRMIATKQELPLKHHVLG 540
R RRL + SSTA+SF +VNGSTKK ANQDMKSP+A A DQEMRMIATKQELPL H VLG
Sbjct: 481 RKRRL-SGSSTAISFNVDVNGSTKKTANQDMKSPKATAADQEMRMIATKQELPLHHQVLG 540
Query: 541 STYEGKQSLTVSYEGDSSQLGDFSHLEDFIPPPPSIEFSDSSDIDMFDILKNMKSLYEYN 600
STYE +Q T+ ++GDSSQ DFSH ED IPPPPSI+F DSSDIDM DI+KNMKSLYEYN
Sbjct: 541 STYEKQQ--TIPFKGDSSQPLDFSHPEDIIPPPPSIDFKDSSDIDMSDIMKNMKSLYEYN 600
Query: 601 VILREKLLSTQSEVRAIATNS 619
V LREKLLSTQSEVRA+AT S
Sbjct: 601 VFLREKLLSTQSEVRALATKS 618
BLAST of MS021997 vs. NCBI nr
Match:
XP_004152836.1 (uncharacterized protein LOC101211303 [Cucumis sativus] >XP_011648990.1 uncharacterized protein LOC101211303 [Cucumis sativus] >KGN61244.1 hypothetical protein Csa_006652 [Cucumis sativus])
HSP 1 Score: 1008.4 bits (2606), Expect = 2.6e-290
Identity = 512/622 (82.32%), Postives = 563/622 (90.51%), Query Frame = 0
Query: 1 MSCPSNSFRFNGSLCACPPGQLLNRTENSCVLFSSTSAITSGRLENYAVSFPETIVSFDS 60
MSCPSNSFR+NGSLCACPPGQLLNR NSCVLFS TSAIT+GRL+NYAVSFPETI SFDS
Sbjct: 1 MSCPSNSFRYNGSLCACPPGQLLNRANNSCVLFSRTSAITTGRLQNYAVSFPETIFSFDS 60
Query: 61 IKKITQSQAVFLQATLVLLFSWLAFCIFLRFMKLGDGRNIWFRMRWWVSRLDVCFATRHW 120
I+KITQSQAVFL+ATLV+L SWL FCIFLRFMKLGDGRNIWFR+RWWVSRLDVCFATRHW
Sbjct: 61 IRKITQSQAVFLEATLVMLLSWLFFCIFLRFMKLGDGRNIWFRIRWWVSRLDVCFATRHW 120
Query: 121 LDERKVVTKRKTELGGAFSMGSWILFIGLFAALLYQIISKRSIEVHNVKAANGPDLASFV 180
LD++++VTKRKTELGG FS+ SWILFIGLFAALLYQIISKRSIEVHNVKAAN PDL SFV
Sbjct: 121 LDDQRIVTKRKTELGGMFSIASWILFIGLFAALLYQIISKRSIEVHNVKAANAPDLVSFV 180
Query: 181 NDMEFNITAVSTMNCANVRGLDTVVFGNPGFLEQKVMPLSNFANYSCQNRSEGPTISVRC 240
ND+EFNIT VSTM+CAN+RGLDTVVFGNPGFLEQKVMPLS+FAN+SCQNRSEGPTIS++C
Sbjct: 181 NDIEFNITTVSTMSCANIRGLDTVVFGNPGFLEQKVMPLSSFANFSCQNRSEGPTISLKC 240
Query: 241 ERCRFIQDDIYISWQFVDLPNSPASAVGFQFSFSAKDHAKKDRESFVSGTLKNGSNFDDT 300
ERCRFIQDD+YISWQFVDLPN+PASAVGF+F+ SAKD ++ +ESFVSGTLKN SNFDDT
Sbjct: 241 ERCRFIQDDVYISWQFVDLPNNPASAVGFEFNISAKDQVQRSQESFVSGTLKNRSNFDDT 300
Query: 301 PVTFRGKNANIVQFNLFPRIYRNQQDSKLMQPLFHEFLPGSSFQKTSELQSSLQNVNDGL 360
PVTFRGK+ANIVQFNLFPRIY N+QDSKLMQPLFHEF+ GSSFQ T++LQ SL+N NDGL
Sbjct: 301 PVTFRGKSANIVQFNLFPRIYSNKQDSKLMQPLFHEFVSGSSFQNTNDLQLSLENTNDGL 360
Query: 361 LNITLYINLLSSYIVEVESQSIMGPVSFLADLGGLYCISVGIFFYLLVQCEYRIKRLRNE 420
LNITLYINLLSSYIVEVESQ+I+GPVSFLADLGGLYCIS GIFFYLLVQ EYRIKRLRNE
Sbjct: 361 LNITLYINLLSSYIVEVESQNILGPVSFLADLGGLYCISFGIFFYLLVQFEYRIKRLRNE 420
Query: 421 DSVLRKIRNRRKAQEHWNKLRKYVTYTWGCSTL--EDYYDPS-TTGCGNCMVQ-SSKSGS 480
DSV+RKIRNRRKAQEHWNKLRKYV YTWGCS L DY DPS T+ C NC+ Q S K+GS
Sbjct: 421 DSVMRKIRNRRKAQEHWNKLRKYVMYTWGCSALLDGDYNDPSKTSSCPNCIGQPSHKNGS 480
Query: 481 SRNRRLRNSSSTALSFKREVNGSTKKNANQDMKSPEARATDQEMRMIATKQELPLKHHVL 540
SR RRL++ SSTA+SF +VNG+T + NQDMKSP+A ATDQEMRMIATKQE PL H VL
Sbjct: 481 SRKRRLKSGSSTAISFNIDVNGATNRTVNQDMKSPKATATDQEMRMIATKQEQPLHHQVL 540
Query: 541 GSTYEGKQSLTVSYEGDSSQLGDFSHLEDFIPPPPSIEFSDSSDIDMFDILKNMKSLYEY 600
GSTYE KQ TV ++GDSSQ DFS ED IPPPP I+F+DSSDIDM +ILKNMKSLYEY
Sbjct: 541 GSTYEEKQR-TVPFKGDSSQPVDFSRSED-IPPPPLIDFNDSSDIDMSNILKNMKSLYEY 600
Query: 601 NVILREKLLSTQSEVRAIATNS 619
NV LREKLLSTQSEVRA+AT S
Sbjct: 601 NVFLREKLLSTQSEVRALATKS 620
BLAST of MS021997 vs. NCBI nr
Match:
XP_008441901.1 (PREDICTED: uncharacterized protein LOC103485904 [Cucumis melo])
HSP 1 Score: 997.3 bits (2577), Expect = 6.0e-287
Identity = 501/613 (81.73%), Postives = 557/613 (90.86%), Query Frame = 0
Query: 1 MSCPSNSFRFNGSLCACPPGQLLNRTENSCVLFSSTSAITSGRLENYAVSFPETIVSFDS 60
MSCPSNSFR+NGSLCACPPGQLL+RT NSC+LFS TSAIT+GRL+NYAVSFPETI SFDS
Sbjct: 1 MSCPSNSFRYNGSLCACPPGQLLSRTNNSCILFSRTSAITTGRLQNYAVSFPETIFSFDS 60
Query: 61 IKKITQSQAVFLQATLVLLFSWLAFCIFLRFMKLGDGRNIWFRMRWWVSRLDVCFATRHW 120
I+KITQSQAVFL+ATLV+L SWL FCIFLRFMKLGDGRNIWFR+RWWVSRLDVCFATRHW
Sbjct: 61 IRKITQSQAVFLEATLVMLLSWLFFCIFLRFMKLGDGRNIWFRIRWWVSRLDVCFATRHW 120
Query: 121 LDERKVVTKRKTELGGAFSMGSWILFIGLFAALLYQIISKRSIEVHNVKAANGPDLASFV 180
LD+++ VTKRKTELGG FS+ SWILFIGLFAALLYQIISKRSIEVHNVKAAN PDL SFV
Sbjct: 121 LDDQRTVTKRKTELGGTFSIASWILFIGLFAALLYQIISKRSIEVHNVKAANAPDLVSFV 180
Query: 181 NDMEFNITAVSTMNCANVRGLDTVVFGNPGFLEQKVMPLSNFANYSCQNRSEGPTISVRC 240
ND+EFNIT VSTM+CAN+RGLDT+VFGNPGFLEQKVMPLS+FAN+SCQNRSEGPTIS++C
Sbjct: 181 NDIEFNITTVSTMSCANIRGLDTIVFGNPGFLEQKVMPLSSFANFSCQNRSEGPTISLKC 240
Query: 241 ERCRFIQDDIYISWQFVDLPNSPASAVGFQFSFSAKDHAKKDRESFVSGTLKNGSNFDDT 300
ERCRFIQDD+YISWQFVDLPN+PASAVGF+F+ SAKDH +K++ESFVSGTLKN SNFDDT
Sbjct: 241 ERCRFIQDDVYISWQFVDLPNNPASAVGFEFNISAKDHVQKNQESFVSGTLKNRSNFDDT 300
Query: 301 PVTFRGKNANIVQFNLFPRIYRNQQDSKLMQPLFHEFLPGSSFQKTSELQSSLQNVNDGL 360
PVTFRGK+ANIVQFNLFPRIY N+QDSKLMQPLFHEF+ GSSFQ T++LQ SL+N NDGL
Sbjct: 301 PVTFRGKSANIVQFNLFPRIYSNKQDSKLMQPLFHEFVSGSSFQNTNDLQLSLENANDGL 360
Query: 361 LNITLYINLLSSYIVEVESQSIMGPVSFLADLGGLYCISVGIFFYLLVQCEYRIKRLRNE 420
LNITLYINLLSSYI+EVESQ+I+GPVSFLADLGGLYCISVGIFFYLLVQ EYRIK+LRNE
Sbjct: 361 LNITLYINLLSSYIIEVESQNILGPVSFLADLGGLYCISVGIFFYLLVQFEYRIKKLRNE 420
Query: 421 DSVLRKIRNRRKAQEHWNKLRKYVTYTWGCSTL-EDYYDPS-TTGCGNCMVQ-SSKSGSS 480
DSV+RKIRNRRKAQEHWNKLRKYV YTWGCS L DY D S T+ C NC+ Q S K+GSS
Sbjct: 421 DSVMRKIRNRRKAQEHWNKLRKYVMYTWGCSALVGDYNDQSETSSCPNCIGQPSHKNGSS 480
Query: 481 RNRRLRNSSSTALSFKREVNGSTKKNANQDMKSPEARATDQEMRMIATKQELPLKHHVLG 540
R R LR+ SSTA++F +VNG+TK+ ANQDMK+P+A ATDQEMRMIATKQE PL H VLG
Sbjct: 481 RKRHLRSGSSTAINFNIDVNGATKRTANQDMKTPKATATDQEMRMIATKQEQPLHHQVLG 540
Query: 541 STYEGKQSLTVSYEGDSSQLGDFSHLEDFIPPPPSIEFSDSSDIDMFDILKNMKSLYEYN 600
STYE KQ TV ++GDSSQ DFS ED IP PP I+F+D SD+DM +ILKNMKSLYEYN
Sbjct: 541 STYEEKQR-TVPFKGDSSQPVDFSRPEDIIPLPPLIDFNDCSDVDMSNILKNMKSLYEYN 600
Query: 601 VILREKLLSTQSE 611
V LREKLLSTQSE
Sbjct: 601 VFLREKLLSTQSE 612
BLAST of MS021997 vs. NCBI nr
Match:
XP_022937645.1 (uncharacterized protein LOC111443988 [Cucurbita moschata] >XP_022937646.1 uncharacterized protein LOC111443988 [Cucurbita moschata] >XP_022937647.1 uncharacterized protein LOC111443988 [Cucurbita moschata])
HSP 1 Score: 993.0 bits (2566), Expect = 1.1e-285
Identity = 501/623 (80.42%), Postives = 558/623 (89.57%), Query Frame = 0
Query: 1 MSCPSNSFRFNGSLCACPPGQLLNRTENSCVLFSSTSAITSGRLENYAVSFPETIVSFDS 60
MSCPS+SFR+NGS CACPPGQLLNR+ NSCV+F+S S IT+GR E+YAVSFPETI SFDS
Sbjct: 1 MSCPSSSFRYNGSHCACPPGQLLNRSNNSCVIFNSPSVITTGRFESYAVSFPETIFSFDS 60
Query: 61 IKKITQSQAVFLQATLVLLFSWLAFCIFLRFMKLGDGRNIWFRMRWWVSRLDVCFATRHW 120
I+K TQSQAVFL+ATL LL SWL FC+FLRFMKLGDGRNIWFRMRWWVSRLDVCFATRHW
Sbjct: 61 IRKFTQSQAVFLEATLFLLLSWLFFCMFLRFMKLGDGRNIWFRMRWWVSRLDVCFATRHW 120
Query: 121 LDERKVVTKRKTELGGAFSMGSWILFIGLFAALLYQIISKRSIEVHNVKAANGPDLASFV 180
LD++KVVTKRKTELGG FS+ SWILFIGLFAALLYQIISKRSIEVHNVKAAN PDL SFV
Sbjct: 121 LDDQKVVTKRKTELGGTFSIASWILFIGLFAALLYQIISKRSIEVHNVKAANAPDLVSFV 180
Query: 181 NDMEFNITAVSTMNCANVRGLDTVVFGNPGFLEQKVMPLSNFANYSCQNRSEGPTISVRC 240
NDMEFNIT VSTM+CAN+RGL T VFGNPGFLEQ VMPLS FANYSCQN SEGPTISV+C
Sbjct: 181 NDMEFNITTVSTMSCANIRGLGTAVFGNPGFLEQSVMPLSKFANYSCQNTSEGPTISVKC 240
Query: 241 ERCRFIQDDIYISWQFVDLPNSPASAVGFQFSFSAKDHAKKDRESFVSGTLKNGSNFDDT 300
ERCRFIQDD+Y+SWQFVDLPN+PASAVGFQF+FSAKDH +K++ESFVSGTLKN SNFDDT
Sbjct: 241 ERCRFIQDDLYVSWQFVDLPNNPASAVGFQFNFSAKDHVQKNQESFVSGTLKNRSNFDDT 300
Query: 301 PVTFRGKNANIVQFNLFPRIYRNQQDSKLMQPLFHEFLPGSSFQKTSELQSSLQNVNDGL 360
PVTFRGKNANI+QFNLFPRIYR+++DSKLMQPLFHEF+ GSSFQ T+ELQ SL+N NDGL
Sbjct: 301 PVTFRGKNANIMQFNLFPRIYRSKRDSKLMQPLFHEFVSGSSFQNTNELQLSLENANDGL 360
Query: 361 LNITLYINLLSSYIVEVESQSIMGPVSFLADLGGLYCISVGIFFYLLVQCEYRIKRLRNE 420
+NITLYINLLSSYIVEVE+Q+I+GPVSFLADLGGLYC+SVGIFFYLLVQ EYRIK+LRNE
Sbjct: 361 INITLYINLLSSYIVEVETQNILGPVSFLADLGGLYCVSVGIFFYLLVQFEYRIKKLRNE 420
Query: 421 DSVLRKIRNRRKAQEHWNKLRKYVTYTWGCSTLED-YYDPS-TTGCGNCMVQ-SSKSGSS 480
D+V+RKIRNRRKAQEHWNKLRKYV YTW CSTL D DPS T+ C NC+ Q + K S
Sbjct: 421 DTVMRKIRNRRKAQEHWNKLRKYVMYTWDCSTLYDNCNDPSKTSNCANCIGQPARKDESL 480
Query: 481 RNRRLRNSSSTALSFKREVNGSTKKNANQDMKSPEARATDQEMRMIATKQELPLKHHVLG 540
R RRL+N SSTA+SFK +VNGS KK +++D KSP+ARATDQEM MIATKQE P +HHVLG
Sbjct: 481 RKRRLKNGSSTAISFKLDVNGSAKK-SSKDEKSPKARATDQEMGMIATKQE-PPQHHVLG 540
Query: 541 STYEGKQSLTVSYEGDSSQLGDFSHLEDFIPPPPSIEFSDSSDIDMFDILKNMKSLYEYN 600
ST+E KQS TV +EGDSSQ G+FS ED IPPPP I+F SSDIDMFD+LKN+KSLYEYN
Sbjct: 541 STHETKQSSTVPFEGDSSQPGEFSRPEDIIPPPPLIDFKHSSDIDMFDVLKNIKSLYEYN 600
Query: 601 VILREKLLSTQSEVRAIATNSTP 621
V LREKLLSTQSEVRA++ S P
Sbjct: 601 VFLREKLLSTQSEVRALSAKSAP 621
BLAST of MS021997 vs. ExPASy TrEMBL
Match:
A0A6J1D7T0 (uncharacterized protein LOC111018164 OS=Momordica charantia OX=3673 GN=LOC111018164 PE=4 SV=1)
HSP 1 Score: 1228.8 bits (3178), Expect = 0.0e+00
Identity = 620/620 (100.00%), Postives = 620/620 (100.00%), Query Frame = 0
Query: 1 MSCPSNSFRFNGSLCACPPGQLLNRTENSCVLFSSTSAITSGRLENYAVSFPETIVSFDS 60
MSCPSNSFRFNGSLCACPPGQLLNRTENSCVLFSSTSAITSGRLENYAVSFPETIVSFDS
Sbjct: 1 MSCPSNSFRFNGSLCACPPGQLLNRTENSCVLFSSTSAITSGRLENYAVSFPETIVSFDS 60
Query: 61 IKKITQSQAVFLQATLVLLFSWLAFCIFLRFMKLGDGRNIWFRMRWWVSRLDVCFATRHW 120
IKKITQSQAVFLQATLVLLFSWLAFCIFLRFMKLGDGRNIWFRMRWWVSRLDVCFATRHW
Sbjct: 61 IKKITQSQAVFLQATLVLLFSWLAFCIFLRFMKLGDGRNIWFRMRWWVSRLDVCFATRHW 120
Query: 121 LDERKVVTKRKTELGGAFSMGSWILFIGLFAALLYQIISKRSIEVHNVKAANGPDLASFV 180
LDERKVVTKRKTELGGAFSMGSWILFIGLFAALLYQIISKRSIEVHNVKAANGPDLASFV
Sbjct: 121 LDERKVVTKRKTELGGAFSMGSWILFIGLFAALLYQIISKRSIEVHNVKAANGPDLASFV 180
Query: 181 NDMEFNITAVSTMNCANVRGLDTVVFGNPGFLEQKVMPLSNFANYSCQNRSEGPTISVRC 240
NDMEFNITAVSTMNCANVRGLDTVVFGNPGFLEQKVMPLSNFANYSCQNRSEGPTISVRC
Sbjct: 181 NDMEFNITAVSTMNCANVRGLDTVVFGNPGFLEQKVMPLSNFANYSCQNRSEGPTISVRC 240
Query: 241 ERCRFIQDDIYISWQFVDLPNSPASAVGFQFSFSAKDHAKKDRESFVSGTLKNGSNFDDT 300
ERCRFIQDDIYISWQFVDLPNSPASAVGFQFSFSAKDHAKKDRESFVSGTLKNGSNFDDT
Sbjct: 241 ERCRFIQDDIYISWQFVDLPNSPASAVGFQFSFSAKDHAKKDRESFVSGTLKNGSNFDDT 300
Query: 301 PVTFRGKNANIVQFNLFPRIYRNQQDSKLMQPLFHEFLPGSSFQKTSELQSSLQNVNDGL 360
PVTFRGKNANIVQFNLFPRIYRNQQDSKLMQPLFHEFLPGSSFQKTSELQSSLQNVNDGL
Sbjct: 301 PVTFRGKNANIVQFNLFPRIYRNQQDSKLMQPLFHEFLPGSSFQKTSELQSSLQNVNDGL 360
Query: 361 LNITLYINLLSSYIVEVESQSIMGPVSFLADLGGLYCISVGIFFYLLVQCEYRIKRLRNE 420
LNITLYINLLSSYIVEVESQSIMGPVSFLADLGGLYCISVGIFFYLLVQCEYRIKRLRNE
Sbjct: 361 LNITLYINLLSSYIVEVESQSIMGPVSFLADLGGLYCISVGIFFYLLVQCEYRIKRLRNE 420
Query: 421 DSVLRKIRNRRKAQEHWNKLRKYVTYTWGCSTLEDYYDPSTTGCGNCMVQSSKSGSSRNR 480
DSVLRKIRNRRKAQEHWNKLRKYVTYTWGCSTLEDYYDPSTTGCGNCMVQSSKSGSSRNR
Sbjct: 421 DSVLRKIRNRRKAQEHWNKLRKYVTYTWGCSTLEDYYDPSTTGCGNCMVQSSKSGSSRNR 480
Query: 481 RLRNSSSTALSFKREVNGSTKKNANQDMKSPEARATDQEMRMIATKQELPLKHHVLGSTY 540
RLRNSSSTALSFKREVNGSTKKNANQDMKSPEARATDQEMRMIATKQELPLKHHVLGSTY
Sbjct: 481 RLRNSSSTALSFKREVNGSTKKNANQDMKSPEARATDQEMRMIATKQELPLKHHVLGSTY 540
Query: 541 EGKQSLTVSYEGDSSQLGDFSHLEDFIPPPPSIEFSDSSDIDMFDILKNMKSLYEYNVIL 600
EGKQSLTVSYEGDSSQLGDFSHLEDFIPPPPSIEFSDSSDIDMFDILKNMKSLYEYNVIL
Sbjct: 541 EGKQSLTVSYEGDSSQLGDFSHLEDFIPPPPSIEFSDSSDIDMFDILKNMKSLYEYNVIL 600
Query: 601 REKLLSTQSEVRAIATNSTP 621
REKLLSTQSEVRAIATNSTP
Sbjct: 601 REKLLSTQSEVRAIATNSTP 620
BLAST of MS021997 vs. ExPASy TrEMBL
Match:
A0A0A0LJH8 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G074060 PE=4 SV=1)
HSP 1 Score: 1008.4 bits (2606), Expect = 1.3e-290
Identity = 512/622 (82.32%), Postives = 563/622 (90.51%), Query Frame = 0
Query: 1 MSCPSNSFRFNGSLCACPPGQLLNRTENSCVLFSSTSAITSGRLENYAVSFPETIVSFDS 60
MSCPSNSFR+NGSLCACPPGQLLNR NSCVLFS TSAIT+GRL+NYAVSFPETI SFDS
Sbjct: 1 MSCPSNSFRYNGSLCACPPGQLLNRANNSCVLFSRTSAITTGRLQNYAVSFPETIFSFDS 60
Query: 61 IKKITQSQAVFLQATLVLLFSWLAFCIFLRFMKLGDGRNIWFRMRWWVSRLDVCFATRHW 120
I+KITQSQAVFL+ATLV+L SWL FCIFLRFMKLGDGRNIWFR+RWWVSRLDVCFATRHW
Sbjct: 61 IRKITQSQAVFLEATLVMLLSWLFFCIFLRFMKLGDGRNIWFRIRWWVSRLDVCFATRHW 120
Query: 121 LDERKVVTKRKTELGGAFSMGSWILFIGLFAALLYQIISKRSIEVHNVKAANGPDLASFV 180
LD++++VTKRKTELGG FS+ SWILFIGLFAALLYQIISKRSIEVHNVKAAN PDL SFV
Sbjct: 121 LDDQRIVTKRKTELGGMFSIASWILFIGLFAALLYQIISKRSIEVHNVKAANAPDLVSFV 180
Query: 181 NDMEFNITAVSTMNCANVRGLDTVVFGNPGFLEQKVMPLSNFANYSCQNRSEGPTISVRC 240
ND+EFNIT VSTM+CAN+RGLDTVVFGNPGFLEQKVMPLS+FAN+SCQNRSEGPTIS++C
Sbjct: 181 NDIEFNITTVSTMSCANIRGLDTVVFGNPGFLEQKVMPLSSFANFSCQNRSEGPTISLKC 240
Query: 241 ERCRFIQDDIYISWQFVDLPNSPASAVGFQFSFSAKDHAKKDRESFVSGTLKNGSNFDDT 300
ERCRFIQDD+YISWQFVDLPN+PASAVGF+F+ SAKD ++ +ESFVSGTLKN SNFDDT
Sbjct: 241 ERCRFIQDDVYISWQFVDLPNNPASAVGFEFNISAKDQVQRSQESFVSGTLKNRSNFDDT 300
Query: 301 PVTFRGKNANIVQFNLFPRIYRNQQDSKLMQPLFHEFLPGSSFQKTSELQSSLQNVNDGL 360
PVTFRGK+ANIVQFNLFPRIY N+QDSKLMQPLFHEF+ GSSFQ T++LQ SL+N NDGL
Sbjct: 301 PVTFRGKSANIVQFNLFPRIYSNKQDSKLMQPLFHEFVSGSSFQNTNDLQLSLENTNDGL 360
Query: 361 LNITLYINLLSSYIVEVESQSIMGPVSFLADLGGLYCISVGIFFYLLVQCEYRIKRLRNE 420
LNITLYINLLSSYIVEVESQ+I+GPVSFLADLGGLYCIS GIFFYLLVQ EYRIKRLRNE
Sbjct: 361 LNITLYINLLSSYIVEVESQNILGPVSFLADLGGLYCISFGIFFYLLVQFEYRIKRLRNE 420
Query: 421 DSVLRKIRNRRKAQEHWNKLRKYVTYTWGCSTL--EDYYDPS-TTGCGNCMVQ-SSKSGS 480
DSV+RKIRNRRKAQEHWNKLRKYV YTWGCS L DY DPS T+ C NC+ Q S K+GS
Sbjct: 421 DSVMRKIRNRRKAQEHWNKLRKYVMYTWGCSALLDGDYNDPSKTSSCPNCIGQPSHKNGS 480
Query: 481 SRNRRLRNSSSTALSFKREVNGSTKKNANQDMKSPEARATDQEMRMIATKQELPLKHHVL 540
SR RRL++ SSTA+SF +VNG+T + NQDMKSP+A ATDQEMRMIATKQE PL H VL
Sbjct: 481 SRKRRLKSGSSTAISFNIDVNGATNRTVNQDMKSPKATATDQEMRMIATKQEQPLHHQVL 540
Query: 541 GSTYEGKQSLTVSYEGDSSQLGDFSHLEDFIPPPPSIEFSDSSDIDMFDILKNMKSLYEY 600
GSTYE KQ TV ++GDSSQ DFS ED IPPPP I+F+DSSDIDM +ILKNMKSLYEY
Sbjct: 541 GSTYEEKQR-TVPFKGDSSQPVDFSRSED-IPPPPLIDFNDSSDIDMSNILKNMKSLYEY 600
Query: 601 NVILREKLLSTQSEVRAIATNS 619
NV LREKLLSTQSEVRA+AT S
Sbjct: 601 NVFLREKLLSTQSEVRALATKS 620
BLAST of MS021997 vs. ExPASy TrEMBL
Match:
A0A1S3B545 (uncharacterized protein LOC103485904 OS=Cucumis melo OX=3656 GN=LOC103485904 PE=4 SV=1)
HSP 1 Score: 997.3 bits (2577), Expect = 2.9e-287
Identity = 501/613 (81.73%), Postives = 557/613 (90.86%), Query Frame = 0
Query: 1 MSCPSNSFRFNGSLCACPPGQLLNRTENSCVLFSSTSAITSGRLENYAVSFPETIVSFDS 60
MSCPSNSFR+NGSLCACPPGQLL+RT NSC+LFS TSAIT+GRL+NYAVSFPETI SFDS
Sbjct: 1 MSCPSNSFRYNGSLCACPPGQLLSRTNNSCILFSRTSAITTGRLQNYAVSFPETIFSFDS 60
Query: 61 IKKITQSQAVFLQATLVLLFSWLAFCIFLRFMKLGDGRNIWFRMRWWVSRLDVCFATRHW 120
I+KITQSQAVFL+ATLV+L SWL FCIFLRFMKLGDGRNIWFR+RWWVSRLDVCFATRHW
Sbjct: 61 IRKITQSQAVFLEATLVMLLSWLFFCIFLRFMKLGDGRNIWFRIRWWVSRLDVCFATRHW 120
Query: 121 LDERKVVTKRKTELGGAFSMGSWILFIGLFAALLYQIISKRSIEVHNVKAANGPDLASFV 180
LD+++ VTKRKTELGG FS+ SWILFIGLFAALLYQIISKRSIEVHNVKAAN PDL SFV
Sbjct: 121 LDDQRTVTKRKTELGGTFSIASWILFIGLFAALLYQIISKRSIEVHNVKAANAPDLVSFV 180
Query: 181 NDMEFNITAVSTMNCANVRGLDTVVFGNPGFLEQKVMPLSNFANYSCQNRSEGPTISVRC 240
ND+EFNIT VSTM+CAN+RGLDT+VFGNPGFLEQKVMPLS+FAN+SCQNRSEGPTIS++C
Sbjct: 181 NDIEFNITTVSTMSCANIRGLDTIVFGNPGFLEQKVMPLSSFANFSCQNRSEGPTISLKC 240
Query: 241 ERCRFIQDDIYISWQFVDLPNSPASAVGFQFSFSAKDHAKKDRESFVSGTLKNGSNFDDT 300
ERCRFIQDD+YISWQFVDLPN+PASAVGF+F+ SAKDH +K++ESFVSGTLKN SNFDDT
Sbjct: 241 ERCRFIQDDVYISWQFVDLPNNPASAVGFEFNISAKDHVQKNQESFVSGTLKNRSNFDDT 300
Query: 301 PVTFRGKNANIVQFNLFPRIYRNQQDSKLMQPLFHEFLPGSSFQKTSELQSSLQNVNDGL 360
PVTFRGK+ANIVQFNLFPRIY N+QDSKLMQPLFHEF+ GSSFQ T++LQ SL+N NDGL
Sbjct: 301 PVTFRGKSANIVQFNLFPRIYSNKQDSKLMQPLFHEFVSGSSFQNTNDLQLSLENANDGL 360
Query: 361 LNITLYINLLSSYIVEVESQSIMGPVSFLADLGGLYCISVGIFFYLLVQCEYRIKRLRNE 420
LNITLYINLLSSYI+EVESQ+I+GPVSFLADLGGLYCISVGIFFYLLVQ EYRIK+LRNE
Sbjct: 361 LNITLYINLLSSYIIEVESQNILGPVSFLADLGGLYCISVGIFFYLLVQFEYRIKKLRNE 420
Query: 421 DSVLRKIRNRRKAQEHWNKLRKYVTYTWGCSTL-EDYYDPS-TTGCGNCMVQ-SSKSGSS 480
DSV+RKIRNRRKAQEHWNKLRKYV YTWGCS L DY D S T+ C NC+ Q S K+GSS
Sbjct: 421 DSVMRKIRNRRKAQEHWNKLRKYVMYTWGCSALVGDYNDQSETSSCPNCIGQPSHKNGSS 480
Query: 481 RNRRLRNSSSTALSFKREVNGSTKKNANQDMKSPEARATDQEMRMIATKQELPLKHHVLG 540
R R LR+ SSTA++F +VNG+TK+ ANQDMK+P+A ATDQEMRMIATKQE PL H VLG
Sbjct: 481 RKRHLRSGSSTAINFNIDVNGATKRTANQDMKTPKATATDQEMRMIATKQEQPLHHQVLG 540
Query: 541 STYEGKQSLTVSYEGDSSQLGDFSHLEDFIPPPPSIEFSDSSDIDMFDILKNMKSLYEYN 600
STYE KQ TV ++GDSSQ DFS ED IP PP I+F+D SD+DM +ILKNMKSLYEYN
Sbjct: 541 STYEEKQR-TVPFKGDSSQPVDFSRPEDIIPLPPLIDFNDCSDVDMSNILKNMKSLYEYN 600
Query: 601 VILREKLLSTQSE 611
V LREKLLSTQSE
Sbjct: 601 VFLREKLLSTQSE 612
BLAST of MS021997 vs. ExPASy TrEMBL
Match:
A0A6J1FBT5 (uncharacterized protein LOC111443988 OS=Cucurbita moschata OX=3662 GN=LOC111443988 PE=4 SV=1)
HSP 1 Score: 993.0 bits (2566), Expect = 5.5e-286
Identity = 501/623 (80.42%), Postives = 558/623 (89.57%), Query Frame = 0
Query: 1 MSCPSNSFRFNGSLCACPPGQLLNRTENSCVLFSSTSAITSGRLENYAVSFPETIVSFDS 60
MSCPS+SFR+NGS CACPPGQLLNR+ NSCV+F+S S IT+GR E+YAVSFPETI SFDS
Sbjct: 1 MSCPSSSFRYNGSHCACPPGQLLNRSNNSCVIFNSPSVITTGRFESYAVSFPETIFSFDS 60
Query: 61 IKKITQSQAVFLQATLVLLFSWLAFCIFLRFMKLGDGRNIWFRMRWWVSRLDVCFATRHW 120
I+K TQSQAVFL+ATL LL SWL FC+FLRFMKLGDGRNIWFRMRWWVSRLDVCFATRHW
Sbjct: 61 IRKFTQSQAVFLEATLFLLLSWLFFCMFLRFMKLGDGRNIWFRMRWWVSRLDVCFATRHW 120
Query: 121 LDERKVVTKRKTELGGAFSMGSWILFIGLFAALLYQIISKRSIEVHNVKAANGPDLASFV 180
LD++KVVTKRKTELGG FS+ SWILFIGLFAALLYQIISKRSIEVHNVKAAN PDL SFV
Sbjct: 121 LDDQKVVTKRKTELGGTFSIASWILFIGLFAALLYQIISKRSIEVHNVKAANAPDLVSFV 180
Query: 181 NDMEFNITAVSTMNCANVRGLDTVVFGNPGFLEQKVMPLSNFANYSCQNRSEGPTISVRC 240
NDMEFNIT VSTM+CAN+RGL T VFGNPGFLEQ VMPLS FANYSCQN SEGPTISV+C
Sbjct: 181 NDMEFNITTVSTMSCANIRGLGTAVFGNPGFLEQSVMPLSKFANYSCQNTSEGPTISVKC 240
Query: 241 ERCRFIQDDIYISWQFVDLPNSPASAVGFQFSFSAKDHAKKDRESFVSGTLKNGSNFDDT 300
ERCRFIQDD+Y+SWQFVDLPN+PASAVGFQF+FSAKDH +K++ESFVSGTLKN SNFDDT
Sbjct: 241 ERCRFIQDDLYVSWQFVDLPNNPASAVGFQFNFSAKDHVQKNQESFVSGTLKNRSNFDDT 300
Query: 301 PVTFRGKNANIVQFNLFPRIYRNQQDSKLMQPLFHEFLPGSSFQKTSELQSSLQNVNDGL 360
PVTFRGKNANI+QFNLFPRIYR+++DSKLMQPLFHEF+ GSSFQ T+ELQ SL+N NDGL
Sbjct: 301 PVTFRGKNANIMQFNLFPRIYRSKRDSKLMQPLFHEFVSGSSFQNTNELQLSLENANDGL 360
Query: 361 LNITLYINLLSSYIVEVESQSIMGPVSFLADLGGLYCISVGIFFYLLVQCEYRIKRLRNE 420
+NITLYINLLSSYIVEVE+Q+I+GPVSFLADLGGLYC+SVGIFFYLLVQ EYRIK+LRNE
Sbjct: 361 INITLYINLLSSYIVEVETQNILGPVSFLADLGGLYCVSVGIFFYLLVQFEYRIKKLRNE 420
Query: 421 DSVLRKIRNRRKAQEHWNKLRKYVTYTWGCSTLED-YYDPS-TTGCGNCMVQ-SSKSGSS 480
D+V+RKIRNRRKAQEHWNKLRKYV YTW CSTL D DPS T+ C NC+ Q + K S
Sbjct: 421 DTVMRKIRNRRKAQEHWNKLRKYVMYTWDCSTLYDNCNDPSKTSNCANCIGQPARKDESL 480
Query: 481 RNRRLRNSSSTALSFKREVNGSTKKNANQDMKSPEARATDQEMRMIATKQELPLKHHVLG 540
R RRL+N SSTA+SFK +VNGS KK +++D KSP+ARATDQEM MIATKQE P +HHVLG
Sbjct: 481 RKRRLKNGSSTAISFKLDVNGSAKK-SSKDEKSPKARATDQEMGMIATKQE-PPQHHVLG 540
Query: 541 STYEGKQSLTVSYEGDSSQLGDFSHLEDFIPPPPSIEFSDSSDIDMFDILKNMKSLYEYN 600
ST+E KQS TV +EGDSSQ G+FS ED IPPPP I+F SSDIDMFD+LKN+KSLYEYN
Sbjct: 541 STHETKQSSTVPFEGDSSQPGEFSRPEDIIPPPPLIDFKHSSDIDMFDVLKNIKSLYEYN 600
Query: 601 VILREKLLSTQSEVRAIATNSTP 621
V LREKLLSTQSEVRA++ S P
Sbjct: 601 VFLREKLLSTQSEVRALSAKSAP 621
BLAST of MS021997 vs. ExPASy TrEMBL
Match:
A0A6J1ENC3 (uncharacterized protein LOC111436176 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111436176 PE=4 SV=1)
HSP 1 Score: 984.6 bits (2544), Expect = 1.9e-283
Identity = 498/623 (79.94%), Postives = 552/623 (88.60%), Query Frame = 0
Query: 1 MSCPSNSFRFNGSLCACPPGQLLNRTENSCVLFSSTSAITSGRLENYAVSFPETIVSFDS 60
MSCPSNSFR+NGSLCACPPGQLLNRT NSCV+FS TSAIT+GRLEN AVSFPETI +FDS
Sbjct: 1 MSCPSNSFRYNGSLCACPPGQLLNRTSNSCVVFSGTSAITTGRLENSAVSFPETIFAFDS 60
Query: 61 IKKITQSQAVFLQATLVLLFSWLAFCIFLRFMKLGDGRNIWFRMRWWVSRLDVCFATRHW 120
I+KITQSQAVFL+ATLV+L WL FCIFLRFMKL DGRNIWFRMRWWVSRLDVCF+TRHW
Sbjct: 61 IRKITQSQAVFLKATLVMLLCWLFFCIFLRFMKLEDGRNIWFRMRWWVSRLDVCFSTRHW 120
Query: 121 LDERKVVTKRKTELGGAFSMGSWILFIGLFAALLYQIISKRSIEVHNVKAANGPDLASFV 180
LD++KVVTKRKTELGG FSM SWI+F GLFAALLYQIISKRSIEVHN+KAAN PDL SFV
Sbjct: 121 LDDQKVVTKRKTELGGTFSMASWIVFSGLFAALLYQIISKRSIEVHNMKAANAPDLVSFV 180
Query: 181 NDMEFNITAVSTMNCANVRGLDTVVFGNPGFLEQKVMPLSNFANYSCQNRSEGPTISVRC 240
NDMEFNIT VSTM+C+N+RGLDT+VFGNPGFL QKVMPLSNFAN+SCQNRSEGPTISV+C
Sbjct: 181 NDMEFNITTVSTMSCSNIRGLDTIVFGNPGFLAQKVMPLSNFANFSCQNRSEGPTISVKC 240
Query: 241 ERCRFIQDDIYISWQFVDLPNSPASAVGFQFSFSAKDHAKKDRESFVSGTLKNGSNFDDT 300
E+CRFIQDDIYISWQF+DLPN+PASAVGFQF+FS+KDH +K++ESFVSGTLKN SN DDT
Sbjct: 241 EKCRFIQDDIYISWQFIDLPNNPASAVGFQFNFSSKDHVQKNQESFVSGTLKNRSNLDDT 300
Query: 301 PVTFRGKNANIVQFNLFPRIYRNQQDSKLMQPLFHEFLPGSSFQKTSELQSSLQNVNDGL 360
PVTFRGKNANIVQFNLFPRI+RN QDSKLMQPLFHEF+ GSSFQ T+ELQ SL+N N+GL
Sbjct: 301 PVTFRGKNANIVQFNLFPRIFRNNQDSKLMQPLFHEFVSGSSFQNTNELQLSLENANEGL 360
Query: 361 LNITLYINLLSSYIVEVESQSIMGPVSFLADLGGLYCISVGIFFYLLVQCEYRIKRLRNE 420
LNITLYINLLSSYIVEVE Q+I GPVSFLADLGGLYCI+ IFFYLLVQ EYR+K+LRNE
Sbjct: 361 LNITLYINLLSSYIVEVERQNIFGPVSFLADLGGLYCITFSIFFYLLVQLEYRVKKLRNE 420
Query: 421 DSVLRKIRNRRKAQEHWNKLRKYVTYTWGCSTLE-DYYDPST-TGCGNCMVQSS-KSGSS 480
DSV+ K+RNRRKAQEHWNKLRKYV YTWG S L+ DY DPS + C NC+ SS K+GSS
Sbjct: 421 DSVMLKVRNRRKAQEHWNKLRKYVMYTWGYSALDNDYSDPSKGSSCTNCIGPSSRKNGSS 480
Query: 481 RNRRLRNSSSTALSFKREVNGSTKKNANQDMKSPEARATDQEMRMIATKQELPLKHHVLG 540
R R LR+ SSTA+SF +VNG TK+ A DM SP+A ATD+EMR IATKQE PL H VLG
Sbjct: 481 RPRGLRSGSSTAISFHVDVNGYTKETAKHDMISPKATATDREMRTIATKQERPLHHQVLG 540
Query: 541 STYEGKQSLTVSYEGDSSQLGDFSHLEDFIPPPPSIEFSDSSDIDMFDILKNMKSLYEYN 600
ST+EGKQ LTV ++ GDFSH ED IPPPPSI+F DSSDI M DIL++MKSLYEYN
Sbjct: 541 STHEGKQRLTVPFK------GDFSHPEDIIPPPPSIDFKDSSDIGMSDILRSMKSLYEYN 600
Query: 601 VILREKLLSTQSEVRAIATNSTP 621
V LREKLLSTQSEVRA+AT STP
Sbjct: 601 VFLREKLLSTQSEVRALATKSTP 617
BLAST of MS021997 vs. TAIR 10
Match:
AT5G16520.1 (unknown protein; Has 25 Blast hits to 25 proteins in 9 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 25; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )
HSP 1 Score: 660.6 bits (1703), Expect = 1.2e-189
Identity = 344/622 (55.31%), Postives = 441/622 (70.90%), Query Frame = 0
Query: 1 MSCPSNSFRFNGSLCACPPGQLLNRTENSCVLFSSTSAITSGRLENYAV-SFPETIVSFD 60
M+CP NS +N + CAC GQLLNR+ SC +F S I++ + NY+V SF ET+ +FD
Sbjct: 1 MACPRNSITYNATRCACGIGQLLNRSSGSCEIFGWPSTISTDKDVNYSVISFAETLFAFD 60
Query: 61 SIKKITQSQAVFLQATLVLLFSWLAFCIFLRFMKLGDGRNIWFRMRWWVSRLDVCFATRH 120
I+K TQSQA+FL+ATLV+L SWL FC FLRF KLGDGRN+WF +RWW++RLDV F+TRH
Sbjct: 61 RIRKFTQSQAIFLEATLVMLLSWLVFCFFLRFTKLGDGRNVWFNLRWWITRLDVFFSTRH 120
Query: 121 WLDERKVVTKRKTELGGAFSMGSWILFIGLFAALLYQIISKRSIEVHNVKAANGPDLASF 180
WLD++++V KRKTELGG FS+ SWI+FIGLFAALLYQII+KR+IEVHNV+A PDL SF
Sbjct: 121 WLDDQQIVKKRKTELGGTFSVASWIVFIGLFAALLYQIITKRTIEVHNVRATGSPDLISF 180
Query: 181 VNDMEFNITAVSTMNCANVRGLDTVVFGNPGFLEQKVMPLSNFANYSCQNRSEGPTISVR 240
ND+EFNITAVS M+C+N+RG+ VV GNPGF E KV LS+ +Y+C+N + GPT++ +
Sbjct: 181 ENDLEFNITAVSDMSCSNLRGIGNVVMGNPGFSEFKVAALSSLGSYTCKNTTSGPTVNFK 240
Query: 241 CERCRFIQDDIYISWQFVDLPNSPASAVGFQFSFSAKDHAKKDRESFVSGTLKNGSNFDD 300
C +CR D IYISW FVDLP+SPA+AVGFQF+F++K+ + SFVSGTL+NGS D+
Sbjct: 241 CTKCRLTNDYIYISWHFVDLPDSPAAAVGFQFNFTSKNGPNEKHMSFVSGTLRNGSILDE 300
Query: 301 TPVTFRGKNANIVQFNLFPRIYRNQQDSKLMQPLFHEFLPGSSFQKTSELQSSLQNVNDG 360
+PVTFRG NI++FNLFPRIY + D KL+QPLFHEF+PGS ++ T++LQ+S+ DG
Sbjct: 301 SPVTFRGTEGNILKFNLFPRIYHHLHDLKLIQPLFHEFIPGSVYRDTTQLQASMGRSTDG 360
Query: 361 LLNITLYINLLSSYIVEVESQSIMGPVSFLADLGGLYCISVGIFFYLLVQCEYRIKRLRN 420
+LN TL+IN LS+YIVE++ ++I+GPVSFLADLGGLYCIS+GIFFYLLVQCEYRIK+LRN
Sbjct: 361 ILNTTLFINYLSAYIVEIDHENILGPVSFLADLGGLYCISIGIFFYLLVQCEYRIKKLRN 420
Query: 421 EDSVLRKIRNRRKAQEHWNKLRKYVTYTWGCSTLEDYYDPSTTGCGNCMVQSSKSGSSRN 480
ED+V RKIRNRRKA +HW+KLR+YV YTW CS L D +T G C G +R
Sbjct: 421 EDTVFRKIRNRRKALDHWDKLRRYVAYTWDCSILVDDAIKTTKVSGMC-------GLTRP 480
Query: 481 RRLRNSS--STALSFKREVNGSTKKNANQDMKSPEARATDQEMRMIATKQELPLKHHVLG 540
NSS ++ ++ N +KN S E + D L H G
Sbjct: 481 PTSSNSSEHGESIMANKKPNLGIEKNVISQPASLELSSFDSAS---------SLAH---G 540
Query: 541 STYEGKQSLTVSYEGDSSQLGDFSHLEDF-IPPPPSIEF---SDSSDIDMFDILKNMKSL 600
+ K+S+T SH ED IPPPP +EF S S++D DI + L
Sbjct: 541 DNFSNKKSIT----------HPISHSEDVSIPPPPPMEFIDGSSGSEVDAMDIKNKFQLL 593
Query: 601 YEYNVILREKLLSTQSEVRAIA 616
Y+YNV+LREKLL TQS + +A
Sbjct: 601 YDYNVLLREKLLETQSLLNTLA 593
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_022149823.1 | 0.0e+00 | 100.00 | uncharacterized protein LOC111018164 [Momordica charantia] | [more] |
XP_038889680.1 | 5.4e-296 | 84.06 | uncharacterized protein LOC120079539 isoform X1 [Benincasa hispida] >XP_03888968... | [more] |
XP_004152836.1 | 2.6e-290 | 82.32 | uncharacterized protein LOC101211303 [Cucumis sativus] >XP_011648990.1 uncharact... | [more] |
XP_008441901.1 | 6.0e-287 | 81.73 | PREDICTED: uncharacterized protein LOC103485904 [Cucumis melo] | [more] |
XP_022937645.1 | 1.1e-285 | 80.42 | uncharacterized protein LOC111443988 [Cucurbita moschata] >XP_022937646.1 unchar... | [more] |
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A6J1D7T0 | 0.0e+00 | 100.00 | uncharacterized protein LOC111018164 OS=Momordica charantia OX=3673 GN=LOC111018... | [more] |
A0A0A0LJH8 | 1.3e-290 | 82.32 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G074060 PE=4 SV=1 | [more] |
A0A1S3B545 | 2.9e-287 | 81.73 | uncharacterized protein LOC103485904 OS=Cucumis melo OX=3656 GN=LOC103485904 PE=... | [more] |
A0A6J1FBT5 | 5.5e-286 | 80.42 | uncharacterized protein LOC111443988 OS=Cucurbita moschata OX=3662 GN=LOC1114439... | [more] |
A0A6J1ENC3 | 1.9e-283 | 79.94 | uncharacterized protein LOC111436176 isoform X1 OS=Cucurbita moschata OX=3662 GN... | [more] |
Match Name | E-value | Identity | Description | |
AT5G16520.1 | 1.2e-189 | 55.31 | unknown protein; Has 25 Blast hits to 25 proteins in 9 species: Archae - 0; Bact... | [more] |