Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TTTTGACAGAATTAACTATTCAGGTTATTTTGACAAATCTAATTTTCAAAGCGACATTTAGCCAAAGTTTATCATCTTCAATTACAAGAGAAGTGTACTGTTCGATCAAATTCTCATTCGAGGCAATTTCAGCGGCGCAAGCTTTGAACGGTAAGCGATCGTGTTTATAGTCAGTGTGCGATTTGTTCATCGTATTTAATTTGAATATCATCATTTTCATTTTGTTGAATGTTGCTGCATTTGTTGATTTTTATGGTTGGTTGTATCTCAGTGCGTGTTCTTGATTTTTTTTGTTGAATGTGTGGGTTCTTGATTTGTAGCTTACTAGCCTCACAATCTTAATTTAGGATTGAAAATTGATCATTGTGCAAGATGTTTGGTTTAATTGACTTTTCGAGATTTTTTTAATATTTCGAACAGTCAACAGATGACTGTAGAAGACTGTCGGACCATGGAAGATTGATCTGGTATTTGAATTCGAGGGTAAATCTTATAAGTTCAAGTTCAACAATATGTAAAGTTCAACTGTTTGACGTTATAATGTCTTAATCAGTTGAACCAGGTGATAAATCGTCATGAAAAAACAAATGGACTAGACTTCATATTGTTGTCTATGCTGGGAATGGGTCGGTGGTGGCCCTCGTCCAAACCGGTGCTGCATGTTCGAAAGCCACGTGGTGGATTTTTTTTTTTTTTTGTTGGAAACCAAGAAATCGGAGCTTTACTCTACTCTACTCGGGGTAACTATGCCCGTCCCTAGGCCAAACACTAGGAGACATGGGAAGTGTCTACAAATTAATGTCACAGGTGAATCTTGAAGTTGGGAACTTGAAGTAAGCATACCTTCAAAGTCCAAACCTTCTACCACCGCGTCACCCCTTGTGGACACACTAGAGATGTCCGCAGGGCAAAGCCGGAGAATGGCTTCCCCGTTCTCGTCCCCGCCACCAAAATCAATCCCTATCCCTGCGAAATTTCGGAGATCGGGGAGAGGAATCCCCGATCAAGGACCTTGACATCCAAAGCGGGCTCCCCCTTTCATTGAAAACCACTTCTGCATTTTTGCGATATTTTGAACTTATTGTTTCAAATCGGCACAAGACGTGTTTTAAATAAGAAAGATGGGTTCAACAAAGCGATAGGAACTAAAAAAGAGAGATGGGTTGAAGATCTTTGAACGGCAGGATGTGTTTGGTTCCTAAAGATCTTTGTGGACCATTGGTTCTGTTATGAAAACGACAATCAATCGTAAAACGAGATGTAAAAAATAGATGACGATAAACGTGAGAGAGAAAATAGGAAACAACAGAGAATTTACGGGCTCGTTTGATAACGTTTCTGTTTCATGTTTCTTGTTTCTTAGTTCTTGTTTCTAGTTTCTTGTTTCTTAAAAGATATAAAAAACAGAAACAAACTTGTTTGATAACTATTTCTTGTGTCTTGTTTCTTATTTCTTGCTTTTTGTTTATATATATACATACATACACACACACACATATATATATGTATGTATGTATGTATGTATGTATGTATGTATGTATGTATGTATGTATGTATATGTACATATTTAGGCTCCTAACAAAATATAACATGCATTAAAATATAGTCATCGAAAGTTTACATTTTCTAGTATAATTAAAAAAAAAATCTTAATATAGAACTAACCATAAATTTAATATGCTATCTCTCACATAAATTATTTGTCACAAATGACGTGTTTAAAAAATGTTGTGAGTGTAAAATATTAAGGTAGTGTCAGTTGTATATTTTAGTGAACAAATTTTTCATGATACCAAATTTTTTACGGATTCCGAGTGTATTATTAACTTAAATAATAAATAAATACACATATACAAAATTTTCAAATCAACCCATAAATTAAAAATACATAATTCCATTAAAAATATAAACAATAAAAGATTACAAAATTTAAAACTAAAAAATAAAATGGTAAAAAAACATACAAAATTAAAAGAATTAAAAATTGCAAAAATGATTTGAAAAAAAAATGCAAAGTTGGGATTTGAACCGTGGACTTGACAAATTCTGGAAAAAAAGTTGACAACACAACCATCACACCAGAAGTTGCATATTGTGATTTTTTTGGAACGCATAATATAAAACTAAAACAATACTGGGTTTCAGAAGAAAAAAAAAAAGAACGGAACAAGAGAAACAAGTTTTGAGAGTTTTTCATTTTTTCACATAATTTGAGAAACGTTTCTTGTATTTTTGGGAACAAGAAACGGAAACAGTTATCAAACAAGACTGTTTCTCAATAAATAAGAAACAAGAACAAGAAACAAGAAACGAGAACATTATCAAACGAGCCCTACGTGGTTCACCATCGTTATGTTGGCTAGTCCACGGACAGAGGAGGAGAGCAATATCATTGAGAGAGATGGAAATTTACAGATACAAATATTATGTCATAATACTGCTAGGGTTTAATTCGAGTTTATATAGGAACTCTCATTAAACCCTATTATAGAGAGCGTAAACATAAATTAGAAAGAGGATGTGTAAACGCGATGCCCAAATAGAATGCTCTACGAAGTGCGGTTAGCGGATGCGGAATGCAGAATGCGTTCGCAGGGCATTAATGCGTTCAAGGCATAACACAACAGGTTCCTTAAGTCTGGGACCAGTGGCTCTCTGGGCCTTGTGGGAGAATTTGGAAGCAAAGAATCTGAAGAGTCTTCCTTAATGTTTTGTATATAATGGGATGCCTTAATTTTCAAGGCTTCAAGTTGAAATGGTTTTATAAAGACTTATTGGCTGCTTTGCTAATGTCTTTGAAGTCTCCATTGGCCTTTTCAAATTACTTAGCTCACTTATCTTATAAGTTTTTACCTGAAAATTAAATCAGTTGCTATTGTGTTACCAGGTGTGATGGAGAGGCAGGATAAAGATGACATCCAAGACTGCACAATTAAGCTGGTTCTTATTTCCATTCATAATTCTTTTATTTATTTATTTTTTTTTTTGTTGTAAACGAGTTGATGAAATAATTGTTAGATTGGTGTCTGATGACAGAGAGTAAATCCTCAAAAGCAGAGAGACAAGGTGTACATTGGCTGTGGAGCTGGATTTGGAGGCGATAGGCCAACTGCAGCTCTGAAGCTACTTCAGAGGGTCAACGACCTAAACTATCTTGTACTTGAATGCCTAGCCGAACGCACTCTTGCAGATCGCTTTCAAGTTATGTTGTCTGGTGGCGATGGTTATGATTCTAGGAGTATGTCCATGCTTAAAAATCATATTTAGCTTACAATTTTTTTTTTTGACATCTCTTTCCTTGAAAGTTCATAATATTGTTTGGATGGTATTTTTTCTTTAGAAGCTGGCAAATGGTTATTTACATTGTGCAAATCCTTATTTGTTCTTTCACAATCCTGATGACTTTTTCTCATAGAAGAAACTCACCATCTTAGAATCTCTTGATGTCGCCATAAAATTTTACTCACCCCTTCAACATCAGTGATTCTCGTGTTACAAATTATCTTCTTCATTTTCTATGGATGTTAGGGAGTGCCAAGAGCTGTTAGATTTGGCTTGTCATTTGCCTTAACTATGCTGACATTCATTCAAACTTCTATACCACGTCTATTTCCAATTTTTGAGAAGGAATGATCAAAAGAATGCGAGTATTTTCTTTCTAGACAGCAAAAACCATGAATTATTTATATATCATAGCTCAATGTTCTGTTGGGTGTTAATCTCCGTTTTTATAGTTCATACCATATTTCAAAAATATGATTCTTTACTTTTGTTGGTCACATATTGATATTACTAAAAGCATTGAAGCTCCTTTATGTATATTGACGTCTGTAGAAAATGGTTTTCACTTTTATGCAATAACATTGTTACTCCGTGCTGCATTCATATGGCAATCTGACACCTCATTTTGGCCTTGCACCGTGTTTAGAGTTTGCTTTCAATCCAAGTATTATGAAATTGTGCACAGACCAGTTACCTTGTTGTGATGACATTTCATTCACTGATTTCATGACTTCTCACATTCAATTGATGGATTTGGTGTTGTAATGAACCACCAAGAAGTTAGTTGGTCTTGTTATTGGCTACTTATTGGGTTCTTTTTGTTCGTTAAGCTAATTGCCCAGGAAGTTATTTTGTAACAAATTCTACTAACTATAATGTGTTGAGCAAAATGGAAGTATTGAGTTGAAATACATGATAGTTTGCCCGGGTTTTCTTCTTAATCAGTTATTGAACTTCAATCTAAGTGTACATTTTCTTAAACCGTATCTCCATTAATAACTTTGATGTCTAGTCAGCTTTCGCGAGCTTCTCAGCGTTTTGCTCTTTACTATTATTTGAATTACTCTGTAGTTTCTGAGTGGATGAAATTGCTTCTTCCTTTGGCCGTGAAGAGAAATATTTGCATAATTACCAACATGGGTGCAAGTAAGTTATTGAAGTCTGAAAGAGTTTCAAATTCGTGTTCTGTTTATTTTTTAGTGGTGGTGGTTAATTACTTATCTATAATCTCTGATTCTCTCATTTTCTTTTCTGAGTAGTGGATCCCCCTGGGGCTCAGCAAAATGTTATAGAAATAGCTAGCAGTCTGGGGTTGAATGTTTCGATCGCAGTTGCTTATGAGGTCTCAGTTCAAGAACAAGGTAATTGTTCTTTATTATTTTTAGTGGTTTGAACGGCTTATCTGTTCAGTAAGGGTTGATTTCTAAGAACCGTTACCTTGTCTTCCCAAATGGTGGCCCAAATAGACAAGTAAGTTCAATATTCCCCATCAGTACTCTTACATGCCAAATTTTGAACACTTCAGGTCATGAAAAAACACCGTTGTTTGTAATTTGTATACTCCGGTTTAGGATCCTCAACTGAAGTTCATTTCTCTTTAGCTTTATTGTGAAAATAATAATACTCCTCTCCAGCAACTATTATCAGTTGATCTAGTGGTTAAAAGGATCTTAAGGAATCAGAAAGTTATGGTTTCAATCAATGGTGGTCATCTACCTAGAAACTAATTTTCTATGGGTTTTCTCGACTACTAAATGTTTTAAGGTCAGATGGTTTATTCCGTGAGAATTTTCAAGTGCGCGTAAGCTGGCCTGGATACTCTCACACACACATATGCAAAATTCCCCTCCAGCATATTACCAAAAAACTTAATTTTTTTCTATGGGTTTTCTTGACAATCAAATGTTGTATAGTTAAATGGTTTGTTCCGTGAGAATAGTCAAGGTGCACGTAAGCTGGCCCTGTCACTCTCACACATACATGGCATATCACCAACAAACCTAAATTCTTTTTCCCGGTCGAAATTGTAACATGACTTCACAAAATATAAATTACAAACTGAAGCGGGTAGATTGCATATATGGCAGCCACTCAAGACACATCTATTTTATTTTCCTCCTTTTTGCTCCTTAATTGTATTTTACTTCTAAAGGTGGAGCTTATCGTTGGTTTGATCAATTAAAGGCATTAGCACATATCTGGGAGCAGCTCCTATTGTTGAGTGTCTGGAAAAGTACCATCCAAATGTCATAATTACTTCGCGAGTTGCAGATGCTGCCCTATTCTTGGCTCCAATGGTAGGAAGCGGTTTCTGCATGATGTTCACACTTTTGGAAGTGATTTGCCTTTATGTTTCTTTGTTCAACATACTTTTGTTACCTGTTTAAAGGTAGAATAGCAAGCACTTAAGAGAAGTTGCTTAGTTGGGAGACGGAAATGGATTTTAGGGGTCTCGGCTTCTTTTCTTTGGGCTTTAACCAAGGAGTTAAAGAAAATGTGTCCATTTGCAATGAAAGAAATGTTGTAAGAACTGGTTAAACAACATCAAAGATTTATATGTAGTTGGAACAAAGACAAAATATTAGAGAGATGGATTGTATTCACTGTTTTAGAGGAAATAGTGGAAGTTGGCGATGATCATATTGGTAGTCTGCTTCTATACTCCATACTATCATGTAATTACGATTTATCACAGCAATTTTTTTTTAACTCTTTACTTTAGGGGATGTTTTGAAGCATTTATCAGACAATGACAACCTGACTTATATTTAGGTCTATGAACTTGGATGGAACTGGGACGATTTTCCACTGCTAGCACAGGGAATAATGGCTGGCCATCTTCTGGAATGTGGCTGTCAACTTACAGGGGGATACTTTATGCATCCAGGTCCACTTCTTCATACCCACCAAAATCCATTTTTTGTCTATGGAGCATGGAACTCGTAGTTCGGCTAAATATGATTCGAGTGATCATTAAATGTTATCTTCTGTAGGTGACAAGTATAGAAACATGCCTTTCCAACAGCTTTTGGATATATCATTGCCTTATGCTGAAGTTCAGTGTGATGGAAAAGTCTATGTAGCCAAGGCAGAAGAGACTGGAGGTCTTTTGAATTTCAGTACATGTGCTGAACAACTTTTGTATGAGATTGGTGATCCTTCGGCCTATATCACCCCTGATATGGTAAGATTATATGTTTTGTATCACTACTTTTGATGAATTGAATATTAAGAAAAAATACGAACTTCGACTTAGCTGGTTGTAAATGTAGGTGGTTGACTTCAGCAATGTTTCGTTTTGCTCTTTATCCAGCTCCAAGGTTTTTTGTTCCGGAGCAAAACCATCTATTCTAGGAGTGCCCGAAAAACTCTTGCAGTTGGCTCCAACGGTATTTATTTTTGTCAGGAATATCTGACATTTTAACTAATGGATGAATGAGTTTCGTTTCTTGTTTTAAAAATATGGATGAATGAGTTCATTTGCTTTGGAGTCTATGAGTGACAAATACCCAGAAGATCAAAAGAACCTACAAATATTTTAAAGTAAAAAAATTCCAACTTGGAAAACTGCGTATTTGGTAGGCAATCTAGATTTGTTTTATGTTTTCAAATTCACTGAGTTTTGTGAATGTCTTTGACGAAAGAGTTTTCATAAATTATTTCTGCCTCTTTGAATTATGTGATTGAGATGGTAGAAATTCTGAAACCAACATTCTTATGTTTTCATTATATCCACATTTGGAATTTCAAATTTTAAAATGTAGTATTATATTAAAAAAAGATAAAACCTCGGGTAAATATAGTATTTTAGGTGATAAAATCAGCTTATTTTGTATAAAATCAGCTTATTATGACAAAAATATTTAAATTAATTTATGATGGTTTTTATTCAACCTAATATTTACTAAAGCACATTTTGATCCTTTTATCCAATTTCTAAATCAAACACATCTATAAGCACGAAATGCAACTTTGTTTTCATTAATTGTTTTTTATTCCGGAGTGTCCGGACAAGTTTATACACTCCTTGACTATTCTTATGACACAAATTGTCCGACCTTATGATATTTGGTTGCTAAGAAAGTCCGTATGAAGTTTATTCTTAGATAGATGGTCACCATAGATAGAACCTAAAACAATTTTTTTTCCTTTTTCTTTTTTGAAACACCTAGAACCTCTTAGAACATGCTCAACTTCTTAGGTAATTCCAAAATGTTGTTTAGCTGGATAATATTTTAGACCTCAAATTCTTTTAACGGAAACCTAGCAGAAATAATTTTTATCCACTAATCTTTACTCATGAAAGAGCGGCATTTAAAGCCAGGATTGGAGGCAACCTTGCCTATTCTTCTTTCCGCTGCATTTATCAATGCAAAAGAGATATTCTCCTGGAGAAATTTCTTGTTTCTATTTTCTATATTGGCAAGCTAATTGTGGATGTTACATATCAGGACTGTGGATGGAAAGGGTATGGAGAGATATCCTATGGGGGACGTGAATGTGTTCTGCGTGCTAAAGCTGCAGAATATCTGGTAATTCACTTGGTTGCTTTCCCATTATATCACTGAAAAAAATTCTTTGACATTGTTTTTTTTTTTTTATATATAATTTTGATTTATTTGGGTTTCGTTAACCATTGTGGGTTGGTTTAGTTACCAAGAGGGTCTTGGAAACTAAGAAGGGATGAGTTCAATCTATAGTGACCACCTCCTACCTAGAAATTGATTTTCTACTGGAGACATCGAAGGAGTTATGTCACATGTGAGTCTCGAATCCAGAATTTTGAGGGGGGCATATCCTGAAAGCCCAAGTCTTCAACCTCTGTGCCACCCCTTGGGGACCGATTGTCTAAACTGTTATTGGTGCTATGTTTAGTTATAAGAGTCTAAAATTAATTTGATTGTTTCCCATTTTCTCGTTTTTCCTTTCATAGGTTAGGTCATGGATGGAAGAAGGATTGCGTGGTGTTGATCAGCGTATAGTTTCTTATATAATTGGACTTGACAGCCTTAAAGCATCCAGCAATGGTAGCTATAATGTTGAAGATATTAGGTTACGCATGGATGGACTATTCGAGCAAAAGGAGGACGCACTCCTGTTTGTTAGGGAATTTACAGCTTTATACACAAATGGACCAGCTGGTGGTGGCGGTATCAGGTTGATACACTCTTATAATTGCCCTTTCTTTAACTGCGGCAGTCTGAAATTTGTTTATATTAATTATAGACCTCGCTCTTCTATTTGTTGAATAATGAATAATCATCATCGATTGGCCTAGAGATCAAAGAGGACTTGAAAATCTAAGAGATTACGAGTTCAATCCATTATAATTATCTACCTAATAAATAATTTCTTACGAGTTTAGGGTCAAATAGTATAGTCGAGGTGTTGATTGGGTCAGATAGATATGAAGCATAGAAACTTGTGGATTTGAGATATGAAGCATGCAACTTTCAGAAACTTGTGGCTTTGAGATATTATTATTGTGGTCCATATTTGTGCTAGAACCGCATTTTAGTGACAATGACGTGATTTTTGTCCTTTGCAGCACTGGGTACAAGAAGGAAATCGTGCTTGAAAAACAATTGGTACATCTCTCTTTCTCTGTCATTCTCTACATCTGGATTGACGGTCTTAGTTTTACACCTCATTTTTAACTAAGAGATTATTGGTTCATTCTATAGTAGCCTCTTGGTTTCTTTGACAACAAAATATTGTAGGGTTAAGAGGTTGTCCTAGAAAAATTGTCGATATGCATGTAAGTTTGGTCAACGTGTGTGTATATGTATATGCATATATAAATCTTCAGTTTTGATCCATGGATATTGCCTTGGGTAAATTATAAATTTGATGAAAGTTAGAATTCAGTCTTTGTGGTTGTAAAAATTAGAATTTAGTTCCTATGATTTGATAAAACATAAAAATAGTTTTTGTCATTTGTAATCTTGGTAAATTAACGTGTGAAAGTTAGTTATTCATAAGTCAATAACGATGGTGTGAGTCAATTGATTGATTGAACAAAAAAATAACATTTTTTACATGTGTACTTAATTCGTCAACAATTACGAGCGATAAATATTATTTTTAGTTTTTATAGAATCATGGGAACTAAATTTTAGTTTTAAAATATAGGGACTAGGTTTTACTTCTCCCGACTATAAAGATTAAATTCTAGATTTTTAAAGCATAGGGACTAAATGGATAATTTTTACTCTATTTATTTCTTCTTCTCTGGTGACAAAAGTGAATTTATTTGAACTTTCATCTCCATAATGAGATCTTCCCATTCTTCGTTGTTCAATTTCTTTTCCAATTGATTGCTTTTAAATTAGGTTTGGCGCGAACATGTTTTCTGGCGAACCGGTGTGAAGTACACTAAAGCAATAAAATTAGACAGTCAACCAACATATCTTCGAAAGGATTCGGAGGAGGCGGCATGTTCTTCGACAGTGGTAACATTGCCATGTCCGGTATCCGAGTATACAGACGAGCCTTGTACATTCTCCTCCACACCAGAAACTGCCCACTCCCCTATTCCATCTGGCCAGAATATTCCTCTTTACAATGTGGCACATAGCAGAGCTGGAGACAAAGGGAATGACTTGAACTTCTCTCTCATTCCTCATTATCCTTCTGATCTCAAGCGGTTGAAGATGATCGTCACGCCCGAATGGGTAAAGCGAGTTCTCTCTTCGCTGCAAAACTCAACAACATTTCTCGATTTGAATGCTGATGAGAAGAGGGACGATTGGATGAATGAACATGTGAAGGTCGAAATATACGAAGTTAAAGGTATCCATTCTTTGAATGTTGTTGTTCGTAACATTCTCGATGGTGGCGTAAATTGCTCGCGGAGAATTGATCGCCATGGAAAGACCATATCGGATCTCGTGTTGAACCAGCAAGTTGTTTTGCCACCATAGTTTGGTGGTTCATACAGAAGAGAGGAAGGTTCTTATAGACAATGAAGTATATGCCTTCTGCTTCTAAGGTTTTCTTCTTATGCAATTGAGTGTGGGTTTTCGTTGACTTCCAAAAGTTGGATTAAGTTGGATTAATCATTCAATGTATTATCAAAGTTCACTTTATTTAAACGTTTATCAGTT
mRNA sequence
TTTTGACAGAATTAACTATTCAGGTTATTTTGACAAATCTAATTTTCAAAGCGACATTTAGCCAAAGTTTATCATCTTCAATTACAAGAGAAGTGTACTGTTCGATCAAATTCTCATTCGAGGCAATTTCAGCGGCGCAAGCTTTGAACGGTGTGATGGAGAGGCAGGATAAAGATGACATCCAAGACTGCACAATTAAGCTGAGAGTAAATCCTCAAAAGCAGAGAGACAAGGTGTACATTGGCTGTGGAGCTGGATTTGGAGGCGATAGGCCAACTGCAGCTCTGAAGCTACTTCAGAGGGTCAACGACCTAAACTATCTTGTACTTGAATGCCTAGCCGAACGCACTCTTGCAGATCGCTTTCAAGTTATGTTGTCTGGTGGCGATGGTTATGATTCTAGGATTTCTGAGTGGATGAAATTGCTTCTTCCTTTGGCCGTGAAGAGAAATATTTGCATAATTACCAACATGGGTGCAATGGATCCCCCTGGGGCTCAGCAAAATGTTATAGAAATAGCTAGCAGTCTGGGGTTGAATGTTTCGATCGCAGTTGCTTATGAGGTCTCAGTTCAAGAACAAGGCATTAGCACATATCTGGGAGCAGCTCCTATTGTTGAGTGTCTGGAAAAGTACCATCCAAATGTCATAATTACTTCGCGAGTTGCAGATGCTGCCCTATTCTTGGCTCCAATGGTCTATGAACTTGGATGGAACTGGGACGATTTTCCACTGCTAGCACAGGGAATAATGGCTGGCCATCTTCTGGAATGTGGCTGTCAACTTACAGGGGGATACTTTATGCATCCAGGTGACAAGTATAGAAACATGCCTTTCCAACAGCTTTTGGATATATCATTGCCTTATGCTGAAGTTCAGTGTGATGGAAAAGTCTATGTAGCCAAGGCAGAAGAGACTGGAGGTCTTTTGAATTTCAGTACATGTGCTGAACAACTTTTGTATGAGATTGGTGATCCTTCGGCCTATATCACCCCTGATATGGTGGTTGACTTCAGCAATGTTTCGTTTTGCTCTTTATCCAGCTCCAAGGTTTTTTGTTCCGGAGCAAAACCATCTATTCTAGGAGTGCCCGAAAAACTCTTGCAGTTGGCTCCAACGGACTGTGGATGGAAAGGGTATGGAGAGATATCCTATGGGGGACGTGAATGTGTTCTGCGTGCTAAAGCTGCAGAATATCTGGTTAGGTCATGGATGGAAGAAGGATTGCGTGGTGTTGATCAGCGTATAGTTTCTTATATAATTGGACTTGACAGCCTTAAAGCATCCAGCAATGGTAGCTATAATGTTGAAGATATTAGGTTACGCATGGATGGACTATTCGAGCAAAAGGAGGACGCACTCCTGTTTGTTAGGGAATTTACAGCTTTATACACAAATGGACCAGCTGGTGGTGGCGGTATCAGCACTGGGTACAAGAAGGAAATCGTGCTTGAAAAACAATTGGTTTGGCGCGAACATGTTTTCTGGCGAACCGGTGTGAAGTACACTAAAGCAATAAAATTAGACAGTCAACCAACATATCTTCGAAAGGATTCGGAGGAGGCGGCATGTTCTTCGACAGTGGTAACATTGCCATGTCCGGTATCCGAGTATACAGACGAGCCTTGTACATTCTCCTCCACACCAGAAACTGCCCACTCCCCTATTCCATCTGGCCAGAATATTCCTCTTTACAATGTGGCACATAGCAGAGCTGGAGACAAAGGGAATGACTTGAACTTCTCTCTCATTCCTCATTATCCTTCTGATCTCAAGCGGTTGAAGATGATCGTCACGCCCGAATGGGTAAAGCGAGTTCTCTCTTCGCTGCAAAACTCAACAACATTTCTCGATTTGAATGCTGATGAGAAGAGGGACGATTGGATGAATGAACATGTGAAGGTCGAAATATACGAAGTTAAAGGTATCCATTCTTTGAATGTTGTTGTTCGTAACATTCTCGATGGTGGCGTAAATTGCTCGCGGAGAATTGATCGCCATGGAAAGACCATATCGGATCTCGTGTTGAACCAGCAAGTTGTTTTGCCACCATAGTTTGGTGGTTCATACAGAAGAGAGGAAGGTTCTTATAGACAATGAAGTATATGCCTTCTGCTTCTAAGGTTTTCTTCTTATGCAATTGAGTGTGGGTTTTCGTTGACTTCCAAAAGTTGGATTAAGTTGGATTAATCATTCAATGTATTATCAAAGTTCACTTTATTTAAACGTTTATCAGTT
Coding sequence (CDS)
ATGGAGAGGCAGGATAAAGATGACATCCAAGACTGCACAATTAAGCTGAGAGTAAATCCTCAAAAGCAGAGAGACAAGGTGTACATTGGCTGTGGAGCTGGATTTGGAGGCGATAGGCCAACTGCAGCTCTGAAGCTACTTCAGAGGGTCAACGACCTAAACTATCTTGTACTTGAATGCCTAGCCGAACGCACTCTTGCAGATCGCTTTCAAGTTATGTTGTCTGGTGGCGATGGTTATGATTCTAGGATTTCTGAGTGGATGAAATTGCTTCTTCCTTTGGCCGTGAAGAGAAATATTTGCATAATTACCAACATGGGTGCAATGGATCCCCCTGGGGCTCAGCAAAATGTTATAGAAATAGCTAGCAGTCTGGGGTTGAATGTTTCGATCGCAGTTGCTTATGAGGTCTCAGTTCAAGAACAAGGCATTAGCACATATCTGGGAGCAGCTCCTATTGTTGAGTGTCTGGAAAAGTACCATCCAAATGTCATAATTACTTCGCGAGTTGCAGATGCTGCCCTATTCTTGGCTCCAATGGTCTATGAACTTGGATGGAACTGGGACGATTTTCCACTGCTAGCACAGGGAATAATGGCTGGCCATCTTCTGGAATGTGGCTGTCAACTTACAGGGGGATACTTTATGCATCCAGGTGACAAGTATAGAAACATGCCTTTCCAACAGCTTTTGGATATATCATTGCCTTATGCTGAAGTTCAGTGTGATGGAAAAGTCTATGTAGCCAAGGCAGAAGAGACTGGAGGTCTTTTGAATTTCAGTACATGTGCTGAACAACTTTTGTATGAGATTGGTGATCCTTCGGCCTATATCACCCCTGATATGGTGGTTGACTTCAGCAATGTTTCGTTTTGCTCTTTATCCAGCTCCAAGGTTTTTTGTTCCGGAGCAAAACCATCTATTCTAGGAGTGCCCGAAAAACTCTTGCAGTTGGCTCCAACGGACTGTGGATGGAAAGGGTATGGAGAGATATCCTATGGGGGACGTGAATGTGTTCTGCGTGCTAAAGCTGCAGAATATCTGGTTAGGTCATGGATGGAAGAAGGATTGCGTGGTGTTGATCAGCGTATAGTTTCTTATATAATTGGACTTGACAGCCTTAAAGCATCCAGCAATGGTAGCTATAATGTTGAAGATATTAGGTTACGCATGGATGGACTATTCGAGCAAAAGGAGGACGCACTCCTGTTTGTTAGGGAATTTACAGCTTTATACACAAATGGACCAGCTGGTGGTGGCGGTATCAGCACTGGGTACAAGAAGGAAATCGTGCTTGAAAAACAATTGGTTTGGCGCGAACATGTTTTCTGGCGAACCGGTGTGAAGTACACTAAAGCAATAAAATTAGACAGTCAACCAACATATCTTCGAAAGGATTCGGAGGAGGCGGCATGTTCTTCGACAGTGGTAACATTGCCATGTCCGGTATCCGAGTATACAGACGAGCCTTGTACATTCTCCTCCACACCAGAAACTGCCCACTCCCCTATTCCATCTGGCCAGAATATTCCTCTTTACAATGTGGCACATAGCAGAGCTGGAGACAAAGGGAATGACTTGAACTTCTCTCTCATTCCTCATTATCCTTCTGATCTCAAGCGGTTGAAGATGATCGTCACGCCCGAATGGGTAAAGCGAGTTCTCTCTTCGCTGCAAAACTCAACAACATTTCTCGATTTGAATGCTGATGAGAAGAGGGACGATTGGATGAATGAACATGTGAAGGTCGAAATATACGAAGTTAAAGGTATCCATTCTTTGAATGTTGTTGTTCGTAACATTCTCGATGGTGGCGTAAATTGCTCGCGGAGAATTGATCGCCATGGAAAGACCATATCGGATCTCGTGTTGAACCAGCAAGTTGTTTTGCCACCATAG
Protein sequence
MERQDKDDIQDCTIKLRVNPQKQRDKVYIGCGAGFGGDRPTAALKLLQRVNDLNYLVLECLAERTLADRFQVMLSGGDGYDSRISEWMKLLLPLAVKRNICIITNMGAMDPPGAQQNVIEIASSLGLNVSIAVAYEVSVQEQGISTYLGAAPIVECLEKYHPNVIITSRVADAALFLAPMVYELGWNWDDFPLLAQGIMAGHLLECGCQLTGGYFMHPGDKYRNMPFQQLLDISLPYAEVQCDGKVYVAKAEETGGLLNFSTCAEQLLYEIGDPSAYITPDMVVDFSNVSFCSLSSSKVFCSGAKPSILGVPEKLLQLAPTDCGWKGYGEISYGGRECVLRAKAAEYLVRSWMEEGLRGVDQRIVSYIIGLDSLKASSNGSYNVEDIRLRMDGLFEQKEDALLFVREFTALYTNGPAGGGGISTGYKKEIVLEKQLVWREHVFWRTGVKYTKAIKLDSQPTYLRKDSEEAACSSTVVTLPCPVSEYTDEPCTFSSTPETAHSPIPSGQNIPLYNVAHSRAGDKGNDLNFSLIPHYPSDLKRLKMIVTPEWVKRVLSSLQNSTTFLDLNADEKRDDWMNEHVKVEIYEVKGIHSLNVVVRNILDGGVNCSRRIDRHGKTISDLVLNQQVVLPP
Homology
BLAST of Sed0028010 vs. NCBI nr
Match:
XP_022975428.1 (uncharacterized protein LOC111474742 isoform X3 [Cucurbita maxima] >XP_022975429.1 uncharacterized protein LOC111474742 isoform X3 [Cucurbita maxima])
HSP 1 Score: 1126.3 bits (2912), Expect = 0.0e+00
Identity = 548/632 (86.71%), Postives = 589/632 (93.20%), Query Frame = 0
Query: 1 MERQDKDDIQDCTIKLRVNPQKQRDKVYIGCGAGFGGDRPTAALKLLQRVNDLNYLVLEC 60
MERQ +DD+ DCTIKLRVNP+K+RDKVYIGCGAGFGGDRPTAALKLLQRV DLNYLVLEC
Sbjct: 3 MERQGEDDVHDCTIKLRVNPKKRRDKVYIGCGAGFGGDRPTAALKLLQRVKDLNYLVLEC 62
Query: 61 LAERTLADRFQVMLSGGDGYDSRISEWMKLLLPLAVKRNICIITNMGAMDPPGAQQNVIE 120
LAERTLADR Q M SGGDGYDSRI++WMKLLLPLAVKRNICIITNMGAMDPPGAQQNVIE
Sbjct: 63 LAERTLADRHQAMSSGGDGYDSRIADWMKLLLPLAVKRNICIITNMGAMDPPGAQQNVIE 122
Query: 121 IASSLGLNVSIAVAYEVSVQEQGISTYLGAAPIVECLEKYHPNVIITSRVADAALFLAPM 180
IASSLGL+VS+AVAYEVSV+E GISTYLGAAPIV+CLEKYHPNVIITSRVADAALF+APM
Sbjct: 123 IASSLGLSVSVAVAYEVSVKESGISTYLGAAPIVKCLEKYHPNVIITSRVADAALFMAPM 182
Query: 181 VYELGWNWDDFPLLAQGIMAGHLLECGCQLTGGYFMHPGDKYRNMPFQQLLDISLPYAEV 240
VYELGWNWDDFP L+QG +AGHLLECGCQLTGGYFMHPGDK+R+MPFQQLLDISLPYAE+
Sbjct: 183 VYELGWNWDDFPRLSQGTLAGHLLECGCQLTGGYFMHPGDKHRSMPFQQLLDISLPYAEI 242
Query: 241 QCDGKVYVAKAEETGGLLNFSTCAEQLLYEIGDPSAYITPDMVVDFSNVSFCSLSSSKVF 300
CDGKVYVAKAEETGGLLNFSTCAEQLLYE+GDPSAYITPD+VVD SNVSFCS+SSSKVF
Sbjct: 243 DCDGKVYVAKAEETGGLLNFSTCAEQLLYEVGDPSAYITPDLVVDLSNVSFCSISSSKVF 302
Query: 301 CSGAKPSILGVPEKLLQLAPTDCGWKGYGEISYGGRECVLRAKAAEYLVRSWMEEGLRGV 360
CSGAKPSI VPEKLLQLAP DCGWKG+GEISYGGRECVLRAKAAEYLVRSWMEE L GV
Sbjct: 303 CSGAKPSIQVVPEKLLQLAPKDCGWKGWGEISYGGRECVLRAKAAEYLVRSWMEEVLYGV 362
Query: 361 DQRIVSYIIGLDSLKASSNGSYNVEDIRLRMDGLFEQKEDALLFVREFTALYTNGPAGGG 420
+Q IVSYIIGLDSLKAS N S +VEDIRLRMDGLFE KE ALLFVREFTALYTNGPAGGG
Sbjct: 363 NQHIVSYIIGLDSLKASINSS-SVEDIRLRMDGLFETKEHALLFVREFTALYTNGPAGGG 422
Query: 421 GISTGYKKEIVLEKQLVWREHVFWRTGVKYTKAIKLDSQPTYLRKDSEEAACSSTVVTLP 480
GISTGYKKEIVLEKQLV REHVFWR GVK TKA++LDS+PT LR+D +A +S VTLP
Sbjct: 423 GISTGYKKEIVLEKQLVGREHVFWRMGVKCTKAVELDSRPTDLREDPAKAR-TSPRVTLP 482
Query: 481 CPVSEYTDEPCTFSSTPETAHSPIPSGQNIPLYNVAHSRAGDKGNDLNFSLIPHYPSDLK 540
C + Y D PC SSTPET HSPIPSGQ + LYNVAHSRAGDKGND+NFS++PHYPSD++
Sbjct: 483 CSIFAYADNPCASSSTPETGHSPIPSGQKVALYNVAHSRAGDKGNDMNFSVVPHYPSDIE 542
Query: 541 RLKMIVTPEWVKRVLSSLQNSTTFLDLNADEKRDDWMNEHVKVEIYEVKGIHSLNVVVRN 600
RLKMI+TPEWVKRVLSSLQNS+TF DL+AD+KRD+W+NEHVKVEIYEVKGIHSLNVVVRN
Sbjct: 543 RLKMIITPEWVKRVLSSLQNSSTFHDLDADKKRDEWVNEHVKVEIYEVKGIHSLNVVVRN 602
Query: 601 ILDGGVNCSRRIDRHGKTISDLVLNQQVVLPP 633
ILDGGVNCSRRIDRHGKTISDLVLNQQVVLPP
Sbjct: 603 ILDGGVNCSRRIDRHGKTISDLVLNQQVVLPP 632
BLAST of Sed0028010 vs. NCBI nr
Match:
XP_022975425.1 (uncharacterized protein LOC111474742 isoform X1 [Cucurbita maxima])
HSP 1 Score: 1126.3 bits (2912), Expect = 0.0e+00
Identity = 548/632 (86.71%), Postives = 589/632 (93.20%), Query Frame = 0
Query: 1 MERQDKDDIQDCTIKLRVNPQKQRDKVYIGCGAGFGGDRPTAALKLLQRVNDLNYLVLEC 60
MERQ +DD+ DCTIKLRVNP+K+RDKVYIGCGAGFGGDRPTAALKLLQRV DLNYLVLEC
Sbjct: 21 MERQGEDDVHDCTIKLRVNPKKRRDKVYIGCGAGFGGDRPTAALKLLQRVKDLNYLVLEC 80
Query: 61 LAERTLADRFQVMLSGGDGYDSRISEWMKLLLPLAVKRNICIITNMGAMDPPGAQQNVIE 120
LAERTLADR Q M SGGDGYDSRI++WMKLLLPLAVKRNICIITNMGAMDPPGAQQNVIE
Sbjct: 81 LAERTLADRHQAMSSGGDGYDSRIADWMKLLLPLAVKRNICIITNMGAMDPPGAQQNVIE 140
Query: 121 IASSLGLNVSIAVAYEVSVQEQGISTYLGAAPIVECLEKYHPNVIITSRVADAALFLAPM 180
IASSLGL+VS+AVAYEVSV+E GISTYLGAAPIV+CLEKYHPNVIITSRVADAALF+APM
Sbjct: 141 IASSLGLSVSVAVAYEVSVKESGISTYLGAAPIVKCLEKYHPNVIITSRVADAALFMAPM 200
Query: 181 VYELGWNWDDFPLLAQGIMAGHLLECGCQLTGGYFMHPGDKYRNMPFQQLLDISLPYAEV 240
VYELGWNWDDFP L+QG +AGHLLECGCQLTGGYFMHPGDK+R+MPFQQLLDISLPYAE+
Sbjct: 201 VYELGWNWDDFPRLSQGTLAGHLLECGCQLTGGYFMHPGDKHRSMPFQQLLDISLPYAEI 260
Query: 241 QCDGKVYVAKAEETGGLLNFSTCAEQLLYEIGDPSAYITPDMVVDFSNVSFCSLSSSKVF 300
CDGKVYVAKAEETGGLLNFSTCAEQLLYE+GDPSAYITPD+VVD SNVSFCS+SSSKVF
Sbjct: 261 DCDGKVYVAKAEETGGLLNFSTCAEQLLYEVGDPSAYITPDLVVDLSNVSFCSISSSKVF 320
Query: 301 CSGAKPSILGVPEKLLQLAPTDCGWKGYGEISYGGRECVLRAKAAEYLVRSWMEEGLRGV 360
CSGAKPSI VPEKLLQLAP DCGWKG+GEISYGGRECVLRAKAAEYLVRSWMEE L GV
Sbjct: 321 CSGAKPSIQVVPEKLLQLAPKDCGWKGWGEISYGGRECVLRAKAAEYLVRSWMEEVLYGV 380
Query: 361 DQRIVSYIIGLDSLKASSNGSYNVEDIRLRMDGLFEQKEDALLFVREFTALYTNGPAGGG 420
+Q IVSYIIGLDSLKAS N S +VEDIRLRMDGLFE KE ALLFVREFTALYTNGPAGGG
Sbjct: 381 NQHIVSYIIGLDSLKASINSS-SVEDIRLRMDGLFETKEHALLFVREFTALYTNGPAGGG 440
Query: 421 GISTGYKKEIVLEKQLVWREHVFWRTGVKYTKAIKLDSQPTYLRKDSEEAACSSTVVTLP 480
GISTGYKKEIVLEKQLV REHVFWR GVK TKA++LDS+PT LR+D +A +S VTLP
Sbjct: 441 GISTGYKKEIVLEKQLVGREHVFWRMGVKCTKAVELDSRPTDLREDPAKAR-TSPRVTLP 500
Query: 481 CPVSEYTDEPCTFSSTPETAHSPIPSGQNIPLYNVAHSRAGDKGNDLNFSLIPHYPSDLK 540
C + Y D PC SSTPET HSPIPSGQ + LYNVAHSRAGDKGND+NFS++PHYPSD++
Sbjct: 501 CSIFAYADNPCASSSTPETGHSPIPSGQKVALYNVAHSRAGDKGNDMNFSVVPHYPSDIE 560
Query: 541 RLKMIVTPEWVKRVLSSLQNSTTFLDLNADEKRDDWMNEHVKVEIYEVKGIHSLNVVVRN 600
RLKMI+TPEWVKRVLSSLQNS+TF DL+AD+KRD+W+NEHVKVEIYEVKGIHSLNVVVRN
Sbjct: 561 RLKMIITPEWVKRVLSSLQNSSTFHDLDADKKRDEWVNEHVKVEIYEVKGIHSLNVVVRN 620
Query: 601 ILDGGVNCSRRIDRHGKTISDLVLNQQVVLPP 633
ILDGGVNCSRRIDRHGKTISDLVLNQQVVLPP
Sbjct: 621 ILDGGVNCSRRIDRHGKTISDLVLNQQVVLPP 650
BLAST of Sed0028010 vs. NCBI nr
Match:
XP_022975426.1 (uncharacterized protein LOC111474742 isoform X2 [Cucurbita maxima])
HSP 1 Score: 1118.6 bits (2892), Expect = 0.0e+00
Identity = 547/632 (86.55%), Postives = 586/632 (92.72%), Query Frame = 0
Query: 1 MERQDKDDIQDCTIKLRVNPQKQRDKVYIGCGAGFGGDRPTAALKLLQRVNDLNYLVLEC 60
MERQ +DD+ DCTIKLRVNP+K+RDKVYIGCGAGFGGDRPTAALKLLQRV DLNYLVLEC
Sbjct: 21 MERQGEDDVHDCTIKLRVNPKKRRDKVYIGCGAGFGGDRPTAALKLLQRVKDLNYLVLEC 80
Query: 61 LAERTLADRFQVMLSGGDGYDSRISEWMKLLLPLAVKRNICIITNMGAMDPPGAQQNVIE 120
LAERTLADR Q M SGGDGYDSR WMKLLLPLAVKRNICIITNMGAMDPPGAQQNVIE
Sbjct: 81 LAERTLADRHQAMSSGGDGYDSR--NWMKLLLPLAVKRNICIITNMGAMDPPGAQQNVIE 140
Query: 121 IASSLGLNVSIAVAYEVSVQEQGISTYLGAAPIVECLEKYHPNVIITSRVADAALFLAPM 180
IASSLGL+VS+AVAYEVSV+E GISTYLGAAPIV+CLEKYHPNVIITSRVADAALF+APM
Sbjct: 141 IASSLGLSVSVAVAYEVSVKESGISTYLGAAPIVKCLEKYHPNVIITSRVADAALFMAPM 200
Query: 181 VYELGWNWDDFPLLAQGIMAGHLLECGCQLTGGYFMHPGDKYRNMPFQQLLDISLPYAEV 240
VYELGWNWDDFP L+QG +AGHLLECGCQLTGGYFMHPGDK+R+MPFQQLLDISLPYAE+
Sbjct: 201 VYELGWNWDDFPRLSQGTLAGHLLECGCQLTGGYFMHPGDKHRSMPFQQLLDISLPYAEI 260
Query: 241 QCDGKVYVAKAEETGGLLNFSTCAEQLLYEIGDPSAYITPDMVVDFSNVSFCSLSSSKVF 300
CDGKVYVAKAEETGGLLNFSTCAEQLLYE+GDPSAYITPD+VVD SNVSFCS+SSSKVF
Sbjct: 261 DCDGKVYVAKAEETGGLLNFSTCAEQLLYEVGDPSAYITPDLVVDLSNVSFCSISSSKVF 320
Query: 301 CSGAKPSILGVPEKLLQLAPTDCGWKGYGEISYGGRECVLRAKAAEYLVRSWMEEGLRGV 360
CSGAKPSI VPEKLLQLAP DCGWKG+GEISYGGRECVLRAKAAEYLVRSWMEE L GV
Sbjct: 321 CSGAKPSIQVVPEKLLQLAPKDCGWKGWGEISYGGRECVLRAKAAEYLVRSWMEEVLYGV 380
Query: 361 DQRIVSYIIGLDSLKASSNGSYNVEDIRLRMDGLFEQKEDALLFVREFTALYTNGPAGGG 420
+Q IVSYIIGLDSLKAS N S +VEDIRLRMDGLFE KE ALLFVREFTALYTNGPAGGG
Sbjct: 381 NQHIVSYIIGLDSLKASINSS-SVEDIRLRMDGLFETKEHALLFVREFTALYTNGPAGGG 440
Query: 421 GISTGYKKEIVLEKQLVWREHVFWRTGVKYTKAIKLDSQPTYLRKDSEEAACSSTVVTLP 480
GISTGYKKEIVLEKQLV REHVFWR GVK TKA++LDS+PT LR+D +A +S VTLP
Sbjct: 441 GISTGYKKEIVLEKQLVGREHVFWRMGVKCTKAVELDSRPTDLREDPAKAR-TSPRVTLP 500
Query: 481 CPVSEYTDEPCTFSSTPETAHSPIPSGQNIPLYNVAHSRAGDKGNDLNFSLIPHYPSDLK 540
C + Y D PC SSTPET HSPIPSGQ + LYNVAHSRAGDKGND+NFS++PHYPSD++
Sbjct: 501 CSIFAYADNPCASSSTPETGHSPIPSGQKVALYNVAHSRAGDKGNDMNFSVVPHYPSDIE 560
Query: 541 RLKMIVTPEWVKRVLSSLQNSTTFLDLNADEKRDDWMNEHVKVEIYEVKGIHSLNVVVRN 600
RLKMI+TPEWVKRVLSSLQNS+TF DL+AD+KRD+W+NEHVKVEIYEVKGIHSLNVVVRN
Sbjct: 561 RLKMIITPEWVKRVLSSLQNSSTFHDLDADKKRDEWVNEHVKVEIYEVKGIHSLNVVVRN 620
Query: 601 ILDGGVNCSRRIDRHGKTISDLVLNQQVVLPP 633
ILDGGVNCSRRIDRHGKTISDLVLNQQVVLPP
Sbjct: 621 ILDGGVNCSRRIDRHGKTISDLVLNQQVVLPP 648
BLAST of Sed0028010 vs. NCBI nr
Match:
XP_022975249.1 (uncharacterized protein LOC111474369 [Cucurbita maxima])
HSP 1 Score: 1118.2 bits (2891), Expect = 0.0e+00
Identity = 547/635 (86.14%), Postives = 588/635 (92.60%), Query Frame = 0
Query: 1 MERQDKDDIQDCTIKLRVNPQKQRDKVYIGCGAGFGGDRPTAALKLLQRVNDLNYLVLEC 60
MERQ +DD+ DCTIKLRVNP+K+RDKVYIGCGAGFGGDRPTAALKLLQRV DLNYLVLEC
Sbjct: 21 MERQGEDDVHDCTIKLRVNPKKRRDKVYIGCGAGFGGDRPTAALKLLQRVKDLNYLVLEC 80
Query: 61 LAERTLADRFQVMLSGGDGYDSRISEWMKLLLPLAVKRNICIITNMGAM---DPPGAQQN 120
LAERTLADR Q M SGGDGYDSRI++WMKLLLPLAVKRNICIITNMGAM PPGAQQN
Sbjct: 81 LAERTLADRHQAMSSGGDGYDSRIADWMKLLLPLAVKRNICIITNMGAMFRLHPPGAQQN 140
Query: 121 VIEIASSLGLNVSIAVAYEVSVQEQGISTYLGAAPIVECLEKYHPNVIITSRVADAALFL 180
VIEIASSLGL+VS+AVAYEVSV+E GISTYLGAAPIV+CLEKYHPNVIITSRVADAALF+
Sbjct: 141 VIEIASSLGLSVSVAVAYEVSVKESGISTYLGAAPIVKCLEKYHPNVIITSRVADAALFM 200
Query: 181 APMVYELGWNWDDFPLLAQGIMAGHLLECGCQLTGGYFMHPGDKYRNMPFQQLLDISLPY 240
APMVYELGWNWDDFP L+QG +AGHLLECGCQLTGGYFMHPGDK+R+MPFQQLLDISLPY
Sbjct: 201 APMVYELGWNWDDFPRLSQGTLAGHLLECGCQLTGGYFMHPGDKHRSMPFQQLLDISLPY 260
Query: 241 AEVQCDGKVYVAKAEETGGLLNFSTCAEQLLYEIGDPSAYITPDMVVDFSNVSFCSLSSS 300
AE+ CDGKVYVAKAEETGGLLNFSTCAEQLLYE+GDPSAYITPD+VVD SNVSFCS+SSS
Sbjct: 261 AEIDCDGKVYVAKAEETGGLLNFSTCAEQLLYEVGDPSAYITPDLVVDLSNVSFCSISSS 320
Query: 301 KVFCSGAKPSILGVPEKLLQLAPTDCGWKGYGEISYGGRECVLRAKAAEYLVRSWMEEGL 360
KVFCSGAKPSI VPEKLLQLAP DCGWKG+GEISYGGRECVLRAKAAEYLVRSWMEE L
Sbjct: 321 KVFCSGAKPSIQVVPEKLLQLAPKDCGWKGWGEISYGGRECVLRAKAAEYLVRSWMEEVL 380
Query: 361 RGVDQRIVSYIIGLDSLKASSNGSYNVEDIRLRMDGLFEQKEDALLFVREFTALYTNGPA 420
GV+Q IVSYIIGLDSLKAS N S +VEDIRLRMDGLFE KE ALLFVREFTALYTNGPA
Sbjct: 381 YGVNQHIVSYIIGLDSLKASINSS-SVEDIRLRMDGLFETKEHALLFVREFTALYTNGPA 440
Query: 421 GGGGISTGYKKEIVLEKQLVWREHVFWRTGVKYTKAIKLDSQPTYLRKDSEEAACSSTVV 480
GGGGISTGYKKEIVLEKQLV REHVFWR GVK TKA++LDS+PT LR+D +A +S V
Sbjct: 441 GGGGISTGYKKEIVLEKQLVGREHVFWRMGVKCTKAVELDSRPTDLREDPAKAR-TSPRV 500
Query: 481 TLPCPVSEYTDEPCTFSSTPETAHSPIPSGQNIPLYNVAHSRAGDKGNDLNFSLIPHYPS 540
TLPC + Y D PC SSTPET HSPIPSGQ + LYNVAHSRAGDKGND+NFS++PHYPS
Sbjct: 501 TLPCSIFAYADNPCASSSTPETGHSPIPSGQKVALYNVAHSRAGDKGNDMNFSVVPHYPS 560
Query: 541 DLKRLKMIVTPEWVKRVLSSLQNSTTFLDLNADEKRDDWMNEHVKVEIYEVKGIHSLNVV 600
D++RLKMI+TPEWVKRVLSSLQNS+TF DL+AD+KRD+W+NEHVKVEIYEVKGIHSLNVV
Sbjct: 561 DIERLKMIITPEWVKRVLSSLQNSSTFHDLDADKKRDEWVNEHVKVEIYEVKGIHSLNVV 620
Query: 601 VRNILDGGVNCSRRIDRHGKTISDLVLNQQVVLPP 633
VRNILDGGVNCSRRIDRHGKTISDLVLNQQVVLPP
Sbjct: 621 VRNILDGGVNCSRRIDRHGKTISDLVLNQQVVLPP 653
BLAST of Sed0028010 vs. NCBI nr
Match:
XP_023539933.1 (uncharacterized protein LOC111800461 isoform X1 [Cucurbita pepo subsp. pepo] >XP_023539934.1 uncharacterized protein LOC111800461 isoform X1 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 1116.7 bits (2887), Expect = 0.0e+00
Identity = 545/633 (86.10%), Postives = 588/633 (92.89%), Query Frame = 0
Query: 1 MERQ-DKDDIQDCTIKLRVNPQKQRDKVYIGCGAGFGGDRPTAALKLLQRVNDLNYLVLE 60
MERQ + DD+ DCTIKLR+NP+K+RDKVYIGCGAGFGGDRPTAALKLLQRV DLNYLVLE
Sbjct: 3 MERQGEDDDVHDCTIKLRLNPKKRRDKVYIGCGAGFGGDRPTAALKLLQRVKDLNYLVLE 62
Query: 61 CLAERTLADRFQVMLSGGDGYDSRISEWMKLLLPLAVKRNICIITNMGAMDPPGAQQNVI 120
CLAERTLADR Q M SGGDGYDSRI++WMKLLLPLAV+RNICIITNMGAMDPPGAQQNVI
Sbjct: 63 CLAERTLADRHQAMSSGGDGYDSRIADWMKLLLPLAVERNICIITNMGAMDPPGAQQNVI 122
Query: 121 EIASSLGLNVSIAVAYEVSVQEQGISTYLGAAPIVECLEKYHPNVIITSRVADAALFLAP 180
EIASSLGL+VS+AVAYEVSV+E GISTYLGAAPIVECLEKYHPNVIITSRVADAALF+AP
Sbjct: 123 EIASSLGLSVSVAVAYEVSVKESGISTYLGAAPIVECLEKYHPNVIITSRVADAALFMAP 182
Query: 181 MVYELGWNWDDFPLLAQGIMAGHLLECGCQLTGGYFMHPGDKYRNMPFQQLLDISLPYAE 240
MVYELGWNWDDFP L+QG +AGHLLECGCQLTGGYFMHPGDK+R+M FQQLLDISLPYAE
Sbjct: 183 MVYELGWNWDDFPRLSQGTLAGHLLECGCQLTGGYFMHPGDKHRSMHFQQLLDISLPYAE 242
Query: 241 VQCDGKVYVAKAEETGGLLNFSTCAEQLLYEIGDPSAYITPDMVVDFSNVSFCSLSSSKV 300
+ CDGKVYVAKAEETGGLLNFSTCAEQLLYE+GDPSAYITPD+VVD SNVSFCS+SSSKV
Sbjct: 243 IDCDGKVYVAKAEETGGLLNFSTCAEQLLYEVGDPSAYITPDLVVDLSNVSFCSISSSKV 302
Query: 301 FCSGAKPSILGVPEKLLQLAPTDCGWKGYGEISYGGRECVLRAKAAEYLVRSWMEEGLRG 360
FCSGAKPSI VPEKLLQLAP DCGWKG+GEISYGGRECVLRAKAAEYLVRSWMEE L G
Sbjct: 303 FCSGAKPSIQVVPEKLLQLAPKDCGWKGWGEISYGGRECVLRAKAAEYLVRSWMEEVLNG 362
Query: 361 VDQRIVSYIIGLDSLKASSNGSYNVEDIRLRMDGLFEQKEDALLFVREFTALYTNGPAGG 420
V+Q IVSYIIGLDSLKAS N S +VEDIRLRMDGLFE KE LLFVREFTALYTNGPAGG
Sbjct: 363 VNQHIVSYIIGLDSLKASINSS-SVEDIRLRMDGLFETKEHTLLFVREFTALYTNGPAGG 422
Query: 421 GGISTGYKKEIVLEKQLVWREHVFWRTGVKYTKAIKLDSQPTYLRKDSEEAACSSTVVTL 480
GGISTGYKKEI+LEKQLV REHVFWRTGVK TKA++LDS+PT LR+D +A +S VTL
Sbjct: 423 GGISTGYKKEILLEKQLVGREHVFWRTGVKCTKAVELDSRPTDLREDPAKAR-TSPRVTL 482
Query: 481 PCPVSEYTDEPCTFSSTPETAHSPIPSGQNIPLYNVAHSRAGDKGNDLNFSLIPHYPSDL 540
PCP+ Y D PC SS PET HSPIPSGQ + LYNVAHSRAGDKGND+NFS+IPHYPSD+
Sbjct: 483 PCPIFAYADNPCAGSSPPETGHSPIPSGQKVALYNVAHSRAGDKGNDMNFSVIPHYPSDI 542
Query: 541 KRLKMIVTPEWVKRVLSSLQNSTTFLDLNADEKRDDWMNEHVKVEIYEVKGIHSLNVVVR 600
+RLKMI+TPEWVKRVLSSLQNS+TF DL+AD+KRD+W++EHVKVEIYEVKGIHSLNVVVR
Sbjct: 543 ERLKMIITPEWVKRVLSSLQNSSTFPDLDADKKRDEWIDEHVKVEIYEVKGIHSLNVVVR 602
Query: 601 NILDGGVNCSRRIDRHGKTISDLVLNQQVVLPP 633
NILDGGVNCSRRIDRHGKTISDLVLNQQVVLPP
Sbjct: 603 NILDGGVNCSRRIDRHGKTISDLVLNQQVVLPP 633
BLAST of Sed0028010 vs. ExPASy TrEMBL
Match:
A0A6J1IJ63 (uncharacterized protein LOC111474742 isoform X3 OS=Cucurbita maxima OX=3661 GN=LOC111474742 PE=4 SV=1)
HSP 1 Score: 1126.3 bits (2912), Expect = 0.0e+00
Identity = 548/632 (86.71%), Postives = 589/632 (93.20%), Query Frame = 0
Query: 1 MERQDKDDIQDCTIKLRVNPQKQRDKVYIGCGAGFGGDRPTAALKLLQRVNDLNYLVLEC 60
MERQ +DD+ DCTIKLRVNP+K+RDKVYIGCGAGFGGDRPTAALKLLQRV DLNYLVLEC
Sbjct: 3 MERQGEDDVHDCTIKLRVNPKKRRDKVYIGCGAGFGGDRPTAALKLLQRVKDLNYLVLEC 62
Query: 61 LAERTLADRFQVMLSGGDGYDSRISEWMKLLLPLAVKRNICIITNMGAMDPPGAQQNVIE 120
LAERTLADR Q M SGGDGYDSRI++WMKLLLPLAVKRNICIITNMGAMDPPGAQQNVIE
Sbjct: 63 LAERTLADRHQAMSSGGDGYDSRIADWMKLLLPLAVKRNICIITNMGAMDPPGAQQNVIE 122
Query: 121 IASSLGLNVSIAVAYEVSVQEQGISTYLGAAPIVECLEKYHPNVIITSRVADAALFLAPM 180
IASSLGL+VS+AVAYEVSV+E GISTYLGAAPIV+CLEKYHPNVIITSRVADAALF+APM
Sbjct: 123 IASSLGLSVSVAVAYEVSVKESGISTYLGAAPIVKCLEKYHPNVIITSRVADAALFMAPM 182
Query: 181 VYELGWNWDDFPLLAQGIMAGHLLECGCQLTGGYFMHPGDKYRNMPFQQLLDISLPYAEV 240
VYELGWNWDDFP L+QG +AGHLLECGCQLTGGYFMHPGDK+R+MPFQQLLDISLPYAE+
Sbjct: 183 VYELGWNWDDFPRLSQGTLAGHLLECGCQLTGGYFMHPGDKHRSMPFQQLLDISLPYAEI 242
Query: 241 QCDGKVYVAKAEETGGLLNFSTCAEQLLYEIGDPSAYITPDMVVDFSNVSFCSLSSSKVF 300
CDGKVYVAKAEETGGLLNFSTCAEQLLYE+GDPSAYITPD+VVD SNVSFCS+SSSKVF
Sbjct: 243 DCDGKVYVAKAEETGGLLNFSTCAEQLLYEVGDPSAYITPDLVVDLSNVSFCSISSSKVF 302
Query: 301 CSGAKPSILGVPEKLLQLAPTDCGWKGYGEISYGGRECVLRAKAAEYLVRSWMEEGLRGV 360
CSGAKPSI VPEKLLQLAP DCGWKG+GEISYGGRECVLRAKAAEYLVRSWMEE L GV
Sbjct: 303 CSGAKPSIQVVPEKLLQLAPKDCGWKGWGEISYGGRECVLRAKAAEYLVRSWMEEVLYGV 362
Query: 361 DQRIVSYIIGLDSLKASSNGSYNVEDIRLRMDGLFEQKEDALLFVREFTALYTNGPAGGG 420
+Q IVSYIIGLDSLKAS N S +VEDIRLRMDGLFE KE ALLFVREFTALYTNGPAGGG
Sbjct: 363 NQHIVSYIIGLDSLKASINSS-SVEDIRLRMDGLFETKEHALLFVREFTALYTNGPAGGG 422
Query: 421 GISTGYKKEIVLEKQLVWREHVFWRTGVKYTKAIKLDSQPTYLRKDSEEAACSSTVVTLP 480
GISTGYKKEIVLEKQLV REHVFWR GVK TKA++LDS+PT LR+D +A +S VTLP
Sbjct: 423 GISTGYKKEIVLEKQLVGREHVFWRMGVKCTKAVELDSRPTDLREDPAKAR-TSPRVTLP 482
Query: 481 CPVSEYTDEPCTFSSTPETAHSPIPSGQNIPLYNVAHSRAGDKGNDLNFSLIPHYPSDLK 540
C + Y D PC SSTPET HSPIPSGQ + LYNVAHSRAGDKGND+NFS++PHYPSD++
Sbjct: 483 CSIFAYADNPCASSSTPETGHSPIPSGQKVALYNVAHSRAGDKGNDMNFSVVPHYPSDIE 542
Query: 541 RLKMIVTPEWVKRVLSSLQNSTTFLDLNADEKRDDWMNEHVKVEIYEVKGIHSLNVVVRN 600
RLKMI+TPEWVKRVLSSLQNS+TF DL+AD+KRD+W+NEHVKVEIYEVKGIHSLNVVVRN
Sbjct: 543 RLKMIITPEWVKRVLSSLQNSSTFHDLDADKKRDEWVNEHVKVEIYEVKGIHSLNVVVRN 602
Query: 601 ILDGGVNCSRRIDRHGKTISDLVLNQQVVLPP 633
ILDGGVNCSRRIDRHGKTISDLVLNQQVVLPP
Sbjct: 603 ILDGGVNCSRRIDRHGKTISDLVLNQQVVLPP 632
BLAST of Sed0028010 vs. ExPASy TrEMBL
Match:
A0A6J1IGN9 (uncharacterized protein LOC111474742 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111474742 PE=4 SV=1)
HSP 1 Score: 1126.3 bits (2912), Expect = 0.0e+00
Identity = 548/632 (86.71%), Postives = 589/632 (93.20%), Query Frame = 0
Query: 1 MERQDKDDIQDCTIKLRVNPQKQRDKVYIGCGAGFGGDRPTAALKLLQRVNDLNYLVLEC 60
MERQ +DD+ DCTIKLRVNP+K+RDKVYIGCGAGFGGDRPTAALKLLQRV DLNYLVLEC
Sbjct: 21 MERQGEDDVHDCTIKLRVNPKKRRDKVYIGCGAGFGGDRPTAALKLLQRVKDLNYLVLEC 80
Query: 61 LAERTLADRFQVMLSGGDGYDSRISEWMKLLLPLAVKRNICIITNMGAMDPPGAQQNVIE 120
LAERTLADR Q M SGGDGYDSRI++WMKLLLPLAVKRNICIITNMGAMDPPGAQQNVIE
Sbjct: 81 LAERTLADRHQAMSSGGDGYDSRIADWMKLLLPLAVKRNICIITNMGAMDPPGAQQNVIE 140
Query: 121 IASSLGLNVSIAVAYEVSVQEQGISTYLGAAPIVECLEKYHPNVIITSRVADAALFLAPM 180
IASSLGL+VS+AVAYEVSV+E GISTYLGAAPIV+CLEKYHPNVIITSRVADAALF+APM
Sbjct: 141 IASSLGLSVSVAVAYEVSVKESGISTYLGAAPIVKCLEKYHPNVIITSRVADAALFMAPM 200
Query: 181 VYELGWNWDDFPLLAQGIMAGHLLECGCQLTGGYFMHPGDKYRNMPFQQLLDISLPYAEV 240
VYELGWNWDDFP L+QG +AGHLLECGCQLTGGYFMHPGDK+R+MPFQQLLDISLPYAE+
Sbjct: 201 VYELGWNWDDFPRLSQGTLAGHLLECGCQLTGGYFMHPGDKHRSMPFQQLLDISLPYAEI 260
Query: 241 QCDGKVYVAKAEETGGLLNFSTCAEQLLYEIGDPSAYITPDMVVDFSNVSFCSLSSSKVF 300
CDGKVYVAKAEETGGLLNFSTCAEQLLYE+GDPSAYITPD+VVD SNVSFCS+SSSKVF
Sbjct: 261 DCDGKVYVAKAEETGGLLNFSTCAEQLLYEVGDPSAYITPDLVVDLSNVSFCSISSSKVF 320
Query: 301 CSGAKPSILGVPEKLLQLAPTDCGWKGYGEISYGGRECVLRAKAAEYLVRSWMEEGLRGV 360
CSGAKPSI VPEKLLQLAP DCGWKG+GEISYGGRECVLRAKAAEYLVRSWMEE L GV
Sbjct: 321 CSGAKPSIQVVPEKLLQLAPKDCGWKGWGEISYGGRECVLRAKAAEYLVRSWMEEVLYGV 380
Query: 361 DQRIVSYIIGLDSLKASSNGSYNVEDIRLRMDGLFEQKEDALLFVREFTALYTNGPAGGG 420
+Q IVSYIIGLDSLKAS N S +VEDIRLRMDGLFE KE ALLFVREFTALYTNGPAGGG
Sbjct: 381 NQHIVSYIIGLDSLKASINSS-SVEDIRLRMDGLFETKEHALLFVREFTALYTNGPAGGG 440
Query: 421 GISTGYKKEIVLEKQLVWREHVFWRTGVKYTKAIKLDSQPTYLRKDSEEAACSSTVVTLP 480
GISTGYKKEIVLEKQLV REHVFWR GVK TKA++LDS+PT LR+D +A +S VTLP
Sbjct: 441 GISTGYKKEIVLEKQLVGREHVFWRMGVKCTKAVELDSRPTDLREDPAKAR-TSPRVTLP 500
Query: 481 CPVSEYTDEPCTFSSTPETAHSPIPSGQNIPLYNVAHSRAGDKGNDLNFSLIPHYPSDLK 540
C + Y D PC SSTPET HSPIPSGQ + LYNVAHSRAGDKGND+NFS++PHYPSD++
Sbjct: 501 CSIFAYADNPCASSSTPETGHSPIPSGQKVALYNVAHSRAGDKGNDMNFSVVPHYPSDIE 560
Query: 541 RLKMIVTPEWVKRVLSSLQNSTTFLDLNADEKRDDWMNEHVKVEIYEVKGIHSLNVVVRN 600
RLKMI+TPEWVKRVLSSLQNS+TF DL+AD+KRD+W+NEHVKVEIYEVKGIHSLNVVVRN
Sbjct: 561 RLKMIITPEWVKRVLSSLQNSSTFHDLDADKKRDEWVNEHVKVEIYEVKGIHSLNVVVRN 620
Query: 601 ILDGGVNCSRRIDRHGKTISDLVLNQQVVLPP 633
ILDGGVNCSRRIDRHGKTISDLVLNQQVVLPP
Sbjct: 621 ILDGGVNCSRRIDRHGKTISDLVLNQQVVLPP 650
BLAST of Sed0028010 vs. ExPASy TrEMBL
Match:
A0A6J1IE50 (uncharacterized protein LOC111474742 isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111474742 PE=4 SV=1)
HSP 1 Score: 1118.6 bits (2892), Expect = 0.0e+00
Identity = 547/632 (86.55%), Postives = 586/632 (92.72%), Query Frame = 0
Query: 1 MERQDKDDIQDCTIKLRVNPQKQRDKVYIGCGAGFGGDRPTAALKLLQRVNDLNYLVLEC 60
MERQ +DD+ DCTIKLRVNP+K+RDKVYIGCGAGFGGDRPTAALKLLQRV DLNYLVLEC
Sbjct: 21 MERQGEDDVHDCTIKLRVNPKKRRDKVYIGCGAGFGGDRPTAALKLLQRVKDLNYLVLEC 80
Query: 61 LAERTLADRFQVMLSGGDGYDSRISEWMKLLLPLAVKRNICIITNMGAMDPPGAQQNVIE 120
LAERTLADR Q M SGGDGYDSR WMKLLLPLAVKRNICIITNMGAMDPPGAQQNVIE
Sbjct: 81 LAERTLADRHQAMSSGGDGYDSR--NWMKLLLPLAVKRNICIITNMGAMDPPGAQQNVIE 140
Query: 121 IASSLGLNVSIAVAYEVSVQEQGISTYLGAAPIVECLEKYHPNVIITSRVADAALFLAPM 180
IASSLGL+VS+AVAYEVSV+E GISTYLGAAPIV+CLEKYHPNVIITSRVADAALF+APM
Sbjct: 141 IASSLGLSVSVAVAYEVSVKESGISTYLGAAPIVKCLEKYHPNVIITSRVADAALFMAPM 200
Query: 181 VYELGWNWDDFPLLAQGIMAGHLLECGCQLTGGYFMHPGDKYRNMPFQQLLDISLPYAEV 240
VYELGWNWDDFP L+QG +AGHLLECGCQLTGGYFMHPGDK+R+MPFQQLLDISLPYAE+
Sbjct: 201 VYELGWNWDDFPRLSQGTLAGHLLECGCQLTGGYFMHPGDKHRSMPFQQLLDISLPYAEI 260
Query: 241 QCDGKVYVAKAEETGGLLNFSTCAEQLLYEIGDPSAYITPDMVVDFSNVSFCSLSSSKVF 300
CDGKVYVAKAEETGGLLNFSTCAEQLLYE+GDPSAYITPD+VVD SNVSFCS+SSSKVF
Sbjct: 261 DCDGKVYVAKAEETGGLLNFSTCAEQLLYEVGDPSAYITPDLVVDLSNVSFCSISSSKVF 320
Query: 301 CSGAKPSILGVPEKLLQLAPTDCGWKGYGEISYGGRECVLRAKAAEYLVRSWMEEGLRGV 360
CSGAKPSI VPEKLLQLAP DCGWKG+GEISYGGRECVLRAKAAEYLVRSWMEE L GV
Sbjct: 321 CSGAKPSIQVVPEKLLQLAPKDCGWKGWGEISYGGRECVLRAKAAEYLVRSWMEEVLYGV 380
Query: 361 DQRIVSYIIGLDSLKASSNGSYNVEDIRLRMDGLFEQKEDALLFVREFTALYTNGPAGGG 420
+Q IVSYIIGLDSLKAS N S +VEDIRLRMDGLFE KE ALLFVREFTALYTNGPAGGG
Sbjct: 381 NQHIVSYIIGLDSLKASINSS-SVEDIRLRMDGLFETKEHALLFVREFTALYTNGPAGGG 440
Query: 421 GISTGYKKEIVLEKQLVWREHVFWRTGVKYTKAIKLDSQPTYLRKDSEEAACSSTVVTLP 480
GISTGYKKEIVLEKQLV REHVFWR GVK TKA++LDS+PT LR+D +A +S VTLP
Sbjct: 441 GISTGYKKEIVLEKQLVGREHVFWRMGVKCTKAVELDSRPTDLREDPAKAR-TSPRVTLP 500
Query: 481 CPVSEYTDEPCTFSSTPETAHSPIPSGQNIPLYNVAHSRAGDKGNDLNFSLIPHYPSDLK 540
C + Y D PC SSTPET HSPIPSGQ + LYNVAHSRAGDKGND+NFS++PHYPSD++
Sbjct: 501 CSIFAYADNPCASSSTPETGHSPIPSGQKVALYNVAHSRAGDKGNDMNFSVVPHYPSDIE 560
Query: 541 RLKMIVTPEWVKRVLSSLQNSTTFLDLNADEKRDDWMNEHVKVEIYEVKGIHSLNVVVRN 600
RLKMI+TPEWVKRVLSSLQNS+TF DL+AD+KRD+W+NEHVKVEIYEVKGIHSLNVVVRN
Sbjct: 561 RLKMIITPEWVKRVLSSLQNSSTFHDLDADKKRDEWVNEHVKVEIYEVKGIHSLNVVVRN 620
Query: 601 ILDGGVNCSRRIDRHGKTISDLVLNQQVVLPP 633
ILDGGVNCSRRIDRHGKTISDLVLNQQVVLPP
Sbjct: 621 ILDGGVNCSRRIDRHGKTISDLVLNQQVVLPP 648
BLAST of Sed0028010 vs. ExPASy TrEMBL
Match:
A0A6J1IJX5 (uncharacterized protein LOC111474369 OS=Cucurbita maxima OX=3661 GN=LOC111474369 PE=4 SV=1)
HSP 1 Score: 1118.2 bits (2891), Expect = 0.0e+00
Identity = 547/635 (86.14%), Postives = 588/635 (92.60%), Query Frame = 0
Query: 1 MERQDKDDIQDCTIKLRVNPQKQRDKVYIGCGAGFGGDRPTAALKLLQRVNDLNYLVLEC 60
MERQ +DD+ DCTIKLRVNP+K+RDKVYIGCGAGFGGDRPTAALKLLQRV DLNYLVLEC
Sbjct: 21 MERQGEDDVHDCTIKLRVNPKKRRDKVYIGCGAGFGGDRPTAALKLLQRVKDLNYLVLEC 80
Query: 61 LAERTLADRFQVMLSGGDGYDSRISEWMKLLLPLAVKRNICIITNMGAM---DPPGAQQN 120
LAERTLADR Q M SGGDGYDSRI++WMKLLLPLAVKRNICIITNMGAM PPGAQQN
Sbjct: 81 LAERTLADRHQAMSSGGDGYDSRIADWMKLLLPLAVKRNICIITNMGAMFRLHPPGAQQN 140
Query: 121 VIEIASSLGLNVSIAVAYEVSVQEQGISTYLGAAPIVECLEKYHPNVIITSRVADAALFL 180
VIEIASSLGL+VS+AVAYEVSV+E GISTYLGAAPIV+CLEKYHPNVIITSRVADAALF+
Sbjct: 141 VIEIASSLGLSVSVAVAYEVSVKESGISTYLGAAPIVKCLEKYHPNVIITSRVADAALFM 200
Query: 181 APMVYELGWNWDDFPLLAQGIMAGHLLECGCQLTGGYFMHPGDKYRNMPFQQLLDISLPY 240
APMVYELGWNWDDFP L+QG +AGHLLECGCQLTGGYFMHPGDK+R+MPFQQLLDISLPY
Sbjct: 201 APMVYELGWNWDDFPRLSQGTLAGHLLECGCQLTGGYFMHPGDKHRSMPFQQLLDISLPY 260
Query: 241 AEVQCDGKVYVAKAEETGGLLNFSTCAEQLLYEIGDPSAYITPDMVVDFSNVSFCSLSSS 300
AE+ CDGKVYVAKAEETGGLLNFSTCAEQLLYE+GDPSAYITPD+VVD SNVSFCS+SSS
Sbjct: 261 AEIDCDGKVYVAKAEETGGLLNFSTCAEQLLYEVGDPSAYITPDLVVDLSNVSFCSISSS 320
Query: 301 KVFCSGAKPSILGVPEKLLQLAPTDCGWKGYGEISYGGRECVLRAKAAEYLVRSWMEEGL 360
KVFCSGAKPSI VPEKLLQLAP DCGWKG+GEISYGGRECVLRAKAAEYLVRSWMEE L
Sbjct: 321 KVFCSGAKPSIQVVPEKLLQLAPKDCGWKGWGEISYGGRECVLRAKAAEYLVRSWMEEVL 380
Query: 361 RGVDQRIVSYIIGLDSLKASSNGSYNVEDIRLRMDGLFEQKEDALLFVREFTALYTNGPA 420
GV+Q IVSYIIGLDSLKAS N S +VEDIRLRMDGLFE KE ALLFVREFTALYTNGPA
Sbjct: 381 YGVNQHIVSYIIGLDSLKASINSS-SVEDIRLRMDGLFETKEHALLFVREFTALYTNGPA 440
Query: 421 GGGGISTGYKKEIVLEKQLVWREHVFWRTGVKYTKAIKLDSQPTYLRKDSEEAACSSTVV 480
GGGGISTGYKKEIVLEKQLV REHVFWR GVK TKA++LDS+PT LR+D +A +S V
Sbjct: 441 GGGGISTGYKKEIVLEKQLVGREHVFWRMGVKCTKAVELDSRPTDLREDPAKAR-TSPRV 500
Query: 481 TLPCPVSEYTDEPCTFSSTPETAHSPIPSGQNIPLYNVAHSRAGDKGNDLNFSLIPHYPS 540
TLPC + Y D PC SSTPET HSPIPSGQ + LYNVAHSRAGDKGND+NFS++PHYPS
Sbjct: 501 TLPCSIFAYADNPCASSSTPETGHSPIPSGQKVALYNVAHSRAGDKGNDMNFSVVPHYPS 560
Query: 541 DLKRLKMIVTPEWVKRVLSSLQNSTTFLDLNADEKRDDWMNEHVKVEIYEVKGIHSLNVV 600
D++RLKMI+TPEWVKRVLSSLQNS+TF DL+AD+KRD+W+NEHVKVEIYEVKGIHSLNVV
Sbjct: 561 DIERLKMIITPEWVKRVLSSLQNSSTFHDLDADKKRDEWVNEHVKVEIYEVKGIHSLNVV 620
Query: 601 VRNILDGGVNCSRRIDRHGKTISDLVLNQQVVLPP 633
VRNILDGGVNCSRRIDRHGKTISDLVLNQQVVLPP
Sbjct: 621 VRNILDGGVNCSRRIDRHGKTISDLVLNQQVVLPP 653
BLAST of Sed0028010 vs. ExPASy TrEMBL
Match:
A0A6J1FPM0 (uncharacterized protein LOC111447321 isoform X3 OS=Cucurbita moschata OX=3662 GN=LOC111447321 PE=4 SV=1)
HSP 1 Score: 1110.5 bits (2871), Expect = 0.0e+00
Identity = 544/632 (86.08%), Postives = 582/632 (92.09%), Query Frame = 0
Query: 1 MERQDKDDIQDCTIKLRVNPQKQRDKVYIGCGAGFGGDRPTAALKLLQRVNDLNYLVLEC 60
MERQ +DD+ DCTIKLRVNP+K+RDKVYIGCGAGFGGDRPTAALKLLQRV DLNYLVLEC
Sbjct: 3 MERQGEDDVHDCTIKLRVNPKKRRDKVYIGCGAGFGGDRPTAALKLLQRVKDLNYLVLEC 62
Query: 61 LAERTLADRFQVMLSGGDGYDSRISEWMKLLLPLAVKRNICIITNMGAMDPPGAQQNVIE 120
LAERTLADR Q M SGGDGYDSRI++WMKLLLPLAVKRNICIITNMGAMDPPGAQQNVIE
Sbjct: 63 LAERTLADRHQAMSSGGDGYDSRIADWMKLLLPLAVKRNICIITNMGAMDPPGAQQNVIE 122
Query: 121 IASSLGLNVSIAVAYEVSVQEQGISTYLGAAPIVECLEKYHPNVIITSRVADAALFLAPM 180
IASSLGL+VS+AVAYEVSV+E GISTYLGAAPIVECLEKYHPNVIITSRVADAALF+APM
Sbjct: 123 IASSLGLSVSVAVAYEVSVKESGISTYLGAAPIVECLEKYHPNVIITSRVADAALFMAPM 182
Query: 181 VYELGWNWDDFPLLAQGIMAGHLLECGCQLTGGYFMHPGDKYRNMPFQQLLDISLPYAEV 240
VYELGWNWDDF L+QG +AGHLLECGCQLTGGYFMHPGDK+R+MPFQQLLDISLPYAE+
Sbjct: 183 VYELGWNWDDFLRLSQGTLAGHLLECGCQLTGGYFMHPGDKHRSMPFQQLLDISLPYAEI 242
Query: 241 QCDGKVYVAKAEETGGLLNFSTCAEQLLYEIGDPSAYITPDMVVDFSNVSFCSLSSSKVF 300
CDGKVYVAKAEETGGLLNFSTCAEQLLYEIGDPSAYITPD+VVD SNVSFCS+SSSKVF
Sbjct: 243 DCDGKVYVAKAEETGGLLNFSTCAEQLLYEIGDPSAYITPDLVVDLSNVSFCSISSSKVF 302
Query: 301 CSGAKPSILGVPEKLLQLAPTDCGWKGYGEISYGGRECVLRAKAAEYLVRSWMEEGLRGV 360
CSGAKPSI VPEKLLQLAP DCGWKG+GEISYGGRECVLRAKAAEYLVRSWMEE L GV
Sbjct: 303 CSGAKPSIQVVPEKLLQLAPKDCGWKGWGEISYGGRECVLRAKAAEYLVRSWMEEVLNGV 362
Query: 361 DQRIVSYIIGLDSLKASSNGSYNVEDIRLRMDGLFEQKEDALLFVREFTALYTNGPAGGG 420
+Q IVSYIIGLDSLKAS N S +VEDIRLRMDGLFE KE ALLFVREFTALYTNGPAGGG
Sbjct: 363 NQHIVSYIIGLDSLKASINSS-SVEDIRLRMDGLFETKEHALLFVREFTALYTNGPAGGG 422
Query: 421 GISTGYKKEIVLEKQLVWREHVFWRTGVKYTKAIKLDSQPTYLRKDSEEAACSSTVVTLP 480
GISTGYKKEIVLEKQLV REHVFW+TGVK TKA++LDS+PT LR+D +A S V
Sbjct: 423 GISTGYKKEIVLEKQLVGREHVFWQTGVKCTKAVELDSRPTDLREDPAKAQTSPRVFA-- 482
Query: 481 CPVSEYTDEPCTFSSTPETAHSPIPSGQNIPLYNVAHSRAGDKGNDLNFSLIPHYPSDLK 540
Y D PC SS PET HSPIPSGQ + LYNVAHSRAGDKGND+NFS+IPHYPSD++
Sbjct: 483 -----YADNPCADSSPPETGHSPIPSGQKVALYNVAHSRAGDKGNDMNFSVIPHYPSDIE 542
Query: 541 RLKMIVTPEWVKRVLSSLQNSTTFLDLNADEKRDDWMNEHVKVEIYEVKGIHSLNVVVRN 600
RLKMI+TPEWVKRVLSSLQNS+TF DL+AD+KRD+W++EHVKVEIYEVKGIHSLNVVVRN
Sbjct: 543 RLKMIITPEWVKRVLSSLQNSSTFSDLDADKKRDEWIDEHVKVEIYEVKGIHSLNVVVRN 602
Query: 601 ILDGGVNCSRRIDRHGKTISDLVLNQQVVLPP 633
ILDGGVNCSRRIDRHGKTISDLVLNQQVVLPP
Sbjct: 603 ILDGGVNCSRRIDRHGKTISDLVLNQQVVLPP 626
BLAST of Sed0028010 vs. TAIR 10
Match:
AT1G01770.1 (unknown protein; CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF1446 (InterPro:IPR010839); Has 1597 Blast hits to 1509 proteins in 306 species: Archae - 4; Bacteria - 843; Metazoa - 22; Fungi - 131; Plants - 31; Viruses - 0; Other Eukaryotes - 566 (source: NCBI BLink). )
HSP 1 Score: 821.6 bits (2121), Expect = 4.3e-238
Identity = 417/642 (64.95%), Postives = 496/642 (77.26%), Query Frame = 0
Query: 6 KDDIQDCTIKLRVNPQKQRDKVYIGCGAGFGGDRPTAALKLLQRVNDLNYLVLECLAERT 65
K+ + DC I LR NP+++R+ VY+GCGAGFGGDRP AALKLLQRV +LNYLVLECLAERT
Sbjct: 7 KEILCDCVINLRENPKRRRETVYVGCGAGFGGDRPLAALKLLQRVEELNYLVLECLAERT 66
Query: 66 LADRFQVMLSGGDGYDSRISEWMKLLLPLAVKRNICIITNMGAMDPPGAQQNVIEIASSL 125
LADR+ M SGG GYD R+SEWM+LLLPLAV+R CIITNMGA+DP GAQ+ V+E+A L
Sbjct: 67 LADRWLSMASGGLGYDPRVSEWMQLLLPLAVERGTCIITNMGAIDPSGAQKKVLEVAGEL 126
Query: 126 GLNVSIAVAYEVSVQ-------------EQGISTYLGAAPIVECLEKYHPNVIITSRVAD 185
GL +S+AVA+EV + G STYLGAAPIVECLEKY PNVIITSRVAD
Sbjct: 127 GLTISVAVAHEVHFETGSGSSFGGQYCSAGGTSTYLGAAPIVECLEKYQPNVIITSRVAD 186
Query: 186 AALFLAPMVYELGWNWDDFPLLAQGIMAGHLLECGCQLTGGYFMHPGDKYRNMPFQQLLD 245
AALFLAPMVYELGWNW+D LLAQG +AGHLLECGCQLTGGYFMHPGD+YR+M F L D
Sbjct: 187 AALFLAPMVYELGWNWNDLELLAQGTLAGHLLECGCQLTGGYFMHPGDQYRDMAFPLLQD 246
Query: 246 ISLPYAEVQCDGKVYVAKAEETGGLLNFSTCAEQLLYEIGDPSAYITPDMVVDFSNVSFC 305
+SLPYAE+ DGKV V+K E +GG+LN STCAEQLLYEI DPSAYITPD+V+D VSF
Sbjct: 247 LSLPYAEIGYDGKVCVSKVEGSGGILNTSTCAEQLLYEIADPSAYITPDVVIDIRGVSFL 306
Query: 306 SLSSSKVFCSGAKPSI-LGVPEKLLQLAPTDCGWKGYGEISYGGRECVLRAKAAEYLVRS 365
LS KV CSGAKPS VPEKLL+L P +CGWKG+GEISYGG + RAKA+E+LVRS
Sbjct: 307 PLSDCKVQCSGAKPSSNTSVPEKLLRLIPKECGWKGWGEISYGGNGSIQRAKASEFLVRS 366
Query: 366 WMEEGLRGVDQRIVSYIIGLDSLKASSNGSYNVE---DIRLRMDGLFEQKEDALLFVREF 425
WMEE + GV+ I+SY+IG+DSLKA+SNG+ + + DIRLRMDGLF+ KE A+ +EF
Sbjct: 367 WMEETIPGVNHCILSYVIGVDSLKATSNGTESWQSCGDIRLRMDGLFKLKEHAVQLTKEF 426
Query: 426 TALYTNGPAGGGGISTGYKKEIVLEKQLVWREHVFWRTGVKYTKAIKLDSQPTYLRKDSE 485
TALYTNGPAGGGGISTG+K EIVLEK+LV RE V W+TG+++T S+P S
Sbjct: 427 TALYTNGPAGGGGISTGHKMEIVLEKRLVSRESVMWKTGLQHTNT----SEPETSEHHSP 486
Query: 486 EAACSSTVVTLPCPVSEYTDEPCTFSSTPETAHSPIPSGQNIPLYNVAHSRAGDKGNDLN 545
E +P E HSP PSGQ IPLY+VAHSRAGDKGND+N
Sbjct: 487 E--------KMPKLPKENPKNLTMRGYQSGFHHSPAPSGQKIPLYSVAHSRAGDKGNDIN 546
Query: 546 FSLIPHYPSDLKRLKMIVTPEWVKRVLSSLQNSTTFLDLNADEKRDDWMNEHVKVEIYEV 605
FS+IPHY D++RLK+I+TP+WVK V+S L ++++FL+L+A M+E+V VEIY+V
Sbjct: 547 FSIIPHYSPDVERLKLIITPQWVKHVMSVLLSTSSFLELDAKP-----MDENVSVEIYDV 606
Query: 606 KGIHSLNVVVRNILDGGVNCSRRIDRHGKTISDLVLNQQVVL 631
+GIH++NVVVRNILDGGVNCSRRIDRHGKTISDL+L QQVVL
Sbjct: 607 EGIHAMNVVVRNILDGGVNCSRRIDRHGKTISDLILCQQVVL 631
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_022975428.1 | 0.0e+00 | 86.71 | uncharacterized protein LOC111474742 isoform X3 [Cucurbita maxima] >XP_022975429... | [more] |
XP_022975425.1 | 0.0e+00 | 86.71 | uncharacterized protein LOC111474742 isoform X1 [Cucurbita maxima] | [more] |
XP_022975426.1 | 0.0e+00 | 86.55 | uncharacterized protein LOC111474742 isoform X2 [Cucurbita maxima] | [more] |
XP_022975249.1 | 0.0e+00 | 86.14 | uncharacterized protein LOC111474369 [Cucurbita maxima] | [more] |
XP_023539933.1 | 0.0e+00 | 86.10 | uncharacterized protein LOC111800461 isoform X1 [Cucurbita pepo subsp. pepo] >XP... | [more] |
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A6J1IJ63 | 0.0e+00 | 86.71 | uncharacterized protein LOC111474742 isoform X3 OS=Cucurbita maxima OX=3661 GN=L... | [more] |
A0A6J1IGN9 | 0.0e+00 | 86.71 | uncharacterized protein LOC111474742 isoform X1 OS=Cucurbita maxima OX=3661 GN=L... | [more] |
A0A6J1IE50 | 0.0e+00 | 86.55 | uncharacterized protein LOC111474742 isoform X2 OS=Cucurbita maxima OX=3661 GN=L... | [more] |
A0A6J1IJX5 | 0.0e+00 | 86.14 | uncharacterized protein LOC111474369 OS=Cucurbita maxima OX=3661 GN=LOC111474369... | [more] |
A0A6J1FPM0 | 0.0e+00 | 86.08 | uncharacterized protein LOC111447321 isoform X3 OS=Cucurbita moschata OX=3662 GN... | [more] |
Match Name | E-value | Identity | Description | |
AT1G01770.1 | 4.3e-238 | 64.95 | unknown protein; CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF1446... | [more] |