Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
AAGGGAAATTCTTGGACTTCAAAACAGAAATTTTCCCGGTTACCAGGCGGGTATAAAAGCGAAACAGAACGGCGAAAAAACGAACTAGAAAAGAAAACAAGAAACGAATCACTTTCCCTCCACTTGGTGCTCTACTCTCGCTCTCTTTTCCCTCTAAAAATCAAGCTTCAACTTCCTACTCTCCTCATCTCTAACCAGTATAGGTTCCCGCCATTTTTATCGAGAAGAAGATTTCGTACAATATTCTTTATGGTGCCGGACTCATCTCCACCGGTTGTTGACGACGGCGCTTGTGATCTCGGTTTCTTATCGTCCAAAGAACGCTCTCTTTCGAGGCGCAATCTCAAGCAGCATCAGGAGCAAGACAATGTGTCCTCGGATCGCTCTGTCTGCCGTTTTCGATCAAACCTCGACCGGCGCGATCGCTACGGGTGGTTTCCGTTCAGAAGGAGATCGTTCATCGTTTTGGCGTTCTTCGTTTTGTTCACGATGTTCATGTTTCAGTTGTTTCTGGAGAGTTCGATGACTTCGGTGTTCTTGAAAAGGAGCAAGAAAGCTTGGCCGCGTGAGGCAGAGTTGAAGCCCGGGAGGACACTTAAGTTCGTGCCGCAGAGGATTCCTCGGAAGTTTATTGAAGGTAATGAGGTTGATCGATTGCACTCGGAGGATCATGTTGGTTTCCGGAAACCGAGGCTTGCTCTGGTGAGTGGTTAAGTTTCTATTCGTTCTTACTTTCTCGAGTTATGGTTTAATTAGCCACTCGAACGAACTTATCTGACAATGAGTTAAGTTACGTCATCAAACATCCATTTCAATGCCTTTCTGCCTTAGAAGATGAACTCAAACCATTTTCACTGGTGCGCAATAATAGATTGCGTTGATTAGTTTTTTCGCTTTCTGAAATAAATTGTTTATACGAATTAGTGCAACACCTGCTTTCTCTCTAGAGGTCAGCAACCAGAAGTTGAACGTTCACACGACTAATGTACAGTGCACACTGTTTTAAAACTTCCAAGCAACTTTAGATTGAGCTCCATGGCATTACGGATGGTCTTGATTTTTGAAGTGACTAAGCCCGGAGTTACGACCACCTAAAGAAAAATAACGTGTATCTTTTCGAGGTAGTGATTATGCGTCACTGTAAAACTACAGACGAGTATACACTGTATTTTTACTGATAAAAATATTATATCACGTTTTTTTTGGCAGGAAAGTTGTGTAAAAAATGAAATCATGTTTTATGAATCTATAGTCATTACTTTAGACCTTTTAGGGATGAAAAAACTTTAGGGGAAGAACTTCGAATGGCAAGATAGATACAGTTGATGGAAATGAACAGTGGATTCCGATATGAGCCTACTCCACTATACCTACTGTGATCATATGCCAATAATATCATCAATTTGGTATTAGCATGTTTTGGCGTCTAAGAAATATTGTAAGCTGACGGGATATCATAGTGTATTGCTCTTGAATAATTTTTCACTGTCGACTTCATGCTGATGGTTAATTGCTTTCCCTACAGCGATCATTATTAAGTTTTCACGATCCGTAAGATTCTGTCATAGTGACCGAATATTCTTGGCTTTAGAGCTTAGTAGCCACTACCAATTTTCCTGTCACTCTGAGTTCTAAAGTTAAACTCCATTCTTTTCGATGCAGAATCTTGGTTACTTTTGAGAATTTGTTGTATTAATACTCAGCAACTCTTATTACTAGCTTTTACATGTTTATATATGCCGTGATGCACATAACAATGTGTTAGTGAATGTTGTATCATCATGTGTAGACTCGATAATGTACGAGAGTCCTAAAATGCACGGCTATTTGAGCAGCCACATCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCNTTTTATAAACGAACAAACTATTCCTAATTTTTTTGTTGCCTAAATTATCAGATATTGAGAAACATGGAGAAAGATTCACTATCCTTGTTCTTAATTACTGTAATGAAGAACATGAGGGAGCTTGGATATGTGTTTGAGGTGAGGTCTTTGACATTGCTTTTGTTACTCGTATTGCATAATTGATTTTTATTTGCTGAAATAGCCGTGTAAGAAGTCACACGATGCATGGTTTTCTAGGCATATTCTGTGAGATCGGGCATTGATGTGTCGTTGAAACCTTCATGCTTGAAGGGTAATATTATTGGTTAGTTTTTAAATGATTATATTCTTATGGAAGAGGCTCAGATTTTTGCAGTTGGCAATGGAGAAGCACGTCAAATGTGGCTGAAACTTGGTCGGGTTGTCCTTTTAAGCCCAAAGCAGTTTGGCCAGATCAATTGGTTACTGTAAGTGCATATATATCTATACACACGCATACATCGGTTGTGCATTCGTACATAAAAGTAGACTAATGCCGTTCTTGTTTCCTATCTGTATGAGCACAAAATGATCTCTGTTAGTAGTCAGAACTGCATGAATGGTTAACTTTTGTCATTGCAGTTTTGAAGGCATTATCGTCGATTCTTTTGAAGGGAAGGAGGCTATTACAAGGTTGGTTTATTGATACTATATCTTTTAAAAAATGTTAGTGATGGAGTTTTCTGGAGTCTGAGGCCACGGTAAGTCATATGTTTTATCTGCTTGTAGGTGAAAATAATCACTGCTCAACTTCAGAAAACTTTTTAAATAATGTGGAATAATTATATTTACTCTATTTTTCACTTTTTTAACAAAATATTATATTAATATGTAAAGATAATTACGAAATAGGAGAATAAGGAATCCTCCAATTTAAAATACAGTGAAGTAAAAGAAATACAGTTAACCTTGCACGTTATGAACTTCTTACTATTAATGTTTACGAACACAGTTATCTTGTTAATATTTTTTTCTTTTATTTCAAATTGTAAAAACTGATGCAATGAGAATTGCATTTTTCTGGAAGATGTTGAAATTAGACTACTATAGTATTAATGATAGCAAATTTGTTTGGAGAATGTTGATACAGAATTATGAATATCTAAATGCATTTTCCTGGTGAATGATTGGAGGGACCTTTCCTTTAGTACGGTTGATGTGAAGAGATTTGAGACTTTTTTCCCCTTTTCATGGTATATAACTAATGAGTTTTCATGTATGCAGCATTATGCAGGAACCTTTTTGTTCAATACCACTTATATGGATCATTCAGGATGATATCCTAGCCAAGCGTCTTAAAATGTACAAGGACAAGGGCTGGGAGAATCTTGTTTCTCATTGGAGAAGTACTTTTAGCAGAGCTAGTGTTATTGTGTTTCCCAATTTTGCTCTTCCTGTAAGTTCGTGAACTTGGCTGTTTATATTATGTGGCTTTTAGACGCATGGTTAGCCTCAACTTTGACCTTTTTCTTTCCATAAAAAAAGTAAAAAAGACATTGACGTTTCTCTTAGAAACTGCTCATCTTTTATTTCCTTTTATATCAGATGCTATATAGTGCGCTTGACACTGGAAACTTTCATGTGATCCACGGATCACCAGTGGACGTTTGGACTGCTGAAATTTATAAGAGCTCTCACTTCAAGTTTAAATTAGGAGAGAAACTTGGATTTGGTATAGAAGATTTCGTAGTTCTTGTGGTTGGAAATTCCTTCTATAATGAGCTATCACCGGAATATGCTGCGGCATTGTATCGCATGGGACCTCTACTAACAGAATTTGCAAGGAGGAAGAATCCTAGAGGGTCGTTTAAATTTGTTTTCTTGTATGGTAATTCCTCCGACGGATGCAATGATGCTCTGCAGGTAGTCTTTTGTTATGCTTGTCACATTTTTAGCTTTATTTGCGTGCCAATTTGCCTCGTAATGCTACTTTCTCCTTCTCAAATTATATAGTGTGTAATTATTGGATATGTGAAATTGAGAAAATATTGATTGAATTGAAGTAGGTTTTATTTTCCTAAAATTTTAGTATAACTAGCAAATGGGATTTTATCTTCTTCTGTTCCAACTCTTGCTGGGAAATTTAAATCAAAGTTCTTCACCTCTGTACGATTTTTGTACTAAGCTCGACAGAGAAATCCTATGAATAATTGAATCCACAAATTTTCTGTTTTAGTTGTTGCAATATTTCTTAAGTGAAAATTTATAAGGTGAATGTAACTTGAAGCGGAAGAATTAGATTACCTTAAGAAATTATGATTATAATGGTAATGGACTGATAGTTCCAAATAGCATATGGGATGAGCATTTGGAGAAAACCAACTTCAAATTTATATTCATGATGGGGATTGAGTTAGTTCTGGCTTGGCAAGTGTCTATGTGGATATATGCCGTATGTATGTTTTTTGCCTCCTCAATTATCTCAAATGGCTGAAAGAAAGGATGTTTATCCTGCCCTTGCTTAGGTGGTTGGGAGTTGTGACGTCCAATGGATGGGTCCGTCACTTAACCGAAGAATCTTGCATCTTCATAGCATAATAGTTCGTTTTTTTAAATTTAGAATTGAAGAGGAAGGAGGGTTTGACAATAGCTTTTATTAATTCTTGTTGTGGAGTTTGATCCTTTCCTTTGGCCTTGTTTGGTTTAGATGTATCATGTAAAGTTTCTTATACTATAGCAAGACAACGCTTATATCTAGTGAGAATAAATGCTATTGAGATAGGAGGATACAGAAGATGGTCTTAAAATAAAATACTACTACCCATGTAATTAGCAGCTGATACTTGCATGAAATATCTGAGACTGAGTAAGTTTGAGAACAACATGCTAGCTTTAGAAAAAGGCAGTAAACTTTTGGATCACTTCTTCTGTATCGTATGGCCACTACAACTTTCTTGTAATTTGATTTCACAGTGACTGGAAATGAACGACACATCAAACAAGGTGAAGTCATAAAATATTATTTTTGTGTTTAATATGATTAGAAAGTAGTGGAAATGGGAAGGTTGATTTTTGGAGTTCGGAGTTTCTGTGGATTCCTGGTTCCATTCTCCATGTATGTCACTTATATGCTTTAATGATAGGGATCTAAGAATGGTACAAGGATCTGCCTTAACTATCGGGGTTTTAAATTCCTTGTCTATAGGGGCTTGAACATTCTAATAATGGTATAGGTGTCACTTGTAATCTTTGGTTTTCCAAACTTTGCTAATGAAGAAAGTAAGTGGACAGGACGTAGGCTAGGTACAAAAGAATGGCGTTGTCTTGTTATATGCAATGAATACATGAATACCTTTAAGATAAGACTGCTGTGTATGAGTTGGAACGGTCGGTGGATGCACCACCTTGTTTCTTTTCTTGATTTATTTTCTGCATACTTTTAAACGAGACCAAGTGTTGCACTTGTCTGACTCTAGATAAGGATGAATGGCGAATGTTTCTATTTGTACTTTTTCTTATGCATATGGTCTGACTCTAGATAAGTACTTTTTCTGCAGTATTTCTTTGGATTAATGATTAATCGTTCAACTTATGTAGGAAACTGCTTCACGTTTAAGACTTCCTCGTGGTTATTTAAGCCATTATAGCTTTGATCAAGACGTAAATGGTATTTTGTACGTGGCCGATATTGTTCTTTATGAATCTTCCCAAAATGTACAAGATTTTCCTCCCTTGCTCATTCGGGCGATGACCTTTGGAGTCCCAATAGTGGCACCTGATATGCCCATTATTAACCAATATGTGAGTTCATTCTACTTCTCTCCCCTCTTCCTCCACTAGAAAGGAAAAAATATAATAATAGTGTCTTGAAACATGCAGGTTGTTGGGGGGGTCCATGGATTACTTGTTACTAAATTTAGTTCAGATGCTTTGATAAGAGCTCTCTCTAATCTTTGTTTTGATGGAAGGCTCGCTAGAATTGCTAACAATCTTGCTTCATCTGGAAAATTACTTGCCAAAAATCTTCTTGCTTTAGAGTGCATTACTGGATATGCAAATCTGTTGGAGGAAGTCCTCAATTTCCCATCAGACGTTATACTGCCAGGTTCCATTACCCAGCTTCCAGAAGCAGCGTGGGAATGGGATCTCTTTTGGAAGGAAATAATACAGGGATCTTCCAATGAGCAACGCGATAAGAATGTTAAAAAGAAATCTAGTGTGGTGATTAAACTCGAAGAGGAGTTCTCTGACCTTGTTAGTCCCTTGAACATCTCCAGTCCTAGAAAGGAGATTTTGGTGCATGATATCCCAACTCAACAAGATTGGGATATTATCGGGGAAATAGATCGTACTGAAGAATATGACAGAGTGGAAATGGAGGAGGTATGTTGTTATTTTTCCATTATGGCTTGTCTGCAATTTTTATGTCACTCACTTTCCTGAGAGCGAGGCCATATGTGATTTGTTATGTAGCTTCAAGAAAGAACAGAAAGAATATTAGGTTCATGGGAAAAAATATATCGTAGCGCACGGAAGTCCGAAAAGATGAAGCTTGAAAATGAGAATGACGAGGAAGATCTCGAAAGGGCAGGGCAAGCAGTATGCATTTATGAGATATACAGCGGACCTGGAGCTTGGTCATTTTTGCATCATGGTTCTATGTTTCGTGGACTTAGTCTTGTGAGCTTCTTCCATCCAAAACTATCTGCTGATATTTTTCTGTTTGTTTTATTTTGTCTCACTTTTCTTTGTTATTATCTTCTGTGTCATTCATTCGAAAATGCAAGTAAGATTTTAGTCCATCAATCGTGCACTCTTGCTAGATCCTCAGTCTTGAATGTTAAACATGTATAAGAGAAGATAGTTTTGCTACATATCCAGCCATCGCTTCCTTAAAAAAACATCAACTAGATTGTTGAATTTAAAACTTGTGAATTTTATGGTGTCAAACATTAAACTCGATCTTGGCCCATTTTTTAACTTTTTGAACTTGTTATTCTTAAATGTGGATGTATTCCTTGCTTCCAAGATACTCATTTTACATTCTTTATTGCTTGTAATTTACATTGTACCCTACAGTCTTCGAGAGCACTGAGGTTGGAATCAGATGATGTCAATGCTCCCAAGCGTCTTCCTCTTTTGGAAGACAGATTCTATCAGGACATTCTTTGTGAGATGGGAGGAATGTTTGCTGTTGCAAATGAGATTGATACAATTCACAGAAGACCTTGGATTGGTTTCCAATCGTGGCAAGCTGACGGTAGGAAGGTAATCCGTACTTGCTAACATTCTTTCTTGCTTAGTTGTACTATTAGGCTGTAAGGCTTCAAGAGAAATGTAAATGGTGGATAGAAATTTTTGGAGGTTCTAGTTTAGACAAGGAATACCTTTACCCCCTTTTCTTAACGGCCGCCCAATCTATTTGGTTTGCAAATGAAAGATTTTCATAACCATCTGGAATCTGGTAGTCAAAGATTTTGAGACTATATCTTCTTATGTAACAATCTTAGGCTTTATTAAACCACTGGCCTCCAGGTGTAATGAACGGGTTTTAAGCTCTATGCTATAATATATCTATGCTGCTAAGTAGATTTTTTTTAATGTATTTATTTATTTACTTCTATTCCTCTGATTCGTAATCAGCCTCTTCTGTTAAAAAGCAGGAGTCATTATCTAAAAAGGCTGGAAAGGTCTTGGAAGAAGCAATTCAGAATAATACTAGAGGGGAAGTTATTTACTTTTGGGCGTACATGGACGTGGATTCTGAAGTCACGGACAGCGCTGATGGTCCTTTTTGGCACACATGTGACATCTTCAATCGGGGACATTGCAGGTATATCAGTCATTCAACATATTCTTGTATGATAAGTTAGTTGTTCCATGTTCATAGGTCAAGTCTTCCATTGTTTTTTTAAAAACAAATAATTCTCATTAAAGGGTATGAAAGGTTTAAAAAGAAGAATTCCAAAGAAATCAGAGGAGCTTACAACAAAAGCATCATTCCAATTGGCATAAATGGATAATCTATTGACATTAACGAATGCATCCCTTTAGTCTTTTGATATTAATTGAAAGTTGACCCAGTACTTCGTATTTTATGATTGGAAATATAGAGAGAGTTTAATTGATTCTGCATCAACTTCCAAGTCGTGCGTGTTAACCAACTTTATCATCACTAATTAACATCAAATACGAAATGAAGAACTAAAGTATAATACATTAGAATTTTACATTAAACAGATGCAATTTGAACATGTTCTCACTTAAAATTACATCCATCCAGTTCTACGTTTAAAGATGCCTTTAGGCAGATGTATGGACTACATCCATCACATTCGGAAGCTCTTCCTCCAATGCCTAATGATGGCGGTCTCTGGTCTTATCTGCATAGCTGGGTGATGCCAACCCCTACATTTGTGGAGTTCATAATGTTTTCCCGGTAAGCACATTATATATATATCAGAAGTAGAACCCCATTTATTTCGCACATTCCCTCTATTTTTTTGTTCTTCTTCAAAATTTCCTCCCTCACCACTGTGGATGTAATTTTGGCTTGAATAGTGACTGAAATTTTCGTATGTGAGCTGCAGGATGTTTGTTGATTCCGTAGATGCCGTGAACAGAAAGCTTGACAATAGCAGCAAGTGTTTGCTGGCTTCCACTGGACTGGAGGTAAATATCCTACTTCTCTCCCTTGAGGAGATTACTTCTATGGTGAAATTAATTTCTTTATAATGTTCGAGTTAACGCAATTCTTCCCTAAATACTTCAAAATTTTTGGGAAAAACACTAATTAAATGGTCCCAAAACCACTCCCAATATGTGTTTGAAATGCCTTCTAAAAATATATTTTAAATAAAACACTTATTACATCAGCACCGCAAAAGAAAATTTATTAAGTGTTTCTCCTAAATGCACTTTAAGATTTTTTATTAGGTTCTTAAAATTTTTTAGTCTTCTGTTTTTGATGGGCAGAGAAGGCAGTGTTATTGCCGGCTGTTGGATATCCTGATAAACGTGTGGGCGTACCACAGTGGGCGGAGAATGGTTTATTTAACCCCACGTTCAGGCTCGCTAGTGGAGCAGCATCCCCTTGAAGAACGTCAGGACTTCATGTGGTCCAAATTCTTCAACATCACATTATTGAAAGCCATGGATGCAGACTTGGCCGAAGCTGCCGATGATGGCGATCACCCGAGAACCAAATGGTTATGGCCATTAACAGGAGACGTATTCTGGGAAGGGATGTATGCAAGGAAAAGCAAAGAAAGGCACAGGCACAGGCACAAAGTTGAAAAGAGGACAAAACCCCGACATAAAAAATCAGGCAACCGCCGTAATCATGAACACAAGCAAAAACCACTTGGAAAATAGCTGACAACAAACTAATAGTCTATTTGCAGCAAATGGTAAGTTTAACTCATTTATTTATTTTTGTCTTCTCATCGAAAATTGGGATTTTCTTATTGACTAGATATTTGGTTCTCTTCTTTCAAACGGTTAGCAGCAGATTGTAGATAAGATAGATCGAAATGGTGATGATGCTTTACGAGTACAGAAGACTATCGTCAATTCAAGTACGTCACTTTCTTTCAACCTCTTTTCCATAATACGTGGAGTCTAGCAATTAGCAATTAGATGATGTATATAATATCTGTAATCAACATTTTTTGCATTCAATTTAGAACTTAGCGAAATTGACGAAATGA
mRNA sequence
AAGGGAAATTCTTGGACTTCAAAACAGAAATTTTCCCGGTTACCAGGCGGGTATAAAAGCGAAACAGAACGGCGAAAAAACGAACTAGAAAAGAAAACAAGAAACGAATCACTTTCCCTCCACTTGGTGCTCTACTCTCGCTCTCTTTTCCCTCTAAAAATCAAGCTTCAACTTCCTACTCTCCTCATCTCTAACCAGTATAGGTTCCCGCCATTTTTATCGAGAAGAAGATTTCGTACAATATTCTTTATGGTGCCGGACTCATCTCCACCGGTTGTTGACGACGGCGCTTGTGATCTCGGTTTCTTATCGTCCAAAGAACGCTCTCTTTCGAGGCGCAATCTCAAGCAGCATCAGGAGCAAGACAATGTGTCCTCGGATCGCTCTGTCTGCCGTTTTCGATCAAACCTCGACCGGCGCGATCGCTACGGGTGGTTTCCGTTCAGAAGGAGATCGTTCATCGTTTTGGCGTTCTTCGTTTTGTTCACGATGTTCATGTTTCAGTTGTTTCTGGAGAGTTCGATGACTTCGGTGTTCTTGAAAAGGAGCAAGAAAGCTTGGCCGCGTGAGGCAGAGTTGAAGCCCGGGAGGACACTTAAGTTCGTGCCGCAGAGGATTCCTCGGAAGTTTATTGAAGGTAATGAGGTTGATCGATTGCACTCGGAGGATCATGTTGGTTTCCGGAAACCGAGGCTTGCTCTGATATTGAGAAACATGGAGAAAGATTCACTATCCTTGTTCTTAATTACTGTAATGAAGAACATGAGGGAGCTTGGATATGTGTTTGAGATTTTTGCAGTTGGCAATGGAGAAGCACGTCAAATGTGGCTGAAACTTGGTCGGGTTGTCCTTTTAAGCCCAAAGCAGTTTGGCCAGATCAATTGGTTACTTTTTGAAGGCATTATCGTCGATTCTTTTGAAGGGAAGGAGGCTATTACAAGCATTATGCAGGAACCTTTTTGTTCAATACCACTTATATGGATCATTCAGGATGATATCCTAGCCAAGCGTCTTAAAATGTACAAGGACAAGGGCTGGGAGAATCTTGTTTCTCATTGGAGAAGTACTTTTAGCAGAGCTAGTGTTATTGTGTTTCCCAATTTTGCTCTTCCTATGCTATATAGTGCGCTTGACACTGGAAACTTTCATGTGATCCACGGATCACCAGTGGACGTTTGGACTGCTGAAATTTATAAGAGCTCTCACTTCAAGTTTAAATTAGGAGAGAAACTTGGATTTGGTATAGAAGATTTCGTAGTTCTTGTGGTTGGAAATTCCTTCTATAATGAGCTATCACCGGAATATGCTGCGGCATTGTATCGCATGGGACCTCTACTAACAGAATTTGCAAGGAGGAAGAATCCTAGAGGGTCGTTTAAATTTGTTTTCTTGTATGGTAATTCCTCCGACGGATGCAATGATGCTCTGCAGAACTTAGCGAAATTGACGAAATGA
Coding sequence (CDS)
AAGGGAAATTCTTGGACTTCAAAACAGAAATTTTCCCGGTTACCAGGCGGGTATAAAAGCGAAACAGAACGGCGAAAAAACGAACTAGAAAAGAAAACAAGAAACGAATCACTTTCCCTCCACTTGGTGCTCTACTCTCGCTCTCTTTTCCCTCTAAAAATCAAGCTTCAACTTCCTACTCTCCTCATCTCTAACCAGTATAGGTTCCCGCCATTTTTATCGAGAAGAAGATTTCGTACAATATTCTTTATGGTGCCGGACTCATCTCCACCGGTTGTTGACGACGGCGCTTGTGATCTCGGTTTCTTATCGTCCAAAGAACGCTCTCTTTCGAGGCGCAATCTCAAGCAGCATCAGGAGCAAGACAATGTGTCCTCGGATCGCTCTGTCTGCCGTTTTCGATCAAACCTCGACCGGCGCGATCGCTACGGGTGGTTTCCGTTCAGAAGGAGATCGTTCATCGTTTTGGCGTTCTTCGTTTTGTTCACGATGTTCATGTTTCAGTTGTTTCTGGAGAGTTCGATGACTTCGGTGTTCTTGAAAAGGAGCAAGAAAGCTTGGCCGCGTGAGGCAGAGTTGAAGCCCGGGAGGACACTTAAGTTCGTGCCGCAGAGGATTCCTCGGAAGTTTATTGAAGGTAATGAGGTTGATCGATTGCACTCGGAGGATCATGTTGGTTTCCGGAAACCGAGGCTTGCTCTGATATTGAGAAACATGGAGAAAGATTCACTATCCTTGTTCTTAATTACTGTAATGAAGAACATGAGGGAGCTTGGATATGTGTTTGAGATTTTTGCAGTTGGCAATGGAGAAGCACGTCAAATGTGGCTGAAACTTGGTCGGGTTGTCCTTTTAAGCCCAAAGCAGTTTGGCCAGATCAATTGGTTACTTTTTGAAGGCATTATCGTCGATTCTTTTGAAGGGAAGGAGGCTATTACAAGCATTATGCAGGAACCTTTTTGTTCAATACCACTTATATGGATCATTCAGGATGATATCCTAGCCAAGCGTCTTAAAATGTACAAGGACAAGGGCTGGGAGAATCTTGTTTCTCATTGGAGAAGTACTTTTAGCAGAGCTAGTGTTATTGTGTTTCCCAATTTTGCTCTTCCTATGCTATATAGTGCGCTTGACACTGGAAACTTTCATGTGATCCACGGATCACCAGTGGACGTTTGGACTGCTGAAATTTATAAGAGCTCTCACTTCAAGTTTAAATTAGGAGAGAAACTTGGATTTGGTATAGAAGATTTCGTAGTTCTTGTGGTTGGAAATTCCTTCTATAATGAGCTATCACCGGAATATGCTGCGGCATTGTATCGCATGGGACCTCTACTAACAGAATTTGCAAGGAGGAAGAATCCTAGAGGGTCGTTTAAATTTGTTTTCTTGTATGGTAATTCCTCCGACGGATGCAATGATGCTCTGCAGAACTTAGCGAAATTGACGAAATGA
Protein sequence
KGNSWTSKQKFSRLPGGYKSETERRKNELEKKTRNESLSLHLVLYSRSLFPLKIKLQLPTLLISNQYRFPPFLSRRRFRTIFFMVPDSSPPVVDDGACDLGFLSSKERSLSRRNLKQHQEQDNVSSDRSVCRFRSNLDRRDRYGWFPFRRRSFIVLAFFVLFTMFMFQLFLESSMTSVFLKRSKKAWPREAELKPGRTLKFVPQRIPRKFIEGNEVDRLHSEDHVGFRKPRLALILRNMEKDSLSLFLITVMKNMRELGYVFEIFAVGNGEARQMWLKLGRVVLLSPKQFGQINWLLFEGIIVDSFEGKEAITSIMQEPFCSIPLIWIIQDDILAKRLKMYKDKGWENLVSHWRSTFSRASVIVFPNFALPMLYSALDTGNFHVIHGSPVDVWTAEIYKSSHFKFKLGEKLGFGIEDFVVLVVGNSFYNELSPEYAAALYRMGPLLTEFARRKNPRGSFKFVFLYGNSSDGCNDALQNLAKLTK
Homology
BLAST of Cp4.1LG01g14540 vs. NCBI nr
Match:
XP_023551126.1 (uncharacterized protein LOC111809035 [Cucurbita pepo subsp. pepo] >XP_023551134.1 uncharacterized protein LOC111809035 [Cucurbita pepo subsp. pepo] >XP_023551142.1 uncharacterized protein LOC111809035 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 798 bits (2060), Expect = 4.46e-280
Identity = 395/397 (99.50%), Postives = 395/397 (99.50%), Query Frame = 0
Query: 84 MVPDSSPPVVDDGACDLGFLSSKERSLSRRNLKQHQEQDNVSSDRSVCRFRSNLDRRDRY 143
MVPDSSPPVVDDGACDLGFLSSKERSLSRRNLKQHQEQDNVSSDRSVCRFRSNLDRRDRY
Sbjct: 1 MVPDSSPPVVDDGACDLGFLSSKERSLSRRNLKQHQEQDNVSSDRSVCRFRSNLDRRDRY 60
Query: 144 GWFPFRRRSFIVLAFFVLFTMFMFQLFLESSMTSVFLKRSKKAWPREAELKPGRTLKFVP 203
GWFPFRRRSFIVLAFFVLFTMFMFQLFLESSMTSVFLKRSKKAWPREAELKPGRTLKFVP
Sbjct: 61 GWFPFRRRSFIVLAFFVLFTMFMFQLFLESSMTSVFLKRSKKAWPREAELKPGRTLKFVP 120
Query: 204 QRIPRKFIEGNEVDRLHSEDHVGFRKPRLALILRNMEKDSLSLFLITVMKNMRELGYVFE 263
QRIPRKFIEGNEVDRLHSEDHVGFRKPRLALILRNMEKDSLSLFLITVMKNMRELGYVFE
Sbjct: 121 QRIPRKFIEGNEVDRLHSEDHVGFRKPRLALILRNMEKDSLSLFLITVMKNMRELGYVFE 180
Query: 264 IFAVGNGEARQMWLKLGRVVLLSPKQFGQINWLLFEGIIVDSFEGKEAITSIMQEPFCSI 323
IFAVGNGEARQMWLKLGRVVLLSPKQFGQINWLLFEGIIVDSFEGKEAITSIMQEPFCSI
Sbjct: 181 IFAVGNGEARQMWLKLGRVVLLSPKQFGQINWLLFEGIIVDSFEGKEAITSIMQEPFCSI 240
Query: 324 PLIWIIQDDILAKRLKMYKDKGWENLVSHWRSTFSRASVIVFPNFALPMLYSALDTGNFH 383
PLIWIIQDDILAKRLKMYKDKGWENLVSHWRSTFSRASVIVFPNFALPMLYSALDTGNFH
Sbjct: 241 PLIWIIQDDILAKRLKMYKDKGWENLVSHWRSTFSRASVIVFPNFALPMLYSALDTGNFH 300
Query: 384 VIHGSPVDVWTAEIYKSSHFKFKLGEKLGFGIEDFVVLVVGNSFYNELSPEYAAALYRMG 443
VIHGSPVDVWTAEIYKSSHFKFKLGEKLGFGIEDFVVLVVGNSFYNELSPEYAAALYRMG
Sbjct: 301 VIHGSPVDVWTAEIYKSSHFKFKLGEKLGFGIEDFVVLVVGNSFYNELSPEYAAALYRMG 360
Query: 444 PLLTEFARRKNPRGSFKFVFLYGNSSDGCNDALQNLA 480
PLLTEFARRKNPRGSFKFVFLYGNSSDGCNDALQ A
Sbjct: 361 PLLTEFARRKNPRGSFKFVFLYGNSSDGCNDALQETA 397
BLAST of Cp4.1LG01g14540 vs. NCBI nr
Match:
KAG7031994.1 (hypothetical protein SDJN02_06036, partial [Cucurbita argyrosperma subsp. argyrosperma])
HSP 1 Score: 776 bits (2003), Expect = 1.75e-271
Identity = 385/397 (96.98%), Postives = 391/397 (98.49%), Query Frame = 0
Query: 84 MVPDSSPPVVDDGACDLGFLSSKERSLSRRNLKQHQEQDNVSSDRSVCRFRSNLDRRDRY 143
MVPDSSPPV DDGACDLGFLSSKERSLSRRNLKQHQEQDNVSSDRSV RFRSNLDRRDR+
Sbjct: 1 MVPDSSPPVDDDGACDLGFLSSKERSLSRRNLKQHQEQDNVSSDRSVSRFRSNLDRRDRH 60
Query: 144 GWFPFRRRSFIVLAFFVLFTMFMFQLFLESSMTSVFLKRSKKAWPREAELKPGRTLKFVP 203
GWFPFRRRSFIVLAFFVLFT+FMFQLFLESSMTSVFLKRSKKAWPREAELK GRTLKFVP
Sbjct: 61 GWFPFRRRSFIVLAFFVLFTLFMFQLFLESSMTSVFLKRSKKAWPREAELKSGRTLKFVP 120
Query: 204 QRIPRKFIEGNEVDRLHSEDHVGFRKPRLALILRNMEKDSLSLFLITVMKNMRELGYVFE 263
QRIPRKFIEGNEVDRLHSEDHVGFRKPRLALILRNMEKDSLSLFLITVMKNM+ELGYVFE
Sbjct: 121 QRIPRKFIEGNEVDRLHSEDHVGFRKPRLALILRNMEKDSLSLFLITVMKNMKELGYVFE 180
Query: 264 IFAVGNGEARQMWLKLGRVVLLSPKQFGQINWLLFEGIIVDSFEGKEAITSIMQEPFCSI 323
IFAVGNGEARQMWLKLGRVVLLSPKQFGQINWLLFEGIIVDSFEGKEAITSIMQEPFCSI
Sbjct: 181 IFAVGNGEARQMWLKLGRVVLLSPKQFGQINWLLFEGIIVDSFEGKEAITSIMQEPFCSI 240
Query: 324 PLIWIIQDDILAKRLKMYKDKGWENLVSHWRSTFSRASVIVFPNFALPMLYSALDTGNFH 383
PLIWIIQDDILAKRLKMYKDKGWENLVSHWRSTFSRASVIVFPNFALPMLYSALDTGNFH
Sbjct: 241 PLIWIIQDDILAKRLKMYKDKGWENLVSHWRSTFSRASVIVFPNFALPMLYSALDTGNFH 300
Query: 384 VIHGSPVDVWTAEIYKSSHFKFKLGEKLGFGIEDFVVLVVGNSFYNELSPEYAAALYRMG 443
VIHGSPVDVWTAEIYKSSHFKFKLG+KLGFGIEDFVVLVVGNSFYNELSPEYAAALYRMG
Sbjct: 301 VIHGSPVDVWTAEIYKSSHFKFKLGQKLGFGIEDFVVLVVGNSFYNELSPEYAAALYRMG 360
Query: 444 PLLTEFARRKNPRGSFKFVFLYGNSSDGCNDALQNLA 480
PLLT+FARRKNPRGSFKFVFL GNSS+GCNDALQ A
Sbjct: 361 PLLTKFARRKNPRGSFKFVFLCGNSSNGCNDALQETA 397
BLAST of Cp4.1LG01g14540 vs. NCBI nr
Match:
XP_022956546.1 (uncharacterized protein LOC111458257 [Cucurbita moschata] >XP_022956547.1 uncharacterized protein LOC111458257 [Cucurbita moschata])
HSP 1 Score: 775 bits (2000), Expect = 4.99e-271
Identity = 385/397 (96.98%), Postives = 390/397 (98.24%), Query Frame = 0
Query: 84 MVPDSSPPVVDDGACDLGFLSSKERSLSRRNLKQHQEQDNVSSDRSVCRFRSNLDRRDRY 143
MVPDSSP V DDGACDLGFLSSKERSLSRRNLKQHQEQDNVSSDRSV RFRSNLDRRDR+
Sbjct: 1 MVPDSSPHVDDDGACDLGFLSSKERSLSRRNLKQHQEQDNVSSDRSVSRFRSNLDRRDRH 60
Query: 144 GWFPFRRRSFIVLAFFVLFTMFMFQLFLESSMTSVFLKRSKKAWPREAELKPGRTLKFVP 203
GWFPFRRRSFIVLAFFVLFT+FMFQLFLESSMTSVFLKRSKKAWPREAELKPGRTLKFVP
Sbjct: 61 GWFPFRRRSFIVLAFFVLFTLFMFQLFLESSMTSVFLKRSKKAWPREAELKPGRTLKFVP 120
Query: 204 QRIPRKFIEGNEVDRLHSEDHVGFRKPRLALILRNMEKDSLSLFLITVMKNMRELGYVFE 263
QRIPRKFIEGNEVDRLHSEDHVGFRKPRLALILRNMEKDSLSLFLITVMKNM+ELGYVFE
Sbjct: 121 QRIPRKFIEGNEVDRLHSEDHVGFRKPRLALILRNMEKDSLSLFLITVMKNMKELGYVFE 180
Query: 264 IFAVGNGEARQMWLKLGRVVLLSPKQFGQINWLLFEGIIVDSFEGKEAITSIMQEPFCSI 323
IFAVGNGEARQMWLKLGRVVLLSPKQFGQINWLLFEGIIVDSFEGKEAITSIMQEPFCSI
Sbjct: 181 IFAVGNGEARQMWLKLGRVVLLSPKQFGQINWLLFEGIIVDSFEGKEAITSIMQEPFCSI 240
Query: 324 PLIWIIQDDILAKRLKMYKDKGWENLVSHWRSTFSRASVIVFPNFALPMLYSALDTGNFH 383
PLIWIIQDDILAKRLKMYKDKGWENLVSHWRSTFSRASVIVFPNFALPMLYSALDTGNFH
Sbjct: 241 PLIWIIQDDILAKRLKMYKDKGWENLVSHWRSTFSRASVIVFPNFALPMLYSALDTGNFH 300
Query: 384 VIHGSPVDVWTAEIYKSSHFKFKLGEKLGFGIEDFVVLVVGNSFYNELSPEYAAALYRMG 443
VIHGSPVDVWTAEIYKSSHFKFKLG+KLGFGIEDFVVLVVGNSFYNELSPEYAAALYRMG
Sbjct: 301 VIHGSPVDVWTAEIYKSSHFKFKLGQKLGFGIEDFVVLVVGNSFYNELSPEYAAALYRMG 360
Query: 444 PLLTEFARRKNPRGSFKFVFLYGNSSDGCNDALQNLA 480
PLLT+FARRKNPRGSFKFVFL GNSS GCNDALQ A
Sbjct: 361 PLLTKFARRKNPRGSFKFVFLCGNSSHGCNDALQETA 397
BLAST of Cp4.1LG01g14540 vs. NCBI nr
Match:
KAG6601199.1 (hypothetical protein SDJN03_06432, partial [Cucurbita argyrosperma subsp. sororia])
HSP 1 Score: 761 bits (1966), Expect = 1.11e-265
Identity = 382/412 (92.72%), Postives = 390/412 (94.66%), Query Frame = 0
Query: 84 MVPDSSPPVVDDGACDLGFLSSKERSLSRRNLKQHQEQDNVSSDRSVCRFRSNLDRRDRY 143
MVPDSSPPV DDGACDLGFLSSKERSLSRRNLKQHQEQDNVSSDRSV RFRSNLDRRDR+
Sbjct: 1 MVPDSSPPVDDDGACDLGFLSSKERSLSRRNLKQHQEQDNVSSDRSVSRFRSNLDRRDRH 60
Query: 144 GWFPFRRRSFIVLAFFVLFTMFMFQLFLESSMTSVFLKRSKKAWPREAELKPGRTLKFVP 203
GWFPFRRRSFIVLAFFVLFT+FMFQLFLESSMTSVFLKRSKKAWPREAELK GRTLKFVP
Sbjct: 61 GWFPFRRRSFIVLAFFVLFTLFMFQLFLESSMTSVFLKRSKKAWPREAELKSGRTLKFVP 120
Query: 204 QRIPRKFIEGNEVDRLHSEDHVGFRKPRLALILRNMEKDSLSLFLITVMKNMRELGYVFE 263
QRIPRKFIEGNEVDRLHSEDHVGFRKPRLALILRNMEKDSLSLFLITVMKNM+ELGYVFE
Sbjct: 121 QRIPRKFIEGNEVDRLHSEDHVGFRKPRLALILRNMEKDSLSLFLITVMKNMKELGYVFE 180
Query: 264 IFAV---------------GNGEARQMWLKLGRVVLLSPKQFGQINWLLFEGIIVDSFEG 323
++V GNGEARQMWLKLGRVVLLSPKQFGQINWLLFEGIIVDSFEG
Sbjct: 181 AYSVRSSIDVSLKPSCLKVGNGEARQMWLKLGRVVLLSPKQFGQINWLLFEGIIVDSFEG 240
Query: 324 KEAITSIMQEPFCSIPLIWIIQDDILAKRLKMYKDKGWENLVSHWRSTFSRASVIVFPNF 383
KEAITSIMQEPFCSIPLIWIIQDDILAKRLKMYKDKGWENLVSHWRSTFSRASVIVFPNF
Sbjct: 241 KEAITSIMQEPFCSIPLIWIIQDDILAKRLKMYKDKGWENLVSHWRSTFSRASVIVFPNF 300
Query: 384 ALPMLYSALDTGNFHVIHGSPVDVWTAEIYKSSHFKFKLGEKLGFGIEDFVVLVVGNSFY 443
ALPMLYSALDTGNFHVIHGSPVDVWTAEIYKSSHFKFKLG+KLGFGIEDFVVLVVGNSFY
Sbjct: 301 ALPMLYSALDTGNFHVIHGSPVDVWTAEIYKSSHFKFKLGQKLGFGIEDFVVLVVGNSFY 360
Query: 444 NELSPEYAAALYRMGPLLTEFARRKNPRGSFKFVFLYGNSSDGCNDALQNLA 480
NELSPEYAAALYRMGPLLT+FARRKNPRGSFKFVFL GNSS+GCNDALQ A
Sbjct: 361 NELSPEYAAALYRMGPLLTKFARRKNPRGSFKFVFLCGNSSNGCNDALQETA 412
BLAST of Cp4.1LG01g14540 vs. NCBI nr
Match:
XP_022993256.1 (uncharacterized protein LOC111489326 [Cucurbita maxima] >XP_022993265.1 uncharacterized protein LOC111489326 [Cucurbita maxima])
HSP 1 Score: 751 bits (1938), Expect = 1.18e-261
Identity = 375/397 (94.46%), Postives = 383/397 (96.47%), Query Frame = 0
Query: 84 MVPDSSPPVVDDGACDLGFLSSKERSLSRRNLKQHQEQDNVSSDRSVCRFRSNLDRRDRY 143
MVPDSSPPV DDGACDLGFLSSKERSLSRRNLKQHQEQ+NVSSDRSV R RSNLDRRDR+
Sbjct: 1 MVPDSSPPVDDDGACDLGFLSSKERSLSRRNLKQHQEQENVSSDRSVSRLRSNLDRRDRH 60
Query: 144 GWFPFRRRSFIVLAFFVLFTMFMFQLFLESSMTSVFLKRSKKAWPREAELKPGRTLKFVP 203
GWF FRRRSF +LAFFVLFT+FM QLFLESSMTSVFLKRSKKA REAELKPGRTLKFVP
Sbjct: 61 GWFSFRRRSFFILAFFVLFTLFMVQLFLESSMTSVFLKRSKKASSREAELKPGRTLKFVP 120
Query: 204 QRIPRKFIEGNEVDRLHSEDHVGFRKPRLALILRNMEKDSLSLFLITVMKNMRELGYVFE 263
QRIPRKFIEGNEVDRLH EDHVGFRKPRLALILRNMEKDSLSLFLITVMKNM+ELGYVFE
Sbjct: 121 QRIPRKFIEGNEVDRLHLEDHVGFRKPRLALILRNMEKDSLSLFLITVMKNMKELGYVFE 180
Query: 264 IFAVGNGEARQMWLKLGRVVLLSPKQFGQINWLLFEGIIVDSFEGKEAITSIMQEPFCSI 323
IFAVGNGEARQMWLKLGRVVLLSPKQFGQINWLLFEGIIVDSFEGKEAITSIMQEPFCSI
Sbjct: 181 IFAVGNGEARQMWLKLGRVVLLSPKQFGQINWLLFEGIIVDSFEGKEAITSIMQEPFCSI 240
Query: 324 PLIWIIQDDILAKRLKMYKDKGWENLVSHWRSTFSRASVIVFPNFALPMLYSALDTGNFH 383
PLIWIIQDDILAKRLKMYKDKGWENLVSHWRSTFSRASVIVFPNFALPMLYSALDTGNFH
Sbjct: 241 PLIWIIQDDILAKRLKMYKDKGWENLVSHWRSTFSRASVIVFPNFALPMLYSALDTGNFH 300
Query: 384 VIHGSPVDVWTAEIYKSSHFKFKLGEKLGFGIEDFVVLVVGNSFYNELSPEYAAALYRMG 443
VIHGSPVDVWTAEIYKSSHFK KLGEKLGFGIEDFVVLVVGNSFYNELSP+YAAALYRMG
Sbjct: 301 VIHGSPVDVWTAEIYKSSHFKLKLGEKLGFGIEDFVVLVVGNSFYNELSPDYAAALYRMG 360
Query: 444 PLLTEFARRKNPRGSFKFVFLYGNSSDGCNDALQNLA 480
PLLT+FARRKN RGSFKFVFL GNSS+GCNDALQ A
Sbjct: 361 PLLTKFARRKNRRGSFKFVFLCGNSSNGCNDALQETA 397
BLAST of Cp4.1LG01g14540 vs. ExPASy TrEMBL
Match:
A0A6J1GWM9 (uncharacterized protein LOC111458257 OS=Cucurbita moschata OX=3662 GN=LOC111458257 PE=4 SV=1)
HSP 1 Score: 775 bits (2000), Expect = 2.41e-271
Identity = 385/397 (96.98%), Postives = 390/397 (98.24%), Query Frame = 0
Query: 84 MVPDSSPPVVDDGACDLGFLSSKERSLSRRNLKQHQEQDNVSSDRSVCRFRSNLDRRDRY 143
MVPDSSP V DDGACDLGFLSSKERSLSRRNLKQHQEQDNVSSDRSV RFRSNLDRRDR+
Sbjct: 1 MVPDSSPHVDDDGACDLGFLSSKERSLSRRNLKQHQEQDNVSSDRSVSRFRSNLDRRDRH 60
Query: 144 GWFPFRRRSFIVLAFFVLFTMFMFQLFLESSMTSVFLKRSKKAWPREAELKPGRTLKFVP 203
GWFPFRRRSFIVLAFFVLFT+FMFQLFLESSMTSVFLKRSKKAWPREAELKPGRTLKFVP
Sbjct: 61 GWFPFRRRSFIVLAFFVLFTLFMFQLFLESSMTSVFLKRSKKAWPREAELKPGRTLKFVP 120
Query: 204 QRIPRKFIEGNEVDRLHSEDHVGFRKPRLALILRNMEKDSLSLFLITVMKNMRELGYVFE 263
QRIPRKFIEGNEVDRLHSEDHVGFRKPRLALILRNMEKDSLSLFLITVMKNM+ELGYVFE
Sbjct: 121 QRIPRKFIEGNEVDRLHSEDHVGFRKPRLALILRNMEKDSLSLFLITVMKNMKELGYVFE 180
Query: 264 IFAVGNGEARQMWLKLGRVVLLSPKQFGQINWLLFEGIIVDSFEGKEAITSIMQEPFCSI 323
IFAVGNGEARQMWLKLGRVVLLSPKQFGQINWLLFEGIIVDSFEGKEAITSIMQEPFCSI
Sbjct: 181 IFAVGNGEARQMWLKLGRVVLLSPKQFGQINWLLFEGIIVDSFEGKEAITSIMQEPFCSI 240
Query: 324 PLIWIIQDDILAKRLKMYKDKGWENLVSHWRSTFSRASVIVFPNFALPMLYSALDTGNFH 383
PLIWIIQDDILAKRLKMYKDKGWENLVSHWRSTFSRASVIVFPNFALPMLYSALDTGNFH
Sbjct: 241 PLIWIIQDDILAKRLKMYKDKGWENLVSHWRSTFSRASVIVFPNFALPMLYSALDTGNFH 300
Query: 384 VIHGSPVDVWTAEIYKSSHFKFKLGEKLGFGIEDFVVLVVGNSFYNELSPEYAAALYRMG 443
VIHGSPVDVWTAEIYKSSHFKFKLG+KLGFGIEDFVVLVVGNSFYNELSPEYAAALYRMG
Sbjct: 301 VIHGSPVDVWTAEIYKSSHFKFKLGQKLGFGIEDFVVLVVGNSFYNELSPEYAAALYRMG 360
Query: 444 PLLTEFARRKNPRGSFKFVFLYGNSSDGCNDALQNLA 480
PLLT+FARRKNPRGSFKFVFL GNSS GCNDALQ A
Sbjct: 361 PLLTKFARRKNPRGSFKFVFLCGNSSHGCNDALQETA 397
BLAST of Cp4.1LG01g14540 vs. ExPASy TrEMBL
Match:
A0A6J1JVU1 (uncharacterized protein LOC111489326 OS=Cucurbita maxima OX=3661 GN=LOC111489326 PE=4 SV=1)
HSP 1 Score: 751 bits (1938), Expect = 5.71e-262
Identity = 375/397 (94.46%), Postives = 383/397 (96.47%), Query Frame = 0
Query: 84 MVPDSSPPVVDDGACDLGFLSSKERSLSRRNLKQHQEQDNVSSDRSVCRFRSNLDRRDRY 143
MVPDSSPPV DDGACDLGFLSSKERSLSRRNLKQHQEQ+NVSSDRSV R RSNLDRRDR+
Sbjct: 1 MVPDSSPPVDDDGACDLGFLSSKERSLSRRNLKQHQEQENVSSDRSVSRLRSNLDRRDRH 60
Query: 144 GWFPFRRRSFIVLAFFVLFTMFMFQLFLESSMTSVFLKRSKKAWPREAELKPGRTLKFVP 203
GWF FRRRSF +LAFFVLFT+FM QLFLESSMTSVFLKRSKKA REAELKPGRTLKFVP
Sbjct: 61 GWFSFRRRSFFILAFFVLFTLFMVQLFLESSMTSVFLKRSKKASSREAELKPGRTLKFVP 120
Query: 204 QRIPRKFIEGNEVDRLHSEDHVGFRKPRLALILRNMEKDSLSLFLITVMKNMRELGYVFE 263
QRIPRKFIEGNEVDRLH EDHVGFRKPRLALILRNMEKDSLSLFLITVMKNM+ELGYVFE
Sbjct: 121 QRIPRKFIEGNEVDRLHLEDHVGFRKPRLALILRNMEKDSLSLFLITVMKNMKELGYVFE 180
Query: 264 IFAVGNGEARQMWLKLGRVVLLSPKQFGQINWLLFEGIIVDSFEGKEAITSIMQEPFCSI 323
IFAVGNGEARQMWLKLGRVVLLSPKQFGQINWLLFEGIIVDSFEGKEAITSIMQEPFCSI
Sbjct: 181 IFAVGNGEARQMWLKLGRVVLLSPKQFGQINWLLFEGIIVDSFEGKEAITSIMQEPFCSI 240
Query: 324 PLIWIIQDDILAKRLKMYKDKGWENLVSHWRSTFSRASVIVFPNFALPMLYSALDTGNFH 383
PLIWIIQDDILAKRLKMYKDKGWENLVSHWRSTFSRASVIVFPNFALPMLYSALDTGNFH
Sbjct: 241 PLIWIIQDDILAKRLKMYKDKGWENLVSHWRSTFSRASVIVFPNFALPMLYSALDTGNFH 300
Query: 384 VIHGSPVDVWTAEIYKSSHFKFKLGEKLGFGIEDFVVLVVGNSFYNELSPEYAAALYRMG 443
VIHGSPVDVWTAEIYKSSHFK KLGEKLGFGIEDFVVLVVGNSFYNELSP+YAAALYRMG
Sbjct: 301 VIHGSPVDVWTAEIYKSSHFKLKLGEKLGFGIEDFVVLVVGNSFYNELSPDYAAALYRMG 360
Query: 444 PLLTEFARRKNPRGSFKFVFLYGNSSDGCNDALQNLA 480
PLLT+FARRKN RGSFKFVFL GNSS+GCNDALQ A
Sbjct: 361 PLLTKFARRKNRRGSFKFVFLCGNSSNGCNDALQETA 397
BLAST of Cp4.1LG01g14540 vs. ExPASy TrEMBL
Match:
A0A5D3CBN1 (UDP-glycosyltransferase family protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold475G002250 PE=4 SV=1)
HSP 1 Score: 612 bits (1578), Expect = 1.08e-207
Identity = 307/397 (77.33%), Postives = 339/397 (85.39%), Query Frame = 0
Query: 84 MVPDSSPPVVDDGACDLGFLSSKERSLSRRNLKQHQEQDNVSSDRSVCRFRSNLDRRDRY 143
M+ +S PP DDG +GFLS +ERSLS+RNLKQHQEQDNVSSDR V R RSNL R D
Sbjct: 1 MMQESFPPSDDDGDGGIGFLSYRERSLSKRNLKQHQEQDNVSSDRPVTRSRSNLGRSDTR 60
Query: 144 GWFPFRRRSFIVLAFFVLFTMFMFQLFLESSMTSVFLKRSKKAWPREAELKPGRTLKFVP 203
WF F RRS A F L +F+ +LES MTSVFLKRS+KAW R+AELK G TLKF P
Sbjct: 61 RWFAFSRRSIFAFAGFSLLLLFVVTFYLESLMTSVFLKRSEKAWSRDAELKLGMTLKFAP 120
Query: 204 QRIPRKFIEGNEVDRLHSEDHVGFRKPRLALILRNMEKDSLSLFLITVMKNMRELGYVFE 263
QRIPRKFIEGNEVDRLHS++ GFRKPRLALILR+MEKDS SLFLITVMKNM+ELGY FE
Sbjct: 121 QRIPRKFIEGNEVDRLHSDNRFGFRKPRLALILRSMEKDSQSLFLITVMKNMKELGYAFE 180
Query: 264 IFAVGNGEARQMWLKLGRVVLLSPKQFGQINWLLFEGIIVDSFEGKEAITSIMQEPFCSI 323
IFAV NGEARQMW +LGR+VLLSPKQFGQI+WLLFEGIIVDSFEGKEAITSIM EPFCS+
Sbjct: 181 IFAVANGEARQMWQELGRLVLLSPKQFGQIDWLLFEGIIVDSFEGKEAITSIMVEPFCSV 240
Query: 324 PLIWIIQDDILAKRLKMYKDKGWENLVSHWRSTFSRASVIVFPNFALPMLYSALDTGNFH 383
PLIWIIQDDIL+KRL MYKD+GWENLVSHWRSTFSRASV+VFPNFALPM YSALDTGNFH
Sbjct: 241 PLIWIIQDDILSKRLNMYKDRGWENLVSHWRSTFSRASVVVFPNFALPMFYSALDTGNFH 300
Query: 384 VIHGSPVDVWTAEIYKSSHFKFKLGEKLGFGIEDFVVLVVGNSFYNELSPEYAAALYRMG 443
VI GSPVDVW+AEIYK +HFK++LG+KLGF +ED VVLVVG+SFYNELS EYA AL RMG
Sbjct: 301 VIQGSPVDVWSAEIYKKTHFKYELGKKLGFDVEDIVVLVVGSSFYNELSSEYAVALNRMG 360
Query: 444 PLLTEFARRKNPRGSFKFVFLYGNSSDGCNDALQNLA 480
P+LT+ R KNP SFKFVFL GNS++GCNDALQ A
Sbjct: 361 PVLTKLPR-KNPEVSFKFVFLCGNSTNGCNDALQETA 396
BLAST of Cp4.1LG01g14540 vs. ExPASy TrEMBL
Match:
A0A1S4DWD8 (uncharacterized protein LOC103489564 OS=Cucumis melo OX=3656 GN=LOC103489564 PE=4 SV=1)
HSP 1 Score: 488 bits (1255), Expect = 1.28e-160
Identity = 236/283 (83.39%), Postives = 258/283 (91.17%), Query Frame = 0
Query: 198 TLKFVPQRIPRKFIEGNEVDRLHSEDHVGFRKPRLALILRNMEKDSLSLFLITVMKNMRE 257
TLKF PQRIPRKFIEGNEVDRLHS++ GFRKPRLALILR+MEKDS SLFLITVMKNM+E
Sbjct: 2 TLKFAPQRIPRKFIEGNEVDRLHSDNRFGFRKPRLALILRSMEKDSQSLFLITVMKNMKE 61
Query: 258 LGYVFEIFAVGNGEARQMWLKLGRVVLLSPKQFGQINWLLFEGIIVDSFEGKEAITSIMQ 317
LGY FEIFAV NGEARQMW +LGR+VLLSPKQFGQI+WLLFEGIIVDSFEGKEAITSIM
Sbjct: 62 LGYAFEIFAVANGEARQMWQELGRLVLLSPKQFGQIDWLLFEGIIVDSFEGKEAITSIMV 121
Query: 318 EPFCSIPLIWIIQDDILAKRLKMYKDKGWENLVSHWRSTFSRASVIVFPNFALPMLYSAL 377
EPFCS+PLIWIIQDDIL+KRL MYKD+GWENLVSHWRSTFSRASV+VFPNFALPM YSAL
Sbjct: 122 EPFCSVPLIWIIQDDILSKRLNMYKDRGWENLVSHWRSTFSRASVVVFPNFALPMFYSAL 181
Query: 378 DTGNFHVIHGSPVDVWTAEIYKSSHFKFKLGEKLGFGIEDFVVLVVGNSFYNELSPEYAA 437
DTGNFHVI GSPVDVW+AEIYK +HFK++LG+KLGF +ED VVLVVG+SFYNELS EYA
Sbjct: 182 DTGNFHVIQGSPVDVWSAEIYKKTHFKYELGKKLGFDVEDIVVLVVGSSFYNELSSEYAV 241
Query: 438 ALYRMGPLLTEFARRKNPRGSFKFVFLYGNSSDGCNDALQNLA 480
AL RMGP+LT+ R KNP SFKFVFL GNS++GCNDALQ A
Sbjct: 242 ALNRMGPVLTKLPR-KNPEVSFKFVFLCGNSTNGCNDALQETA 283
BLAST of Cp4.1LG01g14540 vs. ExPASy TrEMBL
Match:
A0A6P3ZSX7 (uncharacterized protein LOC107413250 OS=Ziziphus jujuba OX=326968 GN=LOC107413250 PE=4 SV=1)
HSP 1 Score: 393 bits (1010), Expect = 5.97e-123
Identity = 206/402 (51.24%), Postives = 287/402 (71.39%), Query Frame = 0
Query: 88 SSPP-VVDDGACDLGFLSSKERSLSRRNLK--QHQEQDNVSSDRSVCRFRSNLDRRDRYG 147
SSPP ++DD DLGF S ++R RRN Q++ + + DR R+RS+ R +R G
Sbjct: 8 SSPPGILDDNGNDLGFHSIRDRFRFRRNSNPSQNRGRGRIFPDRLSSRYRSHHGRFNRKG 67
Query: 148 W---FPFRRRSFIVLAFFVLFTMF-MFQLFLESSMTSVFLKRSKKAWPREAELKPGRTLK 207
+ FPF+ + + L + +F M + L+SS+T VF + S++ LK G TL+
Sbjct: 68 FLLLFPFKGKLALYLVIMLALVLFAMASMVLQSSITLVFRQGSERGRLFRYGLKFGSTLR 127
Query: 208 FVPQRIPRKFIEGNEVDRLHSEDHVGFRKPRLALILRNMEKDSLSLFLITVMKNMRELGY 267
FVP RI R+ +EG VDR ++ +G R PRLALIL +M KD+ SL L+TV+KN+++LGY
Sbjct: 128 FVPGRISRRIMEGGGVDRFRNQARIGVRPPRLALILGHMTKDAQSLMLVTVIKNIKKLGY 187
Query: 268 VFEIFAVGNGEARQMWLKLG-RVVLLSPKQFGQINWLLFEGIIVDSFEGKEAITSIMQEP 327
V +IFAV NG A MW ++G ++ +L P+ FG I+W +F+GI+VDSFE K A++S+MQEP
Sbjct: 188 VLKIFAVQNGNAHSMWEQVGGQISILDPEHFGHIDWTIFDGIVVDSFEAKAALSSLMQEP 247
Query: 328 FCSIPLIWIIQDDILAKRLKMYKDKGWENLVSHWRSTFSRASVIVFPNFALPMLYSALDT 387
F SIPLIWIIQ+D LAKRL +Y++ GW++L+SHW++ RA++IVFP+F LPMLYS LDT
Sbjct: 248 FSSIPLIWIIQEDTLAKRLPVYEEMGWKHLISHWKNALGRANLIVFPDFTLPMLYSVLDT 307
Query: 388 GNFHVIHGSPVDVWTAEIYKSSHFKFKLGEKLGFGIEDFVVLVVGNS-FYNELSPEYAAA 447
GNF V+ GSPVD+W AE Y +H K +L GF ED +VLVVG+S F++ELS +YA A
Sbjct: 308 GNFFVVPGSPVDIWAAESYSKTHSKIQLRNDSGFSEEDLLVLVVGSSLFFDELSWDYAVA 367
Query: 448 LYRMGPLLTEFARRKNPRGSFKFVFLYGNSSDGCNDALQNLA 480
++ +GPLLT++A+RK+P GSFKFVFL GNS+DG +DALQ +A
Sbjct: 368 MHAIGPLLTKYAKRKDPGGSFKFVFLCGNSTDGHDDALQEVA 409
BLAST of Cp4.1LG01g14540 vs. TAIR 10
Match:
AT5G04480.1 (UDP-Glycosyltransferase superfamily protein )
HSP 1 Score: 332.0 bits (850), Expect = 7.9e-91
Identity = 184/390 (47.18%), Postives = 251/390 (64.36%), Query Frame = 0
Query: 96 GACDLGFLSSKERSLSRRNLKQHQEQDNVSSDRSVCRFRSNLDRR--DRYGWFP-FRRRS 155
G D F S ++R +RN +++ + DR R R + R +R G + R
Sbjct: 29 GNGDTSFHSIRDRLRLKRNSSDRRDRSHSGLDRPSLRTRPHHIGRSLNRKGLLSLLKPRG 88
Query: 156 FIVLAFFVLFTMFMFQLFLESSMTSVFLKRSKKAWPREAELKPGRTLKFVPQRIPRKFIE 215
+L F V FT+ F + S+ + + K +++ G TLK+VP I R IE
Sbjct: 89 TCLLYFLVAFTVCAFVMSSLLLQNSITWQGNVKGGQVRSQIGLGSTLKYVPGGIARTLIE 148
Query: 216 GNEVDRLHSEDHVGFRKPRLALILRNMEKDSLSLFLITVMKNMRELGYVFEIFAVGNGEA 275
G +D L S +G R PRLAL+L NM+KD +L L+TVMKN+++LGYVF++FAV NGEA
Sbjct: 149 GKGLDPLRSAVRIGVRPPRLALVLGNMKKDPRTLMLVTVMKNLQKLGYVFKVFAVENGEA 208
Query: 276 RQMWLKL-GRVVLLSPKQFGQINWLLFEGIIVDSFEGKEAITSIMQEPFCSIPLIWIIQD 335
R +W +L G V +L +Q G +W +FEG+I DS E KEAI+S+MQEPF S+PLIWI+ +
Sbjct: 209 RSLWEQLAGHVKVLVSEQLGHADWTIFEGVIADSLEAKEAISSLMQEPFRSVPLIWIVHE 268
Query: 336 DILAKRLKMYKDKGWENLVSHWRSTFSRASVIVFPNFALPMLYSALDTGNFHVIHGSPVD 395
DILA RL +Y+ G +L+SHWRS F+RA V+VFP F LPML+S LD GNF VI S VD
Sbjct: 269 DILANRLPVYQRMGQNSLISHWRSAFARADVVVFPQFTLPMLHSVLDDGNFVVIPESVVD 328
Query: 396 VWTAEIYKSSHFKFKLGEKLGFGIEDFVVLVVGNS-FYNELSPEYAAALYRMGPLLTEFA 455
VW AE Y +H K L E FG +D ++LV+G+S FY+E S + A A++ +GPLLT +
Sbjct: 329 VWAAESYSETHTKQNLREINEFGEDDVIILVLGSSFFYDEFSWDNAVAMHMLGPLLTRYG 388
Query: 456 RRKNPRGSFKFVFLYGNSSDGCNDALQNLA 481
RRK+ GSFKFVFLYGNS+ G +DA+Q +A
Sbjct: 389 RRKDTSGSFKFVFLYGNSTKGQSDAVQEVA 418
BLAST of Cp4.1LG01g14540 vs. TAIR 10
Match:
AT5G04480.2 (UDP-Glycosyltransferase superfamily protein )
HSP 1 Score: 298.9 bits (764), Expect = 7.4e-81
Identity = 174/390 (44.62%), Postives = 236/390 (60.51%), Query Frame = 0
Query: 96 GACDLGFLSSKERSLSRRNLKQHQEQDNVSSDRSVCRFRSNLDRR--DRYGWFP-FRRRS 155
G D F S ++R +RN +++ + DR R R + R +R G + R
Sbjct: 29 GNGDTSFHSIRDRLRLKRNSSDRRDRSHSGLDRPSLRTRPHHIGRSLNRKGLLSLLKPRG 88
Query: 156 FIVLAFFVLFTMFMFQLFLESSMTSVFLKRSKKAWPREAELKPGRTLKFVPQRIPRKFIE 215
+L F V FT+ F + S+ + + K +++ G TLK+VP I R IE
Sbjct: 89 TCLLYFLVAFTVCAFVMSSLLLQNSITWQGNVKGGQVRSQIGLGSTLKYVPGGIARTLIE 148
Query: 216 GNEVDRLHSEDHVGFRKPRLALILRNMEKDSLSLFLITVMKNMRELGYVFEIFAVGNGEA 275
G +D L S +G R PRLAL+L NM+KD +L L +FAV NGEA
Sbjct: 149 GKGLDPLRSAVRIGVRPPRLALVLGNMKKDPRTLML---------------VFAVENGEA 208
Query: 276 RQMWLKL-GRVVLLSPKQFGQINWLLFEGIIVDSFEGKEAITSIMQEPFCSIPLIWIIQD 335
R +W +L G V +L +Q G +W +FEG+I DS E KEAI+S+MQEPF S+PLIWI+ +
Sbjct: 209 RSLWEQLAGHVKVLVSEQLGHADWTIFEGVIADSLEAKEAISSLMQEPFRSVPLIWIVHE 268
Query: 336 DILAKRLKMYKDKGWENLVSHWRSTFSRASVIVFPNFALPMLYSALDTGNFHVIHGSPVD 395
DILA RL +Y+ G +L+SHWRS F+RA V+VFP F LPML+S LD GNF VI S VD
Sbjct: 269 DILANRLPVYQRMGQNSLISHWRSAFARADVVVFPQFTLPMLHSVLDDGNFVVIPESVVD 328
Query: 396 VWTAEIYKSSHFKFKLGEKLGFGIEDFVVLVVGNS-FYNELSPEYAAALYRMGPLLTEFA 455
VW AE Y +H K L E FG +D ++LV+G+S FY+E S + A A++ +GPLLT +
Sbjct: 329 VWAAESYSETHTKQNLREINEFGEDDVIILVLGSSFFYDEFSWDNAVAMHMLGPLLTRYG 388
Query: 456 RRKNPRGSFKFVFLYGNSSDGCNDALQNLA 481
RRK+ GSFKFVFLYGNS+ G +DA+Q +A
Sbjct: 389 RRKDTSGSFKFVFLYGNSTKGQSDAVQEVA 403
BLAST of Cp4.1LG01g14540 vs. TAIR 10
Match:
AT4G01210.1 (glycosyl transferase family 1 protein )
HSP 1 Score: 166.8 bits (421), Expect = 4.4e-41
Identity = 119/409 (29.10%), Postives = 200/409 (48.90%), Query Frame = 0
Query: 96 GACDLGFLSSKERSLSRRNLKQHQEQDNVSSDRSVCRFRSNLDRRDRYGWFPFRRRSFIV 155
G+ + G + ++ R +Q Q+Q + R RS L R F + I+
Sbjct: 2 GSLESGIPTKRDNGGVRGGRQQQQQQQ--QQQFFLQRNRSRLSRFFLLKSFNYLLWISII 61
Query: 156 LAFFVLFTMFMFQLFLESSMTSVFLKRSKKAWPREAELKP-------------GRTLKFV 215
FF F +FQ+FL + + +S K W + L P G ++
Sbjct: 62 CVFF--FFAVLFQMFL----PGLVIDKSDKPWISKEILPPDLVGFREKGFLDFGDDVRIE 121
Query: 216 PQRIPRKFIEGNEVDRLHSE------DHVGFRKPRLALILRNMEKDSLSLFLITVMKNMR 275
P ++ KF S GFRKP+LAL+ ++ D + ++++ K ++
Sbjct: 122 PTKLLMKFQRDAHGFNFTSSSLNTTLQRFGFRKPKLALVFGDLLADPEQVLMVSLSKALQ 181
Query: 276 ELGYVFEIFAVGNGEARQMWLKLG-RVVLLSPKQFGQ--INWLLFEGIIVDSFEGKEAIT 335
E+GY E++++ +G +W K+G V +L P Q I+WL ++GIIV+S + T
Sbjct: 182 EVGYAIEVYSLEDGPVNSIWQKMGVPVTILKPNQESSCVIDWLSYDGIIVNSLRARSMFT 241
Query: 336 SIMQEPFCSIPLIWIIQDDILAKRLKMYKDKGWENLVSHWRSTFSRASVIVFPNFALPML 395
MQEPF S+PLIW+I ++ LA R + Y G L++ W+ FSRASV+VF N+ LP+L
Sbjct: 242 CFMQEPFKSLPLIWVINEETLAVRSRQYNSTGQTELLTDWKKIFSRASVVVFHNYLLPIL 301
Query: 396 YSALDTGNFHVIHGSPVDVWTAEIYKSSHFKFKLGEKLGFGIEDFVVLVVGNSF-YNELS 455
Y+ D GNF+VI GSP E+ K+ + +F + +D V+ +VG+ F Y
Sbjct: 302 YTEFDAGNFYVIPGSP-----EEVCKAKNLEFPPQK------DDVVISIVGSQFLYKGQW 361
Query: 456 PEYAAALYRMGPLLTEFARRKNPRGSFKFVFLYGNSSDGCNDALQNLAK 482
E+A L + PL + ++ K + L G ++ + A++ +++
Sbjct: 362 LEHALLLQALRPLFSG-NYLESDNSHLKIIVLGGETASNYSVAIETISQ 390
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
XP_023551126.1 | 4.46e-280 | 99.50 | uncharacterized protein LOC111809035 [Cucurbita pepo subsp. pepo] >XP_023551134.... | [more] |
KAG7031994.1 | 1.75e-271 | 96.98 | hypothetical protein SDJN02_06036, partial [Cucurbita argyrosperma subsp. argyro... | [more] |
XP_022956546.1 | 4.99e-271 | 96.98 | uncharacterized protein LOC111458257 [Cucurbita moschata] >XP_022956547.1 unchar... | [more] |
KAG6601199.1 | 1.11e-265 | 92.72 | hypothetical protein SDJN03_06432, partial [Cucurbita argyrosperma subsp. sorori... | [more] |
XP_022993256.1 | 1.18e-261 | 94.46 | uncharacterized protein LOC111489326 [Cucurbita maxima] >XP_022993265.1 uncharac... | [more] |
Match Name | E-value | Identity | Description | |
A0A6J1GWM9 | 2.41e-271 | 96.98 | uncharacterized protein LOC111458257 OS=Cucurbita moschata OX=3662 GN=LOC1114582... | [more] |
A0A6J1JVU1 | 5.71e-262 | 94.46 | uncharacterized protein LOC111489326 OS=Cucurbita maxima OX=3661 GN=LOC111489326... | [more] |
A0A5D3CBN1 | 1.08e-207 | 77.33 | UDP-glycosyltransferase family protein OS=Cucumis melo var. makuwa OX=1194695 GN... | [more] |
A0A1S4DWD8 | 1.28e-160 | 83.39 | uncharacterized protein LOC103489564 OS=Cucumis melo OX=3656 GN=LOC103489564 PE=... | [more] |
A0A6P3ZSX7 | 5.97e-123 | 51.24 | uncharacterized protein LOC107413250 OS=Ziziphus jujuba OX=326968 GN=LOC10741325... | [more] |