Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCCGAGAACCTCTTCGAAGGCCTTCCACCGCCGATTTCAGCGACAAACCCTTCGCCGGAGACCCAATTACCAGATGCAGCCACCAAGCAGAGTCGGCCCTCTGATCCTTCTCCCAATCCCGCCGCTTCTTCTTCTTCTTCTTCTCCGGCGCCGGTCATCAAGAGCGCCCTCAAGCGCCCCAAGACTTCGCAAGAATCGAACCCAGCAGGTCGTCTGTTCTCCTCCTAAATCGCTTCGATTTGTTATTGTTCATTCTTGGTTTTGAGTTTTTTGAGGGCTTGTTTGATTTTGTAGTTTTTTTTTTTTTTTAAACTCTTTTTCTTTTTCTTTTTCTTTTTCTTTTTCTCGGTGTTGTGCTCTTTTTGGGCCCCTTTTGAGCTTCAATTTGCCTTTACTAGTCAATTTATTTGAACTCGATTCTTCTTTTGGGATGTTTGATTCTCGGATCTCGCTTTTAATTATTTGAGGTACAAGAATGAAGGATGAAAGGTGTGTTGCAAGGATTTCTTAGTTCATTTGGTCTTGAACATCGTTCCTTTGTTTAAATTATAGGTTAAATAAAATTTTAGCCTGCAGGTTATATCTATTAAGTCCATAAAGTTTAAAAAGTTCTACTTTGTTGAATTTTGAATTCTGATTCTGTATACCCCCTGTGATTAACACCTGGATAATTATTAATAAGTATTAATATTGATGCGATATGACTATTCTATTGGACACCTTTAGGTTGATTTGATAGGTAGAAGAATGACAAAAGAGAGTAGAGAACTAAAACAACGTAGAGTGGTATAGTGTTAGGTACCTAAGCCGTTGCTTATACTCAGAAGGACCCAACAACATGGAAATCGGGGGAGAGTACTCTCCCTGTGCTATGGATAATCAACTTCTCAAACAGAGCAAGAGTTTATCAAACCCTAATGAAAGAAATAGAGCGAACGACAATAAACAAAATGAACAAGAACGACCCTGCACTAAAACCCTCATAAGCCCTGCACTAGTCCCCTCCTCCCCTCCATACATAACCAACTTTTATGTGCTGCTACCCAACTGTTAAGGGTGACGTTTGTTTCTTATGCCTCCTCTCATATAAATGTTTAATAGGAGGTCTTACAATACCCTCGAGGTTGAAACTCACCTTGTCTGTACCTAGTTGGTTCTTCATGAATTCCACCTATTCCCATGTGGCCCCACTATCCTCATGGAGTTTTCATTTGACTAGACACTCCTCTTTCTTTTTCTCTTCATTCCACCTTATTCCCTGTAATCCTTCATATGTTGTTGTTAGCTTGAAACCTGTAAAAGATGGGGGATGAGATTGAACTATTGCCCTGTGTCCAATGGCTTTCTTTAGCTAATAATAGTCCCCCAGTTACTAAGGTGTGCACCAGAAGCAACAAGAGAGAGAAAAGGGGTGAGGAGAATGATGTAATATGTGAGGAGTGAAAGTTGTGCTTACATGGGTGTAAATGTATGATTATGTGGTAATGGGAATTCGATTAGTCGGAGAAAAGCCTATTTAGAGGTAACGAGAGGGAGGGGAGTTGGTTAACTTTTCATCTTTTTAGCTTATTGTTTTGTAGTTCTTTTGGGTGTCGAGGAGAAGGGGGAGATCTTGAACTCCCTTGGCTTTGCCTAAACAGTGTGTTAGAAATTTGGTGTTGGCAATATTTTTTAAGGTTGTCAACATTTGTAAGGGTGTAATGGGTTGGGTTGGGTTGGGTTGAGGATGTTTTTTGGACTAACCTGAAAATTAAGGTTGGTTGGGTTGGCAACCCGAAAGACCCGAATAAAGTCTACAACCCAACCCAACCTAACCCTTAAAATTCGAGTTGGGTTGGGTTGGGTTGTTGGGTTATTACTTATTTTTTCTTTTTAATTAAAAATATGTAAATTTATCTACAACACATGACTAATAACTAAAATCTCATAAAATTAAAATGTTAAATACCAAATATTTATGATATTTATTGTAAACTTTAAACAAAAATAAGTATCATAACAATTTTAAAAGAAAATATTAAAAATTCACAAATTAGTCAATATAAAGGAATCAAACTTTAAACAAAGAGGAATATATACACACACACACATATTATTAATATATATAGAATTTGGGTTGGGTTAGGTTGACCCAAATTGTTTTTTAGCCAACCCGCGATCCAACCTAACCCAACCCTTACATTTTGGGTTGGGTAGTCTGGGTTATTCGGGTTGTCGGGTCATTTGAACACCCCTAAACATTATTTGTCCCTTGAGATTTCTTGTCAGTTGTTCCTTGAGATGTCTTGCAAGTTGTCCCATGGATTGCTGATTCTGTTGGATATATTTTCCTTATTTGGAAGAATCTACATATTATTCTTGAAGATATTCTCCTTCTTTGGGCTTTGCAGTTTTTTTGTTTTTGTATATATATATATATATATATTTTTTTTGGGATAGAAACAACTACTTTCATTGAGGGGAAAAAAAAGAAAGAATCCAAAGGCATACAAAAAAATCAAGCCCACAAAAACCCCACTACAGAAAGGGTTCCAACTAAGTAAGATATTGCCTAGGGAATAATCACAGAAAGTCTTCAAAACCGAAGCTTAAAAGACATGAAACCTCACCAAATACCAAACCTCACTAAGGTCCCTGTCCACACCATGAACACTCTATTGTTTCTCTTCCCCCAAAGATCCCATAATAGTGCACACACCCCCGTAAACCATAGAAAACGACCTTTTCCTTTGAAGGACGGATGGAGGAGGAACTCCCCATCCCTATGCCGAGCAAGCATAATATCATAACTCCTGAAAGAAACCTCCATCTCACAAACTGTCACTCCTAAGTCCTAGAAAAGGTGATCCAGGTTTTCCTCCACCTTTCGATAAAGGATAAAACAAAAAGGACTCATCAATGAAGGCATCTTTCTCAATAGCCTATCCATCGCATTCACAAGTCCAAGCAGAACTTACCAAGAAAAGAACTTTCTTTGGGATCTTAATCCTCCATAAAACATCAAAGACCGACTCAACAGCCAAGGAGGGATCCAATAAAATCCTAAAGAATGACTTACTAGAAAAGCCCTCCGAAGGGTTAGGACTCTAAACACGAACATCCCTTCTCCCAATCCTAAAGTCAAAGCCCTCAACCAAGAAAAGAAGAGAGGTCACTCTCGTCGATTCCCTATTGGACAACACCCAATGGAACCTCAAGGAGAAGCAACCAAACCAAAAAGTCTGACACGAAGTGATTTTTCTGGGAGGACAAATGATATAACCAAGGAAACGCAAAGGAGAGGAGTTGATCCCCCATCCAATGGTCTTCCTAGAAATACATTACCTTGCCATGCCCACGACACAAAAAAACAAGATGGGAAGAGGAGGGAAGCTCTAACCCCCCTAATTCACAGGTTTCCCCACAGCTTCCCAGCTAACCAAATGTAACCCTTTCCCTTCCTCAACCCCTTCTCAGAGGAAGTCTCTCATGAGTTTCCCAATGCTCTTGTAAATCTAGACAAAAGCACGGAAAAAGAGAAAAGTAATAAATGGGGATACCACTCAGCACCGATTTGATTGGTGTCAATCTCCCTACTTTGGAGAAAAGCTTTTCTTCCCTGATGCTAACCTCTTCCTTATCTTCTCCATGGGAGCATCTAAAAAAGAGAGAGCTCTCTGATTACCTCCCAATGGGAGGCCAAGATAGGAAGAGGGAAAGGAGCCAACCTTGCAACCAACCCGATTCACCCATCTAATAAGCTTGTCCTACTCACAATTTATATTTGTTGTGCATATTTAAGAGATTGTTGTATTCTTTTGAGGGTTGGTTGCTTTGAAGAATTATTGTGTTGTGTGTCTTGTGTGTTTGTCGTGAACCACAAGTGTGTCAATGTTAGTGCTTGTGAAGCAAGAAGCATTGTAGATCTTGAGGTGTTTCTCTAGATCATCTTGAAGCTTTATTCGTTACTTGATAAGAGATTCAAACTTAGGGGGAACCTACATTCATCAAGTGAACGTTCATCAAGTGGAGTGATCTTTATTTAAAGTTGGAAAAATAGAAGTTGAAGGGGGAGTCTAAGTGATTATTCACTAATATTCTCCTACTTAGGGGAGCCTAAGTTTCATCGAGAGAGTCTTATACTTAGGGGGAGTGTAAGTGATCATTCACCAAGTTAGGATATTCGTATGTTACTATAATTGCTGTTTGTTTATAGATAAGTACAATGTTGTAATTGCTTGACTTTATATTAGTGAAAATATCTTTCCATGAGCACATTGTCCCCCGGACGTAGGTGGTATTGCGCCGAATTGGGTTACCAAACTCTGTGTTTTGTTCTTGCTTTATTCTCTGTGTTTATGTTTGATGTTCTTGGAGTTGTTGTAATATTGTGTGTCAAATAACCCTTCTTTCACAAACTAATCCTGGGTAATGGTAAAAAAGGAGACATAGCTCAATAAATGGCTGAGAGGTCATTGGTTCAATCCATGGTGGCCACCTACCTAGGAACTAATATCTTACGAGTTTCCTTGACACCCAAATGTTGTAAGGTAAAGCGAGTTGTCCCGTGAGATTAGTCGAGGTGTGCATAAGCTGGCTCAAACACTCACAGATAATAAAAAAGAGGTCTTTCTTTCACAGTGAATATATGTACCATTAGATATATGAAAGTCATTGCAGTAGCTTTTGATGGTAATTCCACCCTATATGCTATTTTCCCAATCCTTTTCAACACTAAATAAAGCCCACAAAACTTAGGAGAATGTTTTTTCACCCCCTTTTTAGCTAGGGAACACTTGTGATAGGGGGGGTGTAACTTGAGGAAGAACACTTGGAGGACTGAGATAAGCTCCTATAAGAGAATCAAGTTAACCCTCGACTGGTTCAGTTTCGGTTTAGCATCATATGTCGAAGGATCCCCAATGGAAAACTTGATGTCTATTCATTCTTGGTCAGCAAACTTCTTCATTCGATTTTGAGCTAAATTGCTCTTTCAAATCTAGCAAGGCCACATCTCGATCAATCAATTGTTGTTCTAGGGAGCTATTAGTTGTTTTCTCCATAAGACAGTAGGTGAGGGCTCGCTTGACCACACACAACATTAAAAGGAGTGGTATTAATGAAGATATGAAATGTGGTATTGTACCAGTACTCCTCTGCTCATGAAAGCTACTTTCCCCATTGTTTCAACTGCTCATTACAAAAAGACCGCAAGTAAGTCTCGAGACATCTGTTGACCCTCTCCATTTGTCCATCGGTTTGGAGATGAAATGTTATGCTTTGCTTGAGTAACATCCCTTTAGTGGCAAATAGCTATGTCCAGTAATGACTCACAAAAAGATTGTCTCTATTTGAAACTATTGACCTCGGAAAATCGTGTAACCTCACCACCTCCTTGATAAATATCACTGCAACTTAGTAAAGTATGGATGGCAGATCAAATATGAAATGGTTATTTTTGCTCAGGTGATCAACAACTACCATATTGGAGTCATTACCGTGTGATTTTGGCAGTCCTTCCTTGAAATCCATAGATACATCCTCTTATATACAATTAGGAATACGAAGAGGCTGCAATAACTCAATTGGTAATATTGATTCAATTTTGTTTGATTGTCATACCTCACACTGTTCCACATAGGAGTTCACATCATCTTTCATCCCTTTTCAGTAGAATTCTCCAGTCATTCATTTATACCTACGAAGGAGATGATTTAGATGATACCAACCTATCTCTATAGAGGAAACACCTCTGTTGCAATGAGAATTTTAGCTCTCTAGAGTCTGGTCTAGTGGATCCTCCTCTGTTAACTTTGTGATTATTCTTTGAAGCTCTTAATTTGATGGATCTCCTTGTGGATTTGTGACATATATAACAAGGCACGAGCAAATATAGCATTTGGTTCACTCACTGGAGTCACGCTTGACAAGGCATTGGCTACCTTATTCTACAATTCGGGTTGGTAAAGGATGTCAAAGTCATATGCCAACAATTTAGTCAAGCACCTCTGGTATTCGAGTTGTAATTCACGCTGTTCTGAGAAGAATGTTAATGCTCGTTGGTCAGATATCCCACCACCACCCCATAATTTCAGGTTTCTGTCAACAATAACCATAACTAGCCCAACCGCTTTTGACGAATGTAGTTGTGGGTCGCTCCACTGTCAATAAGAACAATCACTTCCTTATCTTGCACCATCCCCTTCAATTTCATGGTACTTGGAGCAGAAAATCCCAAAACCGATTGTAATGTCAACTCTGTACTGTTTGCCGCTTCAAGTGAGGTACTTCTACATTTTCCTCAACTTGTTCTTCTAGTAGTTATTCAATTTTGTTTCTGGCATTTGCTATCAAGAGCATCAACTCTCATTTCTCCTCATAGGCCCTTCTCCCTTCTTGCCTTGTATTTTGCATAGGTAAGTGTCCTGCATCATGCTTCGAGTTTACTGAAGAACAATTGTTACTTGGGGTGTTCCCATGGGTCGGCCGCTGAGGGGGTTATTTTCTAGGGCCAAGTACATGGTGAAGTTCCAATTGTGAACAAGTTGGGCCTCCTTCAAGATCTCATCAAGGCCTATGAGATGTTTGTTGATTACCTCCACCTCCAGCGTTGGAGCCGTTGAGGAACATACTCCTTAACACATCTTCTGCACTCAAAAAGGCCCAACGAGAGGGAATATAGGGGAAAATACTCTCCCAATGCTATTGATAATCAACCTCTCAAATAGAGTGAGAGTTTAACAAAATTAATGAAAGAAATAGCAAAAAATGAAAATGAAATGAACAAGAACGATATCTTTCGATCAGTTCGAGAGCATCAGTTTCCTTCCTCCCTGTCAAGAAAGCTTCCACAGCCAAAATATCCACACAAATCCAGCATCCCTCTGCCACTAAACCCTTCATAAACCTTGCACTGGTCCCCTTCCCCTCCATACGTAAACAACTTTTACATGCCACCACCCAATTGTCATGGGTGAAGTTCCTTTCTTACCCTTCCTCTTGTATGTATGTTCAATAGGAGGTCTTACATATAGTTAATAAGTGGAATTTGTACCACTTATTTTGTCTGGGTGTCCATATCTTGTCCGCTATTTACCACATTGGATTAAATTAGTAAAAGTTTGTCCAATAGGCAAGCATGCCGGCATATATTTTCACGGTTTTGTACCGCTTATTTTGTCTGGGTGTCCATATCTTGTCCGCTATTTACCACATTGGATTAAATTAGTAAAAGTTTGTCCAATAGGCAAGCATGCCGGCATATATTTCACGGTAAGGAATACTTAGAAGTTAGAATCACAGATTCACAATTCTAATGTTAATGCAAGAACATGATTGAAATGTTTTAAGGTTCGGAACTAAATAGACACAATTTATATTTAACCTAAATTCTATATAAATTTATGATAACTCTGGTAGAAGATGAAATATGGAATTTGGTTTTCAGCTGCAGCACCGGCACCGGCACCGGCACCGGGAAAACGTTTAAGGTTCAAAACCATGACAGATGCTTCAGAGACCCAAGTTTTGGAGGCCATGCAGAAGATAGCTTCACATATTAAGAACCCCACTAAGTTTGGTAAGGCTGCAAAACTTGCCATACAGCTCATTCAGGCAGGAAGCGTGAAGCCAACTACCAGTGATCGTTTCTTTACCATACTTGAAGCTGCCATGTCCGTGTCTTCATCCACTCCTTGTACTGATCCTTCAGTACGAGGAGATTATCATGCATTGTTTTCTGCTGCACAATCTACAGTGGAAGTATGATGATCATATTTTATGTTTTACAGTTGATATGATGATATTTTCTGTCTGTTTCTCTAAATGGGATTTCAATTTTCTAATAATGGTTTCTTATTTTAATTTTCAGTGCCTTAACAGAAAGCAGAAGAACCAGTTAACAACCTGGACAATTCAGACTGTGTTGGCAAATGATTTGTTGACAGATGACAGTTTTGTGGTAGGGATTATTTCCTATATTTGTGTTTAATCGAAACCATGATGCTATATAAGGATATAAAACCTACATTTATTTTATTTATTTTCTTTGTTTAGTTTTATCACTTTGAAAAACATGCCTGAAATCTTAAGTAATTTCAAAAGGTTTTTATAAAATGACTTTTTTGCTTTTTTAGATTTCAAAACTTGGCTATGAAAATGGTTTTAAAAATGGTTTTAAAAATGGTTTTTGAAATAGATTAGAAAAGGAAGAAAAAGATCAGTAAATAAGCATAACTTTTAAAAAGTAGAAACGTCATCAAATGGGTCGTTTGTCATTCTTGTTGGTTATTTTGACCATTTTACTCTCTTTACTCGTGGAACATCAATTTTCCATTCTTTCATGTTTGGCTTTTTGAACCATGTAGTTTTCAAAGACAGCTGGACAAATAAAAGAAGCTATCTCTAATCTTCCAGTTGCAACTAAGGAGGATGACATTGAGGAAGCTGAAGTACTTAGAGGCCATGAAGAGAGCACTGATGATGAACATCAGCAAAAGAAGAATGCTGCTCCAACTGAAGAGAAAAACCAGGAAGAATCCGATCCATTTGGGCTTGATGCTTTTCTGCCTGGTTCGTTGAAGAAAAGTGAGAGGGAAAAGGTAAAGAATGATGTGGCATCCATGACTAGAAATGATGAAGAAATGGAGACTAAGAGTTTTCTCAAAGCACAAAGAAATGCCCTAATTAGCTGTTTAGAAATTGCTGCTCATCGGTATAAAATCCCATGGTACAAATCTGCACCGAGTAGTTAGCAACAGTTCCTTACATTGTTTTTAAAACTGGAATGTGAAACTGACTTACAATTTCTCTCGTTAGGTGTCAAACCGTCATTGATATCTTAGTGAAGCATGCCTTTGATAATGTTACGAGGTTCACATCGCAGCAACAGGATGCGATTGGGAAATTGTGGGCTTCAGTAAGGGAACAACAAAATCGTAGGAAACAAGGAAAATCAGTCTCGGGTAAACTTGATGTAAATGGATTTGAATGGCTGCAACAGAAATATGCCAATGAGAAGATCAGCATTCGGCATTCTGTTGGGGCTAGTGGCGATCGAAAAGCACAACAGTGGCTTGGTTGA
mRNA sequence
ATGGCCGAGAACCTCTTCGAAGGCCTTCCACCGCCGATTTCAGCGACAAACCCTTCGCCGGAGACCCAATTACCAGATGCAGCCACCAAGCAGAGTCGGCCCTCTGATCCTTCTCCCAATCCCGCCGCTTCTTCTTCTTCTTCTTCTCCGGCGCCGGTCATCAAGAGCGCCCTCAAGCGCCCCAAGACTTCGCAAGAATCGAACCCAGCAGCTGCAGCACCGGCACCGGCACCGGCACCGGGAAAACGTTTAAGGTTCAAAACCATGACAGATGCTTCAGAGACCCAAGTTTTGGAGGCCATGCAGAAGATAGCTTCACATATTAAGAACCCCACTAAGTTTGGTAAGGCTGCAAAACTTGCCATACAGCTCATTCAGGCAGGAAGCGTGAAGCCAACTACCAGTGATCGTTTCTTTACCATACTTGAAGCTGCCATGTCCGTGTCTTCATCCACTCCTTGTACTGATCCTTCAGTACGAGGAGATTATCATGCATTGTTTTCTGCTGCACAATCTACAGTGGAATGCCTTAACAGAAAGCAGAAGAACCAGTTAACAACCTGGACAATTCAGACTGTGTTGGCAAATGATTTGTTGACAGATGACAGTTTTGTGTTTTCAAAGACAGCTGGACAAATAAAAGAAGCTATCTCTAATCTTCCAGTTGCAACTAAGGAGGATGACATTGAGGAAGCTGAAGTACTTAGAGGCCATGAAGAGAGCACTGATGATGAACATCAGCAAAAGAAGAATGCTGCTCCAACTGAAGAGAAAAACCAGGAAGAATCCGATCCATTTGGGCTTGATGCTTTTCTGCCTGGTTCGTTGAAGAAAAGTGAGAGGGAAAAGGTAAAGAATGATGTGGCATCCATGACTAGAAATGATGAAGAAATGGAGACTAAGAGTTTTCTCAAAGCACAAAGAAATGCCCTAATTAGCTGTTTAGAAATTGCTGCTCATCGGTATAAAATCCCATGGTGTCAAACCGTCATTGATATCTTAGTGAAGCATGCCTTTGATAATGTTACGAGGTTCACATCGCAGCAACAGGATGCGATTGGGAAATTGTGGGCTTCAGTAAGGGAACAACAAAATCGTAGGAAACAAGGAAAATCAGTCTCGGGTAAACTTGATGTAAATGGATTTGAATGGCTGCAACAGAAATATGCCAATGAGAAGATCAGCATTCGGCATTCTGTTGGGGCTAGTGGCGATCGAAAAGCACAACAGTGGCTTGGTTGA
Coding sequence (CDS)
ATGGCCGAGAACCTCTTCGAAGGCCTTCCACCGCCGATTTCAGCGACAAACCCTTCGCCGGAGACCCAATTACCAGATGCAGCCACCAAGCAGAGTCGGCCCTCTGATCCTTCTCCCAATCCCGCCGCTTCTTCTTCTTCTTCTTCTCCGGCGCCGGTCATCAAGAGCGCCCTCAAGCGCCCCAAGACTTCGCAAGAATCGAACCCAGCAGCTGCAGCACCGGCACCGGCACCGGCACCGGGAAAACGTTTAAGGTTCAAAACCATGACAGATGCTTCAGAGACCCAAGTTTTGGAGGCCATGCAGAAGATAGCTTCACATATTAAGAACCCCACTAAGTTTGGTAAGGCTGCAAAACTTGCCATACAGCTCATTCAGGCAGGAAGCGTGAAGCCAACTACCAGTGATCGTTTCTTTACCATACTTGAAGCTGCCATGTCCGTGTCTTCATCCACTCCTTGTACTGATCCTTCAGTACGAGGAGATTATCATGCATTGTTTTCTGCTGCACAATCTACAGTGGAATGCCTTAACAGAAAGCAGAAGAACCAGTTAACAACCTGGACAATTCAGACTGTGTTGGCAAATGATTTGTTGACAGATGACAGTTTTGTGTTTTCAAAGACAGCTGGACAAATAAAAGAAGCTATCTCTAATCTTCCAGTTGCAACTAAGGAGGATGACATTGAGGAAGCTGAAGTACTTAGAGGCCATGAAGAGAGCACTGATGATGAACATCAGCAAAAGAAGAATGCTGCTCCAACTGAAGAGAAAAACCAGGAAGAATCCGATCCATTTGGGCTTGATGCTTTTCTGCCTGGTTCGTTGAAGAAAAGTGAGAGGGAAAAGGTAAAGAATGATGTGGCATCCATGACTAGAAATGATGAAGAAATGGAGACTAAGAGTTTTCTCAAAGCACAAAGAAATGCCCTAATTAGCTGTTTAGAAATTGCTGCTCATCGGTATAAAATCCCATGGTGTCAAACCGTCATTGATATCTTAGTGAAGCATGCCTTTGATAATGTTACGAGGTTCACATCGCAGCAACAGGATGCGATTGGGAAATTGTGGGCTTCAGTAAGGGAACAACAAAATCGTAGGAAACAAGGAAAATCAGTCTCGGGTAAACTTGATGTAAATGGATTTGAATGGCTGCAACAGAAATATGCCAATGAGAAGATCAGCATTCGGCATTCTGTTGGGGCTAGTGGCGATCGAAAAGCACAACAGTGGCTTGGTTGA
Protein sequence
MAENLFEGLPPPISATNPSPETQLPDAATKQSRPSDPSPNPAASSSSSSPAPVIKSALKRPKTSQESNPAAAAPAPAPAPGKRLRFKTMTDASETQVLEAMQKIASHIKNPTKFGKAAKLAIQLIQAGSVKPTTSDRFFTILEAAMSVSSSTPCTDPSVRGDYHALFSAAQSTVECLNRKQKNQLTTWTIQTVLANDLLTDDSFVFSKTAGQIKEAISNLPVATKEDDIEEAEVLRGHEESTDDEHQQKKNAAPTEEKNQEESDPFGLDAFLPGSLKKSEREKVKNDVASMTRNDEEMETKSFLKAQRNALISCLEIAAHRYKIPWCQTVIDILVKHAFDNVTRFTSQQQDAIGKLWASVREQQNRRKQGKSVSGKLDVNGFEWLQQKYANEKISIRHSVGASGDRKAQQWLG
Homology
BLAST of Clc06G10220 vs. NCBI nr
Match:
XP_038879152.1 (uncharacterized protein LOC120071141 isoform X1 [Benincasa hispida])
HSP 1 Score: 710.3 bits (1832), Expect = 9.7e-201
Identity = 378/413 (91.53%), Postives = 387/413 (93.70%), Query Frame = 0
Query: 1 MAENLFEGLPPPISATNPSPETQLPDAATKQSRPSDPSPNPAASSSSSSPAPVIKSALKR 60
MAENLFEGLPPPIS TNPSPE QLPDAAT Q+RPSDPS NPAA SSSSPAPVIKSALKR
Sbjct: 1 MAENLFEGLPPPISTTNPSPEAQLPDAATNQNRPSDPSTNPAA--SSSSPAPVIKSALKR 60
Query: 61 PKTSQESNPAAAAPAPAPAPGKRLRFKTMTDASETQVLEAMQKIASHIKNPTKFGKAAKL 120
PKT+QE NPAA APAPGKRLRFKT TDASETQVLEAMQKIASHIKNP KFGKAAKL
Sbjct: 61 PKTAQEPNPAAT----APAPGKRLRFKTTTDASETQVLEAMQKIASHIKNPNKFGKAAKL 120
Query: 121 AIQLIQAGSVKPTTSDRFFTILEAAMSVSSSTPCTDPSVRGDYHALFSAAQSTVECLNRK 180
AIQLIQAGSVKP TSDRFFTILEAAMS+SSSTPCTDPSVRGDYHALF AAQSTVECLNRK
Sbjct: 121 AIQLIQAGSVKPATSDRFFTILEAAMSMSSSTPCTDPSVRGDYHALFLAAQSTVECLNRK 180
Query: 181 QKNQLTTWTIQTVLANDLLTDDSFVFSKTAGQIKEAISNLPVATKEDDIEEAEVLRGHEE 240
QKNQLTTWTIQTVLANDLLTDDSFVFSKTAGQIKEAISNLPVAT EDDIEEAE L+GHEE
Sbjct: 181 QKNQLTTWTIQTVLANDLLTDDSFVFSKTAGQIKEAISNLPVATTEDDIEEAEALKGHEE 240
Query: 241 STDDEHQQKKNAAPTEEKNQEESDPFGLDAFLPGSLKKSEREKVKNDVASMTRNDEEMET 300
STDDEHQ+KKN EEKNQEESDPFGLDAFLPGSLKK ER +VKNDVAS TRNDEE+ET
Sbjct: 241 STDDEHQKKKNVDLAEEKNQEESDPFGLDAFLPGSLKKGERARVKNDVASKTRNDEEVET 300
Query: 301 KSFLKAQRNALISCLEIAAHRYKIPWCQTVIDILVKHAFDNVTRFTSQQQDAIGKLWASV 360
K FLKAQR+ALISCLEIAAHRYKIPWCQTVIDILVKHAFDNVTRFTSQQ+DAIGKLWASV
Sbjct: 301 KRFLKAQRDALISCLEIAAHRYKIPWCQTVIDILVKHAFDNVTRFTSQQRDAIGKLWASV 360
Query: 361 REQQNRRKQGKSVSGKLDVNGFEWLQQKYANEKISIRHSVGASGDRKAQQWLG 414
REQQNRRKQGKSVSGKLDVNGFEWLQQKYANEKISIRHSVGASGDRKAQQWLG
Sbjct: 361 REQQNRRKQGKSVSGKLDVNGFEWLQQKYANEKISIRHSVGASGDRKAQQWLG 407
BLAST of Clc06G10220 vs. NCBI nr
Match:
XP_038879153.1 (uncharacterized protein LOC120071141 isoform X2 [Benincasa hispida])
HSP 1 Score: 708.4 bits (1827), Expect = 3.7e-200
Identity = 377/413 (91.28%), Postives = 386/413 (93.46%), Query Frame = 0
Query: 1 MAENLFEGLPPPISATNPSPETQLPDAATKQSRPSDPSPNPAASSSSSSPAPVIKSALKR 60
MAENLFEGLPPPIS TNPSPE QLPDAAT Q+RPSDPS NPAA SSSSPAPVIKSALKR
Sbjct: 1 MAENLFEGLPPPISTTNPSPEAQLPDAATNQNRPSDPSTNPAA--SSSSPAPVIKSALKR 60
Query: 61 PKTSQESNPAAAAPAPAPAPGKRLRFKTMTDASETQVLEAMQKIASHIKNPTKFGKAAKL 120
PKT+QE NPA APAPGKRLRFKT TDASETQVLEAMQKIASHIKNP KFGKAAKL
Sbjct: 61 PKTAQEPNPA------APAPGKRLRFKTTTDASETQVLEAMQKIASHIKNPNKFGKAAKL 120
Query: 121 AIQLIQAGSVKPTTSDRFFTILEAAMSVSSSTPCTDPSVRGDYHALFSAAQSTVECLNRK 180
AIQLIQAGSVKP TSDRFFTILEAAMS+SSSTPCTDPSVRGDYHALF AAQSTVECLNRK
Sbjct: 121 AIQLIQAGSVKPATSDRFFTILEAAMSMSSSTPCTDPSVRGDYHALFLAAQSTVECLNRK 180
Query: 181 QKNQLTTWTIQTVLANDLLTDDSFVFSKTAGQIKEAISNLPVATKEDDIEEAEVLRGHEE 240
QKNQLTTWTIQTVLANDLLTDDSFVFSKTAGQIKEAISNLPVAT EDDIEEAE L+GHEE
Sbjct: 181 QKNQLTTWTIQTVLANDLLTDDSFVFSKTAGQIKEAISNLPVATTEDDIEEAEALKGHEE 240
Query: 241 STDDEHQQKKNAAPTEEKNQEESDPFGLDAFLPGSLKKSEREKVKNDVASMTRNDEEMET 300
STDDEHQ+KKN EEKNQEESDPFGLDAFLPGSLKK ER +VKNDVAS TRNDEE+ET
Sbjct: 241 STDDEHQKKKNVDLAEEKNQEESDPFGLDAFLPGSLKKGERARVKNDVASKTRNDEEVET 300
Query: 301 KSFLKAQRNALISCLEIAAHRYKIPWCQTVIDILVKHAFDNVTRFTSQQQDAIGKLWASV 360
K FLKAQR+ALISCLEIAAHRYKIPWCQTVIDILVKHAFDNVTRFTSQQ+DAIGKLWASV
Sbjct: 301 KRFLKAQRDALISCLEIAAHRYKIPWCQTVIDILVKHAFDNVTRFTSQQRDAIGKLWASV 360
Query: 361 REQQNRRKQGKSVSGKLDVNGFEWLQQKYANEKISIRHSVGASGDRKAQQWLG 414
REQQNRRKQGKSVSGKLDVNGFEWLQQKYANEKISIRHSVGASGDRKAQQWLG
Sbjct: 361 REQQNRRKQGKSVSGKLDVNGFEWLQQKYANEKISIRHSVGASGDRKAQQWLG 405
BLAST of Clc06G10220 vs. NCBI nr
Match:
XP_008462709.1 (PREDICTED: uncharacterized protein LOC103501010 [Cucumis melo] >XP_016902919.1 PREDICTED: uncharacterized protein LOC103501010 [Cucumis melo])
HSP 1 Score: 700.7 bits (1807), Expect = 7.7e-198
Identity = 376/413 (91.04%), Postives = 384/413 (92.98%), Query Frame = 0
Query: 1 MAENLFEGLPPPISATNPSPETQLPDAATKQSRPSDPSPNPAASSSSSSPAPVIKSALKR 60
MAENLFEGLPPPISATN PET LPDA T Q RPSDPS NPA SSSSPAPVIKSALKR
Sbjct: 1 MAENLFEGLPPPISATNLLPETLLPDAPTNQIRPSDPSINPA--PSSSSPAPVIKSALKR 60
Query: 61 PKTSQESNPAAAAPAPAPAPGKRLRFKTMTDASETQVLEAMQKIASHIKNPTKFGKAAKL 120
PKT+QE N AA APAPGKRLRFKT TDASETQVLEAMQKIASHIKNPTKFGKAAKL
Sbjct: 61 PKTAQEPNSAAT----APAPGKRLRFKTTTDASETQVLEAMQKIASHIKNPTKFGKAAKL 120
Query: 121 AIQLIQAGSVKPTTSDRFFTILEAAMSVSSSTPCTDPSVRGDYHALFSAAQSTVECLNRK 180
AIQLIQAGSVKP TSDRFFTILEAAMS+SSSTPCTDPSVRGDYHALFSAAQST+ECLNRK
Sbjct: 121 AIQLIQAGSVKPATSDRFFTILEAAMSMSSSTPCTDPSVRGDYHALFSAAQSTMECLNRK 180
Query: 181 QKNQLTTWTIQTVLANDLLTDDSFVFSKTAGQIKEAISNLPVATKEDDIEEAEVLRGHEE 240
QKNQLTTWTIQTVLANDLLTDDSFVFSKTAGQIKEAISNLPVATKEDD EEAE L+GHEE
Sbjct: 181 QKNQLTTWTIQTVLANDLLTDDSFVFSKTAGQIKEAISNLPVATKEDDSEEAEALKGHEE 240
Query: 241 STDDEHQQKKNAAPTEEKNQEESDPFGLDAFLPGSLKKSEREKVKNDVASMTRNDEEMET 300
STDDEHQ+KK+AAP E KNQEESDPFGLDAFLPGSLKKSER KVKNDV S TRNDEE+ET
Sbjct: 241 STDDEHQKKKDAAPAEGKNQEESDPFGLDAFLPGSLKKSERAKVKNDVVSKTRNDEEVET 300
Query: 301 KSFLKAQRNALISCLEIAAHRYKIPWCQTVIDILVKHAFDNVTRFTSQQQDAIGKLWASV 360
KSFLKAQR ALISCLEIAAHRYKIPWCQTVIDILVKHAFDNVTRFT QQ+DAIGKLWASV
Sbjct: 301 KSFLKAQRGALISCLEIAAHRYKIPWCQTVIDILVKHAFDNVTRFTLQQRDAIGKLWASV 360
Query: 361 REQQNRRKQGKSVSGKLDVNGFEWLQQKYANEKISIRHSVGASGDRKAQQWLG 414
REQQNRRKQGKSVSGKLDVNGFEWLQQKYANEKISIRHSVG SGDRKAQQWLG
Sbjct: 361 REQQNRRKQGKSVSGKLDVNGFEWLQQKYANEKISIRHSVGGSGDRKAQQWLG 407
BLAST of Clc06G10220 vs. NCBI nr
Match:
XP_004142528.1 (uncharacterized protein LOC101212234 [Cucumis sativus] >KGN66787.1 hypothetical protein Csa_007096 [Cucumis sativus])
HSP 1 Score: 683.3 bits (1762), Expect = 1.3e-192
Identity = 365/413 (88.38%), Postives = 376/413 (91.04%), Query Frame = 0
Query: 1 MAENLFEGLPPPISATNPSPETQLPDAATKQSRPSDPSPNPAASSSSSSPAPVIKSALKR 60
MAENLFEGLPPPISATN SPETQLPDA T Q RPSDPS N PAPVIKSALKR
Sbjct: 1 MAENLFEGLPPPISATNLSPETQLPDAPTNQIRPSDPSTN---------PAPVIKSALKR 60
Query: 61 PKTSQESNPAAAAPAPAPAPGKRLRFKTMTDASETQVLEAMQKIASHIKNPTKFGKAAKL 120
PKT+QE N AA APAPGKRLRFKT TDASETQVLEAMQKIASHIKNPTKFGKAAKL
Sbjct: 61 PKTAQEPNSAAT----APAPGKRLRFKTTTDASETQVLEAMQKIASHIKNPTKFGKAAKL 120
Query: 121 AIQLIQAGSVKPTTSDRFFTILEAAMSVSSSTPCTDPSVRGDYHALFSAAQSTVECLNRK 180
AIQLIQAGSVKP TSD FFTILEAAMS+SSSTPCTD SVRGDYHALFSAAQST+ECLNRK
Sbjct: 121 AIQLIQAGSVKPATSDCFFTILEAAMSMSSSTPCTDASVRGDYHALFSAAQSTMECLNRK 180
Query: 181 QKNQLTTWTIQTVLANDLLTDDSFVFSKTAGQIKEAISNLPVATKEDDIEEAEVLRGHEE 240
QKNQLTTWTIQTVLANDLLTDDSFVFSKTAGQIKEAIS+LPVATKEDD EEAE L+GHEE
Sbjct: 181 QKNQLTTWTIQTVLANDLLTDDSFVFSKTAGQIKEAISDLPVATKEDDSEEAEALKGHEE 240
Query: 241 STDDEHQQKKNAAPTEEKNQEESDPFGLDAFLPGSLKKSEREKVKNDVASMTRNDEEMET 300
STDDEH +KKNAAP E+KNQEESDPFGLDAFLPGSLKK ER KVKNDV S TRNDEE+E
Sbjct: 241 STDDEHLKKKNAAPAEKKNQEESDPFGLDAFLPGSLKKGERAKVKNDVVSKTRNDEEVEA 300
Query: 301 KSFLKAQRNALISCLEIAAHRYKIPWCQTVIDILVKHAFDNVTRFTSQQQDAIGKLWASV 360
K+FLKAQR ALISCLEIAAHRY+IPWCQTVIDILVKHAFDNVTRFT QQ+DAIGKLWASV
Sbjct: 301 KNFLKAQRGALISCLEIAAHRYRIPWCQTVIDILVKHAFDNVTRFTLQQRDAIGKLWASV 360
Query: 361 REQQNRRKQGKSVSGKLDVNGFEWLQQKYANEKISIRHSVGASGDRKAQQWLG 414
REQQNRRKQGKSVSGKLDVNGFEWLQQKYANEKISIRHSVG SGDRKAQQWLG
Sbjct: 361 REQQNRRKQGKSVSGKLDVNGFEWLQQKYANEKISIRHSVGGSGDRKAQQWLG 400
BLAST of Clc06G10220 vs. NCBI nr
Match:
XP_023533719.1 (uncharacterized protein LOC111795494 isoform X1 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 672.9 bits (1735), Expect = 1.7e-189
Identity = 358/413 (86.68%), Postives = 379/413 (91.77%), Query Frame = 0
Query: 1 MAENLFEGLPPPISATNPSPETQLPDAATKQSRPSDPSPNPAASSSSSSPAPVIKSALKR 60
MA+NLFEGLPPPISAT+PS ETQ DA T Q+RPSDPSPNP A SSSS P PVIKSALKR
Sbjct: 1 MADNLFEGLPPPISATDPSSETQPQDANTTQNRPSDPSPNPPA-SSSSCPPPVIKSALKR 60
Query: 61 PKTSQESNPAAAAPAPAPAPGKRLRFKTMTDASETQVLEAMQKIASHIKNPTKFGKAAKL 120
PKT+ E NP APAPAPAPGKRLRFKT TDASETQV+EAMQKIASHIKNPTKFGKAAKL
Sbjct: 61 PKTALEPNP--TAPAPAPAPGKRLRFKTTTDASETQVMEAMQKIASHIKNPTKFGKAAKL 120
Query: 121 AIQLIQAGSVKPTTSDRFFTILEAAMSVSSSTPCTDPSVRGDYHALFSAAQSTVECLNRK 180
AIQLIQAGSVK TSD FF ILEAAMS+SSSTPCTDPSVRGDYHALFSAAQST+ECLN+K
Sbjct: 121 AIQLIQAGSVKAATSDHFFAILEAAMSMSSSTPCTDPSVRGDYHALFSAAQSTMECLNKK 180
Query: 181 QKNQLTTWTIQTVLANDLLTDDSFVFSKTAGQIKEAISNLPVATKEDDIEEAEVLRGHEE 240
QKNQL+TWTI+ V+ANDLLTDDSFVFSKTA QIKEAISNLPVATKEDD+EEAE L+ HEE
Sbjct: 181 QKNQLSTWTIRAVVANDLLTDDSFVFSKTATQIKEAISNLPVATKEDDVEEAEALKVHEE 240
Query: 241 STDDEHQQKKNAAPTEEKNQEESDPFGLDAFLPGSLKKSEREKVKNDVASMTRNDEEMET 300
+TDDEHQ+K+NAAP E+K+QEESDPFGL+AFLPGSLKK ER K KNDV S R DEE+E
Sbjct: 241 NTDDEHQKKENAAPAEKKSQEESDPFGLEAFLPGSLKKGERTKGKNDVESKIRKDEEVEA 300
Query: 301 KSFLKAQRNALISCLEIAAHRYKIPWCQTVIDILVKHAFDNVTRFTSQQQDAIGKLWASV 360
KSFLKAQR ALISCLEIAAHRYKIPWCQTVIDILVKHAFDNV RFTSQQ+DAIGKLWASV
Sbjct: 301 KSFLKAQREALISCLEIAAHRYKIPWCQTVIDILVKHAFDNVMRFTSQQRDAIGKLWASV 360
Query: 361 REQQNRRKQGKSVSGKLDVNGFEWLQQKYANEKISIRHSVGASGDRKAQQWLG 414
REQQNRRKQGKSVSGKLDVNGFEWLQQKYANEKISIRHSVGASGDRKAQQWLG
Sbjct: 361 REQQNRRKQGKSVSGKLDVNGFEWLQQKYANEKISIRHSVGASGDRKAQQWLG 410
BLAST of Clc06G10220 vs. ExPASy TrEMBL
Match:
A0A1S3CHK0 (uncharacterized protein LOC103501010 OS=Cucumis melo OX=3656 GN=LOC103501010 PE=4 SV=1)
HSP 1 Score: 700.7 bits (1807), Expect = 3.7e-198
Identity = 376/413 (91.04%), Postives = 384/413 (92.98%), Query Frame = 0
Query: 1 MAENLFEGLPPPISATNPSPETQLPDAATKQSRPSDPSPNPAASSSSSSPAPVIKSALKR 60
MAENLFEGLPPPISATN PET LPDA T Q RPSDPS NPA SSSSPAPVIKSALKR
Sbjct: 1 MAENLFEGLPPPISATNLLPETLLPDAPTNQIRPSDPSINPA--PSSSSPAPVIKSALKR 60
Query: 61 PKTSQESNPAAAAPAPAPAPGKRLRFKTMTDASETQVLEAMQKIASHIKNPTKFGKAAKL 120
PKT+QE N AA APAPGKRLRFKT TDASETQVLEAMQKIASHIKNPTKFGKAAKL
Sbjct: 61 PKTAQEPNSAAT----APAPGKRLRFKTTTDASETQVLEAMQKIASHIKNPTKFGKAAKL 120
Query: 121 AIQLIQAGSVKPTTSDRFFTILEAAMSVSSSTPCTDPSVRGDYHALFSAAQSTVECLNRK 180
AIQLIQAGSVKP TSDRFFTILEAAMS+SSSTPCTDPSVRGDYHALFSAAQST+ECLNRK
Sbjct: 121 AIQLIQAGSVKPATSDRFFTILEAAMSMSSSTPCTDPSVRGDYHALFSAAQSTMECLNRK 180
Query: 181 QKNQLTTWTIQTVLANDLLTDDSFVFSKTAGQIKEAISNLPVATKEDDIEEAEVLRGHEE 240
QKNQLTTWTIQTVLANDLLTDDSFVFSKTAGQIKEAISNLPVATKEDD EEAE L+GHEE
Sbjct: 181 QKNQLTTWTIQTVLANDLLTDDSFVFSKTAGQIKEAISNLPVATKEDDSEEAEALKGHEE 240
Query: 241 STDDEHQQKKNAAPTEEKNQEESDPFGLDAFLPGSLKKSEREKVKNDVASMTRNDEEMET 300
STDDEHQ+KK+AAP E KNQEESDPFGLDAFLPGSLKKSER KVKNDV S TRNDEE+ET
Sbjct: 241 STDDEHQKKKDAAPAEGKNQEESDPFGLDAFLPGSLKKSERAKVKNDVVSKTRNDEEVET 300
Query: 301 KSFLKAQRNALISCLEIAAHRYKIPWCQTVIDILVKHAFDNVTRFTSQQQDAIGKLWASV 360
KSFLKAQR ALISCLEIAAHRYKIPWCQTVIDILVKHAFDNVTRFT QQ+DAIGKLWASV
Sbjct: 301 KSFLKAQRGALISCLEIAAHRYKIPWCQTVIDILVKHAFDNVTRFTLQQRDAIGKLWASV 360
Query: 361 REQQNRRKQGKSVSGKLDVNGFEWLQQKYANEKISIRHSVGASGDRKAQQWLG 414
REQQNRRKQGKSVSGKLDVNGFEWLQQKYANEKISIRHSVG SGDRKAQQWLG
Sbjct: 361 REQQNRRKQGKSVSGKLDVNGFEWLQQKYANEKISIRHSVGGSGDRKAQQWLG 407
BLAST of Clc06G10220 vs. ExPASy TrEMBL
Match:
A0A0A0M0Y5 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G690250 PE=4 SV=1)
HSP 1 Score: 683.3 bits (1762), Expect = 6.2e-193
Identity = 365/413 (88.38%), Postives = 376/413 (91.04%), Query Frame = 0
Query: 1 MAENLFEGLPPPISATNPSPETQLPDAATKQSRPSDPSPNPAASSSSSSPAPVIKSALKR 60
MAENLFEGLPPPISATN SPETQLPDA T Q RPSDPS N PAPVIKSALKR
Sbjct: 1 MAENLFEGLPPPISATNLSPETQLPDAPTNQIRPSDPSTN---------PAPVIKSALKR 60
Query: 61 PKTSQESNPAAAAPAPAPAPGKRLRFKTMTDASETQVLEAMQKIASHIKNPTKFGKAAKL 120
PKT+QE N AA APAPGKRLRFKT TDASETQVLEAMQKIASHIKNPTKFGKAAKL
Sbjct: 61 PKTAQEPNSAAT----APAPGKRLRFKTTTDASETQVLEAMQKIASHIKNPTKFGKAAKL 120
Query: 121 AIQLIQAGSVKPTTSDRFFTILEAAMSVSSSTPCTDPSVRGDYHALFSAAQSTVECLNRK 180
AIQLIQAGSVKP TSD FFTILEAAMS+SSSTPCTD SVRGDYHALFSAAQST+ECLNRK
Sbjct: 121 AIQLIQAGSVKPATSDCFFTILEAAMSMSSSTPCTDASVRGDYHALFSAAQSTMECLNRK 180
Query: 181 QKNQLTTWTIQTVLANDLLTDDSFVFSKTAGQIKEAISNLPVATKEDDIEEAEVLRGHEE 240
QKNQLTTWTIQTVLANDLLTDDSFVFSKTAGQIKEAIS+LPVATKEDD EEAE L+GHEE
Sbjct: 181 QKNQLTTWTIQTVLANDLLTDDSFVFSKTAGQIKEAISDLPVATKEDDSEEAEALKGHEE 240
Query: 241 STDDEHQQKKNAAPTEEKNQEESDPFGLDAFLPGSLKKSEREKVKNDVASMTRNDEEMET 300
STDDEH +KKNAAP E+KNQEESDPFGLDAFLPGSLKK ER KVKNDV S TRNDEE+E
Sbjct: 241 STDDEHLKKKNAAPAEKKNQEESDPFGLDAFLPGSLKKGERAKVKNDVVSKTRNDEEVEA 300
Query: 301 KSFLKAQRNALISCLEIAAHRYKIPWCQTVIDILVKHAFDNVTRFTSQQQDAIGKLWASV 360
K+FLKAQR ALISCLEIAAHRY+IPWCQTVIDILVKHAFDNVTRFT QQ+DAIGKLWASV
Sbjct: 301 KNFLKAQRGALISCLEIAAHRYRIPWCQTVIDILVKHAFDNVTRFTLQQRDAIGKLWASV 360
Query: 361 REQQNRRKQGKSVSGKLDVNGFEWLQQKYANEKISIRHSVGASGDRKAQQWLG 414
REQQNRRKQGKSVSGKLDVNGFEWLQQKYANEKISIRHSVG SGDRKAQQWLG
Sbjct: 361 REQQNRRKQGKSVSGKLDVNGFEWLQQKYANEKISIRHSVGGSGDRKAQQWLG 400
BLAST of Clc06G10220 vs. ExPASy TrEMBL
Match:
A0A6J1HCB8 (uncharacterized protein LOC111461540 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111461540 PE=4 SV=1)
HSP 1 Score: 669.1 bits (1725), Expect = 1.2e-188
Identity = 356/413 (86.20%), Postives = 377/413 (91.28%), Query Frame = 0
Query: 1 MAENLFEGLPPPISATNPSPETQLPDAATKQSRPSDPSPNPAASSSSSSPAPVIKSALKR 60
MA+NLFEGLPPPISA +P PETQL DA T Q+RPSDPSPNP A SSSS P PVIKSALKR
Sbjct: 1 MADNLFEGLPPPISAIDPLPETQLQDANTTQNRPSDPSPNPPA-SSSSCPPPVIKSALKR 60
Query: 61 PKTSQESNPAAAAPAPAPAPGKRLRFKTMTDASETQVLEAMQKIASHIKNPTKFGKAAKL 120
PKT+ E NP PA APAPGKRLRFKT TDASETQV+EAMQKIASHIKNPTKFGKAAKL
Sbjct: 61 PKTALEPNP--TGPATAPAPGKRLRFKTTTDASETQVMEAMQKIASHIKNPTKFGKAAKL 120
Query: 121 AIQLIQAGSVKPTTSDRFFTILEAAMSVSSSTPCTDPSVRGDYHALFSAAQSTVECLNRK 180
AIQLIQAGSVK TSD FF ILEAAMS+SSSTPCTDPSVRGDYHALFSAAQST+ECLN+K
Sbjct: 121 AIQLIQAGSVKAATSDHFFAILEAAMSMSSSTPCTDPSVRGDYHALFSAAQSTMECLNKK 180
Query: 181 QKNQLTTWTIQTVLANDLLTDDSFVFSKTAGQIKEAISNLPVATKEDDIEEAEVLRGHEE 240
QKNQL+TWTI+ V+ANDLLTDDSFVFSKTA QIKEAISNLPVATKEDD+EEAE L+ HEE
Sbjct: 181 QKNQLSTWTIRAVVANDLLTDDSFVFSKTATQIKEAISNLPVATKEDDVEEAEALKVHEE 240
Query: 241 STDDEHQQKKNAAPTEEKNQEESDPFGLDAFLPGSLKKSEREKVKNDVASMTRNDEEMET 300
+TDDEHQ+K++AAP EEK+QEESDPFGL+AFLPGSLKK ER K KNDV S R DEE+E
Sbjct: 241 NTDDEHQKKEDAAPAEEKSQEESDPFGLEAFLPGSLKKGERAKGKNDVESKIRKDEEVEA 300
Query: 301 KSFLKAQRNALISCLEIAAHRYKIPWCQTVIDILVKHAFDNVTRFTSQQQDAIGKLWASV 360
KSFLKAQR ALISCLEIAAHRYKIPWCQTVIDILVKHAFDNV RFTSQQ+DAIGKLWASV
Sbjct: 301 KSFLKAQREALISCLEIAAHRYKIPWCQTVIDILVKHAFDNVMRFTSQQRDAIGKLWASV 360
Query: 361 REQQNRRKQGKSVSGKLDVNGFEWLQQKYANEKISIRHSVGASGDRKAQQWLG 414
REQQNRRKQGKSVSGKLDVNGFEWLQQKYANEKISIRHSVGASGDRKAQQWLG
Sbjct: 361 REQQNRRKQGKSVSGKLDVNGFEWLQQKYANEKISIRHSVGASGDRKAQQWLG 410
BLAST of Clc06G10220 vs. ExPASy TrEMBL
Match:
A0A6J1JIQ9 (uncharacterized protein LOC111485467 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111485467 PE=4 SV=1)
HSP 1 Score: 668.7 bits (1724), Expect = 1.6e-188
Identity = 356/413 (86.20%), Postives = 376/413 (91.04%), Query Frame = 0
Query: 1 MAENLFEGLPPPISATNPSPETQLPDAATKQSRPSDPSPNPAASSSSSSPAPVIKSALKR 60
MA+NLFEGLPPPISA +PSPETQL DA T Q+RPSDPSPNP A SSSS P PVIKSALKR
Sbjct: 1 MADNLFEGLPPPISAIDPSPETQLYDANTTQNRPSDPSPNPPA-SSSSCPPPVIKSALKR 60
Query: 61 PKTSQESNPAAAAPAPAPAPGKRLRFKTMTDASETQVLEAMQKIASHIKNPTKFGKAAKL 120
PKT+ E NP PA APAPGKRLRFKT TDASE QV+EAMQKIASHIKNPTKFGKAAKL
Sbjct: 61 PKTALEPNP--TGPATAPAPGKRLRFKTTTDASEAQVMEAMQKIASHIKNPTKFGKAAKL 120
Query: 121 AIQLIQAGSVKPTTSDRFFTILEAAMSVSSSTPCTDPSVRGDYHALFSAAQSTVECLNRK 180
AIQLIQAGSVK TSD FF ILEAAMS+SSSTPCTDPSVRGDYHALFSAAQST+ECLN+K
Sbjct: 121 AIQLIQAGSVKAATSDHFFAILEAAMSMSSSTPCTDPSVRGDYHALFSAAQSTMECLNKK 180
Query: 181 QKNQLTTWTIQTVLANDLLTDDSFVFSKTAGQIKEAISNLPVATKEDDIEEAEVLRGHEE 240
QKNQL+TWTIQ V+ANDLLTDDSFVFSKTA QIKEAISNLPVATKEDD+EEAE L+ HEE
Sbjct: 181 QKNQLSTWTIQAVVANDLLTDDSFVFSKTATQIKEAISNLPVATKEDDVEEAEALKVHEE 240
Query: 241 STDDEHQQKKNAAPTEEKNQEESDPFGLDAFLPGSLKKSEREKVKNDVASMTRNDEEMET 300
+TDDEHQ+K+NAAP +EK+QEESDPFGL+AFLPGSLKK ER K KNDV S R DEE+E
Sbjct: 241 NTDDEHQKKENAAPAKEKSQEESDPFGLEAFLPGSLKKGERAKGKNDVESKIRQDEEVEA 300
Query: 301 KSFLKAQRNALISCLEIAAHRYKIPWCQTVIDILVKHAFDNVTRFTSQQQDAIGKLWASV 360
KSFLKAQR AL SCLEIAAHRYKIPWCQTVIDILVKHAFDNV RFTSQQ+DAIGKLWASV
Sbjct: 301 KSFLKAQREALTSCLEIAAHRYKIPWCQTVIDILVKHAFDNVMRFTSQQRDAIGKLWASV 360
Query: 361 REQQNRRKQGKSVSGKLDVNGFEWLQQKYANEKISIRHSVGASGDRKAQQWLG 414
REQQNRRKQGKSVSGKLDVNGFEWLQQKYANEKISIRHSVGASGDRKAQQWLG
Sbjct: 361 REQQNRRKQGKSVSGKLDVNGFEWLQQKYANEKISIRHSVGASGDRKAQQWLG 410
BLAST of Clc06G10220 vs. ExPASy TrEMBL
Match:
A0A6J1H8S6 (uncharacterized protein LOC111461540 isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111461540 PE=4 SV=1)
HSP 1 Score: 666.8 bits (1719), Expect = 6.0e-188
Identity = 355/413 (85.96%), Postives = 376/413 (91.04%), Query Frame = 0
Query: 1 MAENLFEGLPPPISATNPSPETQLPDAATKQSRPSDPSPNPAASSSSSSPAPVIKSALKR 60
MA+NLFEGLPPPISA +P PETQL DA T Q+RPSDPSPNP A SSSS P PVIKSALKR
Sbjct: 1 MADNLFEGLPPPISAIDPLPETQLQDANTTQNRPSDPSPNPPA-SSSSCPPPVIKSALKR 60
Query: 61 PKTSQESNPAAAAPAPAPAPGKRLRFKTMTDASETQVLEAMQKIASHIKNPTKFGKAAKL 120
PKT+ E NP A APAPGKRLRFKT TDASETQV+EAMQKIASHIKNPTKFGKAAKL
Sbjct: 61 PKTALEPNPTAT----APAPGKRLRFKTTTDASETQVMEAMQKIASHIKNPTKFGKAAKL 120
Query: 121 AIQLIQAGSVKPTTSDRFFTILEAAMSVSSSTPCTDPSVRGDYHALFSAAQSTVECLNRK 180
AIQLIQAGSVK TSD FF ILEAAMS+SSSTPCTDPSVRGDYHALFSAAQST+ECLN+K
Sbjct: 121 AIQLIQAGSVKAATSDHFFAILEAAMSMSSSTPCTDPSVRGDYHALFSAAQSTMECLNKK 180
Query: 181 QKNQLTTWTIQTVLANDLLTDDSFVFSKTAGQIKEAISNLPVATKEDDIEEAEVLRGHEE 240
QKNQL+TWTI+ V+ANDLLTDDSFVFSKTA QIKEAISNLPVATKEDD+EEAE L+ HEE
Sbjct: 181 QKNQLSTWTIRAVVANDLLTDDSFVFSKTATQIKEAISNLPVATKEDDVEEAEALKVHEE 240
Query: 241 STDDEHQQKKNAAPTEEKNQEESDPFGLDAFLPGSLKKSEREKVKNDVASMTRNDEEMET 300
+TDDEHQ+K++AAP EEK+QEESDPFGL+AFLPGSLKK ER K KNDV S R DEE+E
Sbjct: 241 NTDDEHQKKEDAAPAEEKSQEESDPFGLEAFLPGSLKKGERAKGKNDVESKIRKDEEVEA 300
Query: 301 KSFLKAQRNALISCLEIAAHRYKIPWCQTVIDILVKHAFDNVTRFTSQQQDAIGKLWASV 360
KSFLKAQR ALISCLEIAAHRYKIPWCQTVIDILVKHAFDNV RFTSQQ+DAIGKLWASV
Sbjct: 301 KSFLKAQREALISCLEIAAHRYKIPWCQTVIDILVKHAFDNVMRFTSQQRDAIGKLWASV 360
Query: 361 REQQNRRKQGKSVSGKLDVNGFEWLQQKYANEKISIRHSVGASGDRKAQQWLG 414
REQQNRRKQGKSVSGKLDVNGFEWLQQKYANEKISIRHSVGASGDRKAQQWLG
Sbjct: 361 REQQNRRKQGKSVSGKLDVNGFEWLQQKYANEKISIRHSVGASGDRKAQQWLG 408
BLAST of Clc06G10220 vs. TAIR 10
Match:
AT3G04560.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 16 growth stages; Has 227 Blast hits to 225 proteins in 83 species: Archae - 0; Bacteria - 17; Metazoa - 98; Fungi - 29; Plants - 51; Viruses - 1; Other Eukaryotes - 31 (source: NCBI BLink). )
HSP 1 Score: 437.2 bits (1123), Expect = 1.5e-122
Identity = 260/434 (59.91%), Postives = 310/434 (71.43%), Query Frame = 0
Query: 1 MAENLFEGLPPPISATNPS-PETQLPD----------------AATKQSRPSDPSPNPAA 60
MAENLF GLPPP S+ P + +PD +A K+S+P + +PN +A
Sbjct: 1 MAENLFSGLPPPSSSQQQELPISPIPDESKIETSSPAPILVLKSALKRSKPEESAPNLSA 60
Query: 61 SSSSSSPAPVIKSALKRPKTSQESNPAAAAPAPAP-APGKRLRFKTMTDASETQVLEAMQ 120
PV+KSALKR K S ES P P P P AP KRL+FKT TDASE QV+EAMQ
Sbjct: 61 -------PPVLKSALKRSKPS-ESTP---EPVPEPEAPKKRLQFKTSTDASEEQVIEAMQ 120
Query: 121 KIASHIKNPTKFGKAAKLAIQLIQAGSVKPTTSDRFFTILEAAMSVSSSTPCTDPSVRGD 180
KI SHIKNP+KF KA+KLAI+LIQAGSVKP TS F ILEAAM SS TPCTD SVR D
Sbjct: 121 KITSHIKNPSKFSKASKLAIRLIQAGSVKPETSSYFIAILEAAM--SSKTPCTDRSVRAD 180
Query: 181 YHALFSAAQSTVECLNRKQKNQLTTWTIQTVLANDLLTDDSFVFSKTAGQIKEAISNLPV 240
YHALFSAAQ ECL++ QKN LT WT + V+ANDL TDDSF+FSKTA +IKEAIS+LPV
Sbjct: 181 YHALFSAAQDVAECLDKSQKNLLTIWTFKAVVANDLFTDDSFMFSKTATRIKEAISDLPV 240
Query: 241 ATKEDDIEEAEVLRGHEESTDDEHQQKKN---AAPTEEKNQEESDPFGLDAFLPGSLKKS 300
+T+EDD+EEA L + + Q ++ AA + ESDPFGLDA++P S KK+
Sbjct: 241 STEEDDVEEAAALEEAAVKDNGDGQTTQDVAEAASAGDNEAVESDPFGLDAWIPSSGKKN 300
Query: 301 EREKVKNDVASMTRNDEEMETKSFLKAQRNALISCLEIAAHRYKIPWCQTVIDILVKHAF 360
+ K+K + + E K FL+++R ALI+CLEIAA RYK+PWCQTVIDILVKHAF
Sbjct: 301 GKTKIKR----TNEDPDAEENKRFLRSKREALITCLEIAARRYKVPWCQTVIDILVKHAF 360
Query: 361 DNVTRFTSQQQDAIGKLWASVREQQNRRKQGKSVSGKLDVNGFEWLQQKYANEKISIRHS 414
+NV+RFTSQQ+ A+ KLWASVREQ RRKQGKSV+GKLDV FE LQ KYANEK+SIR S
Sbjct: 361 ENVSRFTSQQRQAVEKLWASVREQHLRRKQGKSVTGKLDVTAFESLQDKYANEKMSIRSS 417
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_038879152.1 | 9.7e-201 | 91.53 | uncharacterized protein LOC120071141 isoform X1 [Benincasa hispida] | [more] |
XP_038879153.1 | 3.7e-200 | 91.28 | uncharacterized protein LOC120071141 isoform X2 [Benincasa hispida] | [more] |
XP_008462709.1 | 7.7e-198 | 91.04 | PREDICTED: uncharacterized protein LOC103501010 [Cucumis melo] >XP_016902919.1 P... | [more] |
XP_004142528.1 | 1.3e-192 | 88.38 | uncharacterized protein LOC101212234 [Cucumis sativus] >KGN66787.1 hypothetical ... | [more] |
XP_023533719.1 | 1.7e-189 | 86.68 | uncharacterized protein LOC111795494 isoform X1 [Cucurbita pepo subsp. pepo] | [more] |
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A1S3CHK0 | 3.7e-198 | 91.04 | uncharacterized protein LOC103501010 OS=Cucumis melo OX=3656 GN=LOC103501010 PE=... | [more] |
A0A0A0M0Y5 | 6.2e-193 | 88.38 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G690250 PE=4 SV=1 | [more] |
A0A6J1HCB8 | 1.2e-188 | 86.20 | uncharacterized protein LOC111461540 isoform X1 OS=Cucurbita moschata OX=3662 GN... | [more] |
A0A6J1JIQ9 | 1.6e-188 | 86.20 | uncharacterized protein LOC111485467 isoform X1 OS=Cucurbita maxima OX=3661 GN=L... | [more] |
A0A6J1H8S6 | 6.0e-188 | 85.96 | uncharacterized protein LOC111461540 isoform X2 OS=Cucurbita moschata OX=3662 GN... | [more] |
Match Name | E-value | Identity | Description | |
AT3G04560.1 | 1.5e-122 | 59.91 | unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... | [more] |