Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GAGGGAACCAAAAGCCTCATCTGTTGATAGGTATTGGATATCGTTGGTTTCTCTTACCCTTGTATAAAAAACAAGAAAAATCGAACCACAGATCTCTGAGTTCTCAAAACTCAATCTCTGAACGAACCCCACGGCGGCGCTCGGCAGCCGGCCACCGGCGGACAAACCCCCCTTCCCAAAAGGCTGTTCTGTAACTGAAACCTTAAAAAAAATTCGTTGGAGTCCTTCATGGACTACTCTGATCTCTCTGTTTTTAGGTCCCTCCACTTCTTACTCTATAAAGTCCAAGCCGACGGCGCATAAATTTCCAATTTTCAGTCTCTTATTCTCTCTCTTTTTCTCTCTCTTTCTTATTGTCTTTGTGTCAGCACCAAGGGTCATGGCTGCAGATCCAGAAGGAAGAGAGTTCAAGTTTTACTCTCAGTTTCAGACTGAGCATGGAGATAAACAAAGCCCCTTTGAAGGAGACAATTGGTCGAGCTATTTTGGACGCTCTGATTCATTTCTCAGCTTCAGCTCGCCAGTGGAGTCTGAGATTGGTTCCTATGAAATTGAGAGCGAAAAAGACGAAGGCGACGATGGTTATATAGCGGAATTGAGTCACCAGATGGCTCAATACATGCTTCAAGATGATGATAACTCCTCCACTGCAAGCTTTCAATCTGAGATTCAGAACAAGGTATTGAATTTGTTCAGGCGTGGAAGGTTTCATCAAGGTGTTTGTGAAATTCTTTGTTGCTCCCTTTTTTTGATATTGTGGGTGTGTTCTGTTTCTTGTATATAGTCATGGGGTTTGTCTGGTTCGCCAATTTCAACCCTGTGGTCACCTCTAGGCTCGAGCAATGGAAGCAGCCATGGAAGCCCAGAAGGGCCTTCGAAGGAGCCATCGCCTCCATCGACACCGGTGGTTGCAGAGCGTGGGGGGCTGGACATATCACAGAACGTTTTCAGCCAGTTGGAGAAGATGAAAAAAGTGAGCACAAATGGTAAATCAATCCAAACAAGCCCCCAAGATGGAGAAACTGATTCCTCTTCTTCTTTCAAAGATCAAGCTAGAGCTCTTAAAGTGAGATGAGATTTAAATCCCACAAATCCCATAAATCTCTCTGTTTCTTACTTTTTTTTACCTTCACAAATCTCTTTCTCCCCTTTTTGCCGATTCAAAACTCGACCCTCTTTGCAGAATCAGAAGCGGAAGCAGAACCAGCAGTTTATGAAGCAAAAAGGCTCAGCCACCATACAAGCCAAGCAAGCTCAAGGAAGCTCATCACAAGGCAATTCAGGGGCAAAACCAGGAGGATCATCAGGGACTGGCGTGTTCTTGCCCCGCCATGTGAACTATAACCGTCCAGCCCCATCTCCTCAGCCACCACAGCCGTCCAAGAAAAAGGGTAGATTCCACTTTCATTCCATAGCTAGCTAATTTTTACATCGTTTTCGTAACGGGCTAACTAGCTAACCCCATTAGTTGGGATTTTTCCATCAATGAATTGCCATTTCAACCATTCCTCGTGATGCATCTTGGACTTTTGCTCATACCCCTCTGTTATATTTTTCTTTTTTTTTTCTCTTTCTCTTGATTCATAGAGATTCCGTGTCTTTCCAAATACCCTCTGAATCTATCAGACTTTCAACCATTCACGGTTGGCCCAACCAGATTTGTCCGCTTCTTTGATGGAATCTCCATCCATCTCTTTAAAACCTGGCTGTTCTGTGGTCAAAAGGAGGTGAATGAAAACGACACCCAATTGAATCTTTAAACCACTTGCATAGCCCACTTCAGCTTAATATTTTAATGGGCCGATAGATGCGTTTGTCTTCCTCTACAAGGAAAAAACACTAACGAATTTACCCTTCCCCCCTTTCCCCATGTCAGGCTGCTCCACTGTTCTAATACCCGTGAGAGTCCTACAAGCCTTACAACTTCACTACGACAGAATGGACGACGAGACAAGACAAAAAATCACTGGCTTCACAGCACTTAGAGGTAATTTTACAATTTCCATCACAAAATTCTCTGTGGCATTTCCACAAACAGTTGGTGGTCATTCCACAAAAATTGCATAACTTACTCCCTTCATGTCTCTATTACCCCCTTCTTGTCTCTATTCGATCTGGGTGATTCTATTTTCATCTAACAAAAGATTCCTATTTGCTTTTACGAACAGAAGCAGCAGCTGGTGTGAGAACAGCTACACATACCGTTGAGAAAAGTCATCCGGGAGCGTCGGCGGCGGCGGCGACAAGTCAAACCGACGTGGGTCTTCCTCAAGAATGGACGTATTAATGGCTTTCGACAACTGGCGAGGCGCCATTTCGTCTCTCGATCACAAATCTGAAGCTTTTTGGCGGTTTTCTGAGAGAACATGGAGGTGATGTTAATGGAATGTGAATAAAAGCCGCAAGTGGGCGGAGAAGAAGACGAAATAAAATTAGGGGTTGTGAAAACGACGTCGTCTATTTTTACTGTCTATTAGTGGCAAAAACATAAATAATGTTTGGGAAATTAATATTGTTAGGGAGGAAAATGGGATTTGTTTAGATGAAATCCTTTTGTAATAATTTTCAAACAAGGTTGTACCTCTTCAATAATTGTTTAAATAATTTAATGATTTTTTATGGGTCTACAATTTAGTTTGTTAATTCAAAACCGACTAATAAATTACCGATGGAATTTTTTTTCTTTTCAAAA
mRNA sequence
GAGGGAACCAAAAGCCTCATCTGTTGATAGGTATTGGATATCGTTGGTTTCTCTTACCCTTGTATAAAAAACAAGAAAAATCGAACCACAGATCTCTGAGTTCTCAAAACTCAATCTCTGAACGAACCCCACGGCGGCGCTCGGCAGCCGGCCACCGGCGGACAAACCCCCCTTCCCAAAAGGCTGTTCTGTAACTGAAACCTTAAAAAAAATTCGTTGGAGTCCTTCATGGACTACTCTGATCTCTCTGTTTTTAGGTCCCTCCACTTCTTACTCTATAAAGTCCAAGCCGACGGCGCATAAATTTCCAATTTTCAGTCTCTTATTCTCTCTCTTTTTCTCTCTCTTTCTTATTGTCTTTGTGTCAGCACCAAGGGTCATGGCTGCAGATCCAGAAGGAAGAGAGTTCAAGTTTTACTCTCAGTTTCAGACTGAGCATGGAGATAAACAAAGCCCCTTTGAAGGAGACAATTGGTCGAGCTATTTTGGACGCTCTGATTCATTTCTCAGCTTCAGCTCGCCAGTGGAGTCTGAGATTGGTTCCTATGAAATTGAGAGCGAAAAAGACGAAGGCGACGATGGTTATATAGCGGAATTGAGTCACCAGATGGCTCAATACATGCTTCAAGATGATGATAACTCCTCCACTGCAAGCTTTCAATCTGAGATTCAGAACAAGTCATGGGGTTTGTCTGGTTCGCCAATTTCAACCCTGTGGTCACCTCTAGGCTCGAGCAATGGAAGCAGCCATGGAAGCCCAGAAGGGCCTTCGAAGGAGCCATCGCCTCCATCGACACCGGTGGTTGCAGAGCGTGGGGGGCTGGACATATCACAGAACGTTTTCAGCCAGTTGGAGAAGATGAAAAAAGTGAGCACAAATGGTAAATCAATCCAAACAAGCCCCCAAGATGGAGAAACTGATTCCTCTTCTTCTTTCAAAGATCAAGCTAGAGCTCTTAAAAATCAGAAGCGGAAGCAGAACCAGCAGTTTATGAAGCAAAAAGGCTCAGCCACCATACAAGCCAAGCAAGCTCAAGGAAGCTCATCACAAGGCAATTCAGGGGCAAAACCAGGAGGATCATCAGGGACTGGCGTGTTCTTGCCCCGCCATGTGAACTATAACCGTCCAGCCCCATCTCCTCAGCCACCACAGCCGTCCAAGAAAAAGGGTAGATTCCACTTTCATTCCATAGCTAGCTAATTTTTACATCGTTTTCGTAACGGGCTAACTAGCTAACCCCATTAGTTGGGATTTTTCCATCAATGAATTGCCATTTCAACCATTCCTCGTGATGCATCTTGGACTTTTGCTCATACCCCTCTGTTATATTTTTCTTTTTTTTTTCTCTTTCTCTTGATTCATAGAGATTCCGTGTCTTTCCAAATACCCTCTGAATCTATCAGACTTTCAACCATTCACGGTTGGCCCAACCAGATTTGTCCGCTTCTTTGATGGAATCTCCATCCATCTCTTTAAAACCTGGCTGTTCTGTGGTCAAAAGGAGGTGAATGAAAACGACACCCAATTGAATCTTTAAACCACTTGCATAGCCCACTTCAGCTTAATATTTTAATGGGCCGATAGATGCGTTTGTCTTCCTCTACAAGGAAAAAACACTAACGAATTTACCCTTCCCCCCTTTCCCCATGTCAGGCTGCTCCACTGTTCTAATACCCGTGAGAGTCCTACAAGCCTTACAACTTCACTACGACAGAATGGACGACGAGACAAGACAAAAAATCACTGGCTTCACAGCACTTAGAGAAGCAGCAGCTGGTGTGAGAACAGCTACACATACCGTTGAGAAAAGTCATCCGGGAGCGTCGGCGGCGGCGGCGACAAGTCAAACCGACGTGGGTCTTCCTCAAGAATGGACGTATTAATGGCTTTCGACAACTGGCGAGGCGCCATTTCGTCTCTCGATCACAAATCTGAAGCTTTTTGGCGGTTTTCTGAGAGAACATGGAGGTGATGTTAATGGAATGTGAATAAAAGCCGCAAGTGGGCGGAGAAGAAGACGAAATAAAATTAGGGGTTGTGAAAACGACGTCGTCTATTTTTACTGTCTATTAGTGGCAAAAACATAAATAATGTTTGGGAAATTAATATTGTTAGGGAGGAAAATGGGATTTGTTTAGATGAAATCCTTTTGTAATAATTTTCAAACAAGGTTGTACCTCTTCAATAATTGTTTAAATAATTTAATGATTTTTTATGGGTCTACAATTTAGTTTGTTAATTCAAAACCGACTAATAAATTACCGATGGAATTTTTTTTCTTTTCAAAA
Coding sequence (CDS)
ATGGCTGCAGATCCAGAAGGAAGAGAGTTCAAGTTTTACTCTCAGTTTCAGACTGAGCATGGAGATAAACAAAGCCCCTTTGAAGGAGACAATTGGTCGAGCTATTTTGGACGCTCTGATTCATTTCTCAGCTTCAGCTCGCCAGTGGAGTCTGAGATTGGTTCCTATGAAATTGAGAGCGAAAAAGACGAAGGCGACGATGGTTATATAGCGGAATTGAGTCACCAGATGGCTCAATACATGCTTCAAGATGATGATAACTCCTCCACTGCAAGCTTTCAATCTGAGATTCAGAACAAGTCATGGGGTTTGTCTGGTTCGCCAATTTCAACCCTGTGGTCACCTCTAGGCTCGAGCAATGGAAGCAGCCATGGAAGCCCAGAAGGGCCTTCGAAGGAGCCATCGCCTCCATCGACACCGGTGGTTGCAGAGCGTGGGGGGCTGGACATATCACAGAACGTTTTCAGCCAGTTGGAGAAGATGAAAAAAGTGAGCACAAATGGTAAATCAATCCAAACAAGCCCCCAAGATGGAGAAACTGATTCCTCTTCTTCTTTCAAAGATCAAGCTAGAGCTCTTAAAAATCAGAAGCGGAAGCAGAACCAGCAGTTTATGAAGCAAAAAGGCTCAGCCACCATACAAGCCAAGCAAGCTCAAGGAAGCTCATCACAAGGCAATTCAGGGGCAAAACCAGGAGGATCATCAGGGACTGGCGTGTTCTTGCCCCGCCATGTGAACTATAACCGTCCAGCCCCATCTCCTCAGCCACCACAGCCGTCCAAGAAAAAGGGTAGATTCCACTTTCATTCCATAGCTAGCTAA
Protein sequence
MAADPEGREFKFYSQFQTEHGDKQSPFEGDNWSSYFGRSDSFLSFSSPVESEIGSYEIESEKDEGDDGYIAELSHQMAQYMLQDDDNSSTASFQSEIQNKSWGLSGSPISTLWSPLGSSNGSSHGSPEGPSKEPSPPSTPVVAERGGLDISQNVFSQLEKMKKVSTNGKSIQTSPQDGETDSSSSFKDQARALKNQKRKQNQQFMKQKGSATIQAKQAQGSSSQGNSGAKPGGSSGTGVFLPRHVNYNRPAPSPQPPQPSKKKGRFHFHSIAS
Homology
BLAST of Tan0018024 vs. NCBI nr
Match:
XP_038895137.1 (uncharacterized protein LOC120083444 isoform X3 [Benincasa hispida])
HSP 1 Score: 412.9 bits (1060), Expect = 2.1e-111
Identity = 228/271 (84.13%), Postives = 245/271 (90.41%), Query Frame = 0
Query: 1 MAADPEGREFKFYSQFQTEHGDK-QSPFEGDNWSSYFGRSDSFLSFSSPVESEIGSYEIE 60
MAAD EG+E KF SQFQT+HGDK QSPFEGDNWSSYFGRSDSFLSFSSPVESEIGSYEIE
Sbjct: 1 MAADQEGKELKFNSQFQTDHGDKQQSPFEGDNWSSYFGRSDSFLSFSSPVESEIGSYEIE 60
Query: 61 SEKDEGDDG---YIAELSHQMAQYMLQDDDNSSTASFQSEIQNKSWGLSGSPISTLWSPL 120
S++D+G++G Y AELS +MAQYM QDDDNSST SFQSEIQNKSWGLSGSPISTLWSPL
Sbjct: 61 SDRDDGENGSDDYTAELSRRMAQYMFQDDDNSSTTSFQSEIQNKSWGLSGSPISTLWSPL 120
Query: 121 GSSNGSSHGSPEGPSKEPSPPSTPVVAERGGLDISQNVFSQLEKMKKVSTNGKSIQTSPQ 180
GSS GSSHGSPEGPSKEPSPPSTPVVAERGGLDIS+NVF++LEKMKKVSTNGKSIQTSPQ
Sbjct: 121 GSSTGSSHGSPEGPSKEPSPPSTPVVAERGGLDISRNVFNKLEKMKKVSTNGKSIQTSPQ 180
Query: 181 DGETDSSSSFKDQARALKNQKRKQN---QQFMKQKGSATIQAKQAQGSSSQGNSGAKPGG 240
GET+SSSS K+Q+R KNQ+R+QN QQF+KQKGSA IQAKQAQGSS Q NSGAK GG
Sbjct: 181 IGETESSSS-KNQSRTSKNQERRQNQQQQQFIKQKGSAAIQAKQAQGSSLQANSGAKSGG 240
Query: 241 SSGTGVFLPRHVNYNRPAPSPQPPQPSKKKG 265
SSGTGVFLPRHVNYNRPAP QPPQP KKKG
Sbjct: 241 SSGTGVFLPRHVNYNRPAPCSQPPQPPKKKG 270
BLAST of Tan0018024 vs. NCBI nr
Match:
XP_038895135.1 (uncharacterized protein LOC120083444 isoform X1 [Benincasa hispida])
HSP 1 Score: 412.9 bits (1060), Expect = 2.1e-111
Identity = 228/271 (84.13%), Postives = 245/271 (90.41%), Query Frame = 0
Query: 1 MAADPEGREFKFYSQFQTEHGDK-QSPFEGDNWSSYFGRSDSFLSFSSPVESEIGSYEIE 60
MAAD EG+E KF SQFQT+HGDK QSPFEGDNWSSYFGRSDSFLSFSSPVESEIGSYEIE
Sbjct: 1 MAADQEGKELKFNSQFQTDHGDKQQSPFEGDNWSSYFGRSDSFLSFSSPVESEIGSYEIE 60
Query: 61 SEKDEGDDG---YIAELSHQMAQYMLQDDDNSSTASFQSEIQNKSWGLSGSPISTLWSPL 120
S++D+G++G Y AELS +MAQYM QDDDNSST SFQSEIQNKSWGLSGSPISTLWSPL
Sbjct: 61 SDRDDGENGSDDYTAELSRRMAQYMFQDDDNSSTTSFQSEIQNKSWGLSGSPISTLWSPL 120
Query: 121 GSSNGSSHGSPEGPSKEPSPPSTPVVAERGGLDISQNVFSQLEKMKKVSTNGKSIQTSPQ 180
GSS GSSHGSPEGPSKEPSPPSTPVVAERGGLDIS+NVF++LEKMKKVSTNGKSIQTSPQ
Sbjct: 121 GSSTGSSHGSPEGPSKEPSPPSTPVVAERGGLDISRNVFNKLEKMKKVSTNGKSIQTSPQ 180
Query: 181 DGETDSSSSFKDQARALKNQKRKQN---QQFMKQKGSATIQAKQAQGSSSQGNSGAKPGG 240
GET+SSSS K+Q+R KNQ+R+QN QQF+KQKGSA IQAKQAQGSS Q NSGAK GG
Sbjct: 181 IGETESSSS-KNQSRTSKNQERRQNQQQQQFIKQKGSAAIQAKQAQGSSLQANSGAKSGG 240
Query: 241 SSGTGVFLPRHVNYNRPAPSPQPPQPSKKKG 265
SSGTGVFLPRHVNYNRPAP QPPQP KKKG
Sbjct: 241 SSGTGVFLPRHVNYNRPAPCSQPPQPPKKKG 270
BLAST of Tan0018024 vs. NCBI nr
Match:
XP_038895136.1 (uncharacterized protein LOC120083444 isoform X2 [Benincasa hispida])
HSP 1 Score: 412.9 bits (1060), Expect = 2.1e-111
Identity = 228/271 (84.13%), Postives = 245/271 (90.41%), Query Frame = 0
Query: 1 MAADPEGREFKFYSQFQTEHGDK-QSPFEGDNWSSYFGRSDSFLSFSSPVESEIGSYEIE 60
MAAD EG+E KF SQFQT+HGDK QSPFEGDNWSSYFGRSDSFLSFSSPVESEIGSYEIE
Sbjct: 1 MAADQEGKELKFNSQFQTDHGDKQQSPFEGDNWSSYFGRSDSFLSFSSPVESEIGSYEIE 60
Query: 61 SEKDEGDDG---YIAELSHQMAQYMLQDDDNSSTASFQSEIQNKSWGLSGSPISTLWSPL 120
S++D+G++G Y AELS +MAQYM QDDDNSST SFQSEIQNKSWGLSGSPISTLWSPL
Sbjct: 61 SDRDDGENGSDDYTAELSRRMAQYMFQDDDNSSTTSFQSEIQNKSWGLSGSPISTLWSPL 120
Query: 121 GSSNGSSHGSPEGPSKEPSPPSTPVVAERGGLDISQNVFSQLEKMKKVSTNGKSIQTSPQ 180
GSS GSSHGSPEGPSKEPSPPSTPVVAERGGLDIS+NVF++LEKMKKVSTNGKSIQTSPQ
Sbjct: 121 GSSTGSSHGSPEGPSKEPSPPSTPVVAERGGLDISRNVFNKLEKMKKVSTNGKSIQTSPQ 180
Query: 181 DGETDSSSSFKDQARALKNQKRKQN---QQFMKQKGSATIQAKQAQGSSSQGNSGAKPGG 240
GET+SSSS K+Q+R KNQ+R+QN QQF+KQKGSA IQAKQAQGSS Q NSGAK GG
Sbjct: 181 IGETESSSS-KNQSRTSKNQERRQNQQQQQFIKQKGSAAIQAKQAQGSSLQANSGAKSGG 240
Query: 241 SSGTGVFLPRHVNYNRPAPSPQPPQPSKKKG 265
SSGTGVFLPRHVNYNRPAP QPPQP KKKG
Sbjct: 241 SSGTGVFLPRHVNYNRPAPCSQPPQPPKKKG 270
BLAST of Tan0018024 vs. NCBI nr
Match:
KAG7036954.1 (hypothetical protein SDJN02_00574 [Cucurbita argyrosperma subsp. argyrosperma])
HSP 1 Score: 400.2 bits (1027), Expect = 1.4e-107
Identity = 220/268 (82.09%), Postives = 233/268 (86.94%), Query Frame = 0
Query: 1 MAADPEGREFKFYSQFQTEHGD-KQSPFEGDNWSSYFGRSDSFLSFSSPVESEIGSYEIE 60
MA DPE EFKFY+QFQTEHGD KQSPFEGDNWSSYF SPVESEIGS+EIE
Sbjct: 1 MAVDPEAGEFKFYTQFQTEHGDNKQSPFEGDNWSSYF----------SPVESEIGSFEIE 60
Query: 61 SEKDEGDDG---YIAELSHQMAQYMLQDDDNSSTASFQSEIQNKSWGLSGSPISTLWSPL 120
S+KD+GD G Y AELS +MAQ+MLQDDDNSSTA FQSEIQNKSWGLSGSPISTLWSPL
Sbjct: 61 SDKDDGDGGDEDYTAELSRRMAQFMLQDDDNSSTARFQSEIQNKSWGLSGSPISTLWSPL 120
Query: 121 GSSNGSSHGSPEGPSKEPSPPSTPVVAERGGLDISQNVFSQLEKMKKVSTNGKSIQTSPQ 180
GSSNGSS+GSPEGPSKEPSPP+TP+VAER GLDISQNVF++LEKMKKVSTNGKSIQT PQ
Sbjct: 121 GSSNGSSYGSPEGPSKEPSPPTTPMVAERRGLDISQNVFTKLEKMKKVSTNGKSIQTRPQ 180
Query: 181 DGETDSSSSFKDQARALKNQKRKQNQQFMKQKGSATIQAKQAQGSSSQGNSGAKPGGSSG 240
D E SSSS KDQ RALKNQ+RKQNQQF+KQKGSA I AKQAQGSSSQGNSG KPGG SG
Sbjct: 181 DEENGSSSSSKDQTRALKNQRRKQNQQFIKQKGSAAIMAKQAQGSSSQGNSGGKPGGLSG 240
Query: 241 TGVFLPRHVNYNRPAPSPQPPQPSKKKG 265
TGVFLPRHVNYNR APSPQPPQP KK G
Sbjct: 241 TGVFLPRHVNYNRSAPSPQPPQPPKKTG 258
BLAST of Tan0018024 vs. NCBI nr
Match:
XP_022948629.1 (uncharacterized protein LOC111452251 [Cucurbita moschata] >KAG6607276.1 hypothetical protein SDJN03_00618, partial [Cucurbita argyrosperma subsp. sororia])
HSP 1 Score: 400.2 bits (1027), Expect = 1.4e-107
Identity = 220/268 (82.09%), Postives = 233/268 (86.94%), Query Frame = 0
Query: 1 MAADPEGREFKFYSQFQTEHGD-KQSPFEGDNWSSYFGRSDSFLSFSSPVESEIGSYEIE 60
MA DPE EFKFY+QFQTEHGD KQSPFEGDNWSSYF SPVESEIGS+EIE
Sbjct: 1 MAVDPEAGEFKFYTQFQTEHGDNKQSPFEGDNWSSYF----------SPVESEIGSFEIE 60
Query: 61 SEKDEGDDG---YIAELSHQMAQYMLQDDDNSSTASFQSEIQNKSWGLSGSPISTLWSPL 120
S+KD+GD G Y AELS +MAQ+MLQDDDNSSTA FQSEIQNKSWGLSGSPISTLWSPL
Sbjct: 61 SDKDDGDGGDEDYTAELSRRMAQFMLQDDDNSSTARFQSEIQNKSWGLSGSPISTLWSPL 120
Query: 121 GSSNGSSHGSPEGPSKEPSPPSTPVVAERGGLDISQNVFSQLEKMKKVSTNGKSIQTSPQ 180
GSSNGSS+GSPEGPSKEPSPP+TP+VAER GLDISQNVF++LEKMKKVSTNGKSIQT PQ
Sbjct: 121 GSSNGSSYGSPEGPSKEPSPPTTPMVAERRGLDISQNVFTKLEKMKKVSTNGKSIQTRPQ 180
Query: 181 DGETDSSSSFKDQARALKNQKRKQNQQFMKQKGSATIQAKQAQGSSSQGNSGAKPGGSSG 240
D E SSSS KDQ RALKNQ+RKQNQQF+KQKGSA I AKQAQGSSSQGNSG KPGG SG
Sbjct: 181 DEENGSSSSSKDQTRALKNQRRKQNQQFIKQKGSAAIMAKQAQGSSSQGNSGGKPGGLSG 240
Query: 241 TGVFLPRHVNYNRPAPSPQPPQPSKKKG 265
TGVFLPRHVNYNR APSPQPPQP KK G
Sbjct: 241 TGVFLPRHVNYNRSAPSPQPPQPPKKTG 258
BLAST of Tan0018024 vs. ExPASy TrEMBL
Match:
A0A6J1GAG3 (uncharacterized protein LOC111452251 OS=Cucurbita moschata OX=3662 GN=LOC111452251 PE=4 SV=1)
HSP 1 Score: 400.2 bits (1027), Expect = 6.9e-108
Identity = 220/268 (82.09%), Postives = 233/268 (86.94%), Query Frame = 0
Query: 1 MAADPEGREFKFYSQFQTEHGD-KQSPFEGDNWSSYFGRSDSFLSFSSPVESEIGSYEIE 60
MA DPE EFKFY+QFQTEHGD KQSPFEGDNWSSYF SPVESEIGS+EIE
Sbjct: 1 MAVDPEAGEFKFYTQFQTEHGDNKQSPFEGDNWSSYF----------SPVESEIGSFEIE 60
Query: 61 SEKDEGDDG---YIAELSHQMAQYMLQDDDNSSTASFQSEIQNKSWGLSGSPISTLWSPL 120
S+KD+GD G Y AELS +MAQ+MLQDDDNSSTA FQSEIQNKSWGLSGSPISTLWSPL
Sbjct: 61 SDKDDGDGGDEDYTAELSRRMAQFMLQDDDNSSTARFQSEIQNKSWGLSGSPISTLWSPL 120
Query: 121 GSSNGSSHGSPEGPSKEPSPPSTPVVAERGGLDISQNVFSQLEKMKKVSTNGKSIQTSPQ 180
GSSNGSS+GSPEGPSKEPSPP+TP+VAER GLDISQNVF++LEKMKKVSTNGKSIQT PQ
Sbjct: 121 GSSNGSSYGSPEGPSKEPSPPTTPMVAERRGLDISQNVFTKLEKMKKVSTNGKSIQTRPQ 180
Query: 181 DGETDSSSSFKDQARALKNQKRKQNQQFMKQKGSATIQAKQAQGSSSQGNSGAKPGGSSG 240
D E SSSS KDQ RALKNQ+RKQNQQF+KQKGSA I AKQAQGSSSQGNSG KPGG SG
Sbjct: 181 DEENGSSSSSKDQTRALKNQRRKQNQQFIKQKGSAAIMAKQAQGSSSQGNSGGKPGGLSG 240
Query: 241 TGVFLPRHVNYNRPAPSPQPPQPSKKKG 265
TGVFLPRHVNYNR APSPQPPQP KK G
Sbjct: 241 TGVFLPRHVNYNRSAPSPQPPQPPKKTG 258
BLAST of Tan0018024 vs. ExPASy TrEMBL
Match:
A0A5D3BEB2 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold194G001750 PE=4 SV=1)
HSP 1 Score: 397.5 bits (1020), Expect = 4.5e-107
Identity = 223/271 (82.29%), Postives = 236/271 (87.08%), Query Frame = 0
Query: 1 MAADPEGREFKFYSQFQTEHGDK-QSPFEGDNWSSYFGRSDSFLSFSSPVESEIGSYEIE 60
MAAD EGRE KF S+FQ EHGDK Q+PFEGD+WSSYFGRSDSFLSF+SPVESEIGS EIE
Sbjct: 1 MAADQEGRELKFNSKFQIEHGDKQQNPFEGDSWSSYFGRSDSFLSFNSPVESEIGSNEIE 60
Query: 61 SEKDEGD---DGYIAELSHQMAQYMLQDDDNSSTASFQSEIQNKSWGLSGSPISTLWSPL 120
S++D+G+ D Y AELS +MAQYMLQDDDNSST SFQSEIQNKSWGLSGSPISTLWSPL
Sbjct: 61 SDRDDGENDGDDYTAELSRRMAQYMLQDDDNSSTTSFQSEIQNKSWGLSGSPISTLWSPL 120
Query: 121 GSSNGSSHGSPEGPSKEPSPPSTPVVAERGGLDISQNVFSQLEKMKKVSTNGKSIQTSPQ 180
GSS GSSHGSPEGPSKEPSPPSTPVV ERGGLDIS NVFS+LEKMKKVS +GKSIQTS Q
Sbjct: 121 GSSTGSSHGSPEGPSKEPSPPSTPVVEERGGLDISHNVFSKLEKMKKVSIHGKSIQTSTQ 180
Query: 181 DGETDSSSSFKDQARALKNQKRKQN---QQFMKQKGSATIQAKQAQGSSSQGNSGAKPGG 240
GET SSSS KDQ+R KNQKR+QN QQFMKQKGS IQ KQAQGSS Q NSGAK GG
Sbjct: 181 IGETGSSSS-KDQSRTPKNQKRRQNQQQQQFMKQKGSGAIQVKQAQGSSLQANSGAKSGG 240
Query: 241 SSGTGVFLPRHVNYNRPAPSPQPPQPSKKKG 265
SGTGVFLPRHVNYNRPAP PQPPQP KKKG
Sbjct: 241 PSGTGVFLPRHVNYNRPAPCPQPPQPPKKKG 270
BLAST of Tan0018024 vs. ExPASy TrEMBL
Match:
A0A1S3C665 (uncharacterized protein LOC103497120 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103497120 PE=4 SV=1)
HSP 1 Score: 397.5 bits (1020), Expect = 4.5e-107
Identity = 223/271 (82.29%), Postives = 236/271 (87.08%), Query Frame = 0
Query: 1 MAADPEGREFKFYSQFQTEHGDK-QSPFEGDNWSSYFGRSDSFLSFSSPVESEIGSYEIE 60
MAAD EGRE KF S+FQ EHGDK Q+PFEGD+WSSYFGRSDSFLSF+SPVESEIGS EIE
Sbjct: 1 MAADQEGRELKFNSKFQIEHGDKQQNPFEGDSWSSYFGRSDSFLSFNSPVESEIGSNEIE 60
Query: 61 SEKDEGD---DGYIAELSHQMAQYMLQDDDNSSTASFQSEIQNKSWGLSGSPISTLWSPL 120
S++D+G+ D Y AELS +MAQYMLQDDDNSST SFQSEIQNKSWGLSGSPISTLWSPL
Sbjct: 61 SDRDDGENDGDDYTAELSRRMAQYMLQDDDNSSTTSFQSEIQNKSWGLSGSPISTLWSPL 120
Query: 121 GSSNGSSHGSPEGPSKEPSPPSTPVVAERGGLDISQNVFSQLEKMKKVSTNGKSIQTSPQ 180
GSS GSSHGSPEGPSKEPSPPSTPVV ERGGLDIS NVFS+LEKMKKVS +GKSIQTS Q
Sbjct: 121 GSSTGSSHGSPEGPSKEPSPPSTPVVEERGGLDISHNVFSKLEKMKKVSIHGKSIQTSTQ 180
Query: 181 DGETDSSSSFKDQARALKNQKRKQN---QQFMKQKGSATIQAKQAQGSSSQGNSGAKPGG 240
GET SSSS KDQ+R KNQKR+QN QQFMKQKGS IQ KQAQGSS Q NSGAK GG
Sbjct: 181 IGETGSSSS-KDQSRTPKNQKRRQNQQQQQFMKQKGSGAIQVKQAQGSSLQANSGAKSGG 240
Query: 241 SSGTGVFLPRHVNYNRPAPSPQPPQPSKKKG 265
SGTGVFLPRHVNYNRPAP PQPPQP KKKG
Sbjct: 241 PSGTGVFLPRHVNYNRPAPCPQPPQPPKKKG 270
BLAST of Tan0018024 vs. ExPASy TrEMBL
Match:
A0A5A7SMB3 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold848G00070 PE=4 SV=1)
HSP 1 Score: 397.1 bits (1019), Expect = 5.8e-107
Identity = 223/271 (82.29%), Postives = 235/271 (86.72%), Query Frame = 0
Query: 1 MAADPEGREFKFYSQFQTEHGDK-QSPFEGDNWSSYFGRSDSFLSFSSPVESEIGSYEIE 60
MAAD EGRE KF S+FQ EHGDK Q+PFEGD+WSSYFGRSDSFLSF+SPVESEIGS EIE
Sbjct: 1 MAADQEGRELKFNSKFQIEHGDKQQNPFEGDSWSSYFGRSDSFLSFNSPVESEIGSNEIE 60
Query: 61 SEKDEGD---DGYIAELSHQMAQYMLQDDDNSSTASFQSEIQNKSWGLSGSPISTLWSPL 120
S++D+G+ D Y AELS +MAQYMLQDDDNSST SFQSEIQNKSWGLSGSPISTLWSPL
Sbjct: 61 SDRDDGENDGDDYTAELSRRMAQYMLQDDDNSSTTSFQSEIQNKSWGLSGSPISTLWSPL 120
Query: 121 GSSNGSSHGSPEGPSKEPSPPSTPVVAERGGLDISQNVFSQLEKMKKVSTNGKSIQTSPQ 180
GSS GSSHGSPEGPSKEPSPPSTPVV ERGGLDIS NVFS+LEKMKKVS N KSIQTS Q
Sbjct: 121 GSSTGSSHGSPEGPSKEPSPPSTPVVEERGGLDISHNVFSKLEKMKKVSINSKSIQTSTQ 180
Query: 181 DGETDSSSSFKDQARALKNQKRKQN---QQFMKQKGSATIQAKQAQGSSSQGNSGAKPGG 240
GET SSSS KDQ+R KNQKR+QN QQFMKQKGS IQ KQAQGSS Q NSGAK GG
Sbjct: 181 IGETGSSSS-KDQSRTPKNQKRRQNQQQQQFMKQKGSGAIQVKQAQGSSLQANSGAKSGG 240
Query: 241 SSGTGVFLPRHVNYNRPAPSPQPPQPSKKKG 265
SGTGVFLPRHVNYNRPAP PQPPQP KKKG
Sbjct: 241 PSGTGVFLPRHVNYNRPAPCPQPPQPPKKKG 270
BLAST of Tan0018024 vs. ExPASy TrEMBL
Match:
A0A1S3C6T0 (uncharacterized protein LOC103497120 isoform X2 OS=Cucumis melo OX=3656 GN=LOC103497120 PE=4 SV=1)
HSP 1 Score: 395.2 bits (1014), Expect = 2.2e-106
Identity = 222/270 (82.22%), Postives = 235/270 (87.04%), Query Frame = 0
Query: 1 MAADPEGREFKFYSQFQTEHGDK-QSPFEGDNWSSYFGRSDSFLSFSSPVESEIGSYEIE 60
MAAD EGRE KF S+FQ EHGDK Q+PFEGD+WSSYFGRSDSFLSF+SPVESEIGS EIE
Sbjct: 1 MAADQEGRELKFNSKFQIEHGDKQQNPFEGDSWSSYFGRSDSFLSFNSPVESEIGSNEIE 60
Query: 61 SEKDEGD---DGYIAELSHQMAQYMLQDDDNSSTASFQSEIQNKSWGLSGSPISTLWSPL 120
S++D+G+ D Y AELS +MAQYMLQDDDNSST SFQSEIQNKSWGLSGSPISTLWSPL
Sbjct: 61 SDRDDGENDGDDYTAELSRRMAQYMLQDDDNSSTTSFQSEIQNKSWGLSGSPISTLWSPL 120
Query: 121 GSSNGSSHGSPEGPSKEPSPPSTPVVAERGGLDISQNVFSQLEKMKKVSTNGKSIQTSPQ 180
GSS GSSHGSPEGPSKEPSPPSTPVV ERGGLDIS NVFS+LEKMKKVS +GKSIQTS Q
Sbjct: 121 GSSTGSSHGSPEGPSKEPSPPSTPVVEERGGLDISHNVFSKLEKMKKVSIHGKSIQTSTQ 180
Query: 181 DGETDSSSSFKDQARALKNQKRKQN---QQFMKQKGSATIQAKQAQGSSSQGNSGAKPGG 240
GET SSSS KDQ+R KNQKR+QN QQFMKQKGS IQ KQAQGSS Q NSGAK GG
Sbjct: 181 IGETGSSSS-KDQSRTPKNQKRRQNQQQQQFMKQKGSGAIQVKQAQGSSLQANSGAKSGG 240
Query: 241 SSGTGVFLPRHVNYNRPAPSPQPPQPSKKK 264
SGTGVFLPRHVNYNRPAP PQPPQP KKK
Sbjct: 241 PSGTGVFLPRHVNYNRPAPCPQPPQPPKKK 269
BLAST of Tan0018024 vs. TAIR 10
Match:
AT5G59050.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G54000.1); Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink). )
HSP 1 Score: 62.4 bits (150), Expect = 6.6e-10
Identity = 74/222 (33.33%), Postives = 98/222 (44.14%), Query Frame = 0
Query: 39 SDSFLSFSSP---------VESEIGSYEIESEK---DEGDDGYIAELSHQMAQYMLQDDD 98
S+ F SFS P + + S E +S K ++ +D YI EL+ QM YMLQDD
Sbjct: 13 SNPFTSFSEPTFFTPTTSSLRPDFVSDEPDSPKAKNEDEEDEYITELTRQMTNYMLQDD- 72
Query: 99 NSSTASFQSEIQNKSWGL-SGSPISTLWSPLGSSNGSSHGSPEGPSKEPSPPSTPVVAE- 158
E KS G SGSP STLWSP S SP GPS+EPSPP TP
Sbjct: 73 ---------EKHQKSCGSGSGSPQSTLWSPF----ASGLSSPIGPSREPSPPLTPATVPV 132
Query: 159 ---RGGLDISQNVFSQLEKMKKVSTNGKSIQTSPQDGETDSSSSFKDQARALKNQKRKQN 218
+D K + +SIQ + Q + + + A L ++ R +
Sbjct: 133 EKIMTKIDTKPVTIPFQSKQALIDDQIRSIQANFQKIKKEKEKERQRNADVLGHKARNYH 192
Query: 219 QQFMKQKGSATIQAKQAQGSSSQGNSGAKPGGSSGTGVFLPR 244
Q+ + ++A GS S+ GS GTGVFLPR
Sbjct: 193 HLHQNQRPRSGVKAVFVDGSGSR-------TGSGGTGVFLPR 213
BLAST of Tan0018024 vs. TAIR 10
Match:
AT5G59050.2 (unknown protein; Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink). )
HSP 1 Score: 58.5 bits (140), Expect = 9.5e-09
Identity = 50/115 (43.48%), Postives = 59/115 (51.30%), Query Frame = 0
Query: 39 SDSFLSFSSP---------VESEIGSYEIESEK---DEGDDGYIAELSHQMAQYMLQDDD 98
S+ F SFS P + + S E +S K ++ +D YI EL+ QM YMLQDD
Sbjct: 13 SNPFTSFSEPTFFTPTTSSLRPDFVSDEPDSPKAKNEDEEDEYITELTRQMTNYMLQDD- 72
Query: 99 NSSTASFQSEIQNKSWGL-SGSPISTLWSPLGSSNGSSHGSPEGPSKEPSPPSTP 141
E KS G SGSP STLWSP S SP GPS+EPSPP TP
Sbjct: 73 ---------EKHQKSCGSGSGSPQSTLWSPF----ASGLSSPIGPSREPSPPLTP 113
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
XP_038895137.1 | 2.1e-111 | 84.13 | uncharacterized protein LOC120083444 isoform X3 [Benincasa hispida] | [more] |
XP_038895135.1 | 2.1e-111 | 84.13 | uncharacterized protein LOC120083444 isoform X1 [Benincasa hispida] | [more] |
XP_038895136.1 | 2.1e-111 | 84.13 | uncharacterized protein LOC120083444 isoform X2 [Benincasa hispida] | [more] |
KAG7036954.1 | 1.4e-107 | 82.09 | hypothetical protein SDJN02_00574 [Cucurbita argyrosperma subsp. argyrosperma] | [more] |
XP_022948629.1 | 1.4e-107 | 82.09 | uncharacterized protein LOC111452251 [Cucurbita moschata] >KAG6607276.1 hypothet... | [more] |
Match Name | E-value | Identity | Description | |
A0A6J1GAG3 | 6.9e-108 | 82.09 | uncharacterized protein LOC111452251 OS=Cucurbita moschata OX=3662 GN=LOC1114522... | [more] |
A0A5D3BEB2 | 4.5e-107 | 82.29 | Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... | [more] |
A0A1S3C665 | 4.5e-107 | 82.29 | uncharacterized protein LOC103497120 isoform X1 OS=Cucumis melo OX=3656 GN=LOC10... | [more] |
A0A5A7SMB3 | 5.8e-107 | 82.29 | Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold... | [more] |
A0A1S3C6T0 | 2.2e-106 | 82.22 | uncharacterized protein LOC103497120 isoform X2 OS=Cucumis melo OX=3656 GN=LOC10... | [more] |
Match Name | E-value | Identity | Description | |
AT5G59050.1 | 6.6e-10 | 33.33 | unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... | [more] |
AT5G59050.2 | 9.5e-09 | 43.48 | unknown protein; Has 35333 Blast hits to 34131 proteins in 2444 species: Archae ... | [more] |