Tan0018024 (gene) Snake gourd v1

Overview
NameTan0018024
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionAdenine nucleotide alpha hydrolases-like superfamily protein
LocationLG01: 29196212 .. 29198903 (-)
RNA-Seq ExpressionTan0018024
SyntenyTan0018024
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GAGGGAACCAAAAGCCTCATCTGTTGATAGGTATTGGATATCGTTGGTTTCTCTTACCCTTGTATAAAAAACAAGAAAAATCGAACCACAGATCTCTGAGTTCTCAAAACTCAATCTCTGAACGAACCCCACGGCGGCGCTCGGCAGCCGGCCACCGGCGGACAAACCCCCCTTCCCAAAAGGCTGTTCTGTAACTGAAACCTTAAAAAAAATTCGTTGGAGTCCTTCATGGACTACTCTGATCTCTCTGTTTTTAGGTCCCTCCACTTCTTACTCTATAAAGTCCAAGCCGACGGCGCATAAATTTCCAATTTTCAGTCTCTTATTCTCTCTCTTTTTCTCTCTCTTTCTTATTGTCTTTGTGTCAGCACCAAGGGTCATGGCTGCAGATCCAGAAGGAAGAGAGTTCAAGTTTTACTCTCAGTTTCAGACTGAGCATGGAGATAAACAAAGCCCCTTTGAAGGAGACAATTGGTCGAGCTATTTTGGACGCTCTGATTCATTTCTCAGCTTCAGCTCGCCAGTGGAGTCTGAGATTGGTTCCTATGAAATTGAGAGCGAAAAAGACGAAGGCGACGATGGTTATATAGCGGAATTGAGTCACCAGATGGCTCAATACATGCTTCAAGATGATGATAACTCCTCCACTGCAAGCTTTCAATCTGAGATTCAGAACAAGGTATTGAATTTGTTCAGGCGTGGAAGGTTTCATCAAGGTGTTTGTGAAATTCTTTGTTGCTCCCTTTTTTTGATATTGTGGGTGTGTTCTGTTTCTTGTATATAGTCATGGGGTTTGTCTGGTTCGCCAATTTCAACCCTGTGGTCACCTCTAGGCTCGAGCAATGGAAGCAGCCATGGAAGCCCAGAAGGGCCTTCGAAGGAGCCATCGCCTCCATCGACACCGGTGGTTGCAGAGCGTGGGGGGCTGGACATATCACAGAACGTTTTCAGCCAGTTGGAGAAGATGAAAAAAGTGAGCACAAATGGTAAATCAATCCAAACAAGCCCCCAAGATGGAGAAACTGATTCCTCTTCTTCTTTCAAAGATCAAGCTAGAGCTCTTAAAGTGAGATGAGATTTAAATCCCACAAATCCCATAAATCTCTCTGTTTCTTACTTTTTTTTACCTTCACAAATCTCTTTCTCCCCTTTTTGCCGATTCAAAACTCGACCCTCTTTGCAGAATCAGAAGCGGAAGCAGAACCAGCAGTTTATGAAGCAAAAAGGCTCAGCCACCATACAAGCCAAGCAAGCTCAAGGAAGCTCATCACAAGGCAATTCAGGGGCAAAACCAGGAGGATCATCAGGGACTGGCGTGTTCTTGCCCCGCCATGTGAACTATAACCGTCCAGCCCCATCTCCTCAGCCACCACAGCCGTCCAAGAAAAAGGGTAGATTCCACTTTCATTCCATAGCTAGCTAATTTTTACATCGTTTTCGTAACGGGCTAACTAGCTAACCCCATTAGTTGGGATTTTTCCATCAATGAATTGCCATTTCAACCATTCCTCGTGATGCATCTTGGACTTTTGCTCATACCCCTCTGTTATATTTTTCTTTTTTTTTTCTCTTTCTCTTGATTCATAGAGATTCCGTGTCTTTCCAAATACCCTCTGAATCTATCAGACTTTCAACCATTCACGGTTGGCCCAACCAGATTTGTCCGCTTCTTTGATGGAATCTCCATCCATCTCTTTAAAACCTGGCTGTTCTGTGGTCAAAAGGAGGTGAATGAAAACGACACCCAATTGAATCTTTAAACCACTTGCATAGCCCACTTCAGCTTAATATTTTAATGGGCCGATAGATGCGTTTGTCTTCCTCTACAAGGAAAAAACACTAACGAATTTACCCTTCCCCCCTTTCCCCATGTCAGGCTGCTCCACTGTTCTAATACCCGTGAGAGTCCTACAAGCCTTACAACTTCACTACGACAGAATGGACGACGAGACAAGACAAAAAATCACTGGCTTCACAGCACTTAGAGGTAATTTTACAATTTCCATCACAAAATTCTCTGTGGCATTTCCACAAACAGTTGGTGGTCATTCCACAAAAATTGCATAACTTACTCCCTTCATGTCTCTATTACCCCCTTCTTGTCTCTATTCGATCTGGGTGATTCTATTTTCATCTAACAAAAGATTCCTATTTGCTTTTACGAACAGAAGCAGCAGCTGGTGTGAGAACAGCTACACATACCGTTGAGAAAAGTCATCCGGGAGCGTCGGCGGCGGCGGCGACAAGTCAAACCGACGTGGGTCTTCCTCAAGAATGGACGTATTAATGGCTTTCGACAACTGGCGAGGCGCCATTTCGTCTCTCGATCACAAATCTGAAGCTTTTTGGCGGTTTTCTGAGAGAACATGGAGGTGATGTTAATGGAATGTGAATAAAAGCCGCAAGTGGGCGGAGAAGAAGACGAAATAAAATTAGGGGTTGTGAAAACGACGTCGTCTATTTTTACTGTCTATTAGTGGCAAAAACATAAATAATGTTTGGGAAATTAATATTGTTAGGGAGGAAAATGGGATTTGTTTAGATGAAATCCTTTTGTAATAATTTTCAAACAAGGTTGTACCTCTTCAATAATTGTTTAAATAATTTAATGATTTTTTATGGGTCTACAATTTAGTTTGTTAATTCAAAACCGACTAATAAATTACCGATGGAATTTTTTTTCTTTTCAAAA

mRNA sequence

GAGGGAACCAAAAGCCTCATCTGTTGATAGGTATTGGATATCGTTGGTTTCTCTTACCCTTGTATAAAAAACAAGAAAAATCGAACCACAGATCTCTGAGTTCTCAAAACTCAATCTCTGAACGAACCCCACGGCGGCGCTCGGCAGCCGGCCACCGGCGGACAAACCCCCCTTCCCAAAAGGCTGTTCTGTAACTGAAACCTTAAAAAAAATTCGTTGGAGTCCTTCATGGACTACTCTGATCTCTCTGTTTTTAGGTCCCTCCACTTCTTACTCTATAAAGTCCAAGCCGACGGCGCATAAATTTCCAATTTTCAGTCTCTTATTCTCTCTCTTTTTCTCTCTCTTTCTTATTGTCTTTGTGTCAGCACCAAGGGTCATGGCTGCAGATCCAGAAGGAAGAGAGTTCAAGTTTTACTCTCAGTTTCAGACTGAGCATGGAGATAAACAAAGCCCCTTTGAAGGAGACAATTGGTCGAGCTATTTTGGACGCTCTGATTCATTTCTCAGCTTCAGCTCGCCAGTGGAGTCTGAGATTGGTTCCTATGAAATTGAGAGCGAAAAAGACGAAGGCGACGATGGTTATATAGCGGAATTGAGTCACCAGATGGCTCAATACATGCTTCAAGATGATGATAACTCCTCCACTGCAAGCTTTCAATCTGAGATTCAGAACAAGTCATGGGGTTTGTCTGGTTCGCCAATTTCAACCCTGTGGTCACCTCTAGGCTCGAGCAATGGAAGCAGCCATGGAAGCCCAGAAGGGCCTTCGAAGGAGCCATCGCCTCCATCGACACCGGTGGTTGCAGAGCGTGGGGGGCTGGACATATCACAGAACGTTTTCAGCCAGTTGGAGAAGATGAAAAAAGTGAGCACAAATGGTAAATCAATCCAAACAAGCCCCCAAGATGGAGAAACTGATTCCTCTTCTTCTTTCAAAGATCAAGCTAGAGCTCTTAAAAATCAGAAGCGGAAGCAGAACCAGCAGTTTATGAAGCAAAAAGGCTCAGCCACCATACAAGCCAAGCAAGCTCAAGGAAGCTCATCACAAGGCAATTCAGGGGCAAAACCAGGAGGATCATCAGGGACTGGCGTGTTCTTGCCCCGCCATGTGAACTATAACCGTCCAGCCCCATCTCCTCAGCCACCACAGCCGTCCAAGAAAAAGGGTAGATTCCACTTTCATTCCATAGCTAGCTAATTTTTACATCGTTTTCGTAACGGGCTAACTAGCTAACCCCATTAGTTGGGATTTTTCCATCAATGAATTGCCATTTCAACCATTCCTCGTGATGCATCTTGGACTTTTGCTCATACCCCTCTGTTATATTTTTCTTTTTTTTTTCTCTTTCTCTTGATTCATAGAGATTCCGTGTCTTTCCAAATACCCTCTGAATCTATCAGACTTTCAACCATTCACGGTTGGCCCAACCAGATTTGTCCGCTTCTTTGATGGAATCTCCATCCATCTCTTTAAAACCTGGCTGTTCTGTGGTCAAAAGGAGGTGAATGAAAACGACACCCAATTGAATCTTTAAACCACTTGCATAGCCCACTTCAGCTTAATATTTTAATGGGCCGATAGATGCGTTTGTCTTCCTCTACAAGGAAAAAACACTAACGAATTTACCCTTCCCCCCTTTCCCCATGTCAGGCTGCTCCACTGTTCTAATACCCGTGAGAGTCCTACAAGCCTTACAACTTCACTACGACAGAATGGACGACGAGACAAGACAAAAAATCACTGGCTTCACAGCACTTAGAGAAGCAGCAGCTGGTGTGAGAACAGCTACACATACCGTTGAGAAAAGTCATCCGGGAGCGTCGGCGGCGGCGGCGACAAGTCAAACCGACGTGGGTCTTCCTCAAGAATGGACGTATTAATGGCTTTCGACAACTGGCGAGGCGCCATTTCGTCTCTCGATCACAAATCTGAAGCTTTTTGGCGGTTTTCTGAGAGAACATGGAGGTGATGTTAATGGAATGTGAATAAAAGCCGCAAGTGGGCGGAGAAGAAGACGAAATAAAATTAGGGGTTGTGAAAACGACGTCGTCTATTTTTACTGTCTATTAGTGGCAAAAACATAAATAATGTTTGGGAAATTAATATTGTTAGGGAGGAAAATGGGATTTGTTTAGATGAAATCCTTTTGTAATAATTTTCAAACAAGGTTGTACCTCTTCAATAATTGTTTAAATAATTTAATGATTTTTTATGGGTCTACAATTTAGTTTGTTAATTCAAAACCGACTAATAAATTACCGATGGAATTTTTTTTCTTTTCAAAA

Coding sequence (CDS)

ATGGCTGCAGATCCAGAAGGAAGAGAGTTCAAGTTTTACTCTCAGTTTCAGACTGAGCATGGAGATAAACAAAGCCCCTTTGAAGGAGACAATTGGTCGAGCTATTTTGGACGCTCTGATTCATTTCTCAGCTTCAGCTCGCCAGTGGAGTCTGAGATTGGTTCCTATGAAATTGAGAGCGAAAAAGACGAAGGCGACGATGGTTATATAGCGGAATTGAGTCACCAGATGGCTCAATACATGCTTCAAGATGATGATAACTCCTCCACTGCAAGCTTTCAATCTGAGATTCAGAACAAGTCATGGGGTTTGTCTGGTTCGCCAATTTCAACCCTGTGGTCACCTCTAGGCTCGAGCAATGGAAGCAGCCATGGAAGCCCAGAAGGGCCTTCGAAGGAGCCATCGCCTCCATCGACACCGGTGGTTGCAGAGCGTGGGGGGCTGGACATATCACAGAACGTTTTCAGCCAGTTGGAGAAGATGAAAAAAGTGAGCACAAATGGTAAATCAATCCAAACAAGCCCCCAAGATGGAGAAACTGATTCCTCTTCTTCTTTCAAAGATCAAGCTAGAGCTCTTAAAAATCAGAAGCGGAAGCAGAACCAGCAGTTTATGAAGCAAAAAGGCTCAGCCACCATACAAGCCAAGCAAGCTCAAGGAAGCTCATCACAAGGCAATTCAGGGGCAAAACCAGGAGGATCATCAGGGACTGGCGTGTTCTTGCCCCGCCATGTGAACTATAACCGTCCAGCCCCATCTCCTCAGCCACCACAGCCGTCCAAGAAAAAGGGTAGATTCCACTTTCATTCCATAGCTAGCTAA

Protein sequence

MAADPEGREFKFYSQFQTEHGDKQSPFEGDNWSSYFGRSDSFLSFSSPVESEIGSYEIESEKDEGDDGYIAELSHQMAQYMLQDDDNSSTASFQSEIQNKSWGLSGSPISTLWSPLGSSNGSSHGSPEGPSKEPSPPSTPVVAERGGLDISQNVFSQLEKMKKVSTNGKSIQTSPQDGETDSSSSFKDQARALKNQKRKQNQQFMKQKGSATIQAKQAQGSSSQGNSGAKPGGSSGTGVFLPRHVNYNRPAPSPQPPQPSKKKGRFHFHSIAS
Homology
BLAST of Tan0018024 vs. NCBI nr
Match: XP_038895137.1 (uncharacterized protein LOC120083444 isoform X3 [Benincasa hispida])

HSP 1 Score: 412.9 bits (1060), Expect = 2.1e-111
Identity = 228/271 (84.13%), Postives = 245/271 (90.41%), Query Frame = 0

Query: 1   MAADPEGREFKFYSQFQTEHGDK-QSPFEGDNWSSYFGRSDSFLSFSSPVESEIGSYEIE 60
           MAAD EG+E KF SQFQT+HGDK QSPFEGDNWSSYFGRSDSFLSFSSPVESEIGSYEIE
Sbjct: 1   MAADQEGKELKFNSQFQTDHGDKQQSPFEGDNWSSYFGRSDSFLSFSSPVESEIGSYEIE 60

Query: 61  SEKDEGDDG---YIAELSHQMAQYMLQDDDNSSTASFQSEIQNKSWGLSGSPISTLWSPL 120
           S++D+G++G   Y AELS +MAQYM QDDDNSST SFQSEIQNKSWGLSGSPISTLWSPL
Sbjct: 61  SDRDDGENGSDDYTAELSRRMAQYMFQDDDNSSTTSFQSEIQNKSWGLSGSPISTLWSPL 120

Query: 121 GSSNGSSHGSPEGPSKEPSPPSTPVVAERGGLDISQNVFSQLEKMKKVSTNGKSIQTSPQ 180
           GSS GSSHGSPEGPSKEPSPPSTPVVAERGGLDIS+NVF++LEKMKKVSTNGKSIQTSPQ
Sbjct: 121 GSSTGSSHGSPEGPSKEPSPPSTPVVAERGGLDISRNVFNKLEKMKKVSTNGKSIQTSPQ 180

Query: 181 DGETDSSSSFKDQARALKNQKRKQN---QQFMKQKGSATIQAKQAQGSSSQGNSGAKPGG 240
            GET+SSSS K+Q+R  KNQ+R+QN   QQF+KQKGSA IQAKQAQGSS Q NSGAK GG
Sbjct: 181 IGETESSSS-KNQSRTSKNQERRQNQQQQQFIKQKGSAAIQAKQAQGSSLQANSGAKSGG 240

Query: 241 SSGTGVFLPRHVNYNRPAPSPQPPQPSKKKG 265
           SSGTGVFLPRHVNYNRPAP  QPPQP KKKG
Sbjct: 241 SSGTGVFLPRHVNYNRPAPCSQPPQPPKKKG 270

BLAST of Tan0018024 vs. NCBI nr
Match: XP_038895135.1 (uncharacterized protein LOC120083444 isoform X1 [Benincasa hispida])

HSP 1 Score: 412.9 bits (1060), Expect = 2.1e-111
Identity = 228/271 (84.13%), Postives = 245/271 (90.41%), Query Frame = 0

Query: 1   MAADPEGREFKFYSQFQTEHGDK-QSPFEGDNWSSYFGRSDSFLSFSSPVESEIGSYEIE 60
           MAAD EG+E KF SQFQT+HGDK QSPFEGDNWSSYFGRSDSFLSFSSPVESEIGSYEIE
Sbjct: 1   MAADQEGKELKFNSQFQTDHGDKQQSPFEGDNWSSYFGRSDSFLSFSSPVESEIGSYEIE 60

Query: 61  SEKDEGDDG---YIAELSHQMAQYMLQDDDNSSTASFQSEIQNKSWGLSGSPISTLWSPL 120
           S++D+G++G   Y AELS +MAQYM QDDDNSST SFQSEIQNKSWGLSGSPISTLWSPL
Sbjct: 61  SDRDDGENGSDDYTAELSRRMAQYMFQDDDNSSTTSFQSEIQNKSWGLSGSPISTLWSPL 120

Query: 121 GSSNGSSHGSPEGPSKEPSPPSTPVVAERGGLDISQNVFSQLEKMKKVSTNGKSIQTSPQ 180
           GSS GSSHGSPEGPSKEPSPPSTPVVAERGGLDIS+NVF++LEKMKKVSTNGKSIQTSPQ
Sbjct: 121 GSSTGSSHGSPEGPSKEPSPPSTPVVAERGGLDISRNVFNKLEKMKKVSTNGKSIQTSPQ 180

Query: 181 DGETDSSSSFKDQARALKNQKRKQN---QQFMKQKGSATIQAKQAQGSSSQGNSGAKPGG 240
            GET+SSSS K+Q+R  KNQ+R+QN   QQF+KQKGSA IQAKQAQGSS Q NSGAK GG
Sbjct: 181 IGETESSSS-KNQSRTSKNQERRQNQQQQQFIKQKGSAAIQAKQAQGSSLQANSGAKSGG 240

Query: 241 SSGTGVFLPRHVNYNRPAPSPQPPQPSKKKG 265
           SSGTGVFLPRHVNYNRPAP  QPPQP KKKG
Sbjct: 241 SSGTGVFLPRHVNYNRPAPCSQPPQPPKKKG 270

BLAST of Tan0018024 vs. NCBI nr
Match: XP_038895136.1 (uncharacterized protein LOC120083444 isoform X2 [Benincasa hispida])

HSP 1 Score: 412.9 bits (1060), Expect = 2.1e-111
Identity = 228/271 (84.13%), Postives = 245/271 (90.41%), Query Frame = 0

Query: 1   MAADPEGREFKFYSQFQTEHGDK-QSPFEGDNWSSYFGRSDSFLSFSSPVESEIGSYEIE 60
           MAAD EG+E KF SQFQT+HGDK QSPFEGDNWSSYFGRSDSFLSFSSPVESEIGSYEIE
Sbjct: 1   MAADQEGKELKFNSQFQTDHGDKQQSPFEGDNWSSYFGRSDSFLSFSSPVESEIGSYEIE 60

Query: 61  SEKDEGDDG---YIAELSHQMAQYMLQDDDNSSTASFQSEIQNKSWGLSGSPISTLWSPL 120
           S++D+G++G   Y AELS +MAQYM QDDDNSST SFQSEIQNKSWGLSGSPISTLWSPL
Sbjct: 61  SDRDDGENGSDDYTAELSRRMAQYMFQDDDNSSTTSFQSEIQNKSWGLSGSPISTLWSPL 120

Query: 121 GSSNGSSHGSPEGPSKEPSPPSTPVVAERGGLDISQNVFSQLEKMKKVSTNGKSIQTSPQ 180
           GSS GSSHGSPEGPSKEPSPPSTPVVAERGGLDIS+NVF++LEKMKKVSTNGKSIQTSPQ
Sbjct: 121 GSSTGSSHGSPEGPSKEPSPPSTPVVAERGGLDISRNVFNKLEKMKKVSTNGKSIQTSPQ 180

Query: 181 DGETDSSSSFKDQARALKNQKRKQN---QQFMKQKGSATIQAKQAQGSSSQGNSGAKPGG 240
            GET+SSSS K+Q+R  KNQ+R+QN   QQF+KQKGSA IQAKQAQGSS Q NSGAK GG
Sbjct: 181 IGETESSSS-KNQSRTSKNQERRQNQQQQQFIKQKGSAAIQAKQAQGSSLQANSGAKSGG 240

Query: 241 SSGTGVFLPRHVNYNRPAPSPQPPQPSKKKG 265
           SSGTGVFLPRHVNYNRPAP  QPPQP KKKG
Sbjct: 241 SSGTGVFLPRHVNYNRPAPCSQPPQPPKKKG 270

BLAST of Tan0018024 vs. NCBI nr
Match: KAG7036954.1 (hypothetical protein SDJN02_00574 [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 400.2 bits (1027), Expect = 1.4e-107
Identity = 220/268 (82.09%), Postives = 233/268 (86.94%), Query Frame = 0

Query: 1   MAADPEGREFKFYSQFQTEHGD-KQSPFEGDNWSSYFGRSDSFLSFSSPVESEIGSYEIE 60
           MA DPE  EFKFY+QFQTEHGD KQSPFEGDNWSSYF          SPVESEIGS+EIE
Sbjct: 1   MAVDPEAGEFKFYTQFQTEHGDNKQSPFEGDNWSSYF----------SPVESEIGSFEIE 60

Query: 61  SEKDEGDDG---YIAELSHQMAQYMLQDDDNSSTASFQSEIQNKSWGLSGSPISTLWSPL 120
           S+KD+GD G   Y AELS +MAQ+MLQDDDNSSTA FQSEIQNKSWGLSGSPISTLWSPL
Sbjct: 61  SDKDDGDGGDEDYTAELSRRMAQFMLQDDDNSSTARFQSEIQNKSWGLSGSPISTLWSPL 120

Query: 121 GSSNGSSHGSPEGPSKEPSPPSTPVVAERGGLDISQNVFSQLEKMKKVSTNGKSIQTSPQ 180
           GSSNGSS+GSPEGPSKEPSPP+TP+VAER GLDISQNVF++LEKMKKVSTNGKSIQT PQ
Sbjct: 121 GSSNGSSYGSPEGPSKEPSPPTTPMVAERRGLDISQNVFTKLEKMKKVSTNGKSIQTRPQ 180

Query: 181 DGETDSSSSFKDQARALKNQKRKQNQQFMKQKGSATIQAKQAQGSSSQGNSGAKPGGSSG 240
           D E  SSSS KDQ RALKNQ+RKQNQQF+KQKGSA I AKQAQGSSSQGNSG KPGG SG
Sbjct: 181 DEENGSSSSSKDQTRALKNQRRKQNQQFIKQKGSAAIMAKQAQGSSSQGNSGGKPGGLSG 240

Query: 241 TGVFLPRHVNYNRPAPSPQPPQPSKKKG 265
           TGVFLPRHVNYNR APSPQPPQP KK G
Sbjct: 241 TGVFLPRHVNYNRSAPSPQPPQPPKKTG 258

BLAST of Tan0018024 vs. NCBI nr
Match: XP_022948629.1 (uncharacterized protein LOC111452251 [Cucurbita moschata] >KAG6607276.1 hypothetical protein SDJN03_00618, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 400.2 bits (1027), Expect = 1.4e-107
Identity = 220/268 (82.09%), Postives = 233/268 (86.94%), Query Frame = 0

Query: 1   MAADPEGREFKFYSQFQTEHGD-KQSPFEGDNWSSYFGRSDSFLSFSSPVESEIGSYEIE 60
           MA DPE  EFKFY+QFQTEHGD KQSPFEGDNWSSYF          SPVESEIGS+EIE
Sbjct: 1   MAVDPEAGEFKFYTQFQTEHGDNKQSPFEGDNWSSYF----------SPVESEIGSFEIE 60

Query: 61  SEKDEGDDG---YIAELSHQMAQYMLQDDDNSSTASFQSEIQNKSWGLSGSPISTLWSPL 120
           S+KD+GD G   Y AELS +MAQ+MLQDDDNSSTA FQSEIQNKSWGLSGSPISTLWSPL
Sbjct: 61  SDKDDGDGGDEDYTAELSRRMAQFMLQDDDNSSTARFQSEIQNKSWGLSGSPISTLWSPL 120

Query: 121 GSSNGSSHGSPEGPSKEPSPPSTPVVAERGGLDISQNVFSQLEKMKKVSTNGKSIQTSPQ 180
           GSSNGSS+GSPEGPSKEPSPP+TP+VAER GLDISQNVF++LEKMKKVSTNGKSIQT PQ
Sbjct: 121 GSSNGSSYGSPEGPSKEPSPPTTPMVAERRGLDISQNVFTKLEKMKKVSTNGKSIQTRPQ 180

Query: 181 DGETDSSSSFKDQARALKNQKRKQNQQFMKQKGSATIQAKQAQGSSSQGNSGAKPGGSSG 240
           D E  SSSS KDQ RALKNQ+RKQNQQF+KQKGSA I AKQAQGSSSQGNSG KPGG SG
Sbjct: 181 DEENGSSSSSKDQTRALKNQRRKQNQQFIKQKGSAAIMAKQAQGSSSQGNSGGKPGGLSG 240

Query: 241 TGVFLPRHVNYNRPAPSPQPPQPSKKKG 265
           TGVFLPRHVNYNR APSPQPPQP KK G
Sbjct: 241 TGVFLPRHVNYNRSAPSPQPPQPPKKTG 258

BLAST of Tan0018024 vs. ExPASy TrEMBL
Match: A0A6J1GAG3 (uncharacterized protein LOC111452251 OS=Cucurbita moschata OX=3662 GN=LOC111452251 PE=4 SV=1)

HSP 1 Score: 400.2 bits (1027), Expect = 6.9e-108
Identity = 220/268 (82.09%), Postives = 233/268 (86.94%), Query Frame = 0

Query: 1   MAADPEGREFKFYSQFQTEHGD-KQSPFEGDNWSSYFGRSDSFLSFSSPVESEIGSYEIE 60
           MA DPE  EFKFY+QFQTEHGD KQSPFEGDNWSSYF          SPVESEIGS+EIE
Sbjct: 1   MAVDPEAGEFKFYTQFQTEHGDNKQSPFEGDNWSSYF----------SPVESEIGSFEIE 60

Query: 61  SEKDEGDDG---YIAELSHQMAQYMLQDDDNSSTASFQSEIQNKSWGLSGSPISTLWSPL 120
           S+KD+GD G   Y AELS +MAQ+MLQDDDNSSTA FQSEIQNKSWGLSGSPISTLWSPL
Sbjct: 61  SDKDDGDGGDEDYTAELSRRMAQFMLQDDDNSSTARFQSEIQNKSWGLSGSPISTLWSPL 120

Query: 121 GSSNGSSHGSPEGPSKEPSPPSTPVVAERGGLDISQNVFSQLEKMKKVSTNGKSIQTSPQ 180
           GSSNGSS+GSPEGPSKEPSPP+TP+VAER GLDISQNVF++LEKMKKVSTNGKSIQT PQ
Sbjct: 121 GSSNGSSYGSPEGPSKEPSPPTTPMVAERRGLDISQNVFTKLEKMKKVSTNGKSIQTRPQ 180

Query: 181 DGETDSSSSFKDQARALKNQKRKQNQQFMKQKGSATIQAKQAQGSSSQGNSGAKPGGSSG 240
           D E  SSSS KDQ RALKNQ+RKQNQQF+KQKGSA I AKQAQGSSSQGNSG KPGG SG
Sbjct: 181 DEENGSSSSSKDQTRALKNQRRKQNQQFIKQKGSAAIMAKQAQGSSSQGNSGGKPGGLSG 240

Query: 241 TGVFLPRHVNYNRPAPSPQPPQPSKKKG 265
           TGVFLPRHVNYNR APSPQPPQP KK G
Sbjct: 241 TGVFLPRHVNYNRSAPSPQPPQPPKKTG 258

BLAST of Tan0018024 vs. ExPASy TrEMBL
Match: A0A5D3BEB2 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold194G001750 PE=4 SV=1)

HSP 1 Score: 397.5 bits (1020), Expect = 4.5e-107
Identity = 223/271 (82.29%), Postives = 236/271 (87.08%), Query Frame = 0

Query: 1   MAADPEGREFKFYSQFQTEHGDK-QSPFEGDNWSSYFGRSDSFLSFSSPVESEIGSYEIE 60
           MAAD EGRE KF S+FQ EHGDK Q+PFEGD+WSSYFGRSDSFLSF+SPVESEIGS EIE
Sbjct: 1   MAADQEGRELKFNSKFQIEHGDKQQNPFEGDSWSSYFGRSDSFLSFNSPVESEIGSNEIE 60

Query: 61  SEKDEGD---DGYIAELSHQMAQYMLQDDDNSSTASFQSEIQNKSWGLSGSPISTLWSPL 120
           S++D+G+   D Y AELS +MAQYMLQDDDNSST SFQSEIQNKSWGLSGSPISTLWSPL
Sbjct: 61  SDRDDGENDGDDYTAELSRRMAQYMLQDDDNSSTTSFQSEIQNKSWGLSGSPISTLWSPL 120

Query: 121 GSSNGSSHGSPEGPSKEPSPPSTPVVAERGGLDISQNVFSQLEKMKKVSTNGKSIQTSPQ 180
           GSS GSSHGSPEGPSKEPSPPSTPVV ERGGLDIS NVFS+LEKMKKVS +GKSIQTS Q
Sbjct: 121 GSSTGSSHGSPEGPSKEPSPPSTPVVEERGGLDISHNVFSKLEKMKKVSIHGKSIQTSTQ 180

Query: 181 DGETDSSSSFKDQARALKNQKRKQN---QQFMKQKGSATIQAKQAQGSSSQGNSGAKPGG 240
            GET SSSS KDQ+R  KNQKR+QN   QQFMKQKGS  IQ KQAQGSS Q NSGAK GG
Sbjct: 181 IGETGSSSS-KDQSRTPKNQKRRQNQQQQQFMKQKGSGAIQVKQAQGSSLQANSGAKSGG 240

Query: 241 SSGTGVFLPRHVNYNRPAPSPQPPQPSKKKG 265
            SGTGVFLPRHVNYNRPAP PQPPQP KKKG
Sbjct: 241 PSGTGVFLPRHVNYNRPAPCPQPPQPPKKKG 270

BLAST of Tan0018024 vs. ExPASy TrEMBL
Match: A0A1S3C665 (uncharacterized protein LOC103497120 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103497120 PE=4 SV=1)

HSP 1 Score: 397.5 bits (1020), Expect = 4.5e-107
Identity = 223/271 (82.29%), Postives = 236/271 (87.08%), Query Frame = 0

Query: 1   MAADPEGREFKFYSQFQTEHGDK-QSPFEGDNWSSYFGRSDSFLSFSSPVESEIGSYEIE 60
           MAAD EGRE KF S+FQ EHGDK Q+PFEGD+WSSYFGRSDSFLSF+SPVESEIGS EIE
Sbjct: 1   MAADQEGRELKFNSKFQIEHGDKQQNPFEGDSWSSYFGRSDSFLSFNSPVESEIGSNEIE 60

Query: 61  SEKDEGD---DGYIAELSHQMAQYMLQDDDNSSTASFQSEIQNKSWGLSGSPISTLWSPL 120
           S++D+G+   D Y AELS +MAQYMLQDDDNSST SFQSEIQNKSWGLSGSPISTLWSPL
Sbjct: 61  SDRDDGENDGDDYTAELSRRMAQYMLQDDDNSSTTSFQSEIQNKSWGLSGSPISTLWSPL 120

Query: 121 GSSNGSSHGSPEGPSKEPSPPSTPVVAERGGLDISQNVFSQLEKMKKVSTNGKSIQTSPQ 180
           GSS GSSHGSPEGPSKEPSPPSTPVV ERGGLDIS NVFS+LEKMKKVS +GKSIQTS Q
Sbjct: 121 GSSTGSSHGSPEGPSKEPSPPSTPVVEERGGLDISHNVFSKLEKMKKVSIHGKSIQTSTQ 180

Query: 181 DGETDSSSSFKDQARALKNQKRKQN---QQFMKQKGSATIQAKQAQGSSSQGNSGAKPGG 240
            GET SSSS KDQ+R  KNQKR+QN   QQFMKQKGS  IQ KQAQGSS Q NSGAK GG
Sbjct: 181 IGETGSSSS-KDQSRTPKNQKRRQNQQQQQFMKQKGSGAIQVKQAQGSSLQANSGAKSGG 240

Query: 241 SSGTGVFLPRHVNYNRPAPSPQPPQPSKKKG 265
            SGTGVFLPRHVNYNRPAP PQPPQP KKKG
Sbjct: 241 PSGTGVFLPRHVNYNRPAPCPQPPQPPKKKG 270

BLAST of Tan0018024 vs. ExPASy TrEMBL
Match: A0A5A7SMB3 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold848G00070 PE=4 SV=1)

HSP 1 Score: 397.1 bits (1019), Expect = 5.8e-107
Identity = 223/271 (82.29%), Postives = 235/271 (86.72%), Query Frame = 0

Query: 1   MAADPEGREFKFYSQFQTEHGDK-QSPFEGDNWSSYFGRSDSFLSFSSPVESEIGSYEIE 60
           MAAD EGRE KF S+FQ EHGDK Q+PFEGD+WSSYFGRSDSFLSF+SPVESEIGS EIE
Sbjct: 1   MAADQEGRELKFNSKFQIEHGDKQQNPFEGDSWSSYFGRSDSFLSFNSPVESEIGSNEIE 60

Query: 61  SEKDEGD---DGYIAELSHQMAQYMLQDDDNSSTASFQSEIQNKSWGLSGSPISTLWSPL 120
           S++D+G+   D Y AELS +MAQYMLQDDDNSST SFQSEIQNKSWGLSGSPISTLWSPL
Sbjct: 61  SDRDDGENDGDDYTAELSRRMAQYMLQDDDNSSTTSFQSEIQNKSWGLSGSPISTLWSPL 120

Query: 121 GSSNGSSHGSPEGPSKEPSPPSTPVVAERGGLDISQNVFSQLEKMKKVSTNGKSIQTSPQ 180
           GSS GSSHGSPEGPSKEPSPPSTPVV ERGGLDIS NVFS+LEKMKKVS N KSIQTS Q
Sbjct: 121 GSSTGSSHGSPEGPSKEPSPPSTPVVEERGGLDISHNVFSKLEKMKKVSINSKSIQTSTQ 180

Query: 181 DGETDSSSSFKDQARALKNQKRKQN---QQFMKQKGSATIQAKQAQGSSSQGNSGAKPGG 240
            GET SSSS KDQ+R  KNQKR+QN   QQFMKQKGS  IQ KQAQGSS Q NSGAK GG
Sbjct: 181 IGETGSSSS-KDQSRTPKNQKRRQNQQQQQFMKQKGSGAIQVKQAQGSSLQANSGAKSGG 240

Query: 241 SSGTGVFLPRHVNYNRPAPSPQPPQPSKKKG 265
            SGTGVFLPRHVNYNRPAP PQPPQP KKKG
Sbjct: 241 PSGTGVFLPRHVNYNRPAPCPQPPQPPKKKG 270

BLAST of Tan0018024 vs. ExPASy TrEMBL
Match: A0A1S3C6T0 (uncharacterized protein LOC103497120 isoform X2 OS=Cucumis melo OX=3656 GN=LOC103497120 PE=4 SV=1)

HSP 1 Score: 395.2 bits (1014), Expect = 2.2e-106
Identity = 222/270 (82.22%), Postives = 235/270 (87.04%), Query Frame = 0

Query: 1   MAADPEGREFKFYSQFQTEHGDK-QSPFEGDNWSSYFGRSDSFLSFSSPVESEIGSYEIE 60
           MAAD EGRE KF S+FQ EHGDK Q+PFEGD+WSSYFGRSDSFLSF+SPVESEIGS EIE
Sbjct: 1   MAADQEGRELKFNSKFQIEHGDKQQNPFEGDSWSSYFGRSDSFLSFNSPVESEIGSNEIE 60

Query: 61  SEKDEGD---DGYIAELSHQMAQYMLQDDDNSSTASFQSEIQNKSWGLSGSPISTLWSPL 120
           S++D+G+   D Y AELS +MAQYMLQDDDNSST SFQSEIQNKSWGLSGSPISTLWSPL
Sbjct: 61  SDRDDGENDGDDYTAELSRRMAQYMLQDDDNSSTTSFQSEIQNKSWGLSGSPISTLWSPL 120

Query: 121 GSSNGSSHGSPEGPSKEPSPPSTPVVAERGGLDISQNVFSQLEKMKKVSTNGKSIQTSPQ 180
           GSS GSSHGSPEGPSKEPSPPSTPVV ERGGLDIS NVFS+LEKMKKVS +GKSIQTS Q
Sbjct: 121 GSSTGSSHGSPEGPSKEPSPPSTPVVEERGGLDISHNVFSKLEKMKKVSIHGKSIQTSTQ 180

Query: 181 DGETDSSSSFKDQARALKNQKRKQN---QQFMKQKGSATIQAKQAQGSSSQGNSGAKPGG 240
            GET SSSS KDQ+R  KNQKR+QN   QQFMKQKGS  IQ KQAQGSS Q NSGAK GG
Sbjct: 181 IGETGSSSS-KDQSRTPKNQKRRQNQQQQQFMKQKGSGAIQVKQAQGSSLQANSGAKSGG 240

Query: 241 SSGTGVFLPRHVNYNRPAPSPQPPQPSKKK 264
            SGTGVFLPRHVNYNRPAP PQPPQP KKK
Sbjct: 241 PSGTGVFLPRHVNYNRPAPCPQPPQPPKKK 269

BLAST of Tan0018024 vs. TAIR 10
Match: AT5G59050.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G54000.1); Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink). )

HSP 1 Score: 62.4 bits (150), Expect = 6.6e-10
Identity = 74/222 (33.33%), Postives = 98/222 (44.14%), Query Frame = 0

Query: 39  SDSFLSFSSP---------VESEIGSYEIESEK---DEGDDGYIAELSHQMAQYMLQDDD 98
           S+ F SFS P         +  +  S E +S K   ++ +D YI EL+ QM  YMLQDD 
Sbjct: 13  SNPFTSFSEPTFFTPTTSSLRPDFVSDEPDSPKAKNEDEEDEYITELTRQMTNYMLQDD- 72

Query: 99  NSSTASFQSEIQNKSWGL-SGSPISTLWSPLGSSNGSSHGSPEGPSKEPSPPSTPVVAE- 158
                    E   KS G  SGSP STLWSP      S   SP GPS+EPSPP TP     
Sbjct: 73  ---------EKHQKSCGSGSGSPQSTLWSPF----ASGLSSPIGPSREPSPPLTPATVPV 132

Query: 159 ---RGGLDISQNVFSQLEKMKKVSTNGKSIQTSPQDGETDSSSSFKDQARALKNQKRKQN 218
                 +D          K   +    +SIQ + Q  + +     +  A  L ++ R  +
Sbjct: 133 EKIMTKIDTKPVTIPFQSKQALIDDQIRSIQANFQKIKKEKEKERQRNADVLGHKARNYH 192

Query: 219 QQFMKQKGSATIQAKQAQGSSSQGNSGAKPGGSSGTGVFLPR 244
                Q+  + ++A    GS S+        GS GTGVFLPR
Sbjct: 193 HLHQNQRPRSGVKAVFVDGSGSR-------TGSGGTGVFLPR 213

BLAST of Tan0018024 vs. TAIR 10
Match: AT5G59050.2 (unknown protein; Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink). )

HSP 1 Score: 58.5 bits (140), Expect = 9.5e-09
Identity = 50/115 (43.48%), Postives = 59/115 (51.30%), Query Frame = 0

Query: 39  SDSFLSFSSP---------VESEIGSYEIESEK---DEGDDGYIAELSHQMAQYMLQDDD 98
           S+ F SFS P         +  +  S E +S K   ++ +D YI EL+ QM  YMLQDD 
Sbjct: 13  SNPFTSFSEPTFFTPTTSSLRPDFVSDEPDSPKAKNEDEEDEYITELTRQMTNYMLQDD- 72

Query: 99  NSSTASFQSEIQNKSWGL-SGSPISTLWSPLGSSNGSSHGSPEGPSKEPSPPSTP 141
                    E   KS G  SGSP STLWSP      S   SP GPS+EPSPP TP
Sbjct: 73  ---------EKHQKSCGSGSGSPQSTLWSPF----ASGLSSPIGPSREPSPPLTP 113

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
XP_038895137.12.1e-11184.13uncharacterized protein LOC120083444 isoform X3 [Benincasa hispida][more]
XP_038895135.12.1e-11184.13uncharacterized protein LOC120083444 isoform X1 [Benincasa hispida][more]
XP_038895136.12.1e-11184.13uncharacterized protein LOC120083444 isoform X2 [Benincasa hispida][more]
KAG7036954.11.4e-10782.09hypothetical protein SDJN02_00574 [Cucurbita argyrosperma subsp. argyrosperma][more]
XP_022948629.11.4e-10782.09uncharacterized protein LOC111452251 [Cucurbita moschata] >KAG6607276.1 hypothet... [more]
Match NameE-valueIdentityDescription
A0A6J1GAG36.9e-10882.09uncharacterized protein LOC111452251 OS=Cucurbita moschata OX=3662 GN=LOC1114522... [more]
A0A5D3BEB24.5e-10782.29Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... [more]
A0A1S3C6654.5e-10782.29uncharacterized protein LOC103497120 isoform X1 OS=Cucumis melo OX=3656 GN=LOC10... [more]
A0A5A7SMB35.8e-10782.29Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold... [more]
A0A1S3C6T02.2e-10682.22uncharacterized protein LOC103497120 isoform X2 OS=Cucumis melo OX=3656 GN=LOC10... [more]
Match NameE-valueIdentityDescription
AT5G59050.16.6e-1033.33unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT5G59050.29.5e-0943.48unknown protein; Has 35333 Blast hits to 34131 proteins in 2444 species: Archae ... [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 183..207
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 311..339
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 80..265
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 321..339
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 150..233
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 80..129
NoneNo IPR availablePANTHERPTHR33356:SF16G PATCH DOMAIN PROTEINcoord: 10..339
NoneNo IPR availablePANTHERPTHR33356TIP41-LIKE PROTEINcoord: 10..339

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0018024.1Tan0018024.1mRNA
Tan0018024.2Tan0018024.2mRNA