Tan0010764 (gene) Snake gourd v1

Overview
NameTan0010764
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionUnknown protein
LocationLG06: 1597170 .. 1602846 (+)
RNA-Seq ExpressionTan0010764
SyntenyTan0010764
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CATCAACAGCCTAAGATTGTAGAACATGGCGGAGAGCGACTCCGTACCTACTGGCGAAGAAGCAAAATCAGCCATCAAATACAAAAGACTGAGAAAAAGGGAACTTACAGTCATCGGAGATGAGCGAAGAAGAACCTCCTAAACTCTACGCTAACAAACCCAAGAAAGGTACCATTCCCAAACTCCCAATTCCTCATTTCTTCCAAGTTTAGACACAAAATGGACACTCCAAGAACCGATCCGCTTCAATTATTCATTACAATTATTGGGGCTTGGTATCTGATTGATGAGCAATCGGGGTTTTTCTTTTTTCCATCTGTAAACAGCCCAGGTTAAACAATTTCAAGAACAGCACAAAGTCAGAGACGCTTCTTCTTCGCCGGCGCCACCTGCATCGTCGAACATGGGATCTGCGTCTTCGCCGGCACCGCCACAGCCTCCGAAGGAATCATTTGCAAGGCGATATAAGTTCTTATGGCCCATGCTTTTGACTGTTAACCTTGCTGTTGGAGGTAATTTTCCTCTTCTCTCTTCTCCTCTCCCATGACATGTGCAAAAGTTTCTGGTTGTTGTGTTAACGCGTTTGATCGAGTCCATTAAATACCCAAATGGGGAATCGAGGTTTGTCGTTCGGTTGTTAATGGAGTCTTTGCATTTAGGATTGGTACTCTCTAGTGACATTTGTGGCGGACCAGTCTGTTTAGATTGCGAAATCCTGTTCTCTTCTTTCTGGTAGACGTAAGTTTGGGTCCCTTAGTCATTTTTCTGCTGGGAGACAAATCGTTTTAATGAATTTGACTGCAAGATGAGGATATGCTCGGTATACATACGCGTACTTATATTTATGTTTGAATGGAACGATCACAGAAAGGAAAATGGGGAACTGAATTTCTTACGACTTTTCATGTTTCTTCTGTTAGTTTTTTGACAAGGAAACAAAAGTTTTCATTGAAAGAATGAAAAGAACCTAATGCTCAAAATACAAAGGGAGAACAAAATAAAAACAGAAATATGCACAAATCAATTTAATGACAAGCATTAAGGCAATTTTAGATAAAAAAACATGAAATCGACTTTAAAATATCCATCGAAAAAGAAAATTCCTCAAAATTTCCACCAACTCATATTCCTTCATCGACTCTCAAAGGCACCTAATAGTCCACACGGGGCCTTAGTAGCCACCCCCAACCTGCGAGATCAACCAGCTGCGACACTCAGGAGTTCACGAACATGCCAACCCACACCTTTGGTCATTCTACTTTGAGGCTACCAGACAAGGACCCATCTTATGAAATTAAACTCATCTTTCATGCACCATGGTCCTTCAAAAAGGAAGACAAAAATTTCCATTCTTCCCATAAAGAAAATCCTTTCCCCCCCTATCTTCCAAAGACTTCTACATGAACCCAACTAATGTCTCACGAATTCCCAATGAATTAGAACAACATTCTATCAACAAAAGCAAGCTTGGCTCCAGAATGAGCACGATATCAGGGTTGCAAAAACAGCTGCAAACCATCTCCAAAACCTCTTGAATTCCAAGAAAGAAAAATCATTATATGGAGGATGAAAACCCACCAACCGACACTTCCTTAAACTGCAAACCACTTGCCTTAATCAAGGAGCTAAAACGGCTTGGTTCAAGACAAATAACCGAGGTCTCTTTATCGATCAGAGGCGAACTCGTTCCTGAAGATTCGAAGAGACTTACAAAATTAATAATAGGGGACTCTACCACCAAATCCTTATGCCTAGACTTACCATGTAGAATAAAATCTCCCACCCTAACTATGTAGATGTCTTCGTTCAGGGGAGTTCTATATGCTTTGTTGTTTTGTCATGAATATTGGTAGAGTTGGGAATGATCAGTTTCTTGTGTGATTCGAGAATACGTGAGCATGAAATAATTGGAATTCGAGATTGATCAGAAACTAAAACTGAGTGTGTGCAATGACTTTTTTTAATTTTAATTTTTTAATTTATGATTTTCTAGAATACTCTCTTCAATTGAAGGATGGTACATCTAAACTGTGTCGTGGAAAGAAAATTGTTTTAACTAGGGGCTTAAGTGGGTTAATTCAAGTTGTTTTTTTGGTCAAACTCGACATATTTTGGTGAGTGGTCGAGAGTTTGGAATGGGAGTTGCCTCCAATCCAACAATCTTTCTATCTCCTAGTAGGGCAAAGCCTGAGGCTGAATTAGACCCACACTATGATTAGACTGGGTGACTTTTGCACTTTTTATCTTTATTTATCTTTATATTTGTTGTCAATGTTCTTATCTAGCTTTTGGATAGAGGGGTAAGAACCAAATGCAGGGTTTTTATATGGGGAGGGATGATGTGATCTCATCTACTTTATGTTGCCAATGAAGATCTTTTATTTCCCCAGGAAGGGAAGACTTCTTTATTAACATTCATTGGGCCTTAGGATTCTAAGAGGTTGTTTAGGGCACTGAGTGGGTTATAATAACATGAATTATTATAGTTTGTAGGTTATAGCCCCCAAGGGGCAACCCCATTGGTAAGAACTTAGGGTCTCTTGGTCATACCGGTTTAGAGGTCTTAGGGTTGAGCCCTCGGGGATCTTAATAAATAAAAACCTTTGATGTTTCCTAGGTCTGGGCCTTGGGGTGGGCGCGGGTACCCTCGTGTATAGGGGAGCAAAGCTCCGAGCCTCCAACTCTCGGTTGTAAAAAAAAAATTAGTCTGTAGAGTTATGTAATTTGTGTTTAGGATGCAAAATTATTTAATTTGAGTTATTATAGTATGTGTTTGGGGTGCCTATTATAACTCACGCCCCAAACAACTCTTAAGAAATAAGAGCTCTGGTGGGCTATTATGAGATGTCGTGCTATTTCTTCCCTCTCCACCCGATATATAGGCATGATATTAGACATAATTTTTAGAGATCTATTAGACTCCTTTTGAAATTGAGGGACTTGTTAGACACAAAATTGTAAGTTTAAGAACATATTAGACATTTTAAAATTCAAGGACCTATTAGACGCAAATATGAAAGTTTAGGGACTAAACTTATAATTTAATGCACTAAAAAAAAACTTGTAATTTAACCTTAAATTATTTATTAGTAATGAAAACCATATTATTCTCCAAAAAAAAAAAAAAAAAGAAAACCAGATTTTTCATTGAGCAAATGAAAGGATATACACCTCGGAAATACAAGAAAAACAATCCAAACTAAGTGGAGCCTGAATAGAAGGTATAGCCGAGATCCTTTCAAACAGTAGAACAAATTTTACTCCAACCTCCTCACGATCACAATCACAATTCACAACCTTTTAGTCTTTTTATAGTTTGTATTTTTCATTTCCAATTGGAAAATGTTCTTGTAATTCACTATTAAGGTGTCCGGAGTTTTCTCCTTATTTCATTCATCAATGAAATTGTTTCTTCTATAAAAAAAAAAACCAGCTCTTTAATCTTCAACAAATCTAATAACACAATAACTTCCTTGCTTCCCCATCCCTCAAACTTCTTGTTGTGATATTGATATTCCAACTCCTACAAGTTGAAGCCCACATTGTATTTACAAAATTCAAATAGGACTCCACCACCCCGTACAAATTTTGATACAGTTCAGCAAGAGAGTAATCATCCCAAATCTAGTCAATCTATCATTGCCCACTTTAATCTTTGAATTACTTAATACAATGTTTTTTATTTCTAAAATATTCCTCCATAGTCTTCTCCATCTCCCCTACTCCTTTGAAAAGTCAATCAATGTTTCTCATTCTTTCCATAGATACATACTGCGAATCACCTTGTGTCAAATCTGTTCAGACTCATACATAAATATGCAAAGACATTGTGATTAATAAAAGCTTTATTTATCTGCTTCAAATTGTCCATGCCTGAACCTCCTTTTTCATAAGCCAAAGAACAAAGAAGCCACCCTCCAATGACCGAGTGTCTCAATTTATTTTCATTATTTCCTTCCCAAAGAAAAATTCCTCGTGACTTTCCAAAATATCTCATAAACATTTCCAAAGAAACCTCTATTGCTTTATGTTGCACTTTTTGGTCTTTTGGTAGATGGATTTTATTTTTAGAAAAGATCTTAAAAAAAGGGACCCGTTTGATCTCAATCCCTTTCTCGAATCCATCAAGGGGGTTTTCAAGGCAGAGTAACGTGGAAAAGAGGGCATTGGGCTTCAATCATCACATTTTTGTTTAATTATATCTTTTATTGTCTTAGGTTTATTTTGTTCCTCAAGACAGACTCTTACTTGTATATATATTAAAAAGTTTGGGGTATTTGAACTATATCATTTAGTTCTCTCTCTACTTATGCTTATACAACTATGGTATTAGACAACATGAAAATTGAATAATGGTGCTCCTTGGACATTAATTAGCGTCTTGAGAATATGGTATGCATGTATTAATTCTTAATAGCCATGATTTGCCAAAGTGAGCATAGCTCGGTAATTGGCATGTACCTCCGACCGAATCTCCCACCCTGGAATGTTGTTGAAGTAAAAAATTCTTAATAGCCATGATTTATTCTCGTTTCCCACAAGAGATGACACTTGTCATTTCAAAAGCCTATGGGTTGACTCCCTGACACTTTCATGCTGCTCCACTATGCTTTTTCTCCTCCACCAGCTGTGTAGGTTGAGTGCCCTTTTCAAGGAGGCAGGGAAATTTACCATCTTGCAATTTTGTAAATTCAGTAGTTATCATTGTTTTTAGGAAATCTTGCTTTTGCATGAACTGAGATAAGTCTTTTAATTGTGTAGCTTATCTGTTTATGAGGACAAAGAAGCAAGATGAACATGTAGCTGAAGAAGAGGCTACCCCGGATTCAGCTAAAACTGCCAAGATTGCTGCTCCTGTTGTTGAGGAATCATTGGCCAGACCAGCCATTGTGGAGCCCGTGAAGGTAAGGGAGCCAATTCCGGTGGATCAGCAGCGTGAACTTTTCAAGTGGATTTTGGAAGAAAAGCGCAAGATAAAGCCAAAAGATCGCGAAGAGCAGAAACGCATTGACGAAGAGAAAGCAATTCTCAAAGAGTTCATCCGAGCAAAATCTTTTCCAAATGTTTAAACTTTCCAAATGGTAGCAAGCAAACATATTGACTGTTTTACATTGCTGATGGAATCTGAATGTTATCTGAATGTTGTTGATTTCTGTGGTCCTAAAATGGTTTTGTACGTTTCAAATCCTCCCACCACCCCGACCCCAAACGCTAGTTGACACTATGAACACGGTTACGTCCAGAAAACCAGCAATGAAATTTGTTAGGCCTCTGTCGTTGTATAGAATTTCTGATACCAAACAGAACCTTCACTATTGAGGAAAAGACTAGCTAGCCTTTTTTGTGGTTGTTTGTCTTTAGGTTATGTGGTTTTGTACGAGGAGGAAACATCCATTTTATTAGCTTCCAACTTGAGAGAAAATCCAAGGGAACAACTTCTGCAACTTGAAGAGTTCACGTGGGCAGGATGAAATGATGCACCATTTCCCAAATGGATGTGGACATATCCAATGTGTATGGTGAGATTTCTCCATCTGCTGCATGCATAAGCCTCCAAAAACCAAACTAAATGCAACTTCCAATGGGTCAAGTTTTTGTTGCTCTTTTTGGAATATTTTAGTCTTTCTAGATAATGTGTCATAGCTCCTTCCGGTGTGTACCTGTTGTAGGCTCCAGG

mRNA sequence

CATCAACAGCCTAAGATTGTAGAACATGGCGGAGAGCGACTCCGTACCTACTGGCGAAGAAGCAAAATCAGCCATCAAATACAAAAGACTGAGAAAAAGGGAACTTACAGTCATCGGAGATGAGCGAAGAAGAACCTCCTAAACTCTACGCTAACAAACCCAAGAAAGCCCAGGTTAAACAATTTCAAGAACAGCACAAAGTCAGAGACGCTTCTTCTTCGCCGGCGCCACCTGCATCGTCGAACATGGGATCTGCGTCTTCGCCGGCACCGCCACAGCCTCCGAAGGAATCATTTGCAAGGCGATATAAGTTCTTATGGCCCATGCTTTTGACTGTTAACCTTGCTGTTGGAGCTTATCTGTTTATGAGGACAAAGAAGCAAGATGAACATGTAGCTGAAGAAGAGGCTACCCCGGATTCAGCTAAAACTGCCAAGATTGCTGCTCCTGTTGTTGAGGAATCATTGGCCAGACCAGCCATTGTGGAGCCCGTGAAGGTAAGGGAGCCAATTCCGGTGGATCAGCAGCGTGAACTTTTCAAGTGGATTTTGGAAGAAAAGCGCAAGATAAAGCCAAAAGATCGCGAAGAGCAGAAACGCATTGACGAAGAGAAAGCAATTCTCAAAGAGTTCATCCGAGCAAAATCTTTTCCAAATGTTTAAACTTTCCAAATGGTAGCAAGCAAACATATTGACTGTTTTACATTGCTGATGGAATCTGAATGTTATCTGAATGTTGTTGATTTCTGTGGTCCTAAAATGGTTTTGTACGTTTCAAATCCTCCCACCACCCCGACCCCAAACGCTAGTTGACACTATGAACACGGTTACGTCCAGAAAACCAGCAATGAAATTTGTTAGGCCTCTGTCGTTGTATAGAATTTCTGATACCAAACAGAACCTTCACTATTGAGGAAAAGACTAGCTAGCCTTTTTTGTGGTTGTTTGTCTTTAGGTTATGTGGTTTTGTACGAGGAGGAAACATCCATTTTATTAGCTTCCAACTTGAGAGAAAATCCAAGGGAACAACTTCTGCAACTTGAAGAGTTCACGTGGGCAGGATGAAATGATGCACCATTTCCCAAATGGATGTGGACATATCCAATGTGTATGGTGAGATTTCTCCATCTGCTGCATGCATAAGCCTCCAAAAACCAAACTAAATGCAACTTCCAATGGGTCAAGTTTTTGTTGCTCTTTTTGGAATATTTTAGTCTTTCTAGATAATGTGTCATAGCTCCTTCCGGTGTGTACCTGTTGTAGGCTCCAGG

Coding sequence (CDS)

ATGAGCGAAGAAGAACCTCCTAAACTCTACGCTAACAAACCCAAGAAAGCCCAGGTTAAACAATTTCAAGAACAGCACAAAGTCAGAGACGCTTCTTCTTCGCCGGCGCCACCTGCATCGTCGAACATGGGATCTGCGTCTTCGCCGGCACCGCCACAGCCTCCGAAGGAATCATTTGCAAGGCGATATAAGTTCTTATGGCCCATGCTTTTGACTGTTAACCTTGCTGTTGGAGCTTATCTGTTTATGAGGACAAAGAAGCAAGATGAACATGTAGCTGAAGAAGAGGCTACCCCGGATTCAGCTAAAACTGCCAAGATTGCTGCTCCTGTTGTTGAGGAATCATTGGCCAGACCAGCCATTGTGGAGCCCGTGAAGGTAAGGGAGCCAATTCCGGTGGATCAGCAGCGTGAACTTTTCAAGTGGATTTTGGAAGAAAAGCGCAAGATAAAGCCAAAAGATCGCGAAGAGCAGAAACGCATTGACGAAGAGAAAGCAATTCTCAAAGAGTTCATCCGAGCAAAATCTTTTCCAAATGTTTAA

Protein sequence

MSEEEPPKLYANKPKKAQVKQFQEQHKVRDASSSPAPPASSNMGSASSPAPPQPPKESFARRYKFLWPMLLTVNLAVGAYLFMRTKKQDEHVAEEEATPDSAKTAKIAAPVVEESLARPAIVEPVKVREPIPVDQQRELFKWILEEKRKIKPKDREEQKRIDEEKAILKEFIRAKSFPNV
Homology
BLAST of Tan0010764 vs. NCBI nr
Match: XP_038898399.1 (uncharacterized protein LOC120086050 [Benincasa hispida])

HSP 1 Score: 297.4 bits (760), Expect = 8.6e-77
Identity = 163/187 (87.17%), Postives = 170/187 (90.91%), Query Frame = 0

Query: 1   MSEEEPPKLYANKPKKAQVKQFQEQHKVRDASSSPAPPASSNMGSASS-------PAPPQ 60
           MSEEEPPKLYANKPKKAQVKQFQEQHKVRDASSSPAPPASSNM SAS+       P+PPQ
Sbjct: 1   MSEEEPPKLYANKPKKAQVKQFQEQHKVRDASSSPAPPASSNMASASASASASSYPSPPQ 60

Query: 61  PPKESFARRYKFLWPMLLTVNLAVGAYLFMRTKKQDEHVAEEEATPDSAKTAKIAAPVVE 120
           PPKESFARRYKFLWPMLLTVNLAVGAYLFMRTKKQDEHVA+E+A PDSA   KIA PVVE
Sbjct: 61  PPKESFARRYKFLWPMLLTVNLAVGAYLFMRTKKQDEHVADEDAAPDSA--TKIAPPVVE 120

Query: 121 ESLARPAIVEPVKVREPIPVDQQRELFKWILEEKRKIKPKDREEQKRIDEEKAILKEFIR 180
           ES   PAIVEPVKVREPIPVDQQRELFKWILEEKRKIKPKDREE+KRIDEEKAILK+FIR
Sbjct: 121 ESFTGPAIVEPVKVREPIPVDQQRELFKWILEEKRKIKPKDREEKKRIDEEKAILKQFIR 180

BLAST of Tan0010764 vs. NCBI nr
Match: KAG6575480.1 (hypothetical protein SDJN03_26119, partial [Cucurbita argyrosperma subsp. sororia] >KAG7014023.1 hypothetical protein SDJN02_24194 [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 293.1 bits (749), Expect = 1.6e-75
Identity = 162/182 (89.01%), Postives = 166/182 (91.21%), Query Frame = 0

Query: 1   MSEEEPPKLYANKPKKAQVKQFQEQHKVRDA--SSSPAPPASSNMGSASSPAPPQPPKES 60
           MS EEPPKLYANKPKKAQVKQFQEQHKV  A  SSSPAPPASSN  S SS + PQPPKES
Sbjct: 1   MSGEEPPKLYANKPKKAQVKQFQEQHKVMSASSSSSPAPPASSNTASTSSSSLPQPPKES 60

Query: 61  FARRYKFLWPMLLTVNLAVGAYLFMRTKKQDEHVAEEEATPDSAKTAKIAAPVVEESLAR 120
           FARRYKFLWPMLLTVNLAVGAYL MRTKKQDE V EEEA PDSAKTAKIAAPVVEES A+
Sbjct: 61  FARRYKFLWPMLLTVNLAVGAYLLMRTKKQDEQVTEEEAAPDSAKTAKIAAPVVEESFAK 120

Query: 121 PAIVEPVKVREPIPVDQQRELFKWILEEKRKIKPKDREEQKRIDEEKAILKEFIRAKSFP 180
           PAIVEPVKVREPIPVDQQRELFKWILEEKRKIKPKDREE+KRIDEEKAILKEFIRAKS P
Sbjct: 121 PAIVEPVKVREPIPVDQQRELFKWILEEKRKIKPKDREEKKRIDEEKAILKEFIRAKSIP 180

BLAST of Tan0010764 vs. NCBI nr
Match: XP_022954191.1 (uncharacterized protein LOC111456527 isoform X1 [Cucurbita moschata])

HSP 1 Score: 290.8 bits (743), Expect = 8.0e-75
Identity = 163/185 (88.11%), Postives = 167/185 (90.27%), Query Frame = 0

Query: 1   MSEEEPPKLYANKPKKAQVKQFQEQHKVRDA--SSSPAPPASSNMGSASSPAP---PQPP 60
           MS EEPPKLYANKPKKAQVKQFQEQHKV  A  SSSPAPPASSN  S SS +    PQPP
Sbjct: 1   MSGEEPPKLYANKPKKAQVKQFQEQHKVMSASSSSSPAPPASSNTASTSSSSSSSLPQPP 60

Query: 61  KESFARRYKFLWPMLLTVNLAVGAYLFMRTKKQDEHVAEEEATPDSAKTAKIAAPVVEES 120
           KESFARRYKFLWPMLLTVNLAVGAYLFMRTKKQDE V EEEA PDSAKTAKIAAPVVEES
Sbjct: 61  KESFARRYKFLWPMLLTVNLAVGAYLFMRTKKQDEQVTEEEAAPDSAKTAKIAAPVVEES 120

Query: 121 LARPAIVEPVKVREPIPVDQQRELFKWILEEKRKIKPKDREEQKRIDEEKAILKEFIRAK 180
            A+PAIVEPVKVREPIPVDQQRELFKWILEEKRKIKPKDREE+KRIDEEKAILKEFIRAK
Sbjct: 121 FAKPAIVEPVKVREPIPVDQQRELFKWILEEKRKIKPKDREEKKRIDEEKAILKEFIRAK 180

BLAST of Tan0010764 vs. NCBI nr
Match: XP_023548106.1 (uncharacterized protein LOC111806841 isoform X1 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 290.4 bits (742), Expect = 1.0e-74
Identity = 163/185 (88.11%), Postives = 168/185 (90.81%), Query Frame = 0

Query: 1   MSEEEPPKLYANKPKKAQVKQFQEQHKVRDA--SSSPAPPASSN---MGSASSPAPPQPP 60
           MS EEPPKLYANKPKKAQVKQFQEQHKV  A  SSSPAPPASSN     S+SS + PQPP
Sbjct: 1   MSGEEPPKLYANKPKKAQVKQFQEQHKVMSASSSSSPAPPASSNTAATSSSSSSSLPQPP 60

Query: 61  KESFARRYKFLWPMLLTVNLAVGAYLFMRTKKQDEHVAEEEATPDSAKTAKIAAPVVEES 120
           KESFARRYKFLWPMLLTVNLAVGAYLFMRTKKQDE V EEEA PDSAKTAKIAAPVVEES
Sbjct: 61  KESFARRYKFLWPMLLTVNLAVGAYLFMRTKKQDEQVTEEEAAPDSAKTAKIAAPVVEES 120

Query: 121 LARPAIVEPVKVREPIPVDQQRELFKWILEEKRKIKPKDREEQKRIDEEKAILKEFIRAK 180
            A+PAIVEPVKVREPIPVDQQRELFKWILEEKRKIKPKDREE+KRIDEEKAILKEFIRAK
Sbjct: 121 FAKPAIVEPVKVREPIPVDQQRELFKWILEEKRKIKPKDREEKKRIDEEKAILKEFIRAK 180

BLAST of Tan0010764 vs. NCBI nr
Match: XP_022992414.1 (uncharacterized protein LOC111488728 isoform X1 [Cucurbita maxima])

HSP 1 Score: 288.5 bits (737), Expect = 4.0e-74
Identity = 162/186 (87.10%), Postives = 166/186 (89.25%), Query Frame = 0

Query: 1   MSEEEPPKLYANKPKKAQVKQFQEQHKVRDA-SSSPAPPASSNMGSASSPAP-----PQP 60
           MS EEPPKLYANKPKKAQVKQFQEQHKV  A SSSPAPPASSN  S SS +      PQP
Sbjct: 1   MSGEEPPKLYANKPKKAQVKQFQEQHKVMSASSSSPAPPASSNTASTSSSSSSSSSLPQP 60

Query: 61  PKESFARRYKFLWPMLLTVNLAVGAYLFMRTKKQDEHVAEEEATPDSAKTAKIAAPVVEE 120
           PKESFARRYKFLWPMLLTVNLAVGAYLFMRTKKQDE V EEEA PDSAKTAKIAAPVVEE
Sbjct: 61  PKESFARRYKFLWPMLLTVNLAVGAYLFMRTKKQDEQVTEEEAAPDSAKTAKIAAPVVEE 120

Query: 121 SLARPAIVEPVKVREPIPVDQQRELFKWILEEKRKIKPKDREEQKRIDEEKAILKEFIRA 180
           S A+PAIVEPVKVREPIPVDQQRELFKWILEEKRKIKPKD EE+KRIDEEKAILKEFIRA
Sbjct: 121 SFAKPAIVEPVKVREPIPVDQQRELFKWILEEKRKIKPKDHEEKKRIDEEKAILKEFIRA 180

BLAST of Tan0010764 vs. ExPASy TrEMBL
Match: A0A6J1GQ86 (uncharacterized protein LOC111456527 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111456527 PE=4 SV=1)

HSP 1 Score: 290.8 bits (743), Expect = 3.9e-75
Identity = 163/185 (88.11%), Postives = 167/185 (90.27%), Query Frame = 0

Query: 1   MSEEEPPKLYANKPKKAQVKQFQEQHKVRDA--SSSPAPPASSNMGSASSPAP---PQPP 60
           MS EEPPKLYANKPKKAQVKQFQEQHKV  A  SSSPAPPASSN  S SS +    PQPP
Sbjct: 1   MSGEEPPKLYANKPKKAQVKQFQEQHKVMSASSSSSPAPPASSNTASTSSSSSSSLPQPP 60

Query: 61  KESFARRYKFLWPMLLTVNLAVGAYLFMRTKKQDEHVAEEEATPDSAKTAKIAAPVVEES 120
           KESFARRYKFLWPMLLTVNLAVGAYLFMRTKKQDE V EEEA PDSAKTAKIAAPVVEES
Sbjct: 61  KESFARRYKFLWPMLLTVNLAVGAYLFMRTKKQDEQVTEEEAAPDSAKTAKIAAPVVEES 120

Query: 121 LARPAIVEPVKVREPIPVDQQRELFKWILEEKRKIKPKDREEQKRIDEEKAILKEFIRAK 180
            A+PAIVEPVKVREPIPVDQQRELFKWILEEKRKIKPKDREE+KRIDEEKAILKEFIRAK
Sbjct: 121 FAKPAIVEPVKVREPIPVDQQRELFKWILEEKRKIKPKDREEKKRIDEEKAILKEFIRAK 180

BLAST of Tan0010764 vs. ExPASy TrEMBL
Match: A0A6J1JXH4 (uncharacterized protein LOC111488728 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111488728 PE=4 SV=1)

HSP 1 Score: 288.5 bits (737), Expect = 1.9e-74
Identity = 162/186 (87.10%), Postives = 166/186 (89.25%), Query Frame = 0

Query: 1   MSEEEPPKLYANKPKKAQVKQFQEQHKVRDA-SSSPAPPASSNMGSASSPAP-----PQP 60
           MS EEPPKLYANKPKKAQVKQFQEQHKV  A SSSPAPPASSN  S SS +      PQP
Sbjct: 1   MSGEEPPKLYANKPKKAQVKQFQEQHKVMSASSSSPAPPASSNTASTSSSSSSSSSLPQP 60

Query: 61  PKESFARRYKFLWPMLLTVNLAVGAYLFMRTKKQDEHVAEEEATPDSAKTAKIAAPVVEE 120
           PKESFARRYKFLWPMLLTVNLAVGAYLFMRTKKQDE V EEEA PDSAKTAKIAAPVVEE
Sbjct: 61  PKESFARRYKFLWPMLLTVNLAVGAYLFMRTKKQDEQVTEEEAAPDSAKTAKIAAPVVEE 120

Query: 121 SLARPAIVEPVKVREPIPVDQQRELFKWILEEKRKIKPKDREEQKRIDEEKAILKEFIRA 180
           S A+PAIVEPVKVREPIPVDQQRELFKWILEEKRKIKPKD EE+KRIDEEKAILKEFIRA
Sbjct: 121 SFAKPAIVEPVKVREPIPVDQQRELFKWILEEKRKIKPKDHEEKKRIDEEKAILKEFIRA 180

BLAST of Tan0010764 vs. ExPASy TrEMBL
Match: E5GCA6 (Uncharacterized protein OS=Cucumis melo subsp. melo OX=412675 PE=4 SV=1)

HSP 1 Score: 288.1 bits (736), Expect = 2.5e-74
Identity = 160/182 (87.91%), Postives = 166/182 (91.21%), Query Frame = 0

Query: 1   MSEEEPPKLYANKPKKAQVKQFQEQHKVRDASSSPAPPASSNMGSA--SSPAPPQPPKES 60
           MSEE  PKLYANKP KAQ+KQFQEQHK  DASSS    ASS+M SA  SSP PPQPPKES
Sbjct: 1   MSEEGLPKLYANKPTKAQIKQFQEQHKAGDASSS----ASSSMASASSSSPPPPQPPKES 60

Query: 61  FARRYKFLWPMLLTVNLAVGAYLFMRTKKQDEHVAEEEATPDSAKTAKIAAPVVEESLAR 120
           FARRYKFLWPMLLTVNLAVGAY+FMRTKKQDEHVAEEEA PDSAKT KIAAPVVEESLA+
Sbjct: 61  FARRYKFLWPMLLTVNLAVGAYVFMRTKKQDEHVAEEEAAPDSAKTTKIAAPVVEESLAK 120

Query: 121 PAIVEPVKVREPIPVDQQRELFKWILEEKRKIKPKDREEQKRIDEEKAILKEFIRAKSFP 180
           PAIVEPVKVREPIPVDQQRELFKWILEEKRKIKPKDREE+KRIDEEKAILKEFIRAKS P
Sbjct: 121 PAIVEPVKVREPIPVDQQRELFKWILEEKRKIKPKDREEKKRIDEEKAILKEFIRAKSIP 178

BLAST of Tan0010764 vs. ExPASy TrEMBL
Match: A0A1S3CGT4 (uncharacterized protein LOC103500733 isoform X2 OS=Cucumis melo OX=3656 GN=LOC103500733 PE=4 SV=1)

HSP 1 Score: 288.1 bits (736), Expect = 2.5e-74
Identity = 160/182 (87.91%), Postives = 166/182 (91.21%), Query Frame = 0

Query: 1   MSEEEPPKLYANKPKKAQVKQFQEQHKVRDASSSPAPPASSNMGSA--SSPAPPQPPKES 60
           MSEE  PKLYANKP KAQ+KQFQEQHK  DASSS    ASS+M SA  SSP PPQPPKES
Sbjct: 1   MSEEGLPKLYANKPTKAQIKQFQEQHKAGDASSS----ASSSMASASSSSPPPPQPPKES 60

Query: 61  FARRYKFLWPMLLTVNLAVGAYLFMRTKKQDEHVAEEEATPDSAKTAKIAAPVVEESLAR 120
           FARRYKFLWPMLLTVNLAVGAY+FMRTKKQDEHVAEEEA PDSAKT KIAAPVVEESLA+
Sbjct: 61  FARRYKFLWPMLLTVNLAVGAYVFMRTKKQDEHVAEEEAAPDSAKTTKIAAPVVEESLAK 120

Query: 121 PAIVEPVKVREPIPVDQQRELFKWILEEKRKIKPKDREEQKRIDEEKAILKEFIRAKSFP 180
           PAIVEPVKVREPIPVDQQRELFKWILEEKRKIKPKDREE+KRIDEEKAILKEFIRAKS P
Sbjct: 121 PAIVEPVKVREPIPVDQQRELFKWILEEKRKIKPKDREEKKRIDEEKAILKEFIRAKSIP 178

BLAST of Tan0010764 vs. ExPASy TrEMBL
Match: A0A0A0KAZ6 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_7G451380 PE=4 SV=1)

HSP 1 Score: 286.2 bits (731), Expect = 9.6e-74
Identity = 158/181 (87.29%), Postives = 165/181 (91.16%), Query Frame = 0

Query: 1   MSEEEPPKLYANKPKKAQVKQFQEQHKVRDASSSPAPPASSNMGSA-SSPAPPQPPKESF 60
           MSEE  PKLYANKP KAQ+KQFQE+HK  DASSS    ASSNM SA SSP PPQPPKESF
Sbjct: 1   MSEEGLPKLYANKPTKAQIKQFQERHKAGDASSS----ASSNMASASSSPPPPQPPKESF 60

Query: 61  ARRYKFLWPMLLTVNLAVGAYLFMRTKKQDEHVAEEEATPDSAKTAKIAAPVVEESLARP 120
           ARRYKFLWPMLLTVNLAVGAY+FMRTKKQDEHVAEEEA PDSAKT KIAAPVVEESLARP
Sbjct: 61  ARRYKFLWPMLLTVNLAVGAYVFMRTKKQDEHVAEEEAAPDSAKTTKIAAPVVEESLARP 120

Query: 121 AIVEPVKVREPIPVDQQRELFKWILEEKRKIKPKDREEQKRIDEEKAILKEFIRAKSFPN 180
            +VEPVKVREPIPVDQQRELFKWILEEKRKIKPKDREE+KRIDEEKAILKEFIRAKS P+
Sbjct: 121 VVVEPVKVREPIPVDQQRELFKWILEEKRKIKPKDREEKKRIDEEKAILKEFIRAKSIPS 177

BLAST of Tan0010764 vs. TAIR 10
Match: AT1G55160.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: mitochondrion, plastid; EXPRESSED IN: 24 plant structures; EXPRESSED DURING: 13 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G19530.1); Has 63 Blast hits to 63 proteins in 14 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 63; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 164.5 bits (415), Expect = 8.1e-41
Identity = 109/189 (57.67%), Postives = 133/189 (70.37%), Query Frame = 0

Query: 4   EEPPKLYANKPKK----AQVKQFQE--QHKVRDASSSPAP----PASSNMGSASSPAPPQ 63
           EE PKL+ NKPKK    AQ+K  +    +     SS P+P     AS  MG  S P PP 
Sbjct: 3   EETPKLFTNKPKKKAIIAQLKHVEANFNNPTVPPSSKPSPAAAAAASYTMGGGSVP-PPP 62

Query: 64  PPKESFARRYKFLWPMLLTVNLAVGAYLFMRTKKQD-EHVAEEEATPDSAKTAKIAAPV- 123
           PPKESFARRYK++WP+LLTVNLAVG YLF RTKK+D + V EE A    AK++ +AAPV 
Sbjct: 63  PPKESFARRYKYVWPLLLTVNLAVGGYLFFRTKKKDLDPVVEETA----AKSSSVAAPVT 122

Query: 124 VEESLARPAIVEPV--KVREPIPVDQQRELFKWILEEKRKIKPKDREEQKRIDEEKAILK 179
           VE++L+   + EPV  K REPIP  QQRELFKW+LEEKRK+ PK+ EE+KR DEEKAILK
Sbjct: 123 VEKTLSSTVVAEPVVIKAREPIPEKQQRELFKWMLEEKRKVNPKNAEEKKRNDEEKAILK 182

BLAST of Tan0010764 vs. TAIR 10
Match: AT1G55160.2 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: mitochondrion, plastid; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G19530.1); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )

HSP 1 Score: 152.9 bits (385), Expect = 2.4e-37
Identity = 90/140 (64.29%), Postives = 109/140 (77.86%), Query Frame = 0

Query: 43  MGSASSPAPPQPPKESFARRYKFLWPMLLTVNLAVGAYLFMRTKKQD-EHVAEEEATPDS 102
           MG  S P PP PPKESFARRYK++WP+LLTVNLAVG YLF RTKK+D + V EE A    
Sbjct: 1   MGGGSVP-PPPPPKESFARRYKYVWPLLLTVNLAVGGYLFFRTKKKDLDPVVEETA---- 60

Query: 103 AKTAKIAAPV-VEESLARPAIVEPV--KVREPIPVDQQRELFKWILEEKRKIKPKDREEQ 162
           AK++ +AAPV VE++L+   + EPV  K REPIP  QQRELFKW+LEEKRK+ PK+ EE+
Sbjct: 61  AKSSSVAAPVTVEKTLSSTVVAEPVVIKAREPIPEKQQRELFKWMLEEKRKVNPKNAEEK 120

Query: 163 KRIDEEKAILKEFIRAKSFP 179
           KR DEEKAILK+FI +K+ P
Sbjct: 121 KRNDEEKAILKQFIGSKTIP 135

BLAST of Tan0010764 vs. TAIR 10
Match: AT1G55160.3 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: mitochondrion, plastid; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G19530.1); Has 63 Blast hits to 63 proteins in 14 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 63; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 151.0 bits (380), Expect = 9.2e-37
Identity = 109/214 (50.93%), Postives = 134/214 (62.62%), Query Frame = 0

Query: 4   EEPPKLYANKPKK----AQVKQFQE--QHKVRDASSSPAP----PASSNMGSASSPAPPQ 63
           EE PKL+ NKPKK    AQ+K  +    +     SS P+P     AS  MG  S P PP 
Sbjct: 3   EETPKLFTNKPKKKAIIAQLKHVEANFNNPTVPPSSKPSPAAAAAASYTMGGGSVP-PPP 62

Query: 64  PPKESFARRYKFLWPMLLTVNLAVG-------------------------AYLFMRTKKQ 123
           PPKESFARRYK++WP+LLTVNLAVG                         +YLF RTKK+
Sbjct: 63  PPKESFARRYKYVWPLLLTVNLAVGGFCSSLDENRIVFSFIFMMLRVIYDSYLFFRTKKK 122

Query: 124 D-EHVAEEEATPDSAKTAKIAAPV-VEESLARPAIVEPV--KVREPIPVDQQRELFKWIL 179
           D + V EE A    AK++ +AAPV VE++L+   + EPV  K REPIP  QQRELFKW+L
Sbjct: 123 DLDPVVEETA----AKSSSVAAPVTVEKTLSSTVVAEPVVIKAREPIPEKQQRELFKWML 182

BLAST of Tan0010764 vs. TAIR 10
Match: AT2G19530.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G55160.2); Has 461 Blast hits to 346 proteins in 80 species: Archae - 0; Bacteria - 16; Metazoa - 89; Fungi - 28; Plants - 57; Viruses - 0; Other Eukaryotes - 271 (source: NCBI BLink). )

HSP 1 Score: 87.8 bits (216), Expect = 9.6e-18
Identity = 66/181 (36.46%), Postives = 92/181 (50.83%), Query Frame = 0

Query: 47  SSPAPPQPPKESFARRYKFLWPMLLTVNLAVGAYLFMRTKKQDEHVAEEEAT-------- 106
           SSP+  +PP++  ++  K  W   +  NL   AY+F   +++D    E++          
Sbjct: 3   SSPSGSEPPQKVVSKLQKVGWRATMIFNLGFAAYIFAIKREKDIDADEKKKVKKGSEARH 62

Query: 107 -------------------PDSAKTAKIAAPVVEE-------------------SLARPA 166
                               D AK A+ A P  EE                   S+ +  
Sbjct: 63  KGVKKGAVNTEIEKKGAEETDKAKEAETAIPEKEETKLIPELDPLFEFTDATDQSMFQTV 122

Query: 167 IVEPVKV-REPIPVDQQRELFKWILEEKRKIKPKDREEQKRIDEEKAILKEFIRAKSFPN 181
             E VKV R+PIP D+Q+ELFKWILEEKRKI+PKDR+E+K+IDEEKAILK+FIRA+  P 
Sbjct: 123 ATEHVKVARKPIPEDEQKELFKWILEEKRKIEPKDRKEKKQIDEEKAILKQFIRAERIPK 182

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
XP_038898399.18.6e-7787.17uncharacterized protein LOC120086050 [Benincasa hispida][more]
KAG6575480.11.6e-7589.01hypothetical protein SDJN03_26119, partial [Cucurbita argyrosperma subsp. sorori... [more]
XP_022954191.18.0e-7588.11uncharacterized protein LOC111456527 isoform X1 [Cucurbita moschata][more]
XP_023548106.11.0e-7488.11uncharacterized protein LOC111806841 isoform X1 [Cucurbita pepo subsp. pepo][more]
XP_022992414.14.0e-7487.10uncharacterized protein LOC111488728 isoform X1 [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
A0A6J1GQ863.9e-7588.11uncharacterized protein LOC111456527 isoform X1 OS=Cucurbita moschata OX=3662 GN... [more]
A0A6J1JXH41.9e-7487.10uncharacterized protein LOC111488728 isoform X1 OS=Cucurbita maxima OX=3661 GN=L... [more]
E5GCA62.5e-7487.91Uncharacterized protein OS=Cucumis melo subsp. melo OX=412675 PE=4 SV=1[more]
A0A1S3CGT42.5e-7487.91uncharacterized protein LOC103500733 isoform X2 OS=Cucumis melo OX=3656 GN=LOC10... [more]
A0A0A0KAZ69.6e-7487.29Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_7G451380 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G55160.18.1e-4157.67unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT1G55160.22.4e-3764.29unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT1G55160.39.2e-3750.93unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT2G19530.19.6e-1836.46unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR34364:SF12SUBFAMILY NOT NAMEDcoord: 2..137
NoneNo IPR availablePANTHERPTHR34364WAS/WASL-INTERACTING FAMILY PROTEINcoord: 2..137

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0010764.1Tan0010764.1mRNA
Tan0010764.2Tan0010764.2mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0016021 integral component of membrane