HG10023482 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10023482
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionUnknown protein
LocationChr05: 34594782 .. 34598276 (+)
RNA-Seq ExpressionHG10023482
SyntenyHG10023482
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAGCAAAGAAGAACCTCCCAAGCTCTACGCCAACAAACCCAAGAAAGGTACCATTCCAAAACTCACAACCCATTTCCCAATTCTTCATTCCTTCGAAGTTTAGGCACAAAATGGTTGCTCCAAGAACCGATCCGATTCAATAATTCATTTCAATTATTGGGGTTCTATTATATAATGGATAATCAATCGGCTTTTCGCTTTTTTTTCCGTCTGCAAACAGCCCAGGTCAAACAATTTCAAGAACAGCACAAAGTCAGCGACGCTTCTTCTTCTTCACCGGCGCCACCATCATCGAGCATGTCATCCGCGTCTTCTTCTCCTTCACCGCCGCAGCCTCCGAAGGAATCATTTGCAAGGCGATATAAGTTCTTATGGCCCATGCTTTTGACTGTCAACCTTGCTGTTGGAGGTAATTTTCCTCTCCGATGATGTGTGTATCAGTTTCTGGTTGTTGTGTTAATGTGTTTGATCGAGAGCGTTAAATGCTAATACGGGTAATCCGGTTCTTCGTTTTTATTTCTGTTACATGGAGTCTTTGTATTTAGGATTGGTACTCTCTACTGAAATTTGTGGTGTACCAGTTTCTTTAGATTGTTAAATCCCGTTATCTTCTTTCTGGTACTTGTAAACTTGGGTCTCCCACCTTAGCTATCTAGATATCTCTGTTCGGAGGAATTCTGTATGGTTTGTTGTTTTGCCGCTACAGTCAGCATAGAGTTGGGAATGTTTAGTTTCCTGTGTGATTGAAGAATATATGCAAGCATGAAATAACTGTAATTCGAGATCGATCAGAAACTAAAACCAAGTGTGTGCAATAATTTTTTTTTGATGATTTTGTTGAATACTCTCTTCAATTGAAGGATGGTCTGCTATGACAAATTAGCAGTAGCACAATAGTACTTTTCAGGCGTATTGGTTAATCTAAACCGTGTCATAAGTGGGTTAATTAGAGTTGTTTTTTTGGTTGAACTTGACATATATTGGTGAGCGTAGAGAGGTTAGTATTGGGGTTGTGATCCAACAATCTTTTTATCTTCCAGTAGGACAAAGCCGGAGGCTAAATTAAAGCCACATTTGTTGTCAATGTCCTTATCTAGCTTTTGGACAGAGGGAAAGGCAATAACCAATGACACTATCTTTAGATGGGGAGGGATGATTTGATTTCATCGACTTATGTTGCCAATGACACTATCTTTTCTTTCCCCAGGAAGGGAAGGCGTCCTTGTCGACCTTTTGAAAAGAAAACCCTTATAGAACCATCCTTATAGTTTTCTATCTAACATTTTTCCTTGGGATTCTAAGAGATAAGAGCTCTATCCATGGTGTTAATATCTTTATAGTAAACACCCCAGGTGGGCTATTATGATTTGTTTTCATCCCTCTCCACCTAATATATAACTTGGCATTAGATTAAGAATTAAATGTTTAGACATCTATTAGACTTTATTTAAAGTTTAGGAATACATTATACTCAAAATTGATAGTTTATTAGACATTTTTAAAATTCAGGGACCTATTAGATGCAAATTGCATCTATGAAAACTTAGGAACTAAACTTGTAATTTAACCTTAAATCATTGATTAATAATTAAAACCATATTCTTCTGAAGAAAAAAAAAAAACAAAAACCAATTTTTTTATTAAGAAAATGAAAGGATAGACTCCAGGGCAATACAAGAAAAACAATCCAAACCAAGTGGAGCCAAAATAAAAGGCATAGATGAGATCCTTTCAAACAGCAAAACAAAAAATTTCCTCCAACCTCTTCATAATTAGATAATTTTCTTACCAAAGGATGGGCAGTAAACTGCCTAAAAAGCTCTAGTACCCGGAGTTTTTTATGTCCCTCTTCCTCCAGCTCTTTTTTCAACAAATCTAGTAGCACAATAGATTCCCTTCTTCCATCTTTAAATTTCTTGTCGTGATATGCCAACTCCTACAAAAAGAAGCCGATATTGTTCATTTATATCCCATCATCCCATACAAACTTCGGTTCATTTCAGCTAGAGGAAGTTCATCCCTTGCCCAAACATCCTCCCAAAAATTGAATCAATCTACCATTACCCATTTCAATCTTTGAATTACTCAGTAAGTGTTCTTCATTTTTTCGCAATATGGCAAAAGAATGCTGGGCATAGCAGTATATTTGTTTGACTTTGAGATTAGTTGGTGTTTCCTCTTTGTCATGAGGAGATGGCGATTCAACTATTACAGAGTCCTAGCCTAAGAAGTTACGAGTATTTTGTGGAACGACGTAGTGCACACAATCCTTTGGAGGTTGTGGCTTGAAAGGAACCAAAGGACATTCCAAGGGACGGAGTTGAGCATGGATAACCTATGTGCCAGCATTAAGTTTTATTCTTCCTTTTGGTGGATCCATAACAATTTTTTTTTGTAATTACGATCTTTTGCGAATTATTGCCAATTGGGAAGCCTTTTTGTAATCCCCTGGCTTTCTTGGAGATATCTCATCTCCCCTCTTTTGTGTATGCCCTTTTGATCTAATGCAATTCTCAGTTTAATTTTAAAAAATATAAGGAAAAATAAAGGAAAAAAGAAATAATATTTTTCATTTCTAAAATATTCCTCCATAGGCTTGTCCATTTTCCGACTCATTTGAGAAGTGAACCAATATTCCTGATTCTTTCCCCATATATACTGCAAAAGACCCTGTGCCAAAACTGATCTGACTCATCTTTTTATAATTATATCTTTTATTGTCTTAGGTTTCTTGTCTCTCAAGATGGTCTTGTCCTTTCATACATACATATTATGATATTTGAACTAACATCATATCATGAGAATTGAATAATGGTCCTCCTTGGACATAATCTAGTGTCTTTAGTTAGATATTATGATATCCACGTATTAATTCTTGATAGTCATGTTTTTCTTCTCATTTCCCTTTGAGGATGACAAACACCTGTTATTACAAAAGCCCTAACCTAAAAGAGCCAAGCTCCACTCTTTTCTCCGCCACCAGCTGTAGGTTGAGTACCCTTTTCAAGGAAATATAGAATAGCAGAAACATTCTATTAGGTTTTTTATTCACCTTTTTTGTGTTTTGGAGGGTTGGGGGATGGAAATTTATTATCTTGCAATTGTTTTGATTCAGTAGTTGTCATTGTTCTTAGGAACATTTACTTGCTTTTGCATGAACTGAGATAAGCCTTTTAACTGTGTAGCTTATCTGTTTATGAGAACAAAAAAGCAAGATGAACATGTAGCTGAAGAAGAGGCTGGCCCGGATTCAGCCAAAATCACCAAGATTGCTGCTCCTGTTGTTGAGGAATCATTGGCCAGACCAACCATTGTGGAGCCTGTGAAGGTAAGAGAACCAATTCCGGTGGACCAGCAGCGTGAACTGTTCAAGTGGATTTTGGAAGAGAAGCGCAAGATAAAGCCAAAGGACCGTGAAGAGAAAAAACGCATTGATGAAGAGAAAGCAATTCTCAAAGAGTTCATCCGAGCAAAATCTATTCCTAATGTTTAA

mRNA sequence

ATGAGCAAAGAAGAACCTCCCAAGCTCTACGCCAACAAACCCAAGAAAGCCCAGGTCAAACAATTTCAAGAACAGCACAAAGTCAGCGACGCTTCTTCTTCTTCACCGGCGCCACCATCATCGAGCATGTCATCCGCGTCTTCTTCTCCTTCACCGCCGCAGCCTCCGAAGGAATCATTTGCAAGGCGATATAAGTTCTTATGGCCCATGCTTTTGACTGTCAACCTTGCTGTTGGAGCTTATCTGTTTATGAGAACAAAAAAGCAAGATGAACATGTAGCTGAAGAAGAGGCTGGCCCGGATTCAGCCAAAATCACCAAGATTGCTGCTCCTGTTGTTGAGGAATCATTGGCCAGACCAACCATTGTGGAGCCTGTGAAGGTAAGAGAACCAATTCCGGTGGACCAGCAGCGTGAACTGTTCAAGTGGATTTTGGAAGAGAAGCGCAAGATAAAGCCAAAGGACCGTGAAGAGAAAAAACGCATTGATGAAGAGAAAGCAATTCTCAAAGAGTTCATCCGAGCAAAATCTATTCCTAATGTTTAA

Coding sequence (CDS)

ATGAGCAAAGAAGAACCTCCCAAGCTCTACGCCAACAAACCCAAGAAAGCCCAGGTCAAACAATTTCAAGAACAGCACAAAGTCAGCGACGCTTCTTCTTCTTCACCGGCGCCACCATCATCGAGCATGTCATCCGCGTCTTCTTCTCCTTCACCGCCGCAGCCTCCGAAGGAATCATTTGCAAGGCGATATAAGTTCTTATGGCCCATGCTTTTGACTGTCAACCTTGCTGTTGGAGCTTATCTGTTTATGAGAACAAAAAAGCAAGATGAACATGTAGCTGAAGAAGAGGCTGGCCCGGATTCAGCCAAAATCACCAAGATTGCTGCTCCTGTTGTTGAGGAATCATTGGCCAGACCAACCATTGTGGAGCCTGTGAAGGTAAGAGAACCAATTCCGGTGGACCAGCAGCGTGAACTGTTCAAGTGGATTTTGGAAGAGAAGCGCAAGATAAAGCCAAAGGACCGTGAAGAGAAAAAACGCATTGATGAAGAGAAAGCAATTCTCAAAGAGTTCATCCGAGCAAAATCTATTCCTAATGTTTAA

Protein sequence

MSKEEPPKLYANKPKKAQVKQFQEQHKVSDASSSSPAPPSSSMSSASSSPSPPQPPKESFARRYKFLWPMLLTVNLAVGAYLFMRTKKQDEHVAEEEAGPDSAKITKIAAPVVEESLARPTIVEPVKVREPIPVDQQRELFKWILEEKRKIKPKDREEKKRIDEEKAILKEFIRAKSIPNV
Homology
BLAST of HG10023482 vs. NCBI nr
Match: XP_022992414.1 (uncharacterized protein LOC111488728 isoform X1 [Cucurbita maxima])

HSP 1 Score: 288.1 bits (736), Expect = 5.2e-74
Identity = 163/186 (87.63%), Postives = 167/186 (89.78%), Query Frame = 0

Query: 1   MSKEEPPKLYANKPKKAQVKQFQEQHKVSDASSSSPAPPSSS-----MSSASSSPSPPQP 60
           MS EEPPKLYANKPKKAQVKQFQEQHKV  ASSSSPAPP+SS      SS+SSS S PQP
Sbjct: 1   MSGEEPPKLYANKPKKAQVKQFQEQHKVMSASSSSPAPPASSNTASTSSSSSSSSSLPQP 60

Query: 61  PKESFARRYKFLWPMLLTVNLAVGAYLFMRTKKQDEHVAEEEAGPDSAKITKIAAPVVEE 120
           PKESFARRYKFLWPMLLTVNLAVGAYLFMRTKKQDE V EEEA PDSAK  KIAAPVVEE
Sbjct: 61  PKESFARRYKFLWPMLLTVNLAVGAYLFMRTKKQDEQVTEEEAAPDSAKTAKIAAPVVEE 120

Query: 121 SLARPTIVEPVKVREPIPVDQQRELFKWILEEKRKIKPKDREEKKRIDEEKAILKEFIRA 180
           S A+P IVEPVKVREPIPVDQQRELFKWILEEKRKIKPKD EEKKRIDEEKAILKEFIRA
Sbjct: 121 SFAKPAIVEPVKVREPIPVDQQRELFKWILEEKRKIKPKDHEEKKRIDEEKAILKEFIRA 180

Query: 181 KSIPNV 182
           KSIPN+
Sbjct: 181 KSIPNL 186

BLAST of HG10023482 vs. NCBI nr
Match: XP_004141680.1 (uncharacterized protein LOC101218777 isoform X2 [Cucumis sativus] >KGN45532.1 hypothetical protein Csa_015973 [Cucumis sativus])

HSP 1 Score: 287.0 bits (733), Expect = 1.2e-73
Identity = 158/181 (87.29%), Postives = 168/181 (92.82%), Query Frame = 0

Query: 1   MSKEEPPKLYANKPKKAQVKQFQEQHKVSDASSSSPAPPSSSMSSASSSPSPPQPPKESF 60
           MS+E  PKLYANKP KAQ+KQFQE+HK  DASSS+    SS+M+SASSSP PPQPPKESF
Sbjct: 1   MSEEGLPKLYANKPTKAQIKQFQERHKAGDASSSA----SSNMASASSSPPPPQPPKESF 60

Query: 61  ARRYKFLWPMLLTVNLAVGAYLFMRTKKQDEHVAEEEAGPDSAKITKIAAPVVEESLARP 120
           ARRYKFLWPMLLTVNLAVGAY+FMRTKKQDEHVAEEEA PDSAK TKIAAPVVEESLARP
Sbjct: 61  ARRYKFLWPMLLTVNLAVGAYVFMRTKKQDEHVAEEEAAPDSAKTTKIAAPVVEESLARP 120

Query: 121 TIVEPVKVREPIPVDQQRELFKWILEEKRKIKPKDREEKKRIDEEKAILKEFIRAKSIPN 180
            +VEPVKVREPIPVDQQRELFKWILEEKRKIKPKDREEKKRIDEEKAILKEFIRAKSIP+
Sbjct: 121 VVVEPVKVREPIPVDQQRELFKWILEEKRKIKPKDREEKKRIDEEKAILKEFIRAKSIPS 177

Query: 181 V 182
           +
Sbjct: 181 I 177

BLAST of HG10023482 vs. NCBI nr
Match: XP_008462359.1 (PREDICTED: uncharacterized protein LOC103500733 isoform X2 [Cucumis melo] >ADN34105.1 hypothetical protein [Cucumis melo subsp. melo])

HSP 1 Score: 286.2 bits (731), Expect = 2.0e-73
Identity = 161/182 (88.46%), Postives = 168/182 (92.31%), Query Frame = 0

Query: 1   MSKEEPPKLYANKPKKAQVKQFQEQHKVSDASSSSPAPPSSSMSSA-SSSPSPPQPPKES 60
           MS+E  PKLYANKP KAQ+KQFQEQHK  DASSS+    SSSM+SA SSSP PPQPPKES
Sbjct: 1   MSEEGLPKLYANKPTKAQIKQFQEQHKAGDASSSA----SSSMASASSSSPPPPQPPKES 60

Query: 61  FARRYKFLWPMLLTVNLAVGAYLFMRTKKQDEHVAEEEAGPDSAKITKIAAPVVEESLAR 120
           FARRYKFLWPMLLTVNLAVGAY+FMRTKKQDEHVAEEEA PDSAK TKIAAPVVEESLA+
Sbjct: 61  FARRYKFLWPMLLTVNLAVGAYVFMRTKKQDEHVAEEEAAPDSAKTTKIAAPVVEESLAK 120

Query: 121 PTIVEPVKVREPIPVDQQRELFKWILEEKRKIKPKDREEKKRIDEEKAILKEFIRAKSIP 180
           P IVEPVKVREPIPVDQQRELFKWILEEKRKIKPKDREEKKRIDEEKAILKEFIRAKSIP
Sbjct: 121 PAIVEPVKVREPIPVDQQRELFKWILEEKRKIKPKDREEKKRIDEEKAILKEFIRAKSIP 178

Query: 181 NV 182
           N+
Sbjct: 181 NI 178

BLAST of HG10023482 vs. NCBI nr
Match: KAG6575480.1 (hypothetical protein SDJN03_26119, partial [Cucurbita argyrosperma subsp. sororia] >KAG7014023.1 hypothetical protein SDJN02_24194 [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 285.4 bits (729), Expect = 3.4e-73
Identity = 160/182 (87.91%), Postives = 168/182 (92.31%), Query Frame = 0

Query: 1   MSKEEPPKLYANKPKKAQVKQFQEQHKV-SDASSSSPAPPSSSMSSASSSPSPPQPPKES 60
           MS EEPPKLYANKPKKAQVKQFQEQHKV S +SSSSPAPP+SS ++++SS S PQPPKES
Sbjct: 1   MSGEEPPKLYANKPKKAQVKQFQEQHKVMSASSSSSPAPPASSNTASTSSSSLPQPPKES 60

Query: 61  FARRYKFLWPMLLTVNLAVGAYLFMRTKKQDEHVAEEEAGPDSAKITKIAAPVVEESLAR 120
           FARRYKFLWPMLLTVNLAVGAYL MRTKKQDE V EEEA PDSAK  KIAAPVVEES A+
Sbjct: 61  FARRYKFLWPMLLTVNLAVGAYLLMRTKKQDEQVTEEEAAPDSAKTAKIAAPVVEESFAK 120

Query: 121 PTIVEPVKVREPIPVDQQRELFKWILEEKRKIKPKDREEKKRIDEEKAILKEFIRAKSIP 180
           P IVEPVKVREPIPVDQQRELFKWILEEKRKIKPKDREEKKRIDEEKAILKEFIRAKSIP
Sbjct: 121 PAIVEPVKVREPIPVDQQRELFKWILEEKRKIKPKDREEKKRIDEEKAILKEFIRAKSIP 180

Query: 181 NV 182
           N+
Sbjct: 181 NL 182

BLAST of HG10023482 vs. NCBI nr
Match: XP_022954191.1 (uncharacterized protein LOC111456527 isoform X1 [Cucurbita moschata])

HSP 1 Score: 285.0 bits (728), Expect = 4.4e-73
Identity = 163/185 (88.11%), Postives = 169/185 (91.35%), Query Frame = 0

Query: 1   MSKEEPPKLYANKPKKAQVKQFQEQHKV-SDASSSSPAPPSSS---MSSASSSPSPPQPP 60
           MS EEPPKLYANKPKKAQVKQFQEQHKV S +SSSSPAPP+SS    +S+SSS S PQPP
Sbjct: 1   MSGEEPPKLYANKPKKAQVKQFQEQHKVMSASSSSSPAPPASSNTASTSSSSSSSLPQPP 60

Query: 61  KESFARRYKFLWPMLLTVNLAVGAYLFMRTKKQDEHVAEEEAGPDSAKITKIAAPVVEES 120
           KESFARRYKFLWPMLLTVNLAVGAYLFMRTKKQDE V EEEA PDSAK  KIAAPVVEES
Sbjct: 61  KESFARRYKFLWPMLLTVNLAVGAYLFMRTKKQDEQVTEEEAAPDSAKTAKIAAPVVEES 120

Query: 121 LARPTIVEPVKVREPIPVDQQRELFKWILEEKRKIKPKDREEKKRIDEEKAILKEFIRAK 180
            A+P IVEPVKVREPIPVDQQRELFKWILEEKRKIKPKDREEKKRIDEEKAILKEFIRAK
Sbjct: 121 FAKPAIVEPVKVREPIPVDQQRELFKWILEEKRKIKPKDREEKKRIDEEKAILKEFIRAK 180

Query: 181 SIPNV 182
           SIPN+
Sbjct: 181 SIPNL 185

BLAST of HG10023482 vs. ExPASy TrEMBL
Match: A0A6J1JXH4 (uncharacterized protein LOC111488728 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111488728 PE=4 SV=1)

HSP 1 Score: 288.1 bits (736), Expect = 2.5e-74
Identity = 163/186 (87.63%), Postives = 167/186 (89.78%), Query Frame = 0

Query: 1   MSKEEPPKLYANKPKKAQVKQFQEQHKVSDASSSSPAPPSSS-----MSSASSSPSPPQP 60
           MS EEPPKLYANKPKKAQVKQFQEQHKV  ASSSSPAPP+SS      SS+SSS S PQP
Sbjct: 1   MSGEEPPKLYANKPKKAQVKQFQEQHKVMSASSSSPAPPASSNTASTSSSSSSSSSLPQP 60

Query: 61  PKESFARRYKFLWPMLLTVNLAVGAYLFMRTKKQDEHVAEEEAGPDSAKITKIAAPVVEE 120
           PKESFARRYKFLWPMLLTVNLAVGAYLFMRTKKQDE V EEEA PDSAK  KIAAPVVEE
Sbjct: 61  PKESFARRYKFLWPMLLTVNLAVGAYLFMRTKKQDEQVTEEEAAPDSAKTAKIAAPVVEE 120

Query: 121 SLARPTIVEPVKVREPIPVDQQRELFKWILEEKRKIKPKDREEKKRIDEEKAILKEFIRA 180
           S A+P IVEPVKVREPIPVDQQRELFKWILEEKRKIKPKD EEKKRIDEEKAILKEFIRA
Sbjct: 121 SFAKPAIVEPVKVREPIPVDQQRELFKWILEEKRKIKPKDHEEKKRIDEEKAILKEFIRA 180

Query: 181 KSIPNV 182
           KSIPN+
Sbjct: 181 KSIPNL 186

BLAST of HG10023482 vs. ExPASy TrEMBL
Match: A0A0A0KAZ6 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_7G451380 PE=4 SV=1)

HSP 1 Score: 287.0 bits (733), Expect = 5.6e-74
Identity = 158/181 (87.29%), Postives = 168/181 (92.82%), Query Frame = 0

Query: 1   MSKEEPPKLYANKPKKAQVKQFQEQHKVSDASSSSPAPPSSSMSSASSSPSPPQPPKESF 60
           MS+E  PKLYANKP KAQ+KQFQE+HK  DASSS+    SS+M+SASSSP PPQPPKESF
Sbjct: 1   MSEEGLPKLYANKPTKAQIKQFQERHKAGDASSSA----SSNMASASSSPPPPQPPKESF 60

Query: 61  ARRYKFLWPMLLTVNLAVGAYLFMRTKKQDEHVAEEEAGPDSAKITKIAAPVVEESLARP 120
           ARRYKFLWPMLLTVNLAVGAY+FMRTKKQDEHVAEEEA PDSAK TKIAAPVVEESLARP
Sbjct: 61  ARRYKFLWPMLLTVNLAVGAYVFMRTKKQDEHVAEEEAAPDSAKTTKIAAPVVEESLARP 120

Query: 121 TIVEPVKVREPIPVDQQRELFKWILEEKRKIKPKDREEKKRIDEEKAILKEFIRAKSIPN 180
            +VEPVKVREPIPVDQQRELFKWILEEKRKIKPKDREEKKRIDEEKAILKEFIRAKSIP+
Sbjct: 121 VVVEPVKVREPIPVDQQRELFKWILEEKRKIKPKDREEKKRIDEEKAILKEFIRAKSIPS 177

Query: 181 V 182
           +
Sbjct: 181 I 177

BLAST of HG10023482 vs. ExPASy TrEMBL
Match: E5GCA6 (Uncharacterized protein OS=Cucumis melo subsp. melo OX=412675 PE=4 SV=1)

HSP 1 Score: 286.2 bits (731), Expect = 9.6e-74
Identity = 161/182 (88.46%), Postives = 168/182 (92.31%), Query Frame = 0

Query: 1   MSKEEPPKLYANKPKKAQVKQFQEQHKVSDASSSSPAPPSSSMSSA-SSSPSPPQPPKES 60
           MS+E  PKLYANKP KAQ+KQFQEQHK  DASSS+    SSSM+SA SSSP PPQPPKES
Sbjct: 1   MSEEGLPKLYANKPTKAQIKQFQEQHKAGDASSSA----SSSMASASSSSPPPPQPPKES 60

Query: 61  FARRYKFLWPMLLTVNLAVGAYLFMRTKKQDEHVAEEEAGPDSAKITKIAAPVVEESLAR 120
           FARRYKFLWPMLLTVNLAVGAY+FMRTKKQDEHVAEEEA PDSAK TKIAAPVVEESLA+
Sbjct: 61  FARRYKFLWPMLLTVNLAVGAYVFMRTKKQDEHVAEEEAAPDSAKTTKIAAPVVEESLAK 120

Query: 121 PTIVEPVKVREPIPVDQQRELFKWILEEKRKIKPKDREEKKRIDEEKAILKEFIRAKSIP 180
           P IVEPVKVREPIPVDQQRELFKWILEEKRKIKPKDREEKKRIDEEKAILKEFIRAKSIP
Sbjct: 121 PAIVEPVKVREPIPVDQQRELFKWILEEKRKIKPKDREEKKRIDEEKAILKEFIRAKSIP 178

Query: 181 NV 182
           N+
Sbjct: 181 NI 178

BLAST of HG10023482 vs. ExPASy TrEMBL
Match: A0A1S3CGT4 (uncharacterized protein LOC103500733 isoform X2 OS=Cucumis melo OX=3656 GN=LOC103500733 PE=4 SV=1)

HSP 1 Score: 286.2 bits (731), Expect = 9.6e-74
Identity = 161/182 (88.46%), Postives = 168/182 (92.31%), Query Frame = 0

Query: 1   MSKEEPPKLYANKPKKAQVKQFQEQHKVSDASSSSPAPPSSSMSSA-SSSPSPPQPPKES 60
           MS+E  PKLYANKP KAQ+KQFQEQHK  DASSS+    SSSM+SA SSSP PPQPPKES
Sbjct: 1   MSEEGLPKLYANKPTKAQIKQFQEQHKAGDASSSA----SSSMASASSSSPPPPQPPKES 60

Query: 61  FARRYKFLWPMLLTVNLAVGAYLFMRTKKQDEHVAEEEAGPDSAKITKIAAPVVEESLAR 120
           FARRYKFLWPMLLTVNLAVGAY+FMRTKKQDEHVAEEEA PDSAK TKIAAPVVEESLA+
Sbjct: 61  FARRYKFLWPMLLTVNLAVGAYVFMRTKKQDEHVAEEEAAPDSAKTTKIAAPVVEESLAK 120

Query: 121 PTIVEPVKVREPIPVDQQRELFKWILEEKRKIKPKDREEKKRIDEEKAILKEFIRAKSIP 180
           P IVEPVKVREPIPVDQQRELFKWILEEKRKIKPKDREEKKRIDEEKAILKEFIRAKSIP
Sbjct: 121 PAIVEPVKVREPIPVDQQRELFKWILEEKRKIKPKDREEKKRIDEEKAILKEFIRAKSIP 178

Query: 181 NV 182
           N+
Sbjct: 181 NI 178

BLAST of HG10023482 vs. ExPASy TrEMBL
Match: A0A6J1GQ86 (uncharacterized protein LOC111456527 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111456527 PE=4 SV=1)

HSP 1 Score: 285.0 bits (728), Expect = 2.1e-73
Identity = 163/185 (88.11%), Postives = 169/185 (91.35%), Query Frame = 0

Query: 1   MSKEEPPKLYANKPKKAQVKQFQEQHKV-SDASSSSPAPPSSS---MSSASSSPSPPQPP 60
           MS EEPPKLYANKPKKAQVKQFQEQHKV S +SSSSPAPP+SS    +S+SSS S PQPP
Sbjct: 1   MSGEEPPKLYANKPKKAQVKQFQEQHKVMSASSSSSPAPPASSNTASTSSSSSSSLPQPP 60

Query: 61  KESFARRYKFLWPMLLTVNLAVGAYLFMRTKKQDEHVAEEEAGPDSAKITKIAAPVVEES 120
           KESFARRYKFLWPMLLTVNLAVGAYLFMRTKKQDE V EEEA PDSAK  KIAAPVVEES
Sbjct: 61  KESFARRYKFLWPMLLTVNLAVGAYLFMRTKKQDEQVTEEEAAPDSAKTAKIAAPVVEES 120

Query: 121 LARPTIVEPVKVREPIPVDQQRELFKWILEEKRKIKPKDREEKKRIDEEKAILKEFIRAK 180
            A+P IVEPVKVREPIPVDQQRELFKWILEEKRKIKPKDREEKKRIDEEKAILKEFIRAK
Sbjct: 121 FAKPAIVEPVKVREPIPVDQQRELFKWILEEKRKIKPKDREEKKRIDEEKAILKEFIRAK 180

Query: 181 SIPNV 182
           SIPN+
Sbjct: 181 SIPNL 185

BLAST of HG10023482 vs. TAIR 10
Match: AT1G55160.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: mitochondrion, plastid; EXPRESSED IN: 24 plant structures; EXPRESSED DURING: 13 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G19530.1); Has 63 Blast hits to 63 proteins in 14 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 63; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 160.2 bits (404), Expect = 1.5e-39
Identity = 107/188 (56.91%), Postives = 134/188 (71.28%), Query Frame = 0

Query: 4   EEPPKLYANKPKK----AQVKQFQEQ-HKVSDASSSSPAPPSSSMSS---ASSSPSPPQP 63
           EE PKL+ NKPKK    AQ+K  +   +  +   SS P+P +++ +S      S  PP P
Sbjct: 3   EETPKLFTNKPKKKAIIAQLKHVEANFNNPTVPPSSKPSPAAAAAASYTMGGGSVPPPPP 62

Query: 64  PKESFARRYKFLWPMLLTVNLAVGAYLFMRTKKQD-EHVAEEEAGPDSAKITKIAAPV-V 123
           PKESFARRYK++WP+LLTVNLAVG YLF RTKK+D + V EE A    AK + +AAPV V
Sbjct: 63  PKESFARRYKYVWPLLLTVNLAVGGYLFFRTKKKDLDPVVEETA----AKSSSVAAPVTV 122

Query: 124 EESLARPTIVEPV--KVREPIPVDQQRELFKWILEEKRKIKPKDREEKKRIDEEKAILKE 180
           E++L+   + EPV  K REPIP  QQRELFKW+LEEKRK+ PK+ EEKKR DEEKAILK+
Sbjct: 123 EKTLSSTVVAEPVVIKAREPIPEKQQRELFKWMLEEKRKVNPKNAEEKKRNDEEKAILKQ 182

BLAST of HG10023482 vs. TAIR 10
Match: AT1G55160.2 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: mitochondrion, plastid; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G19530.1); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )

HSP 1 Score: 152.1 bits (383), Expect = 4.2e-37
Identity = 89/135 (65.93%), Postives = 106/135 (78.52%), Query Frame = 0

Query: 49  SPSPPQPPKESFARRYKFLWPMLLTVNLAVGAYLFMRTKKQD-EHVAEEEAGPDSAKITK 108
           S  PP PPKESFARRYK++WP+LLTVNLAVG YLF RTKK+D + V EE A    AK + 
Sbjct: 5   SVPPPPPPKESFARRYKYVWPLLLTVNLAVGGYLFFRTKKKDLDPVVEETA----AKSSS 64

Query: 109 IAAPV-VEESLARPTIVEPV--KVREPIPVDQQRELFKWILEEKRKIKPKDREEKKRIDE 168
           +AAPV VE++L+   + EPV  K REPIP  QQRELFKW+LEEKRK+ PK+ EEKKR DE
Sbjct: 65  VAAPVTVEKTLSSTVVAEPVVIKAREPIPEKQQRELFKWMLEEKRKVNPKNAEEKKRNDE 124

Query: 169 EKAILKEFIRAKSIP 180
           EKAILK+FI +K+IP
Sbjct: 125 EKAILKQFIGSKTIP 135

BLAST of HG10023482 vs. TAIR 10
Match: AT1G55160.3 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: mitochondrion, plastid; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G19530.1); Has 63 Blast hits to 63 proteins in 14 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 63; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 146.7 bits (369), Expect = 1.8e-35
Identity = 107/213 (50.23%), Postives = 135/213 (63.38%), Query Frame = 0

Query: 4   EEPPKLYANKPKK----AQVKQFQEQ-HKVSDASSSSPAPPSSSMSS---ASSSPSPPQP 63
           EE PKL+ NKPKK    AQ+K  +   +  +   SS P+P +++ +S      S  PP P
Sbjct: 3   EETPKLFTNKPKKKAIIAQLKHVEANFNNPTVPPSSKPSPAAAAAASYTMGGGSVPPPPP 62

Query: 64  PKESFARRYKFLWPMLLTVNLAVG-------------------------AYLFMRTKKQD 123
           PKESFARRYK++WP+LLTVNLAVG                         +YLF RTKK+D
Sbjct: 63  PKESFARRYKYVWPLLLTVNLAVGGFCSSLDENRIVFSFIFMMLRVIYDSYLFFRTKKKD 122

Query: 124 -EHVAEEEAGPDSAKITKIAAPV-VEESLARPTIVEPV--KVREPIPVDQQRELFKWILE 180
            + V EE A    AK + +AAPV VE++L+   + EPV  K REPIP  QQRELFKW+LE
Sbjct: 123 LDPVVEETA----AKSSSVAAPVTVEKTLSSTVVAEPVVIKAREPIPEKQQRELFKWMLE 182

BLAST of HG10023482 vs. TAIR 10
Match: AT2G19530.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G55160.2); Has 461 Blast hits to 346 proteins in 80 species: Archae - 0; Bacteria - 16; Metazoa - 89; Fungi - 28; Plants - 57; Viruses - 0; Other Eukaryotes - 271 (source: NCBI BLink). )

HSP 1 Score: 89.0 bits (219), Expect = 4.3e-18
Identity = 70/181 (38.67%), Postives = 93/181 (51.38%), Query Frame = 0

Query: 48  SSPSPPQPPKESFARRYKFLWPMLLTVNLAVGAYLFMRTKKQDEHVAE------------ 107
           SSPS  +PP++  ++  K  W   +  NL   AY+F   +++D    E            
Sbjct: 3   SSPSGSEPPQKVVSKLQKVGWRATMIFNLGFAAYIFAIKREKDIDADEKKKVKKGSEARH 62

Query: 108 ------------EEAG---PDSAKITKIAAPVVEE-------------------SLARPT 167
                       E+ G    D AK  + A P  EE                   S+ +  
Sbjct: 63  KGVKKGAVNTEIEKKGAEETDKAKEAETAIPEKEETKLIPELDPLFEFTDATDQSMFQTV 122

Query: 168 IVEPVKV-REPIPVDQQRELFKWILEEKRKIKPKDREEKKRIDEEKAILKEFIRAKSIPN 182
             E VKV R+PIP D+Q+ELFKWILEEKRKI+PKDR+EKK+IDEEKAILK+FIRA+ IP 
Sbjct: 123 ATEHVKVARKPIPEDEQKELFKWILEEKRKIEPKDRKEKKQIDEEKAILKQFIRAERIPK 182

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022992414.15.2e-7487.63uncharacterized protein LOC111488728 isoform X1 [Cucurbita maxima][more]
XP_004141680.11.2e-7387.29uncharacterized protein LOC101218777 isoform X2 [Cucumis sativus] >KGN45532.1 hy... [more]
XP_008462359.12.0e-7388.46PREDICTED: uncharacterized protein LOC103500733 isoform X2 [Cucumis melo] >ADN34... [more]
KAG6575480.13.4e-7387.91hypothetical protein SDJN03_26119, partial [Cucurbita argyrosperma subsp. sorori... [more]
XP_022954191.14.4e-7388.11uncharacterized protein LOC111456527 isoform X1 [Cucurbita moschata][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A6J1JXH42.5e-7487.63uncharacterized protein LOC111488728 isoform X1 OS=Cucurbita maxima OX=3661 GN=L... [more]
A0A0A0KAZ65.6e-7487.29Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_7G451380 PE=4 SV=1[more]
E5GCA69.6e-7488.46Uncharacterized protein OS=Cucumis melo subsp. melo OX=412675 PE=4 SV=1[more]
A0A1S3CGT49.6e-7488.46uncharacterized protein LOC103500733 isoform X2 OS=Cucumis melo OX=3656 GN=LOC10... [more]
A0A6J1GQ862.1e-7388.11uncharacterized protein LOC111456527 isoform X1 OS=Cucurbita moschata OX=3662 GN... [more]
Match NameE-valueIdentityDescription
AT1G55160.11.5e-3956.91unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT1G55160.24.2e-3765.93unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT1G55160.31.8e-3550.23unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT2G19530.14.3e-1838.67unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..59
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 24..48
NoneNo IPR availablePANTHERPTHR34364:SF11WAS/WASL-INTERACTING PROTEIN FAMILY MEMBER 3-LIKEcoord: 5..179
NoneNo IPR availablePANTHERPTHR34364WAS/WASL-INTERACTING FAMILY PROTEINcoord: 5..179

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10023482.1HG10023482.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0016021 integral component of membrane