HG10001715 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10001715
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionATP synthase protein I -related
LocationChr09: 19760563 .. 19765853 (+)
RNA-Seq ExpressionHG10001715
SyntenyHG10001715
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCTGTTCTTAACTACATCTCAGCTACCTCCACTCCCATCTCCCAAGATTCTTCAATTTCACCTCCAATACCAGACCCAAGGCAAACCAAGGTCATTCTCCCCAAAAAGAAGCCGGAAAAATGGTCCACCGGAGTTTCCCCGGGCGAGTACGGCGGCCCCCCTACCACAACCAAGCTCCGTAAGTACTGGGGTGGTGAAAAAGATGACCCTTTAACTTCCGACGATTATATCTGGAATAGAGAATTCATGGCCCGGATGAAAAAGTTTGTCAAGGACCAACCTGCTGATGATTTATCTCTTACGGTCAATAAAGCGAAGGTTTCCCTTCTAATTCCTTCTCCTTCACGCATTTCCCTCTTTCTATTCTATTCCATTCCACCTTACTATTGCCATTATCAGTTATTGTTCAATTATTCGTCTCTCTGCTTAATTCCTCATGGAATATCATTTGGTGTGGTTGGACATTGGTTCTTTCTTTGCCTGATTAAAGTTAAATGCCCATTTTAAGCTATGTTTACTTTGGTATAGTCTTAAAGTGTTGAACTAAGTGTTACTCATTAGCTATATGAACTTTGTGAAATATAATCTTCAATTCTTCCTGTTTGAGAACCAAGCACTTGCTAGATTATTCGGTTATCTTGGAAAATGGAAAGTGTTTTATGATGCCATCTGTAATGTGTTGGTTAATTGATAATCTGACTGCTGATGACATGATCTTATTGTATCAGGACGAGCCTTCTGGATTTCTTAGCTTGAATAGAGTCATGGCCCTTGATAGGTTGGTTCTCACTGCTTACTTATATGTTAACTGCAATCACGTTTTTTATTATTCTGTGTTTCAGCATTATAATGTTTCAACTTTCAAATAATGCAAGTACAGTTTGGAAGTTGATTTGAGCAGGGAACTATTGGCTCCTCCAATGCCTCGGTCAGAAGACGTAGTTGAGAAAAATATTCCGGTACGATGTCATTATTTATCCCACCCCTGAAATCTTTCATAAAATGTGGAAGATTCTACTTCCCTGACTAATTATGTAAACGACTATGGAGGGTATTTGTAGAATATGAGCAACTTCAAGTATGCTGAGATCTCAGAATATCTTTAATTTGTTATATTTCTTTTACTAAAATGGCAATGAAATGATGGGTCAGATTGATAACCGCAAGTCACCCAGATGGAAGTTAGCACCGACAAGGCGTGAGCAAGAGAAGTGGGATAGGGCATATAAGGCAGCTACTGGAGGCAGTGTAAGTCCTTTGACCCCCCTTGGTTTATCTCTAAAGTCTCATCTAAAGAGTCCAGCTTTTCTTATAGTGCTTTGAAATTGAAATAGTGTTAATTTTATGGTTTGATTTGAATTTCTTTGGAGCTTATGAATCTATTGGCAACTTATAATATTTAGAATGAAATATTTCTTTAAGATGTTTACTTGTTTTGAAAAGAAGCTTAATGATGAATTGTAGGATGTGATGTTTCGAGAATTGAGACGGCCTCAAGGGGATCCAGAAGTATTGGCTGCCTTATCCAGGGAACAGTATTTTAAGGTGCTTTGTTGTAAAATGGCTTCATGATATTTCTGTTTACAGTTACACTCCTTTGCTTCATATATTGAATGTTGTTTCCTTCCCTTTTGTTTTCTTTTGTCTTCTCAGTTAAAGAAGAAGATGCAAATTTTAACACTGGCAATAGGAGGTGTTGGTTTGTTCTCAGCTTATGTTTCATATTCTCCGGAAGTTGCTGCTAGGTTCGTTTGATGTTAATTTCTTTAAACAATTTTGCTGGGATTCTAGTAATGATCTGGGTGGTCTCCTTTTGAAATATTGACTGCTGCAAATGCATTTAGTCTTAGTAATCAAATTCCTATTATTATAAGTTGATCTAATAAATTGTTTGGAAAAATTACAAATTTAGTCCCTTAACTTTGAGGTTTGTGTCTATTGGTCCTTGAACTTTCAAAGTTACAACTTTAGTCTCTAAATGTTTGAAATTTTAAAATTCACGGATCTACTAGATTCTTTAGAAATGACAAATGGTATCTTTTAAAAATTCAAATTGTTATTTAAAAATTTTAAATTTGTCAGAGACCCATTAGACACGAAGTTGAAAGTTTAGGGGCCTAATAGACACTTTTGAAAGTCTAGGGACCAAATAAACACAAACCTCAAAGTTTAGGGATAAGACTAAGCTTGTAATTTAACCTAAATTGTTTTTATGGTCATACAAGTGGTTTGTGCTAAAGGTGTTCTAATGCACGTAAGTGAAAGAATACCTCAAAACTTTATTAAAATGATACTTGAGTACACAAATGGCTGCCCTATTTATAAGTAAGGCAGCAACTAACAATAGGTTGTGGGCTTTTCTTGAGCTCTCCGACCAACTAACAATCGGTTTTGGCAAAACCAACTCCGATCAAAGCTATCAAAAAATCGATTGACCGATCGATTCGGTCAGTTTTCAACCTAACTGTATTCACCCCCAAATAAATTGGTTCAAAATTGAAACAAGATTATTTTCGAGCAATCTATTGTCAAATAAATCAGTTTTTTATGCTCTTGCGGCCGTTGACAATCTCAATCAACTTGTTTGAATACTCTTCAATGGACTCAGAATCCTTCATTTGCATATGAATTCTCTCATCAAGTTCAACACTTCCATGCCTGTCATTCTTTCATTTTCTTCATACTCACCTTTGAGGAACTCCCAAATCTACTTTGCTGACTTCATGGCCATAATTTTGTTGAATATGGTAGGAGACACAGTTGCATATTGGCAAACTCTAGCCTTCGCCTTCCTAGTGACCTCCTTATAAGCCTTGATCTGATTTATTGCTGGGTTATCATAAAGTGGAGCAACATCATAGTATTGCTCAATTGCTTCCCCTGATCATAACCCTCCATGTAGGCTTGCATTCTGACAGCCCATACTTCGTAGTTTTCACCATAGGTGGAGCTTGTGAAGAGAGGCGGCTAGATTATGATATGACTCTATTTCACTCAATTCTCACAGCTGTTCACTCGTTTCTATTTAGAGATATCTCGCTCACAGGCCCCTTAAGAATTTAATGCTCTAATACCATTGAAGATCGAGAGAGAGACATGGGGAAGTTATACATTTTCATTCAATGGCTACTCGAAGTAAATCCAGACTTTAATGCTAAAGAAAGCTTATCTAACTCAAGCTTATCAAACTCCACAAAAGTAGTAACTTGGTACTTGGTATTGATGTATGAAGAAAGTGGGGAAGACCTTATCCACTTCTTTCTTAGATGTAAATTTATTTTTTAGCTTGCTTTTTATGGCTTGGTATGCCCGGAGAGACTTTAGGAAGCTAGTGGAACAATGTATAGGTACCCTTTTGCAAGAAGAAAGGGAAAGTTCTTTCAAGAAATGCTTGTTGTGGGCTTGTGTTGTTTATTTCATTGGATAGACATGCTAGTATCTTTGAAGGAACTGAAATGGATTTTAATTTCTTCTGGAGCTCTTATTAGGAGTGTTCAAAATACCCAATGACTTGAAAAATCTGATCAACCCAACCCAACCCATACGGTTTGAGTTGGGTTATCAACTCATTTGGGTTGGCTGGATTCAAATAAATGAAAATTTTATGGGTTGGGTTGGTTCATGGTTAACCTAAAATAACTCAAACTAACCCGAACCAATCCGAACCAACCCGAATATATTATTAATTTTAAAAAATATGTTTTTTTTTACTTATGATACATTATATATACATATATTAATTTTAATTCTATTTAATTTCATTATTTTTTTGATAATTTATTATCCAACAACTCTTAAAAACAATGCTTTGGGGTGAGGGGTTAACAAGTCTCTGTTGTCTATTAGAAAAATTAAAGAAGTAAAAGGAAAAAGTGAAATGATTTGGTCTTTCGTAACCACTTGATCATGGTGTGTGATTAATAAATGTTGAGTAACTTGAGTTTCTAATTATTATGATTTTAATTTCTTTTTTTAAATCCTGCCAAGCTAACAACAAGAATTGCCATTTATTGTGAATTATTAACAATTTGTCGAGGACATGGGATTATGTGGCATGATACTTTGTTGGAGACACATTCGTAGATTATTTGACACTTCCTTCCCAATTTCAGCTTTGGTGCTGGGCTAATTGGTTCTCTTGTGTACATACGAATGCTGGGGAGTAGTGTGGATTCTCTGGCAGATGGAGCAAAAGGACTTGTTAAGTATGTATTACACAACTGTTTTTGTTACAGTAGAGTATTGATGATTTTCACTAAATCGTGTTGTATTCTATTTCTTTCTCATTAGATTATGGTATAAGCCATGGTGAATTACTGTGTTGTCCTTCTTATGATTTTAAGTATGGACCGTACAGTGCATCTTGTATTTTCTTTCATTTTTTCTTAATAAAAGTTTGGTTTTCTATTTTTATTTTTTTTAAAAAAAAAACAGAAGATAAAAAGAATAAAAGACAAGTTAAGTCTGGTGCATCCCCAAACATCAAGATCTTTTAGGACAAATCTATATAAGATAATTCACAATTCATCTTGTTGATACAGGTGGTGCTTTGAGAGTTTATTTATATAACATGAAGTTCCATGATTCGAAGTCATGAACTATACATTCTATGTTGATAATTTTTTGTACCACTATAATGGAATACCGATTTAATTTTTTTCTTGAAGAACAATATTGAACTTGTTGAATTTCATGTCATCTTAATGAAATCGTTGCAATTTTGTTGCAGGGGAGCCGTTGCACAACCACGGTTATTAGTTCCAGTCATATTGGTGATGGTATATAACCGTTGGAACGGGTAAGTATGCTCACTGCCTTGTTAGTTTTGGCAATTTTCCCCCCTCTATATGGCCACCATAATCATTCGCACCCATTACTAGAACTTTCATTTTCTCTCTTGTCGTCTAGTGTTATTTGAAACTGAAATTTTTCTTAGGAAATGAAAACGTACATACGAAGGGATGTGAATCCTGTAGGTCCCCATTACTGATTGTGTACTTTTTTTTCTTTCTTTTGGAATAAAGCCTGGTCTTACTCTTGACTGACACCACAGACTTTGAATTTAGAATGTGAAATCAATAAAACATTTTATCAGGCACAGCTGGGGTCCTAATTAGCCAAATTTTCTACCCCCTTTTTTCTTTTGTAGTATTCTTGTTGAAGATTATGGAGTTATGCATTTACAGTTGATACCAATGCTAGTTGGGTTCTTCACATACAAAGTTGCTACTTTTGTTCAAGCTTTAGAGGAGGCACTTACCGTCGTTAAGAACGAGCCGCAAGCCTAG

mRNA sequence

ATGGCTGTTCTTAACTACATCTCAGCTACCTCCACTCCCATCTCCCAAGATTCTTCAATTTCACCTCCAATACCAGACCCAAGGCAAACCAAGGTCATTCTCCCCAAAAAGAAGCCGGAAAAATGGTCCACCGGAGTTTCCCCGGGCGAGTACGGCGGCCCCCCTACCACAACCAAGCTCCGTAAGTACTGGGGTGGTGAAAAAGATGACCCTTTAACTTCCGACGATTATATCTGGAATAGAGAATTCATGGCCCGGATGAAAAAGTTTGTCAAGGACCAACCTGCTGATGATTTATCTCTTACGGTCAATAAAGCGAAGGACGAGCCTTCTGGATTTCTTAGCTTGAATAGAGTCATGGCCCTTGATAGGGAACTATTGGCTCCTCCAATGCCTCGGTCAGAAGACGTAGTTGAGAAAAATATTCCGATTGATAACCGCAAGTCACCCAGATGGAAGTTAGCACCGACAAGGCGTGAGCAAGAGAAGTGGGATAGGGCATATAAGGCAGCTACTGGAGGCAGTGATGTGATGTTTCGAGAATTGAGACGGCCTCAAGGGGATCCAGAAGTATTGGCTGCCTTATCCAGGGAACAGTATTTTAAGTTAAAGAAGAAGATGCAAATTTTAACACTGGCAATAGGAGGTGTTGGTTTGTTCTCAGCTTATGTTTCATATTCTCCGGAAGTTGCTGCTAGCTTTGGTGCTGGGCTAATTGGTTCTCTTGTGTACATACGAATGCTGGGGAGTAGTGTGGATTCTCTGGCAGATGGAGCAAAAGGACTTGTTAAGGGAGCCGTTGCACAACCACGGTTATTAGTTCCAGTCATATTGGTGATGGTATATAACCGTTGGAACGGTATTCTTGTTGAAGATTATGGAGTTATGCATTTACAGTTGATACCAATGCTAGTTGGGTTCTTCACATACAAAGTTGCTACTTTTGTTCAAGCTTTAGAGGAGGCACTTACCGTCGTTAAGAACGAGCCGCAAGCCTAG

Coding sequence (CDS)

ATGGCTGTTCTTAACTACATCTCAGCTACCTCCACTCCCATCTCCCAAGATTCTTCAATTTCACCTCCAATACCAGACCCAAGGCAAACCAAGGTCATTCTCCCCAAAAAGAAGCCGGAAAAATGGTCCACCGGAGTTTCCCCGGGCGAGTACGGCGGCCCCCCTACCACAACCAAGCTCCGTAAGTACTGGGGTGGTGAAAAAGATGACCCTTTAACTTCCGACGATTATATCTGGAATAGAGAATTCATGGCCCGGATGAAAAAGTTTGTCAAGGACCAACCTGCTGATGATTTATCTCTTACGGTCAATAAAGCGAAGGACGAGCCTTCTGGATTTCTTAGCTTGAATAGAGTCATGGCCCTTGATAGGGAACTATTGGCTCCTCCAATGCCTCGGTCAGAAGACGTAGTTGAGAAAAATATTCCGATTGATAACCGCAAGTCACCCAGATGGAAGTTAGCACCGACAAGGCGTGAGCAAGAGAAGTGGGATAGGGCATATAAGGCAGCTACTGGAGGCAGTGATGTGATGTTTCGAGAATTGAGACGGCCTCAAGGGGATCCAGAAGTATTGGCTGCCTTATCCAGGGAACAGTATTTTAAGTTAAAGAAGAAGATGCAAATTTTAACACTGGCAATAGGAGGTGTTGGTTTGTTCTCAGCTTATGTTTCATATTCTCCGGAAGTTGCTGCTAGCTTTGGTGCTGGGCTAATTGGTTCTCTTGTGTACATACGAATGCTGGGGAGTAGTGTGGATTCTCTGGCAGATGGAGCAAAAGGACTTGTTAAGGGAGCCGTTGCACAACCACGGTTATTAGTTCCAGTCATATTGGTGATGGTATATAACCGTTGGAACGGTATTCTTGTTGAAGATTATGGAGTTATGCATTTACAGTTGATACCAATGCTAGTTGGGTTCTTCACATACAAAGTTGCTACTTTTGTTCAAGCTTTAGAGGAGGCACTTACCGTCGTTAAGAACGAGCCGCAAGCCTAG

Protein sequence

MAVLNYISATSTPISQDSSISPPIPDPRQTKVILPKKKPEKWSTGVSPGEYGGPPTTTKLRKYWGGEKDDPLTSDDYIWNREFMARMKKFVKDQPADDLSLTVNKAKDEPSGFLSLNRVMALDRELLAPPMPRSEDVVEKNIPIDNRKSPRWKLAPTRREQEKWDRAYKAATGGSDVMFRELRRPQGDPEVLAALSREQYFKLKKKMQILTLAIGGVGLFSAYVSYSPEVAASFGAGLIGSLVYIRMLGSSVDSLADGAKGLVKGAVAQPRLLVPVILVMVYNRWNGILVEDYGVMHLQLIPMLVGFFTYKVATFVQALEEALTVVKNEPQA
Homology
BLAST of HG10001715 vs. NCBI nr
Match: XP_038901394.1 (protein CONSERVED ONLY IN THE GREEN LINEAGE 160, chloroplastic [Benincasa hispida])

HSP 1 Score: 621.7 bits (1602), Expect = 3.7e-174
Identity = 320/339 (94.40%), Postives = 328/339 (96.76%), Query Frame = 0

Query: 1   MAVLNYISATSTPISQDSSISPPIPDPRQTKVILPKKKPEKWSTGVSPGEYGGPPTTTKL 60
           MAVLNYISATSTPISQDSS+SPPIPDPRQTKVILPKKKPEKWSTGVSPGEYGGPPTTTKL
Sbjct: 1   MAVLNYISATSTPISQDSSVSPPIPDPRQTKVILPKKKPEKWSTGVSPGEYGGPPTTTKL 60

Query: 61  RKYWGGEKDDPLTSDDYIWNREFMARMKKFVKDQPADDLSLTVNKAKDEPSGFLSLNRVM 120
           RK WGGEKDDPLTSDDYIWNREFMAR+KKFVKDQP DDLSLTVNKAKDEPSGFLSLNRVM
Sbjct: 61  RKVWGGEKDDPLTSDDYIWNREFMARVKKFVKDQP-DDLSLTVNKAKDEPSGFLSLNRVM 120

Query: 121 ALD-------RELLAPPMPRSEDVVEKNIPIDNRKSPRWKLAPTRREQEKWDRAYKAATG 180
           ALD       +EL APP+PRSED+VEKNIPID+RKSPRWKLAPTRREQEKWDRAYKAATG
Sbjct: 121 ALDSLEVDLSKELSAPPVPRSEDLVEKNIPIDSRKSPRWKLAPTRREQEKWDRAYKAATG 180

Query: 181 GSDVMFRELRRPQGDPEVLAALSREQYFKLKKKMQILTLAIGGVGLFSAYVSYSPEVAAS 240
           GSDVMFRELRRPQGDPEVLAALSREQYFKLKKKMQILTLAIGGVGLFSAYVSYSPEVAAS
Sbjct: 181 GSDVMFRELRRPQGDPEVLAALSREQYFKLKKKMQILTLAIGGVGLFSAYVSYSPEVAAS 240

Query: 241 FGAGLIGSLVYIRMLGSSVDSLADGAKGLVKGAVAQPRLLVPVILVMVYNRWNGILVEDY 300
           FGAGLIGSLVYIRMLGSSVDSLADGAKGLVKGAVAQPRLLVPVILVM+YNRWNGILVEDY
Sbjct: 241 FGAGLIGSLVYIRMLGSSVDSLADGAKGLVKGAVAQPRLLVPVILVMLYNRWNGILVEDY 300

Query: 301 GVMHLQLIPMLVGFFTYKVATFVQALEEALTVVKNEPQA 333
           GVM LQLIPMLVGFFTYKVATFVQALEEALTVVKN+PQA
Sbjct: 301 GVMQLQLIPMLVGFFTYKVATFVQALEEALTVVKNKPQA 338

BLAST of HG10001715 vs. NCBI nr
Match: KAG6604770.1 (Protein CONSERVED ONLY IN THE GREEN LINEAGE 160, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 595.9 bits (1535), Expect = 2.1e-166
Identity = 303/339 (89.38%), Postives = 318/339 (93.81%), Query Frame = 0

Query: 1   MAVLNYISATSTPISQDSSISPPIPDPRQTKVILPKKKPEKWSTGVSPGEYGGPPTTTKL 60
           MA+LNYISATSTPISQDSSI+PP+PDPRQTKVILPKKKPEKWSTGVSPGEYGGPPTTTKL
Sbjct: 1   MAILNYISATSTPISQDSSIAPPLPDPRQTKVILPKKKPEKWSTGVSPGEYGGPPTTTKL 60

Query: 61  RKYWGGEKDDPLTSDDYIWNREFMARMKKFVKDQPADDLSLTVNKAKDEPSGFLSLNRVM 120
           RKYWGGEKDDPLTSDDYIWNREFM RMKK +KD+P DD S+ VNKAKDEPSGFLSLNRVM
Sbjct: 61  RKYWGGEKDDPLTSDDYIWNREFMGRMKKLMKDKP-DDSSVRVNKAKDEPSGFLSLNRVM 120

Query: 121 ALD-------RELLAPPMPRSEDVVEKNIPIDNRKSPRWKLAPTRREQEKWDRAYKAATG 180
            LD       +EL+APPMP  E+VVE+ I +DNRKSPRW+LAPTRREQEKWDRAYKAATG
Sbjct: 121 TLDSLEVDLSKELMAPPMPLKENVVEEKIQVDNRKSPRWRLAPTRREQEKWDRAYKAATG 180

Query: 181 GSDVMFRELRRPQGDPEVLAALSREQYFKLKKKMQILTLAIGGVGLFSAYVSYSPEVAAS 240
           GSDVMFRELRRPQGDPEVLAALSREQYFKLKKK+Q LTLAIGGVGL SAYVSYSPEVAAS
Sbjct: 181 GSDVMFRELRRPQGDPEVLAALSREQYFKLKKKLQTLTLAIGGVGLVSAYVSYSPEVAAS 240

Query: 241 FGAGLIGSLVYIRMLGSSVDSLADGAKGLVKGAVAQPRLLVPVILVMVYNRWNGILVEDY 300
           FGAGLIGSLVY+RMLGSSVDSLADGA+GLVKGAVAQPRLLVPVILVMVYNRWNGILVEDY
Sbjct: 241 FGAGLIGSLVYVRMLGSSVDSLADGARGLVKGAVAQPRLLVPVILVMVYNRWNGILVEDY 300

Query: 301 GVMHLQLIPMLVGFFTYKVATFVQALEEALTVVKNEPQA 333
           GVM LQLIPMLVGFFTYKVATFVQALEEALTV KNEPQA
Sbjct: 301 GVMQLQLIPMLVGFFTYKVATFVQALEEALTVTKNEPQA 338

BLAST of HG10001715 vs. NCBI nr
Match: XP_022970893.1 (uncharacterized protein LOC111469729 [Cucurbita maxima])

HSP 1 Score: 595.5 bits (1534), Expect = 2.8e-166
Identity = 303/339 (89.38%), Postives = 317/339 (93.51%), Query Frame = 0

Query: 1   MAVLNYISATSTPISQDSSISPPIPDPRQTKVILPKKKPEKWSTGVSPGEYGGPPTTTKL 60
           MA+LNYISATSTPISQDSSI PP+PDPRQTKVILPKKKPEKWSTGVSPG+YGGPPTTTKL
Sbjct: 1   MAILNYISATSTPISQDSSIPPPLPDPRQTKVILPKKKPEKWSTGVSPGDYGGPPTTTKL 60

Query: 61  RKYWGGEKDDPLTSDDYIWNREFMARMKKFVKDQPADDLSLTVNKAKDEPSGFLSLNRVM 120
           RKYWGGEKDDPLTSDDYIWNREFM RMKK +KDQP DD S+ VNKAKDEPSGFLSLNRVM
Sbjct: 61  RKYWGGEKDDPLTSDDYIWNREFMGRMKKLMKDQP-DDSSVRVNKAKDEPSGFLSLNRVM 120

Query: 121 ALD-------RELLAPPMPRSEDVVEKNIPIDNRKSPRWKLAPTRREQEKWDRAYKAATG 180
            LD       +EL+APPMP  E+VVE+ I +DNRKSPRW+LAPTRREQEKWDRAYKAATG
Sbjct: 121 TLDSLEVDLSKELMAPPMPLKENVVEEKIQVDNRKSPRWRLAPTRREQEKWDRAYKAATG 180

Query: 181 GSDVMFRELRRPQGDPEVLAALSREQYFKLKKKMQILTLAIGGVGLFSAYVSYSPEVAAS 240
           GSDVMFRELRRPQGDPEVLAALSREQYFKLKKK+Q LTLAIGGVGL SAYVSYSPEVAAS
Sbjct: 181 GSDVMFRELRRPQGDPEVLAALSREQYFKLKKKLQTLTLAIGGVGLVSAYVSYSPEVAAS 240

Query: 241 FGAGLIGSLVYIRMLGSSVDSLADGAKGLVKGAVAQPRLLVPVILVMVYNRWNGILVEDY 300
           FGAGLIGSLVY+RMLGSSVDSLADGA+GLVKGAVAQPRLLVPVILVMVYNRWNGILVEDY
Sbjct: 241 FGAGLIGSLVYVRMLGSSVDSLADGARGLVKGAVAQPRLLVPVILVMVYNRWNGILVEDY 300

Query: 301 GVMHLQLIPMLVGFFTYKVATFVQALEEALTVVKNEPQA 333
           GVM LQLIPMLVGFFTYKVATFVQALEEALTV KNEPQA
Sbjct: 301 GVMQLQLIPMLVGFFTYKVATFVQALEEALTVTKNEPQA 338

BLAST of HG10001715 vs. NCBI nr
Match: XP_008448233.1 (PREDICTED: uncharacterized protein LOC103490489 isoform X3 [Cucumis melo])

HSP 1 Score: 593.6 bits (1529), Expect = 1.1e-165
Identity = 306/340 (90.00%), Postives = 318/340 (93.53%), Query Frame = 0

Query: 1   MAVLNYISATSTPISQDSSISPPIPDPRQTKVILPKKKPEKWSTGVSPGEYGGPPTTTKL 60
           MAVLNYISATSTPISQDSSISPPIPDPRQTKVILPKKKPEKWSTG++PG+YGGPPTTTKL
Sbjct: 1   MAVLNYISATSTPISQDSSISPPIPDPRQTKVILPKKKPEKWSTGIAPGDYGGPPTTTKL 60

Query: 61  RKYWGGEKDDPLTSDDYIWNREFMARMKKFVKDQPADDLSLTVNKAKDE-PSGFLSLNRV 120
           RKYWGGEKDDPLTSDDYIWNREFM RMKKFVKDQP DDLS TV K KD+ PSGFLSLNRV
Sbjct: 61  RKYWGGEKDDPLTSDDYIWNREFMGRMKKFVKDQP-DDLSHTVYKPKDDKPSGFLSLNRV 120

Query: 121 MALD-------RELLAPPMPRSEDVVEKNIPIDNRKSPRWKLAPTRREQEKWDRAYKAAT 180
           M LD       +EL  PPMPRSED+VEKNIPI +RKSPRWKLAPTR EQEKWDRAYKAAT
Sbjct: 121 MTLDSLDVDLSKELSPPPMPRSEDLVEKNIPIGHRKSPRWKLAPTRHEQEKWDRAYKAAT 180

Query: 181 GGSDVMFRELRRPQGDPEVLAALSREQYFKLKKKMQILTLAIGGVGLFSAYVSYSPEVAA 240
           GGSDVMF+ELRRPQGDPE LAALS EQYFKLKKKMQILTLAIGGVGL SAYVSYSPEVAA
Sbjct: 181 GGSDVMFQELRRPQGDPEALAALSMEQYFKLKKKMQILTLAIGGVGLISAYVSYSPEVAA 240

Query: 241 SFGAGLIGSLVYIRMLGSSVDSLADGAKGLVKGAVAQPRLLVPVILVMVYNRWNGILVED 300
           SFGAGLIGSLVYIRMLG+SVDSLADGAKGLVKGAVAQPRLLVPVILVM+YNRWNGILVED
Sbjct: 241 SFGAGLIGSLVYIRMLGNSVDSLADGAKGLVKGAVAQPRLLVPVILVMIYNRWNGILVED 300

Query: 301 YGVMHLQLIPMLVGFFTYKVATFVQALEEALTVVKNEPQA 333
           YGVM LQLIPMLVGFFTYKVATFVQA+EEALTVVKN+PQA
Sbjct: 301 YGVMQLQLIPMLVGFFTYKVATFVQAIEEALTVVKNKPQA 339

BLAST of HG10001715 vs. NCBI nr
Match: XP_022947232.1 (uncharacterized protein LOC111451157 [Cucurbita moschata] >XP_022947233.1 uncharacterized protein LOC111451157 [Cucurbita moschata])

HSP 1 Score: 589.7 bits (1519), Expect = 1.5e-164
Identity = 300/339 (88.50%), Postives = 315/339 (92.92%), Query Frame = 0

Query: 1   MAVLNYISATSTPISQDSSISPPIPDPRQTKVILPKKKPEKWSTGVSPGEYGGPPTTTKL 60
           MA+L+YISATSTPISQDSSI PP+PDPRQTKVILPKKKPEKWSTGVSPG+YGGPPTTTKL
Sbjct: 1   MAILHYISATSTPISQDSSIPPPLPDPRQTKVILPKKKPEKWSTGVSPGDYGGPPTTTKL 60

Query: 61  RKYWGGEKDDPLTSDDYIWNREFMARMKKFVKDQPADDLSLTVNKAKDEPSGFLSLNRVM 120
           RKYWGGEKDDPLTSDDYIWNREFM RMKK +KDQP DD S+ VN AKDEPSGFLSLNRVM
Sbjct: 61  RKYWGGEKDDPLTSDDYIWNREFMGRMKKLMKDQP-DDSSVRVNNAKDEPSGFLSLNRVM 120

Query: 121 ALD-------RELLAPPMPRSEDVVEKNIPIDNRKSPRWKLAPTRREQEKWDRAYKAATG 180
            LD       +EL+APPMP  E+VVE+ I +DNRKSPRW+LAPTRREQEKWDRAYKAATG
Sbjct: 121 TLDSLEVDLSKELMAPPMPLKENVVEEKIQVDNRKSPRWRLAPTRREQEKWDRAYKAATG 180

Query: 181 GSDVMFRELRRPQGDPEVLAALSREQYFKLKKKMQILTLAIGGVGLFSAYVSYSPEVAAS 240
           GSDVMFRELRRPQGDPEVLAALSREQYFKLKKK+Q LTLAIGGVGL SAYVSYSPEVAAS
Sbjct: 181 GSDVMFRELRRPQGDPEVLAALSREQYFKLKKKLQTLTLAIGGVGLVSAYVSYSPEVAAS 240

Query: 241 FGAGLIGSLVYIRMLGSSVDSLADGAKGLVKGAVAQPRLLVPVILVMVYNRWNGILVEDY 300
           FGAGLIGSLVY+RMLGSSVDSLADGA+GLVKGAVAQPRLLVPVILVMVYNRWNGILVEDY
Sbjct: 241 FGAGLIGSLVYVRMLGSSVDSLADGARGLVKGAVAQPRLLVPVILVMVYNRWNGILVEDY 300

Query: 301 GVMHLQLIPMLVGFFTYKVATFVQALEEALTVVKNEPQA 333
           GVM LQLIPMLVGFFTYKVATFVQALEEALTV KNEP A
Sbjct: 301 GVMQLQLIPMLVGFFTYKVATFVQALEEALTVTKNEPHA 338

BLAST of HG10001715 vs. ExPASy Swiss-Prot
Match: O82279 (Protein CONSERVED ONLY IN THE GREEN LINEAGE 160, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=CGL160 PE=1 SV=1)

HSP 1 Score: 432.2 bits (1110), Expect = 5.4e-120
Identity = 223/347 (64.27%), Postives = 272/347 (78.39%), Query Frame = 0

Query: 1   MAVLNYISATST--PISQDSSISPPIPDPRQTKVILPKKKPEKWSTGVSPGEYGGPPTTT 60
           MA+L+YISATST  PI QD S +  +P    TK+ILP KKPEKWSTGV+PGEYGGPPTTT
Sbjct: 1   MAILSYISATSTTPPIPQDQSPNSRLP----TKIILPNKKPEKWSTGVAPGEYGGPPTTT 60

Query: 61  KLRKYWGGEKDDPLTSDDYIWNREFMARMKKFVKDQPADDLSLTVNKAKDEPSGFLSLNR 120
           KLRKYWGGEK+DP+TS D IWNR+FM +MKK   D   +D SL  + +K++ SGFLS +R
Sbjct: 61  KLRKYWGGEKEDPITSTDLIWNRDFMDQMKKLFDD--PNDSSLDPSPSKEKSSGFLSFSR 120

Query: 121 VMALDR---ELLAPPMPRSEDVVEKNIPIDNRK----------SPRWKLAPTRREQEKWD 180
           VM+LD    +L       S+ VV+  +     +          SP+WKLAPTRREQEKWD
Sbjct: 121 VMSLDSMDVDLSKELASSSKSVVKNRLDTSKSEAKKQMSKAIVSPKWKLAPTRREQEKWD 180

Query: 181 RAYKAATGGSDVMFRELRRPQGDPEVLAALSREQYFKLKKKMQILTLAIGGVGLFSAYVS 240
           RA KAATGGSDVMFRELRRP+GDPEV AA  REQYFKLK K+Q+LTL IGGVGL SAY+S
Sbjct: 181 RATKAATGGSDVMFRELRRPRGDPEVQAAKDREQYFKLKNKIQVLTLGIGGVGLVSAYIS 240

Query: 241 YSPEVAASFGAGLIGSLVYIRMLGSSVDSLADGAKGLVKGAVAQPRLLVPVILVMVYNRW 300
           Y+PE+A SFGAGL+GSL Y+RMLG+SVD++ADGA+G+ KGA  QPRLLVPV+LVM++NRW
Sbjct: 241 YTPEIALSFGAGLLGSLAYMRMLGNSVDAMADGARGVAKGAANQPRLLVPVVLVMIFNRW 300

Query: 301 NGILVEDYGVMHLQLIPMLVGFFTYKVATFVQALEEALTVVKNEPQA 333
           N ILV +YG MHL+LIPMLVGFFTYK+ATF QA+EEA+++   +P++
Sbjct: 301 NAILVPEYGFMHLELIPMLVGFFTYKIATFFQAIEEAISITTQKPES 341

BLAST of HG10001715 vs. ExPASy Swiss-Prot
Match: P12403 (ATP synthase protein I OS=Nostoc sp. (strain PCC 7120 / SAG 25.82 / UTEX 2576) OX=103690 GN=atpI PE=3 SV=2)

HSP 1 Score: 60.5 bits (145), Expect = 4.3e-08
Identity = 34/125 (27.20%), Postives = 69/125 (55.20%), Query Frame = 0

Query: 198 EQYFKLKKKMQILTLAIGGVGLFSAYVSYSPEVAASFGAGLIGSLVYIRMLGSSVDSLAD 257
           +++++L +++ ++TL + GV   S ++ YS  +A ++  G    +VY+RML   V+ L  
Sbjct: 2   QEFYQLYQELVLITLVLTGVVFISVWIFYSLNIALNYLLGACTGVVYLRMLAKDVERL-- 61

Query: 258 GAKGLVKGAVAQPRLLVPVILVMVYNRWNGILVEDYGVMHLQLIPMLVGFFTYKVATFVQ 317
              G  K ++++ RL + + L+++ +RWN           LQ++P+ +GF TYK    + 
Sbjct: 62  ---GREKQSLSKTRLALLMALILLASRWN----------QLQIMPIFLGFLTYKATLIIY 111

Query: 318 ALEEA 323
            +  A
Sbjct: 122 VVRVA 111

BLAST of HG10001715 vs. ExPASy Swiss-Prot
Match: P08443 (ATP synthase protein I OS=Synechococcus sp. (strain ATCC 27144 / PCC 6301 / SAUG 1402/1) OX=269084 GN=atpI PE=3 SV=1)

HSP 1 Score: 56.2 bits (134), Expect = 8.0e-07
Identity = 37/130 (28.46%), Postives = 68/130 (52.31%), Query Frame = 0

Query: 199 QYFKLKKKMQILTLAIGGVGLFSAYVSYSPEVAASFGAGLIGSLVYIRMLGSSVDSLADG 258
           +Y+ L++++  +TL    V   + + +YS   AAS+  G +G L+Y+RMLG +V+ + + 
Sbjct: 3   EYYALQRQLLQVTLICTVVIFGAVWWAYSLNTAASYLLGAMGGLLYLRMLGKAVERIGER 62

Query: 259 AKGLVKGAVAQPRLLVPVILVMVYNRWNGILVEDYGVMHLQLIPMLVGFFTYKVATFVQA 318
            +   K      RL + V+L+++  RW           +L+L+P+ +GF TYK A     
Sbjct: 63  RRQFGKS-----RLALFVVLIVLAARW----------QYLELMPVFLGFLTYKAALIWYT 117

Query: 319 LEEALTVVKN 329
           L   +   +N
Sbjct: 123 LRAVIPTAEN 117

BLAST of HG10001715 vs. ExPASy Swiss-Prot
Match: Q05376 (ATP synthase protein I OS=Synechococcus sp. (strain PCC 6716) OX=32048 GN=atpI PE=3 SV=2)

HSP 1 Score: 50.1 bits (118), Expect = 5.8e-05
Identity = 34/125 (27.20%), Postives = 62/125 (49.60%), Query Frame = 0

Query: 199 QYFKLKKKMQILTLAIGGVGLFSAYVSYSPEVAASFGAGLIGSLVYIRMLGSSVDSLADG 258
           ++++L +++   +L +  +   + +V Y    A ++  G   SL+Y+R+L  +V+ L   
Sbjct: 3   EFYQLCRELFTTSLVLMAIAFGTVWVIYDLNTALNYLLGASASLIYLRLLARNVERLGHD 62

Query: 259 AKGLVKGAVAQPRLLVPVILVMVYNRWNGILVEDYGVMHLQLIPMLVGFFTYKVATFVQA 318
            K L K      +LLV V ++++  RW+           L +IP+ +GF TYK A  V  
Sbjct: 63  QKKLGK-----TQLLVVVAVIILAARWH----------ELHIIPVFLGFLTYKAAILVYM 112

Query: 319 LEEAL 324
           L   L
Sbjct: 123 LRTVL 112

BLAST of HG10001715 vs. ExPASy TrEMBL
Match: A0A6J1I585 (uncharacterized protein LOC111469729 OS=Cucurbita maxima OX=3661 GN=LOC111469729 PE=4 SV=1)

HSP 1 Score: 595.5 bits (1534), Expect = 1.4e-166
Identity = 303/339 (89.38%), Postives = 317/339 (93.51%), Query Frame = 0

Query: 1   MAVLNYISATSTPISQDSSISPPIPDPRQTKVILPKKKPEKWSTGVSPGEYGGPPTTTKL 60
           MA+LNYISATSTPISQDSSI PP+PDPRQTKVILPKKKPEKWSTGVSPG+YGGPPTTTKL
Sbjct: 1   MAILNYISATSTPISQDSSIPPPLPDPRQTKVILPKKKPEKWSTGVSPGDYGGPPTTTKL 60

Query: 61  RKYWGGEKDDPLTSDDYIWNREFMARMKKFVKDQPADDLSLTVNKAKDEPSGFLSLNRVM 120
           RKYWGGEKDDPLTSDDYIWNREFM RMKK +KDQP DD S+ VNKAKDEPSGFLSLNRVM
Sbjct: 61  RKYWGGEKDDPLTSDDYIWNREFMGRMKKLMKDQP-DDSSVRVNKAKDEPSGFLSLNRVM 120

Query: 121 ALD-------RELLAPPMPRSEDVVEKNIPIDNRKSPRWKLAPTRREQEKWDRAYKAATG 180
            LD       +EL+APPMP  E+VVE+ I +DNRKSPRW+LAPTRREQEKWDRAYKAATG
Sbjct: 121 TLDSLEVDLSKELMAPPMPLKENVVEEKIQVDNRKSPRWRLAPTRREQEKWDRAYKAATG 180

Query: 181 GSDVMFRELRRPQGDPEVLAALSREQYFKLKKKMQILTLAIGGVGLFSAYVSYSPEVAAS 240
           GSDVMFRELRRPQGDPEVLAALSREQYFKLKKK+Q LTLAIGGVGL SAYVSYSPEVAAS
Sbjct: 181 GSDVMFRELRRPQGDPEVLAALSREQYFKLKKKLQTLTLAIGGVGLVSAYVSYSPEVAAS 240

Query: 241 FGAGLIGSLVYIRMLGSSVDSLADGAKGLVKGAVAQPRLLVPVILVMVYNRWNGILVEDY 300
           FGAGLIGSLVY+RMLGSSVDSLADGA+GLVKGAVAQPRLLVPVILVMVYNRWNGILVEDY
Sbjct: 241 FGAGLIGSLVYVRMLGSSVDSLADGARGLVKGAVAQPRLLVPVILVMVYNRWNGILVEDY 300

Query: 301 GVMHLQLIPMLVGFFTYKVATFVQALEEALTVVKNEPQA 333
           GVM LQLIPMLVGFFTYKVATFVQALEEALTV KNEPQA
Sbjct: 301 GVMQLQLIPMLVGFFTYKVATFVQALEEALTVTKNEPQA 338

BLAST of HG10001715 vs. ExPASy TrEMBL
Match: A0A1S3BK31 (uncharacterized protein LOC103490489 isoform X3 OS=Cucumis melo OX=3656 GN=LOC103490489 PE=4 SV=1)

HSP 1 Score: 593.6 bits (1529), Expect = 5.2e-166
Identity = 306/340 (90.00%), Postives = 318/340 (93.53%), Query Frame = 0

Query: 1   MAVLNYISATSTPISQDSSISPPIPDPRQTKVILPKKKPEKWSTGVSPGEYGGPPTTTKL 60
           MAVLNYISATSTPISQDSSISPPIPDPRQTKVILPKKKPEKWSTG++PG+YGGPPTTTKL
Sbjct: 1   MAVLNYISATSTPISQDSSISPPIPDPRQTKVILPKKKPEKWSTGIAPGDYGGPPTTTKL 60

Query: 61  RKYWGGEKDDPLTSDDYIWNREFMARMKKFVKDQPADDLSLTVNKAKDE-PSGFLSLNRV 120
           RKYWGGEKDDPLTSDDYIWNREFM RMKKFVKDQP DDLS TV K KD+ PSGFLSLNRV
Sbjct: 61  RKYWGGEKDDPLTSDDYIWNREFMGRMKKFVKDQP-DDLSHTVYKPKDDKPSGFLSLNRV 120

Query: 121 MALD-------RELLAPPMPRSEDVVEKNIPIDNRKSPRWKLAPTRREQEKWDRAYKAAT 180
           M LD       +EL  PPMPRSED+VEKNIPI +RKSPRWKLAPTR EQEKWDRAYKAAT
Sbjct: 121 MTLDSLDVDLSKELSPPPMPRSEDLVEKNIPIGHRKSPRWKLAPTRHEQEKWDRAYKAAT 180

Query: 181 GGSDVMFRELRRPQGDPEVLAALSREQYFKLKKKMQILTLAIGGVGLFSAYVSYSPEVAA 240
           GGSDVMF+ELRRPQGDPE LAALS EQYFKLKKKMQILTLAIGGVGL SAYVSYSPEVAA
Sbjct: 181 GGSDVMFQELRRPQGDPEALAALSMEQYFKLKKKMQILTLAIGGVGLISAYVSYSPEVAA 240

Query: 241 SFGAGLIGSLVYIRMLGSSVDSLADGAKGLVKGAVAQPRLLVPVILVMVYNRWNGILVED 300
           SFGAGLIGSLVYIRMLG+SVDSLADGAKGLVKGAVAQPRLLVPVILVM+YNRWNGILVED
Sbjct: 241 SFGAGLIGSLVYIRMLGNSVDSLADGAKGLVKGAVAQPRLLVPVILVMIYNRWNGILVED 300

Query: 301 YGVMHLQLIPMLVGFFTYKVATFVQALEEALTVVKNEPQA 333
           YGVM LQLIPMLVGFFTYKVATFVQA+EEALTVVKN+PQA
Sbjct: 301 YGVMQLQLIPMLVGFFTYKVATFVQAIEEALTVVKNKPQA 339

BLAST of HG10001715 vs. ExPASy TrEMBL
Match: A0A6J1G5W4 (uncharacterized protein LOC111451157 OS=Cucurbita moschata OX=3662 GN=LOC111451157 PE=4 SV=1)

HSP 1 Score: 589.7 bits (1519), Expect = 7.5e-165
Identity = 300/339 (88.50%), Postives = 315/339 (92.92%), Query Frame = 0

Query: 1   MAVLNYISATSTPISQDSSISPPIPDPRQTKVILPKKKPEKWSTGVSPGEYGGPPTTTKL 60
           MA+L+YISATSTPISQDSSI PP+PDPRQTKVILPKKKPEKWSTGVSPG+YGGPPTTTKL
Sbjct: 1   MAILHYISATSTPISQDSSIPPPLPDPRQTKVILPKKKPEKWSTGVSPGDYGGPPTTTKL 60

Query: 61  RKYWGGEKDDPLTSDDYIWNREFMARMKKFVKDQPADDLSLTVNKAKDEPSGFLSLNRVM 120
           RKYWGGEKDDPLTSDDYIWNREFM RMKK +KDQP DD S+ VN AKDEPSGFLSLNRVM
Sbjct: 61  RKYWGGEKDDPLTSDDYIWNREFMGRMKKLMKDQP-DDSSVRVNNAKDEPSGFLSLNRVM 120

Query: 121 ALD-------RELLAPPMPRSEDVVEKNIPIDNRKSPRWKLAPTRREQEKWDRAYKAATG 180
            LD       +EL+APPMP  E+VVE+ I +DNRKSPRW+LAPTRREQEKWDRAYKAATG
Sbjct: 121 TLDSLEVDLSKELMAPPMPLKENVVEEKIQVDNRKSPRWRLAPTRREQEKWDRAYKAATG 180

Query: 181 GSDVMFRELRRPQGDPEVLAALSREQYFKLKKKMQILTLAIGGVGLFSAYVSYSPEVAAS 240
           GSDVMFRELRRPQGDPEVLAALSREQYFKLKKK+Q LTLAIGGVGL SAYVSYSPEVAAS
Sbjct: 181 GSDVMFRELRRPQGDPEVLAALSREQYFKLKKKLQTLTLAIGGVGLVSAYVSYSPEVAAS 240

Query: 241 FGAGLIGSLVYIRMLGSSVDSLADGAKGLVKGAVAQPRLLVPVILVMVYNRWNGILVEDY 300
           FGAGLIGSLVY+RMLGSSVDSLADGA+GLVKGAVAQPRLLVPVILVMVYNRWNGILVEDY
Sbjct: 241 FGAGLIGSLVYVRMLGSSVDSLADGARGLVKGAVAQPRLLVPVILVMVYNRWNGILVEDY 300

Query: 301 GVMHLQLIPMLVGFFTYKVATFVQALEEALTVVKNEPQA 333
           GVM LQLIPMLVGFFTYKVATFVQALEEALTV KNEP A
Sbjct: 301 GVMQLQLIPMLVGFFTYKVATFVQALEEALTVTKNEPHA 338

BLAST of HG10001715 vs. ExPASy TrEMBL
Match: A0A0A0KC26 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G242200 PE=4 SV=1)

HSP 1 Score: 589.0 bits (1517), Expect = 1.3e-164
Identity = 306/340 (90.00%), Postives = 321/340 (94.41%), Query Frame = 0

Query: 1   MAVLNYISATSTPISQDSSISPPIPDPRQTKVILPKKKPEKWSTGVSPGEYGGPPTTTKL 60
           MAVLNYISA S+PISQDSSISPPIPDPRQTKVILPKKKPEKWSTG++PG+YGGPPTTTKL
Sbjct: 1   MAVLNYISAASSPISQDSSISPPIPDPRQTKVILPKKKPEKWSTGIAPGDYGGPPTTTKL 60

Query: 61  RKYWGGEKDDPLTSDDYIWNREFMARMKKFVKDQPADDLSLTVNKAKDE-PSGFLSLNRV 120
           RKYWGGEKDDPLTSDDYIWNREFMARMKKFVK QP DDLSLTVNK KD+ PSGFLSLNRV
Sbjct: 61  RKYWGGEKDDPLTSDDYIWNREFMARMKKFVKYQP-DDLSLTVNKPKDDKPSGFLSLNRV 120

Query: 121 MALD-------RELLAPPMPRSEDVVEKNIPIDNRKSPRWKLAPTRREQEKWDRAYKAAT 180
           M LD       +EL APPMPRSED+VEKNIPID+RKSPRWKLAPTRREQEKWDRAY+AAT
Sbjct: 121 MTLDSLDVDLSKELSAPPMPRSEDLVEKNIPIDHRKSPRWKLAPTRREQEKWDRAYEAAT 180

Query: 181 GGSDVMFRELRRPQGDPEVLAALSREQYFKLKKKMQILTLAIGGVGLFSAYVSYSPEVAA 240
           GGSDVMFRELRRPQG+PEVLAALS EQY KLKKKMQILTLAIGGVGL SAYVSYSPEV+A
Sbjct: 181 GGSDVMFRELRRPQGNPEVLAALSMEQYVKLKKKMQILTLAIGGVGLISAYVSYSPEVSA 240

Query: 241 SFGAGLIGSLVYIRMLGSSVDSLADGAKGLVKGAVAQPRLLVPVILVMVYNRWNGILVED 300
           SFGAGLIGSLVYIRMLG+SVDSLADGAKGLVKGAVAQPRLLVPVILVM+YNRWNGILVED
Sbjct: 241 SFGAGLIGSLVYIRMLGNSVDSLADGAKGLVKGAVAQPRLLVPVILVMIYNRWNGILVED 300

Query: 301 YGVMHLQLIPMLVGFFTYKVATFVQALEEALTVVKNEPQA 333
           YGV+ LQLIPMLVGFFTYKVATFVQA+EEALTVVK EPQA
Sbjct: 301 YGVVQLQLIPMLVGFFTYKVATFVQAIEEALTVVK-EPQA 338

BLAST of HG10001715 vs. ExPASy TrEMBL
Match: A0A6J1D6M3 (uncharacterized protein LOC111018112 OS=Momordica charantia OX=3673 GN=LOC111018112 PE=4 SV=1)

HSP 1 Score: 575.5 bits (1482), Expect = 1.5e-160
Identity = 296/340 (87.06%), Postives = 313/340 (92.06%), Query Frame = 0

Query: 1   MAVLNYISATSTPISQDSSISPPIPDPRQTKVILPKKKPEKWSTGVSPGEYGGPPTTTKL 60
           MAVLNYISATSTPI QDSSI+PPIP PRQTK+ILPKKKPEKWSTGVSPGEYGGPPT TKL
Sbjct: 1   MAVLNYISATSTPIPQDSSITPPIPGPRQTKIILPKKKPEKWSTGVSPGEYGGPPTATKL 60

Query: 61  RKYWGGEKDDPLTSDDYIWNREFMARMKKFVKDQPADDLSLTVNKAKDEPSGFLSLNRVM 120
           RKYWGGEK+DPLTSDDYIWNREFM RMKK ++DQP+D  S+  NKAKDEPSGFLSLNRVM
Sbjct: 61  RKYWGGEKEDPLTSDDYIWNREFMGRMKKLIQDQPSDS-SVQPNKAKDEPSGFLSLNRVM 120

Query: 121 ALD-------RELLAPP-MPRSEDVVEKNIPIDNRKSPRWKLAPTRREQEKWDRAYKAAT 180
            LD       +EL+APP MPRSE +VE+NI ID  KSPRWKLAPTRREQEKWDRA KAAT
Sbjct: 121 TLDSLEVDLSKELMAPPSMPRSEKLVEENIQIDKHKSPRWKLAPTRREQEKWDRANKAAT 180

Query: 181 GGSDVMFRELRRPQGDPEVLAALSREQYFKLKKKMQILTLAIGGVGLFSAYVSYSPEVAA 240
           GGSDVMFRELRRP+GDPEVLA+L REQYFKLK KM+ILTLAIGGVGLFSAYVSYSPEVAA
Sbjct: 181 GGSDVMFRELRRPRGDPEVLASLYREQYFKLKNKMEILTLAIGGVGLFSAYVSYSPEVAA 240

Query: 241 SFGAGLIGSLVYIRMLGSSVDSLADGAKGLVKGAVAQPRLLVPVILVMVYNRWNGILVED 300
           SFGAGLIGSLVYIRMLGSSVDSLADGAKGLVKGA+AQPRLLVPVILVMVYNRWNGILVED
Sbjct: 241 SFGAGLIGSLVYIRMLGSSVDSLADGAKGLVKGAIAQPRLLVPVILVMVYNRWNGILVED 300

Query: 301 YGVMHLQLIPMLVGFFTYKVATFVQALEEALTVVKNEPQA 333
           YGVM LQLIPMLVGFFTYKVATFVQALEEALTV K+EPQ+
Sbjct: 301 YGVMQLQLIPMLVGFFTYKVATFVQALEEALTVTKDEPQS 339

BLAST of HG10001715 vs. TAIR 10
Match: AT2G31040.1 (ATP synthase protein I -related )

HSP 1 Score: 432.2 bits (1110), Expect = 3.8e-121
Identity = 223/347 (64.27%), Postives = 272/347 (78.39%), Query Frame = 0

Query: 1   MAVLNYISATST--PISQDSSISPPIPDPRQTKVILPKKKPEKWSTGVSPGEYGGPPTTT 60
           MA+L+YISATST  PI QD S +  +P    TK+ILP KKPEKWSTGV+PGEYGGPPTTT
Sbjct: 1   MAILSYISATSTTPPIPQDQSPNSRLP----TKIILPNKKPEKWSTGVAPGEYGGPPTTT 60

Query: 61  KLRKYWGGEKDDPLTSDDYIWNREFMARMKKFVKDQPADDLSLTVNKAKDEPSGFLSLNR 120
           KLRKYWGGEK+DP+TS D IWNR+FM +MKK   D   +D SL  + +K++ SGFLS +R
Sbjct: 61  KLRKYWGGEKEDPITSTDLIWNRDFMDQMKKLFDD--PNDSSLDPSPSKEKSSGFLSFSR 120

Query: 121 VMALDR---ELLAPPMPRSEDVVEKNIPIDNRK----------SPRWKLAPTRREQEKWD 180
           VM+LD    +L       S+ VV+  +     +          SP+WKLAPTRREQEKWD
Sbjct: 121 VMSLDSMDVDLSKELASSSKSVVKNRLDTSKSEAKKQMSKAIVSPKWKLAPTRREQEKWD 180

Query: 181 RAYKAATGGSDVMFRELRRPQGDPEVLAALSREQYFKLKKKMQILTLAIGGVGLFSAYVS 240
           RA KAATGGSDVMFRELRRP+GDPEV AA  REQYFKLK K+Q+LTL IGGVGL SAY+S
Sbjct: 181 RATKAATGGSDVMFRELRRPRGDPEVQAAKDREQYFKLKNKIQVLTLGIGGVGLVSAYIS 240

Query: 241 YSPEVAASFGAGLIGSLVYIRMLGSSVDSLADGAKGLVKGAVAQPRLLVPVILVMVYNRW 300
           Y+PE+A SFGAGL+GSL Y+RMLG+SVD++ADGA+G+ KGA  QPRLLVPV+LVM++NRW
Sbjct: 241 YTPEIALSFGAGLLGSLAYMRMLGNSVDAMADGARGVAKGAANQPRLLVPVVLVMIFNRW 300

Query: 301 NGILVEDYGVMHLQLIPMLVGFFTYKVATFVQALEEALTVVKNEPQA 333
           N ILV +YG MHL+LIPMLVGFFTYK+ATF QA+EEA+++   +P++
Sbjct: 301 NAILVPEYGFMHLELIPMLVGFFTYKIATFFQAIEEAISITTQKPES 341

BLAST of HG10001715 vs. TAIR 10
Match: AT5G22340.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; Has 58 Blast hits to 58 proteins in 20 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 57; Viruses - 0; Other Eukaryotes - 1 (source: NCBI BLink). )

HSP 1 Score: 42.7 bits (99), Expect = 6.5e-04
Identity = 51/207 (24.64%), Postives = 88/207 (42.51%), Query Frame = 0

Query: 145 DNRKSPRWKLAPTRREQEKWDRAYKAATGGSDVMFRELRRPQGDPEVLAALSREQYFKLK 204
           D+ +S   KL PT +E   W+    +A      + + LR    D E     +  +Y  LK
Sbjct: 116 DDDESGFLKLKPT-QEWIGWES--DSAPMNKKALAKALR---DDSERRKKFNFLKYEALK 175

Query: 205 KKMQILTLAIGGVGLFSAYVSYSPEVAASFGAGLIGSLVYIRML---------------- 264
           +++  L++ IG        ++ S + A S+  G++ S +Y+++L                
Sbjct: 176 RELMYLSIVIGTGCSGYCLLALSVQAAVSYAVGVLFSCLYLQLLYGYADGLSREAVPDIF 235

Query: 265 --------GSSVDSLADGAKGLVKG---AVAQPRLLVPVILVMVYNRWNGILVEDY---G 322
                   G   + L D     ++G   A++ PRL++P     +Y  W  IL   Y    
Sbjct: 236 LKKKSKKIGIRSEDLEDFVVRTIRGSGMALSSPRLVIP---AAIYGLW--ILSHKYFQND 295

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038901394.13.7e-17494.40protein CONSERVED ONLY IN THE GREEN LINEAGE 160, chloroplastic [Benincasa hispid... [more]
KAG6604770.12.1e-16689.38Protein CONSERVED ONLY IN THE GREEN LINEAGE 160, chloroplastic, partial [Cucurbi... [more]
XP_022970893.12.8e-16689.38uncharacterized protein LOC111469729 [Cucurbita maxima][more]
XP_008448233.11.1e-16590.00PREDICTED: uncharacterized protein LOC103490489 isoform X3 [Cucumis melo][more]
XP_022947232.11.5e-16488.50uncharacterized protein LOC111451157 [Cucurbita moschata] >XP_022947233.1 unchar... [more]
Match NameE-valueIdentityDescription
O822795.4e-12064.27Protein CONSERVED ONLY IN THE GREEN LINEAGE 160, chloroplastic OS=Arabidopsis th... [more]
P124034.3e-0827.20ATP synthase protein I OS=Nostoc sp. (strain PCC 7120 / SAG 25.82 / UTEX 2576) O... [more]
P084438.0e-0728.46ATP synthase protein I OS=Synechococcus sp. (strain ATCC 27144 / PCC 6301 / SAUG... [more]
Q053765.8e-0527.20ATP synthase protein I OS=Synechococcus sp. (strain PCC 6716) OX=32048 GN=atpI P... [more]
Match NameE-valueIdentityDescription
A0A6J1I5851.4e-16689.38uncharacterized protein LOC111469729 OS=Cucurbita maxima OX=3661 GN=LOC111469729... [more]
A0A1S3BK315.2e-16690.00uncharacterized protein LOC103490489 isoform X3 OS=Cucumis melo OX=3656 GN=LOC10... [more]
A0A6J1G5W47.5e-16588.50uncharacterized protein LOC111451157 OS=Cucurbita moschata OX=3662 GN=LOC1114511... [more]
A0A0A0KC261.3e-16490.00Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G242200 PE=4 SV=1[more]
A0A6J1D6M31.5e-16087.06uncharacterized protein LOC111018112 OS=Momordica charantia OX=3673 GN=LOC111018... [more]
Match NameE-valueIdentityDescription
AT2G31040.13.8e-12164.27ATP synthase protein I -related [more]
AT5G22340.16.5e-0424.64unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 10..25
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 10..54
NoneNo IPR availablePANTHERPTHR34118NF-KAPPA-B INHIBITOR-LIKE PROTEIN-RELATEDcoord: 5..328
NoneNo IPR availablePANTHERPTHR34118:SF4SUBFAMILY NOT NAMEDcoord: 5..328

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10001715.1HG10001715.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0016021 integral component of membrane