Cp4.1LG10g02140 (gene) Cucurbita pepo (MU‐CU‐16) v4.1

Overview
NameCp4.1LG10g02140
Typegene
OrganismCucurbita pepo (Cucurbita pepo (MU‐CU‐16) v4.1)
DescriptionN-acetyltransferase domain-containing protein
LocationCp4.1LG10: 2803720 .. 2805977 (+)
RNA-Seq ExpressionCp4.1LG10g02140
SyntenyCp4.1LG10g02140
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGATAAGAAACAGAGATTCCATAGTGATTAGAGAGTTCGATGCGAGTAAAGATAGTAAGAGCGTCGAAGATGTGGAGCGACGGTGCGAAGTCGGACCTAGCGGCAAGCTTTGTCTCTTCACCGACCTTTTAGGTGACCCAATTTGCAGGGTCCGCAACTCCCCTGCTTTCCTCATGCTGGTATGGTATTCTTTTTGTTTCTTTTCTATTCTGTTCCTTTGGTATTTGTATTCTGTTCTGTTTCTGTGTGTGTGTGTGTGTGTGTGTGTAGGTTATAACAAAATCATTAAATTCTAAAGTTGTTGGTATTTGGTGTTGGTTATGGTTTTGTGTCTCACACTCATTATGAGCGTTTGTTGGCGTTTCAACAGGTGGCGGCAACGGGCGAAGAGAACGAGATAGTGGGGATGATTAGAGGGTGCATCAAAACCGTTACATGCGGTCAAAAACTCTCCCGCGCTCCTCCTAATCCCAAAACCCACCACCATCATCACTCTACCAACCACCACCTCCCTGTTTTCACTAAACTCGCCTATATCCTTGGCCTTCGCGTCTCTCCTGCTCACCGGTCTGTATTTATTTATTTATTTATTTATTTATTTATTATTATTATTTTTTTTGCTACATAATAATAATTATTATAATACGTGACATGTTTGATTTCAGACGGATGGGAATTGGGATGAAGCTGGTGAAGAAAATGGAGGAATGGTTCAGAGAGAATGGCGCTGAGTATTCCTACATAGCCACCGAAAAGGACAACGTAGCGTCGGTGAATCTGTTCACTGAAAAATGCGACTACTCTAAATTCCGTACACCGGCCATCCTTGTCAATCCCGTCTTTGCTCATCCACTCCCCGTCTCCCAACGGGTCACTCTCCTCCCACTTTCACGCGCCGACGCTGAGATTCTCTATCGCCGCCGTTTCTCAACCACTGAGTTTTTCCCCCGAGACATTGACGCTATCCTCAACAACGTACTCACTCTTGGCACCTTCCTCGCCTTTCCACGTGGCATTTACACGCCTCAGACTTGGCCCGGGTCGGACCGGTTCTTGCTCGACCCACCCGAGTCCTGGGCGGTTCTCAGCGTCTGGAACTGCAATGACGTGTTTAAGCTTCAAGTACGTGGAGCGTCACGCTTAAAGCGGAGTCTAGCCAGGACGACACGGGTGTTGGATAGAGCATTTCCGTGGCTCAGGCTGCCGTCGGTGCCGGAGCTTTTCAGACCGTTCGGGTTGCACTTCTTGTACGGGTTGGGCGGTGAAGGATCCGAGGCTGGGAGGATGGTGAAGGCGCTGTGTGGGTACGCACATAACCTGGCGAAGGAGAGAGGTTGTGGGGTGGTANTACATAATAATAATTATTATAATACGTGACATGTTTGATTTCAGACGGATGGGAATTGGGATGAAGCTGGTGAAGAAAATGGAGGAATGGTTCAGAGAGAATGGCGCTGAGTATTCCTACATAGCCACCGAAAAGGACAACGTAGCGTCGGTGAATCTGTTCACTGAAAAATGCGACTACTCTAAATTCCGTACACCGGCCATCCTTGTCAATCCCGTCTTTGCTCATCCACTCCCCGTCTCCCAACGGGTCACTCTCCTCCCACTTTCACGCGCCGACGCTGAGATTCTCTATCGCCGCCGTTTCTCAACCACTGAGTTTTTCCCCCGAGACATTGACGCTATCCTCAACAACGTACTCACTCTTGGCACCTTCCTCGCCTTTCCACGTGGCATTTACACGCCTCAGACTTGGCCCGGGTCGGACCGGTTCTTGCTCGACCCACCCGAGTCCTGGGCGGTTCTCAGCGTCTGGAACTGCAATGACGTGTTTAAGCTTCAAGTACGTGGAGCGTCACGCTTAAAGCGGAGTCTAGCCAGGACGACACGGGTGTTGGATAGAGCATTTCCGTGGCTCAGGCTGCCGTCGGTGCCGGAGCTTTTCAGACCGTTCGGGTTGCACTTCTTGTACGGGCTGGGCGGAGAAGGACCCGAGGCTGGGAGGATGGTGAAGGCGCTGTGTGGGTACGCACATAACCTGGCGAAGGAGAGAGGTTGTGGGGTTGTCGCGACGGAGGTCTCGGCTAGAGAGCGGCTCAGAGGCGCCATTCCACACTGGAAAATGCTGTCGTGCGAGGAGGATCTTTGGTGCATCAAGCGCCTCGCCGAAGATTTCAGCGACGGCTCCGTGGGTGACTGGACCAAATCACCACCTGGCTTGTCCATTTTCGTCGACCCCAGAGAATTCTAA

mRNA sequence

ATGATAAGAAACAGAGATTCCATAGTGATTAGAGAGTTCGATGCGAGTAAAGATAGTAAGAGCGTCGAAGATGTGGAGCGACGGTGCGAAGTCGGACCTAGCGGCAAGCTTTGTCTCTTCACCGACCTTTTAGGTGACCCAATTTGCAGGGTCCGCAACTCCCCTGCTTTCCTCATGCTGGTGGCGGCAACGGGCGAAGAGAACGAGATAGTGGGGATGATTAGAGGGTGCATCAAAACCGTTACATGCGGTCAAAAACTCTCCCGCGCTCCTCCTAATCCCAAAACCCACCACCATCATCACTCTACCAACCACCACCTCCCTGTTTTCACTAAACTCGCCTATATCCTTGGCCTTCGCGTCTCTCCTGCTCACCGACGGATGGGAATTGGGATGAAGCTGGTGAAGAAAATGGAGGAATGGTTCAGAGAGAATGGCGCTGAGTATTCCTACATAGCCACCGAAAAGGACAACGTAGCGTCGGTGAATCTGTTCACTGAAAAATGCGACTACTCTAAATTCCGTACACCGGCCATCCTTGTCAATCCCGTCTTTGCTCATCCACTCCCCGTCTCCCAACGGGTCACTCTCCTCCCACTTTCACGCGCCGACGCTGAGATTCTCTATCGCCGCCGTTTCTCAACCACTGAGTTTTTCCCCCGAGACATTGACGCTATCCTCAACAACGTACTCACTCTTGGCACCTTCCTCGCCTTTCCACGTGGCATTTACACGCCTCAGACTTGGCCCGGGTCGGACCGGTTCTTGCTCGACCCACCCGAGTCCTGGGCGGTTCTCAGCGTCTGGAACTGCAATGACGTGTTTAAGCTTCAAGTACGTGGAGCGTCACGCTTAAAGCGGAGTCTAGCCAGGACGACACGGGTGTTGGATAGAGCATTTCCGTGGCTCAGGCTGCCGTCGGTGCCGGAGCTTTTCAGACCGTTCGGGTTGCACTTCTTGTACGGGTTGGGCGGTGAAGGATCCGAGGCTGGGAGGATGGTGAAGGCGCTGTGTGGACGGATGGGAATTGGGATGAAGCTGGTGAAGAAAATGGAGGAATGGTTCAGAGAGAATGGCGCTGAGTATTCCTACATAGCCACCGAAAAGGACAACGTAGCGTCGGTGAATCTGTTCACTGAAAAATGCGACTACTCTAAATTCCGTACACCGGCCATCCTTGTCAATCCCGTCTTTGCTCATCCACTCCCCGTCTCCCAACGGGTCACTCTCCTCCCACTTTCACGCGCCGACGCTGAGATTCTCTATCGCCGCCGTTTCTCAACCACTGAGTTTTTCCCCCGAGACATTGACGCTATCCTCAACAACGTACTCACTCTTGGCACCTTCCTCGCCTTTCCACGTGGCATTTACACGCCTCAGACTTGGCCCGGGTCGGACCGGTTCTTGCTCGACCCACCCGAGTCCTGGGCGGTTCTCAGCGTCTGGAACTGCAATGACGTGTTTAAGCTTCAAGTACGTGGAGCGTCACGCTTAAAGCGGAGTCTAGCCAGGACGACACGGGTGTTGGATAGAGCATTTCCGTGGCTCAGGCTGCCGTCGGTGCCGGAGCTTTTCAGACCGTTCGGGTTGCACTTCTTGTACGGGCTGGGCGGAGAAGGACCCGAGGCTGGGAGGATGGTGAAGGCGCTGTGTGGGTACGCACATAACCTGGCGAAGGAGAGAGGTTGTGGGGTTGTCGCGACGGAGGTCTCGGCTAGAGAGCGGCTCAGAGGCGCCATTCCACACTGGAAAATGCTGTCGTGCGAGGAGGATCTTTGGTGCATCAAGCGCCTCGCCGAAGATTTCAGCGACGGCTCCGTGGGTGACTGGACCAAATCACCACCTGGCTTGTCCATTTTCGTCGACCCCAGAGAATTCTAA

Coding sequence (CDS)

ATGATAAGAAACAGAGATTCCATAGTGATTAGAGAGTTCGATGCGAGTAAAGATAGTAAGAGCGTCGAAGATGTGGAGCGACGGTGCGAAGTCGGACCTAGCGGCAAGCTTTGTCTCTTCACCGACCTTTTAGGTGACCCAATTTGCAGGGTCCGCAACTCCCCTGCTTTCCTCATGCTGGTGGCGGCAACGGGCGAAGAGAACGAGATAGTGGGGATGATTAGAGGGTGCATCAAAACCGTTACATGCGGTCAAAAACTCTCCCGCGCTCCTCCTAATCCCAAAACCCACCACCATCATCACTCTACCAACCACCACCTCCCTGTTTTCACTAAACTCGCCTATATCCTTGGCCTTCGCGTCTCTCCTGCTCACCGACGGATGGGAATTGGGATGAAGCTGGTGAAGAAAATGGAGGAATGGTTCAGAGAGAATGGCGCTGAGTATTCCTACATAGCCACCGAAAAGGACAACGTAGCGTCGGTGAATCTGTTCACTGAAAAATGCGACTACTCTAAATTCCGTACACCGGCCATCCTTGTCAATCCCGTCTTTGCTCATCCACTCCCCGTCTCCCAACGGGTCACTCTCCTCCCACTTTCACGCGCCGACGCTGAGATTCTCTATCGCCGCCGTTTCTCAACCACTGAGTTTTTCCCCCGAGACATTGACGCTATCCTCAACAACGTACTCACTCTTGGCACCTTCCTCGCCTTTCCACGTGGCATTTACACGCCTCAGACTTGGCCCGGGTCGGACCGGTTCTTGCTCGACCCACCCGAGTCCTGGGCGGTTCTCAGCGTCTGGAACTGCAATGACGTGTTTAAGCTTCAAGTACGTGGAGCGTCACGCTTAAAGCGGAGTCTAGCCAGGACGACACGGGTGTTGGATAGAGCATTTCCGTGGCTCAGGCTGCCGTCGGTGCCGGAGCTTTTCAGACCGTTCGGGTTGCACTTCTTGTACGGGTTGGGCGGTGAAGGATCCGAGGCTGGGAGGATGGTGAAGGCGCTGTGTGGACGGATGGGAATTGGGATGAAGCTGGTGAAGAAAATGGAGGAATGGTTCAGAGAGAATGGCGCTGAGTATTCCTACATAGCCACCGAAAAGGACAACGTAGCGTCGGTGAATCTGTTCACTGAAAAATGCGACTACTCTAAATTCCGTACACCGGCCATCCTTGTCAATCCCGTCTTTGCTCATCCACTCCCCGTCTCCCAACGGGTCACTCTCCTCCCACTTTCACGCGCCGACGCTGAGATTCTCTATCGCCGCCGTTTCTCAACCACTGAGTTTTTCCCCCGAGACATTGACGCTATCCTCAACAACGTACTCACTCTTGGCACCTTCCTCGCCTTTCCACGTGGCATTTACACGCCTCAGACTTGGCCCGGGTCGGACCGGTTCTTGCTCGACCCACCCGAGTCCTGGGCGGTTCTCAGCGTCTGGAACTGCAATGACGTGTTTAAGCTTCAAGTACGTGGAGCGTCACGCTTAAAGCGGAGTCTAGCCAGGACGACACGGGTGTTGGATAGAGCATTTCCGTGGCTCAGGCTGCCGTCGGTGCCGGAGCTTTTCAGACCGTTCGGGTTGCACTTCTTGTACGGGCTGGGCGGAGAAGGACCCGAGGCTGGGAGGATGGTGAAGGCGCTGTGTGGGTACGCACATAACCTGGCGAAGGAGAGAGGTTGTGGGGTTGTCGCGACGGAGGTCTCGGCTAGAGAGCGGCTCAGAGGCGCCATTCCACACTGGAAAATGCTGTCGTGCGAGGAGGATCTTTGGTGCATCAAGCGCCTCGCCGAAGATTTCAGCGACGGCTCCGTGGGTGACTGGACCAAATCACCACCTGGCTTGTCCATTTTCGTCGACCCCAGAGAATTCTAA

Protein sequence

MIRNRDSIVIREFDASKDSKSVEDVERRCEVGPSGKLCLFTDLLGDPICRVRNSPAFLMLVAATGEENEIVGMIRGCIKTVTCGQKLSRAPPNPKTHHHHHSTNHHLPVFTKLAYILGLRVSPAHRRMGIGMKLVKKMEEWFRENGAEYSYIATEKDNVASVNLFTEKCDYSKFRTPAILVNPVFAHPLPVSQRVTLLPLSRADAEILYRRRFSTTEFFPRDIDAILNNVLTLGTFLAFPRGIYTPQTWPGSDRFLLDPPESWAVLSVWNCNDVFKLQVRGASRLKRSLARTTRVLDRAFPWLRLPSVPELFRPFGLHFLYGLGGEGSEAGRMVKALCGRMGIGMKLVKKMEEWFRENGAEYSYIATEKDNVASVNLFTEKCDYSKFRTPAILVNPVFAHPLPVSQRVTLLPLSRADAEILYRRRFSTTEFFPRDIDAILNNVLTLGTFLAFPRGIYTPQTWPGSDRFLLDPPESWAVLSVWNCNDVFKLQVRGASRLKRSLARTTRVLDRAFPWLRLPSVPELFRPFGLHFLYGLGGEGPEAGRMVKALCGYAHNLAKERGCGVVATEVSARERLRGAIPHWKMLSCEEDLWCIKRLAEDFSDGSVGDWTKSPPGLSIFVDPREF
Homology
BLAST of Cp4.1LG10g02140 vs. ExPASy Swiss-Prot
Match: Q42381 (Probable N-acetyltransferase HLS1 OS=Arabidopsis thaliana OX=3702 GN=HLS1 PE=1 SV=1)

HSP 1 Score: 513.8 bits (1322), Expect = 2.7e-144
Identity = 291/620 (46.94%), Postives = 341/620 (55.00%), Query Frame = 0

Query: 9   VIREFDASKDSKSVEDVERRCEVGPSGKLCLFTDLLGDPICRVRNSPAFLMLVAATG-EE 68
           V+RE+D ++D   VEDVERRCEVGPSGKL LFTDLLGDPICR+R+SP++LMLVA  G E+
Sbjct: 3   VVREYDPTRDLVGVEDVERRCEVGPSGKLSLFTDLLGDPICRIRHSPSYLMLVAEMGTEK 62

Query: 69  NEIVGMIRGCIKTVTCGQKLSRAPPNPKTHHHHHSTNHHL-PVFTKLAYILGLRVSPAHR 128
            EIVGMIRGCIKTVTCGQKL           +H S N  + P++TKLAY+LGLRVSP HR
Sbjct: 63  KEIVGMIRGCIKTVTCGQKLDL---------NHKSQNDVVKPLYTKLAYVLGLRVSPFHR 122

Query: 129 RMGIGMKLVKKMEEWFRENGAEYSYIATEKDNVASVNLFTEKCDYSKFRTPAILVNPVFA 188
           R GIG KLVK MEEWFR+NGAEYSYIATE DN ASVNLFT KC YS+FRTP+ILVNPV+A
Sbjct: 123 RQGIGFKLVKMMEEWFRQNGAEYSYIATENDNQASVNLFTGKCGYSEFRTPSILVNPVYA 182

Query: 189 HPLPVSQRVTLLPLSRADAEILYRRRFSTTEFFPRDIDAILNNVLTLGTFLAFPRGIYTP 248
           H + VS+RVT++ L   DAE LYR RFSTTEFFPRDID++LNN L+LGTF+A PRG    
Sbjct: 183 HRVNVSRRVTVIKLEPVDAETLYRIRFSTTEFFPRDIDSVLNNKLSLGTFVAVPRG---- 242

Query: 249 QTWPGSDRFLLDPPESWAVLSVWNCNDVFKLQVRGASRLKRSLARTTRVLDRAFPWLRLP 308
                                  +C                                   
Sbjct: 243 -----------------------SC----------------------------------- 302

Query: 309 SVPELFRPFGLHFLYGLGGEGSEAGRMVKALCGRMGIGMKLVKKMEEWFRENGAEYSYIA 368
                         YG  G GS                                      
Sbjct: 303 --------------YG-SGSGS-------------------------------------- 362

Query: 369 TEKDNVASVNLFTEKCDYSKFRTPAILVNPVFAHPLPVSQRVTLLPLSRADAEILYRRRF 428
                                                                       
Sbjct: 363 ------------------------------------------------------------ 403

Query: 429 STTEFFPRDIDAILNNVLTLGTFLAFPRGIYTPQTWPGSDRFLLDPPESWAVLSVWNCND 488
                                              WPGS +FL  PPESWAVLSVWNC D
Sbjct: 423 -----------------------------------WPGSAKFLEYPPESWAVLSVWNCKD 403

Query: 489 VFKLQVRGASRLKRSLARTTRVLDRAFPWLRLPSVPELFRPFGLHFLYGLGGEGPEAGRM 548
            F L+VRGASRL+R +A+TTRV+D+  P+L+LPS+P +F PFGLHF+YG+GGEGP A +M
Sbjct: 483 SFLLEVRGASRLRRVVAKTTRVVDKTLPFLKLPSIPSVFEPFGLHFMYGIGGEGPRAVKM 403

Query: 549 VKALCGYAHNLAKERGCGVVATEVSARERLRGAIPHWKMLSCEEDLWCIKRLAEDFSDGS 608
           VK+LC +AHNLAK  GCGVVA EV+  + LR  IPHWK+LSC+EDLWCIKRL +D+SDG 
Sbjct: 543 VKSLCAHAHNLAKAGGCGVVAAEVAGEDPLRRGIPHWKVLSCDEDLWCIKRLGDDYSDGV 403

Query: 609 VGDWTKSPPGLSIFVDPREF 627
           VGDWTKSPPG+SIFVDPREF
Sbjct: 603 VGDWTKSPPGVSIFVDPREF 403

BLAST of Cp4.1LG10g02140 vs. ExPASy Swiss-Prot
Match: O64815 (Probable N-acetyltransferase HLS1-like OS=Arabidopsis thaliana OX=3702 GN=At2g23060 PE=2 SV=1)

HSP 1 Score: 506.5 bits (1303), Expect = 4.2e-142
Identity = 284/624 (45.51%), Postives = 341/624 (54.65%), Query Frame = 0

Query: 8   IVIREFDASKDSKSVEDVERRCEVGPSGKLCLFTDLLGDPICRVRNSPAFLMLVAATG-- 67
           + +RE+D SKD  +VEDVERRCEVGP+GKL LFTDLLGDPICRVR+SP++LMLVA  G  
Sbjct: 5   VEVREYDPSKDLATVEDVERRCEVGPAGKLSLFTDLLGDPICRVRHSPSYLMLVAEIGPK 64

Query: 68  EENEIVGMIRGCIKTVTCGQKLSRAPPNPKTHHHHHSTNHHL---PVFTKLAYILGLRVS 127
           E+ E+VGMIRGCIKTVTCG    R         H+ S N  +   P++TKLAYILGLRVS
Sbjct: 65  EKKELVGMIRGCIKTVTCGITTKRLDLT-----HNKSQNDVVITKPLYTKLAYILGLRVS 124

Query: 128 PAHRRMGIGMKLVKKMEEWFRENGAEYSYIATEKDNVASVNLFTEKCDYSKFRTPAILVN 187
           P HRR GIG KLVK ME+WF +NGAEYSY ATE DN ASVNLFT KC Y++FRTP+ILVN
Sbjct: 125 PTHRRQGIGFKLVKAMEDWFSQNGAEYSYFATENDNHASVNLFTGKCGYAEFRTPSILVN 184

Query: 188 PVFAHPLPVSQRVTLLPLSRADAEILYRRRFSTTEFFPRDIDAILNNVLTLGTFLAFPRG 247
           PV+AH + +S+RVT++ L  +DAE+LYR RFSTTEFFPRDID++LNN L+LGTF+A PRG
Sbjct: 185 PVYAHRVNISRRVTVIKLEPSDAELLYRLRFSTTEFFPRDIDSVLNNKLSLGTFVAVPRG 244

Query: 248 IYTPQTWPGSDRFLLDPPESWAVLSVWNCNDVFKLQVRGASRLKRSLARTTRVLDRAFPW 307
                                      +C                               
Sbjct: 245 ---------------------------SC------------------------------- 304

Query: 308 LRLPSVPELFRPFGLHFLYGLGGEGSEAGRMVKALCGRMGIGMKLVKKMEEWFRENGAEY 367
                             YG G                                      
Sbjct: 305 ------------------YGSGS------------------------------------- 364

Query: 368 SYIATEKDNVASVNLFTEKCDYSKFRTPAILVNPVFAHPLPVSQRVTLLPLSRADAEILY 427
                                                                       
Sbjct: 365 ------------------------------------------------------------ 413

Query: 428 RRRFSTTEFFPRDIDAILNNVLTLGTFLAFPRGIYTPQTWPGSDRFLLDPPESWAVLSVW 487
                                                ++WPGS +FL  PP+SWAVLSVW
Sbjct: 425 -------------------------------------RSWPGSAKFLEYPPDSWAVLSVW 413

Query: 488 NCNDVFKLQVRGASRLKRSLARTTRVLDRAFPWLRLPSVPELFRPFGLHFLYGLGGEGPE 547
           NC D F+L+VRGASRL+R +++ TR++D+  P+L++PS+P +FRPFGLHF+YG+GGEGP 
Sbjct: 485 NCKDSFRLEVRGASRLRRVVSKATRMVDKTLPFLKIPSIPAVFRPFGLHFMYGIGGEGPR 413

Query: 548 AGRMVKALCGYAHNLAKERGCGVVATEVSARERLRGAIPHWKMLSCEEDLWCIKRLAEDF 607
           A +MVKALC +AHNLAKE GCGVVA EV+  E LR  IPHWK+LSC EDLWCIKRL ED+
Sbjct: 545 AEKMVKALCDHAHNLAKEGGCGVVAAEVAGEEPLRRGIPHWKVLSCAEDLWCIKRLGEDY 413

Query: 608 SDGSVGDWTKSPPGLSIFVDPREF 627
           SDGSVGDWTKSPPG SIFVDPREF
Sbjct: 605 SDGSVGDWTKSPPGDSIFVDPREF 413

BLAST of Cp4.1LG10g02140 vs. NCBI nr
Match: XP_023544685.1 (probable N-acetyltransferase HLS1 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 768 bits (1983), Expect = 1.97e-275
Identity = 413/626 (65.97%), Postives = 413/626 (65.97%), Query Frame = 0

Query: 1   MIRNRDSIVIREFDASKDSKSVEDVERRCEVGPSGKLCLFTDLLGDPICRVRNSPAFLML 60
           MIRNRDSIVIREFDASKDSKSVEDVERRCEVGPSGKLCLFTDLLGDPICRVRNSPAFLML
Sbjct: 1   MIRNRDSIVIREFDASKDSKSVEDVERRCEVGPSGKLCLFTDLLGDPICRVRNSPAFLML 60

Query: 61  VAATGEENEIVGMIRGCIKTVTCGQKLSRAPPNPKTHHHHHSTNHHLPVFTKLAYILGLR 120
           VAATGEENEIVGMIRGCIKTVTCGQKLSRAPPNPKTHHHHHSTNHHLPVFTKLAYILGLR
Sbjct: 61  VAATGEENEIVGMIRGCIKTVTCGQKLSRAPPNPKTHHHHHSTNHHLPVFTKLAYILGLR 120

Query: 121 VSPAHRRMGIGMKLVKKMEEWFRENGAEYSYIATEKDNVASVNLFTEKCDYSKFRTPAIL 180
           VSPAHRRMGIGMKLVKKMEEWFRENGAEYSYIATEKDNVASVNLFTEKCDYSKFRTPAIL
Sbjct: 121 VSPAHRRMGIGMKLVKKMEEWFRENGAEYSYIATEKDNVASVNLFTEKCDYSKFRTPAIL 180

Query: 181 VNPVFAHPLPVSQRVTLLPLSRADAEILYRRRFSTTEFFPRDIDAILNNVLTLGTFLAFP 240
           VNPVFAHPLPVSQRVTLLPLSRADAEILYRRRFSTTEFFPRDIDAILNNVLTLGTFLAFP
Sbjct: 181 VNPVFAHPLPVSQRVTLLPLSRADAEILYRRRFSTTEFFPRDIDAILNNVLTLGTFLAFP 240

Query: 241 RGIYTPQTWPGSDRFLLDPPESWAVLSVWNCNDVFKLQVRGASRLKRSLARTTRVLDRAF 300
           RGIYTPQTWPGSDRFLLDPPESWAVLSVWNCNDVFKLQVRGASRLKRSLARTTRVLDRAF
Sbjct: 241 RGIYTPQTWPGSDRFLLDPPESWAVLSVWNCNDVFKLQVRGASRLKRSLARTTRVLDRAF 300

Query: 301 PWLRLPSVPELFRPFGLHFLYGLGGEGSEAGRMVKALCGRMGIGMKLVKKMEEWFRENGA 360
           PWLRLPSVPELFRPFGLHFLYGLGGEGSEAGRMV                          
Sbjct: 301 PWLRLPSVPELFRPFGLHFLYGLGGEGSEAGRMV-------------------------- 360

Query: 361 EYSYIATEKDNVASVNLFTEKCDYSKFRTPAILVNPVFAHPLPVSQRVTLLPLSRADAEI 420
                                                                       
Sbjct: 361 ------------------------------------------------------------ 413

Query: 421 LYRRRFSTTEFFPRDIDAILNNVLTLGTFLAFPRGIYTPQTWPGSDRFLLDPPESWAVLS 480
                                                                       
Sbjct: 421 ------------------------------------------------------------ 413

Query: 481 VWNCNDVFKLQVRGASRLKRSLARTTRVLDRAFPWLRLPSVPELFRPFGLHFLYGLGGEG 540
                                                                       
Sbjct: 481 ------------------------------------------------------------ 413

Query: 541 PEAGRMVKALCGYAHNLAKERGCGVVATEVSARERLRGAIPHWKMLSCEEDLWCIKRLAE 600
                  KALCGYAHNLAKERGCGVVATEVSARERLRGAIPHWKMLSCEEDLWCIKRLAE
Sbjct: 541 -------KALCGYAHNLAKERGCGVVATEVSARERLRGAIPHWKMLSCEEDLWCIKRLAE 413

Query: 601 DFSDGSVGDWTKSPPGLSIFVDPREF 626
           DFSDGSVGDWTKSPPGLSIFVDPREF
Sbjct: 601 DFSDGSVGDWTKSPPGLSIFVDPREF 413

BLAST of Cp4.1LG10g02140 vs. NCBI nr
Match: KAG6604134.1 (putative N-acetyltransferase HLS1-like protein, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 762 bits (1967), Expect = 5.34e-273
Identity = 408/626 (65.18%), Postives = 410/626 (65.50%), Query Frame = 0

Query: 1   MIRNRDSIVIREFDASKDSKSVEDVERRCEVGPSGKLCLFTDLLGDPICRVRNSPAFLML 60
           MIRNRDS+VIREFDASKDSKSVEDVERRCEVGPSGKLCLFTDLLGDPICRVRNSPAFLML
Sbjct: 1   MIRNRDSMVIREFDASKDSKSVEDVERRCEVGPSGKLCLFTDLLGDPICRVRNSPAFLML 60

Query: 61  VAATGEENEIVGMIRGCIKTVTCGQKLSRAPPNPKTHHHHHSTNHHLPVFTKLAYILGLR 120
           VAATGEENEIVGMIRGCIKTVTCGQKLSRAPPNPKTHHHHHSTNHHLPVFTKLAYILGLR
Sbjct: 61  VAATGEENEIVGMIRGCIKTVTCGQKLSRAPPNPKTHHHHHSTNHHLPVFTKLAYILGLR 120

Query: 121 VSPAHRRMGIGMKLVKKMEEWFRENGAEYSYIATEKDNVASVNLFTEKCDYSKFRTPAIL 180
           VSPAHRRMGIGMKLVKKMEEWFRENGAEYSYIATEKDNVASVNLFTEKCDYSKFRTPAIL
Sbjct: 121 VSPAHRRMGIGMKLVKKMEEWFRENGAEYSYIATEKDNVASVNLFTEKCDYSKFRTPAIL 180

Query: 181 VNPVFAHPLPVSQRVTLLPLSRADAEILYRRRFSTTEFFPRDIDAILNNVLTLGTFLAFP 240
           VNPVFAHPLPVSQRVT+LPLSRADAEILYRRRFSTTEFFPRDIDAILNNVLTLGTFLAFP
Sbjct: 181 VNPVFAHPLPVSQRVTILPLSRADAEILYRRRFSTTEFFPRDIDAILNNVLTLGTFLAFP 240

Query: 241 RGIYTPQTWPGSDRFLLDPPESWAVLSVWNCNDVFKLQVRGASRLKRSLARTTRVLDRAF 300
           RG Y PQTWPGSDRFLLDPPESWAVLSVWNCNDVFKLQVRGASRLKRSLARTTRVLDRAF
Sbjct: 241 RGTYLPQTWPGSDRFLLDPPESWAVLSVWNCNDVFKLQVRGASRLKRSLARTTRVLDRAF 300

Query: 301 PWLRLPSVPELFRPFGLHFLYGLGGEGSEAGRMVKALCGRMGIGMKLVKKMEEWFRENGA 360
           PWLRLPSVPELFRPFGLHFLYGL                                     
Sbjct: 301 PWLRLPSVPELFRPFGLHFLYGL------------------------------------- 360

Query: 361 EYSYIATEKDNVASVNLFTEKCDYSKFRTPAILVNPVFAHPLPVSQRVTLLPLSRADAEI 420
                                                                       
Sbjct: 361 ------------------------------------------------------------ 413

Query: 421 LYRRRFSTTEFFPRDIDAILNNVLTLGTFLAFPRGIYTPQTWPGSDRFLLDPPESWAVLS 480
                                                                       
Sbjct: 421 ------------------------------------------------------------ 413

Query: 481 VWNCNDVFKLQVRGASRLKRSLARTTRVLDRAFPWLRLPSVPELFRPFGLHFLYGLGGEG 540
                                                                   GGEG
Sbjct: 481 --------------------------------------------------------GGEG 413

Query: 541 PEAGRMVKALCGYAHNLAKERGCGVVATEVSARERLRGAIPHWKMLSCEEDLWCIKRLAE 600
           PEAGRMVKALCGYAHNLAKERGCGVVATEVSARERLRGAIPHWKMLSCEEDLWCIKRL E
Sbjct: 541 PEAGRMVKALCGYAHNLAKERGCGVVATEVSARERLRGAIPHWKMLSCEEDLWCIKRLTE 413

Query: 601 DFSDGSVGDWTKSPPGLSIFVDPREF 626
           DFSDGSVGDWTKSPPGLSIFVDPREF
Sbjct: 601 DFSDGSVGDWTKSPPGLSIFVDPREF 413

BLAST of Cp4.1LG10g02140 vs. NCBI nr
Match: XP_022950471.1 (probable N-acetyltransferase HLS1 [Cucurbita moschata])

HSP 1 Score: 757 bits (1955), Expect = 3.71e-271
Identity = 409/627 (65.23%), Postives = 410/627 (65.39%), Query Frame = 0

Query: 1   MIRNRDSIVIREFDASKDSKSVEDVERRCEVGPSGKLCLFTDLLGDPICRVRNSPAFLML 60
           MIRNRDSIVIREFDASKDSKSVEDVERRCEVGPSGKLCLFTDLLGDPICRVRNSPAFLML
Sbjct: 1   MIRNRDSIVIREFDASKDSKSVEDVERRCEVGPSGKLCLFTDLLGDPICRVRNSPAFLML 60

Query: 61  VAATGEENEIVGMIRGCIKTVTCGQKLSRAPPNPKTHHHHH-STNHHLPVFTKLAYILGL 120
           VAATGEENEIVGMIRGCIKTVTCGQKLSRAPPNPKTHHHHH STNHHLPVFTKLAYILGL
Sbjct: 61  VAATGEENEIVGMIRGCIKTVTCGQKLSRAPPNPKTHHHHHHSTNHHLPVFTKLAYILGL 120

Query: 121 RVSPAHRRMGIGMKLVKKMEEWFRENGAEYSYIATEKDNVASVNLFTEKCDYSKFRTPAI 180
           RVSPAHRRMGIGMKLVKKMEEWFRENGAEYSYIATEKDNVASVNLFTEKCDYSKFRTPAI
Sbjct: 121 RVSPAHRRMGIGMKLVKKMEEWFRENGAEYSYIATEKDNVASVNLFTEKCDYSKFRTPAI 180

Query: 181 LVNPVFAHPLPVSQRVTLLPLSRADAEILYRRRFSTTEFFPRDIDAILNNVLTLGTFLAF 240
           LVNPVFAHPLPVSQRVT+LPLSRADAEILYRRRFSTTEFFPRDIDAILNNVLTLGTFLAF
Sbjct: 181 LVNPVFAHPLPVSQRVTILPLSRADAEILYRRRFSTTEFFPRDIDAILNNVLTLGTFLAF 240

Query: 241 PRGIYTPQTWPGSDRFLLDPPESWAVLSVWNCNDVFKLQVRGASRLKRSLARTTRVLDRA 300
           PRG Y PQTWPGSDRFLLDPPESWAVLSVWNCNDVFKLQVRGASRLKRSLARTTRVLDRA
Sbjct: 241 PRGTYLPQTWPGSDRFLLDPPESWAVLSVWNCNDVFKLQVRGASRLKRSLARTTRVLDRA 300

Query: 301 FPWLRLPSVPELFRPFGLHFLYGLGGEGSEAGRMVKALCGRMGIGMKLVKKMEEWFRENG 360
           FPWLRLPSVPELFRPFGLHFLYGL                                    
Sbjct: 301 FPWLRLPSVPELFRPFGLHFLYGL------------------------------------ 360

Query: 361 AEYSYIATEKDNVASVNLFTEKCDYSKFRTPAILVNPVFAHPLPVSQRVTLLPLSRADAE 420
                                                                       
Sbjct: 361 ------------------------------------------------------------ 414

Query: 421 ILYRRRFSTTEFFPRDIDAILNNVLTLGTFLAFPRGIYTPQTWPGSDRFLLDPPESWAVL 480
                                                                       
Sbjct: 421 ------------------------------------------------------------ 414

Query: 481 SVWNCNDVFKLQVRGASRLKRSLARTTRVLDRAFPWLRLPSVPELFRPFGLHFLYGLGGE 540
                                                                    GGE
Sbjct: 481 ---------------------------------------------------------GGE 414

Query: 541 GPEAGRMVKALCGYAHNLAKERGCGVVATEVSARERLRGAIPHWKMLSCEEDLWCIKRLA 600
           GPEAG MVKALCGYAHNLAKERGCGVVATEVSARERLRGAIPHWKMLSCEEDLWCIKRLA
Sbjct: 541 GPEAGGMVKALCGYAHNLAKERGCGVVATEVSARERLRGAIPHWKMLSCEEDLWCIKRLA 414

Query: 601 EDFSDGSVGDWTKSPPGLSIFVDPREF 626
           EDFSDGSVGDWTKSPPGLSIFVDPREF
Sbjct: 601 EDFSDGSVGDWTKSPPGLSIFVDPREF 414

BLAST of Cp4.1LG10g02140 vs. NCBI nr
Match: XP_022977747.1 (probable N-acetyltransferase HLS1 isoform X1 [Cucurbita maxima])

HSP 1 Score: 730 bits (1885), Expect = 1.21e-260
Identity = 398/626 (63.58%), Postives = 401/626 (64.06%), Query Frame = 0

Query: 1   MIRNRDSIVIREFDASKDSKSVEDVERRCEVGPSGKLCLFTDLLGDPICRVRNSPAFLML 60
           MIRNRDSIVIREFDASKDSK VEDVERRC VGPSGKLCLFTDLLGDPICRVRNSPAFLML
Sbjct: 1   MIRNRDSIVIREFDASKDSKGVEDVERRCAVGPSGKLCLFTDLLGDPICRVRNSPAFLML 60

Query: 61  VAATGEENEIVGMIRGCIKTVTCGQKLSRAPPNPKTHHHHHSTNHHLPVFTKLAYILGLR 120
           VAATGEENEIVGMIRGCIKTVTCGQKLSRAPPNPKTHHH       LPVFTKLAYILGLR
Sbjct: 61  VAATGEENEIVGMIRGCIKTVTCGQKLSRAPPNPKTHHH-------LPVFTKLAYILGLR 120

Query: 121 VSPAHRRMGIGMKLVKKMEEWFRENGAEYSYIATEKDNVASVNLFTEKCDYSKFRTPAIL 180
           VSPAHRRMGIGMKLVKKMEEWFRENGAEYSYIATEKDNVASVNLFTEKCDYSKFRTPAIL
Sbjct: 121 VSPAHRRMGIGMKLVKKMEEWFRENGAEYSYIATEKDNVASVNLFTEKCDYSKFRTPAIL 180

Query: 181 VNPVFAHPLPVSQRVTLLPLSRADAEILYRRRFSTTEFFPRDIDAILNNVLTLGTFLAFP 240
           VNPVFA PLPVSQRVT+LPLSRADAEILYRRRFSTTEFFPRDIDAIL+NVLTLGTFLAFP
Sbjct: 181 VNPVFARPLPVSQRVTILPLSRADAEILYRRRFSTTEFFPRDIDAILDNVLTLGTFLAFP 240

Query: 241 RGIYTPQTWPGSDRFLLDPPESWAVLSVWNCNDVFKLQVRGASRLKRSLARTTRVLDRAF 300
           RG YTPQTWPGSDRFLLDPPESWAVLSVWNCNDVFKLQVRGASRLKRSLARTTRVLD+AF
Sbjct: 241 RGTYTPQTWPGSDRFLLDPPESWAVLSVWNCNDVFKLQVRGASRLKRSLARTTRVLDKAF 300

Query: 301 PWLRLPSVPELFRPFGLHFLYGLGGEGSEAGRMVKALCGRMGIGMKLVKKMEEWFRENGA 360
           PWLRLPSVPELFRPFGLHFLYGL                                     
Sbjct: 301 PWLRLPSVPELFRPFGLHFLYGL------------------------------------- 360

Query: 361 EYSYIATEKDNVASVNLFTEKCDYSKFRTPAILVNPVFAHPLPVSQRVTLLPLSRADAEI 420
                                                                       
Sbjct: 361 ------------------------------------------------------------ 406

Query: 421 LYRRRFSTTEFFPRDIDAILNNVLTLGTFLAFPRGIYTPQTWPGSDRFLLDPPESWAVLS 480
                                                                       
Sbjct: 421 ------------------------------------------------------------ 406

Query: 481 VWNCNDVFKLQVRGASRLKRSLARTTRVLDRAFPWLRLPSVPELFRPFGLHFLYGLGGEG 540
                                                                   GGEG
Sbjct: 481 --------------------------------------------------------GGEG 406

Query: 541 PEAGRMVKALCGYAHNLAKERGCGVVATEVSARERLRGAIPHWKMLSCEEDLWCIKRLAE 600
           PEAGRMVKALCGYAHNLAKERGCGVVATEVSARERLRGAIPHWKMLSCEEDLWCIKRLAE
Sbjct: 541 PEAGRMVKALCGYAHNLAKERGCGVVATEVSARERLRGAIPHWKMLSCEEDLWCIKRLAE 406

Query: 601 DFSDGSVGDWTKSPPGLSIFVDPREF 626
           DFSDGSVGDWTKS PGLSIFVDPREF
Sbjct: 601 DFSDGSVGDWTKSAPGLSIFVDPREF 406

BLAST of Cp4.1LG10g02140 vs. NCBI nr
Match: KAG7034295.1 (putative N-acetyltransferase HLS1-like protein, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 722 bits (1863), Expect = 4.21e-257
Identity = 394/626 (62.94%), Postives = 403/626 (64.38%), Query Frame = 0

Query: 1   MIRNRDSIVIREFDASKDSKSVEDVERRCEVGPSGKLCLFTDLLGDPICRVRNSPAFLML 60
           MIRNRDSIVIREFDASKDSKSVEDVERRCEVGPSGKLCLFTDLLGDPICRVRNSPAFLML
Sbjct: 1   MIRNRDSIVIREFDASKDSKSVEDVERRCEVGPSGKLCLFTDLLGDPICRVRNSPAFLML 60

Query: 61  VAATGEENEIVGMIRGCIKTVTCGQKLSRAPPNPKTHHHHHSTNHHLPVFTKLAYILGLR 120
           VAATGEENEIVGMIRGCIKTVTCGQKLSRAPPNPKTHHHHHSTNHHLPVFTKLAYILGLR
Sbjct: 61  VAATGEENEIVGMIRGCIKTVTCGQKLSRAPPNPKTHHHHHSTNHHLPVFTKLAYILGLR 120

Query: 121 VSPAHRRMGIGMKLVKKMEEWFRENGAEYSYIATEKDNVASVNLFTEKCDYSKFRTPAIL 180
           VSPAHRRMGIGMKLVKKMEEWFRENGA     A   +N  ++            RTP + 
Sbjct: 121 VSPAHRRMGIGMKLVKKMEEWFRENGA-----ADHPNNQDAI------------RTPIV- 180

Query: 181 VNPVFAHPLPVSQRVTLLPLSRADAEILYRRRFSTTEFFPRDIDAILNNVLTLGTFLAFP 240
                                                   RD+  + N+  T        
Sbjct: 181 ----------------------------------------RDVILVSNSQAT-------- 240

Query: 241 RGIYTPQTWPGSDRFLLDPPESWAVLSVWNCNDVFKLQVRGASRLKRSLARTTRVLDRAF 300
                                                                       
Sbjct: 241 ------------------------------------------------------------ 300

Query: 301 PWLRLPSVPELFRPFGLHFLYGLGGEGSEAGRMVKALCGRMGIGMKLVKKMEEWFRENGA 360
                                                                       
Sbjct: 301 ------------------------------------------------------------ 360

Query: 361 EYSYIATEKDNVASVNLFTEKCDYSKFRTPAILVNPVFAHPLPVSQRVTLLPLSRADAEI 420
                                 +YSKFRTPAILVNPVFAHPLPVSQRVT+LPLSRADAEI
Sbjct: 361 ----------------------NYSKFRTPAILVNPVFAHPLPVSQRVTILPLSRADAEI 418

Query: 421 LYRRRFSTTEFFPRDIDAILNNVLTLGTFLAFPRGIYTPQTWPGSDRFLLDPPESWAVLS 480
           LYRRRFSTTEFFPRDIDAILNNVLTLGTFLAFPRG Y PQTW GSDRFLLDPPESWAVLS
Sbjct: 421 LYRRRFSTTEFFPRDIDAILNNVLTLGTFLAFPRGTYLPQTWAGSDRFLLDPPESWAVLS 418

Query: 481 VWNCNDVFKLQVRGASRLKRSLARTTRVLDRAFPWLRLPSVPELFRPFGLHFLYGLGGEG 540
           VWNCNDVFKLQVRGASRLKRSLARTTRVLDRAFPWLRLPSVPELFRPFGLHFLYGLGGEG
Sbjct: 481 VWNCNDVFKLQVRGASRLKRSLARTTRVLDRAFPWLRLPSVPELFRPFGLHFLYGLGGEG 418

Query: 541 PEAGRMVKALCGYAHNLAKERGCGVVATEVSARERLRGAIPHWKMLSCEEDLWCIKRLAE 600
           PEAGRMVKALCGYAHNLAKERGCGVVATEVSARERLRGAIPHWKMLSCEEDLWCIKRL E
Sbjct: 541 PEAGRMVKALCGYAHNLAKERGCGVVATEVSARERLRGAIPHWKMLSCEEDLWCIKRLTE 418

Query: 601 DFSDGSVGDWTKSPPGLSIFVDPREF 626
           DFSDGSVGDWTKSPPGLSIFVDPREF
Sbjct: 601 DFSDGSVGDWTKSPPGLSIFVDPREF 418

BLAST of Cp4.1LG10g02140 vs. ExPASy TrEMBL
Match: A0A6J1GEX3 (probable N-acetyltransferase HLS1 OS=Cucurbita moschata OX=3662 GN=LOC111453566 PE=4 SV=1)

HSP 1 Score: 757 bits (1955), Expect = 1.80e-271
Identity = 409/627 (65.23%), Postives = 410/627 (65.39%), Query Frame = 0

Query: 1   MIRNRDSIVIREFDASKDSKSVEDVERRCEVGPSGKLCLFTDLLGDPICRVRNSPAFLML 60
           MIRNRDSIVIREFDASKDSKSVEDVERRCEVGPSGKLCLFTDLLGDPICRVRNSPAFLML
Sbjct: 1   MIRNRDSIVIREFDASKDSKSVEDVERRCEVGPSGKLCLFTDLLGDPICRVRNSPAFLML 60

Query: 61  VAATGEENEIVGMIRGCIKTVTCGQKLSRAPPNPKTHHHHH-STNHHLPVFTKLAYILGL 120
           VAATGEENEIVGMIRGCIKTVTCGQKLSRAPPNPKTHHHHH STNHHLPVFTKLAYILGL
Sbjct: 61  VAATGEENEIVGMIRGCIKTVTCGQKLSRAPPNPKTHHHHHHSTNHHLPVFTKLAYILGL 120

Query: 121 RVSPAHRRMGIGMKLVKKMEEWFRENGAEYSYIATEKDNVASVNLFTEKCDYSKFRTPAI 180
           RVSPAHRRMGIGMKLVKKMEEWFRENGAEYSYIATEKDNVASVNLFTEKCDYSKFRTPAI
Sbjct: 121 RVSPAHRRMGIGMKLVKKMEEWFRENGAEYSYIATEKDNVASVNLFTEKCDYSKFRTPAI 180

Query: 181 LVNPVFAHPLPVSQRVTLLPLSRADAEILYRRRFSTTEFFPRDIDAILNNVLTLGTFLAF 240
           LVNPVFAHPLPVSQRVT+LPLSRADAEILYRRRFSTTEFFPRDIDAILNNVLTLGTFLAF
Sbjct: 181 LVNPVFAHPLPVSQRVTILPLSRADAEILYRRRFSTTEFFPRDIDAILNNVLTLGTFLAF 240

Query: 241 PRGIYTPQTWPGSDRFLLDPPESWAVLSVWNCNDVFKLQVRGASRLKRSLARTTRVLDRA 300
           PRG Y PQTWPGSDRFLLDPPESWAVLSVWNCNDVFKLQVRGASRLKRSLARTTRVLDRA
Sbjct: 241 PRGTYLPQTWPGSDRFLLDPPESWAVLSVWNCNDVFKLQVRGASRLKRSLARTTRVLDRA 300

Query: 301 FPWLRLPSVPELFRPFGLHFLYGLGGEGSEAGRMVKALCGRMGIGMKLVKKMEEWFRENG 360
           FPWLRLPSVPELFRPFGLHFLYGL                                    
Sbjct: 301 FPWLRLPSVPELFRPFGLHFLYGL------------------------------------ 360

Query: 361 AEYSYIATEKDNVASVNLFTEKCDYSKFRTPAILVNPVFAHPLPVSQRVTLLPLSRADAE 420
                                                                       
Sbjct: 361 ------------------------------------------------------------ 414

Query: 421 ILYRRRFSTTEFFPRDIDAILNNVLTLGTFLAFPRGIYTPQTWPGSDRFLLDPPESWAVL 480
                                                                       
Sbjct: 421 ------------------------------------------------------------ 414

Query: 481 SVWNCNDVFKLQVRGASRLKRSLARTTRVLDRAFPWLRLPSVPELFRPFGLHFLYGLGGE 540
                                                                    GGE
Sbjct: 481 ---------------------------------------------------------GGE 414

Query: 541 GPEAGRMVKALCGYAHNLAKERGCGVVATEVSARERLRGAIPHWKMLSCEEDLWCIKRLA 600
           GPEAG MVKALCGYAHNLAKERGCGVVATEVSARERLRGAIPHWKMLSCEEDLWCIKRLA
Sbjct: 541 GPEAGGMVKALCGYAHNLAKERGCGVVATEVSARERLRGAIPHWKMLSCEEDLWCIKRLA 414

Query: 601 EDFSDGSVGDWTKSPPGLSIFVDPREF 626
           EDFSDGSVGDWTKSPPGLSIFVDPREF
Sbjct: 601 EDFSDGSVGDWTKSPPGLSIFVDPREF 414

BLAST of Cp4.1LG10g02140 vs. ExPASy TrEMBL
Match: A0A6J1IJC0 (probable N-acetyltransferase HLS1 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111477959 PE=4 SV=1)

HSP 1 Score: 730 bits (1885), Expect = 5.88e-261
Identity = 398/626 (63.58%), Postives = 401/626 (64.06%), Query Frame = 0

Query: 1   MIRNRDSIVIREFDASKDSKSVEDVERRCEVGPSGKLCLFTDLLGDPICRVRNSPAFLML 60
           MIRNRDSIVIREFDASKDSK VEDVERRC VGPSGKLCLFTDLLGDPICRVRNSPAFLML
Sbjct: 1   MIRNRDSIVIREFDASKDSKGVEDVERRCAVGPSGKLCLFTDLLGDPICRVRNSPAFLML 60

Query: 61  VAATGEENEIVGMIRGCIKTVTCGQKLSRAPPNPKTHHHHHSTNHHLPVFTKLAYILGLR 120
           VAATGEENEIVGMIRGCIKTVTCGQKLSRAPPNPKTHHH       LPVFTKLAYILGLR
Sbjct: 61  VAATGEENEIVGMIRGCIKTVTCGQKLSRAPPNPKTHHH-------LPVFTKLAYILGLR 120

Query: 121 VSPAHRRMGIGMKLVKKMEEWFRENGAEYSYIATEKDNVASVNLFTEKCDYSKFRTPAIL 180
           VSPAHRRMGIGMKLVKKMEEWFRENGAEYSYIATEKDNVASVNLFTEKCDYSKFRTPAIL
Sbjct: 121 VSPAHRRMGIGMKLVKKMEEWFRENGAEYSYIATEKDNVASVNLFTEKCDYSKFRTPAIL 180

Query: 181 VNPVFAHPLPVSQRVTLLPLSRADAEILYRRRFSTTEFFPRDIDAILNNVLTLGTFLAFP 240
           VNPVFA PLPVSQRVT+LPLSRADAEILYRRRFSTTEFFPRDIDAIL+NVLTLGTFLAFP
Sbjct: 181 VNPVFARPLPVSQRVTILPLSRADAEILYRRRFSTTEFFPRDIDAILDNVLTLGTFLAFP 240

Query: 241 RGIYTPQTWPGSDRFLLDPPESWAVLSVWNCNDVFKLQVRGASRLKRSLARTTRVLDRAF 300
           RG YTPQTWPGSDRFLLDPPESWAVLSVWNCNDVFKLQVRGASRLKRSLARTTRVLD+AF
Sbjct: 241 RGTYTPQTWPGSDRFLLDPPESWAVLSVWNCNDVFKLQVRGASRLKRSLARTTRVLDKAF 300

Query: 301 PWLRLPSVPELFRPFGLHFLYGLGGEGSEAGRMVKALCGRMGIGMKLVKKMEEWFRENGA 360
           PWLRLPSVPELFRPFGLHFLYGL                                     
Sbjct: 301 PWLRLPSVPELFRPFGLHFLYGL------------------------------------- 360

Query: 361 EYSYIATEKDNVASVNLFTEKCDYSKFRTPAILVNPVFAHPLPVSQRVTLLPLSRADAEI 420
                                                                       
Sbjct: 361 ------------------------------------------------------------ 406

Query: 421 LYRRRFSTTEFFPRDIDAILNNVLTLGTFLAFPRGIYTPQTWPGSDRFLLDPPESWAVLS 480
                                                                       
Sbjct: 421 ------------------------------------------------------------ 406

Query: 481 VWNCNDVFKLQVRGASRLKRSLARTTRVLDRAFPWLRLPSVPELFRPFGLHFLYGLGGEG 540
                                                                   GGEG
Sbjct: 481 --------------------------------------------------------GGEG 406

Query: 541 PEAGRMVKALCGYAHNLAKERGCGVVATEVSARERLRGAIPHWKMLSCEEDLWCIKRLAE 600
           PEAGRMVKALCGYAHNLAKERGCGVVATEVSARERLRGAIPHWKMLSCEEDLWCIKRLAE
Sbjct: 541 PEAGRMVKALCGYAHNLAKERGCGVVATEVSARERLRGAIPHWKMLSCEEDLWCIKRLAE 406

Query: 601 DFSDGSVGDWTKSPPGLSIFVDPREF 626
           DFSDGSVGDWTKS PGLSIFVDPREF
Sbjct: 601 DFSDGSVGDWTKSAPGLSIFVDPREF 406

BLAST of Cp4.1LG10g02140 vs. ExPASy TrEMBL
Match: A0A0A0KLH9 (N-acetyltransferase domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_6G495060 PE=4 SV=1)

HSP 1 Score: 654 bits (1688), Expect = 4.57e-231
Identity = 357/620 (57.58%), Postives = 379/620 (61.13%), Query Frame = 0

Query: 8   IVIREFDASKDSKSVEDVERRCEVGPSGKLCLFTDLLGDPICRVRNSPAFLMLVAATGEE 67
           IVIREFD SKD  +VEDVERRCEVGPSGKLCLFTDLLGDPICRVRNSPAFLMLVAAT ++
Sbjct: 2   IVIREFDPSKDCIAVEDVERRCEVGPSGKLCLFTDLLGDPICRVRNSPAFLMLVAATADQ 61

Query: 68  NEIVGMIRGCIKTVTCGQKLSR-APPNPKTHHHHHSTNHHLPVFTKLAYILGLRVSPAHR 127
           NEIVGMIRGCIKTVTCGQKLSR A PN       H    HLPV+TKLAYILGLRVSPAHR
Sbjct: 62  NEIVGMIRGCIKTVTCGQKLSRSAIPNSD-----HQPPKHLPVYTKLAYILGLRVSPAHR 121

Query: 128 RMGIGMKLVKKMEEWFRENGAEYSYIATEKDNVASVNLFTEKCDYSKFRTPAILVNPVFA 187
           RMGIG+KLVKKMEEWFRE+GAEYSYIATEKDNVASVNLFTEKC+YSKFRTPAILVNPVFA
Sbjct: 122 RMGIGIKLVKKMEEWFRESGAEYSYIATEKDNVASVNLFTEKCEYSKFRTPAILVNPVFA 181

Query: 188 HPLPVSQRVTLLPLSRADAEILYRRRFSTTEFFPRDIDAILNNVLTLGTFLAFPRGIYTP 247
           HP+P+S+RVT+LPLSR+DAEILYRRRFSTTEFFPRDIDA+LNN LTLGTFLA PRG YTP
Sbjct: 182 HPVPLSKRVTILPLSRSDAEILYRRRFSTTEFFPRDIDAVLNNPLTLGTFLAIPRGTYTP 241

Query: 248 QTWPGSDRFLLDPPESWAVLSVWNCNDVFKLQVRGASRLKRSLARTTRVLDRAFPWLRLP 307
            TWPGSDRFL+DPP+SWAVLSVWNCNDVF+LQVRG SRLKRS ARTTRVLD+AFPWLRLP
Sbjct: 242 HTWPGSDRFLVDPPQSWAVLSVWNCNDVFRLQVRGVSRLKRSFARTTRVLDKAFPWLRLP 301

Query: 308 SVPELFRPFGLHFLYGLGGEGSEAGRMVKALCGRMGIGMKLVKKMEEWFRENGAEYSYIA 367
           SVPELF PFGLHF+YGL                                           
Sbjct: 302 SVPELFSPFGLHFMYGL------------------------------------------- 361

Query: 368 TEKDNVASVNLFTEKCDYSKFRTPAILVNPVFAHPLPVSQRVTLLPLSRADAEILYRRRF 427
                                                                       
Sbjct: 362 ------------------------------------------------------------ 403

Query: 428 STTEFFPRDIDAILNNVLTLGTFLAFPRGIYTPQTWPGSDRFLLDPPESWAVLSVWNCND 487
                                                                       
Sbjct: 422 ------------------------------------------------------------ 403

Query: 488 VFKLQVRGASRLKRSLARTTRVLDRAFPWLRLPSVPELFRPFGLHFLYGLGGEGPEAGRM 547
                                                             GGEGP+A RM
Sbjct: 482 --------------------------------------------------GGEGPDAERM 403

Query: 548 VKALCGYAHNLAKERGCGVVATEVSARERLRGAIPHWKMLSCEEDLWCIKRLAEDFSDGS 607
           +KALCGYAHNLAKE+GCGVVATEVSA ERLR AIPHWKMLSCEEDLWCIKRL EDFSDGS
Sbjct: 542 LKALCGYAHNLAKEKGCGVVATEVSAGERLRTAIPHWKMLSCEEDLWCIKRLGEDFSDGS 403

Query: 608 VGDWTKSPPGLSIFVDPREF 626
           VGDWTKSPPG+SIFVDPREF
Sbjct: 602 VGDWTKSPPGMSIFVDPREF 403

BLAST of Cp4.1LG10g02140 vs. ExPASy TrEMBL
Match: A0A1S3B0J5 (probable N-acetyltransferase HLS1 OS=Cucumis melo OX=3656 GN=LOC103484843 PE=4 SV=1)

HSP 1 Score: 650 bits (1678), Expect = 1.96e-229
Identity = 361/627 (57.58%), Postives = 380/627 (60.61%), Query Frame = 0

Query: 1   MIRNRDS-IVIREFDASKDSKSVEDVERRCEVGPSGKLCLFTDLLGDPICRVRNSPAFLM 60
           MI  RDS IVIREFD SKD  +VEDVERRCEVGPSGKLCLFTDLLGDPICRVRNSPAFLM
Sbjct: 1   MITKRDSMIVIREFDPSKDCIAVEDVERRCEVGPSGKLCLFTDLLGDPICRVRNSPAFLM 60

Query: 61  LVAATGEENEIVGMIRGCIKTVTCGQKLSR-APPNPKTHHHHHSTNHHLPVFTKLAYILG 120
           LVAAT + NEIVGMIRGCIKTVTCGQKLSR A PN       H    HLPV+TKLAYILG
Sbjct: 61  LVAATADHNEIVGMIRGCIKTVTCGQKLSRSAIPNSD-----HQPPKHLPVYTKLAYILG 120

Query: 121 LRVSPAHRRMGIGMKLVKKMEEWFRENGAEYSYIATEKDNVASVNLFTEKCDYSKFRTPA 180
           LRVSPAHRRMGIG+KLVKKMEEWFRE+GAEYSYIATEKDNVASVNLFTEKC+YSKFRTPA
Sbjct: 121 LRVSPAHRRMGIGIKLVKKMEEWFRESGAEYSYIATEKDNVASVNLFTEKCEYSKFRTPA 180

Query: 181 ILVNPVFAHPLPVSQRVTLLPLSRADAEILYRRRFSTTEFFPRDIDAILNNVLTLGTFLA 240
           ILVNPVFAHP+P+S+RVT+LPLSR+DAEILYRRRFSTTEFFPRDIDA+LNN LTLGTFLA
Sbjct: 181 ILVNPVFAHPVPLSKRVTILPLSRSDAEILYRRRFSTTEFFPRDIDAVLNNPLTLGTFLA 240

Query: 241 FPRGIYTPQTWPGSDRFLLDPPESWAVLSVWNCNDVFKLQVRGASRLKRSLARTTRVLDR 300
            PRG YTP TWPGSDRFL DPP+SWAVLSVWNCNDVF+LQVRG SRLKRS ARTTRVLD+
Sbjct: 241 IPRGTYTPHTWPGSDRFLCDPPQSWAVLSVWNCNDVFRLQVRGVSRLKRSFARTTRVLDK 300

Query: 301 AFPWLRLPSVPELFRPFGLHFLYGLGGEGSEAGRMVKALCGRMGIGMKLVKKMEEWFREN 360
           AFP LRLPSVPELF PFGLHF+YGL                                   
Sbjct: 301 AFPCLRLPSVPELFSPFGLHFMYGL----------------------------------- 360

Query: 361 GAEYSYIATEKDNVASVNLFTEKCDYSKFRTPAILVNPVFAHPLPVSQRVTLLPLSRADA 420
                                                                       
Sbjct: 361 ------------------------------------------------------------ 409

Query: 421 EILYRRRFSTTEFFPRDIDAILNNVLTLGTFLAFPRGIYTPQTWPGSDRFLLDPPESWAV 480
                                                                       
Sbjct: 421 ------------------------------------------------------------ 409

Query: 481 LSVWNCNDVFKLQVRGASRLKRSLARTTRVLDRAFPWLRLPSVPELFRPFGLHFLYGLGG 540
                                                                     GG
Sbjct: 481 ----------------------------------------------------------GG 409

Query: 541 EGPEAGRMVKALCGYAHNLAKERGCGVVATEVSARERLRGAIPHWKMLSCEEDLWCIKRL 600
           EGP+A RMVKALCGYAHNLAKE+GCGVVATEVSA ERLR AIPHWKMLSCEEDLWCIKRL
Sbjct: 541 EGPDAERMVKALCGYAHNLAKEKGCGVVATEVSAGERLRPAIPHWKMLSCEEDLWCIKRL 409

Query: 601 AEDFSDGSVGDWTKSPPGLSIFVDPRE 625
            EDFSDGSVGDWTKSPPG+SIFVDPRE
Sbjct: 601 GEDFSDGSVGDWTKSPPGMSIFVDPRE 409

BLAST of Cp4.1LG10g02140 vs. ExPASy TrEMBL
Match: A0A5A7SYR2 (Putative N-acetyltransferase HLS1 OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold18G002300 PE=4 SV=1)

HSP 1 Score: 649 bits (1674), Expect = 6.11e-229
Identity = 357/620 (57.58%), Postives = 376/620 (60.65%), Query Frame = 0

Query: 8   IVIREFDASKDSKSVEDVERRCEVGPSGKLCLFTDLLGDPICRVRNSPAFLMLVAATGEE 67
           IVIREFD SKD  +VEDVERRCEVGPSGKLCLFTDLLGDPICRVRNSPAFLMLVAAT + 
Sbjct: 2   IVIREFDPSKDCIAVEDVERRCEVGPSGKLCLFTDLLGDPICRVRNSPAFLMLVAATADH 61

Query: 68  NEIVGMIRGCIKTVTCGQKLSR-APPNPKTHHHHHSTNHHLPVFTKLAYILGLRVSPAHR 127
           NEIVGMIRGCIKTVTCGQKLSR A PN       H    HLPV+TKLAYILGLRVSPAHR
Sbjct: 62  NEIVGMIRGCIKTVTCGQKLSRSAIPNSD-----HQPPKHLPVYTKLAYILGLRVSPAHR 121

Query: 128 RMGIGMKLVKKMEEWFRENGAEYSYIATEKDNVASVNLFTEKCDYSKFRTPAILVNPVFA 187
           RMGIG+KLVKKMEEWFRE+GAEYSYIATEKDNVASVNLFTEKC+YSKFRTPAILVNPVFA
Sbjct: 122 RMGIGIKLVKKMEEWFRESGAEYSYIATEKDNVASVNLFTEKCEYSKFRTPAILVNPVFA 181

Query: 188 HPLPVSQRVTLLPLSRADAEILYRRRFSTTEFFPRDIDAILNNVLTLGTFLAFPRGIYTP 247
           HP+P+S+RVT+LPLSR+DAEILYRRRFSTTEFFPRDIDA+LNN LTLGTFLA PRG YTP
Sbjct: 182 HPVPLSKRVTILPLSRSDAEILYRRRFSTTEFFPRDIDAVLNNPLTLGTFLAIPRGTYTP 241

Query: 248 QTWPGSDRFLLDPPESWAVLSVWNCNDVFKLQVRGASRLKRSLARTTRVLDRAFPWLRLP 307
            TWPGSDRFL DPP+SWAVLSVWNCNDVF+LQVRG SRLKRS ARTTRVLD+AFP LRLP
Sbjct: 242 HTWPGSDRFLCDPPQSWAVLSVWNCNDVFRLQVRGVSRLKRSFARTTRVLDKAFPCLRLP 301

Query: 308 SVPELFRPFGLHFLYGLGGEGSEAGRMVKALCGRMGIGMKLVKKMEEWFRENGAEYSYIA 367
           SVPELF PFGLHF+YGL                                           
Sbjct: 302 SVPELFSPFGLHFMYGL------------------------------------------- 361

Query: 368 TEKDNVASVNLFTEKCDYSKFRTPAILVNPVFAHPLPVSQRVTLLPLSRADAEILYRRRF 427
                                                                       
Sbjct: 362 ------------------------------------------------------------ 403

Query: 428 STTEFFPRDIDAILNNVLTLGTFLAFPRGIYTPQTWPGSDRFLLDPPESWAVLSVWNCND 487
                                                                       
Sbjct: 422 ------------------------------------------------------------ 403

Query: 488 VFKLQVRGASRLKRSLARTTRVLDRAFPWLRLPSVPELFRPFGLHFLYGLGGEGPEAGRM 547
                                                             GGEGP+A RM
Sbjct: 482 --------------------------------------------------GGEGPDAERM 403

Query: 548 VKALCGYAHNLAKERGCGVVATEVSARERLRGAIPHWKMLSCEEDLWCIKRLAEDFSDGS 607
           VKALCGYAHNLAKE+GCGVVATEVSA ERLR AIPHWKMLSCEEDLWCIKRL EDFSDGS
Sbjct: 542 VKALCGYAHNLAKEKGCGVVATEVSAGERLRPAIPHWKMLSCEEDLWCIKRLGEDFSDGS 403

Query: 608 VGDWTKSPPGLSIFVDPREF 626
           VGDWTKSPPG+SIFVDPREF
Sbjct: 602 VGDWTKSPPGMSIFVDPREF 403

BLAST of Cp4.1LG10g02140 vs. TAIR 10
Match: AT4G37580.1 (Acyl-CoA N-acyltransferases (NAT) superfamily protein )

HSP 1 Score: 513.8 bits (1322), Expect = 1.9e-145
Identity = 291/620 (46.94%), Postives = 341/620 (55.00%), Query Frame = 0

Query: 9   VIREFDASKDSKSVEDVERRCEVGPSGKLCLFTDLLGDPICRVRNSPAFLMLVAATG-EE 68
           V+RE+D ++D   VEDVERRCEVGPSGKL LFTDLLGDPICR+R+SP++LMLVA  G E+
Sbjct: 3   VVREYDPTRDLVGVEDVERRCEVGPSGKLSLFTDLLGDPICRIRHSPSYLMLVAEMGTEK 62

Query: 69  NEIVGMIRGCIKTVTCGQKLSRAPPNPKTHHHHHSTNHHL-PVFTKLAYILGLRVSPAHR 128
            EIVGMIRGCIKTVTCGQKL           +H S N  + P++TKLAY+LGLRVSP HR
Sbjct: 63  KEIVGMIRGCIKTVTCGQKLDL---------NHKSQNDVVKPLYTKLAYVLGLRVSPFHR 122

Query: 129 RMGIGMKLVKKMEEWFRENGAEYSYIATEKDNVASVNLFTEKCDYSKFRTPAILVNPVFA 188
           R GIG KLVK MEEWFR+NGAEYSYIATE DN ASVNLFT KC YS+FRTP+ILVNPV+A
Sbjct: 123 RQGIGFKLVKMMEEWFRQNGAEYSYIATENDNQASVNLFTGKCGYSEFRTPSILVNPVYA 182

Query: 189 HPLPVSQRVTLLPLSRADAEILYRRRFSTTEFFPRDIDAILNNVLTLGTFLAFPRGIYTP 248
           H + VS+RVT++ L   DAE LYR RFSTTEFFPRDID++LNN L+LGTF+A PRG    
Sbjct: 183 HRVNVSRRVTVIKLEPVDAETLYRIRFSTTEFFPRDIDSVLNNKLSLGTFVAVPRG---- 242

Query: 249 QTWPGSDRFLLDPPESWAVLSVWNCNDVFKLQVRGASRLKRSLARTTRVLDRAFPWLRLP 308
                                  +C                                   
Sbjct: 243 -----------------------SC----------------------------------- 302

Query: 309 SVPELFRPFGLHFLYGLGGEGSEAGRMVKALCGRMGIGMKLVKKMEEWFRENGAEYSYIA 368
                         YG  G GS                                      
Sbjct: 303 --------------YG-SGSGS-------------------------------------- 362

Query: 369 TEKDNVASVNLFTEKCDYSKFRTPAILVNPVFAHPLPVSQRVTLLPLSRADAEILYRRRF 428
                                                                       
Sbjct: 363 ------------------------------------------------------------ 403

Query: 429 STTEFFPRDIDAILNNVLTLGTFLAFPRGIYTPQTWPGSDRFLLDPPESWAVLSVWNCND 488
                                              WPGS +FL  PPESWAVLSVWNC D
Sbjct: 423 -----------------------------------WPGSAKFLEYPPESWAVLSVWNCKD 403

Query: 489 VFKLQVRGASRLKRSLARTTRVLDRAFPWLRLPSVPELFRPFGLHFLYGLGGEGPEAGRM 548
            F L+VRGASRL+R +A+TTRV+D+  P+L+LPS+P +F PFGLHF+YG+GGEGP A +M
Sbjct: 483 SFLLEVRGASRLRRVVAKTTRVVDKTLPFLKLPSIPSVFEPFGLHFMYGIGGEGPRAVKM 403

Query: 549 VKALCGYAHNLAKERGCGVVATEVSARERLRGAIPHWKMLSCEEDLWCIKRLAEDFSDGS 608
           VK+LC +AHNLAK  GCGVVA EV+  + LR  IPHWK+LSC+EDLWCIKRL +D+SDG 
Sbjct: 543 VKSLCAHAHNLAKAGGCGVVAAEVAGEDPLRRGIPHWKVLSCDEDLWCIKRLGDDYSDGV 403

Query: 609 VGDWTKSPPGLSIFVDPREF 627
           VGDWTKSPPG+SIFVDPREF
Sbjct: 603 VGDWTKSPPGVSIFVDPREF 403

BLAST of Cp4.1LG10g02140 vs. TAIR 10
Match: AT2G23060.1 (Acyl-CoA N-acyltransferases (NAT) superfamily protein )

HSP 1 Score: 506.5 bits (1303), Expect = 3.0e-143
Identity = 284/624 (45.51%), Postives = 341/624 (54.65%), Query Frame = 0

Query: 8   IVIREFDASKDSKSVEDVERRCEVGPSGKLCLFTDLLGDPICRVRNSPAFLMLVAATG-- 67
           + +RE+D SKD  +VEDVERRCEVGP+GKL LFTDLLGDPICRVR+SP++LMLVA  G  
Sbjct: 5   VEVREYDPSKDLATVEDVERRCEVGPAGKLSLFTDLLGDPICRVRHSPSYLMLVAEIGPK 64

Query: 68  EENEIVGMIRGCIKTVTCGQKLSRAPPNPKTHHHHHSTNHHL---PVFTKLAYILGLRVS 127
           E+ E+VGMIRGCIKTVTCG    R         H+ S N  +   P++TKLAYILGLRVS
Sbjct: 65  EKKELVGMIRGCIKTVTCGITTKRLDLT-----HNKSQNDVVITKPLYTKLAYILGLRVS 124

Query: 128 PAHRRMGIGMKLVKKMEEWFRENGAEYSYIATEKDNVASVNLFTEKCDYSKFRTPAILVN 187
           P HRR GIG KLVK ME+WF +NGAEYSY ATE DN ASVNLFT KC Y++FRTP+ILVN
Sbjct: 125 PTHRRQGIGFKLVKAMEDWFSQNGAEYSYFATENDNHASVNLFTGKCGYAEFRTPSILVN 184

Query: 188 PVFAHPLPVSQRVTLLPLSRADAEILYRRRFSTTEFFPRDIDAILNNVLTLGTFLAFPRG 247
           PV+AH + +S+RVT++ L  +DAE+LYR RFSTTEFFPRDID++LNN L+LGTF+A PRG
Sbjct: 185 PVYAHRVNISRRVTVIKLEPSDAELLYRLRFSTTEFFPRDIDSVLNNKLSLGTFVAVPRG 244

Query: 248 IYTPQTWPGSDRFLLDPPESWAVLSVWNCNDVFKLQVRGASRLKRSLARTTRVLDRAFPW 307
                                      +C                               
Sbjct: 245 ---------------------------SC------------------------------- 304

Query: 308 LRLPSVPELFRPFGLHFLYGLGGEGSEAGRMVKALCGRMGIGMKLVKKMEEWFRENGAEY 367
                             YG G                                      
Sbjct: 305 ------------------YGSGS------------------------------------- 364

Query: 368 SYIATEKDNVASVNLFTEKCDYSKFRTPAILVNPVFAHPLPVSQRVTLLPLSRADAEILY 427
                                                                       
Sbjct: 365 ------------------------------------------------------------ 413

Query: 428 RRRFSTTEFFPRDIDAILNNVLTLGTFLAFPRGIYTPQTWPGSDRFLLDPPESWAVLSVW 487
                                                ++WPGS +FL  PP+SWAVLSVW
Sbjct: 425 -------------------------------------RSWPGSAKFLEYPPDSWAVLSVW 413

Query: 488 NCNDVFKLQVRGASRLKRSLARTTRVLDRAFPWLRLPSVPELFRPFGLHFLYGLGGEGPE 547
           NC D F+L+VRGASRL+R +++ TR++D+  P+L++PS+P +FRPFGLHF+YG+GGEGP 
Sbjct: 485 NCKDSFRLEVRGASRLRRVVSKATRMVDKTLPFLKIPSIPAVFRPFGLHFMYGIGGEGPR 413

Query: 548 AGRMVKALCGYAHNLAKERGCGVVATEVSARERLRGAIPHWKMLSCEEDLWCIKRLAEDF 607
           A +MVKALC +AHNLAKE GCGVVA EV+  E LR  IPHWK+LSC EDLWCIKRL ED+
Sbjct: 545 AEKMVKALCDHAHNLAKEGGCGVVAAEVAGEEPLRRGIPHWKVLSCAEDLWCIKRLGEDY 413

Query: 608 SDGSVGDWTKSPPGLSIFVDPREF 627
           SDGSVGDWTKSPPG SIFVDPREF
Sbjct: 605 SDGSVGDWTKSPPGDSIFVDPREF 413

BLAST of Cp4.1LG10g02140 vs. TAIR 10
Match: AT2G23060.2 (Acyl-CoA N-acyltransferases (NAT) superfamily protein )

HSP 1 Score: 437.2 bits (1123), Expect = 2.2e-122
Identity = 201/290 (69.31%), Postives = 243/290 (83.79%), Query Frame = 0

Query: 340 RMGIGMKLVKKMEEWFRENGAEYSYIATEKDNVASVNLFTEKCDYSKFRTPAILVNPVFA 399
           R GIG KLVK ME+WF +NGAEYSY ATE DN ASVNLFT KC Y++FRTP+ILVNPV+A
Sbjct: 69  RQGIGFKLVKAMEDWFSQNGAEYSYFATENDNHASVNLFTGKCGYAEFRTPSILVNPVYA 128

Query: 400 HPLPVSQRVTLLPLSRADAEILYRRRFSTTEFFPRDIDAILNNVLTLGTFLAFPRGI--- 459
           H + +S+RVT++ L  +DAE+LYR RFSTTEFFPRDID++LNN L+LGTF+A PRG    
Sbjct: 129 HRVNISRRVTVIKLEPSDAELLYRLRFSTTEFFPRDIDSVLNNKLSLGTFVAVPRGSCYG 188

Query: 460 YTPQTWPGSDRFLLDPPESWAVLSVWNCNDVFKLQVRGASRLKRSLARTTRVLDRAFPWL 519
              ++WPGS +FL  PP+SWAVLSVWNC D F+L+VRGASRL+R +++ TR++D+  P+L
Sbjct: 189 SGSRSWPGSAKFLEYPPDSWAVLSVWNCKDSFRLEVRGASRLRRVVSKATRMVDKTLPFL 248

Query: 520 RLPSVPELFRPFGLHFLYGLGGEGPEAGRMVKALCGYAHNLAKERGCGVVATEVSARERL 579
           ++PS+P +FRPFGLHF+YG+GGEGP A +MVKALC +AHNLAKE GCGVVA EV+  E L
Sbjct: 249 KIPSIPAVFRPFGLHFMYGIGGEGPRAEKMVKALCDHAHNLAKEGGCGVVAAEVAGEEPL 308

Query: 580 RGAIPHWKMLSCEEDLWCIKRLAEDFSDGSVGDWTKSPPGLSIFVDPREF 627
           R  IPHWK+LSC EDLWCIKRL ED+SDGSVGDWTKSPPG SIFVDPREF
Sbjct: 309 RRGIPHWKVLSCAEDLWCIKRLGEDYSDGSVGDWTKSPPGDSIFVDPREF 358


HSP 2 Score: 365.9 bits (938), Expect = 6.4e-101
Identity = 181/288 (62.85%), Postives = 225/288 (78.12%), Query Frame = 0

Query: 59  MLVAATG--EENEIVGMIRGCIKTVTCGQKLSRAPPNPKTHHHHHSTNHHL---PVFTKL 118
           MLVA  G  E+ E+VGMIRGCIKTVTCG    R         H+ S N  +   P++TKL
Sbjct: 1   MLVAEIGPKEKKELVGMIRGCIKTVTCGITTKRLDLT-----HNKSQNDVVITKPLYTKL 60

Query: 119 AYILGLRVSPAHRRMGIGMKLVKKMEEWFRENGAEYSYIATEKDNVASVNLFTEKCDYSK 178
           AYILGLRVSP HRR GIG KLVK ME+WF +NGAEYSY ATE DN ASVNLFT KC Y++
Sbjct: 61  AYILGLRVSPTHRRQGIGFKLVKAMEDWFSQNGAEYSYFATENDNHASVNLFTGKCGYAE 120

Query: 179 FRTPAILVNPVFAHPLPVSQRVTLLPLSRADAEILYRRRFSTTEFFPRDIDAILNNVLTL 238
           FRTP+ILVNPV+AH + +S+RVT++ L  +DAE+LYR RFSTTEFFPRDID++LNN L+L
Sbjct: 121 FRTPSILVNPVYAHRVNISRRVTVIKLEPSDAELLYRLRFSTTEFFPRDIDSVLNNKLSL 180

Query: 239 GTFLAFPRGI---YTPQTWPGSDRFLLDPPESWAVLSVWNCNDVFKLQVRGASRLKRSLA 298
           GTF+A PRG       ++WPGS +FL  PP+SWAVLSVWNC D F+L+VRGASRL+R ++
Sbjct: 181 GTFVAVPRGSCYGSGSRSWPGSAKFLEYPPDSWAVLSVWNCKDSFRLEVRGASRLRRVVS 240

Query: 299 RTTRVLDRAFPWLRLPSVPELFRPFGLHFLYGLGGEGSEAGRMVKALC 339
           + TR++D+  P+L++PS+P +FRPFGLHF+YG+GGEG  A +MVKALC
Sbjct: 241 KATRMVDKTLPFLKIPSIPAVFRPFGLHFMYGIGGEGPRAEKMVKALC 283

BLAST of Cp4.1LG10g02140 vs. TAIR 10
Match: AT5G67430.1 (Acyl-CoA N-acyltransferases (NAT) superfamily protein )

HSP 1 Score: 371.7 bits (953), Expect = 1.2e-102
Identity = 239/618 (38.67%), Postives = 293/618 (47.41%), Query Frame = 0

Query: 8   IVIREFDASKDSKSVEDVERRCEVGPSGKLCLFTDLLGDPICRVRNSPAFLMLVAATGEE 67
           +V+RE+D  +D  SVE++E  CEVG      L  DL+GDP+ R+R SP+F MLVA  G  
Sbjct: 8   VVVREYDPKRDLTSVEELEESCEVG-----SLLVDLMGDPLARIRQSPSFHMLVAEIG-- 67

Query: 68  NEIVGMIRGCIKTVTCGQKLSRAPPNPKTHHHHHSTNHHLPVFTKLAYILGLRVSPAHRR 127
           NEIVGMIRG IK VT G    R   +     +           TKLA++ GLRVSP +RR
Sbjct: 68  NEIVGMIRGTIKMVTRGVNALRQADDVSPEIN----------TTKLAFVSGLRVSPFYRR 127

Query: 128 MGIGMKLVKKMEEWFRENGAEYSYIATEKDNVASVNLFTEKCDYSKFRTPAILVNPVFAH 187
           MGIG+KLV+++EEWF  N A YSY+ TE DN+ASV LFTEK  YSKFRTP  LVNPVF H
Sbjct: 128 MGIGLKLVQRLEEWFLRNDAVYSYVQTENDNIASVKLFTEKSGYSKFRTPTFLVNPVFNH 187

Query: 188 PLPVSQRVTLLPLSRADAEILYRRRFSTTEFFPRDIDAILNNVLTLGTFLAFPRGIYTPQ 247
            + VS+RV ++ L+ +DAE LYR RFSTTEFFP DI++IL N L+LGT+LA PRG     
Sbjct: 188 RVTVSRRVKIIKLAPSDAESLYRNRFSTTEFFPSDINSILTNKLSLGTYLAVPRG----- 247

Query: 248 TWPGSDRFLLDPPESWAVLSVWNCNDVFKLQVRGASRLKRSLARTTRVLDRAFPWLRLPS 307
                                                                       
Sbjct: 248 ------------------------------------------------------------ 307

Query: 308 VPELFRPFGLHFLYGLGGEGSEAGRMVKALCGRMGIGMKLVKKMEEWFRENGAEYSYIAT 367
                                                                       
Sbjct: 308 ------------------------------------------------------------ 367

Query: 368 EKDNVASVNLFTEKCDYSKFRTPAILVNPVFAHPLPVSQRVTLLPLSRADAEILYRRRFS 427
             DNV+                                                      
Sbjct: 368 -GDNVS------------------------------------------------------ 385

Query: 428 TTEFFPRDIDAILNNVLTLGTFLAFPRGIYTPQTWPGSDRFLLDPPESWAVLSVWNCNDV 487
                                               GS   L D   SWAV+S+WN  DV
Sbjct: 428 ------------------------------------GS---LPDQTGSWAVISIWNSKDV 385

Query: 488 FKLQVRGASRLKRSLARTTRVLDRAFPWLRLPSVPELFRPFGLHFLYGLGGEGPEAGRMV 547
           ++LQV+GASRLKR LA++TRV D AFP+L++PS P LF+ F +HF+YG+GGEGP A  MV
Sbjct: 488 YRLQVKGASRLKRMLAKSTRVFDGAFPFLKIPSFPNLFKSFAMHFMYGIGGEGPRAAEMV 385

Query: 548 KALCGYAHNLAKERGCGVVATEVSARERLRGAIPHWKMLSCEEDLWCIKRLAEDFSDGSV 607
           +ALC +AHNLA++ GC VVA EV++ E LR  IPHWK+LS  EDLWC+KRL  D  DG  
Sbjct: 548 EALCSHAHNLARKSGCAVVAAEVASCEPLRVGIPHWKVLS-PEDLWCLKRLRYD-DDGV- 385

Query: 608 GDWTKSPPGLSIFVDPRE 626
            DWTKSPPGLSIFVDPRE
Sbjct: 608 -DWTKSPPGLSIFVDPRE 385

BLAST of Cp4.1LG10g02140 vs. TAIR 10
Match: AT2G30090.1 (Acyl-CoA N-acyltransferases (NAT) superfamily protein )

HSP 1 Score: 242.3 bits (617), Expect = 1.1e-63
Identity = 138/335 (41.19%), Postives = 192/335 (57.31%), Query Frame = 0

Query: 6   DSIVIREFDASKDSKSVEDVERRCEVGPSGKLCLFTDLLGDPICRVRNSPAFLMLVAATG 65
           + +VIR +D  +D   +  +E+ CE+G   +  LFTD LGDPICR+RNSP F+MLVA  G
Sbjct: 11  EEVVIRCYDDRRDRIQMGRMEKSCEIGHDHQTLLFTDTLGDPICRIRNSPFFIMLVAGVG 70

Query: 66  EENEIVGMIRGCIKTVTCGQKLSRAPPNPKTHHHHHSTNHHLPVFTKLAYILGLRVSPAH 125
             N++VG I+G +K V    K  R                       + Y+LGLRV P++
Sbjct: 71  --NKLVGSIQGSVKPVEFHDKSVR-----------------------VGYVLGLRVVPSY 130

Query: 126 RRMGIGMKLVKKMEEWFRENGAEYSYIATEKDNVASVNLFTEKCDYSKFRTPAILVNPVF 185
           RR GIG  LV+K+EEWF  + A+Y+Y+ATEKDN AS  LF  +  Y  FR PAILVNPV 
Sbjct: 131 RRRGIGSILVRKLEEWFESHNADYAYMATEKDNEASHGLFIGRLGYVVFRNPAILVNPVN 190

Query: 186 -AHPLPVSQRVTLLPLSRADAEILYRRRF-STTEFFPRDIDAILNNVLTLGTFLAFPRGI 245
               L +   + +  L   +AE LYRR   +TTEFFP DI+ IL N L++GT++A+   +
Sbjct: 191 PGRGLKLPSDIGIRKLKVKEAESLYRRNVAATTEFFPDDINKILRNKLSIGTWVAYYNNV 250

Query: 246 YTPQTWPGSDRFLLDPPESWAVLSVWNCNDVFKLQVRGASRLKRSLARTTRVLDRAFPWL 305
                         D   SWA+LSVW+ + VFKL++  A      L + +++       L
Sbjct: 251 --------------DNTRSWAMLSVWDSSKVFKLRIERAPLSYLLLTKVSKLFGNFLSLL 306

Query: 306 RLPSVPELFRPFGLHFLYGLGGEGSEAGRMVKALC 339
            L  +P+LF PFG +FLYG+  EG   G++V+ALC
Sbjct: 311 GLTVLPDLFTPFGFYFLYGVHSEGPHCGKLVRALC 306


HSP 2 Score: 222.2 bits (565), Expect = 1.1e-57
Identity = 125/295 (42.37%), Postives = 174/295 (58.98%), Query Frame = 0

Query: 340 RMGIGMKLVKKMEEWFRENGAEYSYIATEKDNVASVNLFTEKCDYSKFRTPAILVNPVF- 399
           R GIG  LV+K+EEWF  + A+Y+Y+ATEKDN AS  LF  +  Y  FR PAILVNPV  
Sbjct: 107 RRGIGSILVRKLEEWFESHNADYAYMATEKDNEASHGLFIGRLGYVVFRNPAILVNPVNP 166

Query: 400 AHPLPVSQRVTLLPLSRADAEILYRRRF-STTEFFPRDIDAILNNVLTLGTFLAFPRGIY 459
              L +   + +  L   +AE LYRR   +TTEFFP DI+ IL N L++GT++A+   + 
Sbjct: 167 GRGLKLPSDIGIRKLKVKEAESLYRRNVAATTEFFPDDINKILRNKLSIGTWVAYYNNV- 226

Query: 460 TPQTWPGSDRFLLDPPESWAVLSVWNCNDVFKLQVRGASRLKRSLARTTRVLDRAFPWLR 519
                        D   SWA+LSVW+ + VFKL++  A      L + +++       L 
Sbjct: 227 -------------DNTRSWAMLSVWDSSKVFKLRIERAPLSYLLLTKVSKLFGNFLSLLG 286

Query: 520 LPSVPELFRPFGLHFLYGLGGEGPEAGRMVKALCGYAHNLAKER---GCGVVATEV---- 579
           L  +P+LF PFG +FLYG+  EGP  G++V+ALC + HN+A       C VV  EV    
Sbjct: 287 LTVLPDLFTPFGFYFLYGVHSEGPHCGKLVRALCEHVHNMAALNDGCACKVVVVEVDKGS 346

Query: 580 SARERLRGAIPHWKMLSCEEDLWCIKRLAEDFSDGSVGDWTKSPPGLSIFVDPRE 626
           +  + L+  IPHWKMLSC++D+WCIK L  + +   + + +KS    S+FVDPRE
Sbjct: 347 NGDDSLQRCIPHWKMLSCDDDMWCIKPLKCEKNKFDLSERSKSRS--SLFVDPRE 385

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q423812.7e-14446.94Probable N-acetyltransferase HLS1 OS=Arabidopsis thaliana OX=3702 GN=HLS1 PE=1 S... [more]
O648154.2e-14245.51Probable N-acetyltransferase HLS1-like OS=Arabidopsis thaliana OX=3702 GN=At2g23... [more]
Match NameE-valueIdentityDescription
XP_023544685.11.97e-27565.97probable N-acetyltransferase HLS1 [Cucurbita pepo subsp. pepo][more]
KAG6604134.15.34e-27365.18putative N-acetyltransferase HLS1-like protein, partial [Cucurbita argyrosperma ... [more]
XP_022950471.13.71e-27165.23probable N-acetyltransferase HLS1 [Cucurbita moschata][more]
XP_022977747.11.21e-26063.58probable N-acetyltransferase HLS1 isoform X1 [Cucurbita maxima][more]
KAG7034295.14.21e-25762.94putative N-acetyltransferase HLS1-like protein, partial [Cucurbita argyrosperma ... [more]
Match NameE-valueIdentityDescription
A0A6J1GEX31.80e-27165.23probable N-acetyltransferase HLS1 OS=Cucurbita moschata OX=3662 GN=LOC111453566 ... [more]
A0A6J1IJC05.88e-26163.58probable N-acetyltransferase HLS1 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC1... [more]
A0A0A0KLH94.57e-23157.58N-acetyltransferase domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_... [more]
A0A1S3B0J51.96e-22957.58probable N-acetyltransferase HLS1 OS=Cucumis melo OX=3656 GN=LOC103484843 PE=4 S... [more]
A0A5A7SYR26.11e-22957.58Putative N-acetyltransferase HLS1 OS=Cucumis melo var. makuwa OX=1194695 GN=E6C2... [more]
Match NameE-valueIdentityDescription
AT4G37580.11.9e-14546.94Acyl-CoA N-acyltransferases (NAT) superfamily protein [more]
AT2G23060.13.0e-14345.51Acyl-CoA N-acyltransferases (NAT) superfamily protein [more]
AT2G23060.22.2e-12269.31Acyl-CoA N-acyltransferases (NAT) superfamily protein [more]
AT5G67430.11.2e-10238.67Acyl-CoA N-acyltransferases (NAT) superfamily protein [more]
AT2G30090.11.1e-6341.19Acyl-CoA N-acyltransferases (NAT) superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita pepo (Zucchini) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableGENE3D3.40.630.30coord: 42..177
e-value: 5.7E-15
score: 57.5
NoneNo IPR availableGENE3D3.40.630.30coord: 316..393
e-value: 1.9E-6
score: 29.7
NoneNo IPR availablePANTHERPTHR43072N-ACETYLTRANSFERASEcoord: 4..339
coord: 340..626
NoneNo IPR availablePANTHERPTHR43072:SF42N-ACETYLTRANSFERASE HLS1-LIKE-RELATEDcoord: 4..339
coord: 340..626
NoneNo IPR availableCDDcd04301NAT_SFcoord: 114..149
e-value: 1.98873E-7
score: 46.5001
IPR000182GNAT domainPFAMPF00583Acetyltransf_1coord: 61..171
e-value: 3.9E-17
score: 62.6
coord: 333..384
e-value: 1.4E-7
score: 31.8
IPR000182GNAT domainPROSITEPS51186GNATcoord: 8..196
score: 13.387731
IPR016181Acyl-CoA N-acyltransferaseSUPERFAMILY55729Acyl-CoA N-acyltransferases (Nat)coord: 6..173
IPR016181Acyl-CoA N-acyltransferaseSUPERFAMILY55729Acyl-CoA N-acyltransferases (Nat)coord: 331..388

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG10g02140.1Cp4.1LG10g02140.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0008080 N-acetyltransferase activity