Cmc06g0159821 (gene) Melon (Charmono) v1.1

Overview
NameCmc06g0159821
Typegene
OrganismCucumis melo L. var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionGag/pol protein
LocationCMiso1.1chr06: 8064003 .. 8065730 (-)
RNA-Seq ExpressionCmc06g0159821
SyntenyCmc06g0159821
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAGAAATCATAAACCACGAAACAAATTAGTATTAAGTGAAGCTACTGATGAATCAACAAGGGTTGTTGATGAAGTTGGTCCCTCATCAAGAGTTGATGAAACCACCACATCAGGTCAATCTCATCCTTCTCAATCGTTGAGAATGCCTCAACGCAGTGGGAGGATTGTATCACAACCTAACCGTTATTTGGGTTTAACTGAAACTCAGGTTGTCATACCAGATGATGGTGTTGAGGATCCATTGTCCTATAAACAGGCAATGAATGATGTAGATAAGGACCAATGGGTCAAAGCCATGGACCTTGAAATGGAGTCTATGTACTTCAATTTAGTGTGGGAGCTTGTAGATCTACCTGAAGGGGTAAAACCTATAGGTTGCAAATGGATCTATAAGAGAAAGAGAGATTCAGCTGGGAAGGTACAAACCTTCAAAGCTAGACTTGTGGCAAAAGGGTATACTCAAAGGGAAGGGGTTGACTATGAGGAAACTTTTTCTCTTGTTGCTATGTTAAGGTCTATAAGGATTCTCTTGTCCATCGCCACATTTTATGATTATGAAATATGGAAAATGGATGTCAAGACTGCTTTGTTAAATGGCAATCTTGAAGAGAGTATCTTTATGTCTCAGCGCGAGGGGTTCATAACCCAAGGTCAAGAGCAAAAAGTTTGCAAGCTGAATCGATCCATTTATGGGTTGAAACAAGCATCTAGATCTTGGAACATTAGGTTTGATACTGCGATCAAATGCTACGGTTTTGACCAGAACGTTGATGAACCTTGTGTATATAAGAAAATCAACAAAGGAAAAGTAGCTTTCTTAGTACTTTATGTAGACGATATCCTCCTCATTGGGAATGATATGGGATACCTTACTGACGTTAAAGCTTGGTTAGCAGCCCAATTCCAAATGAAAGATTTAGGAGAGGCACAATATGTTCTTGGGATCCAAATCATAAGGGATCGTAAGAACAAAATCCTAGCACTATCTCAAGTAACCTATATCGACAAAATGTTGGTTCGATATTCGATGCAGAACTCTAAGAAGGGTTTATTACCTTTCAGGCATGGGGTTCACTTGTCTAAGGAACAGTGTTCTAAGACACCTCAAGAAGTTGAGGATATGAGACGTATTCCCTATGCCTCAGCTGTGGGCAGCTTAATGTATGTTATGCTCTGCACTAGGCCAGAAATTTGTTATGCAGTGGGAATAGTCAGTAGGTATCAGTCCAACCCAGGGTTAGACCACTGGACGGCGGTTAAAATTATTCTTAAGTATCTTAGGAGAACGAGAGACTACATGCTTGTGTATGGAGCTAAGGATTTGATCCTTACAGGATACACTGATTTTGATTTCCAAACCGATAAGGATTCTAGAAAATCCACATCGGGATCAGTGTTCACCCTAAATGGGGGAGTTGTAGTATGGCGTAGCATCAAGCAAGGATGCATTGCAGACTGTACAATGGAGGCTGAATACGTAGCTGCTTGTGAAGCAGCAAAAGAAGCAGTTTGGCTTAGGAAGTTCCTACATGATTTAGAAGTTGTTCCAAACATGAACTTGCCCATCACTCTATATTGTGATAACAGTGGGGCAGTAGCCAATTGTAAAGAACCTCGCAGCCATAAACGAGGGAAAGATAGAGAGGAAGTATCATCTGATACGGGAGATTGTGCAACGAGGGGATGTGATCGTCACCAAGATCGCTTCGGAGCACAACATTGCTGA

mRNA sequence

ATGAGAAATCATAAACCACGAAACAAATTAGTATTAAGTGAAGCTACTGATGAATCAACAAGGGTTGTTGATGAAGTTGGTCCCTCATCAAGAGTTGATGAAACCACCACATCAGGTCAATCTCATCCTTCTCAATCGTTGAGAATGCCTCAACGCAGTGGGAGGATTGTATCACAACCTAACCGTTATTTGGGTTTAACTGAAACTCAGGTTGTCATACCAGATGATGGTGTTGAGGATCCATTGTCCTATAAACAGGCAATGAATGATGTAGATAAGGACCAATGGGTCAAAGCCATGGACCTTGAAATGGAGTCTATGTACTTCAATTTAGTGTGGGAGCTTGTAGATCTACCTGAAGGGGTAAAACCTATAGGTTGCAAATGGATCTATAAGAGAAAGAGAGATTCAGCTGGGAAGGTACAAACCTTCAAAGCTAGACTTGTGGCAAAAGGGTATACTCAAAGGGAAGGGGTTGACTATGAGGAAACTTTTTCTCTTGTTGCTATGTTAAGGTCTATAAGGATTCTCTTGTCCATCGCCACATTTTATGATTATGAAATATGGAAAATGGATGTCAAGACTGCTTTGTTAAATGGCAATCTTGAAGAGAGTATCTTTATGTCTCAGCGCGAGGGGTTCATAACCCAAGGTCAAGAGCAAAAAGTTTGCAAGCTGAATCGATCCATTTATGGGTTGAAACAAGCATCTAGATCTTGGAACATTAGGTTTGATACTGCGATCAAATGCTACGGTTTTGACCAGAACGTTGATGAACCTTGTGTATATAAGAAAATCAACAAAGGAAAAGTAGCTTTCTTAGTACTTTATGTAGACGATATCCTCCTCATTGGGAATGATATGGGATACCTTACTGACGTTAAAGCTTGGTTAGCAGCCCAATTCCAAATGAAAGATTTAGGAGAGGCACAATATGTTCTTGGGATCCAAATCATAAGGGATCGTAAGAACAAAATCCTAGCACTATCTCAAGTAACCTATATCGACAAAATGTTGGTTCGATATTCGATGCAGAACTCTAAGAAGGGTTTATTACCTTTCAGGCATGGGGTTCACTTGTCTAAGGAACAGTGTTCTAAGACACCTCAAGAAGTTGAGGATATGAGACGTATTCCCTATGCCTCAGCTGTGGGCAGCTTAATGTATGTTATGCTCTGCACTAGGCCAGAAATTTGTTATGCAGTGGGAATAGTCAGTAGGTATCAGTCCAACCCAGGGTTAGACCACTGGACGGCGGTTAAAATTATTCTTAAGTATCTTAGGAGAACGAGAGACTACATGCTTGTGTATGGAGCTAAGGATTTGATCCTTACAGGATACACTGATTTTGATTTCCAAACCGATAAGGATTCTAGAAAATCCACATCGGGATCAGTGTTCACCCTAAATGGGGGAGTTGTAGTATGGCGTAGCATCAAGCAAGGATGCATTGCAGACTGTACAATGGAGGCTGAATACGTAGCTGCTTGTGAAGCAGCAAAAGAAGCAGTTTGGCTTAGGAAGTTCCTACATGATTTAGAAGTTGTTCCAAACATGAACTTGCCCATCACTCTATATTGTGATAACAGTGGGGCAGTAGCCAATTGTAAAGAACCTCGCAGCCATAAACGAGGGAAAGATAGAGAGGAAGTATCATCTGATACGGGAGATTGTGCAACGAGGGGATGTGATCGTCACCAAGATCGCTTCGGAGCACAACATTGCTGA

Coding sequence (CDS)

ATGAGAAATCATAAACCACGAAACAAATTAGTATTAAGTGAAGCTACTGATGAATCAACAAGGGTTGTTGATGAAGTTGGTCCCTCATCAAGAGTTGATGAAACCACCACATCAGGTCAATCTCATCCTTCTCAATCGTTGAGAATGCCTCAACGCAGTGGGAGGATTGTATCACAACCTAACCGTTATTTGGGTTTAACTGAAACTCAGGTTGTCATACCAGATGATGGTGTTGAGGATCCATTGTCCTATAAACAGGCAATGAATGATGTAGATAAGGACCAATGGGTCAAAGCCATGGACCTTGAAATGGAGTCTATGTACTTCAATTTAGTGTGGGAGCTTGTAGATCTACCTGAAGGGGTAAAACCTATAGGTTGCAAATGGATCTATAAGAGAAAGAGAGATTCAGCTGGGAAGGTACAAACCTTCAAAGCTAGACTTGTGGCAAAAGGGTATACTCAAAGGGAAGGGGTTGACTATGAGGAAACTTTTTCTCTTGTTGCTATGTTAAGGTCTATAAGGATTCTCTTGTCCATCGCCACATTTTATGATTATGAAATATGGAAAATGGATGTCAAGACTGCTTTGTTAAATGGCAATCTTGAAGAGAGTATCTTTATGTCTCAGCGCGAGGGGTTCATAACCCAAGGTCAAGAGCAAAAAGTTTGCAAGCTGAATCGATCCATTTATGGGTTGAAACAAGCATCTAGATCTTGGAACATTAGGTTTGATACTGCGATCAAATGCTACGGTTTTGACCAGAACGTTGATGAACCTTGTGTATATAAGAAAATCAACAAAGGAAAAGTAGCTTTCTTAGTACTTTATGTAGACGATATCCTCCTCATTGGGAATGATATGGGATACCTTACTGACGTTAAAGCTTGGTTAGCAGCCCAATTCCAAATGAAAGATTTAGGAGAGGCACAATATGTTCTTGGGATCCAAATCATAAGGGATCGTAAGAACAAAATCCTAGCACTATCTCAAGTAACCTATATCGACAAAATGTTGGTTCGATATTCGATGCAGAACTCTAAGAAGGGTTTATTACCTTTCAGGCATGGGGTTCACTTGTCTAAGGAACAGTGTTCTAAGACACCTCAAGAAGTTGAGGATATGAGACGTATTCCCTATGCCTCAGCTGTGGGCAGCTTAATGTATGTTATGCTCTGCACTAGGCCAGAAATTTGTTATGCAGTGGGAATAGTCAGTAGGTATCAGTCCAACCCAGGGTTAGACCACTGGACGGCGGTTAAAATTATTCTTAAGTATCTTAGGAGAACGAGAGACTACATGCTTGTGTATGGAGCTAAGGATTTGATCCTTACAGGATACACTGATTTTGATTTCCAAACCGATAAGGATTCTAGAAAATCCACATCGGGATCAGTGTTCACCCTAAATGGGGGAGTTGTAGTATGGCGTAGCATCAAGCAAGGATGCATTGCAGACTGTACAATGGAGGCTGAATACGTAGCTGCTTGTGAAGCAGCAAAAGAAGCAGTTTGGCTTAGGAAGTTCCTACATGATTTAGAAGTTGTTCCAAACATGAACTTGCCCATCACTCTATATTGTGATAACAGTGGGGCAGTAGCCAATTGTAAAGAACCTCGCAGCCATAAACGAGGGAAAGATAGAGAGGAAGTATCATCTGATACGGGAGATTGTGCAACGAGGGGATGTGATCGTCACCAAGATCGCTTCGGAGCACAACATTGCTGA

Protein sequence

MRNHKPRNKLVLSEATDESTRVVDEVGPSSRVDETTTSGQSHPSQSLRMPQRSGRIVSQPNRYLGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNLVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSLVAMLRSIRILLSIATFYDYEIWKMDVKTALLNGNLEESIFMSQREGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKCYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDMGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKILALSQVTYIDKMLVRYSMQNSKKGLLPFRHGVHLSKEQCSKTPQEVEDMRRIPYASAVGSLMYVMLCTRPEICYAVGIVSRYQSNPGLDHWTAVKIILKYLRRTRDYMLVYGAKDLILTGYTDFDFQTDKDSRKSTSGSVFTLNGGVVVWRSIKQGCIADCTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANCKEPRSHKRGKDREEVSSDTGDCATRGCDRHQDRFGAQHC
Homology
BLAST of Cmc06g0159821 vs. NCBI nr
Match: KAA0050437.1 (gag/pol protein [Cucumis melo var. makuwa])

HSP 1 Score: 1164.1 bits (3010), Expect = 0.0e+00
Identity = 575/575 (100.00%), Postives = 575/575 (100.00%), Query Frame = 0

Query: 1   MRNHKPRNKLVLSEATDESTRVVDEVGPSSRVDETTTSGQSHPSQSLRMPQRSGRIVSQP 60
           MRNHKPRNKLVLSEATDESTRVVDEVGPSSRVDETTTSGQSHPSQSLRMPQRSGRIVSQP
Sbjct: 319 MRNHKPRNKLVLSEATDESTRVVDEVGPSSRVDETTTSGQSHPSQSLRMPQRSGRIVSQP 378

Query: 61  NRYLGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNLVWELVDLPE 120
           NRYLGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNLVWELVDLPE
Sbjct: 379 NRYLGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNLVWELVDLPE 438

Query: 121 GVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSLVAMLRSIRILLSI 180
           GVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSLVAMLRSIRILLSI
Sbjct: 439 GVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSLVAMLRSIRILLSI 498

Query: 181 ATFYDYEIWKMDVKTALLNGNLEESIFMSQREGFITQGQEQKVCKLNRSIYGLKQASRSW 240
           ATFYDYEIWKMDVKTALLNGNLEESIFMSQREGFITQGQEQKVCKLNRSIYGLKQASRSW
Sbjct: 499 ATFYDYEIWKMDVKTALLNGNLEESIFMSQREGFITQGQEQKVCKLNRSIYGLKQASRSW 558

Query: 241 NIRFDTAIKCYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDMGYLTDVKAWLAA 300
           NIRFDTAIKCYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDMGYLTDVKAWLAA
Sbjct: 559 NIRFDTAIKCYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDMGYLTDVKAWLAA 618

Query: 301 QFQMKDLGEAQYVLGIQIIRDRKNKILALSQVTYIDKMLVRYSMQNSKKGLLPFRHGVHL 360
           QFQMKDLGEAQYVLGIQIIRDRKNKILALSQVTYIDKMLVRYSMQNSKKGLLPFRHGVHL
Sbjct: 619 QFQMKDLGEAQYVLGIQIIRDRKNKILALSQVTYIDKMLVRYSMQNSKKGLLPFRHGVHL 678

Query: 361 SKEQCSKTPQEVEDMRRIPYASAVGSLMYVMLCTRPEICYAVGIVSRYQSNPGLDHWTAV 420
           SKEQCSKTPQEVEDMRRIPYASAVGSLMYVMLCTRPEICYAVGIVSRYQSNPGLDHWTAV
Sbjct: 679 SKEQCSKTPQEVEDMRRIPYASAVGSLMYVMLCTRPEICYAVGIVSRYQSNPGLDHWTAV 738

Query: 421 KIILKYLRRTRDYMLVYGAKDLILTGYTDFDFQTDKDSRKSTSGSVFTLNGGVVVWRSIK 480
           KIILKYLRRTRDYMLVYGAKDLILTGYTDFDFQTDKDSRKSTSGSVFTLNGGVVVWRSIK
Sbjct: 739 KIILKYLRRTRDYMLVYGAKDLILTGYTDFDFQTDKDSRKSTSGSVFTLNGGVVVWRSIK 798

Query: 481 QGCIADCTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANCKEPR 540
           QGCIADCTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANCKEPR
Sbjct: 799 QGCIADCTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANCKEPR 858

Query: 541 SHKRGKDREEVSSDTGDCATRGCDRHQDRFGAQHC 576
           SHKRGKDREEVSSDTGDCATRGCDRHQDRFGAQHC
Sbjct: 859 SHKRGKDREEVSSDTGDCATRGCDRHQDRFGAQHC 893

BLAST of Cmc06g0159821 vs. NCBI nr
Match: KAA0025945.1 (gag/pol protein [Cucumis melo var. makuwa] >KAA0026303.1 gag/pol protein [Cucumis melo var. makuwa] >KAA0035786.1 gag/pol protein [Cucumis melo var. makuwa] >KAA0040492.1 gag/pol protein [Cucumis melo var. makuwa] >KAA0041262.1 gag/pol protein [Cucumis melo var. makuwa])

HSP 1 Score: 1055.0 bits (2727), Expect = 2.2e-304
Identity = 524/549 (95.45%), Postives = 533/549 (97.09%), Query Frame = 0

Query: 1    MRNHKPRNKLVLSEATDESTRVVDEVGPSSRVDETTTSGQSHPSQSLRMPQRSGRIVSQP 60
            MRNHKPR+KLVLSEATDESTRVVDEVGPSSRVDETTTSGQSHPSQSLRMP+RSGR+VSQP
Sbjct: 633  MRNHKPRSKLVLSEATDESTRVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQP 692

Query: 61   NRYLGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNLVWELVDLPE 120
            NRYLGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFN VWELVDLPE
Sbjct: 693  NRYLGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSVWELVDLPE 752

Query: 121  GVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSLVAMLRSIRILLSI 180
            GVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFS VAML+SIRILLSI
Sbjct: 753  GVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSI 812

Query: 181  ATFYDYEIWKMDVKTALLNGNLEESIFMSQREGFITQGQEQKVCKLNRSIYGLKQASRSW 240
            ATFYDYEIW+MDVKTA LNGNLEESIFMSQ EGFITQGQEQKVCKLNRSIYGLKQASRSW
Sbjct: 813  ATFYDYEIWQMDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSW 872

Query: 241  NIRFDTAIKCYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDMGYLTDVKAWLAA 300
            NIRFDTAIK YGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGND+GYLTDVKAWLAA
Sbjct: 873  NIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAA 932

Query: 301  QFQMKDLGEAQYVLGIQIIRDRKNKILALSQVTYIDKMLVRYSMQNSKKGLLPFRHGVHL 360
            QFQMKDLGEAQYVLGIQIIRDRKNK LALSQ TYIDK+LVRYSMQNSKKGLLPFRHGVHL
Sbjct: 933  QFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHL 992

Query: 361  SKEQCSKTPQEVEDMRRIPYASAVGSLMYVMLCTRPEICYAVGIVSRYQSNPGLDHWTAV 420
            SKEQ  KTPQEVEDMRRIPYASAVGSLMY MLCTRP+ICYAVGIVSRYQSNPGLDHWTAV
Sbjct: 993  SKEQSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAV 1052

Query: 421  KIILKYLRRTRDYMLVYGAKDLILTGYTDFDFQTDKDSRKSTSGSVFTLNGGVVVWRSIK 480
            KI+LKYLRRTRDYMLVYGAKDLILTGYTD DFQTDKDSRKSTSGSVFTLNGG VVWRSIK
Sbjct: 1053 KIVLKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIK 1112

Query: 481  QGCIADCTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANCKEPR 540
            QGCIAD TMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVAN KEPR
Sbjct: 1113 QGCIADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPR 1172

Query: 541  SHKRGKDRE 550
            SHKRGK  E
Sbjct: 1173 SHKRGKHIE 1181

BLAST of Cmc06g0159821 vs. NCBI nr
Match: KAA0059226.1 (gag/pol protein [Cucumis melo var. makuwa])

HSP 1 Score: 1055.0 bits (2727), Expect = 2.2e-304
Identity = 524/549 (95.45%), Postives = 533/549 (97.09%), Query Frame = 0

Query: 1    MRNHKPRNKLVLSEATDESTRVVDEVGPSSRVDETTTSGQSHPSQSLRMPQRSGRIVSQP 60
            MRNHKPR+KLVLSEATDESTRVVDEVGPSSRVDETTTSGQSHPSQSLRMP+RSGR+VSQP
Sbjct: 507  MRNHKPRSKLVLSEATDESTRVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQP 566

Query: 61   NRYLGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNLVWELVDLPE 120
            NRYLGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFN VWELVDLPE
Sbjct: 567  NRYLGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSVWELVDLPE 626

Query: 121  GVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSLVAMLRSIRILLSI 180
            GVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFS VAML+SIRILLSI
Sbjct: 627  GVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSI 686

Query: 181  ATFYDYEIWKMDVKTALLNGNLEESIFMSQREGFITQGQEQKVCKLNRSIYGLKQASRSW 240
            ATFYDYEIW+MDVKTA LNGNLEESIFMSQ EGFITQGQEQKVCKLNRSIYGLKQASRSW
Sbjct: 687  ATFYDYEIWQMDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSW 746

Query: 241  NIRFDTAIKCYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDMGYLTDVKAWLAA 300
            NIRFDTAIK YGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGND+GYLTDVKAWLAA
Sbjct: 747  NIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAA 806

Query: 301  QFQMKDLGEAQYVLGIQIIRDRKNKILALSQVTYIDKMLVRYSMQNSKKGLLPFRHGVHL 360
            QFQMKDLGEAQYVLGIQIIRDRKNK LALSQ TYIDK+LVRYSMQNSKKGLLPFRHGVHL
Sbjct: 807  QFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHL 866

Query: 361  SKEQCSKTPQEVEDMRRIPYASAVGSLMYVMLCTRPEICYAVGIVSRYQSNPGLDHWTAV 420
            SKEQ  KTPQEVEDMRRIPYASAVGSLMY MLCTRP+ICYAVGIVSRYQSNPGLDHWTAV
Sbjct: 867  SKEQSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAV 926

Query: 421  KIILKYLRRTRDYMLVYGAKDLILTGYTDFDFQTDKDSRKSTSGSVFTLNGGVVVWRSIK 480
            KI+LKYLRRTRDYMLVYGAKDLILTGYTD DFQTDKDSRKSTSGSVFTLNGG VVWRSIK
Sbjct: 927  KIVLKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIK 986

Query: 481  QGCIADCTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANCKEPR 540
            QGCIAD TMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVAN KEPR
Sbjct: 987  QGCIADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPR 1046

Query: 541  SHKRGKDRE 550
            SHKRGK  E
Sbjct: 1047 SHKRGKHIE 1055

BLAST of Cmc06g0159821 vs. NCBI nr
Match: KAA0035907.1 (gag/pol protein [Cucumis melo var. makuwa])

HSP 1 Score: 1043.1 bits (2696), Expect = 8.8e-301
Identity = 518/549 (94.35%), Postives = 530/549 (96.54%), Query Frame = 0

Query: 1    MRNHKPRNKLVLSEATDESTRVVDEVGPSSRVDETTTSGQSHPSQSLRMPQRSGRIVSQP 60
            MRNHKPR+KLVLSEATDESTRVVDEVGPSSRVDETTTSGQSHPSQSLRMP+RSGR+VSQP
Sbjct: 633  MRNHKPRSKLVLSEATDESTRVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQP 692

Query: 61   NRYLGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNLVWELVDLPE 120
            NRYLGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFN VWELVDLPE
Sbjct: 693  NRYLGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSVWELVDLPE 752

Query: 121  GVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSLVAMLRSIRILLSI 180
            GVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYT++EGVDYEETFS VAML+SIRILLSI
Sbjct: 753  GVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTRKEGVDYEETFSSVAMLKSIRILLSI 812

Query: 181  ATFYDYEIWKMDVKTALLNGNLEESIFMSQREGFITQGQEQKVCKLNRSIYGLKQASRSW 240
            A FYDYEIW+MDVKTA LNGNLEESIFMSQ EGFITQGQEQKVCKLNRSIYGLKQASRSW
Sbjct: 813  AKFYDYEIWQMDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSW 872

Query: 241  NIRFDTAIKCYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDMGYLTDVKAWLAA 300
            NIRFDTAIK YGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGND+GYLTDVKAWLAA
Sbjct: 873  NIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAA 932

Query: 301  QFQMKDLGEAQYVLGIQIIRDRKNKILALSQVTYIDKMLVRYSMQNSKKGLLPFRHGVHL 360
            QFQMKDLGE QYVLGIQIIRDRKNK LALSQ TYIDK+LVRYSMQNSKKGLLPFRHGVHL
Sbjct: 933  QFQMKDLGEGQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHL 992

Query: 361  SKEQCSKTPQEVEDMRRIPYASAVGSLMYVMLCTRPEICYAVGIVSRYQSNPGLDHWTAV 420
            SKEQ  KTPQEVEDMRRIPYASAVGSLMY MLCTRP+ICYAVGIVSRYQSNPGLDHWTAV
Sbjct: 993  SKEQSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAV 1052

Query: 421  KIILKYLRRTRDYMLVYGAKDLILTGYTDFDFQTDKDSRKSTSGSVFTLNGGVVVWRSIK 480
            KIILKYLRRTRDYMLVYGAKDLILTGYT+ DFQTDKDSRKSTS SVFTLNGG VVWRSIK
Sbjct: 1053 KIILKYLRRTRDYMLVYGAKDLILTGYTNSDFQTDKDSRKSTSRSVFTLNGGAVVWRSIK 1112

Query: 481  QGCIADCTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANCKEPR 540
            QGCIAD TMEAEYVAACEAAKEAVWL+KFLHDLEVVPNMNLPITLYCDNSGAVAN KEPR
Sbjct: 1113 QGCIADSTMEAEYVAACEAAKEAVWLKKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPR 1172

Query: 541  SHKRGKDRE 550
            SHKRGK  E
Sbjct: 1173 SHKRGKHIE 1181

BLAST of Cmc06g0159821 vs. NCBI nr
Match: KAA0033121.1 (gag/pol protein [Cucumis melo var. makuwa] >TYK17112.1 gag/pol protein [Cucumis melo var. makuwa])

HSP 1 Score: 1030.0 bits (2662), Expect = 7.7e-297
Identity = 514/549 (93.62%), Postives = 526/549 (95.81%), Query Frame = 0

Query: 1   MRNHKPRNKLVLSEATDESTRVVDEVGPSSRVDETTTSGQSHPSQSLRMPQRSGRIVSQP 60
           MR+HKP+NKLVL+EA DESTRVVDEVGPSSRV+ETTTSGQSHPSQSLRMP+RSGRIVSQP
Sbjct: 243 MRDHKPQNKLVLNEAIDESTRVVDEVGPSSRVNETTTSGQSHPSQSLRMPRRSGRIVSQP 302

Query: 61  NRYLGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNLVWELVDLPE 120
           NRYLGLTETQVVIPDDGVEDPLSY QAMNDVDKDQWVKAMDLEMESMYFNL+WELVDLPE
Sbjct: 303 NRYLGLTETQVVIPDDGVEDPLSYNQAMNDVDKDQWVKAMDLEMESMYFNLMWELVDLPE 362

Query: 121 GVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSLVAMLRSIRILLSI 180
           GVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFS VAML+SIRILLSI
Sbjct: 363 GVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSI 422

Query: 181 ATFYDYEIWKMDVKTALLNGNLEESIFMSQREGFITQGQEQKVCKLNRSIYGLKQASRSW 240
           ATFYDYEIWKMDV TA LNGNLEESIFMSQ EGFITQGQEQKVCKLNRSIYGLKQASRSW
Sbjct: 423 ATFYDYEIWKMDVNTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSW 482

Query: 241 NIRFDTAIKCYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDMGYLTDVKAWLAA 300
           NIRFDTAIK YGF+QNVDEPCVYKKINKGKV FLVLYVDDILLIGND+GYLTDVKAWLAA
Sbjct: 483 NIRFDTAIKSYGFEQNVDEPCVYKKINKGKVVFLVLYVDDILLIGNDVGYLTDVKAWLAA 542

Query: 301 QFQMKDLGEAQYVLGIQIIRDRKNKILALSQVTYIDKMLVRYSMQNSKKGLLPFRHGVHL 360
           QFQMKDLGEAQYVLGIQIIRDRKNK LALSQ TYIDKMLVRYSMQNSKKGLLPFRHGVHL
Sbjct: 543 QFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKMLVRYSMQNSKKGLLPFRHGVHL 602

Query: 361 SKEQCSKTPQEVEDMRRIPYASAVGSLMYVMLCTRPEICYAVGIVSRYQSNPGLDHWTAV 420
           SKEQC KTPQEVEDMRRIPYASAVGSLMYV+ CTR EICYAV IVSRYQSN GLDHWTAV
Sbjct: 603 SKEQCPKTPQEVEDMRRIPYASAVGSLMYVIFCTRLEICYAVRIVSRYQSNLGLDHWTAV 662

Query: 421 KIILKYLRRTRDYMLVYGAKDLILTGYTDFDFQTDKDSRKSTSGSVFTLNGGVVVWRSIK 480
           KIILKYLRRTRDYMLVYGAKDLILTGYTD DFQT+KDSRKSTS SVFTLNGG +VWRSIK
Sbjct: 663 KIILKYLRRTRDYMLVYGAKDLILTGYTDSDFQTNKDSRKSTSRSVFTLNGGAIVWRSIK 722

Query: 481 QGCIADCTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANCKEPR 540
           QGCIAD TMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVAN KEPR
Sbjct: 723 QGCIADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPR 782

Query: 541 SHKRGKDRE 550
           SHKR K  E
Sbjct: 783 SHKREKHIE 791

BLAST of Cmc06g0159821 vs. ExPASy Swiss-Prot
Match: P10978 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum OX=4097 PE=2 SV=1)

HSP 1 Score: 439.9 bits (1130), Expect = 4.5e-122
Identity = 235/546 (43.04%), Postives = 343/546 (62.82%), Query Frame = 0

Query: 8    NKLVLSEATDESTRVVDEVGPSSRVDETTTSG---QSHPSQSLRMPQ---RSGRIVSQPN 67
            N       TDE +   ++ G      E    G     HP+Q     Q   RS R   +  
Sbjct: 736  NPTSAESTTDEVSEQGEQPGEVIEQGEQLDEGVEEVEHPTQGEEQHQPLRRSERPRVESR 795

Query: 68   RYLGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNLVWELVDLPEG 127
            RY   +   V+I DD   +P S K+ ++  +K+Q +KAM  EMES+  N  ++LV+LP+G
Sbjct: 796  RY--PSTEYVLISDD--REPESLKEVLSHPEKNQLMKAMQEEMESLQKNGTYKLVELPKG 855

Query: 128  VKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSLVAMLRSIRILLSIA 187
             +P+ CKW++K K+D   K+  +KARLV KG+ Q++G+D++E FS V  + SIR +LS+A
Sbjct: 856  KRPLKCKWVFKLKKDGDCKLVRYKARLVVKGFEQKKGIDFDEIFSPVVKMTSIRTILSLA 915

Query: 188  TFYDYEIWKMDVKTALLNGNLEESIFMSQREGFITQGQEQKVCKLNRSIYGLKQASRSWN 247
               D E+ ++DVKTA L+G+LEE I+M Q EGF   G++  VCKLN+S+YGLKQA R W 
Sbjct: 916  ASLDLEVEQLDVKTAFLHGDLEEEIYMEQPEGFEVAGKKHMVCKLNKSLYGLKQAPRQWY 975

Query: 248  IRFDTAIKCYGFDQNVDEPCVY-KKINKGKVAFLVLYVDDILLIGNDMGYLTDVKAWLAA 307
            ++FD+ +K   + +   +PCVY K+ ++     L+LYVDD+L++G D G +  +K  L+ 
Sbjct: 976  MKFDSFMKSQTYLKTYSDPCVYFKRFSENNFIILLLYVDDMLIVGKDKGLIAKLKGDLSK 1035

Query: 308  QFQMKDLGEAQYVLGIQIIRDRKNKILALSQVTYIDKMLVRYSMQNSKKGLLPFRHGVHL 367
             F MKDLG AQ +LG++I+R+R ++ L LSQ  YI+++L R++M+N+K    P    + L
Sbjct: 1036 SFDMKDLGPAQQILGMKIVRERTSRKLWLSQEKYIERVLERFNMKNAKPVSTPLAGHLKL 1095

Query: 368  SKEQCSKTPQEVEDMRRIPYASAVGSLMYVMLCTRPEICYAVGIVSRYQSNPGLDHWTAV 427
            SK+ C  T +E  +M ++PY+SAVGSLMY M+CTRP+I +AVG+VSR+  NPG +HW AV
Sbjct: 1096 SKKMCPTTVEEKGNMAKVPYSSAVGSLMYAMVCTRPDIAHAVGVVSRFLENPGKEHWEAV 1155

Query: 428  KIILKYLRRTRDYMLVYGAKDLILTGYTDFDFQTDKDSRKSTSGSVFTLNGGVVVWRSIK 487
            K IL+YLR T    L +G  D IL GYTD D   D D+RKS++G +FT +GG + W+S  
Sbjct: 1156 KWILRYLRGTTGDCLCFGGSDPILKGYTDADMAGDIDNRKSSTGYLFTFSGGAISWQSKL 1215

Query: 488  QGCIADCTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANCKEPR 547
            Q C+A  T EAEY+AA E  KE +WL++FL +L +         +YCD+  A+   K   
Sbjct: 1216 QKCVALSTTEAEYIAATETGKEMIWLKRFLQELGL---HQKEYVVYCDSQSAIDLSKNSM 1274

BLAST of Cmc06g0159821 vs. ExPASy Swiss-Prot
Match: P04146 (Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3)

HSP 1 Score: 291.2 bits (744), Expect = 2.6e-77
Identity = 165/476 (34.66%), Postives = 266/476 (55.88%), Query Frame = 0

Query: 81   PLSYKQAMNDVDKDQWVKAMDLEMESMYFNLVWELVDLPEGVKPIGCKWIYKRKRDSAGK 140
            P S+ +     DK  W +A++ E+ +   N  W +   PE    +  +W++  K +  G 
Sbjct: 891  PNSFDEIQYRDDKSSWEEAINTELNAHKINNTWTITKRPENKNIVDSRWVFSVKYNELGN 950

Query: 141  VQTFKARLVAKGYTQREGVDYEETFSLVAMLRSIRILLSIATFYDYEIWKMDVKTALLNG 200
               +KARLVA+G+TQ+  +DYEETF+ VA + S R +LS+   Y+ ++ +MDVKTA LNG
Sbjct: 951  PIRYKARLVARGFTQKYQIDYEETFAPVARISSFRFILSLVIQYNLKVHQMDVKTAFLNG 1010

Query: 201  NLEESIFMSQREGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKCYGFDQNVDEP 260
             L+E I+M   +G         VCKLN++IYGLKQA+R W   F+ A+K   F  +  + 
Sbjct: 1011 TLKEEIYMRLPQGI--SCNSDNVCKLNKAIYGLKQAARCWFEVFEQALKECEFVNSSVDR 1070

Query: 261  CVY--KKINKGKVAFLVLYVDDILLIGNDMGYLTDVKAWLAAQFQMKDLGEAQYVLGIQI 320
            C+Y   K N  +  +++LYVDD+++   DM  + + K +L  +F+M DL E ++ +GI+I
Sbjct: 1071 CIYILDKGNINENIYVLLYVDDVVIATGDMTRMNNFKRYLMEKFRMTDLNEIKHFIGIRI 1130

Query: 321  IRDRKNKILALSQVTYIDKMLVRYSMQNSKKGLLPFRHGVHL----SKEQCSKTPQEVED 380
              + +   + LSQ  Y+ K+L +++M+N      P    ++     S E C+        
Sbjct: 1131 --EMQEDKIYLSQSAYVKKILSKFNMENCNAVSTPLPSKINYELLNSDEDCN-------- 1190

Query: 381  MRRIPYASAVGSLMYVMLCTRPEICYAVGIVSRYQSNPGLDHWTAVKIILKYLRRTRDYM 440
                P  S +G LMY+MLCTRP++  AV I+SRY S    + W  +K +L+YL+ T D  
Sbjct: 1191 ---TPCRSLIGCLMYIMLCTRPDLTTAVNILSRYSSKNNSELWQNLKRVLRYLKGTIDMK 1250

Query: 441  LVYG---AKDLILTGYTDFDFQTDKDSRKSTSGSVFTL-NGGVVVWRSIKQGCIADCTME 500
            L++    A +  + GY D D+   +  RKST+G +F + +  ++ W + +Q  +A  + E
Sbjct: 1251 LIFKKNLAFENKIIGYVDSDWAGSEIDRKSTTGYLFKMFDFNLICWNTKRQNSVAASSTE 1310

Query: 501  AEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANCKEPRSHKRGK 547
            AEY+A  EA +EA+WL+  L  + +   +  PI +Y DN G ++    P  HKR K
Sbjct: 1311 AEYMALFEAVREALWLKFLLTSINI--KLENPIKIYEDNQGCISIANNPSCHKRAK 1349

BLAST of Cmc06g0159821 vs. ExPASy Swiss-Prot
Match: Q9ZT94 (Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana OX=3702 GN=RE2 PE=4 SV=1)

HSP 1 Score: 277.7 bits (709), Expect = 2.9e-73
Identity = 167/469 (35.61%), Postives = 248/469 (52.88%), Query Frame = 0

Query: 80   DPLSYKQAMNDVDKDQWVKAMDLEMESMYFNLVWELV-DLPEGVKPIGCKWIYKRKRDSA 139
            +P +  QAM D   D+W +AM  E+ +   N  W+LV   P  V  +GC+WI+ +K +S 
Sbjct: 938  EPRTAIQAMKD---DRWRQAMGSEINAQIGNHTWDLVPPPPPSVTIVGCRWIFTKKFNSD 997

Query: 140  GKVQTFKARLVAKGYTQREGVDYEETFSLVAMLRSIRILLSIATFYDYEIWKMDVKTALL 199
            G +  +KARLVAKGY QR G+DY ETFS V    SIRI+L +A    + I ++DV  A L
Sbjct: 998  GSLNRYKARLVAKGYNQRPGLDYAETFSPVIKSTSIRIVLGVAVDRSWPIRQLDVNNAFL 1057

Query: 200  NGNLEESIFMSQREGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKCYGFDQNVD 259
             G L + ++MSQ  GF+ + +   VC+L ++IYGLKQA R+W +   T +   GF  ++ 
Sbjct: 1058 QGTLTDEVYMSQPPGFVDKDRPDYVCRLRKAIYGLKQAPRAWYVELRTYLLTVGFVNSIS 1117

Query: 260  EPCVYKKINKGKVAFLVLYVDDILLIGNDMGYLTDVKAWLAAQFQMKDLGEAQYVLGIQI 319
            +  ++       + ++++YVDDIL+ GND   L      L+ +F +K+  +  Y LGI+ 
Sbjct: 1118 DTSLFVLQRGRSIIYMLVYVDDILITGNDTVLLKHTLDALSQRFSVKEHEDLHYFLGIEA 1177

Query: 320  IRDRKNKILALSQVTYIDKMLVRYSMQNSKKGLLPFRHGVHLSKEQCSKTPQEVEDMRRI 379
             R  +   L LSQ  Y   +L R +M  +K    P      L+    +K P   E     
Sbjct: 1178 KRVPQG--LHLSQRRYTLDLLARTNMLTAKPVATPMATSPKLTLHSGTKLPDPTE----- 1237

Query: 380  PYASAVGSLMYVMLCTRPEICYAVGIVSRYQSNPGLDHWTAVKIILKYLRRTRDY-MLVY 439
             Y   VGSL Y+   TRP++ YAV  +S+Y   P  DHW A+K +L+YL  T D+ + + 
Sbjct: 1238 -YRGIVGSLQYLAF-TRPDLSYAVNRLSQYMHMPTDDHWNALKRVLRYLAGTPDHGIFLK 1297

Query: 440  GAKDLILTGYTDFDFQTDKDSRKSTSGSVFTLNGGVVVWRSIKQGCIADCTMEAEYVAAC 499
                L L  Y+D D+  D D   ST+G +  L    + W S KQ  +   + EAEY +  
Sbjct: 1298 KGNTLSLHAYSDADWAGDTDDYVSTNGYIVYLGHHPISWSSKKQKGVVRSSTEAEYRSVA 1357

Query: 500  EAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANCKEPRSHKRGK 547
              + E  W+   L +L +   ++ P  +YCDN GA   C  P  H R K
Sbjct: 1358 NTSSELQWICSLLTELGI--QLSHPPVIYCDNVGATYLCANPVFHSRMK 1392

BLAST of Cmc06g0159821 vs. ExPASy Swiss-Prot
Match: Q94HW2 (Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana OX=3702 GN=RE1 PE=2 SV=1)

HSP 1 Score: 268.9 bits (686), Expect = 1.4e-70
Identity = 168/469 (35.82%), Postives = 245/469 (52.24%), Query Frame = 0

Query: 80   DPLSYKQAMNDVDKDQWVKAMDLEMESMYFNLVWELVDLPEG-VKPIGCKWIYKRKRDSA 139
            +P +  QA+ D   ++W  AM  E+ +   N  W+LV  P   V  +GC+WI+ +K +S 
Sbjct: 955  EPRTAIQALKD---ERWRNAMGSEINAQIGNHTWDLVPPPPSHVTIVGCRWIFTKKYNSD 1014

Query: 140  GKVQTFKARLVAKGYTQREGVDYEETFSLVAMLRSIRILLSIATFYDYEIWKMDVKTALL 199
            G +  +KARLVAKGY QR G+DY ETFS V    SIRI+L +A    + I ++DV  A L
Sbjct: 1015 GSLNRYKARLVAKGYNQRPGLDYAETFSPVIKSTSIRIVLGVAVDRSWPIRQLDVNNAFL 1074

Query: 200  NGNLEESIFMSQREGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKCYGFDQNVD 259
             G L + ++MSQ  GFI + +   VCKL +++YGLKQA R+W +     +   GF  +V 
Sbjct: 1075 QGTLTDDVYMSQPPGFIDKDRPNYVCKLRKALYGLKQAPRAWYVELRNYLLTIGFVNSVS 1134

Query: 260  EPCVYKKINKGKVAFLVLYVDDILLIGNDMGYLTDVKAWLAAQFQMKDLGEAQYVLGIQI 319
            +  ++       + ++++YVDDIL+ GND   L +    L+ +F +KD  E  Y LGI+ 
Sbjct: 1135 DTSLFVLQRGKSIVYMLVYVDDILITGNDPTLLHNTLDNLSQRFSVKDHEELHYFLGIEA 1194

Query: 320  IRDRKNKILALSQVTYIDKMLVRYSMQNSKKGLLPFRHGVHLSKEQCSKTPQEVEDMRRI 379
             R      L LSQ  YI  +L R +M  +K    P      LS    +K     E     
Sbjct: 1195 KRVPTG--LHLSQRRYILDLLARTNMITAKPVTTPMAPSPKLSLYSGTKLTDPTE----- 1254

Query: 380  PYASAVGSLMYVMLCTRPEICYAVGIVSRYQSNPGLDHWTAVKIILKYLRRTRDY-MLVY 439
             Y   VGSL Y+   TRP+I YAV  +S++   P  +H  A+K IL+YL  T ++ + + 
Sbjct: 1255 -YRGIVGSLQYLAF-TRPDISYAVNRLSQFMHMPTEEHLQALKRILRYLAGTPNHGIFLK 1314

Query: 440  GAKDLILTGYTDFDFQTDKDSRKSTSGSVFTLNGGVVVWRSIKQGCIADCTMEAEYVAAC 499
                L L  Y+D D+  DKD   ST+G +  L    + W S KQ  +   + EAEY +  
Sbjct: 1315 KGNTLSLHAYSDADWAGDKDDYVSTNGYIVYLGHHPISWSSKKQKGVVRSSTEAEYRSVA 1374

Query: 500  EAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANCKEPRSHKRGK 547
              + E  W+   L +L +   +  P  +YCDN GA   C  P  H R K
Sbjct: 1375 NTSSEMQWICSLLTELGI--RLTRPPVIYCDNVGATYLCANPVFHSRMK 1409

BLAST of Cmc06g0159821 vs. ExPASy Swiss-Prot
Match: P25600 (Putative transposon Ty5-1 protein YCL074W OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) OX=559292 GN=TY5A PE=5 SV=2)

HSP 1 Score: 142.9 bits (359), Expect = 1.1e-32
Identity = 102/314 (32.48%), Postives = 143/314 (45.54%), Query Frame = 0

Query: 191 MDVKTALLNGNLEESIFMSQREGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKC 250
           MDV TA LN  ++E I++ Q  GF+ +     V +L   +YGLKQA   WN   +  +K 
Sbjct: 1   MDVDTAFLNSTMDEPIYVKQPPGFVNERNPDYVWELYGGMYGLKQAPLLWNEHINNTLKK 60

Query: 251 YGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDMGYLTDVKAWLAAQFQMKDLGEA 310
            GF ++  E  +Y +       ++ +YVDD+L+          VK  L   + MKDLG+ 
Sbjct: 61  IGFCRHEGEHGLYFRSTSDGPIYIAVYVDDLLVAAPSPKIYDRVKQELTKLYSMKDLGKV 120

Query: 311 QYVLGIQIIRDRKNKILALSQVTYIDKMLVRYSMQNSKKGLLPFRHGVHLSKEQCSKTPQ 370
              LG+  I    N  + LS   YI K      +   K    P  +    SK     T  
Sbjct: 121 DKFLGLN-IHQSSNGDITLSLQDYIAKAASESEINTFKLTQTPLCN----SKPLFETTSP 180

Query: 371 EVEDMRRIPYASAVGSLMYVMLCTRPEICYAVGIVSRYQSNPGLDHWTAVKIILKYLRRT 430
            ++D+   PY S VG L++     RP+I Y V ++SR+   P   H  + + +L+YL  T
Sbjct: 181 HLKDI--TPYQSIVGQLLFCANTGRPDISYPVSLLSRFLREPRAIHLESARRVLRYLYTT 240

Query: 431 RDYMLVY-GAKDLILTGYTDFDFQTDKDSRKSTSGSVFTLNGGVVVWRSIK-QGCIADCT 490
           R   L Y     L LT Y D       D   ST G V  L G  V W S K +G I   +
Sbjct: 241 RSMCLKYRSGSQLALTVYCDASHGAIHDLPHSTGGYVTLLAGAPVTWSSKKLKGVIPVPS 300

Query: 491 MEAEYVAACEAAKE 503
            EAEY+ A E   E
Sbjct: 301 TEAEYITASETVME 307

BLAST of Cmc06g0159821 vs. ExPASy TrEMBL
Match: A0A5A7U7T0 (Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold175G00310 PE=4 SV=1)

HSP 1 Score: 1164.1 bits (3010), Expect = 0.0e+00
Identity = 575/575 (100.00%), Postives = 575/575 (100.00%), Query Frame = 0

Query: 1   MRNHKPRNKLVLSEATDESTRVVDEVGPSSRVDETTTSGQSHPSQSLRMPQRSGRIVSQP 60
           MRNHKPRNKLVLSEATDESTRVVDEVGPSSRVDETTTSGQSHPSQSLRMPQRSGRIVSQP
Sbjct: 319 MRNHKPRNKLVLSEATDESTRVVDEVGPSSRVDETTTSGQSHPSQSLRMPQRSGRIVSQP 378

Query: 61  NRYLGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNLVWELVDLPE 120
           NRYLGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNLVWELVDLPE
Sbjct: 379 NRYLGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNLVWELVDLPE 438

Query: 121 GVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSLVAMLRSIRILLSI 180
           GVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSLVAMLRSIRILLSI
Sbjct: 439 GVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSLVAMLRSIRILLSI 498

Query: 181 ATFYDYEIWKMDVKTALLNGNLEESIFMSQREGFITQGQEQKVCKLNRSIYGLKQASRSW 240
           ATFYDYEIWKMDVKTALLNGNLEESIFMSQREGFITQGQEQKVCKLNRSIYGLKQASRSW
Sbjct: 499 ATFYDYEIWKMDVKTALLNGNLEESIFMSQREGFITQGQEQKVCKLNRSIYGLKQASRSW 558

Query: 241 NIRFDTAIKCYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDMGYLTDVKAWLAA 300
           NIRFDTAIKCYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDMGYLTDVKAWLAA
Sbjct: 559 NIRFDTAIKCYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDMGYLTDVKAWLAA 618

Query: 301 QFQMKDLGEAQYVLGIQIIRDRKNKILALSQVTYIDKMLVRYSMQNSKKGLLPFRHGVHL 360
           QFQMKDLGEAQYVLGIQIIRDRKNKILALSQVTYIDKMLVRYSMQNSKKGLLPFRHGVHL
Sbjct: 619 QFQMKDLGEAQYVLGIQIIRDRKNKILALSQVTYIDKMLVRYSMQNSKKGLLPFRHGVHL 678

Query: 361 SKEQCSKTPQEVEDMRRIPYASAVGSLMYVMLCTRPEICYAVGIVSRYQSNPGLDHWTAV 420
           SKEQCSKTPQEVEDMRRIPYASAVGSLMYVMLCTRPEICYAVGIVSRYQSNPGLDHWTAV
Sbjct: 679 SKEQCSKTPQEVEDMRRIPYASAVGSLMYVMLCTRPEICYAVGIVSRYQSNPGLDHWTAV 738

Query: 421 KIILKYLRRTRDYMLVYGAKDLILTGYTDFDFQTDKDSRKSTSGSVFTLNGGVVVWRSIK 480
           KIILKYLRRTRDYMLVYGAKDLILTGYTDFDFQTDKDSRKSTSGSVFTLNGGVVVWRSIK
Sbjct: 739 KIILKYLRRTRDYMLVYGAKDLILTGYTDFDFQTDKDSRKSTSGSVFTLNGGVVVWRSIK 798

Query: 481 QGCIADCTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANCKEPR 540
           QGCIADCTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANCKEPR
Sbjct: 799 QGCIADCTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANCKEPR 858

Query: 541 SHKRGKDREEVSSDTGDCATRGCDRHQDRFGAQHC 576
           SHKRGKDREEVSSDTGDCATRGCDRHQDRFGAQHC
Sbjct: 859 SHKRGKDREEVSSDTGDCATRGCDRHQDRFGAQHC 893

BLAST of Cmc06g0159821 vs. ExPASy TrEMBL
Match: A0A5A7TZD0 (Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold1163G00090 PE=4 SV=1)

HSP 1 Score: 1055.0 bits (2727), Expect = 1.1e-304
Identity = 524/549 (95.45%), Postives = 533/549 (97.09%), Query Frame = 0

Query: 1    MRNHKPRNKLVLSEATDESTRVVDEVGPSSRVDETTTSGQSHPSQSLRMPQRSGRIVSQP 60
            MRNHKPR+KLVLSEATDESTRVVDEVGPSSRVDETTTSGQSHPSQSLRMP+RSGR+VSQP
Sbjct: 633  MRNHKPRSKLVLSEATDESTRVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQP 692

Query: 61   NRYLGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNLVWELVDLPE 120
            NRYLGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFN VWELVDLPE
Sbjct: 693  NRYLGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSVWELVDLPE 752

Query: 121  GVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSLVAMLRSIRILLSI 180
            GVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFS VAML+SIRILLSI
Sbjct: 753  GVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSI 812

Query: 181  ATFYDYEIWKMDVKTALLNGNLEESIFMSQREGFITQGQEQKVCKLNRSIYGLKQASRSW 240
            ATFYDYEIW+MDVKTA LNGNLEESIFMSQ EGFITQGQEQKVCKLNRSIYGLKQASRSW
Sbjct: 813  ATFYDYEIWQMDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSW 872

Query: 241  NIRFDTAIKCYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDMGYLTDVKAWLAA 300
            NIRFDTAIK YGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGND+GYLTDVKAWLAA
Sbjct: 873  NIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAA 932

Query: 301  QFQMKDLGEAQYVLGIQIIRDRKNKILALSQVTYIDKMLVRYSMQNSKKGLLPFRHGVHL 360
            QFQMKDLGEAQYVLGIQIIRDRKNK LALSQ TYIDK+LVRYSMQNSKKGLLPFRHGVHL
Sbjct: 933  QFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHL 992

Query: 361  SKEQCSKTPQEVEDMRRIPYASAVGSLMYVMLCTRPEICYAVGIVSRYQSNPGLDHWTAV 420
            SKEQ  KTPQEVEDMRRIPYASAVGSLMY MLCTRP+ICYAVGIVSRYQSNPGLDHWTAV
Sbjct: 993  SKEQSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAV 1052

Query: 421  KIILKYLRRTRDYMLVYGAKDLILTGYTDFDFQTDKDSRKSTSGSVFTLNGGVVVWRSIK 480
            KI+LKYLRRTRDYMLVYGAKDLILTGYTD DFQTDKDSRKSTSGSVFTLNGG VVWRSIK
Sbjct: 1053 KIVLKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIK 1112

Query: 481  QGCIADCTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANCKEPR 540
            QGCIAD TMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVAN KEPR
Sbjct: 1113 QGCIADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPR 1172

Query: 541  SHKRGKDRE 550
            SHKRGK  E
Sbjct: 1173 SHKRGKHIE 1181

BLAST of Cmc06g0159821 vs. ExPASy TrEMBL
Match: A0A5A7UYE8 (Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold430G001570 PE=4 SV=1)

HSP 1 Score: 1055.0 bits (2727), Expect = 1.1e-304
Identity = 524/549 (95.45%), Postives = 533/549 (97.09%), Query Frame = 0

Query: 1    MRNHKPRNKLVLSEATDESTRVVDEVGPSSRVDETTTSGQSHPSQSLRMPQRSGRIVSQP 60
            MRNHKPR+KLVLSEATDESTRVVDEVGPSSRVDETTTSGQSHPSQSLRMP+RSGR+VSQP
Sbjct: 507  MRNHKPRSKLVLSEATDESTRVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQP 566

Query: 61   NRYLGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNLVWELVDLPE 120
            NRYLGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFN VWELVDLPE
Sbjct: 567  NRYLGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSVWELVDLPE 626

Query: 121  GVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSLVAMLRSIRILLSI 180
            GVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFS VAML+SIRILLSI
Sbjct: 627  GVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSI 686

Query: 181  ATFYDYEIWKMDVKTALLNGNLEESIFMSQREGFITQGQEQKVCKLNRSIYGLKQASRSW 240
            ATFYDYEIW+MDVKTA LNGNLEESIFMSQ EGFITQGQEQKVCKLNRSIYGLKQASRSW
Sbjct: 687  ATFYDYEIWQMDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSW 746

Query: 241  NIRFDTAIKCYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDMGYLTDVKAWLAA 300
            NIRFDTAIK YGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGND+GYLTDVKAWLAA
Sbjct: 747  NIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAA 806

Query: 301  QFQMKDLGEAQYVLGIQIIRDRKNKILALSQVTYIDKMLVRYSMQNSKKGLLPFRHGVHL 360
            QFQMKDLGEAQYVLGIQIIRDRKNK LALSQ TYIDK+LVRYSMQNSKKGLLPFRHGVHL
Sbjct: 807  QFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHL 866

Query: 361  SKEQCSKTPQEVEDMRRIPYASAVGSLMYVMLCTRPEICYAVGIVSRYQSNPGLDHWTAV 420
            SKEQ  KTPQEVEDMRRIPYASAVGSLMY MLCTRP+ICYAVGIVSRYQSNPGLDHWTAV
Sbjct: 867  SKEQSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAV 926

Query: 421  KIILKYLRRTRDYMLVYGAKDLILTGYTDFDFQTDKDSRKSTSGSVFTLNGGVVVWRSIK 480
            KI+LKYLRRTRDYMLVYGAKDLILTGYTD DFQTDKDSRKSTSGSVFTLNGG VVWRSIK
Sbjct: 927  KIVLKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIK 986

Query: 481  QGCIADCTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANCKEPR 540
            QGCIAD TMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVAN KEPR
Sbjct: 987  QGCIADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPR 1046

Query: 541  SHKRGKDRE 550
            SHKRGK  E
Sbjct: 1047 SHKRGKHIE 1055

BLAST of Cmc06g0159821 vs. ExPASy TrEMBL
Match: A0A5A7T2V9 (Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold56G00760 PE=4 SV=1)

HSP 1 Score: 1043.1 bits (2696), Expect = 4.3e-301
Identity = 518/549 (94.35%), Postives = 530/549 (96.54%), Query Frame = 0

Query: 1    MRNHKPRNKLVLSEATDESTRVVDEVGPSSRVDETTTSGQSHPSQSLRMPQRSGRIVSQP 60
            MRNHKPR+KLVLSEATDESTRVVDEVGPSSRVDETTTSGQSHPSQSLRMP+RSGR+VSQP
Sbjct: 633  MRNHKPRSKLVLSEATDESTRVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQP 692

Query: 61   NRYLGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNLVWELVDLPE 120
            NRYLGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFN VWELVDLPE
Sbjct: 693  NRYLGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSVWELVDLPE 752

Query: 121  GVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSLVAMLRSIRILLSI 180
            GVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYT++EGVDYEETFS VAML+SIRILLSI
Sbjct: 753  GVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTRKEGVDYEETFSSVAMLKSIRILLSI 812

Query: 181  ATFYDYEIWKMDVKTALLNGNLEESIFMSQREGFITQGQEQKVCKLNRSIYGLKQASRSW 240
            A FYDYEIW+MDVKTA LNGNLEESIFMSQ EGFITQGQEQKVCKLNRSIYGLKQASRSW
Sbjct: 813  AKFYDYEIWQMDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSW 872

Query: 241  NIRFDTAIKCYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDMGYLTDVKAWLAA 300
            NIRFDTAIK YGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGND+GYLTDVKAWLAA
Sbjct: 873  NIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAA 932

Query: 301  QFQMKDLGEAQYVLGIQIIRDRKNKILALSQVTYIDKMLVRYSMQNSKKGLLPFRHGVHL 360
            QFQMKDLGE QYVLGIQIIRDRKNK LALSQ TYIDK+LVRYSMQNSKKGLLPFRHGVHL
Sbjct: 933  QFQMKDLGEGQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHL 992

Query: 361  SKEQCSKTPQEVEDMRRIPYASAVGSLMYVMLCTRPEICYAVGIVSRYQSNPGLDHWTAV 420
            SKEQ  KTPQEVEDMRRIPYASAVGSLMY MLCTRP+ICYAVGIVSRYQSNPGLDHWTAV
Sbjct: 993  SKEQSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAV 1052

Query: 421  KIILKYLRRTRDYMLVYGAKDLILTGYTDFDFQTDKDSRKSTSGSVFTLNGGVVVWRSIK 480
            KIILKYLRRTRDYMLVYGAKDLILTGYT+ DFQTDKDSRKSTS SVFTLNGG VVWRSIK
Sbjct: 1053 KIILKYLRRTRDYMLVYGAKDLILTGYTNSDFQTDKDSRKSTSRSVFTLNGGAVVWRSIK 1112

Query: 481  QGCIADCTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANCKEPR 540
            QGCIAD TMEAEYVAACEAAKEAVWL+KFLHDLEVVPNMNLPITLYCDNSGAVAN KEPR
Sbjct: 1113 QGCIADSTMEAEYVAACEAAKEAVWLKKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPR 1172

Query: 541  SHKRGKDRE 550
            SHKRGK  E
Sbjct: 1173 SHKRGKHIE 1181

BLAST of Cmc06g0159821 vs. ExPASy TrEMBL
Match: A0A5D3CZY3 (Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold1032G00460 PE=4 SV=1)

HSP 1 Score: 1030.0 bits (2662), Expect = 3.7e-297
Identity = 514/549 (93.62%), Postives = 526/549 (95.81%), Query Frame = 0

Query: 1   MRNHKPRNKLVLSEATDESTRVVDEVGPSSRVDETTTSGQSHPSQSLRMPQRSGRIVSQP 60
           MR+HKP+NKLVL+EA DESTRVVDEVGPSSRV+ETTTSGQSHPSQSLRMP+RSGRIVSQP
Sbjct: 243 MRDHKPQNKLVLNEAIDESTRVVDEVGPSSRVNETTTSGQSHPSQSLRMPRRSGRIVSQP 302

Query: 61  NRYLGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNLVWELVDLPE 120
           NRYLGLTETQVVIPDDGVEDPLSY QAMNDVDKDQWVKAMDLEMESMYFNL+WELVDLPE
Sbjct: 303 NRYLGLTETQVVIPDDGVEDPLSYNQAMNDVDKDQWVKAMDLEMESMYFNLMWELVDLPE 362

Query: 121 GVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSLVAMLRSIRILLSI 180
           GVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFS VAML+SIRILLSI
Sbjct: 363 GVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSI 422

Query: 181 ATFYDYEIWKMDVKTALLNGNLEESIFMSQREGFITQGQEQKVCKLNRSIYGLKQASRSW 240
           ATFYDYEIWKMDV TA LNGNLEESIFMSQ EGFITQGQEQKVCKLNRSIYGLKQASRSW
Sbjct: 423 ATFYDYEIWKMDVNTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSW 482

Query: 241 NIRFDTAIKCYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDMGYLTDVKAWLAA 300
           NIRFDTAIK YGF+QNVDEPCVYKKINKGKV FLVLYVDDILLIGND+GYLTDVKAWLAA
Sbjct: 483 NIRFDTAIKSYGFEQNVDEPCVYKKINKGKVVFLVLYVDDILLIGNDVGYLTDVKAWLAA 542

Query: 301 QFQMKDLGEAQYVLGIQIIRDRKNKILALSQVTYIDKMLVRYSMQNSKKGLLPFRHGVHL 360
           QFQMKDLGEAQYVLGIQIIRDRKNK LALSQ TYIDKMLVRYSMQNSKKGLLPFRHGVHL
Sbjct: 543 QFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKMLVRYSMQNSKKGLLPFRHGVHL 602

Query: 361 SKEQCSKTPQEVEDMRRIPYASAVGSLMYVMLCTRPEICYAVGIVSRYQSNPGLDHWTAV 420
           SKEQC KTPQEVEDMRRIPYASAVGSLMYV+ CTR EICYAV IVSRYQSN GLDHWTAV
Sbjct: 603 SKEQCPKTPQEVEDMRRIPYASAVGSLMYVIFCTRLEICYAVRIVSRYQSNLGLDHWTAV 662

Query: 421 KIILKYLRRTRDYMLVYGAKDLILTGYTDFDFQTDKDSRKSTSGSVFTLNGGVVVWRSIK 480
           KIILKYLRRTRDYMLVYGAKDLILTGYTD DFQT+KDSRKSTS SVFTLNGG +VWRSIK
Sbjct: 663 KIILKYLRRTRDYMLVYGAKDLILTGYTDSDFQTNKDSRKSTSRSVFTLNGGAIVWRSIK 722

Query: 481 QGCIADCTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANCKEPR 540
           QGCIAD TMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVAN KEPR
Sbjct: 723 QGCIADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPR 782

Query: 541 SHKRGKDRE 550
           SHKR K  E
Sbjct: 783 SHKREKHIE 791

BLAST of Cmc06g0159821 vs. TAIR 10
Match: AT4G23160.1 (cysteine-rich RLK (RECEPTOR-like protein kinase) 8 )

HSP 1 Score: 277.7 bits (709), Expect = 2.1e-74
Identity = 164/476 (34.45%), Postives = 267/476 (56.09%), Query Frame = 0

Query: 79  EDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNLVWELVDLPEGVKPIGCKWIYKRKRDSA 138
           ++P +Y +A   +    W  AMD E+ +M     WE+  LP   KPIGCKW+YK K +S 
Sbjct: 84  KEPSTYNEAKEFL---VWCGAMDDEIGAMETTHTWEICTLPPNKKPIGCKWVYKIKYNSD 143

Query: 139 GKVQTFKARLVAKGYTQREGVDYEETFSLVAMLRSIRILLSIATFYDYEIWKMDVKTALL 198
           G ++ +KARLVAKGYTQ+EG+D+ ETFS V  L S++++L+I+  Y++ + ++D+  A L
Sbjct: 144 GTIERYKARLVAKGYTQQEGIDFIETFSPVCKLTSVKLILAISAIYNFTLHQLDISNAFL 203

Query: 199 NGNLEESIFMSQREGFIT-QGQE---QKVCKLNRSIYGLKQASRSWNIRFDTAIKCYGFD 258
           NG+L+E I+M    G+   QG       VC L +SIYGLKQASR W ++F   +  +GF 
Sbjct: 204 NGDLDEEIYMKLPPGYAARQGDSLPPNAVCYLKKSIYGLKQASRQWFLKFSVTLIGFGFV 263

Query: 259 QNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDMGYLTDVKAWLAAQFQMKDLGEAQYVL 318
           Q+  +   + KI       +++YVDDI++  N+   + ++K+ L + F+++DLG  +Y L
Sbjct: 264 QSHSDHTYFLKITATLFLCVLVYVDDIIICSNNDAAVDELKSQLKSCFKLRDLGPLKYFL 323

Query: 319 GIQIIRDRKNKILALSQVTYIDKMLVRYSMQNSKKGLLPFRHGVHLSKEQCSKTPQEVED 378
           G++I R      + + Q  Y   +L    +   K   +P    V  S    + +  +  D
Sbjct: 324 GLEIARSAAG--INICQRKYALDLLDETGLLGCKPSSVPMDPSVTFS----AHSGGDFVD 383

Query: 379 MRRIPYASAVGSLMYVMLCTRPEICYAVGIVSRYQSNPGLDHWTAVKIILKYLRRTRDYM 438
            +   Y   +G LMY+ + TR +I +AV  +S++   P L H  AV  IL Y++ T    
Sbjct: 384 AK--AYRRLIGRLMYLQI-TRLDISFAVNKLSQFSEAPRLAHQQAVMKILHYIKGTVGQG 443

Query: 439 LVYGAK-DLILTGYTDFDFQTDKDSRKSTSGSVFTLNGGVVVWRSIKQGCIADCTMEAEY 498
           L Y ++ ++ L  ++D  FQ+ KD+R+ST+G    L   ++ W+S KQ  ++  + EAEY
Sbjct: 444 LFYSSQAEMQLQVFSDASFQSCKDTRRSTNGYCMFLGTSLISWKSKKQQVVSKSSAEAEY 503

Query: 499 VAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANCKEPRSHKRGKDRE 550
            A   A  E +WL +F  +L++   ++ P  L+CDN+ A+        H+R K  E
Sbjct: 504 RALSFATDEMMWLAQFFRELQL--PLSKPTLLFCDNTAAIHIATNAVFHERTKHIE 545

BLAST of Cmc06g0159821 vs. TAIR 10
Match: ATMG00810.1 (DNA/RNA polymerases superfamily protein )

HSP 1 Score: 99.4 bits (246), Expect = 1.0e-20
Identity = 74/234 (31.62%), Postives = 116/234 (49.57%), Query Frame = 0

Query: 273 FLVLYVDDILLIGNDMGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKILALSQV 332
           +L+LYVDDILL G+    L  +   L++ F MKDLG   Y LGIQI        L LSQ 
Sbjct: 2   YLLLYVDDILLTGSSNTLLNMLIFQLSSTFSMKDLGPVHYFLGIQIKTHPSG--LFLSQT 61

Query: 333 TYIDKMLVRYSMQNSKKGLLPFRHGVHLSKEQCSKTPQEVEDMRRIPYASAVGSLMYVML 392
            Y +++L    M + K    P    ++ S    +K P   +      + S VG+L Y+ L
Sbjct: 62  KYAEQILNNAGMLDCKPMSTPLPLKLN-SSVSTAKYPDPSD------FRSIVGALQYLTL 121

Query: 393 CTRPEICYAVGIVSRYQSNPGLDHWTAVKIILKYLRRTRDY-MLVYGAKDLILTGYTDFD 452
            TRP+I YAV IV +    P L  +  +K +L+Y++ T  + + ++    L +  + D D
Sbjct: 122 -TRPDISYAVNIVCQRMHEPTLADFDLLKRVLRYVKGTIFHGLYIHKNSKLNVQAFCDSD 181

Query: 453 FQTDKDSRKSTSGSVFTLNGGVVVWRSIKQGCIADCTMEAEYVAACEAAKEAVW 506
           +     +R+ST+G    L   ++ W + +Q  ++  + E EY A    A E  W
Sbjct: 182 WAGCTSTRRSTTGFCTFLGCNIISWSAKRQPTVSRSSTETEYRALALTAAELTW 225

BLAST of Cmc06g0159821 vs. TAIR 10
Match: ATMG00820.1 (Reverse transcriptase (RNA-dependent DNA polymerase) )

HSP 1 Score: 76.3 bits (186), Expect = 9.2e-14
Identity = 47/133 (35.34%), Postives = 71/133 (53.38%), Query Frame = 0

Query: 49  MPQRSGRIVSQPNRYLGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMY 108
           M  RS   +++ N    LT T  +      ++P S   A+ D     W +AM  E++++ 
Sbjct: 1   MLTRSKAGINKLNPKYSLTITTTI-----KKEPKSVIFALKD---PGWCQAMQEELDALS 60

Query: 109 FNLVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSLV 168
            N  W LV  P     +GCKW++K K  S G +   KARLVAKG+ Q EG+ + ET+S V
Sbjct: 61  RNKTWILVPPPVNQNILGCKWVFKTKLHSDGTLDRLKARLVAKGFHQEEGIYFVETYSPV 120

Query: 169 AMLRSIRILLSIA 182
               +IR +L++A
Sbjct: 121 VRTATIRTILNVA 125

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
KAA0050437.10.0e+00100.00gag/pol protein [Cucumis melo var. makuwa][more]
KAA0025945.12.2e-30495.45gag/pol protein [Cucumis melo var. makuwa] >KAA0026303.1 gag/pol protein [Cucumi... [more]
KAA0059226.12.2e-30495.45gag/pol protein [Cucumis melo var. makuwa][more]
KAA0035907.18.8e-30194.35gag/pol protein [Cucumis melo var. makuwa][more]
KAA0033121.17.7e-29793.62gag/pol protein [Cucumis melo var. makuwa] >TYK17112.1 gag/pol protein [Cucumis ... [more]
Match NameE-valueIdentityDescription
P109784.5e-12243.04Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum... [more]
P041462.6e-7734.66Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3[more]
Q9ZT942.9e-7335.61Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana O... [more]
Q94HW21.4e-7035.82Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana O... [more]
P256001.1e-3232.48Putative transposon Ty5-1 protein YCL074W OS=Saccharomyces cerevisiae (strain AT... [more]
Match NameE-valueIdentityDescription
A0A5A7U7T00.0e+00100.00Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold175G0031... [more]
A0A5A7TZD01.1e-30495.45Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold1163G000... [more]
A0A5A7UYE81.1e-30495.45Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold430G0015... [more]
A0A5A7T2V94.3e-30194.35Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold56G00760... [more]
A0A5D3CZY33.7e-29793.62Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold1032G004... [more]
Match NameE-valueIdentityDescription
AT4G23160.12.1e-7434.45cysteine-rich RLK (RECEPTOR-like protein kinase) 8 [more]
ATMG00810.11.0e-2031.62DNA/RNA polymerases superfamily protein [more]
ATMG00820.19.2e-1435.34Reverse transcriptase (RNA-dependent DNA polymerase) [more]
InterPro
Analysis Name: InterPro Annotations of Melon (Charmono) v1.1
Date Performed: 2022-10-13
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR013103Reverse transcriptase, RNA-dependent DNA polymerasePFAMPF07727RVT_2coord: 110..353
e-value: 6.1E-70
score: 235.6
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 30..53
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..53
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 7..21
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 538..557
NoneNo IPR availablePANTHERPTHR45895FAMILY NOT NAMEDcoord: 79..442
NoneNo IPR availableCDDcd09272RNase_HI_RT_Ty1coord: 445..546
e-value: 6.43861E-42
score: 145.689
IPR043502DNA/RNA polymerase superfamilySUPERFAMILY56672DNA/RNA polymerasescoord: 110..544

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cmc06g0159821.1Cmc06g0159821.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0015074 DNA integration
biological_process GO:0006508 proteolysis
molecular_function GO:0008234 cysteine-type peptidase activity
molecular_function GO:0003676 nucleic acid binding
molecular_function GO:0008270 zinc ion binding