Homology
BLAST of Cmc01g0005171 vs. NCBI nr
Match:
KAA0037867.1 (putative polyprotein (retrotrasposon protein) [Cucumis melo var. makuwa])
HSP 1 Score: 2975.7 bits (7713), Expect = 0.0e+00
Identity = 1458/1468 (99.32%), Postives = 1462/1468 (99.59%), Query Frame = 0
Query: 1 MFRNFQPPLGPATGPEGGLPPMTIVFNGRNYGVWSQMVEVLLASKDKLGHINGERTQPAS 60
MFRNFQPPLGPATGPEGGLPPMTIVFNGRNYGVWSQMVEVLLASKDKLGHINGERTQPAS
Sbjct: 33 MFRNFQPPLGPATGPEGGLPPMTIVFNGRNYGVWSQMVEVLLASKDKLGHINGERTQPAS 92
Query: 61 NDTGYKRWMSDNSMVKGWILSSLDPNLIGNFIRFSTAKEVWDSIKLTFFDGNDNTQIYDL 120
NDTGYKRWMSDNSMVKGWILSSLDPNLIGNFIRFSTAKEVWDSIKLTFFDGNDNTQIYDL
Sbjct: 93 NDTGYKRWMSDNSMVKGWILSSLDPNLIGNFIRFSTAKEVWDSIKLTFFDGNDNTQIYDL 152
Query: 121 EVKVYGLKQTGTLEDYFYTLQGLWKEIEFRKPNPMTCPIDIEKYNRVEQDRKVYIFLGGL 180
EVKVYGLKQTGTLEDYFYTLQGLWKEIEFRKPNPMTCPIDIEKYNRVEQDRKVYIFLGGL
Sbjct: 153 EVKVYGLKQTGTLEDYFYTLQGLWKEIEFRKPNPMTCPIDIEKYNRVEQDRKVYIFLGGL 212
Query: 181 EDRFDGIRAEILRMTPLPTVEETYGHVRREDVRQSVMVGKSTDLSNSAAMIVQGAKQVAG 240
EDRFDGIRAEILRMTPLPTVEETYGHVRREDVRQSVMVGKSTDLSNS AMIVQGAKQVAG
Sbjct: 213 EDRFDGIRAEILRMTPLPTVEETYGHVRREDVRQSVMVGKSTDLSNSVAMIVQGAKQVAG 272
Query: 241 VPHISTNSTHGNSNNSVPGDPLSLSSQVQGKESSSNQGAVHFTTRYNKTEAVLKCTHCGG 300
VPHISTNSTHGNSNNSVPGDPLSLSSQVQGKESSSNQGAVHFTTRYNKTEAVLKCTHCGG
Sbjct: 273 VPHISTNSTHGNSNNSVPGDPLSLSSQVQGKESSSNQGAVHFTTRYNKTEAVLKCTHCGG 332
Query: 301 KGHSKDTCFKIHGYPDWYKELKKKQAEAKRGKVSIAVSGKNGDDSPSEGIIFLSSNLSND 360
KGHSKDTCFKIHGYPDWYKELKKKQAEAKRGKVSIAVSGKNGDDSPSEGIIFLSSNLSND
Sbjct: 333 KGHSKDTCFKIHGYPDWYKELKKKQAEAKRGKVSIAVSGKNGDDSPSEGIIFLSSNLSND 392
Query: 361 NSTWIVDSGASDHMTFTKTDLTAECVTKKTEILNANGISYPVKCAGSIKVTSQLNLSNVL 420
NSTWIVDSGASDHMTFTKTDLTAECVTKKTEILNANGISYPVKCAGSIKVTSQLNLSNVL
Sbjct: 393 NSTWIVDSGASDHMTFTKTDLTAECVTKKTEILNANGISYPVKCAGSIKVTSQLNLSNVL 452
Query: 421 VVPSLATRLMSVSKLTKDRNCVVKMYSDYFIIQDILTKEIIGRGIERDGLYHLEDLKAGR 480
VVPSLATRLMSVSKLTKDRNCVVKMYSDYFIIQDILTKEIIGRGIERDGLYHLEDLKAGR
Sbjct: 453 VVPSLATRLMSVSKLTKDRNCVVKMYSDYFIIQDILTKEIIGRGIERDGLYHLEDLKAGR 512
Query: 481 ASLVVDQTEVQNKIWTWHRRLGHPSLGYMKKLLPHLFKNNNLTVFNCDTCIKAKSHRVSY 540
A+LVVDQTEVQNKIWTWHRRLGHPSLGYMKKLLP LFKNNNLTVFNCDTCIKAKSHRVSY
Sbjct: 513 ANLVVDQTEVQNKIWTWHRRLGHPSLGYMKKLLPRLFKNNNLTVFNCDTCIKAKSHRVSY 572
Query: 541 APSSNKSNSPFDLIHSDVWGPAPVSSIDGNKWFVLFIDDCTRMTWVYLLKSKDEVPSVFK 600
APSSNKSNSPFDLIHSDVWGPAPVSSIDGNKWFVLFIDDCTRMTWVYLLKSKDEVPSVFK
Sbjct: 573 APSSNKSNSPFDLIHSDVWGPAPVSSIDGNKWFVLFIDDCTRMTWVYLLKSKDEVPSVFK 632
Query: 601 NFYTMIKTQFGKGIKFFRSDNGGEFIGKVLKDFFVQMGIVHETSCVGTPQQNGVAERKNR 660
FYTMIKTQFGKGIKFFRSDNGGEFIGKVLKDFFVQMGIVHETSCVGTPQQNGVAERKNR
Sbjct: 633 KFYTMIKTQFGKGIKFFRSDNGGEFIGKVLKDFFVQMGIVHETSCVGTPQQNGVAERKNR 692
Query: 661 HILEMSRALLFEYHVPTKFWDKSILMAVYVLNRLPTKINNFQTPLKTLEKHHSIPSVLSL 720
HILEMSRALLFEYHVPTKFWDKSILMAVYVLNRLPTKINNFQTPLKTLEKHHSIPSVLSL
Sbjct: 693 HILEMSRALLFEYHVPTKFWDKSILMAVYVLNRLPTKINNFQTPLKTLEKHHSIPSVLSL 752
Query: 721 TPKIFGCVAYVHVPKTQRSKLSPYAVRCVFLGFGQNQKGYKCFDVDSRKWYVTMDVTFME 780
TPKIFGCVAYVHVPKTQRSKLSP AVRCVFLGFGQNQKGYKCFDVDSRKWYVTMDVTF+E
Sbjct: 753 TPKIFGCVAYVHVPKTQRSKLSPCAVRCVFLGFGQNQKGYKCFDVDSRKWYVTMDVTFVE 812
Query: 781 HEAFFKPTIQSDQGENSSESDVWKNKDLLSFQGWTPGCEQYGQDGVSGTDKSDGNGNGTD 840
HEAFFKPTIQSDQGENSSESDVWKNKDLLSFQGWTPGCEQYGQDGVSGTD+SDGNGNGTD
Sbjct: 813 HEAFFKPTIQSDQGENSSESDVWKNKDLLSFQGWTPGCEQYGQDGVSGTDESDGNGNGTD 872
Query: 841 ESDGDGNGATDGNKPLEMYSRNKNRAVFGMESENEEVIQPNEEPQGPMSPEDQLQVSLPT 900
ESDGDGNGATDGNKPLEMYSRNKNRAVFGMESENEEVIQPNEEPQGPMSPEDQLQVSLPT
Sbjct: 873 ESDGDGNGATDGNKPLEMYSRNKNRAVFGMESENEEVIQPNEEPQGPMSPEDQLQVSLPT 932
Query: 901 SKYTLPVRCNRGIPPKRYEPDEDRHKKSKYPIANFVDTKSLSGPVKRFNENLLSCKIPEN 960
SKYTLPVRCNRGIPPKRYEPDEDRHKKSKYPIANFVDTKSLSGPVKRFNENLLSCKIPEN
Sbjct: 933 SKYTLPVRCNRGIPPKRYEPDEDRHKKSKYPIANFVDTKSLSGPVKRFNENLLSCKIPEN 992
Query: 961 VDEAKSDPNWSQAMEAEMSALYNNKTWTLVELPQGKIPVGCRWVFSIKYNANGEIDRYKA 1020
VDEAKSDPNWSQAMEAEMSALYNNKTWTLVELPQGKIPVGCRWVFSIKYNANGEIDRYKA
Sbjct: 993 VDEAKSDPNWSQAMEAEMSALYNNKTWTLVELPQGKIPVGCRWVFSIKYNANGEIDRYKA 1052
Query: 1021 RLVAKGYTQTQGIDFQETFSPVAKLNTIRVLLSLAANLDWPLHQFDVKNAFLHGELKEEV 1080
RLVAKGYTQTQGIDFQETFSPVAKLNTIRVLLSLAANLDWPLHQFDVKNAFLHGELKEEV
Sbjct: 1053 RLVAKGYTQTQGIDFQETFSPVAKLNTIRVLLSLAANLDWPLHQFDVKNAFLHGELKEEV 1112
Query: 1081 YMEQPPGYKSTNAKPMVCKLNKALYGLKQSPRAWFGRFCRAMQSYGFKQCDSDHTLFLKR 1140
YMEQPPGYKSTNAKPMVCKLNKALYGLKQSPRAWFGRFCRAMQSYGFKQCDSDHTLFLKR
Sbjct: 1113 YMEQPPGYKSTNAKPMVCKLNKALYGLKQSPRAWFGRFCRAMQSYGFKQCDSDHTLFLKR 1172
Query: 1141 NQEKLTALIIYVDDMIVTGNDTQEIATLEKKLSGEFEMKNLGGLKYFLGIEVMRSRQGIV 1200
NQEKLTALIIYVDDMIVTGNDTQEIATLEKKLSGEFEMKNLGGLKYFL IEVMRS+QGIV
Sbjct: 1173 NQEKLTALIIYVDDMIVTGNDTQEIATLEKKLSGEFEMKNLGGLKYFLRIEVMRSKQGIV 1232
Query: 1201 LSQRKYILDLLAEIGMLDCRPADTPVVQGVKLGEFPDQVPANKERYQRLVGKLIYLAHTR 1260
LSQRKYILDLLAEIGMLDCRPADTPVVQGVKLGEFPDQVPANKERYQRLVGKLIYLAHTR
Sbjct: 1233 LSQRKYILDLLAEIGMLDCRPADTPVVQGVKLGEFPDQVPANKERYQRLVGKLIYLAHTR 1292
Query: 1261 PDIAYGVSVVSQFMHNPSEDHMDAVMRIIRYLKGCPGKGITFKKNGHLDVSGFTDADWAG 1320
PDIAYGVSVVSQFMHNPSEDHMDAVMRIIRYLKGCPGKGITFKKNGHLDVSGFTDADWAG
Sbjct: 1293 PDIAYGVSVVSQFMHNPSEDHMDAVMRIIRYLKGCPGKGITFKKNGHLDVSGFTDADWAG 1352
Query: 1321 SVSDRRSTAGYFTFVGGNLVTWRSKKQSVVALSSAEAEFRGVAKGICELLWLRRLLGELG 1380
SVSDRRSTAGYFTFVGGNLVTWRSKKQSVVALSSAEAEFRGVAKGICELLWLRRLLGELG
Sbjct: 1353 SVSDRRSTAGYFTFVGGNLVTWRSKKQSVVALSSAEAEFRGVAKGICELLWLRRLLGELG 1412
Query: 1381 FAPTQAMDLYCDSRPAIDISHNPVQHDRTKHVEIDRHFIKEKLESNVIQMVHVRSGEQLA 1440
FAPTQAMDLYCDSRPAIDISHNPVQHDRTKHVEIDRHFIKEKLESNVIQMVHVRSGEQLA
Sbjct: 1413 FAPTQAMDLYCDSRPAIDISHNPVQHDRTKHVEIDRHFIKEKLESNVIQMVHVRSGEQLA 1472
Query: 1441 DILTKAVMSKTFTSIIDKLGMQDAYIPT 1469
DILTKAVMSK FTSIIDKLGMQDAYIPT
Sbjct: 1473 DILTKAVMSKMFTSIIDKLGMQDAYIPT 1500
BLAST of Cmc01g0005171 vs. NCBI nr
Match:
TYK09814.1 (putative polyprotein (retrotrasposon protein) [Cucumis melo var. makuwa])
HSP 1 Score: 2970.6 bits (7700), Expect = 0.0e+00
Identity = 1455/1468 (99.11%), Postives = 1460/1468 (99.46%), Query Frame = 0
Query: 1 MFRNFQPPLGPATGPEGGLPPMTIVFNGRNYGVWSQMVEVLLASKDKLGHINGERTQPAS 60
MFRNFQPPLGPATGPEGGLPPMTIVFNGRNYGVWSQMVEVLLASKDKLGHINGERTQPAS
Sbjct: 33 MFRNFQPPLGPATGPEGGLPPMTIVFNGRNYGVWSQMVEVLLASKDKLGHINGERTQPAS 92
Query: 61 NDTGYKRWMSDNSMVKGWILSSLDPNLIGNFIRFSTAKEVWDSIKLTFFDGNDNTQIYDL 120
NDTGYKRWMSDNSMVKGWILSSLDPNLIGNFIRFSTAKEVWDSIKLTFFDGNDNTQIYDL
Sbjct: 93 NDTGYKRWMSDNSMVKGWILSSLDPNLIGNFIRFSTAKEVWDSIKLTFFDGNDNTQIYDL 152
Query: 121 EVKVYGLKQTGTLEDYFYTLQGLWKEIEFRKPNPMTCPIDIEKYNRVEQDRKVYIFLGGL 180
EVKVYGLKQTGTLEDYFYTLQGLWKEIEFRKPNPMTCPIDIEKYNRVEQDRKVYIFLGGL
Sbjct: 153 EVKVYGLKQTGTLEDYFYTLQGLWKEIEFRKPNPMTCPIDIEKYNRVEQDRKVYIFLGGL 212
Query: 181 EDRFDGIRAEILRMTPLPTVEETYGHVRREDVRQSVMVGKSTDLSNSAAMIVQGAKQVAG 240
EDRFDGIRAEILRMTPLPTVEETYGHVRREDVRQSVMVGKSTDLSNS AMIVQGAKQVAG
Sbjct: 213 EDRFDGIRAEILRMTPLPTVEETYGHVRREDVRQSVMVGKSTDLSNSVAMIVQGAKQVAG 272
Query: 241 VPHISTNSTHGNSNNSVPGDPLSLSSQVQGKESSSNQGAVHFTTRYNKTEAVLKCTHCGG 300
VPHISTNSTHGNSNNSVPGDPLSLSSQVQGKESSSNQGAVHFTTRYNKTEAVLKCTHCGG
Sbjct: 273 VPHISTNSTHGNSNNSVPGDPLSLSSQVQGKESSSNQGAVHFTTRYNKTEAVLKCTHCGG 332
Query: 301 KGHSKDTCFKIHGYPDWYKELKKKQAEAKRGKVSIAVSGKNGDDSPSEGIIFLSSNLSND 360
KGHSKDTCFKIHGYPDWYKELKKKQAEAKRGKVSIAVSGKNGDDSPSEG IFLSSNLSND
Sbjct: 333 KGHSKDTCFKIHGYPDWYKELKKKQAEAKRGKVSIAVSGKNGDDSPSEGTIFLSSNLSND 392
Query: 361 NSTWIVDSGASDHMTFTKTDLTAECVTKKTEILNANGISYPVKCAGSIKVTSQLNLSNVL 420
STWIVDSGASDHMTFTKTDLTAECVTKKTEILNANGISYPVKCAGSIKVTSQLNLSNVL
Sbjct: 393 YSTWIVDSGASDHMTFTKTDLTAECVTKKTEILNANGISYPVKCAGSIKVTSQLNLSNVL 452
Query: 421 VVPSLATRLMSVSKLTKDRNCVVKMYSDYFIIQDILTKEIIGRGIERDGLYHLEDLKAGR 480
VVPSLATRLMSVSKLTKDRNCVVKMYSDYFIIQDILTKEIIGRGIERDGLYHLEDLKAGR
Sbjct: 453 VVPSLATRLMSVSKLTKDRNCVVKMYSDYFIIQDILTKEIIGRGIERDGLYHLEDLKAGR 512
Query: 481 ASLVVDQTEVQNKIWTWHRRLGHPSLGYMKKLLPHLFKNNNLTVFNCDTCIKAKSHRVSY 540
A+LVVDQTEVQNKIWTWHRRLGHPSLGYMKKLLP LFKNNNLTVFNCDTCIKAKSHRVSY
Sbjct: 513 ANLVVDQTEVQNKIWTWHRRLGHPSLGYMKKLLPRLFKNNNLTVFNCDTCIKAKSHRVSY 572
Query: 541 APSSNKSNSPFDLIHSDVWGPAPVSSIDGNKWFVLFIDDCTRMTWVYLLKSKDEVPSVFK 600
APSSNKSNSPFDLIHSDVWGPAPVSSIDGNKWFVLFIDDCTRMTWVYLLKSKDEVPSVFK
Sbjct: 573 APSSNKSNSPFDLIHSDVWGPAPVSSIDGNKWFVLFIDDCTRMTWVYLLKSKDEVPSVFK 632
Query: 601 NFYTMIKTQFGKGIKFFRSDNGGEFIGKVLKDFFVQMGIVHETSCVGTPQQNGVAERKNR 660
FYTMIKTQFGKGIKFFRSDNGGEFIGKVLKDFFVQMGIVHETSCVGTPQQNGVAERKNR
Sbjct: 633 KFYTMIKTQFGKGIKFFRSDNGGEFIGKVLKDFFVQMGIVHETSCVGTPQQNGVAERKNR 692
Query: 661 HILEMSRALLFEYHVPTKFWDKSILMAVYVLNRLPTKINNFQTPLKTLEKHHSIPSVLSL 720
HILEMSRALLFEYHVPTKFWDKSILMAVYVLNRLPTKINNFQTPLKTLEKHHSIPSVLSL
Sbjct: 693 HILEMSRALLFEYHVPTKFWDKSILMAVYVLNRLPTKINNFQTPLKTLEKHHSIPSVLSL 752
Query: 721 TPKIFGCVAYVHVPKTQRSKLSPYAVRCVFLGFGQNQKGYKCFDVDSRKWYVTMDVTFME 780
TPKIFGCVAYVHVPKTQRSKLSP AVRCVFLGFGQNQKGYKCFDVDSRKWYVTMDVTF+E
Sbjct: 753 TPKIFGCVAYVHVPKTQRSKLSPCAVRCVFLGFGQNQKGYKCFDVDSRKWYVTMDVTFVE 812
Query: 781 HEAFFKPTIQSDQGENSSESDVWKNKDLLSFQGWTPGCEQYGQDGVSGTDKSDGNGNGTD 840
HEAFFKPTIQSDQGENSSESDVWKNKDLLSFQGWTPGCEQYGQDGVSGTD+SDGNGNGTD
Sbjct: 813 HEAFFKPTIQSDQGENSSESDVWKNKDLLSFQGWTPGCEQYGQDGVSGTDESDGNGNGTD 872
Query: 841 ESDGDGNGATDGNKPLEMYSRNKNRAVFGMESENEEVIQPNEEPQGPMSPEDQLQVSLPT 900
ESDGDGNGATDGNKPLEMYSRNKNRAVFGMESENEEVIQPNEEPQGPMSPEDQLQVSLPT
Sbjct: 873 ESDGDGNGATDGNKPLEMYSRNKNRAVFGMESENEEVIQPNEEPQGPMSPEDQLQVSLPT 932
Query: 901 SKYTLPVRCNRGIPPKRYEPDEDRHKKSKYPIANFVDTKSLSGPVKRFNENLLSCKIPEN 960
SKYTLPVRCNRGIPPKRYEPDEDRHKKSKYPIANFVDTKSLSGPVKRFNENLLSCKIPEN
Sbjct: 933 SKYTLPVRCNRGIPPKRYEPDEDRHKKSKYPIANFVDTKSLSGPVKRFNENLLSCKIPEN 992
Query: 961 VDEAKSDPNWSQAMEAEMSALYNNKTWTLVELPQGKIPVGCRWVFSIKYNANGEIDRYKA 1020
VDEAKSDPNWSQAMEAEMSALYNNKTWTLVELPQGKIPVGCRWVFSIKYNANGEIDRYKA
Sbjct: 993 VDEAKSDPNWSQAMEAEMSALYNNKTWTLVELPQGKIPVGCRWVFSIKYNANGEIDRYKA 1052
Query: 1021 RLVAKGYTQTQGIDFQETFSPVAKLNTIRVLLSLAANLDWPLHQFDVKNAFLHGELKEEV 1080
RLVAKGYTQTQGIDFQETFSPVAKLNTIRVLLSLAANLDWPLHQFDVKNAFLHGELKEEV
Sbjct: 1053 RLVAKGYTQTQGIDFQETFSPVAKLNTIRVLLSLAANLDWPLHQFDVKNAFLHGELKEEV 1112
Query: 1081 YMEQPPGYKSTNAKPMVCKLNKALYGLKQSPRAWFGRFCRAMQSYGFKQCDSDHTLFLKR 1140
YMEQPPGYKSTNAKPMVCKLNKALYGLKQSPRAWFGRFCRAMQSYGFKQCDSDHTLFLKR
Sbjct: 1113 YMEQPPGYKSTNAKPMVCKLNKALYGLKQSPRAWFGRFCRAMQSYGFKQCDSDHTLFLKR 1172
Query: 1141 NQEKLTALIIYVDDMIVTGNDTQEIATLEKKLSGEFEMKNLGGLKYFLGIEVMRSRQGIV 1200
NQEKLTALIIYVDDMIV GNDTQEIATLEKKLSGEFEMKNLGGLKYFLGIEVMRS+QGIV
Sbjct: 1173 NQEKLTALIIYVDDMIVMGNDTQEIATLEKKLSGEFEMKNLGGLKYFLGIEVMRSKQGIV 1232
Query: 1201 LSQRKYILDLLAEIGMLDCRPADTPVVQGVKLGEFPDQVPANKERYQRLVGKLIYLAHTR 1260
LSQRKYILDLLAEIGMLDCRPADTPVVQGVKLGEFPDQVPANKERYQRLVGKLIYLAHTR
Sbjct: 1233 LSQRKYILDLLAEIGMLDCRPADTPVVQGVKLGEFPDQVPANKERYQRLVGKLIYLAHTR 1292
Query: 1261 PDIAYGVSVVSQFMHNPSEDHMDAVMRIIRYLKGCPGKGITFKKNGHLDVSGFTDADWAG 1320
PDIAYGVSVVSQFMHNPSEDHMDAVMRI+RYLKGCPGKGITFKKNGHLDVSGFTDADWAG
Sbjct: 1293 PDIAYGVSVVSQFMHNPSEDHMDAVMRILRYLKGCPGKGITFKKNGHLDVSGFTDADWAG 1352
Query: 1321 SVSDRRSTAGYFTFVGGNLVTWRSKKQSVVALSSAEAEFRGVAKGICELLWLRRLLGELG 1380
SVSDRRSTAGYFTFVGGNLVTWRSKKQSVVALSSAEAEFRGVAKGICELLWLRRLLGELG
Sbjct: 1353 SVSDRRSTAGYFTFVGGNLVTWRSKKQSVVALSSAEAEFRGVAKGICELLWLRRLLGELG 1412
Query: 1381 FAPTQAMDLYCDSRPAIDISHNPVQHDRTKHVEIDRHFIKEKLESNVIQMVHVRSGEQLA 1440
FAPTQAMDLYCDSRPAIDISHNPVQHDRTKHVEIDRHFIKEKLESNVIQMVHVRSGEQLA
Sbjct: 1413 FAPTQAMDLYCDSRPAIDISHNPVQHDRTKHVEIDRHFIKEKLESNVIQMVHVRSGEQLA 1472
Query: 1441 DILTKAVMSKTFTSIIDKLGMQDAYIPT 1469
DILTKAVMSK FTSIIDKLGMQDAYIPT
Sbjct: 1473 DILTKAVMSKMFTSIIDKLGMQDAYIPT 1500
BLAST of Cmc01g0005171 vs. NCBI nr
Match:
TYJ98005.1 (gag-pol polyprotein [Cucumis melo var. makuwa])
HSP 1 Score: 2360.1 bits (6115), Expect = 0.0e+00
Identity = 1210/1468 (82.43%), Postives = 1214/1468 (82.70%), Query Frame = 0
Query: 1 MFRNFQPPLGPATGPEGGLPPMTIVFNGRNYGVWSQMVEVLLASKDKLGHINGERTQPAS 60
MFRNFQPPLGPATGPEGGLPPMTIVFNGRNYGVWSQMVEVLLASKDKLGHINGERTQPAS
Sbjct: 33 MFRNFQPPLGPATGPEGGLPPMTIVFNGRNYGVWSQMVEVLLASKDKLGHINGERTQPAS 92
Query: 61 NDTGYKRWMSDNSMVKGWILSSLDPNLIGNFIRFSTAKEVWDSIKLTFFDGNDNTQIYDL 120
NDTGYKRWMSDNSMVKGWILSSLDPNLIGNFIRFSTAKEVWDSIKLTFFDGNDNTQIYDL
Sbjct: 93 NDTGYKRWMSDNSMVKGWILSSLDPNLIGNFIRFSTAKEVWDSIKLTFFDGNDNTQIYDL 152
Query: 121 EVKVYGLKQTGTLEDYFYTLQGLWKEIEFRKPNPMTCPIDIEKYNRVEQDRKVYIFLGGL 180
EVKVYGLKQTGTLEDYFYTLQGLWKEIEFRKPNPMTCPIDIEKYNRVEQDRKVYIFLGGL
Sbjct: 153 EVKVYGLKQTGTLEDYFYTLQGLWKEIEFRKPNPMTCPIDIEKYNRVEQDRKVYIFLGGL 212
Query: 181 EDRFDGIRAEILRMTPLPTVEETYGHVRREDVRQSVMVGKSTDLSNSAAMIVQGAKQVAG 240
EDRFDGIRAEILRMTPLPTVEETYGHVRREDVRQSVMVGKSTDLSNS AMIVQGAKQVAG
Sbjct: 213 EDRFDGIRAEILRMTPLPTVEETYGHVRREDVRQSVMVGKSTDLSNSVAMIVQGAKQVAG 272
Query: 241 VPHISTNSTHGNSNNSVPGDPLSLSSQVQGKESSSNQGAVHFTTRYNKTEAVLKCTHCGG 300
VPHISTNSTHGNSNNSVPGDPLSLSSQVQGKESSSNQGAVHFTTRYNKTEAVLKCTHCGG
Sbjct: 273 VPHISTNSTHGNSNNSVPGDPLSLSSQVQGKESSSNQGAVHFTTRYNKTEAVLKCTHCGG 332
Query: 301 KGHSKDTCFKIHGYPDWYKELKKKQAEAKRGKVSIAVSGKNGDDSPSEGIIFLSSNLSND 360
KGHSKDTCFKIHGYPDWYKELKKKQAEAKRGKVSIAVSGKNGDDSPSEG IFLSSNLSND
Sbjct: 333 KGHSKDTCFKIHGYPDWYKELKKKQAEAKRGKVSIAVSGKNGDDSPSEGTIFLSSNLSND 392
Query: 361 NSTWIVDSGASDHMTFTKTDLTAECVTKKTEILNANGISYPVKCAGSIKVTSQLNLSNVL 420
STWIVDSGASDHMTFTKTDLTAECVTKKTEILNANGISYPVKCAGSIKVTSQLNLSNVL
Sbjct: 393 YSTWIVDSGASDHMTFTKTDLTAECVTKKTEILNANGISYPVKCAGSIKVTSQLNLSNVL 452
Query: 421 VVPSLATRLMSVSKLTKDRNCVVKMYSDYFIIQDILTKEIIGRGIERDGLYHLEDLKAGR 480
VVPSLATRLMSVSKLTKDRNCVVKMYSDYFIIQDILTKEIIGRGIERDGLYHLEDLKAGR
Sbjct: 453 VVPSLATRLMSVSKLTKDRNCVVKMYSDYFIIQDILTKEIIGRGIERDGLYHLEDLKAGR 512
Query: 481 ASLVVDQTEVQNKIWTWHRRLGHPSLGYMKKLLPHLFKNNNLTVFNCDTCIKAKSHRVSY 540
A+LVVDQTEVQNKIWTWHRRLGHPSLGYMKKLLP LFKNNNLTVFNCDTCIKAKSHRVSY
Sbjct: 513 ANLVVDQTEVQNKIWTWHRRLGHPSLGYMKKLLPRLFKNNNLTVFNCDTCIKAKSHRVSY 572
Query: 541 APSSNKSNSPFDLIHSDVWGPAPVSSIDGNKWFVLFIDDCTRMTWVYLLKSKDEVPSVFK 600
APSSNKSNSPFDLIHSDVWGPAP
Sbjct: 573 APSSNKSNSPFDLIHSDVWGPAP------------------------------------- 632
Query: 601 NFYTMIKTQFGKGIKFFRSDNGGEFIGKVLKDFFVQMGIVHETSCVGTPQQNGVAERKNR 660
Sbjct: 633 ------------------------------------------------------------ 692
Query: 661 HILEMSRALLFEYHVPTKFWDKSILMAVYVLNRLPTKINNFQTPLKTLEKHHSIPSVLSL 720
Sbjct: 693 ------------------------------------------------------------ 752
Query: 721 TPKIFGCVAYVHVPKTQRSKLSPYAVRCVFLGFGQNQKGYKCFDVDSRKWYVTMDVTFME 780
NQKGYKCFDVDSRKWYVTMDVTFME
Sbjct: 753 -----------------------------------NQKGYKCFDVDSRKWYVTMDVTFME 812
Query: 781 HEAFFKPTIQSDQGENSSESDVWKNKDLLSFQGWTPGCEQYGQDGVSGTDKSDGNGNGTD 840
HEAFFKPTIQSDQGENSSESDVWKNKDLLSFQGWTPGCEQYGQDGVSGTD+SDGNGNGTD
Sbjct: 813 HEAFFKPTIQSDQGENSSESDVWKNKDLLSFQGWTPGCEQYGQDGVSGTDESDGNGNGTD 872
Query: 841 ESDGDGNGATDGNKPLEMYSRNKNRAVFGMESENEEVIQPNEEPQGPMSPEDQLQVSLPT 900
ESDGDGNGATDGNKPLEMYSRNKNRAVFGMESENEEVIQPNEEPQGPMSPEDQLQVSLPT
Sbjct: 873 ESDGDGNGATDGNKPLEMYSRNKNRAVFGMESENEEVIQPNEEPQGPMSPEDQLQVSLPT 932
Query: 901 SKYTLPVRCNRGIPPKRYEPDEDRHKKSKYPIANFVDTKSLSGPVKRFNENLLSCKIPEN 960
SKYTLPVRCNRGIPPKRYEPDEDRHKKSKYPIANFVDTKSLSGPVKRFNENLLSCKIPEN
Sbjct: 933 SKYTLPVRCNRGIPPKRYEPDEDRHKKSKYPIANFVDTKSLSGPVKRFNENLLSCKIPEN 992
Query: 961 VDEAKSDPNWSQAMEAEMSALYNNKTWTLVELPQGKIPVGCRWVFSIKYNANGEIDRYKA 1020
VDEAKSDPNWSQAMEAEMSALYNNKTWTLVELPQGKIPVGCRWVFSIKYNANGEIDRYKA
Sbjct: 993 VDEAKSDPNWSQAMEAEMSALYNNKTWTLVELPQGKIPVGCRWVFSIKYNANGEIDRYKA 1052
Query: 1021 RLVAKGYTQTQGIDFQETFSPVAKLNTIRVLLSLAANLDWPLHQFDVKNAFLHGELKEEV 1080
RLVAKGYTQTQGIDFQETFSPVAKLNTIRVLLSLAANLDWPLHQFDVKNAFLHGELKEEV
Sbjct: 1053 RLVAKGYTQTQGIDFQETFSPVAKLNTIRVLLSLAANLDWPLHQFDVKNAFLHGELKEEV 1112
Query: 1081 YMEQPPGYKSTNAKPMVCKLNKALYGLKQSPRAWFGRFCRAMQSYGFKQCDSDHTLFLKR 1140
YMEQPPGYKSTNAKPMVCKLNKALYGLKQSPRAWFGRFCRAMQSYGFKQCDSDHTLFLKR
Sbjct: 1113 YMEQPPGYKSTNAKPMVCKLNKALYGLKQSPRAWFGRFCRAMQSYGFKQCDSDHTLFLKR 1172
Query: 1141 NQEKLTALIIYVDDMIVTGNDTQEIATLEKKLSGEFEMKNLGGLKYFLGIEVMRSRQGIV 1200
NQEKLTALIIYVDDMIV GNDTQEIATLEKKLSGEFEMKNLGGLKYFLGIEVMRS+QGIV
Sbjct: 1173 NQEKLTALIIYVDDMIVMGNDTQEIATLEKKLSGEFEMKNLGGLKYFLGIEVMRSKQGIV 1232
Query: 1201 LSQRKYILDLLAEIGMLDCRPADTPVVQGVKLGEFPDQVPANKERYQRLVGKLIYLAHTR 1260
LSQRKYILDLLAEIGMLDCRPADTPVVQGVKLGEFPDQVPANKERYQRLVGKLIYLAHTR
Sbjct: 1233 LSQRKYILDLLAEIGMLDCRPADTPVVQGVKLGEFPDQVPANKERYQRLVGKLIYLAHTR 1251
Query: 1261 PDIAYGVSVVSQFMHNPSEDHMDAVMRIIRYLKGCPGKGITFKKNGHLDVSGFTDADWAG 1320
PDIAYGVSVVSQFMHNPSEDHMDAVMRI+RYLKGCPGK
Sbjct: 1293 PDIAYGVSVVSQFMHNPSEDHMDAVMRILRYLKGCPGK---------------------- 1251
Query: 1321 SVSDRRSTAGYFTFVGGNLVTWRSKKQSVVALSSAEAEFRGVAKGICELLWLRRLLGELG 1380
GYFTFVGGNLVTWRSKKQSVVALSSAEAEFRGVAKGICELLWLRRLLGELG
Sbjct: 1353 ---------GYFTFVGGNLVTWRSKKQSVVALSSAEAEFRGVAKGICELLWLRRLLGELG 1251
Query: 1381 FAPTQAMDLYCDSRPAIDISHNPVQHDRTKHVEIDRHFIKEKLESNVIQMVHVRSGEQLA 1440
FAPTQAMDLYCDSRPAIDISHNP MVHVRSGEQLA
Sbjct: 1413 FAPTQAMDLYCDSRPAIDISHNP--------------------------MVHVRSGEQLA 1251
Query: 1441 DILTKAVMSKTFTSIIDKLGMQDAYIPT 1469
DILTKAVMSKTFTSIIDKLGMQDAYIPT
Sbjct: 1473 DILTKAVMSKTFTSIIDKLGMQDAYIPT 1251
BLAST of Cmc01g0005171 vs. NCBI nr
Match:
RVW36328.1 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera])
HSP 1 Score: 1620.5 bits (4195), Expect = 0.0e+00
Identity = 813/1477 (55.04%), Postives = 1048/1477 (70.95%), Query Frame = 0
Query: 21 PMTIVFNGRNYGVWSQMVEVLLASKDKLGHINGERTQPASNDTGYKRWMSDNSMVKGWIL 80
P+ I G NY +WSQ+VE+ ++ KDKLG+ING+ QP D ++RW ++N++VKGW++
Sbjct: 48 PIGIKLEGSNYALWSQVVEMYISGKDKLGYINGDSPQPPETDPSFRRWRTENAIVKGWLI 107
Query: 81 SSLDPNLIGNFIRFSTAKEVWDSIKLTFFDGNDNTQIYDLEVKVYGLKQT-GTLEDYFYT 140
+S+DP+LI NFIRF TAK+VWDS +T+FDG D +Q+YDL +V +KQ G++E Y+
Sbjct: 108 NSMDPSLIANFIRFPTAKQVWDSAAITYFDGTDTSQVYDLRRRVTRMKQAGGSIEKYYND 167
Query: 141 LQGLWKEIEFRKPNPMTCPIDIEKYNRVEQDRKVYIFLGGLEDRFDGIRAEILRMTPLPT 200
LQGLW+EI+FR+PNPM C IDI+KYN + Q+ +VY FL GL+DR D R+++L++ P PT
Sbjct: 168 LQGLWREIDFRRPNPMECAIDIQKYNSILQEDQVYTFLDGLDDRLDKTRSDVLQIKPFPT 227
Query: 201 VEETYGHVRREDVRQSVMVGKSTDLSNSAAMIVQGAKQVAGVPHISTNSTHGNSNNSVPG 260
VE+ Y VRRE+VRQ+VM+ D A M +G K S++ +P
Sbjct: 228 VEQAYAFVRREEVRQTVMI-SGADTLPGAVMASKGIK---------------GSHHQMPP 287
Query: 261 DP--LSLSSQVQGKESSSNQGAVHFTTRYNKTEAVLKCTHCGGKGHSKDTCFKIHGYPDW 320
P LSLSS GK +SS F T+ +KCTHCG H++DTCFK+HGYPDW
Sbjct: 288 KPGALSLSS---GKSNSS------FKTK--PPSDGMKCTHCGNTKHTRDTCFKLHGYPDW 347
Query: 321 YKELKKKQAEAKR------------------GKVSIAVSGKNGDDSPSEGIIFLSSNLSN 380
+ +L QA KR +S+ ++ DS + +F S
Sbjct: 348 WNDL---QARKKREIIVNDNHTGRAAVVTCDASLSLIPQAESSHDSGTSSKVFHIST-HK 407
Query: 381 DNSTWIVDSGASDHMTFTKTDLTAECVTKKTEILNANGISYPVKCAGSIKVTSQLNLSNV 440
D+ WI+DSGA+DHMTF D + +++ + NANG++YPV AG++ ++ L+LSN
Sbjct: 408 DDEDWILDSGATDHMTFDSKDFSNTTQPRRSCVANANGVTYPVTGAGTVTLSPSLSLSNT 467
Query: 441 LVVPSLATRLMSVSKLTKDRNCVVKMYSDYFIIQDILTKEIIGRGIERDGLYHLEDLKAG 500
L+VPSL+ RLMSVS++T D NCVV MYS + ++QDILTKEIIGRG +R GLY+++ +G
Sbjct: 468 LLVPSLSNRLMSVSQVTSDLNCVVLMYSTFCLLQDILTKEIIGRGTKRGGLYYVDAFSSG 527
Query: 501 RASLVVDQT-EVQNKIWTWHRRLGHPSLGYMKKLLPHLFKNNNLTVFNCDTCIKAKSHRV 560
RA+ + + + +IW WH RLGHPS GY+K LLP LF F CDTCI AKSHR
Sbjct: 528 RANHMHHKVGNKERQIWLWHHRLGHPSFGYLKHLLPGLFSKATHLDFKCDTCILAKSHRA 587
Query: 561 SYAPSSNKSNSPFDLIHSDVWGPAPVSSIDGNKWFVLFIDDCTRMTWVYLLKSKDEVPSV 620
SY S NKS PFDLIHSDVWGP+ V++ G++WFV+F+DDCTRMTW+YLLK KDEV S+
Sbjct: 588 SYPMSMNKSMIPFDLIHSDVWGPSLVTTSSGHRWFVIFVDDCTRMTWLYLLKHKDEVFSI 647
Query: 621 FKNFYTMIKTQFGKGIKFFRSDNGGEFIGKVLKDFFVQMGIVHETSCVGTPQQNGVAERK 680
F++F+ M++TQF IK RSDNGGE++ + + +F GI+HETSC TPQQNG+AERK
Sbjct: 648 FQSFHAMVQTQFSARIKILRSDNGGEYVNQQFQTYFNNHGILHETSCSQTPQQNGIAERK 707
Query: 681 NRHILEMSRALLFEYHVPTKFWDKSILMAVYVLNRLPTKINNFQTPLKTLEKHHSIPSVL 740
NRHILE +RALL HVP ++W ++ AVY+LNR+PTK+ FQTPLK L H S+P+VL
Sbjct: 708 NRHILETARALLINAHVPNRYWSDAVTTAVYLLNRMPTKVLQFQTPLKVLSYHVSLPTVL 767
Query: 741 SLTPKIFGCVAYVHVPKTQRSKLSPYAVRCVFLGFGQNQKGYKCFDVDSRKWYVTMDVTF 800
+ P+IFGCVA+VH+ K QR+KL P AVRC+FLG+G +KGY+C+D +++ Y+TMDVTF
Sbjct: 768 MIPPRIFGCVAFVHLHKNQRTKLDPCAVRCLFLGYGVQKKGYRCYDPIAKRSYITMDVTF 827
Query: 801 MEHEAFFKPTIQSD-QGENSSESDVWKNKDLLSFQGWTPGCEQYGQDGVSGTDKSDGNGN 860
+E E FF P S QGE E W + ++L V + N
Sbjct: 828 LESEFFFSPISNSPLQGEIYGEERNWSDVEVLE---------------VGDNPTHPNDDN 887
Query: 861 GTDESDGDGNGATDGNKPLEMYSRNKNRAVFGMESENEEVIQPNEEPQGPM----SPEDQ 920
E D +P+ S + VF + PN+ P + SP
Sbjct: 888 DLVEHDPVPEPLRTEAEPVPESSEDAESDVF------PHSLVPNDPPTENIPEVSSPTTP 947
Query: 921 LQVSL--PTSKYTLPVRCNRGIPPKRYEPDEDRHKKSKYPIANFVDTKSLSGPVKRFNEN 980
LQ + ++ Y LP R NRG PP RY PD + ++SKYPIAN V T+ LS P++ F
Sbjct: 948 LQTNAIDTSAGYVLPFRHNRGKPPNRYSPDIE-ERRSKYPIANHVSTQRLSEPLRAFAHT 1007
Query: 981 LLSCKIPENVDEAKSDPNWSQAMEAEMSALYNNKTWTLVELPQGKIPVGCRWVFSIKYNA 1040
L SC+IP V+EA SDP W+QA++ E+ AL N TW L LP+G+ V C+W+FSIKY A
Sbjct: 1008 LSSCQIPSRVEEAFSDPKWAQAIKEELEALQKNNTWVLSVLPEGRKTVRCKWIFSIKYKA 1067
Query: 1041 NGEIDRYKARLVAKGYTQTQGIDFQETFSPVAKLNTIRVLLSLAANLDWPLHQFDVKNAF 1100
+G IDRYKARLVAKGYTQ GID+QETFSPVAKL T+RVLLSLAANLDWPLHQ DVKNAF
Sbjct: 1068 DGSIDRYKARLVAKGYTQKHGIDYQETFSPVAKLKTVRVLLSLAANLDWPLHQLDVKNAF 1127
Query: 1101 LHGELKEEVYMEQPPGYKSTNAKPMVCKLNKALYGLKQSPRAWFGRFCRAMQSYGFKQCD 1160
LHG+L+EE+YM+ PPGY +T+ + C+L +ALYGLKQSPRAWFGR AM+ YGF+Q +
Sbjct: 1128 LHGDLEEEIYMDIPPGYTATSEAKIACRLQRALYGLKQSPRAWFGRLSSAMRKYGFQQSN 1187
Query: 1161 SDHTLFLKRNQEKLTALIIYVDDMIVTGNDTQEIATLEKKLSGEFEMKNLGGLKYFLGIE 1220
SDHTLFLK K+TALI+YVDDMI+TG+D +EI+ L+ +LS EFEMKNLGGLKYFLGIE
Sbjct: 1188 SDHTLFLKHRLGKITALIVYVDDMIITGDDVEEISKLQDQLSTEFEMKNLGGLKYFLGIE 1247
Query: 1221 VMRSRQGIVLSQRKYILDLLAEIGMLDCRPADTPVVQGVKLGEFPDQVPANKERYQRLVG 1280
V RSRQGI LSQRKYILDLLAE+G+L+C+PAD P+VQ KLGE+ DQVPA+K+RYQRLVG
Sbjct: 1248 VARSRQGIFLSQRKYILDLLAEVGLLECKPADIPIVQNHKLGEYVDQVPADKQRYQRLVG 1307
Query: 1281 KLIYLAHTRPDIAYGVSVVSQFMHNPSEDHMDAVMRIIRYLKGCPGKGITFKKNGHLDVS 1340
KLIYL+HTRPDIAY VSVVSQFMH PSEDHMDAVMRI+RYLK PGKG+ F KNGHL V+
Sbjct: 1308 KLIYLSHTRPDIAYAVSVVSQFMHWPSEDHMDAVMRILRYLKSSPGKGLMFSKNGHLKVA 1367
Query: 1341 GFTDADWAGSVSDRRSTAGYFTFVGGNLVTWRSKKQSVVALSSAEAEFRGVAKGICELLW 1400
G+TDADWAG+++DR+STAGYFTFVGGNLVTWRSKKQ VVALSSAEAEFRG+ KGICEL+W
Sbjct: 1368 GYTDADWAGNITDRKSTAGYFTFVGGNLVTWRSKKQKVVALSSAEAEFRGMVKGICELIW 1427
Query: 1401 LRRLLGELGFAPTQAMDLYCDSRPAIDISHNPVQHDRTKHVEIDRHFIKEKLESNVIQMV 1460
L++LL E+G AP+ M+L+CD+ AI ISHNPVQHDRTKHVE+DR+FIK+ LE +IQ+
Sbjct: 1428 LKKLLAEIGVAPSSEMNLFCDNTAAIAISHNPVQHDRTKHVEVDRNFIKQNLEEKIIQLP 1471
Query: 1461 HVRSGEQLADILTKAVMSKTFTSIIDKLGMQDAYIPT 1469
V+S +QLAD+LTKAV ++ F + +DKLG++D Y PT
Sbjct: 1488 FVKSEDQLADVLTKAVSARNFYNSLDKLGIKDIYAPT 1471
BLAST of Cmc01g0005171 vs. NCBI nr
Match:
RVX17869.1 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera])
HSP 1 Score: 1485.7 bits (3845), Expect = 0.0e+00
Identity = 754/1486 (50.74%), Postives = 994/1486 (66.89%), Query Frame = 0
Query: 6 QPPLGPATGPEGGLPPMTIVFNGRNYGVWSQMVEVLLASKDKLGHINGERTQPASNDTGY 65
QPP T P L + I +G NY +WSQ+VE+ ++ KDKLG+ING+ QP S D +
Sbjct: 278 QPPT-LTTEPSTAL--IGIKLDGTNYALWSQVVEMYISGKDKLGYINGDIPQPPSTDPTF 337
Query: 66 KRWMSDNSMVKGWILSSLDPNLIGNFIRFSTAKEVWDSIKLTFFDGNDNTQIYDLEVKVY 125
++W +DN++VKGW+++S+DP LIGNFIRF TAK VWDSI T+FDG+D +Q+YDL +V
Sbjct: 338 RKWRTDNAIVKGWLINSMDPFLIGNFIRFPTAKLVWDSIATTYFDGSDTSQVYDLRRRVT 397
Query: 126 GLKQT-GTLEDYFYTLQGLWKEIEFRKPNPMTCPIDIEKYNRVEQDRKVYIFLGGLEDRF 185
LKQ G+LE + LQGLW+EI+FR+PNPM C +DI YN + Q+ +VY+FL GL+DR
Sbjct: 398 QLKQAGGSLEKCYNDLQGLWREIDFRRPNPMECAVDIHNYNLLLQEDRVYVFLDGLDDRL 457
Query: 186 DGIRAEILRMTPLPTVEETYGHVRREDVRQSVMVGKSTDLSNSAAMIVQGAKQVAGVPHI 245
D IR ++L++ P PTVE+ Y HVRRE +RQSVM+ + D + A + +G K +
Sbjct: 458 DKIRGDVLQLRPFPTVEQAYAHVRREALRQSVMITGNADAVSGAVLATKGLK-------L 517
Query: 246 STNSTHGNSNNSVPGDPLSLSSQVQGKESSSNQGAVHFTTRYNKTEAVLKCTHCGGKGHS 305
++ +N +P K +S++G LKC+HCG H+
Sbjct: 518 GSSIQPPTVHNGMP------------KSRTSSEG--------------LKCSHCGNSKHT 577
Query: 306 KDTCFKIHGYPDWYKELKKKQAEAKRGKVSIAVSGKNGDDSPSEGII------------F 365
DTCFK+HGY DW+ +L+ K+ K + + P I +
Sbjct: 578 CDTCFKLHGYSDWWNDLRAKKGRDAGTKDEDSATAVVATAEPQLSFIPQMTMPNSGNCGY 637
Query: 366 LSSNLSND--NSTWIVDSGASDHMTFTKTDLTAECVTKKTEILNANGISYPVKCAGSIKV 425
+ND W++DSGA+DHMTFT D T + ++T NANG+ PV AG++ +
Sbjct: 638 ACYTSTNDGYRGAWLLDSGATDHMTFTAMDFTMTSLPRRTNTANANGVISPVTGAGTVTL 697
Query: 426 TSQLNLSNVLVVPSLATRLMSVSKLTKDRNCVVKMYSDYFIIQDILTKEIIGRGIERDGL 485
+ +L L N L VPSL+ +L+SVS++T D NC+V +Y ++QDILTKEIIGRG +R GL
Sbjct: 698 SPKLQLHNTLFVPSLSHKLLSVSQVTSDLNCIVLIYPTLCLLQDILTKEIIGRGTKRGGL 757
Query: 486 YHLEDLKAGRASLVVDQTEV-QNKIWTWHRRLGHPSLGYMKKLLPHLFKNNNLTVFNCDT 545
Y++EDL GRA +V +N++W WHRRLGHPS YMK L P LF F C+T
Sbjct: 758 YYMEDLSVGRAHHTQHTLDVKENELWLWHRRLGHPSFTYMKHLFPDLFSQLKNFDFQCET 817
Query: 546 CIKAKSHRVSYAPSSNKSNSPFDLIHSDVWGPAPVSSIDGNKWFVLFIDDCTRMTWVYLL 605
CI AKS+R S+ NK ++PF LIHSDVWG +P+++++G KWFVLF+DDCTRMTW+YLL
Sbjct: 818 CILAKSYRASFPLHLNKKDTPFALIHSDVWGLSPITTVNGFKWFVLFVDDCTRMTWLYLL 877
Query: 606 KSKDEVPSVFKNFYTMIKTQFGKGIKFFRSDNGGEFIGKVLKDFFVQMGIVHETSCVGTP 665
K KDEV VFK+F+ M++TQF ++ RSDNGGE++ +++F Q GI+HETSC TP
Sbjct: 878 KHKDEVLGVFKSFHAMVQTQFSAKVQVLRSDNGGEYVNHQFREYFQQHGIIHETSCPQTP 937
Query: 666 QQNGVAERKNRHILEMSRALLFEYHVPTKFWDKSILMAVYVLNRLPTKINNFQTPLKTLE 725
QQNG+ ERKNRH+LE +RALL H PT+FW ++ AV++LNR+ +K+ +FQTPL+ L
Sbjct: 938 QQNGIVERKNRHVLETARALLVGAHAPTRFWADAVTTAVHLLNRMLSKVLDFQTPLQALS 997
Query: 726 KHHSIPSVLSLTPKIFGCVAYVHVPKTQRSKLSPYAVRCVFLGFGQNQKGYKCFDVDSRK 785
+ ++P++L L P +FGCVAYVH+ K Q++KL P A C+FLG+ +QKGY+C+D+ S +
Sbjct: 998 GYTAVPAILMLPPHVFGCVAYVHLHKNQQTKLDPCARCCLFLGYAFHQKGYRCYDLTSGR 1057
Query: 786 WYVTMDVTFMEHEAFFKPTIQSDQGENSSESDVWKNKDLLSFQGWTPGCEQYGQDGVSGT 845
Y+TMDVTFME E FF P QGE E W + W E +
Sbjct: 1058 MYITMDVTFMETETFFPPN-SPLQGETRQEEQNWTELN------WPSVSEIH-------- 1117
Query: 846 DKSDGNGNGTDESDGDGNGATDGNKPLEMYSRNKNRAVFGMESENEEVIQPNEEPQGPMS 905
+ +P + ++ + V P+ P+ P +
Sbjct: 1118 --------------------VEPRQPEHVSLATEHHE----DDHEAHVTSPSTIPENP-T 1177
Query: 906 PEDQLQVS-------LPTSKYTLPVRCNRGIPPKRYEPDEDRHKKSKYPIANFVDTKSLS 965
PE+ +VS P Y LP R NRG P RY PD + ++S+YPIAN+V TK L+
Sbjct: 1178 PENDPEVSSFNTNILAPPIGYVLPNRHNRGKTPSRYSPDIE-GRRSRYPIANYVPTKKLN 1237
Query: 966 GPVKRFNENLLSCKIPENVDEAKSDPNWSQAMEAEMSALYNNKTWTLVELPQGKIPVGCR 1025
P+K F N+ C +P V+EA DP W+QA++ EM L NKTW LV L +GK VGC+
Sbjct: 1238 EPLKTFVHNISGCHVPTRVEEALGDPKWTQAIKDEMETLMKNKTWNLVPLSEGKKTVGCK 1297
Query: 1026 WVFSIKYNANGEIDRYKARLVAKGYTQTQGIDFQETFSPVAKLNTIRVLLSLAANLDWPL 1085
WVFSIK+ A+G I+RYKARLVAKGYTQT GID+Q+TFSPVAKLNT+RVL+SLAANL+WPL
Sbjct: 1298 WVFSIKHKADGSIERYKARLVAKGYTQTYGIDYQDTFSPVAKLNTVRVLISLAANLNWPL 1357
Query: 1086 HQFDVKNAFLHGELKEEVYMEQPPGYKSTNAKPMVCKLNKALYGLKQSPRAWFGRFCRAM 1145
HQFDVKNAFLHG L+EEVYM+ PPGY T
Sbjct: 1358 HQFDVKNAFLHGGLEEEVYMDIPPGYSVTT------------------------------ 1417
Query: 1146 QSYGFKQCDSDHTLFLKRNQEKLTALIIYVDDMIVTGNDTQEIATLEKKLSGEFEMKNLG 1205
G + ++DHTLFLK+ Q K+TALI+YVDDM++TG+D +EI+ L+ +L+ EFEMKNLG
Sbjct: 1418 ---GTNESNADHTLFLKKQQGKVTALIVYVDDMVITGDDIEEISRLQGQLASEFEMKNLG 1477
Query: 1206 GLKYFLGIEVMRSRQGIVLSQRKYILDLLAEIGMLDCRPADTPVVQGVKLGEFPDQVPAN 1265
GLKYFLGI+V RS QGI LSQRKY+LDLL+E+G+L+C+P DTP+VQ KLG +P+Q P +
Sbjct: 1478 GLKYFLGIKVARSTQGIFLSQRKYVLDLLSEVGLLECKPVDTPIVQNHKLGIYPNQKPID 1537
Query: 1266 KERYQRLVGKLIYLAHTRPDIAYGVSVVSQFMHNPSEDHMDAVMRIIRYLKGCPGKGITF 1325
K RYQRLV KLIYL+HTRPDIAY VSVVSQFMH PSE+HM+AV+RI+RYLK PGKG+ F
Sbjct: 1538 KGRYQRLVSKLIYLSHTRPDIAYAVSVVSQFMHCPSEEHMEAVIRILRYLKSSPGKGLMF 1597
Query: 1326 KKNGHLDVSGFTDADWAGSVSDRRSTAGYFTFVGGNLVTWRSKKQSVVALSSAEAEFRGV 1385
KN H+ V G+TDADWAG++SDR+ST+GYFTFVGGNLVTWRSKKQ VVALSSAEAEFRG+
Sbjct: 1598 SKNDHVRVDGYTDADWAGNISDRKSTSGYFTFVGGNLVTWRSKKQKVVALSSAEAEFRGM 1653
Query: 1386 AKGICELLWLRRLLGELGFAPTQAMDLYCDSRPAIDISHNPVQHDRTKHVEIDRHFIKEK 1445
AKG+CELLWL+RLL E+GFAP M+L+CD++ IDISHNPVQHDRTKHVE+DRHFIK
Sbjct: 1658 AKGLCELLWLKRLLTEIGFAPKSEMNLFCDNKATIDISHNPVQHDRTKHVEVDRHFIKYN 1653
Query: 1446 LESNVIQMVHVRSGEQLADILTKAVMSKTFTSIIDKLGMQDAYIPT 1469
LE+N I+ V+S +QLADILTK V SK F + + KL M+D+Y T
Sbjct: 1718 LETNTIRFPFVKSKDQLADILTKVVSSKDFHNSLIKLRMKDSYAST 1653
BLAST of Cmc01g0005171 vs. ExPASy Swiss-Prot
Match:
Q94HW2 (Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana OX=3702 GN=RE1 PE=2 SV=1)
HSP 1 Score: 694.5 bits (1791), Expect = 2.6e-198
Identity = 466/1502 (31.03%), Postives = 753/1502 (50.13%), Query Frame = 0
Query: 30 NYGVWSQMVEVLLASKDKLGHINGERTQPASN---------DTGYKRWMSDNSMVKGWIL 89
NY +WS+ V L + G ++G T P + + Y RW + ++ +L
Sbjct: 30 NYLMWSRQVHALFDGYELAGFLDGSTTMPPATIGTDAAPRVNPDYTRWKRQDKLIYSAVL 89
Query: 90 SSLDPNLIGNFIRFSTAKEVWDSIKLTFFDGNDNTQIYDLEVKV-YGLKQTGTLEDYFYT 149
++ ++ R +TA ++W++++ + + + + L ++ K T T++DY
Sbjct: 90 GAISMSVQPAVSRATTAAQIWETLRKIYANPSYG-HVTQLRTQLKQWTKGTKTIDDY--- 149
Query: 150 LQGLWKEIEFRKPNPMTCPIDIEKYNRVEQDRKVYIFLGGLEDRFDGIRAEILRMTPLPT 209
+QGL F + + P+D D +V L L + + + +I PT
Sbjct: 150 MQGL--VTRFDQLALLGKPMD--------HDEQVERVLENLPEEYKPVIDQIAAKDTPPT 209
Query: 210 VEETYGHVRREDVRQSVMVGKSTDLSNSAAMIVQGAKQVAGVPHISTNSTHGNSNNSVPG 269
+ E + + ++ +S L+ S+A ++ V H +T +T+ N+N
Sbjct: 210 LTE---------IHERLLNHESKILAVSSATVIPITAN--AVSHRNTTTTNNNNNG---- 269
Query: 270 DPLSLSSQVQGKESSSN-----QGAVHFTTRYNKTEAVL-KCTHCGGKGHSKDTCFKIHG 329
+ +++ + +++N Q + +F N+++ L KC CG +GHS C ++
Sbjct: 270 ---NRNNRYDNRNNNNNSKPWQQSSTNFHPNNNQSKPYLGKCQICGVQGHSAKRCSQLQH 329
Query: 330 YPDWYKELKKKQAEAKRGKVSIAVSGKNGDDSPS-------EGIIFLSSNLSNDNSTWIV 389
+ +S N PS + L S S++N W++
Sbjct: 330 F----------------------LSSVNSQQPPSPFTPWQPRANLALGSPYSSNN--WLL 389
Query: 390 DSGASDHMTFTKTDLTA-ECVTKKTEILNANGISYPVKCAGSIKVTSQ---LNLSNVLVV 449
DSGA+ H+T +L+ + T +++ A+G + P+ GS ++++ LNL N+L V
Sbjct: 390 DSGATHHITSDFNNLSLHQPYTGGDDVMVADGSTIPISHTGSTSLSTKSRPLNLHNILYV 449
Query: 450 PSLATRLMSVSKLTKDRNCVVKMYSDYFIIQDILTKEIIGRGIERDGLYHLEDLKAGRAS 509
P++ L+SV +L V+ + F ++D+ T + +G +D LY + S
Sbjct: 450 PNIHKNLISVYRLCNANGVSVEFFPASFQVKDLNTGVPLLQGKTKDELYEWPIASSQPVS 509
Query: 510 LVVDQTEVQNKIWTWHRRLGHPSLGYMKKLLPHLFKNNNLTVFN-------CDTCIKAKS 569
L + +WH RLGHP+ +L + N +L+V N C C+ KS
Sbjct: 510 LFASPSSKATHS-SWHARLGHPA----PSILNSVISNYSLSVLNPSHKFLSCSDCLINKS 569
Query: 570 HRVSYAPSSNKSNSPFDLIHSDVWGPAPVSSIDGNKWFVLFIDDCTRMTWVYLLKSKDEV 629
++V ++ S+ S P + I+SDVW +P+ S D +++V+F+D TR TW+Y LK K +V
Sbjct: 570 NKVPFSQSTINSTRPLEYIYSDVWS-SPILSHDNYRYYVIFVDHFTRYTWLYPLKQKSQV 629
Query: 630 PSVFKNFYTMIKTQFGKGIKFFRSDNGGEFIGKVLKDFFVQMGIVHETSCVGTPQQNGVA 689
F F +++ +F I F SDNGGEF+ L ++F Q GI H TS TP+ NG++
Sbjct: 630 KETFITFKNLLENRFQTRIGTFYSDNGGEFV--ALWEYFSQHGISHLTSPPHTPEHNGLS 689
Query: 690 ERKNRHILEMSRALLFEYHVPTKFWDKSILMAVYVLNRLPTKINNFQTPLKTLEKHHSIP 749
ERK+RHI+E LL +P +W + +AVY++NRLPT + ++P + L + P
Sbjct: 690 ERKHRHIVETGLTLLSHASIPKTYWPYAFAVAVYLINRLPTPLLQLESPFQKL--FGTSP 749
Query: 750 SVLSLTPKIFGCVAYVHVPKTQRSKLSPYAVRCVFLGFGQNQKGYKCFDVDSRKWYVTMD 809
+ L ++FGC Y + + KL + +CVFLG+ Q Y C + + + Y++
Sbjct: 750 NYDKL--RVFGCACYPWLRPYNQHKLDDKSRQCVFLGYSLTQSAYLCLHLQTSRLYISRH 809
Query: 810 VTFMEH---EAFFKPTIQSDQGENSSESDVWKNKDLLSFQG---WTPGCEQ--------- 869
V F E+ + + T+ Q + S VW L + P C
Sbjct: 810 VRFDENCFPFSNYLATLSPVQEQRRESSCVWSPHTTLPTRTPVLPAPSCSDPHHAATPPS 869
Query: 870 -----YGQDGVSGTDKSDGNGNGTDESDGDGNGATDGNKPLEMYSRNKNRAVFGMESENE 929
+ VS ++ + S +G +P ++ + + S+N
Sbjct: 870 SPSAPFRNSQVSSSNLDSSFSSSFPSSPEPTAPRQNGPQPTTQPTQTQTQT---HSSQNT 929
Query: 930 EVIQP-NEEPQ---GPMSPEDQLQVSLPTSKYTLPVRCNRGIPPK--RYEPD---EDRHK 989
P NE P +S Q S P+ + PP + P + +
Sbjct: 930 SQNNPTNESPSQLAQSLSTPAQSSSSSPSPTTSASSSSTSPTPPSILIHPPPPLAQIVNN 989
Query: 990 KSKYPI-ANFVDTKSLSGPVKRFNENLLSCKI-----PENVDEAKSDPNWSQAMEAEMSA 1049
++ P+ + + T++ +G +K + L+ + P +A D W AM +E++A
Sbjct: 990 NNQAPLNTHSMGTRAKAGIIKPNPKYSLAVSLAAESEPRTAIQALKDERWRNAMGSEINA 1049
Query: 1050 LYNNKTWTLVELPQGKIP-VGCRWVFSIKYNANGEIDRYKARLVAKGYTQTQGIDFQETF 1109
N TW LV P + VGCRW+F+ KYN++G ++RYKARLVAKGY Q G+D+ ETF
Sbjct: 1050 QIGNHTWDLVPPPPSHVTIVGCRWIFTKKYNSDGSLNRYKARLVAKGYNQRPGLDYAETF 1109
Query: 1110 SPVAKLNTIRVLLSLAANLDWPLHQFDVKNAFLHGELKEEVYMEQPPGYKSTNAKPMVCK 1169
SPV K +IR++L +A + WP+ Q DV NAFL G L ++VYM QPPG+ + VCK
Sbjct: 1110 SPVIKSTSIRIVLGVAVDRSWPIRQLDVNNAFLQGTLTDDVYMSQPPGFIDKDRPNYVCK 1169
Query: 1170 LNKALYGLKQSPRAWFGRFCRAMQSYGFKQCDSDHTLFLKRNQEKLTALIIYVDDMIVTG 1229
L KALYGLKQ+PRAW+ + + GF SD +LF+ + + + +++YVDD+++TG
Sbjct: 1170 LRKALYGLKQAPRAWYVELRNYLLTIGFVNSVSDTSLFVLQRGKSIVYMLVYVDDILITG 1229
Query: 1230 NDTQEIATLEKKLSGEFEMKNLGGLKYFLGIEVMRSRQGIVLSQRKYILDLLAEIGMLDC 1289
ND + LS F +K+ L YFLGIE R G+ LSQR+YILDLLA M+
Sbjct: 1230 NDPTLLHNTLDNLSQRFSVKDHEELHYFLGIEAKRVPTGLHLSQRRYILDLLARTNMITA 1289
Query: 1290 RPADTPVVQGVKLGEFPDQVPANKERYQRLVGKLIYLAHTRPDIAYGVSVVSQFMHNPSE 1349
+P TP+ KL + + Y+ +VG L YLA TRPDI+Y V+ +SQFMH P+E
Sbjct: 1290 KPVTTPMAPSPKLSLYSGTKLTDPTEYRGIVGSLQYLAFTRPDISYAVNRLSQFMHMPTE 1349
Query: 1350 DHMDAVMRIIRYLKGCPGKGITFKKNGHLDVSGFTDADWAGSVSDRRSTAGYFTFVGGNL 1409
+H+ A+ RI+RYL G P GI KK L + ++DADWAG D ST GY ++G +
Sbjct: 1350 EHLQALKRILRYLAGTPNHGIFLKKGNTLSLHAYSDADWAGDKDDYVSTNGYIVYLGHHP 1409
Query: 1410 VTWRSKKQSVVALSSAEAEFRGVAKGICELLWLRRLLGELGFAPTQAMDLYCDSRPAIDI 1462
++W SKKQ V SS EAE+R VA E+ W+ LL ELG T+ +YCD+ A +
Sbjct: 1410 ISWSSKKQKGVVRSSTEAEYRSVANTSSEMQWICSLLTELGIRLTRPPVIYCDNVGATYL 1460
BLAST of Cmc01g0005171 vs. ExPASy Swiss-Prot
Match:
Q9ZT94 (Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana OX=3702 GN=RE2 PE=4 SV=1)
HSP 1 Score: 679.9 bits (1753), Expect = 6.6e-194
Identity = 458/1489 (30.76%), Postives = 728/1489 (48.89%), Query Frame = 0
Query: 30 NYGVWSQMVEVLLASKDKLGHINGERTQPASN---------DTGYKRWMSDNSMVKGWIL 89
NY +WS+ V L + G ++G P + + Y RW + ++ IL
Sbjct: 30 NYLMWSRQVHALFDGYELAGFLDGSTPMPPATIGTDAVPRVNPDYTRWRRQDKLIYSAIL 89
Query: 90 SSLDPNLIGNFIRFSTAKEVWDSIKLTFFDGNDNTQIYDLEVKVYGLKQTGTLEDYFYTL 149
++ ++ R +TA ++W++++ K+Y G + +
Sbjct: 90 GAISMSVQPAVSRATTAAQIWETLR-----------------KIYANPSYGHVTQLRFIT 149
Query: 150 QGLWKEIEFRKPNPMTCPIDIEKYNRVEQDRKVYIFLGGLEDRFDGIRAEILRMTPLPTV 209
+ F + + P+D D +V L L D + + +I P++
Sbjct: 150 R-------FDQLALLGKPMD--------HDEQVERVLENLPDDYKPVIDQIAAKDTPPSL 209
Query: 210 EETYGHVRREDVRQSVMVGKSTDLSNSAAMIVQGAKQVAGVPHISTNSTHGNSNNSVPGD 269
E + + ++ +S L+ ++A +V V V H +TN T+ N NN GD
Sbjct: 210 TE---------IHERLINRESKLLALNSAEVVPITANV--VTHRNTN-TNRNQNNR--GD 269
Query: 270 PLSLSSQVQGKESSSNQGAVHFTTRYNKTEAVLKCTHCGGKGHSKDTCFKIHGYPDWYKE 329
+ ++ S + + + +C C +GHS C ++H + +
Sbjct: 270 NRNYNNNNNRSNSWQPSSSGSRSDNRQPKPYLGRCQICSVQGHSAKRCPQLHQFQSTTNQ 329
Query: 330 LKKKQAEAKRGKVSIAVSGKNGDDSPSEGIIFLSSNLSNDNSTWIVDSGASDHMTFTKTD 389
+ +P + L+ N + + W++DSGA+ H+T +
Sbjct: 330 QQ-----------------STSPFTPWQPRANLAVNSPYNANNWLLDSGATHHITSDFNN 389
Query: 390 LT-AECVTKKTEILNANGISYPVKCAGSIKV---TSQLNLSNVLVVPSLATRLMSVSKLT 449
L+ + T +++ A+G + P+ GS + + L+L+ VL VP++ L+SV +L
Sbjct: 390 LSFHQPYTGGDDVMIADGSTIPITHTGSASLPTSSRSLDLNKVLYVPNIHKNLISVYRLC 449
Query: 450 KDRNCVVKMYSDYFIIQDILTKEIIGRGIERDGLYHLEDLKAGRASLVVDQTEVQNKIWT 509
V+ + F ++D+ T + +G +D LY + S+ +
Sbjct: 450 NTNRVSVEFFPASFQVKDLNTGVPLLQGKTKDELYEWPIASSQAVSMFASPCSKATHS-S 509
Query: 510 WHRRLGHPSLGYMKKLLPHLFKNNNLTVFN-------CDTCIKAKSHRVSYAPSSNKSNS 569
WH RLGHPSL +L + N++L V N C C KSH+V ++ S+ S+
Sbjct: 510 WHSRLGHPSLA----ILNSVISNHSLPVLNPSHKLLSCSDCFINKSHKVPFSNSTITSSK 569
Query: 570 PFDLIHSDVWGPAPVSSIDGNKWFVLFIDDCTRMTWVYLLKSKDEVPSVFKNFYTMIKTQ 629
P + I+SDVW +P+ SID +++V+F+D TR TW+Y LK K +V F F ++++ +
Sbjct: 570 PLEYIYSDVWS-SPILSIDNYRYYVIFVDHFTRYTWLYPLKQKSQVKDTFIIFKSLVENR 629
Query: 630 FGKGIKFFRSDNGGEFIGKVLKDFFVQMGIVHETSCVGTPQQNGVAERKNRHILEMSRAL 689
F I SDNGGEF+ VL+D+ Q GI H TS TP+ NG++ERK+RHI+EM L
Sbjct: 630 FQTRIGTLYSDNGGEFV--VLRDYLSQHGISHFTSPPHTPEHNGLSERKHRHIVEMGLTL 689
Query: 690 LFEYHVPTKFWDKSILMAVYVLNRLPTKINNFQTPLKTLEKHHSIPSVLSLTPKIFGCVA 749
L VP +W + +AVY++NRLPT + Q+P + L P+ L K+FGC
Sbjct: 690 LSHASVPKTYWPYAFSVAVYLINRLPTPLLQLQSPFQKL--FGQPPNYEKL--KVFGCAC 749
Query: 750 YVHVPKTQRSKLSPYAVRCVFLGFGQNQKGYKCFDVDSRKWYVTMDVTFMEHEAFFKPT- 809
Y + R KL + +C F+G+ Q Y C + + + Y + V F E F T
Sbjct: 750 YPWLRPYNRHKLEDKSKQCAFMGYSLTQSAYLCLHIPTGRLYTSRHVQFDERCFPFSTTN 809
Query: 810 --IQSDQGENSSESDVWKNKDLLSFQGW---TPGCEQYGQD---------------GVSG 869
+ + Q + S + W + L P C D VS
Sbjct: 810 FGVSTSQEQRSDSAPNWPSHTTLPTTPLVLPAPPCLGPHLDTSPRPPSSPSPLCTTQVSS 869
Query: 870 TD---KSDGNGNGTDESDGDGNGATDGNKPLEMYSRNKNRAVFGMESENE-EVIQPNEEP 929
++ S + + ++ + NG +P + + N N + + N PN+
Sbjct: 870 SNLPSSSISSPSSSEPTAPSHNGPQPTAQPHQTQNSNSNSPILNNPNPNSPSPNSPNQNS 929
Query: 930 QGPMSPEDQLQVSLPTSKYTLPVRCNRG------IPPKRYEPDEDRHKKSKYPIANFVDT 989
P SP + P++ + P + +PP P + + + T
Sbjct: 930 PLPQSPISSPHIPTPSTSISEPNSPSSSSTSTPPLPPVLPAPPIIQVNAQAPVNTHSMAT 989
Query: 990 KSLSG---PVKRFN--ENLLSCKIPENVDEAKSDPNWSQAMEAEMSALYNNKTWTLVELP 1049
++ G P ++++ +L + P +A D W QAM +E++A N TW LV P
Sbjct: 990 RAKDGIRKPNQKYSYATSLAANSEPRTAIQAMKDDRWRQAMGSEINAQIGNHTWDLVPPP 1049
Query: 1050 QGKIP-VGCRWVFSIKYNANGEIDRYKARLVAKGYTQTQGIDFQETFSPVAKLNTIRVLL 1109
+ VGCRW+F+ K+N++G ++RYKARLVAKGY Q G+D+ ETFSPV K +IR++L
Sbjct: 1050 PPSVTIVGCRWIFTKKFNSDGSLNRYKARLVAKGYNQRPGLDYAETFSPVIKSTSIRIVL 1109
Query: 1110 SLAANLDWPLHQFDVKNAFLHGELKEEVYMEQPPGYKSTNAKPMVCKLNKALYGLKQSPR 1169
+A + WP+ Q DV NAFL G L +EVYM QPPG+ + VC+L KA+YGLKQ+PR
Sbjct: 1110 GVAVDRSWPIRQLDVNNAFLQGTLTDEVYMSQPPGFVDKDRPDYVCRLRKAIYGLKQAPR 1169
Query: 1170 AWFGRFCRAMQSYGFKQCDSDHTLFLKRNQEKLTALIIYVDDMIVTGNDTQEIATLEKKL 1229
AW+ + + GF SD +LF+ + + +++YVDD+++TGNDT + L
Sbjct: 1170 AWYVELRTYLLTVGFVNSISDTSLFVLQRGRSIIYMLVYVDDILITGNDTVLLKHTLDAL 1229
Query: 1230 SGEFEMKNLGGLKYFLGIEVMRSRQGIVLSQRKYILDLLAEIGMLDCRPADTPVVQGVKL 1289
S F +K L YFLGIE R QG+ LSQR+Y LDLLA ML +P TP+ KL
Sbjct: 1230 SQRFSVKEHEDLHYFLGIEAKRVPQGLHLSQRRYTLDLLARTNMLTAKPVATPMATSPKL 1289
Query: 1290 GEFPDQVPANKERYQRLVGKLIYLAHTRPDIAYGVSVVSQFMHNPSEDHMDAVMRIIRYL 1349
+ Y+ +VG L YLA TRPD++Y V+ +SQ+MH P++DH +A+ R++RYL
Sbjct: 1290 TLHSGTKLPDPTEYRGIVGSLQYLAFTRPDLSYAVNRLSQYMHMPTDDHWNALKRVLRYL 1349
Query: 1350 KGCPGKGITFKKNGHLDVSGFTDADWAGSVSDRRSTAGYFTFVGGNLVTWRSKKQSVVAL 1409
G P GI KK L + ++DADWAG D ST GY ++G + ++W SKKQ V
Sbjct: 1350 AGTPDHGIFLKKGNTLSLHAYSDADWAGDTDDYVSTNGYIVYLGHHPISWSSKKQKGVVR 1409
Query: 1410 SSAEAEFRGVAKGICELLWLRRLLGELGFAPTQAMDLYCDSRPAIDISHNPVQHDRTKHV 1462
SS EAE+R VA EL W+ LL ELG + +YCD+ A + NPV H R KH+
Sbjct: 1410 SSTEAEYRSVANTSSELQWICSLLTELGIQLSHPPVIYCDNVGATYLCANPVFHSRMKHI 1443
BLAST of Cmc01g0005171 vs. ExPASy Swiss-Prot
Match:
P10978 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum OX=4097 PE=2 SV=1)
HSP 1 Score: 636.0 bits (1639), Expect = 1.1e-180
Identity = 426/1228 (34.69%), Postives = 639/1228 (52.04%), Query Frame = 0
Query: 271 KESSSNQG--AVHFTTRYNKTEAVLKCTHCGGKGHSKDTCFKIHGYPDWYKELKKKQAEA 330
+ SS+N G ++ V C +C GH K C P+ K K +
Sbjct: 206 QRSSNNYGRSGARGKSKNRSKSRVRNCYNCNQPGHFKRDC------PNPRK--GKGETSG 265
Query: 331 KRGKVSIAVSGKNGDDSPSEGIIFLSS-----NLSNDNSTWIVDSGASDHMTFTKTDLTA 390
++ + A +N D+ ++F++ +LS S W+VD+ AS H T + DL
Sbjct: 266 QKNDDNTAAMVQNNDNV----VLFINEEEECMHLSGPESEWVVDTAASHHATPVR-DLFC 325
Query: 391 ECVTKKTEILNANGISYPVKCAGSIKVTSQLNLSNVLV------VPSLATRLMSVSKLTK 450
V + SY K AG + + N+ LV VP L L+S L +
Sbjct: 326 RYVAGDFGTVKMGNTSYS-KIAGIGDICIKTNVGCTLVLKDVRHVPDLRMNLISGIALDR 385
Query: 451 DRNCVVKMYSDYFIIQD-ILTKE--IIGRGIERDGLYHLE-DLKAGRASLVVDQTEVQNK 510
D Y YF Q LTK +I +G+ R LY ++ G + D+ V
Sbjct: 386 DG------YESYFANQKWRLTKGSLVIAKGVARGTLYRTNAEICQGELNAAQDEISVD-- 445
Query: 511 IWTWHRRLGHPSLGYMKKLLPHLFKNN------NLTVFNCDTCIKAKSHRVSYAPSSNKS 570
WH+R+GH S +K L L K + TV CD C+ K HRVS+ SS +
Sbjct: 446 --LWHKRMGHMS----EKGLQILAKKSLISYAKGTTVKPCDYCLFGKQHRVSFQTSSERK 505
Query: 571 NSPFDLIHSDVWGPAPVSSIDGNKWFVLFIDDCTRMTWVYLLKSKDEVPSVFKNFYTMIK 630
+ DL++SDV GP + S+ GNK+FV FIDD +R WVY+LK+KD+V VF+ F+ +++
Sbjct: 506 LNILDLVYSDVCGPMEIESMGGNKYFVTFIDDASRKLWVYILKTKDQVFQVFQKFHALVE 565
Query: 631 TQFGKGIKFFRSDNGGEFIGKVLKDFFVQMGIVHETSCVGTPQQNGVAERKNRHILEMSR 690
+ G+ +K RSDNGGE+ + +++ GI HE + GTPQ NGVAER NR I+E R
Sbjct: 566 RETGRKLKRLRSDNGGEYTSREFEEYCSSHGIRHEKTVPGTPQHNGVAERMNRTIVEKVR 625
Query: 691 ALLFEYHVPTKFWDKSILMAVYVLNRLPTKINNFQTPLKTLEKHHSIPSVLSLTPKIFGC 750
++L +P FW +++ A Y++NR P+ F+ P + S L K+FGC
Sbjct: 626 SMLRMAKLPKSFWGEAVQTACYLINRSPSVPLAFEIPERVWTNKEVSYSHL----KVFGC 685
Query: 751 VAYVHVPKTQRSKLSPYAVRCVFLGFGQNQKGYKCFDVDSRKWYVTMDVTFMEHEAFFKP 810
A+ HVPK QR+KL ++ C+F+G+G + GY+ +D +K + DV F E E
Sbjct: 686 RAFAHVPKEQRTKLDDKSIPCIFIGYGDEEFGYRLWDPVKKKVIRSRDVVFRESEV---- 745
Query: 811 TIQSDQGENSSESDVWKNKDLLSFQGWTPGCEQYGQDGVSGTDKSDGNGNGTDESDGDGN 870
D S+ N
Sbjct: 746 ------------------------------------------------RTAADMSEKVKN 805
Query: 871 GATDGNKPLEMYSRNKNRAVFGMESENEEVIQPNEEPQGPMSPEDQLQVSLPTSKYTLPV 930
G + S N A ES +EV + E+P + +QL + ++ P
Sbjct: 806 GIIPNFVTIPSTSNNPTSA----ESTTDEVSEQGEQPGEVIEQGEQLDEGVEEVEH--PT 865
Query: 931 RCNRGIPPKRYEPDEDRHKKSKYPIANFVDTKSLSGPVKRFNENLLSCKIPENVDEAKSD 990
+ P R + R + +YP +V + + PE++ E S
Sbjct: 866 QGEEQHQPLR-RSERPRVESRRYPSTEYV--------------LISDDREPESLKEVLSH 925
Query: 991 PNWSQ---AMEAEMSALYNNKTWTLVELPQGKIPVGCRWVFSIKYNANGEIDRYKARLVA 1050
P +Q AM+ EM +L N T+ LVELP+GK P+ C+WVF +K + + ++ RYKARLV
Sbjct: 926 PEKNQLMKAMQEEMESLQKNGTYKLVELPKGKRPLKCKWVFKLKKDGDCKLVRYKARLVV 985
Query: 1051 KGYTQTQGIDFQETFSPVAKLNTIRVLLSLAANLDWPLHQFDVKNAFLHGELKEEVYMEQ 1110
KG+ Q +GIDF E FSPV K+ +IR +LSLAA+LD + Q DVK AFLHG+L+EE+YMEQ
Sbjct: 986 KGFEQKKGIDFDEIFSPVVKMTSIRTILSLAASLDLEVEQLDVKTAFLHGDLEEEIYMEQ 1045
Query: 1111 PPGYKSTNAKPMVCKLNKALYGLKQSPRAWFGRFCRAMQSYGFKQCDSDHTLFLKR-NQE 1170
P G++ K MVCKLNK+LYGLKQ+PR W+ +F M+S + + SD ++ KR ++
Sbjct: 1046 PEGFEVAGKKHMVCKLNKSLYGLKQAPRQWYMKFDSFMKSQTYLKTYSDPCVYFKRFSEN 1105
Query: 1171 KLTALIIYVDDMIVTGNDTQEIATLEKKLSGEFEMKNLGGLKYFLGIEVMRSR--QGIVL 1230
L++YVDDM++ G D IA L+ LS F+MK+LG + LG++++R R + + L
Sbjct: 1106 NFIILLLYVDDMLIVGKDKGLIAKLKGDLSKSFDMKDLGPAQQILGMKIVRERTSRKLWL 1165
Query: 1231 SQRKYILDLLAEIGMLDCRPADTPVVQGVKLGE--FPDQVPAN----KERYQRLVGKLIY 1290
SQ KYI +L M + +P TP+ +KL + P V K Y VG L+Y
Sbjct: 1166 SQEKYIERVLERFNMKNAKPVSTPLAGHLKLSKKMCPTTVEEKGNMAKVPYSSAVGSLMY 1225
Query: 1291 -LAHTRPDIAYGVSVVSQFMHNPSEDHMDAVMRIIRYLKGCPGKGITFKKNGHLDVSGFT 1350
+ TRPDIA+ V VVS+F+ NP ++H +AV I+RYL+G G + F + + + G+T
Sbjct: 1226 AMVCTRPDIAHAVGVVSRFLENPGKEHWEAVKWILRYLRGTTGDCLCFGGSDPI-LKGYT 1285
Query: 1351 DADWAGSVSDRRSTAGY-FTFVGGNLVTWRSKKQSVVALSSAEAEFRGVAKGICELLWLR 1410
DAD AG + +R+S+ GY FTF GG ++W+SK Q VALS+ EAE+ + E++WL+
Sbjct: 1286 DADMAGDIDNRKSSTGYLFTFSGG-AISWQSKLQKCVALSTTEAEYIAATETGKEMIWLK 1325
Query: 1411 RLLGELGFAPTQAMDLYCDSRPAIDISHNPVQHDRTKHVEIDRHFIKEKLESNVIQMVHV 1462
R L ELG + + +YCDS+ AID+S N + H RTKH+++ H+I+E ++ ++++ +
Sbjct: 1346 RFLQELGLHQKEYV-VYCDSQSAIDLSKNSMYHARTKHIDVRYHWIREMVDDESLKVLKI 1325
BLAST of Cmc01g0005171 vs. ExPASy Swiss-Prot
Match:
P04146 (Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3)
HSP 1 Score: 508.1 bits (1307), Expect = 3.4e-142
Identity = 352/1210 (29.09%), Postives = 603/1210 (49.83%), Query Frame = 0
Query: 293 LKCTHCGGKGHSKDTCFKIHGYPDWYKELKKKQAEAKRGKVSIAVSGKNGDDSPSEGIIF 352
+KC HCG +GH K CF YK + + + +V A S GI F
Sbjct: 230 VKCHHCGREGHIKKDCFH-------YKRILNNKNKENEKQVQTAT---------SHGIAF 289
Query: 353 LSSNLSN----DNSTWIVDSGASDHMTFTK---TDLTAECVTKKTEILNANGISYPVKCA 412
+ ++N DN +++DSGASDH+ + TD K + Y K
Sbjct: 290 MVKEVNNTSVMDNCGFVLDSGASDHLINDESLYTDSVEVVPPLKIAVAKQGEFIYATK-R 349
Query: 413 GSIKVTS--QLNLSNVLVVPSLATRLMSVSKLTKDRNCVVKMYSDYFIIQDILTKEIIGR 472
G +++ + ++ L +VL A LMSV +L + + S I ++ L
Sbjct: 350 GIVRLRNDHEITLEDVLFCKEAAGNLMSVKRLQEAGMSIEFDKSGVTISKNGLMV----- 409
Query: 473 GIERDGLYHLEDLKAGRASLVVDQTEVQNKIWTWHRRLGHPSLGYMKKLL-------PHL 532
++ G+ L ++ + +N WH R GH S G + ++ L
Sbjct: 410 -VKNSGM--LNNVPVINFQAYSINAKHKNNFRLWHERFGHISDGKLLEIKRKNMFSDQSL 469
Query: 533 FKNNNLTVFNCDTCIKAKSHRVSYAPSSNKSN--SPFDLIHSDVWGPAPVSSIDGNKWFV 592
N L+ C+ C+ K R+ + +K++ P ++HSDV GP ++D +FV
Sbjct: 470 LNNLELSCEICEPCLNGKQARLPFKQLKDKTHIKRPLFVVHSDVCGPITPVTLDDKNYFV 529
Query: 593 LFIDDCTRMTWVYLLKSKDEVPSVFKNFYTMIKTQFGKGIKFFRSDNGGEFIGKVLKDFF 652
+F+D T YL+K K +V S+F++F + F + + DNG E++ ++ F
Sbjct: 530 IFVDQFTHYCVTYLIKYKSDVFSMFQDFVAKSEAHFNLKVVYLYIDNGREYLSNEMRQFC 589
Query: 653 VQMGIVHETSCVGTPQQNGVAERKNRHILEMSRALLFEYHVPTKFWDKSILMAVYVLNRL 712
V+ GI + + TPQ NGV+ER R I E +R ++ + FW +++L A Y++NR+
Sbjct: 590 VKKGISYHLTVPHTPQLNGVSERMIRTITEKARTMVSGAKLDKSFWGEAVLTATYLINRI 649
Query: 713 PTK--INNFQTPLKTLEKHHSIPSVLSLTPKIFGCVAYVHVPKTQRSKLSPYAVRCVFLG 772
P++ +++ +TP + H+ P + L ++FG YVH+ K ++ K + + +F+G
Sbjct: 650 PSRALVDSSKTPYEMW--HNKKPYLKHL--RVFGATVYVHI-KNKQGKFDDKSFKSIFVG 709
Query: 773 FGQNQKGYKCFDVDSRKWYVTMDVTFME------HEAFFKPTIQSDQGENSSESDVWKNK 832
+ N G+K +D + K+ V DV E F+ D E+ +++ ++
Sbjct: 710 YEPN--GFKLWDAVNEKFIVARDVVVDETNMVNSRAVKFETVFLKDSKESENKNFPNDSR 769
Query: 833 DLL--SFQGWTPGCE--QYGQDGVSGTDKSDGNGNGTDESDGDGNGATDGNKPLEMYSRN 892
++ F + C+ Q+ +D +K+ N + N + + + +
Sbjct: 770 KIIQTEFPNESKECDNIQFLKDSKESENKNFPNDSRKIIQTEFPNESKECDNIQFLKDSK 829
Query: 893 KNRAVFGMESENEEVIQPNEEPQGPMSPEDQLQVSLPTSKYTLPVRCNRGIPPKRYEPDE 952
++ F ES+ + E +G +P + + T+++ + + E
Sbjct: 830 ESNKYFLNESKKRKRDDHLNESKGSGNPNESRESE--TAEHLKEIGIDNPTKNDGIEIIN 889
Query: 953 DRHK--KSKYPIANFVDTKSLSGPVKRFNENLLSCKIPENVDEAK---SDPNWSQAMEAE 1012
R + K+K I+ + SL+ V N + + +P + DE + +W +A+ E
Sbjct: 890 RRSERLKTKPQISYNEEDNSLNKVV--LNAHTIFNDVPNSFDEIQYRDDKSSWEEAINTE 949
Query: 1013 MSALYNNKTWTLVELPQGKIPVGCRWVFSIKYNANGEIDRYKARLVAKGYTQTQGIDFQE 1072
++A N TWT+ + P+ K V RWVFS+KYN G RYKARLVA+G+TQ ID++E
Sbjct: 950 LNAHKINNTWTITKRPENKNIVDSRWVFSVKYNELGNPIRYKARLVARGFTQKYQIDYEE 1009
Query: 1073 TFSPVAKLNTIRVLLSLAANLDWPLHQFDVKNAFLHGELKEEVYMEQPPGYKSTNAKPMV 1132
TF+PVA++++ R +LSL + +HQ DVK AFL+G LKEE+YM P G + V
Sbjct: 1010 TFAPVARISSFRFILSLVIQYNLKVHQMDVKTAFLNGTLKEEIYMRLPQGISCNSDN--V 1069
Query: 1133 CKLNKALYGLKQSPRAWFGRFCRAMQSYGFKQCDSDHTLFL--KRNQEKLTALIIYVDDM 1192
CKLNKA+YGLKQ+ R WF F +A++ F D +++ K N + +++YVDD+
Sbjct: 1070 CKLNKAIYGLKQAARCWFEVFEQALKECEFVNSSVDRCIYILDKGNINENIYVLLYVDDV 1129
Query: 1193 IVTGNDTQEIATLEKKLSGEFEMKNLGGLKYFLGIEVMRSRQGIVLSQRKYILDLLAEIG 1252
++ D + ++ L +F M +L +K+F+GI + I LSQ Y+ +L++
Sbjct: 1130 VIATGDMTRMNNFKRYLMEKFRMTDLNEIKHFIGIRIEMQEDKIYLSQSAYVKKILSKFN 1189
Query: 1253 MLDCRPADTPVVQGVKLGEFPDQVPANKERYQRLVGKLIY-LAHTRPDIAYGVSVVSQFM 1312
M +C TP+ + N + L+G L+Y + TRPD+ V+++S++
Sbjct: 1190 MENCNAVSTPLPSKINYELLNSDEDCNTP-CRSLIGCLMYIMLCTRPDLTTAVNILSRYS 1249
Query: 1313 HNPSEDHMDAVMRIIRYLKGCPGKGITFKKNGHLD--VSGFTDADWAGSVSDRRSTAGY- 1372
+ + + R++RYLKG + FKKN + + G+ D+DWAGS DR+ST GY
Sbjct: 1250 SKNNSELWQNLKRVLRYLKGTIDMKLIFKKNLAFENKIIGYVDSDWAGSEIDRKSTTGYL 1309
Query: 1373 FTFVGGNLVTWRSKKQSVVALSSAEAEFRGVAKGICELLWLRRLLGELGFAPTQAMDLYC 1432
F NL+ W +K+Q+ VA SS EAE+ + + + E LWL+ LL + + +Y
Sbjct: 1310 FKMFDFNLICWNTKRQNSVAASSTEAEYMALFEAVREALWLKFLLTSINIKLENPIKIYE 1369
Query: 1433 DSRPAIDISHNPVQHDRTKHVEIDRHFIKEKLESNVIQMVHVRSGEQLADILTKAVMSKT 1462
D++ I I++NP H R KH++I HF +E++++NVI + ++ + QLADI TK + +
Sbjct: 1370 DNQGCISIANNPSCHKRAKHIDIKYHFAREQVQNNVICLEYIPTENQLADIFTKPLPAAR 1400
BLAST of Cmc01g0005171 vs. ExPASy Swiss-Prot
Match:
P92519 (Uncharacterized mitochondrial protein AtMg00810 OS=Arabidopsis thaliana OX=3702 GN=AtMg00810 PE=4 SV=1)
HSP 1 Score: 188.7 bits (478), Expect = 4.6e-46
Identity = 96/228 (42.11%), Postives = 140/228 (61.40%), Query Frame = 0
Query: 1148 LIIYVDDMIVTGNDTQEIATLEKKLSGEFEMKNLGGLKYFLGIEVMRSRQGIVLSQRKYI 1207
L++YVDD+++TG+ + L +LS F MK+LG + YFLGI++ G+ LSQ KY
Sbjct: 3 LLLYVDDILLTGSSNTLLNMLIFQLSSTFSMKDLGPVHYFLGIQIKTHPSGLFLSQTKYA 62
Query: 1208 LDLLAEIGMLDCRPADTPVV----QGVKLGEFPDQVPANKERYQRLVGKLIYLAHTRPDI 1267
+L GMLDC+P TP+ V ++PD P++ ++ +VG L YL TRPDI
Sbjct: 63 EQILNNAGMLDCKPMSTPLPLKLNSSVSTAKYPD--PSD---FRSIVGALQYLTLTRPDI 122
Query: 1268 AYGVSVVSQFMHNPSEDHMDAVMRIIRYLKGCPGKGITFKKNGHLDVSGFTDADWAGSVS 1327
+Y V++V Q MH P+ D + R++RY+KG G+ KN L+V F D+DWAG S
Sbjct: 123 SYAVNIVCQRMHEPTLADFDLLKRVLRYVKGTIFHGLYIHKNSKLNVQAFCDSDWAGCTS 182
Query: 1328 DRRSTAGYFTFVGGNLVTWRSKKQSVVALSSAEAEFRGVAKGICELLW 1372
RRST G+ TF+G N+++W +K+Q V+ SS E E+R +A EL W
Sbjct: 183 TRRSTTGFCTFLGCNIISWSAKRQPTVSRSSTETEYRALALTAAELTW 225
BLAST of Cmc01g0005171 vs. ExPASy TrEMBL
Match:
A0A5A7T8G9 (Putative polyprotein (Retrotrasposon protein) OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold36G00310 PE=4 SV=1)
HSP 1 Score: 2975.7 bits (7713), Expect = 0.0e+00
Identity = 1458/1468 (99.32%), Postives = 1462/1468 (99.59%), Query Frame = 0
Query: 1 MFRNFQPPLGPATGPEGGLPPMTIVFNGRNYGVWSQMVEVLLASKDKLGHINGERTQPAS 60
MFRNFQPPLGPATGPEGGLPPMTIVFNGRNYGVWSQMVEVLLASKDKLGHINGERTQPAS
Sbjct: 33 MFRNFQPPLGPATGPEGGLPPMTIVFNGRNYGVWSQMVEVLLASKDKLGHINGERTQPAS 92
Query: 61 NDTGYKRWMSDNSMVKGWILSSLDPNLIGNFIRFSTAKEVWDSIKLTFFDGNDNTQIYDL 120
NDTGYKRWMSDNSMVKGWILSSLDPNLIGNFIRFSTAKEVWDSIKLTFFDGNDNTQIYDL
Sbjct: 93 NDTGYKRWMSDNSMVKGWILSSLDPNLIGNFIRFSTAKEVWDSIKLTFFDGNDNTQIYDL 152
Query: 121 EVKVYGLKQTGTLEDYFYTLQGLWKEIEFRKPNPMTCPIDIEKYNRVEQDRKVYIFLGGL 180
EVKVYGLKQTGTLEDYFYTLQGLWKEIEFRKPNPMTCPIDIEKYNRVEQDRKVYIFLGGL
Sbjct: 153 EVKVYGLKQTGTLEDYFYTLQGLWKEIEFRKPNPMTCPIDIEKYNRVEQDRKVYIFLGGL 212
Query: 181 EDRFDGIRAEILRMTPLPTVEETYGHVRREDVRQSVMVGKSTDLSNSAAMIVQGAKQVAG 240
EDRFDGIRAEILRMTPLPTVEETYGHVRREDVRQSVMVGKSTDLSNS AMIVQGAKQVAG
Sbjct: 213 EDRFDGIRAEILRMTPLPTVEETYGHVRREDVRQSVMVGKSTDLSNSVAMIVQGAKQVAG 272
Query: 241 VPHISTNSTHGNSNNSVPGDPLSLSSQVQGKESSSNQGAVHFTTRYNKTEAVLKCTHCGG 300
VPHISTNSTHGNSNNSVPGDPLSLSSQVQGKESSSNQGAVHFTTRYNKTEAVLKCTHCGG
Sbjct: 273 VPHISTNSTHGNSNNSVPGDPLSLSSQVQGKESSSNQGAVHFTTRYNKTEAVLKCTHCGG 332
Query: 301 KGHSKDTCFKIHGYPDWYKELKKKQAEAKRGKVSIAVSGKNGDDSPSEGIIFLSSNLSND 360
KGHSKDTCFKIHGYPDWYKELKKKQAEAKRGKVSIAVSGKNGDDSPSEGIIFLSSNLSND
Sbjct: 333 KGHSKDTCFKIHGYPDWYKELKKKQAEAKRGKVSIAVSGKNGDDSPSEGIIFLSSNLSND 392
Query: 361 NSTWIVDSGASDHMTFTKTDLTAECVTKKTEILNANGISYPVKCAGSIKVTSQLNLSNVL 420
NSTWIVDSGASDHMTFTKTDLTAECVTKKTEILNANGISYPVKCAGSIKVTSQLNLSNVL
Sbjct: 393 NSTWIVDSGASDHMTFTKTDLTAECVTKKTEILNANGISYPVKCAGSIKVTSQLNLSNVL 452
Query: 421 VVPSLATRLMSVSKLTKDRNCVVKMYSDYFIIQDILTKEIIGRGIERDGLYHLEDLKAGR 480
VVPSLATRLMSVSKLTKDRNCVVKMYSDYFIIQDILTKEIIGRGIERDGLYHLEDLKAGR
Sbjct: 453 VVPSLATRLMSVSKLTKDRNCVVKMYSDYFIIQDILTKEIIGRGIERDGLYHLEDLKAGR 512
Query: 481 ASLVVDQTEVQNKIWTWHRRLGHPSLGYMKKLLPHLFKNNNLTVFNCDTCIKAKSHRVSY 540
A+LVVDQTEVQNKIWTWHRRLGHPSLGYMKKLLP LFKNNNLTVFNCDTCIKAKSHRVSY
Sbjct: 513 ANLVVDQTEVQNKIWTWHRRLGHPSLGYMKKLLPRLFKNNNLTVFNCDTCIKAKSHRVSY 572
Query: 541 APSSNKSNSPFDLIHSDVWGPAPVSSIDGNKWFVLFIDDCTRMTWVYLLKSKDEVPSVFK 600
APSSNKSNSPFDLIHSDVWGPAPVSSIDGNKWFVLFIDDCTRMTWVYLLKSKDEVPSVFK
Sbjct: 573 APSSNKSNSPFDLIHSDVWGPAPVSSIDGNKWFVLFIDDCTRMTWVYLLKSKDEVPSVFK 632
Query: 601 NFYTMIKTQFGKGIKFFRSDNGGEFIGKVLKDFFVQMGIVHETSCVGTPQQNGVAERKNR 660
FYTMIKTQFGKGIKFFRSDNGGEFIGKVLKDFFVQMGIVHETSCVGTPQQNGVAERKNR
Sbjct: 633 KFYTMIKTQFGKGIKFFRSDNGGEFIGKVLKDFFVQMGIVHETSCVGTPQQNGVAERKNR 692
Query: 661 HILEMSRALLFEYHVPTKFWDKSILMAVYVLNRLPTKINNFQTPLKTLEKHHSIPSVLSL 720
HILEMSRALLFEYHVPTKFWDKSILMAVYVLNRLPTKINNFQTPLKTLEKHHSIPSVLSL
Sbjct: 693 HILEMSRALLFEYHVPTKFWDKSILMAVYVLNRLPTKINNFQTPLKTLEKHHSIPSVLSL 752
Query: 721 TPKIFGCVAYVHVPKTQRSKLSPYAVRCVFLGFGQNQKGYKCFDVDSRKWYVTMDVTFME 780
TPKIFGCVAYVHVPKTQRSKLSP AVRCVFLGFGQNQKGYKCFDVDSRKWYVTMDVTF+E
Sbjct: 753 TPKIFGCVAYVHVPKTQRSKLSPCAVRCVFLGFGQNQKGYKCFDVDSRKWYVTMDVTFVE 812
Query: 781 HEAFFKPTIQSDQGENSSESDVWKNKDLLSFQGWTPGCEQYGQDGVSGTDKSDGNGNGTD 840
HEAFFKPTIQSDQGENSSESDVWKNKDLLSFQGWTPGCEQYGQDGVSGTD+SDGNGNGTD
Sbjct: 813 HEAFFKPTIQSDQGENSSESDVWKNKDLLSFQGWTPGCEQYGQDGVSGTDESDGNGNGTD 872
Query: 841 ESDGDGNGATDGNKPLEMYSRNKNRAVFGMESENEEVIQPNEEPQGPMSPEDQLQVSLPT 900
ESDGDGNGATDGNKPLEMYSRNKNRAVFGMESENEEVIQPNEEPQGPMSPEDQLQVSLPT
Sbjct: 873 ESDGDGNGATDGNKPLEMYSRNKNRAVFGMESENEEVIQPNEEPQGPMSPEDQLQVSLPT 932
Query: 901 SKYTLPVRCNRGIPPKRYEPDEDRHKKSKYPIANFVDTKSLSGPVKRFNENLLSCKIPEN 960
SKYTLPVRCNRGIPPKRYEPDEDRHKKSKYPIANFVDTKSLSGPVKRFNENLLSCKIPEN
Sbjct: 933 SKYTLPVRCNRGIPPKRYEPDEDRHKKSKYPIANFVDTKSLSGPVKRFNENLLSCKIPEN 992
Query: 961 VDEAKSDPNWSQAMEAEMSALYNNKTWTLVELPQGKIPVGCRWVFSIKYNANGEIDRYKA 1020
VDEAKSDPNWSQAMEAEMSALYNNKTWTLVELPQGKIPVGCRWVFSIKYNANGEIDRYKA
Sbjct: 993 VDEAKSDPNWSQAMEAEMSALYNNKTWTLVELPQGKIPVGCRWVFSIKYNANGEIDRYKA 1052
Query: 1021 RLVAKGYTQTQGIDFQETFSPVAKLNTIRVLLSLAANLDWPLHQFDVKNAFLHGELKEEV 1080
RLVAKGYTQTQGIDFQETFSPVAKLNTIRVLLSLAANLDWPLHQFDVKNAFLHGELKEEV
Sbjct: 1053 RLVAKGYTQTQGIDFQETFSPVAKLNTIRVLLSLAANLDWPLHQFDVKNAFLHGELKEEV 1112
Query: 1081 YMEQPPGYKSTNAKPMVCKLNKALYGLKQSPRAWFGRFCRAMQSYGFKQCDSDHTLFLKR 1140
YMEQPPGYKSTNAKPMVCKLNKALYGLKQSPRAWFGRFCRAMQSYGFKQCDSDHTLFLKR
Sbjct: 1113 YMEQPPGYKSTNAKPMVCKLNKALYGLKQSPRAWFGRFCRAMQSYGFKQCDSDHTLFLKR 1172
Query: 1141 NQEKLTALIIYVDDMIVTGNDTQEIATLEKKLSGEFEMKNLGGLKYFLGIEVMRSRQGIV 1200
NQEKLTALIIYVDDMIVTGNDTQEIATLEKKLSGEFEMKNLGGLKYFL IEVMRS+QGIV
Sbjct: 1173 NQEKLTALIIYVDDMIVTGNDTQEIATLEKKLSGEFEMKNLGGLKYFLRIEVMRSKQGIV 1232
Query: 1201 LSQRKYILDLLAEIGMLDCRPADTPVVQGVKLGEFPDQVPANKERYQRLVGKLIYLAHTR 1260
LSQRKYILDLLAEIGMLDCRPADTPVVQGVKLGEFPDQVPANKERYQRLVGKLIYLAHTR
Sbjct: 1233 LSQRKYILDLLAEIGMLDCRPADTPVVQGVKLGEFPDQVPANKERYQRLVGKLIYLAHTR 1292
Query: 1261 PDIAYGVSVVSQFMHNPSEDHMDAVMRIIRYLKGCPGKGITFKKNGHLDVSGFTDADWAG 1320
PDIAYGVSVVSQFMHNPSEDHMDAVMRIIRYLKGCPGKGITFKKNGHLDVSGFTDADWAG
Sbjct: 1293 PDIAYGVSVVSQFMHNPSEDHMDAVMRIIRYLKGCPGKGITFKKNGHLDVSGFTDADWAG 1352
Query: 1321 SVSDRRSTAGYFTFVGGNLVTWRSKKQSVVALSSAEAEFRGVAKGICELLWLRRLLGELG 1380
SVSDRRSTAGYFTFVGGNLVTWRSKKQSVVALSSAEAEFRGVAKGICELLWLRRLLGELG
Sbjct: 1353 SVSDRRSTAGYFTFVGGNLVTWRSKKQSVVALSSAEAEFRGVAKGICELLWLRRLLGELG 1412
Query: 1381 FAPTQAMDLYCDSRPAIDISHNPVQHDRTKHVEIDRHFIKEKLESNVIQMVHVRSGEQLA 1440
FAPTQAMDLYCDSRPAIDISHNPVQHDRTKHVEIDRHFIKEKLESNVIQMVHVRSGEQLA
Sbjct: 1413 FAPTQAMDLYCDSRPAIDISHNPVQHDRTKHVEIDRHFIKEKLESNVIQMVHVRSGEQLA 1472
Query: 1441 DILTKAVMSKTFTSIIDKLGMQDAYIPT 1469
DILTKAVMSK FTSIIDKLGMQDAYIPT
Sbjct: 1473 DILTKAVMSKMFTSIIDKLGMQDAYIPT 1500
BLAST of Cmc01g0005171 vs. ExPASy TrEMBL
Match:
A0A5D3CF38 (Putative polyprotein (Retrotrasposon protein) OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold127G001130 PE=4 SV=1)
HSP 1 Score: 2970.6 bits (7700), Expect = 0.0e+00
Identity = 1455/1468 (99.11%), Postives = 1460/1468 (99.46%), Query Frame = 0
Query: 1 MFRNFQPPLGPATGPEGGLPPMTIVFNGRNYGVWSQMVEVLLASKDKLGHINGERTQPAS 60
MFRNFQPPLGPATGPEGGLPPMTIVFNGRNYGVWSQMVEVLLASKDKLGHINGERTQPAS
Sbjct: 33 MFRNFQPPLGPATGPEGGLPPMTIVFNGRNYGVWSQMVEVLLASKDKLGHINGERTQPAS 92
Query: 61 NDTGYKRWMSDNSMVKGWILSSLDPNLIGNFIRFSTAKEVWDSIKLTFFDGNDNTQIYDL 120
NDTGYKRWMSDNSMVKGWILSSLDPNLIGNFIRFSTAKEVWDSIKLTFFDGNDNTQIYDL
Sbjct: 93 NDTGYKRWMSDNSMVKGWILSSLDPNLIGNFIRFSTAKEVWDSIKLTFFDGNDNTQIYDL 152
Query: 121 EVKVYGLKQTGTLEDYFYTLQGLWKEIEFRKPNPMTCPIDIEKYNRVEQDRKVYIFLGGL 180
EVKVYGLKQTGTLEDYFYTLQGLWKEIEFRKPNPMTCPIDIEKYNRVEQDRKVYIFLGGL
Sbjct: 153 EVKVYGLKQTGTLEDYFYTLQGLWKEIEFRKPNPMTCPIDIEKYNRVEQDRKVYIFLGGL 212
Query: 181 EDRFDGIRAEILRMTPLPTVEETYGHVRREDVRQSVMVGKSTDLSNSAAMIVQGAKQVAG 240
EDRFDGIRAEILRMTPLPTVEETYGHVRREDVRQSVMVGKSTDLSNS AMIVQGAKQVAG
Sbjct: 213 EDRFDGIRAEILRMTPLPTVEETYGHVRREDVRQSVMVGKSTDLSNSVAMIVQGAKQVAG 272
Query: 241 VPHISTNSTHGNSNNSVPGDPLSLSSQVQGKESSSNQGAVHFTTRYNKTEAVLKCTHCGG 300
VPHISTNSTHGNSNNSVPGDPLSLSSQVQGKESSSNQGAVHFTTRYNKTEAVLKCTHCGG
Sbjct: 273 VPHISTNSTHGNSNNSVPGDPLSLSSQVQGKESSSNQGAVHFTTRYNKTEAVLKCTHCGG 332
Query: 301 KGHSKDTCFKIHGYPDWYKELKKKQAEAKRGKVSIAVSGKNGDDSPSEGIIFLSSNLSND 360
KGHSKDTCFKIHGYPDWYKELKKKQAEAKRGKVSIAVSGKNGDDSPSEG IFLSSNLSND
Sbjct: 333 KGHSKDTCFKIHGYPDWYKELKKKQAEAKRGKVSIAVSGKNGDDSPSEGTIFLSSNLSND 392
Query: 361 NSTWIVDSGASDHMTFTKTDLTAECVTKKTEILNANGISYPVKCAGSIKVTSQLNLSNVL 420
STWIVDSGASDHMTFTKTDLTAECVTKKTEILNANGISYPVKCAGSIKVTSQLNLSNVL
Sbjct: 393 YSTWIVDSGASDHMTFTKTDLTAECVTKKTEILNANGISYPVKCAGSIKVTSQLNLSNVL 452
Query: 421 VVPSLATRLMSVSKLTKDRNCVVKMYSDYFIIQDILTKEIIGRGIERDGLYHLEDLKAGR 480
VVPSLATRLMSVSKLTKDRNCVVKMYSDYFIIQDILTKEIIGRGIERDGLYHLEDLKAGR
Sbjct: 453 VVPSLATRLMSVSKLTKDRNCVVKMYSDYFIIQDILTKEIIGRGIERDGLYHLEDLKAGR 512
Query: 481 ASLVVDQTEVQNKIWTWHRRLGHPSLGYMKKLLPHLFKNNNLTVFNCDTCIKAKSHRVSY 540
A+LVVDQTEVQNKIWTWHRRLGHPSLGYMKKLLP LFKNNNLTVFNCDTCIKAKSHRVSY
Sbjct: 513 ANLVVDQTEVQNKIWTWHRRLGHPSLGYMKKLLPRLFKNNNLTVFNCDTCIKAKSHRVSY 572
Query: 541 APSSNKSNSPFDLIHSDVWGPAPVSSIDGNKWFVLFIDDCTRMTWVYLLKSKDEVPSVFK 600
APSSNKSNSPFDLIHSDVWGPAPVSSIDGNKWFVLFIDDCTRMTWVYLLKSKDEVPSVFK
Sbjct: 573 APSSNKSNSPFDLIHSDVWGPAPVSSIDGNKWFVLFIDDCTRMTWVYLLKSKDEVPSVFK 632
Query: 601 NFYTMIKTQFGKGIKFFRSDNGGEFIGKVLKDFFVQMGIVHETSCVGTPQQNGVAERKNR 660
FYTMIKTQFGKGIKFFRSDNGGEFIGKVLKDFFVQMGIVHETSCVGTPQQNGVAERKNR
Sbjct: 633 KFYTMIKTQFGKGIKFFRSDNGGEFIGKVLKDFFVQMGIVHETSCVGTPQQNGVAERKNR 692
Query: 661 HILEMSRALLFEYHVPTKFWDKSILMAVYVLNRLPTKINNFQTPLKTLEKHHSIPSVLSL 720
HILEMSRALLFEYHVPTKFWDKSILMAVYVLNRLPTKINNFQTPLKTLEKHHSIPSVLSL
Sbjct: 693 HILEMSRALLFEYHVPTKFWDKSILMAVYVLNRLPTKINNFQTPLKTLEKHHSIPSVLSL 752
Query: 721 TPKIFGCVAYVHVPKTQRSKLSPYAVRCVFLGFGQNQKGYKCFDVDSRKWYVTMDVTFME 780
TPKIFGCVAYVHVPKTQRSKLSP AVRCVFLGFGQNQKGYKCFDVDSRKWYVTMDVTF+E
Sbjct: 753 TPKIFGCVAYVHVPKTQRSKLSPCAVRCVFLGFGQNQKGYKCFDVDSRKWYVTMDVTFVE 812
Query: 781 HEAFFKPTIQSDQGENSSESDVWKNKDLLSFQGWTPGCEQYGQDGVSGTDKSDGNGNGTD 840
HEAFFKPTIQSDQGENSSESDVWKNKDLLSFQGWTPGCEQYGQDGVSGTD+SDGNGNGTD
Sbjct: 813 HEAFFKPTIQSDQGENSSESDVWKNKDLLSFQGWTPGCEQYGQDGVSGTDESDGNGNGTD 872
Query: 841 ESDGDGNGATDGNKPLEMYSRNKNRAVFGMESENEEVIQPNEEPQGPMSPEDQLQVSLPT 900
ESDGDGNGATDGNKPLEMYSRNKNRAVFGMESENEEVIQPNEEPQGPMSPEDQLQVSLPT
Sbjct: 873 ESDGDGNGATDGNKPLEMYSRNKNRAVFGMESENEEVIQPNEEPQGPMSPEDQLQVSLPT 932
Query: 901 SKYTLPVRCNRGIPPKRYEPDEDRHKKSKYPIANFVDTKSLSGPVKRFNENLLSCKIPEN 960
SKYTLPVRCNRGIPPKRYEPDEDRHKKSKYPIANFVDTKSLSGPVKRFNENLLSCKIPEN
Sbjct: 933 SKYTLPVRCNRGIPPKRYEPDEDRHKKSKYPIANFVDTKSLSGPVKRFNENLLSCKIPEN 992
Query: 961 VDEAKSDPNWSQAMEAEMSALYNNKTWTLVELPQGKIPVGCRWVFSIKYNANGEIDRYKA 1020
VDEAKSDPNWSQAMEAEMSALYNNKTWTLVELPQGKIPVGCRWVFSIKYNANGEIDRYKA
Sbjct: 993 VDEAKSDPNWSQAMEAEMSALYNNKTWTLVELPQGKIPVGCRWVFSIKYNANGEIDRYKA 1052
Query: 1021 RLVAKGYTQTQGIDFQETFSPVAKLNTIRVLLSLAANLDWPLHQFDVKNAFLHGELKEEV 1080
RLVAKGYTQTQGIDFQETFSPVAKLNTIRVLLSLAANLDWPLHQFDVKNAFLHGELKEEV
Sbjct: 1053 RLVAKGYTQTQGIDFQETFSPVAKLNTIRVLLSLAANLDWPLHQFDVKNAFLHGELKEEV 1112
Query: 1081 YMEQPPGYKSTNAKPMVCKLNKALYGLKQSPRAWFGRFCRAMQSYGFKQCDSDHTLFLKR 1140
YMEQPPGYKSTNAKPMVCKLNKALYGLKQSPRAWFGRFCRAMQSYGFKQCDSDHTLFLKR
Sbjct: 1113 YMEQPPGYKSTNAKPMVCKLNKALYGLKQSPRAWFGRFCRAMQSYGFKQCDSDHTLFLKR 1172
Query: 1141 NQEKLTALIIYVDDMIVTGNDTQEIATLEKKLSGEFEMKNLGGLKYFLGIEVMRSRQGIV 1200
NQEKLTALIIYVDDMIV GNDTQEIATLEKKLSGEFEMKNLGGLKYFLGIEVMRS+QGIV
Sbjct: 1173 NQEKLTALIIYVDDMIVMGNDTQEIATLEKKLSGEFEMKNLGGLKYFLGIEVMRSKQGIV 1232
Query: 1201 LSQRKYILDLLAEIGMLDCRPADTPVVQGVKLGEFPDQVPANKERYQRLVGKLIYLAHTR 1260
LSQRKYILDLLAEIGMLDCRPADTPVVQGVKLGEFPDQVPANKERYQRLVGKLIYLAHTR
Sbjct: 1233 LSQRKYILDLLAEIGMLDCRPADTPVVQGVKLGEFPDQVPANKERYQRLVGKLIYLAHTR 1292
Query: 1261 PDIAYGVSVVSQFMHNPSEDHMDAVMRIIRYLKGCPGKGITFKKNGHLDVSGFTDADWAG 1320
PDIAYGVSVVSQFMHNPSEDHMDAVMRI+RYLKGCPGKGITFKKNGHLDVSGFTDADWAG
Sbjct: 1293 PDIAYGVSVVSQFMHNPSEDHMDAVMRILRYLKGCPGKGITFKKNGHLDVSGFTDADWAG 1352
Query: 1321 SVSDRRSTAGYFTFVGGNLVTWRSKKQSVVALSSAEAEFRGVAKGICELLWLRRLLGELG 1380
SVSDRRSTAGYFTFVGGNLVTWRSKKQSVVALSSAEAEFRGVAKGICELLWLRRLLGELG
Sbjct: 1353 SVSDRRSTAGYFTFVGGNLVTWRSKKQSVVALSSAEAEFRGVAKGICELLWLRRLLGELG 1412
Query: 1381 FAPTQAMDLYCDSRPAIDISHNPVQHDRTKHVEIDRHFIKEKLESNVIQMVHVRSGEQLA 1440
FAPTQAMDLYCDSRPAIDISHNPVQHDRTKHVEIDRHFIKEKLESNVIQMVHVRSGEQLA
Sbjct: 1413 FAPTQAMDLYCDSRPAIDISHNPVQHDRTKHVEIDRHFIKEKLESNVIQMVHVRSGEQLA 1472
Query: 1441 DILTKAVMSKTFTSIIDKLGMQDAYIPT 1469
DILTKAVMSK FTSIIDKLGMQDAYIPT
Sbjct: 1473 DILTKAVMSKMFTSIIDKLGMQDAYIPT 1500
BLAST of Cmc01g0005171 vs. ExPASy TrEMBL
Match:
A0A5D3BHP1 (Gag-pol polyprotein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold487G00280 PE=4 SV=1)
HSP 1 Score: 2360.1 bits (6115), Expect = 0.0e+00
Identity = 1210/1468 (82.43%), Postives = 1214/1468 (82.70%), Query Frame = 0
Query: 1 MFRNFQPPLGPATGPEGGLPPMTIVFNGRNYGVWSQMVEVLLASKDKLGHINGERTQPAS 60
MFRNFQPPLGPATGPEGGLPPMTIVFNGRNYGVWSQMVEVLLASKDKLGHINGERTQPAS
Sbjct: 33 MFRNFQPPLGPATGPEGGLPPMTIVFNGRNYGVWSQMVEVLLASKDKLGHINGERTQPAS 92
Query: 61 NDTGYKRWMSDNSMVKGWILSSLDPNLIGNFIRFSTAKEVWDSIKLTFFDGNDNTQIYDL 120
NDTGYKRWMSDNSMVKGWILSSLDPNLIGNFIRFSTAKEVWDSIKLTFFDGNDNTQIYDL
Sbjct: 93 NDTGYKRWMSDNSMVKGWILSSLDPNLIGNFIRFSTAKEVWDSIKLTFFDGNDNTQIYDL 152
Query: 121 EVKVYGLKQTGTLEDYFYTLQGLWKEIEFRKPNPMTCPIDIEKYNRVEQDRKVYIFLGGL 180
EVKVYGLKQTGTLEDYFYTLQGLWKEIEFRKPNPMTCPIDIEKYNRVEQDRKVYIFLGGL
Sbjct: 153 EVKVYGLKQTGTLEDYFYTLQGLWKEIEFRKPNPMTCPIDIEKYNRVEQDRKVYIFLGGL 212
Query: 181 EDRFDGIRAEILRMTPLPTVEETYGHVRREDVRQSVMVGKSTDLSNSAAMIVQGAKQVAG 240
EDRFDGIRAEILRMTPLPTVEETYGHVRREDVRQSVMVGKSTDLSNS AMIVQGAKQVAG
Sbjct: 213 EDRFDGIRAEILRMTPLPTVEETYGHVRREDVRQSVMVGKSTDLSNSVAMIVQGAKQVAG 272
Query: 241 VPHISTNSTHGNSNNSVPGDPLSLSSQVQGKESSSNQGAVHFTTRYNKTEAVLKCTHCGG 300
VPHISTNSTHGNSNNSVPGDPLSLSSQVQGKESSSNQGAVHFTTRYNKTEAVLKCTHCGG
Sbjct: 273 VPHISTNSTHGNSNNSVPGDPLSLSSQVQGKESSSNQGAVHFTTRYNKTEAVLKCTHCGG 332
Query: 301 KGHSKDTCFKIHGYPDWYKELKKKQAEAKRGKVSIAVSGKNGDDSPSEGIIFLSSNLSND 360
KGHSKDTCFKIHGYPDWYKELKKKQAEAKRGKVSIAVSGKNGDDSPSEG IFLSSNLSND
Sbjct: 333 KGHSKDTCFKIHGYPDWYKELKKKQAEAKRGKVSIAVSGKNGDDSPSEGTIFLSSNLSND 392
Query: 361 NSTWIVDSGASDHMTFTKTDLTAECVTKKTEILNANGISYPVKCAGSIKVTSQLNLSNVL 420
STWIVDSGASDHMTFTKTDLTAECVTKKTEILNANGISYPVKCAGSIKVTSQLNLSNVL
Sbjct: 393 YSTWIVDSGASDHMTFTKTDLTAECVTKKTEILNANGISYPVKCAGSIKVTSQLNLSNVL 452
Query: 421 VVPSLATRLMSVSKLTKDRNCVVKMYSDYFIIQDILTKEIIGRGIERDGLYHLEDLKAGR 480
VVPSLATRLMSVSKLTKDRNCVVKMYSDYFIIQDILTKEIIGRGIERDGLYHLEDLKAGR
Sbjct: 453 VVPSLATRLMSVSKLTKDRNCVVKMYSDYFIIQDILTKEIIGRGIERDGLYHLEDLKAGR 512
Query: 481 ASLVVDQTEVQNKIWTWHRRLGHPSLGYMKKLLPHLFKNNNLTVFNCDTCIKAKSHRVSY 540
A+LVVDQTEVQNKIWTWHRRLGHPSLGYMKKLLP LFKNNNLTVFNCDTCIKAKSHRVSY
Sbjct: 513 ANLVVDQTEVQNKIWTWHRRLGHPSLGYMKKLLPRLFKNNNLTVFNCDTCIKAKSHRVSY 572
Query: 541 APSSNKSNSPFDLIHSDVWGPAPVSSIDGNKWFVLFIDDCTRMTWVYLLKSKDEVPSVFK 600
APSSNKSNSPFDLIHSDVWGPAP
Sbjct: 573 APSSNKSNSPFDLIHSDVWGPAP------------------------------------- 632
Query: 601 NFYTMIKTQFGKGIKFFRSDNGGEFIGKVLKDFFVQMGIVHETSCVGTPQQNGVAERKNR 660
Sbjct: 633 ------------------------------------------------------------ 692
Query: 661 HILEMSRALLFEYHVPTKFWDKSILMAVYVLNRLPTKINNFQTPLKTLEKHHSIPSVLSL 720
Sbjct: 693 ------------------------------------------------------------ 752
Query: 721 TPKIFGCVAYVHVPKTQRSKLSPYAVRCVFLGFGQNQKGYKCFDVDSRKWYVTMDVTFME 780
NQKGYKCFDVDSRKWYVTMDVTFME
Sbjct: 753 -----------------------------------NQKGYKCFDVDSRKWYVTMDVTFME 812
Query: 781 HEAFFKPTIQSDQGENSSESDVWKNKDLLSFQGWTPGCEQYGQDGVSGTDKSDGNGNGTD 840
HEAFFKPTIQSDQGENSSESDVWKNKDLLSFQGWTPGCEQYGQDGVSGTD+SDGNGNGTD
Sbjct: 813 HEAFFKPTIQSDQGENSSESDVWKNKDLLSFQGWTPGCEQYGQDGVSGTDESDGNGNGTD 872
Query: 841 ESDGDGNGATDGNKPLEMYSRNKNRAVFGMESENEEVIQPNEEPQGPMSPEDQLQVSLPT 900
ESDGDGNGATDGNKPLEMYSRNKNRAVFGMESENEEVIQPNEEPQGPMSPEDQLQVSLPT
Sbjct: 873 ESDGDGNGATDGNKPLEMYSRNKNRAVFGMESENEEVIQPNEEPQGPMSPEDQLQVSLPT 932
Query: 901 SKYTLPVRCNRGIPPKRYEPDEDRHKKSKYPIANFVDTKSLSGPVKRFNENLLSCKIPEN 960
SKYTLPVRCNRGIPPKRYEPDEDRHKKSKYPIANFVDTKSLSGPVKRFNENLLSCKIPEN
Sbjct: 933 SKYTLPVRCNRGIPPKRYEPDEDRHKKSKYPIANFVDTKSLSGPVKRFNENLLSCKIPEN 992
Query: 961 VDEAKSDPNWSQAMEAEMSALYNNKTWTLVELPQGKIPVGCRWVFSIKYNANGEIDRYKA 1020
VDEAKSDPNWSQAMEAEMSALYNNKTWTLVELPQGKIPVGCRWVFSIKYNANGEIDRYKA
Sbjct: 993 VDEAKSDPNWSQAMEAEMSALYNNKTWTLVELPQGKIPVGCRWVFSIKYNANGEIDRYKA 1052
Query: 1021 RLVAKGYTQTQGIDFQETFSPVAKLNTIRVLLSLAANLDWPLHQFDVKNAFLHGELKEEV 1080
RLVAKGYTQTQGIDFQETFSPVAKLNTIRVLLSLAANLDWPLHQFDVKNAFLHGELKEEV
Sbjct: 1053 RLVAKGYTQTQGIDFQETFSPVAKLNTIRVLLSLAANLDWPLHQFDVKNAFLHGELKEEV 1112
Query: 1081 YMEQPPGYKSTNAKPMVCKLNKALYGLKQSPRAWFGRFCRAMQSYGFKQCDSDHTLFLKR 1140
YMEQPPGYKSTNAKPMVCKLNKALYGLKQSPRAWFGRFCRAMQSYGFKQCDSDHTLFLKR
Sbjct: 1113 YMEQPPGYKSTNAKPMVCKLNKALYGLKQSPRAWFGRFCRAMQSYGFKQCDSDHTLFLKR 1172
Query: 1141 NQEKLTALIIYVDDMIVTGNDTQEIATLEKKLSGEFEMKNLGGLKYFLGIEVMRSRQGIV 1200
NQEKLTALIIYVDDMIV GNDTQEIATLEKKLSGEFEMKNLGGLKYFLGIEVMRS+QGIV
Sbjct: 1173 NQEKLTALIIYVDDMIVMGNDTQEIATLEKKLSGEFEMKNLGGLKYFLGIEVMRSKQGIV 1232
Query: 1201 LSQRKYILDLLAEIGMLDCRPADTPVVQGVKLGEFPDQVPANKERYQRLVGKLIYLAHTR 1260
LSQRKYILDLLAEIGMLDCRPADTPVVQGVKLGEFPDQVPANKERYQRLVGKLIYLAHTR
Sbjct: 1233 LSQRKYILDLLAEIGMLDCRPADTPVVQGVKLGEFPDQVPANKERYQRLVGKLIYLAHTR 1251
Query: 1261 PDIAYGVSVVSQFMHNPSEDHMDAVMRIIRYLKGCPGKGITFKKNGHLDVSGFTDADWAG 1320
PDIAYGVSVVSQFMHNPSEDHMDAVMRI+RYLKGCPGK
Sbjct: 1293 PDIAYGVSVVSQFMHNPSEDHMDAVMRILRYLKGCPGK---------------------- 1251
Query: 1321 SVSDRRSTAGYFTFVGGNLVTWRSKKQSVVALSSAEAEFRGVAKGICELLWLRRLLGELG 1380
GYFTFVGGNLVTWRSKKQSVVALSSAEAEFRGVAKGICELLWLRRLLGELG
Sbjct: 1353 ---------GYFTFVGGNLVTWRSKKQSVVALSSAEAEFRGVAKGICELLWLRRLLGELG 1251
Query: 1381 FAPTQAMDLYCDSRPAIDISHNPVQHDRTKHVEIDRHFIKEKLESNVIQMVHVRSGEQLA 1440
FAPTQAMDLYCDSRPAIDISHNP MVHVRSGEQLA
Sbjct: 1413 FAPTQAMDLYCDSRPAIDISHNP--------------------------MVHVRSGEQLA 1251
Query: 1441 DILTKAVMSKTFTSIIDKLGMQDAYIPT 1469
DILTKAVMSKTFTSIIDKLGMQDAYIPT
Sbjct: 1473 DILTKAVMSKTFTSIIDKLGMQDAYIPT 1251
BLAST of Cmc01g0005171 vs. ExPASy TrEMBL
Match:
A0A2N9IDK7 (Integrase catalytic domain-containing protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS29099 PE=4 SV=1)
HSP 1 Score: 1635.5 bits (4234), Expect = 0.0e+00
Identity = 825/1493 (55.26%), Postives = 1054/1493 (70.60%), Query Frame = 0
Query: 7 PPLGPATGPEGGLPPMTIVFNGRNYGVWSQMVEVLLASKDKLGHINGERTQPASNDTGYK 66
PP +T P G I +G NY +WSQ+VE+ ++ KDKLG+ING+ QP D ++
Sbjct: 40 PPADSSTAPIG------IKLDGSNYALWSQVVEMYISGKDKLGYINGDSPQPPETDPSFR 99
Query: 67 RWMSDNSMVKGWILSSLDPNLIGNFIRFSTAKEVWDSIKLTFFDGNDNTQIYDLEVKVYG 126
RW ++N++VKGW+++S+D +LI NFIRF TAK+VWDS T+FDG D +Q+YDL +V
Sbjct: 100 RWRTENAIVKGWLINSMDSSLIANFIRFPTAKQVWDSAATTYFDGTDTSQVYDLRRRVTR 159
Query: 127 LKQT-GTLEDYFYTLQGLWKEIEFRKPNPMTCPIDIEKYNRVEQDRKVYIFLGGLEDRFD 186
KQ G++E Y+ LQGLW+EI+FR+PNPM C DI+KYN + Q+ +VYIFL GL+DR D
Sbjct: 160 TKQAGGSIEKYYNDLQGLWREIDFRRPNPMECANDIQKYNSILQEDRVYIFLDGLDDRLD 219
Query: 187 GIRAEILRMTPLPTVEETYGHVRREDVRQSVMVGKSTDLSNSAAMIVQGAKQVAGVPHIS 246
R+++L++ P PTVE+ Y HVRREDVRQ VM GA GV S
Sbjct: 220 KTRSDVLQLKPFPTVEQAYAHVRREDVRQMVM--------------TSGANTAPGVVMAS 279
Query: 247 TNSTHGNSNNSVPGDPLSLSSQVQGKESSSNQGAVHFTTRYNKTEAVLKCTHCGGKGHSK 306
G+ + LSLSS S S +KCTHCG H++
Sbjct: 280 KGIKAGHYHTPPKTGVLSLSSGKSNPPSKS-----------KAPSDGMKCTHCGNAKHTR 339
Query: 307 DTCFKIHGYPDWYKELK-KKQAEA-----KRGKVSIA--------VSGKNGDDSPSEGII 366
+TCFK+HGYPDW+ +L+ +K+ EA G+V++ S +P
Sbjct: 340 ETCFKLHGYPDWWHDLQARKKHEAPVIDDSTGRVAMVTGEPSLSLTSQVESSHNPGNCSN 399
Query: 367 FLSSNLSNDNSTWIVDSGASDHMTFTKTDLTAECVTKKTEILNANGISYPVKCAGSIKVT 426
L S+ ND+ WI+DSGA+DHMTF D + +++ + NANG++YPV AG + ++
Sbjct: 400 ALHSSTHNDDDNWILDSGATDHMTFDSNDFSHITPPRRSHVANANGVTYPVTGAGIVTLS 459
Query: 427 SQLNLSNVLVVPSLATRLMSVSKLTKDRNCVVKMYSDYFIIQDILTKEIIGRGIERDGLY 486
L+LS+ L+VPSL+ +LMSVS++T D NCVV MYS + ++QDILTKEIIGRG +R GLY
Sbjct: 460 PSLSLSHTLLVPSLSNKLMSVSQVTADLNCVVLMYSTFCLLQDILTKEIIGRGTKRGGLY 519
Query: 487 HLEDLKAGRASLVVDQTEVQNK---IWTWHRRLGHPSLGYMKKLLPHLFKNNNLTVFNCD 546
+++D RA+ + V NK IW WH RLGHPS GY+K L P LF N + F C+
Sbjct: 520 YVDDFSPSRANHM--HHTVNNKERQIWLWHHRLGHPSFGYLKHLFPDLFSNTMHSNFKCN 579
Query: 547 TCIKAKSHRVSYAPSSNKSNSPFDLIHSDVWGPAPVSSIDGNKWFVLFIDDCTRMTWVYL 606
TCI AKSHRVSY S NKS PF LIHSDVWGP+PV++ G++WFV+F+DDCTRMTW+YL
Sbjct: 580 TCILAKSHRVSYPVSMNKSAIPFALIHSDVWGPSPVTTSSGHRWFVIFVDDCTRMTWLYL 639
Query: 607 LKSKDEVPSVFKNFYTMIKTQFGKGIKFFRSDNGGEFIGKVLKDFFVQMGIVHETSCVGT 666
LK KDEV VFK+F+ M++TQF I+ RSDNGGE++ + + +F G+ HETSC T
Sbjct: 640 LKHKDEVFDVFKSFHIMVQTQFSAKIQILRSDNGGEYVNQPFQAYFQSHGLFHETSCSQT 699
Query: 667 PQQNGVAERKNRHILEMSRALLFEYHVPTKFWDKSILMAVYVLNRLPTKINNFQTPLKTL 726
PQQNG+AERKNRHILE +RALL HVP+++WD ++ AV++LNR+PTK+ FQTPLK L
Sbjct: 700 PQQNGIAERKNRHILETARALLIGAHVPSRYWDDAVATAVHLLNRMPTKVLTFQTPLKVL 759
Query: 727 EKHHSIPSVLSLTPKIFGCVAYVHVPKTQRSKLSPYAVRCVFLGFGQNQKGYKCFDVDSR 786
H +P+VL + P+IFGCVA+VH+ K QR+KL P AVRC+FLG+G ++KGY+CFD ++
Sbjct: 760 SNHVPLPTVLMIPPRIFGCVAFVHLHKNQRTKLDPCAVRCLFLGYGLHKKGYRCFDPTTK 819
Query: 787 KWYVTMDVTFMEHEAFF-KPTIQSD-QGENSSESDVWKNKDLLSFQGWTPGCEQYGQDGV 846
+ Y+TMDVTF+E + FF P S QGE E W +G + +
Sbjct: 820 RTYITMDVTFLESDTFFPSPASNSTLQGELRDEEQNW-----------------WGSEEL 879
Query: 847 SGTDKSDGNGNGTDESDGDGNGATDGNKPLEMYSRNKNRAVFGMESENEEVIQPNEEPQG 906
D +G D + D ++MY R + ++ ESE+E +P
Sbjct: 880 HVEDNPAHMNDGNDMIEPDVQTFVG----VDMYPRAEPVSLANAESEDESPHSSVPDPND 939
Query: 907 PMSPEDQLQVSLPTSK-----------YTLPVRCNRGIPPKRYEPDEDRHKKSKYPIANF 966
P S E+ +VS PT+ Y LP R NRG PP RY PD + ++SKYPIAN+
Sbjct: 940 PPS-ENIPEVSSPTTPLHTNAMDTSTGYVLPFRHNRGKPPNRYSPDIE-ERRSKYPIANY 999
Query: 967 VDTKSLSGPVKRFNENLLSCKIPENVDEAKSDPNWSQAMEAEMSALYNNKTWTLVELPQG 1026
V T+ LS P+K F L SC IP +V+EA SDP W+QA++ E+ AL NKTW LV LP+G
Sbjct: 1000 VSTQRLSEPLKAFAHTLSSCNIPSSVEEALSDPKWAQAIKEELEALQKNKTWALVVLPEG 1059
Query: 1027 KIPVGCRWVFSIKYNANGEIDRYKARLVAKGYTQTQGIDFQETFSPVAKLNTIRVLLSLA 1086
K VGC+W+FSIKY A+G IDR KARLVAKGYTQT GID+ ETFSPVAKLNT+RVLLSLA
Sbjct: 1060 KKTVGCKWIFSIKYKADGSIDRCKARLVAKGYTQTYGIDYHETFSPVAKLNTVRVLLSLA 1119
Query: 1087 ANLDWPLHQFDVKNAFLHGELKEEVYMEQPPGYKSTNAKPMVCKLNKALYGLKQSPRAWF 1146
ANLDWPLHQ DVKNAFLHG+L+EEVYM+ PPGY +++ + CKL +ALYGLKQSPRAWF
Sbjct: 1120 ANLDWPLHQLDVKNAFLHGDLEEEVYMDIPPGYTASSKAKIACKLQRALYGLKQSPRAWF 1179
Query: 1147 GRFCRAMQSYGFKQCDSDHTLFLKRNQEKLTALIIYVDDMIVTGNDTQEIATLEKKLSGE 1206
GRF AM+ YGF+Q +SDHTLFLK K+TALI+YVDDMI+TG+D +EI+ L+++LS E
Sbjct: 1180 GRFSSAMRKYGFQQSNSDHTLFLKHRLGKVTALIVYVDDMIITGDDAEEISRLQEQLSTE 1239
Query: 1207 FEMKNLGGLKYFLGIEVMRSRQGIVLSQRKYILDLLAEIGMLDCRPADTPVVQGVKLGEF 1266
FEMKNLGGLKYFLGIEV RSRQGI LSQRKY+LDLL+E+G+L+C+PADTP+V KLGE+
Sbjct: 1240 FEMKNLGGLKYFLGIEVARSRQGIFLSQRKYVLDLLSEVGLLECKPADTPIVPNHKLGEY 1299
Query: 1267 PDQVPANKERYQRLVGKLIYLAHTRPDIAYGVSVVSQFMHNPSEDHMDAVMRIIRYLKGC 1326
DQVPA+KERYQRLVGKLIYL+HTRPDIAY VSVVSQFMH PSEDHMDAV+RI+RYLK
Sbjct: 1300 TDQVPADKERYQRLVGKLIYLSHTRPDIAYAVSVVSQFMHCPSEDHMDAVIRILRYLKSS 1359
Query: 1327 PGKGITFKKNGHLDVSGFTDADWAGSVSDRRSTAGYFTFVGGNLVTWRSKKQSVVALSSA 1386
PGKG+ F KN HL+V G+TDADWAG++SDR+ST+GYFTFVGGNLVTWRSKKQ VVALSSA
Sbjct: 1360 PGKGLMFSKNNHLNVDGYTDADWAGNISDRKSTSGYFTFVGGNLVTWRSKKQKVVALSSA 1419
Query: 1387 EAEFRGVAKGICELLWLRRLLGELGFAPTQAMDLYCDSRPAIDISHNPVQHDRTKHVEID 1446
EAEFRG+AKG+CELLWLRRLL E+GFAP+ M+L+CD++ AIDISHNPVQHDRTKHVE+D
Sbjct: 1420 EAEFRGMAKGLCELLWLRRLLAEIGFAPSSEMNLFCDNKAAIDISHNPVQHDRTKHVEVD 1476
Query: 1447 RHFIKEKLESNVIQMVHVRSGEQLADILTKAVMSKTFTSIIDKLGMQDAYIPT 1469
RHFIK LE +I+ V+S +QLADILTKAV ++ F +DKLG++D Y PT
Sbjct: 1480 RHFIKHNLEEKIIRFPFVKSEDQLADILTKAVSTRNFYDSLDKLGIRDIYAPT 1476
BLAST of Cmc01g0005171 vs. ExPASy TrEMBL
Match:
A0A2N9IYR6 (Integrase catalytic domain-containing protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS25009 PE=4 SV=1)
HSP 1 Score: 1634.8 bits (4232), Expect = 0.0e+00
Identity = 821/1479 (55.51%), Postives = 1050/1479 (70.99%), Query Frame = 0
Query: 21 PMTIVFNGRNYGVWSQMVEVLLASKDKLGHINGERTQPASNDTGYKRWMSDNSMVKGWIL 80
P+ I +G NY +WSQ+VE+ ++ KDKLG+ING+ QP D ++RW ++N++VKGW++
Sbjct: 48 PIGIKLDGSNYALWSQVVEMYISGKDKLGYINGDSPQPPETDPSFRRWRTENAIVKGWLI 107
Query: 81 SSLDPNLIGNFIRFSTAKEVWDSIKLTFFDGNDNTQIYDLEVKVYGLKQT-GTLEDYFYT 140
+S+D +LI NFIRF TAK+VWDS T+FDG D +Q+YDL +V KQ G++E Y+
Sbjct: 108 NSMDSSLIANFIRFPTAKQVWDSAATTYFDGTDTSQVYDLRRRVTRTKQAGGSIEKYYND 167
Query: 141 LQGLWKEIEFRKPNPMTCPIDIEKYNRVEQDRKVYIFLGGLEDRFDGIRAEILRMTPLPT 200
LQGLW+EI+FR+PNPM C DI+KYN + Q+ +VYIFL GL+DR D R+++L++ P PT
Sbjct: 168 LQGLWREIDFRRPNPMECANDIQKYNSILQEDRVYIFLDGLDDRLDKTRSDVLQLKPFPT 227
Query: 201 VEETYGHVRREDVRQSVMVGKSTDLSNSAAMIVQGAKQVAGVPHISTNSTHGNSNNSVPG 260
VE+ Y HVRREDVRQ VM GA GV S G+ +
Sbjct: 228 VEQAYAHVRREDVRQMVM--------------TSGANTAPGVVMASKGIKAGHYHTPPKT 287
Query: 261 DPLSLSSQVQGKESSSNQGAVHFTTRYNKTEAVLKCTHCGGKGHSKDTCFKIHGYPDWYK 320
LSLSS S S +KCTHCG H+++TCFK+HGYPDW+
Sbjct: 288 GVLSLSSGKSNPPSKS-----------KAPSDGMKCTHCGNAKHTRETCFKLHGYPDWWH 347
Query: 321 ELK-KKQAEA-----KRGKVSIA--------VSGKNGDDSPSEGIIFLSSNLSNDNSTWI 380
+L+ +K+ EA G+V++ S +P L S+ ND+ WI
Sbjct: 348 DLQARKKHEAPVIDDSTGRVAMVTGEPSLSLTSQVESSHNPGNCSNALHSSTHNDDDNWI 407
Query: 381 VDSGASDHMTFTKTDLTAECVTKKTEILNANGISYPVKCAGSIKVTSQLNLSNVLVVPSL 440
+DSGA+DHMTF D + +++ + NANG++YPV AG + ++ L+LS+ L+VPSL
Sbjct: 408 LDSGATDHMTFDSNDFSHITPPRRSHVANANGVTYPVTGAGIVTLSPSLSLSHTLLVPSL 467
Query: 441 ATRLMSVSKLTKDRNCVVKMYSDYFIIQDILTKEIIGRGIERDGLYHLEDLKAGRASLVV 500
+ +LMSVS++T D NCVV MYS + ++QDILTKEIIGRG +R GLY+++D RA+ +
Sbjct: 468 SNKLMSVSQVTADLNCVVLMYSTFCLLQDILTKEIIGRGTKRGGLYYVDDFSPSRANHM- 527
Query: 501 DQTEVQNK---IWTWHRRLGHPSLGYMKKLLPHLFKNNNLTVFNCDTCIKAKSHRVSYAP 560
V NK IW WH RLGHPS GY+K L P LF N + F C+TCI AKSHRVSY
Sbjct: 528 -HHTVNNKERQIWLWHHRLGHPSFGYLKHLFPDLFSNTMHSNFKCNTCILAKSHRVSYPV 587
Query: 561 SSNKSNSPFDLIHSDVWGPAPVSSIDGNKWFVLFIDDCTRMTWVYLLKSKDEVPSVFKNF 620
S NKS PF LIHSDVWGP+PV++ G++WFV+F+DDCTRMTW+YLLK KDEV VFK+F
Sbjct: 588 SMNKSAIPFALIHSDVWGPSPVTTSSGHRWFVIFVDDCTRMTWLYLLKHKDEVFDVFKSF 647
Query: 621 YTMIKTQFGKGIKFFRSDNGGEFIGKVLKDFFVQMGIVHETSCVGTPQQNGVAERKNRHI 680
+ M++TQF I+ RSDNGGE++ + + +F G+ HETSC TPQQNG+AERKNRHI
Sbjct: 648 HIMVQTQFSAKIQILRSDNGGEYVNQPFQAYFQSHGLFHETSCSQTPQQNGIAERKNRHI 707
Query: 681 LEMSRALLFEYHVPTKFWDKSILMAVYVLNRLPTKINNFQTPLKTLEKHHSIPSVLSLTP 740
LE +RALL HVP+++WD ++ AV++LNR+PTK+ FQTPLK L H +P+VL + P
Sbjct: 708 LETARALLIGAHVPSRYWDDAVATAVHLLNRMPTKVLTFQTPLKVLSNHVPLPTVLMIPP 767
Query: 741 KIFGCVAYVHVPKTQRSKLSPYAVRCVFLGFGQNQKGYKCFDVDSRKWYVTMDVTFMEHE 800
+IFGCVA+VH+ K QR+KL P AVRC+FLG+G ++KGY+CFD +++ Y+TMDVTF+E +
Sbjct: 768 RIFGCVAFVHLHKNQRTKLDPCAVRCLFLGYGLHKKGYRCFDPTTKRTYITMDVTFLESD 827
Query: 801 AFF-KPTIQSD-QGENSSESDVWKNKDLLSFQGWTPGCEQYGQDGVSGTDKSDGNGNGTD 860
FF P S QGE E W +G + + D +G D
Sbjct: 828 TFFPSPASNSTLQGELRDEEQNW-----------------WGSEELHVEDNPAHMNDGND 887
Query: 861 ESDGDGNGATDGNKPLEMYSRNKNRAVFGMESENEEVIQPNEEPQGPMSPEDQLQVSLPT 920
+ D ++MY R + ++ ESE+E +P P S E+ +VS PT
Sbjct: 888 MIEPDVQTFVG----VDMYPRAEPVSLANAESEDESPHSSVPDPNDPPS-ENIPEVSSPT 947
Query: 921 SK-----------YTLPVRCNRGIPPKRYEPDEDRHKKSKYPIANFVDTKSLSGPVKRFN 980
+ Y LP R NRG PP RY PD + ++SKYPIAN+V T+ LS P+K F
Sbjct: 948 TPLHTNAMDTSTGYVLPFRHNRGKPPNRYSPDIE-ERRSKYPIANYVSTQRLSEPLKAFA 1007
Query: 981 ENLLSCKIPENVDEAKSDPNWSQAMEAEMSALYNNKTWTLVELPQGKIPVGCRWVFSIKY 1040
L SC IP +V+EA SDP W+QA++ E+ AL NKTW LV LP+GK VGC+W+FSIKY
Sbjct: 1008 HTLSSCNIPSSVEEALSDPKWAQAIKEELEALQKNKTWALVVLPEGKKTVGCKWIFSIKY 1067
Query: 1041 NANGEIDRYKARLVAKGYTQTQGIDFQETFSPVAKLNTIRVLLSLAANLDWPLHQFDVKN 1100
A+G IDR KARLVAKGYTQT GID+ ETFSPVAKLNT+RVLLSLAANLDWPLHQ DVKN
Sbjct: 1068 KADGSIDRCKARLVAKGYTQTYGIDYHETFSPVAKLNTVRVLLSLAANLDWPLHQLDVKN 1127
Query: 1101 AFLHGELKEEVYMEQPPGYKSTNAKPMVCKLNKALYGLKQSPRAWFGRFCRAMQSYGFKQ 1160
AFLHG+L+EEVYM+ PPGY +++ + CKL +ALYGLKQSPRAWFGRF AM+ YGF+Q
Sbjct: 1128 AFLHGDLEEEVYMDIPPGYTASSKAKIACKLQRALYGLKQSPRAWFGRFSSAMRKYGFQQ 1187
Query: 1161 CDSDHTLFLKRNQEKLTALIIYVDDMIVTGNDTQEIATLEKKLSGEFEMKNLGGLKYFLG 1220
+SDHTLFLK K+TALI+YVDDMI+TG+D +EI+ L+++LS EFEMKNLGGLKYFLG
Sbjct: 1188 SNSDHTLFLKHRLGKVTALIVYVDDMIITGDDAEEISRLQEQLSTEFEMKNLGGLKYFLG 1247
Query: 1221 IEVMRSRQGIVLSQRKYILDLLAEIGMLDCRPADTPVVQGVKLGEFPDQVPANKERYQRL 1280
IEV RSRQGI LSQRKY+LDLL+E+G+L+C+PADTP+V KLGE+ DQVPA+KERYQRL
Sbjct: 1248 IEVARSRQGIFLSQRKYVLDLLSEVGLLECKPADTPIVPNHKLGEYTDQVPADKERYQRL 1307
Query: 1281 VGKLIYLAHTRPDIAYGVSVVSQFMHNPSEDHMDAVMRIIRYLKGCPGKGITFKKNGHLD 1340
VGKLIYL+HTRPDIAY VSVVSQFMH PSEDHMDAV+RI+RYLK PGKG+ F KN HL+
Sbjct: 1308 VGKLIYLSHTRPDIAYAVSVVSQFMHCPSEDHMDAVIRILRYLKSSPGKGLMFSKNNHLN 1367
Query: 1341 VSGFTDADWAGSVSDRRSTAGYFTFVGGNLVTWRSKKQSVVALSSAEAEFRGVAKGICEL 1400
V G+TDADWAG++SDR+ST+GYFTFVGGNLVTWRSKKQ VVALSSAEAEFRG+AKG+CEL
Sbjct: 1368 VDGYTDADWAGNISDRKSTSGYFTFVGGNLVTWRSKKQKVVALSSAEAEFRGMAKGLCEL 1427
Query: 1401 LWLRRLLGELGFAPTQAMDLYCDSRPAIDISHNPVQHDRTKHVEIDRHFIKEKLESNVIQ 1460
LWLRRLL E+GFAP+ M+L+CD++ AIDISHNPVQHDRTKHVE+DRHFIK LE +I+
Sbjct: 1428 LWLRRLLAEIGFAPSSEMNLFCDNKAAIDISHNPVQHDRTKHVEVDRHFIKHNLEEKIIR 1476
Query: 1461 MVHVRSGEQLADILTKAVMSKTFTSIIDKLGMQDAYIPT 1469
V+S +QLADILTKAV ++ F +DKLG++D Y PT
Sbjct: 1488 FPFVKSEDQLADILTKAVSTRNFYDSLDKLGIRDIYAPT 1476
BLAST of Cmc01g0005171 vs. TAIR 10
Match:
AT4G23160.1 (cysteine-rich RLK (RECEPTOR-like protein kinase) 8 )
HSP 1 Score: 428.7 bits (1101), Expect = 1.9e-119
Identity = 209/497 (42.05%), Postives = 312/497 (62.78%), Query Frame = 0
Query: 930 YPIANFVDTKSLSGPVKRFNENLLSCKIPENVDEAKSDPNWSQAMEAEMSALYNNKTWTL 989
+ I+ F+ + +S F + K P +EAK W AM+ E+ A+ TW +
Sbjct: 58 HDISQFLSYEKVSPLYHSFLVCIAKAKEPSTYNEAKEFLVWCGAMDDEIGAMETTHTWEI 117
Query: 990 VELPQGKIPVGCRWVFSIKYNANGEIDRYKARLVAKGYTQTQGIDFQETFSPVAKLNTIR 1049
LP K P+GC+WV+ IKYN++G I+RYKARLVAKGYTQ +GIDF ETFSPV KL +++
Sbjct: 118 CTLPPNKKPIGCKWVYKIKYNSDGTIERYKARLVAKGYTQQEGIDFIETFSPVCKLTSVK 177
Query: 1050 VLLSLAANLDWPLHQFDVKNAFLHGELKEEVYMEQPPGYKSTNAKPM----VCKLNKALY 1109
++L+++A ++ LHQ D+ NAFL+G+L EE+YM+ PPGY + + VC L K++Y
Sbjct: 178 LILAISAIYNFTLHQLDISNAFLNGDLDEEIYMKLPPGYAARQGDSLPPNAVCYLKKSIY 237
Query: 1110 GLKQSPRAWFGRFCRAMQSYGFKQCDSDHTLFLKRNQEKLTALIIYVDDMIVTGNDTQEI 1169
GLKQ+ R WF +F + +GF Q SDHT FLK +++YVDD+I+ N+ +
Sbjct: 238 GLKQASRQWFLKFSVTLIGFGFVQSHSDHTYFLKITATLFLCVLVYVDDIIICSNNDAAV 297
Query: 1170 ATLEKKLSGEFEMKNLGGLKYFLGIEVMRSRQGIVLSQRKYILDLLAEIGMLDCRPADTP 1229
L+ +L F++++LG LKYFLG+E+ RS GI + QRKY LDLL E G+L C+P+ P
Sbjct: 298 DELKSQLKSCFKLRDLGPLKYFLGLEIARSAAGINICQRKYALDLLDETGLLGCKPSSVP 357
Query: 1230 VVQGVKLGEFPDQVPANKERYQRLVGKLIYLAHTRPDIAYGVSVVSQFMHNPSEDHMDAV 1289
+ V + + Y+RL+G+L+YL TR DI++ V+ +SQF P H AV
Sbjct: 358 MDPSVTFSAHSGGDFVDAKAYRRLIGRLMYLQITRLDISFAVNKLSQFSEAPRLAHQQAV 417
Query: 1290 MRIIRYLKGCPGKGITFKKNGHLDVSGFTDADWAGSVSDRRSTAGYFTFVGGNLVTWRSK 1349
M+I+ Y+KG G+G+ + + + F+DA + RRST GY F+G +L++W+SK
Sbjct: 418 MKILHYIKGTVGQGLFYSSQAEMQLQVFSDASFQSCKDTRRSTNGYCMFLGTSLISWKSK 477
Query: 1350 KQSVVALSSAEAEFRGVAKGICELLWLRRLLGELGFAPTQAMDLYCDSRPAIDISHNPVQ 1409
KQ VV+ SSAEAE+R ++ E++WL + EL ++ L+CD+ AI I+ N V
Sbjct: 478 KQQVVSKSSAEAEYRALSFATDEMMWLAQFFRELQLPLSKPTLLFCDNTAAIHIATNAVF 537
Query: 1410 HDRTKHVEIDRHFIKEK 1423
H+RTKH+E D H ++E+
Sbjct: 538 HERTKHIESDCHSVRER 554
BLAST of Cmc01g0005171 vs. TAIR 10
Match:
ATMG00810.1 (DNA/RNA polymerases superfamily protein )
HSP 1 Score: 188.7 bits (478), Expect = 3.3e-47
Identity = 96/228 (42.11%), Postives = 140/228 (61.40%), Query Frame = 0
Query: 1148 LIIYVDDMIVTGNDTQEIATLEKKLSGEFEMKNLGGLKYFLGIEVMRSRQGIVLSQRKYI 1207
L++YVDD+++TG+ + L +LS F MK+LG + YFLGI++ G+ LSQ KY
Sbjct: 3 LLLYVDDILLTGSSNTLLNMLIFQLSSTFSMKDLGPVHYFLGIQIKTHPSGLFLSQTKYA 62
Query: 1208 LDLLAEIGMLDCRPADTPVV----QGVKLGEFPDQVPANKERYQRLVGKLIYLAHTRPDI 1267
+L GMLDC+P TP+ V ++PD P++ ++ +VG L YL TRPDI
Sbjct: 63 EQILNNAGMLDCKPMSTPLPLKLNSSVSTAKYPD--PSD---FRSIVGALQYLTLTRPDI 122
Query: 1268 AYGVSVVSQFMHNPSEDHMDAVMRIIRYLKGCPGKGITFKKNGHLDVSGFTDADWAGSVS 1327
+Y V++V Q MH P+ D + R++RY+KG G+ KN L+V F D+DWAG S
Sbjct: 123 SYAVNIVCQRMHEPTLADFDLLKRVLRYVKGTIFHGLYIHKNSKLNVQAFCDSDWAGCTS 182
Query: 1328 DRRSTAGYFTFVGGNLVTWRSKKQSVVALSSAEAEFRGVAKGICELLW 1372
RRST G+ TF+G N+++W +K+Q V+ SS E E+R +A EL W
Sbjct: 183 TRRSTTGFCTFLGCNIISWSAKRQPTVSRSSTETEYRALALTAAELTW 225
BLAST of Cmc01g0005171 vs. TAIR 10
Match:
ATMG00820.1 (Reverse transcriptase (RNA-dependent DNA polymerase) )
HSP 1 Score: 109.8 bits (273), Expect = 1.9e-23
Identity = 52/104 (50.00%), Postives = 71/104 (68.27%), Query Frame = 0
Query: 956 KIPENVDEAKSDPNWSQAMEAEMSALYNNKTWTLVELPQGKIPVGCRWVFSIKYNANGEI 1015
K P++V A DP W QAM+ E+ AL NKTW LV P + +GC+WVF K +++G +
Sbjct: 26 KEPKSVIFALKDPGWCQAMQEELDALSRNKTWILVPPPVNQNILGCKWVFKTKLHSDGTL 85
Query: 1016 DRYKARLVAKGYTQTQGIDFQETFSPVAKLNTIRVLLSLAANLD 1060
DR KARLVAKG+ Q +GI F ET+SPV + TIR +L++A L+
Sbjct: 86 DRLKARLVAKGFHQEEGIYFVETYSPVVRTATIRTILNVAQQLE 129
BLAST of Cmc01g0005171 vs. TAIR 10
Match:
ATMG00240.1 (Gag-Pol-related retrotransposon family protein )
HSP 1 Score: 66.6 bits (161), Expect = 1.9e-10
Identity = 29/82 (35.37%), Postives = 45/82 (54.88%), Query Frame = 0
Query: 1254 IYLAHTRPDIAYGVSVVSQFMHNPSEDHMDAVMRIIRYLKGCPGKGITFKKNGHLDVSGF 1313
+YL TRPD+ + V+ +SQF M AV +++ Y+KG G+G+ + L + F
Sbjct: 1 MYLTITRPDLTFAVNRLSQFSSASRTAQMQAVYKVLHYVKGTVGQGLFYSATSDLQLKAF 60
Query: 1314 TDADWAGSVSDRRSTAGYFTFV 1336
D+DWA RRS G+ + V
Sbjct: 61 ADSDWASCPDTRRSVTGFCSLV 82
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
KAA0037867.1 | 0.0e+00 | 99.32 | putative polyprotein (retrotrasposon protein) [Cucumis melo var. makuwa] | [more] |
TYK09814.1 | 0.0e+00 | 99.11 | putative polyprotein (retrotrasposon protein) [Cucumis melo var. makuwa] | [more] |
TYJ98005.1 | 0.0e+00 | 82.43 | gag-pol polyprotein [Cucumis melo var. makuwa] | [more] |
RVW36328.1 | 0.0e+00 | 55.04 | Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera] | [more] |
RVX17869.1 | 0.0e+00 | 50.74 | Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera] | [more] |
Match Name | E-value | Identity | Description | |
Q94HW2 | 2.6e-198 | 31.03 | Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana O... | [more] |
Q9ZT94 | 6.6e-194 | 30.76 | Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana O... | [more] |
P10978 | 1.1e-180 | 34.69 | Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum... | [more] |
P04146 | 3.4e-142 | 29.09 | Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3 | [more] |
P92519 | 4.6e-46 | 42.11 | Uncharacterized mitochondrial protein AtMg00810 OS=Arabidopsis thaliana OX=3702 ... | [more] |
Match Name | E-value | Identity | Description | |
A0A5A7T8G9 | 0.0e+00 | 99.32 | Putative polyprotein (Retrotrasposon protein) OS=Cucumis melo var. makuwa OX=119... | [more] |
A0A5D3CF38 | 0.0e+00 | 99.11 | Putative polyprotein (Retrotrasposon protein) OS=Cucumis melo var. makuwa OX=119... | [more] |
A0A5D3BHP1 | 0.0e+00 | 82.43 | Gag-pol polyprotein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold487G... | [more] |
A0A2N9IDK7 | 0.0e+00 | 55.26 | Integrase catalytic domain-containing protein OS=Fagus sylvatica OX=28930 GN=FSB... | [more] |
A0A2N9IYR6 | 0.0e+00 | 55.51 | Integrase catalytic domain-containing protein OS=Fagus sylvatica OX=28930 GN=FSB... | [more] |
Match Name | E-value | Identity | Description | |
AT4G23160.1 | 1.9e-119 | 42.05 | cysteine-rich RLK (RECEPTOR-like protein kinase) 8 | [more] |
ATMG00810.1 | 3.3e-47 | 42.11 | DNA/RNA polymerases superfamily protein | [more] |
ATMG00820.1 | 1.9e-23 | 50.00 | Reverse transcriptase (RNA-dependent DNA polymerase) | [more] |
ATMG00240.1 | 1.9e-10 | 35.37 | Gag-Pol-related retrotransposon family protein | [more] |