CSPI01G12910 (gene) Cucumber (PI 183967) v1

Overview
NameCSPI01G12910
Typegene
OrganismCucumis sativus var. hardwickii cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionReverse transcriptase
LocationChr1: 8411815 .. 8415195 (+)
RNA-Seq ExpressionCSPI01G12910
SyntenyCSPI01G12910
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCTGGTCGAAGAGGTAAAAACCCTGCCGCGGGGGAAAACCGTACGCAAGAAGTAGCGGAGGAAATCACCGCCCTCTCTCCAAGGACAACAACAGTTCGCTTGCTGGCTGTTGAGGAATCCTTGGGAGATCTCCGTAACATATTTGATAGATTGATAGAAAGCGTCGAATTGTTAAGCCGAAGGAAAGAATACCCACAACCACCACCACGGAACGAAATCAACTTCCAAAACAACCAACGTTTTGGTGAAGCAAGAGGCCGGCGAGCAAGGGAAAACTTCAGAAACGTGAACAACCCACGAGGTTTTCAAAGAAGGAGACCCGGGTACGCCATACCACAACAGTTTGACGAAGATTTTCAAGAAGACCAAGAAGTATGGCAAGAAATCCAAGAAGATGATTCTTCAAGTGGGGATGAACAAGGAAACATGTGGAACTTCAATGATGACTTGCGAGCAGGAAGAAATAACCAAAGAAATGAAGTCAGAAGAGGAGAGTACCACGACTATAAGATGAAGATTGACCTTCCCGTGTATGATGGCAAACAAAATATAGAAGCATTCCTAGACTGGATAAAGAGCACTGAGAATTTCTTCAACTACATGGATATACCCGAACGCAAGAAAGTCCATCTAGTAGCCTTAAAGTTAAGAGCCGGTGCATCAACTTGGTGGGATCAATTGGAAATTAACAGACAAAGATGTGGGAAACAGTCGATCCGCTCGTGGGAAAAGATGAAGAAGTTGCTGAAAGCAAGATTCCTACCCCCAAACTATGAACAAACACTCTACAATCAGTACCAAAACTGTCGCCAAGGTGTCCGTTCAGTAGTTGATTACATTGAAGAATTCCACTGCCTGAGTGCAAGAACGAACCTGAGCGAAAATGAACAACACCAGATTGCAAGATTTGTGGGAGGTCTCCGACTCGACATCAAGGAAAAAGTCAAACTACAACCATTCCGTTTCTTGTCTGAAGCAATATCCTTTGCAGAAACAGTGGAAGAAATGATTGCGGTTCGATCCAAAAACCTAAAGAGAAGACCAGCATGGGAGACAACTTCAACAAGAATGAACAATTATGCGGACAAAACAAACGACCAACCCTCAACCTCAACAAAAGGAAAAGGGAAGGAAGTTGAAAATCAAGAAGTAGCCGTTGAAAGAAAGAATGAACAAACATTCAAAACCAGTAGTCAGAACAACTACTCCCGCCCTTTATTAGGAAAATTCTTCCGATGTGGCCAAACTGAACACCTCTCCAACAACTGCCCGCAAAGAAAAACCATAGCAATAGCCGAAGAAGGAAGGCAGATGAGTGAAGATAGTAAAGAAGCAGAAGACGAAACTGAACTGATTGAAGCAGATGACGAGGAAAGGGTCTCTTGTGTCATCCAACGGGTACTCATCACACCAAAGAAGAAAAGAACCAGCAACGCCACTGTCTTTTCAAGGCAAGATGCACCATAAACGGAAGGGTATGTGATGTAATCATAGACAACGACAGTAGCAAAAACTTCGTAGCAAAGAAACTAGTAACAGTCTTGAACCTAAAGGCTGAAGCACATCCAACCCCCTACAAGATAGGTTGGGTAAGAAAAGGAGGAGAAGTCACGGTTAGCGAAATCTGCACAGTCCCTCTCTCCATTGAAAACGCCTACAAAGACCAAATTGTTTGTGACGTCATTGAGATGGACGTATGCCATCTCCTATTAGGAAGACCTTGGCAGTATGATACCCAATCCTTACACAAAGGAAGAGAAAATACGTATGAATTACAATGGATGGGGAGAAAGGTAGTTCTACTCCCAATAACAAGAAAGAATAAGGAAGGATTAAGAGGTGAGAAACAACTATTCACCACCGTTAGTGGAAAGAATATGCTTAAAGAAAGGGAACAGGACCTCATAGGACTAGTTGTTATTGAAAAAACTAAGGAATGACAAGTCGAAGACATAGAACCCGAATTACAGCAGCTCCTTTATGAGTTCCCACGCATAAAGGAAGAACCAGAGGGACTCCCACCTCTTCGAGACATACAGCACCACATAGACTTGATCTCGGGAGCATCATTACCAAACTTGGCTCACTATAGGATGAGTCCCCAGGAGTACAAAACACTTCATGACCATATTGAGGAACTATTAAAGAAAGGGCACATCCAACCGAGCCTCAGCCCTTGTGCAGTACCAGCCCTTCTCACACCAAAGAAATATGGGAGTTGGAGAATGTGTGTTGACAGCAGAGCCATCAATCGTATCACGGTAAAGTATAGATTTTCCATCCCAAGGATTAGTGACCTGCTTGATCAACTCGGCAAAGCCAGCATTTTTTCGAAGATTGATTTAAAAAGTGGCTACCACCAAATACGTATAAGACCTGGCGATGAATGGAAAACAACTTTCAAGACAAACGAAGGCTTATTTGAATGGATGGTCATGCCATTTGGCCTTTCTAATGCACCCAACACCTTCATGAGATTGATGAACCAGATACTTCACCCATTTCTCAACAAATTCATAGTCGTCTACTTCGATGACATACTCGTTTACAGCACAAACAACGAGGAGCATTTACTACATCTAAGAAAAATGTTCCAGGCTTGACAGAGACAGAACTCTACATCAACACTAAGAAAAGCATGTTTATGAAAAGAGAAATTGCATTCCTCGATTTTGTAATCAAACAAGGAAGCATAAGCATGGAACCAAAGAAGATTGAAGCCATCCACACATGGCCGATTCCTGCCTCCATTAAAGAAATACAAGCCTTCCTTGGCCTGGCTTCATTTTACAGAAAATTCATCAGGAACTTCAACTCTTTAGCCGCACCCCTCACCGACTGTCTAAAGAAAGGAAACTTTAAATGGACCCCATTGTAACAAGAGAGCTTTGAAGATATCAAAAAGAAATTGACATCCAACCTCATCCTTAAATTACCAGACTTCTCTTCACCTTTTGAAGTAGAAGTCGACGCATGCTGCACAGGGATTGGAGTTGTCCTAGCTCAGCAAGGACACCCTATCGAATACTTCAGTGAAAAGCTCAACCCCTCAAGACAGTCATGGAGCACATATGAACAAGAGTTGTATGCCCTTGTGCGAGCACTAAAACAATGGGAGCACTACCTACTCTCCAAAGAATTCGTACTCCTAACTGATCACTTCTCACTAAAGTACCTTCAAGCTCAAAAAAACATCAGCAGGATGCACACATGCTGGATATCCTTCCTCCAAAGGTTTGATTTTGTGATCAAACACCAATCAGGCAAAGACAACAAGGTGGCCGATGCCCTAAGCAGAAAAGGCTTCCTACTCACATTGTTGTCTTCGAAAATCATAGCATTCAAGCATTTACCCGACCTATACGAAGAAGATATTGA

mRNA sequence

ATGGCTGGTCGAAGAGGTAAAAACCCTGCCGCGGGGGAAAACCGTACGCAAGAAGTAGCGGAGGAAATCACCGCCCTCTCTCCAAGGACAACAACAGTTCGCTTGCTGGCTGTTGAGGAATCCTTGGGAGATCTCCGTAACATATTTGATAGATTGATAGAAAGCGTCGAATTGTTAAGCCGAAGGAAAGAATACCCACAACCACCACCACGGAACGAAATCAACTTCCAAAACAACCAACGTTTTGGTGAAGCAAGAGGCCGGCGAGCAAGGGAAAACTTCAGAAACGTGAACAACCCACGAGGTTTTCAAAGAAGGAGACCCGGGTACGCCATACCACAACAGTTTGACGAAGATTTTCAAGAAGACCAAGAAGTATGGCAAGAAATCCAAGAAGATGATTCTTCAAGTGGGGATGAACAAGGAAACATGTGGAACTTCAATGATGACTTGCGAGCAGGAAGAAATAACCAAAGAAATGAAGTCAGAAGAGGAGAGTACCACGACTATAAGATGAAGATTGACCTTCCCGTGTATGATGGCAAACAAAATATAGAAGCATTCCTAGACTGGATAAAGAGCACTGAGAATTTCTTCAACTACATGGATATACCCGAACGCAAGAAAGTCCATCTAGTAGCCTTAAAGTTAAGAGCCGGTGCATCAACTTGGTGGGATCAATTGGAAATTAACAGACAAAGATGTGGGAAACAGTCGATCCGCTCGTGGGAAAAGATGAAGAAGTTGCTGAAAGCAAGATTCCTACCCCCAAACTATGAACAAACACTCTACAATCAGTACCAAAACTGTCGCCAAGGTGTCCGTTCAGTAGTTGATTACATTGAAGAATTCCACTGCCTGAGTGCAAGAACGAACCTGAGCGAAAATGAACAACACCAGATTGCAAGATTTGTGGGAGGTCTCCGACTCGACATCAAGGAAAAAGTCAAACTACAACCATTCCGTTTCTTGTCTGAAGCAATATCCTTTGCAGAAACAGTGGAAGAAATGATTGCGGTTCGATCCAAAAACCTAAAGAGAAGACCAGCATGGGAGACAACTTCAACAAGAATGAACAATTATGCGGACAAAACAAACGACCAACCCTCAACCTCAACAAAAGGAAAAGGGAAGGAAGTTGAAAATCAAGAAGTAGCCGTTGAAAGAAAGAATGAACAAACATTCAAAACCAGTAGTCAGAACAACTACTCCCGCCCTTTATTAGGAAAATTCTTCCGATGTGGCCAAACTGAACACCTCTCCAACAACTGCCCGCAAAGAAAAACCATAGCAATAGCCGAAGAAGGAAGGCAGATGAGTGAAGATAGTAAAGAAGCAGAAGACGAAACTGAACTGATTGAAGCAGATGACGAGGAAAGGGTCTCTTGTGTCATCCAACGGGCAAGATGCACCATAAACGGAAGGGTATGTGATGTAATCATAGACAACGACAGTAGCAAAAACTTCGTAGCAAAGAAACTAGTAACAGTCTTGAACCTAAAGGCTGAAGCACATCCAACCCCCTACAAGATAGGTTGGGTAAGAAAAGGAGGAGAAGTCACGGTTAGCGAAATCTGCACAGTCCCTCTCTCCATTGAAAACGCCTACAAAGACCAAATTGTTTGTGACGTCATTGAGATGGACGTATGCCATCTCCTATTAGGAAGACCTTGGCAGTATGATACCCAATCCTTACACAAAGGAAGAGAAAATACGTATGAATTACAATGGATGGGGAGAAAGGTAGTTCTACTCCCAATAACAAGAAAGAATAAGGAAGGATTAAGAGGTGAGAAACAACTATTCACCACCCAGCTCCTTTATGAGTTCCCACGCATAAAGGAAGAACCAGAGGGACTCCCACCTCTTCGAGACATACAGCACCACATAGACTTGATCTCGGGAGCATCATTACCAAACTTGGCTCACTATAGGATGAGTCCCCAGGAGTACAAAACACTTCATGACCATATTGAGGAACTATTAAAGAAAGGGCACATCCAACCGAGCCTCAGCCCTTGTGCAGTACCAGCCCTTCTCACACCAAAGAAATATGGGAGTTGGAGAATGTGTGTTGACAGCAGAGCCATCAATCGTATCACGGTAAAGTATAGATTTTCCATCCCAAGGATTAGTGACCTGCTTGATCAACTCGGCAAAGCCAGCATTTTTTCGAAGATTGATTTAAAAAGTGGCTACCACCAAATACGTATAAGACCTGGCGATGAATGGAAAACAACTTTCAAGACAAACGAAGGCTTATTTGAATGGATGCACAAACAACGAGGAGCATTTACTACATCTAAGAAAAATGTTCCAGGCTTGACAGAGACAGAACTCTACATCAACACTAAGAAAAGCATGTTTATGAAAAGAGAAATTGCATTCCTCGATTTTGTAATCAAACAAGGAAGCATAAGCATGGAACCAAAGAAGATTGAAGCCATCCACACATGGCCGATTCCTGCCTCCATTAAAGAAATACAAGCCTTCCTTGGCCTGGCTTCATTTTACAGAAAATTCATCAGGAACTTCAACTCTTTAGCCGCACCCCTCACCGACTACTTCTCTTCACCTTTTGAAGTAGAAGTCGACGCATGCTGCACAGGGATTGGAGTTGTCCTAGCTCAGCAAGGACACCCTATCGAATACTTCAGTGAAAAGCTCAACCCCTCAAGACAGTCATGGAGCACATATGAACAAGAGTTGTATGCCCTTGTGCGAGCACTAAAACAATGGGAGCACTACCTACTCTCCAAAGAATTCGTACTCCTAACTGATCACTTCTCACTAAAGCAAAGACAACAAGGTGGCCGATGCCCTAAGCAGAAAAGGCTTCCTACTCACATTGTTGTCTTCGAAAATCATAGCATTCAAGCATTTACCCGACCTATACGAAGAAGATATTGA

Coding sequence (CDS)

ATGGCTGGTCGAAGAGGTAAAAACCCTGCCGCGGGGGAAAACCGTACGCAAGAAGTAGCGGAGGAAATCACCGCCCTCTCTCCAAGGACAACAACAGTTCGCTTGCTGGCTGTTGAGGAATCCTTGGGAGATCTCCGTAACATATTTGATAGATTGATAGAAAGCGTCGAATTGTTAAGCCGAAGGAAAGAATACCCACAACCACCACCACGGAACGAAATCAACTTCCAAAACAACCAACGTTTTGGTGAAGCAAGAGGCCGGCGAGCAAGGGAAAACTTCAGAAACGTGAACAACCCACGAGGTTTTCAAAGAAGGAGACCCGGGTACGCCATACCACAACAGTTTGACGAAGATTTTCAAGAAGACCAAGAAGTATGGCAAGAAATCCAAGAAGATGATTCTTCAAGTGGGGATGAACAAGGAAACATGTGGAACTTCAATGATGACTTGCGAGCAGGAAGAAATAACCAAAGAAATGAAGTCAGAAGAGGAGAGTACCACGACTATAAGATGAAGATTGACCTTCCCGTGTATGATGGCAAACAAAATATAGAAGCATTCCTAGACTGGATAAAGAGCACTGAGAATTTCTTCAACTACATGGATATACCCGAACGCAAGAAAGTCCATCTAGTAGCCTTAAAGTTAAGAGCCGGTGCATCAACTTGGTGGGATCAATTGGAAATTAACAGACAAAGATGTGGGAAACAGTCGATCCGCTCGTGGGAAAAGATGAAGAAGTTGCTGAAAGCAAGATTCCTACCCCCAAACTATGAACAAACACTCTACAATCAGTACCAAAACTGTCGCCAAGGTGTCCGTTCAGTAGTTGATTACATTGAAGAATTCCACTGCCTGAGTGCAAGAACGAACCTGAGCGAAAATGAACAACACCAGATTGCAAGATTTGTGGGAGGTCTCCGACTCGACATCAAGGAAAAAGTCAAACTACAACCATTCCGTTTCTTGTCTGAAGCAATATCCTTTGCAGAAACAGTGGAAGAAATGATTGCGGTTCGATCCAAAAACCTAAAGAGAAGACCAGCATGGGAGACAACTTCAACAAGAATGAACAATTATGCGGACAAAACAAACGACCAACCCTCAACCTCAACAAAAGGAAAAGGGAAGGAAGTTGAAAATCAAGAAGTAGCCGTTGAAAGAAAGAATGAACAAACATTCAAAACCAGTAGTCAGAACAACTACTCCCGCCCTTTATTAGGAAAATTCTTCCGATGTGGCCAAACTGAACACCTCTCCAACAACTGCCCGCAAAGAAAAACCATAGCAATAGCCGAAGAAGGAAGGCAGATGAGTGAAGATAGTAAAGAAGCAGAAGACGAAACTGAACTGATTGAAGCAGATGACGAGGAAAGGGTCTCTTGTGTCATCCAACGGGCAAGATGCACCATAAACGGAAGGGTATGTGATGTAATCATAGACAACGACAGTAGCAAAAACTTCGTAGCAAAGAAACTAGTAACAGTCTTGAACCTAAAGGCTGAAGCACATCCAACCCCCTACAAGATAGGTTGGGTAAGAAAAGGAGGAGAAGTCACGGTTAGCGAAATCTGCACAGTCCCTCTCTCCATTGAAAACGCCTACAAAGACCAAATTGTTTGTGACGTCATTGAGATGGACGTATGCCATCTCCTATTAGGAAGACCTTGGCAGTATGATACCCAATCCTTACACAAAGGAAGAGAAAATACGTATGAATTACAATGGATGGGGAGAAAGGTAGTTCTACTCCCAATAACAAGAAAGAATAAGGAAGGATTAAGAGGTGAGAAACAACTATTCACCACCCAGCTCCTTTATGAGTTCCCACGCATAAAGGAAGAACCAGAGGGACTCCCACCTCTTCGAGACATACAGCACCACATAGACTTGATCTCGGGAGCATCATTACCAAACTTGGCTCACTATAGGATGAGTCCCCAGGAGTACAAAACACTTCATGACCATATTGAGGAACTATTAAAGAAAGGGCACATCCAACCGAGCCTCAGCCCTTGTGCAGTACCAGCCCTTCTCACACCAAAGAAATATGGGAGTTGGAGAATGTGTGTTGACAGCAGAGCCATCAATCGTATCACGGTAAAGTATAGATTTTCCATCCCAAGGATTAGTGACCTGCTTGATCAACTCGGCAAAGCCAGCATTTTTTCGAAGATTGATTTAAAAAGTGGCTACCACCAAATACGTATAAGACCTGGCGATGAATGGAAAACAACTTTCAAGACAAACGAAGGCTTATTTGAATGGATGCACAAACAACGAGGAGCATTTACTACATCTAAGAAAAATGTTCCAGGCTTGACAGAGACAGAACTCTACATCAACACTAAGAAAAGCATGTTTATGAAAAGAGAAATTGCATTCCTCGATTTTGTAATCAAACAAGGAAGCATAAGCATGGAACCAAAGAAGATTGAAGCCATCCACACATGGCCGATTCCTGCCTCCATTAAAGAAATACAAGCCTTCCTTGGCCTGGCTTCATTTTACAGAAAATTCATCAGGAACTTCAACTCTTTAGCCGCACCCCTCACCGACTACTTCTCTTCACCTTTTGAAGTAGAAGTCGACGCATGCTGCACAGGGATTGGAGTTGTCCTAGCTCAGCAAGGACACCCTATCGAATACTTCAGTGAAAAGCTCAACCCCTCAAGACAGTCATGGAGCACATATGAACAAGAGTTGTATGCCCTTGTGCGAGCACTAAAACAATGGGAGCACTACCTACTCTCCAAAGAATTCGTACTCCTAACTGATCACTTCTCACTAAAGCAAAGACAACAAGGTGGCCGATGCCCTAAGCAGAAAAGGCTTCCTACTCACATTGTTGTCTTCGAAAATCATAGCATTCAAGCATTTACCCGACCTATACGAAGAAGATATTGA

Protein sequence

MAGRRGKNPAAGENRTQEVAEEITALSPRTTTVRLLAVEESLGDLRNIFDRLIESVELLSRRKEYPQPPPRNEINFQNNQRFGEARGRRARENFRNVNNPRGFQRRRPGYAIPQQFDEDFQEDQEVWQEIQEDDSSSGDEQGNMWNFNDDLRAGRNNQRNEVRRGEYHDYKMKIDLPVYDGKQNIEAFLDWIKSTENFFNYMDIPERKKVHLVALKLRAGASTWWDQLEINRQRCGKQSIRSWEKMKKLLKARFLPPNYEQTLYNQYQNCRQGVRSVVDYIEEFHCLSARTNLSENEQHQIARFVGGLRLDIKEKVKLQPFRFLSEAISFAETVEEMIAVRSKNLKRRPAWETTSTRMNNYADKTNDQPSTSTKGKGKEVENQEVAVERKNEQTFKTSSQNNYSRPLLGKFFRCGQTEHLSNNCPQRKTIAIAEEGRQMSEDSKEAEDETELIEADDEERVSCVIQRARCTINGRVCDVIIDNDSSKNFVAKKLVTVLNLKAEAHPTPYKIGWVRKGGEVTVSEICTVPLSIENAYKDQIVCDVIEMDVCHLLLGRPWQYDTQSLHKGRENTYELQWMGRKVVLLPITRKNKEGLRGEKQLFTTQLLYEFPRIKEEPEGLPPLRDIQHHIDLISGASLPNLAHYRMSPQEYKTLHDHIEELLKKGHIQPSLSPCAVPALLTPKKYGSWRMCVDSRAINRITVKYRFSIPRISDLLDQLGKASIFSKIDLKSGYHQIRIRPGDEWKTTFKTNEGLFEWMHKQRGAFTTSKKNVPGLTETELYINTKKSMFMKREIAFLDFVIKQGSISMEPKKIEAIHTWPIPASIKEIQAFLGLASFYRKFIRNFNSLAAPLTDYFSSPFEVEVDACCTGIGVVLAQQGHPIEYFSEKLNPSRQSWSTYEQELYALVRALKQWEHYLLSKEFVLLTDHFSLKQRQQGGRCPKQKRLPTHIVVFENHSIQAFTRPIRRRY*
Homology
BLAST of CSPI01G12910 vs. ExPASy Swiss-Prot
Match: P04323 (Retrovirus-related Pol polyprotein from transposon 17.6 OS=Drosophila melanogaster OX=7227 GN=pol PE=4 SV=1)

HSP 1 Score: 176.0 bits (445), Expect = 2.0e-42
Identity = 128/418 (30.62%), Postives = 190/418 (45.45%), Query Frame = 0

Query: 593 EGLRGEKQLFTTQLLYEFPRIK-EEPEGLPPLRDIQHHIDLISGASLPNLAHYRMSPQEY 652
           E L  E++     LL ++  I+  E + L      +H I+  +  +LP  + Y   PQ Y
Sbjct: 163 EHLNNEEKQRLCALLQKYHDIQYHEGDKLTFTNQTKHTIN--TKHNLPLYSKYSY-PQAY 222

Query: 653 -KTLHDHIEELLKKGHIQPSLSPCAVPALLTPKKYGS-----WRMCVDSRAINRITVKYR 712
            + +   I+++L +G I+ S SP   P  + PKK  +     +R+ +D R +N ITV  R
Sbjct: 223 EQEVESQIQDMLNQGIIRTSNSPYNSPIWVVPKKQDASGKQKFRIVIDYRKLNEITVGDR 282

Query: 713 FSIPRISDLLDQLGKASIFSKIDLKSGYHQIRIRPGDEWKTTFKTNEGLFEWMHKQRG-- 772
             IP + ++L +LG+ + F+ IDL  G+HQI + P    KT F T  G +E++    G  
Sbjct: 283 HPIPNMDEILGKLGRCNYFTTIDLAKGFHQIEMDPESVSKTAFSTKHGHYEYLRMPFGLK 342

Query: 773 ------------------------------AFTTSKKN--------VPGLTETELYINTK 832
                                          F+TS              L +  L +   
Sbjct: 343 NAPATFQRCMNDILRPLLNKHCLVYLDDIIVFSTSLDEHLQSLGLVFEKLAKANLKLQLD 402

Query: 833 KSMFMKREIAFLDFVIKQGSISMEPKKIEAIHTWPIPASIKEIQAFLGLASFYRKFIRNF 892
           K  F+K+E  FL  V+    I   P+KIEAI  +PIP   KEI+AFLGL  +YRKFI NF
Sbjct: 403 KCEFLKQETTFLGHVLTPDGIKPNPEKIEAIQKYPIPTKPKEIKAFLGLTGYYRKFIPNF 462

Query: 893 NSLAAPLT--------------DY---------------------FSSPFEVEVDACCTG 929
             +A P+T              +Y                     F+  F +  DA    
Sbjct: 463 ADIAKPMTKCLKKNMKIDTTNPEYDSAFKKLKYLISEDPILKVPDFTKKFTLTTDASDVA 522

BLAST of CSPI01G12910 vs. ExPASy Swiss-Prot
Match: Q99315 (Transposon Ty3-G Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) OX=559292 GN=TY3B-G PE=1 SV=3)

HSP 1 Score: 175.6 bits (444), Expect = 2.7e-42
Identity = 113/386 (29.27%), Postives = 179/386 (46.37%), Query Frame = 0

Query: 626 IQHHIDLISGASLPNLAHYRMSPQEYKTLHDHIEELLKKGHIQPSLSPCAVPALLTPKKY 685
           ++H I++  GA LP L  Y ++ +  + ++  +++LL    I PS SPC+ P +L PKK 
Sbjct: 584 VKHDIEIKPGARLPRLQPYHVTEKNEQEINKIVQKLLDNKFIVPSKSPCSSPVVLVPKKD 643

Query: 686 GSWRMCVDSRAINRITVKYRFSIPRISDLLDQLGKASIFSKIDLKSGYHQIRIRPGDEWK 745
           G++R+CVD R +N+ T+   F +PRI +LL ++G A IF+ +DL SGYHQI + P D +K
Sbjct: 644 GTFRLCVDYRTLNKATISDPFPLPRIDNLLSRIGNAQIFTTLDLHSGYHQIPMEPKDRYK 703

Query: 746 TTFKTNEGLFEWMHKQRG------------------------------AFTTSKKN---- 805
           T F T  G +E+     G                               F+ S +     
Sbjct: 704 TAFVTPSGKYEYTVMPFGLVNAPSTFARYMADTFRDLRFVNVYLDDILIFSESPEEHWKH 763

Query: 806 ----VPGLTETELYINTKKSMFMKREIAFLDFVIKQGSISMEPKKIEAIHTWPIPASIKE 865
               +  L    L +  KK  F   E  FL + I    I+    K  AI  +P P ++K+
Sbjct: 764 LDTVLERLKNENLIVKKKKCKFASEETEFLGYSIGIQKIAPLQHKCAAIRDFPTPKTVKQ 823

Query: 866 IQAFLGLASFYRKFIRNFNSLAAPLTDYF----------------------SSP------ 925
            Q FLG+ ++YR+FI N + +A P+  +                       +SP      
Sbjct: 824 AQRFLGMINYYRRFIPNCSKIAQPIQLFICDKSQWTEKQDKAIDKLKDALCNSPVLVPFN 883

Query: 926 ----FEVEVDACCTGIGVVLAQQGHP------IEYFSEKLNPSRQSWSTYEQELYALVRA 936
               + +  DA   GIG VL +  +       + YFS+ L  +++++   E EL  +++A
Sbjct: 884 NKANYRLTTDASKDGIGAVLEEVDNKNKLVGVVGYFSKSLESAQKNYPAGELELLGIIKA 943

BLAST of CSPI01G12910 vs. ExPASy Swiss-Prot
Match: Q7LHG5 (Transposon Ty3-I Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) OX=559292 GN=TY3B-I PE=1 SV=2)

HSP 1 Score: 175.6 bits (444), Expect = 2.7e-42
Identity = 113/386 (29.27%), Postives = 179/386 (46.37%), Query Frame = 0

Query: 626 IQHHIDLISGASLPNLAHYRMSPQEYKTLHDHIEELLKKGHIQPSLSPCAVPALLTPKKY 685
           ++H I++  GA LP L  Y ++ +  + ++  +++LL    I PS SPC+ P +L PKK 
Sbjct: 610 VKHDIEIKPGARLPRLQPYHVTEKNEQEINKIVQKLLDNKFIVPSKSPCSSPVVLVPKKD 669

Query: 686 GSWRMCVDSRAINRITVKYRFSIPRISDLLDQLGKASIFSKIDLKSGYHQIRIRPGDEWK 745
           G++R+CVD R +N+ T+   F +PRI +LL ++G A IF+ +DL SGYHQI + P D +K
Sbjct: 670 GTFRLCVDYRTLNKATISDPFPLPRIDNLLSRIGNAQIFTTLDLHSGYHQIPMEPKDRYK 729

Query: 746 TTFKTNEGLFEWMHKQRG------------------------------AFTTSKKN---- 805
           T F T  G +E+     G                               F+ S +     
Sbjct: 730 TAFVTPSGKYEYTVMPFGLVNAPSTFARYMADTFRDLRFVNVYLDDILIFSESPEEHWKH 789

Query: 806 ----VPGLTETELYINTKKSMFMKREIAFLDFVIKQGSISMEPKKIEAIHTWPIPASIKE 865
               +  L    L +  KK  F   E  FL + I    I+    K  AI  +P P ++K+
Sbjct: 790 LDTVLERLKNENLIVKKKKCKFASEETEFLGYSIGIQKIAPLQHKCAAIRDFPTPKTVKQ 849

Query: 866 IQAFLGLASFYRKFIRNFNSLAAPLTDYF----------------------SSP------ 925
            Q FLG+ ++YR+FI N + +A P+  +                       +SP      
Sbjct: 850 AQRFLGMINYYRRFIPNCSKIAQPIQLFICDKSQWTEKQDKAIEKLKAALCNSPVLVPFN 909

Query: 926 ----FEVEVDACCTGIGVVLAQQGHP------IEYFSEKLNPSRQSWSTYEQELYALVRA 936
               + +  DA   GIG VL +  +       + YFS+ L  +++++   E EL  +++A
Sbjct: 910 NKANYRLTTDASKDGIGAVLEEVDNKNKLVGVVGYFSKSLESAQKNYPAGELELLGIIKA 969

BLAST of CSPI01G12910 vs. ExPASy Swiss-Prot
Match: P20825 (Retrovirus-related Pol polyprotein from transposon 297 OS=Drosophila melanogaster OX=7227 GN=pol PE=4 SV=1)

HSP 1 Score: 169.1 bits (427), Expect = 2.5e-40
Identity = 122/447 (27.29%), Postives = 192/447 (42.95%), Query Frame = 0

Query: 566 HKGRENTYELQWMGRKVVLLPITRKNKEGLRGEKQLFTTQLLYEFPRIKEEPEGLPPLRD 625
           H  +E T++L+ +  K        +N E   GEK  FT  + +              + +
Sbjct: 162 HLNQEETFKLKGLLNKF-------RNLEYKEGEKLTFTNTIKH--------------VLN 221

Query: 626 IQHHIDLISGASLPNLAHYRMSPQEYKTLHDHIEELLKKGHIQPSLSPCAVPALLTPKK- 685
             H+  + S         Y ++      + + ++E+L +G I+ S SP   P  + PKK 
Sbjct: 222 TTHNSPIYS-------KQYPLAQTHEIEVENQVQEMLNQGLIRESNSPYNSPTWVVPKKP 281

Query: 686 ----YGSWRMCVDSRAINRITVKYRFSIPRISDLLDQLGKASIFSKIDLKSGYHQIRIRP 745
                  +R+ +D R +N IT+  R+ IP + ++L +LGK   F+ IDL  G+HQI +  
Sbjct: 282 DASGANKYRVVIDYRKLNEITIPDRYPIPNMDEILGKLGKCQYFTTIDLAKGFHQIEMDE 341

Query: 746 GDEWKTTFKTNEGLFEWMHKQRG--------------------------------AFTTS 805
               KT F T  G +E++    G                                 F+TS
Sbjct: 342 ESISKTAFSTKSGHYEYLRMPFGLRNAPATFQRCMNNILRPLLNKHCLVYLDDIIIFSTS 401

Query: 806 KKN--------VPGLTETELYINTKKSMFMKREIAFLDFVIKQGSISMEPKKIEAIHTWP 865
                         L +  L +   K  F+K+E  FL  ++    I   P K++AI ++P
Sbjct: 402 LTEHLNSIQLVFTKLADANLKLQLDKCEFLKKEANFLGHIVTPDGIKPNPIKVKAIVSYP 461

Query: 866 IPASIKEIQAFLGLASFYRKFIRNFNSLAAPLT--------------DY----------- 925
           IP   KEI+AFLGL  +YRKFI N+  +A P+T              +Y           
Sbjct: 462 IPTKDKEIRAFLGLTGYYRKFIPNYADIAKPMTSCLKKRTKIDTQKLEYIEAFEKLKALI 521

Query: 926 ----------FSSPFEVEVDACCTGIGVVLAQQGHPIEYFSEKLNPSRQSWSTYEQELYA 933
                     F   F +  DA    +G VL+Q GHPI + S  LN    ++S  E+EL A
Sbjct: 522 IRDPILQLPDFEKKFVLTTDASNLALGAVLSQNGHPISFISRTLNDHELNYSAIEKELLA 580

BLAST of CSPI01G12910 vs. ExPASy Swiss-Prot
Match: P10401 (Retrovirus-related Pol polyprotein from transposon gypsy OS=Drosophila melanogaster OX=7227 GN=pol PE=4 SV=1)

HSP 1 Score: 163.7 bits (413), Expect = 1.0e-38
Identity = 113/413 (27.36%), Postives = 196/413 (47.46%), Query Frame = 0

Query: 654 LHDHIEELLKKGHIQPSLSPCAVPALLTPKK----YG--SWRMCVDSRAINRITVKYRFS 713
           +++ +++LLK G I+PS SP   P  +  KK    +G  + R+ +D R +N  T+  R+ 
Sbjct: 197 VNNEVKQLLKDGIIRPSRSPYNSPTWVVDKKGTDAFGNPNKRLVIDFRKLNEKTIPDRYP 256

Query: 714 IPRISDLLDQLGKASIFSKIDLKSGYHQIRIRPGDEWKTTFKTNEGLFEW---------- 773
           +P I  +L  LGKA  F+ +DLKSGYHQI +   D  KT+F  N G +E+          
Sbjct: 257 MPSIPMILANLGKAKFFTTLDLKSGYHQIYLAEHDREKTSFSVNGGKYEFCRLPFGLRNA 316

Query: 774 -----------MHKQRG-----------AFTTSKKN--------VPGLTETELYINTKKS 833
                      + +Q G            F+ ++ +        +  L +  + ++ +K+
Sbjct: 317 SSIFQRALDDVLREQIGKICYVYVDDVIIFSENESDHVRHIDTVLKCLIDANMRVSQEKT 376

Query: 834 MFMKREIAFLDFVIKQGSISMEPKKIEAIHTWPIPASIKEIQAFLGLASFYRKFIRNFNS 893
            F K  + +L F++ +     +P+K++AI  +P P  + ++++FLGLAS+YR FI++F +
Sbjct: 377 RFFKESVEYLGFIVSKDGTKSDPEKVKAIQEYPEPDCVYKVRSFLGLASYYRVFIKDFAA 436

Query: 894 LAAPLTDY----------------------------------------------FSSPFE 953
           +A P+TD                                               F  PF+
Sbjct: 437 IARPITDILKGENGSVSKHMSKKIPVEFNETQRNAFQRLRNILASEDVILKYPDFKKPFD 496

Query: 954 VEVDACCTGIGVVLAQQGHPIEYFSEKLNPSRQSWSTYEQELYALVRALKQWEHYLL-SK 970
           +  DA  +GIG VL+Q+G PI   S  L    Q+++T E+EL A+V AL + +++L  S+
Sbjct: 497 LTTDASASGIGAVLSQEGRPITMISRTLKQPEQNYATNERELLAIVWALGKLQNFLYGSR 556

BLAST of CSPI01G12910 vs. ExPASy TrEMBL
Match: A0A5D3E417 (Transposon Ty3-I Gag-Pol polyprotein isoform X1 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold455G00630 PE=4 SV=1)

HSP 1 Score: 1011.9 bits (2615), Expect = 1.8e-291
Identity = 536/899 (59.62%), Postives = 636/899 (70.75%), Query Frame = 0

Query: 69  PPRNEINFQNNQRFGEARGRRARENFRNVNNPRGFQRRRPGYAIPQQFDEDFQEDQEVWQ 128
           P R E   +N++  G   GRRAR N++N  N    QRRRP     Q  D++ QE+ E WQ
Sbjct: 32  PARIEAYARNDENRG---GRRARRNYKNFPN----QRRRPTDIPLQYADDNSQEEYEHWQ 91

Query: 129 EIQEDDSSSGDEQGNMWNFNDDLRAGRNNQRNEVRRGEYHDYKMKIDLPVYDGKQNIEAF 188
             Q+ DSS GDEQGN+WN + + R  +  +  E RR  YHDYKMKIDLP Y+GK++IE+F
Sbjct: 92  NTQDHDSSIGDEQGNIWNDDGEFRMAQGYRGQEARRETYHDYKMKIDLPTYNGKRDIESF 151

Query: 189 LDWIKSTENFFNYMDIPERKKVHLVALKLRAGASTWWDQLEINRQRCGKQSIRSWEKMKK 248
           LDWIK+TENFF YM  P+RKKVHLVALKL+ GAS W                        
Sbjct: 152 LDWIKNTENFFKYMVPPDRKKVHLVALKLKGGASAW------------------------ 211

Query: 249 LLKARFLPPNYEQTL---YNQYQNCRQGVRSVVDYIEEFHCLSARTNLSENEQHQIARFV 308
                  P +Y Q +   Y+QYQNCRQG + V +YIEEFH L AR NLSENEQHQIARF+
Sbjct: 212 -------PVSYPQIMNRHYSQYQNCRQGSQLVAEYIEEFHRLGARINLSENEQHQIARFI 271

Query: 309 GGLRLDIKEKVKLQPFRFLSEAISFAETVEEMIAVRSKNLKRRPAWETTSTRMNNYADKT 368
           GGLR DIKEKVKL  FR LSEAIS AETVEEM+ VR KN  RR AWET  ++  +Y  KT
Sbjct: 272 GGLRFDIKEKVKLHSFRVLSEAISLAETVEEMMTVRLKNSNRRTAWETNPSKKQSYGKKT 331

Query: 369 NDQPSTSTKGKGKEVENQEVAVERKNEQTFKTSSQNNYSRPLLGKFFRCGQTEHLSNNCP 428
           ++QPSTS   KGK ++ QE    +K E   +  +QNNY+RP LGK FRCG+  HLSNNC 
Sbjct: 332 DEQPSTSMVDKGKAIDIQE--TNKKKESLVRGKTQNNYTRPSLGKCFRCGEPGHLSNNCS 391

Query: 429 QRKTIAIAE-EGRQMSEDSKEAEDETELIEADDEERVSCVIQRARCTINGRVCDVIIDND 488
           QRKTIA+AE E   MS   +E E+ETELIEADD +R+SC++QR   T          +  
Sbjct: 392 QRKTIALAEDEDTYMSGTDEEEEEETELIEADDGDRISCIVQRVLITPK--------EET 451

Query: 489 SSKNFVAKKLVTVLNLKAEAHPTPYKIGWVRKGGEVTVSEICTVPLSIENAYKDQIVCDV 548
           + ++    K    +N K   HP PYKIGWV+KGGE  ++EICT+PLSI N+YKDQIVCDV
Sbjct: 452 NPQHHSLFKTRCTINGKVYPHPDPYKIGWVKKGGETLINEICTIPLSIGNSYKDQIVCDV 511

Query: 549 IEMDVCHLLLGRPWQYDTQSLHKGRENTYELQWMGRKVVLLPITRKNKEGLR--GEKQLF 608
           IEMDVCHLLLGRPWQ+DTQ+LH+GRENTYE QWMG+KV+LLP+ +KN E +R   ++QLF
Sbjct: 512 IEMDVCHLLLGRPWQHDTQTLHRGRENTYEFQWMGKKVILLPLAKKNTESIRQKNKRQLF 571

Query: 609 TT------------------------------------QLLYEFPRIKEEPEGLPPLRDI 668
            T                                    +L  EFP +K+EP+GLPPLRDI
Sbjct: 572 ITVSGKNLLKEREQDLLGLLVTDKSQGGNSEIVEPRLKELFAEFPHLKKEPQGLPPLRDI 631

Query: 669 QHHIDLISGASLPNLAHYRMSPQEYKTLHDHIEELLKKGHIQPSLSPCAVPALLTPKKYG 728
           QH IDL+  ASLPNL HYRMSP+EY+ LHDHIE+LLKKGHI+PSLSPCAVPALLTP K G
Sbjct: 632 QHQIDLVPRASLPNLPHYRMSPEEYQVLHDHIEDLLKKGHIKPSLSPCAVPALLTPNKDG 691

Query: 729 SWRMCVDSRAINRITVKYRFSIPRISDLLDQLGKASIFSKIDLKSGYHQIRIRPGDEWKT 788
           SWRMCVDSRAINR+T KYRF IPRI DLLDQLGKA IFSKIDL++GYHQI+IRPGDEWKT
Sbjct: 692 SWRMCVDSRAINRVTGKYRFPIPRIGDLLDQLGKAMIFSKIDLRNGYHQIQIRPGDEWKT 751

Query: 789 TFKTNEGLFEWMHKQRGAFTTSKKNVPGLTETELYINTKKSMFMKREIAFLDFVIKQGSI 848
            FKTNEGLFE   ++       +K    LTE ELYIN KK  ++ +EI FL F+IK+G I
Sbjct: 752 AFKTNEGLFECSSRE-DHLQYLRKLFRVLTEIELYINPKKCTYLTKEIVFLGFLIKEGKI 811

Query: 849 SMEPKKIEAIHTWPIPASIKEIQAFLGLASFYRKFIRNFNSLAAPLTDYFSSPFEVEVDA 908
            MEPKKIEAI + P P SIKE+QAFLGLASFYR+FIRNF+ + APLTDYF+SPFEV V+A
Sbjct: 812 RMEPKKIEAIQSRPTPTSIKEVQAFLGLASFYRRFIRNFSLIVAPLTDYFASPFEVAVNA 871

Query: 909 CCTGIGVVLAQQGHPIEYFSEKLNPSRQSWSTYEQELYALVRALKQWEHYLLSKEFVLL 926
           C TGIG VL+QQGHPIEYFSEKL+ SRQSWSTYEQELYALVRALKQWEHYLLS +F ++
Sbjct: 872 CGTGIGAVLSQQGHPIEYFSEKLSTSRQSWSTYEQELYALVRALKQWEHYLLSGDFHIM 881

BLAST of CSPI01G12910 vs. ExPASy TrEMBL
Match: A0A5D3DGR0 (Reverse transcriptase OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold1163G00870 PE=4 SV=1)

HSP 1 Score: 954.1 bits (2465), Expect = 4.4e-274
Identity = 542/1075 (50.42%), Postives = 681/1075 (63.35%), Query Frame = 0

Query: 1    MAGRRGKNPAAGENRTQEVAEEITALSPRTTTVRLLAVEESLGDLRNIFDRLI-----ES 60
            M  +RG+ PAA E   ++ A E   LSPRT++  L +VE S+ ++R + + ++     E+
Sbjct: 1    MINQRGRAPAAKE---RQEAGETPILSPRTSSRCLRSVEASIEEIRQLLNGVVHRLDEEN 60

Query: 61   VELLSRRKEYPQPPPRNEINFQNNQRFGEARGRRARENFRNVNNPRGFQRR---RPGYAI 120
             +L  R  E P          QN  R    RGRR  E FR     R FQ R      + +
Sbjct: 61   AQLNDRDVEPP--------TLQNWGR----RGRRGLEYFR---PQRNFQERIIPEDQWLL 120

Query: 121  PQQFDEDFQEDQEV-WQEIQED-DSSSGDEQGNMWNFNDDLRAGRNNQ--RNEVRRGEYH 180
            PQ      + D+ + WQ  +E+ ++SS  E+ +    NDD+   R ++  +NE ++ E  
Sbjct: 121  PQG-----RRDRRIEWQAREEEIENSSSSEESD----NDDINEFRRHRYVQNERQQRENS 180

Query: 181  DYKMKIDLPVYDGKQNIEAFLDWIKSTENFFNYMDIPERKKVHLVALKLRAGASTWWDQL 240
            +YKMKIDLP YDGK+NIE FLDW+K+TENFF YM   + KKVHLVALKL+ GAS WWDQ+
Sbjct: 181  EYKMKIDLPSYDGKRNIENFLDWLKNTENFFAYMGTTKNKKVHLVALKLKGGASAWWDQI 240

Query: 241  EINRQRCGKQSIRSWEKMKKLLKARFLPPNYEQTLYNQYQNCRQGVRSVVDYIEEFHCLS 300
             +NRQ+ GK  IRSWEKMKKL+K RF+PPNYEQTLY QYQNCRQG+R   +YIEEFH L 
Sbjct: 241  TVNRQKQGKHPIRSWEKMKKLMKQRFVPPNYEQTLYTQYQNCRQGMRKTAEYIEEFHRLG 300

Query: 301  ARTNLSENEQHQIARFVGGLRLDIKEKVKLQPFRFLSEAISFAETVEEMIAVRSKNLKRR 360
             RTNL E E+H I+ FVGGLR D+KEKVKLQPF+ LSEAI++AETVEEMI  R+K+ ++R
Sbjct: 301  GRTNLMEGEKHLISWFVGGLRFDLKEKVKLQPFQHLSEAITYAETVEEMIENRAKSTRKR 360

Query: 361  PAWETTSTRMNNYADKTNDQPSTSTKGKGKEVENQEVAVERKNEQTFKTSSQNNYSRPLL 420
            P WE ++++      KT    S       ++   QE +  +K     +   +N Y RP  
Sbjct: 361  P-WEPSASK------KTTAGNSKLKNATSEKPVEQEESSGKKEVPEGEKKGKNPYQRPFS 420

Query: 421  GKFFRCGQTEHLSNNCPQRKTIAIAEEGRQMSEDS-KEAEDETELIEADDEERVSCVIQR 480
            G  +RCGQ  H SN CPQRKTIA+A++    S  S  E ++ETE+IEAD+ + +SC++QR
Sbjct: 421  GNCYRCGQMGHPSNQCPQRKTIAVAKDNDDGSNRSLGEFDEETEVIEADEGDSLSCILQR 480

Query: 481  ------------------ARCTINGRVCDVIIDNDSSKNFVAKKLVTVLNLKAEAHPTPY 540
                               RCTI G+VC+VIID+ SS+NFV+KKLVT LNLK + H  PY
Sbjct: 481  VLISPKEENQLQRHSLFKTRCTIQGKVCNVIIDSGSSENFVSKKLVTALNLKTQPHEKPY 540

Query: 541  KIGWVRKGGEVTVSEICTVPLSIENAYKDQIVCDVIEMDVCHLLLGRPWQYDTQSLHKGR 600
            KIGW++KGGE  +SEIC VPLSI N+YKDQ+VCDVIEMDVCH+LLGRPWQ+D QS+H+GR
Sbjct: 541  KIGWIKKGGETLISEICYVPLSIGNSYKDQMVCDVIEMDVCHILLGRPWQFDVQSMHRGR 600

Query: 601  ENTYELQWMGRKVVLLPITRKNKEGLRGEKQ---LFTT---------------------- 660
            ENTYE  WM +KV+LLP+ ++  + +   ++   LF T                      
Sbjct: 601  ENTYEFMWMNKKVILLPLQKRKDDNIEKNQKKGSLFVTISGKKFLRERENEILGIVMSGT 660

Query: 661  --------------QLLYEFPRIKEEPEGLPPLRDIQHHIDLISGASLPNLAHYRMSPQE 720
                          +L  ++P+I +EP  LPPLRDI H+I+L+SGAS P+L HY MSP E
Sbjct: 661  EDTTRDEQIPEAIKELFKKYPKISKEPTCLPPLRDIHHNIELLSGASFPHLPHYHMSPNE 720

Query: 721  YKTLHDHIEELLKKGHIQPSLSPCAVPALLTPKKYGSWRMCVDSRAINRITVKYRFSIPR 780
            YK LHD IEELLKKGHI+PS S C VPALLTPKK G+WRMCVDSRAIN+ITVKYRF IPR
Sbjct: 721  YKILHDAIEELLKKGHIKPSFSLCVVPALLTPKKDGTWRMCVDSRAINKITVKYRFPIPR 780

Query: 781  ISDLLDQLGKASIFSKIDLKSGYHQIRIRPGDEWKTTFKTNEGLFEW------------- 840
            +SDLLDQLG A IFSKIDL+S YHQIRIRPGDEWKT FKTNEGLFEW             
Sbjct: 781  VSDLLDQLGGACIFSKIDLRSDYHQIRIRPGDEWKTAFKTNEGLFEWLVMPFDLSNAPST 840

Query: 841  ----MHKQRGAFTTS-----------------------KKNVPGLTETELYINTKKSMFM 900
                M+K    F                           +    L   ELY+N KK +F 
Sbjct: 841  FMRLMNKVLHPFLNKFIIVYFDDILVFSKTYDQHLQHIDQLFQVLNHNELYVNLKKCIFC 900

Query: 901  KREIAFLDFVIKQGSISMEPKKIEAIHTWPIPASIKEIQAFLGLASFYRKFIRNFNSLAA 933
              EIAFL F+I++  + M+ KK+EAI  W  P ++ ++QAFLGLASFYRKFI+N +S+AA
Sbjct: 901  SNEIAFLGFIIRKDHVLMDEKKVEAIKNWSTPTTVIQVQAFLGLASFYRKFIQNCSSIAA 960

BLAST of CSPI01G12910 vs. ExPASy TrEMBL
Match: A0A5A7T256 (Reverse transcriptase OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold285G003810 PE=4 SV=1)

HSP 1 Score: 876.7 bits (2264), Expect = 8.9e-251
Identity = 449/702 (63.96%), Postives = 527/702 (75.07%), Query Frame = 0

Query: 264 YNQYQNCRQGVRSVVDYIEEFHCLSARTNLSENEQHQIARFVGGLRLDIKEKVKLQPFRF 323
           Y+QYQNCRQG + V +YIEEFH LSAR NLSENEQHQIARF+GGLR DIKEKVKL  FR 
Sbjct: 5   YSQYQNCRQGSQFVAEYIEEFHRLSARINLSENEQHQIARFIGGLRFDIKEKVKLHSFRV 64

Query: 324 LSEAISFAETVEEMIAVRSKNLKRRPAWETTSTRMNNYADKTNDQPSTSTKGKGKEVENQ 383
           LSEAIS AETVEEM+ VR KN  RR AWET  ++  +Y  KT++QPSTS   KGK ++ Q
Sbjct: 65  LSEAISLAETVEEMMTVRLKNSNRRTAWETNPSKKQSYGKKTDEQPSTSMVDKGKAIDIQ 124

Query: 384 EVAVERKNEQTFKTSSQNNYSRPLLGKFFRCGQTEHLSNNCPQRKTIAIAE-EGRQMS-E 443
           E    +K E   +  +QNNY+RP LGK FRCG+  HLSNNC QRKTIA+AE E   MS  
Sbjct: 125 E--TNKKKESLVRGKTQNNYTRPSLGKCFRCGEPGHLSNNCSQRKTIALAEDEDTYMSGT 184

Query: 444 DSKEAEDETELIEADDEERVSCVIQRARCTINGRVCDVIIDNDSSKNFVAKKLVTVLNLK 503
           D +E E+ETELIEADD +R+SC++QR   T          +  + ++    K    +N K
Sbjct: 185 DREEEEEETELIEADDGDRISCIVQRVLITPK--------EETNPQHHSLFKTRCTINGK 244

Query: 504 AEAHPTPYKIGWVRKGGEVTVSEICTVPLSIENAYKDQIVCDVIEMDVCHLLLGRPWQYD 563
              HP PYKIGWV+KGGE  ++EICT+PLSI N+YKDQIVCDVIEMDVCHLLLGRPWQ+D
Sbjct: 245 VYPHPDPYKIGWVKKGGETLINEICTIPLSIGNSYKDQIVCDVIEMDVCHLLLGRPWQHD 304

Query: 564 TQSLHKGRENTYELQWMGRKVVLLPITRKNKEGLR--GEKQLFTT--------------- 623
           TQ+LH+GRENTYE QWMG+KV+LLP+ +KN E +R   ++QLF T               
Sbjct: 305 TQTLHRGRENTYEFQWMGKKVILLPLAKKNTESIRQKNKRQLFITVSGKNLLKEREQDLL 364

Query: 624 ---------------------QLLYEFPRIKEEPEGLPPLRDIQHHIDLISGASLPNLAH 683
                                +L  EFP +K+EP+GLPPLRDIQH IDL+  ASLPNL H
Sbjct: 365 GLLVTDKSQGGNSEIVEPRLKELFAEFPHLKKEPQGLPPLRDIQHQIDLVPRASLPNLPH 424

Query: 684 YRMSPQEYKTLHDHIEELLKKGHIQPSLSPCAVPALLTPKKYGSWRMCVDSRAINRITVK 743
           YRMSP+EY+ LHDHIE+LLKKGHI+PSLSPCAVPALLTP K GSWRMCVDSRAINR+T K
Sbjct: 425 YRMSPEEYQVLHDHIEDLLKKGHIKPSLSPCAVPALLTPNKDGSWRMCVDSRAINRVTGK 484

Query: 744 YRFSIPRISDLLDQLGKASIFSKIDLKSGYHQIRIRPGDEWKTTFKTNEGLFEWMHKQRG 803
           YRF IPRI DLLDQLGKA IFSKIDL++GYHQI+IRPGDEWKT FKTNEGLFE   ++  
Sbjct: 485 YRFPIPRIGDLLDQLGKAMIFSKIDLRNGYHQIQIRPGDEWKTAFKTNEGLFECSSRE-D 544

Query: 804 AFTTSKKNVPGLTETELYINTKKSMFMKREIAFLDFVIKQGSISMEPKKIEAIHTWPIPA 863
                +K    LTE ELYIN KK  ++ +EI FL F+IK+G I MEPKKIEAIH+ P P 
Sbjct: 545 HLQYLRKLFRVLTEIELYINPKKCTYLTKEIVFLGFLIKEGKIRMEPKKIEAIHSRPTPT 604

Query: 864 SIKEIQAFLGLASFYRKFIRNFNSLAAPLTDYFSSPFEVEVDACCTGIGVVLAQQGHPIE 923
           SIKE+QAFLGLASFYR+FIRNF+ + APLTDYF+SPFEV V+AC TGIG VL+QQGHPIE
Sbjct: 605 SIKEVQAFLGLASFYRRFIRNFSLIVAPLTDYFASPFEVAVNACGTGIGAVLSQQGHPIE 664

Query: 924 YFSEKLNPSRQSWSTYEQELYALVRALKQWEHYLLSKEFVLL 926
           YFSEKL+ SRQSWSTYEQELYALVRALKQWEH+LLS +F ++
Sbjct: 665 YFSEKLSTSRQSWSTYEQELYALVRALKQWEHHLLSGDFHIM 695

BLAST of CSPI01G12910 vs. ExPASy TrEMBL
Match: A0A5A7V4G7 (Retrovirus-related Pol polyprotein from transposon 17.6 OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold468G00370 PE=4 SV=1)

HSP 1 Score: 783.1 bits (2021), Expect = 1.3e-222
Identity = 468/998 (46.89%), Postives = 559/998 (56.01%), Query Frame = 0

Query: 7   KNPAAGENRTQEVAEEITALSPRTTTVRLLAVEESLGDLRNIFDRLIESVELLSRR-KEY 66
           K P+ G    QE  EEI  LS RT+TVRLLAVE+SLGDL    DR+++ ++ L+RR  E 
Sbjct: 34  KQPSRGGYDEQEEVEEIATLSLRTSTVRLLAVEDSLGDLHGKIDRMMDYLDALTRRMNEL 93

Query: 67  PQPPPRNEINFQNNQRFGEARGRR-ARENFRNVNNPRGFQRRRPGYAIPQQFDEDFQEDQ 126
           P P        +   R    RG R  R N+RN  N R  QRRRP     Q  D++ QE+ 
Sbjct: 94  PAP-----ARIKAYARIDGNRGSRWVRRNYRNFPNQRNNQRRRPTDIPLQYADDNSQEEY 153

Query: 127 EVWQEIQEDDSSSGDEQGNMWNFNDDLRAGRNNQRNEVRRGEYHDYKMKIDLPVYDGKQN 186
           E WQ I                                                      
Sbjct: 154 EHWQNI------------------------------------------------------ 213

Query: 187 IEAFLDWIKSTENFFNYMDIPERKKVHLVALKLRAGASTWWDQLEINRQRCGKQSIRSWE 246
                                  KKVHLVALKL+ GAS WWDQLE+NRQ+          
Sbjct: 214 -----------------------KKVHLVALKLKGGASAWWDQLEVNRQK---------- 273

Query: 247 KMKKLLKARFLPPNYEQTLYNQYQNCRQGVRSVVDYIEEFHCLSARTNLSENEQHQIARF 306
                                                                       
Sbjct: 274 ------------------------------------------------------------ 333

Query: 307 VGGLRLDIKEKVKLQPFRFLSEAISFAETVEEMIAVRSKNLKRRPAWETTSTRMNNYADK 366
                                       + EEM+ VR KN  +R  WET  ++  +   K
Sbjct: 334 ----------------------------SEEEMMIVRLKNSNKRATWETNPSKKQSSGKK 393

Query: 367 TNDQPSTSTKGKGKEVENQEVAVERKNEQTFKTSSQNNYSRPLLGKFFRCGQTEHLSNNC 426
           T++QPSTS   KGK ++ QE     K E   +  +QNNY+RP LGK FRCG+ +HLSNNC
Sbjct: 394 TDEQPSTSVVDKGKAIDIQET---NKKESVVRGKTQNNYTRPSLGKCFRCGEPDHLSNNC 453

Query: 427 PQRKTIAIAE-EGRQMSEDSKEAEDETELIEADDEERVSCVIQR---------------- 486
           PQRKTIA+AE E   MSE  KE ++E ELIEAD+ +R+SC++QR                
Sbjct: 454 PQRKTIALAEDEDTYMSEADKEEKEEIELIEADNGDRISCIVQRVLITLKEERNPQRHSL 513

Query: 487 --ARCTINGRVCDVIIDNDSSKNFVAKKLVTVLNLKAEAHPTPYKIGWVRKGGEVTVSEI 546
              RCTI+G+VCDVIID+ SS+NFVA+KLV  LNLK + HP PYKIGWV+K GE  ++EI
Sbjct: 514 FKTRCTISGKVCDVIIDSGSSENFVARKLVASLNLKIDPHPDPYKIGWVKKEGETLINEI 573

Query: 547 CTVPLSIENAYKDQIVCDVIEMDVCHLLLGRPWQYDTQSLHKGRENTYELQWMGRKVVLL 606
           CT+PLSI N+YKDQIVCDVIEMDVCHLLL RPW+ D                    ++ L
Sbjct: 574 CTIPLSIVNSYKDQIVCDVIEMDVCHLLLDRPWEQD--------------------LLGL 633

Query: 607 PITRKNKEGLRGEKQLFTTQLLYEFPRIKEEPEGLPPLRDIQHHIDLISGASLPNLAHYR 666
            +  K++ G     +    +L  EFP +K+EP+GLPPL DIQH IDL+ GASLP+L HYR
Sbjct: 634 VVAEKSQGGNSEIVEPRLKELFAEFPHLKKEPQGLPPLHDIQHQIDLVPGASLPDLPHYR 693

Query: 667 MSPQEYKTLHDHIEELLKKGHIQPSLSPCAVPALLTPKKYGSWRMCVDSRAINRITVKYR 726
           MSP+EY+ LHD+IE LLKKGHI+PSLSPC VPALLTPKK  SWRMCVDSRAINRITVKY 
Sbjct: 694 MSPEEYQVLHDYIENLLKKGHIKPSLSPCVVPALLTPKKDESWRMCVDSRAINRITVKYW 753

Query: 727 FSIPRISDLLDQLGKASIFSKIDLKSGYHQIRIRPGDEWKTTFKTNEGLFEWMHKQRG-- 786
           F IP++ DLLDQLGKA++FSKIDL+S YHQIRIRP DEWKTTFK NEGLFEW+    G  
Sbjct: 754 FPIPQVGDLLDQLGKAAVFSKIDLRSDYHQIRIRPEDEWKTTFKINEGLFEWLAMPFGLS 813

Query: 787 ----AFTTS---------KKNVPGLTETELYINTKKSMFMKREIAFLDFVIKQGSISMEP 846
                FT+          +K    L E ELYIN KK  F+ +EI FL F+IK+G I MEP
Sbjct: 814 NAPSTFTSRSREDHLQHLRKLFQVLIEIELYINPKKCTFLIKEIVFLGFLIKEGKIGMEP 828

Query: 847 KKIEAIHTWPIPASIKEIQAFLGLASFYRKFIRNFNSLAAPLTDY--------------- 906
           KK+EAI +WP P SIKE+QAFLGLASFY++FIRNF+S+  PLTDY               
Sbjct: 874 KKVEAIQSWPAPTSIKEVQAFLGLASFYKRFIRNFSSIVTPLTDYLKKENFKWEHMQQQS 828

Query: 907 ------------------FSSPFEVEVDACCTGIGVVLAQQGHPIEYFSEKLNPSRQSWS 936
                             F+SPFEV VDAC  GIG VL+QQGHPIEYFSEKL+ SRQSWS
Sbjct: 934 FEEIKRRLTSSPILQLPDFTSPFEVVVDACGIGIGTVLSQQGHPIEYFSEKLSASRQSWS 828

BLAST of CSPI01G12910 vs. ExPASy TrEMBL
Match: A0A5B7BER3 (Uncharacterized protein OS=Davidia involucrata OX=16924 GN=Din_036800 PE=4 SV=1)

HSP 1 Score: 752.7 bits (1942), Expect = 1.9e-213
Identity = 417/946 (44.08%), Postives = 558/946 (58.99%), Query Frame = 0

Query: 135  SSSGDEQGNMWNFNDDLRAGRNNQRNEVRRGEYHDYKMKIDLPVYDGKQNIEAFLDWIKS 194
            +++ D      +F  D   GR     + R     +Y+MKIDLP ++G  +IE+FLDWI  
Sbjct: 88   ANNSDSDEEFADFPADRGYGR-----DPRGQNTQEYRMKIDLPSFNGHLHIESFLDWISE 147

Query: 195  TENFFNYMDIPERKKVHLVALKLRAGASTWWDQLEINRQRCGKQSIRSWEKMKKLLKARF 254
             E FF+ M+I + K+V LVA KL+ GAS WWDQ++ NR+R GKQ +R+W+KM++LL+ RF
Sbjct: 148  VETFFDCMEISDDKQVKLVAYKLKGGASAWWDQVQQNRRRQGKQPVRTWQKMRRLLRERF 207

Query: 255  LPPNYEQTLYNQYQNCRQGVRSVVDYIEEFHCLSARTNLSENEQHQIARFVGGLRLDIKE 314
            LP +YEQ LY QYQNCRQG RSV +Y +EF+ LS+R NL+E E  Q+AR+VGGLR  I++
Sbjct: 208  LPVDYEQVLYQQYQNCRQGGRSVSEYSQEFNTLSSRNNLTETENQQVARYVGGLRATIQD 267

Query: 315  KVKLQPFRFLSEAISFAETVEEMIAVRSKNLKRRPAWETTSTRMNNYADKTNDQPSTSTK 374
            ++ L+    L+EA S A  VE           R+P     S R  +Y D + +Q +   +
Sbjct: 268  QLNLRTIWNLNEATSLALKVE-------AQQSRQPLRSQNSAR--SYPDSSRNQQNRDKQ 327

Query: 375  GKG-----KEVENQEVAVERKNEQTFKTSSQ---NNYSRPLLGKFFRCGQTEHLSNNCPQ 434
             +G     +++  ++ A   KN+ T    SQ   N Y+RP+ GK FRC Q  H SN CP 
Sbjct: 328  IEGVVPQPQKITPRDQASSSKNQNTPIAPSQKSTNPYARPIPGKCFRCQQPGHRSNECPN 387

Query: 435  RK---TIAIAEEGRQMSEDSKEAE--DE---TELIEADDEERVSCVIQ------------ 494
            R+    + + E+     E+ +EAE  DE    E+ E D+ E VSCV+Q            
Sbjct: 388  RRQVNMVGVTEDNSPDFENEEEAEYQDEYGGAEITEGDEGEHVSCVVQRLLLVPKQEVDP 447

Query: 495  ------RARCTINGRVCDVIIDNDSSKNFVAKKLVTVLNLKAEAHPTPYKIGWVRKGGEV 554
                  R RCTIN +VCDVIID+ SS+N V+K LV  L LK E HP PYKIGW++KG E 
Sbjct: 448  QRHNIFRTRCTINQKVCDVIIDSGSSENIVSKALVKALQLKTEKHPNPYKIGWIKKGAET 507

Query: 555  TVSEICTVPLSIENAYKDQIVCDVIEMDVCHLLLGRPWQYDTQSLHKGRENTYELQWMGR 614
             V+EIC VP SI   YKD++ CD+++MD CH+LLGRPWQ+D  + HKG++NTY   W  +
Sbjct: 508  KVTEICRVPFSIGKVYKDEVACDIVDMDACHVLLGRPWQFDVDATHKGKDNTYLFWWHDK 567

Query: 615  KVVLLP------------------ITRKNKEGLRGEKQL--------------------- 674
            KVVL+P                  +T    + +   K+                      
Sbjct: 568  KVVLVPNEKGSNLPKTSKVEGRSLLTVAGSQFMEDAKEAGQIIVMIVKGKTGPEPPDVPE 627

Query: 675  FTTQLLYEFPRI--KEEPEGLPPLRDIQHHIDLISGASLPNLAHYRMSPQEYKTLHDHIE 734
                LL EF  I   E P+ LPP+RDIQHHIDL+ GASLPNL HYRMSP+E + L   +E
Sbjct: 628  ILQPLLAEFQDITPSELPDHLPPMRDIQHHIDLVPGASLPNLPHYRMSPKENEILQQQVE 687

Query: 735  ELLKKGHIQPSLSPCAVPALLTPKKYGSWRMCVDSRAINRITVKYRFSIPRISDLLDQLG 794
            +L+ KG IQ S+SPCAVPALLTPKK GSWRMCVDSRAIN+ITVKYRF IPR++D+LD L 
Sbjct: 688  DLINKGFIQESMSPCAVPALLTPKKDGSWRMCVDSRAINKITVKYRFPIPRLNDMLDMLE 747

Query: 795  KASIFSKIDLKSGYHQIRIRPGDEWKTTFKTNEGLFEWM--------------------- 854
             + IFSKIDL+SGYHQIRIRPGDEWKT FKT EGL+EW+                     
Sbjct: 748  GSKIFSKIDLRSGYHQIRIRPGDEWKTAFKTKEGLYEWLVMPFGLSNAPSTFMRIMNQVL 807

Query: 855  -------------------HKQRGAFTTSKKNVPGLTETELYINTKKSMFMKREIAFLDF 914
                                 +R      ++ +  L E++LYIN KK  F+   + FL F
Sbjct: 808  KPFIGKFVVVYFDDILIYSKSEREHLEHVREVLLALRESKLYINMKKCCFLTTRLLFLGF 867

Query: 915  VIKQGSISMEPKKIEAIHTWPIPASIKEIQAFLGLASFYRKFIRNFNSLAAPLTD----- 933
            +I    I ++ +K+ AI  WP P ++ +I++F GLA+FYR+FIRNF+S+ AP+TD     
Sbjct: 868  IIGSEGIQVDEEKVRAIRDWPTPKTVHDIRSFHGLATFYRRFIRNFSSIVAPITDCMKKG 927

BLAST of CSPI01G12910 vs. NCBI nr
Match: XP_031744062.1 (uncharacterized protein LOC116404773 [Cucumis sativus])

HSP 1 Score: 1040.0 bits (2688), Expect = 1.3e-299
Identity = 533/669 (79.67%), Postives = 549/669 (82.06%), Query Frame = 0

Query: 144 MWNFNDDLRAGRNNQRNEVRRGEYHDYKMKIDLPVYDGKQNIEAFLDWIKSTENFFNYMD 203
           MWN NDDLRAGRNN+R EVRRGEYHDYKMKIDLP+YDGK+NIEAFLDWIKSTENFFNYMD
Sbjct: 1   MWNLNDDLRAGRNNRRIEVRRGEYHDYKMKIDLPMYDGKRNIEAFLDWIKSTENFFNYMD 60

Query: 204 IPERKKVHLVALKLRAGASTWWDQLEINRQRCGKQSIRSWEKMKKLLKARFLPPNYEQTL 263
            PERKKVHLVALKLRAGAS WWDQLEINRQRCGKQ IRSWEKMKKLLKARFLPPNYEQTL
Sbjct: 61  TPERKKVHLVALKLRAGASAWWDQLEINRQRCGKQPIRSWEKMKKLLKARFLPPNYEQTL 120

Query: 264 YNQYQNCRQGVRSVVDYIEEFHCLSARTNLSENEQHQIARFVGGLRLDIKEKVKLQPFRF 323
           YNQYQNCRQGVRSV DYIEEFH LSARTNLSENEQHQ+ARFVG                 
Sbjct: 121 YNQYQNCRQGVRSVADYIEEFHRLSARTNLSENEQHQVARFVG----------------- 180

Query: 324 LSEAISFAETVEEMIAVRSKNLKRRPAWETTSTRMNNYADKTNDQPSTSTKGKGKEVENQ 383
                   ETVEEMIA+RSKNL RR AWETTST+      KTNDQPSTSTKGKGKEV+NQ
Sbjct: 181 --------ETVEEMIAIRSKNLNRRSAWETTSTK-----SKTNDQPSTSTKGKGKEVDNQ 240

Query: 384 EVAVERKNEQTFKTSSQNNYSRPLLGKFFRCGQTEHLSNNCPQRKTIAIAEEGRQMSEDS 443
           EVAVERK EQTFK S QN+YSRP LGK FRCGQT HLSNNCPQRKTIAIAEEG Q SEDS
Sbjct: 241 EVAVERKKEQTFKPSGQNSYSRPSLGKCFRCGQTGHLSNNCPQRKTIAIAEEGGQTSEDS 300

Query: 444 KEAEDETELIEADDEERVSCVIQR------------------ARCTINGRVCDVIIDNDS 503
            EAE+ETELIEADD ERVSC IQR                   RCTINGRVCDVIID+ S
Sbjct: 301 IEAEEETELIEADDGERVSCFIQRVLIMPKEEKNLQRHCLFKTRCTINGRVCDVIIDSGS 360

Query: 504 SKNFVAKKLVTVLNLKAEAHPTPYKIGWVRKGGEVTVSEICTVPLSIENAYKDQIVCDVI 563
           S+NFVAKKLV VLNLKAEAHPTPYKIGWVRKGGE TVSEICTVPLSI NAYKDQIVCDVI
Sbjct: 361 SENFVAKKLVIVLNLKAEAHPTPYKIGWVRKGGEATVSEICTVPLSIGNAYKDQIVCDVI 420

Query: 564 EMDVCHLLLGRPWQYDTQSLHKGRENTYELQWMGRKVVLLPITRKNKEGLRGEKQLFTT- 623
           EMDVCHLLLGRPWQYDTQSLHKGRENTYE QWMGRKVVLLPIT+K  EGLRGEKQLF T 
Sbjct: 421 EMDVCHLLLGRPWQYDTQSLHKGRENTYEFQWMGRKVVLLPITKKINEGLRGEKQLFITV 480

Query: 624 -----------------------------------QLLYEFPRIKEEPEGLPPLRDIQHH 683
                                              QLL+EFP IKEEP+GLPPLRDIQHH
Sbjct: 481 SGKKMLKEREQYILGLVVIEKTKEKQVEDIEPKLQQLLHEFPHIKEEPKGLPPLRDIQHH 540

Query: 684 IDLISGASLPNLAHYRMSPQEYKTLHDHIEELLKKGHIQPSLSPCAVPALLTPKKYGSWR 743
           IDLI GASLPNLAHYRMSPQEYK LHDHIEELLKKGHI+PSLSPCAVPALLTPKK GSWR
Sbjct: 541 IDLIPGASLPNLAHYRMSPQEYKILHDHIEELLKKGHIKPSLSPCAVPALLTPKKDGSWR 600

Query: 744 MCVDSRAINRITVKYRFSIPRISDLLDQLGKASIFSKIDLKSGYHQIRIRPGDEWKTTFK 759
           MCVDSRAINRITVKYRF IPRISDLLDQLGKASIFSKIDLKSGYHQIR+RPGDEWKT FK
Sbjct: 601 MCVDSRAINRITVKYRFPIPRISDLLDQLGKASIFSKIDLKSGYHQIRVRPGDEWKTAFK 639

BLAST of CSPI01G12910 vs. NCBI nr
Match: XP_031741035.1 (uncharacterized protein LOC116403692 [Cucumis sativus])

HSP 1 Score: 1027.3 bits (2655), Expect = 8.4e-296
Identity = 524/622 (84.24%), Postives = 550/622 (88.42%), Query Frame = 0

Query: 1   MAGRRGKNPAAGENRTQEVAEEITALSPRTTTVRLLAVEESLGDLRNIFDRLIESVELLS 60
           MAGRR  NPA GENR QE AEEIT LSP+T+TVRLLAVEESLGDL N FDRL+ESVELL+
Sbjct: 1   MAGRRVNNPATGENRVQEAAEEITVLSPKTSTVRLLAVEESLGDLHNKFDRLMESVELLN 60

Query: 61  RRKEYPQPPPRNEINFQNNQRFGEARGRRARENFRNVNNPRGFQRRRPGYAIPQQFDEDF 120
           RR+E+PQPPPRNEINFQN+QRFGE RGRRAR   RN+NNPRG QRRRPGYAI QQ DEDF
Sbjct: 61  RREEFPQPPPRNEINFQNDQRFGETRGRRARGYVRNMNNPRGLQRRRPGYAIQQQLDEDF 120

Query: 121 QEDQEVWQEIQEDDSSSGDEQGNMWNFNDDLRAGRNNQRNEVRRGEYHDYKMKIDLPVYD 180
           QEDQE WQE QEDDSSSGDEQGNMWNFND+ RAGRNNQR E RRGEYHDYKMKIDLP+YD
Sbjct: 121 QEDQEAWQETQEDDSSSGDEQGNMWNFNDEARAGRNNQRIEARRGEYHDYKMKIDLPMYD 180

Query: 181 GKQNIEAFLDWIKSTENFFNYMDIPERKKVHLVALKLRAGASTWWDQLEINRQRCGKQSI 240
           GK+NIEAFLDWIKSTENFFNYMD PERKKVHLVALKLRAGAS WWDQLEINRQRCGKQ +
Sbjct: 181 GKRNIEAFLDWIKSTENFFNYMDTPERKKVHLVALKLRAGASAWWDQLEINRQRCGKQPV 240

Query: 241 RSWEKMKKLLKARFLPPNYEQTLYNQYQNCRQGVRSVVDYIEEFHCLSARTNLSENEQHQ 300
           RSWEKMKKLLKARFLPPNYEQTLYNQYQNCRQGVRSV +YIEEFH LSARTNLSENEQHQ
Sbjct: 241 RSWEKMKKLLKARFLPPNYEQTLYNQYQNCRQGVRSVAEYIEEFHRLSARTNLSENEQHQ 300

Query: 301 IARFVGGLRLDIKEKVKLQPFRFLSEAISFAETVEEMIAVRSKNLKRRPAWETTSTRMNN 360
           +ARFVGGLR DIKEKV+LQPFRFLSEAISFAETVEEMIA+RSKNL RR AWET ST+   
Sbjct: 301 VARFVGGLRFDIKEKVRLQPFRFLSEAISFAETVEEMIAIRSKNLNRRSAWETNSTK--- 360

Query: 361 YADKTNDQPSTSTKGKGKEVENQEVAVERKNEQTFKTSSQNNYSRPLLGKFFRCGQTEHL 420
              KTNDQPSTSTK KGKE++NQEVAVERK EQTFK S QN+YSRP LGK FRCGQT HL
Sbjct: 361 --SKTNDQPSTSTKAKGKEIDNQEVAVERKKEQTFKPSGQNSYSRPSLGKCFRCGQTGHL 420

Query: 421 SNNCPQRKTIAIAEEGRQMSEDSKEAEDETELIEADDEERVSCVIQR------------- 480
           S+NCPQRKTIAIAEEG Q+SEDS EAE+ETELIEADD ERVSCVIQR             
Sbjct: 421 SDNCPQRKTIAIAEEGGQISEDSIEAEEETELIEADDGERVSCVIQRLLITPKEEKNLQR 480

Query: 481 -----ARCTINGRVCDVIIDNDSSKNFVAKKLVTVLNLKAEAHPTPYKIGWVRKGGEVTV 540
                 RCTINGRVCDVIID+ SS+NFVAKKLVTVLNLKAEAHP PYKIGWVRKGGE TV
Sbjct: 481 HCLFKTRCTINGRVCDVIIDSGSSENFVAKKLVTVLNLKAEAHPNPYKIGWVRKGGEATV 540

Query: 541 SEICTVPLSIENAYKDQIVCDVIEMDVCHLLLGRPWQYDTQSLHKGRENTYELQWMGRKV 600
           SEICTVPLSI NAYKDQIVCDVIEMDVCHLLLGRPWQYDTQSLHKGRENTYE QWMGRKV
Sbjct: 541 SEICTVPLSIGNAYKDQIVCDVIEMDVCHLLLGRPWQYDTQSLHKGRENTYEFQWMGRKV 600

Query: 601 VLLPITRKNKEGLRGEKQLFTT 605
           VLLPIT+K  EGLRGEKQLF T
Sbjct: 601 VLLPITKKINEGLRGEKQLFIT 617

BLAST of CSPI01G12910 vs. NCBI nr
Match: TYK30863.1 (transposon Ty3-I Gag-Pol polyprotein isoform X1 [Cucumis melo var. makuwa])

HSP 1 Score: 1011.9 bits (2615), Expect = 3.7e-291
Identity = 536/899 (59.62%), Postives = 636/899 (70.75%), Query Frame = 0

Query: 69  PPRNEINFQNNQRFGEARGRRARENFRNVNNPRGFQRRRPGYAIPQQFDEDFQEDQEVWQ 128
           P R E   +N++  G   GRRAR N++N  N    QRRRP     Q  D++ QE+ E WQ
Sbjct: 32  PARIEAYARNDENRG---GRRARRNYKNFPN----QRRRPTDIPLQYADDNSQEEYEHWQ 91

Query: 129 EIQEDDSSSGDEQGNMWNFNDDLRAGRNNQRNEVRRGEYHDYKMKIDLPVYDGKQNIEAF 188
             Q+ DSS GDEQGN+WN + + R  +  +  E RR  YHDYKMKIDLP Y+GK++IE+F
Sbjct: 92  NTQDHDSSIGDEQGNIWNDDGEFRMAQGYRGQEARRETYHDYKMKIDLPTYNGKRDIESF 151

Query: 189 LDWIKSTENFFNYMDIPERKKVHLVALKLRAGASTWWDQLEINRQRCGKQSIRSWEKMKK 248
           LDWIK+TENFF YM  P+RKKVHLVALKL+ GAS W                        
Sbjct: 152 LDWIKNTENFFKYMVPPDRKKVHLVALKLKGGASAW------------------------ 211

Query: 249 LLKARFLPPNYEQTL---YNQYQNCRQGVRSVVDYIEEFHCLSARTNLSENEQHQIARFV 308
                  P +Y Q +   Y+QYQNCRQG + V +YIEEFH L AR NLSENEQHQIARF+
Sbjct: 212 -------PVSYPQIMNRHYSQYQNCRQGSQLVAEYIEEFHRLGARINLSENEQHQIARFI 271

Query: 309 GGLRLDIKEKVKLQPFRFLSEAISFAETVEEMIAVRSKNLKRRPAWETTSTRMNNYADKT 368
           GGLR DIKEKVKL  FR LSEAIS AETVEEM+ VR KN  RR AWET  ++  +Y  KT
Sbjct: 272 GGLRFDIKEKVKLHSFRVLSEAISLAETVEEMMTVRLKNSNRRTAWETNPSKKQSYGKKT 331

Query: 369 NDQPSTSTKGKGKEVENQEVAVERKNEQTFKTSSQNNYSRPLLGKFFRCGQTEHLSNNCP 428
           ++QPSTS   KGK ++ QE    +K E   +  +QNNY+RP LGK FRCG+  HLSNNC 
Sbjct: 332 DEQPSTSMVDKGKAIDIQE--TNKKKESLVRGKTQNNYTRPSLGKCFRCGEPGHLSNNCS 391

Query: 429 QRKTIAIAE-EGRQMSEDSKEAEDETELIEADDEERVSCVIQRARCTINGRVCDVIIDND 488
           QRKTIA+AE E   MS   +E E+ETELIEADD +R+SC++QR   T          +  
Sbjct: 392 QRKTIALAEDEDTYMSGTDEEEEEETELIEADDGDRISCIVQRVLITPK--------EET 451

Query: 489 SSKNFVAKKLVTVLNLKAEAHPTPYKIGWVRKGGEVTVSEICTVPLSIENAYKDQIVCDV 548
           + ++    K    +N K   HP PYKIGWV+KGGE  ++EICT+PLSI N+YKDQIVCDV
Sbjct: 452 NPQHHSLFKTRCTINGKVYPHPDPYKIGWVKKGGETLINEICTIPLSIGNSYKDQIVCDV 511

Query: 549 IEMDVCHLLLGRPWQYDTQSLHKGRENTYELQWMGRKVVLLPITRKNKEGLR--GEKQLF 608
           IEMDVCHLLLGRPWQ+DTQ+LH+GRENTYE QWMG+KV+LLP+ +KN E +R   ++QLF
Sbjct: 512 IEMDVCHLLLGRPWQHDTQTLHRGRENTYEFQWMGKKVILLPLAKKNTESIRQKNKRQLF 571

Query: 609 TT------------------------------------QLLYEFPRIKEEPEGLPPLRDI 668
            T                                    +L  EFP +K+EP+GLPPLRDI
Sbjct: 572 ITVSGKNLLKEREQDLLGLLVTDKSQGGNSEIVEPRLKELFAEFPHLKKEPQGLPPLRDI 631

Query: 669 QHHIDLISGASLPNLAHYRMSPQEYKTLHDHIEELLKKGHIQPSLSPCAVPALLTPKKYG 728
           QH IDL+  ASLPNL HYRMSP+EY+ LHDHIE+LLKKGHI+PSLSPCAVPALLTP K G
Sbjct: 632 QHQIDLVPRASLPNLPHYRMSPEEYQVLHDHIEDLLKKGHIKPSLSPCAVPALLTPNKDG 691

Query: 729 SWRMCVDSRAINRITVKYRFSIPRISDLLDQLGKASIFSKIDLKSGYHQIRIRPGDEWKT 788
           SWRMCVDSRAINR+T KYRF IPRI DLLDQLGKA IFSKIDL++GYHQI+IRPGDEWKT
Sbjct: 692 SWRMCVDSRAINRVTGKYRFPIPRIGDLLDQLGKAMIFSKIDLRNGYHQIQIRPGDEWKT 751

Query: 789 TFKTNEGLFEWMHKQRGAFTTSKKNVPGLTETELYINTKKSMFMKREIAFLDFVIKQGSI 848
            FKTNEGLFE   ++       +K    LTE ELYIN KK  ++ +EI FL F+IK+G I
Sbjct: 752 AFKTNEGLFECSSRE-DHLQYLRKLFRVLTEIELYINPKKCTYLTKEIVFLGFLIKEGKI 811

Query: 849 SMEPKKIEAIHTWPIPASIKEIQAFLGLASFYRKFIRNFNSLAAPLTDYFSSPFEVEVDA 908
            MEPKKIEAI + P P SIKE+QAFLGLASFYR+FIRNF+ + APLTDYF+SPFEV V+A
Sbjct: 812 RMEPKKIEAIQSRPTPTSIKEVQAFLGLASFYRRFIRNFSLIVAPLTDYFASPFEVAVNA 871

Query: 909 CCTGIGVVLAQQGHPIEYFSEKLNPSRQSWSTYEQELYALVRALKQWEHYLLSKEFVLL 926
           C TGIG VL+QQGHPIEYFSEKL+ SRQSWSTYEQELYALVRALKQWEHYLLS +F ++
Sbjct: 872 CGTGIGAVLSQQGHPIEYFSEKLSTSRQSWSTYEQELYALVRALKQWEHYLLSGDFHIM 881

BLAST of CSPI01G12910 vs. NCBI nr
Match: KAA0054966.1 (transposon Ty3-I Gag-Pol polyprotein isoform X1 [Cucumis melo var. makuwa] >TYK22755.1 transposon Ty3-I Gag-Pol polyprotein isoform X1 [Cucumis melo var. makuwa])

HSP 1 Score: 954.1 bits (2465), Expect = 9.1e-274
Identity = 542/1075 (50.42%), Postives = 681/1075 (63.35%), Query Frame = 0

Query: 1    MAGRRGKNPAAGENRTQEVAEEITALSPRTTTVRLLAVEESLGDLRNIFDRLI-----ES 60
            M  +RG+ PAA E   ++ A E   LSPRT++  L +VE S+ ++R + + ++     E+
Sbjct: 1    MINQRGRAPAAKE---RQEAGETPILSPRTSSRCLRSVEASIEEIRQLLNGVVHRLDEEN 60

Query: 61   VELLSRRKEYPQPPPRNEINFQNNQRFGEARGRRARENFRNVNNPRGFQRR---RPGYAI 120
             +L  R  E P          QN  R    RGRR  E FR     R FQ R      + +
Sbjct: 61   AQLNDRDVEPP--------TLQNWGR----RGRRGLEYFR---PQRNFQERIIPEDQWLL 120

Query: 121  PQQFDEDFQEDQEV-WQEIQED-DSSSGDEQGNMWNFNDDLRAGRNNQ--RNEVRRGEYH 180
            PQ      + D+ + WQ  +E+ ++SS  E+ +    NDD+   R ++  +NE ++ E  
Sbjct: 121  PQG-----RRDRRIEWQAREEEIENSSSSEESD----NDDINEFRRHRYVQNERQQRENS 180

Query: 181  DYKMKIDLPVYDGKQNIEAFLDWIKSTENFFNYMDIPERKKVHLVALKLRAGASTWWDQL 240
            +YKMKIDLP YDGK+NIE FLDW+K+TENFF YM   + KKVHLVALKL+ GAS WWDQ+
Sbjct: 181  EYKMKIDLPSYDGKRNIENFLDWLKNTENFFAYMGTTKNKKVHLVALKLKGGASAWWDQI 240

Query: 241  EINRQRCGKQSIRSWEKMKKLLKARFLPPNYEQTLYNQYQNCRQGVRSVVDYIEEFHCLS 300
             +NRQ+ GK  IRSWEKMKKL+K RF+PPNYEQTLY QYQNCRQG+R   +YIEEFH L 
Sbjct: 241  TVNRQKQGKHPIRSWEKMKKLMKQRFVPPNYEQTLYTQYQNCRQGMRKTAEYIEEFHRLG 300

Query: 301  ARTNLSENEQHQIARFVGGLRLDIKEKVKLQPFRFLSEAISFAETVEEMIAVRSKNLKRR 360
             RTNL E E+H I+ FVGGLR D+KEKVKLQPF+ LSEAI++AETVEEMI  R+K+ ++R
Sbjct: 301  GRTNLMEGEKHLISWFVGGLRFDLKEKVKLQPFQHLSEAITYAETVEEMIENRAKSTRKR 360

Query: 361  PAWETTSTRMNNYADKTNDQPSTSTKGKGKEVENQEVAVERKNEQTFKTSSQNNYSRPLL 420
            P WE ++++      KT    S       ++   QE +  +K     +   +N Y RP  
Sbjct: 361  P-WEPSASK------KTTAGNSKLKNATSEKPVEQEESSGKKEVPEGEKKGKNPYQRPFS 420

Query: 421  GKFFRCGQTEHLSNNCPQRKTIAIAEEGRQMSEDS-KEAEDETELIEADDEERVSCVIQR 480
            G  +RCGQ  H SN CPQRKTIA+A++    S  S  E ++ETE+IEAD+ + +SC++QR
Sbjct: 421  GNCYRCGQMGHPSNQCPQRKTIAVAKDNDDGSNRSLGEFDEETEVIEADEGDSLSCILQR 480

Query: 481  ------------------ARCTINGRVCDVIIDNDSSKNFVAKKLVTVLNLKAEAHPTPY 540
                               RCTI G+VC+VIID+ SS+NFV+KKLVT LNLK + H  PY
Sbjct: 481  VLISPKEENQLQRHSLFKTRCTIQGKVCNVIIDSGSSENFVSKKLVTALNLKTQPHEKPY 540

Query: 541  KIGWVRKGGEVTVSEICTVPLSIENAYKDQIVCDVIEMDVCHLLLGRPWQYDTQSLHKGR 600
            KIGW++KGGE  +SEIC VPLSI N+YKDQ+VCDVIEMDVCH+LLGRPWQ+D QS+H+GR
Sbjct: 541  KIGWIKKGGETLISEICYVPLSIGNSYKDQMVCDVIEMDVCHILLGRPWQFDVQSMHRGR 600

Query: 601  ENTYELQWMGRKVVLLPITRKNKEGLRGEKQ---LFTT---------------------- 660
            ENTYE  WM +KV+LLP+ ++  + +   ++   LF T                      
Sbjct: 601  ENTYEFMWMNKKVILLPLQKRKDDNIEKNQKKGSLFVTISGKKFLRERENEILGIVMSGT 660

Query: 661  --------------QLLYEFPRIKEEPEGLPPLRDIQHHIDLISGASLPNLAHYRMSPQE 720
                          +L  ++P+I +EP  LPPLRDI H+I+L+SGAS P+L HY MSP E
Sbjct: 661  EDTTRDEQIPEAIKELFKKYPKISKEPTCLPPLRDIHHNIELLSGASFPHLPHYHMSPNE 720

Query: 721  YKTLHDHIEELLKKGHIQPSLSPCAVPALLTPKKYGSWRMCVDSRAINRITVKYRFSIPR 780
            YK LHD IEELLKKGHI+PS S C VPALLTPKK G+WRMCVDSRAIN+ITVKYRF IPR
Sbjct: 721  YKILHDAIEELLKKGHIKPSFSLCVVPALLTPKKDGTWRMCVDSRAINKITVKYRFPIPR 780

Query: 781  ISDLLDQLGKASIFSKIDLKSGYHQIRIRPGDEWKTTFKTNEGLFEW------------- 840
            +SDLLDQLG A IFSKIDL+S YHQIRIRPGDEWKT FKTNEGLFEW             
Sbjct: 781  VSDLLDQLGGACIFSKIDLRSDYHQIRIRPGDEWKTAFKTNEGLFEWLVMPFDLSNAPST 840

Query: 841  ----MHKQRGAFTTS-----------------------KKNVPGLTETELYINTKKSMFM 900
                M+K    F                           +    L   ELY+N KK +F 
Sbjct: 841  FMRLMNKVLHPFLNKFIIVYFDDILVFSKTYDQHLQHIDQLFQVLNHNELYVNLKKCIFC 900

Query: 901  KREIAFLDFVIKQGSISMEPKKIEAIHTWPIPASIKEIQAFLGLASFYRKFIRNFNSLAA 933
              EIAFL F+I++  + M+ KK+EAI  W  P ++ ++QAFLGLASFYRKFI+N +S+AA
Sbjct: 901  SNEIAFLGFIIRKDHVLMDEKKVEAIKNWSTPTTVIQVQAFLGLASFYRKFIQNCSSIAA 960

BLAST of CSPI01G12910 vs. NCBI nr
Match: XP_011648447.2 (uncharacterized protein LOC105434464 [Cucumis sativus])

HSP 1 Score: 896.7 bits (2316), Expect = 1.7e-256
Identity = 478/634 (75.39%), Postives = 481/634 (75.87%), Query Frame = 0

Query: 144 MWNFNDDLRAGRNNQRNEVRRGEYHDYKMKIDLPVYDGKQNIEAFLDWIKSTENFFNYMD 203
           MWNFNDDLRAGRNNQRNEVRRGEYHDYKMKIDL VYDGKQNIEAFLDWIKSTENFFNYMD
Sbjct: 1   MWNFNDDLRAGRNNQRNEVRRGEYHDYKMKIDLSVYDGKQNIEAFLDWIKSTENFFNYMD 60

Query: 204 IPERKKVHLVALKLRAGASTWWDQLEINRQRCGKQSIRSWEKMKKLLKARFLPPNYEQTL 263
            PE KKVHLVALKLR                                             
Sbjct: 61  TPECKKVHLVALKLR--------------------------------------------- 120

Query: 264 YNQYQNCRQGVRSVVDYIEEFHCLSARTNLSENEQHQIARFVGGLRLDIKEKVKLQPFRF 323
                                                                       
Sbjct: 121 ------------------------------------------------------------ 180

Query: 324 LSEAISFAETVEEMIAVRSKNLKRRPAWETTSTRMNNYADKTNDQPSTSTKGKGKEVENQ 383
                  AETVEEMIAVRSKNLKRRPAW+TTSTRMNNYADKTNDQPSTSTKGKGKEVENQ
Sbjct: 181 -------AETVEEMIAVRSKNLKRRPAWKTTSTRMNNYADKTNDQPSTSTKGKGKEVENQ 240

Query: 384 EVAVERKNEQTFKTSSQNNYSRPLLGKFFRCGQTEHLSNNCPQRKTIAIAEEGRQMSEDS 443
           EV VERKNEQ FKTSSQNNYSRPLLGKFFRCGQTEHLSNNCPQRKTIAIAEEGRQMSEDS
Sbjct: 241 EVVVERKNEQAFKTSSQNNYSRPLLGKFFRCGQTEHLSNNCPQRKTIAIAEEGRQMSEDS 300

Query: 444 KEAEDETELIEADDEERVSCVIQR------------------ARCTINGRVCDVIIDNDS 503
           K AEDE ELIEADD ERVSCVIQR                  ARCTINGRVCDVIIDNDS
Sbjct: 301 KGAEDEIELIEADDGERVSCVIQRVLITPKEEKKQQRHCLFKARCTINGRVCDVIIDNDS 360

Query: 504 SKNFVAKKLVTVLNLKAEAHPTPYKIGWVRKGGEVTVSEICTVPLSIENAYKDQIVCDVI 563
           SKNFVAKKLVTVLNLKAEAHPT YKIGWVRK GE TVSEICTVPLSIENAYKDQIVCDVI
Sbjct: 361 SKNFVAKKLVTVLNLKAEAHPTSYKIGWVRKEGEATVSEICTVPLSIENAYKDQIVCDVI 420

Query: 564 EMDVCHLLLGRPWQYDTQSLHKGRENTYELQWMGRKVVLLPITRKNKEGLRGEK-QLFTT 623
           EMDVCHLLLGRPWQYDTQSLHKGRENTYELQ MGRKVVLLPITRKNKEGLR E  +    
Sbjct: 421 EMDVCHLLLGRPWQYDTQSLHKGRENTYELQLMGRKVVLLPITRKNKEGLRVEDIEPELQ 480

Query: 624 QLLYEFPRIKEEPEGLPPLRDIQHHIDLISGASLPNLAHYRMSPQEYKTLHDHIEELLKK 683
           QLLYEFPRIKEEPEGLPPLRDIQHHIDLI GASLPNLAHYRMSPQEYKTLHDHIEELLKK
Sbjct: 481 QLLYEFPRIKEEPEGLPPLRDIQHHIDLIPGASLPNLAHYRMSPQEYKTLHDHIEELLKK 522

Query: 684 GHIQPSLSPCAVPALLTPKKYGSWRMCVDSRAINRITVKYRFSIPRISDLLDQLGKASIF 743
           GHI+PSLSPCAVPALLT KK GSWRMCVDSRAINRITVKYRFSIPRISDLLDQLGKASIF
Sbjct: 541 GHIKPSLSPCAVPALLTLKKDGSWRMCVDSRAINRITVKYRFSIPRISDLLDQLGKASIF 522

Query: 744 SKIDLKSGYHQIRIRPGDEWKTTFKTNEGLFEWM 759
           SKIDLKSGYHQIRIRPGDEWKTTFKT EGLFEWM
Sbjct: 601 SKIDLKSGYHQIRIRPGDEWKTTFKTKEGLFEWM 522

BLAST of CSPI01G12910 vs. TAIR 10
Match: AT4G13320.1 (unknown protein; Has 68 Blast hits to 67 proteins in 12 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 68; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 68.6 bits (166), Expect = 3.2e-11
Identity = 38/120 (31.67%), Postives = 61/120 (50.83%), Query Frame = 0

Query: 467 RARCTINGRVCDVIIDNDSSKNFVAKKLVTVLNLKAEAHPTPYKIGWVRKGGEVTVSEIC 526
           R +C IN   C +++      N ++K LV  L LK        ++   R+  +V   E C
Sbjct: 100 RTQCVINDEACRLVL--YGGNNIISKGLVKQLKLKTLKKYPSVRVMATRREDKV-AEETC 159

Query: 527 TVPLSIENAYKDQIVCDVIEM--DVCHLLLGRPWQYDTQSLHKGRENTYELQWMGRKVVL 585
            VP+SI + YKD++ C V+ M  +   LL G PW Y  Q+ H GR+++  + W    ++L
Sbjct: 160 RVPVSIGDFYKDKVTCYVVNMEEEEDQLLFGGPWLYRVQATHNGRDDSCMIIWNHNMILL 216

BLAST of CSPI01G12910 vs. TAIR 10
Match: ATMG00860.1 (DNA/RNA polymerases superfamily protein )

HSP 1 Score: 65.5 bits (158), Expect = 2.7e-10
Identity = 29/78 (37.18%), Postives = 48/78 (61.54%), Query Frame = 0

Query: 779 ELYINTKKSMFMKREIAFLD--FVIKQGSISMEPKKIEAIHTWPIPASIKEIQAFLGLAS 838
           + Y N KK  F + +IA+L    +I    +S +P K+EA+  WP P +  E++ FLGL  
Sbjct: 15  QFYANRKKCAFGQPQIAYLGHRHIISGEGVSADPAKLEAMVGWPEPKNTTELRGFLGLTG 74

Query: 839 FYRKFIRNFNSLAAPLTD 855
           +YR+F++N+  +  PLT+
Sbjct: 75  YYRRFVKNYGKIVRPLTE 92

BLAST of CSPI01G12910 vs. TAIR 10
Match: AT2G15180.1 (Zinc knuckle (CCHC-type) family protein )

HSP 1 Score: 46.2 bits (108), Expect = 1.7e-04
Identity = 20/69 (28.99%), Postives = 36/69 (52.17%), Query Frame = 0

Query: 188 FLDWIKSTENFFNYMDIPERKKVHLVALKLRAGASTWWDQLEINRQRCGKQSIRSWEKMK 247
           +L W  +   +F +    +  K+ +   +L+  A  WWDQ E NR    +  IR+WE++K
Sbjct: 119 YLQWESNMNYYFEFHSTAQEDKLSIALGQLKGSALWWWDQDEYNRWYERRAPIRTWERLK 178

Query: 248 KLLKARFLP 257
             + A++ P
Sbjct: 179 WNMCAKYSP 187

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
P043232.0e-4230.62Retrovirus-related Pol polyprotein from transposon 17.6 OS=Drosophila melanogast... [more]
Q993152.7e-4229.27Transposon Ty3-G Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 20... [more]
Q7LHG52.7e-4229.27Transposon Ty3-I Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 20... [more]
P208252.5e-4027.29Retrovirus-related Pol polyprotein from transposon 297 OS=Drosophila melanogaste... [more]
P104011.0e-3827.36Retrovirus-related Pol polyprotein from transposon gypsy OS=Drosophila melanogas... [more]
Match NameE-valueIdentityDescription
A0A5D3E4171.8e-29159.62Transposon Ty3-I Gag-Pol polyprotein isoform X1 OS=Cucumis melo var. makuwa OX=1... [more]
A0A5D3DGR04.4e-27450.42Reverse transcriptase OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold11... [more]
A0A5A7T2568.9e-25163.96Reverse transcriptase OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold28... [more]
A0A5A7V4G71.3e-22246.89Retrovirus-related Pol polyprotein from transposon 17.6 OS=Cucumis melo var. mak... [more]
A0A5B7BER31.9e-21344.08Uncharacterized protein OS=Davidia involucrata OX=16924 GN=Din_036800 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
XP_031744062.11.3e-29979.67uncharacterized protein LOC116404773 [Cucumis sativus][more]
XP_031741035.18.4e-29684.24uncharacterized protein LOC116403692 [Cucumis sativus][more]
TYK30863.13.7e-29159.62transposon Ty3-I Gag-Pol polyprotein isoform X1 [Cucumis melo var. makuwa][more]
KAA0054966.19.1e-27450.42transposon Ty3-I Gag-Pol polyprotein isoform X1 [Cucumis melo var. makuwa] >TYK2... [more]
XP_011648447.21.7e-25675.39uncharacterized protein LOC105434464 [Cucumis sativus][more]
Match NameE-valueIdentityDescription
AT4G13320.13.2e-1131.67unknown protein; Has 68 Blast hits to 67 proteins in 12 species: Archae - 0; Bac... [more]
ATMG00860.12.7e-1037.18DNA/RNA polymerases superfamily protein [more]
AT2G15180.11.7e-0428.99Zinc knuckle (CCHC-type) family protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucumber (PI 183967) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR041373Reverse transcriptase, RNase H-like domainPFAMPF17917RT_RNaseHcoord: 856..935
e-value: 3.4E-22
score: 78.7
IPR043128Reverse transcriptase/Diguanylate cyclase domainGENE3D3.30.70.270coord: 811..927
e-value: 1.1E-14
score: 56.2
IPR043128Reverse transcriptase/Diguanylate cyclase domainGENE3D3.30.70.270coord: 706..733
e-value: 9.1E-49
score: 167.6
IPR021109Aspartic peptidase domain superfamilyGENE3D2.40.70.10Acid Proteasescoord: 467..584
e-value: 1.1E-7
score: 33.7
IPR000477Reverse transcriptase domainPFAMPF00078RVT_1coord: 682..760
e-value: 1.6E-9
score: 37.6
IPR005162Retrotransposon gag domainPFAMPF03732Retrotrans_gagcoord: 212..309
e-value: 4.8E-16
score: 58.8
NoneNo IPR availableGENE3D3.10.10.10HIV Type 1 Reverse Transcriptase, subunit A, domain 1coord: 626..759
e-value: 9.1E-49
score: 167.6
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 351..376
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 87..109
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 377..391
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 123..152
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 351..400
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..23
NoneNo IPR availablePANTHERPTHR24559TRANSPOSON TY3-I GAG-POL POLYPROTEINcoord: 615..925
NoneNo IPR availablePANTHERPTHR24559TRANSPOSON TY3-I GAG-POL POLYPROTEINcoord: 213..318
coord: 465..588
NoneNo IPR availablePANTHERPTHR24559:SF327REVERSE TRANSCRIPTASE DOMAIN-CONTAINING PROTEIN-RELATEDcoord: 213..318
coord: 465..588
coord: 615..925
NoneNo IPR availableCDDcd00303retropepsin_likecoord: 470..558
e-value: 1.94162E-13
score: 65.0504
NoneNo IPR availableCDDcd09274RNase_HI_RT_Ty3coord: 861..945
e-value: 5.02305E-26
score: 101.8
NoneNo IPR availableCDDcd01647RT_LTRcoord: 665..801
e-value: 6.81281E-38
score: 137.727
IPR043502DNA/RNA polymerase superfamilySUPERFAMILY56672DNA/RNA polymerasescoord: 609..936

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI01G12910.1CSPI01G12910.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006259 DNA metabolic process
molecular_function GO:0005488 binding
molecular_function GO:0016779 nucleotidyltransferase activity