CSPI04G15470 (gene) Cucumber (PI 183967) v1

Overview
NameCSPI04G15470
Typegene
OrganismCucumis sativus var. hardwickii cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionRNA-directed DNA polymerase
LocationChr4: 12853638 .. 12855104 (+)
RNA-Seq ExpressionCSPI04G15470
SyntenyCSPI04G15470
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTGGAGCCCTTATGAGCAAGAAATGTATGCCCTCATTCGAGCATTAAGGCAGTGGGAAGATTACCTACTATCTAAAGAGTTTCTCCTATTAACAAATCATTTCTCCATAAAATACCTCCAAGCTCAGAAATCTACCAACAAGATACCCGCTAGATGGATCTCCTTTTTACAGAGATTTGACTTTGTTATCAAGCATAAAAGCGGAAAAGAAAACAAAGTAGCTGATGCACTAAGTAGAAAGCATTCCTTACTTTCTATATCATCATCCGAGGTGATAGCATTCAAACACTTACCATACTTATATGAAGATGACACTGACTTCAACAAGATGTGGTACAAATGCATTCATCACCTCGAAACAAGAGAATTTCATATTGTTGATGGATTTCTATTCAAAGAAGAAAAATTGTGCATACCCCATACTTCCCCAAGAGAAGCTCTACTAAAGGAGGCACACTCCGAAGGACTAGCTGGTCACTTTGGGCAAGACAAAACAATTGAAATACTTTCCTCCAAATACTATTTGCCACAATTGCGAAAAGACACAAATAACTTTGTAAAGCGATGCCCTATATGCCAAACAACTAAAGGGACCAGTACTAATGCTGGATTATACAACCCCTTACCGATTCCTACAGCTATATGGGAGGATTTATCAGTAGACTTCGTATTGGAATTACCTAAGGCACAAAGACAGCATAATACTGTAATGGTAGTTGTAGACAGATTCAGCAAAAGGACCCACTTTTTACCCTGCAAGAAGACTAATGATGCTGTCTATATTGCTAATCTCTTCTTTCGAGAGATAATTTGACTACATGGCATACCAAGCCACTTTTGGCGAATTCTGTGGAAGAAACTAGATACCACACTTAAATTCAGTACTACAGCACACTCACAAACAGATGGTCAAACAGAAGTCACTAACAGAACCTTGGGGAATCTAATTTGTTGCCTAAGTGGCTCAAAGCATAGACAGTGGGATCTAGCGTTAGCTCAATCTGAATTCGCGTTCAATAATATGAAAAATAGATCAACCGGGAGACGTCCCTTTGAAATTGTCTATACTAAAAGTCCTAGACTTGCACTTGATCTTGCCAATTTGCTATCAAATGCAGATATCAACAATGAAGCAGAAAATATGATTGAAAGAATACAAAATCTGCACAAACAGGTACATGAACACCTACAGAAAACAACTTTATCTTATAAACAAGATAAAGACAAGAAAAGAAGAGAAGTCAAATTCAAAGAAGGAGATCTGGTGATGATACATCTGAAGAAAAACCGGTTCCCAACAGGGACATACAATAAACTAAAAGACAGACAACTTGGGCCTTGCAAAATACTAGCAAAGTATGGTGATAATGCCTATAAAATTGAACTGCCAGACGACCTACACATCAGCCCTGTTTTCAATGTAGCAGACCTGAAGACCTACCATGCCCCAGATGAATTCAGTTAG

mRNA sequence

ATGTGGAGCCCTTATGAGCAAGAAATGTATGCCCTCATTCGAGCATTAAGGCAGTGGGAAGATTACCTACTATCTAAAGAGTTTCTCCTATTAACAAATCATTTCTCCATAAAATACCTCCAAGCTCAGAAATCTACCAACAAGATACCCGCTAGATGGATCTCCTTTTTACAGAGATTTGACTTTGTTATCAAGCATAAAAGCGGAAAAGAAAACAAAGTAGCTGATGCACTAAGTAGAAAGCATTCCTTACTTTCTATATCATCATCCGAGGTGATAGCATTCAAACACTTACCATACTTATATGAAGATGACACTGACTTCAACAAGATGTGGTACAAATGCATTCATCACCTCGAAACAAGAGAATTTCATATTGTTGATGGATTTCTATTCAAAGAAGAAAAATTGTGCATACCCCATACTTCCCCAAGAGAAGCTCTACTAAAGGAGGCACACTCCGAAGGACTAGCTGGTCACTTTGGGCAAGACAAAACAATTGAAATACTTTCCTCCAAATACTATTTGCCACAATTGCGAAAAGACACAAATAACTTTGTAAAGCGATGCCCTATATGCCAAACAACTAAAGGGACCAGTACTAATGCTGGATTATACAACCCCTTACCGATTCCTACAGCTATATGGGAGGATTTATCAGTAGACTTCGTATTGGAATTACCTAAGGCACAAAGACAGCATAATACTGTAATGGTAGTTGTAGACAGATTCAGCAAAAGGACCCACTTTTTACCCTGCAAGAAGACTAATGATGCTAAACTAGATACCACACTTAAATTCAGTACTACAGCACACTCACAAACAGATGGTCAAACAGAAGTCACTAACAGAACCTTGGGGAATCTAATTTGTTGCCTAAGTGGCTCAAAGCATAGACAGTGGGATCTAGCGTTAGCTCAATCTGAATTCGCGTTCAATAATATGAAAAATAGATCAACCGGGAGACGTCCCTTTGAAATTGTCTATACTAAAAGTCCTAGACTTGCACTTGATCTTGCCAATTTGCTATCAAATGCAGATATCAACAATGAAGCAGAAAATATGATTGAAAGAATACAAAATCTGCACAAACAGGTACATGAACACCTACAGAAAACAACTTTATCTTATAAACAAGATAAAGACAAGAAAAGAAGAGAAGTCAAATTCAAAGAAGGAGATCTGGTGATGATACATCTGAAGAAAAACCGGTTCCCAACAGGGACATACAATAAACTAAAAGACAGACAACTTGGGCCTTGCAAAATACTAGCAAAGTATGGTGATAATGCCTATAAAATTGAACTGCCAGACGACCTACACATCAGCCCTGTTTTCAATGTAGCAGACCTGAAGACCTACCATGCCCCAGATGAATTCAGTTAG

Coding sequence (CDS)

ATGTGGAGCCCTTATGAGCAAGAAATGTATGCCCTCATTCGAGCATTAAGGCAGTGGGAAGATTACCTACTATCTAAAGAGTTTCTCCTATTAACAAATCATTTCTCCATAAAATACCTCCAAGCTCAGAAATCTACCAACAAGATACCCGCTAGATGGATCTCCTTTTTACAGAGATTTGACTTTGTTATCAAGCATAAAAGCGGAAAAGAAAACAAAGTAGCTGATGCACTAAGTAGAAAGCATTCCTTACTTTCTATATCATCATCCGAGGTGATAGCATTCAAACACTTACCATACTTATATGAAGATGACACTGACTTCAACAAGATGTGGTACAAATGCATTCATCACCTCGAAACAAGAGAATTTCATATTGTTGATGGATTTCTATTCAAAGAAGAAAAATTGTGCATACCCCATACTTCCCCAAGAGAAGCTCTACTAAAGGAGGCACACTCCGAAGGACTAGCTGGTCACTTTGGGCAAGACAAAACAATTGAAATACTTTCCTCCAAATACTATTTGCCACAATTGCGAAAAGACACAAATAACTTTGTAAAGCGATGCCCTATATGCCAAACAACTAAAGGGACCAGTACTAATGCTGGATTATACAACCCCTTACCGATTCCTACAGCTATATGGGAGGATTTATCAGTAGACTTCGTATTGGAATTACCTAAGGCACAAAGACAGCATAATACTGTAATGGTAGTTGTAGACAGATTCAGCAAAAGGACCCACTTTTTACCCTGCAAGAAGACTAATGATGCTAAACTAGATACCACACTTAAATTCAGTACTACAGCACACTCACAAACAGATGGTCAAACAGAAGTCACTAACAGAACCTTGGGGAATCTAATTTGTTGCCTAAGTGGCTCAAAGCATAGACAGTGGGATCTAGCGTTAGCTCAATCTGAATTCGCGTTCAATAATATGAAAAATAGATCAACCGGGAGACGTCCCTTTGAAATTGTCTATACTAAAAGTCCTAGACTTGCACTTGATCTTGCCAATTTGCTATCAAATGCAGATATCAACAATGAAGCAGAAAATATGATTGAAAGAATACAAAATCTGCACAAACAGGTACATGAACACCTACAGAAAACAACTTTATCTTATAAACAAGATAAAGACAAGAAAAGAAGAGAAGTCAAATTCAAAGAAGGAGATCTGGTGATGATACATCTGAAGAAAAACCGGTTCCCAACAGGGACATACAATAAACTAAAAGACAGACAACTTGGGCCTTGCAAAATACTAGCAAAGTATGGTGATAATGCCTATAAAATTGAACTGCCAGACGACCTACACATCAGCCCTGTTTTCAATGTAGCAGACCTGAAGACCTACCATGCCCCAGATGAATTCAGTTAG

Protein sequence

MWSPYEQEMYALIRALRQWEDYLLSKEFLLLTNHFSIKYLQAQKSTNKIPARWISFLQRFDFVIKHKSGKENKVADALSRKHSLLSISSSEVIAFKHLPYLYEDDTDFNKMWYKCIHHLETREFHIVDGFLFKEEKLCIPHTSPREALLKEAHSEGLAGHFGQDKTIEILSSKYYLPQLRKDTNNFVKRCPICQTTKGTSTNAGLYNPLPIPTAIWEDLSVDFVLELPKAQRQHNTVMVVVDRFSKRTHFLPCKKTNDAKLDTTLKFSTTAHSQTDGQTEVTNRTLGNLICCLSGSKHRQWDLALAQSEFAFNNMKNRSTGRRPFEIVYTKSPRLALDLANLLSNADINNEAENMIERIQNLHKQVHEHLQKTTLSYKQDKDKKRREVKFKEGDLVMIHLKKNRFPTGTYNKLKDRQLGPCKILAKYGDNAYKIELPDDLHISPVFNVADLKTYHAPDEFS*
Homology
BLAST of CSPI04G15470 vs. ExPASy Swiss-Prot
Match: Q99315 (Transposon Ty3-G Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) OX=559292 GN=TY3B-G PE=1 SV=3)

HSP 1 Score: 191.4 bits (485), Expect = 2.2e-47
Identity = 143/524 (27.29%), Postives = 239/524 (45.61%), Query Frame = 0

Query: 6    EQEMYALIRALRQWEDYLLSKEFLLLTNHFSIKYLQAQKSTNKIPARWISFLQRFDFVIK 65
            E E+  +I+AL  +   L  K F L T+H S+  LQ +    +   RW+  L  +DF ++
Sbjct: 934  ELELLGIIKALHHFRYMLHGKHFTLRTDHISLLSLQNKNEPARRVQRWLDDLATYDFTLE 993

Query: 66   HKSGKENKVADALSRKHSLLSISSSEVIAFKHLPYLYEDD-------------TDFN--- 125
            + +G +N VADA+SR    ++  +S  I  +     Y+ D             T  N   
Sbjct: 994  YLAGPKNVVADAISRAVYTITPETSRPIDTESWKSYYKSDPLCSAVLIHMKELTQHNVTP 1053

Query: 126  ------KMWYKCIHHLET--REFHIVDGFLFKEEKLCIPHTSPREALLKEAHSEGL-AGH 185
                  + + K +   ET  + + + D  ++ +++L +P    + A+++  H   L  GH
Sbjct: 1054 EDMSAFRSYQKKLELSETFRKNYSLEDEMIYYQDRLVVP-IKQQNAVMRLYHDHTLFGGH 1113

Query: 186  FGQDKTIEILSSKYYLPQLRKDTNNFVKRCPICQTTKGTSTNA-GLYNPLPIPTAIWEDL 245
            FG   T+  +S  YY P+L+     +++ C  CQ  K       GL  PLPI    W D+
Sbjct: 1114 FGVTVTLAKISPIYYWPKLQHSIIQYIRTCVQCQLIKSHRPRLHGLLQPLPIAEGRWLDI 1173

Query: 246  SVDFVLELPKAQRQHNTVMVVVDRFSKRTHFLPCKKTNDA-------------------- 305
            S+DFV  LP      N ++VVVDRFSKR HF+  +KT DA                    
Sbjct: 1174 SMDFVTGLPPTSNNLNMILVVVDRFSKRAHFIATRKTLDATQLIDLLFRYIFSYHGFPRT 1233

Query: 306  -------------------KLDTTLKFSTTAHSQTDGQTEVTNRTLGNLICCLSGSKHRQ 365
                               +L      S+  H QTDGQ+E T +TL  L+   + +  + 
Sbjct: 1234 ITSDRDVRMTADKYQELTKRLGIKSTMSSANHPQTDGQSERTIQTLNRLLRAYASTNIQN 1293

Query: 366  WDLALAQSEFAFNNMKNRSTGRRPFEIVYTKSPRLALDLANLLSNADINNEAENMIERIQ 425
            W + L Q EF +N+   R+ G+ PFEI     P    +   + S+ ++N  +   +E  +
Sbjct: 1294 WHVYLPQIEFVYNSTPTRTLGKSPFEIDLGYLP----NTPAIKSDDEVNARSFTAVELAK 1353

Query: 426  NLHK---QVHEHLQKTTLSYKQDKDKKRREVKFKEGDLVMIHLKKNRFPTGTYNKLKDRQ 461
            +L     Q  E L+   +  + + +++R+ +    GD V++H +   F  G Y K++   
Sbjct: 1354 HLKALTIQTKEQLEHAQIEMETNNNQRRKPLLLNIGDHVLVH-RDAYFKKGAYMKVQQIY 1413

BLAST of CSPI04G15470 vs. ExPASy Swiss-Prot
Match: Q7LHG5 (Transposon Ty3-I Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) OX=559292 GN=TY3B-I PE=1 SV=2)

HSP 1 Score: 188.7 bits (478), Expect = 1.4e-46
Identity = 141/518 (27.22%), Postives = 235/518 (45.37%), Query Frame = 0

Query: 6    EQEMYALIRALRQWEDYLLSKEFLLLTNHFSIKYLQAQKSTNKIPARWISFLQRFDFVIK 65
            E E+  +I+AL  +   L  K F L T+H S+  LQ +    +   RW+  L  +DF ++
Sbjct: 960  ELELLGIIKALHHFRYMLHGKHFTLRTDHISLLSLQNKNEPARRVQRWLDDLATYDFTLE 1019

Query: 66   HKSGKENKVADALSRKHSLLSISSSEVIAFKHLPYLYEDD-------------TDFN--- 125
            + +G +N VADA+SR    ++  +S  I  +     Y+ D             T  N   
Sbjct: 1020 YLAGPKNVVADAISRAIYTITPETSRPIDTESWKSYYKSDPLCSAVLIHMKELTQHNVTP 1079

Query: 126  ------KMWYKCIHHLET--REFHIVDGFLFKEEKLCIPHTSPREALLKEAHSEGL-AGH 185
                  + + K +   ET  + + + D  ++ +++L +P    + A+++  H   L  GH
Sbjct: 1080 EDMSAFRSYQKKLELSETFRKNYSLEDEMIYYQDRLVVP-IKQQNAVMRLYHDHTLFGGH 1139

Query: 186  FGQDKTIEILSSKYYLPQLRKDTNNFVKRCPICQTTKGTSTNA-GLYNPLPIPTAIWEDL 245
            FG   T+  +S  YY P+L+     +++ C  CQ  K       GL  PLPI    W D+
Sbjct: 1140 FGVTVTLAKISPIYYWPKLQHSIIQYIRTCVQCQLIKSHRPRLHGLLQPLPIAEGRWLDI 1199

Query: 246  SVDFVLELPKAQRQHNTVMVVVDRFSKRTHFLPCKKTNDA-------------------- 305
            S+DFV  LP      N ++VVVDRFSKR HF+  +KT DA                    
Sbjct: 1200 SMDFVTGLPPTSNNLNMILVVVDRFSKRAHFIATRKTLDATQLIDLLFRYIFSYHGFPRT 1259

Query: 306  -------------------KLDTTLKFSTTAHSQTDGQTEVTNRTLGNLICCLSGSKHRQ 365
                               +L      S+  H QTDGQ+E T +TL  L+     +  + 
Sbjct: 1260 ITSDRDVRMTADKYQELTKRLGIKSTMSSANHPQTDGQSERTIQTLNRLLRAYVSTNIQN 1319

Query: 366  WDLALAQSEFAFNNMKNRSTGRRPFEIVYTKSPRLALDLANLLSNADINNEAENMIERIQ 425
            W + L Q EF +N+   R+ G+ PFEI     P    +   + S+ ++N  +   +E  +
Sbjct: 1320 WHVYLPQIEFVYNSTPTRTLGKSPFEIDLGYLP----NTPAIKSDDEVNARSFTAVELAK 1379

Query: 426  NLHK---QVHEHLQKTTLSYKQDKDKKRREVKFKEGDLVMIHLKKNRFPTGTYNKLKDRQ 456
            +L     Q  E L+   +  + + +++R+ +    GD V++H +   F  G Y K++   
Sbjct: 1380 HLKALTIQTKEQLEHAQIEMETNNNQRRKPLLLNIGDHVLVH-RDAYFKKGAYMKVQQIY 1439

BLAST of CSPI04G15470 vs. ExPASy Swiss-Prot
Match: Q9UR07 (Transposon Tf2-11 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) OX=284812 GN=Tf2-11 PE=3 SV=1)

HSP 1 Score: 183.7 bits (465), Expect = 4.6e-45
Identity = 159/527 (30.17%), Postives = 244/527 (46.30%), Query Frame = 0

Query: 2    WSPYEQEMYALIRALRQWEDYLLS--KEFLLLTNHFSI--KYLQAQKSTNKIPARWISFL 61
            +S  ++EM A+I++L+ W  YL S  + F +LT+H ++  +     +  NK  ARW  FL
Sbjct: 749  YSVSDKEMLAIIKSLKHWRHYLESTIEPFKILTDHRNLIGRITNESEPENKRLARWQLFL 808

Query: 62   QRFDFVIKHKSGKENKVADALSR----KHSLLSISSSEVIAF-----------KHLPYLY 121
            Q F+F I ++ G  N +ADALSR       +   S    I F             +   Y
Sbjct: 809  QDFNFEINYRPGSANHIADALSRIVDETEPIPKDSEDNSINFVNQISITDDFKNQVVTEY 868

Query: 122  EDDTDFNKMWYKCIHHLETREFHIVDGFLF-KEEKLCIPH-TSPREALLKEAHSEGLAGH 181
             +DT    +       +E     + DG L   ++++ +P+ T     ++K+ H EG   H
Sbjct: 869  TNDTKLLNLLNNEDKRVE-ENIQLKDGLLINSKDQILLPNDTQLTRTIIKKYHEEGKLIH 928

Query: 182  FGQDKTIEILSSKYYLPQLRKDTNNFVKRCPICQTTKGTSTNAGLYNPL-PIPTA--IWE 241
             G +    I+  ++    +RK    +V+ C  CQ  K  S N   Y PL PIP +   WE
Sbjct: 929  PGIELLTNIILRRFTWKGIRKQIQEYVQNCHTCQINK--SRNHKPYGPLQPIPPSERPWE 988

Query: 242  DLSVDFVLELPKAQRQHNTVMVVVDRFSKRTHFLPCKKTNDA------------------ 301
             LS+DF+  LP++   +N + VVVDRFSK    +PC K+  A                  
Sbjct: 989  SLSMDFITALPESS-GYNALFVVVDRFSKMAILVPCTKSITAEQTARMFDQRVIAYFGNP 1048

Query: 302  ---------------------KLDTTLKFSTTAHSQTDGQTEVTNRTLGNLICCLSGSKH 361
                                 K +  +KFS     QTDGQTE TN+T+  L+ C+  +  
Sbjct: 1049 KEIIADNDHIFTSQTWKDFAHKYNFVMKFSLPYRPQTDGQTERTNQTVEKLLRCVCSTHP 1108

Query: 362  RQWDLALAQSEFAFNNMKNRSTGRRPFEIVYTKSPRLA-LDLANLLSNADINNEAENMIE 421
              W   ++  + ++NN  + +T   PFEIV+  SP L+ L+L +     D     EN  E
Sbjct: 1109 NTWVDHISLVQQSYNNAIHSATQMTPFEIVHRYSPALSPLELPSFSDKTD-----ENSQE 1168

Query: 422  RIQNLHKQVHEHLQKTTLSYKQDKDKKRREV-KFKEGDLVMIHLKKNRFPTGTYNKLKDR 462
             IQ + + V EHL    +  K+  D K +E+ +F+ GDLVM+   K  F     NKL   
Sbjct: 1169 TIQ-VFQTVKEHLNTNNIKMKKYFDMKIQEIEEFQPGDLVMVKRTKTGF-LHKSNKLAPS 1228

BLAST of CSPI04G15470 vs. ExPASy Swiss-Prot
Match: P0CT41 (Transposon Tf2-12 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) OX=284812 GN=Tf2-12 PE=3 SV=1)

HSP 1 Score: 183.7 bits (465), Expect = 4.6e-45
Identity = 159/527 (30.17%), Postives = 244/527 (46.30%), Query Frame = 0

Query: 2    WSPYEQEMYALIRALRQWEDYLLS--KEFLLLTNHFSI--KYLQAQKSTNKIPARWISFL 61
            +S  ++EM A+I++L+ W  YL S  + F +LT+H ++  +     +  NK  ARW  FL
Sbjct: 749  YSVSDKEMLAIIKSLKHWRHYLESTIEPFKILTDHRNLIGRITNESEPENKRLARWQLFL 808

Query: 62   QRFDFVIKHKSGKENKVADALSR----KHSLLSISSSEVIAF-----------KHLPYLY 121
            Q F+F I ++ G  N +ADALSR       +   S    I F             +   Y
Sbjct: 809  QDFNFEINYRPGSANHIADALSRIVDETEPIPKDSEDNSINFVNQISITDDFKNQVVTEY 868

Query: 122  EDDTDFNKMWYKCIHHLETREFHIVDGFLF-KEEKLCIPH-TSPREALLKEAHSEGLAGH 181
             +DT    +       +E     + DG L   ++++ +P+ T     ++K+ H EG   H
Sbjct: 869  TNDTKLLNLLNNEDKRVE-ENIQLKDGLLINSKDQILLPNDTQLTRTIIKKYHEEGKLIH 928

Query: 182  FGQDKTIEILSSKYYLPQLRKDTNNFVKRCPICQTTKGTSTNAGLYNPL-PIPTA--IWE 241
             G +    I+  ++    +RK    +V+ C  CQ  K  S N   Y PL PIP +   WE
Sbjct: 929  PGIELLTNIILRRFTWKGIRKQIQEYVQNCHTCQINK--SRNHKPYGPLQPIPPSERPWE 988

Query: 242  DLSVDFVLELPKAQRQHNTVMVVVDRFSKRTHFLPCKKTNDA------------------ 301
             LS+DF+  LP++   +N + VVVDRFSK    +PC K+  A                  
Sbjct: 989  SLSMDFITALPESS-GYNALFVVVDRFSKMAILVPCTKSITAEQTARMFDQRVIAYFGNP 1048

Query: 302  ---------------------KLDTTLKFSTTAHSQTDGQTEVTNRTLGNLICCLSGSKH 361
                                 K +  +KFS     QTDGQTE TN+T+  L+ C+  +  
Sbjct: 1049 KEIIADNDHIFTSQTWKDFAHKYNFVMKFSLPYRPQTDGQTERTNQTVEKLLRCVCSTHP 1108

Query: 362  RQWDLALAQSEFAFNNMKNRSTGRRPFEIVYTKSPRLA-LDLANLLSNADINNEAENMIE 421
              W   ++  + ++NN  + +T   PFEIV+  SP L+ L+L +     D     EN  E
Sbjct: 1109 NTWVDHISLVQQSYNNAIHSATQMTPFEIVHRYSPALSPLELPSFSDKTD-----ENSQE 1168

Query: 422  RIQNLHKQVHEHLQKTTLSYKQDKDKKRREV-KFKEGDLVMIHLKKNRFPTGTYNKLKDR 462
             IQ + + V EHL    +  K+  D K +E+ +F+ GDLVM+   K  F     NKL   
Sbjct: 1169 TIQ-VFQTVKEHLNTNNIKMKKYFDMKIQEIEEFQPGDLVMVKRTKTGF-LHKSNKLAPS 1228

BLAST of CSPI04G15470 vs. ExPASy Swiss-Prot
Match: P0CT34 (Transposon Tf2-1 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) OX=284812 GN=Tf2-1 PE=3 SV=1)

HSP 1 Score: 183.7 bits (465), Expect = 4.6e-45
Identity = 159/527 (30.17%), Postives = 244/527 (46.30%), Query Frame = 0

Query: 2    WSPYEQEMYALIRALRQWEDYLLS--KEFLLLTNHFSI--KYLQAQKSTNKIPARWISFL 61
            +S  ++EM A+I++L+ W  YL S  + F +LT+H ++  +     +  NK  ARW  FL
Sbjct: 749  YSVSDKEMLAIIKSLKHWRHYLESTIEPFKILTDHRNLIGRITNESEPENKRLARWQLFL 808

Query: 62   QRFDFVIKHKSGKENKVADALSR----KHSLLSISSSEVIAF-----------KHLPYLY 121
            Q F+F I ++ G  N +ADALSR       +   S    I F             +   Y
Sbjct: 809  QDFNFEINYRPGSANHIADALSRIVDETEPIPKDSEDNSINFVNQISITDDFKNQVVTEY 868

Query: 122  EDDTDFNKMWYKCIHHLETREFHIVDGFLF-KEEKLCIPH-TSPREALLKEAHSEGLAGH 181
             +DT    +       +E     + DG L   ++++ +P+ T     ++K+ H EG   H
Sbjct: 869  TNDTKLLNLLNNEDKRVE-ENIQLKDGLLINSKDQILLPNDTQLTRTIIKKYHEEGKLIH 928

Query: 182  FGQDKTIEILSSKYYLPQLRKDTNNFVKRCPICQTTKGTSTNAGLYNPL-PIPTA--IWE 241
             G +    I+  ++    +RK    +V+ C  CQ  K  S N   Y PL PIP +   WE
Sbjct: 929  PGIELLTNIILRRFTWKGIRKQIQEYVQNCHTCQINK--SRNHKPYGPLQPIPPSERPWE 988

Query: 242  DLSVDFVLELPKAQRQHNTVMVVVDRFSKRTHFLPCKKTNDA------------------ 301
             LS+DF+  LP++   +N + VVVDRFSK    +PC K+  A                  
Sbjct: 989  SLSMDFITALPESS-GYNALFVVVDRFSKMAILVPCTKSITAEQTARMFDQRVIAYFGNP 1048

Query: 302  ---------------------KLDTTLKFSTTAHSQTDGQTEVTNRTLGNLICCLSGSKH 361
                                 K +  +KFS     QTDGQTE TN+T+  L+ C+  +  
Sbjct: 1049 KEIIADNDHIFTSQTWKDFAHKYNFVMKFSLPYRPQTDGQTERTNQTVEKLLRCVCSTHP 1108

Query: 362  RQWDLALAQSEFAFNNMKNRSTGRRPFEIVYTKSPRLA-LDLANLLSNADINNEAENMIE 421
              W   ++  + ++NN  + +T   PFEIV+  SP L+ L+L +     D     EN  E
Sbjct: 1109 NTWVDHISLVQQSYNNAIHSATQMTPFEIVHRYSPALSPLELPSFSDKTD-----ENSQE 1168

Query: 422  RIQNLHKQVHEHLQKTTLSYKQDKDKKRREV-KFKEGDLVMIHLKKNRFPTGTYNKLKDR 462
             IQ + + V EHL    +  K+  D K +E+ +F+ GDLVM+   K  F     NKL   
Sbjct: 1169 TIQ-VFQTVKEHLNTNNIKMKKYFDMKIQEIEEFQPGDLVMVKRTKTGF-LHKSNKLAPS 1228

BLAST of CSPI04G15470 vs. ExPASy TrEMBL
Match: A0A5B7BER3 (Uncharacterized protein OS=Davidia involucrata OX=16924 GN=Din_036800 PE=4 SV=1)

HSP 1 Score: 521.9 bits (1343), Expect = 2.7e-144
Identity = 255/497 (51.31%), Postives = 339/497 (68.21%), Query Frame = 0

Query: 2    WSPYEQEMYALIRALRQWEDYLLSKEFLLLTNHFSIKYLQAQKSTNKIPARWISFLQRFD 61
            W+ YE E++A++RAL+ WE YL+ +EF++ ++H ++K++  Q S +++  RWI+FLQRF 
Sbjct: 983  WTTYELELHAVVRALKHWEHYLIHQEFVIYSDHEALKFINTQNSLSRMHGRWIAFLQRFT 1042

Query: 62   FVIKHKSGKENKVADALSRKHSLLSISSSEVIAFKHLPYLYEDDTDFNKMWYKCIHHLET 121
            FV+KHK+G++NKVADALSR+ +LL++ SSE+ +F+ L  LY++D DF + W KC     +
Sbjct: 1043 FVLKHKAGQQNKVADALSRRAALLAVVSSEITSFESLKELYQEDEDFQQWWAKCELKQAS 1102

Query: 122  REFHIVDGFLFKEEKLCIPHTSPREALLKEAHSEGLAGHFGQDKTIEILSSKYYLPQLRK 181
             EFHI DG+LFK  +LCIP TS RE +L++ HS GL GH G+DKTI ++  +YY PQL++
Sbjct: 1103 AEFHIQDGYLFKGNQLCIPRTSLREQILRDLHSGGLGGHLGRDKTIALVEERYYWPQLKR 1162

Query: 182  DTNNFVKRCPICQTTKGTSTNAGLYNPLPIPTAIWEDLSVDFVLELPKAQRQHNTVMVVV 241
            D   FV++CPICQT KG + N GLY PLP+P  IWEDL++DF+L LP+ QR  ++V VVV
Sbjct: 1163 DVGKFVQKCPICQTAKGQAQNTGLYTPLPVPEDIWEDLTMDFILGLPRTQRGMDSVFVVV 1222

Query: 242  DRFSKRTHFLPCKKTNDA---------------------------------------KLD 301
            DRFSK  HF+PCKKT+DA                                       K D
Sbjct: 1223 DRFSKMAHFIPCKKTSDASHVANLFFREIVRLHGVPKSITSDRDVKFLSHFWRTLWRKFD 1282

Query: 302  TTLKFSTTAHSQTDGQTEVTNRTLGNLICCLSGSKHRQWDLALAQSEFAFNNMKNRSTGR 361
            T+L++S+TAH QTDGQTEVTNRTLGNLI C SG + +QWD+ L Q EFA+N M NRST +
Sbjct: 1283 TSLQYSSTAHPQTDGQTEVTNRTLGNLIRCTSGDRPKQWDVGLPQMEFAYNCMTNRSTKK 1342

Query: 362  RPFEIVYTKSPRLALDLANLLSNADINNEAENMIERIQNLHKQVHEHLQKTTLSYKQDKD 421
             PFEIVYTK P+ ALDLA L      +  AEN  +R   + ++V ++L+K    YK   D
Sbjct: 1343 TPFEIVYTKPPKQALDLAPLPKLPGSSIAAENFADRYYTIQEEVKQNLEKANNLYKAAAD 1402

Query: 422  KKRREVKFKEGDLVMIHLKKNRFPTGTYNKLKDRQLGPCKILAKYGDNAYKIELPDDLHI 460
            K RR   F EGDLVM+ L+KNRFP GTYNKLK+R+ GP ++  K  DNAY +ELPDD+ I
Sbjct: 1403 KHRRPKVFTEGDLVMVFLRKNRFPVGTYNKLKNRKYGPFRVKRKINDNAYVVELPDDMAI 1462

BLAST of CSPI04G15470 vs. ExPASy TrEMBL
Match: A0A6N2LVR1 (Uncharacterized protein OS=Salix viminalis OX=40686 GN=SVIM_LOCUS287486 PE=4 SV=1)

HSP 1 Score: 500.0 bits (1286), Expect = 1.1e-137
Identity = 247/497 (49.70%), Postives = 330/497 (66.40%), Query Frame = 0

Query: 2    WSPYEQEMYALIRALRQWEDYLLSKEFLLLTNHFSIKYLQAQKSTNKIPARWISFLQRFD 61
            WS YE E+YA+ RA++ WE YL+ +EF+L ++H ++K++  Q + N++ ARW++F+QRF+
Sbjct: 982  WSTYELELYAVFRAMKVWEHYLVQREFILFSDHQALKFINNQTNVNRMHARWVAFIQRFN 1041

Query: 62   FVIKHKSGKENKVADALSRKHSLLSISSSEVIAFKHLPYLYEDDTDFNKMWYKCIHHLET 121
            F +KHKSG+ NKVADALSRK SLL+   +EVI F+ +  LY  D DF   W KC   L  
Sbjct: 1042 FTLKHKSGQLNKVADALSRKVSLLTTLQAEVIGFECIKDLYAGDEDFGNTWDKCQQGLSH 1101

Query: 122  REFHIVDGFLFKEEKLCIPHTSPREALLKEAHSEGLAGHFGQDKTIEILSSKYYLPQLRK 181
               H  DG+LF+  +LCIP +S RE ++ E H  GL GH G+DKT+ +   +YY PQL++
Sbjct: 1102 EGMHTHDGYLFRGNQLCIPRSSLREQIIHELHGGGLGGHLGRDKTVALAEERYYWPQLKR 1161

Query: 182  DTNNFVKRCPICQTTKGTSTNAGLYNPLPIPTAIWEDLSVDFVLELPKAQRQHNTVMVVV 241
            D  N VKRCP CQ +KG + N GLY PLPIP   WEDLS+DF+L LP+ QR  ++V VVV
Sbjct: 1162 DIGNHVKRCPTCQASKGQTQNTGLYLPLPIPAGPWEDLSMDFILGLPRTQRGVDSVFVVV 1221

Query: 242  DRFSKRTHFLPCKKTNDA---------------------------------------KLD 301
            DRFSK  HF+ CKKT+DA                                       + D
Sbjct: 1222 DRFSKMAHFIACKKTSDAVHVANLFFKEVVRLHGVPKSITSDRDTKFLSHFWRTLWRRFD 1281

Query: 302  TTLKFSTTAHSQTDGQTEVTNRTLGNLICCLSGSKHRQWDLALAQSEFAFNNMKNRSTGR 361
            TTL FS+T+H QTDGQTEV NRTLGNLI CLSG + +QWDL LAQ+EFA+N+M NRSTG+
Sbjct: 1282 TTLNFSSTSHPQTDGQTEVVNRTLGNLIRCLSGERPKQWDLTLAQAEFAYNSMLNRSTGK 1341

Query: 362  RPFEIVYTKSPRLALDLANLLSNADINNEAENMIERIQNLHKQVHEHLQKTTLSYKQDKD 421
             PF++VY + P+ ALDL  L     +N  AE+M +R++ + ++V ++L+ +   YK   D
Sbjct: 1342 TPFQVVYCQPPKHALDLVPLPKLPGMNIAAEHMADRVRAIQEEVRKNLEASNEKYKAAAD 1401

Query: 422  KKRREVKFKEGDLVMIHLKKNRFPTGTYNKLKDRQLGPCKILAKYGDNAYKIELPDDLHI 460
            KKRR   FKEGDLVM++L+K R P GT +KL D++ GP +IL K  DNAY+++LP D+ I
Sbjct: 1402 KKRRLKLFKEGDLVMVYLRKGRVPGGTLHKLSDKKHGPYQILQKINDNAYRVDLPADMTI 1461

BLAST of CSPI04G15470 vs. ExPASy TrEMBL
Match: A0A6N2KHU6 (Reverse transcriptase OS=Salix viminalis OX=40686 GN=SVIM_LOCUS85055 PE=4 SV=1)

HSP 1 Score: 481.1 bits (1237), Expect = 5.2e-132
Identity = 242/494 (48.99%), Postives = 323/494 (65.38%), Query Frame = 0

Query: 2    WSPYEQEMYALIRALRQWEDYLLSKEFLLLTNHFSIKYLQAQKSTNKIPARWISFLQRFD 61
            WS Y+QE YA++RAL+ WE YL+ ++F+L T+H ++KYL +QK+ + + ARW ++LQ+F 
Sbjct: 941  WSTYDQEFYAVVRALKHWEHYLIQRDFILYTDHQALKYLNSQKNLSNMHARWSTYLQKFP 1000

Query: 62   FVIKHKSGKENKVADALSRKHSLLSISSSEVIAFKHLPYLYEDDTDFNKMWYKCIHHLET 121
            F++KHKS   NKVADALSR+ +LL     EV+ F+ L  LYE D DF  +W KC      
Sbjct: 1001 FILKHKSVALNKVADALSRRANLLVTMQQEVVGFEFLKELYEGDEDFAGVWEKCRLQHTV 1060

Query: 122  REFHIVDGFLFKEEKLCIPHTSPREALLKEAHSEGLAGHFGQDKTIEILSSKYYLPQLRK 181
             EFHIVD +LF+  +LC+P +S RE L++E H  GL+GH G+DKTI  ++ +YY PQL++
Sbjct: 1061 GEFHIVDDYLFRGNQLCVPRSSLREKLVRELHGRGLSGHLGRDKTISSVAERYYWPQLKR 1120

Query: 182  DTNNFVKRCPICQTTKGTSTNAGLYNPLPIPTAIWEDLSVDFVLELPKAQRQHNTVMVVV 241
            D  N V++C +CQT+KG + N GLY PLP+P AIWEDLS+DFVL LP+ QR  ++V VVV
Sbjct: 1121 DVGNLVRKCYVCQTSKGQAQNTGLYMPLPVPDAIWEDLSMDFVLGLPRTQRGVDSVFVVV 1180

Query: 242  DRFSKRTHFLPCKKTNDA---------------------------------------KLD 301
            DRFSK  HF+PC+KT+DA                                       ++D
Sbjct: 1181 DRFSKMGHFIPCRKTSDASHVAHLFFREVVRLHWVPQSITSDRDSKFLSHFWITLWRRMD 1240

Query: 302  TTLKFSTTAHSQTDGQTEVTNRTLGNLICCLSGSKHRQWDLALAQSEFAFNNMKNRSTGR 361
            TTLKFS+TAH QTDGQTE  NRTLGNLI  + G K +QWD++LAQ+EFA+NN  + +TGR
Sbjct: 1241 TTLKFSSTAHPQTDGQTENLNRTLGNLIRSICGDKPKQWDVSLAQAEFAYNNAIHTATGR 1300

Query: 362  RPFEIVYTKSPRLALDLANLLSNADINNEAENMIERIQNLHKQVHEHLQKTTLSYKQDKD 421
             PF +VY KSP+ ALDLA L      +  AEN+ E  +++  +V    ++    YK   D
Sbjct: 1301 SPFSLVYLKSPKHALDLARLPKMTSSSVAAENLAEHARSVQAEVKARFEEKNAKYKAATD 1360

Query: 422  KKRREVKFKEGDLVMIHLKKNRFPTGTYNKLKDRQLGPCKILAKYGDNAYKIELPDDLHI 457
             KRRE  F EGD VM+ L+K RFP  TYNKLK R+ GP  I+ K  DNAY ++L  D+HI
Sbjct: 1361 VKRREKLFAEGDEVMVFLRKERFPLVTYNKLKPRKYGPYNIIKKINDNAYVVDLLADMHI 1420

BLAST of CSPI04G15470 vs. ExPASy TrEMBL
Match: A0A6D2HLB5 (Reverse transcriptase OS=Microthlaspi erraticum OX=1685480 GN=MERR_LOCUS2198 PE=4 SV=1)

HSP 1 Score: 480.7 bits (1236), Expect = 6.8e-132
Identity = 246/494 (49.80%), Postives = 319/494 (64.57%), Query Frame = 0

Query: 2    WSPYEQEMYALIRALRQWEDYLLSKEFLLLTNHFSIKYLQAQKSTNKIPARWISFLQRFD 61
            WS Y+QE YA+ RALRQWE YL+ +EF+L T+H ++K+L +QK  NK+ ARW+SFLQ+F 
Sbjct: 925  WSTYDQEFYAVFRALRQWEHYLIQREFILFTDHQALKFLHSQKVINKMHARWVSFLQKFP 984

Query: 62   FVIKHKSGKENKVADALSRKHSLLSISSSEVIAFKHLPYLYEDDTDFNKMWYKCIHHLET 121
            F+I+HKSG  NKVADALSR+ SLL   + E++ F+ L  LYE D +F ++W KC     +
Sbjct: 985  FIIQHKSGTLNKVADALSRRASLLITLAHEIVGFELLKELYESDAEFKELWDKCNGKHPS 1044

Query: 122  REFHIVDGFLFKEEKLCIPHTSPREALLKEAHSEGLAGHFGQDKTIEILSSKYYLPQLRK 181
             +FHI DGFLFK ++LCIP +S RE L+++ H  GL+GH G+DKTI  L  +YY P LR+
Sbjct: 1045 ADFHIRDGFLFKGDRLCIPCSSLREKLIRDLHGGGLSGHLGRDKTIASLEERYYWPHLRR 1104

Query: 182  DTNNFVKRCPICQTTKGTSTNAGLYNPLPIPTAIWEDLSVDFVLELPKAQRQHNTVMVVV 241
            D    VKRC ICQT+KG S N GLY PLP+P  IW+DLS+DFVL LP+ QR  ++V VVV
Sbjct: 1105 DAGAIVKRCYICQTSKGQSQNTGLYMPLPVPDDIWQDLSMDFVLGLPRTQRGVDSVFVVV 1164

Query: 242  DRFSKRTHFLPCKKTNDAK---------------------------------------LD 301
            DRFSK THF+ CKKT DA                                          
Sbjct: 1165 DRFSKMTHFIACKKTADASNIAKLFFREVVRLHGVPKTIISDRDTKFLSHFWITLWRMFG 1224

Query: 302  TTLKFSTTAHSQTDGQTEVTNRTLGNLICCLSGSKHRQWDLALAQSEFAFNNMKNRSTGR 361
            TTLK S+TAH QTDGQTEVTNRTLGN+I  + G + +QWDLAL Q EFA+N+  + +TG+
Sbjct: 1225 TTLKRSSTAHPQTDGQTEVTNRTLGNMIRSVCGDRPKQWDLALPQVEFAYNSAMHSATGK 1284

Query: 362  RPFEIVYTKSPRLALDLANLLSNADINNEAENMIERIQNLHKQVHEHLQKTTLSYKQDKD 421
             PF +VYT  P+  +DL  L     ++  AE M E I    + V   L+ T    K   D
Sbjct: 1285 SPFSLVYTSVPKHVVDLVKLPKCPGVSVSAETMAEEIMATKEAVKAKLEATGQKNKVAAD 1344

Query: 422  KKRREVKFKEGDLVMIHLKKNRFPTGTYNKLKDRQLGPCKILAKYGDNAYKIELPDDLHI 457
            K+RR   FKEGD VM+ L+K RFP GTY KL+  + GP K+L K  DNAY ++LP++++I
Sbjct: 1345 KRRRVKVFKEGDEVMVFLRKERFPVGTYRKLQPHKYGPFKVLRKINDNAYVVDLPENMNI 1404

BLAST of CSPI04G15470 vs. ExPASy TrEMBL
Match: A0A6D2IKM3 (Reverse transcriptase OS=Microthlaspi erraticum OX=1685480 GN=MERR_LOCUS15430 PE=4 SV=1)

HSP 1 Score: 480.7 bits (1236), Expect = 6.8e-132
Identity = 246/494 (49.80%), Postives = 319/494 (64.57%), Query Frame = 0

Query: 2    WSPYEQEMYALIRALRQWEDYLLSKEFLLLTNHFSIKYLQAQKSTNKIPARWISFLQRFD 61
            WS Y+QE YA+ RALRQWE YL+ +EF+L T+H ++K+L +QK  NK+ ARW+SFLQ+F 
Sbjct: 712  WSTYDQEFYAVFRALRQWEHYLIQREFILFTDHQALKFLHSQKVINKMHARWVSFLQKFP 771

Query: 62   FVIKHKSGKENKVADALSRKHSLLSISSSEVIAFKHLPYLYEDDTDFNKMWYKCIHHLET 121
            F+I+HKSG  NKVADALSR+ SLL   + E++ F+ L  LYE D +F ++W KC     +
Sbjct: 772  FIIQHKSGTLNKVADALSRRASLLITLAHEIVGFELLKELYESDAEFKELWDKCNGKHPS 831

Query: 122  REFHIVDGFLFKEEKLCIPHTSPREALLKEAHSEGLAGHFGQDKTIEILSSKYYLPQLRK 181
             +FHI DGFLFK ++LCIP +S RE L+++ H  GL+GH G+DKTI  L  +YY P LR+
Sbjct: 832  ADFHIRDGFLFKGDRLCIPCSSLREKLIRDLHGGGLSGHLGRDKTIASLEERYYWPHLRR 891

Query: 182  DTNNFVKRCPICQTTKGTSTNAGLYNPLPIPTAIWEDLSVDFVLELPKAQRQHNTVMVVV 241
            D    VKRC ICQT+KG S N GLY PLP+P  IW+DLS+DFVL LP+ QR  ++V VVV
Sbjct: 892  DAGAIVKRCYICQTSKGQSQNTGLYMPLPVPDDIWQDLSMDFVLGLPRTQRGVDSVFVVV 951

Query: 242  DRFSKRTHFLPCKKTNDAK---------------------------------------LD 301
            DRFSK THF+ CKKT DA                                          
Sbjct: 952  DRFSKMTHFIACKKTADASNIAKLFFREVVRLHGVPKTIISDRDTKFLSHFWITLWRMFG 1011

Query: 302  TTLKFSTTAHSQTDGQTEVTNRTLGNLICCLSGSKHRQWDLALAQSEFAFNNMKNRSTGR 361
            TTLK S+TAH QTDGQTEVTNRTLGN+I  + G + +QWDLAL Q EFA+N+  + +TG+
Sbjct: 1012 TTLKRSSTAHPQTDGQTEVTNRTLGNMIRSVCGDRPKQWDLALPQVEFAYNSAMHSATGK 1071

Query: 362  RPFEIVYTKSPRLALDLANLLSNADINNEAENMIERIQNLHKQVHEHLQKTTLSYKQDKD 421
             PF +VYT  P+  +DL  L     ++  AE M E I    + V   L+ T    K   D
Sbjct: 1072 SPFSLVYTSVPKHVVDLVKLPKCPGVSVSAETMAEEIMATKEAVKAKLEATGQKNKVAAD 1131

Query: 422  KKRREVKFKEGDLVMIHLKKNRFPTGTYNKLKDRQLGPCKILAKYGDNAYKIELPDDLHI 457
            K+RR   FKEGD VM+ L+K RFP GTY KL+  + GP K+L K  DNAY ++LP++++I
Sbjct: 1132 KRRRVKVFKEGDEVMVFLRKERFPVGTYRKLQPHKYGPFKVLRKINDNAYVVDLPENMNI 1191

BLAST of CSPI04G15470 vs. NCBI nr
Match: CAA7028195.1 (unnamed protein product [Microthlaspi erraticum])

HSP 1 Score: 480.7 bits (1236), Expect = 1.4e-131
Identity = 246/494 (49.80%), Postives = 319/494 (64.57%), Query Frame = 0

Query: 2    WSPYEQEMYALIRALRQWEDYLLSKEFLLLTNHFSIKYLQAQKSTNKIPARWISFLQRFD 61
            WS Y+QE YA+ RALRQWE YL+ +EF+L T+H ++K+L +QK  NK+ ARW+SFLQ+F 
Sbjct: 712  WSTYDQEFYAVFRALRQWEHYLIQREFILFTDHQALKFLHSQKVINKMHARWVSFLQKFP 771

Query: 62   FVIKHKSGKENKVADALSRKHSLLSISSSEVIAFKHLPYLYEDDTDFNKMWYKCIHHLET 121
            F+I+HKSG  NKVADALSR+ SLL   + E++ F+ L  LYE D +F ++W KC     +
Sbjct: 772  FIIQHKSGTLNKVADALSRRASLLITLAHEIVGFELLKELYESDAEFKELWDKCNGKHPS 831

Query: 122  REFHIVDGFLFKEEKLCIPHTSPREALLKEAHSEGLAGHFGQDKTIEILSSKYYLPQLRK 181
             +FHI DGFLFK ++LCIP +S RE L+++ H  GL+GH G+DKTI  L  +YY P LR+
Sbjct: 832  ADFHIRDGFLFKGDRLCIPCSSLREKLIRDLHGGGLSGHLGRDKTIASLEERYYWPHLRR 891

Query: 182  DTNNFVKRCPICQTTKGTSTNAGLYNPLPIPTAIWEDLSVDFVLELPKAQRQHNTVMVVV 241
            D    VKRC ICQT+KG S N GLY PLP+P  IW+DLS+DFVL LP+ QR  ++V VVV
Sbjct: 892  DAGAIVKRCYICQTSKGQSQNTGLYMPLPVPDDIWQDLSMDFVLGLPRTQRGVDSVFVVV 951

Query: 242  DRFSKRTHFLPCKKTNDAK---------------------------------------LD 301
            DRFSK THF+ CKKT DA                                          
Sbjct: 952  DRFSKMTHFIACKKTADASNIAKLFFREVVRLHGVPKTIISDRDTKFLSHFWITLWRMFG 1011

Query: 302  TTLKFSTTAHSQTDGQTEVTNRTLGNLICCLSGSKHRQWDLALAQSEFAFNNMKNRSTGR 361
            TTLK S+TAH QTDGQTEVTNRTLGN+I  + G + +QWDLAL Q EFA+N+  + +TG+
Sbjct: 1012 TTLKRSSTAHPQTDGQTEVTNRTLGNMIRSVCGDRPKQWDLALPQVEFAYNSAMHSATGK 1071

Query: 362  RPFEIVYTKSPRLALDLANLLSNADINNEAENMIERIQNLHKQVHEHLQKTTLSYKQDKD 421
             PF +VYT  P+  +DL  L     ++  AE M E I    + V   L+ T    K   D
Sbjct: 1072 SPFSLVYTSVPKHVVDLVKLPKCPGVSVSAETMAEEIMATKEAVKAKLEATGQKNKVAAD 1131

Query: 422  KKRREVKFKEGDLVMIHLKKNRFPTGTYNKLKDRQLGPCKILAKYGDNAYKIELPDDLHI 457
            K+RR   FKEGD VM+ L+K RFP GTY KL+  + GP K+L K  DNAY ++LP++++I
Sbjct: 1132 KRRRVKVFKEGDEVMVFLRKERFPVGTYRKLQPHKYGPFKVLRKINDNAYVVDLPENMNI 1191

BLAST of CSPI04G15470 vs. NCBI nr
Match: CAA7014963.1 (unnamed protein product [Microthlaspi erraticum])

HSP 1 Score: 480.7 bits (1236), Expect = 1.4e-131
Identity = 246/494 (49.80%), Postives = 319/494 (64.57%), Query Frame = 0

Query: 2    WSPYEQEMYALIRALRQWEDYLLSKEFLLLTNHFSIKYLQAQKSTNKIPARWISFLQRFD 61
            WS Y+QE YA+ RALRQWE YL+ +EF+L T+H ++K+L +QK  NK+ ARW+SFLQ+F 
Sbjct: 925  WSTYDQEFYAVFRALRQWEHYLIQREFILFTDHQALKFLHSQKVINKMHARWVSFLQKFP 984

Query: 62   FVIKHKSGKENKVADALSRKHSLLSISSSEVIAFKHLPYLYEDDTDFNKMWYKCIHHLET 121
            F+I+HKSG  NKVADALSR+ SLL   + E++ F+ L  LYE D +F ++W KC     +
Sbjct: 985  FIIQHKSGTLNKVADALSRRASLLITLAHEIVGFELLKELYESDAEFKELWDKCNGKHPS 1044

Query: 122  REFHIVDGFLFKEEKLCIPHTSPREALLKEAHSEGLAGHFGQDKTIEILSSKYYLPQLRK 181
             +FHI DGFLFK ++LCIP +S RE L+++ H  GL+GH G+DKTI  L  +YY P LR+
Sbjct: 1045 ADFHIRDGFLFKGDRLCIPCSSLREKLIRDLHGGGLSGHLGRDKTIASLEERYYWPHLRR 1104

Query: 182  DTNNFVKRCPICQTTKGTSTNAGLYNPLPIPTAIWEDLSVDFVLELPKAQRQHNTVMVVV 241
            D    VKRC ICQT+KG S N GLY PLP+P  IW+DLS+DFVL LP+ QR  ++V VVV
Sbjct: 1105 DAGAIVKRCYICQTSKGQSQNTGLYMPLPVPDDIWQDLSMDFVLGLPRTQRGVDSVFVVV 1164

Query: 242  DRFSKRTHFLPCKKTNDAK---------------------------------------LD 301
            DRFSK THF+ CKKT DA                                          
Sbjct: 1165 DRFSKMTHFIACKKTADASNIAKLFFREVVRLHGVPKTIISDRDTKFLSHFWITLWRMFG 1224

Query: 302  TTLKFSTTAHSQTDGQTEVTNRTLGNLICCLSGSKHRQWDLALAQSEFAFNNMKNRSTGR 361
            TTLK S+TAH QTDGQTEVTNRTLGN+I  + G + +QWDLAL Q EFA+N+  + +TG+
Sbjct: 1225 TTLKRSSTAHPQTDGQTEVTNRTLGNMIRSVCGDRPKQWDLALPQVEFAYNSAMHSATGK 1284

Query: 362  RPFEIVYTKSPRLALDLANLLSNADINNEAENMIERIQNLHKQVHEHLQKTTLSYKQDKD 421
             PF +VYT  P+  +DL  L     ++  AE M E I    + V   L+ T    K   D
Sbjct: 1285 SPFSLVYTSVPKHVVDLVKLPKCPGVSVSAETMAEEIMATKEAVKAKLEATGQKNKVAAD 1344

Query: 422  KKRREVKFKEGDLVMIHLKKNRFPTGTYNKLKDRQLGPCKILAKYGDNAYKIELPDDLHI 457
            K+RR   FKEGD VM+ L+K RFP GTY KL+  + GP K+L K  DNAY ++LP++++I
Sbjct: 1345 KRRRVKVFKEGDEVMVFLRKERFPVGTYRKLQPHKYGPFKVLRKINDNAYVVDLPENMNI 1404

BLAST of CSPI04G15470 vs. NCBI nr
Match: TXG62763.1 (hypothetical protein EZV62_009757 [Acer yangbiense])

HSP 1 Score: 480.3 bits (1235), Expect = 1.8e-131
Identity = 240/489 (49.08%), Postives = 325/489 (66.46%), Query Frame = 0

Query: 2    WSPYEQEMYALIRALRQWEDYLLSKEFLLLTNHFSIKYLQAQKSTNKIPARWISFLQRFD 61
            WS Y+QE YA+IRAL+ WE YL+ +EF+L T+H ++KYL +Q+S + + ARW ++LQ+F 
Sbjct: 1000 WSTYDQEFYAIIRALKNWEHYLIQREFILYTDHQALKYLNSQRSLSNMHARWATYLQKFP 1059

Query: 62   FVIKHKSGKENKVADALSRKHSLLSISSSEVIAFKHLPYLYEDDTDFNKMWYKCIHHLET 121
            FV+KHKSG  NKVADALSR+ SLL     E+I F+ L  LY DD DF ++W  C+     
Sbjct: 1060 FVLKHKSGVLNKVADALSRRASLLVTMQQEIIGFEFLKELYSDDEDFGEVWEMCVLKQAD 1119

Query: 122  REFHIVDGFLFKEEKLCIPHTSPREALLKEAHSEGLAGHFGQDKTIEILSSKYYLPQLRK 181
             EFH+ +G+LF   +LCIP +S RE L++E H  GL GH G+DKTI  ++ +YY PQL++
Sbjct: 1120 GEFHMNEGYLFCGNQLCIPRSSLREKLIRELHGGGLGGHLGRDKTISGVAERYYWPQLKR 1179

Query: 182  DTNNFVKRCPICQTTKGTSTNAGLYNPLPIPTAIWEDLSVDFVLELPKAQRQHNTVMVVV 241
            D  NFV++C + QT+KG + N GLY PLP+P AIWEDL++DFVL LP+ QR  ++V VVV
Sbjct: 1180 DVGNFVRKCYVYQTSKGQAQNTGLYMPLPVPNAIWEDLAMDFVLGLPRTQRGVDSVFVVV 1239

Query: 242  DRFSKRTHFLPCKKTNDA---------------------------------------KLD 301
            DRFSK  HF+PC+KT+DA                                       ++D
Sbjct: 1240 DRFSKMGHFIPCRKTSDASHVAQLFFREVVRLHGVPQSITSDRDSKFLSHFWVTLWRRMD 1299

Query: 302  TTLKFSTTAHSQTDGQTEVTNRTLGNLICCLSGSKHRQWDLALAQSEFAFNNMKNRSTGR 361
            T LKFS+TAH QTDGQTE  NRTLGNLI  + G K +QWD+ALAQ+EFA+NN  + +TG+
Sbjct: 1300 TALKFSSTAHPQTDGQTENLNRTLGNLIRSICGDKPKQWDVALAQAEFAYNNAVHTATGK 1359

Query: 362  RPFEIVYTKSPRLALDLANLLSNADINNEAENMIERIQNLHKQVHEHLQKTTLSYKQDKD 421
             PF +VY + P+ ALDLA L     ++  AE M E+++++  +V   L++    YK   D
Sbjct: 1360 SPFALVYRQPPKHALDLARLPKVTGMSVAAETMAEQVRDVQAEVKARLEEKNAKYKAAAD 1419

Query: 422  KKRREVKFKEGDLVMIHLKKNRFPTGTYNKLKDRQLGPCKILAKYGDNAYKIELPDDLHI 452
             KRRE  F EGD VM+ L+K RFP G+YNKLK R+ GP K++ K  +NAY I+LP +++I
Sbjct: 1420 VKRREKLFAEGDQVMVFLRKERFPVGSYNKLKPRKYGPYKVIKKINNNAYVIDLPANMNI 1479

BLAST of CSPI04G15470 vs. NCBI nr
Match: CAA7021913.1 (unnamed protein product [Microthlaspi erraticum])

HSP 1 Score: 474.9 bits (1221), Expect = 7.7e-130
Identity = 246/504 (48.81%), Postives = 323/504 (64.09%), Query Frame = 0

Query: 2   WSPYEQEMYALIRALRQWEDYLLSKEFLLLTNHFSIKYLQAQKSTNKIPARWISFLQRFD 61
           WS Y+QE YA+ RALRQWE YL+ +EF+L T+H ++K+L +QK  NK+ ARW++FLQ+F 
Sbjct: 280 WSTYDQEFYAVFRALRQWEHYLVQREFILFTDHQALKFLHSQKVINKMHARWVNFLQKFP 339

Query: 62  FVIKHKSGKENKVADALSRKHSLLSISSSEVIAFKHLPYLYEDDTDFNKMWYKCIHHLET 121
           F+I+HKSG  NKVADALSR+ SLL+  ++E++ F+ L  LYE D +F ++W KC  +  +
Sbjct: 340 FIIQHKSGTLNKVADALSRRASLLTTLANEIVGFEFLRELYESDEEFKELWRKCCTNHPS 399

Query: 122 REFHIVDGFLFKEEKLCIPHTSPREALLKEAHSEGLAGHFGQDKTIEILSSKYYLPQLRK 181
            +FH+ DG+LFK ++LCIP +S  E L++E H  GL+GH G+DKTI  L  +YY P LRK
Sbjct: 400 ADFHVRDGYLFKGDRLCIPCSSLHEKLIRELHGGGLSGHLGRDKTIASLEERYYWPHLRK 459

Query: 182 DTNNFVKRCPICQTTKGTSTNAGLYNPLPIPTAIWEDLSVDFVLELPKAQRQHNTVMVVV 241
           D    V+RC ICQ +KG S N GLY PLPIP  IW+DLS+DFVL LP+ QR  ++V VVV
Sbjct: 460 DAGAIVRRCYICQVSKGQSQNTGLYMPLPIPDDIWQDLSMDFVLGLPRTQRGVDSVFVVV 519

Query: 242 DRFSKRTHFLPCKKTNDAK---------------------------------------LD 301
           DRFSK THF+ C+KT DA                                          
Sbjct: 520 DRFSKMTHFIACRKTADASNIAKLFFREIVRLHGVPKSIVSDRDTKFLSHFWITLWRMFG 579

Query: 302 TTLKFSTTAHSQTDGQTEVTNRTLGNLICCLSGSKHRQWDLALAQSEFAFNNMKNRSTGR 361
           T+LK S+TAH Q+DGQ EVTNRTLGN+I  + G K +QWDLAL Q EFA+N   + +TG+
Sbjct: 580 TSLKRSSTAHPQSDGQPEVTNRTLGNMIRSVCGDKPKQWDLALPQVEFAYNMAVHSATGK 639

Query: 362 RPFEIVYTKSPRLALDLANLLSNADINNEAENMIERIQNLHKQVHEHLQKTTLSYKQDKD 421
            PF +VYT  P+  +DL  L     ++  AE M + I    + V   L+ T    K+  D
Sbjct: 640 SPFSLVYTSVPKHVVDLVPLPKAPGVSASAEAMAKDILETKEAVRARLEATGQKNKRAAD 699

Query: 422 KKRREVKFKEGDLVMIHLKKNRFPTGTYNKLKDRQLGPCKILAKYGDNAYKIELPDDLHI 462
           K+RR   F EGD VM+ L+K RFP GTY KL+ R+ GP KIL K  DNAY ++LPDD++I
Sbjct: 700 KRRRLKVFTEGDEVMVFLRKERFPVGTYRKLQPRKYGPFKILQKLNDNAYVVDLPDDMNI 759

BLAST of CSPI04G15470 vs. NCBI nr
Match: KAG7588770.1 (Integrase catalytic core [Arabidopsis suecica])

HSP 1 Score: 471.9 bits (1213), Expect = 6.5e-129
Identity = 242/494 (48.99%), Postives = 320/494 (64.78%), Query Frame = 0

Query: 2    WSPYEQEMYALIRALRQWEDYLLSKEFLLLTNHFSIKYLQAQKSTNKIPARWISFLQRFD 61
            WS Y+QE YA+ RALRQWE YL+ +EF+L T+H ++K+L +QK  NK+ ARW+SFLQ+F 
Sbjct: 952  WSTYDQEFYAVFRALRQWEHYLVQREFILFTDHQALKFLHSQKVINKMHARWVSFLQKFP 1011

Query: 62   FVIKHKSGKENKVADALSRKHSLLSISSSEVIAFKHLPYLYEDDTDFNKMWYKCIHHLET 121
            F+I+HKSG  NKVADALSR+ SLL+  + E++ F+ L  LYE D +F ++W KC     +
Sbjct: 1012 FIIQHKSGALNKVADALSRRASLLTTLAHEIVGFEFLKELYETDAEFKELWDKCNGKHPS 1071

Query: 122  REFHIVDGFLFKEEKLCIPHTSPREALLKEAHSEGLAGHFGQDKTIEILSSKYYLPQLRK 181
             +FHI +G+LFK ++LCIP +S RE L++E H  GL+GH G+DKTI  L  +YY P LRK
Sbjct: 1072 TDFHIREGYLFKGDRLCIPCSSLREKLIRELHGGGLSGHLGRDKTIASLEERYYWPHLRK 1131

Query: 182  DTNNFVKRCPICQTTKGTSTNAGLYNPLPIPTAIWEDLSVDFVLELPKAQRQHNTVMVVV 241
            D    V+RC +CQ +KG S N GLY PL +P  IW+DLS+DFVL LP+ QR  ++V VVV
Sbjct: 1132 DAGAIVRRCFVCQVSKGQSQNTGLYMPLSVPDDIWQDLSMDFVLGLPRTQRGVDSVFVVV 1191

Query: 242  DRFSKRTHFLPCKKTNDA---------------------------------------KLD 301
            D+FSK THF+ C+KT DA                                          
Sbjct: 1192 DKFSKMTHFIACRKTADATNIAKLFFREVVRLHGVPKSIVSDRDTKFLSHFWITLWRMFG 1251

Query: 302  TTLKFSTTAHSQTDGQTEVTNRTLGNLICCLSGSKHRQWDLALAQSEFAFNNMKNRSTGR 361
            T+LK S+TAH Q+DGQTEVTNRTLGN+I  + G K +QWDLAL Q EFA+N+  + +TG+
Sbjct: 1252 TSLKRSSTAHPQSDGQTEVTNRTLGNMIRSVCGDKPKQWDLALPQIEFAYNSAVHSATGK 1311

Query: 362  RPFEIVYTKSPRLALDLANLLSNADINNEAENMIERIQNLHKQVHEHLQKTTLSYKQDKD 421
             PF +VYT  P+  +DL  L     ++  A+ M + I +  + V   L+ T    K+  D
Sbjct: 1312 SPFTLVYTSVPKHVVDLVPLPQAPGVSASAKAMAKDILDTKEAVRARLEATGQKNKRAAD 1371

Query: 422  KKRREVKFKEGDLVMIHLKKNRFPTGTYNKLKDRQLGPCKILAKYGDNAYKIELPDDLHI 457
            KK+R   FKEGD VM+ LKK RFP GTY KL+ R+ GP KIL K  DNAY ++LPDD+ I
Sbjct: 1372 KKQRLKVFKEGDEVMVFLKKERFPVGTYRKLQPRKYGPFKILQKLNDNAYVVDLPDDMSI 1431

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q993152.2e-4727.29Transposon Ty3-G Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 20... [more]
Q7LHG51.4e-4627.22Transposon Ty3-I Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 20... [more]
Q9UR074.6e-4530.17Transposon Tf2-11 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 24... [more]
P0CT414.6e-4530.17Transposon Tf2-12 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 24... [more]
P0CT344.6e-4530.17Transposon Tf2-1 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 248... [more]
Match NameE-valueIdentityDescription
A0A5B7BER32.7e-14451.31Uncharacterized protein OS=Davidia involucrata OX=16924 GN=Din_036800 PE=4 SV=1[more]
A0A6N2LVR11.1e-13749.70Uncharacterized protein OS=Salix viminalis OX=40686 GN=SVIM_LOCUS287486 PE=4 SV=... [more]
A0A6N2KHU65.2e-13248.99Reverse transcriptase OS=Salix viminalis OX=40686 GN=SVIM_LOCUS85055 PE=4 SV=1[more]
A0A6D2HLB56.8e-13249.80Reverse transcriptase OS=Microthlaspi erraticum OX=1685480 GN=MERR_LOCUS2198 PE=... [more]
A0A6D2IKM36.8e-13249.80Reverse transcriptase OS=Microthlaspi erraticum OX=1685480 GN=MERR_LOCUS15430 PE... [more]
Match NameE-valueIdentityDescription
CAA7028195.11.4e-13149.80unnamed protein product [Microthlaspi erraticum][more]
CAA7014963.11.4e-13149.80unnamed protein product [Microthlaspi erraticum][more]
TXG62763.11.8e-13149.08hypothetical protein EZV62_009757 [Acer yangbiense][more]
CAA7021913.17.7e-13048.81unnamed protein product [Microthlaspi erraticum][more]
KAG7588770.16.5e-12948.99Integrase catalytic core [Arabidopsis suecica][more]
Match NameE-valueIdentityDescription
InterPro
Analysis Name: InterPro Annotations of Cucumber (PI 183967) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 345..372
NoneNo IPR availableGENE3D1.10.340.70coord: 112..197
e-value: 3.7E-18
score: 67.6
NoneNo IPR availablePANTHERPTHR24559:SF367SUBFAMILY NOT NAMEDcoord: 260..459
NoneNo IPR availablePANTHERPTHR24559TRANSPOSON TY3-I GAG-POL POLYPROTEINcoord: 260..459
NoneNo IPR availableCDDcd09274RNase_HI_RT_Ty3coord: 2..82
e-value: 5.49926E-31
score: 113.742
IPR041373Reverse transcriptase, RNase H-like domainPFAMPF17917RT_RNaseHcoord: 2..60
e-value: 2.7E-11
score: 43.7
IPR036397Ribonuclease H superfamilyGENE3D3.30.420.10coord: 260..374
e-value: 7.9E-18
score: 66.4
coord: 207..259
e-value: 9.4E-6
score: 27.0
IPR041588Integrase zinc-binding domainPFAMPF17921Integrase_H2C2coord: 145..197
e-value: 5.0E-17
score: 61.7
IPR043502DNA/RNA polymerase superfamilySUPERFAMILY56672DNA/RNA polymerasescoord: 2..66
IPR012337Ribonuclease H-like superfamilySUPERFAMILY53098Ribonuclease H-likecoord: 207..331

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI04G15470.1CSPI04G15470.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0043170 macromolecule metabolic process
biological_process GO:0006807 nitrogen compound metabolic process
biological_process GO:0044238 primary metabolic process
molecular_function GO:0003676 nucleic acid binding
molecular_function GO:0008233 peptidase activity