CSPI07G11750 (gene) Cucumber (PI 183967) v1

Overview
NameCSPI07G11750
Typegene
OrganismCucumis sativus var. hardwickii cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
LocationChr7: 9954033 .. 9960114 (+)
RNA-Seq ExpressionCSPI07G11750
SyntenyCSPI07G11750
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCGGTAGGCCTCCTAAAAAACTACGTGTGTAAAAATAAACACCTTAAATTTTGGGGTGTTACACACGTATCCGAAAACTAAACAAAGTTATATTCATAAAAACACAAATAACAAAAAAGTGAAAATTTCTAGACCTCCTTATTTTGATGGTTCAAATTTTGGATGTGAACTTGAGAATTTAATTTGTTTGGGTGGTGTGGGTTATGCTAAACTGTGGCTTGTACTGGTAAACTAAGGGGTGTTAGGTCTTTGTAGTTTGATCGACTGGTGTATTAAAACGTTACCAACGGTCTTTGGACTAAAGTGAGATTGGTTGACAGCTAGAATGTAAGTGAGAAAACTATATATGTTGAAGTAATGGAATGGGGTTATAATGACTGTTAAGATGTTTATAACAGACTGAGGGGGAAATTGTTAGCTTTTATTGGGATGTGTGCCTTGTAGGTGTCCCATGGGATCACCAATTATTATGCATCCATCGGGAGCATTAGACTGATATGTGTTTATGCAGAACACTTGACTGAACTGTACGTCCCTCAAGGCGTTAGATTGATACGTATATTCTATGGGATCACAAGACTGACTATGCAGGGTATCTCAATAGGACGTTAACAATTTATTTTCCTCTGACGGAACCAGTAGTGGGTTTCTTACTGAGTATTTTTATACTCACCCCTTTATGTTTAATTTTTTCATGCAAAGGTAACAAAGATGGCAAAGTGGCGAGGGGCAGGAAGGAAGCGTGATTGCCAGGGGAACATGTTATTTTACTTCCGCTTTATGTTTTTTTGATGTTTTAAACGTTGTGTTGAAACTTAGCTTACGAACTTTTTTTTTAAACAGTACTTTTTAAGTTTTGAATATTTGAAACCAGGTCTGACTTAAAATTATTTTTCCTCGATTGTATGTTTTTCAAATGCCTTACATTTCCTTTTGTATCAACCGAGGTCTTTTGTCAAAAGTTAATGACCTCGACTTAGGTAGAAAAGTTGGGTCGTTACAGGTTTAGGCTATATTGATGAATCATCTACTCCTTCAAGTTCTAAAACTACATTTGTTCAAGCATCACTTATTGTGCCTAAGCTTAACATGCCTAATGATGTGTCTAATCATGTTAAATCTAGTTTTGTACCCATATGTCATAATTGTGGTGTTGAAGGTCACATTAGACCTAAATGCTTTAAATTGAAGTATGCTCAAAATACTTATTCAAGAAGAAATTTTTCACAAAGAGCGAAGTTTTACAATGCTCCAAGGAAAAAATTTTCGAAGAAAAGTAGAGTACATAAATTTGTTATGAAAAATAAATCTTTGCATAATGTTGTTTGTTTTTCTTGTGGAAAGTATGGACATAAAGCTTATTCTTGTTACCTATCTAGCTCTAGTGCTTGTAATGTCTATAAAAAAATGAAATGGATCCCTAAATACGTAAATGCTAACATTCTAGGACCCAAACAAGTTTGGGTACCAAAGGATCAAACTTGAACTTAGTTGTTTTTAGGTTTTTTTGAAAGCCTCCAAAAAAAATGAATGGTACTTGGATAGTGGTTGCTCAAGACACATGACGAGAGACCGATCCAAGTTATCTCTTTCTCCAAAAAGAATGGAGGCATGGTAACCTTTGGTGACAACAAGAAAGGTAAAATAATTGGTAAGGGTAATATAGGAAATGATTCATCTACTTTGATTGAAAATATTCATTTGGTTGATAGTTTAAAGCATGATTTGCTTAGTATTAGTCAATTGTGTGATAAAGGATTTAGAGTAATATTTAAGAAAAAAAATTGCATAATTGAAAATGTTAGTGATAGAAAAGTTTTGTTTGTTGGAAATAGGGATGAAAATGTGTACACTCTTGATTTGAATGATTATTCTATTATTGATAAATGTCTTGCGATTTTGCATGATGATTCTTGGTTATGGCATAGAAGACTAGGCCATGCTAGCATGCACTTAATTTCAAATATTTCTAAAAATTGTTTGGTTAGAGGTCTTCCTAGTTTTAAATTTGAAAAAGACAAAGTTTGTGATGCTTGTCAAATGGGTAAGCAAACTAAGTCCTCTTTCAAATCTAAAAATGTGATCTCTACTACTAGACCCTTACAACTATTACACATGGACTTATTTGGCCCTTCTAGAATTGCTAGTTATGGAGGAAATTATTATGCTTTTGTGATAGTTGATGATTTTTCTAGATTTACTTGGATTTTGATGATAAAACATAAGGATGATGCTTTGAAAAAGTTTTATTAGTTTTACAAAAAGAGTACAAAATGAAAAAAAGGATTTTTTATTTCCAAAATTAGGAGTGATCACGAAGGAGAATTTGATAATTAATGATGCTTTTAAAGATTTTTGTGAAGAAAATGGTTTTTCCCATAATTTTTCTTCTCCAAGAACTCCTCAACAAAATGGTGTAGTTGAAAGGAAAAATCGTACTTTGCAAGAATTTGCTAGATCAATGTTGAATGAGTATGGTTTACCTAAATATTTTTGGACGGAAGCCGTTAACACCGCTTGCTATGTTTCAAATAGAGTTTTAGTTAGACCCTCTTTAGATAAAACTCCTTATGAACTTTGGCATGGAAAAATTCCAAATATTGGGTATTTCAAAGTTTTCGGTTGTAAATGTTTTATTTTGAATAATAAAGAAAAACTTGGAAAATTTGATTCTAAGACGGATGTTGGTATTTTTCTAGGATATTCCTCTACTAGTAAAGCTTATAGAGTTTTCAATAAGAAAACTTTAGTTATTGAGGATCTATTCATGTTGTATTTGATGAATCTTGGAATGATGTTTCTAATGAGTCTATTTGTAGTGATGATTTAGAAAAAGATTTTGGAGATTTACTTGTTAATGACAAAGGCAAAGAAATTGTTCCAAGTATGCAAGATGTGAATATCATAGAAAAGAAAGAAGAGGGTTCTTCATCCTTGCCTAAAGAGTGGAGATATGCTCTATCCCATCCCAAGGATCTAATTCTTGGCAATCCCGAACAAGGTGTCAAAACTCGTTCTTCTCTTAATTTATTTAGTAATCTTGCTTTTGTGTCTCAAATTGAACCTAGAAGTTTTAAAGATGCCGAATGTGATGAGTTTTGGATTTTAGCTATGCAAGAAGAATTAAATCAATTTGAAATGAACAAAGTTCGGAAATTAGTCCCTAGGCCGTATAATGCATCTATAATTGGAACTAAATGGGTTTTTAGAAATAAGATGGATGAAAATGGAAATATCATTAGAAATAAAGCTAGACTTGTAGCTCAAGGTTTTTGTCAAGAAGAAGATATATATTATGAAGAGACTTTTGCAACGGTTGCTAGATTAGAAGCTATTAGAATGTTGCTTGCTTTTGCTTCTTATAAAACATTCATTTTCTATCAAATGGATGTAAAATGTGCTTTTTTAAACGGTTATATTGTGGAGGAAGTTTACGTAGAACAACCTCTGGGCTTTGAAAAAGGCTCTTTATGGCTTAAAACAAGCTCCAAGAGCTTGGTATGATAGACTTAGCAAGTTTTTACTTGAGAATGACTTTAAGATGGGAAAAATTGATAATACTCTATTTATTAAAGTTAAAAATAATGACATGCTTATAGTGTAAATTTATGTGGATGATATTATATTTGGTTCTACTAATTCATCTTTGTGTGAAGAATTTTCCAAGTGTATGCATAATGAGTTTGAGATGAGTATGATGGGAGAACTTAGTTTCTTCCTTGGTTTTCAAATCAAACAACTCAAGGATGACATCTTCATAAGTCAAGAAAAATACACAAGGAATTTGCTTAAGAAATTCAAATTAAATAAAGGTCAAGTTGCAAAAACTCGTATGAGCACTACCACTAAGCTTGACAAAGATGAAAAGGGTAAGTGTGTGGATATAAAGACTTATCGAGGTATGATCAAATCTTTACTTTATTTGACCGCTAGTAAACCCGATATCATGTTTAGTGTATGTCTTTGTGCTAGATTTCAATCTTGTCCTAAAGAATCACATTTCCATGCCGTTAAAAGGATACTTAAATATTTGCTTGGAACTATTGATGTTGGATTATGGTATCCTAGAAATGTTGAGTTTAATTTGATAGGATATTCCGATGCGAATTTTGCCGGTAGTTTACTTGACCATAAAAGTACTAGTAGGACTTGTCAATTTCTTGGTAGTTCCTTAGTATCTTGGTTTAGTAAAAAGCAAAATTCGGTTGCTTTATCCACTACCGAAGCGGAATATATTGCGATTGCTAGTTGTTGTGCAAAAATTATTTGGATGAAACAAATTCTTTGTGATTTTGGATTAAAATTTGATAATGTGCCTATATTTTGTGATAATATTAGTGCCATAAATTTGACTAAGAATCCGATTCATCATTCTAGAACTAAGCATATAGATATTAGGCATCACTTTATTAGAGAGCATGTACAAAATGGTCATATTACTCTTGAGTTTGTAAGCTCCAATAATCAATTAGCGGATATATTTACCAAGCCTTTGAATGAAGAAAGTTTTTGTAAAAATAGGCTTGAGCTTGGAATTATTCGTTGTGATGCATCTTGATTTTATACCTTAATTGATATTGTTACCCGAATGCATATTGATAGGGGGAGAATTAGACATTTTGTATGTGTTCATATTCTTTTTCGAGAGCTAACTTCCTTGGTGTTGTTTTTGATGATTCCAAAGAGGGAGAAGTATAAGAAAAATATGTTTTTATGAATTTGTTGCATAATTTTTCACCTCAAAAACATATTCTTAAATAAGAAGGGTTTTACACTTCAAAATCAATGATTATGTGGATATGAAAAGAAATTGACCAAACATTAATTCCAAAATCATCACAATTACAAATACGTTGAATGTTCAATTTGACAAGAAGATATCCTAAACACGGCAGAAAAGTTAAAACAAATAAACATGCAAGAATATATTATAATTCTAAAGGAAAGGGAAGATACTTGTCAGCTTCTCCTTTCACTTTCTTTGAGAACTCATTGAATACACTGAATCTGTGATTGATGGAATAACCACTTGGTAATAGCAATTGTGATCTCGTGCAGGAGCTCTACCATACCAAAATGTGTCCATAAATCAAACAAACTAGATGAATAAATTTACACCACAGAAATCCATCCTATGAAGACTACAACGGAGAAAGATGGGATAAGAATAAAGCATAGGAAGAACAAATACCGATAAAAAATACTGTCTTGAGAATAATATATCCCAAACCAATTTTCTGCCGGTCATTAAGATTTACCTACTGCAACTCTGCACACAAATTTTTTCTTATAAACAATAATTAACACAATCTAATTGGTATCTTATGGATATAAAAGACAGCATGGATCCATATTTTCGTGAATGAAGTAAGAATTTAACACATGATTCAAGTTTTGAGGAATGTTGTGATGTTAAAAGAAAGTAAAGGTATTACTACAAGACCTTAAATAAAAAAAATAAAAAAATCTCCATGAAACCACAAATATTCGATTTAAATGTAGTTATATAGGTGAGGTAAAGCTACTTCAAATCTTTAAACTATAATGCTATTGTAAATAACCGACAGAAAGTTGAATATAGTGAATTAATAATCAAAGAACAATTTTGGATTTTGTGGTAAAATTCTACCTTTTAGAACCATAAAATTGGAAGGATTTCAAATTAGTCTATGAACAACGAATTTGGAAGCATTTGTGATCCACTAACGAGCTCTCCTGGGGCATCCTTCGATACTTATTTGTGATCCACAAGAGACTGGGAGTAGTTGTGATCTCTGTCCATAGTAGTCATCCAGCCAGGAAGGGAAAGACCACGACACGCAGTGGAATACGAAAAAGAGCATCGAGCACAGCTTTCGTGTCACCAGTAATCCTTTCTCCTTTATTGGTAGACTGAAAAAGGTTTTGTCAAGCAGGCAGGCATGATTTTCATGTCGATCGACAGTTGAGAAATCTGATGTTGTTAGTGCTAGGGGAAGAAGATGGCGGTGAATTGGCGATTTTGAGGAAGAAAAAGAAGAAAAGAGAGTGTGAGAGAAGGGGAGAGGAGGGAGAAAGTAGAGAGAGAGGCAGTAGTTAG

mRNA sequence

ATGGCGGTGTCCCATGGGATCACCAATTATTATGCATCCATCGGGAGCATTAGACTGATATGTGTTTATGCAGAACACTTGACTGAACTGTACGTCCCTCAAGGCGTTAGATTGATACGTATATTCTATGGGATCACAAGACTGACTATGCAGGGTTTAGGCTATATTGATGAATCATCTACTCCTTCAAGTTCTAAAACTACATTTGTTCAAGCATCACTTATTGTGCCTAAGCTTAACATGCCTAATGATGTGTCTAATCATGTTAAATCTAGTTTTTGGTTGCTCAAGACACATGACGAGAGACCGATCCAAGTTATCTCTTTCTCCAAAAAGAATGGAGGCATGGTAACCTTTGGTGACAACAAGAAAGGAGTGATCACGAAGGAGAATTTGATAATTAATGATGCTTTTAAAGATTTTTGTGAAGAAAATGGTTTTTCCCATAATTTTTCTTCTCCAAGAACTCCTCAACAAAATGGTGTAGTTGAAAGGAAAAATCGTACTTTGCAAGAATTTGCTAGATCAATGTTGAATGAGTATGGTTTACCTAAATATTTTTGGACGGAAGCCGTTAACACCGCTTGCTATGTTTCAAATAGAGTTTTAGTTAGACCCTCTTTAGATAAAACTCCTTATGAACTTTGGCATGGAAAAATTCCAAATATTGGTGATGATTTAGAAAAAGATTTTGGAGATTTACTTGTTAATGACAAAGGCAAAGAAATTGTTCCAAGTATGCAAGATGTGAATATCATAGAAAAGAAAGAAGAGGGTTCTTCATCCTTGCCTAAAGAGTGGAGATATGCTCTATCCCATCCCAAGGATCTAATTCTTGGCAATCCCGAACAAGGTGTCAAAACTCGTTCTTCTCTTAATTTATTTAGTAATCTTGCTTTTGTGTCTCAAATTGAACCTAGAAGTTTTAAAGATGCCGAATGTGATGAGTTTTGGATTTTAGCTATGCAAGAAGAATTAAATCAATTTGAAATGAACAAAGTTCGGAAATTAGTCCCTAGGCCGTATAATGCATCTATAATTGGAACTAAATGGGTTTTTAGAAATAAGATGGATGAAAATGGAAATATCATTAGAAATAAAGCTAGACTTGTAGCTCAAGGTTTTTGTCAAGAAGAAGATATATATTATGAAGAGACTTTTGCAACGGTTGCTAGATTAGAAGCTATTAGAATGTTGCTTGCTTTTGCTTCTTATAAAACATTCATTTTCTATCAAATGGATGTAAAATGTGCTTTTTTAAACGGTTATATTGTGGAGGAAGTTTACGTAGAACAACCTCTGGGCTTTGAAAAAGGCTCTTTATGGCTTAAAACAAGCTCCAAGAGCTTGTGTATGCATAATGAGTTTGAGATGAGTATGATGGGAGAACTTAGTTTCTTCCTTGGTTTTCAAATCAAACAACTCAAGGATGACATCTTCATAAGTCAAGAAAAATACACAAGGAATTTGCTTAAGAAATTCAAATTAAATAAAGGTCAAGTTGCAAAAACTCGTATGAGCACTACCACTAAGCTTGACAAAGATGAAAAGGGTAAGTGTGTGGATATAAAGACTTATCGAGGTATGATCAAATCTTTACTTTATTTGACCGCTAGTAAACCCGATATCATGTTTAGTGTATGTCTTTGTGCTAGATTTCAATCTTGTCCTAAAGAATCACATTTCCATGCCGTTAAAAGGATACTTAAATATTTGCTTGGAACTATTGATGTTGGATTATGGTATCCTAGAAATGTTGAGTTTAATTTGATAGGATATTCCGATGCGAATTTTGCCGGTAGTTTACTTGACCATAAAAGTACTAGTAGGACTTGTCAATTTCTTGGTAGTTCCTTAGTATCTTGGTTTAGTAAAAAGCAAAATTCGGTTGCTTTATCCACTACCGAAGCGGAATATATTGCGATTGCTAGTTGTTGTGCAAAAATTATTTGGATGAAACAAATTCTTTGTGATTTTGGATTAAAATTTGATAATGTGCCTATATTTTGTGATAATATTAGTGCCATAAATTTGACTAAGAATCCGATTCATCATTCTAGAACTAAGCATATAGATATTAGGCATCACTTTATTAGAGAGCATGTACAAAATGGTCATATTACTCTTGAGTTTCTCTCCTGGGGCATCCTTCGATACTTATTTGTGATCCACAAGAGACTGGGAGTAGTTGTGATCTCTGTCCATAGTAGTCATCCAGCCAGGAAGGGAAAGACCACGACACGCAGTGGAATACGAAAAAGAGCATCGAGCACAGCTTTCGTGTCACCATTGAGAAATCTGATGTTGTTAGTGCTAGGGGAAGAAGATGGCGGTGAATTGGCGATTTTGAGGAAGAAAAAGAAGAAAAGAGAGTGTGAGAGAAGGGGAGAGGAGGGAGAAAGTAGAGAGAGAGGCAGTAGTTAG

Coding sequence (CDS)

ATGGCGGTGTCCCATGGGATCACCAATTATTATGCATCCATCGGGAGCATTAGACTGATATGTGTTTATGCAGAACACTTGACTGAACTGTACGTCCCTCAAGGCGTTAGATTGATACGTATATTCTATGGGATCACAAGACTGACTATGCAGGGTTTAGGCTATATTGATGAATCATCTACTCCTTCAAGTTCTAAAACTACATTTGTTCAAGCATCACTTATTGTGCCTAAGCTTAACATGCCTAATGATGTGTCTAATCATGTTAAATCTAGTTTTTGGTTGCTCAAGACACATGACGAGAGACCGATCCAAGTTATCTCTTTCTCCAAAAAGAATGGAGGCATGGTAACCTTTGGTGACAACAAGAAAGGAGTGATCACGAAGGAGAATTTGATAATTAATGATGCTTTTAAAGATTTTTGTGAAGAAAATGGTTTTTCCCATAATTTTTCTTCTCCAAGAACTCCTCAACAAAATGGTGTAGTTGAAAGGAAAAATCGTACTTTGCAAGAATTTGCTAGATCAATGTTGAATGAGTATGGTTTACCTAAATATTTTTGGACGGAAGCCGTTAACACCGCTTGCTATGTTTCAAATAGAGTTTTAGTTAGACCCTCTTTAGATAAAACTCCTTATGAACTTTGGCATGGAAAAATTCCAAATATTGGTGATGATTTAGAAAAAGATTTTGGAGATTTACTTGTTAATGACAAAGGCAAAGAAATTGTTCCAAGTATGCAAGATGTGAATATCATAGAAAAGAAAGAAGAGGGTTCTTCATCCTTGCCTAAAGAGTGGAGATATGCTCTATCCCATCCCAAGGATCTAATTCTTGGCAATCCCGAACAAGGTGTCAAAACTCGTTCTTCTCTTAATTTATTTAGTAATCTTGCTTTTGTGTCTCAAATTGAACCTAGAAGTTTTAAAGATGCCGAATGTGATGAGTTTTGGATTTTAGCTATGCAAGAAGAATTAAATCAATTTGAAATGAACAAAGTTCGGAAATTAGTCCCTAGGCCGTATAATGCATCTATAATTGGAACTAAATGGGTTTTTAGAAATAAGATGGATGAAAATGGAAATATCATTAGAAATAAAGCTAGACTTGTAGCTCAAGGTTTTTGTCAAGAAGAAGATATATATTATGAAGAGACTTTTGCAACGGTTGCTAGATTAGAAGCTATTAGAATGTTGCTTGCTTTTGCTTCTTATAAAACATTCATTTTCTATCAAATGGATGTAAAATGTGCTTTTTTAAACGGTTATATTGTGGAGGAAGTTTACGTAGAACAACCTCTGGGCTTTGAAAAAGGCTCTTTATGGCTTAAAACAAGCTCCAAGAGCTTGTGTATGCATAATGAGTTTGAGATGAGTATGATGGGAGAACTTAGTTTCTTCCTTGGTTTTCAAATCAAACAACTCAAGGATGACATCTTCATAAGTCAAGAAAAATACACAAGGAATTTGCTTAAGAAATTCAAATTAAATAAAGGTCAAGTTGCAAAAACTCGTATGAGCACTACCACTAAGCTTGACAAAGATGAAAAGGGTAAGTGTGTGGATATAAAGACTTATCGAGGTATGATCAAATCTTTACTTTATTTGACCGCTAGTAAACCCGATATCATGTTTAGTGTATGTCTTTGTGCTAGATTTCAATCTTGTCCTAAAGAATCACATTTCCATGCCGTTAAAAGGATACTTAAATATTTGCTTGGAACTATTGATGTTGGATTATGGTATCCTAGAAATGTTGAGTTTAATTTGATAGGATATTCCGATGCGAATTTTGCCGGTAGTTTACTTGACCATAAAAGTACTAGTAGGACTTGTCAATTTCTTGGTAGTTCCTTAGTATCTTGGTTTAGTAAAAAGCAAAATTCGGTTGCTTTATCCACTACCGAAGCGGAATATATTGCGATTGCTAGTTGTTGTGCAAAAATTATTTGGATGAAACAAATTCTTTGTGATTTTGGATTAAAATTTGATAATGTGCCTATATTTTGTGATAATATTAGTGCCATAAATTTGACTAAGAATCCGATTCATCATTCTAGAACTAAGCATATAGATATTAGGCATCACTTTATTAGAGAGCATGTACAAAATGGTCATATTACTCTTGAGTTTCTCTCCTGGGGCATCCTTCGATACTTATTTGTGATCCACAAGAGACTGGGAGTAGTTGTGATCTCTGTCCATAGTAGTCATCCAGCCAGGAAGGGAAAGACCACGACACGCAGTGGAATACGAAAAAGAGCATCGAGCACAGCTTTCGTGTCACCATTGAGAAATCTGATGTTGTTAGTGCTAGGGGAAGAAGATGGCGGTGAATTGGCGATTTTGAGGAAGAAAAAGAAGAAAAGAGAGTGTGAGAGAAGGGGAGAGGAGGGAGAAAGTAGAGAGAGAGGCAGTAGTTAG

Protein sequence

MAVSHGITNYYASIGSIRLICVYAEHLTELYVPQGVRLIRIFYGITRLTMQGLGYIDESSTPSSSKTTFVQASLIVPKLNMPNDVSNHVKSSFWLLKTHDERPIQVISFSKKNGGMVTFGDNKKGVITKENLIINDAFKDFCEENGFSHNFSSPRTPQQNGVVERKNRTLQEFARSMLNEYGLPKYFWTEAVNTACYVSNRVLVRPSLDKTPYELWHGKIPNIGDDLEKDFGDLLVNDKGKEIVPSMQDVNIIEKKEEGSSSLPKEWRYALSHPKDLILGNPEQGVKTRSSLNLFSNLAFVSQIEPRSFKDAECDEFWILAMQEELNQFEMNKVRKLVPRPYNASIIGTKWVFRNKMDENGNIIRNKARLVAQGFCQEEDIYYEETFATVARLEAIRMLLAFASYKTFIFYQMDVKCAFLNGYIVEEVYVEQPLGFEKGSLWLKTSSKSLCMHNEFEMSMMGELSFFLGFQIKQLKDDIFISQEKYTRNLLKKFKLNKGQVAKTRMSTTTKLDKDEKGKCVDIKTYRGMIKSLLYLTASKPDIMFSVCLCARFQSCPKESHFHAVKRILKYLLGTIDVGLWYPRNVEFNLIGYSDANFAGSLLDHKSTSRTCQFLGSSLVSWFSKKQNSVALSTTEAEYIAIASCCAKIIWMKQILCDFGLKFDNVPIFCDNISAINLTKNPIHHSRTKHIDIRHHFIREHVQNGHITLEFLSWGILRYLFVIHKRLGVVVISVHSSHPARKGKTTTRSGIRKRASSTAFVSPLRNLMLLVLGEEDGGELAILRKKKKKRECERRGEEGESRERGSS*
Homology
BLAST of CSPI07G11750 vs. ExPASy Swiss-Prot
Match: P04146 (Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3)

HSP 1 Score: 263.5 bits (672), Expect = 8.1e-69
Identity = 220/819 (26.86%), Postives = 348/819 (42.49%), Query Frame = 0

Query: 134  INDAFKDFCEENGFSHNFSSPRTPQQNGVVERKNRTLQEFARSMLNEYGLPKYFWTEAVN 193
            +++  + FC + G S++ + P TPQ NGV ER  RT+ E AR+M++   L K FW EAV 
Sbjct: 556  LSNEMRQFCVKKGISYHLTVPHTPQLNGVSERMIRTITEKARTMVSGAKLDKSFWGEAVL 615

Query: 194  TACYVSNRVLVRPSLD--KTPYELWHGKIPNI------------------GDDLEKDFGD 253
            TA Y+ NR+  R  +D  KTPYE+WH K P +                  G   +K F  
Sbjct: 616  TATYLINRIPSRALVDSSKTPYEMWHNKKPYLKHLRVFGATVYVHIKNKQGKFDDKSFKS 675

Query: 254  LL-------------VNDK---GKEIV-----------PSMQDVNIIEKKEEGSSSLPKE 313
            +              VN+K    +++V              + V + + KE  + + P +
Sbjct: 676  IFVGYEPNGFKLWDAVNEKFIVARDVVVDETNMVNSRAVKFETVFLKDSKESENKNFPND 735

Query: 314  WRYAL------------------------------------------------------- 373
             R  +                                                       
Sbjct: 736  SRKIIQTEFPNESKECDNIQFLKDSKESENKNFPNDSRKIIQTEFPNESKECDNIQFLKD 795

Query: 374  ---------------------------------------SHPKDLILGNP---------- 433
                                                    H K++ + NP          
Sbjct: 796  SKESNKYFLNESKKRKRDDHLNESKGSGNPNESRESETAEHLKEIGIDNPTKNDGIEIIN 855

Query: 434  --EQGVKTR---------SSLN-LFSNLAFVSQIEPRSFKDAECDE---FWILAMQEELN 493
               + +KT+         +SLN +  N   +    P SF + +  +    W  A+  ELN
Sbjct: 856  RRSERLKTKPQISYNEEDNSLNKVVLNAHTIFNDVPNSFDEIQYRDDKSSWEEAINTELN 915

Query: 494  QFEMNKVRKLVPRPYNASIIGTKWVFRNKMDENGNIIRNKARLVAQGFCQEEDIYYEETF 553
              ++N    +  RP N +I+ ++WVF  K +E GN IR KARLVA+GF Q+  I YEETF
Sbjct: 916  AHKINNTWTITKRPENKNIVDSRWVFSVKYNELGNPIRYKARLVARGFTQKYQIDYEETF 975

Query: 554  ATVARLEAIRMLLAFASYKTFIFYQMDVKCAFLNGYIVEEVYVEQPLGFEKGS------- 613
            A VAR+ + R +L+         +QMDVK AFLNG + EE+Y+  P G    S       
Sbjct: 976  APVARISSFRFILSLVIQYNLKVHQMDVKTAFLNGTLKEEIYMRLPQGISCNSDNVCKLN 1035

Query: 614  -----------LWLK------------TSSKSLCMH------------------------ 673
                        W +             SS   C++                        
Sbjct: 1036 KAIYGLKQAARCWFEVFEQALKECEFVNSSVDRCIYILDKGNINENIYVLLYVDDVVIAT 1095

Query: 674  --------------NEFEMSMMGELSFFLGFQIKQLKDDIFISQEKYTRNLLKKFKLNKG 713
                           +F M+ + E+  F+G +I+  +D I++SQ  Y + +L KF +   
Sbjct: 1096 GDMTRMNNFKRYLMEKFRMTDLNEIKHFIGIRIEMQEDKIYLSQSAYVKKILSKFNMENC 1155

BLAST of CSPI07G11750 vs. ExPASy Swiss-Prot
Match: P10978 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum OX=4097 PE=2 SV=1)

HSP 1 Score: 262.3 bits (669), Expect = 1.8e-68
Identity = 219/736 (29.76%), Postives = 330/736 (44.84%), Query Frame = 0

Query: 138  FKDFCEENGFSHNFSSPRTPQQNGVVERKNRTLQEFARSMLNEYGLPKYFWTEAVNTACY 197
            F+++C  +G  H  + P TPQ NGV ER NRT+ E  RSML    LPK FW EAV TACY
Sbjct: 560  FEEYCSSHGIRHEKTVPGTPQHNGVAERMNRTIVEKVRSMLRMAKLPKSFWGEAVQTACY 619

Query: 198  VSNRVLVRPSLDKTPYELWHGK------------------------------IPNI---- 257
            + NR    P   + P  +W  K                              IP I    
Sbjct: 620  LINRSPSVPLAFEIPERVWTNKEVSYSHLKVFGCRAFAHVPKEQRTKLDDKSIPCIFIGY 679

Query: 258  GDDLEKDFGDLLVNDKGKEIVPSMQDV----------NIIEKKEEG-------------- 317
            GD+   +FG  L +   K+++ S   V          ++ EK + G              
Sbjct: 680  GDE---EFGYRLWDPVKKKVIRSRDVVFRESEVRTAADMSEKVKNGIIPNFVTIPSTSNN 739

Query: 318  ---SSSLPKEWRYALSHPKDLI------------LGNPEQGVKTRSSL----------NL 377
               + S   E       P ++I            + +P QG +    L            
Sbjct: 740  PTSAESTTDEVSEQGEQPGEVIEQGEQLDEGVEEVEHPTQGEEQHQPLRRSERPRVESRR 799

Query: 378  FSNLAFV---SQIEPRSFKDA----ECDEFWILAMQEELNQFEMNKVRKLVPRPYNASII 437
            + +  +V      EP S K+     E ++  + AMQEE+   + N   KLV  P     +
Sbjct: 800  YPSTEYVLISDDREPESLKEVLSHPEKNQL-MKAMQEEMESLQKNGTYKLVELPKGKRPL 859

Query: 438  GTKWVFRNKMDENGNIIRNKARLVAQGFCQEEDIYYEETFATVARLEAIRMLLAFASYKT 497
              KWVF+ K D +  ++R KARLV +GF Q++ I ++E F+ V ++ +IR +L+ A+   
Sbjct: 860  KCKWVFKLKKDGDCKLVRYKARLVVKGFEQKKGIDFDEIFSPVVKMTSIRTILSLAASLD 919

Query: 498  FIFYQMDVKCAFLNGYIVEEVYVEQPLGFE------------------------------ 557
                Q+DVK AFL+G + EE+Y+EQP GFE                              
Sbjct: 920  LEVEQLDVKTAFLHGDLEEEIYMEQPEGFEVAGKKHMVCKLNKSLYGLKQAPRQWYMKFD 979

Query: 558  ---KGSLWLKTSS------------------------------KSLC------MHNEFEM 617
               K   +LKT S                              K L       +   F+M
Sbjct: 980  SFMKSQTYLKTYSDPCVYFKRFSENNFIILLLYVDDMLIVGKDKGLIAKLKGDLSKSFDM 1039

Query: 618  SMMGELSFFLGFQI--KQLKDDIFISQEKYTRNLLKKFKLNKGQVAKTRMSTTTKLDK-- 677
              +G     LG +I  ++    +++SQEKY   +L++F +   +   T ++   KL K  
Sbjct: 1040 KDLGPAQQILGMKIVRERTSRKLWLSQEKYIERVLERFNMKNAKPVSTPLAGHLKLSKKM 1099

Query: 678  -----DEKGKCVDIKTYRGMIKSLLY-LTASKPDIMFSVCLCARFQSCPKESHFHAVKRI 705
                 +EKG    +  Y   + SL+Y +  ++PDI  +V + +RF   P + H+ AVK I
Sbjct: 1100 CPTTVEEKGNMAKV-PYSSAVGSLMYAMVCTRPDIAHAVGVVSRFLENPGKEHWEAVKWI 1159

BLAST of CSPI07G11750 vs. ExPASy Swiss-Prot
Match: Q9ZT94 (Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana OX=3702 GN=RE2 PE=4 SV=1)

HSP 1 Score: 238.8 bits (608), Expect = 2.1e-61
Identity = 149/479 (31.11%), Postives = 244/479 (50.94%), Query Frame = 0

Query: 305  EPRSFKDAECDEFWILAMQEELNQFEMNKVRKLV-PRPYNASIIGTKWVFRNKMDENGNI 364
            EPR+   A  D+ W  AM  E+N    N    LV P P + +I+G +W+F  K + +G++
Sbjct: 938  EPRTAIQAMKDDRWRQAMGSEINAQIGNHTWDLVPPPPPSVTIVGCRWIFTKKFNSDGSL 997

Query: 365  IRNKARLVAQGFCQEEDIYYEETFATVARLEAIRMLLAFASYKTFIFYQMDVKCAFLNGY 424
             R KARLVA+G+ Q   + Y ETF+ V +  +IR++L  A  +++   Q+DV  AFL G 
Sbjct: 998  NRYKARLVAKGYNQRPGLDYAETFSPVIKSTSIRIVLGVAVDRSWPIRQLDVNNAFLQGT 1057

Query: 425  IVEEVYVEQPLGF-------------------------------------------EKGS 484
            + +EVY+ QP GF                                              S
Sbjct: 1058 LTDEVYMSQPPGFVDKDRPDYVCRLRKAIYGLKQAPRAWYVELRTYLLTVGFVNSISDTS 1117

Query: 485  LWLKTSSKSL-------------------------CMHNEFEMSMMGELSFFLGFQIKQL 544
            L++    +S+                          +   F +    +L +FLG + K++
Sbjct: 1118 LFVLQRGRSIIYMLVYVDDILITGNDTVLLKHTLDALSQRFSVKEHEDLHYFLGIEAKRV 1177

Query: 545  KDDIFISQEKYTRNLLKKFKLNKGQVAKTRMSTTTKLDKDEKGKCVDIKTYRGMIKSLLY 604
               + +SQ +YT +LL +  +   +   T M+T+ KL      K  D   YRG++ SL Y
Sbjct: 1178 PQGLHLSQRRYTLDLLARTNMLTAKPVATPMATSPKLTLHSGTKLPDPTEYRGIVGSLQY 1237

Query: 605  LTASKPDIMFSVCLCARFQSCPKESHFHAVKRILKYLLGTIDVGLWYPRNVEFNLIGYSD 664
            L  ++PD+ ++V   +++   P + H++A+KR+L+YL GT D G++  +    +L  YSD
Sbjct: 1238 LAFTRPDLSYAVNRLSQYMHMPTDDHWNALKRVLRYLAGTPDHGIFLKKGNTLSLHAYSD 1297

Query: 665  ANFAGSLLDHKSTSRTCQFLGSSLVSWFSKKQNSVALSTTEAEYIAIASCCAKIIWMKQI 714
            A++AG   D+ ST+    +LG   +SW SKKQ  V  S+TEAEY ++A+  +++ W+  +
Sbjct: 1298 ADWAGDTDDYVSTNGYIVYLGHHPISWSSKKQKGVVRSSTEAEYRSVANTSSELQWICSL 1357


HSP 2 Score: 62.8 bits (151), Expect = 2.1e-08
Identity = 31/85 (36.47%), Postives = 51/85 (60.00%), Query Frame = 0

Query: 139 KDFCEENGFSHNFSSPRTPQQNGVVERKNRTLQEFARSMLNEYGLPKYFWTEAVNTACYV 198
           +D+  ++G SH  S P TP+ NG+ ERK+R + E   ++L+   +PK +W  A + A Y+
Sbjct: 580 RDYLSQHGISHFTSPPHTPEHNGLSERKHRHIVEMGLTLLSHASVPKTYWPYAFSVAVYL 639

Query: 199 SNRVLVRPSLD-KTPYELWHGKIPN 223
            NR L  P L  ++P++   G+ PN
Sbjct: 640 INR-LPTPLLQLQSPFQKLFGQPPN 663

BLAST of CSPI07G11750 vs. ExPASy Swiss-Prot
Match: Q94HW2 (Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana OX=3702 GN=RE1 PE=2 SV=1)

HSP 1 Score: 238.4 bits (607), Expect = 2.8e-61
Identity = 155/486 (31.89%), Postives = 245/486 (50.41%), Query Frame = 0

Query: 298  LAFVSQIEPRSFKDAECDEFWILAMQEELNQFEMNKVRKLV-PRPYNASIIGTKWVFRNK 357
            ++  ++ EPR+   A  DE W  AM  E+N    N    LV P P + +I+G +W+F  K
Sbjct: 948  VSLAAESEPRTAIQALKDERWRNAMGSEINAQIGNHTWDLVPPPPSHVTIVGCRWIFTKK 1007

Query: 358  MDENGNIIRNKARLVAQGFCQEEDIYYEETFATVARLEAIRMLLAFASYKTFIFYQMDVK 417
             + +G++ R KARLVA+G+ Q   + Y ETF+ V +  +IR++L  A  +++   Q+DV 
Sbjct: 1008 YNSDGSLNRYKARLVAKGYNQRPGLDYAETFSPVIKSTSIRIVLGVAVDRSWPIRQLDVN 1067

Query: 418  CAFLNGYIVEEVYVEQPLGF---------------------------------------- 477
             AFL G + ++VY+ QP GF                                        
Sbjct: 1068 NAFLQGTLTDDVYMSQPPGFIDKDRPNYVCKLRKALYGLKQAPRAWYVELRNYLLTIGFV 1127

Query: 478  ---EKGSLWLKTSSKSL------------------CMHN-------EFEMSMMGELSFFL 537
                  SL++    KS+                   +HN        F +    EL +FL
Sbjct: 1128 NSVSDTSLFVLQRGKSIVYMLVYVDDILITGNDPTLLHNTLDNLSQRFSVKDHEELHYFL 1187

Query: 538  GFQIKQLKDDIFISQEKYTRNLLKKFKLNKGQVAKTRMSTTTKLDKDEKGKCVDIKTYRG 597
            G + K++   + +SQ +Y  +LL +  +   +   T M+ + KL      K  D   YRG
Sbjct: 1188 GIEAKRVPTGLHLSQRRYILDLLARTNMITAKPVTTPMAPSPKLSLYSGTKLTDPTEYRG 1247

Query: 598  MIKSLLYLTASKPDIMFSVCLCARFQSCPKESHFHAVKRILKYLLGTIDVGLWYPRNVEF 657
            ++ SL YL  ++PDI ++V   ++F   P E H  A+KRIL+YL GT + G++  +    
Sbjct: 1248 IVGSLQYLAFTRPDISYAVNRLSQFMHMPTEEHLQALKRILRYLAGTPNHGIFLKKGNTL 1307

Query: 658  NLIGYSDANFAGSLLDHKSTSRTCQFLGSSLVSWFSKKQNSVALSTTEAEYIAIASCCAK 714
            +L  YSDA++AG   D+ ST+    +LG   +SW SKKQ  V  S+TEAEY ++A+  ++
Sbjct: 1308 SLHAYSDADWAGDKDDYVSTNGYIVYLGHHPISWSSKKQKGVVRSSTEAEYRSVANTSSE 1367


HSP 2 Score: 58.9 bits (141), Expect = 3.0e-07
Identity = 31/87 (35.63%), Postives = 49/87 (56.32%), Query Frame = 0

Query: 137 AFKDFCEENGFSHNFSSPRTPQQNGVVERKNRTLQEFARSMLNEYGLPKYFWTEAVNTAC 196
           A  ++  ++G SH  S P TP+ NG+ ERK+R + E   ++L+   +PK +W  A   A 
Sbjct: 599 ALWEYFSQHGISHLTSPPHTPEHNGLSERKHRHIVETGLTLLSHASIPKTYWPYAFAVAV 658

Query: 197 YVSNRVLVRPSLD-KTPYELWHGKIPN 223
           Y+ NR L  P L  ++P++   G  PN
Sbjct: 659 YLINR-LPTPLLQLESPFQKLFGTSPN 684

BLAST of CSPI07G11750 vs. ExPASy Swiss-Prot
Match: P92519 (Uncharacterized mitochondrial protein AtMg00810 OS=Arabidopsis thaliana OX=3702 GN=AtMg00810 PE=4 SV=1)

HSP 1 Score: 119.4 bits (298), Expect = 1.9e-25
Identity = 68/203 (33.50%), Postives = 114/203 (56.16%), Query Frame = 0

Query: 452 MHNEFEMSMMGELSFFLGFQIKQLKDDIFISQEKYTRNLLKKFKLNKGQVAKTRMSTTTK 511
           + + F M  +G + +FLG QIK     +F+SQ KY   +L     N G +    MST   
Sbjct: 27  LSSTFSMKDLGPVHYFLGIQIKTHPSGLFLSQTKYAEQILN----NAGMLDCKPMSTPLP 86

Query: 512 LDKDEK---GKCVDIKTYRGMIKSLLYLTASKPDIMFSVCLCARFQSCPKESHFHAVKRI 571
           L  +      K  D   +R ++ +L YLT ++PDI ++V +  +    P  + F  +KR+
Sbjct: 87  LKLNSSVSTAKYPDPSDFRSIVGALQYLTLTRPDISYAVNIVCQRMHEPTLADFDLLKRV 146

Query: 572 LKYLLGTIDVGLWYPRNVEFNLIGYSDANFAGSLLDHKSTSRTCQFLGSSLVSWFSKKQN 631
           L+Y+ GTI  GL+  +N + N+  + D+++AG     +ST+  C FLG +++SW +K+Q 
Sbjct: 147 LRYVKGTIFHGLYIHKNSKLNVQAFCDSDWAGCTSTRRSTTGFCTFLGCNIISWSAKRQP 206

Query: 632 SVALSTTEAEYIAIASCCAKIIW 652
           +V+ S+TE EY A+A   A++ W
Sbjct: 207 TVSRSSTETEYRALALTAAELTW 225

BLAST of CSPI07G11750 vs. ExPASy TrEMBL
Match: A0A438GI90 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Vitis vinifera OX=29760 GN=POLX_2030 PE=4 SV=1)

HSP 1 Score: 723.8 bits (1867), Expect = 8.1e-205
Identity = 392/731 (53.63%), Postives = 473/731 (64.71%), Query Frame = 0

Query: 135  NDAFKDFCEENGFSHNFSSPRTPQQNGVVERKNRTLQEFARSMLNEYGLPKYFWTEAVNT 194
            N  F+++C ++G +HNFS+PRTPQQNGVVERKNRTLQE AR+MLNE  LPKYFW EAVNT
Sbjct: 310  NFDFEEYCNKHGINHNFSAPRTPQQNGVVERKNRTLQEMARTMLNENNLPKYFWAEAVNT 369

Query: 195  ACYVSNRVLVRPSLDKTPYELWHGKIPNIG------------------------------ 254
            +CYV NR+L+RP L KTPYELW  K PNI                               
Sbjct: 370  SCYVLNRILLRPILKKTPYELWKNKKPNISYFKVFGCKCFILNTKDNLGKFDAKSDVGIF 429

Query: 255  ------------------------------------------DD--LEKDFGDLLVNDKG 314
                                                      DD  LE   G L + DK 
Sbjct: 430  LGYSTSSKAFRVFNKRTMVVEESIHVIFDESNNSLQERESVDDDLGLETSMGKLQIEDKR 489

Query: 315  KEIV----PSMQDVNII-----EKKEEGSSSLPKEWRYALSHPKDLILGNPEQGVKTRSS 374
            ++      P  +D  +      + + E S  LPK+W++ ++HP+D I+GNP  GV+TRSS
Sbjct: 490  QQEESGEDPKKEDSPLALPPPQQVQGESSQDLPKDWKFVINHPQDQIIGNPSSGVRTRSS 549

Query: 375  L-NLFSNLAFVSQIEPRSFKDAECDEFWILAMQEELNQFEMNKVRKLVPRPYNASIIGTK 434
            L N+ +NLAF+SQIEP++ KDA  DE W++AMQEELNQFE ++V +LVPRP N S+IGTK
Sbjct: 550  LRNICNNLAFISQIEPKNIKDAIVDENWMIAMQEELNQFERSEVWELVPRPSNQSVIGTK 609

Query: 435  WVFRNKMDENGNIIRNKARLVAQGFCQEEDIYYEETFATVARLEAIRMLLAFASYKTFIF 494
            WVFRNKMDENG I+RNKARLVAQG+ QEE I YEETFA VARLEAIRMLLAFA +K FI 
Sbjct: 610  WVFRNKMDENGIIVRNKARLVAQGYNQEEGIDYEETFAPVARLEAIRMLLAFACFKDFIL 669

Query: 495  YQMDVKCAFLNGYIVEEVYVEQPLGFE--------------------------------- 554
            YQMDVK AFLNG+I EEVYVEQP GF+                                 
Sbjct: 670  YQMDVKSAFLNGFINEEVYVEQPPGFQSFNFPNHVFKLKKALYGLKQAPRAWYERLSKFL 729

Query: 555  --KG--------SLWLKTSSK-------------------------SLCMHNEFEMSMMG 614
              KG        +L++KT  K                         S CMH+EFEMSMMG
Sbjct: 730  LKKGFKMGKIDTTLFIKTKEKDMLLVQIYVDDIIFGATNDSLCEDFSKCMHSEFEMSMMG 789

Query: 615  ELSFFLGFQIKQLKDDIFISQEKYTRNLLKKFKLNKGQVAKTRMSTTTKLDKDEKGKCVD 674
            EL++FLG QIKQLK+  FI+Q KY ++LLK+F + + +V KT MS++ KLD DEKGK +D
Sbjct: 790  ELNYFLGLQIKQLKEGTFINQAKYIKDLLKRFNMEEAKVMKTPMSSSIKLDMDEKGKSID 849

Query: 675  IKTYRGMIKSLLYLTASKPDIMFSVCLCARFQSCPKESHFHAVKRILKYLLGTIDVGLWY 714
               YRGMI SLLYLTAS+PDIM+SVCLCARFQSCPKESH  AVKRIL+YL GT+++GLWY
Sbjct: 850  STMYRGMIGSLLYLTASRPDIMYSVCLCARFQSCPKESHLSAVKRILRYLKGTMNIGLWY 909

BLAST of CSPI07G11750 vs. ExPASy TrEMBL
Match: A0A151QU14 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cajanus cajan OX=3821 GN=KK1_045365 PE=4 SV=1)

HSP 1 Score: 662.5 bits (1708), Expect = 2.2e-186
Identity = 370/719 (51.46%), Postives = 450/719 (62.59%), Query Frame = 0

Query: 135 NDAFKDFCEENGFSHNFSSPRTPQQNGVVERKNRTLQEFARSMLNEYGLPKYFWTEAVNT 194
           N  F  FCEENG  HNFS+PRTPQQNGVVERKNR+L+E AR+MLN+  LPKYFW EAVNT
Sbjct: 257 NKDFDLFCEENGIEHNFSAPRTPQQNGVVERKNRSLEELARTMLNDSKLPKYFWAEAVNT 316

Query: 195 ACYVSNRVLVRPSLDKTPYELWHGKIPNIGD---------------------DLEKDFGD 254
           ACY  NR L+RP L KTPYEL++G+ PNI                       D + D G 
Sbjct: 317 ACYTMNRALIRPILKKTPYELFNGRKPNISHLHIFGCKCFVLNNGKDNLGKFDAKSDEGI 376

Query: 255 LL---VNDKG--------------------------------KEIVPSMQDVNIIEKK-- 314
            L   +N K                                  EIV S +D +I E+   
Sbjct: 377 FLGYSLNSKSFRIYNKRTMTIEESIHVVFDETNLVCPRRDIIDEIVESFEDTHINEQTHK 436

Query: 315 ------------EEGSSSL--PKEWRYALSHPKDLILGNPEQGVKTRSSL-NLFSNLAFV 374
                       +EG +++   +EWR + +HP + I+G+  +GV TR+SL    +N++FV
Sbjct: 437 DDKDKEKEDSTIQEGQTNINPQREWRISRNHPLENIIGDITKGVITRNSLKEACNNMSFV 496

Query: 375 SQIEPRSFKDAECDEFWILAMQEELNQFEMNKVRKLVPRPYNASIIGTKWVFRNKMDENG 434
           S+IE ++  +A  DE WI AMQEELNQFE N+V  LV RP N  IIGTKW+FRNK+DE+G
Sbjct: 497 SEIEVKNIDEALNDEHWINAMQEELNQFERNQVWDLVNRPTNHPIIGTKWIFRNKLDEHG 556

Query: 435 NIIRNKARLVAQGFCQEEDIYYEETFATVARLEAIRMLLAFASYKTFIFYQMDVKCAFLN 494
            +IRNKARLVA+G+ QEE I YEET+A VARLEAIRMLLA+AS   F  YQMDVK AFLN
Sbjct: 557 LVIRNKARLVAKGYNQEEGIDYEETYAPVARLEAIRMLLAYASIMDFKLYQMDVKSAFLN 616

Query: 495 GYIVEEVYVEQPLGFEKGSL--------------------WLKTSSKSLC---------- 554
           G+I EEVYVEQP GFE                        W +  SK L           
Sbjct: 617 GFIQEEVYVEQPPGFENSEFPNHVFKLKKALYGLKQAPRAWYERLSKFLLEKEFTRGKVD 676

Query: 555 --------------------------------------MHNEFEMSMMGELSFFLGFQIK 614
                                                 M +EFEMSMMGEL+FFLG QI+
Sbjct: 677 TTLFIKRKMNDILLVQIYVDDIIFGATNDYLCKEFSNDMQSEFEMSMMGELNFFLGLQIR 736

Query: 615 QLKDDIFISQEKYTRNLLKKFKLNKGQVAKTRMSTTTKLDKDEKGKCVDIKTYRGMIKSL 674
           Q K+ IFI+Q KY + LLK+F +   +   T MSTT  LDKDE GK +D+K YRGMI SL
Sbjct: 737 QTKNGIFINQSKYCKELLKRFGMENAKSMATPMSTTCYLDKDEVGKSIDVKKYRGMIGSL 796

Query: 675 LYLTASKPDIMFSVCLCARFQSCPKESHFHAVKRILKYLLGTIDVGLWYPRNVEFNLIGY 713
           LYL+AS+PDIMFSVC CAR+QS PKESH  AVKRI++YLL T ++GLWYP+N+ FNL+GY
Sbjct: 797 LYLSASRPDIMFSVCFCARYQSNPKESHLSAVKRIMRYLLRTTNLGLWYPKNMSFNLVGY 856

BLAST of CSPI07G11750 vs. ExPASy TrEMBL
Match: A0A151TIF5 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cajanus cajan OX=3821 GN=KK1_013123 PE=4 SV=1)

HSP 1 Score: 662.5 bits (1708), Expect = 2.2e-186
Identity = 370/719 (51.46%), Postives = 451/719 (62.73%), Query Frame = 0

Query: 135 NDAFKDFCEENGFSHNFSSPRTPQQNGVVERKNRTLQEFARSMLNEYGLPKYFWTEAVNT 194
           N  F  FCEENG  HNFS+PRTPQQNGVVERKNR+L+E AR+MLN+  LPKYFW EAVNT
Sbjct: 146 NKDFDLFCEENGIEHNFSAPRTPQQNGVVERKNRSLEELARTMLNDSKLPKYFWAEAVNT 205

Query: 195 ACYVSNRVLVRPSLDKTPYELWHGKIPNIGD---------------------DLEKDFGD 254
           ACY  NR L+RP L KTPYEL++G+ PNI                       D + D G 
Sbjct: 206 ACYTMNRALIRPILKKTPYELFNGRKPNISHLHIFGCKCFVLNNGKDNLGKFDAKSDEGI 265

Query: 255 LL---VNDKG--------------------------------KEIVPSMQDVNIIEKK-- 314
            L   +N K                                  EIV S +D ++ E+   
Sbjct: 266 FLGYSLNSKSFRIYNKRTMTIEESVHVVFDETNLVCPRRDVFDEIVESFEDTHLNEQTHK 325

Query: 315 ------------EEGSSSL--PKEWRYALSHPKDLILGNPEQGVKTRSSL-NLFSNLAFV 374
                       +EG +++   +EWR + +HP + I+G+  +GV TR+SL    +N++FV
Sbjct: 326 DDKDKEKEDSTIQEGQTNINSEREWRISRNHPLENIIGDITKGVITRNSLKEACNNMSFV 385

Query: 375 SQIEPRSFKDAECDEFWILAMQEELNQFEMNKVRKLVPRPYNASIIGTKWVFRNKMDENG 434
           S+IE ++  +A  DE WI AMQEELNQFE N+V  LV RP N  IIGTKW+FRNK+DE+G
Sbjct: 386 SEIEVKNIDEALNDEHWINAMQEELNQFERNQVWDLVNRPTNHPIIGTKWIFRNKLDEHG 445

Query: 435 NIIRNKARLVAQGFCQEEDIYYEETFATVARLEAIRMLLAFASYKTFIFYQMDVKCAFLN 494
            +IRNKARLVA+G+ QEE I YEET+A VARLEAIRMLLA+AS   F  YQMDVK AFLN
Sbjct: 446 LVIRNKARLVAKGYNQEEGIDYEETYAPVARLEAIRMLLAYASIMDFKLYQMDVKSAFLN 505

Query: 495 GYIVEEVYVEQPLGFEKGSL--------------------WLKTSSKSLC---------- 554
           G+I EEVYVEQP GFE                        W +  SK L           
Sbjct: 506 GFIQEEVYVEQPPGFENSEFPNHVFKLKKALYGLKQAPRAWYERLSKFLLEKEFTRGKVD 565

Query: 555 --------------------------------------MHNEFEMSMMGELSFFLGFQIK 614
                                                 M +EFEMSMMGEL+FFLG QI+
Sbjct: 566 TTLFIKRKMNDILLVQIYVDDIIFGATNDYLCKEFSNDMQSEFEMSMMGELNFFLGLQIR 625

Query: 615 QLKDDIFISQEKYTRNLLKKFKLNKGQVAKTRMSTTTKLDKDEKGKCVDIKTYRGMIKSL 674
           Q K+ IFI+Q KY + LLK+F +   +   T MSTT  LDKDE GK +D+K YRGMI SL
Sbjct: 626 QTKNGIFINQSKYCKELLKRFGMENAKSMATPMSTTCYLDKDEVGKSIDVKKYRGMIGSL 685

Query: 675 LYLTASKPDIMFSVCLCARFQSCPKESHFHAVKRILKYLLGTIDVGLWYPRNVEFNLIGY 713
           LYL+AS+PDIMFSVCLCAR+QS PKESH  AVKRI++ LLGT ++GLWYP+N+ FNL+GY
Sbjct: 686 LYLSASRPDIMFSVCLCARYQSNPKESHLSAVKRIMRCLLGTTNLGLWYPKNMPFNLVGY 745

BLAST of CSPI07G11750 vs. ExPASy TrEMBL
Match: A5C8K0 (Uncharacterized protein OS=Vitis vinifera OX=29760 GN=VITISV_001808 PE=4 SV=1)

HSP 1 Score: 653.7 bits (1685), Expect = 1.0e-183
Identity = 355/670 (52.99%), Postives = 439/670 (65.52%), Query Frame = 0

Query: 135  NDAFKDFCEENGFSHNFSSPRTPQQNGVVERKNRTLQEFARSMLNEYGLPKYFWTEAVNT 194
            N  F+++C + G +HNF +PRT QQNGVVERKNRTLQE AR+MLNE  LPKYFW EA+NT
Sbjct: 740  NFDFEEYCNKYGINHNFLAPRTSQQNGVVERKNRTLQEMARTMLNENNLPKYFWAEAINT 799

Query: 195  ACYVSNRVLVRPSLDKTPYELWHGKIPNIG--------------------DDLEKDFGDL 254
            +CYV NR+L+RP L KTPYELW  K PNI                      D + D G  
Sbjct: 800  SCYVLNRILLRPILKKTPYELWKNKKPNISYFKVFGCKCFILNTKDNLGKFDAKSDVGIF 859

Query: 255  L-----------VNDKGKEIVPSMQDVNI--------IEKKEEGSSSLPKE----WRYAL 314
            L            N +   +  S+ D  +         +  ++    +P++    W Y L
Sbjct: 860  LGYSTSSKAFRVFNKRTMVVEESIHDWRLPWENCKLRTKDNKKKVERIPRKKNHLWHYLL 919

Query: 315  --------SHPKDLILGNPEQGVKTRSSL-NLFSNLAFVSQIEPRSFKDAECDEFWILAM 374
                    +HP+D I+GNP  GV+TRSSL N+ +NLAF+SQIEP++ KDA  DE W++AM
Sbjct: 920  LNKCKFVINHPQDQIIGNPLSGVRTRSSLRNICNNLAFISQIEPKNIKDAIVDENWMIAM 979

Query: 375  QEELNQFEMNKVRKLVPRPYNASIIGTKWVFRNKMDENGNIIRNKARLVAQGFCQEEDIY 434
            Q+ELNQFE ++V +LVPRP N S+IGTKWVFRNKMDENG I+RNKARLVAQG+ QEE I 
Sbjct: 980  QKELNQFERSEVWELVPRPSNQSVIGTKWVFRNKMDENGIIVRNKARLVAQGYNQEEGID 1039

Query: 435  YEETFATVARLEAIRMLLAFASYKTFIFYQMDVKCAFLNGYIVEEVYVEQPLGFEKGSL- 494
            YEETFA VARLEAIRMLLAFA +K FI YQMDVK AFLNG+I EE+YVEQP GF+  +  
Sbjct: 1040 YEETFAPVARLEAIRMLLAFACFKDFILYQMDVKSAFLNGFINEEIYVEQPPGFQSFNFP 1099

Query: 495  -------------------WLKTSSKSLCMHNEFEMSMMGELSFFLGFQIKQLKDDIFIS 554
                               W +  SK L +   F+M  +    F    +   L   I++ 
Sbjct: 1100 NHVFKLKKALYGLKQAPRAWYERLSKFL-LKKSFKMGKIDTTLFIKTKENDMLLVQIYVD 1159

Query: 555  -------------------QEKYTRNLLKKFKLNKGQVAKTRMSTTTKLDKDEKGKCVDI 614
                                 KY ++LLK+F + + +V KT MS++ KLD DEKGK +D 
Sbjct: 1160 DITFGATNDSLCEDFSKCMHTKYIKDLLKRFNMGEAKVMKTPMSSSIKLDMDEKGKSIDS 1219

Query: 615  KTYRGMIKSLLYLTASKPDIMFSVCLCARFQSCPKESHFHAVKRILKYLLGTIDVGLWYP 674
              YRGMI SLLYLTAS+PDIM+SVCLCARFQSCPKESH  AVKRIL+YL GT+ +GLWYP
Sbjct: 1220 TMYRGMIGSLLYLTASRPDIMYSVCLCARFQSCPKESHLSAVKRILRYLKGTMSIGLWYP 1279

Query: 675  RNVEFNLIGYSDANFAGSLLDHKSTSRTCQFLGSSLVSWFSKKQNSVALSTTEAEYIAIA 714
            +   F LIG+SDA+FAG  ++ KSTS TC  LG SLVSW SKKQNS+ALST EAEY A +
Sbjct: 1280 KGDNFELIGFSDADFAGCRVERKSTSGTCHSLGHSLVSWHSKKQNSIALSTAEAEYTAAS 1339

BLAST of CSPI07G11750 vs. ExPASy TrEMBL
Match: A0A151TAG4 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 (Fragment) OS=Cajanus cajan OX=3821 GN=KK1_018591 PE=4 SV=1)

HSP 1 Score: 649.0 bits (1673), Expect = 2.5e-182
Identity = 367/719 (51.04%), Postives = 446/719 (62.03%), Query Frame = 0

Query: 135 NDAFKDFCEENGFSHNFSSPRTPQQNGVVERKNRTLQEFARSMLNEYGLPKYFWTEAVNT 194
           N  F  FCEENG  HNFS+PRTPQQNGVVERKNR+L+E AR+MLN+  LPKYFW EAVNT
Sbjct: 267 NKDFDLFCEENGIEHNFSAPRTPQQNGVVERKNRSLEELARTMLNDSKLPKYFWAEAVNT 326

Query: 195 ACYVSNRVLVRPSLDKTPYELWHGKIPNIGD---------------------DLEKDFGD 254
           ACY  NR L+RP L KTPYEL++G+ PNI                       D + D G 
Sbjct: 327 ACYTMNRALIRPILKKTPYELFNGRKPNISHLHIFGCKCFVLNNGKDNLGKFDAKSDEGI 386

Query: 255 LL---VNDKG--------------------------------KEIVPSMQDVNIIEKK-- 314
            L   +N K                                  EIV S +D +I E+   
Sbjct: 387 FLGYSLNSKSFRIYNKRTMTIEESIHVVFDETNLVCPRRDIIDEIVESFEDTHINEQTHK 446

Query: 315 ------------EEGSSSL--PKEWRYALSHPKDLILGNPEQGVKTRSSL-NLFSNLAFV 374
                       +EG +++   +EWR + +HP + I+G+  +GV TR+SL    +N++FV
Sbjct: 447 DDKDKEKEDSTIQEGQTNINSQREWRISRNHPLENIIGDITKGVITRNSLKEACNNMSFV 506

Query: 375 SQIEPRSFKDAECDEFWILAMQEELNQFEMNKVRKLVPRPYNASIIGTKWVFRNKMDENG 434
           S+IE ++  +A  DE WI AMQEELNQFE N+V  LV RP N  IIGTKW+FRNK+DE+G
Sbjct: 507 SEIEVKNIDEALNDEHWINAMQEELNQFERNQVWDLVNRPTNHPIIGTKWIFRNKLDEHG 566

Query: 435 NIIRNKARLVAQGFCQEEDIYYEETFATVARLEAIRMLLAFASYKTFIFYQMDVKCAFLN 494
            +IRNKARLVA+G+ QEE I YEET+A VARLEAIRMLLA+AS   F  YQMDVK AFLN
Sbjct: 567 LVIRNKARLVAKGYNQEEGIDYEETYAPVARLEAIRMLLAYASIMDFKLYQMDVKSAFLN 626

Query: 495 GYIVEEVYVEQPLGFEKGSL--------------------WLKTSSKSLC---------- 554
           G+I EEVYVEQP GFE                        W +  SK L           
Sbjct: 627 GFIQEEVYVEQPPGFENSEFPNHVFKLKKALYGLKQAPRAWYERLSKFLLEKEFTRGKVD 686

Query: 555 --------------------------------------MHNEFEMSMMGELSFFLGFQIK 614
                                                 M +EFEMSMMGEL+FFLG QI+
Sbjct: 687 TTLFIKRKMNDILLVQIYVDDIIFGATNDYLCKEFSNDMQSEFEMSMMGELNFFLGLQIR 746

Query: 615 QLKDDIFISQEKYTRNLLKKFKLNKGQVAKTRMSTTTKLDKDEKGKCVDIKTYRGMIKSL 674
           Q K+ IFI+Q KY + LLK+F +   +   T MSTT  LDKDE GK +D+K YRGMI SL
Sbjct: 747 QTKNGIFINQSKYCKELLKRFGMENAKSMATPMSTTCYLDKDEVGKSIDVKKYRGMIGSL 806

Query: 675 LYLTASKPDIMFSVCLCARFQSCPKESHFHAVKRILKYLLGTIDVGLWYPRNVEFNLIGY 713
           LYL+ S+P+IMFSVCLC R+QS PKESH  AVKRI++YLLGT ++GLWY +N+ FNL+GY
Sbjct: 807 LYLSTSRPNIMFSVCLCTRYQSNPKESHLSAVKRIMRYLLGTTNLGLWYSKNMPFNLVGY 866

BLAST of CSPI07G11750 vs. NCBI nr
Match: RVW71911.1 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera])

HSP 1 Score: 723.8 bits (1867), Expect = 1.7e-204
Identity = 392/731 (53.63%), Postives = 473/731 (64.71%), Query Frame = 0

Query: 135  NDAFKDFCEENGFSHNFSSPRTPQQNGVVERKNRTLQEFARSMLNEYGLPKYFWTEAVNT 194
            N  F+++C ++G +HNFS+PRTPQQNGVVERKNRTLQE AR+MLNE  LPKYFW EAVNT
Sbjct: 310  NFDFEEYCNKHGINHNFSAPRTPQQNGVVERKNRTLQEMARTMLNENNLPKYFWAEAVNT 369

Query: 195  ACYVSNRVLVRPSLDKTPYELWHGKIPNIG------------------------------ 254
            +CYV NR+L+RP L KTPYELW  K PNI                               
Sbjct: 370  SCYVLNRILLRPILKKTPYELWKNKKPNISYFKVFGCKCFILNTKDNLGKFDAKSDVGIF 429

Query: 255  ------------------------------------------DD--LEKDFGDLLVNDKG 314
                                                      DD  LE   G L + DK 
Sbjct: 430  LGYSTSSKAFRVFNKRTMVVEESIHVIFDESNNSLQERESVDDDLGLETSMGKLQIEDKR 489

Query: 315  KEIV----PSMQDVNII-----EKKEEGSSSLPKEWRYALSHPKDLILGNPEQGVKTRSS 374
            ++      P  +D  +      + + E S  LPK+W++ ++HP+D I+GNP  GV+TRSS
Sbjct: 490  QQEESGEDPKKEDSPLALPPPQQVQGESSQDLPKDWKFVINHPQDQIIGNPSSGVRTRSS 549

Query: 375  L-NLFSNLAFVSQIEPRSFKDAECDEFWILAMQEELNQFEMNKVRKLVPRPYNASIIGTK 434
            L N+ +NLAF+SQIEP++ KDA  DE W++AMQEELNQFE ++V +LVPRP N S+IGTK
Sbjct: 550  LRNICNNLAFISQIEPKNIKDAIVDENWMIAMQEELNQFERSEVWELVPRPSNQSVIGTK 609

Query: 435  WVFRNKMDENGNIIRNKARLVAQGFCQEEDIYYEETFATVARLEAIRMLLAFASYKTFIF 494
            WVFRNKMDENG I+RNKARLVAQG+ QEE I YEETFA VARLEAIRMLLAFA +K FI 
Sbjct: 610  WVFRNKMDENGIIVRNKARLVAQGYNQEEGIDYEETFAPVARLEAIRMLLAFACFKDFIL 669

Query: 495  YQMDVKCAFLNGYIVEEVYVEQPLGFE--------------------------------- 554
            YQMDVK AFLNG+I EEVYVEQP GF+                                 
Sbjct: 670  YQMDVKSAFLNGFINEEVYVEQPPGFQSFNFPNHVFKLKKALYGLKQAPRAWYERLSKFL 729

Query: 555  --KG--------SLWLKTSSK-------------------------SLCMHNEFEMSMMG 614
              KG        +L++KT  K                         S CMH+EFEMSMMG
Sbjct: 730  LKKGFKMGKIDTTLFIKTKEKDMLLVQIYVDDIIFGATNDSLCEDFSKCMHSEFEMSMMG 789

Query: 615  ELSFFLGFQIKQLKDDIFISQEKYTRNLLKKFKLNKGQVAKTRMSTTTKLDKDEKGKCVD 674
            EL++FLG QIKQLK+  FI+Q KY ++LLK+F + + +V KT MS++ KLD DEKGK +D
Sbjct: 790  ELNYFLGLQIKQLKEGTFINQAKYIKDLLKRFNMEEAKVMKTPMSSSIKLDMDEKGKSID 849

Query: 675  IKTYRGMIKSLLYLTASKPDIMFSVCLCARFQSCPKESHFHAVKRILKYLLGTIDVGLWY 714
               YRGMI SLLYLTAS+PDIM+SVCLCARFQSCPKESH  AVKRIL+YL GT+++GLWY
Sbjct: 850  STMYRGMIGSLLYLTASRPDIMYSVCLCARFQSCPKESHLSAVKRILRYLKGTMNIGLWY 909

BLAST of CSPI07G11750 vs. NCBI nr
Match: KYP66812.1 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan])

HSP 1 Score: 662.5 bits (1708), Expect = 4.5e-186
Identity = 370/719 (51.46%), Postives = 451/719 (62.73%), Query Frame = 0

Query: 135 NDAFKDFCEENGFSHNFSSPRTPQQNGVVERKNRTLQEFARSMLNEYGLPKYFWTEAVNT 194
           N  F  FCEENG  HNFS+PRTPQQNGVVERKNR+L+E AR+MLN+  LPKYFW EAVNT
Sbjct: 146 NKDFDLFCEENGIEHNFSAPRTPQQNGVVERKNRSLEELARTMLNDSKLPKYFWAEAVNT 205

Query: 195 ACYVSNRVLVRPSLDKTPYELWHGKIPNIGD---------------------DLEKDFGD 254
           ACY  NR L+RP L KTPYEL++G+ PNI                       D + D G 
Sbjct: 206 ACYTMNRALIRPILKKTPYELFNGRKPNISHLHIFGCKCFVLNNGKDNLGKFDAKSDEGI 265

Query: 255 LL---VNDKG--------------------------------KEIVPSMQDVNIIEKK-- 314
            L   +N K                                  EIV S +D ++ E+   
Sbjct: 266 FLGYSLNSKSFRIYNKRTMTIEESVHVVFDETNLVCPRRDVFDEIVESFEDTHLNEQTHK 325

Query: 315 ------------EEGSSSL--PKEWRYALSHPKDLILGNPEQGVKTRSSL-NLFSNLAFV 374
                       +EG +++   +EWR + +HP + I+G+  +GV TR+SL    +N++FV
Sbjct: 326 DDKDKEKEDSTIQEGQTNINSEREWRISRNHPLENIIGDITKGVITRNSLKEACNNMSFV 385

Query: 375 SQIEPRSFKDAECDEFWILAMQEELNQFEMNKVRKLVPRPYNASIIGTKWVFRNKMDENG 434
           S+IE ++  +A  DE WI AMQEELNQFE N+V  LV RP N  IIGTKW+FRNK+DE+G
Sbjct: 386 SEIEVKNIDEALNDEHWINAMQEELNQFERNQVWDLVNRPTNHPIIGTKWIFRNKLDEHG 445

Query: 435 NIIRNKARLVAQGFCQEEDIYYEETFATVARLEAIRMLLAFASYKTFIFYQMDVKCAFLN 494
            +IRNKARLVA+G+ QEE I YEET+A VARLEAIRMLLA+AS   F  YQMDVK AFLN
Sbjct: 446 LVIRNKARLVAKGYNQEEGIDYEETYAPVARLEAIRMLLAYASIMDFKLYQMDVKSAFLN 505

Query: 495 GYIVEEVYVEQPLGFEKGSL--------------------WLKTSSKSLC---------- 554
           G+I EEVYVEQP GFE                        W +  SK L           
Sbjct: 506 GFIQEEVYVEQPPGFENSEFPNHVFKLKKALYGLKQAPRAWYERLSKFLLEKEFTRGKVD 565

Query: 555 --------------------------------------MHNEFEMSMMGELSFFLGFQIK 614
                                                 M +EFEMSMMGEL+FFLG QI+
Sbjct: 566 TTLFIKRKMNDILLVQIYVDDIIFGATNDYLCKEFSNDMQSEFEMSMMGELNFFLGLQIR 625

Query: 615 QLKDDIFISQEKYTRNLLKKFKLNKGQVAKTRMSTTTKLDKDEKGKCVDIKTYRGMIKSL 674
           Q K+ IFI+Q KY + LLK+F +   +   T MSTT  LDKDE GK +D+K YRGMI SL
Sbjct: 626 QTKNGIFINQSKYCKELLKRFGMENAKSMATPMSTTCYLDKDEVGKSIDVKKYRGMIGSL 685

Query: 675 LYLTASKPDIMFSVCLCARFQSCPKESHFHAVKRILKYLLGTIDVGLWYPRNVEFNLIGY 713
           LYL+AS+PDIMFSVCLCAR+QS PKESH  AVKRI++ LLGT ++GLWYP+N+ FNL+GY
Sbjct: 686 LYLSASRPDIMFSVCLCARYQSNPKESHLSAVKRIMRCLLGTTNLGLWYPKNMPFNLVGY 745

BLAST of CSPI07G11750 vs. NCBI nr
Match: KYP33754.1 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan])

HSP 1 Score: 662.5 bits (1708), Expect = 4.5e-186
Identity = 370/719 (51.46%), Postives = 450/719 (62.59%), Query Frame = 0

Query: 135 NDAFKDFCEENGFSHNFSSPRTPQQNGVVERKNRTLQEFARSMLNEYGLPKYFWTEAVNT 194
           N  F  FCEENG  HNFS+PRTPQQNGVVERKNR+L+E AR+MLN+  LPKYFW EAVNT
Sbjct: 257 NKDFDLFCEENGIEHNFSAPRTPQQNGVVERKNRSLEELARTMLNDSKLPKYFWAEAVNT 316

Query: 195 ACYVSNRVLVRPSLDKTPYELWHGKIPNIGD---------------------DLEKDFGD 254
           ACY  NR L+RP L KTPYEL++G+ PNI                       D + D G 
Sbjct: 317 ACYTMNRALIRPILKKTPYELFNGRKPNISHLHIFGCKCFVLNNGKDNLGKFDAKSDEGI 376

Query: 255 LL---VNDKG--------------------------------KEIVPSMQDVNIIEKK-- 314
            L   +N K                                  EIV S +D +I E+   
Sbjct: 377 FLGYSLNSKSFRIYNKRTMTIEESIHVVFDETNLVCPRRDIIDEIVESFEDTHINEQTHK 436

Query: 315 ------------EEGSSSL--PKEWRYALSHPKDLILGNPEQGVKTRSSL-NLFSNLAFV 374
                       +EG +++   +EWR + +HP + I+G+  +GV TR+SL    +N++FV
Sbjct: 437 DDKDKEKEDSTIQEGQTNINPQREWRISRNHPLENIIGDITKGVITRNSLKEACNNMSFV 496

Query: 375 SQIEPRSFKDAECDEFWILAMQEELNQFEMNKVRKLVPRPYNASIIGTKWVFRNKMDENG 434
           S+IE ++  +A  DE WI AMQEELNQFE N+V  LV RP N  IIGTKW+FRNK+DE+G
Sbjct: 497 SEIEVKNIDEALNDEHWINAMQEELNQFERNQVWDLVNRPTNHPIIGTKWIFRNKLDEHG 556

Query: 435 NIIRNKARLVAQGFCQEEDIYYEETFATVARLEAIRMLLAFASYKTFIFYQMDVKCAFLN 494
            +IRNKARLVA+G+ QEE I YEET+A VARLEAIRMLLA+AS   F  YQMDVK AFLN
Sbjct: 557 LVIRNKARLVAKGYNQEEGIDYEETYAPVARLEAIRMLLAYASIMDFKLYQMDVKSAFLN 616

Query: 495 GYIVEEVYVEQPLGFEKGSL--------------------WLKTSSKSLC---------- 554
           G+I EEVYVEQP GFE                        W +  SK L           
Sbjct: 617 GFIQEEVYVEQPPGFENSEFPNHVFKLKKALYGLKQAPRAWYERLSKFLLEKEFTRGKVD 676

Query: 555 --------------------------------------MHNEFEMSMMGELSFFLGFQIK 614
                                                 M +EFEMSMMGEL+FFLG QI+
Sbjct: 677 TTLFIKRKMNDILLVQIYVDDIIFGATNDYLCKEFSNDMQSEFEMSMMGELNFFLGLQIR 736

Query: 615 QLKDDIFISQEKYTRNLLKKFKLNKGQVAKTRMSTTTKLDKDEKGKCVDIKTYRGMIKSL 674
           Q K+ IFI+Q KY + LLK+F +   +   T MSTT  LDKDE GK +D+K YRGMI SL
Sbjct: 737 QTKNGIFINQSKYCKELLKRFGMENAKSMATPMSTTCYLDKDEVGKSIDVKKYRGMIGSL 796

Query: 675 LYLTASKPDIMFSVCLCARFQSCPKESHFHAVKRILKYLLGTIDVGLWYPRNVEFNLIGY 713
           LYL+AS+PDIMFSVC CAR+QS PKESH  AVKRI++YLL T ++GLWYP+N+ FNL+GY
Sbjct: 797 LYLSASRPDIMFSVCFCARYQSNPKESHLSAVKRIMRYLLRTTNLGLWYPKNMSFNLVGY 856

BLAST of CSPI07G11750 vs. NCBI nr
Match: CAN64335.1 (hypothetical protein VITISV_001808 [Vitis vinifera])

HSP 1 Score: 653.7 bits (1685), Expect = 2.1e-183
Identity = 355/670 (52.99%), Postives = 439/670 (65.52%), Query Frame = 0

Query: 135  NDAFKDFCEENGFSHNFSSPRTPQQNGVVERKNRTLQEFARSMLNEYGLPKYFWTEAVNT 194
            N  F+++C + G +HNF +PRT QQNGVVERKNRTLQE AR+MLNE  LPKYFW EA+NT
Sbjct: 740  NFDFEEYCNKYGINHNFLAPRTSQQNGVVERKNRTLQEMARTMLNENNLPKYFWAEAINT 799

Query: 195  ACYVSNRVLVRPSLDKTPYELWHGKIPNIG--------------------DDLEKDFGDL 254
            +CYV NR+L+RP L KTPYELW  K PNI                      D + D G  
Sbjct: 800  SCYVLNRILLRPILKKTPYELWKNKKPNISYFKVFGCKCFILNTKDNLGKFDAKSDVGIF 859

Query: 255  L-----------VNDKGKEIVPSMQDVNI--------IEKKEEGSSSLPKE----WRYAL 314
            L            N +   +  S+ D  +         +  ++    +P++    W Y L
Sbjct: 860  LGYSTSSKAFRVFNKRTMVVEESIHDWRLPWENCKLRTKDNKKKVERIPRKKNHLWHYLL 919

Query: 315  --------SHPKDLILGNPEQGVKTRSSL-NLFSNLAFVSQIEPRSFKDAECDEFWILAM 374
                    +HP+D I+GNP  GV+TRSSL N+ +NLAF+SQIEP++ KDA  DE W++AM
Sbjct: 920  LNKCKFVINHPQDQIIGNPLSGVRTRSSLRNICNNLAFISQIEPKNIKDAIVDENWMIAM 979

Query: 375  QEELNQFEMNKVRKLVPRPYNASIIGTKWVFRNKMDENGNIIRNKARLVAQGFCQEEDIY 434
            Q+ELNQFE ++V +LVPRP N S+IGTKWVFRNKMDENG I+RNKARLVAQG+ QEE I 
Sbjct: 980  QKELNQFERSEVWELVPRPSNQSVIGTKWVFRNKMDENGIIVRNKARLVAQGYNQEEGID 1039

Query: 435  YEETFATVARLEAIRMLLAFASYKTFIFYQMDVKCAFLNGYIVEEVYVEQPLGFEKGSL- 494
            YEETFA VARLEAIRMLLAFA +K FI YQMDVK AFLNG+I EE+YVEQP GF+  +  
Sbjct: 1040 YEETFAPVARLEAIRMLLAFACFKDFILYQMDVKSAFLNGFINEEIYVEQPPGFQSFNFP 1099

Query: 495  -------------------WLKTSSKSLCMHNEFEMSMMGELSFFLGFQIKQLKDDIFIS 554
                               W +  SK L +   F+M  +    F    +   L   I++ 
Sbjct: 1100 NHVFKLKKALYGLKQAPRAWYERLSKFL-LKKSFKMGKIDTTLFIKTKENDMLLVQIYVD 1159

Query: 555  -------------------QEKYTRNLLKKFKLNKGQVAKTRMSTTTKLDKDEKGKCVDI 614
                                 KY ++LLK+F + + +V KT MS++ KLD DEKGK +D 
Sbjct: 1160 DITFGATNDSLCEDFSKCMHTKYIKDLLKRFNMGEAKVMKTPMSSSIKLDMDEKGKSIDS 1219

Query: 615  KTYRGMIKSLLYLTASKPDIMFSVCLCARFQSCPKESHFHAVKRILKYLLGTIDVGLWYP 674
              YRGMI SLLYLTAS+PDIM+SVCLCARFQSCPKESH  AVKRIL+YL GT+ +GLWYP
Sbjct: 1220 TMYRGMIGSLLYLTASRPDIMYSVCLCARFQSCPKESHLSAVKRILRYLKGTMSIGLWYP 1279

Query: 675  RNVEFNLIGYSDANFAGSLLDHKSTSRTCQFLGSSLVSWFSKKQNSVALSTTEAEYIAIA 714
            +   F LIG+SDA+FAG  ++ KSTS TC  LG SLVSW SKKQNS+ALST EAEY A +
Sbjct: 1280 KGDNFELIGFSDADFAGCRVERKSTSGTCHSLGHSLVSWHSKKQNSIALSTAEAEYTAAS 1339

BLAST of CSPI07G11750 vs. NCBI nr
Match: XP_042980087.1 (uncharacterized protein LOC122310269, partial [Carya illinoinensis])

HSP 1 Score: 650.2 bits (1676), Expect = 2.3e-182
Identity = 362/649 (55.78%), Postives = 435/649 (67.03%), Query Frame = 0

Query: 134  INDAFKDFCEENGFSHNFSSPRTPQQNGVVERKNRTLQEFARSMLNEYGLPKYFWTEAVN 193
            +N   + FC+ENGF HNFS+PRTPQQNGVVERKNR+LQE AR+MLNE  LP YFW EAV+
Sbjct: 476  VNKNIETFCDENGFIHNFSAPRTPQQNGVVERKNRSLQEMARTMLNENNLPSYFWAEAVS 535

Query: 194  TACYVSNRVLVRPSLDKTPYELWHGKIPNIGDDLEKDFGDLLVNDK---GKEIVPSMQDV 253
            TACYV NRV++R  LDKTPYELW+ K PNIG          ++ND+   GK    S + +
Sbjct: 536  TACYVINRVMLRSKLDKTPYELWNEKKPNIGYFHVFGCKCFILNDRDNLGKFDAKSDEGI 595

Query: 254  NIIEKKEEGSSSLPKEWRYALSHPKDLILGNPEQGVKTRSSLNLFSNLAFVSQIEPRSFK 313
             +      G S+  K +R  + + K L             ++    ++ F  QIEP++  
Sbjct: 596  FL------GYSTNSKAYR--VFNKKTL-------------TVQESMHVVFDEQIEPKNID 655

Query: 314  DAECDEFWILAMQEELNQFEMNKVRKLVPRPYNASIIGTKWVFRNKMDENGNIIRNKARL 373
            DA  DE WILAMQEELNQFE N V  LVPRP N +IIGTKWVFRNK DE+G I RNKARL
Sbjct: 656  DALLDESWILAMQEELNQFERNDVWTLVPRPKNYTIIGTKWVFRNKKDESGVITRNKARL 715

Query: 374  VAQGFCQEEDIYYEETFATVARLEAIRMLLAFASYKTFIFYQMDVKCAFLNGYIVEEVYV 433
            VAQGF QEE I Y+ET+A VARLEAIRMLLA+A YK F  +QMDVK AFLNG+I EEVYV
Sbjct: 716  VAQGFNQEEGIDYDETYAPVARLEAIRMLLAYACYKDFKLFQMDVKSAFLNGFINEEVYV 775

Query: 434  EQPLGF-----------------------------------EKG--------SLWLK--- 493
            EQP GF                                   EKG        +L++K   
Sbjct: 776  EQPPGFENHISPNHVFKLTKALYGLKQAPRAWYERLSGFLIEKGFSREKIDTTLFIKYEN 835

Query: 494  ----------------TSSKSLC------MHNEFEMSMMGELSFFLGFQIKQLKDDIFIS 553
                             +++++C      M  EFEMSMMGEL+FFLG QIKQ K   FI+
Sbjct: 836  DDILLIQIYVDDIIFGATNENMCQVFAKTMQEEFEMSMMGELTFFLGLQIKQAKSGTFIN 895

Query: 554  QEKYTRNLLKKFKLNKGQVAKTRMSTTTKLDKDEKGKCVDIKTYRGMIKSLLYLTASKPD 613
            Q KY + LLKKF +   +   T MS +TKLDKDE GK VD K YRGMI SLLYLTAS+PD
Sbjct: 896  QSKYIKELLKKFGMENAKEIGTPMSPSTKLDKDESGKPVDSKIYRGMIGSLLYLTASRPD 955

Query: 614  IMFSVCLCARFQSCPKESHFHAVKRILKYLLGTIDVGLWYPRNVEFNLIGYSDANFAGSL 673
            IMFSVCLCARFQS PKESH  AVKRIL+YL GTI++GLWYP++  F+LI Y+DA++AG  
Sbjct: 956  IMFSVCLCARFQSSPKESHLIAVKRILRYLSGTINLGLWYPKHTSFDLISYTDADYAGCK 1015

Query: 674  LDHKSTSRTCQFLGSSLVSWFSKKQNSVALSTTEAEYIAIASCCAKIIWMKQILCDFGLK 712
            +D KSTS  C FLG +LVSWFSKKQNSVALSTTEAEY+A  SCCA++++MKQ L DF L 
Sbjct: 1016 IDRKSTSGACHFLGHALVSWFSKKQNSVALSTTEAEYVAAGSCCAQVLYMKQQLEDFKLM 1075

BLAST of CSPI07G11750 vs. TAIR 10
Match: AT4G23160.1 (cysteine-rich RLK (RECEPTOR-like protein kinase) 8 )

HSP 1 Score: 223.4 bits (568), Expect = 6.6e-58
Identity = 151/508 (29.72%), Postives = 245/508 (48.23%), Query Frame = 0

Query: 305 EPRSFKDAECDEFWILAMQEELNQFEMNKVRKLVPRPYNASIIGTKWVFRNKMDENGNII 364
           EP ++ +A+    W  AM +E+   E     ++   P N   IG KWV++ K + +G I 
Sbjct: 85  EPSTYNEAKEFLVWCGAMDDEIGAMETTHTWEICTLPPNKKPIGCKWVYKIKYNSDGTIE 144

Query: 365 RNKARLVAQGFCQEEDIYYEETFATVARLEAIRMLLAFASYKTFIFYQMDVKCAFLNGYI 424
           R KARLVA+G+ Q+E I + ETF+ V +L +++++LA ++   F  +Q+D+  AFLNG +
Sbjct: 145 RYKARLVAKGYTQQEGIDFIETFSPVCKLTSVKLILAISAIYNFTLHQLDISNAFLNGDL 204

Query: 425 VEEVYVEQPLGF----------------EKGSLWLKTSSKS------------------- 484
            EE+Y++ P G+                +K    LK +S+                    
Sbjct: 205 DEEIYMKLPPGYAARQGDSLPPNAVCYLKKSIYGLKQASRQWFLKFSVTLIGFGFVQSHS 264

Query: 485 ------------------------LCMHNE-------------FEMSMMGELSFFLGFQI 544
                                   +C +N+             F++  +G L +FLG +I
Sbjct: 265 DHTYFLKITATLFLCVLVYVDDIIICSNNDAAVDELKSQLKSCFKLRDLGPLKYFLGLEI 324

Query: 545 KQLKDDIFISQEKYTRNLLKKFKLNKGQVAKTRMSTTTKLDKDEKGKCVDIKTYRGMIKS 604
            +    I I Q KY  +LL +  L   + +   M  +        G  VD K YR +I  
Sbjct: 325 ARSAAGINICQRKYALDLLDETGLLGCKPSSVPMDPSVTFSAHSGGDFVDAKAYRRLIGR 384

Query: 605 LLYLTASKPDIMFSVCLCARFQSCPKESHFHAVKRILKYLLGTIDVGLWYPRNVEFNLIG 664
           L+YL  ++ DI F+V   ++F   P+ +H  AV +IL Y+ GT+  GL+Y    E  L  
Sbjct: 385 LMYLQITRLDISFAVNKLSQFSEAPRLAHQQAVMKILHYIKGTVGQGLFYSSQAEMQLQV 444

Query: 665 YSDANFAGSLLDHKSTSRTCQFLGSSLVSWFSKKQNSVALSTTEAEYIAIASCCAKIIWM 724
           +SDA+F       +ST+  C FLG+SL+SW SKKQ  V+ S+ EAEY A++    +++W+
Sbjct: 445 FSDASFQSCKDTRRSTNGYCMFLGTSLISWKSKKQQVVSKSSAEAEYRALSFATDEMMWL 504

Query: 725 KQILCDFGLKFDN-VPIFCDNISAINLTKNPIHHSRTKHIDIRHHFIREH-VQNGHITLE 735
            Q   +  L       +FCDN +AI++  N + H RTKHI+   H +RE  V    ++  
Sbjct: 505 AQFFRELQLPLSKPTLLFCDNTAAIHIATNAVFHERTKHIESDCHSVRERSVYQATLSYS 564

BLAST of CSPI07G11750 vs. TAIR 10
Match: ATMG00810.1 (DNA/RNA polymerases superfamily protein )

HSP 1 Score: 119.4 bits (298), Expect = 1.3e-26
Identity = 68/203 (33.50%), Postives = 114/203 (56.16%), Query Frame = 0

Query: 452 MHNEFEMSMMGELSFFLGFQIKQLKDDIFISQEKYTRNLLKKFKLNKGQVAKTRMSTTTK 511
           + + F M  +G + +FLG QIK     +F+SQ KY   +L     N G +    MST   
Sbjct: 27  LSSTFSMKDLGPVHYFLGIQIKTHPSGLFLSQTKYAEQILN----NAGMLDCKPMSTPLP 86

Query: 512 LDKDEK---GKCVDIKTYRGMIKSLLYLTASKPDIMFSVCLCARFQSCPKESHFHAVKRI 571
           L  +      K  D   +R ++ +L YLT ++PDI ++V +  +    P  + F  +KR+
Sbjct: 87  LKLNSSVSTAKYPDPSDFRSIVGALQYLTLTRPDISYAVNIVCQRMHEPTLADFDLLKRV 146

Query: 572 LKYLLGTIDVGLWYPRNVEFNLIGYSDANFAGSLLDHKSTSRTCQFLGSSLVSWFSKKQN 631
           L+Y+ GTI  GL+  +N + N+  + D+++AG     +ST+  C FLG +++SW +K+Q 
Sbjct: 147 LRYVKGTIFHGLYIHKNSKLNVQAFCDSDWAGCTSTRRSTTGFCTFLGCNIISWSAKRQP 206

Query: 632 SVALSTTEAEYIAIASCCAKIIW 652
           +V+ S+TE EY A+A   A++ W
Sbjct: 207 TVSRSSTETEYRALALTAAELTW 225

BLAST of CSPI07G11750 vs. TAIR 10
Match: ATMG00820.1 (Reverse transcriptase (RNA-dependent DNA polymerase) )

HSP 1 Score: 91.7 bits (226), Expect = 3.0e-18
Identity = 49/99 (49.49%), Postives = 62/99 (62.63%), Query Frame = 0

Query: 305 EPRSFKDAECDEFWILAMQEELNQFEMNKVRKLVPRPYNASIIGTKWVFRNKMDENGNII 364
           EP+S   A  D  W  AMQEEL+    NK   LVP P N +I+G KWVF+ K+  +G + 
Sbjct: 27  EPKSVIFALKDPGWCQAMQEELDALSRNKTWILVPPPVNQNILGCKWVFKTKLHSDGTLD 86

Query: 365 RNKARLVAQGFCQEEDIYYEETFATVARLEAIRMLLAFA 404
           R KARLVA+GF QEE IY+ ET++ V R   IR +L  A
Sbjct: 87  RLKARLVAKGFHQEEGIYFVETYSPVVRTATIRTILNVA 125

BLAST of CSPI07G11750 vs. TAIR 10
Match: ATMG00710.1 (Polynucleotidyl transferase, ribonuclease H-like superfamily protein )

HSP 1 Score: 45.4 bits (106), Expect = 2.5e-04
Identity = 23/55 (41.82%), Postives = 30/55 (54.55%), Query Frame = 0

Query: 167 NRTLQEFARSMLNEYGLPKYFWTEAVNTACYVSNRVLVRPSLDKTPYELWHGKIP 222
           NRT+ E  RSML E GLPK F  +A NTA ++ N+          P E+W   +P
Sbjct: 2   NRTIIEKVRSMLCECGLPKTFRADAANTAVHIINKYPSTAINFHVPDEVWFQSVP 56

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
P041468.1e-6926.86Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3[more]
P109781.8e-6829.76Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum... [more]
Q9ZT942.1e-6131.11Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana O... [more]
Q94HW22.8e-6131.89Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana O... [more]
P925191.9e-2533.50Uncharacterized mitochondrial protein AtMg00810 OS=Arabidopsis thaliana OX=3702 ... [more]
Match NameE-valueIdentityDescription
A0A438GI908.1e-20553.63Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Vitis vinifera OX... [more]
A0A151QU142.2e-18651.46Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cajanus cajan OX=... [more]
A0A151TIF52.2e-18651.46Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cajanus cajan OX=... [more]
A5C8K01.0e-18352.99Uncharacterized protein OS=Vitis vinifera OX=29760 GN=VITISV_001808 PE=4 SV=1[more]
A0A151TAG42.5e-18251.04Retrovirus-related Pol polyprotein from transposon TNT 1-94 (Fragment) OS=Cajanu... [more]
Match NameE-valueIdentityDescription
RVW71911.11.7e-20453.63Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera][more]
KYP66812.14.5e-18651.46Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan][more]
KYP33754.14.5e-18651.46Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan][more]
CAN64335.12.1e-18352.99hypothetical protein VITISV_001808 [Vitis vinifera][more]
XP_042980087.12.3e-18255.78uncharacterized protein LOC122310269, partial [Carya illinoinensis][more]
Match NameE-valueIdentityDescription
AT4G23160.16.6e-5829.72cysteine-rich RLK (RECEPTOR-like protein kinase) 8 [more]
ATMG00810.11.3e-2633.50DNA/RNA polymerases superfamily protein [more]
ATMG00820.13.0e-1849.49Reverse transcriptase (RNA-dependent DNA polymerase) [more]
ATMG00710.12.5e-0441.82Polynucleotidyl transferase, ribonuclease H-like superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucumber (PI 183967) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR036397Ribonuclease H superfamilyGENE3D3.30.420.10coord: 121..238
e-value: 1.6E-23
score: 85.0
IPR013103Reverse transcriptase, RNA-dependent DNA polymerasePFAMPF07727RVT_2coord: 332..438
e-value: 9.3E-30
score: 104.0
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 787..807
NoneNo IPR availablePANTHERPTHR45895FAMILY NOT NAMEDcoord: 135..216
coord: 448..575
coord: 305..437
NoneNo IPR availableCDDcd09272RNase_HI_RT_Ty1coord: 591..713
e-value: 7.90233E-64
score: 208.091
IPR001584Integrase, catalytic corePROSITEPS50994INTEGRASEcoord: 121..220
score: 14.923328
IPR012337Ribonuclease H-like superfamilySUPERFAMILY53098Ribonuclease H-likecoord: 135..221
IPR043502DNA/RNA polymerase superfamilySUPERFAMILY56672DNA/RNA polymerasescoord: 340..694

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI07G11750.1CSPI07G11750.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0015074 DNA integration
molecular_function GO:0003676 nucleic acid binding