CmoCh10G002580 (gene) Cucurbita moschata (Rifu) v1

Overview
NameCmoCh10G002580
Typegene
OrganismCucurbita moschata (Cucurbita moschata (Rifu) v1)
DescriptionGag/pol
LocationCmo_Chr10: 1142655 .. 1144844 (+)
RNA-Seq ExpressionCmoCh10G002580
SyntenyCmoCh10G002580
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCTTAGGGTGTTTGGCTGTGCTTGCTGGCCTAATTTAAGGCCTTACAACAACAAGAAACTGAGTTTCAGAACTACTAGATGTATATTCTTGGGTTATAGTTCTTCTCATAAGGGATATAAATGCTTAAATAGAAGTACAGGACGTATTTACATCTCTAGAGACGTGGTTTTCGATGAAAATATTTTTCCTTTTGAAGAATCTAAGCCACCAAACAAAACCACAAATCCACATCATCCTGTTCTACTTCCAGCCTTAGCCAAACTTGCTAGTTTTTACACTGAAAATGCTCTTACAGATATTGAACCAGTTGTTAGTAATTCCCATATGAATGATGGTCAAACTGATAATATTGCTAGTGACAACTTGTCTGGTGTCAGCTTATCTTCTGCAGATAATACAAGAAGTTCAGAGGAAATTGCAGAATATGAAGCTGAGAGCAGTTCGATCAATGCTCAAAACCAAACTCATGAACATGTGTCTGATCAACCAACTGAAGCAGCTAGTCAACATCCAATGCGAACAAGGTTGAGAAATAACATTGTACAAGCTAAACAATTCACTGATGGAACTATCAGATATTCAGAAACCTCAAGAAAATTCGCAAGCGCTGTAACTATCACAACTCCGATCATAGAGACTGCTACTGAACCTCGAAACCTGCAGGAAGCCATGCAACATCCAAGATGGAGAGGAGCAATGAATGATGAGCTCTCAGCGCTAAAACGAAATGCCACTTGGGATCTAGTTCCACCCAAACCTGGAATAAATCTCATTGATAGTAAATGGGTGTATAAAGTGAAAAGAAAAGCAGATGGGTCAGTTGAAAGATTAAAAGCAAGATTAGTTGCCAAAGGATTCAAGCAAAGATTTGGTGTTGATTACACTGATACTTTTAGCCCTGTGATCAAACCGTCAACAATCAGGGTCATTCTTTCGCTAGCAGTAACCAAGGGCTGGAATATGAGACAAGTTGATATCCAAAATGCATTTTTGCATGGAATTCTGAAAGAGGAAGTGTACATGCGACAACCACCAGGATTTCAAGACTCAGCCAAACCAAAGAATTACATATGCAAGCTCAAGAAAGCCCTTTATGGCCTGAAACAAGCCCCAAAAGCTTGGCATTCAAGGTTGACTGGAAAACTTATTGAGTTAGGCTTCAAGGCTTCAGTAGCTGATTCATCTCTTTTTATTCTCAAAAACAGAGAGATAACTATCTATATGCTCATCTATGTTGATGATATAATTATTGTGAGCTCCTCTGATCAAGCAACCGAAAGGTTGATTCAGAAATTGAAAATAGATTTTGCAGTAAAAGATTTGGGTGGTCTTGAGTATTTTCTGGGTATTGAAGTCAAGAAAACACGAGATGGTATCATACTGTCACAGAGACGATATGCCTTAGATTTGTTGAAAAGAGTAAACATGGAAAAATGCAAACCTATGTCTACACCAATGGGTTCTGCTGAAAAATTATTCAGAGAACAAGGAATACCCTTATCAGCTGAAGAACAATTCAAATACAGAAGTACAGTGGGAGCACTACAATATTTGACAATGACTAGGCCTGATTTGGCATTTGCTGTCAATAAAGTGTGTCAATATCTTCATACACCTACTGATGCTCATTGGGGTGCTGTGAAGAGAATTCTTCGTTATGTTAAAGGCACACTAGCATTAGGAGTGAAAATTCAGAAATCAACCATGATGTTGTCGGGGTTTTCTGATGCTGATTGGGCTGGTTGTCCCGATGATCGACGTTCAACTAGCGGCTTTGCTGTATTTCTTGGAGCAAATCTAATCTCATGGAGTTCCAGAAAACAGGCTACAGTGTCAAGATCAAGCACCGAAGCAGAATACAAGGCCATTGCGAATCTTACTGCAGAAATGATTTGGATCAAGTCATTACTGAAGGAACTGGGCGTGTATCAATCAAAGGCTCCTCGCCTCTGGTGTGACAACCTCGGAGCTACATATTTAACTTCAAATCCAGTATTTCATGCTAGAACGAAACATATTGAAGTTGATTTTCATTTTGTTCGAGAACAAGTAGCACGTAAAGCAATGGAAGTTCGGTTCATTTCATCAAGTGATCAAGTAGCTGATATCCTGACAAAACCACTGTCTAAAACTCCTTTTACTACACATTGTAACAATCTCAACATGTACAAGACTTGTTGGGATTGA

mRNA sequence

ATGCTTAGGGTGTTTGGCTGTGCTTGCTGGCCTAATTTAAGGCCTTACAACAACAAGAAACTGAGTTTCAGAACTACTAGATGTATATTCTTGGGTTATAGTTCTTCTCATAAGGGATATAAATGCTTAAATAGAAGTACAGGACGTATTTACATCTCTAGAGACGTGGTTTTCGATGAAAATATTTTTCCTTTTGAAGAATCTAAGCCACCAAACAAAACCACAAATCCACATCATCCTGTTCTACTTCCAGCCTTAGCCAAACTTGCTAGTTTTTACACTGAAAATGCTCTTACAGATATTGAACCAGTTGTTAGTAATTCCCATATGAATGATGGTCAAACTGATAATATTGCTAGTGACAACTTGTCTGGTGTCAGCTTATCTTCTGCAGATAATACAAGAAGTTCAGAGGAAATTGCAGAATATGAAGCTGAGAGCAGTTCGATCAATGCTCAAAACCAAACTCATGAACATGTGTCTGATCAACCAACTGAAGCAGCTAGTCAACATCCAATGCGAACAAGGTTGAGAAATAACATTGTACAAGCTAAACAATTCACTGATGGAACTATCAGATATTCAGAAACCTCAAGAAAATTCGCAAGCGCTGTAACTATCACAACTCCGATCATAGAGACTGCTACTGAACCTCGAAACCTGCAGGAAGCCATGCAACATCCAAGATGGAGAGGAGCAATGAATGATGAGCTCTCAGCGCTAAAACGAAATGCCACTTGGGATCTAGTTCCACCCAAACCTGGAATAAATCTCATTGATAGTAAATGGGTGTATAAAGTGAAAAGAAAAGCAGATGGGTCAGTTGAAAGATTAAAAGCAAGATTAGTTGCCAAAGGATTCAAGCAAAGATTTGGTGTTGATTACACTGATACTTTTAGCCCTGTGATCAAACCGTCAACAATCAGGGTCATTCTTTCGCTAGCAGTAACCAAGGGCTGGAATATGAGACAAGTTGATATCCAAAATGCATTTTTGCATGGAATTCTGAAAGAGGAAGTGTACATGCGACAACCACCAGGATTTCAAGACTCAGCCAAACCAAAGAATTACATATGCAAGCTCAAGAAAGCCCTTTATGGCCTGAAACAAGCCCCAAAAGCTTGGCATTCAAGGTTGACTGGAAAACTTATTGAGTTAGGCTTCAAGGCTTCAGTAGCTGATTCATCTCTTTTTATTCTCAAAAACAGAGAGATAACTATCTATATGCTCATCTATGTTGATGATATAATTATTGTGAGCTCCTCTGATCAAGCAACCGAAAGGTTGATTCAGAAATTGAAAATAGATTTTGCAGTAAAAGATTTGGGTGGTCTTGAGTATTTTCTGGGTATTGAAGTCAAGAAAACACGAGATGGTATCATACTGTCACAGAGACGATATGCCTTAGATTTGTTGAAAAGAGTAAACATGGAAAAATGCAAACCTATGTCTACACCAATGGGTTCTGCTGAAAAATTATTCAGAGAACAAGGAATACCCTTATCAGCTGAAGAACAATTCAAATACAGAAGTACAGTGGGAGCACTACAATATTTGACAATGACTAGGCCTGATTTGGCATTTGCTGTCAATAAAGTGTGTCAATATCTTCATACACCTACTGATGCTCATTGGGGTGCTGTGAAGAGAATTCTTCGTTATGTTAAAGGCACACTAGCATTAGGAGTGAAAATTCAGAAATCAACCATGATGTTGTCGGGGTTTTCTGATGCTGATTGGGCTGGTTGTCCCGATGATCGACGTTCAACTAGCGGCTTTGCTGTATTTCTTGGAGCAAATCTAATCTCATGGAGTTCCAGAAAACAGGCTACAGTGTCAAGATCAAGCACCGAAGCAGAATACAAGGCCATTGCGAATCTTACTGCAGAAATGATTTGGATCAAGTCATTACTGAAGGAACTGGGCGTGTATCAATCAAAGGCTCCTCGCCTCTGGTGTGACAACCTCGGAGCTACATATTTAACTTCAAATCCAGTATTTCATGCTAGAACGAAACATATTGAAGTTGATTTTCATTTTGTTCGAGAACAAGTAGCACGTAAAGCAATGGAAGTTCGGTTCATTTCATCAAGTGATCAAGTAGCTGATATCCTGACAAAACCACTGTCTAAAACTCCTTTTACTACACATTGTAACAATCTCAACATGTACAAGACTTGTTGGGATTGA

Coding sequence (CDS)

ATGCTTAGGGTGTTTGGCTGTGCTTGCTGGCCTAATTTAAGGCCTTACAACAACAAGAAACTGAGTTTCAGAACTACTAGATGTATATTCTTGGGTTATAGTTCTTCTCATAAGGGATATAAATGCTTAAATAGAAGTACAGGACGTATTTACATCTCTAGAGACGTGGTTTTCGATGAAAATATTTTTCCTTTTGAAGAATCTAAGCCACCAAACAAAACCACAAATCCACATCATCCTGTTCTACTTCCAGCCTTAGCCAAACTTGCTAGTTTTTACACTGAAAATGCTCTTACAGATATTGAACCAGTTGTTAGTAATTCCCATATGAATGATGGTCAAACTGATAATATTGCTAGTGACAACTTGTCTGGTGTCAGCTTATCTTCTGCAGATAATACAAGAAGTTCAGAGGAAATTGCAGAATATGAAGCTGAGAGCAGTTCGATCAATGCTCAAAACCAAACTCATGAACATGTGTCTGATCAACCAACTGAAGCAGCTAGTCAACATCCAATGCGAACAAGGTTGAGAAATAACATTGTACAAGCTAAACAATTCACTGATGGAACTATCAGATATTCAGAAACCTCAAGAAAATTCGCAAGCGCTGTAACTATCACAACTCCGATCATAGAGACTGCTACTGAACCTCGAAACCTGCAGGAAGCCATGCAACATCCAAGATGGAGAGGAGCAATGAATGATGAGCTCTCAGCGCTAAAACGAAATGCCACTTGGGATCTAGTTCCACCCAAACCTGGAATAAATCTCATTGATAGTAAATGGGTGTATAAAGTGAAAAGAAAAGCAGATGGGTCAGTTGAAAGATTAAAAGCAAGATTAGTTGCCAAAGGATTCAAGCAAAGATTTGGTGTTGATTACACTGATACTTTTAGCCCTGTGATCAAACCGTCAACAATCAGGGTCATTCTTTCGCTAGCAGTAACCAAGGGCTGGAATATGAGACAAGTTGATATCCAAAATGCATTTTTGCATGGAATTCTGAAAGAGGAAGTGTACATGCGACAACCACCAGGATTTCAAGACTCAGCCAAACCAAAGAATTACATATGCAAGCTCAAGAAAGCCCTTTATGGCCTGAAACAAGCCCCAAAAGCTTGGCATTCAAGGTTGACTGGAAAACTTATTGAGTTAGGCTTCAAGGCTTCAGTAGCTGATTCATCTCTTTTTATTCTCAAAAACAGAGAGATAACTATCTATATGCTCATCTATGTTGATGATATAATTATTGTGAGCTCCTCTGATCAAGCAACCGAAAGGTTGATTCAGAAATTGAAAATAGATTTTGCAGTAAAAGATTTGGGTGGTCTTGAGTATTTTCTGGGTATTGAAGTCAAGAAAACACGAGATGGTATCATACTGTCACAGAGACGATATGCCTTAGATTTGTTGAAAAGAGTAAACATGGAAAAATGCAAACCTATGTCTACACCAATGGGTTCTGCTGAAAAATTATTCAGAGAACAAGGAATACCCTTATCAGCTGAAGAACAATTCAAATACAGAAGTACAGTGGGAGCACTACAATATTTGACAATGACTAGGCCTGATTTGGCATTTGCTGTCAATAAAGTGTGTCAATATCTTCATACACCTACTGATGCTCATTGGGGTGCTGTGAAGAGAATTCTTCGTTATGTTAAAGGCACACTAGCATTAGGAGTGAAAATTCAGAAATCAACCATGATGTTGTCGGGGTTTTCTGATGCTGATTGGGCTGGTTGTCCCGATGATCGACGTTCAACTAGCGGCTTTGCTGTATTTCTTGGAGCAAATCTAATCTCATGGAGTTCCAGAAAACAGGCTACAGTGTCAAGATCAAGCACCGAAGCAGAATACAAGGCCATTGCGAATCTTACTGCAGAAATGATTTGGATCAAGTCATTACTGAAGGAACTGGGCGTGTATCAATCAAAGGCTCCTCGCCTCTGGTGTGACAACCTCGGAGCTACATATTTAACTTCAAATCCAGTATTTCATGCTAGAACGAAACATATTGAAGTTGATTTTCATTTTGTTCGAGAACAAGTAGCACGTAAAGCAATGGAAGTTCGGTTCATTTCATCAAGTGATCAAGTAGCTGATATCCTGACAAAACCACTGTCTAAAACTCCTTTTACTACACATTGTAACAATCTCAACATGTACAAGACTTGTTGGGATTGA

Protein sequence

MLRVFGCACWPNLRPYNNKKLSFRTTRCIFLGYSSSHKGYKCLNRSTGRIYISRDVVFDENIFPFEESKPPNKTTNPHHPVLLPALAKLASFYTENALTDIEPVVSNSHMNDGQTDNIASDNLSGVSLSSADNTRSSEEIAEYEAESSSINAQNQTHEHVSDQPTEAASQHPMRTRLRNNIVQAKQFTDGTIRYSETSRKFASAVTITTPIIETATEPRNLQEAMQHPRWRGAMNDELSALKRNATWDLVPPKPGINLIDSKWVYKVKRKADGSVERLKARLVAKGFKQRFGVDYTDTFSPVIKPSTIRVILSLAVTKGWNMRQVDIQNAFLHGILKEEVYMRQPPGFQDSAKPKNYICKLKKALYGLKQAPKAWHSRLTGKLIELGFKASVADSSLFILKNREITIYMLIYVDDIIIVSSSDQATERLIQKLKIDFAVKDLGGLEYFLGIEVKKTRDGIILSQRRYALDLLKRVNMEKCKPMSTPMGSAEKLFREQGIPLSAEEQFKYRSTVGALQYLTMTRPDLAFAVNKVCQYLHTPTDAHWGAVKRILRYVKGTLALGVKIQKSTMMLSGFSDADWAGCPDDRRSTSGFAVFLGANLISWSSRKQATVSRSSTEAEYKAIANLTAEMIWIKSLLKELGVYQSKAPRLWCDNLGATYLTSNPVFHARTKHIEVDFHFVREQVARKAMEVRFISSSDQVADILTKPLSKTPFTTHCNNLNMYKTCWD
Homology
BLAST of CmoCh10G002580 vs. ExPASy Swiss-Prot
Match: Q94HW2 (Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana OX=3702 GN=RE1 PE=2 SV=1)

HSP 1 Score: 585.9 bits (1509), Expect = 6.4e-166
Identity = 332/785 (42.29%), Postives = 470/785 (59.87%), Query Frame = 0

Query: 2    LRVFGCACWPNLRPYNNKKLSFRTTRCIFLGYSSSHKGYKCLNRSTGRIYISRDVVFDEN 61
            LRVFGCAC+P LRPYN  KL  ++ +C+FLGYS +   Y CL+  T R+YISR V FDEN
Sbjct: 688  LRVFGCACYPWLRPYNQHKLDDKSRQCVFLGYSLTQSAYLCLHLQTSRLYISRHVRFDEN 747

Query: 62   IFPF--------------EESK----------------PPNKTTNPHHPVLLP------- 121
             FPF               ES                 P    ++PHH    P       
Sbjct: 748  CFPFSNYLATLSPVQEQRRESSCVWSPHTTLPTRTPVLPAPSCSDPHHAATPPSSPSAPF 807

Query: 122  -----ALAKLASFYTENALTDIEPVVSNSHMNDGQTDNIASDNLSGVSLSSADNTRSSEE 181
                 + + L S ++ +  +  EP     +     T    +   +  S +++ N  ++E 
Sbjct: 808  RNSQVSSSNLDSSFSSSFPSSPEPTAPRQNGPQPTTQPTQTQTQTHSSQNTSQNNPTNES 867

Query: 182  IAEYEAESSSINAQNQTHEHVSDQPTEAASQH--------------PMRTRLRNNIVQA- 241
             ++  A+S S  AQ+ +    S  PT +AS                P   ++ NN  QA 
Sbjct: 868  PSQL-AQSLSTPAQSSSS---SPSPTTSASSSSTSPTPPSILIHPPPPLAQIVNNNNQAP 927

Query: 242  ----KQFTDGTIRYSETSRKFASAVTITTPIIETATEPRNLQEAMQHPRWRGAMNDELSA 301
                   T       + + K++ AV++        +EPR   +A++  RWR AM  E++A
Sbjct: 928  LNTHSMGTRAKAGIIKPNPKYSLAVSLA-----AESEPRTAIQALKDERWRNAMGSEINA 987

Query: 302  LKRNATWDLVPPKPG-INLIDSKWVYKVKRKADGSVERLKARLVAKGFKQRFGVDYTDTF 361
               N TWDLVPP P  + ++  +W++  K  +DGS+ R KARLVAKG+ QR G+DY +TF
Sbjct: 988  QIGNHTWDLVPPPPSHVTIVGCRWIFTKKYNSDGSLNRYKARLVAKGYNQRPGLDYAETF 1047

Query: 362  SPVIKPSTIRVILSLAVTKGWNMRQVDIQNAFLHGILKEEVYMRQPPGFQDSAKPKNYIC 421
            SPVIK ++IR++L +AV + W +RQ+D+ NAFL G L ++VYM QPPGF D  +P NY+C
Sbjct: 1048 SPVIKSTSIRIVLGVAVDRSWPIRQLDVNNAFLQGTLTDDVYMSQPPGFIDKDRP-NYVC 1107

Query: 422  KLKKALYGLKQAPKAWHSRLTGKLIELGFKASVADSSLFILKNREITIYMLIYVDDIIIV 481
            KL+KALYGLKQAP+AW+  L   L+ +GF  SV+D+SLF+L+  +  +YML+YVDDI+I 
Sbjct: 1108 KLRKALYGLKQAPRAWYVELRNYLLTIGFVNSVSDTSLFVLQRGKSIVYMLVYVDDILIT 1167

Query: 482  SSSDQATERLIQKLKIDFAVKDLGGLEYFLGIEVKKTRDGIILSQRRYALDLLKRVNMEK 541
             +        +  L   F+VKD   L YFLGIE K+   G+ LSQRRY LDLL R NM  
Sbjct: 1168 GNDPTLLHNTLDNLSQRFSVKDHEELHYFLGIEAKRVPTGLHLSQRRYILDLLARTNMIT 1227

Query: 542  CKPMSTPMGSAEKLFREQGIPLSAEEQFKYRSTVGALQYLTMTRPDLAFAVNKVCQYLHT 601
             KP++TPM  + KL    G  L+  +  +YR  VG+LQYL  TRPD+++AVN++ Q++H 
Sbjct: 1228 AKPVTTPMAPSPKLSLYSGTKLT--DPTEYRGIVGSLQYLAFTRPDISYAVNRLSQFMHM 1287

Query: 602  PTDAHWGAVKRILRYVKGTLALGVKIQK-STMMLSGFSDADWAGCPDDRRSTSGFAVFLG 661
            PT+ H  A+KRILRY+ GT   G+ ++K +T+ L  +SDADWAG  DD  ST+G+ V+LG
Sbjct: 1288 PTEEHLQALKRILRYLAGTPNHGIFLKKGNTLSLHAYSDADWAGDKDDYVSTNGYIVYLG 1347

Query: 662  ANLISWSSRKQATVSRSSTEAEYKAIANLTAEMIWIKSLLKELGVYQSKAPRLWCDNLGA 721
             + ISWSS+KQ  V RSSTEAEY+++AN ++EM WI SLL ELG+  ++ P ++CDN+GA
Sbjct: 1348 HHPISWSSKKQKGVVRSSTEAEYRSVANTSSEMQWICSLLTELGIRLTRPPVIYCDNVGA 1407

Query: 722  TYLTSNPVFHARTKHIEVDFHFVREQVARKAMEVRFISSSDQVADILTKPLSKTPFTTHC 724
            TYL +NPVFH+R KHI +D+HF+R QV   A+ V  +S+ DQ+AD LTKPLS+T F    
Sbjct: 1408 TYLCANPVFHSRMKHIAIDYHFIRNQVQSGALRVVHVSTHDQLADTLTKPLSRTAFQNFA 1460

BLAST of CmoCh10G002580 vs. ExPASy Swiss-Prot
Match: Q9ZT94 (Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana OX=3702 GN=RE2 PE=4 SV=1)

HSP 1 Score: 580.1 bits (1494), Expect = 3.5e-164
Identity = 334/792 (42.17%), Postives = 463/792 (58.46%), Query Frame = 0

Query: 2    LRVFGCACWPNLRPYNNKKLSFRTTRCIFLGYSSSHKGYKCLNRSTGRIYISRDVVFDEN 61
            L+VFGCAC+P LRPYN  KL  ++ +C F+GYS +   Y CL+  TGR+Y SR V FDE 
Sbjct: 667  LKVFGCACYPWLRPYNRHKLEDKSKQCAFMGYSLTQSAYLCLHIPTGRLYTSRHVQFDER 726

Query: 62   IFPF--------------EESKP--PNKTTNPHHPVLLPALAKLASFYTENALTDIEPVV 121
             FPF               +S P  P+ TT P  P++LPA   L         T   P  
Sbjct: 727  CFPFSTTNFGVSTSQEQRSDSAPNWPSHTTLPTTPLVLPAPPCLGPHLD----TSPRPPS 786

Query: 122  SNSHMNDGQTDNIASDNLSGVSLSSADNTRSSEEIAEYEAESSSINAQNQTHEHVSDQP- 181
            S S +    T  ++S NL   S+SS     SSE  A            +QT    S+ P 
Sbjct: 787  SPSPL---CTTQVSSSNLPSSSISSPS---SSEPTAPSHNGPQPTAQPHQTQNSNSNSPI 846

Query: 182  -------TEAASQHPMRTRLRNNIVQAKQFTDGTIRYSETSRKFASA--------VTITT 241
                   + + +     + L  + + +      +   SE +   +S+        V    
Sbjct: 847  LNNPNPNSPSPNSPNQNSPLPQSPISSPHIPTPSTSISEPNSPSSSSTSTPPLPPVLPAP 906

Query: 242  PIIE----------------------------------TATEPRNLQEAMQHPRWRGAMN 301
            PII+                                    +EPR   +AM+  RWR AM 
Sbjct: 907  PIIQVNAQAPVNTHSMATRAKDGIRKPNQKYSYATSLAANSEPRTAIQAMKDDRWRQAMG 966

Query: 302  DELSALKRNATWDLV-PPKPGINLIDSKWVYKVKRKADGSVERLKARLVAKGFKQRFGVD 361
             E++A   N TWDLV PP P + ++  +W++  K  +DGS+ R KARLVAKG+ QR G+D
Sbjct: 967  SEINAQIGNHTWDLVPPPPPSVTIVGCRWIFTKKFNSDGSLNRYKARLVAKGYNQRPGLD 1026

Query: 362  YTDTFSPVIKPSTIRVILSLAVTKGWNMRQVDIQNAFLHGILKEEVYMRQPPGFQDSAKP 421
            Y +TFSPVIK ++IR++L +AV + W +RQ+D+ NAFL G L +EVYM QPPGF D  +P
Sbjct: 1027 YAETFSPVIKSTSIRIVLGVAVDRSWPIRQLDVNNAFLQGTLTDEVYMSQPPGFVDKDRP 1086

Query: 422  KNYICKLKKALYGLKQAPKAWHSRLTGKLIELGFKASVADSSLFILKNREITIYMLIYVD 481
             +Y+C+L+KA+YGLKQAP+AW+  L   L+ +GF  S++D+SLF+L+     IYML+YVD
Sbjct: 1087 -DYVCRLRKAIYGLKQAPRAWYVELRTYLLTVGFVNSISDTSLFVLQRGRSIIYMLVYVD 1146

Query: 482  DIIIVSSSDQATERLIQKLKIDFAVKDLGGLEYFLGIEVKKTRDGIILSQRRYALDLLKR 541
            DI+I  +     +  +  L   F+VK+   L YFLGIE K+   G+ LSQRRY LDLL R
Sbjct: 1147 DILITGNDTVLLKHTLDALSQRFSVKEHEDLHYFLGIEAKRVPQGLHLSQRRYTLDLLAR 1206

Query: 542  VNMEKCKPMSTPMGSAEKLFREQGIPLSAEEQFKYRSTVGALQYLTMTRPDLAFAVNKVC 601
             NM   KP++TPM ++ KL    G  L   +  +YR  VG+LQYL  TRPDL++AVN++ 
Sbjct: 1207 TNMLTAKPVATPMATSPKLTLHSGTKL--PDPTEYRGIVGSLQYLAFTRPDLSYAVNRLS 1266

Query: 602  QYLHTPTDAHWGAVKRILRYVKGTLALGVKIQK-STMMLSGFSDADWAGCPDDRRSTSGF 661
            QY+H PTD HW A+KR+LRY+ GT   G+ ++K +T+ L  +SDADWAG  DD  ST+G+
Sbjct: 1267 QYMHMPTDDHWNALKRVLRYLAGTPDHGIFLKKGNTLSLHAYSDADWAGDTDDYVSTNGY 1326

Query: 662  AVFLGANLISWSSRKQATVSRSSTEAEYKAIANLTAEMIWIKSLLKELGVYQSKAPRLWC 721
             V+LG + ISWSS+KQ  V RSSTEAEY+++AN ++E+ WI SLL ELG+  S  P ++C
Sbjct: 1327 IVYLGHHPISWSSKKQKGVVRSSTEAEYRSVANTSSELQWICSLLTELGIQLSHPPVIYC 1386

Query: 722  DNLGATYLTSNPVFHARTKHIEVDFHFVREQVARKAMEVRFISSSDQVADILTKPLSKTP 726
            DN+GATYL +NPVFH+R KHI +D+HF+R QV   A+ V  +S+ DQ+AD LTKPLS+  
Sbjct: 1387 DNVGATYLCANPVFHSRMKHIALDYHFIRNQVQSGALRVVHVSTHDQLADTLTKPLSRVA 1445

BLAST of CmoCh10G002580 vs. ExPASy Swiss-Prot
Match: P10978 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum OX=4097 PE=2 SV=1)

HSP 1 Score: 382.1 bits (980), Expect = 1.4e-104
Identity = 245/726 (33.75%), Postives = 389/726 (53.58%), Query Frame = 0

Query: 2    LRVFGCACWPNLRPYNNKKLSFRTTRCIFLGYSSSHKGYKCLNRSTGRIYISRDVVFDEN 61
            L+VFGC  + ++      KL  ++  CIF+GY     GY+  +    ++  SRDVVF E+
Sbjct: 648  LKVFGCRAFAHVPKEQRTKLDDKSIPCIFIGYGDEEFGYRLWDPVKKKVIRSRDVVFRES 707

Query: 62   IFPFEESKPPNKTTNPHHPVLLPALAKLASFYTENALTDIEPVVSNSHMNDGQTDNIASD 121
                                                  D+   V N  + +  T    S+
Sbjct: 708  --------------------------------EVRTAADMSEKVKNGIIPNFVTIPSTSN 767

Query: 122  NLSGVSLSSADNTRSSEEIAEYEAESSSINAQNQTHEHVSDQPTEAASQH-PMRTRLRNN 181
            N +    ++ + +   E+  E   +   ++   +  EH    PT+   QH P+R   R  
Sbjct: 768  NPTSAESTTDEVSEQGEQPGEVIEQGEQLDEGVEEVEH----PTQGEEQHQPLRRSERPR 827

Query: 182  IVQAKQFTDGTIRYSETSRKFASAVTITTPIIETATEPRNLQEAMQHP---RWRGAMNDE 241
            +                SR++ S   +   +I    EP +L+E + HP   +   AM +E
Sbjct: 828  V---------------ESRRYPSTEYV---LISDDREPESLKEVLSHPEKNQLMKAMQEE 887

Query: 242  LSALKRNATWDLVPPKPGINLIDSKWVYKVKRKADGSVERLKARLVAKGFKQRFGVDYTD 301
            + +L++N T+ LV    G   +  KWV+K+K+  D  + R KARLV KGF+Q+ G+D+ +
Sbjct: 888  MESLQKNGTYKLVELPKGKRPLKCKWVFKLKKDGDCKLVRYKARLVVKGFEQKKGIDFDE 947

Query: 302  TFSPVIKPSTIRVILSLAVTKGWNMRQVDIQNAFLHGILKEEVYMRQPPGFQDSAKPKNY 361
             FSPV+K ++IR ILSLA +    + Q+D++ AFLHG L+EE+YM QP GF+ + K K+ 
Sbjct: 948  IFSPVVKMTSIRTILSLAASLDLEVEQLDVKTAFLHGDLEEEIYMEQPEGFEVAGK-KHM 1007

Query: 362  ICKLKKALYGLKQAPKAWHSRLTGKLIELGFKASVADSSLFILKNREIT-IYMLIYVDDI 421
            +CKL K+LYGLKQAP+ W+ +    +    +  + +D  ++  +  E   I +L+YVDD+
Sbjct: 1008 VCKLNKSLYGLKQAPRQWYMKFDSFMKSQTYLKTYSDPCVYFKRFSENNFIILLLYVDDM 1067

Query: 422  IIVSSSDQATERLIQKLKIDFAVKDLGGLEYFLGIEV--KKTRDGIILSQRRYALDLLKR 481
            +IV        +L   L   F +KDLG  +  LG+++  ++T   + LSQ +Y   +L+R
Sbjct: 1068 LIVGKDKGLIAKLKGDLSKSFDMKDLGPAQQILGMKIVRERTSRKLWLSQEKYIERVLER 1127

Query: 482  VNMEKCKPMSTPMGSAEKLFREQGIPLSAEE-----QFKYRSTVGALQY-LTMTRPDLAF 541
             NM+  KP+STP+    KL ++   P + EE     +  Y S VG+L Y +  TRPD+A 
Sbjct: 1128 FNMKNAKPVSTPLAGHLKLSKKM-CPTTVEEKGNMAKVPYSSAVGSLMYAMVCTRPDIAH 1187

Query: 542  AVNKVCQYLHTPTDAHWGAVKRILRYVKGTLALGVKIQKSTMMLSGFSDADWAGCPDDRR 601
            AV  V ++L  P   HW AVK ILRY++GT    +    S  +L G++DAD AG  D+R+
Sbjct: 1188 AVGVVSRFLENPGKEHWEAVKWILRYLRGTTGDCLCFGGSDPILKGYTDADMAGDIDNRK 1247

Query: 602  STSGFAVFLGANLISWSSRKQATVSRSSTEAEYKAIANLTAEMIWIKSLLKELGVYQSKA 661
            S++G+        ISW S+ Q  V+ S+TEAEY A      EMIW+K  L+ELG++Q K 
Sbjct: 1248 SSTGYLFTFSGGAISWQSKLQKCVALSTTEAEYIAATETGKEMIWLKRFLQELGLHQ-KE 1307

Query: 662  PRLWCDNLGATYLTSNPVFHARTKHIEVDFHFVREQVARKAMEVRFISSSDQVADILTKP 715
              ++CD+  A  L+ N ++HARTKHI+V +H++RE V  ++++V  IS+++  AD+LTK 
Sbjct: 1308 YVVYCDSQSAIDLSKNSMYHARTKHIDVRYHWIREMVDDESLKVLKISTNENPADMLTKV 1316

BLAST of CmoCh10G002580 vs. ExPASy Swiss-Prot
Match: P04146 (Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3)

HSP 1 Score: 340.9 bits (873), Expect = 3.6e-92
Identity = 189/494 (38.26%), Postives = 285/494 (57.69%), Query Frame = 0

Query: 230  WRGAMNDELSALKRNATWDLVPPKPGINLIDSKWVYKVKRKADGSVERLKARLVAKGFKQ 289
            W  A+N EL+A K N TW +       N++DS+WV+ VK    G+  R KARLVA+GF Q
Sbjct: 906  WEEAINTELNAHKINNTWTITKRPENKNIVDSRWVFSVKYNELGNPIRYKARLVARGFTQ 965

Query: 290  RFGVDYTDTFSPVIKPSTIRVILSLAVTKGWNMRQVDIQNAFLHGILKEEVYMRQPPGFQ 349
            ++ +DY +TF+PV + S+ R ILSL +     + Q+D++ AFL+G LKEE+YMR P G  
Sbjct: 966  KYQIDYEETFAPVARISSFRFILSLVIQYNLKVHQMDVKTAFLNGTLKEEIYMRLPQGIS 1025

Query: 350  DSAKPKNYICKLKKALYGLKQAPKAWHSRLTGKLIELGFKASVADSSLFILKNREI--TI 409
             ++   + +CKL KA+YGLKQA + W       L E  F  S  D  ++IL    I   I
Sbjct: 1026 CNS---DNVCKLNKAIYGLKQAARCWFEVFEQALKECEFVNSSVDRCIYILDKGNINENI 1085

Query: 410  YMLIYVDDIIIVSSSDQATERLIQKLKIDFAVKDLGGLEYFLGIEVKKTRDGIILSQRRY 469
            Y+L+YVDD++I +          + L   F + DL  +++F+GI ++   D I LSQ  Y
Sbjct: 1086 YVLLYVDDVVIATGDMTRMNNFKRYLMEKFRMTDLNEIKHFIGIRIEMQEDKIYLSQSAY 1145

Query: 470  ALDLLKRVNMEKCKPMSTPMGSAEKLFREQGIPLSAEEQFK--YRSTVGALQYLTM-TRP 529
               +L + NME C  +STP+ S  K+  E    L+++E      RS +G L Y+ + TRP
Sbjct: 1146 VKKILSKFNMENCNAVSTPLPS--KINYEL---LNSDEDCNTPCRSLIGCLMYIMLCTRP 1205

Query: 530  DLAFAVNKVCQYLHTPTDAHWGAVKRILRYVKGTLALGVKIQKSTMM---LSGFSDADWA 589
            DL  AVN + +Y        W  +KR+LRY+KGT+ + +  +K+      + G+ D+DWA
Sbjct: 1206 DLTTAVNILSRYSSKNNSELWQNLKRVLRYLKGTIDMKLIFKKNLAFENKIIGYVDSDWA 1265

Query: 590  GCPDDRRSTSGFAV-FLGANLISWSSRKQATVSRSSTEAEYKAIANLTAEMIWIKSLLKE 649
            G   DR+ST+G+       NLI W++++Q +V+ SSTEAEY A+     E +W+K LL  
Sbjct: 1266 GSEIDRKSTTGYLFKMFDFNLICWNTKRQNSVAASSTEAEYMALFEAVREALWLKFLLTS 1325

Query: 650  LGVYQSKAPRLWCDNLGATYLTSNPVFHARTKHIEVDFHFVREQVARKAMEVRFISSSDQ 709
            + +      +++ DN G   + +NP  H R KHI++ +HF REQV    + + +I + +Q
Sbjct: 1326 INIKLENPIKIYEDNQGCISIANNPSCHKRAKHIDIKYHFAREQVQNNVICLEYIPTENQ 1385

Query: 710  VADILTKPLSKTPF 715
            +ADI TKPL    F
Sbjct: 1386 LADIFTKPLPAARF 1391

BLAST of CmoCh10G002580 vs. ExPASy Swiss-Prot
Match: P92519 (Uncharacterized mitochondrial protein AtMg00810 OS=Arabidopsis thaliana OX=3702 GN=AtMg00810 PE=4 SV=1)

HSP 1 Score: 218.4 bits (555), Expect = 2.7e-55
Identity = 112/228 (49.12%), Postives = 151/228 (66.23%), Query Frame = 0

Query: 407 IYMLIYVDDIIIVSSSDQATERLIQKLKIDFAVKDLGGLEYFLGIEVKKTRDGIILSQRR 466
           +Y+L+YVDDI++  SS+     LI +L   F++KDLG + YFLGI++K    G+ LSQ +
Sbjct: 1   MYLLLYVDDILLTGSSNTLLNMLIFQLSSTFSMKDLGPVHYFLGIQIKTHPSGLFLSQTK 60

Query: 467 YALDLLKRVNMEKCKPMSTPMGSAEKLFREQGIPLSAEEQFKYRSTVGALQYLTMTRPDL 526
           YA  +L    M  CKPMSTP+                 +   +RS VGALQYLT+TRPD+
Sbjct: 61  YAEQILNNAGMLDCKPMSTPL---PLKLNSSVSTAKYPDPSDFRSIVGALQYLTLTRPDI 120

Query: 527 AFAVNKVCQYLHTPTDAHWGAVKRILRYVKGTLALGVKIQK-STMMLSGFSDADWAGCPD 586
           ++AVN VCQ +H PT A +  +KR+LRYVKGT+  G+ I K S + +  F D+DWAGC  
Sbjct: 121 SYAVNIVCQRMHEPTLADFDLLKRVLRYVKGTIFHGLYIHKNSKLNVQAFCDSDWAGCTS 180

Query: 587 DRRSTSGFAVFLGANLISWSSRKQATVSRSSTEAEYKAIANLTAEMIW 634
            RRST+GF  FLG N+ISWS+++Q TVSRSSTE EY+A+A   AE+ W
Sbjct: 181 TRRSTTGFCTFLGCNIISWSAKRQPTVSRSSTETEYRALALTAAELTW 225

BLAST of CmoCh10G002580 vs. ExPASy TrEMBL
Match: Q2QRW4 (Retrotransposon protein, putative, Ty1-copia subclass OS=Oryza sativa subsp. japonica OX=39947 GN=LOC_Os12g26180 PE=4 SV=1)

HSP 1 Score: 772.3 bits (1993), Expect = 1.8e-219
Identity = 408/795 (51.32%), Postives = 536/795 (67.42%), Query Frame = 0

Query: 2    LRVFGCACWPNLRPYNNKKLSFRTTRCIFLGYSSSHKGYKCLNRSTGRIYISRDVVFDEN 61
            LR+FGCACWPNLRPYNN KL FR+ RC+FLG+S+ HKG+KCL  STGRIYISRDVVFDEN
Sbjct: 673  LRIFGCACWPNLRPYNNHKLQFRSKRCVFLGFSTMHKGFKCLEVSTGRIYISRDVVFDEN 732

Query: 62   IFPFEESK--------------------PPNKTTNPH--HPVLLPALAKLASFYTENA-- 121
            IFPF E                      P     N    HPV  P  A +++  +  A  
Sbjct: 733  IFPFTELHANAGARLRSEIDILTPELLGPIRSVGNEQCMHPVNNPLSADVSAALSNRANE 792

Query: 122  -------------------------------LTDIEPVVSNSHMNDGQTD--------NI 181
                                           +    P  ++S  + G T         + 
Sbjct: 793  PHRDGAVHPADAEDPPATPPLDASSGPEPDRVVHHSPAATSSGRHPGPTPGSVPRGAASS 852

Query: 182  ASDNLSGVSLSSADNTRSSEEIAEYEAES------SSINAQNQTHEHVSDQPTEAASQHP 241
             ++  +  S+S A   + S+ + E E  S       ++  +  T +H     T + +   
Sbjct: 853  LAEETAEDSVSQAVQEQESQVVQEQEQSSPAQEHAQAVTDETNTLQHADVTDTGSEAPAG 912

Query: 242  MRTRLRNNIVQAKQFTDGTIRYSETSRKFASAVTITTPIIETATEPRNLQEAMQHPRWRG 301
             RTRL++ + + K +TDGTI+Y  +                 + EP N  EA++   W+ 
Sbjct: 913  PRTRLQSGVRKEKVYTDGTIKYKHS-------------WFTASGEPTNDLEALKDKNWKL 972

Query: 302  AMNDELSALKRNATWDLVPPKPGINLIDSKWVYKVKRKADGSVERLKARLVAKGFKQRFG 361
            AM+ E  AL +N TW LVPP+ G N+I  KWVYK+KRKADG+++R KARLVAKGFKQR+G
Sbjct: 973  AMDSEYDALVKNKTWHLVPPQRGRNIIGCKWVYKIKRKADGTLDRYKARLVAKGFKQRYG 1032

Query: 362  VDYTDTFSPVIKPSTIRVILSLAVTKGWNMRQVDIQNAFLHGILKEEVYMRQPPGFQDSA 421
            +DY DTFSPV+K +TIR+ILSLAV+KGW++RQ+D+QNAFLHG L+EEVYM QPPGF+D  
Sbjct: 1033 IDYEDTFSPVVKAATIRIILSLAVSKGWSLRQLDVQNAFLHGYLEEEVYMLQPPGFEDPT 1092

Query: 422  KPKNYICKLKKALYGLKQAPKAWHSRLTGKLIELGFKASVADSSLFILKNREITIYMLIY 481
            KP +++CKL KALYGLKQAP+AW SRL+ KL++LGFK S  D+SLF L   +IT+++L+Y
Sbjct: 1093 KP-HHVCKLDKALYGLKQAPRAWFSRLSKKLMDLGFKGSKPDTSLFFLNKGDITMFVLVY 1152

Query: 482  VDDIIIVSSSDQATERLIQKLKIDFAVKDLGGLEYFLGIEVKKTRDGIILSQRRYALDLL 541
            VDDII+ SSS++AT  L+Q LK +FA+KDLG L YFLGIEV K ++GI+L+Q +YA DLL
Sbjct: 1153 VDDIIVASSSEKATAALLQDLKGEFALKDLGELHYFLGIEVSKVQNGIVLNQDKYANDLL 1212

Query: 542  KRVNMEKCKPMSTPMGSAEKLFREQGIPLSAEEQFKYRSTVGALQYLTMTRPDLAFAVNK 601
            K+V M  CKP +TP+  +EKL   +G  L  E+   YRS VGALQYLT+TRPD+AF+VNK
Sbjct: 1213 KKVGMIDCKPANTPLSVSEKLSLHEGSLLGPEDASHYRSVVGALQYLTLTRPDIAFSVNK 1272

Query: 602  VCQYLHTPTDAHWGAVKRILRYVKGTLALGVKIQKS-TMMLSGFSDADWAGCPDDRRSTS 661
            VCQ+LH PT  HW AVKRILRY+K    LG++I KS + ++SGFSDADWAGC DDRRST 
Sbjct: 1273 VCQFLHAPTTVHWIAVKRILRYLKQCTRLGLEIHKSGSTLVSGFSDADWAGCLDDRRSTG 1332

Query: 662  GFAVFLGANLISWSSRKQATVSRSSTEAEYKAIANLTAEMIWIKSLLKELGVYQSKAPRL 721
            GFA+FLG+NL+SW++RKQATVSRSSTE+EYKAIAN TAE++W+++LL EL +   KA ++
Sbjct: 1333 GFAIFLGSNLVSWNARKQATVSRSSTESEYKAIANATAEIMWVQTLLAELEIKSPKAAKI 1392

Query: 722  WCDNLGATYLTSNPVFHARTKHIEVDFHFVREQVARKAMEVRFISSSDQVADILTKPLSK 727
            WCDNLGA YL++NPVFHARTKHIEVD+HFVRE+V++K +E+ F+S++DQVAD  TKPLS 
Sbjct: 1393 WCDNLGAKYLSANPVFHARTKHIEVDYHFVRERVSQKLLEIDFVSTNDQVADGFTKPLSV 1452

BLAST of CmoCh10G002580 vs. ExPASy TrEMBL
Match: C7J5P9 (Os08g0544300 protein OS=Oryza sativa subsp. japonica OX=39947 GN=Os08g0544300 PE=4 SV=1)

HSP 1 Score: 771.5 bits (1991), Expect = 3.0e-219
Identity = 411/786 (52.29%), Postives = 540/786 (68.70%), Query Frame = 0

Query: 2   LRVFGCACWPNLRPYNNKKLSFRTTRCIFLGYSSSHKGYKCLNRSTGRIYISRDVVFDEN 61
           LR+FGCA WPNLRPYN  KL+FR+ RC+FLGYS+ HKG+KCL  +TGR+Y+SRDV FDE+
Sbjct: 44  LRIFGCAVWPNLRPYNKHKLAFRSKRCVFLGYSNLHKGFKCLEIATGRVYVSRDVTFDES 103

Query: 62  IFPFEESKPP-----NKTTNPHHPVLLPALAKLA----------------SFYTENALTD 121
           IFPF E             +   P L+P L+ L                  F  ENA   
Sbjct: 104 IFPFSELHSNAGACLRAEISLLPPSLVPHLSSLGGEQNNHVLNYPPNVTDQFGEENAEIG 163

Query: 122 IEPVVSNSHMN---------DGQTDNIASDNLSGVSLSSA---------DNTRSSEE--- 181
            E +V+N   N             +  A D++ GV+  ++         D T S+ E   
Sbjct: 164 -EEIVANGEENAAAAADENAAAAANGGAQDDVHGVAYDASPEHSSPVTDDATASAAEQHG 223

Query: 182 --IAE----------YEAESSSINAQNQTHEHV-------SDQPTEAASQHPMR--TRLR 241
             I E            + S S+ +    H+ V       +DQ    A+  P+R  TRL+
Sbjct: 224 NPIQEEHLVQASPQTASSTSPSVASSAGVHDDVTTDQSDQTDQAMPEAAVAPIRPKTRLQ 283

Query: 242 NNIVQAKQFTDGTIRYSETSRKFASAVTITTPIIETATEPRNLQEAMQHPRWRGAMNDEL 301
           + I + K +TDGT+++                   ++ EP++L+EA+ +  W+ AM+ E 
Sbjct: 284 SGIRKEKVYTDGTVKWLN---------------FTSSGEPQSLEEAVNNKHWKEAMDAEY 343

Query: 302 SALKRNATWDLVPPKPGINLIDSKWVYKVKRKADGSVERLKARLVAKGFKQRFGVDYTDT 361
            AL  N TW LVPP+ G N+ID KWVYKVKRKADGS++R KARLVAKGFKQR+G+DY DT
Sbjct: 344 MALIENKTWHLVPPQKGRNVIDCKWVYKVKRKADGSLDRYKARLVAKGFKQRYGIDYEDT 403

Query: 362 FSPVIKPSTIRVILSLAVTKGWNMRQVDIQNAFLHGILKEEVYMRQPPGFQDSAKPKNYI 421
           FSPV+K +TIR++LSLAV++GW++RQ+D++NAFLHG+L+EEVYM QPPG++  + P NY+
Sbjct: 404 FSPVVKAATIRIVLSLAVSRGWSLRQLDVKNAFLHGVLEEEVYMEQPPGYEKKSMP-NYV 463

Query: 422 CKLKKALYGLKQAPKAWHSRLTGKLIELGFKASVADSSLFILKNREITIYMLIYVDDIII 481
           CKL KALYGLKQAP+AW+SRL+ KL ELGF  S AD+SLF  K  +++I++LIYVDDII+
Sbjct: 464 CKLDKALYGLKQAPRAWYSRLSTKLSELGFVPSKADTSLFFYKKGQVSIFLLIYVDDIIM 523

Query: 482 VSSSDQATERLIQKLKIDFAVKDLGGLEYFLGIEVKKTRDGIILSQRRYALDLLKRVNME 541
            SS   AT  L+Q+L  DFA+KDLG L YFLGIEV K +DG++LSQ +YA DLL+RV M 
Sbjct: 524 ASSVPDATSTLLQELSKDFALKDLGDLHYFLGIEVHKVKDGLMLSQEKYASDLLRRVGMY 583

Query: 542 KCKPMSTPMGSAEKLFREQGIPLSAEEQFKYRSTVGALQYLTMTRPDLAFAVNKVCQYLH 601
           +CKP+STP+ ++EKL   +G  L  ++  +YRS VGALQYLT+TRPD++F++NKVCQ+LH
Sbjct: 584 ECKPVSTPLSTSEKLSVNEGTLLGPQDSTQYRSVVGALQYLTLTRPDISFSINKVCQFLH 643

Query: 602 TPTDAHWGAVKRILRYVKGTLALGVKI-QKSTMMLSGFSDADWAGCPDDRRSTSGFAVFL 661
            PT  HW AVKRILRYVK T+  G+K  +  ++++SGFSDADWAG PDDRRST GFAVFL
Sbjct: 644 APTTTHWAAVKRILRYVKYTVDTGLKFCRNPSLLVSGFSDADWAGSPDDRRSTGGFAVFL 703

Query: 662 GANLISWSSRKQATVSRSSTEAEYKAIANLTAEMIWIKSLLKELGVYQSKAPRLWCDNLG 721
           G NL+SWS+RKQATVSRSSTEAEYKA+AN TAE++W+++LL+ELGV   +A +LWCDNLG
Sbjct: 704 GPNLVSWSARKQATVSRSSTEAEYKALANATAEIMWVQTLLQELGVESPRAAKLWCDNLG 763

Query: 722 ATYLTSNPVFHARTKHIEVDFHFVREQVARKAMEVRFISSSDQVADILTKPLSKTPFTTH 724
           A YL++NP+FHARTKHIEVDFHFVRE+VARK +E+ +IS+ DQVAD  TK +        
Sbjct: 764 AKYLSANPIFHARTKHIEVDFHFVRERVARKLLEIAYISTKDQVADGFTKAIPVRQMEMF 812

BLAST of CmoCh10G002580 vs. ExPASy TrEMBL
Match: Q75HT9 (Putative polyprotein OS=Oryza sativa subsp. japonica OX=39947 GN=B1003C08.12 PE=4 SV=1)

HSP 1 Score: 767.3 bits (1980), Expect = 5.7e-218
Identity = 399/780 (51.15%), Postives = 528/780 (67.69%), Query Frame = 0

Query: 2    LRVFGCACWPNLRPYNNKKLSFRTTRCIFLGYSSSHKGYKCLNRSTGRIYISRDVVFDEN 61
            LR FGCACWPNLRPYN  KL FR+ +C+FLG+S+ HKG+KCL+ STGR+Y+SRDV FDE 
Sbjct: 666  LRTFGCACWPNLRPYNTHKLQFRSKQCVFLGFSNIHKGFKCLDVSTGRVYVSRDVTFDEQ 725

Query: 62   IFPFEESKP------------------PNKTTN----PHHP----VLLPALAKLASFYT- 121
            +FPF    P                    KT++     HH     + +P    ++   + 
Sbjct: 726  VFPFANLHPNAGARLRAEISLLPPTLINEKTSDQGGEEHHDHLFNISMPNATDISCAESP 785

Query: 122  ENALTDIEPVVSNSHMNDGQ------------------------TDNIASDNLSGVSLSS 181
             N  +DI       H  +G                         T   +    SG+  + 
Sbjct: 786  RNVNSDIPGAFGRVHGANGDLAGESASDSASVQAQLQRQASGSATQGESEQQRSGIQPAR 845

Query: 182  ADNTRSSEEIA---EYEAESSSINAQNQTHEHVSDQPTEAASQHPM---RTRLRNNIVQA 241
            A +  +S   A      A +SS     Q+H+  S  P+E+A    +   +TRL++ I + 
Sbjct: 846  ATSPAASPTAAPPSPARAVASSSGGAQQSHQPGSSAPSESAQLSEVIRPKTRLQSGIRKE 905

Query: 242  KQFTDGTIRYSETSRKFASAVTITTPIIETATEPRNLQEAMQHPRWRGAMNDELSALKRN 301
            K +TDGT+RYS                  ++ EP+ L EA+    W+ AM+ E  AL +N
Sbjct: 906  KIYTDGTVRYS---------------CFTSSGEPQTLHEALGDKNWKEAMDSEYQALMKN 965

Query: 302  ATWDLVPPKPGINLIDSKWVYKVKRKADGSVERLKARLVAKGFKQRFGVDYTDTFSPVIK 361
             TW LVP K G N+ID KWVYKVKRKADGS++R KAR+VAKGFKQR+G+DY DTF+PV+K
Sbjct: 966  KTWHLVPSKKGQNIIDCKWVYKVKRKADGSLDRYKARVVAKGFKQRYGIDYEDTFNPVVK 1025

Query: 362  PSTIRVILSLAVTKGWNMRQVDIQNAFLHGILKEEVYMRQPPGFQDSAKPKNYICKLKKA 421
             +TIR ILS+A+++GW +RQ+D+QNAFLHG+L+E+V+MRQPPG++   +   Y+CKL KA
Sbjct: 1026 AATIRTILSIAISRGWTLRQLDVQNAFLHGVLEEDVFMRQPPGYE---QKDGYVCKLDKA 1085

Query: 422  LYGLKQAPKAWHSRLTGKLIELGFKASVADSSLFILKNREITIYMLIYVDDIIIVSSSDQ 481
            LYGLKQAP+AW+SRL+ KL ELGFK+S +D+SLF     ++ ++ML+YVDDII+ SSS  
Sbjct: 1086 LYGLKQAPRAWYSRLSTKLHELGFKSSKSDTSLFFYSKGDVAMFMLVYVDDIIVASSSID 1145

Query: 482  ATERLIQKLKIDFAVKDLGGLEYFLGIEVKKTRDGIILSQRRYALDLLKRVNMEKCKPMS 541
            AT  L++ L  +FA+KDLG L YFLGIEVK+  +GI+L+Q +YA+D+LKRVNM  CK ++
Sbjct: 1146 ATNALLKNLNQEFALKDLGRLHYFLGIEVKEVNNGIVLTQEKYAMDVLKRVNMSDCKAVN 1205

Query: 542  TPMGSAEKLFREQGIPLSAEEQFKYRSTVGALQYLTMTRPDLAFAVNKVCQYLHTPTDAH 601
            TP+  +EKL   +G P   E+  +YRS VGALQYLT+TRPDL+F+VNKVCQYLH PT  H
Sbjct: 1206 TPLSISEKLSAHEGNPFGPEDSTRYRSLVGALQYLTLTRPDLSFSVNKVCQYLHAPTTKH 1265

Query: 602  WGAVKRILRYVKGTLALGVKIQKS-TMMLSGFSDADWAGCPDDRRSTSGFAVFLGANLIS 661
            W A KRILRY+K T+ LG+KI KS ++++S FSDADWAGC DDR ST GFAVF+G NL+S
Sbjct: 1266 WAAAKRILRYLKHTVKLGLKISKSNSLLVSAFSDADWAGCLDDRHSTGGFAVFIGPNLVS 1325

Query: 662  WSSRKQATVSRSSTEAEYKAIANLTAEMIWIKSLLKELGVYQSKAPRLWCDNLGATYLTS 721
            WS+RKQATVSRSSTEAEYKA+AN+TAE++WI++LL ELG+   K  ++WCDN+GA Y+T+
Sbjct: 1326 WSARKQATVSRSSTEAEYKALANVTAEIMWIQTLLHELGIQAPKIAKVWCDNIGAKYMTA 1385

Query: 722  NPVFHARTKHIEVDFHFVREQVARKAMEVRFISSSDQVADILTKPLSKTPFTTHCNNLNM 724
            NPVFHARTKHIEVD+HFVRE+VARK ++V +IS+ DQVAD  TK LS        NNLN+
Sbjct: 1386 NPVFHARTKHIEVDYHFVRERVARKFLQVEYISTKDQVADGFTKTLSVRQLEMFRNNLNL 1427

BLAST of CmoCh10G002580 vs. ExPASy TrEMBL
Match: Q75G45 (Putative polyprotein OS=Oryza sativa subsp. japonica OX=39947 GN=OSJNBb0043H23.10 PE=4 SV=1)

HSP 1 Score: 767.3 bits (1980), Expect = 5.7e-218
Identity = 399/780 (51.15%), Postives = 528/780 (67.69%), Query Frame = 0

Query: 2    LRVFGCACWPNLRPYNNKKLSFRTTRCIFLGYSSSHKGYKCLNRSTGRIYISRDVVFDEN 61
            LR FGCACWPNLRPYN  KL FR+ +C+FLG+S+ HKG+KCL+ STGR+Y+SRDV FDE 
Sbjct: 666  LRTFGCACWPNLRPYNTHKLQFRSKQCVFLGFSNIHKGFKCLDVSTGRVYVSRDVTFDEQ 725

Query: 62   IFPFEESKP------------------PNKTTN----PHHP----VLLPALAKLASFYT- 121
            +FPF    P                    KT++     HH     + +P    ++   + 
Sbjct: 726  VFPFANLHPNAGARLRAEISLLPPTLINEKTSDQGGEEHHDHLFNISMPNATDISCAESP 785

Query: 122  ENALTDIEPVVSNSHMNDGQ------------------------TDNIASDNLSGVSLSS 181
             N  +DI       H  +G                         T   +    SG+  + 
Sbjct: 786  RNVNSDIPGAFGRVHGANGDLAGESASDSASVQAQLQRQASGSATQGESEQQRSGIQPAR 845

Query: 182  ADNTRSSEEIA---EYEAESSSINAQNQTHEHVSDQPTEAASQHPM---RTRLRNNIVQA 241
            A +  +S   A      A +SS     Q+H+  S  P+E+A    +   +TRL++ I + 
Sbjct: 846  ATSPAASPTAAPPSPARAVASSSGGAQQSHQPGSSAPSESAQLSEVIRPKTRLQSGIRKE 905

Query: 242  KQFTDGTIRYSETSRKFASAVTITTPIIETATEPRNLQEAMQHPRWRGAMNDELSALKRN 301
            K +TDGT+RYS                  ++ EP+ L EA+    W+ AM+ E  AL +N
Sbjct: 906  KIYTDGTVRYS---------------CFTSSGEPQTLHEALGDKNWKEAMDSEYQALMKN 965

Query: 302  ATWDLVPPKPGINLIDSKWVYKVKRKADGSVERLKARLVAKGFKQRFGVDYTDTFSPVIK 361
             TW LVP K G N+ID KWVYKVKRKADGS++R KAR+VAKGFKQR+G+DY DTF+PV+K
Sbjct: 966  KTWHLVPSKKGQNIIDCKWVYKVKRKADGSLDRYKARVVAKGFKQRYGIDYEDTFNPVVK 1025

Query: 362  PSTIRVILSLAVTKGWNMRQVDIQNAFLHGILKEEVYMRQPPGFQDSAKPKNYICKLKKA 421
             +TIR ILS+A+++GW +RQ+D+QNAFLHG+L+E+V+MRQPPG++   +   Y+CKL KA
Sbjct: 1026 AATIRTILSIAISRGWTLRQLDVQNAFLHGVLEEDVFMRQPPGYE---QKDGYVCKLDKA 1085

Query: 422  LYGLKQAPKAWHSRLTGKLIELGFKASVADSSLFILKNREITIYMLIYVDDIIIVSSSDQ 481
            LYGLKQAP+AW+SRL+ KL ELGFK+S +D+SLF     ++ ++ML+YVDDII+ SSS  
Sbjct: 1086 LYGLKQAPRAWYSRLSTKLHELGFKSSKSDTSLFFYSKGDVAMFMLVYVDDIIVASSSID 1145

Query: 482  ATERLIQKLKIDFAVKDLGGLEYFLGIEVKKTRDGIILSQRRYALDLLKRVNMEKCKPMS 541
            AT  L++ L  +FA+KDLG L YFLGIEVK+  +GI+L+Q +YA+D+LKRVNM  CK ++
Sbjct: 1146 ATNALLKNLNQEFALKDLGRLHYFLGIEVKEVNNGIVLTQEKYAMDVLKRVNMSDCKAVN 1205

Query: 542  TPMGSAEKLFREQGIPLSAEEQFKYRSTVGALQYLTMTRPDLAFAVNKVCQYLHTPTDAH 601
            TP+  +EKL   +G P   E+  +YRS VGALQYLT+TRPDL+F+VNKVCQYLH PT  H
Sbjct: 1206 TPLSISEKLSAHEGNPFGPEDSTRYRSLVGALQYLTLTRPDLSFSVNKVCQYLHAPTTKH 1265

Query: 602  WGAVKRILRYVKGTLALGVKIQKS-TMMLSGFSDADWAGCPDDRRSTSGFAVFLGANLIS 661
            W A KRILRY+K T+ LG+KI KS ++++S FSDADWAGC DDR ST GFAVF+G NL+S
Sbjct: 1266 WAAAKRILRYLKHTVKLGLKISKSNSLLVSAFSDADWAGCLDDRHSTGGFAVFIGPNLVS 1325

Query: 662  WSSRKQATVSRSSTEAEYKAIANLTAEMIWIKSLLKELGVYQSKAPRLWCDNLGATYLTS 721
            WS+RKQATVSRSSTEAEYKA+AN+TAE++WI++LL ELG+   K  ++WCDN+GA Y+T+
Sbjct: 1326 WSARKQATVSRSSTEAEYKALANVTAEIMWIQTLLHELGIQAPKIAKVWCDNIGAKYMTA 1385

Query: 722  NPVFHARTKHIEVDFHFVREQVARKAMEVRFISSSDQVADILTKPLSKTPFTTHCNNLNM 724
            NPVFHARTKHIEVD+HFVRE+VARK ++V +IS+ DQVAD  TK LS        NNLN+
Sbjct: 1386 NPVFHARTKHIEVDYHFVRERVARKFLQVEYISTKDQVADGFTKTLSVRQLEMFRNNLNL 1427

BLAST of CmoCh10G002580 vs. ExPASy TrEMBL
Match: O24438 (Retrofit OS=Oryza longistaminata OX=4528 GN=gag PE=4 SV=1)

HSP 1 Score: 766.9 bits (1979), Expect = 7.5e-218
Identity = 404/788 (51.27%), Postives = 535/788 (67.89%), Query Frame = 0

Query: 2    LRVFGCACWPNLRPYNNKKLSFRTTRCIFLGYSSSHKGYKCLNRSTGRIYISRDVVFDEN 61
            LRVFGCACWP+LRPYN  KL FR+ +C+FLG+S+ HKG+KCL+ S+GR+YISRDVVFDEN
Sbjct: 679  LRVFGCACWPHLRPYNTHKLQFRSKQCVFLGFSTHHKGFKCLDVSSGRVYISRDVVFDEN 738

Query: 62   IFPFEESKPPNKTTNPHHPVLLPALAKLASFYTENA-LTDIEPVVSNSHMNDGQTDNIAS 121
            +FPF               +LLP  + L ++ T +A  T +   V+N+ +    +DN+ S
Sbjct: 739  VFPFSTLHSNAGARLRSEILLLP--SPLTNYNTASAGGTHVVAPVANTPL---PSDNLIS 798

Query: 122  DNLSGVSLSSADNTRSSEEIAEYEAESSSINAQNQTHEH-----VSDQPTEAASQHP--- 181
               +   ++S +N+ + E+  E E E  ++   N  H       V DQPT  +S  P   
Sbjct: 799  ---NAADVTSGENSAAHEQEMENEQEIENVMHGNDVHGDAASGPVLDQPTADSSTAPDQG 858

Query: 182  --------------------------------------------------------MRTR 241
                                                                      TR
Sbjct: 859  ADTSDAVSGAASDAGGDTATLGAGAANSAAAGGEESQPVQPDVTGTVLATVAPASRPHTR 918

Query: 242  LRNNIVQAKQFTDGTIRYSETSRKFASAVTITTPIIETATEPRNLQEAMQHPRWRGAMND 301
            LR+ I + K +TDGT++Y                   +  EP+N +EA+    WR AM  
Sbjct: 919  LRSGIRKEKVYTDGTVKYG---------------CFSSTGEPQNDKEALGDKNWRDAMET 978

Query: 302  ELSALKRNATWDLVPPKPGINLIDSKWVYKVKRKADGSVERLKARLVAKGFKQRFGVDYT 361
            E +AL +N TW LVP + G N+I  KWVYK+KRKADG+++R KARLVAKGFKQR+G+DY 
Sbjct: 979  EYNALIKNDTWHLVPYEKGQNIIGCKWVYKIKRKADGTLDRYKARLVAKGFKQRYGIDYE 1038

Query: 362  DTFSPVIKPSTIRVILSLAVTKGWNMRQVDIQNAFLHGILKEEVYMRQPPGFQDSAKPKN 421
            DTFSPV+K +TIR+ILS+AV++GW++RQ+D+QNAFLHG L+EEVYM+QPPGF+ S+KP +
Sbjct: 1039 DTFSPVVKAATIRIILSIAVSRGWSLRQLDVQNAFLHGFLEEEVYMQQPPGFESSSKP-D 1098

Query: 422  YICKLKKALYGLKQAPKAWHSRLTGKLIELGFKASVADSSLFILKNREITIYMLIYVDDI 481
            Y+CKL KALYGLKQAP+AW+SRL+ KL+ELGF+AS AD+SLF L    I +++L+YVDDI
Sbjct: 1099 YVCKLDKALYGLKQAPRAWYSRLSKKLVELGFEASKADTSLFFLNKGGILMFVLVYVDDI 1158

Query: 482  IIVSSSDQATERLIQKLKIDFAVKDLGGLEYFLGIEVKKTRDGIILSQRRYALDLLKRVN 541
            I+ SS+++AT  L++ L  +FA+KDLG L YFLGIEV K  +G+IL+Q +YA DLLKRVN
Sbjct: 1159 IVASSTEKATTALLKDLNKEFALKDLGDLHYFLGIEVTKVSNGVILTQEKYANDLLKRVN 1218

Query: 542  MEKCKPMSTPMGSAEKLFREQGIPLSAEEQFKYRSTVGALQYLTMTRPDLAFAVNKVCQY 601
            M  CKP+STP+  +EKL   +G PL   +  +YRS VGALQYLT+TRPD+A++VNKVCQ+
Sbjct: 1219 MSNCKPVSTPLSVSEKLTLYEGSPLGPNDAIQYRSIVGALQYLTLTRPDIAYSVNKVCQF 1278

Query: 602  LHTPTDAHWGAVKRILRYVKGTLALGVKIQKS-TMMLSGFSDADWAGCPDDRRSTSGFAV 661
            LH PT +HW AVKRILRY+    +LG+ I KS + ++ G+SDADWAG  DDR+ST GFAV
Sbjct: 1279 LHAPTTSHWIAVKRILRYLNQCTSLGLHIHKSASTLVHGYSDADWAGSIDDRKSTGGFAV 1338

Query: 662  FLGANLISWSSRKQATVSRSSTEAEYKAIANLTAEMIWIKSLLKELGVYQSKAPRLWCDN 721
            FLG+NL+SWS+RKQ TVSRSSTEAEYKA+AN TAE+IW+++LLKELG+   KA ++WCDN
Sbjct: 1339 FLGSNLVSWSARKQPTVSRSSTEAEYKAVANTTAELIWVQTLLKELGIESPKAAKIWCDN 1398

Query: 722  LGATYLTSNPVFHARTKHIEVDFHFVREQVARKAMEVRFISSSDQVADILTKPLSKTPFT 724
            LGA YL++NPVFHARTKHIEVD+HFVRE+V++K +E+ F+ S DQVAD  TK LS     
Sbjct: 1399 LGAKYLSANPVFHARTKHIEVDYHFVRERVSQKLLEIDFVPSGDQVADGFTKALSACLLE 1442

BLAST of CmoCh10G002580 vs. NCBI nr
Match: XP_035816648.1 (uncharacterized protein LOC115072894 isoform X1 [Zea mays])

HSP 1 Score: 821.2 bits (2120), Expect = 6.9e-234
Identity = 407/602 (67.61%), Postives = 499/602 (82.89%), Query Frame = 0

Query: 135 RSSEEIAEYEAESSSINAQNQTHE------HVSDQPTEAASQHPMRTRLRNNIVQAKQFT 194
           +S EE  E E+E+ S + Q +T E       V     E   QH MRTRL++NIV+ K+ T
Sbjct: 90  QSEEEGQESESEAHSGSLQQETPEPEQIQGEVERNQQEQEVQHHMRTRLKDNIVKTKKLT 149

Query: 195 DGTIRYSETSRKFA------SAVTITTPIIETATEPRNLQEAMQHPRWRGAMNDELSALK 254
           DGT+RY++  R F       +   +   +  + +EP +LQ+A++ P W+ AM++E SAL+
Sbjct: 150 DGTVRYAQQGRGFVVTEENPTTTALIASVQNSVSEPYDLQQALKDPGWKQAMDEEYSALQ 209

Query: 255 RNATWDLVPPKPGINLIDSKWVYKVKRKADGSVERLKARLVAKGFKQRFGVDYTDTFSPV 314
           RN TW+LVPP+ G+NLIDSKWV+KVKRKAD SVERLKARLVAKGFKQR+G+DY DTF PV
Sbjct: 210 RNQTWELVPPRAGVNLIDSKWVFKVKRKADSSVERLKARLVAKGFKQRYGIDYFDTFCPV 269

Query: 315 IKPSTIRVILSLAVTKGWNMRQVDIQNAFLHGILKEEVYMRQPPGFQDSAKPKNYICKLK 374
           +KP+TIR+ILSLAV++GW+MRQ+DIQNAFLHG+L+EEVYMRQPPG+ D  KP NYICKLK
Sbjct: 270 VKPTTIRIILSLAVSQGWSMRQIDIQNAFLHGLLEEEVYMRQPPGYIDPNKP-NYICKLK 329

Query: 375 KALYGLKQAPKAWHSRLTGKLIELGFKASVADSSLFILKNREITIYMLIYVDDIIIVSSS 434
           KALYGLKQAP+AWHSRLT KL ELGF+ASVAD+SLF+ K   ++IYMLIYVDDIIIVSS+
Sbjct: 330 KALYGLKQAPRAWHSRLTRKLQELGFQASVADASLFVFKQNGLSIYMLIYVDDIIIVSSN 389

Query: 435 DQATERLIQKLKIDFAVKDLGGLEYFLGIEVKKTRDGIILSQRRYALDLLKRVNMEKCKP 494
           D AT++LI+ L  DFAVKDLG LEYFLGIEVKKTR+GI+LSQ+ YALDLLK+ NMEKCK 
Sbjct: 390 DSATDKLIKNLADDFAVKDLGNLEYFLGIEVKKTREGILLSQKGYALDLLKKANMEKCKA 449

Query: 495 MSTPMGSAEKLFREQGIPLSAEEQFKYRSTVGALQYLTMTRPDLAFAVNKVCQYLHTPTD 554
           +STPM + +KL + QG  L+ +E F+YRSTVG LQYLT+TRPDL+FAVNKV QYL +PTD
Sbjct: 450 ISTPMSATDKLSKNQGTTLNEKEHFRYRSTVGGLQYLTLTRPDLSFAVNKVSQYLQSPTD 509

Query: 555 AHWGAVKRILRYVKGTLALGVKIQKS-TMMLSGFSDADWAGCPDDRRSTSGFAVFLGANL 614
            HW AVKRILR+VKGT+  G++IQK+ ++MLS FSDADWAGCPDDR+STSGFA+FLG NL
Sbjct: 510 VHWTAVKRILRFVKGTIDYGLQIQKTPSVMLSSFSDADWAGCPDDRKSTSGFAIFLGDNL 569

Query: 615 ISWSSRKQATVSRSSTEAEYKAIANLTAEMIWIKSLLKELGVYQSKAPRLWCDNLGATYL 674
           ++WSSRKQATVSRSSTEAEYKAIAN TAE+IWI++LLKELG+Y  + PR+WCDN+GATYL
Sbjct: 570 VAWSSRKQATVSRSSTEAEYKAIANATAELIWIQALLKELGIYLHRPPRMWCDNVGATYL 629

Query: 675 TSNPVFHARTKHIEVDFHFVREQVARKAMEVRFISSSDQVADILTKPLSKTPFTTHCNNL 724
           T+NP F+ RTKH+EVDFHFVREQVARKAMEVR ISS DQ+AD++TKPLSK PF  +C+NL
Sbjct: 630 TANPTFNGRTKHVEVDFHFVREQVARKAMEVRIISSKDQLADVMTKPLSKAPFVKNCSNL 689

BLAST of CmoCh10G002580 vs. NCBI nr
Match: KAG8087752.1 (hypothetical protein GUJ93_ZPchr0010g8288 [Zizania palustris])

HSP 1 Score: 795.4 bits (2053), Expect = 4.1e-226
Identity = 408/726 (56.20%), Postives = 528/726 (72.73%), Query Frame = 0

Query: 1    MLRVFGCACWPNLRPYNNKKLSFRTTRCIFLGYSSSHKGYKCLNRSTGRIYISRDVVFDE 60
            +LRVFGCACWP LRPY ++K+ FR+TRC+FLGYS  HKGYKCL+  TGRIYISRDV FDE
Sbjct: 655  LLRVFGCACWPYLRPYRSRKIEFRSTRCVFLGYSGKHKGYKCLHIPTGRIYISRDVTFDE 714

Query: 61   NIFPFEESKPPNKTTNPHHPVLLPALAKLASFYTENALTDIEPVVSNSHMNDGQTDNIAS 120
             IFPF ++   + T + +      +  +L+S     + + +EPV+   H+   Q  N   
Sbjct: 715  RIFPFADTSTSDSTPSTNPTDKPGSTIQLSS----PSHSTLEPVLDYHHV---QRANPVL 774

Query: 121  DNLSGVSLSSADNTRSSEEIAEYEAESSSINAQNQTHEHVSDQPTEAASQHPMR--TRLR 180
             + + +   + D+T +++  +    E+ S    +     +S +   A  Q   R  TRL 
Sbjct: 775  FSTNDMPCGAFDDTGAADLDSPTTGETLSPAVVDDASGMISQEDPSAPPQLTSRPTTRLS 834

Query: 181  NNIVQAKQFTDGTIRYSETSRKFASAVTITTPIIETATEPRNLQEAMQHPRWRGAMNDEL 240
            + I + K+FTDGT+RY    R F +++++         EP +  EA + P WR AM  E 
Sbjct: 835  HGISKPKEFTDGTVRYPLNKRAFLASLSL------CPAEPVSYAEAAKFPEWRNAMTAEY 894

Query: 241  SALKRNATWDLVPPKPGINLIDSKWVYKVKRKADGSVERLKARLVAKGFKQRFGVDYTDT 300
             AL RN TW L+P +   N++  +W++KVK KADG+++R KARLVAKGF QR G+DY DT
Sbjct: 895  DALMRNKTWHLIPREKHHNVVGCRWIFKVKHKADGTLDRYKARLVAKGFTQREGIDYGDT 954

Query: 301  FSPVIKPSTIRVILSLAVTKGWNMRQVDIQNAFLHGILKEEVYMRQPPGFQDSAKPKNYI 360
            FSPV+KP+T+R++LSLAV++GW++RQ+DIQNAFLHG L EEVYM+QPPGF++S  P+ YI
Sbjct: 955  FSPVVKPTTVRLVLSLAVSRGWHLRQIDIQNAFLHGELTEEVYMQQPPGFENSQHPE-YI 1014

Query: 361  CKLKKALYGLKQAPKAWHSRLTGKLIELGFKASVADSSLFILKNREITIYMLIYVDDIII 420
            CKL KALYGLKQAP+AW+ +L+ KL  LGF AS +DSSLFIL    ++IYML+YVDDIII
Sbjct: 1015 CKLDKALYGLKQAPRAWNIKLSTKLFSLGFTASKSDSSLFILHQPTVSIYMLVYVDDIII 1074

Query: 421  VSSSDQATERLIQKLKIDFAVKDLGGLEYFLGIEVKKTRDGIILSQRRYALDLLKRVNME 480
             SS+   +++L+Q+L  +FAVKDLG L YFLGIE     DG++L+Q +Y  DLL+R NM+
Sbjct: 1075 ASSNADVSDKLLQQLSSEFAVKDLGQLHYFLGIEASYHDDGVVLTQGKYVQDLLRRTNMQ 1134

Query: 481  KCKPMSTPMGSAEKLFREQGIPLSAEEQFKYRSTVGALQYLTMTRPDLAFAVNKVCQYLH 540
             CKP  TPM S EKL RE G  L  +E F YRSTVGALQYLT+TRPD++FAVNKVCQ+LH
Sbjct: 1135 LCKPSDTPMCSTEKLSRELGKALEDQEVFLYRSTVGALQYLTLTRPDISFAVNKVCQFLH 1194

Query: 541  TPTDAHWGAVKRILRYVKGTLALGVKIQKS-TMMLSGFSDADWAGCPDDRRSTSGFAVFL 600
             PTDAHW AVKRILRY+KGT  +G+KI++S +  LS FSDADWAGCPDDRRST GFA+F 
Sbjct: 1195 CPTDAHWEAVKRILRYLKGTCYVGLKIRRSLSQGLSAFSDADWAGCPDDRRSTGGFAIFC 1254

Query: 601  GANLISWSSRKQATVSRSSTEAEYKAIANLTAEMIWIKSLLKELGVYQSKAPRLWCDNLG 660
            G NL+SWSSRKQ+T+SRSSTEAEYKA+AN TAE+IW++SLL+EL V     P+LWCDNLG
Sbjct: 1255 GPNLVSWSSRKQSTISRSSTEAEYKALANATAELIWMESLLQELKVPLQCKPKLWCDNLG 1314

Query: 661  ATYLTSNPVFHARTKHIEVDFHFVREQVARKAMEVRFISSSDQVADILTKPLSKTPFTTH 720
            ATYLT+NPVFHARTKHIE+D HFVRE+V R  +EV+FISS+DQVADI TKPL +  F   
Sbjct: 1315 ATYLTANPVFHARTKHIEIDVHFVRERVTRGQLEVQFISSADQVADIFTKPLPRPLFRRF 1366

Query: 721  CNNLNM 724
              +LN+
Sbjct: 1375 FGDLNL 1366

BLAST of CmoCh10G002580 vs. NCBI nr
Match: KAG8087751.1 (hypothetical protein GUJ93_ZPchr0010g8326 [Zizania palustris])

HSP 1 Score: 795.4 bits (2053), Expect = 4.1e-226
Identity = 408/726 (56.20%), Postives = 528/726 (72.73%), Query Frame = 0

Query: 1   MLRVFGCACWPNLRPYNNKKLSFRTTRCIFLGYSSSHKGYKCLNRSTGRIYISRDVVFDE 60
           +LRVFGCACWP LRPY ++K+ FR+TRC+FLGYS  HKGYKCL+  TGRIYISRDV FDE
Sbjct: 273 LLRVFGCACWPYLRPYRSRKIEFRSTRCVFLGYSGKHKGYKCLHIPTGRIYISRDVTFDE 332

Query: 61  NIFPFEESKPPNKTTNPHHPVLLPALAKLASFYTENALTDIEPVVSNSHMNDGQTDNIAS 120
            IFPF ++   + T + +      +  +L+S     + + +EPV+   H+   Q  N   
Sbjct: 333 RIFPFADTSTSDSTPSTNPTDKPGSTIQLSS----PSHSTLEPVLDYHHV---QRANPVL 392

Query: 121 DNLSGVSLSSADNTRSSEEIAEYEAESSSINAQNQTHEHVSDQPTEAASQHPMR--TRLR 180
            + + +   + D+T +++  +    E+ S    +     +S +   A  Q   R  TRL 
Sbjct: 393 FSTNDMPCGAFDDTGAADLDSPTTGETLSPAVVDDASGMISQEDPSAPPQLTSRPTTRLS 452

Query: 181 NNIVQAKQFTDGTIRYSETSRKFASAVTITTPIIETATEPRNLQEAMQHPRWRGAMNDEL 240
           + I + K+FTDGT+RY    R F +++++         EP +  EA + P WR AM  E 
Sbjct: 453 HGISKPKEFTDGTVRYPLNKRAFLASLSL------CPAEPVSYAEAAKFPEWRNAMTAEY 512

Query: 241 SALKRNATWDLVPPKPGINLIDSKWVYKVKRKADGSVERLKARLVAKGFKQRFGVDYTDT 300
            AL RN TW L+P +   N++  +W++KVK KADG+++R KARLVAKGF QR G+DY DT
Sbjct: 513 DALMRNKTWHLIPREKHHNVVGCRWIFKVKHKADGTLDRYKARLVAKGFTQREGIDYGDT 572

Query: 301 FSPVIKPSTIRVILSLAVTKGWNMRQVDIQNAFLHGILKEEVYMRQPPGFQDSAKPKNYI 360
           FSPV+KP+T+R++LSLAV++GW++RQ+DIQNAFLHG L EEVYM+QPPGF++S  P+ YI
Sbjct: 573 FSPVVKPTTVRLVLSLAVSRGWHLRQIDIQNAFLHGELTEEVYMQQPPGFENSQHPE-YI 632

Query: 361 CKLKKALYGLKQAPKAWHSRLTGKLIELGFKASVADSSLFILKNREITIYMLIYVDDIII 420
           CKL KALYGLKQAP+AW+ +L+ KL  LGF AS +DSSLFIL    ++IYML+YVDDIII
Sbjct: 633 CKLDKALYGLKQAPRAWNIKLSTKLFSLGFTASKSDSSLFILHQPTVSIYMLVYVDDIII 692

Query: 421 VSSSDQATERLIQKLKIDFAVKDLGGLEYFLGIEVKKTRDGIILSQRRYALDLLKRVNME 480
            SS+   +++L+Q+L  +FAVKDLG L YFLGIE     DG++L+Q +Y  DLL+R NM+
Sbjct: 693 ASSNADVSDKLLQQLSSEFAVKDLGQLHYFLGIEASYHDDGVVLTQGKYVQDLLRRTNMQ 752

Query: 481 KCKPMSTPMGSAEKLFREQGIPLSAEEQFKYRSTVGALQYLTMTRPDLAFAVNKVCQYLH 540
            CKP  TPM S EKL RE G  L  +E F YRSTVGALQYLT+TRPD++FAVNKVCQ+LH
Sbjct: 753 LCKPSDTPMCSTEKLSRELGKALEDQEVFLYRSTVGALQYLTLTRPDISFAVNKVCQFLH 812

Query: 541 TPTDAHWGAVKRILRYVKGTLALGVKIQKS-TMMLSGFSDADWAGCPDDRRSTSGFAVFL 600
            PTDAHW AVKRILRY+KGT  +G+KI++S +  LS FSDADWAGCPDDRRST GFA+F 
Sbjct: 813 CPTDAHWEAVKRILRYLKGTCYVGLKIRRSLSQGLSAFSDADWAGCPDDRRSTGGFAIFC 872

Query: 601 GANLISWSSRKQATVSRSSTEAEYKAIANLTAEMIWIKSLLKELGVYQSKAPRLWCDNLG 660
           G NL+SWSSRKQ+T+SRSSTEAEYKA+AN TAE+IW++SLL+EL V     P+LWCDNLG
Sbjct: 873 GPNLVSWSSRKQSTISRSSTEAEYKALANATAELIWMESLLQELKVPLQCKPKLWCDNLG 932

Query: 661 ATYLTSNPVFHARTKHIEVDFHFVREQVARKAMEVRFISSSDQVADILTKPLSKTPFTTH 720
           ATYLT+NPVFHARTKHIE+D HFVRE+V R  +EV+FISS+DQVADI TKPL +  F   
Sbjct: 933 ATYLTANPVFHARTKHIEIDVHFVRERVTRGQLEVQFISSADQVADIFTKPLPRPLFRRF 984

Query: 721 CNNLNM 724
             +LN+
Sbjct: 993 FGDLNL 984

BLAST of CmoCh10G002580 vs. NCBI nr
Match: ABA98049.1 (retrotransposon protein, putative, Ty1-copia subclass [Oryza sativa Japonica Group])

HSP 1 Score: 772.3 bits (1993), Expect = 3.7e-219
Identity = 408/795 (51.32%), Postives = 536/795 (67.42%), Query Frame = 0

Query: 2    LRVFGCACWPNLRPYNNKKLSFRTTRCIFLGYSSSHKGYKCLNRSTGRIYISRDVVFDEN 61
            LR+FGCACWPNLRPYNN KL FR+ RC+FLG+S+ HKG+KCL  STGRIYISRDVVFDEN
Sbjct: 673  LRIFGCACWPNLRPYNNHKLQFRSKRCVFLGFSTMHKGFKCLEVSTGRIYISRDVVFDEN 732

Query: 62   IFPFEESK--------------------PPNKTTNPH--HPVLLPALAKLASFYTENA-- 121
            IFPF E                      P     N    HPV  P  A +++  +  A  
Sbjct: 733  IFPFTELHANAGARLRSEIDILTPELLGPIRSVGNEQCMHPVNNPLSADVSAALSNRANE 792

Query: 122  -------------------------------LTDIEPVVSNSHMNDGQTD--------NI 181
                                           +    P  ++S  + G T         + 
Sbjct: 793  PHRDGAVHPADAEDPPATPPLDASSGPEPDRVVHHSPAATSSGRHPGPTPGSVPRGAASS 852

Query: 182  ASDNLSGVSLSSADNTRSSEEIAEYEAES------SSINAQNQTHEHVSDQPTEAASQHP 241
             ++  +  S+S A   + S+ + E E  S       ++  +  T +H     T + +   
Sbjct: 853  LAEETAEDSVSQAVQEQESQVVQEQEQSSPAQEHAQAVTDETNTLQHADVTDTGSEAPAG 912

Query: 242  MRTRLRNNIVQAKQFTDGTIRYSETSRKFASAVTITTPIIETATEPRNLQEAMQHPRWRG 301
             RTRL++ + + K +TDGTI+Y  +                 + EP N  EA++   W+ 
Sbjct: 913  PRTRLQSGVRKEKVYTDGTIKYKHS-------------WFTASGEPTNDLEALKDKNWKL 972

Query: 302  AMNDELSALKRNATWDLVPPKPGINLIDSKWVYKVKRKADGSVERLKARLVAKGFKQRFG 361
            AM+ E  AL +N TW LVPP+ G N+I  KWVYK+KRKADG+++R KARLVAKGFKQR+G
Sbjct: 973  AMDSEYDALVKNKTWHLVPPQRGRNIIGCKWVYKIKRKADGTLDRYKARLVAKGFKQRYG 1032

Query: 362  VDYTDTFSPVIKPSTIRVILSLAVTKGWNMRQVDIQNAFLHGILKEEVYMRQPPGFQDSA 421
            +DY DTFSPV+K +TIR+ILSLAV+KGW++RQ+D+QNAFLHG L+EEVYM QPPGF+D  
Sbjct: 1033 IDYEDTFSPVVKAATIRIILSLAVSKGWSLRQLDVQNAFLHGYLEEEVYMLQPPGFEDPT 1092

Query: 422  KPKNYICKLKKALYGLKQAPKAWHSRLTGKLIELGFKASVADSSLFILKNREITIYMLIY 481
            KP +++CKL KALYGLKQAP+AW SRL+ KL++LGFK S  D+SLF L   +IT+++L+Y
Sbjct: 1093 KP-HHVCKLDKALYGLKQAPRAWFSRLSKKLMDLGFKGSKPDTSLFFLNKGDITMFVLVY 1152

Query: 482  VDDIIIVSSSDQATERLIQKLKIDFAVKDLGGLEYFLGIEVKKTRDGIILSQRRYALDLL 541
            VDDII+ SSS++AT  L+Q LK +FA+KDLG L YFLGIEV K ++GI+L+Q +YA DLL
Sbjct: 1153 VDDIIVASSSEKATAALLQDLKGEFALKDLGELHYFLGIEVSKVQNGIVLNQDKYANDLL 1212

Query: 542  KRVNMEKCKPMSTPMGSAEKLFREQGIPLSAEEQFKYRSTVGALQYLTMTRPDLAFAVNK 601
            K+V M  CKP +TP+  +EKL   +G  L  E+   YRS VGALQYLT+TRPD+AF+VNK
Sbjct: 1213 KKVGMIDCKPANTPLSVSEKLSLHEGSLLGPEDASHYRSVVGALQYLTLTRPDIAFSVNK 1272

Query: 602  VCQYLHTPTDAHWGAVKRILRYVKGTLALGVKIQKS-TMMLSGFSDADWAGCPDDRRSTS 661
            VCQ+LH PT  HW AVKRILRY+K    LG++I KS + ++SGFSDADWAGC DDRRST 
Sbjct: 1273 VCQFLHAPTTVHWIAVKRILRYLKQCTRLGLEIHKSGSTLVSGFSDADWAGCLDDRRSTG 1332

Query: 662  GFAVFLGANLISWSSRKQATVSRSSTEAEYKAIANLTAEMIWIKSLLKELGVYQSKAPRL 721
            GFA+FLG+NL+SW++RKQATVSRSSTE+EYKAIAN TAE++W+++LL EL +   KA ++
Sbjct: 1333 GFAIFLGSNLVSWNARKQATVSRSSTESEYKAIANATAEIMWVQTLLAELEIKSPKAAKI 1392

Query: 722  WCDNLGATYLTSNPVFHARTKHIEVDFHFVREQVARKAMEVRFISSSDQVADILTKPLSK 727
            WCDNLGA YL++NPVFHARTKHIEVD+HFVRE+V++K +E+ F+S++DQVAD  TKPLS 
Sbjct: 1393 WCDNLGAKYLSANPVFHARTKHIEVDYHFVRERVSQKLLEIDFVSTNDQVADGFTKPLSV 1452

BLAST of CmoCh10G002580 vs. NCBI nr
Match: BAH94406.1 (Os08g0544300 [Oryza sativa Japonica Group])

HSP 1 Score: 771.5 bits (1991), Expect = 6.3e-219
Identity = 411/786 (52.29%), Postives = 540/786 (68.70%), Query Frame = 0

Query: 2   LRVFGCACWPNLRPYNNKKLSFRTTRCIFLGYSSSHKGYKCLNRSTGRIYISRDVVFDEN 61
           LR+FGCA WPNLRPYN  KL+FR+ RC+FLGYS+ HKG+KCL  +TGR+Y+SRDV FDE+
Sbjct: 44  LRIFGCAVWPNLRPYNKHKLAFRSKRCVFLGYSNLHKGFKCLEIATGRVYVSRDVTFDES 103

Query: 62  IFPFEESKPP-----NKTTNPHHPVLLPALAKLA----------------SFYTENALTD 121
           IFPF E             +   P L+P L+ L                  F  ENA   
Sbjct: 104 IFPFSELHSNAGACLRAEISLLPPSLVPHLSSLGGEQNNHVLNYPPNVTDQFGEENAEIG 163

Query: 122 IEPVVSNSHMN---------DGQTDNIASDNLSGVSLSSA---------DNTRSSEE--- 181
            E +V+N   N             +  A D++ GV+  ++         D T S+ E   
Sbjct: 164 -EEIVANGEENAAAAADENAAAAANGGAQDDVHGVAYDASPEHSSPVTDDATASAAEQHG 223

Query: 182 --IAE----------YEAESSSINAQNQTHEHV-------SDQPTEAASQHPMR--TRLR 241
             I E            + S S+ +    H+ V       +DQ    A+  P+R  TRL+
Sbjct: 224 NPIQEEHLVQASPQTASSTSPSVASSAGVHDDVTTDQSDQTDQAMPEAAVAPIRPKTRLQ 283

Query: 242 NNIVQAKQFTDGTIRYSETSRKFASAVTITTPIIETATEPRNLQEAMQHPRWRGAMNDEL 301
           + I + K +TDGT+++                   ++ EP++L+EA+ +  W+ AM+ E 
Sbjct: 284 SGIRKEKVYTDGTVKWLN---------------FTSSGEPQSLEEAVNNKHWKEAMDAEY 343

Query: 302 SALKRNATWDLVPPKPGINLIDSKWVYKVKRKADGSVERLKARLVAKGFKQRFGVDYTDT 361
            AL  N TW LVPP+ G N+ID KWVYKVKRKADGS++R KARLVAKGFKQR+G+DY DT
Sbjct: 344 MALIENKTWHLVPPQKGRNVIDCKWVYKVKRKADGSLDRYKARLVAKGFKQRYGIDYEDT 403

Query: 362 FSPVIKPSTIRVILSLAVTKGWNMRQVDIQNAFLHGILKEEVYMRQPPGFQDSAKPKNYI 421
           FSPV+K +TIR++LSLAV++GW++RQ+D++NAFLHG+L+EEVYM QPPG++  + P NY+
Sbjct: 404 FSPVVKAATIRIVLSLAVSRGWSLRQLDVKNAFLHGVLEEEVYMEQPPGYEKKSMP-NYV 463

Query: 422 CKLKKALYGLKQAPKAWHSRLTGKLIELGFKASVADSSLFILKNREITIYMLIYVDDIII 481
           CKL KALYGLKQAP+AW+SRL+ KL ELGF  S AD+SLF  K  +++I++LIYVDDII+
Sbjct: 464 CKLDKALYGLKQAPRAWYSRLSTKLSELGFVPSKADTSLFFYKKGQVSIFLLIYVDDIIM 523

Query: 482 VSSSDQATERLIQKLKIDFAVKDLGGLEYFLGIEVKKTRDGIILSQRRYALDLLKRVNME 541
            SS   AT  L+Q+L  DFA+KDLG L YFLGIEV K +DG++LSQ +YA DLL+RV M 
Sbjct: 524 ASSVPDATSTLLQELSKDFALKDLGDLHYFLGIEVHKVKDGLMLSQEKYASDLLRRVGMY 583

Query: 542 KCKPMSTPMGSAEKLFREQGIPLSAEEQFKYRSTVGALQYLTMTRPDLAFAVNKVCQYLH 601
           +CKP+STP+ ++EKL   +G  L  ++  +YRS VGALQYLT+TRPD++F++NKVCQ+LH
Sbjct: 584 ECKPVSTPLSTSEKLSVNEGTLLGPQDSTQYRSVVGALQYLTLTRPDISFSINKVCQFLH 643

Query: 602 TPTDAHWGAVKRILRYVKGTLALGVKI-QKSTMMLSGFSDADWAGCPDDRRSTSGFAVFL 661
            PT  HW AVKRILRYVK T+  G+K  +  ++++SGFSDADWAG PDDRRST GFAVFL
Sbjct: 644 APTTTHWAAVKRILRYVKYTVDTGLKFCRNPSLLVSGFSDADWAGSPDDRRSTGGFAVFL 703

Query: 662 GANLISWSSRKQATVSRSSTEAEYKAIANLTAEMIWIKSLLKELGVYQSKAPRLWCDNLG 721
           G NL+SWS+RKQATVSRSSTEAEYKA+AN TAE++W+++LL+ELGV   +A +LWCDNLG
Sbjct: 704 GPNLVSWSARKQATVSRSSTEAEYKALANATAEIMWVQTLLQELGVESPRAAKLWCDNLG 763

Query: 722 ATYLTSNPVFHARTKHIEVDFHFVREQVARKAMEVRFISSSDQVADILTKPLSKTPFTTH 724
           A YL++NP+FHARTKHIEVDFHFVRE+VARK +E+ +IS+ DQVAD  TK +        
Sbjct: 764 AKYLSANPIFHARTKHIEVDFHFVRERVARKLLEIAYISTKDQVADGFTKAIPVRQMEMF 812

BLAST of CmoCh10G002580 vs. TAIR 10
Match: AT4G23160.1 (cysteine-rich RLK (RECEPTOR-like protein kinase) 8 )

HSP 1 Score: 408.3 bits (1048), Expect = 1.3e-113
Identity = 212/482 (43.98%), Postives = 305/482 (63.28%), Query Frame = 0

Query: 212 IETATEPRNLQEAMQHPRWRGAMNDELSALKRNATWDLVPPKPGINLIDSKWVYKVKRKA 271
           I  A EP    EA +   W GAM+DE+ A++   TW++    P    I  KWVYK+K  +
Sbjct: 80  IAKAKEPSTYNEAKEFLVWCGAMDDEIGAMETTHTWEICTLPPNKKPIGCKWVYKIKYNS 139

Query: 272 DGSVERLKARLVAKGFKQRFGVDYTDTFSPVIKPSTIRVILSLAVTKGWNMRQVDIQNAF 331
           DG++ER KARLVAKG+ Q+ G+D+ +TFSPV K +++++IL+++    + + Q+DI NAF
Sbjct: 140 DGTIERYKARLVAKGYTQQEGIDFIETFSPVCKLTSVKLILAISAIYNFTLHQLDISNAF 199

Query: 332 LHGILKEEVYMRQPPGF---QDSAKPKNYICKLKKALYGLKQAPKAWHSRLTGKLIELGF 391
           L+G L EE+YM+ PPG+   Q  + P N +C LKK++YGLKQA + W  + +  LI  GF
Sbjct: 200 LNGDLDEEIYMKLPPGYAARQGDSLPPNAVCYLKKSIYGLKQASRQWFLKFSVTLIGFGF 259

Query: 392 KASVADSSLFILKNREITIYMLIYVDDIIIVSSSDQATERLIQKLKIDFAVKDLGGLEYF 451
             S +D + F+     + + +L+YVDDIII S++D A + L  +LK  F ++DLG L+YF
Sbjct: 260 VQSHSDHTYFLKITATLFLCVLVYVDDIIICSNNDAAVDELKSQLKSCFKLRDLGPLKYF 319

Query: 452 LGIEVKKTRDGIILSQRRYALDLLKRVNMEKCKPMSTPMGSAEKLFREQGIPLSAEEQFK 511
           LG+E+ ++  GI + QR+YALDLL    +  CKP S PM  +       G      +   
Sbjct: 320 LGLEIARSAAGINICQRKYALDLLDETGLLGCKPSSVPMDPSVTFSAHSGGDF--VDAKA 379

Query: 512 YRSTVGALQYLTMTRPDLAFAVNKVCQYLHTPTDAHWGAVKRILRYVKGTLALGV-KIQK 571
           YR  +G L YL +TR D++FAVNK+ Q+   P  AH  AV +IL Y+KGT+  G+    +
Sbjct: 380 YRRLIGRLMYLQITRLDISFAVNKLSQFSEAPRLAHQQAVMKILHYIKGTVGQGLFYSSQ 439

Query: 572 STMMLSGFSDADWAGCPDDRRSTSGFAVFLGANLISWSSRKQATVSRSSTEAEYKAIANL 631
           + M L  FSDA +  C D RRST+G+ +FLG +LISW S+KQ  VS+SS EAEY+A++  
Sbjct: 440 AEMQLQVFSDASFQSCKDTRRSTNGYCMFLGTSLISWKSKKQQVVSKSSAEAEYRALSFA 499

Query: 632 TAEMIWIKSLLKELGVYQSKAPRLWCDNLGATYLTSNPVFHARTKHIEVDFHFVREQVAR 690
           T EM+W+    +EL +  SK   L+CDN  A ++ +N VFH RTKHIE D H VRE+   
Sbjct: 500 TDEMMWLAQFFRELQLPLSKPTLLFCDNTAAIHIATNAVFHERTKHIESDCHSVRERSVY 559

BLAST of CmoCh10G002580 vs. TAIR 10
Match: ATMG00810.1 (DNA/RNA polymerases superfamily protein )

HSP 1 Score: 218.4 bits (555), Expect = 1.9e-56
Identity = 112/228 (49.12%), Postives = 151/228 (66.23%), Query Frame = 0

Query: 407 IYMLIYVDDIIIVSSSDQATERLIQKLKIDFAVKDLGGLEYFLGIEVKKTRDGIILSQRR 466
           +Y+L+YVDDI++  SS+     LI +L   F++KDLG + YFLGI++K    G+ LSQ +
Sbjct: 1   MYLLLYVDDILLTGSSNTLLNMLIFQLSSTFSMKDLGPVHYFLGIQIKTHPSGLFLSQTK 60

Query: 467 YALDLLKRVNMEKCKPMSTPMGSAEKLFREQGIPLSAEEQFKYRSTVGALQYLTMTRPDL 526
           YA  +L    M  CKPMSTP+                 +   +RS VGALQYLT+TRPD+
Sbjct: 61  YAEQILNNAGMLDCKPMSTPL---PLKLNSSVSTAKYPDPSDFRSIVGALQYLTLTRPDI 120

Query: 527 AFAVNKVCQYLHTPTDAHWGAVKRILRYVKGTLALGVKIQK-STMMLSGFSDADWAGCPD 586
           ++AVN VCQ +H PT A +  +KR+LRYVKGT+  G+ I K S + +  F D+DWAGC  
Sbjct: 121 SYAVNIVCQRMHEPTLADFDLLKRVLRYVKGTIFHGLYIHKNSKLNVQAFCDSDWAGCTS 180

Query: 587 DRRSTSGFAVFLGANLISWSSRKQATVSRSSTEAEYKAIANLTAEMIW 634
            RRST+GF  FLG N+ISWS+++Q TVSRSSTE EY+A+A   AE+ W
Sbjct: 181 TRRSTTGFCTFLGCNIISWSAKRQPTVSRSSTETEYRALALTAAELTW 225

BLAST of CmoCh10G002580 vs. TAIR 10
Match: ATMG00820.1 (Reverse transcriptase (RNA-dependent DNA polymerase) )

HSP 1 Score: 111.3 bits (277), Expect = 3.3e-24
Identity = 55/112 (49.11%), Postives = 79/112 (70.54%), Query Frame = 0

Query: 204 AVTITTPIIETATEPRNLQEAMQHPRWRGAMNDELSALKRNATWDLVPPKPGINLIDSKW 263
           ++TITT I     EP+++  A++ P W  AM +EL AL RN TW LVPP    N++  KW
Sbjct: 17  SLTITTTI---KKEPKSVIFALKDPGWCQAMQEELDALSRNKTWILVPPPVNQNILGCKW 76

Query: 264 VYKVKRKADGSVERLKARLVAKGFKQRFGVDYTDTFSPVIKPSTIRVILSLA 316
           V+K K  +DG+++RLKARLVAKGF Q  G+ + +T+SPV++ +TIR IL++A
Sbjct: 77  VFKTKLHSDGTLDRLKARLVAKGFHQEEGIYFVETYSPVVRTATIRTILNVA 125

BLAST of CmoCh10G002580 vs. TAIR 10
Match: ATMG00240.1 (Gag-Pol-related retrotransposon family protein )

HSP 1 Score: 80.5 bits (197), Expect = 6.2e-15
Identity = 42/88 (47.73%), Postives = 54/88 (61.36%), Query Frame = 0

Query: 518 YLTMTRPDLAFAVNKVCQYLHTPTDAHWGAVKRILRYVKGTLALGVKIQ-KSTMMLSGFS 577
           YLT+TRPDL FAVN++ Q+      A   AV ++L YVKGT+  G+     S + L  F+
Sbjct: 2   YLTITRPDLTFAVNRLSQFSSASRTAQMQAVYKVLHYVKGTVGQGLFYSATSDLQLKAFA 61

Query: 578 DADWAGCPDDRRSTSGFAV-----FLGA 600
           D+DWA CPD RRS +GF       FLGA
Sbjct: 62  DSDWASCPDTRRSVTGFCSLVPLWFLGA 89

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q94HW26.4e-16642.29Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana O... [more]
Q9ZT943.5e-16442.17Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana O... [more]
P109781.4e-10433.75Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum... [more]
P041463.6e-9238.26Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3[more]
P925192.7e-5549.12Uncharacterized mitochondrial protein AtMg00810 OS=Arabidopsis thaliana OX=3702 ... [more]
Match NameE-valueIdentityDescription
Q2QRW41.8e-21951.32Retrotransposon protein, putative, Ty1-copia subclass OS=Oryza sativa subsp. jap... [more]
C7J5P93.0e-21952.29Os08g0544300 protein OS=Oryza sativa subsp. japonica OX=39947 GN=Os08g0544300 PE... [more]
Q75HT95.7e-21851.15Putative polyprotein OS=Oryza sativa subsp. japonica OX=39947 GN=B1003C08.12 PE=... [more]
Q75G455.7e-21851.15Putative polyprotein OS=Oryza sativa subsp. japonica OX=39947 GN=OSJNBb0043H23.1... [more]
O244387.5e-21851.27Retrofit OS=Oryza longistaminata OX=4528 GN=gag PE=4 SV=1[more]
Match NameE-valueIdentityDescription
XP_035816648.16.9e-23467.61uncharacterized protein LOC115072894 isoform X1 [Zea mays][more]
KAG8087752.14.1e-22656.20hypothetical protein GUJ93_ZPchr0010g8288 [Zizania palustris][more]
KAG8087751.14.1e-22656.20hypothetical protein GUJ93_ZPchr0010g8326 [Zizania palustris][more]
ABA98049.13.7e-21951.32retrotransposon protein, putative, Ty1-copia subclass [Oryza sativa Japonica Gro... [more]
BAH94406.16.3e-21952.29Os08g0544300 [Oryza sativa Japonica Group][more]
Match NameE-valueIdentityDescription
AT4G23160.11.3e-11343.98cysteine-rich RLK (RECEPTOR-like protein kinase) 8 [more]
ATMG00810.11.9e-5649.12DNA/RNA polymerases superfamily protein [more]
ATMG00820.13.3e-2449.11Reverse transcriptase (RNA-dependent DNA polymerase) [more]
ATMG00240.16.2e-1547.73Gag-Pol-related retrotransposon family protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita moschata (Rifu) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR013103Reverse transcriptase, RNA-dependent DNA polymerasePFAMPF07727RVT_2coord: 244..487
e-value: 1.6E-65
score: 221.1
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 143..174
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 108..136
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 108..174
NoneNo IPR availablePANTHERPTHR45895FAMILY NOT NAMEDcoord: 2..576
NoneNo IPR availableCDDcd09272RNase_HI_RT_Ty1coord: 574..711
e-value: 8.09086E-78
score: 243.915
IPR043502DNA/RNA polymerase superfamilySUPERFAMILY56672DNA/RNA polymerasescoord: 243..680

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmoCh10G002580.1CmoCh10G002580.1mRNA