CmoCh05G009260 (gene) Cucurbita moschata (Rifu) v1

Overview
NameCmoCh05G009260
Typegene
OrganismCucurbita moschata (Cucurbita moschata (Rifu) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
LocationCmo_Chr05: 7237226 .. 7240186 (-)
RNA-Seq ExpressionCmoCh05G009260
SyntenyCmoCh05G009260
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGACCACGAAACAGTGGAAGCTAAGGGATCGACAAGCCTTAGGGATGATCCGGTTGACGCTATCCAGAAACGTGGCGTTTAACATCATCAAGGAGAAGACAACGTCAGATCTGATGAAGGCGCTGTCAAATATGTATGAAAAACCGTCGGCTATGAACAAGGTGTATTTGATGCATAGATTGTTCAATCTACAGATGTCTAAAGGTGGACGTGTTGCGGATCATATAAATGAATTCAATATGATCATAAGTCAACTGGGTTCGATGAAAATTAATTTCGAAGATGAAATTAAAGCGTTGATTTTGATGTCATCCTTACCCGAGTCATGGGATACTGTTGTTGCCGCAATCAGCAGTTCCCGAGGATCTGATAAACTGAAGTTTGATGAAATTCGAAATGTAGTTCTCAGCGAAAGTATTCGCAAACGGGAAACTGGAGATTCATCAGGCAATGCTCTCAGTGTTGATCGAAGGGGAAGAAGCAAATCAAAGGGCTCAAACAAACATGGGCGTTCAAAATCAAAGAACCGGGAAAAATCTTCAAACAAACCCAACGTAACGTGTTGGAGCTGTGGGGGAAAAGAACACTTTCGGACAGATTGTACAAAACTAAAGAAGAAGCAGAATCATAAATCTGAAGATGATGATGATTCTATATACACAACAGAAGATGCTGAGGACGTTTTAATCCTTAGTGTGGATAGCCCAGTTGAATCCTGGATTTTGGATTCTTGTGCATCGTTCCATTCGTCTCCAAGTAAGGAATTGTTTCAGAATTTCAAATCAGGAAATTTCGGGAAGGTGTATCTTGCCGACAACAAAGCCTTGGAGATTGAAGGAAAGGGAGATGTCTCTATACAAACTCCAGCAGGAAATCTATGGACATTACAAGATGTCAGATACATTCCTGGTCTCAAGAAGAACCTGATCTCTATTAGACAGTTGGATAGCACAGGCTATGCAGCAGAGTTTGGAAAGAGTTCCTGGATGATTGTGAAGGGCGCCATGGTTGTTGCGCGTGGCACAAAATCCGGAACCTTATACACAACTGCAGAGTGTATAAAAATGACTGCTGCTGCTGAGAGTGCTTCCAATTCAAGTCTATGGCACAATAGACTTGGACATATGAGCGTCAAAGGAATGAAGATGCTAACTGCGAAAGGAGCTTTAGAAGGCTTAAAATTTGTTGATATGGGTCTTTGTGAGAGCTGTGTTATGGGCAAACAAAAACGAGTTAGTTTCACAAAGGCTGCCAGAGAACCGAAGATAGTGCGGTTGAAAATGGTCCATACAGACGTCTGGGGACCATCTCCAGTTTCATCACTTGGTGGATCAAGGTACTACGTCACCTTCATCGATGACTTCAGCAGAAAGGTATGGGTTTACTTTCGGAAACACAAGTCAGATGTGTTTACCACCTTCAAGAAGTGGAAAGCTGAAGTTGAAAATCAGACCGGCTTGAAGATCAAATGCCTGAGGTCTAACAATGGAGGAGAATACAACAAATTAGAGTTTATAAAATTTTGTGCAGCTGAGGAAATTAGATTAATAAGAATAATTCCCGGTAAGGCAAGACAATGGTATTGCAGAAAGGATGAACAGAACATTGAATGAGCGAGCAAGAAGCATGAGGATTCATTCTGGATTGCCAAAGACATTCTGGGCTGATGCTGTGAACACAGCAGCATATTTGATTAATAGAAGGCCGTCAGTACCCTTGAAGTTCAAATTTCCCGAAGAAGTATGGACAGGAAATGAACTCAAGTACTCTCACTTGAGAACCTTTGGTTGTACTGCGTATGTTCACATTGATCTAGAGAAGAGAGATAAGCTTGATGCTAAGGCTGTAAAATGCTACTTCATAGGCTATGGATCTGACATGTTTGGGTACAGGTTTTGGGATGAGAAAAATATGAAGATCCTAAGACACTGTGATGTGATTTTTGATGAAAATGTCATGTACAAGGACAGAGAGAAGATAAACTCTGAGACTACAAAGCAAGTGGGAGTTGAACTTGAGTGGCAAGAAAATTCACGCAGTGATGGTACAACAGAAGCTCAAGAAACTTCTGATCCTATTGCTGAAGAACCAGACGTGGAGCAAGTTGCACCTGAGCAAGTGTTGAGAAGATCATCCAGAACTACCAGAGCACTAGATAAATATTCAACCTCATTACATTATTTGTTGCTGACAGACGAAGAGAACCAGAGTCCTTTGATGAGGCCCTACAAGTGGAAGATTCAATCAAGTGGGAGCAAGCCGTGGATGATGAGATGAGCTCACTTAAAAGGAATAATACGTGGGTGCTGACTGAGTTGCCTACAGGAAAGAGAGCTCTGTTGAACAAATGGGTGTTCAAAATCAAGGTTGAACCAGATGGCAGAAGAAAGTTTAAGGCCCGGTTAGTAGTTAAAGGATATTCACAAAGAAAAGGCATTGATTATGTTGAGATCTTTTCTCCAGTTGTGAAATTAACTACTATCCGAATTCTGCTGAGTATTGTTGCATCGGAGAATTTGCACCTCGAGCAAATGGATGTAAAAATGACTTTTTTACATGGAGATCTAGATGAGGAGATCTATATGCAACAACCCGAAGGGTTTGCAGCTCCAGGCAAGAAGTACATGGTGTGTAAGCTCAATAAGAGCTTGTATGGACTGAAACAAGCACCGAGACAATGGTACAAGAAGTTTGACTCCTTCATGAGCAAGAGTGGTTTCCATAGAAGTGAAAAGAATCAGTGTTGCTACCTCAAGAAATACACTGATTCTTATGTGTTTCTACTCCTGTATGTGGATGATATAATAATTGTTGGATCAAGTATGAGGGAGATAAACCACCTGAAGGCAAGCTTGTCTTCAGTATTTGAGATGAAAGATTTAGGTGCAGCGAAGCAGATCCTTGGGATGAGAATTTCTCGAGATAGATCTGCTGGCACATTAAATCTATCCCAAGAATAG

mRNA sequence

ATGACCACGAAACAGTGGAAGCTAAGGGATCGACAAGCCTTAGGGATGATCCGGTTGACGCTATCCAGAAACGTGGCGTTTAACATCATCAAGGAGAAGACAACGTCAGATCTGATGAAGGCGCTGTCAAATATGTATGAAAAACCGTCGGCTATGAACAAGGTGTATTTGATGCATAGATTGTTCAATCTACAGATGTCTAAAGGTGGACGTGTTGCGGATCATATAAATGAATTCAATATGATCATAAGTCAACTGGGTTCGATGAAAATTAATTTCGAAGATGAAATTAAAGCGTTGATTTTGATGTCATCCTTACCCGAGTCATGGGATACTGTTGTTGCCGCAATCAGCAGTTCCCGAGGATCTGATAAACTGAAGTTTGATGAAATTCGAAATGTAGTTCTCAGCGAAAGTATTCGCAAACGGGAAACTGGAGATTCATCAGGCAATGCTCTCAGTGTTGATCGAAGGGGAAGAAGCAAATCAAAGGGCTCAAACAAACATGGGCGTTCAAAATCAAAGAACCGGGAAAAATCTTCAAACAAACCCAACGTAACGTGTTGGAGCTGTGGGGGAAAAGAACACTTTCGGACAGATTGTACAAAACTAAAGAAGAAGCAGAATCATAAATCTGAAGATGATGATGATTCTATATACACAACAGAAGATGCTGAGGACGTTTTAATCCTTAGTGTGGATAGCCCAGTTGAATCCTGGATTTTGGATTCTTGTGCATCGTTCCATTCGTCTCCAAGTAAGGAATTGTTTCAGAATTTCAAATCAGGAAATTTCGGGAAGGTGTATCTTGCCGACAACAAAGCCTTGGAGATTGAAGGAAAGGGAGATGTCTCTATACAAACTCCAGCAGGAAATCTATGGACATTACAAGATGTCAGATACATTCCTGGTCTCAAGAAGAACCTGATCTCTATTAGACAGTTGGATAGCACAGGCTATGCAGCAGAGTTTGGAAAGAGTTCCTGGATGATTGTGAAGGGCGCCATGGTTGTTGCGCGTGGCACAAAATCCGGAACCTTATACACAACTGCAGAGTGTATAAAAATGACTGCTGCTGCTGAGAGTGCTTCCAATTCAAGTCTATGGCACAATAGACTTGGACATATGAGCGTCAAAGGAATGAAGATGCTAACTGCGAAAGGAGCTTTAGAAGGCTTAAAATTTGTTGATATGGGTCTTTGTGAGAGCTGTGTTATGGGCAAACAAAAACGAGTTAGTTTCACAAAGGCTGCCAGAGAACCGAAGATAGTGCGGTTGAAAATGGTCCATACAGACGTCTGGGGACCATCTCCAGTTTCATCACTTGGTGGATCAAGGTACTACGTCACCTTCATCGATGACTTCAGCAGAAAGGTATGGGTTTACTTTCGGAAACACAAGTCAGATGTGTTTACCACCTTCAAGAAGTGGAAAGCTGAAGTTGAAAATCAGACCGGCTTGAAGATCAAATGCCTGAGGCAAGACAATGGTATTGCAGAAAGGATGAACAGAACATTGAATGAGCGAGCAAGAAGCATGAGGATTCATTCTGGATTGCCAAAGACATTCTGGGCTGATGCTGTGAACACAGCAGCATATTTGATTAATAGAAGGCCGTCAGTACCCTTGAAGTTCAAATTTCCCGAAGAAGTATGGACAGGAAATGAACTCAAGTACTCTCACTTGAGAACCTTTGGTTGTACTGCGTATGTTCACATTGATCTAGAGAAGAGAGATAAGCTTGATGCTAAGGCTGTAAAATGCTACTTCATAGGCTATGGATCTGACATGTTTGGGTACAGGTTTTGGGATGAGAAAAATATGAAGATCCTAAGACACTGTGATGTGATTTTTGATGAAAATGTCATGTACAAGGACAGAGAGAAGATAAACTCTGAGACTACAAAGCAAGTGGGAGTTGAACTTGAGTGGCAAGAAAATTCACGCAGTGATGGTACAACAGAAGCTCAAGAAACTTCTGATCCTATTGCTGAAGAACCAGACGTGGAGCAAGTTGCACCTGAGCAAGTACGAAGAGAACCAGAGTCCTTTGATGAGGCCCTACAAGTGGAAGATTCAATCAAGTGGGAGCAAGCCGTGGATGATGAGATGAGCTCACTTAAAAGGAATAATACGTGGGTGCTGACTGAGTTGCCTACAGGAAAGAGAGCTCTGTTGAACAAATGGGTGTTCAAAATCAAGGTTGAACCAGATGGCAGAAGAAAGTTTAAGGCCCGGTTAGTAGTTAAAGGATATTCACAAAGAAAAGGCATTGATTATGTTGAGATCTTTTCTCCAGTTGTGAAATTAACTACTATCCGAATTCTGCTGAGTATTGTTGCATCGGAGAATTTGCACCTCGAGCAAATGGATGTAAAAATGACTTTTTTACATGGAGATCTAGATGAGGAGATCTATATGCAACAACCCGAAGGGTTTGCAGCTCCAGGCAAGAAGTACATGGTGTGTAAGCTCAATAAGAGCTTGTATGGACTGAAACAAGCACCGAGACAATGGTACAAGAAGTTTGACTCCTTCATGAGCAAGAGTGGTTTCCATAGAAGTGAAAAGAATCAGTGTTGCTACCTCAAGAAATACACTGATTCTTATGTGTTTCTACTCCTGTATGTGGATGATATAATAATTGTTGGATCAAGTATGAGGGAGATAAACCACCTGAAGGCAAGCTTGTCTTCAGTATTTGAGATGAAAGATTTAGGTGCAGCGAAGCAGATCCTTGGGATGAGAATTTCTCGAGATAGATCTGCTGGCACATTAAATCTATCCCAAGAATAG

Coding sequence (CDS)

ATGACCACGAAACAGTGGAAGCTAAGGGATCGACAAGCCTTAGGGATGATCCGGTTGACGCTATCCAGAAACGTGGCGTTTAACATCATCAAGGAGAAGACAACGTCAGATCTGATGAAGGCGCTGTCAAATATGTATGAAAAACCGTCGGCTATGAACAAGGTGTATTTGATGCATAGATTGTTCAATCTACAGATGTCTAAAGGTGGACGTGTTGCGGATCATATAAATGAATTCAATATGATCATAAGTCAACTGGGTTCGATGAAAATTAATTTCGAAGATGAAATTAAAGCGTTGATTTTGATGTCATCCTTACCCGAGTCATGGGATACTGTTGTTGCCGCAATCAGCAGTTCCCGAGGATCTGATAAACTGAAGTTTGATGAAATTCGAAATGTAGTTCTCAGCGAAAGTATTCGCAAACGGGAAACTGGAGATTCATCAGGCAATGCTCTCAGTGTTGATCGAAGGGGAAGAAGCAAATCAAAGGGCTCAAACAAACATGGGCGTTCAAAATCAAAGAACCGGGAAAAATCTTCAAACAAACCCAACGTAACGTGTTGGAGCTGTGGGGGAAAAGAACACTTTCGGACAGATTGTACAAAACTAAAGAAGAAGCAGAATCATAAATCTGAAGATGATGATGATTCTATATACACAACAGAAGATGCTGAGGACGTTTTAATCCTTAGTGTGGATAGCCCAGTTGAATCCTGGATTTTGGATTCTTGTGCATCGTTCCATTCGTCTCCAAGTAAGGAATTGTTTCAGAATTTCAAATCAGGAAATTTCGGGAAGGTGTATCTTGCCGACAACAAAGCCTTGGAGATTGAAGGAAAGGGAGATGTCTCTATACAAACTCCAGCAGGAAATCTATGGACATTACAAGATGTCAGATACATTCCTGGTCTCAAGAAGAACCTGATCTCTATTAGACAGTTGGATAGCACAGGCTATGCAGCAGAGTTTGGAAAGAGTTCCTGGATGATTGTGAAGGGCGCCATGGTTGTTGCGCGTGGCACAAAATCCGGAACCTTATACACAACTGCAGAGTGTATAAAAATGACTGCTGCTGCTGAGAGTGCTTCCAATTCAAGTCTATGGCACAATAGACTTGGACATATGAGCGTCAAAGGAATGAAGATGCTAACTGCGAAAGGAGCTTTAGAAGGCTTAAAATTTGTTGATATGGGTCTTTGTGAGAGCTGTGTTATGGGCAAACAAAAACGAGTTAGTTTCACAAAGGCTGCCAGAGAACCGAAGATAGTGCGGTTGAAAATGGTCCATACAGACGTCTGGGGACCATCTCCAGTTTCATCACTTGGTGGATCAAGGTACTACGTCACCTTCATCGATGACTTCAGCAGAAAGGTATGGGTTTACTTTCGGAAACACAAGTCAGATGTGTTTACCACCTTCAAGAAGTGGAAAGCTGAAGTTGAAAATCAGACCGGCTTGAAGATCAAATGCCTGAGGCAAGACAATGGTATTGCAGAAAGGATGAACAGAACATTGAATGAGCGAGCAAGAAGCATGAGGATTCATTCTGGATTGCCAAAGACATTCTGGGCTGATGCTGTGAACACAGCAGCATATTTGATTAATAGAAGGCCGTCAGTACCCTTGAAGTTCAAATTTCCCGAAGAAGTATGGACAGGAAATGAACTCAAGTACTCTCACTTGAGAACCTTTGGTTGTACTGCGTATGTTCACATTGATCTAGAGAAGAGAGATAAGCTTGATGCTAAGGCTGTAAAATGCTACTTCATAGGCTATGGATCTGACATGTTTGGGTACAGGTTTTGGGATGAGAAAAATATGAAGATCCTAAGACACTGTGATGTGATTTTTGATGAAAATGTCATGTACAAGGACAGAGAGAAGATAAACTCTGAGACTACAAAGCAAGTGGGAGTTGAACTTGAGTGGCAAGAAAATTCACGCAGTGATGGTACAACAGAAGCTCAAGAAACTTCTGATCCTATTGCTGAAGAACCAGACGTGGAGCAAGTTGCACCTGAGCAAGTACGAAGAGAACCAGAGTCCTTTGATGAGGCCCTACAAGTGGAAGATTCAATCAAGTGGGAGCAAGCCGTGGATGATGAGATGAGCTCACTTAAAAGGAATAATACGTGGGTGCTGACTGAGTTGCCTACAGGAAAGAGAGCTCTGTTGAACAAATGGGTGTTCAAAATCAAGGTTGAACCAGATGGCAGAAGAAAGTTTAAGGCCCGGTTAGTAGTTAAAGGATATTCACAAAGAAAAGGCATTGATTATGTTGAGATCTTTTCTCCAGTTGTGAAATTAACTACTATCCGAATTCTGCTGAGTATTGTTGCATCGGAGAATTTGCACCTCGAGCAAATGGATGTAAAAATGACTTTTTTACATGGAGATCTAGATGAGGAGATCTATATGCAACAACCCGAAGGGTTTGCAGCTCCAGGCAAGAAGTACATGGTGTGTAAGCTCAATAAGAGCTTGTATGGACTGAAACAAGCACCGAGACAATGGTACAAGAAGTTTGACTCCTTCATGAGCAAGAGTGGTTTCCATAGAAGTGAAAAGAATCAGTGTTGCTACCTCAAGAAATACACTGATTCTTATGTGTTTCTACTCCTGTATGTGGATGATATAATAATTGTTGGATCAAGTATGAGGGAGATAAACCACCTGAAGGCAAGCTTGTCTTCAGTATTTGAGATGAAAGATTTAGGTGCAGCGAAGCAGATCCTTGGGATGAGAATTTCTCGAGATAGATCTGCTGGCACATTAAATCTATCCCAAGAATAG

Protein sequence

MTTKQWKLRDRQALGMIRLTLSRNVAFNIIKEKTTSDLMKALSNMYEKPSAMNKVYLMHRLFNLQMSKGGRVADHINEFNMIISQLGSMKINFEDEIKALILMSSLPESWDTVVAAISSSRGSDKLKFDEIRNVVLSESIRKRETGDSSGNALSVDRRGRSKSKGSNKHGRSKSKNREKSSNKPNVTCWSCGGKEHFRTDCTKLKKKQNHKSEDDDDSIYTTEDAEDVLILSVDSPVESWILDSCASFHSSPSKELFQNFKSGNFGKVYLADNKALEIEGKGDVSIQTPAGNLWTLQDVRYIPGLKKNLISIRQLDSTGYAAEFGKSSWMIVKGAMVVARGTKSGTLYTTAECIKMTAAAESASNSSLWHNRLGHMSVKGMKMLTAKGALEGLKFVDMGLCESCVMGKQKRVSFTKAAREPKIVRLKMVHTDVWGPSPVSSLGGSRYYVTFIDDFSRKVWVYFRKHKSDVFTTFKKWKAEVENQTGLKIKCLRQDNGIAERMNRTLNERARSMRIHSGLPKTFWADAVNTAAYLINRRPSVPLKFKFPEEVWTGNELKYSHLRTFGCTAYVHIDLEKRDKLDAKAVKCYFIGYGSDMFGYRFWDEKNMKILRHCDVIFDENVMYKDREKINSETTKQVGVELEWQENSRSDGTTEAQETSDPIAEEPDVEQVAPEQVRREPESFDEALQVEDSIKWEQAVDDEMSSLKRNNTWVLTELPTGKRALLNKWVFKIKVEPDGRRKFKARLVVKGYSQRKGIDYVEIFSPVVKLTTIRILLSIVASENLHLEQMDVKMTFLHGDLDEEIYMQQPEGFAAPGKKYMVCKLNKSLYGLKQAPRQWYKKFDSFMSKSGFHRSEKNQCCYLKKYTDSYVFLLLYVDDIIIVGSSMREINHLKASLSSVFEMKDLGAAKQILGMRISRDRSAGTLNLSQE
Homology
BLAST of CmoCh05G009260 vs. ExPASy Swiss-Prot
Match: P10978 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum OX=4097 PE=2 SV=1)

HSP 1 Score: 718.4 bits (1853), Expect = 1.1e-205
Identity = 425/1021 (41.63%), Postives = 609/1021 (59.65%), Query Frame = 0

Query: 1    MTTKQWKLRDRQALGMIRLTLSRNVAFNIIKEKTTSDLMKALSNMYEKPSAMNKVYLMHR 60
            M  + W   D +A   IRL LS +V  NII E T   +   L ++Y   +  NK+YL  +
Sbjct: 47   MKAEDWADLDERAASAIRLHLSDDVVNNIIDEDTARGIWTRLESLYMSKTLTNKLYLKKQ 106

Query: 61   LFNLQMSKGGRVADHINEFNMIISQLGSMKINFEDEIKALILMSSLPESWDTVVAAISSS 120
            L+ L MS+G     H+N FN +I+QL ++ +  E+E KA++L++SLP S+D +   I   
Sbjct: 107  LYALHMSEGTNFLSHLNVFNGLITQLANLGVKIEEEDKAILLLNSLPSSYDNLATTILHG 166

Query: 121  RGSDKLKFDEIRNVVLSESIRKRETGDSSGNALSVDRRGRSKSKGSNKHGRSKSKNREKS 180
            + + +LK D    ++L+E +RK+   ++ G AL  + RGRS  + SN +GRS ++ + K+
Sbjct: 167  KTTIELK-DVTSALLLNEKMRKKP--ENQGQALITEGRGRSYQRSSNNYGRSGARGKSKN 226

Query: 181  SNKPNV-TCWSCGGKEHFRTDCTKLKKKQNHKS--EDDDDSIYTTEDAEDVLI------- 240
             +K  V  C++C    HF+ DC   +K +   S  ++DD++    ++ ++V++       
Sbjct: 227  RSKSRVRNCYNCNQPGHFKRDCPNPRKGKGETSGQKNDDNTAAMVQNNDNVVLFINEEEE 286

Query: 241  -LSVDSPVESWILDSCASFHSSPSKELFQNFKSGNFGKVYLADNKALEIEGKGDVSIQTP 300
             + +  P   W++D+ AS H++P ++LF  + +G+FG V + +    +I G GD+ I+T 
Sbjct: 287  CMHLSGPESEWVVDTAASHHATPVRDLFCRYVAGDFGTVKMGNTSYSKIAGIGDICIKTN 346

Query: 301  AGNLWTLQDVRYIPGLKKNLISIRQLDSTGYAAEFGKSSWMIVKGAMVVARGTKSGTLY- 360
             G    L+DVR++P L+ NLIS   LD  GY + F    W + KG++V+A+G   GTLY 
Sbjct: 347  VGCTLVLKDVRHVPDLRMNLISGIALDRDGYESYFANQKWRLTKGSLVIAKGVARGTLYR 406

Query: 361  TTAE-CIKMTAAAESASNSSLWHNRLGHMSVKGMKMLTAKGALEGLKFVDMGLCESCVMG 420
            T AE C     AA+   +  LWH R+GHMS KG+++L  K  +   K   +  C+ C+ G
Sbjct: 407  TNAEICQGELNAAQDEISVDLWHKRMGHMSEKGLQILAKKSLISYAKGTTVKPCDYCLFG 466

Query: 421  KQKRVSFTKAAREPKIVRLKMVHTDVWGPSPVSSLGGSRYYVTFIDDFSRKVWVYFRKHK 480
            KQ RVSF + + E K+  L +V++DV GP  + S+GG++Y+VTFIDD SRK+WVY  K K
Sbjct: 467  KQHRVSF-QTSSERKLNILDLVYSDVCGPMEIESMGGNKYFVTFIDDASRKLWVYILKTK 526

Query: 481  SDVFTTFKKWKAEVENQTGLKIKCLRQD-------------------------------N 540
              VF  F+K+ A VE +TG K+K LR D                               N
Sbjct: 527  DQVFQVFQKFHALVERETGRKLKRLRSDNGGEYTSREFEEYCSSHGIRHEKTVPGTPQHN 586

Query: 541  GIAERMNRTLNERARSMRIHSGLPKTFWADAVNTAAYLINRRPSVPLKFKFPEEVWTGNE 600
            G+AERMNRT+ E+ RSM   + LPK+FW +AV TA YLINR PSVPL F+ PE VWT  E
Sbjct: 587  GVAERMNRTIVEKVRSMLRMAKLPKSFWGEAVQTACYLINRSPSVPLAFEIPERVWTNKE 646

Query: 601  LKYSHLRTFGCTAYVHIDLEKRDKLDAKAVKCYFIGYGSDMFGYRFWDEKNMKILRHCDV 660
            + YSHL+ FGC A+ H+  E+R KLD K++ C FIGYG + FGYR WD    K++R  DV
Sbjct: 647  VSYSHLKVFGCRAFAHVPKEQRTKLDDKSIPCIFIGYGDEEFGYRLWDPVKKKVIRSRDV 706

Query: 661  IFDENVMYKD---REKIN---------------------------SETTKQVGVELEWQE 720
            +F E+ +       EK+                            SE  +Q G  +E  E
Sbjct: 707  VFRESEVRTAADMSEKVKNGIIPNFVTIPSTSNNPTSAESTTDEVSEQGEQPGEVIEQGE 766

Query: 721  N-----SRSDGTTEAQETSDPI--AEEPDVEQ---VAPEQV----RREPESFDEALQVED 780
                     +  T+ +E   P+  +E P VE     + E V     REPES  E L   +
Sbjct: 767  QLDEGVEEVEHPTQGEEQHQPLRRSERPRVESRRYPSTEYVLISDDREPESLKEVLSHPE 826

Query: 781  SIKWEQAVDDEMSSLKRNNTWVLTELPTGKRALLNKWVFKIKVEPDGRR-KFKARLVVKG 840
              +  +A+ +EM SL++N T+ L ELP GKR L  KWVFK+K + D +  ++KARLVVKG
Sbjct: 827  KNQLMKAMQEEMESLQKNGTYKLVELPKGKRPLKCKWVFKLKKDGDCKLVRYKARLVVKG 886

Query: 841  YSQRKGIDYVEIFSPVVKLTTIRILLSIVASENLHLEQMDVKMTFLHGDLDEEIYMQQPE 900
            + Q+KGID+ EIFSPVVK+T+IR +LS+ AS +L +EQ+DVK  FLHGDL+EEIYM+QPE
Sbjct: 887  FEQKKGIDFDEIFSPVVKMTSIRTILSLAASLDLEVEQLDVKTAFLHGDLEEEIYMEQPE 946

Query: 901  GFAAPGKKYMVCKLNKSLYGLKQAPRQWYKKFDSFMSKSGFHRSEKNQCCYLKKYTD-SY 932
            GF   GKK+MVCKLNKSLYGLKQAPRQWY KFDSFM    + ++  + C Y K++++ ++
Sbjct: 947  GFEVAGKKHMVCKLNKSLYGLKQAPRQWYMKFDSFMKSQTYLKTYSDPCVYFKRFSENNF 1006

BLAST of CmoCh05G009260 vs. ExPASy Swiss-Prot
Match: P04146 (Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3)

HSP 1 Score: 364.4 bits (934), Expect = 3.9e-99
Identity = 298/1094 (27.24%), Postives = 486/1094 (44.42%), Query Frame = 0

Query: 6    WKLRDRQALGMIRLTLSRNVAFNIIKEKTTSDLMKALSNMYEKPSAMNKVYLMHRLFNLQ 65
            WK  +R A   I   LS +       + T   +++ L  +YE+ S  +++ L  RL +L+
Sbjct: 48   WKKAERCAKSTIIEYLSDSFLNFATSDITARQILENLDAVYERKSLASQLALRKRLLSLK 107

Query: 66   MSKGGRVADHINEFNMIISQLGSMKINFEDEIKALILMSSLPESWDTVVAAISSSRGSDK 125
            +S    +  H + F+ +IS+L +     E+  K   L+ +LP  +D ++ AI  +   + 
Sbjct: 108  LSSEMSLLSHFHIFDELISELLAAGAKIEEMDKISHLLITLPSCYDGIITAI-ETLSEEN 167

Query: 126  LKFDEIRNVVLSESIR-KRETGDSSGNALSVDRRGRSKSKGSN--KHGRSKSKNREKSSN 185
            L    ++N +L + I+ K +  D+S   ++      + +  +N  K+  +K K   K ++
Sbjct: 168  LTLAFVKNRLLDQEIKIKNDHNDTSKKVMNAIVHNNNNTYKNNLFKNRVTKPKKIFKGNS 227

Query: 186  KPNVTCWSCGGKEHFRTDCTKLKKKQNHKSEDDDDSIYTTEDAEDVLILSVDSPVESWIL 245
            K  V C  CG + H + DC   K+  N+K+++++  + T       +   V     + ++
Sbjct: 228  KYKVKCHHCGREGHIKKDCFHYKRILNNKNKENEKQVQTA--TSHGIAFMVKEVNNTSVM 287

Query: 246  DSCASFHSSPSKELFQNFKSGNFGKVYLADNKALEIEGKGDVSIQTPAG-------NLWT 305
            D+C     S + +   N +S     V +     + +  +G+    T  G       +  T
Sbjct: 288  DNCGFVLDSGASDHLINDESLYTDSVEVVPPLKIAVAKQGEFIYATKRGIVRLRNDHEIT 347

Query: 306  LQDVRYIPGLKKNLISIRQLDSTGYAAEFGKSSWMIVKGAMVVARGTKSGTLYTTAECIK 365
            L+DV +      NL+S+++L   G + EF KS   I K  ++V + +             
Sbjct: 348  LEDVLFCKEAAGNLMSVKRLQEAGMSIEFDKSGVTISKNGLMVVKNSGMLNNVPVINFQA 407

Query: 366  MTAAAESASNSSLWHNRLGHMS------VKGMKMLTAKGALEGLKFVDMGLCESCVMGKQ 425
             +  A+  +N  LWH R GH+S      +K   M + +  L  L+ +   +CE C+ GKQ
Sbjct: 408  YSINAKHKNNFRLWHERFGHISDGKLLEIKRKNMFSDQSLLNNLE-LSCEICEPCLNGKQ 467

Query: 426  KRVSFTKAAREPKIVR-LKMVHTDVWGPSPVSSLGGSRYYVTFIDDFSRKVWVYFRKHKS 485
             R+ F +   +  I R L +VH+DV GP    +L    Y+V F+D F+     Y  K+KS
Sbjct: 468  ARLPFKQLKDKTHIKRPLFVVHSDVCGPITPVTLDDKNYFVIFVDQFTHYCVTYLIKYKS 527

Query: 486  DVFTTFKKWKAEVENQTGLKIKCLRQD-------------------------------NG 545
            DVF+ F+ + A+ E    LK+  L  D                               NG
Sbjct: 528  DVFSMFQDFVAKSEAHFNLKVVYLYIDNGREYLSNEMRQFCVKKGISYHLTVPHTPQLNG 587

Query: 546  IAERMNRTLNERARSMRIHSGLPKTFWADAVNTAAYLINRRPSVPL--KFKFPEEVWTGN 605
            ++ERM RT+ E+AR+M   + L K+FW +AV TA YLINR PS  L    K P E+W   
Sbjct: 588  VSERMIRTITEKARTMVSGAKLDKSFWGEAVLTATYLINRIPSRALVDSSKTPYEMWHNK 647

Query: 606  ELKYSHLRTFGCTAYVHIDLEKRDKLDAKAVKCYFIGYGSDMFGYRFWDEKNMKILRHCD 665
            +    HLR FG T YVHI   K+ K D K+ K  F+GY  +  G++ WD  N K +   D
Sbjct: 648  KPYLKHLRVFGATVYVHIK-NKQGKFDDKSFKSIFVGYEPN--GFKLWDAVNEKFIVARD 707

Query: 666  VIFD------------ENVMYKD------------------------------------- 725
            V+ D            E V  KD                                     
Sbjct: 708  VVVDETNMVNSRAVKFETVFLKDSKESENKNFPNDSRKIIQTEFPNESKECDNIQFLKDS 767

Query: 726  ------------------------------------------------------------ 785
                                                                        
Sbjct: 768  KESENKNFPNDSRKIIQTEFPNESKECDNIQFLKDSKESNKYFLNESKKRKRDDHLNESK 827

Query: 786  --------REKINSETTKQVGVELEWQENSRSDGTTEAQETSDPIAEEPDVEQVAPEQVR 845
                    RE   +E  K++G++      +++DG       S+ +  +P +     +   
Sbjct: 828  GSGNPNESRESETAEHLKEIGID----NPTKNDGIEIINRRSERLKTKPQISYNEEDNSL 887

Query: 846  RE------------PESFDEALQVEDSIKWEQAVDDEMSSLKRNNTWVLTELPTGKRALL 905
             +            P SFDE    +D   WE+A++ E+++ K NNTW +T+ P  K  + 
Sbjct: 888  NKVVLNAHTIFNDVPNSFDEIQYRDDKSSWEEAINTELNAHKINNTWTITKRPENKNIVD 947

Query: 906  NKWVFKIKVEPDGRR-KFKARLVVKGYSQRKGIDYVEIFSPVVKLTTIRILLSIVASENL 918
            ++WVF +K    G   ++KARLV +G++Q+  IDY E F+PV ++++ R +LS+V   NL
Sbjct: 948  SRWVFSVKYNELGNPIRYKARLVARGFTQKYQIDYEETFAPVARISSFRFILSLVIQYNL 1007

BLAST of CmoCh05G009260 vs. ExPASy Swiss-Prot
Match: Q9ZT94 (Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana OX=3702 GN=RE2 PE=4 SV=1)

HSP 1 Score: 272.3 bits (695), Expect = 2.0e-71
Identity = 268/1109 (24.17%), Postives = 454/1109 (40.94%), Query Frame = 0

Query: 5    QWKLRDRQALGMIRLTLSRNVAFNIIKEKTTSDLMKALSNMYEKPSAMNKVYLMHRLFNL 64
            +W+ +D+     I   +S +V   + +  T + + + L  +Y  PS  +   L       
Sbjct: 76   RWRRQDKLIYSAILGAISMSVQPAVSRATTAAQIWETLRKIYANPSYGHVTQLRFITRFD 135

Query: 65   QMSKGGRVADHINEFNMIISQLGSMKINFEDEIKALILMSSLPESWDTVVAAISSSRGSD 124
            Q++  G+  DH  +   ++  L        D+I A     SL E  + ++   S     +
Sbjct: 136  QLALLGKPMDHDEQVERVLENLPDDYKPVIDQIAAKDTPPSLTEIHERLINRESKLLALN 195

Query: 125  KLKFDEI-RNVVLSESIRKRETGDSSGNALSVDRRGRSKSKGSNKHGRSKSKNREKSSNK 184
              +   I  NVV   +       ++ G+  + +      +        S+S NR+    K
Sbjct: 196  SAEVVPITANVVTHRNTNTNRNQNNRGDNRNYNNNNNRSNSWQPSSSGSRSDNRQP---K 255

Query: 185  PNV-TCWSCGGKEHFRTDCTKLKKKQNHKSEDDDDSIYTTEDAEDVLILSVDSPVESWIL 244
            P +  C  C  + H    C +L + Q+  ++    S +T       L ++      +W+L
Sbjct: 256  PYLGRCQICSVQGHSAKRCPQLHQFQSTTNQQQSTSPFTPWQPRANLAVNSPYNANNWLL 315

Query: 245  DSCASFHSSP---SKELFQNFKSGNFGKVYLADNKALEIEGKGDVSIQTPAGNLWTLQDV 304
            DS A+ H +    +    Q +  G+   V +AD   + I   G  S+ T + +L  L  V
Sbjct: 316  DSGATHHITSDFNNLSFHQPYTGGD--DVMIADGSTIPITHTGSASLPTSSRSL-DLNKV 375

Query: 305  RYIPGLKKNLISIRQLDSTG-YAAEFGKSSWMI--VKGAMVVARGTKSGTLY----TTAE 364
             Y+P + KNLIS+ +L +T   + EF  +S+ +  +   + + +G     LY     +++
Sbjct: 376  LYVPNIHKNLISVYRLCNTNRVSVEFFPASFQVKDLNTGVPLLQGKTKDELYEWPIASSQ 435

Query: 365  CIKMTAAAESASNSSLWHNRLGHMSVKGMKMLTAKGALEGLKFVDMGL-CESCVMGKQKR 424
             + M A+  S +  S WH+RLGH S+  +  + +  +L  L      L C  C + K  +
Sbjct: 436  AVSMFASPCSKATHSSWHSRLGHPSLAILNSVISNHSLPVLNPSHKLLSCSDCFINKSHK 495

Query: 425  VSFTKAAREPKIVRLKMVHTDVWGPSPVSSLGGSRYYVTFIDDFSRKVWVYFRKHKSDVF 484
            V F+ +        L+ +++DVW  SP+ S+   RYYV F+D F+R  W+Y  K KS V 
Sbjct: 496  VPFSNSTITSS-KPLEYIYSDVWS-SPILSIDNYRYYVIFVDHFTRYTWLYPLKQKSQVK 555

Query: 485  TTFKKWKAEVENQTGLKIKCLRQD-----------------------------NGIAERM 544
             TF  +K+ VEN+   +I  L  D                             NG++ER 
Sbjct: 556  DTFIIFKSLVENRFQTRIGTLYSDNGGEFVVLRDYLSQHGISHFTSPPHTPEHNGLSERK 615

Query: 545  NRTLNERARSMRIHSGLPKTFWADAVNTAAYLINRRPSVPLKFKFPEEVWTGNELKYSHL 604
            +R + E   ++  H+ +PKT+W  A + A YLINR P+  L+ + P +   G    Y  L
Sbjct: 616  HRHIVEMGLTLLSHASVPKTYWPYAFSVAVYLINRLPTPLLQLQSPFQKLFGQPPNYEKL 675

Query: 605  RTFGCTAYVHIDLEKRDKLDAKAVKCYFIGYGSDMFGY--------RFWDEKNMKILRHC 664
            + FGC  Y  +    R KL+ K+ +C F+GY      Y        R +  ++++    C
Sbjct: 676  KVFGCACYPWLRPYNRHKLEDKSKQCAFMGYSLTQSAYLCLHIPTGRLYTSRHVQFDERC 735

Query: 665  DVIFDENVMYKDREKINSET---------------------------------------- 724
                  N      ++  S++                                        
Sbjct: 736  FPFSTTNFGVSTSQEQRSDSAPNWPSHTTLPTTPLVLPAPPCLGPHLDTSPRPPSSPSPL 795

Query: 725  -TKQVG----------------------------VELEWQENSRSDG------------- 784
             T QV                              +    +NS S+              
Sbjct: 796  CTTQVSSSNLPSSSISSPSSSEPTAPSHNGPQPTAQPHQTQNSNSNSPILNNPNPNSPSP 855

Query: 785  ----------------------TTEAQETSD------------PIAEEPDVEQV------ 844
                                  +T   E +             P+   P + QV      
Sbjct: 856  NSPNQNSPLPQSPISSPHIPTPSTSISEPNSPSSSSTSTPPLPPVLPAPPIIQVNAQAPV 915

Query: 845  --------APEQVRREPESFD------------EALQVEDSIKWEQAVDDEMSSLKRNNT 904
                    A + +R+  + +              A+Q     +W QA+  E+++   N+T
Sbjct: 916  NTHSMATRAKDGIRKPNQKYSYATSLAANSEPRTAIQAMKDDRWRQAMGSEINAQIGNHT 975

Query: 905  WVLTELPTGKRALLN-KWVFKIKVEPDGR-RKFKARLVVKGYSQRKGIDYVEIFSPVVKL 920
            W L   P     ++  +W+F  K   DG   ++KARLV KGY+QR G+DY E FSPV+K 
Sbjct: 976  WDLVPPPPPSVTIVGCRWIFTKKFNSDGSLNRYKARLVAKGYNQRPGLDYAETFSPVIKS 1035

BLAST of CmoCh05G009260 vs. ExPASy Swiss-Prot
Match: Q94HW2 (Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana OX=3702 GN=RE1 PE=2 SV=1)

HSP 1 Score: 271.6 bits (693), Expect = 3.4e-71
Identity = 281/1142 (24.61%), Postives = 472/1142 (41.33%), Query Frame = 0

Query: 5    QWKLRDRQALGMIRLTLSRNVAFNIIKEKTTSDLMKALSNMYEKPSAMNKVYLMHRLFNL 64
            +WK +D+     +   +S +V   + +  T + + + L  +Y  PS  +   L  +L   
Sbjct: 76   RWKRQDKLIYSAVLGAISMSVQPAVSRATTAAQIWETLRKIYANPSYGHVTQLRTQL--K 135

Query: 65   QMSKGGR-VADHINEFNMIISQLGSMKINFEDEIKALILMSSLPESWDTVVAAISS---- 124
            Q +KG + + D++        QL  +    + + +   ++ +LPE +  V+  I++    
Sbjct: 136  QWTKGTKTIDDYMQGLVTRFDQLALLGKPMDHDEQVERVLENLPEEYKPVIDQIAAKDTP 195

Query: 125  ---SRGSDKLKFDEIRNVVLSE-----------SIRKRETGDSSGNALSVDRRGRSKSKG 184
               +   ++L   E + + +S            S R   T +++ N    +R     +  
Sbjct: 196  PTLTEIHERLLNHESKILAVSSATVIPITANAVSHRNTTTTNNNNNGNRNNRYDNRNNNN 255

Query: 185  SNKHGRSKSKNREKSSN--KPNV-TCWSCGGKEHFRTDCTKLKKKQNHKSEDDDDSIYTT 244
            ++K  +  S N   ++N  KP +  C  CG + H    C++L+   +  +     S +T 
Sbjct: 256  NSKPWQQSSTNFHPNNNQSKPYLGKCQICGVQGHSAKRCSQLQHFLSSVNSQQPPSPFTP 315

Query: 245  EDAEDVLILSVDSPVESWILDSCASFHSSP---SKELFQNFKSGNFGKVYLADNKALEIE 304
                  L L       +W+LDS A+ H +    +  L Q +  G+   V +AD   + I 
Sbjct: 316  WQPRANLALGSPYSSNNWLLDSGATHHITSDFNNLSLHQPYTGGD--DVMVADGSTIPIS 375

Query: 305  GKGDVSIQTPAGNLWTLQDVRYIPGLKKNLISIRQL-DSTGYAAEFGKSSWMI--VKGAM 364
              G  S+ T +  L  L ++ Y+P + KNLIS+ +L ++ G + EF  +S+ +  +   +
Sbjct: 376  HTGSTSLSTKSRPL-NLHNILYVPNIHKNLISVYRLCNANGVSVEFFPASFQVKDLNTGV 435

Query: 365  VVARGTKSGTLY----TTAECIKMTAAAESASNSSLWHNRLGHMSVKGMKMLTAKGALEG 424
             + +G     LY     +++ + + A+  S +  S WH RLGH +   +  + +  +L  
Sbjct: 436  PLLQGKTKDELYEWPIASSQPVSLFASPSSKATHSSWHARLGHPAPSILNSVISNYSLSV 495

Query: 425  L----KFVDMGLCESCVMGKQKRVSFTKAAREPKIVRLKMVHTDVWGPSPVSSLGGSRYY 484
            L    KF+    C  C++ K  +V F+++        L+ +++DVW  SP+ S    RYY
Sbjct: 496  LNPSHKFLS---CSDCLINKSNKVPFSQSTIN-STRPLEYIYSDVWS-SPILSHDNYRYY 555

Query: 485  VTFIDDFSRKVWVYFRKHKSDVFTTFKKWKAEVENQTGLKIKCLRQD------------- 544
            V F+D F+R  W+Y  K KS V  TF  +K  +EN+   +I     D             
Sbjct: 556  VIFVDHFTRYTWLYPLKQKSQVKETFITFKNLLENRFQTRIGTFYSDNGGEFVALWEYFS 615

Query: 545  ----------------NGIAERMNRTLNERARSMRIHSGLPKTFWADAVNTAAYLINRRP 604
                            NG++ER +R + E   ++  H+ +PKT+W  A   A YLINR P
Sbjct: 616  QHGISHLTSPPHTPEHNGLSERKHRHIVETGLTLLSHASIPKTYWPYAFAVAVYLINRLP 675

Query: 605  SVPLKFKFPEEVWTGNELKYSHLRTFGCTAYVHIDLEKRDKLDAKAVKCYFIGYGSDMFG 664
            +  L+ + P +   G    Y  LR FGC  Y  +    + KLD K+ +C F+GY      
Sbjct: 676  TPLLQLESPFQKLFGTSPNYDKLRVFGCACYPWLRPYNQHKLDDKSRQCVFLGYSLTQSA 735

Query: 665  YRFWDEKNMKILRHCDVIFDEN-------------VMYKDREKI---------------- 724
            Y     +  ++     V FDEN             V  + RE                  
Sbjct: 736  YLCLHLQTSRLYISRHVRFDENCFPFSNYLATLSPVQEQRRESSCVWSPHTTLPTRTPVL 795

Query: 725  ------------------------------------------------------------ 784
                                                                        
Sbjct: 796  PAPSCSDPHHAATPPSSPSAPFRNSQVSSSNLDSSFSSSFPSSPEPTAPRQNGPQPTTQP 855

Query: 785  ------------------NSETTKQVGVELEWQENSRSDG---TTEAQETS--------- 844
                               +E+  Q+   L     S S     TT A  +S         
Sbjct: 856  TQTQTQTHSSQNTSQNNPTNESPSQLAQSLSTPAQSSSSSPSPTTSASSSSTSPTPPSIL 915

Query: 845  ----DPIAEEPDVEQVAPEQ--------------------------VRREPESFDEALQV 904
                 P+A+  +    AP                               EP +  +AL+ 
Sbjct: 916  IHPPPPLAQIVNNNNQAPLNTHSMGTRAKAGIIKPNPKYSLAVSLAAESEPRTAIQALKD 975

Query: 905  EDSIKWEQAVDDEMSSLKRNNTWVLTELPTGKRALLN-KWVFKIKVEPDGR-RKFKARLV 931
            E   +W  A+  E+++   N+TW L   P     ++  +W+F  K   DG   ++KARLV
Sbjct: 976  E---RWRNAMGSEINAQIGNHTWDLVPPPPSHVTIVGCRWIFTKKYNSDGSLNRYKARLV 1035

BLAST of CmoCh05G009260 vs. ExPASy Swiss-Prot
Match: P25600 (Putative transposon Ty5-1 protein YCL074W OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) OX=559292 GN=TY5A PE=5 SV=2)

HSP 1 Score: 92.4 bits (228), Expect = 2.8e-17
Identity = 49/140 (35.00%), Postives = 78/140 (55.71%), Query Frame = 0

Query: 790 MDVKMTFLHGDLDEEIYMQQPEGFAAPGKKYMVCKLNKSLYGLKQAPRQWYKKFDSFMSK 849
           MDV   FL+  +DE IY++QP GF        V +L   +YGLKQAP  W +  ++ + K
Sbjct: 1   MDVDTAFLNSTMDEPIYVKQPPGFVNERNPDYVWELYGGMYGLKQAPLLWNEHINNTLKK 60

Query: 850 SGFHRSEKNQCCYLKKYTDSYVFLLLYVDDIIIVGSSMREINHLKASLSSVFEMKDLGAA 909
            GF R E     Y +  +D  +++ +YVDD+++   S +  + +K  L+ ++ MKDLG  
Sbjct: 61  IGFCRHEGEHGLYFRSTSDGPIYIAVYVDDLLVAAPSPKIYDRVKQELTKLYSMKDLGKV 120

Query: 910 KQILGMRISRDRSAGTLNLS 930
            + LG+ I +  S G + LS
Sbjct: 121 DKFLGLNIHQS-SNGDITLS 139

BLAST of CmoCh05G009260 vs. ExPASy TrEMBL
Match: A0A151TNK0 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cajanus cajan OX=3821 GN=KK1_022239 PE=4 SV=1)

HSP 1 Score: 1104.7 bits (2856), Expect = 0.0e+00
Identity = 565/989 (57.13%), Postives = 719/989 (72.70%), Query Frame = 0

Query: 1    MTTKQWKLRDRQALGMIRLTLSRNVAFNIIKEKTTSDLMKALSNMYEKPSAMNKVYLMHR 60
            M  ++W L DRQALG+IRLTL++NVAFNI+ EKTT+ LMK LS+MYEKPSA NKV++M R
Sbjct: 24   MKQEEWNLLDRQALGVIRLTLAKNVAFNIVNEKTTASLMKTLSDMYEKPSAANKVHIMRR 83

Query: 61   LFNLQMSKGGRVADHINEFNMIISQLGSMKINFEDEIKALILMSSLPESWDTVVAAISSS 120
            LFNL+M +G  V +HIN+FN I++ L S++I FEDE+KALIL+SSLPESW   V A+SSS
Sbjct: 84   LFNLKMGEGNLVINHINDFNTILAMLESVQIKFEDEVKALILLSSLPESWAATVTAVSSS 143

Query: 121  RGSDKLKFDEIRNVVLSESIRKRE----TGDSSGNALSVDRRGRSKSKGSNKHGRSKSKN 180
               +KLK ++IR+++LSE +R+R+    +  +S +AL+ + RGR+  KG N  GRSKS+ 
Sbjct: 144  ARDNKLKLNDIRDLILSEDVRRRDSEEPSSSTSSSALNTESRGRTTQKGYNSRGRSKSRA 203

Query: 181  REKSSNKPNVTCWSCGGKEHFRTDCTKLKKKQNHKSEDDDDSI-YTTEDAEDVLILSVDS 240
            + +   + ++ CW+C  + HF   C   KK +NHK  DDD+S    T++ +D LI S+DS
Sbjct: 204  KGQPKFRNDIVCWNCDKRGHFTNQCKAPKKNKNHKKRDDDESANAATDEIDDALICSLDS 263

Query: 241  PVESWILDSCASFHSSPSKELFQNFKSGNFGKVYLADNKALEIEGKGDVSIQTPAGNLWT 300
            P+ESWI+DS ASFH++PS EL  N+ SG FGKVYLAD K L I GKGD++I+T +G+ WT
Sbjct: 264  PIESWIMDSGASFHTTPSNELLTNYVSGRFGKVYLADGKPLNIVGKGDIAIRTSSGSHWT 323

Query: 301  LQDVRYIPGLKKNLISIRQLDSTGYAAEFGKSSWMIVKGAMVVARGTKSGTLYTTAECIK 360
            L++VR+IP LK+NLIS+ QLD  G+   FG  +W + KG ++VARG K G+LY  A+   
Sbjct: 324  LKNVRHIPALKRNLISVGQLDDEGHETTFGDGAWKVKKGNLIVARGKKRGSLYMVAD-EN 383

Query: 361  MTAAAESASNSSLWHNRLGHMSVKGMKMLTAKGALEGLKFVDMGLCESCVMGKQKRVSFT 420
            M A  E+A+NS LWH RLGHMS KGMK++  KG L  LK VD+G CE C++GKQ+++SF+
Sbjct: 384  MIAVTEAANNSFLWHQRLGHMSEKGMKLMATKGKLSKLKHVDVGTCEHCILGKQRKISFS 443

Query: 421  KAAREPKIVRLKMVHTDVWGPSPVSSLGGSRYYVTFIDDFSRKVWVYFRKHKSDVFTTFK 480
            +  +  K  RL++VHTDVWGP+PV SLGGS YYVTFIDD +RKVWVYF K+KSDVF+ FK
Sbjct: 444  RQGKTLKTERLELVHTDVWGPAPVKSLGGSCYYVTFIDDATRKVWVYFLKNKSDVFSVFK 503

Query: 481  KWKAEVENQTGLKIKCLRQD-------------------------------NGIAERMNR 540
            +WK EVENQTGLK+K L+ D                               NG+AERMNR
Sbjct: 504  RWKTEVENQTGLKLKSLKSDNGGEYNSHEFKNFCSEHGIRMIKTIPGTPEQNGVAERMNR 563

Query: 541  TLNERARSMRIHSGLPKTFWADAVNTAAYLINRRPSVPLKFKFPEEVWTGNELKYSHLRT 600
            TLNERAR MRI SGLPK FWADA+NTAAYLINR PSVPL ++ PEEVW+G E+  SHLR 
Sbjct: 564  TLNERARCMRIQSGLPKVFWADAINTAAYLINRGPSVPLGYQLPEEVWSGKEVNLSHLRV 623

Query: 601  FGCTAYVHIDLEKRDKLDAKAVKCYFIGYGSDMFGYRFWDEKNMKILRHCDVIFDENVMY 660
            FGC +YV ID + RDKLD KA KCYFIGYGSDM+GYRFWD++N KI+R  +V F+EN+ Y
Sbjct: 624  FGCVSYVLIDSDSRDKLDPKAKKCYFIGYGSDMYGYRFWDDQNKKIIRSRNVTFNENLFY 683

Query: 661  KDR---EKINSETTKQVGVELEWQENSRSDGTTEAQETSDPIAEEPDVE--------QVA 720
            KDR   E I+++   +   ++E +E S SD T  +Q T   +  EP            VA
Sbjct: 684  KDRFSAESIDTDKLPEPSEKVELEEISESDITNRSQSTDIEVESEPASPPLRRSCRVSVA 743

Query: 721  PEQV-----------RREPESFDEALQVEDSIKWEQAVDDEMSSLKRNNTWVLTELPTGK 780
            PE+              EPE FDEA+Q  DS+KWE A+ DEM+SL++N TW LTELP GK
Sbjct: 744  PERYSPSLHYLLLTDAGEPEYFDEAIQGNDSVKWELAMKDEMTSLQKNGTWSLTELPEGK 803

Query: 781  RALLNKWVFKIKVEPDGRRKFKARLVVKGYSQRKGIDYVEIFSPVVKLTTIRILLSIVAS 840
             AL NKWV+++K E DGR+++KARLVVKG+ Q+ GID+ EIFSPVVK+TTIR++LSIVA+
Sbjct: 804  MALQNKWVYRLKEESDGRKRYKARLVVKGFQQKPGIDFTEIFSPVVKMTTIRVILSIVAA 863

Query: 841  ENLHLEQMDVKMTFLHGDLDEEIYMQQPEGFAAPGKKYMVCKLNKSLYGLKQAPRQWYKK 900
            ENLHLEQ+DVK  FLHGDLDE+IYM QPEGF   GK+ +VCKL+KSLYGLKQAPRQWYKK
Sbjct: 864  ENLHLEQLDVKTAFLHGDLDEDIYMTQPEGFKVSGKENLVCKLHKSLYGLKQAPRQWYKK 923

Query: 901  FDSFMSKSGFHRSEKNQCCYLKKYTDSYVFLLLYVDDIIIVGSSMREINHLKASLSSVFE 932
            F+ FM  SGF R + + CCY+KKY +SY+ L LYVDD++I G +M EIN LK  L+  FE
Sbjct: 924  FNEFMQNSGFSRCDMDHCCYVKKYVNSYIILALYVDDMLIAGPNMTEINRLKQQLTDHFE 983

BLAST of CmoCh05G009260 vs. ExPASy TrEMBL
Match: A0A5B7BAK4 (Uncharacterized protein OS=Davidia involucrata OX=16924 GN=Din_035380 PE=4 SV=1)

HSP 1 Score: 1104.4 bits (2855), Expect = 0.0e+00
Identity = 574/994 (57.75%), Postives = 721/994 (72.54%), Query Frame = 0

Query: 1    MTTKQWKLRDRQALGMIRLTLSRNVAFNIIKEKTTSDLMKALSNMYEKPSAMNKVYLMHR 60
            M  + W L DRQALG++RLTL+RNVAFNI KEKTT+ LM ALSNMYEKPSA NKVYLM R
Sbjct: 46   MKDEDWALLDRQALGVVRLTLARNVAFNIAKEKTTASLMAALSNMYEKPSASNKVYLMRR 105

Query: 61   LFNLQMSKGGRVADHINEFNMIISQLGSMKINFEDEIKALILMSSLPESWDTVVAAISSS 120
            LFNL+MS+G  VA+H+NEFN++ +QL S++I F+DEI+ALIL+SSLPESW+  V A+SSS
Sbjct: 106  LFNLRMSEGASVANHLNEFNIVTTQLSSVEIEFDDEIRALILLSSLPESWNGTVTAVSSS 165

Query: 121  RGSDKLKFDEIRNVVLSESIRKRETGDSSGNALSVDRRGRSKSKGSNKHGRSKSKNREKS 180
             G+ KLK+D++R+++LSE IR+RE+G+SSG+AL+V+ RGR+  + S+ H RS+S+  +  
Sbjct: 166  SGTTKLKYDDVRDLILSEEIRRRESGESSGSALNVENRGRTFERNSD-HSRSQSRRSQSR 225

Query: 181  SNKPNVTCWSCGGKEHFRTDCTKLKKKQNHK-----SEDDDDSIYTTEDAEDVLILSVDS 240
              K    CW+CG   H + DC   KK++  K     +++ + +   TE+ +D LILS+D 
Sbjct: 226  GGKSKAECWNCGKLGHLKKDCYSPKKEKQGKGKGKGNKERETANAVTEEIQDALILSLDC 285

Query: 241  PVESWILDSCASFHSSPSKELFQNFKSGNFGKVYLADNKALEIEGKGDVSIQTPAGNLWT 300
              ESW++DS ASFH++  +E+ +N+  G+FGKVYL D++   I GKGDV I+ P+G +W 
Sbjct: 286  HTESWVVDSGASFHATAHEEIMENYVRGDFGKVYLGDDEPCNIIGKGDVQIKLPSGVVWK 345

Query: 301  LQDVRYIPGLKKNLISIRQLDSTGYAAEFGKSSWMIVKGAMVVARGTKSGTLYTTAECIK 360
            L DVR++P LK+NLIS+ QL S+G    F   SW + KGAMV+ARG K GTLY T     
Sbjct: 346  LIDVRHVPSLKRNLISVGQLASSGCVTTFLGDSWKVTKGAMVMARGKKEGTLYVTTNSRD 405

Query: 361  MTAAAESASNSSLWHNRLGHMSVKGMKMLTAKGALEGLKFVDMGLCESCVMGKQKRVSFT 420
                AE+ S+S+LWH RLGHMS KGMK+L +KG L+GLK VD+ LCE C+ GKQK+VSF+
Sbjct: 406  AITIAETGSDSNLWHYRLGHMSQKGMKVLHSKGKLQGLKSVDIDLCEDCIFGKQKKVSFS 465

Query: 421  KAAREPKIVRLKMVHTDVWGPSPVSSLGGSRYYVTFIDDFSRKVWVYFRKHKSDVFTTFK 480
            KA R PK  +L++VHTDVWGPSPVSSLGGS YYVTFIDD +RKVWVYF K KSDVF+TFK
Sbjct: 466  KAGRTPKAEKLELVHTDVWGPSPVSSLGGSSYYVTFIDDSTRKVWVYFLKSKSDVFSTFK 525

Query: 481  KWKAEVENQTGLKIKCLRQD-------------------------------NGIAERMNR 540
            KWKA VEN+TGLKIKCLR D                               NG+AERMNR
Sbjct: 526  KWKAMVENETGLKIKCLRSDNGGEYTDKEFKEFCATNGIRMEKTIPKTPQQNGVAERMNR 585

Query: 541  TLNERARSMRIHSGLPKTFWADAVNTAAYLINRRPSVPLKFKFPEEVWTGNELKYSHLRT 600
            TLNER RSMRIH+GLPK FWADAVNTAAYLINR PSVPL    PEE W+G E+  SHL+ 
Sbjct: 586  TLNERGRSMRIHAGLPKMFWADAVNTAAYLINRGPSVPLDDGLPEEAWSGKEVNLSHLKV 645

Query: 601  FGCTAYVHIDLEKRDKLDAKAVKCYFIGYGSDMFGYRFWDEKNMKILRHCDVIFDENVMY 660
            FGC +YVHID + R KLD K+ KC FIGYG+D FGYRFWD++N KI R  DVIF+E V+Y
Sbjct: 646  FGCVSYVHIDSDARSKLDPKSKKCTFIGYGTDEFGYRFWDDQNRKITRSRDVIFNEKVLY 705

Query: 661  KDR---EKINSETTKQVGVELEWQENSRSDGTTEAQETSDPIAEEPDVEQVAPEQ-VRR- 720
            KDR   E  N++T  +    +E +E S S+  +  Q   + I   P VE V P   +RR 
Sbjct: 706  KDRSGAESNNADTHAEKTGFVELEEPSESEVHSRVQNNPENII--PQVEPVTPATGLRRS 765

Query: 721  -----------------------EPESFDEALQVEDSIKWEQAVDDEMSSLKRNNTWVLT 780
                                   EPE +DEALQV DS KWE A+ DEM SL  N TW LT
Sbjct: 766  SRVSKAPQCYSPSLYYLLLSDSGEPECYDEALQVGDSAKWESAMQDEMDSLMSNQTWKLT 825

Query: 781  ELPTGKRALLNKWVFKIKVEPDGRRKFKARLVVKGYSQRKGIDYVEIFSPVVKLTTIRIL 840
            ELP GK+AL NKWV++IK E DG +++KARLVVKG+ Q++G+DY EIFSPVVK+TTIR++
Sbjct: 826  ELPKGKKALHNKWVYRIKEEHDGSKRYKARLVVKGFQQKEGVDYTEIFSPVVKMTTIRLV 885

Query: 841  LSIVASENLHLEQMDVKMTFLHGDLDEEIYMQQPEGFAAPGKKYMVCKLNKSLYGLKQAP 900
            L IVA+ENLHLEQ+DVK  FLHGDL+E+IYM+QP+GF APGK+ ++CKL KSLYGLKQAP
Sbjct: 886  LGIVAAENLHLEQLDVKTAFLHGDLEEDIYMKQPQGFTAPGKEGLICKLAKSLYGLKQAP 945

Query: 901  RQWYKKFDSFMSKSGFHRSEKNQCCYLKKYTDSYVFLLLYVDDIIIVGSSMREINHLKAS 931
            RQWYKKFD FM  +GF R + + CCY+K++ + Y+ LLLYVDD++I GSS++EI +LK  
Sbjct: 946  RQWYKKFDGFMCTNGFTRCQADHCCYMKRFDNDYIILLLYVDDMLIAGSSIQEIKNLKNQ 1005

BLAST of CmoCh05G009260 vs. ExPASy TrEMBL
Match: A0A7N2KSK9 (Uncharacterized protein OS=Quercus lobata OX=97700 PE=4 SV=1)

HSP 1 Score: 1086.6 bits (2809), Expect = 0.0e+00
Identity = 560/993 (56.39%), Postives = 728/993 (73.31%), Query Frame = 0

Query: 1    MTTKQWKLRDRQALGMIRLTLSRNVAFNIIKEKTTSDLMKALSNMYEKPSAMNKVYLMHR 60
            M  ++W L DRQ LG+IRLTLSR+VA N++KEKTT+DLMKALS MYEKPSA NKV+LM +
Sbjct: 47   MKAEEWALLDRQVLGVIRLTLSRSVAHNVVKEKTTADLMKALSGMYEKPSANNKVHLMKK 106

Query: 61   LFNLQMSKGGRVADHINEFNMIISQLGSMKINFEDEIKALILMSSLPESWDTVVAAISSS 120
            LFNL+M++   VA H+NEFN I +QL S++I+F+DEI+ALI+++SLP SW+ +  A+S+S
Sbjct: 107  LFNLKMAENASVAQHLNEFNTITNQLSSVEIDFDDEIRALIVLASLPNSWEAMRMAVSNS 166

Query: 121  RGSDKLKFDEIRNVVLSESIRKRETGDSSGN--ALSVDRRGRSKSKGSNKHGRSKS---- 180
             G +KLK+++IR+++L+E IR+R+ G++SG+  AL+++ RGR  ++ SN+ GRSKS    
Sbjct: 167  TGKEKLKYNDIRDLILAEEIRRRDAGETSGSGFALNLETRGRGNNRNSNR-GRSKSRNSN 226

Query: 181  KNREKSSNKPNVTCWSCGGKEHFRTDCTKLKKKQNHKSEDDDDSIYTTEDAEDVLILSVD 240
            +NR KS +   V CW+CG   HFR  C   KKK      +DD +   TE+ +D L+L VD
Sbjct: 227  RNRSKSRSGQQVQCWNCGKTGHFRNQCKSPKKK-----NEDDSANAVTEEVQDALLLEVD 286

Query: 241  SPVESWILDSCASFHSSPSKELFQNFKSGNFGKVYLADNKALEIEGKGDVSIQTPAGNLW 300
            SP++ W+LDS ASFH++P +E+ QN+ +G+FGKVYLAD  AL++ G GDV I  P G++W
Sbjct: 287  SPLDDWVLDSGASFHTTPHREIIQNYVAGDFGKVYLADGSALDVVGMGDVRILLPNGSVW 346

Query: 301  TLQDVRYIPGLKKNLISIRQLDSTGYAAEFGKSSWMIVKGAMVVARGTKSGTLYTTAECI 360
             L+ +R+IP L++NLIS+ QLD  G+A  F   +W + KGA V+ARG K+GTLY T+   
Sbjct: 347  LLEKIRHIPDLRRNLISVGQLDDEGHAILFVGGTWKVTKGARVLARGKKTGTLYMTSSPR 406

Query: 361  KMTAAAESASNSSLWHNRLGHMSVKGMKMLTAKGALEGLKFVDMGLCESCVMGKQKRVSF 420
               A A+++ ++SLWH RLGHMS KGMKML +KG L  LK +D  +CESC++GKQK+VSF
Sbjct: 407  DTIAVADASIDTSLWHRRLGHMSEKGMKMLLSKGKLPELKSIDFDMCESCILGKQKKVSF 466

Query: 421  TKAAREPKIVRLKMVHTDVWGPSPVSSLGGSRYYVTFIDDFSRKVWVYFRKHKSDVFTTF 480
             K  R PK  +L++VHTD+WGPSPV+SLGGSRYY+TFIDD SRKVWVYF K+KSDVF TF
Sbjct: 467  LKTGRTPKAEKLELVHTDLWGPSPVASLGGSRYYITFIDDSSRKVWVYFLKNKSDVFETF 526

Query: 481  KKWKAEVENQTGLKIKCLRQD-------------------------------NGIAERMN 540
            KKWKA VE +TGLK+KCLR D                               NG+AERMN
Sbjct: 527  KKWKAMVETETGLKVKCLRSDNGGEYIDGGFSEYCAAQGIRMEKTIPGTPQQNGVAERMN 586

Query: 541  RTLNERARSMRIHSGLPKTFWADAVNTAAYLINRRPSVPLKFKFPEEVWTGNELKYSHLR 600
            RTLNERARSMR+H+GLPKTFWADAVNTAAYLINR PSVP++F+ PEEVW+G E+K+SHL+
Sbjct: 587  RTLNERARSMRLHAGLPKTFWADAVNTAAYLINRGPSVPMEFRLPEEVWSGKEVKFSHLK 646

Query: 601  TFGCTAYVHIDLEKRDKLDAKAVKCYFIGYGSDMFGYRFWDEKNMKILRHCDVIFDENVM 660
             FGC +YVHID + R KLDAK+  C+FIGYG + FGYRFWDE+N KI+R  +VIF+E VM
Sbjct: 647  VFGCVSYVHIDSDVRSKLDAKSKICFFIGYGDEKFGYRFWDEQNRKIIRSRNVIFNEQVM 706

Query: 661  YKDREKINSETTKQVGVELEWQE-NSRSDGTTEAQ------------ETSDPIAE-EPDV 720
            YKDR  + S+ T+    + E+   +  ++GT + +            + S P+AE     
Sbjct: 707  YKDRSTVVSDVTEIDQKKSEFVNLDELTEGTVQKRGEEDKENVNSQVDLSTPVAEARRSS 766

Query: 721  EQVAPEQVRR------------EPESFDEALQVEDSIKWEQAVDDEMSSLKRNNTWVLTE 780
              + P Q               EPE +DEALQ E+S KWE A+ DEM SL  N TW LTE
Sbjct: 767  RNIRPPQRYSPTLNYLLLTDGGEPEYYDEALQDENSSKWELAMKDEMDSLLGNQTWELTE 826

Query: 781  LPTGKRALLNKWVFKIKVEPDGRRKFKARLVVKGYSQRKGIDYVEIFSPVVKLTTIRILL 840
            LP GK+AL NKWV++IK E DG +++KARLVVKG+ Q++GIDY EIFSPVVK++TIR++L
Sbjct: 827  LPVGKKALHNKWVYRIKNEHDGSKRYKARLVVKGFQQKEGIDYTEIFSPVVKMSTIRLVL 886

Query: 841  SIVASENLHLEQMDVKMTFLHGDLDEEIYMQQPEGFAAPGKKYMVCKLNKSLYGLKQAPR 900
             +VA+ENLHLEQ+DVK  FLHGDL+E++YM QPEGF A G++ +VCKL KSLYGLKQAPR
Sbjct: 887  GMVAAENLHLEQLDVKTAFLHGDLEEDLYMIQPEGFIAQGQENLVCKLKKSLYGLKQAPR 946

Query: 901  QWYKKFDSFMSKSGFHRSEKNQCCYLKKYTDSYVFLLLYVDDIIIVGSSMREINHLKASL 931
            QWYKKFDSFM + GF R E + CCY+K + +SY+ LLLYVDD++I GSS+ EIN+LK  L
Sbjct: 947  QWYKKFDSFMHRIGFKRCEADHCCYVKFFDNSYIILLLYVDDMLIAGSSIEEINNLKKQL 1006

BLAST of CmoCh05G009260 vs. ExPASy TrEMBL
Match: A0A7N2KYF5 (Uncharacterized protein OS=Quercus lobata OX=97700 PE=4 SV=1)

HSP 1 Score: 1085.9 bits (2807), Expect = 0.0e+00
Identity = 559/996 (56.12%), Postives = 726/996 (72.89%), Query Frame = 0

Query: 1    MTTKQWKLRDRQALGMIRLTLSRNVAFNIIKEKTTSDLMKALSNMYEKPSAMNKVYLMHR 60
            M   +W L DRQ LG+IRLTLSR+VA N++KEKTT+DLMKALS MYEKPSA NKV+LM +
Sbjct: 47   MKADEWALLDRQVLGVIRLTLSRSVAHNVVKEKTTADLMKALSGMYEKPSANNKVHLMKK 106

Query: 61   LFNLQMSKGGRVADHINEFNMIISQLGSMKINFEDEIKALILMSSLPESWDTVVAAISSS 120
            LFNL+M++   VA H+NEFN I +QL S++I+F+DEI+ALI+++SLP SW+ +  A+S+S
Sbjct: 107  LFNLKMAENASVAQHLNEFNTITNQLSSVEIDFDDEIRALIVLASLPNSWEAMRMAVSNS 166

Query: 121  RGSDKLKFDEIRNVVLSESIRKRETGDS--SGNALSVDRRGRSKSKGSNKHGRSKS---- 180
             G +KLK+++IR+++L+E IR+R+ G+S  SG+AL+++ RGR  ++ SN+ GRSKS    
Sbjct: 167  TGKEKLKYNDIRDLILAEEIRRRDAGESSGSGSALNLETRGRGNNRNSNR-GRSKSRNSN 226

Query: 181  KNREKSSNKPNVTCWSCGGKEHFRTDCTKLKKKQNHKSEDDDDSIYTTEDAEDVLILSVD 240
            +NR KS +   V CW+CG   HFR  C   KKK      +DD +   TE+ +D L+L+VD
Sbjct: 227  RNRSKSRSGQQVQCWNCGKTGHFRNQCKSPKKK-----NEDDSANAVTEEVQDALLLAVD 286

Query: 241  SPVESWILDSCASFHSSPSKELFQNFKSGNFGKVYLADNKALEIEGKGDVSIQTPAGNLW 300
            SP++ W+LDS ASFH++P +E+ QN+ +G+FGKVYLAD  AL++ G GDV I  P G++W
Sbjct: 287  SPLDDWVLDSGASFHTTPHREIIQNYVAGDFGKVYLADGLALDVVGMGDVRILLPNGSVW 346

Query: 301  TLQDVRYIPGLKKNLISIRQLDSTGYAAEFGKSSWMIVKGAMVVARGTKSGTLYTTAECI 360
             L+ +R+IP L++NLIS+ QLD  G+A  F   +W + KGA V+ARG K+GTLY T+   
Sbjct: 347  LLEKIRHIPDLRRNLISVGQLDDEGHAILFVGGTWKVTKGARVLARGKKTGTLYMTSSPR 406

Query: 361  KMTAAAESASNSSLWHNRLGHMSVKGMKMLTAKGALEGLKFVDMGLCESCVMGKQKRVSF 420
               A A++++++SLWH RLGHMS KGMKML +KG L  LK +D  +CESC++GKQK+VSF
Sbjct: 407  DTIAVADASTDTSLWHRRLGHMSEKGMKMLLSKGKLPELKSIDFDMCESCILGKQKKVSF 466

Query: 421  TKAAREPKIVRLKMVHTDVWGPSPVSSLGGSRYYVTFIDDFSRKVWVYFRKHKSDVFTTF 480
             K  R PK  +L++VHTD+WGPSPV+SLGGSRYY+TFIDD SRKVWVYF K+KSDVF TF
Sbjct: 467  LKTGRTPKTEKLELVHTDLWGPSPVASLGGSRYYITFIDDSSRKVWVYFLKNKSDVFETF 526

Query: 481  KKWKAEVENQTGLKIKCLRQD-------------------------------NGIAERMN 540
            KKWKA VE +TGLK+KCLR D                               NG+AERMN
Sbjct: 527  KKWKAMVETETGLKVKCLRSDNGGEYIDGGFSEYCAAQGIRMEKTIPGTPQQNGVAERMN 586

Query: 541  RTLNERARSMRIHSGLPKTFWADAVNTAAYLINRRPSVPLKFKFPEEVWTGNELKYSHLR 600
            RTLNERARSMR+H+GLPKTFWADAV+TAAYLINR PSVP++F+ PEEVW+G E+K+SHL+
Sbjct: 587  RTLNERARSMRLHAGLPKTFWADAVSTAAYLINRGPSVPMEFRLPEEVWSGKEVKFSHLK 646

Query: 601  TFGCTAYVHIDLEKRDKLDAKAVKCYFIGYGSDMFGYRFWDEKNMKILRHCDVIFDENVM 660
             FGC +YVHID + R KLDAK+  C+FIGYG + FGYRFWDE+N KI+R  +VIF+E VM
Sbjct: 647  VFGCVSYVHIDSDARSKLDAKSKICFFIGYGDEKFGYRFWDEQNRKIIRSRNVIFNEQVM 706

Query: 661  YKDREKINSETTKQVGVELEWQENSRSDGTTE------AQETSDPIAEEPDVEQVAPEQV 720
            YKDR  + S+ T   G++ +  E    D  TE       +E  + +  + D+     E  
Sbjct: 707  YKDRSTVVSDVT---GIDQKKSEFVNLDELTEGTVQKRGEEDKENVNSQVDLSTPVVEAR 766

Query: 721  RR-----------------------EPESFDEALQVEDSIKWEQAVDDEMSSLKRNNTWV 780
            R                        EPE +DEALQ E+S KW+ A+ DEM SL  N TW 
Sbjct: 767  RSSRNIRPPQRYSPTLNYLLLTDGGEPECYDEALQDENSSKWKLAMKDEMDSLLGNQTWE 826

Query: 781  LTELPTGKRALLNKWVFKIKVEPDGRRKFKARLVVKGYSQRKGIDYVEIFSPVVKLTTIR 840
            LTELP GK+AL NKWV++IK E DG +++KARLVVKG+ Q++GIDY EIFSPVVK++TIR
Sbjct: 827  LTELPVGKKALHNKWVYRIKNEHDGSKRYKARLVVKGFQQKEGIDYTEIFSPVVKMSTIR 886

Query: 841  ILLSIVASENLHLEQMDVKMTFLHGDLDEEIYMQQPEGFAAPGKKYMVCKLNKSLYGLKQ 900
            ++L +VA+ENLHLEQ+DVK  FLHGDL+E++YM QPEGF A G++ +VCKL KSLYGLKQ
Sbjct: 887  LVLGMVAAENLHLEQLDVKTAFLHGDLEEDLYMIQPEGFIAQGQENLVCKLKKSLYGLKQ 946

Query: 901  APRQWYKKFDSFMSKSGFHRSEKNQCCYLKKYTDSYVFLLLYVDDIIIVGSSMREINHLK 931
            APRQWYKKFDSFM + GF R E + CCY+K + +SY+ LLLYVDD++I GSS+ EIN+LK
Sbjct: 947  APRQWYKKFDSFMHRIGFKRCEADHCCYVKFFDNSYIILLLYVDDMLIAGSSIEEINNLK 1006

BLAST of CmoCh05G009260 vs. ExPASy TrEMBL
Match: A0A7N2M4T2 (Uncharacterized protein OS=Quercus lobata OX=97700 PE=4 SV=1)

HSP 1 Score: 1085.9 bits (2807), Expect = 0.0e+00
Identity = 557/993 (56.09%), Postives = 728/993 (73.31%), Query Frame = 0

Query: 1    MTTKQWKLRDRQALGMIRLTLSRNVAFNIIKEKTTSDLMKALSNMYEKPSAMNKVYLMHR 60
            M  ++W L DRQ LG+IRLTLSR+VA N++KEKTT+DLMKALS MYEKPSA NKV+LM +
Sbjct: 47   MKAEEWALLDRQVLGVIRLTLSRSVAHNVVKEKTTADLMKALSGMYEKPSANNKVHLMKK 106

Query: 61   LFNLQMSKGGRVADHINEFNMIISQLGSMKINFEDEIKALILMSSLPESWDTVVAAISSS 120
            LFNL+M++   VA H+NEFN I +QL S++I+F+DEI+ALI+++SLP SW+ +  A+S+S
Sbjct: 107  LFNLKMAENASVAQHLNEFNTITNQLSSVEIDFDDEIRALIVLASLPNSWEAMRMAVSNS 166

Query: 121  RGSDKLKFDEIRNVVLSESIRKRETGDS--SGNALSVDRRGRSKSKGSNKHGRSKS---- 180
             G +KLK+++IR+++L+E IR+R+ G++  SG+AL+++ RGR  ++ SN+ GRSKS    
Sbjct: 167  TGKEKLKYNDIRDLILAEEIRRRDAGETSGSGSALNLETRGRGNNRNSNR-GRSKSRNSN 226

Query: 181  KNREKSSNKPNVTCWSCGGKEHFRTDCTKLKKKQNHKSEDDDDSIYTTEDAEDVLILSVD 240
            +NR KS +   V CW+CG   HFR  C   KKK      +DD +   TE+ +D L+L+VD
Sbjct: 227  RNRSKSRSGQQVQCWNCGKTGHFRNQCKNPKKK-----NEDDSANAVTEEVQDALLLAVD 286

Query: 241  SPVESWILDSCASFHSSPSKELFQNFKSGNFGKVYLADNKALEIEGKGDVSIQTPAGNLW 300
            SP++ W+LDS ASFH++P +E+ QN+ +G+FGKVYLAD  AL++ G GDV I  P G++W
Sbjct: 287  SPLDDWVLDSGASFHTTPHREIIQNYVAGDFGKVYLADGSALDVVGMGDVRILLPNGSVW 346

Query: 301  TLQDVRYIPGLKKNLISIRQLDSTGYAAEFGKSSWMIVKGAMVVARGTKSGTLYTTAECI 360
             L+ +R+IP L++NLIS+ QLD  G+A  F   +W + KGA V+ARG K+GTLY T+   
Sbjct: 347  LLEKIRHIPDLRRNLISVGQLDDEGHAILFVGGTWKVTKGARVLARGKKTGTLYMTSSPR 406

Query: 361  KMTAAAESASNSSLWHNRLGHMSVKGMKMLTAKGALEGLKFVDMGLCESCVMGKQKRVSF 420
               A A++++++SLWH RLGHMS KGMKML +KG L  LK +D  +CESC++GKQK+VSF
Sbjct: 407  DTIAVADASTDTSLWHRRLGHMSEKGMKMLLSKGKLPELKSIDFDMCESCILGKQKKVSF 466

Query: 421  TKAAREPKIVRLKMVHTDVWGPSPVSSLGGSRYYVTFIDDFSRKVWVYFRKHKSDVFTTF 480
             K  R PK  +L++VHTD+WGPSPV+SLGGSRYY+TFIDD SRKVWVYF K+KSDVF TF
Sbjct: 467  LKTGRTPKAEKLELVHTDLWGPSPVASLGGSRYYITFIDDSSRKVWVYFLKNKSDVFETF 526

Query: 481  KKWKAEVENQTGLKIKCLRQD-------------------------------NGIAERMN 540
            KKWKA VE +TGLK+KCLR D                               NG+AERMN
Sbjct: 527  KKWKAMVETETGLKVKCLRSDNGGEYIDGGFSEYCAAQGIRMEKTIPGTPQQNGVAERMN 586

Query: 541  RTLNERARSMRIHSGLPKTFWADAVNTAAYLINRRPSVPLKFKFPEEVWTGNELKYSHLR 600
            RTLNERARSMR+H+GLPKTFWADAV+TAAYLINR PSVP++F+ PEEVW+G E+K+SHL+
Sbjct: 587  RTLNERARSMRLHAGLPKTFWADAVSTAAYLINRGPSVPMEFRLPEEVWSGKEVKFSHLK 646

Query: 601  TFGCTAYVHIDLEKRDKLDAKAVKCYFIGYGSDMFGYRFWDEKNMKILRHCDVIFDENVM 660
             FGC +YVHID + R KLDAK+  C+FIGYG + FGYRFWDE+N KI+R  +VIF+E VM
Sbjct: 647  VFGCVSYVHIDSDARSKLDAKSKICFFIGYGDEKFGYRFWDEQNRKIIRSRNVIFNEQVM 706

Query: 661  YKDREKINSETTKQVGVELEWQE-NSRSDGTTE--AQETSDPIAEEPDVEQVAPEQVRR- 720
            YKDR  + S+ T+    + E+   +  ++GT +   +E  + I  + D+     E  R  
Sbjct: 707  YKDRSTVVSDVTEIDQKKSEFVNLDELTEGTVQKRGEEDKENINSQVDLSTPVAEARRSS 766

Query: 721  ----------------------EPESFDEALQVEDSIKWEQAVDDEMSSLKRNNTWVLTE 780
                                  EPE +DEALQ E+S KWE A+ DEM SL  N TW LTE
Sbjct: 767  RNIRPPQRYSPTLNYLLLTDGGEPECYDEALQDENSSKWELAMKDEMDSLLGNQTWELTE 826

Query: 781  LPTGKRALLNKWVFKIKVEPDGRRKFKARLVVKGYSQRKGIDYVEIFSPVVKLTTIRILL 840
            LP GK+AL NKWV++IK E DG +++KARLVVKG+ Q++GIDY EIFSPVVK++TIR++L
Sbjct: 827  LPVGKKALHNKWVYRIKNEHDGSKRYKARLVVKGFQQKEGIDYTEIFSPVVKMSTIRLVL 886

Query: 841  SIVASENLHLEQMDVKMTFLHGDLDEEIYMQQPEGFAAPGKKYMVCKLNKSLYGLKQAPR 900
             +VA+ENLHLEQ+DVK  FLHGDL+E++YM QPEGF   G++ +VCKL KSLYGLKQAPR
Sbjct: 887  GMVAAENLHLEQLDVKTAFLHGDLEEDLYMIQPEGFITQGQENLVCKLRKSLYGLKQAPR 946

Query: 901  QWYKKFDSFMSKSGFHRSEKNQCCYLKKYTDSYVFLLLYVDDIIIVGSSMREINHLKASL 931
            QWYKKFDSFM + GF R E + CCY+K + +SY+ LLLYVDD++I GSS+ EIN+LK  L
Sbjct: 947  QWYKKFDSFMHRIGFKRCEADHCCYVKFFDNSYIILLLYVDDMLIAGSSIEEINNLKKQL 1006

BLAST of CmoCh05G009260 vs. NCBI nr
Match: KYP68607.1 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan])

HSP 1 Score: 1104.7 bits (2856), Expect = 0.0e+00
Identity = 565/989 (57.13%), Postives = 719/989 (72.70%), Query Frame = 0

Query: 1    MTTKQWKLRDRQALGMIRLTLSRNVAFNIIKEKTTSDLMKALSNMYEKPSAMNKVYLMHR 60
            M  ++W L DRQALG+IRLTL++NVAFNI+ EKTT+ LMK LS+MYEKPSA NKV++M R
Sbjct: 24   MKQEEWNLLDRQALGVIRLTLAKNVAFNIVNEKTTASLMKTLSDMYEKPSAANKVHIMRR 83

Query: 61   LFNLQMSKGGRVADHINEFNMIISQLGSMKINFEDEIKALILMSSLPESWDTVVAAISSS 120
            LFNL+M +G  V +HIN+FN I++ L S++I FEDE+KALIL+SSLPESW   V A+SSS
Sbjct: 84   LFNLKMGEGNLVINHINDFNTILAMLESVQIKFEDEVKALILLSSLPESWAATVTAVSSS 143

Query: 121  RGSDKLKFDEIRNVVLSESIRKRE----TGDSSGNALSVDRRGRSKSKGSNKHGRSKSKN 180
               +KLK ++IR+++LSE +R+R+    +  +S +AL+ + RGR+  KG N  GRSKS+ 
Sbjct: 144  ARDNKLKLNDIRDLILSEDVRRRDSEEPSSSTSSSALNTESRGRTTQKGYNSRGRSKSRA 203

Query: 181  REKSSNKPNVTCWSCGGKEHFRTDCTKLKKKQNHKSEDDDDSI-YTTEDAEDVLILSVDS 240
            + +   + ++ CW+C  + HF   C   KK +NHK  DDD+S    T++ +D LI S+DS
Sbjct: 204  KGQPKFRNDIVCWNCDKRGHFTNQCKAPKKNKNHKKRDDDESANAATDEIDDALICSLDS 263

Query: 241  PVESWILDSCASFHSSPSKELFQNFKSGNFGKVYLADNKALEIEGKGDVSIQTPAGNLWT 300
            P+ESWI+DS ASFH++PS EL  N+ SG FGKVYLAD K L I GKGD++I+T +G+ WT
Sbjct: 264  PIESWIMDSGASFHTTPSNELLTNYVSGRFGKVYLADGKPLNIVGKGDIAIRTSSGSHWT 323

Query: 301  LQDVRYIPGLKKNLISIRQLDSTGYAAEFGKSSWMIVKGAMVVARGTKSGTLYTTAECIK 360
            L++VR+IP LK+NLIS+ QLD  G+   FG  +W + KG ++VARG K G+LY  A+   
Sbjct: 324  LKNVRHIPALKRNLISVGQLDDEGHETTFGDGAWKVKKGNLIVARGKKRGSLYMVAD-EN 383

Query: 361  MTAAAESASNSSLWHNRLGHMSVKGMKMLTAKGALEGLKFVDMGLCESCVMGKQKRVSFT 420
            M A  E+A+NS LWH RLGHMS KGMK++  KG L  LK VD+G CE C++GKQ+++SF+
Sbjct: 384  MIAVTEAANNSFLWHQRLGHMSEKGMKLMATKGKLSKLKHVDVGTCEHCILGKQRKISFS 443

Query: 421  KAAREPKIVRLKMVHTDVWGPSPVSSLGGSRYYVTFIDDFSRKVWVYFRKHKSDVFTTFK 480
            +  +  K  RL++VHTDVWGP+PV SLGGS YYVTFIDD +RKVWVYF K+KSDVF+ FK
Sbjct: 444  RQGKTLKTERLELVHTDVWGPAPVKSLGGSCYYVTFIDDATRKVWVYFLKNKSDVFSVFK 503

Query: 481  KWKAEVENQTGLKIKCLRQD-------------------------------NGIAERMNR 540
            +WK EVENQTGLK+K L+ D                               NG+AERMNR
Sbjct: 504  RWKTEVENQTGLKLKSLKSDNGGEYNSHEFKNFCSEHGIRMIKTIPGTPEQNGVAERMNR 563

Query: 541  TLNERARSMRIHSGLPKTFWADAVNTAAYLINRRPSVPLKFKFPEEVWTGNELKYSHLRT 600
            TLNERAR MRI SGLPK FWADA+NTAAYLINR PSVPL ++ PEEVW+G E+  SHLR 
Sbjct: 564  TLNERARCMRIQSGLPKVFWADAINTAAYLINRGPSVPLGYQLPEEVWSGKEVNLSHLRV 623

Query: 601  FGCTAYVHIDLEKRDKLDAKAVKCYFIGYGSDMFGYRFWDEKNMKILRHCDVIFDENVMY 660
            FGC +YV ID + RDKLD KA KCYFIGYGSDM+GYRFWD++N KI+R  +V F+EN+ Y
Sbjct: 624  FGCVSYVLIDSDSRDKLDPKAKKCYFIGYGSDMYGYRFWDDQNKKIIRSRNVTFNENLFY 683

Query: 661  KDR---EKINSETTKQVGVELEWQENSRSDGTTEAQETSDPIAEEPDVE--------QVA 720
            KDR   E I+++   +   ++E +E S SD T  +Q T   +  EP            VA
Sbjct: 684  KDRFSAESIDTDKLPEPSEKVELEEISESDITNRSQSTDIEVESEPASPPLRRSCRVSVA 743

Query: 721  PEQV-----------RREPESFDEALQVEDSIKWEQAVDDEMSSLKRNNTWVLTELPTGK 780
            PE+              EPE FDEA+Q  DS+KWE A+ DEM+SL++N TW LTELP GK
Sbjct: 744  PERYSPSLHYLLLTDAGEPEYFDEAIQGNDSVKWELAMKDEMTSLQKNGTWSLTELPEGK 803

Query: 781  RALLNKWVFKIKVEPDGRRKFKARLVVKGYSQRKGIDYVEIFSPVVKLTTIRILLSIVAS 840
             AL NKWV+++K E DGR+++KARLVVKG+ Q+ GID+ EIFSPVVK+TTIR++LSIVA+
Sbjct: 804  MALQNKWVYRLKEESDGRKRYKARLVVKGFQQKPGIDFTEIFSPVVKMTTIRVILSIVAA 863

Query: 841  ENLHLEQMDVKMTFLHGDLDEEIYMQQPEGFAAPGKKYMVCKLNKSLYGLKQAPRQWYKK 900
            ENLHLEQ+DVK  FLHGDLDE+IYM QPEGF   GK+ +VCKL+KSLYGLKQAPRQWYKK
Sbjct: 864  ENLHLEQLDVKTAFLHGDLDEDIYMTQPEGFKVSGKENLVCKLHKSLYGLKQAPRQWYKK 923

Query: 901  FDSFMSKSGFHRSEKNQCCYLKKYTDSYVFLLLYVDDIIIVGSSMREINHLKASLSSVFE 932
            F+ FM  SGF R + + CCY+KKY +SY+ L LYVDD++I G +M EIN LK  L+  FE
Sbjct: 924  FNEFMQNSGFSRCDMDHCCYVKKYVNSYIILALYVDDMLIAGPNMTEINRLKQQLTDHFE 983

BLAST of CmoCh05G009260 vs. NCBI nr
Match: RVW23445.1 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera])

HSP 1 Score: 1081.6 bits (2796), Expect = 0.0e+00
Identity = 556/992 (56.05%), Postives = 726/992 (73.19%), Query Frame = 0

Query: 1    MTTKQWKLRDRQALGMIRLTLSRNVAFNIIKEKTTSDLMKALSNMYEKPSAMNKVYLMHR 60
            M  ++W L DRQ LG+IRLTLSR+VA N++KEKTT+DLMKALS MYEKPSA NKV+LM +
Sbjct: 47   MKAEEWALLDRQVLGVIRLTLSRSVAHNVVKEKTTADLMKALSGMYEKPSANNKVHLMKK 106

Query: 61   LFNLQMSKGGRVADHINEFNMIISQLGSMKINFEDEIKALILMSSLPESWDTVVAAISSS 120
            LFNL+M++   VA H+NEFN I +QL S++I+F+DEI+ALI+++SLP SW+ +  A+S+S
Sbjct: 107  LFNLKMAENASVAQHLNEFNTITNQLSSVEIDFDDEIRALIVLASLPNSWEAMRMAVSNS 166

Query: 121  RGSDKLKFDEIRNVVLSESIRKRETGDS--SGNALSVDRRGRSKSKGSNK---HGRSKSK 180
             G +KLK+++IR+++L+E IR+R+ G++  SG+AL+++ RGR  ++ SN+   + R+ ++
Sbjct: 167  TGKEKLKYNDIRDLILAEEIRRRDAGETSGSGSALNLETRGRGNNRNSNQGRSNSRNSNR 226

Query: 181  NREKSSNKPNVTCWSCGGKEHFRTDCTKLKKKQNHKSEDDDDSIYTTEDAEDVLILSVDS 240
            NR KS +   V CW+CG   HF+  C   KKK      +DD +   TE+ +D L+L+VDS
Sbjct: 227  NRSKSRSGQQVQCWNCGKTGHFKRQCKSPKKK-----NEDDSANAVTEEVQDALLLAVDS 286

Query: 241  PVESWILDSCASFHSSPSKELFQNFKSGNFGKVYLADNKALEIEGKGDVSIQTPAGNLWT 300
            P++ W+LDS ASFH++P +E+ QN+ +G+FGKVYLAD  AL++ G GDV I  P G++W 
Sbjct: 287  PLDDWVLDSGASFHTTPHREIIQNYVAGDFGKVYLADGSALDVVGLGDVRISLPNGSVWL 346

Query: 301  LQDVRYIPGLKKNLISIRQLDSTGYAAEFGKSSWMIVKGAMVVARGTKSGTLYTTAECIK 360
            L+ VR+IP L++NLIS+ QLD  G+A  F   +W + KGA V+ARG K+GTLY T+    
Sbjct: 347  LEKVRHIPDLRRNLISVGQLDDEGHAILFVGGTWKVTKGARVLARGKKTGTLYMTSCPRD 406

Query: 361  MTAAAESASNSSLWHNRLGHMSVKGMKMLTAKGALEGLKFVDMGLCESCVMGKQKRVSFT 420
              A A++++++SLWH RLGHMS KGMKML +KG L  LK +D  +CESC++GKQK+VSF 
Sbjct: 407  TIAVADASTDTSLWHRRLGHMSEKGMKMLLSKGKLPELKSIDFDMCESCILGKQKKVSFL 466

Query: 421  KAAREPKIVRLKMVHTDVWGPSPVSSLGGSRYYVTFIDDFSRKVWVYFRKHKSDVFTTFK 480
            K  R PK  +L++VHTD+WGPSPV+SLGGSRYY+TFIDD SRKVWVYF K+KSDVF TFK
Sbjct: 467  KTGRTPKAEKLELVHTDLWGPSPVASLGGSRYYITFIDDSSRKVWVYFLKNKSDVFVTFK 526

Query: 481  KWKAEVENQTGLKIKCLRQD-------------------------------NGIAERMNR 540
            KWKA VE +TGLK+KCLR D                               NG+AERMNR
Sbjct: 527  KWKAMVETETGLKVKCLRSDNGGEYIDGGFSEYCAAQGIRMEKTIPGTPQQNGVAERMNR 586

Query: 541  TLNERARSMRIHSGLPKTFWADAVNTAAYLINRRPSVPLKFKFPEEVWTGNELKYSHLRT 600
            TLNERARSMR+H+GLPKTFWADAV+TAAYLINR PSVP++F+ PEEVW+G E+K+SHL+ 
Sbjct: 587  TLNERARSMRLHAGLPKTFWADAVSTAAYLINRGPSVPMEFRLPEEVWSGKEVKFSHLKV 646

Query: 601  FGCTAYVHIDLEKRDKLDAKAVKCYFIGYGSDMFGYRFWDEKNMKILRHCDVIFDENVMY 660
            FGC +YVHID + R KLDAK+  C+FIGYG + FGYRFWDE+N KI+R  +VIF+E VMY
Sbjct: 647  FGCVSYVHIDSDARSKLDAKSKICFFIGYGDEKFGYRFWDEQNRKIIRSRNVIFNEQVMY 706

Query: 661  KDREKINSETT-----KQVGVEL-EWQENSRSDGTTEAQET-------SDPIAE-EPDVE 720
            KDR  + S+ T     K   V L E  E++   G  E +E        S P+ E      
Sbjct: 707  KDRSTVTSDVTEIDQKKSEFVNLDELTESTVQKGGEEDKENVNSQVDLSTPVVEVRRSSR 766

Query: 721  QVAPEQVRR------------EPESFDEALQVEDSIKWEQAVDDEMSSLKRNNTWVLTEL 780
             + P Q               EPE +DEALQ E+S KWE A+ DEM SL  N TW LTEL
Sbjct: 767  NIRPPQRYSPVLNYLLLTDGGEPECYDEALQDENSSKWELAMKDEMDSLLGNQTWELTEL 826

Query: 781  PTGKRALLNKWVFKIKVEPDGRRKFKARLVVKGYSQRKGIDYVEIFSPVVKLTTIRILLS 840
            P GK+AL NKWV++IK E DG +++KARLVVKG+ Q++GIDY EIFSPVVK++TIR++L 
Sbjct: 827  PVGKKALHNKWVYRIKNEHDGSKRYKARLVVKGFQQKEGIDYTEIFSPVVKMSTIRLVLG 886

Query: 841  IVASENLHLEQMDVKMTFLHGDLDEEIYMQQPEGFAAPGKKYMVCKLNKSLYGLKQAPRQ 900
            +VA+ENLHLEQ+DVK  FLHGDL+E++YM QPEGF   G++ +VCKL KSLYGLKQAPRQ
Sbjct: 887  MVAAENLHLEQLDVKTAFLHGDLEEDLYMIQPEGFIVQGQENLVCKLRKSLYGLKQAPRQ 946

Query: 901  WYKKFDSFMSKSGFHRSEKNQCCYLKKYTDSYVFLLLYVDDIIIVGSSMREINHLKASLS 931
            WYKKFD+FM + GF R E + CCY+K + +SY+ LLLYVDD++IVGS + +IN+LK  LS
Sbjct: 947  WYKKFDNFMHRIGFKRCEADHCCYVKSFDNSYIILLLYVDDMLIVGSDIEKINNLKKQLS 1006

BLAST of CmoCh05G009260 vs. NCBI nr
Match: RVW35576.1 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera])

HSP 1 Score: 1078.9 bits (2789), Expect = 0.0e+00
Identity = 550/993 (55.39%), Postives = 726/993 (73.11%), Query Frame = 0

Query: 1    MTTKQWKLRDRQALGMIRLTLSRNVAFNIIKEKTTSDLMKALSNMYEKPSAMNKVYLMHR 60
            M  ++W L DRQ LG+IRLTLSR+VA N++KEKTT+DLMKALS MYEKPSA NKV+LM +
Sbjct: 47   MKAEEWALLDRQVLGVIRLTLSRSVAHNVVKEKTTADLMKALSGMYEKPSANNKVHLMKK 106

Query: 61   LFNLQMSKGGRVADHINEFNMIISQLGSMKINFEDEIKALILMSSLPESWDTVVAAISSS 120
            LFNL+M++   VA H+NEFN I +QL S++I+F+DEI+ALI+++SLP SW+ +  A+S+S
Sbjct: 107  LFNLKMAENASVAQHLNEFNTITNQLSSVEIDFDDEIRALIVLASLPNSWEAMRMAVSNS 166

Query: 121  RGSDKLKFDEIRNVVLSESIRKRETGDS--SGNALSVDRRGRSKSKGSNK---HGRSKSK 180
             G +KLK+++IR+++L+E IR+R+ G++  SG+AL+++ RGR  ++ SN+   + R+ ++
Sbjct: 167  TGKEKLKYNDIRDLILAEEIRRRDAGETSGSGSALNLETRGRGNNRNSNQGRSNSRNSNR 226

Query: 181  NREKSSNKPNVTCWSCGGKEHFRTDCTKLKKKQNHKSEDDDDSIYTTEDAEDVLILSVDS 240
            NR KS +   V CW+CG   HF+  C   KKK      +DD +   TE+ +D L+L+VDS
Sbjct: 227  NRSKSRSGQQVQCWNCGKTGHFKRQCKSPKKK-----NEDDSANAVTEEVQDALLLAVDS 286

Query: 241  PVESWILDSCASFHSSPSKELFQNFKSGNFGKVYLADNKALEIEGKGDVSIQTPAGNLWT 300
            P++ W+LDS ASFH++P +E+ QN+ +G+FGKVYLAD  AL++ G GDV I  P G++W 
Sbjct: 287  PLDDWVLDSGASFHTTPHREIIQNYVAGDFGKVYLADGSALDVVGLGDVRISLPNGSVWL 346

Query: 301  LQDVRYIPGLKKNLISIRQLDSTGYAAEFGKSSWMIVKGAMVVARGTKSGTLYTTAECIK 360
            L+ VR+IP L++NLIS+ QLD  G+A  F   +W + KGA V+ARG K+GTLY T+    
Sbjct: 347  LEKVRHIPDLRRNLISVGQLDDEGHAILFVGGTWKVTKGARVLARGKKTGTLYMTSCPRD 406

Query: 361  MTAAAESASNSSLWHNRLGHMSVKGMKMLTAKGALEGLKFVDMGLCESCVMGKQKRVSFT 420
              A A++++++SLWH RLGHMS KGMKML +KG L  LK +D  +CESC++GKQK+VSF 
Sbjct: 407  TIAVADASTDTSLWHRRLGHMSEKGMKMLLSKGKLPELKSIDFDMCESCILGKQKKVSFL 466

Query: 421  KAAREPKIVRLKMVHTDVWGPSPVSSLGGSRYYVTFIDDFSRKVWVYFRKHKSDVFTTFK 480
            K  R PK  +L++VHTD+WGPSPV+SLGGSRYY+TFIDD SRKVWVYF K+KSDVF TFK
Sbjct: 467  KTGRTPKSEKLELVHTDLWGPSPVASLGGSRYYITFIDDSSRKVWVYFLKNKSDVFVTFK 526

Query: 481  KWKAEVENQTGLKIKCLRQD-------------------------------NGIAERMNR 540
            KWK  VE +TGLK+KCLR D                               NG+AERMNR
Sbjct: 527  KWKVMVETETGLKVKCLRSDNGGEYIDGGFSEYCAAQGIRMEKTIPGTPQQNGVAERMNR 586

Query: 541  TLNERARSMRIHSGLPKTFWADAVNTAAYLINRRPSVPLKFKFPEEVWTGNELKYSHLRT 600
            TLNERARSMR+H+GLPKTFWADA++TAAYLINR PSVP++F+ PEEVW+G E+K+SHL+ 
Sbjct: 587  TLNERARSMRLHAGLPKTFWADAISTAAYLINRGPSVPMEFRLPEEVWSGKEVKFSHLKV 646

Query: 601  FGCTAYVHIDLEKRDKLDAKAVKCYFIGYGSDMFGYRFWDEKNMKILRHCDVIFDENVMY 660
            FGC +YVHID + R KLDAK+  C+FIGYG + FGYRFWDE+N KI+R  +VIF+E VMY
Sbjct: 647  FGCVSYVHIDSDARSKLDAKSKICFFIGYGDEKFGYRFWDEQNRKIIRSRNVIFNEQVMY 706

Query: 661  KDREKINSETTKQVGVELEW---QENSRSDGTTEAQETSDPIAEEPDVEQVAPEQVRR-- 720
            KDR  + S+ T+    + E+    E ++S      +E  + +  + D+     E VRR  
Sbjct: 707  KDRSTVTSDVTEIDQKKSEFVNLDELTKSTVQKGGEEDKENVNSQVDLSTPVVE-VRRSS 766

Query: 721  ----------------------EPESFDEALQVEDSIKWEQAVDDEMSSLKRNNTWVLTE 780
                                  EPE +DEALQ E+S KWE A+ DEM SL  N TW LTE
Sbjct: 767  RNTRPPQRYSPVLNYLLLTDGGEPECYDEALQDENSSKWELAMKDEMDSLLGNQTWELTE 826

Query: 781  LPTGKRALLNKWVFKIKVEPDGRRKFKARLVVKGYSQRKGIDYVEIFSPVVKLTTIRILL 840
            LP GK+AL NKWV++IK E DG +++KARLVVKG+ Q++GIDY EIFSPVVK++TIR++L
Sbjct: 827  LPVGKKALHNKWVYRIKNEHDGSKRYKARLVVKGFQQKEGIDYTEIFSPVVKMSTIRLVL 886

Query: 841  SIVASENLHLEQMDVKMTFLHGDLDEEIYMQQPEGFAAPGKKYMVCKLNKSLYGLKQAPR 900
             +VA+ENLHLEQ+DVK  FLHGDL+E++YM QPEGF   G++ +VCKL KSLYGLKQAPR
Sbjct: 887  GMVAAENLHLEQLDVKTAFLHGDLEEDLYMIQPEGFIVQGQENLVCKLRKSLYGLKQAPR 946

Query: 901  QWYKKFDSFMSKSGFHRSEKNQCCYLKKYTDSYVFLLLYVDDIIIVGSSMREINHLKASL 931
            QWYKKFD+FM + GF R E + CCY+K + +SY+ LLLYVDD++IVGS + +IN+LK  L
Sbjct: 947  QWYKKFDNFMHRIGFKRCEADHCCYVKSFDNSYIILLLYVDDMLIVGSDIEKINNLKKQL 1006

BLAST of CmoCh05G009260 vs. NCBI nr
Match: RVW69546.1 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera])

HSP 1 Score: 1077.0 bits (2784), Expect = 0.0e+00
Identity = 554/998 (55.51%), Postives = 725/998 (72.65%), Query Frame = 0

Query: 1    MTTKQWKLRDRQALGMIRLTLSRNVAFNIIKEKTTSDLMKALSNMYEKPSAMNKVYLMHR 60
            M  ++W L DRQ LG+IRLTLSR+VA N++KEKTT+DLMKALS MYEKPSA NKV+LM +
Sbjct: 47   MKAEEWALLDRQVLGVIRLTLSRSVAHNVVKEKTTADLMKALSGMYEKPSANNKVHLMKK 106

Query: 61   LFNLQMSKGGRVADHINEFNMIISQLGSMKINFEDEIKALILMSSLPESWDTVVAAISSS 120
            LFNL+M++   VA H+NEFN I +QL S++I+F+DEI+ALI+++SLP SW+ +  A+S+S
Sbjct: 107  LFNLKMTENASVAQHLNEFNTITNQLSSVEIDFDDEIRALIVLASLPNSWEAMRMAVSNS 166

Query: 121  RGSDKLKFDEIRNVVLSESIRKRETGDS--SGNALSVDRRGRSKSKGSNK---HGRSKSK 180
             G +KLK+++IR+++L+E IR+R+ G++  SG+AL+++ RGR  ++ SN+   + R+ ++
Sbjct: 167  TGKEKLKYNDIRDLILAEEIRRRDAGETLGSGSALNLETRGRGNNRNSNQGRSNFRNSNR 226

Query: 181  NREKSSNKPNVTCWSCGGKEHFRTDCTKLKKKQNHKSEDDDDSIYTTEDAEDVLILSVDS 240
            NR KS +   V CW+CG   HF+  C   KKK      +DD +   TE+ +D L+L+VDS
Sbjct: 227  NRSKSRSGQQVQCWNCGKTGHFKRQCKSPKKK-----NEDDSANAVTEEVQDALLLAVDS 286

Query: 241  PVESWILDSCASFHSSPSKELFQNFKSGNFGKVYLADNKALEIEGKGDVSIQTPAGNLWT 300
            P++ W+LDS ASFH++P +E+ QN+ +G+FGKVYLAD  AL++ G GDV I  P G++W 
Sbjct: 287  PLDDWVLDSGASFHTTPHREIIQNYVAGDFGKVYLADGSALDVVGLGDVRISLPNGSVWL 346

Query: 301  LQDVRYIPGLKKNLISIRQLDSTGYAAEFGKSSWMIVKGAMVVARGTKSGTLYTTAECIK 360
            L+ VR+IP L++NLIS+ QLD  G+A  F   +W + KGA V+ARG K+GTLY T+    
Sbjct: 347  LEKVRHIPDLRRNLISVGQLDDEGHAILFVGGTWKVTKGARVLARGKKAGTLYMTSCPRD 406

Query: 361  MTAAAESASNSSLWHNRLGHMSVKGMKMLTAKGALEGLKFVDMGLCESCVMGKQKRVSFT 420
              A A++++++SLWH RLGHMS KGMKML +KG L  LK +D  +CESC++GKQK+VSF 
Sbjct: 407  TIAVADASTDTSLWHRRLGHMSEKGMKMLLSKGKLPELKSIDFDMCESCILGKQKKVSFL 466

Query: 421  KAAREPKIVRLKMVHTDVWGPSPVSSLGGSRYYVTFIDDFSRKVWVYFRKHKSDVFTTFK 480
            K  R PK  +L++VHTD+WGPSPV+SLGGSRYY+TFIDD SRKVWVYF K+K DVF TFK
Sbjct: 467  KTGRTPKAEKLELVHTDLWGPSPVASLGGSRYYITFIDDSSRKVWVYFLKNKFDVFVTFK 526

Query: 481  KWKAEVENQTGLKIKCLRQD-------------------------------NGIAERMNR 540
            KWK  VE +TGLK+KCLR D                               NG+AERMNR
Sbjct: 527  KWKVMVETETGLKVKCLRSDNGGEYIDGGFSEYCVAQGIRMEKTIPGTPQQNGVAERMNR 586

Query: 541  TLNERARSMRIHSGLPKTFWADAVNTAAYLINRRPSVPLKFKFPEEVWTGNELKYSHLRT 600
            TLNERARSMR+H+GLPKTFWADAV+TAAYLINR PSVP++F+ PEEVW+G E+K+SHL+ 
Sbjct: 587  TLNERARSMRLHAGLPKTFWADAVSTAAYLINRGPSVPMEFRLPEEVWSGKEVKFSHLKV 646

Query: 601  FGCTAYVHIDLEKRDKLDAKAVKCYFIGYGSDMFGYRFWDEKNMKILRHCDVIFDENVMY 660
            FGC +YVHID + R KLDAK+  C+FIGYG + FGYRFWDE+N KI+R  +VIF+E VMY
Sbjct: 647  FGCVSYVHIDSDARSKLDAKSKICFFIGYGDEKFGYRFWDEQNRKIIRSRNVIFNEQVMY 706

Query: 661  KDREKINSETTKQVGVELEWQENSRSDGTTEAQETSDPIAEEPDVEQVAPE--------Q 720
            KDR  + S+ T     E++ Q+ S      E  E++     E D E V  +        +
Sbjct: 707  KDRSTVTSDVT-----EID-QKKSEFVNLDELTESTIQKGGEKDKENVNSQVDLSTPVGE 766

Query: 721  VRR------------------------EPESFDEALQVEDSIKWEQAVDDEMSSLKRNNT 780
            VRR                        EPE +DEALQ E+S KWE A+ DEM SL  N T
Sbjct: 767  VRRSSRNIRPPQRYSPVLNYLLLTDGGEPECYDEALQDENSSKWELAMKDEMDSLLGNQT 826

Query: 781  WVLTELPTGKRALLNKWVFKIKVEPDGRRKFKARLVVKGYSQRKGIDYVEIFSPVVKLTT 840
            W LTELP GK+AL NKWV++IK E DG +++KARLVVKG+ Q++GIDY EIFSPVVK++T
Sbjct: 827  WELTELPVGKKALHNKWVYRIKNEHDGSKRYKARLVVKGFQQKEGIDYTEIFSPVVKMST 886

Query: 841  IRILLSIVASENLHLEQMDVKMTFLHGDLDEEIYMQQPEGFAAPGKKYMVCKLNKSLYGL 900
            IR++L +VA+ENLHLEQ+DVK TFLHGDL+E++YM QPEGF   G++ +VCKL KSLYGL
Sbjct: 887  IRLVLGMVAAENLHLEQLDVKTTFLHGDLEEDLYMIQPEGFIVQGQENLVCKLRKSLYGL 946

Query: 901  KQAPRQWYKKFDSFMSKSGFHRSEKNQCCYLKKYTDSYVFLLLYVDDIIIVGSSMREINH 931
            KQAPRQWYKKFD+FM + GF R E + CCY K + +SY+ LLLYVDD++IVGS + +IN+
Sbjct: 947  KQAPRQWYKKFDNFMHRIGFKRCEADHCCYFKSFDNSYIILLLYVDDMLIVGSDIEKINN 1006

BLAST of CmoCh05G009260 vs. NCBI nr
Match: RVW81243.1 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera])

HSP 1 Score: 1076.6 bits (2783), Expect = 0.0e+00
Identity = 551/996 (55.32%), Postives = 725/996 (72.79%), Query Frame = 0

Query: 1   MTTKQWKLRDRQALGMIRLTLSRNVAFNIIKEKTTSDLMKALSNMYEKPSAMNKVYLMHR 60
           M  ++W L DRQ LG+IRLTLSR+VA N++KEKTT+DLMKALS MYEKPSA NKV+LM +
Sbjct: 1   MKAEEWALLDRQVLGVIRLTLSRSVAHNVVKEKTTADLMKALSGMYEKPSANNKVHLMKK 60

Query: 61  LFNLQMSKGGRVADHINEFNMIISQLGSMKINFEDEIKALILMSSLPESWDTVVAAISSS 120
           LFNL+M++   VA H+NEFN I +QL S++I+F+DEI+ALI+++SLP SW+ +  A+S+S
Sbjct: 61  LFNLKMAENASVAQHLNEFNTITNQLSSVEIDFDDEIRALIVLASLPNSWEAMRMAVSNS 120

Query: 121 RGSDKLKFDEIRNVVLSESIRKRETGD--SSGNALSVDRRGRSKSKGSNK---HGRSKSK 180
            G +KLK+++IR+++L+E IR+R+ G+   SG+AL+++ RGR  ++ SN+   + R+ ++
Sbjct: 121 TGKEKLKYNDIRDLILAEEIRQRDAGEISGSGSALNLETRGRGNNRNSNQGRSNSRNSNR 180

Query: 181 NREKSSNKPNVTCWSCGGKEHFRTDCTKLKKKQNHKSEDDDDSIYTTEDAEDVLILSVDS 240
           NR KS +   + CW+CG   HF+  C   KKK      +DD +   TE+ +D L+L+VDS
Sbjct: 181 NRSKSRSGQQIQCWNCGKTGHFKRQCKSPKKK-----NEDDSANAVTEEVQDALLLAVDS 240

Query: 241 PVESWILDSCASFHSSPSKELFQNFKSGNFGKVYLADNKALEIEGKGDVSIQTPAGNLWT 300
           P++ W+LDS ASFH++P +E+ QN+ +G+FGKVYLAD  AL++ G GDV I  P G++W 
Sbjct: 241 PLDDWVLDSGASFHTTPHREIIQNYVAGDFGKVYLADGSALDVVGLGDVRISLPNGSVWL 300

Query: 301 LQDVRYIPGLKKNLISIRQLDSTGYAAEFGKSSWMIVKGAMVVARGTKSGTLYTTAECIK 360
           L+ VR+IP LK+NLIS+ QLD  G+A  F   +W + KGA V+ARG K+GTLY T+    
Sbjct: 301 LEKVRHIPDLKRNLISVGQLDDEGHAILFVGGTWKVTKGARVLARGKKTGTLYMTSCPRD 360

Query: 361 MTAAAESASNSSLWHNRLGHMSVKGMKMLTAKGALEGLKFVDMGLCESCVMGKQKRVSFT 420
             A A++++++SLWH RLGHMS K MKML +KG L  LK +D  +CESC++GKQK+VSF 
Sbjct: 361 TIAVADASTDTSLWHRRLGHMSEKEMKMLLSKGKLPELKSIDFDMCESCILGKQKKVSFL 420

Query: 421 KAAREPKIVRLKMVHTDVWGPSPVSSLGGSRYYVTFIDDFSRKVWVYFRKHKSDVFTTFK 480
           K  R PK  +L++VHTD+WGPSPV+SLGGSRYY+TFIDD SRKVWVYF K+KSDVF TFK
Sbjct: 421 KTGRTPKAEKLELVHTDLWGPSPVASLGGSRYYITFIDDSSRKVWVYFLKNKSDVFVTFK 480

Query: 481 KWKAEVENQTGLKIKCLRQD-------------------------------NGIAERMNR 540
           KWK  VE +TGLK+KCLR D                               NG+AERMNR
Sbjct: 481 KWKVMVETETGLKVKCLRSDNGGEYIDGGFSEYCAAQGIRMEKTIPGTPQQNGVAERMNR 540

Query: 541 TLNERARSMRIHSGLPKTFWADAVNTAAYLINRRPSVPLKFKFPEEVWTGNELKYSHLRT 600
           TLNERARSMR+H+GLPKTFWADAV+TAAYLINR PSVP++F+ PEEVW+G E+K+SHL+ 
Sbjct: 541 TLNERARSMRLHAGLPKTFWADAVSTAAYLINRGPSVPMEFRLPEEVWSGKEVKFSHLKV 600

Query: 601 FGCTAYVHIDLEKRDKLDAKAVKCYFIGYGSDMFGYRFWDEKNMKILRHCDVIFDENVMY 660
           FGC +YVHID + R KLDAK+  C+FIGYG + FGYRFWDE+N KI+R  +VIF+E VMY
Sbjct: 601 FGCVSYVHIDSDARSKLDAKSKICFFIGYGDEKFGYRFWDEQNRKIIRSRNVIFNEQVMY 660

Query: 661 KDREKINSETTKQVGVELEWQENSRSDGTTEA------QETSDPIAEEPDVEQVAPEQVR 720
           KDR  + S+ T+   ++ +  E    D  TE+      +E  + +  + D+     E VR
Sbjct: 661 KDRSTVTSDVTE---IDQKKSEFVNLDELTESTVQKGGEEDKENVNSQVDLSTPVVE-VR 720

Query: 721 R------------------------EPESFDEALQVEDSIKWEQAVDDEMSSLKRNNTWV 780
           R                        EPE +DEALQ E+S KWE A+ DEM SL  N TW 
Sbjct: 721 RSSRNTRPPQRYSPVLNYLLLTDGGEPECYDEALQDENSSKWELAMKDEMDSLLGNQTWE 780

Query: 781 LTELPTGKRALLNKWVFKIKVEPDGRRKFKARLVVKGYSQRKGIDYVEIFSPVVKLTTIR 840
           LTELP GK+AL NKWV++IK E DG +++KARLVVKG+ Q++GIDY EIFSPVVK++TIR
Sbjct: 781 LTELPVGKKALHNKWVYRIKNEHDGSKRYKARLVVKGFQQKEGIDYTEIFSPVVKMSTIR 840

Query: 841 ILLSIVASENLHLEQMDVKMTFLHGDLDEEIYMQQPEGFAAPGKKYMVCKLNKSLYGLKQ 900
           ++L +VA+ENLHLEQ+DVK  FLHGDL+E++YM QPEGF   G++ +VCKL KSLYGLKQ
Sbjct: 841 LVLGMVAAENLHLEQLDVKTAFLHGDLEEDLYMIQPEGFIVQGQENLVCKLRKSLYGLKQ 900

Query: 901 APRQWYKKFDSFMSKSGFHRSEKNQCCYLKKYTDSYVFLLLYVDDIIIVGSSMREINHLK 931
           APRQWYKKFD+FM + GF R E + CCY+K + +SY+ LLLYVDD++IVGS + +IN+LK
Sbjct: 901 APRQWYKKFDNFMHRIGFKRCEADHCCYVKSFDNSYIILLLYVDDMLIVGSDIEKINNLK 960

BLAST of CmoCh05G009260 vs. TAIR 10
Match: AT4G23160.1 (cysteine-rich RLK (RECEPTOR-like protein kinase) 8 )

HSP 1 Score: 201.4 bits (511), Expect = 3.1e-51
Identity = 110/257 (42.80%), Postives = 168/257 (65.37%), Query Frame = 0

Query: 679 REPESFDEALQVEDSIKWEQAVDDEMSSLKRNNTWVLTELPTGKRALLNKWVFKIKVEPD 738
           +EP +++EA   ++ + W  A+DDE+ +++  +TW +  LP  K+ +  KWV+KIK   D
Sbjct: 84  KEPSTYNEA---KEFLVWCGAMDDEIGAMETTHTWEICTLPPNKKPIGCKWVYKIKYNSD 143

Query: 739 GR-RKFKARLVVKGYSQRKGIDYVEIFSPVVKLTTIRILLSIVASENLHLEQMDVKMTFL 798
           G   ++KARLV KGY+Q++GID++E FSPV KLT+++++L+I A  N  L Q+D+   FL
Sbjct: 144 GTIERYKARLVAKGYTQQEGIDFIETFSPVCKLTSVKLILAISAIYNFTLHQLDISNAFL 203

Query: 799 HGDLDEEIYMQQPEGFAAPGKKYM----VCKLNKSLYGLKQAPRQWYKKFDSFMSKSGFH 858
           +GDLDEEIYM+ P G+AA     +    VC L KS+YGLKQA RQW+ KF   +   GF 
Sbjct: 204 NGDLDEEIYMKLPPGYAARQGDSLPPNAVCYLKKSIYGLKQASRQWFLKFSVTLIGFGFV 263

Query: 859 RSEKNQCCYLKKYTDSYVFLLLYVDDIIIVGSSMREINHLKASLSSVFEMKDLGAAKQIL 918
           +S  +   +LK     ++ +L+YVDDIII  ++   ++ LK+ L S F+++DLG  K  L
Sbjct: 264 QSHSDHTYFLKITATLFLCVLVYVDDIIICSNNDAAVDELKSQLKSCFKLRDLGPLKYFL 323

Query: 919 GMRISRDRSAGTLNLSQ 931
           G+ I+  RSA  +N+ Q
Sbjct: 324 GLEIA--RSAAGINICQ 335

BLAST of CmoCh05G009260 vs. TAIR 10
Match: ATMG00710.1 (Polynucleotidyl transferase, ribonuclease H-like superfamily protein )

HSP 1 Score: 91.7 bits (226), Expect = 3.4e-18
Identity = 44/86 (51.16%), Postives = 55/86 (63.95%), Query Frame = 0

Query: 502 MNRTLNERARSMRIHSGLPKTFWADAVNTAAYLINRRPSVPLKFKFPEEVWTGNELKYSH 561
           MNRT+ E+ RSM    GLPKTF ADA NTA ++IN+ PS  + F  P+EVW  +   YS+
Sbjct: 1   MNRTIIEKVRSMLCECGLPKTFRADAANTAVHIINKYPSTAINFHVPDEVWFQSVPTYSY 60

Query: 562 LRTFGCTAYVHIDLEKRDKLDAKAVK 588
           LR FGC AY+H D     KL  +A K
Sbjct: 61  LRRFGCVAYIHCD---EGKLKPRAKK 83

BLAST of CmoCh05G009260 vs. TAIR 10
Match: ATMG00820.1 (Reverse transcriptase (RNA-dependent DNA polymerase) )

HSP 1 Score: 81.6 bits (200), Expect = 3.6e-15
Identity = 45/104 (43.27%), Postives = 66/104 (63.46%), Query Frame = 0

Query: 677 VRREPESFDEALQVEDSIKWEQAVDDEMSSLKRNNTWVLTELPTGKRALLNKWVFKIKVE 736
           +++EP+S   AL+      W QA+ +E+ +L RN TW+L   P  +  L  KWVFK K+ 
Sbjct: 24  IKKEPKSVIFALK---DPGWCQAMQEELDALSRNKTWILVPPPVNQNILGCKWVFKTKLH 83

Query: 737 PDGR-RKFKARLVVKGYSQRKGIDYVEIFSPVVKLTTIRILLSI 780
            DG   + KARLV KG+ Q +GI +VE +SPVV+  TIR +L++
Sbjct: 84  SDGTLDRLKARLVAKGFHQEEGIYFVETYSPVVRTATIRTILNV 124

BLAST of CmoCh05G009260 vs. TAIR 10
Match: ATMG00300.1 (Gag-Pol-related retrotransposon family protein )

HSP 1 Score: 71.2 bits (173), Expect = 4.8e-12
Identity = 38/112 (33.93%), Postives = 59/112 (52.68%), Query Frame = 0

Query: 331 IVKGAMVVARGTKSGTLYT---TAECIKMTAAAESASNSSLWHNRLGHMSVKGMKMLTAK 390
           ++KG   + +G +  +LY    + E  +   A  +   + LWH+RL HMS +GM++L  K
Sbjct: 31  VLKGCRTILKGNRHDSLYILQGSVETGESNLAETAKDETRLWHSRLAHMSQRGMELLVKK 90

Query: 391 GALEGLKFVDMGLCESCVMGKQKRVSFTKAAREPKIVRLKMVHTDVWGPSPV 440
           G L+  K   +  CE C+ GK  RV+F+      K   L  VH+D+WG   V
Sbjct: 91  GFLDSSKVSSLKFCEDCIYGKTHRVNFSTGQHTTK-NPLDYVHSDLWGAPSV 141

BLAST of CmoCh05G009260 vs. TAIR 10
Match: AT3G29785.1 (unknown protein; Has 90 Blast hits to 90 proteins in 7 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 90; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 57.0 bits (136), Expect = 9.4e-08
Identity = 27/55 (49.09%), Postives = 38/55 (69.09%), Query Frame = 0

Query: 1  MTTKQWKLRDRQALGMIRLTLSRNVAFNIIKEKTTSDLMKALSNMYEKPSAMNKV 56
          M+   W +  RQ L +IRLT+S+N+A N+ KEK+   LMK LS++Y+KPS  N V
Sbjct: 41 MSQDDWNILYRQVLDVIRLTISKNIAHNVAKEKSPDGLMKVLSDIYKKPSTNNTV 95

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
P109781.1e-20541.63Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum... [more]
P041463.9e-9927.24Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3[more]
Q9ZT942.0e-7124.17Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana O... [more]
Q94HW23.4e-7124.61Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana O... [more]
P256002.8e-1735.00Putative transposon Ty5-1 protein YCL074W OS=Saccharomyces cerevisiae (strain AT... [more]
Match NameE-valueIdentityDescription
A0A151TNK00.0e+0057.13Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cajanus cajan OX=... [more]
A0A5B7BAK40.0e+0057.75Uncharacterized protein OS=Davidia involucrata OX=16924 GN=Din_035380 PE=4 SV=1[more]
A0A7N2KSK90.0e+0056.39Uncharacterized protein OS=Quercus lobata OX=97700 PE=4 SV=1[more]
A0A7N2KYF50.0e+0056.12Uncharacterized protein OS=Quercus lobata OX=97700 PE=4 SV=1[more]
A0A7N2M4T20.0e+0056.09Uncharacterized protein OS=Quercus lobata OX=97700 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
KYP68607.10.0e+0057.13Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan][more]
RVW23445.10.0e+0056.05Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera][more]
RVW35576.10.0e+0055.39Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera][more]
RVW69546.10.0e+0055.51Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera][more]
RVW81243.10.0e+0055.32Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera][more]
Match NameE-valueIdentityDescription
AT4G23160.13.1e-5142.80cysteine-rich RLK (RECEPTOR-like protein kinase) 8 [more]
ATMG00710.13.4e-1851.16Polynucleotidyl transferase, ribonuclease H-like superfamily protein [more]
ATMG00820.13.6e-1543.27Reverse transcriptase (RNA-dependent DNA polymerase) [more]
ATMG00300.14.8e-1233.93Gag-Pol-related retrotransposon family protein [more]
AT3G29785.19.4e-0849.09unknown protein; Has 90 Blast hits to 90 proteins in 7 species: Archae - 0; Bact... [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita moschata (Rifu) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001878Zinc finger, CCHC-typeSMARTSM00343c2hcfinal6coord: 187..203
e-value: 0.0087
score: 23.9
IPR036397Ribonuclease H superfamilyGENE3D3.30.420.10coord: 424..495
e-value: 3.7E-9
score: 38.1
IPR036397Ribonuclease H superfamilyGENE3D3.30.420.10coord: 496..547
e-value: 1.1E-9
score: 40.2
NoneNo IPR availablePFAMPF14223Retrotran_gag_2coord: 6..144
e-value: 4.5E-29
score: 101.0
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 638..660
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 638..683
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 162..176
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 143..185
NoneNo IPR availablePANTHERPTHR34676FAMILY NOT NAMEDcoord: 1..349
NoneNo IPR availablePANTHERPTHR34676:SF1ZINC FINGER, CCHC-TYPE, TUBBY C-TERMINAL-LIKE DOMAIN PROTEIN-RELATEDcoord: 1..349
IPR013103Reverse transcriptase, RNA-dependent DNA polymerasePFAMPF07727RVT_2coord: 710..929
e-value: 1.1E-67
score: 228.2
IPR025724GAG-pre-integrase domainPFAMPF13976gag_pre-integrscoord: 357..409
e-value: 1.4E-13
score: 50.5
IPR001584Integrase, catalytic corePROSITEPS50994INTEGRASEcoord: 360..556
score: 9.419276
IPR012337Ribonuclease H-like superfamilySUPERFAMILY53098Ribonuclease H-likecoord: 427..550
IPR043502DNA/RNA polymerase superfamilySUPERFAMILY56672DNA/RNA polymerasescoord: 712..920

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmoCh05G009260.1CmoCh05G009260.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0015074 DNA integration
molecular_function GO:0003676 nucleic acid binding
molecular_function GO:0016740 transferase activity
molecular_function GO:0008270 zinc ion binding