Cmc04g0107081 (gene) Melon (Charmono) v1.1

Overview
NameCmc04g0107081
Typegene
OrganismCucumis melo L. var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionIntegrase catalytic domain-containing protein
LocationCMiso1.1chr04: 25485910 .. 25487829 (+)
RNA-Seq ExpressionCmc04g0107081
SyntenyCmc04g0107081
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTTAATGGCGATTTCTGGACGAAATAAGGCCGGGTTCATCACCGGAAAAATCCAGAAACCTTCTGATGGCGTCTTACTCGATGCCTGGATCTGCAACAATGATATTCTAGCCTCATGGATTCTCAACTCTGTTTCAAAGGAAATTGCAGCAAGCATTATCTACACAGGATCAATTAAAGAAATATGGGATGAATTACGCCAAAGATTCAAACAATCAAATGGTCCCAGCATATACCAACTTCGAAAAGAATTTGTCACGTTGCGGCAAGGAAACCTGACAATTGAAACATACTACACAAAACTCAAAACCATATGGCAGAATCTAAATGAATACCGATTTACAAATGACTGCACATGTGGAGGTTTGAAACCATTCATCGATCATCTTGAATCTGAATATATTATGGCCTTCCTGATGGGATTAAATGATTCCTATGCTGCTGTAAGAGCACAAATCCTCCTTATGCAACCTTTACCTTCAATCAACACTGTATTTTCTTTGCTGATTCAAGAAGAACAACAAAGATCTGCTGGCATTTTAACCCCTCCCATTGATCCTGTGGCTTTAAATATTGCTTCATCCATGGCTATCTCAACTGATCGAAATCGCAAAAAAGAGCGCCCTACTTGCTCCTACTGTGGAATTAAGGGACACATAGCAGACAAATGTTACAAGAAACATGGATATCCTCCAGGTTACAAGCCAAGAAACTCAAACTCCATTACTACTGCACCAGATACCTCAAAAACCAACAATGTTGCCAATACCAATTCAGCTGCTGCCAATCGTTCTCCAGATTTTTTCTCAAGCCTAAATTCAGAACAATACAGTCAATTGATGACTCTACTCAACAACCATCTCCAAGCAGCCACTACAGCACCTATCACTACGGCAACTGCCATAACCCACACTTCAGGTATCTTCGCCTTAACTTCACATAATAATCAGTCACATGATGAATGGATCATAGACTCAGGAGCATCTAGACACATATGTCATGATAAATCCCTTTTCAAAAATTGGAGTCATACAAATAACATGTTTGTTATGCTACCTAATGGCCATCGTATATCTGTTGATCTTATTGGAGATATCCAAATAAATGGGTCCCTGACATTGAAAGATGTACTGTTCGTGTCCCAATTTGCATACAACCTCATCTCGGTCAGTTGCTTATTGATCACTAAAAATATATCTCTTGACTTTCAAAGTACTTGTTGCATCATACAGGATCTTTCCCGACCAATGATGATTGGCAAGGCTAGTTGCCAAAATGGACTGTATGTTCTCAATAAAGAGGCCAATACCAACTGTATTGCTGCTGGAGTTAAAATCAATGCCATTTCAGTTGATACATGGCATCAACGTTTAGGTCACTTATCACCTAAATGTCTCTCTTCTTTGTCTTCAACCTTATGTTTGTCTAATCATTCCATACATCATTCATCATGTCATGTATGCCCATTAGCCAAACAAAAAAGGTTATCATTTCATTCAAATAATAATGTTGCTTCTTCTCCATTTGACCTTGTCCATGCTGATATATGGGGACCTTTTAAAATACCTTCTTATGGTGGCTACAAATATTTCTTGACACTAGTTGATGACTGCTTACGCTTCACATGGGTATATATGCTAAGGCAAAAATCTGATGTTCTTCACATTGTACCTAAATTTTTTCAGCTCATCGAAACTCAGTTTTCAAAAGTCATCAAGAGTTTTCGGTCAGACAATGCCCCTGAACTAAAGTTGACTGAATTCTTTGCTCAGAAGGGAACTGTCCACCAATTCTCTTGTGTTGAACAACCTCAACAAAACTCAGTAGTAGAGCGGAAACACCAGCACCTTCTTAATGTAGCTCGTGCTCTTTTTTTCAGTCAAGAATTCCAATCAGTTTTTGGGCAGATTGCATCCTAA

mRNA sequence

ATGTTAATGGCGATTTCTGGACGAAATAAGGCCGGGTTCATCACCGGAAAAATCCAGAAACCTTCTGATGGCGTCTTACTCGATGCCTGGATCTGCAACAATGATATTCTAGCCTCATGGATTCTCAACTCTGTTTCAAAGGAAATTGCAGCAAGCATTATCTACACAGGATCAATTAAAGAAATATGGGATGAATTACGCCAAAGATTCAAACAATCAAATGGTCCCAGCATATACCAACTTCGAAAAGAATTTGTCACGTTGCGGCAAGGAAACCTGACAATTGAAACATACTACACAAAACTCAAAACCATATGGCAGAATCTAAATGAATACCGATTTACAAATGACTGCACATGTGGAGGTTTGAAACCATTCATCGATCATCTTGAATCTGAATATATTATGGCCTTCCTGATGGGATTAAATGATTCCTATGCTGCTGTAAGAGCACAAATCCTCCTTATGCAACCTTTACCTTCAATCAACACTGTATTTTCTTTGCTGATTCAAGAAGAACAACAAAGATCTGCTGGCATTTTAACCCCTCCCATTGATCCTGTGGCTTTAAATATTGCTTCATCCATGGCTATCTCAACTGATCGAAATCGCAAAAAAGAGCGCCCTACTTGCTCCTACTGTGGAATTAAGGGACACATAGCAGACAAATGTTACAAGAAACATGGATATCCTCCAGGTTACAAGCCAAGAAACTCAAACTCCATTACTACTGCACCAGATACCTCAAAAACCAACAATGTTGCCAATACCAATTCAGCTGCTGCCAATCGTTCTCCAGATTTTTTCTCAAGCCTAAATTCAGAACAATACAGTCAATTGATGACTCTACTCAACAACCATCTCCAAGCAGCCACTACAGCACCTATCACTACGGCAACTGCCATAACCCACACTTCAGGTATCTTCGCCTTAACTTCACATAATAATCAGTCACATGATGAATGGATCATAGACTCAGGAGCATCTAGACACATATGTCATGATAAATCCCTTTTCAAAAATTGGAGTCATACAAATAACATGTTTGTTATGCTACCTAATGGCCATCGTATATCTGTTGATCTTATTGGAGATATCCAAATAAATGGGTCCCTGACATTGAAAGATGTACTGTTCGTGTCCCAATTTGCATACAACCTCATCTCGGTCAGTTGCTTATTGATCACTAAAAATATATCTCTTGACTTTCAAAGTACTTGTTGCATCATACAGGATCTTTCCCGACCAATGATGATTGGCAAGGCTAGTTGCCAAAATGGACTGTATGTTCTCAATAAAGAGGCCAATACCAACTGTATTGCTGCTGGAGTTAAAATCAATGCCATTTCAGTTGATACATGGCATCAACGTTTAGGTCACTTATCACCTAAATGTCTCTCTTCTTTGTCTTCAACCTTATGTTTGTCTAATCATTCCATACATCATTCATCATGTCATGTATGCCCATTAGCCAAACAAAAAAGGTTATCATTTCATTCAAATAATAATGTTGCTTCTTCTCCATTTGACCTTGTCCATGCTGATATATGGGGACCTTTTAAAATACCTTCTTATGGTGGCTACAAATATTTCTTGACACTAGTTGATGACTGCTTACGCTTCACATGGGTATATATGCTAAGGCAAAAATCTGATGTTCTTCACATTGTACCTAAATTTTTTCAGCTCATCGAAACTCAGTTTTCAAAAGTCATCAAGAGTTTTCGGTCAGACAATGCCCCTGAACTAAAGTTGACTGAATTCTTTGCTCAGAAGGGAACTGTCCACCAATTCTCTTGTGTTGAACAACCTCAACAAAACTCAGTAGTAGAGCGGAAACACCAGCACCTTCTTAATGTAGCTCGTGCTCTTTTTTTCAGTCAAGAATTCCAATCAGTTTTTGGGCAGATTGCATCCTAA

Coding sequence (CDS)

ATGTTAATGGCGATTTCTGGACGAAATAAGGCCGGGTTCATCACCGGAAAAATCCAGAAACCTTCTGATGGCGTCTTACTCGATGCCTGGATCTGCAACAATGATATTCTAGCCTCATGGATTCTCAACTCTGTTTCAAAGGAAATTGCAGCAAGCATTATCTACACAGGATCAATTAAAGAAATATGGGATGAATTACGCCAAAGATTCAAACAATCAAATGGTCCCAGCATATACCAACTTCGAAAAGAATTTGTCACGTTGCGGCAAGGAAACCTGACAATTGAAACATACTACACAAAACTCAAAACCATATGGCAGAATCTAAATGAATACCGATTTACAAATGACTGCACATGTGGAGGTTTGAAACCATTCATCGATCATCTTGAATCTGAATATATTATGGCCTTCCTGATGGGATTAAATGATTCCTATGCTGCTGTAAGAGCACAAATCCTCCTTATGCAACCTTTACCTTCAATCAACACTGTATTTTCTTTGCTGATTCAAGAAGAACAACAAAGATCTGCTGGCATTTTAACCCCTCCCATTGATCCTGTGGCTTTAAATATTGCTTCATCCATGGCTATCTCAACTGATCGAAATCGCAAAAAAGAGCGCCCTACTTGCTCCTACTGTGGAATTAAGGGACACATAGCAGACAAATGTTACAAGAAACATGGATATCCTCCAGGTTACAAGCCAAGAAACTCAAACTCCATTACTACTGCACCAGATACCTCAAAAACCAACAATGTTGCCAATACCAATTCAGCTGCTGCCAATCGTTCTCCAGATTTTTTCTCAAGCCTAAATTCAGAACAATACAGTCAATTGATGACTCTACTCAACAACCATCTCCAAGCAGCCACTACAGCACCTATCACTACGGCAACTGCCATAACCCACACTTCAGGTATCTTCGCCTTAACTTCACATAATAATCAGTCACATGATGAATGGATCATAGACTCAGGAGCATCTAGACACATATGTCATGATAAATCCCTTTTCAAAAATTGGAGTCATACAAATAACATGTTTGTTATGCTACCTAATGGCCATCGTATATCTGTTGATCTTATTGGAGATATCCAAATAAATGGGTCCCTGACATTGAAAGATGTACTGTTCGTGTCCCAATTTGCATACAACCTCATCTCGGTCAGTTGCTTATTGATCACTAAAAATATATCTCTTGACTTTCAAAGTACTTGTTGCATCATACAGGATCTTTCCCGACCAATGATGATTGGCAAGGCTAGTTGCCAAAATGGACTGTATGTTCTCAATAAAGAGGCCAATACCAACTGTATTGCTGCTGGAGTTAAAATCAATGCCATTTCAGTTGATACATGGCATCAACGTTTAGGTCACTTATCACCTAAATGTCTCTCTTCTTTGTCTTCAACCTTATGTTTGTCTAATCATTCCATACATCATTCATCATGTCATGTATGCCCATTAGCCAAACAAAAAAGGTTATCATTTCATTCAAATAATAATGTTGCTTCTTCTCCATTTGACCTTGTCCATGCTGATATATGGGGACCTTTTAAAATACCTTCTTATGGTGGCTACAAATATTTCTTGACACTAGTTGATGACTGCTTACGCTTCACATGGGTATATATGCTAAGGCAAAAATCTGATGTTCTTCACATTGTACCTAAATTTTTTCAGCTCATCGAAACTCAGTTTTCAAAAGTCATCAAGAGTTTTCGGTCAGACAATGCCCCTGAACTAAAGTTGACTGAATTCTTTGCTCAGAAGGGAACTGTCCACCAATTCTCTTGTGTTGAACAACCTCAACAAAACTCAGTAGTAGAGCGGAAACACCAGCACCTTCTTAATGTAGCTCGTGCTCTTTTTTTCAGTCAAGAATTCCAATCAGTTTTTGGGCAGATTGCATCCTAA

Protein sequence

MLMAISGRNKAGFITGKIQKPSDGVLLDAWICNNDILASWILNSVSKEIAASIIYTGSIKEIWDELRQRFKQSNGPSIYQLRKEFVTLRQGNLTIETYYTKLKTIWQNLNEYRFTNDCTCGGLKPFIDHLESEYIMAFLMGLNDSYAAVRAQILLMQPLPSINTVFSLLIQEEQQRSAGILTPPIDPVALNIASSMAISTDRNRKKERPTCSYCGIKGHIADKCYKKHGYPPGYKPRNSNSITTAPDTSKTNNVANTNSAAANRSPDFFSSLNSEQYSQLMTLLNNHLQAATTAPITTATAITHTSGIFALTSHNNQSHDEWIIDSGASRHICHDKSLFKNWSHTNNMFVMLPNGHRISVDLIGDIQINGSLTLKDVLFVSQFAYNLISVSCLLITKNISLDFQSTCCIIQDLSRPMMIGKASCQNGLYVLNKEANTNCIAAGVKINAISVDTWHQRLGHLSPKCLSSLSSTLCLSNHSIHHSSCHVCPLAKQKRLSFHSNNNVASSPFDLVHADIWGPFKIPSYGGYKYFLTLVDDCLRFTWVYMLRQKSDVLHIVPKFFQLIETQFSKVIKSFRSDNAPELKLTEFFAQKGTVHQFSCVEQPQQNSVVERKHQHLLNVARALFFSQEFQSVFGQIAS
Homology
BLAST of Cmc04g0107081 vs. NCBI nr
Match: KAA0065480.1 (Cysteine-rich RLK (receptor-like protein kinase) 8 [Cucumis melo var. makuwa] >TYK08721.1 Cysteine-rich RLK (receptor-like protein kinase) 8 [Cucumis melo var. makuwa])

HSP 1 Score: 1281.2 bits (3314), Expect = 0.0e+00
Identity = 638/639 (99.84%), Postives = 638/639 (99.84%), Query Frame = 0

Query: 1   MLMAISGRNKAGFITGKIQKPSDGVLLDAWICNNDILASWILNSVSKEIAASIIYTGSIK 60
           MLMAISGRNKAGFITGKIQKPSDGVLLDAWICNNDILASWILNSVSKEIAASIIY GSIK
Sbjct: 59  MLMAISGRNKAGFITGKIQKPSDGVLLDAWICNNDILASWILNSVSKEIAASIIYIGSIK 118

Query: 61  EIWDELRQRFKQSNGPSIYQLRKEFVTLRQGNLTIETYYTKLKTIWQNLNEYRFTNDCTC 120
           EIWDELRQRFKQSNGPSIYQLRKEFVTLRQGNLTIETYYTKLKTIWQNLNEYRFTNDCTC
Sbjct: 119 EIWDELRQRFKQSNGPSIYQLRKEFVTLRQGNLTIETYYTKLKTIWQNLNEYRFTNDCTC 178

Query: 121 GGLKPFIDHLESEYIMAFLMGLNDSYAAVRAQILLMQPLPSINTVFSLLIQEEQQRSAGI 180
           GGLKPFIDHLESEYIMAFLMGLNDSYAAVRAQILLMQPLPSINTVFSLLIQEEQQRSAGI
Sbjct: 179 GGLKPFIDHLESEYIMAFLMGLNDSYAAVRAQILLMQPLPSINTVFSLLIQEEQQRSAGI 238

Query: 181 LTPPIDPVALNIASSMAISTDRNRKKERPTCSYCGIKGHIADKCYKKHGYPPGYKPRNSN 240
           LTPPIDPVALNIASSMAISTDRNRKKERPTCSYCGIKGHIADKCYKKHGYPPGYKPRNSN
Sbjct: 239 LTPPIDPVALNIASSMAISTDRNRKKERPTCSYCGIKGHIADKCYKKHGYPPGYKPRNSN 298

Query: 241 SITTAPDTSKTNNVANTNSAAANRSPDFFSSLNSEQYSQLMTLLNNHLQAATTAPITTAT 300
           SITTAPDTSKTNNVANTNSAAANRSPDFFSSLNSEQYSQLMTLLNNHLQAATTAPITTAT
Sbjct: 299 SITTAPDTSKTNNVANTNSAAANRSPDFFSSLNSEQYSQLMTLLNNHLQAATTAPITTAT 358

Query: 301 AITHTSGIFALTSHNNQSHDEWIIDSGASRHICHDKSLFKNWSHTNNMFVMLPNGHRISV 360
           AITHTSGIFALTSHNNQSHDEWIIDSGASRHICHDKSLFKNWSHTNNMFVMLPNGHRISV
Sbjct: 359 AITHTSGIFALTSHNNQSHDEWIIDSGASRHICHDKSLFKNWSHTNNMFVMLPNGHRISV 418

Query: 361 DLIGDIQINGSLTLKDVLFVSQFAYNLISVSCLLITKNISLDFQSTCCIIQDLSRPMMIG 420
           DLIGDIQINGSLTLKDVLFVSQFAYNLISVSCLLITKNISLDFQSTCCIIQDLSRPMMIG
Sbjct: 419 DLIGDIQINGSLTLKDVLFVSQFAYNLISVSCLLITKNISLDFQSTCCIIQDLSRPMMIG 478

Query: 421 KASCQNGLYVLNKEANTNCIAAGVKINAISVDTWHQRLGHLSPKCLSSLSSTLCLSNHSI 480
           KASCQNGLYVLNKEANTNCIAAGVKINAISVDTWHQRLGHLSPKCLSSLSSTLCLSNHSI
Sbjct: 479 KASCQNGLYVLNKEANTNCIAAGVKINAISVDTWHQRLGHLSPKCLSSLSSTLCLSNHSI 538

Query: 481 HHSSCHVCPLAKQKRLSFHSNNNVASSPFDLVHADIWGPFKIPSYGGYKYFLTLVDDCLR 540
           HHSSCHVCPLAKQKRLSFHSNNNVASSPFDLVHADIWGPFKIPSYGGYKYFLTLVDDCLR
Sbjct: 539 HHSSCHVCPLAKQKRLSFHSNNNVASSPFDLVHADIWGPFKIPSYGGYKYFLTLVDDCLR 598

Query: 541 FTWVYMLRQKSDVLHIVPKFFQLIETQFSKVIKSFRSDNAPELKLTEFFAQKGTVHQFSC 600
           FTWVYMLRQKSDVLHIVPKFFQLIETQFSKVIKSFRSDNAPELKLTEFFAQKGTVHQFSC
Sbjct: 599 FTWVYMLRQKSDVLHIVPKFFQLIETQFSKVIKSFRSDNAPELKLTEFFAQKGTVHQFSC 658

Query: 601 VEQPQQNSVVERKHQHLLNVARALFFSQEFQSVFGQIAS 640
           VEQPQQNSVVERKHQHLLNVARALFFSQEFQSVFGQIAS
Sbjct: 659 VEQPQQNSVVERKHQHLLNVARALFFSQEFQSVFGQIAS 697

BLAST of Cmc04g0107081 vs. NCBI nr
Match: KAA0035612.1 (No apical meristem (NAM) protein [Cucumis melo var. makuwa] >TYK30930.1 No apical meristem (NAM) protein [Cucumis melo var. makuwa])

HSP 1 Score: 593.2 bits (1528), Expect = 2.7e-165
Identity = 362/643 (56.30%), Postives = 372/643 (57.85%), Query Frame = 0

Query: 1   MLMAISGRNKAGFITGKIQKPSDGVLLDAWICNNDILASWILNSVSKEIAASIIYTGSIK 60
           MLMAISGRNKAGFITGKIQKPSDGVLLDAWICNNDILASWILNSVSKEIAASIIYTGS  
Sbjct: 53  MLMAISGRNKAGFITGKIQKPSDGVLLDAWICNNDILASWILNSVSKEIAASIIYTGS-- 112

Query: 61  EIWDELRQRFKQSNGPSIYQLRKEFVTLRQGNLTIETYYTKLKTIWQNLNEYRFTNDCTC 120
                       +N P                     Y T           + F   C  
Sbjct: 113 -----------STNSP---------------------YAT-----------FTFNQHC-- 172

Query: 121 GGLKPFIDHLESEYIMAFLMGLNDSYAAVRAQILLMQPLPSINTVFSLLIQEEQQRSAGI 180
                                                         SLLIQEEQQRSAGI
Sbjct: 173 ---------------------------------------------ISLLIQEEQQRSAGI 232

Query: 181 LTPPIDPVAL----NIASSMAISTDRNRKKERPTCSYCGIKGHIADKCYKKHGYPPGYKP 240
           LTPPIDPVAL    NIA +MAISTDRNRKKERPTC YCGIKGHIADKCYKKHGYPPGYKP
Sbjct: 233 LTPPIDPVALFTTQNIAPTMAISTDRNRKKERPTCCYCGIKGHIADKCYKKHGYPPGYKP 292

Query: 241 RNSNSITTAPDTSKTNNVANTNSAAANRSPDFFSSLNSEQYSQLMTLLNNHLQAATTAPI 300
           RNSN IT   +TSKTN VANTNS AAN SPDFFSSLNSEQYSQLMTLLNNHLQAA TAPI
Sbjct: 293 RNSNPIT---NTSKTNKVANTNSTAANHSPDFFSSLNSEQYSQLMTLLNNHLQAAITAPI 352

Query: 301 TTATAITHTSGIFALTSHNNQSHDEWIIDSGASRHICHDKSLFKNWSHTNNMFVMLPNGH 360
           T  T ITHT  IF+LTSHNNQSHDEWII SGASRH+CHDKSLF+NWSHTNNMFVMLPNGH
Sbjct: 353 TLTTTITHTPSIFSLTSHNNQSHDEWIIVSGASRHVCHDKSLFRNWSHTNNMFVMLPNGH 412

Query: 361 RISVDLIGDIQINGSLTLKDVLFVSQFAYNLISVSCLLITKNISLDFQSTCCIIQDLSRP 420
           RISVDLIGDI INGSL LKDVLFV QFAY+LIS                      DLSRP
Sbjct: 413 RISVDLIGDILINGSLLLKDVLFVPQFAYDLIS----------------------DLSRP 458

Query: 421 MMIGKASCQNGLYVLNKEANTNCIAAGVKINAISVDTWHQRLGHLSPKCLSSLSSTLCLS 480
           MMIG ASC NG                                                 
Sbjct: 473 MMIGNASCHNG------------------------------------------------- 458

Query: 481 NHSIHHSSCHVCPLAKQKRLSFHSNNNVASSPFDLVHADIWGPFKIPSYGGYKYFLTLVD 540
                                                                       
Sbjct: 533 ------------------------------------------------------------ 458

Query: 541 DCLRFTWVYMLRQKSDVLHIVPKFFQLIETQFSKVIKSFRSDNAPELKLTEFFAQKGTVH 600
                      RQKSDVLHIVPKFFQLIETQFSKVIKSFRSDNA ELK TEFFAQKG+VH
Sbjct: 593 -----------RQKSDVLHIVPKFFQLIETQFSKVIKSFRSDNAHELKFTEFFAQKGSVH 458

Query: 601 QFSCVEQPQQNSVVERKHQHLLNVARALFFSQEFQSVFGQIAS 640
           QFSCVE+PQQNSVVERKHQHLLNVARALF SQEF SVFGQIAS
Sbjct: 653 QFSCVERPQQNSVVERKHQHLLNVARALFISQEFLSVFGQIAS 458

BLAST of Cmc04g0107081 vs. NCBI nr
Match: XP_008463248.1 (PREDICTED: uncharacterized protein LOC103501452 [Cucumis melo])

HSP 1 Score: 533.5 bits (1373), Expect = 2.5e-147
Identity = 314/488 (64.34%), Postives = 328/488 (67.21%), Query Frame = 0

Query: 140 MGLNDSYAAVRAQILLMQPLPSINTVFSLLIQEEQQRSAGILTPPIDPVAL----NIASS 199
           MGLNDSYAAVRAQILLMQPLPSIN +FSLLIQEEQQRS GILTPPIDP  L    NIA +
Sbjct: 1   MGLNDSYAAVRAQILLMQPLPSINILFSLLIQEEQQRSTGILTPPIDPATLITTQNIAPT 60

Query: 200 MAISTDRNRKKERPTCSYCGIKGHIADKCYKKHGYPPGYKPRNSNSITTAPDTSKTNNVA 259
           MAISTDRNRKKE PTC+YCGIKGHIADKCY KHGYP GYKPRNSN ITTAPDTSKTN VA
Sbjct: 61  MAISTDRNRKKECPTCAYCGIKGHIADKCYNKHGYPLGYKPRNSNPITTAPDTSKTNKVA 120

Query: 260 NTNSAAANRSPDFFSSLNSEQYSQLMTLLNNHLQAATTAPITTATAITHTS--GIFALTS 319
           NTNSAAAN SPDFFSSLNSEQYSQLMT+LNNHLQAATTAPITTATAITHT   G+  +T 
Sbjct: 121 NTNSAAANHSPDFFSSLNSEQYSQLMTILNNHLQAATTAPITTATAITHTPEIGVIQITC 180

Query: 320 HNNQSHDEWIIDSGASRHICHDKSLFKNWSHTNNMFVMLPNGHRISVDLIGDIQINGSLT 379
                             +C+  ++                   ISVDLIGDI INGSL 
Sbjct: 181 -----------------LLCYLMAIC------------------ISVDLIGDILINGSLL 240

Query: 380 LKDVLFVSQFAYNLISVSCLLITKNISLDFQSTCCIIQDLSRPMMIGKASCQNGLYVLNK 439
            KDVLFV QFAYNLIS                      DLSRPMMIGKASCQNGLYVLNK
Sbjct: 241 FKDVLFVPQFAYNLIS----------------------DLSRPMMIGKASCQNGLYVLNK 300

Query: 440 EANTNCIAAGVKINAISVDTWHQRLGHLSPKCLSSLSSTLCLSNHSIHHSSCHVCPLAKQ 499
           EANTNC+AA VKINAISVDTWHQRLGHLSPKCLS LSSTLCLSNHSIHHSSC        
Sbjct: 301 EANTNCVAARVKINAISVDTWHQRLGHLSPKCLSYLSSTLCLSNHSIHHSSC-------- 360

Query: 500 KRLSFHSNNNVASSPFDLVHADIWGPFKIPSYGGYKYFLTLVDDCLRFTWVYMLRQKSDV 559
                              HA+IW                   D L +            
Sbjct: 361 -------------------HANIW-------------------DLLTY------------ 367

Query: 560 LHIVPKFFQLIETQFSKVIKSFRSDNAPELKLTEFFAQKGTVHQFSCVEQPQQNSVVERK 619
                    LIETQFSKVIKSFRSDNA ELK TEFFAQKGTVHQFSCVE+PQQNSVVERK
Sbjct: 421 ------LLMLIETQFSKVIKSFRSDNAHELKFTEFFAQKGTVHQFSCVERPQQNSVVERK 367

Query: 620 HQHLLNVA 622
           HQHLLN A
Sbjct: 481 HQHLLNDA 367

BLAST of Cmc04g0107081 vs. NCBI nr
Match: RVW82526.1 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera])

HSP 1 Score: 508.1 bits (1307), Expect = 1.1e-139
Identity = 275/646 (42.57%), Postives = 403/646 (62.38%), Query Frame = 0

Query: 1   MLMAISGRNKAGFITGKIQKP-SDGVLLDAWICNNDILASWILNSVSKEIAASIIYTGSI 60
           M+ A++ +NK GFI G I +P +  +L   W   N ++ SW+ NSV KEIA SI+Y  + 
Sbjct: 53  MVTALNAKNKLGFIDGTISRPAATDLLASPWSRCNSMVISWLSNSVCKEIAESILYHETA 112

Query: 61  KEIWDELRQRFKQSNGPSIYQLRKEFVTLRQGNLTIETYYTKLKTIWQNLNEYRFTNDCT 120
            EIW++L +RF Q +GP I++L+++ +   QG+  + TYYT+LK++W  L E++    C 
Sbjct: 113 IEIWNDLYERFHQGSGPRIFELKQKILAHTQGSADVNTYYTRLKSLWDELREFKAIPICN 172

Query: 121 CGGLKPFIDHLESEYIMAFLMGLNDSYAAVRAQILLMQPLPSINTVFSLLIQEEQQRSAG 180
           CGG++ +++  + E +M FL+GLN+S+A ++AQILLM+P P +N VFSL++QEE QRS  
Sbjct: 173 CGGMRVYMEDQQRETVMQFLLGLNESFAPIQAQILLMEPTPPLNKVFSLVVQEEWQRSLT 232

Query: 181 ILTPP--IDPVALNIASSMAISTDRN---RKKERPTCSYCGIKGHIADKCYKKHGYPPG- 240
               P    PV+    ++   S+  N    +K+RP C++C I GH  D+CYK HGY PG 
Sbjct: 233 TSNSPAFTTPVSSRFQAASRASSPTNSSRSRKDRPLCTHCNILGHTVDRCYKIHGYTPGF 292

Query: 241 -----YKPRNSNSITTAPDTSKTNNVANTNSAAANRSPDFFSSLNSEQYSQLMTLLNNHL 300
                ++P  S      P++  TN +  T+ + A+ SP     L  +Q++QL+ LL+ H 
Sbjct: 293 RNRPNFRPNGSRPNQMLPNSLHTNQLTLTDGSIASASP---PPLTHDQHNQLLALLSLHS 352

Query: 301 QAATTAPITTAT----AITHTSGIFALT-SHNNQSHDEWIIDSGASRHICHDKSLFKNWS 360
            + ++A    +     +I++ +GI +L+ S +  +   WI+DSGA+ H+C + S+F +  
Sbjct: 353 SSGSSASFGDSNPLQQSISNFTGILSLSPSSSTLNPSIWILDSGATHHVCTNSSMFHSIH 412

Query: 361 HTNNMFVMLPNGHRISVDLIGDIQINGSLTLKDVLFVSQFAYNLISVSCLLITKNISLDF 420
             ++  V LP G +I +  IG I ++  L L+ VL++  F +NLIS+S L  T   S DF
Sbjct: 413 SFSSNTVTLPTGTKIPITGIGTIHLSPHLVLEHVLYIPTFQFNLISISALTQTNCFSFDF 472

Query: 421 QSTCCIIQDLSRPMMIGKASCQNGLYVLNKEANTNCIAAGVKINAISVDT---WHQRLGH 480
            +  C IQD S+  +IG    Q  LY+L+     +  +  V  N  S      WH RL H
Sbjct: 473 TAHFCFIQDHSQGKLIGMGRRQGNLYLLDSSVFRSISSVFVVDNNTSAHVNKLWHFRLSH 532

Query: 481 LSPKCLSSLSSTLCLSNHSIHHSSCHVCPLAKQKRLSFHSNNNVASSPFDLVHADIWGPF 540
            S   LS L   L L ++   + SC +CPLAKQKRL F  +NN++SSPFDL+H DIWGPF
Sbjct: 533 PSNVKLSVLKPHLQLQSNGNTNLSCSICPLAKQKRLPFDCHNNLSSSPFDLIHCDIWGPF 592

Query: 541 KIPSYGGYKYFLTLVDDCLRFTWVYMLRQKSDVLHIVPKFFQLIETQFSKVIKSFRSDNA 600
            IP++ G++YFLT+VDDC R TWV++LR KSDV  I P+FF +++T+F   IK+ RSDNA
Sbjct: 593 HIPTHDGFRYFLTIVDDCTRNTWVHLLRAKSDVKTIFPQFFSMVKTKFGLTIKAVRSDNA 652

Query: 601 PELKLTEFFAQKGTVHQFSCVEQPQQNSVVERKHQHLLNVARALFF 627
           PEL L+  F Q   +H FSCVE PQQNSVVERKHQH+LNVARAL+F
Sbjct: 653 PELNLSNLFTQLDVLHFFSCVETPQQNSVVERKHQHILNVARALYF 695

BLAST of Cmc04g0107081 vs. NCBI nr
Match: XP_012857659.1 (PREDICTED: uncharacterized protein LOC105976934 [Erythranthe guttata])

HSP 1 Score: 506.5 bits (1303), Expect = 3.3e-139
Identity = 280/668 (41.92%), Postives = 410/668 (61.38%), Query Frame = 0

Query: 1   MLMAISGRNKAGFITGKIQKP--SDGVLLDAWICNNDILASWILNSVSKEIAASIIYTGS 60
           M+++++ +NK GFI G I KP   + +LL+AW+ NN I+ SWILN++S +I AS++Y+ S
Sbjct: 43  MMISLTVKNKLGFIDGSITKPPADEALLLNAWVRNNSIVISWILNAISPDIQASVMYSES 102

Query: 61  IKEIWDELRQRFKQSNGPSIYQLRKEFVTLRQGNLTIETYYTKLKTIWQNLNEYRFT--- 120
             +IW++L+ RF Q+NGP I+QLR+E   L Q   ++  Y+TKLK IW  L+ +R T   
Sbjct: 103 AHDIWNDLKIRFSQTNGPRIFQLRRELANLTQDQQSVNVYFTKLKAIWDELDNFRPTCTC 162

Query: 121 NDCTCGGLKPFIDHLESEYIMAFLMGLNDSYAAVRAQILLMQPLPSINTVFSLLIQEEQQ 180
             C+CGG+    DH   E++M+FLMGLNDS A+ R QILLM PLP IN VF+L+ QEE+ 
Sbjct: 163 GRCSCGGVDKLQDHHHIEHVMSFLMGLNDSLASTRGQILLMDPLPPINKVFALVSQEERH 222

Query: 181 RSAGILTP-----PIDPVALNIASSMAISTDRN--------RKKERPTCSYCGIKGHIAD 240
           RS  + +       +   A  I ++  +   +N        ++K++  C++C   GH  +
Sbjct: 223 RSVAVTSSSDVQHSLAFAARGIQTNQFVRRPQNNQFYGTTSQRKDKIYCTHCHKTGHTVE 282

Query: 241 KCYKKHGYPPGYKPRNSNSITTAPDTSK----------TNNVANTNSAAANR----SPDF 300
           KCY+ HG+PPGY+PR    +T+   +             ++ A+ NS + ++    S +F
Sbjct: 283 KCYRLHGFPPGYQPRQKPGMTSNQSSQTKFAVNQVSDIVHSDASLNSGSLSQSLPSSDNF 342

Query: 301 FSSLNSEQYSQLMTLLNNHLQAATTAP-------ITTATAITHTSGIFALTSHNNQSH-- 360
             ++ + Q  QL++ +++HL      P       I   + I+  +GI    + +  S   
Sbjct: 343 LDAMTASQCQQLLSYVSSHLANKANQPPHDKNSEIFDTSHISRVTGICLFNALHTPSFMP 402

Query: 361 DEWIIDSGASRHICHDKSLFKNWSHTNNMFVMLPNGHRISVDLIGDIQINGSLTLKDVLF 420
             WI+DSGASRHICH+KSLF N    +N  V+LP+   + V+ IGD+Q+   L L +V +
Sbjct: 403 HHWILDSGASRHICHNKSLFLNMKSVSNARVVLPDSSMVLVNCIGDVQLTTHLVLHNVFY 462

Query: 421 VSQFAYNLISVSCLLITKNISLDFQSTCCIIQDLSRPMMIGKASCQNGLYVLNKEANTNC 480
           V +F +NL+SVS LL   +  + F      IQD      IGK +   GLYVL+  + +  
Sbjct: 463 VPEFKFNLVSVSALLHGSSYVVIFDEFSFSIQD-RLMTQIGKGNKVQGLYVLDPVSASPI 522

Query: 481 IAAGVKINAISVDTWHQRLGHLSPKCLSSLSSTLCLSNHSIHHSS-CHVCPLAKQKRLSF 540
             A    N IS   WH RLGH+    L+ L+    LS   I  SS C+VCPLAKQKRL F
Sbjct: 523 EHA--FCNKISATVWHHRLGHIPQPKLAFLAKKFSLSVDKISESSCCYVCPLAKQKRLHF 582

Query: 541 HSNNNVASSPFDLVHADIWGPFKIPSYGGYKYFLTLVDDCLRFTWVYMLRQKSDVLHIVP 600
            ++++V+++ FDL+H DIWGPFK+PSY G+ YF+TLVDD  RFTWV++L+ KS+V+ +VP
Sbjct: 583 SNSSSVSTAMFDLIHCDIWGPFKVPSYSGFHYFVTLVDDYSRFTWVHLLKTKSEVITVVP 642

Query: 601 KFFQLIETQFSKVIKSFRSDNAPELKLTEFFAQKGTVHQFSCVEQPQQNSVVERKHQHLL 627
           +F +++  QF K IK FRSDNA EL+    F + G +HQFSCV  PQQN++VERKHQH+L
Sbjct: 643 RFLKMVLNQFGKSIKVFRSDNAYELQFKSLFDELGVIHQFSCVYTPQQNAIVERKHQHIL 702

BLAST of Cmc04g0107081 vs. ExPASy Swiss-Prot
Match: Q94HW2 (Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana OX=3702 GN=RE1 PE=2 SV=1)

HSP 1 Score: 168.7 bits (426), Expect = 2.1e-40
Identity = 159/637 (24.96%), Postives = 269/637 (42.23%), Query Frame = 0

Query: 7   GRNKAGFITGKIQKPSDGVLLDA----------WICNNDILASWILNSVSKEIAASIIYT 66
           G   AGF+ G    P   +  DA          W   + ++ S +L ++S  +  ++   
Sbjct: 44  GYELAGFLDGSTTMPPATIGTDAAPRVNPDYTRWKRQDKLIYSAVLGAISMSVQPAVSRA 103

Query: 67  GSIKEIWDELRQRFKQSNGPSIYQLRKEFVTLRQGNLTIETYYTKLKTIWQNLNEYRFTN 126
            +  +IW+ LR+ +   +   + QLR +     +G  TI+ Y   L T        RF  
Sbjct: 104 TTAAQIWETLRKIYANPSYGHVTQLRTQLKQWTKGTKTIDDYMQGLVT--------RFDQ 163

Query: 127 DCTCGGLKPFIDHLESEYIMAFLMGLNDSYAAVRAQILLMQPLPSINTVFSLLIQEEQQR 186
               G  KP +DH   E +   L  L + Y  V  QI      P++  +   L+  E + 
Sbjct: 164 LALLG--KP-MDH--DEQVERVLENLPEEYKPVIDQIAAKDTPPTLTEIHERLLNHESKI 223

Query: 187 SAGILTPPIDPVALNIASSMAISTDRNRKKERPTCSYCGIKGHIADKCYKKHGYPPGYKP 246
            A + +  + P+  N  S    +T  N         Y     +   K +++      + P
Sbjct: 224 LA-VSSATVIPITANAVSHRNTTTTNNNNNGNRNNRYDNRNNNNNSKPWQQSS--TNFHP 283

Query: 247 RNSNSITTAPDTSKTN--NVANTNSAAANRSPDFFSSLNSEQYSQLMTLLNNHLQAATTA 306
            N+ S    P   K     V   ++   ++   F SS+NS+Q     T        A  +
Sbjct: 284 NNNQS---KPYLGKCQICGVQGHSAKRCSQLQHFLSSVNSQQPPSPFTPWQPRANLALGS 343

Query: 307 PITTATAITHTSGIFALTSHNNQSHDEWIIDSGASRHICHD-KSLFKNWSHTNNMFVMLP 366
           P                      S + W++DSGA+ HI  D  +L  +  +T    VM+ 
Sbjct: 344 P---------------------YSSNNWLLDSGATHHITSDFNNLSLHQPYTGGDDVMVA 403

Query: 367 NGHRISVDLIGDIQINGS---LTLKDVLFVSQFAYNLISVSCLLITKNISLDFQSTCCII 426
           +G  I +   G   ++     L L ++L+V     NLISV  L     +S++F      +
Sbjct: 404 DGSTIPISHTGSTSLSTKSRPLNLHNILYVPNIHKNLISVYRLCNANGVSVEFFPASFQV 463

Query: 427 QDLSRPMMIGKASCQNGLYVLNKEANTNCIAAGVKINAISVDTWHQRLGHLSPKCLSSLS 486
           +DL+  + + +   ++ LY     ++          +  +  +WH RLGH +P  L+S+ 
Sbjct: 464 KDLNTGVPLLQGKTKDELYEWPIASSQPVSLFASPSSKATHSSWHARLGHPAPSILNSVI 523

Query: 487 STLCLS--NHSIHHSSCHVCPLAKQKRLSFHSNNNVASSPFDLVHADIWGPFKIPSYGGY 546
           S   LS  N S    SC  C + K  ++ F  +   ++ P + +++D+W    I S+  Y
Sbjct: 524 SNYSLSVLNPSHKFLSCSDCLINKSNKVPFSQSTINSTRPLEYIYSDVWSS-PILSHDNY 583

Query: 547 KYFLTLVDDCLRFTWVYMLRQKSDVLHIVPKFFQLIETQFSKVIKSFRSDNAPE-LKLTE 606
           +Y++  VD   R+TW+Y L+QKS V      F  L+E +F   I +F SDN  E + L E
Sbjct: 584 RYYVIFVDHFTRYTWLYPLKQKSQVKETFITFKNLLENRFQTRIGTFYSDNGGEFVALWE 639

Query: 607 FFAQKGTVHQFSCVEQPQQNSVVERKHQHLLNVARAL 625
           +F+Q G  H  S    P+ N + ERKH+H++     L
Sbjct: 644 YFSQHGISHLTSPPHTPEHNGLSERKHRHIVETGLTL 639

BLAST of Cmc04g0107081 vs. ExPASy Swiss-Prot
Match: Q9ZT94 (Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana OX=3702 GN=RE2 PE=4 SV=1)

HSP 1 Score: 151.8 bits (382), Expect = 2.7e-35
Identity = 155/638 (24.29%), Postives = 265/638 (41.54%), Query Frame = 0

Query: 7   GRNKAGFITGKIQKPSDGVLLDA----------WICNNDILASWILNSVSKEIAASIIYT 66
           G   AGF+ G    P   +  DA          W   + ++ S IL ++S  +  ++   
Sbjct: 44  GYELAGFLDGSTPMPPATIGTDAVPRVNPDYTRWRRQDKLIYSAILGAISMSVQPAVSRA 103

Query: 67  GSIKEIWDELRQRFKQSNGPSIYQLRKEFVTLRQGNLTIETYYTKLKTIWQNLNEYRFTN 126
            +  +IW+ LR+ +   +   + QLR  F+T                         RF  
Sbjct: 104 TTAAQIWETLRKIYANPSYGHVTQLR--FIT-------------------------RFDQ 163

Query: 127 DCTCGGLKPFIDHLESEYIMAFLMGLNDSYAAVRAQILLMQPLPSINTVFSLLIQEEQQR 186
               G  KP +DH   E +   L  L D Y  V  QI      PS+  +   LI  E + 
Sbjct: 164 LALLG--KP-MDH--DEQVERVLENLPDDYKPVIDQIAAKDTPPSLTEIHERLINRESKL 223

Query: 187 SAGILTPPIDPVALNIASSMAISTDRNRKKERPTCSYCGIKGHIADKCYKKHGYPPGYKP 246
            A + +  + P+  N+ +    +T+RN+       +Y        +   + + + P    
Sbjct: 224 LA-LNSAEVVPITANVVTHRNTNTNRNQNNRGDNRNY-------NNNNNRSNSWQPSSSG 283

Query: 247 RNSNSITTAPDTSKTNNVANTNSAAANRSP---DFFSSLNSEQYSQLMTLLNNHLQAATT 306
             S++    P   +   + +    +A R P    F S+ N +Q +   T        A  
Sbjct: 284 SRSDNRQPKPYLGRC-QICSVQGHSAKRCPQLHQFQSTTNQQQSTSPFTPWQPRANLAVN 343

Query: 307 APITTATAITHTSGIFALTSHNNQSHDEWIIDSGASRHICHD-KSLFKNWSHTNNMFVML 366
           +P                        + W++DSGA+ HI  D  +L  +  +T    VM+
Sbjct: 344 SPYNA---------------------NNWLLDSGATHHITSDFNNLSFHQPYTGGDDVMI 403

Query: 367 PNGHRISVDLIGDIQI---NGSLTLKDVLFVSQFAYNLISVSCLLITKNISLDFQSTCCI 426
            +G  I +   G   +   + SL L  VL+V     NLISV  L  T  +S++F      
Sbjct: 404 ADGSTIPITHTGSASLPTSSRSLDLNKVLYVPNIHKNLISVYRLCNTNRVSVEFFPASFQ 463

Query: 427 IQDLSRPMMIGKASCQNGLYVLNKEANTNCIAAGVKINAISVDTWHQRLGHLSPKCLSSL 486
           ++DL+  + + +   ++ LY     ++          +  +  +WH RLGH S   L+S+
Sbjct: 464 VKDLNTGVPLLQGKTKDELYEWPIASSQAVSMFASPCSKATHSSWHSRLGHPSLAILNSV 523

Query: 487 SS--TLCLSNHSIHHSSCHVCPLAKQKRLSFHSNNNVASSPFDLVHADIWGPFKIPSYGG 546
            S  +L + N S    SC  C + K  ++ F ++   +S P + +++D+W    I S   
Sbjct: 524 ISNHSLPVLNPSHKLLSCSDCFINKSHKVPFSNSTITSSKPLEYIYSDVWSS-PILSIDN 583

Query: 547 YKYFLTLVDDCLRFTWVYMLRQKSDVLHIVPKFFQLIETQFSKVIKSFRSDNAPE-LKLT 606
           Y+Y++  VD   R+TW+Y L+QKS V      F  L+E +F   I +  SDN  E + L 
Sbjct: 584 YRYYVIFVDHFTRYTWLYPLKQKSQVKDTFIIFKSLVENRFQTRIGTLYSDNGGEFVVLR 618

Query: 607 EFFAQKGTVHQFSCVEQPQQNSVVERKHQHLLNVARAL 625
           ++ +Q G  H  S    P+ N + ERKH+H++ +   L
Sbjct: 644 DYLSQHGISHFTSPPHTPEHNGLSERKHRHIVEMGLTL 618

BLAST of Cmc04g0107081 vs. ExPASy Swiss-Prot
Match: P10978 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum OX=4097 PE=2 SV=1)

HSP 1 Score: 133.7 bits (335), Expect = 7.6e-30
Identity = 156/631 (24.72%), Postives = 241/631 (38.19%), Query Frame = 0

Query: 20  KPSDGVLLDAWICNNDILASWILNSVSKEIAASIIYTGSIKEIWDELRQRFKQSNGPSIY 79
           K  D +  + W   ++  AS I   +S ++  +II   + + IW  L   +      +  
Sbjct: 42  KKPDTMKAEDWADLDERAASAIRLHLSDDVVNNIIDEDTARGIWTRLESLYMSKTLTNKL 101

Query: 80  QLRKEFVTLRQGNLTIETYYTKLKTIWQNLNEYR--FTNDCTCGGLKPFIDHLESEYIMA 139
            L+K+   L     T             +LN +    T     G     +   E +  + 
Sbjct: 102 YLKKQLYALHMSEGT---------NFLSHLNVFNGLITQLANLG-----VKIEEEDKAIL 161

Query: 140 FLMGLNDSYAAVRAQILLMQPLPSINTVFSLLIQEEQQR------SAGILTP----PIDP 199
            L  L  SY  +   IL  +    +  V S L+  E+ R         ++T         
Sbjct: 162 LLNSLPSSYDNLATTILHGKTTIELKDVTSALLLNEKMRKKPENQGQALITEGRGRSYQR 221

Query: 200 VALNIASSMAISTDRNRKKER-PTCSYCGIKGHIADKCYKKHGYPPGYKPRNSNSITTAP 259
            + N   S A    +NR K R   C  C   GH    C           PR         
Sbjct: 222 SSNNYGRSGARGKSKNRSKSRVRNCYNCNQPGHFKRDC---------PNPRKGKG----- 281

Query: 260 DTSKTNNVANTNSAAANRSPDFFSSLNSEQYSQLMTLLNNHLQAATTAPITTATAITHTS 319
           +TS   N  NT +A    + +    +N E+                           H S
Sbjct: 282 ETSGQKNDDNT-AAMVQNNDNVVLFINEEE------------------------ECMHLS 341

Query: 320 GIFALTSHNNQSHDEWIIDSGASRHICHDKSLFKNWSHTNNMFVMLPNGHRISVDLIGDI 379
           G             EW++D+ AS H    + LF  +   +   V + N     +  IGDI
Sbjct: 342 G----------PESEWVVDTAASHHATPVRDLFCRYVAGDFGTVKMGNTSYSKIAGIGDI 401

Query: 380 ----QINGSLTLKDVLFVSQFAYNLISVSCLLITKNISLDFQSTCCIIQD-----LSRPM 439
                +  +L LKDV  V     NLIS         I+LD         +         +
Sbjct: 402 CIKTNVGCTLVLKDVRHVPDLRMNLIS--------GIALDRDGYESYFANQKWRLTKGSL 461

Query: 440 MIGKASCQNGLYVLNKEANTNCIAAGVKINAISVDTWHQRLGHLSPKCLSSLSSTLCLS- 499
           +I K   +  LY  N E     + A    + ISVD WH+R+GH+S K L  L+    +S 
Sbjct: 462 VIAKGVARGTLYRTNAEICQGELNAAQ--DEISVDLWHKRMGHMSEKGLQILAKKSLISY 521

Query: 500 NHSIHHSSCHVCPLAKQKRLSFHSNNNVASSPFDLVHADIWGPFKIPSYGGYKYFLTLVD 559
                   C  C   KQ R+SF +++    +  DLV++D+ GP +I S GG KYF+T +D
Sbjct: 522 AKGTTVKPCDYCLFGKQHRVSFQTSSERKLNILDLVYSDVCGPMEIESMGGNKYFVTFID 581

Query: 560 DCLRFTWVYMLRQKSDVLHIVPKFFQLIETQFSKVIKSFRSDNAPEL---KLTEFFAQKG 619
           D  R  WVY+L+ K  V  +  KF  L+E +  + +K  RSDN  E    +  E+ +  G
Sbjct: 582 DASRKLWVYILKTKDQVFQVFQKFHALVERETGRKLKRLRSDNGGEYTSREFEEYCSSHG 599

Query: 620 TVHQFSCVEQPQQNSVVERKHQHLLNVARAL 625
             H+ +    PQ N V ER ++ ++   R++
Sbjct: 642 IRHEKTVPGTPQHNGVAERMNRTIVEKVRSM 599

BLAST of Cmc04g0107081 vs. ExPASy Swiss-Prot
Match: Q12491 (Transposon Ty2-B Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) OX=559292 GN=TY2B-B PE=3 SV=1)

HSP 1 Score: 67.4 bits (163), Expect = 6.7e-10
Identity = 95/415 (22.89%), Postives = 164/415 (39.52%), Query Frame = 0

Query: 243 TTAPDTSKTNNVANTNSAAANRSPDFFSSLNSEQYSQLMTLLNNHLQAATTAPITTATAI 302
           ++ P  +K +N+A ++  +   +     S  S QY      L+   Q   + P       
Sbjct: 388 SSKPRAAKAHNIATSSKFSRVNNDHINESTVSSQYLSDDNELSLGQQQKESKP------- 447

Query: 303 THTSGIFALTSHNNQSHDEWIIDSGASRHICHDKSLFKNWSHTNNMFVMLPNGHRISVDL 362
           THT         N++  D  +IDSGAS+ +        + +  + + ++      I ++ 
Sbjct: 448 THT------IDSNDELPDHLLIDSGASQTLVRSAHYLHHATPNSEINIVDAQKQDIPINA 507

Query: 363 IGDIQI---NGSLTLKDVLFVSQFAYNLISVSCLLITKNISLDFQSTCCIIQDLSRPMMI 422
           IG++     NG+ T    L     AY+L+S+S  L  +NI+  F        + S   ++
Sbjct: 508 IGNLHFNFQNGTKTSIKALHTPNIAYDLLSLS-ELANQNITACFTRNTL---ERSDGTVL 567

Query: 423 GKASCQNGLYVLNK---------EANTNCIAAGVKINAISVDTWHQRLGHLS----PKCL 482
                    Y L+K         +   N +     +N       H+ LGH +     K L
Sbjct: 568 APIVKHGDFYWLSKKYLIPSHISKLTINNVNKSKSVNKYPYPLIHRMLGHANFRSIQKSL 627

Query: 483 SSLSST-LCLSNHSIHHSSCHVCP-----------LAKQKRLSFHSNNNVASSPFDLVHA 542
              + T L  S+    ++S + CP             K  RL +      +  PF  +H 
Sbjct: 628 KKNAVTYLKESDIEWSNASTYQCPDCLIGKSTKHRHVKGSRLKYQE----SYEPFQYLHT 687

Query: 543 DIWGPFKIPSYGGYKYFLTLVDDCLRFTWVYML--RQKSDVLHIVPKFFQLIETQFSK-- 602
           DI+GP          YF++  D+  RF WVY L  R++  +L++       I+ QF+   
Sbjct: 688 DIFGPVHHLPKSAPSYFISFTDEKTRFQWVYPLHDRREESILNVFTSILAFIKNQFNARV 747

Query: 603 -VIKSFRSDNAPELKLTEFFAQKGTVHQFSCVEQPQQNSVVERKHQHLLNVARAL 625
            VI+  R        L +FF  +G    ++     + + V ER ++ LLN  R L
Sbjct: 748 LVIQMDRGSEYTNKTLHKFFTNRGITACYTTTADSRAHGVAERLNRTLLNDCRTL 781

BLAST of Cmc04g0107081 vs. ExPASy Swiss-Prot
Match: P25384 (Transposon Ty2-C Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) OX=559292 GN=TY2B-C PE=3 SV=2)

HSP 1 Score: 67.4 bits (163), Expect = 6.7e-10
Identity = 95/415 (22.89%), Postives = 164/415 (39.52%), Query Frame = 0

Query: 243 TTAPDTSKTNNVANTNSAAANRSPDFFSSLNSEQYSQLMTLLNNHLQAATTAPITTATAI 302
           ++ P  +K +N+A ++  +   +     S  S QY      L+   Q   + P       
Sbjct: 388 SSKPRAAKAHNIATSSKFSRVNNDHINESTVSSQYLSDDNELSLGQQQKESKP------- 447

Query: 303 THTSGIFALTSHNNQSHDEWIIDSGASRHICHDKSLFKNWSHTNNMFVMLPNGHRISVDL 362
           THT         N++  D  +IDSGAS+ +        + +  + + ++      I ++ 
Sbjct: 448 THT------IDSNDELPDHLLIDSGASQTLVRSAHYLHHATPNSEINIVDAQKQDIPINA 507

Query: 363 IGDIQI---NGSLTLKDVLFVSQFAYNLISVSCLLITKNISLDFQSTCCIIQDLSRPMMI 422
           IG++     NG+ T    L     AY+L+S+S  L  +NI+  F        + S   ++
Sbjct: 508 IGNLHFNFQNGTKTSIKALHTPNIAYDLLSLS-ELANQNITACFTRNTL---ERSDGTVL 567

Query: 423 GKASCQNGLYVLNK---------EANTNCIAAGVKINAISVDTWHQRLGHLS----PKCL 482
                    Y L+K         +   N +     +N       H+ LGH +     K L
Sbjct: 568 APIVKHGDFYWLSKKYLIPSHISKLTINNVNKSKSVNKYPYPLIHRMLGHANFRSIQKSL 627

Query: 483 SSLSST-LCLSNHSIHHSSCHVCP-----------LAKQKRLSFHSNNNVASSPFDLVHA 542
              + T L  S+    ++S + CP             K  RL +      +  PF  +H 
Sbjct: 628 KKNAVTYLKESDIEWSNASTYQCPDCLIGKSTKHRHVKGSRLKYQE----SYEPFQYLHT 687

Query: 543 DIWGPFKIPSYGGYKYFLTLVDDCLRFTWVYML--RQKSDVLHIVPKFFQLIETQFSK-- 602
           DI+GP          YF++  D+  RF WVY L  R++  +L++       I+ QF+   
Sbjct: 688 DIFGPVHHLPKSAPSYFISFTDEKTRFQWVYPLHDRREESILNVFTSILAFIKNQFNARV 747

Query: 603 -VIKSFRSDNAPELKLTEFFAQKGTVHQFSCVEQPQQNSVVERKHQHLLNVARAL 625
            VI+  R        L +FF  +G    ++     + + V ER ++ LLN  R L
Sbjct: 748 LVIQMDRGSEYTNKTLHKFFTNRGITACYTTTADSRAHGVAERLNRTLLNDCRTL 781

BLAST of Cmc04g0107081 vs. ExPASy TrEMBL
Match: A0A5A7VE66 (Cysteine-rich RLK (Receptor-like protein kinase) 8 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold76G00250 PE=4 SV=1)

HSP 1 Score: 1281.2 bits (3314), Expect = 0.0e+00
Identity = 638/639 (99.84%), Postives = 638/639 (99.84%), Query Frame = 0

Query: 1   MLMAISGRNKAGFITGKIQKPSDGVLLDAWICNNDILASWILNSVSKEIAASIIYTGSIK 60
           MLMAISGRNKAGFITGKIQKPSDGVLLDAWICNNDILASWILNSVSKEIAASIIY GSIK
Sbjct: 59  MLMAISGRNKAGFITGKIQKPSDGVLLDAWICNNDILASWILNSVSKEIAASIIYIGSIK 118

Query: 61  EIWDELRQRFKQSNGPSIYQLRKEFVTLRQGNLTIETYYTKLKTIWQNLNEYRFTNDCTC 120
           EIWDELRQRFKQSNGPSIYQLRKEFVTLRQGNLTIETYYTKLKTIWQNLNEYRFTNDCTC
Sbjct: 119 EIWDELRQRFKQSNGPSIYQLRKEFVTLRQGNLTIETYYTKLKTIWQNLNEYRFTNDCTC 178

Query: 121 GGLKPFIDHLESEYIMAFLMGLNDSYAAVRAQILLMQPLPSINTVFSLLIQEEQQRSAGI 180
           GGLKPFIDHLESEYIMAFLMGLNDSYAAVRAQILLMQPLPSINTVFSLLIQEEQQRSAGI
Sbjct: 179 GGLKPFIDHLESEYIMAFLMGLNDSYAAVRAQILLMQPLPSINTVFSLLIQEEQQRSAGI 238

Query: 181 LTPPIDPVALNIASSMAISTDRNRKKERPTCSYCGIKGHIADKCYKKHGYPPGYKPRNSN 240
           LTPPIDPVALNIASSMAISTDRNRKKERPTCSYCGIKGHIADKCYKKHGYPPGYKPRNSN
Sbjct: 239 LTPPIDPVALNIASSMAISTDRNRKKERPTCSYCGIKGHIADKCYKKHGYPPGYKPRNSN 298

Query: 241 SITTAPDTSKTNNVANTNSAAANRSPDFFSSLNSEQYSQLMTLLNNHLQAATTAPITTAT 300
           SITTAPDTSKTNNVANTNSAAANRSPDFFSSLNSEQYSQLMTLLNNHLQAATTAPITTAT
Sbjct: 299 SITTAPDTSKTNNVANTNSAAANRSPDFFSSLNSEQYSQLMTLLNNHLQAATTAPITTAT 358

Query: 301 AITHTSGIFALTSHNNQSHDEWIIDSGASRHICHDKSLFKNWSHTNNMFVMLPNGHRISV 360
           AITHTSGIFALTSHNNQSHDEWIIDSGASRHICHDKSLFKNWSHTNNMFVMLPNGHRISV
Sbjct: 359 AITHTSGIFALTSHNNQSHDEWIIDSGASRHICHDKSLFKNWSHTNNMFVMLPNGHRISV 418

Query: 361 DLIGDIQINGSLTLKDVLFVSQFAYNLISVSCLLITKNISLDFQSTCCIIQDLSRPMMIG 420
           DLIGDIQINGSLTLKDVLFVSQFAYNLISVSCLLITKNISLDFQSTCCIIQDLSRPMMIG
Sbjct: 419 DLIGDIQINGSLTLKDVLFVSQFAYNLISVSCLLITKNISLDFQSTCCIIQDLSRPMMIG 478

Query: 421 KASCQNGLYVLNKEANTNCIAAGVKINAISVDTWHQRLGHLSPKCLSSLSSTLCLSNHSI 480
           KASCQNGLYVLNKEANTNCIAAGVKINAISVDTWHQRLGHLSPKCLSSLSSTLCLSNHSI
Sbjct: 479 KASCQNGLYVLNKEANTNCIAAGVKINAISVDTWHQRLGHLSPKCLSSLSSTLCLSNHSI 538

Query: 481 HHSSCHVCPLAKQKRLSFHSNNNVASSPFDLVHADIWGPFKIPSYGGYKYFLTLVDDCLR 540
           HHSSCHVCPLAKQKRLSFHSNNNVASSPFDLVHADIWGPFKIPSYGGYKYFLTLVDDCLR
Sbjct: 539 HHSSCHVCPLAKQKRLSFHSNNNVASSPFDLVHADIWGPFKIPSYGGYKYFLTLVDDCLR 598

Query: 541 FTWVYMLRQKSDVLHIVPKFFQLIETQFSKVIKSFRSDNAPELKLTEFFAQKGTVHQFSC 600
           FTWVYMLRQKSDVLHIVPKFFQLIETQFSKVIKSFRSDNAPELKLTEFFAQKGTVHQFSC
Sbjct: 599 FTWVYMLRQKSDVLHIVPKFFQLIETQFSKVIKSFRSDNAPELKLTEFFAQKGTVHQFSC 658

Query: 601 VEQPQQNSVVERKHQHLLNVARALFFSQEFQSVFGQIAS 640
           VEQPQQNSVVERKHQHLLNVARALFFSQEFQSVFGQIAS
Sbjct: 659 VEQPQQNSVVERKHQHLLNVARALFFSQEFQSVFGQIAS 697

BLAST of Cmc04g0107081 vs. ExPASy TrEMBL
Match: A0A5D3E5P0 (No apical meristem (NAM) protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold455G001460 PE=4 SV=1)

HSP 1 Score: 593.2 bits (1528), Expect = 1.3e-165
Identity = 362/643 (56.30%), Postives = 372/643 (57.85%), Query Frame = 0

Query: 1   MLMAISGRNKAGFITGKIQKPSDGVLLDAWICNNDILASWILNSVSKEIAASIIYTGSIK 60
           MLMAISGRNKAGFITGKIQKPSDGVLLDAWICNNDILASWILNSVSKEIAASIIYTGS  
Sbjct: 53  MLMAISGRNKAGFITGKIQKPSDGVLLDAWICNNDILASWILNSVSKEIAASIIYTGS-- 112

Query: 61  EIWDELRQRFKQSNGPSIYQLRKEFVTLRQGNLTIETYYTKLKTIWQNLNEYRFTNDCTC 120
                       +N P                     Y T           + F   C  
Sbjct: 113 -----------STNSP---------------------YAT-----------FTFNQHC-- 172

Query: 121 GGLKPFIDHLESEYIMAFLMGLNDSYAAVRAQILLMQPLPSINTVFSLLIQEEQQRSAGI 180
                                                         SLLIQEEQQRSAGI
Sbjct: 173 ---------------------------------------------ISLLIQEEQQRSAGI 232

Query: 181 LTPPIDPVAL----NIASSMAISTDRNRKKERPTCSYCGIKGHIADKCYKKHGYPPGYKP 240
           LTPPIDPVAL    NIA +MAISTDRNRKKERPTC YCGIKGHIADKCYKKHGYPPGYKP
Sbjct: 233 LTPPIDPVALFTTQNIAPTMAISTDRNRKKERPTCCYCGIKGHIADKCYKKHGYPPGYKP 292

Query: 241 RNSNSITTAPDTSKTNNVANTNSAAANRSPDFFSSLNSEQYSQLMTLLNNHLQAATTAPI 300
           RNSN IT   +TSKTN VANTNS AAN SPDFFSSLNSEQYSQLMTLLNNHLQAA TAPI
Sbjct: 293 RNSNPIT---NTSKTNKVANTNSTAANHSPDFFSSLNSEQYSQLMTLLNNHLQAAITAPI 352

Query: 301 TTATAITHTSGIFALTSHNNQSHDEWIIDSGASRHICHDKSLFKNWSHTNNMFVMLPNGH 360
           T  T ITHT  IF+LTSHNNQSHDEWII SGASRH+CHDKSLF+NWSHTNNMFVMLPNGH
Sbjct: 353 TLTTTITHTPSIFSLTSHNNQSHDEWIIVSGASRHVCHDKSLFRNWSHTNNMFVMLPNGH 412

Query: 361 RISVDLIGDIQINGSLTLKDVLFVSQFAYNLISVSCLLITKNISLDFQSTCCIIQDLSRP 420
           RISVDLIGDI INGSL LKDVLFV QFAY+LIS                      DLSRP
Sbjct: 413 RISVDLIGDILINGSLLLKDVLFVPQFAYDLIS----------------------DLSRP 458

Query: 421 MMIGKASCQNGLYVLNKEANTNCIAAGVKINAISVDTWHQRLGHLSPKCLSSLSSTLCLS 480
           MMIG ASC NG                                                 
Sbjct: 473 MMIGNASCHNG------------------------------------------------- 458

Query: 481 NHSIHHSSCHVCPLAKQKRLSFHSNNNVASSPFDLVHADIWGPFKIPSYGGYKYFLTLVD 540
                                                                       
Sbjct: 533 ------------------------------------------------------------ 458

Query: 541 DCLRFTWVYMLRQKSDVLHIVPKFFQLIETQFSKVIKSFRSDNAPELKLTEFFAQKGTVH 600
                      RQKSDVLHIVPKFFQLIETQFSKVIKSFRSDNA ELK TEFFAQKG+VH
Sbjct: 593 -----------RQKSDVLHIVPKFFQLIETQFSKVIKSFRSDNAHELKFTEFFAQKGSVH 458

Query: 601 QFSCVEQPQQNSVVERKHQHLLNVARALFFSQEFQSVFGQIAS 640
           QFSCVE+PQQNSVVERKHQHLLNVARALF SQEF SVFGQIAS
Sbjct: 653 QFSCVERPQQNSVVERKHQHLLNVARALFISQEFLSVFGQIAS 458

BLAST of Cmc04g0107081 vs. ExPASy TrEMBL
Match: A0A1S3CJ63 (uncharacterized protein LOC103501452 OS=Cucumis melo OX=3656 GN=LOC103501452 PE=4 SV=1)

HSP 1 Score: 533.5 bits (1373), Expect = 1.2e-147
Identity = 314/488 (64.34%), Postives = 328/488 (67.21%), Query Frame = 0

Query: 140 MGLNDSYAAVRAQILLMQPLPSINTVFSLLIQEEQQRSAGILTPPIDPVAL----NIASS 199
           MGLNDSYAAVRAQILLMQPLPSIN +FSLLIQEEQQRS GILTPPIDP  L    NIA +
Sbjct: 1   MGLNDSYAAVRAQILLMQPLPSINILFSLLIQEEQQRSTGILTPPIDPATLITTQNIAPT 60

Query: 200 MAISTDRNRKKERPTCSYCGIKGHIADKCYKKHGYPPGYKPRNSNSITTAPDTSKTNNVA 259
           MAISTDRNRKKE PTC+YCGIKGHIADKCY KHGYP GYKPRNSN ITTAPDTSKTN VA
Sbjct: 61  MAISTDRNRKKECPTCAYCGIKGHIADKCYNKHGYPLGYKPRNSNPITTAPDTSKTNKVA 120

Query: 260 NTNSAAANRSPDFFSSLNSEQYSQLMTLLNNHLQAATTAPITTATAITHTS--GIFALTS 319
           NTNSAAAN SPDFFSSLNSEQYSQLMT+LNNHLQAATTAPITTATAITHT   G+  +T 
Sbjct: 121 NTNSAAANHSPDFFSSLNSEQYSQLMTILNNHLQAATTAPITTATAITHTPEIGVIQITC 180

Query: 320 HNNQSHDEWIIDSGASRHICHDKSLFKNWSHTNNMFVMLPNGHRISVDLIGDIQINGSLT 379
                             +C+  ++                   ISVDLIGDI INGSL 
Sbjct: 181 -----------------LLCYLMAIC------------------ISVDLIGDILINGSLL 240

Query: 380 LKDVLFVSQFAYNLISVSCLLITKNISLDFQSTCCIIQDLSRPMMIGKASCQNGLYVLNK 439
            KDVLFV QFAYNLIS                      DLSRPMMIGKASCQNGLYVLNK
Sbjct: 241 FKDVLFVPQFAYNLIS----------------------DLSRPMMIGKASCQNGLYVLNK 300

Query: 440 EANTNCIAAGVKINAISVDTWHQRLGHLSPKCLSSLSSTLCLSNHSIHHSSCHVCPLAKQ 499
           EANTNC+AA VKINAISVDTWHQRLGHLSPKCLS LSSTLCLSNHSIHHSSC        
Sbjct: 301 EANTNCVAARVKINAISVDTWHQRLGHLSPKCLSYLSSTLCLSNHSIHHSSC-------- 360

Query: 500 KRLSFHSNNNVASSPFDLVHADIWGPFKIPSYGGYKYFLTLVDDCLRFTWVYMLRQKSDV 559
                              HA+IW                   D L +            
Sbjct: 361 -------------------HANIW-------------------DLLTY------------ 367

Query: 560 LHIVPKFFQLIETQFSKVIKSFRSDNAPELKLTEFFAQKGTVHQFSCVEQPQQNSVVERK 619
                    LIETQFSKVIKSFRSDNA ELK TEFFAQKGTVHQFSCVE+PQQNSVVERK
Sbjct: 421 ------LLMLIETQFSKVIKSFRSDNAHELKFTEFFAQKGTVHQFSCVERPQQNSVVERK 367

Query: 620 HQHLLNVA 622
           HQHLLN A
Sbjct: 481 HQHLLNDA 367

BLAST of Cmc04g0107081 vs. ExPASy TrEMBL
Match: A0A438HDI8 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Vitis vinifera OX=29760 GN=POLX_2781 PE=4 SV=1)

HSP 1 Score: 508.1 bits (1307), Expect = 5.5e-140
Identity = 275/646 (42.57%), Postives = 403/646 (62.38%), Query Frame = 0

Query: 1   MLMAISGRNKAGFITGKIQKP-SDGVLLDAWICNNDILASWILNSVSKEIAASIIYTGSI 60
           M+ A++ +NK GFI G I +P +  +L   W   N ++ SW+ NSV KEIA SI+Y  + 
Sbjct: 53  MVTALNAKNKLGFIDGTISRPAATDLLASPWSRCNSMVISWLSNSVCKEIAESILYHETA 112

Query: 61  KEIWDELRQRFKQSNGPSIYQLRKEFVTLRQGNLTIETYYTKLKTIWQNLNEYRFTNDCT 120
            EIW++L +RF Q +GP I++L+++ +   QG+  + TYYT+LK++W  L E++    C 
Sbjct: 113 IEIWNDLYERFHQGSGPRIFELKQKILAHTQGSADVNTYYTRLKSLWDELREFKAIPICN 172

Query: 121 CGGLKPFIDHLESEYIMAFLMGLNDSYAAVRAQILLMQPLPSINTVFSLLIQEEQQRSAG 180
           CGG++ +++  + E +M FL+GLN+S+A ++AQILLM+P P +N VFSL++QEE QRS  
Sbjct: 173 CGGMRVYMEDQQRETVMQFLLGLNESFAPIQAQILLMEPTPPLNKVFSLVVQEEWQRSLT 232

Query: 181 ILTPP--IDPVALNIASSMAISTDRN---RKKERPTCSYCGIKGHIADKCYKKHGYPPG- 240
               P    PV+    ++   S+  N    +K+RP C++C I GH  D+CYK HGY PG 
Sbjct: 233 TSNSPAFTTPVSSRFQAASRASSPTNSSRSRKDRPLCTHCNILGHTVDRCYKIHGYTPGF 292

Query: 241 -----YKPRNSNSITTAPDTSKTNNVANTNSAAANRSPDFFSSLNSEQYSQLMTLLNNHL 300
                ++P  S      P++  TN +  T+ + A+ SP     L  +Q++QL+ LL+ H 
Sbjct: 293 RNRPNFRPNGSRPNQMLPNSLHTNQLTLTDGSIASASP---PPLTHDQHNQLLALLSLHS 352

Query: 301 QAATTAPITTAT----AITHTSGIFALT-SHNNQSHDEWIIDSGASRHICHDKSLFKNWS 360
            + ++A    +     +I++ +GI +L+ S +  +   WI+DSGA+ H+C + S+F +  
Sbjct: 353 SSGSSASFGDSNPLQQSISNFTGILSLSPSSSTLNPSIWILDSGATHHVCTNSSMFHSIH 412

Query: 361 HTNNMFVMLPNGHRISVDLIGDIQINGSLTLKDVLFVSQFAYNLISVSCLLITKNISLDF 420
             ++  V LP G +I +  IG I ++  L L+ VL++  F +NLIS+S L  T   S DF
Sbjct: 413 SFSSNTVTLPTGTKIPITGIGTIHLSPHLVLEHVLYIPTFQFNLISISALTQTNCFSFDF 472

Query: 421 QSTCCIIQDLSRPMMIGKASCQNGLYVLNKEANTNCIAAGVKINAISVDT---WHQRLGH 480
            +  C IQD S+  +IG    Q  LY+L+     +  +  V  N  S      WH RL H
Sbjct: 473 TAHFCFIQDHSQGKLIGMGRRQGNLYLLDSSVFRSISSVFVVDNNTSAHVNKLWHFRLSH 532

Query: 481 LSPKCLSSLSSTLCLSNHSIHHSSCHVCPLAKQKRLSFHSNNNVASSPFDLVHADIWGPF 540
            S   LS L   L L ++   + SC +CPLAKQKRL F  +NN++SSPFDL+H DIWGPF
Sbjct: 533 PSNVKLSVLKPHLQLQSNGNTNLSCSICPLAKQKRLPFDCHNNLSSSPFDLIHCDIWGPF 592

Query: 541 KIPSYGGYKYFLTLVDDCLRFTWVYMLRQKSDVLHIVPKFFQLIETQFSKVIKSFRSDNA 600
            IP++ G++YFLT+VDDC R TWV++LR KSDV  I P+FF +++T+F   IK+ RSDNA
Sbjct: 593 HIPTHDGFRYFLTIVDDCTRNTWVHLLRAKSDVKTIFPQFFSMVKTKFGLTIKAVRSDNA 652

Query: 601 PELKLTEFFAQKGTVHQFSCVEQPQQNSVVERKHQHLLNVARALFF 627
           PEL L+  F Q   +H FSCVE PQQNSVVERKHQH+LNVARAL+F
Sbjct: 653 PELNLSNLFTQLDVLHFFSCVETPQQNSVVERKHQHILNVARALYF 695

BLAST of Cmc04g0107081 vs. ExPASy TrEMBL
Match: A0A2N9GZW3 (Integrase catalytic domain-containing protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS33057 PE=4 SV=1)

HSP 1 Score: 505.0 bits (1299), Expect = 4.6e-139
Identity = 283/662 (42.75%), Postives = 397/662 (59.97%), Query Frame = 0

Query: 1    MLMAISGRNKAGFITGKIQKPSD--GVLLDAWICNNDILASWILNSVSKEIAASIIYTGS 60
            M+MA++ +NK GF+ G I++P D      +AW+  N ++ SW+LNS+SKEIA+S+IY  +
Sbjct: 474  MVMALTAKNKIGFVNGVIEQPQDEFSPAYNAWVRCNTMVISWLLNSLSKEIASSVIYANT 533

Query: 61   IKEIWDELRQRFKQSNGPSIYQLRKEFVTLRQGNLTIETYYTKLKTIWQNLNEYRFTNDC 120
             KEIW++LR+RF Q NGP I++++K    L Q N ++ +YYT+LK++W  L+ +R   DC
Sbjct: 534  AKEIWEDLRERFAQGNGPRIFEIQKSISVLSQDNSSVSSYYTRLKSLWDELSNFRPIPDC 593

Query: 121  TCGGLKPFIDHLESEYIMAFLMGLNDSYAAVRAQILLMQPLPSINTVFSLLIQEEQQRSA 180
            +CG +K  +D+ + EY+M FLMGLNDS++ VRAQIL+  PLPSI   F+L+IQEE+QR+ 
Sbjct: 594  SCGAMKVLLDNKQHEYVMQFLMGLNDSFSHVRAQILMTDPLPSITKAFALVIQEERQRNI 653

Query: 181  GI--LTPPIDPVAL---NIASSMAISTDRNRKKERPTCSYCGIKGHIADKCYKKHGYPPG 240
             I  L P  D VAL     A+      +++ KK+RP CS+CGI GH  DKCYK HGYPPG
Sbjct: 654  NIPSLAPAADSVALFTRGEATRHNYGKNQSYKKDRPICSHCGITGHTVDKCYKLHGYPPG 713

Query: 241  YK-------PRNSNSITTAPDTSKTNNVANTNSAAANRSPDFFSSLNSEQYSQLMTLLNN 300
            YK          S+++   P    T        +  + S    +SL S Q+     +++ 
Sbjct: 714  YKFKAKMHSAHQSSAVVEDPHLPFTQAQCQQLLSMLS-SQASLASLQSSQHPVNNQVVSQ 773

Query: 301  HLQAATTAPITTATAITH-TSGIFALT-----------SHNNQ---SHDEWIIDSGASRH 360
                 ++ P   A+AI+H  SGI + +            H N+   SH  WI+D+GA+ H
Sbjct: 774  ESAGTSSTPHQAASAISHFMSGISSFSHTVPKHSIFSVQHVNKTRFSHSTWILDTGATDH 833

Query: 361  ICHDKSLFKNWSHTNNMFVMLPNGHRISVDLIGDIQINGSLTLKDVLFVSQFAYNLISVS 420
            + H    F + + + N ++ LPNG ++    IG +Q+  SL L DVL V  F++NLIS+S
Sbjct: 834  MVHSLRKFTSITSSINTYIHLPNGEKVLATHIGTVQVTTSLLLTDVLCVPSFSFNLISIS 893

Query: 421  CLLITKNISLDFQSTCCIIQDLSRPMMIGKASCQNGLYVLNKEANT------NCIAAGVK 480
             L  T +  + F S  C IQDL     IG    +NGLY L    +         +AA   
Sbjct: 894  KLTNTPSCCVFFLSHFCFIQDLVTWKRIGLGRKKNGLYFLQDSTDAVPSSSFPLVAAHTA 953

Query: 481  INAISV-DTWHQRLGHLSPKCLSSLSSTLCLSNHSIHHSSCHVCPLAKQKRLSFHSNNNV 540
            +N   V D WH RLGH S   LS L + +        +  C VC ++KQKRL FH+  + 
Sbjct: 954  VNNTPVFDVWHHRLGHPSLSRLSLLKNVISDLVMPSANEHCKVCHISKQKRLPFHTAVHF 1013

Query: 541  ASSPFDLVHADIWGPFKIPSYGGYKYFLTLVDDCLRFTWVYMLRQKSDVLHIVPKFFQLI 600
            A  PFDL+H DIWGP+ +P+    +YFLT+VDDC R TWV++++QKS+   ++  FF LI
Sbjct: 1014 ADLPFDLIHCDIWGPYHVPTIDQQRYFLTIVDDCTRCTWVFLMKQKSETSPLIQSFFALI 1073

Query: 601  ETQFSKVIKSFRSDNAPELKLTEFFAQKGTVHQFSCVEQPQQNSVVERKHQHLLNVARAL 627
            +TQFS  IK  RSDN PE K+  F+AQ GT+HQ SCV  PQQN+ VERKHQHLL VARAL
Sbjct: 1074 KTQFSASIKMVRSDNGPEFKMPSFYAQHGTLHQKSCVGTPQQNATVERKHQHLLMVARAL 1133

BLAST of Cmc04g0107081 vs. TAIR 10
Match: AT1G21280.1 (CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 104.8 bits (260), Expect = 2.7e-22
Identity = 58/172 (33.72%), Postives = 92/172 (53.49%), Query Frame = 0

Query: 10  KAGFITGKIQKPSD-GVLLDAWICNNDILASWILNSVSKEIAASIIYTGSIKEIWDELRQ 69
           K GFI G + KP     L   W   N ++  W++NS++ ++  S++Y  +  ++W++LR+
Sbjct: 58  KFGFIDGTLPKPDPFSPLYQPWEQCNAMVMYWLMNSMTDKLLESVMYAETAHKMWEDLRR 117

Query: 70  RFKQSNGPSIYQLRKEFVTLRQGNLTIETYYTKLKTIWQNLNEYRFTNDCTCGG-----L 129
            F       IYQLR+   TLRQG  ++E Y+ KL  +W  L+EY    +C CGG      
Sbjct: 118 VFVPCVDLKIYQLRRRLATLRQGGDSVEEYFGKLSKVWMELSEYAPIPECKCGGCNCECT 177

Query: 130 KPFIDHLESEYIMAFLMG--LNDSYAAVRAQILLMQPLPSINTVFSLLIQEE 174
           K   +  E E    FLMG  LN  + AV  +I+  +P PS++  F+++   E
Sbjct: 178 KRAEEAREKEQRYEFLMGLKLNQGFEAVTTKIMFQKPPPSLHEAFAMVKDAE 229

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
KAA0065480.10.0e+0099.84Cysteine-rich RLK (receptor-like protein kinase) 8 [Cucumis melo var. makuwa] >T... [more]
KAA0035612.12.7e-16556.30No apical meristem (NAM) protein [Cucumis melo var. makuwa] >TYK30930.1 No apica... [more]
XP_008463248.12.5e-14764.34PREDICTED: uncharacterized protein LOC103501452 [Cucumis melo][more]
RVW82526.11.1e-13942.57Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera][more]
XP_012857659.13.3e-13941.92PREDICTED: uncharacterized protein LOC105976934 [Erythranthe guttata][more]
Match NameE-valueIdentityDescription
Q94HW22.1e-4024.96Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana O... [more]
Q9ZT942.7e-3524.29Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana O... [more]
P109787.6e-3024.72Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum... [more]
Q124916.7e-1022.89Transposon Ty2-B Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 20... [more]
P253846.7e-1022.89Transposon Ty2-C Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 20... [more]
Match NameE-valueIdentityDescription
A0A5A7VE660.0e+0099.84Cysteine-rich RLK (Receptor-like protein kinase) 8 OS=Cucumis melo var. makuwa O... [more]
A0A5D3E5P01.3e-16556.30No apical meristem (NAM) protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676... [more]
A0A1S3CJ631.2e-14764.34uncharacterized protein LOC103501452 OS=Cucumis melo OX=3656 GN=LOC103501452 PE=... [more]
A0A438HDI85.5e-14042.57Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Vitis vinifera OX... [more]
A0A2N9GZW34.6e-13942.75Integrase catalytic domain-containing protein OS=Fagus sylvatica OX=28930 GN=FSB... [more]
Match NameE-valueIdentityDescription
AT1G21280.12.7e-2233.72CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Ha... [more]
InterPro
Analysis Name: InterPro Annotations of Melon (Charmono) v1.1
Date Performed: 2022-10-13
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR005162Retrotransposon gag domainPFAMPF03732Retrotrans_gagcoord: 37..144
e-value: 1.1E-14
score: 54.5
IPR025724GAG-pre-integrase domainPFAMPF13976gag_pre-integrscoord: 427..493
e-value: 3.5E-10
score: 39.6
IPR001584Integrase, catalytic corePFAMPF00665rvecoord: 507..604
e-value: 3.9E-9
score: 36.7
IPR001584Integrase, catalytic corePROSITEPS50994INTEGRASEcoord: 504..639
score: 13.366789
IPR036397Ribonuclease H superfamilyGENE3D3.30.420.10coord: 500..636
e-value: 5.4E-27
score: 96.4
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 236..260
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 235..260
NoneNo IPR availablePANTHERPTHR34222:SF6OS02G0671800 PROTEINcoord: 9..289
NoneNo IPR availablePANTHERPTHR34222FAMILY NOT NAMEDcoord: 9..289
IPR036875Zinc finger, CCHC-type superfamilySUPERFAMILY57756Retrovirus zinc finger-like domainscoord: 197..232
IPR012337Ribonuclease H-like superfamilySUPERFAMILY53098Ribonuclease H-likecoord: 504..629

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cmc04g0107081.1Cmc04g0107081.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0015074 DNA integration
biological_process GO:0016310 phosphorylation
molecular_function GO:0016301 kinase activity
molecular_function GO:0003676 nucleic acid binding
molecular_function GO:0008270 zinc ion binding