CSPI01G19050 (gene) Cucumber (PI 183967) v1

Overview
NameCSPI01G19050
Typegene
OrganismCucumis sativus var. hardwickii cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionIntegrase
LocationChr1: 14450796 .. 14453762 (+)
RNA-Seq ExpressionCSPI01G19050
SyntenyCSPI01G19050
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGACTAGAAGATATGAGCATATTGTTGTAGCAATTGAAGAATCCAAAGATTTGTCAACTCTCTCTATAAATAGCTTAATGGGATCTCTTCAATCTCATGAGCTCAGATTGAAGATGTTTGATTCTAATCCTTCAGAAGAAGCTTTTCATATGCAGTCCTCCTATAGAGGTCGATCCAATGGAAGAAGAGGTGGACGTGGTGGTAGAGGCAATGGACGATCCAACGTTGTAACAAATACAGAGTCAGAAAGCAGAGACAATCAATTTTTTTCAAATAGAGGACGAGGAAGAAGTTCAAATAGAGGAAGAGGTAGAAGTGGTGGTCGTGGAGATTTTTCTCACATACAATGTTTCAATTGTAGACGTTATGGACATTTTCAAGCAGACTGTTGGTCTAAGAAGACTAATTCTAATCAAGCAGAAACCACACTAATGCATGAGCAATCAAATAATGATCAAGGTCTTCTCTTCCTCACTCTCAATGTTCAAGAATCAAGCACTGAAGAAATATGGTATCTTGATAGTGGTTGTAGTAACCACATGACAGGAAGAAAGGATATTTTTATATCTTTAGATGAATCTCATCAAAATGTAGTGAAGACTGGTGACAACAAGATGCTTGAAGTCAAAGGAAAAGGAGATATTCTTGTCAAGACAAAAATGGGAGCAAAAAAAATTACTGATGTGTATTATGTTTCAGGTCTCAAACACAATCTTTTAAGTGTTGGACAACTTCTCCTAAGAGGACATGATGTTATTTTTAAAGATAAAATATGCGAGATTAGAACCAAGAATGGAGATCTCATAACGAAGGTTCGTATGACTCACAACAAAATGTTTCCAATTAAAATATGTTATGAGAAGCTTGTTTGTTTTGAGACTTTAGTAAATGACACCTCATGGTTATGGCATTGTCGATTTGGGCACCTAAGTTTTGACACTTTGTCTCACATGTGTCAACAACATATGGTGAGAGGAATGTCAAATATTAAAAAGGAAGATCAACTCTGTGAAGCATGTGTTTTCAGAAAGCATCATCGAAATTCATTTCCGACTGGAGGTTCTTGGAGAGCATCAAAACCACTCGAGCTTGTTCATACAGACTTATGTGGACCTATGAGAACTACTACACATGGAGGTAACCGTTATTTTCTCACATTTATTGATGACTACAGTCGAAAAACATGGATTTATCTACTAAAAGAAAAGAGTGCTACTTTCGAATGTTTCAAGACATTCAAAGCAATGGTGGAAAATGAAAGTAACTTGAAATTGAAATCATTGCGTTCGGATCGTGGAGGAGAATATATTGTTTTTGCAGATTTCTTGAAGGAAAATGGAATCAAGCATCAGAAGACTGTTCGAAGAACTCCTCAACAAAACGGAGTTGCAGAGAGGAAAAATAGAATAATAATGGAACTTGCAAGAAGTATGTTGAAGGCAAAGAAGCTTCCTGATCAATTTTGGGGAGACGCAGTAACTTGTGCTGTTTATCTCCTAAATAGAGCTTCAACGAAAAGTGTGCAAGGTATTACTCCTCAAGAAGCATGGAGCGGATTGAAACCAACCGTTAGTCACCTAAGAGTGTTTGGGTGCATTGCTTACTCTCACATTTCAGATGAGAAAAGAGGTAAGCTAGATGATAAATCAGAGAAATGCATTTTTGTTGGGTACAGTGAGAACTCTAAGGCCTACAGACTATACAATCCGATAAGTAAGAAAGTTATTATTAGTCGAGATGTCAAGTTCGATGAAGCAAAATTGTGGCAATGGAATGCACCAAATGAAGACCAAAATCCATTACATGTTGATATGGATGGAAAAAAAGATGCTCGAGACTTGGAGCTTGAAGTAACTCAACCACTGACTTCACCTTCTTCATCACACTCCACAAGTGATGAAGAAACTACTCCAAGGAAGACCAGAAATATTCAAGAGATCTATAATACTTCAAGAAGGATACTAGATGAAGAACATGTTGATTTTGCTTTATTTGCAAATGTTGATCCTGTATACTTTGAAGAAGCAATTCAAGATGAAAATTGGAAAGATGCAATGAATCAAGAGATTGATGCAATAACAAGAAACGAGACATGGGAGTTAGTAAAATTACCAGAAAATAAAAAGGCTCTTGGAGTCAAATGGATCTATAGAACAAAGCTAAAGCAAAACGGAGAAGTGCAAAAATACAAAGCCAGATACCACAACAAAATTTGTCAAAGAACTTTGGCAATACATATCATAAAATCAAGCTGCCCCTCCCAAGGAATCAAAATCACAACGAATCAAAATCTAGGGTTCTGCTTCCACATTCATTCAAGCATCGATCAAAGTCACAATTCGAAACAACAAAATTCACAAGATTTAAAGCCCAAGAAAACAAGTAAACCTACTAAGAAAACAAGTCTAGTTACTTCGCTACATAAACATCAATTACAGATGTCGAATTATAGGTTACCTGGAGAGAAGAAGACTCGTCGGATTGAAGAGGATCGAGAGAAAGAATGACCTTAGCGAGAGGAAGAGAATCAGATGGAGGATGAGCCTGCCACGACTTAGCGACTAGCAATGGACACTTCATTAGCCACACTGATCTATCCGCCTTGCCTGTATCCAAGTTACTGCTACTACCGCCGCTGCCGTGCTCATCTTCCATATCTATTAACTTACAAAATCACCCTCTCTCTCTCTTCTGAATTTTCAACGGCGGATTTACTGCAACTTTGTTCCTATTTTTTTTCTTTTCTGGAGACGACGGTGGACCCTAAAGAAGTGGAAGGCGGTGGTCGGAAATCAAAGGGAGGTTTGATATTACGCAATCAGAAGCAGAAGATTATCGTACAAGTGATGTAATCTGTATTCTAAAACGAGAGTATAGACTATAGAGTTGATCTCTCAATCGAGTATTGACTTCCTTCTGACGAAACACACTGATTTGTGGTGCGACCCGCCGCTTCTGATTGTTGC

mRNA sequence

ATGACTAGAAGATATGAGCATATTGTTGTAGCAATTGAAGAATCCAAAGATTTGTCAACTCTCTCTATAAATAGCTTAATGGGATCTCTTCAATCTCATGAGCTCAGATTGAAGATGTTTGATTCTAATCCTTCAGAAGAAGCTTTTCATATGCAGTCCTCCTATAGAGGTCGATCCAATGGAAGAAGAGGTGGACGTGGTGGTAGAGGCAATGGACGATCCAACGTTGTAACAAATACAGAGTCAGAAAGCAGAGACAATCAATTTTTTTCAAATAGAGGACGAGGAAGAAGTTCAAATAGAGGAAGAGGTAGAAGTGGTGGTCGTGGAGATTTTTCTCACATACAATGTTTCAATTGTAGACGTTATGGACATTTTCAAGCAGACTGTTGGTCTAAGAAGACTAATTCTAATCAAGCAGAAACCACACTAATGCATGAGCAATCAAATAATGATCAAGGTCTTCTCTTCCTCACTCTCAATGTTCAAGAATCAAGCACTGAAGAAATATGGTATCTTGATAGTGGTTGTAGTAACCACATGACAGGAAGAAAGGATATTTTTATATCTTTAGATGAATCTCATCAAAATGTAGTGAAGACTGGTGACAACAAGATGCTTGAAGTCAAAGGAAAAGGAGATATTCTTGTCAAGACAAAAATGGGAGCAAAAAAAATTACTGATGTGTATTATGTTTCAGGTCTCAAACACAATCTTTTAAGTGTTGGACAACTTCTCCTAAGAGGACATGATGTTATTTTTAAAGATAAAATATGCGAGATTAGAACCAAGAATGGAGATCTCATAACGAAGGTTCGTATGACTCACAACAAAATGTTTCCAATTAAAATATGTTATGAGAAGCTTGTTTGTTTTGAGACTTTAGTAAATGACACCTCATGGTTATGGCATTGTCGATTTGGGCACCTAAGTTTTGACACTTTGTCTCACATGTGTCAACAACATATGGTGAGAGGAATGTCAAATATTAAAAAGGAAGATCAACTCTGTGAAGCATGTGTTTTCAGAAAGCATCATCGAAATTCATTTCCGACTGGAGGTTCTTGGAGAGCATCAAAACCACTCGAGCTTGTTCATACAGACTTATGTGGACCTATGAGAACTACTACACATGGAGGTAACCGTTATTTTCTCACATTTATTGATGACTACAGTCGAAAAACATGGATTTATCTACTAAAAGAAAAGAGTGCTACTTTCGAATGTTTCAAGACATTCAAAGCAATGGTGGAAAATGAAAGTAACTTGAAATTGAAATCATTGCGTTCGGATCGTGGAGGAGAATATATTGTTTTTGCAGATTTCTTGAAGGAAAATGGAATCAAGCATCAGAAGACTGTTCGAAGAACTCCTCAACAAAACGGAGTTGCAGAGAGGAAAAATAGAATAATAATGGAACTTGCAAGAAGTATGTTGAAGGCAAAGAAGCTTCCTGATCAATTTTGGGGAGACGCAGTAACTTGTGCTGTTTATCTCCTAAATAGAGCTTCAACGAAAAGTGTGCAAGGTATTACTCCTCAAGAAGCATGGAGCGGATTGAAACCAACCGTTAGTCACCTAAGAGTGTTTGGGTGCATTGCTTACTCTCACATTTCAGATGAGAAAAGAGGTAAGCTAGATGATAAATCAGAGAAATGCATTTTTGTTGGGTACAGTGAGAACTCTAAGGCCTACAGACTATACAATCCGATAAGTAAGAAAGTTATTATTAGTCGAGATGTCAAGTTCGATGAAGCAAAATTGTGGCAATGGAATGCACCAAATGAAGACCAAAATCCATTACATGTTGATATGGATGGAAAAAAAGATGCTCGAGACTTGGAGCTTGAAGTAACTCAACCACTGACTTCACCTTCTTCATCACACTCCACAAGTGATGAAGAAACTACTCCAAGGAAGACCAGAAATATTCAAGAGATCTATAATACTTCAAGAAGGATACTAGATGAAGAACATGTTGATTTTGCTTTATTTGCAAATGTTGATCCTGTATACTTTGAAGAAGCAATTCAAGATGAAAATTGGAAAGATGCAATGAATCAAGAGATTGATGCAATAACAAGAAACGAGACATGGGAGTTAGTAAAATTACCAGAAAATAAAAAGGCTCTTGGAGTCAAATGGATCTATAGAACAAAGCTAAAGCAAAACGGAGAAGTGCAAAAATACAAAGCCAGATACCACAACAAAATTTGTCAAAGAACTTTGGCAATACATATCATAAAATCAAGCTGCCCCTCCCAAGGAATCAAAATCACAACGAATCAAAATCTAGGGTTCTGCTTCCACATTCATTCAAGCATCGATCAAAGTCACAATTCGAAACAACAAAATTCACAAGATTTAAAGCCCAAGAAAACAAGTAAACCTACTAAGAAAACAAGTCTAGTTACTTCGCTACATAAACATCAATTACAGATGTCGAATTATAGGTTACCTGGAGAGAAGAAGACTCGTCGGATTGAAGAGGATCGAGAGAAAGAATGACCTTAGCGAGAGGAAGAGAATCAGATGGAGGATGAGCCTGCCACGACTTAGCGACTAGCAATGGACACTTCATTAGCCACACTGATCTATCCGCCTTGCCTGTATCCAAGTTACTGCTACTACCGCCGCTGCCGTGCTCATCTTCCATATCTATTAACTTACAAAATCACCCTCTCTCTCTCTTCTGAATTTTCAACGGCGGATTTACTGCAACTTTGTTCCTATTTTTTTTCTTTTCTGGAGACGACGGTGGACCCTAAAGAAGTGGAAGGCGGTGGTCGGAAATCAAAGGGAGGTTTGATATTACGCAATCAGAAGCAGAAGATTATCGTACAAGTGATGTAATCTGTATTCTAAAACGAGAGTATAGACTATAGAGTTGATCTCTCAATCGAGTATTGACTTCCTTCTGACGAAACACACTGATTTGTGGTGCGACCCGCCGCTTCTGATTGTTGC

Coding sequence (CDS)

ATGACTAGAAGATATGAGCATATTGTTGTAGCAATTGAAGAATCCAAAGATTTGTCAACTCTCTCTATAAATAGCTTAATGGGATCTCTTCAATCTCATGAGCTCAGATTGAAGATGTTTGATTCTAATCCTTCAGAAGAAGCTTTTCATATGCAGTCCTCCTATAGAGGTCGATCCAATGGAAGAAGAGGTGGACGTGGTGGTAGAGGCAATGGACGATCCAACGTTGTAACAAATACAGAGTCAGAAAGCAGAGACAATCAATTTTTTTCAAATAGAGGACGAGGAAGAAGTTCAAATAGAGGAAGAGGTAGAAGTGGTGGTCGTGGAGATTTTTCTCACATACAATGTTTCAATTGTAGACGTTATGGACATTTTCAAGCAGACTGTTGGTCTAAGAAGACTAATTCTAATCAAGCAGAAACCACACTAATGCATGAGCAATCAAATAATGATCAAGGTCTTCTCTTCCTCACTCTCAATGTTCAAGAATCAAGCACTGAAGAAATATGGTATCTTGATAGTGGTTGTAGTAACCACATGACAGGAAGAAAGGATATTTTTATATCTTTAGATGAATCTCATCAAAATGTAGTGAAGACTGGTGACAACAAGATGCTTGAAGTCAAAGGAAAAGGAGATATTCTTGTCAAGACAAAAATGGGAGCAAAAAAAATTACTGATGTGTATTATGTTTCAGGTCTCAAACACAATCTTTTAAGTGTTGGACAACTTCTCCTAAGAGGACATGATGTTATTTTTAAAGATAAAATATGCGAGATTAGAACCAAGAATGGAGATCTCATAACGAAGGTTCGTATGACTCACAACAAAATGTTTCCAATTAAAATATGTTATGAGAAGCTTGTTTGTTTTGAGACTTTAGTAAATGACACCTCATGGTTATGGCATTGTCGATTTGGGCACCTAAGTTTTGACACTTTGTCTCACATGTGTCAACAACATATGGTGAGAGGAATGTCAAATATTAAAAAGGAAGATCAACTCTGTGAAGCATGTGTTTTCAGAAAGCATCATCGAAATTCATTTCCGACTGGAGGTTCTTGGAGAGCATCAAAACCACTCGAGCTTGTTCATACAGACTTATGTGGACCTATGAGAACTACTACACATGGAGGTAACCGTTATTTTCTCACATTTATTGATGACTACAGTCGAAAAACATGGATTTATCTACTAAAAGAAAAGAGTGCTACTTTCGAATGTTTCAAGACATTCAAAGCAATGGTGGAAAATGAAAGTAACTTGAAATTGAAATCATTGCGTTCGGATCGTGGAGGAGAATATATTGTTTTTGCAGATTTCTTGAAGGAAAATGGAATCAAGCATCAGAAGACTGTTCGAAGAACTCCTCAACAAAACGGAGTTGCAGAGAGGAAAAATAGAATAATAATGGAACTTGCAAGAAGTATGTTGAAGGCAAAGAAGCTTCCTGATCAATTTTGGGGAGACGCAGTAACTTGTGCTGTTTATCTCCTAAATAGAGCTTCAACGAAAAGTGTGCAAGGTATTACTCCTCAAGAAGCATGGAGCGGATTGAAACCAACCGTTAGTCACCTAAGAGTGTTTGGGTGCATTGCTTACTCTCACATTTCAGATGAGAAAAGAGGTAAGCTAGATGATAAATCAGAGAAATGCATTTTTGTTGGGTACAGTGAGAACTCTAAGGCCTACAGACTATACAATCCGATAAGTAAGAAAGTTATTATTAGTCGAGATGTCAAGTTCGATGAAGCAAAATTGTGGCAATGGAATGCACCAAATGAAGACCAAAATCCATTACATGTTGATATGGATGGAAAAAAAGATGCTCGAGACTTGGAGCTTGAAGTAACTCAACCACTGACTTCACCTTCTTCATCACACTCCACAAGTGATGAAGAAACTACTCCAAGGAAGACCAGAAATATTCAAGAGATCTATAATACTTCAAGAAGGATACTAGATGAAGAACATGTTGATTTTGCTTTATTTGCAAATGTTGATCCTGTATACTTTGAAGAAGCAATTCAAGATGAAAATTGGAAAGATGCAATGAATCAAGAGATTGATGCAATAACAAGAAACGAGACATGGGAGTTAGTAAAATTACCAGAAAATAAAAAGGCTCTTGGAGTCAAATGGATCTATAGAACAAAGCTAAAGCAAAACGGAGAAGTGCAAAAATACAAAGCCAGATACCACAACAAAATTTGTCAAAGAACTTTGGCAATACATATCATAAAATCAAGCTGCCCCTCCCAAGGAATCAAAATCACAACGAATCAAAATCTAGGGTTCTGCTTCCACATTCATTCAAGCATCGATCAAAGTCACAATTCGAAACAACAAAATTCACAAGATTTAAAGCCCAAGAAAACAAGTAAACCTACTAAGAAAACAAGTCTAGTTACTTCGCTACATAAACATCAATTACAGATGTCGAATTATAGGTTACCTGGAGAGAAGAAGACTCGTCGGATTGAAGAGGATCGAGAGAAAGAATGA

Protein sequence

MTRRYEHIVVAIEESKDLSTLSINSLMGSLQSHELRLKMFDSNPSEEAFHMQSSYRGRSNGRRGGRGGRGNGRSNVVTNTESESRDNQFFSNRGRGRSSNRGRGRSGGRGDFSHIQCFNCRRYGHFQADCWSKKTNSNQAETTLMHEQSNNDQGLLFLTLNVQESSTEEIWYLDSGCSNHMTGRKDIFISLDESHQNVVKTGDNKMLEVKGKGDILVKTKMGAKKITDVYYVSGLKHNLLSVGQLLLRGHDVIFKDKICEIRTKNGDLITKVRMTHNKMFPIKICYEKLVCFETLVNDTSWLWHCRFGHLSFDTLSHMCQQHMVRGMSNIKKEDQLCEACVFRKHHRNSFPTGGSWRASKPLELVHTDLCGPMRTTTHGGNRYFLTFIDDYSRKTWIYLLKEKSATFECFKTFKAMVENESNLKLKSLRSDRGGEYIVFADFLKENGIKHQKTVRRTPQQNGVAERKNRIIMELARSMLKAKKLPDQFWGDAVTCAVYLLNRASTKSVQGITPQEAWSGLKPTVSHLRVFGCIAYSHISDEKRGKLDDKSEKCIFVGYSENSKAYRLYNPISKKVIISRDVKFDEAKLWQWNAPNEDQNPLHVDMDGKKDARDLELEVTQPLTSPSSSHSTSDEETTPRKTRNIQEIYNTSRRILDEEHVDFALFANVDPVYFEEAIQDENWKDAMNQEIDAITRNETWELVKLPENKKALGVKWIYRTKLKQNGEVQKYKARYHNKICQRTLAIHIIKSSCPSQGIKITTNQNLGFCFHIHSSIDQSHNSKQQNSQDLKPKKTSKPTKKTSLVTSLHKHQLQMSNYRLPGEKKTRRIEEDREKE*
Homology
BLAST of CSPI01G19050 vs. ExPASy Swiss-Prot
Match: P10978 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum OX=4097 PE=2 SV=1)

HSP 1 Score: 364.4 bits (934), Expect = 3.5e-99
Identity = 241/704 (34.23%), Postives = 371/704 (52.70%), Query Frame = 0

Query: 88  QFFSNRGRGRS---SNRGRGRSGGRGDFSH------IQCFNCRRYGHFQADCWSKK---- 147
           Q     GRGRS   S+   GRSG RG   +        C+NC + GHF+ DC + +    
Sbjct: 194 QALITEGRGRSYQRSSNNYGRSGARGKSKNRSKSRVRNCYNCNQPGHFKRDCPNPRKGKG 253

Query: 148 -TNSNQAETTLMHEQSNNDQGLLFLTLNVQE-----SSTEEIWYLDSGCSNHMTGRKDIF 207
            T+  + +        NND  +LF  +N +E     S  E  W +D+  S+H T  +D+F
Sbjct: 254 ETSGQKNDDNTAAMVQNNDNVVLF--INEEEECMHLSGPESEWVVDTAASHHATPVRDLF 313

Query: 208 ISLDESHQNVVKTGDNKMLEVKGKGDILVKTKMGAKKI-TDVYYVSGLKHNLLSVGQLLL 267
                     VK G+    ++ G GDI +KT +G   +  DV +V  L+ NL+S   L  
Sbjct: 314 CRYVAGDFGTVKMGNTSYSKIAGIGDICIKTNVGCTLVLKDVRHVPDLRMNLISGIALDR 373

Query: 268 RGHDVIFKDKICEIRTKNGDLITKVRMTHNKMF--PIKICYEKLVCFETLVNDTSWLWHC 327
            G++  F ++  + R   G L+    +    ++    +IC  +L   +  ++    LWH 
Sbjct: 374 DGYESYFANQ--KWRLTKGSLVIAKGVARGTLYRTNAEICQGELNAAQDEISVD--LWHK 433

Query: 328 RFGHLSFDTLSHMCQQHMVRGMSNIKKEDQLCEACVFRKHHRNSFPTGGSWRASKPLELV 387
           R GH+S   L  + ++ ++        +   C+ C+F K HR SF T  S R    L+LV
Sbjct: 434 RMGHMSEKGLQILAKKSLISYAKGTTVKP--CDYCLFGKQHRVSFQT-SSERKLNILDLV 493

Query: 388 HTDLCGPMRTTTHGGNRYFLTFIDDYSRKTWIYLLKEKSATFECFKTFKAMVENESNLKL 447
           ++D+CGPM   + GGN+YF+TFIDD SRK W+Y+LK K   F+ F+ F A+VE E+  KL
Sbjct: 494 YSDVCGPMEIESMGGNKYFVTFIDDASRKLWVYILKTKDQVFQVFQKFHALVERETGRKL 553

Query: 448 KSLRSDRGGEYI--VFADFLKENGIKHQKTVRRTPQQNGVAERKNRIIMELARSMLKAKK 507
           K LRSD GGEY    F ++   +GI+H+KTV  TPQ NGVAER NR I+E  RSML+  K
Sbjct: 554 KRLRSDNGGEYTSREFEEYCSSHGIRHEKTVPGTPQHNGVAERMNRTIVEKVRSMLRMAK 613

Query: 508 LPDQFWGDAVTCAVYLLNRASTKSVQGITPQEAWSGLKPTVSHLRVFGCIAYSHISDEKR 567
           LP  FWG+AV  A YL+NR+ +  +    P+  W+  + + SHL+VFGC A++H+  E+R
Sbjct: 614 LPKSFWGEAVQTACYLINRSPSVPLAFEIPERVWTNKEVSYSHLKVFGCRAFAHVPKEQR 673

Query: 568 GKLDDKSEKCIFVGYSENSKAYRLYNPISKKVIISRDVKFDEAKLWQWNAPNEDQNPLHV 627
            KLDDKS  CIF+GY +    YRL++P+ KKVI SRDV F E+++               
Sbjct: 674 TKLDDKSIPCIFIGYGDEEFGYRLWDPVKKKVIRSRDVVFRESEV-----------RTAA 733

Query: 628 DMDGKKDARDLELEVTQPLTS--PSSSHSTSDE------------ETTPRKTRNIQEIYN 687
           DM  K     +   VT P TS  P+S+ ST+DE            E   +    ++E+ +
Sbjct: 734 DMSEKVKNGIIPNFVTIPSTSNNPTSAESTTDEVSEQGEQPGEVIEQGEQLDEGVEEVEH 793

Query: 688 TSR--------RILDEEHVDFALFANVDPVYFEEAIQDENWKD------------AMNQE 734
            ++        R  +   V+   + + + V   +  + E+ K+            AM +E
Sbjct: 794 PTQGEEQHQPLRRSERPRVESRRYPSTEYVLISDDREPESLKEVLSHPEKNQLMKAMQEE 853

BLAST of CSPI01G19050 vs. ExPASy Swiss-Prot
Match: P04146 (Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3)

HSP 1 Score: 249.2 bits (635), Expect = 1.6e-64
Identity = 208/783 (26.56%), Postives = 335/783 (42.78%), Query Frame = 0

Query: 75  NVVTNTESESRDNQFFSNR-GRGRSSNRGRGRSGGRGDFSHIQCFNCRRYGHFQADCWS- 134
           N + +  + +  N  F NR  + +   +G  +         ++C +C R GH + DC+  
Sbjct: 196 NAIVHNNNNTYKNNLFKNRVTKPKKIFKGNSK-------YKVKCHHCGREGHIKKDCFHY 255

Query: 135 KKTNSNQAETTLMHEQSNNDQGLLFLTLNVQESSTEEI--WYLDSGCSNHMTGRKDIFIS 194
           K+  +N+ +      Q+    G+ F+   V  +S  +   + LDSG S+H+       I+
Sbjct: 256 KRILNNKNKENEKQVQTATSHGIAFMVKEVNNTSVMDNCGFVLDSGASDHL-------IN 315

Query: 195 LDESHQNVVKTGDNKMLEVKGKGDILVKTKMGAKK--------ITDVYYVSGLKHNLLSV 254
            +  + + V+      + V  +G+ +  TK G  +        + DV +      NL+SV
Sbjct: 316 DESLYTDSVEVVPPLKIAVAKQGEFIYATKRGIVRLRNDHEITLEDVLFCKEAAGNLMSV 375

Query: 255 GQLLLRGHDVIFKDKICEIRTKNGDLITKVRMTHNKMFPIKICYEKLVCFETLVNDTSWL 314
            +L   G  + F DK     +KNG ++ K       +  + +   +         +   L
Sbjct: 376 KRLQEAGMSIEF-DKSGVTISKNGLMVVK---NSGMLNNVPVINFQAYSINAKHKNNFRL 435

Query: 315 WHCRFGHLSFDTLSHMCQQHMVRGMS---NIKKEDQLCEACVFRKHHRNSF-PTGGSWRA 374
           WH RFGH+S   L  + +++M    S   N++   ++CE C+  K  R  F         
Sbjct: 436 WHERFGHISDGKLLEIKRKNMFSDQSLLNNLELSCEICEPCLNGKQARLPFKQLKDKTHI 495

Query: 375 SKPLELVHTDLCGPMRTTTHGGNRYFLTFIDDYSRKTWIYLLKEKSATFECFKTFKAMVE 434
            +PL +VH+D+CGP+   T     YF+ F+D ++     YL+K KS  F  F+ F A  E
Sbjct: 496 KRPLFVVHSDVCGPITPVTLDDKNYFVIFVDQFTHYCVTYLIKYKSDVFSMFQDFVAKSE 555

Query: 435 NESNLKLKSLRSDRGGEYI--VFADFLKENGIKHQKTVRRTPQQNGVAERKNRIIMELAR 494
              NLK+  L  D G EY+      F  + GI +  TV  TPQ NGV+ER  R I E AR
Sbjct: 556 AHFNLKVVYLYIDNGREYLSNEMRQFCVKKGISYHLTVPHTPQLNGVSERMIRTITEKAR 615

Query: 495 SMLKAKKLPDQFWGDAVTCAVYLLNRASTKSV--QGITPQEAWSGLKPTVSHLRVFGCIA 554
           +M+   KL   FWG+AV  A YL+NR  ++++     TP E W   KP + HLRVFG   
Sbjct: 616 TMVSGAKLDKSFWGEAVLTATYLINRIPSRALVDSSKTPYEMWHNKKPYLKHLRVFGATV 675

Query: 555 YSHISDEKRGKLDDKSEKCIFVGYSENSKAYRLYNPISKKVIISRDVKFDEA-------- 614
           Y HI + K+GK DDKS K IFVGY  N   ++L++ +++K I++RDV  DE         
Sbjct: 676 YVHIKN-KQGKFDDKSFKSIFVGYEPN--GFKLWDAVNEKFIVARDVVVDETNMVNSRAV 735

Query: 615 -----------------------------------------------------------K 674
                                                                      K
Sbjct: 736 KFETVFLKDSKESENKNFPNDSRKIIQTEFPNESKECDNIQFLKDSKESENKNFPNDSRK 795

Query: 675 LWQWNAPNEDQNPLHVDM-------------DGKKDARDLELEVTQPLTSPSSSHSTSDE 734
           + Q   PNE +   ++               + KK  RD  L  ++   +P+ S  +   
Sbjct: 796 IIQTEFPNESKECDNIQFLKDSKESNKYFLNESKKRKRDDHLNESKGSGNPNESRESETA 855

BLAST of CSPI01G19050 vs. ExPASy Swiss-Prot
Match: Q9ZT94 (Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana OX=3702 GN=RE2 PE=4 SV=1)

HSP 1 Score: 211.5 bits (537), Expect = 3.8e-53
Identity = 202/812 (24.88%), Postives = 334/812 (41.13%), Query Frame = 0

Query: 74   SNVVTNTESESRDNQFFSNRGRGRSSNRGRGR--------SGGRGDFSHI-----QCFNC 133
            +NVVT+  + +  NQ  +NRG  R+ N    R        SG R D         +C  C
Sbjct: 204  ANVVTHRNTNTNRNQ--NNRGDNRNYNNNNNRSNSWQPSSSGSRSDNRQPKPYLGRCQIC 263

Query: 134  RRYGHFQADCWSKKTNSNQAETTLMHEQSNND----QGLLFLTLNVQESSTEEIWYLDSG 193
               GH    C       +Q ++T   +QS +     Q    L +N   ++    W LDSG
Sbjct: 264  SVQGHSAKRC----PQLHQFQSTTNQQQSTSPFTPWQPRANLAVNSPYNANN--WLLDSG 323

Query: 194  CSNHMTGR-KDIFISLDESHQNVVKTGDNKMLEVKGKGDILVKTKMGAKKITDVYYVSGL 253
             ++H+T    ++      +  + V   D   + +   G   + T   +  +  V YV  +
Sbjct: 324  ATHHITSDFNNLSFHQPYTGGDDVMIADGSTIPITHTGSASLPTSSRSLDLNKVLYVPNI 383

Query: 254  KHNLLSVGQLLLRGH-DVIFKDKICEIRTKNGDLITKVRMTHNKMFPIKICYEKLVCF-- 313
              NL+SV +L       V F     +++  N  +      T ++++   I   + V    
Sbjct: 384  HKNLISVYRLCNTNRVSVEFFPASFQVKDLNTGVPLLQGKTKDELYEWPIASSQAVSMFA 443

Query: 314  ETLVNDTSWLWHCRFGHLSFDTLSHMCQQHMVRGMSNIKKEDQLCEACVFRKHHRNSFPT 373
                  T   WH R GH S   L+ +   H +  + N   +   C  C   K H+  F +
Sbjct: 444  SPCSKATHSSWHSRLGHPSLAILNSVISNHSL-PVLNPSHKLLSCSDCFINKSHKVPF-S 503

Query: 374  GGSWRASKPLELVHTDL-CGPMRTTTHGGNRYFLTFIDDYSRKTWIYLLKEKSATFECFK 433
              +  +SKPLE +++D+   P+ +  +   RY++ F+D ++R TW+Y LK+KS   + F 
Sbjct: 504  NSTITSSKPLEYIYSDVWSSPILSIDN--YRYYVIFVDHFTRYTWLYPLKQKSQVKDTFI 563

Query: 434  TFKAMVENESNLKLKSLRSDRGGEYIVFADFLKENGIKHQKTVRRTPQQNGVAERKNRII 493
             FK++VEN    ++ +L SD GGE++V  D+L ++GI H  +   TP+ NG++ERK+R I
Sbjct: 564  IFKSLVENRFQTRIGTLYSDNGGEFVVLRDYLSQHGISHFTSPPHTPEHNGLSERKHRHI 623

Query: 494  MELARSMLKAKKLPDQFWGDAVTCAVYLLNRASTKSVQGITPQEAWSGLKPTVSHLRVFG 553
            +E+  ++L    +P  +W  A + AVYL+NR  T  +Q  +P +   G  P    L+VFG
Sbjct: 624  VEMGLTLLSHASVPKTYWPYAFSVAVYLINRLPTPLLQLQSPFQKLFGQPPNYEKLKVFG 683

Query: 554  CIAYSHISDEKRGKLDDKSEKCIFVGYSENSKAYRLYNPISKKVIISRDVKFDE------ 613
            C  Y  +    R KL+DKS++C F+GYS    AY   +  + ++  SR V+FDE      
Sbjct: 684  CACYPWLRPYNRHKLEDKSKQCAFMGYSLTQSAYLCLHIPTGRLYTSRHVQFDERCFPFS 743

Query: 614  ---------AKLWQWNAPNEDQN------PL----------HVDMDGKKDARDLELEVTQ 673
                      +    +APN   +      PL          H+D   +  +    L  TQ
Sbjct: 744  TTNFGVSTSQEQRSDSAPNWPSHTTLPTTPLVLPAPPCLGPHLDTSPRPPSSPSPLCTTQ 803

Query: 674  ---------PLTSPSSSHSTSDEETTPRKTRNIQEIYNTS-------------------- 733
                      ++SPSSS  T+     P+ T    +  N++                    
Sbjct: 804  VSSSNLPSSSISSPSSSEPTAPSHNGPQPTAQPHQTQNSNSNSPILNNPNPNSPSPNSPN 863

BLAST of CSPI01G19050 vs. ExPASy Swiss-Prot
Match: Q94HW2 (Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana OX=3702 GN=RE1 PE=2 SV=1)

HSP 1 Score: 197.2 bits (500), Expect = 7.3e-49
Identity = 145/536 (27.05%), Postives = 246/536 (45.90%), Query Frame = 0

Query: 59  SNGRRGGRGGRGNGRSNVVTNTESESRDNQFFSNRGRGRSSNRGRGRSGGRGDFSHIQCF 118
           +N   G R  R + R+N   +   +     F  N      +N+ +   G        +C 
Sbjct: 235 NNNNNGNRNNRYDNRNNNNNSKPWQQSSTNFHPN------NNQSKPYLG--------KCQ 294

Query: 119 NCRRYGHFQADC-----WSKKTNSNQAETTLMHEQSNNDQGLLFLTLNVQESSTEEIWYL 178
            C   GH    C     +    NS Q  +     Q   +     L L    SS    W L
Sbjct: 295 ICGVQGHSAKRCSQLQHFLSSVNSQQPPSPFTPWQPRAN-----LALGSPYSSNN--WLL 354

Query: 179 DSGCSNHMTGR-KDIFISLDESHQNVVKTGDNKMLEVKGKGDILVKTKMGAKKITDVYYV 238
           DSG ++H+T    ++ +    +  + V   D   + +   G   + TK     + ++ YV
Sbjct: 355 DSGATHHITSDFNNLSLHQPYTGGDDVMVADGSTIPISHTGSTSLSTKSRPLNLHNILYV 414

Query: 239 SGLKHNLLSVGQLL-LRGHDVIFKDKICEIRTKNGDLITKVRMTHNKMFPIKICYEKLVC 298
             +  NL+SV +L    G  V F     +++  N  +      T ++++   I   + V 
Sbjct: 415 PNIHKNLISVYRLCNANGVSVEFFPASFQVKDLNTGVPLLQGKTKDELYEWPIASSQPVS 474

Query: 299 FETLVND--TSWLWHCRFGHLSFDTLSHMCQQHMVRGMSNIKKEDQLCEACVFRKHHRNS 358
                +   T   WH R GH +   L+ +   + +  + N   +   C  C+  K ++  
Sbjct: 475 LFASPSSKATHSSWHARLGHPAPSILNSVISNYSL-SVLNPSHKFLSCSDCLINKSNKVP 534

Query: 359 FPTGGSWRASKPLELVHTDLCGPMRTTTHGGNRYFLTFIDDYSRKTWIYLLKEKSATFEC 418
           F +  +  +++PLE +++D+       +H   RY++ F+D ++R TW+Y LK+KS   E 
Sbjct: 535 F-SQSTINSTRPLEYIYSDVWS-SPILSHDNYRYYVIFVDHFTRYTWLYPLKQKSQVKET 594

Query: 419 FKTFKAMVENESNLKLKSLRSDRGGEYIVFADFLKENGIKHQKTVRRTPQQNGVAERKNR 478
           F TFK ++EN    ++ +  SD GGE++   ++  ++GI H  +   TP+ NG++ERK+R
Sbjct: 595 FITFKNLLENRFQTRIGTFYSDNGGEFVALWEYFSQHGISHLTSPPHTPEHNGLSERKHR 654

Query: 479 IIMELARSMLKAKKLPDQFWGDAVTCAVYLLNRASTKSVQGITPQEAWSGLKPTVSHLRV 538
            I+E   ++L    +P  +W  A   AVYL+NR  T  +Q  +P +   G  P    LRV
Sbjct: 655 HIVETGLTLLSHASIPKTYWPYAFAVAVYLINRLPTPLLQLESPFQKLFGTSPNYDKLRV 714

Query: 539 FGCIAYSHISDEKRGKLDDKSEKCIFVGYSENSKAYRLYNPISKKVIISRDVKFDE 586
           FGC  Y  +    + KLDDKS +C+F+GYS    AY   +  + ++ ISR V+FDE
Sbjct: 715 FGCACYPWLRPYNQHKLDDKSRQCVFLGYSLTQSAYLCLHLQTSRLYISRHVRFDE 746

BLAST of CSPI01G19050 vs. ExPASy Swiss-Prot
Match: P92520 (Uncharacterized mitochondrial protein AtMg00820 OS=Arabidopsis thaliana OX=3702 GN=AtMg00820 PE=4 SV=1)

HSP 1 Score: 66.6 bits (161), Expect = 1.5e-09
Identity = 34/111 (30.63%), Postives = 61/111 (54.95%), Query Frame = 0

Query: 676 AIQDENWKDAMNQEIDAITRNETWELVKLPENKKALGVKWIYRTKLKQNGEVQKYKARYH 735
           A++D  W  AM +E+DA++RN+TW LV  P N+  LG KW+++TKL  +G + + KAR  
Sbjct: 34  ALKDPGWCQAMQEELDALSRNKTWILVPPPVNQNILGCKWVFKTKLHSDGTLDRLKARLV 93

Query: 736 NKICQRTLAIHIIKSSCP-----------SQGIKITTNQNLGFCFHIHSSI 776
            K   +   I+ +++  P           +   ++   Q++ + F +H S+
Sbjct: 94  AKGFHQEEGIYFVETYSPVVRTATIRTILNVAQQLEVGQSINWMFKMHFSM 144

BLAST of CSPI01G19050 vs. ExPASy TrEMBL
Match: A0A5A7UDJ2 (Integrase OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold174G001470 PE=4 SV=1)

HSP 1 Score: 967.2 bits (2499), Expect = 4.3e-278
Identity = 480/740 (64.86%), Postives = 578/740 (78.11%), Query Frame = 0

Query: 1   MTRRYEHIVVAIEESKDLSTLSINSLMGSLQSHELRLKMFDSNPSEEAFHMQSSYRGRSN 60
           M R++EHIVVAIEESKDLSTLSINSLMGSLQSHELRLK FD NP EEAF MQ+S+RG S 
Sbjct: 167 MPRKFEHIVVAIEESKDLSTLSINSLMGSLQSHELRLKQFDDNP-EEAFQMQTSFRGGSR 226

Query: 61  GRRGGRGGRGNGRSNVVTNTESESRDNQFFSNRGRGRSSNRGRG----RSGGRGDFSHIQ 120
           GRRGG G RG GR N    + + S ++Q  S+  RGR S RGRG    + GGRG+FS IQ
Sbjct: 227 GRRGGHGRRGGGR-NYDNRSGANSENSQESSSLSRGRGSGRGRGFGRNQGGGRGNFSQIQ 286

Query: 121 CFNCRRYGHFQADCWSKKTNSNQAETTLMHEQSNNDQGLLFLTLNVQESSTEEIWYLDSG 180
           CFNC +YGHFQA+CW+ K         +  EQ   D+G+LFL  +VQ++  E  WYLDSG
Sbjct: 287 CFNCGKYGHFQANCWALKNGVGNTTMNMHKEQKKIDEGILFLACSVQDNVVEPTWYLDSG 346

Query: 181 CSNHMTGRKDIFISLDESHQNVVKTGDNKMLEVKGKGDILVKTKMGAKKITDVYYVSGLK 240
           CSNHMTG + IF++LDES Q+ VKTGDN  L+VKG+GDILVKTK G K++T+V+YV GLK
Sbjct: 347 CSNHMTGNRSIFVTLDESFQSEVKTGDNTRLQVKGQGDILVKTKKGTKRVTNVFYVPGLK 406

Query: 241 HNLLSVGQLLLRGHDVIFKDKICEIRTKNGDLITKVRMTHNKMFPIKICYEKLVCFETLV 300
           HNLLS+GQLL RG  V F+  IC I+ + G LI KV+MT NKMFP+   Y ++ CF +++
Sbjct: 407 HNLLSIGQLLQRGLKVSFEGDICAIKDQAGVLIAKVKMTANKMFPLNFTYGQISCFSSIL 466

Query: 301 NDTSWLWHCRFGHLSFDTLSHMCQQHMVRGMSNIKKEDQLCEACVFRKHHRNSFPTGGSW 360
            D SWLWH R+GHL+F +LS++C+ HMVRG+ NI  E  +CE C+  KHHR+SFPTG +W
Sbjct: 467 KDPSWLWHFRYGHLNFKSLSYLCKNHMVRGIQNINHETNICEVCILAKHHRDSFPTGKAW 526

Query: 361 RASKPLELVHTDLCGPMRTTTHGGNRYFLTFIDDYSRKTWIYLLKEKSATFECFKTFKAM 420
           RASKPLEL+HTDLCGPMRTTT+GGNRYF+TFIDD+SRK WIY LKEKS    CFK+FKA 
Sbjct: 527 RASKPLELIHTDLCGPMRTTTNGGNRYFITFIDDFSRKLWIYFLKEKSEALVCFKSFKAF 586

Query: 421 VENESNLKLKSLRSDRGGEYIVFADFLKENGIKHQKTVRRTPQQNGVAERKNRIIMELAR 480
            EN+S  K+K+LRSDRGGEYI F +F KE GI HQ T R TPQQNGVAERKNR IME+AR
Sbjct: 587 TENQSGYKIKTLRSDRGGEYIAFGNFFKEQGIHHQMTARMTPQQNGVAERKNRTIMEMAR 646

Query: 481 SMLKAKKLPDQFWGDAVTCAVYLLNRASTKSVQGITPQEAWSGLKPTVSHLRVFGCIAYS 540
           SMLKAK LP++FWGDAV C VY+LNRA TKSV G+TP EAW G KP+VSHLRVFG IAYS
Sbjct: 647 SMLKAKNLPNEFWGDAVACTVYILNRAPTKSVPGMTPYEAWCGEKPSVSHLRVFGSIAYS 706

Query: 541 HISDEKRGKLDDKSEKCIFVGYSENSKAYRLYNPISKKVIISRDVKFDEAKLWQWNAP-N 600
           HI ++ RGKLDDKSEKCI VGYSENSKAYRLYNP+S+K+IISRDV F E + W WN   +
Sbjct: 707 HIPNQLRGKLDDKSEKCIMVGYSENSKAYRLYNPVSRKIIISRDVIFSEDESWNWNDDVD 766

Query: 601 EDQNPLHVDMDGKKDARDLELEVTQPLTSPSS--SHSTSDEETTPRKTRNIQEIYNTSRR 660
           E ++P HV++D  + A++LE    Q + S SS  S STS++E +PR+ R+IQEIYNT+ R
Sbjct: 767 EAKSPFHVNIDENEVAQELEQAEIQAMESSSSSTSSSTSNDEISPRRMRSIQEIYNTTNR 826

Query: 661 ILDEEHVDFALFANVDPVYFEEAIQDENWKDAMNQEIDAITRNETWELVKLPENKKALGV 720
           I D+   +FALFA VDPV F+EAIQDE WK AM+QEIDAI RNETWEL++LP NK+ALGV
Sbjct: 827 INDDHFANFALFAGVDPVTFDEAIQDEKWKIAMDQEIDAIRRNETWELMELPTNKQALGV 886

Query: 721 KWIYRTKLKQNGEVQKYKAR 734
           KW+YRTKLK +G V+KYKAR
Sbjct: 887 KWVYRTKLKSDGNVEKYKAR 904

BLAST of CSPI01G19050 vs. ExPASy TrEMBL
Match: A0A5D3BQ81 (Integrase OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold451G00890 PE=4 SV=1)

HSP 1 Score: 966.5 bits (2497), Expect = 7.4e-278
Identity = 480/740 (64.86%), Postives = 578/740 (78.11%), Query Frame = 0

Query: 1   MTRRYEHIVVAIEESKDLSTLSINSLMGSLQSHELRLKMFDSNPSEEAFHMQSSYRGRSN 60
           M R++EHIVVAIEESKDLSTLSINSLMGSLQSHELRLK FD NP EEAF MQ+S+RG S 
Sbjct: 167 MPRKFEHIVVAIEESKDLSTLSINSLMGSLQSHELRLKQFDVNP-EEAFQMQTSFRGGSR 226

Query: 61  GRRGGRGGRGNGRSNVVTNTESESRDNQFFSNRGRGRSSNRGRG----RSGGRGDFSHIQ 120
           GRRGG G RG GR N    + + S ++Q  S+  RGR S RGRG    + GGRG+FS IQ
Sbjct: 227 GRRGGHGRRGGGR-NYDNRSGANSENSQESSSLSRGRGSGRGRGFGRNQGGGRGNFSQIQ 286

Query: 121 CFNCRRYGHFQADCWSKKTNSNQAETTLMHEQSNNDQGLLFLTLNVQESSTEEIWYLDSG 180
           CFNC +YGHFQA+CW+ K         +  EQ   D+G+LFL  +VQ++  E  WYLDSG
Sbjct: 287 CFNCGKYGHFQANCWALKNGVGNTTMNMHKEQKKIDEGILFLACSVQDNVVEPTWYLDSG 346

Query: 181 CSNHMTGRKDIFISLDESHQNVVKTGDNKMLEVKGKGDILVKTKMGAKKITDVYYVSGLK 240
           CSNHMTG + IF++LDES Q+ VKTGDN  L+VKG+GDILVKTK G K++T+V+YV GLK
Sbjct: 347 CSNHMTGNRSIFVTLDESFQSEVKTGDNTRLQVKGQGDILVKTKKGTKRVTNVFYVPGLK 406

Query: 241 HNLLSVGQLLLRGHDVIFKDKICEIRTKNGDLITKVRMTHNKMFPIKICYEKLVCFETLV 300
           HNLLS+GQLL RG  V F+  IC I+ + G LI KV+MT NKMFP+   Y ++ CF +++
Sbjct: 407 HNLLSIGQLLQRGLKVSFEGDICAIKDQAGVLIAKVKMTANKMFPLNFTYGQISCFSSIL 466

Query: 301 NDTSWLWHCRFGHLSFDTLSHMCQQHMVRGMSNIKKEDQLCEACVFRKHHRNSFPTGGSW 360
            D SWLWH R+GHL+F +LS++C+ HMVRG+ NI  E  +CE C+  KHHR+SFPTG +W
Sbjct: 467 KDPSWLWHFRYGHLNFKSLSYLCKNHMVRGIQNINHETNICEVCILAKHHRDSFPTGKAW 526

Query: 361 RASKPLELVHTDLCGPMRTTTHGGNRYFLTFIDDYSRKTWIYLLKEKSATFECFKTFKAM 420
           RASKPLEL+HTDLCGPMRTTT+GGNRYF+TFIDD+SRK WIY LKEKS    CFK+FKA 
Sbjct: 527 RASKPLELIHTDLCGPMRTTTNGGNRYFITFIDDFSRKLWIYFLKEKSEALVCFKSFKAF 586

Query: 421 VENESNLKLKSLRSDRGGEYIVFADFLKENGIKHQKTVRRTPQQNGVAERKNRIIMELAR 480
            EN+S  K+K+LRSDRGGEYI F +F KE GI HQ T R TPQQNGVAERKNR IME+AR
Sbjct: 587 TENQSGYKIKTLRSDRGGEYIAFGNFFKEQGIHHQMTARMTPQQNGVAERKNRTIMEMAR 646

Query: 481 SMLKAKKLPDQFWGDAVTCAVYLLNRASTKSVQGITPQEAWSGLKPTVSHLRVFGCIAYS 540
           SMLKAK LP++FWGDAV C VY+LNRA TKSV G+TP EAW G KP+VSHLRVFG IAYS
Sbjct: 647 SMLKAKNLPNEFWGDAVACTVYILNRAPTKSVPGMTPYEAWCGEKPSVSHLRVFGSIAYS 706

Query: 541 HISDEKRGKLDDKSEKCIFVGYSENSKAYRLYNPISKKVIISRDVKFDEAKLWQWNAP-N 600
           HI ++ RGKLDDKSEKCI VGYSENSKAYRLYNP+S+K+IISRDV F E + W WN   +
Sbjct: 707 HIPNQLRGKLDDKSEKCIMVGYSENSKAYRLYNPVSRKIIISRDVIFSEDESWNWNDDVD 766

Query: 601 EDQNPLHVDMDGKKDARDLELEVTQPLTSPSS--SHSTSDEETTPRKTRNIQEIYNTSRR 660
           E ++P HV++D  + A++LE    Q + S SS  S STS++E +PR+ R+IQEIYNT+ R
Sbjct: 767 EAKSPFHVNIDENEVAQELEQAEIQAMESSSSSTSSSTSNDEISPRRMRSIQEIYNTTNR 826

Query: 661 ILDEEHVDFALFANVDPVYFEEAIQDENWKDAMNQEIDAITRNETWELVKLPENKKALGV 720
           I D+   +FALFA VDPV F+EAIQDE WK AM+QEIDAI RNETWEL++LP NK+ALGV
Sbjct: 827 INDDHFANFALFAGVDPVTFDEAIQDEKWKIAMDQEIDAIRRNETWELMELPTNKQALGV 886

Query: 721 KWIYRTKLKQNGEVQKYKAR 734
           KW+YRTKLK +G V+KYKAR
Sbjct: 887 KWVYRTKLKSDGNVEKYKAR 904

BLAST of CSPI01G19050 vs. ExPASy TrEMBL
Match: A0A5D3E3T2 (Integrase OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold156G00030 PE=4 SV=1)

HSP 1 Score: 966.5 bits (2497), Expect = 7.4e-278
Identity = 480/740 (64.86%), Postives = 578/740 (78.11%), Query Frame = 0

Query: 1   MTRRYEHIVVAIEESKDLSTLSINSLMGSLQSHELRLKMFDSNPSEEAFHMQSSYRGRSN 60
           M R++EHIVVAIEESKDLSTLSINSLMGSLQSHELRLK FD NP EEAF MQ+S+RG S 
Sbjct: 167 MPRKFEHIVVAIEESKDLSTLSINSLMGSLQSHELRLKQFDVNP-EEAFQMQTSFRGGSR 226

Query: 61  GRRGGRGGRGNGRSNVVTNTESESRDNQFFSNRGRGRSSNRGRG----RSGGRGDFSHIQ 120
           GRRGG G RG GR N    + + S ++Q  S+  RGR S RGRG    + GGRG+FS IQ
Sbjct: 227 GRRGGHGRRGGGR-NYDNRSGANSENSQESSSLSRGRGSGRGRGFGRNQGGGRGNFSQIQ 286

Query: 121 CFNCRRYGHFQADCWSKKTNSNQAETTLMHEQSNNDQGLLFLTLNVQESSTEEIWYLDSG 180
           CFNC +YGHFQA+CW+ K         +  EQ   D+G+LFL  +VQ++  E  WYLDSG
Sbjct: 287 CFNCGKYGHFQANCWALKNGVGNTTMNMHKEQKKIDEGILFLACSVQDNVVEPTWYLDSG 346

Query: 181 CSNHMTGRKDIFISLDESHQNVVKTGDNKMLEVKGKGDILVKTKMGAKKITDVYYVSGLK 240
           CSNHMTG + IF++LDES Q+ VKTGDN  L+VKG+GDILVKTK G K++T+V+YV GLK
Sbjct: 347 CSNHMTGNRSIFVTLDESFQSEVKTGDNTRLQVKGQGDILVKTKKGTKRVTNVFYVPGLK 406

Query: 241 HNLLSVGQLLLRGHDVIFKDKICEIRTKNGDLITKVRMTHNKMFPIKICYEKLVCFETLV 300
           HNLLS+GQLL RG  V F+  IC I+ + G LI KV+MT NKMFP+   Y ++ CF +++
Sbjct: 407 HNLLSIGQLLQRGLKVSFEGDICAIKDQAGVLIAKVKMTANKMFPLNFTYGQISCFSSIL 466

Query: 301 NDTSWLWHCRFGHLSFDTLSHMCQQHMVRGMSNIKKEDQLCEACVFRKHHRNSFPTGGSW 360
            D SWLWH R+GHL+F +LS++C+ HMVRG+ NI  E  +CE C+  KHHR+SFPTG +W
Sbjct: 467 KDPSWLWHFRYGHLNFKSLSYLCKNHMVRGIQNINHETNICEVCILAKHHRDSFPTGKAW 526

Query: 361 RASKPLELVHTDLCGPMRTTTHGGNRYFLTFIDDYSRKTWIYLLKEKSATFECFKTFKAM 420
           RASKPLEL+HTDLCGPMRTTT+GGNRYF+TFIDD+SRK WIY LKEKS    CFK+FKA 
Sbjct: 527 RASKPLELIHTDLCGPMRTTTNGGNRYFITFIDDFSRKLWIYFLKEKSEALVCFKSFKAF 586

Query: 421 VENESNLKLKSLRSDRGGEYIVFADFLKENGIKHQKTVRRTPQQNGVAERKNRIIMELAR 480
            EN+S  K+K+LRSDRGGEYI F +F KE GI HQ T R TPQQNGVAERKNR IME+AR
Sbjct: 587 TENQSGYKIKTLRSDRGGEYIAFGNFFKEQGIHHQMTARMTPQQNGVAERKNRTIMEMAR 646

Query: 481 SMLKAKKLPDQFWGDAVTCAVYLLNRASTKSVQGITPQEAWSGLKPTVSHLRVFGCIAYS 540
           SMLKAK LP++FWGDAV C VY+LNRA TKSV G+TP EAW G KP+VSHLRVFG IAYS
Sbjct: 647 SMLKAKNLPNEFWGDAVACTVYILNRAPTKSVPGMTPYEAWCGEKPSVSHLRVFGSIAYS 706

Query: 541 HISDEKRGKLDDKSEKCIFVGYSENSKAYRLYNPISKKVIISRDVKFDEAKLWQWNAP-N 600
           HI ++ RGKLDDKSEKCI VGYSENSKAYRLYNP+S+K+IISRDV F E + W WN   +
Sbjct: 707 HIPNQLRGKLDDKSEKCIMVGYSENSKAYRLYNPVSRKIIISRDVIFSEDESWNWNDDVD 766

Query: 601 EDQNPLHVDMDGKKDARDLELEVTQPLTSPSS--SHSTSDEETTPRKTRNIQEIYNTSRR 660
           E ++P HV++D  + A++LE    Q + S SS  S STS++E +PR+ R+IQEIYNT+ R
Sbjct: 767 EAKSPFHVNIDENEVAQELEQAEIQAMESSSSSTSSSTSNDEISPRRMRSIQEIYNTTNR 826

Query: 661 ILDEEHVDFALFANVDPVYFEEAIQDENWKDAMNQEIDAITRNETWELVKLPENKKALGV 720
           I D+   +FALFA VDPV F+EAIQDE WK AM+QEIDAI RNETWEL++LP NK+ALGV
Sbjct: 827 INDDHFANFALFAGVDPVTFDEAIQDEKWKIAMDQEIDAIRRNETWELMELPTNKQALGV 886

Query: 721 KWIYRTKLKQNGEVQKYKAR 734
           KW+YRTKLK +G V+KYKAR
Sbjct: 887 KWVYRTKLKSDGNVEKYKAR 904

BLAST of CSPI01G19050 vs. ExPASy TrEMBL
Match: A0A5A7UZJ8 (Integrase OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold98G001310 PE=4 SV=1)

HSP 1 Score: 966.5 bits (2497), Expect = 7.4e-278
Identity = 480/740 (64.86%), Postives = 578/740 (78.11%), Query Frame = 0

Query: 1   MTRRYEHIVVAIEESKDLSTLSINSLMGSLQSHELRLKMFDSNPSEEAFHMQSSYRGRSN 60
           M R++EHIVVAIEESKDLSTLSINSLMGSLQSHELRLK FD NP EEAF MQ+S+RG S 
Sbjct: 167 MPRKFEHIVVAIEESKDLSTLSINSLMGSLQSHELRLKQFDVNP-EEAFQMQTSFRGGSR 226

Query: 61  GRRGGRGGRGNGRSNVVTNTESESRDNQFFSNRGRGRSSNRGRG----RSGGRGDFSHIQ 120
           GRRGG G RG GR N    + + S ++Q  S+  RGR S RGRG    + GGRG+FS IQ
Sbjct: 227 GRRGGHGRRGGGR-NYDNRSGANSENSQESSSLSRGRGSGRGRGFGRNQGGGRGNFSQIQ 286

Query: 121 CFNCRRYGHFQADCWSKKTNSNQAETTLMHEQSNNDQGLLFLTLNVQESSTEEIWYLDSG 180
           CFNC +YGHFQA+CW+ K         +  EQ   D+G+LFL  +VQ++  E  WYLDSG
Sbjct: 287 CFNCGKYGHFQANCWALKNGVGNTTMNMHKEQKKIDEGILFLACSVQDNVVEPTWYLDSG 346

Query: 181 CSNHMTGRKDIFISLDESHQNVVKTGDNKMLEVKGKGDILVKTKMGAKKITDVYYVSGLK 240
           CSNHMTG + IF++LDES Q+ VKTGDN  L+VKG+GDILVKTK G K++T+V+YV GLK
Sbjct: 347 CSNHMTGNRSIFVTLDESFQSEVKTGDNTRLQVKGQGDILVKTKKGTKRVTNVFYVPGLK 406

Query: 241 HNLLSVGQLLLRGHDVIFKDKICEIRTKNGDLITKVRMTHNKMFPIKICYEKLVCFETLV 300
           HNLLS+GQLL RG  V F+  IC I+ + G LI KV+MT NKMFP+   Y ++ CF +++
Sbjct: 407 HNLLSIGQLLQRGLKVSFEGDICAIKDQAGVLIAKVKMTANKMFPLNFTYGQISCFSSIL 466

Query: 301 NDTSWLWHCRFGHLSFDTLSHMCQQHMVRGMSNIKKEDQLCEACVFRKHHRNSFPTGGSW 360
            D SWLWH R+GHL+F +LS++C+ HMVRG+ NI  E  +CE C+  KHHR+SFPTG +W
Sbjct: 467 KDPSWLWHFRYGHLNFKSLSYLCKNHMVRGIQNINHETNICEVCILAKHHRDSFPTGKAW 526

Query: 361 RASKPLELVHTDLCGPMRTTTHGGNRYFLTFIDDYSRKTWIYLLKEKSATFECFKTFKAM 420
           RASKPLEL+HTDLCGPMRTTT+GGNRYF+TFIDD+SRK WIY LKEKS    CFK+FKA 
Sbjct: 527 RASKPLELIHTDLCGPMRTTTNGGNRYFITFIDDFSRKLWIYFLKEKSEALVCFKSFKAF 586

Query: 421 VENESNLKLKSLRSDRGGEYIVFADFLKENGIKHQKTVRRTPQQNGVAERKNRIIMELAR 480
            EN+S  K+K+LRSDRGGEYI F +F KE GI HQ T R TPQQNGVAERKNR IME+AR
Sbjct: 587 TENQSGYKIKTLRSDRGGEYIAFGNFFKEQGIHHQMTARMTPQQNGVAERKNRTIMEMAR 646

Query: 481 SMLKAKKLPDQFWGDAVTCAVYLLNRASTKSVQGITPQEAWSGLKPTVSHLRVFGCIAYS 540
           SMLKAK LP++FWGDAV C VY+LNRA TKSV G+TP EAW G KP+VSHLRVFG IAYS
Sbjct: 647 SMLKAKNLPNEFWGDAVACTVYILNRAPTKSVPGMTPYEAWCGEKPSVSHLRVFGSIAYS 706

Query: 541 HISDEKRGKLDDKSEKCIFVGYSENSKAYRLYNPISKKVIISRDVKFDEAKLWQWNAP-N 600
           HI ++ RGKLDDKSEKCI VGYSENSKAYRLYNP+S+K+IISRDV F E + W WN   +
Sbjct: 707 HIPNQLRGKLDDKSEKCIMVGYSENSKAYRLYNPVSRKIIISRDVIFSEDESWNWNDDVD 766

Query: 601 EDQNPLHVDMDGKKDARDLELEVTQPLTSPSS--SHSTSDEETTPRKTRNIQEIYNTSRR 660
           E ++P HV++D  + A++LE    Q + S SS  S STS++E +PR+ R+IQEIYNT+ R
Sbjct: 767 EAKSPFHVNIDENEVAQELEQAEIQAMESSSSSTSSSTSNDEISPRRMRSIQEIYNTTNR 826

Query: 661 ILDEEHVDFALFANVDPVYFEEAIQDENWKDAMNQEIDAITRNETWELVKLPENKKALGV 720
           I D+   +FALFA VDPV F+EAIQDE WK AM+QEIDAI RNETWEL++LP NK+ALGV
Sbjct: 827 INDDHFANFALFAGVDPVTFDEAIQDEKWKIAMDQEIDAIRRNETWELMELPTNKQALGV 886

Query: 721 KWIYRTKLKQNGEVQKYKAR 734
           KW+YRTKLK +G V+KYKAR
Sbjct: 887 KWVYRTKLKSDGNVEKYKAR 904

BLAST of CSPI01G19050 vs. ExPASy TrEMBL
Match: A0A5D3CXM6 (Integrase OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold209G001540 PE=4 SV=1)

HSP 1 Score: 966.5 bits (2497), Expect = 7.4e-278
Identity = 480/740 (64.86%), Postives = 578/740 (78.11%), Query Frame = 0

Query: 1    MTRRYEHIVVAIEESKDLSTLSINSLMGSLQSHELRLKMFDSNPSEEAFHMQSSYRGRSN 60
            M R++EHIVVAIEESKDLSTLSINSLMGSLQSHELRLK FD NP EEAF MQ+S+RG S 
Sbjct: 632  MPRKFEHIVVAIEESKDLSTLSINSLMGSLQSHELRLKQFDVNP-EEAFQMQTSFRGGSR 691

Query: 61   GRRGGRGGRGNGRSNVVTNTESESRDNQFFSNRGRGRSSNRGRG----RSGGRGDFSHIQ 120
            GRRGG G RG GR N    + + S ++Q  S+  RGR S RGRG    + GGRG+FS IQ
Sbjct: 692  GRRGGHGRRGGGR-NYDNRSGANSENSQESSSLSRGRGSGRGRGFGRNQGGGRGNFSQIQ 751

Query: 121  CFNCRRYGHFQADCWSKKTNSNQAETTLMHEQSNNDQGLLFLTLNVQESSTEEIWYLDSG 180
            CFNC +YGHFQA+CW+ K         +  EQ   D+G+LFL  +VQ++  E  WYLDSG
Sbjct: 752  CFNCGKYGHFQANCWALKNGVGNTTMNMHKEQKKIDEGILFLACSVQDNVVEPTWYLDSG 811

Query: 181  CSNHMTGRKDIFISLDESHQNVVKTGDNKMLEVKGKGDILVKTKMGAKKITDVYYVSGLK 240
            CSNHMTG + IF++LDES Q+ VKTGDN  L+VKG+GDILVKTK G K++T+V+YV GLK
Sbjct: 812  CSNHMTGNRSIFVTLDESFQSEVKTGDNTRLQVKGQGDILVKTKKGTKRVTNVFYVPGLK 871

Query: 241  HNLLSVGQLLLRGHDVIFKDKICEIRTKNGDLITKVRMTHNKMFPIKICYEKLVCFETLV 300
            HNLLS+GQLL RG  V F+  IC I+ + G LI KV+MT NKMFP+   Y ++ CF +++
Sbjct: 872  HNLLSIGQLLQRGLKVSFEGDICAIKDQAGVLIAKVKMTANKMFPLNFTYGQISCFSSIL 931

Query: 301  NDTSWLWHCRFGHLSFDTLSHMCQQHMVRGMSNIKKEDQLCEACVFRKHHRNSFPTGGSW 360
             D SWLWH R+GHL+F +LS++C+ HMVRG+ NI  E  +CE C+  KHHR+SFPTG +W
Sbjct: 932  KDPSWLWHFRYGHLNFKSLSYLCKNHMVRGIQNINHETNICEVCILAKHHRDSFPTGKAW 991

Query: 361  RASKPLELVHTDLCGPMRTTTHGGNRYFLTFIDDYSRKTWIYLLKEKSATFECFKTFKAM 420
            RASKPLEL+HTDLCGPMRTTT+GGNRYF+TFIDD+SRK WIY LKEKS    CFK+FKA 
Sbjct: 992  RASKPLELIHTDLCGPMRTTTNGGNRYFITFIDDFSRKLWIYFLKEKSEALVCFKSFKAF 1051

Query: 421  VENESNLKLKSLRSDRGGEYIVFADFLKENGIKHQKTVRRTPQQNGVAERKNRIIMELAR 480
             EN+S  K+K+LRSDRGGEYI F +F KE GI HQ T R TPQQNGVAERKNR IME+AR
Sbjct: 1052 TENQSGYKIKTLRSDRGGEYIAFGNFFKEQGIHHQMTARMTPQQNGVAERKNRTIMEMAR 1111

Query: 481  SMLKAKKLPDQFWGDAVTCAVYLLNRASTKSVQGITPQEAWSGLKPTVSHLRVFGCIAYS 540
            SMLKAK LP++FWGDAV C VY+LNRA TKSV G+TP EAW G KP+VSHLRVFG IAYS
Sbjct: 1112 SMLKAKNLPNEFWGDAVACTVYILNRAPTKSVPGMTPYEAWCGEKPSVSHLRVFGSIAYS 1171

Query: 541  HISDEKRGKLDDKSEKCIFVGYSENSKAYRLYNPISKKVIISRDVKFDEAKLWQWNAP-N 600
            HI ++ RGKLDDKSEKCI VGYSENSKAYRLYNP+S+K+IISRDV F E + W WN   +
Sbjct: 1172 HIPNQLRGKLDDKSEKCIMVGYSENSKAYRLYNPVSRKIIISRDVIFSEDESWNWNDDVD 1231

Query: 601  EDQNPLHVDMDGKKDARDLELEVTQPLTSPSS--SHSTSDEETTPRKTRNIQEIYNTSRR 660
            E ++P HV++D  + A++LE    Q + S SS  S STS++E +PR+ R+IQEIYNT+ R
Sbjct: 1232 EAKSPFHVNIDENEVAQELEQAEIQAMESSSSSTSSSTSNDEISPRRMRSIQEIYNTTNR 1291

Query: 661  ILDEEHVDFALFANVDPVYFEEAIQDENWKDAMNQEIDAITRNETWELVKLPENKKALGV 720
            I D+   +FALFA VDPV F+EAIQDE WK AM+QEIDAI RNETWEL++LP NK+ALGV
Sbjct: 1292 INDDHFANFALFAGVDPVTFDEAIQDEKWKIAMDQEIDAIRRNETWELMELPTNKQALGV 1351

Query: 721  KWIYRTKLKQNGEVQKYKAR 734
            KW+YRTKLK +G V+KYKAR
Sbjct: 1352 KWVYRTKLKSDGNVEKYKAR 1369

BLAST of CSPI01G19050 vs. NCBI nr
Match: KAA0051601.1 (integrase [Cucumis melo var. makuwa])

HSP 1 Score: 967.2 bits (2499), Expect = 8.9e-278
Identity = 480/740 (64.86%), Postives = 578/740 (78.11%), Query Frame = 0

Query: 1   MTRRYEHIVVAIEESKDLSTLSINSLMGSLQSHELRLKMFDSNPSEEAFHMQSSYRGRSN 60
           M R++EHIVVAIEESKDLSTLSINSLMGSLQSHELRLK FD NP EEAF MQ+S+RG S 
Sbjct: 167 MPRKFEHIVVAIEESKDLSTLSINSLMGSLQSHELRLKQFDDNP-EEAFQMQTSFRGGSR 226

Query: 61  GRRGGRGGRGNGRSNVVTNTESESRDNQFFSNRGRGRSSNRGRG----RSGGRGDFSHIQ 120
           GRRGG G RG GR N    + + S ++Q  S+  RGR S RGRG    + GGRG+FS IQ
Sbjct: 227 GRRGGHGRRGGGR-NYDNRSGANSENSQESSSLSRGRGSGRGRGFGRNQGGGRGNFSQIQ 286

Query: 121 CFNCRRYGHFQADCWSKKTNSNQAETTLMHEQSNNDQGLLFLTLNVQESSTEEIWYLDSG 180
           CFNC +YGHFQA+CW+ K         +  EQ   D+G+LFL  +VQ++  E  WYLDSG
Sbjct: 287 CFNCGKYGHFQANCWALKNGVGNTTMNMHKEQKKIDEGILFLACSVQDNVVEPTWYLDSG 346

Query: 181 CSNHMTGRKDIFISLDESHQNVVKTGDNKMLEVKGKGDILVKTKMGAKKITDVYYVSGLK 240
           CSNHMTG + IF++LDES Q+ VKTGDN  L+VKG+GDILVKTK G K++T+V+YV GLK
Sbjct: 347 CSNHMTGNRSIFVTLDESFQSEVKTGDNTRLQVKGQGDILVKTKKGTKRVTNVFYVPGLK 406

Query: 241 HNLLSVGQLLLRGHDVIFKDKICEIRTKNGDLITKVRMTHNKMFPIKICYEKLVCFETLV 300
           HNLLS+GQLL RG  V F+  IC I+ + G LI KV+MT NKMFP+   Y ++ CF +++
Sbjct: 407 HNLLSIGQLLQRGLKVSFEGDICAIKDQAGVLIAKVKMTANKMFPLNFTYGQISCFSSIL 466

Query: 301 NDTSWLWHCRFGHLSFDTLSHMCQQHMVRGMSNIKKEDQLCEACVFRKHHRNSFPTGGSW 360
            D SWLWH R+GHL+F +LS++C+ HMVRG+ NI  E  +CE C+  KHHR+SFPTG +W
Sbjct: 467 KDPSWLWHFRYGHLNFKSLSYLCKNHMVRGIQNINHETNICEVCILAKHHRDSFPTGKAW 526

Query: 361 RASKPLELVHTDLCGPMRTTTHGGNRYFLTFIDDYSRKTWIYLLKEKSATFECFKTFKAM 420
           RASKPLEL+HTDLCGPMRTTT+GGNRYF+TFIDD+SRK WIY LKEKS    CFK+FKA 
Sbjct: 527 RASKPLELIHTDLCGPMRTTTNGGNRYFITFIDDFSRKLWIYFLKEKSEALVCFKSFKAF 586

Query: 421 VENESNLKLKSLRSDRGGEYIVFADFLKENGIKHQKTVRRTPQQNGVAERKNRIIMELAR 480
            EN+S  K+K+LRSDRGGEYI F +F KE GI HQ T R TPQQNGVAERKNR IME+AR
Sbjct: 587 TENQSGYKIKTLRSDRGGEYIAFGNFFKEQGIHHQMTARMTPQQNGVAERKNRTIMEMAR 646

Query: 481 SMLKAKKLPDQFWGDAVTCAVYLLNRASTKSVQGITPQEAWSGLKPTVSHLRVFGCIAYS 540
           SMLKAK LP++FWGDAV C VY+LNRA TKSV G+TP EAW G KP+VSHLRVFG IAYS
Sbjct: 647 SMLKAKNLPNEFWGDAVACTVYILNRAPTKSVPGMTPYEAWCGEKPSVSHLRVFGSIAYS 706

Query: 541 HISDEKRGKLDDKSEKCIFVGYSENSKAYRLYNPISKKVIISRDVKFDEAKLWQWNAP-N 600
           HI ++ RGKLDDKSEKCI VGYSENSKAYRLYNP+S+K+IISRDV F E + W WN   +
Sbjct: 707 HIPNQLRGKLDDKSEKCIMVGYSENSKAYRLYNPVSRKIIISRDVIFSEDESWNWNDDVD 766

Query: 601 EDQNPLHVDMDGKKDARDLELEVTQPLTSPSS--SHSTSDEETTPRKTRNIQEIYNTSRR 660
           E ++P HV++D  + A++LE    Q + S SS  S STS++E +PR+ R+IQEIYNT+ R
Sbjct: 767 EAKSPFHVNIDENEVAQELEQAEIQAMESSSSSTSSSTSNDEISPRRMRSIQEIYNTTNR 826

Query: 661 ILDEEHVDFALFANVDPVYFEEAIQDENWKDAMNQEIDAITRNETWELVKLPENKKALGV 720
           I D+   +FALFA VDPV F+EAIQDE WK AM+QEIDAI RNETWEL++LP NK+ALGV
Sbjct: 827 INDDHFANFALFAGVDPVTFDEAIQDEKWKIAMDQEIDAIRRNETWELMELPTNKQALGV 886

Query: 721 KWIYRTKLKQNGEVQKYKAR 734
           KW+YRTKLK +G V+KYKAR
Sbjct: 887 KWVYRTKLKSDGNVEKYKAR 904

BLAST of CSPI01G19050 vs. NCBI nr
Match: KAA0060377.1 (integrase [Cucumis melo var. makuwa])

HSP 1 Score: 966.5 bits (2497), Expect = 1.5e-277
Identity = 480/740 (64.86%), Postives = 578/740 (78.11%), Query Frame = 0

Query: 1   MTRRYEHIVVAIEESKDLSTLSINSLMGSLQSHELRLKMFDSNPSEEAFHMQSSYRGRSN 60
           M R++EHIVVAIEESKDLSTLSINSLMGSLQSHELRLK FD NP EEAF MQ+S+RG S 
Sbjct: 167 MPRKFEHIVVAIEESKDLSTLSINSLMGSLQSHELRLKQFDVNP-EEAFQMQTSFRGGSR 226

Query: 61  GRRGGRGGRGNGRSNVVTNTESESRDNQFFSNRGRGRSSNRGRG----RSGGRGDFSHIQ 120
           GRRGG G RG GR N    + + S ++Q  S+  RGR S RGRG    + GGRG+FS IQ
Sbjct: 227 GRRGGHGRRGGGR-NYDNRSGANSENSQESSSLSRGRGSGRGRGFGRNQGGGRGNFSQIQ 286

Query: 121 CFNCRRYGHFQADCWSKKTNSNQAETTLMHEQSNNDQGLLFLTLNVQESSTEEIWYLDSG 180
           CFNC +YGHFQA+CW+ K         +  EQ   D+G+LFL  +VQ++  E  WYLDSG
Sbjct: 287 CFNCGKYGHFQANCWALKNGVGNTTMNMHKEQKKIDEGILFLACSVQDNVVEPTWYLDSG 346

Query: 181 CSNHMTGRKDIFISLDESHQNVVKTGDNKMLEVKGKGDILVKTKMGAKKITDVYYVSGLK 240
           CSNHMTG + IF++LDES Q+ VKTGDN  L+VKG+GDILVKTK G K++T+V+YV GLK
Sbjct: 347 CSNHMTGNRSIFVTLDESFQSEVKTGDNTRLQVKGQGDILVKTKKGTKRVTNVFYVPGLK 406

Query: 241 HNLLSVGQLLLRGHDVIFKDKICEIRTKNGDLITKVRMTHNKMFPIKICYEKLVCFETLV 300
           HNLLS+GQLL RG  V F+  IC I+ + G LI KV+MT NKMFP+   Y ++ CF +++
Sbjct: 407 HNLLSIGQLLQRGLKVSFEGDICAIKDQAGVLIAKVKMTANKMFPLNFTYGQISCFSSIL 466

Query: 301 NDTSWLWHCRFGHLSFDTLSHMCQQHMVRGMSNIKKEDQLCEACVFRKHHRNSFPTGGSW 360
            D SWLWH R+GHL+F +LS++C+ HMVRG+ NI  E  +CE C+  KHHR+SFPTG +W
Sbjct: 467 KDPSWLWHFRYGHLNFKSLSYLCKNHMVRGIQNINHETNICEVCILAKHHRDSFPTGKAW 526

Query: 361 RASKPLELVHTDLCGPMRTTTHGGNRYFLTFIDDYSRKTWIYLLKEKSATFECFKTFKAM 420
           RASKPLEL+HTDLCGPMRTTT+GGNRYF+TFIDD+SRK WIY LKEKS    CFK+FKA 
Sbjct: 527 RASKPLELIHTDLCGPMRTTTNGGNRYFITFIDDFSRKLWIYFLKEKSEALVCFKSFKAF 586

Query: 421 VENESNLKLKSLRSDRGGEYIVFADFLKENGIKHQKTVRRTPQQNGVAERKNRIIMELAR 480
            EN+S  K+K+LRSDRGGEYI F +F KE GI HQ T R TPQQNGVAERKNR IME+AR
Sbjct: 587 TENQSGYKIKTLRSDRGGEYIAFGNFFKEQGIHHQMTARMTPQQNGVAERKNRTIMEMAR 646

Query: 481 SMLKAKKLPDQFWGDAVTCAVYLLNRASTKSVQGITPQEAWSGLKPTVSHLRVFGCIAYS 540
           SMLKAK LP++FWGDAV C VY+LNRA TKSV G+TP EAW G KP+VSHLRVFG IAYS
Sbjct: 647 SMLKAKNLPNEFWGDAVACTVYILNRAPTKSVPGMTPYEAWCGEKPSVSHLRVFGSIAYS 706

Query: 541 HISDEKRGKLDDKSEKCIFVGYSENSKAYRLYNPISKKVIISRDVKFDEAKLWQWNAP-N 600
           HI ++ RGKLDDKSEKCI VGYSENSKAYRLYNP+S+K+IISRDV F E + W WN   +
Sbjct: 707 HIPNQLRGKLDDKSEKCIMVGYSENSKAYRLYNPVSRKIIISRDVIFSEDESWNWNDDVD 766

Query: 601 EDQNPLHVDMDGKKDARDLELEVTQPLTSPSS--SHSTSDEETTPRKTRNIQEIYNTSRR 660
           E ++P HV++D  + A++LE    Q + S SS  S STS++E +PR+ R+IQEIYNT+ R
Sbjct: 767 EAKSPFHVNIDENEVAQELEQAEIQAMESSSSSTSSSTSNDEISPRRMRSIQEIYNTTNR 826

Query: 661 ILDEEHVDFALFANVDPVYFEEAIQDENWKDAMNQEIDAITRNETWELVKLPENKKALGV 720
           I D+   +FALFA VDPV F+EAIQDE WK AM+QEIDAI RNETWEL++LP NK+ALGV
Sbjct: 827 INDDHFANFALFAGVDPVTFDEAIQDEKWKIAMDQEIDAIRRNETWELMELPTNKQALGV 886

Query: 721 KWIYRTKLKQNGEVQKYKAR 734
           KW+YRTKLK +G V+KYKAR
Sbjct: 887 KWVYRTKLKSDGNVEKYKAR 904

BLAST of CSPI01G19050 vs. NCBI nr
Match: KAA0039947.1 (integrase [Cucumis melo var. makuwa])

HSP 1 Score: 966.5 bits (2497), Expect = 1.5e-277
Identity = 480/740 (64.86%), Postives = 578/740 (78.11%), Query Frame = 0

Query: 1   MTRRYEHIVVAIEESKDLSTLSINSLMGSLQSHELRLKMFDSNPSEEAFHMQSSYRGRSN 60
           M R++EHIVVAIEESKDLSTLSINSLMGSLQSHELRLK FD NP EEAF MQ+S+RG S 
Sbjct: 167 MPRKFEHIVVAIEESKDLSTLSINSLMGSLQSHELRLKQFDVNP-EEAFQMQTSFRGGSR 226

Query: 61  GRRGGRGGRGNGRSNVVTNTESESRDNQFFSNRGRGRSSNRGRG----RSGGRGDFSHIQ 120
           GRRGG G RG GR N    + + S ++Q  S+  RGR S RGRG    + GGRG+FS IQ
Sbjct: 227 GRRGGHGRRGGGR-NYDNRSGANSENSQESSSLSRGRGSGRGRGFGRNQGGGRGNFSQIQ 286

Query: 121 CFNCRRYGHFQADCWSKKTNSNQAETTLMHEQSNNDQGLLFLTLNVQESSTEEIWYLDSG 180
           CFNC +YGHFQA+CW+ K         +  EQ   D+G+LFL  +VQ++  E  WYLDSG
Sbjct: 287 CFNCGKYGHFQANCWALKNGVGNTTMNMHKEQKKIDEGILFLACSVQDNVVEPTWYLDSG 346

Query: 181 CSNHMTGRKDIFISLDESHQNVVKTGDNKMLEVKGKGDILVKTKMGAKKITDVYYVSGLK 240
           CSNHMTG + IF++LDES Q+ VKTGDN  L+VKG+GDILVKTK G K++T+V+YV GLK
Sbjct: 347 CSNHMTGNRSIFVTLDESFQSEVKTGDNTRLQVKGQGDILVKTKKGTKRVTNVFYVPGLK 406

Query: 241 HNLLSVGQLLLRGHDVIFKDKICEIRTKNGDLITKVRMTHNKMFPIKICYEKLVCFETLV 300
           HNLLS+GQLL RG  V F+  IC I+ + G LI KV+MT NKMFP+   Y ++ CF +++
Sbjct: 407 HNLLSIGQLLQRGLKVSFEGDICAIKDQAGVLIAKVKMTANKMFPLNFTYGQISCFSSIL 466

Query: 301 NDTSWLWHCRFGHLSFDTLSHMCQQHMVRGMSNIKKEDQLCEACVFRKHHRNSFPTGGSW 360
            D SWLWH R+GHL+F +LS++C+ HMVRG+ NI  E  +CE C+  KHHR+SFPTG +W
Sbjct: 467 KDPSWLWHFRYGHLNFKSLSYLCKNHMVRGIQNINHETNICEVCILAKHHRDSFPTGKAW 526

Query: 361 RASKPLELVHTDLCGPMRTTTHGGNRYFLTFIDDYSRKTWIYLLKEKSATFECFKTFKAM 420
           RASKPLEL+HTDLCGPMRTTT+GGNRYF+TFIDD+SRK WIY LKEKS    CFK+FKA 
Sbjct: 527 RASKPLELIHTDLCGPMRTTTNGGNRYFITFIDDFSRKLWIYFLKEKSEALVCFKSFKAF 586

Query: 421 VENESNLKLKSLRSDRGGEYIVFADFLKENGIKHQKTVRRTPQQNGVAERKNRIIMELAR 480
            EN+S  K+K+LRSDRGGEYI F +F KE GI HQ T R TPQQNGVAERKNR IME+AR
Sbjct: 587 TENQSGYKIKTLRSDRGGEYIAFGNFFKEQGIHHQMTARMTPQQNGVAERKNRTIMEMAR 646

Query: 481 SMLKAKKLPDQFWGDAVTCAVYLLNRASTKSVQGITPQEAWSGLKPTVSHLRVFGCIAYS 540
           SMLKAK LP++FWGDAV C VY+LNRA TKSV G+TP EAW G KP+VSHLRVFG IAYS
Sbjct: 647 SMLKAKNLPNEFWGDAVACTVYILNRAPTKSVPGMTPYEAWCGEKPSVSHLRVFGSIAYS 706

Query: 541 HISDEKRGKLDDKSEKCIFVGYSENSKAYRLYNPISKKVIISRDVKFDEAKLWQWNAP-N 600
           HI ++ RGKLDDKSEKCI VGYSENSKAYRLYNP+S+K+IISRDV F E + W WN   +
Sbjct: 707 HIPNQLRGKLDDKSEKCIMVGYSENSKAYRLYNPVSRKIIISRDVIFSEDESWNWNDDVD 766

Query: 601 EDQNPLHVDMDGKKDARDLELEVTQPLTSPSS--SHSTSDEETTPRKTRNIQEIYNTSRR 660
           E ++P HV++D  + A++LE    Q + S SS  S STS++E +PR+ R+IQEIYNT+ R
Sbjct: 767 EAKSPFHVNIDENEVAQELEQAEIQAMESSSSSTSSSTSNDEISPRRMRSIQEIYNTTNR 826

Query: 661 ILDEEHVDFALFANVDPVYFEEAIQDENWKDAMNQEIDAITRNETWELVKLPENKKALGV 720
           I D+   +FALFA VDPV F+EAIQDE WK AM+QEIDAI RNETWEL++LP NK+ALGV
Sbjct: 827 INDDHFANFALFAGVDPVTFDEAIQDEKWKIAMDQEIDAIRRNETWELMELPTNKQALGV 886

Query: 721 KWIYRTKLKQNGEVQKYKAR 734
           KW+YRTKLK +G V+KYKAR
Sbjct: 887 KWVYRTKLKSDGNVEKYKAR 904

BLAST of CSPI01G19050 vs. NCBI nr
Match: KAA0038926.1 (integrase [Cucumis melo var. makuwa])

HSP 1 Score: 966.5 bits (2497), Expect = 1.5e-277
Identity = 480/740 (64.86%), Postives = 578/740 (78.11%), Query Frame = 0

Query: 1   MTRRYEHIVVAIEESKDLSTLSINSLMGSLQSHELRLKMFDSNPSEEAFHMQSSYRGRSN 60
           M R++EHIVVAIEESKDLSTLSINSLMGSLQSHELRLK FD NP EEAF MQ+S+RG S 
Sbjct: 167 MPRKFEHIVVAIEESKDLSTLSINSLMGSLQSHELRLKQFDVNP-EEAFQMQTSFRGGSR 226

Query: 61  GRRGGRGGRGNGRSNVVTNTESESRDNQFFSNRGRGRSSNRGRG----RSGGRGDFSHIQ 120
           GRRGG G RG GR N    + + S ++Q  S+  RGR S RGRG    + GGRG+FS IQ
Sbjct: 227 GRRGGHGRRGGGR-NYDNRSGANSENSQESSSLSRGRGSGRGRGFGRNQGGGRGNFSQIQ 286

Query: 121 CFNCRRYGHFQADCWSKKTNSNQAETTLMHEQSNNDQGLLFLTLNVQESSTEEIWYLDSG 180
           CFNC +YGHFQA+CW+ K         +  EQ   D+G+LFL  +VQ++  E  WYLDSG
Sbjct: 287 CFNCGKYGHFQANCWALKNGVGNTTMNMHKEQKKIDEGILFLACSVQDNVVEPTWYLDSG 346

Query: 181 CSNHMTGRKDIFISLDESHQNVVKTGDNKMLEVKGKGDILVKTKMGAKKITDVYYVSGLK 240
           CSNHMTG + IF++LDES Q+ VKTGDN  L+VKG+GDILVKTK G K++T+V+YV GLK
Sbjct: 347 CSNHMTGNRSIFVTLDESFQSEVKTGDNTRLQVKGQGDILVKTKKGTKRVTNVFYVPGLK 406

Query: 241 HNLLSVGQLLLRGHDVIFKDKICEIRTKNGDLITKVRMTHNKMFPIKICYEKLVCFETLV 300
           HNLLS+GQLL RG  V F+  IC I+ + G LI KV+MT NKMFP+   Y ++ CF +++
Sbjct: 407 HNLLSIGQLLQRGLKVSFEGDICAIKDQAGVLIAKVKMTANKMFPLNFTYGQISCFSSIL 466

Query: 301 NDTSWLWHCRFGHLSFDTLSHMCQQHMVRGMSNIKKEDQLCEACVFRKHHRNSFPTGGSW 360
            D SWLWH R+GHL+F +LS++C+ HMVRG+ NI  E  +CE C+  KHHR+SFPTG +W
Sbjct: 467 KDPSWLWHFRYGHLNFKSLSYLCKNHMVRGIQNINHETNICEVCILAKHHRDSFPTGKAW 526

Query: 361 RASKPLELVHTDLCGPMRTTTHGGNRYFLTFIDDYSRKTWIYLLKEKSATFECFKTFKAM 420
           RASKPLEL+HTDLCGPMRTTT+GGNRYF+TFIDD+SRK WIY LKEKS    CFK+FKA 
Sbjct: 527 RASKPLELIHTDLCGPMRTTTNGGNRYFITFIDDFSRKLWIYFLKEKSEALVCFKSFKAF 586

Query: 421 VENESNLKLKSLRSDRGGEYIVFADFLKENGIKHQKTVRRTPQQNGVAERKNRIIMELAR 480
            EN+S  K+K+LRSDRGGEYI F +F KE GI HQ T R TPQQNGVAERKNR IME+AR
Sbjct: 587 TENQSGYKIKTLRSDRGGEYIAFGNFFKEQGIHHQMTARMTPQQNGVAERKNRTIMEMAR 646

Query: 481 SMLKAKKLPDQFWGDAVTCAVYLLNRASTKSVQGITPQEAWSGLKPTVSHLRVFGCIAYS 540
           SMLKAK LP++FWGDAV C VY+LNRA TKSV G+TP EAW G KP+VSHLRVFG IAYS
Sbjct: 647 SMLKAKNLPNEFWGDAVACTVYILNRAPTKSVPGMTPYEAWCGEKPSVSHLRVFGSIAYS 706

Query: 541 HISDEKRGKLDDKSEKCIFVGYSENSKAYRLYNPISKKVIISRDVKFDEAKLWQWNAP-N 600
           HI ++ RGKLDDKSEKCI VGYSENSKAYRLYNP+S+K+IISRDV F E + W WN   +
Sbjct: 707 HIPNQLRGKLDDKSEKCIMVGYSENSKAYRLYNPVSRKIIISRDVIFSEDESWNWNDDVD 766

Query: 601 EDQNPLHVDMDGKKDARDLELEVTQPLTSPSS--SHSTSDEETTPRKTRNIQEIYNTSRR 660
           E ++P HV++D  + A++LE    Q + S SS  S STS++E +PR+ R+IQEIYNT+ R
Sbjct: 767 EAKSPFHVNIDENEVAQELEQAEIQAMESSSSSTSSSTSNDEISPRRMRSIQEIYNTTNR 826

Query: 661 ILDEEHVDFALFANVDPVYFEEAIQDENWKDAMNQEIDAITRNETWELVKLPENKKALGV 720
           I D+   +FALFA VDPV F+EAIQDE WK AM+QEIDAI RNETWEL++LP NK+ALGV
Sbjct: 827 INDDHFANFALFAGVDPVTFDEAIQDEKWKIAMDQEIDAIRRNETWELMELPTNKQALGV 886

Query: 721 KWIYRTKLKQNGEVQKYKAR 734
           KW+YRTKLK +G V+KYKAR
Sbjct: 887 KWVYRTKLKSDGNVEKYKAR 904

BLAST of CSPI01G19050 vs. NCBI nr
Match: KAA0057291.1 (integrase [Cucumis melo var. makuwa] >KAA0060890.1 integrase [Cucumis melo var. makuwa] >KAA0062702.1 integrase [Cucumis melo var. makuwa] >TYJ98712.1 integrase [Cucumis melo var. makuwa] >TYK13441.1 integrase [Cucumis melo var. makuwa])

HSP 1 Score: 966.5 bits (2497), Expect = 1.5e-277
Identity = 480/740 (64.86%), Postives = 578/740 (78.11%), Query Frame = 0

Query: 1   MTRRYEHIVVAIEESKDLSTLSINSLMGSLQSHELRLKMFDSNPSEEAFHMQSSYRGRSN 60
           M R++EHIVVAIEESKDLSTLSINSLMGSLQSHELRLK FD NP EEAF MQ+S+RG S 
Sbjct: 167 MPRKFEHIVVAIEESKDLSTLSINSLMGSLQSHELRLKQFDVNP-EEAFQMQTSFRGGSR 226

Query: 61  GRRGGRGGRGNGRSNVVTNTESESRDNQFFSNRGRGRSSNRGRG----RSGGRGDFSHIQ 120
           GRRGG G RG GR N    + + S ++Q  S+  RGR S RGRG    + GGRG+FS IQ
Sbjct: 227 GRRGGHGRRGGGR-NYDNRSGANSENSQESSSLSRGRGSGRGRGFGRNQGGGRGNFSQIQ 286

Query: 121 CFNCRRYGHFQADCWSKKTNSNQAETTLMHEQSNNDQGLLFLTLNVQESSTEEIWYLDSG 180
           CFNC +YGHFQA+CW+ K         +  EQ   D+G+LFL  +VQ++  E  WYLDSG
Sbjct: 287 CFNCGKYGHFQANCWALKNGVGNTTMNMHKEQKKIDEGILFLACSVQDNVVEPTWYLDSG 346

Query: 181 CSNHMTGRKDIFISLDESHQNVVKTGDNKMLEVKGKGDILVKTKMGAKKITDVYYVSGLK 240
           CSNHMTG + IF++LDES Q+ VKTGDN  L+VKG+GDILVKTK G K++T+V+YV GLK
Sbjct: 347 CSNHMTGNRSIFVTLDESFQSEVKTGDNTRLQVKGQGDILVKTKKGTKRVTNVFYVPGLK 406

Query: 241 HNLLSVGQLLLRGHDVIFKDKICEIRTKNGDLITKVRMTHNKMFPIKICYEKLVCFETLV 300
           HNLLS+GQLL RG  V F+  IC I+ + G LI KV+MT NKMFP+   Y ++ CF +++
Sbjct: 407 HNLLSIGQLLQRGLKVSFEGDICAIKDQAGVLIAKVKMTANKMFPLNFTYGQISCFSSIL 466

Query: 301 NDTSWLWHCRFGHLSFDTLSHMCQQHMVRGMSNIKKEDQLCEACVFRKHHRNSFPTGGSW 360
            D SWLWH R+GHL+F +LS++C+ HMVRG+ NI  E  +CE C+  KHHR+SFPTG +W
Sbjct: 467 KDPSWLWHFRYGHLNFKSLSYLCKNHMVRGIQNINHETNICEVCILAKHHRDSFPTGKAW 526

Query: 361 RASKPLELVHTDLCGPMRTTTHGGNRYFLTFIDDYSRKTWIYLLKEKSATFECFKTFKAM 420
           RASKPLEL+HTDLCGPMRTTT+GGNRYF+TFIDD+SRK WIY LKEKS    CFK+FKA 
Sbjct: 527 RASKPLELIHTDLCGPMRTTTNGGNRYFITFIDDFSRKLWIYFLKEKSEALVCFKSFKAF 586

Query: 421 VENESNLKLKSLRSDRGGEYIVFADFLKENGIKHQKTVRRTPQQNGVAERKNRIIMELAR 480
            EN+S  K+K+LRSDRGGEYI F +F KE GI HQ T R TPQQNGVAERKNR IME+AR
Sbjct: 587 TENQSGYKIKTLRSDRGGEYIAFGNFFKEQGIHHQMTARMTPQQNGVAERKNRTIMEMAR 646

Query: 481 SMLKAKKLPDQFWGDAVTCAVYLLNRASTKSVQGITPQEAWSGLKPTVSHLRVFGCIAYS 540
           SMLKAK LP++FWGDAV C VY+LNRA TKSV G+TP EAW G KP+VSHLRVFG IAYS
Sbjct: 647 SMLKAKNLPNEFWGDAVACTVYILNRAPTKSVPGMTPYEAWCGEKPSVSHLRVFGSIAYS 706

Query: 541 HISDEKRGKLDDKSEKCIFVGYSENSKAYRLYNPISKKVIISRDVKFDEAKLWQWNAP-N 600
           HI ++ RGKLDDKSEKCI VGYSENSKAYRLYNP+S+K+IISRDV F E + W WN   +
Sbjct: 707 HIPNQLRGKLDDKSEKCIMVGYSENSKAYRLYNPVSRKIIISRDVIFSEDESWNWNDDVD 766

Query: 601 EDQNPLHVDMDGKKDARDLELEVTQPLTSPSS--SHSTSDEETTPRKTRNIQEIYNTSRR 660
           E ++P HV++D  + A++LE    Q + S SS  S STS++E +PR+ R+IQEIYNT+ R
Sbjct: 767 EAKSPFHVNIDENEVAQELEQAEIQAMESSSSSTSSSTSNDEISPRRMRSIQEIYNTTNR 826

Query: 661 ILDEEHVDFALFANVDPVYFEEAIQDENWKDAMNQEIDAITRNETWELVKLPENKKALGV 720
           I D+   +FALFA VDPV F+EAIQDE WK AM+QEIDAI RNETWEL++LP NK+ALGV
Sbjct: 827 INDDHFANFALFAGVDPVTFDEAIQDEKWKIAMDQEIDAIRRNETWELMELPTNKQALGV 886

Query: 721 KWIYRTKLKQNGEVQKYKAR 734
           KW+YRTKLK +G V+KYKAR
Sbjct: 887 KWVYRTKLKSDGNVEKYKAR 904

BLAST of CSPI01G19050 vs. TAIR 10
Match: ATMG00710.1 (Polynucleotidyl transferase, ribonuclease H-like superfamily protein )

HSP 1 Score: 66.6 bits (161), Expect = 1.1e-10
Identity = 34/85 (40.00%), Postives = 48/85 (56.47%), Query Frame = 0

Query: 468 NRIIMELARSMLKAKKLPDQFWGDAVTCAVYLLNRASTKSVQGITPQEAWSGLKPTVSHL 527
           NR I+E  RSML    LP  F  DA   AV+++N+  + ++    P E W    PT S+L
Sbjct: 2   NRTIIEKVRSMLCECGLPKTFRADAANTAVHIINKYPSTAINFHVPDEVWFQSVPTYSYL 61

Query: 528 RVFGCIAYSHISDEKRGKLDDKSEK 553
           R FGC+AY H  +   GKL  +++K
Sbjct: 62  RRFGCVAYIHCDE---GKLKPRAKK 83

BLAST of CSPI01G19050 vs. TAIR 10
Match: ATMG00820.1 (Reverse transcriptase (RNA-dependent DNA polymerase) )

HSP 1 Score: 66.6 bits (161), Expect = 1.1e-10
Identity = 34/111 (30.63%), Postives = 61/111 (54.95%), Query Frame = 0

Query: 676 AIQDENWKDAMNQEIDAITRNETWELVKLPENKKALGVKWIYRTKLKQNGEVQKYKARYH 735
           A++D  W  AM +E+DA++RN+TW LV  P N+  LG KW+++TKL  +G + + KAR  
Sbjct: 34  ALKDPGWCQAMQEELDALSRNKTWILVPPPVNQNILGCKWVFKTKLHSDGTLDRLKARLV 93

Query: 736 NKICQRTLAIHIIKSSCP-----------SQGIKITTNQNLGFCFHIHSSI 776
            K   +   I+ +++  P           +   ++   Q++ + F +H S+
Sbjct: 94  AKGFHQEEGIYFVETYSPVVRTATIRTILNVAQQLEVGQSINWMFKMHFSM 144

BLAST of CSPI01G19050 vs. TAIR 10
Match: AT4G23160.1 (cysteine-rich RLK (RECEPTOR-like protein kinase) 8 )

HSP 1 Score: 59.7 bits (143), Expect = 1.3e-08
Identity = 45/145 (31.03%), Postives = 65/145 (44.83%), Query Frame = 0

Query: 625 PSSSHSTSDEETTPRKTRNIQEIYNTSRRILDEEHV-DFALFANVDPVY----------- 684
           P  S  TS   T  RK   +Q+ Y  S   L    +  F  +  V P+Y           
Sbjct: 27  PEPSVHTSHRRT--RKPAYLQDYYCHSVASLTIHDISQFLSYEKVSPLYHSFLVCIAKAK 86

Query: 685 ----FEEAIQDENWKDAMNQEIDAITRNETWELVKLPENKKALGVKWIYRTKLKQNGEVQ 744
               + EA +   W  AM+ EI A+    TWE+  LP NKK +G KW+Y+ K   +G ++
Sbjct: 87  EPSTYNEAKEFLVWCGAMDDEIGAMETTHTWEICTLPPNKKPIGCKWVYKIKYNSDGTIE 146

Query: 745 KYKARYHNKICQRTLAIHIIKSSCP 754
           +YKAR   K   +   I  I++  P
Sbjct: 147 RYKARLVAKGYTQQEGIDFIETFSP 169

BLAST of CSPI01G19050 vs. TAIR 10
Match: AT3G20980.1 (Gag-Pol-related retrotransposon family protein )

HSP 1 Score: 57.4 bits (137), Expect = 6.5e-08
Identity = 37/90 (41.11%), Postives = 48/90 (53.33%), Query Frame = 0

Query: 168 EEIWYLDSGCSNHMTGRKDIFISLDESHQNVVK--TGDNK---MLEVKGKGDILVKTKMG 227
           E IW + S  SNHMT     F +LD S +  VK  +GD     +  V+G GD+   T  G
Sbjct: 266 ENIWLISSTNSNHMTPHVKFFTTLDRSRKCKVKFISGDKSETTVAMVEGIGDVTFITNEG 325

Query: 228 AKKITDVYYVSGLKHNLLSVGQLLLRGHDV 253
            K I +V YV G++ N LSV QL   G +V
Sbjct: 326 NKTIKNVLYVPGIEGNALSVSQLKRNGFEV 355

BLAST of CSPI01G19050 vs. TAIR 10
Match: AT3G21000.1 (Gag-Pol-related retrotransposon family protein )

HSP 1 Score: 53.9 bits (128), Expect = 7.1e-07
Identity = 39/138 (28.26%), Postives = 69/138 (50.00%), Query Frame = 0

Query: 117 CFNCRRYGHFQADC-WSKKTNSNQAETTLMHEQSNNDQGLLFLTLNVQESSTEEIWYLDS 176
           C  C +  H Q DC +   T+  + E  ++      D  L  +     ++  ++IW +  
Sbjct: 230 CGLCYKNNHNQEDCKFRIHTDKEEKEDEIV-----VDYRLETVPNLGAKTYDDDIWIIHK 289

Query: 177 GCSNHMTGRKDIFISLDESHQNVVKTGDNKMLEVKGKGDILVKTKMGAKK-ITDVYYVSG 236
               +MT     F +LD + +  V T D  +L V+GKGD+ ++ K G KK I +V +V G
Sbjct: 290 MAPINMTPYVKYFTTLDRTFKATVGTVDGTVLLVEGKGDVKIRMKEGKKKTIRNVIFVPG 349

Query: 237 LKHNLLSVGQLLLRGHDV 253
           L  N+LS G+++ + + +
Sbjct: 350 LNRNVLSFGKMVSKRYSI 362

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
P109783.5e-9934.23Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum... [more]
P041461.6e-6426.56Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3[more]
Q9ZT943.8e-5324.88Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana O... [more]
Q94HW27.3e-4927.05Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana O... [more]
P925201.5e-0930.63Uncharacterized mitochondrial protein AtMg00820 OS=Arabidopsis thaliana OX=3702 ... [more]
Match NameE-valueIdentityDescription
A0A5A7UDJ24.3e-27864.86Integrase OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold174G001470 PE=... [more]
A0A5D3BQ817.4e-27864.86Integrase OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold451G00890 PE=4... [more]
A0A5D3E3T27.4e-27864.86Integrase OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold156G00030 PE=4... [more]
A0A5A7UZJ87.4e-27864.86Integrase OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold98G001310 PE=4... [more]
A0A5D3CXM67.4e-27864.86Integrase OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold209G001540 PE=... [more]
Match NameE-valueIdentityDescription
KAA0051601.18.9e-27864.86integrase [Cucumis melo var. makuwa][more]
KAA0060377.11.5e-27764.86integrase [Cucumis melo var. makuwa][more]
KAA0039947.11.5e-27764.86integrase [Cucumis melo var. makuwa][more]
KAA0038926.11.5e-27764.86integrase [Cucumis melo var. makuwa][more]
KAA0057291.11.5e-27764.86integrase [Cucumis melo var. makuwa] >KAA0060890.1 integrase [Cucumis melo var. ... [more]
Match NameE-valueIdentityDescription
ATMG00710.11.1e-1040.00Polynucleotidyl transferase, ribonuclease H-like superfamily protein [more]
ATMG00820.11.1e-1030.63Reverse transcriptase (RNA-dependent DNA polymerase) [more]
AT4G23160.11.3e-0831.03cysteine-rich RLK (RECEPTOR-like protein kinase) 8 [more]
AT3G20980.16.5e-0841.11Gag-Pol-related retrotransposon family protein [more]
AT3G21000.17.1e-0728.26Gag-Pol-related retrotransposon family protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucumber (PI 183967) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR025724GAG-pre-integrase domainPFAMPF13976gag_pre-integrscoord: 288..345
e-value: 9.5E-14
score: 51.0
NoneNo IPR availableGENE3D4.10.60.10coord: 92..145
e-value: 1.8E-7
score: 32.9
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 820..835
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 72..97
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 618..645
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 798..813
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 778..835
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 46..108
NoneNo IPR availablePANTHERPTHR45895FAMILY NOT NAMEDcoord: 170..746
IPR001584Integrase, catalytic corePFAMPF00665rvecoord: 359..456
e-value: 2.6E-13
score: 50.1
IPR001584Integrase, catalytic corePROSITEPS50994INTEGRASEcoord: 357..521
score: 25.658638
IPR036397Ribonuclease H superfamilyGENE3D3.30.420.10coord: 355..535
e-value: 4.2E-41
score: 142.4
IPR001878Zinc finger, CCHC-typePROSITEPS50158ZF_CCHCcoord: 117..130
score: 9.059338
IPR012337Ribonuclease H-like superfamilySUPERFAMILY53098Ribonuclease H-likecoord: 356..515
IPR036875Zinc finger, CCHC-type superfamilySUPERFAMILY57756Retrovirus zinc finger-like domainscoord: 99..139

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI01G19050.1CSPI01G19050.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0015074 DNA integration
biological_process GO:0006396 RNA processing
molecular_function GO:0003676 nucleic acid binding
molecular_function GO:0008270 zinc ion binding