Cla97C04G070495 (gene) Watermelon (97103) v2.5

Overview
NameCla97C04G070495
Typegene
OrganismCitrullus lanatus subsp. vulgaris cv. 97103 (Watermelon (97103) v2.5)
DescriptionReverse transcriptase domain-containing protein
LocationCla97Chr04: 12905526 .. 12908343 (+)
RNA-Seq ExpressionCla97C04G070495
SyntenyCla97C04G070495
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAAGTTCGAAAAGCAATTCACTTATCGGAGTCGCATATGCAGTGGTTGGTAAAGCAACTTGCAGAGCTTCTTAACGTTTCAAGCGTTCGTTTTTTCTTAAAGGAAATGCGAGACACCTCAGGAGCAATGAGATTATCAAAATTCAAAAACAACATGGGGTGGAATTTGAGATGCGTGGTTTGGCCAGCTATCGGAGGTCGGTTCTTCATTCATATACCTGAAGGGTCTGCACAACAAGGATGGTCAGAGTTCTTGGAAATGCTTATTAGCTTCACCAATCGATCCAAGATTTATAGAGAAAAGACTGCGAGAAAAGAGAAGGCTCATGAATTTATTCTTCCCCTAATTAAGCAACAGCTGAGTTACGCAGAGGTACTATCAAATGGAATTAGTCAGCAAAAATCCCCTGTTCGGGCACCACTAAAACAAAGCAAGTTGGCGGCAACTCTCGTGCATTTGAACTCCTCTATTTTTGCACCTGATAAACAGAGCAAGCATCAATGCTGGATCCGAGAAGAAATTGAAGTGTTTAAAGAAGATTTTATCAAATTAAAGTAAGCATTAATCCTCTATTTGCCGATAAAGCTTTGGTTAAAGTTATCTCAAGGCAATTTAGAAGAGTGAATCGTCTGGAAAATGGTTCGACTATGGAAGATTCCATCTTCTTTTTGAAAAATGGTGTTCTAAGCAACATAGCCGGCCAACTTGCATCAAAGGCTACAGAGGATGGTTGGCAATAAGAAACCTGCTGCTGGAATACTGGAACAGAGCCACCTTTGAAGCTTTACGTTCCCATTTTGGAGGTTTGGAAGAGAGAGCTCTAGAAACACTTAACCTTTTGAATGTATGAGAAGCCAAAATCAAAGTCAAAAGGAATTTATGTGGTTTTATGCCATCAATTGAAATTAAGAATGAATTGAGAGGTATAATTTTTGTGAACTTCGGGGATATTGAAGCCATGGAAACTCTTTCTTACGTACACAAAGAATTATTCTTTAAAGATTTCAACAATCCAATTGATCAAAGTCGTTTGATTAAAGTAGCAATGGATGAAGATGTTAGGATTATTTTATCAAACCACAACAGTGAAGTCCAAAAACCGATAGGCTCACCGGAGAAGACAACGGAAAAATCTGATCTTGAGAAAGTTGCCGTGACTACTTCAAGGAATGTCGAGGGTCTCTTTCAACTTGAAAGTGCGCCACTCCAGGACACCAACCAAAAAGCTGAAAAGCTTATTGCGGGCGTAAGTGAAGTAGCTAAAAAGTCGGAAGTTATTAGGAGAGAGAATCTCGTTGGCCGAAATTGTAGTCTTCCAAGCAACCAAAGCAAAAATGCTCTAAATTCCTTAATAATTATAGAATCCAGGGTGTCGACATCCCCTAGTCTCCCCTCGATTGAAAATGCATTCAATCCTGCCCTCAGGTTTGTTACCAATATCGCAGAGGAGATCAACACATTAGAAACTAAACCATCATTTTTGCCGAGTCTCAAGAAAGAACACATTAACTCAAATGGGATCTCACTGATGATCCCTTTCATTCGCTCCGGTAATACGCTCCCCAATAAGAAATCGCCATTGTCTGCGGTCAGAAAAACAACCAACCTCTTCAAACCCTTTCCAAAACATTTTGTGAAAGGAAGAACATCATTCCTTAATTCATGGTCAGCCCTCTCTAATTTAAATTTGCTTGAAGAAAGCTATGCCAAACTTCAAATACCAGAATATCCTTCTAGTAGATCCGTAAAATATCCCTTTCCTTCTCAACTCTCTCAACAATCAGTGATAGTTCTCCTGGGCTCAAATATTCATTTTGTACGAGGTACCTTTTGTCCTTATCCTACCAAATCAAGTCTTAAATCTCAAGAATCAGTGGTGAGTGCAAGCAGTGATGAATTGAAAGTGCAAAGAAAAGAAACTCCATCAAATGATCTCGAAACAGAGGACTCTATTAACGAGGATCTCAACAGGCTATTTCAAGCTGTAGAAGGTAATGATAGTGGTGCACTTGATGAAGCAATCGTCTCCCATTATCCAAGCAAAGACATTCCTAAGCACTTAAAATTAATCATTGACATTTGCAGGATATCCCTGAGAACAACTAAGGCTGTGAGCGAAAGCTGAAAAATGAAAATTATTTCATGGAACACAAGAGGAATTGGGGATCAATCCAAGCAATTGGCCCTTAAACGCATGATTTTGAAGTCAAACCCGGATTTGGTGCTTATCCAAGAAACCAAGAAAGATGCCATTGAATTGGATGTTATCAAAGCTATCTGGAGTTCTAAGGACATTAGATGGATTTATGTTGAAGCCTTTGGCAAATCAGGGGGAACACTCACCATGTGGGATGAAAGTAAAGTATCAATTCTAGAATCCCTCAAAGGAGGTTACTCACTCTCGGTAAAATGCAAATCCCTAAGTAAAAAGGTAGATTGGGTGACTAATATATATGGTCCTACTTATTACAAAGAAAGAAGACACATTTGGCCGGAATTAGCTTCCCTCTCAGCTTATTGTACAGAGGCATGGTGTTTGGGGGGAGACTTGAACATGACCAGATTGATTCAAGAAAGATCCCCAACTGGAAGCTGGACAAAAGGTATGAAGAAATTCAATAAATTCATAGAAAACGCTCATCTCTTGGAAATTCCACTCTCAAATGGTCACTTTACTTGGTCAAGAGAAGGAAGAGGTTCGTCTTGCTCACTGCTGGACAAGTTTCTTGTTTCCAATAATTGGGAGGAGGCCTTCGATGACACGAGAGTGGCAAGGCAAGCAAGATTGTATTCTGATCACTTCCCCTATTATTAG

mRNA sequence

ATGGAAGTTCGAAAAGCAATTCACTTATCGGAGTCGCATATGCAGTGGTTGGTAAAGCAACTTGCAGAGCTTCTTAACGTTTCAAGCGTTCGTTTTTTCTTAAAGGAAATGCGAGACACCTCAGGAGCAATGAGATTATCAAAATTCAAAAACAACATGGGGTGGAATTTGAGATGCGTGGTTTGGCCAGCTATCGGAGGTCGGTTCTTCATTCATATACCTGAAGGGTCTGCACAACAAGGATGGTCAGAGTTCTTGGAAATGCTTATTAGCTTCACCAATCGATCCAAGATTTATAGAGAAAAGACTGCGAGAAAAGAGAAGGCTCATGAATTTATTCTTCCCCTAATTAAGCAACAGCTGAGTTACGCAGAGGTACTATCAAATGGAATTAGTCAGCAAAAATCCCCTGTTCGGGCACCACTAAAACAAAGCAAGTTGGCGGCAACTCTCGTGCATTTGAACTCCTCTATTTTTGCACCTGATAAACAGAGCAAGCATCAATGCTGGATCCGAGAAGAAATTGAAGTGTTTAAAGAAGATTTTATCAAATTAAACCGGCCAACTTGCATCAAAGGCTACAGAGGATGGTTGGCAATAAGAAACCTGCTGCTGGAATACTGGAACAGAGCCACCTTTGAAGCTTTACGTTCCCATTTTGGAGAAGCCAAAATCAAAGTCAAAAGGAATTTATGTGGTTTTATGCCATCAATTGAAATTAAGAATGAATTGAGAGGTATAATTTTTGTGAACTTCGGGGATATTGAAGCCATGGAAACTCTTTCTTACGTACACAAAGAATTATTCTTTAAAGATTTCAACAATCCAATTGATCAAAGTCGTTTGATTAAAGTAGCAATGGATGAAGATGTTAGGATTATTTTATCAAACCACAACAGTGAAGTCCAAAAACCGATAGGCTCACCGGAGAAGACAACGGAAAAATCTGATCTTGAGAAAGTTGCCGTGACTACTTCAAGGAATGTCGAGGGTCTCTTTCAACTTGAAAGTGCGCCACTCCAGGACACCAACCAAAAAGCTGAAAAGCTTATTGCGGGCGTAAGTGAAGTAGCTAAAAAGTCGGAAGTTATTAGGAGAGAGAATCTCGTTGGCCGAAATTGTAGTCTTCCAAGCAACCAAAGCAAAAATGCTCTAAATTCCTTAATAATTATAGAATCCAGGGTGTCGACATCCCCTAGTCTCCCCTCGATTGAAAATGCATTCAATCCTGCCCTCAGGTTTGTTACCAATATCGCAGAGGAGATCAACACATTAGAAACTAAACCATCATTTTTGCCGAGTCTCAAGAAAGAACACATTAACTCAAATGGGATCTCACTGATGATCCCTTTCATTCGCTCCGGTAATACGCTCCCCAATAAGAAATCGCCATTGTCTGCGGTCAGAAAAACAACCAACCTCTTCAAACCCTTTCCAAAACATTTTGTGAAAGGAAGAACATCATTCCTTAATTCATGGTCAGCCCTCTCTAATTTAAATTTGCTTGAAGAAAGCTATGCCAAACTTCAAATACCAGAATATCCTTCTAGTAGATCCGTAAAATATCCCTTTCCTTCTCAACTCTCTCAACAATCAGTGATAGTTCTCCTGGGCTCAAATATTCATTTTGTACGAGGTACCTTTTGTCCTTATCCTACCAAATCAAGTCTTAAATCTCAAGAATCAGTGGTGAGTGCAAGCAGTGATGAATTGAAAGTGCAAAGAAAAGAAACTCCATCAAATGATCTCGAAACAGAGGACTCTATTAACGAGGATCTCAACAGGCTATTTCAAGCTGTAGAAGGTAATGATAGTGGTGCACTTGATGAAGCAATCGTCTCCCATTATCCAAGCAAAGACATTCCTAAGCACTTAAAATTAATCATTGACATTTGCAGGATATCCCTGAGAACAACTAAGGCTTCAAACCCGGATTTGGTGCTTATCCAAGAAACCAAGAAAGATGCCATTGAATTGGATGTTATCAAAGCTATCTGGAGTTCTAAGGACATTAGATGGATTTATGTTGAAGCCTTTGGCAAATCAGGGGGAACACTCACCATGTGGGATGAAAGTAAAGTATCAATTCTAGAATCCCTCAAAGGAGGTTACTCACTCTCGGTAAAATGCAAATCCCTAAGTAAAAAGGTAGATTGGGTGACTAATATATATGGTCCTACTTATTACAAAGAAAGAAGACACATTTGGCCGGAATTAGCTTCCCTCTCAGCTTATTGTACAGAGGCATGGTGTTTGGGGGGAGACTTGAACATGACCAGATTGATTCAAGAAAGATCCCCAACTGGAAGCTGGACAAAAGGTATGAAGAAATTCAATAAATTCATAGAAAACGCTCATCTCTTGGAAATTCCACTCTCAAATGGTCACTTTACTTGGTCAAGAGAAGGAAGAGGTTCGTCTTGCTCACTGCTGGACAAGTTTCTTGTTTCCAATAATTGGGAGGAGGCCTTCGATGACACGAGAGTGGCAAGGCAAGCAAGATTGTATTCTGATCACTTCCCCTATTATTAG

Coding sequence (CDS)

ATGGAAGTTCGAAAAGCAATTCACTTATCGGAGTCGCATATGCAGTGGTTGGTAAAGCAACTTGCAGAGCTTCTTAACGTTTCAAGCGTTCGTTTTTTCTTAAAGGAAATGCGAGACACCTCAGGAGCAATGAGATTATCAAAATTCAAAAACAACATGGGGTGGAATTTGAGATGCGTGGTTTGGCCAGCTATCGGAGGTCGGTTCTTCATTCATATACCTGAAGGGTCTGCACAACAAGGATGGTCAGAGTTCTTGGAAATGCTTATTAGCTTCACCAATCGATCCAAGATTTATAGAGAAAAGACTGCGAGAAAAGAGAAGGCTCATGAATTTATTCTTCCCCTAATTAAGCAACAGCTGAGTTACGCAGAGGTACTATCAAATGGAATTAGTCAGCAAAAATCCCCTGTTCGGGCACCACTAAAACAAAGCAAGTTGGCGGCAACTCTCGTGCATTTGAACTCCTCTATTTTTGCACCTGATAAACAGAGCAAGCATCAATGCTGGATCCGAGAAGAAATTGAAGTGTTTAAAGAAGATTTTATCAAATTAAACCGGCCAACTTGCATCAAAGGCTACAGAGGATGGTTGGCAATAAGAAACCTGCTGCTGGAATACTGGAACAGAGCCACCTTTGAAGCTTTACGTTCCCATTTTGGAGAAGCCAAAATCAAAGTCAAAAGGAATTTATGTGGTTTTATGCCATCAATTGAAATTAAGAATGAATTGAGAGGTATAATTTTTGTGAACTTCGGGGATATTGAAGCCATGGAAACTCTTTCTTACGTACACAAAGAATTATTCTTTAAAGATTTCAACAATCCAATTGATCAAAGTCGTTTGATTAAAGTAGCAATGGATGAAGATGTTAGGATTATTTTATCAAACCACAACAGTGAAGTCCAAAAACCGATAGGCTCACCGGAGAAGACAACGGAAAAATCTGATCTTGAGAAAGTTGCCGTGACTACTTCAAGGAATGTCGAGGGTCTCTTTCAACTTGAAAGTGCGCCACTCCAGGACACCAACCAAAAAGCTGAAAAGCTTATTGCGGGCGTAAGTGAAGTAGCTAAAAAGTCGGAAGTTATTAGGAGAGAGAATCTCGTTGGCCGAAATTGTAGTCTTCCAAGCAACCAAAGCAAAAATGCTCTAAATTCCTTAATAATTATAGAATCCAGGGTGTCGACATCCCCTAGTCTCCCCTCGATTGAAAATGCATTCAATCCTGCCCTCAGGTTTGTTACCAATATCGCAGAGGAGATCAACACATTAGAAACTAAACCATCATTTTTGCCGAGTCTCAAGAAAGAACACATTAACTCAAATGGGATCTCACTGATGATCCCTTTCATTCGCTCCGGTAATACGCTCCCCAATAAGAAATCGCCATTGTCTGCGGTCAGAAAAACAACCAACCTCTTCAAACCCTTTCCAAAACATTTTGTGAAAGGAAGAACATCATTCCTTAATTCATGGTCAGCCCTCTCTAATTTAAATTTGCTTGAAGAAAGCTATGCCAAACTTCAAATACCAGAATATCCTTCTAGTAGATCCGTAAAATATCCCTTTCCTTCTCAACTCTCTCAACAATCAGTGATAGTTCTCCTGGGCTCAAATATTCATTTTGTACGAGGTACCTTTTGTCCTTATCCTACCAAATCAAGTCTTAAATCTCAAGAATCAGTGGTGAGTGCAAGCAGTGATGAATTGAAAGTGCAAAGAAAAGAAACTCCATCAAATGATCTCGAAACAGAGGACTCTATTAACGAGGATCTCAACAGGCTATTTCAAGCTGTAGAAGGTAATGATAGTGGTGCACTTGATGAAGCAATCGTCTCCCATTATCCAAGCAAAGACATTCCTAAGCACTTAAAATTAATCATTGACATTTGCAGGATATCCCTGAGAACAACTAAGGCTTCAAACCCGGATTTGGTGCTTATCCAAGAAACCAAGAAAGATGCCATTGAATTGGATGTTATCAAAGCTATCTGGAGTTCTAAGGACATTAGATGGATTTATGTTGAAGCCTTTGGCAAATCAGGGGGAACACTCACCATGTGGGATGAAAGTAAAGTATCAATTCTAGAATCCCTCAAAGGAGGTTACTCACTCTCGGTAAAATGCAAATCCCTAAGTAAAAAGGTAGATTGGGTGACTAATATATATGGTCCTACTTATTACAAAGAAAGAAGACACATTTGGCCGGAATTAGCTTCCCTCTCAGCTTATTGTACAGAGGCATGGTGTTTGGGGGGAGACTTGAACATGACCAGATTGATTCAAGAAAGATCCCCAACTGGAAGCTGGACAAAAGGTATGAAGAAATTCAATAAATTCATAGAAAACGCTCATCTCTTGGAAATTCCACTCTCAAATGGTCACTTTACTTGGTCAAGAGAAGGAAGAGGTTCGTCTTGCTCACTGCTGGACAAGTTTCTTGTTTCCAATAATTGGGAGGAGGCCTTCGATGACACGAGAGTGGCAAGGCAAGCAAGATTGTATTCTGATCACTTCCCCTATTATTAG

Protein sequence

MEVRKAIHLSESHMQWLVKQLAELLNVSSVRFFLKEMRDTSGAMRLSKFKNNMGWNLRCVVWPAIGGRFFIHIPEGSAQQGWSEFLEMLISFTNRSKIYREKTARKEKAHEFILPLIKQQLSYAEVLSNGISQQKSPVRAPLKQSKLAATLVHLNSSIFAPDKQSKHQCWIREEIEVFKEDFIKLNRPTCIKGYRGWLAIRNLLLEYWNRATFEALRSHFGEAKIKVKRNLCGFMPSIEIKNELRGIIFVNFGDIEAMETLSYVHKELFFKDFNNPIDQSRLIKVAMDEDVRIILSNHNSEVQKPIGSPEKTTEKSDLEKVAVTTSRNVEGLFQLESAPLQDTNQKAEKLIAGVSEVAKKSEVIRRENLVGRNCSLPSNQSKNALNSLIIIESRVSTSPSLPSIENAFNPALRFVTNIAEEINTLETKPSFLPSLKKEHINSNGISLMIPFIRSGNTLPNKKSPLSAVRKTTNLFKPFPKHFVKGRTSFLNSWSALSNLNLLEESYAKLQIPEYPSSRSVKYPFPSQLSQQSVIVLLGSNIHFVRGTFCPYPTKSSLKSQESVVSASSDELKVQRKETPSNDLETEDSINEDLNRLFQAVEGNDSGALDEAIVSHYPSKDIPKHLKLIIDICRISLRTTKASNPDLVLIQETKKDAIELDVIKAIWSSKDIRWIYVEAFGKSGGTLTMWDESKVSILESLKGGYSLSVKCKSLSKKVDWVTNIYGPTYYKERRHIWPELASLSAYCTEAWCLGGDLNMTRLIQERSPTGSWTKGMKKFNKFIENAHLLEIPLSNGHFTWSREGRGSSCSLLDKFLVSNNWEEAFDDTRVARQARLYSDHFPYY
Homology
BLAST of Cla97C04G070495 vs. NCBI nr
Match: TYJ98683.1 (hypothetical protein E5676_scaffold429G00120 [Cucumis melo var. makuwa])

HSP 1 Score: 219.2 bits (557), Expect = 1.4e-52
Identity = 100/200 (50.00%), Postives = 136/200 (68.00%), Query Frame = 0

Query: 643 NPD-LVLIQETKKDAIELDVIKAIWSSKDIRWIYVEAFGKSGGTLTMWDESKVSILESLK 702
           +PD LV+    +   I++ +IK++WSSKDI W  VE+FG+ GG LTMWD SK+ ++E+LK
Sbjct: 67  DPDHLVICYRNQGQEIDIALIKSLWSSKDIGWELVESFGRFGGILTMWDMSKIKVVETLK 126

Query: 703 GGYSLSVKCKSLSKKVDWVTNIYGPTYYKERRHIWPELASLSAYCTEAWCLGGDLNMTRL 762
           GGYSLS+   +  KK  W+TN+YGP  Y+ERR +W  L SLS YCT AWC+GG  N+TR 
Sbjct: 127 GGYSLSINSITSCKKSCWITNVYGPYDYEERRFVWLVLVSLSGYCTGAWCIGGKCNITRW 186

Query: 763 IQERSPTGSWTKGMKKFNKFIENAHLLEIPLSNGHFTWSREGRGSSCSLLDKFLVSNNWE 822
             E  P    T+GM++FN  I++ ++ E+PL NG  TWSREG   S SLLD F +   W+
Sbjct: 187 AHECFPLEKQTRGMRQFNNPIDSLNIWELPLQNGRCTWSREGSSISRSLLDPFFIDKEWD 246

Query: 823 EAFDDTRVARQARLYSDHFP 842
           E  +++RV R+A   SDHFP
Sbjct: 247 EISENSRVGRKAHTISDHFP 266

BLAST of Cla97C04G070495 vs. NCBI nr
Match: XP_038876676.1 (uncharacterized protein LOC120069076 [Benincasa hispida])

HSP 1 Score: 205.3 bits (521), Expect = 2.1e-48
Identity = 98/192 (51.04%), Postives = 130/192 (67.71%), Query Frame = 0

Query: 637 RTTKASNPDLVLIQETKKDAIELDVIKAIWSSKDIRWIYVEAFGKSGGTLTMWDESKVSI 696
           R  K  NPD+VLIQETKKD IE   IK++WSSK++   +VEA GKSGG LT+WD+SK+ +
Sbjct: 22  RFLKKVNPDIVLIQETKKDRIEGSFIKSLWSSKEVGCAFVEAKGKSGGLLTVWDDSKILV 81

Query: 697 LESLKGGYSLSVKCKSLSKKVDWVTNIYGPTYYKERRHIWPELASLSAYCTEAWCLGGDL 756
               K  +SLS+KC++++KK+ W+TN+YGP  Y+ERR +W EL+SL+    + WC+GGD 
Sbjct: 82  SSISKDEFSLSIKCQTINKKICWITNVYGPCDYQERRRLWAELSSLAEKLDDPWCIGGDF 141

Query: 757 NMTRLIQERSPTGSWTKGMKKFNKFIENAHLLEIPLSNGHFTWSREGRGSSCSLLDKFLV 816
           N  R   ER P G  T+ M  FNKFI   +LLEIPLSNG FTWS+EG   S S L  FL+
Sbjct: 142 NSIRRRHERYPVGKATRDMNNFNKFIRLNNLLEIPLSNGQFTWSKEGDVVSKS-LKMFLI 201

Query: 817 SNNWEEAFDDTR 829
               ++  +  R
Sbjct: 202 IQGLQDKLEPCR 212

BLAST of Cla97C04G070495 vs. NCBI nr
Match: XP_030478286.1 (uncharacterized protein LOC115695356 [Cannabis sativa])

HSP 1 Score: 186.8 bits (473), Expect = 7.6e-43
Identity = 82/204 (40.20%), Postives = 121/204 (59.31%), Query Frame = 0

Query: 638 TTKASNPDLVLIQETKKDAIELDVIKAIWSSKDIRWIYVEAFGKSGGTLTMWDESKVSIL 697
           T    NPD+V++QE KK +++   I +IW S+   WI + A G+SGGTL +WD   +++L
Sbjct: 23  TNSKVNPDMVILQEVKKASVDRMYIGSIWRSRFKAWILILAIGRSGGTLLVWDTRSITVL 82

Query: 698 ESLKGGYSLSVKCKSLSKKVDWVTNIYGPTYYKERRHIWPELASLSAYCTEAWCLGGDLN 757
           +S+ G +S+SV  ++  K+  W + +YGP  Y ER   W E+A LSA C + WCLG D N
Sbjct: 83  DSMVGEFSISVLIEAEGKRPWWFSGVYGPCSYTERLEFWDEMAGLSAICGDVWCLGRDFN 142

Query: 758 MTRLIQERSPTGSWTKGMKKFNKFIENAHLLEIPLSNGHFTWSREGRGSSCSLLDKFLVS 817
           + R +QE+  + SWTK MK F++ +    L++  L+NG FTWS       C  LD+F  +
Sbjct: 143 VVRRVQEKLNSNSWTKSMKMFDQLVRELKLMDPKLNNGKFTWSNFRNKPICCRLDRFFFT 202

Query: 818 NNWEEAFDDTRVARQARLYSDHFP 842
           NNW   F   +     R+ SDH P
Sbjct: 203 NNWSNLFTFVQQEMLVRIVSDHSP 226

BLAST of Cla97C04G070495 vs. NCBI nr
Match: VVA20479.1 (Hypothetical predicted protein, partial [Prunus dulcis])

HSP 1 Score: 172.9 bits (437), Expect = 1.1e-38
Identity = 76/198 (38.38%), Postives = 115/198 (58.08%), Query Frame = 0

Query: 644 PDLVLIQETKKDAIELDVIKAIWSSKDIRWIYVEAFGKSGGTLTMWDESKVSILESLKGG 703
           PD+V++ ETKK+ ++  ++  +W S+   W++  + G+SGG   +W+   VS+++S+ G 
Sbjct: 36  PDIVILLETKKEIVDRQLVTGVWGSRFKEWVFSPSLGRSGGIAVLWNSQSVSVIDSMVGD 95

Query: 704 YSLSVKCKSLSKKVDWVTNIYGPTYYKERRHIWPELASLSAYCTEAWCLGGDLNMTRLIQ 763
           +S+S++         W++ IYGP   +ER   W ELA L  YC + WCLGGD N+ R   
Sbjct: 96  FSVSIRIVENIGTDWWLSGIYGPCRQRERISFWEELADLYGYCGDKWCLGGDFNVVRFSA 155

Query: 764 ERSPTGSWTKGMKKFNKFIENAHLLEIPLSNGHFTWSREGRGSSCSLLDKFLVSNNWEEA 823
           E+S  G  TK M+ FN FI+  +L +  L N  FTWS     + C  LD+FLVS +WE+ 
Sbjct: 156 EKSNEGRVTKSMRDFNDFIQETNLRDPNLLNASFTWSNLRENAVCRRLDRFLVSGSWEDH 215

Query: 824 FDDTRVARQARLYSDHFP 842
           F   R     R+ SDH P
Sbjct: 216 FPHYRHKALPRITSDHCP 233

BLAST of Cla97C04G070495 vs. NCBI nr
Match: BBH07150.1 (TatD related DNase [Prunus dulcis])

HSP 1 Score: 172.6 bits (436), Expect = 1.5e-38
Identity = 76/198 (38.38%), Postives = 115/198 (58.08%), Query Frame = 0

Query: 644 PDLVLIQETKKDAIELDVIKAIWSSKDIRWIYVEAFGKSGGTLTMWDESKVSILESLKGG 703
           PD+V++ ETKK+ ++  ++  +W S+   W++  + G+SGG   +W+   VS+++S+ G 
Sbjct: 17  PDIVILLETKKEIVDRQLVTGVWGSRFKEWVFSPSLGRSGGIAVLWNSQSVSVIDSMVGE 76

Query: 704 YSLSVKCKSLSKKVDWVTNIYGPTYYKERRHIWPELASLSAYCTEAWCLGGDLNMTRLIQ 763
           +S+S++         W++ IYGP   +ER   W ELA L  YC + WCLGGD N+ R   
Sbjct: 77  FSVSIRIVENIGTDWWLSGIYGPCRQRERISFWEELADLYGYCGDKWCLGGDFNVVRFSA 136

Query: 764 ERSPTGSWTKGMKKFNKFIENAHLLEIPLSNGHFTWSREGRGSSCSLLDKFLVSNNWEEA 823
           E+S  G  TK M+ FN FI+  +L +  L N  FTWS     + C  LD+FLVS +WE+ 
Sbjct: 137 EKSNEGRVTKSMRDFNDFIQETNLRDPNLLNASFTWSNLRENAVCRRLDRFLVSGSWEDH 196

Query: 824 FDDTRVARQARLYSDHFP 842
           F   R     R+ SDH P
Sbjct: 197 FPHYRHKALPRITSDHCP 214

BLAST of Cla97C04G070495 vs. ExPASy TrEMBL
Match: A0A5D3BHE3 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold429G00120 PE=4 SV=1)

HSP 1 Score: 219.2 bits (557), Expect = 6.7e-53
Identity = 100/200 (50.00%), Postives = 136/200 (68.00%), Query Frame = 0

Query: 643 NPD-LVLIQETKKDAIELDVIKAIWSSKDIRWIYVEAFGKSGGTLTMWDESKVSILESLK 702
           +PD LV+    +   I++ +IK++WSSKDI W  VE+FG+ GG LTMWD SK+ ++E+LK
Sbjct: 67  DPDHLVICYRNQGQEIDIALIKSLWSSKDIGWELVESFGRFGGILTMWDMSKIKVVETLK 126

Query: 703 GGYSLSVKCKSLSKKVDWVTNIYGPTYYKERRHIWPELASLSAYCTEAWCLGGDLNMTRL 762
           GGYSLS+   +  KK  W+TN+YGP  Y+ERR +W  L SLS YCT AWC+GG  N+TR 
Sbjct: 127 GGYSLSINSITSCKKSCWITNVYGPYDYEERRFVWLVLVSLSGYCTGAWCIGGKCNITRW 186

Query: 763 IQERSPTGSWTKGMKKFNKFIENAHLLEIPLSNGHFTWSREGRGSSCSLLDKFLVSNNWE 822
             E  P    T+GM++FN  I++ ++ E+PL NG  TWSREG   S SLLD F +   W+
Sbjct: 187 AHECFPLEKQTRGMRQFNNPIDSLNIWELPLQNGRCTWSREGSSISRSLLDPFFIDKEWD 246

Query: 823 EAFDDTRVARQARLYSDHFP 842
           E  +++RV R+A   SDHFP
Sbjct: 247 EISENSRVGRKAHTISDHFP 266

BLAST of Cla97C04G070495 vs. ExPASy TrEMBL
Match: A0A803QQM3 (Uncharacterized protein OS=Cannabis sativa OX=3483 PE=4 SV=1)

HSP 1 Score: 186.4 bits (472), Expect = 4.8e-43
Identity = 87/200 (43.50%), Postives = 120/200 (60.00%), Query Frame = 0

Query: 642  SNPDLVLIQETKKDAIELDVIKAIWSSKDIRWIYVEAFGKSGGTLTMWDESKVSILESLK 701
            +NPDLV++QE K+  ++   I +IW S+   WI + A G+SGGTL +WD   +S+L+SL 
Sbjct: 950  ANPDLVILQEVKRATVDRRFIGSIWRSRFKAWILLPALGRSGGTLLIWDTRTISVLDSLV 1009

Query: 702  GGYSLSVKCKSLSKKVDWVTNIYGPTYYKERRHIWPELASLSAYCTEAWCLGGDLNMTRL 761
            G +S+SV   +  K+  W + +YGP  YK R   W ELA LS+ C E+WC+GGD N+TR 
Sbjct: 1010 GEFSISVLINAEGKEPWWFSGVYGPCSYKLRPEFWDELAGLSSICGESWCVGGDFNVTRR 1069

Query: 762  IQERSPTGSWTKGMKKFNKFIENAHLLEIPLSNGHFTWSREGRGSSCSLLDKFLVSNNWE 821
            + E+  + S T+ MK F+  I    L++  L NG FTWS       CS LD+FL SNNW 
Sbjct: 1070 VGEKLNSSSCTRSMKLFDGLIRELQLIDPKLENGSFTWSNFRASPVCSRLDRFLFSNNWN 1129

Query: 822  EAFDDTRVARQARLYSDHFP 842
              +   R     RL SDH P
Sbjct: 1130 VIYPFVRQEMLVRLVSDHSP 1149

BLAST of Cla97C04G070495 vs. ExPASy TrEMBL
Match: A0A803QI00 (Uncharacterized protein OS=Cannabis sativa OX=3483 PE=4 SV=1)

HSP 1 Score: 184.1 bits (466), Expect = 2.4e-42
Identity = 86/200 (43.00%), Postives = 120/200 (60.00%), Query Frame = 0

Query: 642  SNPDLVLIQETKKDAIELDVIKAIWSSKDIRWIYVEAFGKSGGTLTMWDESKVSILESLK 701
            +NPDLV++QE K+ +++   I +IW S+   WI + A G+SGGTL +WD   +++L+SL 
Sbjct: 926  ANPDLVILQEVKRTSVDRRFIGSIWRSRFKAWIIIPAIGRSGGTLLIWDTRTITVLDSLV 985

Query: 702  GGYSLSVKCKSLSKKVDWVTNIYGPTYYKERRHIWPELASLSAYCTEAWCLGGDLNMTRL 761
            G +S+SV  K+  K   W + +YGP  YK R   W ELA LSA C ++WC+GGD N+TR 
Sbjct: 986  GEFSISVLIKAEGKDPWWFSGVYGPCSYKLRPAFWDELAGLSAICGDSWCVGGDFNVTRR 1045

Query: 762  IQERSPTGSWTKGMKKFNKFIENAHLLEIPLSNGHFTWSREGRGSSCSLLDKFLVSNNWE 821
              E+  + S T+ MK F+  I    L++  L NG FTWS       CS LD+FL +NNW 
Sbjct: 1046 PGEKLNSSSCTRSMKLFDGLIRELRLIDPKLENGRFTWSNFRTSPVCSRLDRFLFTNNWN 1105

Query: 822  EAFDDTRVARQARLYSDHFP 842
              +   R     RL SDH P
Sbjct: 1106 VIYPFVRQEMLVRLVSDHSP 1125

BLAST of Cla97C04G070495 vs. ExPASy TrEMBL
Match: A0A803P8A0 (Uncharacterized protein OS=Cannabis sativa OX=3483 PE=4 SV=1)

HSP 1 Score: 181.8 bits (460), Expect = 1.2e-41
Identity = 86/200 (43.00%), Postives = 119/200 (59.50%), Query Frame = 0

Query: 642 SNPDLVLIQETKKDAIELDVIKAIWSSKDIRWIYVEAFGKSGGTLTMWDESKVSILESLK 701
           +NPD+V++QE K+  ++   I +IW S+   WI + A G+SGGTL +WD   +S+L+SL 
Sbjct: 27  ANPDMVILQEVKRATVDRRFIGSIWRSRFKAWILLPAIGRSGGTLLIWDTRIISVLDSLV 86

Query: 702 GGYSLSVKCKSLSKKVDWVTNIYGPTYYKERRHIWPELASLSAYCTEAWCLGGDLNMTRL 761
           G +S+SV   +  K+  W + +YGP  YK R   W ELA LS+ C E+WC+GGD N+TR 
Sbjct: 87  GEFSISVLINAEGKEPWWFSGVYGPCSYKIRHVFWDELAGLSSICGESWCVGGDFNVTRR 146

Query: 762 IQERSPTGSWTKGMKKFNKFIENAHLLEIPLSNGHFTWSREGRGSSCSLLDKFLVSNNWE 821
           + E+  + S T+ MK F+  I    L++  L NG FTWS       CS LD+FL  NNW 
Sbjct: 147 VGEKLNSSSSTRSMKLFDGLIRELQLIDPKLENGSFTWSNFRAIPICSRLDRFLFLNNWN 206

Query: 822 EAFDDTRVARQARLYSDHFP 842
             F   R     RL SDH P
Sbjct: 207 VVFPFVRQEMLVRLVSDHSP 226

BLAST of Cla97C04G070495 vs. ExPASy TrEMBL
Match: A0A803QEA6 (Uncharacterized protein OS=Cannabis sativa OX=3483 PE=4 SV=1)

HSP 1 Score: 177.9 bits (450), Expect = 1.7e-40
Identity = 101/287 (35.19%), Postives = 151/287 (52.61%), Query Frame = 0

Query: 557 LKSQESVVSASSDELKVQRKETPSNDLETEDSINEDLNRLFQAVEGNDSGALDEAIVSH- 616
           L   E      +DE+ ++     SN +E+   +  ++ +     E  DS    E I++  
Sbjct: 304 LDELEKEEGREADEIMIEATSW-SNIVESMAEMGMEITQ-----ENEDSDQKTEEILTWN 363

Query: 617 -YPSKDIPKHLKLIIDICRISLRTTKASNPDLVLIQETKKDAIELDVIKAIWSSKDIRWI 676
              S D  K   +   IC+        +NPDLV++QE K+  ++   I +IW S+   WI
Sbjct: 364 IRGSGDKGKRTAIKATICK--------ANPDLVILQEVKRATVDRRFIGSIWRSRFKAWI 423

Query: 677 YVEAFGKSGGTLTMWDESKVSILESLKGGYSLSVKCKSLSKKVDWVTNIYGPTYYKERRH 736
            + A G+SGGTL +WD   +S+L+SL G +S+SV   +  K+  W + +YGP  YK R  
Sbjct: 424 LIPAIGRSGGTLLIWDTRTISVLDSLVGEFSISVLINAEGKEPWWFSGVYGPCSYKLRPE 483

Query: 737 IWPELASLSAYCTEAWCLGGDLNMTRLIQERSPTGSWTKGMKKFNKFIENAHLLEIPLSN 796
            W ELA LS+ C ++WC+ GD N+TR + E+  + S+T+ MK F+  I    L++  L N
Sbjct: 484 FWDELAGLSSICGKSWCVAGDFNVTRRVGEKLNSSSFTRSMKLFDGLIRELQLIDPKLEN 543

Query: 797 GHFTWSREGRGSSCSLLDKFLVSNNWEEAFDDTRVARQARLYSDHFP 842
           G FTWS       CS LD+FL +NNW   F   R     R+ SDH P
Sbjct: 544 GSFTWSNFRASPVCSRLDRFLFTNNWNIIFPFVRQELLVRIVSDHSP 576

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
TYJ98683.11.4e-5250.00hypothetical protein E5676_scaffold429G00120 [Cucumis melo var. makuwa][more]
XP_038876676.12.1e-4851.04uncharacterized protein LOC120069076 [Benincasa hispida][more]
XP_030478286.17.6e-4340.20uncharacterized protein LOC115695356 [Cannabis sativa][more]
VVA20479.11.1e-3838.38Hypothetical predicted protein, partial [Prunus dulcis][more]
BBH07150.11.5e-3838.38TatD related DNase [Prunus dulcis][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A5D3BHE36.7e-5350.00Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... [more]
A0A803QQM34.8e-4343.50Uncharacterized protein OS=Cannabis sativa OX=3483 PE=4 SV=1[more]
A0A803QI002.4e-4243.00Uncharacterized protein OS=Cannabis sativa OX=3483 PE=4 SV=1[more]
A0A803P8A01.2e-4143.00Uncharacterized protein OS=Cannabis sativa OX=3483 PE=4 SV=1[more]
A0A803QEA61.7e-4035.19Uncharacterized protein OS=Cannabis sativa OX=3483 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
InterPro
Analysis Name: InterPro Annotations of Watermelon (97103) v2.5
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR036691Endonuclease/exonuclease/phosphatase superfamilyGENE3D3.60.10.10Endonuclease/exonuclease/phosphatasecoord: 628..843
e-value: 4.0E-24
score: 87.7
IPR036691Endonuclease/exonuclease/phosphatase superfamilySUPERFAMILY56219DNase I-likecoord: 639..842
NoneNo IPR availablePANTHERPTHR22748:SF11DNA-(APURINIC OR APYRIMIDINIC SITE) LYASE CHLOROPLASTICcoord: 640..779
IPR004808AP endonuclease 1PANTHERPTHR22748AP ENDONUCLEASEcoord: 640..779

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla97C04G070495.1Cla97C04G070495.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006281 DNA repair
molecular_function GO:0004518 nuclease activity