CcUC03G057900 (gene) Watermelon (PI 537277) v1

Overview
NameCcUC03G057900
Typegene
OrganismCitrullus colocynthis (Watermelon (PI 537277) v1)
DescriptionUnknown protein
LocationCicolChr03: 29829698 .. 29833060 (+)
RNA-Seq ExpressionCcUC03G057900
SyntenyCcUC03G057900
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TATACATCATGATATTTTATCAACATATTTGTGATACAAACAAAATTCTCAATATCAAAATCCATATTTTAATCAAAAGTGCTCACAGCTTGAAGTTTCTTGTAAGATTATGCTTTATTTCATCCGAATTTTGAAAGTACAATCTCGTTGAAGCATTTTAACAAACAGTTTGAAGGCGATATGAAGAACAAGAAAGAAACTGCTAACAGTTCCATCTGAAAATCACACCGGCAGCAAAGCAATTACGAGATTCCGATGAAAATCTTTGCAATTACTGACCTGACGAGCAGCGCCGAAGCATCGAACGCCGTCGGGCTTGCCTTGCGATTCCTTGAGGAGGAAGAGAGCCGCTCGGTCATCGCCTTTGGCTAACTCTCTGTCGATTTGCTCGAGCACTTGTCGCCGCCGGAGACCGACCTCCGATGAACAATTCCAGATTCCGGTCGCTCGAATACATGGACGGCAACGTAAACCTGCATACTGCATTGAACTTCTTTAATCCGAAGCAATCGCACAGGAGGAGAATTTATTGTAAGAAATCTCAAGACGTTTTGTCTGGTTACTATTCTGAAGTTCGCAATAGTTTTCGACCTCACACTCACGGCTATGGATAATTTGAGAATAATTTGGAAATTCCAGAAGCTGAATTGAAAACGACGTCGTATATTCTAAAAAATTGTCTCCTGGTTGTGGATTAACTTCTTTTTCATTTCTTTCTCGGGAAACGCCAACCGGGAGTGTTATGTAATATGTTCTTCACTACTGCTGATTACGATTTCACTTCCAACCTACAGTTTCATCGGAGGATTCCGGTGACCGGAGACGTCATTTCATCGGTGAAGCGCGGGGAATCTGTTGATGGTGCAGCGAAACGACGTCGAGCTCTGAAGCTTGTGGATCGAGCACTCTCGAAGCGGCAATACAAATCCGCTCTTTCGTTGGTTAAGCAGTTGCAAGGGAAACCCTATGGCCTTCGTGCTTTCGGTGCCGCCAAGCAGGTCATCTCTTTCTTTCTTTCTGTTTGTTTCGAGTTTGAGAGATTCTTCTGTCAAAACATCCATTGATTAGTAATTCTTATCTTCGAAATCATATTGAAAATGGTGTAGATATTCAAGAGGCCTTCAGCAATGGACGAATCAGAGCTCAATAGAAAGGATATTTTATCCCTTCAACCACTAGTGGATTTAATTCTGGATTCAATTCAACAATGTCTTCAGATATCTTTATCTGAGAGGGTACATTCCCTGCTTCTTGTTATGCAAGTTTTCATTTCTGAAGATTTTGGTGTTATTGATGACTCTTCATTTGTGCTTCAAGAACTTCACTCTCTGTTTGTTGTTTTACAGACCTCTGCTGAAAAGCTAGAGAGTTTAATTGCTGAAGGTAGACATTCTTCTCGTTGTGAAGAAGAAGAACACCTCATATGTGCACAAGTGAGCTAAATTTGCTAATTCTTTCATGGATTTTTCCACTGTTGTCATTGAGTTTGCATATATAATTGGAAGTTGAGATTATGCTTGCAGCATGAAGCTGGCCATTTCCTTGTTGGCTATTTGATGGGTGTTCTTCCAAAAGAATATGAGGTGCCAAGCATTCAAGCTCTAAGCCAGAACAGATTTGCTGAAGGAAAAGTTTCATTTGTTGGTTTTGAATTTCTTGGGGAAGTAAGATCATTTTCACGCTGCCTATATCACCTGTTTTCGTCAAGATTCAAACTGGGTTTCTTCGGATCTTTCTTTCTATATTTTTGATCTATAAGGCTTCTTAGAAAATCTAGTCAAGACAATTTTGGGTTTTTACCTTGCTGTTTCAAAGAAATATGATAGATTATCAATGCCTGTTAAGAGCTGATTCTTGCTATTTCTTCCGCTCCTTCTACCCGATTGGTTTAGCATGTGAGATTTTAAAATTCAATATAATCTCATGAAGGTTTATAAATATTCTTATGGCTTTACTTTTACTTTTAAGGTTCACATAATAGATGACCCAAATTTTGGAAGAAAAAATGAGTTTTGCCCTACTTTATAAAAATGAGATTTTTCGATATTTGTTGATGTCAAATTCGGGTTAAATTACCAAAAAAATGACCTTCGCACATGCGAAGAGTAAATTCATAACTTCACAAGATCATGACATCAACAAAAGGTCAATTACCATTGTTTTTAAAAAGTGGGACATTCTTTATTTTTTGACCTCCAAAATGGGTCAACTGTACAATTTTCTCTTACTTTTATCCCTCCATGCAGATTTATTCAGTAAGGATTTTGGGGGGAAATGTTGATGTCAGAAATTTGCATAACAGGGTGAGATTTCTTTCTATTAGGGTTGAATGGTTATTTGATTTCTGGTTATAGAAACAATATCAAACAATTGTTTTTGGTAACATAGAATATGATCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCGTGTAATTAATTTATTGGGTTATAGCTTATTTTAATGTACCAACCCTGATTTCTAAAGCTTTAATCTATTCTGTATCATTCAGGCAAATAAAGGCAGAATTTCCTCAAAGGTGAACAGTTCACATCTTACTCTACATGCTATTCACTAAAGCCTTCAATGGCTTTCTCTTTGCAGCTTTGATCCATTGGAACATGTTCTTGTAACTCATCCAACCTTTTGAGAAAGACTACGACATTGAAAATTCATGAGTTTTAATGATCTTAAGCTTACATTCTTAAAACTGCATTATAGCCTGTCAAACAAATATTCAATTATCAATGACATTGAAAAAGAATTCATTTTGAAGATGATTTTGTTTTCCTCTGCTGTTTCAGATATTGAACCAGTTTTCATGTGTAGCATTAGGAGGCTTAGTGGCTGAGCTTCTAGTTGCTGGAAATTCTGATGGCCATCTAGCAGATATGCTCAAGGTAAAAGCAAGCAACTTCCTCTTCTTCTTTTCCATTTTGCACTACCTATTATTTCATGGTGTTGTTTGCAACAGCTGGGGAGTGTTCTTACATGGCTTGGCCTTTCCAAGTCTGAAGCTGATCTTCATTTAAAATGGGCTGCAACAAACACGGCATTCATAATGTCCCAGCATTGTGAAACAAGATCAAGACTCGCAGAGGCCATGGTGCTGGGGAAACCGATTGGGCTCTGTATCGACGCAATTGAAAACTGTTTGCAGGGAATGGAGATATGAAAACGAATTTTCGTGATATTTGAATAGCCATGGCCACCACTACTCAGGAGTGATCTGATGAGTTTTTTGGAAAATCAAATGTAGTTGAGTGAAGCAACTGTCTTGGAAGATAGTGCGCGCGACGTGGTCCAAACCATCGAGAAAAAATGTGATCGTCTTGGACGATGAGTCATTCATGAG

mRNA sequence

TATACATCATGATATTTTATCAACATATTTGTGATACAAACAAAATTCTCAATATCAAAATCCATATTTTAATCAAAAGTGCTCACAGCTTGAAGTTTCTTGTAAGATTATGCTTTATTTCATCCGAATTTTGAAAGTACAATCTCGTTGAAGCATTTTAACAAACAGTTTGAAGGCGATATGAAGAACAAGAAAGAAACTGCTAACAGTTCCATCTGAAAATCACACCGGCAGCAAAGCAATTACGAGATTCCGATGAAAATCTTTGCAATTACTGACCTGACGAGCAGCGCCGAAGCATCGAACGCCGTCGGGCTTGCCTTGCGATTCCTTGAGGAGGAAGAGAGCCGCTCGGTCATCGCCTTTGGCTAACTCTCTGTCGATTTGCTCGAGCACTTGTCGCCGCCGGAGACCGACCTCCGATGAACAATTCCAGATTCCGGTCGCTCGAATACATGGACGGCAACGTAAACCTGCATACTGCATTGAACTTCTTTAATCCGAAGCAATCGCACAGGAGGAGAATTTATTGTAAGAAATCTCAAGACGTTTTGTCTGGTTACTATTCTGAAGTTCGCAATAGTTTTCGACCTCACACTCACGGCTATGGATAATTTGAGAATAATTTGGAAATTCCAGAAGCTGAATTGAAAACGACGTCGTATATTCTAAAAAATTGTCTCCTGGTTGTGGATTAACTTCTTTTTCATTTCTTTCTCGGGAAACGCCAACCGGGAGTGTTATGTAATATGTTCTTCACTACTGCTGATTACGATTTCACTTCCAACCTACAGTTTCATCGGAGGATTCCGGTGACCGGAGACGTCATTTCATCGGTGAAGCGCGGGGAATCTGTTGATGGTGCAGCGAAACGACGTCGAGCTCTGAAGCTTGTGGATCGAGCACTCTCGAAGCGGCAATACAAATCCGCTCTTTCGTTGGTTAAGCAGTTGCAAGGGAAACCCTATGGCCTTCGTGCTTTCGGTGCCGCCAAGCAGATATTCAAGAGGCCTTCAGCAATGGACGAATCAGAGCTCAATAGAAAGGATATTTTATCCCTTCAACCACTAGTGGATTTAATTCTGGATTCAATTCAACAATGTCTTCAGATATCTTTATCTGAGAGGGTACATTCCCTGCTTCTTGTTATGCAAGTTTTCATTTCTGAAGATTTTGGTGTTATTGATGACTCTTCATTTGTGCTTCAAGAACTTCACTCTCTGTTTGTTGTTTTACAGACCTCTGCTGAAAAGCTAGAGAGTTTAATTGCTGAAGGTAGACATTCTTCTCGTTGTGAAGAAGAAGAACACCTCATATGTGCACAACATGAAGCTGGCCATTTCCTTGTTGGCTATTTGATGGGTGTTCTTCCAAAAGAATATGAGGTGCCAAGCATTCAAGCTCTAAGCCAGAACAGATTTGCTGAAGGAAAAGTTTCATTTGTTGGTTTTGAATTTCTTGGGGAAATATTGAACCAGTTTTCATGTGTAGCATTAGGAGGCTTAGTGGCTGAGCTTCTAGTTGCTGGAAATTCTGATGGCCATCTAGCAGATATGCTCAAGCTGGGGAGTGTTCTTACATGGCTTGGCCTTTCCAAGTCTGAAGCTGATCTTCATTTAAAATGGGCTGCAACAAACACGGCATTCATAATGTCCCAGCATTGTGAAACAAGATCAAGACTCGCAGAGGCCATGGTGCTGGGGAAACCGATTGGGCTCTGTATCGACGCAATTGAAAACTGTTTGCAGGGAATGGAGATATGAAAACGAATTTTCGTGATATTTGAATAGCCATGGCCACCACTACTCAGGAGTGATCTGATGAGTTTTTTGGAAAATCAAATGTAGTTGAGTGAAGCAACTGTCTTGGAAGATAGTGCGCGCGACGTGGTCCAAACCATCGAGAAAAAATGTGATCGTCTTGGACGATGAGTCATTCATGAG

Coding sequence (CDS)

ATGTTCTTCACTACTGCTGATTACGATTTCACTTCCAACCTACAGTTTCATCGGAGGATTCCGGTGACCGGAGACGTCATTTCATCGGTGAAGCGCGGGGAATCTGTTGATGGTGCAGCGAAACGACGTCGAGCTCTGAAGCTTGTGGATCGAGCACTCTCGAAGCGGCAATACAAATCCGCTCTTTCGTTGGTTAAGCAGTTGCAAGGGAAACCCTATGGCCTTCGTGCTTTCGGTGCCGCCAAGCAGATATTCAAGAGGCCTTCAGCAATGGACGAATCAGAGCTCAATAGAAAGGATATTTTATCCCTTCAACCACTAGTGGATTTAATTCTGGATTCAATTCAACAATGTCTTCAGATATCTTTATCTGAGAGGGTACATTCCCTGCTTCTTGTTATGCAAGTTTTCATTTCTGAAGATTTTGGTGTTATTGATGACTCTTCATTTGTGCTTCAAGAACTTCACTCTCTGTTTGTTGTTTTACAGACCTCTGCTGAAAAGCTAGAGAGTTTAATTGCTGAAGGTAGACATTCTTCTCGTTGTGAAGAAGAAGAACACCTCATATGTGCACAACATGAAGCTGGCCATTTCCTTGTTGGCTATTTGATGGGTGTTCTTCCAAAAGAATATGAGGTGCCAAGCATTCAAGCTCTAAGCCAGAACAGATTTGCTGAAGGAAAAGTTTCATTTGTTGGTTTTGAATTTCTTGGGGAAATATTGAACCAGTTTTCATGTGTAGCATTAGGAGGCTTAGTGGCTGAGCTTCTAGTTGCTGGAAATTCTGATGGCCATCTAGCAGATATGCTCAAGCTGGGGAGTGTTCTTACATGGCTTGGCCTTTCCAAGTCTGAAGCTGATCTTCATTTAAAATGGGCTGCAACAAACACGGCATTCATAATGTCCCAGCATTGTGAAACAAGATCAAGACTCGCAGAGGCCATGGTGCTGGGGAAACCGATTGGGCTCTGTATCGACGCAATTGAAAACTGTTTGCAGGGAATGGAGATATGA

Protein sequence

MFFTTADYDFTSNLQFHRRIPVTGDVISSVKRGESVDGAAKRRRALKLVDRALSKRQYKSALSLVKQLQGKPYGLRAFGAAKQIFKRPSAMDESELNRKDILSLQPLVDLILDSIQQCLQISLSERVHSLLLVMQVFISEDFGVIDDSSFVLQELHSLFVVLQTSAEKLESLIAEGRHSSRCEEEEHLICAQHEAGHFLVGYLMGVLPKEYEVPSIQALSQNRFAEGKVSFVGFEFLGEILNQFSCVALGGLVAELLVAGNSDGHLADMLKLGSVLTWLGLSKSEADLHLKWAATNTAFIMSQHCETRSRLAEAMVLGKPIGLCIDAIENCLQGMEI
Homology
BLAST of CcUC03G057900 vs. NCBI nr
Match: XP_038879283.1 (uncharacterized protein LOC120071224 isoform X1 [Benincasa hispida])

HSP 1 Score: 498.4 bits (1282), Expect = 4.7e-137
Identity = 271/365 (74.25%), Postives = 284/365 (77.81%), Query Frame = 0

Query: 1   MFFTTADYDFTSNLQFHRRIPVTGDVISSVKRGESVDGAAKRRRALKLVDRALSKRQYKS 60
           MFFT A YDFT NL+FHRRIPVTG+VISSVKRGES DGA KRRRALKLVDRALSKRQYKS
Sbjct: 1   MFFTAAVYDFTFNLEFHRRIPVTGEVISSVKRGESGDGAVKRRRALKLVDRALSKRQYKS 60

Query: 61  ALSLVKQLQGKPYGLRAFGAAKQIFKRPSAMDESELNRKDILSLQPLVDLILDSIQQCLQ 120
           ALSLVKQLQGKPYGLRAFGAAKQI KR S MDE ELNRKDIL+LQPLV  ILDSIQQCLQ
Sbjct: 61  ALSLVKQLQGKPYGLRAFGAAKQIIKRRSEMDEPELNRKDILALQPLVVSILDSIQQCLQ 120

Query: 121 ISLSERVHSLLLVMQVFISEDFGVIDDSSFVLQELHSLFVVLQTSAEKLESLIAEGRHSS 180
           ISL E++                                     SAEKL+SL+A+GRHSS
Sbjct: 121 ISLLEKI-------------------------------------SAEKLQSLVADGRHSS 180

Query: 181 RCEEEEHLICAQHEAGHFLVGYLMGVLPKEYEVPSIQALSQNRFAEGKVSFVGFEFLGEI 240
           RCEEEEH ICAQHEAGHFLVGYLMGVLPKEYEVPSIQAL+QNRFAEGKVSFVGFEFLGEI
Sbjct: 181 RCEEEEHFICAQHEAGHFLVGYLMGVLPKEYEVPSIQALNQNRFAEGKVSFVGFEFLGEI 240

Query: 241 ----------------------------LNQFSCVALGGLVAELLVAGNSDGHLADMLKL 300
                                       LNQFSCV LGGLVAELLVAGNSDGHLAD+LKL
Sbjct: 241 DSVKILGENADIRNFHNRANEGRISSKTLNQFSCVTLGGLVAELLVAGNSDGHLADILKL 300

Query: 301 GSVLTWLGLSKSEADLHLKWAATNTAFIMSQHCETRSRLAEAMVLGKPIGLCIDAIENCL 338
           GSVLTWLG SKSEAD+HLKWAATNTAFIMS+HCETRSRLAEAM LGKPIGLCIDAIENCL
Sbjct: 301 GSVLTWLGFSKSEADIHLKWAATNTAFIMSRHCETRSRLAEAMALGKPIGLCIDAIENCL 328

BLAST of CcUC03G057900 vs. NCBI nr
Match: XP_022968757.1 (uncharacterized protein LOC111467900 isoform X2 [Cucurbita maxima])

HSP 1 Score: 462.6 bits (1189), Expect = 2.9e-126
Identity = 256/352 (72.73%), Postives = 267/352 (75.85%), Query Frame = 0

Query: 1   MFFTTADYDFTSNLQFHRRIPVTGDVISSVKRGESVDGAAKRRRALKLVDRALSKRQYKS 60
           MFFT AD DFTSNL+FHRRIPVTGDVISS KR +S DGAAKRRRALKLVDRALSKRQYKS
Sbjct: 1   MFFTAADCDFTSNLEFHRRIPVTGDVISSAKRWDSGDGAAKRRRALKLVDRALSKRQYKS 60

Query: 61  ALSLVKQLQGKPYGLRAFGAAKQIFKRPSAMDESELNRKDILSLQPLVDLILDSIQQCLQ 120
           ALSLVKQLQGKPYGLRAFGAAKQI KRPSAMDESELN KDILSLQPLVD ILDSIQ CLQ
Sbjct: 61  ALSLVKQLQGKPYGLRAFGAAKQITKRPSAMDESELNTKDILSLQPLVDSILDSIQPCLQ 120

Query: 121 ISLSERVHSLLLVMQVFISEDFGVIDDSSFVLQELHSLFVVLQTSAEKLESLIAEGRHSS 180
           I                                           SAE+LESLIAEGR+ S
Sbjct: 121 I-------------------------------------------SAERLESLIAEGRYPS 180

Query: 181 RCEEEEHLICAQHEAGHFLVGYLMGVLPKEYEVPSIQALSQNRFAEGKVSFVGFEFLGE- 240
           RCEEEEHLICAQHEAGHFLVGYLMGVLPK+YEVPSIQAL QNRFAEG VSFVGFEFLG+ 
Sbjct: 181 RCEEEEHLICAQHEAGHFLVGYLMGVLPKQYEVPSIQALRQNRFAEGNVSFVGFEFLGQE 240

Query: 241 --------------ILNQFSCVALGGLVAELLVAGNSDGHLADMLKLGSVLTWLGLSKSE 300
                          LNQFSCV LGGLVAELLVAGNSDGHLAD+LKL SVL WLGL KS+
Sbjct: 241 NKGRQENKGTISLTKLNQFSCVILGGLVAELLVAGNSDGHLADILKLESVLVWLGLPKSD 300

Query: 301 ADLHLKWAATNTAFIMSQHCETRSRLAEAMVLGKPIGLCIDAIENCLQGMEI 338
           AD HLKWAA NTAFIMS+H ETR  LA+ M LGK IG CID IENCLQG+EI
Sbjct: 301 ADRHLKWAAMNTAFIMSRHSETRLILAKVMALGKSIGFCIDTIENCLQGIEI 309

BLAST of CcUC03G057900 vs. NCBI nr
Match: XP_022968755.1 (uncharacterized protein LOC111467900 isoform X1 [Cucurbita maxima] >XP_022968756.1 uncharacterized protein LOC111467900 isoform X1 [Cucurbita maxima])

HSP 1 Score: 458.0 bits (1177), Expect = 7.1e-125
Identity = 257/371 (69.27%), Postives = 268/371 (72.24%), Query Frame = 0

Query: 1   MFFTTADYDFTSNLQFHRRIPVTGDVISSVKRGESVDGAAKRRRALKLVDRALSKRQYKS 60
           MFFT AD DFTSNL+FHRRIPVTGDVISS KR +S DGAAKRRRALKLVDRALSKRQYKS
Sbjct: 1   MFFTAADCDFTSNLEFHRRIPVTGDVISSAKRWDSGDGAAKRRRALKLVDRALSKRQYKS 60

Query: 61  ALSLVKQLQGKPYGLRAFGAAKQIFKRPSAMDESELNRKDILSLQPLVDLILDSIQQCLQ 120
           ALSLVKQLQGKPYGLRAFGAAKQI KRPSAMDESELN KDILSLQPLVD ILDSIQ CLQ
Sbjct: 61  ALSLVKQLQGKPYGLRAFGAAKQITKRPSAMDESELNTKDILSLQPLVDSILDSIQPCLQ 120

Query: 121 ISLSERVHSLLLVMQVFISEDFGVIDDSSFVLQELHSLFVVLQTSAEKLESLIAEGRHSS 180
           I                                           SAE+LESLIAEGR+ S
Sbjct: 121 I-------------------------------------------SAERLESLIAEGRYPS 180

Query: 181 RCEEEEHLICAQHEAGHFLVGYLMGVLPKEYEVPSIQALSQNRFAEGKVSFVGFEFLGEI 240
           RCEEEEHLICAQHEAGHFLVGYLMGVLPK+YEVPSIQAL QNRFAEG VSFVGFEFLG+I
Sbjct: 181 RCEEEEHLICAQHEAGHFLVGYLMGVLPKQYEVPSIQALRQNRFAEGNVSFVGFEFLGQI 240

Query: 241 ----------------------------------LNQFSCVALGGLVAELLVAGNSDGHL 300
                                             LNQFSCV LGGLVAELLVAGNSDGHL
Sbjct: 241 DSIKILVENADIKNLHERENKGRQENKGTISLTKLNQFSCVILGGLVAELLVAGNSDGHL 300

Query: 301 ADMLKLGSVLTWLGLSKSEADLHLKWAATNTAFIMSQHCETRSRLAEAMVLGKPIGLCID 338
           AD+LKL SVL WLGL KS+AD HLKWAA NTAFIMS+H ETR  LA+ M LGK IG CID
Sbjct: 301 ADILKLESVLVWLGLPKSDADRHLKWAAMNTAFIMSRHSETRLILAKVMALGKSIGFCID 328

BLAST of CcUC03G057900 vs. NCBI nr
Match: XP_004135797.2 (uncharacterized protein LOC101213254 isoform X2 [Cucumis sativus])

HSP 1 Score: 449.1 bits (1154), Expect = 3.3e-122
Identity = 249/365 (68.22%), Postives = 265/365 (72.60%), Query Frame = 0

Query: 1   MFFTTADYDFTSNLQFHRRIPVTGDVISSVKRGESVDGAAKRRRALKLVDRALSKRQYKS 60
           MF TTA YDFT NL+FH R+PVTGDV+SS          AKRRRALKLVDRALSKRQYKS
Sbjct: 1   MFLTTAVYDFTFNLEFHLRVPVTGDVVSS----------AKRRRALKLVDRALSKRQYKS 60

Query: 61  ALSLVKQLQGKPYGLRAFGAAKQIFKRPSAMDESELNRKDILSLQPLVDLILDSIQQCLQ 120
           A+SLVKQLQGKPYGLR FGAAKQI K+   +DESE+NR DILSLQPLVD ILDS+QQCLQ
Sbjct: 61  AVSLVKQLQGKPYGLRGFGAAKQIIKKRLELDESEVNRMDILSLQPLVDSILDSVQQCLQ 120

Query: 121 ISLSERVHSLLLVMQVFISEDFGVIDDSSFVLQELHSLFVVLQTSAEKLESLIAEGRHSS 180
           ISL E +                                     S EKLES +AEGRHSS
Sbjct: 121 ISLLEEI------------------------------------LSVEKLESSMAEGRHSS 180

Query: 181 RCEEEEHLICAQHEAGHFLVGYLMGVLPKEYEVPSIQALSQNRFAEGKVSFVGFEFLGEI 240
           RCEE+EH ICAQHEAGHFLVGYLMGVLPK Y+VPSIQAL QNRFAEGKVSFVGFEFLGEI
Sbjct: 181 RCEEQEHFICAQHEAGHFLVGYLMGVLPKAYQVPSIQALRQNRFAEGKVSFVGFEFLGEI 240

Query: 241 ----------------------------LNQFSCVALGGLVAELLVAGNSDGHLADMLKL 300
                                       LNQFSCV LGGLVAELLVAGNSDGHLAD+LKL
Sbjct: 241 DSAKILGENADIRSFNNRANKGTISSKTLNQFSCVTLGGLVAELLVAGNSDGHLADILKL 300

Query: 301 GSVLTWLGLSKSEADLHLKWAATNTAFIMSQHCETRSRLAEAMVLGKPIGLCIDAIENCL 338
            SVLTWLGL KSEADLHL+WAATNTAFIMS+HCETRSRLAEAM L KPIGLCIDAIENCL
Sbjct: 301 WSVLTWLGLPKSEADLHLRWAATNTAFIMSRHCETRSRLAEAMALAKPIGLCIDAIENCL 319

BLAST of CcUC03G057900 vs. NCBI nr
Match: TYK10198.1 (uncharacterized protein E5676_scaffold16G003430 [Cucumis melo var. makuwa])

HSP 1 Score: 448.7 bits (1153), Expect = 4.3e-122
Identity = 240/334 (71.86%), Postives = 259/334 (77.54%), Query Frame = 0

Query: 1   MFFTTADYDFTSNLQFHRRIPVTGDVISSVKRGESVDGAAKRRRALKLVDRALSKRQYKS 60
           MF T A YDFT +L+FHRR+PVTG V+SS          A+RRRALKLVDRALSKRQYKS
Sbjct: 1   MFLTAAVYDFTFDLEFHRRVPVTGYVVSS----------AERRRALKLVDRALSKRQYKS 60

Query: 61  ALSLVKQLQGKPYGLRAFGAAKQIFKRPSAMDESELNRKDILSLQPLVDLILDSIQQCLQ 120
           A+SLVKQLQGKPYGLR FGAAKQI KR   +DESE+N  D+LSLQPLVD ILDS+QQCLQ
Sbjct: 61  AVSLVKQLQGKPYGLRGFGAAKQIIKRRLELDESEVNGMDMLSLQPLVDSILDSVQQCLQ 120

Query: 121 ISLSERVHSLLLVMQVFISEDFGVIDDSSFVLQELHSLFVVLQTSAEKLESLIAEGRHSS 180
           IS  E +                                     SAEK ES +AEGRHSS
Sbjct: 121 ISFLEEI------------------------------------LSAEKPESSMAEGRHSS 180

Query: 181 RCEEEEHLICAQHEAGHFLVGYLMGVLPKEYEVPSIQALSQNRFAEGKVSFVGFEFLGEI 240
           RCEE+EH ICAQHEAGHFLVGYLMGVLPKEY+VPS+QALSQNRFAEGKVSFVGFEFLGE 
Sbjct: 181 RCEEQEHFICAQHEAGHFLVGYLMGVLPKEYQVPSVQALSQNRFAEGKVSFVGFEFLGET 240

Query: 241 LNQFSCVALGGLVAELLVAGNSDGHLADMLKLGSVLTWLGLSKSEADLHLKWAATNTAFI 300
           LNQFSCV LGGLVAELLVAGNSDGHLAD+LKL SVLTW GL KSEADLHL+WAATNTAFI
Sbjct: 241 LNQFSCVTLGGLVAELLVAGNSDGHLADILKLWSVLTWFGLPKSEADLHLRWAATNTAFI 288

Query: 301 MSQHCETRSRLAEAMVLGKPIGLCIDAIENCLQG 335
           MS+HCETR RLAEAM L KPIGLCID IENCL+G
Sbjct: 301 MSRHCETRLRLAEAMTLAKPIGLCIDTIENCLEG 288

BLAST of CcUC03G057900 vs. ExPASy TrEMBL
Match: A0A6J1HUE1 (uncharacterized protein LOC111467900 isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111467900 PE=4 SV=1)

HSP 1 Score: 462.6 bits (1189), Expect = 1.4e-126
Identity = 256/352 (72.73%), Postives = 267/352 (75.85%), Query Frame = 0

Query: 1   MFFTTADYDFTSNLQFHRRIPVTGDVISSVKRGESVDGAAKRRRALKLVDRALSKRQYKS 60
           MFFT AD DFTSNL+FHRRIPVTGDVISS KR +S DGAAKRRRALKLVDRALSKRQYKS
Sbjct: 1   MFFTAADCDFTSNLEFHRRIPVTGDVISSAKRWDSGDGAAKRRRALKLVDRALSKRQYKS 60

Query: 61  ALSLVKQLQGKPYGLRAFGAAKQIFKRPSAMDESELNRKDILSLQPLVDLILDSIQQCLQ 120
           ALSLVKQLQGKPYGLRAFGAAKQI KRPSAMDESELN KDILSLQPLVD ILDSIQ CLQ
Sbjct: 61  ALSLVKQLQGKPYGLRAFGAAKQITKRPSAMDESELNTKDILSLQPLVDSILDSIQPCLQ 120

Query: 121 ISLSERVHSLLLVMQVFISEDFGVIDDSSFVLQELHSLFVVLQTSAEKLESLIAEGRHSS 180
           I                                           SAE+LESLIAEGR+ S
Sbjct: 121 I-------------------------------------------SAERLESLIAEGRYPS 180

Query: 181 RCEEEEHLICAQHEAGHFLVGYLMGVLPKEYEVPSIQALSQNRFAEGKVSFVGFEFLGE- 240
           RCEEEEHLICAQHEAGHFLVGYLMGVLPK+YEVPSIQAL QNRFAEG VSFVGFEFLG+ 
Sbjct: 181 RCEEEEHLICAQHEAGHFLVGYLMGVLPKQYEVPSIQALRQNRFAEGNVSFVGFEFLGQE 240

Query: 241 --------------ILNQFSCVALGGLVAELLVAGNSDGHLADMLKLGSVLTWLGLSKSE 300
                          LNQFSCV LGGLVAELLVAGNSDGHLAD+LKL SVL WLGL KS+
Sbjct: 241 NKGRQENKGTISLTKLNQFSCVILGGLVAELLVAGNSDGHLADILKLESVLVWLGLPKSD 300

Query: 301 ADLHLKWAATNTAFIMSQHCETRSRLAEAMVLGKPIGLCIDAIENCLQGMEI 338
           AD HLKWAA NTAFIMS+H ETR  LA+ M LGK IG CID IENCLQG+EI
Sbjct: 301 ADRHLKWAAMNTAFIMSRHSETRLILAKVMALGKSIGFCIDTIENCLQGIEI 309

BLAST of CcUC03G057900 vs. ExPASy TrEMBL
Match: A0A6J1HY40 (uncharacterized protein LOC111467900 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111467900 PE=4 SV=1)

HSP 1 Score: 458.0 bits (1177), Expect = 3.4e-125
Identity = 257/371 (69.27%), Postives = 268/371 (72.24%), Query Frame = 0

Query: 1   MFFTTADYDFTSNLQFHRRIPVTGDVISSVKRGESVDGAAKRRRALKLVDRALSKRQYKS 60
           MFFT AD DFTSNL+FHRRIPVTGDVISS KR +S DGAAKRRRALKLVDRALSKRQYKS
Sbjct: 1   MFFTAADCDFTSNLEFHRRIPVTGDVISSAKRWDSGDGAAKRRRALKLVDRALSKRQYKS 60

Query: 61  ALSLVKQLQGKPYGLRAFGAAKQIFKRPSAMDESELNRKDILSLQPLVDLILDSIQQCLQ 120
           ALSLVKQLQGKPYGLRAFGAAKQI KRPSAMDESELN KDILSLQPLVD ILDSIQ CLQ
Sbjct: 61  ALSLVKQLQGKPYGLRAFGAAKQITKRPSAMDESELNTKDILSLQPLVDSILDSIQPCLQ 120

Query: 121 ISLSERVHSLLLVMQVFISEDFGVIDDSSFVLQELHSLFVVLQTSAEKLESLIAEGRHSS 180
           I                                           SAE+LESLIAEGR+ S
Sbjct: 121 I-------------------------------------------SAERLESLIAEGRYPS 180

Query: 181 RCEEEEHLICAQHEAGHFLVGYLMGVLPKEYEVPSIQALSQNRFAEGKVSFVGFEFLGEI 240
           RCEEEEHLICAQHEAGHFLVGYLMGVLPK+YEVPSIQAL QNRFAEG VSFVGFEFLG+I
Sbjct: 181 RCEEEEHLICAQHEAGHFLVGYLMGVLPKQYEVPSIQALRQNRFAEGNVSFVGFEFLGQI 240

Query: 241 ----------------------------------LNQFSCVALGGLVAELLVAGNSDGHL 300
                                             LNQFSCV LGGLVAELLVAGNSDGHL
Sbjct: 241 DSIKILVENADIKNLHERENKGRQENKGTISLTKLNQFSCVILGGLVAELLVAGNSDGHL 300

Query: 301 ADMLKLGSVLTWLGLSKSEADLHLKWAATNTAFIMSQHCETRSRLAEAMVLGKPIGLCID 338
           AD+LKL SVL WLGL KS+AD HLKWAA NTAFIMS+H ETR  LA+ M LGK IG CID
Sbjct: 301 ADILKLESVLVWLGLPKSDADRHLKWAAMNTAFIMSRHSETRLILAKVMALGKSIGFCID 328

BLAST of CcUC03G057900 vs. ExPASy TrEMBL
Match: A0A5D3CG48 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold16G003430 PE=4 SV=1)

HSP 1 Score: 448.7 bits (1153), Expect = 2.1e-122
Identity = 240/334 (71.86%), Postives = 259/334 (77.54%), Query Frame = 0

Query: 1   MFFTTADYDFTSNLQFHRRIPVTGDVISSVKRGESVDGAAKRRRALKLVDRALSKRQYKS 60
           MF T A YDFT +L+FHRR+PVTG V+SS          A+RRRALKLVDRALSKRQYKS
Sbjct: 1   MFLTAAVYDFTFDLEFHRRVPVTGYVVSS----------AERRRALKLVDRALSKRQYKS 60

Query: 61  ALSLVKQLQGKPYGLRAFGAAKQIFKRPSAMDESELNRKDILSLQPLVDLILDSIQQCLQ 120
           A+SLVKQLQGKPYGLR FGAAKQI KR   +DESE+N  D+LSLQPLVD ILDS+QQCLQ
Sbjct: 61  AVSLVKQLQGKPYGLRGFGAAKQIIKRRLELDESEVNGMDMLSLQPLVDSILDSVQQCLQ 120

Query: 121 ISLSERVHSLLLVMQVFISEDFGVIDDSSFVLQELHSLFVVLQTSAEKLESLIAEGRHSS 180
           IS  E +                                     SAEK ES +AEGRHSS
Sbjct: 121 ISFLEEI------------------------------------LSAEKPESSMAEGRHSS 180

Query: 181 RCEEEEHLICAQHEAGHFLVGYLMGVLPKEYEVPSIQALSQNRFAEGKVSFVGFEFLGEI 240
           RCEE+EH ICAQHEAGHFLVGYLMGVLPKEY+VPS+QALSQNRFAEGKVSFVGFEFLGE 
Sbjct: 181 RCEEQEHFICAQHEAGHFLVGYLMGVLPKEYQVPSVQALSQNRFAEGKVSFVGFEFLGET 240

Query: 241 LNQFSCVALGGLVAELLVAGNSDGHLADMLKLGSVLTWLGLSKSEADLHLKWAATNTAFI 300
           LNQFSCV LGGLVAELLVAGNSDGHLAD+LKL SVLTW GL KSEADLHL+WAATNTAFI
Sbjct: 241 LNQFSCVTLGGLVAELLVAGNSDGHLADILKLWSVLTWFGLPKSEADLHLRWAATNTAFI 288

Query: 301 MSQHCETRSRLAEAMVLGKPIGLCIDAIENCLQG 335
           MS+HCETR RLAEAM L KPIGLCID IENCL+G
Sbjct: 301 MSRHCETRLRLAEAMTLAKPIGLCIDTIENCLEG 288

BLAST of CcUC03G057900 vs. ExPASy TrEMBL
Match: A0A1S3BP83 (uncharacterized protein LOC103492218 OS=Cucumis melo OX=3656 GN=LOC103492218 PE=4 SV=1)

HSP 1 Score: 435.6 bits (1119), Expect = 1.8e-118
Identity = 241/362 (66.57%), Postives = 261/362 (72.10%), Query Frame = 0

Query: 1   MFFTTADYDFTSNLQFHRRIPVTGDVISSVKRGESVDGAAKRRRALKLVDRALSKRQYKS 60
           MF T A YDFT +L+FHRR+PVTG V+SS          A+RRRALKLVDRALSKRQYKS
Sbjct: 1   MFLTAAVYDFTFDLEFHRRVPVTGYVVSS----------AERRRALKLVDRALSKRQYKS 60

Query: 61  ALSLVKQLQGKPYGLRAFGAAKQIFKRPSAMDESELNRKDILSLQPLVDLILDSIQQCLQ 120
           A+SLVKQLQGKPYGLR FGAAKQI KR   +DESE+N  D+LSLQPLVD ILDS+QQCLQ
Sbjct: 61  AVSLVKQLQGKPYGLRGFGAAKQIIKRRLELDESEVNGMDMLSLQPLVDSILDSVQQCLQ 120

Query: 121 ISLSERVHSLLLVMQVFISEDFGVIDDSSFVLQELHSLFVVLQTSAEKLESLIAEGRHSS 180
           IS  E +                                     SAEK ES +AEGRHSS
Sbjct: 121 ISFLEEI------------------------------------LSAEKPESSMAEGRHSS 180

Query: 181 RCEEEEHLICAQHEAGHFLVGYLMGVLPKEYEVPSIQALSQNRFAEGKVSFVGFEFLGEI 240
           RCEE+EH ICAQHEAGHFLVGYLMGVLPKEY+VPS+QALSQNRFAEGKVSFVGFEFLGEI
Sbjct: 181 RCEEQEHFICAQHEAGHFLVGYLMGVLPKEYQVPSVQALSQNRFAEGKVSFVGFEFLGEI 240

Query: 241 ----------------------------LNQFSCVALGGLVAELLVAGNSDGHLADMLKL 300
                                       LNQFSCV LGGLVAELLVAGNSDGHLAD+LKL
Sbjct: 241 DSVKILGQNADIKKFNKRANKGTISSKTLNQFSCVTLGGLVAELLVAGNSDGHLADILKL 300

Query: 301 GSVLTWLGLSKSEADLHLKWAATNTAFIMSQHCETRSRLAEAMVLGKPIGLCIDAIENCL 335
            SVLTW GL KSEADLHL+WAATNTAFIMS+HCETR RLAEAM L KPIGLCI+AIENCL
Sbjct: 301 WSVLTWFGLPKSEADLHLRWAATNTAFIMSRHCETRLRLAEAMTLAKPIGLCIEAIENCL 316

BLAST of CcUC03G057900 vs. ExPASy TrEMBL
Match: A0A6J1HDU1 (uncharacterized protein LOC111461960 OS=Cucurbita moschata OX=3662 GN=LOC111461960 PE=4 SV=1)

HSP 1 Score: 431.8 bits (1109), Expect = 2.6e-117
Identity = 248/371 (66.85%), Postives = 258/371 (69.54%), Query Frame = 0

Query: 1   MFFTTADYDFTSNLQFHRRIPVTGDVISSVKRGESVDGAAKRRRALKLVDRALSKRQYKS 60
           MFFT AD DFT NL+FHRRIPVTGDVISS KRG+S DGAAKRRRALKLVDRALSKRQYKS
Sbjct: 1   MFFTAADCDFTFNLEFHRRIPVTGDVISSAKRGDSGDGAAKRRRALKLVDRALSKRQYKS 60

Query: 61  ALSLVKQLQGKPYGLRAFGAAKQIFKRPSAMDESELNRKDILSLQPLVDLILDSIQQCLQ 120
           ALSLVKQLQGKPYGLRAFGAAKQI KRPSAMD         LSLQPLVD ILDSIQ CLQ
Sbjct: 61  ALSLVKQLQGKPYGLRAFGAAKQITKRPSAMDN--------LSLQPLVDSILDSIQPCLQ 120

Query: 121 ISLSERVHSLLLVMQVFISEDFGVIDDSSFVLQELHSLFVVLQTSAEKLESLIAEGRHSS 180
           I                                           SAE+LESLIAEGR+ S
Sbjct: 121 I-------------------------------------------SAERLESLIAEGRYPS 180

Query: 181 RCEEEEHLICAQHEAGHFLVGYLMGVLPKEYEVPSIQALSQNRFAEGKVSFVGFEFLGEI 240
           RCEEEEHLICAQHEAGHFLVGYLMGVLPK+YEVPSIQAL QNRFAEG VSFVGFEFLGEI
Sbjct: 181 RCEEEEHLICAQHEAGHFLVGYLMGVLPKQYEVPSIQALRQNRFAEGNVSFVGFEFLGEI 240

Query: 241 ----------------------------------LNQFSCVALGGLVAELLVAGNSDGHL 300
                                             L QFSCV LGGLVAELLVAGNSDGHL
Sbjct: 241 DSIKILVENADIINLHKRENKGRQENKGTISSTKLKQFSCVILGGLVAELLVAGNSDGHL 300

Query: 301 ADMLKLGSVLTWLGLSKSEADLHLKWAATNTAFIMSQHCETRSRLAEAMVLGKPIGLCID 338
           AD+LKL SVL WLGL KS+AD   KWAA NTAFIMS+H ETRS LA+ M LGK IG CID
Sbjct: 301 ADILKLESVLIWLGLPKSDADRLFKWAAMNTAFIMSRHSETRSILAKVMALGKSIGFCID 320

BLAST of CcUC03G057900 vs. TAIR 10
Match: AT5G27290.1 (unknown protein; LOCATED IN: chloroplast; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G54680.3); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )

HSP 1 Score: 159.1 bits (401), Expect = 6.3e-39
Identity = 116/319 (36.36%), Postives = 176/319 (55.17%), Query Frame = 0

Query: 35  SVDGAAKRRRALKLVDRALSKRQYKSALSLVKQLQGKPYGLRAFGAAKQIFKRPSAMDES 94
           S  G + RR+AL+ VD  LS    ++ALSLVK LQGKP GLR FGAA+Q+ +R   ++E 
Sbjct: 35  SETGLSIRRQALEQVDSKLSSGDERAALSLVKDLQGKPDGLRCFGAARQVPQRLYTLEEL 94

Query: 95  ELNRKDILSLQPLVDLILDSIQQCLQISLSERVHSLLLVMQVFISEDFGVIDDSSFVLQE 154
           +LN  +  SL    D  L SI++ LQI+    V   ++  + F        D SS  L  
Sbjct: 95  KLNGINAASLLSPTDTTLGSIERNLQIA---AVSGGIVAWKAF--------DLSSQQLFF 154

Query: 155 LHSLFVVLQT-----SAEKLESLIAEGRHSSRCEEEEHLICAQHEAGHFLVGYLMGVLPK 214
           L   F+ L T         + SL+ +        +  H    QHEAGHFLV YL+G+LP+
Sbjct: 155 LTLGFMFLWTLDLVSFNGGIGSLVLD-TTGHTFSQRYHNRVVQHEAGHFLVAYLVGILPR 214

Query: 215 EYEVPSIQALSQ--NRFAEGKVSFVGFEFLGEI---------LNQFSCVALGGLVAELLV 274
            Y + S++AL +  +   +   +FV +EFL E+         LN+FSC+AL G+  E L+
Sbjct: 215 GYTLSSLEALQKEGSLNIQAGSAFVDYEFLEEVNSGKVSATMLNRFSCIALAGVATEYLL 274

Query: 275 AGNSDGHLADMLKLGSVLTWLGLSKSEADLHLKWAATNTAFIMSQHCETRSRLAEAMVLG 334
            G ++G L D+ KL  ++  LG ++ +AD  ++W+  NT  ++ +H   RS+LA+AM  G
Sbjct: 275 YGYAEGGLDDISKLDGLVKSLGFTQKKADSQVRWSVLNTILLLRRHEIARSKLAQAMSKG 334

Query: 335 KPIGLCIDAIENCLQGMEI 338
           + +G CI  IE+ +   +I
Sbjct: 335 ESVGSCIQIIEDSIDPSDI 341

BLAST of CcUC03G057900 vs. TAIR 10
Match: AT1G54680.3 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: endomembrane system; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G27290.1). )

HSP 1 Score: 154.5 bits (389), Expect = 1.6e-37
Identity = 78/173 (45.09%), Postives = 108/173 (62.43%), Query Frame = 0

Query: 184 EEEHLICAQHEAGHFLVGYLMGVLPKEYEVPSIQALSQN-RFAEGKVSFVGFEFLGEI-- 243
           EE+     QHE+GHFLVGYL+GVLP+ YE+P+++A+ QN     G+V FVGFEFL ++  
Sbjct: 45  EEDWFSVVQHESGHFLVGYLLGVLPRHYEIPTLEAVRQNVSNVTGRVEFVGFEFLKQLMK 104

Query: 244 ----------------LNQFSCVALGGLVAELLVAGNSDGHLADMLKLGSVLTWLGLSKS 303
                           LN FSCV LGG+V E ++ G S+G  +D++KL  VL WLG ++S
Sbjct: 105 DDVDGQMNQGNISSKTLNNFSCVILGGMVTEHILFGYSEGLYSDIVKLNDVLRWLGFTES 164

Query: 304 EADLHLKWAATNTAFIMSQHCETRSRLAEAMVLGKPIGLCIDAIENCLQGMEI 338
           E + H+KWA +NT  ++  H E R  LAE M   KPI  CI+AIE+ +   +I
Sbjct: 165 EKEAHIKWAVSNTVSLLHSHKEARVSLAETMAKAKPISTCIEAIESAISTHQI 217

BLAST of CcUC03G057900 vs. TAIR 10
Match: AT1G54680.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: endomembrane system; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G27290.1); Has 200 Blast hits to 200 proteins in 57 species: Archae - 0; Bacteria - 59; Metazoa - 0; Fungi - 0; Plants - 127; Viruses - 0; Other Eukaryotes - 14 (source: NCBI BLink). )

HSP 1 Score: 152.5 bits (384), Expect = 5.9e-37
Identity = 78/179 (43.58%), Postives = 108/179 (60.34%), Query Frame = 0

Query: 184 EEEHLICAQHEAGHFLVGYLMGVLPKEYEVPSIQALSQN-RFAEGKVSFVGFEFLGEI-- 243
           EE+     QHE+GHFLVGYL+GVLP+ YE+P+++A+ QN     G+V FVGFEFL ++  
Sbjct: 45  EEDWFSVVQHESGHFLVGYLLGVLPRHYEIPTLEAVRQNVSNVTGRVEFVGFEFLKQVGA 104

Query: 244 ----------------------LNQFSCVALGGLVAELLVAGNSDGHLADMLKLGSVLTW 303
                                 LN FSCV LGG+V E ++ G S+G  +D++KL  VL W
Sbjct: 105 ANQLMKDDVDGQMNQGNISSKTLNNFSCVILGGMVTEHILFGYSEGLYSDIVKLNDVLRW 164

Query: 304 LGLSKSEADLHLKWAATNTAFIMSQHCETRSRLAEAMVLGKPIGLCIDAIENCLQGMEI 338
           LG ++SE + H+KWA +NT  ++  H E R  LAE M   KPI  CI+AIE+ +   +I
Sbjct: 165 LGFTESEKEAHIKWAVSNTVSLLHSHKEARVSLAETMAKAKPISTCIEAIESAISTHQI 223

BLAST of CcUC03G057900 vs. TAIR 10
Match: AT1G54680.2 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G27290.1); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )

HSP 1 Score: 152.5 bits (384), Expect = 5.9e-37
Identity = 78/179 (43.58%), Postives = 108/179 (60.34%), Query Frame = 0

Query: 184 EEEHLICAQHEAGHFLVGYLMGVLPKEYEVPSIQALSQN-RFAEGKVSFVGFEFLGEI-- 243
           EE+     QHE+GHFLVGYL+GVLP+ YE+P+++A+ QN     G+V FVGFEFL ++  
Sbjct: 41  EEDWFSVVQHESGHFLVGYLLGVLPRHYEIPTLEAVRQNVSNVTGRVEFVGFEFLKQVGA 100

Query: 244 ----------------------LNQFSCVALGGLVAELLVAGNSDGHLADMLKLGSVLTW 303
                                 LN FSCV LGG+V E ++ G S+G  +D++KL  VL W
Sbjct: 101 ANQLMKDDVDGQMNQGNISSKTLNNFSCVILGGMVTEHILFGYSEGLYSDIVKLNDVLRW 160

Query: 304 LGLSKSEADLHLKWAATNTAFIMSQHCETRSRLAEAMVLGKPIGLCIDAIENCLQGMEI 338
           LG ++SE + H+KWA +NT  ++  H E R  LAE M   KPI  CI+AIE+ +   +I
Sbjct: 161 LGFTESEKEAHIKWAVSNTVSLLHSHKEARVSLAETMAKAKPISTCIEAIESAISTHQI 219

BLAST of CcUC03G057900 vs. TAIR 10
Match: AT5G27290.2 (unknown protein; LOCATED IN: chloroplast; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G54680.3); Has 199 Blast hits to 194 proteins in 57 species: Archae - 0; Bacteria - 61; Metazoa - 0; Fungi - 0; Plants - 129; Viruses - 0; Other Eukaryotes - 9 (source: NCBI BLink). )

HSP 1 Score: 105.1 bits (261), Expect = 1.1e-22
Identity = 89/240 (37.08%), Postives = 128/240 (53.33%), Query Frame = 0

Query: 35  SVDGAAKRRRALKLVDRALSKRQYKSALSLVKQLQGKPYGLRAFGAAKQIFKRPSAMDES 94
           S  G + RR+AL+ VD  LS    ++ALSLVK LQGKP GLR FGAA+Q+ +R   ++E 
Sbjct: 35  SETGLSIRRQALEQVDSKLSSGDERAALSLVKDLQGKPDGLRCFGAARQVPQRLYTLEEL 94

Query: 95  ELNRKDILSLQPLVDLILDSIQQCLQISLSERVHSLLLVMQVFISEDFGVIDDSSFVLQE 154
           +LN  +  SL    D  L SI++ LQI+    V   ++  + F        D SS  L  
Sbjct: 95  KLNGINAASLLSPTDTTLGSIERNLQIA---AVSGGIVAWKAF--------DLSSQQLFF 154

Query: 155 LHSLFVVLQT-----SAEKLESLIAEGRHSSRCEEEEHLICAQHEAGHFLVGYLMGVLPK 214
           L   F+ L T         + SL+ +        +  H    QHEAGHFLV YL+G+LP+
Sbjct: 155 LTLGFMFLWTLDLVSFNGGIGSLVLD-TTGHTFSQRYHNRVVQHEAGHFLVAYLVGILPR 214

Query: 215 EYEVPSIQALSQ--NRFAEGKVSFVGFEFLGEI---------LNQFSCVALGGLVAELLV 259
            Y + S++AL +  +   +   +FV +EFL E+         LN+FSC+AL G+  E L+
Sbjct: 215 GYTLSSLEALQKEGSLNIQAGSAFVDYEFLEEVNSGKVSATMLNRFSCIALAGVATEYLL 262

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038879283.14.7e-13774.25uncharacterized protein LOC120071224 isoform X1 [Benincasa hispida][more]
XP_022968757.12.9e-12672.73uncharacterized protein LOC111467900 isoform X2 [Cucurbita maxima][more]
XP_022968755.17.1e-12569.27uncharacterized protein LOC111467900 isoform X1 [Cucurbita maxima] >XP_022968756... [more]
XP_004135797.23.3e-12268.22uncharacterized protein LOC101213254 isoform X2 [Cucumis sativus][more]
TYK10198.14.3e-12271.86uncharacterized protein E5676_scaffold16G003430 [Cucumis melo var. makuwa][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A6J1HUE11.4e-12672.73uncharacterized protein LOC111467900 isoform X2 OS=Cucurbita maxima OX=3661 GN=L... [more]
A0A6J1HY403.4e-12569.27uncharacterized protein LOC111467900 isoform X1 OS=Cucurbita maxima OX=3661 GN=L... [more]
A0A5D3CG482.1e-12271.86Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... [more]
A0A1S3BP831.8e-11866.57uncharacterized protein LOC103492218 OS=Cucumis melo OX=3656 GN=LOC103492218 PE=... [more]
A0A6J1HDU12.6e-11766.85uncharacterized protein LOC111461960 OS=Cucurbita moschata OX=3662 GN=LOC1114619... [more]
Match NameE-valueIdentityDescription
AT5G27290.16.3e-3936.36unknown protein; LOCATED IN: chloroplast; EXPRESSED IN: 22 plant structures; EXP... [more]
AT1G54680.31.6e-3745.09unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT1G54680.15.9e-3743.58unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT1G54680.25.9e-3743.58unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT5G27290.21.1e-2237.08unknown protein; LOCATED IN: chloroplast; EXPRESSED IN: 22 plant structures; EXP... [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (PI 537277) v1
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR037219Peptidase M41-likeGENE3D1.20.58.760Peptidase M41coord: 175..317
e-value: 2.0E-8
score: 36.2
IPR037219Peptidase M41-likeSUPERFAMILY140990FtsH protease domain-likecoord: 183..317
NoneNo IPR availablePANTHERPTHR33471FAMILY NOT NAMEDcoord: 11..126
coord: 164..335
NoneNo IPR availablePANTHERPTHR33471:SF4T22H22.11 PROTEINcoord: 11..126
coord: 164..335

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CcUC03G057900.1CcUC03G057900.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
molecular_function GO:0005524 ATP binding
molecular_function GO:0004176 ATP-dependent peptidase activity
molecular_function GO:0004222 metalloendopeptidase activity