CsGy4G017480 (gene) Cucumber (Gy14) v2.1

Overview
NameCsGy4G017480
Typegene
OrganismCucumis sativus L. var. sativus cv. Gy14 (Cucumber (Gy14) v2.1)
DescriptionMitochondrial transcription termination factor family protein
LocationGy14Chr4: 22542222 .. 22543460 (-)
RNA-Seq ExpressionCsGy4G017480
SyntenyCsGy4G017480
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGGAGAGGGGGAAGAAGAACTGAAGACGAAAGCTGAAAATCATGAAGTTGAAATTCAGGTGTGTGTGTGTGTTTTGTTGTTATAAGTTAACAATGATAATTAAGTTGATATAATTAATAGTTATGTAAATTTATAATAATTAGTTTTGAGTTATGGGAATTAATTATCAGGAGAGAGGAGAAATATTCTTCTTGTATAGGCCTAAAGTCGGAAAACAAGAAGTGCATGGCCCTGATGAGGTGCAACGCTTGTACATTATTCTTCGGCCACAGTCCGGTGAGAAGACGGTCGAGGAGAAACAATGTAGCTATGGTGGACAGAGTACCCACACCCAGGTTAATTAATTAATTCTACCAAATACCTACATTTACTTTTGTTTTGACCATTTTATTTAATAAGACATGTCCAACTTATTTTTGTCTAAATTAGAAGTAGTAGATGAGTTAAACATGACCATAAACCCTCATTTTGGCTCTCATACAATATTTGAACACAGGAAGTTAACATCGAAGAGCAACCCCTTTTACGGTTCATTATTATGGGTCGAAAAAGCCTTCCACACCCATCCCACAGGTCTCGACCTTACTGGGGATTTGTAGATATGGTAACAACCAACGTTCAAGATATCAAGACTGCGCTTCAAGGAGGTACATCCCACTGAATCTACATACCCCATGTAGTTTTTATAGTCAGGTGCAAAATAAAATGAGTTTCTAATATGGCAAAAGCAGAGGAATACGACACTTCAACTCGAGGACATCGCCATATTTCTGCGGCAAGAGCATTGGGGGAAGGCATTTACCGTATTCTAAGGCATAATCCAAGGAACAAAAACAATAATCACCACACTCATTTGATCTACAAGCTACAGTTTCCAGCGGCAGACGAGAAAAACGAGCCTCAGAAATCCTTTAATATTGAAAGGGAAGGGTCATTTGTGATACAAATAAAGAACCCAGAGCAAGGAGGCGCAGGTGGTTCGTCTTCTCAGCACAAACGCAGGGCTCAGTTTCCAGCGCATTTGCAAGGTCAATTTGGGCATAAACGGTATTACCCGGCCGACCCGCCTGAGTTCTTGAATTTTGAAGGGTGTGAGTTCTTGTTGATATCGGCTTCTGATGATATAGAACAGGAATTGGGGTTGGAGTTGATTACTGAAGGAGAAGAATGTGATTTAGTGAAGACTTTTGGAGATGCTGTTTCCACGAAGCCTCTTTTCGAAGGTACTTGGGTGTAG

mRNA sequence

ATGGGAGAGGGGGAAGAAGAACTGAAGACGAAAGCTGAAAATCATGAAGTTGAAATTCAGGAGAGAGGAGAAATATTCTTCTTGTATAGGCCTAAAGTCGGAAAACAAGAAGTGCATGGCCCTGATGAGGTGCAACGCTTGTACATTATTCTTCGGCCACAGTCCGGTGAGAAGACGGTCGAGGAGAAACAATGTAGCTATGGTGGACAGAGTACCCACACCCAGGAAGTTAACATCGAAGAGCAACCCCTTTTACGGTTCATTATTATGGGTCGAAAAAGCCTTCCACACCCATCCCACAGGTCTCGACCTTACTGGGGATTTGTAGATATGGTAACAACCAACGTTCAAGATATCAAGACTGCGCTTCAAGGAGAGGAATACGACACTTCAACTCGAGGACATCGCCATATTTCTGCGGCAAGAGCATTGGGGGAAGGCATTTACCGTATTCTAAGGCATAATCCAAGGAACAAAAACAATAATCACCACACTCATTTGATCTACAAGCTACAGTTTCCAGCGGCAGACGAGAAAAACGAGCCTCAGAAATCCTTTAATATTGAAAGGGAAGGGTCATTTGTGATACAAATAAAGAACCCAGAGCAAGGAGGCGCAGGTGGTTCGTCTTCTCAGCACAAACGCAGGGCTCAGTTTCCAGCGCATTTGCAAGGTCAATTTGGGCATAAACGGTATTACCCGGCCGACCCGCCTGAGTTCTTGAATTTTGAAGGGTGTGAGTTCTTGTTGATATCGGCTTCTGATGATATAGAACAGGAATTGGGGTTGGAGTTGATTACTGAAGGAGAAGAATGTGATTTAGTGAAGACTTTTGGAGATGCTGTTTCCACGAAGCCTCTTTTCGAAGGTACTTGGGTGTAG

Coding sequence (CDS)

ATGGGAGAGGGGGAAGAAGAACTGAAGACGAAAGCTGAAAATCATGAAGTTGAAATTCAGGAGAGAGGAGAAATATTCTTCTTGTATAGGCCTAAAGTCGGAAAACAAGAAGTGCATGGCCCTGATGAGGTGCAACGCTTGTACATTATTCTTCGGCCACAGTCCGGTGAGAAGACGGTCGAGGAGAAACAATGTAGCTATGGTGGACAGAGTACCCACACCCAGGAAGTTAACATCGAAGAGCAACCCCTTTTACGGTTCATTATTATGGGTCGAAAAAGCCTTCCACACCCATCCCACAGGTCTCGACCTTACTGGGGATTTGTAGATATGGTAACAACCAACGTTCAAGATATCAAGACTGCGCTTCAAGGAGAGGAATACGACACTTCAACTCGAGGACATCGCCATATTTCTGCGGCAAGAGCATTGGGGGAAGGCATTTACCGTATTCTAAGGCATAATCCAAGGAACAAAAACAATAATCACCACACTCATTTGATCTACAAGCTACAGTTTCCAGCGGCAGACGAGAAAAACGAGCCTCAGAAATCCTTTAATATTGAAAGGGAAGGGTCATTTGTGATACAAATAAAGAACCCAGAGCAAGGAGGCGCAGGTGGTTCGTCTTCTCAGCACAAACGCAGGGCTCAGTTTCCAGCGCATTTGCAAGGTCAATTTGGGCATAAACGGTATTACCCGGCCGACCCGCCTGAGTTCTTGAATTTTGAAGGGTGTGAGTTCTTGTTGATATCGGCTTCTGATGATATAGAACAGGAATTGGGGTTGGAGTTGATTACTGAAGGAGAAGAATGTGATTTAGTGAAGACTTTTGGAGATGCTGTTTCCACGAAGCCTCTTTTCGAAGGTACTTGGGTGTAG

Protein sequence

MGEGEEELKTKAENHEVEIQERGEIFFLYRPKVGKQEVHGPDEVQRLYIILRPQSGEKTVEEKQCSYGGQSTHTQEVNIEEQPLLRFIIMGRKSLPHPSHRSRPYWGFVDMVTTNVQDIKTALQGEEYDTSTRGHRHISAARALGEGIYRILRHNPRNKNNNHHTHLIYKLQFPAADEKNEPQKSFNIEREGSFVIQIKNPEQGGAGGSSSQHKRRAQFPAHLQGQFGHKRYYPADPPEFLNFEGCEFLLISASDDIEQELGLELITEGEECDLVKTFGDAVSTKPLFEGTWV*
Homology
BLAST of CsGy4G017480 vs. NCBI nr
Match: XP_031740485.1 (uncharacterized protein LOC101213393 [Cucumis sativus] >KAE8649641.1 hypothetical protein Csa_012091 [Cucumis sativus])

HSP 1 Score: 603 bits (1555), Expect = 7.88e-218
Identity = 293/293 (100.00%), Postives = 293/293 (100.00%), Query Frame = 0

Query: 1   MGEGEEELKTKAENHEVEIQERGEIFFLYRPKVGKQEVHGPDEVQRLYIILRPQSGEKTV 60
           MGEGEEELKTKAENHEVEIQERGEIFFLYRPKVGKQEVHGPDEVQRLYIILRPQSGEKTV
Sbjct: 1   MGEGEEELKTKAENHEVEIQERGEIFFLYRPKVGKQEVHGPDEVQRLYIILRPQSGEKTV 60

Query: 61  EEKQCSYGGQSTHTQEVNIEEQPLLRFIIMGRKSLPHPSHRSRPYWGFVDMVTTNVQDIK 120
           EEKQCSYGGQSTHTQEVNIEEQPLLRFIIMGRKSLPHPSHRSRPYWGFVDMVTTNVQDIK
Sbjct: 61  EEKQCSYGGQSTHTQEVNIEEQPLLRFIIMGRKSLPHPSHRSRPYWGFVDMVTTNVQDIK 120

Query: 121 TALQGEEYDTSTRGHRHISAARALGEGIYRILRHNPRNKNNNHHTHLIYKLQFPAADEKN 180
           TALQGEEYDTSTRGHRHISAARALGEGIYRILRHNPRNKNNNHHTHLIYKLQFPAADEKN
Sbjct: 121 TALQGEEYDTSTRGHRHISAARALGEGIYRILRHNPRNKNNNHHTHLIYKLQFPAADEKN 180

Query: 181 EPQKSFNIEREGSFVIQIKNPEQGGAGGSSSQHKRRAQFPAHLQGQFGHKRYYPADPPEF 240
           EPQKSFNIEREGSFVIQIKNPEQGGAGGSSSQHKRRAQFPAHLQGQFGHKRYYPADPPEF
Sbjct: 181 EPQKSFNIEREGSFVIQIKNPEQGGAGGSSSQHKRRAQFPAHLQGQFGHKRYYPADPPEF 240

Query: 241 LNFEGCEFLLISASDDIEQELGLELITEGEECDLVKTFGDAVSTKPLFEGTWV 293
           LNFEGCEFLLISASDDIEQELGLELITEGEECDLVKTFGDAVSTKPLFEGTWV
Sbjct: 241 LNFEGCEFLLISASDDIEQELGLELITEGEECDLVKTFGDAVSTKPLFEGTWV 293

BLAST of CsGy4G017480 vs. NCBI nr
Match: XP_008444096.1 (PREDICTED: uncharacterized protein LOC103487535 [Cucumis melo] >KAA0064208.1 uncharacterized protein E6C27_scaffold548G001530 [Cucumis melo var. makuwa] >TYK02820.1 uncharacterized protein E5676_scaffold218G00100 [Cucumis melo var. makuwa])

HSP 1 Score: 566 bits (1459), Expect = 3.28e-203
Identity = 278/293 (94.88%), Postives = 284/293 (96.93%), Query Frame = 0

Query: 1   MGEGEEELKTKAENHEVEIQERGEIFFLYRPKVGKQEVHGPDEVQRLYIILRPQSGEKTV 60
           MGEGEEELKTKAE+HEVEIQERGEIFFLYRPKV KQEVH PDEVQRLYIILRP SGEKTV
Sbjct: 1   MGEGEEELKTKAEDHEVEIQERGEIFFLYRPKVEKQEVHSPDEVQRLYIILRPLSGEKTV 60

Query: 61  EEKQCSYGGQSTHTQEVNIEEQPLLRFIIMGRKSLPHPSHRSRPYWGFVDMVTTNVQDIK 120
           EEKQC  GGQSTHTQEVNI++QPLLRFIIMGRKSLPHPSHRSRPYWGFVDMVTTNVQ+IK
Sbjct: 61  EEKQCKDGGQSTHTQEVNIKKQPLLRFIIMGRKSLPHPSHRSRPYWGFVDMVTTNVQEIK 120

Query: 121 TALQGEEYDTSTRGHRHISAARALGEGIYRILRHNPRNKNNNHHTHLIYKLQFPAADEKN 180
            ALQGEEYDTSTRGHRHISAARALGEGIYRILRHNP+NKNNNH THLIYKL+FPAADEKN
Sbjct: 121 IALQGEEYDTSTRGHRHISAARALGEGIYRILRHNPKNKNNNH-THLIYKLEFPAADEKN 180

Query: 181 EPQKSFNIEREGSFVIQIKNPEQGGAGGSSSQHKRRAQFPAHLQGQFGHKRYYPADPPEF 240
           EPQKSFNIEREGSFVIQIKNPEQGGAGGSSSQHKRRAQFPAHLQGQFGHKRY PADPPEF
Sbjct: 181 EPQKSFNIEREGSFVIQIKNPEQGGAGGSSSQHKRRAQFPAHLQGQFGHKRYCPADPPEF 240

Query: 241 LNFEGCEFLLISASDDIEQELGLELITEGEECDLVKTFGDAVSTKPLFEGTWV 293
           LNFEGCEFLLISASDDIEQELGLEL TEGEECDLVKTFGDAVSTKPLFEGTWV
Sbjct: 241 LNFEGCEFLLISASDDIEQELGLELFTEGEECDLVKTFGDAVSTKPLFEGTWV 292

BLAST of CsGy4G017480 vs. NCBI nr
Match: XP_038895444.1 (uncharacterized protein LOC120083676 [Benincasa hispida])

HSP 1 Score: 484 bits (1246), Expect = 9.76e-171
Identity = 244/300 (81.33%), Postives = 265/300 (88.33%), Query Frame = 0

Query: 1   MGEGEEELKTKAENHEVEIQERGEIFFLYRPKVGKQEVHGPDEVQRLYIILRPQSGEKTV 60
           MGEG +E KTKAE+  VEIQERGEI+F YRPKV KQEVH PDEVQRLYIILRP+SGEK V
Sbjct: 1   MGEG-QESKTKAEDG-VEIQERGEIYFFYRPKVEKQEVHSPDEVQRLYIILRPESGEKAV 60

Query: 61  EEKQCSYG-------GQSTHTQEVNIEEQPLLRFIIMGRKSLPHPSHRSRPYWGFVDMVT 120
           EEKQ +         GQ THTQEVNIE+QPLLRFIIMGRKSLPHP+ R+RPYWGFVDMVT
Sbjct: 61  EEKQSTSSSSTGTQRGQGTHTQEVNIEKQPLLRFIIMGRKSLPHPAQRARPYWGFVDMVT 120

Query: 121 TNVQDIKTALQGEEYDTSTRGHRHISAARALGEGIYRILRHNPRNKNNNHHTHLIYKLQF 180
           T+VQDIK ALQG EYDTSTRGHRHISAARALGEGIYRILRHNP+NK   +HTHLIYKL+F
Sbjct: 121 TDVQDIKNALQGGEYDTSTRGHRHISAARALGEGIYRILRHNPKNK---YHTHLIYKLEF 180

Query: 181 PAADEKNEPQKSFNIEREGSFVIQIKNPEQGGAGGSSSQHKRRAQFPAHLQGQFGHKRYY 240
           P+ DEKNEPQK FNIEREGSFVIQIKNP+QGGAGGS    KRRAQFPAHLQGQFGHK Y+
Sbjct: 181 PSEDEKNEPQKWFNIEREGSFVIQIKNPDQGGAGGSHQ--KRRAQFPAHLQGQFGHKGYH 240

Query: 241 PADPPEFLNFEGCEFLLISASDDIEQELGLELITEGEECDLVKTFGDAVSTKPLFEGTWV 293
           PADPP++LNFEGCEFLLISASDDIE+ELGLEL TEGEECDLVKTFG+ V T+PLF+GTWV
Sbjct: 241 PADPPDYLNFEGCEFLLISASDDIEEELGLELTTEGEECDLVKTFGETVPTEPLFKGTWV 293

BLAST of CsGy4G017480 vs. NCBI nr
Match: KAG6581755.1 (hypothetical protein SDJN03_21757, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 430 bits (1106), Expect = 1.82e-149
Identity = 215/296 (72.64%), Postives = 250/296 (84.46%), Query Frame = 0

Query: 1   MGEGEEELKTKAENHEVEIQERGEIFFLYRPKVGKQEVHGPDEVQRLYIILRPQSGEKTV 60
           MGEG E+ KT+AE   VEIQERGEIFF YRPKVGKQ+VHGPD+VQRLYIILRP+SGE+ V
Sbjct: 1   MGEG-EDSKTRAEAG-VEIQERGEIFFFYRPKVGKQQVHGPDDVQRLYIILRPESGERAV 60

Query: 61  EEKQCSYGGQSTHTQEVNIEEQPLLRFIIMGRKSLPHPSHRSRPYWGFVDMVTTNVQDIK 120
           EEKQ      S  TQEVNIE+QPLLRF+IMGRKSLP+P+ + RPYWGFVDMVTTNVQD+K
Sbjct: 61  EEKQLP-NASSRRTQEVNIEKQPLLRFMIMGRKSLPNPAQKRRPYWGFVDMVTTNVQDVK 120

Query: 121 TALQGEEYDTSTRGHRHISAARALGEGIYRILRH---NPRNKNNNHHTHLIYKLQFPAAD 180
            ALQ  EYD+STRGHRHISAARA+GEGIYR++RH   + +    ++HTHLIYKL+FP+ D
Sbjct: 121 AALQEGEYDSSTRGHRHISAARAVGEGIYRLVRHKQPDTQKSKKSYHTHLIYKLEFPSED 180

Query: 181 EKNEPQKSFNIEREGSFVIQIKNPEQGGAGGSSSQHKRRAQFPAHLQGQFGHKRYYPADP 240
           E+NEPQ SFNI REGSF+I IKNP+  G G   S++KRRAQFPAHLQG+FGH R++PADP
Sbjct: 181 EENEPQNSFNIGREGSFLIMIKNPDVEGDG---SRNKRRAQFPAHLQGEFGHTRFHPADP 240

Query: 241 PEFLNFEGCEFLLISASDDIEQELGLELITEGEECDLVKTFGDAVSTKPLFEGTWV 293
           P++LNFEGCEFLLISASDDIEQELGLEL T   ECDLVKTFG+  ST+PL +GTWV
Sbjct: 241 PDYLNFEGCEFLLISASDDIEQELGLELTTAPHECDLVKTFGETTSTQPLLKGTWV 290

BLAST of CsGy4G017480 vs. NCBI nr
Match: XP_022956009.1 (uncharacterized protein LOC111457833 isoform X1 [Cucurbita moschata])

HSP 1 Score: 429 bits (1103), Expect = 5.20e-149
Identity = 215/296 (72.64%), Postives = 249/296 (84.12%), Query Frame = 0

Query: 1   MGEGEEELKTKAENHEVEIQERGEIFFLYRPKVGKQEVHGPDEVQRLYIILRPQSGEKTV 60
           MGEG EE KT+AE   VEIQERGEIFF YRPKVGKQ+VHGPD+VQRLYIILRP+SGE+ V
Sbjct: 1   MGEG-EESKTRAEAG-VEIQERGEIFFFYRPKVGKQQVHGPDDVQRLYIILRPESGERAV 60

Query: 61  EEKQCSYGGQSTHTQEVNIEEQPLLRFIIMGRKSLPHPSHRSRPYWGFVDMVTTNVQDIK 120
           EEKQ      S  TQEVNIE+QPLLRF+IMGRKSLP+P+ + RPYWGFVDMVTTNVQD+K
Sbjct: 61  EEKQLP-NASSRRTQEVNIEKQPLLRFMIMGRKSLPNPAQKRRPYWGFVDMVTTNVQDVK 120

Query: 121 TALQGEEYDTSTRGHRHISAARALGEGIYRILRH---NPRNKNNNHHTHLIYKLQFPAAD 180
            ALQ  EYD+STRGHRHISAARA+GEGIYR++RH   + +    ++HTHLIYKL+FP+ D
Sbjct: 121 AALQEGEYDSSTRGHRHISAARAVGEGIYRLVRHKQPDTQKSKKSYHTHLIYKLEFPSED 180

Query: 181 EKNEPQKSFNIEREGSFVIQIKNPEQGGAGGSSSQHKRRAQFPAHLQGQFGHKRYYPADP 240
           E+NEPQ SFNI REGSF+I IKNP+  G G   S++KRRAQFPAHLQG+FGH R++PADP
Sbjct: 181 EENEPQNSFNIGREGSFLIMIKNPDVEGDG---SRNKRRAQFPAHLQGEFGHTRFHPADP 240

Query: 241 PEFLNFEGCEFLLISASDDIEQELGLELITEGEECDLVKTFGDAVSTKPLFEGTWV 293
           P++LNFEGCEFLLISASDDIEQELGLEL T   ECDLVK FG+  ST+PL +GTWV
Sbjct: 241 PDYLNFEGCEFLLISASDDIEQELGLELTTAPHECDLVKMFGETTSTQPLLKGTWV 290

BLAST of CsGy4G017480 vs. ExPASy TrEMBL
Match: A0A5D3BUL8 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold218G00100 PE=4 SV=1)

HSP 1 Score: 566 bits (1459), Expect = 1.59e-203
Identity = 278/293 (94.88%), Postives = 284/293 (96.93%), Query Frame = 0

Query: 1   MGEGEEELKTKAENHEVEIQERGEIFFLYRPKVGKQEVHGPDEVQRLYIILRPQSGEKTV 60
           MGEGEEELKTKAE+HEVEIQERGEIFFLYRPKV KQEVH PDEVQRLYIILRP SGEKTV
Sbjct: 1   MGEGEEELKTKAEDHEVEIQERGEIFFLYRPKVEKQEVHSPDEVQRLYIILRPLSGEKTV 60

Query: 61  EEKQCSYGGQSTHTQEVNIEEQPLLRFIIMGRKSLPHPSHRSRPYWGFVDMVTTNVQDIK 120
           EEKQC  GGQSTHTQEVNI++QPLLRFIIMGRKSLPHPSHRSRPYWGFVDMVTTNVQ+IK
Sbjct: 61  EEKQCKDGGQSTHTQEVNIKKQPLLRFIIMGRKSLPHPSHRSRPYWGFVDMVTTNVQEIK 120

Query: 121 TALQGEEYDTSTRGHRHISAARALGEGIYRILRHNPRNKNNNHHTHLIYKLQFPAADEKN 180
            ALQGEEYDTSTRGHRHISAARALGEGIYRILRHNP+NKNNNH THLIYKL+FPAADEKN
Sbjct: 121 IALQGEEYDTSTRGHRHISAARALGEGIYRILRHNPKNKNNNH-THLIYKLEFPAADEKN 180

Query: 181 EPQKSFNIEREGSFVIQIKNPEQGGAGGSSSQHKRRAQFPAHLQGQFGHKRYYPADPPEF 240
           EPQKSFNIEREGSFVIQIKNPEQGGAGGSSSQHKRRAQFPAHLQGQFGHKRY PADPPEF
Sbjct: 181 EPQKSFNIEREGSFVIQIKNPEQGGAGGSSSQHKRRAQFPAHLQGQFGHKRYCPADPPEF 240

Query: 241 LNFEGCEFLLISASDDIEQELGLELITEGEECDLVKTFGDAVSTKPLFEGTWV 293
           LNFEGCEFLLISASDDIEQELGLEL TEGEECDLVKTFGDAVSTKPLFEGTWV
Sbjct: 241 LNFEGCEFLLISASDDIEQELGLELFTEGEECDLVKTFGDAVSTKPLFEGTWV 292

BLAST of CsGy4G017480 vs. ExPASy TrEMBL
Match: A0A1S3B9M2 (uncharacterized protein LOC103487535 OS=Cucumis melo OX=3656 GN=LOC103487535 PE=4 SV=1)

HSP 1 Score: 566 bits (1459), Expect = 1.59e-203
Identity = 278/293 (94.88%), Postives = 284/293 (96.93%), Query Frame = 0

Query: 1   MGEGEEELKTKAENHEVEIQERGEIFFLYRPKVGKQEVHGPDEVQRLYIILRPQSGEKTV 60
           MGEGEEELKTKAE+HEVEIQERGEIFFLYRPKV KQEVH PDEVQRLYIILRP SGEKTV
Sbjct: 1   MGEGEEELKTKAEDHEVEIQERGEIFFLYRPKVEKQEVHSPDEVQRLYIILRPLSGEKTV 60

Query: 61  EEKQCSYGGQSTHTQEVNIEEQPLLRFIIMGRKSLPHPSHRSRPYWGFVDMVTTNVQDIK 120
           EEKQC  GGQSTHTQEVNI++QPLLRFIIMGRKSLPHPSHRSRPYWGFVDMVTTNVQ+IK
Sbjct: 61  EEKQCKDGGQSTHTQEVNIKKQPLLRFIIMGRKSLPHPSHRSRPYWGFVDMVTTNVQEIK 120

Query: 121 TALQGEEYDTSTRGHRHISAARALGEGIYRILRHNPRNKNNNHHTHLIYKLQFPAADEKN 180
            ALQGEEYDTSTRGHRHISAARALGEGIYRILRHNP+NKNNNH THLIYKL+FPAADEKN
Sbjct: 121 IALQGEEYDTSTRGHRHISAARALGEGIYRILRHNPKNKNNNH-THLIYKLEFPAADEKN 180

Query: 181 EPQKSFNIEREGSFVIQIKNPEQGGAGGSSSQHKRRAQFPAHLQGQFGHKRYYPADPPEF 240
           EPQKSFNIEREGSFVIQIKNPEQGGAGGSSSQHKRRAQFPAHLQGQFGHKRY PADPPEF
Sbjct: 181 EPQKSFNIEREGSFVIQIKNPEQGGAGGSSSQHKRRAQFPAHLQGQFGHKRYCPADPPEF 240

Query: 241 LNFEGCEFLLISASDDIEQELGLELITEGEECDLVKTFGDAVSTKPLFEGTWV 293
           LNFEGCEFLLISASDDIEQELGLEL TEGEECDLVKTFGDAVSTKPLFEGTWV
Sbjct: 241 LNFEGCEFLLISASDDIEQELGLELFTEGEECDLVKTFGDAVSTKPLFEGTWV 292

BLAST of CsGy4G017480 vs. ExPASy TrEMBL
Match: A0A6J1GXU8 (uncharacterized protein LOC111457833 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111457833 PE=4 SV=1)

HSP 1 Score: 429 bits (1103), Expect = 2.52e-149
Identity = 215/296 (72.64%), Postives = 249/296 (84.12%), Query Frame = 0

Query: 1   MGEGEEELKTKAENHEVEIQERGEIFFLYRPKVGKQEVHGPDEVQRLYIILRPQSGEKTV 60
           MGEG EE KT+AE   VEIQERGEIFF YRPKVGKQ+VHGPD+VQRLYIILRP+SGE+ V
Sbjct: 1   MGEG-EESKTRAEAG-VEIQERGEIFFFYRPKVGKQQVHGPDDVQRLYIILRPESGERAV 60

Query: 61  EEKQCSYGGQSTHTQEVNIEEQPLLRFIIMGRKSLPHPSHRSRPYWGFVDMVTTNVQDIK 120
           EEKQ      S  TQEVNIE+QPLLRF+IMGRKSLP+P+ + RPYWGFVDMVTTNVQD+K
Sbjct: 61  EEKQLP-NASSRRTQEVNIEKQPLLRFMIMGRKSLPNPAQKRRPYWGFVDMVTTNVQDVK 120

Query: 121 TALQGEEYDTSTRGHRHISAARALGEGIYRILRH---NPRNKNNNHHTHLIYKLQFPAAD 180
            ALQ  EYD+STRGHRHISAARA+GEGIYR++RH   + +    ++HTHLIYKL+FP+ D
Sbjct: 121 AALQEGEYDSSTRGHRHISAARAVGEGIYRLVRHKQPDTQKSKKSYHTHLIYKLEFPSED 180

Query: 181 EKNEPQKSFNIEREGSFVIQIKNPEQGGAGGSSSQHKRRAQFPAHLQGQFGHKRYYPADP 240
           E+NEPQ SFNI REGSF+I IKNP+  G G   S++KRRAQFPAHLQG+FGH R++PADP
Sbjct: 181 EENEPQNSFNIGREGSFLIMIKNPDVEGDG---SRNKRRAQFPAHLQGEFGHTRFHPADP 240

Query: 241 PEFLNFEGCEFLLISASDDIEQELGLELITEGEECDLVKTFGDAVSTKPLFEGTWV 293
           P++LNFEGCEFLLISASDDIEQELGLEL T   ECDLVK FG+  ST+PL +GTWV
Sbjct: 241 PDYLNFEGCEFLLISASDDIEQELGLELTTAPHECDLVKMFGETTSTQPLLKGTWV 290

BLAST of CsGy4G017480 vs. ExPASy TrEMBL
Match: A0A6J5UCV4 (Uncharacterized protein OS=Prunus armeniaca OX=36596 GN=CURHAP_LOCUS18607 PE=4 SV=1)

HSP 1 Score: 400 bits (1029), Expect = 1.31e-137
Identity = 211/324 (65.12%), Postives = 244/324 (75.31%), Query Frame = 0

Query: 1   MGEGEEELKTKAENHEVEIQERGEIFFLYRPKVGKQEVHGPDEVQRLYIILRPQSGEKTV 60
           MG+G +E+KT+A+  +VEIQERGEIFF YRPKV K+E H PD+VQRLYI+LRP+SGE+ +
Sbjct: 1   MGQG-DEVKTRADA-QVEIQERGEIFFFYRPKVNKEEAHSPDDVQRLYIVLRPESGERPI 60

Query: 61  EEKQC-------------------SYGGQSTH----TQEVNIEEQPLLRFIIMGRKSLPH 120
           EEKQ                    S GGQS+      QEVNIE+QPLLRFI+MGRKSLP 
Sbjct: 61  EEKQDPDSGKEGAKKKRPNSGEKGSGGGQSSEGGHGRQEVNIEKQPLLRFIVMGRKSLPD 120

Query: 121 PSHRSRPYWGFVDMVTTNVQDIKTALQGEEYDTSTRGHRHISAARALGEGIYRILRHNPR 180
           PS + RPYWGFV+MVTTN+ D+KTALQGEEYDT T GHRH SAARALGEGIYRI+RH   
Sbjct: 121 PSKKGRPYWGFVEMVTTNIDDVKTALQGEEYDTKTEGHRHTSAARALGEGIYRIVRHKEG 180

Query: 181 NKNNNHHTHLIYKLQFPAADEKNEPQKSFNIEREGSFVIQIKNPEQGGAGGSSS----QH 240
            K    HTHLIYKL+FP  DE NEPQ+S NI+ EGSF IQIKNP+Q G+  +S     Q+
Sbjct: 181 KKKP--HTHLIYKLEFPPEDENNEPQESLNIKHEGSFHIQIKNPDQHGSSSTSQFRGLQN 240

Query: 241 KRRAQFPAHLQGQFGHKRYYPADPPEFLNFEGCEFLLISASDDIEQELGLELITEGE--- 293
            RRA FPAHLQGQFG+ RY PADPP+FLN+EGCEFLLISASDDIE+ELGLEL TEGE   
Sbjct: 241 NRRAMFPAHLQGQFGNLRYCPADPPDFLNYEGCEFLLISASDDIEEELGLELQTEGEAVE 300

BLAST of CsGy4G017480 vs. ExPASy TrEMBL
Match: A0A6J5WRF5 (Uncharacterized protein OS=Prunus armeniaca OX=36596 GN=ORAREDHAP_LOCUS18444 PE=4 SV=1)

HSP 1 Score: 399 bits (1025), Expect = 5.33e-137
Identity = 211/324 (65.12%), Postives = 243/324 (75.00%), Query Frame = 0

Query: 1   MGEGEEELKTKAENHEVEIQERGEIFFLYRPKVGKQEVHGPDEVQRLYIILRPQSGEKTV 60
           MG+G +E+KT+A+  +VEIQERGEIFF YRPKV K+E H PD+VQRLYI+LRP+SGE+ +
Sbjct: 1   MGQG-DEVKTRADA-QVEIQERGEIFFFYRPKVNKEEAHSPDDVQRLYIVLRPESGERPI 60

Query: 61  EEKQC-------------------SYGGQSTH----TQEVNIEEQPLLRFIIMGRKSLPH 120
           EEKQ                    S GGQS+      QEVNIE+QPLLRFI+MGRKSLP 
Sbjct: 61  EEKQDPDSGKEGAKKKRPNSGEKGSGGGQSSEGGHGRQEVNIEKQPLLRFIVMGRKSLPD 120

Query: 121 PSHRSRPYWGFVDMVTTNVQDIKTALQGEEYDTSTRGHRHISAARALGEGIYRILRHNPR 180
           PS + RPYWGFV+MVTTN+ D+KTALQGEEYDT T GHRH SAARALGEGIYRI+RH   
Sbjct: 121 PSKKGRPYWGFVEMVTTNIDDVKTALQGEEYDTKTEGHRHTSAARALGEGIYRIVRHKEG 180

Query: 181 NKNNNHHTHLIYKLQFPAADEKNEPQKSFNIEREGSFVIQIKNPEQGGAGGSSS----QH 240
            K    HTHLIYKL+FP  DE NEPQ+S NI+ EGSF IQIKNP+Q G+  +S     Q+
Sbjct: 181 KKKP--HTHLIYKLEFPPEDENNEPQESLNIKHEGSFHIQIKNPDQHGSSSTSQFRGLQN 240

Query: 241 KRRAQFPAHLQGQFGHKRYYPADPPEFLNFEGCEFLLISASDDIEQELGLELITEGE--- 293
            RRA FPAHLQGQFG+ RY PADPP+FLN+EGCEFLLISASDDIE ELGLEL TEGE   
Sbjct: 241 NRRAMFPAHLQGQFGNLRYCPADPPDFLNYEGCEFLLISASDDIEGELGLELQTEGEAVE 300

BLAST of CsGy4G017480 vs. TAIR 10
Match: AT1G16770.1 (unknown protein; Has 109 Blast hits to 109 proteins in 52 species: Archae - 0; Bacteria - 4; Metazoa - 0; Fungi - 71; Plants - 32; Viruses - 0; Other Eukaryotes - 2 (source: NCBI BLink). )

HSP 1 Score: 366.3 bits (939), Expect = 2.3e-101
Identity = 190/326 (58.28%), Postives = 234/326 (71.78%), Query Frame = 0

Query: 1   MGEGEEELKTKAENHEVEIQERGEIFFLYRPKVGKQEVHGPDEVQRLYIILRPQSGEKTV 60
           MG+G +E+KT+ +  +VEIQERGEIFF YRPKV K+E H  D+VQRLYI++RP+SGE   
Sbjct: 1   MGQG-KEVKTRPD-PQVEIQERGEIFFFYRPKVNKEEAHSVDDVQRLYIVMRPESGENPT 60

Query: 61  EEKQ------------------------CSYGGQSTH-TQEVNIEEQPLLRFIIMGRKSL 120
           EEKQ                            G+  H  ++VNIE+Q LLRFI+MG+KSL
Sbjct: 61  EEKQDPLSGKEGSDKDSGDGEASGSSSGAKNQGEGGHGVEKVNIEKQLLLRFIVMGKKSL 120

Query: 121 PHPSHRSRPYWGFVDMVTTNVQDIKTALQGEEYDTSTRGHRHISAARALGEGIYRILRHN 180
           P PS +S+P+WGFV+MVTTNV+D+K AL+GEEY+T TRGHRH   ARA+GEGIYRILRH 
Sbjct: 121 PDPSKKSQPFWGFVEMVTTNVEDVKNALKGEEYETKTRGHRHKPPARAVGEGIYRILRHK 180

Query: 181 PRNKNNNHHTHLIYKLQFPAADE--KNEPQKSFNIEREGSFVIQIKNPEQGGAGGS---S 240
           P N    HHTHL+YKL+FP+  +  ++EPQ+S NIE EGSF+IQI+NPEQGG G S    
Sbjct: 181 P-NPTRKHHTHLVYKLEFPSVSQTREHEPQESLNIEPEGSFLIQIRNPEQGGGGRSGFGG 240

Query: 241 SQHKRRAQFPAHLQGQFGHKRYYPADPPEFLNFEGCEFLLISASDDIEQELGLELITEGE 293
            Q KR+AQFP H+Q   GH R+ PADPP+FLN+EGCE LLISASDDIE+ELG+EL  EG+
Sbjct: 241 LQRKRKAQFPVHIQAHLGHTRFGPADPPDFLNYEGCELLLISASDDIEEELGMELEPEGD 300

BLAST of CsGy4G017480 vs. TAIR 10
Match: AT1G16770.2 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; Has 103 Blast hits to 103 proteins in 50 species: Archae - 0; Bacteria - 4; Metazoa - 0; Fungi - 65; Plants - 32; Viruses - 0; Other Eukaryotes - 2 (source: NCBI BLink). )

HSP 1 Score: 307.4 bits (786), Expect = 1.3e-83
Identity = 158/276 (57.25%), Postives = 193/276 (69.93%), Query Frame = 0

Query: 51  LRPQSGEKTVEEKQ------------------------CSYGGQSTH-TQEVNIEEQPLL 110
           +RP+SGE   EEKQ                            G+  H  ++VNIE+Q LL
Sbjct: 1   MRPESGENPTEEKQDPLSGKEGSDKDSGDGEASGSSSGAKNQGEGGHGVEKVNIEKQLLL 60

Query: 111 RFIIMGRKSLPHPSHRSRPYWGFVDMVTTNVQDIKTALQGEEYDTSTRGHRHISAARALG 170
           RFI+MG+KSLP PS +S+P+WGFV+MVTTNV+D+K AL+GEEY+T TRGHRH   ARA+G
Sbjct: 61  RFIVMGKKSLPDPSKKSQPFWGFVEMVTTNVEDVKNALKGEEYETKTRGHRHKPPARAVG 120

Query: 171 EGIYRILRHNPRNKNNNHHTHLIYKLQFPAADE--KNEPQKSFNIEREGSFVIQIKNPEQ 230
           EGIYRILRH P N    HHTHL+YKL+FP+  +  ++EPQ+S NIE EGSF+IQI+NPEQ
Sbjct: 121 EGIYRILRHKP-NPTRKHHTHLVYKLEFPSVSQTREHEPQESLNIEPEGSFLIQIRNPEQ 180

Query: 231 GGAGGS---SSQHKRRAQFPAHLQGQFGHKRYYPADPPEFLNFEGCEFLLISASDDIEQE 290
           GG G S     Q KR+AQFP H+Q   GH R+ PADPP+FLN+EGCE LLISASDDIE+E
Sbjct: 181 GGGGRSGFGGLQRKRKAQFPVHIQAHLGHTRFGPADPPDFLNYEGCELLLISASDDIEEE 240

Query: 291 LGLELITEGE----ECDLVKTFGDAVSTKPLFEGTW 293
           LG+EL  EG+     CDL+KTFGD V   PL  GTW
Sbjct: 241 LGMELEPEGDGEESTCDLLKTFGDDVEATPLLRGTW 275

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
XP_031740485.17.88e-218100.00uncharacterized protein LOC101213393 [Cucumis sativus] >KAE8649641.1 hypothetica... [more]
XP_008444096.13.28e-20394.88PREDICTED: uncharacterized protein LOC103487535 [Cucumis melo] >KAA0064208.1 unc... [more]
XP_038895444.19.76e-17181.33uncharacterized protein LOC120083676 [Benincasa hispida][more]
KAG6581755.11.82e-14972.64hypothetical protein SDJN03_21757, partial [Cucurbita argyrosperma subsp. sorori... [more]
XP_022956009.15.20e-14972.64uncharacterized protein LOC111457833 isoform X1 [Cucurbita moschata][more]
Match NameE-valueIdentityDescription
A0A5D3BUL81.59e-20394.88Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... [more]
A0A1S3B9M21.59e-20394.88uncharacterized protein LOC103487535 OS=Cucumis melo OX=3656 GN=LOC103487535 PE=... [more]
A0A6J1GXU82.52e-14972.64uncharacterized protein LOC111457833 isoform X1 OS=Cucurbita moschata OX=3662 GN... [more]
A0A6J5UCV41.31e-13765.12Uncharacterized protein OS=Prunus armeniaca OX=36596 GN=CURHAP_LOCUS18607 PE=4 S... [more]
A0A6J5WRF55.33e-13765.12Uncharacterized protein OS=Prunus armeniaca OX=36596 GN=ORAREDHAP_LOCUS18444 PE=... [more]
Match NameE-valueIdentityDescription
AT1G16770.12.3e-10158.28unknown protein; Has 109 Blast hits to 109 proteins in 52 species: Archae - 0; B... [more]
AT1G16770.21.3e-8357.25unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
InterPro
Analysis Name: InterPro Annotations of Cucumber (Gy14) v2.1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR34776F17F16.3 PROTEINcoord: 1..292

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CsGy4G017480.2CsGy4G017480.2mRNA