CmoCh19G000400 (gene) Cucurbita moschata (Rifu) v1

Overview
NameCmoCh19G000400
Typegene
OrganismCucurbita moschata (Cucurbita moschata (Rifu) v1)
DescriptionUnknown protein
LocationCmo_Chr19: 219538 .. 222258 (+)
RNA-Seq ExpressionCmoCh19G000400
SyntenyCmoCh19G000400
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TGAATTTACTTCTAAATCTTCATTTCCCTGAAGACTAGCCGTTGCAACCTTCTTCCTCTACCTCATGGAACCGTCTGTGGAGTCTCCATCATCATCTTCACATCCCAACCAGGTTCGTTTACTGGACACCCTTGTCGTGTCGGATTTTCATTTTCTGATGTATTTTCAGCTGATTTCTTAGGGACCGACCAGCTTCATGGGTTCGCCCCCATTGTCCAGCCCTGATTCCGACCAGCGTTTCTGGAGCTCCCTTCTAGGTCGGGTCGACTCGCTCCTTGAGCAACGAAATGCCAAATCTTCAAATCTTGTGAGTGCTTTTCTCTTCGAGTTCTTCGTTTTCGAGCTAAGCGTTTTTCCATGTCAATGGAGCTTGAAAATTCATGGGTTTGTGGGGATAGAACACGAACAAATCGGATAGGGCAAAGAGATTGAAGGAAGATTCTTTGCTTTTGATGAGAGGATTCGACTCTGTTGGCTATACTCTGTCTCAGCTTTCCAACAATTTGGATAACGCCTTGCAGGTAAACGCCACTCTCCACTCTTCCGTTTTTTGTATTCTGTACTGCGAATTGTTGGGAGGGAGTCTCACATGGGCTAATTTAGGGAATGATCGTAAGTTTACAAGTAGGAGATACATCTCCACTTGTACGAGGCCTTTTGGCCGCCCAAAGCAATGCCATGAGAGCCTATGCTCAAAGTGGACAATATGATACTACTGTGGAGAGTCGTGATTCCTAACATGGTATTAGAGCCATGCCCTTAACTTACCATGTCAATAGAATTCTCAAATGTGGAACAAAGAAGTTGTGAGCCTCAAAGGCGTAGTCAAAAGTGACTTGAATATCGAACAAAGGGTGTACTTTGTTCAAGGACTCCAGAGAAGGAGTCGAGCCTCGATTAAGGGGAGAGGCTGTTCGAGGGCTTCATAGGCCTCAGGAGAGGCTCTATAGTGTACTTTGTTCGAGGAGAGGATGGTTGGGAGGAAATCCTGCGTTGAATAATTTAGGGAATGATCATAGGTTTATAAGTAAAGAATACACCTTCATTGGTATGAGGCCTTTTGGGGGAGTCCAAAGTAAAGCCACGGGAGCTTATGCTCAAAGTGGACAATATCACACTATTGTGCAGAGTCGTGATTCCTAGCCCCACAAAACCAAGATTTGATTTCGAGGGCTGATTCCAGGGCGCTAAGGATCTGGTTCAAGCACCAACCTTGACGGAGATCTTCCAAAGCAACCTCAAGAACTCCGAGGTTGAGCATGAGTTGGTGGAGCTCAAGCAAGCAACAAAGAGAAAATATGATGATACTCATTGCTCAGAAGATTCAGAGGTTAATTTAGAGAAAGAAAACCGGCAAAACCCAACAGACAACCTTAAAAAGGCTAAACATGTTGGTTCCTTACCTCTGTCATCTGTATTTGAACAAGTTCTTATGCAATATTAACCTTTTGGCCTGCGTTTCTCCAGCTTGCAGTTGCAATGGCAACAAAAACATCATCTCTAGCAAGAGAATTAAAATCATTAAAATCCAATATATGTTTTATGCAAGAACGATGCGCCATACTTGATGAAGAGAATAGGAGACTTCGAAACGGTGTTTCCACCGGGGTCAAACCGGAAGAAGATGATCTGGTTCGTACTATTCTTATGATGATTATTTGTTCTTCCTTCCCACTTTTGTTTATGTGTAGTCTATGTTGATGAGAGGAAGTCCCGCTTATACCCAAAGTAGACAATATTATACCATTGTGGAGAGTCGTGATTCCAAACATAGTACCAGAGCCATACCCTTAACTTAGCTATGTCAATAGAATCCTCAAATGTCGAACAAAAAAGTTGTGAGCCTTGAAGATGTAACCAAAAGTGACTTAAGTTTCGAACAAGAGGTGTACTTTGTTCGAGGACTCCAAAGAAAGAGTTGAGTCTCGATTAAGGGGAGGCTGATCGAGGGCTCCATAGGCCTCAGGGGAGGCTCTATAGTGTACTTTGTTTGAGGAGAAGATTGTTGGGAGGGAGTCCCACATTGGCTAGTTTAGGGAATGATCATGAGTTTATAAGTAATAAATATATCTCCGTTAGTATGAGGTCCTTTGGGGAAGCCTAAAACAAAGTCACGAGAGCTTATGCTCAAAGTGACAATATTATACCATTATGGAGAGTCGTGATTCCTAACTGTCTATTTGGTGATAAATCAGGTTAGGCTTCAAATGGAGGCACTGCTAGCTGAGAAATCAAGATTAGCAAACGAAAATGCAAACTTAACAAGGGAAAATCAATGCCTTTACCAGCTTGCGGAGTATCACCAGCTCACATCCCAAGATCTTTCGCTATCTTACGACGAAGCCATCGGCATGTGTTTGGACTTCTCGTCACCACCACCTGCCATTGTCGAAGGAGCTGAAGGACAAGGACAAGGACAGGGACGAGCTGAAGGACAGGGACAAGGACAAAGACAGGGACAGGGACAATGTGACTATTGACAAAGAAATTACTCAAACATCTAGAGCTGATCGTTTTAGCGTTTCTACCTCTCTTGATGAACTCCGCCAAGAGTAGAGGTGATTCCGGACCAGAAAACGCAGGCTCGAACATTGGTTGCTTCTCGTTCACAATACTGTAACCCAGTTAACTCAATCTTCACTGCATTTAGTTCAGAAATAAATCTATGGTCATAGCATTTTCTACACTGTCGTATTCGTCTACTATCAAACCAGGTTCCAT

mRNA sequence

TGAATTTACTTCTAAATCTTCATTTCCCTGAAGACTAGCCGTTGCAACCTTCTTCCTCTACCTCATGGAACCGTCTGTGGAGTCTCCATCATCATCTTCACATCCCAACCAGGGACCGACCAGCTTCATGGGTTCGCCCCCATTGTCCAGCCCTGATTCCGACCAGCGTTTCTGGAGCTCCCTTCTAGGTCGGGTCGACTCGCTCCTTGAGCAACGAAATGCCAAATCTTCAAATCTTAACACGAACAAATCGGATAGGGCAAAGAGATTGAAGGAAGATTCTTTGCTTTTGATGAGAGGATTCGACTCTGTTGGCTATACTCTGTCTCAGCTTTCCAACAATTTGGATAACGCCTTGCAGGGCGCTAAGGATCTGGTTCAAGCACCAACCTTGACGGAGATCTTCCAAAGCAACCTCAAGAACTCCGAGGTTGAGCATGAGTTGGTGGAGCTCAAGCAAGCAACAAAGAGAAAATATGATGATACTCATTGCTCAGAAGATTCAGAGCTTGCAGTTGCAATGGCAACAAAAACATCATCTCTAGCAAGAGAATTAAAATCATTAAAATCCAATATATGTTTTATGCAAGAACGATGCGCCATACTTGATGAAGAGAATAGGAGACTTCGAAACGGTGTTTCCACCGGGGTCAAACCGGAAGAAGATGATCTGGTTAGGCTTCAAATGGAGGCACTGCTAGCTGAGAAATCAAGATTAGCAAACGAAAATGCAAACTTAACAAGGGAAAATCAATGCCTTTACCAGCTTGCGGAGTATCACCAGCTCACATCCCAAGATCTTTCGCTATCTTACGACGAAGCCATCGGCATGTGTTTGGACTTCTCGTCACCACCACCTGCCATTGTCGAAGGAGCTGAAGGACAAGGACAAGGACAGGGACGAGCTGAAGGACAGGGACAAGGACAAAGACAGGGACAGGGACAATGTGACTATTGACAAAGAAATTACTCAAACATCTAGAGCTGATCGTTTTAGCGTTTCTACCTCTCTTGATGAACTCCGCCAAGAGTAGAGGTGATTCCGGACCAGAAAACGCAGGCTCGAACATTGGTTGCTTCTCGTTCACAATACTGTAACCCAGTTAACTCAATCTTCACTGCATTTAGTTCAGAAATAAATCTATGGTCATAGCATTTTCTACACTGTCGTATTCGTCTACTATCAAACCAGGTTCCAT

Coding sequence (CDS)

ATGGAACCGTCTGTGGAGTCTCCATCATCATCTTCACATCCCAACCAGGGACCGACCAGCTTCATGGGTTCGCCCCCATTGTCCAGCCCTGATTCCGACCAGCGTTTCTGGAGCTCCCTTCTAGGTCGGGTCGACTCGCTCCTTGAGCAACGAAATGCCAAATCTTCAAATCTTAACACGAACAAATCGGATAGGGCAAAGAGATTGAAGGAAGATTCTTTGCTTTTGATGAGAGGATTCGACTCTGTTGGCTATACTCTGTCTCAGCTTTCCAACAATTTGGATAACGCCTTGCAGGGCGCTAAGGATCTGGTTCAAGCACCAACCTTGACGGAGATCTTCCAAAGCAACCTCAAGAACTCCGAGGTTGAGCATGAGTTGGTGGAGCTCAAGCAAGCAACAAAGAGAAAATATGATGATACTCATTGCTCAGAAGATTCAGAGCTTGCAGTTGCAATGGCAACAAAAACATCATCTCTAGCAAGAGAATTAAAATCATTAAAATCCAATATATGTTTTATGCAAGAACGATGCGCCATACTTGATGAAGAGAATAGGAGACTTCGAAACGGTGTTTCCACCGGGGTCAAACCGGAAGAAGATGATCTGGTTAGGCTTCAAATGGAGGCACTGCTAGCTGAGAAATCAAGATTAGCAAACGAAAATGCAAACTTAACAAGGGAAAATCAATGCCTTTACCAGCTTGCGGAGTATCACCAGCTCACATCCCAAGATCTTTCGCTATCTTACGACGAAGCCATCGGCATGTGTTTGGACTTCTCGTCACCACCACCTGCCATTGTCGAAGGAGCTGAAGGACAAGGACAAGGACAGGGACGAGCTGAAGGACAGGGACAAGGACAAAGACAGGGACAGGGACAATGTGACTATTGA

Protein sequence

MEPSVESPSSSSHPNQGPTSFMGSPPLSSPDSDQRFWSSLLGRVDSLLEQRNAKSSNLNTNKSDRAKRLKEDSLLLMRGFDSVGYTLSQLSNNLDNALQGAKDLVQAPTLTEIFQSNLKNSEVEHELVELKQATKRKYDDTHCSEDSELAVAMATKTSSLARELKSLKSNICFMQERCAILDEENRRLRNGVSTGVKPEEDDLVRLQMEALLAEKSRLANENANLTRENQCLYQLAEYHQLTSQDLSLSYDEAIGMCLDFSSPPPAIVEGAEGQGQGQGRAEGQGQGQRQGQGQCDY
Homology
BLAST of CmoCh19G000400 vs. ExPASy TrEMBL
Match: A0A6J1HHY1 (uncharacterized protein LOC111463807 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111463807 PE=4 SV=1)

HSP 1 Score: 550.4 bits (1417), Expect = 4.5e-153
Identity = 297/317 (93.69%), Postives = 297/317 (93.69%), Query Frame = 0

Query: 1   MEPSVESPSSSSHPNQGPTSFMGSPPLSSPDSDQRFWSSLLGRVDSLLEQRNAKSSNLNT 60
           MEPSVESPSSSSHPNQGPTSFMGSPPLSSPDSDQRFWSSLLGRVDSLLEQRNAKSSNLNT
Sbjct: 1   MEPSVESPSSSSHPNQGPTSFMGSPPLSSPDSDQRFWSSLLGRVDSLLEQRNAKSSNLNT 60

Query: 61  NKSDRAKRLKEDSLLLMRGFDSVGYTLSQLSNNLDNALQGAKDLVQAPTLTEIFQSNLKN 120
           NKSDRAKRLKEDSLLLMRGFDSVGYTLSQLSNNLDNALQGAKDLVQAPTLTEIFQSNLKN
Sbjct: 61  NKSDRAKRLKEDSLLLMRGFDSVGYTLSQLSNNLDNALQGAKDLVQAPTLTEIFQSNLKN 120

Query: 121 SEVEHELVELKQATKRKYDDTHCSEDSE--------------------LAVAMATKTSSL 180
           SEVEHELVELKQATKRKYDDTHCSEDSE                    LAVAMATKTSSL
Sbjct: 121 SEVEHELVELKQATKRKYDDTHCSEDSEVNLEKENRQNPTDNLKKAKHLAVAMATKTSSL 180

Query: 181 ARELKSLKSNICFMQERCAILDEENRRLRNGVSTGVKPEEDDLVRLQMEALLAEKSRLAN 240
           ARELKSLKSNICFMQERCAILDEENRRLRNGVSTGVKPEEDDLVRLQMEALLAEKSRLAN
Sbjct: 181 ARELKSLKSNICFMQERCAILDEENRRLRNGVSTGVKPEEDDLVRLQMEALLAEKSRLAN 240

Query: 241 ENANLTRENQCLYQLAEYHQLTSQDLSLSYDEAIGMCLDFSSPPPAIVEGAEGQGQGQGR 298
           ENANLTRENQCLYQLAEYHQLTSQDLSLSYDEAIGMCLDFSSPPPAIVEGAEGQGQGQGR
Sbjct: 241 ENANLTRENQCLYQLAEYHQLTSQDLSLSYDEAIGMCLDFSSPPPAIVEGAEGQGQGQGR 300

BLAST of CmoCh19G000400 vs. ExPASy TrEMBL
Match: A0A6J1HK88 (uncharacterized protein LOC111463807 isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111463807 PE=4 SV=1)

HSP 1 Score: 519.2 bits (1336), Expect = 1.1e-143
Identity = 279/302 (92.38%), Postives = 281/302 (93.05%), Query Frame = 0

Query: 1   MEPSVESPSSSSHPNQGPTSFMGSPPLSSPDSDQRFWSSLLGRVDSLLEQRNAKSSNLNT 60
           MEPSVESPSSSSHPNQGPTSFMGSPPLSSPDSDQRFWSSLLGRVDSLLEQRNAKSSNLNT
Sbjct: 1   MEPSVESPSSSSHPNQGPTSFMGSPPLSSPDSDQRFWSSLLGRVDSLLEQRNAKSSNLNT 60

Query: 61  NKSDRAKRLKEDSLLLMRGFDSVGYTLSQLSNNLDNALQGAKDLVQAPTLTEIFQSNLKN 120
           NKSDRAKRLKEDSLLLMRGFDSVGYTLSQLSNNLDNALQGAKDLVQAPTLTEIFQSNLKN
Sbjct: 61  NKSDRAKRLKEDSLLLMRGFDSVGYTLSQLSNNLDNALQGAKDLVQAPTLTEIFQSNLKN 120

Query: 121 SEVEHELVELKQATKRKYDDTHCSEDSE--------------------LAVAMATKTSSL 180
           SEVEHELVELKQATKRKYDDTHCSEDSE                    LAVAMATKTSSL
Sbjct: 121 SEVEHELVELKQATKRKYDDTHCSEDSEVNLEKENRQNPTDNLKKAKHLAVAMATKTSSL 180

Query: 181 ARELKSLKSNICFMQERCAILDEENRRLRNGVSTGVKPEEDDLVRLQMEALLAEKSRLAN 240
           ARELKSLKSNICFMQERCAILDEENRRLRNGVSTGVKPEEDDLVRLQMEALLAEKSRLAN
Sbjct: 181 ARELKSLKSNICFMQERCAILDEENRRLRNGVSTGVKPEEDDLVRLQMEALLAEKSRLAN 240

Query: 241 ENANLTRENQCLYQLAEYHQLTSQDLSLSYDEAIGMCLDFSSPPPAIVEGAEGQGQGQGR 283
           ENANLTRENQCLYQLAEYHQLTSQDLSLSYDEAIGMCLDFSSPPPAIVEGAEGQGQGQG+
Sbjct: 241 ENANLTRENQCLYQLAEYHQLTSQDLSLSYDEAIGMCLDFSSPPPAIVEGAEGQGQGQGQ 300

BLAST of CmoCh19G000400 vs. ExPASy TrEMBL
Match: A0A6J1HG94 (uncharacterized protein LOC111463807 isoform X3 OS=Cucurbita moschata OX=3662 GN=LOC111463807 PE=4 SV=1)

HSP 1 Score: 512.3 bits (1318), Expect = 1.4e-141
Identity = 276/296 (93.24%), Postives = 276/296 (93.24%), Query Frame = 0

Query: 22  MGSPPLSSPDSDQRFWSSLLGRVDSLLEQRNAKSSNLNTNKSDRAKRLKEDSLLLMRGFD 81
           MGSPPLSSPDSDQRFWSSLLGRVDSLLEQRNAKSSNLNTNKSDRAKRLKEDSLLLMRGFD
Sbjct: 1   MGSPPLSSPDSDQRFWSSLLGRVDSLLEQRNAKSSNLNTNKSDRAKRLKEDSLLLMRGFD 60

Query: 82  SVGYTLSQLSNNLDNALQGAKDLVQAPTLTEIFQSNLKNSEVEHELVELKQATKRKYDDT 141
           SVGYTLSQLSNNLDNALQGAKDLVQAPTLTEIFQSNLKNSEVEHELVELKQATKRKYDDT
Sbjct: 61  SVGYTLSQLSNNLDNALQGAKDLVQAPTLTEIFQSNLKNSEVEHELVELKQATKRKYDDT 120

Query: 142 HCSEDSE--------------------LAVAMATKTSSLARELKSLKSNICFMQERCAIL 201
           HCSEDSE                    LAVAMATKTSSLARELKSLKSNICFMQERCAIL
Sbjct: 121 HCSEDSEVNLEKENRQNPTDNLKKAKHLAVAMATKTSSLARELKSLKSNICFMQERCAIL 180

Query: 202 DEENRRLRNGVSTGVKPEEDDLVRLQMEALLAEKSRLANENANLTRENQCLYQLAEYHQL 261
           DEENRRLRNGVSTGVKPEEDDLVRLQMEALLAEKSRLANENANLTRENQCLYQLAEYHQL
Sbjct: 181 DEENRRLRNGVSTGVKPEEDDLVRLQMEALLAEKSRLANENANLTRENQCLYQLAEYHQL 240

Query: 262 TSQDLSLSYDEAIGMCLDFSSPPPAIVEGAEGQGQGQGRAEGQGQGQRQGQGQCDY 298
           TSQDLSLSYDEAIGMCLDFSSPPPAIVEGAEGQGQGQGRAEGQGQGQRQGQGQCDY
Sbjct: 241 TSQDLSLSYDEAIGMCLDFSSPPPAIVEGAEGQGQGQGRAEGQGQGQRQGQGQCDY 296

BLAST of CmoCh19G000400 vs. ExPASy TrEMBL
Match: A0A6J1HWL1 (uncharacterized protein LOC111466940 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111466940 PE=4 SV=1)

HSP 1 Score: 481.5 bits (1238), Expect = 2.6e-132
Identity = 260/300 (86.67%), Postives = 271/300 (90.33%), Query Frame = 0

Query: 1   MEPSVESPSSSSHPNQGPTSFMGSPPLSSPDSDQRFWSSLLGRVDSLLEQRNAKSSNLNT 60
           MEPSVESPSSSSHPNQGPTSFMGSPPL SPDSD+RFWSSL GRVDSLLEQRNAKSSNLN 
Sbjct: 1   MEPSVESPSSSSHPNQGPTSFMGSPPLPSPDSDKRFWSSLRGRVDSLLEQRNAKSSNLNM 60

Query: 61  NKSDRAKRLKEDSLLLMRGFDSVGYTLSQLSNNLDNALQGAKDLVQA-PTLTEIFQSNLK 120
           NKS+R KRLKEDSLLL+RGFDSVGYTLSQLSNNLDNALQGAKDLVQA PTLTEIFQSNLK
Sbjct: 61  NKSERVKRLKEDSLLLLRGFDSVGYTLSQLSNNLDNALQGAKDLVQAPPTLTEIFQSNLK 120

Query: 121 NSEVEHELVELKQATKRKYDDTHCSEDS-----------------ELAVAMATKTSSLAR 180
           NSEVEHE +E KQATKRKYDDTHCSED+                  LAVAMATKT+SLAR
Sbjct: 121 NSEVEHEFLESKQATKRKYDDTHCSEDNLEKENQQNPTDNLKKAKNLAVAMATKTASLAR 180

Query: 181 ELKSLKSNICFMQERCAILDEENRRLRNGVSTGVKPEEDDLVRLQMEALLAEKSRLANEN 240
           ELKSLKSNICFMQERCAILDEENRRLRNGVSTGV+PEEDDLVRLQMEALLAEKSRLANEN
Sbjct: 181 ELKSLKSNICFMQERCAILDEENRRLRNGVSTGVRPEEDDLVRLQMEALLAEKSRLANEN 240

Query: 241 ANLTRENQCLYQLAEYHQLTSQDLSLSYDEAIGMCLDFSSPPPAIVEGAEGQGQGQGRAE 283
           ANLTRENQCLYQLAEYHQLTSQDL LSYDEAIGMCLDFS+ PPAIV+ AEGQGQGQG+ +
Sbjct: 241 ANLTRENQCLYQLAEYHQLTSQDLLLSYDEAIGMCLDFSASPPAIVKEAEGQGQGQGQRD 300

BLAST of CmoCh19G000400 vs. ExPASy TrEMBL
Match: A0A6J1HFE4 (uncharacterized protein LOC111463807 isoform X4 OS=Cucurbita moschata OX=3662 GN=LOC111463807 PE=4 SV=1)

HSP 1 Score: 445.7 bits (1145), Expect = 1.6e-121
Identity = 248/297 (83.50%), Postives = 248/297 (83.50%), Query Frame = 0

Query: 1   MEPSVESPSSSSHPNQGPTSFMGSPPLSSPDSDQRFWSSLLGRVDSLLEQRNAKSSNLNT 60
           MEPSVESPSSSSHPNQGPTSFMGSPPLSSPDSDQRFWSSLLGRVDSLLEQRNAKSSNLNT
Sbjct: 1   MEPSVESPSSSSHPNQGPTSFMGSPPLSSPDSDQRFWSSLLGRVDSLLEQRNAKSSNLNT 60

Query: 61  NKSDRAKRLKEDSLLLMRGFDSVGYTLSQLSNNLDNALQGAKDLVQAPTLTEIFQSNLKN 120
           NKSDRAKRLKEDSLLLMRGFDSVGYTLSQLSNNLDNALQ                     
Sbjct: 61  NKSDRAKRLKEDSLLLMRGFDSVGYTLSQLSNNLDNALQ--------------------- 120

Query: 121 SEVEHELVELKQATKRKYDDTHCSEDSELAVAMATKTSSLARELKSLKSNICFMQERCAI 180
                                       LAVAMATKTSSLARELKSLKSNICFMQERCAI
Sbjct: 121 ----------------------------LAVAMATKTSSLARELKSLKSNICFMQERCAI 180

Query: 181 LDEENRRLRNGVSTGVKPEEDDLVRLQMEALLAEKSRLANENANLTRENQCLYQLAEYHQ 240
           LDEENRRLRNGVSTGVKPEEDDLVRLQMEALLAEKSRLANENANLTRENQCLYQLAEYHQ
Sbjct: 181 LDEENRRLRNGVSTGVKPEEDDLVRLQMEALLAEKSRLANENANLTRENQCLYQLAEYHQ 240

Query: 241 LTSQDLSLSYDEAIGMCLDFSSPPPAIVEGAEGQGQGQGRAEGQGQGQRQGQGQCDY 298
           LTSQDLSLSYDEAIGMCLDFSSPPPAIVEGAEGQGQGQGRAEGQGQGQRQGQGQCDY
Sbjct: 241 LTSQDLSLSYDEAIGMCLDFSSPPPAIVEGAEGQGQGQGRAEGQGQGQRQGQGQCDY 248

BLAST of CmoCh19G000400 vs. TAIR 10
Match: AT4G02800.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast; EXPRESSED IN: 16 plant structures; EXPRESSED DURING: 9 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G01970.1); Has 3209 Blast hits to 2720 proteins in 308 species: Archae - 13; Bacteria - 213; Metazoa - 1207; Fungi - 247; Plants - 183; Viruses - 21; Other Eukaryotes - 1325 (source: NCBI BLink). )

HSP 1 Score: 260.0 bits (663), Expect = 2.3e-69
Identity = 156/302 (51.66%), Postives = 206/302 (68.21%), Query Frame = 0

Query: 1   MEPSVESPSSSSHPNQG-------PTSFMGSPPLSSPDSDQRFWSSLLGRVDSLLEQRNA 60
           M  SVE+PS +   N+G        TSF  S P  SP SD+R WS++  RVD LLE+ N+
Sbjct: 1   MAASVETPSPNHTNNEGTRLNMVSATSFDSSSPSVSPSSDKRLWSNVRNRVDVLLEE-NS 60

Query: 61  KSSNLNTN----KSDRAKRLKEDSLLLMRGFDSVGYTLSQLSNNLDNALQGAKDLVQAPT 120
           K+    TN    +S+R+KR K DS+LL++GFDSV +TLS LS+NLDNALQG ++L + P+
Sbjct: 61  KNHKPVTNTIAIESERSKRFKNDSMLLLKGFDSVSHTLSLLSSNLDNALQGVRELAKPPS 120

Query: 121 LTEIFQSNLKNSEV------EHELVELKQATKRKYD-DTHCSEDS--------------- 180
            +EI  SNLK  ++      E E  E  +  KRK++ D   +EDS               
Sbjct: 121 YSEILHSNLKADQIQRQQKEEDEEEEESKGKKRKHESDVEQTEDSSNEEEKRPKERKIMK 180

Query: 181 ---ELAVAMATKTSSLARELKSLKSNICFMQERCAILDEENRRLRNGVSTGVKPEEDDLV 240
               +A++MA K +SLARELK++KS++ F+QERC +L+EEN+RLR+G   GV+PEEDDLV
Sbjct: 181 KAKNIAISMAAKANSLARELKTIKSDLSFIQERCGLLEEENKRLRDGFVKGVRPEEDDLV 240

Query: 241 RLQMEALLAEKSRLANENANLTRENQCLYQLAEYHQLTSQDLSLSYDEAI-GMCLDFSSP 266
           RLQ+E LLAEK+RLANENANL RENQCL+Q+ EYHQ+TSQDLS SY++ + G CLDFSSP
Sbjct: 241 RLQLEVLLAEKARLANENANLVRENQCLHQMVEYHQITSQDLSPSYEQVVQGFCLDFSSP 300

BLAST of CmoCh19G000400 vs. TAIR 10
Match: AT5G01970.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G30050.1); Has 240 Blast hits to 236 proteins in 72 species: Archae - 0; Bacteria - 15; Metazoa - 51; Fungi - 19; Plants - 119; Viruses - 0; Other Eukaryotes - 36 (source: NCBI BLink). )

HSP 1 Score: 105.1 bits (261), Expect = 9.6e-23
Identity = 87/263 (33.08%), Postives = 136/263 (51.71%), Query Frame = 0

Query: 24  SPPL-SSPDSDQ-----------RFWSSLLGRVDSLLEQRNAKSSNLNTNKS-DRAKRL- 83
           SPP  SSP  DQ             W  +  +  S++E  + KSS+ +T  S  R   L 
Sbjct: 34  SPPARSSPAFDQPRSKNFTTEPKGLWGVIAQKAKSVIE--DDKSSDRSTTASQSRFSYLS 93

Query: 84  -----KEDSLLLMRGFDSVGYTLSQLSNNLDNALQGAKDLVQAPTLTEIFQSNLK----- 143
                K D+  L RG D +  +L+Q+ +  + A +  + LV+  T  +I Q   K     
Sbjct: 94  DEGFKKMDNPKLRRGLDKLTSSLNQIGDTFEKAFEDGRTLVENKT-ADIIQETRKLQTRR 153

Query: 144 -----NSEVEHELVELKQATKRKYD----------DTHCSEDSELAVAMATKTSSLAREL 203
                  E +++   +  + K+  +          +T      ++A+A A K   L REL
Sbjct: 154 RGTGGEDENQNQSYGVSSSWKKSPEQPMQLNHIEHETQLKASRDVAMATAAKAKLLLREL 213

Query: 204 KSLKSNICFMQERCAILDEENRRLRNG-VSTGVKPEEDDLVRLQMEALLAEKSRLANENA 247
           K++K+++ F +ERCA L+EEN+ LR      G  P ++DL+RLQ+E+LLAEK+RLA+EN+
Sbjct: 214 KTVKADLAFAKERCAQLEEENKHLRESHREKGSNPADEDLIRLQLESLLAEKARLAHENS 273

BLAST of CmoCh19G000400 vs. TAIR 10
Match: AT2G30530.1 (unknown protein; LOCATED IN: cellular_component unknown; EXPRESSED IN: 25 plant structures; EXPRESSED DURING: 15 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G01970.1); Has 5513 Blast hits to 872 proteins in 154 species: Archae - 0; Bacteria - 30; Metazoa - 615; Fungi - 144; Plants - 149; Viruses - 12; Other Eukaryotes - 4563 (source: NCBI BLink). )

HSP 1 Score: 98.2 bits (243), Expect = 1.2e-20
Identity = 81/247 (32.79%), Postives = 131/247 (53.04%), Query Frame = 0

Query: 7   SPSSSSHPNQGPTSFMGSPPLSSPDSDQRFWSSLLGRVDSLLEQRNAKSSNLNTNKSDRA 66
           S  S + P +G    + S   +  D D     + L +    +EQ    ++   T ++ + 
Sbjct: 97  SMKSLNEPKRGFWGSLASKAKAFLDEDD---PNQLPQSPKRMEQSIPSATTSGTKEAGQT 156

Query: 67  KRLKEDSLLLMRGFDSVGYTLSQLSNNLDNALQGAKDLVQAPTLTEIFQSNLKNSEVEHE 126
            R K ++  L R  D++  +L+ +   +   ++     V+  T   I Q   K  + +  
Sbjct: 157 GR-KSENPSLQRRLDAITSSLNYIGGTIGTVVEEGITAVENRT-AGIIQETRKKIKKKPS 216

Query: 127 LVELKQATKRKYD-DTHCSEDSELAVAMATKTSSLARELKSLKSNICFMQERCAILDEEN 186
           L   +Q  + + D +       ++A+AMA K   L RELK +KS++ F ++RCA L+EEN
Sbjct: 217 LTRNQQNPEIQADLEIQLKASRDVAMAMAAKAKLLLRELKMVKSDLAFAKQRCAQLEEEN 276

Query: 187 RRLRNGVSTGVKPEEDDLVRLQMEALLAEKSRLANENANLTRENQCLYQLAEYHQLTSQD 246
           + LR   S   + ++DDLVRLQ+E LLAEK+RLA+EN+  TREN  L  + EYHQLT QD
Sbjct: 277 KVLRENRSGDSQTDDDDLVRLQLETLLAEKARLAHENSIYTRENLYLRGVVEYHQLTMQD 336

Query: 247 LSLSYDE 253
           + + +DE
Sbjct: 337 V-VYFDE 337

BLAST of CmoCh19G000400 vs. TAIR 10
Match: AT1G30050.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G01970.1); Has 246 Blast hits to 244 proteins in 61 species: Archae - 0; Bacteria - 8; Metazoa - 78; Fungi - 10; Plants - 117; Viruses - 0; Other Eukaryotes - 33 (source: NCBI BLink). )

HSP 1 Score: 94.7 bits (234), Expect = 1.3e-19
Identity = 74/259 (28.57%), Postives = 127/259 (49.03%), Query Frame = 0

Query: 7   SPSSSSHPN------QGPTSFMGSPPLSSPDSDQRFWSSLLGRVDSLLEQRNAKSSNLNT 66
           S S SSH +      +  +   G          + FW  L  +  S+LE    +      
Sbjct: 43  SQSFSSHSSLAAQAIRASSQAQGFTAYEDKSESRGFWGILAQKAKSILEDEEEQQQQ--- 102

Query: 67  NKSDRAKRLKEDSLLLMRGFDSVGYTLSQLSNNLDNALQGAKDLVQAPTLTE----IFQS 126
            +       +  +  + +  D +  +L+ + ++ + A +  + +V +    +    I   
Sbjct: 103 QQQQNDVIFEPSNPTIRKSIDKITTSLNHIGDSFEKAFEEGRTIVASQIRRKGSDLIDSD 162

Query: 127 NLKNSEVEHELVELKQATKRKYDDTHCSEDSELAVAMATKTSSLARELKSLKSNICFMQE 186
           N    +        +  T+    ++      ++A+A A K   L RELK++K+++ F +E
Sbjct: 163 NNNYHQSSGSSSPWQPLTQPNPRESQLKASRDVAMATAAKAKLLLRELKTVKADLAFAKE 222

Query: 187 RCAILDEENRRLRNGVSTG-VKPEEDDLVRLQMEALLAEKSRLANENANLTRENQCLYQL 246
           RC+ L+EEN+RLR+    G   P +DDL+RLQ+E LLAEK+RLA+EN+   REN+ L ++
Sbjct: 223 RCSQLEEENKRLRDNRDKGNNNPADDDLIRLQLETLLAEKARLAHENSIYARENRFLREI 282

Query: 247 AEYHQLTSQDLSLSYDEAI 255
            EYHQLT QD+ +  DE I
Sbjct: 283 VEYHQLTMQDV-VYIDEGI 297

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A6J1HHY14.5e-15393.69uncharacterized protein LOC111463807 isoform X1 OS=Cucurbita moschata OX=3662 GN... [more]
A0A6J1HK881.1e-14392.38uncharacterized protein LOC111463807 isoform X2 OS=Cucurbita moschata OX=3662 GN... [more]
A0A6J1HG941.4e-14193.24uncharacterized protein LOC111463807 isoform X3 OS=Cucurbita moschata OX=3662 GN... [more]
A0A6J1HWL12.6e-13286.67uncharacterized protein LOC111466940 isoform X1 OS=Cucurbita maxima OX=3661 GN=L... [more]
A0A6J1HFE41.6e-12183.50uncharacterized protein LOC111463807 isoform X4 OS=Cucurbita moschata OX=3662 GN... [more]
Match NameE-valueIdentityDescription
AT4G02800.12.3e-6951.66unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT5G01970.19.6e-2333.08unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT2G30530.11.2e-2032.79unknown protein; LOCATED IN: cellular_component unknown; EXPRESSED IN: 25 plant ... [more]
AT1G30050.11.3e-1928.57unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita moschata (Rifu) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 87..107
NoneNo IPR availableCOILSCoilCoilcoord: 208..235
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 265..297
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 279..297
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..35
NoneNo IPR availablePANTHERPTHR31016:SF2OSJNBA0065B15.1 PROTEINcoord: 148..272
NoneNo IPR availablePANTHERPTHR31016UNCHARACTERIZEDcoord: 1..147
coord: 148..272
NoneNo IPR availablePANTHERPTHR31016:SF2OSJNBA0065B15.1 PROTEINcoord: 1..147

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmoCh19G000400.1CmoCh19G000400.1mRNA