Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonfive_prime_UTRCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATTTAGTTATAAAATGTATTAACTTTCCAAAAAAATGACACCATAAAATCCAGATGGTGGGCCATCCTTATCCACACAAAACATCAGAGCCTGAAGTCGGACAAAGCCTTATCCATATTTGATCTCCAAAACATCTCTTTATCCATATTCAATCCCATTCTGACCATCAAAATATAAGCGCAAAACCTCGATCCATTCAAACGCTACCTTCGATCATGGCGTCCTTGTGTTCATTTCCACGCATTTCTTCTGCAGATCCCATCAAGCATCCCGCCGCCGCCCCTTTTCCGCCGTCCAATCACCCGAAAAGACCATCAGCGTTGTCTCTCCGGCAGAGCAGCCGCAACCAGAAAAGAACTTCTACGATTGTCGCCGCCATCGGAGACGTCTCCGCTGACGGCACCACGTATTTGATCGCCGGCGCCGTCGCTGTGGCTCTCGTTGGTACCGCCTTCCCTATCCTCTTCTCTCGCAAAGACCTGTAAGCAAAATCTCAATTTCTCCTATAAACCATTTGTTCATATTGAAATCATTTTAAAAATGCCGTTTAGTACCGCTGTTTGAATGGTCTGGTCCTCCGTAGGTGCCCGGTGTGCGACGGCGCAGGGTTTGTCCGGAAGTCGGGGGCGGCGCTGAGGGCGAATGCGGCTCGTAAAGACCAAGCTCAGATCGTTTGTTCTCGTTGCAATGGCCTGGGCAAGCTCAATCAAGTGGACAAATAA
mRNA sequence
ATTTAGTTATAAAATGTATTAACTTTCCAAAAAAATGACACCATAAAATCCAGATGGTGGGCCATCCTTATCCACACAAAACATCAGAGCCTGAAGTCGGACAAAGCCTTATCCATATTTGATCTCCAAAACATCTCTTTATCCATATTCAATCCCATTCTGACCATCAAAATATAAGCGCAAAACCTCGATCCATTCAAACGCTACCTTCGATCATGGCGTCCTTGTGTTCATTTCCACGCATTTCTTCTGCAGATCCCATCAAGCATCCCGCCGCCGCCCCTTTTCCGCCGTCCAATCACCCGAAAAGACCATCAGCGTTGTCTCTCCGGCAGAGCAGCCGCAACCAGAAAAGAACTTCTACGATTGTCGCCGCCATCGGAGACGTCTCCGCTGACGGCACCACGTATTTGATCGCCGGCGCCGTCGCTGTGGCTCTCGTTGGTACCGCCTTCCCTATCCTCTTCTCTCGCAAAGACCTGTGCCCGGTGTGCGACGGCGCAGGGTTTGTCCGGAAGTCGGGGGCGGCGCTGAGGGCGAATGCGGCTCGTAAAGACCAAGCTCAGATCGTTTGTTCTCGTTGCAATGGCCTGGGCAAGCTCAATCAAGTGGACAAATAA
Coding sequence (CDS)
ATGGCGTCCTTGTGTTCATTTCCACGCATTTCTTCTGCAGATCCCATCAAGCATCCCGCCGCCGCCCCTTTTCCGCCGTCCAATCACCCGAAAAGACCATCAGCGTTGTCTCTCCGGCAGAGCAGCCGCAACCAGAAAAGAACTTCTACGATTGTCGCCGCCATCGGAGACGTCTCCGCTGACGGCACCACGTATTTGATCGCCGGCGCCGTCGCTGTGGCTCTCGTTGGTACCGCCTTCCCTATCCTCTTCTCTCGCAAAGACCTGTGCCCGGTGTGCGACGGCGCAGGGTTTGTCCGGAAGTCGGGGGCGGCGCTGAGGGCGAATGCGGCTCGTAAAGACCAAGCTCAGATCGTTTGTTCTCGTTGCAATGGCCTGGGCAAGCTCAATCAAGTGGACAAATAA
Protein sequence
MASLCSFPRISSADPIKHPAAAPFPPSNHPKRPSALSLRQSSRNQKRTSTIVAAIGDVSADGTTYLIAGAVAVALVGTAFPILFSRKDLCPVCDGAGFVRKSGAALRANAARKDQAQIVCSRCNGLGKLNQVDK
Homology
BLAST of CmaCh06G002860 vs. ExPASy TrEMBL
Match:
A0A6J1I3R1 (uncharacterized protein LOC111470276 OS=Cucurbita maxima OX=3661 GN=LOC111470276 PE=4 SV=1)
HSP 1 Score: 258.5 bits (659), Expect = 1.6e-65
Identity = 134/134 (100.00%), Postives = 134/134 (100.00%), Query Frame = 0
Query: 1 MASLCSFPRISSADPIKHPAAAPFPPSNHPKRPSALSLRQSSRNQKRTSTIVAAIGDVSA 60
MASLCSFPRISSADPIKHPAAAPFPPSNHPKRPSALSLRQSSRNQKRTSTIVAAIGDVSA
Sbjct: 1 MASLCSFPRISSADPIKHPAAAPFPPSNHPKRPSALSLRQSSRNQKRTSTIVAAIGDVSA 60
Query: 61 DGTTYLIAGAVAVALVGTAFPILFSRKDLCPVCDGAGFVRKSGAALRANAARKDQAQIVC 120
DGTTYLIAGAVAVALVGTAFPILFSRKDLCPVCDGAGFVRKSGAALRANAARKDQAQIVC
Sbjct: 61 DGTTYLIAGAVAVALVGTAFPILFSRKDLCPVCDGAGFVRKSGAALRANAARKDQAQIVC 120
Query: 121 SRCNGLGKLNQVDK 135
SRCNGLGKLNQVDK
Sbjct: 121 SRCNGLGKLNQVDK 134
BLAST of CmaCh06G002860 vs. ExPASy TrEMBL
Match:
A0A6J1HBY8 (uncharacterized protein LOC111461414 OS=Cucurbita moschata OX=3662 GN=LOC111461414 PE=4 SV=1)
HSP 1 Score: 251.5 bits (641), Expect = 1.9e-63
Identity = 132/134 (98.51%), Postives = 132/134 (98.51%), Query Frame = 0
Query: 1 MASLCSFPRISSADPIKHPAAAPFPPSNHPKRPSALSLRQSSRNQKRTSTIVAAIGDVSA 60
MASLCSFPRISSADPIKH AAAPFPPSNHP RPSALSLRQSSRNQKRTSTIVAAIGDVSA
Sbjct: 1 MASLCSFPRISSADPIKHLAAAPFPPSNHPIRPSALSLRQSSRNQKRTSTIVAAIGDVSA 60
Query: 61 DGTTYLIAGAVAVALVGTAFPILFSRKDLCPVCDGAGFVRKSGAALRANAARKDQAQIVC 120
DGTTYLIAGAVAVALVGTAFPILFSRKDLCPVCDGAGFVRKSGAALRANAARKDQAQIVC
Sbjct: 61 DGTTYLIAGAVAVALVGTAFPILFSRKDLCPVCDGAGFVRKSGAALRANAARKDQAQIVC 120
Query: 121 SRCNGLGKLNQVDK 135
SRCNGLGKLNQVDK
Sbjct: 121 SRCNGLGKLNQVDK 134
BLAST of CmaCh06G002860 vs. ExPASy TrEMBL
Match:
A0A6J1F9E5 (uncharacterized protein LOC111442008 OS=Cucurbita moschata OX=3662 GN=LOC111442008 PE=4 SV=1)
HSP 1 Score: 224.6 bits (571), Expect = 2.5e-55
Identity = 118/135 (87.41%), Postives = 124/135 (91.85%), Query Frame = 0
Query: 1 MASLCSFPRISSADPIKH-PAAAPFPPSNHPKRPSALSLRQSSRNQKRTSTIVAAIGDVS 60
MASLC+FPRISS +PIK PAAAPFPPSN P RPSALSLRQSS +R ST+VAAIGDVS
Sbjct: 1 MASLCTFPRISSTEPIKQTPAAAPFPPSNQPMRPSALSLRQSSSKHRRISTVVAAIGDVS 60
Query: 61 ADGTTYLIAGAVAVALVGTAFPILFSRKDLCPVCDGAGFVRKSGAALRANAARKDQAQIV 120
ADGTTYLIAGAVAVALVGTAFPILFSRKDLCP CDGAGFVR+SGAALRANAARKDQ QIV
Sbjct: 61 ADGTTYLIAGAVAVALVGTAFPILFSRKDLCPECDGAGFVRRSGAALRANAARKDQTQIV 120
Query: 121 CSRCNGLGKLNQVDK 135
C+RCNGLGKLNQVDK
Sbjct: 121 CARCNGLGKLNQVDK 135
BLAST of CmaCh06G002860 vs. ExPASy TrEMBL
Match:
A0A1S3B5E7 (uncharacterized protein LOC103486375 OS=Cucumis melo OX=3656 GN=LOC103486375 PE=4 SV=1)
HSP 1 Score: 224.6 bits (571), Expect = 2.5e-55
Identity = 115/135 (85.19%), Postives = 124/135 (91.85%), Query Frame = 0
Query: 1 MASLCSFPRISSADPIKH-PAAAPFPPSNHPKRPSALSLRQSSRNQKRTSTIVAAIGDVS 60
MASLCSFPRISS +PIK PA APFPPSNHP RPS LSLRQSSRN KR ST+VAA+GDVS
Sbjct: 1 MASLCSFPRISSTEPIKQSPATAPFPPSNHPIRPSTLSLRQSSRNHKRISTVVAAVGDVS 60
Query: 61 ADGTTYLIAGAVAVALVGTAFPILFSRKDLCPVCDGAGFVRKSGAALRANAARKDQAQIV 120
+DGTTYLIAGA+AVALVGTAFPI FSRKDLCP C+GAGFVR+SG+ALRANAARKDQ QIV
Sbjct: 61 SDGTTYLIAGAIAVALVGTAFPIFFSRKDLCPECEGAGFVRRSGSALRANAARKDQTQIV 120
Query: 121 CSRCNGLGKLNQVDK 135
C+RCNGLGKLNQVDK
Sbjct: 121 CARCNGLGKLNQVDK 135
BLAST of CmaCh06G002860 vs. ExPASy TrEMBL
Match:
A0A6J1J3Y5 (uncharacterized protein LOC111482446 OS=Cucurbita maxima OX=3661 GN=LOC111482446 PE=4 SV=1)
HSP 1 Score: 224.2 bits (570), Expect = 3.3e-55
Identity = 118/135 (87.41%), Postives = 124/135 (91.85%), Query Frame = 0
Query: 1 MASLCSFPRISSADPIKH-PAAAPFPPSNHPKRPSALSLRQSSRNQKRTSTIVAAIGDVS 60
MASLC+FPRISS +PIK PAAAPFPPSN P RPSALSLRQSS RTST+VAA+GDVS
Sbjct: 1 MASLCTFPRISSTEPIKQTPAAAPFPPSNQPMRPSALSLRQSSGKHWRTSTVVAAVGDVS 60
Query: 61 ADGTTYLIAGAVAVALVGTAFPILFSRKDLCPVCDGAGFVRKSGAALRANAARKDQAQIV 120
ADGTTYLIAGAVAVALVGTAFPILFSRKDLCP CDGAGFVR+SGAALRANAARKDQ QIV
Sbjct: 61 ADGTTYLIAGAVAVALVGTAFPILFSRKDLCPECDGAGFVRRSGAALRANAARKDQTQIV 120
Query: 121 CSRCNGLGKLNQVDK 135
C+RCNGLGKLNQVDK
Sbjct: 121 CARCNGLGKLNQVDK 135
BLAST of CmaCh06G002860 vs. NCBI nr
Match:
XP_022971601.1 (uncharacterized protein LOC111470276 [Cucurbita maxima])
HSP 1 Score: 258.5 bits (659), Expect = 3.3e-65
Identity = 134/134 (100.00%), Postives = 134/134 (100.00%), Query Frame = 0
Query: 1 MASLCSFPRISSADPIKHPAAAPFPPSNHPKRPSALSLRQSSRNQKRTSTIVAAIGDVSA 60
MASLCSFPRISSADPIKHPAAAPFPPSNHPKRPSALSLRQSSRNQKRTSTIVAAIGDVSA
Sbjct: 1 MASLCSFPRISSADPIKHPAAAPFPPSNHPKRPSALSLRQSSRNQKRTSTIVAAIGDVSA 60
Query: 61 DGTTYLIAGAVAVALVGTAFPILFSRKDLCPVCDGAGFVRKSGAALRANAARKDQAQIVC 120
DGTTYLIAGAVAVALVGTAFPILFSRKDLCPVCDGAGFVRKSGAALRANAARKDQAQIVC
Sbjct: 61 DGTTYLIAGAVAVALVGTAFPILFSRKDLCPVCDGAGFVRKSGAALRANAARKDQAQIVC 120
Query: 121 SRCNGLGKLNQVDK 135
SRCNGLGKLNQVDK
Sbjct: 121 SRCNGLGKLNQVDK 134
BLAST of CmaCh06G002860 vs. NCBI nr
Match:
XP_022960729.1 (uncharacterized protein LOC111461414 [Cucurbita moschata])
HSP 1 Score: 251.5 bits (641), Expect = 4.0e-63
Identity = 132/134 (98.51%), Postives = 132/134 (98.51%), Query Frame = 0
Query: 1 MASLCSFPRISSADPIKHPAAAPFPPSNHPKRPSALSLRQSSRNQKRTSTIVAAIGDVSA 60
MASLCSFPRISSADPIKH AAAPFPPSNHP RPSALSLRQSSRNQKRTSTIVAAIGDVSA
Sbjct: 1 MASLCSFPRISSADPIKHLAAAPFPPSNHPIRPSALSLRQSSRNQKRTSTIVAAIGDVSA 60
Query: 61 DGTTYLIAGAVAVALVGTAFPILFSRKDLCPVCDGAGFVRKSGAALRANAARKDQAQIVC 120
DGTTYLIAGAVAVALVGTAFPILFSRKDLCPVCDGAGFVRKSGAALRANAARKDQAQIVC
Sbjct: 61 DGTTYLIAGAVAVALVGTAFPILFSRKDLCPVCDGAGFVRKSGAALRANAARKDQAQIVC 120
Query: 121 SRCNGLGKLNQVDK 135
SRCNGLGKLNQVDK
Sbjct: 121 SRCNGLGKLNQVDK 134
BLAST of CmaCh06G002860 vs. NCBI nr
Match:
XP_023539540.1 (uncharacterized protein LOC111800182 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 248.4 bits (633), Expect = 3.4e-62
Identity = 131/134 (97.76%), Postives = 131/134 (97.76%), Query Frame = 0
Query: 1 MASLCSFPRISSADPIKHPAAAPFPPSNHPKRPSALSLRQSSRNQKRTSTIVAAIGDVSA 60
MASLCSFPRISSADPIKH AAAPFPPSN P RPSALSLRQSSRNQKRTSTIVAAIGDVSA
Sbjct: 1 MASLCSFPRISSADPIKHLAAAPFPPSNRPIRPSALSLRQSSRNQKRTSTIVAAIGDVSA 60
Query: 61 DGTTYLIAGAVAVALVGTAFPILFSRKDLCPVCDGAGFVRKSGAALRANAARKDQAQIVC 120
DGTTYLIAGAVAVALVGTAFPILFSRKDLCPVCDGAGFVRKSGAALRANAARKDQAQIVC
Sbjct: 61 DGTTYLIAGAVAVALVGTAFPILFSRKDLCPVCDGAGFVRKSGAALRANAARKDQAQIVC 120
Query: 121 SRCNGLGKLNQVDK 135
SRCNGLGKLNQVDK
Sbjct: 121 SRCNGLGKLNQVDK 134
BLAST of CmaCh06G002860 vs. NCBI nr
Match:
KAG6596401.1 (Aspartic proteinase PCS1, partial [Cucurbita argyrosperma subsp. sororia])
HSP 1 Score: 247.3 bits (630), Expect = 7.6e-62
Identity = 130/132 (98.48%), Postives = 130/132 (98.48%), Query Frame = 0
Query: 1 MASLCSFPRISSADPIKHPAAAPFPPSNHPKRPSALSLRQSSRNQKRTSTIVAAIGDVSA 60
MASLCSFPRISSADPIKH AAAPFPPSNHP RPSALSLRQSSRNQKRTSTIVAAIGDVSA
Sbjct: 1 MASLCSFPRISSADPIKHLAAAPFPPSNHPIRPSALSLRQSSRNQKRTSTIVAAIGDVSA 60
Query: 61 DGTTYLIAGAVAVALVGTAFPILFSRKDLCPVCDGAGFVRKSGAALRANAARKDQAQIVC 120
DGTTYLIAGAVAVALVGTAFPILFSRKDLCPVCDGAGFVRKSGAALRANAARKDQAQIVC
Sbjct: 61 DGTTYLIAGAVAVALVGTAFPILFSRKDLCPVCDGAGFVRKSGAALRANAARKDQAQIVC 120
Query: 121 SRCNGLGKLNQV 133
SRCNGLGKLNQV
Sbjct: 121 SRCNGLGKLNQV 132
BLAST of CmaCh06G002860 vs. NCBI nr
Match:
KAG7027943.1 (Aspartic proteinase PCS1 [Cucurbita argyrosperma subsp. argyrosperma])
HSP 1 Score: 244.2 bits (622), Expect = 6.4e-61
Identity = 129/132 (97.73%), Postives = 129/132 (97.73%), Query Frame = 0
Query: 1 MASLCSFPRISSADPIKHPAAAPFPPSNHPKRPSALSLRQSSRNQKRTSTIVAAIGDVSA 60
MASLCSFPRISSADPIKH AAAPFPPSN P RPSALSLRQSSRNQKRTSTIVAAIGDVSA
Sbjct: 1 MASLCSFPRISSADPIKHLAAAPFPPSNRPIRPSALSLRQSSRNQKRTSTIVAAIGDVSA 60
Query: 61 DGTTYLIAGAVAVALVGTAFPILFSRKDLCPVCDGAGFVRKSGAALRANAARKDQAQIVC 120
DGTTYLIAGAVAVALVGTAFPILFSRKDLCPVCDGAGFVRKSGAALRANAARKDQAQIVC
Sbjct: 61 DGTTYLIAGAVAVALVGTAFPILFSRKDLCPVCDGAGFVRKSGAALRANAARKDQAQIVC 120
Query: 121 SRCNGLGKLNQV 133
SRCNGLGKLNQV
Sbjct: 121 SRCNGLGKLNQV 132
BLAST of CmaCh06G002860 vs. TAIR 10
Match:
AT5G02160.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast thylakoid membrane; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 13 growth stages; Has 121 Blast hits to 121 proteins in 17 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 121; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )
HSP 1 Score: 140.2 bits (352), Expect = 1.2e-33
Identity = 75/125 (60.00%), Postives = 89/125 (71.20%), Query Frame = 0
Query: 10 ISSADPIKHPAAAPFPPSNHPKRPSALSLRQSSRNQKRTSTIVAAIGDVSADGTTYLIAG 69
+SS + +KH ++ P N+ P + +S +VAA+GDVS+DGT YLI G
Sbjct: 12 VSSTNFLKHSSSWGSPSPNNVILP--------KNKRSSSSVVVAAVGDVSSDGTIYLIGG 71
Query: 70 AVAVALVGTAFPILFSRKDLCPVCDGAGFVRKSGAALRANAARKDQAQIVCSRCNGLGKL 129
A+AVALVGTAFPILF RKD CP CDGAGFVRK G LRANAARKD QIVC+ CNGLGKL
Sbjct: 72 AIAVALVGTAFPILFKRKDTCPECDGAGFVRKGGVTLRANAARKDLPQIVCANCNGLGKL 128
Query: 130 NQVDK 135
NQ+DK
Sbjct: 132 NQIDK 128
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A6J1I3R1 | 1.6e-65 | 100.00 | uncharacterized protein LOC111470276 OS=Cucurbita maxima OX=3661 GN=LOC111470276... | [more] |
A0A6J1HBY8 | 1.9e-63 | 98.51 | uncharacterized protein LOC111461414 OS=Cucurbita moschata OX=3662 GN=LOC1114614... | [more] |
A0A6J1F9E5 | 2.5e-55 | 87.41 | uncharacterized protein LOC111442008 OS=Cucurbita moschata OX=3662 GN=LOC1114420... | [more] |
A0A1S3B5E7 | 2.5e-55 | 85.19 | uncharacterized protein LOC103486375 OS=Cucumis melo OX=3656 GN=LOC103486375 PE=... | [more] |
A0A6J1J3Y5 | 3.3e-55 | 87.41 | uncharacterized protein LOC111482446 OS=Cucurbita maxima OX=3661 GN=LOC111482446... | [more] |
Match Name | E-value | Identity | Description | |
XP_022971601.1 | 3.3e-65 | 100.00 | uncharacterized protein LOC111470276 [Cucurbita maxima] | [more] |
XP_022960729.1 | 4.0e-63 | 98.51 | uncharacterized protein LOC111461414 [Cucurbita moschata] | [more] |
XP_023539540.1 | 3.4e-62 | 97.76 | uncharacterized protein LOC111800182 [Cucurbita pepo subsp. pepo] | [more] |
KAG6596401.1 | 7.6e-62 | 98.48 | Aspartic proteinase PCS1, partial [Cucurbita argyrosperma subsp. sororia] | [more] |
KAG7027943.1 | 6.4e-61 | 97.73 | Aspartic proteinase PCS1 [Cucurbita argyrosperma subsp. argyrosperma] | [more] |
Match Name | E-value | Identity | Description | |
AT5G02160.1 | 1.2e-33 | 60.00 | unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... | [more] |