Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGACACCGATGAATTTTACCGGCAACCGGCTGCCGTTCCTTTCAAATGGGAGATTAAACCCGGCGTCCCCAGAAACCATCACCGCCTCCGCCACTCACCAACTCACTCTCCCCCTCAACACCACCGTCAAAAGCTGAAACCTCCTCCTGCTGTTTCCCATTTCCCCCATCCTCCAAATTCTCTTCACTCGTCTCCTCGAACCCAGTCCGAACGCTGGCGGTTCGTCCGATCCGAGCAAGTCTCGTCCTCTGGTTGCTTCCCATCGCCTTTGCCGAACCGGAAGTCCCCCAAGTCCGTCAGCCGGAAATTACCCGAACCGGATTACTCCTCTGACTTGGACACTTTGTCTCGCTGGTCGGTTTCCAGTCGGAAGTCGATTTCGCCGTTCCGATATTCAGTTTCGTCGTCGCCGTCGTCGTTCTCGTCGTACCAATCGTCTCCCCGTCCGACAAGTGATACCGAATGGGCAGGTTTTGGGCTATTTTGATTGGGCCTTAAGAGACGTTGCGGCCCATCTAGCTGGTGATACTGAATGGGCTTGGTTTTTGGGCTTTTTGGTTTAGTGGATTTGAAGAGACTGTGGGCTTTAAGTTGTTGTGTAGAAATAGGAAGCTCTAAATCCCTAATTGTATTGTGA
mRNA sequence
ATGGACACCGATGAATTTTACCGGCAACCGGCTGCCGTTCCTTTCAAATGGGAGATTAAACCCGGCGTCCCCAGAAACCATCACCGCCTCCGCCACTCACCAACTCACTCTCCCCCTCAACACCACCGTCAAAAGCTGAAACCTCCTCCTGCTGTTTCCCATTTCCCCCATCCTCCAAATTCTCTTCACTCGTCTCCTCGAACCCAGTCCGAACGCTGGCGGTTCGTCCGATCCGAGCAAGTCTCGTCCTCTGGTTGCTTCCCATCGCCTTTGCCGAACCGGAAGTCCCCCAAGTCCGTCAGCCGGAAATTACCCGAACCGGATTACTCCTCTGACTTGGACACTTTGTCTCGCTGGTCGGTTTCCAGTCGGAAGTCGATTTCGCCGTTCCGATATTCAGTTTCGTCGTCGCCGTCGTCGTTCTCGTCGTACCAATCGTCTCCCCGTCCGACAAGTGATACCGAATGGGCAGTGGATTTGAAGAGACTGTGGGCTTTAAGTTGTTGTGTAGAAATAGGAAGCTCTAAATCCCTAATTGTATTGTGA
Coding sequence (CDS)
ATGGACACCGATGAATTTTACCGGCAACCGGCTGCCGTTCCTTTCAAATGGGAGATTAAACCCGGCGTCCCCAGAAACCATCACCGCCTCCGCCACTCACCAACTCACTCTCCCCCTCAACACCACCGTCAAAAGCTGAAACCTCCTCCTGCTGTTTCCCATTTCCCCCATCCTCCAAATTCTCTTCACTCGTCTCCTCGAACCCAGTCCGAACGCTGGCGGTTCGTCCGATCCGAGCAAGTCTCGTCCTCTGGTTGCTTCCCATCGCCTTTGCCGAACCGGAAGTCCCCCAAGTCCGTCAGCCGGAAATTACCCGAACCGGATTACTCCTCTGACTTGGACACTTTGTCTCGCTGGTCGGTTTCCAGTCGGAAGTCGATTTCGCCGTTCCGATATTCAGTTTCGTCGTCGCCGTCGTCGTTCTCGTCGTACCAATCGTCTCCCCGTCCGACAAGTGATACCGAATGGGCAGTGGATTTGAAGAGACTGTGGGCTTTAAGTTGTTGTGTAGAAATAGGAAGCTCTAAATCCCTAATTGTATTGTGA
Protein sequence
MDTDEFYRQPAAVPFKWEIKPGVPRNHHRLRHSPTHSPPQHHRQKLKPPPAVSHFPHPPNSLHSSPRTQSERWRFVRSEQVSSSGCFPSPLPNRKSPKSVSRKLPEPDYSSDLDTLSRWSVSSRKSISPFRYSVSSSPSSFSSYQSSPRPTSDTEWAVDLKRLWALSCCVEIGSSKSLIVL*
Homology
BLAST of CsaV3_4G028070 vs. NCBI nr
Match:
KAE8649619.1 (hypothetical protein Csa_012837 [Cucumis sativus])
HSP 1 Score: 367.5 bits (942), Expect = 6.8e-98
Identity = 181/181 (100.00%), Postives = 181/181 (100.00%), Query Frame = 0
Query: 1 MDTDEFYRQPAAVPFKWEIKPGVPRNHHRLRHSPTHSPPQHHRQKLKPPPAVSHFPHPPN 60
MDTDEFYRQPAAVPFKWEIKPGVPRNHHRLRHSPTHSPPQHHRQKLKPPPAVSHFPHPPN
Sbjct: 1 MDTDEFYRQPAAVPFKWEIKPGVPRNHHRLRHSPTHSPPQHHRQKLKPPPAVSHFPHPPN 60
Query: 61 SLHSSPRTQSERWRFVRSEQVSSSGCFPSPLPNRKSPKSVSRKLPEPDYSSDLDTLSRWS 120
SLHSSPRTQSERWRFVRSEQVSSSGCFPSPLPNRKSPKSVSRKLPEPDYSSDLDTLSRWS
Sbjct: 61 SLHSSPRTQSERWRFVRSEQVSSSGCFPSPLPNRKSPKSVSRKLPEPDYSSDLDTLSRWS 120
Query: 121 VSSRKSISPFRYSVSSSPSSFSSYQSSPRPTSDTEWAVDLKRLWALSCCVEIGSSKSLIV 180
VSSRKSISPFRYSVSSSPSSFSSYQSSPRPTSDTEWAVDLKRLWALSCCVEIGSSKSLIV
Sbjct: 121 VSSRKSISPFRYSVSSSPSSFSSYQSSPRPTSDTEWAVDLKRLWALSCCVEIGSSKSLIV 180
Query: 181 L 182
L
Sbjct: 181 L 181
BLAST of CsaV3_4G028070 vs. NCBI nr
Match:
XP_004142634.1 (uncharacterized protein LOC101220757 [Cucumis sativus])
HSP 1 Score: 320.1 bits (819), Expect = 1.2e-83
Identity = 157/157 (100.00%), Postives = 157/157 (100.00%), Query Frame = 0
Query: 1 MDTDEFYRQPAAVPFKWEIKPGVPRNHHRLRHSPTHSPPQHHRQKLKPPPAVSHFPHPPN 60
MDTDEFYRQPAAVPFKWEIKPGVPRNHHRLRHSPTHSPPQHHRQKLKPPPAVSHFPHPPN
Sbjct: 1 MDTDEFYRQPAAVPFKWEIKPGVPRNHHRLRHSPTHSPPQHHRQKLKPPPAVSHFPHPPN 60
Query: 61 SLHSSPRTQSERWRFVRSEQVSSSGCFPSPLPNRKSPKSVSRKLPEPDYSSDLDTLSRWS 120
SLHSSPRTQSERWRFVRSEQVSSSGCFPSPLPNRKSPKSVSRKLPEPDYSSDLDTLSRWS
Sbjct: 61 SLHSSPRTQSERWRFVRSEQVSSSGCFPSPLPNRKSPKSVSRKLPEPDYSSDLDTLSRWS 120
Query: 121 VSSRKSISPFRYSVSSSPSSFSSYQSSPRPTSDTEWA 158
VSSRKSISPFRYSVSSSPSSFSSYQSSPRPTSDTEWA
Sbjct: 121 VSSRKSISPFRYSVSSSPSSFSSYQSSPRPTSDTEWA 157
BLAST of CsaV3_4G028070 vs. NCBI nr
Match:
XP_008444194.1 (PREDICTED: uncharacterized protein LOC103487607 [Cucumis melo])
HSP 1 Score: 298.5 bits (763), Expect = 3.9e-77
Identity = 148/158 (93.67%), Postives = 153/158 (96.84%), Query Frame = 0
Query: 1 MDTDEFYRQPAAVPFKWEIKPGVPRNHHRLRHSPTHSPPQHHRQKLKPPPAVSHFPHPPN 60
MDTDEFYR+PAAVPFKWEIKPGVPRNHHR R SPTHSPPQHHRQKLKPPPAVSHFPHP N
Sbjct: 1 MDTDEFYRKPAAVPFKWEIKPGVPRNHHRPRQSPTHSPPQHHRQKLKPPPAVSHFPHPSN 60
Query: 61 SLHSSPRTQSERWRFVRSEQVSSSGCFPSPLPNRKSPKSVSRKLPEPDYSSDLDTLSRWS 120
SLHSSPRT+S+RWRFVRSEQVSSSGCFPSPLPNRKSPK++SRK PEPDYSSDLDTLSRWS
Sbjct: 61 SLHSSPRTRSDRWRFVRSEQVSSSGCFPSPLPNRKSPKALSRKFPEPDYSSDLDTLSRWS 120
Query: 121 VSSRKSISPFRYSV-SSSPSSFSSYQSSPRPTSDTEWA 158
VSSRKSISPFRYSV SSSPSSFSSYQSSPRPTSDTEWA
Sbjct: 121 VSSRKSISPFRYSVSSSSPSSFSSYQSSPRPTSDTEWA 158
BLAST of CsaV3_4G028070 vs. NCBI nr
Match:
XP_038899347.1 (uncharacterized protein LOC120086669 [Benincasa hispida])
HSP 1 Score: 261.2 bits (666), Expect = 6.9e-66
Identity = 134/157 (85.35%), Postives = 144/157 (91.72%), Query Frame = 0
Query: 1 MDTDEFYRQPAAVPFKWEIKPGVPRNHHRLRHSPTHSPPQHHRQKLKPPPAVSHFPHPPN 60
MD DEFYRQPAAVPFKWEIKPGVP+NHHRLRHSPTHSPPQHH QKLKPPP+VS+F HP N
Sbjct: 1 MDVDEFYRQPAAVPFKWEIKPGVPKNHHRLRHSPTHSPPQHH-QKLKPPPSVSNFLHPSN 60
Query: 61 SLHSSPRTQSERWRFVRSEQVSSSGCFPSPLPNRKSPKSVSRKLPEPDYSSDLDTLSRWS 120
SLHSS RT+S+RWRF + EQV SSGCFPSPLPNRKS KS+SR PEPDYSS L++LSRWS
Sbjct: 61 SLHSSSRTRSDRWRFSQPEQV-SSGCFPSPLPNRKSAKSLSRN-PEPDYSSGLESLSRWS 120
Query: 121 VSSRKSISPFRYSVSSSPSSFSSYQSSPRPTSDTEWA 158
VSSRKSISPFRYSVSSSPSS+SSY SSPRPTSDTEWA
Sbjct: 121 VSSRKSISPFRYSVSSSPSSYSSYHSSPRPTSDTEWA 154
BLAST of CsaV3_4G028070 vs. NCBI nr
Match:
XP_022131529.1 (uncharacterized protein DKFZp434B061-like [Momordica charantia])
HSP 1 Score: 215.3 bits (547), Expect = 4.3e-52
Identity = 123/167 (73.65%), Postives = 135/167 (80.84%), Query Frame = 0
Query: 1 MDTDEFYRQPAAVPFKWEIKPGVPRNHHRLRHSPTHSPPQHHRQKLKPPPAVSHFPHPPN 60
MD DEFYR+PAAVPFKWEIKPGVPR HHRL SP SPP QKLKPPP VSHF P
Sbjct: 1 MDADEFYRRPAAVPFKWEIKPGVPRAHHRLCPSP--SPPP---QKLKPPPVVSHFRRPSE 60
Query: 61 ----SLHSSPRTQSERWRFVRSE-----QVS-SSGCFPSPLPNRKSPKSVSRKLPEPDYS 120
SLHSS RT+S+RWRF RS QVS ++GCFPSP PNRKS KS++RK PEP+Y+
Sbjct: 61 SSSCSLHSSSRTRSDRWRFARSSLAEPPQVSPATGCFPSPSPNRKSGKSMNRK-PEPNYT 120
Query: 121 SDLDTLSRWSVSSRKSISPFRYSVSSSPSSFSSYQSSPRPTSDTEWA 158
++L+TLSRWSVSSRKSISPFR SVSSSPSSFSSYQSSPRPTSDTEWA
Sbjct: 121 TELETLSRWSVSSRKSISPFRDSVSSSPSSFSSYQSSPRPTSDTEWA 161
BLAST of CsaV3_4G028070 vs. ExPASy TrEMBL
Match:
A0A1S3BAM4 (uncharacterized protein LOC103487607 OS=Cucumis melo OX=3656 GN=LOC103487607 PE=4 SV=1)
HSP 1 Score: 298.5 bits (763), Expect = 1.9e-77
Identity = 148/158 (93.67%), Postives = 153/158 (96.84%), Query Frame = 0
Query: 1 MDTDEFYRQPAAVPFKWEIKPGVPRNHHRLRHSPTHSPPQHHRQKLKPPPAVSHFPHPPN 60
MDTDEFYR+PAAVPFKWEIKPGVPRNHHR R SPTHSPPQHHRQKLKPPPAVSHFPHP N
Sbjct: 1 MDTDEFYRKPAAVPFKWEIKPGVPRNHHRPRQSPTHSPPQHHRQKLKPPPAVSHFPHPSN 60
Query: 61 SLHSSPRTQSERWRFVRSEQVSSSGCFPSPLPNRKSPKSVSRKLPEPDYSSDLDTLSRWS 120
SLHSSPRT+S+RWRFVRSEQVSSSGCFPSPLPNRKSPK++SRK PEPDYSSDLDTLSRWS
Sbjct: 61 SLHSSPRTRSDRWRFVRSEQVSSSGCFPSPLPNRKSPKALSRKFPEPDYSSDLDTLSRWS 120
Query: 121 VSSRKSISPFRYSV-SSSPSSFSSYQSSPRPTSDTEWA 158
VSSRKSISPFRYSV SSSPSSFSSYQSSPRPTSDTEWA
Sbjct: 121 VSSRKSISPFRYSVSSSSPSSFSSYQSSPRPTSDTEWA 158
BLAST of CsaV3_4G028070 vs. ExPASy TrEMBL
Match:
A0A0A0KY52 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G361790 PE=4 SV=1)
HSP 1 Score: 248.1 bits (632), Expect = 2.9e-62
Identity = 118/118 (100.00%), Postives = 118/118 (100.00%), Query Frame = 0
Query: 1 MDTDEFYRQPAAVPFKWEIKPGVPRNHHRLRHSPTHSPPQHHRQKLKPPPAVSHFPHPPN 60
MDTDEFYRQPAAVPFKWEIKPGVPRNHHRLRHSPTHSPPQHHRQKLKPPPAVSHFPHPPN
Sbjct: 1 MDTDEFYRQPAAVPFKWEIKPGVPRNHHRLRHSPTHSPPQHHRQKLKPPPAVSHFPHPPN 60
Query: 61 SLHSSPRTQSERWRFVRSEQVSSSGCFPSPLPNRKSPKSVSRKLPEPDYSSDLDTLSR 119
SLHSSPRTQSERWRFVRSEQVSSSGCFPSPLPNRKSPKSVSRKLPEPDYSSDLDTLSR
Sbjct: 61 SLHSSPRTQSERWRFVRSEQVSSSGCFPSPLPNRKSPKSVSRKLPEPDYSSDLDTLSR 118
BLAST of CsaV3_4G028070 vs. ExPASy TrEMBL
Match:
A0A6J1BQH3 (uncharacterized protein DKFZp434B061-like OS=Momordica charantia OX=3673 GN=LOC111004696 PE=4 SV=1)
HSP 1 Score: 215.3 bits (547), Expect = 2.1e-52
Identity = 123/167 (73.65%), Postives = 135/167 (80.84%), Query Frame = 0
Query: 1 MDTDEFYRQPAAVPFKWEIKPGVPRNHHRLRHSPTHSPPQHHRQKLKPPPAVSHFPHPPN 60
MD DEFYR+PAAVPFKWEIKPGVPR HHRL SP SPP QKLKPPP VSHF P
Sbjct: 1 MDADEFYRRPAAVPFKWEIKPGVPRAHHRLCPSP--SPPP---QKLKPPPVVSHFRRPSE 60
Query: 61 ----SLHSSPRTQSERWRFVRSE-----QVS-SSGCFPSPLPNRKSPKSVSRKLPEPDYS 120
SLHSS RT+S+RWRF RS QVS ++GCFPSP PNRKS KS++RK PEP+Y+
Sbjct: 61 SSSCSLHSSSRTRSDRWRFARSSLAEPPQVSPATGCFPSPSPNRKSGKSMNRK-PEPNYT 120
Query: 121 SDLDTLSRWSVSSRKSISPFRYSVSSSPSSFSSYQSSPRPTSDTEWA 158
++L+TLSRWSVSSRKSISPFR SVSSSPSSFSSYQSSPRPTSDTEWA
Sbjct: 121 TELETLSRWSVSSRKSISPFRDSVSSSPSSFSSYQSSPRPTSDTEWA 161
BLAST of CsaV3_4G028070 vs. ExPASy TrEMBL
Match:
A0A6J1FHC7 (uncharacterized protein LOC111445775 OS=Cucurbita moschata OX=3662 GN=LOC111445775 PE=4 SV=1)
HSP 1 Score: 195.3 bits (495), Expect = 2.2e-46
Identity = 115/164 (70.12%), Postives = 127/164 (77.44%), Query Frame = 0
Query: 1 MDTDEFYRQPAAVPFKWEIKPGVPRNHHRLRHSPTHSPPQHHRQKLKPPPAV--SHFPHP 60
MD DEFYRQPAAVPFKWEIKPGVPRNHHRL PTHSP QH +KLKPPPAV + F
Sbjct: 1 MDVDEFYRQPAAVPFKWEIKPGVPRNHHRLHQFPTHSPQQH--KKLKPPPAVTATQFHRS 60
Query: 61 PNSLHSSPRTQSERWRFVRS-----EQVSSSGCFPSPLPNRKSPKSVSRKLPEPDYSSDL 120
NSL RT+S+RW +S EQV S GCF SPLPNRK+ K V+RK PEPDY+S+L
Sbjct: 61 SNSL----RTRSDRWSSTQSKLAEPEQV-SVGCFSSPLPNRKASKIVNRK-PEPDYASEL 120
Query: 121 DTLSRWSVSSRKSISPFRYSVSSSPSSFSSYQSSPRPTSDTEWA 158
+TL RWSVSS+KSISPFR SVSS SS SSYQSSPRPTSD+EWA
Sbjct: 121 ETLPRWSVSSKKSISPFRNSVSS--SSLSSYQSSPRPTSDSEWA 154
BLAST of CsaV3_4G028070 vs. ExPASy TrEMBL
Match:
A0A6J1ISY3 (uncharacterized protein LOC111480325 OS=Cucurbita maxima OX=3661 GN=LOC111480325 PE=4 SV=1)
HSP 1 Score: 188.7 bits (478), Expect = 2.1e-44
Identity = 112/166 (67.47%), Postives = 126/166 (75.90%), Query Frame = 0
Query: 1 MDTDEFYRQPAAVPFKWEIKPGVPRNHHRLRHSPTHSPPQHHRQKLKPPPAV--SHFPHP 60
MD DEFYRQPAAVPFKWEIKPGVPRNHH L PTHSP QH +KLKPPPAV + F
Sbjct: 1 MDVDEFYRQPAAVPFKWEIKPGVPRNHHCLHPFPTHSPQQH--KKLKPPPAVTATQFHRS 60
Query: 61 PNSLHSSPRTQSERW-----RFVRSEQVSSSGCFPSPLPNRKSPKSVSRKLPEPDYSSDL 120
NSL RT+S+RW + EQV S GCF SPLPNRK+ K ++RK PEPD +S+L
Sbjct: 61 SNSL----RTRSDRWSSSQSKLAEPEQV-SVGCFSSPLPNRKATKILNRK-PEPDCASEL 120
Query: 121 DTLSRWSVSSRKSISPFRYSVSS--SPSSFSSYQSSPRPTSDTEWA 158
+TL RWS+SS+KSISPFR SVSS SPSS SSYQSSPRPTSD+EWA
Sbjct: 121 ETLPRWSLSSKKSISPFRNSVSSSPSPSSLSSYQSSPRPTSDSEWA 158
BLAST of CsaV3_4G028070 vs. TAIR 10
Match:
AT1G77400.1 (CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF688 (InterPro:IPR007789); BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein (TAIR:AT1G21695.1); Has 328 Blast hits to 314 proteins in 61 species: Archae - 0; Bacteria - 12; Metazoa - 130; Fungi - 28; Plants - 92; Viruses - 10; Other Eukaryotes - 56 (source: NCBI BLink). )
HSP 1 Score: 73.9 bits (180), Expect = 1.5e-13
Identity = 73/226 (32.30%), Postives = 96/226 (42.48%), Query Frame = 0
Query: 1 MDTDEFYRQPAAVPFKWEIKPGVPRNH---------------------HRLRHS-----P 60
+D D+ +++P +PF WEI+PGVP+ L HS P
Sbjct: 4 IDVDDSFKRPGTIPFSWEIRPGVPKTRMSQPGNTTPLQPPKKLSPLRFKPLSHSQPLLPP 63
Query: 61 THSPPQHH---------------------RQKLKP---PPAVSHFPHPPNSLHSSPRTQS 120
SPP KLKP P ++S F P S SSPR S
Sbjct: 64 ALSPPSSSFISNSKSRPLSPLTPHSFSTTPSKLKPPRTPSSLSGFYSPGPSFRSSPRAFS 123
Query: 121 ERWRFVRSEQ--------------VSSSGCFPSPLPNRKSPKS-----VSRKLPEPD-YS 157
ERW+ R + V+ GCFPSP + KS S E D Y
Sbjct: 124 ERWQLHRPNRIRPESEPEPSSDFSVAGFGCFPSPKFRLRKVKSGGSRRKSGSRSENDYYC 183
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A1S3BAM4 | 1.9e-77 | 93.67 | uncharacterized protein LOC103487607 OS=Cucumis melo OX=3656 GN=LOC103487607 PE=... | [more] |
A0A0A0KY52 | 2.9e-62 | 100.00 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G361790 PE=4 SV=1 | [more] |
A0A6J1BQH3 | 2.1e-52 | 73.65 | uncharacterized protein DKFZp434B061-like OS=Momordica charantia OX=3673 GN=LOC1... | [more] |
A0A6J1FHC7 | 2.2e-46 | 70.12 | uncharacterized protein LOC111445775 OS=Cucurbita moschata OX=3662 GN=LOC1114457... | [more] |
A0A6J1ISY3 | 2.1e-44 | 67.47 | uncharacterized protein LOC111480325 OS=Cucurbita maxima OX=3661 GN=LOC111480325... | [more] |
Match Name | E-value | Identity | Description | |
AT1G77400.1 | 1.5e-13 | 32.30 | CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF688 (InterPro:IPR0077... | [more] |