Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: five_prime_UTRCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
CTTCACTATCACTTCCCACAAACTTTCTCTTTCAAATTCTACAGACCTCAGAGCTCTTCAACAATGGAGTTTCTTCGCTTTCTCATCCTTACCTTCCATATTCTCGCTTCCACCTCCATCTCCGGCCACCGCACCGCCTCCTCCGCAGCCCCTCCCTGCCGGAACACATGCGGCGCGCTCACCATCAAGTACCCGTTCGGCACCGGCTACGGCTGCGGCTCGCCGAGATTCTCCTCTCACGTCACCTGCTCCTCCGACGACCGTCTCCTCCTCACCACTCACACCGGCGACTATCCGATCACTTCGATTTCATACTCCGACTCCACCGTCATAATCTCGCCGCCGTCGATGTCGACCTGCTCTAAGATGCAGGAATCGAAAACCCTAGGCATCGATTGGACTGGCCCCTTCCAACTCGGCTCATCGGTGTTCCTCCTCCTCGATTGCGAATCTCCCTCAGCCTCCCTCTCGATTCGAGGCTCCGCCGTGTGCGATTTGTCGTACTCTCACCTCTGCGCTTCGATCTACGCCTGTCCGGCCGTCGTCGACCTCGGCCTGCCGCTCTTTCCGCCGACGAACTCCTGCTGCGTCTACTCGCCGGCGAACTTCGACGGCGACGGCGAGCTCGACCTGCGCGAGCTCAAGTGCGGCGGATTCTCGTCGGTGGTGTCGCTCGGAGAGTATCAGACGGATCCGGCGCGGTGGGAGTACGGAGTGGAGTTGAAGTACGGATATGGAGCGCTGGAGAGCAGTGCGGTGGAAACCAAATGTACGGGCTGCGAAATGAGCGGCGGCGCGTGTGGATTTGCTCCGCCGGAGAATGCGTTCGTTTGCGTGTGTGAAAGAGGGTTCAATACGACGACGGACTGCAATAGTAATGATCTCACTCAGGAGTTCTTCTGGAGCTCTGATTCCTCTCCGGCTGCAGTATTCTTTTGTAAGTTGCCGCCATTCATATATTTCCTTTTTATTTTAGTTTTTTTTTTCGTAATATTTTTCTGAACTTTTGAAAAGGTAAATATATAAGTATAACATGTATTTATTAGATATAAAAACTAATAAAATAGAATAGACTTTAAAAGTTGCTGTTTTTTTTTTTTAAAAAAAAAAGAAGAATAGACTTTAAAGTTATTATTATTATTATTTTAGTTTCTATGTTTTTAAATATTTTATTATATCTTTTTTAGTAGAAATATTTTATTATATCTGAATCTTATAAAACAATATATATATATATATATATTTAATTATGACCATTAATTTTCTAGTCATTTAGTAATAACTTAACTTAGAACCCGTTTGATAACCATTTCGTTTTTTGTTTTTGAAAAAAACAACTTCTATCCACAACTTTCTTTGTTTTGTTATCTACATCCTTCCATTGTTTTCAAAAATCAAGTCAAATTTTGAAAAAAAAAAAAAAAATAGTTTTCAAAACTTGTTTTTCTTTTTGGAATTTGACTAAGAACTCAAATGTCTTCTTAACAAAAATAAAAACCATTGAAAGGAAATTGAGGGAAAACAAGCATAATTTTCAAAAACCAAAAGCCAAAAACCAAATGGTTATCAAACGAGACCTAAAAGTGTTTGGAACTTTATATAATAAAGGTCCGTTTGGAATACATTTTAATTTAAATGTTTTTCAAGAATGCATTTTTTAAAAAACATTTTGTATAAAAATTGTTTAGAATTTAAACACTCATATGTTTGGTTGCATTTCCTCACAAATGTTTTTATACACAAGTTATTGTTTACCTAAAAAGTCTTTTTTAAATAGTTGATTTCAAGTGTTTTTCTCAAAATGGTTTATTTATCAATCTCATTTTCTAATAAGTCATTTTTAGTGGTTGCCGAACACTAAATTTTTTTTTCAAAATGATTTATTTTCAAAATTATACACTTGAAAAGTTAAACCAAACACACCCTAAGAGTAGGTTTGGTTTAACTTTCCAAGTGTTTAATTAATTTAATTTTGAAAAAAAATCATTTTGAAAAAATTTCTAGTGTTTGGCAATCACTAAAAATGACTTATTAGAAAATGAGATTGATAAATAAACCATTTTGAAAAAAACACTTGGAACCAACTTTTTAAAGAAGAGTGTTTAGTATACAATAACTTGTGTATGAAAACACCTATGAGGAAATGCAACCAAACATATAAGTGTTTAAATTCTAAATAATTTTTATACAAAATTAATGTATTTTAAAAAATGCATTCTTGAAAAACATTTAAATCAAAATGTATTTCAAACGGACACATTGATTTATATTATGCAACAACCACTAAAATGAATTATTAGGCAAAGTCTAAGTGGTAAATAAGAATAATTGAAATTTTATGCACCAAGATAAAAAGAAGTATATATATATATTTTGAGAAAAAAGAAGAATAAAAATTGAGATTTAGACCCCTTGTCAATTTTTTTTTTTTTTTTTTTGAGATTTAGACCATTATTTCATTGATTTAATGTAATTTTTGTATAAATCATTTTTTTTTTGTTGAAATCATGAATGCAAGCGGAAATAGTTTACTATTTTAATTTTTAAATTATTTTTTAAGAATGATGGTTTGTAAATGTTAGATGTCGTCGATATACATTTGGGGAAAAAAAAAAAAAAACTATCAATACATATTTTTACCGATTTTTATTTTTATTTTTACATGATAGATTATTATTTTTTAGGGGGAAAAAAGATAGAGAAGTTCTTTTCAGGAAAGAAAAAAGAATGATCTCTATTTTAACTGTGTTTTTTTTTGTTTTCTTTAAAAATATTTATGGTGGATACCCTTGCTTGGTACTAGATAGCTATTAAAAATAATGGATTTTTTTATAGGGGATCAGATGCTTATGCTTTTATCATGGAATTTTCTCACAATATATTTCATTTTTGTTAGGGATACACTTAAATTTTTAGTTAAATTCTAAAAACAAAAACAAGTTTTCGAAAACTATTTTAACCGTTTTATAACCATTTGATTGTTGATTTTTGAAAATTATGCTTGTTTTCTCTAAATTCCATACAATAGTTTTCATTTGTGCTAAGGACACATTTGAATTCTTAGCTAAATTCCAAAAACAAAAACAAACTTTTGAAAACTATTTTTTGTTACCCATTTGATGACTACTTTGGTTTTTGAAAATTAAGCCTATAAACACTATTTACACCTATAAGTTTCTTTGTTTTGTTATCTATCATTTTCTTGTTTTCAAAAACCAAGTCAAGTTTTGAAAACTAAAAAAATAGTTTTCAAAAACTTGTGTTTGTTTTTTAAATTTGACTAAGAATTCAAATGTGTCTTTAACAAAAATGAAAACTATTCTAAAGAAATTGGGAGAAAACAAGAATGATTTTAAAAAACGAAAAAAGAAAAACCAAATGGTTATCAAACGAACCTTTAATTTTTAAAACTTGACTTGGTTTTTGAAAACATGGGAAGAAGGTTTAAAAAAAAAAGTTATGGAAGTAGTGTTTATACGCTTAATTTTCGACAATCAAATTGTTATTAAACAGAGTCTAAATGTTTAGGATTGAGTTTATTTAGCTACCAAACTCTTAAAAAGTGTTAATAGGTTTTTAACTTTTCATTTTGTGTTCAATAGATCTCCAAAATTTAAAAGTATGCGGTAGGTTCTTAAATTTTCAATTTAGTGTTTTTTTTTTTGAACCGGCAATTTAGTGTTTAATTGATCATTCATCTATTTGACATTTTTTTAAAATCTACAAACCTAACAGACACAAAATAGAAAGTTTAGGGTCTTGTTAGACATTAAGTTGAATTTTATAATTTGATAGATTAGTTAATTTTTGAAATTTTAAATTTATTAGACACACATTTGAAAATTCATAAATCCATTAGATATTTTTAAAGTTGGAAAACTTATTTGAAACAAATTTTAAAATTAAATCCATTAGATATTTTTAAAGTTGGAAAACTTATTTGACACAAATTTAAAAGTCGAAGAATTTATTAGACACTTCTAAAATTTAGAATCCTTTTAAAAACAATTTTGGAAGTGAATGAACTAAGTTCTAATTTAACATTTTATTTCTTTGTGTTTTCTCTTCAAAAGTTGTAATAGGATAGTAATTTTATTTTCTTAAAAATTAGAATCTTTTTTAATAGGCTTAATTTTTTATATTGTGATATATATATTTAAAATGTTTTAAAATTTGTGCTCATATTTTAAAAATTAGGGGTAGTTTGAAAATGATTATCTTTTCTATTTTTTGTTTTCAAGTTTATTTGTTTTGTAAGCTACCTAAATAAGTTTTTAAGATTATAGCTTATTTTAAAAAATCAAAACATTTGTTTGGACATGCATTTTTTTTTTTTTTAGTTTAAACAATACGTTTAGTTCTAAATTTTGAGGGTTATGTCAATTTAGTCCTTGAACTTTAAAAAGTGTCTAGTAGATTCTTGAATTTTCAATTTCGTGTCTAATAAGTCCCTAACTTTAAAAATATCTAATAGGTCCCCAAATTTTCAATTTTGTATCTAATAAATCTCTCACATATCGAAGAATTTTAAAAATGAACCAATCTATTAGATAAAAATTGAATTTATGCCTATTGTGACCTTAGCTTCTAATTTTGTATCTTATAGGCTTGTAA
mRNA sequence
ATGGAGTTTCTTCGCTTTCTCATCCTTACCTTCCATATTCTCGCTTCCACCTCCATCTCCGGCCACCGCACCGCCTCCTCCGCAGCCCCTCCCTGCCGGAACACATGCGGCGCGCTCACCATCAAGTACCCGTTCGGCACCGGCTACGGCTGCGGCTCGCCGAGATTCTCCTCTCACGTCACCTGCTCCTCCGACGACCGTCTCCTCCTCACCACTCACACCGGCGACTATCCGATCACTTCGATTTCATACTCCGACTCCACCGTCATAATCTCGCCGCCGTCGATGTCGACCTGCTCTAAGATGCAGGAATCGAAAACCCTAGGCATCGATTGGACTGGCCCCTTCCAACTCGGCTCATCGGTGTTCCTCCTCCTCGATTGCGAATCTCCCTCAGCCTCCCTCTCGATTCGAGGCTCCGCCGTGTGCGATTTGTCGTACTCTCACCTCTGCGCTTCGATCTACGCCTGTCCGGCCGTCGTCGACCTCGGCCTGCCGCTCTTTCCGCCGACGAACTCCTGCTGCGTCTACTCGCCGGCGAACTTCGACGGCGACGGCGAGCTCGACCTGCGCGAGCTCAAGTGCGGCGGATTCTCGTCGGTGGTGTCGCTCGGAGAGTATCAGACGGATCCGGCGCGGTGGGAGTACGGAGTGGAGTTGAAGTACGGATATGGAGCGCTGGAGAGCAGTGCGGTGGAAACCAAATGTACGGGCTGCGAAATGAGCGGCGGCGCGTGTGGATTTGCTCCGCCGGAGAATGCGTTCGTTTGCGTGTGTGAAAGAGGGTTCAATACGACGACGGACTGCAATAGTAATGATCTCACTCAGGAGTTCTTCTGGAGCTCTGATTCCTCTCCGGCTGCAGTATTCTTTTGCTTGTAA
Coding sequence (CDS)
ATGGAGTTTCTTCGCTTTCTCATCCTTACCTTCCATATTCTCGCTTCCACCTCCATCTCCGGCCACCGCACCGCCTCCTCCGCAGCCCCTCCCTGCCGGAACACATGCGGCGCGCTCACCATCAAGTACCCGTTCGGCACCGGCTACGGCTGCGGCTCGCCGAGATTCTCCTCTCACGTCACCTGCTCCTCCGACGACCGTCTCCTCCTCACCACTCACACCGGCGACTATCCGATCACTTCGATTTCATACTCCGACTCCACCGTCATAATCTCGCCGCCGTCGATGTCGACCTGCTCTAAGATGCAGGAATCGAAAACCCTAGGCATCGATTGGACTGGCCCCTTCCAACTCGGCTCATCGGTGTTCCTCCTCCTCGATTGCGAATCTCCCTCAGCCTCCCTCTCGATTCGAGGCTCCGCCGTGTGCGATTTGTCGTACTCTCACCTCTGCGCTTCGATCTACGCCTGTCCGGCCGTCGTCGACCTCGGCCTGCCGCTCTTTCCGCCGACGAACTCCTGCTGCGTCTACTCGCCGGCGAACTTCGACGGCGACGGCGAGCTCGACCTGCGCGAGCTCAAGTGCGGCGGATTCTCGTCGGTGGTGTCGCTCGGAGAGTATCAGACGGATCCGGCGCGGTGGGAGTACGGAGTGGAGTTGAAGTACGGATATGGAGCGCTGGAGAGCAGTGCGGTGGAAACCAAATGTACGGGCTGCGAAATGAGCGGCGGCGCGTGTGGATTTGCTCCGCCGGAGAATGCGTTCGTTTGCGTGTGTGAAAGAGGGTTCAATACGACGACGGACTGCAATAGTAATGATCTCACTCAGGAGTTCTTCTGGAGCTCTGATTCCTCTCCGGCTGCAGTATTCTTTTGCTTGTAA
Protein sequence
MEFLRFLILTFHILASTSISGHRTASSAAPPCRNTCGALTIKYPFGTGYGCGSPRFSSHVTCSSDDRLLLTTHTGDYPITSISYSDSTVIISPPSMSTCSKMQESKTLGIDWTGPFQLGSSVFLLLDCESPSASLSIRGSAVCDLSYSHLCASIYACPAVVDLGLPLFPPTNSCCVYSPANFDGDGELDLRELKCGGFSSVVSLGEYQTDPARWEYGVELKYGYGALESSAVETKCTGCEMSGGACGFAPPENAFVCVCERGFNTTTDCNSNDLTQEFFWSSDSSPAAVFFCL
Homology
BLAST of Spg029868 vs. NCBI nr
Match:
XP_038903465.1 (uncharacterized protein LOC120090047 [Benincasa hispida])
HSP 1 Score: 518.1 bits (1333), Expect = 5.0e-143
Identity = 246/292 (84.25%), Postives = 264/292 (90.41%), Query Frame = 0
Query: 1 MEFLRFLILTFHILASTSISGHRTASSAAPPCRNTCGALTIKYPFGTGYGCGSPRFSSHV 60
MEF LI TFHIL S SIS HR+A + +PPCR TCG L+IKYPFGTGYGCGSPRFSS V
Sbjct: 16 MEFFHSLIFTFHILTSISISAHRSA-AVSPPCRTTCGTLSIKYPFGTGYGCGSPRFSSQV 75
Query: 61 TCSSDDRLLLTTHTGDYPITSISYSDSTVIISPPSMSTCSKMQE-SKTLGIDWTGPFQLG 120
TCSSDDR LL THTG+YPITSISYSDSTV+ISPPSMSTCSKM E S+TLGIDWTGPFQLG
Sbjct: 76 TCSSDDRFLLNTHTGNYPITSISYSDSTVVISPPSMSTCSKMHESSQTLGIDWTGPFQLG 135
Query: 121 SSVFLLLDCESPSASLSIRGSAVCDLSYSHLCASIYACPAVVDLGLPLFPPTNSCCVYSP 180
SS FLLLDC+SPS SLSIRGS VCDLSY+HLCASIY+CP+VVDLGLPLFPPTNSCCVYSP
Sbjct: 136 SSAFLLLDCKSPSVSLSIRGSGVCDLSYAHLCASIYSCPSVVDLGLPLFPPTNSCCVYSP 195
Query: 181 ANFDGDGELDLRELKCGGFSSVVSLGEYQTDPARWEYGVELKYGYGALESSAVETKCTGC 240
ANFDG GELDLRELKCGGFSS+VSLGEY+T+P RWEYGVELKYGYGALE+S +E+KC GC
Sbjct: 196 ANFDGKGELDLRELKCGGFSSIVSLGEYETNPMRWEYGVELKYGYGALENSVMESKCKGC 255
Query: 241 EMSGGACGFAPPENAFVCVCERGFNTTTDCNSNDLTQEFFWSSDSSPAAVFF 292
EMSGGACGF PPEN FVCVCERGFNTTTDC SNDLTQEFFWSS SSP AV+F
Sbjct: 256 EMSGGACGFTPPENLFVCVCERGFNTTTDCKSNDLTQEFFWSSASSPVAVWF 306
BLAST of Spg029868 vs. NCBI nr
Match:
XP_021912082.1 (uncharacterized protein LOC110825865 isoform X3 [Carica papaya])
HSP 1 Score: 370.5 bits (950), Expect = 1.3e-98
Identity = 168/259 (64.86%), Postives = 206/259 (79.54%), Query Frame = 0
Query: 30 PPCRNTCGALTIKYPFGTGYGCGSPRFSSHVTCSSD-DRLLLTTHTGDYPITSISYSDST 89
PPC +TCG+L +KYPFG+GYGCGSPRF +V+CS+D ++LLLTTHTG YPITSISYS ST
Sbjct: 34 PPCHDTCGSLQLKYPFGSGYGCGSPRFYPYVSCSTDSNQLLLTTHTGSYPITSISYSTST 93
Query: 90 VIISPPSMSTCSKMQESKTLGIDWTGPFQLGSSVFLLLDCESPSASLSIRGSAVCDLSYS 149
+ I+PPSMSTC+ MQ+S LG+DW PFQLG S FLLL C P+++L+++GS +CD S +
Sbjct: 94 LTITPPSMSTCTSMQQSPNLGLDWASPFQLGPSTFLLLACAPPTSALTVKGSPICDASSA 153
Query: 150 HLCASIYACPAVVDLGLPLFPPTNSCCVYSPANFDGDGELDLRELKCGGFSSVVSLGEYQ 209
HLCASIYACPAV+ LGLPLFPPTN+CCVYSPANF+G GEL++R L C GF+SVVS+G+Y
Sbjct: 154 HLCASIYACPAVIGLGLPLFPPTNTCCVYSPANFNGKGELEVRALGCSGFASVVSVGDYP 213
Query: 210 TDPARWEYGVELKYGYGALESSAVETKCTGCEMSGGACGFAP--PENAFVCVCERGFNTT 269
TDP RWEYGV LKY +GA +S ++ KC CEM GG CG++P E + VCVC G NTT
Sbjct: 214 TDPTRWEYGVALKYTHGAFDSFYMDEKCNTCEMRGGVCGYSPGGGEKSSVCVCRNGINTT 273
Query: 270 TDCNSNDLTQEFFWSSDSS 286
TDCN N Q + WSS SS
Sbjct: 274 TDCNKNTQDQNYLWSSTSS 292
BLAST of Spg029868 vs. NCBI nr
Match:
XP_021912081.1 (uncharacterized protein LOC110825865 isoform X2 [Carica papaya])
HSP 1 Score: 370.5 bits (950), Expect = 1.3e-98
Identity = 168/259 (64.86%), Postives = 206/259 (79.54%), Query Frame = 0
Query: 30 PPCRNTCGALTIKYPFGTGYGCGSPRFSSHVTCSSD-DRLLLTTHTGDYPITSISYSDST 89
PPC +TCG+L +KYPFG+GYGCGSPRF +V+CS+D ++LLLTTHTG YPITSISYS ST
Sbjct: 34 PPCHDTCGSLQLKYPFGSGYGCGSPRFYPYVSCSTDSNQLLLTTHTGSYPITSISYSTST 93
Query: 90 VIISPPSMSTCSKMQESKTLGIDWTGPFQLGSSVFLLLDCESPSASLSIRGSAVCDLSYS 149
+ I+PPSMSTC+ MQ+S LG+DW PFQLG S FLLL C P+++L+++GS +CD S +
Sbjct: 94 LTITPPSMSTCTSMQQSPNLGLDWASPFQLGPSTFLLLACAPPTSALTVKGSPICDASSA 153
Query: 150 HLCASIYACPAVVDLGLPLFPPTNSCCVYSPANFDGDGELDLRELKCGGFSSVVSLGEYQ 209
HLCASIYACPAV+ LGLPLFPPTN+CCVYSPANF+G GEL++R L C GF+SVVS+G+Y
Sbjct: 154 HLCASIYACPAVIGLGLPLFPPTNTCCVYSPANFNGKGELEVRALGCSGFASVVSVGDYP 213
Query: 210 TDPARWEYGVELKYGYGALESSAVETKCTGCEMSGGACGFAP--PENAFVCVCERGFNTT 269
TDP RWEYGV LKY +GA +S ++ KC CEM GG CG++P E + VCVC G NTT
Sbjct: 214 TDPTRWEYGVALKYTHGAFDSFYMDEKCNTCEMRGGVCGYSPGGGEKSSVCVCRNGINTT 273
Query: 270 TDCNSNDLTQEFFWSSDSS 286
TDCN N Q + WSS SS
Sbjct: 274 TDCNKNTQDQNYLWSSTSS 292
BLAST of Spg029868 vs. NCBI nr
Match:
XP_021912080.1 (uncharacterized protein LOC110825865 isoform X1 [Carica papaya])
HSP 1 Score: 370.5 bits (950), Expect = 1.3e-98
Identity = 168/259 (64.86%), Postives = 206/259 (79.54%), Query Frame = 0
Query: 30 PPCRNTCGALTIKYPFGTGYGCGSPRFSSHVTCSSD-DRLLLTTHTGDYPITSISYSDST 89
PPC +TCG+L +KYPFG+GYGCGSPRF +V+CS+D ++LLLTTHTG YPITSISYS ST
Sbjct: 34 PPCHDTCGSLQLKYPFGSGYGCGSPRFYPYVSCSTDSNQLLLTTHTGSYPITSISYSTST 93
Query: 90 VIISPPSMSTCSKMQESKTLGIDWTGPFQLGSSVFLLLDCESPSASLSIRGSAVCDLSYS 149
+ I+PPSMSTC+ MQ+S LG+DW PFQLG S FLLL C P+++L+++GS +CD S +
Sbjct: 94 LTITPPSMSTCTSMQQSPNLGLDWASPFQLGPSTFLLLACAPPTSALTVKGSPICDASSA 153
Query: 150 HLCASIYACPAVVDLGLPLFPPTNSCCVYSPANFDGDGELDLRELKCGGFSSVVSLGEYQ 209
HLCASIYACPAV+ LGLPLFPPTN+CCVYSPANF+G GEL++R L C GF+SVVS+G+Y
Sbjct: 154 HLCASIYACPAVIGLGLPLFPPTNTCCVYSPANFNGKGELEVRALGCSGFASVVSVGDYP 213
Query: 210 TDPARWEYGVELKYGYGALESSAVETKCTGCEMSGGACGFAP--PENAFVCVCERGFNTT 269
TDP RWEYGV LKY +GA +S ++ KC CEM GG CG++P E + VCVC G NTT
Sbjct: 214 TDPTRWEYGVALKYTHGAFDSFYMDEKCNTCEMRGGVCGYSPGGGEKSSVCVCRNGINTT 273
Query: 270 TDCNSNDLTQEFFWSSDSS 286
TDCN N Q + WSS SS
Sbjct: 274 TDCNKNTQDQNYLWSSTSS 292
BLAST of Spg029868 vs. NCBI nr
Match:
KAF8020139.1 (hypothetical protein BT93_G0748 [Corymbia citriodora subsp. variegata])
HSP 1 Score: 368.2 bits (944), Expect = 6.4e-98
Identity = 171/268 (63.81%), Postives = 208/268 (77.61%), Query Frame = 0
Query: 23 RTASSAAPPCRNTCGALTIKYPFGTGYGCGSPRFSSHVTC---SSDDRLLLTTHTGDYPI 82
R A S P CRNTCG+L +KYPFGTG GCGSPRF +VTC D L+L THTG YP+
Sbjct: 18 RPAESPDPACRNTCGSLAVKYPFGTGSGCGSPRFHPYVTCVHVEDHDDLVLATHTGSYPV 77
Query: 83 TSISYSDSTVIISPPSMSTCSKMQESKTLGIDWTGPFQLGSSVFLLLDCESPSASLSIRG 142
TSISY+ ST+IISPP+MSTC+ M+ S LG+DW+GPF+LGSS F+LL CE P+ASL+I
Sbjct: 78 TSISYTGSTLIISPPNMSTCTSMKPSPNLGLDWSGPFELGSSTFVLLHCEPPTASLTIND 137
Query: 143 SAVCDLSYSHLCASIYACPAVVDLGLPLFPPTNSCCVYSPANFDGDGELDLRELKCGGFS 202
S++CDLS S+LCAS+YACPAV++LGLPLF PTN+CCVYSPANFD GELDL+ LKCGG++
Sbjct: 138 SSICDLSSSNLCASVYACPAVLNLGLPLFSPTNTCCVYSPANFDSKGELDLQALKCGGYA 197
Query: 203 SVVSLGEYQTDPARWEYGVELKYGYGALESSAVETKCTGCEMSGGACGFAPPENAFVCVC 262
V+SLG+ TDP WEYGVELKY GAL+S+ ++TKC CE SGG CG+ P AFVCVC
Sbjct: 198 PVMSLGDDPTDPTGWEYGVELKYTQGALDSNHMDTKCPTCENSGGVCGYEPASLAFVCVC 257
Query: 263 ERGFNTTTDCNSNDLTQEFFWSSDSSPA 288
G+NTTTDC+ + Q WSS S P+
Sbjct: 258 AGGYNTTTDCHPYNQVQGIMWSSASFPS 285
BLAST of Spg029868 vs. ExPASy TrEMBL
Match:
A0A2I4HW38 (uncharacterized protein LOC109022025 OS=Juglans regia OX=51240 GN=LOC109022025 PE=4 SV=1)
HSP 1 Score: 364.8 bits (935), Expect = 3.4e-97
Identity = 171/298 (57.38%), Postives = 220/298 (73.83%), Query Frame = 0
Query: 9 LTFHILASTSISGHRTASSAAPPCRNTCGALTIKYPFGTGYGCGSPRFSSHVTCSSD-DR 68
+ FH+L + T + P C+ TCG++ +KYPFGTG GCGSPRF +VTC SD D+
Sbjct: 8 ILFHLLPVILFTAQITL-APDPVCKGTCGSVQVKYPFGTGTGCGSPRFQPYVTCKSDEDQ 67
Query: 69 LLLTTHTGDYPITSISYSDSTVIISPPSMSTCSKMQE-SKTLGIDWTGPFQLGSSVFLLL 128
L+ TTHTG YPITSISY+ ST+IISP MSTC M++ S LG+DW PFQLG S F+L+
Sbjct: 68 LVFTTHTGSYPITSISYTTSTLIISPSDMSTCISMRQCSSILGVDWASPFQLGPSTFILI 127
Query: 129 DCESPSASLSIRGSAVCDLSYSHLCASIYACPAVVDLGLPLFPPTNSCCVYSPANFDGDG 188
C P++SL+++G+ +CD SYSHLCAS+Y CPAV+ LGLPLFPPTNSCCVYSPAN +G G
Sbjct: 128 SCTPPTSSLTLKGTPICDPSYSHLCASLYTCPAVLSLGLPLFPPTNSCCVYSPANLNGKG 187
Query: 189 ELDLRELKCGGFSSVVSLGEYQTDPARWEYGVELKYGYGALESSAVETKCTGCEMSGGAC 248
ELDLR+LKC G++SV+SLG Y TDP++W+YGV LKY G L+S+ ++TKC CEMSGG C
Sbjct: 188 ELDLRKLKCEGYASVLSLGMYPTDPSQWQYGVALKYVNGVLDSNIIDTKCNACEMSGGVC 247
Query: 249 GFAPPENAFVCVCERGFNTTTDCNSNDLTQE--FFWSSDSSP---------AAVFFCL 294
G+APP N+FVC+C+ G+NT+ DCN+N + FWS+ S P A + FCL
Sbjct: 248 GYAPPSNSFVCICKNGYNTSMDCNNNYGQGQLGIFWSTASLPTGELWFGFLAGLIFCL 304
BLAST of Spg029868 vs. ExPASy TrEMBL
Match:
A0A061DHT7 (Membrane lipoprotein OS=Theobroma cacao OX=3641 GN=TCM_001086 PE=4 SV=1)
HSP 1 Score: 358.6 bits (919), Expect = 2.5e-95
Identity = 172/284 (60.56%), Postives = 207/284 (72.89%), Query Frame = 0
Query: 9 LTFHILASTSISGHRTASSAAPPCRNTCGALTIKYPFGTGYGCGSPRFSSHVTCSSDDRL 68
L F I T + T+S+ CR+TCG+L +KYPFGTGYGCGSPRF +VTCSS DRL
Sbjct: 6 LPFIIFLITLFTPLHTSSAPGSACRSTCGSLQVKYPFGTGYGCGSPRFQPYVTCSS-DRL 65
Query: 69 LLTTHTGDYPITSISYSDSTVIISPPSMSTCSKMQESKTLGIDWTGPFQLGSSVFLLLDC 128
LLTTHTG YPITS+SY DST+II+PP MSTCS MQ+S LG+DW PFQLG S+FLLL C
Sbjct: 66 LLTTHTGSYPITSVSYKDSTLIITPPYMSTCSSMQQSPNLGLDWASPFQLGPSIFLLLSC 125
Query: 129 ESPSASLSIRGSAVCDLSYSHLCASIYACPAVVDLGLPLFPPTNSCCVYSPANFDGDGEL 188
P++SL+I+GS VCD S S LCASIY+CP+VV LGL LFPPTN+CCVYSPANF+ GEL
Sbjct: 126 TPPTSSLTIKGSPVCDPSSSDLCASIYSCPSVVSLGLHLFPPTNTCCVYSPANFNSKGEL 185
Query: 189 DLRELKCGGFSSVVSLGEYQTDPARWEYGVELKYGYGALESSAVETKCTGCEMSGGACGF 248
DLR +KC G++S+ S + TDP+RW+YGV LKY GA + + KC CE SGG CG+
Sbjct: 186 DLRAMKCIGYASIASFQDSPTDPSRWQYGVTLKYTNGAFDDYYMNNKCNTCEDSGGLCGY 245
Query: 249 APPENAFVCVCERGFNTTTDCNSN------DLTQEFFWSSDSSP 287
+PP N+FVC C+ GFNTTT C +N D Q WSS P
Sbjct: 246 SPPSNSFVCACKNGFNTTTACYNNYNPIQDDEDQGITWSSTLLP 288
BLAST of Spg029868 vs. ExPASy TrEMBL
Match:
A0A5N5LZF2 (Uncharacterized protein OS=Salix brachista OX=2182728 GN=DKX38_011586 PE=4 SV=1)
HSP 1 Score: 357.5 bits (916), Expect = 5.5e-95
Identity = 165/261 (63.22%), Postives = 205/261 (78.54%), Query Frame = 0
Query: 31 PCRNTCGALTIKYPFGTGYGCGSPRFSSHVTCSSD-DRLLLTTHTGDYPITSISYSDSTV 90
PCR TCG++ +KYPFG+G+GCGSPRF ++ CS + D+LLLTTHTG YP+TSISY+ ST+
Sbjct: 33 PCRTTCGSIQVKYPFGSGHGCGSPRFHPYIACSPEGDQLLLTTHTGSYPVTSISYTTSTL 92
Query: 91 IISPPSMSTCSKMQESKTLGIDWTGPFQLGSSVFLLLDCESPSASLSIRGSAVCDLSYSH 150
II+PP MSTC+ MQ+S LGIDW PFQLG S FLLL C P++SL+I+GS VCD S SH
Sbjct: 93 IITPPQMSTCTSMQQSPGLGIDWASPFQLGQSTFLLLSCTPPTSSLTIKGSPVCDTS-SH 152
Query: 151 LCASIYACPAVVDLGLPLFPPTNSCCVYSPANFDGDGELDLRELKCGGFSSVVSLGEYQT 210
LCASIY CP+V+ LGLPLFPPTN+CCVYSPANF+ GELDL++LKC G++SVVSL EY T
Sbjct: 153 LCASIYTCPSVIGLGLPLFPPTNTCCVYSPANFNSKGELDLQKLKCMGYASVVSLQEYPT 212
Query: 211 DPARWEYGVELKYGYGALESSAVETKCTGCEMSGGACGFAPPENAFVCVCERGFNTTTDC 270
DP+RW+YGVEL+ GAL+ + KC CE+SGG CG+APP N+FVC+C NTT DC
Sbjct: 213 DPSRWQYGVELQSRGGALDDYYTDNKCNACEISGGLCGYAPPVNSFVCLCSDTINTTIDC 272
Query: 271 NSNDLTQ--EFFWSSDSSPAA 289
+SN L E W+S S P++
Sbjct: 273 HSNSLQNQAELTWNSVSLPSS 292
BLAST of Spg029868 vs. ExPASy TrEMBL
Match:
A0A6J0ZKD8 (uncharacterized protein LOC110409598 OS=Herrania umbratica OX=108875 GN=LOC110409598 PE=4 SV=1)
HSP 1 Score: 357.1 bits (915), Expect = 7.2e-95
Identity = 174/302 (57.62%), Postives = 215/302 (71.19%), Query Frame = 0
Query: 7 LILTFHILASTSISGHRTASSAAPPCRNTCGALTIKYPFGTGYGCGSPRFSSHVTCSSDD 66
L L F +L T + +T+S+ C++TCG+L +KYPFGTGYGCGSPRF +VTCSS D
Sbjct: 4 LPLPFIVLLITLFTPLQTSSAPGSACQSTCGSLQVKYPFGTGYGCGSPRFQPYVTCSS-D 63
Query: 67 RLLLTTHTGDYPITSISYSDSTVIISPPSMSTCSKMQESKTLGIDWTGPFQLGSSVFLLL 126
LLLTTHTG YPITS+SY DST+ I+PP MSTCS MQ+S LG+DW PFQLG S+FLLL
Sbjct: 64 HLLLTTHTGSYPITSVSYKDSTLTITPPYMSTCSSMQQSPNLGLDWASPFQLGPSIFLLL 123
Query: 127 DCESPSASLSIRGSAVCDLSYSHLCASIYACPAVVDLGLPLFPPTNSCCVYSPANFDGDG 186
C P++SL+I+GS VCD S S LCASIY+CP+VV LGL LFPPTN+CCVYSPANF+ G
Sbjct: 124 SCTPPTSSLTIKGSPVCDPSSSDLCASIYSCPSVVSLGLHLFPPTNTCCVYSPANFNSKG 183
Query: 187 ELDLRELKCGGFSSVVSLGEYQTDPARWEYGVELKYGYGALESSAVETKCTGCEMSGGAC 246
ELDLR +KC G++S+ S + TDP+RW+YGV LKY GA + + KC CE SGG C
Sbjct: 184 ELDLRAMKCIGYASIASFQDSPTDPSRWQYGVTLKYTNGAFDDYDMNNKCNTCEDSGGLC 243
Query: 247 GFAPPENAFVCVCERGFNTTTDCNSN------DLTQEFFWSSDSSP---------AAVFF 294
G++PP ++FVC C+ GFNT+T C +N D QE WSS S P A + F
Sbjct: 244 GYSPPSDSFVCACKNGFNTSTACYNNYNPFQDDEDQEITWSSTSLPTWKIWLGLLAGLTF 303
BLAST of Spg029868 vs. ExPASy TrEMBL
Match:
A0A059AMD5 (Uncharacterized protein OS=Eucalyptus grandis OX=71139 GN=EUGRSUZ_I00510 PE=4 SV=1)
HSP 1 Score: 356.7 bits (914), Expect = 9.4e-95
Identity = 163/268 (60.82%), Postives = 206/268 (76.87%), Query Frame = 0
Query: 23 RTASSAAPPCRNTCGALTIKYPFGTGYGCGSPRFSSHVTC---SSDDRLLLTTHTGDYPI 82
R A S P CRN CG+LT+KYPFGTGYGCGSPRF +VTC DD L+LTTHTG YP+
Sbjct: 18 RPAESPNPACRNACGSLTVKYPFGTGYGCGSPRFHPYVTCVHGQDDDTLVLTTHTGSYPV 77
Query: 83 TSISYSDSTVIISPPSMSTCSKMQESKTLGIDWTGPFQLGSSVFLLLDCESPSASLSIRG 142
TSISY+ S++IISP MSTC+ M+ S LG+DW+GPF+LGSS F+LL C+SP+ SL+I+
Sbjct: 78 TSISYTTSSLIISPSDMSTCTSMKPSPNLGLDWSGPFELGSSTFVLLHCKSPNTSLTIKD 137
Query: 143 SAVCDLSYSHLCASIYACPAVVDLGLPLFPPTNSCCVYSPANFDGDGELDLRELKCGGFS 202
++CD S+LCAS+YACPA++ LGLPLF PT++CCVYSPANFD GELDL+ +C G++
Sbjct: 138 LSICDHGSSNLCASMYACPAILALGLPLFSPTDTCCVYSPANFDSKGELDLQAWECDGYA 197
Query: 203 SVVSLGEYQTDPARWEYGVELKYGYGALESSAVETKCTGCEMSGGACGFAPPENAFVCVC 262
V++LG+Y TDPARWEYGVELKY GA +S+ ++TKC CE SGG CG+ P AFVCVC
Sbjct: 198 PVLTLGDYPTDPARWEYGVELKYTQGAFDSNHMDTKCPTCENSGGVCGYDSPSLAFVCVC 257
Query: 263 ERGFNTTTDCNSNDLTQEFFWSSDSSPA 288
G+NT+TDC+ + Q WSS S P+
Sbjct: 258 GGGYNTSTDCHPYNEVQGIIWSSASFPS 285
BLAST of Spg029868 vs. TAIR 10
Match:
AT1G10380.1 (Putative membrane lipoprotein )
HSP 1 Score: 199.1 bits (505), Expect = 4.8e-51
Identity = 105/248 (42.34%), Postives = 143/248 (57.66%), Query Frame = 0
Query: 32 CRNTCGALTIKYPFGTGYGCGSPRFSSHVTCSSDDR-LLLTTHTGDYPITSISYSDSTVI 91
C+ TCG + IKYP GTG GCG PRF+ ++TC D + L LTTHTG YPITS+ Y+ +
Sbjct: 29 CQKTCGQIPIKYPLGTGSGCGDPRFTRYITCDPDQQTLTLTTHTGSYPITSVDYAKQEIY 88
Query: 92 ISPPSMSTCSKMQESKTLGIDWTGPFQL-GSSVFLLLDC---ESPSASLSIRGS---AVC 151
++ PSMSTC+ + S G+DW PF +VF LLDC ESP + GS ++C
Sbjct: 89 VTDPSMSTCACTRPSHGFGLDWDAPFSFHDDTVFTLLDCSVDESPVFTPLSNGSGRVSLC 148
Query: 152 DLSYSHLCASIYA-CPAVVDLGLPLFPPTNSCCVYSPANFDGDGELDLRELKCGGFSSVV 211
D S +C +Y+ C A+ + L + ++CCVY P + E+DL +LKC +S
Sbjct: 149 DRQSSSICTFLYSNCRAISLINLQV----STCCVYVPLDLGPSFEMDLNKLKCSSYSGFY 208
Query: 212 SLGEYQ-TDPARWEYGVELKYGYGALESSAVETKCTGCEMSGGACGFAPPENAFVCVCER 270
+LG Q + P W YG+ LKY + + C CE S GACGF ++FVC C
Sbjct: 209 NLGPGQESHPENWNYGIALKYKFNVFDE--YPGVCGSCERSNGACGFNTQSSSFVCNCPG 268
BLAST of Spg029868 vs. TAIR 10
Match:
AT3G17350.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: endomembrane system; EXPRESSED IN: 19 plant structures; EXPRESSED DURING: 13 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G50290.1); Has 203 Blast hits to 203 proteins in 13 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 203; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )
HSP 1 Score: 121.7 bits (304), Expect = 9.8e-28
Identity = 86/276 (31.16%), Postives = 126/276 (45.65%), Query Frame = 0
Query: 1 MEFLRFLILTFHILASTSISGHRTASSAAPPCRNTCGALTIKYPFGTGYGCGSPRFSSHV 60
M L F +F + T+++ +SAA CR CG + I YPFG GCGSP++
Sbjct: 1 MTKLFFFFFSF-LYTITTLTFPPLTTSAATSCRTLCGNIPINYPFGIDGGCGSPQYRGMF 60
Query: 61 TCSSDDRLLLTTHTGDYPITSISYSDSTVIISPPSMSTCSKMQ---ESKTLGIDWTGPFQ 120
CS+D L TT +G Y + SI Y T++I P+MSTCS +Q + K I T
Sbjct: 61 NCSTD--LYFTTPSGSYKVQSIDYEKKTMVIFDPAMSTCSILQPHHDFKMADIQNTLIRP 120
Query: 121 LGSSVFLLLDCESPSASLSIRGSAVCDLSYSHLCASIYACPAVVDLGLPLFPPTNSCCVY 180
+VF L +C + S + R +C + H C +Y+ + P N+ V+
Sbjct: 121 SYDTVFALFNCSNDS-PVHNRYRNLCFNAAGHSCDELYSSCTSFRIFNTTSPYGNNSTVH 180
Query: 181 SP-----ANFDGDGELDLRELKCGGFSSVVSLGEYQ-TDPARWEYGVELKYGYGALESSA 240
+ N+D + + L C +++V+ G+ + P W YG+EL Y S
Sbjct: 181 TTPYCCFTNYDTVRVMSMNILDCSHYTTVIDNGKMRGVGPLDWSYGIELSY-------SV 240
Query: 241 VETKCTGCEMSGGACGFAPPENAFVCVCERGFNTTT 268
E C C SGG CGF F+C C N T
Sbjct: 241 TEIGCDRCRKSGGTCGFDAETEIFLCQCSGSNNNPT 265
BLAST of Spg029868 vs. TAIR 10
Match:
AT1G11915.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: endomembrane system; EXPRESSED IN: root; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G17350.1); Has 261 Blast hits to 261 proteins in 13 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 261; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )
HSP 1 Score: 103.6 bits (257), Expect = 2.8e-22
Identity = 88/286 (30.77%), Postives = 126/286 (44.06%), Query Frame = 0
Query: 9 LTFHILASTSISGHRTASSAAPPCRNTCGALTIKYPFGTGYGCGSPRFSSHVTCSSDD-R 68
+ F L T + T SS + CR++CG + I YPF GCGSP + + CS +D +
Sbjct: 8 IIFFSLLMTILLQSSTTSSQSNLCRSSCGNIPINYPFSIDDGCGSPYYRHMLICSDNDTK 67
Query: 69 LLLTTHTGDYPITSISYSDSTVIISPPSMSTC---SKMQESKTLGIDWTGPFQLG-SSVF 128
L L T +G YP+ SISYSD +++S P M C + +++ ID + F + + +
Sbjct: 68 LELRTPSGKYPVKSISYSDPHLLVSDPFMWNCQDRDNFRPTRSFSIDSSTHFTVSPQNDY 127
Query: 129 LLLDCES------PSASLSIRGSAVCDLSYSHLCASIYACPAVVDLGLPLFPPTNSCCVY 188
L +C + P R CD S +S Y C + + G L SCC Y
Sbjct: 128 LFFNCNTDKVIVEPKPLFCERFPDRCDSSCD---SSSYLCRHLPECGSALGSRV-SCCSY 187
Query: 189 SPANFDGDGELDLRELKCGGFSSV------VSLGEYQTDPARWEYGVELKYGYGALESSA 248
P L L C ++SV V Y P EYG+ + Y +
Sbjct: 188 YP---KATQSLRLMLQDCATYTSVYWRSTGVENAPYDQFP---EYGIRVDYEF------P 247
Query: 249 VETKCTGCE---MSGGACGFAPPENAFVCVCERGFNTTTDCNSNDL 275
V KC C+ GG CGF F+C+C++G N TT C L
Sbjct: 248 VTMKCLLCQETTKGGGVCGFNTRTRDFLCLCKQG-NVTTYCKDPSL 276
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_038903465.1 | 5.0e-143 | 84.25 | uncharacterized protein LOC120090047 [Benincasa hispida] | [more] |
XP_021912082.1 | 1.3e-98 | 64.86 | uncharacterized protein LOC110825865 isoform X3 [Carica papaya] | [more] |
XP_021912081.1 | 1.3e-98 | 64.86 | uncharacterized protein LOC110825865 isoform X2 [Carica papaya] | [more] |
XP_021912080.1 | 1.3e-98 | 64.86 | uncharacterized protein LOC110825865 isoform X1 [Carica papaya] | [more] |
KAF8020139.1 | 6.4e-98 | 63.81 | hypothetical protein BT93_G0748 [Corymbia citriodora subsp. variegata] | [more] |
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A2I4HW38 | 3.4e-97 | 57.38 | uncharacterized protein LOC109022025 OS=Juglans regia OX=51240 GN=LOC109022025 P... | [more] |
A0A061DHT7 | 2.5e-95 | 60.56 | Membrane lipoprotein OS=Theobroma cacao OX=3641 GN=TCM_001086 PE=4 SV=1 | [more] |
A0A5N5LZF2 | 5.5e-95 | 63.22 | Uncharacterized protein OS=Salix brachista OX=2182728 GN=DKX38_011586 PE=4 SV=1 | [more] |
A0A6J0ZKD8 | 7.2e-95 | 57.62 | uncharacterized protein LOC110409598 OS=Herrania umbratica OX=108875 GN=LOC11040... | [more] |
A0A059AMD5 | 9.4e-95 | 60.82 | Uncharacterized protein OS=Eucalyptus grandis OX=71139 GN=EUGRSUZ_I00510 PE=4 SV... | [more] |
Match Name | E-value | Identity | Description | |
AT1G10380.1 | 4.8e-51 | 42.34 | Putative membrane lipoprotein | [more] |
AT3G17350.1 | 9.8e-28 | 31.16 | unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... | [more] |
AT1G11915.1 | 2.8e-22 | 30.77 | unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... | [more] |