Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TACAAAGACAAGGCTTTTCAATATATCTCTGCCTCTAATTTTTATATTCTATTTGGATTTTCATTTTCTTAGCTTTTCAAAACAATTATGTCAAATTTTCCTCGCTTTGGCCGTACACGTCAACGTCCTTCTGTGGTGGCGCCGCCAGTTCTCGCCGCCGCACAGCCTGCTGCAGAACCGAAGCCTGAATTTCTACCGATTGCTCCGGCCACCCAAACTCTTCAACCTTTAGACCCCAACCTGGCGGTGCCGGCGCCGGCTTCTCCTCTGATTGTGTCCCCTCGCCAGCTTCCCTCTCCGGTGAAGAAAGCCATGTCACCCTTTGCTTCGCCGAGATATGGTGGTCCTGCCACACGTGTGCCCAGCCCGCCGCCGGCGAGGGTCACACGATCGCCTCCGATTTCGCCTGCAAAGAAATATCCCGAAAGGAGAATTGGCCAAACAAGCCCACCTCACTCGCCGGCAAAGTCCCGGCGAACGACTCCGCCGCCTTCTCCTCTCGCTCTGCCTCGTACCCAGGTCACCGCCGTAAATGGGACCACGACTCAGCCCAGGTATTAAAGTTCAATTTTTGTTTTTCTTTCATGTTTTGAGTACGAGTGGATAAATTACTTGGTAATAATATCTTAATATAAATTCGATCTATTTCTCTCTTCATTTAAACGCCGGACATGTGGTTAGTCTAATAATTTCTCGTAAAATTGAGATTTAATACTTTAATATATGTCAAAGATGTAATATTTTAATTCATTAAATCTATGCTCAACATGATATGTTCAATTTAATATAGATGAAATTGAAAATTTGGTCACTAAAATTGTTAGGAGTTTAGATTTAGTTTTAATTTGGTCTTGGTAATTTTAAAGTTTTAAACGGGTTCATTTCATCTATTTTAATTTTTAAAATTAATTGTCATTGATTTAAGCTTTGTTTCAATAATTCGTTTCTAAATTAAGTTTACCAACTCTATTGTCACATTAAAAAAAAAAAGGTCATTATGATAACAAAATGAATCATTTGTAAAGTTTAGTTCAATTATAGAAACTAAATTAAAATTAAACTCAAATTATAAGTAAAATAGAATTTTAGAATTTAAACAACGTAGACTAAAATTTTTGGGATTTCAACGTCTTAATTTCAGGCAAATTAAAGGTATATCTCTTCACTAGAACCAAACCTTATTAAAAAAAATGAATTATTTATTCAAATTTTCAATTCTAGGATTCAGCCGGAGGTGGAGCAGAAAAACATTGTATACAACAAGACCGTCGAGAGGCCGGCGAAGTCTGATCGGCCGTCGGAGTACGGATCCGGCAAGCCACCGCAGAGGGAGCAGGCGGAGGTTATAAACCTCACCGGACATAACGTCGGCGCCGTCATGGAAGTTAATCAGTCCTCCCCCGGCCACCGTTTGGGGGGAGAAACTACCAAAAAGAAGGAAACAGAAGGCGGCGGCGGCTGCGGCGTCCGCCATGGAAATGATCAGAAGAAAACGGAGGCAAAGAAGGAACTCCCGATGACGGCATTTATGAACAGCAACTTTCAGAGTGTAAACAATTCGGTTCTGTACGACTCGTCGTGCAACCACCGTGATCCCGGTCTGCATCTCGCGTTTTCCGATGCGGCGGACGGCGACGGAGCCACTGTTGACGGCCGTAAGAATTACAAGCCATAGAGATAATGGCTGGAGAACTATTTAATCATAAGAAATATGCCATTTACTTGGTGTTTAAAAAAAGCCATTTACTCCTCCTTTTTTTTTTGAAATAAAAAAACCATTTACTTGGAATCGTATTTTAATCGATATGTAAGCATTAATAAAATATATATTATGATTAATTTCCTATTT
mRNA sequence
TACAAAGACAAGGCTTTTCAATATATCTCTGCCTCTAATTTTTATATTCTATTTGGATTTTCATTTTCTTAGCTTTTCAAAACAATTATGTCAAATTTTCCTCGCTTTGGCCGTACACGTCAACGTCCTTCTGTGGTGGCGCCGCCAGTTCTCGCCGCCGCACAGCCTGCTGCAGAACCGAAGCCTGAATTTCTACCGATTGCTCCGGCCACCCAAACTCTTCAACCTTTAGACCCCAACCTGGCGGTGCCGGCGCCGGCTTCTCCTCTGATTGTGTCCCCTCGCCAGCTTCCCTCTCCGGTGAAGAAAGCCATGTCACCCTTTGCTTCGCCGAGATATGGTGGTCCTGCCACACGTGTGCCCAGCCCGCCGCCGGCGAGGGTCACACGATCGCCTCCGATTTCGCCTGCAAAGAAATATCCCGAAAGGAGAATTGGCCAAACAAGCCCACCTCACTCGCCGGCAAAGTCCCGGCGAACGACTCCGCCGCCTTCTCCTCTCGCTCTGCCTCGTACCCAGGTCACCGCCGTAAATGGGACCACGACTCAGCCCAGGATTCAGCCGGAGGTGGAGCAGAAAAACATTGTATACAACAAGACCGTCGAGAGGCCGGCGAAGTCTGATCGGCCGTCGGAGTACGGATCCGGCAAGCCACCGCAGAGGGAGCAGGCGGAGGTTATAAACCTCACCGGACATAACGTCGGCGCCGTCATGGAAGTTAATCAGTCCTCCCCCGGCCACCGTTTGGGGGGAGAAACTACCAAAAAGAAGGAAACAGAAGGCGGCGGCGGCTGCGGCGTCCGCCATGGAAATGATCAGAAGAAAACGGAGGCAAAGAAGGAACTCCCGATGACGGCATTTATGAACAGCAACTTTCAGAGTGTAAACAATTCGGTTCTGTACGACTCGTCGTGCAACCACCGTGATCCCGGTCTGCATCTCGCGTTTTCCGATGCGGCGGACGGCGACGGAGCCACTGTTGACGGCCGTAAGAATTACAAGCCATAGAGATAATGGCTGGAGAACTATTTAATCATAAGAAATATGCCATTTACTTGGTGTTTAAAAAAAGCCATTTACTCCTCCTTTTTTTTTTGAAATAAAAAAACCATTTACTTGGAATCGTATTTTAATCGATATGTAAGCATTAATAAAATATATATTATGATTAATTTCCTATTT
Coding sequence (CDS)
ATGTCAAATTTTCCTCGCTTTGGCCGTACACGTCAACGTCCTTCTGTGGTGGCGCCGCCAGTTCTCGCCGCCGCACAGCCTGCTGCAGAACCGAAGCCTGAATTTCTACCGATTGCTCCGGCCACCCAAACTCTTCAACCTTTAGACCCCAACCTGGCGGTGCCGGCGCCGGCTTCTCCTCTGATTGTGTCCCCTCGCCAGCTTCCCTCTCCGGTGAAGAAAGCCATGTCACCCTTTGCTTCGCCGAGATATGGTGGTCCTGCCACACGTGTGCCCAGCCCGCCGCCGGCGAGGGTCACACGATCGCCTCCGATTTCGCCTGCAAAGAAATATCCCGAAAGGAGAATTGGCCAAACAAGCCCACCTCACTCGCCGGCAAAGTCCCGGCGAACGACTCCGCCGCCTTCTCCTCTCGCTCTGCCTCGTACCCAGGTCACCGCCGTAAATGGGACCACGACTCAGCCCAGGATTCAGCCGGAGGTGGAGCAGAAAAACATTGTATACAACAAGACCGTCGAGAGGCCGGCGAAGTCTGATCGGCCGTCGGAGTACGGATCCGGCAAGCCACCGCAGAGGGAGCAGGCGGAGGTTATAAACCTCACCGGACATAACGTCGGCGCCGTCATGGAAGTTAATCAGTCCTCCCCCGGCCACCGTTTGGGGGGAGAAACTACCAAAAAGAAGGAAACAGAAGGCGGCGGCGGCTGCGGCGTCCGCCATGGAAATGATCAGAAGAAAACGGAGGCAAAGAAGGAACTCCCGATGACGGCATTTATGAACAGCAACTTTCAGAGTGTAAACAATTCGGTTCTGTACGACTCGTCGTGCAACCACCGTGATCCCGGTCTGCATCTCGCGTTTTCCGATGCGGCGGACGGCGACGGAGCCACTGTTGACGGCCGTAAGAATTACAAGCCATAG
Protein sequence
MSNFPRFGRTRQRPSVVAPPVLAAAQPAAEPKPEFLPIAPATQTLQPLDPNLAVPAPASPLIVSPRQLPSPVKKAMSPFASPRYGGPATRVPSPPPARVTRSPPISPAKKYPERRIGQTSPPHSPAKSRRTTPPPSPLALPRTQVTAVNGTTTQPRIQPEVEQKNIVYNKTVERPAKSDRPSEYGSGKPPQREQAEVINLTGHNVGAVMEVNQSSPGHRLGGETTKKKETEGGGGCGVRHGNDQKKTEAKKELPMTAFMNSNFQSVNNSVLYDSSCNHRDPGLHLAFSDAADGDGATVDGRKNYKP
Homology
BLAST of Tan0005219 vs. NCBI nr
Match:
XP_038879417.1 (proline-rich receptor-like protein kinase PERK8 [Benincasa hispida])
HSP 1 Score: 367.1 bits (941), Expect = 1.5e-97
Identity = 205/309 (66.34%), Postives = 235/309 (76.05%), Query Frame = 0
Query: 1 MSNFPRFGRTRQRPSVVAPPVLAAAQPAAEPKPEFLPIAPATQTLQPLDPNLAVPAPASP 60
M+N PR GRTRQRPS VAPP+LAA QP AEPKPE P AP + QP++P PAPASP
Sbjct: 1 MANLPRIGRTRQRPSAVAPPLLAAVQPTAEPKPEISPFAPTSIQTQPIEP--TTPAPASP 60
Query: 61 LIVSPRQLPSPVKKAMSPFASPRYGGPATRVPSPPPARVTRSPPISPAKKYPERRIGQTS 120
L SPR + SP KKA SPFASP+YG TRV SPP A+ TRSPP SP KY E R G+T+
Sbjct: 61 LRQSPRLITSPPKKATSPFASPKYGDSLTRVDSPPAAKATRSPPDSPTNKYLETRNGETT 120
Query: 121 PPHSPAKSRRT-TPPPSPLALPRTQVTAVNGTTTQPRIQPEVEQKNIVYNKTVERPAKSD 180
PP SPAKSRRT TPP SPL LPRT V + + TT PR QP VE K IVYNK VE+P K+D
Sbjct: 121 PPLSPAKSRRTKTPPLSPLTLPRTPVISWDETTALPRTQPVVETKGIVYNKAVEKPTKTD 180
Query: 181 RPSEYGSGKPPQREQ--AEVINLTGHNVGAVMEVNQSSPGHRLGGETTKKKETEGGGGCG 240
RPSEYGSGKP Q++Q AEVINL GHNVGAVME+N+SS G+RLGGET K KET+GG G
Sbjct: 181 RPSEYGSGKPHQKQQATAEVINLNGHNVGAVMEINKSSDGYRLGGETVKNKETKGG---G 240
Query: 241 VRHGNDQKKTEAKKELPMTAFMNSNFQSVNNSVLYDSSCNHRDPGLHLAFSDAADGDGAT 300
V HG+ +K AK P+TAFMN+NFQS+NNS+LYDSSCNH DPGLHL+ ++ DGDGAT
Sbjct: 241 VHHGHKEKNKGAKIVPPVTAFMNTNFQSINNSILYDSSCNHHDPGLHLSLPESVDGDGAT 300
Query: 301 VDGRKNYKP 307
V G K+YKP
Sbjct: 301 VYGHKSYKP 304
BLAST of Tan0005219 vs. NCBI nr
Match:
KAG6597060.1 (hypothetical protein SDJN03_10240, partial [Cucurbita argyrosperma subsp. sororia])
HSP 1 Score: 349.0 bits (894), Expect = 4.2e-92
Identity = 198/310 (63.87%), Postives = 230/310 (74.19%), Query Frame = 0
Query: 1 MSNFPRFGRTRQRPSVVAPPVLAAAQPAAEPKPEFLPIAPATQTLQPLDPNLAVPAPASP 60
MSNFPRFG +RQRPS++APPVL + PAA PLDP P P
Sbjct: 1 MSNFPRFGLSRQRPSLMAPPVLPTSLPAA-----------------PLDP----IEPPKP 60
Query: 61 LIVS----PRQLPSPVKKAMSPFASPRYGGPATRVPSPPPARVTRSPPISPAKKYPERRI 120
+++S PR L SP +K SP ASP YG TRVPSPPPA+ SPP+SPAKKYP+R I
Sbjct: 61 VLLSRKSPPRWLTSPEEKPTSPIASPEYGSSVTRVPSPPPAKDQLSPPVSPAKKYPDRWI 120
Query: 121 GQTSPPHSPAKSRRTTPPPSPLALPRTQVTAVNGTTTQPRIQPEVEQKNIVYNKTVERPA 180
QTSP SP++S RT+PPP P ALP TQ TAVNGTTTQP+IQPEVE+K+ VYNKTVE+PA
Sbjct: 121 AQTSPLQSPSRSLRTSPPPPPFALPPTQFTAVNGTTTQPKIQPEVEKKSFVYNKTVEKPA 180
Query: 181 KSDRPSEYGSGKPPQREQ-AEVINLTGHNVGAVMEVNQSSPGHRLGGETTKKKETEGGGG 240
KSD SEYGSGKP ++++ E INL GHNVGAVME+++S+ GHRLGGET +
Sbjct: 181 KSDWTSEYGSGKPHEKQKVTEAINLAGHNVGAVMEIDKSTVGHRLGGETVRG-------- 240
Query: 241 CGVRHGNDQKKTEAKKELPMTAFMNSNFQSVNNSVLYDSSCNHRDPGLHLAFSDAADGDG 300
GVR GN++KK + KKE+PMTAFMNSNFQSVNNSVLY SSCNHRDPGLHL F+DAADGDG
Sbjct: 241 -GVRDGNEEKKKKEKKEVPMTAFMNSNFQSVNNSVLYGSSCNHRDPGLHLKFADAADGDG 280
Query: 301 ATVDGRKNYK 306
ATVDGRKNYK
Sbjct: 301 ATVDGRKNYK 280
BLAST of Tan0005219 vs. NCBI nr
Match:
XP_022951357.1 (sulfated surface glycoprotein 185-like [Cucurbita moschata])
HSP 1 Score: 344.7 bits (883), Expect = 8.0e-91
Identity = 200/315 (63.49%), Postives = 231/315 (73.33%), Query Frame = 0
Query: 1 MSNFPRFGRTRQRPSVVAPPVLAAAQPAAEPKPEFLPIAPATQTLQPLDPNLAVPAPASP 60
MSNFPRFG +RQRPS+ APPVL PAAE KPE P +Q+LQ
Sbjct: 1 MSNFPRFGLSRQRPSMAAPPVLPTTLPAAELKPETQPFYKRSQSLQ-------------- 60
Query: 61 LIVSPRQLPSPVKKAMSPFASPRYGGPATRVPSPPPARVTRSPPISPAKKYPERRIGQTS 120
P +K SP ASP+YG TRVPSPPP + SPP+SPAKKYP+ R+ QTS
Sbjct: 61 ----------PAEKPTSPIASPKYGPSVTRVPSPPPPKDELSPPVSPAKKYPD-RLSQTS 120
Query: 121 PPHSPAKSRRTT--PPPSPLALPRTQVTAVNGTTTQPRIQPEVEQKNIVYNKTVERPAKS 180
P SP++S RT+ PPP PLALP TQ TAVN TTTQPRIQPEVE+K+IVYNKTVE+ KS
Sbjct: 121 PLQSPSRSLRTSPPPPPPPLALPPTQFTAVNETTTQPRIQPEVEKKSIVYNKTVEKLVKS 180
Query: 181 DRPSEYGSGKPPQREQA-EVINLTGHNVGAVMEVNQSSPGHRLGGETTKKKETEGGGGCG 240
DRPSEYGSGKP ++++A E INL GHNVGAVME+++SS GHRLGGET +K ETEGGG
Sbjct: 181 DRPSEYGSGKPYEKQKAVESINLAGHNVGAVMEIDKSSVGHRLGGETVRKNETEGGG--- 240
Query: 241 VRHGNDQKKTEAKKE-------LPMTAFMNSNFQSVNNSVLYDSSCNHRDPGLHLAFSDA 300
R GN++KK E KK+ +PMTAFMNSNFQSVNNSVLYDSSC+HRDPGLHL F+DA
Sbjct: 241 -RDGNEEKKKEEKKKKEKKKKNVPMTAFMNSNFQSVNNSVLYDSSCSHRDPGLHLKFADA 286
Query: 301 ADGDGATVDGRKNYK 306
ADGDGA VDGRK+YK
Sbjct: 301 ADGDGAAVDGRKSYK 286
BLAST of Tan0005219 vs. NCBI nr
Match:
XP_022974816.1 (wiskott-Aldrich syndrome protein family member 2-like [Cucurbita maxima] >XP_022975275.1 wiskott-Aldrich syndrome protein family member 2-like [Cucurbita maxima])
HSP 1 Score: 339.0 bits (868), Expect = 4.4e-89
Identity = 197/312 (63.14%), Postives = 226/312 (72.44%), Query Frame = 0
Query: 1 MSNFPRFGRTRQRPSVVAPPVLAAAQPAAEPKPEFLPIAPATQTLQPLDPNLAVPAPASP 60
MSNFPRFGR RQRPSV APPVL PAAE KPE LP +QTLQ
Sbjct: 1 MSNFPRFGRPRQRPSVAAPPVLPTTLPAAEHKPETLPFYRTSQTLQ-------------- 60
Query: 61 LIVSPRQLPSPVKKAMSPFASPRYGGPATRVPSPPPARVTRSPPISPAKKYPERRIGQTS 120
P +K SP ASP+YG T VPSPPPA+ SPP+SPAKKYP+ R+ QTS
Sbjct: 61 ----------PAEKPTSPIASPKYGPSVTLVPSPPPAKDELSPPVSPAKKYPD-RLSQTS 120
Query: 121 PPHSPAKSRRTT-PPPSPLALPRTQVTAVNGTTTQPRIQPEVEQKNIVYNKTVERPAKSD 180
P SP++S RT+ PPP PLALP T TA +G TTQPRIQ EVE+K+IVYNKTVE+P KSD
Sbjct: 121 PLQSPSRSLRTSPPPPPPLALPPTHFTAKDGNTTQPRIQTEVEKKSIVYNKTVEKPVKSD 180
Query: 181 RPSEYGSGKPPQREQ-AEVINLTGHNVGAVMEVNQSSPGHRLGGETTKKKETEGGGGCGV 240
RP EYGSGK ++++ AE INL GHNVGAVME+++ S HRLGGET +K +TEGG G
Sbjct: 181 RPLEYGSGKSHEKQKTAESINLAGHNVGAVMEIDKLSASHRLGGETVRKNKTEGGD--GG 240
Query: 241 RHGNDQKKTEAKKE-----LPMTAFMNSNFQSVNNSVLYDSSCNHRDPGLHLAFSDAADG 300
R GN++KK + KK+ +PMTAFMNSNFQSVNNSVLYDSSCNHRDPGLHL F+DAADG
Sbjct: 241 RDGNEEKKKKEKKDKKKKKVPMTAFMNSNFQSVNNSVLYDSSCNHRDPGLHLKFADAADG 285
Query: 301 DGATVDGRKNYK 306
DGA VDGRKNYK
Sbjct: 301 DGAAVDGRKNYK 285
BLAST of Tan0005219 vs. NCBI nr
Match:
KAG6597059.1 (hypothetical protein SDJN03_10239, partial [Cucurbita argyrosperma subsp. sororia])
HSP 1 Score: 337.8 bits (865), Expect = 9.7e-89
Identity = 200/325 (61.54%), Postives = 229/325 (70.46%), Query Frame = 0
Query: 1 MSNFPRFGRTRQRPSVVAPPVLAAAQPAAEPKPEFLPIAPATQTLQPLDPNLAVPAPASP 60
MSNFPRFG +RQRPS+ APPVL PAAE KPE P +Q+LQ
Sbjct: 1 MSNFPRFGLSRQRPSMAAPPVLPTTLPAAELKPETQPFYKRSQSLQ-------------- 60
Query: 61 LIVSPRQLPSPVKKAMSPFASPRYGGPATRVPSPPPARVTRSPPISPAKKYPERRIGQTS 120
P +K SP ASP+YG TRVPSPPP + SPP+SPAKKYP+ R+ QTS
Sbjct: 61 ----------PAEKPTSPIASPKYGPSVTRVPSPPPPKDELSPPVSPAKKYPD-RLSQTS 120
Query: 121 PPHSPAKSRRTT-PPPSPLALPRTQVTAVNGTTTQPRIQPEVEQKNIVYNKTVERPAKSD 180
P SP++S RT+ PPP PLALP T TAVN TTTQPRIQPEVE+K+IVYNKTVERP KSD
Sbjct: 121 PLQSPSRSLRTSPPPPPPLALPPTHFTAVNRTTTQPRIQPEVEKKSIVYNKTVERPVKSD 180
Query: 181 RPSEYGSGKPPQREQ-AEVINLTGHNVGAVMEVNQSSPGHRLGGETTKKKETEGGGGCGV 240
R SEYGSGKP ++++ AE INL GHNVGAVME+++SS GHRLGGET +K ETEGG
Sbjct: 181 RLSEYGSGKPHEKQKAAESINLAGHNVGAVMEIDKSSVGHRLGGETVRKNETEGGD--VG 240
Query: 241 RHGNDQKKTEAK------------------KELPMTAFMNSNFQSVNNSVLYDSSCNHRD 300
R GN++KK E K K++PMTAFMNSNFQSVNNSVLYDSSC+HRD
Sbjct: 241 RDGNEEKKKEEKKKEEKKKKEKKEKKETKEKKVPMTAFMNSNFQSVNNSVLYDSSCSHRD 298
Query: 301 PGLHLAFSDAADGDGATVDGRKNYK 306
PGLHL F+DAADGDGA VDGRKNYK
Sbjct: 301 PGLHLKFADAADGDGAAVDGRKNYK 298
BLAST of Tan0005219 vs. ExPASy TrEMBL
Match:
A0A6J1GHE5 (sulfated surface glycoprotein 185-like OS=Cucurbita moschata OX=3662 GN=LOC111454206 PE=4 SV=1)
HSP 1 Score: 344.7 bits (883), Expect = 3.8e-91
Identity = 200/315 (63.49%), Postives = 231/315 (73.33%), Query Frame = 0
Query: 1 MSNFPRFGRTRQRPSVVAPPVLAAAQPAAEPKPEFLPIAPATQTLQPLDPNLAVPAPASP 60
MSNFPRFG +RQRPS+ APPVL PAAE KPE P +Q+LQ
Sbjct: 1 MSNFPRFGLSRQRPSMAAPPVLPTTLPAAELKPETQPFYKRSQSLQ-------------- 60
Query: 61 LIVSPRQLPSPVKKAMSPFASPRYGGPATRVPSPPPARVTRSPPISPAKKYPERRIGQTS 120
P +K SP ASP+YG TRVPSPPP + SPP+SPAKKYP+ R+ QTS
Sbjct: 61 ----------PAEKPTSPIASPKYGPSVTRVPSPPPPKDELSPPVSPAKKYPD-RLSQTS 120
Query: 121 PPHSPAKSRRTT--PPPSPLALPRTQVTAVNGTTTQPRIQPEVEQKNIVYNKTVERPAKS 180
P SP++S RT+ PPP PLALP TQ TAVN TTTQPRIQPEVE+K+IVYNKTVE+ KS
Sbjct: 121 PLQSPSRSLRTSPPPPPPPLALPPTQFTAVNETTTQPRIQPEVEKKSIVYNKTVEKLVKS 180
Query: 181 DRPSEYGSGKPPQREQA-EVINLTGHNVGAVMEVNQSSPGHRLGGETTKKKETEGGGGCG 240
DRPSEYGSGKP ++++A E INL GHNVGAVME+++SS GHRLGGET +K ETEGGG
Sbjct: 181 DRPSEYGSGKPYEKQKAVESINLAGHNVGAVMEIDKSSVGHRLGGETVRKNETEGGG--- 240
Query: 241 VRHGNDQKKTEAKKE-------LPMTAFMNSNFQSVNNSVLYDSSCNHRDPGLHLAFSDA 300
R GN++KK E KK+ +PMTAFMNSNFQSVNNSVLYDSSC+HRDPGLHL F+DA
Sbjct: 241 -RDGNEEKKKEEKKKKEKKKKNVPMTAFMNSNFQSVNNSVLYDSSCSHRDPGLHLKFADA 286
Query: 301 ADGDGATVDGRKNYK 306
ADGDGA VDGRK+YK
Sbjct: 301 ADGDGAAVDGRKSYK 286
BLAST of Tan0005219 vs. ExPASy TrEMBL
Match:
A0A6J1ICG8 (wiskott-Aldrich syndrome protein family member 2-like OS=Cucurbita maxima OX=3661 GN=LOC111473600 PE=4 SV=1)
HSP 1 Score: 339.0 bits (868), Expect = 2.1e-89
Identity = 197/312 (63.14%), Postives = 226/312 (72.44%), Query Frame = 0
Query: 1 MSNFPRFGRTRQRPSVVAPPVLAAAQPAAEPKPEFLPIAPATQTLQPLDPNLAVPAPASP 60
MSNFPRFGR RQRPSV APPVL PAAE KPE LP +QTLQ
Sbjct: 1 MSNFPRFGRPRQRPSVAAPPVLPTTLPAAEHKPETLPFYRTSQTLQ-------------- 60
Query: 61 LIVSPRQLPSPVKKAMSPFASPRYGGPATRVPSPPPARVTRSPPISPAKKYPERRIGQTS 120
P +K SP ASP+YG T VPSPPPA+ SPP+SPAKKYP+ R+ QTS
Sbjct: 61 ----------PAEKPTSPIASPKYGPSVTLVPSPPPAKDELSPPVSPAKKYPD-RLSQTS 120
Query: 121 PPHSPAKSRRTT-PPPSPLALPRTQVTAVNGTTTQPRIQPEVEQKNIVYNKTVERPAKSD 180
P SP++S RT+ PPP PLALP T TA +G TTQPRIQ EVE+K+IVYNKTVE+P KSD
Sbjct: 121 PLQSPSRSLRTSPPPPPPLALPPTHFTAKDGNTTQPRIQTEVEKKSIVYNKTVEKPVKSD 180
Query: 181 RPSEYGSGKPPQREQ-AEVINLTGHNVGAVMEVNQSSPGHRLGGETTKKKETEGGGGCGV 240
RP EYGSGK ++++ AE INL GHNVGAVME+++ S HRLGGET +K +TEGG G
Sbjct: 181 RPLEYGSGKSHEKQKTAESINLAGHNVGAVMEIDKLSASHRLGGETVRKNKTEGGD--GG 240
Query: 241 RHGNDQKKTEAKKE-----LPMTAFMNSNFQSVNNSVLYDSSCNHRDPGLHLAFSDAADG 300
R GN++KK + KK+ +PMTAFMNSNFQSVNNSVLYDSSCNHRDPGLHL F+DAADG
Sbjct: 241 RDGNEEKKKKEKKDKKKKKVPMTAFMNSNFQSVNNSVLYDSSCNHRDPGLHLKFADAADG 285
Query: 301 DGATVDGRKNYK 306
DGA VDGRKNYK
Sbjct: 301 DGAAVDGRKNYK 285
BLAST of Tan0005219 vs. ExPASy TrEMBL
Match:
A0A5A7TZ24 (Zyxin-like OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold291G00760 PE=4 SV=1)
HSP 1 Score: 332.8 bits (852), Expect = 1.5e-87
Identity = 201/310 (64.84%), Postives = 226/310 (72.90%), Query Frame = 0
Query: 1 MSNFPRFGRTRQRPSVVAPPVLAAAQPAAEPKPEFLPIAPATQTLQPLDPNLAVPAPASP 60
M+N PRFGR RQR V PPV AAAQPA EP+ + LP A T PAPASP
Sbjct: 1 MANLPRFGRARQRLPAVPPPVPAAAQPAVEPRYQILPFATTITT----------PAPASP 60
Query: 61 LIVSPRQLPSPVKKAMSPFASPRYGGPATRVPSPPPARVTRSPPISPAKKYPERRIGQTS 120
SPR L SP KKA SPFASP+YG TR+ P A+ T SPP S KY ERR G+T+
Sbjct: 61 RRESPRPLSSPPKKATSPFASPKYGDSRTRLDRSPAAKATMSPPDSVRDKYFERRNGETT 120
Query: 121 PPHSPAKSRRT-TPPPSPLALPRTQVTAVNGTTTQPRIQPEVEQKNIVYNKTVERPAKSD 180
PP SPAKSRR TPP SPLALPR QV NGTT QPR+QPEVE K IVYNKTVE+P+KS+
Sbjct: 121 PPLSPAKSRRAKTPPLSPLALPRNQVFTGNGTTAQPRVQPEVETKGIVYNKTVEKPSKSN 180
Query: 181 RPS-EYGSGKPPQREQ-AEVINLTGHNVGAVMEVNQSSPGHRLGGETTKKKETEGGGGCG 240
R S EYGS K Q++Q EVI L GHNVGAVME+N+SS G+RLGGET KK ETE G
Sbjct: 181 RSSGEYGSSKSHQKKQKPEVIKLKGHNVGAVMEINKSSTGYRLGGETLKKNETEDVGDVH 240
Query: 241 VRHGNDQKKTEA-KKELPMTAFMNSNFQSVNNSVLYDSSCNHRDPGLHLAFSDAADGDGA 300
+G++ KKTE KKE P+TAFMNSNFQSVNNS+L+DSSCNHRDPGLHLAF DA DGDGA
Sbjct: 241 -GYGHEDKKTETKKKEPPITAFMNSNFQSVNNSLLFDSSCNHRDPGLHLAFPDAVDGDGA 299
Query: 301 TVDGRKNYKP 307
VDG+K+YKP
Sbjct: 301 IVDGQKSYKP 299
BLAST of Tan0005219 vs. ExPASy TrEMBL
Match:
A0A1S3AV38 (zyxin-like OS=Cucumis melo OX=3656 GN=LOC103483158 PE=4 SV=1)
HSP 1 Score: 332.8 bits (852), Expect = 1.5e-87
Identity = 201/310 (64.84%), Postives = 226/310 (72.90%), Query Frame = 0
Query: 1 MSNFPRFGRTRQRPSVVAPPVLAAAQPAAEPKPEFLPIAPATQTLQPLDPNLAVPAPASP 60
M+N PRFGR RQR V PPV AAAQPA EP+ + LP A T PAPASP
Sbjct: 1 MANLPRFGRARQRLPAVPPPVPAAAQPAVEPRYQILPFATTITT----------PAPASP 60
Query: 61 LIVSPRQLPSPVKKAMSPFASPRYGGPATRVPSPPPARVTRSPPISPAKKYPERRIGQTS 120
SPR L SP KKA SPFASP+YG TR+ P A+ T SPP S KY ERR G+T+
Sbjct: 61 RRESPRPLSSPPKKATSPFASPKYGDSRTRLDRSPAAKATMSPPDSVRDKYFERRNGETT 120
Query: 121 PPHSPAKSRRT-TPPPSPLALPRTQVTAVNGTTTQPRIQPEVEQKNIVYNKTVERPAKSD 180
PP SPAKSRR TPP SPLALPR QV NGTT QPR+QPEVE K IVYNKTVE+P+KS+
Sbjct: 121 PPLSPAKSRRAKTPPLSPLALPRNQVFTGNGTTAQPRVQPEVETKGIVYNKTVEKPSKSN 180
Query: 181 RPS-EYGSGKPPQREQ-AEVINLTGHNVGAVMEVNQSSPGHRLGGETTKKKETEGGGGCG 240
R S EYGS K Q++Q EVI L GHNVGAVME+N+SS G+RLGGET KK ETE G
Sbjct: 181 RSSGEYGSSKSHQKKQKPEVIKLKGHNVGAVMEINKSSTGYRLGGETLKKNETEDVGDVH 240
Query: 241 VRHGNDQKKTEA-KKELPMTAFMNSNFQSVNNSVLYDSSCNHRDPGLHLAFSDAADGDGA 300
+G++ KKTE KKE P+TAFMNSNFQSVNNS+L+DSSCNHRDPGLHLAF DA DGDGA
Sbjct: 241 -GYGHEDKKTETKKKEPPITAFMNSNFQSVNNSLLFDSSCNHRDPGLHLAFPDAVDGDGA 299
Query: 301 TVDGRKNYKP 307
VDG+K+YKP
Sbjct: 301 IVDGQKSYKP 299
BLAST of Tan0005219 vs. ExPASy TrEMBL
Match:
A0A6J1GDX3 (wiskott-Aldrich syndrome protein family member 2-like OS=Cucurbita moschata OX=3662 GN=LOC111453294 PE=4 SV=1)
HSP 1 Score: 324.7 bits (831), Expect = 4.1e-85
Identity = 179/255 (70.20%), Postives = 202/255 (79.22%), Query Frame = 0
Query: 55 PAPASPLIVSPRQLPSPVKKAMSPFASPRYGGPATRVPSPPPAR-VTRSPPISPAKKYPE 114
P PASPL SP LPSP KK MSP ASP+YG TRVPSPPP + SPP+SPAKKYP+
Sbjct: 27 PEPASPLKSSPHWLPSPAKKPMSPIASPKYGSSVTRVPSPPPPKDELLSPPVSPAKKYPD 86
Query: 115 RRIGQTSPPHSPAKSRRTT--PPPSPLALPRTQVTAVNGTTTQPRIQPEVEQKNIVYNKT 174
R I QTSP SP++S RT+ PPP PLALP TQ TAVNGTTTQP+IQPE+EQK+IVYNKT
Sbjct: 87 RWIAQTSPLQSPSRSLRTSPPPPPPPLALPPTQFTAVNGTTTQPKIQPEIEQKSIVYNKT 146
Query: 175 VERPAKSDRPSEYGSGKPPQREQ-AEVINLTGHNVGAVMEVNQSSPGHRLGGETTKKKET 234
VE+PAK DR SEYGSGKP ++++ AE INL GHNVGAVME+++SS GHRLGGET +
Sbjct: 147 VEKPAKFDRASEYGSGKPHEKQKAAEAINLAGHNVGAVMEIDKSSVGHRLGGETVRG--- 206
Query: 235 EGGGGCGVRHGNDQKKTEAKKELPMTAFMNSNFQSVNNSVLYDSSCNHRDPGLHLAFSDA 294
VR GN++KK + KKELPMTAF NSNFQSVNNSVLY SSCNHRDPGLHL F+DA
Sbjct: 207 ------SVRDGNEEKKKKEKKELPMTAFTNSNFQSVNNSVLYGSSCNHRDPGLHLKFADA 266
Query: 295 ADGDGATVDGRKNYK 306
ADGDGA DGRKNYK
Sbjct: 267 ADGDGAAADGRKNYK 272
BLAST of Tan0005219 vs. TAIR 10
Match:
AT2G46630.1 (unknown protein; LOCATED IN: chloroplast; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; Has 110095 Blast hits to 59224 proteins in 2216 species: Archae - 177; Bacteria - 15429; Metazoa - 38345; Fungi - 18843; Plants - 13341; Viruses - 3084; Other Eukaryotes - 20876 (source: NCBI BLink). )
HSP 1 Score: 65.5 bits (158), Expect = 8.7e-11
Identity = 99/349 (28.37%), Postives = 139/349 (39.83%), Query Frame = 0
Query: 13 RPSVVAPPVLAAAQPAAEPKPEFLPIAPATQTLQPLDPNLAVPAPASPLIVSPRQ---LP 72
RP P QP + P+ + P +P Q QPL P P SP P++
Sbjct: 40 RPPAKQPSPPRQRQPRSPPRQQD-PPSPPRQQQQPLTPPRQKAPPTSP----PQERSPYH 99
Query: 73 SPVKKAMSPFASPRYGGPATRVPSPPPARVTRSPPISPAKKYPERRIGQTSPPHSPAKSR 132
SP + MSP P+ P PPP R + + P SP + + + P SPA S
Sbjct: 100 SPPSRHMSPPTPPK-----AATPPPPPPRSSYTSPPSPKEVQEALPPRKPNSPPSPAHSS 159
Query: 133 RTT-------------------PPP---SPLALPRT------QVTAVNGTTTQPRIQP-E 192
R+T P P SP +LP + + T N T + Q E
Sbjct: 160 RSTTSESVKTRSPSESENHRKAPSPRVLSPYSLPASLLHSERETTQKNILTAEKTSQTHE 219
Query: 193 VEQKNIVYNKTVERPAKSDRPSEYGSG------------KPPQREQAE------VINLTG 252
N +N + ++ Y + P +E VI + G
Sbjct: 220 TNHHNQNHNHDYNQNHNYNQNHSYNQNQNHQGNNPKKMHRQPSSSDSENIMSTRVITIAG 279
Query: 253 HNVGAVMEVNQSSPGHRLGGE-TTKKKETEGGGGCGVR----------HGNDQKKT---- 295
N GAVME+ +S G++ GG T + + G G G R G +KKT
Sbjct: 280 ENKGAVMEILRSPQGNKTGGSGTHSSRVSHGTGEKGRRLQSSSSSSSDEGEGKKKTTKNV 339
BLAST of Tan0005219 vs. TAIR 10
Match:
AT1G75260.1 (oxidoreductases, acting on NADH or NADPH )
HSP 1 Score: 48.9 bits (115), Expect = 8.4e-06
Identity = 34/107 (31.78%), Postives = 50/107 (46.73%), Query Frame = 0
Query: 193 EQAEVINLTGHNVGAVMEVNQSSPGHRLGGETTKKKETEGGGGCGVRHGNDQ-------- 252
+ V LTG N GA M + G KK+ E G R D+
Sbjct: 341 KSVSVYTLTGENKGATMGI----------GSEKDKKDGEVHIRRGYRSNPDESSNTTATE 400
Query: 253 ----KKTEAKKELPMTAFMNSNFQSVNNSVLYDSSCNHRDPGLHLAF 288
K EA++E TA++N N Q +NNS++ +SS + DPG+H++F
Sbjct: 401 TENPKDDEAEEEASFTAYINGNTQGINNSIVVESSVSENDPGVHMSF 437
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
XP_038879417.1 | 1.5e-97 | 66.34 | proline-rich receptor-like protein kinase PERK8 [Benincasa hispida] | [more] |
KAG6597060.1 | 4.2e-92 | 63.87 | hypothetical protein SDJN03_10240, partial [Cucurbita argyrosperma subsp. sorori... | [more] |
XP_022951357.1 | 8.0e-91 | 63.49 | sulfated surface glycoprotein 185-like [Cucurbita moschata] | [more] |
XP_022974816.1 | 4.4e-89 | 63.14 | wiskott-Aldrich syndrome protein family member 2-like [Cucurbita maxima] >XP_022... | [more] |
KAG6597059.1 | 9.7e-89 | 61.54 | hypothetical protein SDJN03_10239, partial [Cucurbita argyrosperma subsp. sorori... | [more] |
Match Name | E-value | Identity | Description | |
A0A6J1GHE5 | 3.8e-91 | 63.49 | sulfated surface glycoprotein 185-like OS=Cucurbita moschata OX=3662 GN=LOC11145... | [more] |
A0A6J1ICG8 | 2.1e-89 | 63.14 | wiskott-Aldrich syndrome protein family member 2-like OS=Cucurbita maxima OX=366... | [more] |
A0A5A7TZ24 | 1.5e-87 | 64.84 | Zyxin-like OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold291G00760 PE=... | [more] |
A0A1S3AV38 | 1.5e-87 | 64.84 | zyxin-like OS=Cucumis melo OX=3656 GN=LOC103483158 PE=4 SV=1 | [more] |
A0A6J1GDX3 | 4.1e-85 | 70.20 | wiskott-Aldrich syndrome protein family member 2-like OS=Cucurbita moschata OX=3... | [more] |
Match Name | E-value | Identity | Description | |
AT2G46630.1 | 8.7e-11 | 28.37 | unknown protein; LOCATED IN: chloroplast; EXPRESSED IN: 22 plant structures; EXP... | [more] |
AT1G75260.1 | 8.4e-06 | 31.78 | oxidoreductases, acting on NADH or NADPH | [more] |