Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TCAGATTCCTTCTTCAAAATCAAAACAGCCCTAAATTCGCCACTGATCGCCAACGCCGAGCATGGAGGACAATTTTAAAGTTCGTGTGGACAGAATCTTTGGTTCTCTGTCTTCATCCTCTACTTCTTCTACGGCAGGTTTCAGTTCACCTCTGAATTCTCTGTGGTGCCTTACGGACGATGAAATCGAGAGGAGAGAGTGGATCAAGAGGAAGGAGGACCAACCGGAACCGGAGCTCGAGCCGACTTCGTTCTTTCCCGGCCGAAGGAAAGTGAACGAGAGGAATTCATTTGGATTTGGAGATGATTTTGAGGATGACCTCGATGATTTGGACGAAAATCCAGAGTCGAATGGGTCGTCTGGTAAGTTTCTTAAGCCTGACGATTATGGAGATGAAGAATGGGAAATCAAGTCCTCGATTGGCCGGGACTGCACCCTTGATTACGAGGTACTTCTCTCTCTGGCTCTTTGTCGATGTTCGTTAAGAACTATTCTGATTAGTCTATGCTTAGTTTTTGCTGCTTGACGGGTAGGGTAAGTTGAACTATCGTGAAATATGTTGGTAGATTATAGCAGCTTCGAGTGCAAACTTGAGGATACAACAGACAAACAAAATGAGAATTCGATGTTTTGATTTTATGCCTTGTCCATAGTGTTTAAATCGAGCAAATGCTGTCAGTACAGGAGTTCTTAAAACCTAATTCAATGTCGTGTGTTTGGGAAGCTTGTTTAACGTGTGGAATATGGAGTTGATTTAGGTTTTCTATCCTCTATTTGCTTAGTAGCTTGCCTCCTGAAATCTACAGCTTGAATTGATGAGTAACTGAAAATTTTATGCAGGAAGAGGAAGATGAGTATGATAAAGTTGCTGTTGGCAAGGAGAAAGGTGGAGATCGGTTGTATATGAAGGACATAACTGACTGTGGTGTTGAAATAGATTCGTGCACCGAACTTCCTACTTCGATACGAAATTTCACCAGAGATCCACGAGCTAATCATTTGGCTGCAAAAGTTAGGCTCAAGGAAGATGCAGAAGCTTCTAAAAGAATGGATTCTTTGGATGTCTCTGAGAACGGTACAGTAGTCAACACTGATTCTGAATGCAATACTTCCCAGAACCCAAAATCTATTCTGAAGAGGAAAGATAATCATCTAGATGCAAAGTCGCACAAACGTGTCCGATTTGACTCTGAATGTAAAATTTCTCAAGAGTCTCAAGGATCAAAAGATATGTATATGGAATCAAGCTCTTCGCTTGAAGCTGCAGAAGGAACCAATCAAGCGACCCTCCAATCTCAAGGATATCCTCCTCAAGTTCCGGACTACTTACGAAATCCATCAAGATACACACATTACACTTTTGACTCATCCAGTGAGGTGGATGAAGAGTCCAACAAGAAAGCCTATATGGAATTCCTTCAGTTGGTTAGAGGGTCAAAGACAATGGGGTTGTCGCATCAGGATGATGCTTTAACTGGTCCCCCAAAATCAATCACCTTCACTCCAAAAAAGAAAGCAGGTGATACAATAATGCTTGAAAATTCTGCCCTAGGGGAGCAAAATCATCAGAATGGATTTGGCAAGGAGCTTGTGCACCAGAAAAGCATGCCCATCGGCATTGCAGCTGTGGACGATGATGTTTGTTCAATGGAGGAAGATGAGCCTGATAAATTAGAAATTAAAGGAAACAGCTCACAGAAGCCAGGCCGCCAGTACAGGATGAGAGCTAAGACGGAGGAGGAGGAGGAGGCCTGATTGATACCAAGGCACTGCTCTATCTTTGTTCTTCCTTTCTCATGATCTTTTTTTTTTTTTTTAAAACTTGTCTTTAACGTACCCTGTAACAACAACAATTATTGACAACCCTATGTAGGATCAGGCGAGTTATCGAGTTACTAACTGCAAAATTTCCAGTTGGCACAATGGGCTTCCAACCATAGCCATTGTCAATAACTGAGCTTCGAGCTCCAACCTCGGTGGGCAAGCGCCATGTATGAATACGATAACAGGCAATGGACAATAACTCTATTTCCATAGTAATTCAATTAGCTAGTTGTGTAATTGGTTTGATAATGCATGTTTCTGGAATTAGGGATATGGTTGAAATTGAGTGCTTGAGGAAGCAATCTCTACTTGAGCCATCGACTTTAAAAGGAATAACTTAGAATTTTTAGTTTATTAGCTTGCCTTCAAAGGTTCAGAATTTTACGGCAAATTCAAATGTCTCTACTAAGGGTGCAGGCGACTCGGTTCGACTCCGATACCCAACCACATCGACGTTTAGAGGATGTTTGGGGCGTTGGGCGGTTATAATATTATGTGGGTTATTATAATCTGTAAAATCATATAATATTATTTAAAAGGCAGAGTAGTATAGTTTGGGATTATAATAGTTTGTGTTTGGGGTGCAGAATATTTCACAGGTTATAGTAAAAATAATATTATTTAAACTACAGAGTACGATAGTCTGAGGTTATAATAGTCTGTGTTTGGGGTGCAGAGTATTTCACAGGTTATTATAACACCTGTTATAATAACCAG
mRNA sequence
TCAGATTCCTTCTTCAAAATCAAAACAGCCCTAAATTCGCCACTGATCGCCAACGCCGAGCATGGAGGACAATTTTAAAGTTCGTGTGGACAGAATCTTTGGTTCTCTGTCTTCATCCTCTACTTCTTCTACGGCAGGTTTCAGTTCACCTCTGAATTCTCTGTGGTGCCTTACGGACGATGAAATCGAGAGGAGAGAGTGGATCAAGAGGAAGGAGGACCAACCGGAACCGGAGCTCGAGCCGACTTCGTTCTTTCCCGGCCGAAGGAAAGTGAACGAGAGGAATTCATTTGGATTTGGAGATGATTTTGAGGATGACCTCGATGATTTGGACGAAAATCCAGAGTCGAATGGGTCGTCTGGTAAGTTTCTTAAGCCTGACGATTATGGAGATGAAGAATGGGAAATCAAGTCCTCGATTGGCCGGGACTGCACCCTTGATTACGAGGAAGAGGAAGATGAGTATGATAAAGTTGCTGTTGGCAAGGAGAAAGGTGGAGATCGGTTGTATATGAAGGACATAACTGACTGTGGTGTTGAAATAGATTCGTGCACCGAACTTCCTACTTCGATACGAAATTTCACCAGAGATCCACGAGCTAATCATTTGGCTGCAAAAGTTAGGCTCAAGGAAGATGCAGAAGCTTCTAAAAGAATGGATTCTTTGGATGTCTCTGAGAACGGTACAGTAGTCAACACTGATTCTGAATGCAATACTTCCCAGAACCCAAAATCTATTCTGAAGAGGAAAGATAATCATCTAGATGCAAAGTCGCACAAACGTGTCCGATTTGACTCTGAATGTAAAATTTCTCAAGAGTCTCAAGGATCAAAAGATATGTATATGGAATCAAGCTCTTCGCTTGAAGCTGCAGAAGGAACCAATCAAGCGACCCTCCAATCTCAAGGATATCCTCCTCAAGTTCCGGACTACTTACGAAATCCATCAAGATACACACATTACACTTTTGACTCATCCAGTGAGGTGGATGAAGAGTCCAACAAGAAAGCCTATATGGAATTCCTTCAGTTGGTTAGAGGGTCAAAGACAATGGGGTTGTCGCATCAGGATGATGCTTTAACTGGTCCCCCAAAATCAATCACCTTCACTCCAAAAAAGAAAGCAGGTGATACAATAATGCTTGAAAATTCTGCCCTAGGGGAGCAAAATCATCAGAATGGATTTGGCAAGGAGCTTGTGCACCAGAAAAGCATGCCCATCGGCATTGCAGCTGTGGACGATGATGTTTGTTCAATGGAGGAAGATGAGCCTGATAAATTAGAAATTAAAGGAAACAGCTCACAGAAGCCAGGCCGCCAGTACAGGATGAGAGCTAAGACGGAGGAGGAGGAGGAGGCCTGATTGATACCAAGGCACTGCTCTATCTTTGTTCTTCCTTTCTCATGATCTTTTTTTTTTTTTTTAAAACTTGTCTTTAACGTACCCTGTAACAACAACAATTATTGACAACCCTATGTAGGATCAGGCGAGTTATCGAGTTACTAACTGCAAAATTTCCAGTTGGCACAATGGGCTTCCAACCATAGCCATTGTCAATAACTGAGCTTCGAGCTCCAACCTCGGTGGGCAAGCGCCATGTATGAATACGATAACAGGCAATGGACAATAACTCTATTTCCATAGTAATTCAATTAGCTAGTTGTGTAATTGGTTTGATAATGCATGTTTCTGGAATTAGGGATATGGTTGAAATTGAGTGCTTGAGGAAGCAATCTCTACTTGAGCCATCGACTTTAAAAGGAATAACTTAGAATTTTTAGTTTATTAGCTTGCCTTCAAAGGTTCAGAATTTTACGGCAAATTCAAATGTCTCTACTAAGGGTGCAGGCGACTCGGTTCGACTCCGATACCCAACCACATCGACGTTTAGAGGATGTTTGGGGCGTTGGGCGGTTATAATATTATGTGGGTTATTATAATCTGTAAAATCATATAATATTATTTAAAAGGCAGAGTAGTATAGTTTGGGATTATAATAGTTTGTGTTTGGGGTGCAGAATATTTCACAGGTTATAGTAAAAATAATATTATTTAAACTACAGAGTACGATAGTCTGAGGTTATAATAGTCTGTGTTTGGGGTGCAGAGTATTTCACAGGTTATTATAACACCTGTTATAATAACCAG
Coding sequence (CDS)
ATGGAGGACAATTTTAAAGTTCGTGTGGACAGAATCTTTGGTTCTCTGTCTTCATCCTCTACTTCTTCTACGGCAGGTTTCAGTTCACCTCTGAATTCTCTGTGGTGCCTTACGGACGATGAAATCGAGAGGAGAGAGTGGATCAAGAGGAAGGAGGACCAACCGGAACCGGAGCTCGAGCCGACTTCGTTCTTTCCCGGCCGAAGGAAAGTGAACGAGAGGAATTCATTTGGATTTGGAGATGATTTTGAGGATGACCTCGATGATTTGGACGAAAATCCAGAGTCGAATGGGTCGTCTGGTAAGTTTCTTAAGCCTGACGATTATGGAGATGAAGAATGGGAAATCAAGTCCTCGATTGGCCGGGACTGCACCCTTGATTACGAGGAAGAGGAAGATGAGTATGATAAAGTTGCTGTTGGCAAGGAGAAAGGTGGAGATCGGTTGTATATGAAGGACATAACTGACTGTGGTGTTGAAATAGATTCGTGCACCGAACTTCCTACTTCGATACGAAATTTCACCAGAGATCCACGAGCTAATCATTTGGCTGCAAAAGTTAGGCTCAAGGAAGATGCAGAAGCTTCTAAAAGAATGGATTCTTTGGATGTCTCTGAGAACGGTACAGTAGTCAACACTGATTCTGAATGCAATACTTCCCAGAACCCAAAATCTATTCTGAAGAGGAAAGATAATCATCTAGATGCAAAGTCGCACAAACGTGTCCGATTTGACTCTGAATGTAAAATTTCTCAAGAGTCTCAAGGATCAAAAGATATGTATATGGAATCAAGCTCTTCGCTTGAAGCTGCAGAAGGAACCAATCAAGCGACCCTCCAATCTCAAGGATATCCTCCTCAAGTTCCGGACTACTTACGAAATCCATCAAGATACACACATTACACTTTTGACTCATCCAGTGAGGTGGATGAAGAGTCCAACAAGAAAGCCTATATGGAATTCCTTCAGTTGGTTAGAGGGTCAAAGACAATGGGGTTGTCGCATCAGGATGATGCTTTAACTGGTCCCCCAAAATCAATCACCTTCACTCCAAAAAAGAAAGCAGGTGATACAATAATGCTTGAAAATTCTGCCCTAGGGGAGCAAAATCATCAGAATGGATTTGGCAAGGAGCTTGTGCACCAGAAAAGCATGCCCATCGGCATTGCAGCTGTGGACGATGATGTTTGTTCAATGGAGGAAGATGAGCCTGATAAATTAGAAATTAAAGGAAACAGCTCACAGAAGCCAGGCCGCCAGTACAGGATGAGAGCTAAGACGGAGGAGGAGGAGGAGGCCTGA
Protein sequence
MEDNFKVRVDRIFGSLSSSSTSSTAGFSSPLNSLWCLTDDEIERREWIKRKEDQPEPELEPTSFFPGRRKVNERNSFGFGDDFEDDLDDLDENPESNGSSGKFLKPDDYGDEEWEIKSSIGRDCTLDYEEEEDEYDKVAVGKEKGGDRLYMKDITDCGVEIDSCTELPTSIRNFTRDPRANHLAAKVRLKEDAEASKRMDSLDVSENGTVVNTDSECNTSQNPKSILKRKDNHLDAKSHKRVRFDSECKISQESQGSKDMYMESSSSLEAAEGTNQATLQSQGYPPQVPDYLRNPSRYTHYTFDSSSEVDEESNKKAYMEFLQLVRGSKTMGLSHQDDALTGPPKSITFTPKKKAGDTIMLENSALGEQNHQNGFGKELVHQKSMPIGIAAVDDDVCSMEEDEPDKLEIKGNSSQKPGRQYRMRAKTEEEEEA
Homology
BLAST of Tan0011843 vs. NCBI nr
Match:
XP_022989648.1 (uncharacterized protein LOC111486668 [Cucurbita maxima])
HSP 1 Score: 677.9 bits (1748), Expect = 5.6e-191
Identity = 355/437 (81.24%), Postives = 379/437 (86.73%), Query Frame = 0
Query: 1 MEDNFKVRVDRIFGSLSSSSTS---STAGFSSPLNSLWCLTDDEIERREWIKRKEDQPEP 60
MEDNFKVRVDRIFGSLSSSS+S STA F+SPL+SLWCLTDDE+ERREWIK K DQPEP
Sbjct: 1 MEDNFKVRVDRIFGSLSSSSSSSSYSTAAFNSPLSSLWCLTDDEVERREWIKGKVDQPEP 60
Query: 61 ELEPTSFFPGRRKVNERNSFGFGDDFEDDLDDLDENPESNGSSGKFLKPDDYGDEEWEIK 120
ELEPTSF G RKVN RNSF F DDFEDD+ DLDENPESNGSS F KPDDYGDEEWEIK
Sbjct: 61 ELEPTSFLAGERKVNGRNSFRFRDDFEDDVGDLDENPESNGSSSNFPKPDDYGDEEWEIK 120
Query: 121 SSIGRDCTLDYEEEEDEYDKVAVGKEKGGDRLYMKDITDCGVEIDSCTELPTSIRNFTRD 180
S IGRDCTLD+EEEEDEYDKVAVGKEK GDRLYMKDITDCGVEIDSCTELPTSIRNFTRD
Sbjct: 121 SLIGRDCTLDFEEEEDEYDKVAVGKEKVGDRLYMKDITDCGVEIDSCTELPTSIRNFTRD 180
Query: 181 PRANHLAAKVRLKEDAEASKRMDSLDVSENGTVVNTDSECNTSQNPKSILKRKDNHLDAK 240
PRANHLAAKVRLKEDAEASK MDSL VSEN V SECN SQNPKSILKRKDNHLDAK
Sbjct: 181 PRANHLAAKVRLKEDAEASKTMDSLHVSENDAVAIAVSECNVSQNPKSILKRKDNHLDAK 240
Query: 241 SHKRVRFDSECKISQESQGSKDMYMESSSSLEAAEGTNQATLQSQGYPPQVPDYLRNPSR 300
SHKRVRFD EC+IS+ESQGS+D+ ME++SS+ AAE TN+AT QSQG+ Q PDYLRNPSR
Sbjct: 241 SHKRVRFDPECEISRESQGSEDISMEANSSIGAAEVTNEATFQSQGFCRQAPDYLRNPSR 300
Query: 301 YTHYTFDSSSEVDEESNKKAYMEFLQLVRGSKTMGLSHQDDALTGPPKSITFTPKKKAGD 360
YTHYTFDSS+EVDEESNK AYM+FLQLVRGSKT+ H DD+ TGPPKSITF PKKKAGD
Sbjct: 301 YTHYTFDSSNEVDEESNKNAYMDFLQLVRGSKTIEPLHLDDSSTGPPKSITFIPKKKAGD 360
Query: 361 TIMLENSALGEQNHQNGFGKELVHQKSMPIGIAAVD---DDVCSMEEDEPDKLEIKGNSS 420
T+MLEN GEQNHQNG GKELVHQK MPI IA+VD DDVCSMEEDEP+KLE + N S
Sbjct: 361 TVMLENPTPGEQNHQNGVGKELVHQKGMPISIASVDAQTDDVCSMEEDEPEKLETRRNIS 420
Query: 421 QKPGRQYRMRAKTEEEE 432
QK RQYRMRAK + EE
Sbjct: 421 QKTARQYRMRAKMDSEE 437
BLAST of Tan0011843 vs. NCBI nr
Match:
KAG6588861.1 (hypothetical protein SDJN03_17426, partial [Cucurbita argyrosperma subsp. sororia] >KAG7022621.1 hypothetical protein SDJN02_16355, partial [Cucurbita argyrosperma subsp. argyrosperma])
HSP 1 Score: 677.6 bits (1747), Expect = 7.3e-191
Identity = 356/439 (81.09%), Postives = 380/439 (86.56%), Query Frame = 0
Query: 1 MEDNFKVRVDRIFGSLSSSSTS-----STAGFSSPLNSLWCLTDDEIERREWIKRKEDQP 60
MEDNFKVRVDRIFGSLSSSS+S STA F+SPL+SLWCLTDDE+ERREWIK K DQP
Sbjct: 1 MEDNFKVRVDRIFGSLSSSSSSSSSSYSTAAFNSPLSSLWCLTDDEVERREWIKGKVDQP 60
Query: 61 EPELEPTSFFPGRRKVNERNSFGFGDDFEDDLDDLDENPESNGSSGKFLKPDDYGDEEWE 120
EPELEPTSF G RKV+ RNSF F DDFEDD+ DLDENPESNGSS F KPDDYGDEEWE
Sbjct: 61 EPELEPTSFLAGERKVSGRNSFRFRDDFEDDVGDLDENPESNGSSSNFPKPDDYGDEEWE 120
Query: 121 IKSSIGRDCTLDYEEEEDEYDKVAVGKEKGGDRLYMKDITDCGVEIDSCTELPTSIRNFT 180
IKS IGRDCTLD+EEEEDEYDKVAVGKEK GDRLYMKDITDCGVEIDSCTELPTSIRNFT
Sbjct: 121 IKSLIGRDCTLDFEEEEDEYDKVAVGKEKVGDRLYMKDITDCGVEIDSCTELPTSIRNFT 180
Query: 181 RDPRANHLAAKVRLKEDAEASKRMDSLDVSENGTVVNTDSECNTSQNPKSILKRKDNHLD 240
RDPRANHLAAKVRLKEDAEASK MDSL VSENG V SECN SQNPKSILKRKDNHLD
Sbjct: 181 RDPRANHLAAKVRLKEDAEASKTMDSLHVSENGAVAIAVSECNASQNPKSILKRKDNHLD 240
Query: 241 AKSHKRVRFDSECKISQESQGSKDMYMESSSSLEAAEGTNQATLQSQGYPPQVPDYLRNP 300
AKSHKRVRFD EC+IS+ESQGS+D+ ME++SS+ AAE TN+AT SQG+ PQ PDYLRNP
Sbjct: 241 AKSHKRVRFDPECEISRESQGSEDISMEANSSIGAAEVTNEATFHSQGFCPQAPDYLRNP 300
Query: 301 SRYTHYTFDSSSEVDEESNKKAYMEFLQLVRGSKTMGLSHQDDALTGPPKSITFTPKKKA 360
SRYTHYTFDSS+EVDEESNK AYM+FLQLVRGSKT+ H DD TGPPKSITF PKKKA
Sbjct: 301 SRYTHYTFDSSNEVDEESNKNAYMDFLQLVRGSKTIEPLHLDDTSTGPPKSITFIPKKKA 360
Query: 361 GDTIMLENSALGEQNHQNGFGKELVHQKSMPIGIAAVD---DDVCSMEEDEPDKLEIKGN 420
GDT+MLEN GEQNHQNG GKEL HQK MPIGIA+VD DDVCSMEEDEP+KLE + N
Sbjct: 361 GDTVMLENPTPGEQNHQNGVGKEL-HQKGMPIGIASVDAQTDDVCSMEEDEPEKLETRRN 420
Query: 421 SSQKPGRQYRMRAKTEEEE 432
SSQK RQYRMRAK + EE
Sbjct: 421 SSQKTARQYRMRAKMDSEE 438
BLAST of Tan0011843 vs. NCBI nr
Match:
XP_022928536.1 (uncharacterized protein LOC111435316 [Cucurbita moschata])
HSP 1 Score: 676.8 bits (1745), Expect = 1.2e-190
Identity = 355/438 (81.05%), Postives = 380/438 (86.76%), Query Frame = 0
Query: 1 MEDNFKVRVDRIFGSLSSSSTS----STAGFSSPLNSLWCLTDDEIERREWIKRKEDQPE 60
MEDNFKVRVDRIFGSLSSSS+S STA F+SPL+SLWCLTDDE+ERREWIK K DQPE
Sbjct: 1 MEDNFKVRVDRIFGSLSSSSSSSSSYSTAAFNSPLSSLWCLTDDEVERREWIKGKVDQPE 60
Query: 61 PELEPTSFFPGRRKVNERNSFGFGDDFEDDLDDLDENPESNGSSGKFLKPDDYGDEEWEI 120
PELEPTSF G RKV+ RNSF F DDFEDD+ DLDENPESNGSS F KPDDYGDEEWEI
Sbjct: 61 PELEPTSFLAGERKVSGRNSFRFRDDFEDDVGDLDENPESNGSSSNFPKPDDYGDEEWEI 120
Query: 121 KSSIGRDCTLDYEEEEDEYDKVAVGKEKGGDRLYMKDITDCGVEIDSCTELPTSIRNFTR 180
KS IGRDCTLD+EEEEDEYDKVAVGKEK GDRLYMKDITDCGVEIDSCTELPTSI+NFTR
Sbjct: 121 KSLIGRDCTLDFEEEEDEYDKVAVGKEKVGDRLYMKDITDCGVEIDSCTELPTSIQNFTR 180
Query: 181 DPRANHLAAKVRLKEDAEASKRMDSLDVSENGTVVNTDSECNTSQNPKSILKRKDNHLDA 240
DPRANHLAAKVRLKEDAEASK MDSL VSENG V SECN SQNPKSILKRKDNHLDA
Sbjct: 181 DPRANHLAAKVRLKEDAEASKTMDSLHVSENGAVAIAVSECNASQNPKSILKRKDNHLDA 240
Query: 241 KSHKRVRFDSECKISQESQGSKDMYMESSSSLEAAEGTNQATLQSQGYPPQVPDYLRNPS 300
KSHKRVRFD EC+IS+ESQGS+D+ ME++SS+ AAE TN+AT SQG+ PQ PDYLRNPS
Sbjct: 241 KSHKRVRFDPECEISRESQGSEDISMEANSSIGAAEVTNEATFHSQGFRPQAPDYLRNPS 300
Query: 301 RYTHYTFDSSSEVDEESNKKAYMEFLQLVRGSKTMGLSHQDDALTGPPKSITFTPKKKAG 360
RYTHYTFDSS+EVDEESNK AYM+FLQLVRGSKT+ H DD TGPPKSITF PKKKAG
Sbjct: 301 RYTHYTFDSSNEVDEESNKNAYMDFLQLVRGSKTIEPLHLDDTSTGPPKSITFIPKKKAG 360
Query: 361 DTIMLENSALGEQNHQNGFGKELVHQKSMPIGIAAVD---DDVCSMEEDEPDKLEIKGNS 420
DT+MLEN GEQNHQNG GKEL HQK MPIGIA+VD DDVCSMEEDEP+KLE + NS
Sbjct: 361 DTVMLENPTPGEQNHQNGVGKEL-HQKGMPIGIASVDAQTDDVCSMEEDEPEKLETRRNS 420
Query: 421 SQKPGRQYRMRAKTEEEE 432
SQK RQYRMRAK + EE
Sbjct: 421 SQKTARQYRMRAKMDSEE 437
BLAST of Tan0011843 vs. NCBI nr
Match:
XP_023531936.1 (uncharacterized protein LOC111794053 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 676.4 bits (1744), Expect = 1.6e-190
Identity = 354/437 (81.01%), Postives = 378/437 (86.50%), Query Frame = 0
Query: 1 MEDNFKVRVDRIFGSLSSSSTS---STAGFSSPLNSLWCLTDDEIERREWIKRKEDQPEP 60
MEDNFKVRVDRIFGSLSSSS+S STA F+SPL+SLWCLTDDE+ERREWIK K DQPEP
Sbjct: 1 MEDNFKVRVDRIFGSLSSSSSSSSYSTAAFNSPLSSLWCLTDDEVERREWIKGKVDQPEP 60
Query: 61 ELEPTSFFPGRRKVNERNSFGFGDDFEDDLDDLDENPESNGSSGKFLKPDDYGDEEWEIK 120
ELEP SF G RKVN RNSF F DDFEDD+ DLDE+PESNGSS F KPDDYGDEEWEIK
Sbjct: 61 ELEPNSFLAGERKVNGRNSFRFRDDFEDDVGDLDEDPESNGSSSNFPKPDDYGDEEWEIK 120
Query: 121 SSIGRDCTLDYEEEEDEYDKVAVGKEKGGDRLYMKDITDCGVEIDSCTELPTSIRNFTRD 180
S IGRDCTLD+EEEEDEYDKVAVGKEK GDRLYMKDITDCGVEIDSCTELPTSIRNFTRD
Sbjct: 121 SLIGRDCTLDFEEEEDEYDKVAVGKEKVGDRLYMKDITDCGVEIDSCTELPTSIRNFTRD 180
Query: 181 PRANHLAAKVRLKEDAEASKRMDSLDVSENGTVVNTDSECNTSQNPKSILKRKDNHLDAK 240
PRANHLAAKVRLKEDAEASK MDSL VSENG V SECN S NPKSILKRKDN LDAK
Sbjct: 181 PRANHLAAKVRLKEDAEASKTMDSLHVSENGAVAIAVSECNASHNPKSILKRKDNPLDAK 240
Query: 241 SHKRVRFDSECKISQESQGSKDMYMESSSSLEAAEGTNQATLQSQGYPPQVPDYLRNPSR 300
SHKRVRFD EC+IS+E QGS+D+ ME++SS+ AAE T +AT QSQG+ PQ PDYLRNPSR
Sbjct: 241 SHKRVRFDPECEISREPQGSEDISMEANSSIGAAEVTTEATFQSQGFRPQAPDYLRNPSR 300
Query: 301 YTHYTFDSSSEVDEESNKKAYMEFLQLVRGSKTMGLSHQDDALTGPPKSITFTPKKKAGD 360
YTHYTFDSS+EVDEESNK AYM+FLQLVRGSKT+ SH DD TGPPKSITF PKKKAGD
Sbjct: 301 YTHYTFDSSNEVDEESNKNAYMDFLQLVRGSKTIEPSHLDDFSTGPPKSITFIPKKKAGD 360
Query: 361 TIMLENSALGEQNHQNGFGKELVHQKSMPIGIAAVD---DDVCSMEEDEPDKLEIKGNSS 420
T+MLEN GEQNHQNG GKELVHQK MPIGIA+VD DDVCSMEEDEP+KLE + NSS
Sbjct: 361 TVMLENPTPGEQNHQNGVGKELVHQKGMPIGIASVDAQTDDVCSMEEDEPEKLETRRNSS 420
Query: 421 QKPGRQYRMRAKTEEEE 432
QK RQYRMRAK + EE
Sbjct: 421 QKTARQYRMRAKMDSEE 437
BLAST of Tan0011843 vs. NCBI nr
Match:
KAG7017870.1 (hypothetical protein SDJN02_19736 [Cucurbita argyrosperma subsp. argyrosperma])
HSP 1 Score: 655.2 bits (1689), Expect = 3.9e-184
Identity = 351/436 (80.50%), Postives = 376/436 (86.24%), Query Frame = 0
Query: 1 MEDNFKVRVDRIFGSLSSSSTSSTAGFSSPLNSLWCLTDDEIERREWIKRKEDQPEPELE 60
M+D+FKVRVDRIFGSLSS+STSS A FSS SLWCLTDDEIERREWIK KE+Q EPELE
Sbjct: 1 MDDSFKVRVDRIFGSLSSTSTSSPAAFSSTARSLWCLTDDEIERREWIKEKEEQLEPELE 60
Query: 61 PTSFFPGRRKVNERNSFGFGDDFEDD-LDDLDENPESNGSSGKFLKPDDYGDEEWEIKSS 120
PTSFF G RKV+ R+SFGF DDFEDD LDDLDENP SNGSS KF KPDDYGDEEWEIKSS
Sbjct: 61 PTSFFDGGRKVSGRSSFGFQDDFEDDELDDLDENPVSNGSSSKFPKPDDYGDEEWEIKSS 120
Query: 121 IGRDCTLDYEEEEDEYDKVAVGKEKGGDRLYMKDITDCGVEIDSCTELPTSIRNFTRDPR 180
IGRDCTLDYEEEEDEYDKVAVGKEKGGDRLYMKD+TDCG+EI SCTEL TS+RNFTRDPR
Sbjct: 121 IGRDCTLDYEEEEDEYDKVAVGKEKGGDRLYMKDVTDCGIEIGSCTELSTSVRNFTRDPR 180
Query: 181 ANHLAAKVRLKEDAEASKRMDSLDVSENGTVVNTDSECNTSQNPKSILKRKDNHLDAKSH 240
ANHLAAKVRLKEDAEASK +DSL VSEN TV DSECNTSQNPKSILKRKDN+LD KSH
Sbjct: 181 ANHLAAKVRLKEDAEASKTIDSLHVSENSTVAIADSECNTSQNPKSILKRKDNYLDGKSH 240
Query: 241 KRVRFDSECKISQESQGSKDMYMESSSSLEAAEGTNQATLQSQGYPPQVPDYLRNPSRYT 300
KRVRFD ECK+SQESQG KD ME++S AE N+A SQ QVPDYLRNPSRYT
Sbjct: 241 KRVRFDPECKVSQESQGFKDFAMEANSLPGVAEVGNEANFTSQA--TQVPDYLRNPSRYT 300
Query: 301 HYTFDSSSEVDEESNKKAYMEFLQLVRGSKTMGLSHQDDALTGPPKSITFTPKKKAGDTI 360
HYTFDSS+EVDEESNKKAYM+FLQ+VR SKTM SHQDDA TGPPKSITF PKKKAGDTI
Sbjct: 301 HYTFDSSNEVDEESNKKAYMDFLQMVRRSKTME-SHQDDASTGPPKSITFIPKKKAGDTI 360
Query: 361 MLENSALGEQ-NHQNGFGKELVHQKSMPIGIAAV---DDDVCSMEEDEPDKLEIKGNSSQ 420
MLE+S+L EQ +HQNG GKELVHQK MPIGIAAV +DDVCSMEEDEPD+L+I+ NS Q
Sbjct: 361 MLESSSLEEQSHHQNGVGKELVHQKGMPIGIAAVNTQNDDVCSMEEDEPDRLDIRKNSLQ 420
Query: 421 KPGRQYRMRAKTEEEE 432
K GRQYR RA EEE
Sbjct: 421 KSGRQYRTRALESEEE 433
BLAST of Tan0011843 vs. ExPASy TrEMBL
Match:
A0A6J1JQY5 (Protein TSSC4 OS=Cucurbita maxima OX=3661 GN=LOC111486668 PE=3 SV=1)
HSP 1 Score: 677.9 bits (1748), Expect = 2.7e-191
Identity = 355/437 (81.24%), Postives = 379/437 (86.73%), Query Frame = 0
Query: 1 MEDNFKVRVDRIFGSLSSSSTS---STAGFSSPLNSLWCLTDDEIERREWIKRKEDQPEP 60
MEDNFKVRVDRIFGSLSSSS+S STA F+SPL+SLWCLTDDE+ERREWIK K DQPEP
Sbjct: 1 MEDNFKVRVDRIFGSLSSSSSSSSYSTAAFNSPLSSLWCLTDDEVERREWIKGKVDQPEP 60
Query: 61 ELEPTSFFPGRRKVNERNSFGFGDDFEDDLDDLDENPESNGSSGKFLKPDDYGDEEWEIK 120
ELEPTSF G RKVN RNSF F DDFEDD+ DLDENPESNGSS F KPDDYGDEEWEIK
Sbjct: 61 ELEPTSFLAGERKVNGRNSFRFRDDFEDDVGDLDENPESNGSSSNFPKPDDYGDEEWEIK 120
Query: 121 SSIGRDCTLDYEEEEDEYDKVAVGKEKGGDRLYMKDITDCGVEIDSCTELPTSIRNFTRD 180
S IGRDCTLD+EEEEDEYDKVAVGKEK GDRLYMKDITDCGVEIDSCTELPTSIRNFTRD
Sbjct: 121 SLIGRDCTLDFEEEEDEYDKVAVGKEKVGDRLYMKDITDCGVEIDSCTELPTSIRNFTRD 180
Query: 181 PRANHLAAKVRLKEDAEASKRMDSLDVSENGTVVNTDSECNTSQNPKSILKRKDNHLDAK 240
PRANHLAAKVRLKEDAEASK MDSL VSEN V SECN SQNPKSILKRKDNHLDAK
Sbjct: 181 PRANHLAAKVRLKEDAEASKTMDSLHVSENDAVAIAVSECNVSQNPKSILKRKDNHLDAK 240
Query: 241 SHKRVRFDSECKISQESQGSKDMYMESSSSLEAAEGTNQATLQSQGYPPQVPDYLRNPSR 300
SHKRVRFD EC+IS+ESQGS+D+ ME++SS+ AAE TN+AT QSQG+ Q PDYLRNPSR
Sbjct: 241 SHKRVRFDPECEISRESQGSEDISMEANSSIGAAEVTNEATFQSQGFCRQAPDYLRNPSR 300
Query: 301 YTHYTFDSSSEVDEESNKKAYMEFLQLVRGSKTMGLSHQDDALTGPPKSITFTPKKKAGD 360
YTHYTFDSS+EVDEESNK AYM+FLQLVRGSKT+ H DD+ TGPPKSITF PKKKAGD
Sbjct: 301 YTHYTFDSSNEVDEESNKNAYMDFLQLVRGSKTIEPLHLDDSSTGPPKSITFIPKKKAGD 360
Query: 361 TIMLENSALGEQNHQNGFGKELVHQKSMPIGIAAVD---DDVCSMEEDEPDKLEIKGNSS 420
T+MLEN GEQNHQNG GKELVHQK MPI IA+VD DDVCSMEEDEP+KLE + N S
Sbjct: 361 TVMLENPTPGEQNHQNGVGKELVHQKGMPISIASVDAQTDDVCSMEEDEPEKLETRRNIS 420
Query: 421 QKPGRQYRMRAKTEEEE 432
QK RQYRMRAK + EE
Sbjct: 421 QKTARQYRMRAKMDSEE 437
BLAST of Tan0011843 vs. ExPASy TrEMBL
Match:
A0A6J1EKJ9 (Protein TSSC4 OS=Cucurbita moschata OX=3662 GN=LOC111435316 PE=3 SV=1)
HSP 1 Score: 676.8 bits (1745), Expect = 6.0e-191
Identity = 355/438 (81.05%), Postives = 380/438 (86.76%), Query Frame = 0
Query: 1 MEDNFKVRVDRIFGSLSSSSTS----STAGFSSPLNSLWCLTDDEIERREWIKRKEDQPE 60
MEDNFKVRVDRIFGSLSSSS+S STA F+SPL+SLWCLTDDE+ERREWIK K DQPE
Sbjct: 1 MEDNFKVRVDRIFGSLSSSSSSSSSYSTAAFNSPLSSLWCLTDDEVERREWIKGKVDQPE 60
Query: 61 PELEPTSFFPGRRKVNERNSFGFGDDFEDDLDDLDENPESNGSSGKFLKPDDYGDEEWEI 120
PELEPTSF G RKV+ RNSF F DDFEDD+ DLDENPESNGSS F KPDDYGDEEWEI
Sbjct: 61 PELEPTSFLAGERKVSGRNSFRFRDDFEDDVGDLDENPESNGSSSNFPKPDDYGDEEWEI 120
Query: 121 KSSIGRDCTLDYEEEEDEYDKVAVGKEKGGDRLYMKDITDCGVEIDSCTELPTSIRNFTR 180
KS IGRDCTLD+EEEEDEYDKVAVGKEK GDRLYMKDITDCGVEIDSCTELPTSI+NFTR
Sbjct: 121 KSLIGRDCTLDFEEEEDEYDKVAVGKEKVGDRLYMKDITDCGVEIDSCTELPTSIQNFTR 180
Query: 181 DPRANHLAAKVRLKEDAEASKRMDSLDVSENGTVVNTDSECNTSQNPKSILKRKDNHLDA 240
DPRANHLAAKVRLKEDAEASK MDSL VSENG V SECN SQNPKSILKRKDNHLDA
Sbjct: 181 DPRANHLAAKVRLKEDAEASKTMDSLHVSENGAVAIAVSECNASQNPKSILKRKDNHLDA 240
Query: 241 KSHKRVRFDSECKISQESQGSKDMYMESSSSLEAAEGTNQATLQSQGYPPQVPDYLRNPS 300
KSHKRVRFD EC+IS+ESQGS+D+ ME++SS+ AAE TN+AT SQG+ PQ PDYLRNPS
Sbjct: 241 KSHKRVRFDPECEISRESQGSEDISMEANSSIGAAEVTNEATFHSQGFRPQAPDYLRNPS 300
Query: 301 RYTHYTFDSSSEVDEESNKKAYMEFLQLVRGSKTMGLSHQDDALTGPPKSITFTPKKKAG 360
RYTHYTFDSS+EVDEESNK AYM+FLQLVRGSKT+ H DD TGPPKSITF PKKKAG
Sbjct: 301 RYTHYTFDSSNEVDEESNKNAYMDFLQLVRGSKTIEPLHLDDTSTGPPKSITFIPKKKAG 360
Query: 361 DTIMLENSALGEQNHQNGFGKELVHQKSMPIGIAAVD---DDVCSMEEDEPDKLEIKGNS 420
DT+MLEN GEQNHQNG GKEL HQK MPIGIA+VD DDVCSMEEDEP+KLE + NS
Sbjct: 361 DTVMLENPTPGEQNHQNGVGKEL-HQKGMPIGIASVDAQTDDVCSMEEDEPEKLETRRNS 420
Query: 421 SQKPGRQYRMRAKTEEEE 432
SQK RQYRMRAK + EE
Sbjct: 421 SQKTARQYRMRAKMDSEE 437
BLAST of Tan0011843 vs. ExPASy TrEMBL
Match:
A0A6J1F221 (Protein TSSC4 OS=Cucurbita moschata OX=3662 GN=LOC111441676 PE=3 SV=1)
HSP 1 Score: 644.4 bits (1661), Expect = 3.3e-181
Identity = 346/435 (79.54%), Postives = 373/435 (85.75%), Query Frame = 0
Query: 1 MEDNFKVRVDRIFGSLSSSSTSSTAGFSSPLNSLWCLTDDEIERREWIKRKEDQPEPELE 60
M+D+FKVRVDRIFGSLSS+STSS A FSS SLWCLTDDEIERREWIK KE+Q EPELE
Sbjct: 1 MDDSFKVRVDRIFGSLSSTSTSSPAAFSSTARSLWCLTDDEIERREWIKEKEEQLEPELE 60
Query: 61 PTSFFPGRRKVNERNSFGFGDDFEDD-LDDLDENPESNGSSGKFLKPDDYGDEEWEIKSS 120
PTSF G RKV R+SFGF DDFEDD LDDLDENP SNGSS KF KPDDYGDEEWEIKSS
Sbjct: 61 PTSFCDGGRKVRGRSSFGFQDDFEDDELDDLDENPGSNGSSSKFPKPDDYGDEEWEIKSS 120
Query: 121 IGRDCTLDYEEEEDEYDKVAVGKEKGGDRLYMKDITDCGVEIDSCTELPTSIRNFTRDPR 180
IGRDCTLDYEEEEDEYDKVAVGKEKGGDRLYMKD TDCG+EI SCT+LPTS++NFTRDPR
Sbjct: 121 IGRDCTLDYEEEEDEYDKVAVGKEKGGDRLYMKDATDCGIEIGSCTKLPTSVQNFTRDPR 180
Query: 181 ANHLAAKVRLKEDAEASKRMDSLDVSENGTVVNTDSECNTSQNPKSILKRKDNHLDAKSH 240
ANHLAAKVRL+EDAEASK +DSL +SEN TV DSECNTSQNPKSILKRKDN+LDAKSH
Sbjct: 181 ANHLAAKVRLEEDAEASKTIDSLHMSENSTVAIADSECNTSQNPKSILKRKDNYLDAKSH 240
Query: 241 KRVRFDSECKISQESQGSKDMYMESSSSLEAAEGTNQATLQSQGYPPQVPDYLRNPSRYT 300
KRVRFD ECK+SQESQG KD ME++S AE N+A SQ QVPDYLRNPSRYT
Sbjct: 241 KRVRFDPECKVSQESQGFKDFAMEANSLPGVAEVGNEANFPSQA--TQVPDYLRNPSRYT 300
Query: 301 HYTFDSSSEVDEESNKKAYMEFLQLVRGSKTMGLSHQDDALTGPPKSITFTPKKKAGDTI 360
HYTFDSS+EVDEESNKKAYM+FLQ+VR SKTM SHQDDA TGPPKSITF PKKKAGDTI
Sbjct: 301 HYTFDSSNEVDEESNKKAYMDFLQMVRRSKTME-SHQDDASTGPPKSITFIPKKKAGDTI 360
Query: 361 MLENSALGEQ-NHQNGFGKELVHQKSMPIGIAAV---DDDVCSMEEDEPDKLEIKGNSSQ 420
MLE+S+L E+ +HQNG GKELVHQK MPIGIAAV +DDVCSMEEDE D+LE + NS Q
Sbjct: 361 MLESSSLEERSHHQNGVGKELVHQKGMPIGIAAVNTQNDDVCSMEEDESDRLESRKNSLQ 420
Query: 421 KPGRQYRMRAKTEEE 431
K GRQYRMRA EE
Sbjct: 421 KSGRQYRMRALESEE 432
BLAST of Tan0011843 vs. ExPASy TrEMBL
Match:
A0A6J1IZZ9 (Protein TSSC4 OS=Cucurbita maxima OX=3661 GN=LOC111481475 PE=3 SV=1)
HSP 1 Score: 643.3 bits (1658), Expect = 7.4e-181
Identity = 346/436 (79.36%), Postives = 373/436 (85.55%), Query Frame = 0
Query: 1 MEDNFKVRVDRIFGSLSSSSTSSTAGFSSPLNSLWCLTDDEIERREWIKRKEDQPEPELE 60
M+D+FKVRVDRIFGSLSS+STSS A F+S + SLWCLTDDEIERREWIK KE+Q EPELE
Sbjct: 1 MDDSFKVRVDRIFGSLSSTSTSSPAAFNSSVRSLWCLTDDEIERREWIKEKEEQLEPELE 60
Query: 61 PTSFFPGRRKVNERNSFGFGDDFEDD-LDDLDENPESNGSSGKFLKPDDYGDEEWEIKSS 120
PTSFF G +KV+ R+SFGF DDFEDD LDDLDENP SNGSS KF KPDDYGDEEWEIKSS
Sbjct: 61 PTSFFDGGKKVSGRSSFGFQDDFEDDELDDLDENPGSNGSSSKFPKPDDYGDEEWEIKSS 120
Query: 121 IGRDCTLDYEEEEDEYDKVAVGKEKGGDRLYMKDITDCGVEIDSCTELPTSIRNFTRDPR 180
IGRDCTLDYEEEEDEYDKVAVGKEKGGDRLYMKD TDCG+EI SCTELPTS+RNFTRDPR
Sbjct: 121 IGRDCTLDYEEEEDEYDKVAVGKEKGGDRLYMKDATDCGIEIGSCTELPTSVRNFTRDPR 180
Query: 181 ANHLAAKVRLKEDAEASKRMDSLDVSENGTVVNTDSECNTSQNPKSILKRKDNHLDAKSH 240
ANHLAAKVRLKEDAEASK +DSL VS N TV DSECN SQNPKSILKRKDN+LDAK H
Sbjct: 181 ANHLAAKVRLKEDAEASKTIDSLHVSVNSTVAIADSECNASQNPKSILKRKDNYLDAKLH 240
Query: 241 KRVRFDSECKISQESQGSKDMYMESSSSLEAAEGTNQATLQSQGYPPQVPDYLRNPSRYT 300
KRVRFD ECK SQ+SQG KD M+++S AE N+A SQ QVPDYLRNPSRYT
Sbjct: 241 KRVRFDPECKASQDSQGFKDFAMQANSLPGVAEVGNEANYPSQA--TQVPDYLRNPSRYT 300
Query: 301 HYTFDSSSEVDEESNKKAYMEFLQLVRGSKTMGLSHQDDALTGPPKSITFTPKKKAGDTI 360
HYTFDSS EVDEESNKKAYM+FLQLVR SKTM S+QDD TGPP+SITF PKKKAGDTI
Sbjct: 301 HYTFDSSKEVDEESNKKAYMDFLQLVRRSKTME-SNQDDDSTGPPRSITFIPKKKAGDTI 360
Query: 361 MLENSALGEQ-NHQNGFGKELVHQKSMPIGIAAV---DDDVCSMEEDEPDKLEIKGNSSQ 420
MLE+S+L EQ +HQNG GKELVHQK MPIGIAAV +DDVCSMEEDEPD+LEI+ NS Q
Sbjct: 361 MLESSSLEEQSHHQNGVGKELVHQKGMPIGIAAVYTQNDDVCSMEEDEPDRLEIRKNSLQ 420
Query: 421 KPGRQYRMRAKTEEEE 432
K GRQYRMRA EEE
Sbjct: 421 KSGRQYRMRALESEEE 433
BLAST of Tan0011843 vs. ExPASy TrEMBL
Match:
A0A6J1D3P7 (Protein TSSC4 OS=Momordica charantia OX=3673 GN=LOC111016977 PE=3 SV=1)
HSP 1 Score: 630.2 bits (1624), Expect = 6.5e-177
Identity = 331/433 (76.44%), Postives = 365/433 (84.30%), Query Frame = 0
Query: 1 MEDNFKVRVDRIFGSLSSSSTSSTAGFSSPLNSLWCLTDDEIERREWIKRKEDQPEPELE 60
MED+FKVRVDRIFGSLSSSST STA +S L+SLW LTDDEIER+EWIK KED PEPE E
Sbjct: 1 MEDSFKVRVDRIFGSLSSSST-STAALNSSLSSLWSLTDDEIERKEWIKEKEDPPEPEPE 60
Query: 61 PTSFFPGRRKVNERNSFGFGDDFEDDLD-DLDENPESNGSSGKFLKPDDYGDEEWEIKSS 120
P+SFF G RK+N RNSFGF D FEDDL+ DLDENPESNGSS KF KPDDYGDEEWEIKSS
Sbjct: 61 PSSFFAGGRKLNHRNSFGFRDGFEDDLEADLDENPESNGSSSKFPKPDDYGDEEWEIKSS 120
Query: 121 IGRDCTLDYEEEEDEYDKVAVGKEKGGDRLYMKDITDCGVEIDSCTELPTSIRNFTRDPR 180
IGRDCTLDYEEEED YDKVAVGKEK GDRLYMKDI DCG+EIDSCTELPTS+RNFTRDPR
Sbjct: 121 IGRDCTLDYEEEEDVYDKVAVGKEKSGDRLYMKDIADCGIEIDSCTELPTSLRNFTRDPR 180
Query: 181 ANHLAAKVRLKEDAEASKRMDSLDVSENGTVVNTDSECNTSQNPKSILKRKDNHLDAKSH 240
ANHLAAKVRLKEDAEASK DSL +SEN + ECN S NPKSILKRKDNH+D KS
Sbjct: 181 ANHLAAKVRLKEDAEASKTTDSLPISENVALPIAVPECNASPNPKSILKRKDNHIDVKSQ 240
Query: 241 KRVRFDSECKISQESQGSKDMYMESSSSLEAAEGTNQATLQSQGYPPQVPDYLRNPSRYT 300
KRVRF+ ECKISQ+ +GSKD+ ME +SS E A N+AT S GY QVPDY++NPSRYT
Sbjct: 241 KRVRFNPECKISQDFRGSKDISMEGNSSPEDAYVCNEATFSSAGYRTQVPDYIQNPSRYT 300
Query: 301 HYTFDSSSEVDEESNKKAYMEFLQLVRGSKTMGLSHQDDALTGPPKSITFTPKKKAGDTI 360
HYTFDSS +VD+E N+KAY++FLQLVRGSKTM SH+DDA GPPK ITF PKKKAGDT+
Sbjct: 301 HYTFDSSYDVDDEFNRKAYLDFLQLVRGSKTME-SHEDDASAGPPKYITFIPKKKAGDTV 360
Query: 361 MLENSALGEQNHQNGFGKELVHQKSMPIGIAAVDDDVCSMEEDEPDKLEIKGNSSQKPGR 420
M+ENSALG QN QNG GKE+V Q+ + IGI D+VCSMEEDEPDKLE++ NSSQKPGR
Sbjct: 361 MIENSALGSQNDQNGAGKEVVQQRGIAIGIDNQTDEVCSMEEDEPDKLELRRNSSQKPGR 420
Query: 421 QYRMRAKTEEEEE 433
QYRMRAK E EEE
Sbjct: 421 QYRMRAKMESEEE 431
BLAST of Tan0011843 vs. TAIR 10
Match:
AT5G13970.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: endomembrane system; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 13 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G13310.1); Has 1807 Blast hits to 1807 proteins in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes - 339 (source: NCBI BLink). )
HSP 1 Score: 238.8 bits (608), Expect = 8.1e-63
Identity = 180/464 (38.79%), Postives = 261/464 (56.25%), Query Frame = 0
Query: 1 MEDNFKVRVDRIFGSLSSSSTSSTAGFSSPLNSLWCLTDDEIERREWIKRKEDQPEPELE 60
M+++F+VR+D++FGSL+SSSTS S+P++SLWCL +DEI+ +
Sbjct: 1 MDESFRVRIDKVFGSLASSSTS-----SAPVSSLWCLAEDEIDGNQ-------------- 60
Query: 61 PTSFFPGRRKVNER-NSFGFGDDFEDDLDDLDENPESNGSSGKFLKPDDYGDEEWEIKSS 120
G ++++E NSF E + DDL + E G S + KP DY DEEWEIK+S
Sbjct: 61 ----RSGEKELSESLNSFSDNGKQEHESDDLSID-EEKGRSSELQKPSDYDDEEWEIKNS 120
Query: 121 IGRDCTLDYEEEEDEYDKVAVGKEKGGDRLY--MKDIT-DCGVEIDSCTELPTSIRNFTR 180
IG D TLD EEE D+ DKVA+ G+++Y MKD+ D E D ELP S +
Sbjct: 121 IGMDSTLDMEEELDDNDKVAL-----GEKVYCCMKDVNDDYETEADEWVELPASFNEREK 180
Query: 181 DPRANHLAAKVRLKEDAEASKRMDSLDVSE---NGTVVNTDSE----------------- 240
DPRAN +AAK+RLKEDAEA +++SL VSE + ++T++E
Sbjct: 181 DPRANLIAAKLRLKEDAEAVNKLNSLHVSEELQDNLSMSTENEKPFVVSEDNLLGAFKES 240
Query: 241 ---CNTSQNPKSILKRKDNHL-DAKSHKRVRFDSECKISQESQGSKDMYMESSSSLEAAE 300
+ K ILKR++N D+KS KRVRF S+ K ++G D ME+SS
Sbjct: 241 HVGSSDENGLKPILKRRENQADDSKSPKRVRFSSDVKDRTLTEGDNDSVMEASS------ 300
Query: 301 GTNQATLQSQGYPPQVPDYLRNPSRYTHYTFDSSSEVDEESNKKAYMEFLQLVRGSKTMG 360
N+ +++ YP +PDY+RNPS+YT YTF+ S EVDEESN+KAYM+FL ++R
Sbjct: 301 -PNEDKVEAV-YPTGIPDYMRNPSKYTRYTFE-SGEVDEESNRKAYMDFLNMIRS----- 360
Query: 361 LSHQDDALTGP----PKSITFTPKKKAGDTIMLENSALGEQNHQNGFGKELVHQKSMPIG 420
+D++L P P+S+ F PK+K +EN K+ + + I
Sbjct: 361 ---KDESLVDPLMELPRSVAFVPKRKPMAESKVEN-----------IDKD-CEGRRVAIA 403
Query: 421 IAAVDD-DVCSMEEDEPDKLEIKGNSSQKPGRQYRMRAKTEEEE 432
+ ++D + +MEEDEP+ + + +++PGRQYR RAK + EE
Sbjct: 421 VDTIEDCTISAMEEDEPETAQ---HVTKRPGRQYRARAKEDPEE 403
BLAST of Tan0011843 vs. TAIR 10
Match:
AT5G13310.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast; EXPRESSED IN: 13 plant structures; EXPRESSED DURING: 7 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G13970.1); Has 1807 Blast hits to 1807 proteins in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes - 339 (source: NCBI BLink). )
HSP 1 Score: 127.1 bits (318), Expect = 3.4e-29
Identity = 137/443 (30.93%), Postives = 211/443 (47.63%), Query Frame = 0
Query: 2 EDNFKVRVDRIFGSL-------SSSSTSSTAGFS-SPLNSLWCLTDDEIERREWIKRKED 61
+++F+ RV +IFGSL S STSS+A S S+W L+D E+E+REW +++
Sbjct: 8 DEDFQSRVAKIFGSLPFSRPSSSKFSTSSSAPASRQQSGSVWTLSDTEVEKREW--KRDS 67
Query: 62 QPEPELEPTSFFPGRRKVNERNSFGFGDDFEDDLDDLDENPESNGSSGKFLKPDDYGDEE 121
E+ S F K + + EDDL D+D + +G
Sbjct: 68 YDRDEIPCASSFDELLKQQKPS--------EDDLKDIDCGEDFDG--------------V 127
Query: 122 WEIKSSIGRDCTLDYEEEEDEYDKVAVGKEKGGDRLYMKDITDCGVEIDSCTELPTSIRN 181
W I++S+G D TLD E EEDEYDKVA+G+E G+ CG
Sbjct: 128 WSIRASMGLDRTLDDEAEEDEYDKVALGEENDGE--------GCG--------------- 187
Query: 182 FTRDPRANHLAAKVRLKEDAEASKRMDSLDVSENGTVVNTDSECNTSQNPKSILKRKDNH 241
RDPRAN++AA++RLKED + + ++ + + E + + K ILKRK+N
Sbjct: 188 RIRDPRANYVAARIRLKEDEIEANKFNTSASQPSESKEPHAEESSEAMPRKPILKRKENS 247
Query: 242 LD--AKSHKRVRFDSECKISQES--QGSKDMYMESSSSLEAAEGTNQATLQSQGYPPQVP 301
D A++ KRVRFDS + +E+ + +SS + + +G + A +VP
Sbjct: 248 SDSEARTSKRVRFDS---VPEETLKKPEDTCSASASSKIVSHQGKSGA---------RVP 307
Query: 302 DYLRNPSRYTHYTFDSSSEVDEESNKKAYMEFLQLVRGSKTMGLSHQDDALTGPPKSITF 361
DYL NPS YT Y+FD S E+D ES YM+ V G K + + ++ PK ++F
Sbjct: 308 DYLLNPSSYTRYSFDPSCELDVESPTGEYMDTPNAVEGLK----NPESESF---PK-VSF 367
Query: 362 TPKKKAGDTIMLENSALGEQNHQNGFGKELVHQKSMPIGIAAVDDDVCSMEEDEPDKLEI 421
P+ K D + ++S E K + G A ++ + E+ + + E
Sbjct: 368 IPQNKTKD-VSEDSSNCNE-------------TKPIVAGELAEEERPSATEDGDTEIRES 369
Query: 422 KG-NSSQKPGRQYRMRAKTEEEE 432
+ S Q GRQYR + +E +
Sbjct: 428 ESCTSFQSKGRQYRAKLSLDETD 369
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
XP_022989648.1 | 5.6e-191 | 81.24 | uncharacterized protein LOC111486668 [Cucurbita maxima] | [more] |
KAG6588861.1 | 7.3e-191 | 81.09 | hypothetical protein SDJN03_17426, partial [Cucurbita argyrosperma subsp. sorori... | [more] |
XP_022928536.1 | 1.2e-190 | 81.05 | uncharacterized protein LOC111435316 [Cucurbita moschata] | [more] |
XP_023531936.1 | 1.6e-190 | 81.01 | uncharacterized protein LOC111794053 [Cucurbita pepo subsp. pepo] | [more] |
KAG7017870.1 | 3.9e-184 | 80.50 | hypothetical protein SDJN02_19736 [Cucurbita argyrosperma subsp. argyrosperma] | [more] |
Match Name | E-value | Identity | Description | |
A0A6J1JQY5 | 2.7e-191 | 81.24 | Protein TSSC4 OS=Cucurbita maxima OX=3661 GN=LOC111486668 PE=3 SV=1 | [more] |
A0A6J1EKJ9 | 6.0e-191 | 81.05 | Protein TSSC4 OS=Cucurbita moschata OX=3662 GN=LOC111435316 PE=3 SV=1 | [more] |
A0A6J1F221 | 3.3e-181 | 79.54 | Protein TSSC4 OS=Cucurbita moschata OX=3662 GN=LOC111441676 PE=3 SV=1 | [more] |
A0A6J1IZZ9 | 7.4e-181 | 79.36 | Protein TSSC4 OS=Cucurbita maxima OX=3661 GN=LOC111481475 PE=3 SV=1 | [more] |
A0A6J1D3P7 | 6.5e-177 | 76.44 | Protein TSSC4 OS=Momordica charantia OX=3673 GN=LOC111016977 PE=3 SV=1 | [more] |
Match Name | E-value | Identity | Description | |
AT5G13970.1 | 8.1e-63 | 38.79 | unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... | [more] |
AT5G13310.1 | 3.4e-29 | 30.93 | unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... | [more] |