Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonfive_prime_UTRpolypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
TAATGGAGGGAGAAGGTAGCTCAGAAATGGAGTATACGGAGATCGAAGCCTCCGCCGATTGCCTTGACAGCTCCATACTGTTCAATATCATCAACGATGTCTCCGCCTTCGTCTTGTACATGCATCAACAAGTCCCTTCGTAAGTTACTCCTTTCCAATTCCAACATATTTTCCTTCCGGTAACTTAGAATCTGCAAGAAACGCAGGGAAAATCCGTACGGTTTTGATTTGTGATGGAAAATGTGTGATTGCGAAACAATCGTGTGATCGATATTAAGCTTGTTCATCCTTGGCGCATATGATCATTTGTTTTTGATCGTGGAGTTTTGATTCAATTCCGATATATGCACATGGTTGGATTTCATTTCCTCAATCACTCTGTTTGAATTTTGAAATTGTGCTTTTTAGGTTTGGGGGAAGGAAGATTAGGAATTATATTAGCTGCGAGCTTTGTTCAGTTCCAAATTTATTGTGGTCTTTTAAGAACGATGAAATTCATGCAGAGAAATGATAAGGAAATAAGTAAAATTTTTAAAAATTAAAAAACAAATGGTTATCAAACCCGCCCTCATTTTTTGCTGTATTTTTGTTAAATTGACTGGTTGTTATTATTTATGTGTTGCTGATGTTATTCTTTGAATGTAATATAATATTAGTGCATTCTTTCTAGGATTTGTAAATGGCAGTTGTTGTAATTTTAAAAATGCTTTTTGGATGAAAGAAATAAATAGAAGAATGCCAACAAGAGTCGGCTAAGGTATGTAATGAGTAATCTCAATGGGTGGTTCAAACCATGCTTTCACATTTGTTGTACTCCAAAAAATGATAAAAATAAGAATGACGTTGAACTAACAGAATATTTATTTTGCATACCATTGCTTAATTATTTCTTCTGTTTATCTGTTTTGAAGTGTCTTCTCCATGATTTTTGCATCAAGATCGTGTGGCATGTAATGCAACTTTTGTTTGATTTTCTTGCCTCGTATTACAGGAAGCCCTTGCAAGAATACCTTTATAAAATAGATATTGTGGAGAAAAGGGTCACTCCTTAAGGATTTATGATGATTTAACACTGTATGAGATATCTATTTGGTATGTTTGTACAATAAGTACTAAGGTTAGGTGGCTTATGTGCAGAATTCTTCAAGATATGAGCATTGAATTTGATACTTTGCATGAAGAATACAAAGAGCTGGTATGTCAGCTGAGAATGGATAGTTTCCTTTACATTTTTTTGCTTCAATATGAGGATCATAAGACCTATCGGTTTTAGGGGAGTGAGCTAGCACAAAATGAACTAAAAGCGTCATCACGAAGAAAGCATACTGGCAGAATGAGGGAGGTCAGACAGGGAATTAAGAGAATGGAGAAGTTAATGAATTCAGTCTCTGGTTTTCAAGTTGCCATCAAATCGTTGATTAGTGAGACTCCTAACATCCAAGAAGTCTTATTAATTCTTGGAGCAACCCCACTACGACCTCAATATGTCTATGAGCTGTGCTTTTTACATAAAAAAGTTGTGGGGAGAGGTGCAGATGACTTCGTCAAGCACAAAGTAGCAGAAGTTCTTTCAAGAAAGGTGCCTGTGAATTTGTAGTACCTTTCCTAAAATTCATCACTATTTGATACAAATTTCGTTGTTAATAACAGGCTATTCGAACGCTAATCTCGAAAGATGCTGGGTCTGCCTCGTATCCAGGTACACAGTTGTGCATGTTAAAGCAAATTCATCAAGTTGGGAGAAGAATTCTCATCCTTGAAATTTCCCTCATTAAGAAATGATAACATATTCTTTTAGGCCCTACTAAGTTGTTTCTATTGGTGAAGGCTCCTTCTTCTTTCAATCTGCCCTTGCACTTCATTCCAAAACGTGAGTTTCGATACAGCAAAAAGGTAAATGTTAGACGTGTCTGCTTTTATTTTCATCTGAGGATATTAATATGAGATCATCTTGAAGATTCTGTGATCTATCTCCTCTATTAAAGAGTGCTTTATTTTGTTTCCGGTTAAATCAGTTATTAATGGTAAAGAGAATCTTCTTAGTTTCAGTGAGCTGATGGACTGGAGTACCAACCAGTCATCTGATGAAAAGAGCATAGCATATCCTTGTGTAAAAAAATATGAGTTACTTGTTAGCTGTCAATAATATCAATAGTATCCTGAATCTACCGATCATATCAGACTATGGAGTTCCATAAGGATTTGACTGCTTAATATAATATCCTGCATCACTTTTCATTTATCAACATCATTTGGCATAAAATCTTTCAGTTTGATCCAAATGCATAATATGATGTCAAACCATTAAGCACCATTGTGTACATCTTTCTGGGTGCATTACAATCTGTTAATCAATTTGATAGCATCTAAAAAAAGATCTTTGGTGCAGATAGTGCCTTTCAAACTGCGATTTAAGTGCAAGGCCCAAATTCAGCAGATGAATGGTTCTGATCGTGAATCTCAAGTTGGAAACTCTGATGACTTAACCAATCCAGAAGATTCAATCTGGTATGCAAGGCTTTAATCTTCCATTATTATTATGATTTTTTTTATTTATTAATTTGTGTTGGGGATGCGACCAAAAGTTTTATGTTGGCTAGACGAGGGGAAGATTATGGATATATAAGTGTACAGTGATATGAGATCTTTTAAGTGAAACCAAAAACAAAATTATGAGAGCTTATGTTTAAAGTGGACAATACCAAACAATTGTGGAAATGAATGAAGAGTTGTCCTTAACACATTGTCATTCATTGTTGAAGTAAAAATATCAAGAGAGACACTTGATACATACAGCCCCGGCTAGGTATTCAAATGTTGTGGTATTTGTGAGTGTTAAACTTCACTTTGAATTGATTATGGCAGATATCAATTAAAAGAGGGGTTAAGAAATAAAGTTAGTTATGTATAGTATAATAAGATAAAGTCACATTAGGGCCCTTGGATAACATTTTTGCTTTTAGAAACATGCTCATTTACTAGTGTTTTCCTTTATTTTCTAATGTGATTTTATAGTAAAAAATTGCTCCTAACTATAATTTGAAATGTTTTTTTTGAGAGTGAAACTAAAGAAAACAGAGGTATAAAGAATTTTTATAATCTCAATATTTAAAAATAATTAAACATGCCTTAGTTGGTGTAATTCGTCATTAAATAGTCAATTTGAGTAGCTCCACTATAAGGTCAACTAAAAGGTTAGAGATTCGAATTCTTGCCCTATATCTTTGAACTAAAAAAAAGTGCACAATCAAATAGCTATCTTTGCCAAATTTAAAGTTATATACCTGTTGTTGATCCTTGTATGCAACAATTATAGCAACAAGGGGCCTTGATGACCATTTTCCAATATGATTTGTCGCATTTCCAGGTTTCAATGTCGACATGCAATCAAGGGGCTAGCATTGAACAAGCCTGATGAAGATTGA
mRNA sequence
TAATGGAGGGAGAAGGTAGCTCAGAAATGGAGTATACGGAGATCGAAGCCTCCGCCGATTGCCTTGACAGCTCCATACTGTTCAATATCATCAACGATGTCTCCGCCTTCGTCTTGTACATGCATCAACAAGTCCCTTCAATTCTTCAAGATATGAGCATTGAATTTGATACTTTGCATGAAGAATACAAAGAGCTGGGGAGTGAGCTAGCACAAAATGAACTAAAAGCGTCATCACGAAGAAAGCATACTGGCAGAATGAGGGAGGTCAGACAGGGAATTAAGAGAATGGAGAAGTTAATGAATTCAGTCTCTGGTTTTCAAGTTGCCATCAAATCGTTGATTAGTGAGACTCCTAACATCCAAGAAGTCTTATTAATTCTTGGAGCAACCCCACTACGACCTCAATATGTCTATGAGCTGTGCTTTTTACATAAAAAAGTTGTGGGGAGAGGTGCAGATGACTTCGTCAAGCACAAAGTAGCAGAAGTTCTTTCAAGAAAGGCTATTCGAACGCTAATCTCGAAAGATGCTGGGTCTGCCTCGTATCCAGGCCCTACTAAGTTGTTTCTATTGGTGAAGGCTCCTTCTTCTTTCAATCTGCCCTTGCACTTCATTCCAAAACGTGAGTTTCGATACAGCAAAAAGATAGTGCCTTTCAAACTGCGATTTAAGTGCAAGGCCCAAATTCAGCAGATGAATGGTTCTGATCGTGAATCTCAAGTTGGAAACTCTGATGACTTAACCAATCCAGAAGATTCAATCTGGTTTCAATGTCGACATGCAATCAAGGGGCTAGCATTGAACAAGCCTGATGAAGATTGA
Coding sequence (CDS)
ATGGAGGGAGAAGGTAGCTCAGAAATGGAGTATACGGAGATCGAAGCCTCCGCCGATTGCCTTGACAGCTCCATACTGTTCAATATCATCAACGATGTCTCCGCCTTCGTCTTGTACATGCATCAACAAGTCCCTTCAATTCTTCAAGATATGAGCATTGAATTTGATACTTTGCATGAAGAATACAAAGAGCTGGGGAGTGAGCTAGCACAAAATGAACTAAAAGCGTCATCACGAAGAAAGCATACTGGCAGAATGAGGGAGGTCAGACAGGGAATTAAGAGAATGGAGAAGTTAATGAATTCAGTCTCTGGTTTTCAAGTTGCCATCAAATCGTTGATTAGTGAGACTCCTAACATCCAAGAAGTCTTATTAATTCTTGGAGCAACCCCACTACGACCTCAATATGTCTATGAGCTGTGCTTTTTACATAAAAAAGTTGTGGGGAGAGGTGCAGATGACTTCGTCAAGCACAAAGTAGCAGAAGTTCTTTCAAGAAAGGCTATTCGAACGCTAATCTCGAAAGATGCTGGGTCTGCCTCGTATCCAGGCCCTACTAAGTTGTTTCTATTGGTGAAGGCTCCTTCTTCTTTCAATCTGCCCTTGCACTTCATTCCAAAACGTGAGTTTCGATACAGCAAAAAGATAGTGCCTTTCAAACTGCGATTTAAGTGCAAGGCCCAAATTCAGCAGATGAATGGTTCTGATCGTGAATCTCAAGTTGGAAACTCTGATGACTTAACCAATCCAGAAGATTCAATCTGGTTTCAATGTCGACATGCAATCAAGGGGCTAGCATTGAACAAGCCTGATGAAGATTGA
Protein sequence
MEGEGSSEMEYTEIEASADCLDSSILFNIINDVSAFVLYMHQQVPSILQDMSIEFDTLHEEYKELGSELAQNELKASSRRKHTGRMREVRQGIKRMEKLMNSVSGFQVAIKSLISETPNIQEVLLILGATPLRPQYVYELCFLHKKVVGRGADDFVKHKVAEVLSRKAIRTLISKDAGSASYPGPTKLFLLVKAPSSFNLPLHFIPKREFRYSKKIVPFKLRFKCKAQIQQMNGSDRESQVGNSDDLTNPEDSIWFQCRHAIKGLALNKPDED
Homology
BLAST of Tan0004403 vs. NCBI nr
Match:
XP_038897381.1 (uncharacterized protein LOC120085475 [Benincasa hispida])
HSP 1 Score: 493.8 bits (1270), Expect = 9.5e-136
Identity = 255/277 (92.06%), Postives = 263/277 (94.95%), Query Frame = 0
Query: 1 MEGEGSSEMEYTEIEASADCLDSSILFNIINDVSAFVLYMHQQVPSILQDMSIEFDTLHE 60
MEG+GSSEM+YTEIE+SADC DSSILFNIINDVSAFVLYMHQQVPSILQDMSIEFDTLHE
Sbjct: 1 MEGKGSSEMQYTEIESSADCFDSSILFNIINDVSAFVLYMHQQVPSILQDMSIEFDTLHE 60
Query: 61 EYKELGSELAQNELKASSRRKHTGRMREVRQGIKRMEKLMNSVSGFQVAIKSLISETPNI 120
EYKELGSELAQNELKASSRRKHTGRMREVRQGIKRMEKLMNSVSGFQVAIKSLISETPNI
Sbjct: 61 EYKELGSELAQNELKASSRRKHTGRMREVRQGIKRMEKLMNSVSGFQVAIKSLISETPNI 120
Query: 121 QEVLLILGATPLRPQYVYELCFLHKKVVGRGADDFVKHKVAEVLSRKAIRTLISKDAGSA 180
QEVLLILGATPLRPQYVYE+CF HKKVV RGAD+FVKHK AE LSRKAIRTLISKDAGS
Sbjct: 121 QEVLLILGATPLRPQYVYEMCFSHKKVVVRGADNFVKHKAAEALSRKAIRTLISKDAGSV 180
Query: 181 SYPGPTKLFLLVKAPSSFNLPLHFIPKREFRYSKKIVPFKLRFKCKAQIQQMN--GSDRE 240
SYPGPTKLFLLVKAPSSFNLP+HFIPKREFRYSKKIVPFKLRFKCK+QIQQMN G DRE
Sbjct: 181 SYPGPTKLFLLVKAPSSFNLPMHFIPKREFRYSKKIVPFKLRFKCKSQIQQMNNPGPDRE 240
Query: 241 SQVGNSDDLTNP--EDSIWFQCRHAIKGLALNKPDED 274
SQVG SDDLTN EDSIWFQCRHAIKGLA N+PDED
Sbjct: 241 SQVGTSDDLTNSSVEDSIWFQCRHAIKGLAFNRPDED 277
BLAST of Tan0004403 vs. NCBI nr
Match:
XP_008451371.1 (PREDICTED: uncharacterized protein LOC103492681 isoform X1 [Cucumis melo] >KAA0064091.1 F15k9.21, putative isoform 1 [Cucumis melo var. makuwa] >TYK18490.1 F15k9.21, putative isoform 1 [Cucumis melo var. makuwa])
HSP 1 Score: 480.7 bits (1236), Expect = 8.3e-132
Identity = 249/277 (89.89%), Postives = 257/277 (92.78%), Query Frame = 0
Query: 1 MEGEGSSEMEYTEIEASADCLDSSILFNIINDVSAFVLYMHQQVPSILQDMSIEFDTLHE 60
MEGEG SEMEYTEIE+SADC DSSILFNIINDVSAFVLYMHQQ+PS LQDMSIEFDTLHE
Sbjct: 1 MEGEGRSEMEYTEIESSADCFDSSILFNIINDVSAFVLYMHQQLPSTLQDMSIEFDTLHE 60
Query: 61 EYKELGSELAQNELKASSRRKHTGRMREVRQGIKRMEKLMNSVSGFQVAIKSLISETPNI 120
EYKELGSELAQNELKASSRRKHTGRMREVRQGIKRMEKLMNSVSGFQVAIKSLISE PNI
Sbjct: 61 EYKELGSELAQNELKASSRRKHTGRMREVRQGIKRMEKLMNSVSGFQVAIKSLISEAPNI 120
Query: 121 QEVLLILGATPLRPQYVYELCFLHKKVVGRGADDFVKHKVAEVLSRKAIRTLISKDAGSA 180
+EVLLILGATPLRPQYVYE+CF HK+ RGAD+FVKHK AEVLSRKAIRTLISKDAGS
Sbjct: 121 EEVLLILGATPLRPQYVYEMCFSHKRFGLRGADNFVKHKAAEVLSRKAIRTLISKDAGSV 180
Query: 181 SYPGPTKLFLLVKAPSSFNLPLHFIPKREFRYSKKIVPFKLRFKCKAQIQQMN--GSDRE 240
SYPGPTKLFLLVKAPSSFNLPLHFIPKREFRYS+KIVPFKLRFKCKAQIQQM G DRE
Sbjct: 181 SYPGPTKLFLLVKAPSSFNLPLHFIPKREFRYSRKIVPFKLRFKCKAQIQQMKNPGHDRE 240
Query: 241 SQVGNSDDLTNP--EDSIWFQCRHAIKGLALNKPDED 274
S VGNSDDLTN ED IWFQCRHAIKGLA N+PDED
Sbjct: 241 SHVGNSDDLTNSSVEDPIWFQCRHAIKGLAFNRPDED 277
BLAST of Tan0004403 vs. NCBI nr
Match:
XP_023548798.1 (uncharacterized protein LOC111807345 isoform X2 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 480.7 bits (1236), Expect = 8.3e-132
Identity = 251/277 (90.61%), Postives = 257/277 (92.78%), Query Frame = 0
Query: 1 MEGEGSSEMEYTEIEASADCLDSSILFNIINDVSAFVLYMHQQVPSILQDMSIEFDTLHE 60
MEGEG+SEMEYTEIE+SAD DSSILFNIINDVSAFVLYMHQQVPSILQDMSIEFDTLHE
Sbjct: 1 MEGEGTSEMEYTEIESSADYFDSSILFNIINDVSAFVLYMHQQVPSILQDMSIEFDTLHE 60
Query: 61 EYKELGSELAQNELKASSRRKHTGRMREVRQGIKRMEKLMNSVSGFQVAIKSLISETPNI 120
EYKELGSELAQNELKASSRRKHTGRMREVRQGIKRMEKLMNSVSGFQ AIK+LISETPN+
Sbjct: 61 EYKELGSELAQNELKASSRRKHTGRMREVRQGIKRMEKLMNSVSGFQAAIKTLISETPNV 120
Query: 121 QEVLLILGATPLRPQYVYELCFLHKKVVGRGADDFVKHKVAEVLSRKAIRTLISKDAGSA 180
QEVLLILGATPLRPQYVYELCF HK VV RGAD FVKHK AEVLSRKAIRTLISKDAGSA
Sbjct: 121 QEVLLILGATPLRPQYVYELCFSHKNVVVRGADYFVKHKAAEVLSRKAIRTLISKDAGSA 180
Query: 181 SYPGPTKLFLLVKAPSSFNLPLHFIPKREFRYSKKIVPFKLRFKCKAQIQQMN--GSDRE 240
SYPGPTKLFLLVKAPSS NLPLHFIPKREFRYSKKI PFKLRFKCKAQI QMN G DRE
Sbjct: 181 SYPGPTKLFLLVKAPSSINLPLHFIPKREFRYSKKIKPFKLRFKCKAQIHQMNDPGLDRE 240
Query: 241 SQVGNSDDLTNP--EDSIWFQCRHAIKGLALNKPDED 274
QVGNSDDL N EDSIWFQCRHAIKG+A N+PDED
Sbjct: 241 FQVGNSDDLANSSVEDSIWFQCRHAIKGIAFNRPDED 277
BLAST of Tan0004403 vs. NCBI nr
Match:
XP_022991769.1 (uncharacterized protein LOC111488302 [Cucurbita maxima])
HSP 1 Score: 479.9 bits (1234), Expect = 1.4e-131
Identity = 250/277 (90.25%), Postives = 256/277 (92.42%), Query Frame = 0
Query: 1 MEGEGSSEMEYTEIEASADCLDSSILFNIINDVSAFVLYMHQQVPSILQDMSIEFDTLHE 60
MEGEG+SEMEYTEIE+SAD DSS+LFNIINDVSAFVLYMHQQVPSILQDMSIEFDTLHE
Sbjct: 34 MEGEGTSEMEYTEIESSADYFDSSLLFNIINDVSAFVLYMHQQVPSILQDMSIEFDTLHE 93
Query: 61 EYKELGSELAQNELKASSRRKHTGRMREVRQGIKRMEKLMNSVSGFQVAIKSLISETPNI 120
EYKELGSELAQNELKASSRRKHTGRMREVRQGIKRMEKLMNSVSGFQ AIKSLISETPN+
Sbjct: 94 EYKELGSELAQNELKASSRRKHTGRMREVRQGIKRMEKLMNSVSGFQAAIKSLISETPNV 153
Query: 121 QEVLLILGATPLRPQYVYELCFLHKKVVGRGADDFVKHKVAEVLSRKAIRTLISKDAGSA 180
QEVLLILGATPLRPQYVYELCF HK V RGAD FVKHK AEVLSRKAIRTLISKDAGSA
Sbjct: 154 QEVLLILGATPLRPQYVYELCFSHKNAVVRGADYFVKHKAAEVLSRKAIRTLISKDAGSA 213
Query: 181 SYPGPTKLFLLVKAPSSFNLPLHFIPKREFRYSKKIVPFKLRFKCKAQIQQMN--GSDRE 240
SYPGPTKLFLLVKAPSS NLPLHFIPKREFRYSKKI PFKLRFKCKAQI QMN G DRE
Sbjct: 214 SYPGPTKLFLLVKAPSSINLPLHFIPKREFRYSKKIKPFKLRFKCKAQIHQMNDPGPDRE 273
Query: 241 SQVGNSDDLTNP--EDSIWFQCRHAIKGLALNKPDED 274
QVGNSDDL N EDSIWFQCRHAIKG+A N+PDED
Sbjct: 274 FQVGNSDDLVNSSVEDSIWFQCRHAIKGIAFNRPDED 310
BLAST of Tan0004403 vs. NCBI nr
Match:
KAG6575900.1 (hypothetical protein SDJN03_26539, partial [Cucurbita argyrosperma subsp. sororia] >KAG7014433.1 hypothetical protein SDJN02_24610 [Cucurbita argyrosperma subsp. argyrosperma])
HSP 1 Score: 479.2 bits (1232), Expect = 2.4e-131
Identity = 250/277 (90.25%), Postives = 255/277 (92.06%), Query Frame = 0
Query: 1 MEGEGSSEMEYTEIEASADCLDSSILFNIINDVSAFVLYMHQQVPSILQDMSIEFDTLHE 60
MEGEG+SEMEYTEIE+SAD DSSILFNIINDVSAFVLYMHQQVPSILQDMSIEFDTLHE
Sbjct: 1 MEGEGTSEMEYTEIESSADYFDSSILFNIINDVSAFVLYMHQQVPSILQDMSIEFDTLHE 60
Query: 61 EYKELGSELAQNELKASSRRKHTGRMREVRQGIKRMEKLMNSVSGFQVAIKSLISETPNI 120
EYKELGSELAQNELKASSRRKHTGRMREVRQGIKRMEKLMNSVSGFQ AIKSLISETPN+
Sbjct: 61 EYKELGSELAQNELKASSRRKHTGRMREVRQGIKRMEKLMNSVSGFQAAIKSLISETPNV 120
Query: 121 QEVLLILGATPLRPQYVYELCFLHKKVVGRGADDFVKHKVAEVLSRKAIRTLISKDAGSA 180
QEVLLILGATPLRPQYVYELCF HK VV RGAD FVKHK AEVLSRKAIRTLISKDAGSA
Sbjct: 121 QEVLLILGATPLRPQYVYELCFSHKNVVVRGADYFVKHKAAEVLSRKAIRTLISKDAGSA 180
Query: 181 SYPGPTKLFLLVKAPSSFNLPLHFIPKREFRYSKKIVPFKLRFKCKAQIQQMN--GSDRE 240
SYPGPTKLFLLVKAPSS NLPLHFIPKREFRYSKKI PFKLRFKCK QI QMN G DRE
Sbjct: 181 SYPGPTKLFLLVKAPSSINLPLHFIPKREFRYSKKIKPFKLRFKCKGQIHQMNDPGPDRE 240
Query: 241 SQVGNSDDLTNP--EDSIWFQCRHAIKGLALNKPDED 274
QVGNSDDL N EDSIWFQCRH IKG+A N+PDED
Sbjct: 241 FQVGNSDDLANSSVEDSIWFQCRHTIKGIAFNRPDED 277
BLAST of Tan0004403 vs. ExPASy TrEMBL
Match:
A0A5D3D4N9 (F15k9.21, putative isoform 1 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold2032G00070 PE=4 SV=1)
HSP 1 Score: 480.7 bits (1236), Expect = 4.0e-132
Identity = 249/277 (89.89%), Postives = 257/277 (92.78%), Query Frame = 0
Query: 1 MEGEGSSEMEYTEIEASADCLDSSILFNIINDVSAFVLYMHQQVPSILQDMSIEFDTLHE 60
MEGEG SEMEYTEIE+SADC DSSILFNIINDVSAFVLYMHQQ+PS LQDMSIEFDTLHE
Sbjct: 1 MEGEGRSEMEYTEIESSADCFDSSILFNIINDVSAFVLYMHQQLPSTLQDMSIEFDTLHE 60
Query: 61 EYKELGSELAQNELKASSRRKHTGRMREVRQGIKRMEKLMNSVSGFQVAIKSLISETPNI 120
EYKELGSELAQNELKASSRRKHTGRMREVRQGIKRMEKLMNSVSGFQVAIKSLISE PNI
Sbjct: 61 EYKELGSELAQNELKASSRRKHTGRMREVRQGIKRMEKLMNSVSGFQVAIKSLISEAPNI 120
Query: 121 QEVLLILGATPLRPQYVYELCFLHKKVVGRGADDFVKHKVAEVLSRKAIRTLISKDAGSA 180
+EVLLILGATPLRPQYVYE+CF HK+ RGAD+FVKHK AEVLSRKAIRTLISKDAGS
Sbjct: 121 EEVLLILGATPLRPQYVYEMCFSHKRFGLRGADNFVKHKAAEVLSRKAIRTLISKDAGSV 180
Query: 181 SYPGPTKLFLLVKAPSSFNLPLHFIPKREFRYSKKIVPFKLRFKCKAQIQQMN--GSDRE 240
SYPGPTKLFLLVKAPSSFNLPLHFIPKREFRYS+KIVPFKLRFKCKAQIQQM G DRE
Sbjct: 181 SYPGPTKLFLLVKAPSSFNLPLHFIPKREFRYSRKIVPFKLRFKCKAQIQQMKNPGHDRE 240
Query: 241 SQVGNSDDLTNP--EDSIWFQCRHAIKGLALNKPDED 274
S VGNSDDLTN ED IWFQCRHAIKGLA N+PDED
Sbjct: 241 SHVGNSDDLTNSSVEDPIWFQCRHAIKGLAFNRPDED 277
BLAST of Tan0004403 vs. ExPASy TrEMBL
Match:
A0A1S3BSF3 (uncharacterized protein LOC103492681 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103492681 PE=4 SV=1)
HSP 1 Score: 480.7 bits (1236), Expect = 4.0e-132
Identity = 249/277 (89.89%), Postives = 257/277 (92.78%), Query Frame = 0
Query: 1 MEGEGSSEMEYTEIEASADCLDSSILFNIINDVSAFVLYMHQQVPSILQDMSIEFDTLHE 60
MEGEG SEMEYTEIE+SADC DSSILFNIINDVSAFVLYMHQQ+PS LQDMSIEFDTLHE
Sbjct: 1 MEGEGRSEMEYTEIESSADCFDSSILFNIINDVSAFVLYMHQQLPSTLQDMSIEFDTLHE 60
Query: 61 EYKELGSELAQNELKASSRRKHTGRMREVRQGIKRMEKLMNSVSGFQVAIKSLISETPNI 120
EYKELGSELAQNELKASSRRKHTGRMREVRQGIKRMEKLMNSVSGFQVAIKSLISE PNI
Sbjct: 61 EYKELGSELAQNELKASSRRKHTGRMREVRQGIKRMEKLMNSVSGFQVAIKSLISEAPNI 120
Query: 121 QEVLLILGATPLRPQYVYELCFLHKKVVGRGADDFVKHKVAEVLSRKAIRTLISKDAGSA 180
+EVLLILGATPLRPQYVYE+CF HK+ RGAD+FVKHK AEVLSRKAIRTLISKDAGS
Sbjct: 121 EEVLLILGATPLRPQYVYEMCFSHKRFGLRGADNFVKHKAAEVLSRKAIRTLISKDAGSV 180
Query: 181 SYPGPTKLFLLVKAPSSFNLPLHFIPKREFRYSKKIVPFKLRFKCKAQIQQMN--GSDRE 240
SYPGPTKLFLLVKAPSSFNLPLHFIPKREFRYS+KIVPFKLRFKCKAQIQQM G DRE
Sbjct: 181 SYPGPTKLFLLVKAPSSFNLPLHFIPKREFRYSRKIVPFKLRFKCKAQIQQMKNPGHDRE 240
Query: 241 SQVGNSDDLTNP--EDSIWFQCRHAIKGLALNKPDED 274
S VGNSDDLTN ED IWFQCRHAIKGLA N+PDED
Sbjct: 241 SHVGNSDDLTNSSVEDPIWFQCRHAIKGLAFNRPDED 277
BLAST of Tan0004403 vs. ExPASy TrEMBL
Match:
A0A6J1JVR3 (uncharacterized protein LOC111488302 OS=Cucurbita maxima OX=3661 GN=LOC111488302 PE=4 SV=1)
HSP 1 Score: 479.9 bits (1234), Expect = 6.8e-132
Identity = 250/277 (90.25%), Postives = 256/277 (92.42%), Query Frame = 0
Query: 1 MEGEGSSEMEYTEIEASADCLDSSILFNIINDVSAFVLYMHQQVPSILQDMSIEFDTLHE 60
MEGEG+SEMEYTEIE+SAD DSS+LFNIINDVSAFVLYMHQQVPSILQDMSIEFDTLHE
Sbjct: 34 MEGEGTSEMEYTEIESSADYFDSSLLFNIINDVSAFVLYMHQQVPSILQDMSIEFDTLHE 93
Query: 61 EYKELGSELAQNELKASSRRKHTGRMREVRQGIKRMEKLMNSVSGFQVAIKSLISETPNI 120
EYKELGSELAQNELKASSRRKHTGRMREVRQGIKRMEKLMNSVSGFQ AIKSLISETPN+
Sbjct: 94 EYKELGSELAQNELKASSRRKHTGRMREVRQGIKRMEKLMNSVSGFQAAIKSLISETPNV 153
Query: 121 QEVLLILGATPLRPQYVYELCFLHKKVVGRGADDFVKHKVAEVLSRKAIRTLISKDAGSA 180
QEVLLILGATPLRPQYVYELCF HK V RGAD FVKHK AEVLSRKAIRTLISKDAGSA
Sbjct: 154 QEVLLILGATPLRPQYVYELCFSHKNAVVRGADYFVKHKAAEVLSRKAIRTLISKDAGSA 213
Query: 181 SYPGPTKLFLLVKAPSSFNLPLHFIPKREFRYSKKIVPFKLRFKCKAQIQQMN--GSDRE 240
SYPGPTKLFLLVKAPSS NLPLHFIPKREFRYSKKI PFKLRFKCKAQI QMN G DRE
Sbjct: 214 SYPGPTKLFLLVKAPSSINLPLHFIPKREFRYSKKIKPFKLRFKCKAQIHQMNDPGPDRE 273
Query: 241 SQVGNSDDLTNP--EDSIWFQCRHAIKGLALNKPDED 274
QVGNSDDL N EDSIWFQCRHAIKG+A N+PDED
Sbjct: 274 FQVGNSDDLVNSSVEDSIWFQCRHAIKGIAFNRPDED 310
BLAST of Tan0004403 vs. ExPASy TrEMBL
Match:
A0A0A0K8W9 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_7G390030 PE=4 SV=1)
HSP 1 Score: 479.2 bits (1232), Expect = 1.2e-131
Identity = 248/277 (89.53%), Postives = 255/277 (92.06%), Query Frame = 0
Query: 1 MEGEGSSEMEYTEIEASADCLDSSILFNIINDVSAFVLYMHQQVPSILQDMSIEFDTLHE 60
MEGEGSSEMEYTEIE+S DC DSSILFNIINDVSAFVLYMHQQVPS LQDMSIEFDTLHE
Sbjct: 1 MEGEGSSEMEYTEIESSTDCFDSSILFNIINDVSAFVLYMHQQVPSTLQDMSIEFDTLHE 60
Query: 61 EYKELGSELAQNELKASSRRKHTGRMREVRQGIKRMEKLMNSVSGFQVAIKSLISETPNI 120
EYKELGSEL QNELKASSRRKHTGRMREVRQGIKRMEKLMNSVSGFQVAIKSLISE PNI
Sbjct: 61 EYKELGSELEQNELKASSRRKHTGRMREVRQGIKRMEKLMNSVSGFQVAIKSLISEAPNI 120
Query: 121 QEVLLILGATPLRPQYVYELCFLHKKVVGRGADDFVKHKVAEVLSRKAIRTLISKDAGSA 180
+EVLLILGATPLRPQYVYE+CF HK+ RGAD+F KHK AEVLSRKAIRTLISKDAGS
Sbjct: 121 EEVLLILGATPLRPQYVYEMCFSHKRFALRGADNFAKHKAAEVLSRKAIRTLISKDAGSV 180
Query: 181 SYPGPTKLFLLVKAPSSFNLPLHFIPKREFRYSKKIVPFKLRFKCKAQIQQMN--GSDRE 240
SYPGPTKLFLLVKAPSSFNLPLHFIPKREFRYS+KIVPFKLRFKCKAQIQQM DRE
Sbjct: 181 SYPGPTKLFLLVKAPSSFNLPLHFIPKREFRYSRKIVPFKLRFKCKAQIQQMKHPDHDRE 240
Query: 241 SQVGNSDDLTNP--EDSIWFQCRHAIKGLALNKPDED 274
SQVGNSDDLTN ED IWFQCRHAIKGLA N+PDED
Sbjct: 241 SQVGNSDDLTNSSVEDPIWFQCRHAIKGLAFNRPDED 277
BLAST of Tan0004403 vs. ExPASy TrEMBL
Match:
A0A6J1D966 (uncharacterized protein LOC111018414 OS=Momordica charantia OX=3673 GN=LOC111018414 PE=4 SV=1)
HSP 1 Score: 474.6 bits (1220), Expect = 2.9e-130
Identity = 243/275 (88.36%), Postives = 256/275 (93.09%), Query Frame = 0
Query: 1 MEGEGSSEMEYTEIEASADCLDSSILFNIINDVSAFVLYMHQQVPSILQDMSIEFDTLHE 60
MEGEGSSEMEYTEIEASAD DSSILFNIINDVSAFVL+MHQQ+PSILQDMSIEFDTLHE
Sbjct: 1 MEGEGSSEMEYTEIEASADYFDSSILFNIINDVSAFVLFMHQQLPSILQDMSIEFDTLHE 60
Query: 61 EYKELGSELAQNELKASSRRKHTGRMREVRQGIKRMEKLMNSVSGFQVAIKSLISETPNI 120
EYKELGSELAQNELKASSRRKHTGRMREVRQGIKRMEKLMNSVSGFQ AIK LI+E PNI
Sbjct: 61 EYKELGSELAQNELKASSRRKHTGRMREVRQGIKRMEKLMNSVSGFQSAIKLLITEIPNI 120
Query: 121 QEVLLILGATPLRPQYVYELCFLHKKVVGRGADDFVKHKVAEVLSRKAIRTLISKDAGSA 180
+EVLLILGATPLRPQ+VY+LCFL +K GADDF+KHK AEVLSRKAIRTLISKDAGS+
Sbjct: 121 EEVLLILGATPLRPQHVYQLCFLQRKAAVGGADDFIKHKAAEVLSRKAIRTLISKDAGSS 180
Query: 181 SYPGPTKLFLLVKAPSSFNLPLHFIPKREFRYSKKIVPFKLRFKCKAQIQQMNGSDRESQ 240
SYPGPTKLFLLVKAP+SFNLPLHFIPKR+FRYSKKIVPFKLRFKCK QIQ+MN DRESQ
Sbjct: 181 SYPGPTKLFLLVKAPASFNLPLHFIPKRDFRYSKKIVPFKLRFKCKGQIQKMNAPDRESQ 240
Query: 241 VGNSDDLTNP--EDSIWFQCRHAIKGLALNKPDED 274
VGN DDLTN EDSIWFQCRHAIKGLA N+PDED
Sbjct: 241 VGNCDDLTNSSVEDSIWFQCRHAIKGLAFNRPDED 275
BLAST of Tan0004403 vs. TAIR 10
Match:
AT1G03180.1 (unknown protein; Has 36 Blast hits to 36 proteins in 15 species: Archae - 0; Bacteria - 0; Metazoa - 3; Fungi - 0; Plants - 33; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )
HSP 1 Score: 255.8 bits (652), Expect = 4.0e-68
Identity = 137/274 (50.00%), Postives = 184/274 (67.15%), Query Frame = 0
Query: 2 EGEGSSEMEY-TEIEASADCLDSSILFNIINDVSAFVLYMHQQVPSILQDMSIEFDTLHE 61
EGEG++E Y +I +A L S +F+IIND+ FVLYMHQQ+PS+LQDMS+EF+ L
Sbjct: 5 EGEGTTEENYDVDIATTASSLGGSGVFHIINDIVGFVLYMHQQIPSVLQDMSLEFEGLQT 64
Query: 62 EYKELGSELAQNELKASSRRKHTGRMREVRQGIKRMEKLMNSVSGFQVAIKSLISETPNI 121
E+ +L + LA+ ++K RRK R REV+ IK++EKLM ++S + A++ +I E P I
Sbjct: 65 EFMDLETNLAEPQVKPLVRRKLMSRKREVKNEIKKLEKLMKTISSLRSALQLMIREAPGI 124
Query: 122 QEVLLILGATPLRPQYVYELCFLHKKVVGRGAD-DFVKHKVAEVLSRKAIRTLISKDAGS 181
Q+V+LILG +PLRPQ YEL F ++ G + DF K K AE LS+K IR LIS AGS
Sbjct: 125 QKVVLILGGSPLRPQNAYELLFTQRRDHVLGYEGDFAKSKAAEALSKKTIRALISTGAGS 184
Query: 182 ASYPGPTKLFLLVKAPSSFNLPLHFIPKREFRYSKKIVPFKLRFKCKAQIQQMNGSDRES 241
SYPGP +LF+LV AP + NLP HF+PKR+FRY++K VP KLRFKC+ Q
Sbjct: 185 TSYPGPMRLFILVHAPPTLNLPQHFLPKRDFRYNRKFVPSKLRFKCRTQ----------- 244
Query: 242 QVGNSDDLTNPEDSIWFQCRHAIKGLALNKPDED 274
N+ + D IW+QCRH IKGLA ++P E+
Sbjct: 245 --DNATNSPPTNDLIWYQCRHVIKGLAFHQPVEE 265
BLAST of Tan0004403 vs. TAIR 10
Match:
AT1G03180.2 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: mitochondrion; EXPRESSED IN: 15 plant structures; EXPRESSED DURING: 8 growth stages; Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink). )
HSP 1 Score: 212.6 bits (540), Expect = 3.9e-55
Identity = 114/236 (48.31%), Postives = 155/236 (65.68%), Query Frame = 0
Query: 39 YMHQQVPSILQDMSIEFDTLHEEYKELGSELAQNELKASSRRKHTGRMREVRQGIKRMEK 98
Y+H + +LQDMS+EF+ L E+ +L + LA+ ++K RRK R REV+ IK++EK
Sbjct: 31 YLH--LCRVLQDMSLEFEGLQTEFMDLETNLAEPQVKPLVRRKLMSRKREVKNEIKKLEK 90
Query: 99 LMNSVSGFQVAIKSLISETPNIQEVLLILGATPLRPQYVYELCFLHKKVVGRGAD-DFVK 158
LM ++S + A++ +I E P IQ+V+LILG +PLRPQ YEL F ++ G + DF K
Sbjct: 91 LMKTISSLRSALQLMIREAPGIQKVVLILGGSPLRPQNAYELLFTQRRDHVLGYEGDFAK 150
Query: 159 HKVAEVLSRKAIRTLISKDAGSASYPGPTKLFLLVKAPSSFNLPLHFIPKREFRYSKKIV 218
K AE LS+K IR LIS AGS SYPGP +LF+LV AP + NLP HF+PKR+FRY++K V
Sbjct: 151 SKAAEALSKKTIRALISTGAGSTSYPGPMRLFILVHAPPTLNLPQHFLPKRDFRYNRKFV 210
Query: 219 PFKLRFKCKAQIQQMNGSDRESQVGNSDDLTNPEDSIWFQCRHAIKGLALNKPDED 274
P KLRFKC+ Q N+ + D IW+QCRH IKGLA ++P E+
Sbjct: 211 PSKLRFKCRTQ-------------DNATNSPPTNDLIWYQCRHVIKGLAFHQPVEE 251
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
XP_038897381.1 | 9.5e-136 | 92.06 | uncharacterized protein LOC120085475 [Benincasa hispida] | [more] |
XP_008451371.1 | 8.3e-132 | 89.89 | PREDICTED: uncharacterized protein LOC103492681 isoform X1 [Cucumis melo] >KAA00... | [more] |
XP_023548798.1 | 8.3e-132 | 90.61 | uncharacterized protein LOC111807345 isoform X2 [Cucurbita pepo subsp. pepo] | [more] |
XP_022991769.1 | 1.4e-131 | 90.25 | uncharacterized protein LOC111488302 [Cucurbita maxima] | [more] |
KAG6575900.1 | 2.4e-131 | 90.25 | hypothetical protein SDJN03_26539, partial [Cucurbita argyrosperma subsp. sorori... | [more] |
Match Name | E-value | Identity | Description | |
A0A5D3D4N9 | 4.0e-132 | 89.89 | F15k9.21, putative isoform 1 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_sca... | [more] |
A0A1S3BSF3 | 4.0e-132 | 89.89 | uncharacterized protein LOC103492681 isoform X1 OS=Cucumis melo OX=3656 GN=LOC10... | [more] |
A0A6J1JVR3 | 6.8e-132 | 90.25 | uncharacterized protein LOC111488302 OS=Cucurbita maxima OX=3661 GN=LOC111488302... | [more] |
A0A0A0K8W9 | 1.2e-131 | 89.53 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_7G390030 PE=4 SV=1 | [more] |
A0A6J1D966 | 2.9e-130 | 88.36 | uncharacterized protein LOC111018414 OS=Momordica charantia OX=3673 GN=LOC111018... | [more] |
Match Name | E-value | Identity | Description | |
AT1G03180.1 | 4.0e-68 | 50.00 | unknown protein; Has 36 Blast hits to 36 proteins in 15 species: Archae - 0; Bac... | [more] |
AT1G03180.2 | 3.9e-55 | 48.31 | unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... | [more] |