Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonfive_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AGAATAAATGCATCCAATTTCTCTCAAGAACAAGGCCAAGGCGTGACCGTGAGAGGCCGTTTTTCTGGCGTGGGCTCGCCAACTTGGAACCATCATTACACAATGAAACAAAGCTCGTGGCCCTTCCACACCTTTTACCATAAAATTTTATTTCCAAAGTCCTTTCTTTTTAGTATGCATTTGGACGAAAAGTAAAGGCCAAGAAACTCAGCCCCGCCCTGTTCTTCAATCTCCACCCTTATAATTTATTCCATTTCTAACCCAGGCAAAAAGGAGTGTTTCTGTTTTTTCAAAATTTCACAGATGATAAAGTAACACCCAACAAAAAGCAGTAGGGGGCCGGATCCTGACTCACTCATCAGTTCTTCTTTTTTCTTCTTCGATTCTCTGTAAGGATTTGTCTTTTAAAAACCCCACATCCCGAAACCCAGATGCCAGAAAACCACTTTTTTTCCACGCCGCTATGAAATTCGGTAGAGAGACTAAAGGGATCCCTTCCACGGATTTGTTGGTTTGTTTTCCTTCTCGCTCGCATTTGGCTTTGATGCCCAACCCGCTTTGCAGTCCTGCGAGAGGATCTGATTCGAATAAGCTCCGCCGTAGCCGCCTCCACCACCGGCGGAGGAAATCGGCGGAGAGTCCGGTGGTATGGGCGAAGGCGAAGACGATGGGGTCCGAGATGTCGGAACCGTCGTCGCCGAAAGTGACTTGTGCAGGGCAGATTAAGATCAGGCCGAAGAGCAGCAAGAGCTGGCAATCGGTGATGGAGGAGATAGAGAGAATTCACAATCGGAGGAAATTACGGAGGAGGCGGTTCAATTGGGTCGAGTCGTTAGGGTTCAAGAAGGATCTAATGCAATTCTTGACTTGTTTGAGAAGCATACGGTTTGATTTTAGGTGTTTCAGAGCATTCCCAGAAGCGGATTTCACTACTGAAGAAGATGAAGAAGACGAAGAAGAAGAAGAAGAAGAAGAAGAGGAAAAATCTCCGGGAAATCAAGTGGGGGTTGAGGGAAGTGAAAGCTCCAGCACTGCATTTTCCAAATGGTTTATGGTTTTACAGGAAAGTGGAAGAGAAAGAAGAAGTTTCTGTAGCGTTGAGGATGGTTCGATCGAGCCACCAATGGCGCCGCCGAAAAACGCCCTTTTGCTAATGCGTTGCAGGTCGGCTCCGGCAAAGAGTTGGCTGGAACTGGAGGAAGAAGAAGAAGAAACTGAAGAGGAAGAAGAAGAAGCGGAAAAGGAAGAAGTGAAGGTGAAGAAGAGCTTGAAATGGCTAATGGAGGAAGAGAACAGAGAGAGATTGTCATCGGACATTGTAAGTGAAAAGAGCAGGGATTTGTTTTCAAGGAGTCGAAGTTGGAAAGTTTGATCCATCTCTGAATTCTCATCTTCTTCTAGACTTTTTTTTTTAAAAAAAAAAAAAAAACATGCTTGGAATATTTTTAAAATTCTGGAGTCTGAATCTCTGAGTATTATTGGGTTTGGTTGTATGTTGTACAGTTACTTTATAATTTCGTTGGGGGAGGATTTTAATGTCATGTATGAGACAGGCTTATAGTCAAGTTAAATATAATATTGGTTTGTTAATTATGAAATGTAATTATGTGAGGGGTTTCTTTTTTGTTAGTCCTTTATTTGTATTTTTCTGTTTCTCTTACAATAATT
mRNA sequence
AGAATAAATGCATCCAATTTCTCTCAAGAACAAGGCCAAGGCGTGACCGTGAGAGGCCGTTTTTCTGGCGTGGGCTCGCCAACTTGGAACCATCATTACACAATGAAACAAAGCTCGTGGCCCTTCCACACCTTTTACCATAAAATTTTATTTCCAAAGTCCTTTCTTTTTAGTATGCATTTGGACGAAAAGTAAAGGCCAAGAAACTCAGCCCCGCCCTGTTCTTCAATCTCCACCCTTATAATTTATTCCATTTCTAACCCAGGCAAAAAGGAGTGTTTCTGTTTTTTCAAAATTTCACAGATGATAAAGTAACACCCAACAAAAAGCAGTAGGGGGCCGGATCCTGACTCACTCATCAGTTCTTCTTTTTTCTTCTTCGATTCTCTGTAAGGATTTGTCTTTTAAAAACCCCACATCCCGAAACCCAGATGCCAGAAAACCACTTTTTTTCCACGCCGCTATGAAATTCGGTAGAGAGACTAAAGGGATCCCTTCCACGGATTTGTTGGTTTGTTTTCCTTCTCGCTCGCATTTGGCTTTGATGCCCAACCCGCTTTGCAGTCCTGCGAGAGGATCTGATTCGAATAAGCTCCGCCGTAGCCGCCTCCACCACCGGCGGAGGAAATCGGCGGAGAGTCCGGTGGTATGGGCGAAGGCGAAGACGATGGGGTCCGAGATGTCGGAACCGTCGTCGCCGAAAGTGACTTGTGCAGGGCAGATTAAGATCAGGCCGAAGAGCAGCAAGAGCTGGCAATCGGTGATGGAGGAGATAGAGAGAATTCACAATCGGAGGAAATTACGGAGGAGGCGGTTCAATTGGGTCGAGTCGTTAGGGTTCAAGAAGGATCTAATGCAATTCTTGACTTGTTTGAGAAGCATACGGTTTGATTTTAGGTGTTTCAGAGCATTCCCAGAAGCGGATTTCACTACTGAAGAAGATGAAGAAGACGAAGAAGAAGAAGAAGAAGAAGAAGAGGAAAAATCTCCGGGAAATCAAGTGGGGGTTGAGGGAAGTGAAAGCTCCAGCACTGCATTTTCCAAATGGTTTATGGTTTTACAGGAAAGTGGAAGAGAAAGAAGAAGTTTCTGTAGCGTTGAGGATGGTTCGATCGAGCCACCAATGGCGCCGCCGAAAAACGCCCTTTTGCTAATGCGTTGCAGGTCGGCTCCGGCAAAGAGTTGGCTGGAACTGGAGGAAGAAGAAGAAGAAACTGAAGAGGAAGAAGAAGAAGCGGAAAAGGAAGAAGTGAAGGTGAAGAAGAGCTTGAAATGGCTAATGGAGGAAGAGAACAGAGAGAGATTGTCATCGGACATTGTAAGTGAAAAGAGCAGGGATTTGTTTTCAAGGAGTCGAAGTTGGAAAGTTTGATCCATCTCTGAATTCTCATCTTCTTCTAGACTTTTTTTTTTAAAAAAAAAAAAAAAACATGCTTGGAATATTTTTAAAATTCTGGAGTCTGAATCTCTGAGTATTATTGGGTTTGGTTGTATGTTGTACAGTTACTTTATAATTTCGTTGGGGGAGGATTTTAATGTCATGTATGAGACAGGCTTATAGTCAAGTTAAATATAATATTGGTTTGTTAATTATGAAATGTAATTATGTGAGGGGTTTCTTTTTTGTTAGTCCTTTATTTGTATTTTTCTGTTTCTCTTACAATAATT
Coding sequence (CDS)
ATGAAATTCGGTAGAGAGACTAAAGGGATCCCTTCCACGGATTTGTTGGTTTGTTTTCCTTCTCGCTCGCATTTGGCTTTGATGCCCAACCCGCTTTGCAGTCCTGCGAGAGGATCTGATTCGAATAAGCTCCGCCGTAGCCGCCTCCACCACCGGCGGAGGAAATCGGCGGAGAGTCCGGTGGTATGGGCGAAGGCGAAGACGATGGGGTCCGAGATGTCGGAACCGTCGTCGCCGAAAGTGACTTGTGCAGGGCAGATTAAGATCAGGCCGAAGAGCAGCAAGAGCTGGCAATCGGTGATGGAGGAGATAGAGAGAATTCACAATCGGAGGAAATTACGGAGGAGGCGGTTCAATTGGGTCGAGTCGTTAGGGTTCAAGAAGGATCTAATGCAATTCTTGACTTGTTTGAGAAGCATACGGTTTGATTTTAGGTGTTTCAGAGCATTCCCAGAAGCGGATTTCACTACTGAAGAAGATGAAGAAGACGAAGAAGAAGAAGAAGAAGAAGAAGAGGAAAAATCTCCGGGAAATCAAGTGGGGGTTGAGGGAAGTGAAAGCTCCAGCACTGCATTTTCCAAATGGTTTATGGTTTTACAGGAAAGTGGAAGAGAAAGAAGAAGTTTCTGTAGCGTTGAGGATGGTTCGATCGAGCCACCAATGGCGCCGCCGAAAAACGCCCTTTTGCTAATGCGTTGCAGGTCGGCTCCGGCAAAGAGTTGGCTGGAACTGGAGGAAGAAGAAGAAGAAACTGAAGAGGAAGAAGAAGAAGCGGAAAAGGAAGAAGTGAAGGTGAAGAAGAGCTTGAAATGGCTAATGGAGGAAGAGAACAGAGAGAGATTGTCATCGGACATTGTAAGTGAAAAGAGCAGGGATTTGTTTTCAAGGAGTCGAAGTTGGAAAGTTTGA
Protein sequence
MKFGRETKGIPSTDLLVCFPSRSHLALMPNPLCSPARGSDSNKLRRSRLHHRRRKSAESPVVWAKAKTMGSEMSEPSSPKVTCAGQIKIRPKSSKSWQSVMEEIERIHNRRKLRRRRFNWVESLGFKKDLMQFLTCLRSIRFDFRCFRAFPEADFTTEEDEEDEEEEEEEEEEKSPGNQVGVEGSESSSTAFSKWFMVLQESGRERRSFCSVEDGSIEPPMAPPKNALLLMRCRSAPAKSWLELEEEEEETEEEEEEAEKEEVKVKKSLKWLMEEENRERLSSDIVSEKSRDLFSRSRSWKV
Homology
BLAST of Lcy12g014400 vs. ExPASy TrEMBL
Match:
A0A0A0L1Z4 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G377750 PE=4 SV=1)
HSP 1 Score: 420.2 bits (1079), Expect = 7.1e-114
Identity = 235/316 (74.37%), Postives = 271/316 (85.76%), Query Frame = 0
Query: 1 MKFGRE-TKGIPSTDLLVCFPSRSHLALMPNPLCSPARGSDSNKLRRS-RLHHRRRKSAE 60
MK RE +KGIPS+DLLVCFPSRSHLALMPNPLCSPARGSDS+K R R +HRRRKSAE
Sbjct: 1 MKLNREKSKGIPSSDLLVCFPSRSHLALMPNPLCSPARGSDSSKFRLDYRRYHRRRKSAE 60
Query: 61 SPVVWAKAKTMGSEMSEPSSPKVTCAGQIKIRPKSSKSWQSVMEEIERIHNRRKLRRRRF 120
SPVVWAKAKTMGSE+SEPSSPKVTCAGQIKIRPK+SKSWQSVMEEIERIHNRRKLRRRRF
Sbjct: 61 SPVVWAKAKTMGSEISEPSSPKVTCAGQIKIRPKNSKSWQSVMEEIERIHNRRKLRRRRF 120
Query: 121 NWVESLGFKKDLMQFLTCLRSIRFDFRCFRAFPEADFTTEEDEEDEEEEEEEEEEKSPGN 180
NW+ES GFKKD+MQFLTCLR++RFDFRCFRAFPE DFTTEE+EE+EEEEEEEEE+ N
Sbjct: 121 NWIESFGFKKDIMQFLTCLRTMRFDFRCFRAFPETDFTTEEEEEEEEEEEEEEEK----N 180
Query: 181 QVGVEGSESSSTAFSKWFMVLQESG-----RERRSFCSVEDGSIEPPMAPPKNALLLMRC 240
QVG+E +ESS TAFSKWFMVLQE+G R+ S C +D SIE MAPP+NALLLMRC
Sbjct: 181 QVGIEENESSRTAFSKWFMVLQENGSNELKRDSNSRCYEDDESIEATMAPPRNALLLMRC 240
Query: 241 RSAPAKSWLELEEEEEETEEEEEEAEKEEVKVKKSLKWLMEEENRERLSSD-------IV 300
+SAPA+ W+E EE EEE +E+E+E EKE+VKVKKSLKWLMEEENRER+ + ++
Sbjct: 241 KSAPARRWME-EESEEEDDEKEKEKEKEKVKVKKSLKWLMEEENRERVVMEMGTDFCRMI 300
Query: 301 SEKSRDLFSRSRSWKV 303
S+ +++ F+RS+SWKV
Sbjct: 301 SDNAKE-FTRSQSWKV 310
BLAST of Lcy12g014400 vs. ExPASy TrEMBL
Match:
A0A6J1D3C2 (uncharacterized protein LOC111016595 OS=Momordica charantia OX=3673 GN=LOC111016595 PE=4 SV=1)
HSP 1 Score: 413.3 bits (1061), Expect = 8.7e-112
Identity = 246/327 (75.23%), Postives = 261/327 (79.82%), Query Frame = 0
Query: 1 MKFGRETKGIPSTDLLVCFPSRSHLALMPNPLCSPARGSDSNKLRRS-RLHHRRRK--SA 60
MK GR+ K I S DLLVCFPSRS+L LMP PLCSPARG DSNKLRRS R HHRRRK SA
Sbjct: 1 MKLGRDAKAIHSADLLVCFPSRSNLTLMPKPLCSPARGLDSNKLRRSHRHHHRRRKSTSA 60
Query: 61 ESPVVWAKAKTMGSEMSEPSSPKVTCAGQIKIRPK--SSKSWQSVMEEIERIHNRRKLRR 120
SP++WAK KTMGSE+SEPSSPKVTCAGQIKIRPK S KSWQSVMEEIERIHNRRKLRR
Sbjct: 61 ASPLIWAKPKTMGSEISEPSSPKVTCAGQIKIRPKTGSCKSWQSVMEEIERIHNRRKLRR 120
Query: 121 RRFNWVESLGFKKDLMQFLTCLRSIRFDFRCFRAFPEADFTTEEDEEDEEEEEEEEEEKS 180
RR NWVESLGFKKD+MQFLTCLR+IRFDFRCF+AFPEADFTT EE++EEEEEEEE KS
Sbjct: 121 RRSNWVESLGFKKDIMQFLTCLRNIRFDFRCFKAFPEADFTT---EEEDEEEEEEEEGKS 180
Query: 181 PGNQVGVEGSESSSTAFSKWFMVLQESGRERRSFCSVEDGSIEPPMAPPKNALLLMRCRS 240
NQVGVEG+ESS TAFSKWFMVLQESG C +G PP+APPKNALLLMRCRS
Sbjct: 181 QENQVGVEGNESSRTAFSKWFMVLQESGAS-NGICRESNG---PPLAPPKNALLLMRCRS 240
Query: 241 APAKSWLELEEEEEETEEEEEEAEKE--------EVKVKKSLKWLMEEENRERL------ 300
APAKSW E EEEEEE EEEEEE E+E EVKVKKSLKWLMEEENRERL
Sbjct: 241 APAKSWQEEEEEEEEEEEEEEEEEEEAAAEEDEKEVKVKKSLKWLMEEENRERLVMEMGP 300
Query: 301 -----SSDIVSEK--SRDLFSRSRSWK 302
SS+I E RDLFSRSRSWK
Sbjct: 301 DFCRMSSEIAKETWVGRDLFSRSRSWK 320
BLAST of Lcy12g014400 vs. ExPASy TrEMBL
Match:
A0A5D3D503 (Transcription initiation factor IIE subunit alpha-like OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold119G001490 PE=4 SV=1)
HSP 1 Score: 405.2 bits (1040), Expect = 2.4e-109
Identity = 230/316 (72.78%), Postives = 259/316 (81.96%), Query Frame = 0
Query: 1 MKFGRE-TKGIPSTDLLVCFPSRSHLALMPNPLCSPARGSDSNKLRRS-RLHHRRRKSAE 60
MK RE +KGIPS DLLVCFPSRSHLALMPNPLCSPARGSDS+K R R HRRRKSAE
Sbjct: 8 MKLNREKSKGIPSADLLVCFPSRSHLALMPNPLCSPARGSDSSKFRLDHRRFHRRRKSAE 67
Query: 61 SPVVWAKAKTMGSEMSEPSSPKVTCAGQIKIRPKSSKSWQSVMEEIERIHNRRKLRRRRF 120
SPVVWAKAKTMGSE+SEPSSPKVTCAGQIKIRPK+SKSWQSVMEEIERIHNRRKLRRRRF
Sbjct: 68 SPVVWAKAKTMGSEISEPSSPKVTCAGQIKIRPKNSKSWQSVMEEIERIHNRRKLRRRRF 127
Query: 121 NWVESLGFKKDLMQFLTCLRSIRFDFRCFRAFPEADFTTEEDEEDEEEEEEEEEEKSPGN 180
WVES GFKKD+MQFLTCLR+IRFDFRCFRAFPE DFTTEE+EE+EEEEE+E+ N
Sbjct: 128 RWVESFGFKKDIMQFLTCLRTIRFDFRCFRAFPETDFTTEEEEEEEEEEEDEK------N 187
Query: 181 QVGVEGSESSSTAFSKWFMVLQESG-----RERRSFCSVEDGSIEPPMAPPKNALLLMRC 240
QVG+E +ESS TAFSKWFMVLQE+G R+ +S C+ +D SIE MAPP NALLLMRC
Sbjct: 188 QVGIEENESSRTAFSKWFMVLQENGSNELKRDSKSLCNEDDESIEAIMAPPINALLLMRC 247
Query: 241 RSAPAKSWLELEEEEEETEEEEEEAEKEEVKVKKSLKWLMEEENRERLSSD-------IV 300
RSAPA+ W+ E E EE + EKE+VKVKKSLKWLMEEENRERL + +
Sbjct: 248 RSAPARRWM-------EEESEEGDDEKEKVKVKKSLKWLMEEENRERLVVEMGTDFCRMT 307
Query: 301 SEKSRDLFSRSRSWKV 303
S+ +++ F+RS+SWKV
Sbjct: 308 SDNAKE-FTRSQSWKV 309
BLAST of Lcy12g014400 vs. ExPASy TrEMBL
Match:
A0A1S3B949 (uncharacterized protein LOC103487551 OS=Cucumis melo OX=3656 GN=LOC103487551 PE=4 SV=1)
HSP 1 Score: 405.2 bits (1040), Expect = 2.4e-109
Identity = 230/316 (72.78%), Postives = 259/316 (81.96%), Query Frame = 0
Query: 1 MKFGRE-TKGIPSTDLLVCFPSRSHLALMPNPLCSPARGSDSNKLRRS-RLHHRRRKSAE 60
MK RE +KGIPS DLLVCFPSRSHLALMPNPLCSPARGSDS+K R R HRRRKSAE
Sbjct: 8 MKLNREKSKGIPSADLLVCFPSRSHLALMPNPLCSPARGSDSSKFRLDHRRFHRRRKSAE 67
Query: 61 SPVVWAKAKTMGSEMSEPSSPKVTCAGQIKIRPKSSKSWQSVMEEIERIHNRRKLRRRRF 120
SPVVWAKAKTMGSE+SEPSSPKVTCAGQIKIRPK+SKSWQSVMEEIERIHNRRKLRRRRF
Sbjct: 68 SPVVWAKAKTMGSEISEPSSPKVTCAGQIKIRPKNSKSWQSVMEEIERIHNRRKLRRRRF 127
Query: 121 NWVESLGFKKDLMQFLTCLRSIRFDFRCFRAFPEADFTTEEDEEDEEEEEEEEEEKSPGN 180
WVES GFKKD+MQFLTCLR+IRFDFRCFRAFPE DFTTEE+EE+EEEEE+E+ N
Sbjct: 128 RWVESFGFKKDIMQFLTCLRTIRFDFRCFRAFPETDFTTEEEEEEEEEEEDEK------N 187
Query: 181 QVGVEGSESSSTAFSKWFMVLQESG-----RERRSFCSVEDGSIEPPMAPPKNALLLMRC 240
QVG+E +ESS TAFSKWFMVLQE+G R+ +S C+ +D SIE MAPP NALLLMRC
Sbjct: 188 QVGIEENESSRTAFSKWFMVLQENGSNELKRDSKSLCNEDDESIEAIMAPPINALLLMRC 247
Query: 241 RSAPAKSWLELEEEEEETEEEEEEAEKEEVKVKKSLKWLMEEENRERLSSD-------IV 300
RSAPA+ W+ E E EE + EKE+VKVKKSLKWLMEEENRERL + +
Sbjct: 248 RSAPARRWM-------EEESEEGDDEKEKVKVKKSLKWLMEEENRERLVVEMGTDFCRMT 307
Query: 301 SEKSRDLFSRSRSWKV 303
S+ +++ F+RS+SWKV
Sbjct: 308 SDNAKE-FTRSQSWKV 309
BLAST of Lcy12g014400 vs. ExPASy TrEMBL
Match:
A0A6J1FBW7 (uncharacterized protein LOC111442647 OS=Cucurbita moschata OX=3662 GN=LOC111442647 PE=4 SV=1)
HSP 1 Score: 387.5 bits (994), Expect = 5.1e-104
Identity = 219/305 (71.80%), Postives = 241/305 (79.02%), Query Frame = 0
Query: 1 MKFGRETKGIPSTDLLVCFPSRSHLALMPNPLCSPARGSDSNKLRRSRLHHRRRKSAESP 60
MK R+TK PS DLLVCFPSRSH ALMPNPLCSPAR SDSNKLRR +HRRRKSAESP
Sbjct: 1 MKSIRDTKANPSPDLLVCFPSRSHFALMPNPLCSPARASDSNKLRR---YHRRRKSAESP 60
Query: 61 VVWAKAKTM-GSEMSEPSSPKVTCAGQIKIRPKSSKSWQSVMEEIERIHNRRKLRRRRFN 120
VVWAKAKTM GSE+SEPSSPKVTCAGQIK+RPKS KSW+SVMEEIERIHNRR+LRRRRFN
Sbjct: 61 VVWAKAKTMGGSEVSEPSSPKVTCAGQIKMRPKSRKSWESVMEEIERIHNRRELRRRRFN 120
Query: 121 WVESLGFKKDLMQFLTCLRSIRFDFRCFRAFPEADFTTEEDEEDEEEEEEEEEEKSPGNQ 180
WVESLGFKKD+MQFLTCLRS+RFDF CF AFPEA+FT+E++EE+E
Sbjct: 121 WVESLGFKKDIMQFLTCLRSLRFDFGCFGAFPEAEFTSEDEEEEE--------------- 180
Query: 181 VGVEGSESSSTAFSKWFMVLQESG--RERRSFCSVEDGSIEPPMAPPKNALLLMRCRSAP 240
VGVEGS+ S TAFSKWFMVLQ SG R+ C+V+D SI PPMAPP+NALLLMRCRSAP
Sbjct: 181 VGVEGSDGSRTAFSKWFMVLQGSGVRRDGNGLCTVDDASIGPPMAPPRNALLLMRCRSAP 240
Query: 241 AKSWLELEEEEEETEEEEEEAEKEEVKVKKSLKWLMEEENRERLSSDIVSEKSRDLFSRS 300
AKSW+E E EE E+ EVKVKKSLKWLMEEENRE SRDL +RS
Sbjct: 241 AKSWVE--------EGCSEEGEETEVKVKKSLKWLMEEENRE----------SRDLVTRS 269
Query: 301 RSWKV 303
+SWKV
Sbjct: 301 QSWKV 269
BLAST of Lcy12g014400 vs. NCBI nr
Match:
XP_038898663.1 (transcription initiation factor IIE subunit alpha [Benincasa hispida])
HSP 1 Score: 446.8 bits (1148), Expect = 1.5e-121
Identity = 254/325 (78.15%), Postives = 273/325 (84.00%), Query Frame = 0
Query: 1 MKFGRETKGIPSTDLLVCFPSRSHLALMPNPLCSPARGSDSNKLRRSRLH-HRRRKSAES 60
MK GRE KGIPS DLLVCFPSRSHLALMPNPLCSPARGSDS+K R S H HRRRKSAES
Sbjct: 1 MKLGRENKGIPSADLLVCFPSRSHLALMPNPLCSPARGSDSSKFRLSHRHYHRRRKSAES 60
Query: 61 PVVWAKAKTMGSEMSEPSSPKVTCAGQIKIRPKSSKSWQSVMEEIERIHNRRKLRRRRFN 120
PVVWAKAKTMGSE+SEPSSPKVTCAGQIKIRPK+SKSWQSVMEEIERIHNRRKLRRRRF+
Sbjct: 61 PVVWAKAKTMGSEISEPSSPKVTCAGQIKIRPKNSKSWQSVMEEIERIHNRRKLRRRRFH 120
Query: 121 WVESLGFKKDLMQFLTCLRSIRFDFRCFRAFPEADFTTEEDEEDEEEEEEEEEEKSPGNQ 180
WVESLGFKKD+MQFLTCLR+IRFDFRCFRAFP DFTTEE EEEEEEEEEEKS GNQ
Sbjct: 121 WVESLGFKKDIMQFLTCLRNIRFDFRCFRAFPATDFTTEE----EEEEEEEEEEKSQGNQ 180
Query: 181 VGVEGSESSSTAFSKWFMVLQESG-----RERRSFCSVEDGSIEPPMAPPKNALLLMRCR 240
VGV+ +ESS TAFSKWFMVLQE+G RE + CS +D SIE MAPPKNALLLMRCR
Sbjct: 181 VGVDENESSRTAFSKWFMVLQENGSNELKRESKILCSDDDVSIEAAMAPPKNALLLMRCR 240
Query: 241 SAPAKSWLELEEEEEETEEEEEEAEKEEVKVKKSLKWLMEEENRERL-----------SS 300
SAPAK WLE EE EEE ++++++ EKEEVKVKKSLKWLMEEENRERL +S
Sbjct: 241 SAPAKRWLE-EESEEEEDDDDDDDEKEEVKVKKSLKWLMEEENRERLVMETGTDFCRMTS 300
Query: 301 DI------VSEKSRDLFSRSRSWKV 303
DI VSEKSRDLF+RS SWKV
Sbjct: 301 DIAKETWVVSEKSRDLFTRSHSWKV 320
BLAST of Lcy12g014400 vs. NCBI nr
Match:
XP_004142611.2 (transcription initiation factor IIE subunit alpha [Cucumis sativus] >KAE8649637.1 hypothetical protein Csa_012410 [Cucumis sativus])
HSP 1 Score: 420.2 bits (1079), Expect = 1.5e-113
Identity = 235/316 (74.37%), Postives = 271/316 (85.76%), Query Frame = 0
Query: 1 MKFGRE-TKGIPSTDLLVCFPSRSHLALMPNPLCSPARGSDSNKLRRS-RLHHRRRKSAE 60
MK RE +KGIPS+DLLVCFPSRSHLALMPNPLCSPARGSDS+K R R +HRRRKSAE
Sbjct: 11 MKLNREKSKGIPSSDLLVCFPSRSHLALMPNPLCSPARGSDSSKFRLDYRRYHRRRKSAE 70
Query: 61 SPVVWAKAKTMGSEMSEPSSPKVTCAGQIKIRPKSSKSWQSVMEEIERIHNRRKLRRRRF 120
SPVVWAKAKTMGSE+SEPSSPKVTCAGQIKIRPK+SKSWQSVMEEIERIHNRRKLRRRRF
Sbjct: 71 SPVVWAKAKTMGSEISEPSSPKVTCAGQIKIRPKNSKSWQSVMEEIERIHNRRKLRRRRF 130
Query: 121 NWVESLGFKKDLMQFLTCLRSIRFDFRCFRAFPEADFTTEEDEEDEEEEEEEEEEKSPGN 180
NW+ES GFKKD+MQFLTCLR++RFDFRCFRAFPE DFTTEE+EE+EEEEEEEEE+ N
Sbjct: 131 NWIESFGFKKDIMQFLTCLRTMRFDFRCFRAFPETDFTTEEEEEEEEEEEEEEEK----N 190
Query: 181 QVGVEGSESSSTAFSKWFMVLQESG-----RERRSFCSVEDGSIEPPMAPPKNALLLMRC 240
QVG+E +ESS TAFSKWFMVLQE+G R+ S C +D SIE MAPP+NALLLMRC
Sbjct: 191 QVGIEENESSRTAFSKWFMVLQENGSNELKRDSNSRCYEDDESIEATMAPPRNALLLMRC 250
Query: 241 RSAPAKSWLELEEEEEETEEEEEEAEKEEVKVKKSLKWLMEEENRERLSSD-------IV 300
+SAPA+ W+E EE EEE +E+E+E EKE+VKVKKSLKWLMEEENRER+ + ++
Sbjct: 251 KSAPARRWME-EESEEEDDEKEKEKEKEKVKVKKSLKWLMEEENRERVVMEMGTDFCRMI 310
Query: 301 SEKSRDLFSRSRSWKV 303
S+ +++ F+RS+SWKV
Sbjct: 311 SDNAKE-FTRSQSWKV 320
BLAST of Lcy12g014400 vs. NCBI nr
Match:
XP_022147766.1 (uncharacterized protein LOC111016595 [Momordica charantia])
HSP 1 Score: 413.3 bits (1061), Expect = 1.8e-111
Identity = 246/327 (75.23%), Postives = 261/327 (79.82%), Query Frame = 0
Query: 1 MKFGRETKGIPSTDLLVCFPSRSHLALMPNPLCSPARGSDSNKLRRS-RLHHRRRK--SA 60
MK GR+ K I S DLLVCFPSRS+L LMP PLCSPARG DSNKLRRS R HHRRRK SA
Sbjct: 1 MKLGRDAKAIHSADLLVCFPSRSNLTLMPKPLCSPARGLDSNKLRRSHRHHHRRRKSTSA 60
Query: 61 ESPVVWAKAKTMGSEMSEPSSPKVTCAGQIKIRPK--SSKSWQSVMEEIERIHNRRKLRR 120
SP++WAK KTMGSE+SEPSSPKVTCAGQIKIRPK S KSWQSVMEEIERIHNRRKLRR
Sbjct: 61 ASPLIWAKPKTMGSEISEPSSPKVTCAGQIKIRPKTGSCKSWQSVMEEIERIHNRRKLRR 120
Query: 121 RRFNWVESLGFKKDLMQFLTCLRSIRFDFRCFRAFPEADFTTEEDEEDEEEEEEEEEEKS 180
RR NWVESLGFKKD+MQFLTCLR+IRFDFRCF+AFPEADFTT EE++EEEEEEEE KS
Sbjct: 121 RRSNWVESLGFKKDIMQFLTCLRNIRFDFRCFKAFPEADFTT---EEEDEEEEEEEEGKS 180
Query: 181 PGNQVGVEGSESSSTAFSKWFMVLQESGRERRSFCSVEDGSIEPPMAPPKNALLLMRCRS 240
NQVGVEG+ESS TAFSKWFMVLQESG C +G PP+APPKNALLLMRCRS
Sbjct: 181 QENQVGVEGNESSRTAFSKWFMVLQESGAS-NGICRESNG---PPLAPPKNALLLMRCRS 240
Query: 241 APAKSWLELEEEEEETEEEEEEAEKE--------EVKVKKSLKWLMEEENRERL------ 300
APAKSW E EEEEEE EEEEEE E+E EVKVKKSLKWLMEEENRERL
Sbjct: 241 APAKSWQEEEEEEEEEEEEEEEEEEEAAAEEDEKEVKVKKSLKWLMEEENRERLVMEMGP 300
Query: 301 -----SSDIVSEK--SRDLFSRSRSWK 302
SS+I E RDLFSRSRSWK
Sbjct: 301 DFCRMSSEIAKETWVGRDLFSRSRSWK 320
BLAST of Lcy12g014400 vs. NCBI nr
Match:
XP_008444111.1 (PREDICTED: uncharacterized protein LOC103487551 [Cucumis melo] >KAA0064246.1 transcription initiation factor IIE subunit alpha-like [Cucumis melo var. makuwa] >TYK18616.1 transcription initiation factor IIE subunit alpha-like [Cucumis melo var. makuwa])
HSP 1 Score: 405.2 bits (1040), Expect = 4.9e-109
Identity = 230/316 (72.78%), Postives = 259/316 (81.96%), Query Frame = 0
Query: 1 MKFGRE-TKGIPSTDLLVCFPSRSHLALMPNPLCSPARGSDSNKLRRS-RLHHRRRKSAE 60
MK RE +KGIPS DLLVCFPSRSHLALMPNPLCSPARGSDS+K R R HRRRKSAE
Sbjct: 8 MKLNREKSKGIPSADLLVCFPSRSHLALMPNPLCSPARGSDSSKFRLDHRRFHRRRKSAE 67
Query: 61 SPVVWAKAKTMGSEMSEPSSPKVTCAGQIKIRPKSSKSWQSVMEEIERIHNRRKLRRRRF 120
SPVVWAKAKTMGSE+SEPSSPKVTCAGQIKIRPK+SKSWQSVMEEIERIHNRRKLRRRRF
Sbjct: 68 SPVVWAKAKTMGSEISEPSSPKVTCAGQIKIRPKNSKSWQSVMEEIERIHNRRKLRRRRF 127
Query: 121 NWVESLGFKKDLMQFLTCLRSIRFDFRCFRAFPEADFTTEEDEEDEEEEEEEEEEKSPGN 180
WVES GFKKD+MQFLTCLR+IRFDFRCFRAFPE DFTTEE+EE+EEEEE+E+ N
Sbjct: 128 RWVESFGFKKDIMQFLTCLRTIRFDFRCFRAFPETDFTTEEEEEEEEEEEDEK------N 187
Query: 181 QVGVEGSESSSTAFSKWFMVLQESG-----RERRSFCSVEDGSIEPPMAPPKNALLLMRC 240
QVG+E +ESS TAFSKWFMVLQE+G R+ +S C+ +D SIE MAPP NALLLMRC
Sbjct: 188 QVGIEENESSRTAFSKWFMVLQENGSNELKRDSKSLCNEDDESIEAIMAPPINALLLMRC 247
Query: 241 RSAPAKSWLELEEEEEETEEEEEEAEKEEVKVKKSLKWLMEEENRERLSSD-------IV 300
RSAPA+ W+ E E EE + EKE+VKVKKSLKWLMEEENRERL + +
Sbjct: 248 RSAPARRWM-------EEESEEGDDEKEKVKVKKSLKWLMEEENRERLVVEMGTDFCRMT 307
Query: 301 SEKSRDLFSRSRSWKV 303
S+ +++ F+RS+SWKV
Sbjct: 308 SDNAKE-FTRSQSWKV 309
BLAST of Lcy12g014400 vs. NCBI nr
Match:
XP_022935869.1 (uncharacterized protein LOC111442647 [Cucurbita moschata] >XP_022935870.1 uncharacterized protein LOC111442647 [Cucurbita moschata])
HSP 1 Score: 387.5 bits (994), Expect = 1.1e-103
Identity = 219/305 (71.80%), Postives = 241/305 (79.02%), Query Frame = 0
Query: 1 MKFGRETKGIPSTDLLVCFPSRSHLALMPNPLCSPARGSDSNKLRRSRLHHRRRKSAESP 60
MK R+TK PS DLLVCFPSRSH ALMPNPLCSPAR SDSNKLRR +HRRRKSAESP
Sbjct: 1 MKSIRDTKANPSPDLLVCFPSRSHFALMPNPLCSPARASDSNKLRR---YHRRRKSAESP 60
Query: 61 VVWAKAKTM-GSEMSEPSSPKVTCAGQIKIRPKSSKSWQSVMEEIERIHNRRKLRRRRFN 120
VVWAKAKTM GSE+SEPSSPKVTCAGQIK+RPKS KSW+SVMEEIERIHNRR+LRRRRFN
Sbjct: 61 VVWAKAKTMGGSEVSEPSSPKVTCAGQIKMRPKSRKSWESVMEEIERIHNRRELRRRRFN 120
Query: 121 WVESLGFKKDLMQFLTCLRSIRFDFRCFRAFPEADFTTEEDEEDEEEEEEEEEEKSPGNQ 180
WVESLGFKKD+MQFLTCLRS+RFDF CF AFPEA+FT+E++EE+E
Sbjct: 121 WVESLGFKKDIMQFLTCLRSLRFDFGCFGAFPEAEFTSEDEEEEE--------------- 180
Query: 181 VGVEGSESSSTAFSKWFMVLQESG--RERRSFCSVEDGSIEPPMAPPKNALLLMRCRSAP 240
VGVEGS+ S TAFSKWFMVLQ SG R+ C+V+D SI PPMAPP+NALLLMRCRSAP
Sbjct: 181 VGVEGSDGSRTAFSKWFMVLQGSGVRRDGNGLCTVDDASIGPPMAPPRNALLLMRCRSAP 240
Query: 241 AKSWLELEEEEEETEEEEEEAEKEEVKVKKSLKWLMEEENRERLSSDIVSEKSRDLFSRS 300
AKSW+E E EE E+ EVKVKKSLKWLMEEENRE SRDL +RS
Sbjct: 241 AKSWVE--------EGCSEEGEETEVKVKKSLKWLMEEENRE----------SRDLVTRS 269
Query: 301 RSWKV 303
+SWKV
Sbjct: 301 QSWKV 269
BLAST of Lcy12g014400 vs. TAIR 10
Match:
AT1G78110.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: plasma membrane; EXPRESSED IN: 17 plant structures; EXPRESSED DURING: 7 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G22230.1); Has 5452 Blast hits to 3541 proteins in 289 species: Archae - 4; Bacteria - 165; Metazoa - 1756; Fungi - 532; Plants - 205; Viruses - 141; Other Eukaryotes - 2649 (source: NCBI BLink). )
HSP 1 Score: 233.8 bits (595), Expect = 1.8e-61
Identity = 170/341 (49.85%), Postives = 210/341 (61.58%), Query Frame = 0
Query: 12 STDLLVCFPSRSHLALMPNPLCSPARGSDSNKLRRSRLHHRRRKSA---------ESPVV 71
S DLLVCFPSR+HLAL P P+CSP+R SDS+ RR HHRR+ S SPV+
Sbjct: 17 SADLLVCFPSRTHLALTPKPICSPSRPSDSSTNRRP--HHRRQLSKLSGGGGGGHGSPVL 76
Query: 72 WAK---AKTM-GSEMSEPSSPKVTCAGQIKIRPKS----SKSWQSVMEEIERIHNRRKLR 131
WAK +K M G E++EP+SPKVTCAGQIK+RP K+WQSVMEEIERIH+ R
Sbjct: 77 WAKQASSKNMGGDEIAEPTSPKVTCAGQIKVRPSKCGGRGKNWQSVMEEIERIHDNRSQS 136
Query: 132 RRRFNWVESLGFKKDLMQFLTCLRSIRFDFRCFRAFPEADFTTEEDEEDEEEEEEEEEEK 191
+ G KKD+M FLTCLR+I+FDFRCF F AD T+++DEE++++++EEEE
Sbjct: 137 K-------FFGLKKDVMGFLTCLRNIKFDFRCFGDFRHADVTSDDDEEEDDDDDEEEE-- 196
Query: 192 SPGNQVGVEGSESSSTAFSKWFMVLQESGRER-----RSFC----SVEDGSIEPPMAPPK 251
V E E+S T FSKWFMVLQE + + C +ED EP + PP
Sbjct: 197 ----VVEGEEEENSKTVFSKWFMVLQEEQNNKDDDKNNNKCDEKRDLEDTETEPAV-PPP 256
Query: 252 NALLLMRCRSAPAKSWLE------LEEEEEETEEEEEEAEKEEVKV---KKSLKWLMEEE 302
NALLLMRCRSAPAKSWLE E+E+ E ++EE+E E +E + KK L+ LMEEE
Sbjct: 257 NALLLMRCRSAPAKSWLEERMKVKTEQEKREEQKEEKETEDQETSMKTKKKDLRSLMEEE 316
BLAST of Lcy12g014400 vs. TAIR 10
Match:
AT1G22230.1 (unknown protein; LOCATED IN: chloroplast; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G78110.1); Has 2358 Blast hits to 1759 proteins in 159 species: Archae - 2; Bacteria - 36; Metazoa - 1046; Fungi - 203; Plants - 157; Viruses - 72; Other Eukaryotes - 842 (source: NCBI BLink). )
HSP 1 Score: 178.7 bits (452), Expect = 6.9e-45
Identity = 148/324 (45.68%), Postives = 180/324 (55.56%), Query Frame = 0
Query: 12 STDLLVCFPSRSHLALMPNPLCSPARGSDSNKLRRSRLHHRRRKSAESPVVWAKAKTMG- 71
S DL+VCFPSR+HL+L + SP S S R++ HHRR S S + G
Sbjct: 13 SADLMVCFPSRAHLSLPSKSISSP---SSSFNRRQNAPHHRRSISKLSSSGGGVRQNRGG 72
Query: 72 --SEMSEPSSPKVTCAGQIKIRPK----SSKSWQSVMEEIERIHNRRKLRRRRFNWVESL 131
+ EP+SPKVTCAGQIK+R K+WQS+M EIE+IH R K + F
Sbjct: 73 GREVVEEPTSPKVTCAGQIKVRSSKRDGGGKNWQSLMAEIEKIH-RSKSESKFF------ 132
Query: 132 GFKKDLMQFLTCLRSIRFDFRCFRAFPEADFTTEEDEEDEEEEEEEEEEKSPGNQVGVEG 191
G K+D+M FLTCLR FDFRCF AFP D ++++EEDEEEEEE+EEE +
Sbjct: 133 GIKRDVMGFLTCLRD--FDFRCFGAFPPVDIISDDEEEDEEEEEEDEEE---------DE 192
Query: 192 SESSSTAFSKWFMVLQESGRERRSFCSVEDGSIEPPMA-PPKNALLLMRCRSAPAKSWLE 251
ESS T FSKW MVL E E+ + A PP NALLLMRCRSAP K+W
Sbjct: 193 DESSGTVFSKWLMVLHEKQNNEECVDGKENVFSDVETAVPPPNALLLMRCRSAPVKNW-- 252
Query: 252 LEEEEEETEE---------EEEEAEKEEVKVKKSLKWLMEEE------------NRERLS 302
EE++EETEE EEEE EK+ V KK L+ LMEEE N +LS
Sbjct: 253 SEEKKEETEEGDNRVKQSGEEEEEEKDRVGNKKDLRSLMEEEKKMNLVVMNYDTNYYKLS 312
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A0A0L1Z4 | 7.1e-114 | 74.37 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G377750 PE=4 SV=1 | [more] |
A0A6J1D3C2 | 8.7e-112 | 75.23 | uncharacterized protein LOC111016595 OS=Momordica charantia OX=3673 GN=LOC111016... | [more] |
A0A5D3D503 | 2.4e-109 | 72.78 | Transcription initiation factor IIE subunit alpha-like OS=Cucumis melo var. maku... | [more] |
A0A1S3B949 | 2.4e-109 | 72.78 | uncharacterized protein LOC103487551 OS=Cucumis melo OX=3656 GN=LOC103487551 PE=... | [more] |
A0A6J1FBW7 | 5.1e-104 | 71.80 | uncharacterized protein LOC111442647 OS=Cucurbita moschata OX=3662 GN=LOC1114426... | [more] |
Match Name | E-value | Identity | Description | |
XP_038898663.1 | 1.5e-121 | 78.15 | transcription initiation factor IIE subunit alpha [Benincasa hispida] | [more] |
XP_004142611.2 | 1.5e-113 | 74.37 | transcription initiation factor IIE subunit alpha [Cucumis sativus] >KAE8649637.... | [more] |
XP_022147766.1 | 1.8e-111 | 75.23 | uncharacterized protein LOC111016595 [Momordica charantia] | [more] |
XP_008444111.1 | 4.9e-109 | 72.78 | PREDICTED: uncharacterized protein LOC103487551 [Cucumis melo] >KAA0064246.1 tra... | [more] |
XP_022935869.1 | 1.1e-103 | 71.80 | uncharacterized protein LOC111442647 [Cucurbita moschata] >XP_022935870.1 unchar... | [more] |
Match Name | E-value | Identity | Description | |
AT1G78110.1 | 1.8e-61 | 49.85 | unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... | [more] |
AT1G22230.1 | 6.9e-45 | 45.68 | unknown protein; LOCATED IN: chloroplast; BEST Arabidopsis thaliana protein matc... | [more] |