Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: polypeptideexonCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCGGCTGTTAATTCCGTGTCATTCTCAACGGTAAGTCAGTTTTCTGATCGAAGGTCGCCGGTTCCGTCGGCTCGTTCACTCGCCTCGAATTTCGACGGGTTTCGTTTCCGTTCTAGTGTTTTCTATCATCATTCGGGAGTTCGAACCTCGAGTTTCAGTTCTCGCTTGGTCATTCATTGCATGTCCGCCGGAACAGGTAGCTTCTGGTGATTTGTTTTCCTTACTCGCTTACTAGACTGTTCCTTTGCTAGATTATGGTAGCAGAAACGAATTGGCTTGTTTCGATGATTTCGATGTGAACTTGTGTTTTGTTTGTTCGTTAATTGTTGAGATGGATCGTTATGGACGGAAATTTAGGTTTTAAGTAACAATTAGGCTTTCAATAGTTCGGTTTGATCAATTCTAAACTTGTTCAGTATGTTGAGATTTCTCTAGTTCGATTTCTTAGTATTTGAGTGTAGAAAATGATGTGCTATTTTCGGATTTTAGATGTGACGACTGTAGCTGAGACTAAATTGAACTTTCTAAAGGCGTATAAACGACCTATCCCCAGCATATACAACTCTGTTCTGCAAGAGTTGATTGTGCAGCAGCATTTGATGAGGTATAAGAGGACGTACCGTTATGATCCTGTTTTCGCCCTCGGTTTTGTTACTGTATATGATCAGCTTATGGATGGGTACCCTAGCGATGAGGATCGGGAGGCCATTTTCCAAGCCTACATTAAGGCGTTGAATGAGGATCCAGAGCAATATAGGTTTGGCCATGTATTCCCCTTTGCTTTTACCATCCCTGATCTCTATCTCTTCTTGCCATCTAATAAATTCACAAGCCTTGTTCATAGATAGTAAACACTGTTAAGTTGAAAAAAACATGACATGATTCCACTTACAGCAAAAATACGACGATTCTTATTTCAACTTTAATCTTCTTTCCTAGTATTTCACATCTTAATTTGGACTTAGTTGTCTAAATGACTAGATTATTTGGTCAAATCACATGGTAATCAAATTAACGGATGTTTACTCTTTGTTCAATATCTCCAAGAGTGAAATTTTCTTCACTGGAAATTATAATCCATCCTTCGACCATTATTCCAATAATTTCAAGAAGGTTTTTCTTCATTTTTAGCTCTACTTTCTGTTATACCGTCCAGAATTGATGCTCAAAAATTGGAAGAGTGGGCTCGGTCTCAGTCTGCAGCTTCATTGGTTGAATTTGCATCAAAAGAAGGAGAAGTTGAGAGTGTTTTGAAAGACATTGCAGAACGAGCTGCGAGTAAGGGGAGTTTCAGTTACAGCCGATTTTTTGCTATTGGGCTATTTCGACTCCTCGAATTGGCAAATGCTACTGAACCCAGTATCTTGGAAAAGGTTTATTTTATCTTCCTTTTTGTGATATAGCTTTTAAGTAGATGACGATTTTGAGGTGATAGTATATCAAGCCTATTATTGCTAAATTAAACATTTACATGATAGCCAGCCCCTCCTCTCATAGTACTTTTACTTATAGATTATATTCTTTCTCTGAGTGCTGCCATTCGTAAAGTGTAATTGTGTTTCTTGCTCATTATTATCTTTTTACTCATTAGAGAAATTGGATCCTGATGGAAGAGTGCTTAACCTTATTCAGTTCGATAGAATTAGGTTCAAGGTACTCTTTCCTGCATAAGTCTCCATTTTACTTTTCTAGGTAACACCAATCTTTTTGTAGTATTTGGTTGAAGAGAAATATGAGCTAACCATTTTTTGGAACATTCCCCTTTGCTCGGGACAATCCCACATACACAATTACAGTGTTTTTGTGGGATTTCCTTCTCCTTTGTGGAGATTGTACTGATCTTGTGGATGTAGTTTATGCTCTTTAGTAGTCTCAGCTGGTCCTTTTCATTCAGTTTATTAGCTCTTGTGAAGGGAAACATACATATGTTCCACTGCCCCCATCAACAACAGTTATCATGAACTATTGCTTTAATTGGCCATCTCTATTTAGCGTGTCTATCTCCAAACATACTTTTATTTTTCTAAAGCCATGTAATGGAGATTTTACCAAGTGTTGACATCTAACACTTTGCTGCTTCTGAACAGCTCTGTGCCGCTTTAAATGTTGACAAAAAAAGTGTGGACCGAGACCTTGATGTATACCGCAACCTGCTTTCGAAGTTGGTTCAGGCGAAGGAGCTCCTAAAGGAATACGTTGATAGGTAAGAGATTTTTAAGTTCATGGTTCCTGGCCTGGTAGATTAACATACCCACAATCTGCTCTTTCACACCACCATATTTTCTTGGATTTTATGAACCCATAATCGTTTTGAAAACAATCCAGTATTTGTCAAATCTTCTGCAACTCACCGTCATATCATTTATATTCACTAGATATCCACCGGTTGGCTGTGTTTTAAGATGAAAGTGATGTTTTATGAAAATTTAAATGAAGTATTTCTGCATTCAATGTTCATGATATGTATACAGTGGTGACCTTAGATTAAGTGGAAAATTTCGGGTCGTTACATGGATCTTGAGTGAGGGTGGCTGTGTTTTAAGATGAAGGTGATGTTTTATGAAAGTTTAAATGAAGTATTTCTGCATTCAATGTTCATGATATGTATACAGTAGCGGCCTTAAATTAAGTAGAAAATTTCGGATCGTTACATGGATCTTGAGTGAGGCTGATTGTGTTTTAAAATGAAGGTGATGTTTTATGAAAGTTTAAATGAAGTATTTTTGCATTCAATGTTCATAACATGTATACGGTGACGACCTTAAATTAAGTAAAAAATTTCGAGTTGTTACATGGATCTTGAGTGAGGTTGGCTGTGTTTTAAGATGAAGGTGATGTTATGAAAGTTTAAATGAAGTATTTCTGCATTCAATGGTTATGATATGTATACAATGGCGACTTTAAATGAAGTAGAAAATTTCGGATTATTACATGGGTCTTGGGTCTTGAGTCTTGTGGAGGCCTAAAAATGAGGCAGAATTGGGTGGATCATGGTATAATTAACCAGTGCTCTTCCATTCTGTTTTTGCCTATAGAGAGAAGAAGAAAAGAGATGAGAGGGCTGGATCACAGACAGCTAACGAGGCCGTAACAAAATGAATGATTGGGTGAATACAGCATAGAGAGTGGTTTTTGAGAGCTGATGACATCAATTGGAGCACTCACCGCCTAATTTGAGATAAGACTTTAAAGAGTTATAGCAATATTCTAAATACCTGATAGATATGCATTTGTACTGTATTGTTGGGTCTTCTGCATTTGGTAAATTTTGTATTCTGCCAGCTTCTACTTTTATTCTCATTTGTATTACATATTTTGAGTGCAATTGTCTATACAAATGCTTTCGCACTCTGTGCAGTAATCATATTTGTCTCTCTCTTGACCACCA
mRNA sequence
ATGGCGGCTGTTAATTCCGTGTCATTCTCAACGGTAAGTCAGTTTTCTGATCGAAGGTCGCCGGTTCCGTCGGCTCGTTCACTCGCCTCGAATTTCGACGGGTTTCGTTTCCGTTCTAGTGTTTTCTATCATCATTCGGGAGTTCGAACCTCGAGTTTCAGTTCTCGCTTGGTCATTCATTGCATGTCCGCCGGAACAGATGTGACGACTGTAGCTGAGACTAAATTGAACTTTCTAAAGGCGTATAAACGACCTATCCCCAGCATATACAACTCTGTTCTGCAAGAGTTGATTGTGCAGCAGCATTTGATGAGGTATAAGAGGACGTACCGTTATGATCCTGTTTTCGCCCTCGGTTTTGTTACTGTATATGATCAGCTTATGGATGGGTACCCTAGCGATGAGGATCGGGAGGCCATTTTCCAAGCCTACATTAAGGCGTTGAATGAGGATCCAGAGCAATATAGAATTGATGCTCAAAAATTGGAAGAGTGGGCTCGGTCTCAGTCTGCAGCTTCATTGGTTGAATTTGCATCAAAAGAAGGAGAAGTTGAGAGTGTTTTGAAAGACATTGCAGAACGAGCTGCGAGTAAGGGGAGTTTCAGTTACAGCCGATTTTTTGCTATTGGGCTATTTCGACTCCTCGAATTGGCAAATGCTACTGAACCCAGTATCTTGGAAAAGCTCTGTGCCGCTTTAAATGTTGACAAAAAAAGTGTGGACCGAGACCTTGATGTATACCGCAACCTGCTTTCGAAGTTGGTTCAGGCGAAGGAGCTCCTAAAGGAATACGTTGATAGAGAGAAGAAGAAAAGAGATGAGAGGGCTGGATCACAGACAGCTAACGAGGCCGTAACAAAATGAATGATTGGGTGAATACAGCATAGAGAGTGGTTTTTGAGAGCTGATGACATCAATTGGAGCACTCACCGCCTAATTTGAGATAAGACTTTAAAGAGTTATAGCAATATTCTAAATACCTGATAGATATGCATTTGTACTGTATTGTTGGGTCTTCTGCATTTGGTAAATTTTGTATTCTGCCAGCTTCTACTTTTATTCTCATTTGTATTACATATTTTGAGTGCAATTGTCTATACAAATGCTTTCGCACTCTGTGCAGTAATCATATTTGTCTCTCTCTTGACCACCA
Coding sequence (CDS)
ATGGCGGCTGTTAATTCCGTGTCATTCTCAACGGTAAGTCAGTTTTCTGATCGAAGGTCGCCGGTTCCGTCGGCTCGTTCACTCGCCTCGAATTTCGACGGGTTTCGTTTCCGTTCTAGTGTTTTCTATCATCATTCGGGAGTTCGAACCTCGAGTTTCAGTTCTCGCTTGGTCATTCATTGCATGTCCGCCGGAACAGATGTGACGACTGTAGCTGAGACTAAATTGAACTTTCTAAAGGCGTATAAACGACCTATCCCCAGCATATACAACTCTGTTCTGCAAGAGTTGATTGTGCAGCAGCATTTGATGAGGTATAAGAGGACGTACCGTTATGATCCTGTTTTCGCCCTCGGTTTTGTTACTGTATATGATCAGCTTATGGATGGGTACCCTAGCGATGAGGATCGGGAGGCCATTTTCCAAGCCTACATTAAGGCGTTGAATGAGGATCCAGAGCAATATAGAATTGATGCTCAAAAATTGGAAGAGTGGGCTCGGTCTCAGTCTGCAGCTTCATTGGTTGAATTTGCATCAAAAGAAGGAGAAGTTGAGAGTGTTTTGAAAGACATTGCAGAACGAGCTGCGAGTAAGGGGAGTTTCAGTTACAGCCGATTTTTTGCTATTGGGCTATTTCGACTCCTCGAATTGGCAAATGCTACTGAACCCAGTATCTTGGAAAAGCTCTGTGCCGCTTTAAATGTTGACAAAAAAAGTGTGGACCGAGACCTTGATGTATACCGCAACCTGCTTTCGAAGTTGGTTCAGGCGAAGGAGCTCCTAAAGGAATACGTTGATAGAGAGAAGAAGAAAAGAGATGAGAGGGCTGGATCACAGACAGCTAACGAGGCCGTAACAAAATGA
Protein sequence
MAAVNSVSFSTVSQFSDRRSPVPSARSLASNFDGFRFRSSVFYHHSGVRTSSFSSRLVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNSVLQELIVQQHLMRYKRTYRYDPVFALGFVTVYDQLMDGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQSAASLVEFASKEGEVESVLKDIAERAASKGSFSYSRFFAIGLFRLLELANATEPSILEKLCAALNVDKKSVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAVTK
Homology
BLAST of CmoCh11G019300 vs. ExPASy Swiss-Prot
Match:
Q7XAB8 (Protein THYLAKOID FORMATION1, chloroplastic OS=Solanum tuberosum OX=4113 GN=THF1 PE=2 SV=1)
HSP 1 Score: 386.3 bits (991), Expect = 2.9e-106
Identity = 204/288 (70.83%), Postives = 246/288 (85.42%), Query Frame = 0
Query: 1 MAAVNSVSFSTVSQFSDRRSPVPSARSLASNFDGFRFRSSVFYHHSGVRTSSFSSRLVIH 60
MAAV SVSFS ++Q ++R+S V S+RS+ D FRFRS+ + VR+S+ +SR V+H
Sbjct: 1 MAAVTSVSFSAITQSAERKSSVSSSRSI----DTFRFRSNFSFDSVNVRSSNSTSRFVVH 60
Query: 61 C-MSAGTDVTTVAETKLNFLKAYKRPIPSIYNSVLQELIVQQHLMRYKRTYRYDPVFALG 120
C S+ D+ TVA+TKL FL AYKRPIP++YN+VLQELIVQQHL RYK++Y+YDPVFALG
Sbjct: 61 CTSSSAADLPTVADTKLKFLTAYKRPIPTVYNTVLQELIVQQHLTRYKKSYQYDPVFALG 120
Query: 121 FVTVYDQLMDGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQSAASLVEFAS 180
FVTVYDQLM+GYPS+EDR AIF+AYI+AL EDPEQYR DAQKLEEWAR+Q+A +LV+F+S
Sbjct: 121 FVTVYDQLMEGYPSEEDRNAIFKAYIEALKEDPEQYRADAQKLEEWARTQNANTLVDFSS 180
Query: 181 KEGEVESVLKDIAERAASKGSFSYSRFFAIGLFRLLELANATEPSILEKLCAALNVDKKS 240
KEGE+E++ KDIA+RA +K F YSR FA+GLFRLLELAN T+P+ILEKLCAALNV+KKS
Sbjct: 181 KEGEIENIFKDIAQRAGTKDGFCYSRLFAVGLFRLLELANVTDPTILEKLCAALNVNKKS 240
Query: 241 VDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAVTK 288
VDRDLDVYRNLLSKLVQAKELLKEYV+REKKKR ER +Q ANE VTK
Sbjct: 241 VDRDLDVYRNLLSKLVQAKELLKEYVEREKKKRGERE-TQKANETVTK 283
BLAST of CmoCh11G019300 vs. ExPASy Swiss-Prot
Match:
Q9SKT0 (Protein THYLAKOID FORMATION 1, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=THF1 PE=1 SV=1)
HSP 1 Score: 370.9 bits (951), Expect = 1.3e-101
Identity = 201/286 (70.28%), Postives = 243/286 (84.97%), Query Frame = 0
Query: 3 AVNSVSFSTVSQFSDRRSPVPSARSLASNFDGFRFRSSVFYHHSGVRTSSFSSRLVIHCM 62
A++S+SF + Q SD+ S S+R LAS R + + + S +S+ +IHCM
Sbjct: 5 AISSLSFPALGQ-SDKISNFASSRPLAS-----AIRICTKFSRLSLNSRS-TSKSLIHCM 64
Query: 63 SAGT-DVTTVAETKLNFLKAYKRPIPSIYNSVLQELIVQQHLMRYKRTYRYDPVFALGFV 122
S T DV V+ETK FLKAYKRPIPSIYN+VLQELIVQQHLMRYK+TYRYDPVFALGFV
Sbjct: 65 SNVTADVPPVSETKSKFLKAYKRPIPSIYNTVLQELIVQQHLMRYKKTYRYDPVFALGFV 124
Query: 123 TVYDQLMDGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQSAASLVEFASKE 182
TVYDQLM+GYPSD+DR+AIF+AYI+ALNEDP+QYRIDAQK+EEWARSQ++ASLV+F+SKE
Sbjct: 125 TVYDQLMEGYPSDQDRDAIFKAYIEALNEDPKQYRIDAQKMEEWARSQTSASLVDFSSKE 184
Query: 183 GEVESVLKDIAERAASKGSFSYSRFFAIGLFRLLELANATEPSILEKLCAALNVDKKSVD 242
G++E+VLKDIA RA SK FSYSRFFA+GLFRLLELA+AT+P++L+KLCA+LN++KKSVD
Sbjct: 185 GDIEAVLKDIAGRAGSKEGFSYSRFFAVGLFRLLELASATDPTVLDKLCASLNINKKSVD 244
Query: 243 RDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAVTK 288
RDLDVYRNLLSKLVQAKELLKEYV+REKKK+ ERA SQ ANE ++K
Sbjct: 245 RDLDVYRNLLSKLVQAKELLKEYVEREKKKQGERAQSQKANETISK 283
BLAST of CmoCh11G019300 vs. ExPASy Swiss-Prot
Match:
Q84PB7 (Protein THYLAKOID FORMATION1, chloroplastic OS=Oryza sativa subsp. japonica OX=39947 GN=THF1 PE=2 SV=1)
HSP 1 Score: 358.2 bits (918), Expect = 8.5e-98
Identity = 194/288 (67.36%), Postives = 236/288 (81.94%), Query Frame = 0
Query: 1 MAAVNSVSFSTVSQFSDRRSPVPSARSLASNFDGFRFRSSVFYHHSGVRTSSFSSRLVIH 60
MAA++S+ F+ + + +D R PS + A+ SV SR V+
Sbjct: 1 MAAISSLPFAALRRAADCR---PSTAAAAAGAGAGAVVLSV--------RPRRGSRSVVR 60
Query: 61 CMSAGTDV-TTVAETKLNFLKAYKRPIPSIYNSVLQELIVQQHLMRYKRTYRYDPVFALG 120
C++ DV TVAETK+NFLK+YKRPI SIY++VLQEL+VQQHLMRYK TY+YD VFALG
Sbjct: 61 CVATAGDVPPTVAETKMNFLKSYKRPILSIYSTVLQELLVQQHLMRYKTTYQYDAVFALG 120
Query: 121 FVTVYDQLMDGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQSAASLVEFAS 180
FVTVYDQLM+GYPS+EDR+AIF+AYI ALNEDPEQYR DAQK+EEWARSQ+ SLVEF+S
Sbjct: 121 FVTVYDQLMEGYPSNEDRDAIFKAYITALNEDPEQYRADAQKMEEWARSQNGNSLVEFSS 180
Query: 181 KEGEVESVLKDIAERAASKGSFSYSRFFAIGLFRLLELANATEPSILEKLCAALNVDKKS 240
K+GE+E++LKDI+ERA KGSFSYSRFFA+GLFRLLELANATEP+IL+KLCAALN++K+S
Sbjct: 181 KDGEIEAILKDISERAQGKGSFSYSRFFAVGLFRLLELANATEPTILDKLCAALNINKRS 240
Query: 241 VDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAVTK 288
VDRDLDVYRN+LSKLVQAKELLKEYV+REKKKR+ER+ + +NEAVTK
Sbjct: 241 VDRDLDVYRNILSKLVQAKELLKEYVEREKKKREERSETPKSNEAVTK 277
BLAST of CmoCh11G019300 vs. ExPASy Swiss-Prot
Match:
B0C3M8 (Protein Thf1 OS=Acaryochloris marina (strain MBIC 11017) OX=329726 GN=thf1 PE=3 SV=1)
HSP 1 Score: 152.1 bits (383), Expect = 9.3e-36
Identity = 84/219 (38.36%), Postives = 136/219 (62.10%), Query Frame = 0
Query: 67 DVTTVAETKLNFLKAYKRPIPSIYNSVLQELIVQQHLMRYKRTYRYDPVFALGFVTVYDQ 126
++ TV++TK F + RP+ S+Y V++EL+V+ HL+R +RYDP+FALG T +D+
Sbjct: 3 NLRTVSDTKRAFYSIHTRPVNSVYRRVVEELMVEMHLLRVNEDFRYDPIFALGVTTSFDR 62
Query: 127 LMDGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQSAASLVEF---ASKEG- 186
MDGY + D++AIF A KA DP Q + D Q+L E A+S+SA ++++ A+ G
Sbjct: 63 FMDGYQPENDKDAIFSAICKAQEADPVQMKKDGQRLTELAQSKSAQEMLDWITQAANSGG 122
Query: 187 -EVESVLKDIAERAASKGSFSYSRFFAIGLFRLLELA--NATE-----PSILEKLCAALN 246
E++ L++IA+ F YSR FAIGLF LLEL+ N T+ L +C LN
Sbjct: 123 DELQWQLRNIAQNP----KFKYSRLFAIGLFTLLELSEGNITQDEESLAEFLPNICTVLN 182
Query: 247 VDKKSVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRD 274
+ + + +DL++YR L K+ Q ++ + + ++ +KK+R+
Sbjct: 183 ISESKLQKDLEIYRGNLDKIAQVRQAMDDILEAQKKRRE 217
BLAST of CmoCh11G019300 vs. ExPASy Swiss-Prot
Match:
Q116P5 (Protein Thf1 OS=Trichodesmium erythraeum (strain IMS101) OX=203124 GN=thf1 PE=3 SV=1)
HSP 1 Score: 141.4 bits (355), Expect = 1.6e-32
Identity = 80/216 (37.04%), Postives = 129/216 (59.72%), Query Frame = 0
Query: 70 TVAETKLNFLKAYKRPIPSIYNSVLQELIVQQHLMRYKRTYRYDPVFALGFVTVYDQLMD 129
TV++TK F + RPI SIYN V++EL+V+ HL+ Y Y+P +ALG VT +D+ M
Sbjct: 6 TVSDTKKTFYHFHTRPINSIYNRVIEELLVEMHLISVNVDYSYNPFYALGVVTAFDRFMQ 65
Query: 130 GYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQSAASLVEF--ASKEGEVESV 189
GY ED+ +IF A I+ EDP +YR DA+ LE+ A SA+ ++ + SK +
Sbjct: 66 GYSPQEDKTSIFNALIQGQEEDPNKYRSDAKGLEDLAGKISASDILSWICLSKNIDNTQY 125
Query: 190 LKDIAERAASKGSFSYSRFFAIGLFRLLELANA-------TEPSILEKLCAALNVDKKSV 249
L+D + F YSR FAIGLF LLE+ + L+K+C +LN+ ++ +
Sbjct: 126 LQDDLRAISENSKFRYSRLFAIGLFTLLEIVDTELIKEQEKRTEALKKICQSLNLVEEKL 185
Query: 250 DRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERA 277
+D+D+Y + L ++ QA+ +++ + +KKR++R+
Sbjct: 186 LKDIDLYLSNLERVAQARSAMEDTLAAMRKKREKRS 221
BLAST of CmoCh11G019300 vs. ExPASy TrEMBL
Match:
A0A6J1ES50 (protein THYLAKOID FORMATION1, chloroplastic-like OS=Cucurbita moschata OX=3662 GN=LOC111437233 PE=3 SV=1)
HSP 1 Score: 543.1 bits (1398), Expect = 6.9e-151
Identity = 287/287 (100.00%), Postives = 287/287 (100.00%), Query Frame = 0
Query: 1 MAAVNSVSFSTVSQFSDRRSPVPSARSLASNFDGFRFRSSVFYHHSGVRTSSFSSRLVIH 60
MAAVNSVSFSTVSQFSDRRSPVPSARSLASNFDGFRFRSSVFYHHSGVRTSSFSSRLVIH
Sbjct: 1 MAAVNSVSFSTVSQFSDRRSPVPSARSLASNFDGFRFRSSVFYHHSGVRTSSFSSRLVIH 60
Query: 61 CMSAGTDVTTVAETKLNFLKAYKRPIPSIYNSVLQELIVQQHLMRYKRTYRYDPVFALGF 120
CMSAGTDVTTVAETKLNFLKAYKRPIPSIYNSVLQELIVQQHLMRYKRTYRYDPVFALGF
Sbjct: 61 CMSAGTDVTTVAETKLNFLKAYKRPIPSIYNSVLQELIVQQHLMRYKRTYRYDPVFALGF 120
Query: 121 VTVYDQLMDGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQSAASLVEFASK 180
VTVYDQLMDGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQSAASLVEFASK
Sbjct: 121 VTVYDQLMDGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQSAASLVEFASK 180
Query: 181 EGEVESVLKDIAERAASKGSFSYSRFFAIGLFRLLELANATEPSILEKLCAALNVDKKSV 240
EGEVESVLKDIAERAASKGSFSYSRFFAIGLFRLLELANATEPSILEKLCAALNVDKKSV
Sbjct: 181 EGEVESVLKDIAERAASKGSFSYSRFFAIGLFRLLELANATEPSILEKLCAALNVDKKSV 240
Query: 241 DRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAVTK 288
DRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAVTK
Sbjct: 241 DRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAVTK 287
BLAST of CmoCh11G019300 vs. ExPASy TrEMBL
Match:
A0A6J1JMT7 (protein THYLAKOID FORMATION1, chloroplastic-like OS=Cucurbita maxima OX=3661 GN=LOC111486096 PE=3 SV=1)
HSP 1 Score: 540.8 bits (1392), Expect = 3.4e-150
Identity = 284/287 (98.95%), Postives = 286/287 (99.65%), Query Frame = 0
Query: 1 MAAVNSVSFSTVSQFSDRRSPVPSARSLASNFDGFRFRSSVFYHHSGVRTSSFSSRLVIH 60
MAAVNSVSFSTVSQFSDRRSP+PSARSLASNFDGFRFRSSVFYHHSGVRTSSFSSRLVIH
Sbjct: 1 MAAVNSVSFSTVSQFSDRRSPIPSARSLASNFDGFRFRSSVFYHHSGVRTSSFSSRLVIH 60
Query: 61 CMSAGTDVTTVAETKLNFLKAYKRPIPSIYNSVLQELIVQQHLMRYKRTYRYDPVFALGF 120
CMS GTDVTTVAETKLNFLKAYKRPIPSIYNSVLQELIVQQHLMRYKRTYRYDPVFALGF
Sbjct: 61 CMSTGTDVTTVAETKLNFLKAYKRPIPSIYNSVLQELIVQQHLMRYKRTYRYDPVFALGF 120
Query: 121 VTVYDQLMDGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQSAASLVEFASK 180
VTVYDQLMDGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQSAASLVEFASK
Sbjct: 121 VTVYDQLMDGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQSAASLVEFASK 180
Query: 181 EGEVESVLKDIAERAASKGSFSYSRFFAIGLFRLLELANATEPSILEKLCAALNVDKKSV 240
EGEVES+LKDIAERAASKGSFSYSRFFAIGLFRLLELANATEPSILEKLCAALNVDKKSV
Sbjct: 181 EGEVESILKDIAERAASKGSFSYSRFFAIGLFRLLELANATEPSILEKLCAALNVDKKSV 240
Query: 241 DRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAVTK 288
DRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAVTK
Sbjct: 241 DRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAVTK 287
BLAST of CmoCh11G019300 vs. ExPASy TrEMBL
Match:
A0A5D3C7D3 (Protein THYLAKOID FORMATION1 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold13G003760 PE=3 SV=1)
HSP 1 Score: 506.5 bits (1303), Expect = 7.2e-140
Identity = 261/287 (90.94%), Postives = 280/287 (97.56%), Query Frame = 0
Query: 1 MAAVNSVSFSTVSQFSDRRSPVPSARSLASNFDGFRFRSSVFYHHSGVRTSSFSSRLVIH 60
MAAVNS+SFST++Q SDRR PVPS+RSL+SNFDGFRFR+S+F H+S VR S+FSSR+VIH
Sbjct: 1 MAAVNSISFSTLNQCSDRRFPVPSSRSLSSNFDGFRFRTSLFTHYSRVRPSTFSSRMVIH 60
Query: 61 CMSAGTDVTTVAETKLNFLKAYKRPIPSIYNSVLQELIVQQHLMRYKRTYRYDPVFALGF 120
CMSAGTDVTTVAETKLNFLKAYKRPIPSIYN+VLQELIVQQHLMRYKRTYRYDPVFALGF
Sbjct: 61 CMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFALGF 120
Query: 121 VTVYDQLMDGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQSAASLVEFASK 180
VTVYDQLM+GYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQ+AASLVEFAS+
Sbjct: 121 VTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQTAASLVEFASR 180
Query: 181 EGEVESVLKDIAERAASKGSFSYSRFFAIGLFRLLELANATEPSILEKLCAALNVDKKSV 240
EGEVES+LKDIAERA SKG+FSYSRFFAIGLFRLLELANATEPSILEKLCAALN+DKK V
Sbjct: 181 EGEVESILKDIAERAGSKGNFSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGV 240
Query: 241 DRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAVTK 288
DRDLDVYRNLLSKLVQAKELLKEY+DREKKKRDERAGSQTANEA+TK
Sbjct: 241 DRDLDVYRNLLSKLVQAKELLKEYIDREKKKRDERAGSQTANEAITK 287
BLAST of CmoCh11G019300 vs. ExPASy TrEMBL
Match:
A0A1S3C0V5 (protein THYLAKOID FORMATION1, chloroplastic OS=Cucumis melo OX=3656 GN=LOC103495547 PE=3 SV=1)
HSP 1 Score: 506.5 bits (1303), Expect = 7.2e-140
Identity = 261/287 (90.94%), Postives = 280/287 (97.56%), Query Frame = 0
Query: 1 MAAVNSVSFSTVSQFSDRRSPVPSARSLASNFDGFRFRSSVFYHHSGVRTSSFSSRLVIH 60
MAAVNS+SFST++Q SDRR PVPS+RSL+SNFDGFRFR+S+F H+S VR S+FSSR+VIH
Sbjct: 1 MAAVNSISFSTLNQCSDRRFPVPSSRSLSSNFDGFRFRTSLFTHYSRVRPSTFSSRMVIH 60
Query: 61 CMSAGTDVTTVAETKLNFLKAYKRPIPSIYNSVLQELIVQQHLMRYKRTYRYDPVFALGF 120
CMSAGTDVTTVAETKLNFLKAYKRPIPSIYN+VLQELIVQQHLMRYKRTYRYDPVFALGF
Sbjct: 61 CMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFALGF 120
Query: 121 VTVYDQLMDGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQSAASLVEFASK 180
VTVYDQLM+GYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQ+AASLVEFAS+
Sbjct: 121 VTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQTAASLVEFASR 180
Query: 181 EGEVESVLKDIAERAASKGSFSYSRFFAIGLFRLLELANATEPSILEKLCAALNVDKKSV 240
EGEVES+LKDIAERA SKG+FSYSRFFAIGLFRLLELANATEPSILEKLCAALN+DKK V
Sbjct: 181 EGEVESILKDIAERAGSKGNFSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGV 240
Query: 241 DRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAVTK 288
DRDLDVYRNLLSKLVQAKELLKEY+DREKKKRDERAGSQTANEA+TK
Sbjct: 241 DRDLDVYRNLLSKLVQAKELLKEYIDREKKKRDERAGSQTANEAITK 287
BLAST of CmoCh11G019300 vs. ExPASy TrEMBL
Match:
A0A6J1C3B8 (protein THYLAKOID FORMATION1, chloroplastic isoform X1 OS=Momordica charantia OX=3673 GN=LOC111007981 PE=3 SV=1)
HSP 1 Score: 505.4 bits (1300), Expect = 1.6e-139
Identity = 264/287 (91.99%), Postives = 278/287 (96.86%), Query Frame = 0
Query: 1 MAAVNSVSFSTVSQFSDRRSPVPSARSLASNFDGFRFRSSVFYHHSGVRTSSFSSRLVIH 60
MAAVNSVSFS +SQ S+RR VPSARSLASNFDGFRFR+SVF H+SGVRTSS+SSR+V+H
Sbjct: 1 MAAVNSVSFSALSQCSERRLLVPSARSLASNFDGFRFRTSVFCHYSGVRTSSYSSRMVVH 60
Query: 61 CMSAGTDVTTVAETKLNFLKAYKRPIPSIYNSVLQELIVQQHLMRYKRTYRYDPVFALGF 120
CMSAGTDVTTVAETK NFLK YKRPIPSIYN+VLQELIVQQHLMRYKRTYRYDPVFALGF
Sbjct: 61 CMSAGTDVTTVAETKANFLKVYKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFALGF 120
Query: 121 VTVYDQLMDGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQSAASLVEFASK 180
VTVYDQLM+GYPSDEDREAIFQAYIKALNEDPEQYRIDA+KLEEWARSQ+AASLVEFASK
Sbjct: 121 VTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAKKLEEWARSQTAASLVEFASK 180
Query: 181 EGEVESVLKDIAERAASKGSFSYSRFFAIGLFRLLELANATEPSILEKLCAALNVDKKSV 240
EGEVES+LKDIAERA KGSFSYSRFFAIGLFRLLELANATEPSILEKLCAALNV+KKSV
Sbjct: 181 EGEVESILKDIAERAGGKGSFSYSRFFAIGLFRLLELANATEPSILEKLCAALNVNKKSV 240
Query: 241 DRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAVTK 288
DRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEA+TK
Sbjct: 241 DRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAITK 287
BLAST of CmoCh11G019300 vs. TAIR 10
Match:
AT2G20890.1 (photosystem II reaction center PSB29 protein )
HSP 1 Score: 370.9 bits (951), Expect = 9.1e-103
Identity = 201/286 (70.28%), Postives = 243/286 (84.97%), Query Frame = 0
Query: 3 AVNSVSFSTVSQFSDRRSPVPSARSLASNFDGFRFRSSVFYHHSGVRTSSFSSRLVIHCM 62
A++S+SF + Q SD+ S S+R LAS R + + + S +S+ +IHCM
Sbjct: 5 AISSLSFPALGQ-SDKISNFASSRPLAS-----AIRICTKFSRLSLNSRS-TSKSLIHCM 64
Query: 63 SAGT-DVTTVAETKLNFLKAYKRPIPSIYNSVLQELIVQQHLMRYKRTYRYDPVFALGFV 122
S T DV V+ETK FLKAYKRPIPSIYN+VLQELIVQQHLMRYK+TYRYDPVFALGFV
Sbjct: 65 SNVTADVPPVSETKSKFLKAYKRPIPSIYNTVLQELIVQQHLMRYKKTYRYDPVFALGFV 124
Query: 123 TVYDQLMDGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQSAASLVEFASKE 182
TVYDQLM+GYPSD+DR+AIF+AYI+ALNEDP+QYRIDAQK+EEWARSQ++ASLV+F+SKE
Sbjct: 125 TVYDQLMEGYPSDQDRDAIFKAYIEALNEDPKQYRIDAQKMEEWARSQTSASLVDFSSKE 184
Query: 183 GEVESVLKDIAERAASKGSFSYSRFFAIGLFRLLELANATEPSILEKLCAALNVDKKSVD 242
G++E+VLKDIA RA SK FSYSRFFA+GLFRLLELA+AT+P++L+KLCA+LN++KKSVD
Sbjct: 185 GDIEAVLKDIAGRAGSKEGFSYSRFFAVGLFRLLELASATDPTVLDKLCASLNINKKSVD 244
Query: 243 RDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAVTK 288
RDLDVYRNLLSKLVQAKELLKEYV+REKKK+ ERA SQ ANE ++K
Sbjct: 245 RDLDVYRNLLSKLVQAKELLKEYVEREKKKQGERAQSQKANETISK 283
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Q7XAB8 | 2.9e-106 | 70.83 | Protein THYLAKOID FORMATION1, chloroplastic OS=Solanum tuberosum OX=4113 GN=THF1... | [more] |
Q9SKT0 | 1.3e-101 | 70.28 | Protein THYLAKOID FORMATION 1, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=... | [more] |
Q84PB7 | 8.5e-98 | 67.36 | Protein THYLAKOID FORMATION1, chloroplastic OS=Oryza sativa subsp. japonica OX=3... | [more] |
B0C3M8 | 9.3e-36 | 38.36 | Protein Thf1 OS=Acaryochloris marina (strain MBIC 11017) OX=329726 GN=thf1 PE=3 ... | [more] |
Q116P5 | 1.6e-32 | 37.04 | Protein Thf1 OS=Trichodesmium erythraeum (strain IMS101) OX=203124 GN=thf1 PE=3 ... | [more] |
Match Name | E-value | Identity | Description | |
A0A6J1ES50 | 6.9e-151 | 100.00 | protein THYLAKOID FORMATION1, chloroplastic-like OS=Cucurbita moschata OX=3662 G... | [more] |
A0A6J1JMT7 | 3.4e-150 | 98.95 | protein THYLAKOID FORMATION1, chloroplastic-like OS=Cucurbita maxima OX=3661 GN=... | [more] |
A0A5D3C7D3 | 7.2e-140 | 90.94 | Protein THYLAKOID FORMATION1 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_sca... | [more] |
A0A1S3C0V5 | 7.2e-140 | 90.94 | protein THYLAKOID FORMATION1, chloroplastic OS=Cucumis melo OX=3656 GN=LOC103495... | [more] |
A0A6J1C3B8 | 1.6e-139 | 91.99 | protein THYLAKOID FORMATION1, chloroplastic isoform X1 OS=Momordica charantia OX... | [more] |
Match Name | E-value | Identity | Description | |
AT2G20890.1 | 9.1e-103 | 70.28 | photosystem II reaction center PSB29 protein | [more] |