Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCCGTCCCATTCCGAATGGTTCTCTTTGGATTCGAAACGAAGGTACTCTTGGAAGTTGGAAACCCTATCCTCCCCCGGCATTCCCGTTCTCGTTCATACGCTATGGCGTCTCTTTATATTTTCCAGTCTTCGCATCTCTAATTTCCCTCTTCATTTCTTTCAGTATGTTTTCCTGCGCTTGCAATGGCATCCAATTGGACTCTTCAATTACATTCTCATTCTTTCCAGGTAATCGCCCGTTCTAGTTTTCTTTTTCCTTTTAATCATGTTTCTTCTAATTTTATCAGGAAATGTTTTCTGCCATGTGTATTAAGTTTTTAGATCATGTGTGCAGCCATCCGTATGGGTTCGAAAGCCTCCTGCTTTGTCCGGCATATTTGAACTTGGATTTGGTTATGAATGCACTAGCGTTAGGTATGGACTTTTGAACTGTGACTGGAATCGTAATCGAAAGGGCAAGTTTCTAATAAGAGCGGCTGAGGGTTCCGCGACCTCTGATTCAGGCCAAAATGTTGAAGATAATGAAATTGTCGTCAAGAGTGGTACTGGTGCTGTGGTTTCCAAAGATTACATCGGAAAAATGCAGGAGATGATTAGTTCGTCGCCTGCCGGCCTTTTCTTGGTAAATATATTTTGTTCTCAATCGTTTAGGAGTCTGTCGTTTTAAAGAAGGGGCTAAAAACCACTACCCGACTCTTCCTTCTCTATAGTTTTTGCACGTTCGAGAATGGCGTCTGTAGTGACTCTCTTCTTACTGTTATATTCACAATTTGTCATCACCTCATACCGCAAATGAATGAAGAACTGTCTCCTGAATTGGCATGGATTATAACACTGATACTTCTTGTACTACTTGCTTGGAACCTGCATTGCATCAATTAATGGTTGTGGGCAAATGGTTCATGATATCAGTGGAATTAATATAAACAGATGCAGGTTTTGAATGTAAAGAATTACTTGCAATATGCAACAATGCATGATGCTTTTTGTTTTAAACCCTAAACCCTTGTGAAAGAACTCTGAGCTTTGTTTTGGAGTTGCGGGTATCTTACACTGTTATCCCATTCTGTACCAGATGAACAAATGTGCTGGAAATGGTCTTGCAATTGGGTTTTGCATTGCAACTGCTTGTTTAGCAATAGTTGCAAGAGTGTATTTGATGGGGAAGTCCGGGAATAGTAATTCCGGGTCAGTGGCTGATTTAGTCAGGCGTGGTCAGCTAAGATCTGACAGAAGAGGCATGTATGATTGAAATCTTTGGAAGCTTTATCTTTGTAACGTGCTAATATGTTTATTATGTCCTGTTTCCATTATATTCTTATAATTTTCTCTATAATCATCTTTTTTGCAGTTCCAAGCCTTTAAAATACAACGATCCCTTTAATAATCCATTGGTAAAGGTTGACAAAAGAAATTCATCTGTGGAAATGTGTGGAAAGGTTTATCGATTGGCCCCAGTTACTCTTACTAAGGAGGAACAAAGTATTCATCAGAAACGGAGGTCTCGAGCATATAAGTGGAAGAGACCAACCATGTTTCTCAAGGAAGGAGATTCAATACCTCCTGATGTTGACCCCGATACAGTCAGGTGGATTCCTGCAAACCATCCTTTTGCAACAACAGCTAGTGATATTGATGAAGACTTGGCCCAGAACAATGTGTACCAAAAGCATGGTGTTCCTTTCCGTATTCAAGCTGAGCATGAGGCACTGCAGAGAAAGCTTGAAGCACTCCAGAGT
mRNA sequence
ATGGCCGTCCCATTCCGAATGGTTCTCTTTGGATTCGAAACGAAGTATGTTTTCCTGCGCTTGCAATGGCATCCAATTGGACTCTTCAATTACATTCTCATTCTTTCCAGATCATGTGTGCAGCCATCCGTATGGGTTCGAAAGCCTCCTGCTTTGTCCGGCATATTTGAACTTGGATTTGGTTATGAATGCACTAGCGTTAGGTATGGACTTTTGAACTGTGACTGGAATCGTAATCGAAAGGGCAAGTTTCTAATAAGAGCGGCTGAGGGTTCCGCGACCTCTGATTCAGGCCAAAATGTTGAAGATAATGAAATTGTCGTCAAGAGTGGTACTGGTGCTGTGGTTTCCAAAGATTACATCGGAAAAATGCAGGAGATGATTAGTTCGTCGCCTGCCGGCCTTTTCTTGATGAACAAATGTGCTGGAAATGGTCTTGCAATTGGGTTTTGCATTGCAACTGCTTGTTTAGCAATAGTTGCAAGAGTGTATTTGATGGGGAAGTCCGGGAATAGTAATTCCGGGTCAGTGGCTGATTTAGTCAGGCGTGGTCAGCTAAGATCTGACAGAAGAGGCATTTCCAAGCCTTTAAAATACAACGATCCCTTTAATAATCCATTGGTAAAGGTTGACAAAAGAAATTCATCTGTGGAAATGTGTGGAAAGGTTTATCGATTGGCCCCAGTTACTCTTACTAAGGAGGAACAAAGTATTCATCAGAAACGGAGGTCTCGAGCATATAAGTGGAAGAGACCAACCATGTTTCTCAAGGAAGGAGATTCAATACCTCCTGATGTTGACCCCGATACAGTCAGGTGGATTCCTGCAAACCATCCTTTTGCAACAACAGCTAGTGATATTGATGAAGACTTGGCCCAGAACAATGTGTACCAAAAGCATGGTGTTCCTTTCCGTATTCAAGCTGAGCATGAGGCACTGCAGAGAAAGCTTGAAGCACTCCAGAGT
Coding sequence (CDS)
ATGGCCGTCCCATTCCGAATGGTTCTCTTTGGATTCGAAACGAAGTATGTTTTCCTGCGCTTGCAATGGCATCCAATTGGACTCTTCAATTACATTCTCATTCTTTCCAGATCATGTGTGCAGCCATCCGTATGGGTTCGAAAGCCTCCTGCTTTGTCCGGCATATTTGAACTTGGATTTGGTTATGAATGCACTAGCGTTAGGTATGGACTTTTGAACTGTGACTGGAATCGTAATCGAAAGGGCAAGTTTCTAATAAGAGCGGCTGAGGGTTCCGCGACCTCTGATTCAGGCCAAAATGTTGAAGATAATGAAATTGTCGTCAAGAGTGGTACTGGTGCTGTGGTTTCCAAAGATTACATCGGAAAAATGCAGGAGATGATTAGTTCGTCGCCTGCCGGCCTTTTCTTGATGAACAAATGTGCTGGAAATGGTCTTGCAATTGGGTTTTGCATTGCAACTGCTTGTTTAGCAATAGTTGCAAGAGTGTATTTGATGGGGAAGTCCGGGAATAGTAATTCCGGGTCAGTGGCTGATTTAGTCAGGCGTGGTCAGCTAAGATCTGACAGAAGAGGCATTTCCAAGCCTTTAAAATACAACGATCCCTTTAATAATCCATTGGTAAAGGTTGACAAAAGAAATTCATCTGTGGAAATGTGTGGAAAGGTTTATCGATTGGCCCCAGTTACTCTTACTAAGGAGGAACAAAGTATTCATCAGAAACGGAGGTCTCGAGCATATAAGTGGAAGAGACCAACCATGTTTCTCAAGGAAGGAGATTCAATACCTCCTGATGTTGACCCCGATACAGTCAGGTGGATTCCTGCAAACCATCCTTTTGCAACAACAGCTAGTGATATTGATGAAGACTTGGCCCAGAACAATGTGTACCAAAAGCATGGTGTTCCTTTCCGTATTCAAGCTGAGCATGAGGCACTGCAGAGAAAGCTTGAAGCACTCCAGAGT
Protein sequence
MAVPFRMVLFGFETKYVFLRLQWHPIGLFNYILILSRSCVQPSVWVRKPPALSGIFELGFGYECTSVRYGLLNCDWNRNRKGKFLIRAAEGSATSDSGQNVEDNEIVVKSGTGAVVSKDYIGKMQEMISSSPAGLFLMNKCAGNGLAIGFCIATACLAIVARVYLMGKSGNSNSGSVADLVRRGQLRSDRRGISKPLKYNDPFNNPLVKVDKRNSSVEMCGKVYRLAPVTLTKEEQSIHQKRRSRAYKWKRPTMFLKEGDSIPPDVDPDTVRWIPANHPFATTASDIDEDLAQNNVYQKHGVPFRIQAEHEALQRKLEALQS
Homology
BLAST of Sgr017724 vs. NCBI nr
Match:
XP_022135712.1 (protein MULTIPLE CHLOROPLAST DIVISION SITE 1 [Momordica charantia])
HSP 1 Score: 504.2 bits (1297), Expect = 8.3e-139
Identity = 249/283 (87.99%), Postives = 268/283 (94.70%), Query Frame = 0
Query: 41 QPSVWVRKPPALSGIFELGFGYECTSVRYGLLNCDWNRNRKG-KFLIRAAEGSATSDSGQ 100
QPSV VRKPPALSGI FGYECT + Y LNC+WNR+R G KFLIRAAEGSA+SDSGQ
Sbjct: 15 QPSVSVRKPPALSGI----FGYECTGISYRHLNCNWNRHRMGCKFLIRAAEGSASSDSGQ 74
Query: 101 NVEDNEIVVKSGTGAVVSKDYIGKMQEMISSSPAGLFLMNKCAGNGLAIGFCIATACLAI 160
NVE++E+VVK+GTG+V SKDYIGKMQEMI+SSPAG+FLM+KC GNGLAIGFC+ATACLAI
Sbjct: 75 NVEEDEMVVKTGTGSVASKDYIGKMQEMINSSPAGIFLMSKCTGNGLAIGFCVATACLAI 134
Query: 161 VARVYLMGKSGNSNSGSVADLVRRGQLRSDRRGISKPLKYNDPFNNPLVKVDKRNSSVEM 220
+ARVYLMGKS NS+SGSVADLVRRGQLRSDRRGISKPLKYNDPFNNPLVKVDK NSSVEM
Sbjct: 135 IARVYLMGKSRNSHSGSVADLVRRGQLRSDRRGISKPLKYNDPFNNPLVKVDKSNSSVEM 194
Query: 221 CGKVYRLAPVTLTKEEQSIHQKRRSRAYKWKRPTMFLKEGDSIPPDVDPDTVRWIPANHP 280
CGKVYRLAPVTLTKEEQ+IHQKRRSRAY+WKRPTMFLKEGDSIPPDVDP+T+RWIPANHP
Sbjct: 195 CGKVYRLAPVTLTKEEQNIHQKRRSRAYQWKRPTMFLKEGDSIPPDVDPETIRWIPANHP 254
Query: 281 FATTASDIDEDLAQNNVYQKHGVPFRIQAEHEALQRKLEALQS 323
FATTASDIDEDLAQNNVYQKHGVPFRIQAEHEALQRKLEALQS
Sbjct: 255 FATTASDIDEDLAQNNVYQKHGVPFRIQAEHEALQRKLEALQS 293
BLAST of Sgr017724 vs. NCBI nr
Match:
XP_038899779.1 (protein MULTIPLE CHLOROPLAST DIVISION SITE 1 [Benincasa hispida] >XP_038899780.1 protein MULTIPLE CHLOROPLAST DIVISION SITE 1 [Benincasa hispida])
HSP 1 Score: 496.1 bits (1276), Expect = 2.2e-136
Identity = 245/294 (83.33%), Postives = 264/294 (89.80%), Query Frame = 0
Query: 30 NYILILSRSCVQPSVWVRKPPALSGIFELGFGYECTSVRYGLLNCDWNRNRKG-KFLIRA 89
N+ L L QP WVRKP AL GI + G EC ++RYG LNC+WNR+R G KF I++
Sbjct: 4 NWALQLHSLSFQPHAWVRKPSALPGISQREIGNECAAIRYGHLNCNWNRHRIGRKFQIKS 63
Query: 90 AEGSATSDSGQNVEDNEIVVKSGTGAVVSKDYIGKMQEMISSSPAGLFLMNKCAGNGLAI 149
AEGS ++DSG++VED+EIVVKSG+G V SKDYIGKMQEMI SSP G+FLMNKC GNGLAI
Sbjct: 64 AEGSGSTDSGRDVEDDEIVVKSGSGGVASKDYIGKMQEMIISSPPGVFLMNKCTGNGLAI 123
Query: 150 GFCIATACLAIVARVYLMGKSGNSNSGSVADLVRRGQLRSDRRGISKPLKYNDPFNNPLV 209
GFCI TACLAI+ARVYLMGKS NS+SGSVADLVRRGQLRSDRRGISKPLKYNDPFNNPLV
Sbjct: 124 GFCIVTACLAILARVYLMGKSRNSHSGSVADLVRRGQLRSDRRGISKPLKYNDPFNNPLV 183
Query: 210 KVDKRNSSVEMCGKVYRLAPVTLTKEEQSIHQKRRSRAYKWKRPTMFLKEGDSIPPDVDP 269
KVDK NSSVEMCGKVYRLAPVTLTKEEQSIHQKRRSRAY+WKRPTMFLKEGDSIPPDVDP
Sbjct: 184 KVDKSNSSVEMCGKVYRLAPVTLTKEEQSIHQKRRSRAYQWKRPTMFLKEGDSIPPDVDP 243
Query: 270 DTVRWIPANHPFATTASDIDEDLAQNNVYQKHGVPFRIQAEHEALQRKLEALQS 323
DT+RWIPANHPFATTASDIDEDLAQNNVYQKHGVPFRIQAEHEALQRKLEALQS
Sbjct: 244 DTIRWIPANHPFATTASDIDEDLAQNNVYQKHGVPFRIQAEHEALQRKLEALQS 297
BLAST of Sgr017724 vs. NCBI nr
Match:
XP_022940742.1 (protein MULTIPLE CHLOROPLAST DIVISION SITE 1 [Cucurbita moschata])
HSP 1 Score: 490.3 bits (1261), Expect = 1.2e-134
Identity = 247/294 (84.01%), Postives = 265/294 (90.14%), Query Frame = 0
Query: 30 NYILILSRSCVQPSVWVRKPPALSGIFELGFGYECTSVRYGLLNCDWNRN-RKGKFLIRA 89
N+ L L QPSVW RKPPALSGIFELG GY+CTS+RY LN WNR+ + K LIRA
Sbjct: 4 NWTLQLHSHSFQPSVWARKPPALSGIFELGIGYKCTSIRYEHLN--WNRHGMRRKSLIRA 63
Query: 90 AEGSATSDSGQNVEDNEIVVKSGTGAVVSKDYIGKMQEMISSSPAGLFLMNKCAGNGLAI 149
+ GS ++DSG+NVED+EIVVKSGT AV SKD+IGKM+EM SSSP G+F+MNKC GNGLAI
Sbjct: 64 SWGSGSTDSGRNVEDDEIVVKSGTVAVASKDFIGKMKEMTSSSPLGVFVMNKCTGNGLAI 123
Query: 150 GFCIATACLAIVARVYLMGKSGNSNSGSVADLVRRGQLRSDRRGISKPLKYNDPFNNPLV 209
GF I TACLAIVARVY MGKS NS+SGSVADLVRRGQLRSDRRGISKPLKYNDPFNNPLV
Sbjct: 124 GFFIVTACLAIVARVYFMGKSRNSHSGSVADLVRRGQLRSDRRGISKPLKYNDPFNNPLV 183
Query: 210 KVDKRNSSVEMCGKVYRLAPVTLTKEEQSIHQKRRSRAYKWKRPTMFLKEGDSIPPDVDP 269
KVDK NSSVEMCGKVYRLAPVTLTKEEQSIHQKRRSRAY+WKRPTMFLKEGDSIPPDVDP
Sbjct: 184 KVDKSNSSVEMCGKVYRLAPVTLTKEEQSIHQKRRSRAYQWKRPTMFLKEGDSIPPDVDP 243
Query: 270 DTVRWIPANHPFATTASDIDEDLAQNNVYQKHGVPFRIQAEHEALQRKLEALQS 323
D+VRWIPANHPFATTASDIDEDLAQNNVYQKHGVPFRIQAEHEALQRKLEALQ+
Sbjct: 244 DSVRWIPANHPFATTASDIDEDLAQNNVYQKHGVPFRIQAEHEALQRKLEALQN 295
BLAST of Sgr017724 vs. NCBI nr
Match:
KAG7037967.1 (Protein MULTIPLE CHLOROPLAST DIVISION SITE 1 [Cucurbita argyrosperma subsp. argyrosperma])
HSP 1 Score: 489.6 bits (1259), Expect = 2.1e-134
Identity = 246/293 (83.96%), Postives = 266/293 (90.78%), Query Frame = 0
Query: 31 YILILSRSCVQPSVWVRKPPALSGIFELGFGYECTSVRYGLLNCDWNRN-RKGKFLIRAA 90
++ L RS +QPSVW RKPPALSGIFELG GY+CTS+RY LN WNR+ + K LIRA+
Sbjct: 13 HVYSLFRSYMQPSVWARKPPALSGIFELGIGYKCTSIRYEHLN--WNRHGMRRKSLIRAS 72
Query: 91 EGSATSDSGQNVEDNEIVVKSGTGAVVSKDYIGKMQEMISSSPAGLFLMNKCAGNGLAIG 150
GS ++DSG+NVED+EIVVKSGT AV SKD+IGKM+EM SSSP G+F+MNKC GNGLAIG
Sbjct: 73 WGSGSTDSGRNVEDDEIVVKSGTVAVASKDFIGKMKEMTSSSPLGVFVMNKCTGNGLAIG 132
Query: 151 FCIATACLAIVARVYLMGKSGNSNSGSVADLVRRGQLRSDRRGISKPLKYNDPFNNPLVK 210
F I TACLAIVARVY MGKS NS+SGSVADLVRRGQLRSDRRGISKPLKYNDPFNNPLVK
Sbjct: 133 FFIITACLAIVARVYFMGKSRNSHSGSVADLVRRGQLRSDRRGISKPLKYNDPFNNPLVK 192
Query: 211 VDKRNSSVEMCGKVYRLAPVTLTKEEQSIHQKRRSRAYKWKRPTMFLKEGDSIPPDVDPD 270
VDK NSSVEMCGKVYRLAPVTLTKEEQS HQKRRSRAY+WKRPTMFLKEGDSIPPDVDPD
Sbjct: 193 VDKSNSSVEMCGKVYRLAPVTLTKEEQSTHQKRRSRAYQWKRPTMFLKEGDSIPPDVDPD 252
Query: 271 TVRWIPANHPFATTASDIDEDLAQNNVYQKHGVPFRIQAEHEALQRKLEALQS 323
+VRWIPANHPFATTASDIDEDLAQNNVYQKHGVPFRIQAEHEALQRKLEALQ+
Sbjct: 253 SVRWIPANHPFATTASDIDEDLAQNNVYQKHGVPFRIQAEHEALQRKLEALQN 303
BLAST of Sgr017724 vs. NCBI nr
Match:
KAG6608650.1 (Protein MULTIPLE CHLOROPLAST DIVISION SITE 1, partial [Cucurbita argyrosperma subsp. sororia])
HSP 1 Score: 488.0 bits (1255), Expect = 6.1e-134
Identity = 246/294 (83.67%), Postives = 264/294 (89.80%), Query Frame = 0
Query: 30 NYILILSRSCVQPSVWVRKPPALSGIFELGFGYECTSVRYGLLNCDWNRN-RKGKFLIRA 89
N+ L L QPSVW RKPPALSGIFELG GY+CTS+RY LN WNR+ + K LIRA
Sbjct: 4 NWTLQLHSHSFQPSVWARKPPALSGIFELGIGYKCTSIRYEHLN--WNRHGMRRKSLIRA 63
Query: 90 AEGSATSDSGQNVEDNEIVVKSGTGAVVSKDYIGKMQEMISSSPAGLFLMNKCAGNGLAI 149
+ GS ++DSG+NVED+EIVVKSGT AV SKD+IGKM+EM SSSP G+F+MNKC GNGLAI
Sbjct: 64 SWGSGSTDSGRNVEDDEIVVKSGTVAVASKDFIGKMKEMTSSSPLGVFVMNKCTGNGLAI 123
Query: 150 GFCIATACLAIVARVYLMGKSGNSNSGSVADLVRRGQLRSDRRGISKPLKYNDPFNNPLV 209
GF I TACLAIVARVY MGKS NS+SGSVADLVRRGQLRSDRRGISKPLKYNDPFNNPLV
Sbjct: 124 GFFIITACLAIVARVYFMGKSRNSHSGSVADLVRRGQLRSDRRGISKPLKYNDPFNNPLV 183
Query: 210 KVDKRNSSVEMCGKVYRLAPVTLTKEEQSIHQKRRSRAYKWKRPTMFLKEGDSIPPDVDP 269
KVDK NSSVEMCGKVYRLAPVTLTKEEQS HQKRRSRAY+WKRPTMFLKEGDSIPPDVDP
Sbjct: 184 KVDKSNSSVEMCGKVYRLAPVTLTKEEQSTHQKRRSRAYQWKRPTMFLKEGDSIPPDVDP 243
Query: 270 DTVRWIPANHPFATTASDIDEDLAQNNVYQKHGVPFRIQAEHEALQRKLEALQS 323
D+VRWIPANHPFATTASDIDEDLAQNNVYQKHGVPFRIQAEHEALQRKLEALQ+
Sbjct: 244 DSVRWIPANHPFATTASDIDEDLAQNNVYQKHGVPFRIQAEHEALQRKLEALQN 295
BLAST of Sgr017724 vs. ExPASy Swiss-Prot
Match:
Q8GWA7 (Protein MULTIPLE CHLOROPLAST DIVISION SITE 1 OS=Arabidopsis thaliana OX=3702 GN=MCD1 PE=1 SV=1)
HSP 1 Score: 310.8 bits (795), Expect = 1.8e-83
Identity = 164/260 (63.08%), Postives = 199/260 (76.54%), Query Frame = 0
Query: 72 LNCDWNR-NRKGKFLIRAAEGSATSDSG-QNV--EDNEIVVKSGTGAVVSKD---YIGKM 131
+N +W + KG F+ +A S+T D QN +DN +VV + T + + D I +
Sbjct: 38 VNLNWVQFETKGSFVCKAIGDSSTPDEDIQNTQSDDNVVVVTATTQSDIPHDSEYSISRF 97
Query: 132 QEMISSSPAGLFLMNKCAGNGLAIGFCIATACLAIVARVYLMGKS-GNSNSGSVADLVRR 191
+ M+++ P +FLM KC+ N + IG CI L R Y++ KS N +GSVADLVRR
Sbjct: 98 RSMVTTLPPVVFLMKKCSVNSIWIGVCITATVLVAAIRAYVVRKSRDNQRAGSVADLVRR 157
Query: 192 GQLRS-DRRGISKPLKYNDPFNNPLVKVDKRNSSVEMCGKVYRLAPVTLTKEEQSIHQKR 251
GQLRS DRRGISK L Y DPFNNP VK+DK +S+VEMCGKVYRLAPVTLT++EQ+IHQKR
Sbjct: 158 GQLRSGDRRGISKSLNYEDPFNNPFVKLDKGSSTVEMCGKVYRLAPVTLTEKEQTIHQKR 217
Query: 252 RSRAYKWKRPTMFLKEGDSIPPDVDPDTVRWIPANHPFATTASDIDEDLAQNNVYQKHGV 311
RSRAY+WKRPT+FLKEGDSIPPDVDPDTVRWIPANHPFATT SDID+DLAQNNVYQK GV
Sbjct: 218 RSRAYQWKRPTIFLKEGDSIPPDVDPDTVRWIPANHPFATTVSDIDQDLAQNNVYQKQGV 277
Query: 312 PFRIQAEHEALQRKLEALQS 323
PFRI+AEHEA+Q+KLEALQ+
Sbjct: 278 PFRIRAEHEAMQKKLEALQN 297
BLAST of Sgr017724 vs. ExPASy TrEMBL
Match:
A0A6J1C3I1 (protein MULTIPLE CHLOROPLAST DIVISION SITE 1 OS=Momordica charantia OX=3673 GN=LOC111007603 PE=4 SV=1)
HSP 1 Score: 504.2 bits (1297), Expect = 4.0e-139
Identity = 249/283 (87.99%), Postives = 268/283 (94.70%), Query Frame = 0
Query: 41 QPSVWVRKPPALSGIFELGFGYECTSVRYGLLNCDWNRNRKG-KFLIRAAEGSATSDSGQ 100
QPSV VRKPPALSGI FGYECT + Y LNC+WNR+R G KFLIRAAEGSA+SDSGQ
Sbjct: 15 QPSVSVRKPPALSGI----FGYECTGISYRHLNCNWNRHRMGCKFLIRAAEGSASSDSGQ 74
Query: 101 NVEDNEIVVKSGTGAVVSKDYIGKMQEMISSSPAGLFLMNKCAGNGLAIGFCIATACLAI 160
NVE++E+VVK+GTG+V SKDYIGKMQEMI+SSPAG+FLM+KC GNGLAIGFC+ATACLAI
Sbjct: 75 NVEEDEMVVKTGTGSVASKDYIGKMQEMINSSPAGIFLMSKCTGNGLAIGFCVATACLAI 134
Query: 161 VARVYLMGKSGNSNSGSVADLVRRGQLRSDRRGISKPLKYNDPFNNPLVKVDKRNSSVEM 220
+ARVYLMGKS NS+SGSVADLVRRGQLRSDRRGISKPLKYNDPFNNPLVKVDK NSSVEM
Sbjct: 135 IARVYLMGKSRNSHSGSVADLVRRGQLRSDRRGISKPLKYNDPFNNPLVKVDKSNSSVEM 194
Query: 221 CGKVYRLAPVTLTKEEQSIHQKRRSRAYKWKRPTMFLKEGDSIPPDVDPDTVRWIPANHP 280
CGKVYRLAPVTLTKEEQ+IHQKRRSRAY+WKRPTMFLKEGDSIPPDVDP+T+RWIPANHP
Sbjct: 195 CGKVYRLAPVTLTKEEQNIHQKRRSRAYQWKRPTMFLKEGDSIPPDVDPETIRWIPANHP 254
Query: 281 FATTASDIDEDLAQNNVYQKHGVPFRIQAEHEALQRKLEALQS 323
FATTASDIDEDLAQNNVYQKHGVPFRIQAEHEALQRKLEALQS
Sbjct: 255 FATTASDIDEDLAQNNVYQKHGVPFRIQAEHEALQRKLEALQS 293
BLAST of Sgr017724 vs. ExPASy TrEMBL
Match:
A0A6J1FKG4 (protein MULTIPLE CHLOROPLAST DIVISION SITE 1 OS=Cucurbita moschata OX=3662 GN=LOC111446245 PE=4 SV=1)
HSP 1 Score: 490.3 bits (1261), Expect = 6.0e-135
Identity = 247/294 (84.01%), Postives = 265/294 (90.14%), Query Frame = 0
Query: 30 NYILILSRSCVQPSVWVRKPPALSGIFELGFGYECTSVRYGLLNCDWNRN-RKGKFLIRA 89
N+ L L QPSVW RKPPALSGIFELG GY+CTS+RY LN WNR+ + K LIRA
Sbjct: 4 NWTLQLHSHSFQPSVWARKPPALSGIFELGIGYKCTSIRYEHLN--WNRHGMRRKSLIRA 63
Query: 90 AEGSATSDSGQNVEDNEIVVKSGTGAVVSKDYIGKMQEMISSSPAGLFLMNKCAGNGLAI 149
+ GS ++DSG+NVED+EIVVKSGT AV SKD+IGKM+EM SSSP G+F+MNKC GNGLAI
Sbjct: 64 SWGSGSTDSGRNVEDDEIVVKSGTVAVASKDFIGKMKEMTSSSPLGVFVMNKCTGNGLAI 123
Query: 150 GFCIATACLAIVARVYLMGKSGNSNSGSVADLVRRGQLRSDRRGISKPLKYNDPFNNPLV 209
GF I TACLAIVARVY MGKS NS+SGSVADLVRRGQLRSDRRGISKPLKYNDPFNNPLV
Sbjct: 124 GFFIVTACLAIVARVYFMGKSRNSHSGSVADLVRRGQLRSDRRGISKPLKYNDPFNNPLV 183
Query: 210 KVDKRNSSVEMCGKVYRLAPVTLTKEEQSIHQKRRSRAYKWKRPTMFLKEGDSIPPDVDP 269
KVDK NSSVEMCGKVYRLAPVTLTKEEQSIHQKRRSRAY+WKRPTMFLKEGDSIPPDVDP
Sbjct: 184 KVDKSNSSVEMCGKVYRLAPVTLTKEEQSIHQKRRSRAYQWKRPTMFLKEGDSIPPDVDP 243
Query: 270 DTVRWIPANHPFATTASDIDEDLAQNNVYQKHGVPFRIQAEHEALQRKLEALQS 323
D+VRWIPANHPFATTASDIDEDLAQNNVYQKHGVPFRIQAEHEALQRKLEALQ+
Sbjct: 244 DSVRWIPANHPFATTASDIDEDLAQNNVYQKHGVPFRIQAEHEALQRKLEALQN 295
BLAST of Sgr017724 vs. ExPASy TrEMBL
Match:
A0A5D3D8N6 (Protein MULTIPLE CHLOROPLAST DIVISION SITE 1 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold134G00430 PE=4 SV=1)
HSP 1 Score: 485.7 bits (1249), Expect = 1.5e-133
Identity = 241/295 (81.69%), Postives = 264/295 (89.49%), Query Frame = 0
Query: 29 FNYILILSRSCVQPSVWVRKPPALSGIFELGFGYECTSVRYGLLNCDWNRNR-KGKFLIR 88
FN+ L L QP VWVRKPPALS I +LG G + +RYG LNC+ NR+R GKF+I+
Sbjct: 3 FNWTLQLHSLSFQPYVWVRKPPALSVISQLGLGSKWDGIRYGHLNCNSNRSRIGGKFVIK 62
Query: 89 AAEGSATSDSGQNVEDNEIVVKSGTGAVVSKDYIGKMQEMISSSPAGLFLMNKCAGNGLA 148
+AEGS ++DSG+NVE EIVVKSGTG V SKDYIGKMQE+I+ SP G+FLMNKC NGLA
Sbjct: 63 SAEGSGSTDSGRNVEGEEIVVKSGTGGVASKDYIGKMQELIALSPPGVFLMNKCTRNGLA 122
Query: 149 IGFCIATACLAIVARVYLMGKSGNSNSGSVADLVRRGQLRSDRRGISKPLKYNDPFNNPL 208
IGFC+ TACLAIVARVYLMGKS +S+SGSVADLVRRGQLRSDRRGISKPLKYNDPFNNPL
Sbjct: 123 IGFCVVTACLAIVARVYLMGKSRSSHSGSVADLVRRGQLRSDRRGISKPLKYNDPFNNPL 182
Query: 209 VKVDKRNSSVEMCGKVYRLAPVTLTKEEQSIHQKRRSRAYKWKRPTMFLKEGDSIPPDVD 268
VKVDK NSSVEMCGKVYRLAPVTLTKEEQ+IHQKRRSRAY+WKRPTMFLKEGDSIPPDVD
Sbjct: 183 VKVDKSNSSVEMCGKVYRLAPVTLTKEEQNIHQKRRSRAYQWKRPTMFLKEGDSIPPDVD 242
Query: 269 PDTVRWIPANHPFATTASDIDEDLAQNNVYQKHGVPFRIQAEHEALQRKLEALQS 323
P+T+RWIPANHPFATTASDIDEDLAQNNVYQKHGVPFRIQAEHEALQ+KLEALQS
Sbjct: 243 PETIRWIPANHPFATTASDIDEDLAQNNVYQKHGVPFRIQAEHEALQKKLEALQS 297
BLAST of Sgr017724 vs. ExPASy TrEMBL
Match:
A0A1S3BVZ2 (protein MULTIPLE CHLOROPLAST DIVISION SITE 1 OS=Cucumis melo OX=3656 GN=LOC103493840 PE=4 SV=1)
HSP 1 Score: 485.7 bits (1249), Expect = 1.5e-133
Identity = 241/295 (81.69%), Postives = 264/295 (89.49%), Query Frame = 0
Query: 29 FNYILILSRSCVQPSVWVRKPPALSGIFELGFGYECTSVRYGLLNCDWNRNR-KGKFLIR 88
FN+ L L QP VWVRKPPALS I +LG G + +RYG LNC+ NR+R GKF+I+
Sbjct: 3 FNWTLQLHSLSFQPYVWVRKPPALSVISQLGLGSKWDGIRYGHLNCNSNRSRIGGKFVIK 62
Query: 89 AAEGSATSDSGQNVEDNEIVVKSGTGAVVSKDYIGKMQEMISSSPAGLFLMNKCAGNGLA 148
+AEGS ++DSG+NVE EIVVKSGTG V SKDYIGKMQE+I+ SP G+FLMNKC NGLA
Sbjct: 63 SAEGSGSTDSGRNVEGEEIVVKSGTGGVASKDYIGKMQELIALSPPGVFLMNKCTRNGLA 122
Query: 149 IGFCIATACLAIVARVYLMGKSGNSNSGSVADLVRRGQLRSDRRGISKPLKYNDPFNNPL 208
IGFC+ TACLAIVARVYLMGKS +S+SGSVADLVRRGQLRSDRRGISKPLKYNDPFNNPL
Sbjct: 123 IGFCVVTACLAIVARVYLMGKSRSSHSGSVADLVRRGQLRSDRRGISKPLKYNDPFNNPL 182
Query: 209 VKVDKRNSSVEMCGKVYRLAPVTLTKEEQSIHQKRRSRAYKWKRPTMFLKEGDSIPPDVD 268
VKVDK NSSVEMCGKVYRLAPVTLTKEEQ+IHQKRRSRAY+WKRPTMFLKEGDSIPPDVD
Sbjct: 183 VKVDKSNSSVEMCGKVYRLAPVTLTKEEQNIHQKRRSRAYQWKRPTMFLKEGDSIPPDVD 242
Query: 269 PDTVRWIPANHPFATTASDIDEDLAQNNVYQKHGVPFRIQAEHEALQRKLEALQS 323
P+T+RWIPANHPFATTASDIDEDLAQNNVYQKHGVPFRIQAEHEALQ+KLEALQS
Sbjct: 243 PETIRWIPANHPFATTASDIDEDLAQNNVYQKHGVPFRIQAEHEALQKKLEALQS 297
BLAST of Sgr017724 vs. ExPASy TrEMBL
Match:
A0A6J1IW72 (protein MULTIPLE CHLOROPLAST DIVISION SITE 1 OS=Cucurbita maxima OX=3661 GN=LOC111480480 PE=4 SV=1)
HSP 1 Score: 481.5 bits (1238), Expect = 2.8e-132
Identity = 244/294 (82.99%), Postives = 261/294 (88.78%), Query Frame = 0
Query: 30 NYILILSRSCVQPSVWVRKPPALSGIFELGFGYECTSVRYGLLNCDWNRN-RKGKFLIRA 89
N+ L L QPSVW RKPPALSGIFELG GY+CT +RY LN WNR+ K LIRA
Sbjct: 4 NWTLQLHSHSFQPSVWARKPPALSGIFELGIGYKCTGIRYEHLN--WNRHGMDRKSLIRA 63
Query: 90 AEGSATSDSGQNVEDNEIVVKSGTGAVVSKDYIGKMQEMISSSPAGLFLMNKCAGNGLAI 149
+ GS ++DSG+NVED+EIVVKSGT AV SKD+IGKM+EM SSSP G+F+MNKC N LAI
Sbjct: 64 SWGSGSTDSGRNVEDDEIVVKSGTVAVASKDFIGKMKEMTSSSPLGVFVMNKCTRNRLAI 123
Query: 150 GFCIATACLAIVARVYLMGKSGNSNSGSVADLVRRGQLRSDRRGISKPLKYNDPFNNPLV 209
GF I TACLAIVARVY MGKS NS+SGSVADLVRRGQLRSDRRGISKPLKYNDPFNNPLV
Sbjct: 124 GFFIVTACLAIVARVYFMGKSRNSHSGSVADLVRRGQLRSDRRGISKPLKYNDPFNNPLV 183
Query: 210 KVDKRNSSVEMCGKVYRLAPVTLTKEEQSIHQKRRSRAYKWKRPTMFLKEGDSIPPDVDP 269
KVDK NSSVEMCGKVYRLAPVTLTKEEQSIHQKRRSRAY+WKRPTMFLKEGDSIPPDVDP
Sbjct: 184 KVDKSNSSVEMCGKVYRLAPVTLTKEEQSIHQKRRSRAYQWKRPTMFLKEGDSIPPDVDP 243
Query: 270 DTVRWIPANHPFATTASDIDEDLAQNNVYQKHGVPFRIQAEHEALQRKLEALQS 323
D+VRWIPANHPFATTASDIDEDLAQNNVYQKHGVPFRIQAEHEALQRKLEALQ+
Sbjct: 244 DSVRWIPANHPFATTASDIDEDLAQNNVYQKHGVPFRIQAEHEALQRKLEALQN 295
BLAST of Sgr017724 vs. TAIR 10
Match:
AT1G20830.1 (multiple chloroplast division site 1 )
HSP 1 Score: 310.8 bits (795), Expect = 1.2e-84
Identity = 164/260 (63.08%), Postives = 199/260 (76.54%), Query Frame = 0
Query: 72 LNCDWNR-NRKGKFLIRAAEGSATSDSG-QNV--EDNEIVVKSGTGAVVSKD---YIGKM 131
+N +W + KG F+ +A S+T D QN +DN +VV + T + + D I +
Sbjct: 38 VNLNWVQFETKGSFVCKAIGDSSTPDEDIQNTQSDDNVVVVTATTQSDIPHDSEYSISRF 97
Query: 132 QEMISSSPAGLFLMNKCAGNGLAIGFCIATACLAIVARVYLMGKS-GNSNSGSVADLVRR 191
+ M+++ P +FLM KC+ N + IG CI L R Y++ KS N +GSVADLVRR
Sbjct: 98 RSMVTTLPPVVFLMKKCSVNSIWIGVCITATVLVAAIRAYVVRKSRDNQRAGSVADLVRR 157
Query: 192 GQLRS-DRRGISKPLKYNDPFNNPLVKVDKRNSSVEMCGKVYRLAPVTLTKEEQSIHQKR 251
GQLRS DRRGISK L Y DPFNNP VK+DK +S+VEMCGKVYRLAPVTLT++EQ+IHQKR
Sbjct: 158 GQLRSGDRRGISKSLNYEDPFNNPFVKLDKGSSTVEMCGKVYRLAPVTLTEKEQTIHQKR 217
Query: 252 RSRAYKWKRPTMFLKEGDSIPPDVDPDTVRWIPANHPFATTASDIDEDLAQNNVYQKHGV 311
RSRAY+WKRPT+FLKEGDSIPPDVDPDTVRWIPANHPFATT SDID+DLAQNNVYQK GV
Sbjct: 218 RSRAYQWKRPTIFLKEGDSIPPDVDPDTVRWIPANHPFATTVSDIDQDLAQNNVYQKQGV 277
Query: 312 PFRIQAEHEALQRKLEALQS 323
PFRI+AEHEA+Q+KLEALQ+
Sbjct: 278 PFRIRAEHEAMQKKLEALQN 297
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_022135712.1 | 8.3e-139 | 87.99 | protein MULTIPLE CHLOROPLAST DIVISION SITE 1 [Momordica charantia] | [more] |
XP_038899779.1 | 2.2e-136 | 83.33 | protein MULTIPLE CHLOROPLAST DIVISION SITE 1 [Benincasa hispida] >XP_038899780.1... | [more] |
XP_022940742.1 | 1.2e-134 | 84.01 | protein MULTIPLE CHLOROPLAST DIVISION SITE 1 [Cucurbita moschata] | [more] |
KAG7037967.1 | 2.1e-134 | 83.96 | Protein MULTIPLE CHLOROPLAST DIVISION SITE 1 [Cucurbita argyrosperma subsp. argy... | [more] |
KAG6608650.1 | 6.1e-134 | 83.67 | Protein MULTIPLE CHLOROPLAST DIVISION SITE 1, partial [Cucurbita argyrosperma su... | [more] |
Match Name | E-value | Identity | Description | |
Q8GWA7 | 1.8e-83 | 63.08 | Protein MULTIPLE CHLOROPLAST DIVISION SITE 1 OS=Arabidopsis thaliana OX=3702 GN=... | [more] |
Match Name | E-value | Identity | Description | |
A0A6J1C3I1 | 4.0e-139 | 87.99 | protein MULTIPLE CHLOROPLAST DIVISION SITE 1 OS=Momordica charantia OX=3673 GN=L... | [more] |
A0A6J1FKG4 | 6.0e-135 | 84.01 | protein MULTIPLE CHLOROPLAST DIVISION SITE 1 OS=Cucurbita moschata OX=3662 GN=LO... | [more] |
A0A5D3D8N6 | 1.5e-133 | 81.69 | Protein MULTIPLE CHLOROPLAST DIVISION SITE 1 OS=Cucumis melo var. makuwa OX=1194... | [more] |
A0A1S3BVZ2 | 1.5e-133 | 81.69 | protein MULTIPLE CHLOROPLAST DIVISION SITE 1 OS=Cucumis melo OX=3656 GN=LOC10349... | [more] |
A0A6J1IW72 | 2.8e-132 | 82.99 | protein MULTIPLE CHLOROPLAST DIVISION SITE 1 OS=Cucurbita maxima OX=3661 GN=LOC1... | [more] |
Match Name | E-value | Identity | Description | |
AT1G20830.1 | 1.2e-84 | 63.08 | multiple chloroplast division site 1 | [more] |