Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: CDSsinglepolypeptidestart_codonstop_codon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCGCCAAAACTCAGGCGTTCACCGCCGCCGTTGCGGCGGCGTAACTACGCCACCGATTATGATGCTTCCCTATCTCAATCTCTCGACGCATCAAACGAAGACGACTACGACGCTTCTGAATCTAATAACTTCCAAACCAGCGGCCACAAATCAAAATCCCTAGAAATCAATGAAGAATCCGCCACGAACAGTCCAACGAATTTACAGAGTCCAAACGCCGCCGCAACAGTATTTCCATACATTAATATCGCACCGTTGCCTGTTTTCCACGGCGGCTCCGATGAATGTCCGGCTACGCATTTAAGCAGATTCGCCAAAGTTTGCCGTGCGAACAACGCGGCCTCCGTCGAGATTATGATGAGAATCTTTCCGGTAACGTTACAGGGTGAGGCTCTGCTTTGGTACGACTTGAACATTGAACCGTACCCTCCAATTTCTTGGGAAGAATTGAAGTCTTCGTTCTTGGACGCGTATAACAAAATCGAATTGGCTGAGCAGTTGCGATCGGAGCTTATGACGATCAGTCAACGGCCGGAGGAGAATGTTCGTTCTTATTTTCTGAGGCTGCAGTTGATCCTGAAGAAATGGCCGCCGGGAAACGAACTTTCCGATGGGTTTTTGAAAGCGATTTTCATGGATGGATTGAGGGAAGAGTTTAAGGAATGGATGATTCCACAGAAACCGGATTCTCTGAACGAGGCGTTGCGACTTGCATTTGGGCTCGAACAAGTTACGGTCATCCGTACTTCCGGCGGAAAGCGGTTTCTCCGGTGTGGGTTTTGTGAGGGGCGGCATGAGGAATTGGTTTGTGAGGTTAGGGAAAGAATGAGACGGTTGTGGAAGAGTAGGGAAAAGAAGAATGGCGGCGATATGGCGGAGAGCGAGGGGCATAATACGGCGGAGCTTGTGCGGTCGGTTTCGGCGATAAGCAGAAATGAAGCGGAGGTTGGGAAGGACGGCGGGGAAATGGTGGGTTTGAAGAAGAAAGGTCAGTGTCAGTGCTGGAAGCATCAGTGTGGGATGAAGAAATTGGATCGAAACCTTAGCATGCTATCCAAAACTTCTAAACCCTAA
mRNA sequence
ATGGCGCCAAAACTCAGGCGTTCACCGCCGCCGTTGCGGCGGCGTAACTACGCCACCGATTATGATGCTTCCCTATCTCAATCTCTCGACGCATCAAACGAAGACGACTACGACGCTTCTGAATCTAATAACTTCCAAACCAGCGGCCACAAATCAAAATCCCTAGAAATCAATGAAGAATCCGCCACGAACAGTCCAACGAATTTACAGAGTCCAAACGCCGCCGCAACAGTATTTCCATACATTAATATCGCACCGTTGCCTGTTTTCCACGGCGGCTCCGATGAATGTCCGGCTACGCATTTAAGCAGATTCGCCAAAGTTTGCCGTGCGAACAACGCGGCCTCCGTCGAGATTATGATGAGAATCTTTCCGGTAACGTTACAGGGTGAGGCTCTGCTTTGGTACGACTTGAACATTGAACCGTACCCTCCAATTTCTTGGGAAGAATTGAAGTCTTCGTTCTTGGACGCGTATAACAAAATCGAATTGGCTGAGCAGTTGCGATCGGAGCTTATGACGATCAGTCAACGGCCGGAGGAGAATGTTCGTTCTTATTTTCTGAGGCTGCAGTTGATCCTGAAGAAATGGCCGCCGGGAAACGAACTTTCCGATGGGTTTTTGAAAGCGATTTTCATGGATGGATTGAGGGAAGAGTTTAAGGAATGGATGATTCCACAGAAACCGGATTCTCTGAACGAGGCGTTGCGACTTGCATTTGGGCTCGAACAAGTTACGGTCATCCGTACTTCCGGCGGAAAGCGGTTTCTCCGGTGTGGGTTTTGTGAGGGGCGGCATGAGGAATTGGTTTGTGAGGTTAGGGAAAGAATGAGACGGTTGTGGAAGAGTAGGGAAAAGAAGAATGGCGGCGATATGGCGGAGAGCGAGGGGCATAATACGGCGGAGCTTGTGCGGTCGGTTTCGGCGATAAGCAGAAATGAAGCGGAGGTTGGGAAGGACGGCGGGGAAATGGTGGGTTTGAAGAAGAAAGGTCAGTGTCAGTGCTGGAAGCATCAGTGTGGGATGAAGAAATTGGATCGAAACCTTAGCATGCTATCCAAAACTTCTAAACCCTAA
Coding sequence (CDS)
ATGGCGCCAAAACTCAGGCGTTCACCGCCGCCGTTGCGGCGGCGTAACTACGCCACCGATTATGATGCTTCCCTATCTCAATCTCTCGACGCATCAAACGAAGACGACTACGACGCTTCTGAATCTAATAACTTCCAAACCAGCGGCCACAAATCAAAATCCCTAGAAATCAATGAAGAATCCGCCACGAACAGTCCAACGAATTTACAGAGTCCAAACGCCGCCGCAACAGTATTTCCATACATTAATATCGCACCGTTGCCTGTTTTCCACGGCGGCTCCGATGAATGTCCGGCTACGCATTTAAGCAGATTCGCCAAAGTTTGCCGTGCGAACAACGCGGCCTCCGTCGAGATTATGATGAGAATCTTTCCGGTAACGTTACAGGGTGAGGCTCTGCTTTGGTACGACTTGAACATTGAACCGTACCCTCCAATTTCTTGGGAAGAATTGAAGTCTTCGTTCTTGGACGCGTATAACAAAATCGAATTGGCTGAGCAGTTGCGATCGGAGCTTATGACGATCAGTCAACGGCCGGAGGAGAATGTTCGTTCTTATTTTCTGAGGCTGCAGTTGATCCTGAAGAAATGGCCGCCGGGAAACGAACTTTCCGATGGGTTTTTGAAAGCGATTTTCATGGATGGATTGAGGGAAGAGTTTAAGGAATGGATGATTCCACAGAAACCGGATTCTCTGAACGAGGCGTTGCGACTTGCATTTGGGCTCGAACAAGTTACGGTCATCCGTACTTCCGGCGGAAAGCGGTTTCTCCGGTGTGGGTTTTGTGAGGGGCGGCATGAGGAATTGGTTTGTGAGGTTAGGGAAAGAATGAGACGGTTGTGGAAGAGTAGGGAAAAGAAGAATGGCGGCGATATGGCGGAGAGCGAGGGGCATAATACGGCGGAGCTTGTGCGGTCGGTTTCGGCGATAAGCAGAAATGAAGCGGAGGTTGGGAAGGACGGCGGGGAAATGGTGGGTTTGAAGAAGAAAGGTCAGTGTCAGTGCTGGAAGCATCAGTGTGGGATGAAGAAATTGGATCGAAACCTTAGCATGCTATCCAAAACTTCTAAACCCTAA
Protein sequence
MAPKLRRSPPPLRRRNYATDYDASLSQSLDASNEDDYDASESNNFQTSGHKSKSLEINEESATNSPTNLQSPNAAATVFPYINIAPLPVFHGGSDECPATHLSRFAKVCRANNAASVEIMMRIFPVTLQGEALLWYDLNIEPYPPISWEELKSSFLDAYNKIELAEQLRSELMTISQRPEENVRSYFLRLQLILKKWPPGNELSDGFLKAIFMDGLREEFKEWMIPQKPDSLNEALRLAFGLEQVTVIRTSGGKRFLRCGFCEGRHEELVCEVRERMRRLWKSREKKNGGDMAESEGHNTAELVRSVSAISRNEAEVGKDGGEMVGLKKKGQCQCWKHQCGMKKLDRNLSMLSKTSKP
Homology
BLAST of Csor.00g132210 vs. NCBI nr
Match:
KAG6604769.1 (hypothetical protein SDJN03_02086, partial [Cucurbita argyrosperma subsp. sororia])
HSP 1 Score: 714 bits (1842), Expect = 2.13e-259
Identity = 358/358 (100.00%), Postives = 358/358 (100.00%), Query Frame = 0
Query: 1 MAPKLRRSPPPLRRRNYATDYDASLSQSLDASNEDDYDASESNNFQTSGHKSKSLEINEE 60
MAPKLRRSPPPLRRRNYATDYDASLSQSLDASNEDDYDASESNNFQTSGHKSKSLEINEE
Sbjct: 1 MAPKLRRSPPPLRRRNYATDYDASLSQSLDASNEDDYDASESNNFQTSGHKSKSLEINEE 60
Query: 61 SATNSPTNLQSPNAAATVFPYINIAPLPVFHGGSDECPATHLSRFAKVCRANNAASVEIM 120
SATNSPTNLQSPNAAATVFPYINIAPLPVFHGGSDECPATHLSRFAKVCRANNAASVEIM
Sbjct: 61 SATNSPTNLQSPNAAATVFPYINIAPLPVFHGGSDECPATHLSRFAKVCRANNAASVEIM 120
Query: 121 MRIFPVTLQGEALLWYDLNIEPYPPISWEELKSSFLDAYNKIELAEQLRSELMTISQRPE 180
MRIFPVTLQGEALLWYDLNIEPYPPISWEELKSSFLDAYNKIELAEQLRSELMTISQRPE
Sbjct: 121 MRIFPVTLQGEALLWYDLNIEPYPPISWEELKSSFLDAYNKIELAEQLRSELMTISQRPE 180
Query: 181 ENVRSYFLRLQLILKKWPPGNELSDGFLKAIFMDGLREEFKEWMIPQKPDSLNEALRLAF 240
ENVRSYFLRLQLILKKWPPGNELSDGFLKAIFMDGLREEFKEWMIPQKPDSLNEALRLAF
Sbjct: 181 ENVRSYFLRLQLILKKWPPGNELSDGFLKAIFMDGLREEFKEWMIPQKPDSLNEALRLAF 240
Query: 241 GLEQVTVIRTSGGKRFLRCGFCEGRHEELVCEVRERMRRLWKSREKKNGGDMAESEGHNT 300
GLEQVTVIRTSGGKRFLRCGFCEGRHEELVCEVRERMRRLWKSREKKNGGDMAESEGHNT
Sbjct: 241 GLEQVTVIRTSGGKRFLRCGFCEGRHEELVCEVRERMRRLWKSREKKNGGDMAESEGHNT 300
Query: 301 AELVRSVSAISRNEAEVGKDGGEMVGLKKKGQCQCWKHQCGMKKLDRNLSMLSKTSKP 358
AELVRSVSAISRNEAEVGKDGGEMVGLKKKGQCQCWKHQCGMKKLDRNLSMLSKTSKP
Sbjct: 301 AELVRSVSAISRNEAEVGKDGGEMVGLKKKGQCQCWKHQCGMKKLDRNLSMLSKTSKP 358
BLAST of Csor.00g132210 vs. NCBI nr
Match:
CAN62167.1 (hypothetical protein VITISV_007470 [Vitis vinifera])
HSP 1 Score: 325 bits (832), Expect = 1.06e-103
Identity = 172/306 (56.21%), Postives = 216/306 (70.59%), Query Frame = 0
Query: 53 KSLEINEESATNSPTNLQSPNAAATVFPYINIAPLPVFHGGSDECPATHLSRFAKVCRAN 112
K S+ NS +N +P ++ YINIAPLP+F G SDECP THLSRF KVCRAN
Sbjct: 211 KGKSFRPSSSLNSSSNSLNPFXQSS---YINIAPLPIFRGSSDECPVTHLSRFTKVCRAN 270
Query: 113 NAASVEIMMRIFPVTLQGEALLWYDLNIEPYPPISWEELKSSFLDAYNKIELAEQLRSEL 172
N +SVE++MRIFPVTL GEA LWYDLNIEPY +SWEE+KSSFL AY+++ L ++LRSEL
Sbjct: 271 NVSSVEMIMRIFPVTLDGEAALWYDLNIEPYSSLSWEEIKSSFLQAYHRJGLTDELRSEL 330
Query: 173 MTISQRPEENVRSYFLRLQLILKKWPPGNELSDGFLKAIFMDGLREEFKEWMIPQKPDSL 232
M I+Q EE+VRSYFLRLQ ILK+WP + L DG L+ IF+DGLR++F++W+IPQKP SL
Sbjct: 331 MMINQGTEESVRSYFLRLQWILKRWPD-HGLPDGLLEGIFIDGLRKDFQDWIIPQKPSSL 390
Query: 233 NEALRLAFGLEQVTVIRTSGGKRFLRCGFCEGRHEELVCEVRERMRRLWKSREKK----N 292
NEALRLAF E+V IR G R CGFC G H+E CE+RERMR LW +K+ +
Sbjct: 391 NEALRLAFAWEKVQSIR---GGREKECGFCSGGHDEEGCEIRERMRXLWVKSKKQTRDYS 450
Query: 293 GGDMAESEGHNTAELVRSVSAISRNEAEVGKDGGE-MVGLKKKGQCQCWKHQCGMKKLDR 352
G + + +G E SV SR+ + ++G E +G KKK QCQC KHQC KKL+R
Sbjct: 451 GRIVNDEDGEKEFERRVSVGGESRBVGKNEEEGEEGXMGWKKKSQCQCGKHQCWKKKLER 509
BLAST of Csor.00g132210 vs. NCBI nr
Match:
EXB78111.1 (hypothetical protein L484_004813 [Morus notabilis])
HSP 1 Score: 312 bits (800), Expect = 2.47e-100
Identity = 172/303 (56.77%), Postives = 205/303 (67.66%), Query Frame = 0
Query: 70 QSPNAAATVFPYINIAPLPVFHGGSDECPATHLSRFAKVCRANNAASVEIMMRIFPVTLQ 129
Q P + Y+NIA P+F GGS+ECP HLSRFAKVCRANN +S+++MM+IFPVTL+
Sbjct: 103 QQPVSQTGYNSYMNIAQFPIFRGGSEECPFAHLSRFAKVCRANNVSSIDMMMKIFPVTLE 162
Query: 130 GEALLWYDLNIEPYPPISWEELKSSFLDAYNKIELAEQLRSELMTISQRPEENVRSYFLR 189
EA LWYDLN+EPY +SWEE+KSSF AY KIEL EQLRS+LMTI+Q E+VRSYFLR
Sbjct: 163 DEAALWYDLNVEPYEELSWEEIKSSFYHAYGKIELTEQLRSQLMTINQGDAESVRSYFLR 222
Query: 190 LQLILKKWPPGNELSDGFLKAIFMDGLREEFKEWMIPQKPDSLNEALRLAFGLEQVTVIR 249
LQ ILKKWP + LSD LK +F+DGLR +F+EWM PQKP SLN+ALRLAF EQV IR
Sbjct: 223 LQWILKKWPE-HGLSDDLLKGVFVDGLRGDFQEWMAPQKPGSLNKALRLAFCFEQVKSIR 282
Query: 250 TSGGKRFLRCGFCEGRHEELVCEVRERMRRLWKSREKKNG--------GDMAESEGHNTA 309
++CGFC G HEE CEVRERMR LW K +G + +SEG
Sbjct: 283 NVRRNASVKCGFCGGLHEERGCEVRERMRELWLKSNKDDGLGKGMLERNLIEKSEG--VK 342
Query: 310 ELVRSVS-AISRNEAEVGK------DG----GEMVGLKKKGQCQCWKHQCGMKKLDRNLS 353
EL RSVS A SR+ VGK DG E+ KK+ QCQC KHQC K ++RN S
Sbjct: 343 ELGRSVSMATSRSTCVVGKNDQVEEDGKEEEDELGSKKKRSQCQCGKHQCWKKNIERNNS 402
BLAST of Csor.00g132210 vs. NCBI nr
Match:
CAB4273215.1 (unnamed protein product [Prunus armeniaca])
HSP 1 Score: 309 bits (791), Expect = 1.05e-99
Identity = 162/291 (55.67%), Postives = 208/291 (71.48%), Query Frame = 0
Query: 68 NLQSPNAAATVFPYINIAPLPVFHGGSDECPATHLSRFAKVCRAN-NAASVEIMMRIFPV 127
N P T + YI IAPLP+F GGS+ECP THL+RFAK+CRAN + +V++M+RIFPV
Sbjct: 71 NFSEPTTNQTPY-YIPIAPLPIFRGGSNECPVTHLTRFAKICRANFSCPTVDVMVRIFPV 130
Query: 128 TLQGEALLWYDLNIEPYPPISWEELKSSFLDAYNKIELAEQLRSELMTISQRPEENVRSY 187
TL+ EA LWYDLNI+PYP +SWEE++S F AY++I+ QLRSEL I Q +E VRSY
Sbjct: 131 TLENEAALWYDLNIDPYPSLSWEEIRSLFFQAYDQID---QLRSELTMIKQGRDETVRSY 190
Query: 188 FLRLQLILKKWPPGNELSDGFLKAIFMDGLREEFKEWMIPQKPDSLNEALRLAFGLEQVT 247
FLRLQ ILK+WP + L D LK +F+DGLR+EFK+W++ +KP SLN+ALRLAFG E+V
Sbjct: 191 FLRLQWILKRWPD-HGLQDNVLKGVFIDGLRKEFKDWIVAEKPSSLNDALRLAFGFEKVK 250
Query: 248 VIR--TSGGKRFLRCGFCEGRHEELVCEVRERMRRLW-KSREKKNGGDMAESEGHNTAEL 307
+R T+ ++ + CGFC G HEE CEVRERMR+LW KS+E EG L
Sbjct: 251 SVRATTAAKEKAVECGFCGGGHEEKGCEVRERMRKLWVKSKE----------EG-----L 310
Query: 308 VRSVSAISRNEAE--VGKDGGEMVGLKKKGQCQCWKHQCGMKKLDRNLSML 352
VR VS + + E E ++ GE+V LKKKGQCQCWKHQC KKL+R+ S++
Sbjct: 311 VRMVSVVGKREEEGVEREEEGELVDLKKKGQCQCWKHQCWKKKLERSKSLV 341
BLAST of Csor.00g132210 vs. NCBI nr
Match:
KAF3973300.1 (hypothetical protein CMV_003263 [Castanea mollissima])
HSP 1 Score: 311 bits (797), Expect = 2.39e-99
Identity = 176/333 (52.85%), Postives = 225/333 (67.57%), Query Frame = 0
Query: 57 INEESATNSP----------TNLQSPNAAATVFP-------------------YINIAPL 116
I ES TN+P + QS N + T FP YINIAP
Sbjct: 68 IGSESETNAPGDRFSSQLRDPDSQSVNLSTTAFPNSTSNFPKISQPPSTHLASYINIAPF 127
Query: 117 PVFHGGSDECPATHLSRFAKVCRANNAASVEIMMRIFPVTLQGEALLWYDLNIEPYPPIS 176
P+FHG +ECP H+SRFAKVC ANN ++ ++MM IFPVTL+ EA LWYDLNI+PYP ++
Sbjct: 128 PIFHGNPNECPVKHVSRFAKVCVANNVSTTDMMMSIFPVTLEDEAALWYDLNIDPYPSLT 187
Query: 177 WEELKSSFLDAYNKIELAEQLRSELMTISQRPEENVRSYFLRLQLILKKWPPGNELSDGF 236
WEE+KSSFL AY+KI++ +QLRSELM I+Q EE+VRSYFLRLQ ILK+WP + + DG
Sbjct: 188 WEEIKSSFLHAYHKIQVVDQLRSELMMINQGDEESVRSYFLRLQWILKQWPD-HGIPDGL 247
Query: 237 LKAIFMDGLREEFKEWMIPQKPDSLNEALRLAFGLEQVTVIRTSGGKRFLRCGFCEGRHE 296
LK +F+DGLREEF++W+ PQKPDSL+EALRLAF EQV IR ++ L+CGFC+G HE
Sbjct: 248 LKGVFIDGLREEFRDWIFPQKPDSLHEALRLAFAFEQVKSIRAV--RKELKCGFCDGMHE 307
Query: 297 ELVCEVRERMRRLWK-SREKKNGGDMAESEGHNT---AELVRSVS---AISRNEAEVGKD 353
E CEVRERMR+LW+ S+EK+ +A+S + ELVRSVS + S + G++
Sbjct: 308 ERDCEVRERMRKLWRESKEKEEPVVLAKSTRSDDDLGKELVRSVSIGASSSVGKNNEGEE 367
BLAST of Csor.00g132210 vs. ExPASy TrEMBL
Match:
A0A7N2R9A7 (Retrotrans_gag domain-containing protein OS=Quercus lobata OX=97700 PE=4 SV=1)
HSP 1 Score: 322 bits (826), Expect = 1.22e-104
Identity = 191/375 (50.93%), Postives = 246/375 (65.60%), Query Frame = 0
Query: 15 RNYATDYDASLSQSLDASNEDDYDASESNNFQTSGHKSKSLEINEESATNSP-------- 74
R YA D SL SNE+ Y+ + + + I+ ES TN+P
Sbjct: 30 REYAYKDDNYSDASLSESNENGYEYE-----RPAKDDNDDAYISSESETNAPGDRFSSQL 89
Query: 75 --TNLQSPNAAATVFP-------------------YINIAPLPVFHGGSDECPATHLSRF 134
+ QS N + T FP Y+NIAP+P+FHG ++ECP H+SRF
Sbjct: 90 RDPDSQSINLSTTAFPNSTSNFPKISQPPSTHLASYMNIAPIPIFHGNTNECPVKHVSRF 149
Query: 135 AKVCRANNAASVEIMMRIFPVTLQGEALLWYDLNIEPYPPISWEELKSSFLDAYNKIELA 194
AKVC ANN ++ ++MMRIFPVTL+ EA LWYDLNIEPYP ++WEE+KSSFL AY+KIE+
Sbjct: 150 AKVCVANNVSTTDMMMRIFPVTLEDEAALWYDLNIEPYPSLTWEEIKSSFLHAYHKIEVV 209
Query: 195 EQLRSELMTISQRPEENVRSYFLRLQLILKKWPPGNELSDGFLKAIFMDGLREEFKEWMI 254
+QLRSELM I+Q EE+VRSYFLRLQ ILK+WP + +SDG LK +F+DGLREEF+ W+I
Sbjct: 210 DQLRSELMMINQGDEESVRSYFLRLQWILKQWPD-HGISDGLLKGVFIDGLREEFRGWII 269
Query: 255 PQKPDSLNEALRLAFGLEQVTVIRTSGGKRFLRCGFCEGRHEELVCEVRERMRRLWK-SR 314
PQKPDSL+EALRLAFG EQV IR ++ L+CGFC+G HEE CEVRERMR+LW+ S+
Sbjct: 270 PQKPDSLHEALRLAFGFEQVKSIRAV--RKELKCGFCDGMHEERDCEVRERMRKLWRESK 329
Query: 315 EKKNGGDMAESEGHNTA---ELVRSVS---AISRNEAEVGKDGGEMVGLKKKGQCQCWKH 353
EK+ +A+S G + ELVRSVS + S + G++ G M G KK Q Q K+
Sbjct: 330 EKEEAVVLAKSTGGDDELGKELVRSVSIGASSSVGKNNEGEEAGFMDG--KKNQFQYGKY 389
BLAST of Csor.00g132210 vs. ExPASy TrEMBL
Match:
A5C7E6 (Retrotrans_gag domain-containing protein OS=Vitis vinifera OX=29760 GN=VITISV_007470 PE=4 SV=1)
HSP 1 Score: 323 bits (828), Expect = 2.06e-103
Identity = 172/306 (56.21%), Postives = 215/306 (70.26%), Query Frame = 0
Query: 53 KSLEINEESATNSPTNLQSPNAAATVFPYINIAPLPVFHGGSDECPATHLSRFAKVCRAN 112
K S+ NS +N +P ++ YINIAPLP+F G SDECP THLSRF KVCRAN
Sbjct: 211 KGKSFRPSSSLNSSSNSLNPFXQSS---YINIAPLPIFRGSSDECPVTHLSRFTKVCRAN 270
Query: 113 NAASVEIMMRIFPVTLQGEALLWYDLNIEPYPPISWEELKSSFLDAYNKIELAEQLRSEL 172
N +SVE++MRIFPVTL GEA LWYDLNIEPY +SWEE+KSSFL AY++ L ++LRSEL
Sbjct: 271 NVSSVEMIMRIFPVTLDGEAALWYDLNIEPYSSLSWEEIKSSFLQAYHRXGLTDELRSEL 330
Query: 173 MTISQRPEENVRSYFLRLQLILKKWPPGNELSDGFLKAIFMDGLREEFKEWMIPQKPDSL 232
M I+Q EE+VRSYFLRLQ ILK+WP + L DG L+ IF+DGLR++F++W+IPQKP SL
Sbjct: 331 MMINQGTEESVRSYFLRLQWILKRWPD-HGLPDGLLEGIFIDGLRKDFQDWIIPQKPSSL 390
Query: 233 NEALRLAFGLEQVTVIRTSGGKRFLRCGFCEGRHEELVCEVRERMRRLWKSREKK----N 292
NEALRLAF E+V IR G R CGFC G H+E CE+RERMR LW +K+ +
Sbjct: 391 NEALRLAFAWEKVQSIR---GGREKECGFCSGGHDEEGCEIRERMRXLWVKSKKQTRDYS 450
Query: 293 GGDMAESEGHNTAELVRSVSAISRNEAEVGKDGGE-MVGLKKKGQCQCWKHQCGMKKLDR 352
G + + +G E SV SR+ + ++G E +G KKK QCQC KHQC KKL+R
Sbjct: 451 GRIVNDEDGEKEFERRVSVGGESRBVGKNEEEGEEGXMGWKKKSQCQCGKHQCWKKKLER 509
BLAST of Csor.00g132210 vs. ExPASy TrEMBL
Match:
W9R9S0 (Retrotrans_gag domain-containing protein OS=Morus notabilis OX=981085 GN=L484_004813 PE=4 SV=1)
HSP 1 Score: 312 bits (800), Expect = 1.19e-100
Identity = 172/303 (56.77%), Postives = 205/303 (67.66%), Query Frame = 0
Query: 70 QSPNAAATVFPYINIAPLPVFHGGSDECPATHLSRFAKVCRANNAASVEIMMRIFPVTLQ 129
Q P + Y+NIA P+F GGS+ECP HLSRFAKVCRANN +S+++MM+IFPVTL+
Sbjct: 103 QQPVSQTGYNSYMNIAQFPIFRGGSEECPFAHLSRFAKVCRANNVSSIDMMMKIFPVTLE 162
Query: 130 GEALLWYDLNIEPYPPISWEELKSSFLDAYNKIELAEQLRSELMTISQRPEENVRSYFLR 189
EA LWYDLN+EPY +SWEE+KSSF AY KIEL EQLRS+LMTI+Q E+VRSYFLR
Sbjct: 163 DEAALWYDLNVEPYEELSWEEIKSSFYHAYGKIELTEQLRSQLMTINQGDAESVRSYFLR 222
Query: 190 LQLILKKWPPGNELSDGFLKAIFMDGLREEFKEWMIPQKPDSLNEALRLAFGLEQVTVIR 249
LQ ILKKWP + LSD LK +F+DGLR +F+EWM PQKP SLN+ALRLAF EQV IR
Sbjct: 223 LQWILKKWPE-HGLSDDLLKGVFVDGLRGDFQEWMAPQKPGSLNKALRLAFCFEQVKSIR 282
Query: 250 TSGGKRFLRCGFCEGRHEELVCEVRERMRRLWKSREKKNG--------GDMAESEGHNTA 309
++CGFC G HEE CEVRERMR LW K +G + +SEG
Sbjct: 283 NVRRNASVKCGFCGGLHEERGCEVRERMRELWLKSNKDDGLGKGMLERNLIEKSEG--VK 342
Query: 310 ELVRSVS-AISRNEAEVGK------DG----GEMVGLKKKGQCQCWKHQCGMKKLDRNLS 353
EL RSVS A SR+ VGK DG E+ KK+ QCQC KHQC K ++RN S
Sbjct: 343 ELGRSVSMATSRSTCVVGKNDQVEEDGKEEEDELGSKKKRSQCQCGKHQCWKKNIERNNS 402
BLAST of Csor.00g132210 vs. ExPASy TrEMBL
Match:
A0A6J5UDI4 (Retrotrans_gag domain-containing protein OS=Prunus armeniaca OX=36596 GN=CURHAP_LOCUS20487 PE=4 SV=1)
HSP 1 Score: 309 bits (791), Expect = 5.06e-100
Identity = 162/291 (55.67%), Postives = 208/291 (71.48%), Query Frame = 0
Query: 68 NLQSPNAAATVFPYINIAPLPVFHGGSDECPATHLSRFAKVCRAN-NAASVEIMMRIFPV 127
N P T + YI IAPLP+F GGS+ECP THL+RFAK+CRAN + +V++M+RIFPV
Sbjct: 71 NFSEPTTNQTPY-YIPIAPLPIFRGGSNECPVTHLTRFAKICRANFSCPTVDVMVRIFPV 130
Query: 128 TLQGEALLWYDLNIEPYPPISWEELKSSFLDAYNKIELAEQLRSELMTISQRPEENVRSY 187
TL+ EA LWYDLNI+PYP +SWEE++S F AY++I+ QLRSEL I Q +E VRSY
Sbjct: 131 TLENEAALWYDLNIDPYPSLSWEEIRSLFFQAYDQID---QLRSELTMIKQGRDETVRSY 190
Query: 188 FLRLQLILKKWPPGNELSDGFLKAIFMDGLREEFKEWMIPQKPDSLNEALRLAFGLEQVT 247
FLRLQ ILK+WP + L D LK +F+DGLR+EFK+W++ +KP SLN+ALRLAFG E+V
Sbjct: 191 FLRLQWILKRWPD-HGLQDNVLKGVFIDGLRKEFKDWIVAEKPSSLNDALRLAFGFEKVK 250
Query: 248 VIR--TSGGKRFLRCGFCEGRHEELVCEVRERMRRLW-KSREKKNGGDMAESEGHNTAEL 307
+R T+ ++ + CGFC G HEE CEVRERMR+LW KS+E EG L
Sbjct: 251 SVRATTAAKEKAVECGFCGGGHEEKGCEVRERMRKLWVKSKE----------EG-----L 310
Query: 308 VRSVSAISRNEAE--VGKDGGEMVGLKKKGQCQCWKHQCGMKKLDRNLSML 352
VR VS + + E E ++ GE+V LKKKGQCQCWKHQC KKL+R+ S++
Sbjct: 311 VRMVSVVGKREEEGVEREEEGELVDLKKKGQCQCWKHQCWKKKLERSKSLV 341
BLAST of Csor.00g132210 vs. ExPASy TrEMBL
Match:
A0A061DJI4 (Retrotrans_gag domain-containing protein OS=Theobroma cacao OX=3641 GN=TCM_001704 PE=4 SV=1)
HSP 1 Score: 306 bits (785), Expect = 2.05e-98
Identity = 181/360 (50.28%), Postives = 229/360 (63.61%), Query Frame = 0
Query: 24 SLSQSLDASNEDDYDASESNN----FQTSGHKSKSLEINEESATNSPTNLQSPN---AAA 83
SLS S D SN DD + + N F S +S+S+ +A N+P L N AAA
Sbjct: 47 SLSHSPDESNGDDLEQPRNENDYDDFDASDFQSESM----TNAPNAPKTLLRGNGLSAAA 106
Query: 84 TV-----------------FPYINIAPLPVFHGGSDECPATHLSRFAKVCRANNAASVEI 143
++ YINIAPLP+F G +CP THLSRFAKVCRANN +SV++
Sbjct: 107 SLNSVSNSAIWSRSNLIEATSYINIAPLPIFQGSPSDCPVTHLSRFAKVCRANNVSSVDM 166
Query: 144 MMRIFPVTLQGEALLWYDLNIEPYPPISWEELKSSFLDAYNKIELAEQLRSELMTISQRP 203
MMRIFPVTL+ EA LWYDLNIEPYP + WEE+KSSFL AY+K ++ EQLR ELM I+Q
Sbjct: 167 MMRIFPVTLENEAGLWYDLNIEPYPSLRWEEIKSSFLQAYHKTQVTEQLRHELMMINQGS 226
Query: 204 EENVRSYFLRLQLILKKWPPGNELSDGFLKAIFMDGLREEFKEWMIPQKPDSLNEALRLA 263
EE VRSYFLRLQ L++WP + + + LK IF+DGLRE+F++W++PQKPDSL EALRLA
Sbjct: 227 EERVRSYFLRLQWSLQRWPD-HGIPENLLKEIFVDGLREDFQDWIVPQKPDSLVEALRLA 286
Query: 264 FGLEQVTVIRTSGGKRFLRCGFCEGRHEELVCEVRERMRRLWKSREKKNGGDMAESEGHN 323
EQ+ I+ S K+ L+C FCEG HEE C+VRERM+ LW+ + K D +E N
Sbjct: 287 IAFEQLKSIKISR-KKDLKCDFCEGSHEERNCQVRERMKELWRKTKDKEWMDSSEKNQSN 346
Query: 324 TAELVRSV-SAISRNEAEVGKDGGEMVGLK--KKGQCQCWKHQCGMKKLDRNLSMLSKTS 356
A + SA R E E +G + G K KK CQC KHQC K+LDR S++S+ S
Sbjct: 347 EAVNESAEGSAEDRIEEENVVEGEMLSGRKQKKKSPCQCCKHQCWKKQLDRTNSLVSRNS 400
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
KAG6604769.1 | 2.13e-259 | 100.00 | hypothetical protein SDJN03_02086, partial [Cucurbita argyrosperma subsp. sorori... | [more] |
CAN62167.1 | 1.06e-103 | 56.21 | hypothetical protein VITISV_007470 [Vitis vinifera] | [more] |
EXB78111.1 | 2.47e-100 | 56.77 | hypothetical protein L484_004813 [Morus notabilis] | [more] |
CAB4273215.1 | 1.05e-99 | 55.67 | unnamed protein product [Prunus armeniaca] | [more] |
KAF3973300.1 | 2.39e-99 | 52.85 | hypothetical protein CMV_003263 [Castanea mollissima] | [more] |
Match Name | E-value | Identity | Description | |
A0A7N2R9A7 | 1.22e-104 | 50.93 | Retrotrans_gag domain-containing protein OS=Quercus lobata OX=97700 PE=4 SV=1 | [more] |
A5C7E6 | 2.06e-103 | 56.21 | Retrotrans_gag domain-containing protein OS=Vitis vinifera OX=29760 GN=VITISV_00... | [more] |
W9R9S0 | 1.19e-100 | 56.77 | Retrotrans_gag domain-containing protein OS=Morus notabilis OX=981085 GN=L484_00... | [more] |
A0A6J5UDI4 | 5.06e-100 | 55.67 | Retrotrans_gag domain-containing protein OS=Prunus armeniaca OX=36596 GN=CURHAP_... | [more] |
A0A061DJI4 | 2.05e-98 | 50.28 | Retrotrans_gag domain-containing protein OS=Theobroma cacao OX=3641 GN=TCM_00170... | [more] |
Match Name | E-value | Identity | Description | |