Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGACGTGTTTCTACGGACATCCTGAGGAAGAAAAGAGACAGCTCTCATGGGACCTTATGCGATGGCTGAAAGGGAGTGATTCAATCTCATGGCTGATAGGAAGAGACTTTAATGGCATATTAAGTCATGATGAGAAATGGGGGGGAAACCAGAAGAATGATAAGTTGATCAATGACTTCAGTCAGGCGATGGATGATTGCAATTTAATAGATTTAGGCTTCTCTGGTGAGCCGTTTACGTGGTGTAATAGAAGGCCAAATGGGAAGGCTGTATATGAATGATTGATCGGGTGTGTTTTAACCCAGCGTGGGCAAGTTTATTTCCAAATAGTATTAAACACCACTTGGAGTATAGGACATCCAATCATATACCACTGGCGCTAAACATAGGGGAAGAATTGGTATGGAAGGCAAAGAGGTCCGCAAGAATTTACAGATTTGAGGAAACCTGGCTTGAAGATATAAACTGCAAACATATAGTTAAGAATAGTTGGAGTAGTGGAAATTTTGATGGCAGCCCTCAGTCTCTGATCTCAAAGCTGAATTATTGTGCAACTCACCTATCCCATTGGGGATGAAATAAAGCTGGAAATTATCGTAGAAGACTGAAAGAAGCGGAAACCCAACTGCAAAAAGCAATTAAAAACCTCCCAAGCGTGGGAGACCGAACAGAAATTCTTATGGCAGAGATGAATATGGCAAAGCTTTTAAACGAGGAAGAAATATATTGGCGACAACGCTCAAGAGAGCAATGGCTTAAATGGGGGTATCAGAATACTAAGTGGTTCCATAATAAAGCATCTCATAGAAAGCGGAAGAACGAAATCAAGGGATTGTTTTACTCGTAAGGAAGATGGGAAGCGGACCAAAATAAAATCGCCGATATGATATCTAACTACTTCTCTGATTTATTTTCTTCTTCAGGTCCTAGGAGGCTGGACATTTTTCATGTGTCCCGTTGTATTGTGCCCCGAGTCACTTCGGCCATAAACTGAGAGCTTCTAAAGAGTTTCAGTGAAGATGAAATTCAAGAGGCTATGGGTCAAATCCATCCAAATAAATGGCCAGGTCCGGATGGTTTCTCTGGAGCTTTCTACAAAAATTTTTTAGCACATTGTGGGCAAGGATGTTATAGCCTGTTGTTTGAATGTTTTGAATAATGATTTGGATATGAGACCTCTAAATGAAACAATGCTAGTACTGATCCTAAAATGCAATAACCCTCAGCGAGTGACTGATTTTAGGCCAATCTCTCTGTGTAATGTTTGTTATAAGATCATCTCAAAGGTGTTGGTTAATAGGCTGAAGAAGGTGCTTAATGCGATCATTTCACCCAACCAAAGTGCATTTATTCCTAAAAGACATATCACTGACAATGCAATATTGGGATATGAATGCATTCATTCCCTTAGAAGCAAGAAAGGAGGAAAATCCGGTTGGGCGGCTTTGAAACTTGACATGAGTAAGGCCTATGACCGTGTTGAATGGGCGTTCCTGGAAGAGATAATGTTGAGGCTAGGGTTTGATAAGGCCTCAGTTGATTTAATTATGAGGTGTGTTCGATCAGTTACGTTTTCCTTCAAGCTCAATGGAGAGAAAGTTGGCCATGTCACTCCCCAAAGGGGACTCTGTCAGGGGGACTCATTATCGCCATACTTGTTTCTCATGTGCGCAGAAGGGTTGTCTAGCCTTCTCCATCAGTTCTCAATTGAGAAAAAAGTCTTTGGACTATCTATTGCTCGGCGAAGCCCTCCTATCTCTCACATCTTCTTCGCGGTGACAGCCTCTTATTTTTTAAGGCCAAAATTAGTGAAGGAGCTTGCATTGGTGAATGTCTTAAACGGTATGAAGTTGCTTCGGGGCAGGTTATAAATTTCGATAAATCCGTTCTTGCATTTAGTCCAAACACGAATGAAGCTTTTAAGGAACAACTAAGGGATCTTCTTTCTGTTAGAGTTGTGGCCTGCCATCACCAATATTTGGGTCTCCCTTCCTTTCTATCCCGGAACAAAACTATGCATTTCAATTACATTAAGGACCGTGTGTGGAAGAACCTTCAAGGCTGGAAAAGTAGACTGTTTTCAATGGGAGGTAAGGAGGTGTTAATTAAAGCAGTTATACAGGCTATTCCGTGCTACACGATGTCTTGTTTTAGACTGCCGAAGAAACTTATTCAGGAGCTTAATCAACTGGTGGCTCGTTTTTGGTGGGGTGTAGATGGAGAGGACAGGAAGATCCATTGGGTTTGCTGGAAATTAATGTGTAAACCGAAATGCTTGGGAGGAATGGGCTTTAGGGATCTTGAAATTTTCAACAAGGCTCTGCTAGCAAAACAAGGTTGGAGGATTTTAAATGACCTGAACTCAATGCTTGCACAAGTCCTAAAGGGACGTTATTTCAAATAGGGTACCTTTTTATCTACAAAACTCGGTTGGAATCCTTCGTTCATTTGGAGAAATATTCTGTGGGGTCGGGATTTACTTAAAGAGGGAACCCGATGGCGCATTGGGAATGGTGAAAGGGTGAAGATTTATGGTGATAATTGGGTTCCTAATCAACCAACTCTGAAAATTCTATCTCGGCCCCAAATTCCCATTGACACAAACGTTAGTCATCTGATTGATAATGAGCTAGGCCAGTGGAAGGCAGATATAGTTCAAGATATTTTCTCGCCAGATGAAGCTAAGGGTATTATATTGATTCCTATTGGGATGAGCCATGGATGGGATCGTTTGATTTGGCACCATGAGAAATTGGGTATATACACAGTTAAAAGTGGATATAGAGCTGCTCAGATGGCATTATCAAATAACTTGGCCTCATCCTCTTCTACGGATGGCTTGGCGAATTGGTGGGGAGGAGTGTGGAATTTATCTTTCCCTAGTAAAATAAAAGTCTTCTTTTGGCGTTTGTGTTTAGACAGGCTCCCAATGAGAGCAAATTTGACTCAAAGGGAGGTGGATGTCCAAAATATTTGTGTTTTCTGTGGGAAGAAAGGAGAAGATTCTTACCATTTATTTTGGCTTCGTAAATATACCAAACATAAATGGATGAATTCTAAATTTCACCATCTCTCTGTTCTTCATCCTCAGGCACCTATGATTAATATCCTAAGGGACTGGCGAGACATGCTAAATTGGGAAGATTTTGAGGAATTGGTCGTGTTTTTGTGGGGTTTATGGAATCGTAGAAATGCATTTGTGTTTAATAAGAGAAGAGCTGAGGATGATGATTTGGCAGGATGGGTCAGTACTTACATTGCTACATTCAGGGCTACTAATACCAATCATGCAACAGCCAATCAAGACTAGCTTTCAACAGCTTTCACAAATACATCAAGCTCAAAATCACACTACTTGGTGCCCACCAGAAGAGGGGATCTTTAAATTAAACACAGATGCATCTTTCTCCTCTATCGATTTTAATGCAGGTCTGGGAGTCATCATCAGAGACCATAGAGGGCAAGTTCTAGCTTCGGCTACGAAATACCTAGAGCATGTGGCGTCCGGGGATGATGCTGAAGCGCTTGCTGCAGTGGAAGGCTTTCGTGTGGCAATGGAGACTGGAATTTCTCCGATCCTTTTGGTAACTGACTCTTTGCGTATCTACAACCTTTTGCTCGAGATAAAGAAGTCCTATCAGAGACGAGATCAATTATTGAATATGCGAAAACTCATCTTGCTACTAGATTGTAGGTATCCTACAGCTTCACAAAAAGAGGTGGAAATACGATCGCTCACCTTTTGGCGAGAAGAGCCCTCCGGTCTCAGGAAAATTTCGTCTGGCTTGAGGAGGGGCCGGAGGAGATCTCAAACACTCTAG
mRNA sequence
ATGACGTGTTTCTACGGACATCCTGAGGAAGAAAAGAGACAGCTCTCATGGGACCTTATGCGATGGCTGAAAGGGAGTGATTCAATCTCATGGCTGATAGGAAGAGACTTTAATGGCATATTAAGTCATGATGAGAAATGGGGGGGAAACCAGAAGAATGATAAGTTGATCAATGACTTCAGTCAGGCGATGGATGATTGCAATTTAATAGATTTAGGCTTCTCTGAGGGAACCCGATGGCGCATTGGGAATGGTGAAAGGGTGAAGATTTATGGTGATAATTGGGTTCCTAATCAACCAACTCTGAAAATTCTATCTCGGCCCCAAATTCCCATTGACACAAACGTTAGTCATCTGATTGATAATGAGCTAGGCCAGTGGAAGGCAGATATAGTTCAAGATATTTTCTCGCCAGATGAAGCTAAGGGTATTATATTGATTCCTATTGGGATGAGCCATGGATGGGATCGTTTGATTTGGCACCATGAGAAATTGGGTATATACACAGTTAAAAGTGGATATAGAGCTGCTCAGATGGCATTATCAAATAACTTGGCCTCATCCTCTTCTACGGATGGCTTGGCGAATTGGTGGGGAGGAGTGTGGAATTTATCTTTCCCTAGTAAAATAAAAGTCTTCTTTTGGCGTTTGTGTTTAGACAGGCTCCCAATGAGAGCAAATTTGACTCAAAGGGAGGTGGATGTCCAAAATATTTGTGTTTTCTGTGGGAAGAAAGGAGAAGATTCTTACCATTTATTTTGGCTTCGTAAATATACCAAACATAAATGGATGAATTCTAAATTTCACCATCTCTCTGTTCTTCATCCTCAGGCACCTATGATTAATATCCTAAGGGACTGGCGAGACATGCTAAATTGGGAAGATTTTGAGGAATTGGTCGTGTTTTTGTGGGGTTTATGGAATCGTAGAAATGCATTTGTGTTTAATAAGAGAAGAGCTGAGGATGATGATTTGGCAGGATGGGTCAGTCTGGGAGTCATCATCAGAGACCATAGAGGGCAAGTTCTAGCTTCGGCTACGAAATACCTAGAGCATGTGGCGTCCGGGGATGATGCTGAAGCGCTTGCTGCAGTGGAAGGCTTTCGTGTGGCAATGGAGACTGGAATTTCTCCGATCCTTTTGGTAACTGACTCTTTGCGTATCTACAACCTTTTGCTCGAGATAAAGAAGTCCTATCAGAGACGAGATCAATTATTGAATATGCGAAAACTCATCTTGCTACTAGATTGTAGGTATCCTACAGCTTCACAAAAAGAGGTGGAAATACGATCGCTCACCTTTTGGCGAGAAGAGCCCTCCGGTCTCAGGAAAATTTCGTCTGGCTTGAGGAGGGGCCGGAGGAGATCTCAAACACTCTAG
Coding sequence (CDS)
ATGACGTGTTTCTACGGACATCCTGAGGAAGAAAAGAGACAGCTCTCATGGGACCTTATGCGATGGCTGAAAGGGAGTGATTCAATCTCATGGCTGATAGGAAGAGACTTTAATGGCATATTAAGTCATGATGAGAAATGGGGGGGAAACCAGAAGAATGATAAGTTGATCAATGACTTCAGTCAGGCGATGGATGATTGCAATTTAATAGATTTAGGCTTCTCTGAGGGAACCCGATGGCGCATTGGGAATGGTGAAAGGGTGAAGATTTATGGTGATAATTGGGTTCCTAATCAACCAACTCTGAAAATTCTATCTCGGCCCCAAATTCCCATTGACACAAACGTTAGTCATCTGATTGATAATGAGCTAGGCCAGTGGAAGGCAGATATAGTTCAAGATATTTTCTCGCCAGATGAAGCTAAGGGTATTATATTGATTCCTATTGGGATGAGCCATGGATGGGATCGTTTGATTTGGCACCATGAGAAATTGGGTATATACACAGTTAAAAGTGGATATAGAGCTGCTCAGATGGCATTATCAAATAACTTGGCCTCATCCTCTTCTACGGATGGCTTGGCGAATTGGTGGGGAGGAGTGTGGAATTTATCTTTCCCTAGTAAAATAAAAGTCTTCTTTTGGCGTTTGTGTTTAGACAGGCTCCCAATGAGAGCAAATTTGACTCAAAGGGAGGTGGATGTCCAAAATATTTGTGTTTTCTGTGGGAAGAAAGGAGAAGATTCTTACCATTTATTTTGGCTTCGTAAATATACCAAACATAAATGGATGAATTCTAAATTTCACCATCTCTCTGTTCTTCATCCTCAGGCACCTATGATTAATATCCTAAGGGACTGGCGAGACATGCTAAATTGGGAAGATTTTGAGGAATTGGTCGTGTTTTTGTGGGGTTTATGGAATCGTAGAAATGCATTTGTGTTTAATAAGAGAAGAGCTGAGGATGATGATTTGGCAGGATGGGTCAGTCTGGGAGTCATCATCAGAGACCATAGAGGGCAAGTTCTAGCTTCGGCTACGAAATACCTAGAGCATGTGGCGTCCGGGGATGATGCTGAAGCGCTTGCTGCAGTGGAAGGCTTTCGTGTGGCAATGGAGACTGGAATTTCTCCGATCCTTTTGGTAACTGACTCTTTGCGTATCTACAACCTTTTGCTCGAGATAAAGAAGTCCTATCAGAGACGAGATCAATTATTGAATATGCGAAAACTCATCTTGCTACTAGATTGTAGGTATCCTACAGCTTCACAAAAAGAGGTGGAAATACGATCGCTCACCTTTTGGCGAGAAGAGCCCTCCGGTCTCAGGAAAATTTCGTCTGGCTTGAGGAGGGGCCGGAGGAGATCTCAAACACTCTAG
Protein sequence
MTCFYGHPEEEKRQLSWDLMRWLKGSDSISWLIGRDFNGILSHDEKWGGNQKNDKLINDFSQAMDDCNLIDLGFSEGTRWRIGNGERVKIYGDNWVPNQPTLKILSRPQIPIDTNVSHLIDNELGQWKADIVQDIFSPDEAKGIILIPIGMSHGWDRLIWHHEKLGIYTVKSGYRAAQMALSNNLASSSSTDGLANWWGGVWNLSFPSKIKVFFWRLCLDRLPMRANLTQREVDVQNICVFCGKKGEDSYHLFWLRKYTKHKWMNSKFHHLSVLHPQAPMINILRDWRDMLNWEDFEELVVFLWGLWNRRNAFVFNKRRAEDDDLAGWVSLGVIIRDHRGQVLASATKYLEHVASGDDAEALAAVEGFRVAMETGISPILLVTDSLRIYNLLLEIKKSYQRRDQLLNMRKLILLLDCRYPTASQKEVEIRSLTFWREEPSGLRKISSGLRRGRRRSQTL
Homology
BLAST of Moc09g10720 vs. NCBI nr
Match:
XP_022150918.1 (uncharacterized protein LOC111018954 [Momordica charantia])
HSP 1 Score: 322.4 bits (825), Expect = 6.3e-84
Identity = 170/363 (46.83%), Postives = 224/363 (61.71%), Query Frame = 0
Query: 76 EGTRWRIGNGERVKIYGDNWVPNQPTLKILSRPQIPIDTNVSHLIDNELGQWKADIVQDI 135
+G RWRIGNG+ V IYGDNWVPNQPTLKILS P++P+ + VS L+D+E G W+ D+V+D
Sbjct: 713 KGLRWRIGNGDSVFIYGDNWVPNQPTLKILSSPRLPLVSRVSSLVDHEEGGWQGDVVRDE 772
Query: 136 FSPDEAKGIILIPIGMSHGWDRLIWHHEKLGIYTVKSGYRAAQMALSNN----LASSSST 195
F+PDEAKGI+ IPIG DRLIW++EK G+Y+V+SGY+ +AL NN SSSS+
Sbjct: 773 FTPDEAKGILSIPIGRGAEEDRLIWNYEKTGVYSVRSGYK---VALLNNPCVQAPSSSSS 832
Query: 196 DGLANWWGGVWNLSFPSKIKVFFWRLCLDRLPMRANLTQREVDVQNICVFCGKKGEDSYH 255
+ + WW G W + P+KIKVF WRLCLDRLP NL++R V++ N C FCG+ GEDS H
Sbjct: 833 EEVRCWWNGFWKMHIPNKIKVFLWRLCLDRLPTGCNLSKRGVEITNCCYFCGRNGEDSIH 892
Query: 256 LFWLRKYTKHKWMNSKFHHLSVLHPQAPMINILRDWRDMLNWEDFEELVVFLWGLWNRRN 315
LFW+ K+ + W+NSKF LS P + ILR+ + L+ DFEEL V +WGLWN+RN
Sbjct: 893 LFWICKFAEALWINSKFGKLS------PFL-ILRESHESLSKADFEELCVVIWGLWNQRN 952
Query: 316 AFVFNK-----------------------RRAEDDDLAGWVS------------------ 375
A FN R A+ + + G V+
Sbjct: 953 ARAFNDSTKTVFKIGMELVEWANKYAMEFREAKSNPITGRVTNTAEILWQPPDEGIYKIN 1012
Query: 376 -------------LGVIIRDHRGQVLASATKYLEHVASGDDAEALAAVEGFRVAMETGIS 381
LG+II + RGQV+A+ATKYLE++ S D AEA+AAVEG ++A E G+
Sbjct: 1013 TDASFLASDQHAGLGIIIHNDRGQVMAAATKYLENIQSVDMAEAIAAVEGLQLASEIGMH 1065
BLAST of Moc09g10720 vs. NCBI nr
Match:
XP_022143319.1 (uncharacterized protein LOC111013220 [Momordica charantia])
HSP 1 Score: 208.0 bits (528), Expect = 1.7e-49
Identity = 102/194 (52.58%), Postives = 128/194 (65.98%), Query Frame = 0
Query: 135 IFSPDEAKGIILIPIGMSHGWDRLIWHHEKLGIYTVKSGYRAAQMALSNNLASSSSTDGL 194
+F+ DE K I+ IP+G+ DRLIW+ EK GI TVKS Y+ A M + AS+S ++ L
Sbjct: 1 MFTYDEVKTILSIPLGIGLAADRLIWNFEKNGICTVKSDYKLAHMQSPDTSASTSLSECL 60
Query: 195 ANWWGGVWNLSFPSKIKVFFWRLCLDRLPMRANLTQREVDVQNICVFCGKKGEDSYHLFW 254
A WW VW L+ PSKIKVFFWR CLDRLP ANL R VDV + FCGKKGED+ HLFW
Sbjct: 61 AKWWKDVWQLNLPSKIKVFFWRPCLDRLPTGANLILRGVDVPDCYAFCGKKGEDALHLFW 120
Query: 255 LRKYTKHKWMNSKFHHLSVLHPQAPMINILRDWRDMLNWEDFEELVVFLWGLWNRRNAFV 314
K +K++ SKF HL ++++LRD +L+W DFEELVVFLWG+W++RN V
Sbjct: 121 TCKVSKNQRQVSKFSHLPQDVRPLSLLHLLRDCEGILSWSDFEELVVFLWGIWSKRNVKV 180
Query: 315 FNKRRAEDDDLAGW 329
F R DL GW
Sbjct: 181 FLNGREMVRDLDGW 194
BLAST of Moc09g10720 vs. NCBI nr
Match:
XP_023925698.1 (uncharacterized protein LOC112037116 [Quercus suber])
HSP 1 Score: 193.4 bits (490), Expect = 4.4e-45
Identity = 109/321 (33.96%), Postives = 164/321 (51.09%), Query Frame = 0
Query: 76 EGTRWRIGNGERVKIYGDNWVPNQPTLKILSRPQ-IPIDTNVSHLIDNELGQWKADIVQD 135
+G+ WR+GNG +++ GD W+PN PT K+L Q + D V LID W D +
Sbjct: 470 QGSCWRVGNGASIRVLGDKWLPNHPTKKVLLPIQSVDNDLIVEELIDPVTRWWNRDFIMQ 529
Query: 136 IFSPDEAKGIILIPIGMSHGWDRLIWHHEKLGIYTVKSGYRAAQMALSNNLASSSSTDGL 195
F+ ++A+ I+ +P+ + D L W K G YTV+SGY+ A+ L + S++G+
Sbjct: 530 NFNHEDAEAILRVPLSRRYISDSLFWTVNKSGEYTVRSGYQVAR-KLQKEADWAESSNGV 589
Query: 196 ANW--WGGVWNLSFPSKIKVFFWRLCLDRLPMRANLTQREVDVQNICVFCGKKGEDSYHL 255
W +W L P+KIKVF WR C + LP R NL QR V N C C + E H
Sbjct: 590 VGGLVWRTLWKLKVPNKIKVFGWRACRNILPTRVNLVQRRVIQDNKCEACKIEAETGIHA 649
Query: 256 FWLRKYTKHKWMNSKFHHLSVLHPQAPMINILRDWRDMLNWEDFEELVVFLWGLWNRRNA 315
W + W QA M+ ++ + + L+ E+ E+ +V W +WN+RN
Sbjct: 650 LWNCGVARDVWAGYTARVQKCSGDQADMLQLMEEMINRLSTEELEQFLVQSWIIWNQRNG 709
Query: 316 FVFNKR-RAEDDDLAGWVSLGVIIRDHRGQVLASATKYLEHVASGDDAEALAAVEGFRVA 375
+ K+ +A ++D G +G IIR+ RG+V+AS + V +DAE LA A
Sbjct: 710 LIHGKKLQAPEED--GTSGIGAIIRNDRGEVMASLSAKGPPVTCSEDAEILACRRAVNFA 769
Query: 376 METGISPILLVTDSLRIYNLL 393
ME G S ++L D+ I L
Sbjct: 770 MECGFSELVLEGDNQAIMTAL 787
BLAST of Moc09g10720 vs. NCBI nr
Match:
XP_030969741.1 (uncharacterized protein LOC115990018 [Quercus lobata])
HSP 1 Score: 188.3 bits (477), Expect = 1.4e-43
Identity = 101/321 (31.46%), Postives = 159/321 (49.53%), Query Frame = 0
Query: 76 EGTRWRIGNGERVKIYGDNWVPNQPTLKILS-RPQIPIDTNVSHLIDNELGQWKADIVQD 135
+G W IG GE V+I D W+P + ++S P + D VS LID + WK + VQ
Sbjct: 588 QGMVWHIGTGEAVRIKEDRWLPGRANCSVISPLPSLVPDVKVSTLIDQDTNAWKTEAVQQ 647
Query: 136 IFSPDEAKGIILIPIGMSHGWDRLIWHHEKLGIYTVKSGYRAAQMALSNNLASSSSTDGL 195
+F P EA+ I+ IP+ DR+IW H G++T S Y+ +++ A SS+ +
Sbjct: 648 LFLPQEAEIILGIPLSTRRPVDRIIWAHTPSGMFTTCSAYKLLVSCDASSSAGSSNPEAQ 707
Query: 196 ANWWGGVWNLSFPSKIKVFFWRLCLDRLPMRANLTQREVDVQNICVFCGKKGEDSYHLFW 255
+W G+W L P+KI+ F W +C + LP NL +R++ C C EDS H W
Sbjct: 708 KKFWKGIWQLRVPNKIRHFVWGICNNALPTMVNLHRRQIVPSASCALCNVLPEDSLHAVW 767
Query: 256 LRKYTKHKWMNSKFHHLSVLHPQAPMINILRDWRDMLNWEDF--EELVVFLWGLWNRRNA 315
+ W + H + +L + + N E+F E V+ +W LWNRRNA
Sbjct: 768 YCEAISGAWSTLDWFHQTAPPRPTSFTELLSSF--LCNKEEFRAEIFVIMVWLLWNRRNA 827
Query: 316 FVFNKRRAEDDDLAGWVSLGVIIRDHRGQVLASATKYLEHVASGDDAEALAAVEGFRVAM 375
F + +G+I R+H G+ + + + + S D EALA ++ + A+
Sbjct: 828 IQFGHPPLPVASICSSAGIGIIARNHVGEAVGALSSPIPMAQSVADIEALACLKAVQFAL 887
Query: 376 ETGISPILLVTDSLRIYNLLL 394
E G++ +++ DS I N LL
Sbjct: 888 EIGLNRVVIEGDSAVIINALL 906
BLAST of Moc09g10720 vs. NCBI nr
Match:
KAF8408042.1 (hypothetical protein HHK36_007182 [Tetracentron sinense])
HSP 1 Score: 185.7 bits (470), Expect = 9.3e-43
Identity = 141/535 (26.36%), Postives = 224/535 (41.87%), Query Frame = 0
Query: 1 MTCFYGHPEEEKRQLSWDLMRWLKGSDSISWLIGRDFNGILSHDEKWGGNQKNDKLINDF 60
+T YGHPE K+ +W+L+R+L S S+ W+ DFN I +EK G +K + F
Sbjct: 103 LTGMYGHPEAAKKWETWELIRYLSRSYSMPWVCFGDFNEITCAEEKSGRVEKAAWKMRKF 162
Query: 61 SQAMDDCNLIDLGFSEGT-RW---RIGNG------------------------------- 120
+A+ DC+LI LGF T W R G G
Sbjct: 163 KEAILDCHLIGLGFEGNTFTWCNKRSGEGNVRERLDRAMATSDWCFLFPFTTVKHLSCHT 222
Query: 121 ------------ERVKI--------YGDNWVPNQPTLKIL----------SRPQIPIDTN 180
E KI + W+ + +I+ S +P D
Sbjct: 223 SDHSPLLLAFDKEAPKIARKKRSFRFEAMWIHSPECAEIIDSAWTGCHQVSAQVLPRDAK 282
Query: 181 VSHLIDNELGQWKADIVQDIFSPDEAKGIILIPIGMSHGWDRLIWHHEKLGIYTVKSGYR 240
VS LID + W ++ +F P EA+ I IP+ D+ +WH G ++V+S Y
Sbjct: 283 VSLLIDKDQKTWNHTLLMTVFMPHEAELISSIPLSERLPPDKRVWHFTSKG-FSVRSAYH 342
Query: 241 AAQMALSNNLASSSSTDGLANW--------WGGVWNLSFPSKIKVFFWRLCLDRLPMRAN 300
A+SSST L +W W VW L+ P K+K+F W++ L+ LP+RAN
Sbjct: 343 LTSTLRDRESATSSSTSSL-SWNGSLSGIKWSQVWQLAIPPKVKIFIWKVALNILPVRAN 402
Query: 301 LTQREVDVQNICVFCGKKGEDSYHLFWLRKYTKHKWMNSKFHHLSVLHPQAPMINILRDW 360
L +R++ V+N+C CG++GE H+ Y + W+ S+ L A + L W
Sbjct: 403 LCKRKIPVENVCGVCGEEGETILHVLKNCHYARQVWLLSQLG----LRSDATSADSLSSW 462
Query: 361 RDMLNWEDFEE----LVVFLWGLWNRRNAFVF------------------------NKRR 411
+ + EE + W +W RN ++F N R
Sbjct: 463 VEEIMKSHGEEGLSAFFMIAWSIWKHRNEYIFSGVKMTPFNCVQRANKLLADFHNANDRA 522
BLAST of Moc09g10720 vs. ExPASy TrEMBL
Match:
A0A6J1DAR4 (uncharacterized protein LOC111018954 OS=Momordica charantia OX=3673 GN=LOC111018954 PE=4 SV=1)
HSP 1 Score: 322.4 bits (825), Expect = 3.1e-84
Identity = 170/363 (46.83%), Postives = 224/363 (61.71%), Query Frame = 0
Query: 76 EGTRWRIGNGERVKIYGDNWVPNQPTLKILSRPQIPIDTNVSHLIDNELGQWKADIVQDI 135
+G RWRIGNG+ V IYGDNWVPNQPTLKILS P++P+ + VS L+D+E G W+ D+V+D
Sbjct: 713 KGLRWRIGNGDSVFIYGDNWVPNQPTLKILSSPRLPLVSRVSSLVDHEEGGWQGDVVRDE 772
Query: 136 FSPDEAKGIILIPIGMSHGWDRLIWHHEKLGIYTVKSGYRAAQMALSNN----LASSSST 195
F+PDEAKGI+ IPIG DRLIW++EK G+Y+V+SGY+ +AL NN SSSS+
Sbjct: 773 FTPDEAKGILSIPIGRGAEEDRLIWNYEKTGVYSVRSGYK---VALLNNPCVQAPSSSSS 832
Query: 196 DGLANWWGGVWNLSFPSKIKVFFWRLCLDRLPMRANLTQREVDVQNICVFCGKKGEDSYH 255
+ + WW G W + P+KIKVF WRLCLDRLP NL++R V++ N C FCG+ GEDS H
Sbjct: 833 EEVRCWWNGFWKMHIPNKIKVFLWRLCLDRLPTGCNLSKRGVEITNCCYFCGRNGEDSIH 892
Query: 256 LFWLRKYTKHKWMNSKFHHLSVLHPQAPMINILRDWRDMLNWEDFEELVVFLWGLWNRRN 315
LFW+ K+ + W+NSKF LS P + ILR+ + L+ DFEEL V +WGLWN+RN
Sbjct: 893 LFWICKFAEALWINSKFGKLS------PFL-ILRESHESLSKADFEELCVVIWGLWNQRN 952
Query: 316 AFVFNK-----------------------RRAEDDDLAGWVS------------------ 375
A FN R A+ + + G V+
Sbjct: 953 ARAFNDSTKTVFKIGMELVEWANKYAMEFREAKSNPITGRVTNTAEILWQPPDEGIYKIN 1012
Query: 376 -------------LGVIIRDHRGQVLASATKYLEHVASGDDAEALAAVEGFRVAMETGIS 381
LG+II + RGQV+A+ATKYLE++ S D AEA+AAVEG ++A E G+
Sbjct: 1013 TDASFLASDQHAGLGIIIHNDRGQVMAAATKYLENIQSVDMAEAIAAVEGLQLASEIGMH 1065
BLAST of Moc09g10720 vs. ExPASy TrEMBL
Match:
A0A6J1CNZ5 (uncharacterized protein LOC111013220 OS=Momordica charantia OX=3673 GN=LOC111013220 PE=4 SV=1)
HSP 1 Score: 208.0 bits (528), Expect = 8.4e-50
Identity = 102/194 (52.58%), Postives = 128/194 (65.98%), Query Frame = 0
Query: 135 IFSPDEAKGIILIPIGMSHGWDRLIWHHEKLGIYTVKSGYRAAQMALSNNLASSSSTDGL 194
+F+ DE K I+ IP+G+ DRLIW+ EK GI TVKS Y+ A M + AS+S ++ L
Sbjct: 1 MFTYDEVKTILSIPLGIGLAADRLIWNFEKNGICTVKSDYKLAHMQSPDTSASTSLSECL 60
Query: 195 ANWWGGVWNLSFPSKIKVFFWRLCLDRLPMRANLTQREVDVQNICVFCGKKGEDSYHLFW 254
A WW VW L+ PSKIKVFFWR CLDRLP ANL R VDV + FCGKKGED+ HLFW
Sbjct: 61 AKWWKDVWQLNLPSKIKVFFWRPCLDRLPTGANLILRGVDVPDCYAFCGKKGEDALHLFW 120
Query: 255 LRKYTKHKWMNSKFHHLSVLHPQAPMINILRDWRDMLNWEDFEELVVFLWGLWNRRNAFV 314
K +K++ SKF HL ++++LRD +L+W DFEELVVFLWG+W++RN V
Sbjct: 121 TCKVSKNQRQVSKFSHLPQDVRPLSLLHLLRDCEGILSWSDFEELVVFLWGIWSKRNVKV 180
Query: 315 FNKRRAEDDDLAGW 329
F R DL GW
Sbjct: 181 FLNGREMVRDLDGW 194
BLAST of Moc09g10720 vs. ExPASy TrEMBL
Match:
A0A6J1CDQ4 (uncharacterized protein LOC111010533 OS=Momordica charantia OX=3673 GN=LOC111010533 PE=4 SV=1)
HSP 1 Score: 184.1 bits (466), Expect = 1.3e-42
Identity = 108/195 (55.38%), Postives = 113/195 (57.95%), Query Frame = 0
Query: 280 MINILRDWRDMLNWEDFEELVVFLWGLWNRRNAFVFNKRRAEDDDLAGWVS--------- 339
MINILRDWRDMLNW+DFEELVVFLW LWNRRNAFVFNKRR EDDDLAGWVS
Sbjct: 1 MINILRDWRDMLNWKDFEELVVFLWSLWNRRNAFVFNKRRVEDDDLAGWVSTYIATFKAT 60
Query: 340 ------------------------------------------------------LGV-II 399
LGV II
Sbjct: 61 NTNHATANQDVSQSFQQSSQIHQAQNHTIWCPAEEGVFKLKTDASFSSIDFNAGLGVIII 120
Query: 400 RDHRGQVLASATKYLEHVASGDDAEALAAVEGFRVAMETGISPILLVTDSLRIYNLLLEI 411
RDHRGQVLASATKYLEHVAS DDAEALAAVEG RVAMETGISPILL TDSLRIYNL
Sbjct: 121 RDHRGQVLASATKYLEHVASVDDAEALAAVEGLRVAMETGISPILLETDSLRIYNLFARD 180
BLAST of Moc09g10720 vs. ExPASy TrEMBL
Match:
M5W5F3 (Reverse transcriptase domain-containing protein (Fragment) OS=Prunus persica OX=3760 GN=PRUPE_ppa026368mg PE=4 SV=1)
HSP 1 Score: 183.3 bits (464), Expect = 2.2e-42
Identity = 108/372 (29.03%), Postives = 171/372 (45.97%), Query Frame = 0
Query: 75 SEGTRWRIGNGERVKIYGDNWVPNQPTLKILSRPQIPIDTNVSHLIDNELGQWKADIVQD 134
++G RWR+G+G +++Y D W+P KI+S PQ+P+ T V L + GQW +++D
Sbjct: 622 NKGLRWRVGSGVSIQVYTDKWLPAPSCFKIMSPPQLPLSTRVCDLFTSS-GQWNVPLLKD 681
Query: 135 IFSPDEAKGIILIPIGMSHGWDRLIWHHEKLGIYTVKSGYRAAQMALSNNLAS-SSSTDG 194
IF E I+ IP+ G D LIWH+E+ G+Y+VKSGYR A + S+ D
Sbjct: 682 IFWDQEVDAILQIPLASLAGHDCLIWHYERNGMYSVKSGYRLAGLEKDKMSGEPSARVDL 741
Query: 195 LANWWGGVWNLSFPSKIKVFFWRLCLDRLPMRANLTQREVDVQNICVFCGKKGEDSYHLF 254
+ +W +W L P+KIK F WR D LP L R++ IC C +K E H
Sbjct: 742 NSKFWKKIWALKIPNKIKFFLWRCAWDFLPCGQILFNRKIAPTPICPKCHRKAESVLHAV 801
Query: 255 WLRKYTKHKWMNSKFHHLSVLHPQAPMINILRD-WRDMLNWEDFEELVVF---LWGLWNR 314
WL + K W NS + ++ + +N R+ W + EE +F WGLWNR
Sbjct: 802 WLCEAAKEVWRNSAWGNVC----EVWRVNSFRELWHALQLSSSGEEQGLFAYLCWGLWNR 861
Query: 315 RNAFVF--------------------------------NKRRAEDDDLAGW--------- 374
RN+F+F ++ + L GW
Sbjct: 862 RNSFIFEGKSETAIQLLSRMTKLAQEFSDANNILHTIHGRQSSPQAPLQGWRPPPAVKSG 921
Query: 375 ---VSLGVIIRDHRGQVLASATKYLEHVASGDDAEALAAVEGFRVAMETGISPILLVTDS 398
+GV++R+ G+ +A+ + + E +A +EG R A++ G + +L D+
Sbjct: 922 DSVRGVGVVVRNANGEFMAACVRRIHASYGARQTELMATIEGLRFAIDMGFTDAILEMDA 981
BLAST of Moc09g10720 vs. ExPASy TrEMBL
Match:
A0A251NPF0 (Reverse transcriptase domain-containing protein OS=Prunus persica OX=3760 GN=PRUPE_6G123900 PE=4 SV=1)
HSP 1 Score: 183.3 bits (464), Expect = 2.2e-42
Identity = 108/372 (29.03%), Postives = 171/372 (45.97%), Query Frame = 0
Query: 75 SEGTRWRIGNGERVKIYGDNWVPNQPTLKILSRPQIPIDTNVSHLIDNELGQWKADIVQD 134
++G RWR+G+G +++Y D W+P KI+S PQ+P+ T V L + GQW +++D
Sbjct: 585 NKGLRWRVGSGVSIQVYTDKWLPAPSCFKIMSPPQLPLSTRVCDLFTSS-GQWNVPLLKD 644
Query: 135 IFSPDEAKGIILIPIGMSHGWDRLIWHHEKLGIYTVKSGYRAAQMALSNNLAS-SSSTDG 194
IF E I+ IP+ G D LIWH+E+ G+Y+VKSGYR A + S+ D
Sbjct: 645 IFWDQEVDAILQIPLASLAGHDCLIWHYERNGMYSVKSGYRLAGLEKDKMSGEPSARVDL 704
Query: 195 LANWWGGVWNLSFPSKIKVFFWRLCLDRLPMRANLTQREVDVQNICVFCGKKGEDSYHLF 254
+ +W +W L P+KIK F WR D LP L R++ IC C +K E H
Sbjct: 705 NSKFWKKIWALKIPNKIKFFLWRCAWDFLPCGQILFNRKIAPTPICPKCHRKAESVLHAV 764
Query: 255 WLRKYTKHKWMNSKFHHLSVLHPQAPMINILRD-WRDMLNWEDFEELVVF---LWGLWNR 314
WL + K W NS + ++ + +N R+ W + EE +F WGLWNR
Sbjct: 765 WLCEAAKEVWRNSAWGNVC----EVWRVNSFRELWHALQLSSSGEEQGLFAYLCWGLWNR 824
Query: 315 RNAFVF--------------------------------NKRRAEDDDLAGW--------- 374
RN+F+F ++ + L GW
Sbjct: 825 RNSFIFEGKSETAIQLLSRMTKLAQEFSDANNILHTIHGRQSSPQAPLQGWRPPPAVKSG 884
Query: 375 ---VSLGVIIRDHRGQVLASATKYLEHVASGDDAEALAAVEGFRVAMETGISPILLVTDS 398
+GV++R+ G+ +A+ + + E +A +EG R A++ G + +L D+
Sbjct: 885 DSVRGVGVVVRNANGEFMAACVRRIHASYGARQTELMATIEGLRFAIDMGFTDAILEMDA 944
BLAST of Moc09g10720 vs. TAIR 10
Match:
AT3G09510.1 (Ribonuclease H-like superfamily protein )
HSP 1 Score: 89.4 bits (220), Expect = 8.4e-18
Identity = 64/254 (25.20%), Postives = 110/254 (43.31%), Query Frame = 0
Query: 76 EGTRWRIGNGERVKIYGDNWVPNQPTLKILSRPQIPIDTNVSHLIDNELGQ--WKADIVQ 135
+GTR IG+G+ ++I DN V + P + L+ + + +++L + + W +
Sbjct: 36 KGTRHLIGDGQNIRIGLDNIVDSHPP-RPLNTEETYKEMTINNLFERKGSYYFWDDSKIS 95
Query: 136 DIFSPDEAKGIILIPIGMSHGWDRLIWHHEKLGIYTVKSGYRAAQMALSNNLASSSSTDG 195
+ I I + S D++IW++ G YTV+SGY S N+ + + G
Sbjct: 96 QFVDQSDHGFIHRIYLAKSKKPDKIIWNYNTTGEYTVRSGYWLLTHDPSTNIPAINPPHG 155
Query: 196 LANWWGGVWNLSFPSKIKVFFWRLCLDRLPMRANLTQREVDVQNICVFCGKKGEDSYHLF 255
+ +WNL K+K F WR L LT R + + C C ++ E H
Sbjct: 156 SIDLKTRIWNLPIMPKLKHFLWRALSQALATTERLTTRGMRIDPSCPRCHRENESINHAL 215
Query: 256 WLRKYTKHKWMNSKFHHLSVLHPQAPMINILRDWRDMLNW------EDFEEL--VVFLWG 315
+ + W + S++ Q + + ++LN+ DF +L V +W
Sbjct: 216 FTCPFATMAW---RLSDSSLIRNQLMSNDFEENISNILNFVQDTTMSDFHKLLPVWLIWR 275
Query: 316 LWNRRNAFVFNKRR 320
+W RN VFNK R
Sbjct: 276 IWKARNNVVFNKFR 285
BLAST of Moc09g10720 vs. TAIR 10
Match:
AT1G33710.1 (RNA-directed DNA polymerase (reverse transcriptase)-related family protein )
HSP 1 Score: 48.9 bits (115), Expect = 1.3e-05
Identity = 33/134 (24.63%), Postives = 53/134 (39.55%), Query Frame = 0
Query: 194 LANWWGGVWNLSFPSKIKVFFWRLCLDRLPMRANLTQREVDVQNICVFCGKKGEDSYHLF 253
+ +W VW K W LDRLP + L + +Q C C ED HLF
Sbjct: 45 VVSWAKTVWFKGATPKHAFHMWVTNLDRLPTKTRLASWGMQLQTTCGLCSLDIEDRDHLF 104
Query: 254 WLRKYTKHKWMNSKFHHLSVLHPQAPMINILRDWRDMLNW---------EDFEELVV--F 313
++ W H + + P + + W D+++W +L+V
Sbjct: 105 LTCEFACFLW------HTVSVRLELPAFSFV-VWNDLMDWTLQRNRRSPPTLRKLIVQSV 164
Query: 314 LWGLWNRRNAFVFN 317
L+ +W +RN F+ N
Sbjct: 165 LYAIWKQRNNFLHN 171
BLAST of Moc09g10720 vs. TAIR 10
Match:
AT3G26855.1 (RNA-directed DNA polymerase (reverse transcriptase)-related family protein )
HSP 1 Score: 45.4 bits (106), Expect = 1.4e-04
Identity = 16/47 (34.04%), Postives = 25/47 (53.19%), Query Frame = 0
Query: 196 NWWGGVWNLSFPSKIKVFFWRLCLDRLPMRANLTQREVDVQNICVFC 243
NW G +W+L KIK+ W+ + LP+ A L R + ++ C C
Sbjct: 4 NWIGDIWSLKISPKIKLLIWKALNNALPVGAQLLSRNISIEPFCTRC 50
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A6J1DAR4 | 3.1e-84 | 46.83 | uncharacterized protein LOC111018954 OS=Momordica charantia OX=3673 GN=LOC111018... | [more] |
A0A6J1CNZ5 | 8.4e-50 | 52.58 | uncharacterized protein LOC111013220 OS=Momordica charantia OX=3673 GN=LOC111013... | [more] |
A0A6J1CDQ4 | 1.3e-42 | 55.38 | uncharacterized protein LOC111010533 OS=Momordica charantia OX=3673 GN=LOC111010... | [more] |
M5W5F3 | 2.2e-42 | 29.03 | Reverse transcriptase domain-containing protein (Fragment) OS=Prunus persica OX=... | [more] |
A0A251NPF0 | 2.2e-42 | 29.03 | Reverse transcriptase domain-containing protein OS=Prunus persica OX=3760 GN=PRU... | [more] |
Match Name | E-value | Identity | Description | |
AT3G09510.1 | 8.4e-18 | 25.20 | Ribonuclease H-like superfamily protein | [more] |
AT1G33710.1 | 1.3e-05 | 24.63 | RNA-directed DNA polymerase (reverse transcriptase)-related family protein | [more] |
AT3G26855.1 | 1.4e-04 | 34.04 | RNA-directed DNA polymerase (reverse transcriptase)-related family protein | [more] |