Moc03g01560 (gene) Bitter gourd (OHB3-1) v2

Overview
NameMoc03g01560
Typegene
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionReverse transcriptase domain-containing protein
Locationchr3: 1189001 .. 1195439 (+)
RNA-Seq ExpressionMoc03g01560
SyntenyMoc03g01560
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCAAGTTCGTAGCGCCATGGCGCTACGGTACAGCGCTGCGGCGCTGCTAATTCGGGATTTTGCTATCGATTCTGACCTGTACGCGCGCCATGGCGCTAAATGGTGGCACTGCGGCGCTGATGTGGTTTCAACAAGCTCCTCTTTTTCGCTCTAGTTTCGCTCAAATGATCGCTTATTCTTCAATTCTTTCTAATAGCTTGATCCTATAAAAAAACATCAGTTATGCACGTAAATAAGCCACAAATGGCCGAAAATTGTTAAATGATTTCTACCTGAATATCATGAAATGTGGTGCATAAGGTATACTATCAGTCAGATGGTGAAGGATTGCATCAAGAAGTATAAGGTAGCTTCGGGTCAGTTGGTGAATTTTGATAAATCTGTTATAGCTTTCAGCCTGAATACCAAGACAGACAGTCAAGCCTTATTCAAAATATCTTATCTGTTAATATGGTGGAATGCCAGCTGCAATACTTGGGCCTCCCGACGTTTATGCCTAGAAATAGAAGAATGCATTTTAATTATATTAAAGATCGTGTTTGGAAGCACCTTCAAGGGTGGAAAGCGAAATTATTTTCCATTGGTGGGAAGGAGGTGCTTATTAAAGCTGTTGCCCAGGCTATTCCTTGCTATACTATGTCTTGTTTTCGGTTGCCAAAAAGGCTTATTAGAGAGTTTCATCATATTACGGCGCGATTTTGGTGGGGATCGAGTAAGGAAGATAAAAAAATTCATTGGGTGGCCTGGAATTCCTTGTATTTGCCAAAATGTGAGGGGGGAATGGGCTTTCGTGATTTGGAATTGTTTAATAAAGCTTTGCTTGCAAAGCAATGTTGGCGGATTCTTAATCATCCCAATTCGATGCTATCGCGTGTGCTAAAGGGCCGTTATTTTAAAGATTGCAGCTTTATGGAAGCAAAAATTAGTGGAAATCCTTCTTATATATGGAGAAGCATATTATGGGGTCGAGATCTATTAAAAAAGGGGTTGCGATGGCGGATTGGGAATGGTGACAGTGTTTTTATATATGGTGACAATTGGGTCCCAAATCAACCCACTCTCAAGATCCTTTCAAGTCCAAGGCTTCCTCTGGTTTCAAGAGTGAGTTCCTTGGTTGATCATGAGGAGGGTGGTTGGCAGGGGGATGTTGTTCGAGATGAATTTACTCCAGATGAGGCTAAAGGAATTCTATCTATTCCAATTGGTAGGGGTGCGGAAGAGGATAGGCTAATCTGGAACTATGAGAAGACAGGGGTGTACTCGGTCAGAAGTGGGTATAAGGTAGCTTTGCTGAATAATCCATGTGTTCAGGCCCCCTCCTCTTCATCTTCTGAGGAGGTTCGATGTTGGTGGAACGGTTTCTGGAAAATGCACATCCCTAATAAAATAAAAGTGTTTCTATGGCGTCTATGTCTAGATAGACTTCCAACAGGATGCAATCTATCTAAAAGGGGTGTGGAGATTACAAATTGCTGCTATTTTTGTGGGAGGAATGGTGAAGACAGTATCCATCTGTTTTGGATATGCAAATTTGCTGAAGCTTTGTGGATTAATTCAAAATTTGGAAAGCTATCTCCATTTCTTATTCTTCGAGAGTCGCATGAAAGTCTAAGCAAGGCGGATTTTGAGGAATTATGTGTTGTTATTTGGGGGCTGTGGAATCAAAGGAATGCACGAGCTTTCAATGATAGTACAAAAACAGTTTTTAAGATAGGAATGGAGCTTGTTGAATGGGCAAATAAATATGCTATGGAGTTTAGGGAAGCTAAATCTAATCCTATTACTGGGAGAGTTACAAATACAGCAGAGATTTTATGGCAACCACCGGACGAAGGAATATATAAAATTAACACTGATGCCTCATTTTTAGCTTCAGATCAACATGCAGGATTGGGAATCATCATCCATAATGACAGAGGGCAAGTTATGGCTGCAGCTACGAAGTACTTGGAGAATATTCAATCGGTGGATATGGCAGAAGCGATTGCTGCAGTGGAGGGACTTCAACTGGCGTCGGAAATTGGTATGCACCCAGTGATTTTGGAGACGTATTCATCTCGTATTTTCAATCTTTTCTCTTAGGCTTTGGAGGACCTGTCGGAAACGGGAGAAATCGTTTTGAAGGCGAAGAATTTCTGGACTCAATCTTTACATGCAAGTTTCAATTTCGTGAAGAGGGAGGGTAATAAAGCGGCACACATGTTGGCTCGACGGGCTCTCCTTCTTCATGAGTTTTCGATCTGGATGGAGGATTGGCCACTGGAGCTGAAGAGCTGTTTAGAAATGGAATGTTTGGAGGAGCTTCTGTAATGTTTTGCTTTCCTTTTCTTTCAAAAAAAAAATACTAGTCTATTAATTACCACATGCATTGCATACTGTATTTTTTTCATTTAATTTAAATTAATTTTATAAAATGACACTAATTATTTGGGAAAAAATAATAGTAACCAAAGATAAAACTAAGAATGTGTCTATATAGTTGCTAGACAAATACCATTCTAAATTTAGTTTGGTAATTGTTGGACAAACACCATTATACAACTTCAAGTGAGAAATTGTAACGTCTTAAGCTAGTGGTGAAAGAATGGTGGAAAAGTTGTACTCAACCCAAAAAAAACAAAAACAGAATGGTTGAAAAGTTTTTGTTATAGTGAGTTTTCATGTAGAAATTTGAGCTCCTGGGATTCTGCTAAATTCGTACCAATCTTCTAGAATATAAGGGACCGACGAAATCTACAAAACCAAATAGCACTGAGTATCGGTGTGCCTCTTGCCACACAGGCTCCAATGCTTAAGTCAGAATTGGAGTAATTGGGTATGGAGGGTAGGATATTTCAGCGTACCTTTAGGGTTCAGCTTCCTGTTATATTTATAGAGGTCGAATCTTGATGCCCTTAGGAAACATACCTTGTTGGGTTGCGGGCATCTTGTGGGCTGCATGTTGAATTAACAGATCGGTAACCTGTAGTAGACTTTCCAATAGTCAGATATGGTCGGCCGAGATTTGGTCGGCTGAGACCTGGTTAGCCGAGGTCGACTAGGACCTAGTCGGTCAGGATCTCATGGAGCCCTGGATCTGGTCATGCCTTGGACATAATCATGCCTTTGATCTAATCATGCCCTGGATCTAGTCATGCCTTGGATCTATTGCCCTGAATCTGGTCATACCTTGGTTCTGATCATGCCTCGGCCCCGGTCACGCCTCGAGCTTGGTCATGGCCTGGATCTAGTCATGCCCCGGGTTTGGTCATGCCTCAGGTTTTGTCATGCCCTAAATCTAGTCATGCGTTGGGCCTGTGATGTAACTTGCCTGTTGAGTCTAACTTTGTCTTCCCACAACAGTAGCTCCACTCTCAAAATAGAAGGCAATAAGTATAGATAAAATACTAGACTTCTATATTGAGAAAACTTGTTTTGTTTTATAAGACCTCGTTGTTCATGGGCCGACCATTTTTATGGAACAGCCCTCATAATTGATATCGTTTTTCGTAATTAGTTTTCAATAACTCTTCTACTTGCTTGGTCATAACTTCATTCTTTGCTTAGTTGAAGGGTTGATGCTTCTACCTTACTGGTTTAAGTCTCGATTGATCCCAGACATGCCTTCGTGTGATCATACAAACACATCAGTTGCTTTTGAGAAAATTGATGAGACTCGCTCTCTCCTCAACTCCCGGCTTGGTTTCGATGTTGATTCGTCGCTCTTCAAATGTAAGAGAGATGGATTTCTGATCTTCAGCGGGGTTCTCTCTTCCTAAAGTGGGTGACTCCATGTAACTGAGGCTTGGGTCGACCTTCTCCTCTCCATCGATCGAATTGTCATTCACTGGAAACACTAAATTGGTTGCACGATCTGACCACATGGCGTGAGATGTTCACATTCTACTGGTGGTTAAGTTGTTATCCAGTGGAGACATCGAGTCAATGGCATGACATGTTCTCATCTCATCACAAACATCTCATCGTTCACCTTCAAGAAACGAATACGAGGTGGTGGAGGTCGAAAATCCAGTCGCTGAGACCTAGTCGTTGGTTGGTCGAGACCTGGTCGGTCGGGTATCGGTTCTTTGTTGGCTAGGCGCAGTTCGTTATTTAGGCCTCTAGGTCTTTGCTCTTCTCTCACTTGGGCCCCCTCGCTTTCCCCTTGTTTGTAGGCTTGTGTCTCGTTTTTTTGGCCCAACCACATTCTCTATAACATAAGCCCTCACTCTTAAAGTATAAGTATTTTTCAAATTGTGCATTATGCTTTGAGAGTATTTTTTGTATTTGTTGCTCAGTTCTTTTGTCGAAGCTAGCTTTTAATCGTCGAGAATAAACAACTTCAAAGTTTGTCAATCTCTTAGTTTCGTCGCTTTATTATTGTACTAGTCAAGTCTCTTGGGCACTACTACCCGATTTAATTTAACCGTTAGTTTTTCGTTGGCCTTGGGATTAAACTTAGCTGAGTTGCCTTTACCACCCTTGGGTTTCTAACCTATTTGGCTTTTGCCTATGCTTTTACACCCTCGGTTTAATCAATACCTTTGGGTTTATAACCTATTTGGCTTTTACCCAGTGTTTTATACTATGAATTTAATCAACACCCTCGGGTTTGTAACTTGTTTTGGCTTTTCACCAATGGTTTTATACCCTGGGTTTTGTTGATTGTTTTGGCTCATTTGCCACTTTTTGTTTTAACCGTGCTAGGCTCGTACGCATGCCTTTTTCAGTACCTTTGGGGGATTTTTTTATCATTTAGGCTTAATCCGCCACTGATCTTCCCAAGTTTCACTGACTGGGTTTTTAATTATTGGCTTGCTTATGCTCTGTCCCAATTCGAGTTCCTTAACCAGTCATCATGATACCTGCTCGAGTAATCGAGAAAGATATATAATCAAACTCATCATTTGAACAAATAATAGACATATTGTAGAGATATGAAGTCTAATTTACATGCAATTATCAGGAGGAGAGAAGATCTACCCTAGGTACCCTATAATCAATTATATTTAAAAAGAATACTAAAAGTTGTTACGGAAAAGGAAGTTCGAGAAAAAGATGTTTGCCACTTTGTCCTCTAAGGTAGGGCAGTAGCACTCCCCTAATACGAGTCGTTCCTTCAATGTTTTTGGTCCTCGACTGTTCGTTCTCTCTGTTTGGGAATTTGTCGATTTGTTTCACCTAGGATCTGTCCTATAACTTGTCGATTCGTTCTTCTAAGTCCTACCCCAAAACTTATCGACTCGCTTTACGATAGACCTCCTCGGGTCCAAGCAGTTATCAGTTCATCTTCCTCCTCAAATGGCATTTATCGTTGATCTGCTACTGACTCTCTTCAGGATTTGTGTCGATCATTTTGGCTTTTAGATCCTAGTCGTCCCCACAGACGGGTCAATTGTTGATGTAGAAATTTGACCTCCTGAGATTCTGCCAAATTCACACCAATATTCAAGAATAAAAGAGATCGACGAAACTTGCAAAACCAAATAGCACTATGTACCGGTGTGGCTCTTGCCACATAAGCTCCGATGCTTAAGTCAGAATCAGAGTAGTCGAGTATGAAGAGTAGGGTATTTCAGCATACCTTTAGGATTCAACTTACTGTTATATTTATATAGGTCAAATTCTGATGCCCTTAGGAAACATACCTTACTGAGTTGTGGGCCGAATGTTGAATTAAGGGATCGATAGCCTGCACTAGACTTTCCAATAGTCGGGTCTGGTCAATTGGAATAATTTGGCCTCTTGGGATTCCGCCAAATTCGCACAAATCTTCTAGAATAAAAAGGATCGACGAAACTTGCAAAACCAAATAGCACTGAGTATCGGCCACACAGGCTCTGATGCTTAAGTCAGAATCGGAGTAGTCGAGTAGGGTATTTCAGCGTACCTTTAGGGTTCACCTTATTTTTATATTTATAGAGGTCAAATCATGATGCCTTTAGGAAACATACCTTGCTGGGTTGTGGGTATCTTGTGGGCCGAATGTTGAATTAATTGACCGGTAGCCTGCACTAGACTTTCGAGTAGTCGGGTCTTGTCGGCTTGACCTGGTCGACCGAGGCCTGATCGACTGGGATCTCGCCGGGCCCTGAATTTGGTCATACCCTGGCTCTGGTCATGCCTTGGATCTTGTCTAGTCGGATTTGGTCGGTTGGACCTAGTCGGCCAGCCAGACTTGGTAGGCCTGCACCTAGTCGGTCGGGATCTCACGGGGCCCTGGATCTGGTCATGCCCTAGTCATGCCTTGGATCTGGTCATGCCTTGGGCTTGGTCATACCTTAGATCTGGTCATGTCTTAGGCCTATGATGTAGCTCGCATGTCGGGTCTAGCTGTGTCTTCTCGTAATAGTGAGAATAGAGATGCAAGAGCCCCACATGGCAATGGAGACCAACTCTACTATAGAGTGACGACCAATGGAGGTAGCAAAGATGATCTGACAAAGACGTACCGATAA

mRNA sequence

ATGCAAGTTCGTAGCGCCATGGCGCTACGGTACAGCGCTGCGGCGCTGCTAATTCGGGATTTTGCTATCGATTCTGACCTGTATACTATCAGTCAGATGGTGAAGGATTGCATCAAGAAGTATAAGGTAGCTTCGGGTCAGTTGGTGAATTTTGATAAATCTGTTATAGCTTTCAGCCTGAATACCAAGACAGACATGAGTTCCTTGGTTGATCATGAGGAGGGTGGTTGGCAGGGGGATGTTGTTCGAGATGAATTTACTCCAGATGAGGCTAAAGGAATTCTATCTATTCCAATTGGTAGGGGTGCGGAAGAGGATAGGCTAATCTGGAACTATGAGAAGACAGGGGTGTACTCGGTCAGAAGTGGGTATAAGGTAGCTTTGCTGAATAATCCATGTGTTCAGGCCCCCTCCTCTTCATCTTCTGAGGAGGTTCGATGTTGGTGGAACGGTTTCTGGAAAATGCACATCCCTAATAAAATAAAAGTGTTTCTATGGCGTCTATGTCTAGATAGACTTCCAACAGGATGCAATCTATCTAAAAGGGGTGTGGAGATTACAAATTGCTGCTATTTTTGTGGGAGGAATGGTGAAGACAGTATCCATCTGTTTTGGATATGCAAATTTGCTGAAGCTTTGTGGATTAATTCAAAATTTGGAAAGCTATCTCCATTTCTTATTCTTCGAGAGTCGCATGAAAGTCTAAGCAAGGCGGATTTTGAGGAATTATGTGTTGTTATTTGGGGGCTGTGGAATCAAAGGAATGCACGAGCTTTCAATGATAGTACAAAAACAGTTTTTAAGATAGGAATGGAGCTTGTTGAATGGGCAAATAAATATGCTATGGAGTTTAGGGAAGCTAAATCTAATCCTATTACTGGGAGAGTTACAAATACAGCAGAGATTTTATGGCAACCACCGGACGAAGGAATATATAAAATTAACACTGATGCCTCATTTTTAGCTTCAGATCAACATGCAGGATTGGGAATCATCATCCATAATGACAGAGGGCAAGTTATGGCTGCAGCTACGAAGTACTTGGAGAATATTCAATCGGTGGATATGGCAGAAGCGATTGCTGCAGTGGAGGGACTTCAACTGGCGTCGGAAATTGAAATCGTTTTGAAGGCGAAGAATTTCTGGACTCAATCTTTACATGCAAGTTTCAATTTCGTGAAGAGGGAGGGTAATAAAGCGGCACACATGTTGGCTCGACGGGCTCTCCTTCTTCATGAGTTTTCGATCTGGATGGAGGATTGGCCACTGGAGCTGAAGAGCTGTTTAGAAATGGAATGTTTGGAGGAGCTTCTATATGGTCGGCCGAGATTTGGTCGGCTGAGACCTGTCATGCCTTGGATCTATTGCCCTGAATCTGGTCATACCTTGGTTCTGATCATGCCTCGGCCCCGGTCACGCCTCGAGCTTGGTCATGGCCTGGATCTAGTCATGCCCCGGGTTTGGTCATGCCTCAGGTTTTGTCATGCCCTAAATCTAGTCATGCGTTGGGCCTGTGATGTAACTTGCCTGTTGAGTCTAACTTTGTCTTCCCACAACATTGAAGGGTTGATGCTTCTACCTTACTGGTTTAAGTCTCGATTGATCCCAGACATGCCTTCGTGTGATCATACAAACACATCAGTTGCTTTTGAGAAAATTGATGAGACTCGCTCTCTCCTCAACTCCCGGCTTGGTTTCGATGTTGATTCGTCGCTCTTCAAATTCGGATTTGGTCGGTTGGACCTAGTCGGCCAGCCAGACTTGGCCTATGATGTAGCTCGCATGTCGGGTCTAGCTGTGTCTTCTCGTAATAGTGAGAATAGAGATGCAAGAGCCCCACATGGCAATGGAGACCAACTCTACTATAGAGTGACGACCAATGGAGGTAGCAAAGATGATCTGACAAAGACGTACCGATAA

Coding sequence (CDS)

ATGCAAGTTCGTAGCGCCATGGCGCTACGGTACAGCGCTGCGGCGCTGCTAATTCGGGATTTTGCTATCGATTCTGACCTGTATACTATCAGTCAGATGGTGAAGGATTGCATCAAGAAGTATAAGGTAGCTTCGGGTCAGTTGGTGAATTTTGATAAATCTGTTATAGCTTTCAGCCTGAATACCAAGACAGACATGAGTTCCTTGGTTGATCATGAGGAGGGTGGTTGGCAGGGGGATGTTGTTCGAGATGAATTTACTCCAGATGAGGCTAAAGGAATTCTATCTATTCCAATTGGTAGGGGTGCGGAAGAGGATAGGCTAATCTGGAACTATGAGAAGACAGGGGTGTACTCGGTCAGAAGTGGGTATAAGGTAGCTTTGCTGAATAATCCATGTGTTCAGGCCCCCTCCTCTTCATCTTCTGAGGAGGTTCGATGTTGGTGGAACGGTTTCTGGAAAATGCACATCCCTAATAAAATAAAAGTGTTTCTATGGCGTCTATGTCTAGATAGACTTCCAACAGGATGCAATCTATCTAAAAGGGGTGTGGAGATTACAAATTGCTGCTATTTTTGTGGGAGGAATGGTGAAGACAGTATCCATCTGTTTTGGATATGCAAATTTGCTGAAGCTTTGTGGATTAATTCAAAATTTGGAAAGCTATCTCCATTTCTTATTCTTCGAGAGTCGCATGAAAGTCTAAGCAAGGCGGATTTTGAGGAATTATGTGTTGTTATTTGGGGGCTGTGGAATCAAAGGAATGCACGAGCTTTCAATGATAGTACAAAAACAGTTTTTAAGATAGGAATGGAGCTTGTTGAATGGGCAAATAAATATGCTATGGAGTTTAGGGAAGCTAAATCTAATCCTATTACTGGGAGAGTTACAAATACAGCAGAGATTTTATGGCAACCACCGGACGAAGGAATATATAAAATTAACACTGATGCCTCATTTTTAGCTTCAGATCAACATGCAGGATTGGGAATCATCATCCATAATGACAGAGGGCAAGTTATGGCTGCAGCTACGAAGTACTTGGAGAATATTCAATCGGTGGATATGGCAGAAGCGATTGCTGCAGTGGAGGGACTTCAACTGGCGTCGGAAATTGAAATCGTTTTGAAGGCGAAGAATTTCTGGACTCAATCTTTACATGCAAGTTTCAATTTCGTGAAGAGGGAGGGTAATAAAGCGGCACACATGTTGGCTCGACGGGCTCTCCTTCTTCATGAGTTTTCGATCTGGATGGAGGATTGGCCACTGGAGCTGAAGAGCTGTTTAGAAATGGAATGTTTGGAGGAGCTTCTATATGGTCGGCCGAGATTTGGTCGGCTGAGACCTGTCATGCCTTGGATCTATTGCCCTGAATCTGGTCATACCTTGGTTCTGATCATGCCTCGGCCCCGGTCACGCCTCGAGCTTGGTCATGGCCTGGATCTAGTCATGCCCCGGGTTTGGTCATGCCTCAGGTTTTGTCATGCCCTAAATCTAGTCATGCGTTGGGCCTGTGATGTAACTTGCCTGTTGAGTCTAACTTTGTCTTCCCACAACATTGAAGGGTTGATGCTTCTACCTTACTGGTTTAAGTCTCGATTGATCCCAGACATGCCTTCGTGTGATCATACAAACACATCAGTTGCTTTTGAGAAAATTGATGAGACTCGCTCTCTCCTCAACTCCCGGCTTGGTTTCGATGTTGATTCGTCGCTCTTCAAATTCGGATTTGGTCGGTTGGACCTAGTCGGCCAGCCAGACTTGGCCTATGATGTAGCTCGCATGTCGGGTCTAGCTGTGTCTTCTCGTAATAGTGAGAATAGAGATGCAAGAGCCCCACATGGCAATGGAGACCAACTCTACTATAGAGTGACGACCAATGGAGGTAGCAAAGATGATCTGACAAAGACGTACCGATAA

Protein sequence

MQVRSAMALRYSAAALLIRDFAIDSDLYTISQMVKDCIKKYKVASGQLVNFDKSVIAFSLNTKTDMSSLVDHEEGGWQGDVVRDEFTPDEAKGILSIPIGRGAEEDRLIWNYEKTGVYSVRSGYKVALLNNPCVQAPSSSSSEEVRCWWNGFWKMHIPNKIKVFLWRLCLDRLPTGCNLSKRGVEITNCCYFCGRNGEDSIHLFWICKFAEALWINSKFGKLSPFLILRESHESLSKADFEELCVVIWGLWNQRNARAFNDSTKTVFKIGMELVEWANKYAMEFREAKSNPITGRVTNTAEILWQPPDEGIYKINTDASFLASDQHAGLGIIIHNDRGQVMAAATKYLENIQSVDMAEAIAAVEGLQLASEIEIVLKAKNFWTQSLHASFNFVKREGNKAAHMLARRALLLHEFSIWMEDWPLELKSCLEMECLEELLYGRPRFGRLRPVMPWIYCPESGHTLVLIMPRPRSRLELGHGLDLVMPRVWSCLRFCHALNLVMRWACDVTCLLSLTLSSHNIEGLMLLPYWFKSRLIPDMPSCDHTNTSVAFEKIDETRSLLNSRLGFDVDSSLFKFGFGRLDLVGQPDLAYDVARMSGLAVSSRNSENRDARAPHGNGDQLYYRVTTNGGSKDDLTKTYR
Homology
BLAST of Moc03g01560 vs. NCBI nr
Match: XP_022150918.1 (uncharacterized protein LOC111018954 [Momordica charantia])

HSP 1 Score: 771.9 bits (1992), Expect = 4.2e-219
Identity = 372/386 (96.37%), Postives = 373/386 (96.63%), Query Frame = 0

Query: 66   MSSLVDHEEGGWQGDVVRDEFTPDEAKGILSIPIGRGAEEDRLIWNYEKTGVYSVRSGYK 125
            +SSLVDHEEGGWQGDVVRDEFTPDEAKGILSIPIGRGAEEDRLIWNYEKTGVYSVRSGYK
Sbjct: 753  VSSLVDHEEGGWQGDVVRDEFTPDEAKGILSIPIGRGAEEDRLIWNYEKTGVYSVRSGYK 812

Query: 126  VALLNNPCVQAPSSSSSEEVRCWWNGFWKMHIPNKIKVFLWRLCLDRLPTGCNLSKRGVE 185
            VALLNNPCVQAPSSSSSEEVRCWWNGFWKMHIPNKIKVFLWRLCLDRLPTGCNLSKRGVE
Sbjct: 813  VALLNNPCVQAPSSSSSEEVRCWWNGFWKMHIPNKIKVFLWRLCLDRLPTGCNLSKRGVE 872

Query: 186  ITNCCYFCGRNGEDSIHLFWICKFAEALWINSKFGKLSPFLILRESHESLSKADFEELCV 245
            ITNCCYFCGRNGEDSIHLFWICKFAEALWINSKFGKLSPFLILRESHESLSKADFEELCV
Sbjct: 873  ITNCCYFCGRNGEDSIHLFWICKFAEALWINSKFGKLSPFLILRESHESLSKADFEELCV 932

Query: 246  VIWGLWNQRNARAFNDSTKTVFKIGMELVEWANKYAMEFREAKSNPITGRVTNTAEILWQ 305
            VIWGLWNQRNARAFNDSTKTVFKIGMELVEWANKYAMEFREAKSNPITGRVTNTAEILWQ
Sbjct: 933  VIWGLWNQRNARAFNDSTKTVFKIGMELVEWANKYAMEFREAKSNPITGRVTNTAEILWQ 992

Query: 306  PPDEGIYKINTDASFLASDQHAGLGIIIHNDRGQVMAAATKYLENIQSVDMAEAIAAVEG 365
            PPDEGIYKINTDASFLASDQHAGLGIIIHNDRGQVMAAATKYLENIQSVDMAEAIAAVEG
Sbjct: 993  PPDEGIYKINTDASFLASDQHAGLGIIIHNDRGQVMAAATKYLENIQSVDMAEAIAAVEG 1052

Query: 366  LQLASEI-------------EIVLKAKNFWTQSLHASFNFVKREGNKAAHMLARRALLLH 425
            LQLASEI             EIVLKAKNFWTQSLHASFNFVKREGNKAAHMLARRALLLH
Sbjct: 1053 LQLASEIGMHPALEDLSETGEIVLKAKNFWTQSLHASFNFVKREGNKAAHMLARRALLLH 1112

Query: 426  EFSIWMEDWPLELKSCLEMECLEELL 439
            EFSIWMEDWPLELKSCLEMECLEELL
Sbjct: 1113 EFSIWMEDWPLELKSCLEMECLEELL 1138

BLAST of Moc03g01560 vs. NCBI nr
Match: XP_022140628.1 (uncharacterized protein LOC111011237 [Momordica charantia])

HSP 1 Score: 294.3 bits (752), Expect = 2.6e-75
Identity = 153/197 (77.66%), Postives = 159/197 (80.71%), Query Frame = 0

Query: 271 MELVEWANKYAMEFREAKSNPITGRVTNTAEILWQPPDEGIYKINTDASFLASDQHAGLG 330
           M+LVEWANKY MEFREA SNP  GRVTNTAE+LW PPD+ IYKINTDASFLASDQHAGLG
Sbjct: 1   MKLVEWANKYVMEFREANSNPFPGRVTNTAEVLWLPPDKRIYKINTDASFLASDQHAGLG 60

Query: 331 IIIHNDRGQVMAAATKYLENIQSVDMAEAIAAVEGLQLASEI------------------ 390
           III NDRGQVMA+ATKYLENIQSVDMAEAI AVEGLQLAS+I                  
Sbjct: 61  IIIRNDRGQVMASATKYLENIQSVDMAEAIVAVEGLQLASKIGVNPVILETDSSRIFNLF 120

Query: 391 -----------EIVLKAKNFWTQSLHASFNFVKREGNKAAHMLARRALLLHEFSIWMEDW 439
                      EIVLKAKNFWTQSLHASFNFVKREGNKAAHMLARRALLL EFSIWMEDW
Sbjct: 121 SQPSEDLSETGEIVLKAKNFWTQSLHASFNFVKREGNKAAHMLARRALLLREFSIWMEDW 180

BLAST of Moc03g01560 vs. NCBI nr
Match: XP_030942103.1 (uncharacterized protein LOC115967179 [Quercus lobata])

HSP 1 Score: 203.8 bits (517), Expect = 4.6e-48
Identity = 123/405 (30.37%), Postives = 182/405 (44.94%), Query Frame = 0

Query: 66   MSSLVDHEEGGWQGDVVRDEFTPDEAKGILSIPIGRGAEEDRLIWNYEKTGVYSVRSGYK 125
            +++L+D E+G W+ +VVR  F P EA  I SI +     ED+  W     G++SVRS YK
Sbjct: 882  VAALIDEEKGAWKKEVVRQTFLPHEADLICSIALSANLPEDKQAWALTHNGIFSVRSAYK 941

Query: 126  VALLNNPCVQAPSSSSSEEVRCWWNGFWKMHIPNKIKVFLWRLCLDRLPTGCNLSKRGVE 185
            +A+  +  V A S S+  ++R +W   W  +IP+KI+ F WR C D LPT  NL +R V 
Sbjct: 942  LAVEMSSTVPAGSVSNGNQLRRFWKYLWSCNIPHKIRHFAWRACKDVLPTKENLVRRKVL 1001

Query: 186  ITNCCYFCGRNGEDSIHLFWICKFAEALW---------INSKFGKLSPFLILRESHESLS 245
            + + C  C    E S HLFW C+ A  +W         +   FG     L          
Sbjct: 1002 LDSVCEECQMEDESSGHLFWRCQRAREVWRTSSLFLGSVEHHFGSFMDLLWHVVMIAQWD 1061

Query: 246  KADFEELCVVIWGLWNQRNARAFNDSTKTVFKIGMELVEWANKYAMEFREAKSNPITGRV 305
             +  E L V+ W LW+ +N      + K+   I    +E+  KY     +  S       
Sbjct: 1062 HSGVEHLIVIAWALWSNQNEHRHGGAKKSAQAIVQGALEYLVKYQACLEDTDSK------ 1121

Query: 306  TNTAEILWQPPDEGIYKINTDASFLASDQHAGLGIIIHNDRGQVMAAATKYLENIQSVDM 365
               A ++W PP    YKIN D +     + AG+GI+I +   Q++ A++K L+       
Sbjct: 1122 QPAAAVVWTPPSPNRYKINVDGAVFKEQKMAGVGILIRDAASQMIGASSKKLDAPLGAME 1181

Query: 366  AEAIAAVEGLQLASEIEI-----------------------------VLKAKNFWTQSLH 425
             EA A   GLQ A ++ I                             V  +        H
Sbjct: 1182 VEAKAVELGLQFAKDMSIQDFTLEGDSLSLVNALRELSPPPLSVAALVFSSITVAHSFRH 1241

Query: 426  ASFNFVKREGNKAAHMLARRALLLHEFSIWMEDWPLELKSCLEME 433
              F  V R GNK AH+LAR AL + + S+W+E+ P  L+  L  +
Sbjct: 1242 VDFAHVGRNGNKPAHLLARHALGIADLSVWVEETPCFLEQALNQD 1280

BLAST of Moc03g01560 vs. NCBI nr
Match: XP_030939647.1 (uncharacterized protein LOC115964488 [Quercus lobata])

HSP 1 Score: 203.4 bits (516), Expect = 6.0e-48
Identity = 122/404 (30.20%), Postives = 195/404 (48.27%), Query Frame = 0

Query: 64   TDMSSLVDHEEGGWQGDVVRDEFTPDEAKGILSIPIGRGAEEDRLIWNYEKTGVYSVRSG 123
            T +S L+D E   W+ DVV   F   + + ILSIP+      D+++W  +K G + VRS 
Sbjct: 832  TMVSDLIDAESNEWKIDVVHQNFLSQDVEAILSIPLCASGARDKIVWAEDKNGRFLVRSP 891

Query: 124  YKVALLNNPCVQAPSSSSSEEVRCWWNGFWKMHIPNKIKVFLWRLCLDRLPTGCNLSKRG 183
            YKVA+ +       S SS  E+R  W G W M++PNK+K F W+ C + L T  NL KR 
Sbjct: 892  YKVAMKDGVDGGRSSCSSQTELRKVWKGLWGMNLPNKVKHFAWKACRNILTTKENLCKRK 951

Query: 184  VEITNCCYFCGRNGEDSIHLFWICKFAEALWINSKFGKLSPFLILR--ESHESLSKAD-- 243
            +   + C +CG + E   HLFW C+ A+ +W +SK   + PF +L   +  E + +A   
Sbjct: 952  ITSNDICDWCGTHTETVSHLFWFCEHAKTIWSSSKL--IIPFQVLPSWDFMEVMCQAQQW 1011

Query: 244  -------FEELCVVIWGLWNQRNARAFNDSTKTVFKIGMELVEWANKYAMEFREAKSNPI 303
                    E   +V WG+W  RN        +T    G  +V  A     E++ A  +  
Sbjct: 1012 SVSLPGLVERTVMVCWGIWKNRNELHHGGKGRT----GSAVVRGALLLLEEYQRANESSK 1071

Query: 304  TGRVTNTAEILWQPPDEGIYKINTDASFLASDQHAGLGIIIHNDRGQVMAAATKYL---- 363
                  +  + W PP +G YK+N D +     Q  G+G+II NDRG+V+AA +K +    
Sbjct: 1072 VAGERMSLTVQWSPPAQGSYKVNVDGAVFTKCQQMGIGVIIRNDRGEVIAAMSKRMAVPL 1131

Query: 364  ----ENIQSVDMAEAIAAVEGLQLA--------------------SEIEIVLKAKNFWTQ 423
                   ++++ A   AA  G++ A                    + I+ ++       Q
Sbjct: 1132 GALETEAKAMETAVRFAANVGIRDAIFEGDSLTIYNALHGLSSPSAAIQNIVTGILRQAQ 1191

Query: 424  SLHA-SFNFVKREGNKAAHMLARRALLLHEFSIWMEDWPLELKS 428
            S    +F+ +KR+GN  AH+LA+ A  + ++  W+E+ P ++ S
Sbjct: 1192 SFRTFAFSHIKRQGNVPAHVLAQHACNVDDYVTWLEECPSQIVS 1229

BLAST of Moc03g01560 vs. NCBI nr
Match: TXG50387.1 (hypothetical protein EZV62_022911 [Acer yangbiense])

HSP 1 Score: 201.8 bits (512), Expect = 1.7e-47
Identity = 127/404 (31.44%), Postives = 194/404 (48.02%), Query Frame = 0

Query: 59  SLNTKTDMSSLVDHEEGGWQGDVVRDEFTPDEAKGILSIPIGRGAEEDRLIWNYEKTGVY 118
           S++ K  + S + H  G W  +++R+ F PD+A  ILS+P    A++D L W+++K G Y
Sbjct: 31  SVDLKNVVVSELCHPSGAWNSELIRNSFLPDDASLILSLPRLSPAQDDSLCWHFDKRGFY 90

Query: 119 SVRSGYKVALLNNPCVQAPSSSSSEEVRCWWNGFWKMHIPNKIKVFLWRLCLDRLPTGCN 178
           +VRSGYKVAL     ++    SSS ++  WW   WK +IPNK K+F W+     LPT   
Sbjct: 91  TVRSGYKVAL----DLKKGIGSSSMQLSPWWRFLWKCNIPNKCKIFFWKAFNGWLPTFAT 150

Query: 179 LSKRGVEITNCCYFCGRNGEDSIHLFWICKFAEALW--------INSKFGKLSPFLILRE 238
           L++R V++ + C  C  + E   H+ W C  A  +W        I        P +IL  
Sbjct: 151 LARRRVDVVDYCQVCNDDSESITHILWSCNSAVEVWRQLLGDDVIQRIVVSDFPSIIL-S 210

Query: 239 SHESLSKADFEELCVVIWGLWNQRNARAFNDSTKTVFKIGMELVEWANKYAMEFREAKSN 298
              S+    F  L +  W LW  RN+     +         ++V W + +A EFR A  N
Sbjct: 211 LWRSVDSVVFNLLIIGYWRLWTNRNSVVHGSAGWA----AADMVTWIDNFANEFRLA--N 270

Query: 299 PITGRVTNTAEILWQPPDEGIYKINTDASFLASDQHAGLGIIIHNDRGQVMAAATKYLEN 358
            +  +   + +  W+ P  G +KIN DASF      AG+G+II + +G  +AA +  +  
Sbjct: 271 ELNHKEVLSHQPSWKTPKRGEFKINCDASFQLRSGKAGVGVIIRDYKGSAIAARSSPVLC 330

Query: 359 IQSVDMAEAIAAVEGLQLASEIEI------------------------VLKAKNFWTQSL 418
             SV+M EA A +EG+ LA +I +                         L A    + +L
Sbjct: 331 CSSVEMLEAQACLEGIHLAIDIGVSGVIIESDAASVIQLLSDQTVPRTELGAIIHTSLAL 390

Query: 419 HASFNF-----VKREGNKAAHMLARRALLLHEFSIWMEDWPLEL 426
            AS N      V+RE N  AH +A+ AL L    +W E+ P ++
Sbjct: 391 GASVNLLSYVAVRREANSVAHCIAQHALSLDSPVVWFEEMPPDI 423

BLAST of Moc03g01560 vs. ExPASy Swiss-Prot
Match: P0C2F6 (Putative ribonuclease H protein At1g65750 OS=Arabidopsis thaliana OX=3702 GN=At1g65750 PE=3 SV=1)

HSP 1 Score: 84.0 bits (206), Expect = 6.9e-15
Identity = 62/249 (24.90%), Postives = 105/249 (42.17%), Query Frame = 0

Query: 106 DRLIWNYEKTGVYSVRSGYKVALLNNPCVQAPSSSSSEEVRCWWNGFWKMHIPNKIKVFL 165
           DRL W + + G +SVRS Y++  ++   V  P+ +S      ++N  WK+ +P ++K FL
Sbjct: 253 DRLSWKFSQDGQFSVRSAYEMLTVDE--VPRPNMAS------FFNCLWKVRVPERVKTFL 312

Query: 166 WRLCLDRLPTGCNLSKRGVEITNCCYFCGRNGEDSIHLFWICKFAEALWI---------- 225
           W +    + T     +R +  +N C  C    E  +H+   C     +W+          
Sbjct: 313 WLVGNQAVMTEEERHRRHLSASNVCQVCKGGVESMLHVLRDCPAQLGIWVRVVPQRRQQG 372

Query: 226 ---NSKFGKLSPFLILRESHESLSKADFEELCVVIWGLWNQRNARAFNDSTKTVFKIGME 285
               S F  L   L  R   E +  +      V+IW  W  R    F ++TK       +
Sbjct: 373 FFSKSLFEWLYDNLGDRSGCEDIPWSTI--FAVIIWWGWKWRCGNIFGENTKC-----RD 432

Query: 286 LVEWANKYAME-FREAKSNPITGRVTNTAE--ILWQPPDEGIYKINTDASFLASDQHAGL 339
            V++  ++A+E +R    N + G      E  I W  P  G  K+NTD +   +   A  
Sbjct: 433 RVKFVKEWAVEVYRAHSGNVLVGITQPRVERMIGWVSPCVGWVKVNTDGASRGNPGLASA 486

BLAST of Moc03g01560 vs. ExPASy TrEMBL
Match: A0A6J1DAR4 (uncharacterized protein LOC111018954 OS=Momordica charantia OX=3673 GN=LOC111018954 PE=4 SV=1)

HSP 1 Score: 771.9 bits (1992), Expect = 2.0e-219
Identity = 372/386 (96.37%), Postives = 373/386 (96.63%), Query Frame = 0

Query: 66   MSSLVDHEEGGWQGDVVRDEFTPDEAKGILSIPIGRGAEEDRLIWNYEKTGVYSVRSGYK 125
            +SSLVDHEEGGWQGDVVRDEFTPDEAKGILSIPIGRGAEEDRLIWNYEKTGVYSVRSGYK
Sbjct: 753  VSSLVDHEEGGWQGDVVRDEFTPDEAKGILSIPIGRGAEEDRLIWNYEKTGVYSVRSGYK 812

Query: 126  VALLNNPCVQAPSSSSSEEVRCWWNGFWKMHIPNKIKVFLWRLCLDRLPTGCNLSKRGVE 185
            VALLNNPCVQAPSSSSSEEVRCWWNGFWKMHIPNKIKVFLWRLCLDRLPTGCNLSKRGVE
Sbjct: 813  VALLNNPCVQAPSSSSSEEVRCWWNGFWKMHIPNKIKVFLWRLCLDRLPTGCNLSKRGVE 872

Query: 186  ITNCCYFCGRNGEDSIHLFWICKFAEALWINSKFGKLSPFLILRESHESLSKADFEELCV 245
            ITNCCYFCGRNGEDSIHLFWICKFAEALWINSKFGKLSPFLILRESHESLSKADFEELCV
Sbjct: 873  ITNCCYFCGRNGEDSIHLFWICKFAEALWINSKFGKLSPFLILRESHESLSKADFEELCV 932

Query: 246  VIWGLWNQRNARAFNDSTKTVFKIGMELVEWANKYAMEFREAKSNPITGRVTNTAEILWQ 305
            VIWGLWNQRNARAFNDSTKTVFKIGMELVEWANKYAMEFREAKSNPITGRVTNTAEILWQ
Sbjct: 933  VIWGLWNQRNARAFNDSTKTVFKIGMELVEWANKYAMEFREAKSNPITGRVTNTAEILWQ 992

Query: 306  PPDEGIYKINTDASFLASDQHAGLGIIIHNDRGQVMAAATKYLENIQSVDMAEAIAAVEG 365
            PPDEGIYKINTDASFLASDQHAGLGIIIHNDRGQVMAAATKYLENIQSVDMAEAIAAVEG
Sbjct: 993  PPDEGIYKINTDASFLASDQHAGLGIIIHNDRGQVMAAATKYLENIQSVDMAEAIAAVEG 1052

Query: 366  LQLASEI-------------EIVLKAKNFWTQSLHASFNFVKREGNKAAHMLARRALLLH 425
            LQLASEI             EIVLKAKNFWTQSLHASFNFVKREGNKAAHMLARRALLLH
Sbjct: 1053 LQLASEIGMHPALEDLSETGEIVLKAKNFWTQSLHASFNFVKREGNKAAHMLARRALLLH 1112

Query: 426  EFSIWMEDWPLELKSCLEMECLEELL 439
            EFSIWMEDWPLELKSCLEMECLEELL
Sbjct: 1113 EFSIWMEDWPLELKSCLEMECLEELL 1138

BLAST of Moc03g01560 vs. ExPASy TrEMBL
Match: A0A6J1CIF1 (uncharacterized protein LOC111011237 OS=Momordica charantia OX=3673 GN=LOC111011237 PE=4 SV=1)

HSP 1 Score: 294.3 bits (752), Expect = 1.2e-75
Identity = 153/197 (77.66%), Postives = 159/197 (80.71%), Query Frame = 0

Query: 271 MELVEWANKYAMEFREAKSNPITGRVTNTAEILWQPPDEGIYKINTDASFLASDQHAGLG 330
           M+LVEWANKY MEFREA SNP  GRVTNTAE+LW PPD+ IYKINTDASFLASDQHAGLG
Sbjct: 1   MKLVEWANKYVMEFREANSNPFPGRVTNTAEVLWLPPDKRIYKINTDASFLASDQHAGLG 60

Query: 331 IIIHNDRGQVMAAATKYLENIQSVDMAEAIAAVEGLQLASEI------------------ 390
           III NDRGQVMA+ATKYLENIQSVDMAEAI AVEGLQLAS+I                  
Sbjct: 61  IIIRNDRGQVMASATKYLENIQSVDMAEAIVAVEGLQLASKIGVNPVILETDSSRIFNLF 120

Query: 391 -----------EIVLKAKNFWTQSLHASFNFVKREGNKAAHMLARRALLLHEFSIWMEDW 439
                      EIVLKAKNFWTQSLHASFNFVKREGNKAAHMLARRALLL EFSIWMEDW
Sbjct: 121 SQPSEDLSETGEIVLKAKNFWTQSLHASFNFVKREGNKAAHMLARRALLLREFSIWMEDW 180

BLAST of Moc03g01560 vs. ExPASy TrEMBL
Match: A0A2N9J0V8 (Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS59068 PE=4 SV=1)

HSP 1 Score: 211.1 bits (536), Expect = 1.4e-50
Identity = 126/401 (31.42%), Postives = 189/401 (47.13%), Query Frame = 0

Query: 64  TDMSSLVDHEEGGWQGDVVRDEFTPDEAKGILSIPIGRGAEEDRLIWNYEKTGVYSVRSG 123
           T +S L+DH+   W  DVVR  F P EA+ IL IP+    +ED+LIW   KTG Y VRSG
Sbjct: 222 THVSQLIDHQIRAWDKDVVRSTFIPFEAENILGIPLSHTQQEDKLIWGGTKTGDYVVRSG 281

Query: 124 YKVALLNNPCVQAPSSSSSEEVRCWWNGFWKMHIPNKIKVFLWRLCLDRLPTGCNLSKRG 183
           Y + LL       P SS++      WN  W + IP+K++ FLW  C D LPT  NL  R 
Sbjct: 282 YHL-LLEESHSSDPGSSNTASQAQIWNSIWSLSIPHKVRHFLWHACHDSLPTRKNLHHRH 341

Query: 184 VEITNCCYFCGRNGEDSIHLFWICKFAEALWINSKFGKL---SPFLILRESHESLSKA-- 243
           V     C  C  + ED+ H  W CK  + +W  +++G+    S ++   E + +L++   
Sbjct: 342 VIDDPSCPNCTASIEDTYHALWTCKNLQEVWQATEWGRKLQGSHYVDFMELYCALTRTLR 401

Query: 244 --DFEELCVVIWGLWNQRNARAFNDSTKTVFKIGMELVEWANKYAMEFREAK--SNPITG 303
             + +   +  W +W +RN +      + + +    L+  + +  MEF +A+  ++P   
Sbjct: 402 TEELQIFAMASWSIWYRRNRQRLGQPIEEIHR----LIPRSLELLMEFHQAQDDNSPKPE 461

Query: 304 RVTNTAEILWQPPDEGIYKINTDASFLASDQHAGLGIIIHNDRGQVMAAATKYLENIQSV 363
                    W PP  G YK N D +       AG+G+II ND+G VMA+ ++ +    SV
Sbjct: 462 PTRPACLAKWNPPAAGSYKANFDGAIFTETSEAGIGVIIRNDQGAVMASLSQRIPYPHSV 521

Query: 364 DMAEAIAAVEGLQLASEI-----------------------------EIVLKAKNFWTQS 423
           +  EA AA    Q A ++                              ++   K      
Sbjct: 522 EAVEAFAARSAAQFALDLGLRDVSFEGDSLKIITALQHHSPCNTQYGYLITDTKTIAQDF 581

Query: 424 LHASFNFVKREGNKAAHMLARRALLLHEFSIWMEDWPLELK 427
           +   F  VKREGN+ AH LA+RA     F +WMED P +L+
Sbjct: 582 ISCHFVHVKREGNRVAHSLAQRARQHEPFQVWMEDVPPDLQ 617

BLAST of Moc03g01560 vs. ExPASy TrEMBL
Match: A0A2N9EMZ0 (Reverse transcriptase domain-containing protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS8088 PE=4 SV=1)

HSP 1 Score: 207.2 bits (526), Expect = 2.0e-49
Identity = 129/404 (31.93%), Postives = 198/404 (49.01%), Query Frame = 0

Query: 69  LVDHEEGGWQGDVVRDEFTPDEAKGILSIPIGRGAEEDRLIWNYEKTGVYSVRSGYKVAL 128
           L+D E   W+ ++V+  F P EA  IL IP+      D L+W   K G Y+VRSGY   L
Sbjct: 545 LIDSELRSWKAELVKRIFLPHEASVILGIPLSIRDPIDSLVWKATKNGAYTVRSGYH--L 604

Query: 129 LNNPCVQA-PSSSSSEEVRCWWNGFWKMHIPNKIKVFLWRLCLDRLPTGCNLSKRGVEIT 188
           L N C QA PSSS + ++   W+  W +H+P KI+ FLWR C + LPT  NL  R +   
Sbjct: 605 LLNECHQAEPSSSDTTKMTQLWDAIWSLHVPPKIRHFLWRACHNSLPTRSNLHHRHILAD 664

Query: 189 NCCYFCGRNGEDSIHLFWICKFAEALWINSKFGK------LSPFL-ILRESHESLSKADF 248
             C  C    E +IH  W CK  + +W +  +G+       + F+ ++ +  ++LS  + 
Sbjct: 665 PSCSSCTNQIESTIHALWQCKEIKPVWQSIPWGRKLREISYAGFIDLMYQCFQTLSTNEL 724

Query: 249 EELCVVIWGLWNQRNARAFNDSTKTVFKIGMELVEWANKYAMEFREAKSN--PITGRVTN 308
           +   +  WG+W++RN          +     +L+  A    +EF+ A+++    + +  +
Sbjct: 725 QLFSMTSWGIWHRRNRLRLQQPVDNL----SQLIPRALDTLLEFQTAQNSDPQPSPKPNH 784

Query: 309 TAEILWQPPDEGIYKINTDASFLASDQHAGLGIIIHNDRGQVMAAATKYLENIQSVDMAE 368
           T    W+PP+EG YK+N D +  +    AG+G+II N RG+VM + +  +    SV+  E
Sbjct: 785 TKSTTWKPPEEGRYKVNYDGAVFSERNEAGVGVIIRNYRGEVMGSLSHRIPYPHSVEAVE 844

Query: 369 AIAAVEGLQLASEIEIVL---------------------------------KAKNFWTQS 428
           A AA   +Q A ++  +L                                  A+N   QS
Sbjct: 845 ASAASCAIQFAKDLGFMLIDLEGDSKIIVEALLLKAPCTTIYGNVIEDIKQSAQNL--QS 904

Query: 429 LHASFNFVKREGNKAAHMLARRALLLHEFSIWMEDWPLELKSCL 430
           +H  F  + REGN  AH+LA+RA L   F +WME  P EL S L
Sbjct: 905 VH--FLHINREGNAMAHLLAKRARLNKPFEVWMESVPPELISNL 938

BLAST of Moc03g01560 vs. ExPASy TrEMBL
Match: A0A2N9HYE3 (Reverse transcriptase domain-containing protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS44563 PE=4 SV=1)

HSP 1 Score: 207.2 bits (526), Expect = 2.0e-49
Identity = 129/404 (31.93%), Postives = 198/404 (49.01%), Query Frame = 0

Query: 69   LVDHEEGGWQGDVVRDEFTPDEAKGILSIPIGRGAEEDRLIWNYEKTGVYSVRSGYKVAL 128
            L+D E   W+ ++V+  F P EA  IL IP+      D L+W   K G Y+VRSGY   L
Sbjct: 1426 LIDSELRSWKAELVKRIFLPHEASVILGIPLSIRDPIDSLVWKATKNGAYTVRSGYH--L 1485

Query: 129  LNNPCVQA-PSSSSSEEVRCWWNGFWKMHIPNKIKVFLWRLCLDRLPTGCNLSKRGVEIT 188
            L N C QA PSSS + ++   W+  W +H+P KI+ FLWR C + LPT  NL  R +   
Sbjct: 1486 LLNECHQAEPSSSDTTKMTQLWDAIWSLHVPPKIRHFLWRACHNSLPTRSNLHHRHILAD 1545

Query: 189  NCCYFCGRNGEDSIHLFWICKFAEALWINSKFGK------LSPFL-ILRESHESLSKADF 248
              C  C    E +IH  W CK  + +W +  +G+       + F+ ++ +  ++LS  + 
Sbjct: 1546 PSCSSCTNQIESTIHALWQCKEIKPVWQSIPWGRKLREISYAGFIDLMYQCFQTLSTNEL 1605

Query: 249  EELCVVIWGLWNQRNARAFNDSTKTVFKIGMELVEWANKYAMEFREAKSN--PITGRVTN 308
            +   +  WG+W++RN          +     +L+  A    +EF+ A+++    + +  +
Sbjct: 1606 QLFSMTSWGIWHRRNRLRLQQPVDNL----SQLIPRALDTLLEFQTAQNSDPQPSPKPNH 1665

Query: 309  TAEILWQPPDEGIYKINTDASFLASDQHAGLGIIIHNDRGQVMAAATKYLENIQSVDMAE 368
            T    W+PP+EG YK+N D +  +    AG+G+II N RG+VM + +  +    SV+  E
Sbjct: 1666 TKSTTWKPPEEGRYKVNYDGAVFSERNEAGVGVIIRNYRGEVMGSLSHRIPYPHSVEAVE 1725

Query: 369  AIAAVEGLQLASEIEIVL---------------------------------KAKNFWTQS 428
            A AA   +Q A ++  +L                                  A+N   QS
Sbjct: 1726 ASAASCAIQFAKDLGFMLIDLEGDSKIIVEALLLKAPCTTIYGNVIEDIKQSAQNL--QS 1785

Query: 429  LHASFNFVKREGNKAAHMLARRALLLHEFSIWMEDWPLELKSCL 430
            +H  F  + REGN  AH+LA+RA L   F +WME  P EL S L
Sbjct: 1786 VH--FLHINREGNAMAHLLAKRARLNKPFEVWMESVPPELISNL 1819

BLAST of Moc03g01560 vs. TAIR 10
Match: AT3G09510.1 (Ribonuclease H-like superfamily protein )

HSP 1 Score: 127.9 bits (320), Expect = 3.0e-29
Identity = 101/358 (28.21%), Postives = 141/358 (39.39%), Query Frame = 0

Query: 97  IPIGRGAEEDRLIWNYEKTGVYSVRSGYKVAL----LNNPCVQAPSSSSSEEVRCWWNGF 156
           I + +  + D++IWNY  TG Y+VRSGY +       N P +  P  S   + R      
Sbjct: 108 IYLAKSKKPDKIIWNYNTTGEYTVRSGYWLLTHDPSTNIPAINPPHGSIDLKTR-----I 167

Query: 157 WKMHIPNKIKVFLWRLCLDRLPTGCNLSKRGVEITNCCYFCGRNGEDSIHLFWICKFAEA 216
           W + I  K+K FLWR     L T   L+ RG+ I   C  C R  E   H  + C FA  
Sbjct: 168 WNLPIMPKLKHFLWRALSQALATTERLTTRGMRIDPSCPRCHRENESINHALFTCPFATM 227

Query: 217 LWINSKFGKLSPFLILRESHESLSK----------ADFEELCVV--IWGLWNQRNARAFN 276
            W  S    +   L+  +  E++S           +DF +L  V  IW +W  RN   FN
Sbjct: 228 AWRLSDSSLIRNQLMSNDFEENISNILNFVQDTTMSDFHKLLPVWLIWRIWKARNNVVFN 287

Query: 277 ----DSTKTVFKIGMELVEWANKYAMEFREAKSNPITGRVTNTAEILWQPPDEGIYKINT 336
                 +KTV     E  +W N      +  K  P   R     +I W+ P     K N 
Sbjct: 288 KFRESPSKTVLSAKAETHDWLN----ATQSHKKTPSPTRQIAENKIEWRNPPATYVKCNF 347

Query: 337 DASFLASDQHAGLGIIIHNDRG--------------QVMAAATK-------------YLE 396
           DA F      A  G II N  G                + A TK             Y +
Sbjct: 348 DAGFDVQKLEATGGWIIRNHYGTPISWGSMKLAHTSNPLEAETKALLAALQQTWIRGYTQ 407

Query: 397 NIQSVDMAEAIAAVEGLQLASEIEIVLKAKNFWTQSLHA-SFNFVKREGNKAAHMLAR 407
                D    I  + G+   S +   L+  +FW     +  F F++R+GNK AH+LA+
Sbjct: 408 VFMEGDCQTLINLINGISFHSSLANHLEDISFWANKFASIQFGFIRRKGNKLAHVLAK 456

BLAST of Moc03g01560 vs. TAIR 10
Match: AT4G29090.1 (Ribonuclease H-like superfamily protein )

HSP 1 Score: 112.8 bits (281), Expect = 9.9e-25
Identity = 96/396 (24.24%), Postives = 163/396 (41.16%), Query Frame = 0

Query: 59  SLNTKTDMSSLVDHEEGGWQGDVVRDEFTPDEAKGILSIPIGRGAEEDRLIWNYEKTGVY 118
           S+++   +S L+D     W+ DV+   F   E K I  +  G     D   W+Y  +G Y
Sbjct: 164 SVSSILKVSDLIDESGREWRKDVIEMLFPEVERKLIGELRPGGRRILDSYTWDYTSSGDY 223

Query: 119 SVRSGYKVALLNNPCVQAPSSSSSEEVRCWWNGFWKMHIPNKIKVFLWRLCLDRLPTGCN 178
           +V+SGY V         +P   S   +   +   WK     KI+ FLW+   + LP    
Sbjct: 224 TVKSGYWVLTQIINKRSSPQEVSEPSLNPIYQKIWKSQTSPKIQHFLWKCLSNSLPVAGA 283

Query: 179 LSKRGVEITNCCYFCGRNGEDSIHLFWICKFAEAL-------------WINSKFGKLSPF 238
           L+ R +   + C  C    E   HL + C FA                W +S +  L   
Sbjct: 284 LAYRHLSKESACIRCPSCKETVNHLLFKCTFARLTWAISSIPIPLGGEWADSIYVNLYWV 343

Query: 239 LILRESHESLSKADFEELCVVIWGLWNQRNARAFN----DSTKTVFKIGMELVEWANKYA 298
             L   +    KA  + +  ++W LW  RN   F     ++ + + +   +L EW  +  
Sbjct: 344 FNLGNGNPQWEKAS-QLVPWLLWRLWKNRNELVFRGREFNAQEVLRRAEDDLEEWRIR-- 403

Query: 299 MEFREAKSNPITGRVTNTAEILWQPPDEGIYKINTDASFLASDQHAGLGIIIHNDRGQVM 358
               EA+S     +V  ++   W+PP     K NTDA++   ++  G+G ++ N++G+V 
Sbjct: 404 ---TEAESCGTKPQVNRSSCGRWRPPPHQWVKCNTDATWNRDNERCGIGWVLRNEKGEVK 463

Query: 359 AAATKYLENIQSVDMAE------AIAAVEGLQ-----LASEIEIVLKAKN---FW----- 410
               + L  ++SV  AE      A+ ++   Q       S+ +++++  N    W     
Sbjct: 464 WMGARALPKLKSVLEAELEAMRWAVLSLSRFQYNYVIFESDSQVLIEILNNDEIWPSLKP 523

BLAST of Moc03g01560 vs. TAIR 10
Match: AT5G18880.1 (RNA-directed DNA polymerase (reverse transcriptase)-related family protein )

HSP 1 Score: 62.0 bits (149), Expect = 2.0e-09
Identity = 58/261 (22.22%), Postives = 96/261 (36.78%), Query Frame = 0

Query: 46  GQLVNFDKSVIAFSLNTKTDMSSLVDHEEGGWQGDVVRDEFTPDEAKGILSIPIGRGAE- 105
           GQL+ F  +     L  + D   +     G W     R + +      +   P+   +  
Sbjct: 33  GQLLTFLGAAGPRQLRIRQDARVVEASRNGDWFLPAARSDNSQLFLAALTMAPVPHESRG 92

Query: 106 EDRLIWNYEKTGVY----SVRSGYKVALLNNPCVQAPSSSSSEEVRCWWNGFW-KMHIPN 165
           +D  +W     G Y    S R  ++   +++P V             W    W K +IP 
Sbjct: 93  QDSFLWR-NAAGSYLPSFSSRDTWEQIRVHSPTVP------------WAKVVWFKEYIP- 152

Query: 166 KIKVFLWRLCLDRLPTGCNLSKRGVEITNCCYFCGRNGEDSIHLFWICKFAEALW--INS 225
           +  +  W   L+RLPT   L   G+ I +    C    E   HLF+ C F+ A+W    S
Sbjct: 153 RFSLITWMSFLERLPTRDRLRGWGMNIPSSWVLCSNGDETHAHLFFECSFSLAIWEFFAS 212

Query: 226 KFGKLSPF-----------LILRESHESLSKADFEELCVVIWGLWNQRNARAFNDSTKTV 285
           KF    PF           L LR    ++ K   +     ++ +W +RNAR F   + + 
Sbjct: 213 KFRPSPPFGLPAASSWILQLPLRSHSTTILKLLLQS---AVYHVWKERNARIFTSISSSA 272

Query: 286 FKIGMELVEWANKYAMEFREA 288
             + + +        + F  A
Sbjct: 273 SSLRLAIDRTMRNRLLSFPSA 276

BLAST of Moc03g01560 vs. TAIR 10
Match: AT2G34320.1 (Polynucleotidyl transferase, ribonuclease H-like superfamily protein )

HSP 1 Score: 60.1 bits (144), Expect = 7.6e-09
Identity = 62/265 (23.40%), Postives = 99/265 (37.36%), Query Frame = 0

Query: 190 CYFCGRNGEDSIHLFWICKFA-------------EALWINSKFGKLSPFLILRESHESLS 249
           C  C  + E   HL + C FA             E  W +S +  L   L L      L 
Sbjct: 12  CVRCPDSRETVNHLLFKCCFARLVWAISPIPAYPEGEWTDSLYANLYWVLNLEVEIPKLG 71

Query: 250 KADFEELCVVIWGLWNQRNARAFN----DSTKTVFKIGMELVEWANKYAMEFREAKSNPI 309
           K     +  ++W LW  RN   F     D+ + + +   +  EW+ +     RE +    
Sbjct: 72  KIG-NLVPWLLWRLWKSRNELMFKGKEYDAPEVLRRAMEDFEEWSTR-----RELEGKAS 131

Query: 310 TGRVTNTAEILWQPPDEGIYKINTDASFLASDQHAGLGIIIHNDRGQVMAAATKYLENIQ 369
             +V     + W+ P     K NTDA++   +   G+G I+ N+ G V+    + L   +
Sbjct: 132 GPQVERNLSVQWKAPPYQWVKCNTDATWQLENPRCGIGWILRNESGGVLWMGARALPRTK 191

Query: 370 SVDMAE------AIAAVEGLQL--------ASEIEIVLKAKNFW----------TQSLH- 410
           +V  AE      A+  +             A  +  +L + +FW           Q LH 
Sbjct: 192 NVLEAELEALRWAVLTMSRFNYKRIIFESDAQALVNLLNSDDFWPTLQPALEDIQQLLHH 251

BLAST of Moc03g01560 vs. TAIR 10
Match: AT1G33710.1 (RNA-directed DNA polymerase (reverse transcriptase)-related family protein )

HSP 1 Score: 55.8 bits (133), Expect = 1.4e-07
Identity = 40/139 (28.78%), Postives = 55/139 (39.57%), Query Frame = 0

Query: 144 EVRCWWNGFWKMHIPNKIKVFLWRLCLDRLPTGCNLSKRGVEITNCCYFCGRNGEDSIHL 203
           +V  W    W      K    +W   LDRLPT   L+  G+++   C  C  + ED  HL
Sbjct: 44  DVVSWAKTVWFKGATPKHAFHMWVTNLDRLPTKTRLASWGMQLQTTCGLCSLDIEDRDHL 103

Query: 204 FWICKFAEALWINSKFGKLSP---FLILRE------SHESLSKADFEELCV--VIWGLWN 263
           F  C+FA  LW         P   F++  +           S     +L V  V++ +W 
Sbjct: 104 FLTCEFACFLWHTVSVRLELPAFSFVVWNDLMDWTLQRNRRSPPTLRKLIVQSVLYAIWK 163

Query: 264 QRNARAFNDST---KTVFK 269
           QRN    N  T     VFK
Sbjct: 164 QRNNFLHNHETILPSVVFK 182

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022150918.14.2e-21996.37uncharacterized protein LOC111018954 [Momordica charantia][more]
XP_022140628.12.6e-7577.66uncharacterized protein LOC111011237 [Momordica charantia][more]
XP_030942103.14.6e-4830.37uncharacterized protein LOC115967179 [Quercus lobata][more]
XP_030939647.16.0e-4830.20uncharacterized protein LOC115964488 [Quercus lobata][more]
TXG50387.11.7e-4731.44hypothetical protein EZV62_022911 [Acer yangbiense][more]
Match NameE-valueIdentityDescription
P0C2F66.9e-1524.90Putative ribonuclease H protein At1g65750 OS=Arabidopsis thaliana OX=3702 GN=At1... [more]
Match NameE-valueIdentityDescription
A0A6J1DAR42.0e-21996.37uncharacterized protein LOC111018954 OS=Momordica charantia OX=3673 GN=LOC111018... [more]
A0A6J1CIF11.2e-7577.66uncharacterized protein LOC111011237 OS=Momordica charantia OX=3673 GN=LOC111011... [more]
A0A2N9J0V81.4e-5031.42Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS59068 PE=4 SV=1[more]
A0A2N9EMZ02.0e-4931.93Reverse transcriptase domain-containing protein OS=Fagus sylvatica OX=28930 GN=F... [more]
A0A2N9HYE32.0e-4931.93Reverse transcriptase domain-containing protein OS=Fagus sylvatica OX=28930 GN=F... [more]
Match NameE-valueIdentityDescription
AT3G09510.13.0e-2928.21Ribonuclease H-like superfamily protein [more]
AT4G29090.19.9e-2524.24Ribonuclease H-like superfamily protein [more]
AT5G18880.12.0e-0922.22RNA-directed DNA polymerase (reverse transcriptase)-related family protein [more]
AT2G34320.17.6e-0923.40Polynucleotidyl transferase, ribonuclease H-like superfamily protein [more]
AT1G33710.11.4e-0728.78RNA-directed DNA polymerase (reverse transcriptase)-related family protein [more]
InterPro
Analysis Name: InterPro Annotations of Bitter gourd (OHB3-1) v2
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR036397Ribonuclease H superfamilyGENE3D3.30.420.10coord: 311..377
e-value: 3.3E-6
score: 29.2
IPR002156Ribonuclease H domainPFAMPF13456RVT_3coord: 315..372
e-value: 8.5E-9
score: 35.3
IPR026960Reverse transcriptase zinc-binding domainPFAMPF13966zf-RVTcoord: 118..214
e-value: 2.4E-18
score: 66.6
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 603..639
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 620..639
NoneNo IPR availablePANTHERPTHR46736:SF6SUBFAMILY NOT NAMEDcoord: 86..373
NoneNo IPR availablePANTHERPTHR46736FAMILY NOT NAMEDcoord: 86..373
IPR044730Ribonuclease H-like domain, plant typeCDDcd06222RNase_H_likecoord: 314..406
e-value: 4.64613E-13
score: 64.2576
IPR012337Ribonuclease H-like superfamilySUPERFAMILY53098Ribonuclease H-likecoord: 311..411

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Moc03g01560.1Moc03g01560.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0090502 RNA phosphodiester bond hydrolysis, endonucleolytic
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0003676 nucleic acid binding
molecular_function GO:0004523 RNA-DNA hybrid ribonuclease activity