Lcy07g005030 (gene) Sponge gourd (P93075) v1

Overview
NameLcy07g005030
Typegene
OrganismLuffa cylindrica cv. P93075 (Sponge gourd (P93075) v1)
DescriptionReverse transcriptase domain-containing protein
LocationChr07: 26727801 .. 26729910 (-)
RNA-Seq ExpressionLcy07g005030
SyntenyLcy07g005030
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTTCCTCTTGTGCGCTGAGGGTTTGTCTAGTTTGCTTCGTGGGGCAGAACGAAGGTCACTTATCACAGGTTTTCGGCTTACGCTCTCCTGTCCATCAGTTTCGCATCTGTTTTTTGCTGACGATAACCTGCTCTTCTTTCAGGCAAATGCTGCTGAGAGTAGTGTTATTCGGGGTCTTCTGTTGCTCTATGAGCGTGCCTCTGGTCAAACCATTAACTACGAAAAGTCTGTTGTGGCCTTTAGCCCAAACACAAGGGACGATTGCAAACATTATGTTAGCCATGTTCTATCTGTTGTGTGCAAGCCATGTCACAACCAGTATCTGGGCCTCCCCTCATTTATGCCACGTAGTCGTTCTGGGACCCTGAAGTTCATTAGAGATCAGGTCTGGAAACAAGTCCAAGGGTGGAAGGGAAAATTTTTCTCCACAGGTGGCAAAAAGGTGCTGCTTAAATCGACTGTGCAAGCTATTCCCTGCTATACGATGAATTGCTTCAGACTTCCGAAATGCTTGATTGGTGAAATCCATCGCCTCATGGCAAACTTTTGGTGGGATAATCCAAAAGGGGGAAAGAAAAATCATTGGTTGAGTTGGAAATCCCTTTGTCTCCCAAAGTGTCGTGGTGGTTTGGGGTTCAGAGATATGGAGCTGTTTAATCAGTCACTTTTGGCAAAACAGTGCTGGTGGGTGTTTCATGACCCTTCCTCTCTCCTTAGCTTGGTGCTTAAGGGTCGCTACTTCCCAGGTACGGATTTTTTCGGGGCTAGTTTGGGTCATCGGCCTTCTTATGTTTGGAGAAGCTTATTATGGGGGTGTGATCTTCTGACGAGAGGATGTCGTTGGCGTATAGGTAATGGTTCTTCAGTTCCTATTTACTTTTCAAATTGGCTCCCAAATGAGTTTTCACTCCAGATTCACTCCTTCCCTTCTTTACGGTTGACTAGTTGTGTTCGGGACTTGTTTACTGGTTCAGGACATTGGGATGAGGTGAAAATTCGGTCCCATTTTACCACGGCAGATTGCGAAGCTATTTTGAGGATCCCACTTGGAAATCTTTTATCAGAAGATCAATTGATATGGCATTTTGAGAAGAATGGCCTTTTCCCTGTTAAAAGTGGATACCGTTTGGCTCATAGTTTGAGTGTTCAAGATCAACCTTCTTCATCAGATTCAGCCTTATGGCAAGGGTGGTGGTCTAGTCTCTGGAAGATGAATGTTCCAAGCAAGATTAAATTCTTCTTCTGGAGGTTATGTCACAACCGGTTGCCCACCAAAGATAATCTTCTTAAAATAGGTATGGATGTTTCTAATATGTGTGTGATATGCCAAAGTTTCTCGGAGGATTATTTTCATGTTTTTTGGGATTGTCCTATGATTAAATCCATGTGGTGCTGCTCAAAGTTTGTATCTTTATATCAGTCTTTATCTAATTTGAATTTTGATTCACTGTTGTGGGTTTTAAAGGAAAGAGTGAGTAGGCTGGATTTTGAGCTTTTAGTTGTTTTCTGGTGGGCTGTTTGGAACTTACGAAACTCATTGTTATGGGGGGGGGGGGTGTTTTGACGACCGAAACTTGTTACAGTGGGCGAGGGATTACTTAGAGTCTTTCCAAAAGGCGTCGGTGAGGAACTTGCTATCTCCGACGGCATTACAGCAGTCTTCGGGGAGGGCTGTTGTGTCGTGGGTTCCACCAACCGAGAATGAGTTAAAGCTGAATATTGATGCATCGGTAAGGCCTGAGACGGGGGCGGCGAGTGGTGGTTTCGTCCTGCACAATGATAGAGGGGAAGTTCTATTGACGGCTTGTGAAATTCTGCCCTTGTGTTGGAATGTCGACTTAGTTGAGGGATGGGCAATGTTGAGAGGTATACAAATTGCTCGACAAATGGGCTTTTTCAGTTTTCATGTTGAGACCGACTCGTTGAGATTATCAAGAGTTCTAATTGACGATGTGGATGATATTTCGGAAGTAGGAGCAATTATGGATGTGATTCGTGGCTTGCTCCCGCAAGATTCTTCATGTAGGGTGTTGTTCACGCCTAGGCAAGGCAACATGGTAGCACATACCATGACTTCTTTAGCTTTTGTTTATGCTAATTTTATTTGA

mRNA sequence

ATGTTCCTCTTGTGCGCTGAGGGTTTGTCTAGTTTGCTTCGTGGGGCAGAACGAAGGTCACTTATCACAGGTTTTCGGCTTACGCTCTCCTGTCCATCAGTTTCGCATCTGTTTTTTGCTGACGATAACCTGCTCTTCTTTCAGGCAAATGCTGCTGAGAGTAGTGTTATTCGGGGTCTTCTGTTGCTCTATGAGCGTGCCTCTGGTCAAACCATTAACTACGAAAAGTCTGTTGTGGCCTTTAGCCCAAACACAAGGGACGATTGCAAACATTATGTTAGCCATGTTCTATCTGTTGTGTGCAAGCCATGTCACAACCAGTATCTGGGCCTCCCCTCATTTATGCCACGTAGTCGTTCTGGGACCCTGAAGTTCATTAGAGATCAGTGTCGTGGTGGTTTGGGGTTCAGAGATATGGAGCTGTTTAATCAGTCACTTTTGGCAAAACAGTGCTGGTGGGTGTTTCATGACCCTTCCTCTCTCCTTAGCTTGGTGCTTAAGGGTCGCTACTTCCCAGGTAATGGTTCTTCAGTTCCTATTTACTTTTCAAATTGGCTCCCAAATGAGTTTTCACTCCAGATTCACTCCTTCCCTTCTTTACGGTTGACTAGTTGTGTTCGGGACTTGTTTACTGGTTCAGGACATTGGGATGAGGTGAAAATTCGGTCCCATTTTACCACGGCAGATTGCGAAGCTATTTTGAGGATCCCACTTGGAAATCTTTTATCAGAAGATCAATTGATATGGCATTTTGAGAAGAATGGCCTTTTCCCTGTTAAAAGTGGATACCGTTTGGCTCATAGTTTGAGTGTTCAAGATCAACCTTCTTCATCAGATTCAGCCTTATGGCAAGGGTGGTGGTCTAGTCTCTGGAAGATGAATGTTCCAAGCAAGATTAAATTCTTCTTCTGGAGGTTATGTCACAACCGGTTGCCCACCAAAGATAATCTTCTTAAAATAGGTATGGATGTTTCTAATATGTGTGTGATATGCCAAAGTTTCTCGGAGGATTATTTTCATGTTTTTTGGGATTGTCCTATGATTAAATCCATGTGGTGCTGCTCAAAGTTTGTATCTTTATATCAGTCTTTATCTAATTTGAATTTTGATTCACTGTTGTGGGTTTTAAAGGAAAGATGGGCGAGGGATTACTTAGAGTCTTTCCAAAAGGCGTCGGTGAGGAACTTGCTATCTCCGACGGCATTACAGCAGTCTTCGGGGAGGGCTGTTGTGTCGTGGGTTCCACCAACCGAGAATGAGTTAAAGCTGAATATTGATGCATCGGTAAGGCCTGAGACGGGGGCGGCGAGTGGTGGTTTCGTCCTGCACAATGATAGAGGGGAAGTTCTATTGACGGCTTGTGAAATTCTGCCCTTGTGTTGGAATGTCGACTTAGTTGAGGGATGGGCAATGTTGAGAGGTATACAAATTGCTCGACAAATGGGCTTTTTCAGTTTTCATGTTGAGACCGACTCGTTGAGATTATCAAGAGTTCTAATTGACGATGTGGATGATATTTCGGAAGTAGGAGCAATTATGGATGTGATTCGTGGCTTGCTCCCGCAAGATTCTTCATGTAGGGTGTTGTTCACGCCTAGGCAAGGCAACATGGTAGCACATACCATGACTTCTTTAGCTTTTGTTTATGCTAATTTTATTTGA

Coding sequence (CDS)

ATGTTCCTCTTGTGCGCTGAGGGTTTGTCTAGTTTGCTTCGTGGGGCAGAACGAAGGTCACTTATCACAGGTTTTCGGCTTACGCTCTCCTGTCCATCAGTTTCGCATCTGTTTTTTGCTGACGATAACCTGCTCTTCTTTCAGGCAAATGCTGCTGAGAGTAGTGTTATTCGGGGTCTTCTGTTGCTCTATGAGCGTGCCTCTGGTCAAACCATTAACTACGAAAAGTCTGTTGTGGCCTTTAGCCCAAACACAAGGGACGATTGCAAACATTATGTTAGCCATGTTCTATCTGTTGTGTGCAAGCCATGTCACAACCAGTATCTGGGCCTCCCCTCATTTATGCCACGTAGTCGTTCTGGGACCCTGAAGTTCATTAGAGATCAGTGTCGTGGTGGTTTGGGGTTCAGAGATATGGAGCTGTTTAATCAGTCACTTTTGGCAAAACAGTGCTGGTGGGTGTTTCATGACCCTTCCTCTCTCCTTAGCTTGGTGCTTAAGGGTCGCTACTTCCCAGGTAATGGTTCTTCAGTTCCTATTTACTTTTCAAATTGGCTCCCAAATGAGTTTTCACTCCAGATTCACTCCTTCCCTTCTTTACGGTTGACTAGTTGTGTTCGGGACTTGTTTACTGGTTCAGGACATTGGGATGAGGTGAAAATTCGGTCCCATTTTACCACGGCAGATTGCGAAGCTATTTTGAGGATCCCACTTGGAAATCTTTTATCAGAAGATCAATTGATATGGCATTTTGAGAAGAATGGCCTTTTCCCTGTTAAAAGTGGATACCGTTTGGCTCATAGTTTGAGTGTTCAAGATCAACCTTCTTCATCAGATTCAGCCTTATGGCAAGGGTGGTGGTCTAGTCTCTGGAAGATGAATGTTCCAAGCAAGATTAAATTCTTCTTCTGGAGGTTATGTCACAACCGGTTGCCCACCAAAGATAATCTTCTTAAAATAGGTATGGATGTTTCTAATATGTGTGTGATATGCCAAAGTTTCTCGGAGGATTATTTTCATGTTTTTTGGGATTGTCCTATGATTAAATCCATGTGGTGCTGCTCAAAGTTTGTATCTTTATATCAGTCTTTATCTAATTTGAATTTTGATTCACTGTTGTGGGTTTTAAAGGAAAGATGGGCGAGGGATTACTTAGAGTCTTTCCAAAAGGCGTCGGTGAGGAACTTGCTATCTCCGACGGCATTACAGCAGTCTTCGGGGAGGGCTGTTGTGTCGTGGGTTCCACCAACCGAGAATGAGTTAAAGCTGAATATTGATGCATCGGTAAGGCCTGAGACGGGGGCGGCGAGTGGTGGTTTCGTCCTGCACAATGATAGAGGGGAAGTTCTATTGACGGCTTGTGAAATTCTGCCCTTGTGTTGGAATGTCGACTTAGTTGAGGGATGGGCAATGTTGAGAGGTATACAAATTGCTCGACAAATGGGCTTTTTCAGTTTTCATGTTGAGACCGACTCGTTGAGATTATCAAGAGTTCTAATTGACGATGTGGATGATATTTCGGAAGTAGGAGCAATTATGGATGTGATTCGTGGCTTGCTCCCGCAAGATTCTTCATGTAGGGTGTTGTTCACGCCTAGGCAAGGCAACATGGTAGCACATACCATGACTTCTTTAGCTTTTGTTTATGCTAATTTTATTTGA

Protein sequence

MFLLCAEGLSSLLRGAERRSLITGFRLTLSCPSVSHLFFADDNLLFFQANAAESSVIRGLLLLYERASGQTINYEKSVVAFSPNTRDDCKHYVSHVLSVVCKPCHNQYLGLPSFMPRSRSGTLKFIRDQCRGGLGFRDMELFNQSLLAKQCWWVFHDPSSLLSLVLKGRYFPGNGSSVPIYFSNWLPNEFSLQIHSFPSLRLTSCVRDLFTGSGHWDEVKIRSHFTTADCEAILRIPLGNLLSEDQLIWHFEKNGLFPVKSGYRLAHSLSVQDQPSSSDSALWQGWWSSLWKMNVPSKIKFFFWRLCHNRLPTKDNLLKIGMDVSNMCVICQSFSEDYFHVFWDCPMIKSMWCCSKFVSLYQSLSNLNFDSLLWVLKERWARDYLESFQKASVRNLLSPTALQQSSGRAVVSWVPPTENELKLNIDASVRPETGAASGGFVLHNDRGEVLLTACEILPLCWNVDLVEGWAMLRGIQIARQMGFFSFHVETDSLRLSRVLIDDVDDISEVGAIMDVIRGLLPQDSSCRVLFTPRQGNMVAHTMTSLAFVYANFI
Homology
BLAST of Lcy07g005030 vs. ExPASy Swiss-Prot
Match: P0C2F6 (Putative ribonuclease H protein At1g65750 OS=Arabidopsis thaliana OX=3702 GN=At1g65750 PE=3 SV=1)

HSP 1 Score: 123.6 bits (309), Expect = 6.8e-27
Identity = 111/500 (22.20%), Postives = 204/500 (40.80%), Query Frame = 0

Query: 132 GGLGFRDMELFNQSLLAKQCWWVFHDPSSLLSLVLKGRY--------------------- 191
           GGLG R  +  N++L++K  W +  + +SL +LVL+ +Y                     
Sbjct: 99  GGLGVRAAKSMNRALISKVGWRLLQEKNSLWTLVLQKKYHVGEIRDSRWLIPKGSWSSTW 158

Query: 192 -----------------FPGNGSSVPIYFSNWLPNEFSLQIHSFPSLRLTSC----VRDL 251
                             PG+G  +  +   W+  +  L++ +    R T C     +DL
Sbjct: 159 RSIAIGLRDVVSHGVGWIPGDGQQIRFWTDRWVSGKPLLELDN--GERPTDCDTVVAKDL 218

Query: 252 FTGSGHWDEVKIRSHFTTADCEAILRIPLGNLL--SEDQLIWHFEKNGLFPVKSGYRLAH 311
           +     WD  KI   +TT +    LR  + +L+  + D+L W F ++G F V+S Y +  
Sbjct: 219 WIPGRGWDFAKI-DPYTTNNTRLELRAVVLDLVTGARDRLSWKFSQDGQFSVRSAYEM-- 278

Query: 312 SLSVQDQPSSSDSALWQGWWSSLWKMNVPSKIKFFFWRLCHNRLPTKDNLLKIGMDVSNM 371
            L+V + P  + ++    +++ LWK+ VP ++K F W + +  + T++   +  +  SN+
Sbjct: 279 -LTVDEVPRPNMAS----FFNCLWKVRVPERVKTFLWLVGNQAVMTEEERHRRHLSASNV 338

Query: 372 CVICQSFSEDYFHVFWDCPMIKSMW-------------CCSKFVSLYQSLSNLN------ 431
           C +C+   E   HV  DCP    +W               S F  LY +L + +      
Sbjct: 339 CQVCKGGVESMLHVLRDCPAQLGIWVRVVPQRRQQGFFSKSLFEWLYDNLGDRSGCEDIP 398

Query: 432 ----FDSLLW---------VLKE--------RWARDYLESFQKASVRNLLSPTALQQSSG 491
               F  ++W         +  E        ++ +++     +A   N+L    + Q   
Sbjct: 399 WSTIFAVIIWWGWKWRCGNIFGENTKCRDRVKFVKEWAVEVYRAHSGNVL--VGITQPRV 458

Query: 492 RAVVSWVPPTENELKLNIDASVRPETGAASGGFVLHNDRGEVLLTACEILPLCWNVDLVE 548
             ++ WV P    +K+N D + R   G AS G VL +  G         +  C +    E
Sbjct: 459 ERMIGWVSPCVGWVKVNTDGASRGNPGLASAGGVLRDCTGAWCGGFSLNIGRC-SAPQAE 518

BLAST of Lcy07g005030 vs. ExPASy Swiss-Prot
Match: P93295 (Uncharacterized mitochondrial protein AtMg00310 OS=Arabidopsis thaliana OX=3702 GN=AtMg00310 PE=4 SV=1)

HSP 1 Score: 57.4 bits (137), Expect = 6.0e-07
Identity = 25/41 (60.98%), Postives = 32/41 (78.05%), Query Frame = 0

Query: 132 GGLGFRDMELFNQSLLAKQCWWVFHDPSSLLSLVLKGRYFP 173
           GGLGFRD+  FNQ+LLAKQ + + H P +LLS +L+ RYFP
Sbjct: 55  GGLGFRDLGWFNQALLAKQSFRIIHQPHTLLSRLLRSRYFP 95

BLAST of Lcy07g005030 vs. ExPASy TrEMBL
Match: A0A5E4FZN9 (PREDICTED: retrotransposon OS=Prunus dulcis OX=3755 GN=ALMOND_2B007697 PE=4 SV=1)

HSP 1 Score: 286.6 bits (732), Expect = 2.2e-73
Identity = 201/704 (28.55%), Postives = 292/704 (41.48%), Query Frame = 0

Query: 1    MFLLCAEGLSSLLRGAERRSLITGFRLTLSCPSVSHLFFADDNLLFFQANAAESSVIRGL 60
            +FL+C EG S LLRGAERR  + G ++    PSV+HL FADD++LF +A       +  L
Sbjct: 575  LFLMCTEGFSCLLRGAERRGDLVGVQVARGGPSVTHLLFADDSILFMKATNEACRALETL 634

Query: 61   LLLYERASGQTINYEKSVVAFSPNTRDDCKHYVSHVLSVVCKPCHNQYLGLPSFMPRSRS 120
               YE  SGQ INY KS  + SPN        +  VL+V    CH +YLGLP+   + R 
Sbjct: 635  FQTYEEVSGQQINYSKSAFSLSPNATRADFDMIKGVLNVPVVQCHEKYLGLPTIAGKGRK 694

Query: 121  GTLKFIRDQ--------------------------------------------------- 180
               + ++D+                                                   
Sbjct: 695  QLFQHLKDKLWKHISGWKEKLLSRAGKEILMKAVLQAIPTYSMSCFRIPKGLCKELNGIM 754

Query: 181  ----------------------CR----GGLGFRDMELFNQSLLAKQCWWVFHDPSSLLS 240
                                  C+    GGLGFRD+E FNQ+LLAKQCW +   P SL++
Sbjct: 755  ARFWWAKAKDKRGIHWVKWELLCKSKFAGGLGFRDLEAFNQALLAKQCWRILRTPESLVA 814

Query: 241  LVLKGRYFP-----------------------------------GNGSSVPIYFSNWLPN 300
             + + RY P                                   GNG S+ +Y   WLP 
Sbjct: 815  RIFRARYHPSVPFLEAEVGTNPSFIWRSLQWGKELLNKGLRWRVGNGVSIQVYTDKWLPA 874

Query: 301  EFSLQIHSFPSLRLTSCVRDLFTGSGHWDEVKIRSHFTTADCEAILRIPLGNLLSEDQLI 360
                +I S P L L++ V DLFT SG W+   ++  F   + +A L+IPL +L   D LI
Sbjct: 875  PSFFKIMSPPQLPLSTLVCDLFTSSGQWNVPLLKDIFWDQEVDAKLQIPLASLAGHDCLI 934

Query: 361  WHFEKNGLFPVKSGYRLAHSLSVQDQPSSSDSA---LWQGWWSSLWKMNVPSKIKFFFWR 420
            WH+E+NG++ VKSGYRLA     +D+ S   S    L   +W  +W + +P+KIKFF WR
Sbjct: 935  WHYERNGMYSVKSGYRLA--CLEKDKMSGEPSVRVDLNSKFWKKIWALKIPNKIKFFLWR 994

Query: 421  LCHNRLPTKDNLLKIGMDVSNMCVICQSFSEDYFHVFWDCPMIKSMW--------C---- 480
               + LP    L    +  + +C  C   +E   H  W C   K +W        C    
Sbjct: 995  CAWDFLPCGQILFNRKIAPTPICPNCHRKAESVLHAVWLCETAKEVWRNSAWGNVCEEWR 1054

Query: 481  CSKFVSLYQSLSNLN-------FDSLLWVL-----------KERWARDYLESFQK----- 540
             + F  L+ +L   +       F  L W L           K   A   L    K     
Sbjct: 1055 VNSFRELWHALQLSSSGEEQGLFAYLCWGLWNRRNSFIFEGKSETATQLLHRMTKLAQEF 1114

Query: 541  ASVRNLLSPTALQQSSGRAVV-SWVPPTENELKLNIDASVRPETGAASGGFVLHNDRGEV 554
            ++  NL      +QSS +A +  W PP     K+N+D +V+        G V+ N  GE 
Sbjct: 1115 SNANNLSHTIHGRQSSPQAPLHGWRPPPAGIYKINVDGAVKSGDSVRGVGVVVRNANGEF 1174

BLAST of Lcy07g005030 vs. ExPASy TrEMBL
Match: A0A6J1DAR4 (uncharacterized protein LOC111018954 OS=Momordica charantia OX=3673 GN=LOC111018954 PE=4 SV=1)

HSP 1 Score: 285.0 bits (728), Expect = 6.5e-73
Identity = 189/619 (30.53%), Postives = 270/619 (43.62%), Query Frame = 0

Query: 83   PNTRDDCKHYVSHVLSVVCKPCHNQYLGLPSFMPRSRSGTLKFIRD-------------- 142
            P  +D     + ++LSV    C  QYLGLP+FMPR+R     +I+D              
Sbjct: 518  PEYQDRQSSLIQNILSVNMVECQLQYLGLPTFMPRNRRMHFNYIKDRVWKHLQGWKAKLF 577

Query: 143  ------------------------------------------------------------ 202
                                                                        
Sbjct: 578  SIGGKEVLIKAVAQAIPCYTMSCFRLPKRLIREFHHITARFWWGSSKEDKKIHWVAWNSL 637

Query: 203  ---QCRGGLGFRDMELFNQSLLAKQCWWVFHDPSSLLSLVLKGRYFP------------- 262
               +C GG+GFRD+ELFN++LLAKQCW + + P+S+LS VLKGRYF              
Sbjct: 638  YLPKCEGGMGFRDLELFNKALLAKQCWRILNHPNSMLSRVLKGRYFKDCSFMEAKISGNP 697

Query: 263  ----------------------GNGSSVPIYFSNWLPNEFSLQIHSFPSLRLTSCVRDLF 322
                                  GNG SV IY  NW+PN+ +L+I S P L L S V  L 
Sbjct: 698  SYIWRSILWGRDLLKKGLRWRIGNGDSVFIYGDNWVPNQPTLKILSSPRLPLVSRVSSLV 757

Query: 323  T-GSGHWDEVKIRSHFTTADCEAILRIPLGNLLSEDQLIWHFEKNGLFPVKSGYRLA--H 382
                G W    +R  FT  + + IL IP+G    ED+LIW++EK G++ V+SGY++A  +
Sbjct: 758  DHEEGGWQGDVVRDEFTPDEAKGILSIPIGRGAEEDRLIWNYEKTGVYSVRSGYKVALLN 817

Query: 383  SLSVQDQPSSSDSALWQGWWSSLWKMNVPSKIKFFFWRLCHNRLPTKDNLLKIGMDVSNM 442
            +  VQ  PSSS S   + WW+  WKM++P+KIK F WRLC +RLPT  NL K G++++N 
Sbjct: 818  NPCVQ-APSSSSSEEVRCWWNGFWKMHIPNKIKVFLWRLCLDRLPTGCNLSKRGVEITNC 877

Query: 443  CVICQSFSEDYFHVFWDCPMIKSMWCCSKFVSL---------YQSLSNLNFDSL------ 502
            C  C    ED  H+FW C   +++W  SKF  L         ++SLS  +F+ L      
Sbjct: 878  CYFCGRNGEDSIHLFWICKFAEALWINSKFGKLSPFLILRESHESLSKADFEELCVVIWG 937

Query: 503  LWVLKE-------------------RWARDYLESFQKASVRNLLSPTALQQSSGRAVVSW 553
            LW  +                     WA  Y   F++A      S     + +  A + W
Sbjct: 938  LWNQRNARAFNDSTKTVFKIGMELVEWANKYAMEFREAK-----SNPITGRVTNTAEILW 997

BLAST of Lcy07g005030 vs. ExPASy TrEMBL
Match: A0A803QGT2 (Uncharacterized protein OS=Cannabis sativa OX=3483 PE=4 SV=1)

HSP 1 Score: 281.6 bits (719), Expect = 7.2e-72
Identity = 196/698 (28.08%), Postives = 292/698 (41.83%), Query Frame = 0

Query: 1    MFLLCAEGLSSLLRGAERRSLITGFRLTLSCPSVSHLFFADDNLLFFQANAAESSVIRGL 60
            +FL+C+EGLS LL+  E+   + G  ++   PS+SHLFFADD+LLF QAN      I+  
Sbjct: 608  LFLICSEGLSRLLQYEEQIGRLKGLAVSRHSPSISHLFFADDSLLFCQANDRSCGAIKRA 667

Query: 61   LLLYERASGQTINYEKSVVAFSPNTRDDCKHYVSHVLSVVCKPCHNQYLGLPSFMPRSRS 120
            L +Y RASGQ +N +KSV++FSPNT    ++    +L +    CH  YLGLP++  R +S
Sbjct: 668  LDIYHRASGQRLNADKSVMSFSPNTPIAVQNSFQQILGMPICECHEAYLGLPAYSERDKS 727

Query: 121  GTLKFIRDQ--------------------------------------------------- 180
                 I+++                                                   
Sbjct: 728  QLFSNIKEKIWKLMHAWNDKIFSIGGKEVLLKAVVQSIPTYAMSCFRLPVKLCNEIEAMM 787

Query: 181  ----------------------CR----GGLGFRDMELFNQSLLAKQCWWVFHDPSSLLS 240
                                  C+    GG+GFR    FNQ+LLAKQ W +F DP+SLLS
Sbjct: 788  AKFWWGSSSDNKKIHWKKWRFLCKSKGDGGMGFRSFVHFNQALLAKQAWRIFQDPTSLLS 847

Query: 241  LVLKGRYFP-----------------------------------GNGSSVPIYFSNWLPN 300
             VLKG YF                                    G G+++     +W+P 
Sbjct: 848  RVLKGHYFSQNDFMTARGGGLSSLTWQGIVWGRELLVKGLRLKVGTGNNIRCAVDSWIPG 907

Query: 301  EFSLQIHSFPSLRLTSCVRDLFTGSGHWDEVKIRSHFTTADCEAILRIPLGNLLSEDQLI 360
                + + +     T+ V D  T +  W+   ++S F+T D + IL+IPL  L   D+ I
Sbjct: 908  HKGFKPYCYTGAH-TNHVADYITATREWNIECLQSDFSTPDVDNILKIPLSFLPVNDRWI 967

Query: 361  WHFEKNGLFPVKSGYRLAHSLSVQDQPSSSDSALWQGWWSSLWKMNVPSKIKFFFWRLCH 420
            WH+E +G + V SGY LA SL  +D  S S +   + WW S WK+N+PSK+K F W++  
Sbjct: 968  WHYEDSGDYSVSSGYTLASSLGEEDLSSCSHTQ--ETWWKSFWKLNLPSKVKIFGWKVIQ 1027

Query: 421  NRLPTKDNLLKIGMDVSNMCVICQSFSEDYFHVFWDCPMIKSMWCCSKF----------- 480
            + +P   +L    +  S  C +CQS  E   H  + C   K +W  S F           
Sbjct: 1028 SSIPVAKSLYHRKILTSATCSLCQSAWESIGHALFSCCHAKEVWKFSGFSIDFTNADRLQ 1087

Query: 481  --------VSLYQSLSNLNFDSLLWVL----------------KERWARDYLESFQKASV 540
                     S+Y+  +  N   L+W +                 + W +      Q  S+
Sbjct: 1088 DGDYLMHLSSIYEKSAFENILCLMWFIWSDRNNFIHGKKVKTPLQMWTQSVAYMDQYRSI 1147

Query: 541  RNLLSPTALQQSSGRAVVSWVPPTENELKLNIDASVRPETGAASGGFVLHNDRGEVLLTA 552
             + ++P A  ++S  +V SW PP EN  KLN+DA++         G ++ N  G+V    
Sbjct: 1148 TSAVTPAASNRTSQASVASWKPPPENTFKLNVDAALDSSRSKIGIGVIVRNSAGQVKAAL 1207

BLAST of Lcy07g005030 vs. ExPASy TrEMBL
Match: A0A6J5UN51 (Reverse transcriptase domain-containing protein OS=Prunus armeniaca OX=36596 GN=CURHAP_LOCUS28124 PE=4 SV=1)

HSP 1 Score: 278.5 bits (711), Expect = 6.1e-71
Identity = 204/695 (29.35%), Postives = 301/695 (43.31%), Query Frame = 0

Query: 1    MFLLCAEGLSSLLRGAERRSLITGFRLTLSCPSVSHLFFADDNLLFFQANAAESSVIRGL 60
            +FL+ AE  S+LL+ AER S + G  +  S PS++HLFFADD+LLF  A   E+  ++ +
Sbjct: 1030 LFLIVAEAFSALLQQAERDSRLHGVSIAPSAPSINHLFFADDSLLFCNAGTTEALELKRI 1089

Query: 61   LLLYERASGQTINYEKSVVAFSPNTRDDCKHYVSHVLSVVCKPCHNQYLGLPSFMPRSRS 120
              +YE ASGQ +N  KS + FSP+T    +  +  +L+V   PCH +YLGLP+ + + + 
Sbjct: 1090 FGVYESASGQKVNLGKSALCFSPSTPRVLQDDIRQLLNVTLVPCHERYLGLPTIVGKDKK 1149

Query: 121  GTLKFIRDQ-------------------------------------------CR------ 180
               + ++D+                                           CR      
Sbjct: 1150 KMFRMVKDRVWNKVNGWQGKLLSKAGKEVLIKSVCQAIPSYSMSVFRLPVGLCREIESII 1209

Query: 181  ---------------------------GGLGFRDMELFNQSLLAKQCWWVFHDPSSLLSL 240
                                       GG+GFR++  FNQ+LL KQ W +   P SL++ 
Sbjct: 1210 AKFWWAKNDGRGIHWKTWRFMCQHKSDGGIGFRELISFNQALLCKQGWRLLEFPHSLIAR 1269

Query: 241  VLKGRYFP-----------------------------------GNGSSVPIYFSNWLPNE 300
            + K RYFP                                   G+G  V IY   W+P +
Sbjct: 1270 MFKARYFPHSDFLATSSGSLPSFTWQSILWGRDLLRLGLRWRIGDGRLVNIYGDPWVPYD 1329

Query: 301  FSLQIHSFPSLRLTSCVRDLFTGSGHWDEVKIRSHFTTADCEAILRIPL-GNLLSEDQLI 360
                I S P+L +TS V DLFT SG WD  K+ + F+  + EAIL IPL G+ L  D+ I
Sbjct: 1330 RFFTIQSIPTLPVTSRVCDLFTASGGWDVGKVFASFSFPEAEAILSIPLMGDTL--DRRI 1389

Query: 361  WHFEKNGLFPVKSGYRLA------HSLSVQDQPSSSDSALWQGWWSSLWKMNVPSKIKFF 420
            W+F KNG + VKSGY  A        LS       S S+L    W  LWK+ VP KI   
Sbjct: 1390 WNFTKNGRYSVKSGYWAALEYKRLEELSAGGVAGPSSSSLKS--WKHLWKLKVPQKIMHL 1449

Query: 421  FWRLCHNRLPTKDNLLKIGMDVSNMCVICQSFSEDYFHVFWDCPMIKSMWCCSKFVS--L 480
             WR+  + LP+K+ L +  +    +C  C +  E   H    C +   +W    F S  L
Sbjct: 1450 LWRVAQDILPSKEVLFRRRITQGEVCCRCFAARETTLHALVGCDVCLQVWKALDFPSDFL 1509

Query: 481  YQSLSNLN-----------------FDSLLWVL-KER-----------------WARDYL 540
              +L+++                  F   +WVL  ER                  A+DY 
Sbjct: 1510 LPTLADVGTWMDAVWSIIPPDKQSLFAFTVWVLWNERNGVLFGSQPTPSGILVQRAKDYD 1569

BLAST of Lcy07g005030 vs. ExPASy TrEMBL
Match: A0A803Q2K8 (Uncharacterized protein OS=Cannabis sativa OX=3483 PE=4 SV=1)

HSP 1 Score: 276.2 bits (705), Expect = 3.0e-70
Identity = 195/679 (28.72%), Postives = 289/679 (42.56%), Query Frame = 0

Query: 1    MFLLCAEGLSSLLRGAERRSLITGFRLTLSCPSVSHLFFADDNLLFFQANAAESSVIRGL 60
            +FL+CAEGLS LL+  E    + G +++ + PSVSHLFFADD++LF +AN   +  I   
Sbjct: 1037 LFLICAEGLSRLLQHEESTGALQGLKISRNAPSVSHLFFADDSVLFCRANRQSARSIHRC 1096

Query: 61   LLLYERASGQTINYEKSVVAFSPNTRDDCKHYVSHVLSVVCKPCHNQYLGLPSFMPRSRS 120
            L  Y +ASGQ IN +K V++FS NT+   + + + +L +  +PCH QYLGLPSF  R + 
Sbjct: 1097 LQTYSQASGQVINPDKCVLSFSDNTKRHEQDFFTALLGMPIQPCHEQYLGLPSFAGRDKK 1156

Query: 121  GTLKFIRD---------------------------------------------------- 180
                 I D                                                    
Sbjct: 1157 KLFGGITDKIWKLLSSWKEHLFSAGGKEILLKAVVQAIPTYAMSCFRLPITLCHQIESMM 1216

Query: 181  -------------------------QCRGGLGFRDMELFNQSLLAKQCWWVFHDPSSLLS 240
                                     + +GGLGFR+   FNQ+LLAKQ W +   P+SLLS
Sbjct: 1217 ANFWWGSSASGKSIHWKNWNFLCKAKVQGGLGFRNFIHFNQALLAKQAWRLIEFPNSLLS 1276

Query: 241  LVLKGRYFP------------GNGSSVPIYFSNWLPNEFSLQIHSFPSLRLTSCVRDLFT 300
             +L+ RYF             G+G S+      WLP   + + + F        V DL T
Sbjct: 1277 KLLRHRYFSNELLLKGLRWRVGSGLSINCATDAWLPGTTTFKPYFFKGADPNLLVADLIT 1336

Query: 301  GSGHWDEVKIRSHFTTADCEAILRIPLGNLLSEDQLIWHFEKNGLFPVKSGYRLAHSLSV 360
                WD + +R++F+  D + IL IPL     +D +IW     G++ VKSGY+LA S + 
Sbjct: 1337 EQRQWDLISLRANFSQPDVDRILSIPLSLFPHDDAMIWSHSFTGIYNVKSGYQLAVSYAE 1396

Query: 361  QDQPSSSDSALWQGWWSSLWKMNVPSKIKFFFWRLCHNRLPTKDNLLKIGMDVSNMCVIC 420
            QD  +SS S   + WWS+ WKM +P K++ F W++ H+ LP    L +  +  S  C IC
Sbjct: 1397 QDDTASSHS--MENWWSTFWKMKLPPKVRIFVWKVFHSTLPVAAELFRRHIATSPHCSIC 1456

Query: 421  QSFSEDYFHVFWDCPMIKSMWCCSK----FVSLYQS------------LSNLNFDSLLWV 480
             S  E   H  +DCP  K++W  S     F +L QS            LS   F+  L +
Sbjct: 1457 NSAEESINHALFDCPRAKAVWELSSLHIDFHTLRQSASADILLHLSTALSTSEFELFLVL 1516

Query: 481  LKERW---------------------ARDYLESFQKA-SVRNLLSPTALQQSSGRA---- 540
                W                     A  YL  FQ A + R+  +P ++  +  R     
Sbjct: 1517 CWCNWHERNAIYHGNTVRSSQAVASYAPSYLAEFQNARAKRSQPTPISIAATDPRPSSEF 1576

Query: 541  --VVSWVPPTENELKLNIDASVRPETGAASGGFVLHNDRGEVLLTACEILPLCWNVDLVE 547
                 W  P +  LKLN DA++         G  L N  G ++    + L   +  + +E
Sbjct: 1577 THAPKWTAPPQGRLKLNTDAAIDKARNKVGIGATLRNSDGFIVAAISKPLLGNYKAEEME 1636

BLAST of Lcy07g005030 vs. NCBI nr
Match: VVA32947.1 (PREDICTED: retrotransposon [Prunus dulcis])

HSP 1 Score: 286.6 bits (732), Expect = 4.6e-73
Identity = 201/704 (28.55%), Postives = 292/704 (41.48%), Query Frame = 0

Query: 1    MFLLCAEGLSSLLRGAERRSLITGFRLTLSCPSVSHLFFADDNLLFFQANAAESSVIRGL 60
            +FL+C EG S LLRGAERR  + G ++    PSV+HL FADD++LF +A       +  L
Sbjct: 575  LFLMCTEGFSCLLRGAERRGDLVGVQVARGGPSVTHLLFADDSILFMKATNEACRALETL 634

Query: 61   LLLYERASGQTINYEKSVVAFSPNTRDDCKHYVSHVLSVVCKPCHNQYLGLPSFMPRSRS 120
               YE  SGQ INY KS  + SPN        +  VL+V    CH +YLGLP+   + R 
Sbjct: 635  FQTYEEVSGQQINYSKSAFSLSPNATRADFDMIKGVLNVPVVQCHEKYLGLPTIAGKGRK 694

Query: 121  GTLKFIRDQ--------------------------------------------------- 180
               + ++D+                                                   
Sbjct: 695  QLFQHLKDKLWKHISGWKEKLLSRAGKEILMKAVLQAIPTYSMSCFRIPKGLCKELNGIM 754

Query: 181  ----------------------CR----GGLGFRDMELFNQSLLAKQCWWVFHDPSSLLS 240
                                  C+    GGLGFRD+E FNQ+LLAKQCW +   P SL++
Sbjct: 755  ARFWWAKAKDKRGIHWVKWELLCKSKFAGGLGFRDLEAFNQALLAKQCWRILRTPESLVA 814

Query: 241  LVLKGRYFP-----------------------------------GNGSSVPIYFSNWLPN 300
             + + RY P                                   GNG S+ +Y   WLP 
Sbjct: 815  RIFRARYHPSVPFLEAEVGTNPSFIWRSLQWGKELLNKGLRWRVGNGVSIQVYTDKWLPA 874

Query: 301  EFSLQIHSFPSLRLTSCVRDLFTGSGHWDEVKIRSHFTTADCEAILRIPLGNLLSEDQLI 360
                +I S P L L++ V DLFT SG W+   ++  F   + +A L+IPL +L   D LI
Sbjct: 875  PSFFKIMSPPQLPLSTLVCDLFTSSGQWNVPLLKDIFWDQEVDAKLQIPLASLAGHDCLI 934

Query: 361  WHFEKNGLFPVKSGYRLAHSLSVQDQPSSSDSA---LWQGWWSSLWKMNVPSKIKFFFWR 420
            WH+E+NG++ VKSGYRLA     +D+ S   S    L   +W  +W + +P+KIKFF WR
Sbjct: 935  WHYERNGMYSVKSGYRLA--CLEKDKMSGEPSVRVDLNSKFWKKIWALKIPNKIKFFLWR 994

Query: 421  LCHNRLPTKDNLLKIGMDVSNMCVICQSFSEDYFHVFWDCPMIKSMW--------C---- 480
               + LP    L    +  + +C  C   +E   H  W C   K +W        C    
Sbjct: 995  CAWDFLPCGQILFNRKIAPTPICPNCHRKAESVLHAVWLCETAKEVWRNSAWGNVCEEWR 1054

Query: 481  CSKFVSLYQSLSNLN-------FDSLLWVL-----------KERWARDYLESFQK----- 540
             + F  L+ +L   +       F  L W L           K   A   L    K     
Sbjct: 1055 VNSFRELWHALQLSSSGEEQGLFAYLCWGLWNRRNSFIFEGKSETATQLLHRMTKLAQEF 1114

Query: 541  ASVRNLLSPTALQQSSGRAVV-SWVPPTENELKLNIDASVRPETGAASGGFVLHNDRGEV 554
            ++  NL      +QSS +A +  W PP     K+N+D +V+        G V+ N  GE 
Sbjct: 1115 SNANNLSHTIHGRQSSPQAPLHGWRPPPAGIYKINVDGAVKSGDSVRGVGVVVRNANGEF 1174

BLAST of Lcy07g005030 vs. NCBI nr
Match: XP_022150918.1 (uncharacterized protein LOC111018954 [Momordica charantia])

HSP 1 Score: 285.0 bits (728), Expect = 1.4e-72
Identity = 189/619 (30.53%), Postives = 270/619 (43.62%), Query Frame = 0

Query: 83   PNTRDDCKHYVSHVLSVVCKPCHNQYLGLPSFMPRSRSGTLKFIRD-------------- 142
            P  +D     + ++LSV    C  QYLGLP+FMPR+R     +I+D              
Sbjct: 518  PEYQDRQSSLIQNILSVNMVECQLQYLGLPTFMPRNRRMHFNYIKDRVWKHLQGWKAKLF 577

Query: 143  ------------------------------------------------------------ 202
                                                                        
Sbjct: 578  SIGGKEVLIKAVAQAIPCYTMSCFRLPKRLIREFHHITARFWWGSSKEDKKIHWVAWNSL 637

Query: 203  ---QCRGGLGFRDMELFNQSLLAKQCWWVFHDPSSLLSLVLKGRYFP------------- 262
               +C GG+GFRD+ELFN++LLAKQCW + + P+S+LS VLKGRYF              
Sbjct: 638  YLPKCEGGMGFRDLELFNKALLAKQCWRILNHPNSMLSRVLKGRYFKDCSFMEAKISGNP 697

Query: 263  ----------------------GNGSSVPIYFSNWLPNEFSLQIHSFPSLRLTSCVRDLF 322
                                  GNG SV IY  NW+PN+ +L+I S P L L S V  L 
Sbjct: 698  SYIWRSILWGRDLLKKGLRWRIGNGDSVFIYGDNWVPNQPTLKILSSPRLPLVSRVSSLV 757

Query: 323  T-GSGHWDEVKIRSHFTTADCEAILRIPLGNLLSEDQLIWHFEKNGLFPVKSGYRLA--H 382
                G W    +R  FT  + + IL IP+G    ED+LIW++EK G++ V+SGY++A  +
Sbjct: 758  DHEEGGWQGDVVRDEFTPDEAKGILSIPIGRGAEEDRLIWNYEKTGVYSVRSGYKVALLN 817

Query: 383  SLSVQDQPSSSDSALWQGWWSSLWKMNVPSKIKFFFWRLCHNRLPTKDNLLKIGMDVSNM 442
            +  VQ  PSSS S   + WW+  WKM++P+KIK F WRLC +RLPT  NL K G++++N 
Sbjct: 818  NPCVQ-APSSSSSEEVRCWWNGFWKMHIPNKIKVFLWRLCLDRLPTGCNLSKRGVEITNC 877

Query: 443  CVICQSFSEDYFHVFWDCPMIKSMWCCSKFVSL---------YQSLSNLNFDSL------ 502
            C  C    ED  H+FW C   +++W  SKF  L         ++SLS  +F+ L      
Sbjct: 878  CYFCGRNGEDSIHLFWICKFAEALWINSKFGKLSPFLILRESHESLSKADFEELCVVIWG 937

Query: 503  LWVLKE-------------------RWARDYLESFQKASVRNLLSPTALQQSSGRAVVSW 553
            LW  +                     WA  Y   F++A      S     + +  A + W
Sbjct: 938  LWNQRNARAFNDSTKTVFKIGMELVEWANKYAMEFREAK-----SNPITGRVTNTAEILW 997

BLAST of Lcy07g005030 vs. NCBI nr
Match: XP_030479133.1 (uncharacterized protein LOC115696372 [Cannabis sativa])

HSP 1 Score: 281.6 bits (719), Expect = 1.5e-71
Identity = 196/698 (28.08%), Postives = 292/698 (41.83%), Query Frame = 0

Query: 1    MFLLCAEGLSSLLRGAERRSLITGFRLTLSCPSVSHLFFADDNLLFFQANAAESSVIRGL 60
            +FL+C+EGLS LL+  E+   + G  ++   PS+SHLFFADD+LLF QAN      I+  
Sbjct: 607  LFLICSEGLSRLLQYEEQIGRLKGLAVSRHSPSISHLFFADDSLLFCQANDRSCGAIKRA 666

Query: 61   LLLYERASGQTINYEKSVVAFSPNTRDDCKHYVSHVLSVVCKPCHNQYLGLPSFMPRSRS 120
            L +Y RASGQ +N +KSV++FSPNT    ++    +L +    CH  YLGLP++  R +S
Sbjct: 667  LDIYHRASGQRLNADKSVMSFSPNTPIAVQNSFQQILGMPICECHEAYLGLPAYSERDKS 726

Query: 121  GTLKFIRDQ--------------------------------------------------- 180
                 I+++                                                   
Sbjct: 727  QLFSNIKEKIWKLMHAWNDKIFSIGGKEVLLKAVVQSIPTYAMSCFRLPVKLCNEIEAMM 786

Query: 181  ----------------------CR----GGLGFRDMELFNQSLLAKQCWWVFHDPSSLLS 240
                                  C+    GG+GFR    FNQ+LLAKQ W +F DP+SLLS
Sbjct: 787  AKFWWGSSSDNKKIHWKKWRFLCKSKGDGGMGFRSFVHFNQALLAKQAWRIFQDPTSLLS 846

Query: 241  LVLKGRYFP-----------------------------------GNGSSVPIYFSNWLPN 300
             VLKG YF                                    G G+++     +W+P 
Sbjct: 847  RVLKGHYFSQNDFMTARGGGLSSLTWQGIVWGRELLVKGLRLKVGTGNNIRCAVDSWIPG 906

Query: 301  EFSLQIHSFPSLRLTSCVRDLFTGSGHWDEVKIRSHFTTADCEAILRIPLGNLLSEDQLI 360
                + + +     T+ V D  T +  W+   ++S F+T D + IL+IPL  L   D+ I
Sbjct: 907  HKGFKPYCYTGAH-TNHVADYITATREWNIECLQSDFSTPDVDNILKIPLSFLPVNDRWI 966

Query: 361  WHFEKNGLFPVKSGYRLAHSLSVQDQPSSSDSALWQGWWSSLWKMNVPSKIKFFFWRLCH 420
            WH+E +G + V SGY LA SL  +D  S S +   + WW S WK+N+PSK+K F W++  
Sbjct: 967  WHYEDSGDYSVSSGYTLASSLGEEDLSSCSHTQ--ETWWKSFWKLNLPSKVKIFGWKVIQ 1026

Query: 421  NRLPTKDNLLKIGMDVSNMCVICQSFSEDYFHVFWDCPMIKSMWCCSKF----------- 480
            + +P   +L    +  S  C +CQS  E   H  + C   K +W  S F           
Sbjct: 1027 SSIPVAKSLYHRKILTSATCSLCQSAWESIGHALFSCCHAKEVWKFSGFSIDFTNADRLQ 1086

Query: 481  --------VSLYQSLSNLNFDSLLWVL----------------KERWARDYLESFQKASV 540
                     S+Y+  +  N   L+W +                 + W +      Q  S+
Sbjct: 1087 DGDYLMHLSSIYEKSAFENILCLMWFIWSDRNNFIHGKKVKTPLQMWTQSVAYMDQYRSI 1146

Query: 541  RNLLSPTALQQSSGRAVVSWVPPTENELKLNIDASVRPETGAASGGFVLHNDRGEVLLTA 552
             + ++P A  ++S  +V SW PP EN  KLN+DA++         G ++ N  G+V    
Sbjct: 1147 TSAVTPAASNRTSQASVASWKPPPENTFKLNVDAALDSSRSKIGIGVIVRNSAGQVKAAL 1206

BLAST of Lcy07g005030 vs. NCBI nr
Match: XP_023878301.1 (uncharacterized protein LOC111990748 [Quercus suber])

HSP 1 Score: 280.8 bits (717), Expect = 2.5e-71
Identity = 204/701 (29.10%), Postives = 300/701 (42.80%), Query Frame = 0

Query: 1    MFLLCAEGLSSLLRGAERRSLITGFRLTLSCPSVSHLFFADDNLLFFQANAAESSVIRGL 60
            +FLLCAEGLS+L+  A R  LITG  +   CP V+HLFFADD++LF +A   E  ++R +
Sbjct: 607  LFLLCAEGLSALINQAARNKLITGISINRGCPKVTHLFFADDSILFCKAAYEECHLLRSI 666

Query: 61   LLLYERASGQTINYEKSVVAFSPNTRDDCKHYVSHVLSVVCKPCHNQYLGLPSFMPRSRS 120
            L  YE ASGQ IN +KS + FSPNT  + +  + ++L  +    H +YLGLPS + RS+S
Sbjct: 667  LGQYEEASGQKINTDKSSIFFSPNTAQETRDEIFNILGPMQNSRHTKYLGLPSLIGRSKS 726

Query: 121  GTLKFIRD---------------------------------------------------- 180
                 +++                                                    
Sbjct: 727  QVFAMLKEKVGHKLAGWKGKLLSMGGKEILIKAVAQAIPTYTMSCFLLPQGLCDDMERMM 786

Query: 181  -------------------------QCRGGLGFRDMELFNQSLLAKQCWWVFHDPSSLLS 240
                                     +  GGLGFR+++ FN ++LAKQ W + ++P+SL+ 
Sbjct: 787  KNFWWGQRNQETKMGWISWKRMCNSKASGGLGFRNLKAFNLAMLAKQAWRILYNPNSLVG 846

Query: 241  LVLKGRYFP-----------------------------------GNGSSVPIYFSNWLPN 300
             VLK RYFP                                   GNG  + I+   WLP 
Sbjct: 847  RVLKARYFPTGDLLNAKLGSSPSYSWRSIHSSLEVIRRGTRWRVGNGKQIHIWEDRWLPT 906

Query: 301  E-----FSLQIHSFPSLRLTSCVRDLFTGSGHWDEVKIRSHFTTADCEAILRIPLGNLLS 360
                   S QIH+F    L S + D    +  W    +RS F   + E ILRIPL   L 
Sbjct: 907  PSTYKVISPQIHNF-EFPLVSSLID--PDTKWWKVEALRSIFLPFEVETILRIPLSYNLP 966

Query: 361  EDQLIWHFEKNGLFPVKSGYRLAHS-LSVQDQPSSSDSALWQGWWSSLWKMNVPSKIKFF 420
            ED+LIW   K G F VKS Y +AHS +   ++   S+   ++  W  LW +N+P KIK F
Sbjct: 967  EDKLIWIGNKKGEFSVKSAYHIAHSIIDPNERGECSNGDPYRLLWKKLWLLNLPGKIKIF 1026

Query: 421  FWRLCHNRLPTKDNLLKIGMDVSNMCVICQSFSEDYFHVFWDCPMIKSMWC-CSKFVSLY 480
             WR C + LPT DN+ K G+  S+ C IC   +ED  H    C     +WC  S +    
Sbjct: 1027 AWRACVDGLPTYDNISKRGICCSSTCPICGLVTEDVNHALLYCEAASLVWCFWSDYPETP 1086

Query: 481  QSLSNLNFDSLL------------------WVL----------------KERW--ARDYL 540
            QS +    D  L                  W +                 + W  A + L
Sbjct: 1087 QSHNGSFLDMALHLCHSKASQVLELFFVLSWAIWYNRNKIVHNDSPLSPSQVWLMANNTL 1146

Query: 541  ESFQKASVRNLLSPTALQQSSGRAVVSWVPPTENELKLNIDASVRPETGAASGGFVLHND 547
            E F+KA+  +++ P   Q       + W  P     K+N+D +   +   +S G ++ + 
Sbjct: 1147 EDFKKAASLDIIPPRHSQ-------IRWEAPPLGIFKVNVDGATSDQGRNSSIGVIIRDS 1206

BLAST of Lcy07g005030 vs. NCBI nr
Match: CAB4277969.1 (unnamed protein product [Prunus armeniaca])

HSP 1 Score: 278.5 bits (711), Expect = 1.3e-70
Identity = 204/695 (29.35%), Postives = 301/695 (43.31%), Query Frame = 0

Query: 1    MFLLCAEGLSSLLRGAERRSLITGFRLTLSCPSVSHLFFADDNLLFFQANAAESSVIRGL 60
            +FL+ AE  S+LL+ AER S + G  +  S PS++HLFFADD+LLF  A   E+  ++ +
Sbjct: 1030 LFLIVAEAFSALLQQAERDSRLHGVSIAPSAPSINHLFFADDSLLFCNAGTTEALELKRI 1089

Query: 61   LLLYERASGQTINYEKSVVAFSPNTRDDCKHYVSHVLSVVCKPCHNQYLGLPSFMPRSRS 120
              +YE ASGQ +N  KS + FSP+T    +  +  +L+V   PCH +YLGLP+ + + + 
Sbjct: 1090 FGVYESASGQKVNLGKSALCFSPSTPRVLQDDIRQLLNVTLVPCHERYLGLPTIVGKDKK 1149

Query: 121  GTLKFIRDQ-------------------------------------------CR------ 180
               + ++D+                                           CR      
Sbjct: 1150 KMFRMVKDRVWNKVNGWQGKLLSKAGKEVLIKSVCQAIPSYSMSVFRLPVGLCREIESII 1209

Query: 181  ---------------------------GGLGFRDMELFNQSLLAKQCWWVFHDPSSLLSL 240
                                       GG+GFR++  FNQ+LL KQ W +   P SL++ 
Sbjct: 1210 AKFWWAKNDGRGIHWKTWRFMCQHKSDGGIGFRELISFNQALLCKQGWRLLEFPHSLIAR 1269

Query: 241  VLKGRYFP-----------------------------------GNGSSVPIYFSNWLPNE 300
            + K RYFP                                   G+G  V IY   W+P +
Sbjct: 1270 MFKARYFPHSDFLATSSGSLPSFTWQSILWGRDLLRLGLRWRIGDGRLVNIYGDPWVPYD 1329

Query: 301  FSLQIHSFPSLRLTSCVRDLFTGSGHWDEVKIRSHFTTADCEAILRIPL-GNLLSEDQLI 360
                I S P+L +TS V DLFT SG WD  K+ + F+  + EAIL IPL G+ L  D+ I
Sbjct: 1330 RFFTIQSIPTLPVTSRVCDLFTASGGWDVGKVFASFSFPEAEAILSIPLMGDTL--DRRI 1389

Query: 361  WHFEKNGLFPVKSGYRLA------HSLSVQDQPSSSDSALWQGWWSSLWKMNVPSKIKFF 420
            W+F KNG + VKSGY  A        LS       S S+L    W  LWK+ VP KI   
Sbjct: 1390 WNFTKNGRYSVKSGYWAALEYKRLEELSAGGVAGPSSSSLKS--WKHLWKLKVPQKIMHL 1449

Query: 421  FWRLCHNRLPTKDNLLKIGMDVSNMCVICQSFSEDYFHVFWDCPMIKSMWCCSKFVS--L 480
             WR+  + LP+K+ L +  +    +C  C +  E   H    C +   +W    F S  L
Sbjct: 1450 LWRVAQDILPSKEVLFRRRITQGEVCCRCFAARETTLHALVGCDVCLQVWKALDFPSDFL 1509

Query: 481  YQSLSNLN-----------------FDSLLWVL-KER-----------------WARDYL 540
              +L+++                  F   +WVL  ER                  A+DY 
Sbjct: 1510 LPTLADVGTWMDAVWSIIPPDKQSLFAFTVWVLWNERNGVLFGSQPTPSGILVQRAKDYD 1569

BLAST of Lcy07g005030 vs. TAIR 10
Match: AT4G29090.1 (Ribonuclease H-like superfamily protein )

HSP 1 Score: 99.8 bits (247), Expect = 7.5e-21
Identity = 73/293 (24.91%), Postives = 111/293 (37.88%), Query Frame = 0

Query: 129 QCRGGLGFRDMELFNQSLLAKQCWWVFHDPSSLLSLVLKGRYF----------------- 188
           +  GG+GF+D+E FN +LL KQ W +   P SL++ V K RYF                 
Sbjct: 51  KAEGGIGFKDIEAFNLALLGKQMWRMLSRPESLMAKVFKSRYFHKSDPLNAPLGSRPSFV 110

Query: 189 ------------------PGNGSSVPIYFSNWL---PNEFSLQIHSFPSLRLTSC----- 248
                              GNG  + I+   WL   P   +L++   P     S      
Sbjct: 111 WKSIHASQEILRQGARAVVGNGEDIIIWRHKWLDSKPASAALRMQRVPPQEYASVSSILK 170

Query: 249 VRDLFTGSG-HWDEVKIRSHFTTADCEAILRIPLGNLLSEDQLIWHFEKNGLFPVKSGY- 308
           V DL   SG  W +  I   F   + + I  +  G     D   W +  +G + VKSGY 
Sbjct: 171 VSDLIDESGREWRKDVIEMLFPEVERKLIGELRPGGRRILDSYTWDYTSSGDYTVKSGYW 230

Query: 309 RLAHSLSVQDQPSSSDSALWQGWWSSLWKMNVPSKIKFFFWRLCHNRLPTKDNLLKIGMD 368
            L   ++ +  P           +  +WK     KI+ F W+   N LP    L    + 
Sbjct: 231 VLTQIINKRSSPQEVSEPSLNPIYQKIWKSQTSPKIQHFLWKCLSNSLPVAGALAYRHLS 290

Query: 369 VSNMCVICQSFSEDYFHVFWDCPMIKSMWCCSKF-VSLYQSLSNLNFDSLLWV 376
             + C+ C S  E   H+ + C   +  W  S   + L    ++  + +L WV
Sbjct: 291 KESACIRCPSCKETVNHLLFKCTFARLTWAISSIPIPLGGEWADSIYVNLYWV 343

BLAST of Lcy07g005030 vs. TAIR 10
Match: AT2G02650.1 (Ribonuclease H-like superfamily protein )

HSP 1 Score: 74.3 bits (181), Expect = 3.4e-13
Identity = 72/338 (21.30%), Postives = 125/338 (36.98%), Query Frame = 0

Query: 259 VKSGYRLA------HSLSVQDQPSSSDSALWQGWWSSLWKMNVPSKIKFFFWRLCHNRLP 318
           ++SGY +A         ++Q  P S++         ++WK++V  KIK F WR     L 
Sbjct: 6   LRSGYWVATHEDLLEEEAIQPPPGSTEVK------QAIWKLHVAPKIKHFLWRCVTGALA 65

Query: 319 TKDNLLKIGMDVSNMCVICQSFSEDYFHVFWDCPMIKSMWCCSKFV-------------- 378
           T   L    +D   +C  C    E   H+ ++CP  +S+W  +  +              
Sbjct: 66  TNTRLRSRNIDADPICQRCCIEEETIHHIMFNCPYTQSVWRSANIIIGNQWGPPSSFEDN 125

Query: 379 -------SLYQSLSNLNFDSLLWVLKERW-------------------------ARDYLE 438
                  S  Q+ ++L+     W++   W                         A ++L 
Sbjct: 126 LNRLIQLSKTQTTNSLDRFLPFWIMWRLWKSRNVFLFQQKCQSPDYEARKGIQDATEWLN 185

Query: 439 SFQKASVRNLLSPTALQQSSGRAVVSWVPPTENELKLNIDASVRPETGAASGGFVLHNDR 498
           + +     N+   T   Q+S R    W PP E  +K N D+     +     G+ +    
Sbjct: 186 ANETTENTNVHVATNPIQTSRRDSSQWNPPPEGWVKCNFDSGYTQGSPYTRSGWTIRECN 245

Query: 499 GEVLLTACEILPLCWNVDLVEGWAMLRGIQIARQMGFFSFHVETDSLRLSRVLIDDVDDI 545
           G ++L     L         E    L  +Q+    G      E+DS  L   LI++ +D 
Sbjct: 246 GHIVLCGNAKLQSSTCSLHAEALGFLHALQVIWAHGLRYVWFESDSKSLV-TLINNGEDH 305

BLAST of Lcy07g005030 vs. TAIR 10
Match: AT1G43730.1 (RNA-directed DNA polymerase (reverse transcriptase)-related family protein )

HSP 1 Score: 57.4 bits (137), Expect = 4.3e-08
Identity = 22/67 (32.84%), Postives = 37/67 (55.22%), Query Frame = 0

Query: 286 WWSSLWKMNVPSKIKFFFWRLCHNRLPTKDNLLKIGMDVSNMCVICQSFSEDYFHVFWDC 345
           W+ ++W  N   K  F  W +  NRL T+D L   G+ +  +C++C S  E   H+F++C
Sbjct: 151 WYKAVWFKNHVPKHAFICWVVAWNRLHTRDRLRSWGLSIPAVCLLCNSHDESRAHLFFEC 210

Query: 346 PMIKSMW 353
           P   ++W
Sbjct: 211 PFCGAVW 217

BLAST of Lcy07g005030 vs. TAIR 10
Match: ATMG00310.1 (RNA-directed DNA polymerase (reverse transcriptase)-related family protein )

HSP 1 Score: 57.4 bits (137), Expect = 4.3e-08
Identity = 25/41 (60.98%), Postives = 32/41 (78.05%), Query Frame = 0

Query: 132 GGLGFRDMELFNQSLLAKQCWWVFHDPSSLLSLVLKGRYFP 173
           GGLGFRD+  FNQ+LLAKQ + + H P +LLS +L+ RYFP
Sbjct: 55  GGLGFRDLGWFNQALLAKQSFRIIHQPHTLLSRLLRSRYFP 95

BLAST of Lcy07g005030 vs. TAIR 10
Match: AT5G16486.1 (RNA-directed DNA polymerase (reverse transcriptase)-related family protein )

HSP 1 Score: 53.5 bits (127), Expect = 6.2e-07
Identity = 22/60 (36.67%), Postives = 34/60 (56.67%), Query Frame = 0

Query: 286 WWSSLWKMNVPSKIKFFFWRLCHNRLPTKDNLLKIGMDVSNMCVICQSFSEDYFHVFWDC 345
           W  ++W      K  F  W    +RLPT+D LL  G+ V ++C++C +F E   H+F+DC
Sbjct: 144 WHKAIWFKGRIPKHAFISWVNIRHRLPTRDKLLSWGLHVPSLCLLCNAFDETRQHLFFDC 203

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
P0C2F66.8e-2722.20Putative ribonuclease H protein At1g65750 OS=Arabidopsis thaliana OX=3702 GN=At1... [more]
P932956.0e-0760.98Uncharacterized mitochondrial protein AtMg00310 OS=Arabidopsis thaliana OX=3702 ... [more]
Match NameE-valueIdentityDescription
A0A5E4FZN92.2e-7328.55PREDICTED: retrotransposon OS=Prunus dulcis OX=3755 GN=ALMOND_2B007697 PE=4 SV=1[more]
A0A6J1DAR46.5e-7330.53uncharacterized protein LOC111018954 OS=Momordica charantia OX=3673 GN=LOC111018... [more]
A0A803QGT27.2e-7228.08Uncharacterized protein OS=Cannabis sativa OX=3483 PE=4 SV=1[more]
A0A6J5UN516.1e-7129.35Reverse transcriptase domain-containing protein OS=Prunus armeniaca OX=36596 GN=... [more]
A0A803Q2K83.0e-7028.72Uncharacterized protein OS=Cannabis sativa OX=3483 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
VVA32947.14.6e-7328.55PREDICTED: retrotransposon [Prunus dulcis][more]
XP_022150918.11.4e-7230.53uncharacterized protein LOC111018954 [Momordica charantia][more]
XP_030479133.11.5e-7128.08uncharacterized protein LOC115696372 [Cannabis sativa][more]
XP_023878301.12.5e-7129.10uncharacterized protein LOC111990748 [Quercus suber][more]
CAB4277969.11.3e-7029.35unnamed protein product [Prunus armeniaca][more]
Match NameE-valueIdentityDescription
AT4G29090.17.5e-2124.91Ribonuclease H-like superfamily protein [more]
AT2G02650.13.4e-1321.30Ribonuclease H-like superfamily protein [more]
AT1G43730.14.3e-0832.84RNA-directed DNA polymerase (reverse transcriptase)-related family protein [more]
ATMG00310.14.3e-0860.98RNA-directed DNA polymerase (reverse transcriptase)-related family protein [more]
AT5G16486.16.2e-0736.67RNA-directed DNA polymerase (reverse transcriptase)-related family protein [more]
InterPro
Analysis Name: InterPro Annotations of Sponge gourd (P93075) v1
Date Performed: 2021-12-06
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR036397Ribonuclease H superfamilyGENE3D3.30.420.10coord: 418..547
e-value: 1.2E-6
score: 30.6
IPR026960Reverse transcriptase zinc-binding domainPFAMPF13966zf-RVTcoord: 259..352
e-value: 3.4E-20
score: 72.5
IPR002156Ribonuclease H domainPFAMPF13456RVT_3coord: 424..544
e-value: 1.4E-18
score: 66.9
NoneNo IPR availablePANTHERPTHR46736:SF6SUBFAMILY NOT NAMEDcoord: 37..493
NoneNo IPR availablePANTHERPTHR46736FAMILY NOT NAMEDcoord: 37..493
IPR044730Ribonuclease H-like domain, plant typeCDDcd06222RNase_H_likecoord: 423..540
e-value: 1.56521E-15
score: 71.1912
IPR012337Ribonuclease H-like superfamilySUPERFAMILY53098Ribonuclease H-likecoord: 421..545

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Lcy07g005030.1Lcy07g005030.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0003676 nucleic acid binding
molecular_function GO:0004523 RNA-DNA hybrid ribonuclease activity