Lcy01g006040 (gene) Sponge gourd (P93075) v1

Overview
NameLcy01g006040
Typegene
OrganismLuffa cylindrica cv. P93075 (Sponge gourd (P93075) v1)
DescriptionReverse transcriptase domain-containing protein
LocationChr01: 8630165 .. 8635406 (-)
RNA-Seq ExpressionLcy01g006040
SyntenyLcy01g006040
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAAAGAGATTCAGCAGAAGAAACAGGCCATTAAAGATGCATATTCGGTGATACCTGTGGATTTTTCGATTATTCACTCTTTAGAGGCAGAGTTGGCAAGACTTTTGGAGGATGAAGAAATATATTGGCATCAGCGTTCTAGGGAAAACTGGCTCAAATGGGGTGATAGAAATACAAGATGGTTTCATCTTCGGGCCTCAGAGCGGAAAAAGCGCAATGATATTCATGAGATTTGTAGGGATGATGGTACCTGGGCTACTTCTGAGTCTGAGGTGGAGTCGATCTTTCTGGGTTATTTTCAGAATATTTGTACGACATCGAATCCGTCTGTGGTTCAGCAGTCTGCTATCTTGAATCATATTCCTCCTATTATCTCCCCTGAGATGAATGCAAAGTTGACTGCGCCGTTTTGTAAGGCGGAAATTGAGCGAGTTGTATCTCAAATGTTCCCAACTAAGGCCCCGGGTCCGGATGGTTTTCCTGCTATTTTCTACCAGTCTTATTGGGATATTGTTGGTGCTCAAACGGTTGCGAGCTGTCTTGAGGTACTGAATAACAAAAGATCTGTTCTAACTGGAATAAGACAAACATCGCTCTTATTCCGAAAACTAACGCCCCATCTGTGGTGGGTGACTTTCGCCCGATCAGCCTATGCAATGTCTCGTATAAGATAGTGGCTAAGGTTTTGGCTAACAGGTTGAAACATGTTCTGTGCTCTGTGGTTTCGGATGCCCAATCGGCTTTTGTGCCGGGTCGTGCGATTTCTGATAATGTTATAATCGGTCATGAATGTCTGCATTATATTCAGCATCGGAAAAGGGGGCGAGTTGGGTTTGCGGCGTTGAAGTTAGACATGATCAAGGCCTACGATCGTGTTGAATGGTCCTTTTTGGAGCGACTAATGCGTTTGTATGGGGTTTGTTGATGCGTGGATTAGTCTGATCATGGATTGTATTACAACGGTTGAGTTTGCAGTTCTTATTAATCGAGTGGCCTATGGGAGTATTTTTCCGAGCCGCAGTCTCCGACAGGGGGATCCTCTCTCGCCTTATTTGTTCGTCTTATGTGCCGAGGGCTTGTCTCATGCTCTATCAGCTGCTCATGCTTCACGTCTCATCTCGGGGGTCCAAATTGGTACGTACTGCCCTTCTGTTTCTCATTTATTTTTCGCGGATGATAGCTTGGTCTTTTTTAAGGCAAATATAGTGGAGGGTTGGCATATTAAACGGATTTTGAATGAGTATGAGTCAGCATCGGGCCAGTGTGTGAACTTTTCAAAATCGGCTTTACTCATTTCGCCAAATATTTCTGATGATGGTAGAGCTGCTTTGGGCTCTGTGTTGGGTGTTCCTTTTGTGGATGATTTAGGAACTTATCTGGGGTTGCCATCCCGTTTTCCACGATCCAAAAAGTTGTGTTTTCGGAAAATTCTTGAGAAAGTTAAAAAGGTGGTGCAAGGATGGAAGCGTTTTTTTTTTTTTCTACAGGTGGGAAGGAGACTCTTATTAAGAGTGTGGCGCATGCAATCCCAACATATGCAATGAGCTGTTTTCGACTTCCCAAGTCTATCTATCAGGAAATAACTAGGGAGATATCTCGGTTCTGGTGGGGTTCTTCCGAATCTCGCATGAAGATGCATTGGAAATCGTGGGAGAAGATGTGTTTACCAAAAGAAGCTAGGGGTCTAAGTTTTCGGGATGTGGAGTGCTTTAACCAAGCCTTGTTGGCAAAGCAAGTTTGGCGGGTGCTGAAATTTCCTTCTCTTTTGGTCTCTCGTGTACTTAAAGGGAAGTATTTTCAGGATGGTTTAGTGCTTTCGGTGGCTCGCTCTTTGTCAGGATCTTATTTTTGGCGTGGATTTTTATGGGGGAGGGAGTTGTTGTATGGGGGTCTTCGGAAGCGTATTGGTGATGGCACCTCCACATATTTTTATTTGGATCCGTGGCTTCCTCGTGCTGTGACTTTTAAGCCTCTTCTTGGGCCCTCTCAGTCTCTTGGTAATTCTTTGGCCCGGGTGTGTGAGTTTATCACTGATTCTAGAGAGTGAGATGTTGAGAAGTTGTCAGAACTGCTTACTTTTGATGATCTTCAGCTAGTGCAAAGTATTCTTATTGGTCGGTTGGGGGCTGAGGATATTTGGTTATGGCATTATGATAAGCGAGGGGTGTATACGGTAAAGAGTGGGTATAAGCTTCGAATGCTCCAGGGTCAGGTGTCACAATCCTCTGATTCTTCCTGTTGGAATTTGTGGTGGTCGTATTTATGGTTGCAACAGATTCCTGCTAAGGTTAAGATATTTATGTGGCGTGTGGTTTTGTCCATCCTCCCATCTATGACTAATCTTATTCAGCGTGGTATTGCTGCTGATCCTATGTGTTCCTTGTGTAAGAAATTTCCTGAGATTACAGACCATGCCTTAGTGACTTGTGCGCGAGCGAAGCGATTGTGGAAGACTTTATTGTCGCAGGTTGATTGGTCGCTGAACTTCAACAATAGTTTTTTGGATCGCTGTTTATTTTTACAAGGACTTTTGTCAGTTTCGGATTTTGGGTTGTGGTGTGTGTTGGGTTTTATGCCCTAAAACTCGTAGATAGTGAATGTAACCGTTGACCGGATATTAATGAAATAATATTGATATGTTTTTTATTAATTGTTATTAGTTATTTTATATTTGTCTTTGCCTAATAATAACCCTAATCCAATAAACTAGACATCCAAGGTTGTAATATGAGTCTTGAACTATATGTAGCGACATATGGGATCATTGTTCAAGAAACAACCTAAAGGGTCTATAGTATAGGGATAAGACTGGGTGCCTTATCCTGGTGACACTATGGATACGGCCCACTTTGTATTTGATACAAACGCTGCGATCCAACGCGTTCGTGTAGGAGACATGCGAGTGGGGGTATCATATGCAATGAGTTTGCATAAGACCGGACCGCGAAATAGTAACCACTGGTTATAACACCGTTAACTAGTTGGTTTTCTATTTCACTAGGACGACCTAGACAACTTAGTCTTAATCCTGAGTTGATTATGGACTCCTGCTCATGAGAGATTTTCCTTTGATTTGTATGGGTGAGAGTGGCCAATACGCCGACTCAATAAGCCTACCACTTTGGGGACTAGACCGAATGGGGAGCTGGGAACATAATCGTACAAGATGGAATTCACTCCTTCCCGACTTTAGGGAAGCAGATGAGTGTTCCCTTAAGTGGTTACTCCGAGTCTTGAACAAAGGGCCCTACCCTCTCAATGGCACGAGAGGGTTTTCTGTTTGATGGTTGGACCACAAACAGGTTGTTCATTAGAGGAGAATTGGTACTTAAGGAAATAGAGGTAACCCAGGGGTAAAACGGTAATTTGACCCAGCTGGTGTTACGGACACTCGTGAAGGACTAACTAGTCGATATTGGTCTATATCCGTGGACACAGAAAATATGTCTGCAGTGAGAAGAGTGCAACTGTGAGTCTATAGTGGAATGAACACACAGTTAACGAATATTGATTAATGTGGTTAATGAGTTTGACCAATTAATCTCACATCGTTGGAGCTTCTGATCTGCAGGTCCATTAGGTCACACTGGTAGCTCATAAAGTAATTTGAGGCAACTAAGTAATAAATGTAAATTGAAAATGTTCAATTTACTAAGGGAAACAAATATAATATATATTGATATATTATTATGAGAGAAATTAATATTTGAAGAAATTCAAATATAAATTTATGGAAGTTTGAAAGAGTTCAAATAATGAATTAATTAATTTAAGGCCTAAAATGTGTTTGGGCAAATTAATTAATTCTAGGCCGGAAATTAAATATGTGATATTTAATTGGTGGGAATTAATCAATTATGGAAATAATTAATCACTTAAGGAAGTGATTAATCATTTATGGAAATGATCAAATCACTTATGGAAGTAATTAGTCATTTGTGGAAAAGATTAATCAATTGTGGAATTGATTGATCATGATTGATTAATTAAGGAATAAGATTGAATCAATTTTGGAATAGATTGATTTAATTGAATATTTTATTCATGGCCTCTCTTCTATAAATAGAACACTAGACCTAATTGGTCTAGGACACTTGACACTTTGCTCTTGCTCTTAGACTCTCAAGACAAGACCTAGAATTCTCTCTAGCCTCCCTCTTAGAGAAAGACTCCCACAAGTCTTTTGCCTCCTAAACCTAGAGTCATACCGGTGTAACCTCTGTGGTTATTGTGTCATTCAAGAAGAATTTTCCAGCGATCACAAGACAAAGGAGGCTGCTGCGTTTTGTTCGTTGGAGCATCGTTGGCGACGAACGGTCAAGTCTACAACGGAGGTTTGTAACGGTTCTTCCCCTTTTCTTTAGAAAAGCATGAAAATAAATTTAGATTTATTTTAAGCATGTTTAAGCTGATTTTCCAGCAACTTTAAAGAGTGTTTAAAGTTTGTTTGAGTTTTCGTAACGGTATTTTAATGTTAATTAATATACGGCGTAAGGTCCAATCGATGCGCTTCCGCTGCGTGGGGACTTTTATTCCCTTCAGTGTGTTGGTTGTTGGGCATTGTGGAATGATAAGAATGCGTATTTGAACGGTTCGGTCCTCCCGGATATCCCTGTTAAAGCTGAGTGGGTTGGAAAGTATTTGAGTTCCTTTTCATCTGTGCATGAGAGTCGGAGCTCGGGTATTTGTGGGTCTATTGTTCAAGGGCCAATTTCGGTTTCGCAGTGGGTCTGTCCTCCCCAGGGCTGGTTCAAAATTAATACTGATGCTTCCTGTTCATCAAAATATAGTTTGACTGGCGTCGGCGTGGTTGTTCGAATGGCGTCGGGTCGCCTGTATGCTGCCCAAATGGAGGTCGTTCCACTTGTTCTTGCTCCTTTGATCGCTGAAGCTCGTGCGGTCCTAGTGGGTTTGAAGTTGGCACTGGTATTGGGTCTTGTGTGTGTTGAGGTGGAATCAGATTGCCTTTCCCTCATTTCCATGCTGACTGGTTCCTTCATCTCCCTCCATGAAGAGGGTACTTATGTTGATGAAATTCTTGAGCTGGCCTCTCACTTTACGAGTGTTGTTTTTCGTCACGTCCGCAGGGGAGGAAATAGACCAGCGCACATTTTGGCCTCTCACGCTGGTGTTGAAGGTTCAATGTTGTGGTGTGCCTCTTTCCCTGCGTGGTTGACAGATGTTGTTGGTCAGGATTCTTTTCCTGCTAGTTGTAACCCTTGTGGTGATTTGATTCTCTAA

mRNA sequence

ATGAAAGAGATTCAGCAGAAGAAACAGGCCATTAAAGATGCATATTCGGTGATACCTGTGGATTTTTCGATTATTCACTCTTTAGAGGCAGAGTTGGCAAGACTTTTGGAGGATGAAGAAATATATTGGCATCAGCGTTCTAGGGAAAACTGGCTCAAATGGGGTGATAGAAATACAAGATGGTTTCATCTTCGGGCCTCAGAGCGGAAAAAGCGCAATGATATTCATGAGATTTGTAGGGATGATGGTACCTGGGCTACTTCTGAGTCTGAGGTGGAGTCGATCTTTCTGGGTTATTTTCAGAATATTTGTACGACATCGAATCCGTCTGTGGTTCAGCAGTCTGCTATCTTGAATCATATTCCTCCTATTATCTCCCCTGAGATGAATGCAAAGTTGACTGCGCCGTTTTGTAAGGCGGAAATTGAGCGAGTTGTATCTCAAATGTTCCCAACTAAGGCCCCGGGTCCGGATGGTTTTCCTGCTATTTTCTACCAGTCTTATTGGGATATTGTTGGTGCTCAAACGGTTGCGAGCTGTCTTGAGCTAGTGCAAAGTATTCTTATTGGTCGGTTGGGGGCTGAGGATATTTGGTTATGGCATTATGATAAGCGAGGGGTGTATACGGTAAAGAGTGGGTATAAGCTTCGAATGCTCCAGGGTCAGGTGTCACAATCCTCTGATTCTTCCTGTTGGAATTTGTGGTGGTCGTATTTATGGTTGCAACAGATTCCTGCTAAGGTTAAGATATTTATGTGGCGTGTGGTTTTGTCCATCCTCCCATCTATGACTAATCTTATTCAGCGTGGTATTGCTGCTGATCCTATGTGTTCCTTGTGTAAGAAATTTCCTGAGATTACAGACCATGCCTTAGTGACTTGTGCGCGAGCGAAGCGATTGTGGAAGACTTTATTGTCGCAGAATGCGTATTTGAACGGTTCGGTCCTCCCGGATATCCCTGTTAAAGCTGAGTGGGTTGGAAAGTATTTGAGTTCCTTTTCATCTGTGCATGAGAGTCGGAGCTCGGGTATTTGTGGGTCTATTGTTCAAGGGCCAATTTCGGTTTCGCAGTGGGTCTGTCCTCCCCAGGGCTGGTTCAAAATTAATACTGATGCTTCCTGTTCATCAAAATATAGTTTGACTGGCGTCGGCGTGGTTGTTCGAATGGCGTCGGGTCGCCTGTATGCTGCCCAAATGGAGGTCGTTCCACTTGTTCTTGCTCCTTTGATCGCTGAAGCTCGTGCGGTCCTAGTGGGTTTGAAGTTGGCACTGGTATTGGGTCTTGTGTGTGTTGAGGTGGAATCAGATTGCCTTTCCCTCATTTCCATGCTGACTGGTTCCTTCATCTCCCTCCATGAAGAGGGTACTTATGTTGATGAAATTCTTGAGCTGGCCTCTCACTTTACGAGTGTTGTTTTTCGTCACGTCCGCAGGGGAGGAAATAGACCAGCGCACATTTTGGCCTCTCACGCTGGTGTTGAAGGTTCAATGTTGTGGTGTGCCTCTTTCCCTGCGTGGTTGACAGATGTTGTTGGTCAGGATTCTTTTCCTGCTAGTTGTAACCCTTGTGGTGATTTGATTCTCTAA

Coding sequence (CDS)

ATGAAAGAGATTCAGCAGAAGAAACAGGCCATTAAAGATGCATATTCGGTGATACCTGTGGATTTTTCGATTATTCACTCTTTAGAGGCAGAGTTGGCAAGACTTTTGGAGGATGAAGAAATATATTGGCATCAGCGTTCTAGGGAAAACTGGCTCAAATGGGGTGATAGAAATACAAGATGGTTTCATCTTCGGGCCTCAGAGCGGAAAAAGCGCAATGATATTCATGAGATTTGTAGGGATGATGGTACCTGGGCTACTTCTGAGTCTGAGGTGGAGTCGATCTTTCTGGGTTATTTTCAGAATATTTGTACGACATCGAATCCGTCTGTGGTTCAGCAGTCTGCTATCTTGAATCATATTCCTCCTATTATCTCCCCTGAGATGAATGCAAAGTTGACTGCGCCGTTTTGTAAGGCGGAAATTGAGCGAGTTGTATCTCAAATGTTCCCAACTAAGGCCCCGGGTCCGGATGGTTTTCCTGCTATTTTCTACCAGTCTTATTGGGATATTGTTGGTGCTCAAACGGTTGCGAGCTGTCTTGAGCTAGTGCAAAGTATTCTTATTGGTCGGTTGGGGGCTGAGGATATTTGGTTATGGCATTATGATAAGCGAGGGGTGTATACGGTAAAGAGTGGGTATAAGCTTCGAATGCTCCAGGGTCAGGTGTCACAATCCTCTGATTCTTCCTGTTGGAATTTGTGGTGGTCGTATTTATGGTTGCAACAGATTCCTGCTAAGGTTAAGATATTTATGTGGCGTGTGGTTTTGTCCATCCTCCCATCTATGACTAATCTTATTCAGCGTGGTATTGCTGCTGATCCTATGTGTTCCTTGTGTAAGAAATTTCCTGAGATTACAGACCATGCCTTAGTGACTTGTGCGCGAGCGAAGCGATTGTGGAAGACTTTATTGTCGCAGAATGCGTATTTGAACGGTTCGGTCCTCCCGGATATCCCTGTTAAAGCTGAGTGGGTTGGAAAGTATTTGAGTTCCTTTTCATCTGTGCATGAGAGTCGGAGCTCGGGTATTTGTGGGTCTATTGTTCAAGGGCCAATTTCGGTTTCGCAGTGGGTCTGTCCTCCCCAGGGCTGGTTCAAAATTAATACTGATGCTTCCTGTTCATCAAAATATAGTTTGACTGGCGTCGGCGTGGTTGTTCGAATGGCGTCGGGTCGCCTGTATGCTGCCCAAATGGAGGTCGTTCCACTTGTTCTTGCTCCTTTGATCGCTGAAGCTCGTGCGGTCCTAGTGGGTTTGAAGTTGGCACTGGTATTGGGTCTTGTGTGTGTTGAGGTGGAATCAGATTGCCTTTCCCTCATTTCCATGCTGACTGGTTCCTTCATCTCCCTCCATGAAGAGGGTACTTATGTTGATGAAATTCTTGAGCTGGCCTCTCACTTTACGAGTGTTGTTTTTCGTCACGTCCGCAGGGGAGGAAATAGACCAGCGCACATTTTGGCCTCTCACGCTGGTGTTGAAGGTTCAATGTTGTGGTGTGCCTCTTTCCCTGCGTGGTTGACAGATGTTGTTGGTCAGGATTCTTTTCCTGCTAGTTGTAACCCTTGTGGTGATTTGATTCTCTAA

Protein sequence

MKEIQQKKQAIKDAYSVIPVDFSIIHSLEAELARLLEDEEIYWHQRSRENWLKWGDRNTRWFHLRASERKKRNDIHEICRDDGTWATSESEVESIFLGYFQNICTTSNPSVVQQSAILNHIPPIISPEMNAKLTAPFCKAEIERVVSQMFPTKAPGPDGFPAIFYQSYWDIVGAQTVASCLELVQSILIGRLGAEDIWLWHYDKRGVYTVKSGYKLRMLQGQVSQSSDSSCWNLWWSYLWLQQIPAKVKIFMWRVVLSILPSMTNLIQRGIAADPMCSLCKKFPEITDHALVTCARAKRLWKTLLSQNAYLNGSVLPDIPVKAEWVGKYLSSFSSVHESRSSGICGSIVQGPISVSQWVCPPQGWFKINTDASCSSKYSLTGVGVVVRMASGRLYAAQMEVVPLVLAPLIAEARAVLVGLKLALVLGLVCVEVESDCLSLISMLTGSFISLHEEGTYVDEILELASHFTSVVFRHVRRGGNRPAHILASHAGVEGSMLWCASFPAWLTDVVGQDSFPASCNPCGDLIL
Homology
BLAST of Lcy01g006040 vs. ExPASy Swiss-Prot
Match: P0C2F6 (Putative ribonuclease H protein At1g65750 OS=Arabidopsis thaliana OX=3702 GN=At1g65750 PE=3 SV=1)

HSP 1 Score: 62.4 bits (150), Expect = 1.8e-08
Identity = 84/353 (23.80%), Postives = 135/353 (38.24%), Query Frame = 0

Query: 181 LELVQSILIGRLGAEDIWLWHYDKRGVYTVKSGYKLRMLQGQVSQSSDSSCWNLWWSYLW 240
           LEL   +L    GA D   W + + G ++V+S Y++ +   +V + + +S +N     LW
Sbjct: 238 LELRAVVLDLVTGARDRLSWKFSQDGQFSVRSAYEM-LTVDEVPRPNMASFFNC----LW 297

Query: 241 LQQIPAKVKIFMWRVVLSILPSMTNLIQRGIAADPMCSLCKKFPEITDHALVTCARAKRL 300
             ++P +VK F+W V    + +     +R ++A  +C +CK   E   H L  C     +
Sbjct: 298 KVRVPERVKTFLWLVGNQAVMTEEERHRRHLSASNVCQVCKGGVESMLHVLRDCPAQLGI 357

Query: 301 WKTLL---SQNAYLNGSVL-------------PDIP-----VKAEWVG------------ 360
           W  ++    Q  + + S+               DIP         W G            
Sbjct: 358 WVRVVPQRRQQGFFSKSLFEWLYDNLGDRSGCEDIPWSTIFAVIIWWGWKWRCGNIFGEN 417

Query: 361 -------KYLSSFS-SVHESRSSGICGSIVQGPIS-VSQWVCPPQGWFKINTDASCSSKY 420
                  K++  ++  V+ + S  +   I Q  +  +  WV P  GW K+NTD +     
Sbjct: 418 TKCRDRVKFVKEWAVEVYRAHSGNVLVGITQPRVERMIGWVSPCVGWVKVNTDGASRGNP 477

Query: 421 SLTGVGVVVRMASGRLYAAQMEVVPLVLAPLIAEARAVLVGLKLALVLGLVCVEVESDCL 480
            L   G V+R  +G         +    AP  AE   V  GL  A    +  VE+E D  
Sbjct: 478 GLASAGGVLRDCTGAWCGGFSLNIGRCSAPQ-AELWGVYYGLYFAWEKKVPRVELEVDSE 537

Query: 481 SLISMLTGSFISLHEEGTYVDEILELASHFTSVVFRHVRRGGNRPAHILASHA 492
            ++  L       H     V            V   HV R  NR A  LA++A
Sbjct: 538 VIVGFLKTGISDSHPLSFLVRLCHGFLQKDWLVRIVHVYREANRLADGLANYA 584

BLAST of Lcy01g006040 vs. ExPASy Swiss-Prot
Match: O00370 (LINE-1 retrotransposable element ORF2 protein OS=Homo sapiens OX=9606 PE=1 SV=1)

HSP 1 Score: 54.3 bits (129), Expect = 4.9e-06
Identity = 40/164 (24.39%), Postives = 77/164 (46.95%), Query Frame = 0

Query: 25  IHSLEAELARLLEDEEIYWHQRSRENWLKWGDRNTRWFHLRASERKKRNDIHEICRDDGT 84
           I  + AEL  +   + +     SR  + +  ++  R       +++++N I  I  D G 
Sbjct: 340 ITKIRAELKEIETQKTLQKINESRSWFFERINKIDRPLARLIKKKREKNQIDTIKNDKGD 399

Query: 85  WATSESEVESIFLGYFQNICTTSNPSVVQQSAILN-HIPPIISPEMNAKLTAPFCKAEIE 144
             T  +E+++    Y++++      ++ +    L+ +  P ++ E    L  P   +EI 
Sbjct: 400 ITTDPTEIQTTIREYYKHLYANKLENLEEMDTFLDTYTLPRLNQEEVESLNRPITGSEIV 459

Query: 145 RVVSQMFPTKAPGPDGFPAIFYQSYWDIVGAQTVASCLELVQSI 188
            +++ +   K+PGPDGF A FYQ Y +    + V   L+L QSI
Sbjct: 460 AIINSLPTKKSPGPDGFTAEFYQRYKE----ELVPFLLKLFQSI 499

BLAST of Lcy01g006040 vs. ExPASy Swiss-Prot
Match: P11369 (LINE-1 retrotransposable element ORF2 protein OS=Mus musculus OX=10090 GN=Pol PE=1 SV=2)

HSP 1 Score: 50.1 bits (118), Expect = 9.2e-05
Identity = 25/95 (26.32%), Postives = 50/95 (52.63%), Query Frame = 0

Query: 75  IHEICRDDGTWATSESEVESIFLGYFQNICTTSNPSVVQQSAILNHIP-PIISPEMNAKL 134
           I++I  + G   T   E+++    +++ + +T   ++ +    L+    P ++ +    L
Sbjct: 397 INKIRNEKGDITTDPEEIQNTIRSFYKRLYSTKLENLDEMDKFLDRYQVPKLNQDQVDHL 456

Query: 135 TAPFCKAEIERVVSQMFPTKAPGPDGFPAIFYQSY 169
            +P    EIE V++ +   K+PGPDGF A FYQ++
Sbjct: 457 NSPISPKEIEAVINSLPTKKSPGPDGFSAEFYQTF 491

BLAST of Lcy01g006040 vs. ExPASy TrEMBL
Match: A0A803QEQ9 (Uncharacterized protein OS=Cannabis sativa OX=3483 PE=4 SV=1)

HSP 1 Score: 204.9 bits (520), Expect = 8.2e-49
Identity = 165/622 (26.53%), Postives = 269/622 (43.25%), Query Frame = 0

Query: 1   MKEIQQKKQAIKDAYSVIPVDFSIIHSLEAELARLLEDEEIYWHQRSRENWLKWGDRNTR 60
           +KE  Q   A++++ S  P  F+ + + E  L  LL  EE YWHQR+R +W+K GD NT+
Sbjct: 207 IKESHQHVSALQNSSSTDPQHFAALKNSELILDELLAKEEDYWHQRARISWMKSGDSNTK 266

Query: 61  WFHLRASERKKRNDIHEICRDDGTWATSESEVESIFLGYFQNICTTSNPSVVQQSAILNH 120
           +FH RA+ R   N I ++  + G   TSE  +  I   YFQ+I  +        +AIL  
Sbjct: 267 FFHQRANARSINNRIKKLRDEAGNTQTSEPTLLDIIQTYFQSIFRSQGVHDHAINAILEV 326

Query: 121 IPPIISPEMNAKLTAPFCKAEIERVVSQMFPTKAPGPDG-FPAIFYQS------------ 180
           IP  I+ +    ++AP+ +A++   ++ M   K+PG DG  P++ ++S            
Sbjct: 327 IPTPINEQSGETISAPYTEADVFTALNSMAEDKSPGVDGACPSLTWRSIVWGKELLAKGL 386

Query: 181 ----------------------------------------------YWDI--VGAQTVAS 240
                                                          WD+  + A    +
Sbjct: 387 RWRVGNGNRIICKSDPWLPGHTEFTPFNFIGRDNSLQVADLITQHRQWDLTAISANFGQA 446

Query: 241 CLELVQSILIGRLGAEDIWLWHYDKRGVYTVKSGYKLRMLQGQVSQSSDSSCWNLWWSYL 300
            ++ + SI +    ++D+ +W+    G Y VKSGY       +++    +     WW+  
Sbjct: 447 DIDRILSIPLAIYPSDDMLIWNGTNSGNYMVKSGYYFASSLAELNDPGSTFSSENWWTKF 506

Query: 301 WLQQIPAKVKIFMWRVVLSILPSMTNLIQRGIAADPMCSLCKKFPEITDHALVTCARAKR 360
           W   +P+K++IF+W+V  ++LP    L ++ IA  P C LCK   E  +HAL  C+RAK 
Sbjct: 507 WKLHLPSKLRIFVWKVYHNVLPVAAELNRKHIAESPFCPLCKMQRESINHALFLCSRAKE 566

Query: 361 LWK-TLLSQNAYLNGSVLPD--------------------------IPVKAEWVGK---- 420
           +W  + L  N  L  +  P+                              AE+ GK    
Sbjct: 567 VWSLSHLHLNFKLAATSTPEEFLLYASANSSTQEFELFLTICWSIWYERNAEYHGKLPKL 626

Query: 421 ----------YLSSFSSVHESRSSGICGSIVQG-PISVS-------QWVCPPQGWFKINT 480
                     YL  + S H S ++    ++    P++V+        W+ PP+G +K+NT
Sbjct: 627 AAAILVFATQYLIKYQSAHASTAASASPAVSNTLPVNVTPVASLVDPWIAPPEGKWKLNT 686

Query: 481 DASCSSKYSLTGVGVVVRMASGRLYAAQMEVVPLVLAPLIAEARAVLVGLKLALVLGLVC 512
           DA+C+    L G+G V+R ++G + AA  +           EA A+ + L+  L LGL  
Sbjct: 687 DAACNKSSKLIGIGAVLRDSNGYIKAALSKSFLGCFKAEEMEATALALTLQWLLSLGLTA 746

BLAST of Lcy01g006040 vs. ExPASy TrEMBL
Match: A0A803PUL2 (Uncharacterized protein OS=Cannabis sativa OX=3483 PE=4 SV=1)

HSP 1 Score: 202.2 bits (513), Expect = 5.3e-48
Identity = 166/668 (24.85%), Postives = 264/668 (39.52%), Query Frame = 0

Query: 1    MKEIQQKKQAIKDAYSVIPVDFSIIHSLEAELARLLEDEEIYWHQRSRENWLKWGDRNTR 60
            +K+ Q++ + + ++ S     F  +   E+ L  LLE EE+YW QRSR +WL  GDRNT+
Sbjct: 723  IKKCQKQVEQLNNSSSHSSSHFDDLKQAESILDELLEQEEVYWQQRSRVDWLACGDRNTK 782

Query: 61   WFHLRASERKKRNDIHEICRDDGTWATSESEVESIFLGYFQNICTTSNPSVVQQSAILNH 120
            +FH +AS R+  N I  +  + G  A+S +++ ++   ++ ++ T  N      +  L+ 
Sbjct: 783  YFHTKASARRTNNHIKFLFNNSGGKASSIADISTVVQDFYADLFTAGNIDECALAHTLDC 842

Query: 121  IPPIISPEMNAKLTAPFCKAEIERVVSQMFPTKAPGPDGFPAIFYQSYWDIVGAQTVASC 180
            IP +++   N  L APF  AE++  +  M   K+PG DG  A+FYQ +W IVG     + 
Sbjct: 843  IPTLVTNAHNDALLAPFTPAEVDSALKTMSLDKSPGIDGISAMFYQQHWSIVGDLVSHAV 902

Query: 181  LELVQS-ILIGRLGAEDIW----------------------------------------- 240
            L ++ + I +  L    +W                                         
Sbjct: 903  LNILNTGIFLTTLSWMQVWATPLSLTWQGIRWGRELLIKGLRWKIGEGRLIRSGFDPWIP 962

Query: 241  -----------------------------------------------------------L 300
                                                                       +
Sbjct: 963  RHTSFLPLTYSGPSNGVVANLITDERQWNATLLQQYFSPIDVDKILTLPLSYFPSRDKLI 1022

Query: 301  WHYDKRGVYTVKSGYKLRMLQGQVSQSSDSSCWNLWWSYLWLQQIPAKVKIFMWRVVLSI 360
            WH+   G +TV+S Y L         SS S+    WW   W  Q+  KVKIF WR +   
Sbjct: 1023 WHHHSSGNFTVQSAYHLATSLEDEDLSSTSTSTVAWWKLFWSLQVSQKVKIFPWRAIHDA 1082

Query: 361  LPSMTNLIQRGIAADPMCSLCKKFPEITDHALVTCARAKRLWKTL--------------- 420
            LP  T+L++R I  D  CS+CK+  E T HAL +C  AK +W++L               
Sbjct: 1083 LPVATSLVRRKIITDSTCSICKQACESTGHALFSCKYAKAVWRSLGLSFNWSAAASMKNG 1142

Query: 421  --------------------------LSQNAYLNGSVLPDIPVKAEWVGKYLSSFSSVHE 480
                                      + +N  ++G         A +   +L +F +  +
Sbjct: 1143 DYVTHLSSMYNKTEMEQLFCTMWAIWIERNNIIHGKKARCAQNLAAFASVFLQNFRAAQQ 1202

Query: 481  SRSSGICGSIVQGPISVSQ----------WVCPPQGWFKINTDASCSSKYSLTGVGVVVR 516
               + I  +    P    Q          W  P  G FK+NTDA+       TG+G V+R
Sbjct: 1203 RSCAAIPVTTPATPALPHQPLPRLSASAAWHPPAAGTFKLNTDAAVDISTHKTGLGAVLR 1262

BLAST of Lcy01g006040 vs. ExPASy TrEMBL
Match: A0A803QDL0 (Uncharacterized protein OS=Cannabis sativa OX=3483 PE=4 SV=1)

HSP 1 Score: 192.6 bits (488), Expect = 4.2e-45
Identity = 147/500 (29.40%), Postives = 237/500 (47.40%), Query Frame = 0

Query: 25  IHSLEAELARLLEDEEIYWHQRSRENWLKWGDRNTRWFHLRASERKKRNDIHEICRDDGT 84
           +  L+++L  LL  EEIYW QRSR +WLK GD+NT++FH  AS+RKK N I  +  D+  
Sbjct: 193 LQCLQSQLDALLYKEEIYWKQRSRTHWLKAGDKNTKFFHRFASKRKKNNTIKFLKDDNQR 252

Query: 85  WATSESEVESIFLGYFQNICTTSNPSVVQQSAILNHIPPIISPEMNAKLTAPFCKAE--- 144
             +S S++ ++ + YF ++ ++          IL+ + P +     A L APF   E   
Sbjct: 253 IVSSHSDMSNLLVSYFNDLFSSPGSDADAVHLILDCLGPPLDDLDYAFLDAPFSTKEVGD 312

Query: 145 ---IERVVSQMFPTKAPG--PDGFPAIFYQSY-------WDIVGAQTVASCLELVQSIL- 204
              I  +     P        D  P   + SY       WD+       S   LV  IL 
Sbjct: 313 GSDIRTIEDHWIPNNRFKFFSDNLPPSPFLSYFITASGDWDVAKLTNCFSA-PLVDEILS 372

Query: 205 ---IGRLGAEDIWLWHYDKRGVYTVKSGYKLRMLQGQVSQSSDSSCWNLWWSYLWLQQIP 264
              +G  G +DI +W +   G +TVK+ Y L      +  SS       +WS +W  +IP
Sbjct: 373 VPVLGEFGKDDI-IWGHHSSGEFTVKTAYHLAFSSQDLPSSSSFFASKKFWSKIWNSKIP 432

Query: 265 AKVKIFMWRVVLSILPSMTNLIQRGIAADPMCSLCKKFPEITDHALVTCARAKRLWKTLL 324
            KVK+F+WR++ + LP   +L +R +   P+C LCK  PE   HAL+ C+R+++ WK+  
Sbjct: 433 PKVKVFIWRMLSNALPVSFSLNKRCVIDSPLCPLCKIHPETVKHALLDCSRSRKAWKS-- 492

Query: 325 SQNAYLNGSVLPDIPVKAEWVGKYLSSFSSVHESRSSGICGSIVQGPISVSQWVCPPQGW 384
           S+N   + ++        +W    ++ F    +S+   +         ++ Q   PP+G 
Sbjct: 493 SRNNIFHHNLCNQPLDVYDW---SVTFFFKYLDSQQDQVLPCASNSDQTILQQF-PPEGD 552

Query: 385 FKINTDASCSSKYSLTGVGVVVRMASGRLYAAQMEVVPLVLAPLIAEARAVLVGLKLALV 444
           F+I TDA+         +GV V   SG++ A   +     ++P++AEA+AV+  L+ A  
Sbjct: 553 FQIFTDAAIDLNRQKHSIGVAVLNHSGQVVAGLAKPFSGCVSPMVAEAKAVVHALQWAYS 612

Query: 445 LGLVCVEVESDCLSLISMLTGSFISLHEEGTYVDEILELASHFTSVVFRHVRRGGNRPAH 504
           + L    +++DC S++  L    +        +  I  L S   S+   HV R  N  AH
Sbjct: 613 ICLPVDVLKTDCKSIVDKLHHGTLGCSSVDDLIVCIKNLLSIRPSLRVAHVNREFNTIAH 672

BLAST of Lcy01g006040 vs. ExPASy TrEMBL
Match: A0A6J1DX30 (uncharacterized protein LOC111024874 OS=Momordica charantia OX=3673 GN=LOC111024874 PE=4 SV=1)

HSP 1 Score: 176.0 bits (445), Expect = 4.1e-40
Identity = 121/382 (31.68%), Postives = 177/382 (46.34%), Query Frame = 0

Query: 182  ELVQSILIGRLGAEDIWLWHYDKRGVYTVKSGYKLRMLQGQVSQSSDSSCWNLWWSYLWL 241
            +L+ S+ I     +D WLWHYDKRG Y+V+SGYKL M     + S+ ++     W+ +W 
Sbjct: 1078 DLILSMPISSYNLQDSWLWHYDKRGNYSVRSGYKLYMHLKCNATSASTNYRGTQWNSIWK 1137

Query: 242  QQIPAKVKIFMWRVVLSILPSMTNLIQRGIAADPMCSLCKKFPEITDHALVTCARAKRLW 301
              +P K+KIF+WR     +P+  NL+ RGI   P C++C    E   HA   C RA+++W
Sbjct: 1138 LTVPTKIKIFIWRSAHEHIPTAQNLLLRGIGELPACTICGDRRESIIHAFFHCKRARQIW 1197

Query: 302  KTLL------------------------------------------SQNAYLNGSVLPDI 361
            +TL                                            +N+ ++G  +  +
Sbjct: 1198 RTLFPFLTCLSAEDNISFLELWSSLTEQLEPKDLNLAAITGWGIWNDRNSLIHGKQVSPV 1257

Query: 362  PVKAEWVGKYLSSFSSVHESRSSGICGSIVQGPISVSQWVCPPQGWFKINTDASCSSKYS 421
              K EW+  +L S S    S  S    S    P+ V  W        K+NTDA+C  + +
Sbjct: 1258 EFKCEWLTPFLDSHSQAQMSNYSPRTQS-NHRPV-VQYWRPSSSVSLKLNTDAAC--RGA 1317

Query: 422  LTGVGVVVRMASGRLYAAQMEVVPLVLAPLIAEARAVLVGLKLALVLGLVCVEVESDCLS 481
             T  G ++R +S  L AA    VP  L+PL+AE R +L GLK A       +EVESD L 
Sbjct: 1318 STSFGCIIRDSSCSLVAATSIRVPFPLSPLLAEIRCILEGLKFAAASNFTHLEVESDSLL 1377

Query: 482  LISMLTGSFISLHEEGTYVDEILELASHFTSVVFRHVRRGGNRPAHILASH--AGVEGSM 520
             I ++     +  +E  +V EI  L   F  + F H  R  NR AH LA         + 
Sbjct: 1378 AIQLIRNEIHTRGDEQNWVMEIQALTCCFAFISFSHSSRQCNRAAHGLAKWGITSPSATY 1437

BLAST of Lcy01g006040 vs. ExPASy TrEMBL
Match: A0A6J1DX30 (uncharacterized protein LOC111024874 OS=Momordica charantia OX=3673 GN=LOC111024874 PE=4 SV=1)

HSP 1 Score: 129.4 bits (324), Expect = 4.4e-26
Identity = 77/201 (38.31%), Postives = 107/201 (53.23%), Query Frame = 0

Query: 2   KEIQQKKQAIKDAYS-VIPVDFSIIHSLEAELARLLEDEEIYWHQRSRENWLKWGDRNTR 61
           K+I+ +K AI DAY+  +P+DF+IIH+LE +LA LLE EEI+W QRSRE+WLKWG     
Sbjct: 504 KQIKAQKAAIIDAYNQPLPLDFTIIHALENDLAGLLELEEIFWKQRSREDWLKWG----- 563

Query: 62  WFHLRASERKKRNDIHEICRDDGTWATSESEVESIFLGYFQNICTTSNPSVVQQSAILNH 121
                                                         +  + +   AI+N 
Sbjct: 564 ---------------------------------------------IAILNALDIEAIINL 623

Query: 122 IPPIISPEMNAKLTAPFCKAEIERVVSQMFPTKAPGPDGFPAIFYQSYWDIVGAQTVASC 181
           IP  I+ E+N +L AP+ K EIE  + QMFPTKA GPDGFPA+FYQ+YW +VG +T+ +C
Sbjct: 624 IPTRITSEVNEQLLAPYTKEEIELAIRQMFPTKALGPDGFPALFYQTYWHVVGPKTLEAC 647

Query: 182 LELVQSILIGRLGAEDIWLWH 202
           L  + +        +DI  W+
Sbjct: 684 LNALNN-------GDDIKKWN 647


HSP 2 Score: 165.2 bits (417), Expect = 7.2e-37
Identity = 117/435 (26.90%), Postives = 188/435 (43.22%), Query Frame = 0

Query: 2   KEIQQKKQAIKDA-------YSVIPVDFSIIHSLEAELARLLEDEEIYWHQRSRENWLKW 61
           K++Q+K++ +K A        S +P+ F     L  E++ LL  EE  W Q SR  WLK 
Sbjct: 250 KQLQEKRKELKIAEEKAMQGESSVPITF-----LRNEVSMLLAKEERMWRQCSRSQWLKH 309

Query: 62  GDRNTRWFHLRASERKKRNDIHEICRDDGTWATSESEVESIFLGYFQNICTTSNPSVVQQ 121
           G RNT +FH RA+ R++RN I  +   DG W T+ ++V+++   YFQNI  TSNPS +  
Sbjct: 310 GGRNTNFFHSRATHRQRRNSIVGLRDSDGEWRTNPNQVQNMLTSYFQNIFLTSNPSSI-- 369

Query: 122 SAILNHIPPIISPEMNAKLTAPFCKAEIERVVSQMFPTKAPGPDGFPAIFYQSYWDIVGA 181
             +L + P  I   MN  L+ P+   ++E  + QM P  APGPDGFP +FYQ++W ++G 
Sbjct: 370 DTVLQYAPTTIIDSMNEALSKPYTTTKVETALKQMAPLTAPGPDGFPLVFYQNHWHLIGE 429

Query: 182 QTVASCLELVQS-----ILIGRLGAEDIW-LWHYDKRGVYTVKSG---------YKLRML 241
             +   L  + S          L A+ +W L H     +Y V            Y   ++
Sbjct: 430 DVIRGVLSSLNSEKDLQNFNDALLAKQVWRLMHNTNTLLYAVFRAKFFPHDNVLYAKDLI 489

Query: 242 QGQVS------------------------------------QSSDSSCWNLWWSYLWLQQ 301
           +G  +                                      S+S+     W  +W   
Sbjct: 490 RGSYAWTSICQATHVIMKGMVWHIGDVHIISFTKEETNTRPSPSNSNAQKSLWKKVWSIH 549

Query: 302 IPAKVKIFMWRVVLSILPSMTNLIQRGIAADPMCSLCKKFPEITDHALVTCARAKRLWKT 361
           IP KVK F+WR     LP+  NL +R +  +  C  C++  E   HA+  C    ++W  
Sbjct: 550 IPPKVKNFLWRACSESLPTKLNLWKRKVLRNAWCERCEEEVEDIGHAVWHCVVNSQVW-- 609

Query: 362 LLSQNAYLNGSVLPDIPVKAEWVGKYLSSFSSVHESRSSGIC--GSIVQGPISVSQWVC- 376
                             + EW  ++  +  S  +  +  +    + V G  ++  W+  
Sbjct: 610 -----------------ARKEWTHRFNDNLGSFGDLATKMLMEESNEVAGRFAIISWLLW 658

BLAST of Lcy01g006040 vs. NCBI nr
Match: KAF7153144.1 (hypothetical protein RHSIM_Rhsim01G0167200 [Rhododendron simsii])

HSP 1 Score: 176.8 bits (447), Expect = 4.9e-40
Identity = 156/594 (26.26%), Postives = 238/594 (40.07%), Query Frame = 0

Query: 20  VDFSIIHSLEAELARLLEDEEIYWHQRSRENWLKWGDRNTRWFHLRASERKKRNDIHEIC 79
           V   I  ++E  LAR    EE+Y HQRSR NW   GDRN+ +FH    +R++RN I  + 
Sbjct: 52  VQAGIQEAIEVVLAR----EEMYLHQRSRVNWFNHGDRNSSFFHASMIQRRQRNQILRLK 111

Query: 80  RDDGTWATSESEVESIFLGYFQNICTTSNPSVVQQSAILNHIPPIISPEMNAKLTAPFCK 139
             +G W  ++  ++     YF +I     P     +++L H+P  I+ EMN  L      
Sbjct: 112 TANGNWKDTDDGIDQEIASYFSSIFHDDGPR--DMASVLAHVPLSITEEMNTLLLRSVED 171

Query: 140 AEIERVVSQMFPTKAPGPDGFPAIFYQSYWDIVGAQTVASCLEL---------------- 199
            EI+    QM   KAPG DGFP +F+Q +W+++   T A+                    
Sbjct: 172 HEIKAATFQMGALKAPGSDGFPGLFFQQFWEVIKGDTCAAIKSFFSGNYLLKKLNHTNIV 231

Query: 200 ---------------------------------------------------------VQS 259
                                                                    +  
Sbjct: 232 LVPKIPHPEALPHFKPISLCNFSVKIISKVVDFINTTSGAWSIPKLKQVLSEEEALSISC 291

Query: 260 ILIGRLGAEDIWLWHYDKRGVYTVKSGYK-------LRMLQGQVSQSSDSSCWNLWWSYL 319
           I I + G ED  LW +   G Y+VKSGY        L  L+   S  +  S    +W +L
Sbjct: 292 IPISKTGTEDSMLWDFTTNGKYSVKSGYHKTWNDLILSKLEKPTSSINPPSA---FWKFL 351

Query: 320 WLQQIPAKVKIFMWRVVLSILPSMTNLIQRGIAADPMCSLCKKFPEITDHALVTCARAKR 379
           W   IP K+K F W+V  + L +  NL++R  A  PMC  C K  E  +H L  C  AK+
Sbjct: 352 WNLSIPPKLKHFWWKVCRNRLATKENLVRRNCANTPMCPRCGKHSESIEHLLCHCKWAKK 411

Query: 380 LW-KTLLSQNAY---LNGSVLPDIPVKAE-------------WVGKYLSSFSSVHESRSS 439
           +W K+ +S   +   +N ++   I  K E             W+G ++   +S    + S
Sbjct: 412 VWFKSPISGGFFPNNINSALSWSIAAKEEMDKGARVCLHMERWLGLWVRGVNSRLPKKLS 471

Query: 440 GICGSIVQGPISVSQW-VCPPQGWFKINTDASCSSKYSLTGVGVVVRMASGRLYAAQMEV 499
                   G   + ++   PP    KIN DAS       +  G+++R + G L   +   
Sbjct: 472 RFMLLPPTGHPKLPKFGQPPPPDMLKINYDASWIDSAQKSWGGIILRNSRGCLLDGRRFC 531

Query: 500 VPLVLAPLIAEARAVLVGLKLALVLGLVCVEVESDCLSLISMLTGSFISLHEEGTYVDEI 516
           +    A  IAEA         A  L L  V +ESDC SLIS+     +   E    + +I
Sbjct: 532 ISANSA-FIAEAFIFREACLFAKALNLQNVSIESDCASLISLSVSELVPPWEVLALITDI 591

BLAST of Lcy01g006040 vs. NCBI nr
Match: XP_022158377.1 (uncharacterized protein LOC111024874 [Momordica charantia])

HSP 1 Score: 176.0 bits (445), Expect = 8.4e-40
Identity = 121/382 (31.68%), Postives = 177/382 (46.34%), Query Frame = 0

Query: 182  ELVQSILIGRLGAEDIWLWHYDKRGVYTVKSGYKLRMLQGQVSQSSDSSCWNLWWSYLWL 241
            +L+ S+ I     +D WLWHYDKRG Y+V+SGYKL M     + S+ ++     W+ +W 
Sbjct: 1078 DLILSMPISSYNLQDSWLWHYDKRGNYSVRSGYKLYMHLKCNATSASTNYRGTQWNSIWK 1137

Query: 242  QQIPAKVKIFMWRVVLSILPSMTNLIQRGIAADPMCSLCKKFPEITDHALVTCARAKRLW 301
              +P K+KIF+WR     +P+  NL+ RGI   P C++C    E   HA   C RA+++W
Sbjct: 1138 LTVPTKIKIFIWRSAHEHIPTAQNLLLRGIGELPACTICGDRRESIIHAFFHCKRARQIW 1197

Query: 302  KTLL------------------------------------------SQNAYLNGSVLPDI 361
            +TL                                            +N+ ++G  +  +
Sbjct: 1198 RTLFPFLTCLSAEDNISFLELWSSLTEQLEPKDLNLAAITGWGIWNDRNSLIHGKQVSPV 1257

Query: 362  PVKAEWVGKYLSSFSSVHESRSSGICGSIVQGPISVSQWVCPPQGWFKINTDASCSSKYS 421
              K EW+  +L S S    S  S    S    P+ V  W        K+NTDA+C  + +
Sbjct: 1258 EFKCEWLTPFLDSHSQAQMSNYSPRTQS-NHRPV-VQYWRPSSSVSLKLNTDAAC--RGA 1317

Query: 422  LTGVGVVVRMASGRLYAAQMEVVPLVLAPLIAEARAVLVGLKLALVLGLVCVEVESDCLS 481
             T  G ++R +S  L AA    VP  L+PL+AE R +L GLK A       +EVESD L 
Sbjct: 1318 STSFGCIIRDSSCSLVAATSIRVPFPLSPLLAEIRCILEGLKFAAASNFTHLEVESDSLL 1377

Query: 482  LISMLTGSFISLHEEGTYVDEILELASHFTSVVFRHVRRGGNRPAHILASH--AGVEGSM 520
             I ++     +  +E  +V EI  L   F  + F H  R  NR AH LA         + 
Sbjct: 1378 AIQLIRNEIHTRGDEQNWVMEIQALTCCFAFISFSHSSRQCNRAAHGLAKWGITSPSATY 1437

BLAST of Lcy01g006040 vs. NCBI nr
Match: XP_022158377.1 (uncharacterized protein LOC111024874 [Momordica charantia])

HSP 1 Score: 129.4 bits (324), Expect = 9.1e-26
Identity = 77/201 (38.31%), Postives = 107/201 (53.23%), Query Frame = 0

Query: 2   KEIQQKKQAIKDAYS-VIPVDFSIIHSLEAELARLLEDEEIYWHQRSRENWLKWGDRNTR 61
           K+I+ +K AI DAY+  +P+DF+IIH+LE +LA LLE EEI+W QRSRE+WLKWG     
Sbjct: 504 KQIKAQKAAIIDAYNQPLPLDFTIIHALENDLAGLLELEEIFWKQRSREDWLKWG----- 563

Query: 62  WFHLRASERKKRNDIHEICRDDGTWATSESEVESIFLGYFQNICTTSNPSVVQQSAILNH 121
                                                         +  + +   AI+N 
Sbjct: 564 ---------------------------------------------IAILNALDIEAIINL 623

Query: 122 IPPIISPEMNAKLTAPFCKAEIERVVSQMFPTKAPGPDGFPAIFYQSYWDIVGAQTVASC 181
           IP  I+ E+N +L AP+ K EIE  + QMFPTKA GPDGFPA+FYQ+YW +VG +T+ +C
Sbjct: 624 IPTRITSEVNEQLLAPYTKEEIELAIRQMFPTKALGPDGFPALFYQTYWHVVGPKTLEAC 647

Query: 182 LELVQSILIGRLGAEDIWLWH 202
           L  + +        +DI  W+
Sbjct: 684 LNALNN-------GDDIKKWN 647


HSP 2 Score: 162.5 bits (410), Expect = 9.7e-36
Identity = 155/532 (29.14%), Postives = 226/532 (42.48%), Query Frame = 0

Query: 15  YSVIPVDFSIIHSLEAELARLLEDEEIYWHQRSRENWLKWGDRNTRWFHLRASERKKRND 74
           +S I  +   I +L  +L ++   EE+YWHQRSR  WLK GD+NT+    + S       
Sbjct: 262 FSSIMKETCRIAALAVKLDQVTSAEELYWHQRSRVQWLKSGDQNTKISIQQPS------- 321

Query: 75  IHEICRDDGTWATSESEVESIFLGYFQNICTTSNPSVVQQSAILNHIPPIISPEMNAKLT 134
                            V   F   F  +                 +P  +SP M + L 
Sbjct: 322 ---------------IIVAETFCPVFARLM----------------VPTRLSPNMLSSLN 381

Query: 135 APFCKAEIERVVSQMFPTKAPGPDGFPAIFYQSYWDIVGAQTVASCLELVQ------SIL 194
            PF +AEI   +SQ+ P KAPGP+G+PA F+QS W  +G+  V    +L+Q      +I 
Sbjct: 382 CPFSEAEISSALSQIGPLKAPGPNGYPAQFFQSEWSTIGSDIVQVIEQLLQEREAILAIH 441

Query: 195 IGRLGAEDIWLWHYDKRGVYTVKSGYKLRMLQGQ--------VSQSSDSSCWNLWWSYLW 254
             R G ED W+WH+D +G + VK  Y++    G          S   DS    + W  +W
Sbjct: 442 PSRRGVEDKWVWHFDPKGRFYVKLAYQVATNGGSSPIIGIGATSVPPDS----VLWKSIW 501

Query: 255 LQQIPAKVKIFMWRVVLSILPSMTNLIQRGIAADPMCSLCKKFPEITDHALVTCARAKR- 314
             +IP K++ F+WR   + L + + L +R IA    C  C    E  +H L+ C  A+  
Sbjct: 502 QAKIPRKIQFFLWRGCSNALVAGSILARRHIAMGVSCPWCGDIDESIEHCLMLCNFARAV 561

Query: 315 LWKTLLSQNAYLNGSV------------LPDIPVKAEWVGKYLSSFSSVHESRSSGICGS 374
           L+ +LL+     N S             L   P K   V    S    + ++ +  +  S
Sbjct: 562 LFSSLLAIYHQPNASQSFREWWLFLVNRLQTQPDKDTLVLNLASILWYIWKACNEKLFSS 621

Query: 375 IVQGPISVSQ---------------------------WVCPPQGWFKINTDASCSSKYSL 434
           +   P S++                            WV PP G  K+N D +     S+
Sbjct: 622 LSSSPRSIAARASAYAAEVQSVWSPTHAISPHPVLHFWVPPPLGLIKLNCDGAFFVGESI 681

Query: 435 TGVGVVVRMASGRLYAAQMEVVPLVLAPLIAEARAVLVGLKLALVLGLVCVEVESDCLSL 490
            G GVV R + G +   +++VV    A ++AEA+A++ G+ LA   G   + VESD LSL
Sbjct: 682 GGAGVVGRDSVGSVLDFRVQVVRCASA-IMAEAKAIVFGISLAGERGWSNIVVESDSLSL 741

BLAST of Lcy01g006040 vs. NCBI nr
Match: KAG7599644.1 (hypothetical protein ISN44_As06g038180 [Arabidopsis suecica])

HSP 1 Score: 162.5 bits (410), Expect = 9.7e-36
Identity = 98/318 (30.82%), Postives = 159/318 (50.00%), Query Frame = 0

Query: 2   KEIQQKKQAIKDAYSVIPVDFSIIHSLEAELARLLEDEEIYWHQRSRENWLKWGDRNTRW 61
           K+IQQ +  ++  Y+   +D++ I  ++ +L    + EE YW  +SR  WL  GD+NTR+
Sbjct: 550 KKIQQLQMVLQRLYNSSYLDYNSISEVKLQLQHEYQMEEEYWRTKSRIQWLYLGDKNTRY 609

Query: 62  FHLRASERKKRNDIHEICRDDGTWATSESEVESIFLGYFQNICTTSNPSVVQQSAILNHI 121
           FH R  +R+  N I  +  ++G    SE E+  I   YFQ I T+S   + Q   +L HI
Sbjct: 610 FHERTKQRRSHNRITSLQDEEGNIRNSEEEIYKIIHSYFQQIYTSS--GMQQLEGVLQHI 669

Query: 122 PPIISPEMNAKLTAPFCKAEIERVVSQMFPTKAPGPDGFPAIFYQSYWDIVGAQTVA--- 181
            P ++PE+ +KL  P  + EI + ++ M   KAPGPDGF A FY+ +WD + +  +    
Sbjct: 670 QPKVTPEIKSKLLEPVTEDEIFQALTHMNADKAPGPDGFNAGFYKYHWDTIKSGLMVPNT 729

Query: 182 --------------SCLELVQSILIGRLGAEDIWLWHYDKRGVYTVKSGYKLRMLQGQVS 241
                           +++++ I    + + D+  W + + G YTVKSGY  ++ +   +
Sbjct: 730 NLWDEAKLQAYIHPEDIKIIKKIRPQVVKSPDMPTWIHTRDGQYTVKSGYH-QLTKPPSA 789

Query: 242 QSSDSSCWNLWWSYLWLQQIPAKVKIFMWRVVLSILPSMTNLIQRGIAADPMCSLCKKFP 301
             SDS   N     +W   IP KVK   W+++ + LP    L +R +   P C  C +  
Sbjct: 790 DISDSLRVNNLCKSIWSLNIPPKVKHLWWKIIHNALPIAEALGRRRLRISPECLFCGEAC 849

Query: 302 EITDHALVTCARAKRLWK 303
           E   H    C  A  +W+
Sbjct: 850 ESIYHLFFQCRVANEIWE 864

BLAST of Lcy01g006040 vs. NCBI nr
Match: OMP06477.1 (hypothetical protein CCACVL1_01552 [Corchorus capsularis])

HSP 1 Score: 161.8 bits (408), Expect = 1.6e-35
Identity = 168/691 (24.31%), Postives = 247/691 (35.75%), Query Frame = 0

Query: 31   ELARLLEDEEIYWHQRSRENWLKWGDRNTRWFHLRASERKKRNDIHEICRDDGTWATSES 90
            +L +LL  EE+ W QR + +WLK  DRNTR+FH  AS RK++  I  I  D     T ++
Sbjct: 710  DLDKLLHQEELLWRQRPKTHWLKVRDRNTRFFHAVASSRKQKKQILSIKEDARNTHTEQT 769

Query: 91   EVESIFLGYFQNICTTSNPSVVQQSAILNHIPPIISPEMNAKLTAPFCKAEIERVVSQMF 150
             + S F  YF+ + TTSNP+      +L H+   ++ +M  +L  PF   EI+    QM 
Sbjct: 770  GIMSTFTNYFKGVFTTSNPTQAAIHEVLQHMECRVTEQMQIQLEQPFTAREIQHAAFQMG 829

Query: 151  PTKAPGPDGFPAIFYQSYWDIVGAQTV--------------------------------- 210
             +KAPGPDG   +F+Q  W +VG   V                                 
Sbjct: 830  GSKAPGPDGMSPLFFQKCWSVVGKDVVNYALKFLNNNESLPDVNHTNVVLIPKIDDPKLA 889

Query: 211  ----------------------ASCLELVQ------------------------------ 270
                                   SC E++                               
Sbjct: 890  KDFRPISLCNVIFRIVSKALANRSCEEVISLLDMFEAASGQKININKSAVLFSANTTSGV 949

Query: 271  ----------------------SILIG--------------------------------- 330
                                   I+IG                                 
Sbjct: 950  KDELMNFLGVQRVLDNDKYLGLPIMIGRSKCREFRFLKDRLQKRINAWNSKLFSKAGKAV 1009

Query: 331  ---------------------------------------RLGAEDIWLWHYDKRGVYTVK 390
                                                   R   ED  +W+    G +TV 
Sbjct: 1010 MIQAVAQATPVYLMSVFLFPKSFLQELNAMIARFCLVVPRQNEEDRLIWNGTMLGEFTVC 1069

Query: 391  SGYKL-RMLQGQVSQSSDSSCWNLWWSYLWLQQIPAKVKIFMWRVVLSILPSMTNLIQRG 450
            S Y + R + G+  Q       +  W Y+W   I  K++ FMWR+V +ILP+ +NL +RG
Sbjct: 1070 SAYHVARRVIGR--QELPLQLRSPIWRYIWSAGIMPKIQYFMWRLVWNILPTKSNLNKRG 1129

Query: 451  IAADPMCSLCKKFPEITDHALVTCARAKRLWKT----LLS--QNAYLNGSVLPDIPVKAE 505
            +     C +C        H    C  +K +W+     +LS  +   LNG+       KA+
Sbjct: 1130 MEIAGTCEVCGGEESADAHVFFNCHLSKLVWEDACPWVLSCIEQWDLNGNFWEFFLEKAK 1189

BLAST of Lcy01g006040 vs. TAIR 10
Match: AT3G09510.1 (Ribonuclease H-like superfamily protein )

HSP 1 Score: 93.6 bits (231), Expect = 5.1e-19
Identity = 99/393 (25.19%), Postives = 152/393 (38.68%), Query Frame = 0

Query: 184 VQSILIGRLGAEDIWLWHYDKRGVYTVKSGYKLRMLQGQV----------SQSSDSSCWN 243
           +  I + +    D  +W+Y+  G YTV+SGY L                 S    +  WN
Sbjct: 105 IHRIYLAKSKKPDKIIWNYNTTGEYTVRSGYWLLTHDPSTNIPAINPPHGSIDLKTRIWN 164

Query: 244 LWWSYLWLQQIPAKVKIFMWRVVLSILPSMTNLIQRGIAADPMCSLCKKFPEITDHALVT 303
           L         I  K+K F+WR +   L +   L  RG+  DP C  C +  E  +HAL T
Sbjct: 165 L--------PIMPKLKHFLWRALSQALATTERLTTRGMRIDPSCPRCHRENESINHALFT 224

Query: 304 CARAKRLWK---TLLSQNAYLNG----------------------SVLPDIPVKAEWVGK 363
           C  A   W+   + L +N  ++                        +LP   +   W  +
Sbjct: 225 CPFATMAWRLSDSSLIRNQLMSNDFEENISNILNFVQDTTMSDFHKLLPVWLIWRIWKAR 284

Query: 364 YLSSFSSVHESRSSGICGS-----------------------IVQGPISVSQWVCPPQGW 423
               F+   ES S  +  +                       I +  I   +W  PP  +
Sbjct: 285 NNVVFNKFRESPSKTVLSAKAETHDWLNATQSHKKTPSPTRQIAENKI---EWRNPPATY 344

Query: 424 FKINTDASCS-SKYSLTGVGVVVRMASGRLYAAQMEVVPLVLAPLIAEARAVLVGLKLAL 483
            K N DA     K   TG G ++R   G   +     +     PL AE +A+L  L+   
Sbjct: 345 VKCNFDAGFDVQKLEATG-GWIIRNHYGTPISWGSMKLAHTSNPLEAETKALLAALQQTW 404

Query: 484 VLGLVCVEVESDCLSLISMLTGSFISLHEE-GTYVDEILELASHFTSVVFRHVRRGGNRP 516
           + G   V +E DC +LI+++ G  IS H     ++++I   A+ F S+ F  +RR GN+ 
Sbjct: 405 IRGYTQVFMEGDCQTLINLING--ISFHSSLANHLEDISFWANKFASIQFGFIRRKGNKL 464

BLAST of Lcy01g006040 vs. TAIR 10
Match: AT1G43760.1 (DNAse I-like superfamily protein )

HSP 1 Score: 65.1 bits (157), Expect = 2.0e-10
Identity = 47/163 (28.83%), Postives = 80/163 (49.08%), Query Frame = 0

Query: 40  EIYWHQRSRENWLKWGDRNTRWFHLRASERKKRNDIHEICRDDGTWATSESEVESIFLGY 99
           E ++ Q+SR  WL+ GD NTR+FH      + +N I  +  DD     + ++V+ + + Y
Sbjct: 432 ESFYRQKSRIKWLQDGDANTRFFHKVILANQAKNLIKFLRMDDDVRVENVTQVKEMIVAY 491

Query: 100 FQNICTTSNPSVVQQSA--ILNHIPPIISPEMNAKLTAPFCKAEIERVVSQMFPTKAPGP 159
           + ++  + +  +   S   I +  P   +  + ++L+A     EI   V  M   KAPGP
Sbjct: 492 YTHLLGSDSDILTPDSVQRIKDIHPFRCNDTLASRLSALPSDKEITAAVFAMPRNKAPGP 551

Query: 160 DGFPAIFYQSYWDIVGAQTVASCLELVQS-ILIGRLGAEDIWL 200
           D F A F+   W +V   T+A+  E  ++  L+ R  A  I L
Sbjct: 552 DSFTAEFFWESWFVVKDSTIAAVKEFFRTGHLLKRFNATAITL 594

BLAST of Lcy01g006040 vs. TAIR 10
Match: AT3G26855.1 (RNA-directed DNA polymerase (reverse transcriptase)-related family protein )

HSP 1 Score: 54.3 bits (129), Expect = 3.5e-07
Identity = 34/115 (29.57%), Postives = 56/115 (48.70%), Query Frame = 0

Query: 233 NLWWSYLWLQQIPAKVKIFMWRVVLSILPSMTNLIQRGIAADPMCSLCKKFPEITDHALV 292
           N W   +W  +I  K+K+ +W+ + + LP    L+ R I+ +P C+ C+ F  IT H L 
Sbjct: 3   NNWIGDIWSLKISPKIKLLIWKALNNALPVGAQLLSRNISIEPFCTRCRDFETIT-HILF 62

Query: 293 TCARAKR--LWKTLLS----QNAYLNGSVLPDIPVKAEWVGKYLSSFSSVHESRS 342
            C  A+R  + K+++     Q A   GS+L   P          S F  +H+ R+
Sbjct: 63  NCPFAQREVIMKSIIDAKEWQTAQQMGSMLGSTP----------SLFPRIHDERT 106

BLAST of Lcy01g006040 vs. TAIR 10
Match: AT2G13980.1 (Polynucleotidyl transferase, ribonuclease H-like superfamily protein )

HSP 1 Score: 52.8 bits (125), Expect = 1.0e-06
Identity = 49/167 (29.34%), Postives = 78/167 (46.71%), Query Frame = 0

Query: 343 GICGSIVQGPISVSQWVCPPQGWFKINTDASCSSKYSLTGVGVVVRMASGRLYAAQMEVV 402
           G C ++V      ++W  PP    K N DA  + +   +    ++R   G         +
Sbjct: 2   GWCITVVHN----TKWKAPPDTHIKCNYDAGFNVQNLDSTARWIIRNHDGIAQHWGSLQL 61

Query: 403 PLVLAPLIAEARAVLVGLKLALVLGLVCVEVESDCLSLISMLTGSFISLHEEGTYVDEIL 462
                PL AE +A+L  L+   + G + V +E DC +L ++++GS  S +     +D+I 
Sbjct: 62  DNTSTPLEAETKALLAALQQTWIRGYLRVIMEGDCETLTNLVSGS-SSHNHLANLLDDIR 121

Query: 463 ELASHFTSVVFRHVRRGGNRPAHILASHAGVEGSMLWCAS--FPAWL 508
             A  F++V F  VRRGGN+ AH LA   G   +  +  S   P WL
Sbjct: 122 IWAKKFSNVQFSFVRRGGNKVAHELAK-LGCNSTCFYSDSCTLPIWL 162

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
P0C2F61.8e-0823.80Putative ribonuclease H protein At1g65750 OS=Arabidopsis thaliana OX=3702 GN=At1... [more]
O003704.9e-0624.39LINE-1 retrotransposable element ORF2 protein OS=Homo sapiens OX=9606 PE=1 SV=1[more]
P113699.2e-0526.32LINE-1 retrotransposable element ORF2 protein OS=Mus musculus OX=10090 GN=Pol PE... [more]
Match NameE-valueIdentityDescription
A0A803QEQ98.2e-4926.53Uncharacterized protein OS=Cannabis sativa OX=3483 PE=4 SV=1[more]
A0A803PUL25.3e-4824.85Uncharacterized protein OS=Cannabis sativa OX=3483 PE=4 SV=1[more]
A0A803QDL04.2e-4529.40Uncharacterized protein OS=Cannabis sativa OX=3483 PE=4 SV=1[more]
A0A6J1DX304.1e-4031.68uncharacterized protein LOC111024874 OS=Momordica charantia OX=3673 GN=LOC111024... [more]
A0A6J1DX304.4e-2638.31uncharacterized protein LOC111024874 OS=Momordica charantia OX=3673 GN=LOC111024... [more]
Match NameE-valueIdentityDescription
KAF7153144.14.9e-4026.26hypothetical protein RHSIM_Rhsim01G0167200 [Rhododendron simsii][more]
XP_022158377.18.4e-4031.68uncharacterized protein LOC111024874 [Momordica charantia][more]
XP_022158377.19.1e-2638.31uncharacterized protein LOC111024874 [Momordica charantia][more]
KAG7599644.19.7e-3630.82hypothetical protein ISN44_As06g038180 [Arabidopsis suecica][more]
OMP06477.11.6e-3524.31hypothetical protein CCACVL1_01552 [Corchorus capsularis][more]
Match NameE-valueIdentityDescription
AT3G09510.15.1e-1925.19Ribonuclease H-like superfamily protein [more]
AT1G43760.12.0e-1028.83DNAse I-like superfamily protein [more]
AT3G26855.13.5e-0729.57RNA-directed DNA polymerase (reverse transcriptase)-related family protein [more]
AT2G13980.11.0e-0629.34Polynucleotidyl transferase, ribonuclease H-like superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Sponge gourd (P93075) v1
Date Performed: 2021-12-06
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 527..528
NoneNo IPR availablePANTHERPTHR46736:SF6SUBFAMILY NOT NAMEDcoord: 82..433
NoneNo IPR availablePANTHERPTHR46736FAMILY NOT NAMEDcoord: 82..433
IPR036397Ribonuclease H superfamilyGENE3D3.30.420.10coord: 365..492
e-value: 8.3E-16
score: 60.2
IPR002156Ribonuclease H domainPFAMPF13456RVT_3coord: 369..491
e-value: 5.9E-25
score: 87.5
IPR026960Reverse transcriptase zinc-binding domainPFAMPF13966zf-RVTcoord: 208..301
e-value: 3.7E-17
score: 62.8
IPR044730Ribonuclease H-like domain, plant typeCDDcd06222RNase_H_likecoord: 368..488
e-value: 1.52524E-22
score: 90.8364
IPR012337Ribonuclease H-like superfamilySUPERFAMILY53098Ribonuclease H-likecoord: 364..491

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Lcy01g006040.1Lcy01g006040.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0003676 nucleic acid binding
molecular_function GO:0004523 RNA-DNA hybrid ribonuclease activity