Lag0038384 (gene) Sponge gourd (AG‐4) v1

Overview
NameLag0038384
Typegene
OrganismLuffa acutangula (Sponge gourd (AG‐4) v1)
DescriptionRetrotransposon protein
Locationchr2: 16471809 .. 16476428 (+)
RNA-Seq ExpressionLag0038384
SyntenyLag0038384
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGTTCTTAACTCTGTGTTACATTTACATGAGTTATTACTTAAACAACCTGAGCCAGTTCATTCCAACTGCTTGGACGAAAAGTGGAAGTGGTTTAAGGTAAAATTTTGAAGGACTTTAATCATGTTTAAATCTCAATAAGTGGTCTATATTGAATGTTCTCTTAATTAATTTAAGTGTTCTTTGTATTTAGAATTGTCTAGGTGCATTAGATGGAACCTACATTAAAGTCAACGTAGGTATTCTTGATAGACCTAGGTACCGAACAAGGAAGAATGAAATTGCTACCAATGTGTTAGGAGTTTGCTCCCAAGATATGCAATTCATCTATGTTTTACCTGGATGTGAAGGTTCGGCTTCTGACTCGAGAGTTTTGCGAGATGCTGTATCTAGGAGGAATGGATTAAAAATTCCAAAAGGTAACATGTATAATGTTTGGATGGGTAGATATGCAATTCACCTATGCTTTATGTACATTCTCAAGTATTCAATTGATGTATTTAGGTTGTTACTATCTATATGATGCTGGCTATACAAATGGTGAAGGATTTTTGGCACCTTACCGAGGACAACGATATCATTTAAATGATTGGAAGCAAGGATATCAACCAAGAACTCCAGGTGAATTTTTTAATATGAAACATTCCTCGGCAAGAAATGTAATAGAACGAGCATTTGGACTATTAAAAATTTGATGGGCTATTCTTCGAGTGAAATCTTATTATCCAGTAAAAGTTCAATGTCGAATGACCACCGCTTGTTGCCTCATTCATAATCTTATAAGAAGAGAAATGCCTGTAGATCCTTTAGAACAAGAAGTTGGAGACACTCATTCAAGTACTTTTGACGTAGATAGTGATCGAATAGCACATGTTGAAAGTTCAAGCGAATGGACGAATTGGAGAGATAATTTAGCAATTGAGATGTTTGAGGAATGGAGAGGCATTAGACATTAGTATTGTTATCTAGGGAGTGTTTGATTGATGATGTAAACATTCATGTTTTTGCTTTTTTAGATCTTATAAGACTTTATGAAATATTATGAATATTAGTTTGCTACTTATTTCATATTTATGATATCTATTCATGTATGTTTTCTCTCATATTATGTTATATAAATTTGAGTTTTAATAGTTTAATACATTATGTGAATCATAGATATGGATGACAACGAACCATCTAAAGCTGGTTCAAGTAGGAAACGCATGTGGAGTAAGGCCGAAGATGAGAAATTAGTGGAATGTTTACTGGAGCTATCTAATATTGGCACTTGGAAAGCTGACAATGGTACTTTTAAACCAGGATTTCTCATTCAAATAGAAAAATGGATAGTTGAAAAAATTCCTATGTGCGATCTTAAGGCTCAACCACACATAGAGTCTAGAGTTAAAATATTGAAGAAGCAATACAATGCAATAGCTGAAATGTTAGGCCCAAATTGTAGTGGCTTTGGATGGAATGATAGAGACAAGTGCATAGAAGCAGAAAAACATATATTTGATGAATGGGTGAAGGTGAGTAAAAAGAATTTTTTATTTAATAGTTTATGATATAAATATCAAAATGAATTAAAATGGATTTAATTTGTTTTTACAGAGTCATCCAACTGCAAAGGGACTCAGAAACAAGCCTTTTCCATTCTTTGAACAATTGGCCATTGTTTTTGGAAAAGATAGGGCCAATGGACTTGGTGCAGAAGGCCAACTGATATGTTTGAAGCAGTGGAACGAGAAATGACCGACAATGAATTTTGGAGAGGAGACAGCTCTTATGTGGCAATAGTTGGAAGGGAAGAAGTGGATGAACGAACCTCAATGAGTGAAGCACCAATGAATACACAATCTACTGCACATACATCAAGTAGGTCTAGTAAGAAAAGAGCAAAGAGTGTGGACCCATTGGTAGCGGCAGTGAATGGACTTGAGAATGTTATGAGCAGTCATCTTTCAAATGCTAATGAAAATATTCAAGAGATTGCTTTGTTTTATCGACAAGTGGCTGAACGAGAATCTACAAGAGAGGAACGTCGAAACTCATTAGTTAGTGAAATTAGAAAGGTGGATGGATTGAGTGTACGACAAAGAGTTCGAGCTGGTAGGCTTATCACCAAAGATCAATCCCAGATTGATTACTTTTTTAATCTTCCAGCTGATGAAAGGTATGAATTTCTAATGGAAGTTTTAGGAGAGAATACTGATCTTTAGTCTATGTGTGAGTGTTGATTCTTCAATGTAATAGGAGACCTTATAACATGTTATGTATGGACATTTAGATTTTTTGGACCCTAATTAGATGATGTATCATCTTTATTTTGAGTTAGCATGTCATTGACATGCTATGGACGAACATTTATATTTTTTGTGCCCTATCAAATGATGTAACATTCTTATTTTGGGTTAGCATGTTATTGACATGCTGTGGACGAACATTTATATTTTGTGTGTCTTATCAGATGATTAAATATCCTTGTTTTGAATTAGCTTGTCACTATATCTTGTATTTGTATGGTTCACTGTGAAGATAAATGTTCATTTTGTGTATGTTTAGTACACTTTGATATTGATTATATTTTATTTTTTAAATATAATATTTATCATTAATATTTACATGATTGTTGCACTAATTAATTATGATTAAAAGTTTATATAATTGTATTAAATACATTTCTTTGAGGATAATTATTAATTCATTGCTAAATAAAATAAACCGTCTAAAGTTCAAGGTATATTATTTATAATAATTTACTAATGAATGACATTTTAATTGCTTAAAATGATTTTAATAATTTGAGTTAATAAAAATAAATTAACTAACAACAATTAATTATTTAAAATTAACTGATATAATTTTATTTTTTGAAACGCACTAAGGTAAATTAATTGTTGGAAGTTAATTTTGATTAATTTTAATGTTTTATAATTGCATTAAATTGTTGAAAGTAATTGCATGAACCAAGTAAAAATATCATTATTTCCAAAAAAAAAAAAAGTAAAAATATCATTTATTTATAGGCAAATCTGACAACATTCCATGCAACACCAAACACAAACATCATTCATTCCCAGGCAATGTGATGACATTCCAGACACAACCAAACACAGTTATCTTTGATTCTCAGGCATATTATTCCCAGACATATTATTCCAGGCATCCTTTATTCCCAGACATCTGTAAAACTTCAAACCAAACGAGCTCTTAGGGTTTAACTTAAAAAAGGCCAACAATGGGAATATGAGGTGGAAGGACAAACATGGAGAAAAATGTAGGCAAAATTAGCATAAAATCTAGTTTCGCCATATTTAAAATTCATGATACATTTAGTCATTTTATTCAAAATCACTCTCTAAATATGGACATCTAGTTTTTTCATTAATTACTTTAACATTTCATCAAATCTAGCGGAAAAAATGTAGGCAAAATTAGCATAAAATCTAGGTGGGAAATTTTTCATCAATTGTTTTCCTTTAGGTTCTGTTTCACATAAGTAGAAAGTTTCAAGCTTGGTCAACTTAAAATCTTCAAATAAACTCATAATTTATCGTTGAACATTAATTTTTCAATCCATCAAAACATTAAACATTATTTTAATTCTCAAATGATTAATCCTACAAATTTAATCCATACCCAAGTTCTTTAGGTATCAAAACACCACATCCAAAAATAAAAAACCTAGAATTTAGTTCAATTATTGCTTAGTTTTATAAAACAACAATGTAATTTATAAAATACAAATTTCATTATTTTAGAAAAGAGACAAAGTAGAGCACTAAAGAAATAATTGTAAATTTGCAATTGTCACCAAAAGTTGAGATGGCCGAGTTGGTCTAAGGCGCCAGATTAAGGTTCTGGTCCGAAAGGGCGTGGGTTCAAATCCCACTCTCAACATATTTAGCTTTTTCCCCCAAGATACAAATAACTCCATTTCCAATTTCTTACATATATTACTAGGGGTGTTCATCTTCAGACCGATCGAAACCAACCGAACCGACGTCGGTCGGTCGGTGTCGGTTCCAATTTTTTGAAAACCGACAAGTCGGTCGGTCGGTTCAGTTATGTAGCAAGAACCGACCGACCGACCATAGTATATGTAAAATTAAAAAAAAAACGAAAATTTTCTTTAATCCTTCAGCTTCACCCCACCCTCAAAGCCCAAGGCCCAAGCCCACCTTCAGCCACCTCCTTCAGCCTTCAGGGGTCACACTCAGTCACTCACGCAGTCACGCACGCCGCACGCTCCACCTGCAAAAACAAAAATGAAAAAACCCTAAGTAACCACAGCCCCTCACCATCTCTGGTTGTTCTCTCATCTCTCGAACGACTCCTTCGCCTCGCCGTCTCTGGTTATCCCTCGCTCGCCATCTATCTCGACTCGCTCGCCGTCGCTCTCAACTCCTCGTCGTTTCTGTTCGAAATCACAGCATAGCCACAACCCCTCGTTCTTCTCCCTCGCCGTCTCTCTCGAATCCCTCTGTTCTCGAATCTCGCCCCTCGTTCGACTCCCTCACCGTTACTTTTAACTCGCCGTCGCGCCTCCGTTCTCCACCATCGTCTGTTCTCCATCGCCGTCTGTTCTCCACCGCCGTCTCTAACTCGCCGCCGTCGAGATTCCGTTATTACTAA

mRNA sequence

ATGGTTCTTAACTCTGTGTTACATTTACATGAGTTATTACTTAAACAACCTGAGCCAGTTCATTCCAACTGCTTGGACGAAAAGTGGAAGTGGTTTAAGAATTGTCTAGGTGCATTAGATGGAACCTACATTAAAGTCAACGTAGGTATTCTTGATAGACCTAGGTACCGAACAAGGAAGAATGAAATTGCTACCAATGTGTTAGGAGTTTGCTCCCAAGATATGCAATTCATCTATGTTTTACCTGGATGTGAAGGTTCGGCTTCTGACTCGAGAGTTTTGCGAGATGCTGTATCTAGGAGGAATGGATTAAAAATTCCAAAAGGTTGTTACTATCTATATGATGCTGGCTATACAAATGGTGAAGGATTTTTGGCACCTTACCGAGGACAACGATATCATTTAAATGATTGGAAGCAAGGATATCAACCAAGAACTCCAGTAAAAGTTCAATGTCGAATGACCACCGCTTGTTGCCTCATTCATAATCTTATAAGAAGAGAAATGCCTGTAGATCCTTTAGAACAAGAAGTTGGAGACACTCATTCAAATATGGATGACAACGAACCATCTAAAGCTGGTTCAAGTAGGAAACGCATGTGGAGTAAGGCCGAAGATGAGAAATTAGTGGAATGTTTACTGGAGCTATCTAATATTGGCACTTGGAAAGCTGACAATGGTACTTTTAAACCAGGATTTCTCATTCAAATAGAAAAATGGATAGTTGAAAAAATTCCTATGTGCGATCTTAAGGCTCAACCACACATAGAGTCTAGAGTTAAAATATTGAAGAAGCAATACAATGCAATAGCTGAAATGTTAGGCCCAAATTGTAGTGGCTTTGGATGGAATGATAGAGACAAGTGCATAGAAGCAGAAAAACATATATTTGATGAATGGGTGAAGGGCCAATGGACTTGGTGCAGAAGGCCAACTGATATGTTTGAAGCAGTGGAACGAGAAATGACCGACAATGAATTTTGGAGAGGAGACAGCTCTTATGTGGCAATAGTTGGAAGGGAAGAAGTGGATGAACGAACCTCAATGAGTGAAGCACCAATGAATACACAATCTACTGCACATACATCAAGTAGGTCTAGTAAGAAAAGAGCAAAGAGTGTGGACCCATTGGTAGCGGCAGTGAATGGACTTGAGAATGTTATGAGCAGTCATCTTTCAAATGCTAATGAAAATATTCAAGAGATTGCTTTGTTTTATCGACAAGTGGCTGAACGAGAATCTACAAGAGAGGAACGTCGAAACTCATTAGTTAGTGAAATTAGAAAGGTGGATGGATTGAGTGTACGACAAAGAGTTCGAGCTGGTAGGCTTATCACCAAAGATCAATCCCAGATTGATTACTTTTTTAATCTTCCAGCTGATGAAAGCATAGCCACAACCCCTCGTTCTTCTCCCTCGCCGTCTCTCTCGAATCCCTCTGTTCTCGAATCTCGCCCCTCGTTCGACTCCCTCACCGTTACTTTTAACTCGCCGTCGCGCCTCCGTTCTCCACCATCGTCTGTTCTCCATCGCCGTCTGTTCTCCACCGCCGTCTCTAACTCGCCGCCGTCGAGATTCCGTTATTACTAA

Coding sequence (CDS)

ATGGTTCTTAACTCTGTGTTACATTTACATGAGTTATTACTTAAACAACCTGAGCCAGTTCATTCCAACTGCTTGGACGAAAAGTGGAAGTGGTTTAAGAATTGTCTAGGTGCATTAGATGGAACCTACATTAAAGTCAACGTAGGTATTCTTGATAGACCTAGGTACCGAACAAGGAAGAATGAAATTGCTACCAATGTGTTAGGAGTTTGCTCCCAAGATATGCAATTCATCTATGTTTTACCTGGATGTGAAGGTTCGGCTTCTGACTCGAGAGTTTTGCGAGATGCTGTATCTAGGAGGAATGGATTAAAAATTCCAAAAGGTTGTTACTATCTATATGATGCTGGCTATACAAATGGTGAAGGATTTTTGGCACCTTACCGAGGACAACGATATCATTTAAATGATTGGAAGCAAGGATATCAACCAAGAACTCCAGTAAAAGTTCAATGTCGAATGACCACCGCTTGTTGCCTCATTCATAATCTTATAAGAAGAGAAATGCCTGTAGATCCTTTAGAACAAGAAGTTGGAGACACTCATTCAAATATGGATGACAACGAACCATCTAAAGCTGGTTCAAGTAGGAAACGCATGTGGAGTAAGGCCGAAGATGAGAAATTAGTGGAATGTTTACTGGAGCTATCTAATATTGGCACTTGGAAAGCTGACAATGGTACTTTTAAACCAGGATTTCTCATTCAAATAGAAAAATGGATAGTTGAAAAAATTCCTATGTGCGATCTTAAGGCTCAACCACACATAGAGTCTAGAGTTAAAATATTGAAGAAGCAATACAATGCAATAGCTGAAATGTTAGGCCCAAATTGTAGTGGCTTTGGATGGAATGATAGAGACAAGTGCATAGAAGCAGAAAAACATATATTTGATGAATGGGTGAAGGGCCAATGGACTTGGTGCAGAAGGCCAACTGATATGTTTGAAGCAGTGGAACGAGAAATGACCGACAATGAATTTTGGAGAGGAGACAGCTCTTATGTGGCAATAGTTGGAAGGGAAGAAGTGGATGAACGAACCTCAATGAGTGAAGCACCAATGAATACACAATCTACTGCACATACATCAAGTAGGTCTAGTAAGAAAAGAGCAAAGAGTGTGGACCCATTGGTAGCGGCAGTGAATGGACTTGAGAATGTTATGAGCAGTCATCTTTCAAATGCTAATGAAAATATTCAAGAGATTGCTTTGTTTTATCGACAAGTGGCTGAACGAGAATCTACAAGAGAGGAACGTCGAAACTCATTAGTTAGTGAAATTAGAAAGGTGGATGGATTGAGTGTACGACAAAGAGTTCGAGCTGGTAGGCTTATCACCAAAGATCAATCCCAGATTGATTACTTTTTTAATCTTCCAGCTGATGAAAGCATAGCCACAACCCCTCGTTCTTCTCCCTCGCCGTCTCTCTCGAATCCCTCTGTTCTCGAATCTCGCCCCTCGTTCGACTCCCTCACCGTTACTTTTAACTCGCCGTCGCGCCTCCGTTCTCCACCATCGTCTGTTCTCCATCGCCGTCTGTTCTCCACCGCCGTCTCTAACTCGCCGCCGTCGAGATTCCGTTATTACTAA

Protein sequence

MVLNSVLHLHELLLKQPEPVHSNCLDEKWKWFKNCLGALDGTYIKVNVGILDRPRYRTRKNEIATNVLGVCSQDMQFIYVLPGCEGSASDSRVLRDAVSRRNGLKIPKGCYYLYDAGYTNGEGFLAPYRGQRYHLNDWKQGYQPRTPVKVQCRMTTACCLIHNLIRREMPVDPLEQEVGDTHSNMDDNEPSKAGSSRKRMWSKAEDEKLVECLLELSNIGTWKADNGTFKPGFLIQIEKWIVEKIPMCDLKAQPHIESRVKILKKQYNAIAEMLGPNCSGFGWNDRDKCIEAEKHIFDEWVKGQWTWCRRPTDMFEAVEREMTDNEFWRGDSSYVAIVGREEVDERTSMSEAPMNTQSTAHTSSRSSKKRAKSVDPLVAAVNGLENVMSSHLSNANENIQEIALFYRQVAERESTREERRNSLVSEIRKVDGLSVRQRVRAGRLITKDQSQIDYFFNLPADESIATTPRSSPSPSLSNPSVLESRPSFDSLTVTFNSPSRLRSPPSSVLHRRLFSTAVSNSPPSRFRYY
Homology
BLAST of Lag0038384 vs. NCBI nr
Match: XP_042426186.1 (uncharacterized protein LOC122014061 [Zingiber officinale])

HSP 1 Score: 379.8 bits (974), Expect = 3.9e-101
Identity = 189/358 (52.79%), Postives = 232/358 (64.80%), Query Frame = 0

Query: 1   MVLNSVLHLHELLLKQPEPVHSNCLDEKWKWFKNCLGALDGTYIKVNVGILDRPRYRTRK 60
           +VLNSVL LH +LLK+ EP+  NC +E+WKWFK C GALDGTYI VN  I D+PRYRTR 
Sbjct: 107 LVLNSVLRLHNILLKKLEPIPENCTNERWKWFKGCFGALDGTYINVNAPIDDKPRYRTRN 166

Query: 61  NEIATNVLGVCSQDMQFIYVLPGCEGSASDSRVLRDAVSRRNGLKIPKGCYYLYDAGYTN 120
            EIATNVLGVC+ +MQF Y+LPG EGSA+D RVLRDA+SRRNGLKIP+GCYYL DAGYTN
Sbjct: 167 GEIATNVLGVCTPNMQFSYILPGWEGSAADGRVLRDAISRRNGLKIPQGCYYLCDAGYTN 226

Query: 121 GEGFLAPYRGQRYHLNDWKQGYQPRT---------------------------------- 180
           GEGFLAPYRGQRYHL +W+QGYQP T                                  
Sbjct: 227 GEGFLAPYRGQRYHLTEWRQGYQPATSKEYFNMKHSQARNCIERCFGILKARWAILRDKS 286

Query: 181 --PVKVQCRMTTACCLIHNLIRREMPVDPLEQEV--------------GDTHSNMDDNEP 240
              VK QCR+ +ACC++ N IR EM +DP+E E+              G     ++  E 
Sbjct: 287 FYSVKTQCRIISACCILRNFIRYEMAIDPIETELESNEIDDAVDLSMEGKMPKVVEQVES 346

Query: 241 SKA------GSSRKRMWSKAEDEKLVECLLELSNIGTWKADNGTFKPGFLIQIEKWIVEK 300
           S A        + K +W+K ED  LV+CL+ELS    WK++NG F+ G+L+ +EK +  K
Sbjct: 347 SMAPKFKRKTQNTKHLWTKQEDAVLVDCLVELSKDSAWKSENG-FRTGYLLHLEKLMAAK 406

Query: 301 IPMCDLKAQPHIESRVKILKKQYNAIAEMLGPNCSGFGWNDRDKCIEAEKHIFDEWVK 303
           +P   LKA PHIESR K+LK+Q+ AI EML  + SGFGWND +KCI   K +FDEWVK
Sbjct: 407 LPSSSLKATPHIESRYKLLKRQFQAITEMLN-HSSGFGWNDVEKCIITTKDVFDEWVK 462

BLAST of Lag0038384 vs. NCBI nr
Match: XP_026662506.2 (uncharacterized protein LOC113463064 [Phoenix dactylifera])

HSP 1 Score: 361.7 bits (927), Expect = 1.1e-95
Identity = 205/487 (42.09%), Postives = 278/487 (57.08%), Query Frame = 0

Query: 1   MVLNSVLHLHELLLKQPEPVHSNCLDEKWKWFKNCLGALDGTYIKVNVGILDRPRYRTRK 60
           +VLN VL LH +LL++PE V  N  DE+WK FKNCLGALDGTYIKVNV  +++PRYR RK
Sbjct: 35  VVLNFVLCLHSMLLRKPEAVTGNYTDERWKLFKNCLGALDGTYIKVNVEEVNKPRYRCRK 94

Query: 61  NEIATNVLGVCSQDMQFIYVLPGCEGSASDSRVLRDAVSRRNGLKIPKGCYYLYDAGYTN 120
            EIATNVLGVC++DMQFIY+LP  EGSA+D R+LRDA+ RRNGLK+P+  YYL +AGY N
Sbjct: 95  GEIATNVLGVCTRDMQFIYILPRWEGSATDFRLLRDAILRRNGLKVPQDYYYLCNAGYAN 154

Query: 121 GEGFLAPYRGQRYHLNDWKQGYQPRTPVKVQCRMTTACCLIHNLIRREMPVDPLEQEVGD 180
            EGFLAPYRGQRYHLN+W+Q  QP    +             N+I R      +E+++  
Sbjct: 155 TEGFLAPYRGQRYHLNEWRQSQQPTNAHEF---FNMKHSRARNVIER------MEEQL-- 214

Query: 181 THSNMDDNEPSKAGSSRKRMWSKAEDEKLVECLLELSNIGTWKADNGTFKPGFLIQIEKW 240
              N   N  +   +  KR+W+K ED KLVECL+++ N G WK DNG F+PGF   +E+ 
Sbjct: 215 LQENKKANSRADRTTFLKRIWTKTEDAKLVECLIKVVNAGGWKGDNGIFRPGFHQHLERM 274

Query: 241 IVEKIPMCDLKAQPHIESRVKILKKQYNAIAEMLGPNCSGFGWNDRDKCIEAEKHIFDEW 300
           + +K+P C L+  PH+E+ VK+LKKQYNAIAEMLGPNC  FGWNDRDKC+ A+K ++D W
Sbjct: 275 MEKKLPGCRLRGNPHVENHVKLLKKQYNAIAEMLGPNCLEFGWNDRDKCVVADKDVYDLW 334

Query: 301 VK---------------------------GQWTWCRRPTDMFEAVER--EMTDNEFWRGD 360
           +K                                   P D  E +E+  E   +    GD
Sbjct: 335 MKSHPHAAGLRNKPFPYYEDLSIVFGKDRANGEGAENPVDACEQIEKEEEALGDTLSLGD 394

Query: 361 SSYVAIVGREEVDERTSMSEAPMNTQSTAHTSSRSS---KKRAKSVDPLVAAVNGLENVM 420
                      +D   S+ +      STA    +      K+ K  D ++  +      +
Sbjct: 395 DMDAKGGSSNAIDSPPSICQPTDVGTSTAIIGKKKGGLVGKKRKHNDTIIENLVTEMGRI 454

Query: 421 SSHLSNANENIQEIALFYRQVAERESTREERRNSLVSEIRKVDGLSVRQRVRAGRLITKD 456
           SS      E+ ++IA F+    E++   +ERR +L  EI K++ LS    + AG  + K 
Sbjct: 455 SSACEGNREDFKKIANFF----EKKGQSDERRMALFEEIMKIEDLSKDDILIAGGSLAKI 506

BLAST of Lag0038384 vs. NCBI nr
Match: ADN34114.1 (retrotransposon protein [Cucumis melo subsp. melo])

HSP 1 Score: 353.6 bits (906), Expect = 3.0e-93
Identity = 183/361 (50.69%), Postives = 226/361 (62.60%), Query Frame = 0

Query: 1   MVLNSVLHLHELLLKQPEPVHSNCLDEKWKWFKNCLGALDGTYIKVNVGILDRPRYRTRK 60
           MVL +V+ LHE LLK+P+PV + C D++W+WF+NCLGALDGTYIKVNV   DR RYRTRK
Sbjct: 106 MVLLAVIRLHEELLKKPQPVPNECTDQRWRWFENCLGALDGTYIKVNVPASDRARYRTRK 165

Query: 61  NEIATNVLGVCSQDMQFIYVLPGCEGSASDSRVLRDAVSRRNGLKIPKGCYYLYDAGYTN 120
            E+ATNVLGVC     F+YVL G EGSA+DSR+LRDA+SR N LK+PKG YYL D GY N
Sbjct: 166 GEVATNVLGVCDTKGDFVYVLAGWEGSAADSRILRDALSRPNRLKVPKGYYYLVDVGYPN 225

Query: 121 GEGFLAPYRGQRYHLNDWK---------------QGYQPRT------------------- 180
            EGFLAPYRGQRYHL +W+               + Y  R                    
Sbjct: 226 AEGFLAPYRGQRYHLQEWRGPENAPSTSKEFFNMKHYSARNVIERAFGVLKGRWAILRGK 285

Query: 181 ---PVKVQCRMTTACCLIHNLIRREMPVDPLE---QEVGDTH-----------------S 240
              PV+VQCR   ACCL+HNLI REM    +E    EV  TH                 S
Sbjct: 286 SYYPVEVQCRTILACCLLHNLINREMTNFDIEDNIDEVDSTHATTAADDIHYIETSNEWS 345

Query: 241 NMDDN--EPSKAGSSR--KRMWSKAEDEKLVECLLELSNIGTWKADNGTFKPGFLIQIEK 300
              DN  E     SSR  K  W+K E+  LVECL+EL N G W++DNGTF+PG+L Q+ +
Sbjct: 346 QWRDNLAEEIMTSSSRLPKHTWTKEEEAGLVECLVELVNAGGWRSDNGTFRPGYLNQLAR 405

BLAST of Lag0038384 vs. NCBI nr
Match: XP_028060687.1 (uncharacterized protein LOC114264281 [Camellia sinensis])

HSP 1 Score: 350.5 bits (898), Expect = 2.5e-92
Identity = 215/570 (37.72%), Postives = 300/570 (52.63%), Query Frame = 0

Query: 2   VLNSVLHLHELLLKQPEPVHSNCLDEKWKWFKNCLGALDGTYIKVNVGILDRPRYRTRKN 61
           VL +VL  H LLLK+PEP+ +NC D++W  F+NCLGALDGTY+KV   ++D+PRYRTRK 
Sbjct: 105 VLKAVLRFHGLLLKKPEPITANCTDDRWSCFQNCLGALDGTYVKVLAPVIDKPRYRTRKG 164

Query: 62  EIATNVLGVCSQDMQFIYVLPGCEGSASDSRVLRDAVSRRNGLKIPKGCYYLYDAGYTNG 121
           EIATNVLGVCSQDMQFIYVLPG EGSASDSRVLRDAVSR NGLK+P G YYL DAGYTNG
Sbjct: 165 EIATNVLGVCSQDMQFIYVLPGWEGSASDSRVLRDAVSRPNGLKVPTGHYYLVDAGYTNG 224

Query: 122 EGFLAPYRGQRYHLNDWKQGYQPRT----------------------------------- 181
           EGFLAPYRGQ YHL+ W++G  P T                                   
Sbjct: 225 EGFLAPYRGQCYHLSTWREGGAPTTPQEFFNMRHSSARNVIERCFGLLKMRWAILRTYSY 284

Query: 182 -PVKVQCRMTTACCLIHNLIRREMPVD------------PLEQEVGD------------- 241
            P+K Q R+ TACCL+HNLI+REMP+D            PL  E+GD             
Sbjct: 285 FPIKTQFRIITACCLLHNLIQREMPMDLDDDENEEMNPPPLATELGDEMIDVVKASDQWS 344

Query: 242 ---------------THSNMDD--NEPSKAGSSRKRMWSKAEDEKLVECLLE-LSNIGTW 301
                              MDD     S+  +  +R W+  E+  L+  + + +++   W
Sbjct: 345 EWRTALATQMFNEWQASRGMDDTGTTSSRKKAKPRRFWNHREEVFLITTMKDVIASNPRW 404

Query: 302 KADNGTFKPGFLIQIEKWIVEKIPMCDLKAQPHIESRVKILKKQYNAIAEMLGPNCSGFG 361
           K DN  F+ GF  + EK I+   P  DL+A PHI+S++K  +KQYNA+ +ML  N SGFG
Sbjct: 405 KLDNNQFRAGFYNECEKKILSAFPGTDLRASPHIDSKIKFWRKQYNALQDML--NMSGFG 464

Query: 362 WNDRDKCIEAEKH----------------------IFDEWV------KGQWTWCRRPTDM 421
           WND  K +  +                         +++W+      +        P D 
Sbjct: 465 WNDEQKMVLVDSDDVWQNYVRRVPDAKGMRNRPFPFYEDWLILFGKDRATGELAEDPADA 524

Query: 422 FEAVEREMTDNEFWRGDSSYVAIVGREEVDERTSMSEAPMNTQSTAHTSSRSSKKRAKSV 465
             A+E+E  +     G+ S V        D   SMS A  N  + A  S+++ KKRA+  
Sbjct: 525 VAAMEKEDANATTEEGEQSPVEQFSMNMGDTDYSMSTAG-NVPNRA-DSAKTGKKRARPT 584

BLAST of Lag0038384 vs. NCBI nr
Match: GFS42850.1 (hypothetical protein Acr_00g0082040 [Actinidia rufa])

HSP 1 Score: 349.0 bits (894), Expect = 7.3e-92
Identity = 220/566 (38.87%), Postives = 305/566 (53.89%), Query Frame = 0

Query: 2   VLNSVLHLHELLLKQPEPVHSNCLDE--KWKWFKNCLGALDGTYIKVNVGILDRPRYRTR 61
           VL+SVL L  +LLK PEP+ +NC DE  +W+WF+NCLGALDGTY+KV V  +D+PRYRTR
Sbjct: 33  VLHSVLRLQGVLLKMPEPIVANCTDERWRWRWFQNCLGALDGTYVKVLVPSVDKPRYRTR 92

Query: 62  KNEIATNVLGVCSQDMQFIYVLPGCEGSASDSRVLRDAVSRRNGLKIPKGCYYLYDAGYT 121
           K +IATNVL VCSQDMQFIYVLPG EGSASDSRVLRDA++R+NGL++P G YYL DAGYT
Sbjct: 93  KGKIATNVLDVCSQDMQFIYVLPGWEGSASDSRVLRDAITRQNGLRVPTGQYYLVDAGYT 152

Query: 122 NGEGFLAPYRGQRYHLNDWKQGYQPRT--------------------------------- 181
           NGEGFLAPYRGQRYHL+ W+ G  P T                                 
Sbjct: 153 NGEGFLAPYRGQRYHLSTWRGGPIPNTPEEYFNMKHSSARNIIERAFGLLKIRWAILRSY 212

Query: 182 ---PVKVQCRMTTACCLIHNLIRREMPVDPLEQEVGDTHSN--------MDDNEPS---- 241
              P+K Q R+  ACCL+HNLI+REMPVDP E  + +   +        +D  EPS    
Sbjct: 213 SYFPIKTQSRIIIACCLLHNLIKREMPVDPREHLLDENQLSPPPLVDEYIDVVEPSDQWS 272

Query: 242 -----------------------KAGSSRKRMWSKAEDEKLVECLLELSNIGT-WKADNG 301
                                   A SS +R W+K E+E L+ C+ +L +  T WK D G
Sbjct: 273 DWRATLASQMYNEWKTNRGMSKRPAKSSTRRFWTKREEEFLMGCMKDLFDQDTKWKLDCG 332

Query: 302 TFKPGFLIQIEKWIVEKIPMCDLKAQPHIESRVKILKKQYNAIAEMLGPNCSGFGWNDRD 361
            FK GF  + EK I    P  DL+A PHIES++K+ ++QY+ + +ML    SGFGW+D +
Sbjct: 333 QFKGGFYGECEKKIRCAFPGTDLRANPHIESKIKMWRRQYHLLQDML--KTSGFGWDDVE 392

Query: 362 KCI------------EAEKHI----------FDEWV------KGQWTWCRRPTDMFEAVE 421
           K I              EK +          +++W+      +      + PTD   A+E
Sbjct: 393 KSILVDSDDVWDNYVRREKDVKGMRNKSFPYYEDWLVLFGKDRANGDLAKGPTDSVAAIE 452

Query: 422 REMTDNEFWRGDSSYVAIVGREEVDERTSMSEAPMNTQSTAHTSSRSSKKRAKSVDPLVA 465
            + T  E         ++V +    +  SMS A   T S   +S  +SKKR ++ + +  
Sbjct: 453 TKETTKE-----QEPESLVLQFSAADMESMS-ATGGTSSAPSSSHANSKKRGRAAEGISK 512

BLAST of Lag0038384 vs. ExPASy TrEMBL
Match: A0A803QNC5 (Uncharacterized protein OS=Cannabis sativa OX=3483 PE=4 SV=1)

HSP 1 Score: 413.3 bits (1061), Expect = 1.5e-111
Identity = 227/490 (46.33%), Postives = 314/490 (64.08%), Query Frame = 0

Query: 1   MVLNSVLHLHELLLKQPEPVHSNCLDEKWKWFKNCLGALDGTYIKVNVGILDRPRYRTRK 60
           MVLN++LHLH+LLLK+P  +  +C+DE+WKWFKNCLGALDGTYIKVNV   +RPRYRTRK
Sbjct: 64  MVLNALLHLHDLLLKKPVAIRDDCIDERWKWFKNCLGALDGTYIKVNVLASNRPRYRTRK 123

Query: 61  NEIATNVLGVCSQDMQFIYVLPGCEGSASDSRVLRDAVSRRNGLKIPKGCYYLYDAGYTN 120
           NEIATNVLGV SQDMQFIYVLPG EGSA+DSRVLRDA+  RNG K+P+G YYL DAGY N
Sbjct: 124 NEIATNVLGVVSQDMQFIYVLPGWEGSAADSRVLRDAI-HRNGFKVPQGYYYLCDAGYPN 183

Query: 121 GEGFLAPYRGQRYHLNDWKQGYQPRTPVK-VQCRMTTACCLIHN---------LIRREMP 180
           GEGFL PYRGQRYHLNDW   + P +P +    R ++A  ++            I R   
Sbjct: 184 GEGFLTPYRGQRYHLNDWT--HPPNSPREFFNMRHSSARNVVERAFGLLKGRWAILRSRS 243

Query: 181 VDPLEQE----VGDTHSNMDDNEPSKAGSSRKRMWSKAEDEKLVECLLELSNIGTWKADN 240
             P++ +    +GD    M+    S     RK  W+  +D KLVECL+++ N G WKADN
Sbjct: 244 YYPVKIQCRIILGDI---MEATSQSTPIGGRKHQWTSIQDSKLVECLVDMCNSGKWKADN 303

Query: 241 GTFKPGFLIQIEKWIVEKIPMCDLKAQPHIESRVKILKKQYNAIAEMLGPNCSGFGWNDR 300
           GTFKPG+L Q+EK + ++IP   +KAQPHI+SR+KILK+QY AI++MLGP+ SGFGWN++
Sbjct: 304 GTFKPGYLQQLEKMMNDRIPNSGIKAQPHIDSRLKILKRQYTAISDMLGPSASGFGWNEQ 363

Query: 301 DKCIEAEKHIFDEWVKGQWT---WCRRPTDMFEAVE----REMTDNEFWRGDSSYVAIVG 360
            KC+ A+K +FDEWVK   T      +P   ++ +     ++    +   G S  +  + 
Sbjct: 364 LKCVVADKIVFDEWVKSHPTAKGLLHKPFPYYDELAIVYGKDRATGDGAMGFSETLDEIA 423

Query: 361 RE-------EVDERTSMSE----APMNTQSTAHTSSRSSKKRAKSVDPLVAAVNGLENVM 420
            E       + D    + E    A MN+   +  ++R +K+++ + DPLV  ++      
Sbjct: 424 EEINNGWNDDFDPFDPLDEMNANASMNSSIPSSQTTRKAKRKSNNGDPLVELLSKSVQEF 483

Query: 421 SSHLSNANENIQEIALFYRQVAERESTREERRNSLVSEIRKVDGLSVRQRVRAGRLITKD 459
           S+  ++A+++I+++A       + E+    RR  L  EI+KVDGL+  QR++ G+L+  +
Sbjct: 484 STMQASASDSIKKLA----DCFQHEADGAARRMKLYEEIKKVDGLTNSQRLKIGKLLVSN 543

BLAST of Lag0038384 vs. ExPASy TrEMBL
Match: E5GCB5 (Retrotransposon protein OS=Cucumis melo subsp. melo OX=412675 PE=3 SV=1)

HSP 1 Score: 353.6 bits (906), Expect = 1.4e-93
Identity = 183/361 (50.69%), Postives = 226/361 (62.60%), Query Frame = 0

Query: 1   MVLNSVLHLHELLLKQPEPVHSNCLDEKWKWFKNCLGALDGTYIKVNVGILDRPRYRTRK 60
           MVL +V+ LHE LLK+P+PV + C D++W+WF+NCLGALDGTYIKVNV   DR RYRTRK
Sbjct: 106 MVLLAVIRLHEELLKKPQPVPNECTDQRWRWFENCLGALDGTYIKVNVPASDRARYRTRK 165

Query: 61  NEIATNVLGVCSQDMQFIYVLPGCEGSASDSRVLRDAVSRRNGLKIPKGCYYLYDAGYTN 120
            E+ATNVLGVC     F+YVL G EGSA+DSR+LRDA+SR N LK+PKG YYL D GY N
Sbjct: 166 GEVATNVLGVCDTKGDFVYVLAGWEGSAADSRILRDALSRPNRLKVPKGYYYLVDVGYPN 225

Query: 121 GEGFLAPYRGQRYHLNDWK---------------QGYQPRT------------------- 180
            EGFLAPYRGQRYHL +W+               + Y  R                    
Sbjct: 226 AEGFLAPYRGQRYHLQEWRGPENAPSTSKEFFNMKHYSARNVIERAFGVLKGRWAILRGK 285

Query: 181 ---PVKVQCRMTTACCLIHNLIRREMPVDPLE---QEVGDTH-----------------S 240
              PV+VQCR   ACCL+HNLI REM    +E    EV  TH                 S
Sbjct: 286 SYYPVEVQCRTILACCLLHNLINREMTNFDIEDNIDEVDSTHATTAADDIHYIETSNEWS 345

Query: 241 NMDDN--EPSKAGSSR--KRMWSKAEDEKLVECLLELSNIGTWKADNGTFKPGFLIQIEK 300
              DN  E     SSR  K  W+K E+  LVECL+EL N G W++DNGTF+PG+L Q+ +
Sbjct: 346 QWRDNLAEEIMTSSSRLPKHTWTKEEEAGLVECLVELVNAGGWRSDNGTFRPGYLNQLAR 405

BLAST of Lag0038384 vs. ExPASy TrEMBL
Match: A0A7J0DWA5 (Uncharacterized protein OS=Actinidia rufa OX=165716 GN=Acr_00g0082040 PE=3 SV=1)

HSP 1 Score: 349.0 bits (894), Expect = 3.5e-92
Identity = 220/566 (38.87%), Postives = 305/566 (53.89%), Query Frame = 0

Query: 2   VLNSVLHLHELLLKQPEPVHSNCLDE--KWKWFKNCLGALDGTYIKVNVGILDRPRYRTR 61
           VL+SVL L  +LLK PEP+ +NC DE  +W+WF+NCLGALDGTY+KV V  +D+PRYRTR
Sbjct: 33  VLHSVLRLQGVLLKMPEPIVANCTDERWRWRWFQNCLGALDGTYVKVLVPSVDKPRYRTR 92

Query: 62  KNEIATNVLGVCSQDMQFIYVLPGCEGSASDSRVLRDAVSRRNGLKIPKGCYYLYDAGYT 121
           K +IATNVL VCSQDMQFIYVLPG EGSASDSRVLRDA++R+NGL++P G YYL DAGYT
Sbjct: 93  KGKIATNVLDVCSQDMQFIYVLPGWEGSASDSRVLRDAITRQNGLRVPTGQYYLVDAGYT 152

Query: 122 NGEGFLAPYRGQRYHLNDWKQGYQPRT--------------------------------- 181
           NGEGFLAPYRGQRYHL+ W+ G  P T                                 
Sbjct: 153 NGEGFLAPYRGQRYHLSTWRGGPIPNTPEEYFNMKHSSARNIIERAFGLLKIRWAILRSY 212

Query: 182 ---PVKVQCRMTTACCLIHNLIRREMPVDPLEQEVGDTHSN--------MDDNEPS---- 241
              P+K Q R+  ACCL+HNLI+REMPVDP E  + +   +        +D  EPS    
Sbjct: 213 SYFPIKTQSRIIIACCLLHNLIKREMPVDPREHLLDENQLSPPPLVDEYIDVVEPSDQWS 272

Query: 242 -----------------------KAGSSRKRMWSKAEDEKLVECLLELSNIGT-WKADNG 301
                                   A SS +R W+K E+E L+ C+ +L +  T WK D G
Sbjct: 273 DWRATLASQMYNEWKTNRGMSKRPAKSSTRRFWTKREEEFLMGCMKDLFDQDTKWKLDCG 332

Query: 302 TFKPGFLIQIEKWIVEKIPMCDLKAQPHIESRVKILKKQYNAIAEMLGPNCSGFGWNDRD 361
            FK GF  + EK I    P  DL+A PHIES++K+ ++QY+ + +ML    SGFGW+D +
Sbjct: 333 QFKGGFYGECEKKIRCAFPGTDLRANPHIESKIKMWRRQYHLLQDML--KTSGFGWDDVE 392

Query: 362 KCI------------EAEKHI----------FDEWV------KGQWTWCRRPTDMFEAVE 421
           K I              EK +          +++W+      +      + PTD   A+E
Sbjct: 393 KSILVDSDDVWDNYVRREKDVKGMRNKSFPYYEDWLVLFGKDRANGDLAKGPTDSVAAIE 452

Query: 422 REMTDNEFWRGDSSYVAIVGREEVDERTSMSEAPMNTQSTAHTSSRSSKKRAKSVDPLVA 465
            + T  E         ++V +    +  SMS A   T S   +S  +SKKR ++ + +  
Sbjct: 453 TKETTKE-----QEPESLVLQFSAADMESMS-ATGGTSSAPSSSHANSKKRGRAAEGISK 512

BLAST of Lag0038384 vs. ExPASy TrEMBL
Match: A0A803PDI8 (Uncharacterized protein OS=Cannabis sativa OX=3483 PE=4 SV=1)

HSP 1 Score: 343.6 bits (880), Expect = 1.5e-90
Identity = 209/537 (38.92%), Postives = 289/537 (53.82%), Query Frame = 0

Query: 1   MVLNSVLHLHELLLKQPEPVHSNCLDEKWKWFKNCLGALDGTYIKVNVGILDRPRYRTRK 60
           MVLN++LHLH +LLK+P  +  +C DE+WKWFKNCLGALDGTYIKVN   LDRPRYRT K
Sbjct: 1   MVLNALLHLHGVLLKKPVAIRDDCTDERWKWFKNCLGALDGTYIKVNDLALDRPRYRTGK 60

Query: 61  NEIATNVLGVCSQDMQFIYVLPGCEGSASDSRVLRDAVSRRNGLKIPKGCYYLYDAGYTN 120
           N+IATNVLGV SQDM+FIYVLPG +GSA+D RVLRDA++ RN  K+P+G YYL DAGY N
Sbjct: 61  NKIATNVLGVVSQDMKFIYVLPGWKGSAADFRVLRDAIN-RNEFKVPQGYYYLCDAGYPN 120

Query: 121 GEGFLAPYRGQRYHLNDW-------KQGYQPR---------------------------T 180
           GE FL PYRG RYHLNDW       ++ +  R                            
Sbjct: 121 GERFLTPYRGHRYHLNDWTHPPNSSREFFNMRHSSARNVVERAFGLLKGRWAIIRGRSYY 180

Query: 181 PVKVQCRMTTACCLIHNLIRREMPVDPLEQEVGDTHSNMDDN------------EPSKAG 240
           PVK+QCR+  ACC +HNLIR EM +DPLE    D  ++ D++            EPS A 
Sbjct: 181 PVKIQCRIILACCHLHNLIRGEMHMDPLEHTNHDNGNDSDEDGDYADNDCYTYIEPSNAW 240

Query: 241 SS--------------------------------RKRMWSKAEDEKLVECLLELSNIGTW 300
           ++                                +K  W+  ED KLVECL+++ NIG W
Sbjct: 241 TAWRDNLAREMFEQWQGNQRDIMEATSQSTPIGGKKHQWTSIEDLKLVECLIDMCNIGKW 300

Query: 301 KADNGTFKPGFLIQIEKWIVEKIPMCDLKAQPHIESRVKILKKQYNAIAEMLGPNCSGFG 360
           KA+N                         AQPHI SR+KILK+QY  I+ MLGP+ SGFG
Sbjct: 301 KANN-------------------------AQPHINSRLKILKRQYTTISHMLGPSASGFG 360

Query: 361 WNDRDKCIEAEKHIFDEWVKGQWTWCRRPTDMFEAV--------EREMTDNEFWRGDSSY 420
           W++  KC+ A+K +FD+WVK   T       +F           +   T +   R   + 
Sbjct: 361 WSEELKCVVADKIVFDDWVKSHPTTKGLLNKLFSYYNELAIVYGKNRATGDGTIRFSETL 420

Query: 421 VAIV------GREEVDERTSMSE----APMNTQSTAHTSSRSSKKRAKSVDPLVAAVNGL 442
             I         ++ D   ++ E    A MN+   +  ++R +K+++ S DP V  ++  
Sbjct: 421 DEIAEEINNGWNDDYDPFNTLDEMNANASMNSNIPSSQTARKAKRKSNSGDPSVELLSKS 480

BLAST of Lag0038384 vs. ExPASy TrEMBL
Match: A0A5A7SWD8 (Retrotransposon protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold515G00010 PE=3 SV=1)

HSP 1 Score: 342.8 bits (878), Expect = 2.5e-90
Identity = 174/339 (51.33%), Postives = 219/339 (64.60%), Query Frame = 0

Query: 1   MVLNSVLHLHELLLKQPEPVHSNCLDEKWKWFKNCLGALDGTYIKVNVGILDRPRYRTRK 60
           MVL +V+ LH+ LLK+P+PV + C D++W+WF+NCLGALDGTYIKVNV   DR RYRTRK
Sbjct: 130 MVLLAVIRLHDELLKKPQPVPNECTDQRWRWFENCLGALDGTYIKVNVPASDRARYRTRK 189

Query: 61  NEIATNVLGVCSQDMQFIYVLPGCEGSASDSRVLRDAVSRRNGLKIPKGCYYLYDAGYTN 120
            E+ATNVLGV      F+YVL G EGSA+DSR+LRDA+SR N LK+PKG YYL DAGY N
Sbjct: 190 GEVATNVLGVYDTKGDFVYVLTGWEGSAADSRILRDALSRPNRLKVPKGYYYLVDAGYPN 249

Query: 121 GEGFLAPYRGQRYHLNDWK-QGYQPRT--------------------------------- 180
            EGFLAPYRGQRYHL +W+     P T                                 
Sbjct: 250 AEGFLAPYRGQRYHLQEWRGPKNAPSTSKEFFNMKHSSARNVIERAFGVLKGRWAILRGK 309

Query: 181 ---PVKVQCRMTTACCLIHNLIRREMPVDPLEQEVGDTHSNMDDNEPSKAGSSR--KRMW 240
              PV+VQC    ACCL+HNLI REM           T+ +++DN  S   SSR  K  W
Sbjct: 310 SYHPVEVQCHTILACCLLHNLINREM-----------TNFDIEDNIVSMTSSSRLPKHTW 369

Query: 241 SKAEDEKLVECLLELSNIGTWKADNGTFKPGFLIQIEKWIVEKIPMCDLKAQPHIESRVK 300
           +K E+  LV    EL N G W++DNGTF+PG+L Q+ + +  KIP C++ A   I+SR+K
Sbjct: 370 TKEEEAGLV----ELVNAGGWRSDNGTFRPGYLNQLARMMAFKIPGCNIHAST-IDSRIK 429

BLAST of Lag0038384 vs. TAIR 10
Match: AT5G41980.1 (CONTAINS InterPro DOMAIN/s: Putative harbinger transposase-derived nuclease (InterPro:IPR006912); BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G43722.1); Has 1807 Blast hits to 1807 proteins in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes - 339 (source: NCBI BLink). )

HSP 1 Score: 111.3 bits (277), Expect = 2.4e-24
Identity = 68/199 (34.17%), Postives = 105/199 (52.76%), Query Frame = 0

Query: 2   VLNSVLHLHELLLKQPEPVHSNCLDEKWKWFKNCLGALDGTYIKVNVGILDRPRYRTRKN 61
           VLN+V+ + +    QP   +S+ L+    +FK+C+G +D  +I V VG+ ++  +R    
Sbjct: 112 VLNAVIAISKDFF-QPNS-NSDTLENDDPYFKDCVGVVDSFHIPVMVGVDEQGPFRNGNG 171

Query: 62  EIATNVLGVCSQDMQFIYVLPGCEGSASDSRVLRDAVSRRNGLKIPKGCYYLYDAGYTNG 121
            +  NVL   S D++F YVL G EGSASD +VL  A++RRN L++P+G YY+ D  Y N 
Sbjct: 172 LLTQNVLAASSFDLRFNYVLAGWEGSASDQQVLNAALTRRNKLQVPQGKYYIVDNKYPNL 231

Query: 122 EGFLAPYRG--------------QRYHL---------NDWKQGY-----QPRTPVKVQCR 173
            GF+APY G              +R+ L            K+ +      P  P++ Q +
Sbjct: 232 PGFIAPYHGVSTNSREEAKEMFNERHKLLHRAIHRTFGALKERFPILLSAPPYPLQTQVK 291

BLAST of Lag0038384 vs. TAIR 10
Match: AT5G28950.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G41980.1); Has 448 Blast hits to 446 proteins in 74 species: Archae - 0; Bacteria - 0; Metazoa - 31; Fungi - 21; Plants - 396; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 85.1 bits (209), Expect = 1.8e-16
Identity = 43/116 (37.07%), Postives = 68/116 (58.62%), Query Frame = 0

Query: 29  WKWFKNCLGALDGTYIKVNVGILDRPRYRTRKNEIATNVLGVCSQDMQFIYVLPGCEGSA 88
           + +FK+C+GA+D T+I   V     P +R RK +I+ N+L  C+ D++F+YVL G EGSA
Sbjct: 19  YPYFKDCVGAIDDTHIFAMVSQKKMPSFRNRKGDISQNMLAACNFDVEFMYVLSGWEGSA 78

Query: 89  SDSRVLRDAVSRR-NGLKIP---KGCYYLYDAGYTNGEGFLAPYRGQRYHLNDWKQ 141
            DS+VL DA++R  N L +P   +    + +    N +  L     QR + N W++
Sbjct: 79  HDSKVLNDALTRNSNRLPVPEEDESAEEVVEEVNDNNDEVLTTQDQQREYANQWRE 134

BLAST of Lag0038384 vs. TAIR 10
Match: AT1G43722.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G28730.1); Has 924 Blast hits to 912 proteins in 109 species: Archae - 0; Bacteria - 0; Metazoa - 222; Fungi - 31; Plants - 661; Viruses - 0; Other Eukaryotes - 10 (source: NCBI BLink). )

HSP 1 Score: 83.6 bits (205), Expect = 5.3e-16
Identity = 43/123 (34.96%), Postives = 65/123 (52.85%), Query Frame = 0

Query: 29  WKWFKNCLGALDGTYIKVNVGILDRPRYRTRKNEIATNVLGVCSQDMQFIYVLPGCEGSA 88
           W +F   +GA+DGT++ V V    +  Y  R +  + N++ +C   M F Y+  G  GS 
Sbjct: 170 WPYFSGFVGAMDGTHVCVKVKPDLQGMYWNRHDNASLNIMAICDLKMLFTYIWNGAPGSC 229

Query: 89  SDSRVLRDAVSRRNGLKIPKG-CYYLYDAGYTNGEGFLAPYRGQ-----RYHLNDWKQGY 146
            D+ VL+ A    +   +P    YYL D+GY N +G LAPYR       RYH++ +  G 
Sbjct: 230 YDTAVLQIAQQSDSEFPLPPSEKYYLVDSGYPNKQGLLAPYRSSRNRVVRYHMSQFYYGP 289

BLAST of Lag0038384 vs. TAIR 10
Match: AT5G35695.1 (CONTAINS InterPro DOMAIN/s: Putative harbinger transposase-derived nuclease (InterPro:IPR006912); BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G41980.1); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )

HSP 1 Score: 66.2 bits (160), Expect = 8.8e-11
Identity = 55/187 (29.41%), Postives = 73/187 (39.04%), Query Frame = 0

Query: 77  FIYVLPGCEGSASDSRVLRDAVSRRNGLKIPKGCYYLYDAGYTNGEGFLAPYRGQRYHLN 136
           FIYVL G EGSA DSRVL DA+ +          +YL D G+ N   FLAP+RG RYHL 
Sbjct: 25  FIYVLSGWEGSAHDSRVLSDALRK----------FYLVDCGFANRLNFLAPFRGVRYHLQ 84

Query: 137 DWK-QGYQPRTP------------------------------------VKVQCRMTTACC 196
           ++  Q   P TP                                     K Q  +   C 
Sbjct: 85  EFAGQRRDPETPHELFNLRHVSLRNVIERIFGIFKSRFAIFKSAPPFSYKKQAGLVLTCA 144

Query: 197 LIHNLIRREMPVDPLE--QEVGD------------THSNMDDNEP---SKAGSSRKRMWS 210
            +HN +R+E   D  +   EVG+              + +D+ EP    K       MW 
Sbjct: 145 ALHNFLRKECRSDEADFPDEVGNEGDVVNNEGNAMNTNEIDNEEPLEAQKQDRENTNMWR 201

BLAST of Lag0038384 vs. TAIR 10
Match: AT5G28730.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G43722.1); Has 496 Blast hits to 496 proteins in 68 species: Archae - 0; Bacteria - 0; Metazoa - 3; Fungi - 23; Plants - 470; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 63.2 bits (152), Expect = 7.4e-10
Identity = 32/73 (43.84%), Postives = 39/73 (53.42%), Query Frame = 0

Query: 66  NVLGVCSQDMQFIYVLPGCEGSASDSRVLRDAVSRRNGLKIPKGC-YYLYDAGYTNGEGF 125
           NVL +C  DM F Y   G  GS  D+RVL  A+S      +P    YYL D+GY N  G+
Sbjct: 141 NVLAICDLDMLFTYCFVGMAGSTHDARVLSAAISDDPLFHVPPDSKYYLVDSGYANKRGY 200

Query: 126 LAPYRGQRYHLND 138
           LAPYR +     D
Sbjct: 201 LAPYRREHREAQD 213

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_042426186.13.9e-10152.79uncharacterized protein LOC122014061 [Zingiber officinale][more]
XP_026662506.21.1e-9542.09uncharacterized protein LOC113463064 [Phoenix dactylifera][more]
ADN34114.13.0e-9350.69retrotransposon protein [Cucumis melo subsp. melo][more]
XP_028060687.12.5e-9237.72uncharacterized protein LOC114264281 [Camellia sinensis][more]
GFS42850.17.3e-9238.87hypothetical protein Acr_00g0082040 [Actinidia rufa][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A803QNC51.5e-11146.33Uncharacterized protein OS=Cannabis sativa OX=3483 PE=4 SV=1[more]
E5GCB51.4e-9350.69Retrotransposon protein OS=Cucumis melo subsp. melo OX=412675 PE=3 SV=1[more]
A0A7J0DWA53.5e-9238.87Uncharacterized protein OS=Actinidia rufa OX=165716 GN=Acr_00g0082040 PE=3 SV=1[more]
A0A803PDI81.5e-9038.92Uncharacterized protein OS=Cannabis sativa OX=3483 PE=4 SV=1[more]
A0A5A7SWD82.5e-9051.33Retrotransposon protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold... [more]
Match NameE-valueIdentityDescription
AT5G41980.12.4e-2434.17CONTAINS InterPro DOMAIN/s: Putative harbinger transposase-derived nuclease (Int... [more]
AT5G28950.11.8e-1637.07unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT1G43722.15.3e-1634.96unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT5G35695.18.8e-1129.41CONTAINS InterPro DOMAIN/s: Putative harbinger transposase-derived nuclease (Int... [more]
AT5G28730.17.4e-1043.84unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
InterPro
Analysis Name: InterPro Annotations of Sponge gourd (AG-4) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR027806Harbinger transposase-derived nuclease domainPFAMPF13359DDE_Tnp_4coord: 40..134
e-value: 2.7E-7
score: 30.4
IPR024752Myb/SANT-like domainPFAMPF12776Myb_DNA-bind_3coord: 200..298
e-value: 2.1E-7
score: 31.8
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 465..485
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 345..372
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 176..200
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 349..364
NoneNo IPR availablePANTHERPTHR46250MYB/SANT-LIKE DNA-BINDING DOMAIN PROTEIN-RELATEDcoord: 190..462

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Lag0038384.1Lag0038384.1mRNA