Lag0035147 (gene) Sponge gourd (AG‐4) v1

Overview
NameLag0035147
Typegene
OrganismLuffa acutangula (Sponge gourd (AG‐4) v1)
DescriptionRetrotran_gag_3 domain-containing protein
Locationchr3: 15724916 .. 15726207 (+)
RNA-Seq ExpressionLag0035147
SyntenyLag0035147
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCTTCACGCGCATTCCCTCTTTGATATAGTTGATGGATCAAAGTCGTGTCCCAATGAGTTCTTGAAAGATAGTGATGGTAATCGTCTTCCTACACCTAATCTGGCTTACGATCAGTGGATAGCTCAAGATTGAGCACTGATCACTCTGATTAACGCAACACTTTCGAAGGTTGCCTTTTCCTTTGTTATAGGATGCAAGTCTTCCAAAGAGGTATGGACTGCCCGAGAAAAGCGTTTTTCCTCTCTTACACACTCTCATATTCATGAATTGAAATCGGCACTACACTCTGTAGCTAAGAGCCCGACTGAGTCAATTGATGAGTATTTGATTCGAATCAAAGAGATTGTTGATAAACTTGTCACTGTCTCTGTGAAAGTTGATGATGAAGACTTACTTCTGTATACTCTAAATGGTCTGTCATCTAAATTTAACTCTTTCAAGACCTCATTTCGTACAAGAAGTGGTTCTGTAACGCTTGACGAATTACACGCCTTACTGAAGTCTAAATTGAAATTTATTGAGCAACATAACAAATTATCAACTGCCTCCATCAATCCTACAGCAATGTTCGCTCGAGGTGTCAATCAGAATCAGTCCTCTCGCGGTCGTGGTCGAAATCAAAATAATCAAGCAGGTAGAGGCTTCTTTCCTGGAAATCAAAGTGAAAGAGGAACCAGACACCTCAAGGCTCCTTCATTAATCAAAGTGGTCGTGGCAATTTCTCCCAAGGTCAGTTTAATTCTGGCCGTGGCAATTCAGATGGGAGCCAAGGAGGTTATTCTAGTGCAAATTTTGGGAGAAATCCTGGTCGTAGTCTGCCAAATTTGCAATCATCCTGGTCATGGAGCTCTCGATTGCTAAATTGTCTCAATCTATCTTATCAAGGTCGCCATCCCCCTTCCAAGCTCGCTGCAATGGCTATTGCTAATGATCCCTCATCAACCACAGCTACTTGGCTTGCCGACAGCGGATGTAACACTCATGTTACCCCTGACACTTCTTGTTTGGCCTTGAATTCTAACTTCAATGGCAAAGAGGTTCTTACTGTTGCAAACGACCAAGGACTCCCAGCTGCTCAAGCTGGTATCAGTACTCTTCTTACACCTCAAAGTGATCTTCATATGTCCAGTTTATTATGTGTACCGGATCTTTCAGCCAACTTACTATCTGTCTCTCAGTGTTGTGTGGATAATAATTTTGTTTTCACTTTTGGTGCTAATTGGTTTACAATTCAGGACAAGGACACGGGCCAAATTTTAGCTACAAGTGGAAAAATCCTGGCCGTGTAG

mRNA sequence

ATGCTTCACGCGCATTCCCTCTTTGATATAGTTGATGGATCAAAGTCGTGTCCCAATGAGTTCTTGAAAGATAGTGATGGATGCAAGTCTTCCAAAGAGGTATGGACTGCCCGAGAAAAGCGTTTTTCCTCTCTTACACACTCTCATATTCATGAATTGAAATCGGCACTACACTCTGTAGCTAAGAGCCCGACTGAGTCAATTGATGAGTATTTGATTCGAATCAAAGAGATTGTTGATAAACTTGTCACTGTCTCTGTGAAAGTTGATGATGAAGACTTACTTCTGTATACTCTAAATGGTCTGTCATCTAAATTTAACTCTTTCAAGACCTCATTTCGTACAAGAAGTGGTTCTGTAACGCTTGACGAATTACACGCCTTACTGAAGTCTAAATTGAAATTTATTGAGCAACATAACAAATTATCAACTGCCTCCATCAATCCTACAGCAATGTTCGCTCGAGGTGTCAATCAGAATCAGTCCTCTCGCGGTCGTGGTCGAAATCAAAATAATCAAGCAGGTCAGTTTAATTCTGGCCGTGGCAATTCAGATGGGAGCCAAGGAGGTTATTCTAGTGCAAATTTTGGGAGAAATCCTGGTCGTAGTCTGCCAAATTTGCAATCATCCTGGTCATGGAGCTCTCGATTGCTAAATTGTCTCAATCTATCTTATCAAGGTCGCCATCCCCCTTCCAAGCTCGCTGCAATGGCTATTGCTAATGATCCCTCATCAACCACAGCTACTTGGCTTGCCGACAGCGGATGTAACACTCATGTTACCCCTGACACTTCTTGTTTGGCCTTGAATTCTAACTTCAATGGCAAAGAGGTTCTTACTGTTGCAAACGACCAAGGACTCCCAGCTGCTCAAGCTGGTATCAGTACTCTTCTTACACCTCAAAGTGATCTTCATATGTCCAGTTTATTATGTGTACCGGATCTTTCAGCCAACTTACTATCTGTCTCTCAGTGTTGTGTGGATAATAATTTTGTTTTCACTTTTGGTGCTAATTGGTTTACAATTCAGGACAAGGACACGGGCCAAATTTTAGCTACAAGTGGAAAAATCCTGGCCGTGTAG

Coding sequence (CDS)

ATGCTTCACGCGCATTCCCTCTTTGATATAGTTGATGGATCAAAGTCGTGTCCCAATGAGTTCTTGAAAGATAGTGATGGATGCAAGTCTTCCAAAGAGGTATGGACTGCCCGAGAAAAGCGTTTTTCCTCTCTTACACACTCTCATATTCATGAATTGAAATCGGCACTACACTCTGTAGCTAAGAGCCCGACTGAGTCAATTGATGAGTATTTGATTCGAATCAAAGAGATTGTTGATAAACTTGTCACTGTCTCTGTGAAAGTTGATGATGAAGACTTACTTCTGTATACTCTAAATGGTCTGTCATCTAAATTTAACTCTTTCAAGACCTCATTTCGTACAAGAAGTGGTTCTGTAACGCTTGACGAATTACACGCCTTACTGAAGTCTAAATTGAAATTTATTGAGCAACATAACAAATTATCAACTGCCTCCATCAATCCTACAGCAATGTTCGCTCGAGGTGTCAATCAGAATCAGTCCTCTCGCGGTCGTGGTCGAAATCAAAATAATCAAGCAGGTCAGTTTAATTCTGGCCGTGGCAATTCAGATGGGAGCCAAGGAGGTTATTCTAGTGCAAATTTTGGGAGAAATCCTGGTCGTAGTCTGCCAAATTTGCAATCATCCTGGTCATGGAGCTCTCGATTGCTAAATTGTCTCAATCTATCTTATCAAGGTCGCCATCCCCCTTCCAAGCTCGCTGCAATGGCTATTGCTAATGATCCCTCATCAACCACAGCTACTTGGCTTGCCGACAGCGGATGTAACACTCATGTTACCCCTGACACTTCTTGTTTGGCCTTGAATTCTAACTTCAATGGCAAAGAGGTTCTTACTGTTGCAAACGACCAAGGACTCCCAGCTGCTCAAGCTGGTATCAGTACTCTTCTTACACCTCAAAGTGATCTTCATATGTCCAGTTTATTATGTGTACCGGATCTTTCAGCCAACTTACTATCTGTCTCTCAGTGTTGTGTGGATAATAATTTTGTTTTCACTTTTGGTGCTAATTGGTTTACAATTCAGGACAAGGACACGGGCCAAATTTTAGCTACAAGTGGAAAAATCCTGGCCGTGTAG

Protein sequence

MLHAHSLFDIVDGSKSCPNEFLKDSDGCKSSKEVWTAREKRFSSLTHSHIHELKSALHSVAKSPTESIDEYLIRIKEIVDKLVTVSVKVDDEDLLLYTLNGLSSKFNSFKTSFRTRSGSVTLDELHALLKSKLKFIEQHNKLSTASINPTAMFARGVNQNQSSRGRGRNQNNQAGQFNSGRGNSDGSQGGYSSANFGRNPGRSLPNLQSSWSWSSRLLNCLNLSYQGRHPPSKLAAMAIANDPSSTTATWLADSGCNTHVTPDTSCLALNSNFNGKEVLTVANDQGLPAAQAGISTLLTPQSDLHMSSLLCVPDLSANLLSVSQCCVDNNFVFTFGANWFTIQDKDTGQILATSGKILAV
Homology
BLAST of Lag0035147 vs. NCBI nr
Match: KAA8524269.1 (hypothetical protein F0562_010692 [Nyssa sinensis])

HSP 1 Score: 191.8 bits (486), Expect = 1.0e-44
Identity = 140/415 (33.73%), Postives = 217/415 (52.29%), Query Frame = 0

Query: 1   MLHAHSLFDIVDGSKSCPNEFLKDSDGC-------------------------------- 60
           +L AHSL   +DG+  CPN+F++D  G                                 
Sbjct: 77  ILKAHSLIGYIDGTYPCPNKFVQDERGAATAQINPEYQIWNTQDQALMTLLNATLSQTAL 136

Query: 61  ------KSSKEVWTAREKRFSSLTHSHIHELKSALHSVAKSPTESIDEYLIRIKEIVDKL 120
                  +S+E W A E+RFS+ T S+I +LKSALH+++K   +SID Y+ +IK+  D L
Sbjct: 137 SHVIGYSTSREAWLALERRFSASTRSNILQLKSALHNISKG-KDSIDSYIQKIKQARDSL 196

Query: 121 VTVSVKVDDEDLLLYTLNGLSSKFNSFKTSFRTRSGSVTLDELHALLKSKLKFIEQHNKL 180
            +VSV ++DED+L+Y LNGL  ++N+FKTS RT+S ++TL+E++A+LK + + IE  +K 
Sbjct: 197 ASVSVLIEDEDILIYVLNGLPQEYNAFKTSIRTKSENITLEEVYAMLKIEEQTIESVHKQ 256

Query: 181 STASINPTAMFARGVNQNQSSRGRGRNQNNQAGQFN-----SGRGNSDGSQGGYSSANFG 240
           + +   P AM A     N SS  RG + +N +G+       S RG    S G + S NFG
Sbjct: 257 NNSPPFPGAMMATNYRPNFSS-NRGYSPSNFSGRGRGRGRFSNRGGRMHSFGRFQSPNFG 316

Query: 241 --------RNPGRSLPNLQSSWSWSSRL--------LNC---LNLSYQGRHPPSKLAAMA 300
                   + P +S     +S     ++        L+C   ++ SYQG+ P  +L AM+
Sbjct: 317 QSNLPYPTKQPQQSNQRSNNSHPVVCQICNKNGHSALDCYHRMDFSYQGKPPSPQLTAMS 376

Query: 301 IANDPSSTTAT--WLADSGCNTHVTPDTSCLALNSNFNGKEVLTVANDQGLPAAQAGIST 352
              +  S  +   W  D+G   H+T D + L     + G + +T+AN Q L  + +G S+
Sbjct: 377 ATYNTGSDCSPNYWYTDTGATNHITADLANLNFPVEYQGDDNITIANGQALDISHSGQSS 436

BLAST of Lag0035147 vs. NCBI nr
Match: KAA8519786.1 (hypothetical protein F0562_014124 [Nyssa sinensis])

HSP 1 Score: 191.8 bits (486), Expect = 1.0e-44
Identity = 140/415 (33.73%), Postives = 217/415 (52.29%), Query Frame = 0

Query: 1   MLHAHSLFDIVDGSKSCPNEFLKDSDGC-------------------------------- 60
           +L AHSL   +DG+  CPN+F++D  G                                 
Sbjct: 77  ILKAHSLIGYIDGTYPCPNKFVQDERGAATAQINPEYQIWNTQDQALMTLLNATLSQTAL 136

Query: 61  ------KSSKEVWTAREKRFSSLTHSHIHELKSALHSVAKSPTESIDEYLIRIKEIVDKL 120
                  +S+E W A E+RFS+ T S+I +LKSALH+++K   +SID Y+ +IK+  D L
Sbjct: 137 SHVIGYSTSREAWLALERRFSASTRSNILQLKSALHNISKG-KDSIDSYIQKIKQARDSL 196

Query: 121 VTVSVKVDDEDLLLYTLNGLSSKFNSFKTSFRTRSGSVTLDELHALLKSKLKFIEQHNKL 180
            +VSV ++DED+L+Y LNGL  ++N+FKTS RT+S ++TL+E++A+LK + + IE  +K 
Sbjct: 197 ASVSVLIEDEDILIYVLNGLPQEYNAFKTSIRTKSENITLEEVYAMLKIEEQTIESVHKQ 256

Query: 181 STASINPTAMFARGVNQNQSSRGRGRNQNNQAGQFN-----SGRGNSDGSQGGYSSANFG 240
           + +   P AM A     N SS  RG + +N +G+       S RG    S G + S NFG
Sbjct: 257 NNSPPFPGAMMATNYRPNFSS-NRGYSPSNFSGRGRGRGRFSNRGGRMHSFGRFQSPNFG 316

Query: 241 --------RNPGRSLPNLQSSWSWSSRL--------LNC---LNLSYQGRHPPSKLAAMA 300
                   + P +S     +S     ++        L+C   ++ SYQG+ P  +L AM+
Sbjct: 317 QSNLPYPTKQPQQSNQRSNNSHPVVCQICNKNGHSALDCYHRMDFSYQGKPPSPQLTAMS 376

Query: 301 IANDPSSTTAT--WLADSGCNTHVTPDTSCLALNSNFNGKEVLTVANDQGLPAAQAGIST 352
              +  S  +   W  D+G   H+T D + L     + G + +T+AN Q L  + +G S+
Sbjct: 377 ATYNTGSDCSPNYWYTDTGATNHITADLANLNFPVEYQGDDNITIANGQALDISHSGQSS 436

BLAST of Lag0035147 vs. NCBI nr
Match: KAA8535282.1 (hypothetical protein F0562_030285 [Nyssa sinensis])

HSP 1 Score: 191.0 bits (484), Expect = 1.7e-44
Identity = 140/415 (33.73%), Postives = 216/415 (52.05%), Query Frame = 0

Query: 1   MLHAHSLFDIVDGSKSCPNEFLKDSDGC-------------------------------- 60
           +L AHSL   +DG+  CPN+F++D  G                                 
Sbjct: 77  ILKAHSLIGYIDGTYPCPNKFVQDERGAATAQINPEYQIWNTQDQALMTLLNATLSQTAL 136

Query: 61  ------KSSKEVWTAREKRFSSLTHSHIHELKSALHSVAKSPTESIDEYLIRIKEIVDKL 120
                  +S+E W A E+RFS+ T S+I +LKSALH+++K   +SID Y+ +IK   D L
Sbjct: 137 SHVIGYSTSREAWLALERRFSASTRSNILQLKSALHNISKG-KDSIDSYIQKIKRARDSL 196

Query: 121 VTVSVKVDDEDLLLYTLNGLSSKFNSFKTSFRTRSGSVTLDELHALLKSKLKFIEQHNKL 180
            +VSV ++DED+L+Y LNGL  ++N+FKTS RT+S ++TL+E++A+LK + + IE  +K 
Sbjct: 197 ASVSVLIEDEDILIYVLNGLPQEYNAFKTSIRTKSENITLEEVYAMLKIEEQTIESVHKQ 256

Query: 181 STASINPTAMFARGVNQNQSSRGRGRNQNNQAGQFN-----SGRGNSDGSQGGYSSANFG 240
           + +   P AM A     N SS  RG + +N +G+       S RG    S G + S NFG
Sbjct: 257 NNSPPFPGAMMATNYRPNFSS-NRGYSPSNFSGRGRGRGRFSNRGGRMHSFGRFQSPNFG 316

Query: 241 --------RNPGRSLPNLQSSWSWSSRL--------LNC---LNLSYQGRHPPSKLAAMA 300
                   + P +S     +S     ++        L+C   ++ SYQG+ P  +L AM+
Sbjct: 317 QSNLPYPTKQPQQSNQRSNNSHPVVCQICNKNGHSALDCYHRMDFSYQGKPPSPQLTAMS 376

Query: 301 IANDPSSTTAT--WLADSGCNTHVTPDTSCLALNSNFNGKEVLTVANDQGLPAAQAGIST 352
              +  S  +   W  D+G   H+T D + L     + G + +T+AN Q L  + +G S+
Sbjct: 377 ATYNTGSDCSPNYWYTDTGATNHITADLANLNFPVEYQGDDNITIANGQALDISHSGQSS 436

BLAST of Lag0035147 vs. NCBI nr
Match: KAA8516701.1 (hypothetical protein F0562_016793 [Nyssa sinensis])

HSP 1 Score: 190.7 bits (483), Expect = 2.3e-44
Identity = 140/415 (33.73%), Postives = 217/415 (52.29%), Query Frame = 0

Query: 1   MLHAHSLFDIVDGSKSCPNEFLKDSDGC-------------------------------- 60
           +L AHSL   +DG+  CPN+F++D  G                                 
Sbjct: 77  ILKAHSLIGYIDGTYPCPNKFVQDERGAATAQINPEYQIWNTQDQALMTLLNATLSQTAL 136

Query: 61  ------KSSKEVWTAREKRFSSLTHSHIHELKSALHSVAKSPTESIDEYLIRIKEIVDKL 120
                  +S+E W A E+RFS+ T S+I +LKSALH+++K   +SID Y+ +IK+  D L
Sbjct: 137 SHVIGYSTSREAWLALERRFSASTRSNILQLKSALHNISKG-KDSIDSYIQKIKQARDSL 196

Query: 121 VTVSVKVDDEDLLLYTLNGLSSKFNSFKTSFRTRSGSVTLDELHALLKSKLKFIEQHNKL 180
            +VSV ++DED+L+Y LNGL  ++N+FKTS RT+S ++TL+E++A+LK + + IE  +K 
Sbjct: 197 ASVSVLIEDEDILIYVLNGLPQEYNAFKTSIRTKSENITLEEVYAMLKIEEQTIESVHKQ 256

Query: 181 STASINPTAMFARGVNQNQSSRGRGRNQNNQAGQFN-----SGRGNSDGSQGGYSSANFG 240
           + +   P AM A     N SS  RG + +N +G+       S RG    S G + S NFG
Sbjct: 257 NNSPPFPGAMMATNYRPNFSS-NRGYSPSNFSGRGRGRGRFSNRGGRMHSFGRFQSPNFG 316

Query: 241 --------RNPGRSLPNLQSSWSWSSRL--------LNC---LNLSYQGRHPPSKLAAMA 300
                   + P +S     +S     ++        L+C   ++ SYQG+ P  +L AM+
Sbjct: 317 QSNLPYPTKQPQQSNQRSNNSHPVVCQICNKNGHSALDCYHRMDFSYQGKPPSPQLTAMS 376

Query: 301 IANDPSSTTAT--WLADSGCNTHVTPDTSCLALNSNFNGKEVLTVANDQGLPAAQAGIST 352
              +  S  +   W  D+G   H+T D + L     + G + +T+AN Q L  + +G S+
Sbjct: 377 ATYNTGSDCSPNYWYTDTGATNHITADLANLNFPVEYQGDDNITIANGQALDISHSGQSS 436

BLAST of Lag0035147 vs. NCBI nr
Match: KAA8521875.1 (hypothetical protein F0562_012811 [Nyssa sinensis])

HSP 1 Score: 190.7 bits (483), Expect = 2.3e-44
Identity = 140/415 (33.73%), Postives = 217/415 (52.29%), Query Frame = 0

Query: 1   MLHAHSLFDIVDGSKSCPNEFLKDSDGC-------------------------------- 60
           +L AHSL   +DG+  CPN+F++D  G                                 
Sbjct: 383 ILKAHSLIGYIDGTYPCPNKFVQDERGAATAQINPEYQIWNTQDQALMTLLNATLSQTAL 442

Query: 61  ------KSSKEVWTAREKRFSSLTHSHIHELKSALHSVAKSPTESIDEYLIRIKEIVDKL 120
                  +S+E W A E+RFS+ T S+I +LKSALH+++K   +SID Y+ +IK+  D L
Sbjct: 443 SHVIGYSTSREAWLALERRFSASTRSNILQLKSALHNISKG-KDSIDSYIQKIKQARDSL 502

Query: 121 VTVSVKVDDEDLLLYTLNGLSSKFNSFKTSFRTRSGSVTLDELHALLKSKLKFIEQHNKL 180
            +VSV ++DED+L+Y LNGL  ++N+FKTS RT+S ++TL+E++A+LK + + IE  +K 
Sbjct: 503 ASVSVLIEDEDILIYVLNGLPQEYNAFKTSIRTKSENITLEEVYAMLKIEEQTIESVHKQ 562

Query: 181 STASINPTAMFARGVNQNQSSRGRGRNQNNQAGQFN-----SGRGNSDGSQGGYSSANFG 240
           + +   P AM A     N SS  RG + +N +G+       S RG    S G + S NFG
Sbjct: 563 NNSPPFPGAMMATNYRPNFSS-NRGYSPSNFSGRGRGRGRFSNRGGRMHSFGRFQSPNFG 622

Query: 241 --------RNPGRSLPNLQSSWSWSSRL--------LNC---LNLSYQGRHPPSKLAAMA 300
                   + P +S     +S     ++        L+C   ++ SYQG+ P  +L AM+
Sbjct: 623 QSNLPYPTKQPQQSNQRSNNSHPVVCQICNKNGHSALDCYHRMDFSYQGKPPSPQLTAMS 682

Query: 301 IANDPSSTTAT--WLADSGCNTHVTPDTSCLALNSNFNGKEVLTVANDQGLPAAQAGIST 352
              +  S  +   W  D+G   H+T D + L     + G + +T+AN Q L  + +G S+
Sbjct: 683 ATYNTGSDCSPNYWYTDTGATNHITADLANLNFPVEYQGDDNITIANGQALDISHSGQSS 742

BLAST of Lag0035147 vs. ExPASy Swiss-Prot
Match: Q94HW2 (Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana OX=3702 GN=RE1 PE=2 SV=1)

HSP 1 Score: 80.9 bits (198), Expect = 3.3e-14
Identity = 82/335 (24.48%), Postives = 158/335 (47.16%), Query Frame = 0

Query: 30  SSKEVWTAREKRFSSLTHSHIHELKSALHSVAKSPTESIDEYLIRIKEIVDKLVTVSVKV 89
           ++ ++W    K +++ ++ H+ +L++ L    K  T++ID+Y+  +    D+L  +   +
Sbjct: 105 TAAQIWETLRKIYANPSYGHVTQLRTQLKQWTKG-TKTIDDYMQGLVTRFDQLALLGKPM 164

Query: 90  DDEDLLLYTLNGLSSKFNSFKTSFRTRSGSVTLDELH-ALLKSKLKFIEQHNKLSTASIN 149
           D ++ +   L  L  ++         +    TL E+H  LL  + K +     +S+A++ 
Sbjct: 165 DHDEQVERVLENLPEEYKPVIDQIAAKDTPPTLTEIHERLLNHESKIL----AVSSATVI 224

Query: 150 PTAMFARGVNQNQSSRGRGRNQNNQAGQFNSGRGNSDGSQGGYSSANFGRNPGRSLPNLQ 209
           P  + A  V+   ++     N  N+  ++++   N++      SS NF  N  +S P L 
Sbjct: 225 P--ITANAVSHRNTTTTNNNNNGNRNNRYDNRNNNNNSKPWQQSSTNFHPNNNQSKPYLG 284

Query: 210 S---------SWSWSSRLLNCLNLSYQGRHPPSKL------AAMAIANDPSSTTATWLAD 269
                     S    S+L + L+ S   + PPS        A +A+ +  SS    WL D
Sbjct: 285 KCQICGVQGHSAKRCSQLQHFLS-SVNSQQPPSPFTPWQPRANLALGSPYSSN--NWLLD 344

Query: 270 SGCNTHVTPDTSCLALNSNFNGKEVLTVANDQGLPAAQAGISTLLTPQSDLHMSSLLCVP 329
           SG   H+T D + L+L+  + G + + VA+   +P +  G ++L T    L++ ++L VP
Sbjct: 345 SGATHHITSDFNNLSLHQPYTGGDDVMVADGSTIPISHTGSTSLSTKSRPLNLHNILYVP 404

Query: 330 DLSANLLSVSQCCVDNNFVFTFGANWFTIQDKDTG 349
           ++  NL+SV + C  N     F    F ++D +TG
Sbjct: 405 NIHKNLISVYRLCNANGVSVEFFPASFQVKDLNTG 429

BLAST of Lag0035147 vs. ExPASy Swiss-Prot
Match: Q9ZT94 (Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana OX=3702 GN=RE2 PE=4 SV=1)

HSP 1 Score: 60.5 bits (145), Expect = 4.6e-08
Identity = 73/331 (22.05%), Postives = 132/331 (39.88%), Query Frame = 0

Query: 30  SSKEVWTAREKRFSSLTHSHIHELKSALHSVAKSPTESIDEYLIRIKEIVDKLVTVSVKV 89
           ++ ++W    K +++ ++ H+ +L+                ++ R     D+L  +   +
Sbjct: 105 TAAQIWETLRKIYANPSYGHVTQLR----------------FITRF----DQLALLGKPM 164

Query: 90  DDEDLLLYTLNGLSSKFNSFKTSFRTRSGSVTLDELHALLKSKLKFIEQHNKLSTASINP 149
           D ++ +   L  L   +         +    +L E+H  L ++   +   N      I  
Sbjct: 165 DHDEQVERVLENLPDDYKPVIDQIAAKDTPPSLTEIHERLINRESKLLALNSAEVVPITA 224

Query: 150 TAMFAR--GVNQNQSSRGRGRNQNNQAGQFNSGRGNSDGSQGGYSSANFGRNPGRSLPNL 209
             +  R    N+NQ++RG  RN NN   + NS + +S GS+         R P   L   
Sbjct: 225 NVVTHRNTNTNRNQNNRGDNRNYNNNNNRSNSWQPSSSGSRSD------NRQPKPYLGRC 284

Query: 210 QSSWSWSSRLLNCLNL--------SYQGRHP--PSKLAAMAIANDPSSTTATWLADSGCN 269
           Q           C  L          Q   P  P +  A    N P +    WL DSG  
Sbjct: 285 QICSVQGHSAKRCPQLHQFQSTTNQQQSTSPFTPWQPRANLAVNSPYNAN-NWLLDSGAT 344

Query: 270 THVTPDTSCLALNSNFNGKEVLTVANDQGLPAAQAGISTLLTPQSDLHMSSLLCVPDLSA 329
            H+T D + L+ +  + G + + +A+   +P    G ++L T    L ++ +L VP++  
Sbjct: 345 HHITSDFNNLSFHQPYTGGDDVMIADGSTIPITHTGSASLPTSSRSLDLNKVLYVPNIHK 404

Query: 330 NLLSVSQCCVDNNFVFTFGANWFTIQDKDTG 349
           NL+SV + C  N     F    F ++D +TG
Sbjct: 405 NLISVYRLCNTNRVSVEFFPASFQVKDLNTG 408

BLAST of Lag0035147 vs. ExPASy TrEMBL
Match: A0A5J5A1U7 (Integrase catalytic domain-containing protein OS=Nyssa sinensis OX=561372 GN=F0562_010692 PE=4 SV=1)

HSP 1 Score: 191.8 bits (486), Expect = 4.9e-45
Identity = 140/415 (33.73%), Postives = 217/415 (52.29%), Query Frame = 0

Query: 1   MLHAHSLFDIVDGSKSCPNEFLKDSDGC-------------------------------- 60
           +L AHSL   +DG+  CPN+F++D  G                                 
Sbjct: 77  ILKAHSLIGYIDGTYPCPNKFVQDERGAATAQINPEYQIWNTQDQALMTLLNATLSQTAL 136

Query: 61  ------KSSKEVWTAREKRFSSLTHSHIHELKSALHSVAKSPTESIDEYLIRIKEIVDKL 120
                  +S+E W A E+RFS+ T S+I +LKSALH+++K   +SID Y+ +IK+  D L
Sbjct: 137 SHVIGYSTSREAWLALERRFSASTRSNILQLKSALHNISKG-KDSIDSYIQKIKQARDSL 196

Query: 121 VTVSVKVDDEDLLLYTLNGLSSKFNSFKTSFRTRSGSVTLDELHALLKSKLKFIEQHNKL 180
            +VSV ++DED+L+Y LNGL  ++N+FKTS RT+S ++TL+E++A+LK + + IE  +K 
Sbjct: 197 ASVSVLIEDEDILIYVLNGLPQEYNAFKTSIRTKSENITLEEVYAMLKIEEQTIESVHKQ 256

Query: 181 STASINPTAMFARGVNQNQSSRGRGRNQNNQAGQFN-----SGRGNSDGSQGGYSSANFG 240
           + +   P AM A     N SS  RG + +N +G+       S RG    S G + S NFG
Sbjct: 257 NNSPPFPGAMMATNYRPNFSS-NRGYSPSNFSGRGRGRGRFSNRGGRMHSFGRFQSPNFG 316

Query: 241 --------RNPGRSLPNLQSSWSWSSRL--------LNC---LNLSYQGRHPPSKLAAMA 300
                   + P +S     +S     ++        L+C   ++ SYQG+ P  +L AM+
Sbjct: 317 QSNLPYPTKQPQQSNQRSNNSHPVVCQICNKNGHSALDCYHRMDFSYQGKPPSPQLTAMS 376

Query: 301 IANDPSSTTAT--WLADSGCNTHVTPDTSCLALNSNFNGKEVLTVANDQGLPAAQAGIST 352
              +  S  +   W  D+G   H+T D + L     + G + +T+AN Q L  + +G S+
Sbjct: 377 ATYNTGSDCSPNYWYTDTGATNHITADLANLNFPVEYQGDDNITIANGQALDISHSGQSS 436

BLAST of Lag0035147 vs. ExPASy TrEMBL
Match: A0A5J4ZPW7 (Retrotran_gag_3 domain-containing protein OS=Nyssa sinensis OX=561372 GN=F0562_014124 PE=4 SV=1)

HSP 1 Score: 191.8 bits (486), Expect = 4.9e-45
Identity = 140/415 (33.73%), Postives = 217/415 (52.29%), Query Frame = 0

Query: 1   MLHAHSLFDIVDGSKSCPNEFLKDSDGC-------------------------------- 60
           +L AHSL   +DG+  CPN+F++D  G                                 
Sbjct: 77  ILKAHSLIGYIDGTYPCPNKFVQDERGAATAQINPEYQIWNTQDQALMTLLNATLSQTAL 136

Query: 61  ------KSSKEVWTAREKRFSSLTHSHIHELKSALHSVAKSPTESIDEYLIRIKEIVDKL 120
                  +S+E W A E+RFS+ T S+I +LKSALH+++K   +SID Y+ +IK+  D L
Sbjct: 137 SHVIGYSTSREAWLALERRFSASTRSNILQLKSALHNISKG-KDSIDSYIQKIKQARDSL 196

Query: 121 VTVSVKVDDEDLLLYTLNGLSSKFNSFKTSFRTRSGSVTLDELHALLKSKLKFIEQHNKL 180
            +VSV ++DED+L+Y LNGL  ++N+FKTS RT+S ++TL+E++A+LK + + IE  +K 
Sbjct: 197 ASVSVLIEDEDILIYVLNGLPQEYNAFKTSIRTKSENITLEEVYAMLKIEEQTIESVHKQ 256

Query: 181 STASINPTAMFARGVNQNQSSRGRGRNQNNQAGQFN-----SGRGNSDGSQGGYSSANFG 240
           + +   P AM A     N SS  RG + +N +G+       S RG    S G + S NFG
Sbjct: 257 NNSPPFPGAMMATNYRPNFSS-NRGYSPSNFSGRGRGRGRFSNRGGRMHSFGRFQSPNFG 316

Query: 241 --------RNPGRSLPNLQSSWSWSSRL--------LNC---LNLSYQGRHPPSKLAAMA 300
                   + P +S     +S     ++        L+C   ++ SYQG+ P  +L AM+
Sbjct: 317 QSNLPYPTKQPQQSNQRSNNSHPVVCQICNKNGHSALDCYHRMDFSYQGKPPSPQLTAMS 376

Query: 301 IANDPSSTTAT--WLADSGCNTHVTPDTSCLALNSNFNGKEVLTVANDQGLPAAQAGIST 352
              +  S  +   W  D+G   H+T D + L     + G + +T+AN Q L  + +G S+
Sbjct: 377 ATYNTGSDCSPNYWYTDTGATNHITADLANLNFPVEYQGDDNITIANGQALDISHSGQSS 436

BLAST of Lag0035147 vs. ExPASy TrEMBL
Match: A0A5J5B049 (Retrotran_gag_3 domain-containing protein OS=Nyssa sinensis OX=561372 GN=F0562_030285 PE=4 SV=1)

HSP 1 Score: 191.0 bits (484), Expect = 8.4e-45
Identity = 140/415 (33.73%), Postives = 216/415 (52.05%), Query Frame = 0

Query: 1   MLHAHSLFDIVDGSKSCPNEFLKDSDGC-------------------------------- 60
           +L AHSL   +DG+  CPN+F++D  G                                 
Sbjct: 77  ILKAHSLIGYIDGTYPCPNKFVQDERGAATAQINPEYQIWNTQDQALMTLLNATLSQTAL 136

Query: 61  ------KSSKEVWTAREKRFSSLTHSHIHELKSALHSVAKSPTESIDEYLIRIKEIVDKL 120
                  +S+E W A E+RFS+ T S+I +LKSALH+++K   +SID Y+ +IK   D L
Sbjct: 137 SHVIGYSTSREAWLALERRFSASTRSNILQLKSALHNISKG-KDSIDSYIQKIKRARDSL 196

Query: 121 VTVSVKVDDEDLLLYTLNGLSSKFNSFKTSFRTRSGSVTLDELHALLKSKLKFIEQHNKL 180
            +VSV ++DED+L+Y LNGL  ++N+FKTS RT+S ++TL+E++A+LK + + IE  +K 
Sbjct: 197 ASVSVLIEDEDILIYVLNGLPQEYNAFKTSIRTKSENITLEEVYAMLKIEEQTIESVHKQ 256

Query: 181 STASINPTAMFARGVNQNQSSRGRGRNQNNQAGQFN-----SGRGNSDGSQGGYSSANFG 240
           + +   P AM A     N SS  RG + +N +G+       S RG    S G + S NFG
Sbjct: 257 NNSPPFPGAMMATNYRPNFSS-NRGYSPSNFSGRGRGRGRFSNRGGRMHSFGRFQSPNFG 316

Query: 241 --------RNPGRSLPNLQSSWSWSSRL--------LNC---LNLSYQGRHPPSKLAAMA 300
                   + P +S     +S     ++        L+C   ++ SYQG+ P  +L AM+
Sbjct: 317 QSNLPYPTKQPQQSNQRSNNSHPVVCQICNKNGHSALDCYHRMDFSYQGKPPSPQLTAMS 376

Query: 301 IANDPSSTTAT--WLADSGCNTHVTPDTSCLALNSNFNGKEVLTVANDQGLPAAQAGIST 352
              +  S  +   W  D+G   H+T D + L     + G + +T+AN Q L  + +G S+
Sbjct: 377 ATYNTGSDCSPNYWYTDTGATNHITADLANLNFPVEYQGDDNITIANGQALDISHSGQSS 436

BLAST of Lag0035147 vs. ExPASy TrEMBL
Match: A0A2N9I6U7 (Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS49609 PE=4 SV=1)

HSP 1 Score: 191.0 bits (484), Expect = 8.4e-45
Identity = 127/369 (34.42%), Postives = 211/369 (57.18%), Query Frame = 0

Query: 1   MLHAHSLFDIVDGSKSCPNEFLKDSD--GCKSSKEVWTAREKRFSSLTHSHIHELKSALH 60
           +L A+++   VDG++ CP +F+ +S+  G  ++  VW+  EKR++S + S+I  LK  LH
Sbjct: 138 ILKAYAILSFVDGTQLCPPQFVTNSEVVGETTAHGVWSILEKRYTSASRSNILNLKMDLH 197

Query: 61  SVAKSPTESIDEYLIRIKEIVDKLVTVSVKVDDEDLLLYTLNGLSSKFNSFKTSFRTRSG 120
           ++ K   +S++ +L ++K+  D+L  V V++D+E++L   L GL  ++++F T+ RTR+ 
Sbjct: 198 NIKKETNDSVNTFLQKVKDARDRLAAVGVQIDNEEILHIVLRGLPHEYHAFSTAIRTRND 257

Query: 121 SVTLDELHAL-------LKSKLKFIEQHNKLS-TASIN-PTAMFA----RGVNQNQSSRG 180
           + + +++H L       LKS +   + H+ ++  A+ N   A+F+    RG  +N  +RG
Sbjct: 258 ATSFEDIHVLVTVEEQSLKSSIDLAKDHSHMAMVANTNRNNALFSYQGNRGRGRNNFTRG 317

Query: 181 RGRNQNNQAGQFNSGRG--NSDGSQGGYSSANFGRNPGRSLPNLQSSWSWSSRLLNCLNL 240
           RGRN N      N GRG  N+ G   G SS +F RN         +S   +    + ++ 
Sbjct: 318 RGRNFN------NGGRGNYNNSGGNSGSSSGHFNRN---------NSGHAAIDCYHRMDY 377

Query: 241 SYQGRHPPSKLAAMAIA-NDPSSTTATWLADSGCNTHVTPDTSCLALNSNFNGKEVLTVA 300
           +YQG+ PPSKLAAMA   N  +S  + W++D+G   H TP+ S +  + ++ G ++ TV 
Sbjct: 378 AYQGKQPPSKLAAMATTYNAQNSYQSYWISDTGATDHFTPNLSTIPDHQDYAGGDLATVG 437

Query: 301 NDQGLPAAQAGISTLLTPQSDLHMSSLLCVPDLSANLLSVSQCCVDNNFVFTFGANWFTI 352
           N   LP    G S L        +  +LCVP +S+NLLSV++ C DNN  F F A+ F I
Sbjct: 438 NGNALPITHIGNSQLKASSHLFQLRKILCVPSMSSNLLSVNKFCRDNNCCFQFDAHQFKI 491

BLAST of Lag0035147 vs. ExPASy TrEMBL
Match: A0A5J4ZT09 (Flavin-containing monooxygenase OS=Nyssa sinensis OX=561372 GN=F0562_012811 PE=3 SV=1)

HSP 1 Score: 190.7 bits (483), Expect = 1.1e-44
Identity = 140/415 (33.73%), Postives = 217/415 (52.29%), Query Frame = 0

Query: 1   MLHAHSLFDIVDGSKSCPNEFLKDSDGC-------------------------------- 60
           +L AHSL   +DG+  CPN+F++D  G                                 
Sbjct: 383 ILKAHSLIGYIDGTYPCPNKFVQDERGAATAQINPEYQIWNTQDQALMTLLNATLSQTAL 442

Query: 61  ------KSSKEVWTAREKRFSSLTHSHIHELKSALHSVAKSPTESIDEYLIRIKEIVDKL 120
                  +S+E W A E+RFS+ T S+I +LKSALH+++K   +SID Y+ +IK+  D L
Sbjct: 443 SHVIGYSTSREAWLALERRFSASTRSNILQLKSALHNISKG-KDSIDSYIQKIKQARDSL 502

Query: 121 VTVSVKVDDEDLLLYTLNGLSSKFNSFKTSFRTRSGSVTLDELHALLKSKLKFIEQHNKL 180
            +VSV ++DED+L+Y LNGL  ++N+FKTS RT+S ++TL+E++A+LK + + IE  +K 
Sbjct: 503 ASVSVLIEDEDILIYVLNGLPQEYNAFKTSIRTKSENITLEEVYAMLKIEEQTIESVHKQ 562

Query: 181 STASINPTAMFARGVNQNQSSRGRGRNQNNQAGQFN-----SGRGNSDGSQGGYSSANFG 240
           + +   P AM A     N SS  RG + +N +G+       S RG    S G + S NFG
Sbjct: 563 NNSPPFPGAMMATNYRPNFSS-NRGYSPSNFSGRGRGRGRFSNRGGRMHSFGRFQSPNFG 622

Query: 241 --------RNPGRSLPNLQSSWSWSSRL--------LNC---LNLSYQGRHPPSKLAAMA 300
                   + P +S     +S     ++        L+C   ++ SYQG+ P  +L AM+
Sbjct: 623 QSNLPYPTKQPQQSNQRSNNSHPVVCQICNKNGHSALDCYHRMDFSYQGKPPSPQLTAMS 682

Query: 301 IANDPSSTTAT--WLADSGCNTHVTPDTSCLALNSNFNGKEVLTVANDQGLPAAQAGIST 352
              +  S  +   W  D+G   H+T D + L     + G + +T+AN Q L  + +G S+
Sbjct: 683 ATYNTGSDCSPNYWYTDTGATNHITADLANLNFPVEYQGDDNITIANGQALDISHSGQSS 742

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
KAA8524269.11.0e-4433.73hypothetical protein F0562_010692 [Nyssa sinensis][more]
KAA8519786.11.0e-4433.73hypothetical protein F0562_014124 [Nyssa sinensis][more]
KAA8535282.11.7e-4433.73hypothetical protein F0562_030285 [Nyssa sinensis][more]
KAA8516701.12.3e-4433.73hypothetical protein F0562_016793 [Nyssa sinensis][more]
KAA8521875.12.3e-4433.73hypothetical protein F0562_012811 [Nyssa sinensis][more]
Match NameE-valueIdentityDescription
Q94HW23.3e-1424.48Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana O... [more]
Q9ZT944.6e-0822.05Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana O... [more]
Match NameE-valueIdentityDescription
A0A5J5A1U74.9e-4533.73Integrase catalytic domain-containing protein OS=Nyssa sinensis OX=561372 GN=F05... [more]
A0A5J4ZPW74.9e-4533.73Retrotran_gag_3 domain-containing protein OS=Nyssa sinensis OX=561372 GN=F0562_0... [more]
A0A5J5B0498.4e-4533.73Retrotran_gag_3 domain-containing protein OS=Nyssa sinensis OX=561372 GN=F0562_0... [more]
A0A2N9I6U78.4e-4534.42Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS49609 PE=4 SV=1[more]
A0A5J4ZT091.1e-4433.73Flavin-containing monooxygenase OS=Nyssa sinensis OX=561372 GN=F0562_012811 PE=3... [more]
Match NameE-valueIdentityDescription
InterPro
Analysis Name: InterPro Annotations of Sponge gourd (AG-4) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePFAMPF14223Retrotran_gag_2coord: 19..129
e-value: 2.1E-15
score: 56.7
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 158..199
NoneNo IPR availablePANTHERPTHR47481FAMILY NOT NAMEDcoord: 27..320

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Lag0035147.1Lag0035147.1mRNA