Spg033450 (gene) Sponge gourd (cylindrica) v1

Overview
NameSpg033450
Typegene
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionLINE-1 retrotransposable element ORF2 protein
Locationscaffold5: 4817049 .. 4823425 (+)
RNA-Seq ExpressionSpg033450
SyntenySpg033450
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAAGATAAAGGAATCTCAAAAAACCTGTTGTCGACGTCGAACGACAAAGGGAGCACCGAAGAGAACCCTGCCGCAACTCGTGCTAAGAAGTGCGCTACCTTGTTGCTTGCCCTAGGGCATTTGGAAAAGGATACCACCTTTGCTTTCTCCGCTAAAGAGACGATTTCATTCAGCACCACTCTCGTATCAGAGATGTCTTCTTCTTCAAAATTCATCGCTCTAATGACGTTCAAAGCATCAGATTCAACCAAAATATGGGCCTTCTCTCTTCGAGAGGTGTAAGCTCTCAGCCCCTCCTTAATCGCTAGCGTTTCCATGTATTTTATTGACCATTTTCGCTTAATTTGCTTGCAGCCCGCTTCGACAAGAGATCTTGAGGAATCACGGATGATCCAACCTACTCCACTGGTCTCTGATTCTTTGTTCATCGAGACATCAGAGTTCATTTTCCATAAGCTCGCCGGGGGGGTCCATTTTCCATGATTCCAGAGGCTCTCCATCTTATGATCCGTCAGGTTAGGAATGGTTTTTTCTCTTCTGCTCTTCCAAATTTCTCTTGATCCTTCTTGATATCTGCCCGCTCATTGTTGATTGATACTCTGTTTCTGTAAATCCATATAGACCACATGGTCATAGCAATCTGTTCTACCTCCTCGCTGCTCATAGCTTTAGAAATCTCATCCCAGCAATCCTCCATAATCACCTTGTCCCTGCAAAAATTGGAGATGTTCATTAGTTTAGAGAAAAGCTCTAACCATACTTGTTTTACCTTCTTGCAGTTCCATATTATGTGGTCCGTATCCTCCCTCGAACTCCTGCACAAATCACACACAACATCAATGTCCAAGCCTTTGTTTGCTAAGTTAACTTTGGAAGGGATGAGGTTCTTAAGGATTTTCCAAGACAAAAATAAAACTTTAAATAGGTTTCGATCTTTTATCTAATAGATCTCTCAGTTTTAAGGAGTTGTTTGTTTTTTTTGGAATCTAGATAACCGAAATAATCTTCAATTGTCAGAAATTATAGATGTTTGACATTTTAGATAATCGTGTTTGTTTTGAAGGAATAATTTATAAGAATGTATAGAAATATTTTTTTTAGAGAATATACAGAAATATTTAGATAACTATGTTTGTTTCGCAGGATTTTTATATAATAATATCTTTAGATTATGTATTAGACCAATAATGTCCTTATTAATATTTTTGCTAATTTACTTATTAGTTTAACACAATTTAAATTTATATAAGGTCTTGGTCTTTATTAATTTAGAGACGGTTTGATAACCCTTTTATTTTCTGTTTTTGTTTTTCATTTTAAAAAAAATAGAGTTATTTGATAACCATTTCTCTTTTATGTTATCTGTCTTTTTTTGAACTTTTTGAAATTATAAGCAAATTATTAAAAAAAATCATTTTTAAAAACATATTTCGGTTTTTCTAAAACTCATTTTTCTTTTAATGCACATAATCTTCCTCTTTTTAAACATCACTCTCCATCAAATTCCTTTTTGTTCCCTCACTAAAACTTTTTTCTCTCTAAATTTCTCTCCTCTAAAATTTTGATCTCAATATAATTTTTTATTTATCTTTCACTTTCTAAAAAATTCATCTCTACCTAGTTTTTTCTCTCTAAAATGTTTATCTTTATTTAACTTTCATTTATATTTCTCTTCTAACTTATCTCTCCAATTTTTCTCTCTAAACATCAAATAATTTTCCTTTTCCCCAAAATTTTCTCTCTCAAAACTTTTTTTCCTTTATAACTTATCTATTTTTAAAGATTTCTTCCTTAAGTTTTCTCTCTCTAATAAACATATCTCTAACTTTTATTTTGAAAAAAAAAATCTTCCTAAATTTTCTCTCGCTACAATTTTTCTTTATTTTCTATAATTTATCATTATAAACTCTCTAAAATTTCTCTTTTTACTTAATTTTTCTCTCTAAAATATATATCTTCCGCTCTAACTTTCTTTTTCCTTTTTCTCTAACTAATATTATTTATAGAGTCTACCCTAAACTAATCTTCTCTCTCTAAATATTTTCTTTTATGTCTCTAAATTTACTATGTTTAACTTTTCTCTAAAAAAACATTTATATTTATGTCTCTAAATTTTGTTTCAATGTCGGCACACATTTTTCTCTCTCTAAAACACCTTTTCATCTCTCTAATTTTTTAAAAACTAAAAACCAAAAACAATTATTAAACATATATAGTTTTTGTTTTTAAAAACCATAAAACTGAAAACTGAAAAATAAAAAACTGAAAAGTTGAAATGGTTATCAAACAGAACCTTAATGTTTTATGTTTATTCTATTTCTATTTGTTATTTTGTAGAACAAATTAATTCATAGTTTAAGGTTTTATAAAATATTTAATATTAAAAATTAATTTCAATTTTATTTTTTTTTTCGAAAATAGGACAATTAATATAATTTGTGGCGTTTTTAGTGGATTATATTGATCACATTATCAAATTAGAATTTTTAAAGGAATAATATTTGTTTGTAATGGATTTTTTGTGTGTATTTTGAAATGTACAATTAGCATTAATTTTAAGAGAAATTACAAATAAATAACAAAAATTTTCAGTAGATGCATTAGAGAAAAAATTCACTAGATACAAAAGTATTGAGAAACTATTGAGAAATAAAATAAAATGTATATTCCACTTTAAATTTCAAGTGAAATAAATTGAAATATTATTCAATATAATTGTTGTTTATTTTATATCTAAATGTAAAATTTAAAAACAATGGCTAACTACATATAATTGTTTATTTGTGTATGAGTAAATTAAGGTATTTGATTTTGTCATTCTTACAGAAACTAAATTGTGCAACACAAATAAGCGCATCATTAAATCTTTGTGGAGTTCCATTAGTGTTAATTGGATTGCTCTTGATGCCTATGGATCCTCTGGAGGTATTATTATTATGTGGGATGAATTATGTTGCAACATCACTGATCATGTCAAAGGTCTGTTCTCTGTTTCTGTTCTCGTTACTCTCTCAGATGGCTTCTCTTGGTGGTTGTCCGGTATTTATGGCCCTGCGAGTAGGAAAAAACGTAAGTCTTTTTGGAGAGAACTTTATGATCTTCACGGCTTATGTGGTGACTGTTGGCTTCTTGGTGGTGATTTTAATGTTTTCAGGCATTCTTCGGAAACATCCTCCAACAATCCTGCCAAATTAAGTATGTCGAAGTTTAACAAATTCATTTCGGACACTGACCTCCTCGACCCGCCGCTTATTAATGGTCCATATACTTGGACAAATCTTCGCAGTGAGCCTGTTATGTCCCGTCTTGACAGATTTCTTTTCTCTAATAGCTGGTGTATTAAATTTAACGATCATCATTCGAAGAAGCTTTCTCGTTGCACATCAGATCATTTTCCTATTCTTCTGGATGATTCTTCTTCTACTTGGGGCCCTTGTCCTTTTCGTTTCGACAATTATCTTCTGGACAATAAATCCTTCATTGGCAATGTTGAGCATTGGTGGACTGATACTTTCTGTGAGGGTTTTCCTGGATTTTCTTTTATTCGCAGACTAAAGTTCTTAGCAAGGAAAGTCAAAAACTGGAAGCTTTCCAACACAGATTCCTTCAAGGAAAAGAAAAAGGCCATTTCCAACGAGATTGATCGCATTGATGCTCTTGAATCTTCGGGTTCTTTGGACGATATGGCAAAGCAACTTAGAAAATCTCTTAAAGCTGATCTGCAAGAAACAGCTCTCCTTGAAGCCCGTTATTGGAATCAGCGTTGTAAAAAGCTTTGGCTCAGCGATAGAGATGAGAACTCTGCTTTTTTCCATAAAATTTGTACTGCTCGCCGCCGAAGAAACCAAATCCATGAGTTATTTACAAAAGAAGGTATTAGCATTGTTTCTGATTTTTTGCTGGAAAAGGAAGTAATCGATCATTTTGCGGATATTTATGACTACAATCAGAATTCTGAATGGATTATTGTCAATCTTAATTGGGAACCAATTAATTCTGTTTCTGCTAGAGGTCTGATATCTCCTTTTAGGGATGATGAGGTTTTCGAATGTATTAAGTCCATTGGCCAAAATAAAGTTCCGTGCCCAGATGGTTTCACCATTGAGTTTTTTAAGAAATTCTGGAATACTTTCAAACCTTCTATTATGTCAGTCTTCCACGATTTCTACCACAGCAAGGTTATCAATCGAAATATGAATCACACCAATATTGCGCTCATCCCCAAAAGGAACATGACTGGGAAAATTTCAGATTTCCGTCCTATTAGCCTCACCACCTCTCTTTACAAAATTATGGCTAAGGTTTTGGCGGAGCGCTTAAAGTCCACTTTGGAAGACACTATAAGCTTGAATCAGTCGGCTTTTGTTCGAAAGAGACAAATCTCTGACGCCATTCTGCTAGCTAACGAAGCTGTTGACTTCTGGAGAGTTTCTAGAAAGAAAGGTGTCCTTATAAAGCTTGATGTGGAGAAAGCTTTTGACAAAATTAGTTGGAATTTCATCGATTGTGTTCTCCTTAAAAAAGGATACCCGACAATTTGGCGAGAATGGATTAGAGCTTGCATATCTTCAGTTTCTTATTCTATCATTCTCAACGGCAAGCCTCGAGGTAACATTCAAGCCAAAAGAGGAATTAGACAAGGCGATCCTCTATCTCCTTTTCTTTTTGTCCTTGCCATGGATTACCTTAGTAGATTAATCGAAGCTGTCGAAAAAAAAGGGCTCGTTTCTGGAGTTGTTATGGGGGATATTTCTGTCACACACCTTCTATTTGCTGATGATATTTTACTTTTTGTTCAAGATGATGACAAAGCCATTGAAAATATGTATCTTATCATTAAGTCTTTTGAACATGGTTCTGGTTTGCGTATAAATCTCAACAAATCCACTATCTCTGGTATCAACCTATCGAACCAAAGAACATCTGAGATCGCATCCTCTTGGGGCTGTAATCTTCACCCCCTACCCATTGATTATCTTGGCGCTCCTTTGGGTGGAATTCCTAAGAATAACCTGTTTTGGGAGCCCATGATAGAGAAGATTCAGCATAGAATTCACAATTGGCGGTTTGTATCTCTTTCTAAAGGAGGTCGTCTCACTCTTATTCACTCGGTTCTCAATAGTATGCCCCTCTACATCCTCTCGATTTTCAAAGCCCCAACGTCTATCTGCAACAAAATTGAAAAAATTTTCCGAAAATTTCTTTGGGAGGGAGCTTCTTCTTCAGGCTCCACTAATCTTGTGAGATGGGAAATTGTTTCTTCTTCTAAAGCTGAAGGTGGTCTTGGCATTCACAAAATTAAAGAGACTAACGACGCTCTTCTCCTTAAGTGGATCTGGCGTTTCTTCAATGAGGAAAACACTCTTTGGAGGAATTTCATCAACAATAAATATTCCAGTCTTCATTTTGAGTGCTTTCCTTCAAGCAGCAAAGTCTCCAGCTCCAGATCTCCTTGGCATGCTATCTCTAAGCTTCAAGACATTTTCTTTGCCAATTTCAGATGGGATATTCGCAATGGTCGCTCTACTCTGTTTTGGCATGATAATTGGTCTTCCTTTGGCCCCCTAAAATTTGTCTGCAACCGTTTATATCAGCTATCATCCAACAAAAATCTCTCTATAGCTGAAGTTTGGTCTTCTTCGGACAGAATGTGGAACTTTCAGCCCCGAAGACCTTTGTTCGATAGAGATTTACAAAGGTGGAGCGAATTTGCGGAATTATTGCCCACCCCTAATCCTCAAAGGGGTTCGGATATTCGGCGTTGGATGGTATCTGGAGATGGTCTATTTACTACTAAATCTGCGCGCGCTATTTTATCTGTTCTCCCTTCAAGACCGTTTCACAGTCCTGGAGAAAAAATTCTCAACAATCTTTGGACCGCTGACATTCCAAAGAAAATCAAGGTCTTCATTTGGTCTCTCTTTCATCGTAGCATAAATACGTCGGATAGACTTCAAGCGATTTTTCAGAATTCTCTTCACAACCCATCAGTCTGCATTCTTTGTTGGAAGAATTCTGAAGATATTGATCATCTTTTCATCCACTGTAGTCGCGCATCCTTTTTCAGGAACAAAATCAACCTTGCTTTGGGCCTCTCTATGGTTCCTCCGGCAACTATAGATTCTTTTTGCGCTGATTTGTTTACATCCAAAGCTATTTCGCAAAGGCAATTGCTCAGAAGAAACGTTTTCATAGCTACCCTTTGGTTATTATGGAACGAGCGTAATCGCCGTATTTTTGAAGATAAAGCTCGCACTCGAAATCAACTCTGGGAGGACATCGTCTCTCTTGCTGCTCTTTGGGCTACGAAATCCAAAGTTTTCTCTGATTATAGTGCTTCTCATATTGCTTTAAATTGGAAATCTTTTCTGTAG

mRNA sequence

ATGGAAGATAAAGGAATCTCAAAAAACCTGTTGTCGACGTCGAACGACAAAGGGAGCACCGAAGAGAACCCTGCCGCAACTCGTGCTAAGAAGTGCGCTACCTTGTTGCTTGCCCTAGGGCATTTGGAAAAGGATACCACCTTTGCTTTCTCCGCTAAAGAGACGATTTCATTCAGCACCACTCTCCCCGCTTCGACAAGAGATCTTGAGGAATCACGGATGATCCAACCTACTCCACTGGTCTCTGATTCTTTGTTCATCGAGACATCAGATTCCATATTATGTGAAACTAAATTGTGCAACACAAATAAGCGCATCATTAAATCTTTGTGGAGTTCCATTAGTGTTAATTGGATTGCTCTTGATGCCTATGGATCCTCTGGAGGTATTATTATTATGTGGGATGAATTATGTTGCAACATCACTGATCATGTCAAAGGTCTGTTCTCTGTTTCTGTTCTCGTTACTCTCTCAGATGGCTTCTCTTGGTGGTTGTCCGGTATTTATGGCCCTGCGAGTAGGAAAAAACGTAAGTCTTTTTGGAGAGAACTTTATGATCTTCACGGCTTATGTGGTGACTGTTGGCTTCTTGGTGGTGATTTTAATGTTTTCAGGCATTCTTCGGAAACATCCTCCAACAATCCTGCCAAATTAAGTATGTCGAAGTTTAACAAATTCATTTCGGACACTGACCTCCTCGACCCGCCGCTTATTAATGGTCCATATACTTGGACAAATCTTCGCAGTGAGCCTGTTATGTCCCGTCTTGACAGATTTCTTTTCTCTAATAGCTGGTGTATTAAATTTAACGATCATCATTCGAAGAAGCTTTCTCGTTGCACATCAGATCATTTTCCTATTCTTCTGGATGATTCTTCTTCTACTTGGGGCCCTTGTCCTTTTCGTTTCGACAATTATCTTCTGGACAATAAATCCTTCATTGGCAATGTTGAGCATTGGTGGACTGATACTTTCTGTGAGGGTTTTCCTGGATTTTCTTTTATTCGCAGACTAAAGTTCTTAGCAAGGAAAGTCAAAAACTGGAAGCTTTCCAACACAGATTCCTTCAAGGAAAAGAAAAAGGCCATTTCCAACGAGATTGATCGCATTGATGCTCTTGAATCTTCGGGTTCTTTGGACGATATGGCAAAGCAACTTAGAAAATCTCTTAAAGCTGATCTGCAAGAAACAGCTCTCCTTGAAGCCCGTTATTGGAATCAGCGTTGTAAAAAGCTTTGGCTCAGCGATAGAGATGAGAACTCTGCTTTTTTCCATAAAATTTGTACTGCTCGCCGCCGAAGAAACCAAATCCATGAGTTATTTACAAAAGAAGGTATTAGCATTGTTTCTGATTTTTTGCTGGAAAAGGAAGTAATCGATCATTTTGCGGATATTTATGACTACAATCAGAATTCTGAATGGATTATTGTCAATCTTAATTGGGAACCAATTAATTCTGTTTCTGCTAGAGGCTCCACTAATCTTGTGAGATGGGAAATTGTTTCTTCTTCTAAAGCTGAAGGTGGTCTTGGCATTCACAAAATTAAAGAGACTAACGACGCTCTTCTCCTTAAGTGGATCTGGCGTTTCTTCAATGAGGAAAACACTCTTTGGAGGAATTTCATCAACAATAAATATTCCAGTCTTCATTTTGAGTGCTTTCCTTCAAGCAGCAAAGTCTCCAGCTCCAGATCTCCTTGGCATGCTATCTCTAAGCTTCAAGACATTTTCTTTGCCAATTTCAGATGGGATATTCGCAATGGTCGCTCTACTCTGTTTTGGCATGATAATTGGTCTTCCTTTGGCCCCCTAAAATTTGTCTGCAACCGTTTATATCAGCTATCATCCAACAAAAATCTCTCTATAGCTGAAGTTTGGTCTTCTTCGGACAGAATGTGGAACTTTCAGCCCCGAAGACCTTTGTTCGATAGAGATTTACAAAGGTGGAGCGAATTTGCGGAATTATTGCCCACCCCTAATCCTCAAAGGGGTTCGGATATTCGGCGTTGGATGGTATCTGGAGATGGTCTATTTACTACTAAATCTGCGCGCGCTATTTTATCTGTTCTCCCTTCAAGACCGTTTCACAGTCCTGGAGAAAAAATTCTCAACAATCTTTGGACCGCTGACATTCCAAAGAAAATCAAGGTCTTCATTTGGTCTCTCTTTCATCGTAGCATAAATACGTCGGATAGACTTCAAGCGATTTTTCAGAATTCTCTTCACAACCCATCAGTCTGCATTCTTTGTTGGAAGAATTCTGAAGATATTGATCATCTTTTCATCCACTGTAGTCGCGCATCCTTTTTCAGGAACAAAATCAACCTTGCTTTGGGCCTCTCTATGGTTCCTCCGGCAACTATAGATTCTTTTTGCGCTGATTTGTTTACATCCAAAGCTATTTCGCAAAGGCAATTGCTCAGAAGAAACGTTTTCATAGCTACCCTTTGGTTATTATGGAACGAGCGTAATCGCCGTATTTTTGAAGATAAAGCTCGCACTCGAAATCAACTCTGGGAGGACATCGTCTCTCTTGCTGCTCTTTGGGCTACGAAATCCAAAGTTTTCTCTGATTATAGTGCTTCTCATATTGCTTTAAATTGGAAATCTTTTCTGTAG

Coding sequence (CDS)

ATGGAAGATAAAGGAATCTCAAAAAACCTGTTGTCGACGTCGAACGACAAAGGGAGCACCGAAGAGAACCCTGCCGCAACTCGTGCTAAGAAGTGCGCTACCTTGTTGCTTGCCCTAGGGCATTTGGAAAAGGATACCACCTTTGCTTTCTCCGCTAAAGAGACGATTTCATTCAGCACCACTCTCCCCGCTTCGACAAGAGATCTTGAGGAATCACGGATGATCCAACCTACTCCACTGGTCTCTGATTCTTTGTTCATCGAGACATCAGATTCCATATTATGTGAAACTAAATTGTGCAACACAAATAAGCGCATCATTAAATCTTTGTGGAGTTCCATTAGTGTTAATTGGATTGCTCTTGATGCCTATGGATCCTCTGGAGGTATTATTATTATGTGGGATGAATTATGTTGCAACATCACTGATCATGTCAAAGGTCTGTTCTCTGTTTCTGTTCTCGTTACTCTCTCAGATGGCTTCTCTTGGTGGTTGTCCGGTATTTATGGCCCTGCGAGTAGGAAAAAACGTAAGTCTTTTTGGAGAGAACTTTATGATCTTCACGGCTTATGTGGTGACTGTTGGCTTCTTGGTGGTGATTTTAATGTTTTCAGGCATTCTTCGGAAACATCCTCCAACAATCCTGCCAAATTAAGTATGTCGAAGTTTAACAAATTCATTTCGGACACTGACCTCCTCGACCCGCCGCTTATTAATGGTCCATATACTTGGACAAATCTTCGCAGTGAGCCTGTTATGTCCCGTCTTGACAGATTTCTTTTCTCTAATAGCTGGTGTATTAAATTTAACGATCATCATTCGAAGAAGCTTTCTCGTTGCACATCAGATCATTTTCCTATTCTTCTGGATGATTCTTCTTCTACTTGGGGCCCTTGTCCTTTTCGTTTCGACAATTATCTTCTGGACAATAAATCCTTCATTGGCAATGTTGAGCATTGGTGGACTGATACTTTCTGTGAGGGTTTTCCTGGATTTTCTTTTATTCGCAGACTAAAGTTCTTAGCAAGGAAAGTCAAAAACTGGAAGCTTTCCAACACAGATTCCTTCAAGGAAAAGAAAAAGGCCATTTCCAACGAGATTGATCGCATTGATGCTCTTGAATCTTCGGGTTCTTTGGACGATATGGCAAAGCAACTTAGAAAATCTCTTAAAGCTGATCTGCAAGAAACAGCTCTCCTTGAAGCCCGTTATTGGAATCAGCGTTGTAAAAAGCTTTGGCTCAGCGATAGAGATGAGAACTCTGCTTTTTTCCATAAAATTTGTACTGCTCGCCGCCGAAGAAACCAAATCCATGAGTTATTTACAAAAGAAGGTATTAGCATTGTTTCTGATTTTTTGCTGGAAAAGGAAGTAATCGATCATTTTGCGGATATTTATGACTACAATCAGAATTCTGAATGGATTATTGTCAATCTTAATTGGGAACCAATTAATTCTGTTTCTGCTAGAGGCTCCACTAATCTTGTGAGATGGGAAATTGTTTCTTCTTCTAAAGCTGAAGGTGGTCTTGGCATTCACAAAATTAAAGAGACTAACGACGCTCTTCTCCTTAAGTGGATCTGGCGTTTCTTCAATGAGGAAAACACTCTTTGGAGGAATTTCATCAACAATAAATATTCCAGTCTTCATTTTGAGTGCTTTCCTTCAAGCAGCAAAGTCTCCAGCTCCAGATCTCCTTGGCATGCTATCTCTAAGCTTCAAGACATTTTCTTTGCCAATTTCAGATGGGATATTCGCAATGGTCGCTCTACTCTGTTTTGGCATGATAATTGGTCTTCCTTTGGCCCCCTAAAATTTGTCTGCAACCGTTTATATCAGCTATCATCCAACAAAAATCTCTCTATAGCTGAAGTTTGGTCTTCTTCGGACAGAATGTGGAACTTTCAGCCCCGAAGACCTTTGTTCGATAGAGATTTACAAAGGTGGAGCGAATTTGCGGAATTATTGCCCACCCCTAATCCTCAAAGGGGTTCGGATATTCGGCGTTGGATGGTATCTGGAGATGGTCTATTTACTACTAAATCTGCGCGCGCTATTTTATCTGTTCTCCCTTCAAGACCGTTTCACAGTCCTGGAGAAAAAATTCTCAACAATCTTTGGACCGCTGACATTCCAAAGAAAATCAAGGTCTTCATTTGGTCTCTCTTTCATCGTAGCATAAATACGTCGGATAGACTTCAAGCGATTTTTCAGAATTCTCTTCACAACCCATCAGTCTGCATTCTTTGTTGGAAGAATTCTGAAGATATTGATCATCTTTTCATCCACTGTAGTCGCGCATCCTTTTTCAGGAACAAAATCAACCTTGCTTTGGGCCTCTCTATGGTTCCTCCGGCAACTATAGATTCTTTTTGCGCTGATTTGTTTACATCCAAAGCTATTTCGCAAAGGCAATTGCTCAGAAGAAACGTTTTCATAGCTACCCTTTGGTTATTATGGAACGAGCGTAATCGCCGTATTTTTGAAGATAAAGCTCGCACTCGAAATCAACTCTGGGAGGACATCGTCTCTCTTGCTGCTCTTTGGGCTACGAAATCCAAAGTTTTCTCTGATTATAGTGCTTCTCATATTGCTTTAAATTGGAAATCTTTTCTGTAG

Protein sequence

MEDKGISKNLLSTSNDKGSTEENPAATRAKKCATLLLALGHLEKDTTFAFSAKETISFSTTLPASTRDLEESRMIQPTPLVSDSLFIETSDSILCETKLCNTNKRIIKSLWSSISVNWIALDAYGSSGGIIIMWDELCCNITDHVKGLFSVSVLVTLSDGFSWWLSGIYGPASRKKRKSFWRELYDLHGLCGDCWLLGGDFNVFRHSSETSSNNPAKLSMSKFNKFISDTDLLDPPLINGPYTWTNLRSEPVMSRLDRFLFSNSWCIKFNDHHSKKLSRCTSDHFPILLDDSSSTWGPCPFRFDNYLLDNKSFIGNVEHWWTDTFCEGFPGFSFIRRLKFLARKVKNWKLSNTDSFKEKKKAISNEIDRIDALESSGSLDDMAKQLRKSLKADLQETALLEARYWNQRCKKLWLSDRDENSAFFHKICTARRRRNQIHELFTKEGISIVSDFLLEKEVIDHFADIYDYNQNSEWIIVNLNWEPINSVSARGSTNLVRWEIVSSSKAEGGLGIHKIKETNDALLLKWIWRFFNEENTLWRNFINNKYSSLHFECFPSSSKVSSSRSPWHAISKLQDIFFANFRWDIRNGRSTLFWHDNWSSFGPLKFVCNRLYQLSSNKNLSIAEVWSSSDRMWNFQPRRPLFDRDLQRWSEFAELLPTPNPQRGSDIRRWMVSGDGLFTTKSARAILSVLPSRPFHSPGEKILNNLWTADIPKKIKVFIWSLFHRSINTSDRLQAIFQNSLHNPSVCILCWKNSEDIDHLFIHCSRASFFRNKINLALGLSMVPPATIDSFCADLFTSKAISQRQLLRRNVFIATLWLLWNERNRRIFEDKARTRNQLWEDIVSLAALWATKSKVFSDYSASHIALNWKSFL
Homology
BLAST of Spg033450 vs. NCBI nr
Match: KAA0039950.1 (LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa] >TYK24553.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa])

HSP 1 Score: 486.1 bits (1250), Expect = 6.3e-133
Identity = 315/1148 (27.44%), Postives = 444/1148 (38.68%), Query Frame = 0

Query: 133  MWDELCCNITDHVKGLFSVSVLVTLSDGFSWWLSGIYGPASRKKRKSFWRELYDLHGLCG 192
            MW++   +I    KG FSVS+ V  ++G  WWLS IYGPA RK R  FW EL  L  +C 
Sbjct: 1    MWNDQNFSILSVFKGAFSVSIQVGSNNGAFWWLSAIYGPAKRKNRPLFWEELEHLKSICL 60

Query: 193  DCWLLGGDFNVFRHSSETSSNNPAKLSMSKFNKFISDTDLLDPPLINGPYTWTNLRSEPV 252
              W+LGGDFNV R   ET++ NPA LSM +FN FIS+ +L+DPPL N  YTW+NLR++  
Sbjct: 61   PTWILGGDFNVIRWKEETTTKNPALLSMRRFNSFISNCNLIDPPLSNAKYTWSNLRAQAT 120

Query: 253  MSRLDRFLFSNSWCIKFNDHHSKKLSRCTSDHFPILLDDSSSTWGPCPFRFDNYLLDNKS 312
            +SRLDRFLF++ W   F  H SK L+R TSDHFPI+L+ S+ +WGP PFRF N  L +  
Sbjct: 121  LSRLDRFLFTSQWENIFPGHTSKVLTRTTSDHFPIVLESSTISWGPSPFRFTNAYLKDPD 180

Query: 313  FIGNVEHWWTDTFCEGFPGFSFIRRLKFLARKVKNWKLSNTDSFKEKKKAISNEIDRIDA 372
            +  N+E WW +T   G+ G+SF+RRLK LA  +K W        +  KKA   EID+ID 
Sbjct: 181  YKKNIEFWWGNTSQPGYAGYSFMRRLKQLALIIKTWGRDKKGKNEASKKACIKEIDQIDK 240

Query: 373  LESSGSLDDMAKQLRKSLKADLQETALLEARYWNQRCKKLWLSDRDENSAFFHKICTARR 432
            LE+ GS  ++ ++ R +LKADL +  L EA+ W Q+CK++W+ + DENS+FFHKICTAR+
Sbjct: 241  LEAEGSATEIHREKRTALKADLSQINLTEAQIWAQKCKRIWVHEGDENSSFFHKICTARQ 300

Query: 433  RRNQIHELFTKEGISIVSDFLLEKEVIDHFADIYDYNQNSEWIIVNLNWEPINSVSA--- 492
            ++  I ++    G + ++D  +    I HF DIY  N+NS+  I NL+W PI+++++   
Sbjct: 301  KKCLISKIINNSGQNCLNDSDIADAFIQHFEDIYTDNRNSQLFIENLDWCPISNINSELL 360

Query: 493  ------------------------------------------------------------ 552
                                                                        
Sbjct: 361  DKPFNEAEIWLTLKSFAKNKAPGPDGYAMDFLQKSWSFMKQNICDIFKDFHSTHIINKVV 420

Query: 553  ------------------------------------------------------------ 612
                                                                        
Sbjct: 421  NETLITLIAKKEHCETAADFRPISLTTAIYKLIAKTLADRLKQTLPDTISESQMAFVKGR 480

Query: 613  ------------------------------------------------------------ 672
                                                                        
Sbjct: 481  QITEAILIANEALDFWRSKKERGFVIKLDIEKAFDKLNWRFIDFVLMKKNYSQKWRKMIA 540

Query: 673  ------------------------------------------------------------ 732
                                                                        
Sbjct: 541  SCISSVQYSILINGRPRGRIKPSRGIRQGDPLSPFIFVLAMDYLSRLLNNLADKRKINGV 600

Query: 733  ------------------------------------------------------------ 792
                                                                        
Sbjct: 601  KFSPNLNLTHILFADDILIFVEDRDDYVSNLKMILHLFESASGLNINLSKSTIFPINVPT 660

Query: 793  ------------------------------------------------------------ 852
                                                                        
Sbjct: 661  DRAKSIADSWGISKGHLPTSYLGMPLGGRPSSSNFWDNVLQKIQKKLSNWKYSQLSKGGR 720

Query: 853  --------------------------------------RGSTN-----LVRWEIVSSSKA 873
                                                   G++N     L+RW  + S K 
Sbjct: 721  ITLINSTLESLPIYQMSVFKVPKGIAQKIEASWRNFLWNGASNGHNISLIRWNQIVSPKE 780

BLAST of Spg033450 vs. NCBI nr
Match: TYK06777.1 (LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa])

HSP 1 Score: 481.1 bits (1237), Expect = 2.0e-131
Identity = 318/1120 (28.39%), Postives = 448/1120 (40.00%), Query Frame = 0

Query: 133  MWDELCCNITDHVKGLFSVSVLVTLSDGFS---WWLSGIYGPASRKKRKSFWRELYDLHG 192
            MWD+L  N+TD ++G FS+S+ +   DG S   WWLS IYGP+  + RKSFW EL DL  
Sbjct: 1    MWDDLRFNVTDFIEGNFSLSININSPDGPSNSAWWLSAIYGPSGGRNRKSFWAELLDLKN 60

Query: 193  LCGDCWLLGGDFNVFRHSSETSSNNPAKLSMSKFNKFISDTDLLDPPLINGPYTWTNLRS 252
             C   WLL GDFNV R  SETS+ NP+K SM  FNKFI+D++L+DPPL N  +TW+NLR 
Sbjct: 61   KCSPTWLLAGDFNVVRFPSETSAQNPSKHSMRCFNKFIADSNLIDPPLSNAKFTWSNLRV 120

Query: 253  EPVMSRLDRFLFSNSWCIKFNDHHSKKLSRCTSDHFPILLDDSSSTWGPCPFRFDNYLLD 312
             PV+SR+DRFL++ +W   F  H+SK LSR TSDHFPI+L+ S  +WGP PF+  N  L 
Sbjct: 121  HPVLSRIDRFLYTTNWENLFTAHYSKTLSRVTSDHFPIVLESSLISWGPSPFKLINVHLK 180

Query: 313  NKSFIGNVEHWWTDTFCEGFPGFSFIRRLKFLARKVKNWKLSNTDSFKEKKKAISNEIDR 372
               F  N+ +WW +   EG PGFSF+R+LK L+  ++N +  N     E K A   EID 
Sbjct: 181  EPWFKNNITNWWKNLRQEGHPGFSFMRKLKQLSTIIRNEQRKNKCYSDEDKNAWIKEIDS 240

Query: 373  IDALESSGSLDDMAKQLRKSLKADLQETALLEARYWNQRCKKLWLSDRDENSAFFHKICT 432
            ID LE+ G+L +     R  LKAD+  +   EA+ W Q+ K+LW+++ DEN++FFHKIC+
Sbjct: 241  IDRLEAEGNLSEELSLRRTRLKADVLTSGFKEAQIWYQKSKRLWITEGDENTSFFHKICS 300

Query: 433  ARRRRNQIHELFTKEGISIVSDFLLEKEVIDHFADIY-DYNQNSEWIIVNLNWEPINS-- 492
            AR+RR+ I  + + +G+   ++  + K  +DHF DIY    + S W+I NLNW PI++  
Sbjct: 301  ARQRRSIISNINSADGVPCSTNESIAKAFLDHFEDIYKGGGEESPWLIDNLNWSPISTNQ 360

Query: 493  ------------------------------------------------------------ 552
                                                                        
Sbjct: 361  AQILCSMFTEEEIHEALTAFSSNKSPETSTQTVSSTINITNIALIAKKEKCAEPADYRPI 420

Query: 553  ------------------------------------------------------------ 612
                                                                        
Sbjct: 421  SLTTSIYKLIAKVIAERLKDTLPYTVAENQMAFVKDRQIIDAILVANEAIDYWRFKKIQG 480

Query: 613  ------------------------------------------------------------ 672
                                                                        
Sbjct: 481  FVIKLDIEKAFDKLNWRFIDFMLMKKGYPFKWRSWIRACISSVQYSIIINGRPRGKIQPS 540

Query: 673  ----------------------------------VSARGSTN------------------ 732
                                              V   G+ N                  
Sbjct: 541  RGIRQGDPISPFIFVLAMDYISRLLNSVGEKIKGVKLEGNINLTHLLFADDILLFVEDDE 600

Query: 733  ------------------------------------------------------------ 792
                                                                        
Sbjct: 601  HSIQNLKNIINLFQLASGLSINLNKSTISPINVDASRTEQIASQWGISTKFLPINYLGVP 660

Query: 793  ------------------------------------------------------------ 852
                                                                        
Sbjct: 661  LGGKQTTKAFWKNVEEKINKKLASWKYSMLSKGGKITLIKSSLASLPTYQLSIFKAPVST 720

Query: 853  -----------------------LVRWEIVSSSKAEGGLGIHKIKETNDALLLKWIWRFF 872
                                   LV W  ++SSK +GGLGI ++K+TN ALL KW+WR+ 
Sbjct: 721  CKNIEKTWRNFLWKNPPETHKLHLVNWAKITSSKEKGGLGISRLKDTNFALLTKWLWRYI 780

BLAST of Spg033450 vs. NCBI nr
Match: TYK00493.1 (LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa])

HSP 1 Score: 441.0 bits (1133), Expect = 2.3e-119
Identity = 307/1103 (27.83%), Postives = 451/1103 (40.89%), Query Frame = 0

Query: 112  SSISVNWIALDAYGSSGGIIIMWDELCCNITDHVKGLFSVSVLVTLSDGFSWWLSGIYGP 171
            ++ S N +      S+GGI+I+WD    ++    +G FS+S   + S   SWWL+G+YGP
Sbjct: 692  ATTSTNALFSQLGSSAGGILILWDAQHHSLLSQEEGKFSLSANFS-SFNNSWWLTGLYGP 751

Query: 172  ASRKKRKSFWRELYDLHGLCGDCWLLGGDFNVFRHSSETSSNNPAKLSMSKFNKFISDTD 231
              R++R + W +L++LH L    W++GGD NV R   E+++   +  S +  N FIS+  
Sbjct: 752  VKRRERLNVWEDLHNLHHLNSSPWIIGGDLNVVRMREESTAVTFSSHSSNMLNDFISNNL 811

Query: 232  LLDPPLINGPYTWTNLRSEPVMSRLDRFLFSNSWCIKFNDHHSKKLSRCTSDHFPILLDD 291
            L+DPPL N  YTW+NLR+ P  SRLDRFL+++ W I FN H ++ L R TSDHFP++ +D
Sbjct: 812  LIDPPLTNNRYTWSNLRNPPTFSRLDRFLYNSRWEILFNPHITRTLPRPTSDHFPLVCED 871

Query: 292  SSST--WGPCPFRFDNYLLDNKSFIGNVEHWWTDTFCEGFPGFSFIRRLKFLARKVKNWK 351
            S+ST  WGP PFR ++  L++  F  N+E WW  +   G PGF FI+RLK LA  +K W+
Sbjct: 872  STSTLRWGPAPFRLNSIALNDPEFKRNMERWWELSVQNGHPGFFFIQRLKSLANLIKPWQ 931

Query: 352  LSNTDSFKEKKKAISNEIDRIDALESSGSLDDMAKQLRKSLKADLQETALLEARYWNQRC 411
                 S    K+ I  E+D ID  E    L       R +LKA+L + +L E+++W QR 
Sbjct: 932  KEKFQSLTSAKENIIREVDSIDKNELDTPLSLEESNRRLALKAELNDLSLKESQFWFQRA 991

Query: 412  KKLWLSDRDENSAFFHKICTARRRRNQIHELFTKEG----------ISIVSDF------- 471
            KKLWL + DENSAFFH+IC++R++RN IHE+  +EG          ++ V+ F       
Sbjct: 992  KKLWLKEGDENSAFFHRICSSRQKRNLIHEIQDEEGSIQNTNNNISLAFVNHFSRIYRCS 1051

Query: 472  ------------------------------------------------------------ 531
                                                                        
Sbjct: 1052 TKKDPLFIENLEWNPIDYSDWSLLCAPFSEEEIKGVIKSFDGNKAPGPDGFPISFFKSYW 1111

Query: 532  -LLEKEVIDHFADIYD-----YNQNSEWIIV----------------------------- 591
             LL+++++D F D ++      N N+ +I +                             
Sbjct: 1112 HLLKEDILDIFKDFFEKGVINKNMNNTYIALIEKKKDYSHPKDFRPISLTTSIYKTIAKT 1171

Query: 592  -----------------------------------------------------------N 651
                                                                       N
Sbjct: 1172 LSNRLKLTLPDTISGNQLAFIKNRQITDAILMANEALDYWKVKKIKGFILKLDIEKAFDN 1231

Query: 652  LNWE-------------------------------------------------------- 711
            LNW                                                         
Sbjct: 1232 LNWNFIDLVLKKNNYPNSWRKWIRGCISNVTYSIIVNGKPQGRIKANRGLRQGDPLSLFL 1291

Query: 712  ------------------------------------------------------------ 771
                                                                        
Sbjct: 1292 FVIAMDYLSRLLSHLESTGAIKGGILCHTLPLTYLGVPLGGNPKSNLFWRNIEDRIQKKL 1351

Query: 772  ------------------------PINSVSA--------------------RGS-----T 831
                                    PI  +S                     +GS     +
Sbjct: 1352 SNWKYAHISKGGRLTLIKSTLSSLPIYKLSVFQAPSSTYKNIEKLWRNFLWKGSCGLKGS 1411

Query: 832  NLVRWEIVSSSKAEGGLGIHKIKETNDALLLKWIWRFFNEENTLWRNFINNKYSSLHFEC 872
            +L+ W IV+  K EGGLGI +++ TN ALL KW+WR+++E N+LWR  I+ KY   H   
Sbjct: 1412 HLINWSIVTKPKEEGGLGISRLQVTNQALLSKWLWRYYSEPNSLWRRLIHIKYKGKHPGD 1471

BLAST of Spg033450 vs. NCBI nr
Match: TYJ99315.1 (LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa])

HSP 1 Score: 412.1 bits (1058), Expect = 1.2e-110
Identity = 281/1012 (27.77%), Postives = 400/1012 (39.53%), Query Frame = 0

Query: 82   SDSLFIETSDSIL---CETKLCNTNKRIIKSLWSSISVNWIALDAYGSSGGIIIMWDELC 141
            +DS    TS ++L     + L  TNKRIIKSLW S S+NWIA +A GSSGGI+I+WD   
Sbjct: 746  TDSSGATTSTNVLLNQMNSGLKITNKRIIKSLWPSNSINWIAKNASGSSGGILILWDAQN 805

Query: 142  CNITDHVKGLFSVSVLVTLSDGFSWWLSGIYGPASRKKRKSFWRELYDLHGLCGDCWLLG 201
             ++    +GLFS+S    L++  SWWL+G+YGP  R++R  FW EL++L  L    W+LG
Sbjct: 806  HSLLSQEEGLFSLSANFLLNNNSSWWLTGLYGPVKRRERIHFWAELHNLQHLNSFPWILG 865

Query: 202  GDFNVFRHSSETSSNNPAKLSMSKFNKFISDTDLLDPPLINGPYTWTNLRSEPVMSRLDR 261
            GD NV R   E++S   +  +    N FIS+  L+DPPL N  +TW+NLR+ P  SR+DR
Sbjct: 866  GDLNVIRMREESTSVLSSSHNSRMLNNFISNNLLIDPPLTNNRFTWSNLRNPPTFSRIDR 925

Query: 262  FLFSNSWCIKFNDHHSKKLSRCTSDHFPILLDDSSS--TWGPCPFRFDNYLLDNKSFIGN 321
            FL+++SW   F+ H ++ L R TSDHFP++ +DS+   +WGP PFR ++  L +  F  N
Sbjct: 926  FLYNSSWENLFSPHTTRTLPRSTSDHFPLVCEDSNPKLSWGPIPFRLNSITLSDPEFKRN 985

Query: 322  VEHWWTDTFCEGFPGFSFIRRLKFLARKVKNWKLSNTDSFKEKKKAISNEIDRIDALESS 381
            +  WW ++   G+PGFSFI+RLK LA  +K W+     S    K+AI  E+D ID  E  
Sbjct: 986  MGRWWENSIQAGYPGFSFIQRLKSLANFIKPWQKEKLHSLTYAKEAIIREVDSIDKKELD 1045

Query: 382  GSLDDMAKQLRKSLKADLQETALLEARYWNQRCKKLWLSDRDENSAFFHKICTARRRRNQ 441
              L       R +LKADL E +L E+++W QR KKLWL + DENS+FFH+IC++R++R+ 
Sbjct: 1046 TPLTQEESNRRLALKADLSELSLKESQFWYQRAKKLWLREGDENSSFFHRICSSRQKRSF 1105

Query: 442  IHELFTKEGISIVSDFLLEKEVIDHFADIYDYNQNSEWIIV-NLNWEPINS--------- 501
            IHE+  +EG    ++  +    I  F+ IY  +  S+ + + NL+W PI S         
Sbjct: 1106 IHEIQDEEGSIQNTNNSISTAFIKFFSRIYRSSTKSDPLFIENLDWNPIASSEWSHLCAP 1165

Query: 502  ------------------------------------------------------------ 561
                                                                        
Sbjct: 1166 FLEGEIKGVINSFDGKKTPGPDGFPISFFKSHWLKTTLPNTISGNQLAFVKNRQITDAIL 1225

Query: 562  ------------------------------------------------------------ 621
                                                                        
Sbjct: 1226 MANEAVDYWKVKKIKGFILKLDIEKAFDNLNLDFIDNVLEKKNFPNPWRKWIRGCISNVT 1285

Query: 622  ----------------------------------------------------VSARGSTN 681
                                                                VS  G+ N
Sbjct: 1286 YSVIINGRPQGRIKANRGLRQGDPLSPFLFVIAMDYLSRLLSHLESSGAIKGVSLNGNCN 1345

Query: 682  ------------------------------------------------------------ 741
                                                                        
Sbjct: 1346 ISHILFADDILLFIEDNDCFLKNLRMALSLFERASGLKINLLKSALVPVNVSLKRAKECA 1405

Query: 742  ------------------------------------------------------------ 746
                                                                        
Sbjct: 1406 SFWGISCHSLPLSYLGVPLGGNPKSNLFWRNVEDKIQKKLNNWKYAQISKGGRLTLIKST 1465

BLAST of Spg033450 vs. NCBI nr
Match: TYK08190.1 (LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa])

HSP 1 Score: 411.4 bits (1056), Expect = 2.0e-110
Identity = 292/1152 (25.35%), Postives = 419/1152 (36.37%), Query Frame = 0

Query: 125  GSSGGIIIMWDELCCNITDHVKGLFSVSVLVTLSDGFSWWLSGIYGPASRKKRKSFWREL 184
            G  GGI+++WD+    + D   G +S+S+ +  ++G +WWL+ +YGP     R   W EL
Sbjct: 674  GDKGGILVLWDDTNFKVNDIKVGNYSISLNILNTNG-NWWLTSVYGPYKYNDRTKLWPEL 733

Query: 185  YDLHGLCGDCWLLGGDFNVFRHSSETSSNNPAKLSMSKFNKFISDTDLLDPPLINGPYTW 244
              L  LC   WL+ GDFN+ R   ET++ +  K +M+ FN FIS  +L+DPP +N  +TW
Sbjct: 734  EILQSLCLPNWLIAGDFNIVRWERETNAKSLDKRNMANFNNFISVNELIDPPPLNNNFTW 793

Query: 245  TNLRSEPVMSRLDRFLFSNSWCIKFNDHHSKKLSRCTSDHFPILLDDSSSTWGPCPFRFD 304
            +NLR  P  SRLDRFL S  W   F  H S+ L R  SDHFPILL+     WGPCPFR +
Sbjct: 794  SNLRVNPTYSRLDRFLLSKGWENAFGLHTSRTLERNISDHFPILLESPQIKWGPCPFRLN 853

Query: 305  NYLLDNKSFIGNVEHWWTDTFCEGFPGFSFIRRLKFLARKVKNWKLSNTDSFKEKKKAIS 364
            N  L +K F  N  +WW  +   GFPG++FI+ L  L++ +K W+ +  + +   KKA+ 
Sbjct: 854  NSSLRDKEFQKNFINWWNSSKQAGFPGYAFIQSLNSLSKFIKEWQHNKVNLYDANKKALL 913

Query: 365  NEIDRIDALESSGSLDDMAKQLRKSLKADLQETALLEARYWNQRCKKLWLSDRDENSAFF 424
             EID ID LE  G +     Q R SLK+DL      +A+ W+QR ++ W    DEN+++F
Sbjct: 914  KEIDIIDKLEFQGEMSTTHHQKRISLKSDLLSIENNQAQIWHQRARQRWNLLGDENNSYF 973

Query: 425  HKICTARRRRNQIHELFTKEGISIVSDFLLEKEVIDHFADIYDYNQNSEWIIVNLNWEPI 484
            H+ICT  +R+N I  +    G S+ S   + +  I HF +IY      E +I NL+W PI
Sbjct: 974  HRICTINQRKNLIKSICDPAGTSLDSIDDISRTFISHFQNIYTKESYEEILIDNLSWNPI 1033

Query: 485  N----------------------------------------------------------- 544
            +                                                           
Sbjct: 1034 SRLCQSELCKPFDESEIKSTIMSFSNEKAPGPDGYTMLFYKKHWPDLKDDLLNVFKDFHK 1093

Query: 545  ------------------------------------------------------------ 604
                                                                        
Sbjct: 1094 AGIVNNNVNNTFIALISKKEKCSKPSDYRPISLTTSLYKIMAKALANRLKSALPDTIAEN 1153

Query: 605  ------------------------------------------------------------ 664
                                                                        
Sbjct: 1154 QMAFIKGRQINDAILIANEVIDTWKQRKIKGFVLKLDIEKAFDKISWSFIDYMLAKKHFP 1213

Query: 665  ------------------------------------------------------------ 724
                                                                        
Sbjct: 1214 HKWRKWIKACISNVQYSILLNGAPKGRIKAERGIRQGDPLSPFIFVLAMDYLSRLLSHLE 1273

Query: 725  ------------------------------------------------------------ 784
                                                                        
Sbjct: 1274 SKGAIKGVSFNNYCNISHLLFADDVLIFVEDNERYLNNLQMALTLFEKASGLTFNNSKST 1333

Query: 785  ----SVSA---------------------------------------------------- 844
                ++SA                                                    
Sbjct: 1334 ISPINISAGRTDQIASFFGFQTKFLPVNYLGVPLGGNPRSRSFWDQTIECIHKKLNGWKY 1393

Query: 845  ---------------------------------------------------RGSTNLVRW 871
                                                               + + +L+ W
Sbjct: 1394 SQISKGGRLTLLKASLSSLPTYQLSTFKAPVSVYKEIEKHWRDFLWGGSEDKQNAHLINW 1453

BLAST of Spg033450 vs. ExPASy Swiss-Prot
Match: P0C2F6 (Putative ribonuclease H protein At1g65750 OS=Arabidopsis thaliana OX=3702 GN=At1g65750 PE=3 SV=1)

HSP 1 Score: 99.4 bits (246), Expect = 2.2e-19
Identity = 91/363 (25.07%), Postives = 151/363 (41.60%), Query Frame = 0

Query: 486 SVSARGSTNLVRWEIVSSSKAEGGLGIHKIKETNDALLLKWIWRFFNEENTLWRNFINNK 545
           S + +   +LV+W  V S K EGGLG+   K  N AL+ K  WR   E+N+LW   +  K
Sbjct: 77  STAEKKKQHLVKWSKVCSPKKEGGLGVRAAKSMNRALISKVGWRLLQEKNSLWTLVLQKK 136

Query: 546 YSSLHFECFPSSSKVSSSRSPWHAIS-KLQDIFFANFRWDIRNGRSTLFWHDNWSSFGPL 605
           Y               S  S W +I+  L+D+      W   +G+   FW D W S  PL
Sbjct: 137 YHVGEIRDSRWLIPKGSWSSTWRSIAIGLRDVVSHGVGWIPGDGQQIRFWTDRWVSGKPL 196

Query: 606 KFVCNRLYQLSSNKNLSIAEVWSSSDRMWNFQPRRPLFDRDLQRWSEFAELLPTPNPQRG 665
             + N   +  ++ +  +A+      R W+F    P +  +  R    A +L      R 
Sbjct: 197 LELDNG--ERPTDCDTVVAKDLWIPGRGWDFAKIDP-YTTNNTRLELRAVVLDLVTGAR- 256

Query: 666 SDIRRWMVSGDGLFTTKSARAILSV--LPSRPFHSPGEKILNNLWTADIPKKIKVFIWSL 725
            D   W  S DG F+ +SA  +L+V  +P RP  +      N LW   +P+++K F+W +
Sbjct: 257 -DRLSWKFSQDGQFSVRSAYEMLTVDEVP-RPNMA---SFFNCLWKVRVPERVKTFLWLV 316

Query: 726 FHRSINTSDRLQAIFQNSLHNPSVCILCWKNSEDIDHLFIHCSRASFFRNKINLALGLSM 785
            ++++ T +      +  L   +VC +C    E + H+   C           L + + +
Sbjct: 317 GNQAVMTEEERH---RRHLSASNVCQVCKGGVESMLHVLRDC--------PAQLGIWVRV 376

Query: 786 VPPATIDSFCA-DLF------TSKAISQRQLLRRNVFIATLWLLWNERNRRIFEDKARTR 839
           VP      F +  LF               +    +F   +W  W  R   IF +  + R
Sbjct: 377 VPQRRQQGFFSKSLFEWLYDNLGDRSGCEDIPWSTIFAVIIWWGWKWRCGNIFGENTKCR 419

BLAST of Spg033450 vs. ExPASy TrEMBL
Match: A0A5A7T9I7 (LINE-1 retrotransposable element ORF2 protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold266G00980 PE=4 SV=1)

HSP 1 Score: 486.1 bits (1250), Expect = 3.0e-133
Identity = 315/1148 (27.44%), Postives = 444/1148 (38.68%), Query Frame = 0

Query: 133  MWDELCCNITDHVKGLFSVSVLVTLSDGFSWWLSGIYGPASRKKRKSFWRELYDLHGLCG 192
            MW++   +I    KG FSVS+ V  ++G  WWLS IYGPA RK R  FW EL  L  +C 
Sbjct: 1    MWNDQNFSILSVFKGAFSVSIQVGSNNGAFWWLSAIYGPAKRKNRPLFWEELEHLKSICL 60

Query: 193  DCWLLGGDFNVFRHSSETSSNNPAKLSMSKFNKFISDTDLLDPPLINGPYTWTNLRSEPV 252
              W+LGGDFNV R   ET++ NPA LSM +FN FIS+ +L+DPPL N  YTW+NLR++  
Sbjct: 61   PTWILGGDFNVIRWKEETTTKNPALLSMRRFNSFISNCNLIDPPLSNAKYTWSNLRAQAT 120

Query: 253  MSRLDRFLFSNSWCIKFNDHHSKKLSRCTSDHFPILLDDSSSTWGPCPFRFDNYLLDNKS 312
            +SRLDRFLF++ W   F  H SK L+R TSDHFPI+L+ S+ +WGP PFRF N  L +  
Sbjct: 121  LSRLDRFLFTSQWENIFPGHTSKVLTRTTSDHFPIVLESSTISWGPSPFRFTNAYLKDPD 180

Query: 313  FIGNVEHWWTDTFCEGFPGFSFIRRLKFLARKVKNWKLSNTDSFKEKKKAISNEIDRIDA 372
            +  N+E WW +T   G+ G+SF+RRLK LA  +K W        +  KKA   EID+ID 
Sbjct: 181  YKKNIEFWWGNTSQPGYAGYSFMRRLKQLALIIKTWGRDKKGKNEASKKACIKEIDQIDK 240

Query: 373  LESSGSLDDMAKQLRKSLKADLQETALLEARYWNQRCKKLWLSDRDENSAFFHKICTARR 432
            LE+ GS  ++ ++ R +LKADL +  L EA+ W Q+CK++W+ + DENS+FFHKICTAR+
Sbjct: 241  LEAEGSATEIHREKRTALKADLSQINLTEAQIWAQKCKRIWVHEGDENSSFFHKICTARQ 300

Query: 433  RRNQIHELFTKEGISIVSDFLLEKEVIDHFADIYDYNQNSEWIIVNLNWEPINSVSA--- 492
            ++  I ++    G + ++D  +    I HF DIY  N+NS+  I NL+W PI+++++   
Sbjct: 301  KKCLISKIINNSGQNCLNDSDIADAFIQHFEDIYTDNRNSQLFIENLDWCPISNINSELL 360

Query: 493  ------------------------------------------------------------ 552
                                                                        
Sbjct: 361  DKPFNEAEIWLTLKSFAKNKAPGPDGYAMDFLQKSWSFMKQNICDIFKDFHSTHIINKVV 420

Query: 553  ------------------------------------------------------------ 612
                                                                        
Sbjct: 421  NETLITLIAKKEHCETAADFRPISLTTAIYKLIAKTLADRLKQTLPDTISESQMAFVKGR 480

Query: 613  ------------------------------------------------------------ 672
                                                                        
Sbjct: 481  QITEAILIANEALDFWRSKKERGFVIKLDIEKAFDKLNWRFIDFVLMKKNYSQKWRKMIA 540

Query: 673  ------------------------------------------------------------ 732
                                                                        
Sbjct: 541  SCISSVQYSILINGRPRGRIKPSRGIRQGDPLSPFIFVLAMDYLSRLLNNLADKRKINGV 600

Query: 733  ------------------------------------------------------------ 792
                                                                        
Sbjct: 601  KFSPNLNLTHILFADDILIFVEDRDDYVSNLKMILHLFESASGLNINLSKSTIFPINVPT 660

Query: 793  ------------------------------------------------------------ 852
                                                                        
Sbjct: 661  DRAKSIADSWGISKGHLPTSYLGMPLGGRPSSSNFWDNVLQKIQKKLSNWKYSQLSKGGR 720

Query: 853  --------------------------------------RGSTN-----LVRWEIVSSSKA 873
                                                   G++N     L+RW  + S K 
Sbjct: 721  ITLINSTLESLPIYQMSVFKVPKGIAQKIEASWRNFLWNGASNGHNISLIRWNQIVSPKE 780

BLAST of Spg033450 vs. ExPASy TrEMBL
Match: A0A5D3C4J1 (LINE-1 retrotransposable element ORF2 protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold13G00790 PE=4 SV=1)

HSP 1 Score: 481.1 bits (1237), Expect = 9.8e-132
Identity = 318/1120 (28.39%), Postives = 448/1120 (40.00%), Query Frame = 0

Query: 133  MWDELCCNITDHVKGLFSVSVLVTLSDGFS---WWLSGIYGPASRKKRKSFWRELYDLHG 192
            MWD+L  N+TD ++G FS+S+ +   DG S   WWLS IYGP+  + RKSFW EL DL  
Sbjct: 1    MWDDLRFNVTDFIEGNFSLSININSPDGPSNSAWWLSAIYGPSGGRNRKSFWAELLDLKN 60

Query: 193  LCGDCWLLGGDFNVFRHSSETSSNNPAKLSMSKFNKFISDTDLLDPPLINGPYTWTNLRS 252
             C   WLL GDFNV R  SETS+ NP+K SM  FNKFI+D++L+DPPL N  +TW+NLR 
Sbjct: 61   KCSPTWLLAGDFNVVRFPSETSAQNPSKHSMRCFNKFIADSNLIDPPLSNAKFTWSNLRV 120

Query: 253  EPVMSRLDRFLFSNSWCIKFNDHHSKKLSRCTSDHFPILLDDSSSTWGPCPFRFDNYLLD 312
             PV+SR+DRFL++ +W   F  H+SK LSR TSDHFPI+L+ S  +WGP PF+  N  L 
Sbjct: 121  HPVLSRIDRFLYTTNWENLFTAHYSKTLSRVTSDHFPIVLESSLISWGPSPFKLINVHLK 180

Query: 313  NKSFIGNVEHWWTDTFCEGFPGFSFIRRLKFLARKVKNWKLSNTDSFKEKKKAISNEIDR 372
               F  N+ +WW +   EG PGFSF+R+LK L+  ++N +  N     E K A   EID 
Sbjct: 181  EPWFKNNITNWWKNLRQEGHPGFSFMRKLKQLSTIIRNEQRKNKCYSDEDKNAWIKEIDS 240

Query: 373  IDALESSGSLDDMAKQLRKSLKADLQETALLEARYWNQRCKKLWLSDRDENSAFFHKICT 432
            ID LE+ G+L +     R  LKAD+  +   EA+ W Q+ K+LW+++ DEN++FFHKIC+
Sbjct: 241  IDRLEAEGNLSEELSLRRTRLKADVLTSGFKEAQIWYQKSKRLWITEGDENTSFFHKICS 300

Query: 433  ARRRRNQIHELFTKEGISIVSDFLLEKEVIDHFADIY-DYNQNSEWIIVNLNWEPINS-- 492
            AR+RR+ I  + + +G+   ++  + K  +DHF DIY    + S W+I NLNW PI++  
Sbjct: 301  ARQRRSIISNINSADGVPCSTNESIAKAFLDHFEDIYKGGGEESPWLIDNLNWSPISTNQ 360

Query: 493  ------------------------------------------------------------ 552
                                                                        
Sbjct: 361  AQILCSMFTEEEIHEALTAFSSNKSPETSTQTVSSTINITNIALIAKKEKCAEPADYRPI 420

Query: 553  ------------------------------------------------------------ 612
                                                                        
Sbjct: 421  SLTTSIYKLIAKVIAERLKDTLPYTVAENQMAFVKDRQIIDAILVANEAIDYWRFKKIQG 480

Query: 613  ------------------------------------------------------------ 672
                                                                        
Sbjct: 481  FVIKLDIEKAFDKLNWRFIDFMLMKKGYPFKWRSWIRACISSVQYSIIINGRPRGKIQPS 540

Query: 673  ----------------------------------VSARGSTN------------------ 732
                                              V   G+ N                  
Sbjct: 541  RGIRQGDPISPFIFVLAMDYISRLLNSVGEKIKGVKLEGNINLTHLLFADDILLFVEDDE 600

Query: 733  ------------------------------------------------------------ 792
                                                                        
Sbjct: 601  HSIQNLKNIINLFQLASGLSINLNKSTISPINVDASRTEQIASQWGISTKFLPINYLGVP 660

Query: 793  ------------------------------------------------------------ 852
                                                                        
Sbjct: 661  LGGKQTTKAFWKNVEEKINKKLASWKYSMLSKGGKITLIKSSLASLPTYQLSIFKAPVST 720

Query: 853  -----------------------LVRWEIVSSSKAEGGLGIHKIKETNDALLLKWIWRFF 872
                                   LV W  ++SSK +GGLGI ++K+TN ALL KW+WR+ 
Sbjct: 721  CKNIEKTWRNFLWKNPPETHKLHLVNWAKITSSKEKGGLGISRLKDTNFALLTKWLWRYI 780

BLAST of Spg033450 vs. ExPASy TrEMBL
Match: A0A5D3BL61 (LINE-1 retrotransposable element ORF2 protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold169G001020 PE=4 SV=1)

HSP 1 Score: 441.0 bits (1133), Expect = 1.1e-119
Identity = 307/1103 (27.83%), Postives = 451/1103 (40.89%), Query Frame = 0

Query: 112  SSISVNWIALDAYGSSGGIIIMWDELCCNITDHVKGLFSVSVLVTLSDGFSWWLSGIYGP 171
            ++ S N +      S+GGI+I+WD    ++    +G FS+S   + S   SWWL+G+YGP
Sbjct: 692  ATTSTNALFSQLGSSAGGILILWDAQHHSLLSQEEGKFSLSANFS-SFNNSWWLTGLYGP 751

Query: 172  ASRKKRKSFWRELYDLHGLCGDCWLLGGDFNVFRHSSETSSNNPAKLSMSKFNKFISDTD 231
              R++R + W +L++LH L    W++GGD NV R   E+++   +  S +  N FIS+  
Sbjct: 752  VKRRERLNVWEDLHNLHHLNSSPWIIGGDLNVVRMREESTAVTFSSHSSNMLNDFISNNL 811

Query: 232  LLDPPLINGPYTWTNLRSEPVMSRLDRFLFSNSWCIKFNDHHSKKLSRCTSDHFPILLDD 291
            L+DPPL N  YTW+NLR+ P  SRLDRFL+++ W I FN H ++ L R TSDHFP++ +D
Sbjct: 812  LIDPPLTNNRYTWSNLRNPPTFSRLDRFLYNSRWEILFNPHITRTLPRPTSDHFPLVCED 871

Query: 292  SSST--WGPCPFRFDNYLLDNKSFIGNVEHWWTDTFCEGFPGFSFIRRLKFLARKVKNWK 351
            S+ST  WGP PFR ++  L++  F  N+E WW  +   G PGF FI+RLK LA  +K W+
Sbjct: 872  STSTLRWGPAPFRLNSIALNDPEFKRNMERWWELSVQNGHPGFFFIQRLKSLANLIKPWQ 931

Query: 352  LSNTDSFKEKKKAISNEIDRIDALESSGSLDDMAKQLRKSLKADLQETALLEARYWNQRC 411
                 S    K+ I  E+D ID  E    L       R +LKA+L + +L E+++W QR 
Sbjct: 932  KEKFQSLTSAKENIIREVDSIDKNELDTPLSLEESNRRLALKAELNDLSLKESQFWFQRA 991

Query: 412  KKLWLSDRDENSAFFHKICTARRRRNQIHELFTKEG----------ISIVSDF------- 471
            KKLWL + DENSAFFH+IC++R++RN IHE+  +EG          ++ V+ F       
Sbjct: 992  KKLWLKEGDENSAFFHRICSSRQKRNLIHEIQDEEGSIQNTNNNISLAFVNHFSRIYRCS 1051

Query: 472  ------------------------------------------------------------ 531
                                                                        
Sbjct: 1052 TKKDPLFIENLEWNPIDYSDWSLLCAPFSEEEIKGVIKSFDGNKAPGPDGFPISFFKSYW 1111

Query: 532  -LLEKEVIDHFADIYD-----YNQNSEWIIV----------------------------- 591
             LL+++++D F D ++      N N+ +I +                             
Sbjct: 1112 HLLKEDILDIFKDFFEKGVINKNMNNTYIALIEKKKDYSHPKDFRPISLTTSIYKTIAKT 1171

Query: 592  -----------------------------------------------------------N 651
                                                                       N
Sbjct: 1172 LSNRLKLTLPDTISGNQLAFIKNRQITDAILMANEALDYWKVKKIKGFILKLDIEKAFDN 1231

Query: 652  LNWE-------------------------------------------------------- 711
            LNW                                                         
Sbjct: 1232 LNWNFIDLVLKKNNYPNSWRKWIRGCISNVTYSIIVNGKPQGRIKANRGLRQGDPLSLFL 1291

Query: 712  ------------------------------------------------------------ 771
                                                                        
Sbjct: 1292 FVIAMDYLSRLLSHLESTGAIKGGILCHTLPLTYLGVPLGGNPKSNLFWRNIEDRIQKKL 1351

Query: 772  ------------------------PINSVSA--------------------RGS-----T 831
                                    PI  +S                     +GS     +
Sbjct: 1352 SNWKYAHISKGGRLTLIKSTLSSLPIYKLSVFQAPSSTYKNIEKLWRNFLWKGSCGLKGS 1411

Query: 832  NLVRWEIVSSSKAEGGLGIHKIKETNDALLLKWIWRFFNEENTLWRNFINNKYSSLHFEC 872
            +L+ W IV+  K EGGLGI +++ TN ALL KW+WR+++E N+LWR  I+ KY   H   
Sbjct: 1412 HLINWSIVTKPKEEGGLGISRLQVTNQALLSKWLWRYYSEPNSLWRRLIHIKYKGKHPGD 1471

BLAST of Spg033450 vs. ExPASy TrEMBL
Match: A0A5D3BLV7 (LINE-1 retrotransposable element ORF2 protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold248G005290 PE=4 SV=1)

HSP 1 Score: 412.1 bits (1058), Expect = 5.6e-111
Identity = 281/1012 (27.77%), Postives = 400/1012 (39.53%), Query Frame = 0

Query: 82   SDSLFIETSDSIL---CETKLCNTNKRIIKSLWSSISVNWIALDAYGSSGGIIIMWDELC 141
            +DS    TS ++L     + L  TNKRIIKSLW S S+NWIA +A GSSGGI+I+WD   
Sbjct: 746  TDSSGATTSTNVLLNQMNSGLKITNKRIIKSLWPSNSINWIAKNASGSSGGILILWDAQN 805

Query: 142  CNITDHVKGLFSVSVLVTLSDGFSWWLSGIYGPASRKKRKSFWRELYDLHGLCGDCWLLG 201
             ++    +GLFS+S    L++  SWWL+G+YGP  R++R  FW EL++L  L    W+LG
Sbjct: 806  HSLLSQEEGLFSLSANFLLNNNSSWWLTGLYGPVKRRERIHFWAELHNLQHLNSFPWILG 865

Query: 202  GDFNVFRHSSETSSNNPAKLSMSKFNKFISDTDLLDPPLINGPYTWTNLRSEPVMSRLDR 261
            GD NV R   E++S   +  +    N FIS+  L+DPPL N  +TW+NLR+ P  SR+DR
Sbjct: 866  GDLNVIRMREESTSVLSSSHNSRMLNNFISNNLLIDPPLTNNRFTWSNLRNPPTFSRIDR 925

Query: 262  FLFSNSWCIKFNDHHSKKLSRCTSDHFPILLDDSSS--TWGPCPFRFDNYLLDNKSFIGN 321
            FL+++SW   F+ H ++ L R TSDHFP++ +DS+   +WGP PFR ++  L +  F  N
Sbjct: 926  FLYNSSWENLFSPHTTRTLPRSTSDHFPLVCEDSNPKLSWGPIPFRLNSITLSDPEFKRN 985

Query: 322  VEHWWTDTFCEGFPGFSFIRRLKFLARKVKNWKLSNTDSFKEKKKAISNEIDRIDALESS 381
            +  WW ++   G+PGFSFI+RLK LA  +K W+     S    K+AI  E+D ID  E  
Sbjct: 986  MGRWWENSIQAGYPGFSFIQRLKSLANFIKPWQKEKLHSLTYAKEAIIREVDSIDKKELD 1045

Query: 382  GSLDDMAKQLRKSLKADLQETALLEARYWNQRCKKLWLSDRDENSAFFHKICTARRRRNQ 441
              L       R +LKADL E +L E+++W QR KKLWL + DENS+FFH+IC++R++R+ 
Sbjct: 1046 TPLTQEESNRRLALKADLSELSLKESQFWYQRAKKLWLREGDENSSFFHRICSSRQKRSF 1105

Query: 442  IHELFTKEGISIVSDFLLEKEVIDHFADIYDYNQNSEWIIV-NLNWEPINS--------- 501
            IHE+  +EG    ++  +    I  F+ IY  +  S+ + + NL+W PI S         
Sbjct: 1106 IHEIQDEEGSIQNTNNSISTAFIKFFSRIYRSSTKSDPLFIENLDWNPIASSEWSHLCAP 1165

Query: 502  ------------------------------------------------------------ 561
                                                                        
Sbjct: 1166 FLEGEIKGVINSFDGKKTPGPDGFPISFFKSHWLKTTLPNTISGNQLAFVKNRQITDAIL 1225

Query: 562  ------------------------------------------------------------ 621
                                                                        
Sbjct: 1226 MANEAVDYWKVKKIKGFILKLDIEKAFDNLNLDFIDNVLEKKNFPNPWRKWIRGCISNVT 1285

Query: 622  ----------------------------------------------------VSARGSTN 681
                                                                VS  G+ N
Sbjct: 1286 YSVIINGRPQGRIKANRGLRQGDPLSPFLFVIAMDYLSRLLSHLESSGAIKGVSLNGNCN 1345

Query: 682  ------------------------------------------------------------ 741
                                                                        
Sbjct: 1346 ISHILFADDILLFIEDNDCFLKNLRMALSLFERASGLKINLLKSALVPVNVSLKRAKECA 1405

Query: 742  ------------------------------------------------------------ 746
                                                                        
Sbjct: 1406 SFWGISCHSLPLSYLGVPLGGNPKSNLFWRNVEDKIQKKLNNWKYAQISKGGRLTLIKST 1465

BLAST of Spg033450 vs. ExPASy TrEMBL
Match: A0A5A7US62 (LINE-1 retrotransposable element ORF2 protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold280G003960 PE=4 SV=1)

HSP 1 Score: 411.4 bits (1056), Expect = 9.5e-111
Identity = 292/1152 (25.35%), Postives = 419/1152 (36.37%), Query Frame = 0

Query: 125  GSSGGIIIMWDELCCNITDHVKGLFSVSVLVTLSDGFSWWLSGIYGPASRKKRKSFWREL 184
            G  GGI+++WD+    + D   G +S+S+ +  ++G +WWL+ +YGP     R   W EL
Sbjct: 674  GDKGGILVLWDDTNFKVNDIKVGNYSISLNILNTNG-NWWLTSVYGPYKYNDRTKLWPEL 733

Query: 185  YDLHGLCGDCWLLGGDFNVFRHSSETSSNNPAKLSMSKFNKFISDTDLLDPPLINGPYTW 244
              L  LC   WL+ GDFN+ R   ET++ +  K +M+ FN FIS  +L+DPP +N  +TW
Sbjct: 734  EILQSLCLPNWLIAGDFNIVRWERETNAKSLDKRNMANFNNFISVNELIDPPPLNNNFTW 793

Query: 245  TNLRSEPVMSRLDRFLFSNSWCIKFNDHHSKKLSRCTSDHFPILLDDSSSTWGPCPFRFD 304
            +NLR  P  SRLDRFL S  W   F  H S+ L R  SDHFPILL+     WGPCPFR +
Sbjct: 794  SNLRVNPTYSRLDRFLLSKGWENAFGLHTSRTLERNISDHFPILLESPQIKWGPCPFRLN 853

Query: 305  NYLLDNKSFIGNVEHWWTDTFCEGFPGFSFIRRLKFLARKVKNWKLSNTDSFKEKKKAIS 364
            N  L +K F  N  +WW  +   GFPG++FI+ L  L++ +K W+ +  + +   KKA+ 
Sbjct: 854  NSSLRDKEFQKNFINWWNSSKQAGFPGYAFIQSLNSLSKFIKEWQHNKVNLYDANKKALL 913

Query: 365  NEIDRIDALESSGSLDDMAKQLRKSLKADLQETALLEARYWNQRCKKLWLSDRDENSAFF 424
             EID ID LE  G +     Q R SLK+DL      +A+ W+QR ++ W    DEN+++F
Sbjct: 914  KEIDIIDKLEFQGEMSTTHHQKRISLKSDLLSIENNQAQIWHQRARQRWNLLGDENNSYF 973

Query: 425  HKICTARRRRNQIHELFTKEGISIVSDFLLEKEVIDHFADIYDYNQNSEWIIVNLNWEPI 484
            H+ICT  +R+N I  +    G S+ S   + +  I HF +IY      E +I NL+W PI
Sbjct: 974  HRICTINQRKNLIKSICDPAGTSLDSIDDISRTFISHFQNIYTKESYEEILIDNLSWNPI 1033

Query: 485  N----------------------------------------------------------- 544
            +                                                           
Sbjct: 1034 SRLCQSELCKPFDESEIKSTIMSFSNEKAPGPDGYTMLFYKKHWPDLKDDLLNVFKDFHK 1093

Query: 545  ------------------------------------------------------------ 604
                                                                        
Sbjct: 1094 AGIVNNNVNNTFIALISKKEKCSKPSDYRPISLTTSLYKIMAKALANRLKSALPDTIAEN 1153

Query: 605  ------------------------------------------------------------ 664
                                                                        
Sbjct: 1154 QMAFIKGRQINDAILIANEAIDTWKQRKIKGFVLKLDIEKAFDKISWSFIDYMLAKKHFP 1213

Query: 665  ------------------------------------------------------------ 724
                                                                        
Sbjct: 1214 HKWRKWIKACISNVQYSILLNGAPKGRIKAERGIRQGDPLSPFIFVLAMDYLSRLLSHLE 1273

Query: 725  ------------------------------------------------------------ 784
                                                                        
Sbjct: 1274 SKGAIKGVSFNNCCNISHLLFADDVLIFVEDNERYLNNLQMALTLFEKASGLTFNNSKST 1333

Query: 785  ----SVSA---------------------------------------------------- 844
                ++SA                                                    
Sbjct: 1334 ISPINISAGRTDQIASFFGFQTKFLPVNYLGVPLGGNPRSRSFWDQTIECIHKKLNGWKY 1393

Query: 845  ---------------------------------------------------RGSTNLVRW 871
                                                               + + +L+ W
Sbjct: 1394 SQISKGGRLTLLKASLSSLPTYQLSTFKAPVSVYKEIEKHWRDFLWGGSEDKQNAHLINW 1453

BLAST of Spg033450 vs. TAIR 10
Match: AT4G29090.1 (Ribonuclease H-like superfamily protein )

HSP 1 Score: 87.4 bits (215), Expect = 6.1e-17
Identity = 86/353 (24.36%), Postives = 143/353 (40.51%), Query Frame = 0

Query: 498 WEIVSSSKAEGGLGIHKIKETNDALLLKWIWRFFNEENTLWRNFINNKYSSLHFECFPSS 557
           W+ +S  KAEGG+G   I+  N ALL K +WR  +   +L      ++Y     +  P +
Sbjct: 44  WDHLSCYKAEGGIGFKDIEAFNLALLGKQMWRMLSRPESLMAKVFKSRY---FHKSDPLN 103

Query: 558 SKVSSSRS-PWHAISKLQDIFFANFRWDIRNGRSTLFWHDNWSSFGPLKFVCNRLYQLSS 617
           + + S  S  W +I   Q+I     R  + NG   + W   W    P      R+ ++  
Sbjct: 104 APLGSRPSFVWKSIHASQEILRQGARAVVGNGEDIIIWRHKWLDSKPAS-AALRMQRVPP 163

Query: 618 NKNLSIAEVWSSSD------RMWNFQPRRPLFDRDLQRWSEFAELLPTPNPQRGSDIRRW 677
            +  S++ +   SD      R W       LF  +++R     EL   P  +R  D   W
Sbjct: 164 QEYASVSSILKVSDLIDESGREWRKDVIEMLFP-EVER-KLIGEL--RPGGRRILDSYTW 223

Query: 678 MVSGDGLFTTKSARAILSVLPSRPFHSPGE-------KILNNLWTADIPKKIKVFIWSLF 737
             +  G +T KS   +L+ + ++   SP E        I   +W +    KI+ F+W   
Sbjct: 224 DYTSSGDYTVKSGYWVLTQIINKR-SSPQEVSEPSLNPIYQKIWKSQTSPKIQHFLWKCL 283

Query: 738 HRSINTSDRLQAIFQNSLHNPSVCILCWKNSEDIDHLFIHC--SRASFFRNKINLALGLS 797
             S+  +    A+    L   S CI C    E ++HL   C  +R ++  + I + LG  
Sbjct: 284 SNSLPVAG---ALAYRHLSKESACIRCPSCKETVNHLLFKCTFARLTWAISSIPIPLGGE 343

Query: 798 MVPPATIDSFCADLF--TSKAISQRQLLRRNVFIA-TLWLLWNERNRRIFEDK 832
                  DS   +L+   +      Q  + +  +   LW LW  RN  +F  +
Sbjct: 344 WA-----DSIYVNLYWVFNLGNGNPQWEKASQLVPWLLWRLWKNRNELVFRGR 379

BLAST of Spg033450 vs. TAIR 10
Match: AT1G43760.1 (DNAse I-like superfamily protein )

HSP 1 Score: 76.3 bits (186), Expect = 1.4e-13
Identity = 68/279 (24.37%), Postives = 130/279 (46.59%), Query Frame = 0

Query: 196 LLGGDFNVFRHSSETSSNNPAKLSM---SKFNKFISDTDLLDPPLINGPYTWTNLRSE-P 255
           +L GDF+    +S+  S     + M    +F   + D+DL+D P     YTW+N + + P
Sbjct: 222 ILVGDFDQIAATSDHYSVLQTSIPMRGLEEFQNCLRDSDLVDIPSRGVHYTWSNHQDDNP 281

Query: 256 VMSRLDRFLFSNSWCIKFNDHHSKKLSRCTSDHFP--ILLDDSSSTWGPCPFRFDNYLLD 315
           ++ +LDR + +  W   F    +       SDH P  I+L++       C FR+ ++L  
Sbjct: 282 IIRKLDRAIANGDWFSSFPSAIAVFELSGVSDHSPCIIILENLPKRSKKC-FRYFSFLST 341

Query: 316 NKSFIGNVEHWWTDTFCEGFPGFSFIRRLKFLARKVKNWKLSNTDSFKEKKKAISNEIDR 375
           + +F+ ++   W +    G   FS    LK  A K K  KL N   F   +      +D 
Sbjct: 342 HPTFLVSLTVAWEEQIPVGSHMFSLGEHLK--AAK-KCCKLLNRQGFGNIQHKTKEALDS 401

Query: 376 IDALES---SGSLDDMAKQLRKSLKADLQETALLEARYWNQRCKKLWLSDRDENSAFFHK 435
           +++++S   +   D + +    + K      A LE+ ++ Q+ +  WL D D N+ FFHK
Sbjct: 402 LESIQSQLLTNPSDSLFRVEHVARKKWNFFAAALES-FYRQKSRIKWLQDGDANTRFFHK 461

Query: 436 ICTARRRRNQIHELFTKEGISIVSDFLLEKEVIDHFADI 466
           +  A + +N I  L   + + + +   +++ ++ ++  +
Sbjct: 462 VILANQAKNLIKFLRMDDDVRVENVTQVKEMIVAYYTHL 495

BLAST of Spg033450 vs. TAIR 10
Match: AT5G18880.1 (RNA-directed DNA polymerase (reverse transcriptase)-related family protein )

HSP 1 Score: 73.2 bits (178), Expect = 1.2e-12
Identity = 62/263 (23.57%), Postives = 101/263 (38.40%), Query Frame = 0

Query: 582 RWDIRNGRSTLFWHDNWSSFGPLKFVCNRL--YQLSSNKNLSIAEVWSSSDRMWNFQPRR 641
           R D+ NG S  FW+D W+ FG L          QL   ++  + E   + D  W     R
Sbjct: 13  RCDMGNGESASFWYDAWTDFGQLLTFLGAAGPRQLRIRQDARVVEASRNGD--WFLPAAR 72

Query: 642 PLFDRDLQRWSEFAELLPTPNPQRGSDIRRWMVSGDGLFTTKSARAILSVLPSRPFHSPG 701
                + Q +     + P P+  RG D   W  +      + S+R     +     HSP 
Sbjct: 73  ---SDNSQLFLAALTMAPVPHESRGQDSFLWRNAAGSYLPSFSSRDTWEQI---RVHSPT 132

Query: 702 EKILNNLWTADIPKKIKVFIWSLFHRSINTSDRLQAIFQNSLHNPSVCILCWKNSEDIDH 761
                 +W  +   +  +  W  F   + T DRL+    N    PS  +LC    E   H
Sbjct: 133 VPWAKVVWFKEYIPRFSLITWMSFLERLPTRDRLRGWGMNI---PSSWVLCSNGDETHAH 192

Query: 762 LFIHCSRA----SFFRNKINLALGLSMVPPATIDSFCADLFTSKAISQRQLLRRNVFIAT 821
           LF  CS +     FF +K   +      PP  + +  + +      S    + + +  + 
Sbjct: 193 LFFECSFSLAIWEFFASKFRPS------PPFGLPAASSWILQLPLRSHSTTILKLLLQSA 252

Query: 822 LWLLWNERNRRIFEDKARTRNQL 839
           ++ +W ERN RIF   + + + L
Sbjct: 253 VYHVWKERNARIFTSISSSASSL 258

BLAST of Spg033450 vs. TAIR 10
Match: AT3G24255.1 (RNA-directed DNA polymerase (reverse transcriptase)-related family protein )

HSP 1 Score: 72.4 bits (176), Expect = 2.0e-12
Identity = 71/277 (25.63%), Postives = 101/277 (36.46%), Query Frame = 0

Query: 496 VRWEIVSSSKAEGGLGIHKIKETNDALLLKWIWRFFNEENTLWRNFINNKYSSLHFECFP 555
           V W  V + K EGGLGI  +KE N              + + W                 
Sbjct: 116 VAWSDVCTPKDEGGLGIRSLKEAN--------------KGSFW----------------S 175

Query: 556 SSSKVSSSRSPWHAISKLQDIFFANFRWDIRNGRSTLFWHDNWSSFGPLKFVCNRLYQLS 615
            S   +     W  I K + +     + DI NG +T FW DNWS  G       RL  ++
Sbjct: 176 ISGNTTLGSWMWKKILKHRALASGFVKHDIHNGSNTSFWFDNWSKIG-------RLIDVT 235

Query: 616 SNK---NLSIAEVWSSSDRMWNFQPRRPLFDRDLQRWSEFAELLPTPNPQRGSDIRRWMV 675
            ++   ++ I    S ++ + N +PRR   D  L+     AE +       G D  RW  
Sbjct: 236 GHRGCIDMGITLHASVAEAVVNHRPRRHRHDTLLRIEDVIAE-VRHQGLTSGEDTVRWKG 295

Query: 676 SGD---GLFTTKSARAILSVLPSRPFHSPGEKI--LNNLWTADIPKKIKVFIWSLFHRSI 735
           +GD     F TK   A            P  K+     +W +    K  V  W      +
Sbjct: 296 NGDIFKPCFNTKETWAAT--------REPKLKVNWYKGVWFSHATPKYSVLAWIAIKNRL 343

Query: 736 NTSDRLQAIFQNSLHNPSVCILCWKNSEDIDHLFIHC 765
            T DR+ +    +    S C+LC    E  DHLF  C
Sbjct: 356 TTGDRMLSWNAGA---DSSCVLCHHLVETRDHLFFTC 343

BLAST of Spg033450 vs. TAIR 10
Match: AT3G25720.1 (RNA-directed DNA polymerase (reverse transcriptase)-related family protein )

HSP 1 Score: 64.3 bits (155), Expect = 5.5e-10
Identity = 37/106 (34.91%), Postives = 55/106 (51.89%), Query Frame = 0

Query: 505 KAEGGLGIHKIKETNDALLLKWIWRFFNEENTLWRNFINNKYSSLH------FECFPSSS 564
           KAEGGLG+    E N  L LK +WR F+   +LW ++  ++Y  L          F +S 
Sbjct: 22  KAEGGLGLRWFSEWNTTLNLKLVWRLFSGGGSLWVDW--HRYHHLQGLGDASVSKFWTSQ 81

Query: 565 KVSSSRSPWHAISKLQDIFFANFRWDIRNGRSTLFWHDNWSSFGPL 605
           ++ S    W  + +L+ +     R +I NG +  FW DNW+ FGPL
Sbjct: 82  ELVSDSWNWKCLLRLRPLAERFLRCNIGNGLTARFWTDNWTPFGPL 125

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
KAA0039950.16.3e-13327.44LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa] >TYK245... [more]
TYK06777.12.0e-13128.39LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa][more]
TYK00493.12.3e-11927.83LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa][more]
TYJ99315.11.2e-11027.77LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa][more]
TYK08190.12.0e-11025.35LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa][more]
Match NameE-valueIdentityDescription
P0C2F62.2e-1925.07Putative ribonuclease H protein At1g65750 OS=Arabidopsis thaliana OX=3702 GN=At1... [more]
Match NameE-valueIdentityDescription
A0A5A7T9I73.0e-13327.44LINE-1 retrotransposable element ORF2 protein OS=Cucumis melo var. makuwa OX=119... [more]
A0A5D3C4J19.8e-13228.39LINE-1 retrotransposable element ORF2 protein OS=Cucumis melo var. makuwa OX=119... [more]
A0A5D3BL611.1e-11927.83LINE-1 retrotransposable element ORF2 protein OS=Cucumis melo var. makuwa OX=119... [more]
A0A5D3BLV75.6e-11127.77LINE-1 retrotransposable element ORF2 protein OS=Cucumis melo var. makuwa OX=119... [more]
A0A5A7US629.5e-11125.35LINE-1 retrotransposable element ORF2 protein OS=Cucumis melo var. makuwa OX=119... [more]
Match NameE-valueIdentityDescription
AT4G29090.16.1e-1724.36Ribonuclease H-like superfamily protein [more]
AT1G43760.11.4e-1324.37DNAse I-like superfamily protein [more]
AT5G18880.11.2e-1223.57RNA-directed DNA polymerase (reverse transcriptase)-related family protein [more]
AT3G24255.12.0e-1225.63RNA-directed DNA polymerase (reverse transcriptase)-related family protein [more]
AT3G25720.15.5e-1034.91RNA-directed DNA polymerase (reverse transcriptase)-related family protein [more]
InterPro
Analysis Name: InterPro Annotations of Sponge gourd (cylindrica) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR036691Endonuclease/exonuclease/phosphatase superfamilyGENE3D3.60.10.10Endonuclease/exonuclease/phosphatasecoord: 92..290
e-value: 4.0E-23
score: 84.4
IPR036691Endonuclease/exonuclease/phosphatase superfamilySUPERFAMILY56219DNase I-likecoord: 94..290
IPR026960Reverse transcriptase zinc-binding domainPFAMPF13966zf-RVTcoord: 698..768
e-value: 1.2E-12
score: 48.3
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..25
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 7..22
NoneNo IPR availablePANTHERPTHR33710BNAC02G09200D PROTEINcoord: 150..404
NoneNo IPR availablePANTHERPTHR33710:SF32SUBFAMILY NOT NAMEDcoord: 150..404

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Spg033450.1Spg033450.1mRNA