HG10010471 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10010471
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionLINE-1 retrotransposable element ORF2 protein
LocationChr06: 22530163 .. 22531780 (+)
RNA-Seq ExpressionHG10010471
SyntenyHG10010471
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTTCTCTATCAACGTTGACGCATTAGATATTTTCTTAGATTTAATCACCCAATTCACTGAAATTATGAACGATGAAATCTGCCTGAAAATCACACCGTCGGAGTTCTCGATAATCGCTCGGTACGGGTGCCCTCTTTTCTTTGCAATGATGGTTATACCGCCTGTATTCTTCCCCGAGTATTCCGTTGATCGGAATCACAATTCAAAGATTTCCCTCCAGTACTTGCACAATGCTGTCTTGGAAGCTGAAAGCTTTTCTTCTTCAATGACCATGCAACTTCAAGAACAAGAAAATACTATACACCTTATATTTGAGCCTTCAAGTTAGTATACATAATACCATTTTTCCTTTGTTTCTTCTCTTCAAAAAATTAATCTTGATTCTTTTTTTGGTTTTTTTTTGTTTTTTTGTCATTTCTTGATTCTTTTATGCATTCCGAAGAATTTGTTTACGTTTGATAATTGAACGTAGGGCCAATTTGTGTGGATTTTTGTTTTCATAAAATATGATTCTTGTAAACTAAATTTATGTATAACTTTGTCCTTAATTTTTCAGGCCATTGGAAATCGATTCTTGAAAATTAAAACTCATCGCATCAAATCATATCAATTTGTTTATGAACATATTAGATCATTCAACACTGCTATTCTGTGTGGATTCTTGTCTTCATAAAATTTGAATCTTGTTTACGTAAAATTTTGTCGAGGCATTTTTTTAGAGAAAGATATGATAGGATATGAGGGTCAGTAGGTAATTTTTTGAGCATTGGAAATCGATTTTCTTGCAATTTCTTTAATTAAAAGCCCACAATATCATTTTTTTTACGAACATCATATCAAATACTTAAAAAGGATGTTTTATACCTCAATCTTGCTTGCTTTAGAATTGGGGAATTTTTAAAATTAAAAAAAATAAGGGAAACTATTTGCACAAAATAACAAAATGTTTAAATAGTTGTGATAGACGCTGATAGAAGTCTATCAAGGTCTATCAATAATATAGATGCTGATAGAATTCTATCAATGTCTATCAATTTTTTTTTTTTGCTATTTTCCGTAAATAATTTGACATTTTTTCTATTTGTGAAAATTTCCCTTTTCACATGTTACCGATTAATTATATATCATTTTATTCTTGTATTTTGAATTTCATTACATTTTAATTAATTCTTGTACTTCTACTAAATCTAACATTATTATTATTACTATTATTATATAACTTTCTAGTTTTATTTTAAGGGCCTTCTGCACTACCATTGCATCATGAATTGAGATTCTTGCCTATGGAAGATTGGTCTCTTGGCGAAGTTGCCTTATGTGGTGATAAAATTTTATCCGTTGAGGCAGATATGTATAGCGATATTATTACAACATTTTCTGTCTACAATGAAGCTGATTCAAGTAATGCAATAATTCTTCTCTCATTTTTCATAAATTTTACTTCTTTTTTTTTTTTTAATAAAAAATATGGTGTTTATATCTTAGCTTGCTAAAATCTTCTTTCATTTCTCACAGTTTTGATTACTTTAGTGGGTTCTCAAGCCACCTTCTCTGTCGTTCCATTTGATCACGAGATAACTGTTACCCAAGAAGTATGTTTTGTGAACTTTGTGTAA

mRNA sequence

ATGTTCTCTATCAACGTTGACGCATTAGATATTTTCTTAGATTTAATCACCCAATTCACTGAAATTATGAACGATGAAATCTGCCTGAAAATCACACCGTCGGAGTTCTCGATAATCGCTCGGTACGGGTGCCCTCTTTTCTTTGCAATGATGGTTATACCGCCTGTATTCTTCCCCGAGTATTCCGTTGATCGGAATCACAATTCAAAGATTTCCCTCCAGTACTTGCACAATGCTGTCTTGGAAGCTGAAAGCTTTTCTTCTTCAATGACCATGCAACTTCAAGAACAAGAAAATACTATACACCTTATATTTGAGCCTTCAAGGCCTTCTGCACTACCATTGCATCATGAATTGAGATTCTTGCCTATGGAAGATTGGTCTCTTGGCGAAGTTGCCTTATGTGGTGATAAAATTTTATCCGTTGAGGCAGATATGTATAGCGATATTATTACAACATTTTCTGTCTACAATGAAGCTGATTCAATTTTGATTACTTTAGTGGGTTCTCAAGCCACCTTCTCTGTCGTTCCATTTGATCACGAGATAACTGTTACCCAAGAAGTATGTTTTGTGAACTTTGTGTAA

Coding sequence (CDS)

ATGTTCTCTATCAACGTTGACGCATTAGATATTTTCTTAGATTTAATCACCCAATTCACTGAAATTATGAACGATGAAATCTGCCTGAAAATCACACCGTCGGAGTTCTCGATAATCGCTCGGTACGGGTGCCCTCTTTTCTTTGCAATGATGGTTATACCGCCTGTATTCTTCCCCGAGTATTCCGTTGATCGGAATCACAATTCAAAGATTTCCCTCCAGTACTTGCACAATGCTGTCTTGGAAGCTGAAAGCTTTTCTTCTTCAATGACCATGCAACTTCAAGAACAAGAAAATACTATACACCTTATATTTGAGCCTTCAAGGCCTTCTGCACTACCATTGCATCATGAATTGAGATTCTTGCCTATGGAAGATTGGTCTCTTGGCGAAGTTGCCTTATGTGGTGATAAAATTTTATCCGTTGAGGCAGATATGTATAGCGATATTATTACAACATTTTCTGTCTACAATGAAGCTGATTCAATTTTGATTACTTTAGTGGGTTCTCAAGCCACCTTCTCTGTCGTTCCATTTGATCACGAGATAACTGTTACCCAAGAAGTATGTTTTGTGAACTTTGTGTAA

Protein sequence

MFSINVDALDIFLDLITQFTEIMNDEICLKITPSEFSIIARYGCPLFFAMMVIPPVFFPEYSVDRNHNSKISLQYLHNAVLEAESFSSSMTMQLQEQENTIHLIFEPSRPSALPLHHELRFLPMEDWSLGEVALCGDKILSVEADMYSDIITTFSVYNEADSILITLVGSQATFSVVPFDHEITVTQEVCFVNFV
Homology
BLAST of HG10010471 vs. NCBI nr
Match: XP_031736681.1 (uncharacterized protein LOC116402062 [Cucumis sativus] >XP_031741497.1 uncharacterized protein LOC116403863 [Cucumis sativus])

HSP 1 Score: 173.7 bits (439), Expect = 1.5e-39
Identity = 95/189 (50.26%), Postives = 129/189 (68.25%), Query Frame = 0

Query: 1   MFSINVDALDIFLDLITQFTEIMNDEICLKITPSEFSIIARYGCPLFFAMMVIPPVFFPE 60
           MFSI +D +D FLD    F E+ +DEICLK  PS  SII RY CP+FF  M +P   F E
Sbjct: 1   MFSIKIDDIDPFLDATILFAELTHDEICLKFLPSTMSIIVRYECPIFFVTMSLPQPLFVE 60

Query: 61  YSVDRNHNSKISLQYLHNAVLEAESFSSSMTMQLQEQENTIHLIFEPSRPSALPLHHELR 120
           YSVDRNH S+ISL   ++A+LE ++F +++ + LQE +N I L FEPS PS LPL+ EL 
Sbjct: 61  YSVDRNHISRISLLSFYSALLECQTF-AALNIHLQEDQNKILLTFEPSSPSTLPLNRELT 120

Query: 121 F-LPMEDWSLGEVALCGDKILSVEADMYSDIITTFSVYNEADSILITLVGSQATFSVVPF 180
           F +PMEDWS  +V   G K+ S+E+ ++ DII  FS + EA++ILI   GS+ +FS++PF
Sbjct: 121 FVVPMEDWSTAQVNFDG-KVFSIESQLFIDIIQLFSSFTEANTILIASFGSKVSFSILPF 180

Query: 181 DHEITVTQE 189
             E  +T+E
Sbjct: 181 -AETPLTEE 186

BLAST of HG10010471 vs. NCBI nr
Match: KGN45637.1 (hypothetical protein Csa_005027 [Cucumis sativus])

HSP 1 Score: 155.2 bits (391), Expect = 5.7e-34
Identity = 84/163 (51.53%), Postives = 113/163 (69.33%), Query Frame = 0

Query: 1   MFSINVDALDIFLDLITQFTEIMNDEICLKITPSEFSIIARYGCPLFFAMMVIPPVFFPE 60
           MFSI +D LD  LD I+ FT+I+ND+ICLK + S FSIIARY  P FFAM+ IP   F E
Sbjct: 1   MFSIKIDKLDPLLDAISLFTDIVNDKICLKFSLSTFSIIARYQHPFFFAMLFIPEPLFAE 60

Query: 61  YSVDRNHNSKISLQYLHNAVLEAESFSSSMTMQLQEQENTIHLIFEPSRPSALPLHHELR 120
           Y V R+H  ++SL  LH A+   +++ SS+ + LQE++N I L FEPSR S +P+  ++R
Sbjct: 61  YFVGRDHILRVSLLSLHTALARGQTY-SSLRIHLQEEQNIICLAFEPSRHSPVPMRRKMR 120

Query: 121 F-LPMEDWSLGEVALCGDKILSVEADMYSDIITTFSVYNEADS 163
           F +PMEDWS GE+     K  S+E+D++ DIITTF  YNE D+
Sbjct: 121 FEVPMEDWSAGEIDF-DAKSFSIESDLFRDIITTFYDYNEVDT 161

BLAST of HG10010471 vs. NCBI nr
Match: KAA0047513.1 (hypothetical protein E6C27_scaffold498G001420 [Cucumis melo var. makuwa])

HSP 1 Score: 131.7 bits (330), Expect = 6.7e-27
Identity = 73/141 (51.77%), Postives = 101/141 (71.63%), Query Frame = 0

Query: 50  MMVIPPVFFPEYSVDRNHNSKISLQYLHNAVLEAESFSSSMTMQLQEQENTIHLIFEPSR 109
           MM +P   F EYSVDRNH S +SL  LHNA+++ E+F +++ + ++E++N I L FE S 
Sbjct: 1   MMSMPQPLFAEYSVDRNHISIVSLPPLHNALVDGETF-ATLNISIEEEQNKIFLTFEASS 60

Query: 110 PSALPLHHELRF-LPMEDWSLGEVALCGDKILSVEADMYSDIITTFSVYNEADSILITLV 169
           PS LPL+ EL F +PMEDWS G V   G KI S+E+++++DII  FS +NEAD+ILIT  
Sbjct: 61  PSTLPLNRELTFVVPMEDWSPGPVKFDG-KIFSIESELFTDIIQLFSAFNEADAILITAF 120

Query: 170 GSQATFSVVPFDHEITVTQEV 190
           GS+ TFSV PF  E  +T+E+
Sbjct: 121 GSKVTFSVPPF-AETPLTEEI 138

BLAST of HG10010471 vs. NCBI nr
Match: KGN61523.1 (hypothetical protein Csa_006491 [Cucumis sativus])

HSP 1 Score: 131.0 bits (328), Expect = 1.1e-26
Identity = 87/195 (44.62%), Postives = 105/195 (53.85%), Query Frame = 0

Query: 1   MFSINVDALDIFLDLITQFTEIMNDEICLKITPSEFSIIARYGCPLFFAMMVIPPVFFPE 60
           MFSI VD LD FLD  + FTE +NDEICLK +PS FS+IARY CP FFAM+ +P   F E
Sbjct: 1   MFSIKVDNLDPFLDATSLFTEFINDEICLKFSPSTFSMIARYQCPTFFAMLFMPHPLFVE 60

Query: 61  YSVDRNHNSKISLQYLHNAVLEAESFSSSMTMQLQEQENTIHLIFEPSRPSALPLHHELR 120
           YSVDRNH S+ISL+   NA+LE +S+ SSM++ L+E +NTI   FEP             
Sbjct: 61  YSVDRNHISRISLRCFRNALLEGQSY-SSMSIHLREPQNTILFKFEP------------- 120

Query: 121 FLPMEDWSLGEVALCGDKILSVEADMYSDIITTFSVYNEADSILITLVGSQATFSVVPFD 180
                                                    SILIT  GSQ TFS VP +
Sbjct: 121 -----------------------------------------SILITSRGSQITFSAVPRE 139

Query: 181 HEITVTQEVCFVNFV 196
            EI + +EVCFV FV
Sbjct: 181 -EIIIREEVCFVKFV 139

BLAST of HG10010471 vs. NCBI nr
Match: TYK26483.1 (hypothetical protein E5676_scaffold313G00490 [Cucumis melo var. makuwa])

HSP 1 Score: 130.6 bits (327), Expect = 1.5e-26
Identity = 73/140 (52.14%), Postives = 100/140 (71.43%), Query Frame = 0

Query: 50  MMVIPPVFFPEYSVDRNHNSKISLQYLHNAVLEAESFSSSMTMQLQEQENTIHLIFEPSR 109
           MM +P   F EYSVDRNH S +SL  LHNA+++ E+F +++ + ++E++N I L FE S 
Sbjct: 1   MMSMPQPLFAEYSVDRNHISIVSLPPLHNALVDGETF-ATLNISIEEEQNKIFLTFEASS 60

Query: 110 PSALPLHHELRF-LPMEDWSLGEVALCGDKILSVEADMYSDIITTFSVYNEADSILITLV 169
           PS LPL+ EL F +PMEDWS G V   G KI S+E+++++DII  FS +NEAD+ILIT  
Sbjct: 61  PSTLPLNRELTFVVPMEDWSPGPVKFDG-KIFSIESELFTDIIQLFSAFNEADAILITAF 120

Query: 170 GSQATFSVVPFDHEITVTQE 189
           GS+ TFSV PF  E  +T+E
Sbjct: 121 GSKVTFSVPPF-AETPLTEE 137

BLAST of HG10010471 vs. ExPASy TrEMBL
Match: A0A0A0KB76 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G002310 PE=4 SV=1)

HSP 1 Score: 155.2 bits (391), Expect = 2.8e-34
Identity = 84/163 (51.53%), Postives = 113/163 (69.33%), Query Frame = 0

Query: 1   MFSINVDALDIFLDLITQFTEIMNDEICLKITPSEFSIIARYGCPLFFAMMVIPPVFFPE 60
           MFSI +D LD  LD I+ FT+I+ND+ICLK + S FSIIARY  P FFAM+ IP   F E
Sbjct: 1   MFSIKIDKLDPLLDAISLFTDIVNDKICLKFSLSTFSIIARYQHPFFFAMLFIPEPLFAE 60

Query: 61  YSVDRNHNSKISLQYLHNAVLEAESFSSSMTMQLQEQENTIHLIFEPSRPSALPLHHELR 120
           Y V R+H  ++SL  LH A+   +++ SS+ + LQE++N I L FEPSR S +P+  ++R
Sbjct: 61  YFVGRDHILRVSLLSLHTALARGQTY-SSLRIHLQEEQNIICLAFEPSRHSPVPMRRKMR 120

Query: 121 F-LPMEDWSLGEVALCGDKILSVEADMYSDIITTFSVYNEADS 163
           F +PMEDWS GE+     K  S+E+D++ DIITTF  YNE D+
Sbjct: 121 FEVPMEDWSAGEIDF-DAKSFSIESDLFRDIITTFYDYNEVDT 161

BLAST of HG10010471 vs. ExPASy TrEMBL
Match: A0A5A7TVD7 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold498G001420 PE=4 SV=1)

HSP 1 Score: 131.7 bits (330), Expect = 3.3e-27
Identity = 73/141 (51.77%), Postives = 101/141 (71.63%), Query Frame = 0

Query: 50  MMVIPPVFFPEYSVDRNHNSKISLQYLHNAVLEAESFSSSMTMQLQEQENTIHLIFEPSR 109
           MM +P   F EYSVDRNH S +SL  LHNA+++ E+F +++ + ++E++N I L FE S 
Sbjct: 1   MMSMPQPLFAEYSVDRNHISIVSLPPLHNALVDGETF-ATLNISIEEEQNKIFLTFEASS 60

Query: 110 PSALPLHHELRF-LPMEDWSLGEVALCGDKILSVEADMYSDIITTFSVYNEADSILITLV 169
           PS LPL+ EL F +PMEDWS G V   G KI S+E+++++DII  FS +NEAD+ILIT  
Sbjct: 61  PSTLPLNRELTFVVPMEDWSPGPVKFDG-KIFSIESELFTDIIQLFSAFNEADAILITAF 120

Query: 170 GSQATFSVVPFDHEITVTQEV 190
           GS+ TFSV PF  E  +T+E+
Sbjct: 121 GSKVTFSVPPF-AETPLTEEI 138

BLAST of HG10010471 vs. ExPASy TrEMBL
Match: A0A0A0LIE0 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G155090 PE=4 SV=1)

HSP 1 Score: 131.0 bits (328), Expect = 5.6e-27
Identity = 87/195 (44.62%), Postives = 105/195 (53.85%), Query Frame = 0

Query: 1   MFSINVDALDIFLDLITQFTEIMNDEICLKITPSEFSIIARYGCPLFFAMMVIPPVFFPE 60
           MFSI VD LD FLD  + FTE +NDEICLK +PS FS+IARY CP FFAM+ +P   F E
Sbjct: 1   MFSIKVDNLDPFLDATSLFTEFINDEICLKFSPSTFSMIARYQCPTFFAMLFMPHPLFVE 60

Query: 61  YSVDRNHNSKISLQYLHNAVLEAESFSSSMTMQLQEQENTIHLIFEPSRPSALPLHHELR 120
           YSVDRNH S+ISL+   NA+LE +S+ SSM++ L+E +NTI   FEP             
Sbjct: 61  YSVDRNHISRISLRCFRNALLEGQSY-SSMSIHLREPQNTILFKFEP------------- 120

Query: 121 FLPMEDWSLGEVALCGDKILSVEADMYSDIITTFSVYNEADSILITLVGSQATFSVVPFD 180
                                                    SILIT  GSQ TFS VP +
Sbjct: 121 -----------------------------------------SILITSRGSQITFSAVPRE 139

Query: 181 HEITVTQEVCFVNFV 196
            EI + +EVCFV FV
Sbjct: 181 -EIIIREEVCFVKFV 139

BLAST of HG10010471 vs. ExPASy TrEMBL
Match: A0A5D3DSC2 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold313G00490 PE=4 SV=1)

HSP 1 Score: 130.6 bits (327), Expect = 7.3e-27
Identity = 73/140 (52.14%), Postives = 100/140 (71.43%), Query Frame = 0

Query: 50  MMVIPPVFFPEYSVDRNHNSKISLQYLHNAVLEAESFSSSMTMQLQEQENTIHLIFEPSR 109
           MM +P   F EYSVDRNH S +SL  LHNA+++ E+F +++ + ++E++N I L FE S 
Sbjct: 1   MMSMPQPLFAEYSVDRNHISIVSLPPLHNALVDGETF-ATLNISIEEEQNKIFLTFEASS 60

Query: 110 PSALPLHHELRF-LPMEDWSLGEVALCGDKILSVEADMYSDIITTFSVYNEADSILITLV 169
           PS LPL+ EL F +PMEDWS G V   G KI S+E+++++DII  FS +NEAD+ILIT  
Sbjct: 61  PSTLPLNRELTFVVPMEDWSPGPVKFDG-KIFSIESELFTDIIQLFSAFNEADAILITAF 120

Query: 170 GSQATFSVVPFDHEITVTQE 189
           GS+ TFSV PF  E  +T+E
Sbjct: 121 GSKVTFSVPPF-AETPLTEE 137

BLAST of HG10010471 vs. ExPASy TrEMBL
Match: A0A5D3DZ07 (LINE-1 retrotransposable element ORF2 protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold1810G00070 PE=4 SV=1)

HSP 1 Score: 94.7 bits (234), Expect = 4.4e-16
Identity = 63/189 (33.33%), Postives = 98/189 (51.85%), Query Frame = 0

Query: 1   MFSINVDALDIFLDLITQFTEIMNDEICLKITPSEFSIIARYGCPLFFAMMVIPPVFFPE 60
           MF + +   D  LD  +   +I  D   LK TPS+F IIA +  P F A + + P +F  
Sbjct: 1   MFLVKLKNFDPLLDATSFLAQISFDNADLKFTPSKFFIIASHRSPRFIATLQLSPQWFTT 60

Query: 61  YSVDRNHNSKISLQYLHNAVLEAESFSSSMTMQLQEQENTIHLIFEPSRPSALPLHHELR 120
           +SVD +H+SK+SL+  H+A+L+  SF +SMT+ L ++ N + L F+       PLHHEL 
Sbjct: 61  FSVDNDHSSKVSLESFHDAILDGGSF-ASMTIHLLDKTNQMILRFDTPSSEIQPLHHELT 120

Query: 121 FLP--MEDWSLGEVALCGDKILSVEADMYSDIITTFSVYNEADSILITLVGSQATFSVVP 180
             P   ED  +G+  L   K   V++     II    ++     I + +  S+  FS+  
Sbjct: 121 LSPPQAEDNQIGQHELDERKYFIVKSKALRRIIKDLPIFQNDSIISVDVTNSRVKFSIA- 180

Query: 181 FDHEITVTQ 188
              EI +T+
Sbjct: 181 -SKEIILTE 186

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_031736681.11.5e-3950.26uncharacterized protein LOC116402062 [Cucumis sativus] >XP_031741497.1 uncharact... [more]
KGN45637.15.7e-3451.53hypothetical protein Csa_005027 [Cucumis sativus][more]
KAA0047513.16.7e-2751.77hypothetical protein E6C27_scaffold498G001420 [Cucumis melo var. makuwa][more]
KGN61523.11.1e-2644.62hypothetical protein Csa_006491 [Cucumis sativus][more]
TYK26483.11.5e-2652.14hypothetical protein E5676_scaffold313G00490 [Cucumis melo var. makuwa][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0A0KB762.8e-3451.53Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G002310 PE=4 SV=1[more]
A0A5A7TVD73.3e-2751.77Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold... [more]
A0A0A0LIE05.6e-2744.62Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G155090 PE=4 SV=1[more]
A0A5D3DSC27.3e-2752.14Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... [more]
A0A5D3DZ074.4e-1633.33LINE-1 retrotransposable element ORF2 protein OS=Cucumis melo var. makuwa OX=119... [more]
Match NameE-valueIdentityDescription
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableGENE3D3.70.10.10coord: 1..189
e-value: 1.2E-11
score: 46.4
NoneNo IPR availableSUPERFAMILY55979DNA clampcoord: 1..108

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10010471.1HG10010471.1mRNA