CmUC08G147640 (gene) Watermelon (USVL531) v1

Overview
NameCmUC08G147640
Typegene
OrganismCitrullus mucosospermus (Watermelon (USVL531) v1)
DescriptionRetrotran_gag_3 domain-containing protein
LocationCmU531Chr08: 16824657 .. 16825382 (-)
RNA-Seq ExpressionCmUC08G147640
SyntenyCmUC08G147640
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGATTCTTGACCTCGCTGGGAAAAATAAGCTAGGGTTTGTGGATGACAGTTTGAAGCGACTGAATGATGAATTGAAGAATTTATGGATTATCTATAATAATGTAGTCACTGCCTGGATCTTGAATTCCTTGTCCAAAGAAATTTCTGCCAGTGTTAGTTTCTTTGATATTGCTCTTGATATTTGGCTTGATCTTCAACAATGATATTAGAGGAAGAATCATCCACACATTTTTCAATTGCATCGAGAACAATCCAATTTGAATCAAGATCAGCCCTTGGTTATCACTTATTTCGCAAAACTCAAGACTCTGTGGAATGAACTTGCCTTGTATCGTCCAGTTTGTTCCTATGGTTGATATTCTTGCGGTTGAGTTAAAGATCTGATTGATTTCTTTCAGACTGCGTATGTCATGGCTTTCTTGATGGGTTTAAATGAATCTTGTGCCCAGGTTCATACTCAATTGCTTTTAATGGAGCCTGAACCTACTATTCAACGAGCTTTTTCTCTTGATGCTCAAGAAGTTGACCAGCGATCTTTGCTTTCTTTGGAGAGTCCTGCAACGATTAATGCTACTGTCTTGATGGCTACGTCTTCTGGATTCAACTCATCTCCTCGTTCTTTGTCGAATCAATTGAAAAAGAAAGAACGTCTGGTTTGCACTCATTGTCATCTTCTTGGCCATACTATTGACATGAGGGTTCATGGACATCCTGGATATCGATAA

mRNA sequence

ATGATTCTTGACCTCGCTGGGAAAAATAAGCTAGGGTTTGTGGATGACAGTTTGAAGCGACTGAATGATGAATTGAAGAATTTATGGATTATCTATAATAATGTAGTCACTGCCTGGATCTTGAATTCCTTGTCCAAAGAAATTTCTGCCAGTACTGCGTATGTCATGGCTTTCTTGATGGGTTTAAATGAATCTTGTGCCCAGGTTCATACTCAATTGCTTTTAATGGAGCCTGAACCTACTATTCAACGAGCTTTTTCTCTTGATGCTCAAGAAGTTGACCAGCGATCTTTGCTTTCTTTGGAGAGTCCTGCAACGATTAATGCTACTGTCTTGATGGCTACGTCTTCTGGATTCAACTCATCTCCTCGTTCTTTGTCGAATCAATTGAAAAAGAAAGAACGTCTGGTTTGCACTCATTGTCATCTTCTTGGCCATACTATTGACATGAGGGTTCATGGACATCCTGGATATCGATAA

Coding sequence (CDS)

ATGATTCTTGACCTCGCTGGGAAAAATAAGCTAGGGTTTGTGGATGACAGTTTGAAGCGACTGAATGATGAATTGAAGAATTTATGGATTATCTATAATAATGTAGTCACTGCCTGGATCTTGAATTCCTTGTCCAAAGAAATTTCTGCCAGTACTGCGTATGTCATGGCTTTCTTGATGGGTTTAAATGAATCTTGTGCCCAGGTTCATACTCAATTGCTTTTAATGGAGCCTGAACCTACTATTCAACGAGCTTTTTCTCTTGATGCTCAAGAAGTTGACCAGCGATCTTTGCTTTCTTTGGAGAGTCCTGCAACGATTAATGCTACTGTCTTGATGGCTACGTCTTCTGGATTCAACTCATCTCCTCGTTCTTTGTCGAATCAATTGAAAAAGAAAGAACGTCTGGTTTGCACTCATTGTCATCTTCTTGGCCATACTATTGACATGAGGGTTCATGGACATCCTGGATATCGATAA

Protein sequence

MILDLAGKNKLGFVDDSLKRLNDELKNLWIIYNNVVTAWILNSLSKEISASTAYVMAFLMGLNESCAQVHTQLLLMEPEPTIQRAFSLDAQEVDQRSLLSLESPATINATVLMATSSGFNSSPRSLSNQLKKKERLVCTHCHLLGHTIDMRVHGHPGYR
Homology
BLAST of CmUC08G147640 vs. NCBI nr
Match: XP_038905564.1 (uncharacterized protein LOC120091546 [Benincasa hispida])

HSP 1 Score: 122.1 bits (305), Expect = 4.4e-24
Identity = 90/245 (36.73%), Postives = 116/245 (47.35%), Query Frame = 0

Query: 1   MILDLAGKNKLGFVDDSLKRLNDELKNLWIIYNNVVTAWILNSLSKEISAS--------- 60
           M + L  KNKLGF+D S+     E+   WI+ N+VVT WILNSLSKEISAS         
Sbjct: 1   MKIGLTMKNKLGFIDSSIVHPTGEMHQSWIVCNSVVTTWILNSLSKEISASVYFSDLAQD 60

Query: 61  ------------------------------------------------------------ 120
                                                                       
Sbjct: 61  IWVDLQEQYQRKNCLHAYQIRRELSNLTQSQDSVTTYYAKWKTLWNELASYRPSCSCGRC 120

Query: 121 -------------TAYVMAFLMGLNESCAQVHTQLLLMEPEPTIQRAFSLDAQEVDQRSL 157
                        T +VM FLMGLNES +Q+HTQLLLME EP+I +AFS   QEV+QR++
Sbjct: 121 TCDGVKDLNTYLQTEHVMTFLMGLNESFSQIHTQLLLMELEPSINKAFSSVIQEVEQRTI 180

BLAST of CmUC08G147640 vs. NCBI nr
Match: XP_038904477.1 (uncharacterized protein LOC120090845 [Benincasa hispida])

HSP 1 Score: 118.2 bits (295), Expect = 6.3e-23
Identity = 86/248 (34.68%), Postives = 119/248 (47.98%), Query Frame = 0

Query: 1   MILDLAGKNKLGFVDDSLKRLNDELKNLWIIYNNVVTAWILNSLSKEISAS--------- 60
           M + L  KNKLGF++  + R + EL + WII N +VT WILNSLSKEISAS         
Sbjct: 1   MKIGLTVKNKLGFINGEISRPSGELLSSWIICNGIVTTWILNSLSKEISASINFSDSAQE 60

Query: 61  ------------------------------------------------------------ 120
                                                                       
Sbjct: 61  IWVDLQERYQRKNRPRVFQLRRENSNLSQNQDSITTYYAKLKTLWNELISYRPSCSCGKC 120

Query: 121 -------------TAYVMAFLMGLNESCAQVHTQLLLMEPEPTIQRAFSLDAQEVDQRSL 157
                        T YV+AFLMGLN+S A + +QLLLMEP+PTI RAFSL AQE+DQ++ 
Sbjct: 121 TCGGVKNLQTYFQTEYVIAFLMGLNDSSAPIRSQLLLMEPKPTINRAFSLVAQEIDQKAY 180

BLAST of CmUC08G147640 vs. NCBI nr
Match: XP_022148562.1 (uncharacterized protein LOC111017196 [Momordica charantia])

HSP 1 Score: 117.5 bits (293), Expect = 1.1e-22
Identity = 94/255 (36.86%), Postives = 112/255 (43.92%), Query Frame = 0

Query: 1   MILDLAGKNKLGFVDDSLKRLNDELKNLWIIYNNVVTAWILNSLSKEISAS--------- 60
           M + L  KNK G VD S+ R +DE  N WII NNVV AWILNSLSKEISAS         
Sbjct: 1   MKIALTVKNKFGXVDGSIPRPDDEHLNSWIICNNVVIAWILNSLSKEISASVLFADSARE 60

Query: 61  ------------------------------------------------------------ 120
                                                                       
Sbjct: 61  IWLDLQERFQRQNRPRIFQLRRDLSTLVQDQLSVSAYFTKLKTLWTELAAYRPNCSCGRC 120

Query: 121 -------------TAYVMAFLMGLNESCAQVHTQLLLMEPEPTIQRAFSLDAQEVDQRSL 160
                        T YVM FLMGLN+S +Q+   LLLM P PTI  AF L AQEV QR +
Sbjct: 121 TCGGVKSLVEYFQTEYVMCFLMGLNDSFSQIRAXLLLMXPPPTINXAFXLIAQEVQQRXI 180

BLAST of CmUC08G147640 vs. NCBI nr
Match: XP_022154919.1 (uncharacterized protein LOC111022065 [Momordica charantia])

HSP 1 Score: 116.7 bits (291), Expect = 1.8e-22
Identity = 87/245 (35.51%), Postives = 115/245 (46.94%), Query Frame = 0

Query: 1   MILDLAGKNKLGFVDDSLKRLNDELKNLWIIYNNVVTAWILNSLSKEISAST-------- 60
           +++ L  KNK+GFVD S+ R  D   + WII NNVV +WI NSLSK+ISAS         
Sbjct: 58  IVIALTVKNKIGFVDGSISRPTDGRLHSWIICNNVVISWIFNSLSKKISASVLFSDSAHE 117

Query: 61  ------------------------------------------------------------ 120
                                                                       
Sbjct: 118 IWLDLKERFQRQNRPRIFQLRRELSNLTQDQLSVTAYFTRLKTLWSELALYRPACSCGRC 177

Query: 121 --------------AYVMAFLMGLNESCAQVHTQLLLMEPEPTIQRAFSLDAQEVDQRSL 160
                          YVMAFLMGLN S +Q+  QLLLMEP PTI RAF+L AQE+ QRS 
Sbjct: 178 SYGGVKSIEAHYQQEYVMAFLMGLNVSFSQIRAQLLLMEPAPTINRAFALVAQEMQQRS- 237

BLAST of CmUC08G147640 vs. NCBI nr
Match: XP_022141216.1 (uncharacterized protein LOC111011669 [Momordica charantia])

HSP 1 Score: 101.3 bits (251), Expect = 8.0e-18
Identity = 62/110 (56.36%), Postives = 77/110 (70.00%), Query Frame = 0

Query: 56  MAFLMGLNESCAQVHTQLLLMEPEPTIQRAFSLDAQEVDQRSLLSLESPA---TINATVL 115
           MAFLMGLNES  QV  QLLLMEPE TI RAFSL AQEV+QR+ L+  S A   +I A  L
Sbjct: 1   MAFLMGLNESFNQVRAQLLLMEPEXTINRAFSLVAQEVEQRTSLATASSAFASSIPAAFL 60

Query: 116 MATSSGFNSSPRSLSNQLKKKERLVCTHCHLLGHTID--MRVHGH-PGYR 160
             TS+  N++    S+Q ++KER  CTHCHL GHT+D   ++HG+ PG+R
Sbjct: 61  ARTSASSNNT--RASSQPRRKERPYCTHCHLQGHTVDRCYKLHGYPPGFR 108

BLAST of CmUC08G147640 vs. ExPASy TrEMBL
Match: A0A6J1D5E3 (uncharacterized protein LOC111017196 OS=Momordica charantia OX=3673 GN=LOC111017196 PE=4 SV=1)

HSP 1 Score: 117.5 bits (293), Expect = 5.2e-23
Identity = 94/255 (36.86%), Postives = 112/255 (43.92%), Query Frame = 0

Query: 1   MILDLAGKNKLGFVDDSLKRLNDELKNLWIIYNNVVTAWILNSLSKEISAS--------- 60
           M + L  KNK G VD S+ R +DE  N WII NNVV AWILNSLSKEISAS         
Sbjct: 1   MKIALTVKNKFGXVDGSIPRPDDEHLNSWIICNNVVIAWILNSLSKEISASVLFADSARE 60

Query: 61  ------------------------------------------------------------ 120
                                                                       
Sbjct: 61  IWLDLQERFQRQNRPRIFQLRRDLSTLVQDQLSVSAYFTKLKTLWTELAAYRPNCSCGRC 120

Query: 121 -------------TAYVMAFLMGLNESCAQVHTQLLLMEPEPTIQRAFSLDAQEVDQRSL 160
                        T YVM FLMGLN+S +Q+   LLLM P PTI  AF L AQEV QR +
Sbjct: 121 TCGGVKSLVEYFQTEYVMCFLMGLNDSFSQIRAXLLLMXPPPTINXAFXLIAQEVQQRXI 180

BLAST of CmUC08G147640 vs. ExPASy TrEMBL
Match: A0A6J1DNP7 (uncharacterized protein LOC111022065 OS=Momordica charantia OX=3673 GN=LOC111022065 PE=4 SV=1)

HSP 1 Score: 116.7 bits (291), Expect = 8.9e-23
Identity = 87/245 (35.51%), Postives = 115/245 (46.94%), Query Frame = 0

Query: 1   MILDLAGKNKLGFVDDSLKRLNDELKNLWIIYNNVVTAWILNSLSKEISAST-------- 60
           +++ L  KNK+GFVD S+ R  D   + WII NNVV +WI NSLSK+ISAS         
Sbjct: 58  IVIALTVKNKIGFVDGSISRPTDGRLHSWIICNNVVISWIFNSLSKKISASVLFSDSAHE 117

Query: 61  ------------------------------------------------------------ 120
                                                                       
Sbjct: 118 IWLDLKERFQRQNRPRIFQLRRELSNLTQDQLSVTAYFTRLKTLWSELALYRPACSCGRC 177

Query: 121 --------------AYVMAFLMGLNESCAQVHTQLLLMEPEPTIQRAFSLDAQEVDQRSL 160
                          YVMAFLMGLN S +Q+  QLLLMEP PTI RAF+L AQE+ QRS 
Sbjct: 178 SYGGVKSIEAHYQQEYVMAFLMGLNVSFSQIRAQLLLMEPAPTINRAFALVAQEMQQRS- 237

BLAST of CmUC08G147640 vs. ExPASy TrEMBL
Match: A0A2N9H1Z3 (Integrase catalytic domain-containing protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS33502 PE=4 SV=1)

HSP 1 Score: 102.8 bits (255), Expect = 1.3e-18
Identity = 69/193 (35.75%), Postives = 99/193 (51.30%), Query Frame = 0

Query: 1   MILDLAGKNKLGFVDDSLKRLNDELK---NLWIIYNNVVTAWILNSLSKEISASTAY--- 60
           MI+ L  KNK+GF++ ++   NDE     NLW   N +V +WILNS+SK+I++S  Y   
Sbjct: 491 MIMALTAKNKIGFINGTITAPNDETLPSFNLWTRCNTMVISWILNSVSKDIASSVIYANT 550

Query: 61  -------------------------VMAFLMGLNESCAQVHTQLLLMEPEPTIQRAFSLD 120
                                    VM FLMGLN+S A V  Q+L+MEP P + + FSL 
Sbjct: 551 AQEMSFPPCSCGALKILTENKQHENVMQFLMGLNDSFANVRAQILMMEPLPAMNKVFSLV 610

Query: 121 AQEVDQRSLLSLESPATINATVLMATSSGFNSSPRSLSNQLKKKERLVCTHCHLLGHTID 160
            QE  QR  + + S  T   ++ + T S           Q  KK+R +C+HC + GH +D
Sbjct: 611 VQEERQRG-IGVPSMTTNGDSIALYTRSEMPRHNYGGRGQFGKKDRPMCSHCGVAGHIVD 670

BLAST of CmUC08G147640 vs. ExPASy TrEMBL
Match: A0A2N9HCD3 (Integrase catalytic domain-containing protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS39909 PE=4 SV=1)

HSP 1 Score: 102.4 bits (254), Expect = 1.7e-18
Identity = 68/189 (35.98%), Postives = 98/189 (51.85%), Query Frame = 0

Query: 1   MILDLAGKNKLGFVDDSLKRLNDELK---NLWIIYNNVVTAWILNSLSKEISASTAY--- 60
           MI+ L  KNK+GF++ ++   NDE     NLW   N +V +WILNS+SK+I++S  Y   
Sbjct: 61  MIMALTAKNKIGFINGTITAPNDETLPSFNLWTRCNTMVISWILNSVSKDIASSVIYANT 120

Query: 61  ---------------------VMAFLMGLNESCAQVHTQLLLMEPEPTIQRAFSLDAQEV 120
                                VM FLMGLN+S A V  Q+L+MEP P + + FSL  QE 
Sbjct: 121 AQEMWEDLKERFAQEECNMRIVMQFLMGLNDSFANVRAQILMMEPLPAMNKVFSLVVQEE 180

Query: 121 DQRSLLSLESPATINATVLMATSSGFNSSPRSLSNQLKKKERLVCTHCHLLGHTID--MR 160
            QR  + + S      ++ + T S           Q  KK+R +C+HC + GH +D   +
Sbjct: 181 RQRG-IGVPSMTANGDSIALYTRSEMPRHNYGGRGQFGKKDRPMCSHCGVAGHIVDKCYK 240

BLAST of CmUC08G147640 vs. ExPASy TrEMBL
Match: A0A6J1CIG1 (uncharacterized protein LOC111011669 OS=Momordica charantia OX=3673 GN=LOC111011669 PE=4 SV=1)

HSP 1 Score: 101.3 bits (251), Expect = 3.9e-18
Identity = 62/110 (56.36%), Postives = 77/110 (70.00%), Query Frame = 0

Query: 56  MAFLMGLNESCAQVHTQLLLMEPEPTIQRAFSLDAQEVDQRSLLSLESPA---TINATVL 115
           MAFLMGLNES  QV  QLLLMEPE TI RAFSL AQEV+QR+ L+  S A   +I A  L
Sbjct: 1   MAFLMGLNESFNQVRAQLLLMEPEXTINRAFSLVAQEVEQRTSLATASSAFASSIPAAFL 60

Query: 116 MATSSGFNSSPRSLSNQLKKKERLVCTHCHLLGHTID--MRVHGH-PGYR 160
             TS+  N++    S+Q ++KER  CTHCHL GHT+D   ++HG+ PG+R
Sbjct: 61  ARTSASSNNT--RASSQPRRKERPYCTHCHLQGHTVDRCYKLHGYPPGFR 108

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038905564.14.4e-2436.73uncharacterized protein LOC120091546 [Benincasa hispida][more]
XP_038904477.16.3e-2334.68uncharacterized protein LOC120090845 [Benincasa hispida][more]
XP_022148562.11.1e-2236.86uncharacterized protein LOC111017196 [Momordica charantia][more]
XP_022154919.11.8e-2235.51uncharacterized protein LOC111022065 [Momordica charantia][more]
XP_022141216.18.0e-1856.36uncharacterized protein LOC111011669 [Momordica charantia][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A6J1D5E35.2e-2336.86uncharacterized protein LOC111017196 OS=Momordica charantia OX=3673 GN=LOC111017... [more]
A0A6J1DNP78.9e-2335.51uncharacterized protein LOC111022065 OS=Momordica charantia OX=3673 GN=LOC111022... [more]
A0A2N9H1Z31.3e-1835.75Integrase catalytic domain-containing protein OS=Fagus sylvatica OX=28930 GN=FSB... [more]
A0A2N9HCD31.7e-1835.98Integrase catalytic domain-containing protein OS=Fagus sylvatica OX=28930 GN=FSB... [more]
A0A6J1CIG13.9e-1856.36uncharacterized protein LOC111011669 OS=Momordica charantia OX=3673 GN=LOC111011... [more]
Match NameE-valueIdentityDescription
InterPro
Analysis Name: InterPro Annotations of Watermelon (USVL531) v1
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR34222:SF6OS02G0671800 PROTEINcoord: 18..149
NoneNo IPR availablePANTHERPTHR34222FAMILY NOT NAMEDcoord: 18..149

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmUC08G147640.1CmUC08G147640.1mRNA