CmUC05G080590 (gene) Watermelon (USVL531) v1

Overview
NameCmUC05G080590
Typegene
OrganismCitrullus mucosospermus (Watermelon (USVL531) v1)
DescriptionLate cornified envelope protein 1E
LocationCmU531Chr05: 417489 .. 418062 (+)
RNA-Seq ExpressionCmUC05G080590
SyntenyCmUC05G080590
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TCGCCCCATGGATTTGAATCCCCCACCCCCACTCTGTTGCTTGTGCGGCGACGTTGGTTTTCCAGCCAAACTCTTCCGCTGCGTTAGCTGCTCCACTCGATTCCAGCACTCGTAACACACGCATCCCCATTCCACTTGTGTCCTTTTAATTACATGCCTATTTTCTTAAAACTTACATTTTGTAATTACTTCAGTTACTGCAGCAACTTCTACTGTGGGGATCAATCGGGGGATCCCATACTTCAAATATGCGATTGGTGTCGAAGCCAACAGATAACCCGCCGCCCTGCCTCTACAACTGCGACTAATAATACCGGTACTACCTCCCACAAGGATCAGATCACGCAAAGAAGAAGCTCCGGCCCCGGATTCCCCTCCCCTCGCCCTGCTCCACGCCGCTACAAGCTCCTCAAGGATGTCTTGTGCTGAACAAACCAAATACTTTTTCAATTTTCATTTTCTTATATATGATCTTAATCTAATACTGTATGTGTATCTCGCTTTTCCAAATTATATATCAACAAACTACATGACTCTGTGTATTTTATATAATCAACGCCTTTTCGATTACTTC

mRNA sequence

TCGCCCCATGGATTTGAATCCCCCACCCCCACTCTGTTGCTTGTGCGGCGACGTTGGTTTTCCAGCCAAACTCTTCCGCTGCGTTAGCTGCTCCACTCGATTCCAGCACTCTTACTGCAGCAACTTCTACTGTGGGGATCAATCGGGGGATCCCATACTTCAAATATGCGATTGGTGTCGAAGCCAACAGATAACCCGCCGCCCTGCCTCTACAACTGCGACTAATAATACCGGTACTACCTCCCACAAGGATCAGATCACGCAAAGAAGAAGCTCCGGCCCCGGATTCCCCTCCCCTCGCCCTGCTCCACGCCGCTACAAGCTCCTCAAGGATGTCTTGTGCTGAACAAACCAAATACTTTTTCAATTTTCATTTTCTTATATATGATCTTAATCTAATACTGTATGTGTATCTCGCTTTTCCAAATTATATATCAACAAACTACATGACTCTGTGTATTTTATATAATCAACGCCTTTTCGATTACTTC

Coding sequence (CDS)

ATGGATTTGAATCCCCCACCCCCACTCTGTTGCTTGTGCGGCGACGTTGGTTTTCCAGCCAAACTCTTCCGCTGCGTTAGCTGCTCCACTCGATTCCAGCACTCTTACTGCAGCAACTTCTACTGTGGGGATCAATCGGGGGATCCCATACTTCAAATATGCGATTGGTGTCGAAGCCAACAGATAACCCGCCGCCCTGCCTCTACAACTGCGACTAATAATACCGGTACTACCTCCCACAAGGATCAGATCACGCAAAGAAGAAGCTCCGGCCCCGGATTCCCCTCCCCTCGCCCTGCTCCACGCCGCTACAAGCTCCTCAAGGATGTCTTGTGCTGA

Protein sequence

MDLNPPPPLCCLCGDVGFPAKLFRCVSCSTRFQHSYCSNFYCGDQSGDPILQICDWCRSQQITRRPASTTATNNTGTTSHKDQITQRRSSGPGFPSPRPAPRRYKLLKDVLC
Homology
BLAST of CmUC05G080590 vs. NCBI nr
Match: XP_022924678.1 (uncharacterized protein LOC111432109 [Cucurbita moschata])

HSP 1 Score: 164.9 bits (416), Expect = 4.1e-37
Identity = 80/117 (68.38%), Postives = 90/117 (76.92%), Query Frame = 0

Query: 1   MDLNPPPPLCCLCGDVGFPAKLFRCVSCSTRFQHSYCSNFYCGDQSGDPILQICDWCRSQ 60
           MDLN   PLCCLCGDVGFPAKLFRC +CS RFQHSYCSNFYCG +S DPI ++CDWCR++
Sbjct: 1   MDLN--RPLCCLCGDVGFPAKLFRCANCSNRFQHSYCSNFYCG-ESADPITRLCDWCRTE 60

Query: 61  QITRRPASTTATNNTGTTSHK-----DQITQRRSSGPGFPSPRPAPRRYKLLKDVLC 113
             TRRP+   A  N G TS K     DQIT+ +SS  G PSPRPAPRRYKLLKDV+C
Sbjct: 61  HTTRRPSPAAA--NNGATSQKPHKMTDQITETKSSATGVPSPRPAPRRYKLLKDVMC 112

BLAST of CmUC05G080590 vs. NCBI nr
Match: XP_023526855.1 (uncharacterized protein LOC111790233 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 164.5 bits (415), Expect = 5.4e-37
Identity = 80/117 (68.38%), Postives = 89/117 (76.07%), Query Frame = 0

Query: 1   MDLNPPPPLCCLCGDVGFPAKLFRCVSCSTRFQHSYCSNFYCGDQSGDPILQICDWCRSQ 60
           MDLN   PLCCLCGDVGFPAKLFRC +CS RFQHSYCSNFYCG +S DPI ++CDWCR++
Sbjct: 1   MDLN--RPLCCLCGDVGFPAKLFRCANCSNRFQHSYCSNFYCG-ESADPITRLCDWCRTE 60

Query: 61  QITRRPASTTATNNTGTTSHK-----DQITQRRSSGPGFPSPRPAPRRYKLLKDVLC 113
             TRRP    A  N G TS K     DQIT+ +SS  G PSPRPAPRRYKLLKDV+C
Sbjct: 61  HTTRRPGPAAA--NNGATSQKPHKITDQITETKSSATGVPSPRPAPRRYKLLKDVMC 112

BLAST of CmUC05G080590 vs. NCBI nr
Match: XP_004134287.1 (uncharacterized protein LOC101222685 [Cucumis sativus] >KGN56359.1 hypothetical protein Csa_010459 [Cucumis sativus])

HSP 1 Score: 162.5 bits (410), Expect = 2.0e-36
Identity = 79/113 (69.91%), Postives = 91/113 (80.53%), Query Frame = 0

Query: 1   MDLNPPPPLCCLCGDVGFPAKLFRCVSCSTRFQHSYCSNFYCGDQSGDPILQICDWCRSQ 60
           MDLN  PPLCCLCGDVGFPAKLFRC +CS RFQHSYCSN+YCG +SGD  +++CDWCRS+
Sbjct: 1   MDLN--PPLCCLCGDVGFPAKLFRCTNCSNRFQHSYCSNYYCG-ESGDATIRVCDWCRSE 60

Query: 61  QITRRPASTTATNNTGTTSHK-DQITQRRSSGPGFPSPRPAPRRYKLLKDVLC 113
           Q T RPA  +      TTS K ++IT+RRSS  G PSPRPAPRRYKLLKDV+C
Sbjct: 61  QRTCRPAFAS------TTSQKSNKITERRSSAVGLPSPRPAPRRYKLLKDVMC 104

BLAST of CmUC05G080590 vs. NCBI nr
Match: XP_008437789.1 (PREDICTED: uncharacterized protein LOC103483122 [Cucumis melo])

HSP 1 Score: 161.4 bits (407), Expect = 4.6e-36
Identity = 79/113 (69.91%), Postives = 90/113 (79.65%), Query Frame = 0

Query: 1   MDLNPPPPLCCLCGDVGFPAKLFRCVSCSTRFQHSYCSNFYCGDQSGDPILQICDWCRSQ 60
           MDLN  PPLCCLCGDVGFPA LFRC +CS RFQHSYCSN+Y G +SGD I+++CDWCRS+
Sbjct: 41  MDLN--PPLCCLCGDVGFPANLFRCTNCSNRFQHSYCSNYYSG-ESGDAIIRVCDWCRSE 100

Query: 61  QITRRPASTTATNNTGTTSHKD-QITQRRSSGPGFPSPRPAPRRYKLLKDVLC 113
           Q TRRPA+        TTS KD +IT+ RSS  G PSPRPAPRRYKLLKDV+C
Sbjct: 101 QRTRRPAAFAT-----TTSQKDRKITEIRSSAAGLPSPRPAPRRYKLLKDVMC 145

BLAST of CmUC05G080590 vs. NCBI nr
Match: KAA0048831.1 (uncharacterized protein E6C27_scaffold171G00440 [Cucumis melo var. makuwa] >TYK20783.1 uncharacterized protein E5676_scaffold291G00440 [Cucumis melo var. makuwa])

HSP 1 Score: 161.4 bits (407), Expect = 4.6e-36
Identity = 79/113 (69.91%), Postives = 90/113 (79.65%), Query Frame = 0

Query: 1   MDLNPPPPLCCLCGDVGFPAKLFRCVSCSTRFQHSYCSNFYCGDQSGDPILQICDWCRSQ 60
           MDLN  PPLCCLCGDVGFPA LFRC +CS RFQHSYCSN+Y G +SGD I+++CDWCRS+
Sbjct: 1   MDLN--PPLCCLCGDVGFPANLFRCTNCSNRFQHSYCSNYYSG-ESGDAIIRVCDWCRSE 60

Query: 61  QITRRPASTTATNNTGTTSHKD-QITQRRSSGPGFPSPRPAPRRYKLLKDVLC 113
           Q TRRPA+        TTS KD +IT+ RSS  G PSPRPAPRRYKLLKDV+C
Sbjct: 61  QRTRRPAAFAT-----TTSQKDRKITEIRSSAAGLPSPRPAPRRYKLLKDVMC 105

BLAST of CmUC05G080590 vs. ExPASy TrEMBL
Match: A0A6J1ED69 (uncharacterized protein LOC111432109 OS=Cucurbita moschata OX=3662 GN=LOC111432109 PE=4 SV=1)

HSP 1 Score: 164.9 bits (416), Expect = 2.0e-37
Identity = 80/117 (68.38%), Postives = 90/117 (76.92%), Query Frame = 0

Query: 1   MDLNPPPPLCCLCGDVGFPAKLFRCVSCSTRFQHSYCSNFYCGDQSGDPILQICDWCRSQ 60
           MDLN   PLCCLCGDVGFPAKLFRC +CS RFQHSYCSNFYCG +S DPI ++CDWCR++
Sbjct: 1   MDLN--RPLCCLCGDVGFPAKLFRCANCSNRFQHSYCSNFYCG-ESADPITRLCDWCRTE 60

Query: 61  QITRRPASTTATNNTGTTSHK-----DQITQRRSSGPGFPSPRPAPRRYKLLKDVLC 113
             TRRP+   A  N G TS K     DQIT+ +SS  G PSPRPAPRRYKLLKDV+C
Sbjct: 61  HTTRRPSPAAA--NNGATSQKPHKMTDQITETKSSATGVPSPRPAPRRYKLLKDVMC 112

BLAST of CmUC05G080590 vs. ExPASy TrEMBL
Match: A0A0A0L5P1 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G117940 PE=4 SV=1)

HSP 1 Score: 162.5 bits (410), Expect = 9.9e-37
Identity = 79/113 (69.91%), Postives = 91/113 (80.53%), Query Frame = 0

Query: 1   MDLNPPPPLCCLCGDVGFPAKLFRCVSCSTRFQHSYCSNFYCGDQSGDPILQICDWCRSQ 60
           MDLN  PPLCCLCGDVGFPAKLFRC +CS RFQHSYCSN+YCG +SGD  +++CDWCRS+
Sbjct: 1   MDLN--PPLCCLCGDVGFPAKLFRCTNCSNRFQHSYCSNYYCG-ESGDATIRVCDWCRSE 60

Query: 61  QITRRPASTTATNNTGTTSHK-DQITQRRSSGPGFPSPRPAPRRYKLLKDVLC 113
           Q T RPA  +      TTS K ++IT+RRSS  G PSPRPAPRRYKLLKDV+C
Sbjct: 61  QRTCRPAFAS------TTSQKSNKITERRSSAVGLPSPRPAPRRYKLLKDVMC 104

BLAST of CmUC05G080590 vs. ExPASy TrEMBL
Match: A0A5D3DB67 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold291G00440 PE=4 SV=1)

HSP 1 Score: 161.4 bits (407), Expect = 2.2e-36
Identity = 79/113 (69.91%), Postives = 90/113 (79.65%), Query Frame = 0

Query: 1   MDLNPPPPLCCLCGDVGFPAKLFRCVSCSTRFQHSYCSNFYCGDQSGDPILQICDWCRSQ 60
           MDLN  PPLCCLCGDVGFPA LFRC +CS RFQHSYCSN+Y G +SGD I+++CDWCRS+
Sbjct: 1   MDLN--PPLCCLCGDVGFPANLFRCTNCSNRFQHSYCSNYYSG-ESGDAIIRVCDWCRSE 60

Query: 61  QITRRPASTTATNNTGTTSHKD-QITQRRSSGPGFPSPRPAPRRYKLLKDVLC 113
           Q TRRPA+        TTS KD +IT+ RSS  G PSPRPAPRRYKLLKDV+C
Sbjct: 61  QRTRRPAAFAT-----TTSQKDRKITEIRSSAAGLPSPRPAPRRYKLLKDVMC 105

BLAST of CmUC05G080590 vs. ExPASy TrEMBL
Match: A0A1S3AVF1 (uncharacterized protein LOC103483122 OS=Cucumis melo OX=3656 GN=LOC103483122 PE=4 SV=1)

HSP 1 Score: 161.4 bits (407), Expect = 2.2e-36
Identity = 79/113 (69.91%), Postives = 90/113 (79.65%), Query Frame = 0

Query: 1   MDLNPPPPLCCLCGDVGFPAKLFRCVSCSTRFQHSYCSNFYCGDQSGDPILQICDWCRSQ 60
           MDLN  PPLCCLCGDVGFPA LFRC +CS RFQHSYCSN+Y G +SGD I+++CDWCRS+
Sbjct: 41  MDLN--PPLCCLCGDVGFPANLFRCTNCSNRFQHSYCSNYYSG-ESGDAIIRVCDWCRSE 100

Query: 61  QITRRPASTTATNNTGTTSHKD-QITQRRSSGPGFPSPRPAPRRYKLLKDVLC 113
           Q TRRPA+        TTS KD +IT+ RSS  G PSPRPAPRRYKLLKDV+C
Sbjct: 101 QRTRRPAAFAT-----TTSQKDRKITEIRSSAAGLPSPRPAPRRYKLLKDVMC 145

BLAST of CmUC05G080590 vs. ExPASy TrEMBL
Match: A0A6J1IAU2 (uncharacterized protein LOC111472884 isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111472884 PE=4 SV=1)

HSP 1 Score: 120.9 bits (302), Expect = 3.3e-24
Identity = 60/108 (55.56%), Postives = 72/108 (66.67%), Query Frame = 0

Query: 5   PPPPLCCLCGDVGFPAKLFRCVSCSTRFQHSYCSNFYCGDQSGDPILQICDWCRSQQITR 64
           P PP+CCLCGDVGFPA LFRC  CS RFQHSYCSN+Y   +S + I ++CDWCR ++   
Sbjct: 14  PLPPVCCLCGDVGFPANLFRCTLCSHRFQHSYCSNYY--GESAEAI-EVCDWCRCERRCG 73

Query: 65  RPASTTATNNTGTTSHKDQITQRRSSGPGFPSPRPAPRRYKLLKDVLC 113
           R  S        +     Q  + R+SG G PSPR APRRYKLLKDV+C
Sbjct: 74  RRGSAARKFGVASQKSSGQDKRERNSG-GMPSPRVAPRRYKLLKDVMC 117

BLAST of CmUC05G080590 vs. TAIR 10
Match: AT3G60520.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G02070.1); Has 107 Blast hits to 107 proteins in 14 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 107; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 97.1 bits (240), Expect = 9.8e-21
Identity = 54/125 (43.20%), Postives = 68/125 (54.40%), Query Frame = 0

Query: 9   LCCLCGDVGFPAKLFRCVSCSTRFQHSYCSNFYCGDQSGDPILQICDWCRSQQITRRPAS 68
           +CC+CGDVGF  KLF C  C  RFQHSYCS++Y   +  DPI +ICDWC+ +  +R  A 
Sbjct: 8   VCCMCGDVGFFDKLFHCSKCLNRFQHSYCSSYY--KEQADPI-KICDWCQCEAKSRTGAK 67

Query: 69  TTATNNTGTTSHK------------DQITQRRSSG---------PGFPSPRPAPRRYKLL 113
                 +   S++             +I Q  SS           G PSPRPA RRYKLL
Sbjct: 68  HGVNGGSSKRSYRSEYSSPHHQIKQQEINQTTSSSIPPAADKGKTGVPSPRPATRRYKLL 127

BLAST of CmUC05G080590 vs. TAIR 10
Match: AT1G02070.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G60520.1); Has 98 Blast hits to 98 proteins in 13 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 98; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 89.0 bits (219), Expect = 2.7e-18
Identity = 52/131 (39.69%), Postives = 67/131 (51.15%), Query Frame = 0

Query: 9   LCCLCGDVGFPAKLFRCVSCSTRFQHSYCSNFYCGDQSGDPILQICDWCRSQ-------- 68
           +CC+CGDVGF  KLF C  C  RFQHSYCSN+Y   Q  +P  +ICDWCRS         
Sbjct: 4   VCCMCGDVGFSDKLFSCGHCRCRFQHSYCSNYY--GQFAEP-TEICDWCRSDDRKLSNVA 63

Query: 69  ----QITRRPASTTATNN--------------TGTTSHKDQITQR-RSSGPGFPSPRPAP 113
                 +++P+S+    N                  +  DQ+ +     G G  SP+ A 
Sbjct: 64  RHGGSSSKKPSSSVKYENDFSNRSEYSPGHRIKHNNNRHDQVAKGVAGDGGGVTSPKTAT 123

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022924678.14.1e-3768.38uncharacterized protein LOC111432109 [Cucurbita moschata][more]
XP_023526855.15.4e-3768.38uncharacterized protein LOC111790233 [Cucurbita pepo subsp. pepo][more]
XP_004134287.12.0e-3669.91uncharacterized protein LOC101222685 [Cucumis sativus] >KGN56359.1 hypothetical ... [more]
XP_008437789.14.6e-3669.91PREDICTED: uncharacterized protein LOC103483122 [Cucumis melo][more]
KAA0048831.14.6e-3669.91uncharacterized protein E6C27_scaffold171G00440 [Cucumis melo var. makuwa] >TYK2... [more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A6J1ED692.0e-3768.38uncharacterized protein LOC111432109 OS=Cucurbita moschata OX=3662 GN=LOC1114321... [more]
A0A0A0L5P19.9e-3769.91Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G117940 PE=4 SV=1[more]
A0A5D3DB672.2e-3669.91Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... [more]
A0A1S3AVF12.2e-3669.91uncharacterized protein LOC103483122 OS=Cucumis melo OX=3656 GN=LOC103483122 PE=... [more]
A0A6J1IAU23.3e-2455.56uncharacterized protein LOC111472884 isoform X2 OS=Cucurbita maxima OX=3661 GN=L... [more]
Match NameE-valueIdentityDescription
AT3G60520.19.8e-2143.20unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT1G02070.12.7e-1839.69unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (USVL531) v1
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 64..101
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 64..92
NoneNo IPR availablePANTHERPTHR33779:SF11OS02G0658200 PROTEINcoord: 9..81
NoneNo IPR availablePANTHERPTHR33779EXPRESSED PROTEINcoord: 77..112
NoneNo IPR availablePANTHERPTHR33779EXPRESSED PROTEINcoord: 9..81
NoneNo IPR availablePANTHERPTHR33779:SF11OS02G0658200 PROTEINcoord: 77..112

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmUC05G080590.1CmUC05G080590.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0046872 metal ion binding