CaUC02G035090.1 (mRNA) Watermelon (USVL246-FR2) v1

Overview
NameCaUC02G035090.1
TypemRNA
OrganismCitrullus amarus (Watermelon (USVL246-FR2) v1)
DescriptionPentatricopeptide repeat-containing protein
LocationCiama_Chr02: 13688365 .. 13690282 (-)
Sequence length1918
RNA-Seq ExpressionCaUC02G035090.1
SyntenyCaUC02G035090.1
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
GGCGAAGAGAGCTTCAAAAATCAAAATGGAAAGCGCCAGCTTCTACTGTTCAGCCTAAGCTTCGCAGCTAGTAGGATTAGAGGTTTCATTCGTTAATTTTGTTGCCCACTCACCGAGAAACCTTCGCAATCCAATTTCTCGGGCGAACTAATTTGCAATCCTTCTACAATCAATTCGGTTAGTTGATCTTCGAATCCCTGTTTTTCGATTTCCTTCCATTTGCATCTGAATTGGCCTGCAAATCAATTGAATAATAAGTTTTTTGAATTTATGTATCAATCACAAAACTCTGACTTGTGCGAAAATCATGAGGCTTTGGCCATTTCGTACATTTTCCACTCTGCTGGGCTCTGTAGCTAGAACTTCTGCTTCTGTAAACCCTAAATTGGATGTGAGTTTAGGGAAAAGGAATTGGAATTCAGTTCCAATACCTCACCGATCAATTCCCGAACCCAGAGGACAAGACCTTGATTTTGTTAACGTGGTACATAGCCACCTGATTCATTCGGATTGGACTAAGCTTGATTGTTTGTCATTGGGTCTGACAGCATTCAGAATCAAACACATTCTTCTGAAAACCCAGAAGGACTATGTACTCTCGCTCGAGTTCTTCAATTGGGTTTCGACTCAAAATCCTTCTTCTCATACCCTTGAGACCCATTGCATAATCCTCCATATACTTTCTAAACATAGAAAATTTAAATCTGCCGAATCAATTTTGAGGAGTATCATAGGATCTTGCTCTATTGACTTTCCTTCTAAGTTGTTTGAATCTTTACTGTACTCCTACCGACTGTGTGATTCTTCCCCCCATGTTTTTGATCTGCTTTTCAAGACTTTTGCTCATTTGAAGAAGTTTAGATATGCCTCTGATACATTTTGTAAGATGAAGGATTATGGCTTTTTACCCACTGTGGAATCATGTAATGCATTTCTCAGCTCCTTGCTTAAATTCAATAGGTACCGTATTGTGTTGGCCTTTTATGGGGAAATGCGGCGTAGTCGGATTTCCCCAAATTTGTATACAGTCAACATGGTTATTTGTGCTTCCTGTAAATTGGGAAGATTAGATAAGGCTACTGTGGTGTTTGAGGAAATGAGAACCATGGGGTTTTCTCCAAATGTTGTATCTTATAATACACTTATTTCCGGCTACTGCAATAAAGGTCTTCTGAGCTCAGCCATGAAGCTTAGAAGTGCAATGGAGAAAAATGGAGTTCCTCCAAATGCTGTTACATTTAGTATACTTATCAATGGATTCTGCAAGGATGTGAAGCTACAAGAAGCAAGTAAATTATTTGGTGAGATGAAAGGGATGAATTTATCTCCTACTACTGTCACCTACAATATTTTGATAAATGGCTACAGCAAAGTTGGCAATGGTGAGATGGGCAATAGGCTTTTTGAAGAGATGTCAAGGTTTCAAGTTAAAGCTGATATCCTTACTTACAATGCCCTGATATTGGGACTGTGCAAAGAGGGAAAGACAAAGAAAGCAGCATATCTGGTTAAAGAGCTTGATGAGAAGAGTCTTGTTCCGAATGCTTCAACCTTTTCTGCTCTAATTTATGGGCAATGTGTACTGAAGAAGTCGGAGCATGCATTTCAAATATATAAAAGCATGATAAAAAGTGGTTTTATTCCTTGCGATCAGACCTTTAGGATGTTGCTGTCCACTTTCTGCGAGAATGAGGATTATGATGGAGCAGTCCAGTTGTTGAAGGAAATGTTGAAGAGACACAAGGCTCCTGGTTTGAATAACTTATATGAGCTTTGTGCTGGACTTGGCCGGTGTGGAAAAGTTCAAACAGCTATAATGTTATGCACTGAGTTGGAAGCTCAACATCTCTTGCCAGAAGGTTTTGACAAATCCAAAGCATTTGGTTTGTTGCACAATAACGAAGATGCATTTGAATAG

mRNA sequence

GGCGAAGAGAGCTTCAAAAATCAAAATGGAAAGCGCCAGCTTCTACTGTTCAGCCTAAGCTTCGCAGCTAGTAGGATTAGAGGTTTCATTCGTTAATTTTGTTGCCCACTCACCGAGAAACCTTCGCAATCCAATTTCTCGGGCGAACTAATTTGCAATCCTTCTACAATCAATTCGGTTAGTTGATCTTCGAATCCCTGTTTTTCGATTTCCTTCCATTTGCATCTGAATTGGCCTGCAAATCAATTGAATAATAAGTTTTTTGAATTTATGTATCAATCACAAAACTCTGACTTGTGCGAAAATCATGAGGCTTTGGCCATTTCGTACATTTTCCACTCTGCTGGGCTCTGTAGCTAGAACTTCTGCTTCTGTAAACCCTAAATTGGATGTGAGTTTAGGGAAAAGGAATTGGAATTCAGTTCCAATACCTCACCGATCAATTCCCGAACCCAGAGGACAAGACCTTGATTTTGTTAACGTGGTACATAGCCACCTGATTCATTCGGATTGGACTAAGCTTGATTGTTTGTCATTGGGTCTGACAGCATTCAGAATCAAACACATTCTTCTGAAAACCCAGAAGGACTATGTACTCTCGCTCGAGTTCTTCAATTGGGTTTCGACTCAAAATCCTTCTTCTCATACCCTTGAGACCCATTGCATAATCCTCCATATACTTTCTAAACATAGAAAATTTAAATCTGCCGAATCAATTTTGAGGAGTATCATAGGATCTTGCTCTATTGACTTTCCTTCTAAGTTGTTTGAATCTTTACTGTACTCCTACCGACTGTGTGATTCTTCCCCCCATGTTTTTGATCTGCTTTTCAAGACTTTTGCTCATTTGAAGAAGTTTAGATATGCCTCTGATACATTTTGTAAGATGAAGGATTATGGCTTTTTACCCACTGTGGAATCATGTAATGCATTTCTCAGCTCCTTGCTTAAATTCAATAGGTACCGTATTGTGTTGGCCTTTTATGGGGAAATGCGGCGTAGTCGGATTTCCCCAAATTTGTATACAGTCAACATGGTTATTTGTGCTTCCTGTAAATTGGGAAGATTAGATAAGGCTACTGTGGTGTTTGAGGAAATGAGAACCATGGGGTTTTCTCCAAATGTTGTATCTTATAATACACTTATTTCCGGCTACTGCAATAAAGGTCTTCTGAGCTCAGCCATGAAGCTTAGAAGTGCAATGGAGAAAAATGGAGTTCCTCCAAATGCTGTTACATTTAGTATACTTATCAATGGATTCTGCAAGGATGTGAAGCTACAAGAAGCAAGTAAATTATTTGGTGAGATGAAAGGGATGAATTTATCTCCTACTACTGTCACCTACAATATTTTGATAAATGGCTACAGCAAAGTTGGCAATGGTGAGATGGGCAATAGGCTTTTTGAAGAGATGTCAAGGTTTCAAGTTAAAGCTGATATCCTTACTTACAATGCCCTGATATTGGGACTGTGCAAAGAGGGAAAGACAAAGAAAGCAGCATATCTGGTTAAAGAGCTTGATGAGAAGAGTCTTGTTCCGAATGCTTCAACCTTTTCTGCTCTAATTTATGGGCAATGTGTACTGAAGAAGTCGGAGCATGCATTTCAAATATATAAAAGCATGATAAAAAGTGGTTTTATTCCTTGCGATCAGACCTTTAGGATGTTGCTGTCCACTTTCTGCGAGAATGAGGATTATGATGGAGCAGTCCAGTTGTTGAAGGAAATGTTGAAGAGACACAAGGCTCCTGGTTTGAATAACTTATATGAGCTTTGTGCTGGACTTGGCCGGTGTGGAAAAGTTCAAACAGCTATAATGTTATGCACTGAGTTGGAAGCTCAACATCTCTTGCCAGAAGGTTTTGACAAATCCAAAGCATTTGGTTTGTTGCACAATAACGAAGATGCATTTGAATAG

Coding sequence (CDS)

ATGAGGCTTTGGCCATTTCGTACATTTTCCACTCTGCTGGGCTCTGTAGCTAGAACTTCTGCTTCTGTAAACCCTAAATTGGATGTGAGTTTAGGGAAAAGGAATTGGAATTCAGTTCCAATACCTCACCGATCAATTCCCGAACCCAGAGGACAAGACCTTGATTTTGTTAACGTGGTACATAGCCACCTGATTCATTCGGATTGGACTAAGCTTGATTGTTTGTCATTGGGTCTGACAGCATTCAGAATCAAACACATTCTTCTGAAAACCCAGAAGGACTATGTACTCTCGCTCGAGTTCTTCAATTGGGTTTCGACTCAAAATCCTTCTTCTCATACCCTTGAGACCCATTGCATAATCCTCCATATACTTTCTAAACATAGAAAATTTAAATCTGCCGAATCAATTTTGAGGAGTATCATAGGATCTTGCTCTATTGACTTTCCTTCTAAGTTGTTTGAATCTTTACTGTACTCCTACCGACTGTGTGATTCTTCCCCCCATGTTTTTGATCTGCTTTTCAAGACTTTTGCTCATTTGAAGAAGTTTAGATATGCCTCTGATACATTTTGTAAGATGAAGGATTATGGCTTTTTACCCACTGTGGAATCATGTAATGCATTTCTCAGCTCCTTGCTTAAATTCAATAGGTACCGTATTGTGTTGGCCTTTTATGGGGAAATGCGGCGTAGTCGGATTTCCCCAAATTTGTATACAGTCAACATGGTTATTTGTGCTTCCTGTAAATTGGGAAGATTAGATAAGGCTACTGTGGTGTTTGAGGAAATGAGAACCATGGGGTTTTCTCCAAATGTTGTATCTTATAATACACTTATTTCCGGCTACTGCAATAAAGGTCTTCTGAGCTCAGCCATGAAGCTTAGAAGTGCAATGGAGAAAAATGGAGTTCCTCCAAATGCTGTTACATTTAGTATACTTATCAATGGATTCTGCAAGGATGTGAAGCTACAAGAAGCAAGTAAATTATTTGGTGAGATGAAAGGGATGAATTTATCTCCTACTACTGTCACCTACAATATTTTGATAAATGGCTACAGCAAAGTTGGCAATGGTGAGATGGGCAATAGGCTTTTTGAAGAGATGTCAAGGTTTCAAGTTAAAGCTGATATCCTTACTTACAATGCCCTGATATTGGGACTGTGCAAAGAGGGAAAGACAAAGAAAGCAGCATATCTGGTTAAAGAGCTTGATGAGAAGAGTCTTGTTCCGAATGCTTCAACCTTTTCTGCTCTAATTTATGGGCAATGTGTACTGAAGAAGTCGGAGCATGCATTTCAAATATATAAAAGCATGATAAAAAGTGGTTTTATTCCTTGCGATCAGACCTTTAGGATGTTGCTGTCCACTTTCTGCGAGAATGAGGATTATGATGGAGCAGTCCAGTTGTTGAAGGAAATGTTGAAGAGACACAAGGCTCCTGGTTTGAATAACTTATATGAGCTTTGTGCTGGACTTGGCCGGTGTGGAAAAGTTCAAACAGCTATAATGTTATGCACTGAGTTGGAAGCTCAACATCTCTTGCCAGAAGGTTTTGACAAATCCAAAGCATTTGGTTTGTTGCACAATAACGAAGATGCATTTGAATAG

Protein sequence

MRLWPFRTFSTLLGSVARTSASVNPKLDVSLGKRNWNSVPIPHRSIPEPRGQDLDFVNVVHSHLIHSDWTKLDCLSLGLTAFRIKHILLKTQKDYVLSLEFFNWVSTQNPSSHTLETHCIILHILSKHRKFKSAESILRSIIGSCSIDFPSKLFESLLYSYRLCDSSPHVFDLLFKTFAHLKKFRYASDTFCKMKDYGFLPTVESCNAFLSSLLKFNRYRIVLAFYGEMRRSRISPNLYTVNMVICASCKLGRLDKATVVFEEMRTMGFSPNVVSYNTLISGYCNKGLLSSAMKLRSAMEKNGVPPNAVTFSILINGFCKDVKLQEASKLFGEMKGMNLSPTTVTYNILINGYSKVGNGEMGNRLFEEMSRFQVKADILTYNALILGLCKEGKTKKAAYLVKELDEKSLVPNASTFSALIYGQCVLKKSEHAFQIYKSMIKSGFIPCDQTFRMLLSTFCENEDYDGAVQLLKEMLKRHKAPGLNNLYELCAGLGRCGKVQTAIMLCTELEAQHLLPEGFDKSKAFGLLHNNEDAFE
Homology
The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038900606.12.0e-28691.04pentatricopeptide repeat-containing protein At4g26680, mitochondrial [Benincasa ... [more]
XP_008464167.11.9e-27387.66PREDICTED: pentatricopeptide repeat-containing protein At4g26680, mitochondrial ... [more]
XP_004140069.12.8e-27287.10pentatricopeptide repeat-containing protein At4g26680, mitochondrial [Cucumis sa... [more]
KAA0060788.14.7e-27287.48pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa][more]
TYK11943.11.4e-26887.50pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa][more]
Match NameE-valueIdentityDescription
Q9SZ106.1e-17459.68Pentatricopeptide repeat-containing protein At4g26680, mitochondrial OS=Arabidop... [more]
Q9FIX32.0e-6331.56Pentatricopeptide repeat-containing protein At5g39710 OS=Arabidopsis thaliana OX... [more]
Q0WVK75.7e-6332.79Pentatricopeptide repeat-containing protein At1g05670, mitochondrial OS=Arabidop... [more]
O045041.5e-5830.89Pentatricopeptide repeat-containing protein At1g09820 OS=Arabidopsis thaliana OX... [more]
Q9LVQ52.3e-5630.90Pentatricopeptide repeat-containing protein At5g55840 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
A0A1S3CKV39.2e-27487.66pentatricopeptide repeat-containing protein At4g26680, mitochondrial OS=Cucumis ... [more]
A0A0A0KFT31.3e-27287.10Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G111910 PE=4 SV=1[more]
A0A5A7UZU72.3e-27287.48Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
A0A5D3CJ346.8e-26987.50Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
A0A6J1CFK68.7e-26485.28pentatricopeptide repeat-containing protein At4g26680, mitochondrial isoform X1 ... [more]
Match NameE-valueIdentityDescription
AT4G26680.14.4e-17559.68Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT4G26680.24.4e-17559.68Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT5G39710.11.4e-6431.56Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT1G05670.14.1e-6432.79Pentatricopeptide repeat (PPR-like) superfamily protein [more]
AT1G05670.24.1e-6432.79Pentatricopeptide repeat (PPR-like) superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (USVL246-FR2) v1
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 266..335
e-value: 5.5E-23
score: 83.5
coord: 336..409
e-value: 4.8E-20
score: 73.9
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 83..265
e-value: 5.0E-22
score: 80.3
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 415..530
e-value: 6.7E-13
score: 50.7
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 271..320
e-value: 5.9E-21
score: 74.4
coord: 377..424
e-value: 6.6E-11
score: 42.2
IPR002885Pentatricopeptide repeatPFAMPF12854PPR_1coord: 234..265
e-value: 2.2E-8
score: 33.7
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 344..372
e-value: 2.8E-7
score: 30.3
coord: 450..477
e-value: 0.015
score: 15.5
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 239..273
e-value: 2.5E-8
score: 31.6
coord: 380..413
e-value: 3.5E-7
score: 28.0
coord: 450..477
e-value: 2.9E-4
score: 18.8
coord: 309..342
e-value: 9.3E-6
score: 23.5
coord: 415..446
e-value: 9.4E-5
score: 20.3
coord: 274..308
e-value: 9.4E-8
score: 29.8
coord: 205..237
e-value: 0.0012
score: 16.9
coord: 344..377
e-value: 4.7E-7
score: 27.6
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 202..236
score: 8.736214
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 237..271
score: 11.849223
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 377..411
score: 11.41077
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 307..341
score: 11.706726
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 342..376
score: 10.928473
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 412..446
score: 10.413293
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 272..306
score: 13.044004
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 447..481
score: 9.174665
NoneNo IPR availablePANTHERPTHR47932ATPASE EXPRESSION PROTEIN 3coord: 1..533
NoneNo IPR availablePANTHERPTHR47932:SF32PPR CONTAINING PLANT-LIKE PROTEINcoord: 1..533

Relationships

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
CaUC02G035090CaUC02G035090gene


The following exon feature(s) are a part of this mRNA:

Feature NameUnique NameType
CaUC02G035090.1-exonCaUC02G035090.1-exon-Ciama_Chr02:13688365..13690282exon


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
CaUC02G035090.1-cdsCaUC02G035090.1-cds-Ciama_Chr02:13688365..13689975CDS


The following five_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
CaUC02G035090.1-five_prime_utrCaUC02G035090.1-five_prime_utr-Ciama_Chr02:13689976..13690282five_prime_UTR


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
CaUC02G035090.1CaUC02G035090.1-proteinpolypeptide


GO Annotation
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding